Query lcl|NC_013644.1_cdsid_YP_003347388.1 [gene=gp34] [protein=portal protein] [protein_id=YP_003347388.1] [location=19717..21249] Match_columns 510 No_of_seqs 128 out of 470 Neff 9.5 Searched_HMMs 1612 Date Thu Nov 7 13:04:54 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_34 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_34_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:78083 Length: 537 100.0 4E-108 2E-111 609.7 54.7 508 1-510 1-534 (537) 2 protein:vir:105461 Length: 470 100.0 1.1E-98 7E-102 557.6 51.7 459 11-490 1-470 (470) 3 protein:vir:5961 Length: 503 # 100.0 1.8E-97 1E-100 551.0 53.5 484 1-503 15-503 (503) 4 protein:vir:102950 Length: 471 100.0 2.3E-96 1.4E-99 544.9 52.4 457 7-484 1-471 (471) 5 protein:vir:79043 Length: 479 100.0 2.5E-96 1.5E-99 544.7 52.1 468 1-483 7-479 (479) 6 protein:vir:99781 Length: 511 100.0 2.6E-95 1.6E-98 539.1 51.3 462 1-494 31-511 (511) 7 protein:vir:106571 Length: 499 100.0 4.3E-95 2.7E-98 537.9 52.0 467 1-510 1-496 (499) 8 protein:vir:97171 Length: 512 100.0 4.3E-95 2.7E-98 537.9 51.8 462 1-494 31-512 (512) 9 protein:vir:96266 Length: 474 100.0 2.7E-95 1.7E-98 539.1 49.8 453 1-492 17-474 (474) 10 protein:vir:95899 Length: 474 100.0 2.7E-95 1.7E-98 539.1 49.8 453 1-492 17-474 (474) 11 protein:vir:102330 Length: 451 100.0 2.8E-95 1.7E-98 538.9 49.3 432 11-476 1-451 (451) 12 protein:vir:1236 Length: 483 # 100.0 1.2E-94 7.1E-98 535.6 52.4 455 1-493 23-483 (483) 13 protein:vir:96240 Length: 511 100.0 9.2E-95 5.7E-98 536.1 51.6 462 1-494 31-511 (511) 14 protein:vir:9306 Length: 511 # 100.0 1.3E-94 7.9E-98 535.4 52.2 462 1-494 31-511 (511) 15 protein:vir:94805 Length: 492 100.0 1.7E-94 1E-97 534.7 52.1 455 1-494 32-492 (492) 16 protein:vir:103951 Length: 511 100.0 1.9E-94 1.2E-97 534.4 52.1 462 1-494 31-511 (511) 17 protein:vir:97336 Length: 492 100.0 1.8E-94 1.1E-97 534.5 52.0 455 1-493 32-492 (492) 18 protein:vir:78805 Length: 511 100.0 2.1E-94 1.3E-97 534.2 51.3 462 1-494 31-511 (511) 19 protein:vir:96366 Length: 511 100.0 2.1E-94 1.3E-97 534.2 51.3 462 1-494 31-511 (511) 20 protein:vir:94498 Length: 474 100.0 1.8E-93 1.1E-96 529.1 51.1 454 1-493 7-474 (474) 21 protein:vir:97447 Length: 474 100.0 1.8E-93 1.1E-96 529.1 51.1 454 1-493 7-474 (474) 22 protein:vir:96839 Length: 474 100.0 2.2E-93 1.4E-96 528.5 50.6 454 1-492 1-474 (474) 23 protein:vir:107112 Length: 478 100.0 8.1E-93 5E-96 525.5 50.7 457 1-492 1-478 (478) 24 protein:vir:93747 Length: 472 100.0 1.6E-92 1E-95 523.8 51.9 455 1-493 12-472 (472) 25 protein:vir:95113 Length: 474 100.0 1.5E-92 9.6E-96 523.9 50.8 454 1-492 1-474 (474) 26 protein:vir:105292 Length: 478 100.0 3E-92 1.9E-95 522.3 52.2 457 1-492 1-478 (478) 27 protein:vir:3609 Length: 452 # 100.0 7.7E-92 4.8E-95 520.1 51.0 437 1-493 1-452 (452) 28 protein:vir:3964 Length: 453 # 100.0 3.9E-91 2.4E-94 516.3 51.6 439 1-494 1-453 (453) 29 protein:vir:9871 Length: 429 # 100.0 5.4E-91 3.3E-94 515.5 50.4 423 11-488 1-429 (429) 30 protein:vir:96179 Length: 468 100.0 7.4E-91 4.6E-94 514.7 51.1 447 1-483 1-468 (468) 31 protein:vir:99522 Length: 470 100.0 1.3E-90 7.9E-94 513.4 51.8 446 1-492 1-470 (470) 32 protein:vir:94101 Length: 474 100.0 5.2E-91 3.3E-94 515.5 49.1 452 1-489 4-474 (474) 33 protein:vir:105889 Length: 474 100.0 5.2E-91 3.3E-94 515.5 49.1 452 1-489 4-474 (474) 34 protein:vir:2732 Length: 501 # 100.0 1.8E-90 1.1E-93 512.6 51.1 451 1-494 30-501 (501) 35 protein:vir:96494 Length: 501 100.0 2.1E-90 1.3E-93 512.3 50.4 450 1-493 30-501 (501) 36 protein:vir:4898 Length: 502 # 100.0 4.5E-90 2.8E-93 510.4 51.7 451 1-508 31-502 (502) 37 protein:vir:733 Length: 453 # 100.0 2.6E-90 1.6E-93 511.7 50.2 434 1-485 1-453 (453) 38 protein:vir:94546 Length: 506 100.0 1.9E-90 1.2E-93 512.4 48.2 455 1-495 11-506 (506) 39 protein:vir:106639 Length: 481 100.0 2.5E-88 1.6E-91 500.8 52.2 445 1-490 23-481 (481) 40 protein:vir:95806 Length: 440 100.0 9.7E-89 6E-92 503.1 47.4 424 19-482 1-440 (440) 41 protein:vir:9922 Length: 489 # 100.0 2.8E-86 1.7E-89 489.6 48.7 451 1-490 3-489 (489) 42 protein:vir:2427 Length: 485 # 100.0 4.2E-75 2.6E-78 428.3 47.9 452 1-498 1-485 (485) 43 protein:vir:2500 Length: 501 # 100.0 4.5E-75 2.8E-78 428.2 46.8 475 1-501 16-501 (501) 44 protein:vir:99072 Length: 479 100.0 5.9E-75 3.7E-78 427.5 44.9 458 1-510 1-478 (479) 45 protein:vir:78537 Length: 480 100.0 3.4E-74 2.1E-77 423.4 46.9 457 6-505 1-480 (480) 46 protein:vir:78227 Length: 480 100.0 6.9E-74 4.3E-77 421.7 46.5 457 6-504 1-480 (480) 47 protein:vir:4223 Length: 486 # 100.0 3E-73 1.9E-76 418.2 47.6 454 1-503 1-486 (486) 48 protein:vir:80680 Length: 441 100.0 1.7E-73 1E-76 419.6 46.0 423 5-474 1-441 (441) 49 protein:vir:104082 Length: 485 100.0 4.9E-73 3E-76 417.0 48.4 452 1-510 1-484 (485) 50 protein:vir:2341 Length: 488 # 100.0 5.6E-73 3.5E-76 416.7 47.0 449 4-498 1-488 (488) 51 protein:vir:7768 Length: 484 # 100.0 1.3E-72 8.1E-76 414.7 45.4 455 1-503 1-484 (484) 52 protein:vir:105819 Length: 456 100.0 1.2E-70 7.4E-74 403.9 44.3 440 4-484 1-456 (456) 53 protein:vir:102602 Length: 456 100.0 1.2E-70 7.4E-74 403.9 44.3 440 4-484 1-456 (456) 54 protein:vir:7987 Length: 456 # 100.0 4.4E-69 2.7E-72 395.3 44.5 439 4-484 1-456 (456) 55 protein:vir:38 Length: 496 # N 100.0 1.6E-66 1E-69 381.3 46.7 453 4-481 1-496 (496) 56 protein:vir:99916 Length: 504 100.0 1.4E-65 8.5E-69 376.2 47.0 463 1-510 1-504 (504) 57 protein:vir:80959 Length: 499 100.0 5.3E-65 3.3E-68 372.9 48.7 456 8-481 1-499 (499) 58 protein:vir:98444 Length: 434 100.0 2.6E-64 1.6E-67 369.2 41.2 409 45-497 1-434 (434) 59 protein:vir:94742 Length: 409 100.0 6.5E-61 4E-64 350.6 38.6 390 11-444 1-409 (409) 60 protein:vir:9568 Length: 410 # 100.0 5.5E-61 3.4E-64 351.0 38.2 391 20-459 1-410 (410) 61 protein:vir:9751 Length: 422 # 100.0 7.5E-61 4.7E-64 350.2 37.5 402 11-458 1-422 (422) 62 protein:vir:79703 Length: 505 100.0 5.4E-58 3.3E-61 334.6 45.5 457 1-479 1-505 (505) 63 protein:vir:1587 Length: 508 # 100.0 3.6E-58 2.2E-61 335.5 42.7 458 1-489 1-508 (508) 64 protein:vir:1634 Length: 409 # 100.0 5.5E-59 3.4E-62 340.0 38.0 390 11-444 1-409 (409) 65 protein:vir:8184 Length: 474 # 100.0 6E-58 3.8E-61 334.3 43.0 434 1-480 9-474 (474) 66 protein:vir:3028 Length: 500 # 100.0 2.2E-55 1.3E-58 320.3 45.2 453 6-486 1-500 (500) 67 protein:vir:9815 Length: 500 # 100.0 2.2E-55 1.3E-58 320.3 45.2 453 6-486 1-500 (500) 68 protein:vir:4782 Length: 522 # 100.0 1E-52 6.5E-56 305.6 48.3 464 6-495 1-522 (522) 69 protein:vir:78907 Length: 518 100.0 1.4E-50 8.9E-54 293.8 45.3 462 10-484 1-518 (518) 70 protein:vir:98883 Length: 517 100.0 1.6E-49 9.9E-53 288.1 44.0 458 6-491 1-517 (517) 71 protein:vir:101494 Length: 527 100.0 9.8E-48 6.1E-51 278.3 37.9 476 1-508 10-527 (527) 72 protein:vir:102239 Length: 527 100.0 1.1E-47 7.1E-51 277.9 37.9 476 1-508 10-527 (527) 73 protein:vir:7430 Length: 563 # 100.0 6.7E-43 4.1E-46 251.8 36.8 490 1-510 1-563 (563) 74 protein:vir:97265 Length: 513 100.0 3.8E-30 2.3E-33 181.9 35.1 466 1-510 1-512 (513) 75 protein:vir:94956 Length: 452 99.9 1.2E-26 7.3E-30 162.7 37.7 428 1-491 1-452 (452) 76 protein:vir:80453 Length: 535 99.9 2.5E-24 1.6E-27 149.9 40.3 453 1-501 32-535 (535) 77 protein:vir:95149 Length: 501 99.9 1.9E-24 1.2E-27 150.6 39.2 454 1-491 1-501 (501) 78 protein:vir:78393 Length: 489 99.9 2.7E-23 1.7E-26 144.3 39.9 447 1-493 1-489 (489) 79 protein:vir:95014 Length: 491 99.9 8.7E-22 5.4E-25 136.1 37.9 449 1-493 1-491 (491) 80 protein:vir:96783 Length: 488 99.9 1.1E-21 7.1E-25 135.4 33.2 432 1-474 14-488 (488) 81 protein:vir:93630 Length: 776 99.8 7.8E-20 4.9E-23 125.3 28.8 495 1-510 23-716 (776) 82 protein:vir:108295 Length: 711 99.8 3.1E-16 1.9E-19 105.6 39.0 488 1-510 6-682 (711) 83 protein:vir:80040 Length: 461 99.7 1.6E-17 9.8E-21 112.7 30.3 428 1-497 1-461 (461) 84 protein:vir:9950 Length: 714 # 99.7 1.2E-15 7.2E-19 102.5 35.3 471 1-510 1-641 (714) 85 protein:vir:3296 Length: 714 # 99.7 1.2E-15 7.2E-19 102.5 35.3 471 1-510 1-641 (714) 86 protein:vir:817 Length: 714 # 99.7 1.2E-15 7.2E-19 102.5 35.3 471 1-510 1-641 (714) 87 protein:vir:10117 Length: 714 99.7 1.2E-15 7.2E-19 102.5 35.3 471 1-510 1-641 (714) 88 protein:vir:2764 Length: 714 # 99.7 1.2E-15 7.2E-19 102.5 35.3 471 1-510 1-641 (714) 89 protein:vir:104437 Length: 714 99.7 7.3E-15 4.5E-18 98.1 36.6 471 1-510 1-636 (714) 90 protein:vir:5249 Length: 437 # 99.6 7.4E-15 4.6E-18 98.1 30.8 390 29-505 1-437 (437) 91 protein:vir:8846 Length: 705 # 99.6 3.2E-14 2E-17 94.6 32.7 480 1-510 1-628 (705) 92 protein:vir:105619 Length: 772 99.6 4E-14 2.5E-17 94.0 31.5 487 1-510 1-666 (772) 93 protein:vir:79538 Length: 502 99.6 3.6E-13 2.3E-16 88.8 35.2 435 13-505 1-502 (502) 94 protein:vir:95449 Length: 584 99.5 4.3E-13 2.6E-16 88.4 31.5 452 1-474 1-584 (584) 95 protein:vir:107742 Length: 537 99.5 1.1E-13 6.9E-17 91.6 27.4 422 1-510 25-532 (537) 96 protein:vir:80165 Length: 651 99.5 9.8E-12 6.1E-15 80.9 36.9 473 1-510 3-619 (651) 97 protein:vir:105429 Length: 708 99.5 1.1E-13 6.6E-17 91.7 23.8 484 1-510 1-648 (708) 98 protein:vir:96068 Length: 765 99.4 3E-12 1.9E-15 83.7 30.8 422 1-510 71-565 (765) 99 protein:vir:77597 Length: 725 99.4 2.6E-12 1.6E-15 84.1 30.3 473 1-510 1-618 (725) 100 protein:vir:100920 Length: 725 99.4 6.7E-12 4.1E-15 81.8 29.2 470 1-510 1-618 (725) 101 protein:vir:96738 Length: 505 99.4 4.9E-11 3.1E-14 77.1 37.4 436 1-502 11-505 (505) 102 protein:vir:105520 Length: 706 99.4 1.6E-11 9.8E-15 79.8 29.2 484 1-510 1-647 (706) 103 protein:vir:99563 Length: 862 99.4 3.5E-12 2.2E-15 83.4 25.5 444 1-510 66-592 (862) 104 protein:vir:9263 Length: 725 # 99.3 8.5E-12 5.3E-15 81.3 27.2 469 5-510 1-618 (725) 105 protein:vir:6382 Length: 553 # 99.3 7.7E-11 4.8E-14 76.0 37.5 455 1-501 2-553 (553) 106 protein:vir:102668 Length: 547 99.3 8.2E-11 5.1E-14 75.9 40.2 456 11-485 1-547 (547) 107 protein:vir:104338 Length: 422 99.3 4.1E-11 2.5E-14 77.5 29.8 381 4-496 1-422 (422) 108 protein:vir:172 Length: 708 # 99.3 2E-12 1.2E-15 84.8 22.2 492 1-510 1-659 (708) 109 protein:vir:80644 Length: 551 99.3 1E-10 6.5E-14 75.3 31.1 457 1-510 1-543 (551) 110 protein:vir:94049 Length: 532 99.3 6.7E-11 4.1E-14 76.4 29.8 434 1-510 23-525 (532) 111 protein:vir:3420 Length: 533 # 99.3 1.7E-10 1.1E-13 74.1 39.5 442 1-501 1-533 (533) 112 protein:vir:389 Length: 530 # 99.3 1.7E-10 1.1E-13 74.1 41.0 440 1-501 1-530 (530) 113 protein:vir:79647 Length: 435 99.3 5.2E-11 3.3E-14 76.9 27.2 391 1-503 1-435 (435) 114 protein:vir:107662 Length: 427 99.2 1.7E-10 1E-13 74.2 28.4 383 20-497 1-427 (427) 115 protein:vir:3520 Length: 720 # 99.2 2.9E-10 1.8E-13 72.9 29.4 476 5-510 1-632 (720) 116 protein:vir:10321 Length: 495 99.2 4.5E-10 2.8E-13 71.8 37.3 437 1-493 1-495 (495) 117 protein:vir:103765 Length: 549 99.2 6.4E-10 4E-13 71.0 38.1 463 5-503 1-549 (549) 118 protein:vir:63755 Length: 547 99.2 6.5E-10 4E-13 70.9 28.7 456 1-510 1-539 (547) 119 protein:vir:95542 Length: 548 99.1 1.4E-09 8.5E-13 69.2 38.7 455 1-510 1-537 (548) 120 protein:vir:94599 Length: 641 99.1 2.2E-09 1.4E-12 68.1 32.4 493 1-506 5-641 (641) 121 protein:vir:95315 Length: 559 99.1 2.6E-09 1.6E-12 67.7 40.7 477 1-506 1-559 (559) 122 protein:vir:3139 Length: 599 # 99.0 7.6E-09 4.7E-12 65.1 28.6 464 1-492 1-599 (599) 123 protein:vir:94709 Length: 522 98.8 2.8E-08 1.8E-11 62.0 42.6 440 1-497 1-522 (522) 124 protein:vir:102080 Length: 429 98.8 3.3E-08 2E-11 61.6 30.8 392 9-501 1-429 (429) 125 protein:vir:4156 Length: 542 # 98.8 3.2E-08 2E-11 61.6 24.8 423 4-510 1-474 (542) 126 protein:vir:8883 Length: 543 # 98.8 4.7E-08 2.9E-11 60.7 36.9 464 1-506 1-543 (543) 127 protein:vir:7321 Length: 556 # 98.8 5.3E-08 3.3E-11 60.5 38.8 472 1-505 1-556 (556) 128 protein:vir:1380 Length: 422 # 98.8 5.7E-08 3.6E-11 60.3 33.3 400 9-490 1-422 (422) 129 protein:vir:107822 Length: 555 98.7 7.3E-08 4.5E-11 59.7 40.9 464 1-499 1-555 (555) 130 protein:vir:98506 Length: 555 98.7 7.3E-08 4.5E-11 59.7 40.9 464 1-499 1-555 (555) 131 protein:vir:107404 Length: 555 98.7 7.3E-08 4.5E-11 59.7 40.9 464 1-499 1-555 (555) 132 protein:vir:9359 Length: 348 # 98.7 1E-07 6.2E-11 59.0 28.6 327 86-495 1-348 (348) 133 protein:vir:102727 Length: 945 98.7 1E-07 6.2E-11 59.0 31.7 437 1-510 4-536 (945) 134 protein:vir:3361 Length: 535 # 98.7 1.2E-07 7.7E-11 58.4 40.6 455 1-504 1-535 (535) 135 protein:vir:107605 Length: 432 98.6 1.7E-07 1.1E-10 57.7 34.5 400 9-501 1-432 (432) 136 protein:vir:105002 Length: 432 98.6 1.7E-07 1.1E-10 57.7 34.5 400 9-501 1-432 (432) 137 protein:vir:102855 Length: 432 98.6 1.7E-07 1.1E-10 57.7 34.5 400 9-501 1-432 (432) 138 protein:vir:79772 Length: 648 98.6 2.4E-07 1.5E-10 56.9 34.2 420 1-510 34-514 (648) 139 protein:vir:6240 Length: 457 # 98.6 2.4E-07 1.5E-10 56.8 32.9 408 32-498 1-457 (457) 140 protein:vir:96579 Length: 576 98.6 2.6E-07 1.6E-10 56.6 26.9 450 1-510 1-549 (576) 141 protein:vir:3153 Length: 467 # 98.5 3.2E-07 2E-10 56.2 33.7 393 60-510 1-462 (467) 142 protein:vir:1538 Length: 535 # 98.5 3.8E-07 2.4E-10 55.8 41.5 453 1-501 1-535 (535) 143 protein:vir:99312 Length: 563 98.5 3.8E-07 2.4E-10 55.8 31.6 454 1-510 1-550 (563) 144 protein:vir:95599 Length: 563 98.5 3.8E-07 2.4E-10 55.8 31.6 454 1-510 1-550 (563) 145 protein:vir:10447 Length: 536 98.5 4.6E-07 2.9E-10 55.3 40.6 454 1-509 1-536 (536) 146 protein:vir:95821 Length: 763 98.5 5.1E-07 3.1E-10 55.1 40.4 484 1-510 11-752 (763) 147 protein:vir:1326 Length: 457 # 98.5 5.6E-07 3.5E-10 54.9 32.9 415 32-505 1-457 (457) 148 protein:vir:93610 Length: 454 98.4 8E-07 5E-10 54.0 33.8 417 16-510 1-450 (454) 149 protein:vir:80796 Length: 574 98.4 8.1E-07 5.1E-10 54.0 33.7 451 1-510 5-553 (574) 150 protein:vir:1266 Length: 416 # 98.4 8.9E-07 5.5E-10 53.8 32.3 391 10-497 1-416 (416) 151 protein:vir:4454 Length: 414 # 98.4 9.2E-07 5.7E-10 53.7 34.7 389 9-496 1-414 (414) 152 protein:vir:4194 Length: 540 # 98.4 9.3E-07 5.7E-10 53.7 30.0 422 22-510 1-479 (540) 153 protein:vir:103330 Length: 517 98.4 9.6E-07 5.9E-10 53.6 38.7 437 1-489 1-517 (517) 154 protein:vir:2198 Length: 536 # 98.4 1.1E-06 6.9E-10 53.2 41.9 454 1-509 1-536 (536) 155 protein:vir:105782 Length: 449 98.3 1.5E-06 9.2E-10 52.5 25.2 413 6-498 1-449 (449) 156 protein:vir:102118 Length: 409 98.3 1.5E-06 9.4E-10 52.5 31.4 382 15-493 1-409 (409) 157 protein:vir:3843 Length: 397 # 98.3 1.7E-06 1.1E-09 52.2 35.6 380 1-508 1-397 (397) 158 protein:vir:96988 Length: 516 98.2 2.1E-06 1.3E-09 51.7 36.5 432 1-485 1-516 (516) 159 protein:vir:1785 Length: 555 # 98.2 2.3E-06 1.4E-09 51.5 39.3 450 9-508 1-555 (555) 160 protein:vir:78696 Length: 542 98.2 2.5E-06 1.6E-09 51.3 41.2 446 9-503 1-542 (542) 161 protein:vir:93943 Length: 409 98.2 3.4E-06 2.1E-09 50.6 29.9 389 4-495 1-409 (409) 162 protein:vir:100039 Length: 522 98.1 3.7E-06 2.3E-09 50.4 34.5 439 11-505 1-522 (522) 163 protein:vir:8418 Length: 409 # 98.1 4.2E-06 2.6E-09 50.1 33.0 384 32-499 1-409 (409) 164 protein:vir:94572 Length: 535 98.1 4.7E-06 2.9E-09 49.8 39.5 455 1-501 1-535 (535) 165 protein:vir:3868 Length: 417 # 98.1 4.9E-06 3E-09 49.7 29.9 384 36-505 1-417 (417) 166 protein:vir:94426 Length: 409 98.1 4.9E-06 3E-09 49.7 29.8 386 4-495 1-409 (409) 167 protein:vir:81095 Length: 416 98.1 5E-06 3.1E-09 49.6 36.8 384 32-505 1-416 (416) 168 protein:vir:4598 Length: 416 # 98.1 5E-06 3.1E-09 49.6 36.8 384 32-505 1-416 (416) 169 protein:vir:483 Length: 413 # 98.1 5.1E-06 3.1E-09 49.6 32.7 388 15-495 1-413 (413) 170 protein:vir:100150 Length: 437 98.1 5.4E-06 3.4E-09 49.5 30.5 405 1-504 1-437 (437) 171 protein:vir:99672 Length: 532 98.1 5.6E-06 3.5E-09 49.4 39.5 446 1-492 1-532 (532) 172 protein:vir:81152 Length: 411 98.0 6.1E-06 3.8E-09 49.2 32.0 382 9-490 1-411 (411) 173 protein:vir:99853 Length: 488 98.0 6.3E-06 3.9E-09 49.1 34.2 386 6-510 1-419 (488) 174 protein:vir:4952 Length: 386 # 98.0 6.4E-06 3.9E-09 49.1 31.5 366 1-495 1-386 (386) 175 protein:vir:105064 Length: 421 98.0 7.9E-06 4.9E-09 48.6 32.2 389 10-502 1-421 (421) 176 protein:vir:96980 Length: 409 98.0 8E-06 4.9E-09 48.5 30.2 388 4-495 1-409 (409) 177 protein:vir:79233 Length: 526 97.9 9.7E-06 6E-09 48.1 40.5 415 1-510 1-460 (526) 178 protein:vir:101648 Length: 518 97.9 1E-05 6.4E-09 47.9 38.1 408 36-510 1-451 (518) 179 protein:vir:2683 Length: 412 # 97.9 1E-05 6.4E-09 47.9 30.4 392 1-495 1-412 (412) 180 protein:vir:5737 Length: 419 # 97.9 1.1E-05 6.6E-09 47.8 32.0 383 32-501 1-419 (419) 181 protein:vir:1431 Length: 419 # 97.9 1.1E-05 6.8E-09 47.8 33.8 386 15-507 1-419 (419) 182 protein:vir:7017 Length: 515 # 97.9 1.4E-05 8.9E-09 47.1 39.5 431 1-484 1-515 (515) 183 protein:vir:103860 Length: 528 97.8 1.5E-05 9.3E-09 47.0 40.8 411 1-510 1-481 (528) 184 protein:vir:7853 Length: 518 # 97.8 1.7E-05 1.1E-08 46.7 35.4 408 36-510 1-452 (518) 185 protein:vir:79984 Length: 441 97.8 1.8E-05 1.1E-08 46.7 35.2 383 21-492 1-441 (441) 186 protein:vir:9408 Length: 441 # 97.8 1.8E-05 1.1E-08 46.7 35.2 383 21-492 1-441 (441) 187 protein:vir:100882 Length: 383 97.8 1.9E-05 1.2E-08 46.5 31.5 356 1-494 1-383 (383) 188 protein:vir:99232 Length: 526 97.6 3.3E-05 2E-08 45.2 41.2 405 1-510 1-447 (526) 189 protein:vir:7407 Length: 392 # 97.6 3.4E-05 2.1E-08 45.1 31.7 374 15-495 1-392 (392) 190 protein:vir:4337 Length: 434 # 97.6 3.5E-05 2.2E-08 45.0 28.9 396 1-509 1-434 (434) 191 protein:vir:4995 Length: 384 # 97.6 3.7E-05 2.3E-08 44.9 28.6 368 1-482 1-384 (384) 192 protein:vir:100691 Length: 535 97.6 3.8E-05 2.3E-08 44.8 35.8 443 1-510 1-533 (535) 193 protein:vir:78641 Length: 278 97.5 4.7E-05 2.9E-08 44.3 24.9 259 86-410 1-278 (278) 194 protein:vir:98396 Length: 441 97.5 5.2E-05 3.2E-08 44.1 35.5 379 36-492 1-441 (441) 195 protein:vir:105641 Length: 516 97.5 5.8E-05 3.6E-08 43.8 40.2 432 1-483 1-516 (516) 196 protein:vir:10362 Length: 432 97.4 7.2E-05 4.4E-08 43.3 35.1 396 23-505 1-432 (432) 197 protein:vir:4854 Length: 386 # 97.4 8.8E-05 5.4E-08 42.8 32.9 363 1-484 1-386 (386) 198 protein:vir:81072 Length: 432 97.3 9.5E-05 5.9E-08 42.6 36.3 395 24-505 1-432 (432) 199 protein:vir:4509 Length: 424 # 97.3 0.00012 7.2E-08 42.2 31.2 387 15-492 1-424 (424) 200 protein:vir:101541 Length: 694 97.2 0.00012 7.5E-08 42.1 24.9 446 1-510 54-578 (694) 201 protein:vir:4828 Length: 382 # 97.2 0.00014 8.7E-08 41.7 33.6 369 1-484 1-382 (382) 202 protein:vir:6322 Length: 510 # 97.1 0.00016 1E-07 41.4 42.8 428 12-478 1-510 (510) 203 protein:vir:1884 Length: 424 # 97.1 0.00016 1E-07 41.3 33.8 392 1-498 1-424 (424) 204 protein:vir:95378 Length: 406 97.1 0.00017 1E-07 41.3 31.3 376 1-494 1-406 (406) 205 protein:vir:3989 Length: 392 # 97.1 0.00017 1.1E-07 41.2 32.9 377 15-495 1-392 (392) 206 protein:vir:1023 Length: 392 # 97.1 0.00017 1.1E-07 41.2 32.9 377 15-495 1-392 (392) 207 protein:vir:3648 Length: 695 # 97.1 0.00018 1.1E-07 41.1 25.8 431 1-510 69-579 (695) 208 protein:vir:78942 Length: 510 97.0 0.0002 1.2E-07 40.9 44.1 431 12-498 1-510 (510) 209 protein:vir:107880 Length: 491 97.0 0.00021 1.3E-07 40.8 36.3 396 1-510 1-423 (491) 210 protein:vir:1986 Length: 512 # 96.9 0.00028 1.7E-07 40.1 36.8 416 1-510 1-469 (512) 211 protein:vir:100187 Length: 385 96.9 0.00029 1.8E-07 39.9 32.9 368 9-489 1-385 (385) 212 protein:vir:103219 Length: 201 96.8 0.00032 2E-07 39.7 15.2 181 249-496 1-201 (201) 213 protein:vir:106716 Length: 698 96.8 0.00032 2E-07 39.7 30.0 423 1-510 69-568 (698) 214 protein:vir:104500 Length: 537 96.8 0.00033 2E-07 39.7 27.3 420 1-510 36-535 (537) 215 protein:vir:9702 Length: 406 # 96.8 0.00035 2.2E-07 39.5 32.7 384 1-498 1-406 (406) 216 protein:vir:80333 Length: 419 96.8 0.00036 2.2E-07 39.5 31.9 384 4-508 1-419 (419) 217 protein:vir:960 Length: 413 # 96.7 0.00041 2.5E-07 39.2 33.7 371 1-490 1-413 (413) 218 protein:vir:97060 Length: 432 96.6 0.00047 2.9E-07 38.8 34.9 396 4-505 1-432 (432) 219 protein:vir:189 Length: 424 # 96.6 0.0005 3.1E-07 38.7 35.9 391 1-498 1-424 (424) 220 protein:vir:101647 Length: 460 96.5 0.00061 3.8E-07 38.2 32.5 400 12-505 1-460 (460) 221 protein:vir:79063 Length: 491 96.2 0.00085 5.3E-07 37.4 33.9 408 1-510 1-450 (491) 222 protein:vir:104892 Length: 558 96.2 0.00094 5.8E-07 37.2 25.3 447 1-508 1-558 (558) 223 protein:vir:100249 Length: 431 96.1 0.00099 6.1E-07 37.1 35.3 389 17-499 1-431 (431) 224 protein:vir:99452 Length: 651 96.1 0.001 6.4E-07 36.9 24.1 461 1-510 1-567 (651) 225 protein:vir:78589 Length: 695 96.0 0.0012 7.3E-07 36.7 30.4 433 1-510 69-579 (695) 226 protein:vir:80134 Length: 403 95.9 0.0013 8.2E-07 36.4 30.2 370 32-494 1-403 (403) 227 protein:vir:1082 Length: 359 # 95.8 0.0014 8.7E-07 36.2 30.7 340 1-442 1-359 (359) 228 protein:vir:104259 Length: 403 95.7 0.0016 1E-06 35.9 31.2 372 10-501 1-403 (403) 229 protein:vir:6210 Length: 394 # 95.4 0.0022 1.4E-06 35.1 29.0 366 9-495 1-394 (394) 230 protein:vir:108215 Length: 469 94.9 0.0031 1.9E-06 34.3 34.7 413 1-510 1-464 (469) 231 protein:vir:100650 Length: 395 94.5 0.0044 2.7E-06 33.5 28.6 360 32-510 1-394 (395) 232 protein:vir:101289 Length: 395 94.5 0.0044 2.7E-06 33.5 28.6 360 32-510 1-394 (395) 233 protein:vir:9507 Length: 395 # 94.5 0.0044 2.7E-06 33.5 28.6 360 32-510 1-394 (395) 234 protein:vir:77981 Length: 448 94.4 0.0046 2.9E-06 33.4 31.6 395 1-504 1-448 (448) 235 protein:vir:95965 Length: 385 94.3 0.0047 2.9E-06 33.3 26.2 348 32-489 1-385 (385) 236 protein:vir:81218 Length: 423 93.7 0.0068 4.2E-06 32.5 29.8 377 32-493 1-423 (423) 237 protein:vir:80211 Length: 514 93.1 0.0088 5.4E-06 31.9 41.7 433 1-480 1-514 (514) 238 protein:vir:78161 Length: 355 93.1 0.0089 5.5E-06 31.8 31.2 318 129-510 1-355 (355) 239 protein:vir:98816 Length: 446 92.6 0.011 6.6E-06 31.4 31.4 394 1-448 3-446 (446) 240 protein:vir:8100 Length: 466 # 92.4 0.011 7.1E-06 31.2 32.0 399 13-510 1-465 (466) 241 protein:vir:94666 Length: 723 90.4 0.021 1.3E-05 29.8 36.0 379 46-510 1-430 (723) 242 protein:vir:79511 Length: 448 90.1 0.022 1.4E-05 29.6 33.0 387 1-510 15-440 (448) 243 protein:vir:106999 Length: 564 88.5 0.032 2E-05 28.8 25.6 453 1-510 17-558 (564) 244 protein:vir:103177 Length: 533 87.0 0.042 2.6E-05 28.2 28.9 423 1-506 1-533 (533) 245 protein:vir:9641 Length: 395 # 86.9 0.043 2.6E-05 28.1 26.1 363 32-491 1-395 (395) 246 protein:vir:4089 Length: 395 # 86.6 0.044 2.8E-05 28.0 26.8 366 10-498 1-395 (395) 247 protein:vir:5665 Length: 511 # 85.7 0.051 3.2E-05 27.7 21.0 389 1-480 20-511 (511) 248 protein:vir:1661 Length: 378 # 82.8 0.074 4.6E-05 26.8 24.6 334 32-494 1-378 (378) 249 protein:vir:100598 Length: 516 82.1 0.08 5E-05 26.6 21.6 389 1-480 54-516 (516) 250 protein:vir:101189 Length: 516 81.1 0.089 5.5E-05 26.3 21.8 389 1-480 54-516 (516) 251 protein:vir:101806 Length: 516 81.1 0.089 5.5E-05 26.3 21.8 389 1-480 54-516 (516) 252 protein:vir:95254 Length: 488 77.8 0.12 7.5E-05 25.6 34.6 426 1-504 1-488 (488) 253 protein:vir:78310 Length: 376 74.7 0.15 9.6E-05 25.0 31.8 346 9-487 1-376 (376) 254 protein:vir:5839 Length: 533 # 72.6 0.18 0.00011 24.7 25.5 433 1-510 1-533 (533) 255 protein:vir:8317 Length: 409 # 69.7 0.22 0.00014 24.2 29.5 356 1-485 22-409 (409) 256 protein:vir:6896 Length: 523 # 68.6 0.23 0.00015 24.0 21.5 393 1-480 58-523 (523) 257 protein:vir:98643 Length: 395 68.2 0.24 0.00015 24.0 25.6 357 32-491 1-395 (395) 258 protein:vir:94002 Length: 378 65.8 0.28 0.00017 23.6 25.6 335 40-494 1-378 (378) 259 protein:vir:7208 Length: 524 # 64.3 0.3 0.00019 23.4 21.2 393 1-480 58-524 (524) 260 protein:vir:103458 Length: 524 63.8 0.31 0.00019 23.4 21.2 393 1-480 58-524 (524) 261 protein:vir:345 Length: 663 # 60.6 0.37 0.00023 22.9 25.4 460 1-510 1-576 (663) 262 protein:vir:94869 Length: 378 57.7 0.43 0.00027 22.6 24.3 337 17-494 1-378 (378) 263 protein:vir:93867 Length: 378 54.5 0.5 0.00031 22.2 24.9 336 32-494 1-378 (378) 264 protein:vir:79150 Length: 368 43.2 0.86 0.00053 21.0 21.6 311 25-426 1-368 (368) 265 protein:vir:78191 Length: 351 35.9 1.2 0.00075 20.1 23.2 298 25-417 1-351 (351) 266 protein:vir:858 Length: 378 # 31.8 1.5 0.00091 19.7 24.8 335 17-494 1-378 (378) 267 protein:vir:1150 Length: 350 # 24.5 2.2 0.0013 18.7 21.8 296 32-413 1-350 (350) 268 protein:vir:98567 Length: 340 24.1 2.2 0.0014 18.7 20.9 288 40-413 1-340 (340) 269 protein:vir:108049 Length: 524 23.3 2.3 0.0014 18.6 23.0 393 1-480 58-524 (524) 270 protein:vir:106282 Length: 521 20.1 2.8 0.0018 18.1 26.1 398 1-480 28-521 (521) No 1 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=100.00 E-value=3.5e-108 Score=609.67 Aligned_cols=508 Identities=44% Similarity=0.727 Sum_probs=421.2 Q ss_pred CCCc-cCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHH Q lcl|NC_013644. 1 MEAL-LSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIV 79 (510) Q Consensus 1 ~~~~-~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv 79 (510) |+.+ ++...+...++|.+.|..|.++.++.+++++++||+|+|+|++|+...++..+...++..+||+||++||+++|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~Iv 80 (537) T protein:vir:78 1 MTSPLLNKPIDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTELV 80 (537) T ss_pred CCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchHHHHH Confidence 8877 455557788889999999988888999999999999999999999999988999999999999999999999999 Q ss_pred HHHHhhhhcCCceeccCcH---HHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEc Q lcl|NC_013644. 80 DQKTQYLLSNPVEYETENE---ELKEYLAEYYNSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYN 156 (510) Q Consensus 80 ~~~~~~l~g~p~~~~~~d~---~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d 156 (510) ++.++||+|+||+|+++++ ++.+.|+.++++++++++.++++.++++|+||+++|+|++|+++++.++|+++||+|| T Consensus 81 d~~~~yl~G~Pv~~~~~d~~~~e~~~~l~~~~~~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~~~~~i~p~~~~pv~d 160 (537) T protein:vir:78 81 DQLAQYLLSNGVEVKVKDEDNTQLDEILQEYFDEDFQATIDTLVTNASKKGFEGIFARTTSEGKLKFQTVDGLTLIPVFD 160 (537) T ss_pred HHHhhhhcccCceeecCcchhHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeeEEEeeecCCCceEEEEEccceeEEEEc Confidence 9999999999999998654 4566778888899999999999999999999999999999999999999999999999 Q ss_pred CCCCceeEEEEEEEEEeeC--CceeEEEEEEEEcCCcEEEEEEcCCceeecccc---ccccccccc---------ccccc Q lcl|NC_013644. 157 EYNELQRICRHYITEIEKD--GETVDIHHAEVWTDQNVYFFVAEDNKDYELDEA---EPINPRPHV---------LAVDS 222 (510) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~---~~~~~~~~~---------~~~~~ 222 (510) +++++.+++|+|....... .....+.++++||++.+++|....++....... ....+.+.. ..... T Consensus 161 ~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 240 (537) T protein:vir:78 161 DYGVLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIEESTDADFEDT 240 (537) T ss_pred CCCCceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeeecccccccccccc Confidence 9999999999987665432 344567899999999999999887765432211 111111111 11223 Q ss_pred cccccccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeee Q lcl|NC_013644. 223 ENESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVV 302 (510) Q Consensus 223 ~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~ 302 (510) ......+|+||+||||+|+||++|+|||+++++|||+||.++|+++|.+++|++|+++++|+++++.+++..+++.++++ T Consensus 241 ~~~~~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~~~~~~~~l~~~~~i 320 (537) T protein:vir:78 241 DGYQVLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDSTDKLRQNIKAKKMI 320 (537) T ss_pred ccccccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCccchhHHHHHhhcCce Confidence 34466789999999999999999999999999999999999999999999999999999999998888888999998888 Q ss_pred ecc-CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 303 GTG-SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMN 381 (510) Q Consensus 303 ~~~-~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~ 381 (510) .++ ++++|+|++|+.+.++++.++++|++.||.+|++|++++.++||+||+||++++++|++||+.++++|+++|++++ T Consensus 321 ~v~~d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~~~~~~gn~SGvAlk~~~~~l~~ka~~ke~~f~~~l~~~~ 400 (537) T protein:vir:78 321 GVNGDNAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFNSTAVGDGNVTNVVIKSRYTLLAMKARKMETSLRKVLRWCA 400 (537) T ss_pred eecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCCCccccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH Confidence 776 5788999999999999999999999999999999999988999999999999999999999999999999999999 Q ss_pred HHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 382 KLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDV 461 (510) Q Consensus 382 ~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~ 461 (510) ++|+.++++.+...++..+|+|+|++++|.|+++.+++++++.++|++|+||+++++|+++|+|.+++++++.+.+.+.. T Consensus 401 ~~i~~~~~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~l~~~giiS~eT~l~~~p~vdd~e~ek~~~ee~~~~~~~~ 480 (537) T protein:vir:78 401 DMVVSDIALRGLGEYDSNDICFEIEPHVLANELDIATTRKTEAETEALKIGNIMTVAPRIGDDETLKLIAEELDLDYNEL 480 (537) T ss_pred HHHHHHHhhcCCcccccceeeEEeccCCCCCHHHHHHHHHHHHhcCcchHHHHHHhCCCCCCHHHHHHHHHHHHhhhhhh Confidence 99999999998888899999999999999999999999999999999999999999999999988888877766655554 Q ss_pred HHHHHhhhccCCCCC-------CCCCcccCCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 462 KEALEEAEYTKGLSD-------NTDEEETAVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 462 ~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) ....++...+..... .+....+++.++++.+..++..-+..+ -|..| T Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~--~~~~~ 534 (537) T protein:vir:78 481 KDALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVDPNQPVADPNVVPPT--DPNAV 534 (537) T ss_pred hhhhhhhcccccCcCcchhhhcCCCCCCCCCCCCCccCCCCCCCCCCCC--CCccC Confidence 444333322211110 111111122223333332222222111 12222 No 2 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=100.00 E-value=1.1e-98 Score=557.59 Aligned_cols=459 Identities=20% Similarity=0.318 Sum_probs=377.2 Q ss_pred hhHHHHHHHHHhhhh--hhhHHHHHHHHHHhccCCcchhcccceeccccc-cccccccccceeccchhHHHHHHHHhhhh Q lcl|NC_013644. 11 IIANALKAAIDKDRK--SSSKREAETGIRYYNHENDIMNNRIFYVDDEGI-LREDKYASNVRIPHGFFPEIVDQKTQYLL 87 (510) Q Consensus 11 ~~~~~i~~~i~~~~~--~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~-~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~ 87 (510) +..+.|.++|++++. ..+..+|.++++||.|+|+|+++....+...+. ......++++||++||+++||++.++||| T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 80 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGYVA 80 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhhee Confidence 334444555444433 245567999999999999999987665543332 23345688999999999999999999999 Q ss_pred cCCceeccCcHHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCC--CCceeEE Q lcl|NC_013644. 88 SNPVEYETENEELKEYLAEYYNSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEY--NELQRIC 165 (510) Q Consensus 88 g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~--~~~~~~~ 165 (510) |+||+|+++++++.+.|++++++++.+.+.++++.++++|+||+++|+|++|++++++++|.++||+||++ .++.+++ T Consensus 81 G~p~~~~~~d~~~~~~l~~~~~~~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~~~~~~~~p~~~~~v~d~~~~~~~~a~i 160 (470) T protein:vir:10 81 SVFPDIDVGKDADNKKIIDVLGDDRALTLNGLLVDSSNAGRAWLHYWIDEDGNFRYGIIQPDQITPIYATTLDNKLLGIL 160 (470) T ss_pred ccceeeecCchHHHHHHHHHHhhhHHHHHHHHHHHHhhcCeeEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999976 4577777 Q ss_pred EEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCC Q lcl|NC_013644. 166 RHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQ 245 (510) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~ 245 (510) ++|.....++ ...+.++++||++.+++|....++...........................+|+||+||||+|+||++ T Consensus 161 r~y~~~~~~~--~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~ 238 (470) T protein:vir:10 161 RSYKQLDPDS--GKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFSKNKY 238 (470) T ss_pred EEEEeeecCC--ceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccccccccccccCCCeeeEEEeecCCC Confidence 7776554443 34667899999999999998887777666555555555555556666778899999999999999999 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccC-----CCceeEEeecCCHH Q lcl|NC_013644. 246 ETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGS-----DGGLDVKTVTIPTE 320 (510) Q Consensus 246 g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~ 320 (510) |+|||+++++|||+||.++|++++.++++++|+++++|+.+.+..++..+++..+++.++. +++|+|++++.+.+ T Consensus 239 g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~lt~~~~~~ 318 (470) T protein:vir:10 239 RLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQFMNDLRKYKSIKINNTGNGDNSGVDKLQIDIPVE 318 (470) T ss_pred CCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccchhhhhhhhcCeEeccCCCCCcCceeEEEeecCChH Confidence 9999999999999999999999999999999999999999888888888888887777654 46799999999999 Q ss_pred HHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccce Q lcl|NC_013644. 321 GRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTE 400 (510) Q Consensus 321 ~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~ 400 (510) +++.++++|+++||.+|++|++++.++||+||+||++++++|++||+++++.|+++|++++++|+.+++.. ..+..+ T Consensus 319 ~~~~~~~~L~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~~~---~~d~~~ 395 (470) T protein:vir:10 319 ARDDALKITRKNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS---DADKRH 395 (470) T ss_pred HHHHHHHHHHHHHHHHhCCCCCCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc---Ccccce Confidence 99999999999999999999999888899999999999999999999999999999999999999988653 456788 Q ss_pred eeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHH-HHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCC Q lcl|NC_013644. 401 VSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDN-VLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTD 479 (510) Q Consensus 401 v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e-~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~ 479 (510) ++|+|++++|.|+++.+++++++ +|+||+||+++++|+++|++ +.++++++++++. .. ... .+.. T Consensus 396 i~i~f~~~~p~d~~e~~~~~~~~--~g~iS~et~l~~~p~v~D~~~E~eri~~E~~e~~---~~---~~~-~~~~----- 461 (470) T protein:vir:10 396 ISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIVDDWQQELKDLAKDKEEND---PY---SNQ-ADEL----- 461 (470) T ss_pred eeEEeccCCCCCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHH---Hh---hcc-cccc----- Confidence 99999999999999999999876 68999999999999999864 3333333222111 10 011 1111 Q ss_pred CcccCCCCCCc Q lcl|NC_013644. 480 EEETAVNPDDP 490 (510) Q Consensus 480 ~~~~~~~~~~~ 490 (510) .+++.++++ T Consensus 462 --~~~~~dde~ 470 (470) T protein:vir:10 462 --NGKGVNDEQ 470 (470) T ss_pred --CCCCCCCCC Confidence 111111111 No 3 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=100.00 E-value=1.8e-97 Score=551.03 Aligned_cols=484 Identities=26% Similarity=0.402 Sum_probs=389.1 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) +..++....+.+.+....+|.++....++.++.++++||.|+|+|+++.+......+...+...++++|+++||+++||+ T Consensus 15 ~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd 94 (503) T protein:vir:59 15 LNEIIVESAKEIAEPDTTMIQKLIDEHNPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKTNNRTSHAWHKLFVD 94 (503) T ss_pred HHHhhhhhhhhccchhHHHHHHHHHhhcHHHHHHHHHHhccccchhhccchhcccccccccccccccceeecchHHHHHH Confidence 33344444444444444445544444466789999999999999999998888888888888899999999999999999 Q ss_pred HHHhhhhcCCceeccCcHHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCC- Q lcl|NC_013644. 81 QKTQYLLSNPVEYETENEELKEYLAEYYNSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYN- 159 (510) Q Consensus 81 ~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~- 159 (510) +.++||||+||+|+++++++.++|+.|++|+++.++.++++.++++|++|+++|+|++|++++++++|.++||+||+.. T Consensus 95 ~~~~yl~g~~~~~~~~d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~~p~~~~~i~d~~~~ 174 (503) T protein:vir:59 95 QKTQYLVGEPVTFTSDNKTLLEYVNELADDDFDDILNETVKNMSNKGIEYWHPFVDEEGEFDYVIFPAEEMIVVYKDNTR 174 (503) T ss_pred HHHhhhhcCCeeeccCcHHHHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEEeecCCCceEEEEEccceeEEEEeCCCC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999999764 Q ss_pred -CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEE Q lcl|NC_013644. 160 -ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFY 238 (510) Q Consensus 160 -~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv 238 (510) ++.+++++|..... ....+.++++|+++++++|+...+++..........+... ......+|+||+|||| T Consensus 175 ~~~~~~ir~~~~~~~---~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~vPiv 245 (503) T protein:vir:59 175 RDILFALRYYSYKGI---MGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPH------MTKGGQAIGWGRVPII 245 (503) T ss_pred CceEEEEEEEEEecC---CCceEEEEEEEeCCcEEEEEEcCCcccccccccccccccc------eeecceeccCCccceE Confidence 45566655543322 2345678999999999999988777655443333222222 2234568999999999 Q ss_pred EecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeEEeecCC Q lcl|NC_013644. 239 RLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVTIP 318 (510) Q Consensus 239 ~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (510) +|+||+.|.|+|+++++|||+||.++|++++.++++++|+++++|+++++..++..+++..+++.++++++++|++++++ T Consensus 246 ~~~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 325 (503) T protein:vir:59 246 PFKNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPKEFTANLRYHSVIKVSGDGGVDTLRAEIP 325 (503) T ss_pred EecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccchhhhhhhcccceeccCCCcceeEeccCC Confidence 99999999999999999999999999999999999999999999999988888888999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhCCcccccc-ccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccc Q lcl|NC_013644. 319 TEGRKTKMEIDKENIYKFGMAFDSTQV-GDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFD 397 (510) Q Consensus 319 ~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~ 397 (510) .++++.++++|++.|+.+|++|++++. .+|++||+|+++++++|.+||.++++.|+.+|++++++|+.+++..+....+ T Consensus 326 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~ 405 (503) T protein:vir:59 326 VDSAAKELERIQDELYKSAQAVDNSPETIGGGATGPALENLYALLDLKANMAERKIRAGLRLFFWFFAEYLRNTGKGDFN 405 (503) T ss_pred HHHHHHHHHHHHHHHHHHhcccCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccc Confidence 999999999999999999999998765 4678999999999999999999999999999999999999999887766544 Q ss_pred -cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHHHHHHHHhhhccCCCC Q lcl|NC_013644. 398 -PTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWEDVKEALEEAEYTKGLS 475 (510) Q Consensus 398 -~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~~~~~~~~~~~~~~~~ 475 (510) ..+++|+|++++|.|+++.++++++++++|+||+||+++++|+++|+++ .++++++++... ........... T Consensus 406 ~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~GiiS~et~l~~l~~v~d~~~E~~ri~~E~~~~~------~~~~~~~~~~~ 479 (503) T protein:vir:59 406 PDKELTMTFTRTRIQNDSEIVQSLVQGVTGGIMSKETAVARNPFVQDPEEELARIEEEMNQYA------EMQGNLLDDEG 479 (503) T ss_pred cccceeEEeCCCCCCCHHHHHHHHHHHHhCCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHH------hhhccccCccC Confidence 4579999999999999999999999999999999999999999988643 333333222111 11111111111 Q ss_pred CCCCCcccCCCCCCcccccccCcccccc Q lcl|NC_013644. 476 DNTDEEETAVNPDDPTQQMAEGATGSTE 503 (510) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (510) +..++++.++.+++. ..+..|.+. T Consensus 480 ~~~~~~~~~~~~~~~----~~~~~g~~~ 503 (503) T protein:vir:59 480 GDDDLEEDDPNAGAA----ESGGAGQVS 503 (503) T ss_pred CCCCCCcCCCCCCcc----cCCCCCCcC Confidence 111111111111111 111112222 No 4 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=100.00 E-value=2.3e-96 Score=544.94 Aligned_cols=457 Identities=22% Similarity=0.373 Sum_probs=374.2 Q ss_pred CChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccc-----cccccccccccceeccchhHHHHHH Q lcl|NC_013644. 7 EDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDE-----GILREDKYASNVRIPHGFFPEIVDQ 81 (510) Q Consensus 7 ~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~-----~~~~~~~~~~~~ki~~n~~~~Iv~~ 81 (510) =+.+.+.+.|.++|.+|. .++.++.++++||+|+|+|++++....... ........++++||++||+++||++ T Consensus 1 ~~~e~~~~~i~~~~~~~~--~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~ 78 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHG--KFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQ 78 (471) T ss_pred CCHHHHHHHHHHHHHHHH--HHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHh Confidence 456667788888888775 456689999999999999998765433221 1222234568899999999999999 Q ss_pred HHhhhhcCCceeccCcHHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECC-CCceEEEEEcccceEEEEcCCC- Q lcl|NC_013644. 82 KTQYLLSNPVEYETENEELKEYLAEYYNSEFQVVLQELVEGSSQKGFEYVYARTNA-EDRLCFQVADSLNVFGVYNEYN- 159 (510) Q Consensus 82 ~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~-~g~~~i~~~~p~~~~~~~d~~~- 159 (510) .++||||+||+|+++++++.+.|+.|++|++++++.++++.++++|+||+++|+|+ +|++++++++|++++|+||++. T Consensus 79 ~~~yl~G~p~~~~~~~~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~g~~~~~~~~p~~~~~i~d~~~~ 158 (471) T protein:vir:10 79 KKAYALTYPPTFDVDDKKVNDMIVDVLGDDYERISKQLCVNAGNAGIAWLHVWKDASDNSFRYACVDSKEVIPIYSKSLD 158 (471) T ss_pred hhhhhcccCceeccCChHHHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEEEeeCCCCeeEEEEEcccceEEEEcCCCC Confidence 99999999999999999999999999999999999999999999999999999985 6999999999999999999765 Q ss_pred -CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEE Q lcl|NC_013644. 160 -ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFY 238 (510) Q Consensus 160 -~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv 238 (510) ++.+++++|......+ ...+.++++|+++++++|....++...........................+|+||+|||| T Consensus 159 ~~~~~~ir~~~~~~~~~--~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv 236 (471) T protein:vir:10 159 KKSIGVLRVYSSIDETD--GKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVPFI 236 (471) T ss_pred CceEEEEEEEEeeccCC--CceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCCceeEE Confidence 4666777765544433 3467789999999999999887765554443333333333344555667789999999999 Q ss_pred EecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccC-----CCceeEE Q lcl|NC_013644. 239 RLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGS-----DGGLDVK 313 (510) Q Consensus 239 ~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~ 313 (510) +|+||..|.|+|+++++|||+||.++|++++.++++++|+++++|+++.+.++....++.++++.+++ +++|+|+ T Consensus 237 ~~~n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l 316 (471) T protein:vir:10 237 PFKNNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQEFLEDLKRYKMIKMDNDGMGDQSGVTTI 316 (471) T ss_pred EeccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhHHHhhcCCeEEecCCCCccCccceEE Confidence 99999999999999999999999999999999999999999999998888888888888888877653 4589999 Q ss_pred eecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_013644. 314 TVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYT 393 (510) Q Consensus 314 ~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~ 393 (510) +|+++.++++.++++|+++||.+|++|++++..+||+||+||+++++++.+||+.+++.|+++|++++++|+.+++.. T Consensus 317 ~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~-- 394 (471) T protein:vir:10 317 AIDIPTEARNLILERTKKQIFISGQGVNPETDKLGNSSGVALKFLYSLLELKAGNMETQFRSGYATLVKMILKHLGLS-- 394 (471) T ss_pred eecCChHHHHHHHHHHHHHHHHHhCCcCCCcccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-- Confidence 999999999999999999999999999999989999999999999999999999999999999999999999988654 Q ss_pred CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHHHHHHHHhhhccC Q lcl|NC_013644. 394 KAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWEDVKEALEEAEYTK 472 (510) Q Consensus 394 ~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~~~~~~~~~~~~~ 472 (510) +..+++|+|++++|.|+++.+++++++ +|+||+||+++++|+++|+++ .+++++++++. .+.. . T Consensus 395 ---d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~~p~v~D~~~E~eri~~E~~~~-------~~~~---~ 459 (471) T protein:vir:10 395 ---DKLKIKQTWTRNSINNDTEMAQVVSTL--ATITSRENVAKSNPIVEDWQDELRLQKAEQEGR-------SEKL---Y 459 (471) T ss_pred ---CCceeEEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHH-------Hhcc---c Confidence 456789999999999999999999876 688999999999999998643 23332222211 1111 1 Q ss_pred CCCCCCCCcccC Q lcl|NC_013644. 473 GLSDNTDEEETA 484 (510) Q Consensus 473 ~~~~~~~~~~~~ 484 (510) ...+..++++.+ T Consensus 460 ~~~~~~~~~e~~ 471 (471) T protein:vir:10 460 DMEEVEHESEVE 471 (471) T ss_pred ccCCCCCccccC Confidence 111111111111 No 5 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=100.00 E-value=2.5e-96 Score=544.74 Aligned_cols=468 Identities=23% Similarity=0.351 Sum_probs=386.7 Q ss_pred CCCc--cCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEAL--LSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~--~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) |+.. ...-..++..++.+.|.+|+..+++.++.++++||+|+|+|++++...... +.......++++|+++||+++| T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~-~~~~~~~~~~~~ki~~~~~~~I 85 (479) T protein:vir:79 7 SETDLIKVQLKKESTINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLD-GAKVDDFTKVNNKAINNYHKLL 85 (479) T ss_pred cccceEeeccccCChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCcccccccccccc-cccccccccCcceeecchHHHH Confidence 3222 222234466777788888877778888999999999999999987655443 4556677889999999999999 Q ss_pred HHHHHhhhhcCCceeccCcHHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCC Q lcl|NC_013644. 79 VDQKTQYLLSNPVEYETENEELKEYLAEYYNSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEY 158 (510) Q Consensus 79 v~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~ 158 (510) |++.++||+|+||+|+++++++.+.|+.|++|++++.+.++++.++++|+||+++|.|++|++++++++|++++|+||++ T Consensus 86 vd~~~~~l~g~p~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~d~~ 165 (479) T protein:vir:79 86 VDQKVGYSVGNPIVFNADDDNLTKLLNDLLGEEFDDTITELYLNASNKGVEWLHPYINRKGEFKYVIIPAEEAIPIWDSK 165 (479) T ss_pred HHHHHhhhhcCCceeccCCHHHHHHHHHHHhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEccceeEEEEeCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999975 Q ss_pred C--CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCccc Q lcl|NC_013644. 159 N--ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIP 236 (510) Q Consensus 159 ~--~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP 236 (510) . ++.+++++|.....++ ..+.++++|+++.+++|...++++......... ................+|+||+|| T Consensus 166 ~~~~~~~~ir~y~~~~~~~---~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~vP 241 (479) T protein:vir:79 166 RQRELVAFIRFYYIEDIDG---NKIKRVEYYTENDITYFIERGNSFIQEFLYDEY-GKMTDIQEGHFRINNKEQGWGKVP 241 (479) T ss_pred CCCceEEEEEEEEEeecCC---ceEEEEEEEeCCcEEEEEecCCccccccccccc-ccccccccccccccccccCCCccc Confidence 4 5777777776554333 346789999999999999887665433222211 111222223344567789999999 Q ss_pred EEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeEEeec Q lcl|NC_013644. 237 FYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVT 316 (510) Q Consensus 237 vv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (510) ||+|+||+.|+|+|+++++|||+||.++|++++.++++++|+++++|+++.+.+++...++..+++.++++++++|++++ T Consensus 242 vv~~~nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~ 321 (479) T protein:vir:79 242 FIPFKNNEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQEFIDNIRYYKSIKVDGGGGVDKLEIN 321 (479) T ss_pred EEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccccchhhhhhccceecCCCCcceEEecc Confidence 99999999999999999999999999999999999999999999999988777777788888889999999999999999 Q ss_pred CCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCcc Q lcl|NC_013644. 317 IPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAF 396 (510) Q Consensus 317 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~ 396 (510) ++.++++.++++|++.|+.+|++|++++.++||+||+|++++++++++||..+++.|+++|++++++|+.+++..+..++ T Consensus 322 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~ 401 (479) T protein:vir:79 322 IPVEAKKELLDRLEKNIIIFGQGVNPESQNTGDKSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISGNKSY 401 (479) T ss_pred CCHHHHHHHHHHHHHHHHHHhCccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcc Confidence 99999999999999999999999999998899999999999999999999999999999999999999999999988888 Q ss_pred ccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHHHHHHHHhhhccCCCC Q lcl|NC_013644. 397 DPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWEDVKEALEEAEYTKGLS 475 (510) Q Consensus 397 ~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~~~~~~~~~~~~~~~~ 475 (510) +..+++|+|++++|.|+++.+++++++ +|+||+||+++++|+++|+++ .++++++++.+. ......++... T Consensus 402 ~~~~i~i~f~~~~p~~~~~~a~~~~kl--~g~iS~et~l~~l~~v~d~~~E~~ri~~E~~~~~------~~~~~~~~~~~ 473 (479) T protein:vir:79 402 DYKTVQITFNHSMIINEAEKIDMAAKS--TGIVSDETIVSNHPWVEDVNDELERLKKQEDTQK------EYDDLIPNNQD 473 (479) T ss_pred ccccceEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHH------HHHhccCcccC Confidence 999999999999999999999999877 589999999999999988642 233333322111 11111111111 Q ss_pred CCCCCccc Q lcl|NC_013644. 476 DNTDEEET 483 (510) Q Consensus 476 ~~~~~~~~ 483 (510) +..+ ++ T Consensus 474 ~~~~--e~ 479 (479) T protein:vir:79 474 GVID--ET 479 (479) T ss_pred CCcC--cC Confidence 1111 00 No 6 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=100.00 E-value=2.6e-95 Score=539.12 Aligned_cols=462 Identities=16% Similarity=0.227 Sum_probs=365.0 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |-... .+...+.+.|.++|.+|...+ +.+++++++||.|+|+|+++... .....++++|+++||+++||+ T Consensus 31 ~~~~e-~~~~~~~~~i~~~i~~~~~~~-~~r~~~l~~Yy~g~~~i~~~~~~--------~~~~~~~~~ki~~n~~k~Iv~ 100 (511) T protein:vir:99 31 YDGTE-SDLLQNVNEVSKYIEHHMDYQ-RPRLKVLSDYYEGKTKNLVELTR--------RKEEYMADNRVAHDYASYISD 100 (511) T ss_pred cchhh-hhhhccHHHHHHHHHHHHHhh-HHHHHHHHHHhcccCccccccCc--------ccccccCcceeecchHHHHHH Confidence 22111 122334577888999887554 45799999999999999876543 234567889999999999999 Q ss_pred HHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCC Q lcl|NC_013644. 81 QKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYN 159 (510) Q Consensus 81 ~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~ 159 (510) +.++||+|+||+|+++++++.+.|++||+ |+++.++.++++.++++|+||+++|.|++|++++++++|+++||+||++. T Consensus 101 ~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~~p~~~~~vyd~~~ 180 (511) T protein:vir:99 101 FINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTI 180 (511) T ss_pred HHHhhhcccCceeecCchHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCC Confidence 99999999999999999999999999995 78999999999999999999999999999999999999999999999764 Q ss_pred --CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccE Q lcl|NC_013644. 160 --ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPF 237 (510) Q Consensus 160 --~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 237 (510) ++.+++++|.....++.....+.++++|+++++++|+..+++..... .......+|+||.||| T Consensus 181 ~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~---------------~~~~~~~~~~~g~vPv 245 (511) T protein:vir:99 181 ERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLT---------------PRENGFESHSFERMPI 245 (511) T ss_pred CCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCcccccc---------------ccccccccCCCCccce Confidence 57777888877777777777888999999999999988765433211 1223567899999999 Q ss_pred EEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCee------------eecc Q lcl|NC_013644. 238 YRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKV------------VGTG 305 (510) Q Consensus 238 v~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~------------~~~~ 305 (510) |+|+||++|+|+|+++++|||+||.++|++++.++++++|+++++|+...+..+.........+ .... T Consensus 246 v~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (511) T protein:vir:99 246 TEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETE 325 (511) T ss_pred EEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcccCchhhcccccccceecccccccccccccCC Confidence 9999999999999999999999999999999999999999999999776655544333222211 1245 Q ss_pred CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 306 SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLV 384 (510) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i 384 (510) ++++++|++++.+.++++.++++|.+.||.+|++|++++.+ +||+||+||+++++++.+||.+|++.|+++|++++++| T Consensus 326 ~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li 405 (511) T protein:vir:99 326 GSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLL 405 (511) T ss_pred CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 67889999999999999999999999999999999988765 58999999999999999999999999999999999999 Q ss_pred HHHHhhccCC--ccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHH-HHHHHHHHHHHHHHHH Q lcl|NC_013644. 385 IDDINRRYTK--AFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDN-VLRLICEQFDLDWEDV 461 (510) Q Consensus 385 ~~~~~~~~~~--~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e-~~~~~~e~~e~~~~~~ 461 (510) +.+++..+.. ..++.+++|+|++++|.|.++.+++++++ +|+||+||+++++|+++|++ +.++++++++.. . T Consensus 406 ~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~~e~~~~~~kl--~GiiS~et~l~~l~~v~D~~~E~~ri~~E~~~~---~ 480 (511) T protein:vir:99 406 ETILKNTRSIDVSKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKES---I 480 (511) T ss_pred HHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCCHHHHHHhCCCCCCHHHHHHHHHHHHHHH---H Confidence 9999876543 34566899999999999999999999877 48999999999999999864 333333333322 1 Q ss_pred HHHHHhhhccCCCCCCCCCcccCCCCCCccccc Q lcl|NC_013644. 462 KEALEEAEYTKGLSDNTDEEETAVNPDDPTQQM 494 (510) Q Consensus 462 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (510) ...+... ..+..+..+.+.++...++..+++ T Consensus 481 ~~~~~~~--~~~~~~~~~~~~~~~~~~~~d~~e 511 (511) T protein:vir:99 481 KKAQKNM--YQDPRNINDDEQDDSTKDSIDKKE 511 (511) T ss_pred HHHhhcc--cccCCCCCCCCCCCCCcCcccccC Confidence 1122111 111111111111111111111111 No 7 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=100.00 E-value=4.3e-95 Score=537.92 Aligned_cols=467 Identities=16% Similarity=0.160 Sum_probs=366.3 Q ss_pred CCC-----ccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchh Q lcl|NC_013644. 1 MEA-----LLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFF 75 (510) Q Consensus 1 ~~~-----~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~ 75 (510) |-. ++.+..+.+.++|.++|++|.. +..+++++++||.|+|+|++++. +...++++|+++||+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~--~~~~~~~l~~Yy~g~~~i~~~~~----------~~~~~~~~ki~~n~~ 68 (499) T protein:vir:10 1 MAVVIDKDLLDDVNEPNIEAINYAIRELQN--RKKRLDKLSDYYNGKQEIEKHEF----------DNATVEAANVMVNHA 68 (499) T ss_pred CccchhhhHHhhhhcCCHHHHHHHHHHHHH--HHHHHHHHHHHhccccchhcCCc----------CcCCCCcceeecchH Confidence 332 2333345668889999999854 45679999999999999987653 345678899999999 Q ss_pred HHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCc-------------- Q lcl|NC_013644. 76 PEIVDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDR-------------- 140 (510) Q Consensus 76 ~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~-------------- 140 (510) ++||++.++||||+||+|++++++..+.|+++|+ |+++.++.++++.++++|+||+++|.+++|. T Consensus 69 ~~Iv~~~~~~l~g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~ 148 (499) T protein:vir:10 69 KYITDMNVGFMTGNPVKYVAEKGKNIDDILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLTP 148 (499) T ss_pred HHHHHHHhhhhcccCceeecCChhHHHHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEeccccccccccccccccccc Confidence 9999999999999999999999999999999985 7899999999999999999999999998874 Q ss_pred ---eEEEEEcccceEEEEcCCCC--ceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccc Q lcl|NC_013644. 141 ---LCFQVADSLNVFGVYNEYNE--LQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRP 215 (510) Q Consensus 141 ---~~i~~~~p~~~~~~~d~~~~--~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 215 (510) ++++.++|+++||+|++.+. +..++++|+....+ ....++++++||++++++|+...++.... T Consensus 149 ~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~--~~~~~~~~~iyt~~~i~~~~~~~~~~~~~---------- 216 (499) T protein:vir:10 149 NTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLE--GNTNGYSITVYMPQRIVEYRTKTTMEVSA---------- 216 (499) T ss_pred ccceEEEEEcccceEEEecCCCCcceEEEEEEEEEeecC--CCceEEEEEEEeCCeEEEEEecCCccccC---------- Confidence 67899999999999998665 44455555443333 34567899999999999998776543221 Q ss_pred ccccccccccccccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHh Q lcl|NC_013644. 216 HVLAVDSENESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQN 295 (510) Q Consensus 216 ~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~ 295 (510) ........+|+||+||||+|+|++.|+|+|+++++|||+||.++|++++.++++++|+++++|+...+..+.... T Consensus 217 -----~~~~~~~~~~~~g~vPvv~~~n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~ 291 (499) T protein:vir:10 217 -----NDPIVYDGENLFGAVPIIEFRNNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGLGDDKDDIQR 291 (499) T ss_pred -----cceecccccCCCCccceEEecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccccchhhh Confidence 122334568999999999999999999999999999999999999999999999999999999987766665565 Q ss_pred hhcCeeeec--cCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 296 VKSKKVVGT--GSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEAR 372 (510) Q Consensus 296 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~ 372 (510) ++.+.++.+ +++++++|++++.+.++++.++++|.+.|+.+|++|++++.. +||+||+||+++++++.+||.+|+++ T Consensus 292 ~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~k~~~k~~~ 371 (499) T protein:vir:10 292 LKRGAIEAPPREEGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDEKFMGNVSGEAMKFKLFGLENLLSIKQRY 371 (499) T ss_pred hhhcceeccCCCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCchhhcccchHHHHHHHHHHHHHHHHHHHHH Confidence 565555554 467789999999999999999999999999999999987765 58999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHH Q lcl|NC_013644. 373 LRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLIC 451 (510) Q Consensus 373 ~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~ 451 (510) |+++|++++++|+.+++..+ ...+..+++|+|++++|.|+++.+++++++ +|+||+||+++++|+++|+++ .++++ T Consensus 372 ~~~~l~~~~~li~~~~~~~~-~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~~E~~ri~ 448 (499) T protein:vir:10 372 FFDGLRRRLKLIQTIVNIKG-ANDDASGCKISLVANIPSNLSDVVNNVKNA--DGIIPRKYTYSWLPDVDNPQDVIDEMN 448 (499) T ss_pred HHHHHHHHHHHHHHHHhccC-CccccccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHH Confidence 99999999999999998765 456788999999999999999999999987 688999999999999998643 33333 Q ss_pred HHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 452 EQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 452 e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) ++++... ....+ ...+...+.. +.++++.+++..+..+.+....|+.| T Consensus 449 ~E~~~~~---~~~~~--~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (499) T protein:vir:10 449 QQDAETI---KKNQE--ALRGQDPDRL------ELEDKQDDSSENDKEAGSNHNQSHRT 496 (499) T ss_pred HHHHHHH---HHHHh--hhccCCCCCC------CCCCCCcccCCCCCCCccccccCCCC Confidence 3332211 11111 1111111100 01111111111122222344455555 No 8 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=100.00 E-value=4.3e-95 Score=537.91 Aligned_cols=462 Identities=16% Similarity=0.216 Sum_probs=365.4 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |..... +...+.+.|.++|.+|...+. .+++++++||.|+|+|+++.... ....++++|+++||+++||+ T Consensus 31 ~~~~e~-~~~~~~~~i~~~i~~~~~~~~-~r~~~l~~YY~g~~~i~~~~~~~--------~~~~~~~~ki~~n~~k~Ivd 100 (512) T protein:vir:97 31 YDGTES-DLLQNINEVSKYIEHHMDYQR-PRLKVLSDYYEGKTKNLVELTRR--------KEEYMADNRVAHDYASYISD 100 (512) T ss_pred cCchhh-hhhhhHHHHHHHHHHHHHhhH-HHHHHHHHHhcccCccccccCcc--------cccccCcceeecchHHHHHH Confidence 333222 233345778888888875544 46999999999999998765432 34567889999999999999 Q ss_pred HHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCC Q lcl|NC_013644. 81 QKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYN 159 (510) Q Consensus 81 ~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~ 159 (510) +.++||+|+||+|+++++++.+.|++||+ |+++.++.++++.++++|+||+++|.|++|++++++++|+++||+||++. T Consensus 101 ~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded~~~~i~~~~p~~~~~iyd~~~ 180 (512) T protein:vir:97 101 FINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTI 180 (512) T ss_pred HHhhhhcccCceeccCChHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEcccceEEEEcCCC Confidence 99999999999999999999999999995 78999999999999999999999999999999999999999999999764 Q ss_pred --CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccE Q lcl|NC_013644. 160 --ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPF 237 (510) Q Consensus 160 --~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 237 (510) ++.+++++|.....++.....+.++++||++.+++|...+++..... .......+|+||.||| T Consensus 181 ~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~---------------~~~~~~~~~~~g~vPv 245 (512) T protein:vir:97 181 ERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLT---------------PRENGFESHSFERMPI 245 (512) T ss_pred CCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccc---------------ccccccccccCcccce Confidence 57777888877777777777889999999999999988765543221 1223567899999999 Q ss_pred EEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCee-------------eec Q lcl|NC_013644. 238 YRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKV-------------VGT 304 (510) Q Consensus 238 v~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~-------------~~~ 304 (510) |+|+||++|+|+|+++++|||+||.++|++++.++++++|++|+.|+...+..++........+ +.. T Consensus 246 v~~~nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (512) T protein:vir:97 246 TEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYENRDTGIET 325 (512) T ss_pred EeecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCchhhhhhhhcccccccccchhhcccccCC Confidence 9999999999999999999999999999999999999999999999876655544332222211 123 Q ss_pred cCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc-CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 305 GSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGD-GNITNIVIKARYTLLNMKANKTEARLRALLEWMNKL 383 (510) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~ 383 (510) +++++++|++++.+.++++.++++|.+.||.+|++|+++++.+ ||+||+||+++++++.+||..|++.|+++|++++++ T Consensus 326 ~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l 405 (512) T protein:vir:97 326 EGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKL 405 (512) T ss_pred CCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5678899999999999999999999999999999999987764 789999999999999999999999999999999999 Q ss_pred HHHHHhhccCC--ccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHH Q lcl|NC_013644. 384 VIDDINRRYTK--AFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWED 460 (510) Q Consensus 384 i~~~~~~~~~~--~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~ 460 (510) |+.++...+.. ..+..+++|+|++++|.|.++.+++++++ +|+||+||+++++|+++|+++ .++++++++.. T Consensus 406 i~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl--~giiS~et~~~~l~~v~d~~~E~eri~~E~~~~--- 480 (512) T protein:vir:97 406 LETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKES--- 480 (512) T ss_pred HHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHH--- Confidence 99998876543 34566899999999999999999999877 588999999999999988643 33333332221 Q ss_pred HHHHHHhhhccCCCCCCCCCcccCCCCCCccccc Q lcl|NC_013644. 461 VKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQM 494 (510) Q Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (510) ....+ .....+..+..+.+.++.......+++ T Consensus 481 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (512) T protein:vir:97 481 IKKAQ--KGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) T ss_pred HHHHh--hcccCCCCCCCCCCCCCCccccccccC Confidence 11111 111122122111111111111111111 No 9 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=100.00 E-value=2.7e-95 Score=539.07 Aligned_cols=453 Identities=21% Similarity=0.341 Sum_probs=366.6 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) .-..+..+...+.++|.++|++|. .++.++.++++||.|+|+|++++.+.. ........++++||++||+++||+ T Consensus 17 ~~~~~~~~~~~~~~~i~~~i~~~~--~~~~~~~~l~~Yy~g~~~i~~~~~~~~---~~~~~~~~~~~~ki~~n~~k~Iv~ 91 (474) T protein:vir:96 17 VVEQMKPKVETQEEMIIRLINNHK--QKLKDINVGQKYYDKDNDINYQAYKQD---LHGNIDYTKPDWRITTNFHQNLVD 91 (474) T ss_pred hhhhccccccchHHHHHHHHHHHH--HHHHHHHHHHHHhcccCccccccchhh---hcccccccccccccccchHHHHHH Confidence 111233455677888999999996 456689999999999999998865422 222345577899999999999999 Q ss_pred HHHhhhhcCCceeccCcHHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCC- Q lcl|NC_013644. 81 QKTQYLLSNPVEYETENEELKEYLAEYYNSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYN- 159 (510) Q Consensus 81 ~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~- 159 (510) +.++||||+||+|+++++++.+.|+.|++|++++++.++++.++++|+||+++|+|++|++++++++|+++||+||++. T Consensus 92 ~~~~yl~g~p~~~~~~~~~~~~~l~~~~~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~ 171 (474) T protein:vir:96 92 QKVSYVAGKPVTYAHDDDKVLDVIHQVLDTRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRVPAEQAIPIWTDKER 171 (474) T ss_pred hhhhhhcccCceeccCChHHHHHHHHHHhccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEEcCCCC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999998754 Q ss_pred -CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEE Q lcl|NC_013644. 160 -ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFY 238 (510) Q Consensus 160 -~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv 238 (510) ++.++++.|.. . ...++++|+++++++|...+++....... ..........+|+||+|||| T Consensus 172 ~~~~a~ir~~~~--~------~~~~~~vy~~~~i~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~vPvv 233 (474) T protein:vir:96 172 EQLNAFIRIFTF--N------GETKVEYWTAETVTYYVYENGGLIPDFYY----------GDEHIQTHFSTGSWERVPFI 233 (474) T ss_pred CceEEEEEEEee--c------CeeEEEEEeCCeEEEEEEcCCceeecccc----------ccccccCcccccCCCccceE Confidence 56566555432 1 24578999999999999876654322111 11223335668999999999 Q ss_pred EecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeEEeecCC Q lcl|NC_013644. 239 RLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVTIP 318 (510) Q Consensus 239 ~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (510) +|+|+++|.|+|+++++|||+||.++|++++.+++|++|++|++|+++++..++...++.++++.++++++++|++++.+ T Consensus 234 ~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~~~ 313 (474) T protein:vir:96 234 AFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSEFMEGLKYYKAINVSSDGGVETIQVEVP 313 (474) T ss_pred EecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccchhhhhhccceeeccCCCceeEEeccCC Confidence 99999999999999999999999999999999999999999999999988888888889999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccc Q lcl|NC_013644. 319 TEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFD 397 (510) Q Consensus 319 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~ 397 (510) .++++.++++|+++||.+|++|++++.+ +||+||+||+++++++++||.++++.|+++|++++++|+.+++. ..+ T Consensus 314 ~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~----~~d 389 (474) T protein:vir:96 314 VASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKI----KLD 389 (474) T ss_pred HHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----Ccc Confidence 9999999999999999999999988765 46899999999999999999999999999999999999998754 346 Q ss_pred cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHHHHHHHHhhhccCCCCC Q lcl|NC_013644. 398 PTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWEDVKEALEEAEYTKGLSD 476 (510) Q Consensus 398 ~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~ 476 (510) ..+|+|+|++++|.|+++.+++++ .+|+||+||+++++|+++|+++ .++++++++.. .+..+ ...+. T Consensus 390 ~~~i~i~f~~~~p~~~~e~a~~~~---~~giiS~et~~~~lp~v~D~~~E~eri~~E~~~~-------~~~~~--~~~~~ 457 (474) T protein:vir:96 390 AKEIEITFNFNVMVNDLEQSQIGA---QSQYLSKETLVRHHPWVDDPKAELERLDEEQLEL-------NKQLP--NLDDG 457 (474) T ss_pred cceeeEEecCCCccCHHHHHHHHH---HcCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHH-------Hhhcc--ccccc Confidence 778999999999999999999865 3699999999999999988643 23333332211 11111 11111 Q ss_pred CCCC-cccCCCCCCccc Q lcl|NC_013644. 477 NTDE-EETAVNPDDPTQ 492 (510) Q Consensus 477 ~~~~-~~~~~~~~~~~~ 492 (510) +.+. .++++.++++.+ T Consensus 458 ~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:96 458 GADGAQQQQQSENNQSK 474 (474) T ss_pred cCCCCCCcCCCCccccC Confidence 1111 111111111111 No 10 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=100.00 E-value=2.7e-95 Score=539.07 Aligned_cols=453 Identities=21% Similarity=0.341 Sum_probs=366.6 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) .-..+..+...+.++|.++|++|. .++.++.++++||.|+|+|++++.+.. ........++++||++||+++||+ T Consensus 17 ~~~~~~~~~~~~~~~i~~~i~~~~--~~~~~~~~l~~Yy~g~~~i~~~~~~~~---~~~~~~~~~~~~ki~~n~~k~Iv~ 91 (474) T protein:vir:95 17 VVEQMKPKVETQEEMIIRLINNHK--QKLKDINVGQKYYDKDNDINYQAYKQD---LHGNIDYTKPDWRITTNFHQNLVD 91 (474) T ss_pred hhhhccccccchHHHHHHHHHHHH--HHHHHHHHHHHHhcccCccccccchhh---hcccccccccccccccchHHHHHH Confidence 111233455677888999999996 456689999999999999998865422 222345577899999999999999 Q ss_pred HHHhhhhcCCceeccCcHHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCC- Q lcl|NC_013644. 81 QKTQYLLSNPVEYETENEELKEYLAEYYNSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYN- 159 (510) Q Consensus 81 ~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~- 159 (510) +.++||||+||+|+++++++.+.|+.|++|++++++.++++.++++|+||+++|+|++|++++++++|+++||+||++. T Consensus 92 ~~~~yl~g~p~~~~~~~~~~~~~l~~~~~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~ 171 (474) T protein:vir:95 92 QKVSYVAGKPVTYAHDDDKVLDVIHQVLDTRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRVPAEQAIPIWTDKER 171 (474) T ss_pred hhhhhhcccCceeccCChHHHHHHHHHHhccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEEcCCCC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999998754 Q ss_pred -CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEE Q lcl|NC_013644. 160 -ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFY 238 (510) Q Consensus 160 -~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv 238 (510) ++.++++.|.. . ...++++|+++++++|...+++....... ..........+|+||+|||| T Consensus 172 ~~~~a~ir~~~~--~------~~~~~~vy~~~~i~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~vPvv 233 (474) T protein:vir:95 172 EQLNAFIRIFTF--N------GETKVEYWTAETVTYYVYENGGLIPDFYY----------GDEHIQTHFSTGSWERVPFI 233 (474) T ss_pred CceEEEEEEEee--c------CeeEEEEEeCCeEEEEEEcCCceeecccc----------ccccccCcccccCCCccceE Confidence 56566555432 1 24578999999999999876654322111 11223335668999999999 Q ss_pred EecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeEEeecCC Q lcl|NC_013644. 239 RLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVTIP 318 (510) Q Consensus 239 ~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (510) +|+|+++|.|+|+++++|||+||.++|++++.+++|++|++|++|+++++..++...++.++++.++++++++|++++.+ T Consensus 234 ~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~~~ 313 (474) T protein:vir:95 234 AFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSEFMEGLKYYKAINVSSDGGVETIQVEVP 313 (474) T ss_pred EecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccchhhhhhccceeeccCCCceeEEeccCC Confidence 99999999999999999999999999999999999999999999999988888888889999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccc Q lcl|NC_013644. 319 TEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFD 397 (510) Q Consensus 319 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~ 397 (510) .++++.++++|+++||.+|++|++++.+ +||+||+||+++++++++||.++++.|+++|++++++|+.+++. ..+ T Consensus 314 ~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~----~~d 389 (474) T protein:vir:95 314 VASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKI----KLD 389 (474) T ss_pred HHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----Ccc Confidence 9999999999999999999999988765 46899999999999999999999999999999999999998754 346 Q ss_pred cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHHHHHHHHhhhccCCCCC Q lcl|NC_013644. 398 PTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWEDVKEALEEAEYTKGLSD 476 (510) Q Consensus 398 ~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~ 476 (510) ..+|+|+|++++|.|+++.+++++ .+|+||+||+++++|+++|+++ .++++++++.. .+..+ ...+. T Consensus 390 ~~~i~i~f~~~~p~~~~e~a~~~~---~~giiS~et~~~~lp~v~D~~~E~eri~~E~~~~-------~~~~~--~~~~~ 457 (474) T protein:vir:95 390 AKEIEITFNFNVMVNDLEQSQIGA---QSQYLSKETLVRHHPWVDDPKAELERLDEEQLEL-------NKQLP--NLDDG 457 (474) T ss_pred cceeeEEecCCCccCHHHHHHHHH---HcCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHH-------Hhhcc--ccccc Confidence 778999999999999999999865 3699999999999999988643 23333332211 11111 11111 Q ss_pred CCCC-cccCCCCCCccc Q lcl|NC_013644. 477 NTDE-EETAVNPDDPTQ 492 (510) Q Consensus 477 ~~~~-~~~~~~~~~~~~ 492 (510) +.+. .++++.++++.+ T Consensus 458 ~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:95 458 GADGAQQQQQSENNQSK 474 (474) T ss_pred cCCCCCCcCCCCccccC Confidence 1111 111111111111 No 11 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=100.00 E-value=2.8e-95 Score=538.94 Aligned_cols=432 Identities=24% Similarity=0.397 Sum_probs=364.1 Q ss_pred hhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCC Q lcl|NC_013644. 11 IIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNP 90 (510) Q Consensus 11 ~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p 90 (510) ++.+.|.++|++|.. ++.++.++++||.|+|+|+++........ .....++++|+++||+++||++.++||||+| T Consensus 1 l~~~~i~~~i~~~~~--~~~r~~~~~~YY~g~~~i~~~~~~~~~~~---~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p 75 (451) T protein:vir:10 1 MELEKIRAIISADAA--RRQEILQAKSYYYNKNDILKKGVVVQNRD---ENPLRNADNRISHNFHEILVDEKASYMFTYP 75 (451) T ss_pred CCHHHHHHHHHHHHH--HHHHHHHHHHHhcccCccccccccccccc---cccccccccccccchHHHHHHhhhhheeccc Confidence 889999999999874 56789999999999999998876544332 2345788999999999999999999999999 Q ss_pred ceeccCc-HHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCC--------CceEEEEEcccceEEEEcCCC-- Q lcl|NC_013644. 91 VEYETEN-EELKEYLAEYYNSEFQVVLQELVEGSSQKGFEYVYARTNAE--------DRLCFQVADSLNVFGVYNEYN-- 159 (510) Q Consensus 91 ~~~~~~d-~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~--------g~~~i~~~~p~~~~~~~d~~~-- 159 (510) |+|++++ +...+.|+.|++|++++++.++++.++++|+||+++|+|++ |++++.+++|+++||+||++. T Consensus 76 ~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~vydd~~~~ 155 (451) T protein:vir:10 76 VLFDIDNNKELNEKVTDVLGNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIPIYRNGIER 155 (451) T ss_pred ceeecCCcHHHHHHHHHHhccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcccceEEEEcCCCCC Confidence 9998755 66778899999999999999999999999999999999975 789999999999999998754 Q ss_pred CceeEEEEEEEEEeeCCc--eeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccE Q lcl|NC_013644. 160 ELQRICRHYITEIEKDGE--TVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPF 237 (510) Q Consensus 160 ~~~~~~~~~~~~~~~~~~--~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 237 (510) ++.+++|+|.....+++. ...+.++++||+..+++|+...++... ........+|+||+||| T Consensus 156 ~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~g~vPv 219 (451) T protein:vir:10 156 ELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCG----------------SQIEHITVQHRFNSVPF 219 (451) T ss_pred ceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccc----------------cccccccccCCCCeeeE Confidence 577777777665554432 345788999999999999876544321 11233566899999999 Q ss_pred EEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeecc-----CCCceeE Q lcl|NC_013644. 238 YRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTG-----SDGGLDV 312 (510) Q Consensus 238 v~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~ 312 (510) |+|+||+.|.|+|+++++|||+||.++|++++.++++++|+++++|+++++..+....++..+++.+. ++++|+| T Consensus 220 v~~~nn~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 299 (451) T protein:vir:10 220 VEFSNNIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSEFLKELKRYKTIKTETDSEGDSGGLKT 299 (451) T ss_pred EEeccCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhhHHHHhhCCeEEecCcCCccCCcceE Confidence 99999999999999999999999999999999999999999999999988888888888888777665 4578999 Q ss_pred EeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_013644. 313 KTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRY 392 (510) Q Consensus 313 ~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~ 392 (510) ++|+.+.++++.++++|.++||.+|++|++++..+||+||+||+++++++++||++|++.|+++|++++++|+.+++.. T Consensus 300 l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~- 378 (451) T protein:vir:10 300 MQIEIPTEARKIILEILKKQIYESGQGLQQDTENFGNASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLGVT- 378 (451) T ss_pred EeecCCHHHHHHHHHHHHHHHHHHhCcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC- Confidence 9999999999999999999999999999999888899999999999999999999999999999999999999998643 Q ss_pred CCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHH-HHHHHHHHHHHHHHHHHHHhhhcc Q lcl|NC_013644. 393 TKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVL-RLICEQFDLDWEDVKEALEEAEYT 471 (510) Q Consensus 393 ~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~-~~~~e~~e~~~~~~~~~~~~~~~~ 471 (510) +..+++|+|++++|+|+++.+++++++ +|+||+||+++++|+++|++++ +++.++++++.... .+. . T Consensus 379 ----d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~~p~v~d~~~e~~~~~ee~~~~~~~~-----~~~-~ 446 (451) T protein:vir:10 379 ----DYKKIQQTYTRNMMSNDLEDADIATKS--VGIIPTKIILRHHPWVDDVEEAEKLYLEEKKIQASKV-----SDD-Y 446 (451) T ss_pred ----CccceeEEecCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHH-----Hhh-c Confidence 567899999999999999999999987 4889999999999999987543 33333332221111 111 1 Q ss_pred CCCCC Q lcl|NC_013644. 472 KGLSD 476 (510) Q Consensus 472 ~~~~~ 476 (510) +++++ T Consensus 447 ~~~~~ 451 (451) T protein:vir:10 447 NNFTE 451 (451) T ss_pred CCCCC Confidence 12222 No 12 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=100.00 E-value=1.2e-94 Score=535.59 Aligned_cols=455 Identities=20% Similarity=0.321 Sum_probs=370.0 Q ss_pred CCCc--cCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEAL--LSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~--~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) ..++ ...+.+.+.+.|.++|++|. .++.++.++++||.|+|+|+++++.... .......++++|+++||+++| T Consensus 23 ~~~~~~~~~~~e~~~~~i~~~i~~~~--~~~~r~~~l~~YY~g~~~i~~~~~~~~~---~~~~~~~~~~~ki~~n~~k~I 97 (483) T protein:vir:12 23 FDAIVRTNNKPETLEEMIVRYIKQHL--EKLPEISIGQEYYEQRPDIVKEPKPVDA---TGAVDPLKPDDRMITNFHANL 97 (483) T ss_pred hhcccccCCchhhHHHHHHHHHHHHH--HHHHHHHHHHHHhccccccccccccccc---cccccccccccccccchHHHH Confidence 1122 23455678889999999996 3556799999999999999988765432 223456778999999999999 Q ss_pred HHHHHhhhhcCCceeccCcHHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCC Q lcl|NC_013644. 79 VDQKTQYLLSNPVEYETENEELKEYLAEYYNSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEY 158 (510) Q Consensus 79 v~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~ 158 (510) |++.++||+|+||+|+++++++.+.|++|++|++++++.++++.++++|+||+++|.|++|++++++++|+++||+||++ T Consensus 98 vd~~~~~l~G~p~~~~~~d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~y~~v~~d~d~~~~i~~~~p~~~~~v~d~~ 177 (483) T protein:vir:12 98 VDQKVSYIVGKPIAFKHTDDEVVKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDK 177 (483) T ss_pred HHHHhhhhcccCceeccCChHHHHHHHHHHhccHHHHHHHHHHHHhhCCeEEEEEEEcCCCceEEEEEcccceEEEEcCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999865 Q ss_pred --CCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCccc Q lcl|NC_013644. 159 --NELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIP 236 (510) Q Consensus 159 --~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP 236 (510) .++.+++++|... + ..++++|++.++++|...++.......... ........+|+||+|| T Consensus 178 ~~~~~~~~ir~~~~~--~------~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~g~vP 239 (483) T protein:vir:12 178 EHEELEAFIRMYKLE--N------ETKVEYWDKVTVNYYVYENGSLIPDYSNNL----------ENSKTHFSTGSWGKIP 239 (483) T ss_pred CCCceEEEEEEEEee--c------ceEEEEEecCeEEEEEEeCCeeeecccccc----------cccccccccCCCCccc Confidence 4566666665432 1 246899999999999877655433222111 1223346789999999 Q ss_pred EEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeEEeec Q lcl|NC_013644. 237 FYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVT 316 (510) Q Consensus 237 vv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (510) ||+|+||++|+|+|+++++|||+||.++|++++.++++++|++|++|++.++..++...++..+++.++++++++|++++ T Consensus 240 vv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 319 (483) T protein:vir:12 240 FIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVE 319 (483) T ss_pred eEEecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhHHHhhhhccccccCCCCcceEEeec Confidence 99999999999999999999999999999999999999999999999998888888888888889999999999999999 Q ss_pred CCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCc Q lcl|NC_013644. 317 IPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKA 395 (510) Q Consensus 317 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~ 395 (510) ++.++++.++++|++.||.+|++|++++.. +||+||+||++++.++.+||.++++.|+++|++++++|+.+++... T Consensus 320 ~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~~~--- 396 (483) T protein:vir:12 320 VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG--- 396 (483) T ss_pred CCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--- Confidence 999999999999999999999999988765 4789999999999999999999999999999999999999887643 Q ss_pred cccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHHHHHHHHhhhccCCC Q lcl|NC_013644. 396 FDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWEDVKEALEEAEYTKGL 474 (510) Q Consensus 396 ~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~~~~~~~~~~~~~~~ 474 (510) +..+++|+|++++|+|+++.+++++++ +|+||+||+++++|+++|+++ .++++++++.. .... +... T Consensus 397 -~~~~i~v~f~~~~p~~~~~~a~~~~kl--~GiiS~et~~~~~~~v~d~~~E~~ri~~E~~~~------~~~~---~~~~ 464 (483) T protein:vir:12 397 -EHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEY------NKQL---PNLD 464 (483) T ss_pred -ccceeeEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHH------Hhhc---cccc Confidence 567899999999999999999999877 589999999999999988643 22222222211 1111 1111 Q ss_pred CCCCCCcccCCCCCCcccc Q lcl|NC_013644. 475 SDNTDEEETAVNPDDPTQQ 493 (510) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~ 493 (510) +.+.+...+..+.++.++| T Consensus 465 ~~~~d~~~~~~~~~~~e~e 483 (483) T protein:vir:12 465 DGGADGAQQQERSNNKESE 483 (483) T ss_pred ccccCCcccCCCCCcccCC Confidence 1222211111122211111 No 13 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=100.00 E-value=9.2e-95 Score=536.13 Aligned_cols=462 Identities=16% Similarity=0.235 Sum_probs=364.7 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |-.... +...+.+.|.++|.+|...+ +.+++++++||.|+|+|+.+... .....++++|+++||+++||+ T Consensus 31 ~~~~e~-~~~~~~~~i~~~i~~~~~~~-~~r~~~l~~Yy~g~~~i~~~~~~--------~~~~~~~~~ki~~n~~k~Iv~ 100 (511) T protein:vir:96 31 YDGTES-DLLQNVNEVSKYIEHHMDYQ-RPRLKVLSDYYEGKTKNLVELTR--------RKEEYMADNRVAHDYASYISD 100 (511) T ss_pred cchhhh-hhhccHHHHHHHHHHHHHhh-HHHHHHHHHHhcccCccccccCc--------CcccccCcceeecchHHHHHH Confidence 222222 22335677899999987554 45799999999999999876543 234567889999999999999 Q ss_pred HHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCC Q lcl|NC_013644. 81 QKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYN 159 (510) Q Consensus 81 ~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~ 159 (510) +.++||+|+||+|+++++++.+.|++||+ |+++.++.++++.++++|+||+++|.|++|++++++++|+++||+||++. T Consensus 101 ~~~~yl~g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~~p~~~~~vydd~~ 180 (511) T protein:vir:96 101 FINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTI 180 (511) T ss_pred HHHhhhccCCceeecCchHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCC Confidence 99999999999999999999999999995 78999999999999999999999999999999999999999999999764 Q ss_pred --CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccE Q lcl|NC_013644. 160 --ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPF 237 (510) Q Consensus 160 --~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 237 (510) ++.+++++|.....++.....+.++++||++++++|...++++.... .......+|+||.||| T Consensus 181 ~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~vPv 245 (511) T protein:vir:96 181 ERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLT---------------PRENGFESHSFERMPI 245 (511) T ss_pred CCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccc---------------ccccccccccCCceee Confidence 46677777777777777777888999999999999988766543321 1223457899999999 Q ss_pred EEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCee------------eecc Q lcl|NC_013644. 238 YRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKV------------VGTG 305 (510) Q Consensus 238 v~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~------------~~~~ 305 (510) |+|+|++.|+|+|+++++|||+||.++|++++.++++++|++|++|+...+..+.........+ ...+ T Consensus 246 v~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (511) T protein:vir:96 246 TEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETE 325 (511) T ss_pred EEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCCchhhcccccccceecccccccccccccCC Confidence 9999999999999999999999999999999999999999999999766555544333222211 1245 Q ss_pred CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 306 SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLV 384 (510) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i 384 (510) ++++++|++++.+.++++.++++|.+.||.+|++|++++.+ +||+||+||+++++++.+||.+|++.|+++|++++++| T Consensus 326 ~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li 405 (511) T protein:vir:96 326 GSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLL 405 (511) T ss_pred CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 67889999999999999999999999999999999988765 47899999999999999999999999999999999999 Q ss_pred HHHHhhccCC--ccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHHH Q lcl|NC_013644. 385 IDDINRRYTK--AFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWEDV 461 (510) Q Consensus 385 ~~~~~~~~~~--~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~~ 461 (510) +.+++..+.. ..+..+++|+|++++|.|.++.+++++++ +|+||+||+++++|+++|+++ .++++++++.. . T Consensus 406 ~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl--~G~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~---~ 480 (511) T protein:vir:96 406 ETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKES---I 480 (511) T ss_pred HHHHHhhcCcccccccccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHH---H Confidence 9998876554 44566899999999999999999999876 689999999999999988643 33333332211 1 Q ss_pred HHHHHhhhccCCCCCCCCCcccCCCCCCccccc Q lcl|NC_013644. 462 KEALEEAEYTKGLSDNTDEEETAVNPDDPTQQM 494 (510) Q Consensus 462 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (510) ...+. .......+..+.+..+.......+++ T Consensus 481 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:96 481 KKAQK--GIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHhh--ccccCCCCCCCCCCCCcccccccccC Confidence 11111 11111111111111111111111111 No 14 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=100.00 E-value=1.3e-94 Score=535.35 Aligned_cols=462 Identities=16% Similarity=0.226 Sum_probs=365.1 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |-.... +...+.+.|.++|.+|...+ +.+++++++||.|+|+|+.+.... ....++++|+++||+++||+ T Consensus 31 ~~~~e~-~~~~~~~~i~~~i~~~~~~~-~~r~~~l~~Yy~g~~~il~~~~~~--------~~~~~~~~ki~~n~~k~Iv~ 100 (511) T protein:vir:93 31 YDGTES-DLLQNVNEVSKYIEHHMDYQ-RPRLKVLSDYYEGKTKNLVELTRR--------KEEYMADNRVAHDYASYISD 100 (511) T ss_pred ccchhh-hhhccHHHHHHHHHHHHHhh-HHHHHHHHHHhcccCccccccCcC--------cccccCcceeecchHHHHHH Confidence 222222 12234677889999987554 457999999999999998765432 34567889999999999999 Q ss_pred HHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCC Q lcl|NC_013644. 81 QKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYN 159 (510) Q Consensus 81 ~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~ 159 (510) +.++||+|+||+|+++++++.+.|++||+ |+++.++.++++.++++|+||+++|.|++|++++++++|+++||+||++. T Consensus 101 ~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de~~~~~i~~~~p~~~~~vydd~~ 180 (511) T protein:vir:93 101 FINGYFLGNPIQYQDDDKDVLEVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTI 180 (511) T ss_pred HHhhhhcccCeeeccCChHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCC Confidence 99999999999999999999999999995 78999999999999999999999999999999999999999999999764 Q ss_pred --CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccE Q lcl|NC_013644. 160 --ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPF 237 (510) Q Consensus 160 --~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 237 (510) ++.+++++|.....++.....+.++++||++.+++|...+++..... .......+|+||.||| T Consensus 181 ~~~~~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~---------------~~~~~~~~~~~g~vPv 245 (511) T protein:vir:93 181 ERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLT---------------PRENGFESHSFERMPI 245 (511) T ss_pred CCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccc---------------cccccccccCCCccce Confidence 57778888877777777777888999999999999987765543221 1223456899999999 Q ss_pred EEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCee------------eecc Q lcl|NC_013644. 238 YRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKV------------VGTG 305 (510) Q Consensus 238 v~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~------------~~~~ 305 (510) |+|+|+++|+|+|+++++|||+||.++|++++.++++++|+++++|+.+.+..+.........+ .... T Consensus 246 v~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (511) T protein:vir:93 246 TEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETE 325 (511) T ss_pred EEecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcccCchhhcccccccceecccccccccccccCC Confidence 9999999999999999999999999999999999999999999999876665554333222211 1245 Q ss_pred CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 306 SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLV 384 (510) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i 384 (510) ++++++|++++.+.++++.++++|.+.|+.+|++|++++.+ +||+||+||+++++++.+||.+|++.|+++|++++++| T Consensus 326 ~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li 405 (511) T protein:vir:93 326 GSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLL 405 (511) T ss_pred CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 67889999999999999999999999999999999988765 48999999999999999999999999999999999999 Q ss_pred HHHHhhccCC--ccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHHH Q lcl|NC_013644. 385 IDDINRRYTK--AFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWEDV 461 (510) Q Consensus 385 ~~~~~~~~~~--~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~~ 461 (510) +.+++..+.. ..+..+++++|++++|.|.++.+++++++ +|+||+||+++++|+++|+++ .++++++++.+. T Consensus 406 ~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~--- 480 (511) T protein:vir:93 406 ETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESI--- 480 (511) T ss_pred HHHHHhccCcccccccccceEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHH--- Confidence 9998876654 34566899999999999999999999877 589999999999999998643 333333333211 Q ss_pred HHHHHhhhccCCCCCCCCCcccCCCCCCccccc Q lcl|NC_013644. 462 KEALEEAEYTKGLSDNTDEEETAVNPDDPTQQM 494 (510) Q Consensus 462 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (510) ...+ ........+..+.+..+...+...+++ T Consensus 481 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:93 481 KKAQ--KGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHh--hhcccCCCCCCCCCCCCcccccccccC Confidence 1111 111111111111111111111111111 No 15 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=100.00 E-value=1.7e-94 Score=534.68 Aligned_cols=455 Identities=20% Similarity=0.319 Sum_probs=369.2 Q ss_pred CCCc--cCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEAL--LSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~--~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) ..++ ...+.+.+.+.|.++|++|. .++.++.++++||.|+|+|+++++.... .......++++|+++||+++| T Consensus 32 ~~~~~~~~~~~~~~~~~i~~~i~~~~--~~~~r~~~l~~YY~g~~~I~~~~~~~~~---~~~~~~~~~~~ri~~n~~k~I 106 (492) T protein:vir:94 32 FDAIVRTNNKPETLEEMIVRYIKQHL--EKLPEISIGQEYYEQRPDIVKEPKPVDA---TGAVDPLKPDDRMITNFHANL 106 (492) T ss_pred hhcccccCCchhhHHHHHHHHHHHHH--HHHHHHHHHHHHhccccccccccccccc---cccccccccccccccchHHHH Confidence 3333 34566888999999999986 3556899999999999999988765432 223456788999999999999 Q ss_pred HHHHHhhhhcCCceeccCcHHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCC Q lcl|NC_013644. 79 VDQKTQYLLSNPVEYETENEELKEYLAEYYNSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEY 158 (510) Q Consensus 79 v~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~ 158 (510) |++.++||+|+||+|+++++.+.+.|+.|++|++++.+.+++++++++|+||+++|.|++|++++++++|.++||+||++ T Consensus 107 vd~~~~yl~G~p~~~~~~d~~~~~~l~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~~~~~~p~~~~~v~d~~ 186 (492) T protein:vir:94 107 VDQKVSYIVGKPIAFKHTDDEVVKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDK 186 (492) T ss_pred HHHHHhhhcccCceeccCchHHHHHHHHHHhccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEEEcccceEEEEcCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999864 Q ss_pred --CCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCccc Q lcl|NC_013644. 159 --NELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIP 236 (510) Q Consensus 159 --~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP 236 (510) .++.+++++|... + ..++++|++.++++|....+.......... ........+|+||+|| T Consensus 187 ~~~~~~a~ir~~~~~--~------~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~g~vP 248 (492) T protein:vir:94 187 EHEELEAFIRMYKLE--N------ETKVEYWDKVTVNYYVYENGSLIPDYSNNL----------ENSKTHFSTGSWGKIP 248 (492) T ss_pred CCCceEEEEEEEeec--c------ceeEEEEecCeEEEEEEecCeeeecccccc----------ccccccccccCCCccc Confidence 3566666655432 1 246899999999999887665543322211 1223345789999999 Q ss_pred EEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeEEeec Q lcl|NC_013644. 237 FYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVT 316 (510) Q Consensus 237 vv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (510) ||+|+||++|+|+|+++++|||+||.++|++++.++++++|++|++|++.++..++...++..+++.++++++++|++++ T Consensus 249 vv~~~nn~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 328 (492) T protein:vir:94 249 FIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVE 328 (492) T ss_pred eEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhHHHHhhccceecCCCCcceeEecc Confidence 99999999999999999999999999999999999999999999999998888888888888999999999999999999 Q ss_pred CCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCc Q lcl|NC_013644. 317 IPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKA 395 (510) Q Consensus 317 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~ 395 (510) .+.++++.++++|++.||.+|++|++++.. +||+||+||++++++|.+||.++++.|+++|++++++|+.+++... T Consensus 329 ~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~--- 405 (492) T protein:vir:94 329 VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG--- 405 (492) T ss_pred CCHHHHHHHHHHHHHHHHHHhCCcCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc--- Confidence 999999999999999999999999988765 4789999999999999999999999999999999999999987653 Q ss_pred cccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHHHHHHHHhhhccCCC Q lcl|NC_013644. 396 FDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWEDVKEALEEAEYTKGL 474 (510) Q Consensus 396 ~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~~~~~~~~~~~~~~~ 474 (510) +..+++|+|++++|.|+++.+++++++ +|++|+||+++++|+++|+++ .++++++++... +..+ ... T Consensus 406 -~~~~i~v~f~~~~p~~~~e~~~~~~kl--~giiS~et~~~~l~~v~d~~~E~eri~~E~~~~~-------~~~~--~~~ 473 (492) T protein:vir:94 406 -EHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYN-------KQLP--NLD 473 (492) T ss_pred -ccceeeEEecCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHH-------hhcc--ccc Confidence 456899999999999999999999877 589999999999999988643 222222222111 1111 111 Q ss_pred CCCCCCcccCCCCCCccccc Q lcl|NC_013644. 475 SDNTDEEETAVNPDDPTQQM 494 (510) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~~ 494 (510) ..+.+....+...+ +.+.+ T Consensus 474 ~~~~~~~~~~~~~~-~~e~e 492 (492) T protein:vir:94 474 DGGADSAQQQERSN-NKESE 492 (492) T ss_pred cccCCCCccccCCc-cccCC Confidence 11111111111111 11111 No 16 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=100.00 E-value=1.9e-94 Score=534.40 Aligned_cols=462 Identities=16% Similarity=0.226 Sum_probs=364.6 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |-... .+...+.+.|.++|.+|...++ .+++++++||.|+|+|+++... .....++++|+++||+++||+ T Consensus 31 ~~~~~-~~~~~~~~~i~~~i~~~~~~~~-~r~~~l~~Yy~g~~~i~~~~~~--------~~~~~~~~~ki~~n~~k~Iv~ 100 (511) T protein:vir:10 31 YDGTE-SDLLQNVNEVSKCIEHHMDYQR-PRLKVLSDYYEGKTKNLVELTR--------RKEEYMADNRVAHDYASYISD 100 (511) T ss_pred Cchhh-hhcccCHHHHHHHHHHHHHhhH-HHHHHHHHHhcccCccccccCc--------ccccccCcceeecchHHHHHH Confidence 21111 1223345778889988875544 4799999999999999876543 234567889999999999999 Q ss_pred HHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCC Q lcl|NC_013644. 81 QKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYN 159 (510) Q Consensus 81 ~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~ 159 (510) +.++||+|+||+|+++++++.+.|++||+ |+++.++.++++.++++|+||+++|.|++|++++++++|.++||+||++. T Consensus 101 ~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~dedg~~~i~~~~p~~~~~vydd~~ 180 (511) T protein:vir:10 101 FINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIRNQDDETRLYKSDAMSTFVIYDNTI 180 (511) T ss_pred HHhhhhcccCceeecCchHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCC Confidence 99999999999999999999999999995 78999999999999999999999999999999999999999999999765 Q ss_pred --CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccE Q lcl|NC_013644. 160 --ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPF 237 (510) Q Consensus 160 --~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 237 (510) ++.+++++|.....++.....+.++++||++++++|...++++.... .......+|+||.||| T Consensus 181 ~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~vPv 245 (511) T protein:vir:10 181 ERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLT---------------PRENGFESHSFERMPI 245 (511) T ss_pred CCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCCccccc---------------ccccccccccCcceeE Confidence 47777788877777777777888999999999999988765543221 1223456899999999 Q ss_pred EEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCee------------eecc Q lcl|NC_013644. 238 YRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKV------------VGTG 305 (510) Q Consensus 238 v~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~------------~~~~ 305 (510) |+|+|+++|+|+|+++++|||+||.++|++++.++++++|++|++|+...+..+.........+ ...+ T Consensus 246 v~f~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (511) T protein:vir:10 246 TEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETE 325 (511) T ss_pred EEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccCCchhhccchhccceecccccccccccccCC Confidence 9999999999999999999999999999999999999999999999766555554433222211 1235 Q ss_pred CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 306 SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLV 384 (510) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i 384 (510) ++++++|++++.+.++++.++++|.++||.+|++|++++.+ +||+||+||+++++++.+||.+|++.|+++|++++++| T Consensus 326 ~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li 405 (511) T protein:vir:10 326 GSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLL 405 (511) T ss_pred CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 57889999999999999999999999999999999988765 47899999999999999999999999999999999999 Q ss_pred HHHHhhccC--CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHH-HHHHHHHHHHHHHHHH Q lcl|NC_013644. 385 IDDINRRYT--KAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDN-VLRLICEQFDLDWEDV 461 (510) Q Consensus 385 ~~~~~~~~~--~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e-~~~~~~e~~e~~~~~~ 461 (510) +.+++..+. ...+..+++|+|++++|.|.++.+++++++ +|+||+||+++++|+++|++ +.++++++++.. . T Consensus 406 ~~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~G~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~---~ 480 (511) T protein:vir:10 406 ETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKES---I 480 (511) T ss_pred HHHHHhhCCcccccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHH---H Confidence 999987654 345667899999999999999999999987 48899999999999999863 333333333321 1 Q ss_pred HHHHHhhhccCCCCCCCCCcccCCCCCCccccc Q lcl|NC_013644. 462 KEALEEAEYTKGLSDNTDEEETAVNPDDPTQQM 494 (510) Q Consensus 462 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (510) ...+ ........+..+.+..+.......+++ T Consensus 481 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:10 481 KKAQ--KGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHh--hhcccCCCCCCCCCCCCcccCcccccC Confidence 1111 111111111111111111111111111 No 17 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=100.00 E-value=1.8e-94 Score=534.47 Aligned_cols=455 Identities=20% Similarity=0.324 Sum_probs=370.0 Q ss_pred CCCc--cCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEAL--LSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~--~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) -.++ ...+.+++.+.|.++|++|. .++.++.++++||.|+|+|+++++.... .......++++|+++||+++| T Consensus 32 ~~~~~~~~~~~~~~~~~i~~~i~~~~--~~~~r~~~l~~YY~g~~~i~~~~~~~~~---~~~~~~~~~~~ri~~n~~k~I 106 (492) T protein:vir:97 32 FDAIVRTNNKPETLEEMIVRYIKQHL--EKLPEISIGQEYYEQRPDIVKEPKPVDA---TGAVDPLKPDDRMITNFHANL 106 (492) T ss_pred hhhcccCCCchhhHHHHHHHHHHHHH--HHHHHHHHHHHHhcccCccccccccccc---cccccccccccccccchHHHH Confidence 2222 23466788889999999986 4567899999999999999988765432 223456778999999999999 Q ss_pred HHHHHhhhhcCCceeccCcHHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCC Q lcl|NC_013644. 79 VDQKTQYLLSNPVEYETENEELKEYLAEYYNSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEY 158 (510) Q Consensus 79 v~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~ 158 (510) |++.++||+|+||+|+++++...+.|++|++|++++++.+++++++++|+||+++|.|++|++++++++|+++||+||++ T Consensus 107 vd~~~~yl~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~dg~~~~~~~~p~~~~~i~d~~ 186 (492) T protein:vir:97 107 VDQKVSYIVGKPIAFKHTDDEVVKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDK 186 (492) T ss_pred HHHHhhhhcccCceeccCchHHHHHHHHHHhccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEEcCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999964 Q ss_pred --CCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCccc Q lcl|NC_013644. 159 --NELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIP 236 (510) Q Consensus 159 --~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP 236 (510) .++.+++++|... + ..++++|+++.+++|....+.......... ........+|+||.|| T Consensus 187 ~~~~~~~~vr~~~~~--~------~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~g~vP 248 (492) T protein:vir:97 187 EHEELEAFIRMYKLE--N------ETKVEYWDKVTVNYYVYENGSLIPDYSNNL----------ENSKTHFSTGSWGKIP 248 (492) T ss_pred CCCceEEEEEEEeec--c------ceeEEEEecCeEEEEEEecCeeeecccccc----------cccccccccCCCCCcc Confidence 4566666665432 1 246899999999999887665433222111 1223345789999999 Q ss_pred EEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeEEeec Q lcl|NC_013644. 237 FYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVT 316 (510) Q Consensus 237 vv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (510) ||+|+||++|+|+|+++++|||+||.++|++++.++++++|++|++|++..+..++...++..+++.++++++++|++++ T Consensus 249 vv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 328 (492) T protein:vir:97 249 FIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVE 328 (492) T ss_pred eEEecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhHHHHHhhccceecCCCCcceeEecc Confidence 99999999999999999999999999999999999999999999999998888888888899999999999999999999 Q ss_pred CCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCc Q lcl|NC_013644. 317 IPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKA 395 (510) Q Consensus 317 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~ 395 (510) .+.++++.++++|+++|+.+|++|++++.. +||+||+||++++++|.+||.++++.|+++|++++++|+.++++.. T Consensus 329 ~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~--- 405 (492) T protein:vir:97 329 VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG--- 405 (492) T ss_pred CCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc--- Confidence 999999999999999999999999988765 4789999999999999999999999999999999999999887643 Q ss_pred cccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHHHHHHHHhhhccCCC Q lcl|NC_013644. 396 FDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWEDVKEALEEAEYTKGL 474 (510) Q Consensus 396 ~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~~~~~~~~~~~~~~~ 474 (510) +..+++|+|++++|+|+++.+++++++ +|+||+||+++++|+++|+++ .++++++.+.. .. ..+ ... T Consensus 406 -~~~~i~v~f~~~~p~~~~e~a~~~~kl--~G~iS~et~l~~l~~v~d~~~Eleri~~E~~~~------~~-~~~--~~~ 473 (492) T protein:vir:97 406 -EHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQTEY------NK-QLP--NLD 473 (492) T ss_pred -ccceeeEEecCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHH------HH-hhh--ccc Confidence 567899999999999999999999877 589999999999999998643 22222222211 11 111 111 Q ss_pred CCCCCCcccCCCCCCcccc Q lcl|NC_013644. 475 SDNTDEEETAVNPDDPTQQ 493 (510) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~ 493 (510) ..+.+..+++..+++..+| T Consensus 474 ~~~~~~~~~~~~~~~~~~e 492 (492) T protein:vir:97 474 DGGADSAQQQERSNNKESE 492 (492) T ss_pred cCCCCCCcccccccccccC Confidence 1111111111111111111 No 18 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=100.00 E-value=2.1e-94 Score=534.16 Aligned_cols=462 Identities=17% Similarity=0.225 Sum_probs=363.4 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |-.... +...+.+.|.++|.+|+..++ .+++++++||.|+|+|+++... .....++++|+++||+++||+ T Consensus 31 ~~~~e~-~~~~~~~~i~~~i~~~~~~~~-~r~~~l~~Yy~g~~~il~~~~~--------~~~~~~~~~ki~~n~~k~Iv~ 100 (511) T protein:vir:78 31 YDGTES-DLLQNVNEVSKYIEHHMDYQR-PRLKVLSDYYEGKTKNLVELTR--------RKEEYMADNRVAHDYASYISD 100 (511) T ss_pred ccchhh-hhhcCHHHHHHHHHHHHHhhh-HHHHHHHHHhhccCccccccCc--------ccccccCcceeecchHHHHHH Confidence 222111 122345678888888875544 4799999999999999876542 234567889999999999999 Q ss_pred HHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCC Q lcl|NC_013644. 81 QKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYN 159 (510) Q Consensus 81 ~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~ 159 (510) +.++||+|+||+|+++++++.+.|++||+ |+++.++.++++.++++|+||+++|.|++|++++++++|+++||+||++. T Consensus 101 ~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~~p~~~~~v~dd~~ 180 (511) T protein:vir:78 101 FINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFIIYDNTV 180 (511) T ss_pred HHhhhhcccCceeecCchHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccceEEEEcCCC Confidence 99999999999999999999999999995 78999999999999999999999999999999999999999999999764 Q ss_pred --CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccE Q lcl|NC_013644. 160 --ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPF 237 (510) Q Consensus 160 --~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 237 (510) ++.+++++|.....++.....+.++++||++++++|....+++.... .......+|+||.||| T Consensus 181 ~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~---------------~~~~~~~~~~~g~vPv 245 (511) T protein:vir:78 181 ERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLT---------------PRENSFESHSFERMPI 245 (511) T ss_pred CCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccc---------------ccccccccCcCcccce Confidence 56777788877777777777788999999999999988765543221 1233567899999999 Q ss_pred EEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeee------------ecc Q lcl|NC_013644. 238 YRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVV------------GTG 305 (510) Q Consensus 238 v~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~------------~~~ 305 (510) |+|+|+++|+|+|+++++|||+||.++|++++.++++++|++|++|+...+..+.........++ ... T Consensus 246 v~~~n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (511) T protein:vir:78 246 TEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETE 325 (511) T ss_pred EEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCC Confidence 99999999999999999999999999999999999999999999997666555443322222111 134 Q ss_pred CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc-CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 306 SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGD-GNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLV 384 (510) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i 384 (510) ++++++|++++.+.++++.++++|.++|+.+|++|++++..+ ||+||+||+++++++.+||..|++.|+++|++++++| T Consensus 326 ~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li 405 (511) T protein:vir:78 326 GSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLL 405 (511) T ss_pred CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 578899999999999999999999999999999999987664 7899999999999999999999999999999999999 Q ss_pred HHHHhhccCC--ccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHH-HHHHHHHHHHHHHHHH Q lcl|NC_013644. 385 IDDINRRYTK--AFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDN-VLRLICEQFDLDWEDV 461 (510) Q Consensus 385 ~~~~~~~~~~--~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e-~~~~~~e~~e~~~~~~ 461 (510) +.+++..+.. ..+..+++++|++++|.|+++.+++++++ +|+||+||+++++|+++|++ +.++++++++.. . T Consensus 406 ~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl--~G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~---~ 480 (511) T protein:vir:78 406 ETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKES---I 480 (511) T ss_pred HHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHH---H Confidence 9999876543 45567899999999999999999999987 48899999999999999853 333333332211 1 Q ss_pred HHHHHhhhccCCCCCCCCCcccCCCCCCccccc Q lcl|NC_013644. 462 KEALEEAEYTKGLSDNTDEEETAVNPDDPTQQM 494 (510) Q Consensus 462 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (510) ...+. .......+..+.+..+......++++ T Consensus 481 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:78 481 KKAQK--GIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHhh--ccccCCCCCCCCCCCCCccCcccccC Confidence 11111 11111111111111111111111112 No 19 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=100.00 E-value=2.1e-94 Score=534.16 Aligned_cols=462 Identities=17% Similarity=0.225 Sum_probs=363.4 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |-.... +...+.+.|.++|.+|+..++ .+++++++||.|+|+|+++... .....++++|+++||+++||+ T Consensus 31 ~~~~e~-~~~~~~~~i~~~i~~~~~~~~-~r~~~l~~Yy~g~~~il~~~~~--------~~~~~~~~~ki~~n~~k~Iv~ 100 (511) T protein:vir:96 31 YDGTES-DLLQNVNEVSKYIEHHMDYQR-PRLKVLSDYYEGKTKNLVELTR--------RKEEYMADNRVAHDYASYISD 100 (511) T ss_pred ccchhh-hhhcCHHHHHHHHHHHHHhhh-HHHHHHHHHhhccCccccccCc--------ccccccCcceeecchHHHHHH Confidence 222111 122345678888888875544 4799999999999999876542 234567889999999999999 Q ss_pred HHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCC Q lcl|NC_013644. 81 QKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYN 159 (510) Q Consensus 81 ~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~ 159 (510) +.++||+|+||+|+++++++.+.|++||+ |+++.++.++++.++++|+||+++|.|++|++++++++|+++||+||++. T Consensus 101 ~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~~p~~~~~v~dd~~ 180 (511) T protein:vir:96 101 FINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFIIYDNTV 180 (511) T ss_pred HHhhhhcccCceeecCchHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccceEEEEcCCC Confidence 99999999999999999999999999995 78999999999999999999999999999999999999999999999764 Q ss_pred --CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccE Q lcl|NC_013644. 160 --ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPF 237 (510) Q Consensus 160 --~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 237 (510) ++.+++++|.....++.....+.++++||++++++|....+++.... .......+|+||.||| T Consensus 181 ~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~---------------~~~~~~~~~~~g~vPv 245 (511) T protein:vir:96 181 ERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLT---------------PRENSFESHSFERMPI 245 (511) T ss_pred CCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccc---------------ccccccccCcCcccce Confidence 56777788877777777777788999999999999988765543221 1233567899999999 Q ss_pred EEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeee------------ecc Q lcl|NC_013644. 238 YRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVV------------GTG 305 (510) Q Consensus 238 v~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~------------~~~ 305 (510) |+|+|+++|+|+|+++++|||+||.++|++++.++++++|++|++|+...+..+.........++ ... T Consensus 246 v~~~n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (511) T protein:vir:96 246 TEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETE 325 (511) T ss_pred EEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCC Confidence 99999999999999999999999999999999999999999999997666555443322222111 134 Q ss_pred CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc-CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 306 SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGD-GNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLV 384 (510) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i 384 (510) ++++++|++++.+.++++.++++|.++|+.+|++|++++..+ ||+||+||+++++++.+||..|++.|+++|++++++| T Consensus 326 ~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li 405 (511) T protein:vir:96 326 GSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLL 405 (511) T ss_pred CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 578899999999999999999999999999999999987664 7899999999999999999999999999999999999 Q ss_pred HHHHhhccCC--ccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHH-HHHHHHHHHHHHHHHH Q lcl|NC_013644. 385 IDDINRRYTK--AFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDN-VLRLICEQFDLDWEDV 461 (510) Q Consensus 385 ~~~~~~~~~~--~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e-~~~~~~e~~e~~~~~~ 461 (510) +.+++..+.. ..+..+++++|++++|.|+++.+++++++ +|+||+||+++++|+++|++ +.++++++++.. . T Consensus 406 ~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl--~G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~---~ 480 (511) T protein:vir:96 406 ETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKES---I 480 (511) T ss_pred HHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHH---H Confidence 9999876543 45567899999999999999999999987 48899999999999999853 333333332211 1 Q ss_pred HHHHHhhhccCCCCCCCCCcccCCCCCCccccc Q lcl|NC_013644. 462 KEALEEAEYTKGLSDNTDEEETAVNPDDPTQQM 494 (510) Q Consensus 462 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (510) ...+. .......+..+.+..+......++++ T Consensus 481 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:96 481 KKAQK--GIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHhh--ccccCCCCCCCCCCCCCccCcccccC Confidence 11111 11111111111111111111111112 No 20 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=100.00 E-value=1.8e-93 Score=529.10 Aligned_cols=454 Identities=21% Similarity=0.348 Sum_probs=365.3 Q ss_pred CC----------CccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceecccccccccccccccee Q lcl|NC_013644. 1 ME----------ALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRI 70 (510) Q Consensus 1 ~~----------~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki 70 (510) |. ..+....+++.+.|.++|++|+. +..++.++++||.|+|+|+++.+.... .......++++|| T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~--~~~~~~~~~~YY~g~~~i~~~~~~~~~---~~~~~~~~~~~ki 81 (474) T protein:vir:94 7 MPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRK--QLDKITVGQRYYDKDNDIVKQMKKVDV---HGNIDYDKPDWRI 81 (474) T ss_pred ccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHH--HHHHHHHHHHHhccccchhcccchhcc---ccccccccCccee Confidence 11 11223334677889999999863 567899999999999999987654332 2234567889999 Q ss_pred ccchhHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccc Q lcl|NC_013644. 71 PHGFFPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYNSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLN 150 (510) Q Consensus 71 ~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~ 150 (510) ++||+++||++.++||||+||+|+++++.+.+.|+.|++|++.+++.++++.++++|+||+++|.|++|++++++++|++ T Consensus 82 ~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~ 161 (474) T protein:vir:94 82 TTNFHQNLVDQKVSYVASKPVTYSCEDENVLKVIHDVLDTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAEQ 161 (474) T ss_pred ecchHHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEEEcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEcCC--CCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccc Q lcl|NC_013644. 151 VFGVYNEY--NELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLL 228 (510) Q Consensus 151 ~~~~~d~~--~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 228 (510) ++|+||++ .++.+++++|... ...++++|+++++++|+..+++....... .......... T Consensus 162 ~~~v~d~~~~~~~~~~ir~~~~~--------~~~~~~~yt~~~~~~y~~~~~~~~~~~~~----------~~~~~~~~~~ 223 (474) T protein:vir:94 162 AIPIWVDKEREELKSFIRYYKFN--------NEEKVEFWTDTTVTYYVLENGGLIPDYYY----------GANHVQSHFS 223 (474) T ss_pred eEEEEcCCCCCceEEEEEEEEec--------CeEEEEEEeCCeEEEEEEcCCcccccccc----------CcCccccccc Confidence 99999975 4566666665432 13578999999999999877654332111 1122334567 Q ss_pred cccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCC Q lcl|NC_013644. 229 QRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDG 308 (510) Q Consensus 229 ~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (510) +|+||+||||+|+||+.|+|+|+++++|||+||.++|++++.++++++|+++++|+++++..++..+++.++++.+++++ T Consensus 224 ~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~ 303 (474) T protein:vir:94 224 NGNWGRVPFIAFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLKYYKAINVDGDG 303 (474) T ss_pred ccCCCccceEEecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccceeeccCCC Confidence 89999999999999999999999999999999999999999999999999999999988888888888889999999999 Q ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 309 GLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDD 387 (510) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~ 387 (510) +++|++++++.++++.++++|++.||.+|++|++++.+ +||+||+||+++++++++||.+|++.|+++|++++++|+.+ T Consensus 304 ~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~ 383 (474) T protein:vir:94 304 GVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDF 383 (474) T ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999988665 57899999999999999999999999999999999999998 Q ss_pred HhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 388 INRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWEDVKEALE 466 (510) Q Consensus 388 ~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~~~~~~~ 466 (510) ++.. .+..+++|+|++++|.|+++.++++++ +|+||+||+++++|+++|+++ .++++++++.. .. T Consensus 384 ~~~~----~d~~~i~v~f~~~~p~~~~e~a~~~~~---~g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~-------~~ 449 (474) T protein:vir:94 384 NNLK----TDVKDIEISFNFNRMMNDAEQSQIIAQ---SQYLSRETLVKSSPLVDDYKAELERIEQEQMEY-------NK 449 (474) T ss_pred hCCC----cccceeeEEeccCcccCHHHHHHHHHH---cCCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHH-------Hh Confidence 7653 456789999999999999999988754 588999999999999998643 22222222211 11 Q ss_pred hhhccCCCCCCCCCcccCCCCCCcccc Q lcl|NC_013644. 467 EAEYTKGLSDNTDEEETAVNPDDPTQQ 493 (510) Q Consensus 467 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (510) ..+..+ ..+.+.+.+...+++...| T Consensus 450 ~~~~~~--~~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:94 450 QLPNLD--DGGADGAQQQEGSNNKESE 474 (474) T ss_pred hccccC--CCCCCCcccCCCCcccccC Confidence 111111 1111111111111111111 No 21 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=100.00 E-value=1.8e-93 Score=529.10 Aligned_cols=454 Identities=21% Similarity=0.348 Sum_probs=365.3 Q ss_pred CC----------CccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceecccccccccccccccee Q lcl|NC_013644. 1 ME----------ALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRI 70 (510) Q Consensus 1 ~~----------~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki 70 (510) |. ..+....+++.+.|.++|++|+. +..++.++++||.|+|+|+++.+.... .......++++|| T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~--~~~~~~~~~~YY~g~~~i~~~~~~~~~---~~~~~~~~~~~ki 81 (474) T protein:vir:97 7 MPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRK--QLDKITVGQRYYDKDNDIVKQMKKVDV---HGNIDYDKPDWRI 81 (474) T ss_pred ccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHH--HHHHHHHHHHHhccccchhcccchhcc---ccccccccCccee Confidence 11 11223334677889999999863 567899999999999999987654332 2234567889999 Q ss_pred ccchhHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccc Q lcl|NC_013644. 71 PHGFFPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYNSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLN 150 (510) Q Consensus 71 ~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~ 150 (510) ++||+++||++.++||||+||+|+++++.+.+.|+.|++|++.+++.++++.++++|+||+++|.|++|++++++++|++ T Consensus 82 ~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~ 161 (474) T protein:vir:97 82 TTNFHQNLVDQKVSYVASKPVTYSCEDENVLKVIHDVLDTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAEQ 161 (474) T ss_pred ecchHHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEEEcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEcCC--CCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccc Q lcl|NC_013644. 151 VFGVYNEY--NELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLL 228 (510) Q Consensus 151 ~~~~~d~~--~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 228 (510) ++|+||++ .++.+++++|... ...++++|+++++++|+..+++....... .......... T Consensus 162 ~~~v~d~~~~~~~~~~ir~~~~~--------~~~~~~~yt~~~~~~y~~~~~~~~~~~~~----------~~~~~~~~~~ 223 (474) T protein:vir:97 162 AIPIWVDKEREELKSFIRYYKFN--------NEEKVEFWTDTTVTYYVLENGGLIPDYYY----------GANHVQSHFS 223 (474) T ss_pred eEEEEcCCCCCceEEEEEEEEec--------CeEEEEEEeCCeEEEEEEcCCcccccccc----------CcCccccccc Confidence 99999975 4566666665432 13578999999999999877654332111 1122334567 Q ss_pred cccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCC Q lcl|NC_013644. 229 QRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDG 308 (510) Q Consensus 229 ~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (510) +|+||+||||+|+||+.|+|+|+++++|||+||.++|++++.++++++|+++++|+++++..++..+++.++++.+++++ T Consensus 224 ~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~ 303 (474) T protein:vir:97 224 NGNWGRVPFIAFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLKYYKAINVDGDG 303 (474) T ss_pred ccCCCccceEEecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccceeeccCCC Confidence 89999999999999999999999999999999999999999999999999999999988888888888889999999999 Q ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 309 GLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDD 387 (510) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~ 387 (510) +++|++++++.++++.++++|++.||.+|++|++++.+ +||+||+||+++++++++||.+|++.|+++|++++++|+.+ T Consensus 304 ~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~ 383 (474) T protein:vir:97 304 GVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDF 383 (474) T ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999988665 57899999999999999999999999999999999999998 Q ss_pred HhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 388 INRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWEDVKEALE 466 (510) Q Consensus 388 ~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~~~~~~~ 466 (510) ++.. .+..+++|+|++++|.|+++.++++++ +|+||+||+++++|+++|+++ .++++++++.. .. T Consensus 384 ~~~~----~d~~~i~v~f~~~~p~~~~e~a~~~~~---~g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~-------~~ 449 (474) T protein:vir:97 384 NNLK----TDVKDIEISFNFNRMMNDAEQSQIIAQ---SQYLSRETLVKSSPLVDDYKAELERIEQEQMEY-------NK 449 (474) T ss_pred hCCC----cccceeeEEeccCcccCHHHHHHHHHH---cCCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHH-------Hh Confidence 7653 456789999999999999999988754 588999999999999998643 22222222211 11 Q ss_pred hhhccCCCCCCCCCcccCCCCCCcccc Q lcl|NC_013644. 467 EAEYTKGLSDNTDEEETAVNPDDPTQQ 493 (510) Q Consensus 467 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (510) ..+..+ ..+.+.+.+...+++...| T Consensus 450 ~~~~~~--~~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:97 450 QLPNLD--DGGADGAQQQEGSNNKESE 474 (474) T ss_pred hccccC--CCCCCCcccCCCCcccccC Confidence 111111 1111111111111111111 No 22 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=100.00 E-value=2.2e-93 Score=528.53 Aligned_cols=454 Identities=21% Similarity=0.322 Sum_probs=364.7 Q ss_pred CCCc---------------cCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceecccccccccccc Q lcl|NC_013644. 1 MEAL---------------LSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYA 65 (510) Q Consensus 1 ~~~~---------------~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ 65 (510) |-.| +....+...++|+++|++|+ .+..++.++++||.|+|+|++++...... ......+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~--~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~---~~~~~~~ 75 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRLINDHK--PKIDDITVGERYYNHDPDVLRLAPKLDNK---GEIDPLK 75 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHHHHHHH--HHHHHHHHHHHHhccCCcchhccchhccc---ccccccc Confidence 2222 23444667788999999996 45678999999999999999987654332 2345678 Q ss_pred ccceeccchhHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEE Q lcl|NC_013644. 66 SNVRIPHGFFPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYNSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQV 145 (510) Q Consensus 66 ~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~ 145 (510) +++||++||+++||++.++||||+||+|+++++++.+.|++|++|++.+++.++++.++++|+||+++|+|++|++++++ T Consensus 76 ~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~~y~d~~~~~~i~~ 155 (474) T protein:vir:96 76 PDWRMFTNYHQNLVDQKVAYAVANPVTFSSDDDKSLKTIQEVLNHKWDDKLVDILTAASNKGIEWLQPYIDENGEFKTFR 155 (474) T ss_pred cchhcccchHHHHHHhhhhhhcccCceeecCchHHHHHHHHHHhcCHHHHHHHHHHHHHhcCeeEEEEEecCCCceEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EcccceEEEEcCC--CCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccccccccc Q lcl|NC_013644. 146 ADSLNVFGVYNEY--NELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSE 223 (510) Q Consensus 146 ~~p~~~~~~~d~~--~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (510) ++|+++||+||++ .++.+++++|... ...++++||++.+++|...++........... . ..... T Consensus 156 ~~p~~~~~v~d~~~~~~~~~~vr~~~~~--------~~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~-~-----~~~~~ 221 (474) T protein:vir:96 156 VPAEQAIPIWTNKERDTLKAFIRYYRLD--------GAERVEYWTDSDVTYYEYQDGILIPDYYHGEE-H-----IQSHY 221 (474) T ss_pred EcccceEEEEcCCCCCceEEEEEEEeec--------CceEEEEEeCCeEEEEEecCCceeeccccccc-c-----ccccc Confidence 9999999999974 4566666665422 13468999999999999876654432221111 1 11122 Q ss_pred ccccccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeee Q lcl|NC_013644. 224 NESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVG 303 (510) Q Consensus 224 ~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~ 303 (510) .....+|+||+||||+|+|+++|+|||+++++|||+||.++|++++.++++++|++|++|+++++..++..+++.++++. T Consensus 222 ~~~~~~~~~g~iPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~ 301 (474) T protein:vir:96 222 YVGNKRVSWGRVPFIPFKNNPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYEGQDLDEFMRNLKYYKAIN 301 (474) T ss_pred cccccccCCCceeEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccccchhhhhhcCceEE Confidence 33457899999999999999999999999999999999999999999999999999999999888788888888888888 Q ss_pred cc-CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 304 TG-SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMN 381 (510) Q Consensus 304 ~~-~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~ 381 (510) ++ ++++|+|++++++.++++.++++|+++|+.+|++|++++.+ +||+||+|++++++++++||.+|+++|+++|++++ T Consensus 302 ~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~ 381 (474) T protein:vir:96 302 VDGDGSGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELL 381 (474) T ss_pred ecCCCCceeEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 76 46789999999999999999999999999999999998766 46899999999999999999999999999999999 Q ss_pred HHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHH Q lcl|NC_013644. 382 KLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWED 460 (510) Q Consensus 382 ~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~ 460 (510) ++|+.+++. .++..+++|+|++++|.|+++.++++ +++|+||+||+++++|+++|+++ .++++++++. T Consensus 382 ~~i~~~~~~----~~~~~~i~i~f~~~~p~~~~e~~~~~---~~ag~iS~et~~~~~~~v~d~~~E~~ri~~E~~e---- 450 (474) T protein:vir:96 382 QYIIDFYKL----NIKVQDVEITFNFNVMVNELEQSQIG---VQSQYLSKETVVTNHPWVDDPVAELERIEQDNID---- 450 (474) T ss_pred HHHHHHhCC----CcccceeeEEeccCCCcCHHHHHHHH---HhcCCCchHHHHHhCCCCCCHHHHHHHHHHHHHH---- Confidence 999988753 45677899999999999999999875 45799999999999999988643 2222222111 Q ss_pred HHHHHHhhhccCCCCCCCCCcccCCCCCCccc Q lcl|NC_013644. 461 VKEALEEAEYTKGLSDNTDEEETAVNPDDPTQ 492 (510) Q Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (510) ......+...+. .....+++.++. T Consensus 451 --~~~~~~~~~~~~------~~~~~d~~~e~~ 474 (474) T protein:vir:96 451 --FNKQLPPLEGDA------NGRAQDNESETN 474 (474) T ss_pred --HHhccccccccc------ccccCCCcccCC Confidence 111111111111 111111111111 No 23 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=100.00 E-value=8.1e-93 Score=525.47 Aligned_cols=457 Identities=22% Similarity=0.313 Sum_probs=367.0 Q ss_pred CCCc---------------cCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceecccccccccccc Q lcl|NC_013644. 1 MEAL---------------LSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYA 65 (510) Q Consensus 1 ~~~~---------------~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ 65 (510) |-++ +......+.++|.++|.+|. .+..+++++++||.|+|+|++++.... ........+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~--~~~~r~~~~~~Yy~g~~~i~~~~~~~~---~~~~~~~~~ 75 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHK--ENIDNITMGERYYNHHPDILDAPFKRD---VNGDYDETK 75 (478) T ss_pred CccccccCCchhhhHHHHHhhhccCChHHHHHHHHHHHH--HHHHHHHHHHHHhcccccccccchhhh---ccccccccc Confidence 4443 22333467788999999986 456789999999999999998765532 234456778 Q ss_pred ccceeccchhHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEE Q lcl|NC_013644. 66 SNVRIPHGFFPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYNSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQV 145 (510) Q Consensus 66 ~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~ 145 (510) +++||++||++.||++.++||||+||+|+++++++.+.|+++++|++++++.++++.++++|++|++||.|++|++++++ T Consensus 76 ~~~ki~~n~~k~ivd~~~~yl~g~p~~~~~~~~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~~~~ 155 (478) T protein:vir:10 76 PDWRMYTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQHTLNHKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFR 155 (478) T ss_pred ccceeccchHHHHHHHHhhhhcccCceeecCChHHHHHHHHHHhccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EcccceEEEEcCC--CCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccccccccc Q lcl|NC_013644. 146 ADSLNVFGVYNEY--NELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSE 223 (510) Q Consensus 146 ~~p~~~~~~~d~~--~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (510) ++|++++|+||+. +++.++++.|... ...++++|+++++++|+..++.......... .... ... T Consensus 156 ~~p~~~~~v~d~~~~~~~~~~ir~~~~~--------~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~-~~~~-----~~~ 221 (478) T protein:vir:10 156 VPAEQAVPIWTNKERDELQAFIRVYELD--------GAERVEYWTKDDVTFYELKEGQLIPDFYRSE-DHIQ-----PHY 221 (478) T ss_pred EcccceEEEEcCCCCCceEEEEEEEeee--------CceEEEEEeCCcEEEEEecCCeeeccccccc-cccc-----cce Confidence 9999999999864 5677777665432 1357899999999999987665432211111 1111 112 Q ss_pred ccccccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeee Q lcl|NC_013644. 224 NESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVG 303 (510) Q Consensus 224 ~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~ 303 (510) .....+|+||+||||+|+|++.|+|+|+++++|||+||.++|++++.++++++|+++++|+++++..++..+++..+++. T Consensus 222 ~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~ 301 (478) T protein:vir:10 222 YQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHNLKYYKAIS 301 (478) T ss_pred ecccccccCCcceEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCcccccchhhhhhhCceeE Confidence 23456899999999999999999999999999999999999999999999999999999999888888888888877776 Q ss_pred cc--CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 304 TG--SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWM 380 (510) Q Consensus 304 ~~--~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~ 380 (510) ++ ++++++|++++++.++++.++++|++.|+.+|++|++++.+ +||+||+||++++++|.+||..|++.|+++|+++ T Consensus 302 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~ 381 (478) T protein:vir:10 302 VAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQEL 381 (478) T ss_pred ecCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 63 56889999999999999999999999999999999988765 4789999999999999999999999999999999 Q ss_pred HHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHH Q lcl|NC_013644. 381 NKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWE 459 (510) Q Consensus 381 ~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~ 459 (510) +++|+.+++. .++..+++|+|++++|.|+++.+++++++ +|++|+||+++++|+++|+++ .++++++.+.. T Consensus 382 ~~li~~~~~~----~~d~~~i~i~f~~~~p~~~~e~~~~~~~~--~g~iS~et~i~~~~~v~d~~~E~~ri~~E~~~~-- 453 (478) T protein:vir:10 382 LQYIIDFYRL----DVRVQDIEITFNFNVMVNELENSQIAMNS--TGLLSKETILGNHSWVQDPVAEMERIEQENIEL-- 453 (478) T ss_pred HHHHHHHhCC----CcccccceEEeCCCCCCCHHHHHHHHHHH--hCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHH-- Confidence 9999998753 45677899999999999999999999866 689999999999999988642 33333332211 Q ss_pred HHHHHHHhhhccCCCCCCCCCcccCCCCCCccc Q lcl|NC_013644. 460 DVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQ 492 (510) Q Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (510) . ...+....+..+++.+.+ ++++.| T Consensus 454 -----~--~~~~~~~~~~~d~~~~~~-~d~~~e 478 (478) T protein:vir:10 454 -----N--QQLPDIEEGLNDEQQRQS-EDNQSE 478 (478) T ss_pred -----H--HhccccCCCCcccccccC-cCCCCC Confidence 1 111111122222121111 111111 No 24 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=100.00 E-value=1.6e-92 Score=523.80 Aligned_cols=455 Identities=20% Similarity=0.324 Sum_probs=368.0 Q ss_pred CCCc--cCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEAL--LSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~--~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) ..+. ...+.+.+.+.|.++|++|+ .++.++.++++||+|+|+|+.+++..... ......++++|+++||+++| T Consensus 12 ~~~~~~~~~~~~~~~~~i~~~i~~~~--~~~~~~~~~~~YY~g~~~i~~~~~~~~~~---~~~~~~~~~~ri~~n~~~~i 86 (472) T protein:vir:93 12 FDAIVRTNNKPETLEEMIVRYIKQHL--EKLPEISIGQEYYEQRPDIVKEPKPVDAT---GAVDPLKPDDRMITNFHANL 86 (472) T ss_pred hhceeeecCchhhHHHHHHHHHHHHH--HHHHHHHHHHHHhccccccccccchhhcc---ccccccccccccccchHHHH Confidence 2222 33444778899999999996 45568999999999999999887654322 22345678899999999999 Q ss_pred HHHHHhhhhcCCceeccCcHHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCC Q lcl|NC_013644. 79 VDQKTQYLLSNPVEYETENEELKEYLAEYYNSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEY 158 (510) Q Consensus 79 v~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~ 158 (510) |++.++||||+||+|+++++.+.++|++|++|+++.++.++++.++++|+||+++|.|++|++++++++|++++|+||++ T Consensus 87 vd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~d~~~~i~~~~p~~~~~i~d~~ 166 (472) T protein:vir:93 87 VDQKVSYIVGKPIAFKHTDDEVVKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDK 166 (472) T ss_pred HHHHhhhhcccCeeeccCChHHHHHHHHHHhccHHHHHHHHHHHHhhcCeEEEEEEECCCCceEEEEEcccceEEEEcCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999864 Q ss_pred --CCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCccc Q lcl|NC_013644. 159 --NELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIP 236 (510) Q Consensus 159 --~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP 236 (510) .++.+++++|... + ..++++|++.++++|....+......... .........+|+||+|| T Consensus 167 ~~~~~~~~ir~~~~~--~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~vP 228 (472) T protein:vir:93 167 EHEELEAFIRMYKLE--N------ETKVEYWDKVTVNYYVYENGSLIPDYSNN----------LENSKTHFSTGSWGKIP 228 (472) T ss_pred CCCceEEEEEEEEee--c------ceeEEEEecCeEEEEEEecCeeeeccccc----------ccccccccccCCCCCcc Confidence 4566666665432 1 23679999999999987765543322211 11223346789999999 Q ss_pred EEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeEEeec Q lcl|NC_013644. 237 FYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVT 316 (510) Q Consensus 237 vv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (510) ||+|+||++|+|+|+++++|||+||.++|++++.++++++|++|++|++..+..++...++..+++.++++++++|++++ T Consensus 229 vv~~~nn~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 308 (472) T protein:vir:93 229 FIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVE 308 (472) T ss_pred eEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcccchhhHHHHhhccccccCCCCcceeEeec Confidence 99999999999999999999999999999999999999999999999988887888888888889999999999999999 Q ss_pred CCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCc Q lcl|NC_013644. 317 IPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKA 395 (510) Q Consensus 317 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~ 395 (510) ++.++++.++++|++.|+.+|++|+++++. +||+||+||++++.+|.+||.++++.|+++|++++++|+.+++.. T Consensus 309 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~---- 384 (472) T protein:vir:93 309 VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK---- 384 (472) T ss_pred CCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC---- Confidence 999999999999999999999999988765 478999999999999999999999999999999999999988654 Q ss_pred cccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHHHHHHHHhhhccCCC Q lcl|NC_013644. 396 FDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWEDVKEALEEAEYTKGL 474 (510) Q Consensus 396 ~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~~~~~~~~~~~~~~~ 474 (510) .+..+++|+|++++|+|+++.+++++++ +|++|+||+++++|+++|+++ .++++++++.. ..... ... T Consensus 385 ~~~~~i~v~f~~~~p~~~~~~~~~~~k~--~giis~et~l~~l~~~~d~~~E~~ri~~E~~~~------~~~~~---~~~ 453 (472) T protein:vir:93 385 GEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEY------NKQLP---NLD 453 (472) T ss_pred cccceeeEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHH------HHhcc---CcC Confidence 3466899999999999999999999876 589999999999999988543 23333322211 11111 111 Q ss_pred CCCCCCcccCCCCCCcccc Q lcl|NC_013644. 475 SDNTDEEETAVNPDDPTQQ 493 (510) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~ 493 (510) +.+.+..+++.++++.+.| T Consensus 454 ~~~~d~~~~~~~~~~~~~e 472 (472) T protein:vir:93 454 DGGADGAQQQERSNNKESE 472 (472) T ss_pred cccCCCCCCCCCCCcccCC Confidence 1111111111111111111 No 25 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=100.00 E-value=1.5e-92 Score=523.92 Aligned_cols=454 Identities=22% Similarity=0.343 Sum_probs=366.6 Q ss_pred CCCc----------------cCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccc Q lcl|NC_013644. 1 MEAL----------------LSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKY 64 (510) Q Consensus 1 ~~~~----------------~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~ 64 (510) |..+ +....+++.+.|.++|.+|+ .+..++.++++||.|+|+|+++.+.... ....... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~--~~~~~~~~~~~Yy~g~~~i~~r~~~~~~---~~~~~~~ 75 (474) T protein:vir:95 1 MFNIIRMPWDKPYGEEVVEQLKPQFETQEEMIIRLIDDHR--KQLDKITVGQRYYDKDNDIVKQMKKVDV---YGNIDYD 75 (474) T ss_pred CcceeecCCCCchhhHHHHhhhhccCChHHHHHHHHHHHH--HHHHHHHHHHHHhcccCchhcccccccc---ccccccc Confidence 2222 23444677889999999986 4566799999999999999987654332 2233457 Q ss_pred cccceeccchhHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEE Q lcl|NC_013644. 65 ASNVRIPHGFFPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYNSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQ 144 (510) Q Consensus 65 ~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~ 144 (510) ++++||++||++.||++.++||||+||+|+++++++.++|+.|++|+++.++.++++.++++|+||+++|+|++|+++++ T Consensus 76 ~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~~i~ 155 (474) T protein:vir:95 76 KPDWRITTNFHQNLVDQKVSYVASKPVTYSCEDESVLKIIHDVLDTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLF 155 (474) T ss_pred cccceeccchHHHHHHHHHhhhccCCceeccCchHHHHHHHHHHhccHHHHHHHHHHHHhhcCcEEEEEEecCCCceEEE Confidence 78999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcccceEEEEcCC--CCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccc Q lcl|NC_013644. 145 VADSLNVFGVYNEY--NELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDS 222 (510) Q Consensus 145 ~~~p~~~~~~~d~~--~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 222 (510) +++|++++|+||+. +++.+++++|... ...++++|+++++++|+...+++...... .... T Consensus 156 ~~~p~~~~~v~d~~~~~~~~~~i~~~~~~--------~~~~~~~y~~~~~~~~~~~~~~~~~~~~~----------~~~~ 217 (474) T protein:vir:95 156 RVPAEQAIPIWVDKEREELKSFIRYYKFN--------NEEKVEFWTDTTVTYYVLENGGLIPDYYY----------GANH 217 (474) T ss_pred EEcccceEEEEcCCCCCceEEEEEEEEEc--------CeeEEEEEeCCeEEEEEEcCCcccccccc----------Cccc Confidence 99999999999975 4566666655322 13478999999999999887665332111 1112 Q ss_pred cccccccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeee Q lcl|NC_013644. 223 ENESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVV 302 (510) Q Consensus 223 ~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~ 302 (510) ......+|+||+||||+|+||+.|+|+|+++++|||+||.++|++++.++++++|+++++|+++++..++..+++..+++ T Consensus 218 ~~~~~~~~~~g~iPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i 297 (474) T protein:vir:95 218 IQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLEEFMRGLKYYKAI 297 (474) T ss_pred ccccccccCCCccceEeecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhcccee Confidence 23356789999999999999999999999999999999999999999999999999999999998888888888999999 Q ss_pred eccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 303 GTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMN 381 (510) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~ 381 (510) .++++++++|++++++.++++.++++|+++|+.+|++|++++.+ +||+||+||+++++++.+||.+|++.|+++|++++ T Consensus 298 ~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~ 377 (474) T protein:vir:95 298 NVDGDGGVETIQVEVPVSSTKEYIDLMRAYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELI 377 (474) T ss_pred eccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999987664 57899999999999999999999999999999999 Q ss_pred HHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHH Q lcl|NC_013644. 382 KLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWED 460 (510) Q Consensus 382 ~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~ 460 (510) ++|+.+++. ..+..+++|+|++++|.|+++.++++++ +|+||+||+++++|+++|+++ .++++++++... T Consensus 378 ~li~~~~g~----~~d~~~i~v~f~~~~p~d~~e~a~~~~~---~g~iS~et~i~~l~~v~d~~~E~~ri~~E~~~~~-- 448 (474) T protein:vir:95 378 GFIIDFNNL----KMDVKDIEISFNFNRMMNDAEQSQIIAQ---SQYLSRETLVKSSPLVDDYKAELERIEQEQMEYN-- 448 (474) T ss_pred HHHHHHhCC----CcccceeeEEeccCCCcCHHHHHHHHHh---cCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHH-- Confidence 999998754 3567889999999999999999998754 599999999999999988642 333332222111 Q ss_pred HHHHHHhhhccCCCCCCCCCcccCCCCCCccc Q lcl|NC_013644. 461 VKEALEEAEYTKGLSDNTDEEETAVNPDDPTQ 492 (510) Q Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (510) . ..+..... +.++.++.+.+++.+.+ T Consensus 449 -~----~~~~~~~~-~~d~~~~~~~~~~~~~~ 474 (474) T protein:vir:95 449 -K----QLPNLDDG-GADGAQQQERSNDKESE 474 (474) T ss_pred -h----cccccccc-cCCCCcCCCCCccCCCC Confidence 1 11111111 11111111111111111 No 26 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=100.00 E-value=3e-92 Score=522.32 Aligned_cols=457 Identities=21% Similarity=0.315 Sum_probs=364.5 Q ss_pred CCCcc---------------CCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceecccccccccccc Q lcl|NC_013644. 1 MEALL---------------SEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYA 65 (510) Q Consensus 1 ~~~~~---------------~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ 65 (510) |-+|. ........++|.++|.+|. .+..++.++++||+|+|+|+++++.. .+.......+ T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~--~~~~~~~~~~~yY~g~~~i~~~~~~~---~~~~~~~~~~ 75 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHK--ENIDNITMGERYYNHHPDILDAPPKR---DVNGDYDETK 75 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHH--HHHHHHHHHHHHhcCCCchhcccccc---cccccccccc Confidence 55551 1222456788999999886 35567999999999999998876542 2334445678 Q ss_pred ccceeccchhHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEE Q lcl|NC_013644. 66 SNVRIPHGFFPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYNSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQV 145 (510) Q Consensus 66 ~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~ 145 (510) +++|+++||+++||++.++||||+||+|+++++++.+.|+.+++|++.+++.+++++++++|+||+++|.|++|++++++ T Consensus 76 ~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~d~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~~g~~~~~~ 155 (478) T protein:vir:10 76 PDWRMYTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQHTLNHKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFR 155 (478) T ss_pred ccceeccchHHHHHHHHHhhhccCCeeeecCChHHHHHHHHHHhcCHHHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EcccceEEEEcCC--CCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccccccccc Q lcl|NC_013644. 146 ADSLNVFGVYNEY--NELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSE 223 (510) Q Consensus 146 ~~p~~~~~~~d~~--~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (510) ++|+++||+||++ .++.++++.|... ...++++|+++++++|+...+.......... ... .... T Consensus 156 ~~p~~~~~i~d~~~~~~~~~~v~~~~~~--------~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~-~~~-----~~~~ 221 (478) T protein:vir:10 156 VPAEQAVPIWTNKERDELQAFIRVYELD--------GAERVEYWTKDDVTYYELKEGQLIPDFYRSD-DHI-----QPHY 221 (478) T ss_pred EcccceEEEEcCCCCCceEEEEEEEEec--------CceEEEEEeCCeEEEEEEcCCeeeccccccc-ccc-----ccce Confidence 9999999999864 4566666665422 2357899999999999887655432211111 010 1112 Q ss_pred ccccccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeee Q lcl|NC_013644. 224 NESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVG 303 (510) Q Consensus 224 ~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~ 303 (510) .....+|+||+||||+|+|+++|+|+|+++++|||+||.++|++++.++++++|+++++|+++++..+...+++..+++. T Consensus 222 ~~~~~~~~~~~vPvv~~~n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~~~~~~~~~~~~~~~ 301 (478) T protein:vir:10 222 YQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHNLKYYKAIS 301 (478) T ss_pred ecccccccCCccceEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhhhhhhhcceEE Confidence 23456899999999999999999999999999999999999999999999999999999999888788888888887777 Q ss_pred cc--CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc-CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 304 TG--SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGD-GNITNIVIKARYTLLNMKANKTEARLRALLEWM 380 (510) Q Consensus 304 ~~--~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~ 380 (510) ++ ++++++|++++++.++++.++++|++.|+.+|++|++++.++ ||+||+||++++++|.+||+++++.|+++|+++ T Consensus 302 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~ 381 (478) T protein:vir:10 302 VAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQEL 381 (478) T ss_pred ecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 65 568899999999999999999999999999999999887664 789999999999999999999999999999999 Q ss_pred HHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHH Q lcl|NC_013644. 381 NKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWE 459 (510) Q Consensus 381 ~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~ 459 (510) +++|+.+++. .++..+++|+|++++|+|+++.+++++++ +|+||+||+++++|+++|+++ .++++++.+. T Consensus 382 ~~li~~~~g~----~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~--- 452 (478) T protein:vir:10 382 LQYIIDFYRL----DVKVQDIEITFNFNVMVNELENSQIAMNS--TGLLSKETILSNHAWVEDPVAEMERIEQENIE--- 452 (478) T ss_pred HHHHHHHhCC----CcccccceEEecCCCCCCHHHHHHHHHHH--hCCCChHHHHHhCCCCCCHHHHHHHHHHHHHH--- Confidence 9999988743 45677899999999999999999999876 788999999999999988643 2222222211 Q ss_pred HHHHHHHhhhccCCCCCCCCCcccCCCCCCccc Q lcl|NC_013644. 460 DVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQ 492 (510) Q Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (510) ..........+..++.+ ..+++++.| T Consensus 453 ---~~~~~~~~~~~~~~~~~----~~~~~~~~~ 478 (478) T protein:vir:10 453 ---LNQQLPDIEEGLNGEQQ----RQSENNQPE 478 (478) T ss_pred ---HHhhccccccccCCCCC----CCCCCCCCC Confidence 11111111111111111 111111111 No 27 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=100.00 E-value=7.7e-92 Score=520.11 Aligned_cols=437 Identities=15% Similarity=0.141 Sum_probs=355.5 Q ss_pred CCC------ccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccch Q lcl|NC_013644. 1 MEA------LLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGF 74 (510) Q Consensus 1 ~~~------~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~ 74 (510) |.- .+..+.+++.+.|.++|++|+. +..++.++++||.|+|+|++++. +...++++|+++|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~--~~~r~~~~~~Yy~g~~~i~~~~~----------~~~~~~~~ki~~n~ 68 (452) T protein:vir:36 1 MKYKPPKLMTFSKDEPITVEVVTKFMEKHKL--EVARYEYLKNMYLGIMAIDDEPA----------KDSWKPDNRLAVNF 68 (452) T ss_pred CcccCceeEEcCCccCCCHHHHHHHHHHHHH--HHHHHHHHHHHhccccccccCcc----------ccccCccceeecch Confidence 321 2555677788999999999964 44579999999999999988754 24557889999999 Q ss_pred hHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEE Q lcl|NC_013644. 75 FPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFG 153 (510) Q Consensus 75 ~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~ 153 (510) +++||++.++||||+||+|+++++++.+.|+++|+ |+++.++.++++.++++|+||+++|.|++|++++++++|.+++| T Consensus 69 ~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~ 148 (452) T protein:vir:36 69 TKYIVDTFTGYFNGIPVKKSHSDKEILTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDEDTQTNVVYNSPENMFM 148 (452) T ss_pred HHHHHHHHhhhhcccCceeecCChhHHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEE Confidence 99999999999999999999999999999999996 89999999999999999999999999999999999999999999 Q ss_pred EEcCCCC--ceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccccccccccccccccc Q lcl|NC_013644. 154 VYNEYNE--LQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRS 231 (510) Q Consensus 154 ~~d~~~~--~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (510) +||++.. +.+++++| ...+ ...++++||++++++|+...+++... ...+|+ T Consensus 149 v~d~~~~~~~~~~i~~~--~~~~-----~~~~~~vyt~~~i~~~~~~~~~~~~~--------------------~~~~~~ 201 (452) T protein:vir:36 149 VYDDTVKQEPLFAVRYG--VDED-----KKLQGEVYTLLETIKISGENDEISFG--------------------EGTYNP 201 (452) T ss_pred EEcCCCCCceEEEEEEE--EecC-----ceEEEEEEecCeEEEEEEcCCceEEe--------------------cceecc Confidence 9998654 44444433 3222 24678999999999998776654432 245799 Q ss_pred CCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCC---- Q lcl|NC_013644. 232 YGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSD---- 307 (510) Q Consensus 232 ~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~---- 307 (510) ||+||||+|+|+++|+|+|+++++|||+||+++|++++.++++++|+++++|..... +...+++.++++.+..+ T Consensus 202 ~g~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g~~~~~--~~~~~~~~~~~~~~~~~~~~~ 279 (452) T protein:vir:36 202 YPDLPVVEFYFNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEE--EDLKNIRSNRVINYYADGEGK 279 (452) T ss_pred CCcccEEEecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcCc--hhhhhhhhcceEEecCCCCcc Confidence 999999999999999999999999999999999999999999999999999986543 44455566666666543 Q ss_pred -CceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 308 -GGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVID 386 (510) Q Consensus 308 -~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~ 386 (510) ++|+|++++.+.++++.++++|.++|+.+|++|++++..+||+||+||++++++|++||.++++.|+.+|++++++|+. T Consensus 280 ~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~ 359 (452) T protein:vir:36 280 NVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCE 359 (452) T ss_pred CCcceeEeecCCHHHHHHHHHHHHHHHHHHhCccccCcccccCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3699999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 387 DINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWEDVKEAL 465 (510) Q Consensus 387 ~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~~~~~~ 465 (510) +++..+. ..+..+|+|+|++++|.|+++.+++++++ +|+||+||+++++|+++|+++ .+++++++++. .+ T Consensus 360 ~~~~~~~-~~~~~~i~i~f~~~~p~d~~~~a~~~~k~--~g~iS~et~~~~~~~~~d~~~E~~ri~~E~~~~---~~--- 430 (452) T protein:vir:36 360 LSTNVSN-KDSWKDIEYTFTRNEPKDIKEQAETANIL--MGITSQETALSVISVIPDVQAEMEKIKKEEAST---AI--- 430 (452) T ss_pred HHhccCC-ccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHH---HH--- Confidence 9987754 45778899999999999999999999876 688999999999999987542 33333222211 11 Q ss_pred HhhhccCCCCCCCCCcccCCCCCCcccc Q lcl|NC_013644. 466 EEAEYTKGLSDNTDEEETAVNPDDPTQQ 493 (510) Q Consensus 466 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (510) .... ...+..+.++..+.+. +| T Consensus 431 -~~~~-~~~~~~~~~~~~~~~~----~e 452 (452) T protein:vir:36 431 -FDKD-KQPSEKGTDTVVSETN----EE 452 (452) T ss_pred -HHhh-ccCCCCcccccCcccc----CC Confidence 1111 1111111111111111 11 No 28 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=100.00 E-value=3.9e-91 Score=516.26 Aligned_cols=439 Identities=14% Similarity=0.130 Sum_probs=353.8 Q ss_pred CCC------ccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccch Q lcl|NC_013644. 1 MEA------LLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGF 74 (510) Q Consensus 1 ~~~------~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~ 74 (510) |.- .+..+.+++.++|.++|.+|.. +..+++++++||+|+|+|++++. +...++++|+++|| T Consensus 1 ~~~~~~~~~~~p~d~~~~~~~l~~~i~~~~~--~~~r~~~~~~yy~g~~~i~~~~~----------~~~~~~~~ki~~n~ 68 (453) T protein:vir:39 1 MKYKPPKLMTFPKDEPITNEVVTKFMEKHRL--EVARYEYLKNMYRGIMAIDAEPT----------KDLWKPDNRLTVNF 68 (453) T ss_pred CeecCCcceEcCCCCCCCHHHHHHHHHHHHH--HHHHHHHHHHHhhccCchhcCCC----------ccccCccceeecch Confidence 321 2556677888999999999864 45579999999999999988753 34567889999999 Q ss_pred hHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEE Q lcl|NC_013644. 75 FPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFG 153 (510) Q Consensus 75 ~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~ 153 (510) +++||++.++||||+||+|++++++..+.|+++|+ |+++..+.++++.++++|+||++||.|++|++++++++|++++| T Consensus 69 ~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~ 148 (453) T protein:vir:39 69 TKYIVDTFTGYFNGIPVKKSHSDKETLSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNEETQTNVIYNTPENMFM 148 (453) T ss_pred HHHHHHHHhhhhcccCceeccCChHHHHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEE Confidence 99999999999999999999999999999999995 78999999999999999999999999999999999999999999 Q ss_pred EEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCC Q lcl|NC_013644. 154 VYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYG 233 (510) Q Consensus 154 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 233 (510) +||+.......+.++++... ....++++|+++++++|....+.+.. ....+|+|| T Consensus 149 v~d~~~~~~~~~~ir~~~~~-----~~~~~~~~yt~~~i~~~~~~~~~~~~--------------------~~~~~~~~g 203 (453) T protein:vir:39 149 VYDDTIKQEPLFAVRYGYDD-----DYKLYGEVYTKETTYALNGTMGFYNM--------------------TEQAPNPFD 203 (453) T ss_pred EecCCCCCeEEEEEEEEEeC-----CeEEEEEEEeCCeEEEEEecCCceee--------------------ecccccCCC Confidence 99976543333333322221 23578999999999999877655432 234589999 Q ss_pred cccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeec------cCC Q lcl|NC_013644. 234 QIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGT------GSD 307 (510) Q Consensus 234 ~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~------~~~ 307 (510) .||||+|+|+++|+|+|+++++|||+||+++|++++.++++++|+++++|.+.++ +...+++..+++.+ +++ T Consensus 204 ~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 281 (453) T protein:vir:39 204 DLPVVEFYFNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEE--EDLKNIRSNRVINYYGESSEAKN 281 (453) T ss_pred ceeEEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCCCc--hhhhhhhhcceeeecCCCCCCCC Confidence 9999999999999999999999999999999999999999999999999976543 22344455555543 357 Q ss_pred CceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 308 GGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDD 387 (510) Q Consensus 308 ~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~ 387 (510) ++++|++++.+.++++.++++|++.||.+|++|++++..+||+||+||++++++|.+||.++++.|+.+|++++++|+.+ T Consensus 282 ~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~ 361 (453) T protein:vir:39 282 VDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCEL 361 (453) T ss_pred CceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 78999999999999999999999999999999999998899999999999999999999999999999999999999998 Q ss_pred HhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 388 INRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWEDVKEALE 466 (510) Q Consensus 388 ~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~~~~~~~ 466 (510) ++..+. ..+..+|+|+|++++|.|+++.+++++++ +|+||+||+++++|+++|+++ .++++++++... .... T Consensus 362 ~~~~~~-~~~~~~i~v~f~~~~p~~~~~~a~~~~kl--~g~is~et~l~~l~~v~D~~~E~~ri~~E~~~~~---~~~~- 434 (453) T protein:vir:39 362 STNVSN-KEAWKDIEYTFTRNEPKDIKEQAETANIL--MGITSQETALSVISVIPDVQAEMEKIKKEEASTA---IFDK- 434 (453) T ss_pred HhccCC-ccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHH---HHHH- Confidence 887654 45778999999999999999999999876 688999999999999988543 222222222111 1111 Q ss_pred hhhccCCCCCCCCCcccCCCCCCccccc Q lcl|NC_013644. 467 EAEYTKGLSDNTDEEETAVNPDDPTQQM 494 (510) Q Consensus 467 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (510) ...+ +..+.++....++ ++ T Consensus 435 -~~~~---~~~~~~~~~~~~~-----~e 453 (453) T protein:vir:39 435 -DKQP---SEKGTDTVVPETN-----EE 453 (453) T ss_pred -hccC---CCCCCCCCCCCcC-----CC Confidence 1111 1111111111111 11 No 29 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=100.00 E-value=5.4e-91 Score=515.46 Aligned_cols=423 Identities=14% Similarity=0.186 Sum_probs=351.3 Q ss_pred hhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCC Q lcl|NC_013644. 11 IIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNP 90 (510) Q Consensus 11 ~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p 90 (510) ++.++|.++|++|.. +..++.++++||+|+|+|+++.. +...++++|+++||+++||++.++||||+| T Consensus 1 l~~~~l~~~i~~~~~--~~~r~~~l~~yy~g~~~il~~~~----------~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~ 68 (429) T protein:vir:98 1 MTKDLLSELIQKHRS--FNLSYSAYKQLYEGDHAILQQKQ----------KEQYKPDNRLVVNFAKYIVDTFNGYFIGVP 68 (429) T ss_pred CCHHHHHHHHHHHHH--HHHHHHHHHHHhccccccccccc----------cccCCCcceeecchHHHHHHHHhhhhcccC Confidence 889999999999964 45689999999999999987654 245678899999999999999999999999 Q ss_pred ceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCCCceeEEEEEE Q lcl|NC_013644. 91 VEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYNELQRICRHYI 169 (510) Q Consensus 91 ~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~ 169 (510) |+|+++++.+.+.|+++|+ |+++..+.+++++++++|+||+++|.|++|++++++++|.+++|+||+.......+.+++ T Consensus 69 ~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~~~~~~p~~~~~v~dd~~~~~~~~~i~~ 148 (429) T protein:vir:98 69 VQTSHENKQVSNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDENAEAGITYLTPLEAFIVYDDSIRQKPLFAVRY 148 (429) T ss_pred ceeecCChHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCcEEEEEEcccceEEEEeCCCCCceEEEEEE Confidence 9999999999999999995 789999999999999999999999999999999999999999999997655334433333 Q ss_pred EEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCCc Q lcl|NC_013644. 170 TEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTD 249 (510) Q Consensus 170 ~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd 249 (510) +...+ ...++++|+.+.+++|....+++.. ....+|+||+||||+|+|+++|+|+ T Consensus 149 ~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~--------------------~~~~~~~~g~vPvv~~~n~~~g~sd 203 (429) T protein:vir:98 149 FYNKG-----GVLEGSYSDASNITYFKDGEKGIEI--------------------GESEPHPFDGVPMIEYVENEERQSL 203 (429) T ss_pred EEecC-----ceEEEEEEeCceEEEEEecCCceEe--------------------cccccccCCccceEEecCCCCCCCc Confidence 32221 3567789999999998876554432 2345899999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccC----CCceeEEeecCCHHHHHHH Q lcl|NC_013644. 250 LKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGS----DGGLDVKTVTIPTEGRKTK 325 (510) Q Consensus 250 ~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~ 325 (510) |+++++|||+||.++|++++.++++++|+++++|.++.+ +...+++..+++.+++ +++++|++++.+.++++.+ T Consensus 204 ~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 281 (429) T protein:vir:98 204 LASVVTLINAFNKAISEKANDVEYFADAYLKILGAELDD--ETLKSLRDTRIINLKDTDAQQLTVEFLQKPDADATQEHL 281 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCCCc--chhhhHhhCceeeccCCCCCCcceeEEeecCCHHHHHHH Confidence 999999999999999999999999999999999987653 4455667777777653 3479999999999999999 Q ss_pred HHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEe Q lcl|NC_013644. 326 MEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTF 405 (510) Q Consensus 326 ~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f 405 (510) +++|.+.|+.+|++|++++.++||+||+||+++++++.+|+.++++.|+++|++++++|+.+++..+. ..+..+++|+| T Consensus 282 ~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~-~~d~~~i~v~f 360 (429) T protein:vir:98 282 LDRLENLIFRTAMVANISDESFGTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKIG-PKDWIGIKYKF 360 (429) T ss_pred HHHHHHHHHHHhCccccCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC-ccccccceEEe Confidence 99999999999999999998899999999999999999999999999999999999999999887654 46778899999 Q ss_pred CCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccC Q lcl|NC_013644. 406 TREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETA 484 (510) Q Consensus 406 ~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (510) ++++|.|+++.+++++++ +|++|+||+++++|+++|+++ .++++++++.... .. ..+... +++. T Consensus 361 ~~~~p~~~~~~a~~~~kl--~g~is~et~~~~l~~v~d~~~E~~ri~~E~~~~~~---~~------~~~~~~----~~~~ 425 (429) T protein:vir:98 361 TRNLPANLLEESQIAGNL--AGIVSEETQVGVLSIVENPQKEIERKNSDKSTLIS---RQ------AGGLNG----QNTT 425 (429) T ss_pred CCCCCcCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHH---HH------HhhhcC----CCCC Confidence 999999999999999877 688999999999999988643 3333333322111 10 011011 0111 Q ss_pred CCCC Q lcl|NC_013644. 485 VNPD 488 (510) Q Consensus 485 ~~~~ 488 (510) ++.+ T Consensus 426 ~~~~ 429 (429) T protein:vir:98 426 TILE 429 (429) T ss_pred CCCC Confidence 1111 No 30 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=100.00 E-value=7.4e-91 Score=514.71 Aligned_cols=447 Identities=23% Similarity=0.348 Sum_probs=362.6 Q ss_pred CCCc---------------cCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceecccccccccccc Q lcl|NC_013644. 1 MEAL---------------LSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYA 65 (510) Q Consensus 1 ~~~~---------------~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ 65 (510) |-.+ +......+.+.|.++|.+|.. +..++.++++||.|+|+|++++..... .......+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~--~~~~~~~~~~yY~g~~~i~~~~~~~~~---~~~~~~~~ 75 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQYETQEEMILRLITKHKE--NVEDITVGERYYNHQPDVLFNAPKRNV---KGEIDPFK 75 (468) T ss_pred CccccCCcCceeehheeecccccccCcHHHHHHHHHHHHH--HHHHHHHHHHHhcCCCccccccccccc---cccccccc Confidence 3333 123336678889999999864 456799999999999999888765322 23345667 Q ss_pred ccceeccchhHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEE Q lcl|NC_013644. 66 SNVRIPHGFFPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYNSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQV 145 (510) Q Consensus 66 ~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~ 145 (510) +++|+++||++.||++.++||||+||+|+++++++.+.|+++++|++++.+.++++.++++|++|++||.|++|++++++ T Consensus 76 ~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~ 155 (468) T protein:vir:96 76 PDWRMYTNYHQNLVDQKVAYAVANPVTYGTEDEKSLKTIQEVLNHKWDDKLVDILTAASNKGVEWIQPYVDEQGEFKTFR 155 (468) T ss_pred cccccccchHHHHHHHHHhhhccCCceeccCChHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEE Confidence 88999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EcccceEEEEcCC--CCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccccccccc Q lcl|NC_013644. 146 ADSLNVFGVYNEY--NELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSE 223 (510) Q Consensus 146 ~~p~~~~~~~d~~--~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (510) ++|+++||+||+. .++.+++++|... ...++++|+++++++|+..++........... . ..... T Consensus 156 ~~p~~~~~v~~~~~~~~~~~~ir~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-----~~~~~ 221 (468) T protein:vir:96 156 VPAEQAIPIWTNKERDELKAFIRLYELD--------GGERVEYWTANDVTFYELKDGQLIPDYYQGEE-H-----VQAHY 221 (468) T ss_pred EcccceEEEEcCCCCCceEEEEEEEEec--------CceEEEEEeCCeEEEEEEcCCceeeccccccc-c-----cccce Confidence 9999999999864 4566666655322 13468999999999999887654432221111 1 11123 Q ss_pred ccccccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeee Q lcl|NC_013644. 224 NESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVG 303 (510) Q Consensus 224 ~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~ 303 (510) .....+|+||+||||+|+|++.|.|+|+++++|||+||.++|++++.++++++|+++++|+++++...+...++.++++. T Consensus 222 ~~~~~~~~~~~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~ 301 (468) T protein:vir:96 222 YVGNKSMSWNRVPFIPFKNNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEGEDLEEFMYNLKYYKAIN 301 (468) T ss_pred eeccccccCCcccEEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccccchhhhhhhcCceEE Confidence 34556899999999999999999999999999999999999999999999999999999999888888888888888887 Q ss_pred cc--CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 304 TG--SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWM 380 (510) Q Consensus 304 ~~--~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~ 380 (510) ++ ++++++|++++++.++++.++++|+++||.+|++|++++.+ +||+||+|++++++++++||.+|++.|+++|+++ T Consensus 302 ~~~d~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~ 381 (468) T protein:vir:96 302 VDGDGSGGVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQEL 381 (468) T ss_pred ecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 75 45779999999999999999999999999999999988765 4689999999999999999999999999999999 Q ss_pred HHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHH Q lcl|NC_013644. 381 NKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWE 459 (510) Q Consensus 381 ~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~ 459 (510) +++|+.+++. ..+..+++|+|++++|.|+++.|++++ .+|+||+||+++++|+++|+++ .++++++++.. T Consensus 382 ~~li~~~~g~----~~d~~~i~i~f~~~~p~d~~e~a~~~~---~~g~iS~et~i~~l~~v~D~~~E~~ri~~E~~~~-- 452 (468) T protein:vir:96 382 LQYIIDFYKL----SIKVQDVEITFNFNVMVNELEQSQIGV---NSQYLSKETVVTNHPWVDDPVAEMERIDQEELAL-- 452 (468) T ss_pred HHHHHHHhCC----CcccceeeEEecCCCCcCHHHHHHHHH---hcCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHH-- Confidence 9999998753 456778999999999999999998764 4699999999999999988642 23332222211 Q ss_pred HHHHHHHhhhccCCCCCCCCCccc Q lcl|NC_013644. 460 DVKEALEEAEYTKGLSDNTDEEET 483 (510) Q Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~ 483 (510) ...+. .+.++.+++.+ T Consensus 453 ----~~~~~----~~~~~~~~~~~ 468 (468) T protein:vir:96 453 ----PSIEE----GLNGKENNEPT 468 (468) T ss_pred ----HHHhh----ccCCCCCCCCC Confidence 11111 11222221221 No 31 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=100.00 E-value=1.3e-90 Score=513.41 Aligned_cols=446 Identities=14% Similarity=0.183 Sum_probs=357.0 Q ss_pred CCCc--------------cCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccc Q lcl|NC_013644. 1 MEAL--------------LSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYAS 66 (510) Q Consensus 1 ~~~~--------------~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~ 66 (510) |+++ ...+.+++.+.|.++|++|+... +.+++++++||.|+|+|+++.. ...++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~-~~~~~~l~~Yy~g~~~i~~~~~-----------~~~~~ 68 (470) T protein:vir:99 1 MKDINYGRDKVTGNSSFIFPKGEKLTSNELLGFIAYNETVL-KPRYRENMKLYLGKHKILTAPE-----------KETGA 68 (470) T ss_pred CccccCCcccccCCceEEeCCCCCcCHHHHHHHHHHHHHhh-HHHHHHHHHHhccccccccCcc-----------cccCC Confidence 7776 34566888899999999997443 4469999999999999987643 34578 Q ss_pred cceeccchhHHHHHHHHhhhhcCCceeccCcH-HHHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEE Q lcl|NC_013644. 67 NVRIPHGFFPEIVDQKTQYLLSNPVEYETENE-ELKEYLAEYY-NSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQ 144 (510) Q Consensus 67 ~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~-~~~~~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~ 144 (510) ++|+++||+++||++.++||+|+||+|+++++ +..+.|.++| +|+++.++.++++.++++|++|+++|.|++|+++++ T Consensus 69 ~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~ 148 (470) T protein:vir:99 69 DNRIVVNSAKYVVDVYNGYFCGIEPKLALLNDSSKIDEIARWNRQENFFDTINEISKQCDIFGRSIASIYQGEDARPHLM 148 (470) T ss_pred cceeecchHHHHHHHHhhhhccCCeeEeeCCchhHHHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeEEEE Confidence 89999999999999999999999999988654 5667788887 588999999999999999999999999999999999 Q ss_pred EEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccc Q lcl|NC_013644. 145 VADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSEN 224 (510) Q Consensus 145 ~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (510) +++|.+++|+||+.......+.++++.... ......++++|+++.+++|.....++. ... T Consensus 149 ~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~------------------~~~ 208 (470) T protein:vir:99 149 YSSPNHAFIIYDDTVQRQPLAFVHYQIDNS--NNWTDAYGVIQYADKFYKFKGYDIEED------------------TNA 208 (470) T ss_pred EEccceeEEEEcCCCCcceEEEEEEEEEec--CCeeEEEEEEEecCeEEEEEecccccc------------------ccc Confidence 999999999999876544433333333332 234567889999999998886654321 122 Q ss_pred cccccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCc--hhhhhHhhhcCeee Q lcl|NC_013644. 225 ESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDD--LSKLRQNVKSKKVV 302 (510) Q Consensus 225 ~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~--~~~~~~~~~~~~~~ 302 (510) ....+|+||+||||+|+|+++|+|+|+++++|||+||.++|++++.++++++|++|++|+...+ .++....+...+++ T Consensus 209 ~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~g~~~~~~~~~~~~ 288 (470) T protein:vir:99 209 AGYAINPYGLVPAVEFFENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLPEDDEGNPKFDFKNNRVL 288 (470) T ss_pred ccccccCCCccceEeecCCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccccccchhhhhhhccee Confidence 2456899999999999999999999999999999999999999999999999999999986543 23445556666665 Q ss_pred ec-----cCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 303 GT-----GSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRAL 376 (510) Q Consensus 303 ~~-----~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~ 376 (510) .+ +++++++|++++.+.++++.++++|++.|+.+|++|++++.+ +||+||+||++++++|.+||..+++.|+++ T Consensus 289 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~ 368 (470) T protein:vir:99 289 YVSQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKNFAGNSSGVALQYKLFAMKNKADSKERKFDKS 368 (470) T ss_pred eecCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 54 356789999999999999999999999999999999987766 478999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHH Q lcl|NC_013644. 377 LEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDL 456 (510) Q Consensus 377 l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~ 456 (510) |++++++|+.+++..+...++..+++++|++++|.|+++.++++++++ |+||+||+++++|+++++++.++++++++. T Consensus 369 l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl~--giis~et~l~~l~~vd~~~E~eri~~E~~~ 446 (470) T protein:vir:99 369 LMQLYRIVLATLFNNKQDQELWSELDFKFTRNLPEDMASAIDNAKNAE--GIVSKKTQLGMIPDIEPDAEMKQIAKEKAD 446 (470) T ss_pred HHHHHHHHHHHHhccCCcccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHHHhCCCCCHHHHHHHHHHHHHH Confidence 999999999999988888888889999999999999999999999774 889999999999999755444444443332 Q ss_pred HHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCccc Q lcl|NC_013644. 457 DWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQ 492 (510) Q Consensus 457 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (510) .... .. ..... .+..+.++++++ + T Consensus 447 ~~~~---~~---~~~~~----~d~~~~d~~~ee--~ 470 (470) T protein:vir:99 447 AIKQ---TQ---QLSMP----IDILKRDNNAEE--E 470 (470) T ss_pred HHHH---HH---hhcCC----CCcCCCCCCccC--C Confidence 2111 11 11111 111111111111 1 No 32 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=100.00 E-value=5.2e-91 Score=515.53 Aligned_cols=452 Identities=18% Similarity=0.241 Sum_probs=359.1 Q ss_pred CCCc-cCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCc---chhcccc----eeccccccccccccccceecc Q lcl|NC_013644. 1 MEAL-LSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHEND---IMNNRIF----YVDDEGILREDKYASNVRIPH 72 (510) Q Consensus 1 ~~~~-~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~---i~~~~~~----~~~~~~~~~~~~~~~~~ki~~ 72 (510) |+.+ .-++.+++++.|.++|++|.. .++++..+.+||+|.++ +..++.. .+...+...+...++++||++ T Consensus 4 ~~~~~~~~~~~~~~e~i~~~i~~~~~--~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~ 81 (474) T protein:vir:94 4 YKLIDDIEAQGILPKHIEALIESHKD--DRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNN 81 (474) T ss_pred HHHHhhccccCCCHHHHHHHHHHhhh--hhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCccccccc Confidence 3333 223346688899999999853 56678999999999765 3333322 122233445667789999999 Q ss_pred chhHHHHHHHHhhhhcCCceeccC-----cHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEE Q lcl|NC_013644. 73 GFFPEIVDQKTQYLLSNPVEYETE-----NEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVA 146 (510) Q Consensus 73 n~~~~Iv~~~~~~l~g~p~~~~~~-----d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~ 146 (510) ||+++||++.++||||+||+|+++ ++.+.+.|++||+ |+++.++.+++++++++|+||+++|.|++|+++++++ T Consensus 82 n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~~~~i 161 (474) T protein:vir:94 82 SFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDIRIKNI 161 (474) T ss_pred chHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCeeEEEEE Confidence 999999999999999999999874 4677889999985 7899999999999999999999999999999999999 Q ss_pred cccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccc Q lcl|NC_013644. 147 DSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENES 226 (510) Q Consensus 147 ~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (510) +|+++||+||+++++..++++|+..... ....++++++||+..+++|...+.+.+ .... T Consensus 162 ~p~~~~~v~d~~~~~~~~i~~~~~~~~~--~~~~~~~~~~y~~~~~~~~~~~~~~~~-------------------~~~~ 220 (474) T protein:vir:94 162 DPYNVIFVGDNILEPTYSLRYFYEKDDD--NGTDYVYAEFYDNAYYYVFRGEGIDAL-------------------QEVG 220 (474) T ss_pred cccceEEEEcCCCceEEEEEEEEEeeCC--CceEEEEEEEEcCceEEEEeecCCCcc-------------------cccc Confidence 9999999999988888888887665433 345677899999999999987643321 1223 Q ss_pred cccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeee-ecc Q lcl|NC_013644. 227 LLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVV-GTG 305 (510) Q Consensus 227 ~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~-~~~ 305 (510) ..+|+||+||||+|+|+++|+|+|+++++|||+||.++|++++.++++++|+++++|+++.+ +....++..+++ ..+ T Consensus 221 ~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~~--~~~~~~~~~~~i~~~~ 298 (474) T protein:vir:94 221 RYEHLFDYNPLFGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMSE--EMIQETQKSGAFELFD 298 (474) T ss_pred cccCCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCc--hhhhhhhhcceeEecC Confidence 46899999999999999999999999999999999999999999999999999999987764 333444444444 457 Q ss_pred CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 306 SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLV 384 (510) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i 384 (510) ++++++|++++.+.++++.++++|+++|+.+|++|++++.+ +||+||+||+++++++++||..+++.|+++|++++++| T Consensus 299 ~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li 378 (474) T protein:vir:94 299 KDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVI 378 (474) T ss_pred CCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 88999999999999999999999999999999999988765 47899999999999999999999999999999999999 Q ss_pred HHHHhhccC--CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHHH Q lcl|NC_013644. 385 IDDINRRYT--KAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWEDV 461 (510) Q Consensus 385 ~~~~~~~~~--~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~~ 461 (510) +.+++..+. .+.++.+++++|++++|.|+++.+++++++ +|++|+||+++++|+++|++. .++++++++... T Consensus 379 ~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~--- 453 (474) T protein:vir:94 379 LSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINL--KGQVSERTRLGQSQLVDDVDYELDEMEKESLEFN--- 453 (474) T ss_pred HHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHH--- Confidence 999988654 345667899999999999999999999877 589999999999999988642 233322222111 Q ss_pred HHHHHhhhccCCCCCCCCCcccCCCCCC Q lcl|NC_013644. 462 KEALEEAEYTKGLSDNTDEEETAVNPDD 489 (510) Q Consensus 462 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (510) ...+ +...++.+++++.+..+ T Consensus 454 ----~~~~---~~~~~~~~~~~~~~~s~ 474 (474) T protein:vir:94 454 ----DKLP---DIDEGDANDKSQNNQSE 474 (474) T ss_pred ----hhcc---cccCCCcCCCCccccCC Confidence 1111 11111111111111111 No 33 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=100.00 E-value=5.2e-91 Score=515.53 Aligned_cols=452 Identities=18% Similarity=0.241 Sum_probs=359.1 Q ss_pred CCCc-cCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCc---chhcccc----eeccccccccccccccceecc Q lcl|NC_013644. 1 MEAL-LSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHEND---IMNNRIF----YVDDEGILREDKYASNVRIPH 72 (510) Q Consensus 1 ~~~~-~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~---i~~~~~~----~~~~~~~~~~~~~~~~~ki~~ 72 (510) |+.+ .-++.+++++.|.++|++|.. .++++..+.+||+|.++ +..++.. .+...+...+...++++||++ T Consensus 4 ~~~~~~~~~~~~~~e~i~~~i~~~~~--~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~ 81 (474) T protein:vir:10 4 YKLIDDIEAQGILPKHIEALIESHKD--DRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNN 81 (474) T ss_pred HHHHhhccccCCCHHHHHHHHHHhhh--hhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCccccccc Confidence 3333 223346688899999999853 56678999999999765 3333322 122233445667789999999 Q ss_pred chhHHHHHHHHhhhhcCCceeccC-----cHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEE Q lcl|NC_013644. 73 GFFPEIVDQKTQYLLSNPVEYETE-----NEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVA 146 (510) Q Consensus 73 n~~~~Iv~~~~~~l~g~p~~~~~~-----d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~ 146 (510) ||+++||++.++||||+||+|+++ ++.+.+.|++||+ |+++.++.+++++++++|+||+++|.|++|+++++++ T Consensus 82 n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~~~~i 161 (474) T protein:vir:10 82 SFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDIRIKNI 161 (474) T ss_pred chHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCeeEEEEE Confidence 999999999999999999999874 4677889999985 7899999999999999999999999999999999999 Q ss_pred cccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccc Q lcl|NC_013644. 147 DSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENES 226 (510) Q Consensus 147 ~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (510) +|+++||+||+++++..++++|+..... ....++++++||+..+++|...+.+.+ .... T Consensus 162 ~p~~~~~v~d~~~~~~~~i~~~~~~~~~--~~~~~~~~~~y~~~~~~~~~~~~~~~~-------------------~~~~ 220 (474) T protein:vir:10 162 DPYNVIFVGDNILEPTYSLRYFYEKDDD--NGTDYVYAEFYDNAYYYVFRGEGIDAL-------------------QEVG 220 (474) T ss_pred cccceEEEEcCCCceEEEEEEEEEeeCC--CceEEEEEEEEcCceEEEEeecCCCcc-------------------cccc Confidence 9999999999988888888887665433 345677899999999999987643321 1223 Q ss_pred cccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeee-ecc Q lcl|NC_013644. 227 LLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVV-GTG 305 (510) Q Consensus 227 ~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~-~~~ 305 (510) ..+|+||+||||+|+|+++|+|+|+++++|||+||.++|++++.++++++|+++++|+++.+ +....++..+++ ..+ T Consensus 221 ~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~~--~~~~~~~~~~~i~~~~ 298 (474) T protein:vir:10 221 RYEHLFDYNPLFGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMSE--EMIQETQKSGAFELFD 298 (474) T ss_pred cccCCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCc--hhhhhhhhcceeEecC Confidence 46899999999999999999999999999999999999999999999999999999987764 333444444444 457 Q ss_pred CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 306 SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLV 384 (510) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i 384 (510) ++++++|++++.+.++++.++++|+++|+.+|++|++++.+ +||+||+||+++++++++||..+++.|+++|++++++| T Consensus 299 ~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li 378 (474) T protein:vir:10 299 KDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVI 378 (474) T ss_pred CCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 88999999999999999999999999999999999988765 47899999999999999999999999999999999999 Q ss_pred HHHHhhccC--CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHHH Q lcl|NC_013644. 385 IDDINRRYT--KAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWEDV 461 (510) Q Consensus 385 ~~~~~~~~~--~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~~ 461 (510) +.+++..+. .+.++.+++++|++++|.|+++.+++++++ +|++|+||+++++|+++|++. .++++++++... T Consensus 379 ~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~--- 453 (474) T protein:vir:10 379 LSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINL--KGQVSERTRLGQSQLVDDVDYELDEMEKESLEFN--- 453 (474) T ss_pred HHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHH--- Confidence 999988654 345667899999999999999999999877 589999999999999988642 233322222111 Q ss_pred HHHHHhhhccCCCCCCCCCcccCCCCCC Q lcl|NC_013644. 462 KEALEEAEYTKGLSDNTDEEETAVNPDD 489 (510) Q Consensus 462 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (510) ...+ +...++.+++++.+..+ T Consensus 454 ----~~~~---~~~~~~~~~~~~~~~s~ 474 (474) T protein:vir:10 454 ----DKLP---DIDEGDANDKSQNNQSE 474 (474) T ss_pred ----hhcc---cccCCCcCCCCccccCC Confidence 1111 11111111111111111 No 34 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=100.00 E-value=1.8e-90 Score=512.58 Aligned_cols=451 Identities=19% Similarity=0.211 Sum_probs=351.4 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCC-cchhcccceeccccccccccccccceeccchhHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHEN-DIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIV 79 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~-~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv 79 (510) ++. ..+......++|+++|.+|+..+. .+++++.+||.|+| .|.++.. .....++++|+++||+++|| T Consensus 30 ~~~-~~~~~~~~~~~l~~~i~~~~~~~~-~r~~~l~~yY~g~~~~i~~~~~---------~~~~~~~~~ki~~n~~k~Iv 98 (501) T protein:vir:27 30 ADN-LEELMVNNWELLKNFINHHKLRQA-PRIQELLDYARGENHDVLQFGR---------RKDREMADKRAVHNYGRMIS 98 (501) T ss_pred ccc-ccccccccHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCCccccccCc---------cCccccccceeccchHHHHH Confidence 322 223334455679999999975543 46899999999985 5544432 24456788999999999999 Q ss_pred HHHHhhhhcCCceeccCc----HHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEE Q lcl|NC_013644. 80 DQKTQYLLSNPVEYETEN----EELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGV 154 (510) Q Consensus 80 ~~~~~~l~g~p~~~~~~d----~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~ 154 (510) ++.++||+|+||+|++++ +.+.+.|+++|+ |+++..+.++++.++++|+||+++|.|++|++++++++|.+++|+ T Consensus 99 d~~~~yl~g~p~~~~~~d~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~ded~~~~i~~~~p~~~~~v 178 (501) T protein:vir:27 99 KFKTGYLAGNPIRVEYDDNDNNSQNDDTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRNEYDETRIKRLNPLETFVI 178 (501) T ss_pred HHHhhhhcccCeeEecCCccchHHHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeCCCCceEEEEEccceeEEE Confidence 999999999999999876 445677888884 899999999999999999999999999999999999999999999 Q ss_pred EcCCC--CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccC Q lcl|NC_013644. 155 YNEYN--ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSY 232 (510) Q Consensus 155 ~d~~~--~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (510) ||++. ++.+++++|.....++ .+.++++||++.+++|...++.+. ....+|+| T Consensus 179 ~d~~~~~~~~~~ir~~~~~~~~~----~~~~~~vyt~~~v~~~~~~~~~~~---------------------~~~~~~~~ 233 (501) T protein:vir:27 179 YDNSLEDNSIAAVRYYNRGTLQN----AKDVVEIYTNEHIYTLDASDDFNE---------------------ISVTTHAF 233 (501) T ss_pred ecCCCCCceEEEEEEEEeeecCC----cEEEEEEEeCCeEEEEEeCCceee---------------------ccccccCC Confidence 99864 4555666655433332 367899999999999886543321 23468999 Q ss_pred CcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeecc------- Q lcl|NC_013644. 233 GQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTG------- 305 (510) Q Consensus 233 g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~------- 305 (510) |+||||+|+||+.|+|+|+++++|||+||.++|++++.++++++|+++++|+...+.++....++..+++.+. T Consensus 234 g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (501) T protein:vir:27 234 GTVPITEFLNNVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLMQLKPPKSADG 313 (501) T ss_pred CcccEEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccchhhhhhcCceeecccccccC Confidence 9999999999999999999999999999999999999999999999999998877766666666666655543 Q ss_pred --CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 306 --SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNK 382 (510) Q Consensus 306 --~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~ 382 (510) ++++++|++++.+.++++.++++|++.|+.+|++|++++.+ +||+||+||++++++|.+||..+++.|+++|+++++ T Consensus 314 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~ 393 (501) T protein:vir:27 314 KEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYR 393 (501) T ss_pred CCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 34579999999999999999999999999999999988765 588999999999999999999999999999999999 Q ss_pred HHHHHHhhccC-CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH--HHHHHHHHHHHHH Q lcl|NC_013644. 383 LVIDDINRRYT-KAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV--LRLICEQFDLDWE 459 (510) Q Consensus 383 ~i~~~~~~~~~-~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~--~~~~~e~~e~~~~ 459 (510) +|+.+++..+. ..++..+++|+|++++|.|+++.+++++++ +|+||+||+++++|+++|+++ +++++|+++.+.. T Consensus 394 li~~~~~~~~~~~~~d~~~i~v~f~~~~p~n~~e~ad~~~kl--~g~iS~et~l~~l~~v~D~~~E~eri~~E~~e~~~~ 471 (501) T protein:vir:27 394 LAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGL--GGQVSQETALSLSGLVESPNEELDKINKEVSEIDFK 471 (501) T ss_pred HHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhHh Confidence 99999987654 456778899999999999999999999877 588999999999999998543 2333332221111 Q ss_pred HHHHHHHhhhccCCCCCCCCCcccCCCCCCccccc Q lcl|NC_013644. 460 DVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQM 494 (510) Q Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (510) . . ............++.++...++.+...+ T Consensus 472 ~----~-~~~~~~~~~~~~d~~~~~~~d~~e~~~~ 501 (501) T protein:vir:27 472 G----Y-SNDFNEHVGKYTDEVKETHTDDFERAYE 501 (501) T ss_pred h----h-cCccccccccccCCCCCCccccccccCC Confidence 1 1 1111111111112111111111111111 No 35 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=100.00 E-value=2.1e-90 Score=512.26 Aligned_cols=450 Identities=19% Similarity=0.231 Sum_probs=351.7 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccC-CcchhcccceeccccccccccccccceeccchhHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHE-NDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIV 79 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~-~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv 79 (510) +..+ -+.+....++|.++|.+|+..+. .+++++.+||.|+ |+|+.+.. .....++++|+++||+++|| T Consensus 30 ~~~~-~~~~~~~~~~i~~~i~~~~~~~~-~r~~~~~~yY~g~~~~i~~~~~---------~~~~~~~~~ri~~n~~k~Iv 98 (501) T protein:vir:96 30 ADNL-EELMVNNWELLKNFINHHKLRQA-PRIQELLDYARGENHDVLKSGR---------RKDNEMADKRAVHNYGRMIS 98 (501) T ss_pred cccc-ccccCChHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCCCcccCccc---------cCccccccceeecchHHHHH Confidence 3332 23333445679999999985544 4699999999997 46655432 23456788999999999999 Q ss_pred HHHHhhhhcCCceeccCc----HHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEE Q lcl|NC_013644. 80 DQKTQYLLSNPVEYETEN----EELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGV 154 (510) Q Consensus 80 ~~~~~~l~g~p~~~~~~d----~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~ 154 (510) ++.++||+|+||+|++.+ +.+.+.|+++|+ |+++..+.+++++++++|+||+++|.|++|++++++++|.+++|+ T Consensus 99 d~~~~yl~g~p~~~~~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~~~~p~~~~~v 178 (501) T protein:vir:96 99 KFKTGYLAGNPIRVEYDDNDDNSQNDDAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSEYDETRIKRLSPLETFVI 178 (501) T ss_pred HHHhhhhcccCeeEeeCCccchhHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEEEccceeEEE Confidence 999999999999998765 456777888885 889999999999999999999999999999999999999999999 Q ss_pred EcCC--CCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccC Q lcl|NC_013644. 155 YNEY--NELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSY 232 (510) Q Consensus 155 ~d~~--~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (510) ||++ .++.+++++|+....++ .+.++++||++++++|....+.+. ....+|+| T Consensus 179 ~d~~~~~~~~~~v~~~~~~~~~~----~~~~~~vyt~~~i~~~~~~~~~~~---------------------~~~~~~~~ 233 (501) T protein:vir:96 179 YDNSLEDNSIAAVRYYNRGTLQS----AKDVVEIYTDEHIYTLDASDDFNE---------------------ISVTTHAF 233 (501) T ss_pred EcCCCCCceEEEEEEEEeecCCC----cEEEEEEEcCCcEEEEeeCCCcee---------------------ccccccCC Confidence 9976 35666666665433322 357899999999999976543321 23468999 Q ss_pred CcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeecc------- Q lcl|NC_013644. 233 GQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTG------- 305 (510) Q Consensus 233 g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~------- 305 (510) |+||||+|+|+++|+|+|+++++|||+||.++|++++.++++++|+++++|+...+.++....++..+++.+. T Consensus 234 g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (501) T protein:vir:96 234 GTVPITEYLNNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLMQLKPPKSADG 313 (501) T ss_pred CccceEEecCCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccchhhhhhcCeeeecccccccc Confidence 9999999999999999999999999999999999999999999999999999877766666666666665542 Q ss_pred --CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 306 --SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNK 382 (510) Q Consensus 306 --~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~ 382 (510) .+++++|++++.+.++++.++++|++.|+.+|++|++++++ +||+||+||+++++++.+||..+++.|+++|+++++ T Consensus 314 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~ 393 (501) T protein:vir:96 314 KEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTKGLKRRYR 393 (501) T ss_pred cccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 34579999999999999999999999999999999988765 588999999999999999999999999999999999 Q ss_pred HHHHHHhhccC-CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHH-HHHH Q lcl|NC_013644. 383 LVIDDINRRYT-KAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFD-LDWE 459 (510) Q Consensus 383 ~i~~~~~~~~~-~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e-~~~~ 459 (510) +|+.+++..+. ..++..+++|+|++++|.|+++.+++++++ +|+||+||+++++|+++|+++ .++++++++ ++.. T Consensus 394 li~~~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~ad~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~ 471 (501) T protein:vir:96 394 LAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGL--GGQVSQETALSLSGLVESPNEELDKINKEMSEIDFK 471 (501) T ss_pred HHHHHHHhcccccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHhhcc Confidence 99999987654 456677899999999999999999999987 488999999999999998643 222222222 1100 Q ss_pred HHHHHHHhhhccCCCCCCCCC-cccCCCCCCcccc Q lcl|NC_013644. 460 DVKEALEEAEYTKGLSDNTDE-EETAVNPDDPTQQ 493 (510) Q Consensus 460 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 493 (510) . . ........+...++ .+...++++...+ T Consensus 472 ~----~-~~~~~~~~~~~~~~~~e~~~d~~e~~~~ 501 (501) T protein:vir:96 472 G----Y-SNDFNEHVGKYTDEVKETHTDDFEREYE 501 (501) T ss_pred c----c-ccchhhcccccCCcCCCCCCCccccccC Confidence 0 0 00000111111111 1111112111111 No 36 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=100.00 E-value=4.5e-90 Score=510.39 Aligned_cols=451 Identities=19% Similarity=0.208 Sum_probs=352.1 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccC-CcchhcccceeccccccccccccccceeccchhHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHE-NDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIV 79 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~-~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv 79 (510) ++. +.+......+.|+++|++|+..+. .+++++.+||.|+ |+|.++.. .....++++|+++||+++|| T Consensus 31 ~~~-~~~~~~~~~~~i~~~i~~h~~~~~-~rl~~l~~yY~g~~~~i~~~~~---------~~~~~~~~~ki~~n~~k~Iv 99 (502) T protein:vir:48 31 ADN-LEELMVNNWELLKNFINHHKLRQA-PRIQELLDYARGENHDVLKSGR---------RKDNEMADKRAVHNYGRMIS 99 (502) T ss_pred ccc-hhhhccccHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCCcccccccc---------ccccccccceeecchHHHHH Confidence 333 334444556779999999975544 4689999999997 46665432 23456788999999999999 Q ss_pred HHHHhhhhcCCceeccCc----HHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEE Q lcl|NC_013644. 80 DQKTQYLLSNPVEYETEN----EELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGV 154 (510) Q Consensus 80 ~~~~~~l~g~p~~~~~~d----~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~ 154 (510) ++.++||+|+||++++++ +.+.+.|+++|+ |+++..+.++++.++++|+||+++|.|++|++++++++|.+++|+ T Consensus 100 d~~~~yl~g~p~~~~~~d~~~~~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~~~~p~~~~~v 179 (502) T protein:vir:48 100 KFKTGYLAGNPIRVEYDDNEDNSQNDDAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSEYDETRIKRLSPLETFVI 179 (502) T ss_pred HHHhhhhcccCeeEecCCccchhHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCceEEEEEcccceEEE Confidence 999999999999998865 446677888884 889999999999999999999999999999999999999999999 Q ss_pred EcCCC--CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccC Q lcl|NC_013644. 155 YNEYN--ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSY 232 (510) Q Consensus 155 ~d~~~--~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (510) ||++. ++.+++++|.....++ .+.++++||++.+++|...++.. .....+|+| T Consensus 180 ydd~~~~~~~~~ir~~~~~~~~~----~~~~~~iyt~~~i~~~~~~~~~~---------------------~~~~~~~~~ 234 (502) T protein:vir:48 180 YDNSLEDNSIAAVRYYNRGTLQN----AKDVVEIYTNQHIYTLDASDSFN---------------------EISVTPHAF 234 (502) T ss_pred EcCCCCCceEEEEEEEEEeecCC----cEEEEEEEeCCeEEEEEeCCcee---------------------eccceecCC Confidence 99764 4666666665443332 35688999999999987554322 123468999 Q ss_pred CcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeec-------- Q lcl|NC_013644. 233 GQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGT-------- 304 (510) Q Consensus 233 g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~-------- 304 (510) |.||||+|+||++|+|+|+++++|||+||.++|++++.++++++|+++++|....+.......++..+++.+ T Consensus 235 g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (502) T protein:vir:48 235 GTVPITEFLNNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGMQASDMKRTRLMQLKPPKSADG 314 (502) T ss_pred CccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccccchhhhhhcceeeccccccccc Confidence 999999999999999999999999999999999999999999999999999876665555555665555543 Q ss_pred -cCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 305 -GSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNK 382 (510) Q Consensus 305 -~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~ 382 (510) +++++++|++++.+.++++.++++|.++||.+|++|++++.. +||+||+||++++++|.+|+..+++.|+++|+++++ T Consensus 315 ~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~ 394 (502) T protein:vir:48 315 KEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSGNASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYR 394 (502) T ss_pred cccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 245789999999999999999999999999999999987765 588999999999999999999999999999999999 Q ss_pred HHHHHHhhccC-CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHH Q lcl|NC_013644. 383 LVIDDINRRYT-KAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWED 460 (510) Q Consensus 383 ~i~~~~~~~~~-~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~ 460 (510) +|+.+++..+. .+++..+++|+|++++|.|.++.+++++++ +|+||+||+++++|+++|+++ .++++++++.. T Consensus 395 li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~--- 469 (502) T protein:vir:48 395 LAARIGSLVNEFKDFDESRLKITFTPNLPKSLYEQVSILNDL--GGQVSQETALSLSGLVENPTEELDKINEESSKI--- 469 (502) T ss_pred HHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhh--- Confidence 99999987654 466778899999999999999999999887 588999999999999998643 33332222211 Q ss_pred HHHHHHhhhcc-CCCCCCCCCcccCCCCCCcccccccCcccccccccCC Q lcl|NC_013644. 461 VKEALEEAEYT-KGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLPE 508 (510) Q Consensus 461 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (510) ......+.. +..+.+.++.+ +.+.++.+. +|+ T Consensus 470 --~~~~~~~~~~~~~~~~~d~~~--e~~~~~~~~------------~~~ 502 (502) T protein:vir:48 470 --DFKGYPSYFYDNVGKYTDEVK--ETHTDDFER------------VYE 502 (502) T ss_pred --hhhcccccccccccccCCCcc--CCCCcCcCC------------CCC Confidence 011111111 11111111111 111111111 122 No 37 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=100.00 E-value=2.6e-90 Score=511.69 Aligned_cols=434 Identities=15% Similarity=0.179 Sum_probs=353.2 Q ss_pred CCCc------cCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccch Q lcl|NC_013644. 1 MEAL------LSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGF 74 (510) Q Consensus 1 ~~~~------~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~ 74 (510) |.-. ...+.+++.+.|.++|++|. .+..++.++.+||.|+|+|+++.. ....++++|+++|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~--~~~~r~~~~~~yy~g~~~i~~~~~----------~~~~~~~~ki~~n~ 68 (453) T protein:vir:73 1 MNLKPIKLMTYSRDEEITDKVVNDFMKKHQ--EEVERYEYLGNMYKGIMEISSQKA----------KDSWKPDNRLTNNF 68 (453) T ss_pred CccccceeeeccccccCCHHHHHHHHHHHH--HHHHHHHHHHHHhccccchhcCCC----------CCccCccceeecch Confidence 3322 34567788899999999985 456789999999999999987643 23457889999999 Q ss_pred hHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEE Q lcl|NC_013644. 75 FPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFG 153 (510) Q Consensus 75 ~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~ 153 (510) +++||++.++||+|+||+|+++++.+.+.|++||+ |++...+.+++++++++|+||+++|.|++|++++++++|.+++| T Consensus 69 ~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~ 148 (453) T protein:vir:73 69 AKYIVDTFVGYFNGIPIKKTHDDKSVLEAMQLFDNLNDMEDEESELAKIACVYGRAYELMYQNESTESEVIYCSPLNVFM 148 (453) T ss_pred HHHHHHHhhhhhcccCceeecCChHHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEcccceEE Confidence 99999999999999999999999999999999995 78999999999999999999999999999999999999999999 Q ss_pred EEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCC Q lcl|NC_013644. 154 VYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYG 233 (510) Q Consensus 154 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 233 (510) +||+..+....++++++...+ ...++++||++++++|....+.+.. ....+|+|| T Consensus 149 v~dd~~~~~~~~~i~~~~~~~-----~~~~~~vyt~~~i~~~~~~~~~~~~--------------------~~~~~~~~g 203 (453) T protein:vir:73 149 VYDDSIKQKPLFAVYYGFDEE-----GNLSGTVYTLLETISITGKAGEVKF--------------------GESTYNVYS 203 (453) T ss_pred EEeCCCCceeEEEEEEEEecC-----ceEEEEEEeCCeEEEEEecCCceEE--------------------ccceeccCC Confidence 999876655555554443332 1357899999999999876554432 234689999 Q ss_pred cccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeee---------- Q lcl|NC_013644. 234 QIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVG---------- 303 (510) Q Consensus 234 ~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~---------- 303 (510) .||||+|+|+++|+|+|+++++|||+||.++|++++.++++++|+++++|+...+ +....++..+++. T Consensus 204 ~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 281 (453) T protein:vir:73 204 DLPIVEYNFNEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGAEVDE--EDAKNIKDNRLINFFDKNSNGQG 281 (453) T ss_pred ceeEEEecCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCc--hhhhccccccccccccccccccc Confidence 9999999999999999999999999999999999999999999999999986543 2223333332222 Q ss_pred -ccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 304 -TGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNK 382 (510) Q Consensus 304 -~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~ 382 (510) .+.+++++|++|+.+.++++.++++|++.||.+|++|++++..+||+||+|+++++++|.+||.++++.|+++|+++++ T Consensus 282 ~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~ 361 (453) T protein:vir:73 282 TNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAANISDENFGNSSGVALAYKLQAMSNLALSFQRKFQSALNRRYS 361 (453) T ss_pred ccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2346779999999999999999999999999999999999988999999999999999999999999999999999999 Q ss_pred HHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHH-HHHHHHHHHHHHHHH Q lcl|NC_013644. 383 LVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNV-LRLICEQFDLDWEDV 461 (510) Q Consensus 383 ~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~-~~~~~e~~e~~~~~~ 461 (510) +|+.+++..+. ..+..+++|+|++++|.|+++.++++++++ |++|+||+++++|+++|+++ .+++++++++... T Consensus 362 li~~~~~~~~~-~~~~~~i~v~f~~~~p~~~~~~a~~~~k~~--giis~et~~~~~~~~~d~~~E~~ri~~E~~~~~~-- 436 (453) T protein:vir:73 362 LWSSLSTNASN-KDAWKDIEYTFTRNEPKDIKEQAETANILK--GITSEETALSVISVIPDVQAEMEKIKKKKLLQLS-- 436 (453) T ss_pred HHHHHHhccCC-ccccccceEEeCCCCCCCHHHHHHHHHHHh--ccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHH-- Confidence 99998876654 457788999999999999999999998875 88999999999999988543 3333333332221 Q ss_pred HHHHHhhhccCCCCCCCCCcccCC Q lcl|NC_013644. 462 KEALEEAEYTKGLSDNTDEEETAV 485 (510) Q Consensus 462 ~~~~~~~~~~~~~~~~~~~~~~~~ 485 (510) ..+...... .++...+- T Consensus 437 -~~~~~~~~~------~~~~~~~~ 453 (453) T protein:vir:73 437 -LTRTSNLVR------MKQMRGNL 453 (453) T ss_pred -HHHhccCCc------chhhhcCC Confidence 111111100 00000000 No 38 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=100.00 E-value=1.9e-90 Score=512.44 Aligned_cols=455 Identities=16% Similarity=0.172 Sum_probs=358.5 Q ss_pred CCCccCCC-hhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHH Q lcl|NC_013644. 1 MEALLSED-VKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIV 79 (510) Q Consensus 1 ~~~~~~~~-~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv 79 (510) -.-+...+ ..++.+.|.++|++|+..+ +.+++++++||.|+|++..++.. ......++++|+++||+++|| T Consensus 11 ~~~~~~~~~~~l~~~~i~~li~~~~~~~-~~r~~~l~~YY~g~~~~i~~~~~-------~~~~~~~~~~ki~~n~~~~Iv 82 (506) T protein:vir:94 11 ANLIYQESLENLTPNKIMKFITHHFNYQ-RPRLEMLDDYYQGYNLKILDKQS-------RRHEDGKADHRATHSFAKYIA 82 (506) T ss_pred ceeecccchhcCCHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCCcccccccc-------ccccccCCcceeecchHHHHH Confidence 11133334 4567788899999987644 45799999999999976544321 223456788999999999999 Q ss_pred HHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCC Q lcl|NC_013644. 80 DQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEY 158 (510) Q Consensus 80 ~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~ 158 (510) ++.++||||+||+|+++++.+++.|++||+ |+++..+.++++.++++|+||+++|+|++|++++++++|.+++|+||+. T Consensus 83 ~~~~~~l~G~p~~~~~~d~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded~~~~i~~~~p~~~~~v~dd~ 162 (506) T protein:vir:94 83 DFQTSYSVGNPINVKLPDDGSNSGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGEDNEEHLAKLDPLDTFVIYSTD 162 (506) T ss_pred HHhhhhhcccCceeecCcchHHHHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEEecCC Confidence 999999999999999999999999999995 7899999999999999999999999999999999999999999999975 Q ss_pred C--CceeEEEEEEEEEeeCCce-eEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcc Q lcl|NC_013644. 159 N--ELQRICRHYITEIEKDGET-VDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQI 235 (510) Q Consensus 159 ~--~~~~~~~~~~~~~~~~~~~-~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~i 235 (510) . ++.+++++|.....++... ....++++|++.++++|.....++.. ....+|+||.| T Consensus 163 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~--------------------~~~~~~~~g~v 222 (506) T protein:vir:94 163 VDPKPIMAVRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTPIMGKM--------------------QVDTTKPITTF 222 (506) T ss_pred CCCceEEEEEEEeeeeccCCceeEEEEEEEEEeCceEEEeccccCccce--------------------eccccccCCcc Confidence 4 4666677666555544433 35578899999999988766544322 23457999999 Q ss_pred cEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCC------------------------chhh Q lcl|NC_013644. 236 PFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGD------------------------DLSK 291 (510) Q Consensus 236 Pvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~------------------------~~~~ 291 (510) |||+|+|+++|.|||+++++|||+||.++|++++.++++++|+++++|.... .... T Consensus 223 Pvv~~~n~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 302 (506) T protein:vir:94 223 PVVEFKNSNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLE 302 (506) T ss_pred ceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhH Confidence 9999999999999999999999999999999999999999999999986421 1223 Q ss_pred hhHhhhcCeeeeccC---------CCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHH Q lcl|NC_013644. 292 LRQNVKSKKVVGTGS---------DGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTL 361 (510) Q Consensus 292 ~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~ 361 (510) +...++..+++.+++ +++++|++++.+.++++.++++|.+.||.+|++|++++.+ +||+||+||++++++ T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Aik~~~~~ 382 (506) T protein:vir:94 303 LIKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENFASNSSGVAMQYKVLG 382 (506) T ss_pred HHhhhhhcCeeeecccccccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccchHHHHHHHHHH Confidence 444555666666554 4579999999999999999999999999999999987655 588999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC-CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC Q lcl|NC_013644. 362 LNMKANKTEARLRALLEWMNKLVIDDINRRYT-KAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR 440 (510) Q Consensus 362 l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~-~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~ 440 (510) +.+||.+|+++|+++|++++++|+.+++..+. ..++..+++|+|++++|.|+++.+++++++ +|+||+||+++++|+ T Consensus 383 l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~lp~ 460 (506) T protein:vir:94 383 TVELASTKRRMFERGLYARYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQIKALVQA--GATLPQKYLYQQLPG 460 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCC Confidence 99999999999999999999999999987544 466778899999999999999999999877 589999999999999 Q ss_pred CCcHHH-HHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccc Q lcl|NC_013644. 441 LDDDNV-LRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMA 495 (510) Q Consensus 441 v~d~e~-~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (510) ++|+++ .++++++++.. ++.........+.+..+...+++.+++- T Consensus 461 v~d~~~E~~ri~~E~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 506 (506) T protein:vir:94 461 VTNPQDIVDMMKEQSANG----------DYSFDQNGVISNDGQTNTTATQTDEEVR 506 (506) T ss_pred CCCHHHHHHHHHHHHHHH----------hhcchhhcCCCcccCccccccccccCCC Confidence 998542 23332222111 1111111112222222222222222222 No 39 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=100.00 E-value=2.5e-88 Score=500.82 Aligned_cols=445 Identities=17% Similarity=0.217 Sum_probs=355.4 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |+.+ +..++.+.|+++|++|+. .++.+++++++||.|+|.+..++.. .......++++|+++||++.||+ T Consensus 23 ~~~~---~~~~~~~~i~~~i~~~~~-~~~~~~~~~~~yY~g~~~~i~~~~~------~~~~~~~~~~~ki~~n~~~~ivd 92 (481) T protein:vir:10 23 VSDL---AELLKEENLRNFISRHQT-EQVPRLEMLESYYLNRNTDILAGER------RLQKYGDKADHRAVHNYAKYVSR 92 (481) T ss_pred eecc---hhhcCHHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCcccccCcc------ccccccccccceeecchHHHHHH Confidence 4443 456777889999999874 4556799999999999864332221 22244567788999999999999 Q ss_pred HHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCC Q lcl|NC_013644. 81 QKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYN 159 (510) Q Consensus 81 ~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~ 159 (510) +.++||+|+||+|++++++..+.|+++|+ |+++..+.++++.++++|+||+++|.|++|++++++++|++++|+||+.. T Consensus 93 ~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~~i~~~~p~~~~~v~d~~~ 172 (481) T protein:vir:10 93 FIVGYLTGNPITITHQDNQTNDKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRDFEDRDTFKVLDPKSTFVVYDQTL 172 (481) T ss_pred HHHhhhccCCceEecCChhHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCeEEEEEEcccceEEEEcCCC Confidence 99999999999999999999999999995 78999999999999999999999999999999999999999999999764 Q ss_pred --CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccE Q lcl|NC_013644. 160 --ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPF 237 (510) Q Consensus 160 --~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 237 (510) ++.+++++|.....+ ...+.++++|+++++++|...++++... ...+|+||.||| T Consensus 173 ~~~~~~~i~~~~~~~~~---~~~~~~~~~y~~~~i~~~~~~~~~~~~~--------------------~~~~~~~g~vPv 229 (481) T protein:vir:10 173 DKKVVAGVRYFEKQDKD---KVPVQHVEVYTTDKIYYIEIKGGTYHRV--------------------EEVEHYYNDVPI 229 (481) T ss_pred CCceEEEEEEEEEeeCC---CceEEEEEEEecCeEEEEEecCCceeec--------------------ccccccCCceeE Confidence 466666665443332 2356789999999999999877655332 346899999999 Q ss_pred EEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeee---------ccCCC Q lcl|NC_013644. 238 YRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVG---------TGSDG 308 (510) Q Consensus 238 v~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~---------~~~~~ 308 (510) |+|+|+++|+|+|+++++|||+||.++|++++.++++++|+++++|+...+...... ++..+++. .++++ T Consensus 230 v~~~n~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 308 (481) T protein:vir:10 230 IEYLNDQFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDLDSEDAKA-FRDANMIHLEPGTNANGSEGKA 308 (481) T ss_pred EEeecCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCCCccchhh-hhhccceeccccccccCCCCCc Confidence 999999999999999999999999999999999999999999999975544443322 22222222 23467 Q ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 309 GLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDD 387 (510) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~ 387 (510) +++|++++++.++++.++++|++.|+.+|++|+++++. +||+||+|+++++++|.+||.++++.|+.+|++++++++.+ T Consensus 309 ~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~ 388 (481) T protein:vir:10 309 EVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQFSGVQSGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNN 388 (481) T ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 89999999999999999999999999999999988765 47899999999999999999999999999999999999999 Q ss_pred HhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHH-HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 388 INRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDN-VLRLICEQFDLDWEDVKEALE 466 (510) Q Consensus 388 ~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e-~~~~~~e~~e~~~~~~~~~~~ 466 (510) ++..+...++..+++++|++++|.|+++.+++++++ +|+||.||+++++|+++|++ +.++++++++... + T Consensus 389 ~~~~~~~~~~~~~i~v~f~~~~~~~~~~~a~~~~kl--~g~is~et~~~~l~~i~d~~~E~~ri~~E~~~~~---~---- 459 (481) T protein:vir:10 389 VNLTGLKQHNYAELTITFTPNLPKSMMESINAFNAL--SGGVSESTRLSLLDFIDNPKEELEKMQEEEAQRE---K---- 459 (481) T ss_pred HhccCCCccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHH---h---- Confidence 999888888889999999999999999999999877 58899999999999998853 3333333322111 1 Q ss_pred hhhccCCCCCCCCCcccCCCCCCc Q lcl|NC_013644. 467 EAEYTKGLSDNTDEEETAVNPDDP 490 (510) Q Consensus 467 ~~~~~~~~~~~~~~~~~~~~~~~~ 490 (510) ... ..+..+..+ +..+++++++ T Consensus 460 ~~~-~~~~~~~~~-~~~~~dd~~g 481 (481) T protein:vir:10 460 QAD-KRGYGEAFE-NHLNVDDSNG 481 (481) T ss_pred hhh-hccCCccCC-CCCCCCCCCC Confidence 111 111111111 1111111111 No 40 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=100.00 E-value=9.7e-89 Score=503.11 Aligned_cols=424 Identities=15% Similarity=0.123 Sum_probs=336.6 Q ss_pred HHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCCceeccCc- Q lcl|NC_013644. 19 AIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNPVEYETEN- 97 (510) Q Consensus 19 ~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d- 97 (510) +|..|+. .++.+|+++++||+|+|+++.++.. .....++++|+++||+++||++.++||||+||+|++.+ T Consensus 1 ~~~~~~~-~~~~r~~~l~~yy~g~~~~~~~~~~--------~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~ 71 (440) T protein:vir:95 1 MLAAFLG-SQKQRLAILASYAQGDNFSILSGHR--------RLDDEKADYRVRHKWGGYISSFATGYVIGNPVSIGVMEG 71 (440) T ss_pred ChhhHHH-HHHHHHHHHHHHhccCCcccccccc--------cccccCCcceeecchHHHHHHhhhhheeccCceEeeCCC Confidence 5555543 3455799999999999998765432 23556788999999999999999999999999998655 Q ss_pred --HHHHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCCCceeEEEEEEEEEee Q lcl|NC_013644. 98 --EELKEYLAEYY-NSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYNELQRICRHYITEIEK 174 (510) Q Consensus 98 --~~~~~~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~ 174 (510) ++..+.|+++| +|+++.++.+++++++++|+||+++|.|++|++++++++|.+++|+||+.+.....+.++++...+ T Consensus 72 ~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~i~~~~~~~ 151 (440) T protein:vir:95 72 GSADQLSTIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDKDKVDRVVLISPLEMFVIRDLTVEQNIIAAVHLPIYAD 151 (440) T ss_pred ccHHHHHHHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEecC Confidence 44566788887 588999999999999999999999999999999999999999999999865433333333333322 Q ss_pred CCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCCcHHHHH Q lcl|NC_013644. 175 DGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTDLKPIK 254 (510) Q Consensus 175 ~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~ 254 (510) ..++++||+.++++|....++.. ........+|+||+||||+|+|++.|+|+|++++ T Consensus 152 ------~~~~~vyt~~~~~~~~~~~~~~~-----------------~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~ 208 (440) T protein:vir:95 152 ------KVNMTVYTKDKVITYKPYSNNSV-----------------RLVVDDVKKHSYNDVPVVEWWNNRFRMGDYESEI 208 (440) T ss_pred ------ceEEEEEeCCeEEEEEEecCCcc-----------------ceeecceeeccCceeeEEEeeCCCCCCCchhhhH Confidence 34689999999999987654321 1122345689999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhccceeEEecCCCCc--hhhhhHhhhcCeeee---------ccCCCceeEEeecCCHHHHH Q lcl|NC_013644. 255 ALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDD--LSKLRQNVKSKKVVG---------TGSDGGLDVKTVTIPTEGRK 323 (510) Q Consensus 255 ~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~--~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~ 323 (510) +|||+||.++|++++.+++|++|+++++|..... .++....++..+++. .+++++++|++++++.++++ T Consensus 209 ~lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~ 288 (440) T protein:vir:95 209 SLIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKLSPEDAAKMKDANMLFLKTGISTTGQQTTADASYIYKQYDVNGTE 288 (440) T ss_pred HHHHHHHHHHHHHHHHHHHhhcceeeeecccccCCCCccchhhhhhccceecccccccccCCCCcceeEEeecCCHHHHH Confidence 9999999999999999999999999999963221 222233333333322 25667899999999999999 Q ss_pred HHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceee Q lcl|NC_013644. 324 TKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVS 402 (510) Q Consensus 324 ~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~ 402 (510) .++++|++.|+.+|++|++++.. +||+||+||++++++|.+||.+|++.|+++|++++++|+.+++...+..++..+++ T Consensus 289 ~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~ 368 (440) T protein:vir:95 289 AYKNRLANDIHRFSRIPNLDDDRFNSTSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAINGPVIEANKLT 368 (440) T ss_pred HHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccce Confidence 99999999999999999988766 57899999999999999999999999999999999999999998888888889999 Q ss_pred EEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcc Q lcl|NC_013644. 403 FTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEE 482 (510) Q Consensus 403 i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 482 (510) |+|++++|.|+++.+++++++ +|+||+||+++++|++++++++++++++++..... .....+...+++.+.+ T Consensus 369 i~f~~~~p~~~~~~ad~~~kl--~g~iS~et~~~~l~~~d~~~E~~ri~~E~~~~~~~------~~~~~~~~~~~~~~~e 440 (440) T protein:vir:95 369 FTFHPNIPQDVWTEIKAYIEA--GGEISQETLMENASFTDYKTEHSRILKQGGSSDLE------IGQIVGDADVGQADTE 440 (440) T ss_pred EEeCCCCCCCHHHHHHHHHHH--hccCcHHHHHHhCCCCCcHHHHHHHHHHHHHhhhh------HHhhccCCCCCCcCCC Confidence 999999999999999999887 68899999999999998765554444443322111 1111111111111111 No 41 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=100.00 E-value=2.8e-86 Score=489.62 Aligned_cols=451 Identities=17% Similarity=0.175 Sum_probs=346.9 Q ss_pred CCCc--cCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEAL--LSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~--~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) -+++ +..+.+++.+.+.++|.+|+. .++.+++++++||.|+|+|.+++.. ....++++|+++||+++| T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~i~~~~~-~~~~r~~~~~~yy~g~~~i~~~~~~---------~~~~~~~~ki~~n~~~~i 72 (489) T protein:vir:99 3 QEDFEAIDYESKLWIDQLKNYISRFKA-EQLERLKELKRYYLGDNNIKYRPAK---------TDKYAADNRIASDFAKYI 72 (489) T ss_pred ccceeeeCCCCCCCHHHHHHHHHHHHH-HHHHHHHHHHHHhcccCcccccccc---------ccccCCcceeecchHHHH Confidence 1111 333446678889999999974 4556799999999999999877542 244567889999999999 Q ss_pred HHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEE----CCCCceEEEEEcccceEE Q lcl|NC_013644. 79 VDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYART----NAEDRLCFQVADSLNVFG 153 (510) Q Consensus 79 v~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~----d~~g~~~i~~~~p~~~~~ 153 (510) |++.++||||+||+|+++++.+.+.|+++|+ |+++..+.++++.++++|+||+++|. |++|++++.+++|++++| T Consensus 73 v~~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~~~i~~~~p~~~~~ 152 (489) T protein:vir:99 73 TVFEQGYMLGVPVEYKNENKDLQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTEVKLYQLPAEQTFV 152 (489) T ss_pred HHHHhhhhccCCceeecCChhHHHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcceEEEEEcccceEE Confidence 9999999999999999999999999999995 78999999999999999999999985 577899999999999999 Q ss_pred EEcCCC--CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccccccccccccccccc Q lcl|NC_013644. 154 VYNEYN--ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRS 231 (510) Q Consensus 154 ~~d~~~--~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (510) +||+.. ++..++++|..... ....+.++++|+++.+++|+........ .......+|+ T Consensus 153 v~dd~~~~~~~~~i~~~~~~~~---~~~~~~~~~~y~~~~i~~~~~~~~~~~~-----------------~~~~~~~~~~ 212 (489) T protein:vir:99 153 IYDDTYQRNSLMAVHFYDIDYG---SGKRKQIIKAYTSDTIYTYEDYNLETKG-----------------MRLKDYEGHF 212 (489) T ss_pred EEcCCCCCceEEEEEEEEEecC---CCceEEEEEEEeCCcEEEEEecCCCccc-----------------ceeccccccc Confidence 999765 45555655543322 2345678999999999999875432211 1223457899 Q ss_pred CCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCch--hhhh--------------Hh Q lcl|NC_013644. 232 YGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDL--SKLR--------------QN 295 (510) Q Consensus 232 ~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~--~~~~--------------~~ 295 (510) ||+||||+|+|++.|+|+|+++++|||+||.++|++++.++++++|+++++|+..... .+.. .. T Consensus 213 ~g~vPvv~~~n~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (489) T protein:vir:99 213 FKGVPVNEYANNEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTGADENDYLDDGRLNPNGRLAISIG 292 (489) T ss_pred CCceeEEEeecCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCcccccchhhhhhcccccccccccccc Confidence 9999999999999999999999999999999999999999999999999999743321 1111 11 Q ss_pred hhcCeeeeccC-------CCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHH Q lcl|NC_013644. 296 VKSKKVVGTGS-------DGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKAN 367 (510) Q Consensus 296 ~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~ 367 (510) .+..+++.+.+ +.+++|++++++.++++.++++|.+.||.+|++|++++.+ +||+||+||+++++++.+||. T Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~ 372 (489) T protein:vir:99 293 FKKAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDMKFSGVQSGESMKYKLMASDNYRE 372 (489) T ss_pred cccceeeeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHH Confidence 12233333322 4578999999999999999999999999999999987654 589999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccCCc---cccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcH Q lcl|NC_013644. 368 KTEARLRALLEWMNKLVIDDINRRYTKA---FDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDD 444 (510) Q Consensus 368 ~k~~~~~~~l~~~~~~i~~~~~~~~~~~---~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~ 444 (510) +|++.|+.+|++++++|+.+++..+... ....+++|+|++++|.|+++.+++++++ +|+||+||+++++|+++++ T Consensus 373 ~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl--~giis~et~~~~l~~v~~~ 450 (489) T protein:vir:99 373 KQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNL--YGIVSDQTIFEILNTVTGV 450 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCCHHHHHHhcCCCCch Confidence 9999999999999999999998765442 2345799999999999999999999877 4899999999999999865 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCc Q lcl|NC_013644. 445 NVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDP 490 (510) Q Consensus 445 e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (510) +.++.+++.++++.+ ...... ....++.. ++.++.++.+ T Consensus 451 d~~~E~~ri~~E~~~----~~~~~~-~~~~~~~~--~~~~~~~~~p 489 (489) T protein:vir:99 451 DAEAELKRLKEEADK----KQSLPE-PRLVGDAS--GQEEPTAEKP 489 (489) T ss_pred hHHHHHHHHHHHHHH----Hhcccc-ccccCCCC--CCcCCCCCCC Confidence 443323222221111 111111 11111111 1111111111 No 42 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=100.00 E-value=4.2e-75 Score=428.35 Aligned_cols=452 Identities=10% Similarity=0.032 Sum_probs=325.5 Q ss_pred CCCccCCCh--hhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEALLSEDV--KIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~~~~~~--~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) |+.++.... +.....+..+|.+|. .+..++.++.+||+|+|+|....... + ....++|+++||+++| T Consensus 1 ~~~~i~~~~~~~~~~~~~~~L~~~~~--~~~~r~~~~~~YY~G~~~i~~~~~~~--------~-~~~~~~~~~~n~~~~i 69 (485) T protein:vir:24 1 MTAPLPGQEEIADPAIARDEMVSAFE--DQNQNLRSNTSYYEAERRPEAIGVTV--------P-VQMQSLLAHVGYPRLY 69 (485) T ss_pred CCCCCCCCCcccchHHHHHHHHHHHH--HHHHHHHHHHHHHhccCchhhcCccc--------c-hhhhhhhhccchHHHH Confidence 999987665 445666677888884 34567899999999999885433211 1 1224568889999999 Q ss_pred HHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC--------CceEEEEEccc Q lcl|NC_013644. 79 VDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAE--------DRLCFQVADSL 149 (510) Q Consensus 79 v~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~--------g~~~i~~~~p~ 149 (510) |++.++||++++++.. +++...+.++++|+ |+++..+.+++++++++|+||++||.+++ +.++|+.++|. T Consensus 70 vd~~~~~l~~~g~~~~-~~~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~~p~ 148 (485) T protein:vir:24 70 VDSIAERQAVEGFRLG-DADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVEPPT 148 (485) T ss_pred HHHHhhhhccCceecC-CCchhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEEEeccc Confidence 9999999999988743 45667777888885 88999999999999999999999998865 55789999999 Q ss_pred ceEEEEcCCC-CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccc Q lcl|NC_013644. 150 NVFGVYNEYN-ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLL 228 (510) Q Consensus 150 ~~~~~~d~~~-~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 228 (510) +++++||+.. ++..++++|+ ..+ ...+.++++|+++.+++|...++++... ... T Consensus 149 ~~~~i~D~~~~~~~~~~~~~~--~~~---~~~~~~~~~y~~~~~~~~~~~~~~~~~~--------------------~~~ 203 (485) T protein:vir:24 149 RMYAEIDPRIGRPAKAIRVAY--DAE---GNEIQAATLYTPNETFGWFRAEGEWVEW--------------------FSD 203 (485) T ss_pred eeEEEeeCCcCceeEEEEEEE--eec---CCeEEEEEEEcCCcEEEEEecCCceEee--------------------ccc Confidence 9999999754 4545554443 222 2346778999999999998776554321 235 Q ss_pred cccCCcccEEEecCCC-----CCCCcHH-HHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhh-----hh-Hhh Q lcl|NC_013644. 229 QRSYGQIPFYRLSNNK-----QETTDLK-PIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSK-----LR-QNV 296 (510) Q Consensus 229 ~~~~g~iPvv~~~nn~-----~g~sd~~-~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~-----~~-~~~ 296 (510) +|+||.||||+|+|++ +|.|+++ .|++|||+||.++|++++.++++++|++++.|++..+... .. ... T Consensus 204 ~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~ 283 (485) T protein:vir:24 204 PHGLGAVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTLFDA 283 (485) T ss_pred ccCCCcccEEEeccCcccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCccccccccccccchhhh Confidence 7999999999999984 5889997 6899999999999999999999999999999976543211 01 111 Q ss_pred hcCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccC-----cccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 297 KSKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDG-----NITNIVIKARYTLLNMKANKTEA 371 (510) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g-----~~Sg~Ai~~~~~~l~~k~~~k~~ 371 (510) ..+ .+...+++++++.+++ ....+.++++|+..|+.++.+|++.+..+| ++||+||++++.+|.+||.+|++ T Consensus 284 ~~~-~i~~~~~~~~~~~q~~--~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~ 360 (485) T protein:vir:24 284 YLA-RILAFEDAEGKIQQFS--AAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNA 360 (485) T ss_pred ccc-ceeccCCCCceEEeec--ccchHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHH Confidence 222 3334456678876654 456778899999999998877776543322 36999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcC--CCchHHHHHhCCCCCcHHH-HH Q lcl|NC_013644. 372 RLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETR--KIILESILQVAPRLDDDNV-LR 448 (510) Q Consensus 372 ~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g--~iS~et~~~~~~~v~d~e~-~~ 448 (510) .|+++|++++++++.+.+.. ....+...++++|+++.|.|.++.++.+++++++| ++|+||+++++|+++++.+ .+ T Consensus 361 ~f~~~l~~~~~l~~~~~~~~-~~~~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~~e~~ 439 (485) T protein:vir:24 361 IFGGAWEEAMRLAYRLMKGG-DVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARKDMGYSIAEREEMR 439 (485) T ss_pred HHHHHHHHHHHHHHHHhcCC-CCccccceeeEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHhhCCCCHhHHHHHH Confidence 99999999999998876543 33456778999999999999999999999999866 7999999999999876532 22 Q ss_pred HHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCccc-CCCCCCcccccccCc Q lcl|NC_013644. 449 LICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEET-AVNPDDPTQQMAEGA 498 (510) Q Consensus 449 ~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 498 (510) ++.+++..+...... ...+.....++++.+ +.+++.++.+-++++ T Consensus 440 ~~~ee~~~~~~~~~~-----~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~a 485 (485) T protein:vir:24 440 RWDEEEAAMGLGLLG-----TMVDADPTVPGSPNPTPAPKPQPAIEGGDSA 485 (485) T ss_pred HHHHHHhhhhhhHHH-----hhcccCCCCCCCCCCCCCCCCccCCCCCCCC Confidence 222221111111111 111111111111111 112222222222222 No 43 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=100.00 E-value=4.5e-75 Score=428.19 Aligned_cols=475 Identities=12% Similarity=0.052 Sum_probs=335.1 Q ss_pred CCCc-cCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccc-cceeccchhHHH Q lcl|NC_013644. 1 MEAL-LSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYAS-NVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~-~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~-~~ki~~n~~~~I 78 (510) ++.+ ..-+.+.+...+.+++.+|.. +..+++++++||+|+|++...... ..+..++ +.++++||+++| T Consensus 16 ~~~p~~~~~~~~~~~l~~~l~~~~~~--~~~rl~~l~~YY~G~~~~~~~~~~--------~~~~~~~~~~~~v~n~~~~i 85 (501) T protein:vir:25 16 VEFPEDSMSREQLGALVADMWRLHIS--ERQWLDRIYEYTKGLRGRPEVPEG--------ASDEVKELAKLSVKNVLSLV 85 (501) T ss_pred ccCCcccCChHHHHHHHHHHHHHHHH--HHHHHHHHHHHHhcCCCchhcccc--------CChhhhhhHhhhhcChHHHH Confidence 3333 222445556677778877753 556799999999999986443221 1223333 335678999999 Q ss_pred HHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcC Q lcl|NC_013644. 79 VDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNE 157 (510) Q Consensus 79 v~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~ 157 (510) |++.+++++.++ |.+++++..+.++++|+ |+++....+++++++++|+||++||.+++| ++|++++|.+++++|++ T Consensus 86 vd~~a~~l~~~g--f~~~d~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~~-~~i~~~sp~~~~~iy~D 162 (501) T protein:vir:25 86 RDSFAQNLSVVG--YRNALAKENDPAWEMWQRNRMDARQAEVHRPALTYGASYVTVTPTDEG-PVFRTRSPRQILAVYAD 162 (501) T ss_pred HHHHHhhhcccc--eecCCccchHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCC-CeEEEeccccEEEEEec Confidence 999999998766 44556666677888885 899999999999999999999999999887 68999999999999965 Q ss_pred -CCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccc-cccccccccccccccccccccccCCcc Q lcl|NC_013644. 158 -YNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEA-EPINPRPHVLAVDSENESLLQRSYGQI 235 (510) Q Consensus 158 -~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~g~i 235 (510) ..+....+.++++..... .....++++|++..+|+|............. ............+.......+|+||.| T Consensus 163 ~~~~~~~~~ai~~~~~~~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 240 (501) T protein:vir:25 163 PSVDAWPQYALETWVAQKD--AKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVIEHGATFEGKPVC 240 (501) T ss_pred CCCCcceeEEEEEEeeccc--cCcceeEEEecCeeEEEEecCceeeeeccccccccccccccccccccccccccCCccce Confidence 444333333333332222 2234578899998888776443211110000 001111111222333445678999999 Q ss_pred cEEEecCC----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCcee Q lcl|NC_013644. 236 PFYRLSNN----KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLD 311 (510) Q Consensus 236 Pvv~~~nn----~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (510) |||+|+|+ .+|+|+|+++++|+|+||+++|++++.++++++|++++.|++.++...+ .++.++++.+ ++++++ T Consensus 241 Piv~f~N~~~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~--~~~~~~i~~~-~~~~~~ 317 (501) T protein:vir:25 241 PVVRFVNGRDADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVISGWTGSKAEVL--KASALRVWTF-EDPEVK 317 (501) T ss_pred eeEeccCccccCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHhCCCCCccchh--hhcccceecc-CCCCce Confidence 99999994 5689999999999999999999999999999999999999977654433 3444555544 456788 Q ss_pred EEeecC-CHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_013644. 312 VKTVTI-PTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDIN 389 (510) Q Consensus 312 ~~~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~ 389 (510) +.+++. +.+++..+++.+..+|+..|++|+..+++ ++|+||+||++++.+|.+|+.+|++.|+++|++++++++.+.+ T Consensus 318 ~~q~~~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~ 397 (501) T protein:vir:25 318 AQAFPPASVEPYNLILEEMLQHVAMVAQISPAQVTGKMINVSAEALAAAEANQQRKLAAKRESFGESWEQLLRLAAEMDD 397 (501) T ss_pred EEEecccChHHHHHHHHHHHHHHHhhcCCChhhhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Confidence 877764 56788888999999999999999987764 5789999999999999999999999999999999999988765 Q ss_pred hccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_013644. 390 RRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAE 469 (510) Q Consensus 390 ~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~ 469 (510) .. ...+..+++++|+++.|.|.++.++++++++++| ||.||++.++|++++++++++++++++.+......+..... T Consensus 398 ~~--~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~g-is~et~~~~~~g~~~~~ie~~~~~~~e~~~~~~~~~~~~~~ 474 (501) T protein:vir:25 398 DP--DTAADSGAEVLWRDTEARSFGAVVDGITKLASAG-IPIEHLLSMVPGMTQQTIQAIKDSLRGGEVKSLVDKLLSNE 474 (501) T ss_pred CC--ccccceeeeEEecCCCCCCHHHHHHHHHHHHhcC-CCHHHHHHHcCCCCHHHHHHHHHHHHHHhHHHHHHHhhccC Confidence 43 2334568999999999999999999999999988 59999999999999888766666555544333332221111 Q ss_pred ccCCCCCCCCCcccCCCCCCcccccccCcccc Q lcl|NC_013644. 470 YTKGLSDNTDEEETAVNPDDPTQQMAEGATGS 501 (510) Q Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (510) +.+....+.+..+.+++.+ +..+..|+ T Consensus 475 -~~~~~~~~~~~~~~~~~~~----~~~~~~g~ 501 (501) T protein:vir:25 475 -PAPVPPPPPQAAAQALNEG----GVNGNGGA 501 (501) T ss_pred -cCCCCCCCCCCCccccccc----cCCCCCCC Confidence 1111111111111111111 11222222 No 44 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=100.00 E-value=5.9e-75 Score=427.52 Aligned_cols=458 Identities=11% Similarity=0.069 Sum_probs=320.8 Q ss_pred CCCccCCChhhhHHHHHH-----HHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchh Q lcl|NC_013644. 1 MEALLSEDVKIIANALKA-----AIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFF 75 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~-----~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~ 75 (510) |=. ..+..++.+.|.+ ++.+|. .+..++.++++||+|+|+|.+....... +...+..+++++||+ T Consensus 1 ~~~--~p~~~l~~~~~~~~~~~~l~~~~~--~~~~r~~~~~~YY~g~~~i~~~~~~~~~------~~~~~~~~~~~~n~~ 70 (479) T protein:vir:99 1 MID--LPDEDLSSEGLAKYLETKVFPKMN--TECERLDDFEAWTKNGQEVPDLATRHKN------KEREVLQQLSRKPWM 70 (479) T ss_pred Ccc--CCcccCChhHHHHHHHHHHHHHHH--HHhHHHHHHHHHHhcCCcccccccccCC------hhHHHHHHHhhcCcH Confidence 322 2233455555544 444553 4556799999999999998765432111 111223345678999 Q ss_pred HHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEE-----CCCCceEEEEEccc Q lcl|NC_013644. 76 PEIVDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYART-----NAEDRLCFQVADSL 149 (510) Q Consensus 76 ~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~-----d~~g~~~i~~~~p~ 149 (510) ++||++.+++++.+++ ++.+.+..+.+.++|+ |+++..+.+++++++++|+||++||. |++|.+++++++|+ T Consensus 71 ~~iVd~~~~~l~~~gf--~~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~~~i~~~~p~ 148 (479) T protein:vir:99 71 GLMVNSFAQQLIVDGY--RKTGTNENAKGWDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGISPLDGTTVARIKCIDPR 148 (479) T ss_pred HHHHHHHHhhcccccc--cCCCchhhHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCcCCCCceEEEEechh Confidence 9999999999986664 5566666777888885 88999999999999999999999994 67789999999999 Q ss_pred ceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccccccccccccccc Q lcl|NC_013644. 150 NVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQ 229 (510) Q Consensus 150 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (510) +++++||+.......+ |.+.... ...+.+|+...+++|....+.+.. ....+ T Consensus 149 ~~~~iydd~~~~~~~~--~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~--------------------~~~~~ 200 (479) T protein:vir:99 149 DAFAIWEDPYWDEWPK--YLLERQP------NGQYWWWTEEDYSIFEFKQGKFIY--------------------RETVS 200 (479) T ss_pred heEEEecCCcccceee--EEEeecC------ceeEEEEecceEEEEEecCCceee--------------------ccccc Confidence 9999998765433322 2222221 224567888888777766554432 13468 Q ss_pred ccCCcccEEEecCC----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhh---HhhhcCeee Q lcl|NC_013644. 230 RSYGQIPFYRLSNN----KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLR---QNVKSKKVV 302 (510) Q Consensus 230 ~~~g~iPvv~~~nn----~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~---~~~~~~~~~ 302 (510) |+||+||||+|+|+ ++|+|+|+++++|||+||+++|++++.++++++|++++.|+...+..... ..+...+++ T Consensus 201 h~~g~vPvv~f~n~~~~~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~i~ 280 (479) T protein:vir:99 201 HDYGHIPFVRYVNVMDLRGVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATGLMLPEGANADQEKMRFAQESML 280 (479) T ss_pred cCCCCcceEEeecCCCcCcCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCCCcccccccchhccccccccce Confidence 99999999999998 57999999999999999999999999999999999999997654322221 122334444 Q ss_pred eccCCCceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 303 GTGSDGGLDVKTVTI-PTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMN 381 (510) Q Consensus 303 ~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~ 381 (510) . .+++++++.+++. +.+++.+.++.+..+|+..+++|+..++.++|+||+||++++.+|.+||..+++.|+.+|++++ T Consensus 281 ~-~~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~g~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~ 359 (479) T protein:vir:99 281 I-SQNEKASFGAIPAAPLDGLLNAYKESLLEFLALAQLPPHIAGQIVNVAADALAAGTRQTMQKLFEKQATWKASHNQTM 359 (479) T ss_pred e-ecCCCceEEEecccchHHHHHHHHHHHHHHhccCCCCHHHcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3 4566788877653 3455555555555566666678777777789999999999999999999999999999999999 Q ss_pred HHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHH- Q lcl|NC_013644. 382 KLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWED- 460 (510) Q Consensus 382 ~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~- 460 (510) ++++.+.+... ..+...++++|.++.+.|.++.++++++++++|++|.||+++++|++++++++++.+++++..... T Consensus 360 ~l~~~~~~~~~--~~~~~~i~~~w~~~~~~s~~~~ad~~~kl~~ag~is~et~l~~l~gv~~~~~e~~~~~~~~~~~~~~ 437 (479) T protein:vir:99 360 RLVNKIEGRTE--EATDLDFTITWQDVTIQSLAQFADAWAKMVESLKIPAEGVWDMIPNLDQSTVNGWKEIYDREGDFGK 437 (479) T ss_pred HHHHHHcCCCc--cccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHH Confidence 99988765432 234457899999999999999999999999999999999999999999887665544443332211 Q ss_pred HHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 461 VKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) ........+. +.+..+..++.+.. +..+.+.....|+|--| T Consensus 438 ~~~~~~~~~~--------~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 478 (479) T protein:vir:99 438 YMRKLQNGPD--------PAEQRGGPNGATNM-QQANNKTGEPASLNKSG 478 (479) T ss_pred HHHHHhcccC--------cccccCCCCCCCCC-CCCCCCCcchhccCCCC Confidence 1111111110 01111111111111 11112222344556556 No 45 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=100.00 E-value=3.4e-74 Score=423.38 Aligned_cols=457 Identities=11% Similarity=0.062 Sum_probs=312.8 Q ss_pred CCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhh Q lcl|NC_013644. 6 SEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQY 85 (510) Q Consensus 6 ~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~ 85 (510) +..+ .++|..+++.|. .+..++.++++||+|+|++.+.... .. ....++|+++||+++||++.++| T Consensus 1 ~~t~---~d~i~~L~~~~~--~~~~r~~~~~~Yy~G~~~i~~~~~~--------~~-~~~~~~~~~~n~~~~ivd~~~~~ 66 (480) T protein:vir:78 1 MTTY---HEHVERLQGLLA--RDLPNLLEAEAYRNGTRRLKTIGIG--------AP-PELAYLDVQPGWVATYLRTLSDR 66 (480) T ss_pred CCCH---HHHHHHHHHHHH--HHHHHHHHHHHHHhccccchhcccc--------cc-hhhhhhhhhcchHHHHHHHHHhh Confidence 2222 235666766663 3456789999999999987543221 11 11235688999999999999999 Q ss_pred hhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEE------CCCCceEEEEEcccceEEEEcCC Q lcl|NC_013644. 86 LLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYART------NAEDRLCFQVADSLNVFGVYNEY 158 (510) Q Consensus 86 l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~------d~~g~~~i~~~~p~~~~~~~d~~ 158 (510) |++++++. .++++..+.|+++|+ |+++.++.+++++++++|+||++||. |++|.++|.+++|.+++|+||++ T Consensus 67 l~~~g~~~-~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~i~~~~p~~~~~i~D~~ 145 (480) T protein:vir:78 67 LDIEGFRI-SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPR 145 (480) T ss_pred hccCceec-CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeEEEEEcccceEEEEcCC Confidence 99998875 356677888999995 89999999999999999999999985 56889999999999999999976 Q ss_pred C--CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCccc Q lcl|NC_013644. 159 N--ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIP 236 (510) Q Consensus 159 ~--~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP 236 (510) . ++..++++|...+ +. ..++++++|+++.+++|...++..... .......+|+||+|| T Consensus 146 ~~~~~~~~i~~~~~~d--~~--~~~~~~~~y~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~g~vP 205 (480) T protein:vir:78 146 NTRRVTRAVRLYTTRD--DV--AVPDRATLYLPDETVPLRRNGGLNDQW----------------VVDGDVIKHGLGVVP 205 (480) T ss_pred CccceEEEEEEEEeec--CC--cceEEEEEEeCCeEEEEEecCCCcccc----------------cccccccccCCCCcc Confidence 4 4555666554332 22 246789999999999998765432210 012245689999999 Q ss_pred EEEecCCC-----CCCCcHH-HHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhh---hHhhhcCeeeeccCC Q lcl|NC_013644. 237 FYRLSNNK-----QETTDLK-PIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKL---RQNVKSKKVVGTGSD 307 (510) Q Consensus 237 vv~~~nn~-----~g~sd~~-~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~---~~~~~~~~~~~~~~~ 307 (510) ||+|+|+. +|+|+++ .|++|+|+||+++|++++.++++++|+++++|++.++..+. .........+...++ T Consensus 206 vv~f~n~~~~~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~ 285 (480) T protein:vir:78 206 VVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLAS 285 (480) T ss_pred eEEeecccccCCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhCCCccccccccccchhhhhhhhhccCCC Confidence 99999974 5899997 59999999999999999999999999999999765432111 111111222334456 Q ss_pred CceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCcccccccc-Cc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 308 GGLDVKTVTI-PTEGRKTKMEIDKENIYKFGMAFDSTQVGD-GN-ITNIVIKARYTLLNMKANKTEARLRALLEWMNKLV 384 (510) Q Consensus 308 ~~~~~~~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~-~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i 384 (510) +++++.+++. +.+++.+.++.+..+|+..+++|+..+++. .| +||+||++++.+|.+||.+|++.|+.+|+++++++ T Consensus 286 ~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~rl~ 365 (480) T protein:vir:78 286 EAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIA 365 (480) T ss_pred CCceEEecCccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7789988764 334444444444444444455555555432 23 69999999999999999999999999999999999 Q ss_pred HHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcC--CCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 385 IDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETR--KIILESILQVAPRLDDDNVLRLICEQFDLDWEDVK 462 (510) Q Consensus 385 ~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g--~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~ 462 (510) +.+++.. ...+...++++|+++.|.|.++.++++++++++| ++|++|+++++|+++++.++ +++++++ +.+... T Consensus 366 ~~~~~~~--~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~e-~~~~~~~-~~~~~~ 441 (480) T protein:vir:78 366 MQIMGRE--VTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQ-MRDWDKQ-ETEDMI 441 (480) T ss_pred HHHcCCC--ccccceeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHhHHHH-HHHHHHH-HHHHHH Confidence 9886532 3345678999999999999999999999999876 68999999999988665322 2222211 111111 Q ss_pred HHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccc Q lcl|NC_013644. 463 EALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQ 505 (510) Q Consensus 463 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (510) .++.. + .. ........+...+.++..+++++++|-+..- T Consensus 442 ~~~~~-~--~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 442 DTLYS-T--TK-AQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) T ss_pred HHhhc-c--cc-CCCccccCCCCCCCCCccCCCcccCCCcCCC Confidence 11111 1 11 1111111111111111122222223322222 No 46 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=100.00 E-value=6.9e-74 Score=421.69 Aligned_cols=457 Identities=12% Similarity=0.066 Sum_probs=314.2 Q ss_pred CCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhh Q lcl|NC_013644. 6 SEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQY 85 (510) Q Consensus 6 ~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~ 85 (510) +..+ .++|..+++.|. .++.++.++++||+|+|++.+.... .+....++|+++||+++||++.++| T Consensus 1 ~~t~---~~~i~~L~~~~~--~~~~r~~~l~~Yy~G~~~i~~~~~~---------~~~~~~~~~~~~n~~~~ivd~~~~~ 66 (480) T protein:vir:78 1 MTTY---HEHVERLQGLLA--RDLPNLLEAEAYRNGTRRLKTIGIG---------APPELAYLDVQPGWVATYLRTLSDR 66 (480) T ss_pred CCCH---HHHHHHHHHHHH--HHHHHHHHHHHHHhccccccccccc---------cchhHhhhhhhcchHHHHHHHHHhh Confidence 1112 235666777663 3456789999999999987443221 1122335689999999999999999 Q ss_pred hhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEE------CCCCceEEEEEcccceEEEEcCC Q lcl|NC_013644. 86 LLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYART------NAEDRLCFQVADSLNVFGVYNEY 158 (510) Q Consensus 86 l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~------d~~g~~~i~~~~p~~~~~~~d~~ 158 (510) |++++++.. ++++..+.|+++|+ |+++.++.+++++++++|+||++||. |++|.++|.+++|.+++|+||++ T Consensus 67 l~~~g~~~~-~d~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~i~~~~p~~~~~~~D~~ 145 (480) T protein:vir:78 67 LDIEGFRIS-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPR 145 (480) T ss_pred hccCceecC-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeEEEEEcccceEEEEcCC Confidence 999988754 56677788889985 89999999999999999999999996 56889999999999999999975 Q ss_pred C--CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCccc Q lcl|NC_013644. 159 N--ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIP 236 (510) Q Consensus 159 ~--~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP 236 (510) . ++..++++|...+. ...+.++++|+++.+++|+..++..... .......+|+||+|| T Consensus 146 ~~~~~~~~i~~~~~~~~----~~~~~~~~~y~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~g~vP 205 (480) T protein:vir:78 146 NTRRVTRAVRLYTTRDD----VAVPDRATLYLPDETVPLRRNGGLNDQW----------------VVDGDVIKHGLGVVP 205 (480) T ss_pred CccceEEEEEEEEeecC----CCceEEEEEEeCCeEEEEEecCCCcccc----------------ccccccccCCCCCcc Confidence 3 56666666543322 2246788999999999998765432110 011235689999999 Q ss_pred EEEecCCC-----CCCCcHHH-HHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhh---hHhhhcCeeeeccCC Q lcl|NC_013644. 237 FYRLSNNK-----QETTDLKP-IKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKL---RQNVKSKKVVGTGSD 307 (510) Q Consensus 237 vv~~~nn~-----~g~sd~~~-v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~---~~~~~~~~~~~~~~~ 307 (510) ||+|+|++ +|+|+++. |++|+|+||+++|++++.++++++|++++.|++..+..+- .........+...++ T Consensus 206 vv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~ 285 (480) T protein:vir:78 206 VVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLAS 285 (480) T ss_pred eEEeecccccCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhhcCCccccccccccchhhhhhhhhccCCC Confidence 99999874 68899985 9999999999999999999999999999999865432211 011111222334456 Q ss_pred CceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCcccccccc--CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 308 GGLDVKTVTI-PTEGRKTKMEIDKENIYKFGMAFDSTQVGD--GNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLV 384 (510) Q Consensus 308 ~~~~~~~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i 384 (510) +++++.+++. +.+++.+.++.+..+|+..+++|+..+++. +++||+||++++..|..||.++++.|+++|+++++++ T Consensus 286 ~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~~~l~ 365 (480) T protein:vir:78 286 EAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIA 365 (480) T ss_pred CCceEEecCccCHHHHHHHHHHHHHHHhcccCCChHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7899988775 344444444444444444455555555432 2369999999999999999999999999999999999 Q ss_pred HHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcC--CCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 385 IDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETR--KIILESILQVAPRLDDDNVLRLICEQFDLDWEDVK 462 (510) Q Consensus 385 ~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g--~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~ 462 (510) +.+++.. ...+...++++|+++.+.|.++.++++++++++| ++|+||+++++|+++++. +++.+++++. .+... T Consensus 366 ~~~~g~~--~~~~~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~-~~~~~~~~e~-~~~~~ 441 (480) T protein:vir:78 366 MQIMGRE--VTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR-EQMRDWDKQE-TEDMI 441 (480) T ss_pred HHHcCCC--ccccceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcCCCCHhHH-HHHHHHHHHH-HHHHH Confidence 9887643 2345567999999999999999999999999876 789999999999987643 2222222221 11111 Q ss_pred HHHHhhhccCCCCCCCCCcccCCCCCCcccccccCccccccc Q lcl|NC_013644. 463 EALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTES 504 (510) Q Consensus 463 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (510) ..+. ....+.....+++..+..+ .++++.+.|..-+..+ T Consensus 442 ~~~~--~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 442 DTLY--STTKAQADATPKPTVTETK-TETQTSPSGFNRTKTR 480 (480) T ss_pred HHhh--ccccccCCCCCCCCCCCCC-CccccccCCCCcccCC Confidence 1111 1111111111111111111 1222222222222222 No 47 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=100.00 E-value=3e-73 Score=418.17 Aligned_cols=454 Identities=10% Similarity=0.044 Sum_probs=324.7 Q ss_pred CCCccC--CChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEALLS--EDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~~~--~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) |+.++. ++++...+++..++++|.. +..++.++.+||+|+|+|.+..... ... ..+.+++.||+++| T Consensus 1 ~~~~~~~~~e~~~~~~~~~~l~~~~~~--~~~r~~~l~~YY~G~~~i~~~~~~~--------~~~-~~~~~~v~n~~~~i 69 (486) T protein:vir:42 1 MTAPLPGMEEIEDPAVVREEMISAFED--ASKDLASNTSYYDAERRPEAIGVTV--------PRE-MQQLLAHVGYPRLY 69 (486) T ss_pred CCCCCCCCCCcccHHHHHHHHHHHHHH--HHHHHHHHHHHhcccCcchhccccc--------chh-HhhhhhccchHHHH Confidence 888854 4556667788899998854 4467888999999999886543211 111 12457788999999 Q ss_pred HHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECC--------CCceEEEEEccc Q lcl|NC_013644. 79 VDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNA--------EDRLCFQVADSL 149 (510) Q Consensus 79 v~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~--------~g~~~i~~~~p~ 149 (510) |++.+++|.+.+++.. +++...+.++++|+ |+++..+.+++++++++|+||++||.++ ++.++|++++|+ T Consensus 70 Vd~~~~~l~~~g~~~~-~~~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~ 148 (486) T protein:vir:42 70 VDSVAERQAVEGFRLG-DADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVEPPT 148 (486) T ss_pred HHHHHhhhcccceecC-CCchhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEeccc Confidence 9999999988877643 45556677788884 8899999999999999999999999765 556799999999 Q ss_pred ceEEEEcCCC-CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccc Q lcl|NC_013644. 150 NVFGVYNEYN-ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLL 228 (510) Q Consensus 150 ~~~~~~d~~~-~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 228 (510) +++++||+.. ++.+++++|+ ..+ ...++++++|+++.+++|...++++... ... T Consensus 149 ~~~~i~d~~~~~~~~~~~~~~--~~~---~~~~~~~~~y~~~~~~~~~~~~~~~~~~--------------------~~~ 203 (486) T protein:vir:42 149 RMHAEIDPRINRVSKAIRVAY--DKE---GNEIQAATLYTPMETIGWFRADGEWAEW--------------------FNV 203 (486) T ss_pred ceEEEEeCCCCCeEEEEEEEE--ecC---CCeEEEEEEEcCCcEEEEEecCCcEEee--------------------cce Confidence 9999999643 4555555543 222 2346788999999999998876655432 235 Q ss_pred cccCCcccEEEecCCC-----CCCCcHH-HHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhh-----hhHhhh Q lcl|NC_013644. 229 QRSYGQIPFYRLSNNK-----QETTDLK-PIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSK-----LRQNVK 297 (510) Q Consensus 229 ~~~~g~iPvv~~~nn~-----~g~sd~~-~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~-----~~~~~~ 297 (510) +|+||+||||+|+|++ +|.|+++ .|++|||+||+++|++++.++++++|+++++|++..+... ...... T Consensus 204 ~h~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~ 283 (486) T protein:vir:42 204 PHGLGVVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDSETGQTLFDA 283 (486) T ss_pred ecCCCCceEEEeccccccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhcCCccccccccccccchhhh Confidence 7999999999999984 5889998 5899999999999999999999999999999976533211 001111 Q ss_pred cCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccC----c-ccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 298 SKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDG----N-ITNIVIKARYTLLNMKANKTEAR 372 (510) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g----~-~Sg~Ai~~~~~~l~~k~~~k~~~ 372 (510) ....+...+++++++.+++ ....+.++++++..|+.+|.+|++.+..+| | +||+||++++.+|.+||.+|++. T Consensus 284 ~~~~~~~~~~~~~~~~q~~--~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~ 361 (486) T protein:vir:42 284 YLARILAFEDAEGKIQQFS--AAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLM 361 (486) T ss_pred hhchhcccCCCCceEEeec--ccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHH Confidence 1122334456778887654 445778889999999988877776543322 2 69999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhc--CCCchHHHHHhCCCCCcHH-HHHH Q lcl|NC_013644. 373 LRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAET--RKIILESILQVAPRLDDDN-VLRL 449 (510) Q Consensus 373 ~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~--g~iS~et~~~~~~~v~d~e-~~~~ 449 (510) |+++|++++++++.+++... ...+...++++|+++.|.|.++.++++++++++ |++|+||+++++|+++++. +.++ T Consensus 362 f~~~l~~~~~l~~~~~~~~~-~~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~~e~~~ 440 (486) T protein:vir:42 362 FGGAWEEAMRIAYRIMKGGD-VPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARIDMGYSVKEREEMRR 440 (486) T ss_pred HHHHHHHHHHHHHHHhcCCC-ccccceeeeEEecCCCCCCHHHHHHHHHHHHhcccCCCCHHHHHhcCCCChhHHHHHHH Confidence 99999999999988765433 234567899999999999999999999999886 6899999999999988763 2233 Q ss_pred HHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcc-cCCCCCCcccccccCcccccc Q lcl|NC_013644. 450 ICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEE-TAVNPDDPTQQMAEGATGSTE 503 (510) Q Consensus 450 ~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 503 (510) +.+++.........+.. +.....+..+. ++++.+++ +..++|.++ T Consensus 441 ~~~e~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~ 486 (486) T protein:vir:42 441 WDEEEAAMGLGLLGTMV-----DADPTVPGSPSPTAPPKPQP----AIESSGGDA 486 (486) T ss_pred HHHHHHHHHHHHHHHhh-----cCCCCCCCCCCCCCCCCCCc----ccCCCCCCC Confidence 32332222221111111 11111111111 11111111 111222222 No 48 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=100.00 E-value=1.7e-73 Score=419.58 Aligned_cols=423 Identities=12% Similarity=0.029 Sum_probs=310.5 Q ss_pred cCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHh Q lcl|NC_013644. 5 LSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQ 84 (510) Q Consensus 5 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~ 84 (510) ++++... +|+.++++|. ..+.++.++++||+|+|++....... ....+++|+++||+++||++.++ T Consensus 1 ~~~~~~~---~i~~l~~~~~--~~~~r~~~l~~Yy~G~~~i~~~~~~~---------~~~~~~~k~~~n~~~~ivd~~~~ 66 (441) T protein:vir:80 1 MNSDELA---LIEGMYDRIQ--RLSSWHCCIEGYYEGSNRVRDLGVAI---------PPELQRVQTVVSWPGIAVDALEE 66 (441) T ss_pred CCccHHH---HHHHHHHHHH--HHHHHHHHHHHHHhcCCcchhcCccc---------chhhhhhhhhcchHHHHHHHHHh Confidence 4444433 3666666664 33456889999999999875443221 12234679999999999999999 Q ss_pred hhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCC-Cce Q lcl|NC_013644. 85 YLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYN-ELQ 162 (510) Q Consensus 85 ~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~-~~~ 162 (510) |+.+.++ ++++. +.|+++|+ |+++..+.+++++++++|+||++||.|++|.+++++++|.+++|+||+.. ++. T Consensus 67 ~l~~~g~--~~~d~---~~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~ 141 (441) T protein:vir:80 67 RLDWLGW--TNGDG---YGLDGVYAANRLATASCDVHLDALIFGLSFVAIIPHGDGTVSVRPQSPKNCTGKFSADGSRLD 141 (441) T ss_pred hhccccc--cCCCh---HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeCCCCceEEEEEccceEEEEEeCCCCcee Confidence 9976654 44443 34677774 89999999999999999999999999999999999999999999999754 444 Q ss_pred eEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecC Q lcl|NC_013644. 163 RICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSN 242 (510) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n 242 (510) .++++|+... + ...++++|+++.+++|...+++.+.. ....+|+||+||||+|.| T Consensus 142 ~~~~~~~~~~--~----~~~~~~vy~~~~~~~~~~~~~~~~~~-------------------~~~~~~~~g~vPvv~~~n 196 (441) T protein:vir:80 142 AGLVVQQTCD--P----EVVEAELLLPDVIVQVERRGSREWVE-------------------VDRIPNVLGAVPLVPIVN 196 (441) T ss_pred EEEEEEEEec--C----ceEEEEEEecCeEEEEEEcCCcceee-------------------ccccccCCCceeEEEeec Confidence 5555554322 1 24578999999999887765443221 234689999999999998 Q ss_pred CC-----CCCCcHH-HHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCc---eeEE Q lcl|NC_013644. 243 NK-----QETTDLK-PIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGG---LDVK 313 (510) Q Consensus 243 n~-----~g~sd~~-~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~ 313 (510) ++ +|.|++. .|++|||+||.++|++++.++++++|+++++|+..++...........+++.++.+++ +++. T Consensus 197 ~~~~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~G~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 276 (441) T protein:vir:80 197 RRRTSRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVTGVSADEFSQPGWVLSMASVWAVDKDDDGDTPNVG 276 (441) T ss_pred cccCCccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeeecCCccccccchhhhcccccccCCCCCCCCcceeE Confidence 85 4889885 5999999999999999999999999999999987665444444455566777665543 4554 Q ss_pred eecCCHHHHHHHHHHHHHHHHHHhCCccc---cccccC-c-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 314 TVTIPTEGRKTKMEIDKENIYKFGMAFDS---TQVGDG-N-ITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDI 388 (510) Q Consensus 314 ~~~~~~~~~~~~~~~l~~~i~~~s~~p~~---~~~~~g-~-~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~ 388 (510) ++ +....+.++++++..|+.++.++++ .+++.+ | +||+||++++.+|.+||.+|++.|+++|++++++++.++ T Consensus 277 ~~--~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~ 354 (441) T protein:vir:80 277 SF--PVNSPTPYSDQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKAL 354 (441) T ss_pred ec--CccchHHHHHHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 44 3445566666676666666554444 444433 2 599999999999999999999999999999999999998 Q ss_pred hhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCC--chHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 389 NRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKI--ILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALE 466 (510) Q Consensus 389 ~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~i--S~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~ 466 (510) +..........+++++|++++|.|.++.++++++++++|++ |++++++++|+++++ .+++++++++........... T Consensus 355 ~~~~~~~~~~~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s~~~~~~~l~~~~~e-~~~~~~e~~e~~~~~~~~~~~ 433 (441) T protein:vir:80 355 DSRVDEADFFGDVGLRWRDASTPTRAATADAVTKLVGAGILPADSRTVLEMLGLDDVQ-VEAVMRHRAESSDPLAVLAGA 433 (441) T ss_pred cCCCcccccceeeeEEeCCCCCcCHHHHHHHHHHHHhcCcccccHHHHHHhCCCCHHH-HHHHHHHHHHHHHHHHHHhhh Confidence 87766666677899999999999999999999999999964 789999999987543 333333332221111111111 Q ss_pred hhhccCCC Q lcl|NC_013644. 467 EAEYTKGL 474 (510) Q Consensus 467 ~~~~~~~~ 474 (510) ....++.. T Consensus 434 ~~~~~~~~ 441 (441) T protein:vir:80 434 ISRQTNEV 441 (441) T ss_pred hhcccccC Confidence 11111111 No 49 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=100.00 E-value=4.9e-73 Score=417.02 Aligned_cols=452 Identities=10% Similarity=0.048 Sum_probs=319.2 Q ss_pred CCCccCCC--hhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEALLSED--VKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~~~~~--~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) |...+... .+.....+..++++|. .+..++.++++||+|+|++.+..+.. .... .++++++||+++| T Consensus 1 ~~~~i~~~~~~~~~~~~~~~l~~~~~--~~~~r~~~~~~Yy~G~~~i~~~~~~~--------~~~~-~~~~~~~n~~~~i 69 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIARDEMVSAFE--DSTQNLKTNTSYYEAERRPEAIGVTV--------PIQM-QSLLAHVGYPRLY 69 (485) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHH--HHHHHHHHHHHHHhcCCcchhcCCCC--------Chhh-hhhhhhcCcHHHH Confidence 88775444 5556666777877774 34457999999999999885533221 1122 2456778999999 Q ss_pred HHHHHhhhhcCCceeccCcHHHHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECC--------CCceEEEEEccc Q lcl|NC_013644. 79 VDQKTQYLLSNPVEYETENEELKEYLAEYY-NSEFQVVLQELVEGSSQKGFEYVYARTNA--------EDRLCFQVADSL 149 (510) Q Consensus 79 v~~~~~~l~g~p~~~~~~d~~~~~~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~d~--------~g~~~i~~~~p~ 149 (510) |++.++||++++++. .++++..+.++++| .|+++.++.++++.++++|+||+++|.++ ++.++|++++|. T Consensus 70 vd~~~~~l~~~g~~~-~~~~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~ 148 (485) T protein:vir:10 70 VDSIAERQAVEGFRF-GDADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRVEPPT 148 (485) T ss_pred HHHHHhhhcccceec-CCCchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEEEccc Confidence 999999999888764 35556677788888 48899999999999999999999999875 467889999999 Q ss_pred ceEEEEcCCCC-ceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccc Q lcl|NC_013644. 150 NVFGVYNEYNE-LQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLL 228 (510) Q Consensus 150 ~~~~~~d~~~~-~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 228 (510) +++++||+..+ +..++++|+ ..+ .....++++|+++.+++|....+++... ... T Consensus 149 ~~~~~~D~~~~~~~~~~~~~~--~~~---~~~~~~~~~y~~~~~~~~~~~~~~~~~~--------------------~~~ 203 (485) T protein:vir:10 149 RMYAEIDPRIGRVSKAIRVAY--DAE---GNEIQAATLYTPNDIFGWYRVENEWQEW--------------------FNN 203 (485) T ss_pred eeEEEEcCCCCceeEEEEEEE--eeC---CCeEEEEEEEeCCeEEEEEEcCCceEEe--------------------ccc Confidence 99999997554 444444333 222 2346788999999999999876655432 235 Q ss_pred cccCCcccEEEecCCC-----CCCCcHH-HHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhh-----h-hHhh Q lcl|NC_013644. 229 QRSYGQIPFYRLSNNK-----QETTDLK-PIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSK-----L-RQNV 296 (510) Q Consensus 229 ~~~~g~iPvv~~~nn~-----~g~sd~~-~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~-----~-~~~~ 296 (510) +|+||+||||+|+|+. +|+|+++ .|++|||+||+++|++++.++++++|+++++|++..+... . .... T Consensus 204 ~~~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~ 283 (485) T protein:vir:10 204 PHGLGVVPVVPIPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTLFDA 283 (485) T ss_pred cCCCCcccEEEeccccccCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcCCcccccccccccchhhhh Confidence 7999999999999984 4789997 5899999999999999999999999999999975543211 0 1111 Q ss_pred hcCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc---ccccc--CcccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 297 KSKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDS---TQVGD--GNITNIVIKARYTLLNMKANKTEA 371 (510) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~---~~~~~--g~~Sg~Ai~~~~~~l~~k~~~k~~ 371 (510) ... .+...+++++++.+++ ....+.++++|+..|+.++.+|++ .+++. +++||+||++++.+|.+||.+|++ T Consensus 284 ~~~-~i~~~~~~d~k~~q~~--~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~k~~ 360 (485) T protein:vir:10 284 YLA-RILAFEDAEGKIQQFS--AAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNS 360 (485) T ss_pred ccc-ceeccCCCCceEEeec--ccchHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHH Confidence 222 3334456778887755 344667777777777777655554 44332 236999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcC--CCchHHHHHhCCCCCcHH-HHH Q lcl|NC_013644. 372 RLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETR--KIILESILQVAPRLDDDN-VLR 448 (510) Q Consensus 372 ~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g--~iS~et~~~~~~~v~d~e-~~~ 448 (510) .|+.+|++++++++.+.+. .....+...++|+|++++|.|.++.++++++++++| ++|+||+++++|+++++. +.+ T Consensus 361 ~f~~~l~~~~~l~~~~~~~-~~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s~et~~~~lg~~~~~~~~~~ 439 (485) T protein:vir:10 361 IFGGAWEEAMRLAYRMMKG-GDVPPDMLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIPRERARKDMGYSIAEREEMR 439 (485) T ss_pred HHHHHHHHHHHHHHHHhCC-CCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCCHhHHHHHH Confidence 9999999999999886553 233445678999999999999999999999999877 899999999999876642 222 Q ss_pred HHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 449 LICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 449 ~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) ++.+++..+......++. . +. ...+++.+.++.++.+ +.+-|+.| T Consensus 440 ~~~ee~~~~~~~~~~~~~-~--~~--~~~~~~~~~~~~~~~~------------~~~~~~~~ 484 (485) T protein:vir:10 440 RWDEEEAAMGLGLIGTMV-D--PN--PTVPGSPSPAPAPKPA------------ALESGGDA 484 (485) T ss_pred HHHHHHHHHHHHHHHHhh-c--cC--CCCCCCCCccccccCc------------CCCCCCCC Confidence 222222222222111111 1 11 1111111111112111 11122222 No 50 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=100.00 E-value=5.6e-73 Score=416.68 Aligned_cols=449 Identities=9% Similarity=0.020 Sum_probs=317.0 Q ss_pred ccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHH Q lcl|NC_013644. 4 LLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKT 83 (510) Q Consensus 4 ~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~ 83 (510) |-..+..+...+|..++..|.. ++.++.++++||+|+|+|.+..... .... .++|+++||+++||++.+ T Consensus 1 ~~~~~~~d~~~~i~~L~~~~~~--~~~r~~~~~~Yy~g~~~i~~~~~~~--------~~~~-~~~~~~~n~~~~ivd~~a 69 (488) T protein:vir:23 1 MAETESIDPEKLRDQLLDAFEN--KQNELKSSKAYYDAERRPDAIGLAV--------PLDM-RKYLAHVGYPRTYVDAIA 69 (488) T ss_pred CCcccCCCHHHHHHHHHHHHHH--HHHHHHHHHHHHhcccchhhcCccc--------chhh-hhhhhhcchHHHHHHHHH Confidence 3333334445667777777743 3457899999999999886544321 1222 357899999999999999 Q ss_pred hhhhcCCce------e---ccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEE--------CCCCceEEEE Q lcl|NC_013644. 84 QYLLSNPVE------Y---ETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYART--------NAEDRLCFQV 145 (510) Q Consensus 84 ~~l~g~p~~------~---~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~--------d~~g~~~i~~ 145 (510) ++|+.+++. + .+++++..+.|+++|+ |+++..+.+++++++++|+||++||. ++++.++|++ T Consensus 70 ~~l~~~Gf~~~~~~~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~~i~~ 149 (488) T protein:vir:23 70 ERQELEGFRIPSANGEEPESGGENDPASELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVPLIRV 149 (488) T ss_pred HhhhccceeccCCcccccccccchhHHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcceEEE Confidence 766544433 2 2456777888888885 88999999999999999999999986 4567789999 Q ss_pred EcccceEEEEcCCCC-ceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccc Q lcl|NC_013644. 146 ADSLNVFGVYNEYNE-LQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSEN 224 (510) Q Consensus 146 ~~p~~~~~~~d~~~~-~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (510) ++|++++|+||+..+ +..++++| ...+++ .+.++++|+++.+++|...++++... T Consensus 150 ~~p~~~~~~~d~~~~~~~~~~~~~--~~~~~~---~~~~~~~y~~~~~~~~~~~~~~~~~~------------------- 205 (488) T protein:vir:23 150 EPPTALYAEVDPRTRKVLYAIRAI--YGADGN---EIVSATLYLPDTTMTWLRAEGEWEAP------------------- 205 (488) T ss_pred eccceeEEEEecCCCceEEEEEEE--EecCCC---cEEEEEEEecCcEEEEEecCCceEec------------------- Confidence 999999999997543 33333333 333332 35678999999999998776655332 Q ss_pred cccccccCCcccEEEecCCC-----CCCCcHH-HHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhh------ Q lcl|NC_013644. 225 ESLLQRSYGQIPFYRLSNNK-----QETTDLK-PIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKL------ 292 (510) Q Consensus 225 ~~~~~~~~g~iPvv~~~nn~-----~g~sd~~-~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~------ 292 (510) ...+|+||+||||+|+|+. +|+|+++ .|++|+|+||+++|++++.++++++|+++++|+...+.... T Consensus 206 -~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~ 284 (488) T protein:vir:23 206 -TSTPHGLEMVPVIPISNRTRLSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGINAETGQR 284 (488) T ss_pred -cccccCCCCcceEEeccccccCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCcccccccccccch Confidence 2457999999999999975 5889997 58999999999999999999999999999999765432110 Q ss_pred hHhhhcCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhC---Ccccccccc--CcccHHHHHHHHHHHHHHHH Q lcl|NC_013644. 293 RQNVKSKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGM---AFDSTQVGD--GNITNIVIKARYTLLNMKAN 367 (510) Q Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~---~p~~~~~~~--g~~Sg~Ai~~~~~~l~~k~~ 367 (510) .-....+.++.+++++++++.+++. ...+.++++|+..|+.++. +|+..+++. +++||+||++++.+|.+||. T Consensus 285 ~~~~~~~~v~~~~~g~~~~~~q~~~--~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~ 362 (488) T protein:vir:23 285 MFDAYMARILAFEGGEGAHAEQFSA--AELRNFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAESRLVKKVE 362 (488) T ss_pred hhhhhhhhhccCCCCCCceeEecCC--CChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHH Confidence 1112234466667777889877653 3456666666666666655 444444332 23699999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcC--CCchHHHHHhCCCCCcHH Q lcl|NC_013644. 368 KTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETR--KIILESILQVAPRLDDDN 445 (510) Q Consensus 368 ~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g--~iS~et~~~~~~~v~d~e 445 (510) ++++.|+++|++++++++.+++... ...+..+++++|+++.|.|.++.++++++++++| ++|+||+++++|+++++. T Consensus 363 ~~~~~f~~~l~~~~~l~~~~~~~~~-~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l~~~~d~~ 441 (488) T protein:vir:23 363 RKNKIFGGAWEQAMRLAYKMVKGGD-IPTEYYRMETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDMGYTIVER 441 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCC-cchhhccceEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhCCCCchHH Confidence 9999999999999999998876433 2345678999999999999999999999999876 799999999999988753 Q ss_pred H-HHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCc Q lcl|NC_013644. 446 V-LRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGA 498 (510) Q Consensus 446 ~-~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (510) + .+++.+++..+......++.. .....+.. .....++..+++++.+ T Consensus 442 ~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~---~~~~~~~~~~~e~~~a 488 (488) T protein:vir:23 442 EQMRQWLEQDQKQGLGLIGSLYG----ASTPEGKP---GEAPVGEPPAPEPDAA 488 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHhc----cCCCcccC---CCCCCCCCCCCCCCCC Confidence 2 222222222111111111111 11111111 1112233334444444 No 51 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=100.00 E-value=1.3e-72 Score=414.68 Aligned_cols=455 Identities=9% Similarity=0.043 Sum_probs=317.3 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |.+++..+..++++.+.+.+..+.... ..++.++.+||+|+|++.+..... +. ...+.++++||+++||+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~~-~~rl~~l~~Yy~G~~~i~~~~~~~--------~~-~~~~~~~~~n~~~~ivd 70 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTER-TQDLGDNTAYYESERRPDAVGVTV--------PQ-QMQKLLAHVGYPRLYID 70 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHHH-HHHHHHHHHHHhccccchhccccc--------ch-hHHhhhhhcCcHHHHHH Confidence 999999998888766555433333222 345788999999999875432211 11 11234678899999999 Q ss_pred HHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCc--------eEEEEEcccce Q lcl|NC_013644. 81 QKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDR--------LCFQVADSLNV 151 (510) Q Consensus 81 ~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~--------~~i~~~~p~~~ 151 (510) +.+++|++++++.. ++++..+.++++|+ |+++..+.+++++++++|+||++||.+++|. ++|++++|.++ T Consensus 71 ~~~~~l~~~g~~~~-~~~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~ 149 (484) T protein:vir:77 71 AIAARQELEGFRLG-GADKADEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEPPTNL 149 (484) T ss_pred HHHhhhccCceecC-CcchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEecccee Confidence 99999999988864 45556677788885 8999999999999999999999999998874 57999999999 Q ss_pred EEEEcCCC-CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccc Q lcl|NC_013644. 152 FGVYNEYN-ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQR 230 (510) Q Consensus 152 ~~~~d~~~-~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (510) |++||+.. ++.+++++|+ ..++ ..+.++++|+++.+++|....+++... ...+| T Consensus 150 ~~~~D~~~~~~~~a~~~~~--~~~~---~~~~~~~~y~~~~~~~~~~~~~~~~~~--------------------~~~~~ 204 (484) T protein:vir:77 150 YAQIDPRTRQVMRAIRAIE--DEEG---NEVIGATLYLPNNTVIWNREDGQWVQV--------------------ANVAH 204 (484) T ss_pred EEEecCCCCceEEEEEEEE--eecC---CcEEEEEEEecCeEEEEEecCCceEee--------------------ccccC Confidence 99999753 4444444433 3222 235678899999999998776655432 23579 Q ss_pred cCCcccEEEecCCC-----CCCCcHH-HHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhh----hHhhh-cC Q lcl|NC_013644. 231 SYGQIPFYRLSNNK-----QETTDLK-PIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKL----RQNVK-SK 299 (510) Q Consensus 231 ~~g~iPvv~~~nn~-----~g~sd~~-~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~----~~~~~-~~ 299 (510) +||+||||+|+|+. +|+|+|+ .|++|+|+||+++|++++.++++++|++++.|++.++.... ....+ .. T Consensus 205 ~~g~vPvv~f~N~~~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~ 284 (484) T protein:vir:77 205 NLEMVPVIPIPNRTRLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVKGEELGVDPETGQTLFDAYL 284 (484) T ss_pred CCCCcceEEeccccccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCCcchhcccccccchhhhhhh Confidence 99999999999975 5899997 59999999999999999999999999999999765432110 01111 11 Q ss_pred eeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcc---cccccc-Cc-ccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 300 KVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFD---STQVGD-GN-ITNIVIKARYTLLNMKANKTEARLR 374 (510) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~---~~~~~~-g~-~Sg~Ai~~~~~~l~~k~~~k~~~~~ 374 (510) ..+...+++++++.+++ ..+.+.++++|+..|+.+|.+++ ..+++. .| +||+||++++.+|.+||.+|++.|+ T Consensus 285 ~~~~~~~~~~~~~~q~~--~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~k~~~f~ 362 (484) T protein:vir:77 285 ARILAFEDHESKAQQFS--AAELRNFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAIRSSESRLVKTVERKNKIFG 362 (484) T ss_pred hhhcccCCCCceeEeec--CCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12333355678886655 34456777777777777765554 444432 23 6999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcC--CCchHHHHHhCCCCCcHHH-HHHHH Q lcl|NC_013644. 375 ALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETR--KIILESILQVAPRLDDDNV-LRLIC 451 (510) Q Consensus 375 ~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g--~iS~et~~~~~~~v~d~e~-~~~~~ 451 (510) ++|++++++++.+.+... ...+...++++|+++.|.|.++.++++++++++| ++|.||+++++|+++++.+ .+++. T Consensus 363 ~~l~~~~~l~~~~~~~~~-~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s~et~~~~l~~~~~~~~e~~~~~ 441 (484) T protein:vir:77 363 GAWEQAMRVAYKVMNGGD-IPPEYYRMESIWRDPSTPTYAAKADAATKLYNNGQGVIPKERARIDMGYSITEREEMRKWD 441 (484) T ss_pred HHHHHHHHHHHHHhCCCC-cccccccceEEecCCCCCCHHHHHHHHHHHHhccCCCCCHHHHHhcCCCChhHHHHHHHHH Confidence 999999999988765432 3345668999999999999999999999999876 8999999999999877532 22222 Q ss_pred HHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccc Q lcl|NC_013644. 452 EQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTE 503 (510) Q Consensus 452 e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (510) +++.........++... .+.+.+.++..+.++ ++....++.++ T Consensus 442 ~ee~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~--------~~~~~~~~~~~ 484 (484) T protein:vir:77 442 EEEQAQGLGLMGTMFGT-DPSGGGNPDNPETPE--------PQPNPAEEAAA 484 (484) T ss_pred HHHHHHHHHHHhhhccc-cccCCCCCCCCCccc--------ccCCCccccCC Confidence 22222221111111111 111111111111111 11111111111 No 52 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=100.00 E-value=1.2e-70 Score=403.92 Aligned_cols=440 Identities=7% Similarity=-0.046 Sum_probs=303.9 Q ss_pred ccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceecccccccccccc-ccceeccchhHHHHHHH Q lcl|NC_013644. 4 LLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYA-SNVRIPHGFFPEIVDQK 82 (510) Q Consensus 4 ~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~-~~~ki~~n~~~~Iv~~~ 82 (510) |-...++ +++..++.+|. .++.++.++++||+|+|+|.+..+.. ....+ .++|+++||+++||++. T Consensus 1 ~~~~t~~---~~~~~l~~~~~--~~~~r~~~l~~Yy~g~~~i~~~~~~~--------~~~~~~~~~k~~~n~~~~ivd~~ 67 (456) T protein:vir:10 1 MTASTPA---EWLPVLTKRID--DGMSRVRLLARYSNGDAPLPELTRNT--------SAAWRSFQREARTNWGLMVRDSV 67 (456) T ss_pred CCCCCHH---HHHHHHHHHHH--HHHHHHHHHHHHHhcCCCchhcCccc--------ChhhhhhhhhhhcchHHHHHHHH Confidence 2222232 44566666664 34567899999999999875543321 12222 35789999999999999 Q ss_pred HhhhhcCCceeccC-cHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCCC Q lcl|NC_013644. 83 TQYLLSNPVEYETE-NEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYNE 160 (510) Q Consensus 83 ~~~l~g~p~~~~~~-d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~~ 160 (510) ++|++|+|+++.++ +.+..+.++++|+ |+++....+++++++++|+||+++|.|++|.+++++++|.+++++||+... T Consensus 68 ~~~l~~~~~~~~~~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~ 147 (456) T protein:vir:10 68 ADRIIPNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQP 147 (456) T ss_pred HhhhccCCeecCCCCCcchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeEEEEcCCCC Confidence 99999999999765 3455566788885 889999999999999999999999999999999999999999999998764 Q ss_pred ceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEc-CCceeecccccccccccccccccccccccccccCCcccEEE Q lcl|NC_013644. 161 LQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAE-DNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYR 239 (510) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~ 239 (510) ......++++...++. ..+..+|.+.++..|... ...+......... ...........+|.+|+|||++ T Consensus 148 ~~~~~~i~~~~~~d~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~pvv~ 217 (456) T protein:vir:10 148 WRIRAAMRWWRDLDAE----SDFAIVWSGDGWQKFARPCFVQSSSRRRLVTR------ISDSWVPVGDAVVTGSPPPVVV 217 (456) T ss_pred cceEEEEEEEEecCCc----eeEEEEEeccceeEEEEEEEEeecccceeeee------cCCceeeccccCCCCCceeEEE Confidence 4333333333333322 233445554443333211 1100000000000 0011122244689999999987 Q ss_pred ecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCch-----hhh---h--HhhhcCeeeeccCCCc Q lcl|NC_013644. 240 LSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDL-----SKL---R--QNVKSKKVVGTGSDGG 309 (510) Q Consensus 240 ~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~-----~~~---~--~~~~~~~~~~~~~~~~ 309 (510) | +|++|.|+|+++++|||+||.++|++++..+++++|++++.|+..... +.. . .....+.++..+++++ T Consensus 218 ~-~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~~~~~ 296 (456) T protein:vir:10 218 Y-QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAAPGALWELPPGVD 296 (456) T ss_pred e-cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhhccccccCCCCcc Confidence 7 678999999999999999999999999999999999999999753321 100 0 0112234555566665 Q ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 310 LDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDI 388 (510) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~ 388 (510) +..++ ..+.+.+...++.+..+|+..+++|+..+++ ++|+||+||++++.+|.+||..|++.|+++|++++++++.+. T Consensus 297 ~~q~~-~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~ 375 (456) T protein:vir:10 297 IWESQ-ANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIE 375 (456) T ss_pred eEEec-ccChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 54443 3466788888888888888889999888765 578999999999999999999999999999999999987654 Q ss_pred hhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHH-HHHh Q lcl|NC_013644. 389 NRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKE-ALEE 467 (510) Q Consensus 389 ~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~-~~~~ 467 (510) +. .+...++++|+++.|.|.++.++++++++++|++|.+++++++++.++ +.++.. +++...+. .+.. T Consensus 376 g~-----~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~-~i~~~e-----~er~~~e~~~~~~ 444 (456) T protein:vir:10 376 GE-----SVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNAD-QIKQDD-----LDRAREQITLFAG 444 (456) T ss_pred CC-----CcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHH-HHHHHH-----HHHHHHHHHHHhh Confidence 32 234578999999999999999999999999999999999998876543 221111 11111111 1111 Q ss_pred hhccCCCCCCCCCcccC Q lcl|NC_013644. 468 AEYTKGLSDNTDEEETA 484 (510) Q Consensus 468 ~~~~~~~~~~~~~~~~~ 484 (510) .+... ++.+.+. T Consensus 445 ~~~~~-----~~~~~~~ 456 (456) T protein:vir:10 445 NPVQR-----PQEDGSR 456 (456) T ss_pred hhhhc-----CCCCCCC Confidence 10000 0000000 No 53 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=100.00 E-value=1.2e-70 Score=403.92 Aligned_cols=440 Identities=7% Similarity=-0.046 Sum_probs=303.9 Q ss_pred ccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceecccccccccccc-ccceeccchhHHHHHHH Q lcl|NC_013644. 4 LLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYA-SNVRIPHGFFPEIVDQK 82 (510) Q Consensus 4 ~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~-~~~ki~~n~~~~Iv~~~ 82 (510) |-...++ +++..++.+|. .++.++.++++||+|+|+|.+..+.. ....+ .++|+++||+++||++. T Consensus 1 ~~~~t~~---~~~~~l~~~~~--~~~~r~~~l~~Yy~g~~~i~~~~~~~--------~~~~~~~~~k~~~n~~~~ivd~~ 67 (456) T protein:vir:10 1 MTASTPA---EWLPVLTKRID--DGMSRVRLLARYSNGDAPLPELTRNT--------SAAWRSFQREARTNWGLMVRDSV 67 (456) T ss_pred CCCCCHH---HHHHHHHHHHH--HHHHHHHHHHHHHhcCCCchhcCccc--------ChhhhhhhhhhhcchHHHHHHHH Confidence 2222232 44566666664 34567899999999999875543321 12222 35789999999999999 Q ss_pred HhhhhcCCceeccC-cHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCCC Q lcl|NC_013644. 83 TQYLLSNPVEYETE-NEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYNE 160 (510) Q Consensus 83 ~~~l~g~p~~~~~~-d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~~ 160 (510) ++|++|+|+++.++ +.+..+.++++|+ |+++....+++++++++|+||+++|.|++|.+++++++|.+++++||+... T Consensus 68 ~~~l~~~~~~~~~~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~ 147 (456) T protein:vir:10 68 ADRIIPNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQP 147 (456) T ss_pred HhhhccCCeecCCCCCcchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeEEEEcCCCC Confidence 99999999999765 3455566788885 889999999999999999999999999999999999999999999998764 Q ss_pred ceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEc-CCceeecccccccccccccccccccccccccccCCcccEEE Q lcl|NC_013644. 161 LQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAE-DNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYR 239 (510) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~ 239 (510) ......++++...++. ..+..+|.+.++..|... ...+......... ...........+|.+|+|||++ T Consensus 148 ~~~~~~i~~~~~~d~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~pvv~ 217 (456) T protein:vir:10 148 WRIRAAMRWWRDLDAE----SDFAIVWSGDGWQKFARPCFVQSSSRRRLVTR------ISDSWVPVGDAVVTGSPPPVVV 217 (456) T ss_pred cceEEEEEEEEecCCc----eeEEEEEeccceeEEEEEEEEeecccceeeee------cCCceeeccccCCCCCceeEEE Confidence 4333333333333322 233445554443333211 1100000000000 0011122244689999999987 Q ss_pred ecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCch-----hhh---h--HhhhcCeeeeccCCCc Q lcl|NC_013644. 240 LSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDL-----SKL---R--QNVKSKKVVGTGSDGG 309 (510) Q Consensus 240 ~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~-----~~~---~--~~~~~~~~~~~~~~~~ 309 (510) | +|++|.|+|+++++|||+||.++|++++..+++++|++++.|+..... +.. . .....+.++..+++++ T Consensus 218 ~-~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~~~~~ 296 (456) T protein:vir:10 218 Y-QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAAPGALWELPPGVD 296 (456) T ss_pred e-cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhhccccccCCCCcc Confidence 7 678999999999999999999999999999999999999999753321 100 0 0112234555566665 Q ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 310 LDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDI 388 (510) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~ 388 (510) +..++ ..+.+.+...++.+..+|+..+++|+..+++ ++|+||+||++++.+|.+||..|++.|+++|++++++++.+. T Consensus 297 ~~q~~-~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~ 375 (456) T protein:vir:10 297 IWESQ-ANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIE 375 (456) T ss_pred eEEec-ccChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 54443 3466788888888888888889999888765 578999999999999999999999999999999999987654 Q ss_pred hhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHH-HHHh Q lcl|NC_013644. 389 NRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKE-ALEE 467 (510) Q Consensus 389 ~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~-~~~~ 467 (510) +. .+...++++|+++.|.|.++.++++++++++|++|.+++++++++.++ +.++.. +++...+. .+.. T Consensus 376 g~-----~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~-~i~~~e-----~er~~~e~~~~~~ 444 (456) T protein:vir:10 376 GE-----SVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNAD-QIKQDD-----LDRAREQITLFAG 444 (456) T ss_pred CC-----CcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHH-HHHHHH-----HHHHHHHHHHHhh Confidence 32 234578999999999999999999999999999999999998876543 221111 11111111 1111 Q ss_pred hhccCCCCCCCCCcccC Q lcl|NC_013644. 468 AEYTKGLSDNTDEEETA 484 (510) Q Consensus 468 ~~~~~~~~~~~~~~~~~ 484 (510) .+... ++.+.+. T Consensus 445 ~~~~~-----~~~~~~~ 456 (456) T protein:vir:10 445 NPVQR-----PQEDGSR 456 (456) T ss_pred hhhhc-----CCCCCCC Confidence 10000 0000000 No 54 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=100.00 E-value=4.4e-69 Score=395.34 Aligned_cols=439 Identities=8% Similarity=-0.027 Sum_probs=304.7 Q ss_pred ccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceecccccccccccc-ccceeccchhHHHHHHH Q lcl|NC_013644. 4 LLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYA-SNVRIPHGFFPEIVDQK 82 (510) Q Consensus 4 ~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~-~~~ki~~n~~~~Iv~~~ 82 (510) |-..+++.+ +..+|++|. .++.+++++++||+|+|+|.+..+.. ....+ .++++++||+++||++. T Consensus 1 ~~~~t~~~~---~~~l~~~~~--~~~~r~~~l~~Yy~g~~~i~~~~~~~--------~~~~~~~~~~~~~n~~~~ivd~~ 67 (456) T protein:vir:79 1 MTASTPAEW---LPVLTKRID--DGMSRVRLLARYSNGDAPLPELTRNT--------SAAWRSFQREARTNWGLMVRDSV 67 (456) T ss_pred CCCCCHHHH---HHHHHHHHH--HHHHHHHHHHHHHhccCChhhcCccc--------ChhhchhhhhhhcchHHHHHHHH Confidence 444444444 455555553 34556899999999999986543221 12222 34568899999999999 Q ss_pred HhhhhcCCceeccCc-HHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCCC Q lcl|NC_013644. 83 TQYLLSNPVEYETEN-EELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYNE 160 (510) Q Consensus 83 ~~~l~g~p~~~~~~d-~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~~ 160 (510) ++|++|+|+++.+++ .+..+.++++|+ |+++..+.+++++++++|+||+++|.+++|.+++++++|.+++++||+... T Consensus 68 ~~~l~~~g~~~~~~~d~~~~~~~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~~i~~~~p~~~~~i~d~~~~ 147 (456) T protein:vir:79 68 ADRIIPNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQP 147 (456) T ss_pred HhhhccCCeecCCCCCccHHHHHHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEeccceeEEEEcCCCC Confidence 999999999987654 455677888885 889999999999999999999999999999999999999999999997654 Q ss_pred --ceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEE Q lcl|NC_013644. 161 --LQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFY 238 (510) Q Consensus 161 --~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv 238 (510) +.+++++| ...+. ...+..+|++.++++|................. ...........+|.+|+|||+ T Consensus 148 ~~~~~~~~~~--~~~d~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~pvv 216 (456) T protein:vir:79 148 WRIRSAMRWW--RDLDA----ESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTR-----ISDSWVPVGDAVVTGSPPPVV 216 (456) T ss_pred CceEEEEEEE--EecCC----ceeEEEEEcCCceEEEEEEEEeeccccceeeec-----cCCceeecccccCCCCceeEE Confidence 33444443 23222 244566777777666543211111000000000 000112223468999999999 Q ss_pred EecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh-----hh---hH-h-hhcCeeeeccCCC Q lcl|NC_013644. 239 RLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS-----KL---RQ-N-VKSKKVVGTGSDG 308 (510) Q Consensus 239 ~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~-----~~---~~-~-~~~~~~~~~~~~~ 308 (510) +| +|+.|.|+|+++++|||+||.++|++++.++++++|++++.|+...... .. .. . ...+.++..+++. T Consensus 217 ~~-~N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i~~~~~~~~~~~~~~~~~~~~ 295 (456) T protein:vir:79 217 VY-QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAIDYASIFEAAPGALWELPPGV 295 (456) T ss_pred Ee-cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCcccccccccccccchhhhhhhhccccccCCCCc Confidence 98 5789999999999999999999999999999999999999997543211 10 01 1 1123344555555 Q ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 309 GLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDD 387 (510) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~ 387 (510) ++..+ .+.+.+.+...++.+..+|+..+++|+..+++ ++|+||+||++++.+|.+||..+++.|+++|++++++++.+ T Consensus 296 ~~~q~-~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~ 374 (456) T protein:vir:79 296 DIWES-QTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQI 374 (456) T ss_pred ceeee-cccChHHHHHHHHHHHHHHHhhcCCChhHhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 54332 23456777777888888888888888887765 57899999999999999999999999999999999998876 Q ss_pred HhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHH-HHH Q lcl|NC_013644. 388 INRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKE-ALE 466 (510) Q Consensus 388 ~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~-~~~ 466 (510) .+. .+...++++|+++.|.|.++.++++++++++|++|.+++++++++..+ +.++. ++++...+. .+. T Consensus 375 ~g~-----~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~G~~~~~~~~~~lg~~~~-~i~~~-----e~~r~~~e~~~~~ 443 (456) T protein:vir:79 375 EGE-----SVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNAD-QIKQD-----DLDRAREQITLFA 443 (456) T ss_pred cCC-----CccccceEEeCCCCCcCHHHHHHHHHHHHhcCCChHHHHHhcCCCCHH-HHHHH-----HHHHHHHHHHHHh Confidence 432 234578999999999999999999999999999999999988766443 22211 111111111 111 Q ss_pred hhhccCCCCCCCCCcccC Q lcl|NC_013644. 467 EAEYTKGLSDNTDEEETA 484 (510) Q Consensus 467 ~~~~~~~~~~~~~~~~~~ 484 (510) ..+. ..++.+.+. T Consensus 444 ~~~~-----~~~~~~~~~ 456 (456) T protein:vir:79 444 GNPV-----QRPQEDGSR 456 (456) T ss_pred hhHh-----hcCCCCCCC Confidence 1110 000000000 No 55 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=100.00 E-value=1.6e-66 Score=381.26 Aligned_cols=453 Identities=11% Similarity=0.049 Sum_probs=319.1 Q ss_pred ccCCChhh---------hHHHHHHHHHhhh---hhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceec Q lcl|NC_013644. 4 LLSEDVKI---------IANALKAAIDKDR---KSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIP 71 (510) Q Consensus 4 ~~~~~~~~---------~~~~i~~~i~~~~---~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~ 71 (510) |+-.-... ..+.|++.+++.. +..++.++.++++||+|+|+++.+...... ...+.++|++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~-------~~~~~~~~~~ 73 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHN-------GNPVNRRQLS 73 (496) T ss_pred ChhHHHHHHHHHHHHhccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccC-------CCccccceee Confidence 10000000 1122333443322 445667899999999999998876543322 2233456889 Q ss_pred cchhHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccc Q lcl|NC_013644. 72 HGFFPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLN 150 (510) Q Consensus 72 ~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~ 150 (510) +||++.|+++.++||+|+||+++++++..++.|+++++ |+|..++.+++..++++|.+|+++|+|++|++++.+++|.+ T Consensus 74 ~n~~k~i~~~~a~~l~~~p~~i~~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~~~~ 153 (496) T protein:vir:38 74 MNLPKVTAKYMSKLLFNEKVKINIDDKAAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFATADC 153 (496) T ss_pred cchHHHHHHHHhhhhhCCcceEeeCChHHHHHHHHHHhccCHHHHHHHHHHHHhhhCcEEEEEEEcCCCcEEEEEEcccc Confidence 99999999999999999999999999999999999996 68999999999999999999999999999999999999999 Q ss_pred eEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCc----EEE--EEEcCCceeecccccccccccccccccccc Q lcl|NC_013644. 151 VFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQN----VYF--FVAEDNKDYELDEAEPINPRPHVLAVDSEN 224 (510) Q Consensus 151 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~----i~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (510) +||+|++++++..++.++.+.. + ...+++++.|+... |.+ |+...+. ..+ .+.+......... T Consensus 154 ~~P~~~~~~~~~~~~f~~~~~~-~---~~~y~~le~h~~~~~~~~I~~~~y~~~~~~--~~g-----~~v~~~~~~~~~~ 222 (496) T protein:vir:38 154 MYPLSNDSENVDECVIANSFHK-N---NKYYTLLEWNEWQGDVYTVTTELYQSDDPN--ELG-----TKVSLTLLFDDIE 222 (496) T ss_pred eEEEEecCCcEEEEEEEEEEEe-C---CeEEEEEEEEEEeCceEEEEEEEEecCCcc--ccC-----ccccccccccccc Confidence 9999988877776665543322 2 23556677776322 111 2222111 111 1111111111122 Q ss_pred cccccccCCcccEEEecCC---------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeE-------EecCCCCc Q lcl|NC_013644. 225 ESLLQRSYGQIPFYRLSNN---------KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYV-------VSGFQGDD 288 (510) Q Consensus 225 ~~~~~~~~g~iPvv~~~nn---------~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv-------~~g~~~~~ 288 (510) .....+++.++|+++|+++ +.|.|+|+++++|||+||.++|++++.++.+..++++ ..+..+.. T Consensus 223 ~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~ 302 (496) T protein:vir:38 223 PVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGST 302 (496) T ss_pred cceeecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhhcccceecchHHhhccCCCCCcc Confidence 2344567788999998764 4589999999999999999999999999988777776 22222322 Q ss_pred hhhhhHhhhcCeeeeccCCC---ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccc--ccccCcccHHHHHHHHHHHH Q lcl|NC_013644. 289 LSKLRQNVKSKKVVGTGSDG---GLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDST--QVGDGNITNIVIKARYTLLN 363 (510) Q Consensus 289 ~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~--~~~~g~~Sg~Ai~~~~~~l~ 363 (510) ...+..+.+.+.++....++ .++.++.++..+++...++.+.+.|...++.++.. +...|++||.|++++++.|. T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~l~ 382 (496) T protein:vir:38 303 TQYFDSTDEAFFLYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETY 382 (496) T ss_pred ccCCCCccceEEEeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHHHHH Confidence 22233333333333333222 35666677888999999999999999888888754 44567789999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh---hccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC Q lcl|NC_013644. 364 MKANKTEARLRALLEWMNKLVIDDIN---RRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR 440 (510) Q Consensus 364 ~k~~~k~~~~~~~l~~~~~~i~~~~~---~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~ 440 (510) +++..|++.|+++|++++++++.+.. ...+..++..+++++|++++|.|+++.++++++++++|+||++|++..+|+ T Consensus 383 ~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~GiiS~et~l~~~~~ 462 (496) T protein:vir:38 383 QTKNSHSQLIEQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAKNQGMIPLKIALQRAWN 462 (496) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCC Confidence 99999999999999999999987643 233344566689999999999999999999999999999999999999999 Q ss_pred CCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCc Q lcl|NC_013644. 441 LDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEE 481 (510) Q Consensus 441 v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (510) ++++++++++++.+++.. .+.+..+..+..++++ T Consensus 463 ~~d~ea~~el~ri~~E~~-------~~~~~~d~~~~~~~~e 496 (496) T protein:vir:38 463 ITEAEADEWAEMLAKEKQ-------AEMPNNDMNGIFGEEE 496 (496) T ss_pred CChHHHHHHHHHHHHhhh-------ccCccccccCCCCCCC Confidence 998876654444333211 1111111111111111 No 56 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=100.00 E-value=1.4e-65 Score=376.18 Aligned_cols=463 Identities=10% Similarity=0.055 Sum_probs=317.1 Q ss_pred CCCccCCC----------hhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceecccccccccccccccee Q lcl|NC_013644. 1 MEALLSED----------VKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRI 70 (510) Q Consensus 1 ~~~~~~~~----------~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki 70 (510) |.-|.+.. ++.-..+|..++..|. .++.++.++++||+|+|.+.+.... .....+ ++++ T Consensus 1 ~~~~~~~~~~~~~~~~~l~~~e~~~i~~L~~~~~--~~~~r~~~l~~YY~G~~~i~~~~~~--------~p~~~~-~~~~ 69 (504) T protein:vir:99 1 MTEETTSASKFTFRIPELNDDVVDKVNGLYQQLV--DRTPRNLLRASFYDGKYAIRQIGNL--------IPPEYL-RTAT 69 (504) T ss_pred CCccCCcccccccccCCCCHHHHHHHHHHHHHHH--HHhHHHHHHHHHHhccccchhcccc--------ccHHHH-HHhh Confidence 55443222 1222345777777774 3446789999999999987543321 111222 4567 Q ss_pred ccchhHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCc--eEEEEEc Q lcl|NC_013644. 71 PHGFFPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDR--LCFQVAD 147 (510) Q Consensus 71 ~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~--~~i~~~~ 147 (510) ++||+++||++.++++..++++.. +++.....|+++|+ |+++....++++++++|||||++||.+++|+ ++|++++ T Consensus 70 v~n~~~~iVd~~a~rl~~~Gf~~~-d~~~~~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I~~~s 148 (504) T protein:vir:99 70 VLGWSAKAVDTLARRCNLESFVWP-DGDYGSIGGPDVWDENFFATKANNAMVSSLIHGPAFLINTEGGAGEPDSLIHVKS 148 (504) T ss_pred ccCcHHHHHHHHHhhhccceeeCC-CCChhhHHHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEec Confidence 899999999999999998887754 45555667888885 8999999999999999999999999998876 5688999 Q ss_pred ccceEEEEcCCCC-ceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccc Q lcl|NC_013644. 148 SLNVFGVYNEYNE-LQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENES 226 (510) Q Consensus 148 p~~~~~~~d~~~~-~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (510) |.++|++||+..+ +.+++++|.. ..+ .....+++|+++.+++|....++.+.. + T Consensus 149 P~~~~~iyD~~~~~~~~a~~~~~~--d~~---g~~~~~~~y~~~~~~~~~~~~~~~~~~--------------------~ 203 (504) T protein:vir:99 149 AMQATGEWNSRRNAMDSLLSITSR--DAE---GHPTGIALYEDGVTVTADMDDDGDWHA--------------------D 203 (504) T ss_pred cceeEEEEeCCCCceeEEEEEEEe--cCC---CeEEEEEEEcCCcEEEEEEcCCceeee--------------------c Confidence 9999999997554 3444433332 222 245678999999999998765543321 2 Q ss_pred cccccCCcccEEEecCC-----CCCCCcHH-HHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh----hhh--H Q lcl|NC_013644. 227 LLQRSYGQIPFYRLSNN-----KQETTDLK-PIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS----KLR--Q 294 (510) Q Consensus 227 ~~~~~~g~iPvv~~~nn-----~~g~sd~~-~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~----~~~--~ 294 (510) ..+|++| ||||+|.|+ .+|.|++. .|++|+|++|+++|++++..+++++|++++.|+...+.. ... - T Consensus 204 ~~~~~~g-vPvV~~~n~~~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~G~~~~~~~~~d~~~~~~~ 282 (504) T protein:vir:99 204 VRTHKLG-VPVEVLPYKPREDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILLGADAKNFRNKDGSMKPAW 282 (504) T ss_pred cccCCCC-cceEEecccccCccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhccCCccccccccccccchh Confidence 4579998 899999987 36888874 899999999999999999999999999999998654321 111 1 Q ss_pred hhhcCeeeeccCC--------CceeEEeec-CCHHHHHHHHHHHHHHHHHHhCCccccccc---cCcccHHHHHHHHHHH Q lcl|NC_013644. 295 NVKSKKVVGTGSD--------GGLDVKTVT-IPTEGRKTKMEIDKENIYKFGMAFDSTQVG---DGNITNIVIKARYTLL 362 (510) Q Consensus 295 ~~~~~~~~~~~~~--------~~~~~~~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~---~g~~Sg~Ai~~~~~~l 362 (510) .....+++.++++ .++++.+.+ .+.+.+...++.+..+|...|++|..+++. .+++||+||++++.+| T Consensus 283 ~~~~~~i~~~~~~~~~~~~~~~~~~~~q~~~~~l~~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L 362 (504) T protein:vir:99 283 QIALARVFALPDDEDEPDAARARADVKQFPASSPQPHIEMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIASREDL 362 (504) T ss_pred hhhhhhhhcCCCccccccccCccceeeecCCCChHHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHH Confidence 1222345555543 235665544 345555666666666666668899877653 3568999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCC---chHHHHHhCC Q lcl|NC_013644. 363 NMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKI---ILESILQVAP 439 (510) Q Consensus 363 ~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~i---S~et~~~~~~ 439 (510) .+|+.+|++.|+.+|++++++++.+.........+...++++|.++.+.+.++.++++++++++|.+ +.++++++++ T Consensus 363 ~~ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~~~~~~~~v~w~d~~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg 442 (504) T protein:vir:99 363 IAEAEGATDDWSPAFRRSMIRALAIKNGLDRIPPEWKTIDSKFRSPLYLSKAAQADAGAKMLGAGPEWLKETEVGLELLG 442 (504) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceeEecCCCccCHHHHHHHHHHHHhhccccccchHHHHhhcC Confidence 9999999999999999999999888776555555667899999999999999999999999999852 3578999996 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 440 RLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 440 ~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) + ++++++++++++.+.+......++........ +++++.+.+...+ ..++.+ +.++.-.| -| T Consensus 443 ~-~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~-~~~~~~~~~~~e~---a~~~~~-~~~~~p~~---~~ 504 (504) T protein:vir:99 443 L-TPQQAKRALAERRRASSVSIIEALNRRQQEAA-TAGEDQDQGAGEP---PANEPP-AALGRPTL---VG 504 (504) T ss_pred C-CHHHHHHHHHHHHHHhhHHHHHHHhcccCCCC-CCCCCCCcCCCCC---CCCCCC-ccCCCccc---CC Confidence 5 55566655555544333333333322111111 1111111111111 000111 00111111 12 No 57 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=100.00 E-value=5.3e-65 Score=372.94 Aligned_cols=456 Identities=10% Similarity=0.018 Sum_probs=320.9 Q ss_pred ChhhhHHHHHHHHHh-------------h---hhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceec Q lcl|NC_013644. 8 DVKIIANALKAAIDK-------------D---RKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIP 71 (510) Q Consensus 8 ~~~~~~~~i~~~i~~-------------~---~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~ 71 (510) =++.+..+|+..+.+ - .+...+.++.++++||+|+|+.+.+..... ....++++|++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~-------~~~~~~~~~~s 73 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEH-------NGNPVNRRQLS 73 (499) T ss_pred ChhHHHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhcccccc-------CCCccccceee Confidence 112222233333322 1 134566789999999999998776543321 23344567899 Q ss_pred cchhHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccc Q lcl|NC_013644. 72 HGFFPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLN 150 (510) Q Consensus 72 ~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~ 150 (510) +|+++.||++.|+||||+|++++++++..++.|+++++ |+|...+.+++..|+++|.+|+++|+|++|+++|.+++|.+ T Consensus 74 ~n~~~~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~a~~ 153 (499) T protein:vir:80 74 MNLPKVTAKYMSKLLFNEKVKINIDDETAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFATADC 153 (499) T ss_pred cchHHHHHHHHHHhhhCCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEEEEECCCCcEEEEEEcCCc Confidence 99999999999999999999999999999999999996 77999999999999999999999999999999999999999 Q ss_pred eEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCc--EEEEEEcCCceeecccccccccccccccccccccccc Q lcl|NC_013644. 151 VFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQN--VYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLL 228 (510) Q Consensus 151 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~--i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 228 (510) +||+|.+++++..++.++.... + ....+++|.|+... ...|......+..........+.+.........+... T Consensus 154 ~~Pi~~d~~~~~~~~f~~~~~~-~---~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~ 229 (499) T protein:vir:80 154 MYPLSNDSENVDECLIANSFHK-N---NKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVP 229 (499) T ss_pred eEEEEecCCCeEEEEEEEEEee-c---CeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCcCCcee Confidence 9999877777776665443332 2 23455566654321 1122211111110000000111111111111222333 Q ss_pred cccCCcccEEEecCC---------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeE-------EecCCCCchhhh Q lcl|NC_013644. 229 QRSYGQIPFYRLSNN---------KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYV-------VSGFQGDDLSKL 292 (510) Q Consensus 229 ~~~~g~iPvv~~~nn---------~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv-------~~g~~~~~~~~~ 292 (510) ..+++++|+++|+++ +.|+|+|+++++|||+||.++|+++++++....+++| ..+.++.....+ T Consensus 230 ~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~g~~~~~~ 309 (499) T protein:vir:80 230 LPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGSTTQYF 309 (499) T ss_pred ecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhcccceecchhhhhccCCCCCCcccCC Confidence 457889999999764 4589999999999999999999999999998888887 333344433444 Q ss_pred hHhhhcCeeeeccCC-C--ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccc--ccccCcccHHHHHHHHHHHHHHHH Q lcl|NC_013644. 293 RQNVKSKKVVGTGSD-G--GLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDST--QVGDGNITNIVIKARYTLLNMKAN 367 (510) Q Consensus 293 ~~~~~~~~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~--~~~~g~~Sg~Ai~~~~~~l~~k~~ 367 (510) ..+.+.+..+....+ + .++.++.++..+++...++.+.+.|...++.+... +...|+.||++++++++.+.+++. T Consensus 310 ~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l~~~~~ 389 (499) T protein:vir:80 310 DSTDEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKN 389 (499) T ss_pred CcccceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHHHHHHHHH Confidence 444444544443322 2 36667778899999999999999999888877644 445677899999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhh---ccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcH Q lcl|NC_013644. 368 KTEARLRALLEWMNKLVIDDINR---RYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDD 444 (510) Q Consensus 368 ~k~~~~~~~l~~~~~~i~~~~~~---~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~ 444 (510) .|++.|+.+|++++++|+.+... ..+...+..+++|.|++++|.|..+.++++.+++++|+||++|++..+++++++ T Consensus 390 ~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d~ 469 (499) T protein:vir:80 390 SHSQLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYTTAKNQGMIPLKIALQRAWNITEA 469 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHhhcCCCChH Confidence 99999999999999999976543 233445667899999999999999999999999999999999999999999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCc Q lcl|NC_013644. 445 NVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEE 481 (510) Q Consensus 445 e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (510) ++++++++.+++. +. ..+.++..+..++++ T Consensus 470 ea~~el~~i~~E~------~~-~~~~~d~~g~~ge~e 499 (499) T protein:vir:80 470 EADEWAEMLAKEK------QA-EIPNNDMTGIFGEEE 499 (499) T ss_pred HHHHHHHHHHHHh------hc-CCCCCCccccCCCCC Confidence 7665554443321 11 111111111111111 No 58 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=100.00 E-value=2.6e-64 Score=369.21 Aligned_cols=409 Identities=11% Similarity=0.019 Sum_probs=273.4 Q ss_pred chhcccceeccccccccccccccce-eccchhHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHH Q lcl|NC_013644. 45 IMNNRIFYVDDEGILREDKYASNVR-IPHGFFPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEG 122 (510) Q Consensus 45 i~~~~~~~~~~~~~~~~~~~~~~~k-i~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~ 122 (510) ++. +...+..++.+| .+.||+++||++.++++.+++++ ++|.+..+.++++|+ |+++..+.+++++ T Consensus 1 ~l~----------~~~~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~gf~--~~d~~~~~~~~~i~~~N~~d~~~~~~~~~ 68 (434) T protein:vir:98 1 MLP----------KNAEQAFLDFQRKARTNFCGLIANASVHRLLALGVT--GPDGEPDTRASRWWQANRLDSRQKLVWRM 68 (434) T ss_pred CCC----------CCccHHHHHhhhhhhccchHHHHHHHHhhhccCcee--cCCCchHHHHHHHHHhcChhHHHHHHHHH Confidence 111 111233344443 57899999999999999877754 667777888888885 8999999999999 Q ss_pred HHhcCeEEEEEEECCCC-------ceEEEEEcccceEEEEcCCCC-ceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEE Q lcl|NC_013644. 123 SSQKGFEYVYARTNAED-------RLCFQVADSLNVFGVYNEYNE-LQRICRHYITEIEKDGETVDIHHAEVWTDQNVYF 194 (510) Q Consensus 123 ~~~~G~~~~~v~~d~~g-------~~~i~~~~p~~~~~~~d~~~~-~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~ 194 (510) ++++|+||++||.++++ .+.|++++|.+++++||+..+ +..++++| ....++. ....+.+|+...+++ T Consensus 69 a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~--~~~~~~~--~~~~~~~~~~~~~~~ 144 (434) T protein:vir:98 69 AMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVW--HNDIDGF--GYARVFFDDTSFPYR 144 (434) T ss_pred HhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEE--EeccCCc--eEEEEEEeCcEEEEE Confidence 99999999999987644 467999999999999997654 44444443 3333222 222233333334333 Q ss_pred EEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCC----CCCCCcHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 195 FVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNN----KQETTDLKPIKALIDDYDLMNCFLSNN 270 (510) Q Consensus 195 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn----~~g~sd~~~v~~liD~~n~~~S~~~~~ 270 (510) +....++...... .............+|+||+||||+|+|| .+|+|+|+++++|||+||+++|++++. T Consensus 145 ~~~~~~~~~~~~~--------~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~~g~sd~e~vi~liDa~~~~~s~~~~~ 216 (434) T protein:vir:98 145 TRERTGARLPWGP--------DSWVYTGTADSGDVHDLGGMQLVEFARMPDLGEDPEPEFAGVLDIQDRVNLGILNRMAA 216 (434) T ss_pred Eeecccccccccc--------ccceecccccccccCCCCccceEEeccCCCcCcCCcchhhhHHHHHHHHHHHHHHHHHH Confidence 3332222111110 0001112233567899999999999998 679999999999999999999999999 Q ss_pred HHHhccceeEEecCCCCchhhh--------hHhhhcCeeeeccCCCceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCcc Q lcl|NC_013644. 271 LQDFAEAIYVVSGFQGDDLSKL--------RQNVKSKKVVGTGSDGGLDVKTVTI-PTEGRKTKMEIDKENIYKFGMAFD 341 (510) Q Consensus 271 ~~~~~~~~lv~~g~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~i~~~s~~p~ 341 (510) ++++++|+++++|+...+..+. .........+...+++++++.+.+. +.+++...++.+..+|+..+++|+ T Consensus 217 ~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~ 296 (434) T protein:vir:98 217 SRFSGFRQKWIKGHKFAKRTDPATGMTVVDQPFVPSPSAVWASEGENTQFGQLDATDLSGFLKEHASDVRDMLTISQTPT 296 (434) T ss_pred HHHhcchhhhhcCCCcccccccccccchhhhhhhccccccccCCCCCceEEEecCcchHHHHHHHHHHHHHHhcccCCCH Confidence 9999999999999765432211 0111122234455667788877543 344444444444555555556665 Q ss_pred ccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHH Q lcl|NC_013644. 342 STQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDE 420 (510) Q Consensus 342 ~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~ 420 (510) ..+++ ++|+||+||++++.+|.+||..|++.|+++|++++++++.+.+. ..+..+++++|+++.|.|.++.++++ T Consensus 297 ~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~g~----~~~~~~~~v~w~~~~~~s~~~~ada~ 372 (434) T protein:vir:98 297 YLYATDLVNISADTIGALDILHVAKVREHIASFSEGLESVLALAAAQAGV----PEDYTEAEVRWANPAHVTMAVKADAA 372 (434) T ss_pred HHhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC----ChhheeeeEEecCCCCCCHHHHHHHH Confidence 55553 36799999999999999999999999999999999998876432 34566899999999999999999999 Q ss_pred HHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCC-CCCCcccCCCCCCcccccccC Q lcl|NC_013644. 421 KTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSD-NTDEEETAVNPDDPTQQMAEG 497 (510) Q Consensus 421 ~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 497 (510) ++++++| +|.+++++++|+.+ +|.+++++++++..... ....+..++... +.++++. . .+| T Consensus 373 ~kl~~~g-~~~e~~~~~lg~~~-~e~~r~~~e~~~~~~~~----~~~~~~~~~~~~g~~~~~~~------~----~dg 434 (434) T protein:vir:98 373 TKLKSIG-YPLDVIAEELDESP-ARVRRIVAGAASQALLA----ASLLPAPGAPSAGNVPDSGG------A----VDG 434 (434) T ss_pred HHHHhcC-CcHHHHHHhCCCCH-HHHHHHHHHHHHHHHHH----HhhhccCCCCCCCCCCcccC------C----CCC Confidence 9999888 59999999999754 44554444433322111 111111111111 1111111 1 111 No 59 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=100.00 E-value=6.5e-61 Score=350.56 Aligned_cols=390 Identities=11% Similarity=0.018 Sum_probs=295.5 Q ss_pred hhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCC Q lcl|NC_013644. 11 IIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNP 90 (510) Q Consensus 11 ~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p 90 (510) +....|..++++.. .++.++.++.+||+|+|++.+.... .....++++|++.||+++||++.++++.-++ T Consensus 1 ~~~~~i~~L~~~~~--~~~~r~~~~~~yY~g~~~~~~~~~~--------~p~~~~~~~~~v~nw~~~iVds~a~rl~~~G 70 (409) T protein:vir:94 1 MTEKGIGYLRFKLS--VHKRRAEMRYDQYAMKYVDRFKGIT--------IPQALSQQYRSILGWCAKGVDSLADRLVFRE 70 (409) T ss_pred CCHHHHHHHHHHHH--HHhHHHHHHHHHhcccCchhhcChh--------hhHHHHHHHhhhcchhHHHHHHhHhhcccCc Confidence 44455555555543 2345688899999999987543221 2223345568889999999999999886665 Q ss_pred ceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCC-CceeEEEEE Q lcl|NC_013644. 91 VEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYN-ELQRICRHY 168 (510) Q Consensus 91 ~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~-~~~~~~~~~ 168 (510) + +++|.. ++++|+ |+++....++++.++++||||++||.+++|+++|++++|.+++.+||+.. ++.++++++ T Consensus 71 f--~~~d~~----l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~~i~~~sp~~~~~i~D~~~~~~~~a~~~~ 144 (409) T protein:vir:94 71 F--ENDDFT----VNEIFEENNPDIFFDSAVLSSLIASCSFTYISKGENDAVRLQVIEAVNATGIIDPITGLLTEGYAVL 144 (409) T ss_pred c--cCCchH----HHHHHHhcChhHHHHHHHHHHHHhcceeEEEecCCCCceEEEEeccceEEEEEecCCCceeeeEEEE Confidence 4 455543 566774 88999999999999999999999999999999999999999999999753 344444333 Q ss_pred EEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCC---- Q lcl|NC_013644. 169 ITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNK---- 244 (510) Q Consensus 169 ~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~---- 244 (510) . .+.. .......+|+++.+++|....+.+. ..+|++|.||+|+|.|++ T Consensus 145 ~---~d~~--~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~~~n~~g~vPvV~f~n~~~~~~ 196 (409) T protein:vir:94 145 E---RDEN--NNVVLEAHFLPDRTDYYYRDSRNNI-----------------------SIANPTGHPLLVPIIHRPDAVR 196 (409) T ss_pred E---ecCC--CceEEEEEEecCcEEEEEecCceeE-----------------------eeeCCCCCcceEEecccccccc Confidence 2 2221 2345567899999998876654432 247999999999999864 Q ss_pred -CCCCcH-HHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCC---CceeEEeec-CC Q lcl|NC_013644. 245 -QETTDL-KPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSD---GGLDVKTVT-IP 318 (510) Q Consensus 245 -~g~sd~-~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~-~~ 318 (510) +|+|++ +.|++|+|++|++++++....+++++|++++.|++.+......-.....+++.++++ .++++.+++ .+ T Consensus 197 ~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~d~dg~~~~v~q~~~~~ 276 (409) T protein:vir:94 197 PFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEPMETWKATVSSMLQFTKDEDGDKPTLGQFTQPS 276 (409) T ss_pred ccCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCcccchhhhhHHHhhcCCCCCCCCCceEEecCCCC Confidence 688988 689999999999999999999999999999999864321111111122446666533 346765554 35 Q ss_pred HHHHHHHHHHHHHHHHHHhCCcccccccc-Cc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCcc Q lcl|NC_013644. 319 TEGRKTKMEIDKENIYKFGMAFDSTQVGD-GN-ITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAF 396 (510) Q Consensus 319 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~-~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~ 396 (510) .+++.+.++.+..+++..|++|...+++. .| +||+||++.+.+|..++.+|++.|+.+|++++++++.+.+....... T Consensus 277 l~~~~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~~~~~ 356 (409) T protein:vir:94 277 MSPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAPYLRE 356 (409) T ss_pred hhHHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccc Confidence 56666666666677777778888877753 45 69999999999999999999999999999999999988776555555 Q ss_pred ccceeeEEeCCCCCCC---HHHHHHHHHHHHhcC--CCchHHHHHhCCCCCcH Q lcl|NC_013644. 397 DPTEVSFTFTREVMVN---ETDIVNDEKTEAETR--KIILESILQVAPRLDDD 444 (510) Q Consensus 397 ~~~~v~i~f~~~~p~d---~~e~~~~~~~~~~~g--~iS~et~~~~~~~v~d~ 444 (510) +..+++++|.+..|.+ .++.++.++|++++| +.+.++++.++++.+++ T Consensus 357 ~~~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 357 QFRKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIEGGE 409 (409) T ss_pred ccccceEEeccCCCcchHHHHHHHHHHHHHHHhcccccchhHHHHHcCCCCCC Confidence 6678999999776666 678889999999999 55779999999987766 No 60 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=100.00 E-value=5.5e-61 Score=350.96 Aligned_cols=391 Identities=10% Similarity=0.006 Sum_probs=296.4 Q ss_pred HHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCCceeccCcHH Q lcl|NC_013644. 20 IDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNPVEYETENEE 99 (510) Q Consensus 20 i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~ 99 (510) ++.| +.++..+.+||+|+|++.+-.. ......++++|++.||+++||++.++++.-++++ .+|.. T Consensus 1 l~~~-----~~r~~~~~~yY~g~~~~~~~~~--------~~p~~~~~~~~~v~nw~~~~Vds~a~rl~~~Gf~--~~d~~ 65 (410) T protein:vir:95 1 MNLY-----QSRVNLRYKHYAMQHYEAPTGI--------TIPAHIRAKYQAVLGWAAKGVDSLADRLIFRAFA--NDDFN 65 (410) T ss_pred CCcc-----hhhHHHHHHHhcCCCCccccch--------hccHHHHhHHHhhcchhHHHHHHhHhhhcccccc--CCCch Confidence 4444 3357788999999998744322 2223445567888999999999999998777654 45443 Q ss_pred HHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCC-CceeEEEEEEEEEeeCCc Q lcl|NC_013644. 100 LKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYN-ELQRICRHYITEIEKDGE 177 (510) Q Consensus 100 ~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~-~~~~~~~~~~~~~~~~~~ 177 (510) ++++|+ |+++....++++.++++||||++||.+++|+++|++++|.+++++||+.. ++.++++++. ..+ T Consensus 66 ----l~~i~~~N~ld~~~~~~~~~al~~G~sf~~v~~~~d~~~~i~~~sP~~~~~i~Dp~~~~~~~al~~~~--~~~--- 136 (410) T protein:vir:95 66 ----VTEIFDRNNPDIFFDSAILSALIGSCSFVYISKGEDDEVRLQVIESSNATGVIDPITGLLVEGYAVLA--RDD--- 136 (410) T ss_pred ----HHHHHhhcChHHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEEEeCCCCceEEEEEEEE--ecC--- Confidence 566674 89999999999999999999999999999999999999999999999743 3444444332 222 Q ss_pred eeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCC-----CCCCcH-H Q lcl|NC_013644. 178 TVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNK-----QETTDL-K 251 (510) Q Consensus 178 ~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~-----~g~sd~-~ 251 (510) ......+.+|+++.+++|....+.+ ..+|++|+||+|+|.|++ +|+|++ + T Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~~~------------------------~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~ 192 (410) T protein:vir:95 137 YNRPTLEAYFEPNATHFIPKDGEPY------------------------SVTNETGIPLLVPVIHRPDAVRPFGRSRITR 192 (410) T ss_pred CCeEEEEEEEeCCcEEEEeeCCccc------------------------cccCCCCCcceEEecccccCCccCCccccch Confidence 2245678899999999887543322 247999999999999863 588987 6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCC---ceeEEeec-CCHHHHHHHHH Q lcl|NC_013644. 252 PIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDG---GLDVKTVT-IPTEGRKTKME 327 (510) Q Consensus 252 ~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~-~~~~~~~~~~~ 327 (510) .|++|+|++|++++++....+++++|++++.|++.+......-.....+++.++++. .+++.+.+ .+.+++.+.++ T Consensus 193 ~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~~~~~~~~~v~q~~~~~l~~~~~~l~ 272 (410) T protein:vir:95 193 AGMYYQKYAKRTLERADITAEFYSWPQKYILGLDPDAEPMEKWKATVSSLLTISSSDKGVKPSVGQFTTASMSPFTEQLR 272 (410) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhcchhheeeccCCCCCcCchhhhhhhhheeccCCCCCCcceEEecCCCChHHHHHHHH Confidence 899999999999999999999999999999998643222111122334566666543 36776654 35667777777 Q ss_pred HHHHHHHHHhCCcccccccc-Cc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEe Q lcl|NC_013644. 328 IDKENIYKFGMAFDSTQVGD-GN-ITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTF 405 (510) Q Consensus 328 ~l~~~i~~~s~~p~~~~~~~-g~-~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f 405 (510) .+..+|...|++|...+++. .| +||+||++...+|..|+.+|++.|+.+|++++++++.+.+..........++++.| T Consensus 273 ~l~~~~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~v~W 352 (410) T protein:vir:95 273 TAAAGFAGEMGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFVRTAVKW 352 (410) T ss_pred HHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccccceeeEEe Confidence 77777777888998888754 45 69999999999999999999999999999999999988766554455566789999 Q ss_pred C---CCCCCCHHHHHHHHHHHHhc--CCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHH Q lcl|NC_013644. 406 T---REVMVNETDIVNDEKTEAET--RKIILESILQVAPRLDDDNVLRLICEQFDLDWE 459 (510) Q Consensus 406 ~---~~~p~d~~e~~~~~~~~~~~--g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~ 459 (510) . ++-..+.++.++.++|+.++ |+++.+++++++++.++++ .+++.++.+...+ T Consensus 353 ~p~~d~~~~s~a~~aDa~~Kl~~a~~g~~~~~~~~~~lg~~~~~~-~~~~~~e~~~~g~ 410 (410) T protein:vir:95 353 EPLFEADANTMTMIGDGVVKLNQALPGYINAETIRDLTGIAGDMS-AKPVVSEGGSNGE 410 (410) T ss_pred eecCCcchhhHHHHHHHHHHHHHhccCCccHHHHHHhcCCChHHH-HHHHHHHHHhCCC Confidence 8 45556899999999999998 7889999999999876543 3333333222211 No 61 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=100.00 E-value=7.5e-61 Score=350.19 Aligned_cols=402 Identities=10% Similarity=0.016 Sum_probs=289.4 Q ss_pred hhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCC Q lcl|NC_013644. 11 IIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNP 90 (510) Q Consensus 11 ~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p 90 (510) +....|..+++++. .++.++.++.+||+|+|++.+.... .....++.+|++.||++++|++.++.+.-++ T Consensus 1 m~~~~i~~L~~~~~--~~~~r~~~~~~yy~g~~~~~~~~~~--------~p~~~~~~~~~v~nw~~~~Vd~~a~rl~~~G 70 (422) T protein:vir:97 1 MNYMGMGYLRRKLA--LFKTGVDKRYRYYAMDDRDDTRSIV--------MPNNVREMYRSVLEWTAKGVDSLADRIIFRE 70 (422) T ss_pred CChHHHHHHHHHHH--HHHHHHHHHHHHHhcCCChhhcCcc--------ccHHHHHHHHhhcchhHHHHHHHHhccccce Confidence 34445555555543 2344688899999999987543322 2233444557778999999999999876666 Q ss_pred ceeccCcHHHHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECC-CCceEEEEEcccceEEEEcCCCCce-eEEEE Q lcl|NC_013644. 91 VEYETENEELKEYLAEYY-NSEFQVVLQELVEGSSQKGFEYVYARTNA-EDRLCFQVADSLNVFGVYNEYNELQ-RICRH 167 (510) Q Consensus 91 ~~~~~~d~~~~~~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~d~-~g~~~i~~~~p~~~~~~~d~~~~~~-~~~~~ 167 (510) ++ ++|.. ++++| .|+++....++++.++++||||++|+.++ +|.++|++++|.+++++||+..+.. +++.+ T Consensus 71 f~--~~d~~----l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p~i~~~sp~~~~~i~D~~~~~~~~a~~~ 144 (422) T protein:vir:97 71 FT--NDDFN----AWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAEDGLPKMQVIEASKATGILDPTTFLLTEGYAI 144 (422) T ss_pred ee--CCchh----HHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCCCCeeEEEEechhhEEEEEeCCCCcceeeEEE Confidence 54 55543 45566 49999999999999999999999999985 6889999999999999999765433 33332 Q ss_pred EEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCC--- Q lcl|NC_013644. 168 YITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNK--- 244 (510) Q Consensus 168 ~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~--- 244 (510) | ....++ ..+.+.+|++..++++.... .+ ...+|++|+||+|+|.|++ T Consensus 145 ~--~~~~~~---~~~~~~~~~~~~~~~~~~~~-~~-----------------------~~~~~~~g~vPvv~~~n~~~~~ 195 (422) T protein:vir:97 145 L--ESDSNG---NPTLEAYFTDKDIWYYPKKG-KP-----------------------YNIKNPTGHPLLVPIIHRPDAV 195 (422) T ss_pred E--EecCCC---cEEEEEEEcCceEEEEcCCC-cc-----------------------ccccCCCCCcceEEecccCCCc Confidence 2 222222 23445566776666655322 11 1247999999999999863 Q ss_pred --CCCCcH-HHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCC---ceeEEeec-C Q lcl|NC_013644. 245 --QETTDL-KPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDG---GLDVKTVT-I 317 (510) Q Consensus 245 --~g~sd~-~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~-~ 317 (510) +|.|++ +.|++|+|++|++++++....+++++|++++.|++.+......-.....+++.++++. .+++.+.+ . T Consensus 196 ~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~de~~~~~~v~q~~~~ 275 (422) T protein:vir:97 196 RPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYVLGMDPDAKPMEKWRATVSTLLEISKDEDGDKPTVGQFTTA 275 (422) T ss_pred cccCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcccCcccccCchhhhhhhhhhccCCCCCCCcceeeecCCC Confidence 688988 7899999999999999999999999999999998643221111112233566665433 36665444 3 Q ss_pred CHHHHHHHHHHHHHHHHHHhCCccccccccC-c-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCc Q lcl|NC_013644. 318 PTEGRKTKMEIDKENIYKFGMAFDSTQVGDG-N-ITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKA 395 (510) Q Consensus 318 ~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g-~-~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~ 395 (510) +.+++.+.++.+..++...|++|..++++.+ | +||+||++++.+|.+|+.+|++.|+.+|++++++++.+.+...... T Consensus 276 ~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~ 355 (422) T protein:vir:97 276 SMAPFMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEFPYLR 355 (422) T ss_pred ChhHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccc Confidence 4566666666666666666788888877643 4 6999999999999999999999999999999999998876654444 Q ss_pred cccceeeEEeCCCCCCC---HHHHHHHHHHHHhc--CCCchHHHHHhCCCCCcHHHHHHHHHHHHHHH Q lcl|NC_013644. 396 FDPTEVSFTFTREVMVN---ETDIVNDEKTEAET--RKIILESILQVAPRLDDDNVLRLICEQFDLDW 458 (510) Q Consensus 396 ~~~~~v~i~f~~~~p~d---~~e~~~~~~~~~~~--g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~ 458 (510) ....+++++|.++.|.+ .++.++.++|++++ |+++.+++++++++.+++.+. ...++.+.+. T Consensus 356 ~~~~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~~~~~~~~-~~~~~~~~d~ 422 (422) T protein:vir:97 356 NQFMDTVIKWEPLFEADANMLTLVGDGAIKLNQAIPGFMDADVIRDLTGVKGADKPI-PAITEVTTDG 422 (422) T ss_pred hhhccceEEEccCCCCChHHHHHHHHHHHHHHhhccccccHHHHHHHcCCCchhHHH-HHHHhhhccC Confidence 55678999999888888 78889999999999 678999999999984433221 1111111111 No 62 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=100.00 E-value=5.4e-58 Score=334.56 Aligned_cols=457 Identities=10% Similarity=0.083 Sum_probs=313.6 Q ss_pred CCC------ccCC--ChhhhHHHHHHHH---HhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccce Q lcl|NC_013644. 1 MEA------LLSE--DVKIIANALKAAI---DKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVR 69 (510) Q Consensus 1 ~~~------~~~~--~~~~~~~~i~~~i---~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~k 69 (510) |-- .+.. ......+.+.+.. +-..+...+.++..+++||+|+|+++++... ....+..++ T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~---------~~~~~~~~~ 71 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNS---------YGDTQKHEL 71 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCcccccccc---------CCCccccce Confidence 100 0000 0000111122211 1222334667788899999999987654321 122333457 Q ss_pred eccchhHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcc Q lcl|NC_013644. 70 IPHGFFPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADS 148 (510) Q Consensus 70 i~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p 148 (510) +++|+++.|+++.|+|+||+|++++++++..+++|+++++ |+|...+.+++..++..|.+++++|+| .|+++|.+++| T Consensus 72 ~slnl~~~i~~~~A~ll~~e~~~i~~~d~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D-~~~~~i~~v~a 150 (505) T protein:vir:79 72 QSVNVTKLASAKLASLIFNEQCQVTVSDETANDFLDDVFQQNDFYTTFEEKLEEWIALGSGCVRPYVD-SGKIKLAWATA 150 (505) T ss_pred eecchHHHHHHHHHhhhcCCCceeecCChHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCeEEEEEEe-CCceEEEEEcC Confidence 8889999999999999999999999999999999999995 679999999999999999999999998 57899999999 Q ss_pred cceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCC----cEEEEEEcCCceeecccccccccccccccccccc Q lcl|NC_013644. 149 LNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQ----NVYFFVAEDNKDYELDEAEPINPRPHVLAVDSEN 224 (510) Q Consensus 149 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~----~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (510) ++++|++.++++...++.+..+...++.....++++|+|+.. .|.+...........+.....+..+. ..... T Consensus 151 d~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~---~~~l~ 227 (505) T protein:vir:79 151 DQVYPLQADTNQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQ---YEGLE 227 (505) T ss_pred CeeEEEEEcCCCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhccc---ccccC Confidence 999999766667666666665666565566667788998732 33222222111111111111111111 11112 Q ss_pred cccccccCCcccEEEecCC---------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecC------C-CCc Q lcl|NC_013644. 225 ESLLQRSYGQIPFYRLSNN---------KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGF------Q-GDD 288 (510) Q Consensus 225 ~~~~~~~~g~iPvv~~~nn---------~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~------~-~~~ 288 (510) +.+...++.+.++++|+++ +.|.|+|++++++||++|.++|+++++++....++.|-..+ . +.. T Consensus 228 ~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~~~ 307 (505) T protein:vir:79 228 PQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQRRLIVPAEWLKTGSSYGGQA 307 (505) T ss_pred cceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcccCCCCccc Confidence 2333356777778887652 35899999999999999999999999999988888872221 1 110 Q ss_pred hhh----hhHhhhcCeeeec-cCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccc--ccccCcccHHHHHHHHHH Q lcl|NC_013644. 289 LSK----LRQNVKSKKVVGT-GSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDST--QVGDGNITNIVIKARYTL 361 (510) Q Consensus 289 ~~~----~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~--~~~~g~~Sg~Ai~~~~~~ 361 (510) ... +..+...+..+.. ++++.++.++.++..+++...++.+.+.|...++.+... +...|..||++++..++. T Consensus 308 ~~~~~~~fd~~~~~y~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~~~ 387 (505) T protein:vir:79 308 SETHPPMFDPDETVYQAMYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNNSQ 387 (505) T ss_pred ccccccCCCccceeeeeccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHHHHHHHHhH Confidence 000 1111111222222 223456777788899999999999999999888877643 445566799999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------CCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchH Q lcl|NC_013644. 362 LNMKANKTEARLRALLEWMNKLVIDDINRRY---------TKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILE 432 (510) Q Consensus 362 l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~---------~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~e 432 (510) |.++++.+++.|+.+|++++++|+.+....+ ..+.+..+++|.|++.++.|..+.++..++++++|++|+| T Consensus 388 l~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~v~~Gi~s~e 467 (505) T protein:vir:79 388 TYQTRSSYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRAADLQAVQAQVMPKK 467 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHH Confidence 9999999999999999999999998754322 2334456799999999999999999999999999999999 Q ss_pred HHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCC Q lcl|NC_013644. 433 SILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTD 479 (510) Q Consensus 433 t~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~ 479 (510) +++..+|+++++++++++++.++++. . ..|...++++ + T Consensus 468 ~~l~~~~~~~eeea~~el~ri~~E~~------~-~~p~~~~~gg--~ 505 (505) T protein:vir:79 468 QFLMRNYGLDEEEADEWLAQIDAENS------T-AEPEFNQFGG--D 505 (505) T ss_pred HHHHhcCCCChHHHHHHHHHHHHhcc------c-cCCCchhccC--C Confidence 99999999998877655554433221 1 1111111111 1 No 63 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=100.00 E-value=3.6e-58 Score=335.54 Aligned_cols=458 Identities=10% Similarity=0.071 Sum_probs=313.7 Q ss_pred CCCccCCChhhhHHHHHH-------------HH---HhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccc Q lcl|NC_013644. 1 MEALLSEDVKIIANALKA-------------AI---DKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKY 64 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~-------------~i---~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~ 64 (510) |- =.+-+..++++ .. +-..+..+..++..+++||+|+|+....... .... T Consensus 1 m~-----~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~---------~~~~ 66 (508) T protein:vir:15 1 MG-----LIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQAS---------DGIK 66 (508) T ss_pred CC-----hHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccC---------CCCc Confidence 11 11111111111 11 1112334667899999999999986543221 1122 Q ss_pred cccceeccchhHHHHHHHHhhhhcCCceecc-CcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceE Q lcl|NC_013644. 65 ASNVRIPHGFFPEIVDQKTQYLLSNPVEYET-ENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLC 142 (510) Q Consensus 65 ~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~-~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~ 142 (510) +...+++.|+++.|++..|+++||+|+++++ +++..+++|+++++ |+|...+.+++..++++|.+++.+|+|. ++++ T Consensus 67 ~~~~~~sln~~~~i~~~~A~lv~~e~~~i~v~~~~~~~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~ 145 (508) T protein:vir:15 67 KKRLKNTINMAKTAARRIASVVFNEKAEIHVKDNNEADKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRPYIDG-NHIK 145 (508) T ss_pred cccceeecchHHHHHHHHHhhhhCCCceEEeCCchHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEeC-CeeE Confidence 2335788999999999999999999999998 55667788999995 7799999999999999999999999985 6799 Q ss_pred EEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEc-----CCcEEEEEEcCCceeeccccccccccccc Q lcl|NC_013644. 143 FQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWT-----DQNVYFFVAEDNKDYELDEAEPINPRPHV 217 (510) Q Consensus 143 i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~-----~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (510) |.+++|++++|+.-+++++..++.++.....++.+...++++|.|+ +..|.+...........+....+...+. T Consensus 146 i~~v~ad~~~P~~~d~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e- 224 (508) T protein:vir:15 146 IAWVRADQFYPLQSNTNDISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPV- 224 (508) T ss_pred EEEEcCCeeEEEEEcCCCeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhccc- Confidence 9999999999975455666666655555555555666778888886 3333333222221111111111111111 Q ss_pred ccccccccccccccCCcccEEEecCC---------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecC---C Q lcl|NC_013644. 218 LAVDSENESLLQRSYGQIPFYRLSNN---------KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGF---Q 285 (510) Q Consensus 218 ~~~~~~~~~~~~~~~g~iPvv~~~nn---------~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~---~ 285 (510) .....+.+...++.++|+++|+++ +.|.|+|++++++||++|.++|+++++++....+++|..++ + T Consensus 225 --~~~l~~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~d 302 (508) T protein:vir:15 225 --YKELAPQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIRLGQKHIAVQPGMLRFD 302 (508) T ss_pred --ccCCCcceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcCC Confidence 111122334467788899998753 45999999999999999999999999999888888874333 3 Q ss_pred CCchhhhhHhhhcCeeeeccCC--CceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccc--cccCcccHHHHHHHHHH Q lcl|NC_013644. 286 GDDLSKLRQNVKSKKVVGTGSD--GGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQ--VGDGNITNIVIKARYTL 361 (510) Q Consensus 286 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~--~~~g~~Sg~Ai~~~~~~ 361 (510) ......+..+.+.+..+..+++ ..++.++.++..+++...++.+.+.|...++.+...+ ...|..||++++...+. T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~s~~~~ 382 (508) T protein:vir:15 303 DEHKPTFDTEQNVYVGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVVSNNSM 382 (508) T ss_pred CCCccccCCCCeeEEeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHHHHHHH Confidence 2222222222222333333333 3477888889999999999999999988887776543 44566799999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc----C-------CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCc Q lcl|NC_013644. 362 LNMKANKTEARLRALLEWMNKLVIDDINRRY----T-------KAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKII 430 (510) Q Consensus 362 l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~----~-------~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS 430 (510) +.+++..|++.|+.+|++++++|+.+....+ + ......+|+|.|+++++.|..+.++..++++++|++| T Consensus 383 ~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~aGi~s 462 (508) T protein:vir:15 383 TYQTRSSYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVLAIGALS 462 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHHhcCCCC Confidence 9999999999999999999999998765321 1 1123456899999999999999999999999999999 Q ss_pred hHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCC Q lcl|NC_013644. 431 LESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDD 489 (510) Q Consensus 431 ~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (510) +++++..+|+++++|+++++++.++++.. .+..+ +.....+++.|+ T Consensus 463 ~e~~i~~~~g~~deea~~el~ri~~E~~~------------~~~~~-~~~~~~~g~~ge 508 (508) T protein:vir:15 463 KQTFLQRNYGMTDEQAAEELAKIQSEAPT------------DTFEG-GRSAILNGGDGE 508 (508) T ss_pred HHHHHHhcCCCChHHHHHHHHHHHHhccc------------cCccc-cccccCCCCCCC Confidence 99999999999988776555544432210 01111 111111111111 No 64 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=100.00 E-value=5.5e-59 Score=339.97 Aligned_cols=390 Identities=10% Similarity=0.015 Sum_probs=293.2 Q ss_pred hhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCC Q lcl|NC_013644. 11 IIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNP 90 (510) Q Consensus 11 ~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p 90 (510) +....|..+++... .++.++.++.+||+|+|.+.+... ......+.++|.+.||+++||++.++++.-++ T Consensus 1 ~~~~~i~~L~~~~~--~~~~r~~~~~~yY~g~~~~~~~~~--------~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~G 70 (409) T protein:vir:16 1 MTEKGIGYLRFKLS--VHKRRAEMRYEQYAMKHVDRFKGI--------TIPQALSQQYRSILGWCAKGVDSLADRLVFRE 70 (409) T ss_pred CCHHHHHHHHHHHH--HHhHHHHHHHHHHhccCchhhcch--------hhhHHHHHHHhhhcChhHHHHHHhHhhccccc Confidence 44455555555442 344578889999999997644322 12233445567889999999999999886665 Q ss_pred ceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCC-CceeEEEEE Q lcl|NC_013644. 91 VEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYN-ELQRICRHY 168 (510) Q Consensus 91 ~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~-~~~~~~~~~ 168 (510) + +++|+. ++++|+ |+++....++++.++++||||++|+.+++|+++|++++|.+++++||+.. ++.+++++| T Consensus 71 f--~~~d~~----l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~~i~~~sP~~~~~i~D~~~~~~~~a~~~~ 144 (409) T protein:vir:16 71 F--ENDDFT----VNEIFEENNPDIFFDSTVLSALIASCSFTYISKGENDAVRLQVIEATNATGIIDPITGLLTEGYAVL 144 (409) T ss_pred c--cCcchH----HHHHHHhcChhHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEEeecccccceeeeEEE Confidence 4 455543 566774 89999999999999999999999999999999999999999999999753 444555444 Q ss_pred EEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCC---- Q lcl|NC_013644. 169 ITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNK---- 244 (510) Q Consensus 169 ~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~---- 244 (510) . ....+ ......+|+++.+++|....+.+ ...+|++|+||+|+|.|++ T Consensus 145 ~--~d~~~---~~~~~~~~~~~~~~~~~~~~~~~-----------------------~~~~~~~g~vPvV~f~n~~~~~~ 196 (409) T protein:vir:16 145 E--RDENN---NVVLEAHFLPDRTDYYYRDSRNN-----------------------ISIANPTGNPLLVPIIHRPDAVR 196 (409) T ss_pred E--ecCCC---ceEEEEEEecCcEEEEEecCccc-----------------------cceecCCCCcceEEecccccccc Confidence 3 22211 23456789999888877654433 1247999999999999873 Q ss_pred -CCCCcH-HHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCC---CceeEEeec-CC Q lcl|NC_013644. 245 -QETTDL-KPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSD---GGLDVKTVT-IP 318 (510) Q Consensus 245 -~g~sd~-~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~-~~ 318 (510) +|.|++ +.|++|+|++|++++++....+++++|++++.|++.+......-....++++.++++ .++++.+.+ .+ T Consensus 197 ~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~d~~g~~~~v~q~~~~~ 276 (409) T protein:vir:16 197 PFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEPMETWKATVSSMLQFTKDEDGDKPTLGQFTQPS 276 (409) T ss_pred cCCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCCccchhhhhhhHhhccCCCCCCCCceEEecCCCC Confidence 689988 689999999999999999999999999999999864321111111223456666533 346665544 34 Q ss_pred HHHHHHHHHHHHHHHHHHhCCcccccccc-Cc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCcc Q lcl|NC_013644. 319 TEGRKTKMEIDKENIYKFGMAFDSTQVGD-GN-ITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAF 396 (510) Q Consensus 319 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~-~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~ 396 (510) .+++.+.++.+..+++..|++|..++++. .| +||+||++...+|..|+.+|++.|+.+|++++++++.+.+....... T Consensus 277 l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~ 356 (409) T protein:vir:16 277 MSPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVPYLRE 356 (409) T ss_pred hhHHHHHHHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccch Confidence 56666667777777777778888887753 56 69999999999999999999999999999999999988766544444 Q ss_pred ccceeeEEeCCCCCCC---HHHHHHHHHHHHhcCCC--chHHHHHhCCCCCcH Q lcl|NC_013644. 397 DPTEVSFTFTREVMVN---ETDIVNDEKTEAETRKI--ILESILQVAPRLDDD 444 (510) Q Consensus 397 ~~~~v~i~f~~~~p~d---~~e~~~~~~~~~~~g~i--S~et~~~~~~~v~d~ 444 (510) ....++++|.++.+.+ .++.++.++|++++|.. ..+++++++++.+++ T Consensus 357 ~~~~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 357 QFSKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIKGAE 409 (409) T ss_pred hhccceEEecCCCCcchhhHHHHHHHHHHHHhhcccccchhHHHHhccCCCCC Confidence 4567899999877555 79999999999999853 468899999887655 No 65 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=100.00 E-value=6e-58 Score=334.28 Aligned_cols=434 Identities=12% Similarity=0.102 Sum_probs=298.8 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) +.++-.+ . ..++..+++++. ..+.++.++++||+|+|++.+.... .....+ +++++.||++++|+ T Consensus 9 ~~gl~~~-~---~~~~~~L~~~~~--~~~~~~~~~~~Yy~G~~~~~~~~~~--------~p~~~r-~~~~v~nw~~~~Vd 73 (474) T protein:vir:81 9 IPSLSND-E---NALINGLLAQIE--NLRWKNLLRTSYYENKRTIQYVGTL--------IPPQYF-NLGLVLGWTGKAVD 73 (474) T ss_pred CCCCChh-H---HHHHHHHHHHHH--HHhhHHHHHHHHhccCCChhhcccc--------ccHHHH-HHHhhcChHHHHHH Confidence 3322222 2 234555555553 3344688899999999987543322 122222 34678999999999 Q ss_pred HHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCc--eEEEEEcccceEEEEcC Q lcl|NC_013644. 81 QKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDR--LCFQVADSLNVFGVYNE 157 (510) Q Consensus 81 ~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~--~~i~~~~p~~~~~~~d~ 157 (510) +.++++.-++++.. +.+.....+.++|+ |+++....++++.+++|||||++|+.+++|+ ++|++++|.+++++||+ T Consensus 74 ~~a~rl~~~Gf~~~-d~~~~~~~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~~~~~~~D~ 152 (474) T protein:vir:81 74 ALARRCNLEGFVWP-DGDLDSLGGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDASEATGEWNR 152 (474) T ss_pred HHHhhhcccceECC-CCCccchHHHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceEEEEEeC Confidence 99999998888753 33344455677774 9999999999999999999999999876664 78999999999999997 Q ss_pred CCCc-eeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCccc Q lcl|NC_013644. 158 YNEL-QRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIP 236 (510) Q Consensus 158 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP 236 (510) ..+. .+++.++ ....++ ....+.+|+++.++++....+++.-. .+..+|++| || T Consensus 153 ~~~~~~~al~~~--~~~~~g---~~~~~~ly~~~~~~~~~~~~~~~~w~-------------------~~~~~~~~g-vP 207 (474) T protein:vir:81 153 RRRGLNNLLSII--DKDKEG---KVLSLALYLDNETVTAQRDKATLKWQ-------------------VDRDEHVYG-VP 207 (474) T ss_pred CCCcceeeeEEE--EEcCCC---cEEEEEEEeCCcEEEEEEcCccceee-------------------eccCCCCCC-cc Confidence 6543 3333322 222222 34567899999998887654432110 134579998 79 Q ss_pred EEEecCCC-----CCCCcH-HHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh----hhhHhhh--cCeeeec Q lcl|NC_013644. 237 FYRLSNNK-----QETTDL-KPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS----KLRQNVK--SKKVVGT 304 (510) Q Consensus 237 vv~~~nn~-----~g~sd~-~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~----~~~~~~~--~~~~~~~ 304 (510) ||+|.|++ +|+|.+ +++++|+|++|++++++....+++++|++++.|++..+.. ......+ ..+++.+ T Consensus 208 vV~~~n~~~~~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~~~~~~d~d~~~~~~~~~~~~~i~~~ 287 (474) T protein:vir:81 208 AQVLPYKPAPKRPFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLLGADESALKNADGTIKSVWEARLGRIKGL 287 (474) T ss_pred eEEecccccccCcCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeecCChhhcccccccccchhhhhHHHHhcC Confidence 99999874 688887 7999999999999999999999999999999998653321 1111111 2235555 Q ss_pred cCCCc--------eeEEeec-CCHHHHHHHHHHHHHHHHHHhCCcccccc--ccCc-ccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 305 GSDGG--------LDVKTVT-IPTEGRKTKMEIDKENIYKFGMAFDSTQV--GDGN-ITNIVIKARYTLLNMKANKTEAR 372 (510) Q Consensus 305 ~~~~~--------~~~~~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~--~~g~-~Sg~Ai~~~~~~l~~k~~~k~~~ 372 (510) +++.+ +++.+.+ .+.+++...++.+..++...|++|..+++ .+.| +||.||++....|..|+.+|++. T Consensus 288 ~~d~d~~~~~~~~~~~~q~~~a~l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~ 367 (474) T protein:vir:81 288 PDDADADIPQLARADVKQFPAASPDAHWSDINGLAKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDD 367 (474) T ss_pred CCcccccccccccccccccCCCChhHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 55543 3443332 34455555566666666667889988765 3455 69999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhccCCc--cccceeeEEeCCCCCCCHHHHHHHHHHHHhcCC-C-chHHHHHhCCCCCcHHHHH Q lcl|NC_013644. 373 LRALLEWMNKLVIDDINRRYTKA--FDPTEVSFTFTREVMVNETDIVNDEKTEAETRK-I-ILESILQVAPRLDDDNVLR 448 (510) Q Consensus 373 ~~~~l~~~~~~i~~~~~~~~~~~--~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~-i-S~et~~~~~~~v~d~e~~~ 448 (510) |+.+|++++++++.+.+...... .....++++|.++...+.++.+++++|++++|. + +.++++++ +++++++.++ T Consensus 368 fg~~l~~~~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~-lg~t~~~i~~ 446 (474) T protein:vir:81 368 FTPALRKAFIRALAMKNKVAIDEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQLAAVPWLAETEVGLEL-IGLTPQQARR 446 (474) T ss_pred HHHHHHHHHHHHHHHhCCCCccccchhhccceeEecCCCccCHHHHHHHHHHHHhcccCCCcHHHHHhh-cCCCHHHHHH Confidence 99999999999988765433222 234578999999999999999999999999873 4 45566665 4566666666 Q ss_pred HHHHHHHHHHHHHHHHHHhhhccCCCCCCCCC Q lcl|NC_013644. 449 LICEQFDLDWEDVKEALEEAEYTKGLSDNTDE 480 (510) Q Consensus 449 ~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (510) +++++.+.+......++... +.+.+..+ T Consensus 447 ~~~~~~~~~~~~~~~~l~~~----~~~~~~aq 474 (474) T protein:vir:81 447 AMADKRRVQGRGTLQALIDR----SNNGATAQ 474 (474) T ss_pred HHHHHHHHhHHHHHHHHHhc----CCCCCCCC Confidence 55554443333333222111 11111111 No 66 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=100.00 E-value=2.2e-55 Score=320.28 Aligned_cols=453 Identities=9% Similarity=0.026 Sum_probs=305.7 Q ss_pred CCChhhhHHH------------HHHHH---HhhhhhhhHHHHHHHHHHhccCCcchhcccceecccccccccccccccee Q lcl|NC_013644. 6 SEDVKIIANA------------LKAAI---DKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRI 70 (510) Q Consensus 6 ~~~~~~~~~~------------i~~~i---~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki 70 (510) +.=.+.+..+ +.+.. +--.+..+..++..+++||+|++....+.. . ....+..+++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~----~-----~~~~~~~~~~ 71 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLN----T-----DGETKKRDLN 71 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCccccc----C-----CCCcccCcee Confidence 1111111111 12111 112333556789999999999976432211 1 1122344578 Q ss_pred ccchhHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEccc Q lcl|NC_013644. 71 PHGFFPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSL 149 (510) Q Consensus 71 ~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~ 149 (510) +.|+++.|++..|+|+||+|++++++++..++.|+++++ |+|...+.+++..++..|.+++.+|+|. ++++|.+++|+ T Consensus 72 slnl~~~i~~~~A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~I~~v~ad 150 (500) T protein:vir:30 72 HLPIARTAAKKIASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-DKVRVAFVQAP 150 (500) T ss_pred ecchHHHHHHHHhhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCC Confidence 899999999999999999999999999999999999995 6899999999999999999999999985 68999999999 Q ss_pred ceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEc--CC---cEEEEEEcCCceeecccccccccccccccccccc Q lcl|NC_013644. 150 NVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWT--DQ---NVYFFVAEDNKDYELDEAEPINPRPHVLAVDSEN 224 (510) Q Consensus 150 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~--~~---~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (510) +++|+..++.....++.++.+....+++...++++|.|+ ++ .|.+...........+........ ..... T Consensus 151 ~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~-----~~~l~ 225 (500) T protein:vir:30 151 VFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEV-----YKDLK 225 (500) T ss_pred eeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccc-----cCCcC Confidence 999986666555555555555554555556677888886 22 232222222111111111111111 11122 Q ss_pred cccccccCCcccEEEecCC---------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecC--------CCC Q lcl|NC_013644. 225 ESLLQRSYGQIPFYRLSNN---------KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGF--------QGD 287 (510) Q Consensus 225 ~~~~~~~~g~iPvv~~~nn---------~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~--------~~~ 287 (510) +.....++.+.|+++|+++ +.|.|+|++++++||++|.++|+++++++....++.+-..+ ++. T Consensus 226 ~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~ 305 (500) T protein:vir:30 226 DEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGD 305 (500) T ss_pred cceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCcc Confidence 3344566777788887542 45899999999999999999999999999988888873332 111 Q ss_pred chhhhhHhhhc--Ceeeecc--CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc--cccccCcccHHHHHHHHHH Q lcl|NC_013644. 288 DLSKLRQNVKS--KKVVGTG--SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDS--TQVGDGNITNIVIKARYTL 361 (510) Q Consensus 288 ~~~~~~~~~~~--~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~--~~~~~g~~Sg~Ai~~~~~~ 361 (510) ......-+... +..+... ++..++.++.++..+++...++.+.+.|...++.+.. ++...|..||+++++.++. T Consensus 306 ~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~ 385 (500) T protein:vir:30 306 VVPRPRFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSD 385 (500) T ss_pred ccCCcccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHH Confidence 11111111111 2222222 2234777778888999999998888888776665554 3445567799999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhh---ccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhC Q lcl|NC_013644. 362 LNMKANKTEARLRALLEWMNKLVIDDINR---RYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVA 438 (510) Q Consensus 362 l~~k~~~k~~~~~~~l~~~~~~i~~~~~~---~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~ 438 (510) +.++++.+++.|+.+|++++++|+.+... .++......+++|.|+++++.|..+.++.+++++++|+||+++++..+ T Consensus 386 ~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~ 465 (500) T protein:vir:30 386 TYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKV 465 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhc Confidence 99999999999999999999999976543 222222344789999999999999999999999999999999999998 Q ss_pred CCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCC Q lcl|NC_013644. 439 PRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVN 486 (510) Q Consensus 439 ~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (510) ++++++|+++++++.+++. . +..+..+++...-++ T Consensus 466 ~g~~eeea~~~l~~i~~E~----------~---~~~~~~~~~~~~~g~ 500 (500) T protein:vir:30 466 LNVTEEKAQEIAAEINTGI----------V---DEINQQRTDTHLYGE 500 (500) T ss_pred CCCCHHHHHHHHHHHHHhc----------c---ccCCCCCccccccCC Confidence 8888877665554443311 1 111111111100000 No 67 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=100.00 E-value=2.2e-55 Score=320.28 Aligned_cols=453 Identities=9% Similarity=0.026 Sum_probs=305.7 Q ss_pred CCChhhhHHH------------HHHHH---HhhhhhhhHHHHHHHHHHhccCCcchhcccceecccccccccccccccee Q lcl|NC_013644. 6 SEDVKIIANA------------LKAAI---DKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRI 70 (510) Q Consensus 6 ~~~~~~~~~~------------i~~~i---~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki 70 (510) +.=.+.+..+ +.+.. +--.+..+..++..+++||+|++....+.. . ....+..+++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~----~-----~~~~~~~~~~ 71 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLN----T-----DGETKKRDLN 71 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCccccc----C-----CCCcccCcee Confidence 1111111111 12111 112333556789999999999976432211 1 1122344578 Q ss_pred ccchhHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEccc Q lcl|NC_013644. 71 PHGFFPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSL 149 (510) Q Consensus 71 ~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~ 149 (510) +.|+++.|++..|+|+||+|++++++++..++.|+++++ |+|...+.+++..++..|.+++.+|+|. ++++|.+++|+ T Consensus 72 slnl~~~i~~~~A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~I~~v~ad 150 (500) T protein:vir:98 72 HLPIARTAAKKIASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-DKVRVAFVQAP 150 (500) T ss_pred ecchHHHHHHHHhhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCC Confidence 899999999999999999999999999999999999995 6899999999999999999999999985 68999999999 Q ss_pred ceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEc--CC---cEEEEEEcCCceeecccccccccccccccccccc Q lcl|NC_013644. 150 NVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWT--DQ---NVYFFVAEDNKDYELDEAEPINPRPHVLAVDSEN 224 (510) Q Consensus 150 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~--~~---~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (510) +++|+..++.....++.++.+....+++...++++|.|+ ++ .|.+...........+........ ..... T Consensus 151 ~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~-----~~~l~ 225 (500) T protein:vir:98 151 VFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEV-----YKDLK 225 (500) T ss_pred eeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccc-----cCCcC Confidence 999986666555555555555554555556677888886 22 232222222111111111111111 11122 Q ss_pred cccccccCCcccEEEecCC---------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecC--------CCC Q lcl|NC_013644. 225 ESLLQRSYGQIPFYRLSNN---------KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGF--------QGD 287 (510) Q Consensus 225 ~~~~~~~~g~iPvv~~~nn---------~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~--------~~~ 287 (510) +.....++.+.|+++|+++ +.|.|+|++++++||++|.++|+++++++....++.+-..+ ++. T Consensus 226 ~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~ 305 (500) T protein:vir:98 226 DEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGD 305 (500) T ss_pred cceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCcc Confidence 3344566777788887542 45899999999999999999999999999988888873332 111 Q ss_pred chhhhhHhhhc--Ceeeecc--CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc--cccccCcccHHHHHHHHHH Q lcl|NC_013644. 288 DLSKLRQNVKS--KKVVGTG--SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDS--TQVGDGNITNIVIKARYTL 361 (510) Q Consensus 288 ~~~~~~~~~~~--~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~--~~~~~g~~Sg~Ai~~~~~~ 361 (510) ......-+... +..+... ++..++.++.++..+++...++.+.+.|...++.+.. ++...|..||+++++.++. T Consensus 306 ~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~ 385 (500) T protein:vir:98 306 VVPRPRFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSD 385 (500) T ss_pred ccCCcccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHH Confidence 11111111111 2222222 2234777778888999999998888888776665554 3445567799999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhh---ccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhC Q lcl|NC_013644. 362 LNMKANKTEARLRALLEWMNKLVIDDINR---RYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVA 438 (510) Q Consensus 362 l~~k~~~k~~~~~~~l~~~~~~i~~~~~~---~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~ 438 (510) +.++++.+++.|+.+|++++++|+.+... .++......+++|.|+++++.|..+.++.+++++++|+||+++++..+ T Consensus 386 ~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~ 465 (500) T protein:vir:98 386 TYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKV 465 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhc Confidence 99999999999999999999999976543 222222344789999999999999999999999999999999999998 Q ss_pred CCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCC Q lcl|NC_013644. 439 PRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVN 486 (510) Q Consensus 439 ~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (510) ++++++|+++++++.+++. . +..+..+++...-++ T Consensus 466 ~g~~eeea~~~l~~i~~E~----------~---~~~~~~~~~~~~~g~ 500 (500) T protein:vir:98 466 LNVTEEKAQEIAAEINTGI----------V---DEINQQRTDTHLYGE 500 (500) T ss_pred CCCCHHHHHHHHHHHHHhc----------c---ccCCCCCccccccCC Confidence 8888877665554443311 1 111111111100000 No 68 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=100.00 E-value=1e-52 Score=305.56 Aligned_cols=464 Identities=9% Similarity=-0.002 Sum_probs=302.9 Q ss_pred CCChhhhHHHHHHHH-----------Hhh----hhhhhHHHHHHHHHHhccCCcchhcccceecccccccccccccccee Q lcl|NC_013644. 6 SEDVKIIANALKAAI-----------DKD----RKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRI 70 (510) Q Consensus 6 ~~~~~~~~~~i~~~i-----------~~~----~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki 70 (510) +.=..-...+|++.. ..| .+..+..++..++.||+|+++...... . ......++|+ T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~----~-----~~~~~~~~~~ 71 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKN----T-----DGDIKSRPMN 71 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccc----c-----Ccchhcccce Confidence 111111112222211 111 134566789999999999876432211 1 1122234578 Q ss_pred ccchhHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEccc Q lcl|NC_013644. 71 PHGFFPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSL 149 (510) Q Consensus 71 ~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~ 149 (510) +.|+++.|++..|+++||+|++++++++..++.|+++++ |+|...+.+++..++..|.+++.+|+| .++++|.+++|+ T Consensus 72 slnl~~~i~~~~A~lv~~e~~~i~v~d~~~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d-~~~~~i~~v~ad 150 (522) T protein:vir:47 72 HLPIARTASKKIASLVYNEQATITTKNEILQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYID-GDKVRVAFIQAP 150 (522) T ss_pred ecchHHHHHHHHhhhhcCCcceeecCChHHHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEc-CCceEEEEEcCC Confidence 899999999999999999999999999999999999995 789999999999999999999999998 578999999999 Q ss_pred ceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcC----------------CcEEEEEEcCCceeeccccccccc Q lcl|NC_013644. 150 NVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTD----------------QNVYFFVAEDNKDYELDEAEPINP 213 (510) Q Consensus 150 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~----------------~~i~~~~~~~~~~~~~~~~~~~~~ 213 (510) +++|++.++.....++.++......+.....++.+|.|+- ..|.+..........++....+.. T Consensus 151 ~~~P~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~ 230 (522) T protein:vir:47 151 VFFPLESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLSE 230 (522) T ss_pred ceEEEEEcCCceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCccccccc Confidence 9999854444444444444444333333333445666531 122211111111111111111111 Q ss_pred ccccccccccccccccccCCcccEEEecCC---------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecC Q lcl|NC_013644. 214 RPHVLAVDSENESLLQRSYGQIPFYRLSNN---------KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGF 284 (510) Q Consensus 214 ~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn---------~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~ 284 (510) .+. ..+ ..+.+.-.++.+.++++|+++ +.|+|+|++++++||++|.++|+++++++....++.|-..+ T Consensus 231 ~~e--~~~-l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~ 307 (522) T protein:vir:47 231 LDK--YKN-LEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVIVPEHL 307 (522) T ss_pred ccc--ccC-CCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccceeecchHH Confidence 100 111 122333456677788888753 46999999999999999999999999999999988873222 Q ss_pred --------CCCc--hhhhhHhhhcCeeeec--cCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc--cccccCcc Q lcl|NC_013644. 285 --------QGDD--LSKLRQNVKSKKVVGT--GSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDS--TQVGDGNI 350 (510) Q Consensus 285 --------~~~~--~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~--~~~~~g~~ 350 (510) ++.. ...+..+.+.+..+.. +++++++.++.++..+.+...++.+.+.|-..++.... ++...+.. T Consensus 308 l~~~~~~~~g~~~~~~~fd~~~~~f~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~~k 387 (522) T protein:vir:47 308 TQRQYQRPDGTIDFRPRFDVEQNVYMQIGGSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQGMK 387 (522) T ss_pred hccCCCCCCcccccccccCcccceEeecCCCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccccccc Confidence 1110 0011111112222322 33446778888899998888888888887776655543 33445667 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---ccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcC Q lcl|NC_013644. 351 TNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINR---RYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETR 427 (510) Q Consensus 351 Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~---~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g 427 (510) |+++++...+.+.++++.+++.|+.+|+++++.|+.+... .++......+++|.|++.++.|..+.+++.++++++| T Consensus 388 TAtEi~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~~~~~~~~v~aG 467 (522) T protein:vir:47 388 TATEIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAELDYWAKMVAAG 467 (522) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHHHHHHHHHHhcC Confidence 8999999999999999999999999999999999977653 2334445567999999999999999999999999999 Q ss_pred CCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccc Q lcl|NC_013644. 428 KIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMA 495 (510) Q Consensus 428 ~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (510) +||+++++..+++++++++++++++.++++ . .+.+...+..++.+.++ ...|+. + T Consensus 468 ~~s~e~~i~~~~g~~eeea~~el~ri~~E~------~-~~~~~~~~~~~~~~~~~--~~~d~~----~ 522 (522) T protein:vir:47 468 FSTKKRAIGKTLNISGVEAEKELNAINSEL------L-PMNDAELAIYGMHDQNE--EKADDK----G 522 (522) T ss_pred CCCHHHHHHhcCCCChHHHHHHHHHHHHhh------c-cCCCCCCCCCCCCCccc--ccCCCC----C Confidence 999999999999998887665554443321 1 11111112222111111 111111 1 No 69 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=100.00 E-value=1.4e-50 Score=293.83 Aligned_cols=462 Identities=11% Similarity=0.005 Sum_probs=297.1 Q ss_pred hhhHHHHHHHHHhhh----hhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhh Q lcl|NC_013644. 10 KIIANALKAAIDKDR----KSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQY 85 (510) Q Consensus 10 ~~~~~~i~~~i~~~~----~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~ 85 (510) .-+-..++.+|+-+. +..+..+...+.++|.+......+. .+....|......+....+++.|+++.|++..|++ T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~w~~~~~~~~~~~~~~~~l~~~i~~~~A~l 79 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKD-SYLTSLWAQGYVPTVHDKLMNSGTGNEIVVVAAEY 79 (518) T ss_pred CcchhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhh-hhhhhhcccCCCCccccccccCChHHHHHHHHHHh Confidence 222233444444442 2345566666777777664332222 22222222222334455689999999999999999 Q ss_pred hhcCCceecc------CcHHHHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCC Q lcl|NC_013644. 86 LLSNPVEYET------ENEELKEYLAEYY-NSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEY 158 (510) Q Consensus 86 l~g~p~~~~~------~d~~~~~~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~ 158 (510) |||+|++++. +++.+++.|++++ +|+|...+.+++..++..|.+++.+|++ +|+++|.+++|++++|+|++ T Consensus 80 l~~e~~~i~v~~~~~~d~e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d-~~~~~i~~v~ad~~~P~~~~- 157 (518) T protein:vir:78 80 ISGKPLSIDVTGVNGSKDENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINIL-NGRPSISVHSSSQFWIDFKN- 157 (518) T ss_pred hcCCCceEEecCccccCcHHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEE-CCeeEEEEEcCCeeEEEeec- Confidence 9999998864 5677889999988 4789999999999999999999999997 58899999999999999976 Q ss_pred CCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCcee------------ec--ccccccccccccccccccc Q lcl|NC_013644. 159 NELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDY------------EL--DEAEPINPRPHVLAVDSEN 224 (510) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~------------~~--~~~~~~~~~~~~~~~~~~~ 224 (510) +++..++.+.... ..++...+++++.|..+.+.+.....+... .. ................... T Consensus 158 g~~~~~~f~~~~~--~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~~~ 235 (518) T protein:vir:78 158 NEPFRFNFFEEIP--TSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTNDIQ 235 (518) T ss_pred CcEEEEEEEEEee--cCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCcccccccccccccccccccccccCc Confidence 4566655443222 233444566677765443322111111100 00 0000000000001111111 Q ss_pred cccccccCCcccEEEecCC----------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecC-----CCCc- Q lcl|NC_013644. 225 ESLLQRSYGQIPFYRLSNN----------KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGF-----QGDD- 288 (510) Q Consensus 225 ~~~~~~~~g~iPvv~~~nn----------~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~-----~~~~- 288 (510) +...-......|+++|.+| +.|+|+|++++++||+||.++|+++++++....++.|...+ .+.. T Consensus 236 e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~~ 315 (518) T protein:vir:78 236 LNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEKTKTKIAASERMFRKKVNKSTD 315 (518) T ss_pred cceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHhCCceeeechhHhccCCCCCCC Confidence 1111122245566665322 34999999999999999999999999999988888874332 1111 Q ss_pred --hhhhhHhhhcCeeeeccC--CCc----eeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHH Q lcl|NC_013644. 289 --LSKLRQNVKSKKVVGTGS--DGG----LDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARY 359 (510) Q Consensus 289 --~~~~~~~~~~~~~~~~~~--~~~----~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~ 359 (510) ...+..+.+.+..+.... +++ ++.++.++..+++...++.+.+.|...++.+...++. .+..||++++... T Consensus 316 ~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~~~TATei~s~~ 395 (518) T protein:vir:78 316 KEEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNREVKATEIWSLQ 395 (518) T ss_pred ccccccCCCCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCcccccccHHHHHHHH Confidence 111222223333333222 222 5667788999999999999999998888777654432 3568999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC-----CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHH Q lcl|NC_013644. 360 TLLNMKANKTEARLRALLEWMNKLVIDDINRRYT-----KAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESI 434 (510) Q Consensus 360 ~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~-----~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~ 434 (510) +.+.+++..++..++.+|++++++++.+++.... ......+++|.|++.++.|..+.+++.++++++|+||+|++ T Consensus 396 ~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~v~aGimS~e~~ 475 (518) T protein:vir:78 396 DATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSSTLNNMNSALAMSVEEK 475 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHH Confidence 9999999999999999999999999988765322 12234579999999999999999999999999999999999 Q ss_pred HHh-CCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccC Q lcl|NC_013644. 435 LQV-APRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETA 484 (510) Q Consensus 435 ~~~-~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (510) +++ +|.++++++++..++.++++ +....+.+....+ -+.++| T Consensus 476 i~~~~~~~~deea~~e~~ri~~E~------~~~~~~~p~~~~g--~~~~~g 518 (518) T protein:vir:78 476 VKLIHPKWEDEEIQAEVKRIYLEN------AIGEVPDPEAIGG--METKGG 518 (518) T ss_pred HHHhCCCCCHHHHHHHHHHHHHHh------cccCCCCCccccC--CCCCCC Confidence 986 57888776665444443321 1111111111111 001111 No 70 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=100.00 E-value=1.6e-49 Score=288.10 Aligned_cols=458 Identities=12% Similarity=0.049 Sum_probs=302.4 Q ss_pred CCChhhhHHHHH---------------HHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceecccccccccccccccee Q lcl|NC_013644. 6 SEDVKIIANALK---------------AAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRI 70 (510) Q Consensus 6 ~~~~~~~~~~i~---------------~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki 70 (510) +.=..-+..+++ ..++-..+...+.++.++++||+|+++..+... .....+...++ T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~---------~~~~~~~~~~~ 71 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYIN---------SQGKIQERDYM 71 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCccccccc---------cccccccccee Confidence 111111222222 222222344567789999999999997654221 11122233578 Q ss_pred ccchhHHHHHHHHhhhhcCCceeccCc-----------HHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC Q lcl|NC_013644. 71 PHGFFPEIVDQKTQYLLSNPVEYETEN-----------EELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAE 138 (510) Q Consensus 71 ~~n~~~~Iv~~~~~~l~g~p~~~~~~d-----------~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~ 138 (510) +.|+++.|+...++++|++++++++++ ...+++|+++++ |+|...+.+++..++..|.+++.+|+|. T Consensus 72 sl~~~~~i~~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~- 150 (517) T protein:vir:98 72 TLNLRKLSADVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPYVDN- 150 (517) T ss_pred ecCcHHHHHHHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEEEeC- Confidence 899999999999999999999997764 336788999885 6799999999999999999999999984 Q ss_pred CceEEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcE------EEEE---EcCCceeeccccc Q lcl|NC_013644. 139 DRLCFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNV------YFFV---AEDNKDYELDEAE 209 (510) Q Consensus 139 g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i------~~~~---~~~~~~~~~~~~~ 209 (510) |.++|.+++|++++|+-.++.....++.++......+++...++++|.|..+.+ |+-+ ...+....++... T Consensus 151 ~~~~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v 230 (517) T protein:vir:98 151 GEIEFSWALANAFYPLRSNSNGISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGKRI 230 (517) T ss_pred CeeEEEEEcCCeeEEEEecCCCeEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEEecCCCccccccc Confidence 789999999999999543444444444344334444455566778899887643 1111 1111111111111 Q ss_pred ccccccccccccccccccccccCCcccEEEecC----C-----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeE Q lcl|NC_013644. 210 PINPRPHVLAVDSENESLLQRSYGQIPFYRLSN----N-----KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYV 280 (510) Q Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n----n-----~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv 280 (510) .+... .....+.+.-.++.+.++++|++ + +.|.|+|+++++++|++|.++|+++++++....++.| T Consensus 231 ~L~~~-----~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~~i~v 305 (517) T protein:vir:98 231 PLEEL-----YEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQRTVFV 305 (517) T ss_pred ccccc-----ccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCcceec Confidence 11111 01112223334556655667654 2 4699999999999999999999999999998888887 Q ss_pred EecCCC---Cc----h-hhhhHhhhcCeeeecc-CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccc--ccCc Q lcl|NC_013644. 281 VSGFQG---DD----L-SKLRQNVKSKKVVGTG-SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQV--GDGN 349 (510) Q Consensus 281 ~~g~~~---~~----~-~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~--~~g~ 349 (510) -..+-. +. . .-+......+..+..+ ++..++..+.++..+++.+.++.+.+.|...++.+...++ ..|. T Consensus 306 p~~~l~~~~~~~g~~~~~~~d~~~~~y~~~~~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~~ 385 (517) T protein:vir:98 306 SDVMLRTVPDESGMPPPQVFDPDVNVYKSIRMGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRSM 385 (517) T ss_pred ChhhhccccCCCCcccCCCCCcccceeeeccCCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCccccccccccc Confidence 443311 10 0 0011111122223322 2233556667788899999999999999998888875443 3455 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---ccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhc Q lcl|NC_013644. 350 ITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINR---RYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAET 426 (510) Q Consensus 350 ~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~---~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~ 426 (510) .|+++++...+.+.++++++++.|+.+|++++++|+.+... .++......+++|.|++.++.|..+.+++..+++++ T Consensus 386 kTATEi~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~D~~~~~~~~~~~v~a 465 (517) T protein:vir:98 386 KTATEIVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQDRSALLRFYGQAKTF 465 (517) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCCCHHHHHHHHHHHHhc Confidence 68999999999999999999999999999999999876543 333333455799999999999999999999999999 Q ss_pred CCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcc Q lcl|NC_013644. 427 RKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPT 491 (510) Q Consensus 427 g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (510) |+||+++++.++.+++++|+++++++.+++.. ..++ . ...+...+..+|++. T Consensus 466 G~ms~~~~i~~~~g~~eeeA~~e~~~i~~E~~-------~~~~----~--~~~~~~~~~~~gd~e 517 (517) T protein:vir:98 466 GFIPTVEAIQRIFKVPKKTAEQWLEEIRKDQI-------ELDP----V--TISQRAQKRMFGDEE 517 (517) T ss_pred CCCCHHHHHHHhCCCChHHHHHHHHHHHHhcc-------ccCC----C--CccccccCCCCCCCC Confidence 99999999999888887776655544433221 0111 0 111111112222221 No 71 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=100.00 E-value=9.8e-48 Score=278.29 Aligned_cols=476 Identities=11% Similarity=0.060 Sum_probs=304.2 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) -..++-.-....+ ..+..|- ..|...|+.+.+||.+.+.-+.-.. .... . +--.++.+|..++|+. T Consensus 10 ~~~~~~~g~~~~p----~~v~~~d-~~Rl~aY~l~~~~y~n~~~~~~~~l--rg~~------~-~~~r~~~~ps~~~~~~ 75 (527) T protein:vir:10 10 STQQLRAGEANFP----NAVTDFD-KARLASYRLYEDMYLTNTSDYQVIL--RGGD------E-GDQRPIYVPNGEKLIE 75 (527) T ss_pred CCcCcCCccccCc----ccCCHHH-HHHHHHHHHHHHHhcCchhheeeec--CCcc------c-cccceeeehhhHHhhC Confidence 0011100001111 1133332 3456679999999999874322110 0000 0 0012467788888877 Q ss_pred HHHhhhhcCCcee--ccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC----CceEEEEEcccceEE Q lcl|NC_013644. 81 QKTQYLLSNPVEY--ETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAE----DRLCFQVADSLNVFG 153 (510) Q Consensus 81 ~~~~~l~g~p~~~--~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~----g~~~i~~~~p~~~~~ 153 (510) ....|+ +.+..+ +..++++++.|+.|++ +++..++.++.+++++.|++.+++-+|++ ++++++.+||.++|| T Consensus 76 ~~~~~~-~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~~~f~ 154 (527) T protein:vir:10 76 AKMRFL-GQGLKWEFSKKDAKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPSTYFP 154 (527) T ss_pred Ccceee-ccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCcceeee Confidence 665443 444443 4457788999999885 88899999999999999998777766642 479999999999999 Q ss_pred EEcCCC--CceeEEEEEEEEEeeCCceeEE-EE-----EEEE-----cCCcEEEEEEc---CCceeeccccccccccccc Q lcl|NC_013644. 154 VYNEYN--ELQRICRHYITEIEKDGETVDI-HH-----AEVW-----TDQNVYFFVAE---DNKDYELDEAEPINPRPHV 217 (510) Q Consensus 154 ~~d~~~--~~~~~~~~~~~~~~~~~~~~~~-~~-----~e~y-----~~~~i~~~~~~---~~~~~~~~~~~~~~~~~~~ 217 (510) +.|+.. .+..+..++.+...++-+...+ -+ +++- ...+-.+|+-. -+.| ......+..+.... T Consensus 155 ~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w-~d~~e~p~~~~~~~ 233 (527) T protein:vir:10 155 YEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKW-DDRPESPLEPDDIK 233 (527) T ss_pred eecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeecccc-ccccccccchhhhh Confidence 987532 2333333323333333322211 10 1111 11111222110 0111 11112222233333 Q ss_pred ccccccccccccccCCcccEEEecCC-----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhh Q lcl|NC_013644. 218 LAVDSENESLLQRSYGQIPFYRLSNN-----KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKL 292 (510) Q Consensus 218 ~~~~~~~~~~~~~~~g~iPvv~~~nn-----~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~ 292 (510) ...++......++++++||||+|+|- ..|+|+++++++++|++|.++|+.+..+.+.+.|+.+++|+...+...- T Consensus 234 ~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd~~G~ 313 (527) T protein:vir:10 234 KLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRDSRGN 313 (527) T ss_pred hhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecccccccccCC Confidence 44556666788999999999999763 4699999999999999999999999999999999999999865432211 Q ss_pred --hHhhhcCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc---cCcccHHHHHHHHHHHHHHHH Q lcl|NC_013644. 293 --RQNVKSKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG---DGNITNIVIKARYTLLNMKAN 367 (510) Q Consensus 293 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~---~g~~Sg~Ai~~~~~~l~~k~~ 367 (510) .-.+..+.++.+++++++..+.......+++.|++.|.+.|+.+|++|.+.++. ++++||.||+..+++|.+++. T Consensus 314 ~~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLlar~~ 393 (527) T protein:vir:10 314 MVPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAILSSCA 393 (527) T ss_pred cCccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHHHHHH Confidence 113456778899999999988776788999999999999999999999998763 456799999999999999999 Q ss_pred HHHHHHHHHHHHHHH-HHHH----HHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhC---C Q lcl|NC_013644. 368 KTEARLRALLEWMNK-LVID----DINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVA---P 439 (510) Q Consensus 368 ~k~~~~~~~l~~~~~-~i~~----~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~---~ 439 (510) +|+..|+-..++..+ ++.. +.........+...+.|+|.+++|.|.++.++.+++++++|++|.+||+++| + T Consensus 394 rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aGi~S~~tAv~~L~~~~ 473 (527) T protein:vir:10 394 EQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNSEKRFNQLLQLWEAGLIPAKKLTEELSKIM 473 (527) T ss_pred HHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHHcCchhHHHHHHHHHhcc Confidence 999999998887543 3222 2333333333455789999999999999999999999999999999998887 6 Q ss_pred CCCcHHHH-HHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccccCC Q lcl|NC_013644. 440 RLDDDNVL-RLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLPE 508 (510) Q Consensus 440 ~v~d~e~~-~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (510) ++.|+|.+ +++.++.. ......+...++......+....+.. .+|+...+ +|. T Consensus 474 g~eD~E~E~~~I~~era--~~a~a~a~A~~~~~a~~~~~~g~~~~--~~d~~~~~------------~~~ 527 (527) T protein:vir:10 474 GFELTEEDFKQATEDKK--TQGIAQAEAADPFGAQMAAEQGIPDE--EDDQALNG------------QPL 527 (527) T ss_pred CCCChHHHHHHHHHHHH--HHhHHhhhhcCchhhhhccccCCCCC--CcccccCC------------CCC Confidence 78887543 22222222 22222333333333222221111111 11111111 222 No 72 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=100.00 E-value=1.1e-47 Score=277.93 Aligned_cols=476 Identities=11% Similarity=0.063 Sum_probs=304.0 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) -..++-.-....+ ..+..|- ..|...|+.+.+||.+.+.-+.-.. .... . +--.++.+|..++|+. T Consensus 10 ~~~~~~~g~~~~p----~~v~~~d-~~Rl~aY~l~~~~y~n~~~~~~~~l--rg~~------~-~~~r~~~~ps~~~~~~ 75 (527) T protein:vir:10 10 STQQLRAGEANFP----NAVTDFD-KARLASYRLYEDMYLTNTSDYQVIL--RGGD------E-GDQRPIYVPNGEKLIE 75 (527) T ss_pred CCcCcCCccccCc----ccCCHHH-HHHHHHHHHHHHHhcCchhheeeec--CCcc------c-cccceeeehhhHHhhC Confidence 0011100001111 1133332 3456679999999999874322110 0000 0 0012467787888877 Q ss_pred HHHhhhhcCCcee--ccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC----CceEEEEEcccceEE Q lcl|NC_013644. 81 QKTQYLLSNPVEY--ETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAE----DRLCFQVADSLNVFG 153 (510) Q Consensus 81 ~~~~~l~g~p~~~--~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~----g~~~i~~~~p~~~~~ 153 (510) ....|+ +.+..+ +..++++++.|+.|++ +++..++.++.+++++.|++.+++-+|++ ++++++.+||.++|| T Consensus 76 ~~~~~~-~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~~~f~ 154 (527) T protein:vir:10 76 AKMRFL-GQGLKWEFSKKDAKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPSTYFP 154 (527) T ss_pred Ccceee-ccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCcceeee Confidence 665443 444443 4457788999999885 88899999999999999998777766642 479999999999999 Q ss_pred EEcCCC--CceeEEEEEEEEEeeCCceeEE-EE-----EEEE-----cCCcEEEEEEc---CCceeeccccccccccccc Q lcl|NC_013644. 154 VYNEYN--ELQRICRHYITEIEKDGETVDI-HH-----AEVW-----TDQNVYFFVAE---DNKDYELDEAEPINPRPHV 217 (510) Q Consensus 154 ~~d~~~--~~~~~~~~~~~~~~~~~~~~~~-~~-----~e~y-----~~~~i~~~~~~---~~~~~~~~~~~~~~~~~~~ 217 (510) +.|+.. .+..+..++.+...++-+...+ -+ +++- ...+-.+|+-. -+.| ......+..+.... T Consensus 155 ~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w-~d~~e~p~~~~~~~ 233 (527) T protein:vir:10 155 YEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKW-DDRPESPLEPDDIK 233 (527) T ss_pred eecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeecccc-ccccccccchhhhh Confidence 987532 2333333323333333322211 10 1111 11111222110 0111 11112222233333 Q ss_pred ccccccccccccccCCcccEEEecCC-----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhh Q lcl|NC_013644. 218 LAVDSENESLLQRSYGQIPFYRLSNN-----KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKL 292 (510) Q Consensus 218 ~~~~~~~~~~~~~~~g~iPvv~~~nn-----~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~ 292 (510) ...++......++++++||||+|+|- ..|+|+++++++++|++|.++|+.+..+.+.+.|+.+++|+...+...- T Consensus 234 ~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd~~G~ 313 (527) T protein:vir:10 234 KLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRDSRGN 313 (527) T ss_pred hhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecccccccccCC Confidence 44556666788999999999999763 4699999999999999999999999999999999999999865432211 Q ss_pred --hHhhhcCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccc--c-cCcccHHHHHHHHHHHHHHHH Q lcl|NC_013644. 293 --RQNVKSKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQV--G-DGNITNIVIKARYTLLNMKAN 367 (510) Q Consensus 293 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~--~-~g~~Sg~Ai~~~~~~l~~k~~ 367 (510) .-.+..+.++.+++++++..+.......+++.|++.|.+.|+.+|++|.+.++ . ++++||.||+..+++|.+++. T Consensus 314 ~~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLlar~~ 393 (527) T protein:vir:10 314 MVPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAILSSCA 393 (527) T ss_pred cCccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHHHHHH Confidence 11345677889999999998877678899999999999999999999999876 3 456799999999999999999 Q ss_pred HHHHHHHHHHHHHHH-HHHH----HHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhC---C Q lcl|NC_013644. 368 KTEARLRALLEWMNK-LVID----DINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVA---P 439 (510) Q Consensus 368 ~k~~~~~~~l~~~~~-~i~~----~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~---~ 439 (510) +|+..|+-..++..+ ++.. +.........+...+.|+|.+++|.|.++.++.+++++++|++|.+||+++| + T Consensus 394 rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aGiiS~etAv~~L~~~~ 473 (527) T protein:vir:10 394 EQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNNEKRFAQLLELWEAGLIPAKKLTEELSKIM 473 (527) T ss_pred HHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHHcCchhHHHHHHHHHhcc Confidence 999999998887543 3222 2333333333455789999999999999999999999999999999998887 6 Q ss_pred CCCcHHHH-HHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccccCC Q lcl|NC_013644. 440 RLDDDNVL-RLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLPE 508 (510) Q Consensus 440 ~v~d~e~~-~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (510) ++.|+|.+ +++.++... .....+...++......+....+.. .+|+...+ +|. T Consensus 474 g~eD~E~E~~~I~~era~--~a~a~a~a~~~~~a~~~~~~g~~~~--~~d~~~~~------------~~~ 527 (527) T protein:vir:10 474 GFELTEEDFRQATEDKKT--QGIAQAEAADPFGAQMAAEQGIPDE--EDDQALNG------------QPL 527 (527) T ss_pred CCCchHHHHHHHHHHHHH--HhHHhhhhcCchhhhhccccCCCCC--CcccccCC------------CCC Confidence 78887644 222222222 2222333333333222221111111 11111111 222 No 73 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=100.00 E-value=6.7e-43 Score=251.80 Aligned_cols=490 Identities=14% Similarity=0.091 Sum_probs=284.8 Q ss_pred CC-----CccCCC--hhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccc Q lcl|NC_013644. 1 ME-----ALLSED--VKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHG 73 (510) Q Consensus 1 ~~-----~~~~~~--~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n 73 (510) |- -.-++- +.-.+.+ +..+- ..|-.+|+.+.+||.|++--+.-. .. +. .. .-+..| T Consensus 1 m~~~~~q~~p~~~~fp~~~a~w----V~~~D-~~RlaaY~ly~d~y~n~~~el~~i--l~---G~----dr---~~~~~p 63 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLRGGDDNI----VDEND-KNRVRAYDLYENIYLNSAETLKLV--LR---GD----DS---VPILMP 63 (563) T ss_pred CCccccccCCCccccccccccc----CCHHH-HHHHHHHHHHHHhhcCchhhhhhh--cC---CC----ce---eeeccc Confidence 21 000000 1122222 22221 235667999999999998432211 00 01 01 124456 Q ss_pred hhHHHHHHHHhhhhcCCceeccC----cH----HHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECC----CCc Q lcl|NC_013644. 74 FFPEIVDQKTQYLLSNPVEYETE----NE----ELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNA----EDR 140 (510) Q Consensus 74 ~~~~Iv~~~~~~l~g~p~~~~~~----d~----~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~----~g~ 140 (510) +++.||++.+ +++|.|++|+.+ ++ .++..|++|.+ +++..++.++.+++++.|++.+++-+|. .++ T Consensus 64 s~r~~V~~~~-~~Lg~~~~~~Ve~~~~de~~~~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~~g~R 142 (563) T protein:vir:74 64 SGRKIVEAVH-RFLGVGFDYLVEPDMGDEGIRQSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKKAGER 142 (563) T ss_pred hHHHHHHHHH-HhcCCCcEEecCccccCcchHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccccccCCC Confidence 8889999966 555999999543 22 34556677774 7788899999999999999877776664 358 Q ss_pred eEEEEEcccceEEEEcCCCCceeEE--EEE-EEEEeeCCceeEEEE---EEEEcCCcEE--EEEEcCCceee-----cc- Q lcl|NC_013644. 141 LCFQVADSLNVFGVYNEYNELQRIC--RHY-ITEIEKDGETVDIHH---AEVWTDQNVY--FFVAEDNKDYE-----LD- 206 (510) Q Consensus 141 ~~i~~~~p~~~~~~~d~~~~~~~~~--~~~-~~~~~~~~~~~~~~~---~e~y~~~~i~--~~~~~~~~~~~-----~~- 206 (510) +++..+||.+.|| |++......+. ++. .+...++-....... ...+.+...+ +|....+-|.. .. T Consensus 143 ~rv~~vDP~~~fp-~~dpd~v~g~~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg~~~~~~~~dae~w~lg~wd~r~~ 221 (563) T protein:vir:74 143 ISVDEVDPRQIFL-IEDGSTVVGFHMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEGMFTGRISSELTHWTLGNWDDRGA 221 (563) T ss_pred ceEeecCCceeee-ccCCCCcccceeeecccCCCCCcchhccceeeeeeeeeeCCCCCccceeeeccchhccccccccCc Confidence 9999999999999 44332221111 110 111111111111111 1111122221 12222111110 00 Q ss_pred ccccccccccc--ccccccccccccccCCcccEEEecCC-----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcccee Q lcl|NC_013644. 207 EAEPINPRPHV--LAVDSENESLLQRSYGQIPFYRLSNN-----KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIY 279 (510) Q Consensus 207 ~~~~~~~~~~~--~~~~~~~~~~~~~~~g~iPvv~~~nn-----~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~l 279 (510) ........... ...........|++++.||+|+|+|- ..|+|++++++++++++|.++|+.+..+..+..|+. T Consensus 222 ~~~~~~~~~~~~~~~~~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi~ 301 (563) T protein:vir:74 222 ISDEQARRKEQVRSAQHDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQGLGMY 301 (563) T ss_pred cchhhhcccchhhhhhhhchhhhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCeE Confidence 00000001111 11111234566899999999998763 469999999999999999999999999999999999 Q ss_pred EEecCCCCch--hhhh-HhhhcCeeeeccCCCc---eeEEeecCCHHHHHHHHHHHHH-HHHHHhCCcccccc--c-cCc Q lcl|NC_013644. 280 VVSGFQGDDL--SKLR-QNVKSKKVVGTGSDGG---LDVKTVTIPTEGRKTKMEIDKE-NIYKFGMAFDSTQV--G-DGN 349 (510) Q Consensus 280 v~~g~~~~~~--~~~~-~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~l~~-~i~~~s~~p~~~~~--~-~g~ 349 (510) ++.|....+. ++.. -++..+.++.++++.. ...+.-..+...+..|++.+.. .|+.+|++|...++ . +.. T Consensus 302 vl~~~~p~d~~~g~~~~w~vgpG~i~El~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~eral~~~s~tPavA~G~vD~~~~ 381 (563) T protein:vir:74 302 VTNASAPVDPNTGELTDWNIGPMQIVEIAGNRNDNYFERVSGVQDVSPFQDHMKWIDEKGIAEGSGTPEVAIGRVDVTSA 381 (563) T ss_pred EeccccccccccccccccccCCceeEeccCCccccceeeecchhhhHHHHHHHHHHHHHHHHhhccCcceeecccccccc Confidence 9987543321 1111 1245677888887655 4444444567889999988887 88999999998876 3 345 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHhh----------ccCCccccc-eeeEEeCCCCCCCHH Q lcl|NC_013644. 350 ITNIVIKARYTLLNMKANKTEARLRALLEW----MNKLVIDDINR----------RYTKAFDPT-EVSFTFTREVMVNET 414 (510) Q Consensus 350 ~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~----~~~~i~~~~~~----------~~~~~~~~~-~v~i~f~~~~p~d~~ 414 (510) +||+||+..+.+|.+++.+|++.+..++++ .+++++..... .+..++... .|+|+|.+.+|.|.+ T Consensus 382 ~SGiALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~~~v~ivf~p~~P~d~~ 461 (563) T protein:vir:74 382 ESGISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNECSVVCIFADPMPVNKT 461 (563) T ss_pred cchhhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCceEEEEEeCCCCCccHH Confidence 799999999999999999999988888888 45444433322 233333333 579999999999999 Q ss_pred HHHHHHHHHHhcCCCchHHHHHhC---CCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcc Q lcl|NC_013644. 415 DIVNDEKTEAETRKIILESILQVA---PRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPT 491 (510) Q Consensus 415 e~~~~~~~~~~~g~iS~et~~~~~---~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (510) +.++.+++++++|++|.|||+++| +|..++-+.++.+.+ ..+...+..++..+..+.+....++..-++...++.. T Consensus 462 ~vv~~~~tl~~aGiiSretAv~~L~~~g~~~pdae~e~~~ie-~~~i~~~~~a~a~ad~~~~~~a~~~~g~~~~~~dd~g 540 (563) T protein:vir:74 462 QVTQDTLLLQQAHLILRKMAVAKLRSIGWEYPEVDDQGNALT-DDDIADMLLAEAEADASLGLSAMDNGGAGEQQFDDQG 540 (563) T ss_pred HHHHHHHHHHHcCchhHHHHHHHHHhCCCCCCcHHHHHhhcC-HHHHHHHHHHHhhccCcccceecccCCCCcccccccC Confidence 999999999999999999998887 664433222222222 1111221112222222222222222222222222222 Q ss_pred cccccC---cc-cccccccCCCC Q lcl|NC_013644. 492 QQMAEG---AT-GSTESQLPENG 510 (510) Q Consensus 492 ~~~~~~---~~-~~~~~~~~~~~ 510 (510) .+.+.. +. -...-|+|..- T Consensus 541 ~p~~~~~~~~~~~~~~~~~~~~~ 563 (563) T protein:vir:74 541 NPIDQFGNPVEIPPDVTQVPLSP 563 (563) T ss_pred CchhHcCCcccCCccccccCCCC Confidence 222211 00 01122333333 No 74 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=100.00 E-value=3.8e-30 Score=181.90 Aligned_cols=466 Identities=12% Similarity=0.058 Sum_probs=270.4 Q ss_pred CCCccCCCh-hhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccc-e-eccchhHH Q lcl|NC_013644. 1 MEALLSEDV-KIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNV-R-IPHGFFPE 77 (510) Q Consensus 1 ~~~~~~~~~-~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~-k-i~~n~~~~ 77 (510) |-+---++. ...+++. ....++..+++-|.|...+......+..+........++.-. | +-.|+++. T Consensus 1 m~~~~~~~v~~~h~~y~----------a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~ 70 (513) T protein:vir:97 1 MADKDPKSPATTSGAYD----------QMLPRWHVIETLLGGTEAMREAGETYLPRHQEETDKGYQERLASAVLLNMVEQ 70 (513) T ss_pred CCCCCCCCCCcCCHHHH----------HHHHHHHHHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcccCCChHHH Confidence 322211111 1111111 122346667778888765443222221111111111121111 1 34799999 Q ss_pred HHHHHHhhhhcCCceeccCcH-HHHH-HHHHHh--ccCHHHHHHHHHHHHHhcCeEEEEEEECCCC-------------- Q lcl|NC_013644. 78 IVDQKTQYLLSNPVEYETENE-ELKE-YLAEYY--NSEFQVVLQELVEGSSQKGFEYVYARTNAED-------------- 139 (510) Q Consensus 78 Iv~~~~~~l~g~p~~~~~~d~-~~~~-~l~~~~--~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g-------------- 139 (510) +++..++++|-+||+++.+.. ...+ ++.++- .++++..+..+++.++.+|+++++|.++..+ T Consensus 71 tl~~l~G~vf~k~p~~~~~~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~~~~T~Ade~ 150 (513) T protein:vir:97 71 TLDTLSGKPFSEPIKLNEDVPKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDGQPRTLADDR 150 (513) T ss_pred HHHHHhhhhhhcCcccCcCchHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccchhHHhHHHHH Confidence 999999999999999864433 3333 445554 3788899999999999999999999876432 Q ss_pred ----ceEEEEEcccceEEEEcC-C-CC--ceeEEEEEE-EEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccc Q lcl|NC_013644. 140 ----RLCFQVADSLNVFGVYNE-Y-NE--LQRICRHYI-TEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEP 210 (510) Q Consensus 140 ----~~~i~~~~p~~~~~~~d~-~-~~--~~~~~~~~~-~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~ 210 (510) +|.+..++|.+++= |+. . +. ....++... +.+.++-..+.+..+.+++++.+..|+...++...... T Consensus 151 ~~~~rPy~~~~~~e~Iin-W~~~~v~G~~~L~~v~l~E~~~~~Dgf~~~~~~q~rvL~~g~~~v~r~~~~~~~~~~e--- 226 (513) T protein:vir:97 151 REGLRPYWVMIKPECLLF-ARSEVINGVEVLQHVRIIEHYMEQDGFAEVCKRRIRVLEPGLVQLWEPVKKSNAQKEE--- 226 (513) T ss_pred hhccCceEEEecHhhhcC-cceeccCcceeeeeEEEEEEEeecCCCcceEEEEEEEEeCceEEEEEeecCCCccccc--- Confidence 37899999998865 331 1 11 112222222 22334445667777888999988777765433222111 Q ss_pred cccccccccccccccccccccCCcccEEEecCCCC----CCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCC Q lcl|NC_013644. 211 INPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQ----ETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQG 286 (510) Q Consensus 211 ~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~----g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~ 286 (510) ........|+++.||||++..... +.+-|.++-.|--+.-...|++...+...++|++++.|++. T Consensus 227 -----------~~~~~~g~~~l~~IP~v~~~~~~~~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~G~~~ 295 (513) T protein:vir:97 227 -----------WALADEWATGLNYVPLVTFYADRQGFMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFPILACSGASG 295 (513) T ss_pred -----------eEEecCCCCcCCceeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccceeeeecCCc Confidence 112233467899999999875432 44567788888777778899999999999999999999855 Q ss_pred CchhhhhHhhhcCeeeeccC-CCceeEEeecCC-HHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHH Q lcl|NC_013644. 287 DDLSKLRQNVKSKKVVGTGS-DGGLDVKTVTIP-TEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNM 364 (510) Q Consensus 287 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~ 364 (510) .....+ -+..+.++.+++ +++++|++.+.+ .++.+..++.+.++|......+ -....++.||+|.+.......+ T Consensus 296 ~~~~~i--~iG~~~~~~lpe~~~~~~yie~~g~~i~~~~~~l~~le~qm~~~Ga~l--l~~~~~~~Ta~a~~~~~~~~~S 371 (513) T protein:vir:97 296 EDSDPV--VVGPNKVLYNPDPAGRFYYVEHTGQAIAAGRTDLKDLEEQMAGYGAEF--LKRKTGGQTATARALDSAEATS 371 (513) T ss_pred CCCCce--EeeccccccCCCCCCcceeeccCchhHHHHHHHHHHHHHHHHHHHHHh--hccCCccccHHHHHHHHHHHHH Confidence 432222 133445666775 788999999855 4778899999999998775432 2234567899999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCC-CCCC-HHHHHHHHHHHHhcCCCchHHHHHhC---C Q lcl|NC_013644. 365 KANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTRE-VMVN-ETDIVNDEKTEAETRKIILESILQVA---P 439 (510) Q Consensus 365 k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~-~p~d-~~e~~~~~~~~~~~g~iS~et~~~~~---~ 439 (510) ....+...+..++.+++++++.+++... ..++|..++. .... ..+.++.+.++.++|.||.+|.++.+ . T Consensus 372 ~L~~~a~~le~al~~~l~~~a~wlg~~~------~~~~v~in~dF~~~~~~~~~~~al~~a~~~G~is~~t~~~~L~r~g 445 (513) T protein:vir:97 372 DLSAMTGLFEDALAQALDITADWLRLGP------NGGTVELVKDYDLEEMDAPGLQALQVAREKRDISRKTYLNGLRLRG 445 (513) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCCC------CccEEEeccccCcccCCHHHHHHHHHHHhCCCCCHHHHHHHHHhcc Confidence 9999999999999999999999987421 1233333322 2222 24567788899999999999998764 2 Q ss_pred CC----CcHHHHHHHHHHHHHHHHHHHH-HHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 440 RL----DDDNVLRLICEQFDLDWEDVKE-ALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 440 ~v----~d~e~~~~~~e~~e~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) .+ ++++.. +++.++-.++... -....+.....+ ...+.++.++..+.+....|..|. -|+-- T Consensus 446 vl~~d~d~~~~~---e~~~~~~~~~~~~~~~d~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~ 512 (513) T protein:vir:97 446 VLPEDFDEDEDW---EELMEEISEAMGRAGLDLDPAQKNPP--EGGEGEGEGEGEGGEGGEGGEGGG----NPGGE 512 (513) T ss_pred CCCccCCHHHHH---HHHHHhhhhccCCCCccccccCCCCC--CCCCCCCCCCCCCCCCCCccccCC----CCCCC Confidence 22 222211 1111100000000 000001000000 000111111111111111111111 11100 No 75 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=99.95 E-value=1.2e-26 Score=162.72 Aligned_cols=428 Identities=10% Similarity=0.040 Sum_probs=254.5 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceecccccccccccccc--ceeccchhHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASN--VRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~--~ki~~n~~~~I 78 (510) |- -....++.. ....++..+++-|.|...+......+..+.....+..++.- .-+-.|+++.+ T Consensus 1 m~-----V~~~hp~y~----------a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t 65 (452) T protein:vir:94 1 MP-----IETKHPEYL----------AYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKT 65 (452) T ss_pred CC-----CCCcCHHHH----------HHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHH Confidence 11 111111111 12234556667777776553322211111111111111110 01337999999 Q ss_pred HHHHHhhhhcCCceeccCcHHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCC-ceEEEEEcccceEEEEc- Q lcl|NC_013644. 79 VDQKTQYLLSNPVEYETENEELKEYLAEYYNSEFQVVLQELVEGSSQKGFEYVYARTNAED-RLCFQVADSLNVFGVYN- 156 (510) Q Consensus 79 v~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g-~~~i~~~~p~~~~~~~d- 156 (510) ++..++++|.+||+++..+ .......+--.++++.....++..++.+|+++++|.++..| +|.+..++|.+++- |+ T Consensus 66 ~~~~~G~vf~k~p~~~~p~-~l~~~~~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~~~~~~~~Ii~-W~~ 143 (452) T protein:vir:94 66 LSALSGMVLDQPPVITHPD-AMSKYFEDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYISVYTTENILN-WEE 143 (452) T ss_pred HHHHhchhhcCCceecccH-HHHHHHhcccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEEEEechhhhcC-ccc Confidence 9999999999999986542 22222222234788999999999999999999999988665 79999999999874 54 Q ss_pred -CCCCceeEEEEEEEE-EeeC---CceeEEEEEEEEc--CCcEEE--EEEcCCceeeccccccccccccccccccccccc Q lcl|NC_013644. 157 -EYNELQRICRHYITE-IEKD---GETVDIHHAEVWT--DQNVYF--FVAEDNKDYELDEAEPINPRPHVLAVDSENESL 227 (510) Q Consensus 157 -~~~~~~~~~~~~~~~-~~~~---~~~~~~~~~e~y~--~~~i~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (510) ..+.+.. +...... ..+. -....+..+.+++ ++.+.. |+...++.+.... ...... T Consensus 144 ~~~g~l~~-v~lre~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~~~~~--------------~~~~~~ 208 (452) T protein:vir:94 144 DEDGRLLM-VVLREFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKVWELAK--------------TSTIQN 208 (452) T ss_pred cccCCeeE-EEEEEEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCceeeecc--------------ceeecC Confidence 2233322 2222221 1111 1223344444544 554433 3333333222111 112234 Q ss_pred ccccCCcccEEEecCCC----CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeee Q lcl|NC_013644. 228 LQRSYGQIPFYRLSNNK----QETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVG 303 (510) Q Consensus 228 ~~~~~g~iPvv~~~nn~----~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~ 303 (510) ..++|+.||+|.+.... .+.|-|.++-.|--+.....|++.+.+...++|++++.|.+... . ..+....++. T Consensus 209 ~~~~l~~IP~v~~~~~~~~~~~~~pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~~~~~--~--i~iG~~~~~~ 284 (452) T protein:vir:94 209 VGVTMDYIPFFCITPSGLSMTPAKPPMIDIVDINYSHYRTSADLEHGRHFTGLPTPWITGAESQS--T--MHIGSTKAWV 284 (452) T ss_pred CCcccceeEEEEEcCCCCCCCCCccchHHHHHHHHHHhcchhHHHHHHHHcccceeEeecCcCCC--c--eEeccccccc Confidence 56899999999986543 24566888888888888899999999999999999999975322 2 2244556777 Q ss_pred ccC-CCceeEEeecCC-HHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 304 TGS-DGGLDVKTVTIP-TEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMN 381 (510) Q Consensus 304 ~~~-~~~~~~~~~~~~-~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~ 381 (510) +++ +++++|++.+.+ .++.+..++.++++|.....- -+.....++.|++|.........+....+...++.++.+++ T Consensus 285 lpe~~~~~~yie~~g~~i~~~~~~l~~le~~m~~~Ga~-ll~~~~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al~~~l 363 (452) T protein:vir:94 285 IPEVAAKVGFLEFTGQGLQSLEKALSEKQAQLASLSAR-LIDNSTRGSEATETVKLRYMSETASLKSVTRAVEALLNKAY 363 (452) T ss_pred CCCCCCcceEEccCchhHHHHHHHHHHHHHHHHHHHHH-hhccCCCcchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH Confidence 885 889999998865 477899999999999886531 11222345678888777666666777777788899999999 Q ss_pred HHHHHHHhhccCCccccceeeEEeCCC--CCCCHHHHHHHHHHHHhcCCCchHHHHHhC---CCCCcHHHHHHHHHHHHH Q lcl|NC_013644. 382 KLVIDDINRRYTKAFDPTEVSFTFTRE--VMVNETDIVNDEKTEAETRKIILESILQVA---PRLDDDNVLRLICEQFDL 456 (510) Q Consensus 382 ~~i~~~~~~~~~~~~~~~~v~i~f~~~--~p~d~~e~~~~~~~~~~~g~iS~et~~~~~---~~v~d~e~~~~~~e~~e~ 456 (510) ++++.+++... ++.|..+.. .+.-..+.++.+.++.++|.||.+|++..+ +.++.+.+++++..+ T Consensus 364 ~~~a~w~g~~~-------~~~v~~n~dF~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~gvl~~~~e~~~i~~E--- 433 (452) T protein:vir:94 364 SCIMDMESMGG-------TLNIKLNSAFLDSKLTAAELKAWVEAYLSGGISKEIYIHALKVGKVLPPPGESMGVIPD--- 433 (452) T ss_pred HHHHHHcCCCC-------ceEEEeccccccccCCHHHHHHHHHHHhcCCCcHHHHHHHHHhCCCCCCccCHHHHHHH--- Confidence 99998876421 233333221 222235677778889999999999998866 433332211111111 Q ss_pred HHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcc Q lcl|NC_013644. 457 DWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPT 491 (510) Q Consensus 457 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (510) .....+.+.. +. +++++.. T Consensus 434 -------~~~~~~~~~~-~~--------~~~~~~~ 452 (452) T protein:vir:94 434 -------PPAPEPSPSN-TP--------PNPSSKA 452 (452) T ss_pred -------hhccCcccCC-CC--------CCCccCC Confidence 0001111110 11 1111111 No 76 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=99.93 E-value=2.5e-24 Score=149.93 Aligned_cols=453 Identities=9% Similarity=0.017 Sum_probs=248.4 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceecccccc-c----cccccc--cceeccc Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGIL-R----EDKYAS--NVRIPHG 73 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~-~----~~~~~~--~~ki~~n 73 (510) |.++-..-++- .....++..+++-+.|...+......+-.+.... . +..++. ..-+-.| T Consensus 32 m~dV~~~hp~y--------------~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~n 97 (535) T protein:vir:80 32 LPNVGYQRVEF--------------GEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIFYN 97 (535) T ss_pred CCCCCcCCHHH--------------HHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCCcCCHHHHHHHHhhccCCC Confidence 44432222221 1122345566677777655443322111110000 0 000111 0113479 Q ss_pred hhHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHh--ccCHHHHHHHHHHHHHhcCeEEEEEEECCCC------------ Q lcl|NC_013644. 74 FFPEIVDQKTQYLLSNPVEYETENEELKEYLAEYY--NSEFQVVLQELVEGSSQKGFEYVYARTNAED------------ 139 (510) Q Consensus 74 ~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~--~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g------------ 139 (510) +++.+++..++++|-+||.++. .+....++.++- .++++..+..++..++.+|+++++|.+...+ T Consensus 98 ~~~~tl~~l~G~vfrk~p~~~~-p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~~~~t~ade~~~~ 176 (535) T protein:vir:80 98 VTARTLDGMMGQVFSRDPIRQL-PPALEAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNVGRPVTVLEQKLGL 176 (535) T ss_pred hhHHHHHHHhchhhcCCcceec-cHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccHHHHHhcC Confidence 9999999999999999998853 344555555554 3678899999999999999999999876554 Q ss_pred -ceEEEEEcccceEEEEc-CCC-C--ceeEEEEEEEE--EeeCCceeEEEEEEEEcCC--cEEE---EEEcCCceeeccc Q lcl|NC_013644. 140 -RLCFQVADSLNVFGVYN-EYN-E--LQRICRHYITE--IEKDGETVDIHHAEVWTDQ--NVYF---FVAEDNKDYELDE 207 (510) Q Consensus 140 -~~~i~~~~p~~~~~~~d-~~~-~--~~~~~~~~~~~--~~~~~~~~~~~~~e~y~~~--~i~~---~~~~~~~~~~~~~ 207 (510) +|.+..++|.+++= |+ +.. . ....++..... ..++-....+.++.++..+ +.|. |+....+..... T Consensus 177 ~rPy~~~y~ae~Iin-W~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~RvL~~~~~G~y~v~~~~~~~~~~~~~~- 254 (535) T protein:vir:80 177 YRPTITLVHPTSIIN-WRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQWRVLQLNAEGNYQVERWRRETQEEMYYS- 254 (535) T ss_pred CCcEEEEechhhccC-ccccccCCccceeEEEEEEEEEecCCCcccceeEEEEEEEecCCceEEEEEEEeecCCccccc- Confidence 38899999998875 43 211 1 11222222221 1222234444455555542 2222 332221110000 Q ss_pred ccccccccccccccccccccccccCCcccEEEecCCC--C--CCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEec Q lcl|NC_013644. 208 AEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNK--Q--ETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSG 283 (510) Q Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~--~--g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g 283 (510) ...........|+|++||+|+|.... . +.+-|.++-.|.-+.-..-|++.+.+...++|++++.| T Consensus 255 -----------~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~il~~~~~P~l~i~G 323 (535) T protein:vir:80 255 -----------YSKHVPTDGNGNPFKEIPFQFIGPLDNNADIDHPPLLDLCEVNIGHYRNSADYEEMAFVAGQPTAFFTG 323 (535) T ss_pred -----------cceeecccCCCcccCeeEEEEeecCCCCCCCCccchHHHHHHHHHHhhchhHHHHHHHHhcCceeeeec Confidence 00111123456899999999885332 2 34557777777666777788899999999999999999 Q ss_pred CCCCchhhhh----HhhhcCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHH Q lcl|NC_013644. 284 FQGDDLSKLR----QNVKSKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARY 359 (510) Q Consensus 284 ~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~ 359 (510) ++.....+.. ..+....++.++++++++|+....+.-+. ..++.+.++|......+- ....++.+..+.+... T Consensus 324 ~~~~~~~~~~~~~~i~iG~~~~~~lP~~~~~~~~e~~~~~~a~-~~l~~~e~qM~~lGa~ll--~~~~~~~Ta~~a~~~~ 400 (535) T protein:vir:80 324 LTKDWVEDVFKDFKVHLGSRAIIPLPQGATAGILQITPNSVPF-EAMTHKESQMIAMGANLL--VKSGGNRTFGEAQQEE 400 (535) T ss_pred CchhhhhcCCCCcceEecCcccccCCCCCCcceeeeccchhHH-HHHHHHHHHHHHHHHHhh--ccCcccccHHHHHHHH Confidence 8654322211 11334457788999999999887655543 568888888877643221 2234455544445556 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCC-CCCC-HHHHHHHHHHHHhcCCCchHHHHHh Q lcl|NC_013644. 360 TLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTRE-VMVN-ETDIVNDEKTEAETRKIILESILQV 437 (510) Q Consensus 360 ~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~-~p~d-~~e~~~~~~~~~~~g~iS~et~~~~ 437 (510) +...+........++.++.+++++++.+++... +...+.|..+.. ...+ ..+.++.+.++.++|.||.+|++.. T Consensus 401 ~~~~S~L~~~a~~le~al~~aL~~~A~w~G~~~----~~~~~~i~~n~dF~~~~ld~~~~~all~~~~~G~Is~et~~~~ 476 (535) T protein:vir:80 401 ASEQSILSACTKNVSMAFRKALRWANQFQTGIV----NDETVEYNLNTDFPAARLTPNERAELILEWQQGAITFKEMRAG 476 (535) T ss_pred HHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCcc----CCCceEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHH Confidence 666677788888999999999999998876421 222344443321 2222 3456777888999999999999876 Q ss_pred C---CCCCc----HHHHHHHHHHHHHHHHHHHHHHHhhhccCCCC-CCCCCcccCCCCCCcccccccCcccc Q lcl|NC_013644. 438 A---PRLDD----DNVLRLICEQFDLDWEDVKEALEEAEYTKGLS-DNTDEEETAVNPDDPTQQMAEGATGS 501 (510) Q Consensus 438 ~---~~v~d----~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (510) | ..+++ ++++.+++. + ........ +...+ ..+..+....++++ ...+..|+ T Consensus 477 L~r~gvl~~~~~~eee~~ri~~------E--~~~~~~~~-g~~~d~~~~g~~~~~~~~~~----~~~~~~~~ 535 (535) T protein:vir:80 477 LRRAGVASEDDAKAETEGKATV------E--FIAKTAAA-GKVGDAASGGTNKAKLNNGN----GGGNQAGN 535 (535) T ss_pred HHhCCCCCcccchHHHHHHHHh------h--hhhccccC-CCCCCCCCCCCCcCcccCCc----cccccCCC Confidence 5 33322 111111100 0 00010010 00000 00000111111111 11122222 No 77 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=99.93 E-value=1.9e-24 Score=150.61 Aligned_cols=454 Identities=10% Similarity=0.016 Sum_probs=250.1 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceecc-----cccccccccccc-ce-eccc Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDD-----EGILREDKYASN-VR-IPHG 73 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~-----~~~~~~~~~~~~-~k-i~~n 73 (510) |.+.-..-++-. ....++..+++-+.|...+......+-.+ .....+..++.- .| +-.| T Consensus 1 m~~V~~~hp~y~--------------~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n 66 (501) T protein:vir:95 1 MPNVSFIRPELG--------------KLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYN 66 (501) T ss_pred CCCCCCCCHHHH--------------HHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCc Confidence 555433333211 12223555666777776543322111110 011000111110 11 3469 Q ss_pred hhHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHh--ccCHHHHHHHHHHHHHhcCeEEEEEEECCCC------------ Q lcl|NC_013644. 74 FFPEIVDQKTQYLLSNPVEYETENEELKEYLAEYY--NSEFQVVLQELVEGSSQKGFEYVYARTNAED------------ 139 (510) Q Consensus 74 ~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~--~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g------------ 139 (510) +++.+++..++++|-+||+++. .+....++.++- .++++..+..++..++.+|+++++|.++..+ T Consensus 67 ~~~~t~~~l~G~vf~k~p~~~~-p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~~~ 145 (501) T protein:vir:95 67 VARRTLFGLVGQVFMRDPVVKV-PALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADLEA 145 (501) T ss_pred hHHHHHHHHhhhhhcCCcceeC-cHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHHHh Confidence 9999999999999999999863 344555555553 3678899999999999999999999875432 Q ss_pred ---ceEEEEEcccceEEEEc-CCC-Cc--eeEEEEEEEEEeeCC--ceeEEEEEEEEcC--CcEEE---EEEcCCceeec Q lcl|NC_013644. 140 ---RLCFQVADSLNVFGVYN-EYN-EL--QRICRHYITEIEKDG--ETVDIHHAEVWTD--QNVYF---FVAEDNKDYEL 205 (510) Q Consensus 140 ---~~~i~~~~p~~~~~~~d-~~~-~~--~~~~~~~~~~~~~~~--~~~~~~~~e~y~~--~~i~~---~~~~~~~~~~~ 205 (510) +|.+..++|.+++= |+ +.. .. ...++........++ ....+..+.+.+. ++.+. |+....+...- T Consensus 146 ~~~rPy~~~~~~~~Iin-W~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~~~~~ 224 (501) T protein:vir:95 146 GRIRPTLYVYSPTEIIN-WRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQPTKADG 224 (501) T ss_pred ccCCcEEEEecHhhhcC-cceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEecCCcccCc Confidence 38899999988865 43 211 11 112222222111111 2233444444433 23332 33322211110 Q ss_pred ccccccccccccccccccccccccccCCcccEEEecCCCC----CCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEE Q lcl|NC_013644. 206 DEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQ----ETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVV 281 (510) Q Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~----g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~ 281 (510) . ..............+.....|+++.||+|++..... +.+-|.++-.|--+.-..-|++.+.+...++|++++ T Consensus 225 ~---~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~lA~lni~hy~~ssd~~~~l~~~~~P~l~i 301 (501) T protein:vir:95 225 S---KIPKGNYQQYVVYKPTDAQGKRLTEIPFMFIGSENNDSNPDNPNFYDLASLNMAHYRNSADYEESCYIVGQPTPVL 301 (501) T ss_pred c---eecCCcccccceeeeeccCCCcCCeeeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhHHHHHHHHcccceeee Confidence 0 000011111111222233458999999998744322 234455555554444455688888999999999999 Q ss_pred ecCCCCchhh---hhHhhhcCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHH Q lcl|NC_013644. 282 SGFQGDDLSK---LRQNVKSKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKAR 358 (510) Q Consensus 282 ~g~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~ 358 (510) +|.+...... ....+....++.++++++++|++.+.+.- .+..++.+.++|...... +-....++.||+|.+.. T Consensus 302 ~G~~~~~~~~~~~~~i~~G~~~~~~lP~~~~~~~ie~~~~~i-~~~~l~~l~~~m~~~Ga~--ll~~~~~~~Ta~~~~~~ 378 (501) T protein:vir:95 302 IGLTEEWVTNVLKGSVNFGSRGGIPLPVGADAKLLQASENTM-LKEAMDTKERQMVALGAK--LVEQKEVQRTATEAELE 378 (501) T ss_pred eCCcccccccCCCCceeecccccccCCCCCceeEEecChhhH-HHHHHHHHHHHHHHHHHh--hccCCccchhHHHHHHH Confidence 9976532111 11122334567788999999999865443 367788999988876432 22344567899999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCC-CCC-HHHHHHHHHHHHhcCCCchHHHHH Q lcl|NC_013644. 359 YTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREV-MVN-ETDIVNDEKTEAETRKIILESILQ 436 (510) Q Consensus 359 ~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~-p~d-~~e~~~~~~~~~~~g~iS~et~~~ 436 (510) .....+........++.++.+++++++.+++... ..++|..++.. +.. ..+.++.+.++.++|.||.+|+++ T Consensus 379 ~~~~~S~L~~~a~~le~al~~~l~~~a~w~g~~~------~~~~v~i~~df~~~~~~~~~~~al~~~~~~G~is~~t~~~ 452 (501) T protein:vir:95 379 AASEGSTLSSATKNVSAAFEWALKWAARWVGQAD------SGVKFELNTDFDIARMTPDERRSLVEEWQKGAITFEEMRT 452 (501) T ss_pred HHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCCC------CceEEEEecccccccCCHHHHHHHHHHHhCCCCcHHHHHH Confidence 8988999999999999999999999999987532 12333333332 222 345678888999999999999976 Q ss_pred hC---CCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcc Q lcl|NC_013644. 437 VA---PRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPT 491 (510) Q Consensus 437 ~~---~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (510) .+ ..++++ .....++.+++ ...................+.+-+++. T Consensus 453 ~L~~~~v~~~~-~~~e~e~i~~~--------~~~~~~~~~~~~~~~~~~gg~~~~~~~ 501 (501) T protein:vir:95 453 GLRKAGVATED-DSKAKEKIAKD--------TAEAMALATPANVPGDGSGGDNVGNSE 501 (501) T ss_pred HHHhCCCCChh-HHHHHHHHHhh--------hcCcccccccCCCCCCCcccccccCCC Confidence 54 333321 11101111110 000000011111111111111111111 No 78 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=99.92 E-value=2.7e-23 Score=144.31 Aligned_cols=447 Identities=9% Similarity=-0.024 Sum_probs=252.9 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccc-cccccccc--cceeccchhHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGI-LREDKYAS--NVRIPHGFFPE 77 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~-~~~~~~~~--~~ki~~n~~~~ 77 (510) |-+-..+-.+-. ..+.++ .....++..+++-|.|.+-...+..+. ....+ ..+..++. ..-+-.|+++. T Consensus 1 ~~~~~~~~~~V~----~~hp~y---~a~~~~W~~ird~~~G~~~~~~r~~yl-~~~~~~~~e~~Y~~rl~rA~~~n~~~~ 72 (489) T protein:vir:78 1 MLTENGQGSGVK----TKHREW---LHYAPKWQKVRHALAGELVSYLRNVGL-NEPDKAYGEARQAEYEAGGIVYNFTRR 72 (489) T ss_pred CccCCCccCCCC----ccCHHH---HHHHHHHHHHHHHhcCcccccccCCCC-CCCCCCCChHHHHHHHhccccCChHHH Confidence 322221111111 111111 122234666778888864221221111 11000 00111111 01134799999 Q ss_pred HHHHHHhhhhcCCceeccCcHHHHHHHHHHh--ccCHHHHHHHHHHHHHhcCeEEEEEEECCCC------------ceEE Q lcl|NC_013644. 78 IVDQKTQYLLSNPVEYETENEELKEYLAEYY--NSEFQVVLQELVEGSSQKGFEYVYARTNAED------------RLCF 143 (510) Q Consensus 78 Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~--~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g------------~~~i 143 (510) +++..++++|-+||+++.. +....++.++- .++++..+..++..++.+|+++++|.++..+ +|.+ T Consensus 73 tl~~l~G~vfrk~p~~~~p-~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ade~~~~~rPy~ 151 (489) T protein:vir:78 73 TLSGMVGSVMRKEPEINIP-KELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAEQNAGLLNPTI 151 (489) T ss_pred HHHHHhchhhcCCcceecc-HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHHHHHhcCCcEE Confidence 9999999999999998643 33555555554 3778899999999999999999999987655 5889 Q ss_pred EEEcccceEEEE-c--CCCCceeEEEEEEEEE--e--eCCceeEEEEEEEEcCC--c---EEEEEEcCCceeeccccccc Q lcl|NC_013644. 144 QVADSLNVFGVY-N--EYNELQRICRHYITEI--E--KDGETVDIHHAEVWTDQ--N---VYFFVAEDNKDYELDEAEPI 211 (510) Q Consensus 144 ~~~~p~~~~~~~-d--~~~~~~~~~~~~~~~~--~--~~~~~~~~~~~e~y~~~--~---i~~~~~~~~~~~~~~~~~~~ 211 (510) ..++|.+++=.- + +.......++...... + +.-....+..+.+++.+ + +..|+....+....... T Consensus 152 ~~~~~~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~g~~~~~~~--- 228 (489) T protein:vir:78 152 AFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQRLFRFDAEGGAQEDVV--- 228 (489) T ss_pred EEechhhhcCceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEEEEEEeecCCcccceee--- Confidence 999999886531 1 1111122232322211 1 23345566667777764 2 22233222221110000 Q ss_pred ccccccccccccccccccccCCcccEEEecCCC--C--CCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCC Q lcl|NC_013644. 212 NPRPHVLAVDSENESLLQRSYGQIPFYRLSNNK--Q--ETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGD 287 (510) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~--~--g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~ 287 (510) ........++++.||+|++.... . +.+-|.++-.|--+.-..-|++-+.+...++|++++.|.+.. T Consensus 229 ----------~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~G~d~~ 298 (489) T protein:vir:78 229 ----------EIYPDLGESLRGVIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGENL 298 (489) T ss_pred ----------EEeccCCCCccCeeeEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCccC Confidence 00112345789999999986432 2 344466666666666667888899999999999999997543 Q ss_pred chhhhh------HhhhcCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHH-hCCccccccccCcccHHHHHHHHH Q lcl|NC_013644. 288 DLSKLR------QNVKSKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKF-GMAFDSTQVGDGNITNIVIKARYT 360 (510) Q Consensus 288 ~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~-s~~p~~~~~~~g~~Sg~Ai~~~~~ 360 (510) ...... .-+.....+.++.+++++|++...+.- .+..++.+.++|... +.+.. ..++.|+++.+.... T Consensus 299 ~~~~~~~~~~~~i~~g~~~~~~lp~~~~~~~ie~~~~~~-~r~~l~~le~qm~~lGa~l~~----~~~~~Ta~~~~~~~~ 373 (489) T protein:vir:78 299 TPQAFKEANPNGIKFGSRRGHNLGYGGSAQLIQAGENNL-ARQNMLDKEQQAIQIGAQLIT----PTQQITAQSARIQRG 373 (489) T ss_pred CcccccccCccceeeCCcccccCCCCCCcceeccCcchH-HHHHHHHHHHHHHHHhhhhcc----CCcchhHHHHHHHHH Confidence 222111 012234466788899999998875443 477788888888875 44432 235688888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCC- Q lcl|NC_013644. 361 LLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAP- 439 (510) Q Consensus 361 ~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~- 439 (510) ...+........++.++.+++++++.+++....... ...+...|..... ..+.++.+.++.++|.||.+|++..|- T Consensus 374 ~~~S~L~~~a~~~e~al~~~l~~~a~w~G~~~~~~~-~i~~n~dF~~~~~--d~~~~~al~~~~~~G~is~~t~~~~L~~ 450 (489) T protein:vir:78 374 ADTSVMATIARNVSQAYTDALRWVAVMLGKPEDTEV-EFRLNMDFFLEPM--TAQDRAAWMADINAGLLPATAYYAALRK 450 (489) T ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCce-EEEeecccCcccC--CHHHHHHHHHHHhcCCCCHHHHHHHHHh Confidence 888889999999999999999999999876422110 0122333432211 245677788899999999999988652 Q ss_pred -CCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccc Q lcl|NC_013644. 440 -RLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQ 493 (510) Q Consensus 440 -~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (510) .+-|+..++.. .....++.+......++-+.+.-+. ++ T Consensus 451 ~gv~d~~~e~~~------------~ei~~~~~~~~~~~~g~~~~~~q~~----~~ 489 (489) T protein:vir:78 451 AGVTDWTDADIK------------DAVADQPLPVATEVQGEIPQSAQQQ----EK 489 (489) T ss_pred CCCCCccHHHHH------------HHHhhcCCCcccCCcccCCCCcccc----cC Confidence 23222211111 1111122111111111111111111 11 No 79 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=99.89 E-value=8.7e-22 Score=136.05 Aligned_cols=449 Identities=9% Similarity=-0.034 Sum_probs=248.2 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceecccccccccccccc--ceeccchhHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASN--VRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~--~ki~~n~~~~I 78 (510) |-+...+-.+-.. .+.++ .....++..+++-|.|.+-...+..+...-.....+..++.- .-+-.|+++.+ T Consensus 1 ~~~~~~~~~~V~~----~hp~y---~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~t 73 (491) T protein:vir:95 1 MLTANGQGSGVKT----KHREW---LHYAPKWQKVRHALAGDLVGYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRT 73 (491) T ss_pred CcccCCccCCCCc----cCHHH---HHHHHHHHHHHHHhcCcchhhcccCCCcCCCCCCCHHHHHHHHhcccCCChHHHH Confidence 3332222111110 11111 122334666777888854211111111100000111111110 11346999999 Q ss_pred HHHHHhhhhcCCceeccCcHHHHHHHHHHh--ccCHHHHHHHHHHHHHhcCeEEEEEEECCCC------------ceEEE Q lcl|NC_013644. 79 VDQKTQYLLSNPVEYETENEELKEYLAEYY--NSEFQVVLQELVEGSSQKGFEYVYARTNAED------------RLCFQ 144 (510) Q Consensus 79 v~~~~~~l~g~p~~~~~~d~~~~~~l~~~~--~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g------------~~~i~ 144 (510) ++..++++|-+||+++.. +....++.++- .++++..+..++..++.+|+++++|.++..+ +|.+. T Consensus 74 l~~l~G~vfrk~p~~~~p-~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~Ade~~~~~rPy~~ 152 (491) T protein:vir:95 74 LSGMVGSVMRKEPEINIP-KELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAAATAAEQNAGLLNPTIA 152 (491) T ss_pred HHHHhchhhcCCceeecc-HHHHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcccCHHHHHHhcCCcEEE Confidence 999999999999998643 34555656554 3778899999999999999999999887554 48899 Q ss_pred EEcccceEEEE-c--CCCCceeEEEEEEEE-Ee---eCCceeEEEEEEEEcC---Cc--EEEEEEcC-Cceeeccccccc Q lcl|NC_013644. 145 VADSLNVFGVY-N--EYNELQRICRHYITE-IE---KDGETVDIHHAEVWTD---QN--VYFFVAED-NKDYELDEAEPI 211 (510) Q Consensus 145 ~~~p~~~~~~~-d--~~~~~~~~~~~~~~~-~~---~~~~~~~~~~~e~y~~---~~--i~~~~~~~-~~~~~~~~~~~~ 211 (510) .++|.+++=.- + +.......++..... .. +.-....+..+.+++. +. +..|+... ++..... T Consensus 153 ~~~~~~IinW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~~~~~qyRvL~l~~~g~~~~~v~r~~~~g~~~~~~----- 227 (491) T protein:vir:95 153 FYTTENIVNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFETKYGEQYRVLDIDTDGNYRQRLFRFDAEGGAQEEV----- 227 (491) T ss_pred EechhhhcCceeeeeCCceeeeEEEEEEeEEeecCCCCcccceEEEEEEEeecCCCceEEEEEEEcCCCcceeee----- Confidence 99999886531 1 111112222232221 11 1223344445555543 32 22333322 1111111 Q ss_pred ccccccccccccccccccccCCcccEEEecCCC--C--CCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCC Q lcl|NC_013644. 212 NPRPHVLAVDSENESLLQRSYGQIPFYRLSNNK--Q--ETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGD 287 (510) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~--~--g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~ 287 (510) .........++++.||+|++.... . +.+-|.++-.|--+.-..-|++-+.+...++|++++.|.+.. T Consensus 228 ---------~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~~~G~d~~ 298 (491) T protein:vir:95 228 ---------VEIYPDLGESLRGVIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGDNL 298 (491) T ss_pred ---------eeeeecCCCcccCeeEEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCccc Confidence 001112345789999999986432 2 334466666665566667788889999999999999996543 Q ss_pred chhhhhH------hhhcCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHH-hCCccccccccCcccHHHHHHHHH Q lcl|NC_013644. 288 DLSKLRQ------NVKSKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKF-GMAFDSTQVGDGNITNIVIKARYT 360 (510) Q Consensus 288 ~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~-s~~p~~~~~~~g~~Sg~Ai~~~~~ 360 (510) ..+.... -+.....+.++.+++++|++...+.- .+..++.++.+|... +.+.. ..++.|+++.+.... T Consensus 299 ~~~~~~~~~~~~i~~g~~~~~~lP~~~~~~~ie~~~~~~-~~~~l~~~e~qm~~~Ga~l~~----~~~~~Ta~~~~~~~~ 373 (491) T protein:vir:95 299 TPQSFKEANPNGIKFGSRCGHNLGYGGSAQLIQAGENNL-ARQNMLDKEQQAIQIGAQLIT----PSQQITAESARIQRG 373 (491) T ss_pred CcchhhccCcceeEecCcCCcCCCCCCccceeecCcchH-HHHHHHHHHHHHHHHHHHhcc----CCcchhHHHHHHHHH Confidence 2221111 11223356677889999999875543 467788888877765 33332 235689999888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCC- Q lcl|NC_013644. 361 LLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAP- 439 (510) Q Consensus 361 ~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~- 439 (510) ...+........+..++.+++++++.+++....... .-.+...|..... ..+.++.+.++.++|.||.+|++..|- T Consensus 374 ~~~S~L~~~a~~~e~al~~~l~~~a~w~G~~~~~~v-~i~~n~dF~~~~~--~~~~~~all~~~~~G~is~~t~~~~L~~ 450 (491) T protein:vir:95 374 ADTSVMATIARNVSQAYTDALRWVAMMLGKPEDSEV-EFQLNMDFFLQPM--TAQDRAAWMADINAGLLPATAYYAALRK 450 (491) T ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCce-EEEeecccccccC--CHHHHHHHHHHHhcCCCCHHHHHHHHHh Confidence 889999999999999999999999999875421110 0122333432221 245677888999999999999987552 Q ss_pred -CCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccc Q lcl|NC_013644. 440 -RLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQ 493 (510) Q Consensus 440 -~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (510) .+-+...++... ..+.++ .....-.+.....++.-.++++ T Consensus 451 ~~vl~~~~e~~~~------------~ie~~~--~~~~~~~~~~~~~~~~~~~~~~ 491 (491) T protein:vir:95 451 AGVTDWTDEDILN------------AIEDAP--LPSGAVTQVAGEIPQAAQQQQE 491 (491) T ss_pred CCCCCccHHHHHH------------HHHhcC--CCCCccccccccchhhhhhccC Confidence 232221111111 111111 1111111111111111111111 No 80 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=99.87 E-value=1.1e-21 Score=135.40 Aligned_cols=432 Identities=12% Similarity=0.057 Sum_probs=231.7 Q ss_pred CC-CccCCChhhhHHHHHHHHHhhhhhhhHHHHHH-HHHHhccC--Ccchhcccceecccccccccccccc-c-e-eccc Q lcl|NC_013644. 1 ME-ALLSEDVKIIANALKAAIDKDRKSSSKREAET-GIRYYNHE--NDIMNNRIFYVDDEGILREDKYASN-V-R-IPHG 73 (510) Q Consensus 1 ~~-~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~-~~~YY~g~--~~i~~~~~~~~~~~~~~~~~~~~~~-~-k-i~~n 73 (510) |- +..+.+.......-. .+.. -.... ++. ...|-.-- ++-....+..+..........+... . | +-.| T Consensus 14 m~V~~~hp~y~a~~~~W~-~~~d-~g~~~---~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~~~~~~rA~~~n 88 (488) T protein:vir:96 14 MLTPIYHPDYLVNAPQWL-RNLD-CVMDN---IKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWEDLTWRLANYVN 88 (488) T ss_pred ecccccCHHHHHHhhhhh-Hhhh-hhhHH---HHHhhhhcCCCCCCccccccCcchhhhhhccchhhhHhhhhhccccCc Confidence 22 111111111100000 0100 00000 000 01111000 0000000000000000000000000 0 1 2359 Q ss_pred hhHHHHHHHHhhhhcCCceeccCc-HHHHHHHHHHh--ccCHHHHHHHHHHHHHhcCeEEEEEEECCCC----------- Q lcl|NC_013644. 74 FFPEIVDQKTQYLLSNPVEYETEN-EELKEYLAEYY--NSEFQVVLQELVEGSSQKGFEYVYARTNAED----------- 139 (510) Q Consensus 74 ~~~~Iv~~~~~~l~g~p~~~~~~d-~~~~~~l~~~~--~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g----------- 139 (510) +.+..++..++++|-+||+++..+ +....++.++- .++++.....+++.++.+|+++++|..++.+ T Consensus 89 ~~~~tl~~l~G~vfrk~p~~~~~~~~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~T~ade~~~~~ 168 (488) T protein:vir:96 89 IVNPTMNAITGAVMRREPEFDTMDNPVLIGLRDNIDGKGNGIDQECKQALNALQWGSRCGWLVRSHPESATMADWNKGKK 168 (488) T ss_pred hhHHHHHHhcchhhccCceeccCCcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCcCCHHHHHHhcC Confidence 999999999999999999997653 56767777665 4788999999999999999999999887644 Q ss_pred ceEEEEEcccceEEEEc-C-CCCcee--EEEEEEEE-EeeCC--ceeEEEEEEEEcCCcEEEEEEcCCceeecccccccc Q lcl|NC_013644. 140 RLCFQVADSLNVFGVYN-E-YNELQR--ICRHYITE-IEKDG--ETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPIN 212 (510) Q Consensus 140 ~~~i~~~~p~~~~~~~d-~-~~~~~~--~~~~~~~~-~~~~~--~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~ 212 (510) +|.+..++|.+++= |+ + .+.... .++..... +.++. .......+..++++.+..++...+.... T Consensus 169 rPy~~~~~a~~Iin-W~~~~v~G~~~L~~v~lrE~~~~~D~~~~~~~~~~~~~~l~~g~~~v~~~~~~~~~~-------- 239 (488) T protein:vir:96 169 LPTAAFYDALHIID-WEVEYIDGEEKLTYLSLLEDYQERDGGTYVSKQRLINHRLVDGLCEFQEVTDDEYSD-------- 239 (488) T ss_pred CcEEEEechhhhcC-cceeccCCceeeEEEEEEEEEEeccCCCcccceEEEEEEEECcEEEEEEEecCCccc-------- Confidence 48899999998875 33 1 111111 22222222 22221 2223333344666655544443322210 Q ss_pred cccccccccccccccccccCCcccEEEecCCCC----CCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCc Q lcl|NC_013644. 213 PRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQ----ETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDD 288 (510) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~----g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~ 288 (510) ...+.....++|+.||+|++..... +.+-|.++-.|--+.=..-|++-+.+.....|++++.+.+... T Consensus 240 --------e~~~~~~g~~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~Hy~~ssd~~~il~~~~~p~lv~~~~~~~~ 311 (488) T protein:vir:96 240 --------EWTPVLINSKQSDTIPFFLASSQSNEWCIDSTPLTSLAEISLSIYVMNAYSNKAMILANEAKWMVDMGDMNK 311 (488) T ss_pred --------ceEeecCCCcccCeeEEEEEecCCCCCCCCCCchHHHHHHHHHHHhhhhHHHHHHHhcCCceeeeccCCCCc Confidence 0011122457899999999864322 3455666666666666677888888888889988864333221 Q ss_pred hhhh-hH--hhhc-CeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHh-CCccccccccCcccHHHHHHHHHHHH Q lcl|NC_013644. 289 LSKL-RQ--NVKS-KKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFG-MAFDSTQVGDGNITNIVIKARYTLLN 363 (510) Q Consensus 289 ~~~~-~~--~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s-~~p~~~~~~~g~~Sg~Ai~~~~~~l~ 363 (510) .... .. .+.. .......+.|+++|+..+.+.- .+..++.++++|.... .++. ..++-||++.+....... T Consensus 312 ~~~~~~~~~g~~~~~~~~~~~~~g~~~~~e~~~~~l-~~~~l~~l~~qm~~~Ga~l~~----~~~~~Ta~~~~~~~~~~~ 386 (488) T protein:vir:96 312 TMASEMNPLGFTLAGRMPYYVKNGDVKVIQAQFSPE-TENKVEKLFEQAVKVGASLFT----QQSNETATGAAIRSGSST 386 (488) T ss_pred ccccccccceeeecccccccccCCceeecCCchhHH-HHHHHHHHHHHHHHHhHhhcc----CCCcchHHHHHHHHHHhh Confidence 1110 00 0111 1111223467788887665433 3677899999887754 3332 224568888888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCC-CCCC-HHHHHHHHHHHHhcCCCchHHHHHhC--- Q lcl|NC_013644. 364 MKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTRE-VMVN-ETDIVNDEKTEAETRKIILESILQVA--- 438 (510) Q Consensus 364 ~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~-~p~d-~~e~~~~~~~~~~~g~iS~et~~~~~--- 438 (510) +........++.++.+++++++.+++...... +..++++..++. .... ....++.+.++..+|.||.+|.++.+ T Consensus 387 S~L~~~a~~le~al~~~l~~~A~w~g~~~~~~-~~~~~~~~in~dF~~~~ld~~~~~al~~~~~~G~Is~~t~~~~L~~~ 465 (488) T protein:vir:96 387 ASMATLGNNVEDTVRNMLRFIMRYFEGTNLYV-NPDELVFKLNRDYFDVEVNPQMLQVAYAAMMEGNLPQVSWFELLKRA 465 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCc-CccceEEEeccCCCCccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhC Confidence 88899999999999999999999988654332 222344443322 2222 34567788899999999999998754 Q ss_pred CCCCcH-HHHHHHHHHHHHHHHHHHHHHHhhhccCCC Q lcl|NC_013644. 439 PRLDDD-NVLRLICEQFDLDWEDVKEALEEAEYTKGL 474 (510) Q Consensus 439 ~~v~d~-e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~ 474 (510) +.++++ ..++.+.+.++ . .-++ T Consensus 466 gvl~~d~~~e~~~~~ie~------------~--g~~~ 488 (488) T protein:vir:96 466 RVVRGDMSKEEFDEHIAE------------L--GFGM 488 (488) T ss_pred CcCCccCCHHHHHHHHhh------------c--CCCC Confidence 223221 11111111111 0 0111 No 81 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.81 E-value=7.8e-20 Score=125.33 Aligned_cols=495 Identities=11% Similarity=0.047 Sum_probs=225.8 Q ss_pred CCCc---------cCCChhhhHHHHHHHHHhhhh-----hhhHHHHHHHHHHhccCCcchhcccceeccccccccccccc Q lcl|NC_013644. 1 MEAL---------LSEDVKIIANALKAAIDKDRK-----SSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYAS 66 (510) Q Consensus 1 ~~~~---------~~~~~~~~~~~i~~~i~~~~~-----~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~ 66 (510) |+.. -..+.+...+...+++..++. ..-+....+..+||.|.|=- . ......+...+| T Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~-~-------~~~~~l~~~g~p 94 (776) T protein:vir:93 23 SPGEDAAQREKPANPLDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNIQWS-Q-------DEIDELKERGQA 94 (776) T ss_pred CCCCcccchhcccCCCCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCC-H-------HHHHHHHhcCCc Confidence 2222 112222333333333332221 12344566788999998610 0 000111223343 Q ss_pred cceeccchhHHHHHHHHhhhhcCCce--ecc---CcHHHHHH----HHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEEC Q lcl|NC_013644. 67 NVRIPHGFFPEIVDQKTQYLLSNPVE--YET---ENEELKEY----LAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTN 136 (510) Q Consensus 67 ~~ki~~n~~~~Iv~~~~~~l~g~p~~--~~~---~d~~~~~~----l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d 136 (510) .+++|..+.+|+..+++...+.+. +.+ ++.+..+. ++.+++ +++......+..+++++|.||+-++++ T Consensus 95 --~~~~N~i~~~i~~v~g~~~~nr~~~~~~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~af~d~~~~G~G~~~v~~d 172 (776) T protein:vir:93 95 --PTVYNVISQSVNWIIGSEKRGRSDFKVLPRRKDGGKAAERKTALLKYLSDVNHTPFERSMAFEETTKAGIGWLESQVQ 172 (776) T ss_pred --eEEecchHHHHHHHHHHHHhCCcceEEecCChhHHHHHHHHHHHHHHHHHhhcHHHHHHHHHHHhhhcCcceEEEEee Confidence 488999999999999998877544 433 23333343 444443 778889999999999999999999876 Q ss_pred CC--C-ceEEEEEcccceEEEEcCC-CC---ceeEEEEEEEEE------------------------------------- Q lcl|NC_013644. 137 AE--D-RLCFQVADSLNVFGVYNEY-NE---LQRICRHYITEI------------------------------------- 172 (510) Q Consensus 137 ~~--g-~~~i~~~~p~~~~~~~d~~-~~---~~~~~~~~~~~~------------------------------------- 172 (510) .+ + .+++.+++|.++|+=.+.. -+ -..+++...... T Consensus 173 ~~~~~~~~~~~~~~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 252 (776) T protein:vir:93 173 DENDGEPIYAGAESWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDIDGDDAMD 252 (776) T ss_pred ccCCCCceEeeccChhheeeccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccchhccccccccc Confidence 53 3 3556677888876422110 01 111111100000 Q ss_pred --------------eeCCceeEEEEEEEEcCCcEEEEEEcC--C--ceeecccccc-----c--cc----------cccc Q lcl|NC_013644. 173 --------------EKDGETVDIHHAEVWTDQNVYFFVAED--N--KDYELDEAEP-----I--NP----------RPHV 217 (510) Q Consensus 173 --------------~~~~~~~~~~~~e~y~~~~i~~~~~~~--~--~~~~~~~~~~-----~--~~----------~~~~ 217 (510) ........++.+|+|....+....... + .....+.... . .. .-.. T Consensus 253 ~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~~~~v~~~ 332 (776) T protein:vir:93 253 SPEYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAVSPMMRMHCA 332 (776) T ss_pred ccccccccccccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceeehheeeeeeEEE Confidence 000001123344555433221111110 0 0000000000 0 00 0000 Q ss_pred ccccc--cccccccccCCcccEEEecCC-----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh Q lcl|NC_013644. 218 LAVDS--ENESLLQRSYGQIPFYRLSNN-----KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS 290 (510) Q Consensus 218 ~~~~~--~~~~~~~~~~g~iPvv~~~nn-----~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~ 290 (510) ...+. ......+.+++++|+|+|+.. ..|.|.+..+++.++.+|..+|.+.+.+. ..++++..|.- ++.+ T Consensus 333 ~~~g~~~l~~~~~p~~~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l~--~~~~~~~~gav-~~~d 409 (776) T protein:vir:93 333 IMTTRDLMWAGPSPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKALYILS--TNKVLMEEGAV-DDID 409 (776) T ss_pred EEecchhhhccCCCCCCCccceEEecCceecccccccchHHhhhHHHHHHHHHHHHHHHhhc--CCceeeccccc-cchH Confidence 01111 112223456688999987653 34789999999999999999999988763 55666666653 2233 Q ss_pred hhhHh-hhcCeeeeccCCC--ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCc-ccHHHHHHHHHHHHHHH Q lcl|NC_013644. 291 KLRQN-VKSKKVVGTGSDG--GLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGN-ITNIVIKARYTLLNMKA 366 (510) Q Consensus 291 ~~~~~-~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Ai~~~~~~l~~k~ 366 (510) ++... .+.+.++.+..++ .+++.....-..++...+..+...|..+|++.+...+..+| .||+|+..+........ T Consensus 410 ~~~~~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~~~~~~~~~ 489 (776) T protein:vir:93 410 EFRREAARPDAVMTVKNGKLGAVKMDVDRDLAPAHLELASRSIQMIQQVGGVTDEMLGRTTNAVSGVAIQARQEQGSVAT 489 (776) T ss_pred HHHHhcccCCceeeeCCccccccccccCcCccHHHHHHHHHHHHHHHHhhCcChHHhCCCcchhhHHHHHHHHHHHHHHH Confidence 33322 3455666665554 33443333334678888899999999999888776665554 69999999888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCCc-------------c---c-----------cceeeEEeCCCCCCCHHHHHHH Q lcl|NC_013644. 367 NKTEARLRALLEWMNKLVIDDINRRYTKA-------------F---D-----------PTEVSFTFTREVMVNETDIVND 419 (510) Q Consensus 367 ~~k~~~~~~~l~~~~~~i~~~~~~~~~~~-------------~---~-----------~~~v~i~f~~~~p~d~~e~~~~ 419 (510) ....+.|..+++++.++++.++....... + + ..+|.|.=....+.-..+..+. T Consensus 490 ~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v~~~~~~~s~r~~~~~~ 569 (776) T protein:vir:93 490 NKLFDNLRLAFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFIIDEAEWRATMRQAAVAE 569 (776) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEEeecccchhHHHHHHHH Confidence 88888888888888888777764321100 0 0 0112222222222212222322 Q ss_pred HHHHHhcCCCchH-------HHHHhCCCCCcHHHHHHH----------------------HHH---HHHHHH-------- Q lcl|NC_013644. 420 EKTEAETRKIILE-------SILQVAPRLDDDNVLRLI----------------------CEQ---FDLDWE-------- 459 (510) Q Consensus 420 ~~~~~~~g~iS~e-------t~~~~~~~v~d~e~~~~~----------------------~e~---~e~~~~-------- 459 (510) ++.+.. .+..+ .+++..++-.-.+..+++ +.+ .+.... T Consensus 570 l~ql~~--~~~p~~~~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~~~~~e~~~~qq~q~~~~q~q~~~~~a~~~~ 647 (776) T protein:vir:93 570 LMEVIG--KMPPEIALTMLDLLVENMDIPNRDELVKRIRAVNGQKDPDQDEPTPEEIAREQAQQQQQQYNDALAIATLEE 647 (776) T ss_pred HHHHHh--hcChhhHHHHHHHHHHhcCccchHHHHHHHHHhhcccccchhhcchhHHHHHHHhhHHHHHHHHHhhhhhhH Confidence 332221 11111 111221110000000000 000 000000 Q ss_pred ----HHHHHHHhhhccCCCCCCC-CCcccCCCC---CCcccccccCccc----------ccccccCCCC Q lcl|NC_013644. 460 ----DVKEALEEAEYTKGLSDNT-DEEETAVNP---DDPTQQMAEGATG----------STESQLPENG 510 (510) Q Consensus 460 ----~~~~~~~~~~~~~~~~~~~-~~~~~~~~~---~~~~~~~~~~~~~----------~~~~~~~~~~ 510 (510) ..+...+............ ......... .-++.++....++ +...+.|..- T Consensus 648 ~qa~a~~~~aea~~~~aqa~~~~~~a~~~~~~a~q~a~qa~~~~~~~~~~a~~a~~~~~~a~~~~p~~p 716 (776) T protein:vir:93 648 QQAKARKAAAEAQVAEAKAKHISRMAIREGVGAVKDATDAATAIAFMPELAGLSDGILRESGWDDPNTP 716 (776) T ss_pred hhHHHHHHHHHHHHHhhhhhhhhhcchhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhccccccccccc Confidence 0000000000000000000 000000000 0000000000000 0000111111 No 82 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=99.76 E-value=3.1e-16 Score=105.65 Aligned_cols=488 Identities=14% Similarity=0.102 Sum_probs=234.4 Q ss_pred CCCc-----------cCCChhhhHHHHHHHHHhhh-----hhhhHHHHHHHHHHhccCCcchhcccceeccccccccccc Q lcl|NC_013644. 1 MEAL-----------LSEDVKIIANALKAAIDKDR-----KSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKY 64 (510) Q Consensus 1 ~~~~-----------~~~~~~~~~~~i~~~i~~~~-----~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~ 64 (510) -+.. ...+.+.+...+..+.+.++ ....+....+..+||.|.|= .. ......+... T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw-~~-------~~~~~l~~~g 77 (711) T protein:vir:10 6 KKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQW-PS-------QVRTERELEQ 77 (711) T ss_pred ccccccchhHHHHHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCC-CH-------HHHHHHHhcC Confidence 1111 11122233333444433322 12345557778999999861 00 0001112233 Q ss_pred cccceeccchhHHHHHHHHhhhhcCCcee--cc-------------------------CcHHHHHHHHH----Hhc-cCH Q lcl|NC_013644. 65 ASNVRIPHGFFPEIVDQKTQYLLSNPVEY--ET-------------------------ENEELKEYLAE----YYN-SEF 112 (510) Q Consensus 65 ~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~--~~-------------------------~d~~~~~~l~~----~~~-n~~ 112 (510) +| .+++|..+.+|+..+++--.+.+.+ .+ ++.+..+.|+. +.+ ++. T Consensus 78 ~p--~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~ 155 (711) T protein:vir:10 78 RP--CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDA 155 (711) T ss_pred CC--cEEEcchHHHHHHHhhhHhhCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHHhcCh Confidence 33 4789999999999999988776554 22 23344444444 333 667 Q ss_pred HHHHHHHHHHHHhcCeEEEEEEECC------CCceEEEEE-cccceEEEEcC-CC-----CceeEEEEEEEEEe------ Q lcl|NC_013644. 113 QVVLQELVEGSSQKGFEYVYARTNA------EDRLCFQVA-DSLNVFGVYNE-YN-----ELQRICRHYITEIE------ 173 (510) Q Consensus 113 ~~~~~e~~~~~~~~G~~~~~v~~d~------~g~~~i~~~-~p~~~~~~~d~-~~-----~~~~~~~~~~~~~~------ 173 (510) ......+..+++++|.||+-++.|. +|.++|..+ +|.++| ||. .. +-..+++....... T Consensus 156 ~~~~s~af~d~~~~G~G~~ev~~d~~~~d~~~~e~~i~~v~~p~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~y 233 (711) T protein:vir:10 156 ETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALY 233 (711) T ss_pred hHHHHHHHHHhhhcCcceEEEEecccCCCCCCCCeEEeeecChhhee--eCccccccChhhhcceeeeecCCHHHHHHhC Confidence 7888899999999999998876542 478888887 688864 453 11 11112221111100 Q ss_pred --------------eCC---ceeEEEEEEEEcCCcEEE--EEEcCCceeeccccccc-----c--------------c-c Q lcl|NC_013644. 174 --------------KDG---ETVDIHHAEVWTDQNVYF--FVAEDNKDYELDEAEPI-----N--------------P-R 214 (510) Q Consensus 174 --------------~~~---~~~~~~~~e~y~~~~i~~--~~~~~~~~~~~~~~~~~-----~--------------~-~ 214 (510) +.+ ....++..++|......+ +....+........... . . + T Consensus 234 p~~a~~~~~~~~~~~~~~~~~~~~vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~ 313 (711) T protein:vir:10 234 PDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTY 313 (711) T ss_pred CchhhhhhhcccccccCcccCcceeeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhceeeEE Confidence 000 012233445554432211 11111111111100000 0 0 0 Q ss_pred cccccccccccccccccCCcccEEEecCC-------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEE-ecCCC Q lcl|NC_013644. 215 PHVLAVDSENESLLQRSYGQIPFYRLSNN-------KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVV-SGFQG 286 (510) Q Consensus 215 ~~~~~~~~~~~~~~~~~~g~iPvv~~~nn-------~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~-~g~~~ 286 (510) -....+........|.+.+++|+|+|.-. ..+.|.+..+++.++.+|...|.+...+...+.+.+++ .|. . T Consensus 314 ~~~~~G~~~L~~~~p~~~~~~P~vp~~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~~ga-i 392 (711) T protein:vir:10 314 WRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGN-V 392 (711) T ss_pred EEEEecceeecCCCCCCCCcccEEEEeeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCceeecCcc-c Confidence 00011111222334556688888877422 23568899999999999999999999998888866654 443 3 Q ss_pred Cchhh-hhH-hhhcCeeeeccCCC----ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccC-cccHHHHHHHH Q lcl|NC_013644. 287 DDLSK-LRQ-NVKSKKVVGTGSDG----GLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDG-NITNIVIKARY 359 (510) Q Consensus 287 ~~~~~-~~~-~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g-~~Sg~Ai~~~~ 359 (510) ++..+ +.. ..+.+.++.+..++ .++++....-..++...++.....|-..|++.+...+..+ +.||+|+..+. T Consensus 393 ~~~~~~~~e~~~~~~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~q 472 (711) T protein:vir:10 393 EGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQ 472 (711) T ss_pred CChHHHHHhccccCCCeeEecccccCcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCccchHHHHHHHHH Confidence 33333 222 23445566555443 3566555556678888899999999999987776555544 46999999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc----------cC---Ccc---c--------------------cceeeE Q lcl|NC_013644. 360 TLLNMKANKTEARLRALLEWMNKLVIDDINRR----------YT---KAF---D--------------------PTEVSF 403 (510) Q Consensus 360 ~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~----------~~---~~~---~--------------------~~~v~i 403 (510) .............+..+++++.++++.++... +. ..+ + ..+|.| T Consensus 473 ~qg~~~l~~~~dn~~~~~~~~g~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i 552 (711) T protein:vir:10 473 RQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVV 552 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccceeeeccceeeeEEEE Confidence 88888888888888888888888777776421 10 000 0 012222 Q ss_pred EeCCCCCCCHHHHHHHHHHHHhcCCCch------HHHHHhCCCCCcHHHHHHHH------------------HHHHHHHH Q lcl|NC_013644. 404 TFTREVMVNETDIVNDEKTEAETRKIIL------ESILQVAPRLDDDNVLRLIC------------------EQFDLDWE 459 (510) Q Consensus 404 ~f~~~~p~d~~e~~~~~~~~~~~g~iS~------et~~~~~~~v~d~e~~~~~~------------------e~~e~~~~ 459 (510) .=.+..+.-..+.+..++.+.. .++. ..+++.+++..-++..+++. ...+.+.. T Consensus 553 ~~~p~~~s~r~~~~~~l~ql~~--~~p~~~~~~~~~il~~~d~p~~~el~e~lr~~~~~~~~~~~~~~~~qq~~~e~qq~ 630 (711) T protein:vir:10 553 TTGPAFATQRIEAAEAMIQFAQ--AVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEP 630 (711) T ss_pred eeccCchhHHHHHHHHHHHHHh--hcchhhhHHHHHHHHhcCCCCHHHHHHHHHhhcCcccCcchhhhHHHHHHHHHHHH Confidence 2233333333333333333322 1221 12333333221111111110 00000000 Q ss_pred HHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCccccccc---ccCC-C----C Q lcl|NC_013644. 460 DVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTES---QLPE-N----G 510 (510) Q Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~-~----~ 510 (510) ..+................ .+.... ..+ ...++....+ +.+. . . T Consensus 631 ~~~~q~~~~~~q~~~~qa~-ae~~~A----qae--~~qa~~e~~~~q~q~~~~~~~aq~ 682 (711) T protein:vir:10 631 TPEQQVEMAKSQADMAQAE-ADTAQA----QAD--MLKAQLETEEAQKQLAMIEDMAQG 682 (711) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHH----HHH--HHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000000000000000000 000000 000 0001111100 0000 0 0 No 83 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.75 E-value=1.6e-17 Score=112.70 Aligned_cols=428 Identities=11% Similarity=0.065 Sum_probs=208.9 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCC-cchhcccceeccccccccccccc--cceeccchhHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHEN-DIMNNRIFYVDDEGILREDKYAS--NVRIPHGFFPE 77 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~-~i~~~~~~~~~~~~~~~~~~~~~--~~ki~~n~~~~ 77 (510) |-.+..... +.+... ...+..|..|-. ...+ ....+...+......... ..--.+.+++. T Consensus 1 ~~~~~~a~~--------~~~~~~--------a~~~~~~~~~~g~~~~~-d~~~~~~~~~~~~~~~~~l~~lY~~~~l~r~ 63 (461) T protein:vir:80 1 MYSIDKAKQ--------AKIDSK--------IVNRNDFMVGHGKANSR-DKLTRQTPGNGQKLDLKACENLYASNSIAMN 63 (461) T ss_pred Cccchhhhh--------hhhhhh--------hhhhhHHHhhcCCcchh-hhhhccccCcccccCHHHHHHHHHhCCccch Confidence 222211110 001100 000111221110 0000 000000000000000000 00013578899 Q ss_pred HHHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCC---ceEEEEEcccceEE Q lcl|NC_013644. 78 IVDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAED---RLCFQVADSLNVFG 153 (510) Q Consensus 78 Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g---~~~i~~~~p~~~~~ 153 (510) ||+..+..++-+++.+++++++..+.++.+|+ -+....+.++.+.+..+|.|++++-..... ..-...+.|... T Consensus 64 iVd~~a~d~~r~g~~i~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~~pl~~~~~-- 141 (461) T protein:vir:80 64 IVDIISEDMVRAGWSLKTDNKEMKKNIESKWRKLKTKDRFQKLYADKRLYGDGFLSIGVVSSNREQADLSTAIDPKTI-- 141 (461) T ss_pred hhccchHHhhcCCeeeecCCHHHHHHHHHHHHHhhHHHHHHHHHHhhcccccEEEEEEeecCCccccCccCCcccccc-- Confidence 99999999999999999999988888888875 467788999999999999999887653221 111111222110 Q ss_pred EEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCC Q lcl|NC_013644. 154 VYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYG 233 (510) Q Consensus 154 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 233 (510) ..+..+..+|...............-.++.|.. |++..........+. .. .......+. T Consensus 142 -----~~~~~l~~~~~~~i~~~~~~~dp~sp~fg~P~~-y~i~~~~~~~~~~~~-~~--------------~~~~~~~iH 200 (461) T protein:vir:80 142 -----KSIPYINTFNTQKVTQLYLNQDMFSEHFGEVEF-FEVNRVSQLGEEILS-GT--------------TASTSEQIH 200 (461) T ss_pred -----cceeEEEeccccccchhhhcccCcCcccccceE-EEEeccccccccccc-cc--------------cCccceEEc Confidence 000000000000000000000000001112211 111111000000000 00 000001112 Q ss_pred cccEEEecCC-----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCC---CchhhhhHh----hhcCee Q lcl|NC_013644. 234 QIPFYRLSNN-----KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQG---DDLSKLRQN----VKSKKV 301 (510) Q Consensus 234 ~iPvv~~~nn-----~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~---~~~~~~~~~----~~~~~~ 301 (510) .-+|++|.+. -.|+|.++.+.+.+.+++.+.-..+..+..+..+.+...+... +........ ....++ T Consensus 201 ~SRii~~~~~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~~~~~~~~g~ 280 (461) T protein:vir:80 201 RSRIIHEQGLRFEGETKGRSIFESLYDIITVMDTSLWSVGQILYDFAFKVYKTDDIDALNKDDKANLTAMLDFMFRTEAL 280 (461) T ss_pred cccEEEecCCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCCceecchHHhhhchHHHHHHHHHHHhcCCceE Confidence 2355666543 3589999999999999999998888888777777776655321 111111111 123446 Q ss_pred eeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccc---ccCcccHHHHHHHHHHHHHHHHHHH-HHHHHHH Q lcl|NC_013644. 302 VGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQV---GDGNITNIVIKARYTLLNMKANKTE-ARLRALL 377 (510) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~---~~g~~Sg~Ai~~~~~~l~~k~~~k~-~~~~~~l 377 (510) +.++.+.+++.++ .+.......++.+...|...+++|-+-.. .++++||..=. .....++..++ ..++..| T Consensus 281 ~~~d~~e~~e~~~--~~lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~asge~D~---~~yyd~i~~~qe~~l~p~l 355 (461) T protein:vir:80 281 AIIKGDEQLTKES--TNVSGMKDLLDYGWDYLAGAVRMPKTVLKGQEAGTLTGAQYDV---MNYYARVSSIQENRLRPQL 355 (461) T ss_pred EEEcCCcceEEEe--cCcCCHHHHHHHHHHHHhhhhcCCeeeeecccCCccccchHHH---HHHHHHHHHHHHHHHHHHH Confidence 6667666655544 56667889999999999999999986432 24567776522 22344555555 5688999 Q ss_pred HHHHHHHHHHHhhcc-CCccccceeeEEeCCCCCCCHHHHHHH-------HHHHHhcCCCchHHHHHhCCCCCcHHHHHH Q lcl|NC_013644. 378 EWMNKLVIDDINRRY-TKAFDPTEVSFTFTREVMVNETDIVND-------EKTEAETRKIILESILQVAPRLDDDNVLRL 449 (510) Q Consensus 378 ~~~~~~i~~~~~~~~-~~~~~~~~v~i~f~~~~p~d~~e~~~~-------~~~~~~~g~iS~et~~~~~~~v~d~e~~~~ 449 (510) ++++++++......+ ..+.+..+++|.|++-.+.+++|+++. +.++.++|++|.+++.+.+- T Consensus 356 e~l~~~i~~s~~~~~~~~~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~~~~g~is~~e~r~~l~---------- 425 (461) T protein:vir:80 356 EYLTRLLMWASDDCGPSIDPDSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQIYIVNGVLDPDEVKETRF---------- 425 (461) T ss_pred HHHHHHHHHHhcccccccCccccceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHH---------- Confidence 999988775433222 223345689999999999999999876 44555556555554433210 Q ss_pred HHHHHHHHHHHHHHHHHhhhccCCCCCCCCC--cccCCCCCCcccccccC Q lcl|NC_013644. 450 ICEQFDLDWEDVKEALEEAEYTKGLSDNTDE--EETAVNPDDPTQQMAEG 497 (510) Q Consensus 450 ~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~ 497 (510) ......+ .......+++ +-...+.+.+.+++++| T Consensus 426 -------------~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~e~~~g 461 (461) T protein:vir:80 426 -------------GRFGLEN-SSKFSGDSAEIDKLAKLVYDAYAKKNADG 461 (461) T ss_pred -------------HhcCCCC-CccCCCCCchhhhhhhhccccccccCCCC Confidence 0000011 0011111111 11111222233333333 No 84 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=99.71 E-value=1.2e-15 Score=102.45 Aligned_cols=471 Identities=11% Similarity=0.088 Sum_probs=225.8 Q ss_pred CCCccC-----CChhhhHH-------HHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccc Q lcl|NC_013644. 1 MEALLS-----EDVKIIAN-------ALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNV 68 (510) Q Consensus 1 ~~~~~~-----~~~~~~~~-------~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ 68 (510) |++=.. .+.+++.+ ++...++.+. .-|....+..+||.|.|= .. ......+..++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~R~~a~~d~~fy~G~Qw-~~-------~~~~~l~~~g~p-- 68 (714) T protein:vir:99 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQP--KWRDAANKACAYYDGDQL-PP-------EVLQVLKDRGQP-- 68 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhH--HHHHHHHHHHHhhcCCCC-CH-------HHHHHHHhcCCC-- Confidence 433211 11122222 2333333332 235567788999999861 00 011111223344 Q ss_pred eeccchhHHHHHHHHhhhhcCCcee--ccC--c-H--HHHHH----HHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEEC Q lcl|NC_013644. 69 RIPHGFFPEIVDQKTQYLLSNPVEY--ETE--N-E--ELKEY----LAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTN 136 (510) Q Consensus 69 ki~~n~~~~Iv~~~~~~l~g~p~~~--~~~--d-~--~~~~~----l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d 136 (510) .+++|..+.+|+..+++--.+.+.+ .+. + + +..+. ++.+++ ++.......+..+++++|.||+-++.+ T Consensus 69 ~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~ 148 (714) T protein:vir:99 69 MTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRN 148 (714) T ss_pred cEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccc Confidence 5789999999999999998777664 331 2 2 23343 444444 678888899999999999999988876 Q ss_pred CC---CceEEEEEcccceEEEEcCC----CCceeEE-EEEEEEE------e----------------------------- Q lcl|NC_013644. 137 AE---DRLCFQVADSLNVFGVYNEY----NELQRIC-RHYITEI------E----------------------------- 173 (510) Q Consensus 137 ~~---g~~~i~~~~p~~~~~~~d~~----~~~~~~~-~~~~~~~------~----------------------------- 173 (510) .+ +.++|..++|.++|.=.+.. .+-..++ +.|.... . T Consensus 149 ~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~ 228 (714) T protein:vir:99 149 SDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPL 228 (714) T ss_pred cCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccccccc Confidence 43 56899999999977432110 0111111 1110000 0 Q ss_pred -----------------eCCceeEEEEEEEEcCCcEEE--EEEcCCceeecccccccc--------------c-----cc Q lcl|NC_013644. 174 -----------------KDGETVDIHHAEVWTDQNVYF--FVAEDNKDYELDEAEPIN--------------P-----RP 215 (510) Q Consensus 174 -----------------~~~~~~~~~~~e~y~~~~i~~--~~~~~~~~~~~~~~~~~~--------------~-----~~ 215 (510) .+.....+..+|+|....... +...+|.....+...... . .. T Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~ 308 (714) T protein:vir:99 229 MSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREA 308 (714) T ss_pred ccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEE Confidence 000011223345554322221 111122222111110000 0 00 Q ss_pred ccccccccccccccccCCcccEEEecCC---CC--CCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh Q lcl|NC_013644. 216 HVLAVDSENESLLQRSYGQIPFYRLSNN---KQ--ETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS 290 (510) Q Consensus 216 ~~~~~~~~~~~~~~~~~g~iPvv~~~nn---~~--g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~ 290 (510) +............|.+.+++|+|+|+-. .. ..|.+..+++.++.+|...|.+...+ .++..++..|....... T Consensus 309 ~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~d~ 386 (714) T protein:vir:99 309 WFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSDN 386 (714) T ss_pred EEecCcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcccccHH Confidence 0000000111122334456666665432 22 34778889999999999999988866 45555555554333222 Q ss_pred hhhHh-hhcCeeeeccCCC--------ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCc-ccHHHHHHHHH Q lcl|NC_013644. 291 KLRQN-VKSKKVVGTGSDG--------GLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGN-ITNIVIKARYT 360 (510) Q Consensus 291 ~~~~~-~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Ai~~~~~ 360 (510) .+... -+.+.++....+. .++......-...+...+......|-.+|++-+...+..+| .||+|+..+-. T Consensus 387 ~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~ 466 (714) T protein:vir:99 387 DLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVE 466 (714) T ss_pred HHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHH Confidence 33222 2334444443221 12333223345677788888888898888877765555444 59999998888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----------ccCCcc-c-----------------------cceeeEEeC Q lcl|NC_013644. 361 LLNMKANKTEARLRALLEWMNKLVIDDINR----------RYTKAF-D-----------------------PTEVSFTFT 406 (510) Q Consensus 361 ~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~----------~~~~~~-~-----------------------~~~v~i~f~ 406 (510) ............+..+.+++.++++.++.. .+..+- . ..+|.|.=. T Consensus 467 qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~ 546 (714) T protein:vir:99 467 QGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPV 546 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeec Confidence 777777777777777777777777766532 111000 0 012333333 Q ss_pred CCCCCCHHHHHHHHHHHHhc-----CCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCc Q lcl|NC_013644. 407 REVMVNETDIVNDEKTEAET-----RKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEE 481 (510) Q Consensus 407 ~~~p~d~~e~~~~~~~~~~~-----g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (510) +..|.-..+.++.++.+.+. +.+....+++.+++-.-++..+++.+ . .+.+.. .+.. T Consensus 547 p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~-----------~---~~~~~~----~~~~ 608 (714) T protein:vir:99 547 QQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRA-----------A---LGTPKS----PDEM 608 (714) T ss_pred cCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHH-----------H---cCCCCC----cccc Confidence 44444445555555555543 11223445555554221222111111 0 010000 0000 Q ss_pred ccCCCCCCcccccccCcccccccccC-----------CCC Q lcl|NC_013644. 482 ETAVNPDDPTQQMAEGATGSTESQLP-----------ENG 510 (510) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~ 510 (510) . +.+++....+.....+.. .+- T Consensus 609 ~-------~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~ 641 (714) T protein:vir:99 609 T-------PEEQEVAAQQQALQQQQAELQMREMAGRVAKL 641 (714) T ss_pred c-------hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 001111001100000000 000 No 85 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=99.71 E-value=1.2e-15 Score=102.45 Aligned_cols=471 Identities=11% Similarity=0.088 Sum_probs=225.8 Q ss_pred CCCccC-----CChhhhHH-------HHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccc Q lcl|NC_013644. 1 MEALLS-----EDVKIIAN-------ALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNV 68 (510) Q Consensus 1 ~~~~~~-----~~~~~~~~-------~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ 68 (510) |++=.. .+.+++.+ ++...++.+. .-|....+..+||.|.|= .. ......+..++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~R~~a~~d~~fy~G~Qw-~~-------~~~~~l~~~g~p-- 68 (714) T protein:vir:32 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQP--KWRDAANKACAYYDGDQL-PP-------EVLQVLKDRGQP-- 68 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhH--HHHHHHHHHHHhhcCCCC-CH-------HHHHHHHhcCCC-- Confidence 433211 11122222 2333333332 235567788999999861 00 011111223344 Q ss_pred eeccchhHHHHHHHHhhhhcCCcee--ccC--c-H--HHHHH----HHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEEC Q lcl|NC_013644. 69 RIPHGFFPEIVDQKTQYLLSNPVEY--ETE--N-E--ELKEY----LAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTN 136 (510) Q Consensus 69 ki~~n~~~~Iv~~~~~~l~g~p~~~--~~~--d-~--~~~~~----l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d 136 (510) .+++|..+.+|+..+++--.+.+.+ .+. + + +..+. ++.+++ ++.......+..+++++|.||+-++.+ T Consensus 69 ~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~ 148 (714) T protein:vir:32 69 MTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRN 148 (714) T ss_pred cEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccc Confidence 5789999999999999998777664 331 2 2 23343 444444 678888899999999999999988876 Q ss_pred CC---CceEEEEEcccceEEEEcCC----CCceeEE-EEEEEEE------e----------------------------- Q lcl|NC_013644. 137 AE---DRLCFQVADSLNVFGVYNEY----NELQRIC-RHYITEI------E----------------------------- 173 (510) Q Consensus 137 ~~---g~~~i~~~~p~~~~~~~d~~----~~~~~~~-~~~~~~~------~----------------------------- 173 (510) .+ +.++|..++|.++|.=.+.. .+-..++ +.|.... . T Consensus 149 ~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~ 228 (714) T protein:vir:32 149 SDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPL 228 (714) T ss_pred cCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccccccc Confidence 43 56899999999977432110 0111111 1110000 0 Q ss_pred -----------------eCCceeEEEEEEEEcCCcEEE--EEEcCCceeecccccccc--------------c-----cc Q lcl|NC_013644. 174 -----------------KDGETVDIHHAEVWTDQNVYF--FVAEDNKDYELDEAEPIN--------------P-----RP 215 (510) Q Consensus 174 -----------------~~~~~~~~~~~e~y~~~~i~~--~~~~~~~~~~~~~~~~~~--------------~-----~~ 215 (510) .+.....+..+|+|....... +...+|.....+...... . .. T Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~ 308 (714) T protein:vir:32 229 MSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREA 308 (714) T ss_pred ccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEE Confidence 000011223345554322221 111122222111110000 0 00 Q ss_pred ccccccccccccccccCCcccEEEecCC---CC--CCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh Q lcl|NC_013644. 216 HVLAVDSENESLLQRSYGQIPFYRLSNN---KQ--ETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS 290 (510) Q Consensus 216 ~~~~~~~~~~~~~~~~~g~iPvv~~~nn---~~--g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~ 290 (510) +............|.+.+++|+|+|+-. .. ..|.+..+++.++.+|...|.+...+ .++..++..|....... T Consensus 309 ~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~d~ 386 (714) T protein:vir:32 309 WFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSDN 386 (714) T ss_pred EEecCcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcccccHH Confidence 0000000111122334456666665432 22 34778889999999999999988866 45555555554333222 Q ss_pred hhhHh-hhcCeeeeccCCC--------ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCc-ccHHHHHHHHH Q lcl|NC_013644. 291 KLRQN-VKSKKVVGTGSDG--------GLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGN-ITNIVIKARYT 360 (510) Q Consensus 291 ~~~~~-~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Ai~~~~~ 360 (510) .+... -+.+.++....+. .++......-...+...+......|-.+|++-+...+..+| .||+|+..+-. T Consensus 387 ~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~ 466 (714) T protein:vir:32 387 DLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVE 466 (714) T ss_pred HHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHH Confidence 33222 2334444443221 12333223345677788888888898888877765555444 59999998888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----------ccCCcc-c-----------------------cceeeEEeC Q lcl|NC_013644. 361 LLNMKANKTEARLRALLEWMNKLVIDDINR----------RYTKAF-D-----------------------PTEVSFTFT 406 (510) Q Consensus 361 ~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~----------~~~~~~-~-----------------------~~~v~i~f~ 406 (510) ............+..+.+++.++++.++.. .+..+- . ..+|.|.=. T Consensus 467 qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~ 546 (714) T protein:vir:32 467 QGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPV 546 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeec Confidence 777777777777777777777777766532 111000 0 012333333 Q ss_pred CCCCCCHHHHHHHHHHHHhc-----CCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCc Q lcl|NC_013644. 407 REVMVNETDIVNDEKTEAET-----RKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEE 481 (510) Q Consensus 407 ~~~p~d~~e~~~~~~~~~~~-----g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (510) +..|.-..+.++.++.+.+. +.+....+++.+++-.-++..+++.+ . .+.+.. .+.. T Consensus 547 p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~-----------~---~~~~~~----~~~~ 608 (714) T protein:vir:32 547 QQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRA-----------A---LGTPKS----PDEM 608 (714) T ss_pred cCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHH-----------H---cCCCCC----cccc Confidence 44444445555555555543 11223445555554221222111111 0 010000 0000 Q ss_pred ccCCCCCCcccccccCcccccccccC-----------CCC Q lcl|NC_013644. 482 ETAVNPDDPTQQMAEGATGSTESQLP-----------ENG 510 (510) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~ 510 (510) . +.+++....+.....+.. .+- T Consensus 609 ~-------~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~ 641 (714) T protein:vir:32 609 T-------PEEQEVAAQQQALQQQQAELQMREMAGRVAKL 641 (714) T ss_pred c-------hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 001111001100000000 000 No 86 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=99.71 E-value=1.2e-15 Score=102.45 Aligned_cols=471 Identities=11% Similarity=0.088 Sum_probs=225.8 Q ss_pred CCCccC-----CChhhhHH-------HHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccc Q lcl|NC_013644. 1 MEALLS-----EDVKIIAN-------ALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNV 68 (510) Q Consensus 1 ~~~~~~-----~~~~~~~~-------~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ 68 (510) |++=.. .+.+++.+ ++...++.+. .-|....+..+||.|.|= .. ......+..++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~R~~a~~d~~fy~G~Qw-~~-------~~~~~l~~~g~p-- 68 (714) T protein:vir:81 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQP--KWRDAANKACAYYDGDQL-PP-------EVLQVLKDRGQP-- 68 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhH--HHHHHHHHHHHhhcCCCC-CH-------HHHHHHHhcCCC-- Confidence 433211 11122222 2333333332 235567788999999861 00 011111223344 Q ss_pred eeccchhHHHHHHHHhhhhcCCcee--ccC--c-H--HHHHH----HHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEEC Q lcl|NC_013644. 69 RIPHGFFPEIVDQKTQYLLSNPVEY--ETE--N-E--ELKEY----LAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTN 136 (510) Q Consensus 69 ki~~n~~~~Iv~~~~~~l~g~p~~~--~~~--d-~--~~~~~----l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d 136 (510) .+++|..+.+|+..+++--.+.+.+ .+. + + +..+. ++.+++ ++.......+..+++++|.||+-++.+ T Consensus 69 ~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~ 148 (714) T protein:vir:81 69 MTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRN 148 (714) T ss_pred cEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccc Confidence 5789999999999999998777664 331 2 2 23343 444444 678888899999999999999988876 Q ss_pred CC---CceEEEEEcccceEEEEcCC----CCceeEE-EEEEEEE------e----------------------------- Q lcl|NC_013644. 137 AE---DRLCFQVADSLNVFGVYNEY----NELQRIC-RHYITEI------E----------------------------- 173 (510) Q Consensus 137 ~~---g~~~i~~~~p~~~~~~~d~~----~~~~~~~-~~~~~~~------~----------------------------- 173 (510) .+ +.++|..++|.++|.=.+.. .+-..++ +.|.... . T Consensus 149 ~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~ 228 (714) T protein:vir:81 149 SDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPL 228 (714) T ss_pred cCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccccccc Confidence 43 56899999999977432110 0111111 1110000 0 Q ss_pred -----------------eCCceeEEEEEEEEcCCcEEE--EEEcCCceeecccccccc--------------c-----cc Q lcl|NC_013644. 174 -----------------KDGETVDIHHAEVWTDQNVYF--FVAEDNKDYELDEAEPIN--------------P-----RP 215 (510) Q Consensus 174 -----------------~~~~~~~~~~~e~y~~~~i~~--~~~~~~~~~~~~~~~~~~--------------~-----~~ 215 (510) .+.....+..+|+|....... +...+|.....+...... . .. T Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~ 308 (714) T protein:vir:81 229 MSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREA 308 (714) T ss_pred ccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEE Confidence 000011223345554322221 111122222111110000 0 00 Q ss_pred ccccccccccccccccCCcccEEEecCC---CC--CCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh Q lcl|NC_013644. 216 HVLAVDSENESLLQRSYGQIPFYRLSNN---KQ--ETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS 290 (510) Q Consensus 216 ~~~~~~~~~~~~~~~~~g~iPvv~~~nn---~~--g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~ 290 (510) +............|.+.+++|+|+|+-. .. ..|.+..+++.++.+|...|.+...+ .++..++..|....... T Consensus 309 ~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~d~ 386 (714) T protein:vir:81 309 WFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSDN 386 (714) T ss_pred EEecCcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcccccHH Confidence 0000000111122334456666665432 22 34778889999999999999988866 45555555554333222 Q ss_pred hhhHh-hhcCeeeeccCCC--------ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCc-ccHHHHHHHHH Q lcl|NC_013644. 291 KLRQN-VKSKKVVGTGSDG--------GLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGN-ITNIVIKARYT 360 (510) Q Consensus 291 ~~~~~-~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Ai~~~~~ 360 (510) .+... -+.+.++....+. .++......-...+...+......|-.+|++-+...+..+| .||+|+..+-. T Consensus 387 ~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~ 466 (714) T protein:vir:81 387 DLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVE 466 (714) T ss_pred HHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHH Confidence 33222 2334444443221 12333223345677788888888898888877765555444 59999998888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----------ccCCcc-c-----------------------cceeeEEeC Q lcl|NC_013644. 361 LLNMKANKTEARLRALLEWMNKLVIDDINR----------RYTKAF-D-----------------------PTEVSFTFT 406 (510) Q Consensus 361 ~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~----------~~~~~~-~-----------------------~~~v~i~f~ 406 (510) ............+..+.+++.++++.++.. .+..+- . ..+|.|.=. T Consensus 467 qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~ 546 (714) T protein:vir:81 467 QGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPV 546 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeec Confidence 777777777777777777777777766532 111000 0 012333333 Q ss_pred CCCCCCHHHHHHHHHHHHhc-----CCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCc Q lcl|NC_013644. 407 REVMVNETDIVNDEKTEAET-----RKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEE 481 (510) Q Consensus 407 ~~~p~d~~e~~~~~~~~~~~-----g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (510) +..|.-..+.++.++.+.+. +.+....+++.+++-.-++..+++.+ . .+.+.. .+.. T Consensus 547 p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~-----------~---~~~~~~----~~~~ 608 (714) T protein:vir:81 547 QQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRA-----------A---LGTPKS----PDEM 608 (714) T ss_pred cCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHH-----------H---cCCCCC----cccc Confidence 44444445555555555543 11223445555554221222111111 0 010000 0000 Q ss_pred ccCCCCCCcccccccCcccccccccC-----------CCC Q lcl|NC_013644. 482 ETAVNPDDPTQQMAEGATGSTESQLP-----------ENG 510 (510) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~ 510 (510) . +.+++....+.....+.. .+- T Consensus 609 ~-------~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~ 641 (714) T protein:vir:81 609 T-------PEEQEVAAQQQALQQQQAELQMREMAGRVAKL 641 (714) T ss_pred c-------hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 001111001100000000 000 No 87 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=99.71 E-value=1.2e-15 Score=102.45 Aligned_cols=471 Identities=11% Similarity=0.088 Sum_probs=225.8 Q ss_pred CCCccC-----CChhhhHH-------HHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccc Q lcl|NC_013644. 1 MEALLS-----EDVKIIAN-------ALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNV 68 (510) Q Consensus 1 ~~~~~~-----~~~~~~~~-------~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ 68 (510) |++=.. .+.+++.+ ++...++.+. .-|....+..+||.|.|= .. ......+..++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~R~~a~~d~~fy~G~Qw-~~-------~~~~~l~~~g~p-- 68 (714) T protein:vir:10 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQP--KWRDAANKACAYYDGDQL-PP-------EVLQVLKDRGQP-- 68 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhH--HHHHHHHHHHHhhcCCCC-CH-------HHHHHHHhcCCC-- Confidence 433211 11122222 2333333332 235567788999999861 00 011111223344 Q ss_pred eeccchhHHHHHHHHhhhhcCCcee--ccC--c-H--HHHHH----HHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEEC Q lcl|NC_013644. 69 RIPHGFFPEIVDQKTQYLLSNPVEY--ETE--N-E--ELKEY----LAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTN 136 (510) Q Consensus 69 ki~~n~~~~Iv~~~~~~l~g~p~~~--~~~--d-~--~~~~~----l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d 136 (510) .+++|..+.+|+..+++--.+.+.+ .+. + + +..+. ++.+++ ++.......+..+++++|.||+-++.+ T Consensus 69 ~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~ 148 (714) T protein:vir:10 69 MTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRN 148 (714) T ss_pred cEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccc Confidence 5789999999999999998777664 331 2 2 23343 444444 678888899999999999999988876 Q ss_pred CC---CceEEEEEcccceEEEEcCC----CCceeEE-EEEEEEE------e----------------------------- Q lcl|NC_013644. 137 AE---DRLCFQVADSLNVFGVYNEY----NELQRIC-RHYITEI------E----------------------------- 173 (510) Q Consensus 137 ~~---g~~~i~~~~p~~~~~~~d~~----~~~~~~~-~~~~~~~------~----------------------------- 173 (510) .+ +.++|..++|.++|.=.+.. .+-..++ +.|.... . T Consensus 149 ~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~ 228 (714) T protein:vir:10 149 SDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPL 228 (714) T ss_pred cCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccccccc Confidence 43 56899999999977432110 0111111 1110000 0 Q ss_pred -----------------eCCceeEEEEEEEEcCCcEEE--EEEcCCceeecccccccc--------------c-----cc Q lcl|NC_013644. 174 -----------------KDGETVDIHHAEVWTDQNVYF--FVAEDNKDYELDEAEPIN--------------P-----RP 215 (510) Q Consensus 174 -----------------~~~~~~~~~~~e~y~~~~i~~--~~~~~~~~~~~~~~~~~~--------------~-----~~ 215 (510) .+.....+..+|+|....... +...+|.....+...... . .. T Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~ 308 (714) T protein:vir:10 229 MSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREA 308 (714) T ss_pred ccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEE Confidence 000011223345554322221 111122222111110000 0 00 Q ss_pred ccccccccccccccccCCcccEEEecCC---CC--CCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh Q lcl|NC_013644. 216 HVLAVDSENESLLQRSYGQIPFYRLSNN---KQ--ETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS 290 (510) Q Consensus 216 ~~~~~~~~~~~~~~~~~g~iPvv~~~nn---~~--g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~ 290 (510) +............|.+.+++|+|+|+-. .. ..|.+..+++.++.+|...|.+...+ .++..++..|....... T Consensus 309 ~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~d~ 386 (714) T protein:vir:10 309 WFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSDN 386 (714) T ss_pred EEecCcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcccccHH Confidence 0000000111122334456666665432 22 34778889999999999999988866 45555555554333222 Q ss_pred hhhHh-hhcCeeeeccCCC--------ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCc-ccHHHHHHHHH Q lcl|NC_013644. 291 KLRQN-VKSKKVVGTGSDG--------GLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGN-ITNIVIKARYT 360 (510) Q Consensus 291 ~~~~~-~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Ai~~~~~ 360 (510) .+... -+.+.++....+. .++......-...+...+......|-.+|++-+...+..+| .||+|+..+-. T Consensus 387 ~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~ 466 (714) T protein:vir:10 387 DLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVE 466 (714) T ss_pred HHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHH Confidence 33222 2334444443221 12333223345677788888888898888877765555444 59999998888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----------ccCCcc-c-----------------------cceeeEEeC Q lcl|NC_013644. 361 LLNMKANKTEARLRALLEWMNKLVIDDINR----------RYTKAF-D-----------------------PTEVSFTFT 406 (510) Q Consensus 361 ~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~----------~~~~~~-~-----------------------~~~v~i~f~ 406 (510) ............+..+.+++.++++.++.. .+..+- . ..+|.|.=. T Consensus 467 qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~ 546 (714) T protein:vir:10 467 QGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPV 546 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeec Confidence 777777777777777777777777766532 111000 0 012333333 Q ss_pred CCCCCCHHHHHHHHHHHHhc-----CCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCc Q lcl|NC_013644. 407 REVMVNETDIVNDEKTEAET-----RKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEE 481 (510) Q Consensus 407 ~~~p~d~~e~~~~~~~~~~~-----g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (510) +..|.-..+.++.++.+.+. +.+....+++.+++-.-++..+++.+ . .+.+.. .+.. T Consensus 547 p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~-----------~---~~~~~~----~~~~ 608 (714) T protein:vir:10 547 QQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRA-----------A---LGTPKS----PDEM 608 (714) T ss_pred cCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHH-----------H---cCCCCC----cccc Confidence 44444445555555555543 11223445555554221222111111 0 010000 0000 Q ss_pred ccCCCCCCcccccccCcccccccccC-----------CCC Q lcl|NC_013644. 482 ETAVNPDDPTQQMAEGATGSTESQLP-----------ENG 510 (510) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~ 510 (510) . +.+++....+.....+.. .+- T Consensus 609 ~-------~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~ 641 (714) T protein:vir:10 609 T-------PEEQEVAAQQQALQQQQAELQMREMAGRVAKL 641 (714) T ss_pred c-------hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 001111001100000000 000 No 88 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=99.71 E-value=1.2e-15 Score=102.45 Aligned_cols=471 Identities=11% Similarity=0.088 Sum_probs=225.8 Q ss_pred CCCccC-----CChhhhHH-------HHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccc Q lcl|NC_013644. 1 MEALLS-----EDVKIIAN-------ALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNV 68 (510) Q Consensus 1 ~~~~~~-----~~~~~~~~-------~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ 68 (510) |++=.. .+.+++.+ ++...++.+. .-|....+..+||.|.|= .. ......+..++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~R~~a~~d~~fy~G~Qw-~~-------~~~~~l~~~g~p-- 68 (714) T protein:vir:27 1 MKNETNTMATKNDNGATPRFSQRQLQALCSDIDSQP--KWRDAANKACAYYDGDQL-PP-------EVLQVLKDRGQP-- 68 (714) T ss_pred CCcccccccCCCCcchhHHHHHHHHHHHHHHHHhhH--HHHHHHHHHHHhhcCCCC-CH-------HHHHHHHhcCCC-- Confidence 433211 11122222 2333333332 235567788999999861 00 011111223344 Q ss_pred eeccchhHHHHHHHHhhhhcCCcee--ccC--c-H--HHHHH----HHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEEC Q lcl|NC_013644. 69 RIPHGFFPEIVDQKTQYLLSNPVEY--ETE--N-E--ELKEY----LAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTN 136 (510) Q Consensus 69 ki~~n~~~~Iv~~~~~~l~g~p~~~--~~~--d-~--~~~~~----l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d 136 (510) .+++|..+.+|+..+++--.+.+.+ .+. + + +..+. ++.+++ ++.......+..+++++|.||+-++.+ T Consensus 69 ~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~ 148 (714) T protein:vir:27 69 MTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRN 148 (714) T ss_pred cEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccc Confidence 5789999999999999998777664 331 2 2 23343 444444 678888899999999999999988876 Q ss_pred CC---CceEEEEEcccceEEEEcCC----CCceeEE-EEEEEEE------e----------------------------- Q lcl|NC_013644. 137 AE---DRLCFQVADSLNVFGVYNEY----NELQRIC-RHYITEI------E----------------------------- 173 (510) Q Consensus 137 ~~---g~~~i~~~~p~~~~~~~d~~----~~~~~~~-~~~~~~~------~----------------------------- 173 (510) .+ +.++|..++|.++|.=.+.. .+-..++ +.|.... . T Consensus 149 ~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~ 228 (714) T protein:vir:27 149 SDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPL 228 (714) T ss_pred cCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccccccc Confidence 43 56899999999977432110 0111111 1110000 0 Q ss_pred -----------------eCCceeEEEEEEEEcCCcEEE--EEEcCCceeecccccccc--------------c-----cc Q lcl|NC_013644. 174 -----------------KDGETVDIHHAEVWTDQNVYF--FVAEDNKDYELDEAEPIN--------------P-----RP 215 (510) Q Consensus 174 -----------------~~~~~~~~~~~e~y~~~~i~~--~~~~~~~~~~~~~~~~~~--------------~-----~~ 215 (510) .+.....+..+|+|....... +...+|.....+...... . .. T Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~ 308 (714) T protein:vir:27 229 MSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREA 308 (714) T ss_pred ccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEE Confidence 000011223345554322221 111122222111110000 0 00 Q ss_pred ccccccccccccccccCCcccEEEecCC---CC--CCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh Q lcl|NC_013644. 216 HVLAVDSENESLLQRSYGQIPFYRLSNN---KQ--ETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS 290 (510) Q Consensus 216 ~~~~~~~~~~~~~~~~~g~iPvv~~~nn---~~--g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~ 290 (510) +............|.+.+++|+|+|+-. .. ..|.+..+++.++.+|...|.+...+ .++..++..|....... T Consensus 309 ~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~d~ 386 (714) T protein:vir:27 309 WFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSDN 386 (714) T ss_pred EEecCcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcccccHH Confidence 0000000111122334456666665432 22 34778889999999999999988866 45555555554333222 Q ss_pred hhhHh-hhcCeeeeccCCC--------ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCc-ccHHHHHHHHH Q lcl|NC_013644. 291 KLRQN-VKSKKVVGTGSDG--------GLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGN-ITNIVIKARYT 360 (510) Q Consensus 291 ~~~~~-~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Ai~~~~~ 360 (510) .+... -+.+.++....+. .++......-...+...+......|-.+|++-+...+..+| .||+|+..+-. T Consensus 387 ~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~ 466 (714) T protein:vir:27 387 DLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVE 466 (714) T ss_pred HHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHH Confidence 33222 2334444443221 12333223345677788888888898888877765555444 59999998888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----------ccCCcc-c-----------------------cceeeEEeC Q lcl|NC_013644. 361 LLNMKANKTEARLRALLEWMNKLVIDDINR----------RYTKAF-D-----------------------PTEVSFTFT 406 (510) Q Consensus 361 ~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~----------~~~~~~-~-----------------------~~~v~i~f~ 406 (510) ............+..+.+++.++++.++.. .+..+- . ..+|.|.=. T Consensus 467 qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~ 546 (714) T protein:vir:27 467 QGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPV 546 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeec Confidence 777777777777777777777777766532 111000 0 012333333 Q ss_pred CCCCCCHHHHHHHHHHHHhc-----CCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCc Q lcl|NC_013644. 407 REVMVNETDIVNDEKTEAET-----RKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEE 481 (510) Q Consensus 407 ~~~p~d~~e~~~~~~~~~~~-----g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (510) +..|.-..+.++.++.+.+. +.+....+++.+++-.-++..+++.+ . .+.+.. .+.. T Consensus 547 p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~-----------~---~~~~~~----~~~~ 608 (714) T protein:vir:27 547 QQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRA-----------A---LGTPKS----PDEM 608 (714) T ss_pred cCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHH-----------H---cCCCCC----cccc Confidence 44444445555555555543 11223445555554221222111111 0 010000 0000 Q ss_pred ccCCCCCCcccccccCcccccccccC-----------CCC Q lcl|NC_013644. 482 ETAVNPDDPTQQMAEGATGSTESQLP-----------ENG 510 (510) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~ 510 (510) . +.+++....+.....+.. .+- T Consensus 609 ~-------~e~q~~~~~~q~~~~~q~~lq~~~~~a~~~k~ 641 (714) T protein:vir:27 609 T-------PEEQEVAAQQQALQQQQAELQMREMAGRVAKL 641 (714) T ss_pred c-------hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 001111001100000000 000 No 89 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=99.68 E-value=7.3e-15 Score=98.10 Aligned_cols=471 Identities=11% Similarity=0.104 Sum_probs=225.7 Q ss_pred CCCc-----cCCCh----hhhH---HHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccc Q lcl|NC_013644. 1 MEAL-----LSEDV----KIIA---NALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNV 68 (510) Q Consensus 1 ~~~~-----~~~~~----~~~~---~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ 68 (510) |-.= ...+. .... ..+...++.+ ..-|....+..+||.|.| +.. ......+..++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~--~~~r~~a~~d~~fy~G~Q-w~~-------~~~~~l~~~g~p-- 68 (714) T protein:vir:10 1 MKNEINTTAMKNDHGSTPRFSQRQLLSLCSDIDSQ--PLWRDAANKACAYYDGDQ-LAP-------EVIQVLKDRGQP-- 68 (714) T ss_pred CCcCcCcccCCCcchhhhhhhHHHHHHHHHHHhhh--HHHHHHHHHHHHhhcCCC-CCH-------HHHHHHHhcCCC-- Confidence 2111 11111 1121 2222333332 223456778899999987 110 011111223344 Q ss_pred eeccchhHHHHHHHHhhhhcCCcee--ccC--cH---HHHHHH----HHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEEC Q lcl|NC_013644. 69 RIPHGFFPEIVDQKTQYLLSNPVEY--ETE--NE---ELKEYL----AEYYN-SEFQVVLQELVEGSSQKGFEYVYARTN 136 (510) Q Consensus 69 ki~~n~~~~Iv~~~~~~l~g~p~~~--~~~--d~---~~~~~l----~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d 136 (510) .+++|..+.+|+..+++.-.+.+.+ .+. ++ +..+.| +.+++ ++.......+..++.++|.||+-++.| T Consensus 69 ~~~~N~i~~~v~~v~g~~~~nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~d 148 (714) T protein:vir:10 69 MTIHNLIAPTVDGVLGMEAKTRTDLIVMSDDPNDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRN 148 (714) T ss_pred cEEeccHHHHHHHHHHHHHhCCcceEEecCCCChhhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcccceEEeeec Confidence 5789999999999999998777664 331 11 233443 34443 677888899999999999999988887 Q ss_pred CC---CceEEEEEcccceEEEEcCC-CC---ceeEEEE-EEE-------------------------------------- Q lcl|NC_013644. 137 AE---DRLCFQVADSLNVFGVYNEY-NE---LQRICRH-YIT-------------------------------------- 170 (510) Q Consensus 137 ~~---g~~~i~~~~p~~~~~~~d~~-~~---~~~~~~~-~~~-------------------------------------- 170 (510) .+ +.++|..++|.++|.=++.. .+ -..+++. |.. T Consensus 149 ~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~~~~~~~~ 228 (714) T protein:vir:10 149 SEPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPL 228 (714) T ss_pred cCCCCCCeEEEecChhheeeccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcccchhhhhhcccc Confidence 54 67999999999887532110 00 0001100 000 Q ss_pred ------------EE--eeCCceeEEEEEEEEcCCcEEEEEEc--CCceeeccccccc------ccc-------------c Q lcl|NC_013644. 171 ------------EI--EKDGETVDIHHAEVWTDQNVYFFVAE--DNKDYELDEAEPI------NPR-------------P 215 (510) Q Consensus 171 ------------~~--~~~~~~~~~~~~e~y~~~~i~~~~~~--~~~~~~~~~~~~~------~~~-------------~ 215 (510) .. ..+.....+..+|+|........... +|.....+..... ... . T Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~ 308 (714) T protein:vir:10 229 MSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREA 308 (714) T ss_pred cccchhhcccccccccccccCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHhccceecccceeeEEEE Confidence 00 00011123455666655433332222 1222111110000 000 0 Q ss_pred ccccccccccccccccCCcccEEEecCC---C--CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh Q lcl|NC_013644. 216 HVLAVDSENESLLQRSYGQIPFYRLSNN---K--QETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS 290 (510) Q Consensus 216 ~~~~~~~~~~~~~~~~~g~iPvv~~~nn---~--~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~ 290 (510) .............|.+.+++|+|+|+-. . ...|.+..+++.++.+|...|.+...+ .++..++..|....... T Consensus 309 ~~~g~~~L~~~~~p~p~~~fp~vP~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~gav~~~d~ 386 (714) T protein:vir:10 309 WFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSDN 386 (714) T ss_pred EEecchhhhcCCCCCCCCceeeEEecceeeeccCccceehhhhhhHHHHHHHHHHHHHHHH--hCCceeeccccccccHH Confidence 0000001112233455566777766532 2 245788889999999999999988876 34555555555433223 Q ss_pred hhhHh-hhcCeeeeccCC----C----ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCc-ccHHHHHHHHH Q lcl|NC_013644. 291 KLRQN-VKSKKVVGTGSD----G----GLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGN-ITNIVIKARYT 360 (510) Q Consensus 291 ~~~~~-~~~~~~~~~~~~----~----~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Ai~~~~~ 360 (510) .+... -+.+.++.+..+ + .++......-...+...+......|-.+|++-+...+..+| .||+|+..+.. T Consensus 387 ~~~e~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAI~~r~~ 466 (714) T protein:vir:10 387 DLMEQLERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVE 466 (714) T ss_pred HHHHhccCCCCeEEecccccccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCCCcchhHHHHHHHHHH Confidence 33322 233445544321 1 12322222335677888888899999998877766555444 69999998888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc----------CCc-cc-c----------------------ceeeEEeC Q lcl|NC_013644. 361 LLNMKANKTEARLRALLEWMNKLVIDDINRRY----------TKA-FD-P----------------------TEVSFTFT 406 (510) Q Consensus 361 ~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~----------~~~-~~-~----------------------~~v~i~f~ 406 (510) ............+..+.+++.++++.++.... ... .. . .+|.|.=. T Consensus 467 qg~~~l~~~~dnl~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~~~~~dv~i~~~ 546 (714) T protein:vir:10 467 QGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPV 546 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCCccccccceeeeEEEEEeec Confidence 77777777777788888888777777663211 000 00 0 01112222 Q ss_pred CCCCCCHHHHHHHHHHHHhcC-----CCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCc Q lcl|NC_013644. 407 REVMVNETDIVNDEKTEAETR-----KIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEE 481 (510) Q Consensus 407 ~~~p~d~~e~~~~~~~~~~~g-----~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (510) +..+.-..+.++.+..+.... .+....+++.+.+-..++..+++. .. .+.++... T Consensus 547 p~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~le~~d~p~~~ei~~~ir-----------~~---~~~~~~~~------ 606 (714) T protein:vir:10 547 QQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIR-----------AA---LGTPKSPD------ 606 (714) T ss_pred cCcHHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCcCHHHHHHHHH-----------HH---cCCCCCcc------ Confidence 223333334444444444321 112233444444321112111110 00 01000000 Q ss_pred ccCCCCCCcccccccCcccccccccCC------CC Q lcl|NC_013644. 482 ETAVNPDDPTQQMAEGATGSTESQLPE------NG 510 (510) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~ 510 (510) +..+.+++....+.....+... .. T Consensus 607 -----~~~~e~q~~q~~~~~~~~~q~~l~~~e~~a 636 (714) T protein:vir:10 607 -----EMTPEEQEVAAQQQALQQQQAELQMREMAG 636 (714) T ss_pred -----ccCcchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000011110001000000000 00 No 90 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.63 E-value=7.4e-15 Score=98.06 Aligned_cols=390 Identities=12% Similarity=0.132 Sum_probs=193.3 Q ss_pred HHHHHHHHHHhccCCcchhcccceeccccccccccccc-c-ceeccchhHHHHHHHHhhhhcCCceeccCc--HHHHHHH Q lcl|NC_013644. 29 KREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYAS-N-VRIPHGFFPEIVDQKTQYLLSNPVEYETEN--EELKEYL 104 (510) Q Consensus 29 ~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~-~-~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d--~~~~~~l 104 (510) .+.++-+...-.|-. ...+......+......... . .--.+.+++.+|+..+.-++-+++.+++++ ++..+.+ T Consensus 1 ~~~~D~~~~~~~~~g---~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~~d~~~~~~~~~ 77 (437) T protein:vir:52 1 MKFFDGIKSLALKLG---SKQEQTYYSPSLSLTDDLVQLEALWRDNWIANKVCIKRPEDMVRNWREIYSNDLNSKQLDLF 77 (437) T ss_pred CchhhhhHhHHhcCC---CccccceeecCccccccHHHHHHHHHhCchhhHHhhcchHHhhcCCceEecCCCCHHHHHHH Confidence 111222222221111 00000000000000000000 0 001357899999999999999999998864 3333456 Q ss_pred HHHhcc-CHHHHHHHHHHHHHhcCeEEEEEEECCC---------Cce-EEEEEcccceEEEEcCCCCcee-E---EEEEE Q lcl|NC_013644. 105 AEYYNS-EFQVVLQELVEGSSQKGFEYVYARTNAE---------DRL-CFQVADSLNVFGVYNEYNELQR-I---CRHYI 169 (510) Q Consensus 105 ~~~~~n-~~~~~~~e~~~~~~~~G~~~~~v~~d~~---------g~~-~i~~~~p~~~~~~~d~~~~~~~-~---~~~~~ 169 (510) +..++. ++...+.++.+.+..+|.|++++-.|.. |.+ .+.++++.++.|..-...++.. - -..|. T Consensus 78 ~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~~~~~dp~s~~fg~p~~y~ 157 (437) T protein:vir:52 78 TKFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPTGTKDDDVLSPNFGRYSEYS 157 (437) T ss_pred HHHHHhhcHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechhhccccccccccccccccCcceEEE Confidence 666653 6788999999999999999998877643 222 2555555555443211111110 0 00111 Q ss_pred EEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCCc Q lcl|NC_013644. 170 TEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTD 249 (510) Q Consensus 170 ~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd 249 (510) + . ++.. ...+.+.++.+|... .+| ...++-.|.|. T Consensus 158 v--~-~~~~-----~~~iH~SRii~~~~~-----------------------------------~~~--~~~~~~~G~s~ 192 (437) T protein:vir:52 158 I--L-GGSQ-----SITVHHSRLIILNAN-----------------------------------DAP--LSDNDIWGVSD 192 (437) T ss_pred E--e-cCCc-----ceeEccceeEEecCc-----------------------------------cCC--CccccccCCch Confidence 1 0 0000 001122333333211 112 11134458899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCC----CCchhhhhH------h-hhcCeeeeccCCCceeEEeecCC Q lcl|NC_013644. 250 LKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQ----GDDLSKLRQ------N-VKSKKVVGTGSDGGLDVKTVTIP 318 (510) Q Consensus 250 ~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~----~~~~~~~~~------~-~~~~~~~~~~~~~~~~~~~~~~~ 318 (510) ++.+.+-+.+++.+.-..+..+..+..+.+.+.|+. ........+ . ....+++.++.+.+.+.++ .+ T Consensus 193 le~~~~~i~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~e~~~--~~ 270 (437) T protein:vir:52 193 LEKIIDVLKRFDSASVNVGDLIFESKIDIFKIAGLSDKIAAGMENEVASVISAVQEIKSATNSLLLDAENEYDRKE--LT 270 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCceecchHHHHhcCCcHHHHHHHHHHHHHhcCCCceEEEcCCcceEEEe--cC Confidence 999999999999988888777777777777666531 111111111 1 1234566677666555544 56 Q ss_pred HHHHHHHHHHHHHHHHHHhCCcccccccc---CcccHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhccCC Q lcl|NC_013644. 319 TEGRKTKMEIDKENIYKFGMAFDSTQVGD---GNITNIVIKARYTLLNMKANKTE-ARLRALLEWMNKLVIDDINRRYTK 394 (510) Q Consensus 319 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~---g~~Sg~Ai~~~~~~l~~k~~~k~-~~~~~~l~~~~~~i~~~~~~~~~~ 394 (510) ...+...++...+.|...+++|-+-..+. |=+||..=...| +..+...+ ..++..+++++.+|+... .+.. T Consensus 271 ~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s~~Glasge~D~~~y---yd~i~~~Qe~~l~p~le~l~~~i~~~~--~g~~ 345 (437) T protein:vir:52 271 FTGLKDLLTEFRNAVAGAADMPVTILFGQSVSGLASGDEDIQNY---HEAIRRLQETRLRPIFEIIDPLICNEL--FGGL 345 (437) T ss_pred cCCHHHHHHHHHHHHHHHhcCchhhhcCcCcccccccHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHh--cCCC Confidence 66778888999999999999998644322 224555433223 33444444 568888888888766432 1211 Q ss_pred ccccceeeEEeCCCCCCCHHHHHHH-------HHHHHhcCCCchHHHHHhC------CCCCcHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 395 AFDPTEVSFTFTREVMVNETDIVND-------EKTEAETRKIILESILQVA------PRLDDDNVLRLICEQFDLDWEDV 461 (510) Q Consensus 395 ~~~~~~v~i~f~~~~p~d~~e~~~~-------~~~~~~~g~iS~et~~~~~------~~v~d~e~~~~~~e~~e~~~~~~ 461 (510) ..++++.|++-...+.+|.++. +.++.++|++|.+.+.+.| |.+++.+ T Consensus 346 ---~~~~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~g~i~~~e~r~~L~~~g~~~~i~~~~---------------- 406 (437) T protein:vir:52 346 ---PADWWFEFVPLTTVKQEQQINMLNTFATAANTLIQNGVLNEYQIANELRESGLFANISAEH---------------- 406 (437) T ss_pred ---CCcceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCCCCccc---------------- Confidence 2368899999888898888876 4445556665554444322 1111100 Q ss_pred HHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccc Q lcl|NC_013644. 462 KEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQ 505 (510) Q Consensus 462 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (510) .....+.++...+...+....+...+.++. | T Consensus 407 ------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 437 (437) T protein:vir:52 407 ------------IEELKNADEFAGNFEEPEKMEGAQVQNSED-Q 437 (437) T ss_pred ------------cccccCCCCCCCccCCCCCCCCCCCCCCCC-C Confidence 000000000000000000000000000001 1 No 91 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=99.61 E-value=3.2e-14 Score=94.56 Aligned_cols=480 Identities=12% Similarity=0.035 Sum_probs=207.5 Q ss_pred CCC---ccCCChhhhHHHHHHHHHhhhhh---hhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccch Q lcl|NC_013644. 1 MEA---LLSEDVKIIANALKAAIDKDRKS---SSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGF 74 (510) Q Consensus 1 ~~~---~~~~~~~~~~~~i~~~i~~~~~~---~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~ 74 (510) |-- .-.-+.+.+...+...|+.-.+. +......+..+||.|+..-. ...++ .+++.+. T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~~--------------~~~~~--s~~~~~~ 64 (705) T protein:vir:88 1 MAKRRKIKPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGN--------------ERPGK--SGIVSRD 64 (705) T ss_pred CCcccccccCCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCCc--------------ccCCC--CccccHH Confidence 211 11112233445555555444332 22223455678999974210 11122 3566666 Q ss_pred hHHHHHHHHhhh----hcCC--ceecc---CcHHHHHHHHHH----h--ccCHHHHHHHHHHHHHhcCeEEEEEEECCC- Q lcl|NC_013644. 75 FPEIVDQKTQYL----LSNP--VEYET---ENEELKEYLAEY----Y--NSEFQVVLQELVEGSSQKGFEYVYARTNAE- 138 (510) Q Consensus 75 ~~~Iv~~~~~~l----~g~p--~~~~~---~d~~~~~~l~~~----~--~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~- 138 (510) ....|+.....| ||.+ +.+.+ +|....+.++.+ + .|+....+...+++++++|.|++.||++.. T Consensus 65 v~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~ 144 (705) T protein:vir:88 65 VQETVDWIMPSLMKVFTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVL 144 (705) T ss_pred HHHHHHHHHHHHHHhhcCCCceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEecccccc Confidence 666666666655 3433 44443 444444444433 2 255567788899999999999998887432 Q ss_pred -----------------------------------------------CceEEEEEcccceEEEEcCC-C-CceeEEEEEE Q lcl|NC_013644. 139 -----------------------------------------------DRLCFQVADSLNVFGVYNEY-N-ELQRICRHYI 169 (510) Q Consensus 139 -----------------------------------------------g~~~i~~~~p~~~~~~~d~~-~-~~~~~~~~~~ 169 (510) |.+++..|+|.++++--+-. . +-..+++.++ T Consensus 145 ~~~~e~~~~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp~a~~~~d~~~~~~~~~ 224 (705) T protein:vir:88 145 KPTFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARFLCHREK 224 (705) T ss_pred chhhhhhccCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceecCCCCCcccCcEEEEEEe Confidence 66888889999887532211 1 1111112211 Q ss_pred EEEeeC----------------C-----------------ceeEEEEEEEEcCC---cEEEEEEcCCceeeccccccccc Q lcl|NC_013644. 170 TEIEKD----------------G-----------------ETVDIHHAEVWTDQ---NVYFFVAEDNKDYELDEAEPINP 213 (510) Q Consensus 170 ~~~~~~----------------~-----------------~~~~~~~~e~y~~~---~i~~~~~~~~~~~~~~~~~~~~~ 213 (510) ....+- . ........+.+++. .+..|.+.. .+.........+ T Consensus 225 ~t~~dl~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~--~~d~~~d~~~~~ 302 (705) T protein:vir:88 225 YTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYT--LLDVDGDGISEL 302 (705) T ss_pred ccHHHHHhhcCChhHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEEeee--EecccCCcceee Confidence 110000 0 00000000000000 011111000 000000000000 Q ss_pred ccccccccccccccccccCCcccEEEe-----cCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEE-ecCCCC Q lcl|NC_013644. 214 RPHVLAVDSENESLLQRSYGQIPFYRL-----SNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVV-SGFQGD 287 (510) Q Consensus 214 ~~~~~~~~~~~~~~~~~~~g~iPvv~~-----~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~-~g~~~~ 287 (510) +.... .+.... ...++|.+|++.+ +..-+|.|.++.+.++++.+|.+++.+.+.+....+|.+.+ .|+ . T Consensus 303 ~~~~~-~g~~il--~~~~~~~~PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~~g~--v 377 (705) T protein:vir:88 303 RRILY-VGDYII--SNEPWDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQ--V 377 (705) T ss_pred EEEEE-eCcccc--ccccCCCCCEEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHHHHhccCCceeccccc--c Confidence 00000 000000 1124566676654 44557899999999999999999999999999988886655 332 2 Q ss_pred chhhhhHhhhcCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-----cCcccHHHHHHHHHHH Q lcl|NC_013644. 288 DLSKLRQNVKSKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG-----DGNITNIVIKARYTLL 362 (510) Q Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-----~g~~Sg~Ai~~~~~~l 362 (510) +..+... .+.++++.+..++.+.++..+.-.......++.+...|-..|++++...+. .++.|+.|+..+.... T Consensus 378 ~~~d~~~-~~pg~vv~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~ 456 (705) T protein:vir:88 378 NLEDLLT-NEAAGIVRVKSMNSITPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAA 456 (705) T ss_pred Ccccccc-cCCCeeEEecCCCccccccCCcCcHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHHH Confidence 2223222 345566666666667887666666777888899999999999999875442 2345667777777777 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHhhccCCc-----------cc------cceeeEEeCCCCCCCHHHHHHHHHHHH Q lcl|NC_013644. 363 NMKANKTEARLR-ALLEWMNKLVIDDINRRYTKA-----------FD------PTEVSFTFTREVMVNETDIVNDEKTEA 424 (510) Q Consensus 363 ~~k~~~k~~~~~-~~l~~~~~~i~~~~~~~~~~~-----------~~------~~~v~i~f~~~~p~d~~e~~~~~~~~~ 424 (510) ..+.....+.|. .++++++++++.++....... ++ ..++.+.-... ..+..+....+..+. T Consensus 457 ~~r~~~~~r~~a~~~~~~l~~~~~~li~~~~~~~~~~ri~g~~v~v~~~~~~~~~~v~v~v~~~-~~~~eq~~a~l~~ll 535 (705) T protein:vir:88 457 EQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRERSDLTVTVGIG-NMNKDQQMLHLMRIW 535 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCceEEeeccchhccchHhhccCCceEEeeccc-cchHHHHHHHHHHHH Confidence 777777777775 456666666666553321110 00 01122221111 112222222221111 Q ss_pred hcCCCchHH-HHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccc-----c--cc Q lcl|NC_013644. 425 ETRKIILES-ILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQ-----M--AE 496 (510) Q Consensus 425 ~~g~iS~et-~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~--~~ 496 (510) +. .+.-+ .-.+.+.++.........+..+............++ ....... .............. + .. T Consensus 536 ~~--~q~l~~~~~~~~~~~~~~~~~~~~el~e~~~~k~~~~~~~~~--~~~e~~~-~~~~~~q~e~~~~~~~~~~q~e~~ 610 (705) T protein:vir:88 536 EM--AQAVVGGGGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNP--NSPEALQ-AKAIREQKEAQPKPEDIKAQADAQ 610 (705) T ss_pred HH--HHHhhcccchhhhcChHHHHHHHHHHHHhhhhhhHHHHhhhh--hhHHHHH-HHHhhhhhhhhHHHHHHHHHHHHH Confidence 10 00000 001112222222211111111110000000000000 0000000 00000000000000 0 00 Q ss_pred CcccccccccC-C---CC Q lcl|NC_013644. 497 GATGSTESQLP-E---NG 510 (510) Q Consensus 497 ~~~~~~~~~~~-~---~~ 510 (510) .++.....+-- . .. T Consensus 611 k~q~e~~~~q~e~q~~q~ 628 (705) T protein:vir:88 611 RAQSDALAKQAEAQMKQV 628 (705) T ss_pred HHHHHHHHHHHHHHHHHH Confidence 00000000000 0 00 No 92 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=99.59 E-value=4e-14 Score=94.03 Aligned_cols=487 Identities=11% Similarity=0.062 Sum_probs=221.5 Q ss_pred CCCccCC-----------ChhhhHH---HHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccc Q lcl|NC_013644. 1 MEALLSE-----------DVKIIAN---ALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYAS 66 (510) Q Consensus 1 ~~~~~~~-----------~~~~~~~---~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~ 66 (510) |+..... +..++.. .+...++.+. .-|....+..+||.|.|= .. ......+...+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~--~~r~~a~~d~~fy~G~QW-~~-------~~~~~l~~~g~p 70 (772) T protein:vir:10 1 MQITENDRQYLNGLPPAGDTPLTVDEYADINYEIEDQP--AWRAVADKEMDYADGNQL-DT-------ELLRRQQALGIP 70 (772) T ss_pred CCcchhhHHhhccCCcccccccCHHHHHHHHHHHhccH--HHHHHHHHHHHhhcCCCC-CH-------HHHHHHHhcCCC Confidence 4432111 1121222 2333444433 345567788899999861 00 000111223444 Q ss_pred cceeccchhHHHHHHHHhhhhcCCcee--cc----CcHHHHHHHH----HHhc-cCHHHHHHHHHHHHHhcCeEEEEEEE Q lcl|NC_013644. 67 NVRIPHGFFPEIVDQKTQYLLSNPVEY--ET----ENEELKEYLA----EYYN-SEFQVVLQELVEGSSQKGFEYVYART 135 (510) Q Consensus 67 ~~ki~~n~~~~Iv~~~~~~l~g~p~~~--~~----~d~~~~~~l~----~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~ 135 (510) .+++|..+.+|+..+++.-.+.+.+ .+ ++.+..+.|+ .+++ ++.......+..+++++|.||+-++. T Consensus 71 --~~~~N~i~~~v~~v~g~~~~nr~d~~v~Pr~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~~ 148 (772) T protein:vir:10 71 --PAVEDLIGPALLSLQGYEAVTRTDWRVTPNGDVGGQEVADALNYRLNTAERQSGADRACSEAFRPQIACGIGWVEVSR 148 (772) T ss_pred --cEEEcchHHHHHHHHHHHHhcCcceEEecCCCchHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhcCceeEEecc Confidence 4789999999999999998877664 33 2233344443 4443 67888899999999999999998887 Q ss_pred CCC---CceEEEEEcccceEEEEcCCCC--cee---EEEEEEEE----------------------------------E- Q lcl|NC_013644. 136 NAE---DRLCFQVADSLNVFGVYNEYNE--LQR---ICRHYITE----------------------------------I- 172 (510) Q Consensus 136 d~~---g~~~i~~~~p~~~~~~~d~~~~--~~~---~~~~~~~~----------------------------------~- 172 (510) +.+ +.++|..++|.++| ||...+ +.- +++.+... . T Consensus 149 ~~d~~~~~i~i~~v~p~~v~--~Dp~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~ 226 (772) T protein:vir:10 149 ESDPFKFPYRCRPIRRDEIH--WDMKCGDDWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEG 226 (772) T ss_pred ccCCCCCCeEEEeeCcccce--ecCCCCCCHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCccccccccc Confidence 654 46889999998876 443221 111 11100000 0 Q ss_pred ----------------------eeCCceeEEEEEEEEcCCcEEEEEE--cCCceeecccccc----------cc------ Q lcl|NC_013644. 173 ----------------------EKDGETVDIHHAEVWTDQNVYFFVA--EDNKDYELDEAEP----------IN------ 212 (510) Q Consensus 173 ----------------------~~~~~~~~~~~~e~y~~~~i~~~~~--~~~~~~~~~~~~~----------~~------ 212 (510) ..+.....++-+|+|......+... ..+.....+.... .. T Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~~~~~~~~~ 306 (772) T protein:vir:10 227 GTSTGLHNAWNEARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGRISPKKVTV 306 (772) T ss_pred ccccccccccchhhccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhcccchheeee Confidence 0000112234455554432222222 2222221111000 00 Q ss_pred -c--ccccccccccccccccccCCcccEEEecCC---C--CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecC Q lcl|NC_013644. 213 -P--RPHVLAVDSENESLLQRSYGQIPFYRLSNN---K--QETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGF 284 (510) Q Consensus 213 -~--~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn---~--~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~ 284 (510) . ...............|.+.+.+|+|+|+-. . ...|.+..+++.++.+|...|.+...+...+ +..-.|. T Consensus 307 ~rv~~~~~~g~~~L~~~~~p~~~~~fP~vP~~g~r~~~~g~~~G~vr~~kd~Qr~~N~~~S~~~~~l~~~~--~~~~~ga 384 (772) T protein:vir:10 307 SRVRRSYWLGPHCLHDGPTPYTHRHFPYVPFFGFREDATGIPYGYVRGMKYAQDSLNSGVSKLRWGMSVAR--VERTKGA 384 (772) T ss_pred eEEEEEEEecceeeccCCCCCCCCccceEEEeeeEeccCCcccchhhhhhhHHHHHHHHHHHHHHHHhccc--ccccCCC Confidence 0 000011111111233455567777776532 1 2347888999999999999999988775443 3333333 Q ss_pred CCCchhhhhHh-hhcCeeeeccCC------CceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCc-ccHHHHH Q lcl|NC_013644. 285 QGDDLSKLRQN-VKSKKVVGTGSD------GGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGN-ITNIVIK 356 (510) Q Consensus 285 ~~~~~~~~~~~-~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Ai~ 356 (510) -......+... -+.+.++.+..+ +.+++.....-..++...+......|-.++++-+...+..+| .||+|+. T Consensus 385 v~~~d~~~~e~~arp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~~~na~SGvAi~ 464 (772) T protein:vir:10 385 VAMTDAQFRRQIARPDADIVLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGRKGTATSGIQEQ 464 (772) T ss_pred ccchhHHHHHhccCCCCeEEeCCccccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcCCCcchhhHHHHH Confidence 22111223222 233445555443 223333333345677888888888899988777655555555 5999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc----------cCCcccc---------------------ceeeE-E Q lcl|NC_013644. 357 ARYTLLNMKANKTEARLRALLEWMNKLVIDDINRR----------YTKAFDP---------------------TEVSF-T 404 (510) Q Consensus 357 ~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~----------~~~~~~~---------------------~~v~i-~ 404 (510) .+-.............+..+.+++.++++.++... +...... .+|++ . T Consensus 465 ~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~ 544 (772) T protein:vir:10 465 QQIEQSNQSIGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAAYLSNDLLRTR 544 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceecccccccceeccceeee Confidence 88777777777777778888888777777766421 1100000 01100 0 Q ss_pred eC---CCCCCCHH---HHHHHHHHHHhcCCCchHHHH-------HhCCCCCcHHHHHHHHHHHHH-HHHHHH------HH Q lcl|NC_013644. 405 FT---REVMVNET---DIVNDEKTEAETRKIILESIL-------QVAPRLDDDNVLRLICEQFDL-DWEDVK------EA 464 (510) Q Consensus 405 f~---~~~p~d~~---e~~~~~~~~~~~g~iS~et~~-------~~~~~v~d~e~~~~~~e~~e~-~~~~~~------~~ 464 (510) +. ...|...+ +.++.+..+. +.++.+... +.+.+-..++..+++++.... +.+..+ .. T Consensus 545 yDv~i~~~p~~~t~r~~~~~~m~ql~--~~~~P~~~~~~~~~~le~~D~p~~~ei~~~ir~~~~~~~peq~~~~~~q~~q 622 (772) T protein:vir:10 545 IKVALEDVPSTNSYRGQQLNAMSEAV--KSMPPQYQAAVLPFLVSLMDVPFKRDVVEAIRAVDQQQTPEQIQQQIDQAVQ 622 (772) T ss_pred EEEEeeccccchHHHHHHHHHHHHHH--hccChhHHHHHHHHHHhhcCCCChHHHHHHHHHHhccCChHHHHHHHHHHHH Confidence 10 12222223 2233333222 223433322 222221112222222211000 000000 00 Q ss_pred HHhhhccCCCCC---CCCCcccCCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 465 LEEAEYTKGLSD---NTDEEETAVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 465 ~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) +.......+... .....+ .+......-..+.+ ...+-..++ T Consensus 623 q~~~~~~~el~~~q~~a~~~~----~~A~a~~~~aqa~~-~~~~a~~~a 666 (772) T protein:vir:10 623 DALAKAGNDIKLRELEIKERK----ADSEISGLNAKAVQ-IGVQAAFSA 666 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHH-HHHHHHHHH Confidence 000000000000 000000 00000000000000 000000011 No 93 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.57 E-value=3.6e-13 Score=88.79 Aligned_cols=435 Identities=7% Similarity=-0.023 Sum_probs=213.8 Q ss_pred HHHHHHHHHhhhhhh--hHHHHHHHHHHhccCCcchhcccceeccccccccccc---------cc-cceeccchhHHHHH Q lcl|NC_013644. 13 ANALKAAIDKDRKSS--SKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKY---------AS-NVRIPHGFFPEIVD 80 (510) Q Consensus 13 ~~~i~~~i~~~~~~~--~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~---------~~-~~ki~~n~~~~Iv~ 80 (510) -.+|.+.|.-+-+.. ++.......+-|.+-..- +... ............ ++ +.-..++|++-.|+ T Consensus 1 mn~~dr~i~~~sP~~~~~R~~ar~~~~~y~aa~~~--r~~~-~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~ 77 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKAARLRSRAVIQAYEAVKTT--RTHK-ARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFD 77 (502) T ss_pred CchHhhHHhhcChHHHHHHHhhHHHHhhccccCcc--cccC-CCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHH Confidence 233444444442221 112222222335554311 1000 000000000000 00 00112469999999 Q ss_pred HHHhhhhcC-Cceecc----C----cHHHHHHHHHHhc-----------cCHHHHHHHHHHHHHhcCeEEEEEEECCCCc Q lcl|NC_013644. 81 QKTQYLLSN-PVEYET----E----NEELKEYLAEYYN-----------SEFQVVLQELVEGSSQKGFEYVYARTNAEDR 140 (510) Q Consensus 81 ~~~~~l~g~-p~~~~~----~----d~~~~~~l~~~~~-----------n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~ 140 (510) ..++.++|. ++++.+ . +++..+.|+..|. .+|......+++.....|.+++...+++.+. T Consensus 78 ~~~~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~ 157 (502) T protein:vir:79 78 KLEERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINS 157 (502) T ss_pred HHHHhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCc Confidence 999999996 555422 2 2344455554442 2455666667888999999998877665432 Q ss_pred --------eEEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccc Q lcl|NC_013644. 141 --------LCFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPIN 212 (510) Q Consensus 141 --------~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~ 212 (510) +++..++|+.+---+++...+.. ++ +..+. ....-+| ++. ...+. T Consensus 158 ~~~g~~~~l~lq~iepd~l~~~~~~~~~i~~--GV---e~d~~-Gr~~aY~--i~~---------~hPgd---------- 210 (502) T protein:vir:79 158 LTPSAGVHFWLEALEPDFIPMTSDESNRLNQ--GV---FVDDW-GRPEKYL--VYK---------SRPVS---------- 210 (502) T ss_pred cCCCcccceEEEEecchhcCCCCCCCCeeEe--ee---EECCC-CceEEEE--Eee---------cCCCC---------- Confidence 58999999887322222211111 11 11111 1111111 111 10000 Q ss_pred cccccccccccccccccccCCccc---EEEecCC-----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecC Q lcl|NC_013644. 213 PRPHVLAVDSENESLLQRSYGQIP---FYRLSNN-----KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGF 284 (510) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~nn-----~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~ 284 (510) .....+.+|| |+|+... ..|.|+|..++..+..++.....-.......+....+++.. T Consensus 211 --------------~~~~~~~rvpA~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~ 276 (502) T protein:vir:79 211 --------------GRQMETKEVDAERMLHLKFVRRLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKG 276 (502) T ss_pred --------------CcccceeEechhheEEeecccCCccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecC Confidence 0011223455 6666543 45899999999888887775544444444433333344432 Q ss_pred CCC---------chhhhhHhhhcCeeee-ccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccc-ccCcccHH Q lcl|NC_013644. 285 QGD---------DLSKLRQNVKSKKVVG-TGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQV-GDGNITNI 353 (510) Q Consensus 285 ~~~---------~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~g~~Sg~ 353 (510) ... ........+..+.++. +..|.++++.+.+.+...+..++..+...|....++|-.... .++ .|-. T Consensus 277 ~~~~~~~~~~~~~~~~~~~~l~pG~i~~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s-~nyS 355 (502) T protein:vir:79 277 DGQSYEPDGNGSKENERELTIQPGIIYDDLKPGEEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYN-GTYS 355 (502) T ss_pred CCcccccccCCCCCccccccccCCccccccCCCceeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc-chHH Confidence 111 1111122344555554 678889999998888889999999999999998888743221 122 2556 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhccCCcc-----ccceeeEEe--CCCCCCCHHHHHHHHHHHHh Q lcl|NC_013644. 354 VIKARYTLLNMKANKTEARLRALLEW-MNKLVIDDINRRYTKAF-----DPTEVSFTF--TREVMVNETDIVNDEKTEAE 425 (510) Q Consensus 354 Ai~~~~~~l~~k~~~k~~~~~~~l~~-~~~~i~~~~~~~~~~~~-----~~~~v~i~f--~~~~p~d~~e~~~~~~~~~~ 425 (510) +++..+......+...+..|...+-+ +++..+...-..|.-+. ....+.+.| ..-...|....+++...++. T Consensus 356 s~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~ 435 (502) T protein:vir:79 356 AQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIR 435 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHHHHHHHHH Confidence 66776666666666666655543333 44443332222222111 112345666 34445799999999999999 Q ss_pred cCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccc Q lcl|NC_013644. 426 TRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQ 505 (510) Q Consensus 426 ~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (510) +|+.|.+.++...+ .|.++..+++.++.+...+. ......++...........+.+++++++. ...+ T Consensus 436 ~Gl~t~~~~~a~~G-~D~~~v~~q~a~e~~~~~~~-Gl~~~~~~~~~~~~~~~~~~~~e~~~~~~-----------~~e~ 502 (502) T protein:vir:79 436 GGAATESDWVRAGG-RNPDDVKRRRKAEIDENRKL-DLVFDTDPASDKGGSSAATKRQEPQHTDD-----------QSEE 502 (502) T ss_pred cCCCCHHHHHHHcC-CCHHHHHHHHHHHHHHHHHc-CCCCCCCCCCCCCCCCCCCCCCCCCCCCC-----------CCCC Confidence 99999999999885 44444444333333322110 00000111000000000111111111110 0011 No 94 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=99.52 E-value=4.3e-13 Score=88.40 Aligned_cols=452 Identities=9% Similarity=0.028 Sum_probs=223.4 Q ss_pred CCCccCCC-----hhhhHHHHHHHHHhhhhhhhHHHH--HHHHHHhccCCcchhcccceeccccccccccccccceeccc Q lcl|NC_013644. 1 MEALLSED-----VKIIANALKAAIDKDRKSSSKREA--ETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHG 73 (510) Q Consensus 1 ~~~~~~~~-----~~~~~~~i~~~i~~~~~~~~~~~~--~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n 73 (510) |.+-.++. ...+..+|.+.++.+.+....... .++++||.+... + ..... ...++ +++..| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~---~--~~~~~-----~~~~r--~~~~~~ 68 (584) T protein:vir:95 1 MSVKVAELNSLLVRDSSAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDT---T--TTSNQ-----GLPWK--NSTTLP 68 (584) T ss_pred CCcchhhhhhhccccchHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhh---h--hhhhc-----ccccc--cccchh Confidence 54433332 344567777777777655544433 788999987532 1 11110 11112 467778 Q ss_pred hhHHHHHHHHhhhhc----CC-----ceeccCcHH--HHHHHHHHhc-----cCHHHHHHHHHHHHHhcCeEEEEEEECC Q lcl|NC_013644. 74 FFPEIVDQKTQYLLS----NP-----VEYETENEE--LKEYLAEYYN-----SEFQVVLQELVEGSSQKGFEYVYARTNA 137 (510) Q Consensus 74 ~~~~Iv~~~~~~l~g----~p-----~~~~~~d~~--~~~~l~~~~~-----n~~~~~~~e~~~~~~~~G~~~~~v~~d~ 137 (510) .+.-+++..+++|++ +. +.+..++.+ ..+.++.+.. -++...+.++..++.++|.|+..+++.. T Consensus 69 k~~~~~~~i~~~l~~~~Fp~~~w~~~v~~~~~~~~~~~~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~~~~ 148 (584) T protein:vir:95 69 KLCQIRDNLHSNYFSSLFPNDDWLRWVGYGKGDSTKTKAKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVSFEA 148 (584) T ss_pred HHHHHHHHHHHHHHHhhcCccceeeeecCCCchhhHHHHHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEeEee Confidence 887777777777643 21 112223333 3556666653 3677888999999999999998887643 Q ss_pred C-------------CceEEEEEcccceEEEEcCCC---CceeEE-EEEEEE----------------------------- Q lcl|NC_013644. 138 E-------------DRLCFQVADSLNVFGVYNEYN---ELQRIC-RHYITE----------------------------- 171 (510) Q Consensus 138 ~-------------g~~~i~~~~p~~~~~~~d~~~---~~~~~~-~~~~~~----------------------------- 171 (510) . .++++..++|..+| ||.+- +-..++ +.++.. T Consensus 149 ~~~e~~e~~~v~~~~~prieriSP~d~~--~Dpsa~~i~d~~fivrs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~ 226 (584) T protein:vir:95 149 KYKEMTDGTLVPDYIGPRLVRISPLDIV--FNPLATSISDTFKIVRSVKTKGELMRLAQDEPEQSYWLEALKRREEICRH 226 (584) T ss_pred cceeeeccccccccccceEEeeChhhee--ecCCCCCccchhhhhhhhhhHHHHHHHHhhcCccccchHHHHHHHHhccC Confidence 2 26899999999988 45432 111111 111100 Q ss_pred -------EeeCCceeE----EEEEEEEcCCcEEEEEEcCCceeeccccccccccccc-cccccccc--ccccccCCcccE Q lcl|NC_013644. 172 -------IEKDGETVD----IHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHV-LAVDSENE--SLLQRSYGQIPF 237 (510) Q Consensus 172 -------~~~~~~~~~----~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~~~~~~~g~iPv 237 (510) ..+.-.... ....+.|.++.+..+...+ .+..... .....+... ........ ...+.+++.+|+ T Consensus 227 ~~~~~~~~~~~~~~~~~d~~~~~~ey~~~~~V~vl~~~g-~~~~~~~-~e~~~~~iv~v~~g~~iIR~~~np~~~~~~PF 304 (584) T protein:vir:95 227 LGGYSVEDFDKAAGFDVDGFGNLYEYYMSDWVEILEFYG-DYHDKET-GELQTNRIITVVDRSTEVRNESIPTWFGSAPI 304 (584) T ss_pred CCCCcccccccccccccccccccccccCCceeEEEeecc-ccccccc-CCCcccceEEEEeccEEEEeeecCCCCCCCCE Confidence 000000000 0011222232222222111 0000000 000011111 11111111 234566799998 Q ss_pred EEecC-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeE Q lcl|NC_013644. 238 YRLSN-----NKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDV 312 (510) Q Consensus 238 v~~~n-----n~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (510) +.+.. .-+|.|+..-+.++++.+|.+.-.+.+.+..+.+|.+...+.. .+. ..+.+..+.++..+++++ T Consensus 305 ~~~~~~p~~~s~yG~gi~~ll~d~Q~~lna~~r~~iDnl~l~~~pv~k~~~~~----~~~--~~~pg~~~~~~~~~~~q~ 378 (584) T protein:vir:95 305 YHVGWRFRPDNLWAMGPLDNLVGMQYRIDHLENAKADAVDLIIQPPLKIIGEV----EEF--VWGPGAEIHLDQGGDVQE 378 (584) T ss_pred EEEcceeeeccccCCCchhhhhhHHHHHhHHHHHHHHHHHHhcCcceeecccc----chh--cccCCceeecCCCCCcce Confidence 87654 3469999999999999999999999999999999966554431 121 134667788888888899 Q ss_pred EeecC-CHHHHHHHHHHHHHHHHHHhCCccccccc--cCcccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH Q lcl|NC_013644. 313 KTVTI-PTEGRKTKMEIDKENIYKFGMAFDSTQVG--DGNITNIVIKARYTLLNMKANKTEARLRALL-EWMNKLVIDDI 388 (510) Q Consensus 313 ~~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l-~~~~~~i~~~~ 388 (510) +.++. +..+..+.+..+...+-+.|++|...-+. .++.++..+.....++-.-...+.+.|...+ ++++.++..+- T Consensus 379 ~~p~a~~~~s~~~~lq~~e~~me~~sGvp~~~~G~~~~~~~TAtg~s~l~naa~~~~r~~~~~f~~~ll~~l~~ll~~~~ 458 (584) T protein:vir:95 379 IAKNVNYIINADNQIQMLEDRMELYAGAPREAMGIRTPGEKTAFEVQQLGNAAGRIFQEKVTTFEVELLEPVLNAMLETA 458 (584) T ss_pred ecCchhhhhHHHHHHHHHHHHHHhhhCCChhhcccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 88764 44455566788888888899999875442 3333444466666666666777777887776 66666666542 Q ss_pred hhc--cCCcc---------------ccceeeEEeC--CCCCCCHHHH---HHHHHHHHhc-------CCCchHHHHH--- Q lcl|NC_013644. 389 NRR--YTKAF---------------DPTEVSFTFT--REVMVNETDI---VNDEKTEAET-------RKIILESILQ--- 436 (510) Q Consensus 389 ~~~--~~~~~---------------~~~~v~i~f~--~~~p~d~~e~---~~~~~~~~~~-------g~iS~et~~~--- 436 (510) ... ..... ...++.-.|. .--..-..+. .+.+....++ +.++...+.. T Consensus 459 ~~nmd~~~~vr~~n~e~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~q~l~~ilq~~~~~~i~p~~~~~~l~~~la 538 (584) T protein:vir:95 459 TRNMDGSDVIRVMDTDLGVKEFMSVTREDITANGKIRPIGARHFGKQAQDLQNLVGIFNSQIGQMILPHTSGKALATFVD 538 (584) T ss_pred HhhccccCceeeeccccccccccccChhhhccCeeEEeehhhHHHHHHHHHHHHHHHHHhhhhhhccccchHHHHHHHHH Confidence 210 00000 0011111111 0011111122 2222222221 1112211111 Q ss_pred ---hCCC---CCcHHHHHHHHHHHHH--HHHHHHHHHHhhhccCCC Q lcl|NC_013644. 437 ---VAPR---LDDDNVLRLICEQFDL--DWEDVKEALEEAEYTKGL 474 (510) Q Consensus 437 ---~~~~---v~d~e~~~~~~e~~e~--~~~~~~~~~~~~~~~~~~ 474 (510) .+|. ..++-..++..+.+.. +..+....+++-+..+.. T Consensus 539 dl~~~p~~~~~~~~~~~~~Q~~~q~~~~~~q~~~~~~~~~~~~~~~ 584 (584) T protein:vir:95 539 DVTGLQGYEIFRPNVAVAEQAETQSLVAQAQEDLQLQAQMPAEGAI 584 (584) T ss_pred HHhCCCcccccCCCcccchhHHHHhhhHHHHHHHHHHHhhhhccCC Confidence 1231 1111000000000000 000111111111111111 No 95 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.51 E-value=1.1e-13 Score=91.61 Aligned_cols=422 Identities=10% Similarity=0.071 Sum_probs=203.5 Q ss_pred CCCccCCChhhhHH-HHH------------------------------------HHHHhhhhhhhHHHHHHHHHHhccCC Q lcl|NC_013644. 1 MEALLSEDVKIIAN-ALK------------------------------------AAIDKDRKSSSKREAETGIRYYNHEN 43 (510) Q Consensus 1 ~~~~~~~~~~~~~~-~i~------------------------------------~~i~~~~~~~~~~~~~~~~~YY~g~~ 43 (510) |-..-..+...... ... .+-... ...... .+..||.... T Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~-~~~~~~---~~~~~~~~~~ 100 (537) T protein:vir:10 25 VGIFGAGDDEKPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFSAYA-NPNLSE---GLVLWYAQQA 100 (537) T ss_pred cCCCcccchhhHHHHHHhhhhccCCCCCccCcccccccccccchhccccccchhhhhhhc-cccccc---hhhhhccccC Confidence 11111111100000 000 000000 000000 0111111111 Q ss_pred cchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCCceeccCcH-----HHHHHHHHHhc-cCHHHHHH Q lcl|NC_013644. 44 DIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNPVEYETENE-----ELKEYLAEYYN-SEFQVVLQ 117 (510) Q Consensus 44 ~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~-----~~~~~l~~~~~-n~~~~~~~ 117 (510) +..+. .. .-+ -.+.+++.+|+..+.-++-+++.+++++. +..+.|...++ -++...+. T Consensus 101 -~~~~~----------l~----a~Y-~~~~l~r~iVd~~A~d~~r~~~~i~~~~~~~~~~~~~~~l~~~~~~l~~~~~l~ 164 (537) T protein:vir:10 101 -FIGHQ----------MC----ALI-ATHWLVNKACSQMPRDAMRKGYKIISDDGNELDPKDAKFIDRYDRAFNIKKHAI 164 (537) T ss_pred -CccHH----------HH----HHH-HhCchhhhhhhhhhHHhhcCCceeecCCcccccHHHHHHHHHHHHHhhHHHHHH Confidence 00000 00 001 12578999999999999999999987643 23344444443 45778889 Q ss_pred HHHHHHHhcCeEEEEEEECC-CCce----------------EEEEEcccceEEEEcCCC--Cce-eEE---EEEEEEEee Q lcl|NC_013644. 118 ELVEGSSQKGFEYVYARTNA-EDRL----------------CFQVADSLNVFGVYNEYN--ELQ-RIC---RHYITEIEK 174 (510) Q Consensus 118 e~~~~~~~~G~~~~~v~~d~-~g~~----------------~i~~~~p~~~~~~~d~~~--~~~-~~~---~~~~~~~~~ 174 (510) ++.+.+..+|.+++++..+. ++.. .+.+++|.++.|...+.. ++. +-+ ..|.+ T Consensus 165 ~a~~~~rlyG~~~i~i~v~~~D~~~~~~Pl~~~~i~kg~~k~l~vidp~~~~~~~~~~~~~dp~sp~fg~P~~y~v---- 240 (537) T protein:vir:10 165 QFVRKGRIFGIRIALFKVDSPDPYYYEKPFNIDGVMPGAYKGIVQIDPYWCAPLLDAQASSNPVSMHFYEPTYWLI---- 240 (537) T ss_pred HHHHhcccccceEEEEeecCcCCcccccccccccccccceeEEEEechhhcccccchhhhccCCccccCCceeeee---- Confidence 99999999999988876542 2211 244556655554321100 000 000 01110 Q ss_pred CCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEe-cCCCCCCCcHHHH Q lcl|NC_013644. 175 DGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRL-SNNKQETTDLKPI 253 (510) Q Consensus 175 ~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~-~nn~~g~sd~~~v 253 (510) .+ ..+.+.++.+|.... +|-+.- .++-.|+|.++.+ T Consensus 241 ~g--------~~iH~SRli~f~g~~-----------------------------------~p~~~~~~~~~~G~Svlq~~ 277 (537) T protein:vir:10 241 NG--------KKYHRSHLAIYINDE-----------------------------------VVDFLKPSYIYGGVPLPQQI 277 (537) T ss_pred cC--------eEecceeEEEecCCC-----------------------------------CchhhhcccCcccccHHHHH Confidence 00 011233333332110 111110 1223588999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccceeEEecCCC-CchhhhhHh-------hhcCeeeeccCCCceeEEeecCCHHHHHHH Q lcl|NC_013644. 254 KALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQG-DDLSKLRQN-------VKSKKVVGTGSDGGLDVKTVTIPTEGRKTK 325 (510) Q Consensus 254 ~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~-~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (510) .+-+..++.+.-..+..+..+....+.+.+... .+....... ....+++.++.+++ +|-+...+...+... T Consensus 278 ~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l~~~~~~~~r~~~~~~~r~n~g~~~id~e~e-~~e~~~~~lsgl~~~ 356 (537) T protein:vir:10 278 MERVYAAERTANEGPMLAMTKRQTVLKVDAAQVLANKQQFDETMSWWTATRDNYQVRVVDKDNE-DVVQIDTTLNDLDKV 356 (537) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcCHHHHHHHHHHHHhhcCCcceeEecCCCc-eeEEEeccCCCHHHH Confidence 999999999888888888887877776665321 111122111 12234566665432 444444666778889 Q ss_pred HHHHHHHHHHHhCCccccccc----cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCcccccee Q lcl|NC_013644. 326 MEIDKENIYKFGMAFDSTQVG----DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEV 401 (510) Q Consensus 326 ~~~l~~~i~~~s~~p~~~~~~----~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v 401 (510) ++...+.|...+++|-+-.-+ +-|+||..=...|.. .+..++..++..+++++++|+... .+. ..++ T Consensus 357 l~~~~~~iAa~~~IP~t~L~G~sp~GlnatGe~D~~~yyd---~I~~~Qe~l~p~l~~l~~ll~~~~---~~~---~~~~ 427 (537) T protein:vir:10 357 IMNQYQLVCAIARTPAPKMLGTVPTGFNSTGDYEEASYHE---ECESTQDDMRPLIDRHHQLVCRSH---LRK---RIRV 427 (537) T ss_pred HHHHHHHHHhhhCCCceeeccCCccccccchhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhc---CCC---Ccce Confidence 999999999999999864322 224567654444443 344444457889999888877532 222 3468 Q ss_pred eEEeCCCCCCCHHHHHHH-------HHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCC Q lcl|NC_013644. 402 SFTFTREVMVNETDIVND-------EKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGL 474 (510) Q Consensus 402 ~i~f~~~~p~d~~e~~~~-------~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~ 474 (510) ++.|++-...+.+|+++. +.++.++|+||.+.+.+.|..-.+.....+.......+.+... . T Consensus 428 ~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~~~~g~~~l~~~~~~ed~e~~~--~--------- 496 (537) T protein:vir:10 428 KVEFPPMDAPKESERADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMDPTLGFTSITPAMRPTDAEDID--V--------- 496 (537) T ss_pred EEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccCccccccccCCCChhhhhccc--C--------- Confidence 999999999999987764 7888889999988877765321000000000000000000000 0 Q ss_pred CCCCCCcccCCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 475 SDNTDEEETAVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) +...........++++++..+.++.+....+-+++| T Consensus 497 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (537) T protein:vir:10 497 DDEGKPVRIIEDQPAPSEMFGATSSGESANDPRDSG 532 (537) T ss_pred CccCCcCCCCCCCCCccccCCCCccccccCCCccCc Confidence 000000000111111111112222222222233333 No 96 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=99.46 E-value=9.8e-12 Score=80.95 Aligned_cols=473 Identities=11% Similarity=0.078 Sum_probs=208.1 Q ss_pred CCCcc-------CCChhhhHHHHHHHHHhhhhhhhH--HHHHHHHHHhccCCcchhcccceeccccccc---cccccccc Q lcl|NC_013644. 1 MEALL-------SEDVKIIANALKAAIDKDRKSSSK--REAETGIRYYNHENDIMNNRIFYVDDEGILR---EDKYASNV 68 (510) Q Consensus 1 ~~~~~-------~~~~~~~~~~i~~~i~~~~~~~~~--~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~---~~~~~~~~ 68 (510) |...- ..+.+.+...+.+..++++..+.. ...+..+++|.++.+.+++ +++...... ...++ + T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y---~~~~~~~~~~~~~~~~r--s 77 (651) T protein:vir:80 3 LATTTTDKNRQTYDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDY---LRDQVLRSVGDVNADWR--H 77 (651) T ss_pred ccccccchhhhhhhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHh---hccccccccCCCCCCCC--c Confidence 22221 223444566667776666543321 1233333344443221111 111100000 01122 3 Q ss_pred eeccchhHHHHHHHHhhhhc----CC--ceecc-CcHH----HHHHHHHHhc-----cCHHHHHHHHHHHHHhcCeEEEE Q lcl|NC_013644. 69 RIPHGFFPEIVDQKTQYLLS----NP--VEYET-ENEE----LKEYLAEYYN-----SEFQVVLQELVEGSSQKGFEYVY 132 (510) Q Consensus 69 ki~~n~~~~Iv~~~~~~l~g----~p--~~~~~-~d~~----~~~~l~~~~~-----n~~~~~~~e~~~~~~~~G~~~~~ 132 (510) +++.+.....|+.....|+. .+ +.+.+ ++++ ..+.++.++. .+|.....++..+++++|.|.+. T Consensus 78 ~~~~~~v~~~ve~~~~~l~~~~~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~i~k 157 (651) T protein:vir:80 78 KITTGKAFEAIETIHAYLMSATFPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNSVLA 157 (651) T ss_pred cccChhHHHHHHHHHHHHHHhhcCCCceeEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCceEEE Confidence 67888888888877776653 22 22221 2222 3334555542 45777777888999999999998 Q ss_pred EEECC-------------------------------CCceEEEEEcccceEEEEcCCC--CceeEEEEEEEEEee----- Q lcl|NC_013644. 133 ARTNA-------------------------------EDRLCFQVADSLNVFGVYNEYN--ELQRICRHYITEIEK----- 174 (510) Q Consensus 133 v~~d~-------------------------------~g~~~i~~~~p~~~~~~~d~~~--~~~~~~~~~~~~~~~----- 174 (510) ||++. .|.++|..++|.++++--+-.. +-..+++.+...... T Consensus 158 v~we~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~dp~a~~~~d~~~v~~~~~t~~~l~~l~~ 237 (651) T protein:vir:80 158 LPWRVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFYDPNVTDPNRGAFIRKLTKTKADILNLLS 237 (651) T ss_pred EeecceeeeeehheeccccccccccceeeeccceeeeceeEEEEecHHHeeecCCCcCccccceeeeeeeeHHHHHHHHh Confidence 88753 2568899999999886322111 111122222110000 Q ss_pred CC----------------------c-------------eeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccccc Q lcl|NC_013644. 175 DG----------------------E-------------TVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLA 219 (510) Q Consensus 175 ~~----------------------~-------------~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 219 (510) .+ . ......+++|+- +.++..++.+.... ... T Consensus 238 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~--~~~~d~e~~~~~~~-----------~v~ 304 (651) T protein:vir:80 238 EGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEY--WGDIHLENKTYHDV-----------VVT 304 (651) T ss_pred cccccchhhHHHHhhhccccccCCccccccccCCCccccccccceEEEEE--EEEeeccCCceEEE-----------EEE Confidence 00 0 000011111110 00111111111100 000 Q ss_pred ccc-cccccccccC-CcccEEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhh Q lcl|NC_013644. 220 VDS-ENESLLQRSY-GQIPFYRLS-----NNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKL 292 (510) Q Consensus 220 ~~~-~~~~~~~~~~-g~iPvv~~~-----nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~ 292 (510) ..+ .......+++ ..+|++.++ ...+|+|..+.+.+.+..+|.+...+.+.+...++|.+.+..-...++.++ T Consensus 305 ~~g~~il~~~~~~~~~~~Pf~~~~~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~~l 384 (651) T protein:vir:80 305 IMGNEVLRFEQNPYWCGRPFVIGTYIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQPEDV 384 (651) T ss_pred EcCcEEecccccCCCCCCCeeeecceecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccHHHh Confidence 000 0001112221 234665543 345799999999999999999999999999999999987754334444444 Q ss_pred hHhhhcCeeeeccCCCceeEEeec-CCHHHHHHHHHHHHHHHHHHhCCcccccc----ccCcccHHHHHHHHHHHHHHHH Q lcl|NC_013644. 293 RQNVKSKKVVGTGSDGGLDVKTVT-IPTEGRKTKMEIDKENIYKFGMAFDSTQV----GDGNITNIVIKARYTLLNMKAN 367 (510) Q Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~----~~g~~Sg~Ai~~~~~~l~~k~~ 367 (510) . ...++++.++..+++..+... .+.......+..+...|-..+++++...+ ..++.++.++..+...+..... T Consensus 385 ~--~~pg~vi~~~~~~~~~~l~~~~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~ 462 (651) T protein:vir:80 385 Y--TEPGKVFLVSDHGDLQPLANQSSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLS 462 (651) T ss_pred h--cCCCceEEecCCCCceeeccCcccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHH Confidence 2 345667778888888888654 34566678899999999999988875432 2344566666666666666666 Q ss_pred HHHHHHHH-HHHHHHHHHHHHHhhccCCc-----------------cccceeeEEeCCCCCCCHHH---HHHHHHHHHhc Q lcl|NC_013644. 368 KTEARLRA-LLEWMNKLVIDDINRRYTKA-----------------FDPTEVSFTFTREVMVNETD---IVNDEKTEAET 426 (510) Q Consensus 368 ~k~~~~~~-~l~~~~~~i~~~~~~~~~~~-----------------~~~~~v~i~f~~~~p~d~~e---~~~~~~~~~~~ 426 (510) ..-+.|.. +++.+++.++.++...+..+ +...++++.|.- ++....+ ....+..+. T Consensus 463 ~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~i-v~~g~~~~~~r~~~~~~l~-- 539 (651) T protein:vir:80 463 GIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRL-VPIGSDHVIERKQYIEDRL-- 539 (651) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCccceeeeeee-eeccHHHHHHHHHHHHHHH-- Confidence 66666655 55656655555543221100 111123322221 1222211 111111111 Q ss_pred CCCchHHHHHhCCCCCcHH-HHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccC-CCCCCcccccccCcccccc- Q lcl|NC_013644. 427 RKIILESILQVAPRLDDDN-VLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETA-VNPDDPTQQMAEGATGSTE- 503 (510) Q Consensus 427 g~iS~et~~~~~~~v~d~e-~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~- 503 (510) +.-..+...|.+.... ..+... .-.+....++...--.+.+... ..+......+......... T Consensus 540 ---~~~q~~~~~p~~~~~~~~~~~~~-----------~l~~~~g~~~~~~~l~~~~q~~~~~~~~~~~~q~~~~~~~a~~ 605 (651) T protein:vir:80 540 ---TFIQAVAQVPEMGQLVDYKRILV-----------DLLQHWGFEEPEAYLKQQDQQAPANPQEALLSQAKDVGGQAMS 605 (651) T ss_pred ---HHHHhhccCCccchhhhHHHHHH-----------HHHHHcCCCCcHHhcCCCccchhhhhhHHHHhhHHHHHHHHHH Confidence 1111111122111110 000000 0000000000000000000000 0000000000000000000 Q ss_pred -------cccCCCC Q lcl|NC_013644. 504 -------SQLPENG 510 (510) Q Consensus 504 -------~~~~~~~ 510 (510) ++..+.. T Consensus 606 ~~~~~~~~~~~~~~ 619 (651) T protein:vir:80 606 NMLQNQLQADGGTQ 619 (651) T ss_pred HHHHHHHHHHHHHH Confidence 0000000 No 97 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=99.46 E-value=1.1e-13 Score=91.72 Aligned_cols=484 Identities=10% Similarity=0.028 Sum_probs=210.1 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhc--cCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYN--HENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~--g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) |-+.+.+--...-.++...++.. ..-++...+-.+||. |.|= ..-...... ......++| .+++|..+.+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~--~~~r~~~~~D~~f~~~~G~QW-~~~~~~~l~---~~~q~~grP--~~~~N~i~~~ 72 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSPQ--KEVREKCIEATRFARVPGGQW-EGATAAGTK---LDEQFEKYP--KFEINKVATE 72 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHhh--HHHHHHHHHHHHhhcCCCCCC-CHHHHHHHH---HhhhhcCCC--ceEEcchHHH Confidence 33222211111222222232222 233344555556664 6551 000000000 000111233 4778999999 Q ss_pred HHHHHhhhhcCCcee--ccC----cHHHHHHHH----HHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC--------- Q lcl|NC_013644. 79 VDQKTQYLLSNPVEY--ETE----NEELKEYLA----EYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAE--------- 138 (510) Q Consensus 79 v~~~~~~l~g~p~~~--~~~----d~~~~~~l~----~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~--------- 138 (510) |+..+++--.+.+.+ .+. +.+..+.|+ .+.+ ++.......+..++.++|.||+-++.|.. T Consensus 73 v~~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~e~d~~~~~ 152 (708) T protein:vir:10 73 LNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDR 152 (708) T ss_pred HHHHHHHHHhCCcceEEEcCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhcccceeeeeeccccccCCCCCc Confidence 999999988777664 322 333444444 3443 67888899999999999999987765421 Q ss_pred CceEEEE-Ecc-cceEEEEcCC-C--Cce---eEEEEEEEE----------------EeeC--------CceeEEEEEEE Q lcl|NC_013644. 139 DRLCFQV-ADS-LNVFGVYNEY-N--ELQ---RICRHYITE----------------IEKD--------GETVDIHHAEV 186 (510) Q Consensus 139 g~~~i~~-~~p-~~~~~~~d~~-~--~~~---~~~~~~~~~----------------~~~~--------~~~~~~~~~e~ 186 (510) ..+.+.. .+| ..+| ||.. . ++. .+++..... +..+ .....++..++ T Consensus 153 ~~i~i~~~~~p~~~v~--~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~~~~d~v~v~ey 230 (708) T protein:vir:10 153 QRIAIEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKY 230 (708) T ss_pred cccceEEeecchhhcc--cCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccccCCCceEEEEe Confidence 1222222 222 2332 2210 0 000 011000000 0000 00011222333 Q ss_pred EcCCcEE----EEEEcCCceeeccccccc---------cc--------cc------ccccccccccccccccCCcccEEE Q lcl|NC_013644. 187 WTDQNVY----FFVAEDNKDYELDEAEPI---------NP--------RP------HVLAVDSENESLLQRSYGQIPFYR 239 (510) Q Consensus 187 y~~~~i~----~~~~~~~~~~~~~~~~~~---------~~--------~~------~~~~~~~~~~~~~~~~~g~iPvv~ 239 (510) |...... ++....++.......... .. .. .............+.+++.+|+|+ T Consensus 231 ~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~~~p~~~fP~vP 310 (708) T protein:vir:10 231 YEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIP 310 (708) T ss_pred eeEEEEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhhccCCCCCCCceeeEE Confidence 3222111 111111111100000000 00 00 011122222344567778888888 Q ss_pred ecCC-------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCch-hhhhHhhhcCeeee-----ccC Q lcl|NC_013644. 240 LSNN-------KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDL-SKLRQNVKSKKVVG-----TGS 306 (510) Q Consensus 240 ~~nn-------~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~-~~~~~~~~~~~~~~-----~~~ 306 (510) |.-. +.+.|.+.++++.++.+|+..|.+.+.+......+.++........ ..+.........+. .+. T Consensus 311 ~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 390 (708) T protein:vir:10 311 VYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDK 390 (708) T ss_pred EeeeeeccCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHHHHhhccccchhhhcccccccc Confidence 7532 2235788899999999999999999888766665544322111110 01001000000000 011 Q ss_pred CC-------ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 307 DG-------GLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEW 379 (510) Q Consensus 307 ~~-------~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~ 379 (510) .| ....+....-..++...+......|-.+|+.-+...+..+|.||+|+..+-............-+..+.++ T Consensus 391 ~G~~~~~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~~sn~SG~aI~~rq~qg~~~l~~~~Dnl~~~~~~ 470 (708) T protein:vir:10 391 SGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKR 470 (708) T ss_pred ccccccccCCccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 12233333445667888888888999988877665555677899999999888888888888888888888 Q ss_pred HHHHHHHHHhh----------ccCCc-----------cc---------------cceeeEEeCCCCCCCHHHHHHHHHHH Q lcl|NC_013644. 380 MNKLVIDDINR----------RYTKA-----------FD---------------PTEVSFTFTREVMVNETDIVNDEKTE 423 (510) Q Consensus 380 ~~~~i~~~~~~----------~~~~~-----------~~---------------~~~v~i~f~~~~p~d~~e~~~~~~~~ 423 (510) +.++++.++.. .+... .+ ..+|.|.=.+..+.-..+.++.++.+ T Consensus 471 ~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~ql 550 (708) T protein:vir:10 471 AGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNV 550 (708) T ss_pred HHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccCchhHHHHHHHHHHHH Confidence 88877777632 11100 00 01233333344444445566666655 Q ss_pred HhcCCCc-hHH------HHHhCCCCCcHHHHHHHHHH-------------HHHHHHHHHHHH------HhhhccCCCCCC Q lcl|NC_013644. 424 AETRKII-LES------ILQVAPRLDDDNVLRLICEQ-------------FDLDWEDVKEAL------EEAEYTKGLSDN 477 (510) Q Consensus 424 ~~~g~iS-~et------~~~~~~~v~d~e~~~~~~e~-------------~e~~~~~~~~~~------~~~~~~~~~~~~ 477 (510) ..+..-. ..+ +++++.+.--++..+++... +..........+ ............ T Consensus 551 l~~~~p~~~~~~~~~~~~l~~~D~p~~~ei~erir~~~~~~~~~~~~~~ee~q~~~~~q~~~q~q~~~~~~e~qa~~~~~ 630 (708) T protein:vir:10 551 LSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAA 630 (708) T ss_pred HHhcCCCchhhHHHHHHHHHhcCCcChHHHHHHHHHhhcccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5543211 111 22333221111221111110 000000000000 000000000000 Q ss_pred -CCCcccCCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 478 -TDEEETAVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 478 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) ....+ .+. ...++-.++ T Consensus 631 qAe~~k--------a~a--------~a~~~~~~a 648 (708) T protein:vir:10 631 QAEAQK--------ATN--------ETAQTQIKA 648 (708) T ss_pred HHHHHH--------HHH--------HHHHHHHHH Confidence 00000 000 001111111 No 98 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.45 E-value=3e-12 Score=83.73 Aligned_cols=422 Identities=11% Similarity=0.006 Sum_probs=202.9 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |+.+..+-.-... ..+.--.-.+..+. .+..||....-+-+ . .. .-+ -.+.+++.||+ T Consensus 71 ~ds~~~~~~~~~~---~~~~~~~~~~~~~~---~~~~~~~~~~f~gy-q----------l~----alY-~~~~l~rkiVd 128 (765) T protein:vir:96 71 MDSAYGDGPTPAA---KAAAGGQNPYVVPT---MLQDWYNSQGFIGY-Q----------AC----AII-SQHWLVDKACS 128 (765) T ss_pred ccccccccccchH---HHhhhccCccchhh---HHHhhhcccCCccH-H----------HH----HHH-HhCchhhhhhh Confidence 7666433332211 11111100111111 12233333211100 0 00 001 12568899999 Q ss_pred HHHhhhhcCCceeccCcHH----HHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC-Cc-------------- Q lcl|NC_013644. 81 QKTQYLLSNPVEYETENEE----LKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAE-DR-------------- 140 (510) Q Consensus 81 ~~~~~l~g~p~~~~~~d~~----~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~-g~-------------- 140 (510) ..+.-++.+++.+++++++ ..+.|+..++ =++...+.++.+.+-.||.+|+++-.+.. +. T Consensus 129 ~pAeDa~R~g~~I~~~~~e~~~~~~~~l~~~~~rl~v~~~l~ea~~~~RlyGga~i~i~i~~~D~~~l~~PL~~~~I~kg 208 (765) T protein:vir:96 129 MSGEDAARNGWELKSDGRKLSDEQSALIARRDMEFRVKDNLVELNRFKNVFGVRIALFVVESDDPDYYEKPFNPDGIAPG 208 (765) T ss_pred cchHHhhcCCceeecCccccCHHHHHHHHHHHHHhhHHHHHHHHHHHhhhceeeEEEEEecccCcchhhccccccccccc Confidence 9999999999999876432 2334444443 35778899999999999999887765422 11 Q ss_pred -e-EEEEEcccceEEEEcC--CCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccc Q lcl|NC_013644. 141 -L-CFQVADSLNVFGVYNE--YNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPH 216 (510) Q Consensus 141 -~-~i~~~~p~~~~~~~d~--~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 216 (510) + .|..++|.++.|.... ..++..- .++... .+.+.-..+ .+.++.+|... T Consensus 209 ~~kgl~vldp~~~~~~~v~e~~~Dp~sp-~fg~P~------~y~i~g~~I-H~SRli~~~g~------------------ 262 (765) T protein:vir:96 209 SYKGISQIDPYWAMPQLTAESTADPSAE-HFYEPD------FWIISGKKY-HRSHLVVVRGP------------------ 262 (765) T ss_pred eeeEEEEechhhcccccchhcccccccc-ccCcce------eeeecCcee-ccceEEEecCC------------------ Confidence 1 1344455444442100 0000000 000000 000000001 12222222100 Q ss_pred cccccccccccccccCCcccEEEe-cCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCC-CchhhhhH Q lcl|NC_013644. 217 VLAVDSENESLLQRSYGQIPFYRL-SNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQG-DDLSKLRQ 294 (510) Q Consensus 217 ~~~~~~~~~~~~~~~~g~iPvv~~-~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~-~~~~~~~~ 294 (510) .+|-+.- .++-.|+|.++.+.+-+..++.+.-..+..+..+....+.+.+... .+...... T Consensus 263 -----------------~lpd~lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~v~k~~~~~~l~~~~~l~~ 325 (765) T protein:vir:96 263 -----------------QPPDILKPTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSKRTSTIHVDVEKAIANEDAFNA 325 (765) T ss_pred -----------------CchhhhccccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeeechHhhhccHHHHHH Confidence 0111100 1233588999999999999999888887777777777666555321 11122211 Q ss_pred -------hhhcCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc----cCcccHHHHHHHHHHHH Q lcl|NC_013644. 295 -------NVKSKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG----DGNITNIVIKARYTLLN 363 (510) Q Consensus 295 -------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~----~g~~Sg~Ai~~~~~~l~ 363 (510) .....+++.++.+.+.+.++ .+...+...++...+.|...+++|-+-.-+ +-|+||..=...|...+ T Consensus 326 r~~~~~~~r~n~g~~~id~ee~~e~~s--~~lsgl~d~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe~D~~nYyD~I 403 (765) T protein:vir:96 326 RLAFWIANRDNHGVKVIGIDETMEQFD--TNLSDFDSVIMNQYQLVAAIAKTPATKLLGTSPKGFNATGEHETISYHEEL 403 (765) T ss_pred HHHHHHHhcCCceeEEecCCcceeEEe--cccCCHHHHHHHHHHHHHhhhCCCeeeeccCCcccccCcchHHHHHHHHHH Confidence 11233466677766555544 567778899999999999999999753322 22567764333333333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHH-------HHHHHhcCCCchHHHHH Q lcl|NC_013644. 364 MKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVND-------EKTEAETRKIILESILQ 436 (510) Q Consensus 364 ~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~-------~~~~~~~g~iS~et~~~ 436 (510) . ...+..+...|++++.+|+.. +. ++ .+++|.|++-...+++|+++. ++++.++|+||...+.+ T Consensus 404 ~--s~Qe~~l~p~le~L~~li~~s----~~--i~-~d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~Gvis~dEvR~ 474 (765) T protein:vir:96 404 E--SIQEHIFDPLLERHYLLLAKS----ES--ID-VQLEIVWNPVDSTTSQQQAELNNKKAATDEIYINSGVVSPDEVRE 474 (765) T ss_pred H--HHHHHHHHHHHHHHHHHHHHh----cC--CC-CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHH Confidence 2 222366788899888887753 11 12 268999999998999888765 77788899999888887 Q ss_pred hCC------C--CCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccccC- Q lcl|NC_013644. 437 VAP------R--LDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLP- 507 (510) Q Consensus 437 ~~~------~--v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 507 (510) .+. + +++.+.+.. ................+.......+..++ ..+.+....+.+....| T Consensus 475 ~L~~~~~~g~~~l~d~~~e~~----------~~~~pe~~~~~~~~~~~~~~~~~e~~~~~--a~p~~~eg~~~~~~~~p~ 542 (765) T protein:vir:96 475 RLRDDPRSGYNRLTDDQAETE----------PGMSPENLAELEKAGAQSAKAKGEAERAE--AQAGAVEGAGDPVPAAPR 542 (765) T ss_pred HHhccccCCCCCCCccccccc----------cCCCccccccccCCCcccccccCcccccc--CCCCccCCCCcccccCCc Confidence 652 1 222221100 00000000000000000000000000000 00000000111111122 Q ss_pred --------------------CCC Q lcl|NC_013644. 508 --------------------ENG 510 (510) Q Consensus 508 --------------------~~~ 510 (510) .+. T Consensus 543 ~~~p~~~~~~~~~g~~~~~p~~~ 565 (765) T protein:vir:96 543 GTKPLAKAAEEGAGEAATPPSRP 565 (765) T ss_pred ccCCccccccccCccccCccccc Confidence 111 No 99 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=99.44 E-value=2.6e-12 Score=84.05 Aligned_cols=473 Identities=10% Similarity=0.003 Sum_probs=202.9 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |-+. .........++...++.. ..-+....+-.+||.|.+=- .-... ..+...+| ++|..+.+|+ T Consensus 1 m~d~-~~~~~~~~~~~~~~~~~~--~~~r~~a~~d~~fy~G~Qw~-~~~~~-------~l~~q~rp----~~N~i~~~i~ 65 (725) T protein:vir:77 1 MADN-ENRLESILSRFDADWTAS--DEARREAKNDLFFSRVSQWD-DWLSQ-------YTTLQYRG----QFDVVRPVVR 65 (725) T ss_pred CCch-HHHHHHHHHHHHHHHHhh--HHHHHHHHHHHHhhCCCCCC-HHHHH-------HHHhcCCC----ccccHHHHHH Confidence 3221 111122223333333333 23445677889999998710 00000 11122333 5688999999 Q ss_pred HHHhhhhcCCcee--cc---CcHHHHHHHHH----Hhc-cCHHHHHHHHHHHHHhcCeEEEEEEEC---C---CCceEEE Q lcl|NC_013644. 81 QKTQYLLSNPVEY--ET---ENEELKEYLAE----YYN-SEFQVVLQELVEGSSQKGFEYVYARTN---A---EDRLCFQ 144 (510) Q Consensus 81 ~~~~~l~g~p~~~--~~---~d~~~~~~l~~----~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d---~---~g~~~i~ 144 (510) ..+++---+.+.+ .+ ++.+..+.|+. +.+ ++.......+..+++++|.||+-++.| . ++.++|. T Consensus 66 ~v~g~~~~nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~ 145 (725) T protein:vir:77 66 KLVSEMRQNPIDVLYRPKDGARPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIR 145 (725) T ss_pred HHHhhHHhCCcceEEecCCccHHHHHHHHHHHHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeE Confidence 9888876666554 33 33444444443 333 677888889999999999999877644 2 2334443 Q ss_pred EE----cccceEEEEcCCCCcee-----EEEEEEEE---------------------------EeeCCceeEEEEEEEEc Q lcl|NC_013644. 145 VA----DSLNVFGVYNEYNELQR-----ICRHYITE---------------------------IEKDGETVDIHHAEVWT 188 (510) Q Consensus 145 ~~----~p~~~~~~~d~~~~~~~-----~~~~~~~~---------------------------~~~~~~~~~~~~~e~y~ 188 (510) .. ++.++|.-++ ..++.. +++..... ..+-.....++.+++|. T Consensus 146 ~~~~~~~~~~v~~Dp~-a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~E~~~ 224 (725) T protein:vir:77 146 REPIHSACSHVIWDSN-SKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYE 224 (725) T ss_pred EeecccChhhceeCch-hhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCeeEEEEEEE Confidence 32 3444442211 111000 00000000 00000112344455555 Q ss_pred CCcEE--EEEEcC--Cceeecccccccc---------cc-------------ccc-ccccccccccccccCCcccEEEec Q lcl|NC_013644. 189 DQNVY--FFVAED--NKDYELDEAEPIN---------PR-------------PHV-LAVDSENESLLQRSYGQIPFYRLS 241 (510) Q Consensus 189 ~~~i~--~~~~~~--~~~~~~~~~~~~~---------~~-------------~~~-~~~~~~~~~~~~~~~g~iPvv~~~ 241 (510) ...+. .+...+ ++........... +. -+. ..+.....+..+.+.+.+|+|+|. T Consensus 225 r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~~~~~~~P~vP~~ 304 (725) T protein:vir:77 225 VVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVF 304 (725) T ss_pred EEEEeeEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCcCCCCccceEEEe Confidence 33222 122211 1111000000000 00 000 111111123334444666776653 Q ss_pred C---C----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeE-EecCCCCchhhhhHhhhcCee-----eeccCC- Q lcl|NC_013644. 242 N---N----KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYV-VSGFQGDDLSKLRQNVKSKKV-----VGTGSD- 307 (510) Q Consensus 242 n---n----~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv-~~g~~~~~~~~~~~~~~~~~~-----~~~~~~- 307 (510) - . +.+.|.+.++++.++.+|...|.+...+.....-..+ ..|. .+.............. +....| T Consensus 305 g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~-i~~~~~~~~~~~~~~~~~~~~~~~~~g~ 383 (725) T protein:vir:77 305 GEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQ-IAGFEHMYDGNDDYPYYLLNRTDENSGD 383 (725) T ss_pred eeeeccCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhh-hhHHHHHHHhccCCceecccccccCCCc Confidence 2 2 2234788899999999999999998888655443332 2222 1111111111111111 111111 Q ss_pred ---CceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 308 ---GGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGN-ITNIVIKARYTLLNMKANKTEARLRALLEWMNKL 383 (510) Q Consensus 308 ---~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~ 383 (510) +.+..+..+.=...+...++.....|-.+|++-+-..+..+| .||+|+..+-......+.....-+..+.+++.++ T Consensus 384 ~~~~~i~~~~~~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~ 463 (725) T protein:vir:77 384 LPTQPLAYYENPEVPQANAYMLEAATSAVKEVATLGVDTEAVNGGQVAFDTVNQLNMRADLETYVFQDNLATAMRRDGEI 463 (725) T ss_pred ccccCccccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 122333333334566778888888998888766655555554 6999999998888888888888888888888777 Q ss_pred HHHHHhhccC---------Cccc--------------------------cceeeEEeCCCCCCCHHHHHHHHHHHHhcCC Q lcl|NC_013644. 384 VIDDINRRYT---------KAFD--------------------------PTEVSFTFTREVMVNETDIVNDEKTEAETRK 428 (510) Q Consensus 384 i~~~~~~~~~---------~~~~--------------------------~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~ 428 (510) ++.++..... .+-. ..+|.|.=.+..+.=..+.+..++.+..... T Consensus 464 lL~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~ 543 (725) T protein:vir:77 464 YQSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTP 543 (725) T ss_pred HHHHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeeccchHHHHHHHHHHHHHHHHhcc Confidence 7776532110 0000 0122222222222222233333333332211 Q ss_pred --Cch--HHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCccccccc Q lcl|NC_013644. 429 --IIL--ESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTES 504 (510) Q Consensus 429 --iS~--et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (510) .+. -++...++..+-+...+...... +..+.........+++. .......+....+ T Consensus 544 ~~~~~~~~~l~~~~~l~d~~~~~e~~erir-----------kq~~~~~~~q~~~~~e~---------q~~~~~qq~~~~q 603 (725) T protein:vir:77 544 QGTPEYQLLLLQYFTLLDGKGVEMMRDYAN-----------KQLIQMGVKKPETPEEQ---------QWLVEAQQAKQGQ 603 (725) T ss_pred ccchhHHHHHHHhhccccchHHHHHHHHHH-----------hhhhhhhccCCCChhhH---------HHHHHHHHHHHHh Confidence 111 11222222222111111110000 00000000000000000 0000000000011 Q ss_pred ccCC---------CC Q lcl|NC_013644. 505 QLPE---------NG 510 (510) Q Consensus 505 ~~~~---------~~ 510 (510) +-++ ++ T Consensus 604 ~~~e~~q~q~~~~~~ 618 (725) T protein:vir:77 604 QDPAMVQAQGVLLQG 618 (725) T ss_pred HHHHHHHHHHHHHHH Confidence 1110 01 No 100 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=99.39 E-value=6.7e-12 Score=81.85 Aligned_cols=470 Identities=10% Similarity=0.004 Sum_probs=203.9 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |-+- .........++...++.. ..-|+...+-.+||.|.|= ..-.. ...+...+| ++|..+.+|+ T Consensus 1 m~d~-~~~~~~~~~~~~~~~~~~--~~~R~~a~~d~~fy~G~QW-~~~~~-------~~l~~q~rp----~~N~i~~~v~ 65 (725) T protein:vir:10 1 MADN-ENRLESILSRFDADWTAS--DEARREAKNDLFFSRVSQW-DDWLS-------QYTTLQYRG----QFDVVRPVVR 65 (725) T ss_pred CCch-HHHHHHHHHHHHHHHHhh--HHHHHHHHHHHHhhcCCCC-CHHHH-------HHHHhcCCC----cccchHHHHH Confidence 3221 111122223333333333 2345567888999999871 00000 011222333 4699999999 Q ss_pred HHHhhhhcCCcee--cc---CcHHHHHHHHH----Hhc-cCHHHHHHHHHHHHHhcCeEEEEEEEC---CC---CceEEE Q lcl|NC_013644. 81 QKTQYLLSNPVEY--ET---ENEELKEYLAE----YYN-SEFQVVLQELVEGSSQKGFEYVYARTN---AE---DRLCFQ 144 (510) Q Consensus 81 ~~~~~l~g~p~~~--~~---~d~~~~~~l~~----~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d---~~---g~~~i~ 144 (510) ..+++---+.+.+ .+ ++.+..+.|+. +.+ ++.......+..+++++|.||+-|..| .+ +.++|. T Consensus 66 ~v~g~e~~nr~d~~v~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~ 145 (725) T protein:vir:10 66 KLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIR 145 (725) T ss_pred HHHhhHHhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccCCCCCCCceeee Confidence 9999877665554 33 33444444443 333 677788889999999999999877533 22 234333 Q ss_pred EE----cccceEEEEcC-CCCce-----eEEEEEEEEE---------------------------eeCCceeEEEEEEEE Q lcl|NC_013644. 145 VA----DSLNVFGVYNE-YNELQ-----RICRHYITEI---------------------------EKDGETVDIHHAEVW 187 (510) Q Consensus 145 ~~----~p~~~~~~~d~-~~~~~-----~~~~~~~~~~---------------------------~~~~~~~~~~~~e~y 187 (510) .+ ++.++| ||. ..++. .+++.+.... .+......++.+++| T Consensus 146 ~~~i~~~~~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~vrv~E~~ 223 (725) T protein:vir:10 146 REPIHSACSHVI--WDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQDTIQIAEFY 223 (725) T ss_pred eeecccCHhHcc--cCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCCeEEEEEEE Confidence 32 344444 442 11111 0111110000 000001123333444 Q ss_pred cCCcE--EEEEEcC--Cceeeccccccccc----------------------cccc-ccccccccccccccCCcccEEEe Q lcl|NC_013644. 188 TDQNV--YFFVAED--NKDYELDEAEPINP----------------------RPHV-LAVDSENESLLQRSYGQIPFYRL 240 (510) Q Consensus 188 ~~~~i--~~~~~~~--~~~~~~~~~~~~~~----------------------~~~~-~~~~~~~~~~~~~~~g~iPvv~~ 240 (510) ....+ ..|...+ ++.........+.. .-+. ..+.....+..+.+.+.+|+|+| T Consensus 224 ~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~fP~vP~ 303 (725) T protein:vir:10 224 EVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPV 303 (725) T ss_pred EEEEEeeEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCCCCCCceeEEEE Confidence 33211 1111111 11111100000000 0000 11111112223444456777665 Q ss_pred cC---C----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeec---c-CCC- Q lcl|NC_013644. 241 SN---N----KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGT---G-SDG- 308 (510) Q Consensus 241 ~n---n----~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~---~-~~~- 308 (510) .- . +.+.|.+.++++.++.+|...|.+...+........+...-..+..............+.. . .+| T Consensus 304 ~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~~~~~~~~~~~~~~~~~~g~ 383 (725) T protein:vir:10 304 FGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGE 383 (725) T ss_pred EeeeeccCCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHhccCCceeeecccccccCcc Confidence 32 1 2234889999999999999999999888655544333221111111111111111111111 1 111 Q ss_pred ----ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 309 ----GLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGN-ITNIVIKARYTLLNMKANKTEARLRALLEWMNKL 383 (510) Q Consensus 309 ----~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~ 383 (510) .+.+...+.-..++...+......|-.++++-+...+..+| .||+|+..+-............-+..+.+++.++ T Consensus 384 ~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~ 463 (725) T protein:vir:10 384 MPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEI 463 (725) T ss_pred cccccCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12333333344577778899999999999877655555454 6999999998888887777777788888887777 Q ss_pred HHHHHhhc----------cCC---cc---c-------------------cceeeEEeCCCCCCCHHHHHHHHHHHHhcC- Q lcl|NC_013644. 384 VIDDINRR----------YTK---AF---D-------------------PTEVSFTFTREVMVNETDIVNDEKTEAETR- 427 (510) Q Consensus 384 i~~~~~~~----------~~~---~~---~-------------------~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g- 427 (510) ++.++... +.. .+ + ..+|.|.=.+..+.=..+.+..++.+...- T Consensus 464 lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~ 543 (725) T protein:vir:10 464 YQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRSEILELLGKTP 543 (725) T ss_pred HHHHHHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeeccCcHHHHHHHHHHHHHHHHhcc Confidence 77765321 100 00 0 012333332332222223333333333221 Q ss_pred -CCch--HHHHHhCCCCCcH---HHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccc Q lcl|NC_013644. 428 -KIIL--ESILQVAPRLDDD---NVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGS 501 (510) Q Consensus 428 -~iS~--et~~~~~~~v~d~---e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (510) ..+. ..++..++..+-+ +..+++..+ .+.........+++. .+.....+.. T Consensus 544 ~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq--------------~~~~~~~~~~~~e~~---------q~~~e~qq~~ 600 (725) T protein:vir:10 544 QGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQ--------------LIQMGVKKPETPEEQ---------QWLVEAQQAK 600 (725) T ss_pred ccchhHHHHHHHHhhcCCchhHHHHHHHHHhh--------------hhhhccCCccccchh---------HHHHHHHHHH Confidence 0111 2233333322211 111111100 000000000000000 0000000111 Q ss_pred cccccCCC---------C Q lcl|NC_013644. 502 TESQLPEN---------G 510 (510) Q Consensus 502 ~~~~~~~~---------~ 510 (510) ..++-+.. + T Consensus 601 ~~q~~~e~~q~~~~~~~~ 618 (725) T protein:vir:10 601 QGQQDPAMVQAQGVLLQG 618 (725) T ss_pred HhhhHHHHHHHHHHHHHH Confidence 11111100 1 No 101 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.37 E-value=4.9e-11 Score=77.10 Aligned_cols=436 Identities=10% Similarity=0.017 Sum_probs=214.7 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccce-----eccccc----cccccccc-ccee Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFY-----VDDEGI----LREDKYAS-NVRI 70 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~-----~~~~~~----~~~~~~~~-~~ki 70 (510) ++-.+.- .. ...... .......|++-..-....... ...+.. ......++ +.-. T Consensus 11 ~dr~i~~------~~-~~~~~~---------~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~r 74 (505) T protein:vir:96 11 AQRMVNW------AW-YRYVEP---------QKNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSI 74 (505) T ss_pred hhcccch------hh-hhhHHH---------HHHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHh Confidence 2233221 00 101000 011123344332110000000 000000 00000000 0011 Q ss_pred ccchhHHHHHHHHhhhhc-CCceeccC--------cHHHHHHHHHHhc---c----------CHHHHHHHHHHHHHhcCe Q lcl|NC_013644. 71 PHGFFPEIVDQKTQYLLS-NPVEYETE--------NEELKEYLAEYYN---S----------EFQVVLQELVEGSSQKGF 128 (510) Q Consensus 71 ~~n~~~~Iv~~~~~~l~g-~p~~~~~~--------d~~~~~~l~~~~~---n----------~~~~~~~e~~~~~~~~G~ 128 (510) .+++++-+|+..+..++| .++++.+. +++..+.|+..|. . +|......+++.....|. T Consensus 75 Nn~~a~~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE 154 (505) T protein:vir:96 75 NNPYAKRFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGE 154 (505) T ss_pred cChHHHHHHHHHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCc Confidence 246999999999999999 68877542 5666666655542 1 244455667788899999 Q ss_pred EEEEEEECCCC--ceEEEEEcccceEEEEcCC--CCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceee Q lcl|NC_013644. 129 EYVYARTNAED--RLCFQVADSLNVFGVYNEY--NELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYE 204 (510) Q Consensus 129 ~~~~v~~d~~g--~~~i~~~~p~~~~~~~d~~--~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~ 204 (510) +++.......+ .+++..++|+.+---++.. ....-.-++ +.... ....-+++---.|+..+... T Consensus 155 ~f~~~~~~~~~~~~~~lqliepd~l~~~~n~~~~~~~~i~~GI---e~d~~-Gr~~aY~i~~~hPgd~~~~~-------- 222 (505) T protein:vir:96 155 VLVREHRGYPNKWGYALQILECDRLDLNYNADLQNGNRIRMSI---ELDAW-ERPVAYHLLVNHPGDNSYCY-------- 222 (505) T ss_pred eEEEEeecCCCCcceEEEEechhhcCCCCCcccCCcCeEEece---EECCC-CceEEEEEeecCCCcccccc-------- Confidence 98876554433 2689999998874322210 000001111 11111 11111111100111110000 Q ss_pred cccccccccccccccccccccccccccCCccc---EEEecCC-----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_013644. 205 LDEAEPINPRPHVLAVDSENESLLQRSYGQIP---FYRLSNN-----KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAE 276 (510) Q Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~nn-----~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~ 276 (510) ......+.+|| |+|+... ..|.|+|..++..+..++.............+. T Consensus 223 ---------------------~~~~~~~~rvpa~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~ 281 (505) T protein:vir:96 223 ---------------------HYAGQTYERVPADEIIHTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAK 281 (505) T ss_pred ---------------------ccccccccccCHhHhhhhhcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhh Confidence 00112234555 4554432 468999999998887777655554444444343 Q ss_pred ceeEEecCCC-------CchhhhhHhhhcCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccc-cccC Q lcl|NC_013644. 277 AIYVVSGFQG-------DDLSKLRQNVKSKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQ-VGDG 348 (510) Q Consensus 277 ~~lv~~g~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~~g 348 (510) --.+++.... +..+.....+..+.+..+..|.++++++.+.+...+..++..+.+.|....++|-... ...+ T Consensus 282 ~a~fi~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s 361 (505) T protein:vir:96 282 KVGFYEQDPEAYDQPPEDDQGEIVEEVEAGTYQLLPYGIRFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLE 361 (505) T ss_pred heeeeecCCccCCCccccccCccccccCCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc Confidence 3344443211 1111223345667777788899999999988889999999999999999888774332 2345 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhccCCcc---ccc-eeeEEeC--CCCCCCHHHHHHHHH Q lcl|NC_013644. 349 NITNIVIKARYTLLNMKANKTEARLRA-LLEWMNKLVIDDINRRYTKAF---DPT-EVSFTFT--REVMVNETDIVNDEK 421 (510) Q Consensus 349 ~~Sg~Ai~~~~~~l~~k~~~k~~~~~~-~l~~~~~~i~~~~~~~~~~~~---~~~-~v~i~f~--~~~p~d~~e~~~~~~ 421 (510) ++|-.+.+..+......+...+..|.. .++.+++..+...-..|.-+. +.. -..+.|. .-.-.|....+++.. T Consensus 362 ~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~ 441 (505) T protein:vir:96 362 GVNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHS 441 (505) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccCCccccChHHHHHHHH Confidence 556666777777777777766666654 333355544443322222111 111 1345553 333469999999999 Q ss_pred HHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccc Q lcl|NC_013644. 422 TEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGS 501 (510) Q Consensus 422 ~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (510) .++.+|+.|.+.++...+ .|.++..+++.++.+...+ ....+ . .+... ....+..+++ ..++. T Consensus 442 ~~i~~G~~t~~~~~a~~G-~D~~~v~~q~a~e~~~~~~-----~Gl~~------~-~~~~~---~~~~~~~~~~-~~~~d 504 (505) T protein:vir:96 442 ESIKNRTRSRSSIIRAAG-DDPEDVFDEIAWEEQLMRD-----KGVNP------T-PPEQE---SKDATTDEED-DSASD 504 (505) T ss_pred HHHHcCCCCHHHHHHHcC-CCHHHHHHHHHHHHHHHHH-----cCCCC------C-CCCCC---CCCCCCCCCC-CCCCC Confidence 999999999999999885 4444444433333322111 10000 0 00000 0000000001 11111 Q ss_pred c Q lcl|NC_013644. 502 T 502 (510) Q Consensus 502 ~ 502 (510) + T Consensus 505 ~ 505 (505) T protein:vir:96 505 D 505 (505) T ss_pred C Confidence 1 No 102 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=99.36 E-value=1.6e-11 Score=79.81 Aligned_cols=484 Identities=10% Similarity=0.029 Sum_probs=209.1 Q ss_pred CCCccCCChhhhH----HHHHHHHHhhhhhhhHHHHHHHHHHhc--cCCcchhcccceeccccccccccccccceeccch Q lcl|NC_013644. 1 MEALLSEDVKIIA----NALKAAIDKDRKSSSKREAETGIRYYN--HENDIMNNRIFYVDDEGILREDKYASNVRIPHGF 74 (510) Q Consensus 1 ~~~~~~~~~~~~~----~~i~~~i~~~~~~~~~~~~~~~~~YY~--g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~ 74 (510) |-. +...+. ..+...++.. ...++....-++||. |.|= ..-...... ......++| .+++|. T Consensus 1 m~e----~~~~~~~~~~~~~~~~~~~~--~~~r~~~~~d~~f~~~~G~QW-~~~~~~~l~---~~~q~~grP--~~~~N~ 68 (706) T protein:vir:10 1 MAE----SRQKQHERVMLRFDRAWSPQ--QVVREKCIEATRFVRVPGGQW-EGATVAGTK---LDEQFEKYP--KFEINK 68 (706) T ss_pred CCc----chHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHhhccCCccC-CHHHHHHHH---hhhhhcCCC--ceEecc Confidence 332 222121 2222222222 344555666677774 5541 000000000 000111333 578999 Q ss_pred hHHHHHHHHhhhhcCCcee--cc----CcHHHHHHHHH----Hhc-cCHHHHHHHHHHHHHhcCeEEEEEEECC------ Q lcl|NC_013644. 75 FPEIVDQKTQYLLSNPVEY--ET----ENEELKEYLAE----YYN-SEFQVVLQELVEGSSQKGFEYVYARTNA------ 137 (510) Q Consensus 75 ~~~Iv~~~~~~l~g~p~~~--~~----~d~~~~~~l~~----~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~------ 137 (510) .+.+|+..+++.--+.+.+ .+ ++.+..+.|+. +.+ ++.......+..+++++|.||.-++.|- T Consensus 69 i~~~v~~v~g~~~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~~~~d~ 148 (706) T protein:vir:10 69 VATELNRIISEYRNNRISVKFRPGDNAASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTTSFVNEYDP 148 (706) T ss_pred hHHHHHHHhhHHHhCCCceEEecCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEeeeccccccCC Confidence 9999999999987776554 32 23344444443 333 6788889999999999999998886541 Q ss_pred ---CCceEEEEE-cccceEEEEcCC-C--Cce---eEEEEEEEEEe-----------------------eCCceeEEEEE Q lcl|NC_013644. 138 ---EDRLCFQVA-DSLNVFGVYNEY-N--ELQ---RICRHYITEIE-----------------------KDGETVDIHHA 184 (510) Q Consensus 138 ---~g~~~i~~~-~p~~~~~~~d~~-~--~~~---~~~~~~~~~~~-----------------------~~~~~~~~~~~ 184 (510) ++.+.+..+ +|... +.||.. . ++. .+++....... +......+... T Consensus 149 ~~~~~~i~i~~v~~p~~~-v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~~~d~~~~~ 227 (706) T protein:vir:10 149 MDERQRIAVEPIYDPARS-VWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWFTPDVVYIA 227 (706) T ss_pred CCCCccceeeeeccchhc-eecCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhhhccccccccccCCCcceec Confidence 123444433 45422 124421 1 111 11111000000 00001122233 Q ss_pred EEEcCCcE----EEEEEcCCceeeccccccc-cccc---------------------cc-ccccccccccccccCCcccE Q lcl|NC_013644. 185 EVWTDQNV----YFFVAEDNKDYELDEAEPI-NPRP---------------------HV-LAVDSENESLLQRSYGQIPF 237 (510) Q Consensus 185 e~y~~~~i----~~~~~~~~~~~~~~~~~~~-~~~~---------------------~~-~~~~~~~~~~~~~~~g~iPv 237 (510) +.|+.... .+|.....+.......... .... +. ........+..+.+.+++|+ T Consensus 228 eyy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l~~~~p~~~~~~P~ 307 (706) T protein:vir:10 228 KYYEVRKESVDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFLEKPRRIPGEHIPL 307 (706) T ss_pred ccccccceeEEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeeccccccccCCCCCCCccce Confidence 43443221 1111111110100000000 0000 00 01111112233455588888 Q ss_pred EEecCC-------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcC-----eeee-- Q lcl|NC_013644. 238 YRLSNN-------KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSK-----KVVG-- 303 (510) Q Consensus 238 v~~~nn-------~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~-----~~~~-- 303 (510) |+|.-. ....|.+.++++.++.+|...|.+.+.+........+ |. .++...+....... ..+. T Consensus 308 vP~~g~r~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~--~~-~~~i~~~~~~~~~~~~~~~~~l~~~ 384 (706) T protein:vir:10 308 IPVYGKRWFIDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTPI--VD-MEQIRGLEQHWEGRNRKRPAFLPLR 384 (706) T ss_pred EEEeeccccccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCcccc--cc-hhHHHHHHHHhhhcccccccchhcc Confidence 887432 2245788899999999999999999887544443222 21 11111111100000 0000 Q ss_pred -cc-CCCc-------eeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 304 -TG-SDGG-------LDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLR 374 (510) Q Consensus 304 -~~-~~~~-------~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~ 374 (510) .+ .+|. ..++..+.-..++...+......|.+++++.+-..+..+|.||+|+..+-............-+. T Consensus 385 ~~~~~~g~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~ 464 (706) T protein:vir:10 385 TVTDKTGNVVAPANVAGYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQMPSNVARETVNSLLNRSDMASFIYLDNMA 464 (706) T ss_pred cccCCCCcccccccccccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 1111 23333333345667778888888999988777666666789999999998888888888888888 Q ss_pred HHHHHHHHHHHHHHhh----------ccC--C-c--------c-----------cc----ceeeEEeCCCCCCCHHHHHH Q lcl|NC_013644. 375 ALLEWMNKLVIDDINR----------RYT--K-A--------F-----------DP----TEVSFTFTREVMVNETDIVN 418 (510) Q Consensus 375 ~~l~~~~~~i~~~~~~----------~~~--~-~--------~-----------~~----~~v~i~f~~~~p~d~~e~~~ 418 (510) .+.+++.++++.++.. .+. . . . |. .+|.|.=.+..+.-..+.++ T Consensus 465 ~~~~~~g~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~r~~~~~ 544 (706) T protein:vir:10 465 KSLKRAGEIWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSARRDATVN 544 (706) T ss_pred HHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCcchHHHHHHH Confidence 8888887777777642 111 0 0 0 00 12333333444554556666 Q ss_pred HHHHHHhcCC-CchHH------HHHhCCCCCcHHHHHHHHHH--------------HHHHHHHHHHHHHhhhccCCCCCC Q lcl|NC_013644. 419 DEKTEAETRK-IILES------ILQVAPRLDDDNVLRLICEQ--------------FDLDWEDVKEALEEAEYTKGLSDN 477 (510) Q Consensus 419 ~~~~~~~~g~-iS~et------~~~~~~~v~d~e~~~~~~e~--------------~e~~~~~~~~~~~~~~~~~~~~~~ 477 (510) .++.+...+. ....+ +++.+++---++..+++... +++..+. ++.+............ T Consensus 545 ~m~el~~~~~p~~~~~~~l~~~~~~~~d~p~~~e~~e~irk~~~~q~~~~~~~~~eq~~~~q~-qq~q~~q~~~~~~~~~ 623 (706) T protein:vir:10 545 ALTQLLQGMLPQDPMRPALMGIIIDNMEGEGLDDFKAFNRRQLLTQGIVKPRNQQEQAIVQQA-QQAQATQPDPNMLLAQ 623 (706) T ss_pred HHHHHHHhcCCcchhhHHHHHHHHhhcCccchHHHHHHHHHhhcccCCccccchhHHHHHHHH-HHHHHHHHHHHHHHHH Confidence 6666665442 21122 23333221111111111100 0000000 0000000000000000 Q ss_pred CCCcccCCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 478 TDEEETAVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) ....+ . +++ ....+. ...|+=.+. T Consensus 624 aq~~~--~----qA~--~~k~~a-~~~q~~~~a 647 (706) T protein:vir:10 624 AQMVV--A----QAE--AQKSQN-ETVQTQIKA 647 (706) T ss_pred HHHHH--H----HHH--HHHHHH-HHHHHHHHH Confidence 00000 0 000 000000 000000000 No 103 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.35 E-value=3.5e-12 Score=83.39 Aligned_cols=444 Identities=11% Similarity=-0.002 Sum_probs=197.1 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHH----HHHHHHhccCCcchhcccce-----ec--cccccccccccccce Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREA----ETGIRYYNHENDIMNNRIFY-----VD--DEGILREDKYASNVR 69 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~----~~~~~YY~g~~~i~~~~~~~-----~~--~~~~~~~~~~~~~~k 69 (510) |+--..-+..........+..-...... .+. +-+..|-.+-...-....+. .. ..+..........+ T Consensus 66 ~~~~~~~~~~~~~~~~~a~~~a~~~~~~-~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~~~f~gyql~alY- 143 (862) T protein:vir:99 66 VEISDSVNAKSVSGKNFAMDSAVRSAIK-AITGFAMDDGGGAPVPIGAEGKQSSYAVPEALQDWYLSQGFIGHQACALI- 143 (862) T ss_pred ccccccccchhhhhhhhcchhhcchhhh-hhhhhhhhcchhhhhhccccccccccccchhccccccccCcccHHHHHHH- Confidence 1111000110000000000000000000 000 00111111110000000000 00 00000000000001 Q ss_pred eccchhHHHHHHHHhhhhcCCceeccCc------HHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC-Cc- Q lcl|NC_013644. 70 IPHGFFPEIVDQKTQYLLSNPVEYETEN------EELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAE-DR- 140 (510) Q Consensus 70 i~~n~~~~Iv~~~~~~l~g~p~~~~~~d------~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~-g~- 140 (510) -.+.+++.||+..+.-++-+++.+.+.+ ++..+.|...++ -++...+.++.+.+-.+|.+++++-.+.+ +. T Consensus 144 ~~~~larkiVd~pAeDatR~g~~I~~~~d~~e~~~e~~~~ie~~~~rL~v~~~l~eair~~RLyGga~ililv~~~D~~~ 223 (862) T protein:vir:99 144 AQHWLVDKACSLAGEDAIRNGWHLKSLGEGEEIDEESLEKFKAIDVEFKVKENLIEFNRFKNVFGIRVAIFVVDSEDPDY 223 (862) T ss_pred HhCchhhhhhhhhhHHHhhCCceEeecCcccccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEEecCcCchh Confidence 1257899999999999999999998642 233445555553 35778888999999999988776654322 21 Q ss_pred --------------e-EEEEEcccceEEEE-----cCCCCceeE-EEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcC Q lcl|NC_013644. 141 --------------L-CFQVADSLNVFGVY-----NEYNELQRI-CRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAED 199 (510) Q Consensus 141 --------------~-~i~~~~p~~~~~~~-----d~~~~~~~~-~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~ 199 (510) + .|.+++|.++.|.- ++...+... -..|.+ .+ .. +.+.++.+|... T Consensus 224 LsqPLn~e~I~kG~lkgl~vlDp~w~~p~~v~~~~~Dp~sp~yGkP~~y~I----~g-------~~-IH~SRliif~g~- 290 (862) T protein:vir:99 224 YEKPFNPDGITPGSYRGISQIDPYWMMPMLTAESTADPSSQFFYEPEFWII----SG-------QK-YHRSHLIIARGP- 290 (862) T ss_pred hhcCcCcccccccceeEEEEechhhhcccccccccccccccccCCceeeee----cC-------ee-eccceeEEecCC- Confidence 1 24455555554421 010000000 000000 00 01 112222222110 Q ss_pred CceeecccccccccccccccccccccccccccCCcccEEEe-cCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccce Q lcl|NC_013644. 200 NKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRL-SNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAI 278 (510) Q Consensus 200 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~-~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~ 278 (510) .+|-+.- .++-.|.|.++.+.+.+..++.+....+..+..+.... T Consensus 291 ----------------------------------~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~l~v 336 (862) T protein:vir:99 291 ----------------------------------QPADILKPTYIFGGIPLVQRIYERVYAAERTANEAPLLAMNKRTTA 336 (862) T ss_pred ----------------------------------CchhhhhccCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccce Confidence 0111000 12335889999999999999998888888787777777 Q ss_pred eEEecCCC-CchhhhhHh------hh-cCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc---c Q lcl|NC_013644. 279 YVVSGFQG-DDLSKLRQN------VK-SKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG---D 347 (510) Q Consensus 279 lv~~g~~~-~~~~~~~~~------~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~---~ 347 (510) +.+.++.. .+....... .+ ..+++.++.+.+++.++ .+...+...++...+.|...+++|-+-..+ . T Consensus 337 ~ktd~l~~l~~ed~l~~r~~~~~~~rdN~Gi~liD~eEe~e~ls--~slSGL~dll~~~~q~IAaas~IP~tiLfGqspa 414 (862) T protein:vir:99 337 IHTDTAKAIANEDKFIQRLMFWVRYRDNHAVKVLGTDETMEQFD--TSLADFDAVIMGQYQLVASIAKTPATKLLGTAPK 414 (862) T ss_pred eechhHhhhccHHHHHHHHHHHHhccCcceeEEecCCCceeEEe--cccCChHHHHHHHHHHHHhhhCCCceeecccCcc Confidence 66655421 111222111 11 23466677665555544 666678889999999999999999864322 2 Q ss_pred C-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHH------- Q lcl|NC_013644. 348 G-NITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVND------- 419 (510) Q Consensus 348 g-~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~------- 419 (510) | |+||..=...|...+... .+..++..|++++.++...++ . ..+++|.|++-...+++|+++. T Consensus 415 GlnATGE~D~~nYyD~I~s~--QE~~L~P~LerL~~li~~~lg----~---~~d~~ieFnpL~~~sekEkAEi~kk~Aea 485 (862) T protein:vir:99 415 GFNSTGEFETISYHEELESI--QEHVYMPFLQRHYLISRLSLG----I---QHEIDVVMEPVASMTAQQQADLNKTKAEG 485 (862) T ss_pred cccCchHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHhcC----C---CCcceEEeCCCCCCCHHHHHHHHHHHHHH Confidence 3 467764333343333322 245678888887765543221 1 2468999999999999988866 Q ss_pred HHHHHhcCCCchHHHHHhC--------CCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCC----- Q lcl|NC_013644. 420 EKTEAETRKIILESILQVA--------PRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVN----- 486 (510) Q Consensus 420 ~~~~~~~g~iS~et~~~~~--------~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 486 (510) +.++.++|+||.+.++.+| +.+++++.+...-...+.. ...+.+-........++...+.+ T Consensus 486 ~~~lv~sGvispdEvR~~L~~~~~~g~~~l~ded~E~d~~~~~e~~------~~~e~~g~a~~~ap~de~~aga~~~~~e 559 (862) T protein:vir:99 486 GKVLIDGGVISPDEERNRIRDDKRSGYNRLTKEDAEETPGASPENL------AAYQKAGAAQETASAKETQAGAAVTTAE 559 (862) T ss_pred HHHHHhcCCCCHHHHHHHHHhcCCcCCCCCCcccccccCCCCcccc------cccccCCcccccccccccccccCCcccc Confidence 5678889999998887753 2334332211000000000 00000000000000000011000 Q ss_pred CCCcccccccCc-ccccccc-------cCC-CC Q lcl|NC_013644. 487 PDDPTQQMAEGA-TGSTESQ-------LPE-NG 510 (510) Q Consensus 487 ~~~~~~~~~~~~-~~~~~~~-------~~~-~~ 510 (510) .+.+..+.+... -|....+ .|. .+ T Consensus 560 ~d~~~~p~~~~~~~g~~~~~t~~~~a~~p~~~~ 592 (862) T protein:vir:99 560 GDQPNVQMVPSMKPGQMVGPEVGITAPMPEDDA 592 (862) T ss_pred CCcccccccCCCCCCCccccccccccCCCcccc Confidence 000000000000 0000000 111 11 No 104 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=99.35 E-value=8.5e-12 Score=81.28 Aligned_cols=469 Identities=10% Similarity=0.019 Sum_probs=196.7 Q ss_pred cCCChhhh---HHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHH Q lcl|NC_013644. 5 LSEDVKII---ANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQ 81 (510) Q Consensus 5 ~~~~~~~~---~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~ 81 (510) +.++..+. ..++...++.. ..-+....+..+||.|.+=- .-.. ...+...+| ++|..+.+|+. T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~--~~~r~~a~~d~~fy~G~Qw~-~~~~-------~~l~~q~rp----~~N~i~~~i~~ 66 (725) T protein:vir:92 1 MADNENRLESILSRFDADWTAS--DEARREAKNDLFFSRISQWD-DWLS-------QYTTLQYRG----QFDVVRPVVRK 66 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhh--HHHHHHHHHHHHhhcCCCCC-HHHH-------HHHHhcCCC----cccchHHHHHH Confidence 22222222 22333333333 23445677889999998710 0000 011122333 46888999999 Q ss_pred HHhhhhcCCcee--cc---CcHHHHHHHHH----Hhc-cCHHHHHHHHHHHHHhcCeEEEEEEEC---C---CCceEEEE Q lcl|NC_013644. 82 KTQYLLSNPVEY--ET---ENEELKEYLAE----YYN-SEFQVVLQELVEGSSQKGFEYVYARTN---A---EDRLCFQV 145 (510) Q Consensus 82 ~~~~l~g~p~~~--~~---~d~~~~~~l~~----~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d---~---~g~~~i~~ 145 (510) .+++---+.+.+ .+ ++.+..+.|+. +.+ ++.......+..+++++|.||+-|+.| . ++.++|.. T Consensus 67 v~g~e~~nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~ 146 (725) T protein:vir:92 67 LVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPTSNNQVIRR 146 (725) T ss_pred HHhhHHhCCcceEEecCCccHHHHHHHHHHHHHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEE Confidence 888876555443 33 33444444443 333 677888889999999999999877543 2 23344433 Q ss_pred E---cc-cceEEEEcCC-CCcee-----EEEEEEEEE---------------------------eeCCceeEEEEEEEEc Q lcl|NC_013644. 146 A---DS-LNVFGVYNEY-NELQR-----ICRHYITEI---------------------------EKDGETVDIHHAEVWT 188 (510) Q Consensus 146 ~---~p-~~~~~~~d~~-~~~~~-----~~~~~~~~~---------------------------~~~~~~~~~~~~e~y~ 188 (510) . +| .++| ||.. .++.. +++...... .+-.....++.+++|. T Consensus 147 ~~i~~~~~~V~--~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~e~~~ 224 (725) T protein:vir:92 147 EPIHSACSHVI--WDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYE 224 (725) T ss_pred eeccCChhhcc--cCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCCCeEEEEEEEE Confidence 2 12 2232 2211 10000 000000000 0000112233445444 Q ss_pred CCcEE--EEEEcC--Cceeeccccccccc----------------------cccc-ccccccccccccccCCcccEEEec Q lcl|NC_013644. 189 DQNVY--FFVAED--NKDYELDEAEPINP----------------------RPHV-LAVDSENESLLQRSYGQIPFYRLS 241 (510) Q Consensus 189 ~~~i~--~~~~~~--~~~~~~~~~~~~~~----------------------~~~~-~~~~~~~~~~~~~~~g~iPvv~~~ 241 (510) ...+. .|...+ ++.........+.+ .-+. ..+.....+..+.+.+.+|+|+|. T Consensus 225 r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~~P~vP~~ 304 (725) T protein:vir:92 225 VVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVF 304 (725) T ss_pred EEEEeeeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCCCCceeeEEEE Confidence 33221 121111 11111100000000 0000 111111122334444567777653 Q ss_pred CC-------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEE-ecCCCCchhhhhHhhhcCeeee---cc-CCC- Q lcl|NC_013644. 242 NN-------KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVV-SGFQGDDLSKLRQNVKSKKVVG---TG-SDG- 308 (510) Q Consensus 242 nn-------~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~-~g~~~~~~~~~~~~~~~~~~~~---~~-~~~- 308 (510) -. +.+.|.+.++++.++.+|...|.+...+...+....++ .+. .+..............+. +. .+| T Consensus 305 g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~-i~~~~~~~~~~~~~~~~~~~~~~~~~g~ 383 (725) T protein:vir:92 305 GEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQ-IAGFEHMYDGNDDYPYYLLNRTDENNGE 383 (725) T ss_pred eeeeccCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhh-hhHHHHHHhccCccceeecccccccccc Confidence 21 23448899999999999999999988886555433332 221 111111111111111111 11 111 Q ss_pred ----ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccC-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 309 ----GLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDG-NITNIVIKARYTLLNMKANKTEARLRALLEWMNKL 383 (510) Q Consensus 309 ----~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g-~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~ 383 (510) .+++...+.-..++...+......|-.++++-+-..+..+ +.||+|+..+-............-+..+.+++.++ T Consensus 384 ~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~ 463 (725) T protein:vir:92 384 MPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEI 463 (725) T ss_pred ccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1233333344567777889999999999987655444444 46999999988877777777777777787777777 Q ss_pred HHHHHhhccCC--------ccc---------------------------cceeeEEeCCCCCCCHHHHHHHHHHHHhcCC Q lcl|NC_013644. 384 VIDDINRRYTK--------AFD---------------------------PTEVSFTFTREVMVNETDIVNDEKTEAETRK 428 (510) Q Consensus 384 i~~~~~~~~~~--------~~~---------------------------~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~ 428 (510) ++.++...... ... ..+|.|.=.+..+.-..+.+..++.+...-. T Consensus 464 lL~lI~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~ql~~~~~ 543 (725) T protein:vir:92 464 YQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTP 543 (725) T ss_pred HHHHHHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHHHHHHHHHHhcc Confidence 77765321100 000 0122222222222212222222222222110 Q ss_pred -C-ch--HHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCccccccc Q lcl|NC_013644. 429 -I-IL--ESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTES 504 (510) Q Consensus 429 -i-S~--et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (510) + +. -++...++..+-+-.. +..+. ..+..+.........++ ..+.....++....+ T Consensus 544 ~~~~~~~~~l~~~~~~~d~~~~~----e~~er-------irkq~~~~~~~~~~~~e---------~~q~~~~~qqa~~~q 603 (725) T protein:vir:92 544 QGTPEYQLLLLQYFTLLDGKGVE----MMRDY-------ANKQLIQMGVKKPETPE---------EQQWLVEAQQAKQGQ 603 (725) T ss_pred cchhHHHHHHHHHhhcccchHHH----HHHHH-------HHhhhchhccCCccchh---------hhHHHHHHHHHHHhh Confidence 0 10 0111112111111000 00000 00000000000000000 000000001111111 Q ss_pred ccCC---------CC Q lcl|NC_013644. 505 QLPE---------NG 510 (510) Q Consensus 505 ~~~~---------~~ 510 (510) +-++ ++ T Consensus 604 ~~~e~~~~qa~~~~~ 618 (725) T protein:vir:92 604 QDPAMVQAQGVLLQG 618 (725) T ss_pred hHHHHHHHHHHHHHH Confidence 1110 01 No 105 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.35 E-value=7.7e-11 Score=76.03 Aligned_cols=455 Identities=9% Similarity=-0.009 Sum_probs=203.8 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhcc--CCc-chhccc-ceeccc----cccccccccc-cceec Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNH--END-IMNNRI-FYVDDE----GILREDKYAS-NVRIP 71 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g--~~~-i~~~~~-~~~~~~----~~~~~~~~~~-~~ki~ 71 (510) |+..-.--.-.-...-..- - ...... |.| .++ .+.... .....+ ........++ +.-.. T Consensus 2 ~~~~~r~~~~~a~~~~~~~---~--------~~~~~~-y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rN 69 (553) T protein:vir:63 2 TKVTVRKLSEVTSGRPEQS---A--------SLGGGG-LEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADN 69 (553) T ss_pred cchhhhhhcccccccchhh---h--------hhhccc-ccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhc Confidence 2221100000000000000 0 000001 111 110 000000 000000 0000000000 00012 Q ss_pred cchhHHHHHHHHhhhhcCCceeccC-------------cHHHHHHH----HHHhcc-----------CHHHHHHHHHHHH Q lcl|NC_013644. 72 HGFFPEIVDQKTQYLLSNPVEYETE-------------NEELKEYL----AEYYNS-----------EFQVVLQELVEGS 123 (510) Q Consensus 72 ~n~~~~Iv~~~~~~l~g~p~~~~~~-------------d~~~~~~l----~~~~~n-----------~~~~~~~e~~~~~ 123 (510) ++|++-+|+..++.++|.+++..+. ++..++.+ +.|.++ +|......+++.. T Consensus 70 n~~a~~av~~~~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~ 149 (553) T protein:vir:63 70 DGFTNGAVGYQRDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGY 149 (553) T ss_pred ChHHHHHHHHHHHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHH Confidence 4699999999999999999886432 23333334 333221 3445555667888 Q ss_pred HhcCeEEEEEEECCC-C---ceEEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcC Q lcl|NC_013644. 124 SQKGFEYVYARTNAE-D---RLCFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAED 199 (510) Q Consensus 124 ~~~G~~~~~v~~d~~-g---~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~ 199 (510) ...|.+++...+... | .+++..++|+.+-.-++......-.-++ +.... ....-+|+--..|+..+...... T Consensus 150 ~~dGE~~~~~~~~~~~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GV---E~d~~-Gr~vaY~i~~~hPgd~~~~~~~~ 225 (553) T protein:vir:63 150 VKTGEVLATAEWDRAANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGV---QYDKR-GRPQGYWIQVAHPGDLYQMAPDM 225 (553) T ss_pred HhCCceEEEeeeccCCCCcccceEEEechhhcCCCCCCCCCCeeEeee---EECCC-CceEEEEeeccCCCccccccccc Confidence 999999887655443 2 3688999998875444322111111111 11111 22222222111222222111111 Q ss_pred CceeecccccccccccccccccccccccccccCCccc---EEEecC-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 200 NKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIP---FYRLSN-----NKQETTDLKPIKALIDDYDLMNCFLSNNL 271 (510) Q Consensus 200 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~n-----n~~g~sd~~~v~~liD~~n~~~S~~~~~~ 271 (510) ..+.... .+..|| |+|+.. -..|.|+|..++..+..++.....-.... T Consensus 226 ~~~~r~~------------------------~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a 281 (553) T protein:vir:63 226 YKWKFVQ------------------------QSKPWGRRQVIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNA 281 (553) T ss_pred cceeeec------------------------cccccChhHheecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHH Confidence 0010000 011222 344332 34689999999888877776544443333 Q ss_pred HHhccceeEEe-cCCCCch-----------------------------hhhhHhhhcCeeeeccCCCceeEEeecCCHHH Q lcl|NC_013644. 272 QDFAEAIYVVS-GFQGDDL-----------------------------SKLRQNVKSKKVVGTGSDGGLDVKTVTIPTEG 321 (510) Q Consensus 272 ~~~~~~~lv~~-g~~~~~~-----------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (510) ...+.-..+++ +.+.+.. +.....+..+.+..+..|.++++.+.+.+... T Consensus 282 ~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~ 361 (553) T protein:vir:63 282 VINASYAAAIESELPPEFIHSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGTPGGV 361 (553) T ss_pred HHhhhheeeeecCCChhhhhhhcccccccccccccccccccccccccccccceeecCceeeecCCCCeeeecCCCCCCCC Confidence 33333223333 2111000 00112345667777888899999998888889 Q ss_pred HHHHHHHHHHHHHHHhCCccccc-cccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhccCCcc--- Q lcl|NC_013644. 322 RKTKMEIDKENIYKFGMAFDSTQ-VGDGNITNIVIKARYTLLNMKANKTEARLRALLEW-MNKLVIDDINRRYTKAF--- 396 (510) Q Consensus 322 ~~~~~~~l~~~i~~~s~~p~~~~-~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~-~~~~i~~~~~~~~~~~~--- 396 (510) +..+...+...|....++|-... ...+++|-.+.+..+......+...+..|...+-+ +++..+...-..+.-+. T Consensus 362 ~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~ 441 (553) T protein:vir:63 362 GSEFEASLNRHLASAFGMSYEEFTRDFSKANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPG 441 (553) T ss_pred HHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCc Confidence 99999999999988877764322 33445555566666666666666555555444333 44443332212211110 Q ss_pred ---c--------cceeeEEeCCC--CCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 397 ---D--------PTEVSFTFTRE--VMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKE 463 (510) Q Consensus 397 ---~--------~~~v~i~f~~~--~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~ 463 (510) . ...+.+.|..+ ...|....+++...++.+|+.|.+.++...+ .|.++..+++.++.+...+. .. T Consensus 442 ~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~t~~~~~a~~G-~D~~~v~~q~a~e~~~~~~~-Gl 519 (553) T protein:vir:63 442 QTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVMRIDAGLSTYEREIARLG-GDFRKSFAQRAREDALLKKY-GL 519 (553) T ss_pred ccchhhcchhhhhhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhC-CCHHHHHHHHHHHHHHHHHc-CC Confidence 0 01234556433 3468999999999999999999999999885 44444444443332221110 00 Q ss_pred HHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccc Q lcl|NC_013644. 464 ALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGS 501 (510) Q Consensus 464 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (510) .....+.. ....+.+.+.+..+++... +.+.+|+ T Consensus 520 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~---~~~~~~e 553 (553) T protein:vir:63 520 TFNLSAKR-SLGDGRDAATGIAEDPAAA---QTSQQGE 553 (553) T ss_pred CCCCCCcc-ccCCCcccCCCCCCCCCCC---CcccccC Confidence 00011100 0011111111111111000 0001111 No 106 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=99.34 E-value=8.2e-11 Score=75.88 Aligned_cols=456 Identities=9% Similarity=0.006 Sum_probs=216.6 Q ss_pred hhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcC- Q lcl|NC_013644. 11 IIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSN- 89 (510) Q Consensus 11 ~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~- 89 (510) ...+.|.+..+..+..+. .....+++||+---+ .+.+... .....-....+.+.|+..+-+..-+++.++.|++- T Consensus 1 ~~~~~l~~r~~~l~~~R~-~~e~~w~e~~~~~lP--~~~~~~~-~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~l 76 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRK-NVEQIWDCIRKYIMP--MRSDFFS-DLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSL 76 (547) T ss_pred CCHHHHHHHHHHHHHHhh-HHHHHHHHHHHHhcc--ccccccc-CCCCCcccccccccccccchHHHHHHHHHHHHHHhh Confidence 566667776666654332 233444455433211 1111100 00000000112235677778888888888777642 Q ss_pred -Cce-----eccCc------HHHHHHHH-------HHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECC--CCceEEEEEc Q lcl|NC_013644. 90 -PVE-----YETEN------EELKEYLA-------EYY-NSEFQVVLQELVEGSSQKGFEYVYARTNA--EDRLCFQVAD 147 (510) Q Consensus 90 -p~~-----~~~~d------~~~~~~l~-------~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~d~--~g~~~i~~~~ 147 (510) |+. +...+ ..+...|. ..+ ..||.....++.++..++|.|.+++-.|+ .+.+++..++ T Consensus 77 tPp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d~~~~~~~r~~~~p 156 (547) T protein:vir:10 77 TSPATKWFELAFRDKELNSDDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEEDEDEEGSVVFQSSP 156 (547) T ss_pred cCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccCCCCCCceeEEEee Confidence 322 23222 22333332 333 36788889999999999999987776554 3678899999 Q ss_pred ccceEEEEcCCCCceeEEEEEEEEEee-------------------CCceeEEEEEEEEcCCcEEEEEEcCCceeeccc- Q lcl|NC_013644. 148 SLNVFGVYNEYNELQRICRHYITEIEK-------------------DGETVDIHHAEVWTDQNVYFFVAEDNKDYELDE- 207 (510) Q Consensus 148 p~~~~~~~d~~~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~- 207 (510) ..+++..-|..+++..++|.+...... .........+++++.- |..........+. T Consensus 157 l~~~~v~~d~~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v----~~~~~~~~~~~~~~ 232 (547) T protein:vir:10 157 IQDSYFEEDSRGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCV----FTRYDKKQNRNAGT 232 (547) T ss_pred cceEEEeeCCCcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEE----eeccCCCCCccccc Confidence 999999888888887777654432110 0000000112221110 0000000000000 Q ss_pred ccc--cccc-cccccccccccccccccCCcccEEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcccee Q lcl|NC_013644. 208 AEP--INPR-PHVLAVDSENESLLQRSYGQIPFYRLS-----NNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIY 279 (510) Q Consensus 208 ~~~--~~~~-~~~~~~~~~~~~~~~~~~g~iPvv~~~-----nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~l 279 (510) ... ..+. ......++...-....+|..+|++.++ ++.+|+|-.+...+-+..+|.+.-......+...+|.+ T Consensus 233 ~~~~~~~p~~s~~~e~~~~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~ 312 (547) T protein:vir:10 233 VLAPTERPFGKKWILKEGAVQLGEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVIDPAI 312 (547) T ss_pred eeeccccceeEEEEEecCceeeeecCCcccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 000 0000 000111111111223455667777654 34679999999999999999999999999999999988 Q ss_pred EEecCCCCchhhhhHhhhcCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHH Q lcl|NC_013644. 280 VVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARY 359 (510) Q Consensus 280 v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~ 359 (510) .+.-.+.... .+...++++..++..+++.+....+.......++.++..|-..-....+........|++.+..+ T Consensus 313 ~v~~~g~~~~----~~~~pgg~~~~~~~~~v~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~TAtEV~~r- 387 (547) T protein:vir:10 313 MVTERGLISD----IDLGASGLTVVRDMESMKPFESRARFDVSSIQLTDLRSAVRRIYYVDQLQMKDSPAMTATEVQVR- 387 (547) T ss_pred eccccccccc----ceecCCeeeecCCcccceeeecccchHHHHHHHHHHHHHHHHHhhhhhhhcCCCccccHHHHHHH- Confidence 6532111111 22345566666666778878777777777888888887765532111111122344566665553 Q ss_pred HHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHhhccCCcc--------ccceeeEEeCCCCCCCHHH-------- Q lcl|NC_013644. 360 TLLNMKANKTEARLRALLEW--------MNKLVIDDINRRYTKAF--------DPTEVSFTFTREVMVNETD-------- 415 (510) Q Consensus 360 ~~l~~k~~~k~~~~~~~l~~--------~~~~i~~~~~~~~~~~~--------~~~~v~i~f~~~~p~d~~e-------- 415 (510) ..++...++..+.+ ++.-++.++...+.-+- ....++|++..++-+.... T Consensus 388 ------~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~~~~~~v~~is~Laraq~~~~~~~i~~ 461 (547) T protein:vir:10 388 ------YELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIVYTGPLSRAQKIDQAASIER 461 (547) T ss_pred ------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhhccCcceEEEEeccHHHHHHHHHHHHHHHH Confidence 33444444443333 33333344443332111 2335677887666554211 Q ss_pred HHHHHHHHHhcCC-----CchHHHHHh----CCC----C-CcHHHHHHHHHHHHHHHHHHHHHHHhhhccC--CCCCCCC Q lcl|NC_013644. 416 IVNDEKTEAETRK-----IILESILQV----APR----L-DDDNVLRLICEQFDLDWEDVKEALEEAEYTK--GLSDNTD 479 (510) Q Consensus 416 ~~~~~~~~~~~g~-----iS~et~~~~----~~~----v-~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~--~~~~~~~ 479 (510) .++.+..+.+.+. +....++.. ++. + +++|.+++.+++++.++...+.++.+..... ..+.+.. T Consensus 462 ~~~~v~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~qaa~~~~~g~~m~~~~~~~a 541 (547) T protein:vir:10 462 WAGSTAQLAEINPEVLDIPDWDEMVRMLGSLLGAPQTLMRPKAKVTSIRKNRSQTQQKAEQAAIAEAEGNAMEAQGKGQA 541 (547) T ss_pred HHHHHHHhhccChhhhhcCCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccc Confidence 1111211111111 223333332 221 1 2344444444443333322222222111110 1111111 Q ss_pred CcccCC Q lcl|NC_013644. 480 EEETAV 485 (510) Q Consensus 480 ~~~~~~ 485 (510) .-++.. T Consensus 542 ~~~~~~ 547 (547) T protein:vir:10 542 ALKENQ 547 (547) T ss_pred chhccC Confidence 100000 No 107 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.33 E-value=4.1e-11 Score=77.54 Aligned_cols=381 Identities=13% Similarity=0.104 Sum_probs=177.6 Q ss_pred ccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHH Q lcl|NC_013644. 4 LLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKT 83 (510) Q Consensus 4 ~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~ 83 (510) |.+ .+-+...+.|-++--.....+......... .-+ -.+.+++.+|+..+ T Consensus 1 ~~~-------------------------~D~~~n~~~gg~~~~~~~~~~~~~~~~~l~----a~Y-~~~~l~~~~Vd~~a 50 (422) T protein:vir:10 1 MVK-------------------------TDSYANIFLGGSDGSEIYGSLQNQAPTILA----SLY-ADNALVRRIIDTIP 50 (422) T ss_pred Ccc-------------------------chhhHHHHcCCCCCccccCcccccCHHHHH----HHH-HhChhhHHHHhhhh Confidence 000 111112223322210000000000000000 001 13578999999999 Q ss_pred hhhhcCCceeccCcHHHHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCC----------Cce-EEEEEcccce Q lcl|NC_013644. 84 QYLLSNPVEYETENEELKEYLAEYY-NSEFQVVLQELVEGSSQKGFEYVYARTNAE----------DRL-CFQVADSLNV 151 (510) Q Consensus 84 ~~l~g~p~~~~~~d~~~~~~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~----------g~~-~i~~~~p~~~ 151 (510) .-++.+++.+++++++ +.+..-| +=++...+.++.+.+..+|.|++++-.... |.+ .+.++++.++ T Consensus 51 ed~~r~g~~i~~~~~~--~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i 128 (422) T protein:vir:10 51 ETALAAGFHIDGIDDE--PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQV 128 (422) T ss_pred HHHhcCCccccCCCHH--HHHHHHHHHhhHHHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccc Confidence 9999999999876543 1222222 335678899999999999999988776321 112 2444444444 Q ss_pred EEEEcCCCCcee---EEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccc Q lcl|NC_013644. 152 FGVYNEYNELQR---ICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLL 228 (510) Q Consensus 152 ~~~~d~~~~~~~---~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 228 (510) .|..-+..-..+ --..|.+.....+. ...+ .+.++.+|.. T Consensus 129 ~~~~~~~dp~s~~fg~P~~y~v~~~~~~~-----~~~i-H~SRli~~~g------------------------------- 171 (422) T protein:vir:10 129 KVQTREENPRNARFGEPLTYRITTNESDM-----FYDV-HYSRIHIIDG------------------------------- 171 (422) T ss_pred cchhcccCccccccCcceEEEEecCCCCc-----ceee-ccceeEEeCC------------------------------- Confidence 332100000000 00011110000000 0001 1122222210 Q ss_pred cccCCcccE-EEecCCCCCCCcHHH-HHHHHHHHHHHHHHHHHHHHHhccceeEEecCC-----CCchhhhhHh------ Q lcl|NC_013644. 229 QRSYGQIPF-YRLSNNKQETTDLKP-IKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQ-----GDDLSKLRQN------ 295 (510) Q Consensus 229 ~~~~g~iPv-v~~~nn~~g~sd~~~-v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~-----~~~~~~~~~~------ 295 (510) ..+|- ....++-.|.|.+.. +.+-+..++.+....+..+..+....+.+.|.. +......... T Consensus 172 ----~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~ 247 (422) T protein:vir:10 172 ----ERIPNVMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLKRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDN 247 (422) T ss_pred ----CCchhhhcccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHhcCCccchHHHHHHHHHHHH Confidence 00111 112244467887875 567788888887777777777777766665521 1111111110 Q ss_pred hh-cCeeeec-cCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc---cC-cccHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 296 VK-SKKVVGT-GSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG---DG-NITNIVIKARYTLLNMKANKT 369 (510) Q Consensus 296 ~~-~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~---~g-~~Sg~Ai~~~~~~l~~k~~~k 369 (510) .+ ....+.+ +++.+.+.+ +.+...+...++...+.|...+++|-+-..+ .| |+||..-...|...+.. .. T Consensus 248 ~~~~~~~~~l~~~~e~~e~~--~~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~Glnatgd~d~~~yyd~i~~--~Q 323 (422) T protein:vir:10 248 NSGVGQAIGIDAESEEYSVL--NSDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGVSSSQNTALETFHKLVDR--KR 323 (422) T ss_pred hcCCccceeEecCCcceEEE--ecccCChHHHHHHHHHHHHhhhCCCeeeeccCCcccccccchHHHHHHHHHHHH--HH Confidence 11 1222333 334445544 4566678889999999999999999764322 12 34565443333333331 22 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHH-------HHHHhcCCCchHHHHHhCCCCC Q lcl|NC_013644. 370 EARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDE-------KTEAETRKIILESILQVAPRLD 442 (510) Q Consensus 370 ~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~-------~~~~~~g~iS~et~~~~~~~v~ 442 (510) +..++..|++++++|+. ..+++|+|++-...++.|+|+.. +++.++|++|.+.+.+.|-. T Consensus 324 e~~l~p~l~~l~~~i~~-----------s~~~~~~f~pL~~~sekekaei~~~~a~a~~~~~~~g~i~~~e~r~~L~~-- 390 (422) T protein:vir:10 324 NAELLPILEFLIPFIVN-----------AEEWSVEFNPLAQESSKDKAEILEKNVNSIAALIAAGAMDIDEARDTLRT-- 390 (422) T ss_pred HHHHHHHHHHHHHHhcc-----------cCCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHhhh-- Confidence 45678888888887652 13688999999988999888763 34444455444444333210 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCccccccc Q lcl|NC_013644. 443 DDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAE 496 (510) Q Consensus 443 d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (510) ............+...++.... ++...++.++ T Consensus 391 ---------------------~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~d 422 (422) T protein:vir:10 391 ---------------------IAPEVKINDGSVETEVTISETS-NDPLEVPTDD 422 (422) T ss_pred ---------------------hcccccCCCCCCccccchhhcC-CCCCCCCCCC Confidence 0000010111111111000000 0000111111 No 108 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=99.32 E-value=2e-12 Score=84.77 Aligned_cols=492 Identities=10% Similarity=0.008 Sum_probs=199.5 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHH--HHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETG--IRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~--~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) |-..+.+.-...-..+...++... .-+.....- .+||.|.|= ..-...... ......++| .+++|..+.+ T Consensus 1 ma~~~~~~~~~~~~r~~~~~~~~~--~~r~~~~~d~~f~~y~G~Qw-~~~~~~~l~---~~~q~~~rP--~~~~N~i~~~ 72 (708) T protein:vir:17 1 MAETLEKKHERIMLRFDRAYSPQQ--EVREKCIEATRFARVPGGQW-EGATAAGTK---LDEQFEKYP--KFEINKVATE 72 (708) T ss_pred CchhHHHHHHHHHHHHHHHHhhhH--HHHHHHHHHHHhhccCCCCC-CHHHHHHHH---hhhhhcCCC--ceEEcchHHH Confidence 322222111111111222222211 112222222 368988761 000000000 000011233 4778999999 Q ss_pred HHHHHhhhhcCCcee--ccC----cHHHHHHHH----HHhc-cCHHHHHHHHHHHHHhcCeEEEEEEEC---CC------ Q lcl|NC_013644. 79 VDQKTQYLLSNPVEY--ETE----NEELKEYLA----EYYN-SEFQVVLQELVEGSSQKGFEYVYARTN---AE------ 138 (510) Q Consensus 79 v~~~~~~l~g~p~~~--~~~----d~~~~~~l~----~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d---~~------ 138 (510) |+..+++---+.+.+ .+. +.+..+.|+ .+.+ ++.......+..+++++|.||+-+..| +. T Consensus 73 i~~v~g~e~~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~~d~~~e~d~~~~~ 152 (708) T protein:vir:17 73 LNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDR 152 (708) T ss_pred HHHHHhhHhhCCcceEEecCCCcchHHHHHHHHHHHHHHHHhcCchhHHhHHHHHhhhcccceeeeeecccccCCCCCCc Confidence 999999976665554 332 233344444 3333 677888899999999999999877432 21 Q ss_pred CceEEEEE--cccceEEEEcCC-CCcee-----EEE-----------EEEE-------------EEeeCCceeEEEEEEE Q lcl|NC_013644. 139 DRLCFQVA--DSLNVFGVYNEY-NELQR-----ICR-----------HYIT-------------EIEKDGETVDIHHAEV 186 (510) Q Consensus 139 g~~~i~~~--~p~~~~~~~d~~-~~~~~-----~~~-----------~~~~-------------~~~~~~~~~~~~~~e~ 186 (510) ..+.|..+ ++..+| ||.. .++.. +++ .|.. ...+......++.+++ T Consensus 153 ~~i~i~~~~~~~~~v~--~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~~~~~~~d~vrv~e~ 230 (708) T protein:vir:17 153 QRIAIEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWEYDWFDADVIYIAKY 230 (708) T ss_pred cccceEeeccchhhee--cCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhhhhhccccccccCCCeEEEEEE Confidence 23333332 334554 4421 11110 000 0000 0000000112333343 Q ss_pred EcCC----cEEEEEEc-CCceeecccccc-------------------ccc--cc-ccccccccccccccccCCcccEEE Q lcl|NC_013644. 187 WTDQ----NVYFFVAE-DNKDYELDEAEP-------------------INP--RP-HVLAVDSENESLLQRSYGQIPFYR 239 (510) Q Consensus 187 y~~~----~i~~~~~~-~~~~~~~~~~~~-------------------~~~--~~-~~~~~~~~~~~~~~~~~g~iPvv~ 239 (510) |... .++.+... +|.......... ... .- .............+.+++.+|+|+ T Consensus 231 ~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l~~~~~~p~~~fP~vP 310 (708) T protein:vir:17 231 YEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIP 310 (708) T ss_pred EEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccccCCCCCCCCccceEE Confidence 3211 11111111 111111000000 000 00 001222223344566777888887 Q ss_pred ecCC---CCC----CCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEec-CCCCchhhhhHhhh-----------cCe Q lcl|NC_013644. 240 LSNN---KQE----TTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSG-FQGDDLSKLRQNVK-----------SKK 300 (510) Q Consensus 240 ~~nn---~~g----~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g-~~~~~~~~~~~~~~-----------~~~ 300 (510) |.-. ..| .|.+.++++.++.+|...|.+...+-.......++.. .-..-......... ... T Consensus 311 ~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~~~~~~~~~~~~~ 390 (708) T protein:vir:17 311 VYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDK 390 (708) T ss_pred EecccccccCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccchhhhhhhhccCCc Confidence 7532 122 4777899999999999999999888766554443221 10000000000000 011 Q ss_pred eeeccCCCc-eeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 301 VVGTGSDGG-LDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEW 379 (510) Q Consensus 301 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~ 379 (510) .-.+..++. ...+..+.-..++...+......|-.+|++-+...+..+|.||+|+..+-............-+..+.++ T Consensus 391 ~g~v~~~a~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G~~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~ 470 (708) T protein:vir:17 391 YGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKR 470 (708) T ss_pred ccccccccCCcccCCCccccHHHHHHHHHHHHHHHHhcCCChHHccCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111111 1222233344677888888889999998877766666678999999988888887777777777888888 Q ss_pred HHHHHHHHHhhcc----------C--C-c--------cc---------------cceeeEEeCCCCCCCHHHHHHHHHHH Q lcl|NC_013644. 380 MNKLVIDDINRRY----------T--K-A--------FD---------------PTEVSFTFTREVMVNETDIVNDEKTE 423 (510) Q Consensus 380 ~~~~i~~~~~~~~----------~--~-~--------~~---------------~~~v~i~f~~~~p~d~~e~~~~~~~~ 423 (510) ..++++.++.... . . . .+ ..+|.|.=.+..+.-..+..+.++.+ T Consensus 471 ~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t~r~~~~~~l~ql 550 (708) T protein:vir:17 471 AGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNV 550 (708) T ss_pred HHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEecccCchhHHHHHHHHHHHH Confidence 7777777653211 0 0 0 00 00122221222222222334444444 Q ss_pred HhcCCC-chHH------HHHhCCCCCcHHHHHHHHHH------------------HHH-HHHHHHHHHHhhhccCCCCC- Q lcl|NC_013644. 424 AETRKI-ILES------ILQVAPRLDDDNVLRLICEQ------------------FDL-DWEDVKEALEEAEYTKGLSD- 476 (510) Q Consensus 424 ~~~g~i-S~et------~~~~~~~v~d~e~~~~~~e~------------------~e~-~~~~~~~~~~~~~~~~~~~~- 476 (510) .....- ...+ +++.+++.--++..+++... .+. +.+..+.............. T Consensus 551 l~~~~~~~~~~~~~~~l~l~~~D~p~~~ei~e~ir~~~~~~~~~~~~~~e~~q~~~q~qq~~q~q~~~~~~eaqa~~~~~ 630 (708) T protein:vir:17 551 LSSMLPADPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAA 630 (708) T ss_pred HHhcCCccchhHHHHHHHHHhcCCCChHHHHHHHHHHhhccccccCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 433211 0011 22332221111111111100 000 00000000000000000000 Q ss_pred CCCCcccCCCCCCcccccccCcccccccc---cCCCC Q lcl|NC_013644. 477 NTDEEETAVNPDDPTQQMAEGATGSTESQ---LPENG 510 (510) Q Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~ 510 (510) ..+..+. +.+....+....++ .-... T Consensus 631 qAe~~ka--------~aea~~~q~~a~q~~~~~~~a~ 659 (708) T protein:vir:17 631 QAEAQKA--------TNETAQTQIKAFTAQQDAMESQ 659 (708) T ss_pred HHHHHHH--------HHHHHHHHHHHHHHHHHHHHHH Confidence 0000000 00000000000000 00000 No 109 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=99.31 E-value=1e-10 Score=75.30 Aligned_cols=457 Identities=11% Similarity=0.066 Sum_probs=179.0 Q ss_pred CCCccCC-----ChhhhHHHHHHHHHhhhh---hhhHHHHHHHHHHhccCCcchhcccc--eeccccccccccccccc-- Q lcl|NC_013644. 1 MEALLSE-----DVKIIANALKAAIDKDRK---SSSKREAETGIRYYNHENDIMNNRIF--YVDDEGILREDKYASNV-- 68 (510) Q Consensus 1 ~~~~~~~-----~~~~~~~~i~~~i~~~~~---~~~~~~~~~~~~YY~g~~~i~~~~~~--~~~~~~~~~~~~~~~~~-- 68 (510) |+.-+.= ......+.+.+++.+... ..+......+.++-.++.....++-. .....+...+...+|.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~~~~~~~~~~r~~~~~~~~l 80 (551) T protein:vir:80 1 MKNKLGLFESIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDL 80 (551) T ss_pred CchhhhhHHHhhhccCChhhcccccccccceeeecccccHHHHHHhhccCcceeecccccceecCcccccCccccChhHH Confidence 2111000 000011111222222100 11122344555666665433222111 11111111111222210 Q ss_pred -----eec-cchhHHHHHHHHhhhh-----------cCCceecc---------CcHHHHHHHHHHhc--c--------CH Q lcl|NC_013644. 69 -----RIP-HGFFPEIVDQKTQYLL-----------SNPVEYET---------ENEELKEYLAEYYN--S--------EF 112 (510) Q Consensus 69 -----ki~-~n~~~~Iv~~~~~~l~-----------g~p~~~~~---------~d~~~~~~l~~~~~--n--------~~ 112 (510) .+. .+....+|+..+.-+. |-+..+.. .+....+.+.+++. | .+ T Consensus 81 ~~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~~~s~ 160 (551) T protein:vir:80 81 HGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSF 160 (551) T ss_pred HHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCCccchH Confidence 011 2344455555544332 12222211 11222233444432 1 23 Q ss_pred HHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCce-eEEEEEEEEEeeCCceeEEEEEEEEcCC Q lcl|NC_013644. 113 QVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNELQ-RICRHYITEIEKDGETVDIHHAEVWTDQ 190 (510) Q Consensus 113 ~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~-~~~~~~~~~~~~~~~~~~~~~~e~y~~~ 190 (510) ...+..+..+.+.+|.+|+.+.++.+|++ .+.+++|..+.++.+..+... ..+++ +....+.. ...|... T Consensus 161 ~~f~~~lv~dlll~Gnay~~i~rd~~G~~~~L~~l~p~~V~v~~~~~g~~~~~~~~y--~~~~~g~~------~~~~~~~ 232 (551) T protein:vir:80 161 SSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRF--VQVIDQKI------VATFNAR 232 (551) T ss_pred HHHHHHHHHHHHhcCCEEEEEEECCCCcEEEEEEeCCceeEEEECCccccccCceEE--EEEeCCcE------EEEEccc Confidence 34555677788999999998888999986 488899999988876544221 11111 11111110 0112333 Q ss_pred cEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 191 NVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNN 270 (510) Q Consensus 191 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~ 270 (510) .+.+++... .........|.|-++.+...|.....+..-..+. T Consensus 233 eiiH~~~n~-------------------------------------~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~ 275 (551) T protein:vir:80 233 EMAFAVRNP-------------------------------------RSDIYATGYGYPELEIALKQFIAHENTEAFNDRF 275 (551) T ss_pred ceEEecccC-------------------------------------CCCcccccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 333332110 0000001236666666666665555544444555 Q ss_pred HHHhccceeE--EecCC-CCc--hhhhhHhhh-------c-CeeeeccCCCceeEEeec--CCHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 271 LQDFAEAIYV--VSGFQ-GDD--LSKLRQNVK-------S-KKVVGTGSDGGLDVKTVT--IPTEGRKTKMEIDKENIYK 335 (510) Q Consensus 271 ~~~~~~~~lv--~~g~~-~~~--~~~~~~~~~-------~-~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~ 335 (510) +...+.|-.+ +.+.. .++ ...++..+. . +++..+. +++++|.... .....+.+..+...+.|.. T Consensus 276 f~Ng~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~~~vl~-~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~ 354 (551) T protein:vir:80 276 FSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVS-AEDVKFVNMTPSARDMEFEKWLNYLINVISA 354 (551) T ss_pred HHcCCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcccccc-CCCceEEEccCChhHHHHHHHHHHHHHHHHH Confidence 5555656544 34422 221 122222221 1 1222232 2335554444 3445566677888888888 Q ss_pred HhCCccccccccCc-----ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCC Q lcl|NC_013644. 336 FGMAFDSTQVGDGN-----ITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVM 410 (510) Q Consensus 336 ~s~~p~~~~~~~g~-----~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p 410 (510) .-++|+.-.+..+. ..+..+-. +... ......+...|.-+++.|...+...--..+. ..+.+.|..... T Consensus 355 aFgVPp~~lG~~~~~~~~~~~~~s~t~--sn~e---~~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~-~~~~f~f~~~~~ 428 (551) T protein:vir:80 355 LYGIDPAEINIPNNGGATGSKGGSLNE--GNSA---EKNQASKNKGLQPLLGFIEDFINKHIVAEFG-DKYTFQFVGGDI 428 (551) T ss_pred HhcCCHHHcCcccccccccccccccch--hhHH---HHHHHHHHHHHHHHHHHHHHHHHhhhccccC-CceEEEeeccCh Confidence 88888753321111 00111100 0000 1112334444444444444444332111222 346788888878 Q ss_pred CCHHHHHHHHHHHHhcCCCchHHHHHhCCC---CCc-HHHH------HHHHHHHHHHHHHHHHHHHhh--h-ccCCCCCC Q lcl|NC_013644. 411 VNETDIVNDEKTEAETRKIILESILQVAPR---LDD-DNVL------RLICEQFDLDWEDVKEALEEA--E-YTKGLSDN 477 (510) Q Consensus 411 ~d~~e~~~~~~~~~~~g~iS~et~~~~~~~---v~d-~e~~------~~~~e~~e~~~~~~~~~~~~~--~-~~~~~~~~ 477 (510) .+.++.+.. .++..+|+|+.-.++++++. +.. +... ...+.......+......... . ..+...+. T Consensus 429 ~~~~~~~~~-~~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (551) T protein:vir:80 429 KSELESVKI-LAEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVST 507 (551) T ss_pred hhHHHHHHH-HHHHhcCCcCHHHHHHHhCCCCCCCCCceeecccccccccccccccCcchhhhhhccccccCcCCCCCCC Confidence 888777764 45677889999888887643 211 1000 000000000000000000000 0 00000000 Q ss_pred CCCcccCC---CCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 478 TDEEETAV---NPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 478 ~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) .+.+++.. .++.+.+.+.....+++.-.-+.+| T Consensus 508 ~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 543 (551) T protein:vir:80 508 DVEDIPDGKDTTGDIGKDGQRKDKDNANAGKQGMKG 543 (551) T ss_pred CCCCCCCccccCCCccccccccCccccchhhhhcCC Confidence 00011111 0111111112222222222234444 No 110 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.30 E-value=6.7e-11 Score=76.37 Aligned_cols=434 Identities=7% Similarity=0.030 Sum_probs=196.5 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhh---hHHHHHHHHHHhc---cC-Ccchhcccceeccccccccccccccceeccc Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSS---SKREAETGIRYYN---HE-NDIMNNRIFYVDDEGILREDKYASNVRIPHG 73 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~---~~~~~~~~~~YY~---g~-~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n 73 (510) |..-... -....++.-|.... .+..-..+..-.. |- +.-..+....+...........-.-++ .+. T Consensus 23 ~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~-~~~ 95 (532) T protein:vir:94 23 VDAKRAT------HTSLGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSFVEATSWPGFPTLALLA-QLP 95 (532) T ss_pred hhhhhhh------hhhhhhhhhhhhcccccccccccccccccccccccCcccccccccccccccccchHHHHHHHH-cCc Confidence 2221111 11111222221111 0000000000000 00 000000000000000000000000011 256 Q ss_pred hhHHHHHHHHhhhhcCCceeccCcH-----HHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCc------- Q lcl|NC_013644. 74 FFPEIVDQKTQYLLSNPVEYETENE-----ELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDR------- 140 (510) Q Consensus 74 ~~~~Iv~~~~~~l~g~p~~~~~~d~-----~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~------- 140 (510) +++.+|+..+.-++-++++++++++ .....|...++ =++...+.++.+.+..+|.|++++-++.+|. T Consensus 96 l~r~~Vd~~aed~~r~~~~i~~~~~~~~~~~~~~~i~~~~~~l~v~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p 175 (532) T protein:vir:94 96 EYRTMHETPADECVRAWGKITCSSKDELAADKATRITQKLEQYNVRTLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAP 175 (532) T ss_pred hhhhhhccchHHHhhCCceEeeCCccccchHHHHHHHHHHHhhhHHHHHHHHHHhhhcccceEEEEEeccCCcccccccc Confidence 7899999999999999999977432 22233443332 2567888899999999999988876654331 Q ss_pred -------------eEEEEEcccceEEEEcCCCCcee-EEE---EEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCcee Q lcl|NC_013644. 141 -------------LCFQVADSLNVFGVYNEYNELQR-ICR---HYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDY 203 (510) Q Consensus 141 -------------~~i~~~~p~~~~~~~d~~~~~~~-~~~---~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~ 203 (510) ..+.+++|.++.|-.-+..++.. -++ +|.+ . . ..-+.+.++.+|... T Consensus 176 ~~l~~~~I~~g~~~~l~vld~~~v~p~~~~~~dp~sp~fg~P~~y~v--~-~--------g~~iH~SRli~f~g~----- 239 (532) T protein:vir:94 176 LLLSPSFVQRGCLIGFATIEPMWLSPNAYNATDPTLPSFYKPDSWIA--T-S--------GKKIHSSRIHTVVGR----- 239 (532) T ss_pred ccccccccccceeeEEEeechheecccccccccccccccCCceeEEE--c-c--------CeeeccceEEEecCC----- Confidence 12445566555543211111100 000 0100 0 0 001223333433211 Q ss_pred ecccccccccccccccccccccccccccCCcccEEEec-CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEe Q lcl|NC_013644. 204 ELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLS-NNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVS 282 (510) Q Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~ 282 (510) .+|-+..+ ++-.|+|.++.+.+-+..++.+.-..+..+..+....+.. T Consensus 240 ------------------------------~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~- 288 (532) T protein:vir:94 240 ------------------------------PVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFSMTNLAT- 288 (532) T ss_pred ------------------------------CchhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeee- Confidence 11211111 2224889999898889999888888777776666665543 Q ss_pred cCC----CCchhhhhHh------hh-cCeeeeccC-CCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc---c Q lcl|NC_013644. 283 GFQ----GDDLSKLRQN------VK-SKKVVGTGS-DGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG---D 347 (510) Q Consensus 283 g~~----~~~~~~~~~~------~~-~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~---~ 347 (510) ++. ......+... .+ ..+++.++. +.+++.+ ..+...+...++...+.|...+++|-+-.-+ . T Consensus 289 ~~a~~ls~~~~~~~~~r~~~~~~~~~n~g~~~id~~~e~~e~~--~~~lsgl~~~l~~~~~~iAaa~~IP~t~LfG~sp~ 366 (532) T protein:vir:94 289 DMAQLLAPGGAQSLDARLQLFNLYRDNRNIGALDKGTEEIQQT--NTPLSGLDSLQAQSQEQMAAVSHIPLVKLLGITPN 366 (532) T ss_pred chHHhhcchhHHHHHHHHHHHHhhcCCccceEEcCCCceeEEE--ecccCCHHHHHHHHHHHHHhHhCCCeeeeecCCcc Confidence 321 1111121111 11 223455554 3444544 4566678889999999999999999864322 1 Q ss_pred C-cccHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHH------ Q lcl|NC_013644. 348 G-NITNIVIKARYTLLNMKANKTE-ARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVND------ 419 (510) Q Consensus 348 g-~~Sg~Ai~~~~~~l~~k~~~k~-~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~------ 419 (510) | |+||..=...|.. .+..++ ..+...|++++++++... .+.. ..++++.|++-...+.+|+++. T Consensus 367 GlnstGe~D~~~yyd---~I~s~Qe~~l~p~le~l~~~l~~s~--~g~~---~~d~~~~f~pL~~~s~kEkAei~~~~a~ 438 (532) T protein:vir:94 367 GLNASSDGEIRVWYD---FIAGYQATNLTPLMEWIIDLIQLSE--YGQI---DPGLAWEWSPLMELDDKELAEVRQLNAS 438 (532) T ss_pred cccccchHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHh--cCCC---CCCceEEeCCCCCCCHHHHHHHHHHHHH Confidence 2 3556643333433 333333 567888888888776432 1111 2358899999888888887664 Q ss_pred -HHHHHhcCCCchHHHHHhCCCCCc-------HHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcc Q lcl|NC_013644. 420 -EKTEAETRKIILESILQVAPRLDD-------DNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPT 491 (510) Q Consensus 420 -~~~~~~~g~iS~et~~~~~~~v~d-------~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (510) ..++.++|++|.+.+.+.+..-.. .+... ..+.+. .......... .+.. .+..+...++...++++ T Consensus 439 a~~~~~~~Gvi~~~Evr~~l~~~~~~~~~~~~~~~~~-~~~~~~---~~~~~~~~~~-~~~~-~~~~~~~~~~~~~~d~~ 512 (532) T protein:vir:94 439 TDSTLMELGVIDAKMVQQRLAADPTSGYAGALGERDE-LDDVEE---IAKQLMAAAL-NPPA-TAPQTPNPQPDSEDDQT 512 (532) T ss_pred HHHHHHhcCCCCHHHHHHHHhcCCccccccccccccc-cccccc---hhhhhccccc-CCCC-CCCCCCCCCCCCCCCCC Confidence 577888999999888876632110 00000 000000 0000000000 0011 00011111111112222 Q ss_pred cccccCcccccccccCCCC Q lcl|NC_013644. 492 QQMAEGATGSTESQLPENG 510 (510) Q Consensus 492 ~~~~~~~~~~~~~~~~~~~ 510 (510) +.++ .....|+-| T Consensus 513 ~~~~------~~~~~~~~~ 525 (532) T protein:vir:94 513 DNQP------DAQADPAQN 525 (532) T ss_pred CCcc------CCCcccccc Confidence 2111 222233333 No 111 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.29 E-value=1.7e-10 Score=74.13 Aligned_cols=442 Identities=8% Similarity=-0.048 Sum_probs=209.3 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhccc----ceecccc----ccccccccc-cceec Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRI----FYVDDEG----ILREDKYAS-NVRIP 71 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~----~~~~~~~----~~~~~~~~~-~~ki~ 71 (510) |..+-..-.....- .. . ......||.|-..--.+.. .....+. .......++ +.-.. T Consensus 1 ~~~p~~~~~~~~~~-~~----------~---~~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rN 66 (533) T protein:vir:34 1 MKTPTIPTLLGPDG-MT----------S---LREYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRN 66 (533) T ss_pred CCCchhhhhhcccc-cc----------h---HHHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhc Confidence 66663332222111 00 0 0112345544211000000 0000000 000000000 00112 Q ss_pred cchhHHHHHHHHhhhhcCCceeccC------------cHHHHHHHHHHh----cc-----------CHHHHHHHHHHHHH Q lcl|NC_013644. 72 HGFFPEIVDQKTQYLLSNPVEYETE------------NEELKEYLAEYY----NS-----------EFQVVLQELVEGSS 124 (510) Q Consensus 72 ~n~~~~Iv~~~~~~l~g~p~~~~~~------------d~~~~~~l~~~~----~n-----------~~~~~~~e~~~~~~ 124 (510) ++|++-.|+..+++++|.+++..+. +++..+.|+..| ++ +|......+++... T Consensus 67 n~~a~~av~~~~~nvVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~ 146 (533) T protein:vir:34 67 NGYAANAIQLHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHA 146 (533) T ss_pred ChHHHHHHHHHHHHhhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHH Confidence 4799999999999999999887542 233444444433 21 34455566778889 Q ss_pred hcCeEEEEEEECCCC----ceEEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCC Q lcl|NC_013644. 125 QKGFEYVYARTNAED----RLCFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDN 200 (510) Q Consensus 125 ~~G~~~~~v~~d~~g----~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~ 200 (510) +.|.+++...+.+.+ .+++..++|+.+---++......-.-++ +.... ....-+| ++.. ... T Consensus 147 ~dGE~f~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GI---e~d~~-Gr~~aY~--i~~~--------~~~ 212 (533) T protein:vir:34 147 FNGELFVQATWDTSSSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGV---QINDS-GAALGYY--VSED--------GYP 212 (533) T ss_pred hCCceEEEeeeccCCCCccceEEEEechhhcCCCCCCCCCCceEeee---EECCC-CCeEEEE--Eeec--------CCC Confidence 999999887665543 3688999998874333211111111111 11111 1111111 1110 000 Q ss_pred ceeecccccccccccccccccccccccccccCCccc---EEEecC-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 201 KDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIP---FYRLSN-----NKQETTDLKPIKALIDDYDLMNCFLSNNLQ 272 (510) Q Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~n-----n~~g~sd~~~v~~liD~~n~~~S~~~~~~~ 272 (510) +... . .+.. .-.+..+| |+|+.. -..|.|+|..++..+..++........... T Consensus 213 ~~~~-~---~~~~---------------~~~~~~v~a~~VlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~ 273 (533) T protein:vir:34 213 GWMP-Q---KWTW---------------IPRELPGGRASFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAI 273 (533) T ss_pred Cccc-c---ccce---------------eeeeeccChhHeeeeccccCCCcccCCchHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000 0 0000 00011222 555543 346899999998887776665444333333 Q ss_pred HhccceeEEecC-C------------CCch-hhh--------------hHhhhcCeeeeccCCCceeEEeecCCHHHHHH Q lcl|NC_013644. 273 DFAEAIYVVSGF-Q------------GDDL-SKL--------------RQNVKSKKVVGTGSDGGLDVKTVTIPTEGRKT 324 (510) Q Consensus 273 ~~~~~~lv~~g~-~------------~~~~-~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 324 (510) ..+.-..+++.. + ..+. ..+ ...+..+.+..+..|.++++++.+.+...+.. T Consensus 274 i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~ 353 (533) T protein:vir:34 274 VKAMYAATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYSV 353 (533) T ss_pred HhhhheeeeecCCCcccccccccCCCcccccccccccchhhhhccCcceeeccCceeeecCCCCeeeecCCCCCCCCHHH Confidence 322222233311 1 0000 000 01255667777888999999998888889999 Q ss_pred HHHHHHHHHHHHhCCccccc-cccCcccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHh------hccCCcc Q lcl|NC_013644. 325 KMEIDKENIYKFGMAFDSTQ-VGDGNITNIVIKARYTLLNMKANKTEARLRALL-EWMNKLVIDDIN------RRYTKAF 396 (510) Q Consensus 325 ~~~~l~~~i~~~s~~p~~~~-~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l-~~~~~~i~~~~~------~~~~~~~ 396 (510) +...+...|....++|-... ...++.|-.+.+..+......+...+..|...+ +-+++..+...- .+.+... T Consensus 354 f~~~~lr~iAaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~ 433 (533) T protein:vir:34 354 FEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARF 433 (533) T ss_pred HHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCC Confidence 99999999988877774332 234555555666666666666666555554433 223333332111 1221111 Q ss_pred ccc-----eeeEEe--CCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_013644. 397 DPT-----EVSFTF--TREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAE 469 (510) Q Consensus 397 ~~~-----~v~i~f--~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~ 469 (510) +.. ...+.| ..-...|....++....++.+|+.|.+.++...+ .|.++..+++.++.+...+ ... + T Consensus 434 ~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G-~D~~ev~~q~a~e~~~~~~-----~gl-~ 506 (533) T protein:vir:34 434 SFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRG-DDYQEIFAQQVRETMERRA-----AGL-K 506 (533) T ss_pred CchhhHHhhhceeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcC-CCHHHHHHHHHHHHHHHHh-----cCC-C Confidence 111 134555 4445679999999999999999999999999886 3444444333333222111 000 1 Q ss_pred ccCCCCCCCCCcccCCCCCCcccccccCcccc Q lcl|NC_013644. 470 YTKGLSDNTDEEETAVNPDDPTQQMAEGATGS 501 (510) Q Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (510) .+... .....++...+ .++.+.++.++ T Consensus 507 ~~~~~---~~~~~s~~~~~--~~~~~~~~~~~ 533 (533) T protein:vir:34 507 PPAWA---AAAFESGLRQS--TEEEKSDSRAA 533 (533) T ss_pred CCCCC---CcCccCCCCCC--CCCCcccCCCC Confidence 01110 01111111111 11112222222 No 112 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.29 E-value=1.7e-10 Score=74.10 Aligned_cols=440 Identities=8% Similarity=-0.059 Sum_probs=212.7 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCc---chhccc-cee------ccccccccccccc-cce Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHEND---IMNNRI-FYV------DDEGILREDKYAS-NVR 69 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~---i~~~~~-~~~------~~~~~~~~~~~~~-~~k 69 (510) |+.+-..-+.--. . ......||.|-.. ...... ... ....... ..++ +.- T Consensus 1 ~~~~~~~~~~~~~--------------~---~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~l--r~RaRdl~ 61 (530) T protein:vir:38 1 MKIPSLVGPDGKT--------------S---LREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRG--NARADDLV 61 (530) T ss_pred CccceeecCcccc--------------c---hHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHH--HHHHHHHH Confidence 7766444433100 0 1112345543211 000000 000 0000000 0000 001 Q ss_pred eccchhHHHHHHHHhhhhcCCceeccC------------cHHHHHHHHHHhc----c-----------CHHHHHHHHHHH Q lcl|NC_013644. 70 IPHGFFPEIVDQKTQYLLSNPVEYETE------------NEELKEYLAEYYN----S-----------EFQVVLQELVEG 122 (510) Q Consensus 70 i~~n~~~~Iv~~~~~~l~g~p~~~~~~------------d~~~~~~l~~~~~----n-----------~~~~~~~e~~~~ 122 (510) ..++|++-+|+..+..++|.+++..+. +++..+.|+..|. + +|.....-+++. T Consensus 62 rNn~~a~~av~~~~~nvVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~ 141 (530) T protein:vir:38 62 RNNGYAANAVQLHQDHIVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAM 141 (530) T ss_pred hcChHHHHHHHHHHHHhhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHH Confidence 124699999999999999999876541 3344445554442 1 344555567788 Q ss_pred HHhcCeEEEEEEECCCC----ceEEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEc Q lcl|NC_013644. 123 SSQKGFEYVYARTNAED----RLCFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAE 198 (510) Q Consensus 123 ~~~~G~~~~~v~~d~~g----~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~ 198 (510) ..+.|.+++...+++.+ .+++..++|+.+---++......-.-++ +.... ....-+| ++.. . T Consensus 142 ~~~dGE~~~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GI---e~d~~-Gr~~aY~--i~~~--------~ 207 (530) T protein:vir:38 142 HAFNGELCVQATWDSDSTRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGV---KINDS-GAALGYY--VSDD--------G 207 (530) T ss_pred HhhCCceEEEeeeccCCCCccceEEEEechhhcCCCCCCCCCCeeEeee---EECCC-CceEEEE--Eeec--------c Confidence 89999998887665443 3689999998864222211111111111 11111 1111111 1110 0 Q ss_pred CCceeecccccccccccccccccccccccccccCCcccEEEecCC-----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 199 DNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNN-----KQETTDLKPIKALIDDYDLMNCFLSNNLQD 273 (510) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn-----~~g~sd~~~v~~liD~~n~~~S~~~~~~~~ 273 (510) -.+... . .+.. .......+.--|+|+... ..|.|+|..++..+..++............ T Consensus 208 ~~~~~~-~---~~~~------------~~~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i 271 (530) T protein:vir:38 208 YPGWMA-Q---NWTY------------IPRELPGGRPSFIHVFEPMEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIV 271 (530) T ss_pred CCCccc-c---ccce------------eeeeeccChhHeEeeccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHH Confidence 000000 0 0000 000011122236666543 458999999988877776654443333333 Q ss_pred hccceeEEecC-------------CCCchhh---------------hhHhhhcCeeeeccCCCceeEEeecCCHHHHHHH Q lcl|NC_013644. 274 FAEAIYVVSGF-------------QGDDLSK---------------LRQNVKSKKVVGTGSDGGLDVKTVTIPTEGRKTK 325 (510) Q Consensus 274 ~~~~~lv~~g~-------------~~~~~~~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (510) .+.-..+++.. +..+... ....+..+.+..+..|.++++.+.+.+...+..+ T Consensus 272 ~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~f 351 (530) T protein:vir:38 272 KAMYAATIESELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLPGDSLNLQSAQDTDNGYSTF 351 (530) T ss_pred hhhheeeeeccCCccccccccccCCcccccccccccchhhhhcccccceeccCceeeecCCCCeeeeeCCCCCCCCHHHH Confidence 23222333321 1110000 0012456667778888999999998888899999 Q ss_pred HHHHHHHHHHHhCCccccc-cccCcccHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHh------hccCCccc Q lcl|NC_013644. 326 MEIDKENIYKFGMAFDSTQ-VGDGNITNIVIKARYTLLNMKANKTEARLRAL-LEWMNKLVIDDIN------RRYTKAFD 397 (510) Q Consensus 326 ~~~l~~~i~~~s~~p~~~~-~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~-l~~~~~~i~~~~~------~~~~~~~~ 397 (510) +..+...|....++|-... ...+++|-.+.+..+......+...+..|... ++.+++..+...- .+....++ T Consensus 352 ~~~~lr~iaaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~ 431 (530) T protein:vir:38 352 EQSLLRYIAAGLGVSYEQLSRNYSQMSYSTARASANESWAYFMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFS 431 (530) T ss_pred HHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCC Confidence 9999999998887775433 23455555666766666666666666655443 3333333333211 11211111 Q ss_pred cc-----eeeEEe--CCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|NC_013644. 398 PT-----EVSFTF--TREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEY 470 (510) Q Consensus 398 ~~-----~v~i~f--~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~ 470 (510) .. .+.+.| ..-...|....+++...++.+|+.|.+.++...+ .|.++..+++.++.+...+ ... T Consensus 432 ~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G-~D~~~v~~q~a~e~~~~~~-----~Gl--- 502 (530) T protein:vir:38 432 FQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRG-DDYQEIFAQQVRESMERRA-----AGL--- 502 (530) T ss_pred chhhHHhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcC-CCHHHHHHHHHHHHHHHHH-----cCC--- Confidence 11 123455 4455679999999999999999999999999886 3444444333333322111 100 Q ss_pred cCCCCCCCCCcccCCCCCCcccccccCcccc Q lcl|NC_013644. 471 TKGLSDNTDEEETAVNPDDPTQQMAEGATGS 501 (510) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (510) ... ......+......+.++..++.+++ T Consensus 503 ~~~---~~~~~~~~~~~~~~~~~~~d~~~~a 530 (530) T protein:vir:38 503 NPP---AWAAAAFEAGVKKSNEEEQDGARAA 530 (530) T ss_pred CCC---CCcccccCCCCCCCCCCCCCCCCCC Confidence 111 1111111111111111122223333 No 113 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.26 E-value=5.2e-11 Score=76.94 Aligned_cols=391 Identities=13% Similarity=0.126 Sum_probs=178.3 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccce--eccchhHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVR--IPHGFFPEI 78 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~k--i~~n~~~~I 78 (510) |--......+ .-...+-+....-+.-... +...... .......-.. -.+.+++.+ T Consensus 1 ~~~~m~~~~~-----------------~~~~~D~~~~~~~~~~g~~-~~~~~~~-----~~~~~~~l~~~Y~~~~l~~~~ 57 (435) T protein:vir:79 1 MGVFMSDKVK-----------------AITKEDGYNEIFGSKDGTF-RPNAFYM-----QRAAFKALSQFYEEDGMARRI 57 (435) T ss_pred CCcccccccc-----------------cchhhcchhhhhccccccc-ccCcccC-----CcCCHHHHHHHHhcCchhhhh Confidence 2221111100 0001111111111111000 0000000 0000000001 135789999 Q ss_pred HHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC----------Cce-EEEEE Q lcl|NC_013644. 79 VDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAE----------DRL-CFQVA 146 (510) Q Consensus 79 v~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~----------g~~-~i~~~ 146 (510) |+..+.-++.+++.+++++++ +.+...|+ =+....+.++.+.+..+|.|++++-.... |.+ .|.++ T Consensus 58 Vd~~aed~~r~g~~i~g~~~~--~~~~~~~~~l~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i~~i~v~ 135 (435) T protein:vir:79 58 VDVIPEEMVTPGFKVDGVKNE--KSFKSRWDELRLNAKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPGAQLEDIRVY 135 (435) T ss_pred hccchHHhhcCCceecCCChH--HHHHHHHHHhhHHHHHHHHHHhhhccccEEEEEEecCCCCcccccccCCceeeEEee Confidence 999999999999999765432 33444443 25678899999999999999888765321 111 23444 Q ss_pred cccceEEEEcCCCCceeE----EEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccc Q lcl|NC_013644. 147 DSLNVFGVYNEYNELQRI----CRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDS 222 (510) Q Consensus 147 ~p~~~~~~~d~~~~~~~~----~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 222 (510) +|.++.|-.-+. ++..- ...|.+.. .++.. ...+ .+.++.+|... T Consensus 136 d~~~i~~~~~~~-dp~sp~fg~P~~y~v~~-~~~~~----~~~i-H~SRli~~~g~------------------------ 184 (435) T protein:vir:79 136 DRYQITIHERET-NARSVRYGEPKLYKISP-GGDIP----EFFV-HYSRICIIDGE------------------------ 184 (435) T ss_pred chhhccchhhcc-CCcccccCcceEEEEec-CCCCC----ceEE-cceeEEEecCC------------------------ Confidence 444433211000 00000 00111100 00000 0001 11122222100 Q ss_pred cccccccccCCcccEEE-ecCCCCCCCcH-HHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCC-----CCchhhhhHh Q lcl|NC_013644. 223 ENESLLQRSYGQIPFYR-LSNNKQETTDL-KPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQ-----GDDLSKLRQN 295 (510) Q Consensus 223 ~~~~~~~~~~g~iPvv~-~~nn~~g~sd~-~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~-----~~~~~~~~~~ 295 (510) .+|-.. ..++-.|.|.+ +.+.+-+..++.+....+..+..+....+.++|+. .......... T Consensus 185 -----------~~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r 253 (435) T protein:vir:79 185 -----------RVSNEKRRQNDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQQAVWKARDLALMCDDEEGRYAARLR 253 (435) T ss_pred -----------cchhhhccccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccchhHHHhhcCccchHHHHHH Confidence 011110 12344567766 67778888888888888777777776666655531 1111111110 Q ss_pred ------hh-cCeeeec-cCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc---cC-cccHHHHHHHHHHHH Q lcl|NC_013644. 296 ------VK-SKKVVGT-GSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG---DG-NITNIVIKARYTLLN 363 (510) Q Consensus 296 ------~~-~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~---~g-~~Sg~Ai~~~~~~l~ 363 (510) .+ .+..+.+ +++.+.+.++ .+...+...++...+.|...+++|-+-..+ .| |+||..-...|...+ T Consensus 254 ~~~~~~~~~~~~~~~i~~~~e~~e~~~--~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnstgd~d~~~yyd~i 331 (435) T protein:vir:79 254 LAQVDDESGVGKAIGIDATDEEYEVLN--SDVSGVPEFLQEKIDRIVALTGIHEIIIKNKNTGGVSASQNTALETFYKLI 331 (435) T ss_pred HHHHHHhcCCCCceeEecCCcceEEEe--cccCCHHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHH Confidence 11 1233333 3344455544 566778899999999999999999854322 12 456765444444443 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHH-------HHHhcCCCchHHHHH Q lcl|NC_013644. 364 MKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEK-------TEAETRKIILESILQ 436 (510) Q Consensus 364 ~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~-------~~~~~g~iS~et~~~ 436 (510) .. ..+..++..|++++++++.- .+++++|++-...+++|+|+... ++.++|+++.+.+.+ T Consensus 332 ~~--~Qe~~l~p~l~~l~~li~~s-----------~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g~i~~~e~r~ 398 (435) T protein:vir:79 332 DR--KRVEDYKPILEFLLPFMISE-----------TEWSIEFEPLSVPSDKDKAEIMAKNVESVVKLKAEQAINLKETRD 398 (435) T ss_pred HH--HHHHHHHHHHHHHHHHhhcC-----------CCCeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHH Confidence 32 22466788888887776521 36889999999999988877643 344455555444333 Q ss_pred hCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccc Q lcl|NC_013644. 437 VAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTE 503 (510) Q Consensus 437 ~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (510) .+ + ..............+.+ +..++++ ++.. ..|.+. T Consensus 399 ~L------------~-----------~~~~~~~~~~~~~~~~~----~~~d~~~-~~~~--e~g~~~ 435 (435) T protein:vir:79 399 TL------------R-----------SICPDLKIMDNDNIELP----EPEDLDP-EPGQ--EGGLNK 435 (435) T ss_pred HH------------H-----------HhccccCCCCcccccCC----ccccCCC-CCCC--CCCCCC Confidence 22 0 00000001111000000 0000000 0000 000011 No 114 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.23 E-value=1.7e-10 Score=74.21 Aligned_cols=383 Identities=14% Similarity=0.121 Sum_probs=178.9 Q ss_pred HHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCCceeccCcHH Q lcl|NC_013644. 20 IDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNPVEYETENEE 99 (510) Q Consensus 20 i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~ 99 (510) +..++ .+-+.+..-|.++-..+.. .....+.... . .=-.+.+++.+|+..+.-++.+++.+++++++ T Consensus 1 ~~~~~-------~d~~~~~~~~~~~~~~~~~-~~~~~~~~l~----a-~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~~~~ 67 (427) T protein:vir:10 1 MKIVK-------HDGYNDIFNGGADGSPKPF-FMSDASYHVG----S-FYNDNATAKRIVDVIPEEMVTAGFKMSGVKDE 67 (427) T ss_pred CCccc-------cchHHHHhhcCCCCcccCc-cccCchHHHH----H-HHHcCchhhhhhccchHHhhcCCccccCccHH Confidence 11111 1111122222221111110 0000000000 0 00125789999999999999999999876433 Q ss_pred HHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCc-----------eEEEEEcccceEEEEcCCCCcee-E-- Q lcl|NC_013644. 100 LKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDR-----------LCFQVADSLNVFGVYNEYNELQR-I-- 164 (510) Q Consensus 100 ~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~-----------~~i~~~~p~~~~~~~d~~~~~~~-~-- 164 (510) +.+...|+ =++...+.++.+.+..+|.|++++-++.... ..+.++++.++.|-.-+. ++.. - T Consensus 68 --~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~~~~~-dp~s~~fg 144 (427) T protein:vir:10 68 --KEFKSLWDSYKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFAITVEKRVT-NARSPRYG 144 (427) T ss_pred --HHHHHHHHHhhHHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhccccccccc-CccccccC Confidence 33444443 3677889999999999999998876643221 123334443332211100 0000 0 Q ss_pred -EEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEE-ecC Q lcl|NC_013644. 165 -CRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYR-LSN 242 (510) Q Consensus 165 -~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~-~~n 242 (510) -..|.+. .++. .....+ .+.++.+|... .+|-.. ..+ T Consensus 145 ~P~~y~v~--~~~~---~~~~~i-H~SRli~~~g~-----------------------------------~~p~~~~~~~ 183 (427) T protein:vir:10 145 EPEIYKVS--PGDN---MQPYLI-HHSRVFIADGE-----------------------------------RVAQQARKQN 183 (427) T ss_pred cceEEEEe--cCCC---CcceEE-ccccEEEecCC-----------------------------------CchhhhcccC Confidence 0011110 0000 000011 11222222100 011110 123 Q ss_pred CCCCCCcHH-HHHHHHHHHHHHHHHHHHHHHHhccceeEEecCC-----CCchhhhhHh------hh-cCeeeecc-CCC Q lcl|NC_013644. 243 NKQETTDLK-PIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQ-----GDDLSKLRQN------VK-SKKVVGTG-SDG 308 (510) Q Consensus 243 n~~g~sd~~-~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~-----~~~~~~~~~~------~~-~~~~~~~~-~~~ 308 (510) +-.|.|.+. .+.+-+..++.+....+..+..+....+.++|+. +......... .+ ..+.+.+. ++. T Consensus 184 ~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~k~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e 263 (427) T protein:vir:10 184 QGWGASVLNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETE 263 (427) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHHhcCccchHHHHHHHHHHHHhcCcccceeeecCCC Confidence 445777775 4667677788887777777777777766665531 1111111111 11 12233333 344 Q ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc----cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 309 GLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG----DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLV 384 (510) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~----~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i 384 (510) +.+.+ +.+...+...++...+.|...+++|-+-..+ +-|+||..=...|...+.. ..+..++..|++++++| T Consensus 264 ~~e~~--~~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~sp~Glnstgd~D~~nyyd~i~~--~Qe~~l~p~l~~l~~~i 339 (427) T protein:vir:10 264 EYDVL--NSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDR--KREEDYRPLLEFLLPFI 339 (427) T ss_pred ceeEE--ecccCChHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHHH--HHHHHHHHHHHHHHHHh Confidence 44444 4666778888999999999999999764322 1245666533334433332 23356888888888776 Q ss_pred HHHHhhccCCccccceeeEEeCCCCCCCHHHHHHH-------HHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHH Q lcl|NC_013644. 385 IDDINRRYTKAFDPTEVSFTFTREVMVNETDIVND-------EKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLD 457 (510) Q Consensus 385 ~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~-------~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~ 457 (510) +. ..+++++|++-...++.|+++. +.++.++|+++.+.+.+.|- . T Consensus 340 ~~-----------s~~~~~~f~pL~~~s~kEkaei~~~~a~a~~~~~~~gvi~~~e~r~~L~------------~----- 391 (427) T protein:vir:10 340 VD-----------EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLR------------S----- 391 (427) T ss_pred hc-----------CCCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHH------------h----- Confidence 52 1368899999999999988765 44445555555544433221 0 Q ss_pred HHHHHHHHHhhhccC--CCCCCCCCcccCCCCCCcccccccC Q lcl|NC_013644. 458 WEDVKEALEEAEYTK--GLSDNTDEEETAVNPDDPTQQMAEG 497 (510) Q Consensus 458 ~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (510) ........+ .......++..+.+++.+.++.+++ T Consensus 392 ------~~~~~~~~~~~~~~~e~~~~~~e~~p~~~e~~~d~~ 427 (427) T protein:vir:10 392 ------IAPEFKLKDGNNINIREPEETTEPEPGLGEKLEDEN 427 (427) T ss_pred ------hhccccCCCCccccccccchhcCCCCCCCCCCCCCC Confidence 000111100 1111111111111111111111222 No 115 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=99.23 E-value=2.9e-10 Score=72.91 Aligned_cols=476 Identities=10% Similarity=0.047 Sum_probs=202.3 Q ss_pred cCCChhhhHHHHHHHHHhhhh--hhhHHHHHHHHHHhc--cCC---cchhcccceeccccccccccccccceeccchhHH Q lcl|NC_013644. 5 LSEDVKIIANALKAAIDKDRK--SSSKREAETGIRYYN--HEN---DIMNNRIFYVDDEGILREDKYASNVRIPHGFFPE 77 (510) Q Consensus 5 ~~~~~~~~~~~i~~~i~~~~~--~~~~~~~~~~~~YY~--g~~---~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ 77 (510) +.+........+...++.... ..-|+....-++||. |.+ .+..... ...+...+| .+.+|..+. T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~-------~~l~~~~~P--~~~~N~i~~ 71 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAGSE-------LGKHFEKYP--KFEINKIST 71 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHH-------HHHhhCCCC--eEEEccHHH Confidence 222212222222222222221 223344656677775 554 1110000 001112334 477899999 Q ss_pred HHHHHHhhhhcCCcee--ccC----cHHHHHHHH----HHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECC------C-- Q lcl|NC_013644. 78 IVDQKTQYLLSNPVEY--ETE----NEELKEYLA----EYYN-SEFQVVLQELVEGSSQKGFEYVYARTNA------E-- 138 (510) Q Consensus 78 Iv~~~~~~l~g~p~~~--~~~----d~~~~~~l~----~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~------~-- 138 (510) +|+..+++---+.+.+ .+. +.+..+.|+ .+.+ ++.......+..+++++|.||.-++.|- + T Consensus 72 ~v~~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~d~~~~ 151 (720) T protein:vir:35 72 ELNRIISEYRHNRITVKFRPGDKTASEALANKLNGLFRADYEETDGGEACDNAFDDGSTGGFGCFRLTTNLVNALDPMDE 151 (720) T ss_pred HHHHHHhHHHhCCCceEEEcCCCcchHHHHHHHHHHHHHHHHhcCchHHHhHHHHHhhhccceeEEeeecccccCCCCcc Confidence 9999999987666554 332 333344444 3333 6778888999999999999999887642 1 Q ss_pred -CceEEEEE--cccceEEEEcCC-CCce-----eEEEEEEEE-----------------------EeeCCceeEEEEEEE Q lcl|NC_013644. 139 -DRLCFQVA--DSLNVFGVYNEY-NELQ-----RICRHYITE-----------------------IEKDGETVDIHHAEV 186 (510) Q Consensus 139 -g~~~i~~~--~p~~~~~~~d~~-~~~~-----~~~~~~~~~-----------------------~~~~~~~~~~~~~e~ 186 (510) +.+++..+ ++.++| ||.. .++. .+++.+... ..+......++.+|+ T Consensus 152 ~~~i~i~~v~~~~~~v~--~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~d~~~~~~v~i~E~ 229 (720) T protein:vir:35 152 RQRICLEPIYDPARSVW--FDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMSGIERSWDYDWYDVDVVYIAKY 229 (720) T ss_pred cceeeEecccCchhhee--ecccccccChhhhhhhhhhcCCCHHHHHHhCCCccccccccccccccccccCCCceEEEEe Confidence 12333332 223333 2211 0100 011110000 000001122344444 Q ss_pred EcCCcE----EEEEEcCCceeeccccccc---------------ccc-------cc-cccccccccccccccCCcccEEE Q lcl|NC_013644. 187 WTDQNV----YFFVAEDNKDYELDEAEPI---------------NPR-------PH-VLAVDSENESLLQRSYGQIPFYR 239 (510) Q Consensus 187 y~~~~i----~~~~~~~~~~~~~~~~~~~---------------~~~-------~~-~~~~~~~~~~~~~~~~g~iPvv~ 239 (510) |.-..+ +.+....++.......... ... -+ ............+.+++.+|+|+ T Consensus 230 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~~~fP~vP 309 (720) T protein:vir:35 230 YEVKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPGEHIPLIP 309 (720) T ss_pred eEEEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCCCccceEE Confidence 433322 1111111111111000000 000 00 01111222234456677788888 Q ss_pred ecCC-------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhc----Cee---ee-- Q lcl|NC_013644. 240 LSNN-------KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKS----KKV---VG-- 303 (510) Q Consensus 240 ~~nn-------~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~----~~~---~~-- 303 (510) |... +...|.+.++++.++.+|...|.+.+.+.. .+...-.|.. ++...+...... +.. ++ T Consensus 310 ~~g~r~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~--~~~~~~~~a~-~~~~~~~~~~a~~~~~~~~~l~~~~~ 386 (720) T protein:vir:35 310 VYGKRWFIDDIERVEGHIAKAMDAQRLYNLQVSMLADSATQ--DTGSIPIVGK-SQIKTLEKYWANRNKNRPAFLPLNEI 386 (720) T ss_pred EEeeeeccCCCcccceeeecchhHHHHHHHHHHHHHHHHHc--CCccccccCc-chHHHHHHHhhccccccccccccccc Confidence 7532 123578888999999999999999998853 4444433321 111122111111 000 00 Q ss_pred ccCC-------CceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 304 TGSD-------GGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRAL 376 (510) Q Consensus 304 ~~~~-------~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~ 376 (510) .... +.+.+.....-...+...+..-...|-.+|++-+-..+..+|.||+|+..+-............-+..+ T Consensus 387 ~~~~G~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~ 466 (720) T protein:vir:35 387 VDKQGNIIAPPTPVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPMPSNIAKETVNHLMHRSDMSSFIYLDNMAKS 466 (720) T ss_pred cccCcccccCCCcccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCcccchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0111 123344444445666777788888888888777665666677899999998777777777777777778 Q ss_pred HHHHHHHHHHHHhhc----------cC--Cc-c-------------------cc----ceeeEEeCCCCCCCHHHHHHHH Q lcl|NC_013644. 377 LEWMNKLVIDDINRR----------YT--KA-F-------------------DP----TEVSFTFTREVMVNETDIVNDE 420 (510) Q Consensus 377 l~~~~~~i~~~~~~~----------~~--~~-~-------------------~~----~~v~i~f~~~~p~d~~e~~~~~ 420 (510) .+++.++++.++... +. .+ + +. .+|.|.=.+..+.-..+.++.+ T Consensus 467 ~~~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~~m 546 (720) T protein:vir:35 467 LKRAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDATVSVL 546 (720) T ss_pred HHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCcccHHHHHHHHH Confidence 888777777765321 10 00 0 00 1122222223333233334433 Q ss_pred HHHHhcCCCchH---------HHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcc Q lcl|NC_013644. 421 KTEAETRKIILE---------SILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPT 491 (510) Q Consensus 421 ~~~~~~g~iS~e---------t~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (510) +.+.. .++.+ .+++.+++---.+..+++ .+..+.........++++.....-... T Consensus 547 ~qll~--~~~p~~~~~~~~~~~ile~~d~p~~~e~~eri--------------rk~~~~~~~~~~~~~e~qq~~a~~qq~ 610 (720) T protein:vir:35 547 TNLLA--GMLPQDPMRQVLQGIILDNMEGEGLDEFKEYN--------------RKQLLTQGVVKPRNTEEEQMVAQMIQQ 610 (720) T ss_pred HHHHH--hcCCCchhHHHHHHHHHHhcCchhHHHHHHHH--------------HhhcchhcccCccChhHHHHHHHHHHH Confidence 33332 12221 122222221101111000 000100000000000000000000000 Q ss_pred ccccc---CcccccccccCCCC Q lcl|NC_013644. 492 QQMAE---GATGSTESQLPENG 510 (510) Q Consensus 492 ~~~~~---~~~~~~~~~~~~~~ 510 (510) .++.. ...+..-.|.-... T Consensus 611 ~qq~~~e~~~aqa~l~qaqae~ 632 (720) T protein:vir:35 611 AQQPNAELVAAQGVLMQGQAEV 632 (720) T ss_pred HHhHhHHHHHHHHHHHHHHHHH Confidence 00000 00000011111111 No 116 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.22 E-value=4.5e-10 Score=71.82 Aligned_cols=437 Identities=9% Similarity=-0.030 Sum_probs=199.8 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccce-ecccccc----ccccccc-cceeccch Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFY-VDDEGIL----REDKYAS-NVRIPHGF 74 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~-~~~~~~~----~~~~~~~-~~ki~~n~ 74 (510) |.-+-.--.-.-..... +. ...-|+|-..-....... ....... .....++ +.-.-++| T Consensus 1 m~~~~~~~~a~~~~~~~------------~~---~~~~y~aa~~~~~~~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~ 65 (495) T protein:vir:10 1 MNMTPSGYQSLASGLLV------------PV---GASAYEGASGGHRWQDIGDYGPDTAVASGIQTLRARSHHNVRNNPW 65 (495) T ss_pred CCcccccccccchhhhh------------HH---HhhhhhccccCcccCCCCCCChhHHHHHHHHHHHHHHHHHHhcChH Confidence 33221110000000000 00 011133321100000000 0000000 0000000 00112469 Q ss_pred hHHHHHHHHhhhhcCCceec--cCcHHHHHHHHHHhc-----------cCHHHHHHHHHHHHHhcCeEEEEEEECC--CC Q lcl|NC_013644. 75 FPEIVDQKTQYLLSNPVEYE--TENEELKEYLAEYYN-----------SEFQVVLQELVEGSSQKGFEYVYARTNA--ED 139 (510) Q Consensus 75 ~~~Iv~~~~~~l~g~p~~~~--~~d~~~~~~l~~~~~-----------n~~~~~~~e~~~~~~~~G~~~~~v~~d~--~g 139 (510) ++-.|+..+++++|.+++.. +++++..+.|+..|. .+|......+++.....|.+++...+.. +| T Consensus 66 a~~av~~~~~~vVG~Gi~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g 145 (495) T protein:vir:10 66 ATNAVATWVAAAVGNGLTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEG 145 (495) T ss_pred HHHHHHHHHHhhcCCCcccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCC Confidence 99999999999999988765 456666666665552 2455666667888999999987665432 33 Q ss_pred ---ceEEEEEcccceEEEEcCCCCc---eeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccc Q lcl|NC_013644. 140 ---RLCFQVADSLNVFGVYNEYNEL---QRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINP 213 (510) Q Consensus 140 ---~~~i~~~~p~~~~~~~d~~~~~---~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~ 213 (510) .+++..++|+.+---++..... .-.-++ +... .....-+++..-.++..+. .. T Consensus 146 ~~~~~~lqliepd~l~~~~~~~~~~~g~~i~~GI---e~d~-~Gr~vaY~i~~~hpgd~~~--~~--------------- 204 (495) T protein:vir:10 146 LSVPLQLQIIEPDMLASDIPDETLPSGGYVKGGI---RFSN-GGKRKAYCFYRNHPAESSL--IG--------------- 204 (495) T ss_pred CccceEEEEechhhcCCCCCCCCCCCCCEEEece---EECC-CCceEEEEEeecCCCcccc--cc--------------- Confidence 3689999999873222211100 001111 1111 1111112211111111000 00 Q ss_pred ccccccccccccccccccCCccc---EEEec----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCC Q lcl|NC_013644. 214 RPHVLAVDSENESLLQRSYGQIP---FYRLS----NNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQG 286 (510) Q Consensus 214 ~~~~~~~~~~~~~~~~~~~g~iP---vv~~~----nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~ 286 (510) ....+.+|| |+|+. .-..|.|.+..++.| ..++.....-.......+.-..+++.... T Consensus 205 --------------~~~~~~rvpA~~vlH~f~~r~gQ~RGis~la~i~~l-~~l~~y~dael~~a~i~A~~~~fi~~~~~ 269 (495) T protein:vir:10 205 --------------DPVDTVWIKAEHVLHVTVLTVRSDAGAPWFQLLLRL-NELDQYEDAELVRKKTAALFAAFIQEATA 269 (495) T ss_pred --------------cccceeeechhheEeccccCCCcccCcchhHHHHHH-HHhhHHHHHHHHHHHHhhhheeeeecCCC Confidence 000112233 33332 334588988877664 33333322222222222222233332111 Q ss_pred Cc-------------hhhhhHhhhcCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccc-cccCcccH Q lcl|NC_013644. 287 DD-------------LSKLRQNVKSKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQ-VGDGNITN 352 (510) Q Consensus 287 ~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~~g~~Sg 352 (510) .. .......+..+.+..+..|.++++.+.+.+...+..++..+...|....++|-... ...+++|- T Consensus 270 ~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nY 349 (495) T protein:vir:10 270 DSTGGPTIGQPKRSKGGKRITGLNPGTLQYLQPGQEVKFSNPADVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNY 349 (495) T ss_pred ccccccccCccccccCcccceecCCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccH Confidence 10 01112235566777788899999999888888899999999999988777764322 23344555 Q ss_pred HHHHHHHHHHHHHHHHHHH-HHHH-HHHHHHHHHHHHHhhccCCc----cccc--eeeEEeC--CCCCCCHHHHHHHHHH Q lcl|NC_013644. 353 IVIKARYTLLNMKANKTEA-RLRA-LLEWMNKLVIDDINRRYTKA----FDPT--EVSFTFT--REVMVNETDIVNDEKT 422 (510) Q Consensus 353 ~Ai~~~~~~l~~k~~~k~~-~~~~-~l~~~~~~i~~~~~~~~~~~----~~~~--~v~i~f~--~~~p~d~~e~~~~~~~ 422 (510) .+++..+......+...+. .+.. .++.+++..+...-..|.-. ++.. -+.+.|. .-.-.|....+++... T Consensus 350 SS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~ 429 (495) T protein:vir:10 350 SSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLG 429 (495) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhccccccCCccccChHHHHHHHHH Confidence 5666666666656655443 3433 33445554444332222211 1111 1345563 3345799999999999 Q ss_pred HHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccc Q lcl|NC_013644. 423 EAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQ 493 (510) Q Consensus 423 ~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (510) ++.+|+.|.+.++...+ .|.++..+++.++.+...+ .-+-.+..+..........++..++.++ ++ T Consensus 430 ~i~~G~~s~~~~~a~~G-~D~~~v~~q~a~e~~~~~~---~Gl~~~~~p~~~~~~~~~~~~~~~~~~~-~e 495 (495) T protein:vir:10 430 DVRAGFAPISDKQAERG-YDMEELFDMISDANQLIDE---YDLRLDSDPRYVNGSGAEQKSVMEAALN-NE 495 (495) T ss_pred HHHcCCCCHHHHHHHcC-CCHHHHHHHHHHHHHHHHH---cCCCCCCCCCcCCCccCCCCCCCCCCCC-CC Confidence 99999999999999885 4444444333333221111 0000000000000000000000000000 00 No 117 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=99.20 E-value=6.4e-10 Score=70.99 Aligned_cols=463 Identities=10% Similarity=0.009 Sum_probs=207.9 Q ss_pred cCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHh Q lcl|NC_013644. 5 LSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQ 84 (510) Q Consensus 5 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~ 84 (510) ++.|...+.+.+++..+..+..+ ......++++|+---+- +..+.. ..........+.+.++..+-+..-++..++ T Consensus 1 m~~d~~~~~~~l~~r~~~l~~~R-~~~e~~w~e~~~~~lP~--~~~~~~-~~~~~~~~~~~~~~~~~dstg~~a~~~LAs 76 (549) T protein:vir:10 1 MTNDDAKILQALNADHGRMKEKR-QSYEAVWNDVIDYLMPR--LDKFGQ-LPRPDSEKGRERSQKMFDSTAPLALRNFVA 76 (549) T ss_pred CCcchHHHHHHHHHHHHHHHHHh-hhHHHHHHHHHHHhccc--cccccc-cCCCCCCcccccccccccchHHHHHHHHHH Confidence 44466666666766666655332 22344455554332211 100000 000000011122345666777788888887 Q ss_pred hhhc--CCce-----eccCcHH------HHHHHH-------HHh---ccCHHHHHHHHHHHHHhcCeEEEEEEECCCCce Q lcl|NC_013644. 85 YLLS--NPVE-----YETENEE------LKEYLA-------EYY---NSEFQVVLQELVEGSSQKGFEYVYARTNAEDRL 141 (510) Q Consensus 85 ~l~g--~p~~-----~~~~d~~------~~~~l~-------~~~---~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~ 141 (510) .|++ .|+. +...++. +...|. ..+ ..||.....++.++..++|.|.+++..|..+.+ T Consensus 77 ~l~~~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~~~~~~~ 156 (549) T protein:vir:10 77 AMDSMITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHDVGKGI 156 (549) T ss_pred HHHhhccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEeecCCCee Confidence 7764 2322 3333321 222222 222 357888889999999999999888877766778 Q ss_pred EEEEEcccceEEEEcCCCCceeEEEEEEEEEee--------C--------CceeEEEEEEEEcCCcEEEEEEcCCceeec Q lcl|NC_013644. 142 CFQVADSLNVFGVYNEYNELQRICRHYITEIEK--------D--------GETVDIHHAEVWTDQNVYFFVAEDNKDYEL 205 (510) Q Consensus 142 ~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~--------~--------~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~ 205 (510) ++..++-.+++..-|..+++..++|.+...... . ........+++|+- + |.......... T Consensus 157 ~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~v~~~--V--~pr~~~~~~~~ 232 (549) T protein:vir:10 157 VYRNVPMQRLWFAENNSGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYHA--V--EPRADRDPRKL 232 (549) T ss_pred EEEEEEcCeEEEeeCCCCCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEEEEEE--e--ecCCCCCcccc Confidence 899999999998888888888877654433110 0 00011223333321 0 00000000000 Q ss_pred ccccccccccccccccccccccccccCCcccEEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeE Q lcl|NC_013644. 206 DEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLS-----NNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYV 280 (510) Q Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv 280 (510) + ... -++.......+...-....+|..+|++.++ ++.+|+|-.++..+-+..+|.+.-......+...+|.+. T Consensus 233 ~-~~~-~pf~sv~~e~~~~~il~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~ 310 (549) T protein:vir:10 233 D-GRN-MQFASYWLDEGRDRIVQNSGFRTFPFAIGRFYVGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLVDPPLL 310 (549) T ss_pred c-ccc-CceEEEEEEecCCEeeccCCcccCCcceeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 0 000 000000000111111223445566766554 346899999999999999999999999999999999887 Q ss_pred EecCCCCchhhhhHhhhcCee--eecc--CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccc-ccccCcccHHHH Q lcl|NC_013644. 281 VSGFQGDDLSKLRQNVKSKKV--VGTG--SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDST-QVGDGNITNIVI 355 (510) Q Consensus 281 ~~g~~~~~~~~~~~~~~~~~~--~~~~--~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~-~~~~g~~Sg~Ai 355 (510) +.-....++.+. ..++. +..+ ++..+..+....+.......++.++..|-..-..-.+. .......|++.+ T Consensus 311 v~~~g~~~~~~l----~pgg~~~~~~~~~~~~~~~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~~TAtEV 386 (549) T protein:vir:10 311 ANEDGVLDGFDL----RSGALNWGGLNDKGEEMVKPLLTGKQAQIGIEFAQDTRQTINQWFYVTLFQILVDSGDMTATEV 386 (549) T ss_pred ecccccccccee----ccCCccccccCCCCccceeeeccccchhHHHHHHHHHHHHHHHHHhhhhhhhhcCCCCccHHHH Confidence 643222222222 22221 1122 22335555555566777777777777665432111111 112334566665 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHhhccCCcc-------ccceeeEEeCCCCCCCHH-HH--- Q lcl|NC_013644. 356 KARYTLLNMKANKTEARLRALLEW--------MNKLVIDDINRRYTKAF-------DPTEVSFTFTREVMVNET-DI--- 416 (510) Q Consensus 356 ~~~~~~l~~k~~~k~~~~~~~l~~--------~~~~i~~~~~~~~~~~~-------~~~~v~i~f~~~~p~d~~-e~--- 416 (510) ..+. .++...++..+.+ ++.-.+.++...+.-+- ....+.|.|..++-+... +. T Consensus 387 ~~r~-------~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r~g~lP~~p~~l~~~~~~~~i~yis~La~aq~~~~~~~ 459 (549) T protein:vir:10 387 LQRA-------QEKGVLLAPTLGRTQSELLGPMIAREVDILAEAGQLPDMPQELIDAGADVDVEYDSPLNKAMRAGEGAA 459 (549) T ss_pred HHHH-------HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhhcCCceeEEEeecHHHHHHHHHHHHH Confidence 5533 3344444443333 22223333433332111 123466777665544211 11 Q ss_pred ----HHHHHHHHhcCC-----CchHHHHHh----CC-----CCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCC Q lcl|NC_013644. 417 ----VNDEKTEAETRK-----IILESILQV----AP-----RLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNT 478 (510) Q Consensus 417 ----~~~~~~~~~~g~-----iS~et~~~~----~~-----~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~ 478 (510) ++.+..+.+.+. +....++.. ++ ..+++|.+++.+++++.+......++...... ...+.. T Consensus 460 i~~~~~~~~~laq~~Pe~ld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~~~~~qqq~~~~~~~a~~a~~-~a~~~~ 538 (549) T protein:vir:10 460 ILQWLQQLGIVSQFDPAAAKVPNGARIARLLADYGGVPVEAMSTDEELQAQQAAEAQAAQMQQMLAAAPVAAG-AIKDLS 538 (549) T ss_pred HHHHHHHHHHHhccChhHHhcCCHHHHHHHHHHhcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhhh Confidence 111222211121 222233332 11 11334444333333222222111111111110 000000 Q ss_pred CCcccCCCCCCcccccccCcccccc Q lcl|NC_013644. 479 DEEETAVNPDDPTQQMAEGATGSTE 503 (510) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (510) + .... +|.... T Consensus 539 ~-----~~ta---------~~~~~~ 549 (549) T protein:vir:10 539 D-----AQTA---------AQTARV 549 (549) T ss_pred h-----hcCC---------CcccCC Confidence 0 0000 111111 No 118 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=99.17 E-value=6.5e-10 Score=70.94 Aligned_cols=456 Identities=12% Similarity=0.074 Sum_probs=168.2 Q ss_pred CCCccC--CChhhhHHHHHHHHHhhh---hhhhHHHHHHHHHHhccCCcchhccccee--cccccccccccccc------ Q lcl|NC_013644. 1 MEALLS--EDVKIIANALKAAIDKDR---KSSSKREAETGIRYYNHENDIMNNRIFYV--DDEGILREDKYASN------ 67 (510) Q Consensus 1 ~~~~~~--~~~~~~~~~i~~~i~~~~---~~~~~~~~~~~~~YY~g~~~i~~~~~~~~--~~~~~~~~~~~~~~------ 67 (510) |..... .+. .+...+.++++... ..........+.++-.++......+-... ...+...+...++. T Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~~l 79 (547) T protein:vir:63 1 MGLFESIRLAG-VNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGV 79 (547) T ss_pred Cchhhhhhhhc-CCccccccccccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecccccccCCccCChhHHHHH Confidence 211100 000 00000000000000 00000111122333222221111100000 00111111111111 Q ss_pred c-eec-cchhHHHHHHHHhhhh--cCC-----------ceec-------cCcHHHHHHHHHHhc-------c---CHHHH Q lcl|NC_013644. 68 V-RIP-HGFFPEIVDQKTQYLL--SNP-----------VEYE-------TENEELKEYLAEYYN-------S---EFQVV 115 (510) Q Consensus 68 ~-ki~-~n~~~~Iv~~~~~~l~--g~p-----------~~~~-------~~d~~~~~~l~~~~~-------n---~~~~~ 115 (510) . ... .++...+|+..+.-+. +.+ +++. ..++.....|.+++. + .+... T Consensus 80 ~~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s~~~f 159 (547) T protein:vir:63 80 LKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSF 159 (547) T ss_pred HHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccchHHHH Confidence 0 111 2344555544443322 111 1111 112222234444432 1 23345 Q ss_pred HHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCc-eeEEEEEEEEEeeCCceeEEEEEEEEcCCcEE Q lcl|NC_013644. 116 LQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNEL-QRICRHYITEIEKDGETVDIHHAEVWTDQNVY 193 (510) Q Consensus 116 ~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~-~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~ 193 (510) +..+..+.+.+|.+|+.+.++.+|++ .+.+++|..+.++.+..+.. ....+++ ....+. ....+....+. T Consensus 160 ~~~lv~d~ll~Gn~~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~--~~~~~~------~~~~~~~~eii 231 (547) T protein:vir:63 160 VKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRFV--QVIDQK------IVATFNAREMA 231 (547) T ss_pred HHHHHHHHHhhCCEEEEEEECCCCcEEEEEEecCceeEEEECCccccccCceEEE--EEcCCc------EEEEeccccEE Confidence 56677888999999999888998876 47889999998887654321 1111111 111110 00112333333 Q ss_pred EEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 194 FFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQD 273 (510) Q Consensus 194 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~ 273 (510) +++.. |.........|.|.++.+...|.....+..-..+.+.. T Consensus 232 h~r~n-------------------------------------~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~N 274 (547) T protein:vir:63 232 FAVRN-------------------------------------PRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSH 274 (547) T ss_pred Eeccc-------------------------------------CCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHc Confidence 33211 00000011236666666666555555444444455555 Q ss_pred hcccee--EEecCC-CCc--hhhhhHhhh-------c-CeeeeccCCCceeEEeecC--CHHHHHHHHHHHHHHHHHHhC Q lcl|NC_013644. 274 FAEAIY--VVSGFQ-GDD--LSKLRQNVK-------S-KKVVGTGSDGGLDVKTVTI--PTEGRKTKMEIDKENIYKFGM 338 (510) Q Consensus 274 ~~~~~l--v~~g~~-~~~--~~~~~~~~~-------~-~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~~i~~~s~ 338 (510) .+.|-. .+.+.. .++ ...++..+. . +++..+. +++++|..... ....+.+..+...+.|...-+ T Consensus 275 g~~p~giL~~~~~~~ls~e~~~~lk~~~~~~~~G~~nagk~~vl~-~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afg 353 (547) T protein:vir:63 275 GGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVS-AEDVKFVNMTPSARDMEFEKWLNYLINVISALYG 353 (547) T ss_pred CCCcceEEEecCCCCCCHHHHHHHHHHHHHHhcCccccccccccc-CCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhC Confidence 555543 344432 221 112222211 1 1222232 33455554443 444556667778888888888 Q ss_pred CccccccccCc-----ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCH Q lcl|NC_013644. 339 AFDSTQVGDGN-----ITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNE 413 (510) Q Consensus 339 ~p~~~~~~~g~-----~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~ 413 (510) +|++-.+..+. .++..+-. +.+. ......+...|.-+++.|...+...--..+. ..+.+.|+.....+. T Consensus 354 VPP~~lG~~~~~~~~~~~~~s~t~--sn~e---~~~~~~~~~tL~P~~~~ie~~ln~~L~~~~~-~~~~~~f~~~~~~~~ 427 (547) T protein:vir:63 354 IDPAEINIPNNGGATGSKGGSLNE--GNSA---EKNQASKNKGLQPLLGFIEDFINKHIVAEFG-DKYTFQFVGGDIKSE 427 (547) T ss_pred CCHHHcCcccccccccccccccch--hhHH---HHHHHHHHHHHHHHHHHHHHHHHhhcccccC-CceEEEeeccccccH Confidence 88753321110 01111110 0000 1112334444555554454444332211222 246778888888888 Q ss_pred HHHHHHHHHHHhcCCCchHHHHHhCCC---CCc-HHHH------HHHHHHHHHHHHHHHHHHHh---hhccCCCCCCCCC Q lcl|NC_013644. 414 TDIVNDEKTEAETRKIILESILQVAPR---LDD-DNVL------RLICEQFDLDWEDVKEALEE---AEYTKGLSDNTDE 480 (510) Q Consensus 414 ~e~~~~~~~~~~~g~iS~et~~~~~~~---v~d-~e~~------~~~~e~~e~~~~~~~~~~~~---~~~~~~~~~~~~~ 480 (510) ++.+. +.++..+|+++.-.++++++. +.. ++.. ..-+.......+........ .+..+...+.++. T Consensus 428 ~~~~~-~~~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (547) T protein:vir:63 428 LESVK-ILAEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVE 506 (547) T ss_pred HHHHH-HHHHHhCCCcCHHHHHHHhCCCCCCCCCceeecccccccccccccccCCccccchhhccccccccCCCCCCCCC Confidence 87776 445777899999888887643 211 1100 00000000000000000000 0000000101111 Q ss_pred cccCCCCCC---cccccccCcccccccccCCCC Q lcl|NC_013644. 481 EETAVNPDD---PTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 481 ~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~ 510 (510) +++...+.. +.+.......+.+.-.-+.+| T Consensus 507 ~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 539 (547) T protein:vir:63 507 DIPDGKDTTGDIGKDGQRKDKDNANAGKQGMKG 539 (547) T ss_pred CCCCCcccCCCcCccccccCccccchhhhhcCC Confidence 111100000 001111111111111123333 No 119 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.14 E-value=1.4e-09 Score=69.19 Aligned_cols=455 Identities=8% Similarity=-0.032 Sum_probs=203.2 Q ss_pred CCCccCCChhhhHHH-HHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccc---------cccc-cce Q lcl|NC_013644. 1 MEALLSEDVKIIANA-LKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILRED---------KYAS-NVR 69 (510) Q Consensus 1 ~~~~~~~~~~~~~~~-i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~---------~~~~-~~k 69 (510) |--+-.----..+.. ..-... .....-|+|-.. .+....... ...... ..++ +.- T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~a-----------r~~~~~y~aa~~--~r~~~~~~~-~~s~~~~i~~~~~~lr~RaRdL~ 66 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAA-----------REAIQAYEAARP--GRTHKAKRQ-PLGADTSLQKSAVSMREQCRKLD 66 (548) T ss_pred CchHHhHhhhcchHHHHHHHHh-----------HHHhccccccCc--cccccccCC-CCChHHHHHHHHHHHHHHHHHHH Confidence 322210000011111 100000 001112333211 000000000 000000 0000 000 Q ss_pred eccchhHHHHHHHHhhhhc-CCceecc----Cc----HHHHHHHHHHh----c-------cCHHHHHHHHHHHHHhcCeE Q lcl|NC_013644. 70 IPHGFFPEIVDQKTQYLLS-NPVEYET----EN----EELKEYLAEYY----N-------SEFQVVLQELVEGSSQKGFE 129 (510) Q Consensus 70 i~~n~~~~Iv~~~~~~l~g-~p~~~~~----~d----~~~~~~l~~~~----~-------n~~~~~~~e~~~~~~~~G~~ 129 (510) .-++|++-+|+..++.++| ..+.+.+ .+ ++..+.|+..| . .+|......+++...+.|.+ T Consensus 67 rNn~~a~~av~~~~~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~ 146 (548) T protein:vir:95 67 EDHDLVTGLLDRLEERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEG 146 (548) T ss_pred hcChHHHHHHHHHHHhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCce Confidence 1246899999999999998 3444432 22 23333344333 2 23566666678889999999 Q ss_pred EEEEEECCCC--------ceEEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCc Q lcl|NC_013644. 130 YVYARTNAED--------RLCFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNK 201 (510) Q Consensus 130 ~~~v~~d~~g--------~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~ 201 (510) ++...++..+ .+++..++|+.+---++..+. ...-++ +.... ....-+|+.-..++..+.+. . T Consensus 147 f~~~~~~~~~~~~~g~~~~~~lqliepd~l~~~~~~~~~-~i~~GI---E~D~~-Grp~aY~i~~~hPgd~~~~~-~--- 217 (548) T protein:vir:95 147 LAQKLMGRVPNYTFATSVPFALELLEPDYLPFSYNNLSK-GIVQGI---ERDTW-RRKRAYHLLKDHPGNLQTLG-G--- 217 (548) T ss_pred EEEeeecccccccCCcccceEEEEechhhcCCCCCCCCC-ceeeee---EECCC-CceEEEEEeecCCCcccccc-c--- Confidence 8877665432 258999999887322222111 001111 11111 11111221111122111100 0 Q ss_pred eeecccccccccccccccccccccccccccCCccc---EEEecC-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 202 DYELDEAEPINPRPHVLAVDSENESLLQRSYGQIP---FYRLSN-----NKQETTDLKPIKALIDDYDLMNCFLSNNLQD 273 (510) Q Consensus 202 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~n-----n~~g~sd~~~v~~liD~~n~~~S~~~~~~~~ 273 (510) ...+-+|| |+|+.. -..|.|+|..++..+..++.....-...... T Consensus 218 ---------------------------~~~~~rvpA~~VlHif~~~r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki 270 (548) T protein:vir:95 218 ---------------------------SLAVKRVEAERIIHIAYRKRIGQNRGVPMLHAVLIRLADLKDYEESERVAARI 270 (548) T ss_pred ---------------------------ccceeeechhHheecccccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHH Confidence 00112233 444432 3468999999988877777655544444443 Q ss_pred hccceeEEecCCCC--------chhhhhHhhhcCeeee-ccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccc Q lcl|NC_013644. 274 FAEAIYVVSGFQGD--------DLSKLRQNVKSKKVVG-TGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQ 344 (510) Q Consensus 274 ~~~~~lv~~g~~~~--------~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~ 344 (510) .+.--.+++..... ........+..+.++. +..|.++++++.+.+...+..+...+...|....++|-... T Consensus 271 ~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~pG~iv~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~l 350 (548) T protein:vir:95 271 SAALAMYIKKGNPDSYTVEPGKDRKNRTIPIAPGMVFDDLEPGEDVGMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSV 350 (548) T ss_pred hhhheeeeecCCCccccCCCCcccccccccccCCccccccCCCceeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHH Confidence 33333334331111 1111112244455554 67888899999888888999999999999998887774322 Q ss_pred c-ccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhccCCcc----c-cceeeEEeCC--CCCCCHHH Q lcl|NC_013644. 345 V-GDGNITNIVIKARYTLLNMKANKTEARLRALLEW-MNKLVIDDINRRYTKAF----D-PTEVSFTFTR--EVMVNETD 415 (510) Q Consensus 345 ~-~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~-~~~~i~~~~~~~~~~~~----~-~~~v~i~f~~--~~p~d~~e 415 (510) . .++ .|-.+.+..+...-..+...+..|...+-+ +++..+...-..+..+. + ...+.+.|.. -.-.|... T Consensus 351 tgD~s-~nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~K 429 (548) T protein:vir:95 351 SRAYD-GTYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWINPMH 429 (548) T ss_pred hcccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccChHH Confidence 1 122 355566666666666665555555443333 44444433222222111 1 1235677743 33479999 Q ss_pred HHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCC-CCCCCc-c-------cCCC Q lcl|NC_013644. 416 IVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLS-DNTDEE-E-------TAVN 486 (510) Q Consensus 416 ~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~-~~~~~~-~-------~~~~ 486 (510) .+++...++.+|+.|.+.++...+ .|.++..+++.++.+...+. ......++...... ..++.+ . ..+- T Consensus 430 ea~A~~~~i~~Gl~T~~~~~a~~G-~D~~ev~~q~a~E~~~~~~~-GL~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (548) T protein:vir:95 430 EANAWELLVKAGFADEAEVARARG-RDPRELKKSRETEIKANRAA-GLVFSSDAYHQLVKSGMDPVEAVQKVYLGVGKML 507 (548) T ss_pred HHHHHHHHHHcCCCCHHHHHHHhC-CCHHHHHHHHHHHHHHHHHc-CCCCCCcccccccccccCCCCchhhhcccccccc Confidence 999999999999999999999876 34444443333333222111 00000000000000 000000 0 0001 Q ss_pred CCCcccccccCcccccccccCC--------CC Q lcl|NC_013644. 487 PDDPTQQMAEGATGSTESQLPE--------NG 510 (510) Q Consensus 487 ~~~~~~~~~~~~~~~~~~~~~~--------~~ 510 (510) ++++.+++.. .=.++-.+|+ +| T Consensus 508 ~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~ 537 (548) T protein:vir:95 508 TADEARELVN--RYGAGLPVPGPDFPNESNNG 537 (548) T ss_pred ccchhHHhhc--cCCCCCcCCCCCCCcccccC Confidence 2222222221 0001111111 11 No 120 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=99.10 E-value=2.2e-09 Score=68.05 Aligned_cols=493 Identities=9% Similarity=0.016 Sum_probs=204.1 Q ss_pred CCCccCCCh---------hhhHHHHHHHHHhhhhhhhH--HHHHHHHHHhccCCcchhcccceeccccccccc-cccccc Q lcl|NC_013644. 1 MEALLSEDV---------KIIANALKAAIDKDRKSSSK--REAETGIRYYNHENDIMNNRIFYVDDEGILRED-KYASNV 68 (510) Q Consensus 1 ~~~~~~~~~---------~~~~~~i~~~i~~~~~~~~~--~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~-~~~~~~ 68 (510) |-..+-+|. +.+...|.+.++..+..+.. .+++++.+||...-.... +.........-. ...-.+ T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~r~ 81 (641) T protein:vir:94 5 MPTPIIEDKESAKRKLSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQ---NTRARNFQTTGADDADWRH 81 (641) T ss_pred CCcccccCCcchhhcCCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhh---hcccccccccccchhcccc Confidence 333333322 33556666666666543211 135566667654322111 111000000000 000123 Q ss_pred eeccchhHHHHHHHHhhhhc----CC--cee---ccCcHHHHHHHHHHh-----ccCHHHHHHHHHHHHHhcCeEEEEEE Q lcl|NC_013644. 69 RIPHGFFPEIVDQKTQYLLS----NP--VEY---ETENEELKEYLAEYY-----NSEFQVVLQELVEGSSQKGFEYVYAR 134 (510) Q Consensus 69 ki~~n~~~~Iv~~~~~~l~g----~p--~~~---~~~d~~~~~~l~~~~-----~n~~~~~~~e~~~~~~~~G~~~~~v~ 134 (510) |+..+.+...++..++.|++ .+ +++ ..++.+..+.++.+| .+++.+...+..++++.+|.+++.++ T Consensus 82 ki~~~~~~~~~~~l~s~Lm~~~~p~~~wf~~~p~~~ed~~~A~~~~~~~~~~l~~~~~~~~~~~~~~d~~~~g~~iv~~~ 161 (641) T protein:vir:94 82 RINTGHTFEVVETLVAYFKGATFPSDDWFDLKGMVPELADAARVVKQLTKTKLEAASIRDIFETYVRNLVLYGVSTYRLG 161 (641) T ss_pred cccchhHHHHHHHHhhHHhhhhcCCCceEEEecCCCChHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHhhcCceEEEee Confidence 67888888888777776654 22 223 234555555555554 25677777888999999999988877 Q ss_pred ECC----------------------------CCceEEEEEcccceEEEEcCCCCcee--EEEEEEEEEe------e---- Q lcl|NC_013644. 135 TNA----------------------------EDRLCFQVADSLNVFGVYNEYNELQR--ICRHYITEIE------K---- 174 (510) Q Consensus 135 ~d~----------------------------~g~~~i~~~~p~~~~~~~d~~~~~~~--~~~~~~~~~~------~---- 174 (510) ++. ...+++..++|.++++ |.+.+... +++++....+ + T Consensus 162 w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~~--dps~~~~~~~f~~~r~t~~t~~~l~~eg~~~ 239 (641) T protein:vir:94 162 WDTSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVWL--DTSGGKNTGTFVRLRHTREELHELVTSGYYD 239 (641) T ss_pred hhhHHHHhhhhhcccchhhcccccccceecccceeeEEecchhheee--cCCCCcccccceehhhhHHHHHHHHhcCCCC Confidence 541 1234666777776653 43332211 1111110000 0 Q ss_pred ----------CCce---eEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccc-cccccc-cccCCcccEEE Q lcl|NC_013644. 175 ----------DGET---VDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDS-ENESLL-QRSYGQIPFYR 239 (510) Q Consensus 175 ----------~~~~---~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~g~iPvv~ 239 (510) +... .....+..-....+..++..+ .... +...........++ ...... -..|...|++. T Consensus 240 ~d~v~~~~~~~~~~~~~d~~~d~~~~~~~~~~~~e~~g--d~~~---d~~~~~~~~~~~~g~~il~~~~~~~~d~~Pf~~ 314 (641) T protein:vir:94 240 LDLTQVEQYVDYKFADPDTPKDVNGTDTSGWDIIEYYG--PLLV---EGVQFWCVHAVFYGKQLIRLSDSKYWCGSPFVT 314 (641) T ss_pred hhhcchhhcccccccccccccccccccccccceeeeee--eecc---CCCceeeEEEEEeCCEEeecccccccCcCCeEE Confidence 0000 000000000000000110000 0000 00000001111111 011111 12345668776 Q ss_pred ecC-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeEEe Q lcl|NC_013644. 240 LSN-----NKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKT 314 (510) Q Consensus 240 ~~n-----n~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (510) ++- .-+|.|....+.+.+..+|.+.-...+.+..+.+|.+++..-....+.++ ....++++..+..++++++. T Consensus 315 ~r~~~~~~~~YG~gp~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~~~l--~~~PG~ii~~~~~~~v~pl~ 392 (641) T protein:vir:94 315 TTLLPDRDSVYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKREDV--KAKPGAVFKVAQHGSLQPID 392 (641) T ss_pred ecceecCCcccCCChHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeecccccccccee--eccCCcceeeCCCCcceeec Confidence 543 45799999999999999999999999999999999887644322222222 13355566666667788775 Q ss_pred ec-CCHHHHHHHHHHHHHHHHHHhCCcccc----ccccCcccHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_013644. 315 VT-IPTEGRKTKMEIDKENIYKFGMAFDST----QVGDGNITNIVIKARYTLLNMKANKTEARLR-ALLEWMNKLVIDDI 388 (510) Q Consensus 315 ~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~----~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~-~~l~~~~~~i~~~~ 388 (510) .. .+.......++.+...|-....+..+. ...+.+.++..+..+......+.....+.|. +++..+++.++.++ T Consensus 393 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~ 472 (641) T protein:vir:94 393 MGRQDFVVTYQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLL 472 (641) T ss_pred CCccccchhHHHHHHHHHHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 43 233333445555554444332222221 1111233555566666666666666666666 46666666655544 Q ss_pred hhc-----------------cCCccccceeeEEeCCCCCCCHHH---HHHH---HHHHHh-cCCCc-----------hHH Q lcl|NC_013644. 389 NRR-----------------YTKAFDPTEVSFTFTREVMVNETD---IVND---EKTEAE-TRKII-----------LES 433 (510) Q Consensus 389 ~~~-----------------~~~~~~~~~v~i~f~~~~p~d~~e---~~~~---~~~~~~-~g~iS-----------~et 433 (510) ... +-.+....++...|.- +|...++ .++. +..+.+ .+..+ .+. T Consensus 473 ~~~~~~p~i~R~~~~~~~~~~~~~~~p~~L~~~~~i-v~l~~~q~~~~~~~i~~l~~~~~~~a~~P~v~d~~d~~~~~~~ 551 (641) T protein:vir:94 473 QQFYVTPETIRMYVPEEQMDGFFEVSPEYLHYPYKF-LALGANYVVERERMVTDLLQLLDISGRVPQIGQSLDYALILED 551 (641) T ss_pred HHhccchhhhhhhchhhhcccCCCCCccceeeeeeE-eecchhHHHHHHHHHHHHHHHHHHhhcChhhhhcCCHHHHHHH Confidence 321 1112223334333322 2333222 2222 222111 11122 122 Q ss_pred HHHhCCCCC--------cHHHHHHHHHHHHHHHHHHHHHHH------hhhccCCCCC--CCCCcccCCCCCCcccccccC Q lcl|NC_013644. 434 ILQVAPRLD--------DDNVLRLICEQFDLDWEDVKEALE------EAEYTKGLSD--NTDEEETAVNPDDPTQQMAEG 497 (510) Q Consensus 434 ~~~~~~~v~--------d~e~~~~~~e~~e~~~~~~~~~~~------~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~ 497 (510) +++..+.-. +.+......++++.+......++. .+........ ..-.+.-+.++++-..++-.. T Consensus 552 ~~~~~g~~~p~~~ir~~~~~~~~~~~~~~~~q~~~~~~a~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 631 (641) T protein:vir:94 552 LLRQMRFTDPMRYIKKAEAPPAAPPIAPAEPGALPPEMMNSVGGGLNDQAIAGMTPEDVSDLASRIGIDTSDVAPEAMAA 631 (641) T ss_pred HHHHhCCCCchhhccCccCchhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHhhHHHHHHHHHhhcCCchhhhHHHHhc Confidence 233222111 111111111111111111110000 0000000000 000011112222222222221 Q ss_pred -ccccccccc Q lcl|NC_013644. 498 -ATGSTESQL 506 (510) Q Consensus 498 -~~~~~~~~~ 506 (510) -++-+++++ T Consensus 632 ~~~~~~~~~~ 641 (641) T protein:vir:94 632 ATQQITSGAL 641 (641) T ss_pred ccccccccCC Confidence 111122223 No 121 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=99.08 E-value=2.6e-09 Score=67.65 Aligned_cols=477 Identities=10% Similarity=0.051 Sum_probs=206.0 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |.. .+.+.+++..+..+..+ ......++++|+---+ .+.++... . .....+.+.++..+-+..-++ T Consensus 1 m~~-------~~~~~l~~r~~~l~~~R-~~~e~~w~e~~~~~lP--~~~~~~~~-~---~~~~~~~~~~~~dst~~~a~~ 66 (559) T protein:vir:95 1 MAE-------TTKERLNKQFAQLESER-QSFEPHWRELSDYINP--RGSRFLTS-E---VNRNDRRNTRIIDSTGTMAAR 66 (559) T ss_pred CCh-------hhHHHHHHHHHHHHHHh-hHHHHHHHHHHHHhcc--ccCCcCCC-C---CCcccccccccccchHHHHHH Confidence 221 12334444444443222 2223333444332111 11111000 0 001112234566777777788 Q ss_pred HHHhhhhcC--Cce-----eccCc------HHHHHHHH-------HHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCCC Q lcl|NC_013644. 81 QKTQYLLSN--PVE-----YETEN------EELKEYLA-------EYY-NSEFQVVLQELVEGSSQKGFEYVYARTNAED 139 (510) Q Consensus 81 ~~~~~l~g~--p~~-----~~~~d------~~~~~~l~-------~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g 139 (510) +.++.|++- |+. +...+ ..+.+.|. ..+ ..||.....++.++..++|.|.+++..|..+ T Consensus 67 ~Las~l~~~ltpp~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~d~~~ 146 (559) T protein:vir:95 67 TLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLDDDED 146 (559) T ss_pred HHHHHHHHhhcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeecCCCc Confidence 887777642 322 33222 22333332 233 4678888999999999999998877666666 Q ss_pred ceEEEEEcccceEEEEcCCCCceeEEEEEEEEEee--------C---------CceeEEEEEEEEcCCcEEEEEEcCCce Q lcl|NC_013644. 140 RLCFQVADSLNVFGVYNEYNELQRICRHYITEIEK--------D---------GETVDIHHAEVWTDQNVYFFVAEDNKD 202 (510) Q Consensus 140 ~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~--------~---------~~~~~~~~~e~y~~~~i~~~~~~~~~~ 202 (510) .+++..++..+++..-|..+++..++|.+...... . .....-..+++++- .|....... T Consensus 147 ~~r~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~----V~pr~~~~~ 222 (559) T protein:vir:95 147 IIRTMPFPIGSYYLANSPRGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHS----VYPNIDRDT 222 (559) T ss_pred eeEEEEeecCeEEEeeCCCCCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEEEE----Eeccccccc Confidence 78999999999999888888888887755443210 0 00000112222110 000000000 Q ss_pred eeccccccccccc-cccccc-ccccccccccCCcccEEEec-----CCCCCCCc-HHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_013644. 203 YELDEAEPINPRP-HVLAVD-SENESLLQRSYGQIPFYRLS-----NNKQETTD-LKPIKALIDDYDLMNCFLSNNLQDF 274 (510) Q Consensus 203 ~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~g~iPvv~~~-----nn~~g~sd-~~~v~~liD~~n~~~S~~~~~~~~~ 274 (510) ...+.. . -+.. ...... ....-....+|..+|++.++ +..+|+|. .....+-+..+|.+.-......+.. T Consensus 223 ~~~~~~-~-~pf~s~~~e~~~~~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~ 300 (559) T protein:vir:95 223 SKLDSK-N-KPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKA 300 (559) T ss_pred cccccc-c-ceEEEEEEEecCCCceeeecCCcccCCccceeeeecCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHH Confidence 000000 0 0000 000000 00011123344556666544 34678985 8888999999999999999999999 Q ss_pred ccceeEEecCCCCchhhhhHhhhcCeeeeccCC---CceeEE-eecCCHHHHHHHHHHHHHHHHHHhCCcc----ccccc Q lcl|NC_013644. 275 AEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSD---GGLDVK-TVTIPTEGRKTKMEIDKENIYKFGMAFD----STQVG 346 (510) Q Consensus 275 ~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~-~~~~~~~~~~~~~~~l~~~i~~~s~~p~----~~~~~ 346 (510) .+|.+.+..-..... .++..+++...+.. ..++.+ +.+.+...+...++.++..|-..-. .+ ...-. T Consensus 301 ~~pp~~v~~~~~~~~----~~l~pgg~~~~~~~~~~~~i~p~~~~~~~~~~~~~~i~~~~~rI~~af~-~d~~~~l~~r~ 375 (559) T protein:vir:95 301 TNPPMVAPTSLKNQR----ASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYF-VDLFMMLQNIN 375 (559) T ss_pred hcCceeccccccccc----eeeeccceeeeCCCCCcccceeecccccchHHHHHHHHHHHHHHHHHhh-hhhHHHhhcCC Confidence 999877643221111 12233333323222 223433 2234455555666666666644321 12 12223 Q ss_pred cCcccHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccCC-----ccccceeeEEeCCCCCCCHH------ Q lcl|NC_013644. 347 DGNITNIVIKARYTLLNMKAN-KTEARLRALLEWMNKLVIDDINRRYTK-----AFDPTEVSFTFTREVMVNET------ 414 (510) Q Consensus 347 ~g~~Sg~Ai~~~~~~l~~k~~-~k~~~~~~~l~~~~~~i~~~~~~~~~~-----~~~~~~v~i~f~~~~p~d~~------ 414 (510) ....|++.+..+...+.+... -..+.-.+.|.-++.-++.++...+.- .....+++|.|..++-+-.. T Consensus 376 ~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~i~v~~is~La~aqk~~~~~~ 455 (559) T protein:vir:95 376 TRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKNMLPPPPDVMEGMPLKVEYISVMAQAQKSIGLSS 455 (559) T ss_pred CCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccCcceEEEeecHHHHHHHHHHHHH Confidence 445576666654333333221 122222233333333344444443321 22234577788766644111 Q ss_pred --HHHHHHHHHHhcCC-----CchHHHHHhC---CCC------CcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCC Q lcl|NC_013644. 415 --DIVNDEKTEAETRK-----IILESILQVA---PRL------DDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNT 478 (510) Q Consensus 415 --e~~~~~~~~~~~g~-----iS~et~~~~~---~~v------~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~ 478 (510) ..++.+..+.+.+. +....++..+ -++ +++|.+++.+++.+.+. .++..+............+ T Consensus 456 i~~~~~~~~~laq~~Pevld~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~rqqr~~~qq-~~q~~~~~~~aa~~~~~~~ 534 (559) T protein:vir:95 456 LASTVNFIGQLAQVKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVEQARQQRAQQQQ-QQQMMAMGMAAAQGVKTLS 534 (559) T ss_pred HHHHHHHHHHHhccChhhhhcCCHHHHHHHHHHHhCCchhhcCCHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhhccc Confidence 11112222222211 2233333322 111 23333333333322222 1111111111111111222 Q ss_pred CCcccCCCCCCcccccccCccccccccc Q lcl|NC_013644. 479 DEEETAVNPDDPTQQMAEGATGSTESQL 506 (510) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (510) +...+++ +.-+......+|..++|- T Consensus 535 ~~~~~~~---~~l~~~~~~~~~~~~~~~ 559 (559) T protein:vir:95 535 EAKTSDP---SVLSAMANAVSGQGGQSQ 559 (559) T ss_pred cccCCCh---hHHHHHHHhhcCccccCC Confidence 2222211 111222222233333333 No 122 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=98.98 E-value=7.6e-09 Score=65.08 Aligned_cols=464 Identities=11% Similarity=0.036 Sum_probs=213.9 Q ss_pred CCCccCCC------hhhhHHHH---HHHHHhhhhh--hhHHHHHHHHHHhccCCcchhcccceeccccccccccccccce Q lcl|NC_013644. 1 MEALLSED------VKIIANAL---KAAIDKDRKS--SSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVR 69 (510) Q Consensus 1 ~~~~~~~~------~~~~~~~i---~~~i~~~~~~--~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~k 69 (510) |..-.+.- .+....++ ..+-+.+.+. +..+..+++++|..-. + .| +. ...+ -.++ ++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~e~~~yi~~~-~--tr--~t-~~~~----~~w~--~s 68 (599) T protein:vir:31 1 MSTDIKTLQKMLEGRDDDRAFIDELVVLFTNMENARAQKDREDKELMDYIDAT-D--TR--KT-SNSK----LPFK--NS 68 (599) T ss_pred CccchHHHHHHhhccCchHHHHHHHHHHHHhhhhhhhhhhcccHHHHHHHhhh-c--cc--cc-ccCC----CCcc--cc Confidence 33222111 12223332 3333333221 2223455666662211 1 01 00 0000 1111 46 Q ss_pred eccchhHHHHHHHHhhhhcCC------ce---eccC--cHHHHHHHHHHhc-----cCHHHHHHHHHHHHHhcCeEEEEE Q lcl|NC_013644. 70 IPHGFFPEIVDQKTQYLLSNP------VE---YETE--NEELKEYLAEYYN-----SEFQVVLQELVEGSSQKGFEYVYA 133 (510) Q Consensus 70 i~~n~~~~Iv~~~~~~l~g~p------~~---~~~~--d~~~~~~l~~~~~-----n~~~~~~~e~~~~~~~~G~~~~~v 133 (510) +..|..-.|++....++++-- +. +..+ .....+.++.+.+ -+|......++.+...+|.|+..+ T Consensus 69 ~t~~k~~~~~~~l~a~~~~~~fp~~~w~d~~~~~~~~~~~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G~~vat~ 148 (599) T protein:vir:31 69 TTINKLAHLHLMITTSYMEHLLPNRNWVDFVGFDNDSVNAEKREIARSYVRGKVEASNLEGVIERMVDDFAVRGFCVAHT 148 (599) T ss_pred cchHHHHHHHHHHHHHHHhhhcCCccceEeeecCCchhHHHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccCceeEee Confidence 777777778888887776531 11 2223 2344556666654 356777788889999999887766 Q ss_pred EEC------CCC-------ceEEEEEcccceEEEEcCC-CCc-eeE--EEEEEEEEee-----C------------Ccee Q lcl|NC_013644. 134 RTN------AED-------RLCFQVADSLNVFGVYNEY-NEL-QRI--CRHYITEIEK-----D------------GETV 179 (510) Q Consensus 134 ~~d------~~g-------~~~i~~~~p~~~~~~~d~~-~~~-~~~--~~~~~~~~~~-----~------------~~~~ 179 (510) .+- ++| .|++..++|..+|| |.+ +.+ ..+ +|.++....- + ...+ T Consensus 149 ~~er~~~~~~d~~v~~~~~~P~~ervsP~Di~~--Dp~A~si~d~~fivRs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~~ 226 (599) T protein:vir:31 149 RHVKRMTVTAENQVIKNYSGTVTERLSPSDVFW--DVTADSLPKAAKCIRQLYTLGSLKREIEEGTFPLMSMEDFQKLRE 226 (599) T ss_pred eEEEcceeecccccccccccceEEeecccceee--CCCCCCCCcceeeeehhhhHHHHHHHhccCCccccchHHHHHHHh Confidence 521 222 37899999998876 432 111 112 2222200000 0 0000 Q ss_pred EEEE--------------------------EEEEcCCcEEEEEEcCCceeeccccccccccccccccccc--cccccccc Q lcl|NC_013644. 180 DIHH--------------------------AEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSE--NESLLQRS 231 (510) Q Consensus 180 ~~~~--------------------------~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 231 (510) ...+ .+.|.+.-+..+...+ -.........+.+.-........ -....|.+ T Consensus 227 ~~~~~~~~~~d~~~~~~g~D~~~~d~~~~~~eY~~~~~VevLeywG-d~ydee~d~~~~~~ViTi~g~~~liR~e~np~~ 305 (599) T protein:vir:31 227 ERRTIREALADGYNGRRKFDSLHKKGYGSMMNYINEGVVEVLTFMG-DFYDEENDELWNNYEITVIDRKIIGRKQSKDTW 305 (599) T ss_pred hccCCCccccchhhhhhhccccccccccchhhhcccchhhhhhhhh-hhhcccCCccccceEEEEecCcEEeecccCCCC Confidence 0000 0111111111111100 00000000000011111111111 12334566 Q ss_pred CCcccEEEecC-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccC Q lcl|NC_013644. 232 YGQIPFYRLSN-----NKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGS 306 (510) Q Consensus 232 ~g~iPvv~~~n-----n~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~ 306 (510) .|..|++.... ..+|.|.+..+.++++.+|.+.-.+.+.++-+..|+++..|. .. +.+.. -..+.++.+.+ T Consensus 306 ~g~~Pyvv~~~~P~~~~~yG~G~l~~~~gaQ~~lN~~~Ng~iD~~~~~l~p~l~~~~d-l~-~eD~~--~~P~~v~~~~d 381 (599) T protein:vir:31 306 DGSQNLHIAVYEFQKDTLCPIGPLHRLTGMQYKLDKRENFREDLHDRFLHPSLKKVGD-VR-EKGMR--GGPNHVFEVEE 381 (599) T ss_pred CCCCCeEEEEeeeeccccCCCCCchhcchHHHHHHHHHHHhhhhhhhhhccccccccc-cc-ccCcc--CCCCcceeecC Confidence 77888876543 457899999999999999999888899999999998887764 11 11211 23567888999 Q ss_pred CCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccc--ccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_013644. 307 DGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQV--GDGNITNIVIKARYTLLNMKANKTEARLRALLEW-MNKL 383 (510) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~--~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~-~~~~ 383 (510) .+++.++..+++.......+..+...+-+.|+.|...-+ ..|..++..+..+....-.....+.+.|...+-+ +++. T Consensus 382 ~~~vq~~~p~s~~~~a~~~is~~e~~mee~sGvp~~~~G~~~ag~~TA~~is~l~naa~~~~~~~vr~~e~~~lepll~~ 461 (599) T protein:vir:31 382 TGDVQYMTPPAEVLQPDNQLSITLQLMEDLSGAPKESIGQRTAGEKTKFEVQLLDQGQNKVFRRKVKKFERELLTPVLND 461 (599) T ss_pred CCccccccCchhhhhHHHHHHHHHHHHHHhhccchhhcCCcccchhhHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHH Confidence 999999998888878888888888888899999987654 3345666677777777777777778888777655 5554 Q ss_pred HHHHHh-h---ccCC-ccc-------ccee-------eEEeCCCCCCCHHHHHHHHHH---HHhc----CC---CchHHH Q lcl|NC_013644. 384 VIDDIN-R---RYTK-AFD-------PTEV-------SFTFTREVMVNETDIVNDEKT---EAET----RK---IILESI 434 (510) Q Consensus 384 i~~~~~-~---~~~~-~~~-------~~~v-------~i~f~~~~p~d~~e~~~~~~~---~~~~----g~---iS~et~ 434 (510) ++++.. . .+.. -.+ ..+| ...+.+--..-..+..+.++. ..++ ++ +|++.. T Consensus 462 l~e~~~~f~D~~~tiri~~~e~~~~~f~~i~redl~~~~~~v~~Ga~~v~ere~~~q~l~~il~~~~~q~~~P~~~~k~l 541 (599) T protein:vir:31 462 YLEQGRNHLDASDTIKTFNSELGTATFLDITADDLNLNGQMVAQGATLFAEKANTLQNLNAILGGPLGAALAPHMSRTKL 541 (599) T ss_pred HHHHHHhhcccccceeeecccccceeeEEeehhhhhCCeeeeechhhHHHHHHHHHHHHHHHhcccCCCccchhhHHHHH Confidence 443221 1 0000 000 0001 111111111112333333322 2211 11 233222 Q ss_pred ---HHh---CC--CC-CcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCccc Q lcl|NC_013644. 435 ---LQV---AP--RL-DDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQ 492 (510) Q Consensus 435 ---~~~---~~--~v-~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (510) ++. +. .+ .+.-..+..+-+..+.+...+...+.+-.+...+.+..+.. + T Consensus 542 ~~~l~~~~~l~~~~~~~~~va~~eqq~~~~m~Q~~lq~~~~~~~~~~~~~~~~~~~~---------~ 599 (599) T protein:vir:31 542 FNAVEYLGDLDAYGIFTFGIGVQEDQQLARMAQKSTQQTEETALTQEEVGGPTTDTG---------Q 599 (599) T ss_pred HHHHHHHHhccccccCCCchhHHHHHHHHHHHHHHHHHhHhhhhhhhhcCCCCcccC---------C Confidence 111 11 11 11111111111111111111111111111111010000000 0 No 123 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=98.85 E-value=2.8e-08 Score=61.96 Aligned_cols=440 Identities=10% Similarity=0.076 Sum_probs=200.2 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhh--HHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSS--KREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~--~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) |.- -.....+.+++..+..+..+. -.++..+.+|..-.- ....... ......++..+-+... T Consensus 1 ~~~----~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~-------~~~~~~~-----~~~~~~~~~dst~~~a 64 (522) T protein:vir:94 1 MAE----REGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSL-------FPKESDN-----SSTEYTTPWQAVGARC 64 (522) T ss_pred Ccc----cchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc-------cCCCCCc-----ccccccccccccHHHH Confidence 333 333344555555555543221 123444444432210 0011110 1111224556677777 Q ss_pred HHHHHhhhhcC--C----ceeccCc-------------HHHHHHH-------HHHh-ccCHHHHHHHHHHHHHhcCeEEE Q lcl|NC_013644. 79 VDQKTQYLLSN--P----VEYETEN-------------EELKEYL-------AEYY-NSEFQVVLQELVEGSSQKGFEYV 131 (510) Q Consensus 79 v~~~~~~l~g~--p----~~~~~~d-------------~~~~~~l-------~~~~-~n~~~~~~~e~~~~~~~~G~~~~ 131 (510) +++.++.|++- | +++...+ ..+.+.| ...+ .+||.....++.++..++|.|.+ T Consensus 65 ~~~Las~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l 144 (522) T protein:vir:94 65 LNNLAAKLMLALFPQSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLL 144 (522) T ss_pred HHHHHHHHHhhcCCCCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeE Confidence 77777776541 2 1122111 1122222 2222 47788899999999999999988 Q ss_pred EEEECCCCce-EEEEEcccceEEEEcCCCCceeEEEEEEEEEee----------CCceeEEEEEEEEcC-----CcEEEE Q lcl|NC_013644. 132 YARTNAEDRL-CFQVADSLNVFGVYNEYNELQRICRHYITEIEK----------DGETVDIHHAEVWTD-----QNVYFF 195 (510) Q Consensus 132 ~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~e~y~~-----~~i~~~ 195 (510) ++..+..+.+ ++..++-.+++..-|..+++..+++-+++.... .........+++|+. +++.+| T Consensus 145 ~~~~~~~~~~~~~~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~~~~~~ 224 (522) T protein:vir:94 145 YIPEPEQGTYSPMRMYRLVSYVVQRDAFGNILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVYTHIYRQDDEYLRY 224 (522) T ss_pred eeeccCCCceeeEEEEEcceEEEeeCCCcCeEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEEEEEEeeCCceeEE Confidence 7766655543 567777777777777778887777765543210 011111223333321 111111 Q ss_pred EEcCCceeecccccccccccccccccccccccccccCCcccEEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 196 VAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLS-----NNKQETTDLKPIKALIDDYDLMNCFLSNN 270 (510) Q Consensus 196 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----nn~~g~sd~~~v~~liD~~n~~~S~~~~~ 270 (510) ....+... .......+|..+|++.++ ++.+|+|-.+...+-+..+|.+.-..... T Consensus 225 ~~~~g~~~--------------------~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~ 284 (522) T protein:vir:94 225 EEVEGIEV--------------------TGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITKM 284 (522) T ss_pred eeccCcee--------------------cccCCCCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHH Confidence 11000000 001112356677877654 34689999999999999999999999999 Q ss_pred HHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCccccccccC Q lcl|NC_013644. 271 LQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDSTQVGDG 348 (510) Q Consensus 271 ~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g 348 (510) .+...+|.+.+.--...+..++.. ...+.+..+..++++.+... .+.......++.++..|-..-..-....-... T Consensus 285 ~~~~~~p~~~v~~~g~~~~~~~~~--~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~ 362 (522) T protein:vir:94 285 AKVASKVVGLVNPNGITQPRRLNK--AATGEFVAGRVEDINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLNSAVQRNAE 362 (522) T ss_pred HHHHhCCceeecccccccchheec--cCCceeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhccCCCc Confidence 999999987763222222222221 11233445555666665433 46677778888888777653321122222334 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHhhccCC-ccccceeeEEeCCCCCCC-HHHHHH Q lcl|NC_013644. 349 NITNIVIKARYTLLNMKANKTEARLRALLEW--------MNKLVIDDINRRYTK-AFDPTEVSFTFTREVMVN-ETDIVN 418 (510) Q Consensus 349 ~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~--------~~~~i~~~~~~~~~~-~~~~~~v~i~f~~~~p~d-~~e~~~ 418 (510) ..|++.+..+ +.++...++..+.+ +++..+.++...+.- ......+++.+..++..- ....++ T Consensus 363 r~TAtEV~~r-------~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~v~v~~~s~La~~qr~~~~~ 435 (522) T protein:vir:94 363 RVTAEEIRYV-------AGELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGMIPDLPKEAVEPTVSTGLEALGRGQDLE 435 (522) T ss_pred cccHHHHHHH-------HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCcccEEeeEecHHHHHHHHHHHH Confidence 4566665553 34444444444443 333344444333321 222334677776655431 111111 Q ss_pred HHHHHHh-cCCCchHH---------HHHhC---CCCC-------cHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCC Q lcl|NC_013644. 419 DEKTEAE-TRKIILES---------ILQVA---PRLD-------DDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNT 478 (510) Q Consensus 419 ~~~~~~~-~g~iS~et---------~~~~~---~~v~-------d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~ 478 (510) .+....+ ...++.+. ++..+ -+|+ ++|.+.+.+++...+ ..+...... ..+..... T Consensus 436 ~l~~~~~~ia~l~P~~~~~~id~d~~~~~~a~~~Gv~~~~ivr~~ee~~~~~~q~~~~~--~~~~~~~~~--~~~~~a~~ 511 (522) T protein:vir:94 436 KLTQAVNMMTGLQPLSQDPDINLPTLKLRLLNALGIDTAGLLLTQDEKIQRMAEQSSQQ--AVVQGASAA--GANMGAAV 511 (522) T ss_pred HHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHH--HHHHHHHHH--HHHhhhhh Confidence 1111111 00122222 11111 1221 122222222221111 111111111 11111100 Q ss_pred CCcccCCCCCCcccccccC Q lcl|NC_013644. 479 DEEETAVNPDDPTQQMAEG 497 (510) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~ 497 (510) +....+..+.+ T Consensus 512 --------~~~~~~~~~~~ 522 (522) T protein:vir:94 512 --------GQGAGEDMAQA 522 (522) T ss_pred --------hcccchhhhcC Confidence 00011111111 No 124 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=98.83 E-value=3.3e-08 Score=61.61 Aligned_cols=392 Identities=9% Similarity=0.039 Sum_probs=171.5 Q ss_pred hhhhHHHHHHHHHhhhhhhh-HH---HHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHh Q lcl|NC_013644. 9 VKIIANALKAAIDKDRKSSS-KR---EAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQ 84 (510) Q Consensus 9 ~~~~~~~i~~~i~~~~~~~~-~~---~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~ 84 (510) +.....++. +-. +.... +. ....+..+.-.... +... ... .=+...-....|+..++ T Consensus 1 M~~~~~~f~-~~~--r~~~~~~~~~~~~~~~~~~~g~~~~------------~~~v--~~~--~al~~~~v~~~i~~ia~ 61 (429) T protein:vir:10 1 MDSVKKFFN-FEK--RQTSQVIELNKDDEKLLEWLGISPS------------TISV--KGK--NALKVATVFACIKILSE 61 (429) T ss_pred Cchhhhhhc-ccc--cCcccccccCCChHHHHHHhcCCCC------------ccee--chh--hhhccHHHHHHHHHHHH Confidence 111111000 000 00000 00 00000111100000 0000 000 00112223334555555 Q ss_pred hhhcCCceec--cCc---HHHHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceE Q lcl|NC_013644. 85 YLLSNPVEYE--TEN---EELKEYLAEYYN---S---EFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVF 152 (510) Q Consensus 85 ~l~g~p~~~~--~~d---~~~~~~l~~~~~---n---~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~ 152 (510) -+.+-|+.+- .++ +.....+..+++ | ........+....+.+|.+|+++..+..|++ .+.+++|..+- T Consensus 62 ~ia~l~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~ 141 (429) T protein:vir:10 62 SVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVT 141 (429) T ss_pred hhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeE Confidence 5556676641 111 111112333332 2 2345566778889999999999999988886 68889999998 Q ss_pred EEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccC Q lcl|NC_013644. 153 GVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSY 232 (510) Q Consensus 153 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (510) +..|+...+......|+.... ++.. ..+.+.. T Consensus 142 v~~~~~~~~~~~~~~~~~~~~-~g~~------~~~~~~e----------------------------------------- 173 (429) T protein:vir:10 142 VYIDDVGLLNSKTKMWYVVNT-GGQQ------RVLKPEE----------------------------------------- 173 (429) T ss_pred EEEcCcccccccceEEEEEcc-CCeE------EEEcccc----------------------------------------- Confidence 887765433322222222211 1110 1122333 Q ss_pred CcccEEEecC-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCch---hhhhHhhh------- Q lcl|NC_013644. 233 GQIPFYRLSN-----NKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDL---SKLRQNVK------- 297 (510) Q Consensus 233 g~iPvv~~~n-----n~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~---~~~~~~~~------- 297 (510) |+|+++ ...|.|.+..+...++.......-..+.+...+.|-.+++....-+. ..+...+. T Consensus 174 ----vih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~~~~~~~~~g~~ 249 (429) T protein:vir:10 174 ----ILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQ 249 (429) T ss_pred ----EEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccc Confidence 333332 12366667666666665555444445555565666666554222111 11222111 Q ss_pred -cCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc--CcccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 298 -SKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGD--GNITNIVIKARYTLLNMKANKTEARLR 374 (510) Q Consensus 298 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~~Sg~Ai~~~~~~l~~k~~~k~~~~~ 374 (510) .++++.++++.+++.+........+.+..+...+.|+..-++|+.-.+.. ++-|+ ++. .....+. T Consensus 250 n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn--~e~----------~~~~f~~ 317 (429) T protein:vir:10 250 NSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNN--IEQ----------QQQQFYT 317 (429) T ss_pred ccCceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HHH----------HHHHHHH Confidence 23455666666655555443444556667788888999888988544322 22222 111 1112334 Q ss_pred HHHHHHHHHHHHHHhhccCCccc---cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHH Q lcl|NC_013644. 375 ALLEWMNKLVIDDINRRYTKAFD---PTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLIC 451 (510) Q Consensus 375 ~~l~~~~~~i~~~~~~~~~~~~~---~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~ 451 (510) ..|.-+++.|...+..+--.... ...+++.+..-+..|..+.++.+.++..+|+++.-.++++++.-..+...+. T Consensus 318 ~~l~P~~~~ie~~ln~kl~~~~~~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~-- 395 (429) T protein:vir:10 318 DTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRL-- 395 (429) T ss_pred HHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCee-- Confidence 44444444444444332111100 1124444456667899999999999999999999888887643211100000 Q ss_pred HHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccc Q lcl|NC_013644. 452 EQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGS 501 (510) Q Consensus 452 e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (510) ....+..+. +..+. ...+.+.+.++.+++...| + T Consensus 396 ----------~~~~n~~~~--d~~~~-~~~k~g~~~~~~~~~~~e~---~ 429 (429) T protein:vir:10 396 ----------LVNGNMLPI--DMAGQ-AYLKGGDTNGEVSKEGNEG---N 429 (429) T ss_pred ----------eecccccch--hhccc-cccCCCCCCCCCCCCCCCC---C Confidence 000000000 00000 0000111111111111111 1 No 125 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=98.81 E-value=3.2e-08 Score=61.64 Aligned_cols=423 Identities=10% Similarity=0.050 Sum_probs=162.8 Q ss_pred ccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHH Q lcl|NC_013644. 4 LLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKT 83 (510) Q Consensus 4 ~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~ 83 (510) |...+..+..-.=.+.|.......+.-.......||.-. ..........-..++....|+..+ T Consensus 1 ~~~~~~~i~s~~~~~~i~~~~~~s~~~~~~~~~~~~~pp-----------------~~~~~la~l~~~n~~v~scI~~ia 63 (542) T protein:vir:41 1 MFNYHLSIRSLEKYKAIKREEVESQALGETRFEEYVEPK-----------------VNPLVLLSLLQVNPYHASACSIKA 63 (542) T ss_pred CccccccccccccchhhhhccccccccccccCCccccCC-----------------CCHHHHHHHHhhcHHHHHHHHHHH Confidence 222222111000011111000000000000000111000 000000011112356678888888 Q ss_pred hhhhcCCceeccCcHHHHHHHHHHhcc---CHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCC Q lcl|NC_013644. 84 QYLLSNPVEYETENEELKEYLAEYYNS---EFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYN 159 (510) Q Consensus 84 ~~l~g~p~~~~~~d~~~~~~l~~~~~n---~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~ 159 (510) +.+.+-|+.+...+.. .+..++-| +.......++.+...+|.||+.+..+.+|++ .+.+++|..+.+..|... T Consensus 64 ~~IA~l~~~~~~~~~~---~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~~L~~l~~~~v~v~~d~~~ 140 (542) T protein:vir:41 64 NDIIRTGYILEGDDEG---VVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPIRFEYIPSHTIRVHKDGSR 140 (542) T ss_pred HHHhhCceeeecccch---hhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcceEEEEcCCe Confidence 8888888887654433 23344432 2445666778899999999999989988876 478888888877655321 Q ss_pred CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEE Q lcl|NC_013644. 160 ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYR 239 (510) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~ 239 (510) ++.... ... ..++..|...+.+.. ..+.. ...+..=-|+| T Consensus 141 -------~~~~~~--~~~---~~~~~~y~~~~~~~~--~~g~~--------------------------~~~~~~~eIiH 180 (542) T protein:vir:41 141 -------YRQTWD--GVN---ITHFKDYRYEGEINP--ETGED--------------------------QDSVGANELVF 180 (542) T ss_pred -------eEeeec--CCc---ceeEEeecccccccc--ccccc--------------------------ccccCcccEEE Confidence 111100 000 111111111110000 00000 00011112566 Q ss_pred ecCCC-----CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeE--EecCCCCc-----------hhhhhHhh----h Q lcl|NC_013644. 240 LSNNK-----QETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYV--VSGFQGDD-----------LSKLRQNV----K 297 (510) Q Consensus 240 ~~nn~-----~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv--~~g~~~~~-----------~~~~~~~~----~ 297 (510) |++.. .|.|.+......+.....+..-..+.+.-.+.|-.+ +.|...++ ...+...+ . T Consensus 181 ir~~~~~~~~~Glspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~ 260 (542) T protein:vir:41 181 IHIPSPVCSYYGVPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFK 260 (542) T ss_pred ecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHh Confidence 66432 466766655554444333332233334444445443 44432111 11111111 1 Q ss_pred -----cCeeeecc----CCCceeEEeecC--CHHHHHHHHHHHHHHHHHHhCCccccccccCc--ccHHHHHHHHHHHHH Q lcl|NC_013644. 298 -----SKKVVGTG----SDGGLDVKTVTI--PTEGRKTKMEIDKENIYKFGMAFDSTQVGDGN--ITNIVIKARYTLLNM 364 (510) Q Consensus 298 -----~~~~~~~~----~~~~~~~~~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~--~Sg~Ai~~~~~~l~~ 364 (510) .++++.++ .+++++|..... ....+.+..+...+.|...-++|+.-.+.... .++.-++... T Consensus 261 g~~~n~gk~~vL~~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn~Eq~~----- 335 (542) T protein:vir:41 261 HLKEAPHTPLVFSIPGGDTVKVTFTPLNTSQKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNFAEVTR----- 335 (542) T ss_pred hhhcccCceeEeeccCCcccceeEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCcccccccHHHHH----- Confidence 12333332 234566654443 34455666677788888888888754322211 1111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeC--CCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCC Q lcl|NC_013644. 365 KANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFT--REVMVNETDIVNDEKTEAETRKIILESILQVAPRLD 442 (510) Q Consensus 365 k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~--~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~ 442 (510) ...+...|.-+++.|...+...-..... ..+.+.|+ .-+..|.. ..+.+++++|+++...+++.++.++ T Consensus 336 -----~~f~~~tL~P~~~~ie~~ln~~L~~~~~-~~~~~~f~~~~ll~~d~~---~~~~~~v~~GilT~NE~Re~L~g~~ 406 (542) T protein:vir:41 336 -----RTYYESVVRPQQNIISSILTDFFQVKFN-PKTRFKFNDETLLESDSV---RNCALLVQSGVLTPAEARERLFGLD 406 (542) T ss_pred -----HHHHHHHHHHHHHHHHHHHHhhcccccC-CceEEEecchhhcchHHH---HHHHHHHhCCCCCHHHHHHhhCCCC Confidence 1223333333333333333322111111 23455665 33344433 3455678999999988887665444 Q ss_pred cH-HHHH---------HHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 443 DD-NVLR---------LICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 443 d~-e~~~---------~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) +- +... ....+.+.+..+....++..... +++ -++..+....+.-+++....+.+..+ T Consensus 407 pgdd~~l~p~~~~~~~~~~~~~n~~~~~~~~~~k~~~k~------~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~ 474 (542) T protein:vir:41 407 GGPDIFMVPSKGAAKSVKRQERNYEKNQIREIRKIYAKY------RPR----FNEIISSKLSAEEKKKKIDESLAEFR 474 (542) T ss_pred CCCccccccccccccccccCCcCCCCCchhhhhhccccc------Ccc----ccccccccccchhhcccccchhhhhH Confidence 21 1100 00000000000000000000000 000 00000000000011111111222222 No 126 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=98.79 E-value=4.7e-08 Score=60.74 Aligned_cols=464 Identities=11% Similarity=0.087 Sum_probs=202.3 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhH--HHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSK--REAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~--~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) |-..+. .....+.+++..+..+..+.. .++..+.+|..-. .+. ...... .....++..+-+... T Consensus 1 ~~~~~~--~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~--~~~-----~~~~~~-----~~~~~~~~dst~~~a 66 (543) T protein:vir:88 1 MAETKR--EGLAEEGAKAVYERLKNDRVPYETRAENCAKVTIPS--LFP-----KDSDNS-----STDYTTPWQAVGARG 66 (543) T ss_pred Cccccc--CcchHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccc--cCC-----CCCCcc-----cccccccccchHHHH Confidence 433221 222334444554544432222 2344555554321 000 000000 011124555666777 Q ss_pred HHHHHhhhhcC--Cce----eccCcH-------------HHHHHH-------HHHh-ccCHHHHHHHHHHHHHhcCeEEE Q lcl|NC_013644. 79 VDQKTQYLLSN--PVE----YETENE-------------ELKEYL-------AEYY-NSEFQVVLQELVEGSSQKGFEYV 131 (510) Q Consensus 79 v~~~~~~l~g~--p~~----~~~~d~-------------~~~~~l-------~~~~-~n~~~~~~~e~~~~~~~~G~~~~ 131 (510) +++.++.|++- |.. +...+. .++..| ...+ .+||...+.++.++..++|.|.+ T Consensus 67 ~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l 146 (543) T protein:vir:88 67 LNNLSAKVMLALFPLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALI 146 (543) T ss_pred HHHHHHHHHHhhcCCCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceee Confidence 77777766541 322 222221 122222 2222 36788899999999999999976 Q ss_pred EEEECCCCceE---EEEEcccceEEEEcCCCCceeEEEEEEEEEee-----------CCceeEEEEEEEEcCCcEEEEEE Q lcl|NC_013644. 132 YARTNAEDRLC---FQVADSLNVFGVYNEYNELQRICRHYITEIEK-----------DGETVDIHHAEVWTDQNVYFFVA 197 (510) Q Consensus 132 ~v~~d~~g~~~---i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~e~y~~~~i~~~~~ 197 (510) ++-.|....++ ++.++-.+++..-|..+++..+++-+...... .........+++|+. .|.. T Consensus 147 y~~~~~~~~~~~~~~~~~pl~~y~v~~d~~G~v~~i~r~~~~~~~~l~~~~~~~v~~~~~~~p~~~~~v~~~----V~pr 222 (543) T protein:vir:88 147 YLPPPDASSNSYNPMKLYTLHNHVVQRDAFGNVLQIVTLDKVAYAALPEDVRNSLSGGQEYKPEQELEVYTH----IYID 222 (543) T ss_pred eeccCccccceecceEEeEcceEEEeeCCCCCeeeeeeeeeccHHHHhHHhhHHHHHHhhcCCccceEEEEE----EEee Confidence 66544432222 33344455555556677777776655443211 001111223444432 1111 Q ss_pred cCCceeecccccccccccccccccccccccccccCCcccEEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 198 EDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLS-----NNKQETTDLKPIKALIDDYDLMNCFLSNNLQ 272 (510) Q Consensus 198 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~ 272 (510) ...+.+... . +.... .+. ......++..+|++.++ ++.+|+|-.++..+-+..+|.+.-......+ T Consensus 223 ~~~~~~~~~--~---~~~~~-~v~---~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~ 293 (543) T protein:vir:88 223 DESGDFLSY--Q---EIEGV-EVD---GSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAM 293 (543) T ss_pred cCCCccccc--c---cccCe-eee---cCCCccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 111110000 0 00000 000 01112335667877654 3468999999999999999999998999999 Q ss_pred HhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCccccccccCcc Q lcl|NC_013644. 273 DFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNI 350 (510) Q Consensus 273 ~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~ 350 (510) ...+|.+.+.-....+..++.. ...+.+..+..+++..+... .+.......++.++..|-+.-..-....-..... T Consensus 294 ~~~~pp~~v~~~g~~~~~~~~~--~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~ 371 (543) T protein:vir:88 294 ISSKVVGLVNPNGITQVRRLVK--AQTGDFVAGRKADIEFLQLEKTADFTVAKSVADAIEARLSYVFMLNSAVQRSGERV 371 (543) T ss_pred HHhcCceeeccccccchhhccc--CCCceeecCCCCcceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhccCCCCcc Confidence 9999987763222222233221 12234444556667766543 4677778888888877754321111111223334 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHhhccCC-ccccceeeEEeCCCC-CCCHHHHHHHH Q lcl|NC_013644. 351 TNIVIKARYTLLNMKANKTEARLRALLEW--------MNKLVIDDINRRYTK-AFDPTEVSFTFTREV-MVNETDIVNDE 420 (510) Q Consensus 351 Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~--------~~~~i~~~~~~~~~~-~~~~~~v~i~f~~~~-p~d~~e~~~~~ 420 (510) |++.+.. ++.++...++..+.+ +++..+.++...+.- ......+++.+..++ +-.....++.+ T Consensus 372 TAtEV~~-------r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l 444 (543) T protein:vir:88 372 TAEEIRY-------VASELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQIPNLPQEAVEPTVTTGAEALGRGQDLDKL 444 (543) T ss_pred cHHHHHH-------HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhceeeeEEecHHHHHHHHHHHHH Confidence 6665554 445555555555444 333333444333321 122234566665332 21222222222 Q ss_pred HHHHh-cCCCch---------HHHHHhC---CCCC------cHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCc Q lcl|NC_013644. 421 KTEAE-TRKIIL---------ESILQVA---PRLD------DDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEE 481 (510) Q Consensus 421 ~~~~~-~g~iS~---------et~~~~~---~~v~------d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (510) ....+ .+.+++ ..++..+ -+++ .++....++++++++......+..+.. +...+.. T Consensus 445 ~~~~~~v~~~~~p~vld~id~d~~~~~~a~~~Gv~~~~i~r~~~e~~~~~~q~~~q~~~~~~~~~~~~-----~~~~~~~ 519 (543) T protein:vir:88 445 TQFLNAVATVSQLNGDPDLNVNNIKLRLANAIGIDTAGLLLTEAEKAQAQSQEMLKQGGLNAAAGIGS-----GVAAQAT 519 (543) T ss_pred HHHHHHHHhccchhhhccCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhh-----chhhhhc Confidence 22221 122332 2222221 1232 122222222222211111111111110 0001101 Q ss_pred ccCCCCCCcccccccCccccccccc Q lcl|NC_013644. 482 ETAVNPDDPTQQMAEGATGSTESQL 506 (510) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (510) .. +..-+..-..+.+++|..+.|+ T Consensus 520 ~~-~~~~~~~~~~~~~~~~p~~~~~ 543 (543) T protein:vir:88 520 AS-PEAMESAMDTAGVQPGPIATQV 543 (543) T ss_pred cC-hHHHHHHhhhcCCCCCCCCCCC Confidence 10 1110111123445666666666 No 127 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=98.78 E-value=5.3e-08 Score=60.47 Aligned_cols=472 Identities=9% Similarity=0.047 Sum_probs=203.4 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhh--HHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSS--KREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~--~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) |-. . ..+.|++..+..+..+. ..++.++.+|..- .+..+. ... .....+.+.++..+-+..- T Consensus 1 m~~---~----~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP-----~~~~~~-~~~---~~~~~~~~~~~~dst~~~a 64 (556) T protein:vir:73 1 MAE---T----EKERLLKQLAQLKNERTSFESHWLDLSDFINP-----RGSRFL-TSD---VNRDDRRNTKIVDPTGSMA 64 (556) T ss_pred CCh---h----hHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcc-----ccCCcC-CCC---CCcchhhcCccccchHHHH Confidence 322 1 13334444444432221 1234444444311 111110 000 0011112235667777777 Q ss_pred HHHHHhhhhcC--Cc-----eeccCc------HHHHHH-------HHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECC Q lcl|NC_013644. 79 VDQKTQYLLSN--PV-----EYETEN------EELKEY-------LAEYY-NSEFQVVLQELVEGSSQKGFEYVYARTNA 137 (510) Q Consensus 79 v~~~~~~l~g~--p~-----~~~~~d------~~~~~~-------l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~d~ 137 (510) +++.++.|++- |+ ++...+ ..+.+. +...+ ..||...+.++.++..++|.|.+++..+. T Consensus 65 ~~~Las~l~~~ltpp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~ 144 (556) T protein:vir:73 65 QRILSSGMMSGITSPARPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVMEDD 144 (556) T ss_pred HHHHHHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeeecC Confidence 77777777542 32 233322 222222 23333 36788889999999999999988887777 Q ss_pred CCceEEEEEcccceEEEEcCCCCceeEEEEEEEEEe--------eC---------CceeEEEEEEEEcCCcEEEEEEcCC Q lcl|NC_013644. 138 EDRLCFQVADSLNVFGVYNEYNELQRICRHYITEIE--------KD---------GETVDIHHAEVWTDQNVYFFVAEDN 200 (510) Q Consensus 138 ~g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~--------~~---------~~~~~~~~~e~y~~~~i~~~~~~~~ 200 (510) .+-+++..++..+++..-|..+++..++|.+..... +. .....-..+++++- .|..... T Consensus 145 ~~~~r~~~~~l~~~~~~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~~v~~~----V~pr~~~ 220 (556) T protein:vir:73 145 QDVIRTMPFPIGSYYLANSPRGSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHC----ITPNVNR 220 (556) T ss_pred CceEEEEEeecceeEEeeCCCCCeEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcCCccceEEEEEE----Eeccccc Confidence 777899999999999888888888888776554421 00 00000012222110 0000000 Q ss_pred ceeeccccccccccc-cccc-ccccccccccccCCcccEEEec-----CCCCCCCc-HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 201 KDYELDEAEPINPRP-HVLA-VDSENESLLQRSYGQIPFYRLS-----NNKQETTD-LKPIKALIDDYDLMNCFLSNNLQ 272 (510) Q Consensus 201 ~~~~~~~~~~~~~~~-~~~~-~~~~~~~~~~~~~g~iPvv~~~-----nn~~g~sd-~~~v~~liD~~n~~~S~~~~~~~ 272 (510) .....+... -++. .... ......-....+|..+|++.++ ++.+|+|. .....+-+..+|.+.-......+ T Consensus 221 ~~~~~~~~~--~p~~s~~~~~~~~~~~vl~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~~~~ 298 (556) T protein:vir:73 221 DSGKMDSKN--KPYRSVYFESGGDSDKLLRESGFDEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQVEQKRKAQLID 298 (556) T ss_pred cccccCccc--ceEEEEEEEecCCCceecccCCcccCCceeeeeeecCCcccccCccHHHhHHHHHHHHHHHHHHHHHHH Confidence 000000000 0000 0000 0000011123345566766654 35679995 88899999999999888899999 Q ss_pred HhccceeEEecCCCCchhhhhHhhhcCeeeec--cC-CCceeEEe-ecCCHHHHHHHHHHHHHHHHHHhCCcc----ccc Q lcl|NC_013644. 273 DFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGT--GS-DGGLDVKT-VTIPTEGRKTKMEIDKENIYKFGMAFD----STQ 344 (510) Q Consensus 273 ~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~--~~-~~~~~~~~-~~~~~~~~~~~~~~l~~~i~~~s~~p~----~~~ 344 (510) ...+|.+.+..-.... ..+...+++... .. ..+++.+. ...+.......++.++..|-. +...+ ... T Consensus 299 ~~~~pp~~v~~~~~~~----~~~~~pgg~~~~~~~~~~~~i~p~~~~~~d~~~~~~~i~~~~~rI~~-af~~d~~~~l~~ 373 (556) T protein:vir:73 299 KATNPPMVAPTSLKNQ----RVSLLPGDVTYLDVISGQDGFKPAYLVNPNTADLLADIQDTRQTINS-AYFVDLFMMLQN 373 (556) T ss_pred HHhcCceecccccccc----ceeeccCccccccCCCCccceeeeccccccHHHHHHHHHHHHHHHHH-Hhhcchhhhhcc Confidence 9999887764421111 112233332211 22 23455542 234556666667777766643 22222 222 Q ss_pred cccCcccHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccCC-----ccccceeeEEeCCCCCCCHH---- Q lcl|NC_013644. 345 VGDGNITNIVIKARYTLLNMKAN-KTEARLRALLEWMNKLVIDDINRRYTK-----AFDPTEVSFTFTREVMVNET---- 414 (510) Q Consensus 345 ~~~g~~Sg~Ai~~~~~~l~~k~~-~k~~~~~~~l~~~~~~i~~~~~~~~~~-----~~~~~~v~i~f~~~~p~d~~---- 414 (510) ......|++.+..+...+.+... -..+.-.+.|.-++.-++.++...+.- .....+++|.|..++-+... T Consensus 374 ~~~~r~TAtEv~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aqk~~~~ 453 (556) T protein:vir:73 374 INTRSMPVEAVIEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMARKNMLPEPPDVLQGMPLRIEYISVMAQAQKSIGL 453 (556) T ss_pred CCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeecHHHHHHHHHHH Confidence 23344566666554333333221 122222233333343444444443321 12233577788766643211 Q ss_pred ----HHHHHHHHHHhcCC-----CchHHHHHhC---CCC------CcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCC Q lcl|NC_013644. 415 ----DIVNDEKTEAETRK-----IILESILQVA---PRL------DDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSD 476 (510) Q Consensus 415 ----e~~~~~~~~~~~g~-----iS~et~~~~~---~~v------~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~ 476 (510) ..++.+..+.+.+. +....++..+ -++ +++|.+++.+++.+.+......++. ......... T Consensus 454 ~~i~~~~~~~~~laq~~Pe~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~r~~~qq~~~~~~~~-~~a~~~~~~ 532 (556) T protein:vir:73 454 TSLSQTVGFIGQLAQFKPEALDKLDVDQAIDAFSEMSGVSPTVIVPQEQVQGIREERAKQAQAAQAMAMG-QAAAQGAKT 532 (556) T ss_pred HHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCChhhcCCHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH Confidence 11111222212111 2223333321 112 2333333333322222211111111 111111011 Q ss_pred CCCCcccCCCCCCcccccccCcccccccc Q lcl|NC_013644. 477 NTDEEETAVNPDDPTQQMAEGATGSTESQ 505 (510) Q Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (510) ..+...++ .+.-+...++.| ..+| T Consensus 533 ~~~~~~~~----~~~l~~~~~~~g-~~~~ 556 (556) T protein:vir:73 533 LSETQTSD----PSALTAIANAAG-APQQ 556 (556) T ss_pred hhhccCCC----HHHHHHHHHhhc-CCCC Confidence 11111111 111111111222 1122 No 128 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=98.77 E-value=5.7e-08 Score=60.28 Aligned_cols=400 Identities=9% Similarity=0.037 Sum_probs=175.6 Q ss_pred hhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhc Q lcl|NC_013644. 9 VKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLS 88 (510) Q Consensus 9 ~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g 88 (510) +- ++..+..+.......+ ..+.....+.-.....+...+........+..-+..+-....|+..++-+.+ T Consensus 1 MG----~f~~lf~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~al~~~~v~~ci~~ia~~iA~ 70 (422) T protein:vir:13 1 MG----FLRGLFNKKNNNDEKR------SNYDEDIGIDISDSNFWEKFGIKLNFSVRGKRALKENTVYVCTKIRAESIGK 70 (422) T ss_pred Cc----hhhhhhhccCCccchh------hhhhhccccccCcchhhhhccccCCcccchhhhhccHHHHHHHHHHHHhhhh Confidence 11 1111111111111100 0000000000000000000000000000000001122233445555555666 Q ss_pred CCceeccCcHH-HHHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCC Q lcl|NC_013644. 89 NPVEYETENEE-LKEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNE 160 (510) Q Consensus 89 ~p~~~~~~d~~-~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~ 160 (510) -|+.+--..+. ....+..++. |. .......+....+.+|.||.++.++..|++ .+.+++|..+.++.|+.+. T Consensus 71 lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~~~~~~~~~ 150 (422) T protein:vir:13 71 LSLKIYKDKEEYKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGKIIGLYPINSDNVTKIIDDDNF 150 (422) T ss_pred CceEEEecCcccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCcc Confidence 67775222211 1112233332 22 235666778889999999999989888875 6888999999999876543 Q ss_pred ceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEe Q lcl|NC_013644. 161 LQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRL 240 (510) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~ 240 (510) ....-.+++.....++.. ..+.+..+.+++... + T Consensus 151 ~~~~~~~~y~~~~~~g~~------~~~~~~eiih~~~~~------------------------------------~---- 184 (422) T protein:vir:13 151 LSSLSKVWYVVTDKNGKE------HKLLPDEMLHFIGDI------------------------------------T---- 184 (422) T ss_pred eeccceEEEEEEeCCCeE------EEEcccceEEEcCCC------------------------------------C---- Confidence 332222332222221111 012222233322100 0 Q ss_pred cCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCch---hhhhHhhh--------cCeeeeccCCCc Q lcl|NC_013644. 241 SNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDL---SKLRQNVK--------SKKVVGTGSDGG 309 (510) Q Consensus 241 ~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~---~~~~~~~~--------~~~~~~~~~~~~ 309 (510) .+.-.|.|.+..+...|+....+..-..+.++..+.|-.+++-...-+. ..+...+. .++++.++++.+ T Consensus 185 ~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~ 264 (422) T protein:vir:13 185 LDGLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYVGDLDEKAKKIFKKEFESMSNGLENAHSISLLPFGYQ 264 (422) T ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCce Confidence 0112466777766666665555555555556666667766644222111 12222111 234566666666 Q ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_013644. 310 LDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDIN 389 (510) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~ 389 (510) ++.++.......+.+..+.....|+..-++|+.-.+...+.+...++.. ....+...|.-+++.|...+. T Consensus 265 ~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~e~~----------~~~f~~~~l~P~~~~ie~~l~ 334 (422) T protein:vir:13 265 FQPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERATFNNLTEQ----------QKDFYVTTLQSSLTVYEQEIQ 334 (422) T ss_pred eeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH----------HHHHHHHHHHHHHHHHHHHHH Confidence 6666554455556667777888899988899865443322222222211 112333344444444444333 Q ss_pred hccCCccc-cceeeEEe--CCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 390 RRYTKAFD-PTEVSFTF--TREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALE 466 (510) Q Consensus 390 ~~~~~~~~-~~~v~i~f--~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~ 466 (510) .+--...+ .....|.| ..-+..|..+.++.+.++.++|+++.-.++++++.-.-+.-.+ .....+ T Consensus 335 ~~Ll~~~~~~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~------------~~~~~n 402 (422) T protein:vir:13 335 DKLFSQYETLQDVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEANEARRRENLPPVEGGDR------------LLVNGN 402 (422) T ss_pred HhhCChhhhcCCceEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe------------eeeccC Confidence 22111111 11233444 4556678999999999999999999988888765321110000 000000 Q ss_pred hhhccCCCCCCCCCcccCCCCCCc Q lcl|NC_013644. 467 EAEYTKGLSDNTDEEETAVNPDDP 490 (510) Q Consensus 467 ~~~~~~~~~~~~~~~~~~~~~~~~ 490 (510) ..+- +..+.....++.++.. T Consensus 403 ~~~l----~~~~~~~~~~g~~~g~ 422 (422) T protein:vir:13 403 MIPI----EMAGEQYKKGGEKGGK 422 (422) T ss_pred ccch----hhcccccccCCCcCCC Confidence 0000 0000000000100000 No 129 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=98.74 E-value=7.3e-08 Score=59.70 Aligned_cols=464 Identities=9% Similarity=0.023 Sum_probs=203.2 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |... .. .+.+.+..+..+.. |......++++|+---+ .+..... ..+. ...+.+.++..+-+..-++ T Consensus 1 M~~~-----~~-~~~l~~r~~~l~~~-R~~~e~~w~e~~~~~lP--~~~~~~~-~~~~---~~~~~~~~~~dst~~~a~~ 67 (555) T protein:vir:10 1 MAEQ-----TE-RKLLLSRWGQLRTE-RESWMSHWKEISDYLLP--RAGRFFV-QDRN---RGEKRHNNILDNTGTRALR 67 (555) T ss_pred CCCc-----cc-HHHHHHHHHHHHHH-hhHHHHHHHHHHHHhCc--ccccccC-CCCC---cchhcccccccccHHHHHH Confidence 3332 22 23344444444322 11223333444322111 1111100 0000 0112234567777788888 Q ss_pred HHHhhhhcC--Cce-----eccCc------HHHHHHH-------HHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCCC Q lcl|NC_013644. 81 QKTQYLLSN--PVE-----YETEN------EELKEYL-------AEYY-NSEFQVVLQELVEGSSQKGFEYVYARTNAED 139 (510) Q Consensus 81 ~~~~~l~g~--p~~-----~~~~d------~~~~~~l-------~~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g 139 (510) +.++.|++- |+. +...+ ..+...| ...+ ..||.....++.++..++|.|.+++..|..+ T Consensus 68 ~LAa~L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~ 147 (555) T protein:vir:10 68 VLAAGMMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDA 147 (555) T ss_pred HHHHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCc Confidence 888777642 322 23322 1222322 2333 4678888999999999999998888777777 Q ss_pred ceEEEEEcccceEEEEcCCCCceeEEEEEEEEEee--------C---------CceeEEEEEEEEcCCcEEEEEEcCCce Q lcl|NC_013644. 140 RLCFQVADSLNVFGVYNEYNELQRICRHYITEIEK--------D---------GETVDIHHAEVWTDQNVYFFVAEDNKD 202 (510) Q Consensus 140 ~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~--------~---------~~~~~~~~~e~y~~~~i~~~~~~~~~~ 202 (510) .+++..++..+++..-|..+++..++|.+...... . .....-.++++++- .|....... T Consensus 148 ~~rf~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~----V~pr~~~~~ 223 (555) T protein:vir:10 148 VVYHHSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHA----IEPRADRDP 223 (555) T ss_pred eEEEEEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEE----EeeccCcCc Confidence 78999999999999888888888887754433210 0 00000012222211 000000000 Q ss_pred eecccc-cccccccccc--cccccccccccccCCcccEEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_013644. 203 YELDEA-EPINPRPHVL--AVDSENESLLQRSYGQIPFYRLS-----NNKQETTDLKPIKALIDDYDLMNCFLSNNLQDF 274 (510) Q Consensus 203 ~~~~~~-~~~~~~~~~~--~~~~~~~~~~~~~~g~iPvv~~~-----nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~ 274 (510) ...+.. .++ .+... ..++. .-....+|..+|++.++ .+.+|+|-.+...+-+..+|.+.-......+.. T Consensus 224 ~~~~~~~~p~--~s~~~~~~~d~~-~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~ 300 (555) T protein:vir:10 224 SKRDDRNMAW--KSVYFEPGADET-RTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYK 300 (555) T ss_pred CCCCccccce--EEEEEEeccCCc-cccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000 000 00000 00000 01123345567776654 346799999999999999999877788888888 Q ss_pred ccceeEEecCCCCchhhhhHhhhcCee--eeccCCCcee--EEeecCCHHHHHHHHHHHHHHHHHHhCCcc----ccccc Q lcl|NC_013644. 275 AEAIYVVSGFQGDDLSKLRQNVKSKKV--VGTGSDGGLD--VKTVTIPTEGRKTKMEIDKENIYKFGMAFD----STQVG 346 (510) Q Consensus 275 ~~~~lv~~g~~~~~~~~~~~~~~~~~~--~~~~~~~~~~--~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~----~~~~~ 346 (510) .+|.+.+..-..... .....+++ +..+..++.- .+....+.......++.++..|-.. ..-+ ..... T Consensus 301 ~~pp~~v~~~~~~~~----~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~a-f~~dlf~~l~~~~ 375 (555) T protein:vir:10 301 SNPPLQLPVSAKNQD----ISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKAS-FYADLFLMLANGT 375 (555) T ss_pred hcCceeecccccccc----ceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHH-hhcchhhhccCCC Confidence 887776533211111 12223322 2222222222 2233346677777788888877543 2222 22223 Q ss_pred cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHhhccCC-----ccccceeeEEeCCCCCCCH Q lcl|NC_013644. 347 DGNITNIVIKARYTLLNMKANKTEARLRALLEW--------MNKLVIDDINRRYTK-----AFDPTEVSFTFTREVMVNE 413 (510) Q Consensus 347 ~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~--------~~~~i~~~~~~~~~~-----~~~~~~v~i~f~~~~p~d~ 413 (510) ....|++.+..+ ..++...++..+.+ +++-.+.++...+.- .....+++|.|..++-+.. T Consensus 376 ~~~~TAtEV~~r-------~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq 448 (555) T protein:vir:10 376 NPQMTATEVAER-------HEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQ 448 (555) T ss_pred CCcccHHHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHH Confidence 344566666553 33444444444333 333344444443321 1223357777777665421 Q ss_pred HH--------HHHHHHHHHhcC-----CCchHHHHHh----CCC----C-CcHHHHHHHHHHHHHHHHHHHHHHHhhhc- Q lcl|NC_013644. 414 TD--------IVNDEKTEAETR-----KIILESILQV----APR----L-DDDNVLRLICEQFDLDWEDVKEALEEAEY- 470 (510) Q Consensus 414 ~e--------~~~~~~~~~~~g-----~iS~et~~~~----~~~----v-~d~e~~~~~~e~~e~~~~~~~~~~~~~~~- 470 (510) .. .++.+..+.+.+ .+....++.. ++. + +++|.+++++++++.++...+.++..+.. T Consensus 449 ~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~ 528 (555) T protein:vir:10 449 RAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGAD 528 (555) T ss_pred HHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 111122111111 0222222222 221 1 23444444444333332222222211111 Q ss_pred -cCCCCCCCCCcccCCCCCCcccccccCcc Q lcl|NC_013644. 471 -TKGLSDNTDEEETAVNPDDPTQQMAEGAT 499 (510) Q Consensus 471 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (510) ....++... ...+.=...-+-..|-+ T Consensus 529 ~~~~~~~~~~---~~~~~~~~~~~~~~~~~ 555 (555) T protein:vir:10 529 TAAKLGSVDT---SKQNALTDVTRAFSGYT 555 (555) T ss_pred HHHHhccccc---CcchhHHHHHhhhccCC Confidence 011111111 11111000001111111 No 130 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=98.74 E-value=7.3e-08 Score=59.70 Aligned_cols=464 Identities=9% Similarity=0.023 Sum_probs=203.2 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |... .. .+.+.+..+..+.. |......++++|+---+ .+..... ..+. ...+.+.++..+-+..-++ T Consensus 1 M~~~-----~~-~~~l~~r~~~l~~~-R~~~e~~w~e~~~~~lP--~~~~~~~-~~~~---~~~~~~~~~~dst~~~a~~ 67 (555) T protein:vir:98 1 MAEQ-----TE-RKLLLSRWGQLRTE-RESWMSHWKEISDYLLP--RAGRFFV-QDRN---RGEKRHNNILDNTGTRALR 67 (555) T ss_pred CCCc-----cc-HHHHHHHHHHHHHH-hhHHHHHHHHHHHHhCc--ccccccC-CCCC---cchhcccccccccHHHHHH Confidence 3332 22 23344444444322 11223333444322111 1111100 0000 0112234567777788888 Q ss_pred HHHhhhhcC--Cce-----eccCc------HHHHHHH-------HHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCCC Q lcl|NC_013644. 81 QKTQYLLSN--PVE-----YETEN------EELKEYL-------AEYY-NSEFQVVLQELVEGSSQKGFEYVYARTNAED 139 (510) Q Consensus 81 ~~~~~l~g~--p~~-----~~~~d------~~~~~~l-------~~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g 139 (510) +.++.|++- |+. +...+ ..+...| ...+ ..||.....++.++..++|.|.+++..|..+ T Consensus 68 ~LAa~L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~ 147 (555) T protein:vir:98 68 VLAAGMMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDA 147 (555) T ss_pred HHHHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCc Confidence 888777642 322 23322 1222322 2333 4678888999999999999998888777777 Q ss_pred ceEEEEEcccceEEEEcCCCCceeEEEEEEEEEee--------C---------CceeEEEEEEEEcCCcEEEEEEcCCce Q lcl|NC_013644. 140 RLCFQVADSLNVFGVYNEYNELQRICRHYITEIEK--------D---------GETVDIHHAEVWTDQNVYFFVAEDNKD 202 (510) Q Consensus 140 ~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~--------~---------~~~~~~~~~e~y~~~~i~~~~~~~~~~ 202 (510) .+++..++..+++..-|..+++..++|.+...... . .....-.++++++- .|....... T Consensus 148 ~~rf~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~----V~pr~~~~~ 223 (555) T protein:vir:98 148 VVYHHSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHA----IEPRADRDP 223 (555) T ss_pred eEEEEEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEE----EeeccCcCc Confidence 78999999999999888888888887754433210 0 00000012222211 000000000 Q ss_pred eecccc-cccccccccc--cccccccccccccCCcccEEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_013644. 203 YELDEA-EPINPRPHVL--AVDSENESLLQRSYGQIPFYRLS-----NNKQETTDLKPIKALIDDYDLMNCFLSNNLQDF 274 (510) Q Consensus 203 ~~~~~~-~~~~~~~~~~--~~~~~~~~~~~~~~g~iPvv~~~-----nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~ 274 (510) ...+.. .++ .+... ..++. .-....+|..+|++.++ .+.+|+|-.+...+-+..+|.+.-......+.. T Consensus 224 ~~~~~~~~p~--~s~~~~~~~d~~-~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~ 300 (555) T protein:vir:98 224 SKRDDRNMAW--KSVYFEPGADET-RTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYK 300 (555) T ss_pred CCCCccccce--EEEEEEeccCCc-cccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000 000 00000 00000 01123345567776654 346799999999999999999877788888888 Q ss_pred ccceeEEecCCCCchhhhhHhhhcCee--eeccCCCcee--EEeecCCHHHHHHHHHHHHHHHHHHhCCcc----ccccc Q lcl|NC_013644. 275 AEAIYVVSGFQGDDLSKLRQNVKSKKV--VGTGSDGGLD--VKTVTIPTEGRKTKMEIDKENIYKFGMAFD----STQVG 346 (510) Q Consensus 275 ~~~~lv~~g~~~~~~~~~~~~~~~~~~--~~~~~~~~~~--~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~----~~~~~ 346 (510) .+|.+.+..-..... .....+++ +..+..++.- .+....+.......++.++..|-.. ..-+ ..... T Consensus 301 ~~pp~~v~~~~~~~~----~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~a-f~~dlf~~l~~~~ 375 (555) T protein:vir:98 301 SNPPLQLPVSAKNQD----ISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKAS-FYADLFLMLANGT 375 (555) T ss_pred hcCceeecccccccc----ceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHH-hhcchhhhccCCC Confidence 887776533211111 12223322 2222222222 2233346677777788888877543 2222 22223 Q ss_pred cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHhhccCC-----ccccceeeEEeCCCCCCCH Q lcl|NC_013644. 347 DGNITNIVIKARYTLLNMKANKTEARLRALLEW--------MNKLVIDDINRRYTK-----AFDPTEVSFTFTREVMVNE 413 (510) Q Consensus 347 ~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~--------~~~~i~~~~~~~~~~-----~~~~~~v~i~f~~~~p~d~ 413 (510) ....|++.+..+ ..++...++..+.+ +++-.+.++...+.- .....+++|.|..++-+.. T Consensus 376 ~~~~TAtEV~~r-------~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq 448 (555) T protein:vir:98 376 NPQMTATEVAER-------HEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQ 448 (555) T ss_pred CCcccHHHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHH Confidence 344566666553 33444444444333 333344444443321 1223357777777665421 Q ss_pred HH--------HHHHHHHHHhcC-----CCchHHHHHh----CCC----C-CcHHHHHHHHHHHHHHHHHHHHHHHhhhc- Q lcl|NC_013644. 414 TD--------IVNDEKTEAETR-----KIILESILQV----APR----L-DDDNVLRLICEQFDLDWEDVKEALEEAEY- 470 (510) Q Consensus 414 ~e--------~~~~~~~~~~~g-----~iS~et~~~~----~~~----v-~d~e~~~~~~e~~e~~~~~~~~~~~~~~~- 470 (510) .. .++.+..+.+.+ .+....++.. ++. + +++|.+++++++++.++...+.++..+.. T Consensus 449 ~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~ 528 (555) T protein:vir:98 449 RAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGAD 528 (555) T ss_pred HHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 111122111111 0222222222 221 1 23444444444333332222222211111 Q ss_pred -cCCCCCCCCCcccCCCCCCcccccccCcc Q lcl|NC_013644. 471 -TKGLSDNTDEEETAVNPDDPTQQMAEGAT 499 (510) Q Consensus 471 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (510) ....++... ...+.=...-+-..|-+ T Consensus 529 ~~~~~~~~~~---~~~~~~~~~~~~~~~~~ 555 (555) T protein:vir:98 529 TAAKLGSVDT---SKQNALTDVTRAFSGYT 555 (555) T ss_pred HHHHhccccc---CcchhHHHHHhhhccCC Confidence 011111111 11111000001111111 No 131 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=98.74 E-value=7.3e-08 Score=59.70 Aligned_cols=464 Identities=9% Similarity=0.023 Sum_probs=203.2 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |... .. .+.+.+..+..+.. |......++++|+---+ .+..... ..+. ...+.+.++..+-+..-++ T Consensus 1 M~~~-----~~-~~~l~~r~~~l~~~-R~~~e~~w~e~~~~~lP--~~~~~~~-~~~~---~~~~~~~~~~dst~~~a~~ 67 (555) T protein:vir:10 1 MAEQ-----TE-RKLLLSRWGQLRTE-RESWMSHWKEISDYLLP--RAGRFFV-QDRN---RGEKRHNNILDNTGTRALR 67 (555) T ss_pred CCCc-----cc-HHHHHHHHHHHHHH-hhHHHHHHHHHHHHhCc--ccccccC-CCCC---cchhcccccccccHHHHHH Confidence 3332 22 23344444444322 11223333444322111 1111100 0000 0112234567777788888 Q ss_pred HHHhhhhcC--Cce-----eccCc------HHHHHHH-------HHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCCC Q lcl|NC_013644. 81 QKTQYLLSN--PVE-----YETEN------EELKEYL-------AEYY-NSEFQVVLQELVEGSSQKGFEYVYARTNAED 139 (510) Q Consensus 81 ~~~~~l~g~--p~~-----~~~~d------~~~~~~l-------~~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g 139 (510) +.++.|++- |+. +...+ ..+...| ...+ ..||.....++.++..++|.|.+++..|..+ T Consensus 68 ~LAa~L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~ 147 (555) T protein:vir:10 68 VLAAGMMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDA 147 (555) T ss_pred HHHHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCc Confidence 888777642 322 23322 1222322 2333 4678888999999999999998888777777 Q ss_pred ceEEEEEcccceEEEEcCCCCceeEEEEEEEEEee--------C---------CceeEEEEEEEEcCCcEEEEEEcCCce Q lcl|NC_013644. 140 RLCFQVADSLNVFGVYNEYNELQRICRHYITEIEK--------D---------GETVDIHHAEVWTDQNVYFFVAEDNKD 202 (510) Q Consensus 140 ~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~--------~---------~~~~~~~~~e~y~~~~i~~~~~~~~~~ 202 (510) .+++..++..+++..-|..+++..++|.+...... . .....-.++++++- .|....... T Consensus 148 ~~rf~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~----V~pr~~~~~ 223 (555) T protein:vir:10 148 VVYHHSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHA----IEPRADRDP 223 (555) T ss_pred eEEEEEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEE----EeeccCcCc Confidence 78999999999999888888888887754433210 0 00000012222211 000000000 Q ss_pred eecccc-cccccccccc--cccccccccccccCCcccEEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_013644. 203 YELDEA-EPINPRPHVL--AVDSENESLLQRSYGQIPFYRLS-----NNKQETTDLKPIKALIDDYDLMNCFLSNNLQDF 274 (510) Q Consensus 203 ~~~~~~-~~~~~~~~~~--~~~~~~~~~~~~~~g~iPvv~~~-----nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~ 274 (510) ...+.. .++ .+... ..++. .-....+|..+|++.++ .+.+|+|-.+...+-+..+|.+.-......+.. T Consensus 224 ~~~~~~~~p~--~s~~~~~~~d~~-~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~ 300 (555) T protein:vir:10 224 SKRDDRNMAW--KSVYFEPGADET-RTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYK 300 (555) T ss_pred CCCCccccce--EEEEEEeccCCc-cccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000 000 00000 00000 01123345567776654 346799999999999999999877788888888 Q ss_pred ccceeEEecCCCCchhhhhHhhhcCee--eeccCCCcee--EEeecCCHHHHHHHHHHHHHHHHHHhCCcc----ccccc Q lcl|NC_013644. 275 AEAIYVVSGFQGDDLSKLRQNVKSKKV--VGTGSDGGLD--VKTVTIPTEGRKTKMEIDKENIYKFGMAFD----STQVG 346 (510) Q Consensus 275 ~~~~lv~~g~~~~~~~~~~~~~~~~~~--~~~~~~~~~~--~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~----~~~~~ 346 (510) .+|.+.+..-..... .....+++ +..+..++.- .+....+.......++.++..|-.. ..-+ ..... T Consensus 301 ~~pp~~v~~~~~~~~----~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~a-f~~dlf~~l~~~~ 375 (555) T protein:vir:10 301 SNPPLQLPVSAKNQD----ISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKAS-FYADLFLMLANGT 375 (555) T ss_pred hcCceeecccccccc----ceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHH-hhcchhhhccCCC Confidence 887776533211111 12223322 2222222222 2233346677777788888877543 2222 22223 Q ss_pred cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHhhccCC-----ccccceeeEEeCCCCCCCH Q lcl|NC_013644. 347 DGNITNIVIKARYTLLNMKANKTEARLRALLEW--------MNKLVIDDINRRYTK-----AFDPTEVSFTFTREVMVNE 413 (510) Q Consensus 347 ~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~--------~~~~i~~~~~~~~~~-----~~~~~~v~i~f~~~~p~d~ 413 (510) ....|++.+..+ ..++...++..+.+ +++-.+.++...+.- .....+++|.|..++-+.. T Consensus 376 ~~~~TAtEV~~r-------~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq 448 (555) T protein:vir:10 376 NPQMTATEVAER-------HEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQ 448 (555) T ss_pred CCcccHHHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHH Confidence 344566666553 33444444444333 333344444443321 1223357777777665421 Q ss_pred HH--------HHHHHHHHHhcC-----CCchHHHHHh----CCC----C-CcHHHHHHHHHHHHHHHHHHHHHHHhhhc- Q lcl|NC_013644. 414 TD--------IVNDEKTEAETR-----KIILESILQV----APR----L-DDDNVLRLICEQFDLDWEDVKEALEEAEY- 470 (510) Q Consensus 414 ~e--------~~~~~~~~~~~g-----~iS~et~~~~----~~~----v-~d~e~~~~~~e~~e~~~~~~~~~~~~~~~- 470 (510) .. .++.+..+.+.+ .+....++.. ++. + +++|.+++++++++.++...+.++..+.. T Consensus 449 ~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~ 528 (555) T protein:vir:10 449 RAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGAD 528 (555) T ss_pred HHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 111122111111 0222222222 221 1 23444444444333332222222211111 Q ss_pred -cCCCCCCCCCcccCCCCCCcccccccCcc Q lcl|NC_013644. 471 -TKGLSDNTDEEETAVNPDDPTQQMAEGAT 499 (510) Q Consensus 471 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (510) ....++... ...+.=...-+-..|-+ T Consensus 529 ~~~~~~~~~~---~~~~~~~~~~~~~~~~~ 555 (555) T protein:vir:10 529 TAAKLGSVDT---SKQNALTDVTRAFSGYT 555 (555) T ss_pred HHHHhccccc---CcchhHHHHHhhhccCC Confidence 011111111 11111000001111111 No 132 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=98.70 E-value=1e-07 Score=58.95 Aligned_cols=327 Identities=9% Similarity=0.036 Sum_probs=146.5 Q ss_pred hhcCCceeccCcHHHHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCC Q lcl|NC_013644. 86 LLSNPVEYETENEELKEYLAEYYN---S---EFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEY 158 (510) Q Consensus 86 l~g~p~~~~~~d~~~~~~l~~~~~---n---~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~ 158 (510) +..-|+.+-..++....-+..++. | ........++...+.+|.||+++.++..|++ .+.+++|..+-++.++. T Consensus 1 ia~lp~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~l~~~~v~~~~~~~ 80 (348) T protein:vir:93 1 MASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ 80 (348) T ss_pred CcccceEeEecCcCcccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCCceEEEEeCC Confidence 333355542222222223333331 2 2234455667788999999999989888886 57778888887766543 Q ss_pred CCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEE Q lcl|NC_013644. 159 NELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFY 238 (510) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv 238 (510) +.. + .|.+.. .++. . ..|.+..+.+++.-. | T Consensus 81 ~~~---~-~y~~~~-~~g~-----~-~~~~~~eiih~r~~~------------------------------------~-- 111 (348) T protein:vir:93 81 SRE---L-YYSIHA-ATGN-----K-LIVHNMDMLHFKHIV------------------------------------A-- 111 (348) T ss_pred CcE---E-EEEEEc-CCCe-----E-EEEccccEEEecCCC------------------------------------C-- Confidence 221 0 111111 1110 0 112333333332100 0 Q ss_pred EecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccc-eeEE-ecCCCCch--hhhhHh----hh-cCeeeeccCCCc Q lcl|NC_013644. 239 RLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEA-IYVV-SGFQGDDL--SKLRQN----VK-SKKVVGTGSDGG 309 (510) Q Consensus 239 ~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~-~lv~-~g~~~~~~--~~~~~~----~~-~~~~~~~~~~~~ 309 (510) .+.-.|.|-++-+...++..+.+ ....+..+..+ -+++ .+...++. ..+... .. .++++.++++.+ T Consensus 112 --~~~~~G~s~~~~~~~~i~~~~~~---~~~~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~n~~~~~vl~~g~~ 186 (348) T protein:vir:93 112 --SNMVQGISPIDVLKNTTDFDNAV---RTFNLTEMQKPDSFMLKYGSNVSTEKRQQVLEDFKQYYEENGGILFQEPGVE 186 (348) T ss_pred --CCceeeccHHHHHHHHHHHHHHH---HHHHHHhcCCCceeEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCCce Confidence 00112556555554444433322 11223333443 2222 22222221 111111 11 234555666656 Q ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_013644. 310 LDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDIN 389 (510) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~ 389 (510) ++.+..+.....+.+..+...+.|+..-++|+.-.+..++.+...++.... ..+...|.-+++.|...+. T Consensus 187 ~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~e~~~~----------~~~~~~l~P~~~~ie~~l~ 256 (348) T protein:vir:93 187 IEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNR----------FYLQHTLLPIVKQYEEEFN 256 (348) T ss_pred EEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH----------HHHHHHHHHHHHHHHHHHH Confidence 555554444445666777788889998888875443322222222222111 2233334444444444333 Q ss_pred hccCCccc---cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCC--CcHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 390 RRYTKAFD---PTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRL--DDDNVLRLICEQFDLDWEDVKEA 464 (510) Q Consensus 390 ~~~~~~~~---~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v--~d~e~~~~~~e~~e~~~~~~~~~ 464 (510) .+--...+ ...+++.+..-+..|..+.++.+.+++.+|+++.-.+++.++.- ++-++. ... T Consensus 257 ~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~--------------~~~ 322 (348) T protein:vir:93 257 RKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKP--------------LIS 322 (348) T ss_pred HhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcCeE--------------eec Confidence 22111111 12244445566677899999999999999999998888877531 110000 000 Q ss_pred HHhhhccCCCCCCCCCcccCCCCCCcccccc Q lcl|NC_013644. 465 LEEAEYTKGLSDNTDEEETAVNPDDPTQQMA 495 (510) Q Consensus 465 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (510) .+..+. +...+.+.. ..+|++...++ T Consensus 323 ~n~~~~----~~~~~~~~~-~~gg~~n~~~~ 348 (348) T protein:vir:93 323 GDLYPI----DTPLELRKS-LKGGDKNVNES 348 (348) T ss_pred cccccc----ccchhhccc-ccCCCCCcCCC Confidence 111110 000111110 01111111011 No 133 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=98.70 E-value=1e-07 Score=58.95 Aligned_cols=437 Identities=11% Similarity=0.018 Sum_probs=165.5 Q ss_pred CCCc----cCCChhh----hHHHHHHHHHhhhhhhhH---HHHHHHHHHhccC-----------Ccchhcccc--eeccc Q lcl|NC_013644. 1 MEAL----LSEDVKI----IANALKAAIDKDRKSSSK---REAETGIRYYNHE-----------NDIMNNRIF--YVDDE 56 (510) Q Consensus 1 ~~~~----~~~~~~~----~~~~i~~~i~~~~~~~~~---~~~~~~~~YY~g~-----------~~i~~~~~~--~~~~~ 56 (510) ++.+ +....+- ....|+.-++..-.-..+ +-+..++..--.. +.++++... ++... T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~kk~~i~~pfkkk 83 (945) T protein:vir:10 4 LENIIKGFIVNANEQKRPSFSSNIKANVDSLSRGKDYPGFKPLLTYRALAWNSTVVYSIIIFRKNQVLKKEKIIVPYNHQ 83 (945) T ss_pred hhhHhhhheeccccccCccccccchhchhhhhcccCCCCcchhhhhhhhhccceeeeeeeeehhhhHHHhhccccccccc Confidence 2221 1111110 111222222221111111 1111111110000 011111000 00000 Q ss_pred ------------cc---cccccccc----------cceeccchhHHHHHHHHhhhhcCCcee--ccCcHH---------H Q lcl|NC_013644. 57 ------------GI---LREDKYAS----------NVRIPHGFFPEIVDQKTQYLLSNPVEY--ETENEE---------L 100 (510) Q Consensus 57 ------------~~---~~~~~~~~----------~~ki~~n~~~~Iv~~~~~~l~g~p~~~--~~~d~~---------~ 100 (510) +. .......| +......-....|+..++-+.+-|+++ ..++.. . T Consensus 84 ~~~~~~d~f~~s~es~s~vtsls~pdaf~~vnVs~~~AlknsaV~scI~~IA~sIAsLPlklYrr~edG~~~~~~kk~~~ 163 (945) T protein:vir:10 84 EPPFKFNLFEYSPESLMYLPSISDPDAFFLINLFRKYRFNNDSKLIKVSEIPKKLTSKELEIYKHIEDKHVNYYLKRIRD 163 (945) T ss_pred ccchhhhhhhccCccceecccccCccceeeehhhhhhhhccHHHHHHHHHHHhhhccCceEEEEecccCccccccccccc Confidence 00 00000000 001111223335555666666677764 111110 1 Q ss_pred HHHHHHHhc--cC------HH-HHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCceeEEEEEEE Q lcl|NC_013644. 101 KEYLAEYYN--SE------FQ-VVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNELQRICRHYIT 170 (510) Q Consensus 101 ~~~l~~~~~--n~------~~-~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~ 170 (510) ...+..++. |. +. .....+..+.+.+|.+|+.+.++.+|++ .+.+++|..+.+..++.+... ++++ T Consensus 164 ~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd~~G~ii~L~pLdPs~Vti~~ddDG~~~----y~Yv 239 (945) T protein:vir:10 164 ARNILEFLERPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRDEQGNLVAITPVDGTTIKPILSEDTGIV----VGYV 239 (945) T ss_pred chHHHHHHhCCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCCcEE----EEEE Confidence 112333432 21 11 2344567889999999999999989986 588899999988876544321 1111 Q ss_pred EEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCCcH Q lcl|NC_013644. 171 EIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTDL 250 (510) Q Consensus 171 ~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~ 250 (510) ... ++.. ...+.+..+.++.... +.-|.. ...|.|.+ T Consensus 240 ~~i-dG~~-----~~~v~a~DvIlhirn~------------------------------s~DG~~-------~GyGlSPI 276 (945) T protein:vir:10 240 QEV-DGAI-----VAHFDKRDVVLFRQNL------------------------------TPDVYM-------YGYSLPPI 276 (945) T ss_pred Eec-CCce-----EEEecCCceEEEeccC------------------------------CCCccc-------ccCCchHH Confidence 111 1110 0112222211111100 000000 01233444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH----hccce--eEEecCCC-----------CchhhhhHhhh-------cCeeeeccC Q lcl|NC_013644. 251 KPIKALIDDYDLMNCFLSNNLQD----FAEAI--YVVSGFQG-----------DDLSKLRQNVK-------SKKVVGTGS 306 (510) Q Consensus 251 ~~v~~liD~~n~~~S~~~~~~~~----~~~~~--lv~~g~~~-----------~~~~~~~~~~~-------~~~~~~~~~ 306 (510) + .+.+++...++-.....++ .+.|- +.+.+... ++...++..+. .++.+.+++ T Consensus 277 e---aa~~aI~~alAaek~aar~FskNGa~PsGILsvkg~~~~d~k~~~~LseEq~erlKe~wee~~sG~NnG~piVLde 353 (945) T protein:vir:10 277 E---ILYKVILSDIFIDKGNLDYYRKGGSIPEGILAIEPPSYKEGDIYPQLSREQLESIQRQLQAIMMGDYTQVPILSGG 353 (945) T ss_pred H---HHHHHHHHHHHHHHHHHHHHHhCCCccceEEEecCccccccccccccCHHHHHHHHHHHHHHhCCcccccceecCC Confidence 4 4444444433332222222 23453 33333211 11111222111 122344555 Q ss_pred CCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 307 DGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVID 386 (510) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~ 386 (510) |.+++.++.+.....+.+..+...+.|...-++|+.-.+.....++..++.... ..+..+|.-++..|.. T Consensus 354 Gmef~pLs~s~~DaQfLEsrkfs~eeIArAFGVPP~lLG~~e~st~SNiEqq~~----------~Fv~~tL~Pil~~IEq 423 (945) T protein:vir:10 354 KFTWIDFKGKRRDMQFKELAEFVARKICAVYQVSPQDVGILEGSNKATAEVMAS----------LTKAKGLEPLMATISK 423 (945) T ss_pred CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCcchHHHHHH----------HHHHHHHHHHHHHHHH Confidence 555555544444555667778888889998889875443322222222222211 2223333333333333 Q ss_pred HHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 387 DINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEA 464 (510) Q Consensus 387 ~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~ 464 (510) .++.+-........+.+.|+.....+..+.++.+.++.++|+++.-.++++++. ++.-+... . ..... T Consensus 424 eLNrkLl~~~eg~~i~fdFd~ldl~D~ksraEal~kli~sGiLTiNEvRe~lGLpPIeGGD~ll--i--------~~nn~ 493 (945) T protein:vir:10 424 GFDEVVSEFRNEKDIKLWFKEDDLEKERDWWNIIQGQLNTGFRSINEARMEKGLEPVPWGDVPF--S--------GLRNW 493 (945) T ss_pred HHHHhccccccCceeEEEecchhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceee--e--------ccccc Confidence 333221111223457888987777788999999999999999999888887643 21101000 0 00000 Q ss_pred HHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 465 LEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 465 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) ...+.......+..+ +.......++..+.+++. ++....|.+. T Consensus 494 ~P~d~~~ka~~ga~p-~q~aq~~~dqp~~kGGe~--dEns~~psE~ 536 (945) T protein:vir:10 494 KPEDEQAKAQQGAMP-PQLAQAMADQPSQQGGGV--DENSSVPSEQ 536 (945) T ss_pred cccccccccccCCCC-cccccCCCCCCCCCCCCC--CCCCCCCCcc Confidence 000000000000000 000000000000011000 0111122222 No 134 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=98.67 E-value=1.2e-07 Score=58.44 Aligned_cols=455 Identities=12% Similarity=0.071 Sum_probs=204.4 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhh--HHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSS--KREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~--~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) |-.-+.+ .+-++.+++..+..+..+. -.++..+.+|..-. .+. ..+.. ......++..+-+... T Consensus 1 m~~~~~~--~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~--~~~-------~~~~~---~~~~~~~~~dst~~~a 66 (535) T protein:vir:33 1 MADSKRT--GLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPS--LFP-------KESDN---ESTDYTTPWQAVGARG 66 (535) T ss_pred CChhhhh--ccChhHHHHHHHHHHHHhhHHHHHHHHHHHHhccc--ccC-------CCCCc---ccccccccccccHHHH Confidence 4333311 1233445555555543221 12344444443221 000 00000 0111123445566667 Q ss_pred HHHHHhhhhcC--Cce----eccCcH-------------HHHHH-------HHHHh-ccCHHHHHHHHHHHHHhcCeEEE Q lcl|NC_013644. 79 VDQKTQYLLSN--PVE----YETENE-------------ELKEY-------LAEYY-NSEFQVVLQELVEGSSQKGFEYV 131 (510) Q Consensus 79 v~~~~~~l~g~--p~~----~~~~d~-------------~~~~~-------l~~~~-~n~~~~~~~e~~~~~~~~G~~~~ 131 (510) +++.++.|++- |.+ +...+. ++... +...+ .+||...+.++.++..++|.|.+ T Consensus 67 ~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l 146 (535) T protein:vir:33 67 LNNLASKLMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALL 146 (535) T ss_pred HHHHHHHHHHhhcCCCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeE Confidence 77777666541 322 222221 12222 22223 47888999999999999999988 Q ss_pred EEEECCCCceEEEEEcccceEEEEcCCCCceeEEEEEEEEEe------------eCCceeEEEEEEEEcCCcEEEEEEcC Q lcl|NC_013644. 132 YARTNAEDRLCFQVADSLNVFGVYNEYNELQRICRHYITEIE------------KDGETVDIHHAEVWTDQNVYFFVAED 199 (510) Q Consensus 132 ~v~~d~~g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~e~y~~~~i~~~~~~~ 199 (510) ++-.+..+.++++.++-.+++..-|..+++..++|.+..... +...+..-..+++|+. .+..... T Consensus 147 ~~~~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~---v~~~~~~ 223 (535) T protein:vir:33 147 YLPEPEGSYNPMKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYTH---VYLDEES 223 (535) T ss_pred EeecCCCCceeeEEEEcCeeEEeeCCCCCeeEEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEEE---EEeeCCC Confidence 876666667888888888888877888888888776555421 0000111111222211 0011111 Q ss_pred Cceeeccccccccccccccccccccc--ccccccCCcccEEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 200 NKDYELDEAEPINPRPHVLAVDSENE--SLLQRSYGQIPFYRLS-----NNKQETTDLKPIKALIDDYDLMNCFLSNNLQ 272 (510) Q Consensus 200 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~g~iPvv~~~-----nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~ 272 (510) +.+.. ....++... .....+|..+|++.++ ++.+|+|-.++..+-+..+|.+.-......+ T Consensus 224 ~~~~~------------~~~~~~~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~ 291 (535) T protein:vir:33 224 GDYLK------------YEEVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSM 291 (535) T ss_pred CcEEE------------EEEEeCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111 111111111 1112346677877654 3468999999999999999999999999999 Q ss_pred HhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCcc-ccccccCc Q lcl|NC_013644. 273 DFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFD-STQVGDGN 349 (510) Q Consensus 273 ~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~-~~~~~~g~ 349 (510) ...+|.+.+.-....+..++.. ...+.+..+..+++..+... .+.......++.++..|-..- ..+ ...-.... T Consensus 292 ~~~~p~~lv~~~g~~~~~~~~~--~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af-~~~~~~~~~~~r 368 (535) T protein:vir:33 292 ISAKVIGLVNPAGITQPRRLTK--AQTGDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAF-MLNSAVQRTGER 368 (535) T ss_pred HHhcCceeeccccccchhhccc--CCceeeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCcc Confidence 9999987653222222222221 12234445556667766533 467777788888877775532 111 11122333 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHhhccCC-ccccceeeEEeCCCCCCCHH-HHHHH Q lcl|NC_013644. 350 ITNIVIKARYTLLNMKANKTEARLRALLEW--------MNKLVIDDINRRYTK-AFDPTEVSFTFTREVMVNET-DIVND 419 (510) Q Consensus 350 ~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~--------~~~~i~~~~~~~~~~-~~~~~~v~i~f~~~~p~d~~-e~~~~ 419 (510) .|++.+.. ++.++...++..+.+ +++.++.++...+.- ......++++|..++..-.. ..++. T Consensus 369 ~TAtEV~~-------r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~ 441 (535) T protein:vir:33 369 VTAEEIRY-------VASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDK 441 (535) T ss_pred ccHHHHHH-------HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHH Confidence 46655554 344555555555444 344444444443322 22334577888766654211 11111 Q ss_pred ----HHHHHhcC------CCchHHHHHhC---CCC------CcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCC Q lcl|NC_013644. 420 ----EKTEAETR------KIILESILQVA---PRL------DDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDE 480 (510) Q Consensus 420 ----~~~~~~~g------~iS~et~~~~~---~~v------~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (510) +..+.+.+ .+....++..+ -++ ..+|+....++++.++.. ..+.+.... .. ... . T Consensus 442 l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~Gvp~~~i~~~~ee~~~~~~q~~~~~~-~~~~~~~~g--~~--~~~-~ 515 (535) T protein:vir:33 442 LERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTG-VENAAAAGG--AG--VGA-L 515 (535) T ss_pred HHHHHHHHHhhChhhhhccCCHHHHHHHHHHHcCCCHhHhcCCHHHHHHHHHHHHHHHH-HHHHHHhhh--hh--hcc-h Confidence 11111111 01112222211 112 122211111111111111 111111100 00 000 0 Q ss_pred cccCCCCCCcccccccCccccccc Q lcl|NC_013644. 481 EETAVNPDDPTQQMAEGATGSTES 504 (510) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~ 504 (510) ...+ + +..+...+.-|=+.. T Consensus 516 ~~~~--~--~~~~~~~~~~g~~~~ 535 (535) T protein:vir:33 516 ATSS--P--EAMQGAAAKAGLNAT 535 (535) T ss_pred hhcC--C--hhHHHHHHhccCCCC Confidence 0100 0 001111111111111 No 135 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=98.63 E-value=1.7e-07 Score=57.67 Aligned_cols=400 Identities=10% Similarity=0.037 Sum_probs=173.5 Q ss_pred hhhhHHHHHHHHHhhhhhhhHHH-----HHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHH Q lcl|NC_013644. 9 VKIIANALKAAIDKDRKSSSKRE-----AETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKT 83 (510) Q Consensus 9 ~~~~~~~i~~~i~~~~~~~~~~~-----~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~ 83 (510) +-+. +.|.++..-++....... ...+..+. |-.. .+... ... .-+.++-....|+..+ T Consensus 1 M~~~-~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~-g~~~-----------~~~~v--~~~--~al~~~~v~~~i~~ia 63 (432) T protein:vir:10 1 MKIV-DSVKKFFNFEKRQTSQVIELNKDDEKLLEWL-GISP-----------STISV--KGK--NALKVATVFACIKILS 63 (432) T ss_pred CChH-HHHHHhcCccccCcccccccCCchHHHHHHh-CCCc-----------Ccccc--chh--hhhccHHHHHHHHHHH Confidence 1111 111111110000000000 00000010 0000 00000 000 0011222233455555 Q ss_pred hhhhcCCceec--cCc---HHHHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccce Q lcl|NC_013644. 84 QYLLSNPVEYE--TEN---EELKEYLAEYYN---S---EFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNV 151 (510) Q Consensus 84 ~~l~g~p~~~~--~~d---~~~~~~l~~~~~---n---~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~ 151 (510) +-+.+-|+.+- .++ +.....+..+++ | ........+....+.+|.+|+++..+..|++ .+.+++|..+ T Consensus 64 ~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v 143 (432) T protein:vir:10 64 ESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKV 143 (432) T ss_pred HhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcee Confidence 55556677641 111 111122333332 2 2345566778888999999999999988886 6788999999 Q ss_pred EEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccccccccccccccccc Q lcl|NC_013644. 152 FGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRS 231 (510) Q Consensus 152 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (510) -+..|+...+..-...|+.... ++.. ..+.+..+.+++... T Consensus 144 ~v~~d~~~~~~~~~~~~y~~~~-~g~~------~~~~~~eiih~r~~~-------------------------------- 184 (432) T protein:vir:10 144 TVYIDDVGLLNSKTKMWYVVNT-GGQQ------RVLKPEEILHFKNGI-------------------------------- 184 (432) T ss_pred EEEEcCcccccccceEEEEEec-CCeE------EEEccccEEEecCCC-------------------------------- Confidence 8887754433222222222111 1110 112233333332100 Q ss_pred CCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh---hhhHhhh--------cCe Q lcl|NC_013644. 232 YGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS---KLRQNVK--------SKK 300 (510) Q Consensus 232 ~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~---~~~~~~~--------~~~ 300 (510) | .+.-.|.|.+..+...++....+..-..+.+...+.|-.+++....-+++ .+...+. .++ T Consensus 185 ----~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~ 256 (432) T protein:vir:10 185 ----T----LDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHR 256 (432) T ss_pred ----C----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCc Confidence 0 01123667777666666665555555555566666677666542221111 1221111 134 Q ss_pred eeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 301 VVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWM 380 (510) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~ 380 (510) ++.++++.+++.+..+.....+.+..+...+.|+..-++|+.-.+..+..+...++. .....+...|+-+ T Consensus 257 ~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~----------~~~~~~~~~l~P~ 326 (432) T protein:vir:10 257 IALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQ----------QQQQFYTDTLQAT 326 (432) T ss_pred ceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHH----------HHHHHHHHHHHHH Confidence 556666666655554444445566677788899998889886443222222111111 1112334444444 Q ss_pred HHHHHHHHhhccC--Ccc-ccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHH Q lcl|NC_013644. 381 NKLVIDDINRRYT--KAF-DPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLD 457 (510) Q Consensus 381 ~~~i~~~~~~~~~--~~~-~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~ 457 (510) ++.|...+..+-- ... ....+++.++.-+..|..+.++.+.++..+|+++.-.++++++.-..+...+ T Consensus 327 ~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD~--------- 397 (432) T protein:vir:10 327 LTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDR--------- 397 (432) T ss_pred HHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCe--------- Confidence 4444444432211 111 1113445555677889999999999999999999988888775421110000 Q ss_pred HHHHHHHHHhhhccCCCCCCCCCc-ccCCCCCCcccccccCcccc Q lcl|NC_013644. 458 WEDVKEALEEAEYTKGLSDNTDEE-ETAVNPDDPTQQMAEGATGS 501 (510) Q Consensus 458 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 501 (510) .....+..+- +..+... +.+...+ +....|.+|+ T Consensus 398 ---~~~~~n~~~~----~~~~~~~~k~~~~~~---~~~~~~~~~~ 432 (432) T protein:vir:10 398 ---LLVNGNMLPI----DMAGQAYLKGGDTNG---EVSKEGNEGN 432 (432) T ss_pred ---Eeecccccch----hhccccccCCCCCCC---CCCCCCCCCC Confidence 0000000000 0000000 0000000 1111111122 No 136 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=98.63 E-value=1.7e-07 Score=57.67 Aligned_cols=400 Identities=10% Similarity=0.037 Sum_probs=173.5 Q ss_pred hhhhHHHHHHHHHhhhhhhhHHH-----HHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHH Q lcl|NC_013644. 9 VKIIANALKAAIDKDRKSSSKRE-----AETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKT 83 (510) Q Consensus 9 ~~~~~~~i~~~i~~~~~~~~~~~-----~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~ 83 (510) +-+. +.|.++..-++....... ...+..+. |-.. .+... ... .-+.++-....|+..+ T Consensus 1 M~~~-~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~-g~~~-----------~~~~v--~~~--~al~~~~v~~~i~~ia 63 (432) T protein:vir:10 1 MKIV-DSVKKFFNFEKRQTSQVIELNKDDEKLLEWL-GISP-----------STISV--KGK--NALKVATVFACIKILS 63 (432) T ss_pred CChH-HHHHHhcCccccCcccccccCCchHHHHHHh-CCCc-----------Ccccc--chh--hhhccHHHHHHHHHHH Confidence 1111 111111110000000000 00000010 0000 00000 000 0011222233455555 Q ss_pred hhhhcCCceec--cCc---HHHHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccce Q lcl|NC_013644. 84 QYLLSNPVEYE--TEN---EELKEYLAEYYN---S---EFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNV 151 (510) Q Consensus 84 ~~l~g~p~~~~--~~d---~~~~~~l~~~~~---n---~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~ 151 (510) +-+.+-|+.+- .++ +.....+..+++ | ........+....+.+|.+|+++..+..|++ .+.+++|..+ T Consensus 64 ~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v 143 (432) T protein:vir:10 64 ESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKV 143 (432) T ss_pred HhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcee Confidence 55556677641 111 111122333332 2 2345566778888999999999999988886 6788999999 Q ss_pred EEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccccccccccccccccc Q lcl|NC_013644. 152 FGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRS 231 (510) Q Consensus 152 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (510) -+..|+...+..-...|+.... ++.. ..+.+..+.+++... T Consensus 144 ~v~~d~~~~~~~~~~~~y~~~~-~g~~------~~~~~~eiih~r~~~-------------------------------- 184 (432) T protein:vir:10 144 TVYIDDVGLLNSKTKMWYVVNT-GGQQ------RVLKPEEILHFKNGI-------------------------------- 184 (432) T ss_pred EEEEcCcccccccceEEEEEec-CCeE------EEEccccEEEecCCC-------------------------------- Confidence 8887754433222222222111 1110 112233333332100 Q ss_pred CCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh---hhhHhhh--------cCe Q lcl|NC_013644. 232 YGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS---KLRQNVK--------SKK 300 (510) Q Consensus 232 ~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~---~~~~~~~--------~~~ 300 (510) | .+.-.|.|.+..+...++....+..-..+.+...+.|-.+++....-+++ .+...+. .++ T Consensus 185 ----~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~ 256 (432) T protein:vir:10 185 ----T----LDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHR 256 (432) T ss_pred ----C----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCc Confidence 0 01123667777666666665555555555566666677666542221111 1221111 134 Q ss_pred eeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 301 VVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWM 380 (510) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~ 380 (510) ++.++++.+++.+..+.....+.+..+...+.|+..-++|+.-.+..+..+...++. .....+...|+-+ T Consensus 257 ~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~----------~~~~~~~~~l~P~ 326 (432) T protein:vir:10 257 IALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQ----------QQQQFYTDTLQAT 326 (432) T ss_pred ceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHH----------HHHHHHHHHHHHH Confidence 556666666655554444445566677788899998889886443222222111111 1112334444444 Q ss_pred HHHHHHHHhhccC--Ccc-ccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHH Q lcl|NC_013644. 381 NKLVIDDINRRYT--KAF-DPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLD 457 (510) Q Consensus 381 ~~~i~~~~~~~~~--~~~-~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~ 457 (510) ++.|...+..+-- ... ....+++.++.-+..|..+.++.+.++..+|+++.-.++++++.-..+...+ T Consensus 327 ~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD~--------- 397 (432) T protein:vir:10 327 LTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDR--------- 397 (432) T ss_pred HHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCe--------- Confidence 4444444432211 111 1113445555677889999999999999999999988888775421110000 Q ss_pred HHHHHHHHHhhhccCCCCCCCCCc-ccCCCCCCcccccccCcccc Q lcl|NC_013644. 458 WEDVKEALEEAEYTKGLSDNTDEE-ETAVNPDDPTQQMAEGATGS 501 (510) Q Consensus 458 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 501 (510) .....+..+- +..+... +.+...+ +....|.+|+ T Consensus 398 ---~~~~~n~~~~----~~~~~~~~k~~~~~~---~~~~~~~~~~ 432 (432) T protein:vir:10 398 ---LLVNGNMLPI----DMAGQAYLKGGDTNG---EVSKEGNEGN 432 (432) T ss_pred ---Eeecccccch----hhccccccCCCCCCC---CCCCCCCCCC Confidence 0000000000 0000000 0000000 1111111122 No 137 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=98.63 E-value=1.7e-07 Score=57.67 Aligned_cols=400 Identities=10% Similarity=0.037 Sum_probs=173.5 Q ss_pred hhhhHHHHHHHHHhhhhhhhHHH-----HHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHH Q lcl|NC_013644. 9 VKIIANALKAAIDKDRKSSSKRE-----AETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKT 83 (510) Q Consensus 9 ~~~~~~~i~~~i~~~~~~~~~~~-----~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~ 83 (510) +-+. +.|.++..-++....... ...+..+. |-.. .+... ... .-+.++-....|+..+ T Consensus 1 M~~~-~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~-g~~~-----------~~~~v--~~~--~al~~~~v~~~i~~ia 63 (432) T protein:vir:10 1 MKIV-DSVKKFFNFEKRQTSQVIELNKDDEKLLEWL-GISP-----------STISV--KGK--NALKVATVFACIKILS 63 (432) T ss_pred CChH-HHHHHhcCccccCcccccccCCchHHHHHHh-CCCc-----------Ccccc--chh--hhhccHHHHHHHHHHH Confidence 1111 111111110000000000 00000010 0000 00000 000 0011222233455555 Q ss_pred hhhhcCCceec--cCc---HHHHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccce Q lcl|NC_013644. 84 QYLLSNPVEYE--TEN---EELKEYLAEYYN---S---EFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNV 151 (510) Q Consensus 84 ~~l~g~p~~~~--~~d---~~~~~~l~~~~~---n---~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~ 151 (510) +-+.+-|+.+- .++ +.....+..+++ | ........+....+.+|.+|+++..+..|++ .+.+++|..+ T Consensus 64 ~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v 143 (432) T protein:vir:10 64 ESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKV 143 (432) T ss_pred HhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcee Confidence 55556677641 111 111122333332 2 2345566778888999999999999988886 6788999999 Q ss_pred EEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccccccccccccccccc Q lcl|NC_013644. 152 FGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRS 231 (510) Q Consensus 152 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (510) -+..|+...+..-...|+.... ++.. ..+.+..+.+++... T Consensus 144 ~v~~d~~~~~~~~~~~~y~~~~-~g~~------~~~~~~eiih~r~~~-------------------------------- 184 (432) T protein:vir:10 144 TVYIDDVGLLNSKTKMWYVVNT-GGQQ------RVLKPEEILHFKNGI-------------------------------- 184 (432) T ss_pred EEEEcCcccccccceEEEEEec-CCeE------EEEccccEEEecCCC-------------------------------- Confidence 8887754433222222222111 1110 112233333332100 Q ss_pred CCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh---hhhHhhh--------cCe Q lcl|NC_013644. 232 YGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS---KLRQNVK--------SKK 300 (510) Q Consensus 232 ~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~---~~~~~~~--------~~~ 300 (510) | .+.-.|.|.+..+...++....+..-..+.+...+.|-.+++....-+++ .+...+. .++ T Consensus 185 ----~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~ 256 (432) T protein:vir:10 185 ----T----LDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHR 256 (432) T ss_pred ----C----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCc Confidence 0 01123667777666666665555555555566666677666542221111 1221111 134 Q ss_pred eeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 301 VVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWM 380 (510) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~ 380 (510) ++.++++.+++.+..+.....+.+..+...+.|+..-++|+.-.+..+..+...++. .....+...|+-+ T Consensus 257 ~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~----------~~~~~~~~~l~P~ 326 (432) T protein:vir:10 257 IALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQ----------QQQQFYTDTLQAT 326 (432) T ss_pred ceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHH----------HHHHHHHHHHHHH Confidence 556666666655554444445566677788899998889886443222222111111 1112334444444 Q ss_pred HHHHHHHHhhccC--Ccc-ccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHH Q lcl|NC_013644. 381 NKLVIDDINRRYT--KAF-DPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLD 457 (510) Q Consensus 381 ~~~i~~~~~~~~~--~~~-~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~ 457 (510) ++.|...+..+-- ... ....+++.++.-+..|..+.++.+.++..+|+++.-.++++++.-..+...+ T Consensus 327 ~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD~--------- 397 (432) T protein:vir:10 327 LTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDR--------- 397 (432) T ss_pred HHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCe--------- Confidence 4444444432211 111 1113445555677889999999999999999999988888775421110000 Q ss_pred HHHHHHHHHhhhccCCCCCCCCCc-ccCCCCCCcccccccCcccc Q lcl|NC_013644. 458 WEDVKEALEEAEYTKGLSDNTDEE-ETAVNPDDPTQQMAEGATGS 501 (510) Q Consensus 458 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 501 (510) .....+..+- +..+... +.+...+ +....|.+|+ T Consensus 398 ---~~~~~n~~~~----~~~~~~~~k~~~~~~---~~~~~~~~~~ 432 (432) T protein:vir:10 398 ---LLVNGNMLPI----DMAGQAYLKGGDTNG---EVSKEGNEGN 432 (432) T ss_pred ---Eeecccccch----hhccccccCCCCCCC---CCCCCCCCCC Confidence 0000000000 0000000 0000000 1111111122 No 138 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=98.58 E-value=2.4e-07 Score=56.85 Aligned_cols=420 Identities=9% Similarity=0.019 Sum_probs=162.9 Q ss_pred CCCc---------------cCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceecccccccccccc Q lcl|NC_013644. 1 MEAL---------------LSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYA 65 (510) Q Consensus 1 ~~~~---------------~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ 65 (510) |+.. ...|+..... . |-. ......-+ |.+++...+ +-+... .+ T Consensus 34 ~~~~~~p~~~~~~~~~~~~~~~d~~~~~~------~------r~g-~~~~~~~~-g~~~~~epp-~d~~~l-------~~ 91 (648) T protein:vir:79 34 MQLGEAPGAMPKGGGGGGSAKRDPKMSLV------K------RIG-LAIMDGGG-GGRDFEEPE-FDFNEI-------TS 91 (648) T ss_pred cccCCCccccCCCCcccccccccchhHHH------H------HhH-HHHHhhcC-CccccccCC-cCHHHH-------HH Confidence 2211 1122211110 0 000 00000111 233332111 000000 00 Q ss_pred ccceeccchhHHHHHHHHhhhhcCCceeccCcHHHHHH--HHHH-hc----cCHHHHHHHHHHHHHhcCeEEEEEEECCC Q lcl|NC_013644. 66 SNVRIPHGFFPEIVDQKTQYLLSNPVEYETENEELKEY--LAEY-YN----SEFQVVLQELVEGSSQKGFEYVYARTNAE 138 (510) Q Consensus 66 ~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~--l~~~-~~----n~~~~~~~e~~~~~~~~G~~~~~v~~d~~ 138 (510) ..-..++....|+..+.-+.+-|+.+...++...+. .... .. .+.......+..+.+.+|.||+.+-.+.+ T Consensus 92 --l~~~np~V~~aI~iia~~ia~l~~~i~~~~~~~~~~~~~~~ll~rPn~~~t~~~f~~~l~~~lll~GNAYveiiRd~~ 169 (648) T protein:vir:79 92 --AYNTEGYVRQAVDKYIEMMFKADWDFVSKNPNAVEYIRMRFTLMAEATQIPTNQLFIEIAEDLVKYCNVVIAKSRAKD 169 (648) T ss_pred --HHhcChHHHHHHHHHHHHHhhCcceEEecCCccchhhHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCC Confidence 001245566677777777777777765443221111 1111 11 23445666778889999999999988888 Q ss_pred CceE----------------EEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCce Q lcl|NC_013644. 139 DRLC----------------FQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKD 202 (510) Q Consensus 139 g~~~----------------i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~ 202 (510) |.+- +.+++|..+.+..++.+.. .. |.|...+++. T Consensus 170 G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~d~~g~~----~~-------------------------Y~y~~~g~~~ 220 (648) T protein:vir:79 170 ALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKRDKFGMI----KG-------------------------WQQEQEGQDK 220 (648) T ss_pred CccchhhhhhhhccccceeeeEeecCceeEEEEcCCCce----ee-------------------------eEEEecCCce Confidence 7321 1123333333332222110 00 1111111000 Q ss_pred eecccccccccccccccccccccccccccCCcccEEEecC-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_013644. 203 YELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSN-----NKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEA 277 (510) Q Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----n~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~ 277 (510) .. .|..=.|+||+. ...|+|.+..+...|+....+-....+.+...+.| T Consensus 221 ~~--------------------------~~~~~dIIHik~~~~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P 274 (648) T protein:vir:79 221 PQ--------------------------KFKPEDIVHIYYKREKGRAFGTPWLLPALDDIRALRQVEENVLRLVYRNLHP 274 (648) T ss_pred eE--------------------------EecCccEEEEccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCc Confidence 00 000012445542 23477777777666666555555555666667777 Q ss_pred eeEEec-CCCCchhh---hhHhhh-cC-eeeeccCCCceeEEeecC--C--HHHHHHHHHHHHHHHHHHhCCcccccccc Q lcl|NC_013644. 278 IYVVSG-FQGDDLSK---LRQNVK-SK-KVVGTGSDGGLDVKTVTI--P--TEGRKTKMEIDKENIYKFGMAFDSTQVGD 347 (510) Q Consensus 278 ~lv~~g-~~~~~~~~---~~~~~~-~~-~~~~~~~~~~~~~~~~~~--~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~ 347 (510) -.+++- .+...... ...... .. .+...+.+.+.+.+..+. . .-.+.+..+...+.|...-++|++-.+.. T Consensus 275 ~gil~~~~~~~~~e~~k~~~e~~~~~~~~~~i~gg~v~~~~~~i~~~~s~~dlqfle~rk~~~~eIa~aFgVPP~lLG~~ 354 (648) T protein:vir:79 275 LWHVKVGLEQEGFGAEEGEVDLVRGEVENMDVEGGMVTTERVNISSIASNQIIDAKEYLKHFEQRAFTVLGVSELMMGRG 354 (648) T ss_pred cEEEEeCCCccchHHHHHHHHHHHHhcccccccccccccceeeccccCCHHHHHHHHHHHHHHHHHHHHhCCCHhHcccC Confidence 776642 11111111 111111 11 122122222333333221 2 12355566777888999888998643322 Q ss_pred --Cc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHH Q lcl|NC_013644. 348 --GN-ITNIVIKARYTLLNMKANKTEARLRALLEWM-NKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTE 423 (510) Q Consensus 348 --g~-~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~-~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~ 423 (510) ++ .++.+....+... +.-.+..+...+... ++.++.-........ ....+++.|+.-+..|....++.+.++ T Consensus 355 ~~ss~stae~~~~~~~~~---i~~l~~~i~~~le~~~~~~ll~e~~l~~~l~-~d~~ieF~~~~Llr~D~~~~a~~~~~l 430 (648) T protein:vir:79 355 GTASRSTGDNLSSDFKDR---IKALQKVMATFINEFMVKEILMEGGFDPVLN-PDDKVEFRFNEIDMDSKIKLENQAVFL 430 (648) T ss_pred CCccchHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhhhhhcccccc-ccceEEEeecccchhhHHHHHHHHHHH Confidence 22 2233332222221 111112222222221 111110000000000 012467788888888999999999999 Q ss_pred HhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHH-hhhccCCCCCCCCCcccCC-CCCCcccccccCcc Q lcl|NC_013644. 424 AETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALE-EAEYTKGLSDNTDEEETAV-NPDDPTQQMAEGAT 499 (510) Q Consensus 424 ~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 499 (510) .++|+||.-.++++++. +.+..-.....-+ ......... ..+.+.+......+...+. ..+.+.+..+.+ T Consensus 431 ~~~GilT~NEaR~~lGlpPi~~g~~~~~l~~~----~~~~~~~~~~~~~~~~~~~~~~~~a~~eg~~~e~~~~~~~~~-- 504 (648) T protein:vir:79 431 YEHNAISEDEMRELIGRDPVDDGEGRAKMHLQ----MVTIAQATALAALAPTPAGGSSASASGDKKKKATDNKTKPTN-- 504 (648) T ss_pred HhCCCcCHHHHHHHhCCCCCCCCCCccccccc----cccchhccccccCCCCCCCCCCCCccccccccccCCCCCCCC-- Confidence 99999999999888643 2211100000000 000000000 0011111111000000000 000000001111 Q ss_pred cccccccCCCC Q lcl|NC_013644. 500 GSTESQLPENG 510 (510) Q Consensus 500 ~~~~~~~~~~~ 510 (510) ..+.|.+.+. T Consensus 505 -~~g~~~~~~~ 514 (648) T protein:vir:79 505 -QHGTKTSPKK 514 (648) T ss_pred -CCCcCCCCcc Confidence 1111111111 No 139 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=98.58 E-value=2.4e-07 Score=56.84 Aligned_cols=408 Identities=10% Similarity=0.005 Sum_probs=166.4 Q ss_pred HHHHHHHhc-cCCcchhccc--c--eec----ccccc-ccccc-cccceeccchhHHHHHHHHhhhhcCCceeccCcH-- Q lcl|NC_013644. 32 AETGIRYYN-HENDIMNNRI--F--YVD----DEGIL-REDKY-ASNVRIPHGFFPEIVDQKTQYLLSNPVEYETENE-- 98 (510) Q Consensus 32 ~~~~~~YY~-g~~~i~~~~~--~--~~~----~~~~~-~~~~~-~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~-- 98 (510) |-.+.+.+. +......... . .+. ..+.. ..... -+..-+.+.-.-..|+..++-+.+-|+++-.... T Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~ 80 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAEGRAWEPYDPSIYNLGATASSGERVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGGT 80 (457) T ss_pred CchhhhhhccccccccccccccccccchhhhhhccccccCCceechHHhhccHHHHHHHHHHHHhHhhCceEEEEecCCc Confidence 111111110 0000000000 0 000 00000 00000 0000000111122344444445555766421111 Q ss_pred --HH-HHHHHHHhc--cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCc-eeEEEEE Q lcl|NC_013644. 99 --EL-KEYLAEYYN--SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNEL-QRICRHY 168 (510) Q Consensus 99 --~~-~~~l~~~~~--n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~-~~~~~~~ 168 (510) .. ...+..++. |+ .......+....+.+|.||+++..+ .|++ .+.+++|..+.+.-+..... ......| T Consensus 81 ~~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~y 159 (457) T protein:vir:62 81 RKEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWA-GPNIAGLDVLDPTKIHVHMVMVDGLRRKVFEAY 159 (457) T ss_pred cccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEeccCCccceeEEEE Confidence 11 112233332 22 3455666778889999999988655 4554 67788888887755432221 1111111 Q ss_pred EEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCC Q lcl|NC_013644. 169 ITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETT 248 (510) Q Consensus 169 ~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~s 248 (510) . ...++. ......|.+..+.+++.-. .. ..-.|.| T Consensus 160 ~--~~~~g~---~~~~~~~~~~eiih~r~~~-------------------------------~~---------~~~~G~s 194 (457) T protein:vir:62 160 D--IDADGN---EVLLGWFTPRDVLHIPGMM-------------------------------LP---------GDFVGCS 194 (457) T ss_pred E--EccCCc---eeEEEeeCccceEEecCCC-------------------------------CC---------Cceeccc Confidence 1 111111 1122234444454443110 00 0113666 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh---hhhHhh----h----cCeeeeccCCCceeEEeecC Q lcl|NC_013644. 249 DLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS---KLRQNV----K----SKKVVGTGSDGGLDVKTVTI 317 (510) Q Consensus 249 d~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~---~~~~~~----~----~~~~~~~~~~~~~~~~~~~~ 317 (510) -++.+...|.....+..-..+.+...+.|-.+++-...-..+ .+...+ . .++++.++++.+++.++.+. T Consensus 195 p~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~ 274 (457) T protein:vir:62 195 PISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVPGTMSEEGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSP 274 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCCh Confidence 666666655555555444455556666666655432221111 121111 1 13356677666666665544 Q ss_pred CHHHHHHHHHHHHHHHHHHhCCccccccc--cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCc Q lcl|NC_013644. 318 PTEGRKTKMEIDKENIYKFGMAFDSTQVG--DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKA 395 (510) Q Consensus 318 ~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~ 395 (510) ....+.+..+..++.|...-++|+.-.+. .++.++..++..... .+...|.-+++.|...+..+--.. T Consensus 275 ~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~----------f~~~~l~P~~~~ie~~ln~~L~~~ 344 (457) T protein:vir:62 275 DEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIA----------FTMFSLRPWLERIEAGFNRLLFAE 344 (457) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHH----------HHHHHHHHHHHHHHHHHHhhhcCc Confidence 44456677777888899988898854322 222222222221111 222233333333333333211111 Q ss_pred c--ccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|NC_013644. 396 F--DPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALEEAEYT 471 (510) Q Consensus 396 ~--~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~ 471 (510) . ....+++.+..-+-.|..+.++.+.+++++|+++.-.++++++. +.+....+...- ..........+..+.+ T Consensus 345 ~~~~~~~i~fd~~~l~~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~D~~~~~---~n~~~~~~~~~~~~~~ 421 (457) T protein:vir:62 345 TADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGEKYRVP---LNLGEIGEEPEPEPAP 421 (457) T ss_pred cccCceEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeec---cccccccccccccccC Confidence 1 12234555556667799999999999999999999988887643 322100000000 0000000000000000 Q ss_pred CCC---------CCCCCCcccCCCCCCcccccccCc Q lcl|NC_013644. 472 KGL---------SDNTDEEETAVNPDDPTQQMAEGA 498 (510) Q Consensus 472 ~~~---------~~~~~~~~~~~~~~~~~~~~~~~~ 498 (510) ... ....+.+..++++++...+++..+ T Consensus 422 ~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~ 457 (457) T protein:vir:62 422 APPAIDPPAEEPADDEEPDNAEGDPDEGETEDDDDA 457 (457) T ss_pred CCccCCCCccCCCCCCCCCCCCCCCccccccccccC Confidence 000 000000111112222212222221 No 140 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=98.57 E-value=2.6e-07 Score=56.64 Aligned_cols=450 Identities=11% Similarity=0.074 Sum_probs=163.4 Q ss_pred CCCc------------cCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHH-------hccCCcchhcc--cceecccccc Q lcl|NC_013644. 1 MEAL------------LSEDVKIIANALKAAIDKDRKSSSKREAETGIRY-------YNHENDIMNNR--IFYVDDEGIL 59 (510) Q Consensus 1 ~~~~------------~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~Y-------Y~g~~~i~~~~--~~~~~~~~~~ 59 (510) |-+- ..++...+... ..-|+. ++..++.| -.|++.-...+ -......+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~--------~~~~~~~~~~~~~~~~~~~~~a~~~p~~~~~~~~~~~~ 71 (576) T protein:vir:96 1 MVTRLADIFKRLRLGRDYEDIIDTVPI-DDGLQA--------NIRNIEEKSKELNKSLYGKQQAYAEPFLEVMDTNPEFR 71 (576) T ss_pred ChhhHHHHHHHHhccCccccchhhhhc-ccChhH--------HHHHhhhhhhhhccccCCccchhhcceeeeeecCCCcc Confidence 1100 01111111000 000000 01111111 00111000000 0011111111 Q ss_pred ccccccccc----eec-----cchhHHHHHH----HHhhh---------hcCCceeccC-----cHHH--HHHHHHHh-- Q lcl|NC_013644. 60 REDKYASNV----RIP-----HGFFPEIVDQ----KTQYL---------LSNPVEYETE-----NEEL--KEYLAEYY-- 108 (510) Q Consensus 60 ~~~~~~~~~----ki~-----~n~~~~Iv~~----~~~~l---------~g~p~~~~~~-----d~~~--~~~l~~~~-- 108 (510) .++...... .+. .++...+|+. .+.|. .|=++..... +... ...+..++ T Consensus 72 ~~p~~~~~~~~~~~~l~~~~~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~ 151 (576) T protein:vir:96 72 TKRSYMKNSDNLHDVLKQFGNNPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILN 151 (576) T ss_pred ccCcchhhhhhhHHHHHHhhcCHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhh Confidence 111110000 000 1223333333 33321 1112222111 1111 11122222 Q ss_pred --c--c----CHHHHHHHHHHHHHhcCeEEEEEEECCC--Cce-EEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCc Q lcl|NC_013644. 109 --N--S----EFQVVLQELVEGSSQKGFEYVYARTNAE--DRL-CFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGE 177 (510) Q Consensus 109 --~--n----~~~~~~~e~~~~~~~~G~~~~~v~~d~~--g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 177 (510) . + .+......+..+.+.+|.+|+++.++.+ |++ .+.+++|..+.++.+..+........| +....+. T Consensus 152 ~~~~~~p~~~t~~~f~~~lv~dlll~Gna~~~i~~~rd~~g~~~~L~pl~p~~V~v~~~~dg~~~~~~~~~-~~~~~~~- 229 (576) T protein:vir:96 152 TGRDKDIDRDSFQSFCRKIVRDTYTYDQVNFEKVFNKKNATTMDKFIAVDPSTIFYATDKNGKIIKGGKRF-VQVINKK- 229 (576) T ss_pred ccCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEEecCCCCceEEEEEeCCceeEEEECCCCceeeeeeEE-EEecCCc- Confidence 1 1 2345666778889999999988776554 443 588899999998887654321111111 1111110 Q ss_pred eeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCCcHHHHHHHH Q lcl|NC_013644. 178 TVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTDLKPIKALI 257 (510) Q Consensus 178 ~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~li 257 (510) ....+....+.++..... + -......|.|-++.+...| T Consensus 230 -----~~~~~~~~dii~~~~~~~-----------------------------------~--d~~~~~~G~Spi~~a~~~i 267 (576) T protein:vir:96 230 -----VVASFTSREMAMGIRNPR-----------------------------------T--ELSSSGYGLSEVEIAMKQF 267 (576) T ss_pred -----eEEEecccceEEEeecCC-----------------------------------C--CcccCcccccHHHHHHHHH Confidence 111122222222221100 0 0000123556666555555 Q ss_pred HHHHHHHHHHHHHHHHhccceeEE--ecCC-CCc--hhhhhHhhh--------cCe-eeeccCCCceeEEeecCCHHHHH Q lcl|NC_013644. 258 DDYDLMNCFLSNNLQDFAEAIYVV--SGFQ-GDD--LSKLRQNVK--------SKK-VVGTGSDGGLDVKTVTIPTEGRK 323 (510) Q Consensus 258 D~~n~~~S~~~~~~~~~~~~~lv~--~g~~-~~~--~~~~~~~~~--------~~~-~~~~~~~~~~~~~~~~~~~~~~~ 323 (510) .....+..-..+.+.-.+.|-.++ .|.. .++ ...++..+. .++ ++.++++.+.+-++.......+. T Consensus 268 ~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~~nag~~p~vl~~G~~~~~ls~~~~d~qfl 347 (576) T protein:vir:96 268 IAYNNTETFNDRFFSHGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGINGSWQVPVVMADDIKFVNMTPTANDMQFE 347 (576) T ss_pred HHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEeccCChhhHHHH Confidence 555554444445555556665444 3421 222 122222211 122 34456655555555545556677 Q ss_pred HHHHHHHHHHHHHhCCccccccc--cCccc----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccc Q lcl|NC_013644. 324 TKMEIDKENIYKFGMAFDSTQVG--DGNIT----NIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFD 397 (510) Q Consensus 324 ~~~~~l~~~i~~~s~~p~~~~~~--~g~~S----g~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~ 397 (510) +..+...+.|...-++|+.-.+. .++.+ |.++.+ +.+. ......+...|.-+++.|...+..+--..+ T Consensus 348 e~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~--sn~e---~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~- 421 (576) T protein:vir:96 348 KWLTYLINIISALYGIDPAEIGFPNRGGATGGKGGNTLNE--ADPG---KKQQQSQNKGLQPLLRFIEDLINTHIISEY- 421 (576) T ss_pred HHHHHhHHHHHHHhCCCHHHcccccccccccccccccccc--ccHH---HHHHHHHHHHHHHHHHHHHHHHHhhhchhc- Confidence 78888889999988888753221 11111 111110 0000 112233344444444444443332211111 Q ss_pred cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCc-HHHH------HH---HH-HHHHH-HHHHHHH Q lcl|NC_013644. 398 PTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDD-DNVL------RL---IC-EQFDL-DWEDVKE 463 (510) Q Consensus 398 ~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d-~e~~------~~---~~-e~~e~-~~~~~~~ 463 (510) ...+.+.|.+.-+.+.++..... .+..+|+++.-.++++++. +.. ++.. .. .. ...+. ....... T Consensus 422 ~~~~~~~f~r~d~~~~~e~~~~~-~~~~~G~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~ 500 (576) T protein:vir:96 422 SDKYVFQFVGGDTKSELDKIKIL-QEEVKTYKTVNEARKEKGLKPIEGGDVLLDGSFIQSMSLNTQKEQYEDTKQKERFD 500 (576) T ss_pred cCceEEEeccCCHHHHHHHHHHH-HHHhcCccCHHHHHHHhCCCCCCCcceeccccccccccccccCCCCCCcccccccc Confidence 12356678777666666655433 3455799998888877633 221 0000 00 00 00000 0000000 Q ss_pred HHHhhhccCCCCCCCC-CcccCCCCCC-cccccccCcccccccccCCCC Q lcl|NC_013644. 464 ALEEAEYTKGLSDNTD-EEETAVNPDD-PTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 464 ~~~~~~~~~~~~~~~~-~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 510 (510) .......++....+.. ..++..++.. ......++.-|.+++|+|.+- T Consensus 501 ~~~~~~~~~~~~~~~~~s~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 549 (576) T protein:vir:96 501 MIQQFLNSPDDEEPQQESTEDKVDGRESNDPTKIDSPVGTDGQLKDQDN 549 (576) T ss_pred ccccccCCCCCCCCCCCCCCCcccccccccCCCCCCccccccccCCCCc Confidence 0000001111110000 0000000000 111112223445555544322 No 141 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=98.54 E-value=3.2e-07 Score=56.16 Aligned_cols=393 Identities=9% Similarity=0.005 Sum_probs=167.4 Q ss_pred ccccccccceeccchhHHHHHHHHhhhhcCCceeccC--------cHHHHHHHHHHhc----c-----------CHHHHH Q lcl|NC_013644. 60 REDKYASNVRIPHGFFPEIVDQKTQYLLSNPVEYETE--------NEELKEYLAEYYN----S-----------EFQVVL 116 (510) Q Consensus 60 ~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~--------d~~~~~~l~~~~~----n-----------~~~~~~ 116 (510) -+. .--..++....|+..++.+.|-|+.+... .....+.+..++. | .+...+ T Consensus 1 l~~-----l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~ 75 (467) T protein:vir:31 1 MAE-----LLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVL 75 (467) T ss_pred Chh-----hhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHH Confidence 000 00124678888888899898888876311 1112223333331 2 122445 Q ss_pred HHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEE Q lcl|NC_013644. 117 QELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFF 195 (510) Q Consensus 117 ~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~ 195 (510) ..+..+...+|.||+.+..+..|++ .+.+++|..+.+.-|... +... .... ..++.+|........ T Consensus 76 ~~~~~~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~-------~~~~--~~~~----~~~~~~~~~~~~~~~ 142 (467) T protein:vir:31 76 QTAWTDYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERG-------FVQL--LEEK----EKYFGVAGDRYQTNG 142 (467) T ss_pred HHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecce-------eEee--cCCc----eeeEEeccccceeec Confidence 5677888999999999888988875 588889988877655321 0000 0000 001111111100000 Q ss_pred EEcCCceeecccccccccccccccccccccccccccCCcccEEEecCC-----CCCCCcHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 196 VAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNN-----KQETTDLKPIKALIDDYDLMNCFLSNN 270 (510) Q Consensus 196 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn-----~~g~sd~~~v~~liD~~n~~~S~~~~~ 270 (510) . +........ ...........+..=-|+|++.. -.|.|.+......++....+-.-..+. T Consensus 143 ~----~~~~~~~~~-----------~~~~~~~~~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~ 207 (467) T protein:vir:31 143 N----GDLDPVFVD-----------ADDGSTGTSVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDF 207 (467) T ss_pred c----cceeeeeee-----------eccccccceeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHH Confidence 0 000000000 00000000011111125666543 246776665555454433333333333 Q ss_pred HHHhccceeEE--ecCCCCc--hhhhhHhhh-------------------cCeeeeccCCCceeEEee-----c---CCH Q lcl|NC_013644. 271 LQDFAEAIYVV--SGFQGDD--LSKLRQNVK-------------------SKKVVGTGSDGGLDVKTV-----T---IPT 319 (510) Q Consensus 271 ~~~~~~~~lv~--~g~~~~~--~~~~~~~~~-------------------~~~~~~~~~~~~~~~~~~-----~---~~~ 319 (510) +...+.|-.++ +|...++ ...+...+. ....+.+..+.+.+.+.. . ... T Consensus 208 f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d 287 (467) T protein:vir:31 208 FENDGVPRIAIIVKGAELTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEE 287 (467) T ss_pred HhccCCCceEEEecCcCCCHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhh Confidence 44445554443 4432222 111222111 011223334443333221 1 123 Q ss_pred HHHHHHHHHHHHHHHHHhCCccccccc--cCcc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC--- Q lcl|NC_013644. 320 EGRKTKMEIDKENIYKFGMAFDSTQVG--DGNI-TNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYT--- 393 (510) Q Consensus 320 ~~~~~~~~~l~~~i~~~s~~p~~~~~~--~g~~-Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~--- 393 (510) ..+....+...+.|...-++|+.-.+. .++. |+. +... ...+...|.-+++.|...++.+-- T Consensus 288 ~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~~s~~--e~~~----------~~f~~~~l~P~~~~ie~~ln~~l~~~~ 355 (467) T protein:vir:31 288 ASFLEFRGRNEHDILKVHDVPPVIAGVVESGAFSTDA--EEQR----------KEFAEETIQPKQHDFGELLYELVHKQG 355 (467) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCcccCH--HHHH----------HHHHHHHHHHHHHHHHHHHHHhhcchh Confidence 445667777788888888888743221 1221 221 1111 112222333333333333332111 Q ss_pred CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|NC_013644. 394 KAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALEEAEYT 471 (510) Q Consensus 394 ~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~ 471 (510) .......+++.+...+..|..+.++.+..++.+|+++.-.++++++. +.|+... . .......... T Consensus 356 ~~~~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~pi~d~~~~----------~---~~~~~~~~~~ 422 (467) T protein:vir:31 356 LDAPDWTIEFELAKPDTKLQDVEIASQRVQAMQGLLTVNELRDEFGFEPFPEEHVY----------G---GETLVAEVTG 422 (467) T ss_pred hccCCceEEEecchhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCccccc----------C---Cccccccccc Confidence 11112246667778888999999999999999999999999988754 2221100 0 0000000000 Q ss_pred CCCCCCCCCcccCCCCCCccccc-ccCcccccccccCCCC Q lcl|NC_013644. 472 KGLSDNTDEEETAVNPDDPTQQM-AEGATGSTESQLPENG 510 (510) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 510 (510) +.......+++....++++.++. +.-...-+..|.-+-| T Consensus 423 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 462 (467) T protein:vir:31 423 GSGPGGGIGDQIEQLVEDRADEIIDSYQADLETEQLIEIG 462 (467) T ss_pred ccCCCCcccCcCCCCCCCcccchHhhhhhccccchhhhhc Confidence 11111111111111111111111 1111111223333334 No 142 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=98.52 E-value=3.8e-07 Score=55.78 Aligned_cols=453 Identities=12% Similarity=0.076 Sum_probs=207.6 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhh--HHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSS--KREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~--~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) |-.-+ .+.+-.+.+++..+..+..+. -.++..+.+|..-. .+. ..+.. ......++..+-+... T Consensus 1 m~~~~--~~~~~~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~--~~~-------~~~~~---~~~~~~~~~dst~~~a 66 (535) T protein:vir:15 1 MADSK--RTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPS--LFP-------KESDN---ESTDYTTPWQAVGARG 66 (535) T ss_pred CCccc--hhccchHHHHHHHHHHHHHhhHHHHHHHHHHHHhccc--ccC-------CCCCc---ccccccccccccHHHH Confidence 32221 222234445555555553321 12344444443221 110 00000 0111124555666677 Q ss_pred HHHHHhhhhc--CCce----eccCcH-------------HHHHHH-------HHHh-ccCHHHHHHHHHHHHHhcCeEEE Q lcl|NC_013644. 79 VDQKTQYLLS--NPVE----YETENE-------------ELKEYL-------AEYY-NSEFQVVLQELVEGSSQKGFEYV 131 (510) Q Consensus 79 v~~~~~~l~g--~p~~----~~~~d~-------------~~~~~l-------~~~~-~n~~~~~~~e~~~~~~~~G~~~~ 131 (510) +++.++.|++ -|.+ +...+. ++...| ...+ .+||.....++.++..++|.|.+ T Consensus 67 ~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l 146 (535) T protein:vir:15 67 LNNLASKLMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALL 146 (535) T ss_pred HHHHHHHHHHhhcCCCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeE Confidence 7777766654 1322 222221 122222 2223 47788999999999999999987 Q ss_pred EEEECCCCceEEEEEcccceEEEEcCCCCceeEEEEEEEEEee------------CCceeEEEEEEEEcCCcEEEEEEcC Q lcl|NC_013644. 132 YARTNAEDRLCFQVADSLNVFGVYNEYNELQRICRHYITEIEK------------DGETVDIHHAEVWTDQNVYFFVAED 199 (510) Q Consensus 132 ~v~~d~~g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~e~y~~~~i~~~~~~~ 199 (510) ++-.+..+.++++.++-.+++..-|..+++..++|.++..... .........+++|+.- +...+. T Consensus 147 ~~~~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v---~~~~~~ 223 (535) T protein:vir:15 147 YLPEPEGSYNPMKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHV---YLDEES 223 (535) T ss_pred EeecCCCCceeeEEEEcCeeEEeeCCCCCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEEE---EEecCC Confidence 7766666778888888888888888888888887766554210 0111111222333211 111111 Q ss_pred Cceeeccccccccccccccccccccc--ccccccCCcccEEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 200 NKDYELDEAEPINPRPHVLAVDSENE--SLLQRSYGQIPFYRLS-----NNKQETTDLKPIKALIDDYDLMNCFLSNNLQ 272 (510) Q Consensus 200 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~g~iPvv~~~-----nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~ 272 (510) +.+. +....++... .....+|..+|++.++ ++.+|+|-.++..+-+..+|.+.-......+ T Consensus 224 ~~~~------------~~~e~~g~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~ 291 (535) T protein:vir:15 224 GDYL------------KYEEVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSM 291 (535) T ss_pred CcEE------------EEEEeeCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111 1111111111 1123356667777654 3468999999999999999999999999999 Q ss_pred HhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCcc-ccccccCc Q lcl|NC_013644. 273 DFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFD-STQVGDGN 349 (510) Q Consensus 273 ~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~-~~~~~~g~ 349 (510) ...+|.+.+.-....+..++.. ...+.+..+..+++..+... .+.......++.++..|-..- ..+ ...-.... T Consensus 292 ~~~~p~~lv~~~g~~~~~~l~~--~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af-~~~~~~~~~~~r 368 (535) T protein:vir:15 292 ISAKVIGLVNPAGITQPRRLTK--AQTGDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAF-MLNSAVQRTGER 368 (535) T ss_pred HHhcCceeecccccccchhccc--CCceeeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCcc Confidence 9999987653222222222211 12234445555667766533 467777788888777775522 121 11122333 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHhhccCC-ccccceeeEEeCCCCCCCHH-HHHHH Q lcl|NC_013644. 350 ITNIVIKARYTLLNMKANKTEARLRALLEW--------MNKLVIDDINRRYTK-AFDPTEVSFTFTREVMVNET-DIVND 419 (510) Q Consensus 350 ~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~--------~~~~i~~~~~~~~~~-~~~~~~v~i~f~~~~p~d~~-e~~~~ 419 (510) .|++.+.. ++.++...++..+.+ +++.++.++...+.- ......++++|..++..-.. ..++. T Consensus 369 ~TAtEV~~-------r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~ 441 (535) T protein:vir:15 369 VTAEEIRY-------VASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDK 441 (535) T ss_pred ccHHHHHH-------HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHH Confidence 56655554 344555555554444 344444444443322 22334577888766654211 11111 Q ss_pred ----HHHHHhcC------CCchHHHHHh----C--C---CCCc-HHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCC Q lcl|NC_013644. 420 ----EKTEAETR------KIILESILQV----A--P---RLDD-DNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTD 479 (510) Q Consensus 420 ----~~~~~~~g------~iS~et~~~~----~--~---~v~d-~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~ 479 (510) +..+.+.+ .+....++.. + | .+.. +|.+.+.++..+.. .....+.... ... .... T Consensus 442 l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~~~~eev~~~~~q~~~~~--~~~~~a~~~g--~~~-~~~~ 516 (535) T protein:vir:15 442 LERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQT--GIENAAATGG--AGV-GALA 516 (535) T ss_pred HHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHH--HHHHHHHHHH--hhc-cchh Confidence 21111111 0111222221 1 1 1212 22222222211111 1111111111 111 1111 Q ss_pred CcccCCCCCCc-ccccccCcccc Q lcl|NC_013644. 480 EEETAVNPDDP-TQQMAEGATGS 501 (510) Q Consensus 480 ~~~~~~~~~~~-~~~~~~~~~~~ 501 (510) -.+|++- .--...|...+ T Consensus 517 ----~~~p~~~~~~~~~~g~~~~ 535 (535) T protein:vir:15 517 ----TSSPEAMQGAAAQAGLDAT 535 (535) T ss_pred ----ccChHHHHHHHhccCCCCC Confidence 0111110 00001111111 No 143 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=98.52 E-value=3.8e-07 Score=55.76 Aligned_cols=454 Identities=11% Similarity=0.090 Sum_probs=161.8 Q ss_pred CCCcc-----CCChhh-----------hHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhccc--ceeccccccccc Q lcl|NC_013644. 1 MEALL-----SEDVKI-----------IANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRI--FYVDDEGILRED 62 (510) Q Consensus 1 ~~~~~-----~~~~~~-----------~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~--~~~~~~~~~~~~ 62 (510) |-++. ..|... ....-.+.|+ +....+..+.+--.+++..-..+- ......+...+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~ 75 (563) T protein:vir:99 1 MADLFKQFRLGKDYGNNSTIAQVPIDEGLQANIKKIE-----QDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKR 75 (563) T ss_pred ChhhhhhhhcccccccccccceeeccCChhhhHhhhh-----ccchhHHHHHhhhccCCCcchhhhHhhhcccccccccc Confidence 33221 111111 1111111111 111112222222222221100000 000000000000 Q ss_pred c-cccc------ce-ec-cchhHHHHHHHHhhhh-------------cCCceecc-----CcHH--HHHHHHHHhc---- Q lcl|NC_013644. 63 K-YASN------VR-IP-HGFFPEIVDQKTQYLL-------------SNPVEYET-----ENEE--LKEYLAEYYN---- 109 (510) Q Consensus 63 ~-~~~~------~k-i~-~n~~~~Iv~~~~~~l~-------------g~p~~~~~-----~d~~--~~~~l~~~~~---- 109 (510) . ..+. .| +. .++...+|++.+..+. |=|+.+.. ..+. ....|..++. T Consensus 76 ~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~ 155 (563) T protein:vir:99 76 SYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGK 155 (563) T ss_pred cCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCC Confidence 0 0000 01 11 1234444444433322 11232211 1111 1122333321 Q ss_pred c------CHHHHHHHHHHHHHhcCeEEEEEE--ECCCCce-EEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeE Q lcl|NC_013644. 110 S------EFQVVLQELVEGSSQKGFEYVYAR--TNAEDRL-CFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVD 180 (510) Q Consensus 110 n------~~~~~~~e~~~~~~~~G~~~~~v~--~d~~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 180 (510) + .+...+..+..+.+.+|.||+++. .|..|++ .+.+++|..+.+..++.+.+......|++ ...+. T Consensus 156 ~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~-~~~g~---- 230 (563) T protein:vir:99 156 DKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRFVQ-VVDKR---- 230 (563) T ss_pred CCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccceeEEE-EeCCc---- Confidence 1 234566667888999999988765 4556765 58889999999888765432211111111 11110 Q ss_pred EEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCCcHHHHHHHHHHH Q lcl|NC_013644. 181 IHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDY 260 (510) Q Consensus 181 ~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~ 260 (510) ....+....+.++..... .+ ......|+|-++.+...|... T Consensus 231 --~~~~~~~~evI~~~~~~~----------------------------~d---------~~~~~~G~Spi~~a~~~i~~~ 271 (563) T protein:vir:99 231 --VVASFTSRELAMGIRNPR----------------------------TE---------LSSSGYGLSEVEIAMKEFIAY 271 (563) T ss_pred --eeEEecCcceEEEeccCC----------------------------CC---------cccCcccchHHHHHHHHHHHH Confidence 011122222222211000 00 000123556666555555544 Q ss_pred HHHHHHHHHHHHHhccceeEE--ecCC-CCc--hhhhhHhhh----c----Ce-eeeccCCCceeEEeecCCHHHHHHHH Q lcl|NC_013644. 261 DLMNCFLSNNLQDFAEAIYVV--SGFQ-GDD--LSKLRQNVK----S----KK-VVGTGSDGGLDVKTVTIPTEGRKTKM 326 (510) Q Consensus 261 n~~~S~~~~~~~~~~~~~lv~--~g~~-~~~--~~~~~~~~~----~----~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 326 (510) ..+..-..+.+...+.|-.++ .|.. .++ ...+...+. + ++ ++.++++.+.+-++.+.....+.+.. T Consensus 272 ~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~ 351 (563) T protein:vir:99 272 NNTESFNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWL 351 (563) T ss_pred HHHHHHHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHHHHHH Confidence 444444455555556666554 3421 221 112222111 1 12 24455555555555444555667788 Q ss_pred HHHHHHHHHHhCCccccccc--cC----cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccce Q lcl|NC_013644. 327 EIDKENIYKFGMAFDSTQVG--DG----NITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTE 400 (510) Q Consensus 327 ~~l~~~i~~~s~~p~~~~~~--~g----~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~ 400 (510) ...++.|...-++|+.-.+. .+ +..|..+.. +.+ .......+...|.-+++.|...+..+--..+. .. T Consensus 352 ~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~--sn~---e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~-~~ 425 (563) T protein:vir:99 352 NYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNE--ADP---GKKQQQSQNKGLQPLLRFIEDLVNRHIISEYG-DK 425 (563) T ss_pred HHHHHHHHHHhCCCHHHccccccccccccccccchhh--ccH---HHHHHHHHHHHHHHHHHHHHHHHHhhhchhcc-cc Confidence 88888999988898753221 11 111111110 000 01112333444444444444433322111111 23 Q ss_pred eeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHH-------H--HHHHHHHHHHHHHHHHHhhh Q lcl|NC_013644. 401 VSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLR-------L--ICEQFDLDWEDVKEALEEAE 469 (510) Q Consensus 401 v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~-------~--~~e~~e~~~~~~~~~~~~~~ 469 (510) +.+.|.+.-+.+..+..+ +.++..+|+++.-.++++++. +..-+... . .........+..+....... T Consensus 426 ~~~~f~r~D~~~~~e~~~-~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (563) T protein:vir:99 426 YTFQFVGGDTKSATDKLN-ILKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMM 504 (563) T ss_pred cEEEeccCCHHHHHHHHH-HHHHhcCCccCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhcc Confidence 566787665555555443 345678899999888877643 22100000 0 00000000000000000000 Q ss_pred ccCCCCCCCCCcccCCCCCCcccccccCcccccc-----cccCCCC Q lcl|NC_013644. 470 YTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTE-----SQLPENG 510 (510) Q Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~ 510 (510) .....+...++.++..++.+...+++...+-... +|..-.| T Consensus 505 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 550 (563) T protein:vir:99 505 SLLEGDNDDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTSNKG 550 (563) T ss_pred cccCCCCCCCCCCCCCCCCCCccccccccccccccccccccCcccc Confidence 0000000001111110100000001111111001 1111111 No 144 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=98.52 E-value=3.8e-07 Score=55.76 Aligned_cols=454 Identities=11% Similarity=0.090 Sum_probs=161.8 Q ss_pred CCCcc-----CCChhh-----------hHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhccc--ceeccccccccc Q lcl|NC_013644. 1 MEALL-----SEDVKI-----------IANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRI--FYVDDEGILRED 62 (510) Q Consensus 1 ~~~~~-----~~~~~~-----------~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~--~~~~~~~~~~~~ 62 (510) |-++. ..|... ....-.+.|+ +....+..+.+--.+++..-..+- ......+...+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~ 75 (563) T protein:vir:95 1 MADLFKQFRLGKDYGNNSTIAQVPIDEGLQANIKKIE-----QDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKR 75 (563) T ss_pred ChhhhhhhhcccccccccccceeeccCChhhhHhhhh-----ccchhHHHHHhhhccCCCcchhhhHhhhcccccccccc Confidence 33221 111111 1111111111 111112222222222221100000 000000000000 Q ss_pred c-cccc------ce-ec-cchhHHHHHHHHhhhh-------------cCCceecc-----CcHH--HHHHHHHHhc---- Q lcl|NC_013644. 63 K-YASN------VR-IP-HGFFPEIVDQKTQYLL-------------SNPVEYET-----ENEE--LKEYLAEYYN---- 109 (510) Q Consensus 63 ~-~~~~------~k-i~-~n~~~~Iv~~~~~~l~-------------g~p~~~~~-----~d~~--~~~~l~~~~~---- 109 (510) . ..+. .| +. .++...+|++.+..+. |=|+.+.. ..+. ....|..++. T Consensus 76 ~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~ 155 (563) T protein:vir:95 76 SYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGK 155 (563) T ss_pred cCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCC Confidence 0 0000 01 11 1234444444433322 11232211 1111 1122333321 Q ss_pred c------CHHHHHHHHHHHHHhcCeEEEEEE--ECCCCce-EEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeE Q lcl|NC_013644. 110 S------EFQVVLQELVEGSSQKGFEYVYAR--TNAEDRL-CFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVD 180 (510) Q Consensus 110 n------~~~~~~~e~~~~~~~~G~~~~~v~--~d~~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 180 (510) + .+...+..+..+.+.+|.||+++. .|..|++ .+.+++|..+.+..++.+.+......|++ ...+. T Consensus 156 ~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~-~~~g~---- 230 (563) T protein:vir:95 156 DKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRFVQ-VVDKR---- 230 (563) T ss_pred CCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccceeEEE-EeCCc---- Confidence 1 234566667888999999988765 4556765 58889999999888765432211111111 11110 Q ss_pred EEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCCcHHHHHHHHHHH Q lcl|NC_013644. 181 IHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDY 260 (510) Q Consensus 181 ~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~ 260 (510) ....+....+.++..... .+ ......|+|-++.+...|... T Consensus 231 --~~~~~~~~evI~~~~~~~----------------------------~d---------~~~~~~G~Spi~~a~~~i~~~ 271 (563) T protein:vir:95 231 --VVASFTSRELAMGIRNPR----------------------------TE---------LSSSGYGLSEVEIAMKEFIAY 271 (563) T ss_pred --eeEEecCcceEEEeccCC----------------------------CC---------cccCcccchHHHHHHHHHHHH Confidence 011122222222211000 00 000123556666555555544 Q ss_pred HHHHHHHHHHHHHhccceeEE--ecCC-CCc--hhhhhHhhh----c----Ce-eeeccCCCceeEEeecCCHHHHHHHH Q lcl|NC_013644. 261 DLMNCFLSNNLQDFAEAIYVV--SGFQ-GDD--LSKLRQNVK----S----KK-VVGTGSDGGLDVKTVTIPTEGRKTKM 326 (510) Q Consensus 261 n~~~S~~~~~~~~~~~~~lv~--~g~~-~~~--~~~~~~~~~----~----~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 326 (510) ..+..-..+.+...+.|-.++ .|.. .++ ...+...+. + ++ ++.++++.+.+-++.+.....+.+.. T Consensus 272 ~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~ 351 (563) T protein:vir:95 272 NNTESFNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWL 351 (563) T ss_pred HHHHHHHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHHHHHH Confidence 444444455555556666554 3421 221 112222111 1 12 24455555555555444555667788 Q ss_pred HHHHHHHHHHhCCccccccc--cC----cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccce Q lcl|NC_013644. 327 EIDKENIYKFGMAFDSTQVG--DG----NITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTE 400 (510) Q Consensus 327 ~~l~~~i~~~s~~p~~~~~~--~g----~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~ 400 (510) ...++.|...-++|+.-.+. .+ +..|..+.. +.+ .......+...|.-+++.|...+..+--..+. .. T Consensus 352 ~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~--sn~---e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~-~~ 425 (563) T protein:vir:95 352 NYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNE--ADP---GKKQQQSQNKGLQPLLRFIEDLVNRHIISEYG-DK 425 (563) T ss_pred HHHHHHHHHHhCCCHHHccccccccccccccccchhh--ccH---HHHHHHHHHHHHHHHHHHHHHHHHhhhchhcc-cc Confidence 88888999988898753221 11 111111110 000 01112333444444444444433322111111 23 Q ss_pred eeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHH-------H--HHHHHHHHHHHHHHHHHhhh Q lcl|NC_013644. 401 VSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLR-------L--ICEQFDLDWEDVKEALEEAE 469 (510) Q Consensus 401 v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~-------~--~~e~~e~~~~~~~~~~~~~~ 469 (510) +.+.|.+.-+.+..+..+ +.++..+|+++.-.++++++. +..-+... . .........+..+....... T Consensus 426 ~~~~f~r~D~~~~~e~~~-~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (563) T protein:vir:95 426 YTFQFVGGDTKSATDKLN-ILKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMM 504 (563) T ss_pred cEEEeccCCHHHHHHHHH-HHHHhcCCccCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhcc Confidence 566787665555555443 345678899999888877643 22100000 0 00000000000000000000 Q ss_pred ccCCCCCCCCCcccCCCCCCcccccccCcccccc-----cccCCCC Q lcl|NC_013644. 470 YTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTE-----SQLPENG 510 (510) Q Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~ 510 (510) .....+...++.++..++.+...+++...+-... +|..-.| T Consensus 505 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 550 (563) T protein:vir:95 505 SLLEGDNDDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTSNKG 550 (563) T ss_pred cccCCCCCCCCCCCCCCCCCCccccccccccccccccccccCcccc Confidence 0000000001111110100000001111111001 1111111 No 145 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=98.49 E-value=4.6e-07 Score=55.32 Aligned_cols=454 Identities=10% Similarity=0.060 Sum_probs=192.8 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhh--HHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSS--KREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~--~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) |-. ..+-..++.+++..+..+..+. -.++..+.+|..-.- .....+. ..+...++..+-+..- T Consensus 1 m~~---~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~-------~~~~~~~-----~~~~~~~~~dst~~~a 65 (536) T protein:vir:10 1 MAE---KRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSL-------FPKDSDN-----ASTDYQTPWQAVGARG 65 (536) T ss_pred Ccc---hhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc-------cCCCCCc-----ccccccccccccHHHH Confidence 322 1223345666666666554321 123444444432210 1111111 1111234566667777 Q ss_pred HHHHHhhhhc--CCc----eeccCcH-------------HHHH-------HHHHHh-ccCHHHHHHHHHHHHHhcCeEEE Q lcl|NC_013644. 79 VDQKTQYLLS--NPV----EYETENE-------------ELKE-------YLAEYY-NSEFQVVLQELVEGSSQKGFEYV 131 (510) Q Consensus 79 v~~~~~~l~g--~p~----~~~~~d~-------------~~~~-------~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~ 131 (510) +++.++.|++ -|. ++...+. +++. .+...+ .+||...+.++.++..++|.|.+ T Consensus 66 ~~~Laa~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l 145 (536) T protein:vir:10 66 LNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLL 145 (536) T ss_pred HHHHHHHHHhhhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeE Confidence 7777776654 131 1222221 1111 222223 46788889999999999999877 Q ss_pred EEEECCCCce-EEEEEcccceEEEEcCCCCceeEEEEEEEEEe------------eCCceeEEEEEEEEcCCcEEEEEEc Q lcl|NC_013644. 132 YARTNAEDRL-CFQVADSLNVFGVYNEYNELQRICRHYITEIE------------KDGETVDIHHAEVWTDQNVYFFVAE 198 (510) Q Consensus 132 ~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~e~y~~~~i~~~~~~ 198 (510) ++-.+..+.+ .++.++-.+++..-|..+++..++|-+..... ...+......+++|+.- +.+.. T Consensus 146 y~~e~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V---~~~~~ 222 (536) T protein:vir:10 146 YLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHI---YLDEA 222 (536) T ss_pred EEeeCCCCceeeEEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEE---EEecC Confidence 6654444333 46677777888777888888888776554411 11111112223333210 01111 Q ss_pred CCceeeccccccccccccccccccc--ccccccccCCcccEEEecC-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 199 DNKDYELDEAEPINPRPHVLAVDSE--NESLLQRSYGQIPFYRLSN-----NKQETTDLKPIKALIDDYDLMNCFLSNNL 271 (510) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~g~iPvv~~~n-----n~~g~sd~~~v~~liD~~n~~~S~~~~~~ 271 (510) .+.+. +....++. .......+|..+|++.++- +.+|+|-.++..+-+..+|.+.-...... T Consensus 223 ~~~~~------------~~~e~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~ 290 (536) T protein:vir:10 223 SGEYL------------RYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMS 290 (536) T ss_pred CCcEE------------EEEeecCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 11111 00011111 1112234566788777653 46799999999999999998877777766 Q ss_pred HHhccceeEE-ecCCCCchhhhhHhhhcCeeeeccCCCceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCCccccccccC Q lcl|NC_013644. 272 QDFAEAIYVV-SGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKT--VTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDG 348 (510) Q Consensus 272 ~~~~~~~lv~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g 348 (510) .....|.+.+ .+ +..++..+.. ...+.+..+..+++..+. ...+.......++.++..|-..-..-....-... T Consensus 291 ~~a~~~~~lv~p~-g~~~~~~~~~--~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~~~~~ 367 (536) T protein:vir:10 291 MISSKVIGLVNPA-GITQPRRLTK--AQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGE 367 (536) T ss_pred HHHhcCCcccCcc-cccchhhhcc--CCCcceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhhcccCCCC Confidence 6666654433 22 1122222211 111233334444455443 3346666777777777776442211111112223 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHhhccCCc-cccceeeEEeCCCCCC-CHHHHHH Q lcl|NC_013644. 349 NITNIVIKARYTLLNMKANKTEARLRALLEW--------MNKLVIDDINRRYTKA-FDPTEVSFTFTREVMV-NETDIVN 418 (510) Q Consensus 349 ~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~--------~~~~i~~~~~~~~~~~-~~~~~v~i~f~~~~p~-d~~e~~~ 418 (510) ..|++.+..+ +.++...++..+.+ +++.++.++...+.-+ .....+.+.+.-++.. .....++ T Consensus 368 r~TAtEV~~r-------~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~ 440 (536) T protein:vir:10 368 RVTAEEIRYV-------ASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLD 440 (536) T ss_pred CccHHHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHHH Confidence 3466665553 34444444444333 3444444443333221 1122345555444421 1112222 Q ss_pred HHH----HHHhcC------CCchHHHHHhC---CCC-------CcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCC Q lcl|NC_013644. 419 DEK----TEAETR------KIILESILQVA---PRL-------DDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNT 478 (510) Q Consensus 419 ~~~----~~~~~g------~iS~et~~~~~---~~v-------~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~ 478 (510) .+. .+.+.+ .|....++..+ -++ +++|.+++.+++++.+......++............+ T Consensus 441 ~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~ 520 (536) T protein:vir:10 441 KLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASP 520 (536) T ss_pred HHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Confidence 221 111111 12222333221 122 2233333333222222111111111111111111111 Q ss_pred CCcccCCCCCCcccccccCcccccccccCCC Q lcl|NC_013644. 479 DEEETAVNPDDPTQQMAEGATGSTESQLPEN 509 (510) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (510) ..-....+. ... -|+- T Consensus 521 ~~~~~~~~~--~g~-------------~~~~ 536 (536) T protein:vir:10 521 EAMAAAADS--VGL-------------QPGI 536 (536) T ss_pred hhHHhhhhc--ccc-------------CCCC Confidence 100000000 000 0111 No 146 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=98.48 E-value=5.1e-07 Score=55.10 Aligned_cols=484 Identities=10% Similarity=-0.011 Sum_probs=185.0 Q ss_pred CCCc----c---CCChhhhHHHHHHHHHhhhhhh--hHHHHHHHHHHhccCCcchhcccceeccccccccccccccceec Q lcl|NC_013644. 1 MEAL----L---SEDVKIIANALKAAIDKDRKSS--SKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIP 71 (510) Q Consensus 1 ~~~~----~---~~~~~~~~~~i~~~i~~~~~~~--~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~ 71 (510) .+++ . =++ +.....+++.++..+... .+.+...+.+||.+..+-. ++..+ +++ +++ T Consensus 11 ~~~~~~~~~~~~~~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~--grs--~vv 75 (763) T protein:vir:95 11 LPDPSQATKLTSWKN-ELSLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEGKAK----------PPKVK--GRS--QVQ 75 (763) T ss_pred CccccchhcCCCCCC-hHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccccCc----------ccccC--CCc--ccc Confidence 1111 1 111 222333444444332221 2223344556654443211 11111 222 344 Q ss_pred cchhHHHHHH----HHhhhhcCC--ceecc---CcHHHHH----HHHHHh--ccCHHHHHHHHHHHHHhcCeEEEEEEEC Q lcl|NC_013644. 72 HGFFPEIVDQ----KTQYLLSNP--VEYET---ENEELKE----YLAEYY--NSEFQVVLQELVEGSSQKGFEYVYARTN 136 (510) Q Consensus 72 ~n~~~~Iv~~----~~~~l~g~p--~~~~~---~d~~~~~----~l~~~~--~n~~~~~~~e~~~~~~~~G~~~~~v~~d 136 (510) .+-....|+. ....|+|.+ |.|.+ +|....+ .++-++ .|+-.+.+...+++++.+|.|.+.|||+ T Consensus 76 ~~~v~~~ve~~~~~l~~~f~~~~~~~~~~P~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k~~W~ 155 (763) T protein:vir:95 76 PKLVRRQAEWRYSALTEPFLGSNKLFKVTPVTWEDVQGARQNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVRVGWN 155 (763) T ss_pred CHHHHHHHHHHHHHHHHhhcCCCcEEEEecCCcchHHHHHHHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEEEeee Confidence 4444444443 444445543 34543 3333323 344434 3566677889999999999999998764 Q ss_pred C------------------------------------------------------------------------------C Q lcl|NC_013644. 137 A------------------------------------------------------------------------------E 138 (510) Q Consensus 137 ~------------------------------------------------------------------------------~ 138 (510) . + T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k 235 (763) T protein:vir:95 156 REIRKEKQEVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLA 235 (763) T ss_pred eeeeeeeeeehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEec Confidence 1 1 Q ss_pred CceEEEEEcccceEEEEcCCCCce---eEEEEEEEEEeeC---C-----------ceeEEE-----------EEEEEc-- Q lcl|NC_013644. 139 DRLCFQVADSLNVFGVYNEYNELQ---RICRHYITEIEKD---G-----------ETVDIH-----------HAEVWT-- 188 (510) Q Consensus 139 g~~~i~~~~p~~~~~~~d~~~~~~---~~~~~~~~~~~~~---~-----------~~~~~~-----------~~e~y~-- 188 (510) ++|+|..|+|.++++-.+-..++. .+++..+....+- + ...... .....+ T Consensus 236 ~~p~ie~V~p~d~~iDp~a~sD~~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 315 (763) T protein:vir:95 236 NHPTVEMLNPENIIIDPSCQGDINKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQISDPM 315 (763) T ss_pred CceEEEeecHHHheecCCCCCchhhCceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccchhhccCCCcc Confidence 345777789988885332111111 1111111110000 0 000000 000000 Q ss_pred CCcEEEEEEcCCceeecccccccccccccccccccc--cccccccCCcccEEEec-----CCCCCCCcHHHHHHHHHHHH Q lcl|NC_013644. 189 DQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSEN--ESLLQRSYGQIPFYRLS-----NNKQETTDLKPIKALIDDYD 261 (510) Q Consensus 189 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~g~iPvv~~~-----nn~~g~sd~~~v~~liD~~n 261 (510) ...+..|++..... . ........+- ....+... ....|.++|++|++.++ ...+|.|.+..++++++.+| T Consensus 316 ~~~V~v~E~y~~~d-~-~gdg~~~~~~-v~~~g~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~~G~gi~~~~~d~Qr~~N 392 (763) T protein:vir:95 316 RKRVVAYEYWGFWD-I-EGNGVLEPIV-ATWIGSTLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDAELLGDNQAVLG 392 (763) T ss_pred cceEEEEEeeeeec-c-CCcceeEEEE-EEEEcCeeeecccccccCCCcCEEEecceeecCcccCCchHHHhhHHHHHHH Confidence 00111111000000 0 0000000000 00111111 12223445677776554 34568999999999999999 Q ss_pred HHHHHHHHHHHHhccceeE-EecCCCCchhhhhHhhhcCeeeeccCCCce----eEEeecCCHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 262 LMNCFLSNNLQDFAEAIYV-VSGFQGDDLSKLRQNVKSKKVVGTGSDGGL----DVKTVTIPTEGRKTKMEIDKENIYKF 336 (510) Q Consensus 262 ~~~S~~~~~~~~~~~~~lv-~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~l~~~i~~~ 336 (510) ..++.+.+.+....+|.+. ..|. .+ ..+... .+.+.++.+..++++ .++..+.........+..+...+-.. T Consensus 393 ~~~~~~~d~l~~~~~~~~~v~~ga-v~-~~d~~~-~~pg~v~~v~~g~~~~~~~~~~~~p~~~~~~~~~l~~~~~~~e~~ 469 (763) T protein:vir:95 393 AVMRGMIDLLGRSANGQRGMPKGM-LD-ALNSRR-YREGEDYEYNPTQNPAQMIIEHKFPELPQSALTMATLQNQEAESL 469 (763) T ss_pred HHHHHHHHHHHhhcCCcEEeeccc-cc-chhhhc-ccCCceEEeeCCCChhhhcccccCCCCcchHHHHHHHHHHHHHHh Confidence 9999999999988887554 3443 22 222221 234455555444432 23333333345555556665556666 Q ss_pred hCCcccccccc-----CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc----------cCC--ccc-- Q lcl|NC_013644. 337 GMAFDSTQVGD-----GNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRR----------YTK--AFD-- 397 (510) Q Consensus 337 s~~p~~~~~~~-----g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~----------~~~--~~~-- 397 (510) +++++.+.+.. +++||++ .+............+.|..+++.+.+.++.++... +.. .+. T Consensus 470 TGv~~~~~G~~~~~~~~tat~v~--~l~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v~v~~~ 547 (763) T protein:vir:95 470 TGVKAFAGGVTGESYGDVAAGIR--GVLDAASKREMAILRRLAKGMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTIKRE 547 (763) T ss_pred hCcchhhcCcCcccccchhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEeCCccccccHH Confidence 77776543322 2233333 33344444445556667677777777777665331 111 000 Q ss_pred ----cceeeEEeCCCCCCCH-HHHHHHHHHHHh-cC-CCch---HHHH----HhCC---CCC---------cHHHHH--- Q lcl|NC_013644. 398 ----PTEVSFTFTREVMVNE-TDIVNDEKTEAE-TR-KIIL---ESIL----QVAP---RLD---------DDNVLR--- 448 (510) Q Consensus 398 ----~~~v~i~f~~~~p~d~-~e~~~~~~~~~~-~g-~iS~---et~~----~~~~---~v~---------d~e~~~--- 448 (510) ..+|.|.-.. .+. .+.+..+..+.+ .| .+.. .-++ +... .+. ++..+. T Consensus 548 ~~~~~~DV~V~~~~---as~~~q~~~~l~~ll~~l~~~~~~~~~~~il~~~~d~~~~~~~~~~lr~~q~~~d~~~q~qaq 624 (763) T protein:vir:95 548 DLKGNFDLEVDIST---AEVDNQKSQDLGFMLQTIGPNVDQQITLNILAEIADLKRMPKLAHDLRTWQPQPDPVQEQLKQ 624 (763) T ss_pred HhcCCcceEEeccc---chHHHHHHHHHHHHHHHhccccChHHHHHHHHHHHhhhchhhhHHHHHhcCCCccchhhhHHH Confidence 1123332222 122 222332332222 11 1221 1111 1111 000 000000 Q ss_pred ----HHH-------HHHHHH--------HH------HH---------H------HHHHh-----------------hhcc Q lcl|NC_013644. 449 ----LIC-------EQFDLD--------WE------DV---------K------EALEE-----------------AEYT 471 (510) Q Consensus 449 ----~~~-------e~~e~~--------~~------~~---------~------~~~~~-----------------~~~~ 471 (510) ..+ .+.++. .+ .. . ....+ ...+ T Consensus 625 le~~~~q~e~~~~~akaq~~qaqa~~~~aq~e~~~~d~~~~e~~~Q~~~e~~~~~~~~eaq~~l~~~~a~~~~~~ea~~~ 704 (763) T protein:vir:95 625 LAVEKAQLENEELRSKIRLNDAQAQKAMAERDNKNLDYLEQESGTKHARDLEKMKAQSQGNQQLEITKALTKPRKEGELP 704 (763) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 000 000000 00 00 0 00000 0000 Q ss_pred CCCCCCC---CCcccCCCCCCcccc------cccCcccccccccCCCC Q lcl|NC_013644. 472 KGLSDNT---DEEETAVNPDDPTQQ------MAEGATGSTESQLPENG 510 (510) Q Consensus 472 ~~~~~~~---~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~ 510 (510) ....... ........+-.+..+ ..+.++|+...+.|--| T Consensus 705 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 752 (763) T protein:vir:95 705 PNLSAAIGYNALTNGEDTGIQSVSERDIAAEANPAYSLGSSQFDPTRD 752 (763) T ss_pred hhHHHhhhhcccccccCCCccchhhcccCccccccccCCCCCCCCCCc Confidence 0000000 000000000001111 11122233333333333 No 147 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=98.46 E-value=5.6e-07 Score=54.86 Aligned_cols=415 Identities=10% Similarity=-0.021 Sum_probs=168.4 Q ss_pred HHHHHHHhccCC-c----chhcccceeccccccccccccccceec------cchhHHHHHHHHhhhhcCCceecc--Cc- Q lcl|NC_013644. 32 AETGIRYYNHEN-D----IMNNRIFYVDDEGILREDKYASNVRIP------HGFFPEIVDQKTQYLLSNPVEYET--EN- 97 (510) Q Consensus 32 ~~~~~~YY~g~~-~----i~~~~~~~~~~~~~~~~~~~~~~~ki~------~n~~~~Iv~~~~~~l~g~p~~~~~--~d- 97 (510) |-.+.+...-.+ . ...+....++......-........|. +.=.-..|+..++-+.+-|+++-- ++ T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~V~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~ 80 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEARAWEPYDPSIYNLGAVAASGETVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGGS 80 (457) T ss_pred CchhhhhhcccccccccccccccccccchHHHhhcccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 222222111000 0 000000000000000000000000111 111122355555555566766421 11 Q ss_pred --HHHHHHHHHHhc---cC--HHHHHHHHHHHHHhcCeEEEEEEECCCCc-eEEEEEcccceEEEEcCCCC-ceeEEEEE Q lcl|NC_013644. 98 --EELKEYLAEYYN---SE--FQVVLQELVEGSSQKGFEYVYARTNAEDR-LCFQVADSLNVFGVYNEYNE-LQRICRHY 168 (510) Q Consensus 98 --~~~~~~l~~~~~---n~--~~~~~~e~~~~~~~~G~~~~~v~~d~~g~-~~i~~~~p~~~~~~~d~~~~-~~~~~~~~ 168 (510) +.....+..+++ |. .......+....+.+|.||+++..+ .|+ ..+.+++|..+.++.+.... .......| T Consensus 81 ~~~~~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~y 159 (457) T protein:vir:13 81 RKEIVTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQ-GPNIVGLDVLDPTKIHVHMVMVDGLRRKVFEAY 159 (457) T ss_pred ccccccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEccCceEEEEecCCCccceeEEEE Confidence 111222333332 22 2345566777888999999888665 455 46788889888876653322 11222222 Q ss_pred EEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCC Q lcl|NC_013644. 169 ITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETT 248 (510) Q Consensus 169 ~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~s 248 (510) .+ ..++. ......|.+..+.+++.-. .. +.-.|.| T Consensus 160 ~~--~~~~~---~~~~~~~~~~diih~~~~~-------------------------------~~---------~~~~G~s 194 (457) T protein:vir:13 160 DI--DADGN---EVLLGWFTPRDVLHIPGMM-------------------------------LP---------GDFVGCS 194 (457) T ss_pred EE--ecCCc---eeeEEeeCccceEEecCCC-------------------------------CC---------Ccccccc Confidence 11 11111 1122234444444443100 00 0124667 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh---hhhHhhh--------cCeeeeccCCCceeEEeecC Q lcl|NC_013644. 249 DLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS---KLRQNVK--------SKKVVGTGSDGGLDVKTVTI 317 (510) Q Consensus 249 d~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~---~~~~~~~--------~~~~~~~~~~~~~~~~~~~~ 317 (510) .+..+...|.....+-.-..+.+...+.|-.+++-...-..+ .++..+. .++++.++++.+++.++.+. T Consensus 195 ~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~ 274 (457) T protein:vir:13 195 PISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPGTMSEEGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSP 274 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCCh Confidence 676666666555554444455556666676666543221211 1221111 13456677766666665544 Q ss_pred CHHHHHHHHHHHHHHHHHHhCCcccccccc--CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCc Q lcl|NC_013644. 318 PTEGRKTKMEIDKENIYKFGMAFDSTQVGD--GNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKA 395 (510) Q Consensus 318 ~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~ 395 (510) ....+.+..+..++.|...-++|+.-.+.. ++.++..++-... ..+...|.-.++.|...+..+--.. T Consensus 275 ~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~----------~f~~~tl~P~~~~ie~~ln~~L~~~ 344 (457) T protein:vir:13 275 DEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNI----------AFTMFSLRPWLERIEAGFNRLLFAE 344 (457) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHH----------HHHHHHHHHHHHHHHHHHHHhhcCc Confidence 444556666777888888888887543222 2222222221111 1223333333333333333221111 Q ss_pred --cccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHHhhh-- Q lcl|NC_013644. 396 --FDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALEEAE-- 469 (510) Q Consensus 396 --~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~~~~-- 469 (510) .....+++.++.-+-.|..+.++.+.++.++|+++.-.++++++. +.+....+...- ..........+..+ T Consensus 345 ~~~~~~~i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~Pi~~g~~d~~~~~---~n~~~~~~~~~~~~~~ 421 (457) T protein:vir:13 345 TADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGEKYRVP---LNLGEVGEEPEPEPAP 421 (457) T ss_pred cccCceeEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccceeec---cccccccccccccccC Confidence 112235555667778899999999999999999999888877643 222100000000 00000000000000 Q ss_pred ccCCCCCCCCCcccCCCCCCcccccccCcccccccc Q lcl|NC_013644. 470 YTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQ 505 (510) Q Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (510) .+........+...+.+.++..++++.........+ T Consensus 422 ~~~~~~~~~~~~~~~~~~~g~~d~~~~~~~~~~~~~ 457 (457) T protein:vir:13 422 APPAIEPPAEEPDEEPEPEGKPDDEGATEEDDEDDA 457 (457) T ss_pred CCCCCCCCccccCCCCCCCCCCccccCCCCcccccC Confidence 000000000000000000010010111111111111 No 148 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=98.41 E-value=8e-07 Score=54.00 Aligned_cols=417 Identities=8% Similarity=0.003 Sum_probs=167.2 Q ss_pred HHHHHHhhhhhhhHHHHH---HHHHHhccCCcchhcccceeccccccccccccccce--eccchhHHHHHHHHhhhhcCC Q lcl|NC_013644. 16 LKAAIDKDRKSSSKREAE---TGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVR--IPHGFFPEIVDQKTQYLLSNP 90 (510) Q Consensus 16 i~~~i~~~~~~~~~~~~~---~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~k--i~~n~~~~Iv~~~~~~l~g~p 90 (510) +-.+....+..++..+.. -+-..+..-.+. .. +..... ...+.. +...=....|+..++-+.+-| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~--g~~~~g-~~v~~~~al~~~~V~~~v~~Ia~~iA~lp 70 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEP-------FA--GAWQQG-VKADPEAVLSFHAVFACISLISQDIAKMR 70 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhhhh-------hc--chhhcC-cccChHHhhccHHHHHHHHHHHHhhccCc Confidence 111111101001000000 000000000000 00 000000 000000 111112223444555555567 Q ss_pred ceec-cC-c---HHH-HHHHHHHhc--cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCC Q lcl|NC_013644. 91 VEYE-TE-N---EEL-KEYLAEYYN--SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEY 158 (510) Q Consensus 91 ~~~~-~~-d---~~~-~~~l~~~~~--n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~ 158 (510) +.+- .. + +.. ...+..++. |. .......++...+.+|.||+++-.+.+|++ .+.+++|..+-++.++. T Consensus 71 ~~~~~~~~~g~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~ 150 (454) T protein:vir:93 71 LRLMQTDAQGIRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRILDWNRVEPLVADD 150 (454) T ss_pred eEEEEeccCCccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCcceEEEEcCC Confidence 6641 11 1 111 112333332 22 235556677889999999999988888886 68889999998888765 Q ss_pred CCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEE Q lcl|NC_013644. 159 NELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFY 238 (510) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv 238 (510) +.+ .|.+......... ....+....+.+++... T Consensus 151 g~~-----~y~~~~~~~~~~~---~~~~~~~~eViH~k~~~--------------------------------------- 183 (454) T protein:vir:93 151 GEV-----FYRITPDRNCGIT---EAVTVPAREVIHDRFNC--------------------------------------- 183 (454) T ss_pred CcE-----EEEEEeccccccc---eeEEecCcceEEeccCC--------------------------------------- Confidence 432 1211111111000 01123333333332110 Q ss_pred EecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh---hhhHhhh-------cCeeeeccCCC Q lcl|NC_013644. 239 RLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS---KLRQNVK-------SKKVVGTGSDG 308 (510) Q Consensus 239 ~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~---~~~~~~~-------~~~~~~~~~~~ 308 (510) ..+.-.|.|.+......+.....+..-..+.+...+.|-.+++-...-+.+ .+...+. .++++.++++. T Consensus 184 -~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~ 262 (454) T protein:vir:93 184 -FFHPLIGLPPVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPGSITEENAKKLKSNWDSGYTGENAGKTAILSNGA 262 (454) T ss_pred -CCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCCCHHHHHHHHHHHHHHhcccccCCceeccCCc Confidence 001123666666555555544444444444455555565555432211111 1211111 23355566666 Q ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 309 GLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDI 388 (510) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~ 388 (510) +++.++.......+.+..+...+.|+..-++|+.-.+.....+...++.. ....+...|.-+++.|...+ T Consensus 263 ~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~----------~~~f~~~~l~P~~~~ie~~l 332 (454) T protein:vir:93 263 KYNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEAL----------EQQYYSQCLQTLIESIELLL 332 (454) T ss_pred eEEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHH----------HHHHHHHHHHHHHHHHHHHH Confidence 66666654444555667777888888888888854433222222111111 11222233333333333322 Q ss_pred hhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 389 NRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALE 466 (510) Q Consensus 389 ~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~ 466 (510) ..+--... ...+++.++.-+..|..+.++.+.++..+|+++.-.++++++. +..-++.. .. . .........+ T Consensus 333 n~~L~~~~-~~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~~-~~-~---~~~~~~~~~~ 406 (454) T protein:vir:93 333 DEALETGE-NESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGDALY-LQ-Q---QNYSLEALSR 406 (454) T ss_pred HHhhcCCC-CcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeee-ec-c---CccchHhhhc Confidence 22111111 1235556667778899999999999999999999888887644 21101000 00 0 0000000000 Q ss_pred hhhccCCCCCCCCCcccCCCCCCcccccccCcccccccccCCC----C Q lcl|NC_013644. 467 EAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLPEN----G 510 (510) Q Consensus 467 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~ 510 (510) .+. ..+ ...+..++...+... .+.+.+...++...-+.+ | T Consensus 407 ~~~-~~~--~~~~~~~~~~~~~~~-~~~d~~~~~~e~~~d~~~~~~~~ 450 (454) T protein:vir:93 407 RDA-RED--PFASSGKTASVPQAV-AASDGNKAITETEHDAVKAMFRG 450 (454) T ss_pred cCc-ccC--CCCCCccCCCCCCCC-CCCCCCCCccCCccchhhhhhhh Confidence 000 000 000011111111000 000001000000000000 0 No 149 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=98.40 E-value=8.1e-07 Score=53.97 Aligned_cols=451 Identities=12% Similarity=0.083 Sum_probs=164.5 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHH----------------HHHHHHHHhccCCcchhccccee--ccccccccc Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKR----------------EAETGIRYYNHENDIMNNRIFYV--DDEGILRED 62 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~----------------~~~~~~~YY~g~~~i~~~~~~~~--~~~~~~~~~ 62 (510) ++-.+.-+ +.-|++++.+..++ ....+.+.-.++......+-... ...+...+. T Consensus 5 ~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (574) T protein:vir:80 5 LDKALGIE--------KSSIEETRNMENYKMHLREIDTNVVNNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKP 76 (574) T ss_pred hhhhhccc--------hhhHHHHHhhhhhccccchhhhhhhhccCCCHHHHHHhHhhhcccccchhhhhccccccccCcC Confidence 11111111 11122222221111 11112222222221110000000 000001111 Q ss_pred cccccc------e-e-ccchhHHHHHHHHhhhh-----------cCCceeccC--c-------HHHHHHHHHHhcc---- Q lcl|NC_013644. 63 KYASNV------R-I-PHGFFPEIVDQKTQYLL-----------SNPVEYETE--N-------EELKEYLAEYYNS---- 110 (510) Q Consensus 63 ~~~~~~------k-i-~~n~~~~Iv~~~~~~l~-----------g~p~~~~~~--d-------~~~~~~l~~~~~n---- 110 (510) ..++.. + . ..+....+++..++-++ |-|..+-.. + ......|..++.+ T Consensus 77 ~~~~~~~~~~~l~~~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~ 156 (574) T protein:vir:80 77 SIRNSQDLHKTLKKFGNNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQF 156 (574) T ss_pred ccCCcccHHHHHHhhccChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCC Confidence 111100 0 0 01223344444433221 334433111 0 1111233444321 Q ss_pred ------CHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEE Q lcl|NC_013644. 111 ------EFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHH 183 (510) Q Consensus 111 ------~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 183 (510) .+......+..+.+.+|.+|+.+-++.+|++ .+.+++|..+.+..+..+....--..|+.. ..+.. T Consensus 157 ~nP~~~s~~ef~~~lv~~lll~Gnayi~i~r~~~G~~~~L~pl~p~~V~v~~d~~~~~~~~~~~y~~~-~~g~~------ 229 (574) T protein:vir:80 157 RDPNRDNFTTFCKKLVRATYMYDQVNFEKVFDKDGNFIKFDTVDPTTIFLATNGEGKLIKNGERFVQV-IDNRI------ 229 (574) T ss_pred CCCccccHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCccccccCceEEEEE-eCCce------ Confidence 2334556677888999999998888888886 478899999988876433211100111111 11110 Q ss_pred EEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHH Q lcl|NC_013644. 184 AEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLM 263 (510) Q Consensus 184 ~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~ 263 (510) ...+....+.+++.... ++ ......|.|.+..+...|+....+ T Consensus 230 ~~~~~~~eiih~~~~~~----------------------------~~---------~~~~~~G~spi~~a~~~i~~~~~a 272 (574) T protein:vir:80 230 VAKFNERELAFAVRNPR----------------------------AD---------IEVGQYGYPELEIALKQFIAHENT 272 (574) T ss_pred EEEEccccEEEEeccCC----------------------------CC---------cccccccccHHHHHHHHHHHHHHH Confidence 11223333444332100 00 001124666676666666655555 Q ss_pred HHHHHHHHHHhccceeEE--ecCC-CCc--hhhhhHhhh--------cCe-eeeccCCCceeEEeecCCHHHHHHHHHHH Q lcl|NC_013644. 264 NCFLSNNLQDFAEAIYVV--SGFQ-GDD--LSKLRQNVK--------SKK-VVGTGSDGGLDVKTVTIPTEGRKTKMEID 329 (510) Q Consensus 264 ~S~~~~~~~~~~~~~lv~--~g~~-~~~--~~~~~~~~~--------~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 329 (510) ..-..+.+...+.|-.++ .+.. .++ ...+...+. .++ ++.++++.++.-++.......+....+.. T Consensus 273 ~~~~~~~f~ng~~p~gil~~~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl~~~G~~~~~l~~s~~D~qfle~~~~~ 352 (574) T protein:vir:80 273 EVFNDRFFSHGGTTRGILHVKTGQQQSQQALDIFRREWRSSLAGINGSWQIPVVSAEDVKFVNMTPSANDMQFEKWLNYL 352 (574) T ss_pred HHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEEccCChhHHHHHHHHHHH Confidence 444455555556666444 3322 121 122222211 112 23334444444444444445566777778 Q ss_pred HHHHHHHhCCcccccccc--CcccHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeC Q lcl|NC_013644. 330 KENIYKFGMAFDSTQVGD--GNITNIVIK-ARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFT 406 (510) Q Consensus 330 ~~~i~~~s~~p~~~~~~~--g~~Sg~Ai~-~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~ 406 (510) .+.|...-++|+.-.+.. +...|.... ..++.+. ......+..+|.-+++.|...+...--..+. ..+.+.|. T Consensus 353 ~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E---~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~-~~~~~~f~ 428 (574) T protein:vir:80 353 INVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSK---EKMQASQNKGLQPLLRFIEDTVNTYIVAEFG-EKYQFQFR 428 (574) T ss_pred HHHHHHHhCCCHHHhcccccccccccccccccchhHH---HHHHHHHHHHHHHHHHHHHHHHHhhhhhhcC-CceEEEec Confidence 888888888887532211 111110000 0000011 1111223333333333333333322111121 24567888 Q ss_pred CCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCc-HHHHH-----HHHHHHHHH-HHHHHHHHHhh-hc--cCCC Q lcl|NC_013644. 407 REVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDD-DNVLR-----LICEQFDLD-WEDVKEALEEA-EY--TKGL 474 (510) Q Consensus 407 ~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d-~e~~~-----~~~e~~e~~-~~~~~~~~~~~-~~--~~~~ 474 (510) +.-..+.++.... ..+..+|+++.-.++++++. +.. +.... ......... .+......... +. .... T Consensus 429 ~~d~~~~~~~~~~-~~~~~~G~lT~NE~R~~lgl~Pi~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (574) T protein:vir:80 429 GGDLSAQLDKLKI-IEQEGKVFRTVNEIRHDKGLEPIKGGDVILNGVHIQAIGQALQEEQLEYQRSQDRLNRLLELSGGD 507 (574) T ss_pred ccchhhHHHHHHH-HHHHhCCccCHHHHHHHhCCCCCCCCCEeeeccceeecccccccccCCccchhccccccccccCCC Confidence 7666666665543 45677899999888887533 221 00000 000000000 00000000000 00 0000 Q ss_pred CCCCCCccc--CCCCCCcccccccCcccccccc----------cCCCC Q lcl|NC_013644. 475 SDNTDEEET--AVNPDDPTQQMAEGATGSTESQ----------LPENG 510 (510) Q Consensus 475 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~----------~~~~~ 510 (510) +..++..++ ..++++++. .+..++.++.. .=..| T Consensus 508 ~~~~~~~~p~~~~~d~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 553 (574) T protein:vir:80 508 VEQPEPEEPKDSQNDTDVSF--QDEQQGLNGKSKKVNGKVDDNVGKDG 553 (574) T ss_pred CCCCCCCCCCCccccccchh--hhhhhhhccchhhhcCCccccccccc Confidence 000000111 111111111 11111111111 11111 No 150 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=98.39 E-value=8.9e-07 Score=53.75 Aligned_cols=391 Identities=10% Similarity=0.023 Sum_probs=175.4 Q ss_pred hhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcC Q lcl|NC_013644. 10 KIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSN 89 (510) Q Consensus 10 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~ 89 (510) .+..+++.+.- ... .....-...+..++-|..-. .+..+. ++.-+..+.....|+..++-+.+- T Consensus 1 m~~~~~f~~~~-~~~-~~~~~~~~~~~~~~~~~~~~----------~~~~v~----~~~al~~~~v~~~i~~Ia~~ia~l 64 (416) T protein:vir:12 1 MLLERMFEKRS-GSS-DHEDGFNNILLNMFGGRKTA----------SGERVS----ESNSLVQPDIFACVNVLSDDIAKL 64 (416) T ss_pred Cccchhccccc-Ccc-ccCccchhHHHHhhcCcccc----------cCceec----hhhhhccHHHHHHHHHHHHhhhhC Confidence 11111111100 000 00000111223333322100 000000 001112233344556666666666 Q ss_pred Ccee-ccCcH---HHH--HHHHHHhc--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcC Q lcl|NC_013644. 90 PVEY-ETENE---ELK--EYLAEYYN--S---EFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNE 157 (510) Q Consensus 90 p~~~-~~~d~---~~~--~~l~~~~~--n---~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~ 157 (510) |+++ ...+. ... .....++. | ........++...+.+|.||+++..+..|.+ .+.+++|..+-++.++ T Consensus 65 ~~~~~~~~~~~~~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~ 144 (416) T protein:vir:12 65 PIHTYKRTDGGIERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEALFPLRPDYTNAYVHP 144 (416) T ss_pred ceEEEEecCCccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEeC Confidence 7664 21111 110 11122221 2 2335556677888999999999988888876 5888999988877654 Q ss_pred CCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccE Q lcl|NC_013644. 158 YNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPF 237 (510) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 237 (510) .+.. .|+.... ++.. + .+.+..+.+++.- + T Consensus 145 ~~~~-----~~~~~~~-~g~~-----~-~~~~~eiih~~~~-------------------------------------~- 174 (416) T protein:vir:12 145 TTGM-----LWYQTVL-NGKA-----I-ELYDYEVLHFKGL-------------------------------------S- 174 (416) T ss_pred CCcE-----EEEEEec-CCeE-----E-EecCccEEEecCc-------------------------------------C- Confidence 3321 1111111 1110 1 1223333333210 0 Q ss_pred EEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh---hhhHh----hhcCeeeeccCCCce Q lcl|NC_013644. 238 YRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS---KLRQN----VKSKKVVGTGSDGGL 310 (510) Q Consensus 238 v~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~---~~~~~----~~~~~~~~~~~~~~~ 310 (510) .+...|.|.+..+...++....+..-..+.++..+.|-.+++-...-+.+ .+... ...++++.++++.++ T Consensus 175 ---~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~vl~~g~~~ 251 (416) T protein:vir:12 175 ---TDGIHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPAFLDEKPKENVRKEWKRVNKVENIAIIDYGLEY 251 (416) T ss_pred ---CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCCCCCHHHHHHHHHHHHHHhcCCCeeecCCCceE Confidence 01124666666666666665555555556666667776666532221111 12221 123456667777666 Q ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_013644. 311 DVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINR 390 (510) Q Consensus 311 ~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~ 390 (510) +.++.......+.+..+...+.|...-++|+.-.+..+..+...++.. ....+...|.-+++.|...+.. T Consensus 252 ~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~----------~~~f~~~~l~P~~~~ie~~l~~ 321 (416) T protein:vir:12 252 QSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQ----------SIEYVRNTLQPWIVNFEQELNV 321 (416) T ss_pred EEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHH----------HHHHHHHHHHHHHHHHHHHHHH Confidence 666554444556677788888888888888864433222211111111 1123344555555555544443 Q ss_pred ccCCccc---cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 391 RYTKAFD---PTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEAL 465 (510) Q Consensus 391 ~~~~~~~---~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~ 465 (510) +--...+ ...+++.+..-+..|..+.++.+.++..+|+++.-.++++++. +++-+.. .... T Consensus 322 ~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ggd~~--------------~~~~ 387 (416) T protein:vir:12 322 KLFLDHDQKSGHYVKFNIDSELRGDSKTQAEYLKTLHETGVLNKDEIRELLERNPIENGDKY--------------ISSL 387 (416) T ss_pred hhcCchhhcCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccee--------------eecc Confidence 2111111 1234455566678899999999999999999999888887643 2211100 0001 Q ss_pred HhhhccCCCCCCCCCccc-CCCCCCcccccccC Q lcl|NC_013644. 466 EEAEYTKGLSDNTDEEET-AVNPDDPTQQMAEG 497 (510) Q Consensus 466 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 497 (510) +..+.. ..+..+.. .+....++++...| T Consensus 388 n~~~~~----~~~~~~~~~~~~~~~gge~~~~g 416 (416) T protein:vir:12 388 NYVFLD----FLEEYQRLKAGGAMKGGDNKNEG 416 (416) T ss_pred cccccc----ccchhhccccccccCCCCCcCCC Confidence 000000 00000000 00000011111111 No 151 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=98.38 E-value=9.2e-07 Score=53.68 Aligned_cols=389 Identities=9% Similarity=-0.039 Sum_probs=168.7 Q ss_pred hhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhc Q lcl|NC_013644. 9 VKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLS 88 (510) Q Consensus 9 ~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g 88 (510) +.+ +..+..+.. .........+.+.+.+..+- ..+..+... .=+..+-....|+..++-+.+ T Consensus 1 Mg~----f~~lf~r~~-~~~~~~~~~~~~~~~~~~~~---------~~g~~v~~~----~al~~~~v~~~i~~Ia~~ia~ 62 (414) T protein:vir:44 1 MVF----FSGLFQRKS-DAPVTTPAELADAIGLSYDT---------YTGKQISSQ----RAMRLTAVFSCVRVLAESVGM 62 (414) T ss_pred Cch----hhhhhccCc-cCcccchhhHhHhhccCccc---------cCCceechh----hhhccHHHHHHHHHHHHHhcc Confidence 000 001100000 00000001111111111000 000000000 001122233445555555666 Q ss_pred CCceeccCc-----HHHHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEc Q lcl|NC_013644. 89 NPVEYETEN-----EELKEYLAEYYN---S---EFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYN 156 (510) Q Consensus 89 ~p~~~~~~d-----~~~~~~l~~~~~---n---~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d 156 (510) -|+++--.+ .....-+..++. | ........+....+.+|.||+++..+ .|++ .+.+++|..+.+.++ T Consensus 63 ~p~~~~~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-~g~~~~L~~l~~~~v~~~~~ 141 (414) T protein:vir:44 63 LPCNLYHLNGSLKQRATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA-FGEVAELLPVDPGCVVPKLN 141 (414) T ss_pred CceEEEEecCCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCceEEEEEC Confidence 676642111 111111222221 2 23455566778889999999988766 5666 588899999988887 Q ss_pred CCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCccc Q lcl|NC_013644. 157 EYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIP 236 (510) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP 236 (510) +.+++ +|++.... +. ...+.+..+.+++.- + T Consensus 142 ~~~~~-----~y~~~~~~-g~------~~~~~~~evih~~~~-------------------------------------~ 172 (414) T protein:vir:44 142 SSWEP-----VYQVTFPD-GS------TDVLSQEDIWHVRTL-------------------------------------T 172 (414) T ss_pred CCCcE-----EEEEEecC-ce------EEEEccccEEEecCC-------------------------------------C Confidence 65432 22222221 11 112334444443310 0 Q ss_pred EEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCch---hhhhHhh----h----cCeeeecc Q lcl|NC_013644. 237 FYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDL---SKLRQNV----K----SKKVVGTG 305 (510) Q Consensus 237 vv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~---~~~~~~~----~----~~~~~~~~ 305 (510) .+...|.|.+..+...++....+..-..+.+...+.|-.+++....-+. ..+.... . .++++.++ T Consensus 173 ----~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~ 248 (414) T protein:vir:44 173 ----LDGLVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQTLSDQAYERLKKDFEERHTGLGNAHRPMILE 248 (414) T ss_pred ----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCcceecC Confidence 0112366666666666655555544455555666667666554322121 1222211 1 12355566 Q ss_pred CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 306 SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVI 385 (510) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~ 385 (510) ++.+.+.++.+.....+.+..+...+.|+..-++|+.-.+..+..+...++.. ....+..+|.-+++.|. T Consensus 249 ~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~----------~~~~~~~~l~P~~~~ie 318 (414) T protein:vir:44 249 MGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEEL----------GLGFINYSLVPYLTRIE 318 (414) T ss_pred CCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH----------HHHHHHHHHHHHHHHHH Confidence 65555555443334455666777778888888888854433222221111111 12233445555555554 Q ss_pred HHHhhccCCccc--cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 386 DDINRRYTKAFD--PTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKE 463 (510) Q Consensus 386 ~~~~~~~~~~~~--~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~ 463 (510) ..+..+--.... ...+++.+...+..|..+.++.+.++..+|+++.-.++++++.-.-+.- ..... T Consensus 319 ~~ln~~L~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~gg------------D~~~~ 386 (414) T protein:vir:44 319 QRINTGLVRKSKQGVFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGG------------DVYLT 386 (414) T ss_pred HHHHhhcCCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc------------ceecc Confidence 444332111111 1224444456667899999999999999999999888887654211000 00000 Q ss_pred HHHhhhccCCCCCCCCCcccCCCCCCccccccc Q lcl|NC_013644. 464 ALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAE 496 (510) Q Consensus 464 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (510) ..+....+. . ......++++++ .++.++ T Consensus 387 ~~n~~~~~~---~-~~~~~~~~~~~~-~d~~~~ 414 (414) T protein:vir:44 387 PMNMTTKPS---D-GSKAGKQKDNAN-ADETTS 414 (414) T ss_pred cccccccCC---c-cccCCCCCCCCC-CCCCCC Confidence 011110000 0 000000111111 111111 No 152 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=98.38 E-value=9.3e-07 Score=53.66 Aligned_cols=422 Identities=9% Similarity=0.027 Sum_probs=164.0 Q ss_pred hhhhhhhHHHHHHHHHHhccCCcchhcccceecccccccccccc----ccceeccchhHHHHHHHHhhhhcCCceeccCc Q lcl|NC_013644. 22 KDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYA----SNVRIPHGFFPEIVDQKTQYLLSNPVEYETEN 97 (510) Q Consensus 22 ~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~----~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d 97 (510) -|..... ...+.+|-.-+...... .......+...++... .+.--...+....|+..+..+.+-|+.+...+ T Consensus 1 ~~~~~~~---~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~pp~~~~~La~~~~~n~~v~scI~~ia~~ia~~~~~i~~~~ 76 (540) T protein:vir:41 1 MFNYHLS---IKSLEKYRAIKGDTDSQ-ALKEDRFEEYVEPKVHPLVLLSLLQVNPYHASACSIKANDILRTGYLIDGDD 76 (540) T ss_pred CCCcccC---hhhccchhhhhcccccc-ccccCCCCccccCCCCHHHHHHHHHhcHHHHHHHHHHHHHHhcCCceEecCc Confidence 1111111 11122221111110000 0011111111111110 00011235667778888888889998887666 Q ss_pred HHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCC Q lcl|NC_013644. 98 EELKEYLAEYYNSEFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNELQRICRHYITEIEKDG 176 (510) Q Consensus 98 ~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 176 (510) .....++-..+ .........+..+.+.+|.||+.+..+..|++ .+.+++|..+-+.-+... ++. ..++. T Consensus 77 ~~~~~~lpN~~-~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~~L~~i~~~~V~v~~~~~~-------~~~--~~d~~ 146 (540) T protein:vir:41 77 GGVEELLRACR-PSFEFILLQALEDLQVFNYCTLEVVRDDQGEPVRLDYIPAHTVRVHRDGSR-------YMQ--TWDGI 146 (540) T ss_pred cchhhhccCCC-CCHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCcceEEeEcCce-------eEe--eecCc Confidence 65544432111 12455666778889999999999988888875 578888888876554321 111 11110 Q ss_pred ceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCC-----CCCCCcHH Q lcl|NC_013644. 177 ETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNN-----KQETTDLK 251 (510) Q Consensus 177 ~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn-----~~g~sd~~ 251 (510) ...++..|........ ..+. ....+..=.|+|+++. ..|.|.+. T Consensus 147 ---~~~~~~~~~~~~~~~~--~~g~--------------------------~~~~~~~~eViHir~~~~~~~~~G~Spi~ 195 (540) T protein:vir:41 147 ---HVTYFKDYRYEGEVNP--DNGE--------------------------DQDGVGANEIIFIHLPSPICSYYGVPRYL 195 (540) T ss_pred ---eeeeeecccccceeec--cccc--------------------------cceeecccceEEecCCCCCCCcccccHHH Confidence 1111111111110000 0000 0001111135666542 24777666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccceeEEe--cCCCCch-----------hhhhHhh---------hcCeeeecc---- Q lcl|NC_013644. 252 PIKALIDDYDLMNCFLSNNLQDFAEAIYVVS--GFQGDDL-----------SKLRQNV---------KSKKVVGTG---- 305 (510) Q Consensus 252 ~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~--g~~~~~~-----------~~~~~~~---------~~~~~~~~~---- 305 (510) .....+.....+..-..+.+...+.|-.++. |.-.... ..+.... ..++++.+. T Consensus 196 ~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~ 275 (540) T protein:vir:41 196 SAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGG 275 (540) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEecCCC Confidence 5554444443333333444555556655543 3211110 0011111 112233332 Q ss_pred CCCceeEEeecC--CHHHHHHHHHHHHHHHHHHhCCcccccc----ccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 306 SDGGLDVKTVTI--PTEGRKTKMEIDKENIYKFGMAFDSTQV----GDGNITNIVIKARYTLLNMKANKTEARLRALLEW 379 (510) Q Consensus 306 ~~~~~~~~~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~----~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~ 379 (510) .+++++|..... ....+.+..+...+.|...-++|+.-.+ +..+-|... ..... .+...|.- T Consensus 276 ~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~e--q~~~~----------f~~~tL~P 343 (540) T protein:vir:41 276 DTVEVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAE--VARRT----------YYESVVRP 343 (540) T ss_pred cccceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHH--HHHHH----------HHHHHHHH Confidence 134566654443 3445667777888888888888875332 122222211 11111 11112222 Q ss_pred HHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCc-HHHHHHHHHHHHHHH Q lcl|NC_013644. 380 MNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDD-DNVLRLICEQFDLDW 458 (510) Q Consensus 380 ~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d-~e~~~~~~e~~e~~~ 458 (510) +++.|...++..-..... ..+.+.|+..-.... +.+..+.+++++|+++.-.+++.++.++. ++.... .- .... T Consensus 344 ~~~~ie~~ln~~L~~~~~-~~~~i~f~~~~ll~~-D~~~~~~~lv~~G~lT~NE~Re~L~g~e~gdd~~l~-p~--n~~~ 418 (540) T protein:vir:41 344 QQEIVSSVLTDFIQLKLD-PGARFVFNEEILMES-EFVHNYALLVQCGVLTPSEVREKLFGLDGGPDMFMV-PS--SIGK 418 (540) T ss_pred HHHHHHHHHHHhhhhccC-CceEEEecchhhcch-HHHHHHHHHHhCCCCCHHHHHHHhCcCcCCCccccc-cc--cccc Confidence 222222222111101111 235567765433322 34455667889999999888875543332 111000 00 0000 Q ss_pred HHHHHHHHhhh--ccCCCCCCCCCcccCCCCCCcccccccCcccccccccCCC------------C Q lcl|NC_013644. 459 EDVKEALEEAE--YTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLPEN------------G 510 (510) Q Consensus 459 ~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~ 510 (510) .......+..+ .+.....-..+.++..+++.+++. ..++....++. | T Consensus 419 ~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~ 479 (540) T protein:vir:41 419 SAMKRQKRNYEKNQINEIKRTYAKYKPRIQEIISSES-----PLEDKKKKIDEVLSDFRAEAYENG 479 (540) T ss_pred ccccccccccCCCCccccccccchhcccccCcccccc-----ccccccccccccccccCCccccch Confidence 00000000000 000000000001111111000000 00011111111 1 No 153 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=98.38 E-value=9.6e-07 Score=53.59 Aligned_cols=437 Identities=13% Similarity=0.058 Sum_probs=181.7 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |+-.+..+.+.+.+...++-.+.... -.++..+.+|..-. + ..... ......|+..+-+..-++ T Consensus 1 ~~~~~~~e~~~l~~r~~~Lk~~R~~~--e~~w~e~~~~~lP~-----~--~~~~~-------~~~~~~~~~dstg~~a~~ 64 (517) T protein:vir:10 1 MDMRFAGNKSKIPKLYEQLVGKRSPF--LSRAENYSRFTLPY-----L--MADVN-------DDLSSQNAWQDDGASATN 64 (517) T ss_pred CcccccccHHHHHHHHHHHHHhhhHH--HHHHHHHHHHhccc-----c--ccCCC-------CCccccccccchHHHHHH Confidence 77766666665555444442222111 12345555554321 0 00000 011123455666777777 Q ss_pred HHHhhhhc--CCce-----eccCcHH-------------HHHHH-------HH-HhccCHHHHHHHHHHHHHhcCeEEEE Q lcl|NC_013644. 81 QKTQYLLS--NPVE-----YETENEE-------------LKEYL-------AE-YYNSEFQVVLQELVEGSSQKGFEYVY 132 (510) Q Consensus 81 ~~~~~l~g--~p~~-----~~~~d~~-------------~~~~l-------~~-~~~n~~~~~~~e~~~~~~~~G~~~~~ 132 (510) +.++.|++ -|+. +...++. +...| .. +..+||.....++.++..++|.|.++ T Consensus 65 ~LAa~l~~~ltpp~~~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly 144 (517) T protein:vir:10 65 FLSNKLSQVLFPAQRSFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMMY 144 (517) T ss_pred HHHHHHHHhhcCCCCccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEE Confidence 77777654 2322 2332221 22222 12 22468889999999999999998654 Q ss_pred EEECCCCceEEEEEcccceEEEEcCCCCceeEEEEEEEEEee-----C---------CceeEEEEEEEEcCCcEEEEEEc Q lcl|NC_013644. 133 ARTNAEDRLCFQVADSLNVFGVYNEYNELQRICRHYITEIEK-----D---------GETVDIHHAEVWTDQNVYFFVAE 198 (510) Q Consensus 133 v~~d~~g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~-----~---------~~~~~~~~~e~y~~~~i~~~~~~ 198 (510) .++ +...++.++-.+++..-|..+++..+++-....... . .....-..+++|+. . +... T Consensus 145 --~~~-~~~~~~~~pl~~y~v~~d~~G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~---v-~~~~ 217 (517) T protein:vir:10 145 --HPD-KTSPIQAVPLHHYCVRRDNNGTVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTH---A-KRTK 217 (517) T ss_pred --EeC-CCCcEEEEEcCeEEEeeCCCcCeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEE---E-EEeC Confidence 443 333455566667666667777766665433222100 0 00001112233321 0 0111 Q ss_pred CCceeecccccccccccccccccccc-cccccccCCcccEEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 199 DNKDYELDEAEPINPRPHVLAVDSEN-ESLLQRSYGQIPFYRLS-----NNKQETTDLKPIKALIDDYDLMNCFLSNNLQ 272 (510) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~g~iPvv~~~-----nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~ 272 (510) ++.+. .....++.. ......+|..+|++.++ ++.+|+|-.++..+-+..+|.+.-....... T Consensus 218 ~~~~~------------~~~~~d~~~~~~~s~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~ 285 (517) T protein:vir:10 218 DGKYL------------IRQSADDVPVGKESTVTEDKSPFLILTWKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGMA 285 (517) T ss_pred CCceE------------EEEEeCceeeccccccccccCCeeeeeeeecCCCCcccchHHHhHHHHHHHHHHHHHHHHHHH Confidence 11110 000001111 11112335667777654 3467999888899999999988777777666 Q ss_pred HhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCccccccccCcc Q lcl|NC_013644. 273 DFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNI 350 (510) Q Consensus 273 ~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~ 350 (510) ....|.+.+.-....+...+.. ...+.+..+..+++..+... .+.......++.++..|-..-..-....-..... T Consensus 286 ~a~~~~~lv~~~~~~~~~~l~~--~~~g~~~~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~~~~~rv 363 (517) T protein:vir:10 286 LMADVKYLVKPGSYTDINQFVE--GGSGAVLHGVEGDIHIVQLGKYADYTPIQAVLNDYRQRIGRVFMMEAMTRRDAERV 363 (517) T ss_pred HhccCCcccCcccccchhhccC--CCccccccCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhhccCCccc Confidence 6676666542211222222211 11123334444566665533 3566667777777776655322111111122334 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHhhccCCccccceeeEEeCCCCCC-CHHHHHHHHH Q lcl|NC_013644. 351 TNIVIKARYTLLNMKANKTEARLRALLEWM--------NKLVIDDINRRYTKAFDPTEVSFTFTREVMV-NETDIVNDEK 421 (510) Q Consensus 351 Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~--------~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~-d~~e~~~~~~ 421 (510) |++.+.. +..+|...++..+.++ +..++..+..... ...+.+.+.-++.. .....++.+. T Consensus 364 TAtEV~~-------r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~l~~~l~----~~~v~~~~~s~la~l~r~~~~~~i~ 432 (517) T protein:vir:10 364 TAYEIQR-------DAMLVEQSLGGVYSLFATTFQGPLARWFMNGISSILT----SKNVSPTILTGIEALGRMAELDKLG 432 (517) T ss_pred cHHHHHH-------HHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhhhcC----CCCccceeeccHHHHHHHHHHHHHH Confidence 5555543 5566666677665552 2222222211111 11233333322211 1111111111 Q ss_pred HHHh-cCCC--chHHHHHh-------------C--C--CC-CcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCC Q lcl|NC_013644. 422 TEAE-TRKI--ILESILQV-------------A--P--RL-DDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDE 480 (510) Q Consensus 422 ~~~~-~g~i--S~et~~~~-------------~--~--~v-~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (510) ...+ .+.+ -.+.+... + | .+ +++|..+..++..+.+.. +..++... ......-.+ T Consensus 433 ~~~~~i~~~a~~~~~~~~~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~~~~~~~~~~~--~~~~~~ag--~~~~~~~~~ 508 (517) T protein:vir:10 433 TFNGYVSMTAQWPEPLQQAIKWPDFTDWVQGQISANFPFFKTQDELNAEAQAQQEQEAT--KYAAEQAG--KAIPDMVKN 508 (517) T ss_pred HHHHHHHHhhcCChHHHhcCCHHHHHHHHHHHhCCChhhcCCHHHHHHHHHHHHHHHHH--HHHHHHHH--HHHHHHHhC Confidence 1100 0000 01222211 1 1 11 123322222222111111 11111110 000010111 Q ss_pred cccCCCCCC Q lcl|NC_013644. 481 EETAVNPDD 489 (510) Q Consensus 481 ~~~~~~~~~ 489 (510) ....++++. T Consensus 509 ~~~~~~~~~ 517 (517) T protein:vir:10 509 GQINPQGGQ 517 (517) T ss_pred CCCCCCCCC Confidence 111111111 No 154 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=98.35 E-value=1.1e-06 Score=53.22 Aligned_cols=454 Identities=11% Similarity=0.075 Sum_probs=193.5 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhh--HHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSS--KREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~--~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) |-. ..+-..++.+++..+..+..+. -.++..+.+|..-.- .....+. ..+...++..+-+..- T Consensus 1 m~~---~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~-------~~~~~~~-----~~~~~~~~~dst~~~a 65 (536) T protein:vir:21 1 MAE---KRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSL-------FPKDSDN-----ASTDYQTPWQAVGARG 65 (536) T ss_pred Ccc---hhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc-------cCCCCCc-----ccccccccccccHHHH Confidence 322 1223345666666666554321 123444444432210 1111111 1111235666677777 Q ss_pred HHHHHhhhhc--CCc----eeccCcH-------------HHHH-------HHHHHh-ccCHHHHHHHHHHHHHhcCeEEE Q lcl|NC_013644. 79 VDQKTQYLLS--NPV----EYETENE-------------ELKE-------YLAEYY-NSEFQVVLQELVEGSSQKGFEYV 131 (510) Q Consensus 79 v~~~~~~l~g--~p~----~~~~~d~-------------~~~~-------~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~ 131 (510) +++.++.|++ -|. ++...+. +++. .+...+ .+||...+.++.++..++|.|.+ T Consensus 66 ~~~Laa~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l 145 (536) T protein:vir:21 66 LNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLL 145 (536) T ss_pred HHHHHHHHHHhhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeE Confidence 7777776654 131 1222221 1111 222223 46788889999999999999877 Q ss_pred EEEECCCCce-EEEEEcccceEEEEcCCCCceeEEEEEEEEEe------------eCCceeEEEEEEEEcCCcEEEEEEc Q lcl|NC_013644. 132 YARTNAEDRL-CFQVADSLNVFGVYNEYNELQRICRHYITEIE------------KDGETVDIHHAEVWTDQNVYFFVAE 198 (510) Q Consensus 132 ~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~e~y~~~~i~~~~~~ 198 (510) ++-.+..+.+ .++.++-.+++..-|..+++..++|-+..... ...+......+++|+. .|... T Consensus 146 y~~e~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~----v~~~~ 221 (536) T protein:vir:21 146 YLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTH----IYLDE 221 (536) T ss_pred EEeeCCCCceeeEEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEE----EEEec Confidence 6654444333 46677777887777888888888776554421 1111111222333321 01111 Q ss_pred CCceeeccccccccccccccccccc--ccccccccCCcccEEEecC-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 199 DNKDYELDEAEPINPRPHVLAVDSE--NESLLQRSYGQIPFYRLSN-----NKQETTDLKPIKALIDDYDLMNCFLSNNL 271 (510) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~g~iPvv~~~n-----n~~g~sd~~~v~~liD~~n~~~S~~~~~~ 271 (510) +++.+. +....++. .......+|..+|++.++- +.+|+|-.++..+-+..+|.+.-...... T Consensus 222 ~~~~~~-----------~~~e~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~ 290 (536) T protein:vir:21 222 DSGEYL-----------RYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMS 290 (536) T ss_pred CCCcEE-----------EEeccCCeeeccccCccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 111111 01111111 1122234577788887653 46799999999999999998877777766 Q ss_pred HHhccceeEE-ecCCCCchhhhhHhhhcCeeeeccCCCceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCCccccccccC Q lcl|NC_013644. 272 QDFAEAIYVV-SGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKT--VTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDG 348 (510) Q Consensus 272 ~~~~~~~lv~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g 348 (510) .....|.+.+ .+ +..++..+.. ...+.+..+..+++..+. ...+.......++.++..|-..-..-....-... T Consensus 291 ~~a~~~~~lv~p~-g~~~~~~~~~--~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~~~~~ 367 (536) T protein:vir:21 291 MISSKVIGLVNPA-GITQPRRLTK--AQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGE 367 (536) T ss_pred HHHhcCCcccCcc-cccchhhhcc--CCCcceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhhcccCCCC Confidence 6666654433 22 1122222211 111233334444455443 3346666777777777776442211111112223 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHhhccCCc-cccceeeEEeCCCCCC-CHHHHHH Q lcl|NC_013644. 349 NITNIVIKARYTLLNMKANKTEARLRALLEW--------MNKLVIDDINRRYTKA-FDPTEVSFTFTREVMV-NETDIVN 418 (510) Q Consensus 349 ~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~--------~~~~i~~~~~~~~~~~-~~~~~v~i~f~~~~p~-d~~e~~~ 418 (510) ..|++.+..+ +.++...++..+.+ +++.++.++...+.-+ .....+.+.+.-++.. .....++ T Consensus 368 r~TAtEV~~r-------~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~ 440 (536) T protein:vir:21 368 RVTAEEIRYV-------ASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLD 440 (536) T ss_pred CccHHHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHHH Confidence 3466665553 34444444444333 3444444443333211 1222345555444421 1112222 Q ss_pred HHHH----HHhcC------CCchHHHHHhC---CCC-------CcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCC Q lcl|NC_013644. 419 DEKT----EAETR------KIILESILQVA---PRL-------DDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNT 478 (510) Q Consensus 419 ~~~~----~~~~g------~iS~et~~~~~---~~v-------~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~ 478 (510) .+.. +.+.+ .|....++..+ -++ +++|.+++.+++.+.+......+.............+ T Consensus 441 ~l~~~~~~la~~~Pe~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~ 520 (536) T protein:vir:21 441 KLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASP 520 (536) T ss_pred HHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCh Confidence 2211 11111 12222333221 122 2233333332222222111111111111111111111 Q ss_pred CCcccCCCCCCcccccccCcccccccccCCC Q lcl|NC_013644. 479 DEEETAVNPDDPTQQMAEGATGSTESQLPEN 509 (510) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (510) ..-....+ +... -|+- T Consensus 521 ~~~~~~~~--~~g~-------------~~~~ 536 (536) T protein:vir:21 521 EAMAAAAD--SVGL-------------QPGI 536 (536) T ss_pred hhHHhhhh--cccc-------------CCCC Confidence 00000000 0000 0111 No 155 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=98.31 E-value=1.5e-06 Score=52.55 Aligned_cols=413 Identities=11% Similarity=0.065 Sum_probs=153.8 Q ss_pred CCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhc-cCCcchhcccceeccccccccccccc-ccee-ccchhHHHHHHH Q lcl|NC_013644. 6 SEDVKIIANALKAAIDKDRKSSSKREAETGIRYYN-HENDIMNNRIFYVDDEGILREDKYAS-NVRI-PHGFFPEIVDQK 82 (510) Q Consensus 6 ~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~-g~~~i~~~~~~~~~~~~~~~~~~~~~-~~ki-~~n~~~~Iv~~~ 82 (510) +.+. +.-..++.+..++. ....+-+. .-...-..+...+...+........- ...+ ....+..||+.. T Consensus 1 ~~~~--~~~~~~~~~~~~~~-------~~~rd~l~~~~~glg~~r~~~~~~~g~~~~~~~~~l~~~Yr~~~ia~~iVd~~ 71 (449) T protein:vir:10 1 MTDK--LTLAVNHALNDARM-------ARARMGLMVPTMGLDNKRHSAWCEYGFPELVTYENLYSLYRRGGIAHGAVEKL 71 (449) T ss_pred Cchh--hHHHHhhhcchhHH-------HHHHHHHHHHHhcCCcccchhhhhcCCcccCCHHHHHHHHhcCchhHHHHHhh Confidence 1111 11112333332221 11112111 10000001111121111111111000 0011 134677888888 Q ss_pred HhhhhcCCcee-ccCcH---H----HHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEE Q lcl|NC_013644. 83 TQYLLSNPVEY-ETENE---E----LKEYLAEYYNSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGV 154 (510) Q Consensus 83 ~~~l~g~p~~~-~~~d~---~----~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~ 154 (510) ++-..-+-+.+ ...+. . ....+++++.+.+...+.++.+++..+|.+++++-++ +|+..-.++.+. T Consensus 72 ~d~~~~~~~~i~~g~~~~~~~~~~~~e~~~~~l~~~~~~~~l~ea~~~~rl~Gga~i~i~v~-d~~~l~~Pl~~~----- 145 (449) T protein:vir:10 72 VGKCWQTNPEIIEGDDADDSEDETSWEKKSKQVFTNRLWRSFAEADRRRLVGRYAGILLHIR-DEKDWNLPATKG----- 145 (449) T ss_pred hhhhhhcCcccccCccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccCcEEEEEEec-CCCCCCcccccC----- Confidence 87654332222 22111 1 1233445555667778888999999999988877663 343322222221 Q ss_pred EcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCc Q lcl|NC_013644. 155 YNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQ 234 (510) Q Consensus 155 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 234 (510) ..+..+.-+|.....-.........-.++.|.. |++.....+. ......-|+--. T Consensus 146 ----~~i~~i~v~~~~~i~~~~~~~dp~sp~yg~P~~-y~v~~~~~g~--------------------~~~~~~iH~SRl 200 (449) T protein:vir:10 146 ----RGLQKVSVSWAGSLKVAEWDTGINSKTYGQPKL-WKYTERLPNG--------------------SSRRVDIHPDRV 200 (449) T ss_pred ----cceeeEEeeccccCChhhhhcCCCCCCCCCceE-EEEeeeccCC--------------------Cccceeecccee Confidence 111111111111100000000000001112211 1121110000 000001122111 Q ss_pred ccEEEecCCCCCCCcHHHHHHHHHHHHHHHHH-----HHHHHHHhccc----e-----eEEecCCCCchhh-h---hHhh Q lcl|NC_013644. 235 IPFYRLSNNKQETTDLKPIKALIDDYDLMNCF-----LSNNLQDFAEA----I-----YVVSGFQGDDLSK-L---RQNV 296 (510) Q Consensus 235 iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~-----~~~~~~~~~~~----~-----lv~~g~~~~~~~~-~---~~~~ 296 (510) |.+... ...|.|.++.+-.-+-.++.+.-. +.+..+..... + .-+.+.+.+...+ + ...+ T Consensus 201 ~~~~~~--~~~g~~~L~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~~~~~~e~~~~~~~~~~~~~ 278 (449) T protein:vir:10 201 FILGDY--SEDAIGFLEPAYNAFVSLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASLYGVSIDELQDKFNEVAGEI 278 (449) T ss_pred EeecCC--CCCChhHHHHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHHhhCCchHHHHHHHHHHHHH Confidence 221111 112444454432222122221110 11111111100 0 0011111111111 1 1111 Q ss_pred -hcCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc---cCc-ccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 297 -KSKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG---DGN-ITNIVIKARYTLLNMKANKTEA 371 (510) Q Consensus 297 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~---~g~-~Sg~Ai~~~~~~l~~k~~~k~~ 371 (510) +....+.++.+.+.+ +.+.+.......++...+.+...+++|-+-.-+ .|. +++ -++. | +..+..++. T Consensus 279 ~~~~~~~~i~~~~d~~--~~~~~~sgl~d~l~~~~q~iaaa~~IP~t~L~Gqsp~glnst~-D~~n-y---yd~i~~~Q~ 351 (449) T protein:vir:10 279 NRGNDVLMTTQGATVT--PLVTSVADPTATYNVNLQTAAAGVDIPTRILIGNQQAERSSTE-DQKY-F---NARCQSRRV 351 (449) T ss_pred hccchheeecCCcceE--EEecccCChhHHHHHHHHHHHHHhCCCeeeeeccCccccccch-hHHH-H---HHHHHHHHH Confidence 122234455555544 444566677778888888899999999753322 222 333 2333 3 334444555 Q ss_pred HHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHh--CCCCCcHHHHHH Q lcl|NC_013644. 372 RLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQV--APRLDDDNVLRL 449 (510) Q Consensus 372 ~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~--~~~v~d~e~~~~ 449 (510) .++..|++++.+|+.. +. + .. ..+++|.|++-...+++|+|+...+..++ .++++.. .+-++.+|.. T Consensus 352 ~l~p~le~l~~~l~~s-~~-g-~~--~~d~~i~f~pL~~~t~kEkAei~k~~A~a----~~~~~~ag~~~~~~~~EiR-- 420 (449) T protein:vir:10 352 DLSFEIEDFCDKLIEL-KI-I-DA--VAKKAVIWDDLNEQTGTEKLTNAKTMGEI----NQTMLGSGDNPAFSREEIR-- 420 (449) T ss_pred hhhHHHHHHHHHHHHh-hc-C-CC--CCceeEEeCCCCCCCHHHHHHHHHHHHHH----HHHHHHccccCCcCHHHHH-- Confidence 6899999998876543 22 1 11 23699999999999999998876554432 1222221 1223322211 Q ss_pred HHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCc Q lcl|NC_013644. 450 ICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGA 498 (510) Q Consensus 450 ~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (510) ......+. ... +.+ +.++++.....+..+ T Consensus 421 -------------~~~~~~~~--~~~-~~~----~e~~de~~~~~d~~a 449 (449) T protein:vir:10 421 -------------TAAGYDND--DEE-PLG----EEDGDEEDKATDSAA 449 (449) T ss_pred -------------HHhcccCC--CCC-CCC----CCCCccccccCCcCC Confidence 11111111 100 011 011111101111111 No 156 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=98.30 E-value=1.5e-06 Score=52.48 Aligned_cols=382 Identities=8% Similarity=0.006 Sum_probs=167.6 Q ss_pred HHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCCcee- Q lcl|NC_013644. 15 ALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNPVEY- 93 (510) Q Consensus 15 ~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~- 93 (510) ++-+.+-+.+.. ....-....-..-|... .+..+. +.+=+...-....|+..++-+.+-|+.+ T Consensus 1 m~f~~~~~~~~~-~~~~~~~~~~~~~g~~~-----------~~~~v~----~~~al~~~~v~~~i~~ia~~ia~lp~~~~ 64 (409) T protein:vir:10 1 MLFRKGFKNQSQ-EISIDDKKILEWLGINP-----------SETYVN----GKSCLKQATVFGCIRILSDNISKLPIKIY 64 (409) T ss_pred CcccccccCcCC-CCCCChHHHHHHhcCCc-----------Ccceec----hhhhhccHHHHHHHHHHHHhhhhCceEEE Confidence 110000000000 00000000000000000 000000 0000112222334455555555567654 Q ss_pred ccCc---HHHHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCcee Q lcl|NC_013644. 94 ETEN---EELKEYLAEYYN---S---EFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNELQR 163 (510) Q Consensus 94 ~~~d---~~~~~~l~~~~~---n---~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~~ 163 (510) ...+ .....-+..++. | ........++...+.+|.||+++.++..|++ .+.+++|..+-++.++.+.... T Consensus 65 ~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~ 144 (409) T protein:vir:10 65 QKKDGIKRVPDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYPLKSDGMKIFVDDTGLLNS 144 (409) T ss_pred EecCCeeeccCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCCceEEEEcCCccccc Confidence 1111 111112223221 2 2345556678889999999999988988876 5788999998888775443222 Q ss_pred EEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCC Q lcl|NC_013644. 164 ICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNN 243 (510) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn 243 (510) .-.+++......+. ...+....+.+++.-. .+. T Consensus 145 ~~~~~y~~~~~~g~------~~~~~~~evih~r~~~-----------------------------------------~d~ 177 (409) T protein:vir:10 145 ENNVWYLYTDDLGQ------RHKFMSDEILHFKGLT-----------------------------------------ADG 177 (409) T ss_pred cceEEEEEEeCCce------eEEeccccEEEecCcC-----------------------------------------CCC Confidence 11111111111110 0112222233222000 011 Q ss_pred CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCC-Cc--hhhhhHhh--------hcCeeeeccCCCceeE Q lcl|NC_013644. 244 KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQG-DD--LSKLRQNV--------KSKKVVGTGSDGGLDV 312 (510) Q Consensus 244 ~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~-~~--~~~~~~~~--------~~~~~~~~~~~~~~~~ 312 (510) ..|.|.++.+...++....+..-..+.++..+.|-.+++.... ++ ...+...+ ..++++.++++.+++. T Consensus 178 ~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~ 257 (409) T protein:vir:10 178 LAGLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYAGDLNPEAEEVFKENFERMSSGLKNAHRIAMLPIGYKFEP 257 (409) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCHHHHHHHHHHHHHHhccccccCCceecCCCceEEE Confidence 2366666666666665555555555556666667666654321 11 11121111 1233566666666665 Q ss_pred EeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_013644. 313 KTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRY 392 (510) Q Consensus 313 ~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~ 392 (510) +..+.....+.+..+...+.|+..-++|+.-.+..+..++..+.... ...+..+|.-+++.|...+..+- T Consensus 258 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~e~~~----------~~f~~~~l~P~~~~ie~~ln~kL 327 (409) T protein:vir:10 258 ISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRATHSNITEQN----------REFYIDTLQSILNMYELEINYKL 327 (409) T ss_pred ccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccccHHHHH----------HHHHHHHHHHHHHHHHHHHHHhh Confidence 55544445566777888889999889998654432222222222111 22334444444444444443221 Q ss_pred CC--cc-ccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_013644. 393 TK--AF-DPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALEE 467 (510) Q Consensus 393 ~~--~~-~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~~ 467 (510) -. .. ....+++.+..-+-.|..+.++.+.+++.+|+++.-.++++++. +++-... ....+. T Consensus 328 ~~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~ggD~~--------------~~~~n~ 393 (409) T protein:vir:10 328 FLISEIKNGFYSKFNVDTILRADIKTRYESYKEAIQNGFKTPNEIRELEEDEPLEGGDVL--------------LINGNM 393 (409) T ss_pred cCchhccCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCee--------------eeccCc Confidence 10 10 11234444556667899999999999999999998877777643 1110000 000000 Q ss_pred hhccCCCCCCCCCcccCCCCCCcccc Q lcl|NC_013644. 468 AEYTKGLSDNTDEEETAVNPDDPTQQ 493 (510) Q Consensus 468 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (510) .+. +..+++.. .+| +. T Consensus 394 ~~~----~~~~~~~~---kgG---e~ 409 (409) T protein:vir:10 394 IPV----KMAGEQYS---KGG---EK 409 (409) T ss_pred cch----hhcccccc---ccC---CC Confidence 000 00000000 000 00 No 157 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=98.28 E-value=1.7e-06 Score=52.19 Aligned_cols=380 Identities=12% Similarity=0.057 Sum_probs=163.6 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhh-hhHH-HHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKS-SSKR-EAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~-~~~~-~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) |--. +..+.. +... .-..+-.+..+.. .+..+.. ..-+.++-.... T Consensus 1 M~~f----------------~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~v~~----~~al~~~~V~~~ 48 (397) T protein:vir:38 1 MPLL----------------KLNKSHSQGFSLNDPDWVNFLTGGE------------AQKYVSA----DTALKNSDIFSL 48 (397) T ss_pred Ccch----------------hhhhcccCcccCCchhhhhhhcCCc------------CCceech----HHhhccHHHHHH Confidence 1100 000000 0000 0000111111100 0000000 000111122223 Q ss_pred HHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEc Q lcl|NC_013644. 79 VDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYN 156 (510) Q Consensus 79 v~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d 156 (510) |+..++-+.+-|+.. .+.....++.+-.. -........+....+.+|.||+.+-.|.+|.+ .+.+++|..+-+..+ T Consensus 49 v~~ia~~ia~~p~~~--~~~~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v~i~~~ 126 (397) T protein:vir:38 49 IMQLSGDLAMVRYTS--ESDRSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQVQPMLL 126 (397) T ss_pred HHHHHHHHhhCcccc--cccHHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEc Confidence 444444444445543 33333222221111 12345566778889999999999888888875 688899999887776 Q ss_pred CCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCccc Q lcl|NC_013644. 157 EYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIP 236 (510) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP 236 (510) ..+.. ..|.+....... . ....+....+.+++... T Consensus 127 ~~~~~----~~y~~~~~~~~~-~---~~~~~~~~eiih~~~~~------------------------------------- 161 (397) T protein:vir:38 127 QDGSG----LIYNINFDEPAI-G---YMENVPAADVIHIRLLS------------------------------------- 161 (397) T ss_pred CCCce----EEEEEEeccccc-c---ceeEecCccEEEecCCC------------------------------------- Confidence 44321 011111111000 0 00112333333332110 Q ss_pred EEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhh---hhHhh-------hcCeeeeccC Q lcl|NC_013644. 237 FYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSK---LRQNV-------KSKKVVGTGS 306 (510) Q Consensus 237 vv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~---~~~~~-------~~~~~~~~~~ 306 (510) ..+...|.|.+..+...++....+..-..+.+...+.|-.+++-......+. ..... ..++++.+++ T Consensus 162 ---~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~ 238 (397) T protein:vir:38 162 ---KNGGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKGGLLDAETRIARSKEISKQIHNSDGPVVIDA 238 (397) T ss_pred ---CCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCceecCC Confidence 0011246777777766666655555555566666677766665432222111 11111 1233455666 Q ss_pred CCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 307 DGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNI-TNIVIKARYTLLNMKANKTEARLRALLEWMNKLVI 385 (510) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~-Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~ 385 (510) +.+++-++.......+.+..+...+.|+..-++|+.-.+...+. |..+ . ....+..+|.-++..|. T Consensus 239 g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~e--~-----------~~~~~~~~l~P~~~~ie 305 (397) T protein:vir:38 239 LEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQSSIT--Q-----------ISGQYAKSLNRYVQAIV 305 (397) T ss_pred CceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHH--H-----------HHHHHHHHHHHHHHHHH Confidence 55555555444555667778888899998888887644332211 2111 0 01123344555444444 Q ss_pred HHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 386 DDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKE 463 (510) Q Consensus 386 ~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~ 463 (510) ..+..+--.. +++.+...+-.|..+.++.+.++..+|+++.-.+++.++. +.+.+.- . .. .. T Consensus 306 ~~ln~~l~~~-----~~~~~~~~~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~-------~--~~--~~ 369 (397) T protein:vir:38 306 GELNDKLHAN-----ISANIRFAIDAMGDQYASTISSSVKGGTIAGNQARFILQNSGYLAKDLP-------D--PE--KE 369 (397) T ss_pred HHHHHhccCh-----hcccccccccCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCccc-------c--cc--cc Confidence 4444322211 1222333445678899999999999999999888887643 1111100 0 00 00 Q ss_pred HHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccccCC Q lcl|NC_013644. 464 ALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLPE 508 (510) Q Consensus 464 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (510) .. . .......+.+.+++.++.+++.. |+ T Consensus 370 ~~---~-----~~~~~~~~~g~~~~~~~~e~~~~---------~~ 397 (397) T protein:vir:38 370 PQ---Q-----AIQLIQQEGGENDGNNSDERGSD---------PE 397 (397) T ss_pred cc---c-----cccccccccCCCCCCCCCCCCCC---------CC Confidence 00 0 00000000111111111111100 11 No 158 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=98.25 E-value=2.1e-06 Score=51.71 Aligned_cols=432 Identities=10% Similarity=0.045 Sum_probs=192.5 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhH--HHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSK--REAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~--~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) |.+-++.++....+.+++..+..+..+.. .+...+.+|..-. . ....+. .+...|+..+-+..- T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~--~-------~~~~~~-----~~~~~~~~dstg~~a 66 (516) T protein:vir:96 1 MKQSIDLEYGGKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPY--L-------MNDKGD-----NETSQNGWQGVGAQA 66 (516) T ss_pred CcchhhhhhhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhccc--c-------cCCCCC-----ccccCCcccchHHHH Confidence 88888888888888888888877654322 2344444444321 1 111111 111124556667777 Q ss_pred HHHHHhhhhc--CCce-----eccCcH-------------HHHHH-------HHHHh-ccCHHHHHHHHHHHHHhcCeEE Q lcl|NC_013644. 79 VDQKTQYLLS--NPVE-----YETENE-------------ELKEY-------LAEYY-NSEFQVVLQELVEGSSQKGFEY 130 (510) Q Consensus 79 v~~~~~~l~g--~p~~-----~~~~d~-------------~~~~~-------l~~~~-~n~~~~~~~e~~~~~~~~G~~~ 130 (510) +++.++-|++ -|+. +...++ .+.+. +...+ .+||...+.++.++..++|.|. T Consensus 67 ~~~LAa~l~~~ltpp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ 146 (516) T protein:vir:96 67 TNHLANKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCM 146 (516) T ss_pred HHHHHHHHHhhhcCCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEe Confidence 7777776654 2322 332221 12222 22223 4688889999999999999985 Q ss_pred EEEEECCCCceEEEEEcccceEEEEcCCCCceeEEEEEEEEE-------------------eeCCceeEEEEEEEEcCCc Q lcl|NC_013644. 131 VYARTNAEDRLCFQVADSLNVFGVYNEYNELQRICRHYITEI-------------------EKDGETVDIHHAEVWTDQN 191 (510) Q Consensus 131 ~~v~~d~~g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~~~e~y~~~~ 191 (510) +|.|+++.++ .++-.+++..-|..+++..+++-..... .+......+++...+.++. T Consensus 147 --l~~d~~~~~~--~~pl~~y~v~~d~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~ 222 (516) T protein:vir:96 147 --LYKPSKGAIS--AIPMHHYVVNRDTNGDLLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHAKYLGDG 222 (516) T ss_pred --EEecCCCCEE--EEEcCeEEEeeCCCCCeeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEeeeeeCCc Confidence 4557776554 4444555555566665555443211110 0011111111111222222 Q ss_pred EEE-EEEcCCceeecccccccccccccccccccccccccccCCcccEEEec-----CCCCCCCcHHHHHHHHHHHHHHHH Q lcl|NC_013644. 192 VYF-FVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLS-----NNKQETTDLKPIKALIDDYDLMNC 265 (510) Q Consensus 192 i~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----nn~~g~sd~~~v~~liD~~n~~~S 265 (510) .+. |...++.. .......+|..+|++.++ .+.+|+|-.++..+-+..+|.+.- T Consensus 223 ~~~~~~~~d~~~---------------------~~~es~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~ 281 (516) T protein:vir:96 223 FWELKQSADDIP---------------------VGKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSE 281 (516) T ss_pred eeEEEEEeCcee---------------------eccccccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHH Confidence 111 11111110 001112334567776654 346799988889999999998887 Q ss_pred HHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCcccc Q lcl|NC_013644. 266 FLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDST 343 (510) Q Consensus 266 ~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~ 343 (510) ...........|.+.+.-........+.. ...+.+..+..++++.+... .+.......++.++..|-..-..-... T Consensus 282 ~~l~~~~~a~~~~~lv~p~g~~~~~~l~~--~~~g~i~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~ 359 (516) T protein:vir:96 282 AVARGAALMADIKYLIRPGAQTDVDHFVN--SGTGEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETMT 359 (516) T ss_pred HHHHHHHHhcCCccccCcccccchhhhcc--CCCceeecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhc Confidence 88887888887776553211222222211 12234445555667776543 356777777777777775532111111 Q ss_pred ccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHhhccCCccccceeeEEeCCCCCC-CHHHHH Q lcl|NC_013644. 344 QVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVID-----DINRRYTKAFDPTEVSFTFTREVMV-NETDIV 417 (510) Q Consensus 344 ~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~-----~~~~~~~~~~~~~~v~i~f~~~~p~-d~~e~~ 417 (510) .-.....|++.+.. +..+|...++..+.++-.=++. .+...+. ......+.+.+..++.. -....+ T Consensus 360 ~r~~~rvTAtEV~~-------r~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~~p-~lp~~~v~~~~vs~l~~l~r~~~~ 431 (516) T protein:vir:96 360 RRDAERVTAVEIQR-------DALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGE-SFTSDLVDPVIITGIEALGRMAEL 431 (516) T ss_pred cCCCccccHHHHHH-------HHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhcCC-CCccccccceeechHHHHHHHHHH Confidence 11223346655553 5566777777776663211111 1111111 11112234443322211 011111 Q ss_pred HHHHHHHh-cCCC---c--------hHHHHHhC----C----CC-CcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCC Q lcl|NC_013644. 418 NDEKTEAE-TRKI---I--------LESILQVA----P----RL-DDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSD 476 (510) Q Consensus 418 ~~~~~~~~-~g~i---S--------~et~~~~~----~----~v-~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~ 476 (510) +.+....+ .|.+ + ...+++.+ + .+ +++|..+.++++.+.++... .++.. .+...+. T Consensus 432 ~~i~~~~~~i~~~~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~~~~~~~~q~~~~--~a~~~-~~~~~~~ 508 (516) T protein:vir:96 432 DKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMAQEQEAQMQAQQAQM--LEEGV-AKAVPGV 508 (516) T ss_pred HHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHH--HHHHh-hhhhhHH Confidence 11111110 0101 1 11222211 1 11 22333333322222221111 11111 0111111 Q ss_pred CCCCcccCC Q lcl|NC_013644. 477 NTDEEETAV 485 (510) Q Consensus 477 ~~~~~~~~~ 485 (510) ...+-+ +. T Consensus 509 ~~~~~~-~~ 516 (516) T protein:vir:96 509 IQQELK-EA 516 (516) T ss_pred hhcccc-cC Confidence 011000 00 No 159 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=98.23 E-value=2.3e-06 Score=51.54 Aligned_cols=450 Identities=11% Similarity=0.030 Sum_probs=190.0 Q ss_pred hhhhHHHHHHHHHhhhhhh--hHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhh Q lcl|NC_013644. 9 VKIIANALKAAIDKDRKSS--SKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYL 86 (510) Q Consensus 9 ~~~~~~~i~~~i~~~~~~~--~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l 86 (510) .+ +.+++..+..+..+ --.++..+.+|..-.- ... .+. .....+.++..+-+...+++.++.| T Consensus 1 m~---~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~-------~~~--~~~---~~~~~~~~~~dst~~~a~~~Laa~l 65 (555) T protein:vir:17 1 MK---HSAQAKYMMLRADREDYLDSGRQSARLTLPYI-------LTD--EGH---VQGGYLPTPWQSVGSKGVNVLASKL 65 (555) T ss_pred Ch---hHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc-------cCC--CCC---cccccccccccccHHHHHHHHHHHH Confidence 12 22333333333221 1123444555532210 000 110 0111223566677777788877777 Q ss_pred hc--CCce-----eccCc---------HHHHHHHHH-----------Hh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCC Q lcl|NC_013644. 87 LS--NPVE-----YETEN---------EELKEYLAE-----------YY-NSEFQVVLQELVEGSSQKGFEYVYARTNAE 138 (510) Q Consensus 87 ~g--~p~~-----~~~~d---------~~~~~~l~~-----------~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~ 138 (510) ++ -|+. +...+ +.....++. .+ .+||.....++.++..++|.+.+ |.+++ T Consensus 66 ~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--y~~~~ 143 (555) T protein:vir:17 66 MLSLFPVNTSFFKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALL--YQGKK 143 (555) T ss_pred HHhhcCCCCcccccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEE--EecCC Confidence 64 2322 23222 122222222 22 36788899999999999999865 45655 Q ss_pred CceEEEEEcccceEEEEcCCCCceeEEEEEEEEEee-----CC----------------------------ceeEEEEEE Q lcl|NC_013644. 139 DRLCFQVADSLNVFGVYNEYNELQRICRHYITEIEK-----DG----------------------------ETVDIHHAE 185 (510) Q Consensus 139 g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~-----~~----------------------------~~~~~~~~e 185 (510) + + ++++-.+++..-|..+++..++|-+...... +. .......++ T Consensus 144 ~-~--~~~pl~~y~v~~d~~G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 220 (555) T protein:vir:17 144 N-L--KLYPLDRFVVSRDGEGNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDKGKSNDAL 220 (555) T ss_pred c-e--eEEEcCeEEEeeCCCcCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhcccccCCCccee Confidence 4 3 4455566666667778888877755533211 00 000001112 Q ss_pred EEcCCcEEEEEEcCCceeecccccccccccccccccccc--cccccccCCcccEEEec-----CCCCCCCcHHHHHHHHH Q lcl|NC_013644. 186 VWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSEN--ESLLQRSYGQIPFYRLS-----NNKQETTDLKPIKALID 258 (510) Q Consensus 186 ~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~g~iPvv~~~-----nn~~g~sd~~~v~~liD 258 (510) +|+.- ....+. ..+....++.. ......+|..+|++.++ ++.+|+|-.++..+-+. T Consensus 221 v~t~~-----~~~~~~------------~~~~~e~~~~~v~~~l~e~g~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k 283 (555) T protein:vir:17 221 VYTYV-----CRKDGQ------------VKWHQECDGKVIPGSNSSAPYTHNPWIPLRFNIVDGEAYGRGRVEEFMGDLK 283 (555) T ss_pred Eeecc-----cccCCe------------eEEEEecCceeccccccccCcccCCeeeeeeeecCCCccccchHHHHHHHHH Confidence 22110 000000 00000011111 01124566677877665 34679999999999999 Q ss_pred HHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeEEeec--CCHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 259 DYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKF 336 (510) Q Consensus 259 ~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~ 336 (510) .+|.+.-......+...+|.+.+.-.......++.. ...+.+..+..++++.+... .+.......++.++..|-.. T Consensus 284 ~L~~l~~~~l~~~~~~~~pp~lv~~~g~~~~~~l~~--~~~g~v~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~a 361 (555) T protein:vir:17 284 SLEALSQAMVEGSAASAKVVFMVSPSATTKPQNLAL--AANGAIIQGRPDDVSVVQANKAADFRTVLEMIQKLEQRISDA 361 (555) T ss_pred HHHHHHHHHHHHHHHHhCCceeeccccccCcceeec--CCCceeecCCcccceeeeccccchhhHHHHHHHHHHHHHHHH Confidence 999998888999999999887653222222222221 11234444445556666543 35566677777777666443 Q ss_pred hCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHhhccCCc-cccceeeEEeCC Q lcl|NC_013644. 337 GMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWM--------NKLVIDDINRRYTKA-FDPTEVSFTFTR 407 (510) Q Consensus 337 s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~--------~~~i~~~~~~~~~~~-~~~~~v~i~f~~ 407 (510) -.. .........|++.+.. ++.++...++..+.++ ++-++.++...+.-+ ....-+.+.+.- T Consensus 362 Fm~--~~~~d~~r~TAtEV~~-------r~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~~v~~~i~~ 432 (555) T protein:vir:17 362 FLM--LQVRQSERTTATEVQA-------TVQELNEQIGGIYSNLTTELLQPYLARKLHLLQKQRKLPQLPKDLVQPTVVA 432 (555) T ss_pred Hhh--cCCCCcccchHHHHHH-------HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCHhhhccceee Confidence 211 1122334456655554 4455555555544443 333333443333211 111112333322 Q ss_pred CCCC-----CHHHHHHHHHHHHhcC-------CCchHHHHH----hCCC-----C-CcHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 408 EVMV-----NETDIVNDEKTEAETR-------KIILESILQ----VAPR-----L-DDDNVLRLICEQFDLDWEDVKEAL 465 (510) Q Consensus 408 ~~p~-----d~~e~~~~~~~~~~~g-------~iS~et~~~----~~~~-----v-~d~e~~~~~~e~~e~~~~~~~~~~ 465 (510) ++.. +.......+..+.+.+ .+....+++ .++. + ++++..++.++++.++..+....+ T Consensus 433 ~l~~l~r~~~~~~l~~~~~~laq~~~~p~~~d~id~d~~~~~~a~~~Gv~p~~ivrs~eev~~~rq~~~~~~~q~~~~~q 512 (555) T protein:vir:17 433 GLWGVGRGQDKQQLMEFITTLAQTMGPEIAMKYINPTEFIKRLAAAQGIDTLQLINSPETMKQLGDQQKQDMVQASLINQ 512 (555) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhhcCchhHhhcCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHH Confidence 2211 1111111222222221 022222222 2221 2 334444433333332222222222 Q ss_pred HhhhccCCCCCCCCCcccCCCCCCcccc--cccCcccccccccCC Q lcl|NC_013644. 466 EEAEYTKGLSDNTDEEETAVNPDDPTQQ--MAEGATGSTESQLPE 508 (510) Q Consensus 466 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~ 508 (510) ..+..+....+..- ........+..+ .+.-.+.|.+-+-|. T Consensus 513 a~~~~~~~~~~~~~--~~~~~~~~~a~~~~~a~~~~~~~~~~~~~ 555 (555) T protein:vir:17 513 AGQLAKTPMAEQAM--QLIQQQQEGAQDAGAAESETSSAEAQAGA 555 (555) T ss_pred HHHHHhhhhhhhHH--hccccchhhhhHHHHHHhhcCCcccccCC Confidence 11111111111000 000111101010 011111122222222 No 160 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=98.21 E-value=2.5e-06 Score=51.28 Aligned_cols=446 Identities=12% Similarity=0.044 Sum_probs=191.9 Q ss_pred hhhhHHHHHHHHHhhhhhh--hHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhh Q lcl|NC_013644. 9 VKIIANALKAAIDKDRKSS--SKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYL 86 (510) Q Consensus 9 ~~~~~~~i~~~i~~~~~~~--~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l 86 (510) .+.++ ++..+..+..+ --.++..+.+|..-. + .....+. ......++..+.+...+++.++.| T Consensus 1 mk~~a---~~r~~~l~~~R~~~e~~w~e~~~y~lP~-----~--~~~~~~~-----~~~~~~~~~dstg~~a~~~Laa~l 65 (542) T protein:vir:78 1 MKGLA---QARYSAMRADREDFLDMARRCAALTLPY-----L--LTEDGHA-----SGGRLQQPYQSLGSKGVNALSSKL 65 (542) T ss_pred ChhHH---HHHHHHHHHHhhHHHHHHHHHHHHhccc-----c--CCCCCCc-----ccccccccccchHHHHHHHHHHHH Confidence 22222 22333332211 112344555554221 0 0000000 011113455667777788877777 Q ss_pred hc--CCce-----eccCc----------HH----HHHHH-------H-HHhccCHHHHHHHHHHHHHhcCeEEEEEEECC Q lcl|NC_013644. 87 LS--NPVE-----YETEN----------EE----LKEYL-------A-EYYNSEFQVVLQELVEGSSQKGFEYVYARTNA 137 (510) Q Consensus 87 ~g--~p~~-----~~~~d----------~~----~~~~l-------~-~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~ 137 (510) ++ -|+. +...+ +. +...| . .+..+||.....++.++..++|.|.+ |.++ T Consensus 66 ~~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--~~~~ 143 (542) T protein:vir:78 66 MLSLFPIQTSFFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLV--FAGK 143 (542) T ss_pred HHhhcCCCCccccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEE--EecC Confidence 64 2322 23222 11 22222 2 22347888899999999999999865 4555 Q ss_pred CCceEEEEEcccceEEEEcCCCCceeEEEEEEEEEee---------------------CCceeEEEEE-EEEcCCcEEEE Q lcl|NC_013644. 138 EDRLCFQVADSLNVFGVYNEYNELQRICRHYITEIEK---------------------DGETVDIHHA-EVWTDQNVYFF 195 (510) Q Consensus 138 ~g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~---------------------~~~~~~~~~~-e~y~~~~i~~~ 195 (510) +. ++.++-.+++..-|..+++..++|.+...... .+....+.|. .-..+..++.+ T Consensus 144 ~~---~~~~pl~~y~v~~d~~G~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~~~ 220 (542) T protein:vir:78 144 KT---LKVYPLDRYVIERDGDGNVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTC 220 (542) T ss_pred CC---ceEEecceeEEeeCCCCCeEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCCccccc Confidence 43 44455566666667777888877765544211 0000111110 00011111111 Q ss_pred EEcCCceeeccccccccccccccccccccc--ccccccCCcccEEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 196 VAEDNKDYELDEAEPINPRPHVLAVDSENE--SLLQRSYGQIPFYRLS-----NNKQETTDLKPIKALIDDYDLMNCFLS 268 (510) Q Consensus 196 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~g~iPvv~~~-----nn~~g~sd~~~v~~liD~~n~~~S~~~ 268 (510) .....+. +.+....++... .....+|..+|++.++ .+.+|+|-.++..+-+..+|.+.-... T Consensus 221 ~~~~~~~-----------~s~~~e~~g~~v~~~~~e~g~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l 289 (542) T protein:vir:78 221 CKLVDGQ-----------HRWHQECDGKEIKGSRSSSPLKHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRSLI 289 (542) T ss_pred cccCCCe-----------EEEEEEeccccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHH Confidence 1111111 111111111111 1223466677877654 346799999999999999999998889 Q ss_pred HHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCccccccc Q lcl|NC_013644. 269 NNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDSTQVG 346 (510) Q Consensus 269 ~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~ 346 (510) ...+...+|.+.+.--+..+...+.. ...+.+..+..++++.+... .+.......++.++..|-..-. .. ..-. T Consensus 290 ~~~~~a~~pp~lv~~~g~~~~~~~~~--~~~g~iv~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~aFl-~~-~~~d 365 (542) T protein:vir:78 290 EGSAAAAKVVFMVSPSATTKPQSLAR--AGTGAIIQGRAEDVSVVQANKGADFRTVQEMIRDLSQRISDAFL-IL-NVRQ 365 (542) T ss_pred HHHHHHhcCceeeccccccchhhccc--CCCceeecCCccceeeeecccccchhHHHHHHHHHHHHHHHHhc-cc-ccCC Confidence 99999899887653222222222211 12223444555666665433 4677778888888877755321 11 1122 Q ss_pred cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHhhccCC-ccccceeeEEeCCCCCCCH-HHH Q lcl|NC_013644. 347 DGNITNIVIKARYTLLNMKANKTEARLRALLEWM--------NKLVIDDINRRYTK-AFDPTEVSFTFTREVMVNE-TDI 416 (510) Q Consensus 347 ~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~--------~~~i~~~~~~~~~~-~~~~~~v~i~f~~~~p~d~-~e~ 416 (510) ....|++.+.. ++.++...++..+.++ ++-++.++...+.- .....-+++.+..++..-- ... T Consensus 366 ~~rvTAtEV~~-------r~~E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~lv~~~~~s~La~~~r~~~ 438 (542) T protein:vir:78 366 SERTTATEVRE-------VQMELDRQLSGIYGSLTVELLTPYLNRKLHLMQRSKQLPSLPKGLVMPTVVAGLGGVGRGED 438 (542) T ss_pred cccccHHHHHH-------HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhceeeeeechHHHHHHHHH Confidence 23345555544 4455555555555543 22233344333321 1222236777776663311 111 Q ss_pred HHHH----HHHHhcCCCchHHHHHhC------------CCC------CcHHHHHHHHHHHHHHHHHHHHHH-HhhhccCC Q lcl|NC_013644. 417 VNDE----KTEAETRKIILESILQVA------------PRL------DDDNVLRLICEQFDLDWEDVKEAL-EEAEYTKG 473 (510) Q Consensus 417 ~~~~----~~~~~~g~iS~et~~~~~------------~~v------~d~e~~~~~~e~~e~~~~~~~~~~-~~~~~~~~ 473 (510) ++.+ ....+ ++..+.+...+ -++ ..+|+.+.++++... ...+.+. ++...... T Consensus 439 ~~~l~~~~~~i~~--~~~p~~l~~~id~d~~~~~~a~~~Gvp~~~i~~s~e~~~~~~~q~q~--~~~~~al~~~a~~~a~ 514 (542) T protein:vir:78 439 RAALIEFMQTVGQ--AMGPEALQQFIDPTEFLKRLAAASGIDTLNLVKSPETMANEAQQAQQ--QQMTASLMGQAGQLAK 514 (542) T ss_pred HHHHHHHHHHHHH--hcCChhHHhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHH--HHHHHHHHHhhhhccc Confidence 1111 11111 12223332222 122 222222222221111 1111111 11111011 Q ss_pred CCCCCCCcccCCCCCCcccccccC-cccccc Q lcl|NC_013644. 474 LSDNTDEEETAVNPDDPTQQMAEG-ATGSTE 503 (510) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 503 (510) ...+ +....+...+.++++.+ .+|+.- T Consensus 515 ~~~~---~~~~~~~~a~~~~~~~~~~~~~~~ 542 (542) T protein:vir:78 515 SPIG---EKMMQQINAPGQEAPAGPQTGEDL 542 (542) T ss_pred cccc---cchhhhcCCCCcCCCCCCcccccC Confidence 1111 11111111111222222 122222 No 161 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=98.16 E-value=3.4e-06 Score=50.56 Aligned_cols=389 Identities=10% Similarity=0.034 Sum_probs=157.9 Q ss_pred ccCCChhhhHHHHHHHH-HhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHH Q lcl|NC_013644. 4 LLSEDVKIIANALKAAI-DKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQK 82 (510) Q Consensus 4 ~~~~~~~~~~~~i~~~i-~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~ 82 (510) |.+ +.+...++..+ ......... .+..+-... ...+... ..+.-+.++-...-|+.. T Consensus 1 ~~~---~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~-----------~~~~~~v----~~~~~~~~~~V~~ci~~I 58 (409) T protein:vir:93 1 MAK---ENIVTRIKKKLIDNWIDQSTS----KLYDFSPWK-----------NRSFWGV----INNTLETNETIFSAITKL 58 (409) T ss_pred CCc---cchhhhhhhhhhhhhhccccc----ccccccccc-----------Ccccccc----chhhhhccHHHHHHHHHH Confidence 111 11111122211 111000000 000000000 0000000 000011122233344555 Q ss_pred HhhhhcCCceeccCcHHHHHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEE Q lcl|NC_013644. 83 TQYLLSNPVEYETENEELKEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVY 155 (510) Q Consensus 83 ~~~l~g~p~~~~~~d~~~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~ 155 (510) ++-+..-|+++-...+.....+..++. |. .......++...+.+|.||+++..+..|++ .+.+++|..+-+.. T Consensus 59 a~~ia~lp~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~~~~ 138 (409) T protein:vir:93 59 SNSMASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLI 138 (409) T ss_pred HHhhhhCceeEeeccccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEE Confidence 555555676653222222233333331 22 234456677888999999999999988875 67888998887776 Q ss_pred cCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcc Q lcl|NC_013644. 156 NEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQI 235 (510) Q Consensus 156 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~i 235 (510) ++.... + +|.+.. .++. . ..+.+..+.+++.-. + T Consensus 139 ~~~~~~--~--~y~~~~-~~g~-----~-~~~~~~eVih~r~~~-------------------------------~---- 172 (409) T protein:vir:93 139 ENQSRE--L--YYSIHA-ATGN-----K-LIVHNMDMLHFKHIV-------------------------------A---- 172 (409) T ss_pred eCCCcE--E--EEEEEc-CCce-----E-EEEccccEEEeCCCC-------------------------------C---- Confidence 543210 0 111111 1110 0 012233333332100 0 Q ss_pred cEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccc-eeEE-ecCCCCch--hhhhHh----hh-cCeeeeccC Q lcl|NC_013644. 236 PFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEA-IYVV-SGFQGDDL--SKLRQN----VK-SKKVVGTGS 306 (510) Q Consensus 236 Pvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~-~lv~-~g~~~~~~--~~~~~~----~~-~~~~~~~~~ 306 (510) .+.-.|.|.++.+...++..+.+ ..+ .+..+..+ -.++ .+...++. ...... .. .++++.+++ T Consensus 173 -----~~~~~G~s~i~~~~~~i~~~~~~-~~~--~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~ 244 (409) T protein:vir:93 173 -----SNMVQGISPIDVLKNTTDFDNAV-RTF--NLTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEP 244 (409) T ss_pred -----CCccccccHHHHHHHHHHHHHHH-HHH--HHHhcCCCCceEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCC Confidence 00113566665554444433322 111 23333332 2222 33222221 111111 11 234555666 Q ss_pred CCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 307 DGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVID 386 (510) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~ 386 (510) +.+++.++.+.....+.+..+..++.|+..-++|+.-.+..++.+...++... ...+...|.-+++.|.. T Consensus 245 g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~----------~~f~~~~l~P~~~~ie~ 314 (409) T protein:vir:93 245 GVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELN----------RFYLQHTLLPIVKQYEE 314 (409) T ss_pred CceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH----------HHHHHHHHHHHHHHHHH Confidence 55555554443344556666777888988888887644433332222222111 12333344444444444 Q ss_pred HHhhccCCcccc-ceeeEEe--CCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 387 DINRRYTKAFDP-TEVSFTF--TREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKE 463 (510) Q Consensus 387 ~~~~~~~~~~~~-~~v~i~f--~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~ 463 (510) .+..+--...+. ....+.| ..-+-.|..+.++.+.+++.+|+++.-.+++.++.-.-+.-.+ ... T Consensus 315 ~l~~~Ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~------------~~~ 382 (409) T protein:vir:93 315 EFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDK------------PLI 382 (409) T ss_pred HHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe------------eee Confidence 443321111111 1233455 4555678999999999999999999988888765321110000 000 Q ss_pred HHHhhhccCCCCCCCCCcccCCCCCCcccccc Q lcl|NC_013644. 464 ALEEAEYTKGLSDNTDEEETAVNPDDPTQQMA 495 (510) Q Consensus 464 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (510) ..+..+ .+.. .+......+|++...++ T Consensus 383 ~~n~~~----~~~~-~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:93 383 SGDLYP----IDTP-LELRKSLKGGDKNVNES 409 (409) T ss_pred cccccc----cccc-hhhcccccCCCCCcCCC Confidence 010110 0000 00000001111111111 No 162 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=98.14 E-value=3.7e-06 Score=50.35 Aligned_cols=439 Identities=8% Similarity=-0.017 Sum_probs=190.4 Q ss_pred hhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhc-- Q lcl|NC_013644. 11 IIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLS-- 88 (510) Q Consensus 11 ~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g-- 88 (510) .+++.-.+.....|..- ..+...+.+|..-.. -. ..+... ...+...++..+.+...+++.++-|++ T Consensus 1 m~~~~r~~~L~~~R~~~-e~~w~e~~~~tlP~~-----~~----~~~~~~-~~~~~~~~~~dstg~~a~~~LAa~l~~~l 69 (522) T protein:vir:10 1 MKARERYNQLTTARQMF-LDKAVECSELTLPYL-----ID----DDISSR-PNHKSLTVPWQSVGAKCCVTLAAKLMLAV 69 (522) T ss_pred CchHHHHHHHHHHhhHH-HHHHHHHHHHhhhcc-----cC----CCCCCC-cccccccccccchHHHHHHHHHHHHHHhh Confidence 22222222222222111 123444445532210 00 000000 011122356667777777777777654 Q ss_pred CCce-----eccCcHH------------HHHHHH-------H-HhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEE Q lcl|NC_013644. 89 NPVE-----YETENEE------------LKEYLA-------E-YYNSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCF 143 (510) Q Consensus 89 ~p~~-----~~~~d~~------------~~~~l~-------~-~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i 143 (510) -|+. +...+.. +.+.|. . +..+||.....++.++..++|.|.+ |.++++ + T Consensus 70 tpp~~~WF~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--y~~~~~---~ 144 (522) T protein:vir:10 70 LPPQTSFFKLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALI--FMGKDG---L 144 (522) T ss_pred cCCCCccccccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeE--EEcCCC---c Confidence 2322 2322211 122221 2 2247888899999999999999875 456654 4 Q ss_pred EEEcccceEEEEcCCCCceeEEEEEEEEEee------------C--CceeEEEEEEEEcCCcEEEEEEcCCceeeccccc Q lcl|NC_013644. 144 QVADSLNVFGVYNEYNELQRICRHYITEIEK------------D--GETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAE 209 (510) Q Consensus 144 ~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~------------~--~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~ 209 (510) +.++-.+++..-|..+++..+++-+...... . .....-..+++|+. .+.+.+.+.+.. T Consensus 145 ~~~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~---v~p~~~~~~~~~----- 216 (522) T protein:vir:10 145 KTFPLTRYVINRDGDGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTY---VKLDKSSGRWVW----- 216 (522) T ss_pred eEEEcceEEEeeCCCCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEE---EEeeccCCceEE----- Confidence 4555567666677778888777765543100 0 00111122333321 111111111110 Q ss_pred cccccccccccccc-cc-ccccccCCcccEEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEe Q lcl|NC_013644. 210 PINPRPHVLAVDSE-NE-SLLQRSYGQIPFYRLS-----NNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVS 282 (510) Q Consensus 210 ~~~~~~~~~~~~~~-~~-~~~~~~~g~iPvv~~~-----nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~ 282 (510) .....+. .. .....+|..+|++.++ .+.+|+|-.++..+-+..+|.+.-......+...+|.+.+. T Consensus 217 -------~~~~~~~~~~~~~s~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~ 289 (522) T protein:vir:10 217 -------HQEAFDKIIPDSRSTAPKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVS 289 (522) T ss_pred -------EEccCCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeec Confidence 0000000 00 0113466677877654 34679999999999999999998888888889999887763 Q ss_pred cCCCCchhhhhHhhhcCeeeeccCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHH Q lcl|NC_013644. 283 GFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYT 360 (510) Q Consensus 283 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~ 360 (510) --...+...+.. ...+.+..+..+++..+... .+.......++.++..|...-.. ...-.....|++.+..+ T Consensus 290 ~~~~~~~~~l~~--~~~~~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl~--~~~~d~~rvTAtEV~~r-- 363 (522) T protein:vir:10 290 PSSTTKPATIAK--AGNGAIVQGRPEDVAVIQVGKTADFSTAANMATAIEKRLLEAFLV--MNVRNAERVTAEEVRLT-- 363 (522) T ss_pred cccccccccccC--CCCcceecCCCccceeecccccccchHHHHHHHHHHHHHHHHHhh--ccCCCCCCCCHHHHHHH-- Confidence 222222222221 12234445556667666543 46677788888888877764221 11222344566666653 Q ss_pred HHHHHHHHHHHHHHHHHHH--------HHHHHHHHHhhccCCc---ccc-ceeeEEeCCCCCCCHHHHHHHHHHHHhc-- Q lcl|NC_013644. 361 LLNMKANKTEARLRALLEW--------MNKLVIDDINRRYTKA---FDP-TEVSFTFTREVMVNETDIVNDEKTEAET-- 426 (510) Q Consensus 361 ~l~~k~~~k~~~~~~~l~~--------~~~~i~~~~~~~~~~~---~~~-~~v~i~f~~~~p~d~~e~~~~~~~~~~~-- 426 (510) ..++...++..+.+ ++.-++.++...+.-+ .+. ....|++..++-+. +.++.+....+. T Consensus 364 -----~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~~v~~is~Lara--q~~~~l~~~~~~i~ 436 (522) T protein:vir:10 364 -----QLELEQQLGGIFSLLVIEFLIPYLNRTLLVLQRSNQIPKLPKDIVRPTIVAGVNALGRG--QDRESLTAFVGTIA 436 (522) T ss_pred -----HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccccccccccchhHHHHH--HHHHHHHHHHHHHH Confidence 34444444444444 2233333343333211 111 12234555555432 223332222111 Q ss_pred CCCchHHHHHhC------------CCC-------CcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCC Q lcl|NC_013644. 427 RKIILESILQVA------------PRL-------DDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNP 487 (510) Q Consensus 427 g~iS~et~~~~~------------~~v-------~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 487 (510) .++..+.+...+ -++ ++++..+..++.++......... .+..-.+....+ +..++ T Consensus 437 ~~~~p~~~~~~id~d~~~~~~a~~~Gvp~~~ivrt~eev~~~~q~~q~~~~~~~~~~--~a~~~~~~~~~~----~~~~~ 510 (522) T protein:vir:10 437 QTLGPEALMQYLNPLEAIKRLAAAQGIDVLNLVKTEQQLAEEQQAAQQQAAQQSLVD--QAGQMTGSPLMD----PTKNP 510 (522) T ss_pred HhhCchhhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHH--HHHHHhcccccC----ccccH Confidence 111123333222 112 22232222222222211111111 111111111111 11111 Q ss_pred CCcccccccCcccccccc Q lcl|NC_013644. 488 DDPTQQMAEGATGSTESQ 505 (510) Q Consensus 488 ~~~~~~~~~~~~~~~~~~ 505 (510) ..-.+.++.+. . T Consensus 511 ~~~~~~~~~~~------~ 522 (522) T protein:vir:10 511 QLMDEEQPPME------E 522 (522) T ss_pred HHHHHhCCCCC------C Confidence 11111111111 1 No 163 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=98.12 E-value=4.2e-06 Score=50.07 Aligned_cols=384 Identities=9% Similarity=-0.023 Sum_probs=161.0 Q ss_pred HHHHHHHhccCCcchhcccc--ee--ccccccccccccccceeccchhHHHHHHHHhhhhcCCceecc--CcHHH-HHHH Q lcl|NC_013644. 32 AETGIRYYNHENDIMNNRIF--YV--DDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNPVEYET--ENEEL-KEYL 104 (510) Q Consensus 32 ~~~~~~YY~g~~~i~~~~~~--~~--~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~--~d~~~-~~~l 104 (510) |-.+.+.+.+.......... .. ...+...-...-++.=+..+-....|+..++-+.+-|+.+-. ++... ..-+ T Consensus 1 Mgl~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~l 80 (409) T protein:vir:84 1 MSLFTRIFSGPSEERTLTKISGIPSPAEDWAMHGDRPGANSAMTLGAFYACVTLLADTVASLSIDAYRKKDNVRIPVSPA 80 (409) T ss_pred CchhhhhhcCCCcccccccccccccccchhhccCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcccccchH Confidence 11122222221100000000 00 000000000000000011223344466666666666765421 11111 1112 Q ss_pred HHHhc---c---CHHHHHHHHHHHHHhcCeEEEEE-EECCCCce-EEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCC Q lcl|NC_013644. 105 AEYYN---S---EFQVVLQELVEGSSQKGFEYVYA-RTNAEDRL-CFQVADSLNVFGVYNEYNELQRICRHYITEIEKDG 176 (510) Q Consensus 105 ~~~~~---n---~~~~~~~e~~~~~~~~G~~~~~v-~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 176 (510) -.++. | ........+....+.+|.+|.++ +.+..|++ .+.+++|..+.+........ +.++.....++ T Consensus 81 ~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p~~v~v~~~~~~~~----~~~~~~~~~~g 156 (409) T protein:vir:84 81 PKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIHPDCIHVTDAKDEDG----DWIEPVYRIDG 156 (409) T ss_pred HHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEcCceeEEEEcCCCcc----eEEEEEecCCc Confidence 22321 2 23455666778899999999765 46777775 58888998887654321111 01111000000 Q ss_pred ceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCCcHHHHHHH Q lcl|NC_013644. 177 ETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTDLKPIKAL 256 (510) Q Consensus 177 ~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~l 256 (510) ..+....+.+++.-. ..+...|.|.++.+... T Consensus 157 --------~~~~~~dvih~~~~~----------------------------------------~~~~~~G~s~i~~~~~~ 188 (409) T protein:vir:84 157 --------KVVPNHRIMHIKRYP----------------------------------------VAGCALGMSPIEKAASA 188 (409) T ss_pred --------eEEchhhEEEecCCC----------------------------------------CCcccccccHHHHHHHH Confidence 012233333322100 00112466767666666 Q ss_pred HHHHHHHHHHHHHHHHHhccceeEEecCCCCchhh---hhHh----h-hcCeeeeccCCCceeEEeecCCHHHHHHHHHH Q lcl|NC_013644. 257 IDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSK---LRQN----V-KSKKVVGTGSDGGLDVKTVTIPTEGRKTKMEI 328 (510) Q Consensus 257 iD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~---~~~~----~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 328 (510) ++....+..-..+.+...+.|-.+++....-+++. +.+. . ..++++.++++.+.+.+........+.+..+. T Consensus 189 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~~~~~~~d~q~~e~~~~ 268 (409) T protein:vir:84 189 IGLGLAAERYGLRWFRDSANPSGILSSDADLTPDQVKQTQKQWIQSHHNRRLPAVMSAGIKWQSVSITPNESQFLETRSF 268 (409) T ss_pred HHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhccCCCeeecCCCceEEEccCChhHHHHHHHHHH Confidence 65555544444555566666767665432222211 1111 1 12335556655555544433333455666677 Q ss_pred HHHHHHHHhCCccccccc--cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeC Q lcl|NC_013644. 329 DKENIYKFGMAFDSTQVG--DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFT 406 (510) Q Consensus 329 l~~~i~~~s~~p~~~~~~--~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~ 406 (510) ..+.|+..-++|+.-.+. .++.++..++...... +...|.-.++.|...+..+-. .-..+++.++ T Consensus 269 ~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f----------~~~~l~P~~~~ie~~l~~~L~---~g~~i~fd~~ 335 (409) T protein:vir:84 269 QRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINF----------VRHTLLPWLRCIEQALDTFLP---RGQFVKFNVD 335 (409) T ss_pred HHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHH----------HHHHHHHHHHHHHHHHHHhcc---CCCeEEEech Confidence 788898888888753322 2222222222222111 122222223333332222111 1123566666 Q ss_pred CCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCC Q lcl|NC_013644. 407 REVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVN 486 (510) Q Consensus 407 ~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (510) .-+..|..+.++.+.++.++|+++.-.+++.++.-.-+.-. ......+..+.. ..+...+.+++.++ T Consensus 336 ~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~ggD------------~~~~~~n~~~~~-~~~~~~~~~~~~~~ 402 (409) T protein:vir:84 336 GLMRGDVTARFTAYQMGLQNGIWSVNEVRAWEDAPPIPEGD------------IHLQPMNFVPLG-YVPPEEPAQEPQPN 402 (409) T ss_pred hhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc------------eeeecccccccc-cCCccccCcCCCCC Confidence 77778999999999999999999988888876532111100 000011111100 00111111110000 Q ss_pred CCCcccccccCcc Q lcl|NC_013644. 487 PDDPTQQMAEGAT 499 (510) Q Consensus 487 ~~~~~~~~~~~~~ 499 (510) .+ ..|.+ T Consensus 403 ~~------~~gn~ 409 (409) T protein:vir:84 403 SA------TEGNK 409 (409) T ss_pred Cc------cCCCC Confidence 00 00100 No 164 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=98.09 E-value=4.7e-06 Score=49.77 Aligned_cols=455 Identities=12% Similarity=0.081 Sum_probs=194.7 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhh--HHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSS--KREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~--~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) |-+-.+.+ .+..+..++..+..+..+. -.++..+.+|..-.- ....... ......++..+-+..- T Consensus 1 ~~~~~~~~-~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~-------~~~~~~~-----~~~~~~~~~dst~~~a 67 (535) T protein:vir:94 1 MASSQKRE-GFAENGAKAVYDALKNDRNSYETRAENCAKYTIPSL-------FPKDSDN-----ASTDYTTPWQAVGARG 67 (535) T ss_pred CCchhhhh-hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc-------CCCCCCc-----cccccCCcccccHHHH Confidence 44333322 2223334444444443221 123344444432210 0001100 1111234566667777 Q ss_pred HHHHHhhhhcC--Cce----eccCcH-------------HHHHHHHH-------Hh-ccCHHHHHHHHHHHHHhcCeEEE Q lcl|NC_013644. 79 VDQKTQYLLSN--PVE----YETENE-------------ELKEYLAE-------YY-NSEFQVVLQELVEGSSQKGFEYV 131 (510) Q Consensus 79 v~~~~~~l~g~--p~~----~~~~d~-------------~~~~~l~~-------~~-~n~~~~~~~e~~~~~~~~G~~~~ 131 (510) +++.++.|++- |.. +...+. ++.+.|.. .+ .+||...+.++.++..++|.|.+ T Consensus 68 ~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l 147 (535) T protein:vir:94 68 LNNLASKLMLALFPMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALL 147 (535) T ss_pred HHHHHHHHHhhhcCCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeE Confidence 77777666541 321 222221 12223322 22 47888999999999999999977 Q ss_pred EEEECCCCceEEEEEcccceEEEEcCCCCceeEEEEEEEEEee-----------CCceeEEEEEEEEcCCcEEEEEEcCC Q lcl|NC_013644. 132 YARTNAEDRLCFQVADSLNVFGVYNEYNELQRICRHYITEIEK-----------DGETVDIHHAEVWTDQNVYFFVAEDN 200 (510) Q Consensus 132 ~v~~d~~g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~e~y~~~~i~~~~~~~~ 200 (510) ++..+.....+++.++-.+++..-|..+++..+++-+++.... .........+++|+. .|....+ T Consensus 148 ~~~~~~~~~~~f~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~~~----v~~~~~~ 223 (535) T protein:vir:94 148 YIPEPEGTYNPMKLYRLSSYVVQRDAFGTVLQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTH----IYLDEES 223 (535) T ss_pred eeccCcCcccceEEEEcCeEEEeeCCCCCeEEEEeeeeccHHHhhHHHHHHHHhccccCCCceeEEEEE----EEeeCCC Confidence 7655544445677777777777778788887777665544210 001111223344432 1111111 Q ss_pred ceeeccccccccccccccccccccc--ccccccCCcccEEEecC-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 201 KDYELDEAEPINPRPHVLAVDSENE--SLLQRSYGQIPFYRLSN-----NKQETTDLKPIKALIDDYDLMNCFLSNNLQD 273 (510) Q Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~g~iPvv~~~n-----n~~g~sd~~~v~~liD~~n~~~S~~~~~~~~ 273 (510) ..+. .....++... .....+|..+|++.++- +.+|+|-.++..+-+..+|.+.-........ T Consensus 224 ~~~~-----------~~~e~~g~~~~~~~~~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~ 292 (535) T protein:vir:94 224 GEYL-----------KYEEIDGVEVEGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMI 292 (535) T ss_pred CcEE-----------EEEEecCeeeccccccCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111 1111111111 11234677788877653 4679999999999999999887777766666 Q ss_pred hccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCccccccccCccc Q lcl|NC_013644. 274 FAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNIT 351 (510) Q Consensus 274 ~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~S 351 (510) ...|.+.+.--+..++..+.. ...+.+..+..+++..+... .+.......++.++..|...-..-....-.....| T Consensus 293 a~~~~~lv~p~g~~~~~~~~~--~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~d~~rvT 370 (535) T protein:vir:94 293 SAKVIGLVNPAGITQVRRLTK--AQTGDFVSGRPEDISFLQLEKAADFSVARAVSEQIEGRLSYAFMLNSAVQRTGERVT 370 (535) T ss_pred hccCCcccccccccchhhccc--CCCceeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhHhhhccCCCCCcc Confidence 666665442111122222211 12234444555666665443 46677777777777777553221112112233346 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHhhccCCc-cccceeeEEeCCCCCC-CHHHHHHH-- Q lcl|NC_013644. 352 NIVIKARYTLLNMKANKTEARLRALLEW--------MNKLVIDDINRRYTKA-FDPTEVSFTFTREVMV-NETDIVND-- 419 (510) Q Consensus 352 g~Ai~~~~~~l~~k~~~k~~~~~~~l~~--------~~~~i~~~~~~~~~~~-~~~~~v~i~f~~~~p~-d~~e~~~~-- 419 (510) ++.++. +..++...++..+.+ +++..+.++...+.-+ ....-+.+.+..++.. .....++. T Consensus 371 AtEV~~-------r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~la~l~r~~~~~~l~ 443 (535) T protein:vir:94 371 AEEIRY-------VASELEDTLGGVYSILSQELQLPMVRVLLKQLQATNQIPELPKEAVEPTISTGMEALGRGQDLDKLE 443 (535) T ss_pred HHHHHH-------HHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEeehHHHHHHHHHHHHHH Confidence 666555 344555555554444 3333344443333221 1122345555443321 11111222 Q ss_pred --HHHHHhcC------CCchHHHHHhC------C---CC-CcHHHHHHHHHHHHHHHHHHHHHHHhhhcc-CCCCCCCCC Q lcl|NC_013644. 420 --EKTEAETR------KIILESILQVA------P---RL-DDDNVLRLICEQFDLDWEDVKEALEEAEYT-KGLSDNTDE 480 (510) Q Consensus 420 --~~~~~~~g------~iS~et~~~~~------~---~v-~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~-~~~~~~~~~ 480 (510) +..+.+.+ .+....++..+ | .+ +++|.+.+.++.++.+... ..++..... +......++ T Consensus 444 ~~~~~laq~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~rs~eev~~~~~q~~~~~~~~--~~~~~~g~~~~~~~~~~~~ 521 (535) T protein:vir:94 444 RCIAAWSALAPMQGDPDINIATIKLRIANAIGIDTSGILKTPEEKQQEMAEAAQGTAMQ--NAAASAGAGAGTMATASPE 521 (535) T ss_pred HHHHHHHhhChHHhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHH--HHHHHHHHhhhcccccChH Confidence 22111111 11122222221 2 11 2223222222222211111 111111111 110111110 Q ss_pred cccCCCCCCcccccccCcccc Q lcl|NC_013644. 481 EETAVNPDDPTQQMAEGATGS 501 (510) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~ 501 (510) .... .. ...|.+-+ T Consensus 522 ~~~~-----~~--~~~g~~~~ 535 (535) T protein:vir:94 522 NMKA-----AA--AQAGMAPN 535 (535) T ss_pred HHHH-----HH--HHhccCCC Confidence 0000 00 01111111 No 165 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=98.09 E-value=4.9e-06 Score=49.71 Aligned_cols=384 Identities=11% Similarity=0.038 Sum_probs=152.1 Q ss_pred HHHhccCCcchhccc-ceeccccc--cccccccc--cceeccchhHHHHHHHHhhhhcCCceec-c-CcHHHH-HHHHHH Q lcl|NC_013644. 36 IRYYNHENDIMNNRI-FYVDDEGI--LREDKYAS--NVRIPHGFFPEIVDQKTQYLLSNPVEYE-T-ENEELK-EYLAEY 107 (510) Q Consensus 36 ~~YY~g~~~i~~~~~-~~~~~~~~--~~~~~~~~--~~ki~~n~~~~Iv~~~~~~l~g~p~~~~-~-~d~~~~-~~l~~~ 107 (510) .++|++......-.- ......+. .....+.. -.|+.. .-..|+..++-+.+-|+.+- . .++... ..+..+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~Al~~~~--V~~cv~~ia~~iA~lp~~~~~~~~~~~~~~~~~~~l 78 (417) T protein:vir:38 1 MKLFRGLATEVDPHWADHLLDSGVIPSFRGGYLGISALRNSD--VLTAVSIVSGDVSRFPLVITDSSTDEVIDLANIEYL 78 (417) T ss_pred CccccccccCCCccchhhhcccccccccCCceechhhcccHH--HHHHHHHHHHhhccCeeEEEEcCCcceeccchHHHH Confidence 222333321110000 00000000 00000000 012211 11235555555555676642 1 111111 122333 Q ss_pred hc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCC-ce-EEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCcee Q lcl|NC_013644. 108 YN---SE---FQVVLQELVEGSSQKGFEYVYARTNAED-RL-CFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETV 179 (510) Q Consensus 108 ~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g-~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 179 (510) +. |. .......+....+.+|.||.++.++..| .+ .+.+++|..+.+..++.+++ .|++....++. T Consensus 79 L~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~~~~~~~~-----~y~~~~~~~~~-- 151 (417) T protein:vir:38 79 MNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVDTSDPDNI-----IYRFTPYNSSM-- 151 (417) T ss_pred HhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEEEcCCCeE-----EEEEEEcCCcE-- Confidence 31 22 2345556678889999999998887654 34 35678888887765543321 12211111110 Q ss_pred EEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCC----CCCCCcHHHHHH Q lcl|NC_013644. 180 DIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNN----KQETTDLKPIKA 255 (510) Q Consensus 180 ~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn----~~g~sd~~~v~~ 255 (510) ..++....+. ||++. -.|.|.+.-+.. T Consensus 152 ----~~~~~~~dvi---------------------------------------------H~r~~~~d~~~G~s~l~~~~~ 182 (417) T protein:vir:38 152 ----QKVCGFEDVI---------------------------------------------HWKFFSYDTIMGRSPLLSLGD 182 (417) T ss_pred ----EEEecCcceE---------------------------------------------EecCCCCCCccccCHHHHHHH Confidence 1112222233 33221 135566655555 Q ss_pred HHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh---hhhHhh-------hcCeeeeccCCCceeEEeecCCHHHHHHH Q lcl|NC_013644. 256 LIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS---KLRQNV-------KSKKVVGTGSDGGLDVKTVTIPTEGRKTK 325 (510) Q Consensus 256 liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~---~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (510) .|...+.+..-..+.+.-.+.|-.++.-...-+.+ .+++.+ ..++++.++++.+++.++.......+.+. T Consensus 183 ~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~ 262 (417) T protein:vir:38 183 EIGLQESGVSTLQKFFKSGLKGSIIKAKESRLSAEARQKIREDFERAQAGADAGSPIIVDATMDYQPLEVDTNVLNLINS 262 (417) T ss_pred HHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCceeccCCceEEEccCCHHHHHHHHH Confidence 55444444444444455555665555432211111 122111 12335555555544444433333344555 Q ss_pred HHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEe Q lcl|NC_013644. 326 MEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTF 405 (510) Q Consensus 326 ~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f 405 (510) .+...+.|...-++|+.-.+..+ ++..++ .....++...|.-+++.|...+..+--.........+.| T Consensus 263 ~~~~~~~Ia~~fgVPp~~lg~~~--~~s~~e----------~~~~~~~~~tl~P~~~~ie~~l~~~Ll~~~~~~~~~~~f 330 (417) T protein:vir:38 263 NNYSTAQIAKALRVPAYRLAQNS--PNQSVK----------QLADDYIRNDLPFYFEPITSEFELKLLDDAQRHQYCIGF 330 (417) T ss_pred HHhhHHHHHHHhCCCHHHhCCCC--cchhHH----------HHHHHHHHHHHHHHHHHHHHHHHhhhcChhhcccceEEe Confidence 66667788887788875543221 222111 111223444555555555544443222222223455677 Q ss_pred CCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHH-hhhccCCCCCCCCCcc Q lcl|NC_013644. 406 TREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALE-EAEYTKGLSDNTDEEE 482 (510) Q Consensus 406 ~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 482 (510) +..- .+.+. ...+.+++.+|+++.-.++++++. +++....+...- ..........+ +.+...... T Consensus 331 d~~~-l~~~~-~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~d~~~~~---~n~~~~d~~~~~~~~~~~~~k------- 398 (417) T protein:vir:38 331 DTKS-VNGLP-IADVNTAVNGGLWTGNEGRAELGKKPLKDPNMDRIQST---LNTVFLDQKEAYQAEHAAELK------- 398 (417) T ss_pred chhh-hhHHH-HHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeec---ccccccccccccccccccccC------- Confidence 5321 12222 334667888999999888887643 322111000000 00000000000 000000001 Q ss_pred cCCCCCCcccccccCcccccccc Q lcl|NC_013644. 483 TAVNPDDPTQQMAEGATGSTESQ 505 (510) Q Consensus 483 ~~~~~~~~~~~~~~~~~~~~~~~ 505 (510) ++++..+...+..++...+ T Consensus 399 ----gg~~~~~~~~~~~~~~~~~ 417 (417) T protein:vir:38 399 ----GGDTNAKGNQNGSGTNANS 417 (417) T ss_pred ----CCCCCCCCCCcCCCCcCCC Confidence 1111111111111222222 No 166 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=98.09 E-value=4.9e-06 Score=49.69 Aligned_cols=386 Identities=9% Similarity=0.037 Sum_probs=155.6 Q ss_pred ccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHh----ccCCcchhcccceeccccccccccccccceeccchhHHHH Q lcl|NC_013644. 4 LLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYY----NHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIV 79 (510) Q Consensus 4 ~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY----~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv 79 (510) |.++ .+...|++.+ +.++. -+-++... .. +....... .+.-+..+-...-| T Consensus 1 ~~~~---~~~~~~k~~~--------------~~~~~~~~~~~~~~~~~-----~~--~~~~~~v~-~~~a~~~~~v~~~i 55 (409) T protein:vir:94 1 MAKE---NIVTRIKKKL--------------IDNWIDQSASKLYDFSP-----WK--NKSFWGVI-NNTLETNETIFSAI 55 (409) T ss_pred Cccc---ccchhhhhHH--------------hhhhhcCCccccccccc-----cc--Cccccccc-hhhhhccHHHHHHH Confidence 1111 1111222211 11111 00001000 00 00000000 00001112223334 Q ss_pred HHHHhhhhcCCceeccCcHHHHHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceE Q lcl|NC_013644. 80 DQKTQYLLSNPVEYETENEELKEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVF 152 (510) Q Consensus 80 ~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~ 152 (510) +..++-+..-|+++-...+.....+..++. |. -......++...+.+|.||+++..+..|++ .+.+++|..+- T Consensus 56 ~~Ia~~ia~lp~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~ 135 (409) T protein:vir:94 56 TKLSNSMASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVE 135 (409) T ss_pred HHHHHhhhhCceeEeecccccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeE Confidence 444444555566642222222222233331 22 234445667888999999999989888875 67888998888 Q ss_pred EEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccC Q lcl|NC_013644. 153 GVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSY 232 (510) Q Consensus 153 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (510) ++.++.+.. + +|.+. ..++. .+ .+.+..+.+++.-. T Consensus 136 v~~~~~~~~--~--~y~~~-~~~g~-----~~-~~~~~dvih~r~~~--------------------------------- 171 (409) T protein:vir:94 136 MLIENQSRE--L--YYSIH-AATGN-----KL-IVHNMDMLHFKHIV--------------------------------- 171 (409) T ss_pred EEEeCCCcE--E--EEEEE-cCCce-----EE-EEccccEEEecCCC--------------------------------- Confidence 877643221 1 11111 11111 00 12233333332100 Q ss_pred CcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcc-ceeEE-ecCCCCch--hhhhHh----hh-cCeeee Q lcl|NC_013644. 233 GQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAE-AIYVV-SGFQGDDL--SKLRQN----VK-SKKVVG 303 (510) Q Consensus 233 g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~-~~lv~-~g~~~~~~--~~~~~~----~~-~~~~~~ 303 (510) | .+.-.|.|.+.-+...++..+.+.. + .+..+.. +-.++ .+...++. ...... .. .++++. T Consensus 172 ---~----~~~~~G~s~l~~~~~~i~~~~~~~~-~--~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~v 241 (409) T protein:vir:94 172 ---A----SNMVQGISPIDVLKNTTDFDNAVRT-F--NLTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILF 241 (409) T ss_pred ---C----CCccccccHHHHHHHHHHHHHHHHH-H--HHHhcCCCCeeEEecCCCCCHHHHHHHHHHHHHHhhcCCCeee Confidence 0 0111356666555555543333211 1 2223232 22233 33222221 111111 11 234556 Q ss_pred ccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 304 TGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKL 383 (510) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~ 383 (510) ++++.+++.++.+.....+.+..+...++|+..-++|+.-.+..++.+...++.... ..+...|.-+++. T Consensus 242 l~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~----------~f~~~~l~P~~~~ 311 (409) T protein:vir:94 242 QEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNR----------FYLQHTLLPIVKQ 311 (409) T ss_pred cCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH----------HHHHHHHHHHHHH Confidence 666655555554434445566667777888888888876443333222222221111 2233334444444 Q ss_pred HHHHHhhccCCcccc-ceeeEEe--CCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 384 VIDDINRRYTKAFDP-TEVSFTF--TREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWED 460 (510) Q Consensus 384 i~~~~~~~~~~~~~~-~~v~i~f--~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~ 460 (510) |...+..+--...+. ....+.| ..-+-.|..+.++.+.+++.+|+++.-.+++.++.-.-+--.+ T Consensus 312 ie~~ln~~Ll~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~------------ 379 (409) T protein:vir:94 312 YEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDK------------ 379 (409) T ss_pred HHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe------------ Confidence 444333321111111 1223444 4556778999999999999999999888877764311100000 Q ss_pred HHHHHHhhhccCCCCCCCCCcccCCCCCCcccccc Q lcl|NC_013644. 461 VKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMA 495 (510) Q Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (510) .....+..+ .+...+.+.. ..+|++...++ T Consensus 380 ~~~~~n~~~----~~~~~~~~~~-~kGG~~n~~e~ 409 (409) T protein:vir:94 380 PLISGDLYP----IDTPLELRKS-LKGGDKNVNES 409 (409) T ss_pred Eeecccccc----cccchhhccc-ccCCCCCcCCC Confidence 000000000 0000000000 01110000001 No 167 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=98.08 E-value=5e-06 Score=49.64 Aligned_cols=384 Identities=12% Similarity=0.068 Sum_probs=166.2 Q ss_pred HHHHHHHhccCC-cchhc-------ccceeccccccccccccccce-eccchhHHHHHHHHhhhhcCCceeccCcHH-HH Q lcl|NC_013644. 32 AETGIRYYNHEN-DIMNN-------RIFYVDDEGILREDKYASNVR-IPHGFFPEIVDQKTQYLLSNPVEYETENEE-LK 101 (510) Q Consensus 32 ~~~~~~YY~g~~-~i~~~-------~~~~~~~~~~~~~~~~~~~~k-i~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~-~~ 101 (510) |=.+ .+.+. .+... ........+. ......... +.+.-.-.-|+..++-+.+-|+++..+.+. .. T Consensus 1 Mg~f---~~~~~r~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~~~~~ 75 (416) T protein:vir:81 1 MGIF---YKNEKRDLQYNEDDLQMMVQTLPGFQGT--KLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINYS 75 (416) T ss_pred CCcc---cccccccccCCCcchhHHHHHhcccccc--CccccchhhhhcchHHHHHHHHHHHhhccCceEEecCcccccc Confidence 0000 00000 00000 0000000000 000000000 111111123555555566667776432221 12 Q ss_pred HHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCceeEEEEEEEEEee Q lcl|NC_013644. 102 EYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNELQRICRHYITEIEK 174 (510) Q Consensus 102 ~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~ 174 (510) ..+..++. |. .......+....+.+|.||+++.++..|++ .+.+++|..+.++.|+.+.+. |++...+ T Consensus 76 ~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~-----~~~~~~~ 150 (416) T protein:vir:81 76 DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLY-----YFHQRID 150 (416) T ss_pred chHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEE-----EEEEEec Confidence 22333332 22 234455667778899999999999988886 588899999988887655421 1111111 Q ss_pred CCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCCcHHHHH Q lcl|NC_013644. 175 DGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTDLKPIK 254 (510) Q Consensus 175 ~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~ 254 (510) .... .....+.+..+.+++.. |. +.-.|.|.++.+. T Consensus 151 ~~~~---~~~~~~~~~evihir~~-------------------------------------~~----d~~~G~s~i~~~~ 186 (416) T protein:vir:81 151 SNGN---NIERNVKFEDMLDIKFY-------------------------------------SL----DGINGLSLLDTLS 186 (416) T ss_pred CCCc---eeEEEEccccEEEeccC-------------------------------------CC----CCccccCHHHHHH Confidence 1100 01112333333333210 00 1113666666666 Q ss_pred HHHHHHHHHHHHHHHHHHHhccceeEEe--cCCCCch--hhhhHhh----h----cCeeeeccCCCceeEEeecCCHHHH Q lcl|NC_013644. 255 ALIDDYDLMNCFLSNNLQDFAEAIYVVS--GFQGDDL--SKLRQNV----K----SKKVVGTGSDGGLDVKTVTIPTEGR 322 (510) Q Consensus 255 ~liD~~n~~~S~~~~~~~~~~~~~lv~~--g~~~~~~--~~~~~~~----~----~~~~~~~~~~~~~~~~~~~~~~~~~ 322 (510) ..++.......-..+.+...+.|-.+++ |...++. ..++..+ . .++++.++++.+.+.++.+.....+ T Consensus 187 ~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~ 266 (416) T protein:vir:81 187 RTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKL 266 (416) T ss_pred HHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCceeecCCCceeEeccCCHHHHHH Confidence 6665555444444455566666666654 3221111 1111111 1 1335666666655555544444456 Q ss_pred HHHHHHHHHHHHHHhCCcccccccc-CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCcccccee Q lcl|NC_013644. 323 KTKMEIDKENIYKFGMAFDSTQVGD-GNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEV 401 (510) Q Consensus 323 ~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v 401 (510) .+..+..++.|+..-++|+.-.+.. ++.|...... .|...|.-++..|...+..+-........+ T Consensus 267 ~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~--------------~~~~~l~P~~~~ie~~ln~~l~~~~~~~~~ 332 (416) T protein:vir:81 267 IRENKSSTREIAGVFGIPLHKFGIETANMSITDANL--------------DYLSTLKPYITCVCAELNFKFNDEYVNREF 332 (416) T ss_pred HHHHHHHHHHHHHHhCCCHHHcCCCCCCccHHHHHH--------------HHHHHHHHHHHHHHHHHhhhccccccCceE Confidence 6677778888999888887543321 1112111111 122234444444444343322222222345 Q ss_pred eEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCC Q lcl|NC_013644. 402 SFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTD 479 (510) Q Consensus 402 ~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~ 479 (510) ++.+..-+-.|..+.++.+.++..+|+++.-.+.+.++. +++...... ....+..+.. ..+.... T Consensus 333 ~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~------------~~~~n~~~~~-~~~~~~~ 399 (416) T protein:vir:81 333 KFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIH------------RVDLNHVNIE-LVDEYQM 399 (416) T ss_pred EEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceE------------eecccccccc-cccccCc Confidence 555566667799999999999999999999888887643 222111000 0000000000 0000000 Q ss_pred CcccCCCCCCcccccccCcccccccc Q lcl|NC_013644. 480 EEETAVNPDDPTQQMAEGATGSTESQ 505 (510) Q Consensus 480 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (510) .+..... ...+|.++.+ T Consensus 400 ~~~~~~~---------~~~kgGe~n~ 416 (416) T protein:vir:81 400 NKSRATD---------KKLKGGEENE 416 (416) T ss_pred ccccccc---------cccCCCCCCC Confidence 0000000 0000000000 No 168 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=98.08 E-value=5e-06 Score=49.64 Aligned_cols=384 Identities=12% Similarity=0.068 Sum_probs=166.2 Q ss_pred HHHHHHHhccCC-cchhc-------ccceeccccccccccccccce-eccchhHHHHHHHHhhhhcCCceeccCcHH-HH Q lcl|NC_013644. 32 AETGIRYYNHEN-DIMNN-------RIFYVDDEGILREDKYASNVR-IPHGFFPEIVDQKTQYLLSNPVEYETENEE-LK 101 (510) Q Consensus 32 ~~~~~~YY~g~~-~i~~~-------~~~~~~~~~~~~~~~~~~~~k-i~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~-~~ 101 (510) |=.+ .+.+. .+... ........+. ......... +.+.-.-.-|+..++-+.+-|+++..+.+. .. T Consensus 1 Mg~f---~~~~~r~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~~~~~ 75 (416) T protein:vir:45 1 MGIF---YKNEKRDLQYNEDDLQMMVQTLPGFQGT--KLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINYS 75 (416) T ss_pred CCcc---cccccccccCCCcchhHHHHHhcccccc--CccccchhhhhcchHHHHHHHHHHHhhccCceEEecCcccccc Confidence 0000 00000 00000 0000000000 000000000 111111123555555566667776432221 12 Q ss_pred HHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCceeEEEEEEEEEee Q lcl|NC_013644. 102 EYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNELQRICRHYITEIEK 174 (510) Q Consensus 102 ~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~ 174 (510) ..+..++. |. .......+....+.+|.||+++.++..|++ .+.+++|..+.++.|+.+.+. |++...+ T Consensus 76 ~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~-----~~~~~~~ 150 (416) T protein:vir:45 76 DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLY-----YFHQRID 150 (416) T ss_pred chHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEE-----EEEEEec Confidence 22333332 22 234455667778899999999999988886 588899999988887655421 1111111 Q ss_pred CCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCCcHHHHH Q lcl|NC_013644. 175 DGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTDLKPIK 254 (510) Q Consensus 175 ~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~ 254 (510) .... .....+.+..+.+++.. |. +.-.|.|.++.+. T Consensus 151 ~~~~---~~~~~~~~~evihir~~-------------------------------------~~----d~~~G~s~i~~~~ 186 (416) T protein:vir:45 151 SNGN---NIERNVKFEDMLDIKFY-------------------------------------SL----DGINGLSLLDTLS 186 (416) T ss_pred CCCc---eeEEEEccccEEEeccC-------------------------------------CC----CCccccCHHHHHH Confidence 1100 01112333333333210 00 1113666666666 Q ss_pred HHHHHHHHHHHHHHHHHHHhccceeEEe--cCCCCch--hhhhHhh----h----cCeeeeccCCCceeEEeecCCHHHH Q lcl|NC_013644. 255 ALIDDYDLMNCFLSNNLQDFAEAIYVVS--GFQGDDL--SKLRQNV----K----SKKVVGTGSDGGLDVKTVTIPTEGR 322 (510) Q Consensus 255 ~liD~~n~~~S~~~~~~~~~~~~~lv~~--g~~~~~~--~~~~~~~----~----~~~~~~~~~~~~~~~~~~~~~~~~~ 322 (510) ..++.......-..+.+...+.|-.+++ |...++. ..++..+ . .++++.++++.+.+.++.+.....+ T Consensus 187 ~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~ 266 (416) T protein:vir:45 187 RTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKL 266 (416) T ss_pred HHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCceeecCCCceeEeccCCHHHHHH Confidence 6665555444444455566666666654 3221111 1111111 1 1335666666655555544444456 Q ss_pred HHHHHHHHHHHHHHhCCcccccccc-CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCcccccee Q lcl|NC_013644. 323 KTKMEIDKENIYKFGMAFDSTQVGD-GNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEV 401 (510) Q Consensus 323 ~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v 401 (510) .+..+..++.|+..-++|+.-.+.. ++.|...... .|...|.-++..|...+..+-........+ T Consensus 267 ~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~--------------~~~~~l~P~~~~ie~~ln~~l~~~~~~~~~ 332 (416) T protein:vir:45 267 IRENKSSTREIAGVFGIPLHKFGIETANMSITDANL--------------DYLSTLKPYITCVCAELNFKFNDEYVNREF 332 (416) T ss_pred HHHHHHHHHHHHHHhCCCHHHcCCCCCCccHHHHHH--------------HHHHHHHHHHHHHHHHHhhhccccccCceE Confidence 6677778888999888887543321 1112111111 122234444444444343322222222345 Q ss_pred eEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCC Q lcl|NC_013644. 402 SFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTD 479 (510) Q Consensus 402 ~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~ 479 (510) ++.+..-+-.|..+.++.+.++..+|+++.-.+.+.++. +++...... ....+..+.. ..+.... T Consensus 333 ~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~------------~~~~n~~~~~-~~~~~~~ 399 (416) T protein:vir:45 333 KFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIH------------RVDLNHVNIE-LVDEYQM 399 (416) T ss_pred EEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceE------------eecccccccc-cccccCc Confidence 555566667799999999999999999999888887643 222111000 0000000000 0000000 Q ss_pred CcccCCCCCCcccccccCcccccccc Q lcl|NC_013644. 480 EEETAVNPDDPTQQMAEGATGSTESQ 505 (510) Q Consensus 480 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (510) .+..... ...+|.++.+ T Consensus 400 ~~~~~~~---------~~~kgGe~n~ 416 (416) T protein:vir:45 400 NKSRATD---------KKLKGGEENE 416 (416) T ss_pred ccccccc---------cccCCCCCCC Confidence 0000000 0000000000 No 169 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=98.08 E-value=5.1e-06 Score=49.62 Aligned_cols=388 Identities=8% Similarity=-0.060 Sum_probs=169.3 Q ss_pred HHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCCceec Q lcl|NC_013644. 15 ALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNPVEYE 94 (510) Q Consensus 15 ~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~ 94 (510) ++-+-+-+.+..........+...+-+.... ..+..+.. ..=+.++-....|+..++-+.+-|+++- T Consensus 1 ~~f~~~f~r~~~~~~~~~~~~~~~~~~~~~~---------~~g~~v~~----~~~l~~~~v~~~i~~Ia~~iA~~p~~~~ 67 (413) T protein:vir:48 1 MFFSGLFQRKSDAPVTTPAELAEAIGLSYDT---------YTGKRISS----QRAMRLTAVYSCVRVLAESVGMLPCSLY 67 (413) T ss_pred CccchhhccCccCCccchHHHHHhhhcCccc---------ccCceech----hhhhccHHHHHHHHHHHHhhhhCceEEE Confidence 1111100001111110111121222111100 00000000 0001122233345555555556666542 Q ss_pred -cCcH----HHHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCce Q lcl|NC_013644. 95 -TENE----ELKEYLAEYYN---S---EFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNELQ 162 (510) Q Consensus 95 -~~d~----~~~~~l~~~~~---n---~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~ 162 (510) ..++ ....-+..++. | ........+....+.+|.||+++..+ .|++ .+.+++|..+.+..+..+.+ T Consensus 68 ~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~~~- 145 (413) T protein:vir:48 68 KISGTLKTRVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA-LGEVVELLPIDPGCVEPKLNSQWQP- 145 (413) T ss_pred EecCCcceeecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC-CCcEEEEEEEcCceEEEEEcCCceE- Confidence 1111 11112233331 2 23455666788899999999888765 4664 57788999988887754321 Q ss_pred eEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecC Q lcl|NC_013644. 163 RICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSN 242 (510) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n 242 (510) +|++....+ . ...|.+..+.+++.-. .+ T Consensus 146 ----~y~~~~~~g-~------~~~~~~~evih~~~~~-----------------------------------------~d 173 (413) T protein:vir:48 146 ----VYQVTFPDG-S------VDVLTQDEIWHVRTLT-----------------------------------------LD 173 (413) T ss_pred ----EEEEEecCc-e------EEEEccccEEEecCcC-----------------------------------------CC Confidence 222222111 1 1123444444443100 01 Q ss_pred CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCch---hhhhHhhh--------cCeeeeccCCCcee Q lcl|NC_013644. 243 NKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDL---SKLRQNVK--------SKKVVGTGSDGGLD 311 (510) Q Consensus 243 n~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~---~~~~~~~~--------~~~~~~~~~~~~~~ 311 (510) ...|.|-+..+...++....+..-..+.++..+.|-.+++....-+. ..+...+. .++++.++++.+++ T Consensus 174 ~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~~~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~ 253 (413) T protein:vir:48 174 GLVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTEQKLTPDAYERLKKDFEERHTGLGNAHRPMILEMGLDWK 253 (413) T ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEE Confidence 12466666666666665555544455555666667666654322121 22222211 12345566665555 Q ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_013644. 312 VKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRR 391 (510) Q Consensus 312 ~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~ 391 (510) .+........+.+..+...+.|+..-++|+.-.+..+..+...++... ...+...|.-+++.|...+..+ T Consensus 254 ~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~----------~~f~~~~i~P~~~~ie~~l~~~ 323 (413) T protein:vir:48 254 SMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELG----------LGFINYSLVPYLTRIEQRINTG 323 (413) T ss_pred eccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHH----------HHHHHHHHHHHHHHHHHHHHhh Confidence 555444444556777788888988888888544332222211111111 1223334444444444433322 Q ss_pred cCCcc--ccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_013644. 392 YTKAF--DPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAE 469 (510) Q Consensus 392 ~~~~~--~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~ 469 (510) --... ....+++.+..-+-.|..+.++.+.++.++|+++.-.++++++.-.-+.-. ......+..+ T Consensus 324 L~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~ggD------------~~~~~~n~~~ 391 (413) T protein:vir:48 324 LVRESKQGKFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGD------------VYLTPMNMTT 391 (413) T ss_pred ccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc------------eeeccccccc Confidence 11111 112344445566667999999999999999999998888876532111000 0000011010 Q ss_pred ccCCCCCCCCCcccCCCCCCcccccc Q lcl|NC_013644. 470 YTKGLSDNTDEEETAVNPDDPTQQMA 495 (510) Q Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (510) .....++..+..+.++.++..+ T Consensus 392 ----~~~~~~~~~~~~~~~~~~~~~~ 413 (413) T protein:vir:48 392 ----SPSAGDDNGKKKESGDADKTAS 413 (413) T ss_pred ----cccccccCCCCCCCCCccccCC Confidence 0111111111111111111111 No 170 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=98.07 E-value=5.4e-06 Score=49.45 Aligned_cols=405 Identities=11% Similarity=0.006 Sum_probs=167.9 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhH-HHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSK-REAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIV 79 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~-~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv 79 (510) |...... ....+++.+..+.....- ..-..+ +..-|... ..+..+ .++.-+.++-....| T Consensus 1 ~~~~~~~----~~~~~~~~~~~~~g~~~s~~~~~~~-~~~~~~~~----------~~g~~v----~~~~al~~~~v~~ci 61 (437) T protein:vir:10 1 MKQGKQR----ALGRIKSSFLKWLGVPISLTDGSFW-SAWGGMGS----------SSGETV----TADSALQLSAVWSCV 61 (437) T ss_pred CCcchhh----hhhhhHHhhhhhcCCcccCCchhHH-Hhhccccc----------CCCcee----chHhhhccHHHHHHH Confidence 2111111 111111111111000000 000000 00000000 000000 000001112233345 Q ss_pred HHHHhhhhcCCcee-ccC-c----HHHHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEE Q lcl|NC_013644. 80 DQKTQYLLSNPVEY-ETE-N----EELKEYLAEYYN---S---EFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVA 146 (510) Q Consensus 80 ~~~~~~l~g~p~~~-~~~-d----~~~~~~l~~~~~---n---~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~ 146 (510) +..++-+.+-|+++ ... + ......+..++. | ........++...+.+|.||+++..+. |++ .+.++ T Consensus 62 ~~Ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-g~~~~L~~l 140 (437) T protein:vir:10 62 RLIAETIATLPLNLYQTKPDGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRSA-GVLIGLELM 140 (437) T ss_pred HHHHHHHhhCceeEEEEcCCCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEecC-CcEEEEEEE Confidence 55555555556654 111 1 011112223331 2 233455667788899999999988874 765 47889 Q ss_pred cccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccc Q lcl|NC_013644. 147 DSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENES 226 (510) Q Consensus 147 ~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (510) +|..+-+..+..+.+ +|.+... ++. ...+.+..+.+++.-. T Consensus 141 ~p~~v~i~~~~~g~~-----~y~~~~~-~g~------~~~~~~~dIih~r~~~--------------------------- 181 (437) T protein:vir:10 141 LPQRTTVKRLTSGAL-----QYTYRNV-DGT------VSTLAEDDVFHVRGFS--------------------------- 181 (437) T ss_pred cCcceEEEECCCCeE-----EEEEEec-Cce------EEEEccccEEEecCcC--------------------------- Confidence 999888877644321 1221111 111 1123333344332100 Q ss_pred cccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh---hhhHhhh------ Q lcl|NC_013644. 227 LLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS---KLRQNVK------ 297 (510) Q Consensus 227 ~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~---~~~~~~~------ 297 (510) .+...|.|.+..+...++....+..-..+.+...+.|-.+++....-+.+ .+...+. T Consensus 182 --------------~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~ 247 (437) T protein:vir:10 182 --------------LDGLMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQILQKEKRAEIRTDLAEQFGGA 247 (437) T ss_pred --------------CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcCc Confidence 01124566666555555544444444455556666676666543221221 2222211 Q ss_pred --cCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc--CcccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 298 --SKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGD--GNITNIVIKARYTLLNMKANKTEARL 373 (510) Q Consensus 298 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~~Sg~Ai~~~~~~l~~k~~~k~~~~ 373 (510) .++++.++++.+.+.++.......+.+..+...+.|...-++|+.-.+.. ++..+..++.. ....+ T Consensus 248 ~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~----------~~~f~ 317 (437) T protein:vir:10 248 MQAGKTMVLEAGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQ----------TLGFL 317 (437) T ss_pred cccCcceeccCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHH----------HHHHH Confidence 13456666666555555444445556777778888988888887543322 22212222221 11233 Q ss_pred HHHHHHHHHHHHHHHhhcc--CCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCc-HHHHH Q lcl|NC_013644. 374 RALLEWMNKLVIDDINRRY--TKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDD-DNVLR 448 (510) Q Consensus 374 ~~~l~~~~~~i~~~~~~~~--~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d-~e~~~ 448 (510) ...|.-.+..|...+..+- ........+++.+..-+..|..+.++.+.++..+|+++.-.+++.++. +.. .+... T Consensus 318 ~~tl~P~~~~ie~~l~~kll~~~e~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gg~~~~~ 397 (437) T protein:vir:10 318 TFTLRPWLTRIEQAARRSLLRPGERDQFYAEFSVEGLLRADSAGRAAFYSTMTQNGLMTRDECRAKENLPPMGGNAAVLT 397 (437) T ss_pred HHHHHHHHHHHHHHHHhhccCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcceEe Confidence 3444444444444443211 111112235555566678899999999999999999999888887643 221 11000 Q ss_pred HHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCccccccc Q lcl|NC_013644. 449 LICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTES 504 (510) Q Consensus 449 ~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (510) . .. .........+.. ++ . . .+.+...++..+ ...++..++ T Consensus 398 -~--~~--~~~~~~~~~~~~--~~---~-~--~~~~~~~~~~~~---~~~~~~~e~ 437 (437) T protein:vir:10 398 -V--QS--ALLPIDKLGEHT--TA---T-A--AQDALKAWLYQE---EKTRATQER 437 (437) T ss_pred -e--cC--cccchhhccCcC--CC---c-c--hhccccccCCCC---CCCCccccC Confidence 0 00 000000000000 00 0 0 000000000000 111111222 No 171 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=98.06 E-value=5.6e-06 Score=49.37 Aligned_cols=446 Identities=11% Similarity=0.073 Sum_probs=191.2 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhh--HHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSS--KREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~--~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) |-..+.+ .+-++.+++..+..+..+. -.++..+.+|..-.. .. ..+. ...+...++..+-+..- T Consensus 1 m~~~~~~--~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~-------~~--~~~~---~~~~~~~~~~dst~~~a 66 (532) T protein:vir:99 1 MAEVEKT--GFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSV-------FP--SATA---DGSTSYTTPWQSIGARG 66 (532) T ss_pred Ccchhhc--cccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcc-------cC--CCCC---cchhhccccccchHHHH Confidence 4433321 1123445555554443221 123444444432211 00 0111 11112245666777777 Q ss_pred HHHHHhhhhc--CCce-----eccCcHH-------------HHHHH-------HHHh-ccCHHHHHHHHHHHHHhcCeEE Q lcl|NC_013644. 79 VDQKTQYLLS--NPVE-----YETENEE-------------LKEYL-------AEYY-NSEFQVVLQELVEGSSQKGFEY 130 (510) Q Consensus 79 v~~~~~~l~g--~p~~-----~~~~d~~-------------~~~~l-------~~~~-~n~~~~~~~e~~~~~~~~G~~~ 130 (510) +++.++.|++ -|+. +...+.. +...| ...+ .+||...+.++.++..++|.|. T Consensus 67 ~~~LAa~L~~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ 146 (532) T protein:vir:99 67 LNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVL 146 (532) T ss_pred HHHHHHHHHHhhcCCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEe Confidence 7777777764 2322 2332211 22222 2222 4788889999999999999998 Q ss_pred EEEEECCC---CceEEEEEcccceEEEEcCCCCceeEEEEEEEEEeeC------------CceeEEEEEEEEcCCcEEEE Q lcl|NC_013644. 131 VYARTNAE---DRLCFQVADSLNVFGVYNEYNELQRICRHYITEIEKD------------GETVDIHHAEVWTDQNVYFF 195 (510) Q Consensus 131 ~~v~~d~~---g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~e~y~~~~i~~~ 195 (510) +++..++. ...+++.++-.+++..-|..+++..+++-.++....- ........+++|+.- + T Consensus 147 l~~~~~~~~~~~~~~f~~~pl~~y~v~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~p~~~v~v~~~v----~ 222 (532) T protein:vir:99 147 LYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEDAQGDQNPSEEVTIYTHV----Y 222 (532) T ss_pred EEecccccccCcccceEEEEcCeEEEeeCCCCCeeeEeeeeeecHHhcChHHHHHhhccccccCCCcceEEEEEE----E Confidence 87755432 3456667777777776777777777766443331100 001112233443321 0 Q ss_pred EEcCCceeeccccccccccccccccccccc--ccccccCCcccEEEecC-----CCCCCCcHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 196 VAEDNKDYELDEAEPINPRPHVLAVDSENE--SLLQRSYGQIPFYRLSN-----NKQETTDLKPIKALIDDYDLMNCFLS 268 (510) Q Consensus 196 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~g~iPvv~~~n-----n~~g~sd~~~v~~liD~~n~~~S~~~ 268 (510) ...++..+. +.....+... .....+|..+|++.++- +.+|+|-..+..+-+..+|.+.-... T Consensus 223 ~~~~~~~~~-----------~~~~~~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l 291 (532) T protein:vir:99 223 RDPEAMVFR-----------SYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIV 291 (532) T ss_pred ecCCCCeeE-----------EEEeecCceecccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHH Confidence 001110000 0000111111 11122355677776553 46799999999999999998877777 Q ss_pred HHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCccccccc Q lcl|NC_013644. 269 NNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDSTQVG 346 (510) Q Consensus 269 ~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~ 346 (510) ........|.+.+.--+..++..+.. ...+.+..+..+++..+... .+.......++.++..|-..-..-....-. T Consensus 292 ~~~~~a~~~~~lv~p~g~~~~~~~~~--~~~g~~v~g~~~~i~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~d 369 (532) T protein:vir:99 292 KMSMISSKVLFFVNPNGVTQIRRVAK--ANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRG 369 (532) T ss_pred HHHHHHcCCCceeccccccchhhhcc--CCCcceecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCC Confidence 77777777775543111222222211 12233444445556665433 456777777877777775422111111122 Q ss_pred cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHhhccCCc---cccceee-EEeCCCCCCCHH Q lcl|NC_013644. 347 DGNITNIVIKARYTLLNMKANKTEARLRALLEW--------MNKLVIDDINRRYTKA---FDPTEVS-FTFTREVMVNET 414 (510) Q Consensus 347 ~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~--------~~~~i~~~~~~~~~~~---~~~~~v~-i~f~~~~p~d~~ 414 (510) ....|++.+.. ++.++...++..+.+ ++...+.++...+.-+ .+...+. +++-.++- .+ T Consensus 370 ~~r~TAtEV~~-------r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~iv~~is~La--ra 440 (532) T protein:vir:99 370 GDRVTAEEIRY-------VAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALG--RG 440 (532) T ss_pred CCcccHHHHHH-------HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhcccceeecchHHH--HH Confidence 33346655554 344555555554444 3333344443333211 1111222 23333332 22 Q ss_pred HHHHHHHHHHh--cCC-------CchHHHHHh----CC-----CCCc-HHHHHHHHHHHHHHHHHHHHHHHhhhccCCCC Q lcl|NC_013644. 415 DIVNDEKTEAE--TRK-------IILESILQV----AP-----RLDD-DNVLRLICEQFDLDWEDVKEALEEAEYTKGLS 475 (510) Q Consensus 415 e~~~~~~~~~~--~g~-------iS~et~~~~----~~-----~v~d-~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~ 475 (510) +.++.+....+ +.+ +....++.. ++ .+.. +|.+.+.++++..+.. ..+.++.....+.. T Consensus 441 q~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~GV~~~~i~r~~ee~~~~~~q~~~~~~~--~~a~~~~~~~~~~~ 518 (532) T protein:vir:99 441 HDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGM--VTAGQQMGAAGGQA 518 (532) T ss_pred HHHHHHHHHHHHHHhhcchhhhhCCHHHHHHHHHHHhCCChhhccCCHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHh Confidence 22222221111 111 112222221 11 1122 2333332222222211 11111111111111 Q ss_pred C-CCCCcccCCCCCCccc Q lcl|NC_013644. 476 D-NTDEEETAVNPDDPTQ 492 (510) Q Consensus 476 ~-~~~~~~~~~~~~~~~~ 492 (510) . ...+...+.+. + T Consensus 519 ~~~~~~~~~~~~~----~ 532 (532) T protein:vir:99 519 AAAMMQQQAGMPT----Q 532 (532) T ss_pred cchhHHhhcCCCC----C Confidence 1 11111111110 0 No 172 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=98.04 E-value=6.1e-06 Score=49.16 Aligned_cols=382 Identities=11% Similarity=0.022 Sum_probs=168.1 Q ss_pred hhhhHHHHHHHHHhhhhhhhHHH-HH-HHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhh Q lcl|NC_013644. 9 VKIIANALKAAIDKDRKSSSKRE-AE-TGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYL 86 (510) Q Consensus 9 ~~~~~~~i~~~i~~~~~~~~~~~-~~-~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l 86 (510) +-.. +.+..+. +....... .. .+-.+. |.. ...+..-+..+-....|+..++-+ T Consensus 1 MG~~-~~~~~~~---~~~~~~~~~~~~~~~~~~-g~~-------------------~~~~~~al~~~~V~~~v~~Ia~~i 56 (411) T protein:vir:81 1 MGWW-SRLTRFF---RPRNETVDMTNPLLLQWL-GVD-------------------PDTPRNQLSEATYFACLKILSESL 56 (411) T ss_pred CchH-HHHHhhc---cCcccccccchHHHHHHh-cCc-------------------ccChhhhhccHHHHHHHHHHHHhH Confidence 1111 1111111 10000000 00 000000 100 000111111222334456666666 Q ss_pred hcCCceec--cCc---HHHHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEE Q lcl|NC_013644. 87 LSNPVEYE--TEN---EELKEYLAEYYN---S---EFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGV 154 (510) Q Consensus 87 ~g~p~~~~--~~d---~~~~~~l~~~~~---n---~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~ 154 (510) .+-|+.+- .++ +.....+..+++ | ........+....+.+|.||+++..+. |++ .+.+++|..+-++ T Consensus 57 A~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-g~~~~l~~l~~~~v~~~ 135 (411) T protein:vir:81 57 GKLPLKMYQKTERGIVKSDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYSG-PQLQALWILPSQYVTIV 135 (411) T ss_pred hhCceeEEEecCCceeeecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC-CceEEEEEECCceEEEE Confidence 66676651 111 111112233321 2 234555667788899999999888874 554 5788999999998 Q ss_pred EcCCCCcee-EEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCC Q lcl|NC_013644. 155 YNEYNELQR-ICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYG 233 (510) Q Consensus 155 ~d~~~~~~~-~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 233 (510) .|+.+.... ...+|.+....++.. ..+..+.+.+++... T Consensus 136 ~~~~~~~~~~~~~~~~~~~~~~g~~------~~~~~~eiih~k~~~---------------------------------- 175 (411) T protein:vir:81 136 VDDRGLLGEKNAIWYRYNDPYDGKM------YVFRNDEILHFKTSV---------------------------------- 175 (411) T ss_pred EcCcccccccceEEEEEEecCCceE------EEEccccEEEEcCCC---------------------------------- Confidence 876542211 111222221111110 012233333332100 Q ss_pred cccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh---hhhHhhh--------cCeee Q lcl|NC_013644. 234 QIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS---KLRQNVK--------SKKVV 302 (510) Q Consensus 234 ~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~---~~~~~~~--------~~~~~ 302 (510) | .+.-.|.|.+.-+...++....+..-..+.+...+.|-.+++....-+.+ .+...+. .++++ T Consensus 176 --~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~ 249 (411) T protein:vir:81 176 --T----FDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTGDLNQEARDRLVKGFEQFANGSKNAGKII 249 (411) T ss_pred --C----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCCce Confidence 0 01123666666666666555555444555556666677776553221111 1121111 12355 Q ss_pred eccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 303 GTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNK 382 (510) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~ 382 (510) .++++.+++.+..+.....+.+..+...+.|...-++|+.-.+...+.+-..++. .....+..+|.-.+. T Consensus 250 vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~----------~~~~f~~~~l~P~~~ 319 (411) T protein:vir:81 250 PVPLGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSSYASAEA----------QNLAFYVDTLLYVLK 319 (411) T ss_pred ecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHH----------HHHHHHHHHHHHHHH Confidence 5666666555554434445556677888899998889976443222111111111 112334445555555 Q ss_pred HHHHHHhhccCC--cc-ccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHH Q lcl|NC_013644. 383 LVIDDINRRYTK--AF-DPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWE 459 (510) Q Consensus 383 ~i~~~~~~~~~~--~~-~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~ 459 (510) .|...+..+--. .. ....+++.+..-+-.|..+.++.+.++..+|+++.-.++++++.-..+.-.+. T Consensus 320 ~ie~~l~~~ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~~gl~p~~ggD~~---------- 389 (411) T protein:vir:81 320 QYEEEITYKILSNDLISQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDYLDMPADDYGNNL---------- 389 (411) T ss_pred HHHHHHHhhcCChhhcCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCee---------- Confidence 555544432111 11 11234445556677899999999999999999998888877654211100000 Q ss_pred HHHHHHHhhhccCCCCCCCCCcccCCCCCCc Q lcl|NC_013644. 460 DVKEALEEAEYTKGLSDNTDEEETAVNPDDP 490 (510) Q Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (510) ....+..+- +.-+++. ..+|+. T Consensus 390 --~~~~n~~pl----~~~~~~~---~kgGd~ 411 (411) T protein:vir:81 390 --MANGNYIPL----SMLGANY---GKGGDS 411 (411) T ss_pred --eeccCccch----hhhhhhh---ccCCCC Confidence 000000000 0000000 000100 No 173 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=98.04 E-value=6.3e-06 Score=49.10 Aligned_cols=386 Identities=10% Similarity=-0.033 Sum_probs=185.1 Q ss_pred CCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhc----cC----CcchhcccceeccccccccccccccceeccchhHH Q lcl|NC_013644. 6 SEDVKIIANALKAAIDKDRKSSSKREAETGIRYYN----HE----NDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPE 77 (510) Q Consensus 6 ~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~----g~----~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ 77 (510) -+-+.+. ++..-.....+++. |- .+|+..... ..-...+. . .......- T Consensus 1 v~~~~l~--------------~e~at~~~~~d~~~~~~~~l~~~~~~il~~a~~---g~~~~y~~-----l-~~D~~i~s 57 (488) T protein:vir:99 1 MEKPALG--------------REIATSGDGRDITRPFISGLQVPNDSILQRRGG---NDLRVYEE-----I-LSDAQVKT 57 (488) T ss_pred CCccchh--------------HHHHHHHhhhhhhccccCCCCCCChHHHHhhcc---CCHHHHHH-----H-hhChHHHH Confidence 1111111 11000000112211 11 111111000 00000000 0 11345666 Q ss_pred HHHHHHhhhhcCCceeccCc-----HHHHHHHHHHhcc-CHHHHHHHHHHHHHhcCeE-EEEEEECCCCceE---EEEEc Q lcl|NC_013644. 78 IVDQKTQYLLSNPVEYETEN-----EELKEYLAEYYNS-EFQVVLQELVEGSSQKGFE-YVYARTNAEDRLC---FQVAD 147 (510) Q Consensus 78 Iv~~~~~~l~g~p~~~~~~d-----~~~~~~l~~~~~n-~~~~~~~e~~~~~~~~G~~-~~~v~~d~~g~~~---i~~~~ 147 (510) .+++...-++|.+..+.+.+ ....++++++++. +|.+.+.++. ++..+|.+ ++++|.-.+|.+. +.+++ T Consensus 58 ~l~~rk~av~~~~w~i~p~~~~~~~~~~ae~v~~~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~ 136 (488) T protein:vir:99 58 VWGQRQLAVVSREWKVEAGGDRPIDQAAAEHLEQQLQRVGWDRVTSKML-FGVFYGYAVSELIYGRDDRYITLEAIKVRN 136 (488) T ss_pred HHHHHHHHHhcCCceEEcCCCChHHHHHHHHHHHHHhCCCHHHHHHHHH-hhhhhcceeEEEEEeecCCeeeEeeeeeec Confidence 77777788889998886533 3345678887764 6777766655 67888964 5667754455543 44445 Q ss_pred ccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccccccccccccc Q lcl|NC_013644. 148 SLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESL 227 (510) Q Consensus 148 p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (510) |+.+. ||..+.+. +....+... ... T Consensus 137 ~~~f~--~d~~~~l~--------------------------------~~~~~~~~~---------------------g~~ 161 (488) T protein:vir:99 137 RRRFR--YDQDGGLR--------------------------------LLTPNNMFE---------------------GEP 161 (488) T ss_pred cccee--ecCCCceE--------------------------------EeccCCCCC---------------------ccc Confidence 54332 33222110 000000000 000 Q ss_pred ccccCCcccEEEe--cCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCC-CCchhh------hhHhhhc Q lcl|NC_013644. 228 LQRSYGQIPFYRL--SNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQ-GDDLSK------LRQNVKS 298 (510) Q Consensus 228 ~~~~~g~iPvv~~--~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~-~~~~~~------~~~~~~~ 298 (510) .+.+++.|-.++- ..++.|.|.+..+-...--=+..+.+++..++.++.|+++.+-.. +.+..+ ....+.. T Consensus 162 lp~~~~~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a~~~ek~~l~~av~~~~~ 241 (488) T protein:vir:99 162 CPAPYFWHFSTGADNDDEPYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTATPEDKAKLLAALHAIQT 241 (488) T ss_pred cccCceEEEEeecCCCCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCCCHHHHHHHHHHHHHHhc Confidence 1112222211111 125668888887765544455567888999999999999877432 222222 1233445 Q ss_pred CeeeeccCCCceeEEeec-CCHHHHHHHHHHHHHHHHHH--hCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 299 KKVVGTGSDGGLDVKTVT-IPTEGRKTKMEIDKENIYKF--GMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRA 375 (510) Q Consensus 299 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~l~~~i~~~--s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~ 375 (510) .....++.+.++++++.. .+...++..++.+.+.|... .+|.....++++.+.|..-.- -....+..-.+.+.. T Consensus 242 ~~~~viP~~~~ie~~ea~~~~~~~~~~li~~~d~~Isk~iLGqtlts~~~~Gs~a~~~vh~~---v~~d~~~aDa~~i~~ 318 (488) T protein:vir:99 242 DSAIIMPAGMQAELLEAGRSGTADYKTLHDTMDATIAKVGLGQVASTQGTPGRLGNDDLQAD---VRLDLVKADADLICE 318 (488) T ss_pred CcEEEecCCceeEEeecCCCChHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHHH---HHHHHHHHHHHHHHH Confidence 567778889999999864 45567899999999988765 344333222223333433222 223334444466666 Q ss_pred HHH-HHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhc-CC-CchHHHHHhCCCCCcHHHHHHHHH Q lcl|NC_013644. 376 LLE-WMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAET-RK-IILESILQVAPRLDDDNVLRLICE 452 (510) Q Consensus 376 ~l~-~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~-g~-iS~et~~~~~~~v~d~e~~~~~~e 452 (510) .+. ++++.++.+ +.. . ..-..+.|...-+.|.++.++.+.++... |+ ++.+.+.+.++.-.....+ T Consensus 319 tln~~li~~l~~~-N~~--~---~~~p~~~~~~~e~edl~~~a~~~~~l~~~~G~~i~~~~i~e~~Gip~~~~~~----- 387 (488) T protein:vir:99 319 SFNLGPARWLTEW-NFP--G---AQPPRVYRVIEEPEDITAKAERDEKVFRMSGFRPTRGYVQETYGVEVESTQA----- 387 (488) T ss_pred HHHHHHHHHHHHh-CcC--C---cCCceeEecCCCcccHHHHHHHHHHHHhhcCCCCCHHHHHHHcCCCCccccc----- Confidence 664 466655543 221 1 12246788888999999999999999885 76 6777777777542211000 Q ss_pred HHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 453 QFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 453 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) . ...+.+.. ..... ..+.++. .....++.... T Consensus 388 -------~-----~~~~~~~~-~~~~~-----~~~~~~~--------~~~~~~~~~~~ 419 (488) T protein:vir:99 388 -------E-----ATAPTPST-EFAEG-----DQPSDPA--------AAMAPQLAEAM 419 (488) T ss_pred -------c-----cccCCCcc-cCCCC-----CCCCCch--------HHHHHHHHHHH Confidence 0 00000000 00000 0000000 00011111111 No 174 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=98.04 E-value=6.4e-06 Score=49.07 Aligned_cols=366 Identities=9% Similarity=0.015 Sum_probs=159.0 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |--. ..+++. +... ... ..++.+-- ...-......+..+. +..-+.++-.-..|+ T Consensus 1 M~~f---------~~~~~~----~~~~----~~~-~~~~~~~~---~~~~~~~~~~~~~v~----~~~al~~~~v~~~i~ 55 (386) T protein:vir:49 1 MPIF---------NITNLA----TESP----PIN-QESFFDIA---DSDFLASLNSSEWVS----AENALKNSDLFSIIS 55 (386) T ss_pred Cchh---------hhhccC----CCCc----ccc-hhhhhhhh---hccccccccCCceec----hhhhhccHHHHHHHH Confidence 1110 000000 0000 000 00000000 000000000000000 000111122233445 Q ss_pred HHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCC Q lcl|NC_013644. 81 QKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEY 158 (510) Q Consensus 81 ~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~ 158 (510) ..++-+.+-|+.+.- ......+.+=.. .........+....+.+|.||+++-.+.+|++ .+.+++|..+-++.++. T Consensus 56 ~ia~~ia~~p~~~~~--~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i~~~~v~v~~~~~ 133 (386) T protein:vir:49 56 QLSNDLATAKITTSR--KQLQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDN 133 (386) T ss_pred HHHHHhhhCceeecc--chhhhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEecCceeEEEEcCC Confidence 555555566766532 222222221111 12345556677888999999999888888875 67888998887776543 Q ss_pred CCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEE Q lcl|NC_013644. 159 NELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFY 238 (510) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv 238 (510) ... + +|.+...+..... ...+....+ + T Consensus 134 ~~~---~-~y~~~~~~~~~~~----~~~~~~~ev---------------------------------------------i 160 (386) T protein:vir:49 134 QNG---L-YYNITFDDPHIAP----KQHVPQNDI---------------------------------------------L 160 (386) T ss_pred Cce---E-EEEEEEcCccccc----eeEEccccE---------------------------------------------E Confidence 221 1 1111111110000 011222333 3 Q ss_pred EecCC-----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhh---hhHhh-----hcCeeeecc Q lcl|NC_013644. 239 RLSNN-----KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSK---LRQNV-----KSKKVVGTG 305 (510) Q Consensus 239 ~~~nn-----~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~---~~~~~-----~~~~~~~~~ 305 (510) ||++. -.|.|.+..+...++....+..-..+.+...+.|-.+++-........ ..... ..++++.++ T Consensus 161 h~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~n~g~~~vl~ 240 (386) T protein:vir:49 161 HFRLLSVDGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLD 240 (386) T ss_pred EecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChHHHHHHHHHHHHhccCCCCceecC Confidence 33321 246677776666666555554444555566666766654322222111 11111 123456666 Q ss_pred CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc--CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 306 SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGD--GNITNIVIKARYTLLNMKANKTEARLRALLEWMNKL 383 (510) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~ 383 (510) ++.+++.+..+.....+.+..+...+.|+..-++|+.-.+.. +..++..++.. +...+..+++. T Consensus 241 ~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~--------------~~~~i~~~l~~ 306 (386) T protein:vir:49 241 DLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDGDQQSSLEMIYNI--------------YFKSVSRYLRP 306 (386) T ss_pred CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccchHHHHHHH--------------HHHHHHHHHHH Confidence 666666555444555667778888899999889998644322 22333333322 22233333333 Q ss_pred HHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCC---CCCcHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 384 VIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAP---RLDDDNVLRLICEQFDLDWED 460 (510) Q Consensus 384 i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~---~v~d~e~~~~~~e~~e~~~~~ 460 (510) +...+..+-. ..+.+.....+-.|..+.+..+.++..+|+++.-.+++++. +..++ .- T Consensus 307 i~~~~~~~l~-----~~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~~~~-~~------------- 367 (386) T protein:vir:49 307 FVSEMSKKLS-----CEVDVDISPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQAEILPKE-LP------------- 367 (386) T ss_pred HHHHHHHHhc-----chhcccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCCCCc-Cc------------- Confidence 3333322111 12333344445567778888888999999988877766542 21111 00 Q ss_pred HHHHHHhhhccCCCCCCCCCcccCCCCCCcccccc Q lcl|NC_013644. 461 VKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMA 495 (510) Q Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (510) ....+..... + +++.++ +. T Consensus 368 ----~~~~~~~~~~-------~-gGd~~~----~~ 386 (386) T protein:vir:49 368 ----DGKNPNRTSL-------K-GGEINE----QD 386 (386) T ss_pred ----chhccCCCCC-------C-CCCCCC----CC Confidence 0000000000 0 000000 00 No 175 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=97.99 E-value=7.9e-06 Score=48.56 Aligned_cols=389 Identities=8% Similarity=-0.029 Sum_probs=163.0 Q ss_pred hhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcC Q lcl|NC_013644. 10 KIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSN 89 (510) Q Consensus 10 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~ 89 (510) .....+++..- +......-+..+-....+.. ...+..+- +..-+..+-....|+..++-+.+- T Consensus 1 m~~~~~~~~~~---~~~s~~~~w~~~~~~~~~~~----------~~~g~~vt----~~~al~~~~v~~~i~~Ia~~iA~l 63 (421) T protein:vir:10 1 MFIPQMFEGKK---RSVSGGGFWEAMLGGVRSSH----------SKAGVMIT----PETALALSAVRACVTLLAESVAQL 63 (421) T ss_pred CCCcchhcccc---cccCcchhhHHHhhhhccCc----------ccCCceec----hHHhhccHHHHHHHHHHHHhhccC Confidence 00111111000 00000000000000000000 00000000 000011222333455555555566 Q ss_pred Ccee-cc-CcH---HH-HHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEc Q lcl|NC_013644. 90 PVEY-ET-ENE---EL-KEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYN 156 (510) Q Consensus 90 p~~~-~~-~d~---~~-~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d 156 (510) |+.+ .. .+. .. ..-+..++. |. .......+....+.+|.||+++-.+.+|++ .+.+++|..+.+..+ T Consensus 64 p~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~ 143 (421) T protein:vir:10 64 PVELYRRDKNGGRQRATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGYPKELIPINPKKVIVLKG 143 (421) T ss_pred ceEEEEEcCCCceeecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEEC Confidence 7664 11 111 11 112223221 22 234455667889999999999999988876 477788888887665 Q ss_pred CCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCccc Q lcl|NC_013644. 157 EYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIP 236 (510) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP 236 (510) +.+.+ +|.+ ...+. .+....+++++.- + T Consensus 144 ~~g~~-----~y~~--~~~g~--------~~~~~eiih~~~~-------------------------------------~ 171 (421) T protein:vir:10 144 PDGMP-----YYEI--PEIGE--------TLPMRMMHHVKVF-------------------------------------S 171 (421) T ss_pred CCceE-----EEEE--cCCCc--------EEchhhEEEecCc-------------------------------------C Confidence 44321 1111 11110 1112222222100 0 Q ss_pred EEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCC---CC-ch---hhhhHhhh--------cCee Q lcl|NC_013644. 237 FYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQ---GD-DL---SKLRQNVK--------SKKV 301 (510) Q Consensus 237 vv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~---~~-~~---~~~~~~~~--------~~~~ 301 (510) .+.-.|.|-++.+...++....+..-..+.+...+.|-.+++-.. .. .. ..+...+. .+++ T Consensus 172 ----~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~n~~~~ 247 (421) T protein:vir:10 172 ----LDGYIGSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPKEAPAIKSQEKIDQLLAKWTDRYSGINNMFSV 247 (421) T ss_pred ----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCccCccCCHHHHHHHHHHHHHHhcCccccCcc Confidence 011235566665555555444444344445555666766654211 11 11 11111111 1345 Q ss_pred eeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 302 VGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMN 381 (510) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~ 381 (510) +.++++.+.+.++.......+.+..+...+.|+..-++|+.-.+..+..+...++. .....+...|.-++ T Consensus 248 ~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~----------~~~~f~~~tl~P~~ 317 (421) T protein:vir:10 248 ALLQEGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKATNNNIEH----------QGLQFVMYTLLAWL 317 (421) T ss_pred eecCCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCccccHHH----------HHHHHHHHHHHHHH Confidence 66666665555554444455566777788889888888875433222221111111 11223333444444 Q ss_pred HHHHHHHhhccCCccccce--eeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCC--CcHHHHHHHHHHHHHH Q lcl|NC_013644. 382 KLVIDDINRRYTKAFDPTE--VSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRL--DDDNVLRLICEQFDLD 457 (510) Q Consensus 382 ~~i~~~~~~~~~~~~~~~~--v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v--~d~e~~~~~~e~~e~~ 457 (510) +.|...+..+--....... +++.+..-+..|..+.++.+.+++.+|+++.-.+++.++.- .+-+.. T Consensus 318 ~~ie~~ln~kL~~~~~~~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~---------- 387 (421) T protein:vir:10 318 KRHEGALQRDLLLPSERRDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLPPIAGGDKY---------- 387 (421) T ss_pred HHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccee---------- Confidence 4444444332111111223 44444555678999999999999999999998888876532 111100 Q ss_pred HHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCccccc Q lcl|NC_013644. 458 WEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGST 502 (510) Q Consensus 458 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (510) ....+..+... ... .+.+. .... ..+.+...+.| T Consensus 388 ----~~~~n~~~~~~-~~~-~~~~~-~~~~----~~e~d~~~~~~ 421 (421) T protein:vir:10 388 ----LTPLNMVDSAQ-IIP-GDKKP-TAQQ----MAEIDTILSRT 421 (421) T ss_pred ----eeccccccccc-ccc-CCCCc-cccc----CcccccccccC Confidence 00111100000 000 00000 0000 00111111222 No 176 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=97.99 E-value=8e-06 Score=48.53 Aligned_cols=388 Identities=9% Similarity=0.028 Sum_probs=155.4 Q ss_pred ccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCC--cchhcccceeccccccccccccccceeccchhHHHHHH Q lcl|NC_013644. 4 LLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHEN--DIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQ 81 (510) Q Consensus 4 ~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~--~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~ 81 (510) |.++. +...+++.+ +.++- ++. ...... .+. +...... ..+.-+..+-...-|+. T Consensus 1 ~~~~~---~~~~~k~~~--------------~~~~~-~~~~~~~~~~~--~~~--~~~~~~v-~~~~a~~~~~V~~ci~~ 57 (409) T protein:vir:96 1 MAKEN---IVTRIKKKL--------------IDNWI-DQSASKLYDFS--PWK--NKSFWGV-INNTLETNETIFSAITK 57 (409) T ss_pred Ccccc---chhhhhhHH--------------hhhhh-ccccccccccc--ccc--Ccccccc-chhhHhhhHHHHHHHHH Confidence 11100 011111110 01111 110 000000 000 0000000 00000111222333444 Q ss_pred HHhhhhcCCceeccCcHHHHHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEE Q lcl|NC_013644. 82 KTQYLLSNPVEYETENEELKEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGV 154 (510) Q Consensus 82 ~~~~l~g~p~~~~~~d~~~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~ 154 (510) .++-+..-|+.+--..+.....+..++. |. -......++...+.+|.||+++..+..|++ .+.+++|..+-++ T Consensus 58 ia~~ia~lp~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~ 137 (409) T protein:vir:96 58 LSNSMASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEML 137 (409) T ss_pred HHHhhhhCceEEeecccccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEE Confidence 4444444566542222222222233331 22 234455677888999999999988888875 5777888888777 Q ss_pred EcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCc Q lcl|NC_013644. 155 YNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQ 234 (510) Q Consensus 155 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 234 (510) .++.... + +|.+. ..++. . ..+.+..+.+++.-. T Consensus 138 ~~~~~~~--~--~y~~~-~~~g~-----~-~~~~~~evih~r~~~----------------------------------- 171 (409) T protein:vir:96 138 IENQSRE--L--YYSIH-AATGN-----K-LIVHNMDMLHFKHIV----------------------------------- 171 (409) T ss_pred EeCCCcE--E--EEEEE-cCCce-----E-EEEccccEEEeCCCC----------------------------------- Confidence 6543211 0 11111 11110 0 012223333322100 Q ss_pred ccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccc--eeEEecCCCCch--hhhhHh----hh-cCeeeecc Q lcl|NC_013644. 235 IPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEA--IYVVSGFQGDDL--SKLRQN----VK-SKKVVGTG 305 (510) Q Consensus 235 iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~--~lv~~g~~~~~~--~~~~~~----~~-~~~~~~~~ 305 (510) | .+.-.|.|.+..+...++..+.+ ... .+..++.+ .++..+....+. ..+... .. .++++.++ T Consensus 172 -~----~~~~~G~s~l~~~~~~i~~~~~~-~~~--~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~n~g~~~vl~ 243 (409) T protein:vir:96 172 -A----SNMVQGISPIDVLKNTTDFDNAV-RTF--NLTEMQKPDSFMLKYGSNVSTEKRQQVLEDFKQYYEENGGILFQE 243 (409) T ss_pred -C----CCccccccHHHHHHHHHHHHHHH-HHH--HHHhcCCCceeEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecC Confidence 0 01113566665555544433322 111 22223332 222233222221 111111 11 23456666 Q ss_pred CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 306 SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVI 385 (510) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~ 385 (510) ++.+++.+..+.....+.+..+...+.|+..-++|+.-.+..++.+...++.. ....+...|.-+++.|. T Consensus 244 ~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~----------~~~f~~~~l~P~~~~ie 313 (409) T protein:vir:96 244 PGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEEL----------NRFYLQHTLLPIVKQYE 313 (409) T ss_pred CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH----------HHHHHHHHHHHHHHHHH Confidence 66666666544444455666777788898888888764433222222112211 12233334444444444 Q ss_pred HHHhhccCCcccc-ceeeEEe--CCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 386 DDINRRYTKAFDP-TEVSFTF--TREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVK 462 (510) Q Consensus 386 ~~~~~~~~~~~~~-~~v~i~f--~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~ 462 (510) ..+..+--...+. ....|.| ..-+-.|..+.++.+.++..+|+++.-.+++.++.-.-+.-.+ .. T Consensus 314 ~~l~~~Ll~~~~~~~g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi~ggD~------------~~ 381 (409) T protein:vir:96 314 EEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDK------------PL 381 (409) T ss_pred HHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcce------------ee Confidence 4443321111111 1233455 4555678999999999999999999988888764311100000 00 Q ss_pred HHHHhhhccCCCCCCCCCcccCCCCCCcccccc Q lcl|NC_013644. 463 EALEEAEYTKGLSDNTDEEETAVNPDDPTQQMA 495 (510) Q Consensus 463 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (510) ...+..+- +.. .+.+....+|++...++ T Consensus 382 ~~~n~~~~----~~~-~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:96 382 ISGDLYPI----DTP-LELRKSLKGGDKNVNES 409 (409) T ss_pred eccccccc----ccc-hhhcccccCCCCCcCCC Confidence 01111110 000 00000011111111111 No 177 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=97.95 E-value=9.7e-06 Score=48.07 Aligned_cols=415 Identities=11% Similarity=0.007 Sum_probs=188.5 Q ss_pred CCCccCCChhh------------hHHHHHHHHHhhhhhh-hHHHH-HHHHHHhccCCcchhcccceeccccccccccccc Q lcl|NC_013644. 1 MEALLSEDVKI------------IANALKAAIDKDRKSS-SKREA-ETGIRYYNHENDIMNNRIFYVDDEGILREDKYAS 66 (510) Q Consensus 1 ~~~~~~~~~~~------------~~~~i~~~i~~~~~~~-~~~~~-~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~ 66 (510) |-.|+-..-.. ....+.+.+..|..+- .+.++ ..++.--.|. +..... ..++-. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd--~~~~~~--------L~edm~-- 68 (526) T protein:vir:79 1 MAQIVDVYGNPIRPQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGN--LQAQAE--------LFMDME-- 68 (526) T ss_pred CCeeeCCCCCccCccccchhhhhhhhhhhhhcccCCCCCcCHHHHHHHHHHhhCCC--HHHHHH--------HHHHHH-- Confidence 43332222111 0111122222221110 11111 1122222221 100000 000000 Q ss_pred cceeccchhHHHHHHHHhhhhcCCceeccC------cHHHHHHHHHHhcc--CHHHHHHHHHHHHHhcCe-EEEEEEECC Q lcl|NC_013644. 67 NVRIPHGFFPEIVDQKTQYLLSNPVEYETE------NEELKEYLAEYYNS--EFQVVLQELVEGSSQKGF-EYVYARTNA 137 (510) Q Consensus 67 ~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~------d~~~~~~l~~~~~n--~~~~~~~e~~~~~~~~G~-~~~~v~~d~ 137 (510) -......-.+.+...-+.|.+..+... +....++++.++.+ +|.+.+..+ .++..+|. +.+++|.-. T Consensus 69 ---e~D~~i~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~-ldA~~~G~s~~Ei~w~~~ 144 (526) T protein:vir:79 69 ---ERDAHLFAEMSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDA-LDGIGHGYSCIELEWALQ 144 (526) T ss_pred ---hhChHHHHHHHHHHHHHhCCCceEecCCCCChHHHHHHHHHHHHHhcccCHHHHHHHH-HhhhhhcceeEEEEEeec Confidence 013445566666677788888887532 23456678888853 566666654 44778886 456666544 Q ss_pred CCceE---EEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccc Q lcl|NC_013644. 138 EDRLC---FQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPR 214 (510) Q Consensus 138 ~g~~~---i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~ 214 (510) +|... +.+.+|+.+. ||..+... ++. .++.... ..+ T Consensus 145 ~g~~~~~~l~~r~~~~F~--~~~~~~~~--l~~-----~~~~~~g----~~l---------------------------- 183 (526) T protein:vir:79 145 GREWMPLAFHHRPQSWFQ--LNPEDQNE--LRL-----RDNSPAG----EAL---------------------------- 183 (526) T ss_pred CCceeEEEeeeecccceE--eccCCCcE--EEe-----cCCCCCc----eee---------------------------- Confidence 55433 4444444332 33222110 000 0000000 000 Q ss_pred cccccccccccccccccCCcccEEEec--CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhh- Q lcl|NC_013644. 215 PHVLAVDSENESLLQRSYGQIPFYRLS--NNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSK- 291 (510) Q Consensus 215 ~~~~~~~~~~~~~~~~~~g~iPvv~~~--nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~- 291 (510) .+++.|-.++-. .++.|.|.+..+-...--=+..+.+++..++.++.|+++.+=-.+....+ T Consensus 184 ---------------~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~yG~P~~igky~~~a~~~ek 248 (526) T protein:vir:79 184 ---------------QPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPGTADEEK 248 (526) T ss_pred ---------------cCCceEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCCCCHHHH Confidence 112222222111 24567788876654444445577889999999999999986322222222 Q ss_pred --h---hHhhhcCeeeeccCCCceeEEeec-CCHHHHHHHHHHHHHHHHHHh--CCcccc--cccc-CcccHHHHHHHHH Q lcl|NC_013644. 292 --L---RQNVKSKKVVGTGSDGGLDVKTVT-IPTEGRKTKMEIDKENIYKFG--MAFDST--QVGD-GNITNIVIKARYT 360 (510) Q Consensus 292 --~---~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~l~~~i~~~s--~~p~~~--~~~~-g~~Sg~Ai~~~~~ 360 (510) . ...+....+..++.+..+++++.. .....++..++.+.+.|...- +|-... .+++ +.+.|..-.- T Consensus 249 ~~L~~av~~i~~da~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh~~--- 325 (526) T protein:vir:79 249 ATLLRAVTGLGHAAAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVHNE--- 325 (526) T ss_pred HHHHHHHHHHhcCcEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhhHHHHHH--- Confidence 1 223445567778999999999854 556778999999999997753 332221 1222 2222322211 Q ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCC-CchHHHHHhC Q lcl|NC_013644. 361 LLNMKANKTEARLRALLE-WMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRK-IILESILQVA 438 (510) Q Consensus 361 ~l~~k~~~k~~~~~~~l~-~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~-iS~et~~~~~ 438 (510) -....+..-.+.+...|. ++++.++.+ +..+..+. ..-+.++|...-+.|.+..++.+.++...|+ +|.+.+.+.+ T Consensus 326 v~~di~~aDa~~i~~tln~~Li~~l~~~-N~~~~~~~-~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~ 403 (526) T protein:vir:79 326 VRHDILASDARQLAATLSRDLLWPLLVL-NRPGSPDV-RRAPRLVFDLREQADITSMAQSIPALVNVGLEIPSAWVYDKL 403 (526) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCCcCCc-cccceEEeCCCCcccHHHHHHHHHHHHhCCCcCCHHHHHHHh Confidence 122233334455555664 466655543 22221111 2245788999999999999999999999997 7888888887 Q ss_pred CCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCccccc---ccccCCCC Q lcl|NC_013644. 439 PRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGST---ESQLPENG 510 (510) Q Consensus 439 ~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~ 510 (510) +.-...+-+... .....+............ .......+ ....+..- ....+... T Consensus 404 gip~~~~~e~~l---------------~~~~~~~~~~~~~~~~~~--~~~~~~~~-~~~~~~~~d~~l~~~~~~~ 460 (526) T protein:vir:79 404 GIPQPAKNEPVL---------------RPAAQPAILSRQHGQRVA--ALATIVGP-RYGDQQALDKALADLPAKD 460 (526) T ss_pred CCCCCCCchhhc---------------cccCCccccccccccccc--cccccccc-cCchhhHHHHHHHHHHHHH Confidence 652221100000 000000000000000000 00000000 00000000 00000101 No 178 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=97.93 E-value=1e-05 Score=47.93 Aligned_cols=408 Identities=10% Similarity=0.009 Sum_probs=169.1 Q ss_pred HHHhccCCcchhc-----ccceeccc---ccc-ccccccccc----eeccchhHHHHHHHHhhhhcCCcee---ccCc-- Q lcl|NC_013644. 36 IRYYNHENDIMNN-----RIFYVDDE---GIL-REDKYASNV----RIPHGFFPEIVDQKTQYLLSNPVEY---ETEN-- 97 (510) Q Consensus 36 ~~YY~g~~~i~~~-----~~~~~~~~---~~~-~~~~~~~~~----ki~~n~~~~Iv~~~~~~l~g~p~~~---~~~d-- 97 (510) ...-.|+. +... ........ +.. ......... -..++.....|+..++-+-+-|+.+ +.+. T Consensus 1 ~~~~~~~~-~~~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~V~acV~~IA~~iA~lpl~l~~~~~~~~~ 79 (518) T protein:vir:10 1 MLLANGQT-LSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) T ss_pred CcccCcee-ecCchhhhhhhhhhcccccccccceecccccchhhHHHhhhHHHHHHHHHHHHhhccCceEEEEEcCCCce Confidence 11122221 1110 00000000 000 000000000 0011233445555555555556654 1111 Q ss_pred HHHHHHHHHHhc--cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCceeEEEEEEEE Q lcl|NC_013644. 98 EELKEYLAEYYN--SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNELQRICRHYITE 171 (510) Q Consensus 98 ~~~~~~l~~~~~--n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~ 171 (510) +.....+..++. |. .......+....+.+|.+|+++..+.+|++ .+.+++|..+.+..+..... . .|.+. T Consensus 80 ~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~v~v~~~~~~~~---~-~y~~~ 155 (518) T protein:vir:10 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGR---Y-EYYFQ 155 (518) T ss_pred eccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCCceEEEEcCCCCE---E-EEEEE Confidence 112223334442 32 234555667788899999999999999986 58889999998877643211 1 11111 Q ss_pred EeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCC-CCCCCcH Q lcl|NC_013644. 172 IEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNN-KQETTDL 250 (510) Q Consensus 172 ~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn-~~g~sd~ 250 (510) ...... .....+.+..+.+++.-. .+. ..|.|-+ T Consensus 156 ~~~~~~----~~~~~~~~~eViHir~~s-----------------------------------------~dg~~~G~spi 190 (518) T protein:vir:10 156 AGAGVG----TQLVSFADDEVVPIRFFN-----------------------------------------PDGLERGLSLM 190 (518) T ss_pred ecCCcc----ceEEEecCCcEEEecCCC-----------------------------------------CCcccccccHH Confidence 111110 011122333333332100 011 1355666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCch---hhhhHhhh--------cCeeeeccCCCceeEEeecCCH Q lcl|NC_013644. 251 KPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDL---SKLRQNVK--------SKKVVGTGSDGGLDVKTVTIPT 319 (510) Q Consensus 251 ~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~---~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~ 319 (510) ..+...|.....+..-..+.+...+.|-.+++....-+. ..++..+. .++++.++++.+++.++..... T Consensus 191 ~~a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~k~~~~~~~~G~~nag~v~vL~~G~~~~~l~~s~~D 270 (518) T protein:vir:10 191 ESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVE 270 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhcCccccCcceEcCCCceEEEccCChhH Confidence 555555554444444445555556667566544322111 11222211 1235666666666655544444 Q ss_pred HHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccc-c Q lcl|NC_013644. 320 EGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFD-P 398 (510) Q Consensus 320 ~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~-~ 398 (510) ..+.+..+...+.|...-++|+.-.+...+.+...++... ..++..+|.-+++.|...+..+-..... . T Consensus 271 ~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn~eq~~----------~~f~~~tL~P~l~~ie~~ln~~L~~~~~~~ 340 (518) T protein:vir:10 271 MQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQM----------RAFYRDTMAIPIARIQSAMDKYVGQYWVRK 340 (518) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHH----------HHHHHHHHHHHHHHHHHHHHHhhcccccCC Confidence 4566666777788888888887544322222211111111 1222333333333333333222111111 1 Q ss_pred ceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCC Q lcl|NC_013644. 399 TEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSD 476 (510) Q Consensus 399 ~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~ 476 (510) ..+++.+..-+..|..+.++.+.+++.+|+++.-.++++++. ++++...+..... . .......+.....+. T Consensus 341 ~~~~fd~~~llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~Gl~pie~~~gD~~~~~~---n----~~pl~~~~~~~~~g~ 413 (518) T protein:vir:10 341 NRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANS---A----LQPLGATPDGAVEGE 413 (518) T ss_pred ceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeecc---c----ceecccccccccCCC Confidence 234455556778899999999999999999999888887653 3221110000000 0 000000000000000 Q ss_pred -CCCCcccCCCCCCcccccccCccc---ccccccCCCC Q lcl|NC_013644. 477 -NTDEEETAVNPDDPTQQMAEGATG---STESQLPENG 510 (510) Q Consensus 477 -~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~ 510 (510) .....++...+....++...+..+ .+...-++.+ T Consensus 414 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (518) T protein:vir:10 414 EAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDS 451 (518) T ss_pred CCCCCCCCCccccccccccccccCCCCCcccccccccc Confidence 000001111111011111111111 0111111111 No 179 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=97.93 E-value=1e-05 Score=47.92 Aligned_cols=392 Identities=9% Similarity=0.040 Sum_probs=158.6 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccC-CcchhcccceeccccccccccccccceeccchhHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHE-NDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIV 79 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~-~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv 79 (510) |.-+-.+ ...... ++.... ++.... ....... ......+... -++.-+..+-....| T Consensus 1 m~~~~~~----------~~~~~~----~~~~~~---~~~~~~~~~~~~~~-~~~~~~~~~v----~~~~a~~~~~v~~~i 58 (412) T protein:vir:26 1 MNVIAKE----------NIVTRI----KKKLID---NWIDQSTSKLYDFS-PWKNRSFWGV----INNTLETNETIFSAI 58 (412) T ss_pred Cccchhh----------hhhhhh----hhhHhh---hhhccccccccccc-ccCCcccccc----chhhhhccHHHHHHH Confidence 2222110 010000 000000 111000 0000000 0000000000 000111223333345 Q ss_pred HHHHhhhhcCCceeccCcHHHHHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceE Q lcl|NC_013644. 80 DQKTQYLLSNPVEYETENEELKEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVF 152 (510) Q Consensus 80 ~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~ 152 (510) +..++-+.+-|+++--..+.....+..++. |. .......++...+.+|.||+++..+..|++ .+.+++|..+- T Consensus 59 ~~ia~~iA~lp~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~ 138 (412) T protein:vir:26 59 TKLSNSMASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVE 138 (412) T ss_pred HHHHHhHhhCceeEeeccccccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeE Confidence 555555555677653222222222333331 22 234456678889999999999999988875 67888999988 Q ss_pred EEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccC Q lcl|NC_013644. 153 GVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSY 232 (510) Q Consensus 153 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (510) +..++.... + +|.+.. .++. ...+.+..+.+++.-. . T Consensus 139 v~~~~~~~~--~--~y~~~~-~~g~------~~~~~~~evih~~~~~-------------------------------~- 175 (412) T protein:vir:26 139 MLIENQSRE--L--YYSIHA-ATGN------KLIVHNMDMLHFKHIV-------------------------------A- 175 (412) T ss_pred EEEeCCCcE--E--EEEEEc-CCce------EEEEccccEEEeCCCC-------------------------------C- Confidence 877643211 1 111111 1110 0123333333332100 0 Q ss_pred CcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcc-ceeEE-ecCCCCch--hhhhHh----hh-cCeeee Q lcl|NC_013644. 233 GQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAE-AIYVV-SGFQGDDL--SKLRQN----VK-SKKVVG 303 (510) Q Consensus 233 g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~-~~lv~-~g~~~~~~--~~~~~~----~~-~~~~~~ 303 (510) .+.-.|.|.++-+...++..+.+ ..+ .+..+.. +-.++ .+...++. ...... .. .++++. T Consensus 176 --------~~~~~G~s~i~~~~~~i~~~~a~-~~~--~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~v 244 (412) T protein:vir:26 176 --------SNMVQGISPIDVLKNTTDFDNAV-RTF--NLTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILF 244 (412) T ss_pred --------CCCcccccHHHHHHHHHHHHHHH-HHH--HHHhcCCCCceEEecCCCCCHHHHHHHHHHHHHHhhcCCCeee Confidence 01113556555444444433222 111 2222222 22333 22222221 111111 11 234555 Q ss_pred ccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 304 TGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKL 383 (510) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~ 383 (510) ++++.+++.++.......+.+..+..++.|+..-++|+.-.+..++.+...++... ...+...|.-.+.. T Consensus 245 l~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~~~----------~~f~~~~l~P~~~~ 314 (412) T protein:vir:26 245 QEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELN----------RFYLQHTLLPIVKQ 314 (412) T ss_pred cCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH----------HHHHHHHHHHHHHH Confidence 66655555554443344556666777888888888887544332222211111111 12233334444444 Q ss_pred HHHHHhhccCCccc---cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 384 VIDDINRRYTKAFD---PTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWED 460 (510) Q Consensus 384 i~~~~~~~~~~~~~---~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~ 460 (510) |...+..+--...+ ...+++.+..-+..|..+.++.+.++..+|+++.-.+++.++.-.-+.-.+ T Consensus 315 ie~~ln~kLl~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~------------ 382 (412) T protein:vir:26 315 YEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDK------------ 382 (412) T ss_pred HHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe------------ Confidence 44444332111111 112333444556789999999999999999999988888765321110000 Q ss_pred HHHHHHhhhccCCCCCCCCCcccCCCCCCcccccc Q lcl|NC_013644. 461 VKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMA 495 (510) Q Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (510) .....+..+ .+. ..+..+...+|++...++ T Consensus 383 ~~~~~n~~~----~~~-~~~~~~~~~gG~~n~~e~ 412 (412) T protein:vir:26 383 PLISGDLYP----IDT-PLELRKSLKGGDKNVNES 412 (412) T ss_pred eeecccccc----ccc-chhhcccccCCCCCcCCC Confidence 000000000 000 000000001111111111 No 180 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=97.92 E-value=1.1e-05 Score=47.84 Aligned_cols=383 Identities=8% Similarity=-0.066 Sum_probs=170.6 Q ss_pred HHHHHHHhccCCcchhcccceecccccccccccc------ccceeccchhHHHHHHHHhhhhcCCcee-c-cCc---HH- Q lcl|NC_013644. 32 AETGIRYYNHENDIMNNRIFYVDDEGILREDKYA------SNVRIPHGFFPEIVDQKTQYLLSNPVEY-E-TEN---EE- 99 (510) Q Consensus 32 ~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~------~~~ki~~n~~~~Iv~~~~~~l~g~p~~~-~-~~d---~~- 99 (510) |. +.++++++... .+.. ...-.+...-.... +..-+...-....|+..++-+.+-|+.+ . .++ +. T Consensus 1 m~-~~~~~~~~~~~-~~~~-~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~g~~~~~ 77 (419) T protein:vir:57 1 MF-IPQFWKGRPSE-NRVN-WQVVPGGMRSSSSQAGVIITPETALALSAVRACVTLLAESVAQLPCVLYRRTENGGREIA 77 (419) T ss_pred Cc-chhhhccCCcc-cccc-ccccccccccccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcCCCceecc Confidence 11 11222222100 0000 00000000000000 0011112233455555666566667664 1 111 11 Q ss_pred HHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCceeEEEEEEEEE Q lcl|NC_013644. 100 LKEYLAEYYN---S---EFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNELQRICRHYITEI 172 (510) Q Consensus 100 ~~~~l~~~~~---n---~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~ 172 (510) ...-+..++. | ........+......+|.+|.++..+.+|++ .+.+++|..+.+..+..+.+ +|.+ T Consensus 78 ~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~pl~~~~v~v~~~~~g~~-----~y~~-- 150 (419) T protein:vir:57 78 FDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITELIPINPHKVIVLKGPDGMP-----YYDI-- 150 (419) T ss_pred ccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcceEEEECCCceE-----EEEE-- Confidence 1112333331 2 2345556677889999999999999988885 67888998887766543321 1111 Q ss_pred eeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCCcHHH Q lcl|NC_013644. 173 EKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTDLKP 252 (510) Q Consensus 173 ~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~ 252 (510) ...+ .++....+++++.- | .+...|.|.+.. T Consensus 151 ~~~~--------~~~~~~~vih~r~~-------------------------------------~----~d~~~G~s~i~~ 181 (419) T protein:vir:57 151 PSIG--------EILPMRMVHHIKSF-------------------------------------S----LDGYIGTSPIQT 181 (419) T ss_pred cCCc--------eEEchhhEEEecCc-------------------------------------C----CCCcccccHHHH Confidence 1110 11222333332210 0 011236666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccceeEEecCC---CCch----hhhhHhhh--------cCeeeeccCCCceeEEeecC Q lcl|NC_013644. 253 IKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQ---GDDL----SKLRQNVK--------SKKVVGTGSDGGLDVKTVTI 317 (510) Q Consensus 253 v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~---~~~~----~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~ 317 (510) +...++....+-.-..+.+...+.|-.+++-.. .... ..+..... .++++.++++.+++-++... T Consensus 182 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~ 261 (419) T protein:vir:57 182 NPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEAKAIASQAAVDAILAKWTERYGGVRNAFSVGMLQEGMTYKQLSQDN 261 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHccCCccEEEEecCcCCcccCHHHHHHHHHHHHHHhccccccccceecCCCceEEEcCCCh Confidence 666666554444444445555566665554311 1111 11222111 13456666666665555444 Q ss_pred CHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccc Q lcl|NC_013644. 318 PTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFD 397 (510) Q Consensus 318 ~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~ 397 (510) ....+.+..+..++.|+..-++|+.-.+..+..++..++. .....+...|.-.++.|...+..+--.... T Consensus 262 ~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~----------~~~~f~~~~l~P~~~~ie~~l~~~ll~~~~ 331 (419) T protein:vir:57 262 EKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNNIEH----------QGLQYVIYTMLAILKRHESAMMRDLLLPSE 331 (419) T ss_pred hhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccccHHH----------HHHHHHHHHHHHHHHHHHHHHHhhccCccc Confidence 4556677778888899998889875433222221111111 112233444444444444444332111111 Q ss_pred --cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCC Q lcl|NC_013644. 398 --PTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLS 475 (510) Q Consensus 398 --~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~ 475 (510) ...+++.+..-+..|..+.++.+.++..+|+++.-.++++++.-.-+.-. ......+.. . T Consensus 332 ~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD------------~~~~~~n~~------~ 393 (419) T protein:vir:57 332 RRDFYIEFNVSSLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLTPIPGGD------------KYLTPLNMV------D 393 (419) T ss_pred cCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC------------eeeeccccc------c Confidence 12344444566678999999999999999999998888876542111000 000001000 0 Q ss_pred CCCCCcccCCCCCCcccccccCcccc Q lcl|NC_013644. 476 DNTDEEETAVNPDDPTQQMAEGATGS 501 (510) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (510) .....+...+.++...+.++..++-+ T Consensus 394 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 419 (419) T protein:vir:57 394 SKALTGIGKATPQQLKDIEAILCTRN 419 (419) T ss_pred ccccccccCCCcccCcchhhhhhccC Confidence 11111111111222222222222222 No 181 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=97.92 E-value=1.1e-05 Score=47.79 Aligned_cols=386 Identities=7% Similarity=-0.042 Sum_probs=160.0 Q ss_pred HHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCCceec Q lcl|NC_013644. 15 ALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNPVEYE 94 (510) Q Consensus 15 ~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~ 94 (510) .+-+.....+ .....+.. ..|.. .++.-... .-+..+- ++.=+...-....|+..++-+.+-|+.+- T Consensus 1 ~~~~r~~~~~--~~~~~~~~-~~~~~---~~~g~~~s---~~~~~vt----~~~al~~~~v~~~v~~ia~~iA~lp~~~~ 67 (419) T protein:vir:14 1 MFFSRQLLSN--LGQTQMSA-GGWVS---ALLGSSRS---DSGQVVT----PASALALTVLQNCVTLLAESIAQLPIELY 67 (419) T ss_pred Cccccccccc--ccccccCc-chhhH---HhhcCCCc---cCCcccc----hHHhhccHHHHHHHHHHHHhhccCceEEE Confidence 0000000000 00000000 00000 00000000 0000000 00001122234445555555556676642 Q ss_pred c--CcH---HHHHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCce Q lcl|NC_013644. 95 T--ENE---ELKEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNELQ 162 (510) Q Consensus 95 ~--~d~---~~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~ 162 (510) . ++. ....-+..++. |. .......++.....+|.+|+++..+.+|++ .+.+++|..+-+..+..+.+ T Consensus 68 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~~~~v~v~~~~~~~~- 146 (419) T protein:vir:14 68 ERSGEDRKPATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLDNEAVTVMRGSDLKP- 146 (419) T ss_pred EecCCccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCceE- Confidence 1 111 11112333331 22 234455567888999999999999988886 48888998888776543321 Q ss_pred eEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecC Q lcl|NC_013644. 163 RICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSN 242 (510) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n 242 (510) +|.+.... .+.... |+++++ T Consensus 147 ----~y~~~~~~-----------~~~~~~---------------------------------------------i~h~~~ 166 (419) T protein:vir:14 147 ----VYRVRGSD-----------PMPQRL---------------------------------------------VHHVRW 166 (419) T ss_pred ----EEEEccCc-----------ccchhh---------------------------------------------eeEecC Confidence 11110000 000111 122221 Q ss_pred ----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCC---CCchhh----hhHhhh--------cCeeee Q lcl|NC_013644. 243 ----NKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQ---GDDLSK----LRQNVK--------SKKVVG 303 (510) Q Consensus 243 ----n~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~---~~~~~~----~~~~~~--------~~~~~~ 303 (510) .-.|.|.+..+...++....+..-..+.+...+.|-.+++-.. .....+ +...+. .++++. T Consensus 167 ~~~dg~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~v 246 (419) T protein:vir:14 167 MSINGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDRITDGWNAKFGGSGNAKKVAL 246 (419) T ss_pred cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCcee Confidence 2246666666666665554444444455566666766654321 111111 222111 134566 Q ss_pred ccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 304 TGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKL 383 (510) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~ 383 (510) ++++.+++.+........+.+..+...+.|+..-++|+.-.+.....+...++... ...+...|.-.++. T Consensus 247 l~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~~E~~~----------~~f~~~~L~P~~~~ 316 (419) T protein:vir:14 247 LQEGMTFRPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQS----------LQFVIYTLLPWVKR 316 (419) T ss_pred cCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHH----------HHHHHHHHHHHHHH Confidence 66665555554433333455666677788888888887533322112211122111 12333344444444 Q ss_pred HHHHHhhccCCc--cccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 384 VIDDINRRYTKA--FDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDV 461 (510) Q Consensus 384 i~~~~~~~~~~~--~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~ 461 (510) |...+..+--.. .....+++.+..-+..|..+.++.+.+++++|+++.-.++++++.-.-+.-. .. T Consensus 317 ie~~l~~kll~~~~~~~~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD------------~~ 384 (419) T protein:vir:14 317 HEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGD------------IY 384 (419) T ss_pred HHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC------------ee Confidence 433333221111 1112344444566677999999999999999999987777776431100000 00 Q ss_pred HHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccccC Q lcl|NC_013644. 462 KEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLP 507 (510) Q Consensus 462 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (510) ....+.. +....+..++... +...+.....++-++ T Consensus 385 ~~~~n~~----------~~~~~~~~~~~~~-~~~~~~~~e~~~~l~ 419 (419) T protein:vir:14 385 LSPMNMV----------DASKPQQLPVGKS-EPTKAAIDEIGRILS 419 (419) T ss_pred eeccccc----------cccccccccCCCC-CCccccccchhcccC Confidence 0000000 0000000000000 001111122222233 No 182 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=97.86 E-value=1.4e-05 Score=47.14 Aligned_cols=431 Identities=11% Similarity=0.069 Sum_probs=186.4 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhh--HHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSS--KREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~--~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) |++..+..-- ..+.+++..+..+..+. -.++..+.+|..-. ++. ..+ ..+...++..+-+..- T Consensus 1 ~~~~~~~~~~-~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~--~~~-------~~~-----~~~~~~~~~dstg~~a 65 (515) T protein:vir:70 1 MQDTILEYGG-QRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY--LMN-------NKG-----DNETSQNGWQGVGAQA 65 (515) T ss_pred CcchhhhhcC-CHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccc--ccC-------CCC-----CcccccccccchHHHH Confidence 6665443322 22345555554432221 11344444444321 111 011 1111123455666666 Q ss_pred HHHHHhhhhc--CCce-----eccCcH---------H----HHHH-------HHH-HhccCHHHHHHHHHHHHHhcCeEE Q lcl|NC_013644. 79 VDQKTQYLLS--NPVE-----YETENE---------E----LKEY-------LAE-YYNSEFQVVLQELVEGSSQKGFEY 130 (510) Q Consensus 79 v~~~~~~l~g--~p~~-----~~~~d~---------~----~~~~-------l~~-~~~n~~~~~~~e~~~~~~~~G~~~ 130 (510) +++.++.|++ -|+. +...++ . +.+. +.. +..+||.....++.++..++|.|. T Consensus 66 ~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ 145 (515) T protein:vir:70 66 TNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCL 145 (515) T ss_pred HHHHHHHHHHhhcCCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEE Confidence 7777766654 2322 222211 1 1111 112 224688888999999999999986 Q ss_pred EEEEECCCCceEEEEEcccceEEEEcCCCCceeEEEEEEEEEee--------------CCceeEEEEEEEEc-----CCc Q lcl|NC_013644. 131 VYARTNAEDRLCFQVADSLNVFGVYNEYNELQRICRHYITEIEK--------------DGETVDIHHAEVWT-----DQN 191 (510) Q Consensus 131 ~~v~~d~~g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~e~y~-----~~~ 191 (510) + |.|+++.++ .++-.+++..-|..+++..+++-+...... ......-..+++|+ ++. T Consensus 146 l--~~d~~~~~~--~~pl~~y~v~~d~~G~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~ 221 (515) T protein:vir:70 146 L--YKPSKGAMS--AVPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEG 221 (515) T ss_pred E--EEeCCCCeE--EEEcCeEEEeeCCCcCeeEEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEEEEecCCC Confidence 5 457776554 455566666667777777776654433210 00000111222222 111 Q ss_pred EEEEEEcCCceeeccccccccccccccccccc-ccccccccCCcccEEEec-----CCCCCCCcHHHHHHHHHHHHHHHH Q lcl|NC_013644. 192 VYFFVAEDNKDYELDEAEPINPRPHVLAVDSE-NESLLQRSYGQIPFYRLS-----NNKQETTDLKPIKALIDDYDLMNC 265 (510) Q Consensus 192 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~g~iPvv~~~-----nn~~g~sd~~~v~~liD~~n~~~S 265 (510) .+.+..+ .++. .......+|..+|++.++ ++.+|+|-.++..+-+..+|.+.- T Consensus 222 ~~~~~~e---------------------~d~~~~~~es~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~ 280 (515) T protein:vir:70 222 FWKINQS---------------------ADDIPVGKESRIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSE 280 (515) T ss_pred ceEEEEe---------------------cCceeeccccccccccCCceeeeeeecCCCCcccchHHHhhHHHHHHHHHHH Confidence 1111110 0110 001112335567776654 346799999999999999999888 Q ss_pred HHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCcccc Q lcl|NC_013644. 266 FLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDST 343 (510) Q Consensus 266 ~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~ 343 (510) ..........+|.+.+.--...+...+.. ...+.+..+..++++.+... .+.......++.++..|-..-..-... T Consensus 281 ~~l~~~~~a~~p~~lv~~~g~~~~~~l~~--~~~g~iv~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~ 358 (515) T protein:vir:70 281 AMARGAALMADIKYLIRPGSQTDVDHFVN--SGTGEVITGVAEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMT 358 (515) T ss_pred HHHHHHHHhcCCCeeeCcccccchhhccc--cCCceeecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhh Confidence 88888888888877653222222222211 12234455555667776543 456777777888877775432211111 Q ss_pred ccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHhhccCCccccceeeEEeCCCCC-CCHHHHH Q lcl|NC_013644. 344 QVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVID-----DINRRYTKAFDPTEVSFTFTREVM-VNETDIV 417 (510) Q Consensus 344 ~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~-----~~~~~~~~~~~~~~v~i~f~~~~p-~d~~e~~ 417 (510) .......|++.+.. +..+|+..++..+.++-.-++. .+...... .....+.+.+..++. -.....+ T Consensus 359 ~rd~~rvTAtEV~~-------r~~E~~~~LGpv~srL~~Ell~Pli~r~~~~~~p~-~P~~~v~~~~vs~l~~L~r~q~~ 430 (515) T protein:vir:70 359 RRDAERVTAVEIQR-------DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDS-FTSELVDPVIVTGIEALGRMAEL 430 (515) T ss_pred ccCCccccHHHHHH-------HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHhhCCC-CChhhcccceehhHHHHHHHHHH Confidence 11222345555543 5667777777777774222222 11111111 111123333322221 1111111 Q ss_pred HHHHHHHh-cCCC--chHHHHHhC-----------------CCC-CcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCC Q lcl|NC_013644. 418 NDEKTEAE-TRKI--ILESILQVA-----------------PRL-DDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSD 476 (510) Q Consensus 418 ~~~~~~~~-~g~i--S~et~~~~~-----------------~~v-~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~ 476 (510) +.+....+ .+.+ -.+.++..+ ..+ +++|.+++.++.++.+.. ..+.+...+...+. T Consensus 431 ~~i~~~~q~i~~~~~~~p~~~~~id~d~~~~~~a~~~g~p~~~~rs~eev~~~r~q~~~~~~~---~~~~~~~~~a~~~~ 507 (515) T protein:vir:70 431 DKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQE---AMLNEGVAKAVPGV 507 (515) T ss_pred HHHHHHHHHHHHHhccChhHHhhCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHH---HHHHHhhhhhcccc Confidence 11111111 0100 012222221 111 223333322222221111 11111110111111 Q ss_pred CCCCcccC Q lcl|NC_013644. 477 NTDEEETA 484 (510) Q Consensus 477 ~~~~~~~~ 484 (510) ..+.-+++ T Consensus 508 ~~~~~~~~ 515 (515) T protein:vir:70 508 IQQEMKEG 515 (515) T ss_pred hhhhhccC Confidence 11111111 No 183 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=97.85 E-value=1.5e-05 Score=47.03 Aligned_cols=411 Identities=10% Similarity=-0.004 Sum_probs=187.5 Q ss_pred CCCccCCChh------------hhHHHHHHHHHhhhhh-hhHHHHHHH-HHHhccCCcchhcccceeccccccccccccc Q lcl|NC_013644. 1 MEALLSEDVK------------IIANALKAAIDKDRKS-SSKREAETG-IRYYNHENDIMNNRIFYVDDEGILREDKYAS 66 (510) Q Consensus 1 ~~~~~~~~~~------------~~~~~i~~~i~~~~~~-~~~~~~~~~-~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~ 66 (510) |-.|+-..-. .+...+.+....|..+ -.+.++..+ +.--.|. +...... .+.-. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd--~~~~~~L--------~~~m~-- 68 (528) T protein:vir:10 1 MAAIVDIYGNPLRTQQLRKQQTAHLAGLAKEFANHPAKGLTPAKLAHILIEAEQGH--LQAQAEL--------FMDME-- 68 (528) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCCCCCCHHHHHHHHHhhhCCC--HHHHHHH--------HHHHH-- Confidence 4333211111 1111112222212111 112122111 1111111 1100000 00000 Q ss_pred cceeccchhHHHHHHHHhhhhcCCceeccC------cHHHHHHHHHHhcc--CHHHHHHHHHHHHHhcCeE-EEEEEECC Q lcl|NC_013644. 67 NVRIPHGFFPEIVDQKTQYLLSNPVEYETE------NEELKEYLAEYYNS--EFQVVLQELVEGSSQKGFE-YVYARTNA 137 (510) Q Consensus 67 ~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~------d~~~~~~l~~~~~n--~~~~~~~e~~~~~~~~G~~-~~~v~~d~ 137 (510) -......-.+.+...-++|.+..+.+. +....+++++++.+ +|.+.+.. ..++..+|.+ .+++|.-. T Consensus 69 ---e~D~~i~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~i~~-~lda~~~G~s~~Ei~w~~~ 144 (528) T protein:vir:10 69 ---ERDAHLFAEMSKRKRAVLGLDWTIEPPRNASAAEKADAEYLHELLLDLEGIEDLMLD-CMDGVGHGYSAIELDWSLQ 144 (528) T ss_pred ---hhChHHHHHHHHHHHHHhcCCceEecCCCCCHHHHHHHHHHHHHHhCCccHHHHHHH-HHhhhhhcceeEEEEEeec Confidence 013455666777777788888887542 23455677887753 46665554 4567778864 56667544 Q ss_pred CCceE---EEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccc Q lcl|NC_013644. 138 EDRLC---FQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPR 214 (510) Q Consensus 138 ~g~~~---i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~ 214 (510) +|... +.+++++.+. |+..+.+. ++. .++... -..+ T Consensus 145 ~g~~~~~~~~~r~~~~f~--~~~~~~~~--l~~-----~~~~~~----g~~l---------------------------- 183 (528) T protein:vir:10 145 GREWLPQAFDHRPQSWFQ--LNPDDQDE--LRL-----RDNSIA----GEVL---------------------------- 183 (528) T ss_pred CCceeEEEeeeeccccee--eccCCCcE--Eec-----cCCCCC----ceee---------------------------- Confidence 45433 3444443221 22221110 000 000000 0000 Q ss_pred cccccccccccccccccCCcccEEEe--cCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhh- Q lcl|NC_013644. 215 PHVLAVDSENESLLQRSYGQIPFYRL--SNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSK- 291 (510) Q Consensus 215 ~~~~~~~~~~~~~~~~~~g~iPvv~~--~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~- 291 (510) .+++.|=.++- ..++.|.|.+..+-...--=+..+.+++..++.++.|+++.+=..+.+..+ T Consensus 184 ---------------~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek 248 (528) T protein:vir:10 184 ---------------QPFGWIMHKPRSRSGYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLPIRLGKYPPGTPDEEK 248 (528) T ss_pred ---------------cCCCeEEEeecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCCCCHHHH Confidence 01122111111 124567788877666655556778899999999999999876332222222 Q ss_pred --h---hHhhhcCeeeeccCCCceeEEeec-CCHHHHHHHHHHHHHHHHHHhCCccc--c--ccccC-cccHHHHHHHHH Q lcl|NC_013644. 292 --L---RQNVKSKKVVGTGSDGGLDVKTVT-IPTEGRKTKMEIDKENIYKFGMAFDS--T--QVGDG-NITNIVIKARYT 360 (510) Q Consensus 292 --~---~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~--~--~~~~g-~~Sg~Ai~~~~~ 360 (510) . ...+.......++.+..+++++.. .+...++..++.+.+.|...--+-.. . .+++| ++-|..- .. T Consensus 249 ~~L~~al~~i~~~~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~Alg~vh---~~ 325 (528) T protein:vir:10 249 VTLLRAVTGLGHAAAGIIPESMSIDFQEASKGSAEPFMAMMRWCDDSMSKAILGGTLTSQTSESGGGAYALGQVH---NE 325 (528) T ss_pred HHHHHHHHHHhhCcEEEecCCceeEEeecCCCChhHHHHHHHHHHHHHHHHHhhhhhhccccccccchhhhHHHH---HH Confidence 1 223444556778899999999854 56677899999999998876433222 1 11122 2222221 11 Q ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhccCCccc-cceeeEEeCCCCCCCHHHHHHHHHHHHhcCC-CchHHHHHh Q lcl|NC_013644. 361 LLNMKANKTEARLRALLE-WMNKLVIDDINRRYTKAFD-PTEVSFTFTREVMVNETDIVNDEKTEAETRK-IILESILQV 437 (510) Q Consensus 361 ~l~~k~~~k~~~~~~~l~-~~~~~i~~~~~~~~~~~~~-~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~-iS~et~~~~ 437 (510) -....+..-.+.+...|. ++++.++.+ .+....+ ..-+.++|...-+.|.++.++.+.++...|+ +|.+.+.+. T Consensus 326 v~~di~~aDa~~i~~tln~~li~~l~~~---N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~ 402 (528) T protein:vir:10 326 VRHDLLAADARQLAATLSRDLLWPLLVL---NRSGNLDARRAPRLVFDLKDRADLAAMATSLPPLVKLGVQVPVNWVQEQ 402 (528) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh---CCCCCCCccccceEEecCCCcccHHHHHHHHHHHHhCCCCCCHHHHHHH Confidence 122233444455566664 455555543 2222222 2346789999999999999999999999998 788888888 Q ss_pred CCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCccccccccc--------CC- Q lcl|NC_013644. 438 APRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQL--------PE- 508 (510) Q Consensus 438 ~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~- 508 (510) ++.-.+.+-+. ... .+.... .......+.......++ ..+. ....++. +. T Consensus 403 ~gip~p~~~e~------------~~~----~~~~~~--~~~~~~~~~~~~~~~~~--~~~~-~~~~~~~~d~~~~~~~~~ 461 (528) T protein:vir:10 403 LGIPLPANGEA------------VLG----DQAGAG--IAQLSRRPGPRIAALAQ--VIGP-RYRDQEALDQVLASLPAQ 461 (528) T ss_pred hCCCCCCCCcc------------ccc----CCCccc--ccccCcccccccccccc--cccc-cccccchHHHHHHHHHHH Confidence 76522111000 000 000000 00000000000000000 0000 0000000 00 Q ss_pred ------------------CC Q lcl|NC_013644. 509 ------------------NG 510 (510) Q Consensus 509 ------------------~~ 510 (510) ++ T Consensus 462 ~~~~~~~~~l~~i~~~l~~~ 481 (528) T protein:vir:10 462 DMQNQADSLVAPLLDVISRG 481 (528) T ss_pred HHHHHHHHHHHHHHHHHHhc Confidence 00 No 184 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=97.82 E-value=1.7e-05 Score=46.74 Aligned_cols=408 Identities=11% Similarity=0.010 Sum_probs=166.7 Q ss_pred HHHhccCCcchhc-----ccceeccc---cc-cccccccccc----eeccchhHHHHHHHHhhhhcCCceec--cCc--- Q lcl|NC_013644. 36 IRYYNHENDIMNN-----RIFYVDDE---GI-LREDKYASNV----RIPHGFFPEIVDQKTQYLLSNPVEYE--TEN--- 97 (510) Q Consensus 36 ~~YY~g~~~i~~~-----~~~~~~~~---~~-~~~~~~~~~~----ki~~n~~~~Iv~~~~~~l~g~p~~~~--~~d--- 97 (510) ...-.|+. +..- ........ +. ..+....... -...+.....|+..++-+-+-|+.+- .++ T Consensus 1 ~~~~~~~~-~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~~~~~~~~~ 79 (518) T protein:vir:78 1 MLLANGQT-LSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) T ss_pred CcccCcee-eccchhhhhhhhhhhcccccceeceecccccchhhHHhhhhHHHHHHHHHHHHhhccCceEEEEEcCCccc Confidence 11111221 1000 00000000 00 0000000000 01122334455656655556676641 111 Q ss_pred HHHHHHHHHHhc--cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCceeEEEEEEEE Q lcl|NC_013644. 98 EELKEYLAEYYN--SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNELQRICRHYITE 171 (510) Q Consensus 98 ~~~~~~l~~~~~--n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~ 171 (510) +.....+..++. |. .......+....+.+|.+|+++-.+..|++ .+.+++|..+.+..+..... . .|++. T Consensus 80 ~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~Vtv~~~~~~~~---~-~y~~~ 155 (518) T protein:vir:78 80 EEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGR---Y-EYYFQ 155 (518) T ss_pred cccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEECCCceEEEEcCCCCE---E-EEEEE Confidence 111122333432 32 234455677888899999999999988886 58889999988877643211 1 11111 Q ss_pred EeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCC-CCCCCcH Q lcl|NC_013644. 172 IEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNN-KQETTDL 250 (510) Q Consensus 172 ~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn-~~g~sd~ 250 (510) ...... .....+....+.+++.-. + +. ..|.|-+ T Consensus 156 ~~~~~~----~~~~~~~~~eIiHir~~~-------------------------------~----------dg~~~G~Spi 190 (518) T protein:vir:78 156 AGAGVG----TQLVSFADDEVVPIRFFN-------------------------------P----------DGLERGLSLM 190 (518) T ss_pred ecCCcc----ceeEEecCCcEEEecCCC-------------------------------C----------CcccccccHH Confidence 111110 011112333333332100 0 01 1355656 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh---hhhHhh--------hcCeeeeccCCCceeEEeecCCH Q lcl|NC_013644. 251 KPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS---KLRQNV--------KSKKVVGTGSDGGLDVKTVTIPT 319 (510) Q Consensus 251 ~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~---~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~ 319 (510) ..+...|.....+..-..+.+...+.|-.+++....-+.. .+...+ ..++++.++++.+++.++.+... T Consensus 191 ~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~~ls~e~~~~~k~~~~~~~~G~~nag~~~vL~~G~~~~~l~~~~~d 270 (518) T protein:vir:78 191 ESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSPEAQQRLREQFDRAHAGSSNTGKTMVVEEGMEPIPLQLTAVE 270 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhcCcccCCceeEcCCCceEEeccCChhH Confidence 5555555544444444455556666676666543221111 122211 12345666666666555544444 Q ss_pred HHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccc-c Q lcl|NC_013644. 320 EGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFD-P 398 (510) Q Consensus 320 ~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~-~ 398 (510) ..+.+..+...+.|...-++|+.-.+..++.+...++.. ....+...|.-++..|...+..+-...+. . T Consensus 271 ~q~le~r~~~~~eIa~afgVPp~~lg~~~~st~sn~e~~----------~~~f~~~tL~P~~~~ie~eln~~L~~~~~~~ 340 (518) T protein:vir:78 271 MQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQ----------MRAFYRDTMAIPIARIQSAMDKYVGQYWVRK 340 (518) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHH----------HHHHHHHHHHHHHHHHHHHHHHhhcccccCc Confidence 455666677778888888888753332222221111111 11222333333333333333221111111 1 Q ss_pred ceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCC Q lcl|NC_013644. 399 TEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSD 476 (510) Q Consensus 399 ~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~ 476 (510) ..+++..+.-+..|..+.++.+.+++.+|+++.-.++++++. ++++...+...- .-.......+.....+. T Consensus 341 ~~~~fd~~~Llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~gl~pie~~~gD~~~v~-------~n~~pl~~~~~~~~~g~ 413 (518) T protein:vir:78 341 NRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYAN-------SALQPLGATPDGAVEGE 413 (518) T ss_pred ceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeec-------ccceecccccccccCCC Confidence 234444556678899999999999999999999888887653 332111000000 00000000000000000 Q ss_pred C-CCCcccCCCCCCcccccccCccc---ccccccCC-CC Q lcl|NC_013644. 477 N-TDEEETAVNPDDPTQQMAEGATG---STESQLPE-NG 510 (510) Q Consensus 477 ~-~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~-~~ 510 (510) . ....++...+....++...+..+ .+...-+. .| T Consensus 414 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (518) T protein:vir:78 414 EAPAPKRPASTPVASLDQSPPASVPGLSPTNSDRSTDSG 452 (518) T ss_pred CCCCCCCCCcccccccccCccccCCCCCccccccccccc Confidence 0 00000000000000000000000 00000111 11 No 185 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=97.81 E-value=1.8e-05 Score=46.66 Aligned_cols=383 Identities=12% Similarity=0.053 Sum_probs=160.7 Q ss_pred HhhhhhhhHHHHHHHHHHhccC-Cc-----------chhcccc---eecccccc--------ccc-c---cccc--ceec Q lcl|NC_013644. 21 DKDRKSSSKREAETGIRYYNHE-ND-----------IMNNRIF---YVDDEGIL--------RED-K---YASN--VRIP 71 (510) Q Consensus 21 ~~~~~~~~~~~~~~~~~YY~g~-~~-----------i~~~~~~---~~~~~~~~--------~~~-~---~~~~--~ki~ 71 (510) -++.+- .-|+.+- .. ++.+..+ .....+.. ... . ..+. .|. T Consensus 1 ~~~~~~---------~~~~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~- 70 (441) T protein:vir:79 1 MHWYNT---------DCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRH- 70 (441) T ss_pred CccccC---------ccccccccccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhhhcc- Confidence 011100 0111111 00 0000000 00000000 000 0 0000 011 Q ss_pred cchhHHHHHHHHhhhhcCCceeccCcH-HHHHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCce-EE Q lcl|NC_013644. 72 HGFFPEIVDQKTQYLLSNPVEYETENE-ELKEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRL-CF 143 (510) Q Consensus 72 ~n~~~~Iv~~~~~~l~g~p~~~~~~d~-~~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i 143 (510) +=.-.-|+..++-+.+-|+.+.-+.+ ....-+-.++. |. .......+....+.+|.||+++.++..|++ .+ T Consensus 71 -~~V~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L 149 (441) T protein:vir:79 71 -SDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNL 149 (441) T ss_pred -HHHHHHHHHHHHhhccCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEE Confidence 00111244444444455666432211 11122222331 22 234455677888999999999989988986 58 Q ss_pred EEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccccccccc Q lcl|NC_013644. 144 QVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSE 223 (510) Q Consensus 144 ~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (510) .+++|..+.++.|+.+.+. |.+...+.... .....|.+..+.+++.- T Consensus 150 ~~i~~~~v~v~~d~~g~~~-----~~~~~~~~~~~---~~~~~~~~~dvih~k~~------------------------- 196 (441) T protein:vir:79 150 TFRKTSEIELKSDARGRLY-----YFHQRIDSNGN---NIERNVKFEDMLDIKFY------------------------- 196 (441) T ss_pred EEEcCceeEEEECCCccEE-----EEEEEeccCCc---eeEEEEccccEEEeccC------------------------- Confidence 8999999999887655321 11111111100 01112233333333210 Q ss_pred ccccccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEe--cCCCCch--hhhhHhh--- Q lcl|NC_013644. 224 NESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVS--GFQGDDL--SKLRQNV--- 296 (510) Q Consensus 224 ~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~--g~~~~~~--~~~~~~~--- 296 (510) | .+.-.|.|.++.+...++.......-..+.++..+.|-.+++ |...++. ..++... T Consensus 197 ------------~----~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~ 260 (441) T protein:vir:79 197 ------------S----LDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKS 260 (441) T ss_pred ------------C----CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHH Confidence 0 001135566665555555444443444445556666666654 3221111 1111111 Q ss_pred -h----cCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccC-cccHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 297 -K----SKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDG-NITNIVIKARYTLLNMKANKTE 370 (510) Q Consensus 297 -~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g-~~Sg~Ai~~~~~~l~~k~~~k~ 370 (510) . .++++.++++.+.+.++.+.....+.+..+...+.|...-++|+.-.+... +.|...... T Consensus 261 ~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~------------- 327 (441) T protein:vir:79 261 FSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANL------------- 327 (441) T ss_pred hcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHH------------- Confidence 1 133566666666665554444455667777788888888888875433211 112111111 Q ss_pred HHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHH Q lcl|NC_013644. 371 ARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLR 448 (510) Q Consensus 371 ~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~ 448 (510) .|...|.-+++.|...+..+-........+++.+..-+-.|..+.++.+.++..+|+++.-.+.++++. +.+.+... T Consensus 328 -~~~~tl~P~~~~ie~eln~kl~~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~ 406 (441) T protein:vir:79 328 -DYLSTLKPYITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSI 406 (441) T ss_pred -HHHHHHHHHHHHHHHHHhhhccccccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcce Confidence 112233333333333333221111112234444455567789999999999999999999888887643 22211000 Q ss_pred HHHHHHHHHHHHHHHHHHhhhc------cCCCCCCCCCcccCCCCCCccc Q lcl|NC_013644. 449 LICEQFDLDWEDVKEALEEAEY------TKGLSDNTDEEETAVNPDDPTQ 492 (510) Q Consensus 449 ~~~e~~e~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~ 492 (510) .....+..+. +..-....+... .+|+..| T Consensus 407 ------------~~~~~n~~~~~~~~~~~~~~~~~~~~~~---kgGe~~e 441 (441) T protein:vir:79 407 ------------HRVDLNHVNIELVDEYQMNKSRATDKKL---KGGEENE 441 (441) T ss_pred ------------Eeeccccccccccccccccccccccccc---CCCCCCC Confidence 0000000000 000000000000 0011101 No 186 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=97.81 E-value=1.8e-05 Score=46.66 Aligned_cols=383 Identities=12% Similarity=0.053 Sum_probs=160.7 Q ss_pred HhhhhhhhHHHHHHHHHHhccC-Cc-----------chhcccc---eecccccc--------ccc-c---cccc--ceec Q lcl|NC_013644. 21 DKDRKSSSKREAETGIRYYNHE-ND-----------IMNNRIF---YVDDEGIL--------RED-K---YASN--VRIP 71 (510) Q Consensus 21 ~~~~~~~~~~~~~~~~~YY~g~-~~-----------i~~~~~~---~~~~~~~~--------~~~-~---~~~~--~ki~ 71 (510) -++.+- .-|+.+- .. ++.+..+ .....+.. ... . ..+. .|. T Consensus 1 ~~~~~~---------~~~~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~- 70 (441) T protein:vir:94 1 MHWYNT---------DCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRH- 70 (441) T ss_pred CccccC---------ccccccccccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhhhcc- Confidence 011100 0111111 00 0000000 00000000 000 0 0000 011 Q ss_pred cchhHHHHHHHHhhhhcCCceeccCcH-HHHHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCce-EE Q lcl|NC_013644. 72 HGFFPEIVDQKTQYLLSNPVEYETENE-ELKEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRL-CF 143 (510) Q Consensus 72 ~n~~~~Iv~~~~~~l~g~p~~~~~~d~-~~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i 143 (510) +=.-.-|+..++-+.+-|+.+.-+.+ ....-+-.++. |. .......+....+.+|.||+++.++..|++ .+ T Consensus 71 -~~V~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L 149 (441) T protein:vir:94 71 -SDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNL 149 (441) T ss_pred -HHHHHHHHHHHHhhccCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEE Confidence 00111244444444455666432211 11122222331 22 234455677888999999999989988986 58 Q ss_pred EEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccccccccc Q lcl|NC_013644. 144 QVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSE 223 (510) Q Consensus 144 ~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (510) .+++|..+.++.|+.+.+. |.+...+.... .....|.+..+.+++.- T Consensus 150 ~~i~~~~v~v~~d~~g~~~-----~~~~~~~~~~~---~~~~~~~~~dvih~k~~------------------------- 196 (441) T protein:vir:94 150 TFRKTSEIELKSDARGRLY-----YFHQRIDSNGN---NIERNVKFEDMLDIKFY------------------------- 196 (441) T ss_pred EEEcCceeEEEECCCccEE-----EEEEEeccCCc---eeEEEEccccEEEeccC------------------------- Confidence 8999999999887655321 11111111100 01112233333333210 Q ss_pred ccccccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEe--cCCCCch--hhhhHhh--- Q lcl|NC_013644. 224 NESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVS--GFQGDDL--SKLRQNV--- 296 (510) Q Consensus 224 ~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~--g~~~~~~--~~~~~~~--- 296 (510) | .+.-.|.|.++.+...++.......-..+.++..+.|-.+++ |...++. ..++... T Consensus 197 ------------~----~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~ 260 (441) T protein:vir:94 197 ------------S----LDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKS 260 (441) T ss_pred ------------C----CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHH Confidence 0 001135566665555555444443444445556666666654 3221111 1111111 Q ss_pred -h----cCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccC-cccHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 297 -K----SKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDG-NITNIVIKARYTLLNMKANKTE 370 (510) Q Consensus 297 -~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g-~~Sg~Ai~~~~~~l~~k~~~k~ 370 (510) . .++++.++++.+.+.++.+.....+.+..+...+.|...-++|+.-.+... +.|...... T Consensus 261 ~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~------------- 327 (441) T protein:vir:94 261 FSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANL------------- 327 (441) T ss_pred hcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHH------------- Confidence 1 133566666666665554444455667777788888888888875433211 112111111 Q ss_pred HHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHH Q lcl|NC_013644. 371 ARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLR 448 (510) Q Consensus 371 ~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~ 448 (510) .|...|.-+++.|...+..+-........+++.+..-+-.|..+.++.+.++..+|+++.-.+.++++. +.+.+... T Consensus 328 -~~~~tl~P~~~~ie~eln~kl~~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~ 406 (441) T protein:vir:94 328 -DYLSTLKPYITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSI 406 (441) T ss_pred -HHHHHHHHHHHHHHHHHhhhccccccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcce Confidence 112233333333333333221111112234444455567789999999999999999999888887643 22211000 Q ss_pred HHHHHHHHHHHHHHHHHHhhhc------cCCCCCCCCCcccCCCCCCccc Q lcl|NC_013644. 449 LICEQFDLDWEDVKEALEEAEY------TKGLSDNTDEEETAVNPDDPTQ 492 (510) Q Consensus 449 ~~~e~~e~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~ 492 (510) .....+..+. +..-....+... .+|+..| T Consensus 407 ------------~~~~~n~~~~~~~~~~~~~~~~~~~~~~---kgGe~~e 441 (441) T protein:vir:94 407 ------------HRVDLNHVNIELVDEYQMNKSRATDKKL---KGGEENE 441 (441) T ss_pred ------------Eeeccccccccccccccccccccccccc---CCCCCCC Confidence 0000000000 000000000000 0011101 No 187 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=97.79 E-value=1.9e-05 Score=46.51 Aligned_cols=356 Identities=10% Similarity=0.008 Sum_probs=152.9 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhh---HHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSS---KREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPE 77 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~---~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ 77 (510) |--+-.. ... +.+.+ ......+..+..|.. .+.... +..-+..+-... T Consensus 1 Mg~~~~~-----------~~~--k~~~~~~~~~~~~~~~~~~~~~~------------~~~~v~----~~~~l~~~~v~~ 51 (383) T protein:vir:10 1 MGLLTPK-----------NFS--KRNAKNMVYPSNPAFFTTTVGGM------------QLSYVS----ALSALQNTNVYS 51 (383) T ss_pred CCccccc-----------ccc--cccccccccccchhhhhhhccCc------------cccccc----hhHhhcchHHHH Confidence 2111000 000 00000 000000001111100 000000 000011122233 Q ss_pred HHHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEc Q lcl|NC_013644. 78 IVDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYN 156 (510) Q Consensus 78 Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d 156 (510) .|+..++-+.+-|+++. +.....+|+.=.. .........+..+.+.+|.||+++..+. ..+...+|..+-+..+ T Consensus 52 ~i~~ia~~ia~~~~~~~--~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~~---~~~~p~~~~~v~~~~~ 126 (383) T protein:vir:10 52 VINRIASDVSSAHFKTE--NTATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN---LEHIPNSDVQINYLPG 126 (383) T ss_pred HHHHHHHhhccCceeec--ccchhhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc---eeEeecCcceEEEEEc Confidence 34555555555566653 2222223321111 1234555667788888999998775432 3344444444433332 Q ss_pred CCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCccc Q lcl|NC_013644. 157 EYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIP 236 (510) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP 236 (510) ... .+|++....++. ...|.+.. T Consensus 127 ~~~------~~~~~~~~~~~~------~~~~~~~e--------------------------------------------- 149 (383) T protein:vir:10 127 NMG------IVYTVLESNDRP------KMVLRQDQ--------------------------------------------- 149 (383) T ss_pred CCc------eEEEEEEcCCce------EEEEcccc--------------------------------------------- Confidence 211 011111111110 01122233 Q ss_pred EEEecCC-------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCC-Cch---hhhhHhhh-------c Q lcl|NC_013644. 237 FYRLSNN-------KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQG-DDL---SKLRQNVK-------S 298 (510) Q Consensus 237 vv~~~nn-------~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~-~~~---~~~~~~~~-------~ 298 (510) |+||++. ..|.|.++.+...++....+..-..+.+.....|-.++.-... .+. ..+...++ . T Consensus 150 vih~r~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~~e~~~~~~~~~~~~~~~~n~ 229 (383) T protein:vir:10 150 MLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNS 229 (383) T ss_pred eEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCcccc Confidence 3333321 2367777777777776666666666666666666555543211 111 11222221 1 Q ss_pred CeeeeccCCCceeEEeecCCHHH-HHHHHHHHHHHHHHHhCCccccccc--cCcccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 299 KKVVGTGSDGGLDVKTVTIPTEG-RKTKMEIDKENIYKFGMAFDSTQVG--DGNITNIVIKARYTLLNMKANKTEARLRA 375 (510) Q Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~~i~~~s~~p~~~~~~--~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~ 375 (510) ++++.++++.+++.+..+..... +.+..+...+.|+..-++|+.-.+. .++.++..++. ....|.. T Consensus 230 ~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~eq-----------~~~~~~~ 298 (383) T protein:vir:10 230 GRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQ-----------IKATYLA 298 (383) T ss_pred CCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHHH-----------HHHHHHH Confidence 23555666665555554433333 3466677788999988898753322 22222222211 1112223 Q ss_pred HHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHH Q lcl|NC_013644. 376 LLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQ 453 (510) Q Consensus 376 ~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~ 453 (510) .|.-+++.|...+..+--. ..+++.+..-+..|..+.++.+.++.++|+++...++++++. +.+.+ T Consensus 299 ~l~P~~~~ie~~l~~~l~~----~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d-------- 366 (383) T protein:vir:10 299 NLNSYVNPIVDELRLKMNA----PDLELDIKDMLDVDDSILINQVSNLAKSGVLGAEQAQFILTRSGFLPDN-------- 366 (383) T ss_pred HHHHHHHHHHHHHHHhhCC----ceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCcccCCc-------- Confidence 3444444444443332111 246667778888999999999999999999999888877643 11100 Q ss_pred HHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCccccc Q lcl|NC_013644. 454 FDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQM 494 (510) Q Consensus 454 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (510) .+......++-.+.+++ T Consensus 367 ------------------------~~~~~~~~~~~~gGd~e 383 (383) T protein:vir:10 367 ------------------------LPEFKPLTNETKGGDDK 383 (383) T ss_pred ------------------------ccccCCCcccCCCCCCC Confidence 00000000000000000 No 188 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=97.65 E-value=3.3e-05 Score=45.15 Aligned_cols=405 Identities=11% Similarity=0.003 Sum_probs=188.1 Q ss_pred CCCccCCChh------------hhHHHHHHHHHhhhhh-hhHHHHH-HHHHHhccCCcchhcccceeccccccccccccc Q lcl|NC_013644. 1 MEALLSEDVK------------IIANALKAAIDKDRKS-SSKREAE-TGIRYYNHENDIMNNRIFYVDDEGILREDKYAS 66 (510) Q Consensus 1 ~~~~~~~~~~------------~~~~~i~~~i~~~~~~-~~~~~~~-~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~ 66 (510) |-.|+-..-. .....+.+....|..+ -.+.++. .++.--.|. +..... ..++-.. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iLr~a~~gd--~~~~~~--------L~e~m~e- 69 (526) T protein:vir:99 1 MAQIVDVYGNPIRTQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGN--LQAQAE--------LFMDMEE- 69 (526) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCcCCCCHHHHHHHHHhhhCCC--HHHHHH--------HHHHHHh- Confidence 3333221111 1111111222111111 0111111 111111111 100000 0000000 Q ss_pred cceeccchhHHHHHHHHhhhhcCCceeccC------cHHHHHHHHHHhcc--CHHHHHHHHHHHHHhcCe-EEEEEEECC Q lcl|NC_013644. 67 NVRIPHGFFPEIVDQKTQYLLSNPVEYETE------NEELKEYLAEYYNS--EFQVVLQELVEGSSQKGF-EYVYARTNA 137 (510) Q Consensus 67 ~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~------d~~~~~~l~~~~~n--~~~~~~~e~~~~~~~~G~-~~~~v~~d~ 137 (510) ......-.+.+...-+.|.+..+.+. +....+++++++.+ +|.+.+..+. ++..+|. +.+++|.-. T Consensus 70 ----~D~~i~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~ 144 (526) T protein:vir:99 70 ----RDAHLFAEMSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDAL-DGIGHGYSCIELEWALQ 144 (526) T ss_pred ----hChHHHHHHHHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcccCHHHHHHHHH-HhhhhcceeEEEEEeec Confidence 12344555666667777888777532 23455678888853 5777666655 6788886 456677654 Q ss_pred CCceE---EEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccc Q lcl|NC_013644. 138 EDRLC---FQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPR 214 (510) Q Consensus 138 ~g~~~---i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~ 214 (510) +|... +.+.+|+.+. |+...... ++. .++.... ..+ T Consensus 145 ~g~~~~~~l~~r~~~~f~--~~~~~~~~--l~~-----~~~~~~g----~~l---------------------------- 183 (526) T protein:vir:99 145 GREWMPLAFHHRPQSWFQ--LNPEDQNE--LRL-----RDNSPAG----EAL---------------------------- 183 (526) T ss_pred CCceeEEEeeeeccccee--eccCCCcE--EEe-----cCCCCCc----eee---------------------------- Confidence 55433 4444554332 22221110 000 0000000 000 Q ss_pred cccccccccccccccccCCcccEEEec--CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhh- Q lcl|NC_013644. 215 PHVLAVDSENESLLQRSYGQIPFYRLS--NNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSK- 291 (510) Q Consensus 215 ~~~~~~~~~~~~~~~~~~g~iPvv~~~--nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~- 291 (510) .+++.|-.++-. .++.|.|.+..+-...--=+..+.+++..++.++.|+++.+=-.+.+..+ T Consensus 184 ---------------~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek 248 (526) T protein:vir:99 184 ---------------QPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPGTADEEK 248 (526) T ss_pred ---------------cCCCeEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCCCCHHHH Confidence 112222222211 34567788877655444445577899999999999999986322222222 Q ss_pred --h---hHhhhcCeeeeccCCCceeEEeec-CCHHHHHHHHHHHHHHHHHHh--CCcccc--cccc-CcccHHHHHHHHH Q lcl|NC_013644. 292 --L---RQNVKSKKVVGTGSDGGLDVKTVT-IPTEGRKTKMEIDKENIYKFG--MAFDST--QVGD-GNITNIVIKARYT 360 (510) Q Consensus 292 --~---~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~l~~~i~~~s--~~p~~~--~~~~-g~~Sg~Ai~~~~~ 360 (510) . ...+....+..++.+..+++++.. .....++..++.+.+.|...- +|-... .+++ +.+-|..-. . T Consensus 249 ~~L~~av~~i~~d~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh~---~ 325 (526) T protein:vir:99 249 ATLLRAVTGLGHAAAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVHN---E 325 (526) T ss_pred HHHHHHHHHHhhCcEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhHHHHHH---H Confidence 1 223445567778899999999854 456778999999999987753 332222 1221 222222211 1 Q ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCC-CchHHHHHhC Q lcl|NC_013644. 361 LLNMKANKTEARLRALLE-WMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRK-IILESILQVA 438 (510) Q Consensus 361 ~l~~k~~~k~~~~~~~l~-~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~-iS~et~~~~~ 438 (510) -....+..-.+.+...|. ++++.++.+ +..+..+ -...+.++|...-+.|.++.++.+.++...|+ +|.+.+.+.+ T Consensus 326 v~~di~~aDa~~i~~tln~~Li~~l~~~-N~~~~~~-~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~ 403 (526) T protein:vir:99 326 VRHDLLASDARQLAATLSRDLLWPLLVL-NRPGSPD-VRRAPRLVFDLREQADITSMAQSIPALVNVGLEIPSAWVYDKL 403 (526) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCCcCC-ccccceEEeCCCCcccHHHHHHHHHHHHhCCCccCHHHHHHHh Confidence 122233344455666664 466665543 2222111 12246789999999999999999999999997 7888888887 Q ss_pred CCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 439 PRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 439 ~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) +.-...+.+. .....+.+.... ...+.. ............|... T Consensus 404 Gip~~~~~e~---------------~l~~~~~~~~~~-----~~~~~~--------~~~~~~~~~~~~~~~~ 447 (526) T protein:vir:99 404 GIPQPAKNEP---------------VLRSAAQPAILS-----RQHGQR--------VAALATIVGPRYGDQQ 447 (526) T ss_pred CCCCCCCccc---------------ccCCCCCCcccc-----cccccc--------cccccccccccCcchh Confidence 6522211000 000000000000 000000 0000000011111111 No 189 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=97.64 E-value=3.4e-05 Score=45.09 Aligned_cols=374 Identities=11% Similarity=0.048 Sum_probs=161.0 Q ss_pred HHHHHHHhhhhhhhHHHHHHHHHHhccCC--cchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCCce Q lcl|NC_013644. 15 ALKAAIDKDRKSSSKREAETGIRYYNHEN--DIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNPVE 92 (510) Q Consensus 15 ~i~~~i~~~~~~~~~~~~~~~~~YY~g~~--~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~ 92 (510) +...+-+.++.......-.....+..... .+.. ......+..+. +..-+..+-....|+..++-+.+-|++ T Consensus 1 m~m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~g~~v~----~~~al~~~~v~~~v~~ia~~ia~lp~~ 73 (392) T protein:vir:74 1 MILPILNFINQTNDPPEAGSVQSYFPDGNDAQIME---SLLGDNNEWVS----ARAALRNSDLFSIILQLSSDLAIVKIN 73 (392) T ss_pred CcchhhhhhhcccCcccccccccccccCchhhhhh---hccCCCCcccc----hhhhhcchHHHHHHHHHHHhhccCcee Confidence 01111011110000000000000000000 0000 00000000000 000011122334455555555566766 Q ss_pred eccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCceeEEEEEEE Q lcl|NC_013644. 93 YETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNELQRICRHYIT 170 (510) Q Consensus 93 ~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~ 170 (510) +.-.. ...++.+=.. .........+....+.+|.||+++-.+.+|++ .+.+++|..+-+..+..+.. .+|.+ T Consensus 74 ~~~~~--~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~----~~y~~ 147 (392) T protein:vir:74 74 AEKKK--NQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENG----MYYNI 147 (392) T ss_pred eccch--hhhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCce----EEEEE Confidence 53221 1222222111 12234555667889999999999988988876 68888999888777543321 11211 Q ss_pred EEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCCcH Q lcl|NC_013644. 171 EIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTDL 250 (510) Q Consensus 171 ~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~ 250 (510) ....... .....+.++.+.+++.-. ....-.|.|-+ T Consensus 148 ~~~~~~~----~~~~~~~~~evih~~~~~----------------------------------------~~~~~~G~s~i 183 (392) T protein:vir:74 148 TFDDPKI----EPILQAPQSDLIHMKLLS----------------------------------------IDGGKTGISPL 183 (392) T ss_pred EecCCcc----ceeEEEcCccEEEecCCC----------------------------------------CCCccccccHH Confidence 1111100 001122333333332100 00012366767 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCC-CCchhh----hhHhh----hcCeeeeccCCCceeEEeecCCHHH Q lcl|NC_013644. 251 KPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQ-GDDLSK----LRQNV----KSKKVVGTGSDGGLDVKTVTIPTEG 321 (510) Q Consensus 251 ~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~-~~~~~~----~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (510) ..+...|+....+..-..+.+...+.|-.+++-.. ....++ +.+.. ..++++.++++.+++.+..+..... T Consensus 184 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q 263 (392) T protein:vir:74 184 YSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQ 263 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHH Confidence 76666665555555445555666666666654211 111111 11111 1234566666666665555444556 Q ss_pred HHHHHHHHHHHHHHHhCCccccccccCc-cc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccc Q lcl|NC_013644. 322 RKTKMEIDKENIYKFGMAFDSTQVGDGN-IT-NIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPT 399 (510) Q Consensus 322 ~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~S-g~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~ 399 (510) +.+..+...+.|...=++|+.-.+..+. .| ..+.+ ..+...|.-.++.|...+..+-.. T Consensus 264 ~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~e~~~--------------~~~~~~l~p~~~~ie~~l~~~l~~----- 324 (392) T protein:vir:74 264 LLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQIS--------------GMYASALNRYLRPAISELEYKLSD----- 324 (392) T ss_pred HHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhccc----- Confidence 6777788888888888888754332221 12 12221 233344444444444443332111 Q ss_pred eeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhC---CCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCC Q lcl|NC_013644. 400 EVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVA---PRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSD 476 (510) Q Consensus 400 ~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~---~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~ 476 (510) .+++.+..-+-.|..+.++.+.++..+|+++...+.+++ ++. +.|.- ...+..+ T Consensus 325 ~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~-pne~r---------------~~enl~~------- 381 (392) T protein:vir:74 325 HISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYI-PKDLP---------------APENTNK------- 381 (392) T ss_pred hhcccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCC-ccccc---------------hhcCCCC------- Confidence 122223333345677888899999999999998777654 332 22110 0000000 Q ss_pred CCCCcccCCCCCCcccccc Q lcl|NC_013644. 477 NTDEEETAVNPDDPTQQMA 495 (510) Q Consensus 477 ~~~~~~~~~~~~~~~~~~~ 495 (510) ..+|++.++.+ T Consensus 382 --------~~~Gd~~~p~p 392 (392) T protein:vir:74 382 --------KTTGQSNEPVP 392 (392) T ss_pred --------CCCCCCCCCCC Confidence 11111111111 No 190 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=97.63 E-value=3.5e-05 Score=45.00 Aligned_cols=396 Identities=10% Similarity=-0.038 Sum_probs=161.0 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHH-------H----HHHHHHhccCCcchhcccceeccccccccccccccce Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKRE-------A----ETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVR 69 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~-------~----~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~k 69 (510) |-..+ .+.+.......+... . ..+...+-|... ..+..+. +..- T Consensus 1 ~~~~l-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~----------~~g~~v~----~~~a 55 (434) T protein:vir:43 1 MSKSL-----------GKVLSSATSAPRSSLFGWGGKTIRLTDGAFWSQFLGRES----------SSGKKVT----VDKA 55 (434) T ss_pred Cccch-----------hhhhhhcccccchhhhcccccccccCchHHHHHHhcCCc----------cCCceec----hhhh Confidence 21111 111111111010000 0 000000111100 0000000 0000 Q ss_pred eccchhHHHHHHHHhhhhcCCcee-ccC-c---HH-HHHHHHHHh-c--cC---HHHHHHHHHHHHHhcCeEEEEEEECC Q lcl|NC_013644. 70 IPHGFFPEIVDQKTQYLLSNPVEY-ETE-N---EE-LKEYLAEYY-N--SE---FQVVLQELVEGSSQKGFEYVYARTNA 137 (510) Q Consensus 70 i~~n~~~~Iv~~~~~~l~g~p~~~-~~~-d---~~-~~~~l~~~~-~--n~---~~~~~~e~~~~~~~~G~~~~~v~~d~ 137 (510) +.+.-.-..|+..++-+..-|+++ ... + .. ..-.+..++ . |. .......++...+.+|.+|+++..+ T Consensus 56 l~~~~V~~~i~~ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~- 134 (434) T protein:vir:43 56 MKLSAVWACVRLISTSVAGLPLGVYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA- 134 (434) T ss_pred hccHHHHHHHHHHHHhhhhCceEEEEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC- Confidence 111112234555555555567664 211 1 11 111233333 1 32 2355566778889999999888766 Q ss_pred CCce-EEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccc Q lcl|NC_013644. 138 EDRL-CFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPH 216 (510) Q Consensus 138 ~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 216 (510) .|++ .+.+++|..+-+..++.+.+ .|++.. .++. ...+.+..+.+++.- T Consensus 135 ~G~~~~L~~l~p~~v~~~~~~~g~~-----~y~~~~-~~g~------~~~~~~~eVih~~~~------------------ 184 (434) T protein:vir:43 135 AGRPAALDFLLPSRVDLECDENGRL-----KYFYTT-KKGA------RREIERTNMLHIPAF------------------ 184 (434) T ss_pred CCcEEEEEEEcCcceEEEEcCCCeE-----EEEEEe-cCce------EEEEccccEEEecCc------------------ Confidence 5765 57788999988877654321 121111 1111 112233333333210 Q ss_pred cccccccccccccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCch---hhhh Q lcl|NC_013644. 217 VLAVDSENESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDL---SKLR 293 (510) Q Consensus 217 ~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~---~~~~ 293 (510) | .+...|.|-+..+...+........-..+.+...+.|-.+++-...-+. ..++ T Consensus 185 -------------------~----~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~r 241 (434) T protein:vir:43 185 -------------------T----LDGRIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDRILQPAQREEFR 241 (434) T ss_pred -------------------C----CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCCCCCHHHHHHHH Confidence 0 0112355555544444433333323333444445556555544222111 1222 Q ss_pred Hhhh-------cCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc--CcccHHHHHHHHHHHHH Q lcl|NC_013644. 294 QNVK-------SKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGD--GNITNIVIKARYTLLNM 364 (510) Q Consensus 294 ~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~~Sg~Ai~~~~~~l~~ 364 (510) ..++ .++++.++++.+.+.++.+.....+.+..+...+.|+..-++|+.-.+.. ++.++..++.... T Consensus 242 ~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~~~---- 317 (434) T protein:vir:43 242 EYVKSVSGAMNSGRSPVLEQGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQML---- 317 (434) T ss_pred HHHHHhcCccccCCccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHHHH---- Confidence 2221 13345555555555444434445566777888889999888887533221 2222222222211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccCCc--cccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCC Q lcl|NC_013644. 365 KANKTEARLRALLEWMNKLVIDDINRRYTKA--FDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLD 442 (510) Q Consensus 365 k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~--~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~ 442 (510) ..+...|.-.+..|...+..+--.. .....+++.+..-+..|..+.++.+.++..+|+++.-.+++.++.-. T Consensus 318 ------~f~~~~L~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~p 391 (434) T protein:vir:43 318 ------AFLTFSISSITNQIQQCVNKRLLTAPERIRYYAEFSLEGFLKADSAGRAAWYSTMAQNGFMTRNEGRRKENLPE 391 (434) T ss_pred ------HHHHHHHHHHHHHHHHHHHhhcCChhhhcCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Confidence 2333344444444444443221111 11123455555667789999999999999999999988888764321 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccccCCC Q lcl|NC_013644. 443 DDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLPEN 509 (510) Q Consensus 443 d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (510) -+.-.+. ....+..+. +..++.+++. +. ..+.+....+..|.. T Consensus 392 ~~ggD~~------------~~~~n~~~~----~~~~~~~~~~-~~-------~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 392 LPGGDIL------------TVQSNLVPI----DQLGQSNKSQ-AV-------RAALMNWFSQPEPQE 434 (434) T ss_pred CCCCCeE------------eeccCccch----hhhhccCCCc-ch-------hhhhhccCCCCCCCC Confidence 1100000 000000000 0000000000 00 000000011111111 No 191 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=97.61 E-value=3.7e-05 Score=44.85 Aligned_cols=368 Identities=9% Similarity=0.021 Sum_probs=148.5 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHH-HHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKR-EAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIV 79 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~-~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv 79 (510) |--.-. .......+. ....+..+. ...++.. . .-+..+. ...-+..+-....| T Consensus 1 Mglf~~---------------~~~~~~~~~~~~~~~~~~~--~~~~~~~----~-~~~~~v~----~~~al~~~~V~~~i 54 (384) T protein:vir:49 1 MPIFNI---------------TNLATESPPSNQDSFFDIT--DPEFLDA----L-NGSEWVS----AETALKNSDLFSII 54 (384) T ss_pred Cccccc---------------cccCcccccccchhhcccc--chhhccc----c-cCCceec----hhhhhccHHHHHHH Confidence 111100 000000000 000000000 0000000 0 0000000 00001122233445 Q ss_pred HHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcC Q lcl|NC_013644. 80 DQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNE 157 (510) Q Consensus 80 ~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~ 157 (510) +..++-+.+-|+++.- ......+.+=.. .........+....+.+|.||+++-.|..|++ .+.+++|..+-++.++ T Consensus 55 ~~Ia~~ia~l~~~~~~--~~~~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~ 132 (384) T protein:vir:49 55 SQLSNDLATAKITTSR--KQLQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLD 132 (384) T ss_pred HHHHHHHhhCceeeec--chhhhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcC Confidence 6566666666766532 112122211111 12345566778889999999999989988875 6888899988876653 Q ss_pred CCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccE Q lcl|NC_013644. 158 YNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPF 237 (510) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 237 (510) .... .+|.+...+.... ....+....+.+++.-. + T Consensus 133 ~~~~----~~y~~~~~~~~~~----~~~~~~~~eVih~~~~~-------------------------------~------ 167 (384) T protein:vir:49 133 NQNG----LYYNITFDDPRIP----PKQHVPQGDILHFRLLS-------------------------------V------ 167 (384) T ss_pred CCce----EEEEEEecCcccc----ceeEecCccEEEecCCC-------------------------------C------ Confidence 2211 0121111111000 00112233333332100 0 Q ss_pred EEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhh--------hcCeeeeccCCCc Q lcl|NC_013644. 238 YRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNV--------KSKKVVGTGSDGG 309 (510) Q Consensus 238 v~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~--------~~~~~~~~~~~~~ 309 (510) ...-.|.|-+..+...++....+..-..+.+...+.|-.+++-.+.....+..+.. ..++++.++++.+ T Consensus 168 ---~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~ 244 (384) T protein:vir:49 168 ---DGGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKTKQSRSRQAMKQMQGGPLVLDDLED 244 (384) T ss_pred ---CCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHhcccCCccceecCCCce Confidence 00123566666555555554444444455556666666665433222222211111 1234556666665 Q ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc--cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 310 LDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG--DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDD 387 (510) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~ 387 (510) ++.+........+.+..+.+.+.|+..-++|+.-.+. .+..++..++..+...+ ...++-++.. T Consensus 245 ~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~i--------------~~~l~pi~~~ 310 (384) T protein:vir:49 245 FTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEMIYNIYFKAV--------------SRFLRPFVSE 310 (384) T ss_pred EEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHHHHHHHHHHH--------------HHHHHHHHHH Confidence 5555444444555677788889999988899864432 22234444333322222 2222222222 Q ss_pred HhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhC---CCCCcHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 388 INRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVA---PRLDDDNVLRLICEQFDLDWEDVKEA 464 (510) Q Consensus 388 ~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~---~~v~d~e~~~~~~e~~e~~~~~~~~~ 464 (510) +...-...+ ........-.+.......+..+..++++++-.+++.+ |+.+ .|..+ . T Consensus 311 i~~~l~~~l-----~~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~-ne~r~---------------~ 369 (384) T protein:vir:49 311 LSKKLSCEV-----DADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILP-KDLPE---------------G 369 (384) T ss_pred HHHHhchhh-----hhhhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCC-hhHHH---------------H Confidence 111100000 0001111111112333344556777888876666653 4433 32111 0 Q ss_pred HHhhhccCCCCCCCCCcc Q lcl|NC_013644. 465 LEEAEYTKGLSDNTDEEE 482 (510) Q Consensus 465 ~~~~~~~~~~~~~~~~~~ 482 (510) . ..++-.++..+++- T Consensus 370 ~---~~~p~~gGd~~~~~ 384 (384) T protein:vir:49 370 E---TDSTLKGGETNEQY 384 (384) T ss_pred c---CCCCCCCCCCCCCC Confidence 1 11111111122222 No 192 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=97.61 E-value=3.8e-05 Score=44.84 Aligned_cols=443 Identities=11% Similarity=0.024 Sum_probs=159.9 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhccc---------ce---eccccccccccccccc Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRI---------FY---VDDEGILREDKYASNV 68 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~---------~~---~~~~~~~~~~~~~~~~ 68 (510) |-.+ +.+.....-+ .+.-..+.++..|- + |++.+.. .. ....+...-...++.. T Consensus 1 ~~~~---------~~~~~~~~~~--~~~~~~~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 66 (535) T protein:vir:10 1 MAIL---------KDLRNAFSLS--NKKSTSYIELGDYD-K--DIVNKAIRPGRASARDTVDGIDIADGNVAGQYSVASI 66 (535) T ss_pred Chhh---------HHHHHHHHhh--hhhhhhhHHHhhhh-H--HHHHhhhhhhhhhhhccccccccccCCcccccccCcc Confidence 2211 1111111100 00001111111111 1 1100000 00 0000000000001100 Q ss_pred e----------e--ccchhHHHHHHHHhhh-------------hcCCceec-c----CcHH--HHHHHHHHhc---cC-- Q lcl|NC_013644. 69 R----------I--PHGFFPEIVDQKTQYL-------------LSNPVEYE-T----ENEE--LKEYLAEYYN---SE-- 111 (510) Q Consensus 69 k----------i--~~n~~~~Iv~~~~~~l-------------~g~p~~~~-~----~d~~--~~~~l~~~~~---n~-- 111 (510) + . ..+..+.+|++.+... .|-|+.+. . .... ....|..++. |. T Consensus 67 ~~~~~~~~l~~~~~~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~ 146 (535) T protein:vir:10 67 SDVLSTKKLLKAYADNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKRAHEIEDFIYNTGSEYY 146 (535) T ss_pred ccccCHHHHHHHhccChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhhhhHHHHHHHhCCCCCC Confidence 0 0 0122333333333221 23344432 1 1111 1122444442 22 Q ss_pred ----HH-HHHHHHHHHHHhcC-eEEEEEEECCCCceE-EEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEE Q lcl|NC_013644. 112 ----FQ-VVLQELVEGSSQKG-FEYVYARTNAEDRLC-FQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHA 184 (510) Q Consensus 112 ----~~-~~~~e~~~~~~~~G-~~~~~v~~d~~g~~~-i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 184 (510) +. ..+..+..+.+.+| .+|+++..+..|++. +.+++|..+.+..+........ .|++.. .+.. . T Consensus 147 ~~~~~~~~~~~~lv~d~l~~~g~ay~~i~r~~~G~~~~L~~l~p~~V~v~~d~~~~~~~~--~~~~~~-~~~~------~ 217 (535) T protein:vir:10 147 EWRDTFPRLLTKIINDMYVQDQINIERIFKNDSNELDHFNAVDASKVVISYSPRSKDQPR--KFEQFV-SETK------S 217 (535) T ss_pred ChhHHHHHHHHHHHHHHHhhCCceEEEEEECCCCcEEEEEEeCCceeEEEEcCccccCce--EEEEEe-cCce------e Confidence 11 23445566666665 578888888888874 8899999999887754321111 111111 1110 0 Q ss_pred EEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHH Q lcl|NC_013644. 185 EVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMN 264 (510) Q Consensus 185 e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~ 264 (510) ..+.+..+.+++..... .......|+|.++.+...|.....+. T Consensus 218 ~~~~~~eiih~~~~~~~-------------------------------------~~~~~~~G~Spi~~~~~~i~~~~aa~ 260 (535) T protein:vir:10 218 VKFSERNLTFINYWNLS-------------------------------------DTDRRGYGYSPVEASIPLIRAIYDTE 260 (535) T ss_pred EEECcccEEEEeccCCC-------------------------------------CcccccccccHHHHHHHHHHHHHHHH Confidence 12334444444321000 00001135566666555555555444 Q ss_pred HHHHHHHHHhccceeEEe--cCC---C--CchhhhhHhhhc-------CeeeeccCCCceeEEeecC--CHHHHHHHHHH Q lcl|NC_013644. 265 CFLSNNLQDFAEAIYVVS--GFQ---G--DDLSKLRQNVKS-------KKVVGTGSDGGLDVKTVTI--PTEGRKTKMEI 328 (510) Q Consensus 265 S~~~~~~~~~~~~~lv~~--g~~---~--~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 328 (510) .-..+.+...+.|-.++. +.. . +....+...+.. ...+.+-.+.+++|..... ....+.+..+. T Consensus 261 ~~~~~~f~ng~~p~giL~~~~~~~~~ls~e~~e~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~ 340 (535) T protein:vir:10 261 QFNARFFSQGGTTRGILVIDQDGDAQANQMMLAGIRRQWTSQGSGLGGAWKIPILAAKDAKFVNMTQNSRDMEFDKFLNF 340 (535) T ss_pred HHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCcccccccccccCCCceEEecCCChhHHHHHHHHHH Confidence 444555555566655443 321 1 111222222211 1111122333456555444 34455666677 Q ss_pred HHHHHHHHhCCccccccc-----cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeE Q lcl|NC_013644. 329 DKENIYKFGMAFDSTQVG-----DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSF 403 (510) Q Consensus 329 l~~~i~~~s~~p~~~~~~-----~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i 403 (510) ..+.|...-++|++-.+. .+|.++..... +..... ......+..+|.-.++.|...+..+--...+ ..+.+ T Consensus 341 ~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~~~-~~s~~E--~~~~~~~~~~L~P~l~~ie~~ln~~Ll~~~~-~~~~f 416 (535) T protein:vir:10 341 MIYDTAAIFQMQPEEINFPNNGGSTGKSGTKSVN-EGSTAK--AKLESSKDKGLTPLLSFIEQVINDKIMRYVD-TDYRF 416 (535) T ss_pred HHHHHHHHhCCCHHHhccccCcccccchhhhhhh-hhhhHH--HHHHHHHHHHHHHHHHHHHHHHhhhcccccC-CeEEE Confidence 778888887888753321 12222111111 111111 1222233444555555555444432222222 24778 Q ss_pred EeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHH-HHHHHHHHHHHHHHHHHHHHhhhccCCC-CCCCC Q lcl|NC_013644. 404 TFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNV-LRLICEQFDLDWEDVKEALEEAEYTKGL-SDNTD 479 (510) Q Consensus 404 ~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~-~~~~~e~~e~~~~~~~~~~~~~~~~~~~-~~~~~ 479 (510) .|+..+..|.++..+... +..+|.++.-.++++++. +..-+. ........-... ........+.+.+. +...+ T Consensus 417 ~f~~l~~~d~~~r~~~~~-~~~~g~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~--~~~~~~~~p~~~~~~~~~~~ 493 (535) T protein:vir:10 417 SFTLGDAQDKLQEEQVWK-LKLANGYFINEYRKDHGLKTVDGLDVPGFIGSAENFINA--TGFGQPNVPDSSDDSGSTLG 493 (535) T ss_pred EeccccccCHHHHHHHHH-HHHcCCCCHHHHHHHhCCCCCCCccccccccchhhcccc--cccccccCCCCCCCccccCC Confidence 888888889888877665 444666898888887643 221010 000000000000 00000000000000 00000 Q ss_pred CcccCCCCCC-------cccccccC--cccccccccCCCC Q lcl|NC_013644. 480 EEETAVNPDD-------PTQQMAEG--ATGSTESQLPENG 510 (510) Q Consensus 480 ~~~~~~~~~~-------~~~~~~~~--~~~~~~~~~~~~~ 510 (510) ++.+.+..++ .+++...- ...+...|.-++| T Consensus 494 ~~~~q~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 533 (535) T protein:vir:10 494 ERERQERIQHSKDYEKGKDDPKSPLPKPSESDDVSNNEDA 533 (535) T ss_pred ccccCcccccccccccCCCCCCCCCCcCCCCCcccccccc Confidence 0000000000 00000000 0001111111222 No 193 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=97.55 E-value=4.7e-05 Score=44.31 Aligned_cols=259 Identities=10% Similarity=0.073 Sum_probs=122.2 Q ss_pred hhcCCceeccCcHHHHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCC Q lcl|NC_013644. 86 LLSNPVEYETENEELKEYLAEYYN---S---EFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEY 158 (510) Q Consensus 86 l~g~p~~~~~~d~~~~~~l~~~~~---n---~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~ 158 (510) +.+-|+.+-..++.....+...+. | ........++...+.+|.||+.+..+.+|.+ .+.+++|..+.+..++. T Consensus 1 ia~l~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~~l~~l~~~~v~v~~~~~ 80 (278) T protein:vir:78 1 MASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ 80 (278) T ss_pred CccceeEEEecCcccccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEEEEEEECCceeEEEEcCC Confidence 333444442222222222333321 2 2345667778899999999999988888875 67888999888777644 Q ss_pred CCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEE Q lcl|NC_013644. 159 NELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFY 238 (510) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv 238 (510) +.. +++.....++. ...+....+.+++.-. +. T Consensus 81 ~~~-----~~y~~~~~~g~------~~~~~~~evih~~~~~-------------------------------~~------ 112 (278) T protein:vir:78 81 SRE-----LYYSIHAATGN------KLIVHNMDMLHFKHIV-------------------------------AS------ 112 (278) T ss_pred Cce-----EEEEEEcCCce------EEEEccccEEEECCCC-------------------------------CC------ Confidence 321 11111111111 0112233333332100 00 Q ss_pred EecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcc-ceeEEec-CCCCc--hhhhhHh----hh-cCeeeeccCCCc Q lcl|NC_013644. 239 RLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAE-AIYVVSG-FQGDD--LSKLRQN----VK-SKKVVGTGSDGG 309 (510) Q Consensus 239 ~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~-~~lv~~g-~~~~~--~~~~~~~----~~-~~~~~~~~~~~~ 309 (510) +...|.|.+..+...++....+... .+..++. |-.++.. ...++ ...+... .. .++++.++++.+ T Consensus 113 ---~~~~G~s~~~~~~~~i~~~~~~~~~---~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~ 186 (278) T protein:vir:78 113 ---NMVQGISPIDVLKNTTDFDNAVRTF---NLTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEPGVE 186 (278) T ss_pred ---CCeeeccHHHHHHHHHHHHHHHHHH---HHHHhcCCCcEEEEeCCCCCHHHHHHHHHHHHHHhccCCCceecCCCce Confidence 1113666666665555544333221 2233332 3333332 22211 1111111 11 234666666666 Q ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc--CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 310 LDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGD--GNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDD 387 (510) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~ 387 (510) ++.++.......+.+..+...+.|+..-++|+.-.+.. ++-|. ++. .....+..+|.-+++.|... T Consensus 187 ~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn--~~~----------~~~~~~~~~l~P~~~~i~~~ 254 (278) T protein:vir:78 187 IEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAK--NEE----------LNRFYLQHTLLPIVKQYEEE 254 (278) T ss_pred EEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HHH----------HHHHHHHHHHHHHHHHHHHH Confidence 66665554555666777888889988888887533322 22221 111 11233444455555555555 Q ss_pred HhhccCCcccc-ceeeEEeCCCCC Q lcl|NC_013644. 388 INRRYTKAFDP-TEVSFTFTREVM 410 (510) Q Consensus 388 ~~~~~~~~~~~-~~v~i~f~~~~p 410 (510) +..+--...+. ....+.|+-+.- T Consensus 255 ln~~L~~~~e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 255 FNRKLLTKTDREKIGILNLTLNLI 278 (278) T ss_pred HHhhcCChhHhcCCceEEEecccC Confidence 54432111111 124566763333 No 194 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=97.52 E-value=5.2e-05 Score=44.06 Aligned_cols=379 Identities=13% Similarity=0.093 Sum_probs=159.6 Q ss_pred HHHhccCC---cchhccc---------------ce---ecccccc---------cccccc---ccceeccchhHHHHHHH Q lcl|NC_013644. 36 IRYYNHEN---DIMNNRI---------------FY---VDDEGIL---------REDKYA---SNVRIPHGFFPEIVDQK 82 (510) Q Consensus 36 ~~YY~g~~---~i~~~~~---------------~~---~~~~~~~---------~~~~~~---~~~ki~~n~~~~Iv~~~ 82 (510) ..+|.-.- |...|+. +. ....+.. .-.... +..=+.++=.-..|+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~acv~~I 80 (441) T protein:vir:98 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMI 80 (441) T ss_pred CceecCccceeccccccchhhhhhccccccccccccccCCCcchHHHHHHhhcccccCccccchhhhhccHHHHHHHHHH Confidence 12221110 0000000 00 0000000 000000 00000000011124444 Q ss_pred HhhhhcCCceeccCcH-HHHHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEE Q lcl|NC_013644. 83 TQYLLSNPVEYETENE-ELKEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGV 154 (510) Q Consensus 83 ~~~l~g~p~~~~~~d~-~~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~ 154 (510) ++-+.+-|+.+.-+.+ ....-+-.++. |. .......+....+.+|.||+++.++.+|++ .+.+++|..+.+. T Consensus 81 a~~iA~lpl~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~ 160 (441) T protein:vir:98 81 ASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELK 160 (441) T ss_pred HHhhccCceEEecCCcccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCceeEEE Confidence 4444455666532211 11122222331 22 234556677888999999999989988875 5888999999988 Q ss_pred EcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCc Q lcl|NC_013644. 155 YNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQ 234 (510) Q Consensus 155 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 234 (510) .++.+.+.. ++......+. . ....+.+..+.+++.- T Consensus 161 ~~~~g~~~~----~~~~~~~~~~-~---~~~~~~~~dviHir~~------------------------------------ 196 (441) T protein:vir:98 161 LDARGRLYY----FHQRIDSNGN-N---IERNVKFEDMLDIKFY------------------------------------ 196 (441) T ss_pred ECCCCcEEE----EEEEeccCcc-e---eeEEEccccEEEeccC------------------------------------ Confidence 876553211 1111111100 0 0112233333333210 Q ss_pred ccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEe--cCCCCch--hhhhHhhh--------cCeee Q lcl|NC_013644. 235 IPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVS--GFQGDDL--SKLRQNVK--------SKKVV 302 (510) Q Consensus 235 iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~--g~~~~~~--~~~~~~~~--------~~~~~ 302 (510) | .+.-.|.|-+..+...++..+.+..-..+.+.-.+.|-.+++ +...++. ..++.... .++++ T Consensus 197 -~----~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~G~~nag~~~ 271 (441) T protein:vir:98 197 -S----LDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVV 271 (441) T ss_pred -C----CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCcce Confidence 0 011135555665555555444444444445555566666654 3211111 11222111 12356 Q ss_pred eccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc-CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 303 GTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGD-GNITNIVIKARYTLLNMKANKTEARLRALLEWMN 381 (510) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~ 381 (510) .++++.+.+.++.+.....+.+..+...+.|...-++|+.-.+.. ++.|-...... |...|.-.+ T Consensus 272 vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~~--------------y~~tl~P~~ 337 (441) T protein:vir:98 272 VLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLD--------------YLSTLKPYI 337 (441) T ss_pred ecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHHH--------------HHHHHHHHH Confidence 667666666665554445566677777888888888887644321 11221111111 112233333 Q ss_pred HHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHH Q lcl|NC_013644. 382 KLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWE 459 (510) Q Consensus 382 ~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~ 459 (510) ..|...+..+-.....-..+++..+.-+-.|..+.++.+.++..+|+++.-.++++++. +.+.+... T Consensus 338 ~~ie~~ln~~L~~~~~~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGd~~~----------- 406 (441) T protein:vir:98 338 TCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSI----------- 406 (441) T ss_pred HHHHHHHHhhccccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcce----------- Confidence 33333232221111112234444455577899999999999999999999888887643 22111000 Q ss_pred HHHHHHHhhh------ccCCCCCCCCCcccCCCCCCccc Q lcl|NC_013644. 460 DVKEALEEAE------YTKGLSDNTDEEETAVNPDDPTQ 492 (510) Q Consensus 460 ~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~ 492 (510) .....+..+ ++..-..+.+... .+|+..| T Consensus 407 -~~~~~n~~~~~~~~~~q~~~~~~~~~~~---kgGe~ne 441 (441) T protein:vir:98 407 -HRVDLNHVNIELVDEYQMNKSRATDKKL---KGGEENE 441 (441) T ss_pred -Eeeccccccccccccccccccccccccc---CCCCCCC Confidence 000000000 0000000000000 0111101 No 195 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=97.48 E-value=5.8e-05 Score=43.80 Aligned_cols=432 Identities=9% Similarity=0.042 Sum_probs=186.4 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |..=..+......+.+++..+..+..+ -.....++++|+---+-+ ....+. .+...|+..+-+..-++ T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R-~~~e~~w~e~a~~~lP~~------~~~~~~-----~~~~~~~~dstg~~a~~ 68 (516) T protein:vir:10 1 MKQSTDLEYGGKRSKIPKLWEKFSTKR-SSFLDRAKHYSKLTLPYL------MNDKGD-----NETSQNGWQGVGAQATN 68 (516) T ss_pred CCchhhHhhhhHHHHHHHHHHHHHHhh-hHHHHHHHHHHHhhcccc------cCCCCC-----cccccccccchHHHHHH Confidence 554444555555566666666654322 222333444443322210 000111 11112455666677777 Q ss_pred HHHhhhhc--CCce-----eccCcH-------------HHHHHHH-------HHh-ccCHHHHHHHHHHHHHhcCeEEEE Q lcl|NC_013644. 81 QKTQYLLS--NPVE-----YETENE-------------ELKEYLA-------EYY-NSEFQVVLQELVEGSSQKGFEYVY 132 (510) Q Consensus 81 ~~~~~l~g--~p~~-----~~~~d~-------------~~~~~l~-------~~~-~n~~~~~~~e~~~~~~~~G~~~~~ 132 (510) +.++-|++ -|+. +...+. .+.+.|. ..+ .+||...+.++.++..++|.|. T Consensus 69 ~LAa~l~~~ltpp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~-- 146 (516) T protein:vir:10 69 HLANKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCM-- 146 (516) T ss_pred HHHHHHHhhhcCCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEe-- Confidence 77776654 2322 232221 1222222 222 4688889999999999999985 Q ss_pred EEECCCCceEEEEEcccceEEEEcCCCCceeEEEEEEEEEee------C--------CceeEEEEEEEEc-----CCcEE Q lcl|NC_013644. 133 ARTNAEDRLCFQVADSLNVFGVYNEYNELQRICRHYITEIEK------D--------GETVDIHHAEVWT-----DQNVY 193 (510) Q Consensus 133 v~~d~~g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~------~--------~~~~~~~~~e~y~-----~~~i~ 193 (510) +|.|+++.++ .++-.+++..-|..+++..+++........ . ........+++|+ ++..+ T Consensus 147 l~~d~~~~~~--~~pl~~y~v~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t~v~~~~~~~~ 224 (516) T protein:vir:10 147 LYKPSKGAIS--AIPMHHYVVNRDTNGDLLDIILLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYTHAKYLGEGFW 224 (516) T ss_pred EEecCCCCeE--EEEcCeEEEeeCCCCCeEEEeeeecccHHHHHHHhhhhhhhhhhhhccCCCCceEEEEEEEecCCCce Confidence 4567776654 444456666667667666655433221100 0 0001111223332 12111 Q ss_pred EEEEcCCceeecccccccccccccccccccccccccccCCcccEEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 194 FFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLS-----NNKQETTDLKPIKALIDDYDLMNCFLS 268 (510) Q Consensus 194 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----nn~~g~sd~~~v~~liD~~n~~~S~~~ 268 (510) .+....++. ........+|..+|++.++ .+.+|+|-.++..+-+..+|.+.-... T Consensus 225 ~~~~~~d~~--------------------~~~~~s~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l 284 (516) T protein:vir:10 225 ELKQSADDI--------------------PVGKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVA 284 (516) T ss_pred EEEEeeCce--------------------eeccccccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHH Confidence 111111000 0001112345567766654 346799988889999999998877777 Q ss_pred HHHHHhccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCccccccc Q lcl|NC_013644. 269 NNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDSTQVG 346 (510) Q Consensus 269 ~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~ 346 (510) ........|.+.+.-........+.. ...+.+..+..++++.+... .+.......++.++..|-..-..-....-. T Consensus 285 ~~~~~a~~~~~lv~p~g~~~~~~l~~--~~~g~~~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd 362 (516) T protein:vir:10 285 RGAALMADIKYLIRPGAQTDVDHFVN--SGTGEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTRRD 362 (516) T ss_pred HHHHHhcCCCcccCcccccchhhhcc--CCCceeecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhhhhccC Confidence 77777787776553211222222211 12234445555667776543 356777777877777775432211111112 Q ss_pred cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHh----hccCCccccceeeEEeCCCCCCCHHHHH---H Q lcl|NC_013644. 347 DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVID-DIN----RRYTKAFDPTEVSFTFTREVMVNETDIV---N 418 (510) Q Consensus 347 ~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~-~~~----~~~~~~~~~~~v~i~f~~~~p~d~~e~~---~ 418 (510) ....|++.+.. +..+|+..++..+.++-.=++. ++. .... .....-+.+.... +.+....+ + T Consensus 363 ~~rvTAtEV~~-------r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~~~p-~~P~~lv~~~~v~--~i~~L~raq~~~ 432 (516) T protein:vir:10 363 AERVTAVEIQR-------DALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGD-SFTSDLVDPVIIT--GIEALGRMAELD 432 (516) T ss_pred CccccHHHHHH-------HHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhCC-CCChhhcCcceeh--hHHHHHHHHHHH Confidence 23345555543 5677777777777664222211 111 1111 0111111111111 11111111 1 Q ss_pred HHHHHHh-cCC---Cch------------HHHHHhCC----CCCc-HHHHHHHHHHHHHHHHHHHHHHHhhh-ccCCCCC Q lcl|NC_013644. 419 DEKTEAE-TRK---IIL------------ESILQVAP----RLDD-DNVLRLICEQFDLDWEDVKEALEEAE-YTKGLSD 476 (510) Q Consensus 419 ~~~~~~~-~g~---iS~------------et~~~~~~----~v~d-~e~~~~~~e~~e~~~~~~~~~~~~~~-~~~~~~~ 476 (510) .+....+ .+. ++. +.+...++ .+.. +|..++.+++.+.+..... +.+... .++...+ T Consensus 433 ~i~~~~q~i~~~~q~~p~v~d~id~d~~~~~~a~~~gvp~~~irs~eev~~~r~~~~~~q~~~~~-~~~~~~~~~~~~~~ 511 (516) T protein:vir:10 433 KLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMEQEQEAQMQAQQAQML-EEGVAKAVPGVIQQ 511 (516) T ss_pred HHHHHHHHHHHHhcCChHHHhhcCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHHHHHH-HHHhhhcccchhhh Confidence 1111111 000 111 11122211 2222 3333333333222221111 111111 1111111 Q ss_pred CCCCccc Q lcl|NC_013644. 477 NTDEEET 483 (510) Q Consensus 477 ~~~~~~~ 483 (510) .-.+. T Consensus 512 --~~~~~ 516 (516) T protein:vir:10 512 --ELKEA 516 (516) T ss_pred --hhhcC Confidence 11111 No 196 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=97.42 E-value=7.2e-05 Score=43.31 Aligned_cols=396 Identities=10% Similarity=-0.005 Sum_probs=164.9 Q ss_pred hhhhhhHHHHHHHHHHhccCCcch--hcc-cceeccc----cccccccccc---cceeccchhHHHHHHHHhhhhcCCce Q lcl|NC_013644. 23 DRKSSSKREAETGIRYYNHENDIM--NNR-IFYVDDE----GILREDKYAS---NVRIPHGFFPEIVDQKTQYLLSNPVE 92 (510) Q Consensus 23 ~~~~~~~~~~~~~~~YY~g~~~i~--~~~-~~~~~~~----~~~~~~~~~~---~~ki~~n~~~~Iv~~~~~~l~g~p~~ 92 (510) ..+......+.+++..+......- ... ....... +......... +.=+..+-.-..|+..++-+.+-|+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~ 80 (432) T protein:vir:10 1 MPDEKKLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMPLT 80 (432) T ss_pred CCCCcccchhhhhHhhcCCccccccccccccccCcchhhhhcccccccCcccchhhhhcchHHHHHHHHHHHhhhhCcee Confidence 111111112222233332221100 000 0000000 0000000000 00011122333455555555566766 Q ss_pred e--ccCcH---HHHHHHHHHh-c--cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCC Q lcl|NC_013644. 93 Y--ETENE---ELKEYLAEYY-N--SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNE 160 (510) Q Consensus 93 ~--~~~d~---~~~~~l~~~~-~--n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~ 160 (510) + ...+. ....-+..++ . |. .......+....+.+|.||+++..+ +|++ .+.+++|..+.++.|..+. T Consensus 81 ~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~~~~v~v~~~~~g~ 159 (432) T protein:vir:10 81 MYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTKGN 159 (432) T ss_pred EEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEEcCCCc Confidence 4 11111 1112223333 1 22 2344556778889999999887775 4664 5778899999888775543 Q ss_pred ceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEe Q lcl|NC_013644. 161 LQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRL 240 (510) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~ 240 (510) + .|++... ++. ...+....+.+++.- + T Consensus 160 ~-----~y~~~~~-~g~------~~~~~~~~iih~~~~-------------------------------------~---- 186 (432) T protein:vir:10 160 T-----AYRYRRT-DGQ------MIDIPKQQIWKIMGY-------------------------------------S---- 186 (432) T ss_pred E-----EEEEEec-Cce------EEEEcCccEEEecCC-------------------------------------C---- Confidence 1 1222111 111 011223333333210 0 Q ss_pred cCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh---hhhHhh----hcCeeeeccCCCceeEE Q lcl|NC_013644. 241 SNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS---KLRQNV----KSKKVVGTGSDGGLDVK 313 (510) Q Consensus 241 ~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~---~~~~~~----~~~~~~~~~~~~~~~~~ 313 (510) .+.-.|.|-+..+...++.......-..+.+...+.|-.+++....-..+ .+.+.. ..++++.++++.+.+.+ T Consensus 187 ~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~nag~~~vl~~g~~~~~l 266 (432) T protein:vir:10 187 LDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSFAKKVSGSVEAGRAPLLEGGMDVKSL 266 (432) T ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHHHHHHHhhhhhCCCceecCCCceEEEc Confidence 01113555555444444433333222334445555676666543221211 122221 23456667776666666 Q ss_pred eecCCHHHHHHHHHHHHHHHHHHhCCcccccccc--C-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_013644. 314 TVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGD--G-NITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINR 390 (510) Q Consensus 314 ~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g-~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~ 390 (510) +.+.....+.+..+..+..|+..-++|+.-.+.. + ...|..++.... ..+...|.-.++.|...+.. T Consensus 267 ~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~~~~----------~f~~~tl~P~~~~ie~~ln~ 336 (432) T protein:vir:10 267 GLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQL----------GFLSMTLSPWLRRIEQSIAL 336 (432) T ss_pred cCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHHHHH----------HHHHHHHHHHHHHHHHHHHh Confidence 5544445566667888888999888888543321 1 122232322211 22233333333333333332 Q ss_pred ccCCccc--cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 391 RYTKAFD--PTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALE 466 (510) Q Consensus 391 ~~~~~~~--~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~ 466 (510) +--.... ...+++.+..-+..|..+.++.+.++.++|+++.-.++++++. +...... ...+ T Consensus 337 kL~~~~~~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~glppi~g~~~~---------------~~~~ 401 (432) T protein:vir:10 337 NLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNAAV---------------LTVQ 401 (432) T ss_pred hhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCcce---------------Eeec Confidence 1111111 1233444445567899999999999999999999888887653 2211000 0000 Q ss_pred hhhccCCCCCCCCCcccCCCCCCcccccccCcccccccc Q lcl|NC_013644. 467 EAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQ 505 (510) Q Consensus 467 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (510) ....+ .+.......+++..+.+.+++. +.++ T Consensus 402 ~~~~p--l~~~~~~~~~~~~~~~~~~~~~------~~~~ 432 (432) T protein:vir:10 402 SAMVP--LDSIGLQASPEPASGLGNQQQD------KVSK 432 (432) T ss_pred Ccccc--hhhhcccCCCCCCCCCCCcccc------cccC Confidence 00000 0000000011111111111111 1111 No 197 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=97.35 E-value=8.8e-05 Score=42.83 Aligned_cols=363 Identities=9% Similarity=0.012 Sum_probs=153.8 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhccccee---ccc--ccccccccc-ccceeccch Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYV---DDE--GILREDKYA-SNVRIPHGF 74 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~---~~~--~~~~~~~~~-~~~ki~~n~ 74 (510) |--. ..--...+.......... ... ......... +..-+..+- T Consensus 1 M~~f-------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~ 49 (386) T protein:vir:48 1 MPIF-------------------------------NITNLATESPPISQGGFFDITDPDFLSTLNGSEWVSAESALRNSD 49 (386) T ss_pred Cccc-------------------------------ccccccccccccccccccccccchhcccccCCceechhhhhcchH Confidence 1100 000000000000000000 000 000000000 000011122 Q ss_pred hHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceE Q lcl|NC_013644. 75 FPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVF 152 (510) Q Consensus 75 ~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~ 152 (510) ....|+..++-+.+-|+++. +.....++..-.. -........+..+.+.+|.||+++-.|..|.+ .+.+++|..+- T Consensus 50 v~~~i~~ia~~ia~~p~~~~--~~~~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~ 127 (386) T protein:vir:48 50 LFSIINQLSNDLATVKLTAS--RKQLQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVS 127 (386) T ss_pred HHHHHHHHHHhhccCceeec--cchhHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCceeE Confidence 23344444554555566543 2222222222211 12345556678889999999999888888875 67788898887 Q ss_pred EEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccC Q lcl|NC_013644. 153 GVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSY 232 (510) Q Consensus 153 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (510) +..+..+.. .+|.+...... ......+..+.+. T Consensus 128 v~~~~~~~~----~~y~~~~~~~~----~~~~~~~~~~evi--------------------------------------- 160 (386) T protein:vir:48 128 FNRLDNKDG----IYYNITFDDPR----IPPKQHVPQGDVL--------------------------------------- 160 (386) T ss_pred EEEcCCCce----EEEEEEecCcc----ccceeEecCccEE--------------------------------------- Confidence 776543321 11211111100 0011122233333 Q ss_pred CcccEEEecCC-----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhH---hh-----hcC Q lcl|NC_013644. 233 GQIPFYRLSNN-----KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQ---NV-----KSK 299 (510) Q Consensus 233 g~iPvv~~~nn-----~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~---~~-----~~~ 299 (510) |+++. -.|.|.+..+...+.....+..-..+.+...+.|-.+++-..........+ .. ..+ T Consensus 161 ------h~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~~~~e~~~~~~~~~~~~~~n~g 234 (386) T protein:vir:48 161 ------HFKLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKLSRSRQAMKQMQG 234 (386) T ss_pred ------EecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHhhcCCC Confidence 33321 136666666555555544444444555555666766665433222221111 11 123 Q ss_pred eeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 300 KVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEW 379 (510) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~ 379 (510) +++.++++.+++.++.+.....+.+..+...+.|+..-++|+.-.+..++-|. ... .....+..+|.- T Consensus 235 ~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--~e~----------~~~~~~~~~l~P 302 (386) T protein:vir:48 235 GPLVLDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQGDQQS--SLE----------MSLDLYNKAVSR 302 (386) T ss_pred CceecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccc--HHH----------HHHHHHHHHHHH Confidence 35556655555555444334456677788888999888888864432221111 110 001123333444 Q ss_pred HHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCC--CCCcHHHHHHHHHHHHHH Q lcl|NC_013644. 380 MNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAP--RLDDDNVLRLICEQFDLD 457 (510) Q Consensus 380 ~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~--~v~d~e~~~~~~e~~e~~ 457 (510) +++.|...++.+-... +...+...+..+....+..+.++..+|+++.-.+++.+. .+.+.+.-. T Consensus 303 ~~~~ie~~l~~~l~~~-----~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~~~~~--------- 368 (386) T protein:vir:48 303 YLRPFLSELSQKLSCD-----VDADILPAVDPTGSNSVSRINSMVKSGTLAQNQGLYILQQAEILPKELPE--------- 368 (386) T ss_pred HHHHHHHHHHHhhcch-----hhcchhhhhccChHHHHHHHHHHHhCCCcCHHHHHHHhhcCCCCCccchh--------- Confidence 4444444333321111 122222333345566777888899999999877777653 233221100 Q ss_pred HHHHHHHHHhhhccCCCCCCCCCcccC Q lcl|NC_013644. 458 WEDVKEALEEAEYTKGLSDNTDEEETA 484 (510) Q Consensus 458 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (510) . +........+ ++.+.++ T Consensus 369 -------~-~~~~~~~~~g-Gd~~~~~ 386 (386) T protein:vir:48 369 -------G-ENPNKTTLKG-GEINGED 386 (386) T ss_pred -------h-cCCCCCccCC-CCCCCCC Confidence 0 0000000000 0100000 No 198 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=97.32 E-value=9.5e-05 Score=42.65 Aligned_cols=395 Identities=10% Similarity=-0.012 Sum_probs=160.9 Q ss_pred hhhhhH-HHHHHHHHHhccCCcchhcc---cceecc----ccccccccccc---cceeccchhHHHHHHHHhhhhcCCce Q lcl|NC_013644. 24 RKSSSK-REAETGIRYYNHENDIMNNR---IFYVDD----EGILREDKYAS---NVRIPHGFFPEIVDQKTQYLLSNPVE 92 (510) Q Consensus 24 ~~~~~~-~~~~~~~~YY~g~~~i~~~~---~~~~~~----~~~~~~~~~~~---~~ki~~n~~~~Iv~~~~~~l~g~p~~ 92 (510) .+.++. ....+.+..+.....+-... ...... .+......... ..=+.++-.-..|+..++-+.+-|+. T Consensus 1 ~~~~~~mg~f~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~ 80 (432) T protein:vir:81 1 MPDEKKLGLFGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMPLT 80 (432) T ss_pred CCchhhcchhhhhhhhcccccccccccccccccCccchhhhcccccccCcccchHhhhccHHHHHHHHHHHHhhhhCcee Confidence 111110 01111111221111100000 000000 00000000000 00011122223445555555556766 Q ss_pred e-c-cCc---HHHHHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCC Q lcl|NC_013644. 93 Y-E-TEN---EELKEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNE 160 (510) Q Consensus 93 ~-~-~~d---~~~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~ 160 (510) + . ..+ +....-+..++. |. -......+....+.+|.||+++..+ +|++ .+.+++|..+-+..++.+. T Consensus 81 ~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~-~g~~~~L~~l~~~~v~v~~~~~g~ 159 (432) T protein:vir:81 81 MYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDPKGN 159 (432) T ss_pred eEEecCCcceecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEECCCCc Confidence 4 1 111 111112233331 22 2344556677889999999887775 4664 5778899998888775543 Q ss_pred ceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEe Q lcl|NC_013644. 161 LQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRL 240 (510) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~ 240 (510) + +|.+... ++.. ..+....+.+++.- + T Consensus 160 ~-----~y~~~~~-~g~~------~~~~~~~iih~r~~-------------------------------------~---- 186 (432) T protein:vir:81 160 T-----AYRYRRT-DGQM------IDIPKQQIWKIMGY-------------------------------------S---- 186 (432) T ss_pred E-----EEEEEec-CceE------EEEccccEEEecCC-------------------------------------C---- Confidence 2 1222111 1110 11223333333210 0 Q ss_pred cCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCch---hhhhHhh----hcCeeeeccCCCceeEE Q lcl|NC_013644. 241 SNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDL---SKLRQNV----KSKKVVGTGSDGGLDVK 313 (510) Q Consensus 241 ~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~---~~~~~~~----~~~~~~~~~~~~~~~~~ 313 (510) .+.-.|.|-+..+...|+.....-.-..+.+...+.|-.++.-...-+. ..+.+.. ..++++.++++.+++.+ T Consensus 187 ~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~nag~~~vl~~g~~~~~l 266 (432) T protein:vir:81 187 LDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSFAKKVSGSVEAGRAPLLEGGMDVKSL 266 (432) T ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHHHHHHHhhhhcCCCceecCCCceEEEc Confidence 0111355555544444444333333333444444556445443211111 1222221 23456677777666666 Q ss_pred eecCCHHHHHHHHHHHHHHHHHHhCCcccccccc--C-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_013644. 314 TVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGD--G-NITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINR 390 (510) Q Consensus 314 ~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g-~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~ 390 (510) +.......+.+..+...+.|...-++|+.-.+.. + ...|..++-... ..+...|.-.++.|...+.. T Consensus 267 ~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~~----------~f~~~tl~P~~~~ie~~l~~ 336 (432) T protein:vir:81 267 GLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQL----------GFLTMTLSPWLRRIEQSIAL 336 (432) T ss_pred cCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHHHH----------HHHHHHHHHHHHHHHHHHHh Confidence 5544455566677788888988888888543221 1 122232322111 22233444444444443332 Q ss_pred ccCCcccc--ceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 391 RYTKAFDP--TEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALE 466 (510) Q Consensus 391 ~~~~~~~~--~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~ 466 (510) +--..... ..+++.+..-+..|..+.++.+.++..+|+++.-.++++++. +.+.... .... T Consensus 337 kLl~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~~~glpp~~g~~~~---------------~~~~ 401 (432) T protein:vir:81 337 NLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNAAV---------------LTVQ 401 (432) T ss_pred hccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCcce---------------Eeec Confidence 21111111 234444445577899999999999999999999888887653 2211000 0000 Q ss_pred hhhccCCCCCCCCCcccCCCCCCcccccccCcccccccc Q lcl|NC_013644. 467 EAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQ 505 (510) Q Consensus 467 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (510) ....+ .+.......+.+.++.+.+ +.+..++ T Consensus 402 ~~~~p--l~~~~~~~~~~~~~~~~n~------~~~~~~~ 432 (432) T protein:vir:81 402 SAMVP--LDSIGLQASPEPASGLGNQ------QQDKVSK 432 (432) T ss_pred Ccccc--hhhhccCCCCCCCCCCCCc------ccccccC Confidence 00000 0000000000001111101 1111111 No 199 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=97.25 E-value=0.00012 Score=42.16 Aligned_cols=387 Identities=10% Similarity=-0.035 Sum_probs=160.4 Q ss_pred HHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceecccccccccccccccee------ccchhHHHHHHHHhhhhc Q lcl|NC_013644. 15 ALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRI------PHGFFPEIVDQKTQYLLS 88 (510) Q Consensus 15 ~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki------~~n~~~~Iv~~~~~~l~g 88 (510) .+.=+- .|.......++ .+..+++.+.- ..+................+...+ .++=....|+..++-+.+ T Consensus 1 ~~~~~~-~~~~~~~~~~~-~~~~lf~~~~~--~~~~~~~~~~~~~~~~~~~~~~~vs~~~al~~~~v~~cv~~Ia~~iA~ 76 (424) T protein:vir:45 1 MLYCWW-AHWLWPEGGRV-LLDALFRSKSL--ENPSTPITGDAVDTDGLFRADVYVSPETAMKLAAVYSCIYVLSSSLAQ 76 (424) T ss_pred CeeEee-eceecCcchhH-HHHhhccccCC--CCCccccchhhhhhhccccCCceechHHhhccHHHHHHHHHHHHHHhh Confidence 000000 01101111111 12223322210 000000000000000001111111 111222345555555556 Q ss_pred CCceec-cCc---HHH-HHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEc Q lcl|NC_013644. 89 NPVEYE-TEN---EEL-KEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYN 156 (510) Q Consensus 89 ~p~~~~-~~d---~~~-~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d 156 (510) -|+++- ..+ +.+ ..-+..++. |. .......+....+.+|.+|.++-.+..|++ .+.+++|..+.+.-+ T Consensus 77 lp~~v~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~i~~~ 156 (424) T protein:vir:45 77 MPLHVMRRHKGKVEPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEVISLDCCMPWETTLMNT 156 (424) T ss_pred CceEEEEecCCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEEc Confidence 677641 111 111 112233321 22 234445577888999999999888888886 578888887765433 Q ss_pred CCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCccc Q lcl|NC_013644. 157 EYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIP 236 (510) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP 236 (510) + +++ +|.+.. . ... ..+.+..+.+++.-. T Consensus 157 ~-~~~-----~y~~~~-~-~~~------~~~~~~eVih~r~~~------------------------------------- 185 (424) T protein:vir:45 157 G-GRY-----TYGLYN-E-YGA------FAISPDDMIHIRALG------------------------------------- 185 (424) T ss_pred C-CeE-----EEEEEe-c-Cce------EEECcccEEEecCcC------------------------------------- Confidence 2 111 111111 0 000 112233333332100 Q ss_pred EEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCch---hhhhHhh----h-----cCeeeec Q lcl|NC_013644. 237 FYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDL---SKLRQNV----K-----SKKVVGT 304 (510) Q Consensus 237 vv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~---~~~~~~~----~-----~~~~~~~ 304 (510) .+...|.|.+..+...|+....+-.-..+.+...+.|-.+++-...-+. ..+...+ . .++++.+ T Consensus 186 ----~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~~n~g~~~vl 261 (424) T protein:vir:45 186 ----NNQKMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKSGLNKESWGWLKDQWQKASQALRRQENKTMLL 261 (424) T ss_pred ----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhccccccCCceeEc Confidence 0112355666555555544333333334445556667666653222111 1122211 1 1245566 Q ss_pred cCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc--CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 305 GSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGD--GNITNIVIKARYTLLNMKANKTEARLRALLEWMNK 382 (510) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~ 382 (510) +++.+++.++.......+.+..+...+.|...-++|+.-.+.. ++-|+. +. .....+...|.-.++ T Consensus 262 ~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~--eq----------~~~~f~~~tL~P~~~ 329 (424) T protein:vir:45 262 PADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNI--SA----------QAIQFVRYTMMPWVT 329 (424) T ss_pred CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccH--HH----------HHHHHHHHHHHHHHH Confidence 6666555554433334456667777888888888887543322 222221 11 111233344444444 Q ss_pred HHHHHHhhccCCccc---cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCC--CcHHHHHHHHHHHHHH Q lcl|NC_013644. 383 LVIDDINRRYTKAFD---PTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRL--DDDNVLRLICEQFDLD 457 (510) Q Consensus 383 ~i~~~~~~~~~~~~~---~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v--~d~e~~~~~~e~~e~~ 457 (510) .|...+..+--...+ ...+++.+..-+-.|..+.++.+.+++++|+++.-.++++++.- ++-+. T Consensus 330 ~ie~~ln~kLl~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~NE~R~~~gl~pi~ggD~----------- 398 (424) T protein:vir:45 330 NWEQELNRRLFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEARAFEDMNPVEGLDE----------- 398 (424) T ss_pred HHHHHHHHhcCChhhhcCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcce----------- Confidence 444444322111111 11244444566677999999999999999999998888876432 11000 Q ss_pred HHHHHHHHHhhhccCCCCCCCCCcccCCCCCCccc Q lcl|NC_013644. 458 WEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQ 492 (510) Q Consensus 458 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (510) .....+..+ ..++...+..+++.+.+ T Consensus 399 ---~~~~~n~~~------~~~~~~~~~~~~~~~~~ 424 (424) T protein:vir:45 399 ---MLVSVNAAN------PAGDFKPPKNDEGKTNE 424 (424) T ss_pred ---eeecccccc------cccccCCCCCCCCCCCC Confidence 000111000 00011111111111111 No 200 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=97.24 E-value=0.00012 Score=42.06 Aligned_cols=446 Identities=9% Similarity=0.051 Sum_probs=175.8 Q ss_pred CCCccCCChhhhHHHHHHH-HH--hhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccc-ee-ccchh Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAA-ID--KDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNV-RI-PHGFF 75 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~-i~--~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~-ki-~~n~~ 75 (510) .+..-.-++......-+.+ ++ -.-+.+++ ...|-.+.+.+....-..+...+. .+-+-. .+ -++-. T Consensus 54 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~l~~~~~~~F----~Gy~~la~laQ~~ey 124 (694) T protein:vir:10 54 LDAAPVAEPSPSLRLARQFEVDVSNYTPRERR-----AASYALDFNGTSMDALSFVTSSGF----PGFPTLVLLAQLPEY 124 (694) T ss_pred hcccccCCCCcchhhhhhccccccCCCccccc-----hhhhhhccCcccccchhhhhccCc----chHHHHHHHhhccch Confidence 1111112222111111111 00 00000110 111211111100000000000000 000000 00 01122 Q ss_pred HHHHHHHHhhhhcCCceecc--------------------CcHHHHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEE Q lcl|NC_013644. 76 PEIVDQKTQYLLSNPVEYET--------------------ENEELKEYLAEYY-NSEFQVVLQELVEGSSQKGFEYVYAR 134 (510) Q Consensus 76 ~~Iv~~~~~~l~g~p~~~~~--------------------~d~~~~~~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~ 134 (510) +.++.+.+..+.-+-+.+++ .+.+..+.|..-+ +=++...+.++.+++-.||.+..++- T Consensus 125 r~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eaik~aRlfGGa~~~i~ 204 (694) T protein:vir:10 125 RAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFK 204 (694) T ss_pred hhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEE Confidence 33344444444322222111 1112333444433 23567889999999999999887776 Q ss_pred ECCCCc-----------------e-EEEEEcccceEEEEcCCCCce-eEEE---EEEEEEeeCCceeEEEEEEEEcCCcE Q lcl|NC_013644. 135 TNAEDR-----------------L-CFQVADSLNVFGVYNEYNELQ-RICR---HYITEIEKDGETVDIHHAEVWTDQNV 192 (510) Q Consensus 135 ~d~~g~-----------------~-~i~~~~p~~~~~~~d~~~~~~-~~~~---~~~~~~~~~~~~~~~~~~e~y~~~~i 192 (510) ++.++. + -+.+++|.++.|-.-+..++. +-+. +|++. +. ++ ...+. T Consensus 205 I~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~----G~-------~I-H~SRL 272 (694) T protein:vir:10 205 IKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMI----GT-------EV-HATRL 272 (694) T ss_pred eecCccccccccccccccccCcceeeeEeecccccccchhhhccchhhccCCCceEEEe----ce-------EE-eeeeE Confidence 655431 1 166677777777321111111 0000 01100 00 00 01111 Q ss_pred EEEEEcCCceeecccccccccccccccccccccccccccCCcccEEE-ecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 193 YFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYR-LSNNKQETTDLKPIKALIDDYDLMNCFLSNNL 271 (510) Q Consensus 193 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~-~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~ 271 (510) ..|... .+|-+. -.++-.|.|...-+.+-+++.+.+.-..+..+ T Consensus 273 ~~f~g~-----------------------------------plPd~LKp~y~~~G~Sv~q~~~e~V~~~~rT~~~v~~Li 317 (694) T protein:vir:10 273 HTIVSR-----------------------------------PVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIV 317 (694) T ss_pred EEecCC-----------------------------------CchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHH Confidence 111100 011000 00123467777777777777777665555555 Q ss_pred HHhccceeEE---ec-CCCCchh-----hhhHhhh-cCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcc Q lcl|NC_013644. 272 QDFAEAIYVV---SG-FQGDDLS-----KLRQNVK-SKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFD 341 (510) Q Consensus 272 ~~~~~~~lv~---~g-~~~~~~~-----~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~ 341 (510) ..+....+.. .. +.+.+.. +.....+ .++++.+++ ++=+|.+.+.+...+...+....+.|...+++|- T Consensus 318 ~~~~v~~lk~dla~~L~~g~~~~l~~R~eli~~~Rsn~G~~llDk-~~Eefeq~stslSGLddVi~qf~q~VAgaa~IPl 396 (694) T protein:vir:10 318 KQFSVSGILMDLAQALMPGANVDLSMRAELINRYRDNRNILFLDK-ATEEFFQFNTPLSGLDALQAQAQEQMSAVSHIPL 396 (694) T ss_pred HhhhhHHHHHHHHHhhcChhHHHHHHHHHHHHHhcCccceEEEec-CCcceEEEecccCCHHHHHHHHHHHHHhhhcCch Confidence 4333322210 00 0111111 1111122 344555653 2236777788999999999999999999999997 Q ss_pred cccccc---C-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHH Q lcl|NC_013644. 342 STQVGD---G-NITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIV 417 (510) Q Consensus 342 ~~~~~~---g-~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~ 417 (510) +-+-+. | |+||..=...|...+. ...++.++..|++++.+|.. +..+.. ++ +++++|++-..-+++|+| T Consensus 397 tkLfGqSPkGlNATGE~D~rnYYD~I~--s~Qe~~L~p~L~rl~~ii~r--S~~G~i--dp-~i~~~fnPL~qmtd~EkA 469 (694) T protein:vir:10 397 IKLLGITPTGLNASSEGEIRVWYDYVR--AYQRNALQQLMNDVIVMIQL--SLFGAV--DP-SIKWQWNALRELDDLEVA 469 (694) T ss_pred hhhhccCcccccccchhhHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH--HhcCCC--CC-cceEEeCCCCCcCHHHHH Confidence 643321 2 5788875555665555 44578899999998877643 333332 33 688999998888888888 Q ss_pred HHH-------HHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCC-CCCC Q lcl|NC_013644. 418 NDE-------KTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAV-NPDD 489 (510) Q Consensus 418 ~~~-------~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 489 (510) +.. +.....|+|+...+..++-.-.+----..++...+--.....+.....+.......+++...+++ ..|. T Consensus 470 eI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 549 (694) T protein:vir:10 470 ESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARAGA 549 (694) T ss_pred HHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCcccccc Confidence 763 34445677766555554311000000000000000000000000000000000000000000000 0111 Q ss_pred cccccccCccccc-----ccc---cCCCC Q lcl|NC_013644. 490 PTQQMAEGATGST-----ESQ---LPENG 510 (510) Q Consensus 490 ~~~~~~~~~~~~~-----~~~---~~~~~ 510 (510) +.-+.....-.+. +.| ++.-| T Consensus 550 ~~~~~v~~~~~~~~~~~ag~~~~~~~~ag 578 (694) T protein:vir:10 550 TAPPTVANVNANVNPREAGAQDAAMRAAG 578 (694) T ss_pred cCCCcccccccccCccccCCCCccceeeE Confidence 0000000000000 000 11111 No 201 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=97.18 E-value=0.00014 Score=41.72 Aligned_cols=369 Identities=9% Similarity=0.024 Sum_probs=156.4 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |--+ .++.. -+..............+-+-. ..+... -+..-+..+-....|+ T Consensus 1 Mg~f------------~~~~~-~~~~~~~~~~~~~~~~~~~~~-----------~~~~~v----~~~~~l~~~~v~~~i~ 52 (382) T protein:vir:48 1 MPIF------------NLATE-SPPDNQGGFFDVVDSDFLASL-----------KGNEWV----SAETALRNSDLFSIIN 52 (382) T ss_pred Cccc------------ccccc-CCcccccccccchhhhccccc-----------cCCccc----chHhhhccHHHHHHHH Confidence 1111 00000 000000000000000000000 000000 0000011122334455 Q ss_pred HHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCC Q lcl|NC_013644. 81 QKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEY 158 (510) Q Consensus 81 ~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~ 158 (510) ..++-+.+-|+++.-. ....++.+=.. .........+....+.+|.||+++-.|.+|++ .+.+++|..+-++.++. T Consensus 53 ~ia~~ia~~~~~~~~~--~~~~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~v~~~~~ 130 (382) T protein:vir:48 53 QLSNDLATVKLITSRK--KLQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDN 130 (382) T ss_pred HHHHhhccCceeeecc--hhhhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCC Confidence 5555555566664322 22222221111 12345566677889999999999989988875 78889999988776543 Q ss_pred CCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEE Q lcl|NC_013644. 159 NELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFY 238 (510) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv 238 (510) +.. + .|.+....... .....+.+..+.+++.-. T Consensus 131 ~~~---~-~y~~~~~~~~~----~~~~~~~~~evih~~~~~--------------------------------------- 163 (382) T protein:vir:48 131 KDG---I-YYNITFDDPRI----PPKQHVPQNDVLHFRLLS--------------------------------------- 163 (382) T ss_pred CCe---E-EEEEEecCccc----cceeEEcCccEEEecCCC--------------------------------------- Confidence 321 0 11111111000 000112233333332100 Q ss_pred EecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhH---hh-----hcCeeeeccCCCce Q lcl|NC_013644. 239 RLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQ---NV-----KSKKVVGTGSDGGL 310 (510) Q Consensus 239 ~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~---~~-----~~~~~~~~~~~~~~ 310 (510) ......|.|-+..+...++....+..-..+.++..+.|-.+++-....+.+...+ .. ..++++.++++.++ T Consensus 164 -~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~ 242 (382) T protein:vir:48 164 -VDGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGGGLLDFKTKLSRSRQAMKQMQGGPLVLDDLEDF 242 (382) T ss_pred -CCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHhhccCCCCeeEcCCCceE Confidence 0001246676776667676665555555666666777766654422222221111 11 12345666666665 Q ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_013644. 311 DVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINR 390 (510) Q Consensus 311 ~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~ 390 (510) +.+........+.+..+...+.|+..-++|+.-.+..++.+. .. ......+...|.-+++.|...+.. T Consensus 243 ~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~--~~----------~~~~~~~~~~l~p~~~~i~~~l~~ 310 (382) T protein:vir:48 243 TPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGDQQS--SL----------EMSSDLYSKAVSRYLRPFLSELSQ 310 (382) T ss_pred EEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccc--HH----------HHHHHHHHHHHHHHHHHHHHHHHH Confidence 555544444556677788888998888888754433222211 10 111223334444444444444433 Q ss_pred ccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhC---CCCCcHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_013644. 391 RYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVA---PRLDDDNVLRLICEQFDLDWEDVKEALEE 467 (510) Q Consensus 391 ~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~---~~v~d~e~~~~~~e~~e~~~~~~~~~~~~ 467 (510) +-...+.. ++...+. .+.......+.++..+|++++-.+++.+ ++..++... .. T Consensus 311 ~l~~~~~~-~~~~~~~----~~~~~~~~~~~~l~~~g~~t~~e~r~~l~~~g~~~~~~~~------------------~~ 367 (382) T protein:vir:48 311 KLSCDVDA-DIFPAVD----PTGSNYISRINSLVKTGTLAQNQGLYILQQAEILPKELPN------------------GE 367 (382) T ss_pred HhcChhhh-hhhhhhc----cchhHHHHHHHHHhhcCccCHHHHHHHHhhCCCCCcchhh------------------hh Confidence 22111111 1111111 1233445566777888998887776654 333322100 00 Q ss_pred hhccCCCCCCCCCcccC Q lcl|NC_013644. 468 AEYTKGLSDNTDEEETA 484 (510) Q Consensus 468 ~~~~~~~~~~~~~~~~~ 484 (510) ...+. .. ++++++.+ T Consensus 368 ~~~~~-~~-GGd~~~~~ 382 (382) T protein:vir:48 368 NPNST-LK-GGEEDGQD 382 (382) T ss_pred cCCCC-CC-CCCCCCCC Confidence 11010 01 11111100 No 202 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=97.12 E-value=0.00016 Score=41.37 Aligned_cols=428 Identities=8% Similarity=-0.008 Sum_probs=182.7 Q ss_pred hHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhc--C Q lcl|NC_013644. 12 IANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLS--N 89 (510) Q Consensus 12 ~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g--~ 89 (510) ..+.+++..+..+.+.--.+...+.+|..-. . .... +.. ......++..+-+..-+++.++.|++ - T Consensus 1 mk~~~~~~~~~lkR~~~e~~w~e~a~~tlP~--~-----~~~~--~~~---~~~~~~~~~dstg~~a~~~LAa~l~~~lt 68 (510) T protein:vir:63 1 MKTTAAMLWEKLRDGSVEQRAIEFAKTTLPY--L-----MVDP--MSG---SRGVVEHDFQSAGALLVNNLAAKLARSLF 68 (510) T ss_pred ChhHHHHHHHHHhccchHHHHHHHHHhhccc--c-----CCCC--CCc---cccccCCCccchHHHHHHHHHHHHHhhhc Confidence 3344555555544222222333344443221 0 0001 100 11111245556666777777766654 2 Q ss_pred Cce-----eccCcH-------------HHHHH-------HHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEE Q lcl|NC_013644. 90 PVE-----YETENE-------------ELKEY-------LAEYY-NSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCF 143 (510) Q Consensus 90 p~~-----~~~~d~-------------~~~~~-------l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i 143 (510) |+. +...++ ++.+. +...+ .+||...+.++.++...+|.+.+ |.++++. ++ T Consensus 69 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l--~~~~~~~-~~ 145 (510) T protein:vir:63 69 PTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRDSDAA-TV 145 (510) T ss_pred CCCCcccccCCChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEE--EEcCCCc-EE Confidence 322 222221 12222 22222 46888999999999999999754 4566553 56 Q ss_pred EEEcccceEEEEcCCCCceeEEEEEEEEEee------------CCceeEEEEEEEEcCCcEEEEEEcCC-ceeecccccc Q lcl|NC_013644. 144 QVADSLNVFGVYNEYNELQRICRHYITEIEK------------DGETVDIHHAEVWTDQNVYFFVAEDN-KDYELDEAEP 210 (510) Q Consensus 144 ~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~e~y~~~~i~~~~~~~~-~~~~~~~~~~ 210 (510) +.++-.+++..-|..+++..+++-+...... .........+++|+.- +...+. ... T Consensus 146 ~~~pl~~y~v~~d~~G~vd~i~rr~~~t~~~l~e~~~~~~~~~~~~~~~~~~v~v~~~V----~~~~~~~~~~------- 214 (510) T protein:vir:63 146 VAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDEEYKQDLMRAGRNLSGSGSVDLYTHV----QRKKGTAMEY------- 214 (510) T ss_pred EEEEcceeEEeeCCCcCeeEEEeeeeccHHHHhHHhhhhhhccccccCCCcceEEEEEE----EeecCCCceE------- Confidence 6666667666667777777776655443110 0011111223333210 111110 000 Q ss_pred cccccccccccccc-cccccccCCcccEEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecC Q lcl|NC_013644. 211 INPRPHVLAVDSEN-ESLLQRSYGQIPFYRLS-----NNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGF 284 (510) Q Consensus 211 ~~~~~~~~~~~~~~-~~~~~~~~g~iPvv~~~-----nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~ 284 (510) .......++.. ......+|..+|++.++ .+.+|+|-.+...+-+..+|.+.-...........|.+.+.-. T Consensus 215 ---~sv~~e~dg~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~ 291 (510) T protein:vir:63 215 ---AELYHEIDGVRVGKEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEA 291 (510) T ss_pred ---EEEEEEecCceeccccccccccCceeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCcc Confidence 00011111111 11123345667877665 3467999888999999999988777777666666665443211 Q ss_pred CCCchhhhhHhhhcCeeeeccCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHH Q lcl|NC_013644. 285 QGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLL 362 (510) Q Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l 362 (510) +...+..+.. ...+.+..+..++++.+... .+.......++.++..|-..- ..+...-.....|++.+..+ T Consensus 292 g~~~~~~~~~--~~~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af-~~~l~~~~~~rvTAtEV~~r---- 364 (510) T protein:vir:63 292 KGAVVDDYQD--AEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF-MYGANQRDAERVTAEEVRIT---- 364 (510) T ss_pred cccchhhhcc--CCCceeecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHH-HhhcccCCCCCcCHHHHHHH---- Confidence 1122222211 11123444444556666533 456777777777777665532 12222222334566665553 Q ss_pred HHHHHHHHHHHHHHHHH--------HHHHHHHHHhhccCCcccccee---eEEeCCCCCCCHH--HHHHHHHHHHhcCCC Q lcl|NC_013644. 363 NMKANKTEARLRALLEW--------MNKLVIDDINRRYTKAFDPTEV---SFTFTREVMVNET--DIVNDEKTEAETRKI 429 (510) Q Consensus 363 ~~k~~~k~~~~~~~l~~--------~~~~i~~~~~~~~~~~~~~~~v---~i~f~~~~p~d~~--e~~~~~~~~~~~g~i 429 (510) +.+|...++..+.+ +++..+.++...+.-+.....+ .|++-.++-+... .....++.+...+.+ T Consensus 365 ---~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl~p~p~~~~~~~~v~~is~Laraq~~~~l~~~~q~l~~~~~~ 441 (510) T protein:vir:63 365 ---AEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPI 441 (510) T ss_pred ---HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCchhcccceecchhHHHHHHHHHHHHHHHHHHHHhcCc Confidence 34444444444444 3444444443333222222222 2334333332221 111111112222222 Q ss_pred c-------hHHHHHh----CC-----CC-CcHHHHHHHHHHHHHHHHHHHH-HHHhh--hccCCCCCCC Q lcl|NC_013644. 430 I-------LESILQV----AP-----RL-DDDNVLRLICEQFDLDWEDVKE-ALEEA--EYTKGLSDNT 478 (510) Q Consensus 430 S-------~et~~~~----~~-----~v-~d~e~~~~~~e~~e~~~~~~~~-~~~~~--~~~~~~~~~~ 478 (510) . ...++.. ++ .+ +++|.+++.+++..+...+.+. +.... ...+....+- T Consensus 442 aq~~~~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~~~~qq~~~~~~~~~~~~~~a~~~~~~~~g~ 510 (510) T protein:vir:63 442 AQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEQQRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) T ss_pred hhhhccCCHHHHHHHHHHHhCCChhHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC Confidence 2 2222222 22 11 2223222222111111111111 11111 1111111111 No 203 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=97.12 E-value=0.00016 Score=41.33 Aligned_cols=392 Identities=11% Similarity=-0.003 Sum_probs=170.2 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceec---cccccccccccccceeccchhHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVD---DEGILREDKYASNVRIPHGFFPE 77 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~---~~~~~~~~~~~~~~ki~~n~~~~ 77 (510) |+.+.- +-+ ...++..+..++..+.|............. ..+...-....+..=+.++-.-. T Consensus 1 ~~~~~~-----~~~----------~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~ 65 (424) T protein:vir:18 1 MEEPKY-----TID----------LRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWR 65 (424) T ss_pred CCCCcc-----eEe----------ecCCCchHHHHHhhhcccccccccccccccccccccccccccccHHHhhccHHHHH Confidence 333211 000 011222333344444433211100000000 00000000000000011112233 Q ss_pred HHHHHHhhhhcCCcee-ccC-cH---H--HHHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCce-EE Q lcl|NC_013644. 78 IVDQKTQYLLSNPVEY-ETE-NE---E--LKEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRL-CF 143 (510) Q Consensus 78 Iv~~~~~~l~g~p~~~-~~~-d~---~--~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i 143 (510) .|+..++-+.+-|+.+ ..+ +. . ...-+-.++. |. .......+....+.+|.+|.++-++..|++ .+ T Consensus 66 cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L 145 (424) T protein:vir:18 66 CVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISL 145 (424) T ss_pred HHHHHHHhhccCceEEEEeecCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEE Confidence 4555555555667664 211 11 1 1112333332 22 234555677889999999999888888875 57 Q ss_pred EEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccccccccc Q lcl|NC_013644. 144 QVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSE 223 (510) Q Consensus 144 ~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (510) .+++|..+-+..++ +.+ +|.+. .++. ...|.+..+.+++.-. T Consensus 146 ~pl~~~~V~v~~~~-~~~-----~y~~~--~~g~------~~~~~~~eIih~r~~~------------------------ 187 (424) T protein:vir:18 146 LPLQSANMDVKLVG-KKV-----VYRYQ--RDSE------YADFSQKEIFHLKGFG------------------------ 187 (424) T ss_pred EEecCcceEEEEcC-CeE-----EEEEE--eCCe------EEEeccccEEEecCcC------------------------ Confidence 78889888765543 211 11111 1111 0123333444432100 Q ss_pred ccccccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCC--Cch--hhhhHhhh-- Q lcl|NC_013644. 224 NESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQG--DDL--SKLRQNVK-- 297 (510) Q Consensus 224 ~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~--~~~--~~~~~~~~-- 297 (510) .+...|.|-+..+...++.......-..+.+...+.|-.++.-... .+. ..+...+. T Consensus 188 -----------------~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~ 250 (424) T protein:vir:18 188 -----------------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI 250 (424) T ss_pred -----------------CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCCHHHHHHHHHHHHHH Confidence 0112355555555555544333333444555666667666643221 111 11111111 Q ss_pred -----cCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc--cCcccHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 298 -----SKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG--DGNITNIVIKARYTLLNMKANKTE 370 (510) Q Consensus 298 -----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~g~~Sg~Ai~~~~~~l~~k~~~k~ 370 (510) .++++.++++.+++.+........+.+..+..++.|...-++|+.-.+. .++..|..++.... T Consensus 251 ~~g~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~---------- 320 (424) T protein:vir:18 251 AGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL---------- 320 (424) T ss_pred hCCcccCCceeccCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHHHHH---------- Confidence 2345566666666665544445556667778888899888898754332 22222222222211 Q ss_pred HHHHHHHHHHHHHHHHHHhhccCCcc--ccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHH Q lcl|NC_013644. 371 ARLRALLEWMNKLVIDDINRRYTKAF--DPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLR 448 (510) Q Consensus 371 ~~~~~~l~~~~~~i~~~~~~~~~~~~--~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~ 448 (510) ..+...|.-.++.|...+..+--... .-..+++.+..-+..|..+.++.+.++..+|+++.-.+++.++.-.-+.-. T Consensus 321 ~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGD- 399 (424) T protein:vir:18 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD- 399 (424) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC- Confidence 22233444444444444433211111 112345555666788999999999999999999987777765431100000 Q ss_pred HHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCc Q lcl|NC_013644. 449 LICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGA 498 (510) Q Consensus 449 ~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (510) ......+..+- + +.+.+.++...|+ T Consensus 400 -----------~~~~~~n~~~l----~----------~~~~~~~p~~~ga 424 (424) T protein:vir:18 400 -----------VAMRQSQYVPI----T----------DLGTNKEPRNNGA 424 (424) T ss_pred -----------eeeeccCccch----H----------hhhccCCCccCCC Confidence 00000000000 0 0011112222222 No 204 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=97.11 E-value=0.00017 Score=41.29 Aligned_cols=376 Identities=9% Similarity=-0.015 Sum_probs=161.9 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhh---HHHHHHHHHHhccCCcchhcccceeccccccccccc-cccceeccchhH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSS---KREAETGIRYYNHENDIMNNRIFYVDDEGILREDKY-ASNVRIPHGFFP 76 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~---~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~-~~~~ki~~n~~~ 76 (510) |- +.+.++...+ .........++-+.... .... ....=...+... T Consensus 1 Mg----------------~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~v~ 49 (406) T protein:vir:95 1 MG----------------LFDRWRRTKRKSKIRADTGYVGLFMSGEDV---------------SFLVPGYVRLSDNPEVR 49 (406) T ss_pred Cc----------------chhhhccccccccccccchhhhhhccCccc---------------CccccCHHHHhhcHHHH Confidence 11 1111110000 00000000111110000 0000 000001234556 Q ss_pred HHHHHHHhhhhcCCceec--cCc--HHHHH-HHHHHhc--c---CHHHHHHHHHHHHHhcCeE--EEEEEECCCCce-EE Q lcl|NC_013644. 77 EIVDQKTQYLLSNPVEYE--TEN--EELKE-YLAEYYN--S---EFQVVLQELVEGSSQKGFE--YVYARTNAEDRL-CF 143 (510) Q Consensus 77 ~Iv~~~~~~l~g~p~~~~--~~d--~~~~~-~l~~~~~--n---~~~~~~~e~~~~~~~~G~~--~~~v~~d~~g~~-~i 143 (510) ..|+..++-+.+-|+.+- .++ +.... ....++. | ........++...+.+|.| |+.+-.+..|.+ .+ T Consensus 50 ~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g~~~~l 129 (406) T protein:vir:95 50 MAVHKIADLISSMTIYLMQNTEDGDIRIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADGLIDEL 129 (406) T ss_pred HHHHHHHHhhccCceEEEEecCCcceeecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCCcEEEE Confidence 667777777777777651 111 11111 2222321 2 2345556677777887665 545556667766 57 Q ss_pred EEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccccccccc Q lcl|NC_013644. 144 QVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSE 223 (510) Q Consensus 144 ~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (510) .+++|..+-++.+..+ |.+.. ++ ..|.+..+.+++.... T Consensus 130 ~~i~~~~v~~~~~~~~--------~~~~~--~~--------~~~~~~evih~~~~~~----------------------- 168 (406) T protein:vir:95 130 VPLTPSKVNFLDTPDG--------YQVLY--GG--------QTFNYDEVLHFIYNPD----------------------- 168 (406) T ss_pred EEEcCceeEEEEcCCe--------EEEEe--cc--------EEEchhHEEEeeccCC----------------------- Confidence 7788888877665431 11100 00 1122333333321100 Q ss_pred ccccccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCc---hhhhhHh----h Q lcl|NC_013644. 224 NESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDD---LSKLRQN----V 296 (510) Q Consensus 224 ~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~---~~~~~~~----~ 296 (510) |. +.-.|.|-+..+...++....+..-..+.+...+.|-.+++-...-+ ...+... . T Consensus 169 ------------~~----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~~~~~~~~ 232 (406) T protein:vir:95 169 ------------PE----RPYIGRGYRVVLKDIADNLKQATATKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFKKY 232 (406) T ss_pred ------------CC----CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHh Confidence 00 00136666666666666665555555555666666666654422211 1122211 1 Q ss_pred h----cCeeeeccCCC-ceeEEe-ecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 297 K----SKKVVGTGSDG-GLDVKT-VTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTE 370 (510) Q Consensus 297 ~----~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~ 370 (510) . .++++.+..++ ..+-++ .+.....+.+..+...+.|+..-++|+.-.+.. ++.. ..+ . T Consensus 233 ~g~~n~~~~~v~~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~---~~~~--~~~----------~ 297 (406) T protein:vir:95 233 LQATEAGQPWIIPAELLEVEQVKPLSLKDIAINEAVELDKRTVAGMFGVPAFLLGIG---EFNR--DEY----------N 297 (406) T ss_pred ccccccCCceeecCCCccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCC---CchH--HHH----------H Confidence 1 12233343333 232222 233334455677777888888888887533222 2211 111 1 Q ss_pred HHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHH Q lcl|NC_013644. 371 ARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLI 450 (510) Q Consensus 371 ~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~ 450 (510) .++..+|.-+++.|...+..+--... ...+.+.++.-+..|..+.++.+.++..+|+++...++++++.-..+...+. T Consensus 298 ~~~~~~l~P~~~~ie~~l~~~l~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~l~~~G~~t~NE~R~~~gl~p~~~gd~~- 375 (406) T protein:vir:95 298 NFINSTILPIAKGIEQELTRKLLISP-DLYFKFNPRSLYAYDLKELAEVGSNMYVRGIMEGNEVRDWLGLSPKEGLSEL- 375 (406) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCC-CcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccee- Confidence 24555566666656555543221111 1245566677777899999999999999999999999888754221110000 Q ss_pred HHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCccccc Q lcl|NC_013644. 451 CEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQM 494 (510) Q Consensus 451 ~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (510) ....+..+. +....... .+.+++.+++.+.+ T Consensus 376 -----------~~~~n~~~~-~~~~~~~~-~k~g~~~~~~~~~~ 406 (406) T protein:vir:95 376 -----------VILENYIPL-DKIGDQSK-LKGGDNSGADGQTD 406 (406) T ss_pred -----------eeccCccch-hhcccccc-cCCCCCCCCCCCCC Confidence 000111000 00000000 11111111110100 No 205 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=97.09 E-value=0.00017 Score=41.19 Aligned_cols=377 Identities=10% Similarity=0.035 Sum_probs=158.8 Q ss_pred HHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCCceec Q lcl|NC_013644. 15 ALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNPVEYE 94 (510) Q Consensus 15 ~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~ 94 (510) ++..+.+..+.............+.-...+..-. .......+..+... .-+.++-....|+..++-+.+-|+++. T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~v~~~----~al~~~~v~~~i~~ia~~ia~lp~~~~ 75 (392) T protein:vir:39 1 MILPILNFINQTNDPPEVGSVQSYFPDGNDAQIM-ESLLGDNNEWVSAR----AALRNSDLFSIILQLSSDLAIVKINAE 75 (392) T ss_pred CcchhhhhhhcccccccccccccccccCchhhhh-hhhcCCCCceechH----HhhccHHHHHHHHHHHHhhccCceeec Confidence 1111111111000000000000000000000000 00000000000000 001112233345555555555566543 Q ss_pred cCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCceeEEEEEEEEE Q lcl|NC_013644. 95 TENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNELQRICRHYITEI 172 (510) Q Consensus 95 ~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~ 172 (510) - .....++.+=.. .........+....+.+|.||+++..|.+|++ .+.+++|..+-+..+..+.. .+|.+.. T Consensus 76 ~--~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~----~~y~~~~ 149 (392) T protein:vir:39 76 K--KKNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENG----MYYNITF 149 (392) T ss_pred c--chhhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCce----EEEEEEe Confidence 2 222222221111 12244556677889999999999988988886 68888999887776543221 0111111 Q ss_pred eeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCCcHHH Q lcl|NC_013644. 173 EKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTDLKP 252 (510) Q Consensus 173 ~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~ 252 (510) ..... .....+.++.+.+++.-. ......|.|-+.. T Consensus 150 ~~~~~----~~~~~~~~~eiih~~~~~----------------------------------------~~~~~~G~s~i~~ 185 (392) T protein:vir:39 150 DDPKI----EPILQAPQSDLIHMKLLS----------------------------------------IDGGKTGISPLYS 185 (392) T ss_pred cCccc----ceeEEEccccEEEecCCC----------------------------------------CCCccccccHHHH Confidence 11100 001122233333332100 0001236676766 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccceeEE--ecCCCCchh---hhhHhh----hcCeeeeccCCCceeEEeecCCHHHHH Q lcl|NC_013644. 253 IKALIDDYDLMNCFLSNNLQDFAEAIYVV--SGFQGDDLS---KLRQNV----KSKKVVGTGSDGGLDVKTVTIPTEGRK 323 (510) Q Consensus 253 v~~liD~~n~~~S~~~~~~~~~~~~~lv~--~g~~~~~~~---~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~ 323 (510) +...++....+..-..+.++..+.|-.++ .+....+.. .+.+.. ..++++.++++.+++.+........+. T Consensus 186 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~ 265 (392) T protein:vir:39 186 LRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLL 265 (392) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHH Confidence 66666554444444445555556665544 332111111 111111 123456666665555555444455667 Q ss_pred HHHHHHHHHHHHHhCCccccccccCc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceee Q lcl|NC_013644. 324 TKMEIDKENIYKFGMAFDSTQVGDGN-ITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVS 402 (510) Q Consensus 324 ~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~ 402 (510) +..+..++.|+..=++|+.-.+..+. .|... ..+..+...|.-.++.|...+..+-... ++ T Consensus 266 e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~-------------~~~~f~~~~l~P~~~~ie~~l~~~L~~~-----~~ 327 (392) T protein:vir:39 266 SQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQ-------------QISGMYASALNRYLRPAISELEYKLSDH-----IS 327 (392) T ss_pred HHHHHHHHHHHHHhCCCHHHhCCCCCcccHHH-------------HHHHHHHHHHHHHHHHHHHHHHHhcccc-----cc Confidence 77788888898888888754433222 12111 1112334445554444444443322111 22 Q ss_pred EEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhC---CCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCC Q lcl|NC_013644. 403 FTFTREVMVNETDIVNDEKTEAETRKIILESILQVA---PRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTD 479 (510) Q Consensus 403 i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~---~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~ 479 (510) +.+..-+-.|..+.++.+.++..+|+++...+.+++ ++..+ |.-+ ..... . T Consensus 328 ~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~-e~r~---------------~e~l~---~------- 381 (392) T protein:vir:39 328 VNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPK-DLPA---------------PENTN---K------- 381 (392) T ss_pred ccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCcc-ccch---------------hcCCC---C------- Confidence 222233345667788888899999999987776644 44322 1100 00000 0 Q ss_pred CcccCCCCCCcccccc Q lcl|NC_013644. 480 EEETAVNPDDPTQQMA 495 (510) Q Consensus 480 ~~~~~~~~~~~~~~~~ 495 (510) ..+|++.++.+ T Consensus 382 -----~~~Gd~~~p~p 392 (392) T protein:vir:39 382 -----KTTGQSNEPVP 392 (392) T ss_pred -----CCCCCCCCCCC Confidence 11111112111 No 206 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=97.09 E-value=0.00017 Score=41.19 Aligned_cols=377 Identities=10% Similarity=0.035 Sum_probs=158.8 Q ss_pred HHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCCceec Q lcl|NC_013644. 15 ALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNPVEYE 94 (510) Q Consensus 15 ~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~ 94 (510) ++..+.+..+.............+.-...+..-. .......+..+... .-+.++-....|+..++-+.+-|+++. T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~v~~~----~al~~~~v~~~i~~ia~~ia~lp~~~~ 75 (392) T protein:vir:10 1 MILPILNFINQTNDPPEVGSVQSYFPDGNDAQIM-ESLLGDNNEWVSAR----AALRNSDLFSIILQLSSDLAIVKINAE 75 (392) T ss_pred CcchhhhhhhcccccccccccccccccCchhhhh-hhhcCCCCceechH----HhhccHHHHHHHHHHHHhhccCceeec Confidence 1111111111000000000000000000000000 00000000000000 001112233345555555555566543 Q ss_pred cCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCceeEEEEEEEEE Q lcl|NC_013644. 95 TENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNELQRICRHYITEI 172 (510) Q Consensus 95 ~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~ 172 (510) - .....++.+=.. .........+....+.+|.||+++..|.+|++ .+.+++|..+-+..+..+.. .+|.+.. T Consensus 76 ~--~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~----~~y~~~~ 149 (392) T protein:vir:10 76 K--KKNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENG----MYYNITF 149 (392) T ss_pred c--chhhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCce----EEEEEEe Confidence 2 222222221111 12244556677889999999999988988886 68888999887776543221 0111111 Q ss_pred eeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCCcHHH Q lcl|NC_013644. 173 EKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTDLKP 252 (510) Q Consensus 173 ~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~ 252 (510) ..... .....+.++.+.+++.-. ......|.|-+.. T Consensus 150 ~~~~~----~~~~~~~~~eiih~~~~~----------------------------------------~~~~~~G~s~i~~ 185 (392) T protein:vir:10 150 DDPKI----EPILQAPQSDLIHMKLLS----------------------------------------IDGGKTGISPLYS 185 (392) T ss_pred cCccc----ceeEEEccccEEEecCCC----------------------------------------CCCccccccHHHH Confidence 11100 001122233333332100 0001236676766 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccceeEE--ecCCCCchh---hhhHhh----hcCeeeeccCCCceeEEeecCCHHHHH Q lcl|NC_013644. 253 IKALIDDYDLMNCFLSNNLQDFAEAIYVV--SGFQGDDLS---KLRQNV----KSKKVVGTGSDGGLDVKTVTIPTEGRK 323 (510) Q Consensus 253 v~~liD~~n~~~S~~~~~~~~~~~~~lv~--~g~~~~~~~---~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~ 323 (510) +...++....+..-..+.++..+.|-.++ .+....+.. .+.+.. ..++++.++++.+++.+........+. T Consensus 186 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~ 265 (392) T protein:vir:10 186 LRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLL 265 (392) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHH Confidence 66666554444444445555556665544 332111111 111111 123456666665555555444455667 Q ss_pred HHHHHHHHHHHHHhCCccccccccCc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceee Q lcl|NC_013644. 324 TKMEIDKENIYKFGMAFDSTQVGDGN-ITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVS 402 (510) Q Consensus 324 ~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~ 402 (510) +..+..++.|+..=++|+.-.+..+. .|... ..+..+...|.-.++.|...+..+-... ++ T Consensus 266 e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~-------------~~~~f~~~~l~P~~~~ie~~l~~~L~~~-----~~ 327 (392) T protein:vir:10 266 SQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQ-------------QISGMYASALNRYLRPAISELEYKLSDH-----IS 327 (392) T ss_pred HHHHHHHHHHHHHhCCCHHHhCCCCCcccHHH-------------HHHHHHHHHHHHHHHHHHHHHHHhcccc-----cc Confidence 77788888898888888754433222 12111 1112334445554444444443322111 22 Q ss_pred EEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhC---CCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCC Q lcl|NC_013644. 403 FTFTREVMVNETDIVNDEKTEAETRKIILESILQVA---PRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTD 479 (510) Q Consensus 403 i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~---~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~ 479 (510) +.+..-+-.|..+.++.+.++..+|+++...+.+++ ++..+ |.-+ ..... . T Consensus 328 ~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~-e~r~---------------~e~l~---~------- 381 (392) T protein:vir:10 328 VNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPK-DLPA---------------PENTN---K------- 381 (392) T ss_pred ccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCcc-ccch---------------hcCCC---C------- Confidence 222233345667788888899999999987776644 44322 1100 00000 0 Q ss_pred CcccCCCCCCcccccc Q lcl|NC_013644. 480 EEETAVNPDDPTQQMA 495 (510) Q Consensus 480 ~~~~~~~~~~~~~~~~ 495 (510) ..+|++.++.+ T Consensus 382 -----~~~Gd~~~p~p 392 (392) T protein:vir:10 382 -----KTTGQSNEPVP 392 (392) T ss_pred -----CCCCCCCCCCC Confidence 11111112111 No 207 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=97.08 E-value=0.00018 Score=41.12 Aligned_cols=431 Identities=9% Similarity=0.037 Sum_probs=169.8 Q ss_pred CCCc------cCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccch Q lcl|NC_013644. 1 MEAL------LSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGF 74 (510) Q Consensus 1 ~~~~------~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~ 74 (510) +... ...-.+..+- ....+ + ..--++.+ .+|.+.. +..++....-. -++- T Consensus 69 ~~~~~~~~~~~~~~~~~~~~--~~~~~-~----~~~~~~~l-~~~~~~~-F~Gy~~la~la---------------Q~~e 124 (695) T protein:vir:36 69 LARQFEVDVSNYTPRERRAA--SYALD-F----NGTSMDAL-SFVTSSG-FPGFPTLVLLA---------------QLPE 124 (695) T ss_pred cceeceecccccCccccchh--hhhhc-c----cccccccc-hhhhccC-cchHHHHHHHh---------------hccc Confidence 0000 0000000000 00000 0 00001111 1222211 00000000000 0011 Q ss_pred hHHHHHHHHhhhhcCC---------------cee----cc-CcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEE Q lcl|NC_013644. 75 FPEIVDQKTQYLLSNP---------------VEY----ET-ENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYA 133 (510) Q Consensus 75 ~~~Iv~~~~~~l~g~p---------------~~~----~~-~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v 133 (510) .+.++.+.+..+.-+- +.. .. .+.+..+.|..-++ =++...+.++.+++-.||.+..++ T Consensus 125 yr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i 204 (695) T protein:vir:36 125 YRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYF 204 (695) T ss_pred hhhHHHHHHHHhhcccceecccchhhhhhccccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEE Confidence 1111122222111111 111 11 12233444544443 356788999999999999988777 Q ss_pred EECCCCc-----------------e-EEEEEcccceEEEEcCCCCce-eEEE---EEEEEEeeCCceeEEEEEEEEcCCc Q lcl|NC_013644. 134 RTNAEDR-----------------L-CFQVADSLNVFGVYNEYNELQ-RICR---HYITEIEKDGETVDIHHAEVWTDQN 191 (510) Q Consensus 134 ~~d~~g~-----------------~-~i~~~~p~~~~~~~d~~~~~~-~~~~---~~~~~~~~~~~~~~~~~~e~y~~~~ 191 (510) -++.++. + -+.+++|.++.|-.-+..++. +-+. +|++. +. ++ ...+ T Consensus 205 ~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~----G~-------kI-H~SR 272 (695) T protein:vir:36 205 KIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMI----GT-------EV-HATR 272 (695) T ss_pred EeccCccccccccccccccccCcceeeeEeecccccccchhhhccchhhccCCCceEEEe----ce-------EE-eeee Confidence 6655431 1 166677777777321111111 0000 01100 00 00 0111 Q ss_pred EEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEE-ecCCCCCCCcHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 192 VYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYR-LSNNKQETTDLKPIKALIDDYDLMNCFLSNN 270 (510) Q Consensus 192 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~-~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~ 270 (510) ...|... .+|-+. -.++-.|.|...-+.+-+++.+.+.-..+.. T Consensus 273 L~~f~g~-----------------------------------plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~L 317 (695) T protein:vir:36 273 LHTIVSR-----------------------------------PVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDI 317 (695) T ss_pred EEEecCC-----------------------------------CchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHH Confidence 1111100 011000 0012346677777777777777765555555 Q ss_pred HHHhccceeEE---ec-CCCCchh-----hhhHhhh-cCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_013644. 271 LQDFAEAIYVV---SG-FQGDDLS-----KLRQNVK-SKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAF 340 (510) Q Consensus 271 ~~~~~~~~lv~---~g-~~~~~~~-----~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p 340 (510) +..+....+.. .. ..+.+.. +.....+ .++++.+++ ++=+|.+.+.+...+...+....+.|...+++| T Consensus 318 i~~~~v~~lk~dla~aL~~g~~~~l~~R~eli~~~Rsn~G~~llDk-~~Eefeq~stslSGLddVi~qf~q~VAgaa~IP 396 (695) T protein:vir:36 318 VKQFSVSGILMDLAQALMPGANVDLSMRAELINRYRDNRNILFLDK-ATEEFFQFNTPLSGLDALQAQAQEQMSAVSHIP 396 (695) T ss_pred HHhhhHHHHHHHHHHhhcChhHHHHHHHHHHHHHhcCccceEEEec-CCcceEEEecccCCHHHHHHHHHHHHHhhhcCc Confidence 53322222110 00 0111111 1111122 344555653 223677778899999999999999999999999 Q ss_pred ccccccc---C-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHH Q lcl|NC_013644. 341 DSTQVGD---G-NITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDI 416 (510) Q Consensus 341 ~~~~~~~---g-~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~ 416 (510) -+-+-+. | |+||..=...|...+. ...++.++..|++++.+|.. +..+.. ++ +++++|++-..-+++|+ T Consensus 397 ltkLfGqSPkGlNATGE~D~rnYYD~I~--s~Qe~~L~p~L~rl~~ii~r--S~~G~i--dp-di~~~fnPL~qmtd~Ek 469 (695) T protein:vir:36 397 LIKLLGITPTGLNASSEGEIRVWYDYVR--AYQRNALQQLMNDVIVMIQL--SLFGAV--DP-SIKWQWNALRELDDLEV 469 (695) T ss_pred hhhhhccCcccccccchhhHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH--HhcCCC--CC-cceEEeCCCCCcCHHHH Confidence 7643321 2 5788875555665555 44578899999998877643 333332 33 68899999988888888 Q ss_pred HHHH-------HHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCC-CCC Q lcl|NC_013644. 417 VNDE-------KTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAV-NPD 488 (510) Q Consensus 417 ~~~~-------~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 488 (510) |+.. +.....|+|+...+..++-.-.+----..++...+--.....+.....+.......+++...+++ ..| T Consensus 470 AeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 549 (695) T protein:vir:36 470 AESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARAG 549 (695) T ss_pred HHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCccccc Confidence 8763 34445677766555554311000000000000000000000000000000000000010000000 011 Q ss_pred CcccccccCccccc-----ccc---cCCCC Q lcl|NC_013644. 489 DPTQQMAEGATGST-----ESQ---LPENG 510 (510) Q Consensus 489 ~~~~~~~~~~~~~~-----~~~---~~~~~ 510 (510) .+.-+.....-.+. +.| ++.-| T Consensus 550 ~~~~~~v~~~~~~~~~~~ag~~~~~~~aag 579 (695) T protein:vir:36 550 ATAPPTVANVNANVNPREAGAQDAAMRAAG 579 (695) T ss_pred ccCCCcccccccccCccccCCCCccceeeE Confidence 11000000000000 000 11111 No 208 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=97.04 E-value=0.0002 Score=40.88 Aligned_cols=431 Identities=10% Similarity=0.035 Sum_probs=179.0 Q ss_pred hHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhc--C Q lcl|NC_013644. 12 IANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLS--N 89 (510) Q Consensus 12 ~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g--~ 89 (510) ..+.+++..++.+.+.--.+...+.+|..-. . .....+. ....-.++..+-+..-+++.++.|++ - T Consensus 1 mk~~~~~~~~~lkr~~~e~~w~e~a~~tlP~--~-----~~~~~~~-----~~~~~~~~~dstg~~a~~~LAa~l~~~lt 68 (510) T protein:vir:78 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY--L-----MVDPMSG-----SRGVVEHDFQSAGALLVNNLAAKLARSLF 68 (510) T ss_pred ChhHHHHHHHHHhccchHHHHHHHHHhhccc--c-----ccCCCCc-----ccccccCcccchHHHHHHHHHHHHHHhhc Confidence 3344455544444222223344444443321 0 0001100 00111234455566666666666654 2 Q ss_pred Cce-----eccCcH-------------HHHHH-------HHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEE Q lcl|NC_013644. 90 PVE-----YETENE-------------ELKEY-------LAEYY-NSEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCF 143 (510) Q Consensus 90 p~~-----~~~~d~-------------~~~~~-------l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i 143 (510) |+. +...++ ++.+. +...+ .+||...+.++.++...+|.+.++ .++++. ++ T Consensus 69 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~--~~~~~~-~~ 145 (510) T protein:vir:78 69 PTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLY--RNSDEA-TV 145 (510) T ss_pred CCCCcccccCCChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEE--EeCCCC-eE Confidence 322 232221 12222 22222 468888999999999999998654 454443 45 Q ss_pred EEEcccceEEEEcCCCCceeEEEEEEEEEe------------eCCceeEEEEEEEEcCCcEEEEEEcC-Cceeecccccc Q lcl|NC_013644. 144 QVADSLNVFGVYNEYNELQRICRHYITEIE------------KDGETVDIHHAEVWTDQNVYFFVAED-NKDYELDEAEP 210 (510) Q Consensus 144 ~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~e~y~~~~i~~~~~~~-~~~~~~~~~~~ 210 (510) +.++-.+++..-|..+++..+++-+..... .......-..+++|+. .+.... +... T Consensus 146 ~~~pl~~y~v~~d~~G~vd~i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~----V~~~~~~~~~~------- 214 (510) T protein:vir:78 146 VAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTH----VQRRKGTAMDY------- 214 (510) T ss_pred EEEEcceeEEeeCCCcCeeEEEeeeeccHHHHHHHhhHHhhhhhhccCCCceEEEEEE----EEeecCCCCcE------- Confidence 566666666666777777777665544310 0001111122333321 011110 0000 Q ss_pred cccccccccccccc-cccccccCCcccEEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecC Q lcl|NC_013644. 211 INPRPHVLAVDSEN-ESLLQRSYGQIPFYRLS-----NNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGF 284 (510) Q Consensus 211 ~~~~~~~~~~~~~~-~~~~~~~~g~iPvv~~~-----nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~ 284 (510) .......++.. ......++..+|++.++ .+.+|+|-.++..+-+..+|.+.-...........|.+.+.-. T Consensus 215 ---~sv~~e~dg~~i~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~ 291 (510) T protein:vir:78 215 ---AEMYHEIDGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEA 291 (510) T ss_pred ---EEEEEEecCeeeccccccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCCc Confidence 00000011111 11122345567777654 3467999888999999999987766666666656655443211 Q ss_pred CCCchhhhhHhhhcCeeeeccCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHH Q lcl|NC_013644. 285 QGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLL 362 (510) Q Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l 362 (510) +...+..+.. ...+.+..+..++++.+... .+.......++.++..|-..- ..+...-.....|++.+..+ T Consensus 292 g~~~~~~l~~--~~~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF-~~~l~~~~~~rvTAtEV~~r---- 364 (510) T protein:vir:78 292 KGAVVDDYQD--AEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF-MYGANQRDAERVTAEEVRIT---- 364 (510) T ss_pred cccchhhhcc--CCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHH-hhccccCCCCCcCHHHHHHH---- Confidence 1122222211 12234444445566765533 456666777777777775532 12222222334566665553 Q ss_pred HHHHHHHHHHHHHHHHH--------HHHHHHHHHhhccCCcccc---ceeeEEeCCCCCCCHH--HHHHHHHHHHhcCCC Q lcl|NC_013644. 363 NMKANKTEARLRALLEW--------MNKLVIDDINRRYTKAFDP---TEVSFTFTREVMVNET--DIVNDEKTEAETRKI 429 (510) Q Consensus 363 ~~k~~~k~~~~~~~l~~--------~~~~i~~~~~~~~~~~~~~---~~v~i~f~~~~p~d~~--e~~~~~~~~~~~g~i 429 (510) +.++...++..+.+ +++..+.++...+.-+... ....|++..++-+... .....++.+...+.+ T Consensus 365 ---~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl~p~p~~~~~~~~v~~is~Laraq~~~~l~~~~q~l~~~~~~ 441 (510) T protein:vir:78 365 ---AEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPI 441 (510) T ss_pred ---HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccceeeecccHHHHHHHHHHHHHHHHHHHHhcCh Confidence 34444444444433 3444444443333221112 2233445444433211 111111112222221 Q ss_pred c-------hHHHHHhC---CCCC-------cHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCccc Q lcl|NC_013644. 430 I-------LESILQVA---PRLD-------DDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQ 492 (510) Q Consensus 430 S-------~et~~~~~---~~v~-------d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (510) . ...++..+ -+|+ ++|.+++.+++.+.+......++.....-++..+ ... T Consensus 442 ~q~~~~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~~~~~q~~~~~~~~~a~~~~~~~~~~--------~~~----- 508 (510) T protein:vir:78 442 AQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTN--------ALA----- 508 (510) T ss_pred hhhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcc--------cCC----- Confidence 1 22222221 1222 2333332222221111111111110000000000 000 Q ss_pred ccccCc Q lcl|NC_013644. 493 QMAEGA 498 (510) Q Consensus 493 ~~~~~~ 498 (510) |+ T Consensus 509 ----g~ 510 (510) T protein:vir:78 509 ----GV 510 (510) T ss_pred ----CC Confidence 00 No 209 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=97.02 E-value=0.00021 Score=40.78 Aligned_cols=396 Identities=7% Similarity=-0.015 Sum_probs=186.6 Q ss_pred CCCccCCCh-h-----hhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccch Q lcl|NC_013644. 1 MEALLSEDV-K-----IIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGF 74 (510) Q Consensus 1 ~~~~~~~~~-~-----~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~ 74 (510) |.--+.... . ...+.+...|...+....+. . .--+..--..|+.... ..-...+. . ..... T Consensus 1 m~~~i~~~~g~p~~~~~~~~~~~~~ia~~~~~~~~~--~-~~~~~~~~~~iLr~~~----~~~~~y~~-----m-~~D~~ 67 (491) T protein:vir:10 1 MSKGLWVSPTEFVTFGEPDKSLSSQIATRARSIDFF--A-LGMYLPNPDPVLKALG----KDIRVYRE-----L-RADAH 67 (491) T ss_pred CCCceeCCCCCccCcccCChHHHHHHHhhhcccccc--c-ccCCccchHHHHHhcC----CCHHHHHH-----H-hhChH Confidence 443322111 0 01111222222111000000 0 0000000011221100 00000000 0 12455 Q ss_pred hHHHHHHHHhhhhcCCceecc--CcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeE-EEEEEECCCCce---EEEEEc Q lcl|NC_013644. 75 FPEIVDQKTQYLLSNPVEYET--ENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFE-YVYARTNAEDRL---CFQVAD 147 (510) Q Consensus 75 ~~~Iv~~~~~~l~g~p~~~~~--~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~-~~~v~~d~~g~~---~i~~~~ 147 (510) ..-.+++...-++|.+..+.+ +++...+++.++++ -+|.+.+.++. ++..+|.+ ++++|.-.+|.. ++.+++ T Consensus 68 i~s~l~~Rk~av~~~~w~i~~~~~~~~~~e~v~e~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~ 146 (491) T protein:vir:10 68 VGGCVRRRKAAVKALEWGLDRGKAKSRVAKSIADVFADLDLSRIVTEML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKP 146 (491) T ss_pred HHHHHHHHHHHHhCCCcEEecCCCCHHHHHHHHHHHhcCCHHHHHHHHH-HhhhhcceeEEEEEeecCCeeEEEEeeeec Confidence 666777777778899988864 34556678888875 36777776664 78889975 556675445554 345555 Q ss_pred ccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccccccccccccc Q lcl|NC_013644. 148 SLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESL 227 (510) Q Consensus 148 p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (510) ++.+. ||..+.+ +++..++... .. T Consensus 147 ~~~f~--~d~~~~l--------------------------------~~~~~~~~~~----------------------g~ 170 (491) T protein:vir:10 147 ADWFV--YDPENQL--------------------------------RFRSKDHWMQ----------------------GE 170 (491) T ss_pred cccee--eccCCce--------------------------------EEecCCCCCC----------------------cc Confidence 54442 3322211 1110000000 00 Q ss_pred ccccCCcccEEEe--cCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhh------hhHhhhcC Q lcl|NC_013644. 228 LQRSYGQIPFYRL--SNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSK------LRQNVKSK 299 (510) Q Consensus 228 ~~~~~g~iPvv~~--~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~------~~~~~~~~ 299 (510) .-.+++.|-..+- ..++.|.|.+..+-...---+..+.+++..++.++.|+++.+--.+.+..+ ....+... T Consensus 171 ~l~~~k~i~~~~~~~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~l~~al~~~~~~ 250 (491) T protein:vir:10 171 ELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSASDGEKNLLLDCLEDMVQD 250 (491) T ss_pred eecCCCEEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecCCCCCHHHHHHHHHHHHHHhcC Confidence 0011122211111 124668888888777776777788999999999999999877433222222 12334455 Q ss_pred eeeeccCCCceeEEeec---CCHHHHHHHHHHHHHHHHHHh--CCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 300 KVVGTGSDGGLDVKTVT---IPTEGRKTKMEIDKENIYKFG--MAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLR 374 (510) Q Consensus 300 ~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~l~~~i~~~s--~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~ 374 (510) ....++.+.++++++.. .+...++..++.+.+.|...- +|... ..++|.+.|..-.-.. ...+..-.+... T Consensus 251 a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt-~~~gs~a~~~vh~~v~---~di~~~D~~~i~ 326 (491) T protein:vir:10 251 AVAVVPDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIALLGQNQTT-EATSTRASAQAGLEVT---DDIRDGDKAVVS 326 (491) T ss_pred cEEEecCCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHHhhhhccc-CcccchhHHHHHHHHH---HHHHHHHHHHHH Confidence 67778999999999764 235678888999988887653 33332 2233333333322222 222333345666 Q ss_pred HHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCC-CchHHHHHhCCCCCcHHHHHHHHHH Q lcl|NC_013644. 375 ALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRK-IILESILQVAPRLDDDNVLRLICEQ 453 (510) Q Consensus 375 ~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~-iS~et~~~~~~~v~d~e~~~~~~e~ 453 (510) ..+.++++-++.+- +.. ...+.+.|... +.+....++.+.++...|+ ++.+.+.+.++.-.....+. T Consensus 327 ~tln~li~~l~~~N---~~~---~~~p~f~~~~~-~e~~~~~a~~~~~L~~~G~~i~~~~i~e~~Gip~~~~~~~----- 394 (491) T protein:vir:10 327 EAMNMLIRWICDLN---FDG---ADRPVFDMWEQ-EQVDEIQAGRDQKLTQAGARFTPAYFKRAYNLQDGDLDER----- 394 (491) T ss_pred HHHHHHHHHHHHhc---CCC---CCcceEEecCc-CchhHHHHHHHHHHHhCCCcCCHHHHHHHhCCCCCCcCcc----- Confidence 67777666555432 221 11245666543 3444678899999999997 67777777775422111000 Q ss_pred HHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 454 FDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 454 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) . .+.... ........ ....+..+..++.- T Consensus 395 ----------~---~~~~~~-~~~~~~~~--------------~~~~~~~~~~~d~~ 423 (491) T protein:vir:10 395 ----------P---LPVSAV-DTVGAASF--------------AEFEAPDQDALDAA 423 (491) T ss_pred ----------c---cccCCC-CCcccccc--------------cccCCCCCCchHHH Confidence 0 000000 00000000 00000011111111 No 210 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=96.89 E-value=0.00028 Score=40.10 Aligned_cols=416 Identities=11% Similarity=-0.015 Sum_probs=186.2 Q ss_pred CCCccCCCh------------hhhHHHHHHHHHhhhhh-hhHHHHHHHHHHhccCCcchhcccceecccccccccccccc Q lcl|NC_013644. 1 MEALLSEDV------------KIIANALKAAIDKDRKS-SSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASN 67 (510) Q Consensus 1 ~~~~~~~~~------------~~~~~~i~~~i~~~~~~-~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~ 67 (510) |-.|+-..- ......+......|... ..+.++..+-+--.+. ++..+...+ ++-. T Consensus 1 m~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~g-d~~~~~~L~--------~dm~--- 68 (512) T protein:vir:19 1 MGRILDISGQPFDFDDEMQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAERG-DLTAQADLA--------FDME--- 68 (512) T ss_pred CcceeCCCCCccccccccccccchhcccchhhccccccCCCHHHHHHHHHHhhCC-CHHHHHHHH--------HHHH--- Confidence 332221111 11111111111111110 1112221111111111 111100000 0000 Q ss_pred ceeccchhHHHHHHHHhhhhcCCceeccC------cHHHHHHHHHHhcc--CHHHHHHHHHHHHHhcCeE-EEEEEECCC Q lcl|NC_013644. 68 VRIPHGFFPEIVDQKTQYLLSNPVEYETE------NEELKEYLAEYYNS--EFQVVLQELVEGSSQKGFE-YVYARTNAE 138 (510) Q Consensus 68 ~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~------d~~~~~~l~~~~~n--~~~~~~~e~~~~~~~~G~~-~~~v~~d~~ 138 (510) .......-.+.+...-+.|.+..+.+. +....+++++++.+ +|.+.+..+ .++..+|.+ .+++|.-.+ T Consensus 69 --~~D~hi~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~~~~l-ldA~~~G~s~~Ei~w~~~~ 145 (512) T protein:vir:19 69 --EKDTHLFSELSKRRLAIQALEWRIAPARDASAQEKKDADMLNEYLHDAAWFEDALFDA-GDAILKGYSMQEIEWGWLG 145 (512) T ss_pred --hhChHHHHHHHHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcCCCHHHHHHHH-HhhhhhcceeeeeEeeeeC Confidence 013445566666667788888887532 23455678888853 567666665 468888864 556665334 Q ss_pred Cce---EEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccc Q lcl|NC_013644. 139 DRL---CFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRP 215 (510) Q Consensus 139 g~~---~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 215 (510) |.. ++.+++|+.+. |+..+... ++.. ++.... ..+ T Consensus 146 g~~~~~~~~~r~~~~f~--~~~~~~~~--lr~~-----~~~~~G----~~l----------------------------- 183 (512) T protein:vir:19 146 KMRVPVALHHRDPALFC--ANPDNLNE--LRLR-----DASYHG----LEL----------------------------- 183 (512) T ss_pred Cceeeeeeeeeccccce--eccCCCcE--EEec-----CCCCCc----eee----------------------------- Confidence 433 34555554332 22211100 0000 000000 000 Q ss_pred ccccccccccccccccCCcccEEEe--cCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhh-- Q lcl|NC_013644. 216 HVLAVDSENESLLQRSYGQIPFYRL--SNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSK-- 291 (510) Q Consensus 216 ~~~~~~~~~~~~~~~~~g~iPvv~~--~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~-- 291 (510) .+++.|-.++- ..++.|.|.+..+-...--=+..+.+++..++.++.|+++.+=..+....+ T Consensus 184 --------------~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~ 249 (512) T protein:vir:19 184 --------------QPFGWFMHRAKSRTGYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIYGLPMRVGKYPTGSTNREKA 249 (512) T ss_pred --------------cCCceEEEeccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeeEEecCCCCCHHHHH Confidence 11222222211 135678888887766666666778899999999999999876332222222 Q ss_pred ----hhHhhhcCeeeeccCCCceeEEeec-CCHHHHHHHHHHHHHHHHHHh--CCccccccccC-cccHHHHHHHHHHHH Q lcl|NC_013644. 292 ----LRQNVKSKKVVGTGSDGGLDVKTVT-IPTEGRKTKMEIDKENIYKFG--MAFDSTQVGDG-NITNIVIKARYTLLN 363 (510) Q Consensus 292 ----~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~l~~~i~~~s--~~p~~~~~~~g-~~Sg~Ai~~~~~~l~ 363 (510) ....+....+..++.+..+++++.. .....++..++.+.+.|...- +|-....+.+| ++.|..-. .-.. T Consensus 250 ~L~~al~~~~~~a~~iiP~~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~iLGqtlTs~~g~~Gs~a~~~vh~---ev~~ 326 (512) T protein:vir:19 250 TLMQAVMDIGRRAGGIIPMGMTLDFQSAADGQSDPFMAMIGWAEKAISKAILGGTLTTEAGDKGARSLGEVHD---EVRR 326 (512) T ss_pred HHHHHHHHHhhCcEEEecCCceEEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHHH---HHHH Confidence 1233445567778899999999854 455678999999999888753 33333322222 23222221 2223 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCC-CchHHHHHhCCCC Q lcl|NC_013644. 364 MKANKTEARLRALLE-WMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRK-IILESILQVAPRL 441 (510) Q Consensus 364 ~k~~~k~~~~~~~l~-~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~-iS~et~~~~~~~v 441 (510) ..+..-.+.+...|. ++++-++.+ +.....+ ...-..++|...-+.|....++.+.++. .|+ +|.+.+.+.++.- T Consensus 327 di~~aDa~~i~~tln~~li~~l~~~-N~~~~~~-~~~~p~~~f~~~e~eDl~~~a~~~~~l~-~G~~i~~~~i~e~~Gip 403 (512) T protein:vir:19 327 EIRNADVGQLARSINRDLIYPLLAL-NSDSTID-INRLPGIVFDTSEAGDITALSDAIPKLA-AGMRIPVSWIQEKLHIP 403 (512) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh-CCCCCCC-ccccceEEecCCChhhHHHHHHHHHHHh-cCCCCCHHHHHHHhCCC Confidence 333444455666664 466655532 2221111 1224678899999999999999888876 565 7888888887642 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCccccc--ccc------cC------ Q lcl|NC_013644. 442 DDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGST--ESQ------LP------ 507 (510) Q Consensus 442 ~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~------~~------ 507 (510) ...+.+. .....+.... ..............+.+...+...... .++ .| T Consensus 404 ~~~~~e~---------------~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~i~~~~~ 466 (512) T protein:vir:19 404 QPVGDEA---------------VFTIQPVVPD--NGSQKEAALSAEDIPQEDDIDRMGVSPEDWQRSVDPLLKPVIFSVL 466 (512) T ss_pred CCCCccc---------------cccCCCcccc--ccccccccccccCCCchhhHhHHhhhHHHHHHHHHHHHHHHHHHHH Confidence 2111000 0000000000 000000000000000000000000000 000 00 Q ss_pred CCC Q lcl|NC_013644. 508 ENG 510 (510) Q Consensus 508 ~~~ 510 (510) ... T Consensus 467 ~~s 469 (512) T protein:vir:19 467 KDG 469 (512) T ss_pred hCC Confidence 000 No 211 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=96.86 E-value=0.00029 Score=39.94 Aligned_cols=368 Identities=10% Similarity=0.025 Sum_probs=148.0 Q ss_pred hhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhc Q lcl|NC_013644. 9 VKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLS 88 (510) Q Consensus 9 ~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g 88 (510) +-+. . . +..++...+. .++......... ......+..+ .+..-+...-....|+..++-+.+ T Consensus 1 Mg~~-~---~-~~~~~~~~~~-------~~~~~~~~~~~~--~~~~~~~~~v----~~~~al~~~~v~~~i~~ia~~ia~ 62 (385) T protein:vir:10 1 MGLL-T---P-RNFNKRKAKN-------MVYPSNPAFFTT--TVGGMQLSYV----SALSALQNTNVYSVINRIASDVAS 62 (385) T ss_pred Cccc-c---c-hhcccccccc-------cccccchhhhhh--hccccCcccc----CHHHhhccHHHHHHHHHHHHHHhh Confidence 0000 0 0 0000000000 000000000000 0000000000 000001122334456666666666 Q ss_pred CCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCCCceeEEEE Q lcl|NC_013644. 89 NPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYNELQRICRH 167 (510) Q Consensus 89 ~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~~~~~~~~~ 167 (510) -|+++. +......+++=.. .........+......+|.||+++..+. ..+.++++..+.+..+... .+ T Consensus 63 ~p~~v~--~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~~---~~~~p~~~~~v~~~~~~~~------~~ 131 (385) T protein:vir:10 63 AHFKTE--NTATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN---LEHIPNSDVQINYLPGNMG------IV 131 (385) T ss_pred Cceeee--ccchhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc---eeEeecCCceEEEEEcCCc------eE Confidence 677653 2222223321111 1223445556778888999998876542 3344444444444333211 01 Q ss_pred EEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCC Q lcl|NC_013644. 168 YITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQET 247 (510) Q Consensus 168 ~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~ 247 (510) |++....++. ...+.+..+.+++.-. .|. .+...|. T Consensus 132 ~~~~~~~~~~------~~~~~~~eiihik~~~-----------------------------------~~~---~~~~~G~ 167 (385) T protein:vir:10 132 YTVLESNDRP------QMVLRQDQMLHFRLMP-----------------------------------DPQ---YRYLIGR 167 (385) T ss_pred EEEEEcCCce------EEEEccccEEEeccCC-----------------------------------CCc---ccccccc Confidence 1111111110 0112222333322100 000 0112466 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCC-CCch---hhhhHhhh-------cCeeeeccCCCceeEEeec Q lcl|NC_013644. 248 TDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQ-GDDL---SKLRQNVK-------SKKVVGTGSDGGLDVKTVT 316 (510) Q Consensus 248 sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~-~~~~---~~~~~~~~-------~~~~~~~~~~~~~~~~~~~ 316 (510) |.+..+...++....+..-..+.+...+.|-.+++-.. ..+. ..+...++ .++++.++++.+++.++.. T Consensus 168 s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~ 247 (385) T protein:vir:10 168 SPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGRLMVLPDGFDYTQLEMK 247 (385) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCccccCCCceEEecCCC Confidence 77776666666555555445555666666666654322 1111 11211111 2235556665555555443 Q ss_pred CCHHH-HHHHHHHHHHHHHHHhCCccccccc--cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_013644. 317 IPTEG-RKTKMEIDKENIYKFGMAFDSTQVG--DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYT 393 (510) Q Consensus 317 ~~~~~-~~~~~~~l~~~i~~~s~~p~~~~~~--~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~ 393 (510) ..... +.+..+...+.|...-++|+.-.+. .++.++..++.. ... |...|.-.++.|...+..+-- T Consensus 248 ~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~-~~~----------~~~~l~P~~~~ie~~l~~~l~ 316 (385) T protein:vir:10 248 TDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQI-KAT----------YLANLNSYVNPIVDELRLKMN 316 (385) T ss_pred hhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHH-HHH----------HHHHHHHHHHHHHHHHHHhhC Confidence 22222 2456677788898888888754322 233322222211 111 112233333333333332211 Q ss_pred CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|NC_013644. 394 KAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALEEAEYT 471 (510) Q Consensus 394 ~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~ 471 (510) . ..+++.+..-+..|..+.++.+.++.++|+++.-.+++.++. +.+..... .... . T Consensus 317 ~----~~~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~p~~~~~~------------~~~~------~ 374 (385) T protein:vir:10 317 A----PDLELDIKDMLDVDDSALINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLPE------------FKPL------T 374 (385) T ss_pred C----ceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCccCCCCCcc------------ccCc------c Confidence 1 135566667778899999999999999999988777665432 11110000 0000 0 Q ss_pred CCCCCCCCCcccCCCCCC Q lcl|NC_013644. 472 KGLSDNTDEEETAVNPDD 489 (510) Q Consensus 472 ~~~~~~~~~~~~~~~~~~ 489 (510) .. -+.+.++++ T Consensus 375 ~~-------~~~g~~~dn 385 (385) T protein:vir:10 375 TQ-------VKGGDEGDN 385 (385) T ss_pred cc-------cCCCCCCCC Confidence 00 000001110 No 212 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=96.81 E-value=0.00032 Score=39.73 Aligned_cols=181 Identities=15% Similarity=0.129 Sum_probs=84.5 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhH------hhhc-CeeeeccCCCceeEEeecCCHHH Q lcl|NC_013644. 249 DLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQ------NVKS-KKVVGTGSDGGLDVKTVTIPTEG 321 (510) Q Consensus 249 d~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~------~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 321 (510) -|. +..|.+. +. .+...... ..+. ...+.+.++. -+|-+.+.+... T Consensus 1 V~k-~~~l~~~-----------~~--------------~~~~~~~~r~~~~~~~~~~~~~~~ld~~~-e~~e~~~~~lsG 53 (201) T protein:vir:10 1 MWK-AKGLADL-----------CD--------------DSDGAARLRLAQVDNNSGVGQAIGIDADS-EEYNVLNSDIGG 53 (201) T ss_pred Ccc-chHHHHH-----------hc--------------CChHHHHHHHHHHHHhhhhhhhheeecCC-cceeeeecCcCC Confidence 000 0011111 10 00001111 0111 1122233322 246566678889 Q ss_pred HHHHHHHHHHHHHHHhCCccccccc---cC-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccc Q lcl|NC_013644. 322 RKTKMEIDKENIYKFGMAFDSTQVG---DG-NITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFD 397 (510) Q Consensus 322 ~~~~~~~l~~~i~~~s~~p~~~~~~---~g-~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~ 397 (510) +...+....+.|...+++|-+-.-+ .| |+||..=...|...+. ...++.+++.|++++++++. T Consensus 54 l~d~l~~~~~~iaa~s~iP~t~LfG~sp~Glnatge~d~~nyyd~i~--~~Qe~~l~p~le~l~~~~~~----------- 120 (201) T protein:vir:10 54 IDTFLSQKFDRIVALSGIHEIILKGKNVGGVSASQNTALETFYGYVD--RKRKAELLPLLEFLLPFIVT----------- 120 (201) T ss_pred hHHHHHHHHHHHHhHhcCchhhhcCCCCccccccchhHHHHHHHHHH--HHHHHHHHHHHHHHHHhhcC----------- Confidence 9999999999999999999754322 12 4577654333443333 22247788888887775431 Q ss_pred cceeeEEeCCCCCCCHHHHHHHH-------HHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|NC_013644. 398 PTEVSFTFTREVMVNETDIVNDE-------KTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEY 470 (510) Q Consensus 398 ~~~v~i~f~~~~p~d~~e~~~~~-------~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~ 470 (510) ..+++|.|++-...+.+++|+.. .++.++|++|.+.+.+.|- ..+. T Consensus 121 ~~~~~~~f~pL~~~s~kekAei~~~~a~a~~~~~~~g~i~~~e~r~~L~---------------------------~~~~ 173 (201) T protein:vir:10 121 EQEWSVEFNPLSQVSDKDKSEILEKNVNSVAALIAAGIIDADEARDTLR---------------------------AIST 173 (201) T ss_pred CCCceEeeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHH---------------------------hcCC Confidence 24689999999999999888764 3444455555444443221 0110 Q ss_pred cCCCCCCC-CCcccCCCCCC-ccccccc Q lcl|NC_013644. 471 TKGLSDNT-DEEETAVNPDD-PTQQMAE 496 (510) Q Consensus 471 ~~~~~~~~-~~~~~~~~~~~-~~~~~~~ 496 (510) .+..+.+. +.+.......+ ++.++.. T Consensus 174 ~~~~~~~~~~~~~~~~e~~dp~~~~~~~ 201 (201) T protein:vir:10 174 EVKIGEGSIQTEVVINESEDPLDVSANN 201 (201) T ss_pred cCCCCCCCCCccccccccCCCCCCCCCC Confidence 11111100 00000000000 0001111 No 213 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=96.81 E-value=0.00032 Score=39.71 Aligned_cols=423 Identities=9% Similarity=0.008 Sum_probs=169.0 Q ss_pred CCCccC---CChhhhHHHHH-HHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhH Q lcl|NC_013644. 1 MEALLS---EDVKIIANALK-AAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFP 76 (510) Q Consensus 1 ~~~~~~---~~~~~~~~~i~-~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~ 76 (510) +....- ......+..-. ...+ + ..--++.+ .+|.+.. +..++....-. -++-.+ T Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~~-~----~~~~~~~l-~~~~~~~-F~Gy~~la~la---------------Q~~eyr 126 (698) T protein:vir:10 69 LARQFEVDVSNYTPRERRAASYALD-F----NGTSMDAL-SFVTSSG-FPGFPTLVLLA---------------QLPEYR 126 (698) T ss_pred ccccceeccccCCccccchhhhhhc-c----cccccccc-hhhhccC-cchHHHHHHHh---------------hccchh Confidence 000000 00000000000 0000 0 00001111 1222211 00000000000 001111 Q ss_pred HHHHHHHhhhhcCC---------------ce----ecc-CcHHHHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEE Q lcl|NC_013644. 77 EIVDQKTQYLLSNP---------------VE----YET-ENEELKEYLAEYY-NSEFQVVLQELVEGSSQKGFEYVYART 135 (510) Q Consensus 77 ~Iv~~~~~~l~g~p---------------~~----~~~-~d~~~~~~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~ 135 (510) .++.+.+..+.-+- +. ... .+.+..+.|..-+ +=++...+.++.+++-.||.+..++-+ T Consensus 127 ~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eai~~aRlfGGa~~~i~I 206 (698) T protein:vir:10 127 AMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKI 206 (698) T ss_pred hHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEEe Confidence 11222222111111 01 111 1223334444443 346778899999999999998777666 Q ss_pred CCCCc----e--------------EEEEEcccceEEEEcCCCCce-eEEE---EEEEEEeeCCceeEEEEEEEEcCCcEE Q lcl|NC_013644. 136 NAEDR----L--------------CFQVADSLNVFGVYNEYNELQ-RICR---HYITEIEKDGETVDIHHAEVWTDQNVY 193 (510) Q Consensus 136 d~~g~----~--------------~i~~~~p~~~~~~~d~~~~~~-~~~~---~~~~~~~~~~~~~~~~~~e~y~~~~i~ 193 (510) +.++. | -+.+++|.++.|-.-+..++. +-+. +|++. ++ ++ ...+.. T Consensus 207 ~gdd~~l~~PL~~~~~~I~kGslKGL~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~----G~-------~I-H~SRL~ 274 (698) T protein:vir:10 207 KGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMI----GS-------EV-HATRLH 274 (698) T ss_pred ecCccccccccccccccccCccceeeeeecccccccchhhhccchhhccCCCceEEEe----cc-------ee-cceeEE Confidence 54431 1 155666666666321111110 0000 00000 00 00 011111 Q ss_pred EEEEcCCceeecccccccccccccccccccccccccccCCcccEEE-ecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 194 FFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYR-LSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQ 272 (510) Q Consensus 194 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~-~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~ 272 (510) .|... .+|-.. -.++-.|.|....+.+-+++++.+.-..+..+. T Consensus 275 ~~vg~-----------------------------------pvpd~LKp~y~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li~ 319 (698) T protein:vir:10 275 TIVSR-----------------------------------PVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVK 319 (698) T ss_pred EecCC-----------------------------------CchhhhcchhccCCccHHHHHHHHHHHHHHHhhhHHHHHH Confidence 11100 011110 012234677777777777777776555555443 Q ss_pred HhccceeE-----EecCCCC-chh---hhhHhhh-cCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc Q lcl|NC_013644. 273 DFAEAIYV-----VSGFQGD-DLS---KLRQNVK-SKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDS 342 (510) Q Consensus 273 ~~~~~~lv-----~~g~~~~-~~~---~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~ 342 (510) .+....+. +.+.++. +.. +....++ ..+++.+++ ++=+|.+.+.+...+...+....+.|...+++|-+ T Consensus 320 ~~~~~~l~~dla~aL~~g~~~~l~~R~eli~~~Rsn~G~~llDk-~~Eefeq~st~lSGLddVi~qf~q~VAgaa~IPlt 398 (698) T protein:vir:10 320 QFSVSGILMDLAQALTPGANVDLSMRAELINRYRDNRNILFLDK-ATEEFFQFNTPLSGLDALQAQAQEQMSAVSHIPLI 398 (698) T ss_pred HhhHHHHHHHHHHhcCChhhHHHHHHHHHHHHhcCccceEEEec-CCcceEEEecCcCCHHHHHHHHHHHHHhhhcCchh Confidence 33222221 0011111 000 1111122 334555653 22366677889999999999999999999999976 Q ss_pred ccccc---C-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHH Q lcl|NC_013644. 343 TQVGD---G-NITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVN 418 (510) Q Consensus 343 ~~~~~---g-~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~ 418 (510) -.-+. | |+||.+=...|...+. ...++.++..|++++.+|.. +..+.. +. +++++|++-..-++.|+|+ T Consensus 399 kLfGqSPkGlNATGE~D~rnYYD~I~--s~Qe~~L~p~L~rl~~ii~r--S~~G~i--dp-~i~~~fnPL~qmtd~EkAe 471 (698) T protein:vir:10 399 KLLGITPTGLNASSEGEIRVWYDYVR--AYQRNALQQLMNDVIVMIQL--SLFGAV--DP-SIKWQWNALRELDDLEVAE 471 (698) T ss_pred hhhccCCcccCccchhhHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH--HhcCCC--CC-cceEEeCCCCCcCHHHHHH Confidence 43321 2 5788875555665554 44578899999998877643 333332 33 5889999999999999887 Q ss_pred HHH-------HHHhcCCCchHHHHHhCC------CC--CcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCccc Q lcl|NC_013644. 419 DEK-------TEAETRKIILESILQVAP------RL--DDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEET 483 (510) Q Consensus 419 ~~~-------~~~~~g~iS~et~~~~~~------~v--~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 483 (510) .-. .....|+|+...+..+|- +. -|.+-+=-..++... ............+.+.+.+.... T Consensus 472 I~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~d~~d~p~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~ 547 (698) T protein:vir:10 472 ARYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGAPADDDI----DGVLTYVQRMAEGGDTGAPTAPG 547 (698) T ss_pred HHhhhhHHHHHHHHhcCCCHHHHHHHHhccCCCccccccCCcccCCCCCCCcc----hHHHhhhcCCcCCCCcccccccc Confidence 633 233456665544444331 10 000000000000000 00000000000000111111111 Q ss_pred CCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 484 AVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) +..+|.+.-+-+.+ ...|.++ T Consensus 548 ~~~~~~~~~~~~~~------~~~~~~~ 568 (698) T protein:vir:10 548 GARAGATAPPAAAN------VNANANP 568 (698) T ss_pred cccCCCCCCccccc------ccCCCCc Confidence 11122211111111 1122222 No 214 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=96.80 E-value=0.00033 Score=39.68 Aligned_cols=420 Identities=10% Similarity=0.042 Sum_probs=166.5 Q ss_pred CCCccCCCh--hhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHH Q lcl|NC_013644. 1 MEALLSEDV--KIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEI 78 (510) Q Consensus 1 ~~~~~~~~~--~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~I 78 (510) +.+...... +-....-..+|.. |..+-.+++-. +-...| T Consensus 36 ~~~~~g~~~~~e~~~~~~~eLI~~---------YR~ma~~pEvd------------------------------~Av~eI 76 (537) T protein:vir:10 36 GGGYFGYSVDFDGTIRNDHELITR---------YREMVLNPECD------------------------------SAVDDV 76 (537) T ss_pred cccccccccccccccchHHHHHHH---------HHHHhhccchh------------------------------hHHHHh Confidence 111111110 1111111222221 22222222222 222334 Q ss_pred HHHHHhh-hhcCCceeccCc----HHHHHHHH----HHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC----CceEEE Q lcl|NC_013644. 79 VDQKTQY-LLSNPVEYETEN----EELKEYLA----EYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAE----DRLCFQ 144 (510) Q Consensus 79 v~~~~~~-l~g~p~~~~~~d----~~~~~~l~----~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~----g~~~i~ 144 (510) |+..+-+ ....||.+..++ +...+.|. .+++ =+|+.+.++..+.+.+.|+.|++..+|.+ |-..+. T Consensus 77 Vneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fhKiid~k~pk~GI~ELr 156 (537) T protein:vir:10 77 VNETICGNFDDVPISIDLHNLKQSEKIKKLIRSEFDEILRLLDFDNRAYEIFRRWYVDGRLFFHKVIDPKKPRQGLVELR 156 (537) T ss_pred hcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccccceeee Confidence 4433222 234556655443 33333333 3332 36788889999999999999999888744 667899 Q ss_pred EEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEE--EEEEEEcCCcEEEEEEcCCceeecccccccccccccccccc Q lcl|NC_013644. 145 VADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDI--HHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDS 222 (510) Q Consensus 145 ~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~--~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 222 (510) .+||+.+..|.--........++ .+.+..... .-+.+|.+.+.+. ..+.. T Consensus 157 ~lDPr~i~~vR~i~~~~~~~~~~-----~~~~~~v~~~~~eyf~ynp~g~~~---~~~~~-------------------- 208 (537) T protein:vir:10 157 YVDPRKIRKVTEYEAKRPEALRT-----QDLNQQLTQQSASYFLYNPKGLKN---STNQG-------------------- 208 (537) T ss_pred eeCCccceeeEeecccCCccceE-----Eecceeeeecccceeeeccccccc---cCCCc-------------------- Confidence 99999987665311111111111 111111111 1112333332210 00000 Q ss_pred cccccccccCCccc---EEEec------CCCCCCCcHHHHHHHHHHHHH--HHHHHHHHHHHhccceeEE----ec-CCC Q lcl|NC_013644. 223 ENESLLQRSYGQIP---FYRLS------NNKQETTDLKPIKALIDDYDL--MNCFLSNNLQDFAEAIYVV----SG-FQG 286 (510) Q Consensus 223 ~~~~~~~~~~g~iP---vv~~~------nn~~g~sd~~~v~~liD~~n~--~~S~~~~~~~~~~~~~lv~----~g-~~~ 286 (510) =+|| |++.. |+....|-+. ..|..+|. ++-|.+-..+..+.|=.=+ .| +.. T Consensus 209 ----------vkI~~dAI~y~hSGl~d~n~~~i~syLh---kAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk 275 (537) T protein:vir:10 209 ----------MKIAPDSIAYCHSGIQDLNKNMVLSHLH---KAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPK 275 (537) T ss_pred ----------eeccHhheeeecccceeCCCCeeeeeeh---hhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCc Confidence 0111 11100 1222223333 33344443 3444444444444332211 11 111 Q ss_pred Cchhh----hhHhhhcCeeee--------------------c---cCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHh Q lcl|NC_013644. 287 DDLSK----LRQNVKSKKVVG--------------------T---GSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFG 337 (510) Q Consensus 287 ~~~~~----~~~~~~~~~~~~--------------------~---~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s 337 (510) .-... .+...+...+.. + +++.+.+.-|.+ .+.. ...-+.-+++.+|+.- T Consensus 276 ~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnlg-em~DV~YF~kKLy~aL 354 (537) T protein:vir:10 276 NKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLG-ELEDVKYFQKKLYKAL 354 (537) T ss_pred hhHHHHHHHHHHhccceEEEeccCceecccchhhhhhhhhcccccCCCcccceeeccccCCcC-hHHHHHHHHHHHHHHh Confidence 11111 111111111111 1 122223333333 2222 2344555666677766 Q ss_pred CCccc--cccc---cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCcccc--ceeeEEeCCCCC Q lcl|NC_013644. 338 MAFDS--TQVG---DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDP--TEVSFTFTREVM 410 (510) Q Consensus 338 ~~p~~--~~~~---~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~--~~v~i~f~~~~p 410 (510) .+|-. ...+ .|..|. |.......-.-+.+-+..|...+.++++.=+-+-++....+|+. ..+.+.|...-. T Consensus 355 nVP~SRl~~e~~f~~Gr~~E--ItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~ 432 (537) T protein:vir:10 355 NVPSSRLETETTFNIGRAAE--ITRDEVKFQKFIARLRKRFSELFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNY 432 (537) T ss_pred CCCccccCCCCcccccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecch Confidence 67742 2222 233332 22223333344556666677777776665443334444445543 357778865555 Q ss_pred CCHHHHHHH-------HHHHH--hcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhcc-CCCCCCCCC Q lcl|NC_013644. 411 VNETDIVND-------EKTEA--ETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYT-KGLSDNTDE 480 (510) Q Consensus 411 ~d~~e~~~~-------~~~~~--~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~-~~~~~~~~~ 480 (510) -.+...++. +..+. -+..+|.+++.+.+=-.+|+|..++....+++..+... .+|.. .+++.+. T Consensus 433 f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~tDeeI~~~~k~I~~E~k~~~~----~~p~~~~~~~~~~-- 506 (537) T protein:vir:10 433 FTELKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVLKQTESEIKEIDKEIKQEIADGVI----MDPQAMQAMEMGI-- 506 (537) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHhccCHHHHHHHHHHHHHHhhCCCC----CCcccccccccCC-- Confidence 444433333 22221 23346999999886666666543322222221111100 01100 0111111 Q ss_pred cccCCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 481 EETAVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) .+..+-++.+.+++.+++.+. ..+-|..| T Consensus 507 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 535 (537) T protein:vir:10 507 GDEEPVPEGGEEPQTDPNSAV-SPADQKRG 535 (537) T ss_pred CCcccCCCCCCCcccCCccCC-CCCCccCC Confidence 111112333344444443322 33455566 No 215 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=96.77 E-value=0.00035 Score=39.52 Aligned_cols=384 Identities=10% Similarity=-0.032 Sum_probs=145.9 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |--. ++ .. ......-..+..+.-|.-. ..+ .. .+ + .|+ +-.-..|+ T Consensus 1 m~~f------------~~----~~-~~~~~~~~~~~~~~~~~~~-----~~~-------~~-~~-A-l~~--~~V~~~i~ 46 (406) T protein:vir:97 1 MSFF------------QP----LG-TSKVSYDDYISSVLAGDVS-----QKY-------LG-VS-A-LKN--SDILTATS 46 (406) T ss_pred Cccc------------cc----cC-CCCCCcchHHHHHhcCCCC-----ccc-------cc-ch-h-hcc--HHHHHHHH Confidence 1000 00 00 0000000001111111100 000 00 00 0 111 11111244 Q ss_pred HHHhhhhcCCceeccCcHH--HHHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECC-CCce-EEEEEcccc Q lcl|NC_013644. 81 QKTQYLLSNPVEYETENEE--LKEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNA-EDRL-CFQVADSLN 150 (510) Q Consensus 81 ~~~~~l~g~p~~~~~~d~~--~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~-~g~~-~i~~~~p~~ 150 (510) ..++-+..-|+.+...+.+ ....+..++. |. .......+....+..|.||+++..+. .|.+ .+.+++|.. T Consensus 47 ~Ia~~iA~lp~~~~~~~g~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~ 126 (406) T protein:vir:97 47 IIAGDIARFPLVKKDVNGDIIHDEDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYRPSE 126 (406) T ss_pred HHHHhhhhCeeEEEecCccccccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEECCCe Confidence 4444343446554322211 1122334332 22 23555667788899999999988875 4554 688889998 Q ss_pred eEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccc Q lcl|NC_013644. 151 VFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQR 230 (510) Q Consensus 151 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (510) +.+..++.+.+ +|.+....++.. ..+.+..+.+++.- T Consensus 127 v~v~~~~~~~~-----~y~~~~~~~~~~------~~~~~~evih~r~~-------------------------------- 163 (406) T protein:vir:97 127 TTVEETDNHEI-----VYTFTDMLTAKQ------VKCFAHDVIHWKFF-------------------------------- 163 (406) T ss_pred eEEEEcCCceE-----EEEEEecCCceE------EEEccccEEEecCC-------------------------------- Confidence 88766644321 122221111110 01222333333210 Q ss_pred cCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEE-ecCCCCch--hhhhHhhh-------cCe Q lcl|NC_013644. 231 SYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVV-SGFQGDDL--SKLRQNVK-------SKK 300 (510) Q Consensus 231 ~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~-~g~~~~~~--~~~~~~~~-------~~~ 300 (510) | .+.-.|.|.+..+...++....+..-..+.++....|-.++ .+....+. ..+...+. .++ T Consensus 164 -----~----~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~g~n~g~ 234 (406) T protein:vir:97 164 -----S----HDTILGRSPLLSLGDEIDLQTGGINTLIKFFKDGFSSGILTMKGAQLSGDARQRARQEFEKMREGSVGGS 234 (406) T ss_pred -----C----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEecCCCCCHHHHHHHHHHHHHHhcccccCc Confidence 0 00112556565555444433333333333344444443333 33222221 11111111 133 Q ss_pred eeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 301 VVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWM 380 (510) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~ 380 (510) ++.++++.+...++.......+.+..+...+.|...-++|+.-.+..+.-|..+ . .....+...|.-. T Consensus 235 ~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~~~~e--~----------~~~~f~~~~l~P~ 302 (406) T protein:vir:97 235 PLVFDSTMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVNSPNQSVA--Q----------LMEDYVTNDLPFY 302 (406) T ss_pred eeecCCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCCCcchHH--H----------HHHHHHHHHHHHH Confidence 555666555555543333333445555567778877788876543222222211 1 1112233344444 Q ss_pred HHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCC--CcHHHHHHHHHHHHHHH Q lcl|NC_013644. 381 NKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRL--DDDNVLRLICEQFDLDW 458 (510) Q Consensus 381 ~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v--~d~e~~~~~~e~~e~~~ 458 (510) ++.|...+..+--.+.......+.|+ +..+....++.+.++.++|+++.-.+++.++.- .++.-.+ T Consensus 303 ~~~ie~~l~~kll~~~~~~~~~i~fd--~~~~~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~~~~gD~---------- 370 (406) T protein:vir:97 303 FDAITSELGLKTLNDKDRRLYHIEFD--TRSVTGRNVDEIVKLVNNQILTPNQGLVELGKQKSTDPNMDR---------- 370 (406) T ss_pred HHHHHHHHhhhhcChhhccceeEEEe--cCccchhhHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCe---------- Confidence 44444433322111111223345554 223455666778888999999998888876432 1110000 Q ss_pred HHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCc Q lcl|NC_013644. 459 EDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGA 498 (510) Q Consensus 459 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (510) .....+..+- +..+...+.......++ +...+...+ T Consensus 371 --~~~~~n~~~~-~~~~~~~~~~~~~~~gg-~~~~~~~~~ 406 (406) T protein:vir:97 371 --YQSSLNYVFL-DKKEEYQDKVGIKGKGG-EVNAEEDKS 406 (406) T ss_pred --EeeccCccch-hcccccccccccccCCC-CCCCCCCCC Confidence 0000110000 00000000000000010 000000000 No 216 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=96.75 E-value=0.00036 Score=39.45 Aligned_cols=384 Identities=7% Similarity=-0.027 Sum_probs=160.5 Q ss_pred ccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHH Q lcl|NC_013644. 4 LLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKT 83 (510) Q Consensus 4 ~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~ 83 (510) |.-+ +.............-..+ ....|-.. . ..+..+ -+..-+.++-....|+..+ T Consensus 1 m~~~----------~~~~~~~~~~~~~~~~~~-~~~~g~~~------s---~~~~~v----~~~~al~~~~v~~cv~~ia 56 (419) T protein:vir:80 1 MFFS----------RQLLSNLGQTQPGSGGWV-SALLGSAR------S---EAGQVV----TPASALSLTVLQNCVTLLA 56 (419) T ss_pred CCcc----------cccccccCcCCCCcchhh-HHhhcccc------c---ccCccc----ChHHhhccHHHHHHHHHHH Confidence 0000 000000000000000000 00000000 0 000000 0001112233344555566 Q ss_pred hhhhcCCceecc--Cc--HH-HHHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCceE-EEEEcccce Q lcl|NC_013644. 84 QYLLSNPVEYET--EN--EE-LKEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRLC-FQVADSLNV 151 (510) Q Consensus 84 ~~l~g~p~~~~~--~d--~~-~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~-i~~~~p~~~ 151 (510) +-+.+-|+.+-- ++ +. ...-+..++. |. .......++.....+|.||+++..+.+|++. +.+++|..+ T Consensus 57 ~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~i~~~~v 136 (419) T protein:vir:80 57 ESIAQLPVELYERSGDDRKPATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLYPLDNEAV 136 (419) T ss_pred HhhccCceEEEEecCCCcccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceE Confidence 666666776411 11 11 1112333331 22 2344556677889999999999999889864 788889888 Q ss_pred EEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccccccccccccccccc Q lcl|NC_013644. 152 FGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRS 231 (510) Q Consensus 152 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (510) -+..+..+.+ +|.+. +. ..+.... T Consensus 137 ~i~~~~~~~~-----~y~~~----~~-------~~~~~~~---------------------------------------- 160 (419) T protein:vir:80 137 TVMKGPDLKP-----MYRVA----GA-------DPLPQRL---------------------------------------- 160 (419) T ss_pred EEEECCCceE-----EEEEc----Cc-------cccchhh---------------------------------------- Confidence 7766543211 11100 00 0011111 Q ss_pred CCcccEEEecC----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecC---CCC-chh---hhhHhh---- Q lcl|NC_013644. 232 YGQIPFYRLSN----NKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGF---QGD-DLS---KLRQNV---- 296 (510) Q Consensus 232 ~g~iPvv~~~n----n~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~---~~~-~~~---~~~~~~---- 296 (510) |+++++ .-.|.|.+..+...|+....+..-..+.+...+.|-.+++-. ... +.. .+...+ T Consensus 161 -----i~h~~~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (419) T protein:vir:80 161 -----VHHVRWMSINGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDRITDGWNAKF 235 (419) T ss_pred -----eEEecCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHHHHHHHHHHh Confidence 222222 124666666555555544444333344455556676665421 111 111 122111 Q ss_pred ----hcCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 297 ----KSKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEAR 372 (510) Q Consensus 297 ----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~ 372 (510) ..++++.++++.+++.++.......+.+..+...+.|+..-++|+.-.+..+..+...++... ... T Consensus 236 ~g~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~----------~~f 305 (419) T protein:vir:80 236 GGSGNAKKVALLQEGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQS----------LQF 305 (419) T ss_pred cCccccCCceecCCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHH----------HHH Confidence 123456677666665555444444556667777888988888887543322222211111111 122 Q ss_pred HHHHHHHHHHHHHHHHhhccCC--ccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHH Q lcl|NC_013644. 373 LRALLEWMNKLVIDDINRRYTK--AFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLR 448 (510) Q Consensus 373 ~~~~l~~~~~~i~~~~~~~~~~--~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~ 448 (510) +...|.-+++.|...+..+--. ......+++.+..-+..|..+.++.+.++..+|+++.-.+++.++. +.+-+.. T Consensus 306 ~~~~l~P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~gGD~~- 384 (419) T protein:vir:80 306 VIYTLLPWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGDIY- 384 (419) T ss_pred HHHHHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccee- Confidence 3333444444443333321111 1111234444456667899999999999999999998888887643 1110000 Q ss_pred HHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccccCC Q lcl|NC_013644. 449 LICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLPE 508 (510) Q Consensus 449 ~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (510) ....+ ....+...+.+.+.+ + +..+.-.+.+|+=. T Consensus 385 -------------~~~~n------~~~~~~~~~~~~~~~----~--~~~~~~~~~~~~l~ 419 (419) T protein:vir:80 385 -------------LSPMN------MVDASKPQPIPMGKT----E--PTKAALDEIGRILS 419 (419) T ss_pred -------------eeccc------cccccccccccCCCC----C--chhhhHHHHHhhcC Confidence 00000 000000000000000 0 00011112233222 No 217 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=96.69 E-value=0.00041 Score=39.16 Aligned_cols=371 Identities=9% Similarity=0.027 Sum_probs=161.9 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhccc---------ceeccccccccccc-ccccee Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRI---------FYVDDEGILREDKY-ASNVRI 70 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~---------~~~~~~~~~~~~~~-~~~~ki 70 (510) |-.+ ++.. -....+++..+........ .............. ....++ T Consensus 1 ~~~~--~~~~---------------------~~~~m~~F~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 57 (413) T protein:vir:96 1 MPGV--SEIR---------------------KDKNLKFFNNKRSPTEESKAKDEIPKAPQVVMTLPNFFKELISDGYTKL 57 (413) T ss_pred CCcc--chhh---------------------hhhcCCccccCCCcchhhhhhccccccccccccchhhHhhhccchhHHH Confidence 2111 0000 0001112211110000000 00000000000000 000111 Q ss_pred -ccchhHHHHHHHHhhhhcCCceecc--C--cHHHHHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCC Q lcl|NC_013644. 71 -PHGFFPEIVDQKTQYLLSNPVEYET--E--NEELKEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNAED 139 (510) Q Consensus 71 -~~n~~~~Iv~~~~~~l~g~p~~~~~--~--d~~~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g 139 (510) ..+.....|+..++-+.+-|+.+-- + .+.....+..++. |. .......+....+.+|.||+++..+.+| T Consensus 58 ~~~~~v~~cI~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g 137 (413) T protein:vir:96 58 SDSPEVRMAVDCIADLVSNMTIQLMQNGETGDKRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSG 137 (413) T ss_pred hhchHHHHHHHHHHHhhccCceEEEEecCCCccccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCC Confidence 1345556667667667677777511 1 1122222333331 22 3455667788899999999999998877 Q ss_pred c-e-EEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccc Q lcl|NC_013644. 140 R-L-CFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHV 217 (510) Q Consensus 140 ~-~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (510) . + .+.+++|..+-+..++.. +++.... ++. .+.+.. T Consensus 138 ~~~~~L~~l~~~~v~~~~~~~~-------~~y~~~~-~~~--------~~~~~e-------------------------- 175 (413) T protein:vir:96 138 DKIIGLTPISPYKVTFNVSDDD-------LDYSITF-DNK--------EYDPST-------------------------- 175 (413) T ss_pred CceEEEEEecCceeEEEEcCCe-------EEEEEee-cCc--------EEchhh-------------------------- Confidence 4 3 688899988877665321 1111100 000 112222 Q ss_pred ccccccccccccccCCcccEEEecCC------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCch-- Q lcl|NC_013644. 218 LAVDSENESLLQRSYGQIPFYRLSNN------KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDL-- 289 (510) Q Consensus 218 ~~~~~~~~~~~~~~~g~iPvv~~~nn------~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~-- 289 (510) |+||+.+ -.|.|-+..+...+.....+..-..+.+.-.+.|-.+++....-.. T Consensus 176 -------------------vih~k~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~ 236 (413) T protein:vir:96 176 -------------------LLHFVLNPSIERPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDSDSDELS 236 (413) T ss_pred -------------------EEEEeccCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHH Confidence 2333311 1255656555555544444444444555666667666553221111 Q ss_pred -hhhhHhhh--------cCeeeeccCCC-ceeEEe-ecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHH Q lcl|NC_013644. 290 -SKLRQNVK--------SKKVVGTGSDG-GLDVKT-VTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKAR 358 (510) Q Consensus 290 -~~~~~~~~--------~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~ 358 (510) ..+...+. .++++.+++++ +..-+. .+.....+.+..+...+.|+..-++|+.-.+...+....+. T Consensus 237 ~~~~~~~~~~~~~g~~n~g~~~vl~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~--- 313 (413) T protein:vir:96 237 DEEGRENFEEMYLKRKEAGKPWIIPEGMVNVQQIKPLTLNDLAINDAVTLDKKTVAGIFGVPAFLLGVGTYNKDEFN--- 313 (413) T ss_pred HHHHHHHHHHHhcCccccCceeeecCCcccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCcchHHHHH--- Confidence 11222111 12334444433 222221 12233445566667778888887888754432211111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhC Q lcl|NC_013644. 359 YTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVA 438 (510) Q Consensus 359 ~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~ 438 (510) ..+...|.-+++.|...++.+--. +...+++.++.-+..|..+.++.+.++..+|+++.-.+++++ T Consensus 314 ------------~~~~~~l~P~~~~ie~~ln~~ll~--~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~ 379 (413) T protein:vir:96 314 ------------NFINTKIMSIAQVIQQTYNKLIVE--EDMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRRNEFRNWV 379 (413) T ss_pred ------------HHHHHHHHHHHHHHHHHHHHhhCC--CCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh Confidence 234445555555555554432111 123455566677788999999999999999999998888877 Q ss_pred CCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCc Q lcl|NC_013644. 439 PRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDP 490 (510) Q Consensus 439 ~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (510) +.-..+... ......+..+ .+. ..+......+++ T Consensus 380 g~~p~~~gd------------~~~~~~n~~~----~~~--~~~~~~~~~~dt 413 (413) T protein:vir:96 380 GMPPDAEMD------------DLLVLENYLQ----QKD--LVNQKKLIQDET 413 (413) T ss_pred CCCCCCCcc------------eeeecccccc----hhh--cccccCCCCCCC Confidence 542211100 0000000000 000 000000111111 No 218 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=96.61 E-value=0.00047 Score=38.83 Aligned_cols=396 Identities=10% Similarity=-0.009 Sum_probs=162.0 Q ss_pred ccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcc--hhccc-ceec----cccccccccccc---cceeccc Q lcl|NC_013644. 4 LLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDI--MNNRI-FYVD----DEGILREDKYAS---NVRIPHG 73 (510) Q Consensus 4 ~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i--~~~~~-~~~~----~~~~~~~~~~~~---~~ki~~n 73 (510) |+. ..-.....+++..+...... -.+.. .... ..+......... +.=+.++ T Consensus 1 ~~~-------------------~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~a~~~~ 61 (432) T protein:vir:97 1 MPD-------------------EKKLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLD 61 (432) T ss_pred CCC-------------------cccCchhhhhHhhcCCccccccccccccccCchhhhhhcccccccCcccchHhhhcch Confidence 111 11111122222222211100 00000 0000 000000000000 0000111 Q ss_pred hhHHHHHHHHhhhhcCCcee-c-cCc---HHHHHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCce- Q lcl|NC_013644. 74 FFPEIVDQKTQYLLSNPVEY-E-TEN---EELKEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRL- 141 (510) Q Consensus 74 ~~~~Iv~~~~~~l~g~p~~~-~-~~d---~~~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~- 141 (510) -.-..|+..++-+-+-|+.+ . ..+ +....-+..++. |. .......+....+.+|.||+++..+ +|++ T Consensus 62 aV~~~v~~Ia~~ia~lp~~~y~~~~~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~ 140 (432) T protein:vir:97 62 AVAACVKLVSQAVAAMPLMMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIE 140 (432) T ss_pred HHHHHHHHHHHhhccCceEEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEE Confidence 12223444444455556653 1 111 111112223331 22 2345556778889999999888776 4664 Q ss_pred EEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccccccc Q lcl|NC_013644. 142 CFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVD 221 (510) Q Consensus 142 ~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 221 (510) .+.+++|..+-++.+..+.+ +|++.. .++. . ..+....+.+++.- T Consensus 141 ~L~~l~p~~v~v~~~~~g~~-----~y~~~~-~~g~-----~-~~~~~~~iih~r~~----------------------- 185 (432) T protein:vir:97 141 SLQYLANDRLTITTDTKGNT-----AYRYRR-TDGQ-----M-IDIPRQQIWKIMGY----------------------- 185 (432) T ss_pred EEEEEcCcceEEEEcCCCcE-----EEEEEe-cCce-----E-EEEccccEEEecCc----------------------- Confidence 56789999998887755431 222211 1111 0 11233333333210 Q ss_pred ccccccccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh---hhhHhh-- Q lcl|NC_013644. 222 SENESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS---KLRQNV-- 296 (510) Q Consensus 222 ~~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~---~~~~~~-- 296 (510) ++ +.-.|.|-++.....++....+..-..+.+...+.|-.+++-...-+.+ .+.+.. T Consensus 186 --------------~~----dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~ 247 (432) T protein:vir:97 186 --------------SL----DGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSFSKKVSG 247 (432) T ss_pred --------------CC----CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecCCCCCHHHHHHHHHHHhh Confidence 00 1113555555444444433333333334445556665555432221111 122211 Q ss_pred --hcCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc--cCc-ccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 297 --KSKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG--DGN-ITNIVIKARYTLLNMKANKTEA 371 (510) Q Consensus 297 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~g~-~Sg~Ai~~~~~~l~~k~~~k~~ 371 (510) ..++++.++++.+.+.++.+.....+.+..+.....|+..-++|+.-.+. .++ ..|..++.... . T Consensus 248 ~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~~~~----------~ 317 (432) T protein:vir:97 248 SVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQL----------G 317 (432) T ss_pred hhcCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHHHHH----------H Confidence 22446667766666666554444555666778888898888888754322 111 11222222111 2 Q ss_pred HHHHHHHHHHHHHHHHHhhccC--CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHH Q lcl|NC_013644. 372 RLRALLEWMNKLVIDDINRRYT--KAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVL 447 (510) Q Consensus 372 ~~~~~l~~~~~~i~~~~~~~~~--~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~ 447 (510) .+...|.-.++.|...+..+-- .......+++.+..-+-.|..+.++.+.++..+|+++.-.++++++. +.+.... T Consensus 318 f~~~tl~P~~~~ie~~ln~kLl~~~e~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~glpp~~g~~~~ 397 (432) T protein:vir:97 318 FLTMTLSPWLRRIEQSIALNLLTPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNAAV 397 (432) T ss_pred HHHHHHHHHHHHHHHHHhhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCcce Confidence 2233344444444433332211 11111234444456667899999999999999999999888887643 2211000 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccc Q lcl|NC_013644. 448 RLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQ 505 (510) Q Consensus 448 ~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (510) ...+....+ .+.......+++..+++.+++...+ + T Consensus 398 ---------------~~~~~~~~p--l~~~~~~~~~~~~~~~~~~~~~~~~------~ 432 (432) T protein:vir:97 398 ---------------LTVQSAMVP--LDSIGLQASPEPASGLGNQQQDKVS------K 432 (432) T ss_pred ---------------Eeecccccc--hhhhcccCCCCCCCCCCCccccccc------C Confidence 000000000 0000000011111111111111111 1 No 219 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=96.58 E-value=0.0005 Score=38.70 Aligned_cols=391 Identities=11% Similarity=0.000 Sum_probs=168.2 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhccccee---ccccccccccc-cccceeccchhH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYV---DDEGILREDKY-ASNVRIPHGFFP 76 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~---~~~~~~~~~~~-~~~~ki~~n~~~ 76 (510) |+.+ +-+-+. ..++..+..++..+.|............ ...+. .-... -+..-+..+-.. T Consensus 1 ~~~~-----~~~~~~----------~~~~g~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~~~~al~~~~v~ 64 (424) T protein:vir:18 1 MEEP-----KYTIDL----------RTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGY-LGDSSINDERILQISTVW 64 (424) T ss_pred CCCC-----cccccc----------CCCCchHHHHHhhccccccccccchhhccccccccc-cccccccHHHhhccHHHH Confidence 4333 211110 0122223334444444321111000000 00000 00000 000001112223 Q ss_pred HHHHHHHhhhhcCCcee-ccC-cH---H--HHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCce-E Q lcl|NC_013644. 77 EIVDQKTQYLLSNPVEY-ETE-NE---E--LKEYLAEYYN---S---EFQVVLQELVEGSSQKGFEYVYARTNAEDRL-C 142 (510) Q Consensus 77 ~Iv~~~~~~l~g~p~~~-~~~-d~---~--~~~~l~~~~~---n---~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~ 142 (510) ..|+..++-+.+-|+.+ ... +. . ...-+..++. | ........+....+.+|.||+++.++..|++ . T Consensus 65 ~cv~~Ia~~iA~lp~~vy~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~ 144 (424) T protein:vir:18 65 RCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVIS 144 (424) T ss_pred HHHHHHHHhhccCceEEEEeccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEE Confidence 34555555555667664 211 11 1 1112333331 2 2234455677889999999999988888875 5 Q ss_pred EEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccc Q lcl|NC_013644. 143 FQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDS 222 (510) Q Consensus 143 i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 222 (510) +.+++|..+-+..++. .+ +|.+. .++. ...|.+..+.+++.-. T Consensus 145 L~~l~~~~v~v~~~~~-~~-----~y~~~--~~g~------~~~~~~~eVihir~~~----------------------- 187 (424) T protein:vir:18 145 LLPLQSANMDVKLVGK-KV-----VYRYQ--RDSE------YADFSQKEIFHLKGFG----------------------- 187 (424) T ss_pred EEEecCcceEEEEcCC-eE-----EEEEE--eCCe------EEEeccccEEEecCcC----------------------- Confidence 7778888887655421 11 12111 1111 0123333343332100 Q ss_pred cccccccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCC--Cc-h-hhhhHhhh- Q lcl|NC_013644. 223 ENESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQG--DD-L-SKLRQNVK- 297 (510) Q Consensus 223 ~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~--~~-~-~~~~~~~~- 297 (510) .+...|.|-+..+...+.....+..-..+.+...+.|-.+++-... .+ . ..+...++ T Consensus 188 ------------------~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~~ 249 (424) T protein:vir:18 188 ------------------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKE 249 (424) T ss_pred ------------------CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCcCCCHHHHHHHHHHHHH Confidence 0111344555544444443333333334445555666555543221 11 1 11111111 Q ss_pred ------cCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc--cCcccHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 298 ------SKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG--DGNITNIVIKARYTLLNMKANKT 369 (510) Q Consensus 298 ------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~g~~Sg~Ai~~~~~~l~~k~~~k 369 (510) .++++.++++.+.+.++.......+.+..+..++.|+..-++|+.-.+. .++.+|..++.... T Consensus 250 ~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~--------- 320 (424) T protein:vir:18 250 IAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL--------- 320 (424) T ss_pred HhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHH--------- Confidence 1235566666555555544444556667777888888888888754322 22332333332222 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccC--CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHH Q lcl|NC_013644. 370 EARLRALLEWMNKLVIDDINRRYT--KAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVL 447 (510) Q Consensus 370 ~~~~~~~l~~~~~~i~~~~~~~~~--~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~ 447 (510) ..+...|.-+++.|...+..+-- ....-..+++.+..-+..|..+.++.+.++..+|+++.-.++++++.-.-+.-. T Consensus 321 -~f~~~tl~P~~~~ie~~ln~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD 399 (424) T protein:vir:18 321 -GFLQYTLQPYISRWENSIQRWLIPSKDVGRLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNMPPLPGGD 399 (424) T ss_pred -HHHHHHHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC Confidence 22233444444444444432211 111122355556677888999999999999999999987777776431100000 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCc Q lcl|NC_013644. 448 RLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGA 498 (510) Q Consensus 448 ~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (510) ......+..+- .+-+...++...|+ T Consensus 400 ------------~~~~~~n~~~l--------------~~~~~~~~~~~n~a 424 (424) T protein:vir:18 400 ------------VAMRQAQYVPI--------------TDLGTNKEPRNNGA 424 (424) T ss_pred ------------eeeeccCccch--------------hhhhccCCccccCC Confidence 00000000000 00011111222222 No 220 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=96.45 E-value=0.00061 Score=38.21 Aligned_cols=400 Identities=10% Similarity=0.028 Sum_probs=157.7 Q ss_pred hHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCCc Q lcl|NC_013644. 12 IANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNPV 91 (510) Q Consensus 12 ~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~ 91 (510) .++.|.++..+.+.... .....+-++. |.. + .. ....+..+-. ..-...+..-..|+..++-+.+-|+ T Consensus 1 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~-g~~-~-~~----~~~~~~~~~~----~~a~~~~~v~~~v~~ia~~iA~lp~ 68 (460) T protein:vir:10 1 MANRIIRALRELTGLDN-KFNDAFIKYI-GQT-F-TK----YDNNGKTYLE----QGYNINPDVYSCISQMAAKTVAVPY 68 (460) T ss_pred CchhHHHHHhhhhccCC-CchHHHHHhh-ccc-c-CC----CccchhhhhH----HHHhcchHHHHHHHHHHHhhhhCce Confidence 44555555443321111 1111222221 111 0 00 0000000000 0001123334445555555666666 Q ss_pred eecc--CcHH-------------------------------HHHHHHHHhc--cC---HHHHHHHHHHHHHhcCeEEEEE Q lcl|NC_013644. 92 EYET--ENEE-------------------------------LKEYLAEYYN--SE---FQVVLQELVEGSSQKGFEYVYA 133 (510) Q Consensus 92 ~~~~--~d~~-------------------------------~~~~l~~~~~--n~---~~~~~~e~~~~~~~~G~~~~~v 133 (510) .+-- .+.. ....+..++. |. .......+....+.+|.||.++ T Consensus 69 ~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i 148 (460) T protein:vir:10 69 TIKVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYL 148 (460) T ss_pred EEEeccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEE Confidence 5411 1100 0011122221 22 2344455677899999999988 Q ss_pred EECCC----Cce-EEEEEcccceEEEEcCCCCceeE-EEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccc Q lcl|NC_013644. 134 RTNAE----DRL-CFQVADSLNVFGVYNEYNELQRI-CRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDE 207 (510) Q Consensus 134 ~~d~~----g~~-~i~~~~p~~~~~~~d~~~~~~~~-~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~ 207 (510) ..+.. |.+ .+.+++|..+-+..++.+..... ...+.+....++. ...+.+..+.+++...... T Consensus 149 ~r~~~~~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~------~~~~~~~evih~r~~~~~~----- 217 (460) T protein:vir:10 149 MSPDDGINAGVPSQMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLIQGDQ------FIEFNEDEVIHTKYANPNF----- 217 (460) T ss_pred EecCCCccCceeEEEEEEcCceEEEEEcCCCceeeeeeeeeEEEEecCce------eEEecccceEEEecCCCCc----- Confidence 77543 555 37788998888877654422111 1111111111110 0122333333332110000 Q ss_pred ccccccccccccccccccccccccCCcccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCC Q lcl|NC_013644. 208 AEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGD 287 (510) Q Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~ 287 (510) -.-...-.|.|.+..+...+.....+..-..+.+...+.|-.++.....- T Consensus 218 ------------------------------~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~~~~i~~~~~~l 267 (460) T protein:vir:10 218 ------------------------------DLQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQNGGVFGFIHGGSTGL 267 (460) T ss_pred ------------------------------ccccCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceeeecCCCC Confidence 00000113556666555555554444444444455555565554432211 Q ss_pred chh---hhhHhhh--------cCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc--CcccHHH Q lcl|NC_013644. 288 DLS---KLRQNVK--------SKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGD--GNITNIV 354 (510) Q Consensus 288 ~~~---~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~~Sg~A 354 (510) +.+ .+...+. .++++.++++.+.+.++.......+.+..+...+.|+..-++|+.-.+.. ++.++.. T Consensus 268 ~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn 347 (460) T protein:vir:10 268 TQPQADSLKQRLTEMDKSPDRLSQIAGASGEIAFTKISLNTDELKPFDYLKYDQKAICNALGWSDKLLNNNEGGGLNTGN 347 (460) T ss_pred CHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCCcccc Confidence 111 1221111 23355666666555555444445566777788888888888887533221 1112222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccc-cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHH Q lcl|NC_013644. 355 IKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFD-PTEVSFTFTREVMVNETDIVNDEKTEAETRKIILES 433 (510) Q Consensus 355 i~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~-~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et 433 (510) ++.. ....+...|.-++..|...+..+--.... .....|.|+-.-.....+.......+..+|+++.-. T Consensus 348 ~e~~----------~~~f~~~~l~P~~~~ie~~ln~kl~~~~~~~~~~~i~~d~~~l~~l~~d~~~~~~~~~~g~~T~NE 417 (460) T protein:vir:10 348 LEEE----------RKRVVTDNIQPDLVILKQAFDKKFIKRFKGYENAVIEWDISELPEMQTDMVAMASWLNTIPVTPNE 417 (460) T ss_pred HHHH----------HHHHHHHHHHHHHHHHHHHHHHhhcCcccccCCceEEeecchhhhHHHHHHHHHHHHhCCCCCHHH Confidence 2211 11233334444444444433322111111 122344553222222333344555677889988877 Q ss_pred HHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccc Q lcl|NC_013644. 434 ILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQ 505 (510) Q Consensus 434 ~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (510) +++.++. ++++-..+ .....+..+. +... + +..++.+.+ +| T Consensus 418 ~R~~~g~~pi~~~~gD~------------~~~~~n~~~~----~~~~---~-~~~~~~~nq-----------~~ 460 (460) T protein:vir:10 418 IRIAMKYETLNQDGMDI------------VFMPSNKVRI----DDVS---N-NLIDSAFNQ-----------NQ 460 (460) T ss_pred HHHHhCCCCCCCCCCCe------------eeecccccch----hhcc---c-ccCCCcccC-----------CC Confidence 7777643 22110000 0000000000 0000 0 000000000 00 No 221 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=96.23 E-value=0.00085 Score=37.42 Aligned_cols=408 Identities=9% Similarity=0.030 Sum_probs=182.3 Q ss_pred CCCccCCCh-hhh-----HHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccch Q lcl|NC_013644. 1 MEALLSEDV-KII-----ANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGF 74 (510) Q Consensus 1 ~~~~~~~~~-~~~-----~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~ 74 (510) |.--+.... ... .+-+...|... + +.+ +.+.+.. +...+.......+...+ .+.. . ..... T Consensus 1 ~~~~i~~~~g~~~~~~~~~~~~~~~ia~~----~-~~~----~~~~~~~-~~p~~~~il~~~~~~~~-~y~~-m-~~D~~ 67 (491) T protein:vir:79 1 MSKGLWVSPTEFVKFGEPDKSLSSQIATR----A-RSI----DFFALGM-YLPNPDPVLKALGKDIR-VYRE-L-RADAH 67 (491) T ss_pred CCCeeeCCCCCcccccccchhHHHHHhhh----c-ccc----ccccccc-cCcchhHHHhhccCCHH-HHHH-H-hhChH Confidence 443322211 110 01111222210 0 000 0000000 10100000000000000 0000 0 12455 Q ss_pred hHHHHHHHHhhhhcCCceeccC--cHHHHHHHHHHhcc-CHHHHHHHHHHHHHhcCeE-EEEEEECCCCce---EEEEEc Q lcl|NC_013644. 75 FPEIVDQKTQYLLSNPVEYETE--NEELKEYLAEYYNS-EFQVVLQELVEGSSQKGFE-YVYARTNAEDRL---CFQVAD 147 (510) Q Consensus 75 ~~~Iv~~~~~~l~g~p~~~~~~--d~~~~~~l~~~~~n-~~~~~~~e~~~~~~~~G~~-~~~v~~d~~g~~---~i~~~~ 147 (510) ..-.+.+...-+.|.+..+.+. ++...+++.++++. +|.+.+.++ .++..+|.+ .+++|.-.+|.. ++.+++ T Consensus 68 i~s~l~~Rk~av~~~~w~i~~~~~~~~~a~~i~e~l~~~~~~~~i~~~-lda~~~G~s~~Ei~w~~~~g~~~~~~l~~r~ 146 (491) T protein:vir:79 68 VGGCVRRRKAAVKALEWGLDRGKAKSRVAKSIADVFADLDLSRIATEM-LDAVLYGYQPMEITWGKVGNYIVPIDVVGKP 146 (491) T ss_pred HHHHHHHHHHHHhCCCcEEecCCCCHHHHHHHHHHHhcCCHHHHHHHH-HHhhhhcceeEEEEEeecCCeeeEEeeeeec Confidence 6666777777788889888653 44556788888753 566666655 568888965 556665445554 455666 Q ss_pred ccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccccccccccccc Q lcl|NC_013644. 148 SLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESL 227 (510) Q Consensus 148 p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (510) |+.+. ||..+.+ +++...+... .. T Consensus 147 ~~~f~--~d~~~~l--------------------------------~l~~~~~~~~----------------------g~ 170 (491) T protein:vir:79 147 ADWFV--YDPENQL--------------------------------RFRSKEHWVQ----------------------GE 170 (491) T ss_pred cccee--eccCCce--------------------------------EEeecCCCCC----------------------ce Confidence 65443 3322211 0110000000 00 Q ss_pred ccccCCcccEEEec--CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhh------hhHhhhcC Q lcl|NC_013644. 228 LQRSYGQIPFYRLS--NNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSK------LRQNVKSK 299 (510) Q Consensus 228 ~~~~~g~iPvv~~~--nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~------~~~~~~~~ 299 (510) .-.+++.|-..+-. .++.|.|.+..+-...--=+..+.+++..++.++.|+++.+=..+....+ -...+... T Consensus 171 ~lp~~k~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~~a~~~ek~~l~~al~~~~~~ 250 (491) T protein:vir:79 171 ELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSASDAETNLLLDRLEDMVQD 250 (491) T ss_pred eecCCCeEEEEecCCCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCCCCHHHHHHHHHHHHHHhcC Confidence 00112222222111 35668888887766555556667899999999999999876322222221 12334455 Q ss_pred eeeeccCCCceeEEeec---CCHHHHHHHHHHHHHHHHHHh--CCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 300 KVVGTGSDGGLDVKTVT---IPTEGRKTKMEIDKENIYKFG--MAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLR 374 (510) Q Consensus 300 ~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~l~~~i~~~s--~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~ 374 (510) ....++.+.++++++.. .+...++..++.+.+.|...- +|... ..+++.+.|..-.- -....+..-.+.+. T Consensus 251 a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt-~~~gs~a~~~vh~~---v~~~i~~~D~~~i~ 326 (491) T protein:vir:79 251 AVAVIPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIALLGQNQTT-EATSTRASAQAGLE---VTDDIRDGDKAIVV 326 (491) T ss_pred eEEEecCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHHhhhhhcc-CcccchhhHHHHHH---HHHHHHHHHHHHHH Confidence 67778999999999754 245678889998888887653 33333 23333343333222 12233344456666 Q ss_pred HHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCH-HHHHHHHHHHHhcCC-CchHHHHHhCCCCCcHHHHHHHHH Q lcl|NC_013644. 375 ALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNE-TDIVNDEKTEAETRK-IILESILQVAPRLDDDNVLRLICE 452 (510) Q Consensus 375 ~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~-~e~~~~~~~~~~~g~-iS~et~~~~~~~v~d~e~~~~~~e 452 (510) ..+.++++-++.+- +.. ...+.+.|.. +.+. ...++.+.++...|+ ++.+.+.+.++.-.....+. T Consensus 327 ~tln~li~~l~~~N---~~~---~~~p~f~~~e--~ee~~~~~a~~~~~L~~~G~~i~~~~~~e~~Gip~~~~~e~---- 394 (491) T protein:vir:79 327 EAMNMLIRWICDLN---FDG---AARPVFDMWE--QEQVDEIQAGRDEKLTRAGARFTPAYFKRAYNLQDGDLDER---- 394 (491) T ss_pred HHHHHHHHHHHHhc---CCC---CCcceEeecC--cCchhHHHHHHHHHHHhCCCccCHHHHHHHhCCCCCCCCcc---- Confidence 77777666655442 211 1223344443 3333 457888999999997 68787877776422111000 Q ss_pred HHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccccC--------------CCC Q lcl|NC_013644. 453 QFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLP--------------ENG 510 (510) Q Consensus 453 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~ 510 (510) ... .+.+........... ..++..+ .+.-...-...+.. .++ T Consensus 395 -----------~~~-~~~~~~~~~~~~~~~--~~~~~~~--~d~~~~~~~~~~~~~~~~~~~~~i~~~l~~~ 450 (491) T protein:vir:79 395 -----------PLP-VSAVDAVGAASFAEF--EAPDQDA--LDAALNALSARDLNADAQALVAPLLKRIANG 450 (491) T ss_pred -----------ccC-cCccccccccccccc--CCCCCcc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 000 000000000000000 0000000 00000000000000 000 No 222 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=96.15 E-value=0.00094 Score=37.18 Aligned_cols=447 Identities=13% Similarity=0.069 Sum_probs=162.7 Q ss_pred CCCccCCChhhhHHHHHHHHHh---------------------------hhh-hhhHHHHHHHHHHhccCCcchhcccce Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDK---------------------------DRK-SSSKREAETGIRYYNHENDIMNNRIFY 52 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~---------------------------~~~-~~~~~~~~~~~~YY~g~~~i~~~~~~~ 52 (510) |--+..=..+...+...+..+- .++ .+-.++|+.+..+++- T Consensus 1 m~~lfgf~~~~~~~~~~~~~s~~~p~~ddg~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEv----------- 69 (558) T protein:vir:10 1 MAKLFGFSIEETQKKSTSIISPVPKNNEDGVDNFISSGFYGQYVDIEGAYRSEYDLIRRYREMALHPEA----------- 69 (558) T ss_pred CcchhcchhhhhhhhccCCccccCCCccccccceeccceeeeeecccchhhhHHHHHHHHHHHhhccch----------- Confidence 2222111111111100000000 000 0001111112111211 Q ss_pred eccccccccccccccceeccchhHHHHHHHHhh-hhcCCceeccCc----HHHHHHHHH----Hhc-cCHHHHHHHHHHH Q lcl|NC_013644. 53 VDDEGILREDKYASNVRIPHGFFPEIVDQKTQY-LLSNPVEYETEN----EELKEYLAE----YYN-SEFQVVLQELVEG 122 (510) Q Consensus 53 ~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~-l~g~p~~~~~~d----~~~~~~l~~----~~~-n~~~~~~~e~~~~ 122 (510) .+-...||+..+-+ ....||.+..++ +...+.|.+ +++ =+|+.+.++..+. T Consensus 70 -------------------d~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~ 130 (558) T protein:vir:10 70 -------------------DGAIEDVVNEAIVSDLYDSPVEVELSNLNASNTLKKKIREEFRYIKEMMDFDKKSHEIFRN 130 (558) T ss_pred -------------------hhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhh Confidence 12233444433322 234566655443 223344333 322 3678888999999 Q ss_pred HHhcCeEEEEEEECCC----CceEEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeE--EEEEEEEcCCcEEEEE Q lcl|NC_013644. 123 SSQKGFEYVYARTNAE----DRLCFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVD--IHHAEVWTDQNVYFFV 196 (510) Q Consensus 123 ~~~~G~~~~~v~~d~~----g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~--~~~~e~y~~~~i~~~~ 196 (510) +.+.|+.|++..+|.+ |-..+..+||+.+-.|..--.+..-.-.+....... +.... +..+-+|.+...++.. T Consensus 131 WYVDgRiyfHKiid~k~pk~GI~ELr~lDPr~i~~Vr~i~~~~~~~~~~~~~~~~~-~~~~~~~~~eyy~Y~~~~~~~~~ 209 (558) T protein:vir:10 131 WYVDGRVFYLKVIDTKNPQEGIQDLRYIDPLKIKFIRQEKRKPGNQDPAIRVRSEQ-DVVPNPEFEEFYIYTPKVQHPTG 209 (558) T ss_pred heeeeEEEEEEEEeCCCccccceeeeeeCcccceeeeeeccccccccceeeeeccc-ceeeccceeEeeeecCCcccccc Confidence 9999999999888743 667899999999877664211111010111111100 00000 1111223333221111 Q ss_pred EcCCceeecccccccccccccccccccccccccccCCccc--EEEecCC----CCCCCcHHHHHHHHHHHHH--HHHHHH Q lcl|NC_013644. 197 AEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIP--FYRLSNN----KQETTDLKPIKALIDDYDL--MNCFLS 268 (510) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP--vv~~~nn----~~g~sd~~~v~~liD~~n~--~~S~~~ 268 (510) . .+. .++ .+++ +|| .|.|... ..+--.+.-+...|..+|. ++-|.+ T Consensus 210 ~-~~~---------~~~---------------~~~v-kI~~dAI~y~hSGL~d~~~~~i~syLhkAIKp~NQLkmlEDAl 263 (558) T protein:vir:10 210 M-VGQ---------MGG---------------KNSI-KIAKDSITMCTSGLVDRNKNRVLSYLHKAIKALNQLRMIEDSL 263 (558) T ss_pred c-cee---------ecC---------------CCce-eechhheeeecccceecCCCeeeecchHhhHhHHhhHHHHhhH Confidence 0 000 000 0000 222 1111100 0000112223344444443 344444 Q ss_pred HHHHHhccceeEE----ec-CCCCchhh----hhHhhhcCeeee--------------------c---cCCCceeEEeec Q lcl|NC_013644. 269 NNLQDFAEAIYVV----SG-FQGDDLSK----LRQNVKSKKVVG--------------------T---GSDGGLDVKTVT 316 (510) Q Consensus 269 ~~~~~~~~~~lv~----~g-~~~~~~~~----~~~~~~~~~~~~--------------------~---~~~~~~~~~~~~ 316 (510) -..+..+.|=.=+ .| +...-... .+...+...+.. + +++.+.+.-|.+ T Consensus 264 VIYRitRAPERRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLp 343 (558) T protein:vir:10 264 VIYRLSRAPERRIFYIDVGNLPKVKAEQYLKEVMSRYRNKLVYDANTGEVRDDRKFMSMMEDFWLPRREGGRGTEITTLP 343 (558) T ss_pred HHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhhhcccccCCCCccceeecc Confidence 4444444432211 11 11111111 111111111111 1 112223333333 Q ss_pred --CCHHHHHHHHHHHHHHHHHHhCCccc--cccc---cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_013644. 317 --IPTEGRKTKMEIDKENIYKFGMAFDS--TQVG---DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDIN 389 (510) Q Consensus 317 --~~~~~~~~~~~~l~~~i~~~s~~p~~--~~~~---~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~ 389 (510) .+.. -..-+.-+++.+|+.-.+|-. ...+ .|..|. |.......-.-+.+-+..|...+.++++.=+-+-+ T Consensus 344 GgqnLg-em~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~E--ItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKg 420 (558) T protein:vir:10 344 GGQNLG-ELSDVDYFQKKLYRALGVPESRIAAEGGFNLGRSSE--ILRDELKFAKFVGRLRKRFAAMFNDMLKTQLVLKN 420 (558) T ss_pred ccCCcc-hHHHHHHHHHHHHHHhCCCccccCCCCcccccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Confidence 2222 233455566677776677742 2222 233332 32223333344556666777777776665443334 Q ss_pred hccCCcccc--ceeeEEeCCCCCCCHHHHHHH-------HHHHHh--cCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHH Q lcl|NC_013644. 390 RRYTKAFDP--TEVSFTFTREVMVNETDIVND-------EKTEAE--TRKIILESILQVAPRLDDDNVLRLICEQFDLDW 458 (510) Q Consensus 390 ~~~~~~~~~--~~v~i~f~~~~p~d~~e~~~~-------~~~~~~--~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~ 458 (510) +....+|+. ..+.+.|...-.-.+...++. +..+.. +..+|.+++.+.+=-.+|+|..++....+++.. T Consensus 421 iit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeeI~~~~kqI~~E~k 500 (558) T protein:vir:10 421 IVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYSTEYVRKRVLRQTDMEIEEIDTQIEDEIQ 500 (558) T ss_pred CCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHh Confidence 444445543 357778865555444433333 222222 334699999998666666654332222222111 Q ss_pred HH-HHHHHHhhhccCC-CCCCCCCcccCCCCCCcccccccCcccccc------cc-cCC Q lcl|NC_013644. 459 ED-VKEALEEAEYTKG-LSDNTDEEETAVNPDDPTQQMAEGATGSTE------SQ-LPE 508 (510) Q Consensus 459 ~~-~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~-~~~ 508 (510) .. ..+..+.++...+ ....++... ..-+..+.++...+++.... .+ .-. T Consensus 501 ~~~~~~p~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 558 (558) T protein:vir:10 501 KGIIPDPSQIDPITGEPLPQEGDPAM-EGMGEQPVDPDLEAQAQAVDAQYSKDTKKAEL 558 (558) T ss_pred CCCCCCccccChhhccccCccCCchh-ccCCCCCcccccccchhhhhhhhhhhhhhhcC Confidence 11 1111111111110 000000000 00011111111222111111 11 111 No 223 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=96.12 E-value=0.00099 Score=37.07 Aligned_cols=389 Identities=7% Similarity=-0.040 Sum_probs=153.9 Q ss_pred HHHHHhhhhhhhHHH-HH-HHHHHhccCCcchhc-ccceec-ccccc---cccc---c---cccceeccchhHHHHHHHH Q lcl|NC_013644. 17 KAAIDKDRKSSSKRE-AE-TGIRYYNHENDIMNN-RIFYVD-DEGIL---REDK---Y---ASNVRIPHGFFPEIVDQKT 83 (510) Q Consensus 17 ~~~i~~~~~~~~~~~-~~-~~~~YY~g~~~i~~~-~~~~~~-~~~~~---~~~~---~---~~~~ki~~n~~~~Iv~~~~ 83 (510) --+.+.++....+.- .. .++-..........- .+.... ..+.. .... . .+..=+.+.-....|+..+ T Consensus 1 Mgl~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~ci~~Ia 80 (431) T protein:vir:10 1 MGLFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGELNGGTGRETRALRNMAVLRCVTLIS 80 (431) T ss_pred CcchhhhhcCcccccccccccccccccccccccccccccccccchHHHHhhccCccCcceechhhhhccHHHHHHHHHHH Confidence 111111111000000 00 000000000000000 000000 00000 0000 0 0000011222334445555 Q ss_pred hhhhcCCcee-ccCcH---HHHHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEE Q lcl|NC_013644. 84 QYLLSNPVEY-ETENE---ELKEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFG 153 (510) Q Consensus 84 ~~l~g~p~~~-~~~d~---~~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~ 153 (510) +-+-+-|+++ ..++. ....-+..+++ |. -......++...+.+|.+|+++..+....+.+..++|..+.+ T Consensus 81 ~~iA~lp~~v~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~g~~~~L~pl~~~~v~~ 160 (431) T protein:vir:10 81 GTIGMLPMNLISSDDSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSGNRPIRLIPMDRGSAKG 160 (431) T ss_pred HhhccCceEEEEecCceeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCceEEEEEEcCceeEE Confidence 5555667664 22111 11122333332 22 234455677888999999999888853345678889988888 Q ss_pred EEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCC Q lcl|NC_013644. 154 VYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYG 233 (510) Q Consensus 154 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 233 (510) +.++.+.+ +|.+.... +.. ..+....+.+++.-. T Consensus 161 ~~~~~~~~-----~y~~~~~~-g~~------~~~~~~dViHir~~~---------------------------------- 194 (431) T protein:vir:10 161 RLTSTWQI-----VYDYTTPT-GDK------IELPAREVFHLRDLS---------------------------------- 194 (431) T ss_pred EEcCCCeE-----EEEEEeCC-ceE------EEEchhhEEEecCcC---------------------------------- Confidence 76644321 22221111 110 012222333322100 Q ss_pred cccEEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCch---hhhhHhhh--------cCeee Q lcl|NC_013644. 234 QIPFYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDL---SKLRQNVK--------SKKVV 302 (510) Q Consensus 234 ~iPvv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~---~~~~~~~~--------~~~~~ 302 (510) .+...|.|.++-+...+........-..+.+...+.|-.+++-...-+. ..++..+. .++++ T Consensus 195 -------~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~g~~n~g~~~ 267 (431) T protein:vir:10 195 -------IDGVSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIEVPKELSDNAYGRMKASVQENHTGSENAGSWM 267 (431) T ss_pred -------CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCCCHHHHHHHHHHHHHHhcCccccCCce Confidence 0112355655555544444333333334444555566555543222111 22222221 12456 Q ss_pred eccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 303 GTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNK 382 (510) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~ 382 (510) .++++.+.+.++.......+.+..+...+.|+..-++|+.-.+.....++..++.... ..+...|.-.++ T Consensus 268 vl~~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~----------~f~~~tL~P~~~ 337 (431) T protein:vir:10 268 LLEEGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTSWGSGIEQLAI----------FFIQYGLSHWFV 337 (431) T ss_pred ecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCCccccHHHHHH----------HHHHHHHHHHHH Confidence 6666666655554444445556667777888888888886443322222222222222 222223333333 Q ss_pred HHHHHHhhccC--CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCC----CchHHHHHhCCC--CCcHHHHHHHHHHH Q lcl|NC_013644. 383 LVIDDINRRYT--KAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRK----IILESILQVAPR--LDDDNVLRLICEQF 454 (510) Q Consensus 383 ~i~~~~~~~~~--~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~----iS~et~~~~~~~--v~d~e~~~~~~e~~ 454 (510) .|...++.+-- .......+++.+..-+..|..+.++.+.++..+|+ |+.-.++++++. ++++...+ T Consensus 338 ~ie~~ln~~Ll~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~~g~lT~NE~R~~~gl~p~~~~~gD~------ 411 (431) T protein:vir:10 338 SWEQAAARAFLPEKMLGQRQFKFNEGALLRGTLNDQAAFFSKALGAGGQSPWMKQNEVREMLDLPRADDPVADQ------ 411 (431) T ss_pred HHHHHHHhhccChhhcCCceEEEechhhhccCHHHHHHHHHHHHhcccccCccCHHHHHHHhCCCCCCCccccc------ Confidence 33333322110 01111234555556677899999999999988776 555555554332 22111000 Q ss_pred HHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcc Q lcl|NC_013644. 455 DLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGAT 499 (510) Q Consensus 455 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (510) .....+ .. ..++..+ ++.++ T Consensus 412 ------~~~p~n---------------~~--~~~~~~~--~p~~~ 431 (431) T protein:vir:10 412 ------LRNPMT---------------QK--QKGSGDE--PPATT 431 (431) T ss_pred ------eecccc---------------cc--cCCCCCC--CCCCC Confidence 000000 00 0000000 11111 No 224 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=96.08 E-value=0.001 Score=36.95 Aligned_cols=461 Identities=10% Similarity=0.015 Sum_probs=168.2 Q ss_pred CCCccCCChhhhHHHHHHHHHh-----hhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchh Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDK-----DRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFF 75 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~-----~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~ 75 (510) |.+-..+-. ..+.+.... -+.+..-+.+- -.++|.++.-| . +++ ......+..-..++. T Consensus 1 ~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~-~-p~~---------~~~~L~~~~e~~~~~ 64 (651) T protein:vir:99 1 MTDTTGETQ----ETKVHVEGLGGEADLAKSPNSTQIP-DHRIQSHNVGV-N-PPY---------NPDRLAAFLELNETL 64 (651) T ss_pred CCCccceee----eeEEEeecccccccccccccccccc-hhhhcccCCCC-C-CCC---------CHHHHHHHHhcChHH Confidence 444332111 111111100 00011111111 12334333322 1 111 011111111235788 Q ss_pred HHHHHHHHhhhhcCCceecc------C--cHHHHHHHHHHhcc----------------CHHHHHHHHHHHHHhcCeEEE Q lcl|NC_013644. 76 PEIVDQKTQYLLSNPVEYET------E--NEELKEYLAEYYNS----------------EFQVVLQELVEGSSQKGFEYV 131 (510) Q Consensus 76 ~~Iv~~~~~~l~g~p~~~~~------~--d~~~~~~l~~~~~n----------------~~~~~~~e~~~~~~~~G~~~~ 131 (510) ...|+..+..+.|-++.+.. + .+.-.+....+|.+ .+...+..+..+...+|.+|+ T Consensus 65 ~~~i~~~~~~iag~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~i 144 (651) T protein:vir:99 65 ATGIRKKSRYEVGFGFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLAL 144 (651) T ss_pred HHHHHHHhhhhhccCceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhh Confidence 99999999999888766432 1 12222333333321 233455556667778888887 Q ss_pred EEEECCCCce-EEEEEcccceEEEEcCC---------------CCceeE-----------EEEEEEEEeeCCceeEEEEE Q lcl|NC_013644. 132 YARTNAEDRL-CFQVADSLNVFGVYNEY---------------NELQRI-----------CRHYITEIEKDGETVDIHHA 184 (510) Q Consensus 132 ~v~~d~~g~~-~i~~~~p~~~~~~~d~~---------------~~~~~~-----------~~~~~~~~~~~~~~~~~~~~ 184 (510) -+..+..|.+ .+..+++.-+-..-++. ..+... ..+|........ .....+ T Consensus 145 eiIrn~~g~pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~--~~~~~~ 222 (651) T protein:vir:99 145 EMLTDIEGRPVGLAYVPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRY--RGQEVV 222 (651) T ss_pred hhhhcCccchhhhhhcChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccc--cceeee Confidence 7767666653 23333333221100000 000000 000000000000 000000 Q ss_pred EEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCccc---EEEecCC-----CCCCCcHHHHHHH Q lcl|NC_013644. 185 EVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIP---FYRLSNN-----KQETTDLKPIKAL 256 (510) Q Consensus 185 e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~nn-----~~g~sd~~~v~~l 256 (510) .......+............ .. ...+........ ........+| |+||++. ..|+|.+..+... T Consensus 223 ~~~~~~~v~~~~~~d~~~~~-~~-~~~~~~~g~~~~------~~~~~~~~~~~~eViHir~~~~~~g~~G~spl~~a~~~ 294 (651) T protein:vir:99 223 IDESGDEPTIRYREDEESER-EP-IFVDRETGDVTT------GDANGLENRPANELIFIPNPSILEDDYGVPDWVSAIRT 294 (651) T ss_pred eccCCcceeEEeccCcceee-ee-ecccceeeeEEE------cCCCceeEecccceEEecCCCCCCCcccccHHHHHHHH Confidence 00000000000000000000 00 000000000000 0000111223 6777643 2467777766665 Q ss_pred HHHHHHHHHHHHHHHHHhccceeEEe--cCCCCc--hhhhhHhhh-----cCeeeeccC---------CCceeEEeecCC Q lcl|NC_013644. 257 IDDYDLMNCFLSNNLQDFAEAIYVVS--GFQGDD--LSKLRQNVK-----SKKVVGTGS---------DGGLDVKTVTIP 318 (510) Q Consensus 257 iD~~n~~~S~~~~~~~~~~~~~lv~~--g~~~~~--~~~~~~~~~-----~~~~~~~~~---------~~~~~~~~~~~~ 318 (510) +.....+-.-..+.+...+.|-.++. |...++ ...+...++ .++++.++. +.+++|...... T Consensus 295 i~~a~~a~~~~~~~f~NG~~p~gil~~~~~~ls~e~~~~lr~~~~~~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~ 374 (651) T protein:vir:99 295 ISADEAAKDYNRDFFDNDTIPRMVIKVTGGELSEESKRDLRQMLNGLREESHRAVVLEVEKFQSQLDEDVEIELEPMGQG 374 (651) T ss_pred HHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHHhccCCceEEeecccccccccccCCceEEEcCcC Confidence 55444444444455555556666654 422221 112222111 123333332 235666654432 Q ss_pred ---HHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCc Q lcl|NC_013644. 319 ---TEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKA 395 (510) Q Consensus 319 ---~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~ 395 (510) ...+.+..+.....|...-++|+.-.+.....++..++... ...+...|.-+++.|...++.+--.. T Consensus 375 ~~~D~qfle~r~~~~~eIa~afgVPp~~lG~~~~~~~sn~E~~~----------~~f~~~tL~P~~~~ie~eln~kLl~~ 444 (651) T protein:vir:99 375 ISEEMDFRQFREKNEHEIAKVLEVPPVKIGVTDSANRSNSDQQD----------KDFALEVIQPEQHTFAEWLYQIIHQQ 444 (651) T ss_pred chhhHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCcccHHHHH----------HHHHHHHHHHHHHHHHHHHHHhhcCc Confidence 34556667777888888888887543322222111111111 12233334444444444333221110 Q ss_pred ---cccceeeEEeC--CCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_013644. 396 ---FDPTEVSFTFT--REVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALEEA 468 (510) Q Consensus 396 ---~~~~~v~i~f~--~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~~~ 468 (510) ..-..+.+.|+ .-+-.|....++.+.++.++|+++.-.++++++. +.++...... ...+.. T Consensus 445 ~e~~~~~~i~~ef~~~~llr~D~~~~~e~~~~~i~~G~~T~NE~R~~lglppi~~~~gd~~l------------~~~~~~ 512 (651) T protein:vir:99 445 ALGVTDWTIEYELRGADQPKQEAQLAEQRVRAMRLAGVGLVDEAREELGLDPLGEPYGEMTL------------SEFEAE 512 (651) T ss_pred cccccCceEEEEeccchhhhccHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccc------------cccccc Confidence 11113445554 4556788999999999999999999889888743 3322110000 000000 Q ss_pred hccCCCCCCCCCcccCCCCCCcccccccCcccccccccCC-------C-------C Q lcl|NC_013644. 469 EYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLPE-------N-------G 510 (510) Q Consensus 469 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~-------~ 510 (510) .... ...+++.+.....+.++....-.+.+..+.-+.|+ . | T Consensus 513 ~~g~-~~~gge~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~~~~v~ss~~~~~g 567 (651) T protein:vir:99 513 VAGD-VAGGGETEAVHEPPEENKIGEREWDTVKSELTTKDPIEQMQFSSSNLDEGL 567 (651) T ss_pred cccc-cccCCCCcccccCccccccccchhhhhhhhhcccchhhhhhHHHHHHHhhc Confidence 0000 00000000000000000000000001111111111 1 1 No 225 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=95.98 E-value=0.0012 Score=36.65 Aligned_cols=433 Identities=8% Similarity=0.043 Sum_probs=170.8 Q ss_pred CCCccC---CChhhhHHHHH-HHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhH Q lcl|NC_013644. 1 MEALLS---EDVKIIANALK-AAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFP 76 (510) Q Consensus 1 ~~~~~~---~~~~~~~~~i~-~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~ 76 (510) +....- ......+..-. ...+ + ..--++.+ .+|.+.. +..++....-. -++-.+ T Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~~-~----~~~~~~~l-~~~~~~~-F~Gy~~la~la---------------Q~~eyr 126 (695) T protein:vir:78 69 LARQFEVDVSNYTPRERRAASYALD-F----NGTSMDAL-SFVTSSG-FPGFPTLVLLA---------------QLPEYR 126 (695) T ss_pred cceeceeccccCCccccchhhhhhc-c----cccccccc-hhhhccC-cchHHHHHHHh---------------hccchh Confidence 000000 00000000000 0000 0 00001111 1222211 00000000000 001111 Q ss_pred HHHHHHHhhhhcCC---------------ce----ecc-CcHHHHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEE Q lcl|NC_013644. 77 EIVDQKTQYLLSNP---------------VE----YET-ENEELKEYLAEYY-NSEFQVVLQELVEGSSQKGFEYVYART 135 (510) Q Consensus 77 ~Iv~~~~~~l~g~p---------------~~----~~~-~d~~~~~~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~ 135 (510) .++.+.+..+.-+- +. ... .+.+..+.|..-+ +=++...+.++.+++-.||.+..++-+ T Consensus 127 ~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i~i 206 (695) T protein:vir:78 127 AMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKI 206 (695) T ss_pred hHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEe Confidence 11222222211111 01 111 1223334444443 335778899999999999998877766 Q ss_pred CCCCc-----------------e-EEEEEcccceEEEEcCCCCce-eEEE---EEEEEEeeCCceeEEEEEEEEcCCcEE Q lcl|NC_013644. 136 NAEDR-----------------L-CFQVADSLNVFGVYNEYNELQ-RICR---HYITEIEKDGETVDIHHAEVWTDQNVY 193 (510) Q Consensus 136 d~~g~-----------------~-~i~~~~p~~~~~~~d~~~~~~-~~~~---~~~~~~~~~~~~~~~~~~e~y~~~~i~ 193 (510) +.++. + -+.+++|.++.|-.-+..++. +-+. +|++. +. ++| ..+.. T Consensus 207 ~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~----G~-------kIH-~SRL~ 274 (695) T protein:vir:78 207 KGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMI----GT-------EVH-ATRLH 274 (695) T ss_pred ccCccccccccccccccccCcceeeeEeecccccccchhhhccchhhccCCCceEEEe----ce-------EEe-eeeEE Confidence 55431 1 166677777777321111111 0000 01100 00 000 11111 Q ss_pred EEEEcCCceeecccccccccccccccccccccccccccCCcccEEE-ecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 194 FFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYR-LSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQ 272 (510) Q Consensus 194 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~-~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~ 272 (510) .|... .+|-+. -.++-.|.|...-+.+-+++.+.+.-..+..+. T Consensus 275 ~f~g~-----------------------------------plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~ 319 (695) T protein:vir:78 275 TIVSR-----------------------------------PVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVK 319 (695) T ss_pred EecCC-----------------------------------CchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHH Confidence 11100 011000 002234677777777777777776655555554 Q ss_pred HhccceeEE---ec-CCCCchh-----hhhHhhh-cCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc Q lcl|NC_013644. 273 DFAEAIYVV---SG-FQGDDLS-----KLRQNVK-SKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDS 342 (510) Q Consensus 273 ~~~~~~lv~---~g-~~~~~~~-----~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~ 342 (510) .+....+.. .. +.+.+.. +.....+ .++++.+++ ++=+|.+.+.+...+...+....+.|...+++|-+ T Consensus 320 ~~~v~~lk~dla~~L~~g~~~~l~~R~eli~~~Rsn~G~~llDk-~~Eefeq~stslSGLddVi~qf~q~VAgaa~IPlt 398 (695) T protein:vir:78 320 QFSVSGILMDLAQALMPGANVDLSMRAELINRYRDNRNILFLDK-ATEEFFQFNTPLSGLDALQAQAQEQMSAVSHIPLI 398 (695) T ss_pred hhhhHHHHHHHHHhhcChhHHHHHHHHHHHHHhcCccceEEEec-CCcceEEEecccCCHHHHHHHHHHHHHhhhcCchh Confidence 333332210 00 0111111 1111122 344555653 22367777889999999999999999999999976 Q ss_pred ccccc---C-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHH Q lcl|NC_013644. 343 TQVGD---G-NITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVN 418 (510) Q Consensus 343 ~~~~~---g-~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~ 418 (510) -+-+. | |+||..=...|...+. ...++.++..|++++.+|.. +..+.. ++ +++++|++-..-+++|+|+ T Consensus 399 kLfGqSPkGlNATGE~D~rnYYD~I~--s~Qe~~L~p~L~rl~~ii~r--S~~G~i--dp-di~~~fnPL~qmtd~EkAe 471 (695) T protein:vir:78 399 KLLGITPTGLNASSEGEIRVWYDYVR--AYQRNALQQLMNDVIVMIQL--SLFGAV--DP-SIKWQWNALRELDDLEVAE 471 (695) T ss_pred hhhccCCccccccchhhHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH--HhcCCC--CC-cceEEeCCCCCcCHHHHHH Confidence 43321 2 5788875555665555 44578899999998877643 333332 33 5889999999888888887 Q ss_pred HH-------HHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccC-CCCCCc Q lcl|NC_013644. 419 DE-------KTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETA-VNPDDP 490 (510) Q Consensus 419 ~~-------~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 490 (510) .. +.....|+|+...+..++-.-.+----..++...+--.....+.....+.......+++...++ ..+|.+ T Consensus 472 I~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 551 (695) T protein:vir:78 472 SRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARAGAT 551 (695) T ss_pred HHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCCCCCCCC Confidence 63 3344567776655555431100000000000000000000000000000000000001000000 011111 Q ss_pred ccccccCccccc-----ccc---cCCCC Q lcl|NC_013644. 491 TQQMAEGATGST-----ESQ---LPENG 510 (510) Q Consensus 491 ~~~~~~~~~~~~-----~~~---~~~~~ 510 (510) .-+.....-++- +.| ++.-| T Consensus 552 ~~~~~~~~~~~~~~~~ag~~~~~~~aag 579 (695) T protein:vir:78 552 APPTVANVNANVKPREAGAQDAAMRAAG 579 (695) T ss_pred CCCceeeeeccccccccCCCCcccceeE Confidence 111000000000 000 11111 No 226 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=95.88 E-value=0.0013 Score=36.37 Aligned_cols=370 Identities=7% Similarity=0.005 Sum_probs=144.8 Q ss_pred HHHHHHHhccCCcchhcccceecc--cccccccc-ccccceec-cchhHHHHHHHHhhhhcCCcee-c-cCc--HHHHHH Q lcl|NC_013644. 32 AETGIRYYNHENDIMNNRIFYVDD--EGILREDK-YASNVRIP-HGFFPEIVDQKTQYLLSNPVEY-E-TEN--EELKEY 103 (510) Q Consensus 32 ~~~~~~YY~g~~~i~~~~~~~~~~--~~~~~~~~-~~~~~ki~-~n~~~~Iv~~~~~~l~g~p~~~-~-~~d--~~~~~~ 103 (510) |-.+ ++++-+..- ........ ........ ..+..++. .+.....|+..++-+.+-|+.+ . .++ ...... T Consensus 1 Mg~~-~~f~~k~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~V~~~I~~ia~~iA~~p~~~~~~~~~g~~~~~~~ 77 (403) T protein:vir:80 1 MGLF-NFFRRKTRS--EPTNAISWFLTQEAYDTLAIPGYTRLSDNPEVRMAVHKIAELISSMTIHLMQNTDNGDIRIKNE 77 (403) T ss_pred Cccc-ccccccccc--cccchhhhhcccccccccccchhhhhhhhHHHHHHHHHHHHhhhhCceEEEEecCCceeecCCh Confidence 0000 011100000 00000000 00000000 00001111 1223344566666666667764 1 111 112222 Q ss_pred HHHHhc---cCH---HHHHHHHHHHHHh--cCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCceeEEEEEEEEEee Q lcl|NC_013644. 104 LAEYYN---SEF---QVVLQELVEGSSQ--KGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNELQRICRHYITEIEK 174 (510) Q Consensus 104 l~~~~~---n~~---~~~~~e~~~~~~~--~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~ 174 (510) +...+. |.. ......++...+. .|.||+++..+..|++ .+.+++|..+-++.++.+. ++++. T Consensus 78 ~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g~~~~L~~l~p~~v~~~~~~~g~-----~~~y~---- 148 (403) T protein:vir:80 78 LSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSGLIDELIPLAPSKVSFVDTDTGY-----QIWYQ---- 148 (403) T ss_pred HHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCCcEEEEEEEcCCeeEEEEcCCce-----EEEEe---- Confidence 333332 221 2333444555555 4668888878887876 5778888888776654431 11111 Q ss_pred CCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCC-CCCCCcHHHH Q lcl|NC_013644. 175 DGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNN-KQETTDLKPI 253 (510) Q Consensus 175 ~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn-~~g~sd~~~v 253 (510) ...|....+.+|+... .| .+ -.|.|-+..+ T Consensus 149 ---------~~~~~~~eiih~~~~~-----------------------------------~~-----~~~~~G~s~~~~~ 179 (403) T protein:vir:80 149 ---------GKAYNYDEVLHFIVNP-----------------------------------DP-----EKPYMGRGYRVVL 179 (403) T ss_pred ---------ecccchhhEEEEeccC-----------------------------------CC-----cCccccccHHHHH Confidence 0112233333332110 00 01 1244544444 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccceeEEecCCC-C--chhhhhHhh--------hcCeeeeccCC-CceeEEe-ecCCHH Q lcl|NC_013644. 254 KALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQG-D--DLSKLRQNV--------KSKKVVGTGSD-GGLDVKT-VTIPTE 320 (510) Q Consensus 254 ~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~-~--~~~~~~~~~--------~~~~~~~~~~~-~~~~~~~-~~~~~~ 320 (510) ...+........-..+.+...+.|-.++.-... . ........+ ..++.+.++.+ .+..-++ .+.... T Consensus 180 ~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~l~~~d~ 259 (403) T protein:vir:80 180 KDIVNNLKQATTTKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFKKYLEASEAGQPWIIPAELLDVEQVKPLSLKDL 259 (403) T ss_pred HHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCChHHHHHHHHHHHHHHhhhhhcCCeeeecccccccceeccCCHHHH Confidence 444444333333333444445556666543221 1 111111111 12233333322 2222222 222233 Q ss_pred HHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccce Q lcl|NC_013644. 321 GRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTE 400 (510) Q Consensus 321 ~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~ 400 (510) .+.+..+.....|+..-++|+.-.+...+.+... . ..+...|.-+++.|...+..+--.. .+ T Consensus 260 q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~-----~----------~f~~~~l~P~~~~ie~~l~~kll~~---~~ 321 (403) T protein:vir:80 260 AIHETVELDKRTVAGIFGVPAFLLGVGKYDKDEY-----N----------NFINSTILPIAKGIEQELTRKLLIS---PD 321 (403) T ss_pred HHHHHHHHhHHHHHHHhCCCHHHcCCCCccHHHH-----H----------HHHHHHHHHHHHHHHHHHHHhccCC---CC Confidence 4456666777778887778864332211111111 1 1333445554444444443321111 12 Q ss_pred eeEEe--CCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCC Q lcl|NC_013644. 401 VSFTF--TREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNT 478 (510) Q Consensus 401 v~i~f--~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~ 478 (510) ..+.| +.-+..|..+.++.+.++..+|+++.-.+++.++.-..+.-.+ .....+..+- +..+.. T Consensus 322 ~~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~p~~ggd~------------~~~~~n~~pl--~~~~~~ 387 (403) T protein:vir:80 322 LYFKFNPRSLYAYDLKELAEVGSNMYVRGLMEGNEVRDWLGLSPKEGLSE------------LVILENYIPL--DKIGDQ 387 (403) T ss_pred cEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCe------------Eeecccccch--hhccch Confidence 33455 4667789999999999999999999988888765422110000 0000111100 000000 Q ss_pred CCcccCCCCCCccccc Q lcl|NC_013644. 479 DEEETAVNPDDPTQQM 494 (510) Q Consensus 479 ~~~~~~~~~~~~~~~~ 494 (510) ...+.+++.++..+.+ T Consensus 388 ~~~k~ge~~~~~~~~~ 403 (403) T protein:vir:80 388 NKLKGGEKGGADGQTD 403 (403) T ss_pred hhccCCCCCCCCCCCC Confidence 0011111111000100 No 227 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=95.82 E-value=0.0014 Score=36.23 Aligned_cols=340 Identities=10% Similarity=0.024 Sum_probs=138.7 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceec--cchhHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIP--HGFFPEI 78 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~--~n~~~~I 78 (510) |--.. .|+..... ....+.....+.. ....... .+.+.+ ..=.-.- T Consensus 1 M~~~~----------------~f~~r~~~-~~~~~~~~~~~~~--------------~~~~~~~-v~~~~al~~~av~~c 48 (359) T protein:vir:10 1 MSILN----------------PFERRSSI-TPNNYYPFMVQNG--------------SIVPNSL-VDATEALKNSDLYAV 48 (359) T ss_pred Ccccc----------------hhhccccC-CCCcchhhhhccc--------------cccCCcc-cCHHHhhcchHHHHH Confidence 11110 00000000 0000000000000 0000000 000000 1111123 Q ss_pred HHHHHhhhhcCCceeccCcHHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEc Q lcl|NC_013644. 79 VDQKTQYLLSNPVEYETENEELKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYN 156 (510) Q Consensus 79 v~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d 156 (510) |+..++-+-+-|+. ++.....++.+=.. ..-......+....+.+|.||+++-.+..|.+ .+.+++|..+.+..+ T Consensus 49 v~~ia~~ia~~p~~---~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~~l~~l~~~~v~i~~~ 125 (359) T protein:vir:10 49 TSLISSDIAGTRFI---GNQVFTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDNSLMKELRLIPSNAITIDLT 125 (359) T ss_pred HHHHHHhhhcCccc---cchHHHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEECCCCeEEEEEEeCCceEEEEEc Confidence 44444444455553 22223222221111 11233445566778889999999988888875 477888888776555 Q ss_pred CCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCccc Q lcl|NC_013644. 157 EYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIP 236 (510) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP 236 (510) +.. + .|.+....++. ...+.++.+.+++.-.... -| T Consensus 126 ~~~-----~-~y~~~~~~~~~------~~~~~~~evih~~~~~~~~--------------------------------~~ 161 (359) T protein:vir:10 126 DDT-----L-TYEVNQFDDYP------SAKYNASEMIHVKIMAYGV--------------------------------DT 161 (359) T ss_pred CCe-----E-EEEEEecCCce------EEEEcccceEEeccCCCCC--------------------------------Cc Confidence 321 1 12221111111 1123344444443210000 00 Q ss_pred EEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCC-Cch---hhhhHhhh-------cCeeeecc Q lcl|NC_013644. 237 FYRLSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQG-DDL---SKLRQNVK-------SKKVVGTG 305 (510) Q Consensus 237 vv~~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~-~~~---~~~~~~~~-------~~~~~~~~ 305 (510) . +.-.|.|-++.+...+.....+..-..+.++..+.|-.+++-..+ -+. ..+.+... .++++.++ T Consensus 162 ~----dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~~~~~~~n~g~~~vl~ 237 (359) T protein:vir:10 162 L----HNLVGHSPLESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQGTLSSEAKDSIRKEFEKANGGNNSGRVMVLD 237 (359) T ss_pred c----CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCceecC Confidence 0 111355656655555555444444445555556666666543211 111 11222221 12355566 Q ss_pred CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc--CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 306 SDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGD--GNITNIVIKARYTLLNMKANKTEARLRALLEWMNKL 383 (510) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~ 383 (510) ++.+.+.++.......+.+..+...+.|...-++|+.-.+.. .+.+...++..+...+ ...+.- T Consensus 238 ~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~e~~~~~~l--------------~~~l~p 303 (359) T protein:vir:10 238 QSADFSTVSINADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQSSLDQIKDLYVNAL--------------NRFIEP 303 (359) T ss_pred CCcceeeecCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCcccccHHHHHHHHHHHH--------------HHHHHH Confidence 655555554333333455667777888888888888644332 2233333333322221 111111 Q ss_pred HHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CC Q lcl|NC_013644. 384 VIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LD 442 (510) Q Consensus 384 i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~ 442 (510) ++.-+..+-... +.+.....+-.|.......+.++..+|+++.-.+++.++. |= T Consensus 304 ~~~~l~~~l~~~-----~~~~~~~~~~~d~~~~~~~~~~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 304 LISELRIKCDSS-----IGVDMSPITDYSNSVFKADILNWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred HHHHHHHHhhhh-----hcccchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 111111100000 0111111111223444556778899999998888776532 21 No 228 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=95.68 E-value=0.0016 Score=35.86 Aligned_cols=372 Identities=8% Similarity=0.004 Sum_probs=144.0 Q ss_pred hhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcC Q lcl|NC_013644. 10 KIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSN 89 (510) Q Consensus 10 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~ 89 (510) .-+-.|+...+. +.++. +.++ ..+-.+.... .....+.-.|. .-...-|+..++-+..- T Consensus 1 mg~~~~~~~~~~---~~~~~-----~~~~----~~~~~~~~~~-------~~~t~~~~~~~--~~v~~cv~~Ia~~ia~~ 59 (403) T protein:vir:10 1 MGFKSWITEKLN---PGQRI-----IRDM----EPVSHRTNRK-------PFTTGQAYSKI--EILNRTANMVIDSAAEC 59 (403) T ss_pred Ccchhhhhhccc---hhhhh-----hhcc----cccccccCCc-------ccccHHHHHHH--HHHHHHHHHHHHHHhhC Confidence 112233322211 01100 0011 0110000000 00000000111 11112233333434444 Q ss_pred Cceecc-------CcHHHHHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEc Q lcl|NC_013644. 90 PVEYET-------ENEELKEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYN 156 (510) Q Consensus 90 p~~~~~-------~d~~~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d 156 (510) |+++.. .+.....-+..+++ |. .......++.....+|.||++. +.. .+..++|..+.+.-+ T Consensus 60 p~~v~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~--~~~---~l~~l~~~~~~v~~~ 134 (403) T protein:vir:10 60 SYTVGDKYNIVTYANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYW--DGT---SLYHVPAALMQVEAD 134 (403) T ss_pred ceeEeecccccccccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEE--eCc---eeEeecCcceEEEEc Confidence 554311 11111122333332 22 2344555677888999998654 321 244556655544332 Q ss_pred CCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCccc Q lcl|NC_013644. 157 EYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIP 236 (510) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP 236 (510) ... .+..+. ..+ . ..|..+.+.+++ ... T Consensus 135 ~~~----~~~~~~---~~~-~-------~~~~~~eiih~~-------------------------------------~~~ 162 (403) T protein:vir:10 135 ANK----FIKKFI---FNN-Q-------INYRVDEIIFIK-------------------------------------DNS 162 (403) T ss_pred CCc----eEEEEE---ecC-c-------eeecccceEEec-------------------------------------ccc Confidence 211 110000 000 0 001111111111 000 Q ss_pred EEEe-cCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh---hhhHhhh--------cCeeeec Q lcl|NC_013644. 237 FYRL-SNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS---KLRQNVK--------SKKVVGT 304 (510) Q Consensus 237 vv~~-~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~---~~~~~~~--------~~~~~~~ 304 (510) +++. .+...|.|.+..+...++..+.+..-..+.+...+.|-.+++....-+.. .+...+. .++++.+ T Consensus 163 ~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl 242 (403) T protein:vir:10 163 YVCGTNSQISGQSRVATVIDSLEKRSKMLNFKEKFLDNGTVIGLILETDEILNKKLRERKQEELQLDYNPSTGQSSVLIL 242 (403) T ss_pred cccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhCCcccCcceeec Confidence 1111 12334666666666666655555544455565566676666543222211 2222111 1235556 Q ss_pred cCCCceeEEeecCC--HHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 305 GSDGGLDVKTVTIP--TEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNK 382 (510) Q Consensus 305 ~~~~~~~~~~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~ 382 (510) +++-+.+.++...+ ...+.+..+...+.|...-++|+.-.+...+.+-.. .....+...|.-.++ T Consensus 243 ~~g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~sn~e~-------------~~~~f~~~tl~P~~~ 309 (403) T protein:vir:10 243 DGGMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGGNNANIRP-------------NIELFYYMTIIPMLN 309 (403) T ss_pred CCCceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcCHHH-------------HHHHHHHHHHHHHHH Confidence 66656655554333 234455666777888888788875443221111111 112223333444444 Q ss_pred HHHHHHhhccCCccccceeeEEeCCC--CCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHH Q lcl|NC_013644. 383 LVIDDINRRYTKAFDPTEVSFTFTRE--VMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDW 458 (510) Q Consensus 383 ~i~~~~~~~~~~~~~~~~v~i~f~~~--~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~ 458 (510) .|...+..+-. ..+.+.|+.- +-.|..+.++.+.+++.+|+++.-.+++.++. ++++... T Consensus 310 ~ie~~l~~~L~-----~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl~pi~~~~~d----------- 373 (403) T protein:vir:10 310 KLTSSLTFFFG-----YKITPNTKEVAALTPDKEAEAKHLTSLVNNGIITGNEARSELNLEPLDDEQMN----------- 373 (403) T ss_pred HHHHHHHHhcC-----ceeeeccchhhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCccccc----------- Confidence 44443333221 1233344422 55578888999999999999999888887654 2221110 Q ss_pred HHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccc Q lcl|NC_013644. 459 EDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGS 501 (510) Q Consensus 459 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (510) ......+.........++. +++++++++ |+ T Consensus 374 -~~~~p~n~~~~~~~~~~~e-----~~~~~~~~~-------g~ 403 (403) T protein:vir:10 374 -KIRIPANVAGSATGVSGQE-----GGRPKGSTE-------GD 403 (403) T ss_pred -ccccccccccccccCCCCc-----CCCCCCCcC-------CC Confidence 0001111110000000000 001111111 11 No 229 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=95.36 E-value=0.0022 Score=35.13 Aligned_cols=366 Identities=10% Similarity=0.035 Sum_probs=153.2 Q ss_pred hhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhc Q lcl|NC_013644. 9 VKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLS 88 (510) Q Consensus 9 ~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g 88 (510) +- +-.++..... + ...+..+ +.....+.... .+.. .+ ..+=+..+-....|+..++-+.. T Consensus 1 MG-l~~~~~~~~~--~-~~~~~~~--~~~~~~~~~~~----------~~~~---vt-~~~al~~~~v~~~i~~Ia~~iA~ 60 (394) T protein:vir:62 1 MG-LRDRFSNYLF--K-KAEKRGY--LDNVLGKSIRY----------SGVY---VT-DSNILQSSDVYELLQDISNQMVL 60 (394) T ss_pred Cc-hhhhhhhhcc--C-CCCchhh--hhhhhhccccc----------Cccc---cC-hhhhhccHHHHHHHHHHHHhhcc Confidence 11 1122221110 0 0111000 11111111100 0000 00 00001223445556666666666 Q ss_pred CCceeccCc-HHH-HHHHHHHhc--cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCCCc Q lcl|NC_013644. 89 NPVEYETEN-EEL-KEYLAEYYN--SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYNEL 161 (510) Q Consensus 89 ~p~~~~~~d-~~~-~~~l~~~~~--n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~~~ 161 (510) -|+.+-..+ +.. ...+..++. |. .......++...+.+|.+|+++..+..+ -+..+.|..++.+ T Consensus 61 lp~~v~~~~g~~~~~~~~~~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~~~-------~~~~~~~~~~~~~-- 131 (394) T protein:vir:62 61 ADIVVEDEFGNEIKDDIALQILRNPNNYLTQSEFIKLMTNTYLLEGETFPILNGAQIH-------LASNVFTELDDNL-- 131 (394) T ss_pred cceEEEcCCCcccchhhHHHHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEEecceee-------ccccceEEECCce-- Confidence 677653221 111 122223332 22 2344556778889999998876322211 1223444443221 Q ss_pred eeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEec Q lcl|NC_013644. 162 QRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLS 241 (510) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~ 241 (510) ++++.. ++ ..|.+.. |+|++ T Consensus 132 -----~~~~~~--~~--------~~~~~~e---------------------------------------------iih~r 151 (394) T protein:vir:62 132 -----VEHFNI--GG--------HEIPPCM---------------------------------------------IRHVK 151 (394) T ss_pred -----EEEEee--CC--------EEechhh---------------------------------------------eEEec Confidence 000000 00 0011111 23333 Q ss_pred CC----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEe--cCCC-Cch--hhhhHh----hh----cCeeeec Q lcl|NC_013644. 242 NN----KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVS--GFQG-DDL--SKLRQN----VK----SKKVVGT 304 (510) Q Consensus 242 nn----~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~--g~~~-~~~--~~~~~~----~~----~~~~~~~ 304 (510) +. -.|.|.+..+...|+....+..-..+.+...+.|-.+++ +... ++. ..++.. .. .++++.+ T Consensus 152 ~~~~d~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl 231 (394) T protein:vir:62 152 NIGADHLRGKGILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDAHINPQNGAQSKLINAILDQLESIDEARSVKMI 231 (394) T ss_pred CcCCCCccccChHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCCCCcCHHHHHHHHHHHHHHhccccccCceeEe Confidence 21 135566665555555544444444455555566655544 3211 111 111111 11 1334556 Q ss_pred cCCCceeEEeecCC--HHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 305 GSDGGLDVKTVTIP--TEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNK 382 (510) Q Consensus 305 ~~~~~~~~~~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~ 382 (510) +.+.+.++.....+ ...+.+..+...+.|+..-++|+.-.+.... |..+ ......+..+|.-+++ T Consensus 232 ~~g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-sn~e------------~~~~~~~~~~l~P~~~ 298 (394) T protein:vir:62 232 PLGKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTELIK-EDIE------------KAMMYIHNKAVRPIMK 298 (394) T ss_pred eCCCceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC-cCHH------------HHHHHHHHHHHHHHHH Confidence 66777777655433 3445556677788888888888754433222 2111 1112333444555555 Q ss_pred HHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 383 LVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWED 460 (510) Q Consensus 383 ~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~ 460 (510) .|...+..+--.......+.+.|+.....+..+.++.+.++..+|+++.-.++++++. ++++..... T Consensus 299 ~ie~~l~~kll~~~~~~~~~~~fd~~~~~~~~~~~~~~~~~~~~g~~T~NE~R~~~gl~p~~~~~gd~~----------- 367 (394) T protein:vir:62 299 NFEDHLSLLFYAQNSGKRIKFKINILDFVTYSNKTNIGYNLVRTAITSPDNVADMLGFPKQNTKESQAI----------- 367 (394) T ss_pred HHHHHHhhhhcCccccCceEEEechhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCee----------- Confidence 5544444321111122357788887777777888899999999999999888887653 222211000 Q ss_pred HHHHHHhhhccCCCCCCCCCcccCCCCCCcccccc Q lcl|NC_013644. 461 VKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMA 495 (510) Q Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (510) -...+..+.. ......+. ..+|+..+ . T Consensus 368 -~~~~n~~~~~-~~~~~~~~----~kgge~~e--n 394 (394) T protein:vir:62 368 -YISNDVTEIG-KKEATDGS----LGGGEENE--N 394 (394) T ss_pred -eccccccccc-cccccccc----CCCCCCCC--C Confidence 0000000000 00000000 00111000 0 No 230 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=94.94 E-value=0.0031 Score=34.32 Aligned_cols=413 Identities=10% Similarity=0.016 Sum_probs=171.5 Q ss_pred CCCccCCCh-h-hhHHHHHHHHHhhhhhhhHHHHHHHHHHhc--cCCcchhcccceeccccccccccccccceeccchhH Q lcl|NC_013644. 1 MEALLSEDV-K-IIANALKAAIDKDRKSSSKREAETGIRYYN--HENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFP 76 (510) Q Consensus 1 ~~~~~~~~~-~-~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~--g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~ 76 (510) |..+.+.-. - ...+....-.. .. + ..|+ ...+.+..... -...+.- .-...... T Consensus 1 ~~~~~~~~~p~~~~g~~~~~~~~-----~~---~----~~~~~~e~~~~lr~~~~-----~~ly~~m-----~e~D~~i~ 58 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFGSGVV-----DG---W----TVWDPFEQTPELQWPQS-----VAVYSRM-----DNEDSRVT 58 (469) T ss_pred CCCcccCCCCccchhhhhhcccc-----cc---h----hhccccccccccccccc-----hHHHHHH-----HhhChHHH Confidence 433322111 1 11110000000 00 0 0000 00011100000 0000000 00134445 Q ss_pred HHHHHHHhhhhcCCceeccC--cHHHHHHHHHHh------------------ccCHHHHHHHHHHHHHhcCeE-EEEEEE Q lcl|NC_013644. 77 EIVDQKTQYLLSNPVEYETE--NEELKEYLAEYY------------------NSEFQVVLQELVEGSSQKGFE-YVYART 135 (510) Q Consensus 77 ~Iv~~~~~~l~g~p~~~~~~--d~~~~~~l~~~~------------------~n~~~~~~~e~~~~~~~~G~~-~~~v~~ 135 (510) -.+++....+.|-+.++.+. ++++.+++...+ ...+.+.+.++...+..+|.+ ++++|. T Consensus 59 s~l~~rk~av~~~~w~v~p~~~~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~ 138 (469) T protein:vir:10 59 SLLEAISLPIRSTPWRIRANGASDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYR 138 (469) T ss_pred HHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeee Confidence 55666666677777776532 333333333222 123556777777778888975 557775 Q ss_pred CC----CCceEEEEEccc--ceE--EEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccc Q lcl|NC_013644. 136 NA----EDRLCFQVADSL--NVF--GVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDE 207 (510) Q Consensus 136 d~----~g~~~i~~~~p~--~~~--~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~ 207 (510) .. +|...+..+.+. ..+ -.|++.+.+. .++........... T Consensus 139 ~~~~~~dG~~~~~~l~~rp~~~i~~~~~~~~~~l~-------------------------------~~~~~~~~~~~~~~ 187 (469) T protein:vir:10 139 PRNQSPDGRFWLRKLAPRPQWTISKFNVAPDGGLE-------------------------------SIEQIAPPARTRGS 187 (469) T ss_pred cccccCCCceeeeeeeecCcccceeeeeccCCcee-------------------------------eeeecCcccccccc Confidence 22 455544433222 111 1122211110 00000000000000 Q ss_pred ccccccccccccccccccccccccCCcccEEEec--CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCC Q lcl|NC_013644. 208 AEPINPRPHVLAVDSENESLLQRSYGQIPFYRLS--NNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQ 285 (510) Q Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~--nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~ 285 (510) ... . ........+...|-.++-. .|+.|.|.+..+-...--=+..+.+++..++.++.|+++.+-.. T Consensus 188 ~~~----------~-~~~~~~lp~~k~i~~~~~~~~g~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~~ 256 (469) T protein:vir:10 188 LYV----------A-NIAPPEIPVNRLVVYTRNKRPGQWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTASS 256 (469) T ss_pred ccc----------C-CCCccccccCcEEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEecCC Confidence 000 0 0000000111122111111 35678888887765544445577889999999999999877543 Q ss_pred CCchhh------hhHhhhc--CeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccc-c-cccCcccHHHH Q lcl|NC_013644. 286 GDDLSK------LRQNVKS--KKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDST-Q-VGDGNITNIVI 355 (510) Q Consensus 286 ~~~~~~------~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~-~-~~~g~~Sg~Ai 355 (510) +.+..+ ....+.. ...+.++.+.++++++...+...+...++.+.+.|...-.+..++ . .+++.+.|..- T Consensus 257 ~a~~~ek~~l~~a~~~~~~g~~a~~iip~~~~ie~~ea~g~~~~~~~li~~~d~~Isk~iLG~tlTs~~~gGS~a~~~vh 336 (469) T protein:vir:10 257 ATDEDEVRKMAALARSVRGGINAGVGLAQGQILELLGVSGNLPDIRRAIEGHDRSIALSGLAHFLNLDGKGGSYALASVL 336 (469) T ss_pred CCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEeecCCCchHHHHHHHHHHHHHHHHHhcccccccCccchhhHHHHH Confidence 333222 1222321 335568889999999988888889999999999997764433322 2 22222223322 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCC-----C Q lcl|NC_013644. 356 KARYTLLNMKANKTEARLRALLE-WMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRK-----I 429 (510) Q Consensus 356 ~~~~~~l~~k~~~k~~~~~~~l~-~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~-----i 429 (510) .-. ....+..-.+.+...|. ++++-++.+ +. +. +..-+.++|... ..+....++.+.++++.|+ + T Consensus 337 ~ev---~~d~~~sDa~~i~~tln~~li~~l~~l-N~--g~--~~~~P~~~~~~~-e~~~~~~a~~i~~l~~~G~~~~~~~ 407 (469) T protein:vir:10 337 EDP---FTQAVHAYATSICRIANQHIIEDLVDI-NF--GV--DTPAPVLTFDPI-GSRQDLTAAAVKLLYDAGVFDDDPA 407 (469) T ss_pred HHH---HHHHHHHHHHHHHHHHHHHHHHHHHHh-cC--CC--CCCccEEEecCC-CCcHHHHHHHHHHHHhcCCccCccc Confidence 221 22233333455556664 455555543 21 11 122356677543 4556677888999999997 3 Q ss_pred chHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccccCCC Q lcl|NC_013644. 430 ILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLPEN 509 (510) Q Consensus 430 S~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (510) +.+.+.+.++.-...+.+... ...+....+.....+......+.......+ -.|+. T Consensus 408 ~~~~~~e~~gip~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~ 463 (469) T protein:vir:10 408 VKRAIRQRFNLPSELNDTPSA------------EPEEPAAVPNQSAAPARTRSSGNADARARA------------PKADQ 463 (469) T ss_pred cHHHHHHHhCCCCCCCCcccc------------cchhcccCCCCCccccccCCCCCccccccc------------CCChH Confidence 445566666542221110000 000000000000000000000000000000 01111 Q ss_pred C Q lcl|NC_013644. 510 G 510 (510) Q Consensus 510 ~ 510 (510) + T Consensus 464 ~ 464 (469) T protein:vir:10 464 G 464 (469) T ss_pred H Confidence 1 No 231 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=94.45 E-value=0.0044 Score=33.52 Aligned_cols=360 Identities=9% Similarity=0.020 Sum_probs=131.3 Q ss_pred HHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhc-- Q lcl|NC_013644. 32 AETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYN-- 109 (510) Q Consensus 32 ~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-- 109 (510) |..+.+.+..+.+... .. .+....... ...-+........|+..++-+.+-|+.+-..+......+..++. T Consensus 1 Mg~f~~lf~~~~~~~~----~~--~~~~~~~v~-~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~ll~~~ 73 (395) T protein:vir:10 1 MSILEKIFKTRKDITY----ML--DLDMIEDLS-QQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYKLNIK 73 (395) T ss_pred CchhhhhhccCccccc----cc--cchhccccc-hhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHHHHhc Confidence 2222222222211000 00 000000000 00011233444555556666666676643333322232333331 Q ss_pred -cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccce--EEEEcCCCCceeEEEEEEEEEeeCCceeEEEE Q lcl|NC_013644. 110 -SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNV--FGVYNEYNELQRICRHYITEIEKDGETVDIHH 183 (510) Q Consensus 110 -n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~--~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 183 (510) |. .......+....+..|.+|.++..+ .+ + ..+++..+ ..++++. ++.+..... .. T Consensus 74 PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~-~~-~--~~~~~~~~~~~~~~~~~--------~~~~~~~~~------~~ 135 (395) T protein:vir:10 74 PNTDLSSDSFWQQVIYKLIYDNEVLIVVSDS-KE-L--LIADSFYREEYALYDDI--------FKDVTVKDY------TY 135 (395) T ss_pred cCcCCCHHHHHHHHHHHHhhCCceEEEEecC-CC-e--EecCCccceeEeecCcc--------eeEEEEcCc------ee Confidence 22 2233444566666777776554333 22 2 11222221 1122110 000000000 00 Q ss_pred EEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecC-----CCCCCCcHHHHHHHHH Q lcl|NC_013644. 184 AEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSN-----NKQETTDLKPIKALID 258 (510) Q Consensus 184 ~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----n~~g~sd~~~v~~liD 258 (510) -..+.+.. |+|++. ...|.|-++.+..+++ T Consensus 136 ~~~~~~~e---------------------------------------------vih~~~~~~~~~~~G~spi~~~~~~~~ 170 (395) T protein:vir:10 136 QRTFTMQE---------------------------------------------VIYLKYNNNKVTHFVESLFEDYGKIFG 170 (395) T ss_pred eeeecccc---------------------------------------------EEEEccCCCCcccccchHHHHHHHHHH Confidence 01122222 233321 1234444444433333 Q ss_pred HHHHHHHHHHHHHHHhccceeEE--ecCCCCch--hhhhHhhh---------cCeeeeccCCCceeEEeecCC-----HH Q lcl|NC_013644. 259 DYDLMNCFLSNNLQDFAEAIYVV--SGFQGDDL--SKLRQNVK---------SKKVVGTGSDGGLDVKTVTIP-----TE 320 (510) Q Consensus 259 ~~n~~~S~~~~~~~~~~~~~lv~--~g~~~~~~--~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~-----~~ 320 (510) .. .+.+...+.+--++ .+...++. ......+. ...++.++++.+.+.++.... .. T Consensus 171 ~~-------~~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~ 243 (395) T protein:vir:10 171 RM-------IGAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFS 243 (395) T ss_pred HH-------HHHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHH Confidence 22 22233333333333 22211111 11111111 111333455545544443221 22 Q ss_pred HHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccc-cc Q lcl|NC_013644. 321 GRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFD-PT 399 (510) Q Consensus 321 ~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~-~~ 399 (510) .+.+..+...+.|+..-++|+.-.+ |+-|+.+ .....++...|.-++..|...+..+--.... .. T Consensus 244 q~~e~~~~~~~~Ia~~f~VPp~~l~--~~~sn~e------------~~~~~~~~~~l~P~~~~ie~~l~~kL~~~~~~~~ 309 (395) T protein:vir:10 244 ELSELMRDAIKNVALMIGIPPGLIY--GETADLE------------KNTLVFEKFCLTPLLKKIQNELNAKLITQSMYLK 309 (395) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHhc--CcccCHH------------HHHHHHHHHHHHHHHHHHHHHHHHhhcChhhhcc Confidence 4566667778888888888876432 2222111 1111223333444444443333322111101 11 Q ss_pred eeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCC Q lcl|NC_013644. 400 EVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDN 477 (510) Q Consensus 400 ~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~ 477 (510) .+.+.++.-+-.|..+.++.+.+++.+|+++.-.++++++. +++....+. ....+..+........ T Consensus 310 ~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~------------~~~~n~~~~~~~~~~~ 377 (395) T protein:vir:10 310 DTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEY------------LITKNYEKANSGENDE 377 (395) T ss_pred cceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCcee------------eecccccccccccccc Confidence 23455666777899999999999999999999888887643 222110000 0000000000000000 Q ss_pred CCCcccCCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 478 TDEEETAVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) ....+....+|+ ..+|| T Consensus 378 ~~~~~~~~kgg~----------------~~~~g 394 (395) T protein:vir:10 378 KEKDENTLKGGD----------------EDESG 394 (395) T ss_pred CcccccccCCCC----------------CCCCC Confidence 000000000000 11112 No 232 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=94.45 E-value=0.0044 Score=33.52 Aligned_cols=360 Identities=9% Similarity=0.020 Sum_probs=131.3 Q ss_pred HHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhc-- Q lcl|NC_013644. 32 AETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYN-- 109 (510) Q Consensus 32 ~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-- 109 (510) |..+.+.+..+.+... .. .+....... ...-+........|+..++-+.+-|+.+-..+......+..++. T Consensus 1 Mg~f~~lf~~~~~~~~----~~--~~~~~~~v~-~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~ll~~~ 73 (395) T protein:vir:10 1 MSILEKIFKTRKDITY----ML--DLDMIEDLS-QQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYKLNIK 73 (395) T ss_pred CchhhhhhccCccccc----cc--cchhccccc-hhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHHHHhc Confidence 2222222222211000 00 000000000 00011233444555556666666676643333322232333331 Q ss_pred -cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccce--EEEEcCCCCceeEEEEEEEEEeeCCceeEEEE Q lcl|NC_013644. 110 -SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNV--FGVYNEYNELQRICRHYITEIEKDGETVDIHH 183 (510) Q Consensus 110 -n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~--~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 183 (510) |. .......+....+..|.+|.++..+ .+ + ..+++..+ ..++++. ++.+..... .. T Consensus 74 PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~-~~-~--~~~~~~~~~~~~~~~~~--------~~~~~~~~~------~~ 135 (395) T protein:vir:10 74 PNTDLSSDSFWQQVIYKLIYDNEVLIVVSDS-KE-L--LIADSFYREEYALYDDI--------FKDVTVKDY------TY 135 (395) T ss_pred cCcCCCHHHHHHHHHHHHhhCCceEEEEecC-CC-e--EecCCccceeEeecCcc--------eeEEEEcCc------ee Confidence 22 2233444566666777776554333 22 2 11222221 1122110 000000000 00 Q ss_pred EEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecC-----CCCCCCcHHHHHHHHH Q lcl|NC_013644. 184 AEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSN-----NKQETTDLKPIKALID 258 (510) Q Consensus 184 ~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----n~~g~sd~~~v~~liD 258 (510) -..+.+.. |+|++. ...|.|-++.+..+++ T Consensus 136 ~~~~~~~e---------------------------------------------vih~~~~~~~~~~~G~spi~~~~~~~~ 170 (395) T protein:vir:10 136 QRTFTMQE---------------------------------------------VIYLKYNNNKVTHFVESLFEDYGKIFG 170 (395) T ss_pred eeeecccc---------------------------------------------EEEEccCCCCcccccchHHHHHHHHHH Confidence 01122222 233321 1234444444433333 Q ss_pred HHHHHHHHHHHHHHHhccceeEE--ecCCCCch--hhhhHhhh---------cCeeeeccCCCceeEEeecCC-----HH Q lcl|NC_013644. 259 DYDLMNCFLSNNLQDFAEAIYVV--SGFQGDDL--SKLRQNVK---------SKKVVGTGSDGGLDVKTVTIP-----TE 320 (510) Q Consensus 259 ~~n~~~S~~~~~~~~~~~~~lv~--~g~~~~~~--~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~-----~~ 320 (510) .. .+.+...+.+--++ .+...++. ......+. ...++.++++.+.+.++.... .. T Consensus 171 ~~-------~~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~ 243 (395) T protein:vir:10 171 RM-------IGAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFS 243 (395) T ss_pred HH-------HHHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHH Confidence 22 22233333333333 22211111 11111111 111333455545544443221 22 Q ss_pred HHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccc-cc Q lcl|NC_013644. 321 GRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFD-PT 399 (510) Q Consensus 321 ~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~-~~ 399 (510) .+.+..+...+.|+..-++|+.-.+ |+-|+.+ .....++...|.-++..|...+..+--.... .. T Consensus 244 q~~e~~~~~~~~Ia~~f~VPp~~l~--~~~sn~e------------~~~~~~~~~~l~P~~~~ie~~l~~kL~~~~~~~~ 309 (395) T protein:vir:10 244 ELSELMRDAIKNVALMIGIPPGLIY--GETADLE------------KNTLVFEKFCLTPLLKKIQNELNAKLITQSMYLK 309 (395) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHhc--CcccCHH------------HHHHHHHHHHHHHHHHHHHHHHHHhhcChhhhcc Confidence 4566667778888888888876432 2222111 1111223333444444443333322111101 11 Q ss_pred eeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCC Q lcl|NC_013644. 400 EVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDN 477 (510) Q Consensus 400 ~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~ 477 (510) .+.+.++.-+-.|..+.++.+.+++.+|+++.-.++++++. +++....+. ....+..+........ T Consensus 310 ~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~------------~~~~n~~~~~~~~~~~ 377 (395) T protein:vir:10 310 DTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEY------------LITKNYEKANSGENDE 377 (395) T ss_pred cceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCcee------------eecccccccccccccc Confidence 23455666777899999999999999999999888887643 222110000 0000000000000000 Q ss_pred CCCcccCCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 478 TDEEETAVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) ....+....+|+ ..+|| T Consensus 378 ~~~~~~~~kgg~----------------~~~~g 394 (395) T protein:vir:10 378 KEKDENTLKGGD----------------EDESG 394 (395) T ss_pred CcccccccCCCC----------------CCCCC Confidence 000000000000 11112 No 233 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=94.45 E-value=0.0044 Score=33.52 Aligned_cols=360 Identities=9% Similarity=0.020 Sum_probs=131.3 Q ss_pred HHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhc-- Q lcl|NC_013644. 32 AETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYN-- 109 (510) Q Consensus 32 ~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-- 109 (510) |..+.+.+..+.+... .. .+....... ...-+........|+..++-+.+-|+.+-..+......+..++. T Consensus 1 Mg~f~~lf~~~~~~~~----~~--~~~~~~~v~-~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~ll~~~ 73 (395) T protein:vir:95 1 MSILEKIFKTRKDITY----ML--DLDMIEDLS-QQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYKLNIK 73 (395) T ss_pred CchhhhhhccCccccc----cc--cchhccccc-hhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHHHHhc Confidence 2222222222211000 00 000000000 00011233444555556666666676643333322232333331 Q ss_pred -cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccce--EEEEcCCCCceeEEEEEEEEEeeCCceeEEEE Q lcl|NC_013644. 110 -SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNV--FGVYNEYNELQRICRHYITEIEKDGETVDIHH 183 (510) Q Consensus 110 -n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~--~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 183 (510) |. .......+....+..|.+|.++..+ .+ + ..+++..+ ..++++. ++.+..... .. T Consensus 74 PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~-~~-~--~~~~~~~~~~~~~~~~~--------~~~~~~~~~------~~ 135 (395) T protein:vir:95 74 PNTDLSSDSFWQQVIYKLIYDNEVLIVVSDS-KE-L--LIADSFYREEYALYDDI--------FKDVTVKDY------TY 135 (395) T ss_pred cCcCCCHHHHHHHHHHHHhhCCceEEEEecC-CC-e--EecCCccceeEeecCcc--------eeEEEEcCc------ee Confidence 22 2233444566666777776554333 22 2 11222221 1122110 000000000 00 Q ss_pred EEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecC-----CCCCCCcHHHHHHHHH Q lcl|NC_013644. 184 AEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSN-----NKQETTDLKPIKALID 258 (510) Q Consensus 184 ~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----n~~g~sd~~~v~~liD 258 (510) -..+.+.. |+|++. ...|.|-++.+..+++ T Consensus 136 ~~~~~~~e---------------------------------------------vih~~~~~~~~~~~G~spi~~~~~~~~ 170 (395) T protein:vir:95 136 QRTFTMQE---------------------------------------------VIYLKYNNNKVTHFVESLFEDYGKIFG 170 (395) T ss_pred eeeecccc---------------------------------------------EEEEccCCCCcccccchHHHHHHHHHH Confidence 01122222 233321 1234444444433333 Q ss_pred HHHHHHHHHHHHHHHhccceeEE--ecCCCCch--hhhhHhhh---------cCeeeeccCCCceeEEeecCC-----HH Q lcl|NC_013644. 259 DYDLMNCFLSNNLQDFAEAIYVV--SGFQGDDL--SKLRQNVK---------SKKVVGTGSDGGLDVKTVTIP-----TE 320 (510) Q Consensus 259 ~~n~~~S~~~~~~~~~~~~~lv~--~g~~~~~~--~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~-----~~ 320 (510) .. .+.+...+.+--++ .+...++. ......+. ...++.++++.+.+.++.... .. T Consensus 171 ~~-------~~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~ 243 (395) T protein:vir:95 171 RM-------IGAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFS 243 (395) T ss_pred HH-------HHHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHH Confidence 22 22233333333333 22211111 11111111 111333455545544443221 22 Q ss_pred HHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccc-cc Q lcl|NC_013644. 321 GRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFD-PT 399 (510) Q Consensus 321 ~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~-~~ 399 (510) .+.+..+...+.|+..-++|+.-.+ |+-|+.+ .....++...|.-++..|...+..+--.... .. T Consensus 244 q~~e~~~~~~~~Ia~~f~VPp~~l~--~~~sn~e------------~~~~~~~~~~l~P~~~~ie~~l~~kL~~~~~~~~ 309 (395) T protein:vir:95 244 ELSELMRDAIKNVALMIGIPPGLIY--GETADLE------------KNTLVFEKFCLTPLLKKIQNELNAKLITQSMYLK 309 (395) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHhc--CcccCHH------------HHHHHHHHHHHHHHHHHHHHHHHHhhcChhhhcc Confidence 4566667778888888888876432 2222111 1111223333444444443333322111101 11 Q ss_pred eeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCC Q lcl|NC_013644. 400 EVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDN 477 (510) Q Consensus 400 ~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~ 477 (510) .+.+.++.-+-.|..+.++.+.+++.+|+++.-.++++++. +++....+. ....+..+........ T Consensus 310 ~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~------------~~~~n~~~~~~~~~~~ 377 (395) T protein:vir:95 310 DTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEY------------LITKNYEKANSGENDE 377 (395) T ss_pred cceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCcee------------eecccccccccccccc Confidence 23455666777899999999999999999999888887643 222110000 0000000000000000 Q ss_pred CCCcccCCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 478 TDEEETAVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) ....+....+|+ ..+|| T Consensus 378 ~~~~~~~~kgg~----------------~~~~g 394 (395) T protein:vir:95 378 KEKDENTLKGGD----------------EDESG 394 (395) T ss_pred CcccccccCCCC----------------CCCCC Confidence 000000000000 11112 No 234 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=94.36 E-value=0.0046 Score=33.39 Aligned_cols=395 Identities=9% Similarity=0.005 Sum_probs=163.0 Q ss_pred CCCccCCChhh--------hHHH------HHHHHHhhhhhhhHHHHHHHHHHhccCC-----cchhcccceecccccccc Q lcl|NC_013644. 1 MEALLSEDVKI--------IANA------LKAAIDKDRKSSSKREAETGIRYYNHEN-----DIMNNRIFYVDDEGILRE 61 (510) Q Consensus 1 ~~~~~~~~~~~--------~~~~------i~~~i~~~~~~~~~~~~~~~~~YY~g~~-----~i~~~~~~~~~~~~~~~~ 61 (510) |--= +..++. .... ....+..|. |.|-. .|+..... -...+ T Consensus 1 m~kk-~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~g~~~~~~~~iLr~~~~-----~~ly~ 60 (448) T protein:vir:77 1 MAKR-GRKPKELVPGPGSIDPSDVPKLEGASVPVMSTS--------------YDVVVDREFDELLQGKDG-----LLVYH 60 (448) T ss_pred CCCC-CCCCcccCCcccccchhhhhhhccchhhhcccc--------------cccccccchhHhhccccc-----hHHHH Confidence 2110 000100 0000 000000000 01100 11100000 00000 Q ss_pred ccccccceeccchhHHHHHHHHhhhhcCCceeccC-----cHHHHHHHHHHhcc--------CHHHHHHHHHHHHHhcCe Q lcl|NC_013644. 62 DKYASNVRIPHGFFPEIVDQKTQYLLSNPVEYETE-----NEELKEYLAEYYNS--------EFQVVLQELVEGSSQKGF 128 (510) Q Consensus 62 ~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~-----d~~~~~~l~~~~~n--------~~~~~~~e~~~~~~~~G~ 128 (510) + .. ......-.+.+-...+.|.+..+.+. +....+++.+++.. +|.+.+.++ .++..+|. T Consensus 61 ~-----m~-~D~hi~s~l~~Rk~av~~~~w~v~p~~~~~~d~~~ae~v~~~l~~~~~~~~~~~f~~~i~~~-lda~~~G~ 133 (448) T protein:vir:77 61 K-----ML-SDGTVKNALNYIFGRIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIY-ENAYIYGM 133 (448) T ss_pred H-----Hh-hChHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHhhchhhhhccCCHHHHHHHH-HHhhhhcc Confidence 0 00 13444555666667777888887642 23345567666532 455666555 68888997 Q ss_pred E-EEEEEE-CCCCceEEEE---Ecccce-EEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCce Q lcl|NC_013644. 129 E-YVYART-NAEDRLCFQV---ADSLNV-FGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKD 202 (510) Q Consensus 129 ~-~~~v~~-d~~g~~~i~~---~~p~~~-~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~ 202 (510) + .+++|. ..+|...+.. .++... +-.|+..+.+ +++...+.. T Consensus 134 s~~Eivw~~~~dg~~~~~~l~~r~~~~~~~f~~~~~~~l--------------------------------~~~~~~~~~ 181 (448) T protein:vir:77 134 AAGEIVLTLGADGKLILDKIVPIHPFNIDEVLYDEEGGP--------------------------------KALKLSGEV 181 (448) T ss_pred eeEEEEEeecCCCceeeccccccCCCccceeeeecCCce--------------------------------EEEecCCcc Confidence 5 556774 4567654332 233211 1112211111 111110000 Q ss_pred eecccccccccccccccccccccccccccCCcccEEEec--CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeE Q lcl|NC_013644. 203 YELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLS--NNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYV 280 (510) Q Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~--nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv 280 (510) .... ......+.|++++=+.+.. .|+.|.|.+..+--..--=+..+.+++..++.++.|+++ T Consensus 182 ~~~~----------------~~~~~~~lP~~~~i~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~v 245 (448) T protein:vir:77 182 KGGS----------------QFVNGLEIPIWKTVVFLHNDDGSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPT 245 (448) T ss_pred cccc----------------cCCCccccccceEEEEecCCcCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeE Confidence 0000 0000112233443222111 245677888776544444455678899999999999999 Q ss_pred EecCCCCc--hhh------hhHhhh--cCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcc Q lcl|NC_013644. 281 VSGFQGDD--LSK------LRQNVK--SKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNI 350 (510) Q Consensus 281 ~~g~~~~~--~~~------~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~ 350 (510) .+-..+.+ ..+ ...++. ......++.+.++++++.......+...++.+.+.|...-.+-.++.+..|.. T Consensus 246 gky~~ga~~~~~~~~~l~~av~~i~~g~~a~~iiP~g~~ie~~ea~~~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~g~ 325 (448) T protein:vir:77 246 LTIPKSVRQGTKQWEAAKEIVKNFVQKPRHGIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARALGIDFNTVQLNMGV 325 (448) T ss_pred EecCCCCCCCHHHHHHHHHHHHHHhcCCceEEEecCCceEEEEecCCCccCHHHHHHHHHHHHHHHHhccccccccccch Confidence 87433221 111 122222 23355688899999998776666677788888888876543333222222222 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCC Q lcl|NC_013644. 351 TNIVIKARYTLLNMKANKTEARLRALLE-WMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKI 429 (510) Q Consensus 351 Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~-~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~i 429 (510) +..+......-.......-.+.+...|. ++++-++. ++ .+.. ..-..+.|...-+.|.++.++.+.++++. T Consensus 326 ~~~~~~~~~~v~~~~~~aDa~~i~~tln~~Li~~l~~-lN--fg~~--~~~P~~~f~~~e~eDl~~~a~~~~~l~~~--- 397 (448) T protein:vir:77 326 QAVNIGEFVSLTQQTIISLQREFASAVNLYLIPKLVL-PN--WPGA--TRFPRLTFEMEERNDFSAAANLMGMLINA--- 397 (448) T ss_pred hhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hc--CCCC--CCCCEEEecCCChhhHHHHHHHhHHHHHH--- Confidence 2222221111111111222233444443 34544443 22 2211 12357889888889988888887776532 Q ss_pred chHHHHHh--CCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCccccccc Q lcl|NC_013644. 430 ILESILQV--APRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTES 504 (510) Q Consensus 430 S~et~~~~--~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (510) ..+. +|.-.+.. ..+....+.+..... +++.....++. +... ++.-.-+ T Consensus 398 ----~~~~~~ip~~~~~~----------------~~~~~~~~~~~~~~~-~~~~~~~~~~~---~~~~--~~~r~~~ 448 (448) T protein:vir:77 398 ----VKDSEDIPTELKAL----------------IDALPSKMRRALGVV-DEVREAVRQPA---DSRY--LYTRRRR 448 (448) T ss_pred ----HHHHhcCCccCCcC----------------CCCCchhcccccCCC-CCCCchhhcch---hhHH--HHhhhcC Confidence 1111 12100000 000000010111011 11111111121 1111 1111111 No 235 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=94.32 E-value=0.0047 Score=33.32 Aligned_cols=348 Identities=9% Similarity=0.036 Sum_probs=141.3 Q ss_pred HHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhc-- Q lcl|NC_013644. 32 AETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYN-- 109 (510) Q Consensus 32 ~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-- 109 (510) |-.+.+++.-+.... ...... ...... ...-+........|+..++-+.+-|+++--.+......+..++. T Consensus 1 Mg~f~~~f~~~~~~~----~~~~~~-~~~~~~--~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~~~~l~~lL~~~ 73 (385) T protein:vir:95 1 MGLFDSVFKRHSELS----WMYDLE-FLQDKS--KKAYLKQIALNTVVEMVARTISQSEFRVMKNNTKEKGTLYYLLNVR 73 (385) T ss_pred CchhhhhhccCcccc----cccchh-hhhccc--hhhhhhhHHHHHHHHHHHHHHcccceeeeecCccccchHHHHHhcc Confidence 111111111100000 000000 000000 00001123445566777776767777653222222223333332 Q ss_pred -cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCceE--EEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEE Q lcl|NC_013644. 110 -SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRLC--FQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHH 183 (510) Q Consensus 110 -n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~--i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 183 (510) |. .......++...+.+|.||++... ++... ..++.+..+ .++.. . ++....... .. T Consensus 74 PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~--~~~~~~~~~~~~~~~~-~~~~~-------~-~~~~~~~~~------~~ 136 (385) T protein:vir:95 74 PNRNQNAVDFWQKFIFKLIMDNEVLVVKND--EGHFFVADDFEKEDEL-GLYSH-------R-FTNVLVNDF------EF 136 (385) T ss_pred cCcCCCHHHHHHHHHHHHhhcCceEEEEec--CCCeeecccccccccc-ccccc-------c-ceeeeeccc------ce Confidence 22 234556677888899999976533 33321 111111111 10100 0 000000000 00 Q ss_pred EEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCC-----CCCCcHHHHHHHHH Q lcl|NC_013644. 184 AEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNK-----QETTDLKPIKALID 258 (510) Q Consensus 184 ~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~-----~g~sd~~~v~~liD 258 (510) ...+.+.. |+|++... .|.|.++.+.. T Consensus 137 ~~~~~~~e---------------------------------------------iih~~~~~~~~~~~G~s~~~~~~~--- 168 (385) T protein:vir:95 137 KRVFTMDD---------------------------------------------VIYLKYNNQKLDAFSLGLFEDYGE--- 168 (385) T ss_pred eeeecccc---------------------------------------------EEEecCCCCCcccccchHHHHHHH--- Confidence 01111222 34443321 23443333322 Q ss_pred HHHHHHHHHHHHHHHhcc--ceeEEecCCCCch---hhhhHhh---------hcCeeeeccCCCceeEEeecC------C Q lcl|NC_013644. 259 DYDLMNCFLSNNLQDFAE--AIYVVSGFQGDDL---SKLRQNV---------KSKKVVGTGSDGGLDVKTVTI------P 318 (510) Q Consensus 259 ~~n~~~S~~~~~~~~~~~--~~lv~~g~~~~~~---~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~------~ 318 (510) .+...+ +...+... .++++.+....+. ..+...+ ....++.++++.+.+.++... . T Consensus 169 ~i~~~~----~~~~~~~~~~g~l~~~~~~~~~~e~~~~~~~~~~~~~~g~~~~~~~i~~l~~g~~~~~l~~~~~~~~s~~ 244 (385) T protein:vir:95 169 IFGRMI----DLQMLNNQIRGILKVDATKFYNKEKQKELQAYIDTLFDAFQNNTIAVVPLTEGLAYEEHSNRGAAQSAQQ 244 (385) T ss_pred HHHHHH----HHHHhcCCCceEEEeCCccCCCHHHHHHHHHHHHHHhhhhhhcCCceEEcCCCceeEeecccccccCCHH Confidence 222222 22333333 3333433221111 1111111 122355567666666655322 1 Q ss_pred HHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccc- Q lcl|NC_013644. 319 TEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFD- 397 (510) Q Consensus 319 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~- 397 (510) ...+.+..+...+.|+..-++|+.-.. |+-|.. .......+...|.-+++.|...+..+--.... T Consensus 245 d~~~~e~~~~~~~~Ia~~fgVpp~~l~--~~~sn~------------e~~~~~~~~~~l~P~~~~ie~~l~~~L~~~~~~ 310 (385) T protein:vir:95 245 FSELNELKKTVLTDVARMIGVPPSLVL--GEMADL------------EKTIESYLQFCINPLLRKIEAELNSKFFYQDEY 310 (385) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHhc--CCCcCH------------HHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhc Confidence 345666777788888888888875432 221111 11223445555666555555555432211111 Q ss_pred -cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCC--CcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCC Q lcl|NC_013644. 398 -PTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRL--DDDNVLRLICEQFDLDWEDVKEALEEAEYTKGL 474 (510) Q Consensus 398 -~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v--~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~ 474 (510) ...+.+.+..-+..|..+.++.+.+++.+|+++.-.+++.++.- +++...+ .....+..+ T Consensus 311 ~~~~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~R~~~g~~p~~~~~gd~------------~~~~~n~~~----- 373 (385) T protein:vir:95 311 LNDDMHIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQVRIMTGEEPADDPELDK------------FIITKNLQS----- 373 (385) T ss_pred ccceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCce------------eeeccccee----- Confidence 11355555677788999999999999999999998888876542 1111000 000000000 Q ss_pred CCCCCCcccCCCCCC Q lcl|NC_013644. 475 SDNTDEEETAVNPDD 489 (510) Q Consensus 475 ~~~~~~~~~~~~~~~ 489 (510) .++.+.+++.++ T Consensus 374 ---~~~~kgge~~~e 385 (385) T protein:vir:95 374 ---ADAFKGGESNEE 385 (385) T ss_pred ---cccccCCCCCCC Confidence 011111111111 No 236 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=93.67 E-value=0.0068 Score=32.47 Aligned_cols=377 Identities=8% Similarity=-0.028 Sum_probs=146.1 Q ss_pred HHHHHHHhccCCcchhcccc-eeccccccccccccc--ccee--ccchhHHHHHHHHhhhhcCCcee---ccCc--HHH- Q lcl|NC_013644. 32 AETGIRYYNHENDIMNNRIF-YVDDEGILREDKYAS--NVRI--PHGFFPEIVDQKTQYLLSNPVEY---ETEN--EEL- 100 (510) Q Consensus 32 ~~~~~~YY~g~~~i~~~~~~-~~~~~~~~~~~~~~~--~~ki--~~n~~~~Iv~~~~~~l~g~p~~~---~~~d--~~~- 100 (510) |-.+...-.-.......... ..+.-.......... ...+ .++.....|+..++-+.+-|+.+ +.+. +.+ T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~~~~~~dg~~~~~~ 80 (423) T protein:vir:81 1 MGFLQKLGLAPSVVATPEPIELVGPIFESLKLSTKNMTVEQIWEDQPHLRTVTTFIARNVASLQLQAFERVEDGGRERVR 80 (423) T ss_pred CchhHhhccccccccCccccccccccccccccccchhhHHHHHHhhhHHHHHHHHHHHhHhhCceEEEEEecCCceeeec Confidence 11111100000000000000 000000000000000 0000 12344456666777666677764 1111 111 Q ss_pred HHHHHHHhc--cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccc---eEEEEcCCCCceeEEEEEEEEE Q lcl|NC_013644. 101 KEYLAEYYN--SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLN---VFGVYNEYNELQRICRHYITEI 172 (510) Q Consensus 101 ~~~l~~~~~--n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~---~~~~~d~~~~~~~~~~~~~~~~ 172 (510) ...+..++. |. .......+......+|.||.++..|..+...+..+.|.. +.+.....+. ..+++..... T Consensus 81 ~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~~~~l~p~~~~~v~~~~~~~~~--~~~~Y~~~~~ 158 (423) T protein:vir:81 81 EGHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTPTLDIRPIPVSWVQRRAYKDGW--GSLDYIIIES 158 (423) T ss_pred cchHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcceEEEeecccceeeeeeccCCC--cceEEEEEEe Confidence 112333332 32 334555667788899999988877754433333333322 2111110000 0000000000 Q ss_pred eeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCC-----CCCC Q lcl|NC_013644. 173 EKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNN-----KQET 247 (510) Q Consensus 173 ~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn-----~~g~ 247 (510) ...+. ..+ .+.+.. |+|+++. ..|. T Consensus 159 ~~~~g----~~~-~~~~~e---------------------------------------------vih~r~~~~~~~~~G~ 188 (423) T protein:vir:81 159 GDNDG----RSV-KVPGER---------------------------------------------VIHRHGYNPKTMKRGK 188 (423) T ss_pred cCCCc----eEE-EEcccc---------------------------------------------eEEecCCCCCCccccc Confidence 00000 000 011111 3444321 1466 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCC------CCc--hhhhhHhhh---------cCeeeeccCCCce Q lcl|NC_013644. 248 TDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQ------GDD--LSKLRQNVK---------SKKVVGTGSDGGL 310 (510) Q Consensus 248 sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~------~~~--~~~~~~~~~---------~~~~~~~~~~~~~ 310 (510) |.+..+...++....+..-..+.+...+.|-.+++-.. .++ ...+...+. .++++.++++.++ T Consensus 189 spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~ 268 (423) T protein:vir:81 189 SPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAESRTRFMANLRASFSPKSSDVGGTLLLEDGMKA 268 (423) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHHHHHHHHHHHHHHhccccccCCcceecCCCceE Confidence 76666555555444444334444555566666654211 111 111211111 1345666666666 Q ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_013644. 311 DVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINR 390 (510) Q Consensus 311 ~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~ 390 (510) +.++.......+.+..+.....|...-++|+.-.+..++.+...++... ...+...|.-.++.|...+.. T Consensus 269 ~~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn~e~~~----------~~f~~~~L~P~~~~ie~~l~~ 338 (423) T protein:vir:81 269 ENFHTTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNANYSNVREFR----------KALYGDNLGSWIRIIQDVMNL 338 (423) T ss_pred EeccCChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCCcccHHHHH----------HHHHHHHHHHHHHHHHHHHhh Confidence 6555443334445556677778888888887543322222211122111 122233344444444433332 Q ss_pred ccCCc--cccceeeEEe--CCCCCCCHHHHHHHHHHHH-hcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 391 RYTKA--FDPTEVSFTF--TREVMVNETDIVNDEKTEA-ETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEAL 465 (510) Q Consensus 391 ~~~~~--~~~~~v~i~f--~~~~p~d~~e~~~~~~~~~-~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~ 465 (510) +--.. .+....-+.| ..-+..|..+.++.+.++. ++|+++.-.+++.++.-..+.-. ...... T Consensus 339 ~L~~~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~NE~R~~~gl~p~~gGD------------~~~~p~ 406 (423) T protein:vir:81 339 FLLPRVGIDNEKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTINEVRAMDNLPSIDGGD------------DLARPL 406 (423) T ss_pred hhcCccccccCccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCHHHHHHHhCCCCCCCcc------------eeeccc Confidence 21111 1122333455 4556778888888888776 46888887777766432111000 000000 Q ss_pred HhhhccCCCCCCCCCcccCCCCCCcccc Q lcl|NC_013644. 466 EEAEYTKGLSDNTDEEETAVNPDDPTQQ 493 (510) Q Consensus 466 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (510) +.. ..+.+ ..++++.+. T Consensus 407 n~~--------~~~~~---~~~~~~~~t 423 (423) T protein:vir:81 407 NTE--------FGDSE---DAPGEEVET 423 (423) T ss_pred ccc--------cCccC---CCCCCCCCC Confidence 000 00000 011111111 No 237 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=93.11 E-value=0.0088 Score=31.86 Aligned_cols=433 Identities=10% Similarity=0.008 Sum_probs=172.5 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |+ +...++..+.+.+.--.+...+.+|..-. +.....+.... ... ..+...+-+..-++ T Consensus 1 m~-----------~~~~~l~~k~~R~~~e~~w~e~a~~~lP~-----~~~~~~~~~~~--~~~---~~~~~dstg~~a~~ 59 (514) T protein:vir:80 1 MR-----------QQASAMWAEYRDSTAIRKAEDFAKFTIAS-----LMVDPLDKTHQ--AEV---VEYDFQSAGAFLVN 59 (514) T ss_pred Cc-----------cchHHHHHHhhcchHHHHHHHHHHHhccc-----ccCCCCCCccc--ccc---cccccchhHHHHHH Confidence 11 11122222222111112334444443221 00000000000 000 01223445555566 Q ss_pred HHHhhhhc--CCce-----eccCcH-------------HHHHHH-------HH-HhccCHHHHHHHHHHHHHhcCeEEEE Q lcl|NC_013644. 81 QKTQYLLS--NPVE-----YETENE-------------ELKEYL-------AE-YYNSEFQVVLQELVEGSSQKGFEYVY 132 (510) Q Consensus 81 ~~~~~l~g--~p~~-----~~~~d~-------------~~~~~l-------~~-~~~n~~~~~~~e~~~~~~~~G~~~~~ 132 (510) +.++-|++ -|+. +..+++ ++...| .. +..+||.....++.++..++|.|.++ T Consensus 60 ~LAa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~ 139 (514) T protein:vir:80 60 NLTAKLALTLFPPGRPSFQIELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFY 139 (514) T ss_pred HHHHHHHhhhcCCCCcccccccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEE Confidence 66655543 2322 332221 122222 12 22478888999999999999998665 Q ss_pred EEECCCCceEEEEEcccceEEEEcCCCCceeEEEEEEEEEee------------CCceeEEEEEEEEcCCcEEEEEEcCC Q lcl|NC_013644. 133 ARTNAEDRLCFQVADSLNVFGVYNEYNELQRICRHYITEIEK------------DGETVDIHHAEVWTDQNVYFFVAEDN 200 (510) Q Consensus 133 v~~d~~g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~e~y~~~~i~~~~~~~~ 200 (510) + +++ .-.++.++-.+++..-|..+++..+++-....... .........+++|+. .++..... T Consensus 140 ~--~~~-~~~~~~~pl~~y~v~~d~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~---v~~~~~~~ 213 (514) T protein:vir:80 140 R--EPG-TGKMLVWTMQSYTVRRTSHGDPAVVVLRQQMPFRELTPEIQADAQAKQIAKRDSDKCDLYTV---IEWQPTPN 213 (514) T ss_pred E--ecC-CCcEEEEEcCeEEEeeCCCcCeEEEEeeeeecHHHhhhhhhhhhhhhhccCCCCCceEEEEE---EEeecCCC Confidence 5 332 22355566666666667777777666544333211 000111122333321 01111111 Q ss_pred ceeeccccccccccccccccccccc-ccccccCCcccEEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_013644. 201 KDYELDEAEPINPRPHVLAVDSENE-SLLQRSYGQIPFYRLS-----NNKQETTDLKPIKALIDDYDLMNCFLSNNLQDF 274 (510) Q Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~g~iPvv~~~-----nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~ 274 (510) +.. .......++... .....++..+|++.++ .+.+|+|-.+...+-+..+|.+.-......... T Consensus 214 ~~~----------~sv~~e~~g~~i~~es~y~~~e~P~i~~Rw~~~~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~~~a 283 (514) T protein:vir:80 214 GKR----------CAVWHELEGKRVGPESSYPAHLCPYVPVAWNVPDGEHYGRGYVEEYSGDFARLSILSERLGLYEFEA 283 (514) T ss_pred CeE----------EEEEEeccceeecccCccccccCCeeeeeeEecCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 000 000000111111 1111223346766654 346799999999999999998877777777777 Q ss_pred ccceeEEecCCCCchhhhhHhhhcCeeeeccCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCccccccccCcccH Q lcl|NC_013644. 275 AEAIYVVSGFQGDDLSKLRQNVKSKKVVGTGSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITN 352 (510) Q Consensus 275 ~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg 352 (510) ..|.+.+.-.....+..+.. ...+.+..+..++++.+... .+.......++.++..|-..-. ...........|+ T Consensus 284 ~~~~~~v~~~g~~~~~~l~~--~~~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aFm-l~~~~rd~~rvTA 360 (514) T protein:vir:80 284 LSLLNLVDEAKGGAVDDYRD--AETGDFVPGQVGSVASYERGDYNKIAQASASVESIVMRLNRAFM-YTGQVRDAERVTV 360 (514) T ss_pred cCCCceeCcccccchhhhcc--cCCceeecCCCccceeeecCcccchHHHHHHHHHHHHHHHHHHh-hhccCCCCCCCCH Confidence 77666542211222222211 12234445555667776543 4677777888888877754211 1111112233566 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHhhc--cCC-ccccceeeEEeCCCCC-CCHHHHHH-- Q lcl|NC_013644. 353 IVIKARYTLLNMKANKTEARLRALLEWM--------NKLVIDDINRR--YTK-AFDPTEVSFTFTREVM-VNETDIVN-- 418 (510) Q Consensus 353 ~Ai~~~~~~l~~k~~~k~~~~~~~l~~~--------~~~i~~~~~~~--~~~-~~~~~~v~i~f~~~~p-~d~~e~~~-- 418 (510) +.+.. +..+|+..++..+.++ ++..+.++... +.- +....-+.+.+..++. -.....++ T Consensus 361 tEV~~-------r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~il~r~~~g~lP~~p~~l~~~~~vs~la~l~r~~~~~~l 433 (514) T protein:vir:80 361 EEIRT-------VAEEAENLLGGVYSLLAETLQAPLAYLTMYEASRGNGGMLLGIAQGVYRPSIITGIPALTRNIETANI 433 (514) T ss_pred HHHHH-------HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCchhhcceeeecHHHHHHHHHHHHH Confidence 66654 4455555555555542 22222222211 111 1111123444433321 11111111 Q ss_pred -----HHHHHHhcC-----CCchHHHHHhC------C---CCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCC Q lcl|NC_013644. 419 -----DEKTEAETR-----KIILESILQVA------P---RLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTD 479 (510) Q Consensus 419 -----~~~~~~~~g-----~iS~et~~~~~------~---~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~ 479 (510) .+..+.+.. .+....++..+ | .+.++|.....+++...++.+...............+--+ T Consensus 434 ~~~~~~i~~l~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (514) T protein:vir:80 434 LRATQEASAIVPALVQLSKRFDPEKLVERIFANNSVDLSTLSKDPDVVAAEAEQEAALAQQQLDVASGALAAETSAGVLT 513 (514) T ss_pred HHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHhCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccC Confidence 111111110 01122333321 1 1223333332222222211111111110110000001011 Q ss_pred C Q lcl|NC_013644. 480 E 480 (510) Q Consensus 480 ~ 480 (510) + T Consensus 514 ~ 514 (514) T protein:vir:80 514 S 514 (514) T ss_pred C Confidence 1 No 238 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=93.06 E-value=0.0089 Score=31.82 Aligned_cols=318 Identities=8% Similarity=-0.051 Sum_probs=123.6 Q ss_pred EEEEEEECCCCceEEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccc Q lcl|NC_013644. 129 EYVYARTNAEDRLCFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEA 208 (510) Q Consensus 129 ~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~ 208 (510) +++++|.-.+|...+.-+.+. +. ..+ .++.+..+++...++...... T Consensus 1 v~Eivw~~~~g~~~~~~l~~r-------~~---~~~-----------------~~f~~~~~~~l~~~~~~~~~g------ 47 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAWR-------PP---RTI-----------------SRFDVAPDGGLVAIEQWGVFG------ 47 (355) T ss_pred CeEEEEEeeCCeEEEeeeeec-------Cc---cce-----------------eeeeeccCCceeEEEecCCCC------ Confidence 555555433332222111110 00 000 000111111111111110000 Q ss_pred cccccccccccccccccccccccCCcccEEEe--cCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCC Q lcl|NC_013644. 209 EPINPRPHVLAVDSENESLLQRSYGQIPFYRL--SNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQG 286 (510) Q Consensus 209 ~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~--~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~ 286 (510) .......+++.|-..+- ..|+.|.|.+..+--..--=+..+.+++..++.+..|+.+.+|..+ T Consensus 48 ---------------~~~~~lp~~kfi~~~~~~~~g~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~ 112 (355) T protein:vir:78 48 ---------------KATVRIPVDRLVVFVNEREGANWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPL 112 (355) T ss_pred ---------------CCcceeccCCEEEEEeCCCCCCccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCC Confidence 00000011122211111 1356788888776554444456678888899999888888777533 Q ss_pred Cch-------------------hhhhHhhhcC--eeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccc- Q lcl|NC_013644. 287 DDL-------------------SKLRQNVKSK--KVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQ- 344 (510) Q Consensus 287 ~~~-------------------~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~- 344 (510) ... ......+..+ ....++.+.++++++.......+...++.+.+.|...-.+..+.. T Consensus 113 ~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~~a~~iip~g~~ie~~ea~g~~~~~~~~i~~~d~~Isk~iLGqtlTs~ 192 (355) T protein:vir:78 113 PEAIARDTARAEQWLNDQKEEGLQLAKEFRAGEAAGGYIPHGANFTLTGVQGKLPEMDGPIRYHDEQIARAVLAHFLTLG 192 (355) T ss_pred CCcccchhhhHHHHHHHHHHHHHHHHHHhhCCcceeEeecCCceEEEeecCCCcccHHHHHHHHHHHHHHHHhhhhhccc Confidence 111 0011112112 355688888999998776666677888998888876543333221 Q ss_pred ---cccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHH Q lcl|NC_013644. 345 ---VGDGNITNIVIKARYTLLNMKANKTEARLRALLE-WMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDE 420 (510) Q Consensus 345 ---~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~-~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~ 420 (510) ++++.+.|..-. .-....+..-.+.+...|. ++++.++.+ +. +. ...-..++|.. .+.+....++.+ T Consensus 193 ~~~~gGS~Alg~vh~---~v~~~~~~aD~~~i~~~ln~~li~~l~~l-N~--~~--~~~~P~~~~~~-~~~~~~~~a~~~ 263 (355) T protein:vir:78 193 GDKSTGSYALGDTFA---SFFTGSLNAVMKHIADVTQQHVVEDLVDQ-NW--GP--EEPAPRLVPAQ-LGKEQPVTAEAI 263 (355) T ss_pred cCCccchhhHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cC--CC--CCCCCEEEecC-cChhHHHHHHHH Confidence 122223233322 1122233333355555563 466655543 21 11 11234567754 456667789999 Q ss_pred HHHHhcCCC-chH----HHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHh-hhccCCCCCCCCCcccCCCCCCccccc Q lcl|NC_013644. 421 KTEAETRKI-ILE----SILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEE-AEYTKGLSDNTDEEETAVNPDDPTQQM 494 (510) Q Consensus 421 ~~~~~~g~i-S~e----t~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (510) .++...|+. +.+ .+.+.++.-.+.+.+. ............. .....+.....+.....+...++.+.. T Consensus 264 ~~l~~~G~~~~~~~~~~~~~e~~gip~p~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~a~~~~~~~ 337 (355) T protein:vir:78 264 RALVECGAFTADPELEKDLRARYGLPAPAERDD------GADAAAAKAAGRRRAKRLPGQRQGAALPSRSPRADPPRRRG 337 (355) T ss_pred HHHHhCCCccccHHHHHHHHHHhCCCCCCCCCc------ccCCccccccccccccccCCccccccccccCCCCCChhhhH Confidence 999999974 432 2344444311111000 0000000000000 000000000000000000000000000 Q ss_pred ccCcccccccccC---CCC Q lcl|NC_013644. 495 AEGATGSTESQLP---ENG 510 (510) Q Consensus 495 ~~~~~~~~~~~~~---~~~ 510 (510) . --...-...-| ..| T Consensus 338 ~-~~~~~~~~~~~~~~~~~ 355 (355) T protein:vir:78 338 P-LRRRPRHPAHRRCAPDG 355 (355) T ss_pred H-HHHHhhccccCCCCCCC Confidence 0 00000000001 111 No 239 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=92.64 E-value=0.011 Score=31.42 Aligned_cols=394 Identities=11% Similarity=0.019 Sum_probs=171.8 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |+=-....+.+..+..... .+ +.... -|-+.-.++++........-...++-. -......-.+. T Consensus 3 ~~~~~~p~~~~~~~~~~~~--~~--------~~~~~-g~~~~D~~lr~~gg~~~~~~~l~~~m~-----e~D~~v~s~l~ 66 (446) T protein:vir:98 3 MEVRNAPTPAIRRRTIYAM--EH--------LGLAT-SYLSEDGGYKRAGKPTYQQLSAWDEAA-----QTEPIIAQGLD 66 (446) T ss_pred ccccCCCchhhhhhhhhcc--cc--------chhhc-ccCCcchHhhhcCCChHHHHHHHHHHH-----hcchHHHHHHH Confidence 4444444455444433221 11 11111 122222233221100000000000000 01344555556 Q ss_pred HHHhhhhcCCceeccCcHHHHHHHHHHhcc-CHHHHHHHHHHHHHhcCe-EEEEEEECCCCceE-EE------EEcccce Q lcl|NC_013644. 81 QKTQYLLSNPVEYETENEELKEYLAEYYNS-EFQVVLQELVEGSSQKGF-EYVYARTNAEDRLC-FQ------VADSLNV 151 (510) Q Consensus 81 ~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n-~~~~~~~e~~~~~~~~G~-~~~~v~~d~~g~~~-i~------~~~p~~~ 151 (510) +...-+.|-+.++.+.++++.+++.+++.+ .+...+. ...++..+|. +.+++|.-..|.-. .+ .+.|... T Consensus 67 ~Rk~av~~~~w~V~p~~~~~a~~v~~~l~~~~~~~~~~-~~ldai~~G~s~~Eivw~~~~g~~~p~~~~d~~~~~~~~~~ 145 (446) T protein:vir:98 67 SIALSVLNKVGPYQHGDKRIKKFIDDQLRNRAKTWISH-CVKSIMTYGFSLSEQIYAHGARDNMPATVLDDIVNYHPLQV 145 (446) T ss_pred HHHHHhhcCCceecCccHHHHHHHHHHHhhcCchhHHH-HHHHHHhhCceeeeEEEeecccccccchhhccccccccccc Confidence 666667788888988899999999999864 3333333 3568888996 45667754333211 11 1122111 Q ss_pred EEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEE-EEcCCcEEEEEEcCCceeecccccccccccccccccccccccccc Q lcl|NC_013644. 152 FGVYNEYNELQRICRHYITEIEKDGETVDIHHAE-VWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQR 230 (510) Q Consensus 152 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e-~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (510) -=.+|+...+... ..... ..... .|.+- ..+.... + ....... ....+. T Consensus 146 r~~~~~~~~~~~~---~~~~~--------~~~~~~~~~~~--~~~~~~~-------------~-~~~~~~~---g~~~~i 195 (446) T protein:vir:98 146 MLIANDNGRIVDG---DTVTA--------SQYKSGYWVPL--PPYRIGD-------------P-PKKVDVV---GSHVRL 195 (446) T ss_pred eeeeccCCccccc---cccch--------hhcccccccCc--ccchhhh-------------h-hhhcccC---cccccc Confidence 1112221110000 00000 00000 00000 0000000 0 0000000 000111 Q ss_pred cCCcccEEEec---CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchh----------------h Q lcl|NC_013644. 231 SYGQIPFYRLS---NNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLS----------------K 291 (510) Q Consensus 231 ~~g~iPvv~~~---nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~----------------~ 291 (510) |..+.=++.+. .++.|.|.+..+--.---=+..+-+++..++.++.|+.+.+-..+.... . T Consensus 196 P~~kfi~~~~~~~~~~p~G~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~ 275 (446) T protein:vir:98 196 PSHKRLFINYNTKGNNPWGTSCLTSVLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQ 275 (446) T ss_pred cccceEEEEecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHH Confidence 12221111111 2467888877654443333566778888999999999987643221110 1 Q ss_pred hhHhhh---cCeeee-----ccCCCceeEEeecCC-HHHHHHHHHHHHHHHHHHhCCcccc----cc-ccCcccHHHHHH Q lcl|NC_013644. 292 LRQNVK---SKKVVG-----TGSDGGLDVKTVTIP-TEGRKTKMEIDKENIYKFGMAFDST----QV-GDGNITNIVIKA 357 (510) Q Consensus 292 ~~~~~~---~~~~~~-----~~~~~~~~~~~~~~~-~~~~~~~~~~l~~~i~~~s~~p~~~----~~-~~g~~Sg~Ai~~ 357 (510) ..+.+. ...... .+++..+++++...+ ...++..++.+.+.|...-.+..+. .. +++++-|..-.- T Consensus 276 L~~av~~~~~da~~ii~~~~~P~g~eie~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~ala~vh~~ 355 (446) T protein:vir:98 276 AEDALRRLSTDSGLVLTQLSKEQPVQVGALTTGNNFSDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTGRASEIQLE 355 (446) T ss_pred HHHHHHhccccceeeeecccCCCCceEEeeccccCChhhHHHHHHHHHHHHHHHHhcccccccccccccchhhhHHHHHH Confidence 222221 111122 277888999877644 3468899999999998754333222 11 122222222211 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhccCCccccc---eeeEEeCCCCCCCHHHHHHHHHHHHhcCCC-c-- Q lcl|NC_013644. 358 RYTLLNMKANKTEARLRALLE-WMNKLVIDDINRRYTKAFDPT---EVSFTFTREVMVNETDIVNDEKTEAETRKI-I-- 430 (510) Q Consensus 358 ~~~~l~~k~~~k~~~~~~~l~-~~~~~i~~~~~~~~~~~~~~~---~v~i~f~~~~p~d~~e~~~~~~~~~~~g~i-S-- 430 (510) ... ..+..-.+.+...+. ++++-++.+ + ++....+. .-.++|...-+.|....++.+.+++..|++ + T Consensus 356 V~~---d~~~aDa~~i~~tln~~Li~~l~~l-N--f~~~~~~~~~~~~~~~~~~~e~eDl~~~a~~~~~L~~~G~~~p~~ 429 (446) T protein:vir:98 356 LFD---GKINSIFDTVIHAFTEQVIGNLIRL-N--FDPALYPLASNTGYITRLPGRATDLAALVEAIKQMHDMGFLVDGD 429 (446) T ss_pred HHH---HHHHHHHHHHHHHHHHHHHHHHHHh-C--CCccccccccccccceeccCChhhHHHHHHHHHHHHhCCcccccc Confidence 111 122233344445553 455555542 2 22111111 123456666788999999999999999973 3 Q ss_pred hHHHHHhCCCCCcHHHHH Q lcl|NC_013644. 431 LESILQVAPRLDDDNVLR 448 (510) Q Consensus 431 ~et~~~~~~~v~d~e~~~ 448 (510) .+.+.+.++.-.-.+ .. T Consensus 430 ~~~ire~~giP~~~~-~~ 446 (446) T protein:vir:98 430 KDHIRSITGLPDAIS-ST 446 (446) T ss_pred HHHHHHHhCcCCCCC-CC Confidence 344555554311110 00 No 240 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=92.45 E-value=0.011 Score=31.24 Aligned_cols=399 Identities=11% Similarity=-0.018 Sum_probs=151.1 Q ss_pred HHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhc----------cc---ceeccccccccccccc---cceeccchhH Q lcl|NC_013644. 13 ANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNN----------RI---FYVDDEGILREDKYAS---NVRIPHGFFP 76 (510) Q Consensus 13 ~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~----------~~---~~~~~~~~~~~~~~~~---~~ki~~n~~~ 76 (510) =.++..+....+...+ .. +..+-...+..... +. .............+.+ ..-+..+... T Consensus 1 M~~~~~l~~~~~~~~~-~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~g~~v~~~~a~~~~~v~ 76 (466) T protein:vir:81 1 MRLIDRLLSTRGAAPR-MS---IDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTELAPDTFVGLATQAYQANGPVF 76 (466) T ss_pred CchhHHHhhccCcccc-cc---hhhhhhhhhhhhccccccccccccHHHHHhhccccccccCccccccchhhhhccHHHH Confidence 0111111111110000 00 01111111100000 00 0000000000000000 0012234455 Q ss_pred HHHHHHHhhhhcCCceeccCcH----HH-HHHHHHHhc--cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCc------ Q lcl|NC_013644. 77 EIVDQKTQYLLSNPVEYETENE----EL-KEYLAEYYN--SE---FQVVLQELVEGSSQKGFEYVYARTNAEDR------ 140 (510) Q Consensus 77 ~Iv~~~~~~l~g~p~~~~~~d~----~~-~~~l~~~~~--n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~------ 140 (510) ..|+..+.-+.+-|+.+.-.++ .+ ...+..++. |. .......+....+.+|.||.++..+..|. T Consensus 77 ~~i~~Ia~~ia~lp~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l~~~~~ 156 (466) T protein:vir:81 77 ACMLVRQLVFSSVRFRWQRLRDGKPSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFVRMRPDWV 156 (466) T ss_pred HHHHHHHHhhccCceEEEEecCCceeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCccccccccC Confidence 5666666666667777532111 11 112233332 22 23455667788999999999998877654 Q ss_pred ---eEEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccc Q lcl|NC_013644. 141 ---LCFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHV 217 (510) Q Consensus 141 ---~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (510) ..+.+++|..+.+..+..+.... .+++ .. ++.... .....+.... T Consensus 157 g~~~~l~~l~~~~v~~~~~~~~~~~~--~y~~-~~--~~~~~~-~~~~~~~~~d-------------------------- 204 (466) T protein:vir:81 157 DVVVEERMVRGGRGELGGGQLGWRKV--GYLY-TE--GGRQSG-NESVGFLAED-------------------------- 204 (466) T ss_pred cceeEEEEecCcceEEEEcCCCceEE--EEEE-Ee--cCcccc-cceeeecccc-------------------------- Confidence 34666777777666543322111 1111 00 000000 0001112222 Q ss_pred ccccccccccccccCCcccEEEecCC------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCc--- Q lcl|NC_013644. 218 LAVDSENESLLQRSYGQIPFYRLSNN------KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDD--- 288 (510) Q Consensus 218 ~~~~~~~~~~~~~~~g~iPvv~~~nn------~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~--- 288 (510) |+||+.. -.|.|-+......|+....+..-..+.+.....|-.+++-...-+ T Consensus 205 -------------------viHir~~~~~~d~~~G~s~i~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~l~~e~ 265 (466) T protein:vir:81 205 -------------------VVHFAPIPDPLASYRGMSWLTPILREIRADQAMSKHQAKFFDNGATVNLVIKHNPMADPAA 265 (466) T ss_pred -------------------EEEEcCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHH Confidence 3444321 135666665555555544444444555666666766664322111 Q ss_pred hhhhhHhhh--------cCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccc---ccCcccHHHHHH Q lcl|NC_013644. 289 LSKLRQNVK--------SKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQV---GDGNITNIVIKA 357 (510) Q Consensus 289 ~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~---~~g~~Sg~Ai~~ 357 (510) ...++..+. .++++.++++.+++.++.......+.+..+...+.|...-++|++-.+ ..+.+++..++- T Consensus 266 ~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lG~~~~~~~st~sn~eq 345 (466) T protein:vir:81 266 VKKWADEVNSKHAGVDNAWKNLNLYPGADADVVGSNLQEIDFKNVRGGGETRIAAAAGVPPVIVGLSEGLAAATYSNYGQ 345 (466) T ss_pred HHHHHHHHHHHhcCccccccceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccccCCCccccccHHH Confidence 112222211 133566776666666665545556667778888899988889975432 122233222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeC--CCCCCCHHHHHHH-------HHHHHhcCC Q lcl|NC_013644. 358 RYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFT--REVMVNETDIVND-------EKTEAETRK 428 (510) Q Consensus 358 ~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~--~~~p~d~~e~~~~-------~~~~~~~g~ 428 (510) ... ..+...|.-.++.|...+..+--.......+.+.|+ .-+-.|..+.++. +..++++|+ T Consensus 346 ~~~----------~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~llr~d~~~r~~~~~~~~~~~~~~~~~g~ 415 (466) T protein:vir:81 346 ARR----------RLADGTAHPLWQNLSGCIGHVMPDMGPDVRLWYDADDVPFLREDEKDAADIQKVRAETINTLITAGY 415 (466) T ss_pred HHH----------HHHHHHHHHHHHHHHHHHHhhcCCcccCcceEEEecchhhhccCHHHHHHHHHHHHHHHHHHHHcCC Confidence 211 222333333333333333221111111122345554 4444566655543 334455553 Q ss_pred CchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhh--ccCCCCCCCCCcccCCCCCCcccccccCccccccccc Q lcl|NC_013644. 429 IILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAE--YTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQL 506 (510) Q Consensus 429 iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (510) ....++...+.-+.. .. ..... ......+ .........+..+.+. T Consensus 416 -t~nE~r~~~~~gd~~-~~---------~~~~~-~~~~~~~~~~~~~~~~~~~~~~Gg~--------------------- 462 (466) T protein:vir:81 416 -EPESVVAAVNSGDLR-LL---------KHTGL-TSVQLLPPGVSASASSDTPTSGGAD--------------------- 462 (466) T ss_pred -ChhhccccccCCccc-cc---------cCCCc-chhhhcccccccccCCCCcccCCCC--------------------- Confidence 443343322211110 00 00000 0000000 0000000000000000 Q ss_pred CCCC Q lcl|NC_013644. 507 PENG 510 (510) Q Consensus 507 ~~~~ 510 (510) .|| T Consensus 463 -~ng 465 (466) T protein:vir:81 463 -DNG 465 (466) T ss_pred -cCC Confidence 111 No 241 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=90.37 E-value=0.021 Score=29.76 Aligned_cols=379 Identities=11% Similarity=0.028 Sum_probs=150.0 Q ss_pred hhcccceeccccccc--cccccc-cce--eccchhHHHHHHHHhhhhcCCceeccCcHH--HHHHHHHHhc---cC---H Q lcl|NC_013644. 46 MNNRIFYVDDEGILR--EDKYAS-NVR--IPHGFFPEIVDQKTQYLLSNPVEYETENEE--LKEYLAEYYN---SE---F 112 (510) Q Consensus 46 ~~~~~~~~~~~~~~~--~~~~~~-~~k--i~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~--~~~~l~~~~~---n~---~ 112 (510) +. ....+..+... .....+ ..+ ...+....-|+..++-+-+-|+.+--.+.. ...-+-.++. |. . T Consensus 1 ~~--~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~~~~~~~~~l~~lL~~~PN~~~t~ 78 (723) T protein:vir:94 1 MT--TFPSGAGGWNAWSADSVFGNGAKGWSNSAVAYRCISMLANNAASVDLVVRGPDGELDELHPLSQLWNVMPNRAMPA 78 (723) T ss_pred Cc--ccccCCCccccccccccccccHHHHhhhHHHHHHHHHHHHhhccceeEEEcCCCccchhhHHHHHHhhCCCCCCCH Confidence 00 00000000000 000000 000 111222333444555555567665322211 1112333332 22 2 Q ss_pred HHHHHHHHHHHHhcCeEEEEEEECC---CCce-EEEEEcccceEEEEcCCCCceeEEE--EEEEEEeeCCceeEEEEEEE Q lcl|NC_013644. 113 QVVLQELVEGSSQKGFEYVYARTNA---EDRL-CFQVADSLNVFGVYNEYNELQRICR--HYITEIEKDGETVDIHHAEV 186 (510) Q Consensus 113 ~~~~~e~~~~~~~~G~~~~~v~~d~---~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~--~~~~~~~~~~~~~~~~~~e~ 186 (510) ......+......+|.+|+++..+. .|.+ .+..++|+.+.++..+..+...... .|.+.. .++. ... T Consensus 79 ~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~~~~~~~~~~~y~~~~-~~G~------~~~ 151 (723) T protein:vir:94 79 QVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAADAVPQAQIIGYVIER-TDGV------RVP 151 (723) T ss_pred HHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCCccceeeeeeEEEEEe-cCce------eEE Confidence 3344455677888999998876543 3443 4566677666555443322111100 010000 0000 000 Q ss_pred EcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecC-----CCCCCCcHHHHHHHHHHHH Q lcl|NC_013644. 187 WTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSN-----NKQETTDLKPIKALIDDYD 261 (510) Q Consensus 187 y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----n~~g~sd~~~v~~liD~~n 261 (510) +.... |+|++. .-.|.|.+......|.... T Consensus 152 ~~~~d---------------------------------------------IiHir~~~~~dg~~G~Spi~~a~~~i~~~~ 186 (723) T protein:vir:94 152 VLADE---------------------------------------------MLWLRFSDPYDPLAVMAPWKAARAAVDADF 186 (723) T ss_pred ecccc---------------------------------------------eEEecCCCCCCCcccccHHHHHHHHHHHHH Confidence 11111 334332 1246666665555555443 Q ss_pred HHHHHHHHHHHHhccceeEEecCCCCc--hhhhhHhhh--------cCeeeeccC--------CCceeEEeecCCH--HH Q lcl|NC_013644. 262 LMNCFLSNNLQDFAEAIYVVSGFQGDD--LSKLRQNVK--------SKKVVGTGS--------DGGLDVKTVTIPT--EG 321 (510) Q Consensus 262 ~~~S~~~~~~~~~~~~~lv~~g~~~~~--~~~~~~~~~--------~~~~~~~~~--------~~~~~~~~~~~~~--~~ 321 (510) .+..-..+.+...+.|-.++.--..++ ...+...++ .++.+.++. +.+++|.....+. .. T Consensus 187 aa~~~~~~~f~NG~~p~giL~~~~l~~e~~~~~~~~~~~~~~G~~Nagk~~vL~g~~~~~~vl~~G~~~~~l~~s~~D~q 266 (723) T protein:vir:94 187 YAATWQRQSFKNGARPGGVVNLGDMDEQTFTKTVAAFRSQVEGVQNAGRHLLIAGQGSDGGAAGKGATFTSLSMSPAEMD 266 (723) T ss_pred HHHHHHHHHHhcCCCcceEEEcCCCCHHHHHHHHHHHHHHhhchhhcCcceeecccccccccccCCceEEEccCCHHHHH Confidence 333333444555566666665322211 111111111 123343432 1245665544443 34 Q ss_pred HHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCcccccee Q lcl|NC_013644. 322 RKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEV 401 (510) Q Consensus 322 ~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v 401 (510) +.+......+.|...-++|+.-..+.++-|+.+ .... ..+...|.-.++.|...++.+--... ...+ T Consensus 267 ~le~r~~~~~eIa~afgVPp~~i~~~st~sN~e--~~~~----------~f~~~tL~P~~~~ie~~ln~~Ll~~~-g~~~ 333 (723) T protein:vir:94 267 YINSRMHSAEEVMLAFGIRKDALLGGSTYENQA--EAKA----------AVWTETLIPQMEVMASITDLQLLPDI-GWTV 333 (723) T ss_pred HHHHHHHhHHHHHHHhCCChhHcCCCCCcccHH--HHHH----------HHHHHHHHHHHHHHHHHHhHhhcccc-cCce Confidence 555566677788888888875332222112111 1111 12233344444444443332211111 1246 Q ss_pred eEEeCC--CCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCC Q lcl|NC_013644. 402 SFTFTR--EVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDN 477 (510) Q Consensus 402 ~i~f~~--~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~ 477 (510) .+.|+. -+..|..+.++.+.+++.+|+++.-.+.+.++. +..-..+ ....|....... T Consensus 334 ~~~f~~~~lLr~D~~~r~~~~~~~v~~G~~T~NE~R~~lglpPi~gGd~~-----------------~~~~p~~~~~a~- 395 (723) T protein:vir:94 334 EWDFNSVPALQEDLEAQAGRNQGYLVNDVLMVDEVRATIGLDPLPGGIGQ-----------------MTLTPYRAQFAP- 395 (723) T ss_pred EEeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCccc-----------------ceeccccccccC- Confidence 677764 456899999999999999999999888887643 2110000 000000000000 Q ss_pred CCCcccCCCCCCccccc---ccCcccccccccCCCC Q lcl|NC_013644. 478 TDEEETAVNPDDPTQQM---AEGATGSTESQLPENG 510 (510) Q Consensus 478 ~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~ 510 (510) .+...+...++. ..-. ...++-.....+|.++ T Consensus 396 ~~~~~p~~~e~~-~~~~~~~~~~~~~~p~~~~~~~~ 430 (723) T protein:vir:94 396 APAPAPAVEEGA-ARMLALLERVAADRPLPELPVRA 430 (723) T ss_pred CCCCCccchhhh-HhhhhhccccccccCcCCCCCCC Confidence 000000000000 0000 0000001122233333 No 242 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=90.13 E-value=0.022 Score=29.62 Aligned_cols=387 Identities=10% Similarity=0.023 Sum_probs=156.8 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCC-----cchhcccceeccccccccccccccceeccchh Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHEN-----DIMNNRIFYVDDEGILREDKYASNVRIPHGFF 75 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~-----~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~ 75 (510) +..+...+.... ..+...+..|. |.|-. .|+.... .-...+. .. ..... T Consensus 15 ~~~~~~~~~~~~-~~~~~~~~~~~--------------~~g~~~~~~~~iLr~~~-----~~~ly~~-----m~-~D~hi 68 (448) T protein:vir:79 15 PGSIDPSDVPKL-EGASVPVMSTS--------------YDVVVDREFDELLQGKD-----GLLVYHK-----ML-SDGTV 68 (448) T ss_pred ccccccccchhh-hhhhhhhcccc--------------cccccccchhHhhcccc-----chHHHHH-----Hh-hChHH Confidence 000000000000 00000000000 00000 1110000 0000000 00 13445 Q ss_pred HHHHHHHHhhhhcCCceeccCc-----HHHHHHHHHHhcc--------CHHHHHHHHHHHHHhcCeE-EEEEEE-CCCCc Q lcl|NC_013644. 76 PEIVDQKTQYLLSNPVEYETEN-----EELKEYLAEYYNS--------EFQVVLQELVEGSSQKGFE-YVYART-NAEDR 140 (510) Q Consensus 76 ~~Iv~~~~~~l~g~p~~~~~~d-----~~~~~~l~~~~~n--------~~~~~~~e~~~~~~~~G~~-~~~v~~-d~~g~ 140 (510) .-.+.+...-+.|.+..+.+.+ ....+++.+++.. +|.+.+. -+.++..+|.+ ++++|. ..+|+ T Consensus 69 ~s~l~~Rk~av~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~~~~f~~~~~-~~lda~~~G~s~~Eivw~~~~~g~ 147 (448) T protein:vir:79 69 KNALNYIFGRIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFA-IYENAYIYGMAAGEIVLTLGADGK 147 (448) T ss_pred HHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHhhhhhhhhccCCHHHHHH-HHHHhhhhcceeEEEEeeecCCCc Confidence 5556666677888888886422 2344556665532 3444433 35668888875 456664 45666 Q ss_pred eEEE---EEcccce-EEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccc Q lcl|NC_013644. 141 LCFQ---VADSLNV-FGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPH 216 (510) Q Consensus 141 ~~i~---~~~p~~~-~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 216 (510) ..+. +.++... .-.|+..+.+ .+....+..... T Consensus 148 ~~~~~l~~r~~~~~~~f~~~~d~~l--------------------------------~~~~~~~~~~~~----------- 184 (448) T protein:vir:79 148 LILDKIVPIHPFNIDEVLYDEEGGP--------------------------------KALKLSGEVKGG----------- 184 (448) T ss_pred eecccccccCCccccceeeecCCce--------------------------------EEeecCCccccc----------- Confidence 5332 2233211 1112211111 000000000000 Q ss_pred cccccccccccccccCCcccEEEecC----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCc--hh Q lcl|NC_013644. 217 VLAVDSENESLLQRSYGQIPFYRLSN----NKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDD--LS 290 (510) Q Consensus 217 ~~~~~~~~~~~~~~~~g~iPvv~~~n----n~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~--~~ 290 (510) .......+.+++++ +++.+ ++.|.|.+..+--..--=+..+.+++..++.++.|+++.+-..+.+ .. T Consensus 185 -----~~~~~~~~lP~~~~--i~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~ 257 (448) T protein:vir:79 185 -----SQFVSGLEIPIWKT--VVFLHNDDGSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGTK 257 (448) T ss_pred -----ccCCCccccccceE--EEEecCccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCceEEEecCCCCCcCHH Confidence 00000012233332 22222 4567788877665555556677899999999999999887543322 11 Q ss_pred h------hhHhhh--cCeeeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHH Q lcl|NC_013644. 291 K------LRQNVK--SKKVVGTGSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLL 362 (510) Q Consensus 291 ~------~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l 362 (510) + ...++. ......++++.++++++...+...+...++.+.+.|...-.+-..+.+..|+.+..+......-. T Consensus 258 ~~~~l~~av~~i~~g~~a~~iiP~~~~ie~~ea~~~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~g~~~~~~~~~~~v~ 337 (448) T protein:vir:79 258 QWEAAKEIVKNFVQKPRHGIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARALGIDFNTVQLNMGVQAINIGEFVSLT 337 (448) T ss_pred HHHHHHHHHHHHhcCCceEEEecCCceEEEEecCCCcccHHHHHHHHHHHHHHHHhhhhhccccccchhhhhhhhHHHHH Confidence 1 122222 23345688899999998776655666788888888866532222221121111111221111111 Q ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCC Q lcl|NC_013644. 363 NMKANKTEARLRALLE-WMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRL 441 (510) Q Consensus 363 ~~k~~~k~~~~~~~l~-~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v 441 (510) ...+..-.+.+...+. +++.-++.+ + .+.. ..-..+.|...-+.|.++.++.+.+++..+.++ T Consensus 338 ~~~~~aDa~~i~~tln~~li~~l~~l-N--fg~~--~~~P~~~f~~~e~~Dl~~~a~~~~~l~~~~~~~----------- 401 (448) T protein:vir:79 338 QQTIISLQREFASAVNLYLIPKLVLP-N--WPSA--TRFPRLTFEMEERNDFSAAANLMGMLINAVKDS----------- 401 (448) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh-c--CCCc--CCCcEEEecCCChHHHHHHHHHhhhhhccchhh----------- Confidence 1122223344455554 355544432 2 1111 112578888888888888887776554322110 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 442 DDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 442 ~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) + .+. ++ ....| ...+.+... ..+.. ...+.+..|-+++= T Consensus 402 ---~--~~~-------~~-----~~~~p------~~~~~~~~~-a~~~~------~~~~~~~~~~~~~~ 440 (448) T protein:vir:79 402 ---E--DIP-------TE-----LKALI------DALPSKMRR-ALGVV------DEVREAVRQPADSR 440 (448) T ss_pred ---H--HHH-------HH-----hhcCC------CCCCCcccc-ccCCC------CcccccccCCcccc Confidence 0 000 00 00000 000001100 01000 01111111111111 No 243 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=88.54 E-value=0.032 Score=28.80 Aligned_cols=453 Identities=11% Similarity=0.013 Sum_probs=163.9 Q ss_pred CCCccCCChhhhHHHHHH-----HHHhh-----hh-hhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccce Q lcl|NC_013644. 1 MEALLSEDVKIIANALKA-----AIDKD-----RK-SSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVR 69 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~-----~i~~~-----~~-~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~k 69 (510) -..+...+.+.-...|.. .++.. ++ .+-.++|+.+..+++- T Consensus 17 ~~S~vpp~~~~~~~~i~~g~~g~~v~~~g~~~~~n~~eLI~~YR~ma~~pEV---------------------------- 68 (564) T protein:vir:10 17 GQSPVPPNDEASVSTVAGGYFGTYVDTSGGQNSRNEYELIRRYRDMSLHPEV---------------------------- 68 (564) T ss_pred CCCcccCCcCCChhhhhccccceeeecccccchhhHHHHHHHHHHHhhccch---------------------------- Confidence 111111111111111100 00000 00 0011112222121211 Q ss_pred eccchhHHHHHHHHhh-hhcCCceeccCcHH--------HHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECC-- Q lcl|NC_013644. 70 IPHGFFPEIVDQKTQY-LLSNPVEYETENEE--------LKEYLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNA-- 137 (510) Q Consensus 70 i~~n~~~~Iv~~~~~~-l~g~p~~~~~~d~~--------~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~-- 137 (510) .+-...||+..+-+ -...||.+..++-+ ..+..+.+++ =+|+.+.++..+.+.+.|+.|++.-+|. T Consensus 69 --d~Av~eIVneaIv~d~~~~pV~vdL~~~~~s~siK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~ 146 (564) T protein:vir:10 69 --DSAIDEIVNEFVVNDGDDKPVEVDLQNLEIGSGVKKKIRDEFNRILRMMNFNVNAHEIIRNWYVDGRSHYHKVIDLDN 146 (564) T ss_pred --hhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeeCCC Confidence 12234444433222 23456666544422 2222233332 3677888999999999999999877763 Q ss_pred --CCceEEEEEcccceEEEEcCCCCcee-EEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccc Q lcl|NC_013644. 138 --EDRLCFQVADSLNVFGVYNEYNELQR-ICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPR 214 (510) Q Consensus 138 --~g~~~i~~~~p~~~~~~~d~~~~~~~-~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~ 214 (510) +|-..+..+||+.+-.|+-.-.+... ...++.. +...+...+-.+.+.|... +...... ...+.. T Consensus 147 pk~GI~eLr~lDPr~i~~vr~i~~~~~~~~~~v~k~---------~~~~~~y~~~~Eyy~Ynp~--~~~g~~~-~~~~~~ 214 (564) T protein:vir:10 147 PKKGILELRYIDSLKIRKVRQKLKDVDPNRKEIEKG---------TALQYDYGDFIEYYIYNPK--GFAGNIP-MVTGSM 214 (564) T ss_pred hhhhhhhhhhhcccceeeeeeeccccccccceeeee---------eeeeccccccccceeeccc--cccCccc-cccccc Confidence 35567888999998888732211111 1111110 0011111111122222111 0000000 000000 Q ss_pred cccccccccccccccccCCcccEEEecCCCCCC------CcHHHHHHHHHHHHH--HHHHHHHHHHHhccceeEE----e Q lcl|NC_013644. 215 PHVLAVDSENESLLQRSYGQIPFYRLSNNKQET------TDLKPIKALIDDYDL--MNCFLSNNLQDFAEAIYVV----S 282 (510) Q Consensus 215 ~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~------sd~~~v~~liD~~n~--~~S~~~~~~~~~~~~~lv~----~ 282 (510) .... ... =+||.-.+.+-..|. -.+.-+...|..+|. ++-|.+-..+..+.|=.=+ . T Consensus 215 ~~~~------~~~-----ikI~~daI~y~hSGL~d~~~~~i~gyLhkAIKp~NQLkmlEDAlVIYRitRAPeRRvFYIDV 283 (564) T protein:vir:10 215 DWSN------QEG-----IKIASDAIAQSTSGLMDLNKKMTLSFLHKAIKSLNQLRMIEDSLVIYRLSRAPERRIFYIDV 283 (564) T ss_pred cccc------ccc-----eeechhhcceecccceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEec Confidence 0000 000 022311111111111 122233444555553 3444444444444432211 1 Q ss_pred c-CCCCchhh----hhHhhhcCeeee--------------------c---cCCCceeEEeec--CCHHHHHHHHHHHHHH Q lcl|NC_013644. 283 G-FQGDDLSK----LRQNVKSKKVVG--------------------T---GSDGGLDVKTVT--IPTEGRKTKMEIDKEN 332 (510) Q Consensus 283 g-~~~~~~~~----~~~~~~~~~~~~--------------------~---~~~~~~~~~~~~--~~~~~~~~~~~~l~~~ 332 (510) | +...-... .+...+...+.. + +++.+.+.-|.+ .+.. -..-+.-+++. T Consensus 284 GnLPk~KAeqYlr~iM~k~KNklVYDa~TGevrddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLg-em~DV~YF~kK 362 (564) T protein:vir:10 284 GNLPKVKAEQYLRDVMSRYRNKLVYDGQTGEIRDDKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLG-ELKDVEYFKKK 362 (564) T ss_pred CCCCchhHHHHHHHHHHhcCceEEEeccCceecccchhhhhHhhhcccccCCCcccceeeccccCCcc-hHHHHHHHHHH Confidence 1 11111111 111111111111 1 112223333333 2322 23345556667 Q ss_pred HHHHhCCccc--cccc----cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCcccc--ceeeEE Q lcl|NC_013644. 333 IYKFGMAFDS--TQVG----DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDP--TEVSFT 404 (510) Q Consensus 333 i~~~s~~p~~--~~~~----~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~--~~v~i~ 404 (510) +|+.-.+|-. ...+ .|..|. |.......-.-+.+-+..|...+.++++.=+-+-++....+|+. ..+.+. T Consensus 363 LY~aLnVP~SRl~~e~~~f~~Gr~~E--ItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~ 440 (564) T protein:vir:10 363 LYNSLNLPPSRLTDDNKAFNLGKSTE--ILRDELKFTKFIGRLRKRFAQLFHDILKTQLILKGIITPEDWDDMEEHIQYD 440 (564) T ss_pred HHHHhCCCcccccCCCceeecccccc--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEE Confidence 7776677742 2221 133322 22222333344556666677777776665443334444445544 357778 Q ss_pred eCCCCCCCHHHHHHH-------HHHH--HhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHH-HHHHHhh---hcc Q lcl|NC_013644. 405 FTREVMVNETDIVND-------EKTE--AETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDV-KEALEEA---EYT 471 (510) Q Consensus 405 f~~~~p~d~~e~~~~-------~~~~--~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~-~~~~~~~---~~~ 471 (510) |...-.-.+...++. +..+ .-+..+|.+++.+.+=-.+|+|..++....+++..... ....+.. ..+ T Consensus 441 f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~~~~~~~ 520 (564) T protein:vir:10 441 FLFDNHFNELKEQEMQLQRVNLATQMDPFVGKYFSTEYIRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVNMLDDME 520 (564) T ss_pred eeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhhcCCCcc Confidence 865555444433333 2222 22334699999998666666654332222222211111 0110000 000 Q ss_pred CCCCCCCCCcccCCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 472 KGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) .+.....++... -.++...+++.....+.+..+..-+| T Consensus 521 ~~~~~~~p~~~~-~~~~~~~~~~~~~~~~a~~~~~~~~~ 558 (564) T protein:vir:10 521 KQNQAFAPELQA-AQDDLAAEREIKKLNSAPKPPPSQQS 558 (564) T ss_pred CCCCcCCcchhh-hccccccccChhhhccCCCCCCCCCC Confidence 011100111111 11111222222222222222222222 No 244 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=87.00 E-value=0.042 Score=28.15 Aligned_cols=423 Identities=9% Similarity=0.046 Sum_probs=157.3 Q ss_pred CCCccCCChhhhHH------------------------------------HHHHHHHhhhhhhhHHHHHHHHHHhccCCc Q lcl|NC_013644. 1 MEALLSEDVKIIAN------------------------------------ALKAAIDKDRKSSSKREAETGIRYYNHEND 44 (510) Q Consensus 1 ~~~~~~~~~~~~~~------------------------------------~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~ 44 (510) |--++.=..+...+ .-..+|. +|..+-.+++-. T Consensus 1 m~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~i~~~~~~~~~~~~e~~~~~~~eLI~---------~YR~ma~~pEvd-- 69 (533) T protein:vir:10 1 MSQLFGFSLERAKKAPKGPSFVQKDNLDGSQPVSGGGYYGYTVDFDGQVRNEYQLIS---------RYREMVLQPECD-- 69 (533) T ss_pred CccccccccccccccccCCCCCCCCcccccceeecccccceeeecccccchHHHHHH---------HHHHHhhccchh-- Confidence 22222111111111 1111111 111122222111 Q ss_pred chhcccceeccccccccccccccceeccchhHHHHHHHHhh-hhcCCceeccCc----HHHHHHHH----HHhc-cCHHH Q lcl|NC_013644. 45 IMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQY-LLSNPVEYETEN----EELKEYLA----EYYN-SEFQV 114 (510) Q Consensus 45 i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~-l~g~p~~~~~~d----~~~~~~l~----~~~~-n~~~~ 114 (510) +-...||+..+-+ ....||.+..++ +...+.|. .+++ =+|.. T Consensus 70 ----------------------------~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~ 121 (533) T protein:vir:10 70 ----------------------------SAVDDIVNETICGNFDDVPVSVELSNLKVSDKIKKLIREEFGEILRLLDFEN 121 (533) T ss_pred ----------------------------hHHHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccch Confidence 2233444433322 234566665544 23233333 2332 26778 Q ss_pred HHHHHHHHHHhcCeEEEEEEECCC----CceEEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCC Q lcl|NC_013644. 115 VLQELVEGSSQKGFEYVYARTNAE----DRLCFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQ 190 (510) Q Consensus 115 ~~~e~~~~~~~~G~~~~~v~~d~~----g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~ 190 (510) +.++..+.+.+.|+.|.+.-+|.+ |-..+..+||+.+-+|.--..+........+.....-. ...-+-+|++. T Consensus 122 ~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~ELr~lDPr~i~~vr~i~~~~~~~~~~~~~~~~v~~---~~~eyf~Ynp~ 198 (533) T protein:vir:10 122 RSYEIFRRWYVDGRLFYHKVIDPDNPQGGLIELRYIDPRKIRKINETEQKRPEQLRGLPLNQQLSP---KSAEYFLYDPK 198 (533) T ss_pred hhhHHHhhhhhcceEEEEEEecCCCccccceeeeeccccceeeeeeeeccCCCccceeecchhhhc---cceeeeeeccc Confidence 889999999999999998777643 66788999999987765211110000110000000000 01112233333 Q ss_pred cEEEEEEcCCceeecccccccccccccccccccccccccccCCccc---EEEecCCC---CCCCcHHHHHHHHHHHHH-- Q lcl|NC_013644. 191 NVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIP---FYRLSNNK---QETTDLKPIKALIDDYDL-- 262 (510) Q Consensus 191 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~nn~---~g~sd~~~v~~liD~~n~-- 262 (510) +... ..++. =+|| |++...+- .+--.+.-+...|..+|. T Consensus 199 g~~~---~~~~~------------------------------vkI~~dAI~y~hSGl~d~~~~~i~syLhkAiKp~NQLk 245 (533) T protein:vir:10 199 GLKN---STTQG------------------------------LKIAPDSICYVHSGIMDLNKNMTLSHLHKAIKAVNQLR 245 (533) T ss_pred cccc---cCCCc------------------------------eecchhheeeeeccceeCCCCceeccchHhHHHHHhhH Confidence 2210 00000 0111 11111100 011112233344445553 Q ss_pred HHHHHHHHHHHhccceeEE----ec-CCCCchhh----hhHhhhcCeeee--------------------c---cCCCce Q lcl|NC_013644. 263 MNCFLSNNLQDFAEAIYVV----SG-FQGDDLSK----LRQNVKSKKVVG--------------------T---GSDGGL 310 (510) Q Consensus 263 ~~S~~~~~~~~~~~~~lv~----~g-~~~~~~~~----~~~~~~~~~~~~--------------------~---~~~~~~ 310 (510) ++-|.+-..+..+.|=.=+ .| +...-... .+...+...+.. + +++.+. T Consensus 246 m~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLPRReGgrgT 325 (533) T protein:vir:10 246 MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGT 325 (533) T ss_pred HHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhhhcccccCCCCcc Confidence 3444444444444432211 11 11111111 111111111111 1 122223 Q ss_pred eEEeec--CCHHHHHHHHHHHHHHHHHHhCCccc--cccc---cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 311 DVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDS--TQVG---DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKL 383 (510) Q Consensus 311 ~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~--~~~~---~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~ 383 (510) +.-|.+ .+.. ...-+.-+++.+|+.-.+|-. ...+ .|..|. |.......-.-+.+-+..|...+.++++. T Consensus 326 EItTLpGgqnLg-em~DV~YF~kKLY~aLnVP~SRl~~e~~f~~Gr~~E--ItRDEiKF~KFI~RLR~rFs~lF~~~Lk~ 402 (533) T protein:vir:10 326 EITTLPGGQNLG-ELEDVKYFQKKLYKSLNVPGSRLETETTFNVGRAAE--ITRDEVKFQKFVARLRKRFSELFTDLLKT 402 (533) T ss_pred ceeeccccCCcC-hHHHHHHHHHHHHHHhCCCccccCCCCcccccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 333333 2222 234455566667776667742 2222 233332 32333333444556666777777776665 Q ss_pred HHHHHhhccCCcccc--ceeeEEeCCCCCCCHHHHHHH-------HHHH--HhcCCCchHHHHHhCCCCCcHHHHHHHHH Q lcl|NC_013644. 384 VIDDINRRYTKAFDP--TEVSFTFTREVMVNETDIVND-------EKTE--AETRKIILESILQVAPRLDDDNVLRLICE 452 (510) Q Consensus 384 i~~~~~~~~~~~~~~--~~v~i~f~~~~p~d~~e~~~~-------~~~~--~~~g~iS~et~~~~~~~v~d~e~~~~~~e 452 (510) =+-+-++....+|+. ..+.+.|...-.-.+...++. +..+ .-+..+|.+++.+.+=-.+|+|..++... T Consensus 403 qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kq 482 (533) T protein:vir:10 403 QLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLKQTDVEMKEIDKQ 482 (533) T ss_pred hhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHH Confidence 444334444445543 357777865554444433333 2222 12334699999988666666654322222 Q ss_pred HHHHHHHHHH-HH-HHhhhccCCCCCCCCCcccCCCCCCcccccccCccccccccc Q lcl|NC_013644. 453 QFDLDWEDVK-EA-LEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGATGSTESQL 506 (510) Q Consensus 453 ~~e~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (510) .+++...... +. .+..+..+.. +.+.++.+.++.-++.....-++--|+ T Consensus 483 I~~E~k~~~~~~p~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~ 533 (533) T protein:vir:10 483 IESEMESGIIADPAAEMDPAMAAG-----DPDAGGAPAEEVAPEGPDPSDERKAEF 533 (533) T ss_pred HHHHHhCCCCCCCcchhhHHhcCC-----CCCcCCcccccCCCCCCCcchhhccCC Confidence 2221110000 00 0000000000 000000000000000000000000111 No 245 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=86.86 E-value=0.043 Score=28.10 Aligned_cols=363 Identities=9% Similarity=0.031 Sum_probs=127.7 Q ss_pred HHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCCceeccCc-H-HHHHHHHHHhc Q lcl|NC_013644. 32 AETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNPVEYETEN-E-ELKEYLAEYYN 109 (510) Q Consensus 32 ~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d-~-~~~~~l~~~~~ 109 (510) |-....+..++..... ....+....... ++.-+...-....|+..++-+.+-|+.+...+ + .....+..+++ T Consensus 1 Mgl~d~~~~~~~~~~~-----~~~~~~~~~~~~-~~~~l~~~~v~~~i~~Ia~~ia~lp~~v~~~~~~~~~~~~~~~lL~ 74 (395) T protein:vir:96 1 MGILDFFSFKKSGTLS-----DDDSGSTTSEKL-TNVVLKEDALYKCVNYLARIISKSTFRIKAPEKLTENQKDWLYWIN 74 (395) T ss_pred CcchhhhcCCCCcccc-----ccccccchhhhc-chhhhhhHHHHHHHHHHHHhhccceeEEEeCCccccccchHHHHHh Confidence 1111111111110000 000000000000 00001122334455666666666676653221 1 11122333332 Q ss_pred ---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEE Q lcl|NC_013644. 110 ---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHH 183 (510) Q Consensus 110 ---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 183 (510) |. .......++...+.+|.||+++..+.... +.+ .+++.. .+... .++.+.. .+ ... T Consensus 75 ~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~~~~~-----~~~--~~~~~~---~~~~~-~~~~v~~-~~-----~~~ 137 (395) T protein:vir:96 75 TKANPNQSASQFWVEVVQKLLVDGETLIFVIPGKGIY-----VAD--AFTQDK---KLSGN-KFKVSRV-QG-----QTY 137 (395) T ss_pred hcCCCCCCHHHHHHHHHHHHhhcCceEEEEEcCCcee-----cCC--cccccc---ccccc-eeeeeee-cc-----cee Confidence 22 23455667788889999998876654211 111 111100 00000 0000000 00 000 Q ss_pred EEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCCcHH---HHHHHHHHH Q lcl|NC_013644. 184 AEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTDLK---PIKALIDDY 260 (510) Q Consensus 184 ~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~~---~v~~liD~~ 260 (510) -..+.+..+.+++.... +. ...+.+.+. .++.+.-++ T Consensus 138 ~~~~~~~dvih~k~~~~-----------------------------------~~-----~~~~~~~~~~~~~~~~~~i~~ 177 (395) T protein:vir:96 138 EKIFTFDQVIYLKNDNS-----------------------------------DL-----MLKVESLWEEYGELLGHVINN 177 (395) T ss_pred eeEeccCceEEecccCC-----------------------------------cc-----ccccccccchHHHHHHHHHHH Confidence 01122333333321100 00 011122222 233332222 Q ss_pred HHHH---HHHHHHHHHhccceeEEecCCCCchhhhhHh-------hhc--CeeeeccCCCceeEEeecCC-H-----HHH Q lcl|NC_013644. 261 DLMN---CFLSNNLQDFAEAIYVVSGFQGDDLSKLRQN-------VKS--KKVVGTGSDGGLDVKTVTIP-T-----EGR 322 (510) Q Consensus 261 n~~~---S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~-------~~~--~~~~~~~~~~~~~~~~~~~~-~-----~~~ 322 (510) ...- .-..+.......+..++.-.+........+. ... .+++.++++.+.+.++.... . ..+ T Consensus 178 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~d~q~~e~~~~ 257 (395) T protein:vir:96 178 QKIANQIRFTMTPPKDKVRERAQENSDGGRQPKSDKDFFKRTIEKIRTESVVGIPVTANTNYEEYGSKNTGSVKSYVDDI 257 (395) T ss_pred HHHHHHHHHHhhhcccccccceeeccCchhhHHHHHHHHHHHHHHhhcCCcceEEccCCceeEecccChhhhhhhhHHHH Confidence 1111 1111222222222233222111111111111 111 22334444444443333211 1 122 Q ss_pred HHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccc-ccee Q lcl|NC_013644. 323 KTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFD-PTEV 401 (510) Q Consensus 323 ~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~-~~~v 401 (510) ........+.|...=++|+.-.. ++-|... . .....+...|.-.++.|...+..+--.... ...+ T Consensus 258 ~~~~~~~~~eIa~~fgVPp~~l~--~~~sn~e--~----------~~~~f~~~~L~P~~~~ie~~l~~~Ll~~~e~~~~~ 323 (395) T protein:vir:96 258 KKLKDQYMAEFAEMLGIPISLLH--GDIADNQ--K----------NYELLLEGPIESLITNIVDGLEYAIFDKSETLEGS 323 (395) T ss_pred HHHHHHHHHHHHHHhCCCHHHhc--CCCccHH--H----------HHHHHHHHHHHHHHHHHHHHHHhhcCChhhhcCce Confidence 22333445667777778775442 2222111 0 112334444444444444444322111111 1134 Q ss_pred eEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCC Q lcl|NC_013644. 402 SFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTD 479 (510) Q Consensus 402 ~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~ 479 (510) .+.|+.-+..|..+.++.+.++..+|+++.-.+++.++. ++++...+ .....+..+ .+ T Consensus 324 ~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~~~~gD~------------~~~~~N~~~--------~~ 383 (395) T protein:vir:96 324 FIKVTGLKNYDLFSISSQADKLISSGFVFIDEVREEIGLPELPDGLGKV------------LYMTKNYES--------VL 383 (395) T ss_pred eEeecchhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCce------------eeeccccee--------ch Confidence 567788889999999999999999999998888777643 22211100 000000000 00 Q ss_pred CcccCCCCCCcc Q lcl|NC_013644. 480 EEETAVNPDDPT 491 (510) Q Consensus 480 ~~~~~~~~~~~~ 491 (510) +...+++++.++ T Consensus 384 ~~gge~~~~~~~ 395 (395) T protein:vir:96 384 ERGGEVDEEVET 395 (395) T ss_pred hccCCCCCCCCC Confidence 000000111010 No 246 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=86.61 E-value=0.044 Score=28.00 Aligned_cols=366 Identities=10% Similarity=0.039 Sum_probs=136.8 Q ss_pred hhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcC Q lcl|NC_013644. 10 KIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSN 89 (510) Q Consensus 10 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~ 89 (510) .-+-++|..+..+. .+. .. ..+.. .+....... ...-+...-....|+..++-+..- T Consensus 1 Mg~~~~~~~~~~~~----~~~----~~-----~~~~~---------~~~~~~~~~-~~~~l~~~~v~~~v~~Ia~~ia~~ 57 (395) T protein:vir:40 1 MGFKSWVSGFFNEE----QRT----LN-----LTDTV---------WCSIPSEKL-KELSIKKWAIDSCANKIANTLSCA 57 (395) T ss_pred CchHHHHHhhhccc----ccc----cc-----cccch---------hhccccccc-hhhhhhhHHHHHHHHHHHHHHhhC Confidence 11112222221110 000 00 00000 000000000 000111223344455555555555 Q ss_pred CceeccCcHHHHHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCCCcee Q lcl|NC_013644. 90 PVEYETENEELKEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYNELQR 163 (510) Q Consensus 90 p~~~~~~d~~~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~~~~~ 163 (510) |+.+--.++.....+..+++ |. .......+....+.+|.||+++..+. +. .+..+........+ T Consensus 58 p~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~~---~~----~~~~~~~~~~~~~~--- 127 (395) T protein:vir:40 58 EVLTYEKGEEVRKKNWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQDEY---IY----VADSFTKNDKSLYE--- 127 (395) T ss_pred ceeeccCCccccchHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEecCc---ee----ecCCcccccccccc--- Confidence 66653333333333434332 32 23444556788889999997664432 11 11111100000000 Q ss_pred EEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCC Q lcl|NC_013644. 164 ICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNN 243 (510) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn 243 (510) ..++.+. .++ ..+-..|.+.. |+||+.+ T Consensus 128 -~~~~~v~-~~~-----~~~~~~~~~~e---------------------------------------------vih~r~~ 155 (395) T protein:vir:40 128 -NTYTEVT-LKD-----LTLKKEFKESE---------------------------------------------VLHLTLN 155 (395) T ss_pred -ceeeeee-ecC-----ceeeeeecccc---------------------------------------------EEEeecC Confidence 0000000 000 00000112222 3344322 Q ss_pred C-CCCCcHHHHHHHHHHHHHHHHHHHHHHHH--hccceeEEecCCCCchh---hhhHhh---------hcCeeeeccCCC Q lcl|NC_013644. 244 K-QETTDLKPIKALIDDYDLMNCFLSNNLQD--FAEAIYVVSGFQGDDLS---KLRQNV---------KSKKVVGTGSDG 308 (510) Q Consensus 244 ~-~g~sd~~~v~~liD~~n~~~S~~~~~~~~--~~~~~lv~~g~~~~~~~---~~~~~~---------~~~~~~~~~~~~ 308 (510) . .+.+... ++...+....+...+.... ...+.+++......+.+ .....+ ..++++.++++. T Consensus 156 ~~~~~~~~~---~l~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vl~~g~ 232 (395) T protein:vir:40 156 NESIKSIID---GFYLLYGDLLTAAVNKYKKLNSRKIIVKLKAMFGQTPEAEEKLRLMLSERMKKFLAEGDSALPVEDGM 232 (395) T ss_pred CCCccccch---hHHHHHHHHHHHHHHHHHhcCCCCceEEEecccCCCHHHHHHHHHHHHHHHHHhhccCCceeecCCCc Confidence 2 1122222 2333333333333333332 23455555433222111 111111 122355566666 Q ss_pred ceeEEeecCCHHHH---HHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 309 GLDVKTVTIPTEGR---KTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVI 385 (510) Q Consensus 309 ~~~~~~~~~~~~~~---~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~ 385 (510) +++.+..+.....+ ..+.+.+.++|...=++|+.-.. ++-|++. ......+...|.-+++.|. T Consensus 233 ~~~~l~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l~--~~~sn~e------------~~~~~f~~~~L~P~~~~ie 298 (395) T protein:vir:40 233 EIDELAGDSKIAESRDIKKMIDDVFEMVANSFNIPLGLAK--GDTVGLS------------EQVNSFLMFSINPIAEMFT 298 (395) T ss_pred eEEeccCChhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhc--CCCcCHH------------HHHHHHHHHHHHHHHHHHH Confidence 55555443322222 22334445677777778775432 2222111 1112344455555555555 Q ss_pred HHHhhccCCc--c-ccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 386 DDINRRYTKA--F-DPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWED 460 (510) Q Consensus 386 ~~~~~~~~~~--~-~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~ 460 (510) ..+..+--.. . ....+++.+..-+-.|..+.++.+.++..+|+++.-.+++.++. ++++...+ T Consensus 299 ~~l~~kLl~~~~~~~g~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~~~~gD~------------ 366 (395) T protein:vir:40 299 DEGNRKFYGRDSVLERTYMKLDTTRIKVQDIQEIASSMDVLFHIGVNTIDDNLRMIGREPVMSPETQE------------ 366 (395) T ss_pred HHHHHhcCChhhhcCCceEEEechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCCce------------ Confidence 5444321111 1 11245566667778899999999999999999999888887653 22211100 Q ss_pred HHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCc Q lcl|NC_013644. 461 VKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGA 498 (510) Q Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (510) .....+..+ .+...+..+.+..+++ ..++ T Consensus 367 ~~~~~n~~~----~~~~~~~~kgge~~~~-----~~~~ 395 (395) T protein:vir:40 367 RFVTKNYAP----LGENEEDLKGGDINEN-----KGDS 395 (395) T ss_pred eeecccccc----ccccccccCCCCCCCC-----cCCC Confidence 000010000 0111111111111111 1111 No 247 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=85.68 E-value=0.051 Score=27.67 Aligned_cols=389 Identities=13% Similarity=0.053 Sum_probs=152.6 Q ss_pred CCCccCCChhhhH------------------------HH--HHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceec Q lcl|NC_013644. 1 MEALLSEDVKIIA------------------------NA--LKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVD 54 (510) Q Consensus 1 ~~~~~~~~~~~~~------------------------~~--i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~ 54 (510) ...+-..+.++=+ .. -+.+|.. |..+-.+++-... T Consensus 20 ~~S~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~---------YR~ma~~pEvd~A---------- 80 (511) T protein:vir:56 20 VRSFSAPDNVDGAKEIHTNLLAPQLGHAIIPSDAQSEGTIPVKELIKS---------YRALAEYHEVDDA---------- 80 (511) T ss_pred cccccCCCCCCCceEEecccccceecceeccccccccCccchHHHHHH---------HHHHhhccchhhH---------- Confidence 0001011110000 00 0123222 2222233322221 Q ss_pred cccccccccccccceeccchhHHHHHHHHhh-hhcCCceeccCc----HHHHH----HHHHHhc-cCHHHHHHHHHHHHH Q lcl|NC_013644. 55 DEGILREDKYASNVRIPHGFFPEIVDQKTQY-LLSNPVEYETEN----EELKE----YLAEYYN-SEFQVVLQELVEGSS 124 (510) Q Consensus 55 ~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~-l~g~p~~~~~~d----~~~~~----~l~~~~~-n~~~~~~~e~~~~~~ 124 (510) ...||+..+-+ -...||.+..++ +...+ ..+.+++ =+|+.+.++..+.+. T Consensus 81 --------------------v~eIvne~iv~d~~~~pV~l~ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WY 140 (511) T protein:vir:56 81 --------------------IQEIVDEAIVYENDKEVVWLNLDNTDFSENIKAKINEEFDRVVSLLQMRKHGYKWFRKWY 140 (511) T ss_pred --------------------HHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhh Confidence 23333332211 234555554433 22222 2233332 267788899999999 Q ss_pred hcCeEEEEEEECCC-CceEEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCcee Q lcl|NC_013644. 125 QKGFEYVYARTNAE-DRLCFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDY 203 (510) Q Consensus 125 ~~G~~~~~v~~d~~-g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~ 203 (510) +.|+.|.+.-+|++ |-..+..+||+.+-+|..--.+...-..+. ..+..+-+|.+.....+. +. T Consensus 141 VDgRi~fHkiid~k~GI~eLr~lDPr~i~~vr~i~~~~~~~~~v~----------~~~~ey~~Y~~~~~~~~~-----~~ 205 (511) T protein:vir:56 141 VDSRIYFHKILDKDNNIIELRPLNPMKMELVREIQKETIDGVEVV----------KGTLEYYVYKQSDYKMPS-----WM 205 (511) T ss_pred hcceEEEEEEeccccceeehhhcCcccchhhhhhhcccccccccc----------cceeeeeEecCCCcccCc-----cc Confidence 99999998777754 656788899998876653211111000000 001112233332211000 00 Q ss_pred ecccccccccccccccccccccccccccCCccc---EEEe--------cCCCCCCCcHHHHHHHHHHHHH--HHHHHHHH Q lcl|NC_013644. 204 ELDEAEPINPRPHVLAVDSENESLLQRSYGQIP---FYRL--------SNNKQETTDLKPIKALIDDYDL--MNCFLSNN 270 (510) Q Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~--------~nn~~g~sd~~~v~~liD~~n~--~~S~~~~~ 270 (510) .. +.. .+.-=+|| |++. .++....|-+. ..|..+|. ++-|.+-. T Consensus 206 ~~-------~~~-------------~~~~vkI~~daI~y~hSGL~d~~~~~g~i~syLh---kAiKp~NQLkm~EDAlVI 262 (511) T protein:vir:56 206 SA-------TNR-------------AQTSFRIPKDAIVFAHSGLMRGCADDPYIIGYLD---RAIKPANQLKMLEDALVI 262 (511) T ss_pred cc-------ccc-------------cccceeechhheeeecccceeccCCCCeeeccch---hhhHHHHhhHHHHhhHHH Confidence 00 000 00001122 1111 11112233333 33444443 34444444 Q ss_pred HHHhccceeEE----ec-CCCCchhh----hhHhhhcCeeee--------------------c---cCCCceeEEeec-- Q lcl|NC_013644. 271 LQDFAEAIYVV----SG-FQGDDLSK----LRQNVKSKKVVG--------------------T---GSDGGLDVKTVT-- 316 (510) Q Consensus 271 ~~~~~~~~lv~----~g-~~~~~~~~----~~~~~~~~~~~~--------------------~---~~~~~~~~~~~~-- 316 (510) .+..+.|=.=+ .| +...-... .+...+...+.. + +++.+.+.-|.+ T Consensus 263 YRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGg 342 (511) T protein:vir:56 263 YRLARAPERRVFYVDVGNLPTQKAQQYVNGIMQNVKNRVVYDTQTGQVKNTTNAMSMLEDYYLPRREGSKGTEVSTLPGG 342 (511) T ss_pred HhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchhhhhhHhhhcccccCCCCccceeecccc Confidence 44444432211 11 11111111 111111111111 1 122223333333 Q ss_pred CCHHHHHHHHHHHHHHHHHHhCCccccc--c----c--cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 317 IPTEGRKTKMEIDKENIYKFGMAFDSTQ--V----G--DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDI 388 (510) Q Consensus 317 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~--~----~--~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~ 388 (510) .+. ....-+.-+++.+|+.-.+|-.-. + + +|..| +|.......-.-+.+-+..|...+.++++.=+-+- T Consensus 343 qnl-gem~DV~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~--EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilK 419 (511) T protein:vir:56 343 QSL-GDIEDVLYFNRKLYKAMRIPTSRAASEDQTGGINFGQGA--EITRDELKFTKFVKRLQTKFETVITDPLKHQLIVN 419 (511) T ss_pred CCc-ChHHHHHHHHHHHHHHhCCCcccccCCCCccccccccch--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Confidence 222 223445556677777667774311 1 1 12222 23233333344456666677777777666544333 Q ss_pred hhccCCcccc--ceeeEEeCCCCCCCHHHHHHHH-------HHHHh--cCCCchHHHHHhCCCCCcHHHHHHHHHHHHHH Q lcl|NC_013644. 389 NRRYTKAFDP--TEVSFTFTREVMVNETDIVNDE-------KTEAE--TRKIILESILQVAPRLDDDNVLRLICEQFDLD 457 (510) Q Consensus 389 ~~~~~~~~~~--~~v~i~f~~~~p~d~~e~~~~~-------~~~~~--~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~ 457 (510) ++....+|+. ..+.+.|...-.-.+...++.+ ..+.. +..+|.+++.+.+=-.+|+|..++....+++ T Consensus 420 giit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~~yi~k~ILr~tDeei~~~~k~I~~E- 498 (511) T protein:vir:56 420 NIITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGKYYSHKYIQKNILRLSDDQITAMQSEIDEE- 498 (511) T ss_pred cCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhccccchHHHHHHHhccCHHHHHHHHHHHHHh- Confidence 4444445543 3577788655554444433332 22211 3346999999876556665533222111111 Q ss_pred HHHHHHHHHhhhccCCCCCCCCC Q lcl|NC_013644. 458 WEDVKEALEEAEYTKGLSDNTDE 480 (510) Q Consensus 458 ~~~~~~~~~~~~~~~~~~~~~~~ 480 (510) .+..-++.. + .+. T Consensus 499 -------~k~~~~~~~-e--~~f 511 (511) T protein:vir:56 499 -------ETNPRFQQD-D--QGF 511 (511) T ss_pred -------hcCCCCCCc-c--cCC Confidence 111111110 0 011 No 248 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=82.81 E-value=0.074 Score=26.79 Aligned_cols=334 Identities=8% Similarity=0.006 Sum_probs=129.4 Q ss_pred HHHHHHHhccCCcchhcccceeccccccccccccccceec--cchhHHHHHHHHhhhhcCCcee-cc-Cc--------HH Q lcl|NC_013644. 32 AETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIP--HGFFPEIVDQKTQYLLSNPVEY-ET-EN--------EE 99 (510) Q Consensus 32 ~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~--~n~~~~Iv~~~~~~l~g~p~~~-~~-~d--------~~ 99 (510) |-.+.+.+.-.......... .. ....+..+. .......|+..++-+..-|+.+ .. .+ +. T Consensus 1 Mg~f~~~~~~~~~~~~~~~~------~~---~~~~~~~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~~~~~~~~ 71 (378) T protein:vir:16 1 MNLFGKVVSFSRGKLNNDTQ------RV---TAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISM 71 (378) T ss_pred CccchhhhhhhcccccCCcc------ee---eecccchhhHHHHHHHHHHHHHHhhhhhCceeEEEEccccccccccccc Confidence 11111111000000000000 00 000000111 1223344555555555556643 11 00 11 Q ss_pred HHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECC-CCceEEEEEcccceEEEEcCCCCceeEEEEEEEEE Q lcl|NC_013644. 100 LKEYLAEYYN---S---EFQVVLQELVEGSSQKGFEYVYARTNA-EDRLCFQVADSLNVFGVYNEYNELQRICRHYITEI 172 (510) Q Consensus 100 ~~~~l~~~~~---n---~~~~~~~e~~~~~~~~G~~~~~v~~d~-~g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~ 172 (510) ....+..+++ | ........+....+.+|.+|++..+|. .|++. .+ +| + T Consensus 72 ~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~g~~~--~l-----~~--~---------------- 126 (378) T protein:vir:16 72 AGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELL--DL-----LF--A---------------- 126 (378) T ss_pred ccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEE--EE-----Ee--c---------------- Confidence 1223444442 2 223445567788889999998654432 22221 00 00 0 Q ss_pred eeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCCcHHH Q lcl|NC_013644. 173 EKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTDLKP 252 (510) Q Consensus 173 ~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~ 252 (510) ++. . .|..+.+ +|+++.-.+...... T Consensus 127 --~~~---~----~~~~~di---------------------------------------------ih~r~~~~~~~~~s~ 152 (378) T protein:vir:16 127 --DDK---K----EYKPEEL---------------------------------------------VRLTSPFYINEDTSI 152 (378) T ss_pred --CCe---e----Eecccce---------------------------------------------EEecCccCccchhHH Confidence 000 0 0111222 233221111122233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccceeEEecC-CCCch--h----hhhHhhh-------cCeeeeccCCCceeEEeecCC Q lcl|NC_013644. 253 IKALIDDYDLMNCFLSNNLQDFAEAIYVVSGF-QGDDL--S----KLRQNVK-------SKKVVGTGSDGGLDVKTVTIP 318 (510) Q Consensus 253 v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~-~~~~~--~----~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~ 318 (510) +..+.++++..++. +.+-.+++.. ...+. . .+....+ .++++.++++.+++.++.+.. T Consensus 153 l~~~~~~i~~~~~~--------~~~~g~l~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~ 224 (378) T protein:vir:16 153 LDNALASIQTKLEQ--------GKLRGLLKINAFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYS 224 (378) T ss_pred HHHHHHHHHHHHhc--------CccceeeEeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChh Confidence 44444554433321 2222222221 11111 1 1222111 234566776666665554433 Q ss_pred HHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC----- Q lcl|NC_013644. 319 TEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYT----- 393 (510) Q Consensus 319 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~----- 393 (510) ...+ ..++.+.+.|+..-++|+.-..+ +.+... ....+...|.-.++.|...+..+-- T Consensus 225 ~~~~-~~~~~~~~~Ia~~fgVPp~~l~g--~~~e~~--------------~~~f~~~tl~P~~~~ie~~l~~kLl~~~e~ 287 (378) T protein:vir:16 225 VLNK-DEIDLIKSELLTGYFMNENILLG--TASQEQ--------------QIYFYNSTIIPLLIQLEKELTYKLISTNRR 287 (378) T ss_pred hhhH-HHHHHHHHHHHHHhCCCHHHhcC--CchHHH--------------HHHHHHHHHHHHHHHHHHHHHhhcCChhhh Confidence 3333 45567778888888888743321 111111 1123444555555555554432211 Q ss_pred ----CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_013644. 394 ----KAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALEE 467 (510) Q Consensus 394 ----~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~~ 467 (510) .......+.+.++.-+-.|..+.++.+.+++.+|+++.-.++++++. +++-+.. ....+. T Consensus 288 ~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~--------------~~~~n~ 353 (378) T protein:vir:16 288 RVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDVY--------------IANLNA 353 (378) T ss_pred hhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeE--------------eecccc Confidence 11112235555667778899999999999999999999888887643 2110000 000000 Q ss_pred hhccCCCCCCCCCcccCCCCCCccccc Q lcl|NC_013644. 468 AEYTKGLSDNTDEEETAVNPDDPTQQM 494 (510) Q Consensus 468 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (510) .+.... .........+ .+++++..+ T Consensus 354 ~~~~~~-~~~~~~~~~~-~~~~e~~ne 378 (378) T protein:vir:16 354 VAVKNL-SDLQGSRKDV-TSTDETNNQ 378 (378) T ss_pred ccccch-hhhcCccCCC-CCCCCCCCC Confidence 000000 0000000000 111111111 No 249 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=82.08 E-value=0.08 Score=26.60 Aligned_cols=389 Identities=11% Similarity=0.052 Sum_probs=152.9 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |+.....+... +.-..+|.. |..+-.+++-+.. ...||+ T Consensus 54 ~~~~~d~~~~~--~~~~~LI~~---------YR~ma~~pEvd~A------------------------------v~eIvn 92 (516) T protein:vir:10 54 MQQFFGIDNNI--SGTKDLINT---------YRQLTNNPEVERA------------------------------VANIVN 92 (516) T ss_pred eeeeecccCcc--ccHHHHHHH---------HHHhhhccchhHH------------------------------HHHhhc Confidence 33333222211 112333332 2333333333322 233333 Q ss_pred HHHhh-hhcCCceeccCcH----HHHHHH----HHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEEC--CCCceEEEEEcc Q lcl|NC_013644. 81 QKTQY-LLSNPVEYETENE----ELKEYL----AEYYN-SEFQVVLQELVEGSSQKGFEYVYARTN--AEDRLCFQVADS 148 (510) Q Consensus 81 ~~~~~-l~g~p~~~~~~d~----~~~~~l----~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d--~~g~~~i~~~~p 148 (510) ..+-+ -...||.+..++- ...+.| +.+++ =+|+.+.++..+.+.+.|+.|++...| .+|-..+..+|| T Consensus 93 eaiv~d~~~~pV~l~l~~~e~s~sik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~~k~GI~elr~lDP 172 (516) T protein:vir:10 93 EAVVYEKGHKVVSLDLDDTEFSSSIKDKILEEFDEICRLLDASRKLDTLFRRWYIDSRIFFHKIMPNPKEGIVELRRLDP 172 (516) T ss_pred ceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhhhhcceEEEEEEecCcccceeeeeeeCC Confidence 32211 2345555554432 222222 22332 267788899999999999999986666 346678899999 Q ss_pred cceEEEEcCCCCceeEEEEEEEEEeeCCcee--EEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccc Q lcl|NC_013644. 149 LNVFGVYNEYNELQRICRHYITEIEKDGETV--DIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENES 226 (510) Q Consensus 149 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~--~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (510) +.+..+.--.. ...++... ....+-+|+.+.. .|... ++... + T Consensus 173 r~i~~vR~i~~------------~~~~~~~v~~~~~e~~~Y~~~~~-~~~~~-g~~~~--------~------------- 217 (516) T protein:vir:10 173 RHVEYYREIVT------------SDVGGTSVVKGYREFFVYTTGNE-GYAYN-GRLFE--------P------------- 217 (516) T ss_pred cceeeEEeeec------------ccCcchhhhhceeeeeeeecCcc-ceecc-ccccC--------C------------- Confidence 98877542100 00111100 0111222332221 11111 11100 0 Q ss_pred cccccCCccc---EEEecCCC---CCCCcHHHHHHHHHHHHH--HHHHHHHHHHHhccceeEE----ec-CCCCchhhhh Q lcl|NC_013644. 227 LLQRSYGQIP---FYRLSNNK---QETTDLKPIKALIDDYDL--MNCFLSNNLQDFAEAIYVV----SG-FQGDDLSKLR 293 (510) Q Consensus 227 ~~~~~~g~iP---vv~~~nn~---~g~sd~~~v~~liD~~n~--~~S~~~~~~~~~~~~~lv~----~g-~~~~~~~~~~ 293 (510) +.-=+|| |++...+- .+-..+.-+...|..+|. ++-|.+-.-+..+.|=.=+ .| +...-..... T Consensus 218 ---~~~ikI~~daI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl 294 (516) T protein:vir:10 218 ---NTRIKIPRSAIVYAHSGLQDCSDRGIVGYLHNAVKPANQLKLLEDALVIYRITRAPERRVFYIDVGNMPNRKATEYV 294 (516) T ss_pred ---CCceecchhheeeeecCcccCCCCceeceehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHH Confidence 0000222 11111110 011112223344555553 3444444444444432211 11 1111111111 Q ss_pred H----hhhcCeeee--------------------c---cCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCccc-- Q lcl|NC_013644. 294 Q----NVKSKKVVG--------------------T---GSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDS-- 342 (510) Q Consensus 294 ~----~~~~~~~~~--------------------~---~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~-- 342 (510) + ..+...+.. + +++.+.+.-|.+ .+. ....-+.=+++.+|+.-.+|-. T Consensus 295 ~~iM~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnl-gem~DV~YF~kkLy~aLnVP~SRl 373 (516) T protein:vir:10 295 NGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVTSLPGAQTM-GEMDDVRWFNKKLYEALRIPLSRM 373 (516) T ss_pred HHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCc-ChHHHHHHHHHHHHHHhCCCcccc Confidence 1 111111111 1 122223333333 222 2234455666777776677753 Q ss_pred ccccc-----CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccc--eeeEEeCCCCCCCHHH Q lcl|NC_013644. 343 TQVGD-----GNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPT--EVSFTFTREVMVNETD 415 (510) Q Consensus 343 ~~~~~-----g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~--~v~i~f~~~~p~d~~e 415 (510) ..++. |..|. |..-....-.-+.+-+..|...+.++++.=+-+-++....+|+.. .+.+.|...-.-.+.. T Consensus 374 ~~e~~~~~~~Gr~~E--ItRDEiKF~KFI~rLR~rFs~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElK 451 (516) T protein:vir:10 374 PRDDGGMVIGGQDMA--ITRDELDFRKFIVQLQHNFEEIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELK 451 (516) T ss_pred cCCCCceeeccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHH Confidence 21211 22222 222222223334445555555555555443322233333444432 5677776555444443 Q ss_pred HHHH-------HHHHH--hcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCC Q lcl|NC_013644. 416 IVND-------EKTEA--ETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDE 480 (510) Q Consensus 416 ~~~~-------~~~~~--~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (510) .++. +..+. -++.+|.+++.+.+=-.+|+|..++....++ ..+. +--.+...+.+. T Consensus 452 e~Eil~~Rl~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~--------E~~~-~~~~~p~~e~~f 516 (516) T protein:vir:10 452 DIETLRQRVDALSQIEPYVGKYVSHDYVMKNILQMTDEQIAQEEKQIEK--------EANV-KRFQNPENEDDF 516 (516) T ss_pred HHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhHHHHHHHHHHH--------hhhC-CCCCCCCccccC Confidence 3332 33222 3457899999998666666553221111111 0100 000111111111 No 250 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=81.07 E-value=0.089 Score=26.34 Aligned_cols=389 Identities=10% Similarity=0.031 Sum_probs=155.2 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |+.....+... +.-..+|+. |+.+..+++-... ...||+ T Consensus 54 ~~~~~~~~~~~--~~~~eLI~~---------YR~ma~~pEvd~A------------------------------v~eIVn 92 (516) T protein:vir:10 54 MQQFFGIDNNI--SGTKDLINT---------YRQLINNPEVERA------------------------------VANIVN 92 (516) T ss_pred eeeeecccccc--chHHHHHHH---------HHHHhhccchhhH------------------------------HHHhhc Confidence 22222222211 111233332 3333333333322 233333 Q ss_pred HHHhh-hhcCCceeccCcH----HHHHHH----HHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEEC--CCCceEEEEEcc Q lcl|NC_013644. 81 QKTQY-LLSNPVEYETENE----ELKEYL----AEYYN-SEFQVVLQELVEGSSQKGFEYVYARTN--AEDRLCFQVADS 148 (510) Q Consensus 81 ~~~~~-l~g~p~~~~~~d~----~~~~~l----~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d--~~g~~~i~~~~p 148 (510) ..+-+ -...||.+..++- ...+.| +.+++ =+|+.+.++..+.+.+.|+.|++...| .+|-..+..+|| T Consensus 93 eaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~~k~GI~Elr~lDP 172 (516) T protein:vir:10 93 EAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMPNPKKGIAELRRLDP 172 (516) T ss_pred ceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCccccceeeeeeCC Confidence 32211 2345555544432 222222 33332 367888899999999999999986666 346678899999 Q ss_pred cceEEEEcCCCCceeEEEEEEEEEeeCCceeE--EEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccc Q lcl|NC_013644. 149 LNVFGVYNEYNELQRICRHYITEIEKDGETVD--IHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENES 226 (510) Q Consensus 149 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~--~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (510) +.+..+.--. ... .++.... +..+-+|+++.. +|... ++....... T Consensus 173 r~i~~vR~i~-----------~~~-~~~~~v~~~~~e~~~Y~~~~~-~~~~~-g~~~~~~~~------------------ 220 (516) T protein:vir:10 173 RFMEYYREIV-----------TSD-IGGTTIVKGYREFFIYTTGNE-GYSYN-GRIFEPNTR------------------ 220 (516) T ss_pred cceeeEeeec-----------ccc-cccchhhhhhhheeeeccCcc-ccccc-cceeCCCcc------------------ Confidence 9887654210 001 1111000 111223333321 22111 111100000 Q ss_pred cccccCCccc---EEEecCC---CCCCCcHHHHHHHHHHHHH--HHHHHHHHHHHhccceeEE----ec-CCCCchhhh- Q lcl|NC_013644. 227 LLQRSYGQIP---FYRLSNN---KQETTDLKPIKALIDDYDL--MNCFLSNNLQDFAEAIYVV----SG-FQGDDLSKL- 292 (510) Q Consensus 227 ~~~~~~g~iP---vv~~~nn---~~g~sd~~~v~~liD~~n~--~~S~~~~~~~~~~~~~lv~----~g-~~~~~~~~~- 292 (510) =+|| |++.... ..+-..+.-+...|..+|. ++-|.+-.-+..+.|=.=+ .| +...-.... T Consensus 221 ------ikI~~dAI~y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl 294 (516) T protein:vir:10 221 ------IKIPRSAVVYASSGLMDCSDRGIIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYV 294 (516) T ss_pred ------eeechhheeeecccceeCCCCceeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHH Confidence 0122 1111100 0011112223344444443 3444444444444432211 11 111111111 Q ss_pred --------------------------hHhhhcCeeeeccCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCccc-- Q lcl|NC_013644. 293 --------------------------RQNVKSKKVVGTGSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDS-- 342 (510) Q Consensus 293 --------------------------~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~-- 342 (510) +..+.++.+--=+++.+.+.-|.+ .+. ....-+.=+++.+|+.-.+|-. T Consensus 295 ~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnl-gem~DV~YF~kkLy~aLnVP~sRl 373 (516) T protein:vir:10 295 NGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTM-GDMDDVRWFNKKLYEALRIPLSRI 373 (516) T ss_pred HHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCc-ChHHHHHHHHHHHHHHhCCCcccc Confidence 111111111001122223333333 222 2234455566777776677743 Q ss_pred ccccc-----CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCcccc--ceeeEEeCCCCCCCHHH Q lcl|NC_013644. 343 TQVGD-----GNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDP--TEVSFTFTREVMVNETD 415 (510) Q Consensus 343 ~~~~~-----g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~--~~v~i~f~~~~p~d~~e 415 (510) ..++. |..|.+. .-....-.-+.+-+..|...+.++++.=+-+-++....+|+. ..+.+.|...-.-.+.. T Consensus 374 ~~e~~~~~~~Gr~~EIt--RDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElK 451 (516) T protein:vir:10 374 PRDDGGMVIGGQDTAIT--RDELDFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELK 451 (516) T ss_pred cCCCCceeeccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHH Confidence 21111 2333222 222223334555566666666666655443334444445543 35777786555544443 Q ss_pred HHHH-------HHHHH--hcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCC Q lcl|NC_013644. 416 IVND-------EKTEA--ETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDE 480 (510) Q Consensus 416 ~~~~-------~~~~~--~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (510) .++. +..+. -++.+|.+++.+.+=-.+|+|..++....++ ..+..-+ ...+.+.+. T Consensus 452 e~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~e~k~I~~--------E~~~~~~-~~p~~~~~f 516 (516) T protein:vir:10 452 DIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMTEEQIAQEEKQIEQ--------EAGIKRF-QNPENEDDF 516 (516) T ss_pred HHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhhHHHHHHHHHH--------hhhCCCC-CCCCccccC Confidence 3332 33222 3457899999998666666543221111111 1100100 111111111 No 251 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=81.07 E-value=0.089 Score=26.34 Aligned_cols=389 Identities=10% Similarity=0.031 Sum_probs=155.2 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |+.....+... +.-..+|+. |+.+..+++-... ...||+ T Consensus 54 ~~~~~~~~~~~--~~~~eLI~~---------YR~ma~~pEvd~A------------------------------v~eIVn 92 (516) T protein:vir:10 54 MQQFFGIDNNI--SGTKDLINT---------YRQLINNPEVERA------------------------------VANIVN 92 (516) T ss_pred eeeeecccccc--chHHHHHHH---------HHHHhhccchhhH------------------------------HHHhhc Confidence 22222222211 111233332 3333333333322 233333 Q ss_pred HHHhh-hhcCCceeccCcH----HHHHHH----HHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEEC--CCCceEEEEEcc Q lcl|NC_013644. 81 QKTQY-LLSNPVEYETENE----ELKEYL----AEYYN-SEFQVVLQELVEGSSQKGFEYVYARTN--AEDRLCFQVADS 148 (510) Q Consensus 81 ~~~~~-l~g~p~~~~~~d~----~~~~~l----~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d--~~g~~~i~~~~p 148 (510) ..+-+ -...||.+..++- ...+.| +.+++ =+|+.+.++..+.+.+.|+.|++...| .+|-..+..+|| T Consensus 93 eaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~~k~GI~Elr~lDP 172 (516) T protein:vir:10 93 EAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMPNPKKGIAELRRLDP 172 (516) T ss_pred ceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCccccceeeeeeCC Confidence 32211 2345555544432 222222 33332 367888899999999999999986666 346678899999 Q ss_pred cceEEEEcCCCCceeEEEEEEEEEeeCCceeE--EEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccc Q lcl|NC_013644. 149 LNVFGVYNEYNELQRICRHYITEIEKDGETVD--IHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENES 226 (510) Q Consensus 149 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~--~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (510) +.+..+.--. ... .++.... +..+-+|+++.. +|... ++....... T Consensus 173 r~i~~vR~i~-----------~~~-~~~~~v~~~~~e~~~Y~~~~~-~~~~~-g~~~~~~~~------------------ 220 (516) T protein:vir:10 173 RFMEYYREIV-----------TSD-IGGTTIVKGYREFFIYTTGNE-GYSYN-GRIFEPNTR------------------ 220 (516) T ss_pred cceeeEeeec-----------ccc-cccchhhhhhhheeeeccCcc-ccccc-cceeCCCcc------------------ Confidence 9887654210 001 1111000 111223333321 22111 111100000 Q ss_pred cccccCCccc---EEEecCC---CCCCCcHHHHHHHHHHHHH--HHHHHHHHHHHhccceeEE----ec-CCCCchhhh- Q lcl|NC_013644. 227 LLQRSYGQIP---FYRLSNN---KQETTDLKPIKALIDDYDL--MNCFLSNNLQDFAEAIYVV----SG-FQGDDLSKL- 292 (510) Q Consensus 227 ~~~~~~g~iP---vv~~~nn---~~g~sd~~~v~~liD~~n~--~~S~~~~~~~~~~~~~lv~----~g-~~~~~~~~~- 292 (510) =+|| |++.... ..+-..+.-+...|..+|. ++-|.+-.-+..+.|=.=+ .| +...-.... T Consensus 221 ------ikI~~dAI~y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl 294 (516) T protein:vir:10 221 ------IKIPRSAVVYASSGLMDCSDRGIIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYV 294 (516) T ss_pred ------eeechhheeeecccceeCCCCceeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHH Confidence 0122 1111100 0011112223344444443 3444444444444432211 11 111111111 Q ss_pred --------------------------hHhhhcCeeeeccCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCccc-- Q lcl|NC_013644. 293 --------------------------RQNVKSKKVVGTGSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDS-- 342 (510) Q Consensus 293 --------------------------~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~-- 342 (510) +..+.++.+--=+++.+.+.-|.+ .+. ....-+.=+++.+|+.-.+|-. T Consensus 295 ~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnl-gem~DV~YF~kkLy~aLnVP~sRl 373 (516) T protein:vir:10 295 NGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTM-GDMDDVRWFNKKLYEALRIPLSRI 373 (516) T ss_pred HHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCc-ChHHHHHHHHHHHHHHhCCCcccc Confidence 111111111001122223333333 222 2234455566777776677743 Q ss_pred ccccc-----CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCcccc--ceeeEEeCCCCCCCHHH Q lcl|NC_013644. 343 TQVGD-----GNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDP--TEVSFTFTREVMVNETD 415 (510) Q Consensus 343 ~~~~~-----g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~--~~v~i~f~~~~p~d~~e 415 (510) ..++. |..|.+. .-....-.-+.+-+..|...+.++++.=+-+-++....+|+. ..+.+.|...-.-.+.. T Consensus 374 ~~e~~~~~~~Gr~~EIt--RDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElK 451 (516) T protein:vir:10 374 PRDDGGMVIGGQDTAIT--RDELDFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELK 451 (516) T ss_pred cCCCCceeeccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHH Confidence 21111 2333222 222223334555566666666666655443334444445543 35777786555544443 Q ss_pred HHHH-------HHHHH--hcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCC Q lcl|NC_013644. 416 IVND-------EKTEA--ETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDE 480 (510) Q Consensus 416 ~~~~-------~~~~~--~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (510) .++. +..+. -++.+|.+++.+.+=-.+|+|..++....++ ..+..-+ ...+.+.+. T Consensus 452 e~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~e~k~I~~--------E~~~~~~-~~p~~~~~f 516 (516) T protein:vir:10 452 DIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMTEEQIAQEEKQIEQ--------EAGIKRF-QNPENEDDF 516 (516) T ss_pred HHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhhHHHHHHHHHH--------hhhCCCC-CCCCccccC Confidence 3332 33222 3457899999998666666543221111111 1100100 111111111 No 252 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=77.76 E-value=0.12 Score=25.61 Aligned_cols=426 Identities=8% Similarity=-0.026 Sum_probs=163.0 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) |-....-..-+++..+...... -.+. ..-.||+..-.++.... .-...+. .. ......-.+. T Consensus 1 ~~~~~~~~~gl~p~rl~~i~~~---~~~~----~~~~~~~~~~~~Lr~~~-----~~~ly~~-----m~-~D~hi~s~l~ 62 (488) T protein:vir:95 1 MADITETQESLPPFRMGEVGSL---GLKV----KNGRIYEEPRQALRFPE-----SIKTFQL-----MM-RDPAVAASVN 62 (488) T ss_pred CCCccccCCCCCHHHHHHHHHH---hhcc----ccchhhccchhhhcccc-----hHHHHHH-----Hh-hChHHHHHHH Confidence 6655555555555433222110 0000 01122322112221100 0000000 01 1345666677 Q ss_pred HHHhhhhcCCceeccCc-----H---HHHHHHHHHhcc---CHHHHHHHHHHHHHhcCe-EEEEEEECCCCceEE---EE Q lcl|NC_013644. 81 QKTQYLLSNPVEYETEN-----E---ELKEYLAEYYNS---EFQVVLQELVEGSSQKGF-EYVYARTNAEDRLCF---QV 145 (510) Q Consensus 81 ~~~~~l~g~p~~~~~~d-----~---~~~~~l~~~~~n---~~~~~~~e~~~~~~~~G~-~~~~v~~d~~g~~~i---~~ 145 (510) +....+.|.+..+.+.+ . ...++++.++++ ++.+.+..+ .++..+|. +++++|....+.... .+ T Consensus 63 ~Rk~av~~~~w~v~p~~~~~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~-lda~~~G~s~~Eivw~~~~~~~~~~~~~~ 141 (488) T protein:vir:95 63 IIKMFVRKVNWRFVPPKGKEQDPKMLERADFFNSLMDDMEHDWADFINSV-MSFCTYGFCVNEKVYKKRQGKKGKYQSKF 141 (488) T ss_pred HHHHHHhcCCceEecCCCCchhHHHHHHHHHHHHHHhccCccHHHHHHHH-HHhhcccceeeeeeeeccccccccccccc Confidence 77777888887875421 1 234567777642 355666665 46888886 456677543221111 00 Q ss_pred Ec----ccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccccccc Q lcl|NC_013644. 146 AD----SLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVD 221 (510) Q Consensus 146 ~~----p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 221 (510) .+ |..+.+. +...++.+. .+.++.. +.. +........ ...+.... . T Consensus 142 ~dg~~~~~~i~~R------pq~~~~~f~-~d~d~~l---~~~----~~~~~~~~~-------------~~~~~~~~---~ 191 (488) T protein:vir:95 142 DDGLIGWAKLPIR------NQSTLDKWY-FDEDFRR---VTG----VRQNLRNVS-------------HIAGAINL---G 191 (488) T ss_pred cCCeeeeeeeeec------Cccccccee-eccCCCc---eee----ccccccccc-------------cccccccc---c Confidence 00 1111110 000000000 0000000 000 000000000 00000000 0 Q ss_pred ccccccccccCCccc---EEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCC---Cch- Q lcl|NC_013644. 222 SENESLLQRSYGQIP---FYRLS-----NNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQG---DDL- 289 (510) Q Consensus 222 ~~~~~~~~~~~g~iP---vv~~~-----nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~---~~~- 289 (510) .. ..--.|| +|.++ .++.|.|.+..+--..--=+..+..++..++.+..|+.+.+|... .+. T Consensus 192 ~~------~~~~~lP~~kfi~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~ 265 (488) T protein:vir:95 192 ER------PLTRKLPRAKFMLFKYDDEYGNPEGRSPLLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAE 265 (488) T ss_pred cc------cccccccccceEEEeecCCCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeeccCCCCCccc Confidence 00 0001133 12222 345677777665433333345667788888888888888777321 111 Q ss_pred hh---hhHhhh-------c--CeeeeccCCCceeE---------Eee-cCCHHHHHHHHHHHHHHHHHHh--CCcccccc Q lcl|NC_013644. 290 SK---LRQNVK-------S--KKVVGTGSDGGLDV---------KTV-TIPTEGRKTKMEIDKENIYKFG--MAFDSTQV 345 (510) Q Consensus 290 ~~---~~~~~~-------~--~~~~~~~~~~~~~~---------~~~-~~~~~~~~~~~~~l~~~i~~~s--~~p~~~~~ 345 (510) .+ ....+. . ...+.++.+-++++ +.. ......+...++.+.+.|...- +|.....+ T Consensus 266 ~e~~~l~~a~~~i~~~~~~~~~ag~iiP~g~~~~~k~~~~e~~l~~~~~~~~~~~~~li~~~d~~Isk~iLGqtLT~~~~ 345 (488) T protein:vir:95 266 PEKKAFVQYCKTVVNDMIANDRAGLIWPRYIDPDTKEDIFEFSLVSRQGAKAYDTGSIIDRYSKQIMMAFMSDVLAMGQS 345 (488) T ss_pred HHHHHHHHHHHHHHHHhhccchhheeeccccccccchhhhhhhccccccCCchhHHHHHHHHHHHHHHHHhccccccccC Confidence 11 111111 0 01223444433333 111 1233457778888888887653 33333222 Q ss_pred -ccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHH Q lcl|NC_013644. 346 -GDGNITNIVIKARYTLLNMKANKTEARLRALLE-WMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTE 423 (510) Q Consensus 346 -~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~-~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~ 423 (510) +++.+.|..-.- -....+..-.+.+...|. +++.-++.+ +. +. ...-+.++|...-+.|.++.++.+.++ T Consensus 346 ~~Gs~Al~~vh~e---v~~~i~~aDa~~i~~tln~~li~~l~~~-Nf--g~--~~~~P~~~~~~~e~~Dl~~~ae~~~~L 417 (488) T protein:vir:95 346 KYGSFSLADSKTS---LLAMSVDILLKQIKNVINRDLVAQTYAL-NM--WD--DEEHVQITYDDIETPDLEAIGSYIQKT 417 (488) T ss_pred cchhhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHh-cC--CC--CCCccEEEecCcChhhHHHHHHHHHHH Confidence 222222222111 112222333344445553 455544432 21 11 122357889888999999999999999 Q ss_pred HhcCCC-ch----HHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCcccccccCc Q lcl|NC_013644. 424 AETRKI-IL----ESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQMAEGA 498 (510) Q Consensus 424 ~~~g~i-S~----et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (510) +..|+. +. +.+.+.++.-...+.+ ..... .+.......+......+.....+..+++... T Consensus 418 ~~~G~~i~~~~~~~~i~e~~gip~~~~~e------------~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 482 (488) T protein:vir:95 418 VAVGALEVDKELSNKLREHIGLPPADESQ------------PVSEK---LSPNSQSRSGDGYKTAGEGTAKTPSAKDPST 482 (488) T ss_pred HhCCCccccHHHHHHHHHHhCCCCCCCCc------------ccccc---CCCCCCCCCCcccCCCcccCCcccccccchh Confidence 999974 42 3445555432111000 00000 0000000000000000000000000011111 Q ss_pred cccccc Q lcl|NC_013644. 499 TGSTES 504 (510) Q Consensus 499 ~~~~~~ 504 (510) ...+.+ T Consensus 483 a~~~~~ 488 (488) T protein:vir:95 483 ANKANK 488 (488) T ss_pred hhhccC Confidence 111111 No 253 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=74.72 E-value=0.15 Score=25.02 Aligned_cols=346 Identities=8% Similarity=-0.015 Sum_probs=133.1 Q ss_pred hhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccc-eeccccccccccccccceeccchhHHHHHHHHhhhh Q lcl|NC_013644. 9 VKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIF-YVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLL 87 (510) Q Consensus 9 ~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~-~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~ 87 (510) +-. +..+ +.+... .....+......+ ...-+...-....|+..++-+. T Consensus 1 Mg~----f~~l--------------------------~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~v~~~i~~Ia~~ia 49 (376) T protein:vir:78 1 MGF----FSEL--------------------------FKRNKEIEWMWDLDFLEDKT-TKVYLKKMALNTCVKHIARTIA 49 (376) T ss_pred Cch----hhhh--------------------------hccCCccccccchhhccccc-hhhhhhhHHHHHHHHHHHHhhc Confidence 000 0000 000000 0000000000000 0000112334455666666666 Q ss_pred cCCceeccCcHHHHHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCceE-EEEEcccceEEEEcCCCC Q lcl|NC_013644. 88 SNPVEYETENEELKEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRLC-FQVADSLNVFGVYNEYNE 160 (510) Q Consensus 88 g~p~~~~~~d~~~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~-i~~~~p~~~~~~~d~~~~ 160 (510) +-|+.+...+......+..++. |. .......+....+.+|.+|+++..+..|.+. ...+.+..+.+.. T Consensus 50 ~~p~~~~~~~~~~~~~l~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~~~~~~~~~~~~~~~~~~~----- 124 (376) T protein:vir:78 50 KSDFRLKNGETSVRDKLYYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDDFLIADSYVRKEFAFFPDV----- 124 (376) T ss_pred ccceeeccccccccchHHHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCCeeeccceeecccceeeee----- Confidence 6676653333222233333331 22 2344556677888899999888777665331 1111221111100 Q ss_pred ceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEe Q lcl|NC_013644. 161 LQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRL 240 (510) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~ 240 (510) ++.+.... ..+...+..+.+.++ T Consensus 125 ------~~~~~~~~------~~~~~~~~~~evih~--------------------------------------------- 147 (376) T protein:vir:78 125 ------FEGVTVKD------YRYNRNFSMDDVIFL--------------------------------------------- 147 (376) T ss_pred ------eeeeeeec------ceeeeeeccccEEEe--------------------------------------------- Confidence 00000000 000111222333333 Q ss_pred cCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHH--hccceeEEecCCCCchhh---hhHhh----h-----cCeeeeccC Q lcl|NC_013644. 241 SNNKQETTDLKPIKALIDDYDLMNCFLSNNLQD--FAEAIYVVSGFQGDDLSK---LRQNV----K-----SKKVVGTGS 306 (510) Q Consensus 241 ~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~--~~~~~lv~~g~~~~~~~~---~~~~~----~-----~~~~~~~~~ 306 (510) +.+.. +......+++..+..+.+...+.... ...+.+++......+.+. +.... . ...++.+++ T Consensus 148 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~~~~~~v~~l~~ 225 (376) T protein:vir:78 148 EYGNE--RLSAFTDGMFEDYGELFGKMIRAQMRNFQIRGAVNFKMAGVADKDKQTKLQEYIDKVYASFNNNEIAIVPQLE 225 (376) T ss_pred ccCCC--CchhhhhHHHHHHHHHHHHHHHHHHhcCCCceeEEEccCCCCCHHHHHHHHHHHHHHhccccccCcceEEcCC Confidence 21111 00111123334444444443333322 233444453222111111 11111 1 112444555 Q ss_pred CCceeEEeecC-----CHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 307 DGGLDVKTVTI-----PTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWM 380 (510) Q Consensus 307 ~~~~~~~~~~~-----~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~ 380 (510) +.+.+.++... ....+.+..+...+.|+..-++|+.-.++ .++.+... ...+..+|.-. T Consensus 226 g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~~~~s~~e~~~---------------~~f~~~~l~P~ 290 (376) T protein:vir:78 226 GFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLHGDMADLSNNM---------------KAYMEYCIDPL 290 (376) T ss_pred CceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCHHHHH---------------HHHHHHHHHHH Confidence 55555544332 12355666777788888888888854432 12222111 12333344444 Q ss_pred HHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHH Q lcl|NC_013644. 381 NKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDW 458 (510) Q Consensus 381 ~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~ 458 (510) ++.|...+..+--... ...+...|..-+-.|..+.++.+.+++.+|+++.-.+++.++. +++.+..+ T Consensus 291 ~~~ie~~l~~kll~~~-~~~~~~~~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~lg~~p~~~g~~d~---------- 359 (376) T protein:vir:78 291 TKKLEDELNAKLFTFS-EFLAGEHIKIIHKKDIIENAEAVDKLVASGSFNRNEVRELLGAERVDNPELDK---------- 359 (376) T ss_pred HHHHHHHHHhhhCCcc-cceecccchhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCce---------- Confidence 4444443332111110 0112233334456688999999999999999988777776543 11110000 Q ss_pred HHHHHHHHhhhccCCCCCCCCCcccCCCC Q lcl|NC_013644. 459 EDVKEALEEAEYTKGLSDNTDEEETAVNP 487 (510) Q Consensus 459 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 487 (510) .....+..+ -++.++++ T Consensus 360 --~~~~~n~~~----------~~~~~e~g 376 (376) T protein:vir:78 360 --YLITKNYQS----------ADEGGEDG 376 (376) T ss_pred --eeeccCcee----------hhccccCC Confidence 000000000 00000000 No 254 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=72.62 E-value=0.18 Score=24.66 Aligned_cols=433 Identities=13% Similarity=0.062 Sum_probs=157.3 Q ss_pred CCCc---cCCChhhhHHHHHHHHHhhh-------------hhhhH-HHHHHHHHHhccCCcchhcccceecccccccccc Q lcl|NC_013644. 1 MEAL---LSEDVKIIANALKAAIDKDR-------------KSSSK-REAETGIRYYNHENDIMNNRIFYVDDEGILREDK 63 (510) Q Consensus 1 ~~~~---~~~~~~~~~~~i~~~i~~~~-------------~~~~~-~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~ 63 (510) |-.+ ...+.....+.+.+.+..+. +.... ...-....||-|.- .+ ....+. ..+.- T Consensus 1 ~~~~~~w~~~de~~~~~~~~~~~~~~~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~gg~~--~n-~~eLI~----~YR~m 73 (533) T protein:vir:58 1 MPSLEKYKKLNEAVNFTNFLSPMYGMGAPHGAGGSSMIPINMYHPFATAGYASRFYGGIE--FN-RFFLYD----MYDRM 73 (533) T ss_pred CCCcchhhhhhHHHHHHHhhchhhcccCccCCCCCccccCCCCcchhhhhhhhhhhcccc--cc-HHHHHH----HHHHh Confidence 3222 22233333333322222110 00000 00111123343321 00 000000 00000 Q ss_pred ccccceeccchhHHHHHHHHhh-hhcCCceeccCcHHHHHHHHHHhc--cCHHHHHHHHHHHHHhcCeEEEEEEEC-C-C Q lcl|NC_013644. 64 YASNVRIPHGFFPEIVDQKTQY-LLSNPVEYETENEELKEYLAEYYN--SEFQVVLQELVEGSSQKGFEYVYARTN-A-E 138 (510) Q Consensus 64 ~~~~~ki~~n~~~~Iv~~~~~~-l~g~p~~~~~~d~~~~~~l~~~~~--n~~~~~~~e~~~~~~~~G~~~~~v~~d-~-~ 138 (510) .. ..=-+.+-...||+..+-+ ....||.+..++.++.+.+.+... -+|+.+.++..+.+.+.|+.|++.-.+ + + T Consensus 74 a~-~~pEVd~AideIvneaiv~d~~~~pV~v~l~~~e~s~~iK~kI~~lldf~~~~~~~fR~WYVDGriy~Hkiik~~k~ 152 (533) T protein:vir:58 74 DY-TDPLISTVLDIIADECTIPNENGNIVDVVTKDIELAKAILSYLDYVINIEKNAYPIIRNMIKYGDMFLHILEKGSDG 152 (533) T ss_pred hc-cCcchhhHHHhhhceeeEecCCCceeEeecccccccHHHHHHHHHHhcchhhhhHHHHhhhhcceeEEEeccCCccc Confidence 00 0001123334455444332 356788877766555554443332 368889999999999999999887543 2 3 Q ss_pred CceEEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccc Q lcl|NC_013644. 139 DRLCFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVL 218 (510) Q Consensus 139 g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 218 (510) |-..+..+||+.+-.+++--++ . .+-+|++....... +... T Consensus 153 GI~elr~lDPr~i~~vr~~~t~--------------------~-eyyvy~~~~~~~~s----~~~~-------------- 193 (533) T protein:vir:58 153 TIEKFQVVSPYIFSKRYNPETD--------------------T-WYYVITDVYRNVVS----GYFN-------------- 193 (533) T ss_pred chhhheecCCeeeEEEEeeccc--------------------e-EEEeeccccccccc----Cccc-------------- Confidence 4347888999998777653221 0 11234443211100 0000 Q ss_pred cccccccccccccCCccc---EEEecC------CCCCCCcHHHHHHHHHHHHH--HHHHHHHHHHHhccc---eeE--Ee Q lcl|NC_013644. 219 AVDSENESLLQRSYGQIP---FYRLSN------NKQETTDLKPIKALIDDYDL--MNCFLSNNLQDFAEA---IYV--VS 282 (510) Q Consensus 219 ~~~~~~~~~~~~~~g~iP---vv~~~n------n~~g~sd~~~v~~liD~~n~--~~S~~~~~~~~~~~~---~lv--~~ 282 (510) -+|| |+++.. .+.+.|-+. ..|..+|. ++-|.+-.-+..+.| +.. +- T Consensus 194 --------------~kI~~daI~y~~SGl~d~~~~~iisyLh---kAiKp~NQLkmiEDAlVIYRisRAPeRRvFYIDVG 256 (533) T protein:vir:58 194 --------------EDIPEEDVIHFSHKIDTNFFPYGRSYLE---SARAIWNQLRLMEDALMLYRVVRSVDRRVFYVDVG 256 (533) T ss_pred --------------cccchhheeeeeeccccCCCCceehhhh---HHHHHHHHHHHHHHHHHHHhhcCChhheEEEEeec Confidence 0122 222221 222334443 33444443 233333333433332 111 11 Q ss_pred cCCCCchhhhhHh----hhcCeeee-----------------------c---cCCCceeEEeecCCHHHHHHHHHHHHHH Q lcl|NC_013644. 283 GFQGDDLSKLRQN----VKSKKVVG-----------------------T---GSDGGLDVKTVTIPTEGRKTKMEIDKEN 332 (510) Q Consensus 283 g~~~~~~~~~~~~----~~~~~~~~-----------------------~---~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 332 (510) ++...-..+..++ .+...+.. + +++.+.+.-|.+...-....-+.=+++. T Consensus 257 Nlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRReGgrgTEI~TLpGg~lgemeDV~YF~kk 336 (533) T protein:vir:58 257 NVPPDKINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRRGDRRAVEIDILQGSKVDLAEDVEYMLNR 336 (533) T ss_pred CCCccCHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhhhcccccCCCccceeeecCCCCCCcHHHHHHHHHH Confidence 2111111111111 11100000 0 1222234444443222344566677788 Q ss_pred HHHHhCCccc---cccccCcccHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCC Q lcl|NC_013644. 333 IYKFGMAFDS---TQVGDGNITNIVI-KARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTRE 408 (510) Q Consensus 333 i~~~s~~p~~---~~~~~g~~Sg~Ai-~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~ 408 (510) +|..-.+|-. .+.++|..|.++. ..+++..+ .+-+..|...|++. |++ ++.+...+..+.|... T Consensus 337 Ly~ALnVP~sRl~~e~~fgr~~eItRDEiKF~KFI---~rLR~rF~~ll~~q--Lil-------k~iit~eew~~~f~~D 404 (533) T protein:vir:58 337 LISALKVPKAFIGYEGDVNAKNTLATQDIKFNNTI---KRIQGFFVEELERM--VRM-------NKEFADQDFRLVMNRS 404 (533) T ss_pred HHHHhCCCeeecCCCCCCccchhhhHHHHHHHHHH---HHHHHHHHHHHhcc--ccc-------ccCcchhheeeeeecc Confidence 8887778753 2233443333221 11122222 22333344444331 111 1223344456777655 Q ss_pred CCCCHHHHHHHHHH-----HHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHH----HHHhhhccCCCCCCCC Q lcl|NC_013644. 409 VMVNETDIVNDEKT-----EAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKE----ALEEAEYTKGLSDNTD 479 (510) Q Consensus 409 ~p~d~~e~~~~~~~-----~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~----~~~~~~~~~~~~~~~~ 479 (510) -.-.+...++.+.. ....+.+++.++.+.+=-.+|++.......++|.. ..... ..+..+.......+++ T Consensus 405 n~f~ElKe~Eil~~Ri~~l~~~dpyvgk~yi~k~ILr~tdei~~q~e~ie~E~~-~~~~~~~~~~~e~~~~~~~~~~~~p 483 (533) T protein:vir:58 405 NSIVEGERFAVIEQRIGIAERLKGWVREDWIYSNILQIPYDLKPQEEVAEAAGG-GGLFDTGGFGEETTPADFLGERGSP 483 (533) T ss_pred chHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHhcCChhhhHHHHHHHHhhc-CCCCCCCCcccccCCcccCccccCc Confidence 54444433333211 11235678888887654444432221111111100 00000 0000000000000000 Q ss_pred -----------------CcccCCCC--CCcccccccCcccccccccCCCC Q lcl|NC_013644. 480 -----------------EEETAVNP--DDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 480 -----------------~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) +...+..+ +...+.......++.+.+.|..- T Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~g~~~~~~~~p~~~ 533 (533) T protein:vir:58 484 IESPRGRTEFDFGTEGGEELGGELNLGGAFEEFEEETGGGEEELPFPEEE 533 (533) T ss_pred ccCCCChhhHhcccCCcccccccccccccchhhhhhcCCcccCCCCCCCC Confidence 00000100 11111111112222333344333 No 255 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=69.69 E-value=0.22 Score=24.20 Aligned_cols=356 Identities=11% Similarity=0.007 Sum_probs=140.9 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHH-HHHHHHHH--hccCCcchhcccceeccccccccccccc-cce--eccch Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKR-EAETGIRY--YNHENDIMNNRIFYVDDEGILREDKYAS-NVR--IPHGF 74 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~-~~~~~~~Y--Y~g~~~i~~~~~~~~~~~~~~~~~~~~~-~~k--i~~n~ 74 (510) ..+....+++.... .++..... +...+..- .-|... .+. .....+ ..+ +..+. T Consensus 22 ~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~g~~~-----------~~~--~~~~~~~t~~~~~~~~~ 80 (409) T protein:vir:83 22 PVDYNPGDPDMVEF--------RGPEEEPEARALPWIRPTAWSGYPE-----------SWA--TPSWGSAQDKLRTLIDV 80 (409) T ss_pred cccccCCCCceeec--------cCCCcchhhhhcccccccccccccc-----------ccc--ccCccccchhhHhhhHH Confidence 22223333222210 00000000 00000000 000000 000 000000 000 11122 Q ss_pred hHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHhc---cCH---HHHHHHHHHHHHhcCeEEEE-EEECCCCce-EEEEE Q lcl|NC_013644. 75 FPEIVDQKTQYLLSNPVEYETENEELKEYLAEYYN---SEF---QVVLQELVEGSSQKGFEYVY-ARTNAEDRL-CFQVA 146 (510) Q Consensus 75 ~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~---n~~---~~~~~e~~~~~~~~G~~~~~-v~~d~~g~~-~i~~~ 146 (510) ...-|+..++-+-+-|+.+--..... +.+..++. |.. ......++... ..|.+|++ +..+.+|.+ .+.++ T Consensus 81 v~acV~~Ia~~iA~lpl~~~~~~~~~-~~~~~ll~~~PN~~~t~~~f~~~l~~~l-llGnay~~~i~r~~~G~~~~L~pl 158 (409) T protein:vir:83 81 AWACIDLNASVLSSMPIYRMRNGRII-DSVAWMSNPDPEVYTSWQEFAKQLFWDF-QLGEAFVLPMAHGSDGYPIRFRVV 158 (409) T ss_pred HHHHHHHHHHhhccCceEEeeCCccc-cchhhhcccCCCCCCCHHHHHHHHHHHH-hhCCcEEEEEEECCCCcEEEEEEE Confidence 23345555555555566542111111 11222222 221 22333334443 44888875 557888875 57888 Q ss_pred cccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccc Q lcl|NC_013644. 147 DSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENES 226 (510) Q Consensus 147 ~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (510) +|..+-+..++.+.+ +|++. ..+.. T Consensus 159 ~p~~v~v~~~~~g~~-----~y~~~-------------~~~~~------------------------------------- 183 (409) T protein:vir:83 159 PPWLVNVELKKGARR-----EYRIG-------------GLNVT------------------------------------- 183 (409) T ss_pred CCcceEEEEcCCceE-----EEEEc-------------cccCc------------------------------------- Confidence 888877666543210 11100 00000 Q ss_pred cccccCCcccEEEecCC-----CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCch---hhhhHhhh- Q lcl|NC_013644. 227 LLQRSYGQIPFYRLSNN-----KQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDL---SKLRQNVK- 297 (510) Q Consensus 227 ~~~~~~g~iPvv~~~nn-----~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~---~~~~~~~~- 297 (510) =+|+|++.. -.|.|-++.....|+..+..-.-..+.+...+.|-.++.-...-+. ..+++... T Consensus 184 --------~eiiHir~~~~~~~~~G~spi~~~~~~i~~~~a~~~~~~~~f~nga~p~gil~~~~~ls~e~~~~~~~~~~~ 255 (409) T protein:vir:83 184 --------DEILHIRYQGNTADAHGHGPLESAAPRQVVIGLLQKYVQNLAETGGVPLYWLGVERRLSETEAVDLMDRWIE 255 (409) T ss_pred --------cceEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEeecCCCCCHHHHHHHHHHHHH Confidence 124454421 2466666666666665554444344445555667666643222111 22222221 Q ss_pred -----cCeeeeccCCCce-eEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc---cCcccHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 298 -----SKKVVGTGSDGGL-DVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVG---DGNITNIVIKARYTLLNMKANK 368 (510) Q Consensus 298 -----~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~---~g~~Sg~Ai~~~~~~l~~k~~~ 368 (510) .++.+.+.++.+. +.++.......+.+..+...+.|...-++|++-.+. .+..+...++...... T Consensus 256 ~~~~nag~~~il~~g~~~~~~~~~s~~d~q~le~r~~~~~eIa~~fgVPp~llg~~~~~~~~tysn~eq~~~~f------ 329 (409) T protein:vir:83 256 SRSKYAGHPALVTGGATLNQAKSMSAQDLSLMELTQFNEARIAILLGVPPFLVGLPGATGSLTYSNIEQLFSFH------ 329 (409) T ss_pred hhCCccCccceecCCcccccccCCCHHHHHHHHHHHhhHHHHHHHhCCCHHHccCCCCccccccccHHHHHHHH------ Confidence 1223444454443 233332222334555566677888888888753321 1111111111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHH Q lcl|NC_013644. 369 TEARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLR 448 (510) Q Consensus 369 k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~ 448 (510) +...|.-.++.|...++.+--. ....+++.+..-+-.|.++.++...++.++|+++.-.+.+.++. T Consensus 330 ----~~~tL~P~~~~ie~~l~~~Ll~--~~~~~~f~~~~llr~d~~~r~~~~~~~~~~G~lT~NE~R~~~gl-------- 395 (409) T protein:vir:83 330 ----DRSSLRPKATAVMAALDRWALP--SPQHLELNRDDYTRPSLVERATAYKIMIEAGVMEPNEARAMERL-------- 395 (409) T ss_pred ----HHHHHHHHHHHHHHHHHHhhCC--CCcEEEeehhhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCC-------- Confidence 1122222222222222211000 01235555556667888999999999999998887555554321 Q ss_pred HHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCcccCC Q lcl|NC_013644. 449 LICEQFDLDWEDVKEALEEAEYTKGLSDNTDEEETAV 485 (510) Q Consensus 449 ~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (510) + ..+++++-...+. T Consensus 396 ----------------------p-p~~ggd~l~~~gv 409 (409) T protein:vir:83 396 ----------------------H-SEAAAVRLSGGGV 409 (409) T ss_pred ----------------------C-CCCCCcccCCCCC Confidence 0 0011111111111 No 256 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=68.64 E-value=0.23 Score=24.04 Aligned_cols=393 Identities=11% Similarity=0.045 Sum_probs=154.7 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) ++..... .+...+.-..+|+ +|+.+..+++-.. -...||+ T Consensus 58 ~q~~y~~-~e~~~~~~~eLI~---------~YR~ma~~pEvd~------------------------------Av~eIVn 97 (523) T protein:vir:68 58 FQRMFGS-QEPGLKSTRELID---------TYRNLMTNYEVDN------------------------------AVSEIVS 97 (523) T ss_pred hhhhhhc-cccccchHHHHHH---------HHHHHhhccchhh------------------------------HHHHhhc Confidence 1111111 0111111112222 2222222222222 2334444 Q ss_pred HHHhh-hhcCCceeccCcH----HHHHHH----HHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC----CceEEEEE Q lcl|NC_013644. 81 QKTQY-LLSNPVEYETENE----ELKEYL----AEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAE----DRLCFQVA 146 (510) Q Consensus 81 ~~~~~-l~g~p~~~~~~d~----~~~~~l----~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~----g~~~i~~~ 146 (510) ..+-+ -...||.+..++- ...+.| +.+++ =+|+.+.++..+.+.+.|+.|++..+|.+ |-..+..+ T Consensus 98 eaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~l 177 (523) T protein:vir:68 98 DAIVYEDDTEVVSINLDNTKFSPNIKSMMLDEFNEVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRRL 177 (523) T ss_pred ceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhheeeeEEEEEEEeeCCCccccceeeeee Confidence 33322 2345666654432 222222 33332 26778889999999999999999888743 66788999 Q ss_pred cccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccc Q lcl|NC_013644. 147 DSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENES 226 (510) Q Consensus 147 ~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (510) ||+.+-.|.--..+......+ + . .+.-+-+|.+.... |... |.... T Consensus 178 DPr~i~~vr~i~~~~~~g~~v--i---~-----~~~e~f~Y~~~~~~-~~~~-g~~~~---------------------- 223 (523) T protein:vir:68 178 DPRQVQYVREVITTTEAGVKI--V---K-----GYKEYFIYDTSHES-YACD-GRIYE---------------------- 223 (523) T ss_pred CCcceeEEEeecCCCCcchhh--h---h-----hhhhheeecccccc-cccc-ccccC---------------------- Confidence 999775543210000000000 0 0 00111123332211 1000 00000 Q ss_pred cccccCCccc---EEEecCC---CCCCCcHHHHHHHHHHHHH--HHHHHHHHHHHhccceeEE----ec-CCCCchhhh- Q lcl|NC_013644. 227 LLQRSYGQIP---FYRLSNN---KQETTDLKPIKALIDDYDL--MNCFLSNNLQDFAEAIYVV----SG-FQGDDLSKL- 292 (510) Q Consensus 227 ~~~~~~g~iP---vv~~~nn---~~g~sd~~~v~~liD~~n~--~~S~~~~~~~~~~~~~lv~----~g-~~~~~~~~~- 292 (510) ++.-=+|| |++.... ..+--.+.-+...|..+|. ++-|.+-.-+..+.|=.=+ .| +...-.... T Consensus 224 --~~~~ikI~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl 301 (523) T protein:vir:68 224 --AGTKIKIPKAAIVYAHSGLVDCCGKNIIGYLHRAIKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPSRKAAEHM 301 (523) T ss_pred --CCcceecchhheeeeeccceeCCCCceeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHH Confidence 00000222 2211111 0011122233344555553 3444444444444432211 11 111111111 Q ss_pred ---hHhhhcCeeeec-----------------------cCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCccc-- Q lcl|NC_013644. 293 ---RQNVKSKKVVGT-----------------------GSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDS-- 342 (510) Q Consensus 293 ---~~~~~~~~~~~~-----------------------~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~-- 342 (510) +...+...+... +++.+.+.-|.+ .+.. ...-+.-+++.+|+.-.+|-. T Consensus 302 ~~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlg-em~DV~YF~kkLy~aLnVP~sRl 380 (523) T protein:vir:68 302 QHVMNTMKNRIAYDATTGKIKNQQHIMSMTEDYWLQRRDGKAVTEVDTLPGADNTG-NMEDVRWFRNALYMALRIPITRI 380 (523) T ss_pred HHHHHhhcceeEEeccCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcC-hHHHHHHHHHHHHHHhCCcceee Confidence 111111111111 112223333333 2322 234455566677776677742 Q ss_pred -cccc---cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCcccc--ceeeEEeCCCCCCCHHHH Q lcl|NC_013644. 343 -TQVG---DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDP--TEVSFTFTREVMVNETDI 416 (510) Q Consensus 343 -~~~~---~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~--~~v~i~f~~~~p~d~~e~ 416 (510) .+.+ +|..| .|.......-.-+.+-+..|...+.++++.=+-+-++....+|+. ..+.+.|...-.-.+... T Consensus 381 ~~~~~~f~~Gr~~--EItRDEikF~KFI~rLR~rFs~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe 458 (523) T protein:vir:68 381 PSDQGGIQFDAGT--SITRDELSFGKFIRELQHKFEEIFLDPLKTNLILKGIITEDEWNDEINNIKIKFHRDSYFSELKD 458 (523) T ss_pred cCCCcceeccccc--chhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHH Confidence 1211 23222 232333333444566666777777776665443334444445543 357778865555444433 Q ss_pred HHH-------HHHHHh--cCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCC Q lcl|NC_013644. 417 VND-------EKTEAE--TRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDE 480 (510) Q Consensus 417 ~~~-------~~~~~~--~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (510) ++. +..+.. +..+|.+++.+.+=-.+|+|..++....+++ .+..-+++......+. T Consensus 459 ~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~kqI~~E--------~k~~~~~~p~~e~~~f 523 (523) T protein:vir:68 459 AEILERRINMLQMAEPFIGKYISHRTAMKDILQMSDEEIEQEAKQIEEE--------SKEARFQDPDQEQEDF 523 (523) T ss_pred HHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHH--------hhcCCCCCCchhhhcC Confidence 333 222221 3346999999876556665533222111111 1111111100000111 No 257 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=68.21 E-value=0.24 Score=23.98 Aligned_cols=357 Identities=10% Similarity=-0.021 Sum_probs=127.5 Q ss_pred HHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCCceec-cCcHH-HHHHHHHHhc Q lcl|NC_013644. 32 AETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNPVEYE-TENEE-LKEYLAEYYN 109 (510) Q Consensus 32 ~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~-~~d~~-~~~~l~~~~~ 109 (510) |-.+..+-..+..- ......+....... ...-+........|+..++-+.+-|+.+- .+++. ...-+..++. T Consensus 1 MGlf~~~~~~~~~~-----~~~~~~~~~~~~~~-~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~~~~~~~~~~~lL~ 74 (395) T protein:vir:98 1 MGILDFFSFKKSGT-----LSDDDSGSTTSEKL-TNVVLKEDALYKCVNYLARIISKSTFRLKTPEKLTENQKDWLYWIN 74 (395) T ss_pred CcchhhhcCCCccc-----ccccccchhhhhhc-chhhhhhHHHHHHHHHHHHHHhhCceeEEecCCcccccchHHHHHh Confidence 11011110000000 00000000000000 00001122334445556665666676642 22222 1122333332 Q ss_pred ---cC---HHHHHHHHHHHHHhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEE Q lcl|NC_013644. 110 ---SE---FQVVLQELVEGSSQKGFEYVYARTNAEDRLCFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHH 183 (510) Q Consensus 110 ---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 183 (510) |. .......++...+.+|.||+++-.+..+ . + |......+..... . ++.+.. ++ ... T Consensus 75 ~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~~~--~---~-~~~~~~~~~~~~~----~-~~~~~~-~~-----~~~ 137 (395) T protein:vir:98 75 TKANPNQSASQFWVEVIQKLLVDGETLIFVIPGKGI--Y---V-ADSFTQDKKISGS----Q-FKVSRV-QG-----QTY 137 (395) T ss_pred hcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeCCce--e---c-CCcccccccccCc----c-cceeee-cC-----cee Confidence 32 2344556678888999999876655321 1 1 1111111100000 0 000000 00 000 Q ss_pred EEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCC-----CCCCcHHHHHHHHH Q lcl|NC_013644. 184 AEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNK-----QETTDLKPIKALID 258 (510) Q Consensus 184 ~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~-----~g~sd~~~v~~liD 258 (510) -..|.+..+ +||++.. .+.|.+.....++. T Consensus 138 ~~~~~~~ev---------------------------------------------ih~k~~~~~~~~~~~~~~~~~~~~~~ 172 (395) T protein:vir:98 138 EKTFTFDQV---------------------------------------------IYLKNDNSDLMSKVESLWEEYGELLG 172 (395) T ss_pred eeEecCccE---------------------------------------------EEecCCCCCccccccchhhhHHHHHH Confidence 011222233 3333211 11122222222211 Q ss_pred -HHHHHH-HHHHHHHHHhccceeEEecCCCCc---hh----hhhHh-h-----hcCeeeeccCCCceeEEeec------C Q lcl|NC_013644. 259 -DYDLMN-CFLSNNLQDFAEAIYVVSGFQGDD---LS----KLRQN-V-----KSKKVVGTGSDGGLDVKTVT------I 317 (510) Q Consensus 259 -~~n~~~-S~~~~~~~~~~~~~lv~~g~~~~~---~~----~~~~~-~-----~~~~~~~~~~~~~~~~~~~~------~ 317 (510) .++... ......+..+..+...+.+..... .. +..+. . ...+++.+++|.+.+.++.. . T Consensus 173 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~ 252 (395) T protein:vir:98 173 HVINNQKIANQIRFTMIPPKDKVRERAQENSDGGRQSKSDKDFFKRTVEKIRTESVVGIPVTANTNYEEYGSKNTGAVKS 252 (395) T ss_pred HHHHHHHHHHHHHHhhccccccccccccccCCcHHHHHHHHHHHHHHHhhhhcCCcceeecCCCceeEecccccccccCh Confidence 111111 111112222333333333222111 11 11111 1 11223444444444444321 1 Q ss_pred CHHHHHHHHHHHHHHHHHHhCCccccccc-cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCcc Q lcl|NC_013644. 318 PTEGRKTKMEIDKENIYKFGMAFDSTQVG-DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAF 396 (510) Q Consensus 318 ~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~ 396 (510) ....+....+...+.|...=++|+.-.++ .++.+...+ ..+...|.-.++.|...+..+--... T Consensus 253 ~~~q~~e~~~~~~~~Ia~~fgVP~~~l~~~~sn~e~~~~---------------~f~~~tl~P~~~~ie~~l~~kll~~~ 317 (395) T protein:vir:98 253 YVDDIKKLKDQYMAEFAEMLGIPISLLHGDIADNQKNYE---------------LLLEGPIESLITNIVDGLEYAIFDKS 317 (395) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHhcCCcccHHHHHH---------------HHHHHHHHHHHHHHHHHHHHhcCChh Confidence 22344555556667777777788754421 111111111 22233333333333333322111110 Q ss_pred c-cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCC Q lcl|NC_013644. 397 D-PTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALEEAEYTKG 473 (510) Q Consensus 397 ~-~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~ 473 (510) . ...+.+.|+.-+..|..+.++.+.++..+|+++.-.+++.++. ++++...+ .-...+..+. T Consensus 318 ~~~~g~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~Pi~~~~gD~------------~~~~~n~~~~--- 382 (395) T protein:vir:98 318 ETLQGSFIKVTGLKNYDLFSISNQADKLISSGFVFIDEVREEIGLPELPDGLGKV------------LYMTKNYESV--- 382 (395) T ss_pred hhcCcceeeehhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCce------------eeecccceec--- Confidence 0 1234567888889999999999999999999999888887643 22211100 0011111111 Q ss_pred CCCCCCCcccCCCCCCcc Q lcl|NC_013644. 474 LSDNTDEEETAVNPDDPT 491 (510) Q Consensus 474 ~~~~~~~~~~~~~~~~~~ 491 (510) +....+++++.++ T Consensus 383 -----~~~gge~~~~~~~ 395 (395) T protein:vir:98 383 -----LERGGEVDEEVET 395 (395) T ss_pred -----ccccCCCCCCCCC Confidence 0000001111111 No 258 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=65.80 E-value=0.28 Score=23.64 Aligned_cols=335 Identities=8% Similarity=0.035 Sum_probs=127.6 Q ss_pred ccC-Ccchhcccceeccccccccccccccceec--cchhHHHHHHHHhhhhcCCcee-c--cCc-------HHHHHHHHH Q lcl|NC_013644. 40 NHE-NDIMNNRIFYVDDEGILREDKYASNVRIP--HGFFPEIVDQKTQYLLSNPVEY-E--TEN-------EELKEYLAE 106 (510) Q Consensus 40 ~g~-~~i~~~~~~~~~~~~~~~~~~~~~~~ki~--~n~~~~Iv~~~~~~l~g~p~~~-~--~~d-------~~~~~~l~~ 106 (510) .|- +.+....+........ +.....+..+. .......|+..++-+.+-|+.+ . ..+ .....-+.. T Consensus 1 Mg~f~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~~ 78 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNNDTQ--RVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDLDE 78 (378) T ss_pred CCccccchhcccccccCCcc--eeeeeccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccCcccccccccccchHHH Confidence 110 1110000000000000 00000111111 1233445555555555667653 1 111 011123333 Q ss_pred Hhc---cC---HHHHHHHHHHHHHhcCeEEEEE-EECCCCceEEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCcee Q lcl|NC_013644. 107 YYN---SE---FQVVLQELVEGSSQKGFEYVYA-RTNAEDRLCFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETV 179 (510) Q Consensus 107 ~~~---n~---~~~~~~e~~~~~~~~G~~~~~v-~~d~~g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 179 (510) +++ |. .......+....+.+|.||+++ +.+..|++... +| + ++. T Consensus 79 lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~g~~~~l-------~p--~------------------~~~-- 129 (378) T protein:vir:94 79 VLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNTGELLDL-------LF--A------------------DDK-- 129 (378) T ss_pred HHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCCceEEEE-------Ee--c------------------CCe-- Confidence 432 22 2344556678889999999864 33333433211 01 0 000 Q ss_pred EEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCCcHHHHHHHHHH Q lcl|NC_013644. 180 DIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTDLKPIKALIDD 259 (510) Q Consensus 180 ~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~v~~liD~ 259 (510) + .|.++.+ +|+++.-.+......+..+..+ T Consensus 130 -~----~~~~~di---------------------------------------------iH~~~~~~~~~g~s~l~~~~~~ 159 (378) T protein:vir:94 130 -K----EYKPEEL---------------------------------------------VRLTSPFYINEDTSILDNALAS 159 (378) T ss_pred -e----Eeeeeee---------------------------------------------EEecCcCCccchhHHHHHHHHH Confidence 0 0111122 2222111111112223334444 Q ss_pred HHHHHHHHHHHHHHhccceeEEecCCCCc-hh----hhhHhhh-------cCeeeeccCCCceeEEeecCCHHHHHHHHH Q lcl|NC_013644. 260 YDLMNCFLSNNLQDFAEAIYVVSGFQGDD-LS----KLRQNVK-------SKKVVGTGSDGGLDVKTVTIPTEGRKTKME 327 (510) Q Consensus 260 ~n~~~S~~~~~~~~~~~~~lv~~g~~~~~-~~----~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (510) ++..++.- ...-++...+.-..+ .. .+..... .++++.++++.+++.++.......+ ...+ T Consensus 160 i~~~~~~~------~~~gil~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~ 232 (378) T protein:vir:94 160 IQTKLEQG------KLRGLLKINAFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEID 232 (378) T ss_pred HHHHHhcc------cccceeeeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceecCCCceEEEccCChhhhhH-HHHH Confidence 44333220 011222222221111 11 1111111 2235666666555555544333333 4556 Q ss_pred HHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC---------Ccccc Q lcl|NC_013644. 328 IDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYT---------KAFDP 398 (510) Q Consensus 328 ~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~---------~~~~~ 398 (510) .+.+.|+..-++|+.-.. ++.|... ....+...|.-.++.|...+..+-- ..... T Consensus 233 ~~~~~Ia~~fgVP~~~l~--~~~se~~--------------~~~f~~~tL~P~~~~ie~~l~~~Ll~~~er~~g~~~~~~ 296 (378) T protein:vir:94 233 LIKSELLTGYFMNENILL--GTASQEQ--------------QIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYY 296 (378) T ss_pred HHHHHHHHHhCCCHHHhc--CChHHHH--------------HHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccc Confidence 777888888888874332 1111111 1124444555555555544432111 11111 Q ss_pred ceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCC Q lcl|NC_013644. 399 TEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSD 476 (510) Q Consensus 399 ~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~ 476 (510) ..+.+.+..-+-.|..+.++.+.++.++|+++.-.++++++. +++-+.. ....+..+.. ..+. T Consensus 297 ~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD~~--------------~~~~n~~~~~-~~~~ 361 (378) T protein:vir:94 297 ERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDVY--------------IANLNAVAVK-NLSD 361 (378) T ss_pred cceeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCee--------------eecccccccc-cchh Confidence 234555566778899999999999999999998888777643 2211100 0001111000 0000 Q ss_pred CCCCcccCCCCCCccccc Q lcl|NC_013644. 477 NTDEEETAVNPDDPTQQM 494 (510) Q Consensus 477 ~~~~~~~~~~~~~~~~~~ 494 (510) ....+. +..+++++.++ T Consensus 362 ~~~~~~-~~~~~~e~~n~ 378 (378) T protein:vir:94 362 LQGSRK-DVTSTDETNNQ 378 (378) T ss_pred hcCCcC-CCCCCCCCCCC Confidence 000000 00111111111 No 259 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=64.28 E-value=0.3 Score=23.43 Aligned_cols=393 Identities=11% Similarity=0.038 Sum_probs=154.6 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) ++.+... .+...+.-..+|+ +|+.+..+++-... ...||+ T Consensus 58 ~~~~~g~-~e~~~~~~~eLI~---------~YR~ma~~pEvd~A------------------------------v~eIVn 97 (524) T protein:vir:72 58 FQTIFGS-YEPGMKTTRELID---------TYRNLMNNYEVDNA------------------------------VSEIVS 97 (524) T ss_pred eeehhcc-cccccchHHHHHH---------HHHHHhhccchhhH------------------------------HHHhhc Confidence 2222221 0000111122222 22223233332222 233333 Q ss_pred HHHhh-hhcCCceeccCcH----HHHH----HHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC----CceEEEEE Q lcl|NC_013644. 81 QKTQY-LLSNPVEYETENE----ELKE----YLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAE----DRLCFQVA 146 (510) Q Consensus 81 ~~~~~-l~g~p~~~~~~d~----~~~~----~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~----g~~~i~~~ 146 (510) ...-+ -...||.+..++- ...+ ..+.+++ =+|+.+.++..+.+.+.|+.|++..+|.+ |-..+..+ T Consensus 98 eaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~l 177 (524) T protein:vir:72 98 DAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFSDVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRRL 177 (524) T ss_pred ceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccccceeeeee Confidence 32222 2345555544332 2222 2233332 26788889999999999999999888743 66788999 Q ss_pred cccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccc Q lcl|NC_013644. 147 DSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENES 226 (510) Q Consensus 147 ~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (510) ||+.+-.|.---.+......+ + . .+.-+-+|.++. ..|... |..... T Consensus 178 DPr~i~~vr~i~~~~~~~~~v--i---~-----~~~e~f~Y~~~~-~~y~~~-g~~~~~--------------------- 224 (524) T protein:vir:72 178 DPRQVQYVREIITETEAGTKI--V---K-----GYKEYFIYDTAH-ESYACD-GRMYEA--------------------- 224 (524) T ss_pred CCccceeeeeeccCCCccchh--h---c-----chhhheeeccCc-cccccC-ccccCC--------------------- Confidence 999885544210010000000 0 0 001111333321 111100 000000 Q ss_pred cccccCCccc---EEEecCC---CCCCCcHHHHHHHHHHHHH--HHHHHHHHHHHhccceeEE----ec-CCCCchhhhh Q lcl|NC_013644. 227 LLQRSYGQIP---FYRLSNN---KQETTDLKPIKALIDDYDL--MNCFLSNNLQDFAEAIYVV----SG-FQGDDLSKLR 293 (510) Q Consensus 227 ~~~~~~g~iP---vv~~~nn---~~g~sd~~~v~~liD~~n~--~~S~~~~~~~~~~~~~lv~----~g-~~~~~~~~~~ 293 (510) +.-=+|| |++.... ..+--.+.-+...|..+|. ++-|.+-.-+..+.|=.=+ .| +...-..... T Consensus 225 ---~~~ikI~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl 301 (524) T protein:vir:72 225 ---GTKIKIPKAAVVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHM 301 (524) T ss_pred ---CcceecchhheeeeeccceeCCCCceeccchhhhHhHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHH Confidence 0000222 2211111 0011122233344455553 3444444444444432211 11 1111111111 Q ss_pred H----hhhcCeeee--------------------c---cCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCccccc Q lcl|NC_013644. 294 Q----NVKSKKVVG--------------------T---GSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDSTQ 344 (510) Q Consensus 294 ~----~~~~~~~~~--------------------~---~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~ 344 (510) + ..+...+.. + +++.+.+.-|.+ .+.. ...-+.-+++.+|+.-.+|-.-. T Consensus 302 ~~im~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlg-em~DV~YF~kkLy~aLnVP~sRl 380 (524) T protein:vir:72 302 QHVMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTG-NMEDIRWFRQALYMALRVPLSRI 380 (524) T ss_pred HHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcC-hHHHHHHHHHHHHHHhCCchhhc Confidence 1 111111111 1 122223333333 2222 23445556666777666774211 Q ss_pred --cc-----cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCcccc--ceeeEEeCCCCCCCHHH Q lcl|NC_013644. 345 --VG-----DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDP--TEVSFTFTREVMVNETD 415 (510) Q Consensus 345 --~~-----~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~--~~v~i~f~~~~p~d~~e 415 (510) +. +|..|. |.......-.-+.+-+..|...+.++++.=+-+-++....+|+. ..+.+.|...-.-.+.. T Consensus 381 ~~d~~~~f~~gr~~E--ItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElK 458 (524) T protein:vir:72 381 PQDQQGGVMFDSGTS--ITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFAELK 458 (524) T ss_pred CCCCCccccccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHH Confidence 11 232322 22223333344566666777777776665443334444445543 35777886655544444 Q ss_pred HHHH-------HHHHHh--cCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCC Q lcl|NC_013644. 416 IVND-------EKTEAE--TRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDE 480 (510) Q Consensus 416 ~~~~-------~~~~~~--~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (510) .++. +..+.. +..+|.+++.+.+=-.+|+|..++....+++ .+..-+++......+. T Consensus 459 e~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E--------~k~~~~~~~~~~~~~f 524 (524) T protein:vir:72 459 EAEILERRINMLTMAEPFIGKYISHRTAMKDILQMTDEEIEQEAKQIEEE--------SKEARFQDPDQEQEDF 524 (524) T ss_pred HHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHH--------hhcCCCCCCchhhhcC Confidence 3333 222221 3346999999876556665533222111111 1111111100000111 No 260 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=63.77 E-value=0.31 Score=23.36 Aligned_cols=393 Identities=11% Similarity=0.036 Sum_probs=154.7 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) ++.+... .+...+.-..+|+ +|+.+..+++-... ...||+ T Consensus 58 ~~~~~g~-~e~~~~~~~eLI~---------~YR~ma~~pEvd~A------------------------------v~eIVn 97 (524) T protein:vir:10 58 FQTIFGS-YEPGMKTTRELID---------TYRNLMNNYEVDNA------------------------------VSEIVS 97 (524) T ss_pred eeehhcc-cccccchHHHHHH---------HHHHHhhccchhhH------------------------------HHHhhc Confidence 2222221 0000111122222 22223233332222 233333 Q ss_pred HHHhh-hhcCCceeccCcH----HHHH----HHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC----CceEEEEE Q lcl|NC_013644. 81 QKTQY-LLSNPVEYETENE----ELKE----YLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNAE----DRLCFQVA 146 (510) Q Consensus 81 ~~~~~-l~g~p~~~~~~d~----~~~~----~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~----g~~~i~~~ 146 (510) ...-+ -...||.+..++- ...+ ..+.+++ =+|+.+.++..+.+.+.|+.|++..+|.+ |-..+..+ T Consensus 98 eaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~l 177 (524) T protein:vir:10 98 DAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFNDVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRRL 177 (524) T ss_pred ceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEeeCCCccccceeeeee Confidence 32222 2345555544332 2222 2233332 26788889999999999999999888743 66788999 Q ss_pred cccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccc Q lcl|NC_013644. 147 DSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENES 226 (510) Q Consensus 147 ~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (510) ||+.+-.|.---.+......+ .. .+.-+-+|.++. ..|... |..... T Consensus 178 DPr~i~~vr~i~~~~~~~~~v-----i~-----~~~e~f~Y~~~~-~~y~~~-g~~~~~--------------------- 224 (524) T protein:vir:10 178 DPRQVQYVREIITETEAGTKI-----VK-----GYKEYFIYDTAH-ESYACD-GRMYEA--------------------- 224 (524) T ss_pred CCccceeeeeeccCCCccchh-----hc-----chhhheeeccCc-cccccC-ccccCC--------------------- Confidence 999885544210010000000 00 001111333321 111100 000000 Q ss_pred cccccCCccc---EEEecCC---CCCCCcHHHHHHHHHHHHH--HHHHHHHHHHHhccceeEE----ec-CCCCchhhhh Q lcl|NC_013644. 227 LLQRSYGQIP---FYRLSNN---KQETTDLKPIKALIDDYDL--MNCFLSNNLQDFAEAIYVV----SG-FQGDDLSKLR 293 (510) Q Consensus 227 ~~~~~~g~iP---vv~~~nn---~~g~sd~~~v~~liD~~n~--~~S~~~~~~~~~~~~~lv~----~g-~~~~~~~~~~ 293 (510) +.-=+|| |++.... ..+--.+.-+...|..+|. ++-|.+-.-+..+.|=.=+ .| +...-..... T Consensus 225 ---~~~ikI~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl 301 (524) T protein:vir:10 225 ---GTKIKIPKAAIVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHM 301 (524) T ss_pred ---CcceecchhheeeeeccceeCCCCceeccchhhhHHHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHH Confidence 0000222 2211111 0011122233344455553 3444444444444432211 11 1111111111 Q ss_pred H----hhhcCeeee--------------------c---cCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCccccc Q lcl|NC_013644. 294 Q----NVKSKKVVG--------------------T---GSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDSTQ 344 (510) Q Consensus 294 ~----~~~~~~~~~--------------------~---~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~ 344 (510) + ..+...+.. + +++.+.+.-|.+ .+.. ...-+.-+++.+|+.-.+|-.-. T Consensus 302 ~~im~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlg-em~DV~YF~kkLy~aLnVP~sRl 380 (524) T protein:vir:10 302 QHVMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTG-NMEDVRWFRQALYMALRVPLSRI 380 (524) T ss_pred HHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcC-hHHHHHHHHHHHHHHhCCchhhc Confidence 1 111111111 1 122223333333 2222 23445556666777666774211 Q ss_pred --cc-----cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCcccc--ceeeEEeCCCCCCCHHH Q lcl|NC_013644. 345 --VG-----DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDP--TEVSFTFTREVMVNETD 415 (510) Q Consensus 345 --~~-----~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~--~~v~i~f~~~~p~d~~e 415 (510) +. +|..|. |.......-.-+.+-+..|...+.++++.=+-+-++....+|+. ..+.+.|...-.-.+.. T Consensus 381 ~~d~~~~f~~gr~~E--ItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElK 458 (524) T protein:vir:10 381 PQDQQGGVMFDSGTS--ITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFTELK 458 (524) T ss_pred CCCCCccccccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHH Confidence 11 232322 22223333344566666777777776665443334444445543 35777887655544444 Q ss_pred HHHH-------HHHHHh--cCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCC Q lcl|NC_013644. 416 IVND-------EKTEAE--TRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDE 480 (510) Q Consensus 416 ~~~~-------~~~~~~--~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (510) .++. +..+.. +..+|.+++.+.+=-.+|+|..++....+++ .+..-+++......+. T Consensus 459 e~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E--------~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 459 EAEILERRINMLTMAEPFIGKYISHRTAMKDILQMTDEEIEQEAKQIEEE--------SKEARFQDPDQEQEDF 524 (524) T ss_pred HHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHH--------hhcCCCCCCchhhhcC Confidence 3333 222221 3346999999876556665533222111111 1111111110000111 No 261 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=60.56 E-value=0.37 Score=22.95 Aligned_cols=460 Identities=13% Similarity=0.117 Sum_probs=177.8 Q ss_pred CCCccCCChhhhHHHHHH--------HHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceecc Q lcl|NC_013644. 1 MEALLSEDVKIIANALKA--------AIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPH 72 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~--------~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~ 72 (510) |..-...++..+++-+.+ .-.++. .-+++.....+-|.+... . ..... .| . T Consensus 1 m~~~~~~~~~~tpe~la~~W~~~I~~a~~~~~--~~h~r~~~~~k~y~~~~~-------------~-~~~~~---~r--~ 59 (663) T protein:vir:34 1 MNESQPTDFADTPQGWAQRWQEEMSAAREPLE--KWHTQGKEIVKRYRDERD-------------S-AHDAE---TR--W 59 (663) T ss_pred CCccccccchhcchhHHHHHHHHHHHHHhccc--hHHHHHHHHHHHhhcccc-------------C-CCccc---cc--c Confidence 777655566666554322 111111 112233344444444321 0 00000 11 1 Q ss_pred chhHHHHHHHHhhhhcCCceec------cCc----HHHHHHHHHHhc-------cCHHHHHHHHHHHHHhcCeEEEEEEE Q lcl|NC_013644. 73 GFFPEIVDQKTQYLLSNPVEYE------TEN----EELKEYLAEYYN-------SEFQVVLQELVEGSSQKGFEYVYART 135 (510) Q Consensus 73 n~~~~Iv~~~~~~l~g~p~~~~------~~d----~~~~~~l~~~~~-------n~~~~~~~e~~~~~~~~G~~~~~v~~ 135 (510) |++--=|....--+.+.+|..+ ..+ ..+.+.+.+.++ ++++..+...+++++.+|+|-+.+.+ T Consensus 60 nl~~sni~~i~P~iYar~P~p~V~~rf~d~d~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~v~Y 139 (663) T protein:vir:34 60 NLFSTNIQTQMASLYGQTPKVSVSRRFADADDDVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCRIRY 139 (663) T ss_pred chhhhhHHHHhhhhhcCCCcceeeecccCcccchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEEEEe Confidence 2222112222233445554432 222 234445555431 34666777778899999988776654 Q ss_pred --------------CCC-C----------------ceEEEEEcccceE----EEEcCCCCceeEEE-EEEE--------- Q lcl|NC_013644. 136 --------------NAE-D----------------RLCFQVADSLNVF----GVYNEYNELQRICR-HYIT--------- 170 (510) Q Consensus 136 --------------d~~-g----------------~~~i~~~~p~~~~----~~~d~~~~~~~~~~-~~~~--------- 170 (510) |+. + .++|..+.=+.+. =.|++ ...+++ +|.. T Consensus 140 e~~~~~~~~~~~~~D~~~~~~~a~~~~~~e~~a~E~v~id~v~~~dfl~~pAr~W~e---v~wva~r~~mtk~e~~~rf~ 216 (663) T protein:vir:34 140 EVEWEEVAGVDAILDEATGAELAAAVPPTQRKAYECVETDYLHWQDVLWSPARVWHE---VRWLAFRNLLDMREFNARFD 216 (663) T ss_pred ecccchhccccccCCCccccchhcccccchhhcccceeeeeechhhcccchhhcccc---ccceeeeccCCHHHHHHhhc Confidence 111 0 1233332222110 11221 111111 0000 Q ss_pred --------------EEeeC---Cce----eEEEEEEEEcCCcEEEEEEcCCceeeccccccccccccccccccccccccc Q lcl|NC_013644. 171 --------------EIEKD---GET----VDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQ 229 (510) Q Consensus 171 --------------~~~~~---~~~----~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (510) ....+ +.. ....-.|+|+...-..|-...|-...++.. ++.... T Consensus 217 ~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~VwEIWdK~~~~V~w~~eg~~~~L~~~---------------~p~lgl 281 (663) T protein:vir:34 217 ADGSRNLWASVPKVGKPKDGKDGQSCHPWDRAEVWEIWDKGGRKVDWYVEGYSAVLDTQ---------------PDPLGL 281 (663) T ss_pred CChhhhhhhhccCcCCccccCCCCCcchhcCcceeEEEecCCcEEEEEEcCcceecccC---------------CCCCCC Confidence 00000 000 122334777765333322211111211111 011111 Q ss_pred ccCCcccEEEecC----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCchhhhhHhhhcCeeeec- Q lcl|NC_013644. 230 RSYGQIPFYRLSN----NKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDDLSKLRQNVKSKKVVGT- 304 (510) Q Consensus 230 ~~~g~iPvv~~~n----n~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~- 304 (510) .+|-=||..-+++ +-...++|--.+.+++++|.+-.. .|.+.+.-.+-.+...-.+++.+........+.++.+ T Consensus 282 ~~ffPcPrpl~~~~~~ds~ipvpd~~~y~~~~~E~n~~t~R-in~l~d~ikv~gvy~~~~g~~i~~~l~~a~~n~lvpV~ 360 (663) T protein:vir:34 282 ESFFPCPKPLLANWTTDKVVPRPDFVLAQDLYKEIDLVSTR-ITLLERAIRVVGVYDKSSGLTIGRLLSEAAQNDLIPVE 360 (663) T ss_pred CCCCCCcccccceecCCCeecCCcHHHHHHHHHHHHHHHHH-HHHHHhhhhhceeeccccchhHHHHHHHhhCCCceecc Confidence 1222234333322 345779999889999999986433 4455444444444322223233332222222333322 Q ss_pred -----cCCCc----eeEEeecC---CHHHHHHHHHHHHHHHHHHhCCcccccccc-CcccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 305 -----GSDGG----LDVKTVTI---PTEGRKTKMEIDKENIYKFGMAFDSTQVGD-GNITNIVIKARYTLLNMKANKTEA 371 (510) Q Consensus 305 -----~~~~~----~~~~~~~~---~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Ai~~~~~~l~~k~~~k~~ 371 (510) .+.|+ +.++-.+. ...++-..-..++.++|++|+.-++.-+.+ .+-+..|-..+-+.+-.++.+++. T Consensus 361 ~~~~~~~~gg~~k~I~~~pi~~~~~aI~~l~~~r~qir~d~~qITGiaDi~Rga~~a~ETatAQ~IKsq~gS~RIqe~qd 440 (663) T protein:vir:34 361 NWLTFADKGGLRGVVDWFPLEPVVAALTSLRDYRRELVDALHQVTGMADIMRGASDPRETAMAQGVKAKFGSIRLQRLQD 440 (663) T ss_pred hhhhhhhhcCccchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHhHHHHhhcccCcchhhHHHHHHHHHHhHHHHHHHH Confidence 22233 33332221 224445666788889999998877654433 234666666777788889999999 Q ss_pred HHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHh-CCCCCcH--HHHH Q lcl|NC_013644. 372 RLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQV-APRLDDD--NVLR 448 (510) Q Consensus 372 ~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~-~~~v~d~--e~~~ 448 (510) ......+.+.++...++.-.. ....+.=.-.-.+|. ..++......+....+-.-..-++- .....|. +.+. T Consensus 441 evqR~arDi~ql~AEIl~~~~----~~etl~~m~~~elp~-~~ei~~~~~~L~n~~~r~~~ldIe~dsT~~~D~~~eK~~ 515 (663) T protein:vir:34 441 EVARFASDIQRLKAEVIAEHY----DVASILAQANAEFTF-DKELAPKAAELIKSRFSMYRVEVKPEAVSLQDFAALRNE 515 (663) T ss_pred HHHHHHHHHHHHHHHHHHHhc----CHHHHHHHhcCCCCc-ccchhHHHHHHhcCCCcceeeeeccCCCCcCChHHHHHH Confidence 999999999999999876321 111110001122222 1112222222222222000111110 1122222 1111 Q ss_pred HHHHHHHHHHHHHHHHHHhhhccCCCCCCC----CCcccCCCCCCcccccccCcccccccccCCCC Q lcl|NC_013644. 449 LICEQFDLDWEDVKEALEEAEYTKGLSDNT----DEEETAVNPDDPTQQMAEGATGSTESQLPENG 510 (510) Q Consensus 449 ~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (510) +.+...-.... ............+....- -..-.+-..+.. ..++...-+...+.-+ T Consensus 516 ~~E~l~~i~~~-~qq~~pl~~q~p~~~p~l~Ellk~~~~~f~~~~q----ie~ai~~~~~~~e~aa 576 (663) T protein:vir:34 516 KMEVLSGIASF-MQGVAPLAQQVPGSAPFLLQMLKWSVSGLRGSST----IEGVLDKAIAAAEEAQ 576 (663) T ss_pred HHHHHHHHHHH-HHHHHHHHHhhhhhHHHHHHHHHHHhhcCChhhh----HHHHHHHHHhhhHHHh Confidence 11111110000 000000000000000000 000000000000 1111122222222212 No 262 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=57.73 E-value=0.43 Score=22.60 Aligned_cols=337 Identities=9% Similarity=0.053 Sum_probs=125.8 Q ss_pred HHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCCcee-c- Q lcl|NC_013644. 17 KAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNPVEY-E- 94 (510) Q Consensus 17 ~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~-~- 94 (510) -.+ ..+...+.......-...-..+ .... .-+........|+..++-+..-|+.+ . T Consensus 1 M~i------------f~~~~~~~~~~~~~~~~~~~~~---------~~~~-~~~~~~~v~~~v~~Ia~~iA~lp~~~~~~ 58 (378) T protein:vir:94 1 MNL------------FGKVVSFSRGKLNNDTQRVTAW---------QNEA-VEYTSAFVTNIHNKIANEITKVEFNHVKY 58 (378) T ss_pred Cch------------hHHhHhhhhcccccCcceeeee---------ecch-hhhhhHHHHHHHHHHHHhHhhCceeeeee Confidence 111 1111111111110000000000 0000 00111234555666666666667642 1 Q ss_pred -cCc-------HHHHHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEE-EEECCCCceEEEEEcccceEEEEcCCC Q lcl|NC_013644. 95 -TEN-------EELKEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVY-ARTNAEDRLCFQVADSLNVFGVYNEYN 159 (510) Q Consensus 95 -~~d-------~~~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~-v~~d~~g~~~i~~~~p~~~~~~~d~~~ 159 (510) ..+ +....-|..+++ |. .......+....+..|.||++ ++.+..|.+... T Consensus 59 ~~~~~~~~~~~~~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~--------------- 123 (378) T protein:vir:94 59 KKSDVGSDTLISMAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSETGELLDL--------------- 123 (378) T ss_pred cccccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCCCcEEEE--------------- Confidence 111 111222334442 22 234445567888889999875 333333332110 Q ss_pred CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEE Q lcl|NC_013644. 160 ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYR 239 (510) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~ 239 (510) ++ ..++ ..|....+ +| T Consensus 124 -------~~----~~~~--------~~~~~~dv---------------------------------------------ih 139 (378) T protein:vir:94 124 -------LF----ANDK--------KEYKPEEL---------------------------------------------VR 139 (378) T ss_pred -------EE----ecCc--------EEechhce---------------------------------------------ee Confidence 00 0000 00111112 22 Q ss_pred ecCCCCCCCc-HHHHHHHHHHHHHHHHHHHHHHHHhccceeEEecCCCCc-h----hhhhHhhh-------cCeeeeccC Q lcl|NC_013644. 240 LSNNKQETTD-LKPIKALIDDYDLMNCFLSNNLQDFAEAIYVVSGFQGDD-L----SKLRQNVK-------SKKVVGTGS 306 (510) Q Consensus 240 ~~nn~~g~sd-~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~~~~~-~----~~~~~~~~-------~~~~~~~~~ 306 (510) +++. .+.+. ...+..+.++++..+.. . ....++...+.-..+ . +.+...++ .++++.+++ T Consensus 140 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~---~---~~~g~l~~~~~l~~~~~~~~~e~~~~~~~~~~~~~n~~~~~vl~~ 212 (378) T protein:vir:94 140 LTSP-FYINEDTSILDNALASIQTKLEQ---G---KLRGLLKINAFLDIDNTQEYREKALATIKNMQEGSSYNGLTPVDN 212 (378) T ss_pred ecCc-CCcccchhHHHHHHHHHHHHHhh---C---CcccceeeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceeccC Confidence 2211 00011 11122222333322211 0 111222222211111 1 11222111 124566666 Q ss_pred CCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 307 DGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVID 386 (510) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~ 386 (510) +.+++.++.+..... ...++.+.+.|+..-++|+.-..+. .+... ....+...|.-+++.|.. T Consensus 213 g~~~~~l~~~~~~~~-~~~~~~~~~~Ia~~fgvPp~~l~g~--~~e~~--------------~~~f~~~tl~P~~~~ie~ 275 (378) T protein:vir:94 213 KTEIVELKKDYSVLN-KDEIDLIKSELLTGYFMNENILLGT--ATQEQ--------------QIYFYNSTIIPLLIQLEK 275 (378) T ss_pred CceEEEccCChHHhh-HHHHHHHHHHHHHHhCCCHHHhcCC--chHHH--------------HHHHHHHHHHHHHHHHHH Confidence 666665554333323 3556777888888878876433221 11111 112333444444444444 Q ss_pred HHhhcc---------CCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHHHH Q lcl|NC_013644. 387 DINRRY---------TKAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQFD 455 (510) Q Consensus 387 ~~~~~~---------~~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~~e 455 (510) .+..+- -.......+.+.++.-+-.|..+.++.+.++..+|+++.-.++++++. ++.-+.. T Consensus 276 ~l~~~Ll~~~e~~~g~~~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~~t~NE~R~~~g~~p~~ggd~~-------- 347 (378) T protein:vir:94 276 ELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDVY-------- 347 (378) T ss_pred HHHhhcCChhHhhhhhhhcccceeEeecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCee-------- Confidence 433211 111112345555677778899999999999999999999888877643 2110000 Q ss_pred HHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCccccc Q lcl|NC_013644. 456 LDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQM 494 (510) Q Consensus 456 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (510) ....+..+-. ..+......+ +..+++++..+ T Consensus 348 ------~~~~n~~~~~-~~~~~~~~~~-~~~~~~e~~n~ 378 (378) T protein:vir:94 348 ------IANLNAVAVK-NLSDLQGNRK-DVTSTDETNNQ 378 (378) T ss_pred ------eecccccchh-cchhcccccC-CCCCCCCCCCC Confidence 0000000000 0000000000 00111111111 No 263 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=54.50 E-value=0.5 Score=22.22 Aligned_cols=336 Identities=8% Similarity=0.005 Sum_probs=128.6 Q ss_pred HHHHHHHhccCCcchhcccceeccccccccccccccceec--cchhHHHHHHHHhhhhcCCcee-cc-Cc----H----H Q lcl|NC_013644. 32 AETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIP--HGFFPEIVDQKTQYLLSNPVEY-ET-EN----E----E 99 (510) Q Consensus 32 ~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~--~n~~~~Iv~~~~~~l~g~p~~~-~~-~d----~----~ 99 (510) |-.+.+...-...... ....... .+ .+..+. .......|+..++-+.+-|+.+ .- .+ + . T Consensus 1 Mg~f~~~~~f~~~~~~------~~~~~~~--~~-~~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~ 71 (378) T protein:vir:93 1 MNLFGKVVSFSRGKLN------NDTQRVT--AW-QNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISM 71 (378) T ss_pred CccchhhhhhhccccC------CCcceee--ec-ccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEccccccccccccc Confidence 1111111000000000 0000000 00 000111 1223344555555566667653 11 11 0 1 Q ss_pred HHHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEEEEC-CCCceEEEEEcccceEEEEcCCCCceeEEEEEEEEE Q lcl|NC_013644. 100 LKEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYARTN-AEDRLCFQVADSLNVFGVYNEYNELQRICRHYITEI 172 (510) Q Consensus 100 ~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d-~~g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~ 172 (510) ....+..+++ |. .......++...+.+|.||+++..+ ..|++... +| ++ T Consensus 72 ~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~g~~~~l-------~~--~~--------------- 127 (378) T protein:vir:93 72 AGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDL-------LF--AD--------------- 127 (378) T ss_pred ccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEE-------Ee--cC--------------- Confidence 1123444442 22 2344555778899999999865443 22322111 00 00 Q ss_pred eeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecCCCCCCCcHHH Q lcl|NC_013644. 173 EKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSNNKQETTDLKP 252 (510) Q Consensus 173 ~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~~~ 252 (510) .. ..|....+. |+++.-.+...... T Consensus 128 ---~~-------~~~~~~dii---------------------------------------------h~r~~~~~~~~~s~ 152 (378) T protein:vir:93 128 ---DK-------KEYKTEELV---------------------------------------------RLTSPFYINEDTSI 152 (378) T ss_pred ---Ce-------eEeccceeE---------------------------------------------EecCccccchhhHH Confidence 00 001112222 22211111111222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccceeEEecC-CCCch--hh----hhHhh-------hcCeeeeccCCCceeEEeecCC Q lcl|NC_013644. 253 IKALIDDYDLMNCFLSNNLQDFAEAIYVVSGF-QGDDL--SK----LRQNV-------KSKKVVGTGSDGGLDVKTVTIP 318 (510) Q Consensus 253 v~~liD~~n~~~S~~~~~~~~~~~~~lv~~g~-~~~~~--~~----~~~~~-------~~~~~~~~~~~~~~~~~~~~~~ 318 (510) +..+..+++..++. +.+-.+++-. ...+. .. +.... ..++++.++++.+++.++.+.. T Consensus 153 l~~~~~~i~~~~~~--------~~~~g~l~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~g~~~~~l~~~~~ 224 (378) T protein:vir:93 153 LDNALASIQTKLEQ--------GKLRGLLKINAFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYS 224 (378) T ss_pred HHHHHHHHHHHHhc--------CcccceeeeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChh Confidence 33344444332221 2222233211 11111 11 11111 1224566666666555554433 Q ss_pred HHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC----- Q lcl|NC_013644. 319 TEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYT----- 393 (510) Q Consensus 319 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~----- 393 (510) ...+ ...+.+.+.|+..-++|+.-.. |+.|.. .....+...|.-.++.|...+..+-- T Consensus 225 ~~~~-~~~~~~~~~Ia~~fgVPp~~l~--g~~~e~--------------~~~~f~~~tl~P~~~~ie~~l~~kLl~~~er 287 (378) T protein:vir:93 225 VLNK-DEIDLIKSELLTGYFMNENILL--GTATQE--------------QQIYFYNSTIIPLLIQLEKELTYKLISTNRR 287 (378) T ss_pred hhhH-HHHHHHHHHHHHHhCCCHHHhc--CCcHHH--------------HHHHHHHHHHHHHHHHHHHHHHhhcCChhHh Confidence 3333 4556778888888888874332 211111 11234455555555555554442211 Q ss_pred ----CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_013644. 394 ----KAFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAE 469 (510) Q Consensus 394 ----~~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~ 469 (510) .......+.+.++.-+-.|..+.++.+.++..+|+++.-.++++++.-.-+.-.. .....+..+ T Consensus 288 ~~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~------------~~~~~n~~~ 355 (378) T protein:vir:93 288 RVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDV------------YIANLNAVA 355 (378) T ss_pred hhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCe------------eeecccccc Confidence 1111223455556777889999999999999999999988888765321110000 000000000 Q ss_pred ccCCCCCCCCCcccCCCCCCccccc Q lcl|NC_013644. 470 YTKGLSDNTDEEETAVNPDDPTQQM 494 (510) Q Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (510) .. ..+.... .+.+..+++++..+ T Consensus 356 ~~-~~~~~~~-~~~~~~~~~e~~n~ 378 (378) T protein:vir:93 356 VK-NLSDLQG-SRKDVTSTDETNNQ 378 (378) T ss_pred cc-chhhhcC-ccCCCCCCCCCCCC Confidence 00 0000000 00001111111111 No 264 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=43.15 E-value=0.86 Score=20.95 Aligned_cols=311 Identities=8% Similarity=0.028 Sum_probs=108.2 Q ss_pred hhhhHHHHHHHHHHhccCCc------chhcccceecccccccccccccc----ceeccchhHHHHHHHHhhhhcCCceec Q lcl|NC_013644. 25 KSSSKREAETGIRYYNHEND------IMNNRIFYVDDEGILREDKYASN----VRIPHGFFPEIVDQKTQYLLSNPVEYE 94 (510) Q Consensus 25 ~~~~~~~~~~~~~YY~g~~~------i~~~~~~~~~~~~~~~~~~~~~~----~ki~~n~~~~Iv~~~~~~l~g~p~~~~ 94 (510) .++++++.. .+-..+... ....+...... ...-..+.|. .+-..+|...+ ..+-....|+.+. T Consensus 1 m~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~fg~p~~~~~~~~~~~~~~~~---~~~~~~~~pi~~~ 73 (368) T protein:vir:79 1 MSRNKTRRA--ARAASAHVRTANTDAPTEHHTDRAAQ--AEVFSFGDPVEVLDRRELLDYVECM---RMGQWYEPPMPWD 73 (368) T ss_pred CCccccccc--hhccCcccccccccCcchhhccccCc--eEEEEcCCceeecchhhHHHHHHHH---hccchhccCcCHH Confidence 122211110 000100000 00000000000 0000000010 00011122111 1121222344332 Q ss_pred cC------c---HH---H-HHHHHHHhc-cC-H-HHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcC Q lcl|NC_013644. 95 TE------N---EE---L-KEYLAEYYN-SE-F-QVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNE 157 (510) Q Consensus 95 ~~------d---~~---~-~~~l~~~~~-n~-~-~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~ 157 (510) += + .. . ...+.-... |. + ...+.+++.+.+.+|.||+.+..+..|++ .+.+++|..+-..-+. T Consensus 74 ~la~~~~~~~~h~~~~~~~~n~l~l~~~Pn~~~t~~~f~~l~~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v~~~~~~ 153 (368) T protein:vir:79 74 GLARSFRAAAHHSSAVYVKRNILVSTFIPHPLLSRATFERLVLDWQVFGNAYLERRENVLGGTIRLDTPLAKYVRRGLDL 153 (368) T ss_pred HHHHHHhhccccchhhhhhcchhhhhcCCCcCCCHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEeCcccceeeccC Confidence 10 0 00 0 001111122 22 1 13345677888999999999988888874 5667777665432221 Q ss_pred CCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccE Q lcl|NC_013644. 158 YNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPF 237 (510) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 237 (510) . ++|++ .. .+. . ..|.++. | T Consensus 154 ~-------~~~~~-~~-~~~--~----~~~~~~d---------------------------------------------I 173 (368) T protein:vir:79 154 N-------TYFFV-QN-WQQ--P----YTFAAGS---------------------------------------------V 173 (368) T ss_pred C-------EEEEE-ec-CCe--E----EEEcccc---------------------------------------------E Confidence 1 01111 00 000 0 0111222 3 Q ss_pred EEecC-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeE--EecCCCCc--hhhhhHhhhc-------Cee Q lcl|NC_013644. 238 YRLSN-----NKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYV--VSGFQGDD--LSKLRQNVKS-------KKV 301 (510) Q Consensus 238 v~~~n-----n~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv--~~g~~~~~--~~~~~~~~~~-------~~~ 301 (510) +|+++ .-.|.|.+......++.-+.+-.-..+.++-.+.|-.+ +.|...++ .+.++..++. +++ T Consensus 174 ihir~~~~~~~~yGlsp~~~a~~si~l~~aa~~~~~~~~~NGa~~~gil~~~~~~l~~e~~~~lk~~~~~~~G~~N~g~~ 253 (368) T protein:vir:79 174 FHLQEPDINQEVYGLPEYLSALNATWLNESATLFRRRYYKNGSHAGFILYMTDAAQKQEDVDTLREAMKSAKGPGNFRNL 253 (368) T ss_pred EEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHHHHHHHHHhcCCcccCce Confidence 34332 12466666554444443222211122333434445444 34432222 1223222221 234 Q ss_pred eeccC---CCceeEEeecC--CHHHHHHHHHHHHHHHHHHhCCcccccccc-------CcccHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 302 VGTGS---DGGLDVKTVTI--PTEGRKTKMEIDKENIYKFGMAFDSTQVGD-------GNITNIVIKARYTLLNMKANKT 369 (510) Q Consensus 302 ~~~~~---~~~~~~~~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-------g~~Sg~Ai~~~~~~l~~k~~~k 369 (510) +.+.. ++++++..... ....+.+..+...++|...-++|+.-.+.. +|+...... T Consensus 254 ~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~e~~~~~------------- 320 (368) T protein:vir:79 254 FMYAPNGKKDGIQLLPVSEVAAKDEFWNIKNVTRDDQLAAHRVPPQLMGIIPNNTGGFGDVEKAAMV------------- 320 (368) T ss_pred eEecCCCCccceeEEEcCCCHHHHHHHHHHHHhHHHHHHHhCCCHHHccccCCCCCccccHHHHHHH------------- Confidence 44432 34566655443 345566777888889999888887533211 122222221 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCC--CCHHHHHHHHHHHHhc Q lcl|NC_013644. 370 EARLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVM--VNETDIVNDEKTEAET 426 (510) Q Consensus 370 ~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p--~d~~e~~~~~~~~~~~ 426 (510) .+...|.-+++.+.++....+. + .+.|++..- .|.+..++ ....++ T Consensus 321 --f~~~~l~Pl~~~ie~ln~~l~~------e-~~rF~~~~l~~~D~~a~a~--~~~rsa 368 (368) T protein:vir:79 321 --FARNEVKPLQDRLLAINDWIGD------E-VVRFAPYALGGHDQPAAAP--GGQRSA 368 (368) T ss_pred --HHHHHHHHHHHHHHHHHhccCc------c-eeeechhHhhcccccccCC--cccccC Confidence 1222222222222222111110 1 245543221 11111111 111111 No 265 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=35.88 E-value=1.2 Score=20.14 Aligned_cols=298 Identities=9% Similarity=-0.003 Sum_probs=103.5 Q ss_pred hhhhHHHHHHHHHHhccCCcchhcccceecccc--cc--ccccccccceeccchhHHHHHHHHhhhhcC----Cceecc- Q lcl|NC_013644. 25 KSSSKREAETGIRYYNHENDIMNNRIFYVDDEG--IL--REDKYASNVRIPHGFFPEIVDQKTQYLLSN----PVEYET- 95 (510) Q Consensus 25 ~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~--~~--~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~----p~~~~~- 95 (510) .++++. ...+.- ........... .. .-....| ..+.+- ..+.+-.--+..|+ |+.+.. T Consensus 1 ~~~~~~---------~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~p--~~v~~~-~~~~~~~~~~~~~~~~~pp~~~~~l 67 (351) T protein:vir:78 1 MSKRRS---------RAPRTF-AAAPNPSAGSAAPARAEVFTFDDP--TPVMNR-AEILDYVECWSNGEWFEPPVSFAGL 67 (351) T ss_pred CCCCCC---------CCCCCC-CCCCchhhhhcccceeEEEEcCCc--eeecCc-chhhhhhhhhccCceecCCCCHHHH Confidence 111100 000000 00000000000 00 0000000 000000 00111111111122 222110 Q ss_pred -----CcHHHHH-------HH-HHHhccCH--HHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCC Q lcl|NC_013644. 96 -----ENEELKE-------YL-AEYYNSEF--QVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYN 159 (510) Q Consensus 96 -----~d~~~~~-------~l-~~~~~n~~--~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~ 159 (510) .+.-... .| ..+.-|.. ...+.+++.+.+.+|.||+.+..+..|++ .+..++|..+.+..+.. T Consensus 68 a~~~~~~~~h~~~l~~k~n~l~~~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~v~~~~~~~- 146 (351) T protein:vir:78 68 AKSFRASTHHSSALFFKANVLASTFRPHRWLSRHAFERWALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFS- 146 (351) T ss_pred HHHHhhhHhhhhhhhhhhhHHhhcccCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCCEEEEEEecCcceEEeeeCC- Confidence 0000000 11 11111221 23356678888999999999988888864 57777777665543321 Q ss_pred CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEE Q lcl|NC_013644. 160 ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYR 239 (510) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~ 239 (510) ++|+.. ..+. ...|.++.+ +| T Consensus 147 ------~~~~~~--~~~~------~~~~~~~eV---------------------------------------------ih 167 (351) T protein:vir:78 147 ------GFVYVN--GWQE------RHEFAPDSV---------------------------------------------FQ 167 (351) T ss_pred ------eEEEEe--cCCe------EEEEccccE---------------------------------------------EE Confidence 111110 0000 001222222 33 Q ss_pred ecC-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeE--EecCCCCc--hhhhhHhhhc-------Ceeee Q lcl|NC_013644. 240 LSN-----NKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEAIYV--VSGFQGDD--LSKLRQNVKS-------KKVVG 303 (510) Q Consensus 240 ~~n-----n~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~~lv--~~g~~~~~--~~~~~~~~~~-------~~~~~ 303 (510) +++ .-.|.|.+......+..-+.+-.-..+.++-.+.|-.+ .+|...++ .+.++..++. +.++. T Consensus 168 ir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pggIl~~~~~~ls~e~~~~lr~~~~~~~G~~N~~~~~v 247 (351) T protein:vir:78 168 LVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFM 247 (351) T ss_pred EcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCcccccceee Confidence 321 12466666544443332222111112333334444444 34432222 1222222221 22333 Q ss_pred ccC---CCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCccccccc-------cCcccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 304 TGS---DGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDSTQVG-------DGNITNIVIKARYTLLNMKANKTEA 371 (510) Q Consensus 304 ~~~---~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-------~g~~Sg~Ai~~~~~~l~~k~~~k~~ 371 (510) +.. ++++++.... .....+.+..+..+++|...-++|+.-.+. .+++...+..+ T Consensus 248 ~~~~g~~~g~k~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f-------------- 313 (351) T protein:vir:78 248 YAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARVF-------------- 313 (351) T ss_pred ecCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHH-------------- Confidence 332 2345555433 334556777788888999988898743221 12222222221 Q ss_pred HHHHHHHHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCHHHHH Q lcl|NC_013644. 372 RLRALLEWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNETDIV 417 (510) Q Consensus 372 ~~~~~l~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~~e~~ 417 (510) +...|.-+++.|.++....+. + -|.|++.---.-.+.+ T Consensus 314 -~~~~l~P~~~~iee~n~~l~~------~-~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 314 -GRNEIRPLQARFAELNDWLGD------E-VVRFDDYEIPPAPVAA 351 (351) T ss_pred -HHHHHHHHHHHHHHHHhhcCc------c-ceecChhhhccccccC Confidence 222222222222222111111 1 1556533222111111 No 266 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=31.80 E-value=1.5 Score=19.67 Aligned_cols=335 Identities=8% Similarity=0.039 Sum_probs=123.8 Q ss_pred HHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHHHHHhhhhcCCceec-- Q lcl|NC_013644. 17 KAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVDQKTQYLLSNPVEYE-- 94 (510) Q Consensus 17 ~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~-- 94 (510) -. ...++..+.+.+.......-.. .... ..-.........|+..++-+..-|+.+- T Consensus 1 M~------------~f~k~~~~~~~~~~~~~~~~~~---------~~~~-~~~~~~~~v~~~v~~ia~~iA~lp~~~~~~ 58 (378) T protein:vir:85 1 MN------------LFGKVVSFSRGKLNNDTQRVTA---------WQNE-AVEYTSAFVTNIHNKIANEITKVEFNHVKY 58 (378) T ss_pred Cc------------hhhhhhhhhhcccccCCcceee---------eecc-chhhhhHHHHHHHHHHHHhHhhCceeEEEE Confidence 00 0111111222111100000000 0000 0001122333445555555555666531 Q ss_pred -cC----c---HHHHHHHHHHhc---cC---HHHHHHHHHHHHHhcCeEEEEE-EECCCCceEEEEEcccceEEEEcCCC Q lcl|NC_013644. 95 -TE----N---EELKEYLAEYYN---SE---FQVVLQELVEGSSQKGFEYVYA-RTNAEDRLCFQVADSLNVFGVYNEYN 159 (510) Q Consensus 95 -~~----d---~~~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v-~~d~~g~~~i~~~~p~~~~~~~d~~~ 159 (510) .+ + +....-|..+++ |. .......+....+..|.||+++ +.+..|.+...+ |. T Consensus 59 ~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~---------~~--- 126 (378) T protein:vir:85 59 KKSDVGSDTLISMAGSDLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSETGELLDLL---------FA--- 126 (378) T ss_pred eccccccccccccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCCCceEEEEE---------ec--- Confidence 11 0 112223444442 22 2334445677888899999752 333333221111 00 Q ss_pred CceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEE Q lcl|NC_013644. 160 ELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYR 239 (510) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~ 239 (510) .++ ..|.+..+.+++ T Consensus 127 --------------~~~--------~~~~~~dvih~~------------------------------------------- 141 (378) T protein:vir:85 127 --------------NDK--------KEYKPEELVRLV------------------------------------------- 141 (378) T ss_pred --------------CCC--------EEEcccceEEEe------------------------------------------- Confidence 000 011122222221 Q ss_pred ecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccc--eeEEecCCCCch--hh----hhHhh-------hcCeeeec Q lcl|NC_013644. 240 LSNNKQETTDLKPIKALIDDYDLMNCFLSNNLQDFAEA--IYVVSGFQGDDL--SK----LRQNV-------KSKKVVGT 304 (510) Q Consensus 240 ~~nn~~g~sd~~~v~~liD~~n~~~S~~~~~~~~~~~~--~lv~~g~~~~~~--~~----~~~~~-------~~~~~~~~ 304 (510) +.-...+....+..+.++++..+ .. +.+ ++...+. ..+. .. +...+ ..++++.+ T Consensus 142 --~~~~~~~~~~~~~~a~~~~~~~~-------~~-~~~~g~l~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl 210 (378) T protein:vir:85 142 --SPFYINEDTSILDNALASIQTKL-------EQ-GKLRGLLKINAF-LDIDNTQEYREKALATIKNMQEGSSYNGLTPV 210 (378) T ss_pred --cCcCccchhhHHHHHHHHHHHHH-------hc-CCcceEEEeCCc-CCHHHHHHHHHHHHHHHHHhhcccccccceec Confidence 11000011111222333332221 11 222 2222221 1111 11 11111 12345666 Q ss_pred cCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 305 GSDGGLDVKTVTIPTEGRKTKMEIDKENIYKFGMAFDSTQVGDGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLV 384 (510) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i 384 (510) +++.+++.++.+.....+ ..++.+++.|+..-++|+.-..+.. +... ....+...|.-.++.| T Consensus 211 ~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~~s~--~e~~--------------~~~f~~~tL~P~~~~i 273 (378) T protein:vir:85 211 DNKTEIVELKKDYSVLNK-DEIELIKSELLTGYFMNENILLGTA--TQEQ--------------QIYFYNSTIIPLLIQL 273 (378) T ss_pred CCCceEEeccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcCCc--hHHH--------------HHHHHHHHHHHHHHHH Confidence 666666655544333333 4456777888888888875432211 1111 1123444455544444 Q ss_pred HHHHhhccCC---------ccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCchHHHHHhCCC--CCcHHHHHHHHHH Q lcl|NC_013644. 385 IDDINRRYTK---------AFDPTEVSFTFTREVMVNETDIVNDEKTEAETRKIILESILQVAPR--LDDDNVLRLICEQ 453 (510) Q Consensus 385 ~~~~~~~~~~---------~~~~~~v~i~f~~~~p~d~~e~~~~~~~~~~~g~iS~et~~~~~~~--v~d~e~~~~~~e~ 453 (510) ...+..+--. .....++.+.+..-+-.|..+.++.+.++..+|+++.-.++++++. ++.-+.. T Consensus 274 e~~l~~kLl~~~er~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~gGD~~------ 347 (378) T protein:vir:85 274 EKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDIY------ 347 (378) T ss_pred HHHHHhhcCChhhhhhhhhccccceeeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeE------ Confidence 4444322110 0111223344456667899999999999999999999888887643 2211000 Q ss_pred HHHHHHHHHHHHHhhhccCCCCCCCCCcccCCCCCCccccc Q lcl|NC_013644. 454 FDLDWEDVKEALEEAEYTKGLSDNTDEEETAVNPDDPTQQM 494 (510) Q Consensus 454 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (510) ....+..+.. ..+.....++.. .+++++..+ T Consensus 348 --------~~~~N~~~~~-~~~~~~~~~~~~-~~~~e~~n~ 378 (378) T protein:vir:85 348 --------IANLNAVAVK-NLSDLQGSRKDV-ASTDETNNQ 378 (378) T ss_pred --------eecccccccc-cchhhcCccCCC-CCCCCCCCC Confidence 0000000000 000000000000 011111111 No 267 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=24.47 E-value=2.2 Score=18.74 Aligned_cols=296 Identities=7% Similarity=-0.021 Sum_probs=102.4 Q ss_pred HHHHHHHhccCCcchhcc----cceeccccccccccccc----cceeccchhHHHHHHHHhhhhcCCceecc------Cc Q lcl|NC_013644. 32 AETGIRYYNHENDIMNNR----IFYVDDEGILREDKYAS----NVRIPHGFFPEIVDQKTQYLLSNPVEYET------EN 97 (510) Q Consensus 32 ~~~~~~YY~g~~~i~~~~----~~~~~~~~~~~~~~~~~----~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~------~d 97 (510) |.+.++.-.-...-.... ...........-....| ..+-..+|.... ..+....-|+.+.. .+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~y~~~~---~~~~~~~pp~~~~~la~~~~~~ 77 (350) T protein:vir:11 1 MSKRRSHRRQQPVTVQSAQEGEFIPRQGGRAEAFTFGDPMPVLDGRGILDYLECW---PNGRWYEPPLSMEGLAKSVGSS 77 (350) T ss_pred CCccccCCCcCccccCCcchhhhccccccceEEEEeCCceeecCcchhhHHHHHh---hcCccccCCCCHHHHHHHHhhh Confidence 000000000000000000 00000000000000111 011011122111 11111111232211 00 Q ss_pred HH--------HHHHHHHHhccC-H-HHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCceeEEE Q lcl|NC_013644. 98 EE--------LKEYLAEYYNSE-F-QVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNELQRICR 166 (510) Q Consensus 98 ~~--------~~~~l~~~~~n~-~-~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~ 166 (510) .. .+-....+.-|. + ...+.+++.+.+.+|.||+.+..+..|++ .+.+++|..+-+.-+.. + T Consensus 78 ~~h~~~l~~k~n~l~~~~~Pn~~~t~~~f~~~v~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~~vr~~~~~~-------~ 150 (350) T protein:vir:11 78 VYLQSGLKFKRNMLAKTFIPHRLLSRATFEQFSLDWLTFGSAYLEQPRSRLGTRMPLQAPLAKYMRRGTDLE-------T 150 (350) T ss_pred hhhccchhhhhhhhhhcccCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcCCCCEEEEEEeCCceeEeeecCC-------e Confidence 00 000011111132 2 23356677888999999999999888864 57777776665432211 0 Q ss_pred EEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecC---- Q lcl|NC_013644. 167 HYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSN---- 242 (510) Q Consensus 167 ~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n---- 242 (510) +|.+. .++.. ..|.++.+ +|+++ T Consensus 151 ~~~~~--~~~~~------~~~~~~eV---------------------------------------------ihir~~~~~ 177 (350) T protein:vir:11 151 FYQVR--SWKDE------HEFEKGSV---------------------------------------------IQLREADIN 177 (350) T ss_pred EEEEe--eCCeE------EEECcccE---------------------------------------------EEeCCCCCC Confidence 11111 11100 01222222 33332 Q ss_pred -CCCCCCcHHHHHHHHHHHHHHHHHHH-HHHHHhccceeE--EecCCCCc--hhhhhHhhhc-------CeeeeccCC-- Q lcl|NC_013644. 243 -NKQETTDLKPIKALIDDYDLMNCFLS-NNLQDFAEAIYV--VSGFQGDD--LSKLRQNVKS-------KKVVGTGSD-- 307 (510) Q Consensus 243 -n~~g~sd~~~v~~liD~~n~~~S~~~-~~~~~~~~~~lv--~~g~~~~~--~~~~~~~~~~-------~~~~~~~~~-- 307 (510) .-.|.|.+......+.. +.....+. +.+.-.+.|-.+ ..|...++ .+.+...++. ++++.+..+ T Consensus 178 ~~~yGls~~~~a~~si~l-~~~a~~~~~~~f~NGa~~~gil~~~~~~ls~e~~~~l~~~~~~~~G~~N~~~~~v~~~~g~ 256 (350) T protein:vir:11 178 QEIYGVPEWFCALQSALL-NESATLFRRKYYNNGSHAGFILYMTDAAQNEEDIDALRTALKTAKGPGNFRNLFVYAPNGK 256 (350) T ss_pred CCcccccHHHHHHHHHHH-HHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeeecCCCC Confidence 12366666554433332 22222222 222333344444 44533322 2222222221 123333332 Q ss_pred -CceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCcccccc----c---cCcccHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 308 -GGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDSTQV----G---DGNITNIVIKARYTLLNMKANKTEARLRALL 377 (510) Q Consensus 308 -~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~----~---~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l 377 (510) .++++.-.. .....+.+..+...++|...-++|+.-.+ + ++++...+..+... .| T Consensus 257 ~~g~~~~pl~~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f~~~---------------~L 321 (350) T protein:vir:11 257 KEGIQLIPVSEVAAKDEFGSIKNISRDDQLAGLRVYPQLMGVVPQNAGGFGSISDAAAVWASL---------------EL 321 (350) T ss_pred ccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcCCHHHHHHHHHHH---------------HH Confidence 345555433 23456777778888899998888875222 1 22232222222111 12 Q ss_pred HHHHHHHHHHHhhccCCccccceeeEEeCCCCCCCH Q lcl|NC_013644. 378 EWMNKLVIDDINRRYTKAFDPTEVSFTFTREVMVNE 413 (510) Q Consensus 378 ~~~~~~i~~~~~~~~~~~~~~~~v~i~f~~~~p~d~ 413 (510) .-+++.+.++....+. + .+.|.+....++ T Consensus 322 ~P~~~~ie~ln~~l~~------~-~~~F~~~~~~~l 350 (350) T protein:vir:11 322 APMQTRLQQVNEMIGE------E-VVRFAQFDAPGL 350 (350) T ss_pred HHHHHHHHHHHhhcCc------c-ccccCcccccCC Confidence 2222222221111110 0 123433222222 No 268 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=24.07 E-value=2.2 Score=18.68 Aligned_cols=288 Identities=9% Similarity=0.002 Sum_probs=98.3 Q ss_pred ccCCcchhcccceeccccc-cccccccc----cceeccchhHHHHHHHHhhhhcCCceecc------CcH---HHHH--- Q lcl|NC_013644. 40 NHENDIMNNRIFYVDDEGI-LREDKYAS----NVRIPHGFFPEIVDQKTQYLLSNPVEYET------ENE---ELKE--- 102 (510) Q Consensus 40 ~g~~~i~~~~~~~~~~~~~-~~~~~~~~----~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~------~d~---~~~~--- 102 (510) ..++.-............. ..-....| +.+-..++....- .+-.+--|+.+.. .+. .... T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~---~~~~~~pp~~~~~la~l~~a~~~h~s~i~~k~ 77 (340) T protein:vir:98 1 MSKRKPRKAVAMTASAPQKMEAFTFGEPVPVLDKRDILDYVECIS---NGKWYEPPVSFSGLAKSLRSAVHHSSPIYVKR 77 (340) T ss_pred CCCCCCCccccccccCccceeEEEcCCceeecCcchhhhhhhhhh---cCceecCCCCHHHHHHHHHhccccchhhhhhh Confidence 1111000000000000000 00000000 0000011111000 0000001222110 000 0000 Q ss_pred -HHHHHhc-cCH--HHHHHHHHHHHHhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCc Q lcl|NC_013644. 103 -YLAEYYN-SEF--QVVLQELVEGSSQKGFEYVYARTNAEDRL-CFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGE 177 (510) Q Consensus 103 -~l~~~~~-n~~--~~~~~e~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 177 (510) .|...+. |.. ...+..++.+.+.+|.||+.+..+..|++ .+..++|..+....+. + ++|.+.. ++. T Consensus 78 n~l~~~~~Pn~~lt~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~~~--~-----~~~~~~~--~~~ 148 (340) T protein:vir:98 78 NVLASTYIPHPLLSRQDFSRFALDYLVFGNAFLEQRHSVTGQLIKLLTSPAKYTRRGVDD--S-----VFWFVEN--FTQ 148 (340) T ss_pred hHHhhccCCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceEEEcccC--c-----EEEEEec--CCe Confidence 1111111 221 13345677788899999999988888874 4566666555432221 1 1111110 110 Q ss_pred eeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccccccccCCcccEEEecC-----CCCCCCcHHH Q lcl|NC_013644. 178 TVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENESLLQRSYGQIPFYRLSN-----NKQETTDLKP 252 (510) Q Consensus 178 ~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----n~~g~sd~~~ 252 (510) . ..|.++.+ +|+++ .-.|.|.+.. T Consensus 149 --~----~~~~~~eV---------------------------------------------iHir~~~~~~~~~Gls~~~~ 177 (340) T protein:vir:98 149 --P----HEFAPDTV---------------------------------------------FHLLEPDINQEIYGLPEYLS 177 (340) T ss_pred --E----EEEccccE---------------------------------------------EEEcCCCCCCCcccccHHHH Confidence 0 01222222 33321 1135555554 Q ss_pred HHHHHHHHHHHHHHHH-HHHHHhccce--eEEecCCCCc--hhhhhHhhhc-------CeeeeccC---CCceeEEeec- Q lcl|NC_013644. 253 IKALIDDYDLMNCFLS-NNLQDFAEAI--YVVSGFQGDD--LSKLRQNVKS-------KKVVGTGS---DGGLDVKTVT- 316 (510) Q Consensus 253 v~~liD~~n~~~S~~~-~~~~~~~~~~--lv~~g~~~~~--~~~~~~~~~~-------~~~~~~~~---~~~~~~~~~~- 316 (510) ...-++. +.....+. +.++-.+.|- +.++|...++ .+.++..++. ++++.+.. ++++++.-.. T Consensus 178 a~~si~l-~~aa~~~~~~~f~NGa~pg~il~~~~~~ls~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~g~~~~pls~ 256 (340) T protein:vir:98 178 ALNSAWL-NESATLFRRKYYQNGAHAGYIMYVTDPAQSATDVESLRDAMRNSKGLGNFKNLFFYSPNGKPDGIKIVPLSE 256 (340) T ss_pred HHHHHHH-HHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCC Confidence 3333221 22222221 2222233343 4445543332 1222222221 22343432 2345554433 Q ss_pred -CCHHHHHHHHHHHHHHHHHHhCCcccccc----cc---CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 317 -IPTEGRKTKMEIDKENIYKFGMAFDSTQV----GD---GNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDI 388 (510) Q Consensus 317 -~~~~~~~~~~~~l~~~i~~~s~~p~~~~~----~~---g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~ 388 (510) .....+.+..+..+++|...-++|+.-.+ ++ |++...+..+ +...|.-+++.|.++. T Consensus 257 ~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f---------------~~~~l~Pl~~~iee~n 321 (340) T protein:vir:98 257 VATKDDFFNIKKASAADLMDAHRVPFQLMGGKPENIGSLGDVEKVAKVF---------------VRNELSPLQDRFREVN 321 (340) T ss_pred ChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHHHH---------------HHHHHHHHHHHHHHHH Confidence 34456777888888899998888875322 11 2222222221 1112222222222211 Q ss_pred hhccCCccccceeeEEeCCCC-CCCH Q lcl|NC_013644. 389 NRRYTKAFDPTEVSFTFTREV-MVNE 413 (510) Q Consensus 389 ~~~~~~~~~~~~v~i~f~~~~-p~d~ 413 (510) ...+. + -+.|++.. .+.+ T Consensus 322 ~~L~~------e-~~rF~~~~l~~~d 340 (340) T protein:vir:98 322 DWLGM------E-VIRFKEYTLDNPE 340 (340) T ss_pred hcccc------c-ccccCccccccCC Confidence 11111 1 14454322 2222 No 269 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=23.31 E-value=2.3 Score=18.58 Aligned_cols=393 Identities=11% Similarity=0.060 Sum_probs=153.6 Q ss_pred CCCccCCChhhhHHHHHHHHHhhhhhhhHHHHHHHHHHhccCCcchhcccceeccccccccccccccceeccchhHHHHH Q lcl|NC_013644. 1 MEALLSEDVKIIANALKAAIDKDRKSSSKREAETGIRYYNHENDIMNNRIFYVDDEGILREDKYASNVRIPHGFFPEIVD 80 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~ 80 (510) ||....+ .+.....-..+|+ +|+.+..+++-.. -...||+ T Consensus 58 ~q~~y~~-~e~~~~~~~eLI~---------~YR~ma~~pEvd~------------------------------Av~eIVn 97 (524) T protein:vir:10 58 MQQMFGS-NEPEVKNTRELID---------TYRNLMNNYEVDN------------------------------AVQEIVS 97 (524) T ss_pred hhhhhhc-ccchhhhHHHHHH---------HHHHHhhccchhh------------------------------HHHHhhc Confidence 1111111 0111111122222 2222323332222 1233333 Q ss_pred HHHhh-hhcCCceeccCc----HHHHH----HHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECC----CCceEEEEE Q lcl|NC_013644. 81 QKTQY-LLSNPVEYETEN----EELKE----YLAEYYN-SEFQVVLQELVEGSSQKGFEYVYARTNA----EDRLCFQVA 146 (510) Q Consensus 81 ~~~~~-l~g~p~~~~~~d----~~~~~----~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~----~g~~~i~~~ 146 (510) ..+-+ -...||.+..++ +...+ ..+.+++ =+|+.+.++..+.+.+.|+.|.+.-+|. +|-..+..+ T Consensus 98 eaiv~d~~~~pV~l~Ld~~~~s~siK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~~~pk~GI~Elr~l 177 (524) T protein:vir:10 98 DAIVYEDDKEVVALNLDGTDFSQSIKDKILAEFSEVLNLLNFQRKGTDHFQRWYVDSRIFFHKIINPKKMKDGVQELRRL 177 (524) T ss_pred ceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeceEEEEEEeeCCCccccceeeeee Confidence 32222 234555554443 22222 2233332 2678888999999999999999877763 366778889 Q ss_pred cccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccccccccccccccccccccc Q lcl|NC_013644. 147 DSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEAEPINPRPHVLAVDSENES 226 (510) Q Consensus 147 ~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (510) ||+.+-.|.--..+.... ++.+. .+..+-+|.++.- .|...+ .. .. T Consensus 178 DPr~i~~vr~i~~~~~~~--~~vi~--------~~~e~f~Y~~~~~-~~~~~~-~~--------~~-------------- 223 (524) T protein:vir:10 178 DPRQVQYIREIVTRMEDG--VKIVD--------GYREFFVYDTGHE-SYCADG-RI--------YS-------------- 223 (524) T ss_pred CCccceeeeeecccCccc--chhhc--------chhhheeecCCCc-ccccCc-ce--------ec-------------- Confidence 998885554211111000 00000 0011112222100 000000 00 00 Q ss_pred cccccCCccc---EEEecCCC---CCCCcHHHHHHHHHHHHH--HHHHHHHHHHHhccceeEE----ec-CCCCchhh-- Q lcl|NC_013644. 227 LLQRSYGQIP---FYRLSNNK---QETTDLKPIKALIDDYDL--MNCFLSNNLQDFAEAIYVV----SG-FQGDDLSK-- 291 (510) Q Consensus 227 ~~~~~~g~iP---vv~~~nn~---~g~sd~~~v~~liD~~n~--~~S~~~~~~~~~~~~~lv~----~g-~~~~~~~~-- 291 (510) ++.-=+|| |++....- .+--.+.-+...|..+|. ++-|.+-.-+..+.|=.=+ .| +...-... T Consensus 224 --~~~~ikI~~dAIvy~~SGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnlPk~KAeqYl 301 (524) T protein:vir:10 224 --AGTKVKIPRAAVVYAHSGLLDCCGKNIIGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRVFYIDTGNMPSRKAAAQM 301 (524) T ss_pred --CCcceecchhheeeeccCcccCCCCceeccchHhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHH Confidence 00000233 23222110 011122333444555553 3444444444444442211 11 11111111 Q ss_pred --hhHhhhcCeeeec-----------------------cCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCcccc- Q lcl|NC_013644. 292 --LRQNVKSKKVVGT-----------------------GSDGGLDVKTVT--IPTEGRKTKMEIDKENIYKFGMAFDST- 343 (510) Q Consensus 292 --~~~~~~~~~~~~~-----------------------~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~- 343 (510) .+...+...+... +++.+.+.-|.+ .+. ....-+.-+++.+|+.-.+|-.- T Consensus 302 ~~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnl-gem~DV~YF~kkLy~aLnVP~sRl 380 (524) T protein:vir:10 302 QHIMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTMPGATGM-SDMDDVLYFRTALYRALRIPESRI 380 (524) T ss_pred HHHHHhcCceeEEeccCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCc-ChHHHHHHHHHHHHHHhCCCchhc Confidence 1111111111111 112223333333 222 22344555666777766777421 Q ss_pred --cc--c--cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCcccc--ceeeEEeCCCCCCCHHH Q lcl|NC_013644. 344 --QV--G--DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFDP--TEVSFTFTREVMVNETD 415 (510) Q Consensus 344 --~~--~--~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~~--~~v~i~f~~~~p~d~~e 415 (510) +. + +|..|. |.......-.-+.+-+..|...+.++++.=+-+-++....+|+. ..+.+.|...-.-.+.. T Consensus 381 ~~e~~~~f~~gr~~E--ItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElK 458 (524) T protein:vir:10 381 PSESNSGVMFDAGTA--ITRDELKFAKWIRQLQNKFEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYFSEMK 458 (524) T ss_pred cCCCCccccccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHH Confidence 11 1 233332 22223333344556666677777776665443334444445543 35777786555544443 Q ss_pred HHHH-------HHHHHh--cCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCC Q lcl|NC_013644. 416 IVND-------EKTEAE--TRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEALEEAEYTKGLSDNTDE 480 (510) Q Consensus 416 ~~~~-------~~~~~~--~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (510) .++. +..+.. +..+|.+++.+.+=-.+|+|..++....+++ .+..-+++......+. T Consensus 459 e~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E--------~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 459 DAEIMERRINMLTMAEPFIGKYISHQTAMKDFLQMTDEEINQEAKQIEEE--------SKEARFQNPDEEEEDF 524 (524) T ss_pred HHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHH--------hhcCCCCCCChhhhcC Confidence 3333 222221 3346999999876556665533221111111 1111111110111111 No 270 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=20.10 E-value=2.8 Score=18.10 Aligned_cols=398 Identities=11% Similarity=0.054 Sum_probs=155.5 Q ss_pred CCCccCCChhhhHHHH--------------HHHHHhh---hh-hhhHHHHHHHHHHhccCCcchhcccceeccccccccc Q lcl|NC_013644. 1 MEALLSEDVKIIANAL--------------KAAIDKD---RK-SSSKREAETGIRYYNHENDIMNNRIFYVDDEGILRED 62 (510) Q Consensus 1 ~~~~~~~~~~~~~~~i--------------~~~i~~~---~~-~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~ 62 (510) +..+-..+..+=+..| ..++.-. ++ .+-.++|+.+..+++-. T Consensus 28 ~~s~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~~~~n~~eLI~~YR~ma~~pEvd-------------------- 87 (521) T protein:vir:10 28 IDSFAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAPKIQNTKDLINQYRSLSKYHEVD-------------------- 87 (521) T ss_pred ccccccccCCCCceeeccCCCccccccchhhhhhccccccchHHHHHHHHHHHhhccchh-------------------- Confidence 3333222222211111 1111100 00 00011122222222222 Q ss_pred cccccceeccchhHHHHHHHHhh-hhcCCceeccCc----HHHHHHHH----HHhc-cCHHHHHHHHHHHHHhcCeEEEE Q lcl|NC_013644. 63 KYASNVRIPHGFFPEIVDQKTQY-LLSNPVEYETEN----EELKEYLA----EYYN-SEFQVVLQELVEGSSQKGFEYVY 132 (510) Q Consensus 63 ~~~~~~ki~~n~~~~Iv~~~~~~-l~g~p~~~~~~d----~~~~~~l~----~~~~-n~~~~~~~e~~~~~~~~G~~~~~ 132 (510) +-...||+..+-+ -...||.+..++ +...+.|. .+++ =+|+.+.++..+.+.+.|+.|.+ T Consensus 88 ----------~Av~eIvneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fH 157 (521) T protein:vir:10 88 ----------NAIDEIINDAIVQEDNRDTVYLDLDKTDWNESVKEMVREEFRTILKLLKFEREGKRHFRRWYVDSRIYFH 157 (521) T ss_pred ----------hHHHhhhcceEEecCCCceEEEEecCcccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeeEEEE Confidence 2234444433322 234566665443 23333333 3332 26778889999999999999998 Q ss_pred EEECC----CCceEEEEEcccceEEEEcCCCCceeEEEEEEEEEeeCCceeEEEEEEEEcCCcEEEEEEcCCceeecccc Q lcl|NC_013644. 133 ARTNA----EDRLCFQVADSLNVFGVYNEYNELQRICRHYITEIEKDGETVDIHHAEVWTDQNVYFFVAEDNKDYELDEA 208 (510) Q Consensus 133 v~~d~----~g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~ 208 (510) .-+|. +|-..+..++|+.+-.+.-...+......+ +. .+.-+-+|.+..-.+|...++. T Consensus 158 kiid~~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v--~~--------~~~e~f~Y~~~~~~~~~~~g~~------- 220 (521) T protein:vir:10 158 KMIDPARPKDGIKELRLLDPRNVEYYRVNLKSNENGNDV--YK--------GVKEFFTYGATEDNRYNISGNS------- 220 (521) T ss_pred EEeeCCCccccceeeeeeCCcceeeeeeecCCCCCcchh--hc--------cceeeeeeccCCCceecCCCCC------- Confidence 77763 366788899999885554211110000000 00 0111123333221222111100 Q ss_pred cccccccccccccccccccccccCCccc---EEEec------CCCCCCCcHHHHHHHHHHHHH--HHHHHHHHHHHhccc Q lcl|NC_013644. 209 EPINPRPHVLAVDSENESLLQRSYGQIP---FYRLS------NNKQETTDLKPIKALIDDYDL--MNCFLSNNLQDFAEA 277 (510) Q Consensus 209 ~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~------nn~~g~sd~~~v~~liD~~n~--~~S~~~~~~~~~~~~ 277 (510) ...=+|| |++.. |.....|-+. ..|..+|. ++-|.+-..+..+.| T Consensus 221 ---------------------~~~vkI~~daI~y~hSGL~d~~~~~i~syLh---kAiKp~NQLkm~EDAlVIYRitRAP 276 (521) T protein:vir:10 221 ---------------------NNLVQIPIDAIVYSHSGKVDIDGKTIVGYLH---NVIKPANQLKMLEDAMVIYRITRAP 276 (521) T ss_pred ---------------------CcceeechhheeeecccceeCCCCceeccch---hhhHhHHhhHHHHhhHHHHhhhccc Confidence 0001122 11111 1222233333 33444443 344444444444443 Q ss_pred eeEE----ec-CCCCchhh----hhHhhhcCeeee--------------------c---cCCCceeEEeec--CCHHHHH Q lcl|NC_013644. 278 IYVV----SG-FQGDDLSK----LRQNVKSKKVVG--------------------T---GSDGGLDVKTVT--IPTEGRK 323 (510) Q Consensus 278 ~lv~----~g-~~~~~~~~----~~~~~~~~~~~~--------------------~---~~~~~~~~~~~~--~~~~~~~ 323 (510) =.=+ .| +...-... .+...+...+.. + +++.+.+.-|.+ .+.. .. T Consensus 277 eRRvFYIDvGnlpk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEI~TLpggqnlg-em 355 (521) T protein:vir:10 277 ERRVFYIDVGTMPNKKATQHLNNVMQGLKNRVVYDSSTGKVKNSSNNLAMTEDYWLMRRDGKATTEVSTLPGAQSMG-EM 355 (521) T ss_pred cceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchhhhhhHhhhcccccCCCCccceeeccccCCcC-hH Confidence 2211 11 11111111 111111111111 1 122223333333 2222 23 Q ss_pred HHHHHHHHHHHHHhCCccc--cccc----cCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccc Q lcl|NC_013644. 324 TKMEIDKENIYKFGMAFDS--TQVG----DGNITNIVIKARYTLLNMKANKTEARLRALLEWMNKLVIDDINRRYTKAFD 397 (510) Q Consensus 324 ~~~~~l~~~i~~~s~~p~~--~~~~----~g~~Sg~Ai~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~~ 397 (510) .-+.-+++.+|+.-.+|-. ...+ .|..|. |.......-.-+.+-+..|...+.++++.=+-+-++....+|+ T Consensus 356 ~DV~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr~~E--ItRDEikF~KFI~rLR~rFs~~f~~~L~~qLilKgiit~eew~ 433 (521) T protein:vir:10 356 DDVRWFNRKLYESMKIPLSRLPQEGAGVTFGAGND--ITRDELQFTKYIRGLQQQFEPIFLNPLRTNLMLKGKMSVSEWE 433 (521) T ss_pred HHHHHHHHHHHHHhCCCccccCCCCCceecccccc--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHH Confidence 4455566677776677742 2221 222222 2222333334455666667777777666544333444444554 Q ss_pred c--ceeeEEeCCCCCCCHHHHHH-------HHHHHH----hcCCCchHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013644. 398 P--TEVSFTFTREVMVNETDIVN-------DEKTEA----ETRKIILESILQVAPRLDDDNVLRLICEQFDLDWEDVKEA 464 (510) Q Consensus 398 ~--~~v~i~f~~~~p~d~~e~~~-------~~~~~~----~~g~iS~et~~~~~~~v~d~e~~~~~~e~~e~~~~~~~~~ 464 (510) . ..+.+.|...-.-.+...++ .+..+. -+..+|.+++.+.+=-.+|+|...+....++ . T Consensus 434 ~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~yvGky~s~dyi~k~ILr~tDeeik~~~k~I~~--------E 505 (521) T protein:vir:10 434 EQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEVTGKYLSHEYVMKNILRMSDEDIKTEREKIDG--------E 505 (521) T ss_pred HHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCccccccccchHHHHHHHhcCCHhHHHHHHHHHHH--------h Confidence 3 35777786555444443332 333332 2235799999988666666553321111111 1 Q ss_pred HHhhhccCCCCCCCCC Q lcl|NC_013644. 465 LEEAEYTKGLSDNTDE 480 (510) Q Consensus 465 ~~~~~~~~~~~~~~~~ 480 (510) .+..-++......++. T Consensus 506 ~~~~~~~~p~~e~~df 521 (521) T protein:vir:10 506 LKDSVYKNPEDPMEEF 521 (521) T ss_pred hhCCCCCCCcchhhcC Confidence 1111111110111111 Done!