Query lcl|NC_016654.1_cdsid_YP_005087228.1 [gene=RoPhREQ3_gp36] [protein=portal protein] [protein_id=YP_005087228.1] [location=17475..19076] Match_columns 533 No_of_seqs 156 out of 233 Neff 8.7 Searched_HMMs 1612 Date Thu Nov 7 13:14:47 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_36 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_36_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:78907 Length: 518 100.0 1E-111 8E-115 628.4 51.5 486 15-524 1-518 (518) 2 protein:vir:4782 Length: 522 # 100.0 6E-111 4E-114 624.9 50.4 474 1-532 21-522 (522) 3 protein:vir:79703 Length: 505 100.0 1E-109 7E-113 617.8 49.5 458 1-513 18-505 (505) 4 protein:vir:98883 Length: 517 100.0 3E-108 2E-111 610.1 49.5 466 1-533 18-517 (517) 5 protein:vir:1587 Length: 508 # 100.0 5E-108 3E-111 608.9 50.0 464 1-533 22-508 (508) 6 protein:vir:3028 Length: 500 # 100.0 1E-104 7E-108 590.6 49.6 460 1-525 21-500 (500) 7 protein:vir:9815 Length: 500 # 100.0 1E-104 7E-108 590.6 49.6 460 1-525 21-500 (500) 8 protein:vir:80959 Length: 499 100.0 1E-103 6E-107 585.4 49.7 463 1-527 16-499 (499) 9 protein:vir:38 Length: 496 # N 100.0 1.5E-98 9E-102 556.9 49.7 463 1-527 16-496 (496) 10 protein:vir:5961 Length: 503 # 100.0 1.6E-58 1E-61 337.4 45.9 475 1-533 20-501 (503) 11 protein:vir:105461 Length: 470 100.0 3.3E-57 2E-60 330.3 45.7 463 13-533 1-470 (470) 12 protein:vir:96240 Length: 511 100.0 4E-57 2.5E-60 329.8 44.3 468 1-533 31-511 (511) 13 protein:vir:79043 Length: 479 100.0 4.8E-57 2.9E-60 329.4 44.1 462 1-528 14-479 (479) 14 protein:vir:99781 Length: 511 100.0 4.3E-57 2.7E-60 329.6 43.4 469 1-533 31-511 (511) 15 protein:vir:3964 Length: 453 # 100.0 3.5E-56 2.2E-59 324.6 45.2 446 1-533 1-453 (453) 16 protein:vir:105292 Length: 478 100.0 3.8E-56 2.4E-59 324.4 45.2 468 3-529 1-478 (478) 17 protein:vir:4898 Length: 502 # 100.0 8.7E-56 5.4E-59 322.4 46.6 466 1-533 1-502 (502) 18 protein:vir:95113 Length: 474 100.0 6E-56 3.7E-59 323.3 45.6 455 1-533 7-472 (474) 19 protein:vir:96179 Length: 468 100.0 4E-56 2.5E-59 324.3 44.0 451 1-528 17-468 (468) 20 protein:vir:96839 Length: 474 100.0 1E-55 6.5E-59 322.0 45.4 464 3-531 1-474 (474) 21 protein:vir:96494 Length: 501 100.0 2.3E-55 1.4E-58 320.2 46.3 463 1-533 30-501 (501) 22 protein:vir:106571 Length: 499 100.0 1.8E-55 1.1E-58 320.7 45.7 458 1-533 1-485 (499) 23 protein:vir:2732 Length: 501 # 100.0 1.6E-55 1E-58 321.0 45.2 463 1-533 30-501 (501) 24 protein:vir:107112 Length: 478 100.0 1.9E-55 1.2E-58 320.6 45.2 467 3-533 1-477 (478) 25 protein:vir:102950 Length: 471 100.0 2.3E-55 1.4E-58 320.2 44.3 468 1-529 1-471 (471) 26 protein:vir:9306 Length: 511 # 100.0 2.7E-55 1.7E-58 319.7 43.5 467 1-533 31-511 (511) 27 protein:vir:106639 Length: 481 100.0 6.3E-55 3.9E-58 317.7 45.1 458 1-533 21-480 (481) 28 protein:vir:3609 Length: 452 # 100.0 2.5E-55 1.6E-58 319.9 42.7 439 1-533 9-452 (452) 29 protein:vir:96366 Length: 511 100.0 4.6E-55 2.9E-58 318.5 43.5 467 1-533 31-511 (511) 30 protein:vir:78805 Length: 511 100.0 4.6E-55 2.9E-58 318.5 43.5 467 1-533 31-511 (511) 31 protein:vir:97171 Length: 512 100.0 9.9E-55 6.2E-58 316.6 45.2 469 1-533 19-512 (512) 32 protein:vir:103951 Length: 511 100.0 8.6E-55 5.3E-58 317.0 44.4 468 1-533 31-511 (511) 33 protein:vir:94498 Length: 474 100.0 1.5E-54 9.4E-58 315.6 44.9 457 1-533 5-472 (474) 34 protein:vir:97447 Length: 474 100.0 1.5E-54 9.4E-58 315.6 44.9 457 1-533 5-472 (474) 35 protein:vir:1236 Length: 483 # 100.0 1.5E-54 9.2E-58 315.7 44.0 465 1-533 1-481 (483) 36 protein:vir:95899 Length: 474 100.0 1.7E-54 1.1E-57 315.3 43.8 455 1-533 7-471 (474) 37 protein:vir:96266 Length: 474 100.0 1.7E-54 1.1E-57 315.3 43.8 455 1-533 7-471 (474) 38 protein:vir:94101 Length: 474 100.0 4.6E-54 2.8E-57 313.0 45.3 464 1-533 1-474 (474) 39 protein:vir:105889 Length: 474 100.0 4.6E-54 2.8E-57 313.0 45.3 464 1-533 1-474 (474) 40 protein:vir:99522 Length: 470 100.0 8.3E-54 5.2E-57 311.6 46.1 451 1-531 17-470 (470) 41 protein:vir:9922 Length: 489 # 100.0 2.5E-54 1.5E-57 314.5 43.1 465 3-530 1-489 (489) 42 protein:vir:95806 Length: 440 100.0 3.8E-54 2.3E-57 313.5 44.0 436 17-527 1-440 (440) 43 protein:vir:97336 Length: 492 100.0 4E-54 2.5E-57 313.3 43.8 457 1-533 35-492 (492) 44 protein:vir:94546 Length: 506 100.0 3.4E-54 2.1E-57 313.7 42.3 461 1-533 13-504 (506) 45 protein:vir:94805 Length: 492 100.0 1E-53 6.2E-57 311.2 44.2 456 1-531 35-492 (492) 46 protein:vir:93747 Length: 472 100.0 1.4E-53 8.8E-57 310.3 44.3 457 1-533 1-470 (472) 47 protein:vir:78083 Length: 537 100.0 3E-53 1.9E-56 308.5 45.8 480 1-533 1-530 (537) 48 protein:vir:733 Length: 453 # 100.0 3.8E-53 2.3E-56 308.0 44.6 442 1-533 9-452 (453) 49 protein:vir:9871 Length: 429 # 100.0 3.6E-52 2.3E-55 302.6 43.7 425 31-527 1-429 (429) 50 protein:vir:102330 Length: 451 100.0 3.4E-50 2.1E-53 291.8 42.9 437 13-518 1-451 (451) 51 protein:vir:7768 Length: 484 # 100.0 5.4E-50 3.3E-53 290.7 42.9 463 1-533 1-481 (484) 52 protein:vir:2427 Length: 485 # 100.0 5.9E-49 3.7E-52 285.0 46.1 465 1-533 1-483 (485) 53 protein:vir:78537 Length: 480 100.0 1E-49 6.2E-53 289.2 41.5 458 1-533 1-472 (480) 54 protein:vir:104082 Length: 485 100.0 1.1E-48 7E-52 283.4 45.0 464 1-533 1-483 (485) 55 protein:vir:78227 Length: 480 100.0 4.2E-49 2.6E-52 285.8 41.7 455 1-533 1-474 (480) 56 protein:vir:4223 Length: 486 # 100.0 1.6E-48 1E-51 282.6 44.3 462 1-532 1-486 (486) 57 protein:vir:80680 Length: 441 100.0 1E-46 6.5E-50 272.7 42.5 437 3-529 1-441 (441) 58 protein:vir:2341 Length: 488 # 100.0 2.1E-46 1.3E-49 271.0 43.9 462 1-533 1-484 (488) 59 protein:vir:7987 Length: 456 # 100.0 5.9E-46 3.6E-49 268.6 41.0 448 1-526 1-456 (456) 60 protein:vir:2500 Length: 501 # 100.0 2.7E-45 1.7E-48 264.9 43.0 478 1-533 1-498 (501) 61 protein:vir:102602 Length: 456 100.0 1.7E-45 1E-48 266.0 41.1 448 7-526 1-456 (456) 62 protein:vir:105819 Length: 456 100.0 1.7E-45 1E-48 266.0 41.1 448 7-526 1-456 (456) 63 protein:vir:99916 Length: 504 100.0 1.1E-44 6.8E-48 261.6 43.8 467 1-532 2-504 (504) 64 protein:vir:99072 Length: 479 100.0 1.4E-44 8.7E-48 261.0 40.3 455 1-533 2-470 (479) 65 protein:vir:98444 Length: 434 100.0 4.5E-44 2.8E-47 258.2 40.3 425 40-532 1-434 (434) 66 protein:vir:8184 Length: 474 # 100.0 5E-39 3.1E-42 230.6 40.7 457 2-527 1-474 (474) 67 protein:vir:9568 Length: 410 # 100.0 1.7E-38 1.1E-41 227.7 38.4 400 15-505 1-410 (410) 68 protein:vir:7430 Length: 563 # 100.0 7.4E-38 4.6E-41 224.2 39.7 487 1-533 9-548 (563) 69 protein:vir:9751 Length: 422 # 100.0 3.3E-37 2E-40 220.6 35.8 413 1-503 1-422 (422) 70 protein:vir:101494 Length: 527 100.0 5.8E-37 3.6E-40 219.3 36.5 473 3-529 1-527 (527) 71 protein:vir:102239 Length: 527 100.0 6.3E-37 3.9E-40 219.0 36.6 473 3-529 1-527 (527) 72 protein:vir:94742 Length: 409 100.0 8.5E-36 5.3E-39 212.9 37.5 399 13-488 1-409 (409) 73 protein:vir:1634 Length: 409 # 100.0 8.2E-34 5.1E-37 202.0 32.8 400 13-488 1-409 (409) 74 protein:vir:95149 Length: 501 99.8 6.1E-18 3.8E-21 115.0 39.7 459 3-533 1-501 (501) 75 protein:vir:97265 Length: 513 99.8 9.6E-17 6E-20 108.4 41.8 458 1-533 1-501 (513) 76 protein:vir:94956 Length: 452 99.8 2.8E-16 1.7E-19 105.9 38.9 436 1-528 1-452 (452) 77 protein:vir:80453 Length: 535 99.7 9.1E-16 5.6E-19 103.1 38.8 457 1-530 23-535 (535) 78 protein:vir:95014 Length: 491 99.7 1.9E-14 1.2E-17 95.8 39.4 449 2-528 1-491 (491) 79 protein:vir:78393 Length: 489 99.6 1.2E-13 7.2E-17 91.5 39.3 445 2-528 1-489 (489) 80 protein:vir:93630 Length: 776 99.6 1.2E-14 7.7E-18 96.8 29.3 495 1-533 23-678 (776) 81 protein:vir:79538 Length: 502 99.6 2.1E-13 1.3E-16 90.1 33.6 459 1-530 1-502 (502) 82 protein:vir:80040 Length: 461 99.5 8.1E-14 5E-17 92.3 27.1 443 30-532 1-461 (461) 83 protein:vir:389 Length: 530 # 99.5 3.7E-12 2.3E-15 83.3 39.1 450 1-533 1-527 (530) 84 protein:vir:96738 Length: 505 99.5 4.1E-12 2.5E-15 83.0 40.3 456 1-531 8-505 (505) 85 protein:vir:96783 Length: 488 99.5 4.5E-12 2.8E-15 82.8 36.2 443 1-507 14-488 (488) 86 protein:vir:5249 Length: 437 # 99.5 1.2E-12 7.7E-16 85.9 32.3 410 22-529 1-437 (437) 87 protein:vir:817 Length: 714 # 99.4 1.6E-11 9.7E-15 79.8 34.5 482 1-533 6-633 (714) 88 protein:vir:9950 Length: 714 # 99.4 1.6E-11 9.7E-15 79.8 34.5 482 1-533 6-633 (714) 89 protein:vir:10117 Length: 714 99.4 1.6E-11 9.7E-15 79.8 34.5 482 1-533 6-633 (714) 90 protein:vir:3296 Length: 714 # 99.4 1.6E-11 9.7E-15 79.8 34.5 482 1-533 6-633 (714) 91 protein:vir:2764 Length: 714 # 99.4 1.6E-11 9.7E-15 79.8 34.5 482 1-533 6-633 (714) 92 protein:vir:6382 Length: 553 # 99.4 2E-11 1.2E-14 79.3 38.2 461 1-533 9-553 (553) 93 protein:vir:105619 Length: 772 99.4 3.9E-11 2.4E-14 77.7 32.6 489 1-533 12-647 (772) 94 protein:vir:104437 Length: 714 99.4 4.9E-11 3E-14 77.1 35.4 474 1-533 1-616 (714) 95 protein:vir:77597 Length: 725 99.4 1.9E-11 1.2E-14 79.3 29.7 491 15-533 1-607 (725) 96 protein:vir:3420 Length: 533 # 99.3 7.7E-11 4.8E-14 76.0 39.6 457 1-533 1-531 (533) 97 protein:vir:95542 Length: 548 99.3 9.6E-11 5.9E-14 75.5 42.0 459 1-533 1-513 (548) 98 protein:vir:104338 Length: 422 99.3 1.6E-10 9.7E-14 74.3 31.6 404 3-533 1-422 (422) 99 protein:vir:107742 Length: 537 99.3 1.8E-10 1.1E-13 74.1 30.9 454 1-533 48-532 (537) 100 protein:vir:80165 Length: 651 99.3 3.3E-11 2.1E-14 78.0 26.9 491 1-533 1-604 (651) 101 protein:vir:79647 Length: 435 99.3 1.2E-10 7.5E-14 74.9 29.9 419 1-533 1-433 (435) 102 protein:vir:108295 Length: 711 99.3 2.3E-10 1.4E-13 73.4 33.0 497 3-533 1-633 (711) 103 protein:vir:9263 Length: 725 # 99.2 1.8E-10 1.1E-13 73.9 28.4 491 15-533 1-607 (725) 104 protein:vir:107662 Length: 427 99.2 2.2E-10 1.3E-13 73.6 28.8 413 22-533 1-427 (427) 105 protein:vir:94049 Length: 532 99.2 4.1E-10 2.5E-13 72.1 29.6 473 1-533 1-521 (532) 106 protein:vir:8846 Length: 705 # 99.2 1.4E-10 8.6E-14 74.6 26.6 486 1-533 1-601 (705) 107 protein:vir:99563 Length: 862 99.2 2.9E-10 1.8E-13 72.8 26.9 468 1-533 37-569 (862) 108 protein:vir:100920 Length: 725 99.1 1.3E-09 7.8E-13 69.4 29.4 491 15-533 1-607 (725) 109 protein:vir:96068 Length: 765 99.1 2.2E-09 1.3E-12 68.1 28.1 458 1-533 35-545 (765) 110 protein:vir:10321 Length: 495 99.1 3.2E-09 2E-12 67.1 39.5 449 1-533 1-495 (495) 111 protein:vir:95449 Length: 584 99.0 1.6E-09 1E-12 68.8 25.0 473 1-533 1-583 (584) 112 protein:vir:95821 Length: 763 99.0 2.7E-09 1.7E-12 67.6 24.8 469 1-533 9-633 (763) 113 protein:vir:105429 Length: 708 98.7 1.4E-07 8.9E-11 58.1 33.3 494 15-533 1-619 (708) 114 protein:vir:1538 Length: 535 # 98.6 1.6E-07 9.9E-11 57.8 33.4 457 3-528 1-535 (535) 115 protein:vir:79772 Length: 648 98.6 1.8E-07 1.1E-10 57.6 32.7 441 1-533 34-515 (648) 116 protein:vir:94599 Length: 641 98.6 2.2E-07 1.4E-10 57.1 24.3 483 1-533 1-623 (641) 117 protein:vir:6240 Length: 457 # 98.6 2.7E-07 1.7E-10 56.6 31.7 422 22-533 1-450 (457) 118 protein:vir:1326 Length: 457 # 98.5 3.4E-07 2.1E-10 56.0 32.6 423 22-532 1-457 (457) 119 protein:vir:3520 Length: 720 # 98.5 4.6E-07 2.9E-10 55.3 33.2 493 15-533 1-617 (720) 120 protein:vir:102727 Length: 945 98.5 5E-07 3.1E-10 55.1 32.5 409 1-533 76-537 (945) 121 protein:vir:95315 Length: 559 98.5 5.7E-07 3.6E-10 54.8 28.7 475 1-527 1-559 (559) 122 protein:vir:107822 Length: 555 98.4 7.1E-07 4.4E-10 54.3 28.1 480 1-533 1-553 (555) 123 protein:vir:98506 Length: 555 98.4 7.1E-07 4.4E-10 54.3 28.1 480 1-533 1-553 (555) 124 protein:vir:107404 Length: 555 98.4 7.1E-07 4.4E-10 54.3 28.1 480 1-533 1-553 (555) 125 protein:vir:105002 Length: 432 98.4 8.2E-07 5.1E-10 53.9 32.1 413 30-533 1-432 (432) 126 protein:vir:107605 Length: 432 98.4 8.2E-07 5.1E-10 53.9 32.1 413 30-533 1-432 (432) 127 protein:vir:102855 Length: 432 98.4 8.2E-07 5.1E-10 53.9 32.1 413 30-533 1-432 (432) 128 protein:vir:172 Length: 708 # 98.4 8.6E-07 5.3E-10 53.8 35.3 491 3-533 1-633 (708) 129 protein:vir:80644 Length: 551 98.4 1E-06 6.5E-10 53.4 34.3 414 1-533 55-538 (551) 130 protein:vir:3153 Length: 467 # 98.4 1.1E-06 6.7E-10 53.3 34.4 404 72-533 1-463 (467) 131 protein:vir:3361 Length: 535 # 98.3 1.4E-06 8.6E-10 52.7 34.3 466 3-528 1-535 (535) 132 protein:vir:3843 Length: 397 # 98.3 1.4E-06 8.7E-10 52.7 28.7 386 1-533 1-397 (397) 133 protein:vir:80796 Length: 574 98.3 1.4E-06 8.8E-10 52.6 31.6 415 1-533 66-525 (574) 134 protein:vir:7321 Length: 556 # 98.3 1.5E-06 9.4E-10 52.5 31.3 476 1-528 1-556 (556) 135 protein:vir:100150 Length: 437 98.3 1.7E-06 1E-09 52.2 32.4 410 1-533 1-436 (437) 136 protein:vir:78696 Length: 542 98.3 1.8E-06 1.1E-09 52.0 32.4 467 12-533 1-540 (542) 137 protein:vir:94709 Length: 522 98.2 2.4E-06 1.5E-09 51.4 36.4 450 1-532 1-522 (522) 138 protein:vir:102080 Length: 429 98.2 3E-06 1.8E-09 50.9 31.6 409 30-533 1-429 (429) 139 protein:vir:96579 Length: 576 98.2 3.3E-06 2E-09 50.7 32.1 415 1-533 54-537 (576) 140 protein:vir:1266 Length: 416 # 98.1 3.6E-06 2.2E-09 50.5 30.5 396 1-532 1-416 (416) 141 protein:vir:483 Length: 413 # 98.1 3.6E-06 2.2E-09 50.4 34.0 391 27-533 1-409 (413) 142 protein:vir:8418 Length: 409 # 98.1 4.1E-06 2.6E-09 50.1 32.8 392 1-533 1-408 (409) 143 protein:vir:63755 Length: 547 98.1 5E-06 3.1E-09 49.6 34.5 405 1-533 58-534 (547) 144 protein:vir:4454 Length: 414 # 98.1 5.3E-06 3.3E-09 49.5 34.7 391 30-533 1-410 (414) 145 protein:vir:4156 Length: 542 # 98.1 5.4E-06 3.4E-09 49.5 31.0 411 1-533 15-472 (542) 146 protein:vir:105520 Length: 706 98.1 5.8E-06 3.6E-09 49.3 33.0 497 3-533 1-618 (706) 147 protein:vir:102118 Length: 409 98.0 6.6E-06 4.1E-09 49.0 32.1 391 1-533 1-408 (409) 148 protein:vir:102668 Length: 547 98.0 7.2E-06 4.5E-09 48.8 33.4 460 10-527 1-547 (547) 149 protein:vir:94572 Length: 535 98.0 8.5E-06 5.3E-09 48.4 32.2 468 1-527 1-535 (535) 150 protein:vir:97060 Length: 432 97.9 9.6E-06 5.9E-09 48.1 29.8 413 10-533 1-432 (432) 151 protein:vir:8883 Length: 543 # 97.9 1.2E-05 7.2E-09 47.6 32.9 469 1-533 1-541 (543) 152 protein:vir:93610 Length: 454 97.9 1.2E-05 7.2E-09 47.6 32.4 412 24-533 1-442 (454) 153 protein:vir:189 Length: 424 # 97.9 1.2E-05 7.7E-09 47.5 26.6 404 1-533 1-424 (424) 154 protein:vir:10362 Length: 432 97.9 1.3E-05 8.1E-09 47.4 30.4 413 10-533 1-432 (432) 155 protein:vir:81072 Length: 432 97.9 1.4E-05 8.8E-09 47.2 29.3 413 10-533 1-432 (432) 156 protein:vir:3868 Length: 417 # 97.8 1.7E-05 1E-08 46.7 28.7 391 1-533 1-415 (417) 157 protein:vir:4194 Length: 540 # 97.8 1.9E-05 1.2E-08 46.4 31.7 427 1-533 6-480 (540) 158 protein:vir:4337 Length: 434 # 97.8 2.1E-05 1.3E-08 46.3 30.3 410 1-532 1-434 (434) 159 protein:vir:104500 Length: 537 97.7 3E-05 1.9E-08 45.4 25.4 449 1-533 14-536 (537) 160 protein:vir:7017 Length: 515 # 97.7 3E-05 1.9E-08 45.4 33.3 436 22-530 1-515 (515) 161 protein:vir:101648 Length: 518 97.6 3.4E-05 2.1E-08 45.1 32.1 417 1-533 1-457 (518) 162 protein:vir:10447 Length: 536 97.6 3.8E-05 2.3E-08 44.8 32.7 459 3-529 1-536 (536) 163 protein:vir:1431 Length: 419 # 97.6 4.1E-05 2.5E-08 44.6 32.2 389 27-533 1-413 (419) 164 protein:vir:7407 Length: 392 # 97.6 4.1E-05 2.5E-08 44.6 32.2 374 28-530 1-392 (392) 165 protein:vir:5737 Length: 419 # 97.6 4.3E-05 2.7E-08 44.5 29.4 388 1-533 1-412 (419) 166 protein:vir:7853 Length: 518 # 97.6 4.4E-05 2.7E-08 44.5 32.1 417 1-533 1-457 (518) 167 protein:vir:95599 Length: 563 97.5 5E-05 3.1E-08 44.2 31.5 425 1-533 43-528 (563) 168 protein:vir:99312 Length: 563 97.5 5E-05 3.1E-08 44.2 31.5 425 1-533 43-528 (563) 169 protein:vir:2198 Length: 536 # 97.5 5.6E-05 3.4E-08 43.9 32.9 461 3-529 1-536 (536) 170 protein:vir:105782 Length: 449 97.5 6.1E-05 3.8E-08 43.7 27.8 414 3-532 1-449 (449) 171 protein:vir:95378 Length: 406 97.4 6.6E-05 4.1E-08 43.5 32.1 382 1-533 1-404 (406) 172 protein:vir:81152 Length: 411 97.4 6.6E-05 4.1E-08 43.5 34.0 394 30-533 1-410 (411) 173 protein:vir:4598 Length: 416 # 97.4 7.3E-05 4.5E-08 43.3 30.5 392 1-528 1-416 (416) 174 protein:vir:81095 Length: 416 97.4 7.3E-05 4.5E-08 43.3 30.5 392 1-528 1-416 (416) 175 protein:vir:100249 Length: 431 97.4 7.5E-05 4.6E-08 43.2 34.5 390 1-530 1-431 (431) 176 protein:vir:1380 Length: 422 # 97.3 0.0001 6.2E-08 42.5 32.5 405 30-533 1-422 (422) 177 protein:vir:1785 Length: 555 # 97.3 0.0001 6.5E-08 42.4 33.6 465 11-533 1-552 (555) 178 protein:vir:3139 Length: 599 # 97.3 0.00011 6.6E-08 42.4 30.3 475 1-525 1-599 (599) 179 protein:vir:9702 Length: 406 # 97.3 0.00011 6.8E-08 42.3 30.4 391 26-533 1-405 (406) 180 protein:vir:107880 Length: 491 97.3 0.00011 6.8E-08 42.3 33.8 403 1-533 1-423 (491) 181 protein:vir:103177 Length: 533 97.3 0.00011 7.1E-08 42.2 24.7 453 1-533 1-526 (533) 182 protein:vir:100039 Length: 522 97.2 0.00014 8.4E-08 41.8 31.8 451 13-533 1-521 (522) 183 protein:vir:98396 Length: 441 97.1 0.00016 9.7E-08 41.4 32.0 392 1-533 26-441 (441) 184 protein:vir:9408 Length: 441 # 97.1 0.00017 1E-07 41.3 31.8 396 1-533 26-441 (441) 185 protein:vir:79984 Length: 441 97.1 0.00017 1E-07 41.3 31.8 396 1-533 26-441 (441) 186 protein:vir:103330 Length: 517 97.1 0.00017 1E-07 41.3 33.1 450 1-525 1-517 (517) 187 protein:vir:4952 Length: 386 # 97.1 0.00018 1.1E-07 41.1 31.6 375 1-530 1-386 (386) 188 protein:vir:4509 Length: 424 # 97.0 0.00021 1.3E-07 40.8 31.1 402 20-531 1-424 (424) 189 protein:vir:99452 Length: 651 97.0 0.00022 1.4E-07 40.7 23.7 436 1-533 31-537 (651) 190 protein:vir:960 Length: 413 # 97.0 0.00024 1.5E-07 40.5 32.8 393 1-533 1-412 (413) 191 protein:vir:4854 Length: 386 # 96.9 0.00027 1.7E-07 40.2 29.4 377 1-530 1-386 (386) 192 protein:vir:93943 Length: 409 96.9 0.00028 1.7E-07 40.1 29.4 376 22-532 1-409 (409) 193 protein:vir:3989 Length: 392 # 96.8 0.00032 2E-07 39.7 32.4 377 28-530 1-392 (392) 194 protein:vir:1023 Length: 392 # 96.8 0.00032 2E-07 39.7 32.4 377 28-530 1-392 (392) 195 protein:vir:100187 Length: 385 96.7 0.00037 2.3E-07 39.4 31.0 372 1-530 1-385 (385) 196 protein:vir:100882 Length: 383 96.7 0.00038 2.4E-07 39.3 30.7 375 1-527 1-383 (383) 197 protein:vir:99672 Length: 532 96.7 0.00041 2.5E-07 39.2 34.2 459 1-527 1-532 (532) 198 protein:vir:103765 Length: 549 96.6 0.00051 3.2E-07 38.6 31.4 467 1-531 1-549 (549) 199 protein:vir:9359 Length: 348 # 96.6 0.00052 3.2E-07 38.6 30.4 336 89-532 1-348 (348) 200 protein:vir:96980 Length: 409 96.5 0.00055 3.4E-07 38.5 29.9 396 22-532 1-409 (409) 201 protein:vir:80333 Length: 419 96.5 0.00056 3.5E-07 38.4 31.5 388 1-533 1-413 (419) 202 protein:vir:105064 Length: 421 96.5 0.00058 3.6E-07 38.3 31.5 386 1-533 1-413 (421) 203 protein:vir:100691 Length: 535 96.5 0.0006 3.7E-07 38.2 34.1 434 1-533 1-523 (535) 204 protein:vir:1884 Length: 424 # 96.3 0.00074 4.6E-07 37.7 32.0 404 1-533 1-424 (424) 205 protein:vir:8317 Length: 409 # 96.3 0.00078 4.8E-07 37.6 24.7 372 22-519 1-409 (409) 206 protein:vir:5839 Length: 533 # 96.2 0.00087 5.4E-07 37.4 23.8 420 1-533 26-506 (533) 207 protein:vir:104892 Length: 558 96.2 0.00089 5.5E-07 37.3 26.0 465 15-533 1-539 (558) 208 protein:vir:4995 Length: 384 # 96.2 0.00091 5.6E-07 37.3 22.9 372 22-523 1-384 (384) 209 protein:vir:101541 Length: 694 96.2 0.00093 5.7E-07 37.2 28.5 432 1-533 39-541 (694) 210 protein:vir:101806 Length: 516 95.9 0.0012 7.6E-07 36.6 18.7 441 1-523 1-516 (516) 211 protein:vir:101189 Length: 516 95.9 0.0012 7.6E-07 36.6 18.7 441 1-523 1-516 (516) 212 protein:vir:94426 Length: 409 95.8 0.0014 8.6E-07 36.2 31.2 393 22-532 1-409 (409) 213 protein:vir:96988 Length: 516 95.8 0.0014 8.8E-07 36.2 31.6 441 1-530 1-516 (516) 214 protein:vir:103219 Length: 201 95.3 0.0024 1.5E-06 35.0 16.2 196 270-533 1-201 (201) 215 protein:vir:80134 Length: 403 95.3 0.0024 1.5E-06 34.9 29.7 374 22-533 1-401 (403) 216 protein:vir:6322 Length: 510 # 95.1 0.0027 1.7E-06 34.6 35.8 434 29-525 1-510 (510) 217 protein:vir:3648 Length: 695 # 94.9 0.0033 2E-06 34.2 28.6 432 1-533 62-542 (695) 218 protein:vir:78942 Length: 510 94.8 0.0036 2.2E-06 34.0 37.3 433 29-525 1-510 (510) 219 protein:vir:5665 Length: 511 # 94.6 0.0039 2.4E-06 33.8 24.4 442 1-514 16-511 (511) 220 protein:vir:105641 Length: 516 94.5 0.0042 2.6E-06 33.6 32.3 434 1-530 1-516 (516) 221 protein:vir:106282 Length: 521 94.4 0.0045 2.8E-06 33.4 24.2 428 1-533 31-520 (521) 222 protein:vir:8100 Length: 466 # 94.3 0.0048 3E-06 33.3 33.0 419 1-533 1-466 (466) 223 protein:vir:78589 Length: 695 94.2 0.005 3.1E-06 33.2 28.9 432 1-533 40-542 (695) 224 protein:vir:101647 Length: 460 94.0 0.0057 3.6E-06 32.9 31.7 410 15-529 1-460 (460) 225 protein:vir:106716 Length: 698 93.8 0.0062 3.9E-06 32.7 29.1 430 1-533 40-542 (698) 226 protein:vir:104259 Length: 403 93.5 0.0072 4.5E-06 32.3 26.5 380 30-533 1-403 (403) 227 protein:vir:100598 Length: 516 93.5 0.0073 4.5E-06 32.3 25.2 441 1-520 1-516 (516) 228 protein:vir:4828 Length: 382 # 93.5 0.0073 4.5E-06 32.3 29.5 374 1-530 1-382 (382) 229 protein:vir:2683 Length: 412 # 93.4 0.0075 4.7E-06 32.2 34.4 383 37-532 1-412 (412) 230 protein:vir:6596 Length: 521 # 93.3 0.0081 5E-06 32.0 23.8 439 18-533 1-520 (521) 231 protein:vir:9507 Length: 395 # 92.8 0.0098 6.1E-06 31.6 27.6 376 30-533 1-395 (395) 232 protein:vir:100650 Length: 395 92.8 0.0098 6.1E-06 31.6 27.6 376 30-533 1-395 (395) 233 protein:vir:101289 Length: 395 92.8 0.0098 6.1E-06 31.6 27.6 376 30-533 1-395 (395) 234 protein:vir:94666 Length: 723 91.8 0.014 8.8E-06 30.7 31.7 396 29-533 1-440 (723) 235 protein:vir:103458 Length: 524 91.4 0.016 9.9E-06 30.4 20.9 458 1-533 1-523 (524) 236 protein:vir:81218 Length: 423 91.4 0.016 1E-05 30.4 29.7 392 30-533 1-422 (423) 237 protein:vir:7208 Length: 524 # 91.3 0.016 1E-05 30.4 20.8 458 1-533 1-523 (524) 238 protein:vir:345 Length: 663 # 90.9 0.019 1.2E-05 30.1 22.5 469 1-533 1-587 (663) 239 protein:vir:103860 Length: 528 90.8 0.019 1.2E-05 30.0 31.2 422 1-533 1-449 (528) 240 protein:vir:108215 Length: 469 90.8 0.019 1.2E-05 30.0 31.8 429 3-533 1-465 (469) 241 protein:vir:80211 Length: 514 90.5 0.021 1.3E-05 29.8 37.1 428 29-518 1-514 (514) 242 protein:vir:99853 Length: 488 90.5 0.021 1.3E-05 29.8 31.5 393 1-533 1-410 (488) 243 protein:vir:98265 Length: 524 90.0 0.023 1.4E-05 29.6 23.0 452 1-533 1-523 (524) 244 protein:vir:1082 Length: 359 # 89.3 0.027 1.7E-05 29.2 29.0 343 26-490 1-359 (359) 245 protein:vir:6210 Length: 394 # 87.3 0.04 2.5E-05 28.3 31.4 376 30-530 1-394 (394) 246 protein:vir:95965 Length: 385 82.6 0.075 4.7E-05 26.7 27.8 367 30-533 1-385 (385) 247 protein:vir:78161 Length: 355 78.3 0.12 7.2E-05 25.7 21.6 313 172-533 1-335 (355) 248 protein:vir:79063 Length: 491 78.2 0.12 7.3E-05 25.7 33.2 400 1-533 1-421 (491) 249 protein:vir:108049 Length: 524 76.9 0.13 8.1E-05 25.4 24.7 447 15-533 1-523 (524) 250 protein:vir:106999 Length: 564 76.0 0.14 8.7E-05 25.3 26.6 467 1-533 13-560 (564) 251 protein:vir:81017 Length: 521 75.2 0.15 9.3E-05 25.1 27.7 439 15-533 1-520 (521) 252 protein:vir:99232 Length: 526 74.5 0.16 9.8E-05 25.0 36.4 419 1-533 1-447 (526) 253 protein:vir:6896 Length: 523 # 74.4 0.16 9.8E-05 25.0 23.6 441 1-533 1-522 (523) 254 protein:vir:98853 Length: 219 73.5 0.17 0.0001 24.8 15.7 204 193-456 1-219 (219) 255 protein:vir:79207 Length: 351 65.6 0.28 0.00017 23.6 19.6 294 29-459 1-351 (351) 256 protein:vir:78191 Length: 351 65.4 0.28 0.00018 23.6 22.7 326 1-459 1-351 (351) 257 protein:vir:98567 Length: 340 64.5 0.3 0.00019 23.5 20.2 286 62-456 1-340 (340) 258 protein:vir:94869 Length: 378 63.3 0.32 0.0002 23.3 22.6 354 30-533 1-378 (378) 259 protein:vir:1661 Length: 378 # 63.3 0.32 0.0002 23.3 21.3 354 22-533 1-378 (378) 260 protein:vir:79233 Length: 526 62.7 0.33 0.00021 23.2 37.1 417 1-533 1-447 (526) 261 protein:vir:94002 Length: 378 57.6 0.43 0.00027 22.6 21.1 354 30-533 1-378 (378) 262 protein:vir:95254 Length: 488 50.8 0.6 0.00037 21.8 30.8 440 1-533 1-478 (488) 263 protein:vir:78641 Length: 278 47.9 0.69 0.00043 21.5 26.3 265 89-452 1-278 (278) 264 protein:vir:858 Length: 378 # 46.7 0.72 0.00045 21.3 22.2 354 30-533 1-378 (378) 265 protein:vir:1986 Length: 512 # 46.5 0.73 0.00045 21.3 31.5 421 1-533 1-445 (512) 266 protein:vir:9641 Length: 395 # 41.7 0.91 0.00057 20.8 26.9 388 30-533 1-394 (395) 267 protein:vir:93867 Length: 378 40.8 0.95 0.00059 20.7 20.6 347 30-533 1-378 (378) 268 protein:vir:79150 Length: 368 37.2 1.1 0.0007 20.3 17.4 311 36-468 1-368 (368) 269 protein:vir:103971 Length: 376 33.4 1.4 0.00084 19.9 21.4 317 11-459 1-376 (376) 270 protein:vir:98643 Length: 395 23.9 2.2 0.0014 18.7 26.3 387 30-532 1-395 (395) 271 protein:vir:78310 Length: 376 21.5 2.6 0.0016 18.3 28.7 359 30-532 1-376 (376) 272 protein:vir:5691 Length: 344 # 20.9 2.7 0.0017 18.2 22.7 285 36-457 1-344 (344) No 1 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=100.00 E-value=1.3e-111 Score=628.42 Aligned_cols=486 Identities=17% Similarity=0.188 Sum_probs=403.1 Q ss_pred hHHHHHHHHh-hhHhhcCCHH----HHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHH Q lcl|NC_016654. 15 LAAVTARVAE-SHVWWEGDLD----KLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLS 89 (533) Q Consensus 15 ~~~~~~~~~~-~~~w~~gd~~----~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~ 89 (533) +- +...|.+ =..||+|+|+ +|.++|++..+ .++.......++++ +|+.+.+++..++++|+|||+.||+++ T Consensus 1 ~~-~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~-~w~~~~~~~~~~~~~~~~l~~~i~~~~ 76 (518) T protein:vir:78 1 MG-VWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVP--DNQKEWSKDSYLTS-LWAQGYVPTVHDKLMNSGTGNEIVVVA 76 (518) T ss_pred Cc-chhhHHHHHHHhhcCCCCccchhccHHHhhhcc--cchhhhhhhhhhhh-hcccCCCCccccccccCChHHHHHHHH Confidence 22 2223332 3569999998 67677765432 22222222334444 455666778889999999999999999 Q ss_pred HHhhcCCCceEeeCC----CchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEE Q lcl|NC_016654. 90 TTELFSEQLKFLDAG----KSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIP 165 (533) Q Consensus 90 a~ll~~e~~~i~~~~----~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P 165 (533) |+|||+|+++|++++ +++.++++|++++++|+|+.++.++++.|+++|++|+|||||. ++++|++|+|++++| T Consensus 77 A~ll~~e~~~i~v~~~~~~d~e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~---~~~~i~~v~ad~~~P 153 (518) T protein:vir:78 77 AEYISGKPLSIDVTGVNGSKDENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINILN---GRPSISVHSSSQFWI 153 (518) T ss_pred HHhhcCCCceEEecCccccCcHHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEEC---CeeEEEEEcCCeeEE Confidence 999999999999864 4677899999999999999999999999999999999999984 579999999999999 Q ss_pred EEecCCceEEEEEEEEeecCCceEEEEEEEec------------CeeEEEEEEeccCCcccceeehhhcccccccccc-- Q lcl|NC_016654. 166 EFRWGRLVAVTFWSELAGGDGQEVWRHLERHE------------SGYIVHAVYKGTATSLGWMMALTDHPATRDIAVE-- 231 (533) Q Consensus 166 ~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~------------~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~-- 231 (533) +|++|++++|+|++++...+++++||+||+|+ +|+|+|++|+++ .|..++++.++.+..+... T Consensus 154 ~~~~g~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~---~~~~v~~~~~~~~~~l~~~~~ 230 (518) T protein:vir:78 154 DFKNNEPFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKID---GDKTTPISAERLPEQITSYLH 230 (518) T ss_pred EeecCcEEEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeec---Ccccccccccccccccccccc Confidence 99999999999999988888889999999995 689999999864 4556777777777666542 Q ss_pred ccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHH Q lcl|NC_016654. 232 GADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESV 311 (533) Q Consensus 232 ~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~ 311 (533) ..+..+...+.||.+++|++|+||..+|..+++ ||||+|+|+++ +++||+||++||+|+|+|++|+.+|+||+++ T Consensus 231 ~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~----splG~S~~~~~-~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~ 305 (518) T protein:vir:78 231 TNDIQLNHSVSIGLKSMGAYLINNSPSNTRYPH----LNLGESDLSQC-TNYLFAVDYFFTVYMREGEKTKTKIAASERM 305 (518) T ss_pred cccCccceeeccCCccceEEeeccccccccccC----CCcCcchHhhh-hHHHHHHHHHHHHHHHHHHhCCceeeechhH Confidence 334555677889999999999999987777764 78999999986 5999999999999999999999999999999 Q ss_pred hcCCCCcc----ccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCC Q lcl|NC_016654. 312 LTNLGMGQ----GVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDE 387 (533) Q Consensus 312 l~~~~~~~----~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~ 387 (533) ++....+. ...|+.+.++|+++.. ..+.+++++..++.+||+||+++|+++++.+|++|+++||+|+++||++ + T Consensus 306 l~~~~~~~~~~~~~~fd~~~~~y~~i~~-~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~-~ 383 (518) T protein:vir:78 306 FRKKVNKSTDKEEWSMNVDEDYFMQFKG-TLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLG-N 383 (518) T ss_pred hccCCCCCCCccccccCCCCceEEEecC-cCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCcc-c Confidence 97664443 3457888899998875 3456677788999999999999999999999999999999999999986 4 Q ss_pred cchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC---CCCCCceeEEEEeCCCCCCCHHHHHHHHHH Q lcl|NC_016654. 388 VAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPG---KGAAPSEELELEWPKFARESDLAKAQTVQA 464 (533) Q Consensus 388 ~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~---~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~ 464 (533) +.+|||||++++++|++++++|++.++.+|++|++++|++.+..+.. ....+..+|+|+|+|++++|++++++++++ T Consensus 384 ~~~TATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~~~ 463 (518) T protein:vir:78 384 REVKATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSSTLNN 463 (518) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCHHHHHHHHHH Confidence 57899999999999999999999999999999999999998765332 234456789999999999999999999999 Q ss_pred HHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCc-c-ccccccCCCCCCCCCCC Q lcl|NC_016654. 465 WSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPT-F-GFGTDQPPLPTENDPAT 524 (533) Q Consensus 465 l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~-~-~~~~~~~~~~~~~~~~~ 524 (533) ++++|+||++++|+++|++|+|+||++|++||++|+++++++ . ..++ +.+ .+| T Consensus 464 ~v~aGimS~e~~i~~~~~~~~deea~~e~~ri~~E~~~~~~~~p~~~~g-~~~------~~g 518 (518) T protein:vir:78 464 MNSALAMSVEEKVKLIHPKWEDEEIQAEVKRIYLENAIGEVPDPEAIGG-MET------KGG 518 (518) T ss_pred HHhcCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHHhcccCCCCCccccC-CCC------CCC Confidence 999999999999999999999999999999999999875432 1 1111 111 111 No 2 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=100.00 E-value=5.8e-111 Score=624.92 Aligned_cols=474 Identities=15% Similarity=0.145 Sum_probs=389.9 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) .++=.-....+|+-.+..+.+|..|+.||+|++..+. |... .....+++++|+| T Consensus 21 ~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~--~~~~------------------------~~~~~~~~~~sln 74 (522) T protein:vir:47 21 SNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQ--YKNT------------------------DGDIKSRPMNHLP 74 (522) T ss_pred ccchhccccCCCCCCHHHHHHHHHHHHHhcCCccccc--cccc------------------------Ccchhcccceecc Confidence 2332223334444457899999999999999875441 1110 0112346789999 Q ss_pred hHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcC Q lcl|NC_016654. 81 IPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDA 160 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~ 160 (533) +|+.||+++|+|||+||++|+++ ++.++++|++++++|+|..++.++++.|+++|++|||||||. ++++|++|+| T Consensus 75 l~~~i~~~~A~lv~~e~~~i~v~--d~~~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~---~~~~i~~v~a 149 (522) T protein:vir:47 75 IARTASKKIASLVYNEQATITTK--NEILQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYIDG---DKVRVAFIQA 149 (522) T ss_pred hHHHHHHHHhhhhcCCcceeecC--ChHHHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEcC---CceEEEEEcC Confidence 99999999999999999999995 567899999999999999999999999999999999999984 5799999999 Q ss_pred CeEEEE-EecCCceEEEEEEE-EeecCC-ceEEEEEEEec----------------CeeEEEEEEecc-CCcccceeehh Q lcl|NC_016654. 161 DRAIPE-FRWGRLVAVTFWSE-LAGGDG-QEVWRHLERHE----------------SGYIVHAVYKGT-ATSLGWMMALT 220 (533) Q Consensus 161 ~~~~P~-~~~g~~~~v~f~~~-~~~~~~-~~~y~~lE~h~----------------~~~I~~~~y~~~-~~~lG~~v~l~ 220 (533) ++++|+ |+.+.+++++|+.+ +...++ ..+||+||+|+ +|+|+|++|++. ..+||.+|+|+ T Consensus 150 d~~~P~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~ 229 (522) T protein:vir:47 150 PVFFPLESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLS 229 (522) T ss_pred CceEEEEEcCCceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCcccccc Confidence 999997 66677777766554 443333 44789999996 699999999985 46899999999 Q ss_pred hccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_016654. 221 DHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRI 300 (533) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~ 300 (533) ++++|+++.+++ .+.+..+|+|+||+||+.|+.+.. ||+|+|+|++++ ++||+||++||+|+|+|++ T Consensus 230 ~~~e~~~l~~~~-------~~~~~~~Plf~y~~~~~~N~~~~~-----splG~S~~~~~~-~~id~lD~~~s~~~~e~~~ 296 (522) T protein:vir:47 230 ELDKYKNLEPVT-------VFENLSRPLFTYLKTPGMNNKDIN-----SPLGLSIFDNAK-TTIDFINRSYDEFMWEVRM 296 (522) T ss_pred ccccccCCCCce-------EeCCCCcceEEEecCCcccccccC-----CCcCCchhhhhH-HHHHHHHHHHHHHHHHHHh Confidence 999999987754 355667889999999999887653 889999999865 9999999999999999999 Q ss_pred CcceeeechHHhcCCCCcc------ccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHh Q lcl|NC_016654. 301 GAGKVHASESVLTNLGMGQ------GVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRK 374 (533) Q Consensus 301 ~~~~i~v~~~~l~~~~~~~------~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~ 374 (533) |+.+|+||++|++...++. ...||.++++|+++.....+ ...++.+||+||+++|.++++.++++++++ T Consensus 297 g~~~i~v~~~~l~~~~~~~~g~~~~~~~fd~~~~~f~~~~~~~~~-----~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~ 371 (522) T protein:vir:47 297 GQRRVIVPEHLTQRQYQRPDGTIDFRPRFDVEQNVYMQIGGSSMD-----AGGITDLTSPIRANDYILAISEGLKLFEMQ 371 (522) T ss_pred ccceeecchHHhccCCCCCCcccccccccCcccceEeecCCCCCC-----CCcceeeccccChHHHHHHHHHHHHHHHHH Confidence 9999999999998754433 23578788899887654332 234889999999999999999999999999 Q ss_pred hCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hccCCCCCCceeEEEEeCCCCCC Q lcl|NC_016654. 375 TGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAI-KFPGKGAAPSEELELEWPKFARE 453 (533) Q Consensus 375 ~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~-~~~~~~~~~~~~v~i~f~d~i~~ 453 (533) ||||+++||+++++.+|||||++++++|++++++|++.|+.+|++|+++|+++.+. .++++......+|+|+|+|++++ T Consensus 372 ~gls~~tf~~~~~~~kTAtEi~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~ 451 (522) T protein:vir:47 372 IGVSSGMFTFDGQGMKTATEIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFT 451 (522) T ss_pred hCCCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCC Confidence 99999999999999999999999999999999999999999999999999999764 35566677788999999999999 Q ss_pred CHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCcccc-ccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 454 SDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGF-GTDQPPLPTENDPATDPEAVDEG 532 (533) Q Consensus 454 d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~d~ 532 (533) |++++++++++++++|+||++++|+++| +|+|+||++|++||++|+++++|.... ++...+..+.++ .+| T Consensus 452 D~~~~~~~~~~~v~aG~~s~e~~i~~~~-g~~eeea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~d--------~~~ 522 (522) T protein:vir:47 452 DRHAELDYWAKMVAAGFSTKKRAIGKTL-NISGVEAEKELNAINSELLPMNDAELAIYGMHDQNEEKAD--------DKG 522 (522) T ss_pred CHHHHHHHHHHHHhcCCCCHHHHHHhcC-CCChHHHHHHHHHHHHhhccCCCCCCCCCCCCCcccccCC--------CCC Confidence 9999999999999999999999999865 599999999999999999877653221 111111111111 122 No 3 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=100.00 E-value=1.2e-109 Score=617.81 Aligned_cols=458 Identities=16% Similarity=0.180 Sum_probs=386.0 Q ss_pred CCCC--------CCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCc Q lcl|NC_016654. 1 MSLP--------EANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGR 72 (533) Q Consensus 1 ~~~~--------~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 72 (533) |.+= .+...+||. .+.+|+.|+.||+|++..|.+.. . ..... T Consensus 18 ~~~~~~~~~i~d~~~i~~~~~----~~~~i~~~~~~Y~g~~~~l~~~~-~-------------------------~~~~~ 67 (505) T protein:vir:79 18 VGMTKSLGQIIDDPRINLPAD----EVERIARDKRYYMDDFKQVTHKN-S-------------------------YGDTQ 67 (505) T ss_pred hcchhhhhhhhcccCCCCCHH----HHHHHHHHHHHhcCCCccccccc-c-------------------------CCCcc Confidence 1111 134455443 46789999999999987652110 0 01123 Q ss_pred ccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCc Q lcl|NC_016654. 73 APKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADN 152 (533) Q Consensus 73 ~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~ 152 (533) .++++|+|+|+.||+++|+|||+|||+|+++ ++..+++|++++++|+|+.++.++++.|+++|++|+|||||+ ++ T Consensus 68 ~~~~~slnl~~~i~~~~A~ll~~e~~~i~~~--d~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D~---~~ 142 (505) T protein:vir:79 68 KHELQSVNVTKLASAKLASLIFNEQCQVTVS--DETANDFLDDVFQQNDFYTTFEEKLEEWIALGSGCVRPYVDS---GK 142 (505) T ss_pred ccceeecchHHHHHHHHHhhhcCCCceeecC--ChHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCeEEEEEEeC---Cc Confidence 5678999999999999999999999999995 467899999999999999999999999999999999999994 57 Q ss_pred eEEEEEcCCeEEEE-EecCCceEEEEEEEEeecC--CceEEEEEEEec----CeeEEEEEEecc-CCcccceeehhhccc Q lcl|NC_016654. 153 AWIDFVDADRAIPE-FRWGRLVAVTFWSELAGGD--GQEVWRHLERHE----SGYIVHAVYKGT-ATSLGWMMALTDHPA 224 (533) Q Consensus 153 ~~i~~v~~~~~~P~-~~~g~~~~v~f~~~~~~~~--~~~~y~~lE~h~----~~~I~~~~y~~~-~~~lG~~v~l~~~~~ 224 (533) ++|++++|++++|+ |+++++++|+|+.++...+ +..+||++|+|+ +|+|+|++|++. .++||.+|+|+++++ T Consensus 143 ~~i~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~ 222 (505) T protein:vir:79 143 IKLAWATADQVYPLQADTNQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQ 222 (505) T ss_pred eEEEEEcCCeeEEEEEcCCCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhccc Confidence 99999999999998 6789999999988776533 456899999996 899999999985 468999999999999 Q ss_pred cccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcce Q lcl|NC_016654. 225 TRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGK 304 (533) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~ 304 (533) |+++.+++ .+.+..+|+|++|++|..||.+. .||+|+|+|+++ +++||+||++||+|+|+|++|+.+ T Consensus 223 ~~~l~~~~-------~~~g~~~p~f~~~~~~~~N~~~~-----~splG~S~~~~~-~~~id~lD~~~s~~~~e~~~g~~~ 289 (505) T protein:vir:79 223 YEGLEPQV-------KITGLKHPLFAFYRNKGANNKNF-----TSPMGMSLIDNS-YTVIDAINRTHDQFVDEVKKGQRR 289 (505) T ss_pred ccccCcce-------eecCCCcceEEEecCCccccccc-----CCccCCchhhhh-HHHHHHHHHHHHHHHHHHHhcccc Confidence 99987754 23444568888888888877665 378999999986 599999999999999999999999 Q ss_pred eeechHHhcCCCCccc-------cccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCC Q lcl|NC_016654. 305 VHASESVLTNLGMGQG-------VSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGY 377 (533) Q Consensus 305 i~v~~~~l~~~~~~~~-------~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~ 377 (533) |+||++|++....+.+ ..|+.+.++|+.+... +++.+++++||+||+++|+++++.++++++++||+ T Consensus 290 i~v~~~~l~~~~~~~~~~~~~~~~~fd~~~~~y~~~~~~------~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~ 363 (505) T protein:vir:79 290 LIVPAEWLKTGSSYGGQASETHPPMFDPDETVYQAMYGD------ASEVGFHDATSPIRVADYQATMDFFLREFENQTGL 363 (505) T ss_pred eeechHHhcccCCCCcccccccccCCCccceeeeeccCC------CCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCC Confidence 9999999987655443 2477788888776532 22346999999999999999999999999999999 Q ss_pred ChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-------CCCCCCceeEEEEeCCC Q lcl|NC_016654. 378 SPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFP-------GKGAAPSEELELEWPKF 450 (533) Q Consensus 378 s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~-------~~~~~~~~~v~i~f~d~ 450 (533) |+++||+++++.+|||||++++++|++++++|++.|+.+|++|+++|+++++.... .....+..+++|+|+|+ T Consensus 364 s~~~~~~~~~~~~TAtei~s~~~~l~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~ 443 (505) T protein:vir:79 364 SQGTFTTSPSGIQTATEVVTNNSQTYQTRSSYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDG 443 (505) T ss_pred ChhhcCCCccccchHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCC Confidence 99999999999999999999999999999999999999999999999999764321 12345677899999999 Q ss_pred CCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCcccccccc Q lcl|NC_016654. 451 ARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQ 513 (533) Q Consensus 451 i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~ 513 (533) +++|++++++.+++++++|+||+++++++ |++|||+||++|++||++|+++.+|.+...|++ T Consensus 444 i~~d~~~~~~~~~~~v~~Gi~s~e~~l~~-~~~~~eeea~~el~ri~~E~~~~~p~~~~~gg~ 505 (505) T protein:vir:79 444 VFVDQESKRAADLQAVQAQVMPKKQFLMR-NYGLDEEEADEWLAQIDAENSTAEPEFNQFGGD 505 (505) T ss_pred CCCCHHHHHHHHHHHHHcCCCCHHHHHHh-cCCCChHHHHHHHHHHHHhccccCCCchhccCC Confidence 99999999999999999999999999987 678999999999999999998877766443322 No 4 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=100.00 E-value=3e-108 Score=610.09 Aligned_cols=466 Identities=17% Similarity=0.170 Sum_probs=381.9 Q ss_pred CCCCC-------CcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcc Q lcl|NC_016654. 1 MSLPE-------ANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRA 73 (533) Q Consensus 1 ~~~~~-------~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 73 (533) |.+=. +.-.= |+. .+.+|+.|+.||+|++.++. |.. ....+++ T Consensus 18 ~~~~~~~~~~~~~~i~~-~~~---~~~~I~~w~~~Y~g~~~~~~--~~~------------------------~~~~~~~ 67 (517) T protein:vir:98 18 LSGQTLKSINDHEKINI-DPN---ELARIERNLRQYEGDYPQVE--YIN------------------------SQGKIQE 67 (517) T ss_pred hcccchhHhhcCCceec-CHH---HHHHHHHHHHHhcCCCcccc--ccc------------------------ccccccc Confidence 22111 11122 333 57799999999999987652 110 0112345 Q ss_pred cceeecChHHHHHHHHHHhhcCCCceEeeCCCc---------hHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEE Q lcl|NC_016654. 74 PKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKS---------KEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIV 144 (533) Q Consensus 74 ~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~---------~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~ 144 (533) ++++|+|+|+.||+++|+|||+|+++|++++.+ ..++++|++++++|+|..++.++++.|+++|++||||| T Consensus 68 ~~~~sl~~~~~i~~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~ 147 (517) T protein:vir:98 68 RDYMTLNLRKLSADVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPY 147 (517) T ss_pred cceeecCcHHHHHHHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEE Confidence 678999999999999999999999999997643 34789999999999999999999999999999999999 Q ss_pred EcCCCCCceEEEEEcCCeEEEE-EecCCceEEEEEEEEe--ecCCceEEEEEEEec---------CeeEEEEEEec-cCC Q lcl|NC_016654. 145 WDPTIADNAWIDFVDADRAIPE-FRWGRLVAVTFWSELA--GGDGQEVWRHLERHE---------SGYIVHAVYKG-TAT 211 (533) Q Consensus 145 ~D~~~~~~~~i~~v~~~~~~P~-~~~g~~~~v~f~~~~~--~~~~~~~y~~lE~h~---------~~~I~~~~y~~-~~~ 211 (533) ||+ ++++|++|+|++|+|+ |+.+++++++|+.... .+++..+||+||+|+ .|+|+|++|++ +++ T Consensus 148 ~d~---~~~~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~ 224 (517) T protein:vir:98 148 VDN---GEIEFSWALANAFYPLRSNSNGISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEG 224 (517) T ss_pred EeC---CeeEEEEEcCCeeEEEEecCCCeEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEEecCCCc Confidence 995 5799999999999995 7788899988765433 355667999999995 47899999996 466 Q ss_pred cccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHH Q lcl|NC_016654. 212 SLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIY 291 (533) Q Consensus 212 ~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~ 291 (533) +||.+|||+++ |+++.+++ .+.+..+|+|+||+|++.|+.+. .||+|+|+|++++ ++||+||++| T Consensus 225 ~lG~~v~L~~~--~e~l~~~~-------~~~g~~~Plf~y~~~p~~N~~~~-----~splG~S~~~~a~-~~~d~lD~~~ 289 (517) T protein:vir:98 225 EIGKRIPLEEL--YEGMQEKT-------YIQGLSRPLFNYLKPSGFNNINP-----HSPLGLGITDNSV-STLKKINDTY 289 (517) T ss_pred ccccccccccc--ccCCCcce-------eECCCCcceEEEecCCccccccc-----CCCCCCchhhhhH-HHHHHHHHHH Confidence 89999999988 45555432 33344568888999988887765 3889999999865 9999999999 Q ss_pred HHHHHHHHhCcceeeechHHhcCCC----CccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHH Q lcl|NC_016654. 292 SSLMRDFRIGAGKVHASESVLTNLG----MGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALL 367 (533) Q Consensus 292 s~~~~~~~~~~~~i~v~~~~l~~~~----~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~ 367 (533) |+|+|+|++|+.+|+||+++++... ...+..||.++++|+.+... .++.+++.+||+||+++|+++++.+ T Consensus 290 s~~~~e~~~g~~~i~vp~~~l~~~~~~~g~~~~~~~d~~~~~y~~~~~~------~~~~~i~~~~~~iR~e~~~~~~~~~ 363 (517) T protein:vir:98 290 DQFWWEIKMGQRTVFVSDVMLRTVPDESGMPPPQVFDPDVNVYKSIRMG------TDEEFVKDVTHDIRTEQYKEAINQA 363 (517) T ss_pred HHHHHHHHhCCcceecChhhhccccCCCCcccCCCCCcccceeeeccCC------CCCCceeeeccccchHHHHHHHHHH Confidence 9999999999999999999996443 33456688899999887643 2245699999999999999999999 Q ss_pred HHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hccCCCCCCceeEEEE Q lcl|NC_016654. 368 LREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAI-KFPGKGAAPSEELELE 446 (533) Q Consensus 368 l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~-~~~~~~~~~~~~v~i~ 446 (533) |++++++||+|+++||+++++.+|||||+++++++++|+++|++.|+.+|++|+++|+++.+. .++++...+..+|+|+ T Consensus 364 L~~i~~~~Gls~~t~~~~~~~~kTATEi~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~ 443 (517) T protein:vir:98 364 LRTLEMELKLSVGTFSFDGRSMKTATEIVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVD 443 (517) T ss_pred HHHHHHHhCCCcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEE Confidence 999999999999999999999999999999999999999999999999999999999998775 3556666778899999 Q ss_pred eCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCC Q lcl|NC_016654. 447 WPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDP 526 (533) Q Consensus 447 f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (533) |+|++++|++++++++++++++|+||++++|+++|+ ++|+||++|++||++|++.++|.. .. ++ .++ T Consensus 444 f~D~i~~D~~~~~~~~~~~v~aG~ms~~~~i~~~~g-~~eeeA~~e~~~i~~E~~~~~~~~-~~--~~---------~~~ 510 (517) T protein:vir:98 444 FDDGVFQDRSALLRFYGQAKTFGFIPTVEAIQRIFK-VPKKTAEQWLEEIRKDQIELDPVT-IS--QR---------AQK 510 (517) T ss_pred cCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHHhCC-CChHHHHHHHHHHHHhccccCCCC-cc--cc---------ccC Confidence 999999999999999999999999999999999885 899999999999999998765421 11 10 011 Q ss_pred CCCCCCC Q lcl|NC_016654. 527 EAVDEGE 533 (533) Q Consensus 527 ~~~~d~~ 533 (533) ....|+| T Consensus 511 ~~~gd~e 517 (517) T protein:vir:98 511 RMFGDEE 517 (517) T ss_pred CCCCCCC Confidence 1222333 No 5 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=100.00 E-value=4.8e-108 Score=608.94 Aligned_cols=464 Identities=17% Similarity=0.170 Sum_probs=384.4 Q ss_pred CCCC----CCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCC-CCCcccc Q lcl|NC_016654. 1 MSLP----EANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPT-ATGRAPK 75 (533) Q Consensus 1 ~~~~----~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~~~ 75 (533) ++|= .+.-.= |++ .+.+|..|+.||.|++..+ +.++. .....++ T Consensus 22 ~~~~~~~~~~~i~~-~~~---~~~ri~~~~~~y~g~~~~~---------------------------~~~~~~~~~~~~~ 70 (508) T protein:vir:15 22 GSLSKITDDPRISI-DPD---EYVRIQTDLDYYSDKLQYI---------------------------HYQASDGIKKKRL 70 (508) T ss_pred cchHHhhccccccc-CHH---HHHHHHHHHHHhcCCCccc---------------------------ccccCCCCccccc Confidence 2211 011112 333 5678999999999976422 11111 1122355 Q ss_pred eeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEE Q lcl|NC_016654. 76 RYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWI 155 (533) Q Consensus 76 ~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i 155 (533) ++|+|+|+.||+++|+|||+||++++++++ +.++++|++++++|+|..++.++++.|+++|++|+|||||. ++++| T Consensus 71 ~~sln~~~~i~~~~A~lv~~e~~~i~v~~~-~~~~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~---~~~~i 146 (508) T protein:vir:15 71 KNTINMAKTAARRIASVVFNEKAEIHVKDN-NEADKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRPYIDG---NHIKI 146 (508) T ss_pred eeecchHHHHHHHHHhhhhCCCceEEeCCc-hHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEeC---CeeEE Confidence 689999999999999999999999999754 45788999999999999999999999999999999999995 47999 Q ss_pred EEEcCCeEEEE-EecCCceEEEEEEEEeec--CCceEEEEEEEec-----CeeEEEEEEeccC-Ccccceeehhhccccc Q lcl|NC_016654. 156 DFVDADRAIPE-FRWGRLVAVTFWSELAGG--DGQEVWRHLERHE-----SGYIVHAVYKGTA-TSLGWMMALTDHPATR 226 (533) Q Consensus 156 ~~v~~~~~~P~-~~~g~~~~v~f~~~~~~~--~~~~~y~~lE~h~-----~~~I~~~~y~~~~-~~lG~~v~l~~~~~~~ 226 (533) ++|+|++++|+ |+.+++++|+|++++... ++..+||+||+|+ +|+|+|++|++.. .++|.+|+|+++++|+ T Consensus 147 ~~v~ad~~~P~~~d~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~ 226 (508) T protein:vir:15 147 AWVRADQFYPLQSNTNDISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYK 226 (508) T ss_pred EEEcCCeeEEEEEcCCCeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhccccc Confidence 99999999996 788999999999888654 4567899999997 8999999999854 6899999999999999 Q ss_pred cccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceee Q lcl|NC_016654. 227 DIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVH 306 (533) Q Consensus 227 ~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~ 306 (533) ++.+++ .+.+..+|+|++|++|+.|+.+.. ||+|+|+|+++ +++||+||++||+|+|+|++|+.+|+ T Consensus 227 ~l~~~~-------~~~g~~~p~f~y~~~~~~N~~~~~-----splG~S~~~~~-~~lid~lD~~~s~~~~e~~~~~~~i~ 293 (508) T protein:vir:15 227 ELAPQV-------TISGLQRPLFAYFKTPGANNINIE-----SPLGLGVVDNA-KHVLDDINDTHDQFIWEIRLGQKHIA 293 (508) T ss_pred CCCcce-------EecCCCcceeEEecCCccccccCC-----CCcCCchHhhh-HHHHHHHHHHHHHHHHHHHhccccee Confidence 987653 234445688999999988887653 78999999986 59999999999999999999999999 Q ss_pred echHHhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCC Q lcl|NC_016654. 307 ASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSD 386 (533) Q Consensus 307 v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~ 386 (533) ||+++++.++.+ +..|+.++++|+.+.... +....++++||+||+++|.++++.++++|+++||+|+++||+++ T Consensus 294 v~~~~l~~d~~~-~~~~~~~~~~~~~~~~~~-----~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~ 367 (508) T protein:vir:15 294 VQPGMLRFDDEH-KPTFDTEQNVYVGVLSDD-----NNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSN 367 (508) T ss_pred echHHhcCCCCC-ccccCCCCeeEEeccCCC-----CCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhccccc Confidence 999999876655 457899999998775432 22345899999999999999999999999999999999999999 Q ss_pred CcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-c-CC-------CCCCceeEEEEeCCCCCCCHHH Q lcl|NC_016654. 387 EVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKF-P-GK-------GAAPSEELELEWPKFARESDLA 457 (533) Q Consensus 387 ~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~-~-~~-------~~~~~~~v~i~f~d~i~~d~~e 457 (533) ++.+|||||++++++|++++++|++.|+.+|++|+++|+++.+... . ++ ....+.+|+|+|+|++++|+++ T Consensus 368 ~~~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~ 447 (508) T protein:vir:15 368 DGVKTATEVVSNNSMTYQTRSSYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDK 447 (508) T ss_pred CccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHH Confidence 9999999999999999999999999999999999999999976421 1 11 1234678999999999999999 Q ss_pred HHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 458 KAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 458 ~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +++.+++++++|+||+++++++ |++|||+||++|++||++|++..++..+ ...+...+||| T Consensus 448 ~~~~~~~~v~aGi~s~e~~i~~-~~g~~deea~~el~ri~~E~~~~~~~~~--------------~~~~~~g~~ge 508 (508) T protein:vir:15 448 QLEEDAKVLAIGALSKQTFLQR-NYGMTDEQAAEELAKIQSEAPTDTFEGG--------------RSAILNGGDGE 508 (508) T ss_pred HHHHHHHHHhcCCCCHHHHHHh-cCCCChHHHHHHHHHHHHhccccCcccc--------------ccccCCCCCCC Confidence 9999999999999999999987 6789999999999999999876543221 11233334444 No 6 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=100.00 E-value=1e-104 Score=590.63 Aligned_cols=460 Identities=16% Similarity=0.145 Sum_probs=376.2 Q ss_pred CCCC----CCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccce Q lcl|NC_016654. 1 MSLP----EANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKR 76 (533) Q Consensus 1 ~~~~----~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 76 (533) +.|= .+.- ..|++ .+++|+.|+.||+|++..+. ++... +....+++ T Consensus 21 ~~~~~~~~~~~i-~~~~~---~~~~i~~~~~~Y~g~~~~~~-------------------------~~~~~-~~~~~~~~ 70 (500) T protein:vir:30 21 QSLTNITDHPKI-AISKL---EYDRITTNLKYYKSDWDSVL-------------------------YLNTD-GETKKRDL 70 (500) T ss_pred chhhhhhccccc-cCCHH---HHHHHHHHHHHhcCCCCCcc-------------------------cccCC-CCcccCce Confidence 2221 1121 23333 67789999999999753321 11111 11235678 Q ss_pred eecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEE Q lcl|NC_016654. 77 YHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWID 156 (533) Q Consensus 77 ~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~ 156 (533) +|+|+|+.||+++|+|||+||++|+++ ++.++++|++++++|+|+.++.++++.|+++|++|+|||||. ++|+|+ T Consensus 71 ~slnl~~~i~~~~A~lv~~e~~~i~~~--d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~---~~~~I~ 145 (500) T protein:vir:30 71 NHLPIARTAAKKIASLVFNEQAEIKVD--DDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG---DKVRVA 145 (500) T ss_pred eecchHHHHHHHHhhhhcCCcceEecC--ChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC---CceEEE Confidence 999999999999999999999999995 567899999999999999999999999999999999999984 579999 Q ss_pred EEcCCeEEEE-EecCCceEEEEEEEEe--ecCCceEEEEEEEec-----CeeEEEEEEecc-CCcccceeehhhcccccc Q lcl|NC_016654. 157 FVDADRAIPE-FRWGRLVAVTFWSELA--GGDGQEVWRHLERHE-----SGYIVHAVYKGT-ATSLGWMMALTDHPATRD 227 (533) Q Consensus 157 ~v~~~~~~P~-~~~g~~~~v~f~~~~~--~~~~~~~y~~lE~h~-----~~~I~~~~y~~~-~~~lG~~v~l~~~~~~~~ 227 (533) +|+|++++|+ |+.+++++++|+.++. ..++..+||+||+|+ +|+|+|++|++. ..++|.+|+|+++ |+. T Consensus 146 ~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~--~~~ 223 (500) T protein:vir:30 146 FVQAPVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEV--YKD 223 (500) T ss_pred EEcCCeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccc--cCC Confidence 9999999997 6778888888776543 345667999999997 799999999985 4689999999988 444 Q ss_pred ccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeee Q lcl|NC_016654. 228 IAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHA 307 (533) Q Consensus 228 ~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v 307 (533) +.++ ..+.+..+|+|++|+||+.|+.+.. ||+|+|+|+++ +++||+||++||+|+|+|++|+.+|+| T Consensus 224 l~~~-------~~~~~~~~p~f~~~~~~~~N~~~~~-----sp~G~S~~~~~-~~lid~lD~~~s~~~~e~~~g~~~i~v 290 (500) T protein:vir:30 224 LKDE-------AKVTDVTRPIFTYLKTPGMNNKDIN-----SPLGLSIFDNA-KTTIDFINTTYDEFMWEVKMGQRRVAV 290 (500) T ss_pred cCcc-------eEeccCCCccEEEecCCccccccCC-----CccCCchhhhh-HHHHHHHHHHHHHHHHHHHhCcceeee Confidence 4433 2344445688999999999887654 78999999986 599999999999999999999999999 Q ss_pred chHHhcCCCCc------cccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhh Q lcl|NC_016654. 308 SESVLTNLGMG------QGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVS 381 (533) Q Consensus 308 ~~~~l~~~~~~------~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~ 381 (533) |++|++....+ ....|+.++++|+.+... .+++..++.++|+||+++|.++++.+|++++++||||+++ T Consensus 291 ~~~~l~~~~~~~~g~~~~~~~~d~~~~~~~~~~~~-----~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~ 365 (500) T protein:vir:30 291 PESLTALTVRTTDGDVVPRPRFESDQNVYIRMGGR-----DLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGL 365 (500) T ss_pred chHHhcccCCCCCccccCCcccCCCcceEEEcCCC-----CCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccc Confidence 99999765332 234577788888876533 2233469999999999999999999999999999999999 Q ss_pred cccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hccCCCCCCceeEEEEeCCCCCCCHHHHHH Q lcl|NC_016654. 382 LGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAI-KFPGKGAAPSEELELEWPKFARESDLAKAQ 460 (533) Q Consensus 382 ~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~-~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~ 460 (533) ||++++|.+|||||++++++|++++++|++.|+.+|++|+++|+++.+. .++++......+|+|+|+|++++|++++++ T Consensus 366 ~~~~~~g~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~ 445 (500) T protein:vir:30 366 FSFDGKSMKTATEIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELD 445 (500) T ss_pred cccCcCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHH Confidence 9999999999999999999999999999999999999999999999775 345556667788999999999999999999 Q ss_pred HHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCC Q lcl|NC_016654. 461 TVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATD 525 (533) Q Consensus 461 ~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (533) ++++++++|+||++++|+++| +|+|+||++|++||++|+..+...+. +..++.|+ T Consensus 446 ~~~~~v~aGi~s~~~~i~~~~-g~~eeea~~~l~~i~~E~~~~~~~~~---------~~~~~~g~ 500 (500) T protein:vir:30 446 YWIKVVNAGFGTREMAIQKVL-NVTEEKAQEIAAEINTGIVDEINQQR---------TDTHLYGE 500 (500) T ss_pred HHHHHHHcCCCCHHHHHHhcC-CCCHHHHHHHHHHHHHhccccCCCCC---------ccccccCC Confidence 999999999999999999876 59999999999999998744322111 11222222 No 7 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=100.00 E-value=1e-104 Score=590.63 Aligned_cols=460 Identities=16% Similarity=0.145 Sum_probs=376.2 Q ss_pred CCCC----CCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccce Q lcl|NC_016654. 1 MSLP----EANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKR 76 (533) Q Consensus 1 ~~~~----~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 76 (533) +.|= .+.- ..|++ .+++|+.|+.||+|++..+. ++... +....+++ T Consensus 21 ~~~~~~~~~~~i-~~~~~---~~~~i~~~~~~Y~g~~~~~~-------------------------~~~~~-~~~~~~~~ 70 (500) T protein:vir:98 21 QSLTNITDHPKI-AISKL---EYDRITTNLKYYKSDWDSVL-------------------------YLNTD-GETKKRDL 70 (500) T ss_pred chhhhhhccccc-cCCHH---HHHHHHHHHHHhcCCCCCcc-------------------------cccCC-CCcccCce Confidence 2221 1121 23333 67789999999999753321 11111 11235678 Q ss_pred eecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEE Q lcl|NC_016654. 77 YHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWID 156 (533) Q Consensus 77 ~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~ 156 (533) +|+|+|+.||+++|+|||+||++|+++ ++.++++|++++++|+|+.++.++++.|+++|++|+|||||. ++|+|+ T Consensus 71 ~slnl~~~i~~~~A~lv~~e~~~i~~~--d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~---~~~~I~ 145 (500) T protein:vir:98 71 NHLPIARTAAKKIASLVFNEQAEIKVD--DDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG---DKVRVA 145 (500) T ss_pred eecchHHHHHHHHhhhhcCCcceEecC--ChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC---CceEEE Confidence 999999999999999999999999995 567899999999999999999999999999999999999984 579999 Q ss_pred EEcCCeEEEE-EecCCceEEEEEEEEe--ecCCceEEEEEEEec-----CeeEEEEEEecc-CCcccceeehhhcccccc Q lcl|NC_016654. 157 FVDADRAIPE-FRWGRLVAVTFWSELA--GGDGQEVWRHLERHE-----SGYIVHAVYKGT-ATSLGWMMALTDHPATRD 227 (533) Q Consensus 157 ~v~~~~~~P~-~~~g~~~~v~f~~~~~--~~~~~~~y~~lE~h~-----~~~I~~~~y~~~-~~~lG~~v~l~~~~~~~~ 227 (533) +|+|++++|+ |+.+++++++|+.++. ..++..+||+||+|+ +|+|+|++|++. ..++|.+|+|+++ |+. T Consensus 146 ~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~--~~~ 223 (500) T protein:vir:98 146 FVQAPVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEV--YKD 223 (500) T ss_pred EEcCCeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccc--cCC Confidence 9999999997 6778888888776543 345667999999997 799999999985 4689999999988 444 Q ss_pred ccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeee Q lcl|NC_016654. 228 IAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHA 307 (533) Q Consensus 228 ~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v 307 (533) +.++ ..+.+..+|+|++|+||+.|+.+.. ||+|+|+|+++ +++||+||++||+|+|+|++|+.+|+| T Consensus 224 l~~~-------~~~~~~~~p~f~~~~~~~~N~~~~~-----sp~G~S~~~~~-~~lid~lD~~~s~~~~e~~~g~~~i~v 290 (500) T protein:vir:98 224 LKDE-------AKVTDVTRPIFTYLKTPGMNNKDIN-----SPLGLSIFDNA-KTTIDFINTTYDEFMWEVKMGQRRVAV 290 (500) T ss_pred cCcc-------eEeccCCCccEEEecCCccccccCC-----CccCCchhhhh-HHHHHHHHHHHHHHHHHHHhCcceeee Confidence 4433 2344445688999999999887654 78999999986 599999999999999999999999999 Q ss_pred chHHhcCCCCc------cccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhh Q lcl|NC_016654. 308 SESVLTNLGMG------QGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVS 381 (533) Q Consensus 308 ~~~~l~~~~~~------~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~ 381 (533) |++|++....+ ....|+.++++|+.+... .+++..++.++|+||+++|.++++.+|++++++||||+++ T Consensus 291 ~~~~l~~~~~~~~g~~~~~~~~d~~~~~~~~~~~~-----~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~ 365 (500) T protein:vir:98 291 PESLTALTVRTTDGDVVPRPRFESDQNVYIRMGGR-----DLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGL 365 (500) T ss_pred chHHhcccCCCCCccccCCcccCCCcceEEEcCCC-----CCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccc Confidence 99999765332 234577788888876533 2233469999999999999999999999999999999999 Q ss_pred cccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hccCCCCCCceeEEEEeCCCCCCCHHHHHH Q lcl|NC_016654. 382 LGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAI-KFPGKGAAPSEELELEWPKFARESDLAKAQ 460 (533) Q Consensus 382 ~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~-~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~ 460 (533) ||++++|.+|||||++++++|++++++|++.|+.+|++|+++|+++.+. .++++......+|+|+|+|++++|++++++ T Consensus 366 ~~~~~~g~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~ 445 (500) T protein:vir:98 366 FSFDGKSMKTATEIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELD 445 (500) T ss_pred cccCcCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHH Confidence 9999999999999999999999999999999999999999999999775 345556667788999999999999999999 Q ss_pred HHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCC Q lcl|NC_016654. 461 TVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATD 525 (533) Q Consensus 461 ~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (533) ++++++++|+||++++|+++| +|+|+||++|++||++|+..+...+. +..++.|+ T Consensus 446 ~~~~~v~aGi~s~~~~i~~~~-g~~eeea~~~l~~i~~E~~~~~~~~~---------~~~~~~g~ 500 (500) T protein:vir:98 446 YWIKVVNAGFGTREMAIQKVL-NVTEEKAQEIAAEINTGIVDEINQQR---------TDTHLYGE 500 (500) T ss_pred HHHHHHHcCCCCHHHHHHhcC-CCCHHHHHHHHHHHHHhccccCCCCC---------ccccccCC Confidence 999999999999999999876 59999999999999998744322111 11222222 No 8 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=100.00 E-value=9.5e-104 Score=585.38 Aligned_cols=463 Identities=15% Similarity=0.155 Sum_probs=384.0 Q ss_pred CCC--------CCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCc Q lcl|NC_016654. 1 MSL--------PEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGR 72 (533) Q Consensus 1 ~~~--------~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 72 (533) |.+ ..++.++|+.. +.+|..|+.||+|+++.+.+... .....+. T Consensus 16 ~~~~~~~~~~~~~~~i~~~~~~----~~~i~~~~~~Y~g~~~~~~~~~~------------------------~~~~~~~ 67 (499) T protein:vir:80 16 MGLLKSLKDVTDHKKVNANDED----YKYIDMWKRLYQGNYAEWHNLNY------------------------EHNGNPV 67 (499) T ss_pred hccccchhhhhcCCCCcCCHHH----HHHHHHHHHHhcCCcchhhcccc------------------------ccCCCcc Confidence 432 35666776644 57899999999998765432110 0112234 Q ss_pred ccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCc Q lcl|NC_016654. 73 APKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADN 152 (533) Q Consensus 73 ~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~ 152 (533) .++++|+|+|+.||+++|+|||++|++|+++ ++.++++|++++++|+|+.++.++++.|+++|++|+|||+|++ ++ T Consensus 68 ~~~~~s~n~~~~iv~~~a~~l~~ep~~i~~~--d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~--~~ 143 (499) T protein:vir:80 68 NRRQLSMNLPKVTAKYMSKLLFNEKVKINID--DETAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGN--KN 143 (499) T ss_pred ccceeecchHHHHHHHHHHhhhCCcceEeeC--CHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEEEEECCC--Cc Confidence 5778999999999999999999999999995 5678999999999999999999999999999999999999975 57 Q ss_pred eEEEEEcCCeEEEE-EecCCceEEEEEEEEeecCCceEEEEEEEec-------CeeEEEEEEeccC-Ccccceeehhhcc Q lcl|NC_016654. 153 AWIDFVDADRAIPE-FRWGRLVAVTFWSELAGGDGQEVWRHLERHE-------SGYIVHAVYKGTA-TSLGWMMALTDHP 223 (533) Q Consensus 153 ~~i~~v~~~~~~P~-~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~-------~~~I~~~~y~~~~-~~lG~~v~l~~~~ 223 (533) |+|++++|++++|+ |+.+++++|+|++.+..++ .+||+||+|+ .|+|+|.+|+++. ..+|.+|+|.++ T Consensus 144 ~~i~~v~a~~~~Pi~~d~~~~~~~~f~~~~~~~~--~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~- 220 (499) T protein:vir:80 144 VKVSFATADCMYPLSNDSENVDECLIANSFHKNN--KYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLL- 220 (499) T ss_pred EEEEEEcCCceEEEEecCCCeEEEEEEEEEeecC--eEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhh- Confidence 99999999999997 5678999999999887654 5799999996 6899999999865 579999999876 Q ss_pred ccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcc Q lcl|NC_016654. 224 ATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAG 303 (533) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~ 303 (533) ++++.+ ...+.+..+++|++|++|..|+.+.. +|+|+|+|++ ++++||+||++||+++|+|++++. T Consensus 221 -~~~~~~-------~~~~~~~~~p~f~~~~~~~~N~~~~~-----splG~S~~~~-~~~lid~lD~~~s~~~~e~~~~~~ 286 (499) T protein:vir:80 221 -FNDIEP-------VVPLPSLTRPTFIYIKPNIANNKNLT-----SPLGISVYAN-ALDTLKTLDLMFDSYYQEFKLGKK 286 (499) T ss_pred -ccCcCC-------ceeecCCCccceEeecCCccccccCC-----CccCCchHhh-HHHHHHHHHHHHHHHHHHHHhccc Confidence 334433 23455667889999999998887763 7899999998 569999999999999999999999 Q ss_pred eeeechHHhcCCCCcc---ccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChh Q lcl|NC_016654. 304 KVHASESVLTNLGMGQ---GVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPV 380 (533) Q Consensus 304 ~i~v~~~~l~~~~~~~---~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~ 380 (533) +|+||++|++...++. ...|+.+.++|+.+.....+ .+..+++++++||+++|+++|+.++++|+++||+|++ T Consensus 287 ~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~----~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~ 362 (499) T protein:vir:80 287 KVLVPSSFVKTAVNLDGSTTQYFDSTDEAFFLYQGEQDD----NGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAG 362 (499) T ss_pred ceecchhhhhccCCCCCCcccCCCcccceeeEeeccCCC----CcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChh Confidence 9999999997654433 35688888888877544322 2346999999999999999999999999999999999 Q ss_pred hcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-cCCCCCCceeEEEEeCCCCCCCHHHHH Q lcl|NC_016654. 381 SLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKF-PGKGAAPSEELELEWPKFARESDLAKA 459 (533) Q Consensus 381 ~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~-~~~~~~~~~~v~i~f~d~i~~d~~e~a 459 (533) +||++.+|.+|||||++++++|++++++|++.|+.+|++|+++|+++.+... .++...+..+++|+|+|++++|+++++ T Consensus 363 ~fg~~~~g~~TAtei~s~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~ 442 (499) T protein:vir:80 363 TFTFDENGLKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTI 442 (499) T ss_pred hcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHH Confidence 9999999999999999999999999999999999999999999999876432 234455678999999999999999999 Q ss_pred HHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCC Q lcl|NC_016654. 460 QTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPE 527 (533) Q Consensus 460 ~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (533) +++++++++|+||+++++++ +++++|+||++|++||++|++++.|....+ ...|+.| T Consensus 443 ~~~~~~~~~Gi~S~et~l~~-~~~~~d~ea~~el~~i~~E~~~~~~~~d~~----------g~~ge~e 499 (499) T protein:vir:80 443 NRYTTAKNQGMIPLKIALQR-AWNITEAEADEWAEMLAKEKQAEIPNNDMT----------GIFGEEE 499 (499) T ss_pred HHHHHHHHcCCCCHHHHHhh-cCCCChHHHHHHHHHHHHHhhcCCCCCCcc----------ccCCCCC Confidence 99999999999999999987 567899999999999999997765421111 0111111 No 9 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=100.00 E-value=1.5e-98 Score=556.88 Aligned_cols=463 Identities=16% Similarity=0.157 Sum_probs=380.4 Q ss_pred CCC--------CCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCc Q lcl|NC_016654. 1 MSL--------PEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGR 72 (533) Q Consensus 1 ~~~--------~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 72 (533) |-+ =.++.+|||.. +.+|..|+.||.|.+..+.+... .....+. T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~----~~~i~~~~~yy~g~~~~~~~~~~------------------------~~~~~~~ 67 (496) T protein:vir:38 16 MGLLKALKDVKDHKKVNANDED----YKYIDMWKRLYQGHYAEWHNLNY------------------------EHNGNPV 67 (496) T ss_pred hccchhhHHHHhcCCCcCCHHH----HHHHHHHHHHhcCCCchhhcchh------------------------ccCCCcc Confidence 432 13566777644 57899999999998765422110 0011223 Q ss_pred ccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCc Q lcl|NC_016654. 73 APKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADN 152 (533) Q Consensus 73 ~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~ 152 (533) .++++|+|+||.||+++|+|||++||+|+++ ++..+++|++++++|+|++++.++++.|+++|++|++||+|++ ++ T Consensus 68 ~~~~~~~n~~k~i~~~~a~~l~~~p~~i~~~--d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~--~~ 143 (496) T protein:vir:38 68 NRRQLSMNLPKVTAKYMSKLLFNEKVKINID--DKAAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGN--KN 143 (496) T ss_pred ccceeecchHHHHHHHHhhhhhCCcceEeeC--ChHHHHHHHHHHhccCHHHHHHHHHHHHhhhCcEEEEEEEcCC--Cc Confidence 5678999999999999999999999999994 5678999999999999999999999999999999999999975 57 Q ss_pred eEEEEEcCCeEEEEEe-cCCceEEEEEEEEeecCCceEEEEEEEec----CeeEEEEEEeccC-Ccccceeehhhccccc Q lcl|NC_016654. 153 AWIDFVDADRAIPEFR-WGRLVAVTFWSELAGGDGQEVWRHLERHE----SGYIVHAVYKGTA-TSLGWMMALTDHPATR 226 (533) Q Consensus 153 ~~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~~~~~~~y~~lE~h~----~~~I~~~~y~~~~-~~lG~~v~l~~~~~~~ 226 (533) ++|++++|++++|+|. .+++++|+|++.+..++ .+|++||+|+ .|+|+|.+|++.+ +++|.+|+++++++ T Consensus 144 ~~i~~v~~~~~~P~~~~~~~~~~~~f~~~~~~~~--~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~-- 219 (496) T protein:vir:38 144 VKVSFATADCMYPLSNDSENVDECVIANSFHKNN--KYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFD-- 219 (496) T ss_pred EEEEEEcccceEEEEecCCcEEEEEEEEEEEeCC--eEEEEEEEEEEeCceEEEEEEEEecCCccccCcccccccccc-- Confidence 9999999999999865 58899999999887644 4789999997 8999999999854 57999999988743 Q ss_pred cccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceee Q lcl|NC_016654. 227 DIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVH 306 (533) Q Consensus 227 ~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~ 306 (533) .+.+ ...+.+..+++|+++.+|..|+.++. +|+|+|+|++ ++++||+||+++|+++|+|++++.+|+ T Consensus 220 ~~~~-------~~~~~~~~~~~f~~~~~~~~N~~~~~-----~p~G~Sd~~~-~~~lid~ld~~~s~~~~~~~~~~~~i~ 286 (496) T protein:vir:38 220 DIEP-------VVPLPDFTRPTFIYIKPNIANNKNLT-----SPLGISVYAN-ALDTLKTLDLMFDSYYQEFKLGKKKVL 286 (496) T ss_pred cccc-------ceeecCCCcceEEEecCCcccccccC-----CcCCCchHhh-HHHHHHHHHHHHHHHHHHHhhccccee Confidence 2222 23445667788999999988877764 7899999997 569999999999999999999999999 Q ss_pred echHHhcCCCCcc---ccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcc Q lcl|NC_016654. 307 ASESVLTNLGMGQ---GVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLG 383 (533) Q Consensus 307 v~~~~l~~~~~~~---~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g 383 (533) ||+++++...++. ...|+.+.+.|..+..... ++...++.++++||+++|+++++.+++++++.||+|+++|| T Consensus 287 v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~ 362 (496) T protein:vir:38 287 VPSSFVKTAVNLDGSTTQYFDSTDEAFFLYQGDQD----DNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFT 362 (496) T ss_pred cchHHhhccCCCCCccccCCCCccceEEEeecCCC----cccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcC Confidence 9999997654433 3457777788876654322 23346899999999999999999999999999999999999 Q ss_pred cCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-ccCCCCCCceeEEEEeCCCCCCCHHHHHHHH Q lcl|NC_016654. 384 LSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIK-FPGKGAAPSEELELEWPKFARESDLAKAQTV 462 (533) Q Consensus 384 ~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~-~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~ 462 (533) ++.+|.+||+||++++++|++++++|++.|+.+|+++++++|++.+.. ..++......+++|+|+|++++|++++++++ T Consensus 363 ~~~~g~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~ 442 (496) T protein:vir:38 363 FDENGLKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRY 442 (496) T ss_pred CCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHH Confidence 999999999999999999999999999999999999999999987643 3344455677899999999999999999999 Q ss_pred HHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCC Q lcl|NC_016654. 463 QAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPE 527 (533) Q Consensus 463 ~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (533) ++++++|+||+++++++ ++++++++|++|++||++|+++++|.... +..+|+.| T Consensus 443 ~~~~~~GiiS~et~l~~-~~~~~d~ea~~el~ri~~E~~~~~~~~d~----------~~~~~~~e 496 (496) T protein:vir:38 443 TNAKNQGMIPLKIALQR-AWNITEAEADEWAEMLAKEKQAEMPNNDM----------NGIFGEEE 496 (496) T ss_pred HHHHhcCCCCHHHHHHh-cCCCChHHHHHHHHHHHHhhhccCccccc----------cCCCCCCC Confidence 99999999999999986 67899999999999999999766542111 11112211 No 10 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=100.00 E-value=1.6e-58 Score=337.42 Aligned_cols=475 Identities=13% Similarity=0.018 Sum_probs=298.7 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) -..+...+.+.+......+. .++. ....+|.+||.+.+... ...+.. ...+.......+..++|+++| T Consensus 20 ~~~~~~~~~~~~~~i~~~i~---~~~~---~~~~~~~~YY~g~~~i~-~~~~~~-----~~~~~~~~~~~~~~~~ri~~n 87 (503) T protein:vir:59 20 VESAKEIAEPDTTMIQKLID---EHNP---EPLLKGVRYYMCENDIE-KKRRTY-----YDAAGQQLVDDTKTNNRTSHA 87 (503) T ss_pred hhhhhhccchhHHHHHHHHH---hhcH---HHHHHHHHHhccccchh-hccchh-----cccccccccccccccceeecc Confidence 24445555555544333222 2211 12244555555543210 000000 000111122334557799999 Q ss_pred hHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcC Q lcl|NC_016654. 81 IPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDA 160 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~ 160 (533) |++.||++.|+||||+|+++++ +++.+++.|+.+++ |+|...+.++++.++++|.+|+++|+|++ ++++|.+++| T Consensus 88 ~~~~ivd~~~~yl~g~~~~~~~--~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~d--g~~~i~~~~p 162 (503) T protein:vir:59 88 WHKLFVDQKTQYLVGEPVTFTS--DNKTLLEYVNELAD-DDFDDILNETVKNMSNKGIEYWHPFVDEE--GEFDYVIFPA 162 (503) T ss_pred hHHHHHHHHHhhhhcCCeeecc--CcHHHHHHHHHHHh-cCHHHHHHHHHHHHhhCCeEEEEEeecCC--CceEEEEEcc Confidence 9999999999999999999876 55678888988876 78999999999999999999999999976 5799999999 Q ss_pred CeEEEEEecCCceEE-EEEEEEee-cCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCc Q lcl|NC_016654. 161 DRAIPEFRWGRLVAV-TFWSELAG-GDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRG 238 (533) Q Consensus 161 ~~~~P~~~~g~~~~v-~f~~~~~~-~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~ 238 (533) .+++|+|+++...++ +|++.+.. ......++++|.|++++|.+..+.+....++..... ......... T Consensus 163 ~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~-----~~~~~~~~~----- 232 (503) T protein:vir:59 163 EEMIVVYKDNTRRDILFALRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGE-----NNPRPHMTK----- 232 (503) T ss_pred ceeEEEEeCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCcccccccccc-----cccccceee----- Confidence 999999987543332 23333332 222345678999999999874444333222211110 000000000 Q ss_pred eeecCCCc-cceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCC Q lcl|NC_016654. 239 AYVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGM 317 (533) Q Consensus 239 ~~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~ 317 (533) ....++.. .|++.|.. +..|.|+|.. +.+|||++|.++|+++++++....++.|...+ .. T Consensus 233 ~~~~~~~~~vPiv~~~n--------------n~~~~sd~~~-~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~----~~ 293 (503) T protein:vir:59 233 GGQAIGWGRVPIIPFKN--------------NEEMVSDLKF-YKDLIDNYDSITSSTMDSFSDFQQIVYVLKNY----DG 293 (503) T ss_pred cceeccCCccceEEecC--------------CCCCCcchhh-hHHHHHHHHHHHHHHHHHHHHhcCCeeEeecC----Cc Confidence 01122222 33344332 2358999987 77999999999999999999888888874332 11 Q ss_pred ccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHH Q lcl|NC_016654. 318 GQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASG 397 (533) Q Consensus 318 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~ 397 (533) .....+..+...+..+.. ..+++ ++.++++++.+.+...++.+.+.|...++.+.-.++ ..++..||+||++ T Consensus 294 ~~~~~~~~~~~~~~~~~~---~~~~~----~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~~~~~Sg~Ai~~ 365 (503) T protein:vir:59 294 ENPKEFTANLRYHSVIKV---SGDGG----VDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPE-TIGGGATGPALEN 365 (503) T ss_pred cccchhhhhhhcccceec---cCCCc----ceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCcc-cccccccHHHHHH Confidence 111112222222333221 12222 456777888887777777777767666655432322 2245679999999 Q ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHH Q lcl|NC_016654. 398 KKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKV 477 (533) Q Consensus 398 ~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v 477 (533) +++.+.++++++++.|+.+|++++++|+.+.+.. ++........++|.|++++|.|..+.++++++++++|+||+||++ T Consensus 366 ~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~-~~~~~~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~GiiS~et~l 444 (503) T protein:vir:59 366 LYALLDLKANMAERKIRAGLRLFFWFFAEYLRNT-GKGDFNPDKELTMTFTRTRIQNDSEIVQSLVQGVTGGIMSKETAV 444 (503) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-cCcccccccceeEEeCCCCCCCHHHHHHHHHHHHhCCCCchHHHH Confidence 9999999999999999999999999998876532 222223456799999999999999999999999999999999999 Q ss_pred HHhCCCCCHHHHHHHHHHHHHhhhcccCcccccccc-CCCCCCC-CCCC--CCCCCCCCC Q lcl|NC_016654. 478 AYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQ-PPLPTEN-DPAT--DPEAVDEGE 533 (533) Q Consensus 478 ~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~-~~~~~~~-~~~~--~~~~~~d~~ 533 (533) .+ +|.+++ +++|++||++|+...........+. +...++. +.++ +.+....|. T Consensus 445 ~~-l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 501 (503) T protein:vir:59 445 AR-NPFVQD--PEEELARIEEEMNQYAEMQGNLLDDEGGDDDLEEDDPNAGAAESGGAGQ 501 (503) T ss_pred Hh-CCCCCC--HHHHHHHHHHHHHHHHhhhccccCccCCCCCCCcCCCCCCcccCCCCCC Confidence 86 666654 6789999999886443211111111 1111111 1111 122222222 No 11 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=100.00 E-value=3.3e-57 Score=330.25 Aligned_cols=463 Identities=11% Similarity=0.051 Sum_probs=298.9 Q ss_pred cchHHHHHHHHhhhHhhc---CCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHH Q lcl|NC_016654. 13 PELAAVTARVAESHVWWE---GDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLS 89 (533) Q Consensus 13 ~~~~~~~~~~~~~~~w~~---gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~ 89 (533) =+.+..-.-|..+..=.. ....+|.+||.+.+.-...... ....-..+.+...+.+++|+++||++.||++. T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~-----~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~ 75 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNG-----KAKLNKEGKKDPLRSADNRIPSNFYQLLVDQE 75 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccc-----hhcccccccccccccCCcccccchHHHHHHhh Confidence 111111111111110000 0112334444443321000000 00000011223445668899999999999999 Q ss_pred HHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEec Q lcl|NC_016654. 90 TTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRW 169 (533) Q Consensus 90 a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~ 169 (533) |+||||+|++++++ ++.+++.|+++++. +|...+.++++.++++|.+|+++|+|++ +.+++.+++|.+++|+|++ T Consensus 76 ~~yl~G~p~~~~~~--d~~~~~~l~~~~~~-~~~~~~~~l~~~~~~~G~a~~~~y~d~~--~~~~~~~~~p~~~~~v~d~ 150 (470) T protein:vir:10 76 AGYVASVFPDIDVG--KDADNKKIIDVLGD-DRALTLNGLLVDSSNAGRAWLHYWIDED--GNFRYGIIQPDQITPIYAT 150 (470) T ss_pred hhheeccceeeecC--chHHHHHHHHHHhh-hHHHHHHHHHHHHhhcCeeEEEEEecCC--CceEEEEEcccceEEEEcC Confidence 99999999999885 45678899999985 6899999999999999999999999986 4689999999999999986 Q ss_pred C---CceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCc Q lcl|NC_016654. 170 G---RLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVK 246 (533) Q Consensus 170 g---~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~ 246 (533) + ++..++.+......++...++++|.|+.+.|.|..+.++...+.... ..... .......+........++.. T Consensus 151 ~~~~~~~a~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~--~~~~~--~~~~~~~~~~~~~~~~~~~g 226 (470) T protein:vir:10 151 TLDNKLLGILRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPY--NIITS--YDLSAGYETGQSNTLKHNFG 226 (470) T ss_pred CCCCceEEEEEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceecccc--ccccc--cccccccccccccccccCCC Confidence 4 45554433233334444566889999999998866555433221110 00000 00000111111222233333 Q ss_pred c-ceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCCccccccCc Q lcl|NC_016654. 247 D-LTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGMGQGVSLDE 325 (533) Q Consensus 247 ~-~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~~~~~d~ 325 (533) . |++.|..| ..|.|+|.. +++|||++|.++|++++.++.....++|-..+ .......+.. T Consensus 227 ~vPvv~~~nn--------------~~g~sd~e~-v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~----~~~~~~~~~~ 287 (470) T protein:vir:10 227 RVPFIEFSKN--------------KYRLPELNK-YKGLIDAYDDIYNGFINDLDDVQTVILVLTNY----GGADLHQFMN 287 (470) T ss_pred eeeEEEeecC--------------CCCCCchhH-HHHHHHHHHHHHHHHHHHHHHhcCcceeeecC----Cccccchhhh Confidence 2 33333322 358999986 88999999999999999998766666662222 1111112222 Q ss_pred chhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHH Q lcl|NC_016654. 326 EQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKT 405 (533) Q Consensus 326 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~ 405 (533) +...+.++...... .+....+++++++++++.+...++.+.+.|...++.+.. ++...|..||.||+++++.+.++ T Consensus 288 ~~~~~~~i~~~~~~--~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~--~~~~~gn~Sg~Alk~~~~~l~~k 363 (470) T protein:vir:10 288 DLRKYKSIKINNTG--NGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGIDP--ANFESSNASGVAIKMLYSHLELK 363 (470) T ss_pred hhhhcCeEeccCCC--CCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCCC--CccccccchHHHHHHHHHHHHHH Confidence 23334444333221 112234788999999999999999999999988877653 33445678999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC Q lcl|NC_016654. 406 TRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWD 485 (533) Q Consensus 406 ~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~ 485 (533) |+++++.|+++|++++++|+.+.+ ....+...++|+|++.+|.|..+.+++++++ +|+||.||++++ +|.++ T Consensus 364 ~~~~~~~~~~~l~~~~~~i~~~l~-----~~~~d~~~i~i~f~~~~p~d~~e~~~~~~~~--~g~iS~et~l~~-~p~v~ 435 (470) T protein:vir:10 364 AAKTQTYFEHAINELVRAIMRYLN-----FSDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKA-NPIVD 435 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhc-----ccCcccceeeEEeccCCCCCHHHHHHHHHHH--hccCcHHHHHHh-CCCCC Confidence 999999999999999999987642 2334567899999999999999999999887 699999999986 67665 Q ss_pred HHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 486 DERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 486 dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) + +++|++||++|+....+.+.. ....+.+ .++|.| T Consensus 436 D--~~~E~eri~~E~~e~~~~~~~-------~~~~~~~----~~dde~ 470 (470) T protein:vir:10 436 D--WQQELKDLAKDKEENDPYSNQ-------ADELNGK----GVNDEQ 470 (470) T ss_pred C--HHHHHHHHHHHHHHHHHhhcc-------ccccCCC----CCCCCC Confidence 5 788999999998766543221 1111111 222222 No 12 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=100.00 E-value=4e-57 Score=329.78 Aligned_cols=468 Identities=11% Similarity=0.034 Sum_probs=296.6 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |+-.+-....=+..+..++.+....+ .-...+|.+||.+.+.... ........+..++|+++| T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~---~~r~~~l~~Yy~g~~~i~~--------------~~~~~~~~~~~~~ki~~n 93 (511) T protein:vir:96 31 YDGTESDLLQNVNEVSKYIEHHMDYQ---RPRLKVLSDYYEGKTKNLV--------------ELTRRKEEYMADNRVAHD 93 (511) T ss_pred cchhhhhhhccHHHHHHHHHHHHHhh---HHHHHHHHHHhcccCcccc--------------ccCcCcccccCcceeecc Confidence 44444433322333333332211110 1112344555544332100 001112234567899999 Q ss_pred hHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcC Q lcl|NC_016654. 81 IPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDA 160 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~ 160 (533) +|+.||+..++||||+|++++++ ++.+++.|+++++.|+|...+.++++.++++|.+|+++|+|++ +++++.+++| T Consensus 94 ~~k~Iv~~~~~yl~g~p~~~~~~--~~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded--~~~~i~~~~p 169 (511) T protein:vir:96 94 YASYISDFINGYFLGNPIQYQDD--DKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD--DETRLYKSDA 169 (511) T ss_pred hHHHHHHHHHhhhccCCceeecC--chHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCC--CceEEEEEcc Confidence 99999999999999999999874 4568899999999999999999999999999999999999875 4789999999 Q ss_pred CeEEEEEecCCceE-EEEEEEEee---c-CCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccccccc Q lcl|NC_016654. 161 DRAIPEFRWGRLVA-VTFWSELAG---G-DGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADE 235 (533) Q Consensus 161 ~~~~P~~~~g~~~~-v~f~~~~~~---~-~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~ 235 (533) .+++|+|++....+ ++|++.+.. . .......+.|.|++..|.+ |.......+ .+. ... T Consensus 170 ~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~--~~~~~~~~~---~~~---------~~~--- 232 (511) T protein:vir:96 170 MSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYR--YLTSRTNGL---KLT---------PRE--- 232 (511) T ss_pred ceeEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEE--EEecCCCcc---ccc---------ccc--- Confidence 99999998754333 334443332 1 1223345678899888765 333222110 000 000 Q ss_pred CCceeecCCCc-cceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcC Q lcl|NC_016654. 236 GRGAYVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTN 314 (533) Q Consensus 236 ~~~~~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~ 314 (533) .....++.. .|++.|. | ...|.|+|.+ +.+|||++|.++|++++.++....++.|...+... T Consensus 233 --~~~~~~~~~~vPvv~~~-n-------------n~~g~gd~e~-v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~ 295 (511) T protein:vir:96 233 --NGFESHSFERMPITEFS-N-------------NERRKGDYEK-VITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL 295 (511) T ss_pred --cccccccCCceeeEEec-C-------------CCCCCCchhh-hHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccC Confidence 001123322 2333333 2 1258899987 77999999999999999999887888775554322 Q ss_pred CCCcc-----ccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc Q lcl|NC_016654. 315 LGMGQ-----GVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA 389 (533) Q Consensus 315 ~~~~~-----~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~ 389 (533) ..... +..+......+......... ....+++++++++++.+...++.+.+.|+..++.+.-+++. .++. T Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~-~~~n 370 (511) T protein:vir:96 296 DPVEVRKQKEANVLFLEPTVYADSEGRETE----GSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDN-FSGT 370 (511) T ss_pred CchhhcccccccceecccccccccccccCC----CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cccc Confidence 11110 00000001111111111111 12236778888888888888888888888888776554432 2356 Q ss_pred hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCC Q lcl|NC_016654. 390 QTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVAS 469 (533) Q Consensus 390 ~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aG 469 (533) .||.||+++++.+.+++.+|++.|+.+|++++++|+.+.+.............++|.|++++|.|..+.+++++++ +| T Consensus 371 ~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl--~G 448 (511) T protein:vir:96 371 QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GG 448 (511) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccccccceEEeCCCCCCCHHHHHHHHHHH--hc Confidence 7999999999999999999999999999999999988755432222233445799999999999999999999987 69 Q ss_pred CCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCcccccc-ccC-CCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 470 AASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGT-DQP-PLPTENDPATDPEAVDEGE 533 (533) Q Consensus 470 i~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~-~~~-~~~~~~~~~~~~~~~~d~~ 533 (533) +||+||++++ +|.+++ +++|++||++|+....+...... ..+ +..++..++...+..+.+| T Consensus 449 ~iS~et~l~~-l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:96 449 KISQTTLMSL-FSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred cCChHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCcccccccccC Confidence 9999999986 565665 77899999999865433221111 111 1111111222223333333 No 13 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=100.00 E-value=4.8e-57 Score=329.37 Aligned_cols=462 Identities=11% Similarity=0.033 Sum_probs=303.8 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |+++...+.|.......... .++. -...++.+||.+.+... .+.+ .+. . -.+........++|+++| T Consensus 14 ~~~~~~~~~~~~~~i~~~~~---~~~~---~~~~~~~~yy~g~~~i~-~~~~-~~~---~--~~~~~~~~~~~~~ki~~~ 80 (479) T protein:vir:79 14 VQLKKESTINLVKVIEHYIL---KHRP---EKYKQGEEYYYGNTDVN-NKRR-YYL---L--DGAKVDDFTKVNNKAINN 80 (479) T ss_pred eccccCChhHHHHHHHHHHh---hhhH---HHHHHHHHHhccCCccc-cccc-ccc---c--ccccccccccCcceeecc Confidence 78887777776655444322 2221 01235566665544211 0000 000 0 001122334567799999 Q ss_pred hHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcC Q lcl|NC_016654. 81 IPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDA 160 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~ 160 (533) |++.||++.|+||||+|+++++ +++.+++.++.+++ |+|...+.++++.++++|.+|+++|+|++ ++++|.+++| T Consensus 81 ~~~~Ivd~~~~~l~g~p~~~~~--~~~~~~~~~~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~--~~~~i~~~~p 155 (479) T protein:vir:79 81 YHKLLVDQKVGYSVGNPIVFNA--DDDNLTKLLNDLLG-EEFDDTITELYLNASNKGVEWLHPYINRK--GEFKYVIIPA 155 (479) T ss_pred hHHHHHHHHHhhhhcCCceecc--CCHHHHHHHHHHHh-cCHHHHHHHHHHHHHhcCeEEEEEEeCCC--CceEEEEEcc Confidence 9999999999999999999977 45567788877765 78999999999999999999999999975 5799999999 Q ss_pred CeEEEEEecCCceEE-EEEEEEe--ecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCC Q lcl|NC_016654. 161 DRAIPEFRWGRLVAV-TFWSELA--GGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGR 237 (533) Q Consensus 161 ~~~~P~~~~g~~~~v-~f~~~~~--~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~ 237 (533) .+++|+|+++...++ +|++.+. ..+++ ...++|.|++..|.|..+.+ +.....+................ . T Consensus 156 ~~~~~v~d~~~~~~~~~~ir~y~~~~~~~~-~~~~~e~y~~~~i~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~---~ 229 (479) T protein:vir:79 156 EEAIPIWDSKRQRELVAFIRFYYIEDIDGN-KIKRVEYYTENDITYFIERG--NSFIQEFLYDEYGKMTDIQEGHF---R 229 (479) T ss_pred ceeEEEEeCCCCCceEEEEEEEEEeecCCc-eEEEEEEEeCCcEEEEEecC--Ccccccccccccccccccccccc---c Confidence 999999987643332 2233222 22333 34568999999998854433 33222222222221111111110 0 Q ss_pred ceeecCCCcc-ceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCC Q lcl|NC_016654. 238 GAYVETGVKD-LTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLG 316 (533) Q Consensus 238 ~~~~~~g~~~-~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~ 316 (533) ....+++... |++.|.+ ...|.|+|++ +.+|||++|.++|+++++++.....+.|...+ . T Consensus 230 ~~~~~~~~~~vPvv~~~n--------------n~~g~sd~~~-v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~----~ 290 (479) T protein:vir:79 230 INNKEQGWGKVPFIPFKN--------------NEKCVSDLTF-YKSLIDIYDNNISTLADNLDEIQEVIYVLKEY----P 290 (479) T ss_pred ccccccCCCcccEEEecC--------------CCCCCcchhh-hHHHHHHHHHHHHHHHHHHHHhhCceeeeecC----C Confidence 1112333332 3333322 2358999986 78999999999999999998877777763322 1 Q ss_pred CccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHH Q lcl|NC_016654. 317 MGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEAS 396 (533) Q Consensus 317 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~ 396 (533) +.....+.........+.. +.++ .++++++++..+.+...++.+.+.|+..++.+.. +++..|..||+|++ T Consensus 291 ~~~~~~~~~~~~~~~~i~~---~~~~----~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~--~~~~~gn~Sg~Ai~ 361 (479) T protein:vir:79 291 GTSLQEFIDNIRYYKSIKV---DGGG----GVDKLEINIPVEAKKELLDRLEKNIIIFGQGVNP--ESQNTGDKSGVALK 361 (479) T ss_pred ccccccchhhhhhccceec---CCCC----cceEEeccCCHHHHHHHHHHHHHHHHHHhCcccc--ccccccchhHHHHH Confidence 1111112222223333322 2222 2567888889998999999998888888876643 44445678999999 Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHH Q lcl|NC_016654. 397 GKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTK 476 (533) Q Consensus 397 ~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~ 476 (533) ++++.+.++|+++++.|+.+|+++++.++.+.+.. +.......+++|.|++++|.|..+.+++++++ +|+||.||+ T Consensus 362 ~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~--~~~~~~~~~i~i~f~~~~p~~~~~~a~~~~kl--~g~iS~et~ 437 (479) T protein:vir:79 362 FLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKIS--GNKSYDYKTVQITFNHSMIINEAEKIDMAAKS--TGIVSDETI 437 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--CCCccccccceEEeCCCCCcCHHHHHHHHHHH--hccCcHHHH Confidence 99999999999999999999999999998875532 44455677899999999999999999999987 599999999 Q ss_pred HHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCC Q lcl|NC_016654. 477 VAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEA 528 (533) Q Consensus 477 v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 528 (533) +++ +|.+++ +++|++||++|+....+.....+ .. +.++.++. T Consensus 438 l~~-l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~------~~-~~~~~~e~ 479 (479) T protein:vir:79 438 VSN-HPWVED--VNDELERLKKQEDTQKEYDDLIP------NN-QDGVIDET 479 (479) T ss_pred HHh-CCCCCC--HHHHHHHHHHHHHHHHHHHhccC------cc-cCCCcCcC Confidence 987 566665 77899999999865433211110 00 00111111 No 14 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=100.00 E-value=4.3e-57 Score=329.60 Aligned_cols=469 Identities=11% Similarity=0.023 Sum_probs=297.5 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) ||--+-....=+..+..++.+....+ .-...+|.+||.+.+..... ..........++|+++| T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~---~~r~~~l~~Yy~g~~~i~~~--------------~~~~~~~~~~~~ki~~n 93 (511) T protein:vir:99 31 YDGTESDLLQNVNEVSKYIEHHMDYQ---RPRLKVLSDYYEGKTKNLVE--------------LTRRKEEYMADNRVAHD 93 (511) T ss_pred cchhhhhhhccHHHHHHHHHHHHHhh---HHHHHHHHHHhcccCccccc--------------cCcccccccCcceeecc Confidence 55444443322333333333221111 11123455566554321100 00112233457899999 Q ss_pred hHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcC Q lcl|NC_016654. 81 IPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDA 160 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~ 160 (533) +|+.||+..++||||+|++++++ ++.+++.|+++++.|+|...+.++++.++++|.+|+++|+|++ +++++.+++| T Consensus 94 ~~k~Iv~~~~~yl~g~p~~~~~~--d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded--~~~~i~~~~p 169 (511) T protein:vir:99 94 YASYISDFINGYFLGNPIQYQDD--DKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD--DETRLYKSDA 169 (511) T ss_pred hHHHHHHHHHhhhcccCceeecC--chHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCC--CceEEEEEcc Confidence 99999999999999999999874 5568899999999999999999999999999999999999975 4799999999 Q ss_pred CeEEEEEecCCceE-EEEEEEEee---c-CCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccccccc Q lcl|NC_016654. 161 DRAIPEFRWGRLVA-VTFWSELAG---G-DGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADE 235 (533) Q Consensus 161 ~~~~P~~~~g~~~~-v~f~~~~~~---~-~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~ 235 (533) .+++|+|+++...+ ++|++.+.. . .......+.|.|++..|.+....+.... .+.. . T Consensus 170 ~~~~~vyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~-----~~~~--------~----- 231 (511) T protein:vir:99 170 MSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGL-----KLTP--------R----- 231 (511) T ss_pred ceeEEEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCccc-----cccc--------c----- Confidence 99999998753222 333443322 1 1223345678999988876333221110 0000 0 Q ss_pred CCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCC Q lcl|NC_016654. 236 GRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNL 315 (533) Q Consensus 236 ~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~ 315 (533) ......++...+++.+.+| ...|.|+|.+ +.+|||++|.++|++++.++.....+.+-..+.... T Consensus 232 -~~~~~~~~~g~vPvv~~~n-------------n~~g~sd~e~-v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~ 296 (511) T protein:vir:99 232 -ENGFESHSFERMPITEFSN-------------NERRKGDYEK-VITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLD 296 (511) T ss_pred -ccccccCCCCccceEEecC-------------CCCCCCchhh-hHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcccC Confidence 0011123333222333322 1358899986 789999999999999999987766666633322111 Q ss_pred CCcc-----ccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcch Q lcl|NC_016654. 316 GMGQ-----GVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQ 390 (533) Q Consensus 316 ~~~~-----~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~ 390 (533) ..+. ...+......+..........+ ..+++++++++++.+...++.+.+.|+..++.+.-+++ ..++.. T Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~-~~~gn~ 371 (511) T protein:vir:99 297 PVEVRKQKEANVLFLEPTVYADSEGRETEGS----VDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDD-NFSGTQ 371 (511) T ss_pred chhhcccccccceecccccccccccccCCCC----cceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccc-cccccc Confidence 1100 0001011111111111111222 23667888888888888888888888877777654443 223567 Q ss_pred hHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCC Q lcl|NC_016654. 391 TATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASA 470 (533) Q Consensus 391 Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi 470 (533) ||.||+++++.+..++..|++.|+.+|++++++|+.+...............++|.|++++|.|..+.+++++++ +|+ T Consensus 372 Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~~e~~~~~~kl--~Gi 449 (511) T protein:vir:99 372 SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGK 449 (511) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHH--hcc Confidence 999999999999999999999999999999999988765432222233445789999999999999999999988 499 Q ss_pred CCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccc-cccCCC-CCCCCCCCCCCCCCCCC Q lcl|NC_016654. 471 ASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFG-TDQPPL-PTENDPATDPEAVDEGE 533 (533) Q Consensus 471 ~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~~~~d~~ 533 (533) ||+||++++ +|.+++ +++|++||++|+....+..... +.++.. .++.+++......+++| T Consensus 450 iS~et~l~~-l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~e 511 (511) T protein:vir:99 450 ISQTTLMSL-FSFFQD--PELEVKKIEEDEKESIKKAQKNMYQDPRNINDDEQDDSTKDSIDKKE 511 (511) T ss_pred CCHHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHHHHhhcccccCCCCCCCCCCCCCcCcccccC Confidence 999999987 566665 7889999999986544322211 111111 22222222333444444 No 15 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=100.00 E-value=3.5e-56 Score=324.63 Aligned_cols=446 Identities=9% Similarity=0.006 Sum_probs=290.7 Q ss_pred CCCCCCcC-CCcC------cchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcc Q lcl|NC_016654. 1 MSLPEANT-AWPP------PELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRA 73 (533) Q Consensus 1 ~~~~~~~~-~~pp------~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 73 (533) |-.-++++ -+|+ ..+...++....... ...++.+||.+.+. +..+.+...+.. T Consensus 1 ~~~~~~~~~~~p~d~~~~~~~l~~~i~~~~~~~~----r~~~~~~yy~g~~~----------------i~~~~~~~~~~~ 60 (453) T protein:vir:39 1 MKYKPPKLMTFPKDEPITNEVVTKFMEKHRLEVA----RYEYLKNMYRGIMA----------------IDAEPTKDLWKP 60 (453) T ss_pred CeecCCcceEcCCCCCCCHHHHHHHHHHHHHHHH----HHHHHHHHhhccCc----------------hhcCCCccccCc Confidence 33333332 2333 222222222111100 11233444443221 222333445667 Q ss_pred cceeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCce Q lcl|NC_016654. 74 PKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNA 153 (533) Q Consensus 74 ~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~ 153 (533) ++|+++|||+.||+++|+||||+|++++++ ++..++.|++++++|+|...+.++++.++++|.+|+++|+|++ +.+ T Consensus 61 ~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~--d~~~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~~--g~~ 136 (453) T protein:vir:39 61 DNRLTVNFTKYIVDTFTGYFNGIPVKKSHS--DKETLSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNEE--TQT 136 (453) T ss_pred cceeecchHHHHHHHHhhhhcccCceeccC--ChHHHHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecCC--Cce Confidence 889999999999999999999999999874 4567899999999999999999999999999999999999975 468 Q ss_pred EEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccccc Q lcl|NC_016654. 154 WIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGA 233 (533) Q Consensus 154 ~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~ 233 (533) +|.+++|.+++|+|++....+.+++-++...++. .+++|.|++++|.+ |....+.+ . + . T Consensus 137 ~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~--~~~~~~yt~~~i~~--~~~~~~~~--~--~-------------~ 195 (453) T protein:vir:39 137 NVIYNTPENMFMVYDDTIKQEPLFAVRYGYDDDY--KLYGEVYTKETTYA--LNGTMGFY--N--M-------------T 195 (453) T ss_pred EEEEEcccceEEEecCCCCCeEEEEEEEEEeCCe--EEEEEEEeCCeEEE--EEecCCce--e--e-------------e Confidence 9999999999999987654455443333333332 34578999998865 33222110 0 0 0 Q ss_pred ccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhc Q lcl|NC_016654. 234 DEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLT 313 (533) Q Consensus 234 ~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~ 313 (533) ..++++...++++.++| .+.|.|+|.. +.+|+|++|.++|++++.++.....+.+-..+- T Consensus 196 -----~~~~~~~g~vPvv~~~n-------------~~~g~sd~e~-v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~~- 255 (453) T protein:vir:39 196 -----EQAPNPFDDLPVVEFYF-------------NEERMSIFES-VISLVNAFNKAISEKANDVDYFSDQYLTFLGAA- 255 (453) T ss_pred -----cccccCCCceeEEEecC-------------CCCCCcchhh-hHHHHHHHHHHHHHHHHHHHHhhCceeeeecCC- Confidence 01233333333333332 2358999986 789999999999999999976555555421110 Q ss_pred CCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHH Q lcl|NC_016654. 314 NLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTAT 393 (533) Q Consensus 314 ~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tat 393 (533) ........+.. ...+... ...+.+....+.++++++..+.+...++.+.+.|+..++.+. +++...|..||. T Consensus 256 -~~~~~~~~~~~----~~~~~~~-~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~~gn~Sg~ 327 (453) T protein:vir:39 256 -VEEEDLKNIRS----NRVINYY-GESSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVAN--ISDESFGSSSGV 327 (453) T ss_pred -CCchhhhhhhh----cceeeec-CCCCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcc--cccccccCChHH Confidence 00000011111 1111111 111112223467788888888888888888888888887653 333444567999 Q ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCH Q lcl|NC_016654. 394 EASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAAST 473 (533) Q Consensus 394 ai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~ 473 (533) ||+++++.|..+++++++.|+.+|++++++|+.+.+.. +.......|+|.|++++|.|..+.+++++++ +|+||+ T Consensus 328 Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~---~~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl--~g~is~ 402 (453) T protein:vir:39 328 SLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNV---SNKEAWKDIEYTFTRNEPKDIKEQAETANIL--MGITSQ 402 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc---CCccccccceEEeCCCCCcCHHHHHHHHHHH--hccCCh Confidence 99999999999999999999999999999998876532 2334556799999999999999999999987 689999 Q ss_pred HHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 474 KTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 474 et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ||++++ +|.+++ +++|++||++|+.......... . ...++.. +...+++.| T Consensus 403 et~l~~-l~~v~D--~~~E~~ri~~E~~~~~~~~~~~---~-~~~~~~~--~~~~~~~~e 453 (453) T protein:vir:39 403 ETALSV-ISVIPD--VQAEMEKIKKEEASTAIFDKDK---Q-PSEKGTD--TVVPETNEE 453 (453) T ss_pred HHHHHh-CCCCCC--HHHHHHHHHHHHHHHHHHHHhc---c-CCCCCCC--CCCCCcCCC Confidence 999986 565655 7789999999987544321111 0 0111111 112222222 No 16 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=100.00 E-value=3.8e-56 Score=324.40 Aligned_cols=468 Identities=9% Similarity=-0.016 Sum_probs=297.9 Q ss_pred CCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHH---HHHHHHHH------HHHhcccCCCCCcc Q lcl|NC_016654. 3 LPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGI---KARTKAAY------EAFHGRTPTATGRA 73 (533) Q Consensus 3 ~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~---~~~~~~~~------~~~~~~~~~~~g~~ 73 (533) |-+=|-||=+|...-+...++.-.. =+.+.|.++.... ..+...+ ..+|.|.. .+..+......+.. T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~---~~~~~i~~~i~~~-~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~ 76 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYE---TQEEMILRLVREH-KENIDNITMGERYYNHHPDILDAPPKRDVNGDYDETKP 76 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccC---CcHHHHHHHHHHH-HHHHHHHHHHHHHhcCCCchhccccccccccccccccc Confidence 5566667777776655555544211 0112222222111 1111111 11111110 01122223344566 Q ss_pred cceeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCce Q lcl|NC_016654. 74 PKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNA 153 (533) Q Consensus 74 ~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~ 153 (533) ++|+++|+|+.||+++|+||||+|+++++ +++.+++.|+++++ |+|..++.++++.++++|.+|+++|+|++ +++ T Consensus 77 ~~ki~~n~~~~ivd~~~~~l~g~~~~~~~--~~d~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~~~d~~--g~~ 151 (478) T protein:vir:10 77 DWRMYTNYHQNLVDQKVAYAVANPVTFGV--DNDKALKQIQHTLN-HKWDDKLVDILTAASNKGIEWVQPYVDEE--GEF 151 (478) T ss_pred cceeccchHHHHHHHHHhhhccCCeeeec--CChHHHHHHHHHHh-cCHHHHHHHHHHHHHhcCeEEEEEEecCC--Cee Confidence 78999999999999999999999999977 45567889999987 68999999999999999999999999976 578 Q ss_pred EEEEEcCCeEEEEEecCCceE-EEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccc Q lcl|NC_016654. 154 WIDFVDADRAIPEFRWGRLVA-VTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEG 232 (533) Q Consensus 154 ~i~~v~~~~~~P~~~~g~~~~-v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~ 232 (533) ++.+++|.+++|+|+++...+ ++|++.+...+. .++|.|++..|+|..+.. ..+... ............ T Consensus 152 ~~~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~----~~~~~y~~~~i~~~~~~~--~~~~~~----~~~~~~~~~~~~ 221 (478) T protein:vir:10 152 KTFRVPAEQAVPIWTNKERDELQAFIRVYELDGA----ERVEYWTKDDVTYYELKE--GQLIPD----FYRSDDHIQPHY 221 (478) T ss_pred EEEEEcccceEEEEcCCCCCceEEEEEEEEecCc----eEEEEEeCCeEEEEEEcC--Ceeecc----ccccccccccce Confidence 999999999999998765433 335555554433 235778888888744432 211100 000001111000 Q ss_pred cccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHh Q lcl|NC_016654. 233 ADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVL 312 (533) Q Consensus 233 ~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l 312 (533) .. ....++...+++.+.+| .+.|+|+|.. +.+|||++|.++|++++.++.....+++...+ T Consensus 222 ~~----~~~~~~~~~vPvv~~~n-------------~~~g~sd~~~-v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~- 282 (478) T protein:vir:10 222 YQ----GNKLMSWGRVPFIPFKN-------------NPQEVSDLFM-YKTIIDALDKRLSDTQNTFDESVELIYILKGY- 282 (478) T ss_pred ec----ccccccCCccceEEecc-------------CCCCCCcHHH-HHHHHHHHHHHHHHHHHHHHHhhCceeeeecC- Confidence 00 01122222222222222 3468999997 88999999999999999998766667764332 Q ss_pred cCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhH Q lcl|NC_016654. 313 TNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTA 392 (533) Q Consensus 313 ~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Ta 392 (533) .......+..+...++.+... .+.++ .+++++++++++++...++.+.+.|+..++.+..+++ ..+++.|| T Consensus 283 ---~~~~~~~~~~~~~~~~~~~~~-~~~~~----~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~~~n~Sg 353 (478) T protein:vir:10 283 ---EGEDMKDFMHNLKYYKAISVA-GESGS----GVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQD-KFGNSPSG 353 (478) T ss_pred ---CccccchhhhhhhhcceEEec-CCCCC----cceEEeecCChHHHHHHHHHHHHHHHHHhCccccCcc-ccccccHH Confidence 111111122222223333322 12222 2567788888888888888888888877776544433 22467799 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCC Q lcl|NC_016654. 393 TEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAAS 472 (533) Q Consensus 393 tai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S 472 (533) .||+++++.+.+++++|++.|+.+|++++++|+.+. +.......++|+|++++|.|..+.+++++++ +|+|| T Consensus 354 ~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~------g~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS 425 (478) T protein:vir:10 354 IALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFY------RLDVKVQDIEITFNFNVMVNELENSQIAMNS--TGLLS 425 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh------CCCcccccceEEecCCCCCCHHHHHHHHHHH--hCCCC Confidence 999999999999999999999999999999988763 2334566899999999999999999999987 79999 Q ss_pred HHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCC Q lcl|NC_016654. 473 TKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAV 529 (533) Q Consensus 473 ~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 529 (533) +||++++ +|.+++ +++|++||++|+....+....... ....++++.+.+.+++ T Consensus 426 ~et~~~~-l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 478 (478) T protein:vir:10 426 KETILSN-HAWVED--PVAEMERIEQENIELNQQLPDIEE-GLNGEQQRQSENNQPE 478 (478) T ss_pred hHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHhhcccccc-ccCCCCCCCCCCCCCC Confidence 9999986 565655 778999999998654443322211 1112222211112222 No 17 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=100.00 E-value=8.7e-56 Score=322.44 Aligned_cols=466 Identities=11% Similarity=-0.004 Sum_probs=295.0 Q ss_pred CCC------C-----CCcCCCcCcch----------------HHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHH Q lcl|NC_016654. 1 MSL------P-----EANTAWPPPEL----------------AAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKA 53 (533) Q Consensus 1 ~~~------~-----~~~~~~pp~~~----------------~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~ 53 (533) |-- . --+..|++... .-+...|..|+.=-.--..+|.+||.+..... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i------ 74 (502) T protein:vir:48 1 MMEQTLFTDSTGQDLVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDV------ 74 (502) T ss_pred CceeEEEEecchhHHHhhcccChhHHhhhcccchhhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc------ Confidence 000 0 00112332222 22222222222100001122333333321100 Q ss_pred HHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHhhcCCCceEeeCCC--chHHHHHHHHHHhhccHHHHHHHHHH Q lcl|NC_016654. 54 RTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGK--SKEVQARADLIFNTPRFHSSLVEAGE 131 (533) Q Consensus 54 ~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~--~~~~~~~l~~i~~~n~f~~~~~~~~~ 131 (533) .........+..++|+++|||+.||+..++||||+|+++++.++ ++.+++.|+++++.|+|...+.++++ T Consensus 75 --------~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~~~~~l~~~~~~N~~~~~~~~~~~ 146 (502) T protein:vir:48 75 --------LKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNEDNSQNDDAIKRIGRINDIDTHNRNLIR 146 (502) T ss_pred --------cccccccccccccceeecchHHHHHHHHhhhhcccCeeEecCCccchhHHHHHHHHHHhhcCHhHHHHHHHH Confidence 00011123355678999999999999999999999999998653 45678899999999999999999999 Q ss_pred HHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceE-EEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccC Q lcl|NC_016654. 132 SCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVA-VTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTA 210 (533) Q Consensus 132 ~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~-v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~ 210 (533) .++++|.+|+++|.|++ ++++|.+++|.+++|+|++....+ ++|++.+........+..+|.|++..|.+ |.... T Consensus 147 ~~~~~G~a~~~v~~ded--g~~~i~~~~p~~~~~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~iyt~~~i~~--~~~~~ 222 (502) T protein:vir:48 147 DLSQTGRAYEVIYRSEY--DETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNQHIYT--LDASD 222 (502) T ss_pred HHhhcCeEEEEEEeCCC--CceEEEEEcccceEEEEcCCCCCceEEEEEEEEEeecCCcEEEEEEEeCCeEEE--EEeCC Confidence 99999999999999975 468999999999999998643222 33444444332222345678898887754 33211 Q ss_pred CcccceeehhhccccccccccccccCCceeecCCCc-cceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHH Q lcl|NC_016654. 211 TSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDR 289 (533) Q Consensus 211 ~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~ 289 (533) . ...+ ...+++.. .|++.|.+| +.|.|+|.+ +.++||++|. T Consensus 223 ~--~~~~---------------------~~~~~~~g~vPvv~~~nn--------------~~g~sd~e~-v~~liDa~d~ 264 (502) T protein:vir:48 223 S--FNEI---------------------SVTPHAFGTVPITEFLNN--------------ADGIGDYET-ELYLIDLYDS 264 (502) T ss_pred c--eeec---------------------cceecCCCccceEEecCC--------------CCCCCchhh-hHHHHHHHHH Confidence 1 0000 01122222 334444332 358899987 7899999999 Q ss_pred HHHHHHHHHHhCcceeeechHHhcCCCCccccccCcchhhhhhccccc--cccccccccceeeechhhhhHHHHHHHHHH Q lcl|NC_016654. 290 IYSSLMRDFRIGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGG--FNANGDMETIFEFFQPAIRVLEHDQGAALL 367 (533) Q Consensus 290 ~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~--~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~ 367 (533) ++|++++.++....++.+...+.....+..+... ..+..+.... ...+......+++++++++++.+...++.+ T Consensus 265 ~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L 340 (502) T protein:vir:48 265 AESDTANHMSDMADAILAIYGDLALPQGMQASDM----KRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRL 340 (502) T ss_pred HHHHHHHHHHHhcCceeeeecCcccccccchhhh----hhcceeeccccccccccccCcceeEeeecCCHHHHHHHHHHH Confidence 9999999999888887774333211111111111 1112211111 111112233467888899999988889999 Q ss_pred HHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEe Q lcl|NC_016654. 368 LREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEW 447 (533) Q Consensus 368 l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f 447 (533) .+.|+..++++..+++.. ++..||.||+++++.+.+++.++++.|+.+|++++++|+.+.+.. ..........++|+| T Consensus 341 ~~~I~~~s~~p~~~~~~~-~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~-~~~~~~d~~~i~i~f 418 (502) T protein:vir:48 341 NKDIHVFTNTPDMSDNHF-SGNASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLV-NEFKDFDESRLKITF 418 (502) T ss_pred HHHHHHHhCCCCcCcccc-ccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-ccccccccccceEEe Confidence 999988888876555432 456799999999999999999999999999999999998876532 222334556799999 Q ss_pred CCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccC-ccc-cccccC-CCCCCCCCCC Q lcl|NC_016654. 448 PKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAP-TFG-FGTDQP-PLPTENDPAT 524 (533) Q Consensus 448 ~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~-~~~-~~~~~~-~~~~~~~~~~ 524 (533) ++++|.|..+.+++++++ +|+||++|++++ .|.+++ +++|++||++|+....- ... ...+.. ...++.+.+. T Consensus 419 ~~~~p~d~~e~a~~~~kl--~g~iS~et~l~~-l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~d~~~e~~ 493 (502) T protein:vir:48 419 TPNLPKSLYEQVSILNDL--GGQVSQETALSL-SGLVEN--PTEELDKINEESSKIDFKGYPSYFYDNVGKYTDEVKETH 493 (502) T ss_pred CCCCCcCHHHHHHHHHHH--hccCcHHHHHHh-CCCCCC--HHHHHHHHHHHHHhhhhhcccccccccccccCCCccCCC Confidence 999999999999999988 589999999987 565665 67899999999864321 111 111111 1111112222 Q ss_pred CCCCCCCCC Q lcl|NC_016654. 525 DPEAVDEGE 533 (533) Q Consensus 525 ~~~~~~d~~ 533 (533) .++.++--| T Consensus 494 ~~~~~~~~~ 502 (502) T protein:vir:48 494 TDDFERVYE 502 (502) T ss_pred CcCcCCCCC Confidence 233333333 No 18 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=100.00 E-value=6e-56 Score=323.35 Aligned_cols=455 Identities=11% Similarity=0.039 Sum_probs=290.3 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHH---HHHHHHH------HHHHhcccCCCCC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGI---KARTKAA------YEAFHGRTPTATG 71 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~---~~~~~~~------~~~~~~~~~~~~g 71 (533) ||+-+|.+ +-++..+.... .-+++.|.++.... ..+..++ ..+|.|. ..+.++......+ T Consensus 7 ~~~~~~~~-------~~~~~~~~~~~---~~~~~~i~~~i~~~-~~~~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~ 75 (474) T protein:vir:95 7 MPWDKPYG-------EEVVEQLKPQF---ETQEEMIIRLIDDH-RKQLDKITVGQRYYDKDNDIVKQMKKVDVYGNIDYD 75 (474) T ss_pred cCCCCchh-------hHHHHhhhhcc---CChHHHHHHHHHHH-HHHHHHHHHHHHHhcccCchhccccccccccccccc Confidence 55555554 11222221111 11222233222111 0011111 1111111 0112223334456 Q ss_pred cccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCC Q lcl|NC_016654. 72 RAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIAD 151 (533) Q Consensus 72 ~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~ 151 (533) .+++|+++|+++.||++.|+||||+|+++++ +++.+++.|+.+++ |+|...+.++++.++++|.+|+++|+|++ + T Consensus 76 ~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~--~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~--~ 150 (474) T protein:vir:95 76 KPDWRITTNFHQNLVDQKVSYVASKPVTYSC--EDESVLKIIHDVLD-TRWDNKLIDILTATSNKGIDWLQVYINEN--G 150 (474) T ss_pred cccceeccchHHHHHHHHHhhhccCCceecc--CchHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCcEEEEEEecCC--C Confidence 6778999999999999999999999999987 45668899999986 67999999999999999999999999875 5 Q ss_pred ceEEEEEcCCeEEEEEecCCceE-EEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccc Q lcl|NC_016654. 152 NAWIDFVDADRAIPEFRWGRLVA-VTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAV 230 (533) Q Consensus 152 ~~~i~~v~~~~~~P~~~~g~~~~-v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~ 230 (533) +++|.+++|.+++|+|+++...+ ++|++.+...+.. .++.|++..|.+..+.+ ..+... .+ ...... T Consensus 151 ~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~----~~~~y~~~~~~~~~~~~--~~~~~~-~~---~~~~~~-- 218 (474) T protein:vir:95 151 EMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEE----KVEFWTDTTVTYYVLEN--GGLIPD-YY---YGANHI-- 218 (474) T ss_pred ceEEEEEcccceEEEEcCCCCCceEEEEEEEEEcCee----EEEEEeCCeEEEEEEcC--Cccccc-cc---cCcccc-- Confidence 79999999999999998764333 3445555443332 35678877776633332 221110 00 000000 Q ss_pred cccccCCceeecCCC-ccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeech Q lcl|NC_016654. 231 EGADEGRGAYVETGV-KDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASE 309 (533) Q Consensus 231 ~~~~~~~~~~~~~g~-~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~ 309 (533) ......++. ..|++.|++| +.|.|+|+. +.+|||++|.++|+++++++.....+.+.. T Consensus 219 ------~~~~~~~~~g~iPvv~~~nn--------------~~g~sd~e~-v~~liDa~d~~~S~~~~~~~~~~~p~lv~~ 277 (474) T protein:vir:95 219 ------QSHFSNGNWGRVPFIAFKNN--------------PEEVSDIWM-YKSLIDAIDKRLSDAQNMFDESVELIYILK 277 (474) T ss_pred ------cccccccCCCccceEeecCC--------------CCCCCcHHH-HHHHHHHHHHHHHHHHHHHHHhcCceeeee Confidence 001112222 2344444432 358999987 789999999999999999987777776633 Q ss_pred HHhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc Q lcl|NC_016654. 310 SVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA 389 (533) Q Consensus 310 ~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~ 389 (533) .+ .......+......++.+.. +.++ .+++++++++++++...++.+.+.|+..++.+.-+++ ..++. T Consensus 278 g~----~~~~~~~~~~~~~~~~~i~~---~~~~----~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~~~n 345 (474) T protein:vir:95 278 GY----EGQDLEEFMRGLKYYKAINV---DGDG----GVETIQVEVPVSSTKEYIDLMRAYIMEFGQGVDFQTD-KFGSA 345 (474) T ss_pred cC----Ccccchhhhhhhhccceeec---cCCC----ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccc-ccccc Confidence 22 21111223333333444432 2222 2567888899999999999999999988877653332 23456 Q ss_pred hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCC Q lcl|NC_016654. 390 QTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVAS 469 (533) Q Consensus 390 ~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aG 469 (533) .||.||+++++.+..+|++|++.|+.+|+++++.|+.+.. .......++|+|++++|.|..+.++++++ +| T Consensus 346 ~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g------~~~d~~~i~v~f~~~~p~d~~e~a~~~~~---~g 416 (474) T protein:vir:95 346 PSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNN------LKMDVKDIEISFNFNRMMNDAEQSQIIAQ---SQ 416 (474) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCcccceeeEEeccCCCcCHHHHHHHHHh---cC Confidence 7999999999999999999999999999999999887632 23456789999999999999999887654 69 Q ss_pred CCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 470 AASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 470 i~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +||++|++++ +|.+++ +++|++||++|+......+....+.. .++..+++.++|.| T Consensus 417 ~iS~et~i~~-l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~-----~d~~~~~~~~~~~~ 472 (474) T protein:vir:95 417 YLSRETLVKS-SPLVDD--YKAELERIEQEQMEYNKQLPNLDDGG-----ADGAQQQERSNDKE 472 (474) T ss_pred CCchHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHhccccccccc-----CCCCcCCCCCccCC Confidence 9999999986 566665 67899999999865433221111110 11111122222222 No 19 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=100.00 E-value=4e-56 Score=324.30 Aligned_cols=451 Identities=11% Similarity=-0.003 Sum_probs=283.1 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) ..-|++....=+..+...+........+ ..++.+||.+.+...... .............+++|+++| T Consensus 17 ~~~~~~~~~~~~~~i~~~i~~~~~~~~~----~~~~~~yY~g~~~i~~~~---------~~~~~~~~~~~~~~~~ki~~n 83 (468) T protein:vir:96 17 VEQIKPQYETQEEMILRLITKHKENVED----ITVGERYYNHQPDVLFNA---------PKRNVKGEIDPFKPDWRMYTN 83 (468) T ss_pred eecccccccCcHHHHHHHHHHHHHHHHH----HHHHHHHhcCCCcccccc---------ccccccccccccccccccccc Confidence 1111111111111111111111111111 122333333332110000 000111122334467899999 Q ss_pred hHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcC Q lcl|NC_016654. 81 IPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDA 160 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~ 160 (533) ||+.||++.++||||+|++++++ ++.+++.|+++++ |+|...+.++++.++++|.+|+++|+|++ +.++|.+++| T Consensus 84 ~~~~Iv~~~~~~l~g~p~~~~~~--d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~--~~~~i~~~~p 158 (468) T protein:vir:96 84 YHQNLVDQKVAYAVANPVTYGTE--DEKSLKTIQEVLN-HKWDDKLVDILTAASNKGVEWIQPYVDEQ--GEFKTFRVPA 158 (468) T ss_pred hHHHHHHHHHhhhccCCceeccC--ChHHHHHHHHHHh-cCHHHHHHHHHHHHhhcCeEEEEEEEcCC--CceEEEEEcc Confidence 99999999999999999999874 5667899999997 67999999999999999999999999875 4699999999 Q ss_pred CeEEEEEecCCceEE-EEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCce Q lcl|NC_016654. 161 DRAIPEFRWGRLVAV-TFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGA 239 (533) Q Consensus 161 ~~~~P~~~~g~~~~v-~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~ 239 (533) .+++|+|+++...++ +|++.+...+. +.+|.|++++|.+..+... ....... .......... ... T Consensus 159 ~~~~~v~~~~~~~~~~~~ir~~~~~~~----~~~~~~~~~~~~~~~~~~~--~~~~~~~----~~~~~~~~~~----~~~ 224 (468) T protein:vir:96 159 EQAIPIWTNKERDELKAFIRLYELDGG----ERVEYWTANDVTFYELKDG--QLIPDYY----QGEEHVQAHY----YVG 224 (468) T ss_pred cceEEEEcCCCCCceEEEEEEEEecCc----eEEEEEeCCeEEEEEEcCC--ceeeccc----ccccccccce----eec Confidence 999999987654444 34444444332 2357888888876444322 1111100 0000000000 000 Q ss_pred eecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCCcc Q lcl|NC_016654. 240 YVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGMGQ 319 (533) Q Consensus 240 ~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~ 319 (533) ..+++...+++.+.+| .+.|.|+|+. +.+|||++|.++|++++.++.....+.+...+ .... T Consensus 225 ~~~~~~~~iPvv~~~n-------------~~~g~sd~e~-v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~----~~~~ 286 (468) T protein:vir:96 225 NKSMSWNRVPFIPFKN-------------NPQEVSDLFM-YKTIIDAMDKRLSDTQNTFDEATELIYVLKGY----EGED 286 (468) T ss_pred cccccCCcccEEEecC-------------CCCCCCchHH-HHHHHHHHHHHHHHHHHHHHHhcCceeeeecC----Cccc Confidence 1123333222222222 2358999987 78999999999999999998766666653222 1111 Q ss_pred ccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHh Q lcl|NC_016654. 320 GVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKK 399 (533) Q Consensus 320 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~ 399 (533) ...+......++.+... .+.++ .++++++++.++++...++.+.+.|+..++.+.-+++ ..++..||.|+++++ T Consensus 287 ~~~~~~~~~~~~~i~~~-~d~~~----~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~n~Sg~Alk~~~ 360 (468) T protein:vir:96 287 LEEFMYNLKYYKAINVD-GDGSG----GVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQQD-KFGNSPSGIALKFMY 360 (468) T ss_pred cchhhhhhhcCceEEec-CCCCC----cceEEeecCChHHHHHHHHHHHHHHHHHhCccccccc-ccccchHHHHHHHHH Confidence 11222222233333322 22222 3678889999999999999999999888887643332 334678999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHH Q lcl|NC_016654. 400 DLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAY 479 (533) Q Consensus 400 ~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~ 479 (533) +++..++++|++.|+.+|+++++.|+.+. +.......++|+|++++|.|..+.+++++ .+|+||+||++++ T Consensus 361 ~~l~~k~~~k~~~~~~~l~~~~~li~~~~------g~~~d~~~i~i~f~~~~p~d~~e~a~~~~---~~g~iS~et~i~~ 431 (468) T protein:vir:96 361 SNLDLKANKLKNKTLTALQELLQYIIDFY------KLSIKVQDVEITFNFNVMVNELEQSQIGV---NSQYLSKETVVTN 431 (468) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHh------CCCcccceeeEEecCCCCcCHHHHHHHHH---hcCCCchHHHHHh Confidence 99999999999999999999999988763 22345678999999999999999988765 4699999999986 Q ss_pred hCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCC Q lcl|NC_016654. 480 LHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEA 528 (533) Q Consensus 480 l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 528 (533) +|.++| +++|++||++|+....... .+. ...++ +.+. T Consensus 432 -l~~v~D--~~~E~~ri~~E~~~~~~~~-~~~----~~~~~----~~~~ 468 (468) T protein:vir:96 432 -HPWVDD--PVAEMERIDQEELALPSIE-EGL----NGKEN----NEPT 468 (468) T ss_pred -CCCCCC--HHHHHHHHHHHHHHHHHHh-hcc----CCCCC----CCCC Confidence 666665 7899999999986543211 000 00000 1111 No 20 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=100.00 E-value=1e-55 Score=322.00 Aligned_cols=464 Identities=11% Similarity=-0.002 Sum_probs=295.3 Q ss_pred CCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHH---HHHHH------HHHhcccCCCCCcc Q lcl|NC_016654. 3 LPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKAR---TKAAY------EAFHGRTPTATGRA 73 (533) Q Consensus 3 ~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~---~~~~~------~~~~~~~~~~~g~~ 73 (533) |-+=.=||=|+..+-....+..... -..+.|.++.... ..+..+++.. |.|.. .+...+........ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~i~~~i~~~-~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~ 76 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYE---TQEEMIIRLINDH-KPKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKP 76 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccC---ChHHHHHHHHHHH-HHHHHHHHHHHHHhccCCcchhccchhccccccccccc Confidence 3333334444444444433322211 1111222222111 1111111111 11100 00011112223356 Q ss_pred cceeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCce Q lcl|NC_016654. 74 PKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNA 153 (533) Q Consensus 74 ~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~ 153 (533) ++|+++|||+.||++.|+||||+|+++++ +++.+++.|+++++ |++...+.++++.++++|.+|+++|+|++ +++ T Consensus 77 ~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~--~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~y~d~~--~~~ 151 (474) T protein:vir:96 77 DWRMFTNYHQNLVDQKVAYAVANPVTFSS--DDDKSLKTIQEVLN-HKWDDKLVDILTAASNKGIEWLQPYIDEN--GEF 151 (474) T ss_pred chhcccchHHHHHHhhhhhhcccCceeec--CchHHHHHHHHHHh-cCHHHHHHHHHHHHHhcCeeEEEEEecCC--Cce Confidence 77899999999999999999999999987 45668899999987 57999999999999999999999999975 578 Q ss_pred EEEEEcCCeEEEEEecCCceEE-EEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccc Q lcl|NC_016654. 154 WIDFVDADRAIPEFRWGRLVAV-TFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEG 232 (533) Q Consensus 154 ~i~~v~~~~~~P~~~~g~~~~v-~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~ 232 (533) +|.+++|++++|+|+++....+ +|++.++..+.. ..|.|+...|.+..+.+. .+..... ........ T Consensus 152 ~i~~~~p~~~~~v~d~~~~~~~~~~vr~~~~~~~~----~~~~yt~~~v~~~~~~~~--~~~~~~~----~~~~~~~~-- 219 (474) T protein:vir:96 152 KTFRVPAEQAIPIWTNKERDTLKAFIRYYRLDGAE----RVEYWTDSDVTYYEYQDG--ILIPDYY----HGEEHIQS-- 219 (474) T ss_pred EEEEEcccceEEEEcCCCCCceEEEEEEEeecCce----EEEEEeCCeEEEEEecCC--ceeeccc----cccccccc-- Confidence 9999999999999987543333 344445443322 357788888877544332 1111000 00000000 Q ss_pred cccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHh Q lcl|NC_016654. 233 ADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVL 312 (533) Q Consensus 233 ~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l 312 (533) .......+++...+++.+.+| .+.|.|+|.. +.+|||++|.++|+++++++.....++|...+ T Consensus 220 --~~~~~~~~~~~g~iPvv~~~n-------------n~~g~sd~e~-v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~- 282 (474) T protein:vir:96 220 --HYYVGNKRVSWGRVPFIPFKN-------------NPQEMSDLFM-YKTIIDAMDKRLSDTQNTFDESTELIYILKGY- 282 (474) T ss_pred --cccccccccCCCceeEEEecc-------------CCCCCCcHHH-HHHHHHHHHHHHHHHHHHHHHhccceeeeecC- Confidence 000011233433333333333 2358999987 78999999999999999998887888774332 Q ss_pred cCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhH Q lcl|NC_016654. 313 TNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTA 392 (533) Q Consensus 313 ~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Ta 392 (533) .......+..+...++.+... ..++ .++++++++.++++...++.+.+.|+..++.+..+++ ..++..|| T Consensus 283 ---~~~~~~~~~~~~~~~~~i~~~--~~~~----~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~~~n~Sg 352 (474) T protein:vir:96 283 ---EGQDLDEFMRNLKYYKAINVD--GDGS----GVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQD-KFGNSPSG 352 (474) T ss_pred ---CcccccchhhhhhcCceEEec--CCCC----ceeEEeecCChHHHHHHHHHHHHHHHHHhCCcccccc-ccccccHH Confidence 221112222333334444332 1112 2678888999999999999999999999988765543 23456799 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCC Q lcl|NC_016654. 393 TEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAAS 472 (533) Q Consensus 393 tai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S 472 (533) .|++++++.+.++|++|++.|+++|++++++|+.+. +.......++|+|++++|.|..+.++.+ +.+|+|| T Consensus 353 ~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~------~~~~~~~~i~i~f~~~~p~~~~e~~~~~---~~ag~iS 423 (474) T protein:vir:96 353 IALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFY------KLNIKVQDVEITFNFNVMVNELEQSQIG---VQSQYLS 423 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh------CCCcccceeeEEeccCCCcCHHHHHHHH---HhcCCCc Confidence 999999999999999999999999999999988763 2334556799999999999999988865 4579999 Q ss_pred HHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 473 TKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDE 531 (533) Q Consensus 473 ~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 531 (533) +||++++ +|.+++ +++|++||++|+.......... ..++++..++++.+++ T Consensus 424 ~et~~~~-~~~v~d--~~~E~~ri~~E~~e~~~~~~~~-----~~~~~~~~~d~~~e~~ 474 (474) T protein:vir:96 424 KETVVTN-HPWVDD--PVAELERIEQDNIDFNKQLPPL-----EGDANGRAQDNESETN 474 (474) T ss_pred hHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHhccccc-----ccccccccCCCcccCC Confidence 9999986 676765 6789999999986544332211 1112222333344444 No 21 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=100.00 E-value=2.3e-55 Score=320.16 Aligned_cols=463 Identities=11% Similarity=-0.035 Sum_probs=294.6 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |+-=+..+.+. +.-+...|..|+.=-.-...+|.+||.+...... ........+.+++|+++| T Consensus 30 ~~~~~~~~~~~---~~~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i~--------------~~~~~~~~~~~~~ri~~n 92 (501) T protein:vir:96 30 ADNLEELMVNN---WELLKNFINHHKLRQAPRIQELLDYARGENHDVL--------------KSGRRKDNEMADKRAVHN 92 (501) T ss_pred ccccccccCCh---HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCccc--------------CccccCccccccceeecc Confidence 32222222222 1222333333321100111233444443211000 001112334567899999 Q ss_pred hHHHHHHHHHHhhcCCCceEeeCCC--chHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEE Q lcl|NC_016654. 81 IPGVIAKLSTTELFSEQLKFLDAGK--SKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFV 158 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e~~~i~~~~~--~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v 158 (533) +|+.||++.++||||+|+++++.++ ++.+++.|+++++.|+|...+.++++.++++|.+|+++|+|++ +.++|.++ T Consensus 93 ~~k~Ivd~~~~yl~g~p~~~~~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~ded--g~~~i~~~ 170 (501) T protein:vir:96 93 YGRMISKFKTGYLAGNPIRVEYDDNDDNSQNDDAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSEY--DETRIKRL 170 (501) T ss_pred hHHHHHHHHhhhhcccCeeEeeCCccchhHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCC--CceEEEEE Confidence 9999999999999999999998653 4567899999999999999999999999999999999999976 46899999 Q ss_pred cCCeEEEEEecCCceE-EEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCC Q lcl|NC_016654. 159 DADRAIPEFRWGRLVA-VTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGR 237 (533) Q Consensus 159 ~~~~~~P~~~~g~~~~-v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~ 237 (533) +|.+++|+|++....+ .+|++.+...+.....+.++.|++..|.+ |....+. ..+ T Consensus 171 ~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~vyt~~~i~~--~~~~~~~--~~~-------------------- 226 (501) T protein:vir:96 171 SPLETFVIYDNSLEDNSIAAVRYYNRGTLQSAKDVVEIYTDEHIYT--LDASDDF--NEI-------------------- 226 (501) T ss_pred ccceeEEEEcCCCCCceEEEEEEEEeecCCCcEEEEEEEcCCcEEE--EeeCCCc--eec-------------------- Confidence 9999999998743222 23444444333323345678888877754 3322110 000 Q ss_pred ceeecCCCc-cceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCC Q lcl|NC_016654. 238 GAYVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLG 316 (533) Q Consensus 238 ~~~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~ 316 (533) ...+++.. .|++.|+. .+.|+|+|.. +.+|+|++|.++|++++.++.....+.+...+..... T Consensus 227 -~~~~~~~g~vPvv~~~n--------------n~~g~sd~e~-v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~ 290 (501) T protein:vir:96 227 -SVTTHAFGTVPITEYLN--------------NIDGIGDYET-ELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPK 290 (501) T ss_pred -cccccCCCccceEEecC--------------CccCCCchhh-hHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCc Confidence 00122222 23333332 2458999987 7899999999999999999877777766433321111 Q ss_pred CccccccCcchhhhhhcccccc--ccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHH Q lcl|NC_016654. 317 MGQGVSLDEEQEVYSRVGSGGF--NANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATE 394 (533) Q Consensus 317 ~~~~~~~d~~~~~~~~~~~~~~--~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tata 394 (533) +..+... ..++.+..... ..+......++++++++..+.+...++.+.+.|+..++.+...++. .++..||.| T Consensus 291 ~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~-~~~n~Sg~A 365 (501) T protein:vir:96 291 GMQASDM----KRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTN-FSGNTSGEA 365 (501) T ss_pred ccchhhh----hhcCeeeecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCccc-ccccchHHH Confidence 1111111 11111111111 1112223346677888887777777888878887777776555542 235679999 Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHH Q lcl|NC_016654. 395 ASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTK 474 (533) Q Consensus 395 i~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~e 474 (533) |+++++.+..++.++++.|+.+|++++++|+.+.+.. ..........++|.|++.+|.|..+.+++++++ +|+||++ T Consensus 366 l~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~-~~~~~~d~~~i~i~f~~~~p~n~~e~ad~~~kl--~g~iS~e 442 (501) T protein:vir:96 366 LKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLV-NEFKDFDESLLKITFTPNLPKSLNEQVSILTGL--GGQVSQE 442 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-ccccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCchH Confidence 9999999999999999999999999999998876543 222234456799999999999999999999998 4899999 Q ss_pred HHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCC---CCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 475 TKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPP---LPTENDPATDPEAVDEGE 533 (533) Q Consensus 475 t~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~d~~ 533 (533) |++++ +|.+++ +++|++||++|+...+.....+...+. ..++......++++++-| T Consensus 443 t~~~~-l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~e~~~d~~e~~~~ 501 (501) T protein:vir:96 443 TALSL-SGLVES--PNEELDKINKEMSEIDFKGYSNDFNEHVGKYTDEVKETHTDDFEREYE 501 (501) T ss_pred HHHHh-CCCCCC--HHHHHHHHHHHHHHhhccccccchhhcccccCCcCCCCCCCccccccC Confidence 99997 565665 678999999998765433322221111 111112222233444444 No 22 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=100.00 E-value=1.8e-55 Score=320.72 Aligned_cols=458 Identities=11% Similarity=0.026 Sum_probs=290.1 Q ss_pred CCCC------CCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCccc Q lcl|NC_016654. 1 MSLP------EANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAP 74 (533) Q Consensus 1 ~~~~------~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 74 (533) |.+= +.-+.--+.-+...+.+...... ...+|.+||.+.+. ...+.....+..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~----~~~~l~~Yy~g~~~----------------i~~~~~~~~~~~~ 60 (499) T protein:vir:10 1 MAVVIDKDLLDDVNEPNIEAINYAIRELQNRKK----RLDKLSDYYNGKQE----------------IEKHEFDNATVEA 60 (499) T ss_pred CccchhhhHHhhhhcCCHHHHHHHHHHHHHHHH----HHHHHHHHhccccc----------------hhcCCcCcCCCCc Confidence 3221 11111111112222222111111 11233344433221 1112233445678 Q ss_pred ceeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCC--- Q lcl|NC_016654. 75 KRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIAD--- 151 (533) Q Consensus 75 ~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~--- 151 (533) +|+++|+|+.||+..|+||||+|++++++ ++..++.|+++++.|+|...+.++++.++++|.+|.++|+|+++.. T Consensus 61 ~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~--~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~ 138 (499) T protein:vir:10 61 ANVMVNHAKYITDMNVGFMTGNPVKYVAE--KGKNIDDILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVR 138 (499) T ss_pred ceeecchHHHHHHHHhhhhcccCceeecC--ChhHHHHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEeccccccccc Confidence 89999999999999999999999999874 4557888999999999999999999999999999999999987542 Q ss_pred ------------ceEEEEEcCCeEEEEEecCC---ceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccce Q lcl|NC_016654. 152 ------------NAWIDFVDADRAIPEFRWGR---LVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWM 216 (533) Q Consensus 152 ------------~~~i~~v~~~~~~P~~~~g~---~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~ 216 (533) .+++..++|.+++|+|++.. +..++.+......+....++.+|.|++.+|.+....+.....+. T Consensus 139 ~~~~~~~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~- 217 (499) T protein:vir:10 139 DELGNEKLTPNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEVSAN- 217 (499) T ss_pred ccccccccccccceEEEEEcccceEEEecCCCCcceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccccCc- Confidence 57789999999999998632 33334333333333334456789999999887443322111000 Q ss_pred eehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 217 MALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMR 296 (533) Q Consensus 217 v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~ 296 (533) + .++ ...+++...+++++.+| +..|.|+|.+ +++|||++|.++|++++ T Consensus 218 --------~----~~~------~~~~~~~g~vPvv~~~n-------------~~~~~~d~e~-v~~liD~~~~~~S~~~~ 265 (499) T protein:vir:10 218 --------D----PIV------YDGENLFGAVPIIEFRN-------------NEERQGDFEQ-LISLIDAYNLLQTDRIS 265 (499) T ss_pred --------c----eec------ccccCCCCccceEEecC-------------CCCCCCchHh-HHHHHHHHHHHHHHHHH Confidence 0 000 01223333333333322 2358899987 78999999999999999 Q ss_pred HHHhCcceeeechHHhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhC Q lcl|NC_016654. 297 DFRIGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTG 376 (533) Q Consensus 297 ~~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g 376 (533) .++.....+.+-..+ ................ +.....+.++ .++++++.+..+.+...++.+.+.|...++ T Consensus 266 ~~~~~~~~~lv~~G~----~~~~~~~~~~~~~~~~-~~~~~~~~~~----d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~ 336 (499) T protein:vir:10 266 DKEAFVDALLVTFGF----GLGDDKDDIQRLKRGA-IEAPPREEGA----DIEWLTKSFDETQVNLLSQSIENDIHKISY 336 (499) T ss_pred HHHHhcCceeeeecC----ccccccchhhhhhhcc-eeccCCCCCC----cceEEeccCCHHHHHHHHHHHHHHHHHHhC Confidence 998766666652221 1110000000111111 1111122222 256788888888888888888888888887 Q ss_pred CChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHH Q lcl|NC_016654. 377 YSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDL 456 (533) Q Consensus 377 ~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~ 456 (533) ++.-+++. .++..||.||+++++.+.+++.+|++.|+.+|++++++|+.+.+.. +.......++|.|++++|.|.. T Consensus 337 ~p~~~~~~-~~gn~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~---~~~~d~~~i~i~f~~~~p~n~~ 412 (499) T protein:vir:10 337 VPNMNDEK-FMGNVSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIK---GANDDASGCKISLVANIPSNLS 412 (499) T ss_pred cccCCchh-hcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc---CCccccccceEEeCCCCCCCHH Confidence 76444332 2456799999999999999999999999999999999999876532 2334556899999999999999 Q ss_pred HHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccc---cccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 457 AKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFG---FGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 457 e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +.+++++++ +|+||+||++++ +|.+++ +++|++||++|+....+... .+.+..+...++.+ +.+.++++| T Consensus 413 e~~~~~~kl--~g~iS~et~~~~-l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~ 485 (499) T protein:vir:10 413 DVVNNVKNA--DGIIPRKYTYSW-LPDVDN--PQDVIDEMNQQDAETIKKNQEALRGQDPDRLELEDKQ--DDSSENDKE 485 (499) T ss_pred HHHHHHHHH--hccCChHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCCC--cccCCCCCC Confidence 999999998 699999999986 666665 67899999998754322111 11111111111111 122222222 No 23 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=100.00 E-value=1.6e-55 Score=320.95 Aligned_cols=463 Identities=11% Similarity=-0.033 Sum_probs=292.8 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) ++-=+..+.|....+...+. .|+.=-.....+|.+||.+..... . ........+..++|+++| T Consensus 30 ~~~~~~~~~~~~~~l~~~i~---~~~~~~~~r~~~l~~yY~g~~~~i-~-------------~~~~~~~~~~~~~ki~~n 92 (501) T protein:vir:27 30 ADNLEELMVNNWELLKNFIN---HHKLRQAPRIQELLDYARGENHDV-L-------------QFGRRKDREMADKRAVHN 92 (501) T ss_pred cccccccccccHHHHHHHHH---HHHHHHHHHHHHHHHHhcCCCccc-c-------------ccCccCccccccceeccc Confidence 33333333444333332222 121100011123444444321100 0 001112234567899999 Q ss_pred hHHHHHHHHHHhhcCCCceEeeCCC--chHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEE Q lcl|NC_016654. 81 IPGVIAKLSTTELFSEQLKFLDAGK--SKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFV 158 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e~~~i~~~~~--~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v 158 (533) +|+.||++.++||||+|+++++.+. ++.+++.|+++++.|+|...+.++++.|+++|.+|+++|+|++ ++++|.++ T Consensus 93 ~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~ded--~~~~i~~~ 170 (501) T protein:vir:27 93 YGRMISKFKTGYLAGNPIRVEYDDNDNNSQNDDTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRNEY--DETRIKRL 170 (501) T ss_pred hHHHHHHHHhhhhcccCeeEecCCccchHHHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeCCC--CceEEEEE Confidence 9999999999999999999998653 3567889999999999999999999999999999999999975 46899999 Q ss_pred cCCeEEEEEecCCce-EEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCC Q lcl|NC_016654. 159 DADRAIPEFRWGRLV-AVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGR 237 (533) Q Consensus 159 ~~~~~~P~~~~g~~~-~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~ 237 (533) +|.+++|+|++.... .++|++.+...+.....+++|.|++..|.+ |...++. ..+ T Consensus 171 ~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~~~~vyt~~~v~~--~~~~~~~--~~~-------------------- 226 (501) T protein:vir:27 171 NPLETFVIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNEHIYT--LDASDDF--NEI-------------------- 226 (501) T ss_pred ccceeEEEecCCCCCceEEEEEEEEeeecCCcEEEEEEEeCCeEEE--EEeCCce--eec-------------------- Confidence 999999999875322 234455554433333345678888887754 3322110 000 Q ss_pred ceeecCCCc-cceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCC Q lcl|NC_016654. 238 GAYVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLG 316 (533) Q Consensus 238 ~~~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~ 316 (533) ...+++.. .|+++|. | .+.|.|+|.. +.+|||++|.++|++++.++....++.+...+..... T Consensus 227 -~~~~~~~g~vPvv~~~-n-------------n~~g~sd~e~-v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~ 290 (501) T protein:vir:27 227 -SVTTHAFGTVPITEFL-N-------------NVDGIGDYET-ELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPK 290 (501) T ss_pred -cccccCCCcccEEEec-C-------------CCCCCCchhh-hHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCc Confidence 01123333 2334433 2 2358999987 7899999999999999999988777777444322211 Q ss_pred CccccccCcchhhhhhccccc--cccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHH Q lcl|NC_016654. 317 MGQGVSLDEEQEVYSRVGSGG--FNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATE 394 (533) Q Consensus 317 ~~~~~~~d~~~~~~~~~~~~~--~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tata 394 (533) +...... .....+.... ...+......++++++++..+.+...++.+.+.|+..++.+..+++. .++..||.| T Consensus 291 ~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~-~~~n~Sg~A 365 (501) T protein:vir:27 291 GMQASDM----KRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSDTN-FSGNTSGEA 365 (501) T ss_pred ccchhhh----hhcCceeecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCccc-cccCchHHH Confidence 1111111 1111221111 11112222346778888877777777777777777777666544432 235679999 Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHH Q lcl|NC_016654. 395 ASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTK 474 (533) Q Consensus 395 i~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~e 474 (533) |+++++.+.+++.+|++.|+.+|++++++|+.+.+.. +.........|+|.|++++|.|..+.+++++++ +|++|++ T Consensus 366 l~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~-~~~~~~d~~~i~v~f~~~~p~n~~e~ad~~~kl--~g~iS~e 442 (501) T protein:vir:27 366 LKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLV-NEFKDFDESLLKITFTPNLPKSLNEQVSILTGL--GGQVSQE 442 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-ccccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCcHH Confidence 9999999999999999999999999999999876532 122234456799999999999999999999987 6999999 Q ss_pred HHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccc-cccC--CCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 475 TKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFG-TDQP--PLPTENDPATDPEAVDEGE 533 (533) Q Consensus 475 t~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~-~~~~--~~~~~~~~~~~~~~~~d~~ 533 (533) |++++ .|.+++ +++|++||++|+....+....+ ..++ ...++....+.++.++.-| T Consensus 443 t~l~~-l~~v~D--~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~d~~~~~~~d~~e~~~~ 501 (501) T protein:vir:27 443 TALSL-SGLVES--PNEELDKINKEVSEIDFKGYSNDFNEHVGKYTDEVKETHTDDFERAYE 501 (501) T ss_pred HHHHh-CCCCCC--HHHHHHHHHHHHHhhhHhhhcCccccccccccCCCCCCccccccccCC Confidence 99986 565665 7789999999976443322211 1111 0011111111112222222 No 24 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=100.00 E-value=1.9e-55 Score=320.63 Aligned_cols=467 Identities=9% Similarity=-0.002 Sum_probs=297.2 Q ss_pred CCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHH---HHHHHH------HHHhcccCCCCCcc Q lcl|NC_016654. 3 LPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKA---RTKAAY------EAFHGRTPTATGRA 73 (533) Q Consensus 3 ~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~---~~~~~~------~~~~~~~~~~~g~~ 73 (533) |-+-|-||=||.-.-.+..|+.... -..+.|.++.... ..+..+++. +|.|.. ...........+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~i~~~i~~~-~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~ 76 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYE---TQEEMILRLVREH-KENIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKP 76 (478) T ss_pred CccccccCCchhhhHHHHHhhhccC---ChHHHHHHHHHHH-HHHHHHHHHHHHHhcccccccccchhhhcccccccccc Confidence 5666667777776555544443311 1122222222211 111111111 111100 00011112234456 Q ss_pred cceeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCce Q lcl|NC_016654. 74 PKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNA 153 (533) Q Consensus 74 ~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~ 153 (533) ++|+++||++.||++.|+||||+|+++++ +++.+++.|+++++ |+|...+.++++.++++|.+|+++|+|++ +++ T Consensus 77 ~~ki~~n~~k~ivd~~~~yl~g~p~~~~~--~~~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~--~~~ 151 (478) T protein:vir:10 77 DWRMYTNYHQNLVDQKVAYAVANPVTFGV--DNDKALKQIQHTLN-HKWDDKLVDILTAASNKGIEWVQPYVDEE--GEF 151 (478) T ss_pred cceeccchHHHHHHHHhhhhcccCceeec--CChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEecCC--Cce Confidence 77999999999999999999999999977 45668899999986 78999999999999999999999999975 579 Q ss_pred EEEEEcCCeEEEEEecCCceEE-EEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccc Q lcl|NC_016654. 154 WIDFVDADRAIPEFRWGRLVAV-TFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEG 232 (533) Q Consensus 154 ~i~~v~~~~~~P~~~~g~~~~v-~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~ 232 (533) ++.+++|.+++|+|+++...+. +|++.+...+. ..+|.|++.+|.+..+ .+..+...+ ...+....... T Consensus 152 ~~~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~----~~~~~y~~~~i~~~~~--~~~~~~~~~----~~~~~~~~~~~ 221 (478) T protein:vir:10 152 KTFRVPAEQAVPIWTNKERDELQAFIRVYELDGA----ERVEYWTKDDVTFYEL--KEGQLIPDF----YRSEDHIQPHY 221 (478) T ss_pred EEEEEcccceEEEEcCCCCCceEEEEEEEeeeCc----eEEEEEeCCcEEEEEe--cCCeeeccc----cccccccccce Confidence 9999999999999987543333 33444443332 2357888888876433 322221111 11111110000 Q ss_pred cccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHh Q lcl|NC_016654. 233 ADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVL 312 (533) Q Consensus 233 ~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l 312 (533) .. . ...++...++++..+| ...|.|+|.. +.+|||++|.++|++++.++.....+.+...+ T Consensus 222 ~~-~---~~~~~~g~vPvv~~~n-------------~~~g~sd~e~-v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~- 282 (478) T protein:vir:10 222 YQ-G---NKLMSWGRVPFIPFKN-------------NPQEVSDLFM-YKTIIDALDKRLSDTQNTFDESVELIYILKGY- 282 (478) T ss_pred ec-c---cccccCCcceEEEecc-------------CCCCCCcHHH-HHHHHHHHHHHHHHHHHHHHHhhCcceeeecC- Confidence 00 0 1123333323333322 2358999987 88999999999999999998766666663322 Q ss_pred cCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhH Q lcl|NC_016654. 313 TNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTA 392 (533) Q Consensus 313 ~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Ta 392 (533) .......+..+...++.+... .+.++ .+++++++++++++...++.+.+.|+..++.+.-+++ ..++..|| T Consensus 283 ---~~~~~~~~~~~~~~~~~~~~~-~~~~~----~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~n~Sg 353 (478) T protein:vir:10 283 ---EGEDMKDFMHNLKYYKAISVA-GESGS----GVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQD-KFGNSPSG 353 (478) T ss_pred ---CcccccchhhhhhhCceeEec-CCCCC----cceEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCcc-ccccchHH Confidence 111111122222333333322 12222 3567888888999989899888888888876543332 22467799 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCC Q lcl|NC_016654. 393 TEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAAS 472 (533) Q Consensus 393 tai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S 472 (533) .||+++++.+..+|..|++.|+.+|++++++|+.+.. ......+++|+|++++|.|..+.+++++++ +|+|| T Consensus 354 ~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~------~~~d~~~i~i~f~~~~p~~~~e~~~~~~~~--~g~iS 425 (478) T protein:vir:10 354 IALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYR------LDVRVQDIEITFNFNVMVNELENSQIAMNS--TGLLS 425 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCcccccceEEeCCCCCCCHHHHHHHHHHH--hCCCC Confidence 9999999999999999999999999999999887632 234556799999999999999999999876 69999 Q ss_pred HHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 473 TKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 473 ~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ++|++++ +|.+++ +++|++||++|+....+... ...+...++.+++.+++++ T Consensus 426 ~et~i~~-~~~v~d--~~~E~~ri~~E~~~~~~~~~------~~~~~~~d~~~~~~~d~~~ 477 (478) T protein:vir:10 426 KETILGN-HSWVQD--PVAEMERIEQENIELNQQLP------DIEEGLNDEQQRQSEDNQS 477 (478) T ss_pred hHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHHhcc------ccCCCCcccccccCcCCCC Confidence 9999986 665655 77999999999876443211 1111111111222222222 No 25 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=100.00 E-value=2.3e-55 Score=320.16 Aligned_cols=468 Identities=11% Similarity=0.037 Sum_probs=298.1 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |-+..- .. -+...+..++... ....++.+||.+.+.....+... .........+..+...+..++|+++| T Consensus 1 ~~~e~~-----~~---~i~~~~~~~~~~~-~~~~~~~~Yy~g~hdi~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ki~~n 70 (471) T protein:vir:10 1 MEIEVI-----KK---IISSQMVKHGKFV-SQAAEAEKYYRNENDIKRKRKPA-DKKGAENEAKAEDNAFRNADNRISHN 70 (471) T ss_pred CCHHHH-----HH---HHHHHHHHHHHHH-HHHHHHHHHhccccccccccchh-hhhcccccccccccccccccceeccc Confidence 333211 11 1222222222211 12345666666654321111110 00011112223344555678899999 Q ss_pred hHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcC Q lcl|NC_016654. 81 IPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDA 160 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~ 160 (533) |++.||++.++||||+|+++++ +++.+++.|+.+++ |+|...+.++++.++++|.+|+++|+|+. .++++|.+++| T Consensus 71 ~~~~Ivd~~~~yl~G~p~~~~~--~~~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~-~g~~~~~~~~p 146 (471) T protein:vir:10 71 WHQLLLDQKKAYALTYPPTFDV--DDKKVNDMIVDVLG-DDYERISKQLCVNAGNAGIAWLHVWKDAS-DNSFRYACVDS 146 (471) T ss_pred hhHHHHHhhhhhhcccCceecc--CChHHHHHHHHHHh-cCHHHHHHHHHHHHhhCCeEEEEEEeeCC-CCeeEEEEEcc Confidence 9999999999999999999977 55678899999986 78999999999999999999999999864 35799999999 Q ss_pred CeEEEEEecCC---ceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCC Q lcl|NC_016654. 161 DRAIPEFRWGR---LVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGR 237 (533) Q Consensus 161 ~~~~P~~~~g~---~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~ 237 (533) .+++|+|+++. +..++.+............+++|.|+...+.+ |......+...+............. ..... T Consensus 147 ~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~--y~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~ 222 (471) T protein:vir:10 147 KEVIPIYSKSLDKKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSF--YRHEKEKPLEELETFQAISLIDTMN--GDRSS 222 (471) T ss_pred cceEEEEcCCCCCceEEEEEEEEeeccCCCceeEEEEEEeCCcEEE--EEecCCccccccccccccccccccc--ccccc Confidence 99999998653 33333221222222223455689998887776 4444433322222111110000000 01111 Q ss_pred ceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCC Q lcl|NC_016654. 238 GAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGM 317 (533) Q Consensus 238 ~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~ 317 (533) .....++...+++...+| ...|.|+|.. +++|||++|.++|++++.++.....++|...+ .. T Consensus 223 ~~~~~~~~g~iPvv~~~n-------------~~~~~sd~e~-v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~----~~ 284 (471) T protein:vir:10 223 DNSFKHDFGLVPFIPFKN-------------NEIETNDLKP-IKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNY----GG 284 (471) T ss_pred cccccCCCCceeEEEecc-------------CCCCCCchHH-HHHHHHHHHHHHHHHHHHHHHhhCceeeeecC----Cc Confidence 112334443333333333 1248899986 88999999999999999998777777763332 11 Q ss_pred ccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHH Q lcl|NC_016654. 318 GQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASG 397 (533) Q Consensus 318 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~ 397 (533) .....+......++.+...... .+....+++++++++.+++...++.+.+.|+..++.+..++ ...|..||.||++ T Consensus 285 ~~~~~~~~~~~~~~~i~~~~~~--~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~--~~~gn~Sg~Alk~ 360 (471) T protein:vir:10 285 QDKQEFLEDLKRYKMIKMDNDG--MGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNPET--DKLGNSSGVALKF 360 (471) T ss_pred cccchhHHHhhcCCeEEecCCC--CccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCc--ccccCccHHHHHH Confidence 1111111122223333332221 11223478889999999999999999999998888765433 3345679999999 Q ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHH Q lcl|NC_016654. 398 KKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKV 477 (533) Q Consensus 398 ~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v 477 (533) +++.+..+|+++++.|+.+|+++++.|+.+.+ ..+...++|.|++.+|.|..+.+++++++ +|+||.||++ T Consensus 361 ~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~-------~~d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~ 431 (471) T protein:vir:10 361 LYSLLELKAGNMETQFRSGYATLVKMILKHLG-------LSDKLKIKQTWTRNSINNDTEMAQVVSTL--ATITSRENVA 431 (471) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-------cCCCceeEEEeCCCCCCCHHHHHHHHHHH--hccCchHHHH Confidence 99999999999999999999999999887632 12346789999999999999999999987 6899999999 Q ss_pred HHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCC Q lcl|NC_016654. 478 AYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAV 529 (533) Q Consensus 478 ~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 529 (533) ++ +|.+++ +++|++||++|+......+ +...+.++ +++.+ T Consensus 432 ~~-~p~v~D--~~~E~eri~~E~~~~~~~~-------~~~~~~~~--~~e~~ 471 (471) T protein:vir:10 432 KS-NPIVED--WQDELRLQKAEQEGRSEKL-------YDMEEVEH--ESEVE 471 (471) T ss_pred Hh-CCCCCC--HHHHHHHHHHHHHHHHhcc-------cccCCCCC--ccccC Confidence 86 677765 7789999999985442211 11111111 11111 No 26 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=100.00 E-value=2.7e-55 Score=319.72 Aligned_cols=467 Identities=12% Similarity=0.025 Sum_probs=293.3 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |+--+-....=+......+.+....+ .-...+|.+||.+.+..... ....+.....++|+++| T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~---~~r~~~l~~Yy~g~~~il~~--------------~~~~~~~~~~~~ki~~n 93 (511) T protein:vir:93 31 YDGTESDLLQNVNEVSKYIEHHMDYQ---RPRLKVLSDYYEGKTKNLVE--------------LTRRKEEYMADNRVAHD 93 (511) T ss_pred ccchhhhhhccHHHHHHHHHHHHHhh---HHHHHHHHHHhcccCccccc--------------cCcCcccccCcceeecc Confidence 55444444333333333333211110 01123455555554321100 01112233457899999 Q ss_pred hHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcC Q lcl|NC_016654. 81 IPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDA 160 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~ 160 (533) +|+.||+..++||||+|++++++ ++.+++.|+++++.|+|...+.++++.++++|.+|+++|+|++ +++++.+++| T Consensus 94 ~~k~Iv~~~~~yl~g~p~~~~~~--d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de~--~~~~i~~~~p 169 (511) T protein:vir:93 94 YASYISDFINGYFLGNPIQYQDD--DKDVLEVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD--DETRLYKSDA 169 (511) T ss_pred hHHHHHHHHhhhhcccCeeeccC--ChHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCC--CceEEEEEcc Confidence 99999999999999999999874 5568899999999999999999999999999999999999975 4689999999 Q ss_pred CeEEEEEecCCceE-EEEEEEEee----cCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccccccc Q lcl|NC_016654. 161 DRAIPEFRWGRLVA-VTFWSELAG----GDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADE 235 (533) Q Consensus 161 ~~~~P~~~~g~~~~-v~f~~~~~~----~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~ 235 (533) .+++|+|++....+ ++|++.+.. ......+.++|.|++..|.+ |.......+. +.. . . T Consensus 170 ~~~~~vydd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~i~~--~~~~~~~~~~---~~~--------~-~--- 232 (511) T protein:vir:93 170 MSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYR--YLTSRTNGLK---LTP--------R-E--- 232 (511) T ss_pred ceeEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEE--EEecCCCccc---ccc--------c-c--- Confidence 99999998753222 333333321 12223445678999988876 3332221110 000 0 0 Q ss_pred CCceeecCCCc-cceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcC Q lcl|NC_016654. 236 GRGAYVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTN 314 (533) Q Consensus 236 ~~~~~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~ 314 (533) ....+++.. .|++.|.+ ...|.|+|.+ +.+|||++|.++|++++.++.....+.|...+... T Consensus 233 --~~~~~~~~g~vPvv~~~n--------------n~~g~gd~e~-v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~ 295 (511) T protein:vir:93 233 --NGFESHSFERMPITEFSN--------------NERRKGDYEK-VITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL 295 (511) T ss_pred --ccccccCCCccceEEecC--------------CCCCCCchhh-HHHHHHHHHHHHHHHHHHHHHhhCcceeeecCccc Confidence 001122222 23333332 2358899987 77999999999999999998777777764433211 Q ss_pred CCCc-c----ccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc Q lcl|NC_016654. 315 LGMG-Q----GVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA 389 (533) Q Consensus 315 ~~~~-~----~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~ 389 (533) .... . ...+......+........+ ....++++++++.++.+...++.+.+.|+..++++.-+++ ..++. T Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~-~~~~n 370 (511) T protein:vir:93 296 DPVEVRKQKEANVLFLEPTVYADSEGRETE----GSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDD-NFSGT 370 (511) T ss_pred CchhhcccccccceecccccccccccccCC----CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccc-ccccc Confidence 1110 0 00000011111111111111 2233667788888888888888888888877777654443 22356 Q ss_pred hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCC Q lcl|NC_016654. 390 QTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVAS 469 (533) Q Consensus 390 ~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aG 469 (533) .||.||+++++.+.+++..|++.|+.+|++++++|+.+.+.............+++.|++++|.|..+.+++++++ +| T Consensus 371 ~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl--~g 448 (511) T protein:vir:93 371 QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GG 448 (511) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccceEEeCCCCCCCHHHHHHHHHHH--hc Confidence 7999999999999999999999999999999999988755332222233445789999999999999999999988 69 Q ss_pred CCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCC---C Q lcl|NC_016654. 470 AASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEG---E 533 (533) Q Consensus 470 i~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~---~ 533 (533) +||+||++.+ +|.+++ +++|++||++|+....+.........+... +++++++++++.. | T Consensus 449 ~iS~et~~~~-l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:93 449 KISQTTLMSL-FSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDI-NDDEQDDDTKDTVDKKE 511 (511) T ss_pred cCchHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHHHHhhhcccCCCCC-CCCCCCCcccccccccC Confidence 9999999986 565665 678999999998654432211111111111 1111111111111 1 No 27 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=100.00 E-value=6.3e-55 Score=317.74 Aligned_cols=458 Identities=11% Similarity=0.051 Sum_probs=298.5 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) .-+|+...--=|..+..++......+ .....+|.+||.+.+.....+. .......+..++|+++| T Consensus 21 ~~~~~~~~~~~~~~i~~~i~~~~~~~---~~~~~~~~~yY~g~~~~i~~~~------------~~~~~~~~~~~~ki~~n 85 (481) T protein:vir:10 21 FVVSDLAELLKEENLRNFISRHQTEQ---VPRLEMLESYYLNRNTDILAGE------------RRLQKYGDKADHRAVHN 85 (481) T ss_pred eeeecchhhcCHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCcccccCc------------cccccccccccceeecc Confidence 22333333333444333333221110 0012344455544322110000 00011223456789999 Q ss_pred hHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcC Q lcl|NC_016654. 81 IPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDA 160 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~ 160 (533) +|+.||+..|+||||+|++++++ ++..++.|++++++|+|...+.++++.++++|.+|+++|+|++ ++++|.+++| T Consensus 86 ~~~~ivd~~~~~l~g~~~~~~~~--d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~d--g~~~i~~~~p 161 (481) T protein:vir:10 86 YAKYVSRFIVGYLTGNPITITHQ--DNQTNDKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRDFE--DRDTFKVLDP 161 (481) T ss_pred hHHHHHHHHHhhhccCCceEecC--ChhHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCC--CeEEEEEEcc Confidence 99999999999999999999884 5567899999999999999999999999999999999999976 4789999999 Q ss_pred CeEEEEEecCCceE-EEEEEEEeec-CCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCc Q lcl|NC_016654. 161 DRAIPEFRWGRLVA-VTFWSELAGG-DGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRG 238 (533) Q Consensus 161 ~~~~P~~~~g~~~~-v~f~~~~~~~-~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~ 238 (533) .+++|+|++....+ +++++.+... .....+.++|.|++..|.+....+....+ + + T Consensus 162 ~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~---~--~------------------ 218 (481) T protein:vir:10 162 KSTFVVYDQTLDKKVVAGVRYFEKQDKDKVPVQHVEVYTTDKIYYIEIKGGTYHR---V--E------------------ 218 (481) T ss_pred cceEEEEcCCCCCceEEEEEEEEEeeCCCceEEEEEEEecCeEEEEEecCCceee---c--c------------------ Confidence 99999998753322 2334444332 22334567899999888763332211110 0 0 Q ss_pred eeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCCc Q lcl|NC_016654. 239 AYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGMG 318 (533) Q Consensus 239 ~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~ 318 (533) ..+++...+++++.+| ...|.|+|.. +.+|+|++|.++|++.+.++.....+++-..+.... +. T Consensus 219 -~~~~~~g~vPvv~~~n-------------~~~g~~~~~~-v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~-~~ 282 (481) T protein:vir:10 219 -EVEHYYNDVPIIEYLN-------------DQFKQGDFEN-VIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDLD-SE 282 (481) T ss_pred -cccccCCceeEEEeec-------------CCCCCCchhh-HHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCCC-cc Confidence 1122222222222222 2358899986 889999999999999999986666666533222211 11 Q ss_pred cccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHH Q lcl|NC_016654. 319 QGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGK 398 (533) Q Consensus 319 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~ 398 (533) .+..+......+....... .+.+....+++++++++.+++...++.+.+.|+..++.+..+++ ..+++.||.|++++ T Consensus 283 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~~~n~Sg~Al~~~ 359 (481) T protein:vir:10 283 DAKAFRDANMIHLEPGTNA--NGSEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDLNDE-QFSGVQSGESMKYK 359 (481) T ss_pred chhhhhhccceeccccccc--cCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccc-ccccccHHHHHHHH Confidence 2222222222222211111 11122234677888888888888888888888888888766666 33466799999999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHH Q lcl|NC_016654. 399 KDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVA 478 (533) Q Consensus 399 ~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~ 478 (533) ++++..+++++++.|+.+|++++++++.+.+. .+.......++++.|++++|.|..+.+++++++ +|+||.+|+++ T Consensus 360 ~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~--~~~~~~~~~~i~v~f~~~~~~~~~~~a~~~~kl--~g~is~et~~~ 435 (481) T protein:vir:10 360 LFGLEQVRAIKERLFKKGLMKRYKLLLNNVNL--TGLKQHNYAELTITFTPNLPKSMMESINAFNAL--SGGVSESTRLS 435 (481) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--cCCCccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHH Confidence 99999999999999999999999999887653 234445567899999999999999999999988 58999999998 Q ss_pred HhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 479 YLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 479 ~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) + +|.+++ +++|++||++|+....+..... ..+++..+.+..|||+ T Consensus 436 ~-l~~i~d--~~~E~~ri~~E~~~~~~~~~~~-------~~~~~~~~~~~~dd~~ 480 (481) T protein:vir:10 436 L-LDFIDN--PKEELEKMQEEEAQREKQADKR-------GYGEAFENHLNVDDSN 480 (481) T ss_pred h-CCCCCC--HHHHHHHHHHHHHHHHhhhhhc-------cCCccCCCCCCCCCCC Confidence 6 565554 7789999999986554422111 0111111223334444 No 28 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=100.00 E-value=2.5e-55 Score=319.90 Aligned_cols=439 Identities=10% Similarity=0.023 Sum_probs=289.0 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHH---HHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCccccee Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDK---LATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRY 77 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~---l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 77 (533) +-||... +...+-+...|..| +....+ +.+||.+.+. +..+.....+..++|+ T Consensus 9 ~~~~~~~----~~~~~~i~~~i~~~----~~~~~r~~~~~~Yy~g~~~----------------i~~~~~~~~~~~~~ki 64 (452) T protein:vir:36 9 MTFSKDE----PITVEVVTKFMEKH----KLEVARYEYLKNMYLGIMA----------------IDDEPAKDSWKPDNRL 64 (452) T ss_pred EEcCCcc----CCCHHHHHHHHHHH----HHHHHHHHHHHHHhccccc----------------cccCccccccCcccee Confidence 2333222 11112222222222 222222 2334433221 2223334445677899 Q ss_pred ecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEE Q lcl|NC_016654. 78 HAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDF 157 (533) Q Consensus 78 ~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~ 157 (533) ++|+|+.||++.|+||||+|++++++ ++.+++.|++++++|+|...+.++++.++++|.+|+++|+|++ ++++|.+ T Consensus 65 ~~n~~~~ivd~~~~~l~g~~~~~~~~--d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~--g~~~i~~ 140 (452) T protein:vir:36 65 AVNFTKYIVDTFTGYFNGIPVKKSHS--DKEILTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDED--TQTNVVY 140 (452) T ss_pred ecchHHHHHHHHhhhhcccCceeecC--ChhHHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecCC--CeeEEEE Confidence 99999999999999999999999874 5567899999999999999999999999999999999999975 4789999 Q ss_pred EcCCeEEEEEecCCceEE-EEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccC Q lcl|NC_016654. 158 VDADRAIPEFRWGRLVAV-TFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEG 236 (533) Q Consensus 158 v~~~~~~P~~~~g~~~~v-~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~ 236 (533) ++|.+++|+|++..-.+. ++++.+...++. ..+|.|++.+|.+ |....+.+ .+ . T Consensus 141 ~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~---~~~~vyt~~~i~~--~~~~~~~~--~~---------------~--- 195 (452) T protein:vir:36 141 NSPENMFMVYDDTVKQEPLFAVRYGVDEDKK---LQGEVYTLLETIK--ISGENDEI--SF---------------G--- 195 (452) T ss_pred EcccceEEEEcCCCCCceEEEEEEEEecCce---EEEEEEecCeEEE--EEEcCCce--EE---------------e--- Confidence 999999999987543333 344444444432 2467788877754 33222111 00 0 Q ss_pred CceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCC Q lcl|NC_016654. 237 RGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLG 316 (533) Q Consensus 237 ~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~ 316 (533) ...+++...+++.+.+| ...|+|+|.. +.+|+|++|.++|++++.++.....+.+...+ . T Consensus 196 --~~~~~~~g~iPvv~~~n-------------~~~g~sd~e~-v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g~----~ 255 (452) T protein:vir:36 196 --EGTYNPYPDLPVVEFYF-------------NEERMSIFES-VISLVNAFNKAISEKANDVDYFSDQYLTFLGA----A 255 (452) T ss_pred --cceeccCCcccEEEecC-------------CCCCCcchHH-HHHHHHHHHHHHHHHHHHHHHhcCceeEeecC----C Confidence 01123333222222222 2358999986 88999999999999999997766666652211 1 Q ss_pred CccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHH Q lcl|NC_016654. 317 MGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEAS 396 (533) Q Consensus 317 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~ 396 (533) ... ....+...++.+..... +.+....++++++++..+.+...++.+.+.|+..++.+. ++++..|..||+||+ T Consensus 256 ~~~--~~~~~~~~~~~~~~~~~--~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~~gn~Sg~Al~ 329 (452) T protein:vir:36 256 VEE--EDLKNIRSNRVINYYAD--GEGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVAN--ISDESFGSSSGVSLA 329 (452) T ss_pred cCc--hhhhhhhhcceEEecCC--CCccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCccc--cCcccccCCcHHHHH Confidence 100 11111111233322211 122233467788888888888889988888888887764 444555677999999 Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHH Q lcl|NC_016654. 397 GKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTK 476 (533) Q Consensus 397 ~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~ 476 (533) ++++.+.++++++++.|+.+|++++++|+.+.+.. +.......|+|.|++++|.|..+.+++++++ +|+||+||+ T Consensus 330 ~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~---~~~~~~~~i~i~f~~~~p~d~~~~a~~~~k~--~g~iS~et~ 404 (452) T protein:vir:36 330 YKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNV---SNKDSWKDIEYTFTRNEPKDIKEQAETANIL--MGITSQETA 404 (452) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc---CCccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHH Confidence 99999999999999999999999999999886532 2234456799999999999999999999987 689999999 Q ss_pred HHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCC-CCCCCCCC Q lcl|NC_016654. 477 VAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATD-PEAVDEGE 533 (533) Q Consensus 477 v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~d~~ 533 (533) +++ +|.+++ +++|++||++|++...... +....++++.+ ...+++.| T Consensus 405 ~~~-~~~~~d--~~~E~~ri~~E~~~~~~~~-------~~~~~~~~~~~~~~~~~~~e 452 (452) T protein:vir:36 405 LSV-ISVIPD--VQAEMEKIKKEEASTAIFD-------KDKQPSEKGTDTVVSETNEE 452 (452) T ss_pred HHh-CCCCCC--HHHHHHHHHHHHHHHHHHH-------hhccCCCCcccccCccccCC Confidence 986 665654 7899999999986542211 11111112111 11222222 No 29 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=100.00 E-value=4.6e-55 Score=318.48 Aligned_cols=467 Identities=11% Similarity=0.018 Sum_probs=291.0 Q ss_pred CCCCCCcCCCcCcchHHHHHHHH-hhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeec Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVA-ESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHA 79 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~-~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 79 (533) |+-=+.....=+..+...+.+.. ..+.. ..+|.+||.+.+..... .......+..++|+++ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r----~~~l~~Yy~g~~~il~~--------------~~~~~~~~~~~~ki~~ 92 (511) T protein:vir:96 31 YDGTESDLLQNVNEVSKYIEHHMDYQRPR----LKVLSDYYEGKTKNLVE--------------LTRRKEEYMADNRVAH 92 (511) T ss_pred ccchhhhhhcCHHHHHHHHHHHHHhhhHH----HHHHHHHhhccCccccc--------------cCcccccccCcceeec Confidence 22222222222223333322211 11111 13455555554321100 0011223345789999 Q ss_pred ChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEc Q lcl|NC_016654. 80 PIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVD 159 (533) Q Consensus 80 n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~ 159 (533) |+|+.||+..++||||+|++++++ ++.+++.|+++++.|+|...+.++++.++++|.+|+++|+|++ +++++.+++ T Consensus 93 n~~k~Iv~~~~~yl~g~p~~~~~~--d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~d--g~~~i~~~~ 168 (511) T protein:vir:96 93 DYASYISDFINGYFLGNPIQYQDD--DKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD--DETRLYKSD 168 (511) T ss_pred chHHHHHHHHhhhhcccCceeecC--chHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCC--CceEEEEEc Confidence 999999999999999999999874 5568899999999999999999999999999999999999875 468999999 Q ss_pred CCeEEEEEecCCceE-EEEEEEEee---c-CCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccc Q lcl|NC_016654. 160 ADRAIPEFRWGRLVA-VTFWSELAG---G-DGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGAD 234 (533) Q Consensus 160 ~~~~~P~~~~g~~~~-v~f~~~~~~---~-~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~ 234 (533) |.+++|+|++....+ ++|++.+.. . .....+.++|.|++..|.+....+... ..+. .... T Consensus 169 p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~-----~~~~---------~~~~- 233 (511) T protein:vir:96 169 AMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNG-----LKLT---------PREN- 233 (511) T ss_pred ccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCc-----cccc---------cccc- Confidence 999999998753222 333333322 1 122334567899988886533322111 0000 0000 Q ss_pred cCCceeecCCCc-cceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhc Q lcl|NC_016654. 235 EGRGAYVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLT 313 (533) Q Consensus 235 ~~~~~~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~ 313 (533) ...+++.. .|++.|.. ...|.|+|.+ +.+|||++|.++|++++.++.....+.|-..+.. T Consensus 234 ----~~~~~~~g~vPvv~~~n--------------~~~g~gd~e~-v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~ 294 (511) T protein:vir:96 234 ----SFESHSFERMPITEFSN--------------NERRKGDYEK-VITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN 294 (511) T ss_pred ----ccccCcCcccceEEecC--------------CCCCCCchhh-hHHHHHHHHHHHHHHHHHHHHhhcchhheecCcc Confidence 01122222 22233221 2358899987 7899999999999999999876666665333211 Q ss_pred CCCC-c----cccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCc Q lcl|NC_016654. 314 NLGM-G----QGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEV 388 (533) Q Consensus 314 ~~~~-~----~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~ 388 (533) .... . .+..+......+......... ....+++++++++++.+...++.+.+.|+..++.+.-+++. .++ T Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~-~~~ 369 (511) T protein:vir:96 295 LDPVEVRKQKEANVLFLEPTVYVDAEGRETE----GSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDN-FSG 369 (511) T ss_pred CCchhhcccccccceeccccceeccccccCC----CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-ccc Confidence 1100 0 001111111111111111111 12235678888888888888888888888888776544432 235 Q ss_pred chhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhC Q lcl|NC_016654. 389 AQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVA 468 (533) Q Consensus 389 ~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~a 468 (533) ..||.||+++++.+..++..|++.|+.+|++++++|+.+.+.............++|.|++++|.|..+.+++++++ + T Consensus 370 n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl--~ 447 (511) T protein:vir:96 370 TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--G 447 (511) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHH--h Confidence 67999999999999999999999999999999999988765332222233446789999999999999999999988 4 Q ss_pred CCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccc-cccCCCCCCCCC-CCCCCCCCCCC Q lcl|NC_016654. 469 SAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFG-TDQPPLPTENDP-ATDPEAVDEGE 533 (533) Q Consensus 469 Gi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~-~~~~~~~~~~~~-~~~~~~~~d~~ 533 (533) |+||+||++.+ +|.+++ +++|++||++|+....+..... +..+...++.++ +...+..+++| T Consensus 448 G~iS~et~l~~-l~~v~d--~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:96 448 GKISQTTLMSL-FSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred ccCChHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCCccCcccccC Confidence 89999999986 565654 7899999999986543322111 111111111111 11122233333 No 30 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=100.00 E-value=4.6e-55 Score=318.48 Aligned_cols=467 Identities=11% Similarity=0.018 Sum_probs=291.0 Q ss_pred CCCCCCcCCCcCcchHHHHHHHH-hhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeec Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVA-ESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHA 79 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~-~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 79 (533) |+-=+.....=+..+...+.+.. ..+.. ..+|.+||.+.+..... .......+..++|+++ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r----~~~l~~Yy~g~~~il~~--------------~~~~~~~~~~~~ki~~ 92 (511) T protein:vir:78 31 YDGTESDLLQNVNEVSKYIEHHMDYQRPR----LKVLSDYYEGKTKNLVE--------------LTRRKEEYMADNRVAH 92 (511) T ss_pred ccchhhhhhcCHHHHHHHHHHHHHhhhHH----HHHHHHHhhccCccccc--------------cCcccccccCcceeec Confidence 22222222222223333322211 11111 13455555554321100 0011223345789999 Q ss_pred ChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEc Q lcl|NC_016654. 80 PIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVD 159 (533) Q Consensus 80 n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~ 159 (533) |+|+.||+..++||||+|++++++ ++.+++.|+++++.|+|...+.++++.++++|.+|+++|+|++ +++++.+++ T Consensus 93 n~~k~Iv~~~~~yl~g~p~~~~~~--d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~d--g~~~i~~~~ 168 (511) T protein:vir:78 93 DYASYISDFINGYFLGNPIQYQDD--DKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD--DETRLYKSD 168 (511) T ss_pred chHHHHHHHHhhhhcccCceeecC--chHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCC--CceEEEEEc Confidence 999999999999999999999874 5568899999999999999999999999999999999999875 468999999 Q ss_pred CCeEEEEEecCCceE-EEEEEEEee---c-CCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccc Q lcl|NC_016654. 160 ADRAIPEFRWGRLVA-VTFWSELAG---G-DGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGAD 234 (533) Q Consensus 160 ~~~~~P~~~~g~~~~-v~f~~~~~~---~-~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~ 234 (533) |.+++|+|++....+ ++|++.+.. . .....+.++|.|++..|.+....+... ..+. .... T Consensus 169 p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~-----~~~~---------~~~~- 233 (511) T protein:vir:78 169 AMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNG-----LKLT---------PREN- 233 (511) T ss_pred ccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCc-----cccc---------cccc- Confidence 999999998753222 333333322 1 122334567899988886533322111 0000 0000 Q ss_pred cCCceeecCCCc-cceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhc Q lcl|NC_016654. 235 EGRGAYVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLT 313 (533) Q Consensus 235 ~~~~~~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~ 313 (533) ...+++.. .|++.|.. ...|.|+|.+ +.+|||++|.++|++++.++.....+.|-..+.. T Consensus 234 ----~~~~~~~g~vPvv~~~n--------------~~~g~gd~e~-v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~ 294 (511) T protein:vir:78 234 ----SFESHSFERMPITEFSN--------------NERRKGDYEK-VITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN 294 (511) T ss_pred ----ccccCcCcccceEEecC--------------CCCCCCchhh-hHHHHHHHHHHHHHHHHHHHHhhcchhheecCcc Confidence 01122222 22233221 2358899987 7899999999999999999876666665333211 Q ss_pred CCCC-c----cccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCc Q lcl|NC_016654. 314 NLGM-G----QGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEV 388 (533) Q Consensus 314 ~~~~-~----~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~ 388 (533) .... . .+..+......+......... ....+++++++++++.+...++.+.+.|+..++.+.-+++. .++ T Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~-~~~ 369 (511) T protein:vir:78 295 LDPVEVRKQKEANVLFLEPTVYVDAEGRETE----GSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDN-FSG 369 (511) T ss_pred CCchhhcccccccceeccccceeccccccCC----CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-ccc Confidence 1100 0 001111111111111111111 12235678888888888888888888888888776544432 235 Q ss_pred chhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhC Q lcl|NC_016654. 389 AQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVA 468 (533) Q Consensus 389 ~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~a 468 (533) ..||.||+++++.+..++..|++.|+.+|++++++|+.+.+.............++|.|++++|.|..+.+++++++ + T Consensus 370 n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl--~ 447 (511) T protein:vir:78 370 TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--G 447 (511) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHH--h Confidence 67999999999999999999999999999999999988765332222233446789999999999999999999988 4 Q ss_pred CCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccc-cccCCCCCCCCC-CCCCCCCCCCC Q lcl|NC_016654. 469 SAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFG-TDQPPLPTENDP-ATDPEAVDEGE 533 (533) Q Consensus 469 Gi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~-~~~~~~~~~~~~-~~~~~~~~d~~ 533 (533) |+||+||++.+ +|.+++ +++|++||++|+....+..... +..+...++.++ +...+..+++| T Consensus 448 G~iS~et~l~~-l~~v~d--~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:78 448 GKISQTTLMSL-FSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred ccCChHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCCccCcccccC Confidence 89999999986 565654 7899999999986543322111 111111111111 11122233333 No 31 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=100.00 E-value=9.9e-55 Score=316.65 Aligned_cols=469 Identities=11% Similarity=0.020 Sum_probs=288.6 Q ss_pred CCCC-CCcCCCcCcchH--------HHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhc-ccCCCC Q lcl|NC_016654. 1 MSLP-EANTAWPPPELA--------AVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHG-RTPTAT 70 (533) Q Consensus 1 ~~~~-~~~~~~pp~~~~--------~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 70 (533) ---| ..|..++-...+ -+...|..+..=-.-...+|.+||.+.+.. +.. ...... T Consensus 19 ~~~~~~~~~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i---------------~~~~~~~~~~ 83 (512) T protein:vir:97 19 YLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKN---------------LVELTRRKEE 83 (512) T ss_pred eeeccccccccccCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCcc---------------ccccCccccc Confidence 1112 222222211111 111111111110011122334444433221 000 111223 Q ss_pred CcccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCC Q lcl|NC_016654. 71 GRAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIA 150 (533) Q Consensus 71 g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~ 150 (533) +..++|+++|||+.||+..++||||+|+++++ .++.+++.|+++++.|+|...+.++++.++++|.+|+++|+|++ T Consensus 84 ~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~--~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded-- 159 (512) T protein:vir:97 84 YMADNRVAHDYASYISDFINGYFLGNPIQCQD--DDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD-- 159 (512) T ss_pred ccCcceeecchHHHHHHHHhhhhcccCceecc--CChHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCCC-- Confidence 45678999999999999999999999999987 45568899999999999999999999999999999999999875 Q ss_pred CceEEEEEcCCeEEEEEecCCceE-EEEEEEEe--ecC--CceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccc Q lcl|NC_016654. 151 DNAWIDFVDADRAIPEFRWGRLVA-VTFWSELA--GGD--GQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPAT 225 (533) Q Consensus 151 ~~~~i~~v~~~~~~P~~~~g~~~~-v~f~~~~~--~~~--~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~ 225 (533) +.++|.+++|.+++|+|++....+ ++|++.+. ..+ ......++|.|++..|.+ |.....+.+. +. T Consensus 160 ~~~~i~~~~p~~~~~iyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~--~~~~~~~~~~---~~----- 229 (512) T protein:vir:97 160 DETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYR--YLTSRTNGLK---LT----- 229 (512) T ss_pred CceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEE--EEecCCCccc---cc----- Confidence 468999999999999998753222 23333332 211 223345678899888765 4333222110 00 Q ss_pred ccccccccccCCceeecCCCcc-ceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcce Q lcl|NC_016654. 226 RDIAVEGADEGRGAYVETGVKD-LTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGK 304 (533) Q Consensus 226 ~~~~~~~~~~~~~~~~~~g~~~-~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~ 304 (533) ... ....+++... |++.|.+ ...|.|+|.+ +.+|||++|.++|++++.++..... T Consensus 230 ----~~~-----~~~~~~~~g~vPvv~~~n--------------n~~~~gd~e~-v~~liDa~d~~~S~~~~~~~~~~~~ 285 (512) T protein:vir:97 230 ----PRE-----NGFESHSFERMPITEFSN--------------NERRKGDYEK-VITLIDLYDNAESDTANYMSDLNDA 285 (512) T ss_pred ----ccc-----cccccccCcccceEeecC--------------CCCCCCchhh-hHHHHHHHHHHHHHHHHHHHHhcCc Confidence 000 0011233332 2333322 2358899986 7899999999999999999887777 Q ss_pred eeechHHhcCCCC-cc------ccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCC Q lcl|NC_016654. 305 VHASESVLTNLGM-GQ------GVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGY 377 (533) Q Consensus 305 i~v~~~~l~~~~~-~~------~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~ 377 (533) +.|-..+...... .. ............... ...+. ...++++++++.++.+...++.+.+.|+..++. T Consensus 286 ~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~----~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~ 360 (512) T protein:vir:97 286 MLLIKGNLNLDPVEVRKQKEANVLFLEPTVYENRDTG-IETEG----SVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNT 360 (512) T ss_pred eeeeecCccCCchhhhhhhhcccccccccchhhcccc-cCCCC----CcceEEEeecCCHHHHHHHHHHHHHHHHHHhCC Confidence 7763332211110 00 000001111111110 01111 123567788888888888888888888877777 Q ss_pred ChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHH Q lcl|NC_016654. 378 SPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLA 457 (533) Q Consensus 378 s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e 457 (533) +.-+++. .++..||.||+++++.+.+++..|++.|+.+|++++++|+.+....-..........++|+|++++|.|..+ T Consensus 361 p~~~~~~-~~gn~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~~e 439 (512) T protein:vir:97 361 PNMKDDN-FSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIE 439 (512) T ss_pred cccCccc-ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHH Confidence 6555442 235679999999999999999999999999999999999887553321222334457899999999999999 Q ss_pred HHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCcccc--ccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 458 KAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGF--GTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 458 ~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) .+++++++ +|+||+||++.+ +|.+++ +++|++||++|+....+.... +.+..+..++.+++...+..+.+| T Consensus 440 ~~~~~~kl--~giiS~et~~~~-l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (512) T protein:vir:97 440 ELKAYIDS--GGKISQTTLMSL-FSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) T ss_pred HHHHHHHH--hccCchHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCccccccccC Confidence 99999988 599999999987 565665 678999999998654332211 111111111111222222233333 No 32 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=100.00 E-value=8.6e-55 Score=316.99 Aligned_cols=468 Identities=11% Similarity=0.026 Sum_probs=294.4 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |+-=+-.+..=+..+..++.+....+ +-...+|.+||.+.+.... ........+..++|+++| T Consensus 31 ~~~~~~~~~~~~~~i~~~i~~~~~~~---~~r~~~l~~Yy~g~~~i~~--------------~~~~~~~~~~~~~ki~~n 93 (511) T protein:vir:10 31 YDGTESDLLQNVNEVSKCIEHHMDYQ---RPRLKVLSDYYEGKTKNLV--------------ELTRRKEEYMADNRVAHD 93 (511) T ss_pred CchhhhhcccCHHHHHHHHHHHHHhh---HHHHHHHHHHhcccCcccc--------------ccCcccccccCcceeecc Confidence 33323333332333333332211110 0111344455554332100 001112234467899999 Q ss_pred hHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcC Q lcl|NC_016654. 81 IPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDA 160 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~ 160 (533) +|+.||+..++||||+|++++++ ++.+++.|+++++.|+|...+.++++.++++|.+|+++|+|++ +++++.+++| T Consensus 94 ~~k~Iv~~~~~yl~g~p~~~~~~--d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded--g~~~i~~~~p 169 (511) T protein:vir:10 94 YASYISDFINGYFLGNPIQYQDD--DKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIRNQD--DETRLYKSDA 169 (511) T ss_pred hHHHHHHHHhhhhcccCceeecC--chHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCC--CceEEEEEcc Confidence 99999999999999999999874 4568899999999999999999999999999999999999875 4689999999 Q ss_pred CeEEEEEecCCc-eEEEEEEEEee---c-CCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccccccc Q lcl|NC_016654. 161 DRAIPEFRWGRL-VAVTFWSELAG---G-DGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADE 235 (533) Q Consensus 161 ~~~~P~~~~g~~-~~v~f~~~~~~---~-~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~ 235 (533) .+++|+|++... ..++|++.+.. . .......+.|.|++..|.+ |........ .+. ... T Consensus 170 ~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~--~~~~~~~~~---~~~---------~~~--- 232 (511) T protein:vir:10 170 MSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYR--YLTSRTNGL---KLT---------PRE--- 232 (511) T ss_pred ceeEEEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEE--EEecCCCcc---ccc---------ccc--- Confidence 999999987532 22334443322 1 1223345678888887765 333222110 000 000 Q ss_pred CCceeecCCCc-cceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcC Q lcl|NC_016654. 236 GRGAYVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTN 314 (533) Q Consensus 236 ~~~~~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~ 314 (533) ....+++.. .|++.|. | ...|.|+|.+ +.++||++|.++|++++.++....++.|...+... T Consensus 233 --~~~~~~~~~~vPvv~f~-n-------------n~~g~gd~e~-v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~ 295 (511) T protein:vir:10 233 --NGFESHSFERMPITEFS-N-------------NERRKGDYEK-VITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL 295 (511) T ss_pred --cccccccCcceeEEEec-C-------------CCCCCCchhh-hHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccC Confidence 001122222 2333332 2 1258899986 77999999999999999998877777774443321 Q ss_pred CCCc-----cccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc Q lcl|NC_016654. 315 LGMG-----QGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA 389 (533) Q Consensus 315 ~~~~-----~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~ 389 (533) .... .+..+......+........+.+ ..+++++++++++.+...++.+.+.|+..++.+.-+++. .++. T Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~-~~~n 370 (511) T protein:vir:10 296 DPVEVRKQKEANVLFLEPTVYADSEGRETEGS----VDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDN-FSGT 370 (511) T ss_pred CchhhccchhccceecccccccccccccCCCC----cceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cccc Confidence 1110 01111111111111111111222 235778888888888888888888888888776544432 2356 Q ss_pred hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCC Q lcl|NC_016654. 390 QTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVAS 469 (533) Q Consensus 390 ~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aG 469 (533) .||.||+++++.+.+++..|++.|+.+|++++++|+.+...............++|.|++++|.|..+.+++++++ +| T Consensus 371 ~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~G 448 (511) T protein:vir:10 371 QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GG 448 (511) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccccceeeEEeCCCCCcCHHHHHHHHHHH--hc Confidence 7999999999999999999999999999999999988765332222233446799999999999999999999998 48 Q ss_pred CCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCcccc--ccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 470 AASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGF--GTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 470 i~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +||+||++++ .|.+++ +++|++||++|+....+.... +.+..+..++..++...+..+.+| T Consensus 449 ~iS~et~~~~-l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:10 449 KISQTTLMSL-FSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred cCcHHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHHHHhhhcccCCCCCCCCCCCCcccCcccccC Confidence 9999999987 565665 678999999998654332211 111111111112222223333333 No 33 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=100.00 E-value=1.5e-54 Score=315.63 Aligned_cols=457 Identities=11% Similarity=0.045 Sum_probs=290.5 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHH---HHHHHHH------HHHHhcccCCCCC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGI---KARTKAA------YEAFHGRTPTATG 71 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~---~~~~~~~------~~~~~~~~~~~~g 71 (533) ..|| |=+|..+-++..+..-..- +.+.|.+++... ..+..++ ..+|.|. ..+.++......+ T Consensus 5 ~~~~-----~~~~~~~~~~~~~~~~~~~---~~~~i~~~i~~~-~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~ 75 (474) T protein:vir:94 5 IRMP-----WDKPYGEEVVEQLKPQFET---QEEMIVRLIDDH-RKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYD 75 (474) T ss_pred cccc-----CCCchhhHHHHhhhhcccC---HHHHHHHHHHHH-HHHHHHHHHHHHHhccccchhcccchhccccccccc Confidence 3333 3333333333333322110 112233333221 1111111 1111111 1223344455667 Q ss_pred cccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCC Q lcl|NC_016654. 72 RAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIAD 151 (533) Q Consensus 72 ~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~ 151 (533) .+++|+++|+++.||+..|+||||+|+++++ +++.+++.|+.+++ |+|...+.++++.++++|.+|+++|+|++ + T Consensus 76 ~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~--~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~~~~d~~--~ 150 (474) T protein:vir:94 76 KPDWRITTNFHQNLVDQKVSYVASKPVTYSC--EDENVLKVIHDVLD-TRWDNKLIDILTATSNKGIDWLQVYINEN--G 150 (474) T ss_pred cCcceeecchHHHHHHHHHhhhhcCCceecc--CcHHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCceEEEEEecCC--C Confidence 7788999999999999999999999999877 45668899999876 67999999999999999999999999875 4 Q ss_pred ceEEEEEcCCeEEEEEecCCceE-EEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccc Q lcl|NC_016654. 152 NAWIDFVDADRAIPEFRWGRLVA-VTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAV 230 (533) Q Consensus 152 ~~~i~~v~~~~~~P~~~~g~~~~-v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~ 230 (533) +++|.+++|.+++|+|+++.... ++|++.+...+.. .+|.|++..|.+..+.+ +.......... .. T Consensus 151 ~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~----~~~~yt~~~~~~y~~~~--~~~~~~~~~~~----~~--- 217 (474) T protein:vir:94 151 EMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEE----KVEFWTDTTVTYYVLEN--GGLIPDYYYGA----NH--- 217 (474) T ss_pred eeEEEEEcccceEEEEcCCCCCceEEEEEEEEecCeE----EEEEEeCCeEEEEEEcC--CccccccccCc----Cc--- Confidence 79999999999999998764333 3455555544332 35778877776644332 22111110000 00 Q ss_pred cccccCCceeecCCCc-cceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeech Q lcl|NC_016654. 231 EGADEGRGAYVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASE 309 (533) Q Consensus 231 ~~~~~~~~~~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~ 309 (533) .......++.. .|+++|++ .+.|.|+|.+ +.+|||++|.++|++++.++.....+.|.. T Consensus 218 -----~~~~~~~~~~g~vPvv~~~n--------------n~~g~sd~e~-v~~liDa~n~~~s~~~~~~~~~~~~~lv~~ 277 (474) T protein:vir:94 218 -----VQSHFSNGNWGRVPFIAFKN--------------NPEEVSDIWM-YKSIIDAIDKRLSDAQNMFDESVELIYILK 277 (474) T ss_pred -----ccccccccCCCccceEEecC--------------CcCCCCcHHH-HHHHHHHHHHHHHHHHHHHHHhcCceeeee Confidence 00011223333 33444432 2368999997 789999999999999999987666666622 Q ss_pred HHhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc Q lcl|NC_016654. 310 SVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA 389 (533) Q Consensus 310 ~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~ 389 (533) .+ .......+......++.+.. +.++ .+++++++++++++...++.+.+.|+..++.+.-+++ ..++. T Consensus 278 g~----~~~~~~~~~~~~~~~~~i~~---~~~~----~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~n 345 (474) T protein:vir:94 278 GY----EGEDLEEFMRGLKYYKAINV---DGDG----GVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTD-KFGSA 345 (474) T ss_pred cC----Ccccchhhhhhhhccceeec---cCCC----ceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCcc-ccccc Confidence 21 11111122222233433332 2222 2667888899999999999998888888876543332 22456 Q ss_pred hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCC Q lcl|NC_016654. 390 QTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVAS 469 (533) Q Consensus 390 ~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aG 469 (533) .||.|++++++.+..+|.+|++.|+.+|++++++|+.+.. .......++|+|++++|.|..+.++++++ +| T Consensus 346 ~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~------~~~d~~~i~v~f~~~~p~~~~e~a~~~~~---~g 416 (474) T protein:vir:94 346 PSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNN------LKTDVKDIEISFNFNRMMNDAEQSQIIAQ---SQ 416 (474) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCcccceeeEEeccCcccCHHHHHHHHHH---cC Confidence 7999999999999999999999999999999999887632 23455779999999999999999887654 69 Q ss_pred CCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 470 AASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 470 i~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +||++|++++ +|.++| +++|++||++|+..........++... +.+.+++..++++ T Consensus 417 ~iS~et~l~~-l~~v~D--~~~E~eri~~E~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~ 472 (474) T protein:vir:94 417 YLSRETLVKS-SPLVDD--YKAELERIEQEQMEYNKQLPNLDDGGA-----DGAQQQEGSNNKE 472 (474) T ss_pred CCCHHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHhhccccCCCCC-----CCcccCCCCcccc Confidence 9999999987 565655 678999999998654332211111100 1111111122222 No 34 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=100.00 E-value=1.5e-54 Score=315.63 Aligned_cols=457 Identities=11% Similarity=0.045 Sum_probs=290.5 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHH---HHHHHHH------HHHHhcccCCCCC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGI---KARTKAA------YEAFHGRTPTATG 71 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~---~~~~~~~------~~~~~~~~~~~~g 71 (533) ..|| |=+|..+-++..+..-..- +.+.|.+++... ..+..++ ..+|.|. ..+.++......+ T Consensus 5 ~~~~-----~~~~~~~~~~~~~~~~~~~---~~~~i~~~i~~~-~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~ 75 (474) T protein:vir:97 5 IRMP-----WDKPYGEEVVEQLKPQFET---QEEMIVRLIDDH-RKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYD 75 (474) T ss_pred cccc-----CCCchhhHHHHhhhhcccC---HHHHHHHHHHHH-HHHHHHHHHHHHHhccccchhcccchhccccccccc Confidence 3333 3333333333333322110 112233333221 1111111 1111111 1223344455667 Q ss_pred cccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCC Q lcl|NC_016654. 72 RAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIAD 151 (533) Q Consensus 72 ~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~ 151 (533) .+++|+++|+++.||+..|+||||+|+++++ +++.+++.|+.+++ |+|...+.++++.++++|.+|+++|+|++ + T Consensus 76 ~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~--~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~~~~d~~--~ 150 (474) T protein:vir:97 76 KPDWRITTNFHQNLVDQKVSYVASKPVTYSC--EDENVLKVIHDVLD-TRWDNKLIDILTATSNKGIDWLQVYINEN--G 150 (474) T ss_pred cCcceeecchHHHHHHHHHhhhhcCCceecc--CcHHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCceEEEEEecCC--C Confidence 7788999999999999999999999999877 45668899999876 67999999999999999999999999875 4 Q ss_pred ceEEEEEcCCeEEEEEecCCceE-EEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccc Q lcl|NC_016654. 152 NAWIDFVDADRAIPEFRWGRLVA-VTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAV 230 (533) Q Consensus 152 ~~~i~~v~~~~~~P~~~~g~~~~-v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~ 230 (533) +++|.+++|.+++|+|+++.... ++|++.+...+.. .+|.|++..|.+..+.+ +.......... .. T Consensus 151 ~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~----~~~~yt~~~~~~y~~~~--~~~~~~~~~~~----~~--- 217 (474) T protein:vir:97 151 EMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEE----KVEFWTDTTVTYYVLEN--GGLIPDYYYGA----NH--- 217 (474) T ss_pred eeEEEEEcccceEEEEcCCCCCceEEEEEEEEecCeE----EEEEEeCCeEEEEEEcC--CccccccccCc----Cc--- Confidence 79999999999999998764333 3455555544332 35778877776644332 22111110000 00 Q ss_pred cccccCCceeecCCCc-cceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeech Q lcl|NC_016654. 231 EGADEGRGAYVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASE 309 (533) Q Consensus 231 ~~~~~~~~~~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~ 309 (533) .......++.. .|+++|++ .+.|.|+|.+ +.+|||++|.++|++++.++.....+.|.. T Consensus 218 -----~~~~~~~~~~g~vPvv~~~n--------------n~~g~sd~e~-v~~liDa~n~~~s~~~~~~~~~~~~~lv~~ 277 (474) T protein:vir:97 218 -----VQSHFSNGNWGRVPFIAFKN--------------NPEEVSDIWM-YKSIIDAIDKRLSDAQNMFDESVELIYILK 277 (474) T ss_pred -----ccccccccCCCccceEEecC--------------CcCCCCcHHH-HHHHHHHHHHHHHHHHHHHHHhcCceeeee Confidence 00011223333 33444432 2368999997 789999999999999999987666666622 Q ss_pred HHhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc Q lcl|NC_016654. 310 SVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA 389 (533) Q Consensus 310 ~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~ 389 (533) .+ .......+......++.+.. +.++ .+++++++++++++...++.+.+.|+..++.+.-+++ ..++. T Consensus 278 g~----~~~~~~~~~~~~~~~~~i~~---~~~~----~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~n 345 (474) T protein:vir:97 278 GY----EGEDLEEFMRGLKYYKAINV---DGDG----GVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTD-KFGSA 345 (474) T ss_pred cC----Ccccchhhhhhhhccceeec---cCCC----ceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCcc-ccccc Confidence 21 11111122222233433332 2222 2667888899999999999998888888876543332 22456 Q ss_pred hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCC Q lcl|NC_016654. 390 QTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVAS 469 (533) Q Consensus 390 ~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aG 469 (533) .||.|++++++.+..+|.+|++.|+.+|++++++|+.+.. .......++|+|++++|.|..+.++++++ +| T Consensus 346 ~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~------~~~d~~~i~v~f~~~~p~~~~e~a~~~~~---~g 416 (474) T protein:vir:97 346 PSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNN------LKTDVKDIEISFNFNRMMNDAEQSQIIAQ---SQ 416 (474) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCcccceeeEEeccCcccCHHHHHHHHHH---cC Confidence 7999999999999999999999999999999999887632 23455779999999999999999887654 69 Q ss_pred CCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 470 AASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 470 i~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +||++|++++ +|.++| +++|++||++|+..........++... +.+.+++..++++ T Consensus 417 ~iS~et~l~~-l~~v~D--~~~E~eri~~E~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~ 472 (474) T protein:vir:97 417 YLSRETLVKS-SPLVDD--YKAELERIEQEQMEYNKQLPNLDDGGA-----DGAQQQEGSNNKE 472 (474) T ss_pred CCCHHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHhhccccCCCCC-----CCcccCCCCcccc Confidence 9999999987 565655 678999999998654332211111100 1111111122222 No 35 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=100.00 E-value=1.5e-54 Score=315.68 Aligned_cols=465 Identities=13% Similarity=0.071 Sum_probs=286.0 Q ss_pred CC--CCCCcC---CCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHH---HHHHHH------HHhccc Q lcl|NC_016654. 1 MS--LPEANT---AWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKAR---TKAAYE------AFHGRT 66 (533) Q Consensus 1 ~~--~~~~~~---~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~---~~~~~~------~~~~~~ 66 (533) |. |-.+|. |+=|.+.. .+..+-+...--.-..+.|.++...+ ..+..++... |.|... ..+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~e~~~~~i~~~i~~~-~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~ 78 (483) T protein:vir:12 1 MAQALIKGGNILYPSQPTQTE-IFDAIVRTNNKPETLEEMIVRYIKQH-LEKLPEISIGQEYYEQRPDIVKEPKPVDATG 78 (483) T ss_pred CccchhcCCceeecCcchhhh-hhhcccccCCchhhHHHHHHHHHHHH-HHHHHHHHHHHHHhccccccccccccccccc Confidence 32 223332 23333321 11111110000000011222222111 1111112111 111100 001111 Q ss_pred CCCCCcccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEc Q lcl|NC_016654. 67 PTATGRAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWD 146 (533) Q Consensus 67 ~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D 146 (533) .......++|+++|||+.||+..|+||||+|+++++ +++..++.|+++++ |+|...+.++++.++++|.+|+++|+| T Consensus 79 ~~~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~~~~--~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~y~~v~~d 155 (483) T protein:vir:12 79 AVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH--TDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLD 155 (483) T ss_pred cccccccccccccchHHHHHHHHhhhhcccCceecc--CChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEEc Confidence 222344567899999999999999999999999977 45667899999986 679999999999999999999999999 Q ss_pred CCCCCceEEEEEcCCeEEEEEecCCceE-EEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccc Q lcl|NC_016654. 147 PTIADNAWIDFVDADRAIPEFRWGRLVA-VTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPAT 225 (533) Q Consensus 147 ~~~~~~~~i~~v~~~~~~P~~~~g~~~~-v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~ 225 (533) ++ +++++.+++|.+++|+|++....+ .+|++.+...+. +.+|.|++++|.+..+.+... +.-... . T Consensus 156 ~d--~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~----~~~~~y~~~~v~~~~~~~~~~-----~~~~~~-~- 222 (483) T protein:vir:12 156 EE--GEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENE----TKVEYWDKVTVNYYVYENGSL-----IPDYSN-N- 222 (483) T ss_pred CC--CceEEEEEcccceEEEEcCCCCCceEEEEEEEEeecc----eEEEEEecCeEEEEEEeCCee-----eecccc-c- Confidence 75 468999999999999998643222 233444443332 236788888888765543211 000000 0 Q ss_pred ccccccccccCCceeecCCCc-cceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcce Q lcl|NC_016654. 226 RDIAVEGADEGRGAYVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGK 304 (533) Q Consensus 226 ~~~~~~~~~~~~~~~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~ 304 (533) ..........++.. .|++.|. | +..|.|+|+. +.+|+|++|.++|++++.++..... T Consensus 223 -------~~~~~~~~~~~~~g~vPvv~~~-n-------------n~~g~sd~e~-v~~liDa~d~~~S~~~~~~~~~~~~ 280 (483) T protein:vir:12 223 -------LENSKTHFSTGSWGKIPFIPFK-N-------------NDLEISDIFM-YKTLIDAYNRRLSDLSNTFKDSNEL 280 (483) T ss_pred -------ccccccccccCCCCccceEEec-C-------------CCCCCCchhh-HHHHHHHHHHHHHHHHHHHHHhcCc Confidence 00001111233333 2333332 2 2358999986 8899999999999999999876666 Q ss_pred eeechHHhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhccc Q lcl|NC_016654. 305 VHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGL 384 (533) Q Consensus 305 i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~ 384 (533) +.+ +..........+....+.+..+.. +.+++ +++++++++++.+...++.+.+.|+..++.+.-+++ T Consensus 281 ~lv----~~g~~~~~~~~~~~~~~~~~~~~~---~~~~~----~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~- 348 (483) T protein:vir:12 281 TYV----LTNYDDQELPEFKRLLRYYGAIKV---SDNGG----VDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSD- 348 (483) T ss_pred eee----eecCCcccchhHHHhhhhcccccc---CCCCc----ceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCcc- Confidence 665 322222221222222223333322 22222 567778888888888888888888888877654443 Q ss_pred CCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHH Q lcl|NC_016654. 385 SDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQA 464 (533) Q Consensus 385 ~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~ 464 (533) ..++..||.||+++++.+..++.+|++.|+.+|++++++|+.+.. .......++|.|++.+|.|..+.++++++ T Consensus 349 ~~~~n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~------~~~~~~~i~v~f~~~~p~~~~~~a~~~~k 422 (483) T protein:vir:12 349 KFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD------IKGEHKDVDISFNYNKVANTELQVQTAQQ 422 (483) T ss_pred ccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc------CCCccceeeEEeCCCCCCCHHHHHHHHHH Confidence 224567999999999999999999999999999999999887632 22355789999999999999999999998 Q ss_pred HHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 465 WSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 465 l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) + +|+||+||++++ +|.+++ +++|++||++|+......+....+.. .+..++++..+++| T Consensus 423 l--~GiiS~et~~~~-~~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~-----~d~~~~~~~~~~~e 481 (483) T protein:vir:12 423 S--MGIVSHETVLEN-HPFVED--LQAELERIEQEQMEYNKQLPNLDDGG-----ADGAQQQERSNNKE 481 (483) T ss_pred H--hccCchHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHhhcccccccc-----cCCcccCCCCCccc Confidence 8 699999999986 565654 78899999999864433221111111 11122222223333 No 36 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=100.00 E-value=1.7e-54 Score=315.34 Aligned_cols=455 Identities=11% Similarity=0.036 Sum_probs=290.0 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHH--Hhc-------ccCCCCC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEA--FHG-------RTPTATG 71 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~--~~~-------~~~~~~g 71 (533) ||+-++++ +.++..+..-..- ..+.|.++...+. .+..++...+..+.++ ... ....... T Consensus 7 ~~~~~~~~-------~~~~~~~~~~~~~---~~~~i~~~i~~~~-~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~ 75 (474) T protein:vir:95 7 MPWDKPYG-------EEVVEQMKPKVET---QEEMIIRLINNHK-QKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYT 75 (474) T ss_pred CCCCCCCC-------cchhhhccccccc---hHHHHHHHHHHHH-HHHHHHHHHHHHhcccCccccccchhhhccccccc Confidence 55555554 2233333222110 0112223322211 1111111111100000 000 0011223 Q ss_pred cccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCC Q lcl|NC_016654. 72 RAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIAD 151 (533) Q Consensus 72 ~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~ 151 (533) ..++|+++|||+.||+..|+||||+|+++++ +++.+++.|+++++ |+|...+.++++.++++|.+|+++|+|++ + T Consensus 76 ~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~--~~~~~~~~l~~~~~-n~~~~~~~~l~~~~~~~G~~~~~~~~d~~--~ 150 (474) T protein:vir:95 76 KPDWRITTNFHQNLVDQKVSYVAGKPVTYAH--DDDKVLDVIHQVLD-TRWDNKLIDILTAASNKGIDWLQVYINED--G 150 (474) T ss_pred ccccccccchHHHHHHhhhhhhcccCceecc--CChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEeeeCCC--C Confidence 4567899999999999999999999999987 45567899999986 68999999999999999999999999975 4 Q ss_pred ceEEEEEcCCeEEEEEecCCceE-EEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccc Q lcl|NC_016654. 152 NAWIDFVDADRAIPEFRWGRLVA-VTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAV 230 (533) Q Consensus 152 ~~~i~~v~~~~~~P~~~~g~~~~-v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~ 230 (533) .++|.+++|.+++|+|+++...+ ++|++.++..+. +.+|.|++..|.+..+.+....+.. . ... T Consensus 151 ~~~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~----~~~~vy~~~~i~~~~~~~~~~~~~~-----~------~~~ 215 (474) T protein:vir:95 151 ELKLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGE----TKVEYWTAETVTYYVYENGGLIPDF-----Y------YGD 215 (474) T ss_pred ceEEEEEcccceEEEEcCCCCCceEEEEEEEeecCe----eEEEEEeCCeEEEEEEcCCceeecc-----c------ccc Confidence 68999999999999998764433 445665554332 3468899998887555433111100 0 000 Q ss_pred cccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechH Q lcl|NC_016654. 231 EGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASES 310 (533) Q Consensus 231 ~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~ 310 (533) .. .......++...+++...+| ...|.|+|.. +.+|||++|.++|++++.++.....++|... T Consensus 216 ~~---~~~~~~~~~~~~vPvv~~~n-------------n~~~~~d~e~-v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g 278 (474) T protein:vir:95 216 EH---IQTHFSTGSWERVPFIAFKN-------------NPEEVSDIWM-YKSFVDAIDKRLSDVQNMFDESVELIYILRG 278 (474) T ss_pred cc---ccCcccccCCCccceEEecC-------------CCCCCCchHH-HHHHHHHHHHHHHHHHHHHHHhhcchhhhcC Confidence 00 00011223333222222222 2358899987 8899999999999999999877777766333 Q ss_pred HhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcch Q lcl|NC_016654. 311 VLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQ 390 (533) Q Consensus 311 ~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~ 390 (533) + .......+..+.+.++.+.. +.++ .++++++++..+++...++.+.+.|+..++.+.-+++ ..++.. T Consensus 279 ~----~~~~~~~~~~~~~~~~~i~~---~~~~----~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~n~ 346 (474) T protein:vir:95 279 Y----EGEDLSEFMEGLKYYKAINV---SSDG----GVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTD-KFGSAT 346 (474) T ss_pred C----Ccccccchhhhhhccceeec---cCCC----ceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcccc-cccccc Confidence 2 22221222223333333332 1222 2577888899999999999999999988887644332 234567 Q ss_pred hHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCC Q lcl|NC_016654. 391 TATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASA 470 (533) Q Consensus 391 Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi 470 (533) ||.|++++++.+..+|.+|++.|+.+|++++++|+.+.. .......++|+|++++|.|..+.++++++ +|+ T Consensus 347 Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g------~~~d~~~i~i~f~~~~p~~~~e~a~~~~~---~gi 417 (474) T protein:vir:95 347 SGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNK------IKLDAKEIEITFNFNVMVNDLEQSQIGAQ---SQY 417 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCcccceeeEEecCCCccCHHHHHHHHHH---cCC Confidence 999999999999999999999999999999999887632 23456789999999999999999987654 699 Q ss_pred CCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 471 ASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 471 ~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ||+||++.+ +|.+++ +++|++||++|+......+....+..+.... ++.+++++ T Consensus 418 iS~et~~~~-lp~v~D--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~------~~~~~~~~ 471 (474) T protein:vir:95 418 LSKETLVRH-HPWVDD--PKAELERLDEEQLELNKQLPNLDDGGADGAQ------QQQQSENN 471 (474) T ss_pred CChHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHhhccccccccCCCCC------CcCCCCcc Confidence 999999986 565655 7789999999986544333222211111111 11111111 No 37 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=100.00 E-value=1.7e-54 Score=315.34 Aligned_cols=455 Identities=11% Similarity=0.036 Sum_probs=290.0 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHH--Hhc-------ccCCCCC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEA--FHG-------RTPTATG 71 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~--~~~-------~~~~~~g 71 (533) ||+-++++ +.++..+..-..- ..+.|.++...+. .+..++...+..+.++ ... ....... T Consensus 7 ~~~~~~~~-------~~~~~~~~~~~~~---~~~~i~~~i~~~~-~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~ 75 (474) T protein:vir:96 7 MPWDKPYG-------EEVVEQMKPKVET---QEEMIIRLINNHK-QKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYT 75 (474) T ss_pred CCCCCCCC-------cchhhhccccccc---hHHHHHHHHHHHH-HHHHHHHHHHHHhcccCccccccchhhhccccccc Confidence 55555554 2233333222110 0112223322211 1111111111100000 000 0011223 Q ss_pred cccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCC Q lcl|NC_016654. 72 RAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIAD 151 (533) Q Consensus 72 ~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~ 151 (533) ..++|+++|||+.||+..|+||||+|+++++ +++.+++.|+++++ |+|...+.++++.++++|.+|+++|+|++ + T Consensus 76 ~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~--~~~~~~~~l~~~~~-n~~~~~~~~l~~~~~~~G~~~~~~~~d~~--~ 150 (474) T protein:vir:96 76 KPDWRITTNFHQNLVDQKVSYVAGKPVTYAH--DDDKVLDVIHQVLD-TRWDNKLIDILTAASNKGIDWLQVYINED--G 150 (474) T ss_pred ccccccccchHHHHHHhhhhhhcccCceecc--CChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEeeeCCC--C Confidence 4567899999999999999999999999987 45567899999986 68999999999999999999999999975 4 Q ss_pred ceEEEEEcCCeEEEEEecCCceE-EEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccc Q lcl|NC_016654. 152 NAWIDFVDADRAIPEFRWGRLVA-VTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAV 230 (533) Q Consensus 152 ~~~i~~v~~~~~~P~~~~g~~~~-v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~ 230 (533) .++|.+++|.+++|+|+++...+ ++|++.++..+. +.+|.|++..|.+..+.+....+.. . ... T Consensus 151 ~~~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~----~~~~vy~~~~i~~~~~~~~~~~~~~-----~------~~~ 215 (474) T protein:vir:96 151 ELKLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGE----TKVEYWTAETVTYYVYENGGLIPDF-----Y------YGD 215 (474) T ss_pred ceEEEEEcccceEEEEcCCCCCceEEEEEEEeecCe----eEEEEEeCCeEEEEEEcCCceeecc-----c------ccc Confidence 68999999999999998764433 445665554332 3468899998887555433111100 0 000 Q ss_pred cccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechH Q lcl|NC_016654. 231 EGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASES 310 (533) Q Consensus 231 ~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~ 310 (533) .. .......++...+++...+| ...|.|+|.. +.+|||++|.++|++++.++.....++|... T Consensus 216 ~~---~~~~~~~~~~~~vPvv~~~n-------------n~~~~~d~e~-v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g 278 (474) T protein:vir:96 216 EH---IQTHFSTGSWERVPFIAFKN-------------NPEEVSDIWM-YKSFVDAIDKRLSDVQNMFDESVELIYILRG 278 (474) T ss_pred cc---ccCcccccCCCccceEEecC-------------CCCCCCchHH-HHHHHHHHHHHHHHHHHHHHHhhcchhhhcC Confidence 00 00011223333222222222 2358899987 8899999999999999999877777766333 Q ss_pred HhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcch Q lcl|NC_016654. 311 VLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQ 390 (533) Q Consensus 311 ~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~ 390 (533) + .......+..+.+.++.+.. +.++ .++++++++..+++...++.+.+.|+..++.+.-+++ ..++.. T Consensus 279 ~----~~~~~~~~~~~~~~~~~i~~---~~~~----~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~n~ 346 (474) T protein:vir:96 279 Y----EGEDLSEFMEGLKYYKAINV---SSDG----GVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTD-KFGSAT 346 (474) T ss_pred C----Ccccccchhhhhhccceeec---cCCC----ceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcccc-cccccc Confidence 2 22221222223333333332 1222 2577888899999999999999999988887644332 234567 Q ss_pred hHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCC Q lcl|NC_016654. 391 TATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASA 470 (533) Q Consensus 391 Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi 470 (533) ||.|++++++.+..+|.+|++.|+.+|++++++|+.+.. .......++|+|++++|.|..+.++++++ +|+ T Consensus 347 Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g------~~~d~~~i~i~f~~~~p~~~~e~a~~~~~---~gi 417 (474) T protein:vir:96 347 SGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNK------IKLDAKEIEITFNFNVMVNDLEQSQIGAQ---SQY 417 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCcccceeeEEecCCCccCHHHHHHHHHH---cCC Confidence 999999999999999999999999999999999887632 23456789999999999999999987654 699 Q ss_pred CCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 471 ASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 471 ~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ||+||++.+ +|.+++ +++|++||++|+......+....+..+.... ++.+++++ T Consensus 418 iS~et~~~~-lp~v~D--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~------~~~~~~~~ 471 (474) T protein:vir:96 418 LSKETLVRH-HPWVDD--PKAELERLDEEQLELNKQLPNLDDGGADGAQ------QQQQSENN 471 (474) T ss_pred CChHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHhhccccccccCCCCC------CcCCCCcc Confidence 999999986 565655 7789999999986544333222211111111 11111111 No 38 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=100.00 E-value=4.6e-54 Score=313.01 Aligned_cols=464 Identities=11% Similarity=0.010 Sum_probs=295.8 Q ss_pred CCCCCCcCCCcCcchHH--HHHHHHhhhHhhcCCHH---HHHHHHhccCcchhhHHHHHHHHHHHH-HhcccCCCCCccc Q lcl|NC_016654. 1 MSLPEANTAWPPPELAA--VTARVAESHVWWEGDLD---KLATFYGAEGRTSPSGIKARTKAAYEA-FHGRTPTATGRAP 74 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~--~~~~~~~~~~w~~gd~~---~l~~~y~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~g~~~ 74 (533) |-|=.--......++.+ +...|+.+.. ..+ ++.++|++.........+..+...... .+.......+.++ T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~----~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (474) T protein:vir:94 1 MTLYKLIDDIEAQGILPKHIEALIESHKD----DRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVN 76 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhh----hhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcc Confidence 55543333333333322 2333333321 222 223444432211111111100000000 0111223445667 Q ss_pred ceeecChHHHHHHHHHHhhcCCCceEeeCC---CchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCC Q lcl|NC_016654. 75 KRYHAPIPGVIAKLSTTELFSEQLKFLDAG---KSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIAD 151 (533) Q Consensus 75 ~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~---~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~ 151 (533) +|+++|||+.||++.++||||+|+++++.+ .++.++++|+++++.|+|...+.+++..++++|.+|+++|.|++ + T Consensus 77 ~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~--~ 154 (474) T protein:vir:94 77 NKLNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTN--G 154 (474) T ss_pred cccccchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCC--C Confidence 899999999999999999999999999853 35678899999999999999999999999999999999999876 4 Q ss_pred ceEEEEEcCCeEEEEEec-CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccc Q lcl|NC_016654. 152 NAWIDFVDADRAIPEFRW-GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAV 230 (533) Q Consensus 152 ~~~i~~v~~~~~~P~~~~-g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~ 230 (533) ++++.+++|.+++|+|++ +....++.+........+..+..+++|+...+.. |.+....-... + T Consensus 155 ~~~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~--~~~~~~~~~~~-----~-------- 219 (474) T protein:vir:94 155 DIRIKNIDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYV--FRGEGIDALQE-----V-------- 219 (474) T ss_pred eeEEEEEcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEE--EeecCCCcccc-----c-------- Confidence 689999999999999975 3333333222333334445666788888876643 44332110000 0 Q ss_pred cccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechH Q lcl|NC_016654. 231 EGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASES 310 (533) Q Consensus 231 ~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~ 310 (533) ...+++...++++..+| .+.|.|+|.. +++|+|++|.++|++++.++.....+.+ T Consensus 220 --------~~~~~~~g~vPvv~~~n-------------~~~g~sd~e~-v~~liDa~d~~~S~~~~~~~~~~~~~l~--- 274 (474) T protein:vir:94 220 --------GRYEHLFDYNPLFGVPN-------------NKEMIGDAEK-VIHLIDAYDLTMSDASSEISQTRLAYLV--- 274 (474) T ss_pred --------ccccCCCCccceEEecC-------------CCCCCCchHH-HHHHHHHHHHHHHHHHHHHHHhhcchhh--- Confidence 01223333233333332 2358999986 8899999999999999999866665555 Q ss_pred HhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcch Q lcl|NC_016654. 311 VLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQ 390 (533) Q Consensus 311 ~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~ 390 (533) +...+... ....+......+.. .+.++ .++++++++.++.+...++.+.+.|+..++.+..+++. .++.. T Consensus 275 -i~g~~~~~--~~~~~~~~~~~i~~--~~~~~----~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~-~~~n~ 344 (474) T protein:vir:94 275 -LRGMGMSE--EMIQETQKSGAFEL--FDKDM----DVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDE-FNGNV 344 (474) T ss_pred -hccCCCCc--hhhhhhhhcceeEe--cCCCC----ceeEEeccCCHHHHHHHHHHHHHHHHHHhCCccccccc-ccccc Confidence 22111110 00111111222222 12222 25688888889999999999989998888876544432 23567 Q ss_pred hHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCC Q lcl|NC_016654. 391 TATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASA 470 (533) Q Consensus 391 Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi 470 (533) ||.||+++++.+..++.++++.|+.+|+++++.|+.+.+....+........+++.|++++|.|..+.+++++++ +|+ T Consensus 345 Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl--~g~ 422 (474) T protein:vir:94 345 PIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINL--KGQ 422 (474) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHH--hcc Confidence 999999999999999999999999999999999988765432232333456799999999999999999999988 599 Q ss_pred CCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 471 ASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 471 ~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +|++|++++ +|..++ +++|++||++|+......... ..+++.+++ +..++.| T Consensus 423 iS~et~~~~-l~~v~d--~~~E~eri~~E~~e~~~~~~~-------~~~~~~~~~-~~~~~s~ 474 (474) T protein:vir:94 423 VSERTRLGQ-SQLVDD--VDYELDEMEKESLEFNDKLPD-------IDEGDANDK-SQNNQSE 474 (474) T ss_pred CchHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHhhccc-------ccCCCcCCC-CccccCC Confidence 999999987 565554 889999999998643322111 111111111 1122222 No 39 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=100.00 E-value=4.6e-54 Score=313.01 Aligned_cols=464 Identities=11% Similarity=0.010 Sum_probs=295.8 Q ss_pred CCCCCCcCCCcCcchHH--HHHHHHhhhHhhcCCHH---HHHHHHhccCcchhhHHHHHHHHHHHH-HhcccCCCCCccc Q lcl|NC_016654. 1 MSLPEANTAWPPPELAA--VTARVAESHVWWEGDLD---KLATFYGAEGRTSPSGIKARTKAAYEA-FHGRTPTATGRAP 74 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~--~~~~~~~~~~w~~gd~~---~l~~~y~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~g~~~ 74 (533) |-|=.--......++.+ +...|+.+.. ..+ ++.++|++.........+..+...... .+.......+.++ T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~----~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (474) T protein:vir:10 1 MTLYKLIDDIEAQGILPKHIEALIESHKD----DRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVN 76 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhh----hhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcc Confidence 55543333333333322 2333333321 222 223444432211111111100000000 0111223445667 Q ss_pred ceeecChHHHHHHHHHHhhcCCCceEeeCC---CchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCC Q lcl|NC_016654. 75 KRYHAPIPGVIAKLSTTELFSEQLKFLDAG---KSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIAD 151 (533) Q Consensus 75 ~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~---~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~ 151 (533) +|+++|||+.||++.++||||+|+++++.+ .++.++++|+++++.|+|...+.+++..++++|.+|+++|.|++ + T Consensus 77 ~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~--~ 154 (474) T protein:vir:10 77 NKLNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTN--G 154 (474) T ss_pred cccccchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCC--C Confidence 899999999999999999999999999853 35678899999999999999999999999999999999999876 4 Q ss_pred ceEEEEEcCCeEEEEEec-CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccc Q lcl|NC_016654. 152 NAWIDFVDADRAIPEFRW-GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAV 230 (533) Q Consensus 152 ~~~i~~v~~~~~~P~~~~-g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~ 230 (533) ++++.+++|.+++|+|++ +....++.+........+..+..+++|+...+.. |.+....-... + T Consensus 155 ~~~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~--~~~~~~~~~~~-----~-------- 219 (474) T protein:vir:10 155 DIRIKNIDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYV--FRGEGIDALQE-----V-------- 219 (474) T ss_pred eeEEEEEcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEE--EeecCCCcccc-----c-------- Confidence 689999999999999975 3333333222333334445666788888876643 44332110000 0 Q ss_pred cccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechH Q lcl|NC_016654. 231 EGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASES 310 (533) Q Consensus 231 ~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~ 310 (533) ...+++...++++..+| .+.|.|+|.. +++|+|++|.++|++++.++.....+.+ T Consensus 220 --------~~~~~~~g~vPvv~~~n-------------~~~g~sd~e~-v~~liDa~d~~~S~~~~~~~~~~~~~l~--- 274 (474) T protein:vir:10 220 --------GRYEHLFDYNPLFGVPN-------------NKEMIGDAEK-VIHLIDAYDLTMSDASSEISQTRLAYLV--- 274 (474) T ss_pred --------ccccCCCCccceEEecC-------------CCCCCCchHH-HHHHHHHHHHHHHHHHHHHHHhhcchhh--- Confidence 01223333233333332 2358999986 8899999999999999999866665555 Q ss_pred HhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcch Q lcl|NC_016654. 311 VLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQ 390 (533) Q Consensus 311 ~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~ 390 (533) +...+... ....+......+.. .+.++ .++++++++.++.+...++.+.+.|+..++.+..+++. .++.. T Consensus 275 -i~g~~~~~--~~~~~~~~~~~i~~--~~~~~----~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~-~~~n~ 344 (474) T protein:vir:10 275 -LRGMGMSE--EMIQETQKSGAFEL--FDKDM----DVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDE-FNGNV 344 (474) T ss_pred -hccCCCCc--hhhhhhhhcceeEe--cCCCC----ceeEEeccCCHHHHHHHHHHHHHHHHHHhCCccccccc-ccccc Confidence 22111110 00111111222222 12222 25688888889999999999989998888876544432 23567 Q ss_pred hHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCC Q lcl|NC_016654. 391 TATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASA 470 (533) Q Consensus 391 Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi 470 (533) ||.||+++++.+..++.++++.|+.+|+++++.|+.+.+....+........+++.|++++|.|..+.+++++++ +|+ T Consensus 345 Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl--~g~ 422 (474) T protein:vir:10 345 PIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINL--KGQ 422 (474) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHH--hcc Confidence 999999999999999999999999999999999988765432232333456799999999999999999999988 599 Q ss_pred CCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 471 ASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 471 ~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +|++|++++ +|..++ +++|++||++|+......... ..+++.+++ +..++.| T Consensus 423 iS~et~~~~-l~~v~d--~~~E~eri~~E~~e~~~~~~~-------~~~~~~~~~-~~~~~s~ 474 (474) T protein:vir:10 423 VSERTRLGQ-SQLVDD--VDYELDEMEKESLEFNDKLPD-------IDEGDANDK-SQNNQSE 474 (474) T ss_pred CchHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHhhccc-------ccCCCcCCC-CccccCC Confidence 999999987 565554 889999999998643322111 111111111 1122222 No 40 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=100.00 E-value=8.3e-54 Score=311.59 Aligned_cols=451 Identities=11% Similarity=0.022 Sum_probs=291.5 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) .=||... .+=|..+..++.. +..=-.-...+|.+||.+.+.. + .++...+..++|+++| T Consensus 17 ~~~~~~~-~~~~~~i~~~i~~---~~~~~~~~~~~l~~Yy~g~~~i----------------~-~~~~~~~~~~~ki~~n 75 (470) T protein:vir:99 17 FIFPKGE-KLTSNELLGFIAY---NETVLKPRYRENMKLYLGKHKI----------------L-TAPEKETGADNRIVVN 75 (470) T ss_pred EEeCCCC-CcCHHHHHHHHHH---HHHhhHHHHHHHHHHhcccccc----------------c-cCcccccCCcceeecc Confidence 1133332 3334444443332 2111111124556666554321 1 1122345667899999 Q ss_pred hHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcC Q lcl|NC_016654. 81 IPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDA 160 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~ 160 (533) ||+.||+..++||||+|+++++.+++ ..++.|.+++++|+|...+.++++.++++|.+|+++|+|++ ++++|.+++| T Consensus 76 ~~~~Ivd~~~~~l~g~p~~~~~~~d~-~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~d--g~~~i~~~~p 152 (470) T protein:vir:99 76 SAKYVVDVYNGYFCGIEPKLALLNDS-SKIDEIARWNRQENFFDTINEISKQCDIFGRSIASIYQGED--ARPHLMYSSP 152 (470) T ss_pred hHHHHHHHHhhhhccCCeeEeeCCch-hHHHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCCC--CeEEEEEEcc Confidence 99999999999999999999986543 45788999999999999999999999999999999999875 4689999999 Q ss_pred CeEEEEEecCCce-EEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCce Q lcl|NC_016654. 161 DRAIPEFRWGRLV-AVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGA 239 (533) Q Consensus 161 ~~~~P~~~~g~~~-~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~ 239 (533) .+++|+|++..-. -++|++.+...++....+..+.|.+.++.+ |... .++....+ .+ T Consensus 153 ~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~~~~~~--~~~~--~~~~~~~~-------------~~----- 210 (470) T protein:vir:99 153 NHAFIIYDDTVQRQPLAFVHYQIDNSNNWTDAYGVIQYADKFYK--FKGY--DIEEDTNA-------------AG----- 210 (470) T ss_pred ceeEEEEcCCCCcceEEEEEEEEEecCCeeEEEEEEEecCeEEE--EEec--cccccccc-------------cc----- Confidence 9999999975433 334455454444333334445555554432 2221 11111100 00 Q ss_pred eecCCCc-cceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcC-CCC Q lcl|NC_016654. 240 YVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTN-LGM 317 (533) Q Consensus 240 ~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~-~~~ 317 (533) ...++.. .|++.|.. .+.|.|+|.. +.+|||++|.++|++++.++.....+.+-..+... ... T Consensus 211 ~~~~~~g~vPvv~~~n--------------~~~g~sd~e~-v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~ 275 (470) T protein:vir:99 211 YAINPYGLVPAVEFFE--------------NEERQGIFDS-IKTLINALDKVISQKANQVEYFDNAYMYMIGFKLPEDDE 275 (470) T ss_pred ccccCCCccceEeecC--------------CCCCCcchHh-HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccccc Confidence 0122222 23333322 2358999987 88999999999999999998776666663332211 111 Q ss_pred ccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHH Q lcl|NC_016654. 318 GQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASG 397 (533) Q Consensus 318 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~ 397 (533) +... .+ ......+....... +....++++++++..+.+...++.+.+.|+..++.+..+++. .++..||.||++ T Consensus 276 g~~~-~~--~~~~~~~~~~~~~~--~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~~~n~Sg~Ai~~ 349 (470) T protein:vir:99 276 GNPK-FD--FKNNRVLYVSQLDP--DTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKN-FAGNSSGVALQY 349 (470) T ss_pred cchh-hh--hhhcceeeecCCCC--CCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCccccccc-cccCchHHHHHH Confidence 1111 11 11122222222212 222346788889999999999999999999999887655442 245679999999 Q ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHH Q lcl|NC_016654. 398 KKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKV 477 (533) Q Consensus 398 ~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v 477 (533) +++.+..++.++++.|+.+|++++++++.+.+.. +.......+++|.|++++|.|..+.+++++++ +|+||+||++ T Consensus 350 ~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~--~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl--~giis~et~l 425 (470) T protein:vir:99 350 KLFAMKNKADSKERKFDKSLMQLYRIVLATLFNN--KQDQELWSELDFKFTRNLPEDMASAIDNAKNA--EGIVSKKTQL 425 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--CCcccccccceEEeCCCCCcCHHHHHHHHHHH--hccCCHHHHH Confidence 9999999999999999999999999998775432 23344567899999999999999999999988 4899999999 Q ss_pred HHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 478 AYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDE 531 (533) Q Consensus 478 ~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 531 (533) .+ +|.++ +++|++||++|+......... ..+..+ ..++++++++| T Consensus 426 ~~-l~~vd---~~~E~eri~~E~~~~~~~~~~---~~~~~d--~~~~d~~~ee~ 470 (470) T protein:vir:99 426 GM-IPDIE---PDAEMKQIAKEKADAIKQTQQ---LSMPID--ILKRDNNAEEE 470 (470) T ss_pred Hh-CCCCC---HHHHHHHHHHHHHHHHHHHHh---hcCCCC--cCCCCCCccCC Confidence 87 56553 668999999998543221111 111111 11112222222 No 41 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=100.00 E-value=2.5e-54 Score=314.47 Aligned_cols=465 Identities=10% Similarity=-0.001 Sum_probs=292.3 Q ss_pred CCCCc-------CCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccc Q lcl|NC_016654. 3 LPEAN-------TAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPK 75 (533) Q Consensus 3 ~~~~~-------~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 75 (533) |+..+ ..+.|..+...+.+....+. ....++.+||.+.+.. .........+.+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~---~r~~~~~~yy~g~~~i---------------~~~~~~~~~~~~~~ 62 (489) T protein:vir:99 1 MLQEDFEAIDYESKLWIDQLKNYISRFKAEQL---ERLKELKRYYLGDNNI---------------KYRPAKTDKYAADN 62 (489) T ss_pred CCccceeeeCCCCCCCHHHHHHHHHHHHHHHH---HHHHHHHHHhcccCcc---------------ccccccccccCCcc Confidence 33222 23344444433333221110 0123444455443211 11111223345677 Q ss_pred eeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcC--CCCCce Q lcl|NC_016654. 76 RYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDP--TIADNA 153 (533) Q Consensus 76 ~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~--~~~~~~ 153 (533) |+++|||+.||+..|+||||+|+++++ .++.+++.|+++++.|+|...+.++++.++++|.+|+.+|+.+ +..+++ T Consensus 63 ki~~n~~~~iv~~~~~~l~g~~~~~~~--~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~~ 140 (489) T protein:vir:99 63 RIASDFAKYITVFEQGYMLGVPVEYKN--ENKDLQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTEV 140 (489) T ss_pred eeecchHHHHHHHHhhhhccCCceeec--CChhHHHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcce Confidence 999999999999999999999999987 4556889999999999999999999999999999999998743 345789 Q ss_pred EEEEEcCCeEEEEEecCCceE-EEEEEEEee-cCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccc Q lcl|NC_016654. 154 WIDFVDADRAIPEFRWGRLVA-VTFWSELAG-GDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVE 231 (533) Q Consensus 154 ~i~~v~~~~~~P~~~~g~~~~-v~f~~~~~~-~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~ 231 (533) +|.+++|.+++|+|++....+ +.|++.+.. .+....+.+.+.|+++.|.+ |.......+. +.+ T Consensus 141 ~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~--~~~~~~~~~~-~~~------------ 205 (489) T protein:vir:99 141 KLYQLPAEQTFVIYDDTYQRNSLMAVHFYDIDYGSGKRKQIIKAYTSDTIYT--YEDYNLETKG-MRL------------ 205 (489) T ss_pred EEEEEcccceEEEEcCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEE--EEecCCCccc-cee------------ Confidence 999999999999998643222 233333322 12223355678898887754 4332211100 000 Q ss_pred ccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHH Q lcl|NC_016654. 232 GADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESV 311 (533) Q Consensus 232 ~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~ 311 (533) ....+++...+++.+++| ...|.|+|.+ +.+|+|++|.++|++++.++....++.+-..+ T Consensus 206 ------~~~~~~~~g~vPvv~~~n-------------~~~~~s~~~~-v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~ 265 (489) T protein:vir:99 206 ------KDYEGHFFKGVPVNEYAN-------------NEERTGAYES-VLDNIDAYDLSQSELANFQQDSVNALLVIAGN 265 (489) T ss_pred ------cccccccCCceeEEEeec-------------CCCCCCchhh-hHHHHHHHHHHHHHHHHHHHHhhhhhhhhccC Confidence 001223333333333333 1248889986 78999999999999999998766655543222 Q ss_pred hcCCCC-cc---ccccCcc--------hhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCCh Q lcl|NC_016654. 312 LTNLGM-GQ---GVSLDEE--------QEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSP 379 (533) Q Consensus 312 l~~~~~-~~---~~~~d~~--------~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~ 379 (533) ...... .. ....+.. ....+.+.........+....+++++.+++++.+...++.+.+.|+..++.+. T Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~ 345 (489) T protein:vir:99 266 AYTGADENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPD 345 (489) T ss_pred CcccccchhhhhhcccccccccccccccccceeeeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCcc Confidence 111000 00 0000000 00001111111111112233567888899999999999999888888887764 Q ss_pred hhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCC-CCCceeEEEEeCCCCCCCHHHH Q lcl|NC_016654. 380 VSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKG-AAPSEELELEWPKFARESDLAK 458 (533) Q Consensus 380 ~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~-~~~~~~v~i~f~d~i~~d~~e~ 458 (533) -.+. ..++..||.||+++++.+.+++.+|++.|+.+|++++++|+.+.+....... .....+++|.|++++|.|..+. T Consensus 346 ~~~~-~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~ 424 (489) T protein:vir:99 346 TQDM-KFSGVQSGESMKYKLMASDNYREKQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEI 424 (489) T ss_pred cccc-cccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHH Confidence 3322 3345679999999999999999999999999999999999887653211111 1123468999999999999999 Q ss_pred HHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 459 AQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVD 530 (533) Q Consensus 459 a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (533) +++++++ +|+||+|+++++ .|.++++++++|++||++|+........ +...++.++++...+.+| T Consensus 425 ~~~~~kl--~giis~et~~~~-l~~v~~~d~~~E~~ri~~E~~~~~~~~~----~~~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 425 VTAAQNL--YGIVSDQTIFEI-LNTVTGVDAEAELKRLKEEADKKQSLPE----PRLVGDASGQEEPTAEKP 489 (489) T ss_pred HHHHHHH--hccCCHHHHHHh-cCCCCchhHHHHHHHHHHHHHHHhcccc----ccccCCCCCCcCCCCCCC Confidence 9999988 499999999987 5678888899999999999754432111 111111111111111222 No 42 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=100.00 E-value=3.8e-54 Score=313.48 Aligned_cols=436 Identities=9% Similarity=-0.004 Sum_probs=292.1 Q ss_pred HHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHhhcCC Q lcl|NC_016654. 17 AVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTELFSE 96 (533) Q Consensus 17 ~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e 96 (533) =+..++...+.+| .+|.+||.+.+.... ........+..++|+++|+|+.||++.|+||||+ T Consensus 1 ~~~~~~~~~~~r~----~~l~~yy~g~~~~~~--------------~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~ 62 (440) T protein:vir:95 1 MLAAFLGSQKQRL----AILASYAQGDNFSIL--------------SGHRRLDDEKADYRVRHKWGGYISSFATGYVIGN 62 (440) T ss_pred ChhhHHHHHHHHH----HHHHHHhccCCcccc--------------cccccccccCCcceeecchHHHHHHhhhhheecc Confidence 2333444444444 355666655432100 0011223456678999999999999999999999 Q ss_pred CceEeeCCC-chHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCce-E Q lcl|NC_016654. 97 QLKFLDAGK-SKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLV-A 174 (533) Q Consensus 97 ~~~i~~~~~-~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~-~ 174 (533) |+++++.+. +++..+.|+++++.|+|...+.++++.|+++|.+|+++|+|++ ++++|.+++|.+++|+|++.... . T Consensus 63 ~~~~~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~--~~~~i~~~~p~~~~~~~d~~~~~~~ 140 (440) T protein:vir:95 63 PVSIGVMEGGSADQLSTIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDKD--KVDRVVLISPLEMFVIRDLTVEQNI 140 (440) T ss_pred CceEeeCCCccHHHHHHHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCC--CceEEEEEcccceEEEEcCCCCCce Confidence 999988653 4567788999999999999999999999999999999999875 46899999999999999875432 3 Q ss_pred EEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEec Q lcl|NC_016654. 175 VTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVP 254 (533) Q Consensus 175 v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~p 254 (533) +++++.+...+.. ..+.|++..|.+ |.......+.-. . ....+++...+++++.+ T Consensus 141 ~~~i~~~~~~~~~----~~~vyt~~~~~~--~~~~~~~~~~~~--------------~-----~~~~~~~~g~vPvv~~~ 195 (440) T protein:vir:95 141 IAAVHLPIYADKV----NMTVYTKDKVIT--YKPYSNNSVRLV--------------V-----DDVKKHSYNDVPVVEWW 195 (440) T ss_pred EEEEEEEEecCce----EEEEEeCCeEEE--EEEecCCcccee--------------e-----cceeeccCceeeEEEee Confidence 3344444443332 245677666654 221111111000 0 01123444433333333 Q ss_pred CCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCC--CCccccccCcchhhhhh Q lcl|NC_016654. 255 NVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNL--GMGQGVSLDEEQEVYSR 332 (533) Q Consensus 255 n~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~--~~~~~~~~d~~~~~~~~ 332 (533) | ...|.|+|.. +.+|||++|.++|++++.++.....+.|...+.... .+..+..+......+.. T Consensus 196 n-------------~~~g~sd~e~-v~~lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~~~~~ 261 (440) T protein:vir:95 196 N-------------NRFRMGDYES-EISLIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKLSPEDAAKMKDANMLFLK 261 (440) T ss_pred C-------------CCCCCCchhh-hHHHHHHHHHHHHHHHHHHHHhhcceeeeecccccCCCCccchhhhhhccceecc Confidence 3 1258899987 789999999999999999988777766633321111 11111111112222221 Q ss_pred ccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHH Q lcl|NC_016654. 333 VGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARH 412 (533) Q Consensus 333 ~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~ 412 (533) ...... +.+....++++++++..+.+...++.+.+.|+..++++..+++.- ++..||.||+++++.+.+++++|+.. T Consensus 262 ~~~~~~--~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~k~~~ 338 (440) T protein:vir:95 262 TGISTT--GQQTTADASYIYKQYDVNGTEAYKNRLANDIHRFSRIPNLDDDRF-NSTSSGIALLYKMIGLEQVRKDKETY 338 (440) T ss_pred cccccc--cCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc-cccchHHHHHHHHHHHHHHHHHHHHH Confidence 111111 111222467888899999999999999999999998876555432 45679999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHH Q lcl|NC_016654. 413 FGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEE 492 (533) Q Consensus 413 ~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~E 492 (533) |+++|++++++|+.+.+.. ++.......++|.|++++|.|..+.+++++++ +|+||+||++.++ |.+++ ++| T Consensus 339 ~~~~l~~~~~li~~~~~~~--~~~~~~~~~v~i~f~~~~p~~~~~~ad~~~kl--~g~iS~et~~~~l-~~~d~---~~E 410 (440) T protein:vir:95 339 FTKALRRRYELISNIHKAI--NGPVIEANKLTFTFHPNIPQDVWTEIKAYIEA--GGEISQETLMENA-SFTDY---KTE 410 (440) T ss_pred HHHHHHHHHHHHHHHHhhc--CCcccccccceEEeCCCCCCCHHHHHHHHHHH--hccCcHHHHHHhC-CCCCc---HHH Confidence 9999999999998876543 23344567899999999999999999999988 6899999999874 55543 468 Q ss_pred HHHHHHhhhcccCccccccccCCCCCCCCCCCCCC Q lcl|NC_016654. 493 ADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPE 527 (533) Q Consensus 493 l~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (533) ++||++|+..+.+.+.... ++.++.++++| T Consensus 411 ~~ri~~E~~~~~~~~~~~~-----~~~~~~~~~~e 440 (440) T protein:vir:95 411 HSRILKQGGSSDLEIGQIV-----GDADVGQADTE 440 (440) T ss_pred HHHHHHHHHHhhhhHHhhc-----cCCCCCCcCCC Confidence 9999999876555432110 11111111112 No 43 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=100.00 E-value=4e-54 Score=313.34 Aligned_cols=457 Identities=12% Similarity=0.049 Sum_probs=285.8 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |-.+......-+..+...+.+...... ...+|.+||.+.+.-. .+. ...+..........++|+++| T Consensus 35 ~~~~~~~~~~~~~~i~~~i~~~~~~~~----r~~~l~~YY~g~~~i~-~~~--------~~~~~~~~~~~~~~~~ri~~n 101 (492) T protein:vir:97 35 IVRTNNKPETLEEMIVRYIKQHLEKLP----EISIGQEYYEQRPDIV-KEP--------KPVDATGAVDPLKPDDRMITN 101 (492) T ss_pred cccCCCchhhHHHHHHHHHHHHHHHHH----HHHHHHHHhcccCccc-ccc--------ccccccccccccccccccccc Confidence 433333322222222222211111111 1133455555543210 000 001112222334567799999 Q ss_pred hHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcC Q lcl|NC_016654. 81 IPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDA 160 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~ 160 (533) ||+.||+..++||||+|+++++ +++..++.|+++++ |+|...+.++++.++++|.+|+++|.|++ +++++.+++| T Consensus 102 ~~k~Ivd~~~~yl~g~p~~~~~--~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~a~~~v~~d~d--g~~~~~~~~p 176 (492) T protein:vir:97 102 FHANLVDQKVSYIVGKPIAFKH--TDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE--GEFKLFRVPA 176 (492) T ss_pred hHHHHHHHHhhhhcccCceecc--CchHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCeEEEEEEecCC--CceEEEEEcc Confidence 9999999999999999999877 45568899999986 68999999999999999999999999875 4689999999 Q ss_pred CeEEEEEecCCceE-EEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCce Q lcl|NC_016654. 161 DRAIPEFRWGRLVA-VTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGA 239 (533) Q Consensus 161 ~~~~P~~~~g~~~~-v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~ 239 (533) .+++|+|++....+ .+|++.+...+.. .+|.|++++|.+..+.+.... . ....... ..... T Consensus 177 ~~~~~i~d~~~~~~~~~~vr~~~~~~~~----~~~~y~~~~v~~~~~~~~~~~-~---~~~~~~~----------~~~~~ 238 (492) T protein:vir:97 177 EQGIPIWTDKEHEELEAFIRMYKLENET----KVEYWDKVTVNYYVYENGSLI-P---DYSNNLE----------NSKTH 238 (492) T ss_pred cceEEEEcCCCCCceEEEEEEEeeccce----eEEEEecCeEEEEEEecCeee-e---ccccccc----------ccccc Confidence 99999998754333 3445555444432 357888888887655432110 0 0000000 00011 Q ss_pred eecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCCcc Q lcl|NC_016654. 240 YVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGMGQ 319 (533) Q Consensus 240 ~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~ 319 (533) ...++...++++..+| ...|.|+|.. +.+|+|++|.++|++++.++.....+++...+ .... T Consensus 239 ~~~~~~g~vPvv~~~n-------------n~~g~sd~e~-v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~----~~~~ 300 (492) T protein:vir:97 239 FSTGSWGKIPFIPFKN-------------NDLEISDIFM-YKTLIDAYNRRLSDLSNTFKDSNELTYVLKNY----DDQE 300 (492) T ss_pred cccCCCCCcceEEecC-------------CCCCCCchHh-HHHHHHHHHHHHHHHHHHHHHhccceeeeecC----Cccc Confidence 1233333222332222 2358999987 78999999999999999998877777763322 1111 Q ss_pred ccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHh Q lcl|NC_016654. 320 GVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKK 399 (533) Q Consensus 320 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~ 399 (533) ...+......+..+.. +.+++ ++++++++.++.+...++.+.+.|+..++.+.-+++ ..++..||.||++++ T Consensus 301 ~~~~~~~~~~~~~~~~---~~~~~----~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~-~~~~n~Sg~Al~~~~ 372 (492) T protein:vir:97 301 LPEFKRLLRYYGAIKV---SDNGG----VDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSD-KFGSAPSGVALEFLY 372 (492) T ss_pred chhHHHHHhhccceec---CCCCc----ceeEeccCCHHHHHHHHHHHHHHHHHHhCCCCCCcc-ccccCcHHHHHHHHH Confidence 1112112222333322 22222 456777778888888888777777777766543332 223567999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHH Q lcl|NC_016654. 400 DLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAY 479 (533) Q Consensus 400 ~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~ 479 (533) +.+..+++++++.|+.+|++++++++.+.+ .......++|+|++.+|.|..+.+++++++ +|+||+||++++ T Consensus 373 ~~l~~ka~~~~~~f~~~l~~~~~li~~~~~------~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl--~G~iS~et~l~~ 444 (492) T protein:vir:97 373 TNLNLKADKLARKAKVAIQELLWFVFEHFD------IKGEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLEN 444 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhc------CCcccceeeEEecCCCCCCHHHHHHHHHHH--hccCchHHHHHh Confidence 999999999999999999999999887632 223557899999999999999999999988 699999999986 Q ss_pred hCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 480 LHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 480 l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +|.+++ +++|++||++|+...........+.... .+.+.++..+++ +| T Consensus 445 -l~~v~d--~~~Eleri~~E~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~--~e 492 (492) T protein:vir:97 445 -HPFVED--LQAELERIEQEQTEYNKQLPNLDDGGAD-SAQQQERSNNKE--SE 492 (492) T ss_pred -CCCCCC--HHHHHHHHHHHHHHHHHhhhccccCCCC-CCcccccccccc--cC Confidence 565665 6789999999986433221111110000 001111111111 11 No 44 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=100.00 E-value=3.4e-54 Score=313.73 Aligned_cols=461 Identities=9% Similarity=0.017 Sum_probs=290.5 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcc--cCCCCCcccceee Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGR--TPTATGRAPKRYH 78 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~g~~~~~~~ 78 (533) +-.|+.--..=+.++-..+.+...-+. --..+|.+||.+.+.. ++.. .....+..++|++ T Consensus 13 ~~~~~~~~~l~~~~i~~li~~~~~~~~---~r~~~l~~YY~g~~~~---------------i~~~~~~~~~~~~~~~ki~ 74 (506) T protein:vir:94 13 LIYQESLENLTPNKIMKFITHHFNYQR---PRLEMLDDYYQGYNLK---------------ILDKQSRRHEDGKADHRAT 74 (506) T ss_pred eecccchhcCCHHHHHHHHHHHHHHHH---HHHHHHHHHhcCCCcc---------------ccccccccccccCCcceee Confidence 333443333333333222222111000 0113344444443211 1111 1123355678999 Q ss_pred cChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEE Q lcl|NC_016654. 79 APIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFV 158 (533) Q Consensus 79 ~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v 158 (533) +|+|+.||++.|+||||+|++++++ ++.+++.|+++++.|+|...+.++++.++++|.+|+++|+|++ +++++.++ T Consensus 75 ~n~~~~Iv~~~~~~l~G~p~~~~~~--d~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded--~~~~i~~~ 150 (506) T protein:vir:94 75 HSFAKYIADFQTSYSVGNPINVKLP--DDGSNSGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGED--NEEHLAKL 150 (506) T ss_pred cchHHHHHHHhhhhhcccCceeecC--cchHHHHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecCC--CeeEEEEE Confidence 9999999999999999999999885 4557899999999999999999999999999999999999975 57999999 Q ss_pred cCCeEEEEEecCCceE-EEEEEEEe--ecCCce---EEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccc Q lcl|NC_016654. 159 DADRAIPEFRWGRLVA-VTFWSELA--GGDGQE---VWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEG 232 (533) Q Consensus 159 ~~~~~~P~~~~g~~~~-v~f~~~~~--~~~~~~---~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~ 232 (533) +|.+++|+|+++.-.+ ++|++.+. ..++.. ++...+.|+...+.+ |.+. ..+..+. T Consensus 151 ~p~~~~~v~dd~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~--~~~~--~~~~~~~-------------- 212 (506) T protein:vir:94 151 DPLDTFVIYSTDVDPKPIMAVRYHQIELVDDNQVSTINYVPETWTADTYTL--YNPT--PIMGKMQ-------------- 212 (506) T ss_pred cccceEEEecCCCCCceEEEEEEEeeeeccCCceeEEEEEEEEEeCceEEE--eccc--cCcccee-------------- Confidence 9999999998653222 33343332 222222 234455566655443 3222 1111100 Q ss_pred cccCCceeecCCCc-cceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHH Q lcl|NC_016654. 233 ADEGRGAYVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESV 311 (533) Q Consensus 233 ~~~~~~~~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~ 311 (533) ....++.. .|++.|..| ..|.|+|.+ +++|||++|.++|++++.++.....+++-..+ T Consensus 213 ------~~~~~~~g~vPvv~~~n~--------------~~~~sd~e~-~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~ 271 (506) T protein:vir:94 213 ------VDTTKPITTFPVVEFKNS--------------NFRLGDFEN-VLPLIDLYDAAQSDTANYMTDLNEAMLIIQGD 271 (506) T ss_pred ------ccccccCCccceEEecCC--------------CCCCCchhh-hHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcC Confidence 00122222 233333322 247889986 78999999999999999997655555442221 Q ss_pred hcC--------------CCCccccc------cCcchhhhhhccccc--cccccccccceeeechhhhhHHHHHHHHHHHH Q lcl|NC_016654. 312 LTN--------------LGMGQGVS------LDEEQEVYSRVGSGG--FNANGDMETIFEFFQPAIRVLEHDQGAALLLR 369 (533) Q Consensus 312 l~~--------------~~~~~~~~------~d~~~~~~~~~~~~~--~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~ 369 (533) ... ...+.... +.......+.+.... ...+......++++++++..+++...++.+.+ T Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~ 351 (506) T protein:vir:94 272 IDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAG 351 (506) T ss_pred ccccccchhccccccccccccccccccchhHHHhhhhhcCeeeecccccccCccccccceeeeecCCHHHHHHHHHHHHH Confidence 100 00000000 000111111111111 00111122346788899999999999999999 Q ss_pred HHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCC Q lcl|NC_016654. 370 EVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPK 449 (533) Q Consensus 370 ~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d 449 (533) .|+..++++.-+++ ..++..||.||+++++.+.++|++|++.|+.+|++++++|+.+.+.. .+........++|.|++ T Consensus 352 ~I~~~s~~p~~~~~-~~~~n~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~-~~~~~~d~~~i~i~f~~ 429 (506) T protein:vir:94 352 DIHKFSHTPDLTDE-NFASNSSGVAMQYKVLGTVELASTKRRMFERGLYARYQIISDIENSI-HGDWTFDPQELTFTFRD 429 (506) T ss_pred HHHHHhCccccccc-cccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCccccccccceEEeCC Confidence 99988887754432 23466799999999999999999999999999999999999886532 33344556689999999 Q ss_pred CCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCC Q lcl|NC_016654. 450 FARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAV 529 (533) Q Consensus 450 ~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 529 (533) ++|.|..+.+++++++ +|+||++|++.+ +|.+++ +++|++||++|+...++.+.......... ...+.+.+ T Consensus 430 ~~p~d~~e~a~~~~kl--~g~iS~et~~~~-lp~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~----~~~~~~~~ 500 (506) T protein:vir:94 430 NLPADNISQIKALVQA--GATLPQKYLYQQ-LPGVTN--PQDIVDMMKEQSANGDYSFDQNGVISNDG----QTNTTATQ 500 (506) T ss_pred CCCcCHHHHHHHHHHH--hccCChHHHHHh-CCCCCC--HHHHHHHHHHHHHHHhhcchhhcCCCccc----Cccccccc Confidence 9999999999999988 699999999986 576665 66899999999976555433222111111 11112222 Q ss_pred CCCC Q lcl|NC_016654. 530 DEGE 533 (533) Q Consensus 530 ~d~~ 533 (533) .+.| T Consensus 501 ~~~e 504 (506) T protein:vir:94 501 TDEE 504 (506) T ss_pred cccC Confidence 2333 No 45 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=100.00 E-value=1e-53 Score=311.15 Aligned_cols=456 Identities=12% Similarity=0.032 Sum_probs=281.4 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |-.+......-+..+...+........+| .+|.+||.+.+.-. .+ ..+.+..........++|+++| T Consensus 35 ~~~~~~~~~~~~~~i~~~i~~~~~~~~r~----~~l~~YY~g~~~I~-~~--------~~~~~~~~~~~~~~~~~ri~~n 101 (492) T protein:vir:94 35 IVRTNNKPETLEEMIVRYIKQHLEKLPEI----SIGQEYYEQRPDIV-KE--------PKPVDATGAVDPLKPDDRMITN 101 (492) T ss_pred ccccCCchhhHHHHHHHHHHHHHHHHHHH----HHHHHHhccccccc-cc--------cccccccccccccccccccccc Confidence 33332222222222222221111111111 23444444432110 00 0011112222234556789999 Q ss_pred hHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcC Q lcl|NC_016654. 81 IPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDA 160 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~ 160 (533) ||+.||++.++||||+|+++++ +++.+++.|+++++ |+|...+.++++.++++|.+|+++|+|++ +++++.+++| T Consensus 102 ~~k~Ivd~~~~yl~G~p~~~~~--~d~~~~~~l~~~~~-n~~~~~~~~~~~~a~~~G~a~~~v~~d~d--g~~~~~~~~p 176 (492) T protein:vir:94 102 FHANLVDQKVSYIVGKPIAFKH--TDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE--GEFKLFRVPA 176 (492) T ss_pred hHHHHHHHHHhhhcccCceecc--CchHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEecCC--CceEEEEEcc Confidence 9999999999999999999977 45668899999986 68999999999999999999999999975 4689999999 Q ss_pred CeEEEEEecCCceE-EEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCce Q lcl|NC_016654. 161 DRAIPEFRWGRLVA-VTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGA 239 (533) Q Consensus 161 ~~~~P~~~~g~~~~-v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~ 239 (533) .+++|+|++....+ ++|++.+...+.. .+|.|++..|.+..+... .+.. .......+ . ... T Consensus 177 ~~~~~v~d~~~~~~~~a~ir~~~~~~~~----~~~~y~~~~v~~~~~~~~--~~~~--~~~~~~~~----~------~~~ 238 (492) T protein:vir:94 177 EQGIPIWTDKEHEELEAFIRMYKLENET----KVEYWDKVTVNYYVYENG--SLIP--DYSNNLEN----S------KTH 238 (492) T ss_pred cceEEEEcCCCCCceEEEEEEEeeccce----eEEEEecCeEEEEEEecC--eeee--cccccccc----c------ccc Confidence 99999998654333 3345545443332 357888888877544322 1100 00000000 0 001 Q ss_pred eecCCCc-cceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCCc Q lcl|NC_016654. 240 YVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGMG 318 (533) Q Consensus 240 ~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~ 318 (533) ..+++.. .|++.|.. ...|.|+|.. +.+|+|++|.++|++++.++.....+.+...+ ... T Consensus 239 ~~~~~~g~vPvv~~~n--------------n~~~~sd~e~-v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~----~~~ 299 (492) T protein:vir:94 239 FSTGSWGKIPFIPFKN--------------NDLEISDIFM-YKTLIDAYNRRLSDLSNTFKDSNELTYVLKNY----DDQ 299 (492) T ss_pred ccccCCCccceEEecC--------------CCCCCCchHH-HHHHHHHHHHHHHHHHHHHHHhcCceeeeecC----Ccc Confidence 1223333 23333332 2358999986 78999999999999999998777666663222 221 Q ss_pred cccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHH Q lcl|NC_016654. 319 QGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGK 398 (533) Q Consensus 319 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~ 398 (533) ....+......+..+.. +.+++ ++++++++..+.+...++.+.+.|+..++.+.-+++ ..++..||.||+++ T Consensus 300 ~~~~~~~~~~~~~~~~~---~~~~~----~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~n~Sg~Al~~~ 371 (492) T protein:vir:94 300 ELPEFKRLLRYYGAIKV---SDNGG----VDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSD-KFGSAPSGVALEFL 371 (492) T ss_pred cchhhHHHHhhccceec---CCCCc----ceeEeccCCHHHHHHHHHHHHHHHHHHhCCcCCCcc-ccccCchHHHHHHH Confidence 11112222222333222 22222 456677777777766677766666666655433322 12356799999999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHH Q lcl|NC_016654. 399 KDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVA 478 (533) Q Consensus 399 ~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~ 478 (533) ++.+..++++|++.|+.+|++++++|+.+.. ......++.|+|++++|.|..+.+++++++ +|++|+||+++ T Consensus 372 ~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~------~~~~~~~i~v~f~~~~p~~~~e~~~~~~kl--~giiS~et~~~ 443 (492) T protein:vir:94 372 YTNLNLKADKLARKAKVAIQELLWFVFEHFD------IKGEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLE 443 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc------CCcccceeeEEecCCCCCCHHHHHHHHHHH--hccCchHHHHH Confidence 9999999999999999999999999887632 223456799999999999999999999988 59999999998 Q ss_pred HhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 479 YLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDE 531 (533) Q Consensus 479 ~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 531 (533) + +|.+++ +++|++||++|+...+.......+..+.... ++++.++.+.+ T Consensus 444 ~-l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~-~~~~~~~~e~e 492 (492) T protein:vir:94 444 N-HPFVED--LQAELERIEQEQMEYNKQLPNLDDGGADSAQ-QQERSNNKESE 492 (492) T ss_pred h-CCCCCC--HHHHHHHHHHHHHHHHhhccccccccCCCCc-cccCCccccCC Confidence 6 565655 7789999999976543332221111111111 11111122222 No 46 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=100.00 E-value=1.4e-53 Score=310.30 Aligned_cols=457 Identities=14% Similarity=0.085 Sum_probs=284.0 Q ss_pred CCCCCCcC-----------CCcCcch-HHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCC Q lcl|NC_016654. 1 MSLPEANT-----------AWPPPEL-AAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPT 68 (533) Q Consensus 1 ~~~~~~~~-----------~~pp~~~-~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 68 (533) |. |+..+ ++=.... .-+...+..++.=+. ...+|.+||.+.+.-. .+. ...+..... T Consensus 1 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~-~~~~~~~YY~g~~~i~-~~~--------~~~~~~~~~ 69 (472) T protein:vir:93 1 MY-PSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLP-EISIGQEYYEQRPDIV-KEP--------KPVDATGAV 69 (472) T ss_pred CC-CCCCcchhhhhceeeecCchhhHHHHHHHHHHHHHHHHH-HHHHHHHHhccccccc-ccc--------chhhccccc Confidence 21 11100 1100000 111111111111110 1123445555443210 000 001112222 Q ss_pred CCCcccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCC Q lcl|NC_016654. 69 ATGRAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPT 148 (533) Q Consensus 69 ~~g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~ 148 (533) .....++|+++|||+.||+..|+||||+|+++++ +++.+++.|+++++ |+|...+.++++.++++|.+|+++|+|++ T Consensus 70 ~~~~~~~ri~~n~~~~ivd~~~~~l~g~~~~~~~--~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~d 146 (472) T protein:vir:93 70 DPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH--TDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE 146 (472) T ss_pred cccccccccccchHHHHHHHHhhhhcccCeeecc--CChHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCeEEEEEEECCC Confidence 3345677899999999999999999999999977 45667899999986 68999999999999999999999999875 Q ss_pred CCCceEEEEEcCCeEEEEEecCCceEE-EEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccc Q lcl|NC_016654. 149 IADNAWIDFVDADRAIPEFRWGRLVAV-TFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRD 227 (533) Q Consensus 149 ~~~~~~i~~v~~~~~~P~~~~g~~~~v-~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~ 227 (533) ++++|.+++|.+++|+|++....+. +|++.+...+.. .+|.|+++.|.+..+.+... +.- .... T Consensus 147 --~~~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~----~~~~~~~~~~~~~~~~~~~~-----~~~--~~~~-- 211 (472) T protein:vir:93 147 --GEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENET----KVEYWDKVTVNYYVYENGSL-----IPD--YSNN-- 211 (472) T ss_pred --CceEEEEEcccceEEEEcCCCCCceEEEEEEEEeecce----eEEEEecCeEEEEEEecCee-----eec--cccc-- Confidence 4689999999999999986543333 344444443332 35778888887755443211 000 0000 Q ss_pred ccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeee Q lcl|NC_016654. 228 IAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHA 307 (533) Q Consensus 228 ~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v 307 (533) ..........++...++++..+| ...|.|+|+. +.+|+|++|.++|+++++++.....+.+ T Consensus 212 -----~~~~~~~~~~~~~~~vPvv~~~n-------------n~~g~s~~e~-v~~liDa~~~~~s~~~~~~~~~~~~~~~ 272 (472) T protein:vir:93 212 -----LENSKTHFSTGSWGKIPFIPFKN-------------NDLEISDIFM-YKTLIDAYNRRLSDLSNTFKDSNELTYV 272 (472) T ss_pred -----ccccccccccCCCCCcceEEecC-------------CCCCCCchhh-hHHHHHHHHHHHHHHHHHHHHhcCceeE Confidence 00001112234444333333333 2358999996 8899999999999999999876666665 Q ss_pred chHHhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCC Q lcl|NC_016654. 308 SESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDE 387 (533) Q Consensus 308 ~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~ 387 (533) +..........+......+..+. .+.+++ +++++++++++++...++.+.+.|+..++.+.-+++ ..+ T Consensus 273 ----~~g~~~~~~~~~~~~~~~~~~~~---~~~~~~----~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~~ 340 (472) T protein:vir:93 273 ----LTNYDDQELPEFKRLLRYYGAIK---VSDNGG----VDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSD-KFG 340 (472) T ss_pred ----eecCCcccchhhHHHHhhccccc---cCCCCc----ceeEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCcc-ccc Confidence 22211111111211222232222 122222 566777888888888888888888888877654443 234 Q ss_pred cchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHh Q lcl|NC_016654. 388 VAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSV 467 (533) Q Consensus 388 ~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~ 467 (533) +..||.||+++++.+..++++|++.|+.+|++++++|+.+.. .......++|+|++.+|.|..+.+++++++ T Consensus 341 ~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~------~~~~~~~i~v~f~~~~p~~~~~~~~~~~k~-- 412 (472) T protein:vir:93 341 SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD------IKGEHKDVDISFNYNKVANTELQVQTAQQS-- 412 (472) T ss_pred cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCcccceeeEEeCCCCCCCHHHHHHHHHHH-- Confidence 567999999999999999999999999999999999887632 223456799999999999999999999987 Q ss_pred CCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 468 ASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 468 aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +|++|++|++++ +|.+++ +++|++||++|+...+..+....+..+ +++.+.+.+++.| T Consensus 413 ~giis~et~l~~-l~~~~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~-----d~~~~~~~~~~~~ 470 (472) T protein:vir:93 413 MGIVSHETVLEN-HPFVED--LQAELERIEQEQMEYNKQLPNLDDGGA-----DGAQQQERSNNKE 470 (472) T ss_pred hccCchHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHHhccCcCcccC-----CCCCCCCCCCccc Confidence 699999999987 555655 778999999997543322221111111 1111111112222 No 47 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=100.00 E-value=3e-53 Score=308.51 Aligned_cols=480 Identities=12% Similarity=0.075 Sum_probs=295.8 Q ss_pred CCCCCCcCCCcCcchHHHH-HHHHhh-----hHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcc---cCCCCC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVT-ARVAES-----HVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGR---TPTATG 71 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~-~~~~~~-----~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~g 71 (533) |--|-=++ |+......+ ..|..+ +.+| ..+.+||.+.+.-. .+. .+++.. +..... T Consensus 1 ~~~~~~~~--~~~~~~~~~~~~i~~~~~~~~~~~~----~~~~~YY~g~h~Il-~r~--------~~~~~~~~~~~~d~~ 65 (537) T protein:vir:78 1 MTSPLLNK--PIDQLGGLLNTEITTYMASNHIKWA----HIGENYYNQENDIE-KSR--------IFYMNDKGQLREDNY 65 (537) T ss_pred CCcccccc--cHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHhcccchhh-hcc--------ccccccccccccccc Confidence 33222222 223333222 112211 1222 24556666554210 000 011111 112223 Q ss_pred cccceeecChHHHHHHHHHHhhcCCCceEeeCC-CchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCC Q lcl|NC_016654. 72 RAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAG-KSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIA 150 (533) Q Consensus 72 ~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~-~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~ 150 (533) .+++|+++||++.||++.++||||.|+++++.+ .++.+++.|+++++ ++|..++.++++.++++|.+|+++|+|++ T Consensus 66 ~~nnki~~nf~k~Ivd~~~~yl~G~Pv~~~~~d~~~~e~~~~l~~~~~-~~~~~~~~el~~~~s~~G~ay~~~y~de~-- 142 (537) T protein:vir:78 66 ASNVKISHGFFTELVDQLAQYLLSNGVEVKVKDEDNTQLDEILQEYFD-EDFQATIDTLVTNASKKGFEGIFARTTSE-- 142 (537) T ss_pred ccccccccchHHHHHHHHhhhhcccCceeecCcchhHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeeEEEeeecCC-- Confidence 467899999999999999999999999999864 34567788888875 78999999999999999999999999986 Q ss_pred CceEEEEEcCCeEEEEEec-CCceEEEEEEEEee----cCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccc Q lcl|NC_016654. 151 DNAWIDFVDADRAIPEFRW-GRLVAVTFWSELAG----GDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPAT 225 (533) Q Consensus 151 ~~~~i~~v~~~~~~P~~~~-g~~~~v~f~~~~~~----~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~ 225 (533) +.+++..++|.++||+|++ +.+..++.+..... ........++|.|++..|.+..+.+. +++....+...... T Consensus 143 ~~~~~~~i~p~~~~pv~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~--~~~~~~~~~~~~~~ 220 (537) T protein:vir:78 143 GKLKFQTVDGLTLIPVFDDYGVLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDE--GVSTTYKLDEAYNP 220 (537) T ss_pred CceEEEEEccceeEEEEcCCCCceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCC--cccccccccccccc Confidence 4689999999999999985 55555443322211 11223456789999999987544432 22111111111111 Q ss_pred cccccc----------ccccCCceeecCCCc-cceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHH Q lcl|NC_016654. 226 RDIAVE----------GADEGRGAYVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSL 294 (533) Q Consensus 226 ~~~~~~----------~~~~~~~~~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~ 294 (533) ..+... ...........++.. .|++.|..| ..|.|+|.. +++|||++|.++|++ T Consensus 221 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~nn--------------~~~~sd~e~-v~~LiDayd~~~S~~ 285 (537) T protein:vir:78 221 NPAPHVLAIEESTDADFEDTDGYQVLGRSYSKFPFQLLYNN--------------KDGMSDVKR-VKSIIDDYDVMNCFL 285 (537) T ss_pred cccceeeeccccccccccccccccccccCCcceeEEEeccC--------------ccCCCchhh-hHHHHHHHHHHHHhh Confidence 111000 000111111122222 233333332 248899986 889999999999999 Q ss_pred HHHHHhCcceeeechHHhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHh Q lcl|NC_016654. 295 MRDFRIGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRK 374 (533) Q Consensus 295 ~~~~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~ 374 (533) ++.++.....|+| +...+......+..+.+.++++... ..++ .++++++++..++....++.+.+.|... T Consensus 286 an~~~~~~~~ilv----i~g~~~~~~~~~~~~l~~~~~i~v~--~d~~----~v~~l~~~~~~~~~e~~ld~L~~~I~~~ 355 (537) T protein:vir:78 286 SNNLQDFSEAIYV----VKGFSGDSTDKLRQNIKAKKMIGVN--GDNA----GMEIQTVSIPYEARKAKMDIDVENIYRS 355 (537) T ss_pred hhHHHHhcCceee----eecCCCccchhHHHHHhhcCceeec--CCCC----ceeEEEecCCHHHHHHHHHHHHHHHHHh Confidence 9999877777777 3222211111122222334444332 1111 3678889998888888888888877765 Q ss_pred hCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCC Q lcl|NC_016654. 375 TGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARES 454 (533) Q Consensus 375 ~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d 454 (533) +..+ . ++....|..||.||+++++.+..+|++|++.|+++|++++++|+.+.+.. +........|.|.|++.+|.| T Consensus 356 s~~~-~-~~~~~~gn~SGvAlk~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~--~~~~~d~~~i~i~f~~~~P~n 431 (537) T protein:vir:78 356 GMGF-N-STAVGDGNVTNVVIKSRYTLLAMKARKMETSLRKVLRWCADMVVSDIALR--GLGEYDSNDICFEIEPHVLAN 431 (537) T ss_pred cCCC-C-CccccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--CCcccccceeeEEeccCCCCC Confidence 5332 2 23344566799999999999999999999999999999999999886532 334456678999999999999 Q ss_pred HHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhh---------------hc---ccCc---ccc---c Q lcl|NC_016654. 455 DLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNAN---------------TV---SAPT---FGF---G 510 (533) Q Consensus 455 ~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~---------------~~---~~~~---~~~---~ 510 (533) ..+.++++++++++|++|++|++++ +|.+++.|.++ ++++|. .. +.|. +.. . T Consensus 432 ~~e~a~~~~~l~~~giiS~eT~l~~-~p~vdd~e~ek---~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (537) T protein:vir:78 432 ELDIATTRKTEAETEALKIGNIMTV-APRIGDDETLK---LIAEELDLDYNELKDALAEQDAQSLDVSPDVQAMLDGLPV 507 (537) T ss_pred HHHHHHHHHHHHhcCcchHHHHHHh-CCCCCCHHHHH---HHHHHHHhhhhhhhhhhhhhcccccCcCcchhhhcCCCCC Confidence 9999999999999999999999986 67666643221 111111 00 0010 001 1 Q ss_pred cccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 511 TDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 511 ~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) .+.++..++..+.|+|.-.+-|+ T Consensus 508 ~~~~~~~d~~~~~~~~~~~~~~~ 530 (537) T protein:vir:78 508 NANQPPVDPNQPVADPNVVPPTD 530 (537) T ss_pred CCCCCCCCccCCCCCCCCCCCCC Confidence 11222234444555555555555 No 48 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=100.00 E-value=3.8e-53 Score=308.00 Aligned_cols=442 Identities=9% Similarity=0.011 Sum_probs=285.3 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |-+|.... -=+......+........+| .+|.+||.+.+. +........+..++|+++| T Consensus 9 ~~~~~~~~-~~~~~i~~~i~~~~~~~~r~----~~~~~yy~g~~~----------------i~~~~~~~~~~~~~ki~~n 67 (453) T protein:vir:73 9 MTYSRDEE-ITDKVVNDFMKKHQEEVERY----EYLGNMYKGIME----------------ISSQKAKDSWKPDNRLTNN 67 (453) T ss_pred eecccccc-CCHHHHHHHHHHHHHHHHHH----HHHHHHhccccc----------------hhcCCCCCccCccceeecc Confidence 33342211 11223333333222222222 234444444321 2223344556778899999 Q ss_pred hHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcC Q lcl|NC_016654. 81 IPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDA 160 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~ 160 (533) +|+.||++.|+||||+|+++++ +++..++.|+++++.|+|...+.++++.++++|.+|+++|+|++ +.++|.+++| T Consensus 68 ~~~~ivd~~~~~l~g~~~~~~~--~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~--~~~~i~~~~p 143 (453) T protein:vir:73 68 FAKYIVDTFVGYFNGIPIKKTH--DDKSVLEAMQLFDNLNDMEDEESELAKIACVYGRAYELMYQNES--TESEVIYCSP 143 (453) T ss_pred hHHHHHHHhhhhhcccCceeec--CChHHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCC--CceEEEEEcc Confidence 9999999999999999999877 45668899999999999999999999999999999999999975 4688999999 Q ss_pred CeEEEEEecCCceEEEE-EEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCce Q lcl|NC_016654. 161 DRAIPEFRWGRLVAVTF-WSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGA 239 (533) Q Consensus 161 ~~~~P~~~~g~~~~v~f-~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~ 239 (533) .+++|+|+++.-...+| ++.+...++. +..+.|++.+|.+ |....+.. . .. . T Consensus 144 ~~~~~v~dd~~~~~~~~~i~~~~~~~~~---~~~~vyt~~~i~~--~~~~~~~~----~-------------~~-----~ 196 (453) T protein:vir:73 144 LNVFMVYDDSIKQKPLFAVYYGFDEEGN---LSGTVYTLLETIS--ITGKAGEV----K-------------FG-----E 196 (453) T ss_pred cceEEEEeCCCCceeEEEEEEEEecCce---EEEEEEeCCeEEE--EEecCCce----E-------------Ec-----c Confidence 99999998764333333 3333333332 2367788777755 33322211 0 00 0 Q ss_pred eecCCCc-cceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCCc Q lcl|NC_016654. 240 YVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGMG 318 (533) Q Consensus 240 ~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~ 318 (533) ..+++.. .|++.|+. .+.|.|+|.. +.+|+|++|.++|++++.++.....+.+-..+-.. .. T Consensus 197 ~~~~~~g~vPvv~~~n--------------~~~g~s~~~~-v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~--~~ 259 (453) T protein:vir:73 197 STYNVYSDLPIVEYNF--------------NEERQSIFEP-VHSLINSYNKVTSEKANDVEYFSDQYLVFLGAEVD--EE 259 (453) T ss_pred ceeccCCceeEEEecC--------------CCCCCcchhh-HHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCC--ch Confidence 0122222 23333322 2358999986 88999999999999999997665555442111000 00 Q ss_pred cccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHH Q lcl|NC_016654. 319 QGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGK 398 (533) Q Consensus 319 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~ 398 (533) ....+........................+++.++++..+.+...++.+.+.|+..++.+. +++...|..||.|++++ T Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~~gn~Sg~Al~~~ 337 (453) T protein:vir:73 260 DAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAAN--ISDENFGNSSGVALAYK 337 (453) T ss_pred hhhcccccccccccccccccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCcc--cCcccccCccHHHHHHH Confidence 0000111100110000000111111122367788888888888888888888888777653 44455567899999999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHH Q lcl|NC_016654. 399 KDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVA 478 (533) Q Consensus 399 ~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~ 478 (533) ++.+..+++++++.|+.+|+++++.|+.+.+. .........++|.|++++|.|..+.++++++++ |++|.||+++ T Consensus 338 ~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~---~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~k~~--giis~et~~~ 412 (453) T protein:vir:73 338 LQAMSNLALSFQRKFQSALNRRYSLWSSLSTN---ASNKDAWKDIEYTFTRNEPKDIKEQAETANILK--GITSEETALS 412 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---cCCccccccceEEeCCCCCCCHHHHHHHHHHHh--ccCcHHHHHH Confidence 99999999999999999999999999887542 223344567999999999999999999999885 8999999998 Q ss_pred HhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 479 YLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 479 ~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) + +|.+++ +++|++||++|+.........+. + ..++..-|+ T Consensus 413 ~-~~~~~d--~~~E~~ri~~E~~~~~~~~~~~~-~-----------~~~~~~~~~ 452 (453) T protein:vir:73 413 V-ISVIPD--VQAEMEKIKKKKLLQLSLTRTSN-L-----------VRMKQMRGN 452 (453) T ss_pred h-CCCCCC--HHHHHHHHHHHHHHHHHHHHhcc-C-----------CcchhhhcC Confidence 6 666665 77899999999875543221110 0 111111111 No 49 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=100.00 E-value=3.6e-52 Score=302.59 Aligned_cols=425 Identities=11% Similarity=0.032 Sum_probs=285.9 Q ss_pred CCHHHHHHHHhccCcchhhHHH---HHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHhhcCCCceEeeCCCch Q lcl|NC_016654. 31 GDLDKLATFYGAEGRTSPSGIK---ARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSK 107 (533) Q Consensus 31 gd~~~l~~~y~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~ 107 (533) =+++.|.++.... ..+..+++ .+|.|.. .++.+.....+..++|+++|+|+.||+..++||||+|+++++ +++ T Consensus 1 l~~~~l~~~i~~~-~~~~~r~~~l~~yy~g~~-~il~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~--~~~ 76 (429) T protein:vir:98 1 MTKDLLSELIQKH-RSFNLSYSAYKQLYEGDH-AILQQKQKEQYKPDNRLVVNFAKYIVDTFNGYFIGVPVQTSH--ENK 76 (429) T ss_pred CCHHHHHHHHHHH-HHHHHHHHHHHHHhcccc-ccccccccccCCCcceeecchHHHHHHHHhhhhcccCceeec--CCh Confidence 1233343333221 11112222 2222221 123344455566788999999999999999999999999987 456 Q ss_pred HHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCce-EEEEEEEEeecCC Q lcl|NC_016654. 108 EVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLV-AVTFWSELAGGDG 186 (533) Q Consensus 108 ~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~-~v~f~~~~~~~~~ 186 (533) .+++.|+++++.|+|...+.++++.++++|.+|+++|+|++ +++++.+++|.+++|+|++..-. .+++++.+...++ T Consensus 77 ~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~--g~~~~~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~ 154 (429) T protein:vir:98 77 QVSNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDEN--AEAGITYLTPLEAFIVYDDSIRQKPLFAVRYFYNKGG 154 (429) T ss_pred HHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecCC--CcEEEEEEcccceEEEEeCCCCCceEEEEEEEEecCc Confidence 68899999999999999999999999999999999999875 57899999999999999864322 2344444443333 Q ss_pred ceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccc Q lcl|NC_016654. 187 QEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDP 266 (533) Q Consensus 187 ~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~ 266 (533) . ...++++...+++ |...... ..+ . ...+++...+++.+.+| T Consensus 155 ~---~~~~~~~~~~~~~--~~~~~~~--~~~---------------~-----~~~~~~~g~vPvv~~~n----------- 196 (429) T protein:vir:98 155 V---LEGSYSDASNITY--FKDGEKG--IEI---------------G-----ESEPHPFDGVPMIEYVE----------- 196 (429) T ss_pred e---EEEEEEeCceEEE--EEecCCc--eEe---------------c-----ccccccCCccceEEecC----------- Confidence 2 2345556555543 3221111 100 0 01123333333333333 Q ss_pred cccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCCccccccCcchhhhhhcccccccccccccc Q lcl|NC_016654. 267 KLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMET 346 (533) Q Consensus 267 ~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 346 (533) ...|.|+|.. +.+++|++|.++|++++.++.....+.+ +...... ..+..+...++.+.....+ +... T Consensus 197 --~~~g~sd~e~-v~~liD~~d~~~s~~~~~~~~~~~p~~~----i~g~~~~--~~~~~~~~~~~~~~~~~~~---~~~~ 264 (429) T protein:vir:98 197 --NEERQSLLAS-VVTLINAFNKAISEKANDVEYFADAYLK----ILGAELD--DETLKSLRDTRIINLKDTD---AQQL 264 (429) T ss_pred --CCCCCCcHHH-HHHHHHHHHHHHHHHHHHHHHhcCceee----eecCCCC--cchhhhHhhCceeeccCCC---CCCc Confidence 2358999986 7899999999999999999887777666 2211111 1122222233443332221 2223 Q ss_pred ceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 347 IFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLR 426 (533) Q Consensus 347 ~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~ 426 (533) .++++++++..+.+...++.+.+.|+..++.+. +++...|..||.|++++++.+.++++++++.|+.+|++++++|+. T Consensus 265 ~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~--~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~ 342 (429) T protein:vir:98 265 TVEFLQKPDADATQEHLLDRLENLIFRTAMVAN--ISDESFGTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIAS 342 (429) T ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCccc--cCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 467888888888888888999888888887764 344444677999999999999999999999999999999999988 Q ss_pred HHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCc Q lcl|NC_016654. 427 VDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPT 506 (533) Q Consensus 427 l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~ 506 (533) +.+.. ........++|.|++.+|.|..+.+++++++ +|+||+||++.+ .|.+++ +++|++||++|+....+. T Consensus 343 ~~~~~---~~~~d~~~i~v~f~~~~p~~~~~~a~~~~kl--~g~is~et~~~~-l~~v~d--~~~E~~ri~~E~~~~~~~ 414 (429) T protein:vir:98 343 YPTSK---IGPKDWIGIKYKFTRNLPANLLEESQIAGNL--AGIVSEETQVGV-LSIVEN--PQKEIERKNSDKSTLISR 414 (429) T ss_pred HhccC---CCccccccceEEeCCCCCcCHHHHHHHHHHH--hccCchHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHHH Confidence 75422 2234456799999999999999999999987 699999999987 465665 678999999998754332 Q ss_pred cccccccCCCCCCCCCCCCCC Q lcl|NC_016654. 507 FGFGTDQPPLPTENDPATDPE 527 (533) Q Consensus 507 ~~~~~~~~~~~~~~~~~~~~~ 527 (533) .+.+. .+++..++.+ T Consensus 415 ~~~~~------~~~~~~~~~~ 429 (429) T protein:vir:98 415 QAGGL------NGQNTTTILE 429 (429) T ss_pred HHhhh------cCCCCCCCCC Confidence 22111 1111111111 No 50 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=100.00 E-value=3.4e-50 Score=291.77 Aligned_cols=437 Identities=13% Similarity=0.107 Sum_probs=279.8 Q ss_pred cchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHh Q lcl|NC_016654. 13 PELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTE 92 (533) Q Consensus 13 ~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~l 92 (533) =.-+.+..-|..+..= .....++.+||.+.+.... + ....+.+........++|+++||++.||++.++| T Consensus 1 l~~~~i~~~i~~~~~~-~~r~~~~~~YY~g~~~i~~-~--------~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~y 70 (451) T protein:vir:10 1 MELEKIRAIISADAAR-RQEILQAKSYYYNKNDILK-K--------GVVVQNRDENPLRNADNRISHNFHEILVDEKASY 70 (451) T ss_pred CCHHHHHHHHHHHHHH-HHHHHHHHHHhcccCcccc-c--------cccccccccccccccccccccchHHHHHHhhhhh Confidence 1112222222222110 0111234444444321100 0 0001112222334567799999999999999999 Q ss_pred hcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCC------CCceEEEEEcCCeEEEE Q lcl|NC_016654. 93 LFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTI------ADNAWIDFVDADRAIPE 166 (533) Q Consensus 93 l~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~------~~~~~i~~v~~~~~~P~ 166 (533) |||+|+++++++ ++..++.|+.+++ |+|...+.++++.++++|.+|+++|+|++. .+++++..++|.+++|+ T Consensus 71 l~G~p~~~~~~~-~~~~~~~~~~~~~-n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~v 148 (451) T protein:vir:10 71 MFTYPVLFDIDN-NKELNEKVTDVLG-NEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIPI 148 (451) T ss_pred eecccceeecCC-cHHHHHHHHHHhc-cCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcccceEEE Confidence 999999998764 3456777887775 789999999999999999999999999753 25788999999999999 Q ss_pred EecC---CceEEEEEEEEeecC-C---ceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCce Q lcl|NC_016654. 167 FRWG---RLVAVTFWSELAGGD-G---QEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGA 239 (533) Q Consensus 167 ~~~g---~~~~v~f~~~~~~~~-~---~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~ 239 (533) |++. ++..++.+......+ + ...++++|.|+...|.+..+.+... .|..+ ... T Consensus 149 ydd~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~-~~~~~-------------------~~~ 208 (451) T protein:vir:10 149 YRNGIERELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSC-CGSQI-------------------EHI 208 (451) T ss_pred EcCCCCCceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCc-ccccc-------------------ccc Confidence 9864 444433222222211 1 2345678888887776533221111 01000 001 Q ss_pred eecCCCcc-ceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCCc Q lcl|NC_016654. 240 YVETGVKD-LTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGMG 318 (533) Q Consensus 240 ~~~~g~~~-~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~ 318 (533) .++++... |++.|..| ..|.|+|.. +++|||++|.++|++++.++.....+.+-..+ +.. T Consensus 209 ~~~~~~g~vPvv~~~nn--------------~~~~~d~e~-v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~----~~~ 269 (451) T protein:vir:10 209 TVQHRFNSVPFVEFSNN--------------IKKQSDLSK-YKKILDLYDRVMSGFANDLEDIQQIIYILENF----GGE 269 (451) T ss_pred cccCCCCeeeEEEeccC--------------CCCCCchhh-HHHHHHHHHHHHHHHHHHHHHhccceeeeecC----Ccc Confidence 12334333 33333321 247889986 88999999999999999998766666662221 111 Q ss_pred cccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHH Q lcl|NC_016654. 319 QGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGK 398 (533) Q Consensus 319 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~ 398 (533) ....+..+...+..+.......+ ....++++++++..+.+...++.+.+.|+..++.+. +++...|..||.||+++ T Consensus 270 ~~~~~~~~~~~~~~i~~~~~~~~--~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~~gn~Sg~Alk~~ 345 (451) T protein:vir:10 270 DTSEFLKELKRYKTIKTETDSEG--DSGGLKTMQIEIPTEARKIILEILKKQIYESGQGLQ--QDTENFGNASGVALKFF 345 (451) T ss_pred cchhhHHHHhhCCeEEecCcCCc--cCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCccc--ccccccccccHHHHHHH Confidence 11111111122233322221111 112367888889999999999999999988887764 34444466899999999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHH Q lcl|NC_016654. 399 KDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVA 478 (533) Q Consensus 399 ~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~ 478 (533) ++.+.++|+.|++.|+++|+++++.|+.+.+ ......+.|.|++++|.|..+.+++++++ +|+||+||+++ T Consensus 346 ~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~-------~~d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~ 416 (451) T protein:vir:10 346 YRKLELKSGLLETEFRTSFDKLIKAILYFLG-------VTDYKKIQQTYTRNMMSNDLEDADIATKS--VGIIPTKIILR 416 (451) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CCCccceeEEecCCCCCCHHHHHHHHHHH--hccCchHHHHH Confidence 9999999999999999999999999987632 22456789999999999999999999998 48999999998 Q ss_pred HhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCC Q lcl|NC_016654. 479 YLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPT 518 (533) Q Consensus 479 ~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~ 518 (533) + +|.+++ +++|+++|++|+..+...... +.++..+ T Consensus 417 ~-~p~v~d--~~~e~~~~~ee~~~~~~~~~~--~~~~~~~ 451 (451) T protein:vir:10 417 H-HPWVDD--VEEAEKLYLEEKKIQASKVSD--DYNNFTE 451 (451) T ss_pred h-CCCCCC--HHHHHHHHHHHHHHHHHHHHh--hcCCCCC Confidence 6 676665 667888888776543322211 1111111 No 51 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=100.00 E-value=5.4e-50 Score=290.70 Aligned_cols=463 Identities=13% Similarity=0.085 Sum_probs=288.8 Q ss_pred CCCCCCcC-CCcCcchHH-HHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceee Q lcl|NC_016654. 1 MSLPEANT-AWPPPELAA-VTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYH 78 (533) Q Consensus 1 ~~~~~~~~-~~pp~~~~~-~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 78 (533) |+.|-|.- +.-|.++.. .+.++..+.. ...+|.+||.+.+..+. .+...+...++.+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~----rl~~l~~Yy~G~~~i~~---------------~~~~~~~~~~~~~~~ 61 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTERTQ----DLGDNTAYYESERRPDA---------------VGVTVPQQMQKLLAH 61 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHHHHH----HHHHHHHHHhccccchh---------------cccccchhHHhhhhh Confidence 88775543 555555433 4444444433 23578888877654210 011111222344678 Q ss_pred cChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCC------Cc Q lcl|NC_016654. 79 APIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIA------DN 152 (533) Q Consensus 79 ~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~------~~ 152 (533) +|||++||+.++++|+.+. |++.+ ++..++.+++++++|+|.....+++..|+++|.+|+++|.|+++. +. T Consensus 62 ~n~~~~ivd~~~~~l~~~g--~~~~~-~~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~ 138 (484) T protein:vir:77 62 VGYPRLYIDAIAARQELEG--FRLGG-ADKADEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEV 138 (484) T ss_pred cCcHHHHHHHHHhhhccCc--eecCC-cchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCccccccccc Confidence 9999999999999998766 54443 345678899999999999999999999999999999999998753 24 Q ss_pred eEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccc Q lcl|NC_016654. 153 AWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEG 232 (533) Q Consensus 153 ~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~ 232 (533) ++|.+++|.+++++|+...-.-+.++..+...+... .+.++.|.++.|.+ ++..+.. |... T Consensus 139 ~~i~~~~p~~~~~~~D~~~~~~~~a~~~~~~~~~~~-~~~~~~y~~~~~~~-~~~~~~~-------------~~~~---- 199 (484) T protein:vir:77 139 PIIRVEPPTNLYAQIDPRTRQVMRAIRAIEDEEGNE-VIGATLYLPNNTVI-WNREDGQ-------------WVQV---- 199 (484) T ss_pred ceEEEeccceeEEEecCCCCceEEEEEEEEeecCCc-EEEEEEEecCeEEE-EEecCCc-------------eEee---- Confidence 689999999999999863222223333333333222 34466677776543 2332210 0000 Q ss_pred cccCCceeecCCCcc-ceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhC-cceeeechH Q lcl|NC_016654. 233 ADEGRGAYVETGVKD-LTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIG-AGKVHASES 310 (533) Q Consensus 233 ~~~~~~~~~~~g~~~-~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~-~~~i~v~~~ 310 (533) ...+++... |++.|+ | +.+. ..++|+|+|...|.+|+|++|+++|++.+..+.. -+..+| T Consensus 200 ------~~~~~~~g~vPvv~f~-N---~~~~-----~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i--- 261 (484) T protein:vir:77 200 ------ANVAHNLEMVPVIPIP-N---RTRL-----SDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLL--- 261 (484) T ss_pred ------ccccCCCCCcceEEec-c---cccc-----CccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHH--- Confidence 012233332 333333 3 2222 2568999999889999999999999999988743 222222 Q ss_pred HhcCCCCccc-cccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc Q lcl|NC_016654. 311 VLTNLGMGQG-VSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA 389 (533) Q Consensus 311 ~l~~~~~~~~-~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~ 389 (533) ........ ...+.....+...........++ ...+.+++ ....+.|++.|+.++++++..+++++.+||..+.++ T Consensus 262 --~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~q~~-~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~ 337 (484) T protein:vir:77 262 --FGVKGEELGVDPETGQTLFDAYLARILAFEDH-ESKAQQFS-AAELRNFVDALDALDRKAAAYTGLPPYYLSFSSENP 337 (484) T ss_pred --hCCCcchhcccccccchhhhhhhhhhcccCCC-CceeEeec-CCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcc Confidence 11000000 00000001111110000011111 11122222 234578999999999999999999999999877788 Q ss_pred hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCC Q lcl|NC_016654. 390 QTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVAS 469 (533) Q Consensus 390 ~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aG 469 (533) .||.||+++++.|.+++++|++.|+++|++++++++.+.+. .........++|.|.++.++|..+.+++++|++++| T Consensus 338 ~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~~~---~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g 414 (484) T protein:vir:77 338 ASAEAIRSSESRLVKTVERKNKIFGGAWEQAMRVAYKVMNG---GDIPPEYYRMESIWRDPSTPTYAAKADAATKLYNNG 414 (484) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC---CCcccccccceEEecCCCCCCHHHHHHHHHHHHhcc Confidence 89999999999999999999999999999999998876431 122234457899999999999999999999999886 Q ss_pred --CCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhccc----Ccc-ccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 470 --AASTKTKVAYLHEDWDDERVQEEADLIDNANTVSA----PTF-GFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 470 --i~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~----~~~-~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ++|.+++++.+ +++++++ +|++++++|+.+.. ..+ +...++++.++..++++ ++...+.+ T Consensus 415 ~gi~s~et~~~~l--~~~~~~~-~e~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 481 (484) T protein:vir:77 415 QGVIPKERARIDM--GYSITER-EEMRKWDEEEQAQGLGLMGTMFGTDPSGGGNPDNPETPE-PQPNPAEE 481 (484) T ss_pred CCCCCHHHHHhcC--CCChhHH-HHHHHHHHHHHHHHHHHHhhhccccccCCCCCCCCCccc-ccCCCccc Confidence 89999988864 4777765 46788877764321 111 11111211111111111 11111112 No 52 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=100.00 E-value=5.9e-49 Score=284.97 Aligned_cols=465 Identities=12% Similarity=0.073 Sum_probs=283.9 Q ss_pred CCCCCCcCC---CcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCccccee Q lcl|NC_016654. 1 MSLPEANTA---WPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRY 77 (533) Q Consensus 1 ~~~~~~~~~---~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 77 (533) |-+|=+|-. =|...+...+++..... .....|.+||.+.+... +.+...+...++.++ T Consensus 1 ~~~~i~~~~~~~~~~~~~~~L~~~~~~~~----~r~~~~~~YY~G~~~i~---------------~~~~~~~~~~~~~~~ 61 (485) T protein:vir:24 1 MTAPLPGQEEIADPAIARDEMVSAFEDQN----QNLRSNTSYYEAERRPE---------------AIGVTVPVQMQSLLA 61 (485) T ss_pred CCCCCCCCCcccchHHHHHHHHHHHHHHH----HHHHHHHHHHhccCchh---------------hcCcccchhhhhhhh Confidence 666655531 11111111222211111 12234455555543211 011111223346678 Q ss_pred ecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCC------CC Q lcl|NC_016654. 78 HAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTI------AD 151 (533) Q Consensus 78 ~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~------~~ 151 (533) ++|||++||+.+|++|+..+ |++++ ++..++.++++++.|+|...+.+++..|+++|.+|+++|.|+++ .+ T Consensus 62 ~~n~~~~ivd~~~~~l~~~g--~~~~~-~~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~ 138 (485) T protein:vir:24 62 HVGYPRLYVDSIAERQAVEG--FRLGD-ADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPN 138 (485) T ss_pred ccchHHHHHHHHhhhhccCc--eecCC-CchhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCC Confidence 89999999999999998776 55543 44567889999999999999999999999999999999998753 35 Q ss_pred ceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccc Q lcl|NC_016654. 152 NAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVE 231 (533) Q Consensus 152 ~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~ 231 (533) .++|.+++|.+++++|++..-.-..|+..+...++. ..+.++.|++..+.+ ++..+ .. -+... T Consensus 139 ~~~i~~~~p~~~~~i~D~~~~~~~~~~~~~~~~~~~-~~~~~~~y~~~~~~~-~~~~~-~~---~~~~~----------- 201 (485) T protein:vir:24 139 VPLIRVEPPTRMYAEIDPRIGRPAKAIRVAYDAEGN-EIQAATLYTPNETFG-WFRAE-GE---WVEWF----------- 201 (485) T ss_pred cceEEEeccceeEEEeeCCcCceeEEEEEEEeecCC-eEEEEEEEcCCcEEE-EEecC-Cc---eEeec----------- Confidence 678999999999999986422222333333333332 234567777775543 22211 11 00000 Q ss_pred ccccCCceeecCCCc-cceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCc-ceeeech Q lcl|NC_016654. 232 GADEGRGAYVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGA-GKVHASE 309 (533) Q Consensus 232 ~~~~~~~~~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~-~~i~v~~ 309 (533) .++++.. +|++.|++| ... ..++|+|++...|.+|+|++|.+.|++++..+... +.+++. T Consensus 202 --------~~~h~~g~vPvv~f~n~----~~~-----~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~- 263 (485) T protein:vir:24 202 --------SDPHGLGAVPVVPLPNR----TRL-----SDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIF- 263 (485) T ss_pred --------ccccCCCcccEEEeccC----ccc-----CCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhc- Confidence 1123333 344444322 222 24689999998889999999999999999887432 222221 Q ss_pred HHhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc Q lcl|NC_016654. 310 SVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA 389 (533) Q Consensus 310 ~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~ 389 (533) .+ .. .............+...........+ ....+.+++ ....+.|++.|+.++++++..+++++..||....++ T Consensus 264 G~-~~--~~~~~~~~~~~~~~~~~~~~i~~~~~-~~~~~~q~~-~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~ 338 (485) T protein:vir:24 264 GI-KP--EEIGVDPETGQTLFDAYLARILAFED-AEGKIQQFS-AAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNP 338 (485) T ss_pred cC-Cc--cccccccccccchhhhcccceeccCC-CCceEEeec-ccchHHHHHHHHHHHHHHhcccCCCHHHhccccCcc Confidence 00 00 00000000001111111111011111 111122222 235689999999999999999999999999777777 Q ss_pred hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCC Q lcl|NC_016654. 390 QTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVAS 469 (533) Q Consensus 390 ~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aG 469 (533) .||.||++++..+.++|++|++.|+.+|+++++.++.+.+. .........++|.|.++.++|..+.++.+.+++++| T Consensus 339 ~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~---~~~~~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g 415 (485) T protein:vir:24 339 ASAEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLMKG---GDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNG 415 (485) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC---CCCccccceeeEEecCCCCCCHHHHHHHHHHHHhcc Confidence 89999999999999999999999999999999998876432 223345578999999999999999999999999876 Q ss_pred --CCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccC----ccccccccC-CCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 470 --AASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAP----TFGFGTDQP-PLPTENDPATDPEAVDEGE 533 (533) Q Consensus 470 --i~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~----~~~~~~~~~-~~~~~~~~~~~~~~~~d~~ 533 (533) ++|+++++.. +| ++++++ +|++++++|+..... .+....... ..+++++...+.+..+-|+ T Consensus 416 ~~~~s~et~~~~-l~-~~~d~~-~e~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~ 483 (485) T protein:vir:24 416 QGVIPRERARKD-MG-YSIAER-EEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPNPTPAPKPQPAIEGGD 483 (485) T ss_pred cccCCHHHHHhh-CC-CCHhHH-HHHHHHHHHHhhhhhhHHHhhcccCCCCCCCCCCCCCCCCccCCCCCC Confidence 7999998875 54 777665 578888777643211 111111111 1112222222222222233 No 53 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=100.00 E-value=1e-49 Score=289.20 Aligned_cols=458 Identities=13% Similarity=0.063 Sum_probs=282.3 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |+-+. +-+..-+..|... ..-..+|.+||.+.+..+ +.+...+...+++++++| T Consensus 1 ~~t~~----------d~i~~L~~~~~~~-~~r~~~~~~Yy~G~~~i~---------------~~~~~~~~~~~~~~~~~n 54 (480) T protein:vir:78 1 MTTYH----------EHVERLQGLLARD-LPNLLEAEAYRNGTRRLK---------------TIGIGAPPELAYLDVQPG 54 (480) T ss_pred CCCHH----------HHHHHHHHHHHHH-HHHHHHHHHHHhccccch---------------hcccccchhhhhhhhhcc Confidence 33321 2222222222221 112235555665543211 011112223346678999 Q ss_pred hHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEc----CCCCCceEEE Q lcl|NC_016654. 81 IPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWD----PTIADNAWID 156 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D----~~~~~~~~i~ 156 (533) ||++||+.+|++|+.+. +.+.+ ++..++.|+++++.|+|...+.+++..++++|.+|+.+|-. .+..++++|. T Consensus 55 ~~~~ivd~~~~~l~~~g--~~~~~-d~~~~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~i~ 131 (480) T protein:vir:78 55 WVATYLRTLSDRLDIEG--FRISE-DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIR 131 (480) T ss_pred hHHHHHHHHHhhhccCc--eecCC-CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeEEE Confidence 99999999999998765 44443 44568889999999999999999999999999999998842 1235679999 Q ss_pred EEcCCeEEEEEecC---CceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccccc Q lcl|NC_016654. 157 FVDADRAIPEFRWG---RLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGA 233 (533) Q Consensus 157 ~v~~~~~~P~~~~g---~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~ 233 (533) +++|.+++|+|++. .++.++. .+...+....++..+.|+++.|.+..+.+.. ..+.. . T Consensus 132 ~~~p~~~~~i~D~~~~~~~~~~i~--~~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~-~~~~~----------~------ 192 (480) T protein:vir:78 132 VESPLYMYAELDPRNTRRVTRAVR--LYTTRDDVAVPDRATLYLPDETVPLRRNGGL-NDQWV----------V------ 192 (480) T ss_pred EEcccceEEEEcCCCccceEEEEE--EEEeecCCcceEEEEEEeCCeEEEEEecCCC-ccccc----------c------ Confidence 99999999999864 3444333 2333333344677888999888763332211 11000 0 Q ss_pred ccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhc Q lcl|NC_016654. 234 DEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLT 313 (533) Q Consensus 234 ~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~ 313 (533) +...++++...+++..++| +.+. ..++|+|++...|++|+|++|.++|++++.++.....+.+ +. T Consensus 193 ---~~~~~~~~~g~vPvv~f~n---~~~~-----~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~----i~ 257 (480) T protein:vir:78 193 ---DGDVIKHGLGVVPVVPLTN---DPRL-----GNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRV----IS 257 (480) T ss_pred ---cccccccCCCCcceEEeec---cccc-----CCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhh----hh Confidence 0112344444333333333 2222 2467999999888899999999999999998743222211 21 Q ss_pred CCCCccccccC-cchhhhhhccccccccccccccceeeechh-hhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchh Q lcl|NC_016654. 314 NLGMGQGVSLD-EEQEVYSRVGSGGFNANGDMETIFEFFQPA-IRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQT 391 (533) Q Consensus 314 ~~~~~~~~~~d-~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~-ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~T 391 (533) ..... ...+ .....+...........++. .++.+++ ...+.|++.++.++++++..+++++..||..+.++.| T Consensus 258 G~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~S 332 (480) T protein:vir:78 258 GVTTD--ELTNDGENTTLDIYYGRILTLASEA---AKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPAS 332 (480) T ss_pred CCCcc--ccccccccchhhhhhhhhccCCCCC---ceEEecCccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhH Confidence 10000 0000 01111211111111111111 1223322 3467899999999999999999999999977777789 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCC-- Q lcl|NC_016654. 392 ATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVAS-- 469 (533) Q Consensus 392 atai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aG-- 469 (533) |.||+++++.|..+|++|++.|+.+|++++++++.+.. +........++|.|.++.++|..+.++.+++++++| T Consensus 333 g~Al~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~~----~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~ 408 (480) T protein:vir:78 333 AEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMG----REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQG 408 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC----CCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhccc Confidence 99999999999999999999999999999998876632 223344567999999999999999999999999877 Q ss_pred CCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhccc-Ccccc-ccccC-CCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 470 AASTKTKVAYLHEDWDDERVQEEADLIDNANTVSA-PTFGF-GTDQP-PLPTENDPATDPEAVDEGE 533 (533) Q Consensus 470 i~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~-~~~~~-~~~~~-~~~~~~~~~~~~~~~~d~~ 533 (533) ++|+++++.. +| ++++++++ +++++++++... +.+.. ..+++ ..++++..+++++.+..+. T Consensus 409 ~~s~et~~~~-lg-~~~d~~~e-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (480) T protein:vir:78 409 PIPKEQARID-LG-YTATQREQ-MRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPS 472 (480) T ss_pred CCCHHHHHhc-CC-CCHhHHHH-HHHHHHHHHHHHHHHhhccccCCCccccCCCCCCCCCccCCCcc Confidence 6899997775 44 78776654 455555443211 11111 11111 1111222222233333332 No 54 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=100.00 E-value=1.1e-48 Score=283.43 Aligned_cols=464 Identities=13% Similarity=0.092 Sum_probs=283.2 Q ss_pred CCCCCCcCCCcCcc--hH-HHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCccccee Q lcl|NC_016654. 1 MSLPEANTAWPPPE--LA-AVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRY 77 (533) Q Consensus 1 ~~~~~~~~~~pp~~--~~-~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 77 (533) |..|=+++.=+-.+ +- ..+.++...+. ...+|.+||.+.+..+ ..+...+...++.++ T Consensus 1 ~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~----r~~~~~~Yy~G~~~i~---------------~~~~~~~~~~~~~~~ 61 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIARDEMVSAFEDSTQ----NLKTNTSYYEAERRPE---------------AIGVTVPIQMQSLLA 61 (485) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHH----HHHHHHHHHhcCCcch---------------hcCCCCChhhhhhhh Confidence 76665554333222 11 12222222211 1234555665543211 001111222235567 Q ss_pred ecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCC------CC Q lcl|NC_016654. 78 HAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTI------AD 151 (533) Q Consensus 78 ~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~------~~ 151 (533) ++|||++||+.+|++|+... |++.+ ++..++.++++++.|+|.....+++..|+++|.+|+.+|.|+.+ .+ T Consensus 62 ~~n~~~~ivd~~~~~l~~~g--~~~~~-~~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~ 138 (485) T protein:vir:10 62 HVGYPRLYVDSIAERQAVEG--FRFGD-ADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPN 138 (485) T ss_pred hcCcHHHHHHHHHhhhcccc--eecCC-CchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCC Confidence 88999999999999997654 55543 44567889999999999999999999999999999999998653 35 Q ss_pred ceEEEEEcCCeEEEEEec--CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccc Q lcl|NC_016654. 152 NAWIDFVDADRAIPEFRW--GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIA 229 (533) Q Consensus 152 ~~~i~~v~~~~~~P~~~~--g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~ 229 (533) .++|.+++|.+++++|++ +++...+ ..+...++ ..++.++.|++..|.+ |....+.. . . T Consensus 139 ~~~i~~~~p~~~~~~~D~~~~~~~~~~--~~~~~~~~-~~~~~~~~y~~~~~~~--~~~~~~~~--~----------~-- 199 (485) T protein:vir:10 139 TPIIRVEPPTRMYAEIDPRIGRVSKAI--RVAYDAEG-NEIQAATLYTPNDIFG--WYRVENEW--Q----------E-- 199 (485) T ss_pred eeEEEEEccceeEEEEcCCCCceeEEE--EEEEeeCC-CeEEEEEEEeCCeEEE--EEEcCCce--E----------E-- Confidence 688999999999999975 3333322 22222222 2345678888887754 22221110 0 0 Q ss_pred ccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeech Q lcl|NC_016654. 230 VEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASE 309 (533) Q Consensus 230 ~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~ 309 (533) . ...+++...++++.++| +.+. ..++|+|++...|.+|+|++|.++|++.+..+....++. T Consensus 200 ---~-----~~~~~~~g~vPvv~~~n---~~~~-----~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~--- 260 (485) T protein:vir:10 200 ---W-----FNNPHGLGVVPVVPIPN---RTRL-----SDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQR--- 260 (485) T ss_pred ---e-----ccccCCCCcccEEEecc---cccc-----CCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHH--- Confidence 0 01234444333333333 2222 256899999988889999999999999998864322221 Q ss_pred HHhcCCCCccccccCcc-hhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCc Q lcl|NC_016654. 310 SVLTNLGMGQGVSLDEE-QEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEV 388 (533) Q Consensus 310 ~~l~~~~~~~~~~~d~~-~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~ 388 (533) ++..........-+.. ...+...........+++ ..+.+++ ....+.|++.++.++++++..+++++..||....+ T Consensus 261 -~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~d-~k~~q~~-~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n 337 (485) T protein:vir:10 261 -LIFGIKPEEIGVDPETGQTLFDAYLARILAFEDAE-GKIQQFS-AAELANFTNALDQIAKQVAAYTGLPPQYLSTAADN 337 (485) T ss_pred -HHhcCCcccccccccccchhhhhcccceeccCCCC-ceEEeec-ccchHHHHHHHHHHHHHHhcccCCCHHHhccccCc Confidence 1211110000000000 011111111111111111 1122222 23467899999999999999999999999977777 Q ss_pred chhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhC Q lcl|NC_016654. 389 AQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVA 468 (533) Q Consensus 389 ~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~a 468 (533) ..||.||++++..+.+++++|++.|+.+|++++++++.+.+. .........+.|.|.++.++|..+.++++.+|+++ T Consensus 338 ~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~~~---~~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~a 414 (485) T protein:vir:10 338 PASAEAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYRMMKG---GDVPPDMLRMETVWRDPSTPTYAAKADAASKLYNG 414 (485) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC---CCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhc Confidence 789999999999999999999999999999999988876432 12233456899999999999999999999999998 Q ss_pred C--CCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhccc----Ccc-ccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 469 S--AASTKTKVAYLHEDWDDERVQEEADLIDNANTVSA----PTF-GFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 469 G--i~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~----~~~-~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) | ++|++++++. + +++++++ +|++++++|+.+.. +.+ ..+...+...+++.....+..+.-|+ T Consensus 415 g~~~~s~et~~~~-l-g~~~~~~-~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 483 (485) T protein:vir:10 415 GTGVIPRERARKD-M-GYSIAER-EEMRRWDEEEAAMGLGLIGTMVDPNPTVPGSPSPAPAPKPAALESGGD 483 (485) T ss_pred cccCCCHHHHHHh-C-CCCHhHH-HHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCCccccccCcCCCCCCC Confidence 7 8999998875 4 4787765 56777776664311 000 11110000111111111222233333 No 55 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=100.00 E-value=4.2e-49 Score=285.82 Aligned_cols=455 Identities=14% Similarity=0.088 Sum_probs=280.8 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |+-|.. .+...+..+...+. ...+|.+||.+.+..+ ..+...+...+++++++| T Consensus 1 ~~t~~~-------~i~~L~~~~~~~~~----r~~~l~~Yy~G~~~i~---------------~~~~~~~~~~~~~~~~~n 54 (480) T protein:vir:78 1 MTTYHE-------HVERLQGLLARDLP----NLLEAEAYRNGTRRLK---------------TIGIGAPPELAYLDVQPG 54 (480) T ss_pred CCCHHH-------HHHHHHHHHHHHHH----HHHHHHHHHhcccccc---------------ccccccchhHhhhhhhcc Confidence 222211 11112222211111 1234556665543211 011112223346678999 Q ss_pred hHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcC----CCCCceEEE Q lcl|NC_016654. 81 IPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDP----TIADNAWID 156 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~----~~~~~~~i~ 156 (533) ||++||+.++++|+.+. +++.+ ++..++.|+++++.|+|...+.+++..++++|.+|+.+|-.+ +..++++|. T Consensus 55 ~~~~ivd~~~~~l~~~g--~~~~~-d~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~i~ 131 (480) T protein:vir:78 55 WVATYLRTLSDRLDIEG--FRISE-DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPLIR 131 (480) T ss_pred hHHHHHHHHHhhhccCc--eecCC-CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeEEE Confidence 99999999999998665 45443 445688899999999999999999999999999999998532 345679999 Q ss_pred EEcCCeEEEEEecC---CceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccccc Q lcl|NC_016654. 157 FVDADRAIPEFRWG---RLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGA 233 (533) Q Consensus 157 ~v~~~~~~P~~~~g---~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~ 233 (533) +++|.+++|+|++. +++.++.+ +...+....++..+.|+++.|.+..+.+.... + +.. T Consensus 132 ~~~p~~~~~~~D~~~~~~~~~~i~~--~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~-~----------~~~------ 192 (480) T protein:vir:78 132 VESPLYMYAELDPRNTRRVTRAVRL--YTTRDDVAVPDRATLYLPDETVPLRRNGGLND-Q----------WVV------ 192 (480) T ss_pred EEcccceEEEEcCCCccceEEEEEE--EEeecCCCceEEEEEEeCCeEEEEEecCCCcc-c----------ccc------ Confidence 99999999999864 44443332 22333333456678899988876443322110 0 000 Q ss_pred ccCCceeecCCCcc-ceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHh-CcceeeechHH Q lcl|NC_016654. 234 DEGRGAYVETGVKD-LTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRI-GAGKVHASESV 311 (533) Q Consensus 234 ~~~~~~~~~~g~~~-~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~-~~~~i~v~~~~ 311 (533) +...++++... |++.|+ | +.+. ..++|+|++...|.+|+|++|+++|++++.++. +.+.++| T Consensus 193 ---~~~~~~~~~g~vPvv~f~-n---~~~~-----~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i---- 256 (480) T protein:vir:78 193 ---DGDVIKHGLGVVPVVPLT-N---DPRL-----GNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVI---- 256 (480) T ss_pred ---ccccccCCCCCcceEEee-c---cccc-----CCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhh---- Confidence 00123444443 333333 2 2222 246899999987889999999999999999874 3333333 Q ss_pred hcCCCCccccccCc--chhhhhhccccccccccccccceeeechh-hhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCc Q lcl|NC_016654. 312 LTNLGMGQGVSLDE--EQEVYSRVGSGGFNANGDMETIFEFFQPA-IRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEV 388 (533) Q Consensus 312 l~~~~~~~~~~~d~--~~~~~~~~~~~~~~~~~~~~~~i~~~~~~-ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~ 388 (533) ..... ..+.. ....+...........++. .++.+++ ...++|++.++.++++++..+++++..||..+.+ T Consensus 257 -~G~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n 329 (480) T protein:vir:78 257 -SGVTT---DELTNDGENTTLDIYYGRILTLASEA---AKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSEN 329 (480) T ss_pred -hcCCc---cccccccccchhhhhhhhhccCCCCC---ceEEecCccCHHHHHHHHHHHHHHHhcccCCChHHhccccCc Confidence 11000 00100 0111111111111111111 1223332 3568999999999999999999999999987777 Q ss_pred chhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhC Q lcl|NC_016654. 389 AQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVA 468 (533) Q Consensus 389 ~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~a 468 (533) +.||.||++++..|..++++|++.|+.+|+++++.|+.+.. +........+.|.|.++.++|..+.++++.+++++ T Consensus 330 ~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~g----~~~~~~~~~i~v~f~~~~~~s~~~~ad~~~kl~~~ 405 (480) T protein:vir:78 330 PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMG----REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYAN 405 (480) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC----CCccccceeeeEEecCCCCCCHHHHHHHHHHHHHh Confidence 78999999999999999999999999999999998877632 22334456789999999999999999999999987 Q ss_pred C--CCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhccc-C-ccccccccC---CCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 469 S--AASTKTKVAYLHEDWDDERVQEEADLIDNANTVSA-P-TFGFGTDQP---PLPTENDPATDPEAVDEGE 533 (533) Q Consensus 469 G--i~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~-~-~~~~~~~~~---~~~~~~~~~~~~~~~~d~~ 533 (533) | ++|+++++.. +| ++++++ +|++++++|++... + ......+++ +.++.++..+..+..+.|- T Consensus 406 g~~~~s~et~~~~-lg-~~~d~~-~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 474 (480) T protein:vir:78 406 GQGPIPKEQARID-LG-YTATQR-EQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGF 474 (480) T ss_pred ccccCCHHHHHhc-CC-CCHhHH-HHHHHHHHHHHHHHHHHhhccccccCCCCCCCCCCCCCCccccccCCC Confidence 7 7999998886 44 777655 44555555543211 1 111111111 1111112212222222222 No 56 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=100.00 E-value=1.6e-48 Score=282.58 Aligned_cols=462 Identities=13% Similarity=0.096 Sum_probs=284.9 Q ss_pred CCCCCCcCCC--cCcchH-HHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCccccee Q lcl|NC_016654. 1 MSLPEANTAW--PPPELA-AVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRY 77 (533) Q Consensus 1 ~~~~~~~~~~--pp~~~~-~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 77 (533) |++|=++..= .|..+- ..++++.... ....+|.+||.+.+.... .+...+...++.++ T Consensus 1 ~~~~~~~~~e~~~~~~~~~~l~~~~~~~~----~r~~~l~~YY~G~~~i~~---------------~~~~~~~~~~~~~~ 61 (486) T protein:vir:42 1 MTAPLPGMEEIEDPAVVREEMISAFEDAS----KDLASNTSYYDAERRPEA---------------IGVTVPREMQQLLA 61 (486) T ss_pred CCCCCCCCCCcccHHHHHHHHHHHHHHHH----HHHHHHHHHhcccCcchh---------------cccccchhHhhhhh Confidence 7777555421 222222 2222222221 122355666666543210 01111111234467 Q ss_pred ecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCC------CC Q lcl|NC_016654. 78 HAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTI------AD 151 (533) Q Consensus 78 ~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~------~~ 151 (533) ++|||++||+.+|++|+.. .|++.+ ++..++.++++++.|+|.....+++..|+++|.+|+.+|.++.+ .+ T Consensus 62 v~n~~~~iVd~~~~~l~~~--g~~~~~-~~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~ 138 (486) T protein:vir:42 62 HVGYPRLYVDSVAERQAVE--GFRLGD-ADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQN 138 (486) T ss_pred ccchHHHHHHHHHhhhccc--ceecCC-CchhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCC Confidence 8899999999999999654 466543 34456779999999999999999999999999999999987633 45 Q ss_pred ceEEEEEcCCeEEEEEecC--CceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccc Q lcl|NC_016654. 152 NAWIDFVDADRAIPEFRWG--RLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIA 229 (533) Q Consensus 152 ~~~i~~v~~~~~~P~~~~g--~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~ 229 (533) .++|..++|.+++++|++. ++.. |++.+...+++ ..+.+++|++..+.+....+ .. . ... T Consensus 139 ~~~i~~~~p~~~~~i~d~~~~~~~~--~~~~~~~~~~~-~~~~~~~y~~~~~~~~~~~~--~~--~-~~~---------- 200 (486) T protein:vir:42 139 VPIIRVEPPTRMHAEIDPRINRVSK--AIRVAYDKEGN-EIQAATLYTPMETIGWFRAD--GE--W-AEW---------- 200 (486) T ss_pred eeEEEEecccceEEEEeCCCCCeEE--EEEEEEecCCC-eEEEEEEEcCCcEEEEEecC--Cc--E-Eee---------- Confidence 6899999999999999853 4433 44333333333 34567888887766533221 11 0 000 Q ss_pred ccccccCCceeecCCCc-cceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeec Q lcl|NC_016654. 230 VEGADEGRGAYVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHAS 308 (533) Q Consensus 230 ~~~~~~~~~~~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~ 308 (533) ..++++.. .|++.|++| .+. ..++|+|++...|.+|+|++|.++|++.+..+....++.+ T Consensus 201 ---------~~~~h~~g~vPvv~~~n~----~~~-----~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~- 261 (486) T protein:vir:42 201 ---------FNVPHGLGVVPVVPLPNR----TRL-----SDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRL- 261 (486) T ss_pred ---------cceecCCCCceEEEeccc----ccc-----CCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHH- Confidence 01223333 333334322 222 2567999999888999999999999999887643222221 Q ss_pred hHHhcCCCCcc-ccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCC Q lcl|NC_016654. 309 ESVLTNLGMGQ-GVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDE 387 (533) Q Consensus 309 ~~~l~~~~~~~-~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~ 387 (533) +....... ..........+...........++. ..+.++ +....+.|++.++.++++++..+++++..||.... T Consensus 262 ---i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~q~-~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~ 336 (486) T protein:vir:42 262 ---IFGIKPEEIGVDSETGQTLFDAYLARILAFEDAE-GKIQQF-SAAELANFTNALDQIAKQVAAYTGLPPQYLSTAAD 336 (486) T ss_pred ---hhcCCccccccccccccchhhhhhchhcccCCCC-ceEEee-cccCHHHHHHHHHHHHHHHhcccCCCHHHhccccC Confidence 21111000 0000000111111111111111111 112222 23456899999999999999999999999998777 Q ss_pred cchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHh Q lcl|NC_016654. 388 VAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSV 467 (533) Q Consensus 388 ~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~ 467 (533) ++.||.||+++++.+.+++++|++.|+.+|++++++++.+.+. .........+.|.|.++.++|..+.++++.+|++ T Consensus 337 n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~---~~~~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~ 413 (486) T protein:vir:42 337 NPASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMKG---GDVPPDMLRMETVWRDPSTPTYAAKADAATKLYG 413 (486) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC---CCccccceeeeEEecCCCCCCHHHHHHHHHHHHh Confidence 7889999999999999999999999999999999998876432 1223344678999999999999999999999998 Q ss_pred C--CCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCc----cccccccCCC-CC-CCCCCCCC---CCCCCC Q lcl|NC_016654. 468 A--SAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPT----FGFGTDQPPL-PT-ENDPATDP---EAVDEG 532 (533) Q Consensus 468 a--Gi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~----~~~~~~~~~~-~~-~~~~~~~~---~~~~d~ 532 (533) + |++|+++++.. +| ++++++ +|++|+++|+...... +......++. .. .+.+.+++ +...+| T Consensus 414 ~~~g~~s~et~~~~-lg-~~~d~~-~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (486) T protein:vir:42 414 NGQGVIPRERARID-MG-YSVKER-EEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPSPTAPPKPQPAIESSGGDA 486 (486) T ss_pred cccCCCCHHHHHhc-CC-CChhHH-HHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCCCCCCCCCcccCCCCCCC Confidence 7 68999998764 54 776654 5888988877432111 1111111111 00 01122222 223333 No 57 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=100.00 E-value=1e-46 Score=272.66 Aligned_cols=437 Identities=12% Similarity=0.077 Sum_probs=265.3 Q ss_pred CCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChH Q lcl|NC_016654. 3 LPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIP 82 (533) Q Consensus 3 ~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~ 82 (533) |=+.-..| +...+..+...+.+| .+|.+||.+.+..+ ..+...+...+++++++||| T Consensus 1 ~~~~~~~~----i~~l~~~~~~~~~r~----~~l~~Yy~G~~~i~---------------~~~~~~~~~~~~~k~~~n~~ 57 (441) T protein:vir:80 1 MNSDELAL----IEGMYDRIQRLSSWH----CCIEGYYEGSNRVR---------------DLGVAIPPELQRVQTVVSWP 57 (441) T ss_pred CCccHHHH----HHHHHHHHHHHHHHH----HHHHHHHhcCCcch---------------hcCcccchhhhhhhhhcchH Confidence 11111111 222222222222222 24556665543221 01111222335678999999 Q ss_pred HHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCe Q lcl|NC_016654. 83 GVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADR 162 (533) Q Consensus 83 k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~ 162 (533) ++||+.+|++|+. ..|++.+ .+.|+++++.|+|...+.+++..++++|.+|+++|.|++ +.++|.+++|.+ T Consensus 58 ~~ivd~~~~~l~~--~g~~~~d-----~~~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~--g~~~i~~~~p~~ 128 (441) T protein:vir:80 58 GIAVDALEERLDW--LGWTNGD-----GYGLDGVYAANRLATASCDVHLDALIFGLSFVAIIPHGD--GTVSVRPQSPKN 128 (441) T ss_pred HHHHHHHHhhhcc--ccccCCC-----hHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeCCC--CceEEEEEccce Confidence 9999999999964 4566543 245888999999999999999999999999999999876 468899999999 Q ss_pred EEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeec Q lcl|NC_016654. 163 AIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVE 242 (533) Q Consensus 163 ~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~ 242 (533) ++|+|++..-....++..+...++. .+..+.|.++.|.+. ...+. |.-+ .+ ..++ T Consensus 129 ~~~i~d~~~~~~~~~~~~~~~~~~~--~~~~~vy~~~~~~~~--~~~~~--~~~~---~~----------------~~~~ 183 (441) T protein:vir:80 129 CTGKFSADGSRLDAGLVVQQTCDPE--VVEAELLLPDVIVQV--ERRGS--REWV---EV----------------DRIP 183 (441) T ss_pred EEEEEeCCCCceeEEEEEEEEecCc--eEEEEEEecCeEEEE--EEcCC--ccee---ec----------------cccc Confidence 9999986432222222222222222 234567777776552 22211 1000 00 0122 Q ss_pred CCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCCccccc Q lcl|NC_016654. 243 TGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGMGQGVS 322 (533) Q Consensus 243 ~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~~~~ 322 (533) ++...++++.++| +... ..++|.|++...|.+|+|++|.++|++.+.++....++.+ +....... T Consensus 184 ~~~g~vPvv~~~n---~~~~-----~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~----i~G~~~~~--- 248 (441) T protein:vir:80 184 NVLGAVPLVPIVN---RRRT-----SRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRW----VTGVSADE--- 248 (441) T ss_pred cCCCceeEEEeec---cccC-----CccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceee----eecCCccc--- Confidence 3333333333333 2222 2568999999889999999999999999998754433333 11101000 Q ss_pred cCcch-hhh-hhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhh Q lcl|NC_016654. 323 LDEEQ-EVY-SRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKD 400 (533) Q Consensus 323 ~d~~~-~~~-~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~ 400 (533) +..+. +.. ..+...+.+..++ ...+.+++ ....+.|++.|+.++++++..+++|+..||..+.+..||.||+++++ T Consensus 249 ~~~~~~~~~~~~i~~~~~~~~~~-~~~~~~~~-~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~ 326 (441) T protein:vir:80 249 FSQPGWVLSMASVWAVDKDDDGD-TPNVGSFP-VNSPTPYSDQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEES 326 (441) T ss_pred cccchhhhcccccccCCCCCCCC-cceeEecC-ccchHHHHHHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHH Confidence 00000 000 0111111111111 11122222 23568899999999999999999999999987777789999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCC--CCHHHHHH Q lcl|NC_016654. 401 LTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASA--ASTKTKVA 478 (533) Q Consensus 401 ~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi--~S~et~v~ 478 (533) .|..++++|++.|+.+|++++++++.+.... +........+++.|++++|+|..+.++++++++++|+ +|+++++. T Consensus 327 ~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~--~~~~~~~~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s~~~~~~ 404 (441) T protein:vir:80 327 RLVKRAERRQTSFGQGWLSVGFLAAKALDSR--VDEADFFGDVGLRWRDASTPTRAATADAVTKLVGAGILPADSRTVLE 404 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CcccccceeeeEEeCCCCCcCHHHHHHHHHHHHhcCcccccHHHHHH Confidence 9999999999999999999999888764321 1222234688999999999999999999999999996 47788776 Q ss_pred HhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCC Q lcl|NC_016654. 479 YLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAV 529 (533) Q Consensus 479 ~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 529 (533) . .+ ++++++++ +++.++|+. +.+...... ..+++... T Consensus 405 ~-l~-~~~~e~~~-~~~e~~e~~---~~~~~~~~~--------~~~~~~~~ 441 (441) T protein:vir:80 405 M-LG-LDDVQVEA-VMRHRAESS---DPLAVLAGA--------ISRQTNEV 441 (441) T ss_pred h-CC-CCHHHHHH-HHHHHHHHH---HHHHHHhhh--------hhcccccC Confidence 4 44 66655543 333333332 111110000 01111111 No 58 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=100.00 E-value=2.1e-46 Score=270.96 Aligned_cols=462 Identities=12% Similarity=0.059 Sum_probs=273.8 Q ss_pred CCCCCCcCCCcCcch-HHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeec Q lcl|NC_016654. 1 MSLPEANTAWPPPEL-AAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHA 79 (533) Q Consensus 1 ~~~~~~~~~~pp~~~-~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 79 (533) |+--+.. +|..+ ...+.++...+.. ...|.+||.+.+... ..+...+...+++++++ T Consensus 1 ~~~~~~~---d~~~~i~~L~~~~~~~~~r----~~~~~~Yy~g~~~i~---------------~~~~~~~~~~~~~~~~~ 58 (488) T protein:vir:23 1 MAETESI---DPEKLRDQLLDAFENKQNE----LKSSKAYYDAERRPD---------------AIGLAVPLDMRKYLAHV 58 (488) T ss_pred CCcccCC---CHHHHHHHHHHHHHHHHHH----HHHHHHHHhcccchh---------------hcCcccchhhhhhhhhc Confidence 4333333 23332 2222333322221 134555555443211 01111122334667899 Q ss_pred ChHHHHHHHHHHhhcCCCceE------ee-CCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcC----- Q lcl|NC_016654. 80 PIPGVIAKLSTTELFSEQLKF------LD-AGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDP----- 147 (533) Q Consensus 80 n~~k~i~~~~a~ll~~e~~~i------~~-~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~----- 147 (533) |||++||+.+|++|+-+...+ .. ...++...+.|+++++.|+|.....+++..++++|.+|+.++.++ T Consensus 59 n~~~~ivd~~a~~l~~~Gf~~~~~~~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~ 138 (488) T protein:vir:23 59 GYPRTYVDAIAERQELEGFRIPSANGEEPESGGENDPASELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDF 138 (488) T ss_pred chHHHHHHHHHHhhhccceeccCCcccccccccchhHHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCccccc Confidence 999999999998775443222 11 123456778899999999999999999999999999999988754 Q ss_pred -CCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccc Q lcl|NC_016654. 148 -TIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATR 226 (533) Q Consensus 148 -~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~ 226 (533) +..+.++|.+++|.+++|+|+...-....+++.+...+... +++.+.|++..|.+ |....+. .. + T Consensus 139 ~~~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~-~~~~~~y~~~~~~~--~~~~~~~--~~--~------- 204 (488) T protein:vir:23 139 DVDPEVPLIRVEPPTALYAEVDPRTRKVLYAIRAIYGADGNE-IVSATLYLPDTTMT--WLRAEGE--WE--A------- 204 (488) T ss_pred CCCCCcceEEEeccceeEEEEecCCCceEEEEEEEEecCCCc-EEEEEEEecCcEEE--EEecCCc--eE--e------- Confidence 22346789999999999999853211222233333333333 34467777776654 2221111 00 0 Q ss_pred cccccccccCCceeecCCCc-cceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCccee Q lcl|NC_016654. 227 DIAVEGADEGRGAYVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKV 305 (533) Q Consensus 227 ~~~~~~~~~~~~~~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i 305 (533) .+ ..+++.. +|++.|++| ... ..+.|+|++...|.+|+|++|+++|++++.++.....+ T Consensus 205 ------~~-----~~~h~~g~vPvv~f~n~----~~~-----~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~ 264 (488) T protein:vir:23 205 ------PT-----STPHGLEMVPVIPISNR----TRL-----SDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQ 264 (488) T ss_pred ------cc-----ccccCCCCcceEEeccc----ccc-----CCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHH Confidence 00 0123332 333444332 221 25679999998889999999999999999887432221 Q ss_pred eechHHhcCCCCcccc-ccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhccc Q lcl|NC_016654. 306 HASESVLTNLGMGQGV-SLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGL 384 (533) Q Consensus 306 ~v~~~~l~~~~~~~~~-~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~ 384 (533) .+ +......... ........+............++...+.+++ ....++|++.|+.++++++..+++++..||. T Consensus 265 ~~----i~G~~~~~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~q~~-~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~ 339 (488) T protein:vir:23 265 RL----IFGAKPEELGINAETGQRMFDAYMARILAFEGGEGAHAEQFS-AAELRNFVDALDALDRKAASYSGLPPQYLSS 339 (488) T ss_pred HH----HhCCCcccccccccccchhhhhhhhhhccCCCCCCceeEecC-CCChHHHHHHHHHHHHHHhcccCCCHHHhcc Confidence 11 2111100000 0000011121111111111112222333333 3467899999999999999999999999998 Q ss_pred CCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHH Q lcl|NC_016654. 385 SDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQA 464 (533) Q Consensus 385 ~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~ 464 (533) ...++.||.||+++++.+.+++++|++.|+.+|++++++++.+... .........+.+.|.++.++|..+.++++.+ T Consensus 340 ~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~---~~~~~~~~~i~v~f~~~~~~s~~~~ada~~k 416 (488) T protein:vir:23 340 SSDNPASAEAIKAAESRLVKKVERKNKIFGGAWEQAMRLAYKMVKG---GDIPTEYYRMETVWRDPSTPTYAAKADAAAK 416 (488) T ss_pred ccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC---CCcchhhccceEEecCCCCCCHHHHHHHHHH Confidence 7777889999999999999999999999999999999999876432 1122344679999999999999999999999 Q ss_pred HHhCC--CCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcc----cCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 465 WSVAS--AASTKTKVAYLHEDWDDERVQEEADLIDNANTVS----APTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 465 l~~aG--i~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ++++| ++|+|+++.+ +| +++++. +|++++++++.+. ...+.... .+....++. .....+|.| T Consensus 417 l~~~g~~~~s~et~~~~-l~-~~~d~~-~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~--~~~~~~~~e 484 (488) T protein:vir:23 417 LFANGAGLIPRERGWVD-MG-YTIVER-EQMRQWLEQDQKQGLGLIGSLYGAS--TPEGKPGEA--PVGEPPAPE 484 (488) T ss_pred HHhcccccCCHHHHHHh-CC-CCchHH-HHHHHHHHHHHHHHHHHHHHHhccC--CCcccCCCC--CCCCCCCCC Confidence 99977 7999998876 45 555443 4556554443211 11110000 000111111 111122222 No 59 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=100.00 E-value=5.9e-46 Score=268.55 Aligned_cols=448 Identities=11% Similarity=0.031 Sum_probs=267.6 Q ss_pred CCCCCCcCCCcCcch-HHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeec Q lcl|NC_016654. 1 MSLPEANTAWPPPEL-AAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHA 79 (533) Q Consensus 1 ~~~~~~~~~~pp~~~-~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 79 (533) |- +==|.++ ...+.++...+.. ..+|.+||.+.+... ......+...+..++++++ T Consensus 1 ~~------~~t~~~~~~~l~~~~~~~~~r----~~~l~~Yy~g~~~i~-------------~~~~~~~~~~~~~~~~~~~ 57 (456) T protein:vir:79 1 MT------ASTPAEWLPVLTKRIDDGMSR----VRLLARYSNGDAPLP-------------ELTRNTSAAWRSFQREART 57 (456) T ss_pred CC------CCCHHHHHHHHHHHHHHHHHH----HHHHHHHHhccCChh-------------hcCcccChhhchhhhhhhc Confidence 11 1111111 1122222222111 234455555433211 0001112222233456789 Q ss_pred ChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEc Q lcl|NC_016654. 80 PIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVD 159 (533) Q Consensus 80 n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~ 159 (533) |||++||+.+|++|+++|.++...++ ...++.+++++++|+|.....+++..++++|.+|+++|.|+++ .++|.+++ T Consensus 58 n~~~~ivd~~~~~l~~~g~~~~~~~d-~~~~~~~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~edg--~~~i~~~~ 134 (456) T protein:vir:79 58 NWGLMVRDSVADRIIPNGITVGGSAD-SDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDG--TATITADS 134 (456) T ss_pred chHHHHHHHHHhhhccCCeecCCCCC-ccHHHHHHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeCCCC--ceEEEEec Confidence 99999999999999999987765433 3456789999999999999999999999999999999998764 57899999 Q ss_pred CCeEEEEEecCCceEEE-EEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhh-ccccccccccccccCC Q lcl|NC_016654. 160 ADRAIPEFRWGRLVAVT-FWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTD-HPATRDIAVEGADEGR 237 (533) Q Consensus 160 ~~~~~P~~~~g~~~~v~-f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~-~~~~~~~~~~~~~~~~ 237 (533) |.+++++|++.....+. +++.+...++...+ ...+...+.++|............ ..+.. .+.+... T Consensus 135 p~~~~~i~d~~~~~~~~~~~~~~~~~d~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~--------- 203 (456) T protein:vir:79 135 PETMVVSVDPLQPWRIRSAMRWWRDLDAESDF-AIVWSGDGWQKFARPCFVQSSSRR-RLVTRISDSWVPV--------- 203 (456) T ss_pred cceeEEEEcCCCCCceEEEEEEEEecCCceeE-EEEEcCCceEEEEEEEEeeccccc-eeeeccCCceeec--------- Confidence 99999999875433332 22333333333222 233344444443322111111000 00000 0000000 Q ss_pred ceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCccee-eech---HHhc Q lcl|NC_016654. 238 GAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKV-HASE---SVLT 313 (533) Q Consensus 238 ~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i-~v~~---~~l~ 313 (533) ..+.++...+++.+.+| +.|.|+|.+ +.+|+|++|+++|+.+++.+....++ ++.- .+.. T Consensus 204 -~~~~~~~~~~pvv~~~N--------------~~~~gd~e~-v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~ 267 (456) T protein:vir:79 204 -GDAVVTGSPPPVVVYQN--------------PDGMGEVEP-HIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPK 267 (456) T ss_pred -ccccCCCCceeEEEecC--------------CCCCchhhh-hHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCccccc Confidence 01123333233322222 347889987 66999999999999988876422111 1110 0000 Q ss_pred CCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHH Q lcl|NC_016654. 314 NLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTAT 393 (533) Q Consensus 314 ~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tat 393 (533) .+..|+. .+. ...+...........++. .+.+++ +...+.|.+.++.++++++..+++++..||...++ .||. T Consensus 268 ~d~~g~~--i~~-~~~~~~~~~~~~~~~~~~--~~~q~~-~~~~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~N-~Sg~ 340 (456) T protein:vir:79 268 VDENGNA--IDY-ASIFEAAPGALWELPPGV--DIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAE 340 (456) T ss_pred ccccccc--cch-hhhhhhhccccccCCCCc--ceeeec-ccChHHHHHHHHHHHHHHHhhcCCChhHhcccccC-cHHH Confidence 0111111 110 011111111111111111 122332 34567899999999999999999999999966555 4999 Q ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCH Q lcl|NC_016654. 394 EASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAAST 473 (533) Q Consensus 394 ai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~ 473 (533) ||++++..+.++++.|++.|+++|++++++++.+. | ......+.|.|.++.++|..+.|+++++++++|++|. T Consensus 341 Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~-----g--~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~G~~~~ 413 (456) T protein:vir:79 341 GAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIE-----G--ESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWA 413 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----C--CCccccceEEeCCCCCcCHHHHHHHHHHHHhcCCChH Confidence 99999999999999999999999999999887653 2 2344679999999999999999999999999999999 Q ss_pred HHHHHHhCCCCCHHHH-HHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCC Q lcl|NC_016654. 474 KTKVAYLHEDWDDERV-QEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDP 526 (533) Q Consensus 474 et~v~~l~~~~~dee~-~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (533) ++++.. ++++++++ ++|++|+++|..+..... ... .++++.- T Consensus 414 ~~~~~~--lg~~~~~i~~~e~~r~~~e~~~~~~~~------~~~---~~~~~~~ 456 (456) T protein:vir:79 414 SIRRNI--LNYNADQIKQDDLDRAREQITLFAGNP------VQR---PQEDGSR 456 (456) T ss_pred HHHHhc--CCCCHHHHHHHHHHHHHHHHHHHhhhH------hhc---CCCCCCC Confidence 998764 36777764 468888888875432111 000 1111111 No 60 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=100.00 E-value=2.7e-45 Score=264.93 Aligned_cols=478 Identities=12% Similarity=0.051 Sum_probs=273.8 Q ss_pred CC--------CCCCcCCCcCcchHHHH--HHHHhhhHhhcCCH---HHHHHHHhccCcchhhHHHHHHHHHHHHHhcccC Q lcl|NC_016654. 1 MS--------LPEANTAWPPPELAAVT--ARVAESHVWWEGDL---DKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTP 67 (533) Q Consensus 1 ~~--------~~~~~~~~pp~~~~~~~--~~~~~~~~w~~gd~---~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~ 67 (533) |- =|+.+..||=..+.+.. .-+..+-.-+.+.. .+|.+||.+.+..+ ....+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY~G~~~~~-------------~~~~~~~ 67 (501) T protein:vir:25 1 MTVPVDVIADAPAADVEFPEDSMSREQLGALVADMWRLHISERQWLDRIYEYTKGLRGRP-------------EVPEGAS 67 (501) T ss_pred CcccchhhhccCcccccCCcccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCch-------------hccccCC Confidence 33 25566677744433311 11111111111222 23444555443211 0011111 Q ss_pred CCCCcccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcC Q lcl|NC_016654. 68 TATGRAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDP 147 (533) Q Consensus 68 ~~~g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~ 147 (533) ......+++.++|||++||+.+|++|+.+. |++.+.+ .++.+.++++.|+|.....++++.++++|.+|+.+|.|+ T Consensus 68 ~~~~~~~~~~v~n~~~~ivd~~a~~l~~~g--f~~~d~~--~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de 143 (501) T protein:vir:25 68 DEVKELAKLSVKNVLSLVRDSFAQNLSVVG--YRNALAK--ENDPAWEMWQRNRMDARQAEVHRPALTYGASYVTVTPTD 143 (501) T ss_pred hhhhhhHhhhhcChHHHHHHHHHhhhcccc--eecCCcc--chHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCC Confidence 111122446788999999999999997554 6664332 456789999999999999999999999999999999987 Q ss_pred CCCCceEEEEEcCCeEEEEEecCC----ceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcc Q lcl|NC_016654. 148 TIADNAWIDFVDADRAIPEFRWGR----LVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHP 223 (533) Q Consensus 148 ~~~~~~~i~~v~~~~~~P~~~~g~----~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~ 223 (533) ++ ++|.+++|.+++++|++.. +..++++.......+. .+..+.|.+..|.+ |+.+.......... ... T Consensus 144 ~~---~~i~~~sp~~~~~iy~D~~~~~~~~~ai~~~~~~~~~~~--~~~~~~y~~~~~~~--~~~~~~~~~~~~~~-~~~ 215 (501) T protein:vir:25 144 EG---PVFRTRSPRQILAVYADPSVDAWPQYALETWVAQKDAKP--HRRGVLYDDTYMYE--LDLGEVVLGDAGGG-QAT 215 (501) T ss_pred CC---CeEEEeccccEEEEEecCCCCcceeEEEEEEeeccccCc--ceeEEEecCeeEEE--EecCceeeeecccc-ccc Confidence 54 5788999999999997543 3334443322222221 23345555554422 22211100000000 000 Q ss_pred ccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcc Q lcl|NC_016654. 224 ATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAG 303 (533) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~ 303 (533) ........+.+..+....+++....++...+| +.. ..++|+|+|.. +.+|+|++|++.|++.+..+.... T Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N---~~~------~~~~g~sdie~-v~~l~Da~~~~~s~~~~~~e~~a~ 285 (501) T protein:vir:25 216 QQPVNVREVTDVIEHGATFEGKPVCPVVRFVN---GRD------ADDMIVGEVAP-LILLQQAINSVNFDRLIVSRFGAN 285 (501) T ss_pred cccccccccccccccccccCCccceeeEeccC---ccc------cCccccchhhh-hHHHHHHHHHHHHHHHHHHHhhcc Confidence 00000001111111111223322222222222 111 24579999986 779999999999999988875432 Q ss_pred eeeechHHhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcc Q lcl|NC_016654. 304 KVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLG 383 (533) Q Consensus 304 ~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g 383 (533) |..++....... ++.. ..+...... . .+++ ..+.++ +....+.|.+.++.++++|+..+++|+.+|| T Consensus 286 ----p~~~i~G~~~~~---~~~~-~~~~~~i~~-~-~~~~--~~~~q~-~~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~ 352 (501) T protein:vir:25 286 ----PQRVISGWTGSK---AEVL-KASALRVWT-F-EDPE--VKAQAF-PPASVEPYNLILEEMLQHVAMVAQISPAQVT 352 (501) T ss_pred ----HHHHHhCCCCCc---cchh-hhcccceec-c-CCCC--ceEEEe-cccChHHHHHHHHHHHHHHHhhcCCChhhhc Confidence 223332211111 1111 111111111 1 1111 122222 2334578999999999999999999999999 Q ss_pred cCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHH Q lcl|NC_016654. 384 LSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQ 463 (533) Q Consensus 384 ~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~ 463 (533) ...++ .||.||++++..|.+++.+|++.|+.+|++++++++.+.. +........+++.|.++.|+|..+.+++++ T Consensus 353 ~~~~N-~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~----~~~~~~~~~i~v~w~~~~~~s~~~~ada~~ 427 (501) T protein:vir:25 353 GKMIN-VSAEALAAAEANQQRKLAAKRESFGESWEQLLRLAAEMDD----DPDTAADSGAEVLWRDTEARSFGAVVDGIT 427 (501) T ss_pred cccCC-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC----CCccccceeeeEEecCCCCCCHHHHHHHHH Confidence 65554 5999999999999999999999999999999999887642 233345568999999999999999999999 Q ss_pred HHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhccc-C-ccc-cccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 464 AWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSA-P-TFG-FGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 464 ~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~-~-~~~-~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +++++|+ |.++.+.+ .+++++++++++.++.+++.+... . ... ......+...+...+.+++++.+++ T Consensus 428 kl~~~gi-s~et~~~~-~~g~~~~~ie~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (501) T protein:vir:25 428 KLASAGI-PIEHLLSM-VPGMTQQTIQAIKDSLRGGEVKSLVDKLLSNEPAPVPPPPPQAAAQALNEGGVNGN 498 (501) T ss_pred HHHhcCC-CHHHHHHH-cCCCCHHHHHHHHHHHHHHhHHHHHHHhhccCcCCCCCCCCCCCccccccccCCCC Confidence 9999886 99998776 567998887665554444432110 0 111 1011111111222222333333333 No 61 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=100.00 E-value=1.7e-45 Score=266.04 Aligned_cols=448 Identities=11% Similarity=0.036 Sum_probs=268.5 Q ss_pred cCCCcCcchHHH-HHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHH Q lcl|NC_016654. 7 NTAWPPPELAAV-TARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVI 85 (533) Q Consensus 7 ~~~~pp~~~~~~-~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i 85 (533) -++=.|.++... +.++...+ .....|.+||.+.+... .+....+...+..++|+++|||++| T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~----~r~~~l~~Yy~g~~~i~-------------~~~~~~~~~~~~~~~k~~~n~~~~i 63 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGM----SRVRLLARYSNGDAPLP-------------ELTRNTSAAWRSFQREARTNWGLMV 63 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHH----HHHHHHHHHHhcCCCch-------------hcCcccChhhhhhhhhhhcchHHHH Confidence 223334443322 12211111 11234555555543210 0001111122233568999999999 Q ss_pred HHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEE Q lcl|NC_016654. 86 AKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIP 165 (533) Q Consensus 86 ~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P 165 (533) |+.++++|+++|.++...++ .+..+.++++++.|+|.....+++..++++|.+|+.+|.|++ +.++|.+++|.++++ T Consensus 64 vd~~~~~l~~~~~~~~~~~d-~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~--g~~~i~~~~p~~~~~ 140 (456) T protein:vir:10 64 RDSVADRIIPNGITVGGSAD-SDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDD--GTATITADSPETMVV 140 (456) T ss_pred HHHHHhhhccCCeecCCCCC-cchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCC--CceEEEEEccceeEE Confidence 99999999999987765433 345677999999999999999999999999999999998875 468899999999999 Q ss_pred EEecCCceEE-EEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcc-cceeehhhccccccccccccccCCceeecC Q lcl|NC_016654. 166 EFRWGRLVAV-TFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSL-GWMMALTDHPATRDIAVEGADEGRGAYVET 243 (533) Q Consensus 166 ~~~~g~~~~v-~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~l-G~~v~l~~~~~~~~~~~~~~~~~~~~~~~~ 243 (533) +|++..-..+ .++..+...++...|. .++...+..++.......... ...... ....+ +... ...+ T Consensus 141 i~d~~~~~~~~~~i~~~~~~d~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-----~~~~-----~~~~ 208 (456) T protein:vir:10 141 SVDPLQPWRIRAAMRWWRDLDAESDFA-IVWSGDGWQKFARPCFVQSSSRRRLVTR-ISDSW-----VPVG-----DAVV 208 (456) T ss_pred EEcCCCCcceEEEEEEEEecCCceeEE-EEEeccceeEEEEEEEEeecccceeeee-cCCce-----eecc-----ccCC Confidence 9986422222 1222223334333332 233344433332211100000 000000 00000 0000 0112 Q ss_pred CCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCccee-eec---hHHhcCCCCcc Q lcl|NC_016654. 244 GVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKV-HAS---ESVLTNLGMGQ 319 (533) Q Consensus 244 g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i-~v~---~~~l~~~~~~~ 319 (533) +...+.+.+.+| +.|.|+|.+ +.+++|++|.+.|+.++..+....++ ++. ......+..+. T Consensus 209 ~~~~~pvv~~~N--------------~~g~gd~e~-vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~ 273 (456) T protein:vir:10 209 TGSPPPVVVYQN--------------PDGMGEVEP-HIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGN 273 (456) T ss_pred CCCceeEEEecC--------------CCCCchhhh-hHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccccccccccc Confidence 222222233222 358899987 66999999999999988876432211 110 00000011111 Q ss_pred ccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHh Q lcl|NC_016654. 320 GVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKK 399 (533) Q Consensus 320 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~ 399 (533) . .+. ...+............++ .+.+++ ....+.|.+.++.++++++..+++++..||..+++ .||.||++++ T Consensus 274 ~--~~~-~~~~~~~~~~~~~~~~~~--~~~q~~-~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N-~Sg~Ai~~~~ 346 (456) T protein:vir:10 274 A--IDY-ASIFEAAPGALWELPPGV--DIWESQ-ANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIE 346 (456) T ss_pred c--cch-hhhhhhhccccccCCCCc--ceEEec-ccChhHHHHHHHHHHHHHHhccCCChHHhcccccC-hHHHHHHHHH Confidence 1 111 111211111111111122 133333 34567899999999999999999999999976655 4999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHH Q lcl|NC_016654. 400 DLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAY 479 (533) Q Consensus 400 ~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~ 479 (533) ..+.+++.+|++.|+++|++++++++.+. | ......++|.|.++.|+|..+.++++++++++|++|.+++... T Consensus 347 ~~l~~k~~~~~~~f~~~l~~~~rl~~~~~-----g--~~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~ 419 (456) T protein:vir:10 347 KGFLFKCEDRLSIAKIGLEAILVKALQIE-----G--ESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNI 419 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----C--CCcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhh Confidence 99999999999999999999999987653 2 2345679999999999999999999999999999999997764 Q ss_pred hCCCCCHHHHH-HHHHHHHHhhhcccCccccccccCCCCCCCCCCCCC Q lcl|NC_016654. 480 LHEDWDDERVQ-EEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDP 526 (533) Q Consensus 480 l~~~~~dee~~-~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (533) +++++++++ +|++|+++|+.+..... ...+ ++++.. T Consensus 420 --lg~~~~~i~~~e~er~~~e~~~~~~~~------~~~~---~~~~~~ 456 (456) T protein:vir:10 420 --LNYNADQIKQDDLDRAREQITLFAGNP------VQRP---QEDGSR 456 (456) T ss_pred --CCCCHHHHHHHHHHHHHHHHHHHhhhh------hhcC---CCCCCC Confidence 358877654 58889888875432110 0000 111111 No 62 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=100.00 E-value=1.7e-45 Score=266.04 Aligned_cols=448 Identities=11% Similarity=0.036 Sum_probs=268.5 Q ss_pred cCCCcCcchHHH-HHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHH Q lcl|NC_016654. 7 NTAWPPPELAAV-TARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVI 85 (533) Q Consensus 7 ~~~~pp~~~~~~-~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i 85 (533) -++=.|.++... +.++...+ .....|.+||.+.+... .+....+...+..++|+++|||++| T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~----~r~~~l~~Yy~g~~~i~-------------~~~~~~~~~~~~~~~k~~~n~~~~i 63 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGM----SRVRLLARYSNGDAPLP-------------ELTRNTSAAWRSFQREARTNWGLMV 63 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHH----HHHHHHHHHHhcCCCch-------------hcCcccChhhhhhhhhhhcchHHHH Confidence 223334443322 12211111 11234555555543210 0001111122233568999999999 Q ss_pred HHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEE Q lcl|NC_016654. 86 AKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIP 165 (533) Q Consensus 86 ~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P 165 (533) |+.++++|+++|.++...++ .+..+.++++++.|+|.....+++..++++|.+|+.+|.|++ +.++|.+++|.++++ T Consensus 64 vd~~~~~l~~~~~~~~~~~d-~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~--g~~~i~~~~p~~~~~ 140 (456) T protein:vir:10 64 RDSVADRIIPNGITVGGSAD-SDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDD--GTATITADSPETMVV 140 (456) T ss_pred HHHHHhhhccCCeecCCCCC-cchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCC--CceEEEEEccceeEE Confidence 99999999999987765433 345677999999999999999999999999999999998875 468899999999999 Q ss_pred EEecCCceEE-EEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcc-cceeehhhccccccccccccccCCceeecC Q lcl|NC_016654. 166 EFRWGRLVAV-TFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSL-GWMMALTDHPATRDIAVEGADEGRGAYVET 243 (533) Q Consensus 166 ~~~~g~~~~v-~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~l-G~~v~l~~~~~~~~~~~~~~~~~~~~~~~~ 243 (533) +|++..-..+ .++..+...++...|. .++...+..++.......... ...... ....+ +... ...+ T Consensus 141 i~d~~~~~~~~~~i~~~~~~d~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-----~~~~-----~~~~ 208 (456) T protein:vir:10 141 SVDPLQPWRIRAAMRWWRDLDAESDFA-IVWSGDGWQKFARPCFVQSSSRRRLVTR-ISDSW-----VPVG-----DAVV 208 (456) T ss_pred EEcCCCCcceEEEEEEEEecCCceeEE-EEEeccceeEEEEEEEEeecccceeeee-cCCce-----eecc-----ccCC Confidence 9986422222 1222223334333332 233344433332211100000 000000 00000 0000 0112 Q ss_pred CCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCccee-eec---hHHhcCCCCcc Q lcl|NC_016654. 244 GVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKV-HAS---ESVLTNLGMGQ 319 (533) Q Consensus 244 g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i-~v~---~~~l~~~~~~~ 319 (533) +...+.+.+.+| +.|.|+|.+ +.+++|++|.+.|+.++..+....++ ++. ......+..+. T Consensus 209 ~~~~~pvv~~~N--------------~~g~gd~e~-vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~ 273 (456) T protein:vir:10 209 TGSPPPVVVYQN--------------PDGMGEVEP-HIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGN 273 (456) T ss_pred CCCceeEEEecC--------------CCCCchhhh-hHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccccccccccc Confidence 222222233222 358899987 66999999999999988876432211 110 00000011111 Q ss_pred ccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHh Q lcl|NC_016654. 320 GVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKK 399 (533) Q Consensus 320 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~ 399 (533) . .+. ...+............++ .+.+++ ....+.|.+.++.++++++..+++++..||..+++ .||.||++++ T Consensus 274 ~--~~~-~~~~~~~~~~~~~~~~~~--~~~q~~-~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N-~Sg~Ai~~~~ 346 (456) T protein:vir:10 274 A--IDY-ASIFEAAPGALWELPPGV--DIWESQ-ANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIE 346 (456) T ss_pred c--cch-hhhhhhhccccccCCCCc--ceEEec-ccChhHHHHHHHHHHHHHHhccCCChHHhcccccC-hHHHHHHHHH Confidence 1 111 111211111111111122 133333 34567899999999999999999999999976655 4999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHH Q lcl|NC_016654. 400 DLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAY 479 (533) Q Consensus 400 ~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~ 479 (533) ..+.+++.+|++.|+++|++++++++.+. | ......++|.|.++.|+|..+.++++++++++|++|.+++... T Consensus 347 ~~l~~k~~~~~~~f~~~l~~~~rl~~~~~-----g--~~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~ 419 (456) T protein:vir:10 347 KGFLFKCEDRLSIAKIGLEAILVKALQIE-----G--ESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNI 419 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----C--CCcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhh Confidence 99999999999999999999999987653 2 2345679999999999999999999999999999999997764 Q ss_pred hCCCCCHHHHH-HHHHHHHHhhhcccCccccccccCCCCCCCCCCCCC Q lcl|NC_016654. 480 LHEDWDDERVQ-EEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDP 526 (533) Q Consensus 480 l~~~~~dee~~-~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (533) +++++++++ +|++|+++|+.+..... ...+ ++++.. T Consensus 420 --lg~~~~~i~~~e~er~~~e~~~~~~~~------~~~~---~~~~~~ 456 (456) T protein:vir:10 420 --LNYNADQIKQDDLDRAREQITLFAGNP------VQRP---QEDGSR 456 (456) T ss_pred --CCCCHHHHHHHHHHHHHHHHHHHhhhh------hhcC---CCCCCC Confidence 358877654 58889888875432110 0000 111111 No 63 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=100.00 E-value=1.1e-44 Score=261.60 Aligned_cols=467 Identities=10% Similarity=-0.027 Sum_probs=283.2 Q ss_pred CCCCCCcCCCcC-------cchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcc Q lcl|NC_016654. 1 MSLPEANTAWPP-------PELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRA 73 (533) Q Consensus 1 ~~~~~~~~~~pp-------~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 73 (533) -|+=...+.+|. .+.+-+..-+..|.... ....+|.+||.+.+..+. .+...+...+ T Consensus 2 ~~~~~~~~~~~~~~~~l~~~e~~~i~~L~~~~~~~~-~r~~~l~~YY~G~~~i~~---------------~~~~~p~~~~ 65 (504) T protein:vir:99 2 TEETTSASKFTFRIPELNDDVVDKVNGLYQQLVDRT-PRNLLRASFYDGKYAIRQ---------------IGNLIPPEYL 65 (504) T ss_pred CccCCcccccccccCCCCHHHHHHHHHHHHHHHHHh-HHHHHHHHHHhccccchh---------------ccccccHHHH Confidence 223344444543 33333333333343322 234456666665543210 0011111123 Q ss_pred cceeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCce Q lcl|NC_016654. 74 PKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNA 153 (533) Q Consensus 74 ~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~ 153 (533) +.+.++|||++||+.+|++|+-+. |.+.+ ++..++.|+++++.|+|.....+++..++++|.+|+.+|-++++.+.+ T Consensus 66 ~~~~v~n~~~~iVd~~a~rl~~~G--f~~~d-~~~~~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~~ 142 (504) T protein:vir:99 66 RTATVLGWSAKAVDTLARRCNLES--FVWPD-GDYGSIGGPDVWDENFFATKANNAMVSSLIHGPAFLINTEGGAGEPDS 142 (504) T ss_pred HHhhccCcHHHHHHHHHhhhccce--eeCCC-CChhhHHHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecCCCCCcee Confidence 446789999999999999987665 55543 344567899999999999999999999999999999999998888889 Q ss_pred EEEEEcCCeEEEEEec--CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccc Q lcl|NC_016654. 154 WIDFVDADRAIPEFRW--GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVE 231 (533) Q Consensus 154 ~i~~v~~~~~~P~~~~--g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~ 231 (533) .|++++|.+++.+|++ +++..++++... +.+..++.++.|.++.|.+..+.+. |. |. .+ T Consensus 143 ~I~~~sP~~~~~iyD~~~~~~~~a~~~~~~---d~~g~~~~~~~y~~~~~~~~~~~~~----~~---------~~--~~- 203 (504) T protein:vir:99 143 LIHVKSAMQATGEWNSRRNAMDSLLSITSR---DAEGHPTGIALYEDGVTVTADMDDD----GD---------WH--AD- 203 (504) T ss_pred EEEEeccceeEEEEeCCCCceeEEEEEEEe---cCCCeEEEEEEEcCCcEEEEEEcCC----ce---------ee--ec- Confidence 9999999999999975 344444443332 2233456678899988866333211 11 00 00 Q ss_pred ccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhC-cceeee--- Q lcl|NC_016654. 232 GADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIG-AGKVHA--- 307 (533) Q Consensus 232 ~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~-~~~i~v--- 307 (533) ..+++...|.+.|+. +.+. ..++|+|.+...|.+|+|++|+++++.+...+.. -+..+| T Consensus 204 --------~~~~~~gvPvV~~~n----~~~~-----~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~G~ 266 (504) T protein:vir:99 204 --------VRTHKLGVPVEVLPY----KPRE-----DRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILLGA 266 (504) T ss_pred --------cccCCCCcceEEecc----cccC-----ccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhccC Confidence 011222223333332 2222 2568999998778899999999999999887642 222222 Q ss_pred -chHHhcCCCCcccc-ccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhh Q lcl|NC_016654. 308 -SESVLTNLGMGQGV-SLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVS 381 (533) Q Consensus 308 -~~~~l~~~~~~~~~-~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~ 381 (533) ++.... .+++.. .++. ....+...+.+..+ +.+..+.+++ ....+.|++.|+.++++++..+++|+++ T Consensus 267 ~~~~~~~--~d~~~~~~~~~---~~~~i~~~~~~~~~~~~~~~~~~~~q~~-~~~l~~~~~~l~~~i~~~a~~t~~P~~~ 340 (504) T protein:vir:99 267 DAKNFRN--KDGSMKPAWQI---ALARVFALPDDEDEPDAARARADVKQFP-ASSPQPHIEMLEQIAMMFSGETSIPVES 340 (504) T ss_pred Ccccccc--ccccccchhhh---hhhhhhcCCCccccccccCccceeeecC-CCChHHHHHHHHHHHHHHHhhhCCCHHH Confidence 111100 111111 0110 01111111111111 1122233332 2345689999999999999999999999 Q ss_pred cccCC-CcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHH Q lcl|NC_016654. 382 LGLSD-EVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQ 460 (533) Q Consensus 382 ~g~~~-~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~ 460 (533) ||+.+ .++.||.||++++..|..++.+|++.|+.+|++++++++.+.... +........+++.|.++.+++..+.++ T Consensus 341 lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~--~~~~~~~~~~~v~w~d~~~~s~a~~aD 418 (504) T protein:vir:99 341 LGFSNRANPTSADAYIASREDLIAEAEGATDDWSPAFRRSMIRALAIKNGL--DRIPPEWKTIDSKFRSPLYLSKAAQAD 418 (504) T ss_pred hcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CccccccccceeEecCCCccCHHHHHH Confidence 99765 477899999999999999999999999999999999988775421 122344577899999999999999999 Q ss_pred HHHHHHhCCC--CCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcc--------cCccccccccCCCCCCCCCCC------ Q lcl|NC_016654. 461 TVQAWSVASA--ASTKTKVAYLHEDWDDERVQEEADLIDNANTVS--------APTFGFGTDQPPLPTENDPAT------ 524 (533) Q Consensus 461 ~~~~l~~aGi--~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~--------~~~~~~~~~~~~~~~~~~~~~------ 524 (533) +++|++++|. ++..+.+..+. ++++++++++.++.+++++.. .+..+.+++.+..+ ..++.+ T Consensus 419 a~~Kl~~ag~~l~~~~~~l~~~l-g~~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~-~~e~a~~~~~~~ 496 (504) T protein:vir:99 419 AGAKMLGAGPEWLKETEVGLELL-GLTPQQAKRALAERRRASSVSIIEALNRRQQEAATAGEDQDQG-AGEPPANEPPAA 496 (504) T ss_pred HHHHHHhhccccccchHHHHhhc-CCCHHHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCCCCCCcC-CCCCCCCCCCcc Confidence 9999999985 33434444444 588888776555544444211 11111111111100 111111 Q ss_pred CCCCCCCC Q lcl|NC_016654. 525 DPEAVDEG 532 (533) Q Consensus 525 ~~~~~~d~ 532 (533) +.+..-+| T Consensus 497 ~~~p~~~~ 504 (504) T protein:vir:99 497 LGRPTLVG 504 (504) T ss_pred CCCcccCC Confidence 12222222 No 64 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=100.00 E-value=1.4e-44 Score=260.99 Aligned_cols=455 Identities=11% Similarity=0.046 Sum_probs=262.7 Q ss_pred CCCCCCcCCCcCcchHHHHH--HHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCC-CCccccee Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTA--RVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTA-TGRAPKRY 77 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~--~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~~~ 77 (533) .-+|+- -.=+.+...+.. .+..|..= .....+|.+||.+.+..... ....+.. ...-.+++ T Consensus 2 ~~~p~~--~l~~~~~~~~~~~~l~~~~~~~-~~r~~~~~~YY~g~~~i~~~-------------~~~~~~~~~~~~~~~~ 65 (479) T protein:vir:99 2 IDLPDE--DLSSEGLAKYLETKVFPKMNTE-CERLDDFEAWTKNGQEVPDL-------------ATRHKNKEREVLQQLS 65 (479) T ss_pred ccCCcc--cCChhHHHHHHHHHHHHHHHHH-hHHHHHHHHHHhcCCccccc-------------ccccCChhHHHHHHHh Confidence 233422 222333222111 11111110 01123445555554321100 0000111 11112345 Q ss_pred ecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEc---CCCCCceE Q lcl|NC_016654. 78 HAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWD---PTIADNAW 154 (533) Q Consensus 78 ~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D---~~~~~~~~ 154 (533) ++|||++||+.+|++|+.+. |++.+ ...++.+.++++.|+|.....+++..++++|.+|+.+|.. .+..+.++ T Consensus 66 ~~n~~~~iVd~~~~~l~~~g--f~~~d--~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~~~ 141 (479) T protein:vir:99 66 RKPWMGLMVNSFAQQLIVDG--YRKTG--TNENAKGWDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGISPLDGTTVAR 141 (479) T ss_pred hcCcHHHHHHHHHhhccccc--ccCCC--chhhHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCcCCCCceE Confidence 78999999999999997544 66543 3346778999999999999999999999999999998842 12346789 Q ss_pred EEEEcCCeEEEEEecCCceEE-EEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccccc Q lcl|NC_016654. 155 IDFVDADRAIPEFRWGRLVAV-TFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGA 233 (533) Q Consensus 155 i~~v~~~~~~P~~~~g~~~~v-~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~ 233 (533) |.+++|.+++++|++...... +++.+....... .+++...+. +|...... |.. . T Consensus 142 i~~~~p~~~~~iydd~~~~~~~~~~~~~~~~~~~------~~~~~~~~~--~~~~~~~~------------~~~-----~ 196 (479) T protein:vir:99 142 IKCIDPRDAFAIWEDPYWDEWPKYLLERQPNGQY------WWWTEEDYS--IFEFKQGK------------FIY-----R 196 (479) T ss_pred EEEechhheEEEecCCcccceeeEEEeecCceeE------EEEecceEE--EEEecCCc------------eee-----c Confidence 999999999999987654332 333222222111 122222222 22211111 000 0 Q ss_pred ccCCceeecCCCc-cceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCccee-eechHH Q lcl|NC_016654. 234 DEGRGAYVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKV-HASESV 311 (533) Q Consensus 234 ~~~~~~~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i-~v~~~~ 311 (533) + ..+++.. +|++.|+.|. + ..+.|.|+|.. +.+|+|++|.++|++.+.++.....+ ++.-.. T Consensus 197 ~-----~~~h~~g~vPvv~f~n~~----~------~~~~g~sd~e~-v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~ 260 (479) T protein:vir:99 197 E-----TVSHDYGHIPFVRYVNVM----D------LRGVCYGDVEP-LVTVAKAIDKTGLDILLVQHHQSFQIRWATGLM 260 (479) T ss_pred c-----ccccCCCCcceEEeecCC----C------cCcCCcchhHH-HHHHHHHHHHHHHHHHHHHHHhhchhhhhcCCC Confidence 0 1123322 3444455432 1 23469999986 78999999999999999987543333 221100 Q ss_pred hcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchh Q lcl|NC_016654. 312 LTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQT 391 (533) Q Consensus 312 l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~T 391 (533) +.....+....+... .-+.+. ..+.+. .+.+++ +..+++|.+.++.++++|+..++++++.||+. +..| T Consensus 261 ~~~~~~~~~~~~~~~--~~~i~~----~~~~~~--~~~q~~-~~~~~~~~~~l~~~i~~i~~~t~~p~~~~g~~--~n~S 329 (479) T protein:vir:99 261 LPEGANADQEKMRFA--QESMLI----SQNEKA--SFGAIP-AAPLDGLLNAYKESLLEFLALAQLPPHIAGQI--VNVA 329 (479) T ss_pred cccccccchhccccc--ccccee----ecCCCc--eEEEec-ccchHHHHHHHHHHHHHHhccCCCCHHHcccc--cchH Confidence 101111111111100 001111 111121 233333 34578999999999999999999999999853 3479 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCC Q lcl|NC_016654. 392 ATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAA 471 (533) Q Consensus 392 atai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~ 471 (533) |.||++++..+..++++|++.|+.+|++++++++.+.. +........+++.|.++.++|..+.++.+.+|+++|++ T Consensus 330 g~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~~----~~~~~~~~~i~~~w~~~~~~s~~~~ad~~~kl~~ag~i 405 (479) T protein:vir:99 330 ADALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKIEG----RTEEATDLDFTITWQDVTIQSLAQFADAWAKMVESLKI 405 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC----CCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCC Confidence 99999999999999999999999999999999877632 23334456799999999999999999999999999999 Q ss_pred CHHHHHHHhCCCCCHHHHHHHHHHHHHhh---hcccCcccccccc--CCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 472 STKTKVAYLHEDWDDERVQEEADLIDNAN---TVSAPTFGFGTDQ--PPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 472 S~et~v~~l~~~~~dee~~~El~rI~~E~---~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~d~~ 533 (533) |.||++++ ++++++++++++. ++++++ ......+..+.+. .....++.+..++.+...|+ T Consensus 406 s~et~l~~-l~gv~~~~~e~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 470 (479) T protein:vir:99 406 PAEGVWDM-IPNLDQSTVNGWK-EIYDREGDFGKYMRKLQNGPDPAEQRGGPNGATNMQQANNKTGE 470 (479) T ss_pred CHHHHHHh-cCCCCHHHHHHHH-HHHHHHHHHHHHHHHHhcccCcccccCCCCCCCCCCCCCCCCcc Confidence 99999876 5778887765432 222222 1111111111100 00111111122223333344 No 65 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=100.00 E-value=4.5e-44 Score=258.22 Aligned_cols=425 Identities=12% Similarity=0.039 Sum_probs=251.2 Q ss_pred HhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhh Q lcl|NC_016654. 40 YGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNT 119 (533) Q Consensus 40 y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~ 119 (533) |.. ...+.. .+.-+++.++|||++||+.++++|+.+ .|++.+ ...++.+++++++ T Consensus 1 ~l~------~~~~~~---------------~~~~~~~~v~n~~~~ivd~~~~~l~~~--gf~~~d--~~~~~~~~~i~~~ 55 (434) T protein:vir:98 1 MLP------KNAEQA---------------FLDFQRKARTNFCGLIANASVHRLLAL--GVTGPD--GEPDTRASRWWQA 55 (434) T ss_pred CCC------CCccHH---------------HHHhhhhhhccchHHHHHHHHhhhccC--ceecCC--CchHHHHHHHHHh Confidence 111 000000 011234568899999999999998755 466643 3467889999999 Q ss_pred ccHHHHHHHHHHHHhhhCCEEEEEEEcCC-----CCCceEEEEEcCCeEEEEEec--CCceEEEEEEEEeecCCceEEEE Q lcl|NC_016654. 120 PRFHSSLVEAGESCSALSGSFQRIVWDPT-----IADNAWIDFVDADRAIPEFRW--GRLVAVTFWSELAGGDGQEVWRH 192 (533) Q Consensus 120 n~f~~~~~~~~~~~~~~G~~~~~~~~D~~-----~~~~~~i~~v~~~~~~P~~~~--g~~~~v~f~~~~~~~~~~~~y~~ 192 (533) |+|.....++++.++++|.+|+.+|.+++ +.+.+.|++++|.+++++|++ +++..++++. ........+.. T Consensus 56 N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~--~~~~~~~~~~~ 133 (434) T protein:vir:98 56 NRLDSRQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVW--HNDIDGFGYAR 133 (434) T ss_pred cChhHHHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEE--EeccCCceEEE Confidence 99999999999999999999999998754 345788999999999999986 4555444433 22222233333 Q ss_pred EEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCc-cceeEEecCCccccccccccccccc Q lcl|NC_016654. 193 LERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYL 271 (533) Q Consensus 193 lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~ 271 (533) +..+...++.+ ++..... ....... .+ +.........+++.. .|++.|+.| .+. ... T Consensus 134 ~~~~~~~~~~~--~~~~~~~---~~~~~~~-~~------~~~~~~~~~~~h~~g~vPvv~f~N~----~~~------~~~ 191 (434) T protein:vir:98 134 VFFDDTSFPYR--TRERTGA---RLPWGPD-SW------VYTGTADSGDVHDLGGMQLVEFARM----PDL------GED 191 (434) T ss_pred EEEeCcEEEEE--Eeecccc---ccccccc-cc------eecccccccccCCCCccceEEeccC----CCc------CcC Confidence 44444333322 1111110 0000000 00 000000111223333 233333322 222 235 Q ss_pred ccchhhhhHHHHHHHHHHHHHHHHHHHHhC-cceeeechHHhcCCCCccccccCcchhhhhhccccccccccccccceee Q lcl|NC_016654. 272 GRADLSTDLFPTFHELDRIYSSLMRDFRIG-AGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEF 350 (533) Q Consensus 272 G~S~~~~~i~~lid~lD~~~s~~~~~~~~~-~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~ 350 (533) |.|+|.. +.+|+|++|+++|+..+..+.. .+..++.-.-+.......+.... ....+...........+ ....+.+ T Consensus 192 g~sd~e~-vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~-~~~~~~~~~~~i~~~~~-~~~~~~q 268 (434) T protein:vir:98 192 PEPEFAG-VLDIQDRVNLGILNRMAASRFSGFRQKWIKGHKFAKRTDPATGMTV-VDQPFVPSPSAVWASEG-ENTQFGQ 268 (434) T ss_pred Ccchhhh-HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccccch-hhhhhhccccccccCCC-CCceEEE Confidence 8999975 7899999999999999988743 33333311001000011110000 01111111100000011 1112222 Q ss_pred echhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_016654. 351 FQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAI 430 (533) Q Consensus 351 ~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~ 430 (533) ++ ....++|++.|+.++++++..+++++..||. ..+..||.||++++..|.+++.+|++.|+.+|++++++++.+. T Consensus 269 ~~-~~~~~~~~~~l~~~i~~~~~~~~~p~~~~~~-~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~-- 344 (434) T protein:vir:98 269 LD-ATDLSGFLKEHASDVRDMLTISQTPTYLYAT-DLVNISADTIGALDILHVAKVREHIASFSEGLESVLALAAAQA-- 344 (434) T ss_pred ec-CcchHHHHHHHHHHHHHHhcccCCCHHHhcc-ccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-- Confidence 22 3456899999999999999999999999984 3456799999999999999999999999999999999887652 Q ss_pred hccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccc Q lcl|NC_016654. 431 KFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFG 510 (533) Q Consensus 431 ~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~ 510 (533) +.......+++.|.++.++|..+.++++++|+++|+ |.++.+.. .| ++++|+++..++..+ ++...-..... T Consensus 345 ----g~~~~~~~~~v~w~~~~~~s~~~~ada~~kl~~~g~-~~e~~~~~-lg-~~~~e~~r~~~e~~~-~~~~~~~~~~~ 416 (434) T protein:vir:98 345 ----GVPEDYTEAEVRWANPAHVTMAVKADAATKLKSIGY-PLDVIAEE-LD-ESPARVRRIVAGAAS-QALLAASLLPA 416 (434) T ss_pred ----CCChhheeeeEEecCCCCCCHHHHHHHHHHHHhcCC-cHHHHHHh-CC-CCHHHHHHHHHHHHH-HHHHHHhhhcc Confidence 223455689999999999999999999999998886 88876654 44 677665544433222 22111101111 Q ss_pred cccCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 511 TDQPPLPTENDPATDPEAVDEG 532 (533) Q Consensus 511 ~~~~~~~~~~~~~~~~~~~~d~ 532 (533) ..+++... ++ +.+...|| T Consensus 417 ~~~~~~g~--~~--~~~~~~dg 434 (434) T protein:vir:98 417 PGAPSAGN--VP--DSGGAVDG 434 (434) T ss_pred CCCCCCCC--CC--cccCCCCC Confidence 11221111 11 11222344 No 66 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=100.00 E-value=5e-39 Score=230.57 Aligned_cols=457 Identities=11% Similarity=-0.017 Sum_probs=269.1 Q ss_pred CCCCCcCCC---cCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceee Q lcl|NC_016654. 2 SLPEANTAW---PPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYH 78 (533) Q Consensus 2 ~~~~~~~~~---pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 78 (533) -+|-.+++= ++.+..-+..-+..|..=.. ...++.+||.+.+..+.. +...+...++.+.+ T Consensus 1 ~~~~~~~~~~gl~~~~~~~~~~L~~~~~~~~~-~~~~~~~Yy~G~~~~~~~---------------~~~~p~~~r~~~~v 64 (474) T protein:vir:81 1 MIQQQTVRIPSLSNDENALINGLLAQIENLRW-KNLLRTSYYENKRTIQYV---------------GTLIPPQYFNLGLV 64 (474) T ss_pred CcCCCcCcCCCCChhHHHHHHHHHHHHHHHhh-HHHHHHHHhccCCChhhc---------------cccccHHHHHHHhh Confidence 233222211 12221112122222221111 123455666655432100 00011111233578 Q ss_pred cChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEE Q lcl|NC_016654. 79 APIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFV 158 (533) Q Consensus 79 ~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v 158 (533) +|+|++||+.+|+.|.-+. |.+++. +.....+.++++.|+|.....+++..|+++|.+|+.++.++++.+.++|.++ T Consensus 65 ~nw~~~~Vd~~a~rl~~~G--f~~~d~-~~~~~~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~ 141 (474) T protein:vir:81 65 LGWTGKAVDALARRCNLEG--FVWPDG-DLDSLGGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVK 141 (474) T ss_pred cChHHHHHHHHHhhhcccc--eECCCC-CccchHHHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEe Confidence 9999999999999987665 555433 2334568999999999999999999999999999999998888888999999 Q ss_pred cCCeEEEEEecC--CceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccC Q lcl|NC_016654. 159 DADRAIPEFRWG--RLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEG 236 (533) Q Consensus 159 ~~~~~~P~~~~g--~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~ 236 (533) +|.+++.+|++. ++...+.+.. ...+++ .+....|.++.+.+ +++ ...+. .| .. + T Consensus 142 sp~~~~~~~D~~~~~~~~al~~~~-~~~~g~--~~~~~ly~~~~~~~-~~~-~~~~~----------~w--~~----~-- 198 (474) T protein:vir:81 142 DASEATGEWNRRRRGLNNLLSIID-KDKEGK--VLSLALYLDNETVT-AQR-DKATL----------KW--QV----D-- 198 (474) T ss_pred ccceEEEEEeCCCCcceeeeEEEE-EcCCCc--EEEEEEEeCCcEEE-EEE-cCccc----------ee--ee----c-- Confidence 999999999863 4444333222 222222 33444566776643 222 11110 00 00 0 Q ss_pred CceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhC-cceeee----chHH Q lcl|NC_016654. 237 RGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIG-AGKVHA----SESV 311 (533) Q Consensus 237 ~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~-~~~i~v----~~~~ 311 (533) ..+++...|.+.|+.+ .+. ..+.|+|.+...+.+++|++|+++++.....+.. -+..+| ++.+ T Consensus 199 ---~~~~~~gvPvV~~~n~----~~~-----~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~~~~~ 266 (474) T protein:vir:81 199 ---RDEHVYGVPAQVLPYK----PAP-----KRPFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLLGADESAL 266 (474) T ss_pred ---cCCCCCCcceEEeccc----ccc-----cCcCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeecCChhhc Confidence 0122222233333322 222 2567999987778899999999999998877642 111222 0110 Q ss_pred hcCCCCccccccCcchhhhhhcccccccccccc--ccceeeech-hhhhHHHHHHHHHHHHHHHHhhCCChhhcccCC-C Q lcl|NC_016654. 312 LTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDM--ETIFEFFQP-AIRVLEHDQGAALLLREVLRKTGYSPVSLGLSD-E 387 (533) Q Consensus 312 l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~--~~~i~~~~~-~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~-~ 387 (533) . +.++.+. +........+.....+..++. ....+..|+ +...+.|.+.|+.++++++..+++|+++||+.+ . T Consensus 267 ~--d~d~~~~--~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~~a~l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~ 342 (474) T protein:vir:81 267 K--NADGTIK--SVWEARLGRIKGLPDDADADIPQLARADVKQFPAASPDAHWSDINGLAKLFAREASLPDTAVAISGLS 342 (474) T ss_pred c--ccccccc--chhhhhHHHHhcCCCcccccccccccccccccCCCChhHHHHHHHHHHHHHHhhhCCCHHHhcccccc Confidence 0 0111111 111111111111112211111 011122222 345678999999999999999999999999765 7 Q ss_pred cchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHh Q lcl|NC_016654. 388 VAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSV 467 (533) Q Consensus 388 ~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~ 467 (533) ++.||.+|++.+..|..++.+|++.|+.+|++++++++.+....-..........+.+.|.|...++..+.++.+.|+++ T Consensus 343 np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~ 422 (474) T protein:vir:81 343 NPTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVAIDEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQLA 422 (474) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccchhhccceeEecCCCccCHHHHHHHHHHHHh Confidence 88999999999999999999999999999999999998875321111112334678999999999999999999999999 Q ss_pred CCC--CCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhccc-CccccccccCCCCCCCCCCCCCC Q lcl|NC_016654. 468 ASA--ASTKTKVAYLHEDWDDERVQEEADLIDNANTVSA-PTFGFGTDQPPLPTENDPATDPE 527 (533) Q Consensus 468 aGi--~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 527 (533) +|. .+.++... .+++++++++++.+..+++++... +.+-..+ .+.. .+ | T Consensus 423 a~~~~~~~~~~~~--~lg~t~~~i~~~~~~~~~~~~~~~~~~l~~~~--~~~~---~a----q 474 (474) T protein:vir:81 423 AVPWLAETEVGLE--LIGLTPQQARRAMADKRRVQGRGTLQALIDRS--NNGA---TA----Q 474 (474) T ss_pred cccCCCcHHHHHh--hcCCCHHHHHHHHHHHHHHhHHHHHHHHHhcC--CCCC---CC----C Confidence 984 44555444 357999888766555555543211 1000000 0000 00 1 No 67 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=100.00 E-value=1.7e-38 Score=227.66 Aligned_cols=400 Identities=12% Similarity=0.013 Sum_probs=254.7 Q ss_pred hHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchh--hHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHh Q lcl|NC_016654. 15 LAAVTARVAESHVWWEGDLDKLATFYGAEGRTSP--SGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTE 92 (533) Q Consensus 15 ~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~l 92 (533) ++--+.+.++.. +||.+.+..+. ......+ ..+.++++|+|++||+.+|+. T Consensus 1 l~~~~~r~~~~~-----------~yY~g~~~~~~~~~~~p~~~----------------~~~~~~v~nw~~~~Vds~a~r 53 (410) T protein:vir:95 1 MNLYQSRVNLRY-----------KHYAMQHYEAPTGITIPAHI----------------RAKYQAVLGWAAKGVDSLADR 53 (410) T ss_pred CCcchhhHHHHH-----------HHhcCCCCccccchhccHHH----------------HhHHHhhcchhHHHHHHhHhh Confidence 333333333333 34433322210 0000000 123467899999999999998 Q ss_pred hcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEec--C Q lcl|NC_016654. 93 LFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRW--G 170 (533) Q Consensus 93 l~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~--g 170 (533) |.-+. |+.. +. .++++++.|+|.....+++..|+++|.+|+.++-+++ ++++|.+++|.+++.+|++ + T Consensus 54 l~~~G--f~~~--d~----~l~~i~~~N~ld~~~~~~~~~al~~G~sf~~v~~~~d--~~~~i~~~sP~~~~~i~Dp~~~ 123 (410) T protein:vir:95 54 LIFRA--FAND--DF----NVTEIFDRNNPDIFFDSAILSALIGSCSFVYISKGED--DEVRLQVIESSNATGVIDPITG 123 (410) T ss_pred hcccc--ccCC--Cc----hHHHHHhhcChHHHHHHHHHHHHHhCceeEEEecCCC--CceEEEEEcccceEEEEeCCCC Confidence 76554 5442 22 3778899999999999999999999999999987764 4689999999999999986 3 Q ss_pred CceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCcccee Q lcl|NC_016654. 171 RLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTA 250 (533) Q Consensus 171 ~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 250 (533) ++...+ +.+.. ++....+.+..|.++.|.+ |..+.. ...++++...+++ T Consensus 124 ~~~~al--~~~~~-~~~~~~~~~~~~~~~~~~~--~~~~~~--------------------------~~~~~~~~g~vPv 172 (410) T protein:vir:95 124 LLVEGY--AVLAR-DDYNRPTLEAYFEPNATHF--IPKDGE--------------------------PYSVTNETGIPLL 172 (410) T ss_pred ceEEEE--EEEEe-cCCCeEEEEEEEeCCcEEE--EeeCCc--------------------------cccccCCCCCcce Confidence 443333 22222 2222345555666665543 221100 0113444444444 Q ss_pred EEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCc-ceeeechHHhcCCCCccccccCcchhh Q lcl|NC_016654. 251 AYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGA-GKVHASESVLTNLGMGQGVSLDEEQEV 329 (533) Q Consensus 251 ~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~-~~i~v~~~~l~~~~~~~~~~~d~~~~~ 329 (533) +.++|.. +. ..++|+|.+...|.+++|++|+++++.....+..- +..++. . ++.++.+. ..|+.. T Consensus 173 V~f~n~~---~l-----~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-G-~d~d~~~~-~~~~~~--- 238 (410) T protein:vir:95 173 VPVIHRP---DA-----VRPFGRSRITRAGMYYQKYAKRTLERADITAEFYSWPQKYIL-G-LDPDAEPM-EKWKAT--- 238 (410) T ss_pred EEecccc---cC-----CccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheee-c-cCCCCCcC-chhhhh--- Confidence 4444432 22 25689999877788999999999999988876421 111210 1 11111111 011111 Q ss_pred hhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHH Q lcl|NC_016654. 330 YSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAK 409 (533) Q Consensus 330 ~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~ 409 (533) ...+...+.+.++ ....+.+++ ....+.|++.++.++++++..+++|++.||..+.++.||.||++.+..|..++.+| T Consensus 239 ~~~i~~~~~~~~~-~~~~v~q~~-~~~l~~~~~~l~~l~~~~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k 316 (410) T protein:vir:95 239 VSSLLTISSSDKG-VKPSVGQFT-TASMSPFTEQLRTAAAGFAGEMGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKA 316 (410) T ss_pred hhhheeccCCCCC-CcceEEecC-CCChHHHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHH Confidence 1112221222222 222333332 23456899999999999999999999999988888889999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeC---CCCCCCHHHHHHHHHHHHhC--CCCCHHHHHHHhCCCC Q lcl|NC_016654. 410 ARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWP---KFARESDLAKAQTVQAWSVA--SAASTKTKVAYLHEDW 484 (533) Q Consensus 410 ~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~---d~i~~d~~e~a~~~~~l~~a--Gi~S~et~v~~l~~~~ 484 (533) ++.|+.+|++++++++.+.... .........+.|.|. +.-.++..+.++.+.||+++ |++++++++..+ ++ T Consensus 317 ~~~fg~~l~~~~rla~~i~~~~--~~~~~~~~~~~v~W~p~~d~~~~s~a~~aDa~~Kl~~a~~g~~~~~~~~~~l--g~ 392 (410) T protein:vir:95 317 QRSLGAGLLNVAYVAACLRDEF--RYTRSQFVRTAVKWEPLFEADANTMTMIGDGVVKLNQALPGYINAETIRDLT--GI 392 (410) T ss_pred HHHHHHHHHHHHHHHHHHhcCC--CCcccccceeeEEeeecCCcchhhHHHHHHHHHHHHHhccCCccHHHHHHhc--CC Confidence 9999999999999988875321 112234567899998 77778899999999999998 799999988876 46 Q ss_pred CHHHHHHHHHHHHHhhhcccC Q lcl|NC_016654. 485 DDERVQEEADLIDNANTVSAP 505 (533) Q Consensus 485 ~dee~~~El~rI~~E~~~~~~ 505 (533) ++++..+ +..+|+.+... T Consensus 393 ~~~~~~~---~~~~e~~~~g~ 410 (410) T protein:vir:95 393 AGDMSAK---PVVSEGGSNGE 410 (410) T ss_pred ChHHHHH---HHHHHHHhCCC Confidence 7654332 22233322100 No 68 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=100.00 E-value=7.4e-38 Score=224.15 Aligned_cols=487 Identities=11% Similarity=0.051 Sum_probs=278.3 Q ss_pred CC----CCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccce Q lcl|NC_016654. 1 MS----LPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKR 76 (533) Q Consensus 1 ~~----~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 76 (533) || +|+....|=|+.-....+.++. |..||.++.. ...+.-.|..++. T Consensus 9 ~p~~~~fp~~~a~wV~~~D~~RlaaY~l-----------y~d~y~n~~~------------------el~~il~G~dr~~ 59 (563) T protein:vir:74 9 DPAKPFLRGGDDNIVDENDKNRVRAYDL-----------YENIYLNSAE------------------TLKLVLRGDDSVP 59 (563) T ss_pred CCCcccccccccccCCHHHHHHHHHHHH-----------HHHhhcCchh------------------hhhhhcCCCceee Confidence 33 6777778877774443333332 2334433211 1111122444444 Q ss_pred eecChHHHHHHHHHHhhcCCCceEeeCCCc------hHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCC-- Q lcl|NC_016654. 77 YHAPIPGVIAKLSTTELFSEQLKFLDAGKS------KEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPT-- 148 (533) Q Consensus 77 ~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~------~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~-- 148 (533) +..+.++++|++.+++| +++..|++.+.+ +.++.+|.++++.+++...+..+...|.++|+++|++.||++ T Consensus 60 ~~~ps~r~~V~~~~~~L-g~~~~~~Ve~~~~de~~~~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~ 138 (563) T protein:vir:74 60 ILMPSGRKIVEAVHRFL-GVGFDYLVEPDMGDEGIRQSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKK 138 (563) T ss_pred eccchHHHHHHHHHHhc-CCCcEEecCccccCcchHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccccc Confidence 55556789999977655 999999886432 346788899999999999999999999999999999999963 Q ss_pred CCCceEEEEEcCCeEEEEEecCCceEEEEEE---EEe--ecCCceEEEE----EEEecCe-eEEEEEEeccCCcccc--e Q lcl|NC_016654. 149 IADNAWIDFVDADRAIPEFRWGRLVAVTFWS---ELA--GGDGQEVWRH----LERHESG-YIVHAVYKGTATSLGW--M 216 (533) Q Consensus 149 ~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~---~~~--~~~~~~~y~~----lE~h~~~-~I~~~~y~~~~~~lG~--~ 216 (533) ...++++.-|+|.++||.-+.+.+..+..++ .+. ...++..+++ .|....+ +.+-.+|.-+...+|. . T Consensus 139 ~g~R~rv~~vDP~~~fp~~dpd~v~g~~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg~~~~~~~~dae~w~lg~wd~ 218 (563) T protein:vir:74 139 AGERISVDEVDPRQIFLIEDGSTVVGFHMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEGMFTGRISSELTHWTLGNWDD 218 (563) T ss_pred cCCCceEeecCCceeeeccCCCCcccceeeecccCCCCCcchhccceeeeeeeeeeCCCCCccceeeeccchhccccccc Confidence 2358999999999999965555554443222 111 1223323322 1111111 1111112222222321 0 Q ss_pred eehhhcccccccccc-ccccC--CceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHH Q lcl|NC_016654. 217 MALTDHPATRDIAVE-GADEG--RGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSS 293 (533) Q Consensus 217 v~l~~~~~~~~~~~~-~~~~~--~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~ 293 (533) .++.... +...... +.... +.....+....+.+++.||+ ++..+.||+|++++ +.++|++||...|+ T Consensus 219 r~~~~~~-~~~~~~~~~~~~~d~e~~~LP~pi~~iPiv~~~ti--------p~~~s~WG~S~La~-ll~~~~eLn~~~Td 288 (563) T protein:vir:74 219 RGAISDE-QARRKEQVRSAQHDEEEEELPEPISQLPLYRWRNK--------PPQNSSWGTSQLEG-METLAYALNQSLTD 288 (563) T ss_pred cCccchh-hhcccchhhhhhhhchhhhccccccCccEEEcCCC--------CCcccccchhhHHH-HHHHHHHHhhhhhH Confidence 0110000 0000000 01000 11111222223334443443 44568899999997 67999999999999 Q ss_pred HHHHHHhCcceeeechHHhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHH-HHH Q lcl|NC_016654. 294 LMRDFRIGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLR-EVL 372 (533) Q Consensus 294 ~~~~~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~-~i~ 372 (533) ..+.+..+..-|+|-++.-..++ ..+..-+..-+-=..+..+.... ...+..++-.-.+..+..+++.+.. -++ T Consensus 289 ~s~i~~~tG~pi~vl~~~~p~d~-~~g~~~~w~vgpG~i~El~~~~~----~g~l~~v~g~~~l~~~q~Hm~~l~eral~ 363 (563) T protein:vir:74 289 EDATIVFQGLGMYVTNASAPVDP-NTGELTDWNIGPMQIVEIAGNRN----DNYFERVSGVQDVSPFQDHMKWIDEKGIA 363 (563) T ss_pred HHHHHHhcCCCeEEecccccccc-ccccccccccCCceeEeccCCcc----ccceeeecchhhhHHHHHHHHHHHHHHHH Confidence 99999999988998554332221 11111111000000011111111 1234555544455566666666666 446 Q ss_pred HhhCCChhhcc-cCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHH----HHHHHHHHHHhh--------ccCCCCCC Q lcl|NC_016654. 373 RKTGYSPVSLG-LSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGP----LSTTCLRVDAIK--------FPGKGAAP 439 (533) Q Consensus 373 ~~~g~s~~~~g-~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~----li~~il~l~~~~--------~~~~~~~~ 439 (533) ..+|++..+|| .+.+...||.|++.+.++|++++++|+..+..++++ +++..|...... ..|....+ T Consensus 364 ~~s~tPavA~G~vD~~~~~SGiALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~ 443 (563) T protein:vir:74 364 EGSGTPEVAIGRVDVTSAESGISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLL 443 (563) T ss_pred hhccCcceeecccccccccchhhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccC Confidence 77899999999 355568899999999999999999999988888887 444444222211 11222222 Q ss_pred c-eeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCC-CCCHHHHHHHHHHHHHh--------hhcccCcccc Q lcl|NC_016654. 440 S-EELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHE-DWDDERVQEEADLIDNA--------NTVSAPTFGF 509 (533) Q Consensus 440 ~-~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~-~~~dee~~~El~rI~~E--------~~~~~~~~~~ 509 (533) . ..|+|.|.+.+|.|.++.++.+.+|+++||+|++||+++|-- +|...+|+.|.++|+.. ++..+++++. T Consensus 444 ~~~~v~ivf~p~~P~d~~~vv~~~~tl~~aGiiSretAv~~L~~~g~~~pdae~e~~~ie~~~i~~~~~a~a~ad~~~~~ 523 (563) T protein:vir:74 444 NECSVVCIFADPMPVNKTQVTQDTLLLQQAHLILRKMAVAKLRSIGWEYPEVDDQGNALTDDDIADMLLAEAEADASLGL 523 (563) T ss_pred CceEEEEEeCCCCCccHHHHHHHHHHHHHcCchhHHHHHHHHHhCCCCCCcHHHHHhhcCHHHHHHHHHHHhhccCcccc Confidence 2 347899999999999999999999999999999999988711 24444455555544433 3333444332 Q ss_pred -ccccCCCCCCC-CCCCCCCCCCCCC Q lcl|NC_016654. 510 -GTDQPPLPTEN-DPATDPEAVDEGE 533 (533) Q Consensus 510 -~~~~~~~~~~~-~~~~~~~~~~d~~ 533 (533) ..+.+..+++. |+.|+ +-..=|. T Consensus 524 ~a~~~~g~~~~~~dd~g~-p~~~~~~ 548 (563) T protein:vir:74 524 SAMDNGGAGEQQFDDQGN-PIDQFGN 548 (563) T ss_pred eecccCCCCcccccccCC-chhHcCC Confidence 11222222222 22322 1111121 No 69 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=100.00 E-value=3.3e-37 Score=220.58 Aligned_cols=413 Identities=11% Similarity=0.034 Sum_probs=255.4 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |.. .. .......+...+. ...++.+||.+.+....... ..|.. -....++.+| T Consensus 1 m~~--~~-------i~~L~~~~~~~~~----r~~~~~~yy~g~~~~~~~~~-------------~~p~~-~~~~~~~v~n 53 (422) T protein:vir:97 1 MNY--MG-------MGYLRRKLALFKT----GVDKRYRYYAMDDRDDTRSI-------------VMPNN-VREMYRSVLE 53 (422) T ss_pred CCh--HH-------HHHHHHHHHHHHH----HHHHHHHHHhcCCChhhcCc-------------cccHH-HHHHHHhhcc Confidence 111 11 1122222222222 12344555555432210000 00000 0123356789 Q ss_pred hHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcC Q lcl|NC_016654. 81 IPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDA 160 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~ 160 (533) +|+++|+.+|+.++-+. |++. +. .++++++.|+|.....+++..|+++|.+|+.++.|++ .+.++|.+++| T Consensus 54 w~~~~Vd~~a~rl~~~G--f~~~--d~----~l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~-~~~p~i~~~sp 124 (422) T protein:vir:97 54 WTAKGVDSLADRIIFRE--FTND--DF----NAWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAE-DGLPKMQVIEA 124 (422) T ss_pred hhHHHHHHHHhccccce--eeCC--ch----hHHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCC-CCeeEEEEech Confidence 99999999999775443 5553 22 3678899999999999999999999999999998864 35789999999 Q ss_pred CeEEEEEecC--CceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCc Q lcl|NC_016654. 161 DRAIPEFRWG--RLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRG 238 (533) Q Consensus 161 ~~~~P~~~~g--~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~ 238 (533) .+++.+|++. ++..+..+... ..++. .+...++....+.+ ++.+ . .. T Consensus 125 ~~~~~i~D~~~~~~~~a~~~~~~-~~~~~--~~~~~~~~~~~~~~--~~~~-~-------------------------~~ 173 (422) T protein:vir:97 125 SKATGILDPTTFLLTEGYAILES-DSNGN--PTLEAYFTDKDIWY--YPKK-G-------------------------KP 173 (422) T ss_pred hhEEEEEeCCCCcceeeEEEEEe-cCCCc--EEEEEEEcCceEEE--EcCC-C-------------------------cc Confidence 9999999863 44433322221 11222 22233344444332 2211 0 00 Q ss_pred eeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcC-CCC Q lcl|NC_016654. 239 AYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTN-LGM 317 (533) Q Consensus 239 ~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~-~~~ 317 (533) ..++++...++++.++|.. +. ..++|+|.+...+.+++|++|+++++.....+..-. |..++.. +.. T Consensus 174 ~~~~~~~g~vPvv~~~n~~---~~-----~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~----pqr~i~G~d~d 241 (422) T protein:vir:97 174 YNIKNPTGHPLLVPIIHRP---DA-----VRPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSF----PQKYVLGMDPD 241 (422) T ss_pred ccccCCCCCcceEEecccC---CC-----ccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcc----hhhhhcccCcc Confidence 1223444444444444432 11 367899999777889999999999999888764321 2222211 111 Q ss_pred cccc-ccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHH Q lcl|NC_016654. 318 GQGV-SLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEAS 396 (533) Q Consensus 318 ~~~~-~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~ 396 (533) +... .++.. ...+...+.+..++ ...+.+++ ....+.|++.|+.++++++..+++|++.||..+.++.||.||+ T Consensus 242 ~~~~~~~~~~---~~~i~~~~~de~~~-~~~v~q~~-~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~ 316 (422) T protein:vir:97 242 AKPMEKWRAT---VSTLLEISKDEDGD-KPTVGQFT-TASMAPFMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIK 316 (422) T ss_pred cccCchhhhh---hhhhhccCCCCCCC-cceeeecC-CCChhHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHH Confidence 1111 11111 11121122222221 22233332 2335689999999999999999999999998888888999999 Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCC---HHHHHHHHHHHHhC--CCC Q lcl|NC_016654. 397 GKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARES---DLAKAQTVQAWSVA--SAA 471 (533) Q Consensus 397 ~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d---~~e~a~~~~~l~~a--Gi~ 471 (533) +.+..|..++.+|++.|+.+|++++++++.+.... ........++.+.|....+.+ ..+.|+.++|++++ |++ T Consensus 317 a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~--~~~~~~~~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~~~~ 394 (422) T protein:vir:97 317 AAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEF--PYLRNQFMDTVIKWEPLFEADANMLTLVGDGAIKLNQAIPGFM 394 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--cccchhhccceEEEccCCCCChHHHHHHHHHHHHHHhhccccc Confidence 99999999999999999999999999988775321 111223457899999666666 67788889999999 689 Q ss_pred CHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcc Q lcl|NC_016654. 472 STKTKVAYLHEDWDDERVQEEADLIDNANTVS 503 (533) Q Consensus 472 S~et~v~~l~~~~~dee~~~El~rI~~E~~~~ 503 (533) +.+++..++ ++++ .+.|..+++++++.- T Consensus 395 ~~~~~~~~l--g~~~--~~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 395 DADVIRDLT--GVKG--ADKPIPAITEVTTDG 422 (422) T ss_pred cHHHHHHHc--CCCc--hhHHHHHHHhhhccC Confidence 999988877 3554 456777777665321 No 70 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=100.00 E-value=5.8e-37 Score=219.26 Aligned_cols=473 Identities=11% Similarity=0.078 Sum_probs=272.9 Q ss_pred CCCCcCCCcCcchHHHHHHHHhhhHhhcCCH---HHHHHHHhccCcchhhHHHHHHHHHHHHHhccc------CCCCCcc Q lcl|NC_016654. 3 LPEANTAWPPPELAAVTARVAESHVWWEGDL---DKLATFYGAEGRTSPSGIKARTKAAYEAFHGRT------PTATGRA 73 (533) Q Consensus 3 ~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~---~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~g~~ 73 (533) ||-..-++=|+. ..+.|+- +....+-+ .|...|+- +..++..+ ....|.. T Consensus 1 ~~~~~~~~~~~~------------~~~~g~~~~p~~v~~~d~----~Rl~aY~l-----~~~~y~n~~~~~~~~lrg~~~ 59 (527) T protein:vir:10 1 MGQDKRQYGSTQ------------QLRAGEANFPNAVTDFDK----ARLASYRL-----YEDMYLTNTSDYQVILRGGDE 59 (527) T ss_pred CCccccccCCCc------------CcCCccccCcccCCHHHH----HHHHHHHH-----HHHHhcCchhheeeecCCccc Confidence 443333333322 1112221 10000000 00001110 01111110 0011121 Q ss_pred --cceeecChHHHHHHHHHHhhcCCCceEeeC-------CCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEE Q lcl|NC_016654. 74 --PKRYHAPIPGVIAKLSTTELFSEQLKFLDA-------GKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIV 144 (533) Q Consensus 74 --~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~-------~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~ 144 (533) .+.+..+-. +++++....|++. ..++.+++.|..+++.+++...+.++...|.++|+++|++. T Consensus 60 ~~~r~~~~ps~--------~~~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~ 131 (527) T protein:vir:10 60 GDQRPIYVPNG--------EKLIEAKMRFLGQGLKWEFSKKDAKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLI 131 (527) T ss_pred cccceeeehhh--------HHhhCCcceeeccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEe Confidence 234444444 5555555555443 34567889999999999999999999999999999999999 Q ss_pred EcCCC--CCceEEEEEcCCeEEEEEec---CCceEEEEEEEEeec-C-CceEE-EEEEE-----e------cCeeEEEEE Q lcl|NC_016654. 145 WDPTI--ADNAWIDFVDADRAIPEFRW---GRLVAVTFWSELAGG-D-GQEVW-RHLER-----H------ESGYIVHAV 205 (533) Q Consensus 145 ~D~~~--~~~~~i~~v~~~~~~P~~~~---g~~~~v~f~~~~~~~-~-~~~~y-~~lE~-----h------~~~~I~~~~ 205 (533) ||++. ..+|++..++|.++||+.+. +.+..+.|+..|..- + ++.++ .++-+ . ..|.|.|.. T Consensus 132 wD~~k~~~~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~ 211 (527) T protein:vir:10 132 GDDEKDEGSRLSLHEVDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTE 211 (527) T ss_pred eccCCCcCCCceEeecCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeee Confidence 99753 25899999999999999876 467777777555432 2 22222 11111 1 135554432 Q ss_pred EeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHH Q lcl|NC_016654. 206 YKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFH 285 (533) Q Consensus 206 y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid 285 (533) +. ..+|.=-...+.+....-.....+..+....++....++++++||.. +..+.||+|++++ +.++++ T Consensus 212 ~~---w~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p--------~~~~~WG~S~La~-ll~l~d 279 (527) T protein:vir:10 212 EL---YEPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHP--------IMNAMFGRSGLAG-LESLIA 279 (527) T ss_pred ce---eeccccccccccccchhhhhhhcCceeeecccCCCCccceEeecCCC--------ccccccChhhHhH-HHHHHH Confidence 21 11111000111111111111111111111223444555566566653 3357899999996 679999 Q ss_pred HHHHHHHHHHHHHHhCcceeeechHHhcCCCCccccccCcchhhhhhccccc---cccccccccceeeechhhhhHHHHH Q lcl|NC_016654. 286 ELDRIYSSLMRDFRIGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGG---FNANGDMETIFEFFQPAIRVLEHDQ 362 (533) Q Consensus 286 ~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~---~~~~~~~~~~i~~~~~~ir~e~~~~ 362 (533) +||...|+..+.++.+...|++-..+--.+..|+... +.+++ .+-+..++ +..++-.-.+..|.. T Consensus 280 eLn~~~Td~s~is~~sG~Pi~~~tg~~~vd~~G~~~~----------~~VgPG~iweL~e~ak--~~~v~~~~~la~~~~ 347 (527) T protein:vir:10 280 SVNQTMTDEDLIMVFGGLGFYATDSAPPRDSRGNMVP----------WTISPLGMVEHGQNNK--IYRVNGVASLEPSQT 347 (527) T ss_pred HHhhhhhHHHHHHHHhCCceeeecccccccccCCcCc----------cccCCceeEecCCCcc--eeeccchhhhHHHHH Confidence 9999999999999998888887433311111111110 11110 01111111 222332234567888 Q ss_pred HHHHHHHHHHHhhCCChhhcc-cCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHH--HHHHHHHhh-ccCCCCC Q lcl|NC_016654. 363 GAALLLREVLRKTGYSPVSLG-LSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLST--TCLRVDAIK-FPGKGAA 438 (533) Q Consensus 363 ~l~~~l~~i~~~~g~s~~~~g-~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~--~il~l~~~~-~~~~~~~ 438 (533) ++..+++.|+..++++..+|| .+.++..|+.+++.+.++|++++.+|+..|+-..++..+ +..|+.++. +...... T Consensus 348 h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~ 427 (527) T protein:vir:10 348 HMTKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDAD 427 (527) T ss_pred HHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCc Confidence 899999999999999999999 455678899999999999999999999999999988664 234555532 2222233 Q ss_pred CceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCC-CCCHHHHHHHHHHHHHhhhcc-------cCccc-c Q lcl|NC_016654. 439 PSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHE-DWDDERVQEEADLIDNANTVS-------APTFG-F 509 (533) Q Consensus 439 ~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~-~~~dee~~~El~rI~~E~~~~-------~~~~~-~ 509 (533) ....+.|.|.+.+|.|..+.++.+.+|+++|++|+++|+++|-- .+ .+.+++|++||.++.+.+ ..+++ + T Consensus 428 ~~~~v~ivf~p~lP~D~~avie~v~tL~~aGi~S~~tAv~~L~~~~g-~eD~E~E~~~I~~era~~a~a~a~A~~~~~a~ 506 (527) T protein:vir:10 428 KKLTVTITFRDPKPVNSEKRFNQLLQLWEAGLIPAKKLTEELSKIMG-FELTEEDFKQATEDKKTQGIAQAEAADPFGAQ 506 (527) T ss_pred cccceEEEecccCCCCHHHHHHHHHHHHHcCchhHHHHHHHHHhccC-CCChHHHHHHHHHHHHHHhHHhhhhcCchhhh Confidence 44578999999999999999999999999999999999998710 01 223667788887775421 12222 1 Q ss_pred ccccC-CCCCCCCCCCCCCCC Q lcl|NC_016654. 510 GTDQP-PLPTENDPATDPEAV 529 (533) Q Consensus 510 ~~~~~-~~~~~~~~~~~~~~~ 529 (533) .++.. -..++.|+.+.+... T Consensus 507 ~~~~~g~~~~~~d~~~~~~~~ 527 (527) T protein:vir:10 507 MAAEQGIPDEEDDQALNGQPL 527 (527) T ss_pred hccccCCCCCCcccccCCCCC Confidence 12222 222223333433333 No 71 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=100.00 E-value=6.3e-37 Score=219.03 Aligned_cols=473 Identities=11% Similarity=0.077 Sum_probs=273.0 Q ss_pred CCCCcCCCcCcchHHHHHHHHhhhHhhcCCH---HHHHHHHhccCcchhhHHHHHHHHHHHHHhccc------CCCCCcc Q lcl|NC_016654. 3 LPEANTAWPPPELAAVTARVAESHVWWEGDL---DKLATFYGAEGRTSPSGIKARTKAAYEAFHGRT------PTATGRA 73 (533) Q Consensus 3 ~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~---~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~g~~ 73 (533) ||-..-++=|+. ..+.|+- +....+-+ .|...|+- +..++..+ ....|.. T Consensus 1 ~~~~~~~~~~~~------------~~~~g~~~~p~~v~~~d~----~Rl~aY~l-----~~~~y~n~~~~~~~~lrg~~~ 59 (527) T protein:vir:10 1 MGQDKRQYGSTQ------------QLRAGEANFPNAVTDFDK----ARLASYRL-----YEDMYLTNTSDYQVILRGGDE 59 (527) T ss_pred CCccccccCCCc------------CcCCccccCcccCCHHHH----HHHHHHHH-----HHHHhcCchhheeeecCCccc Confidence 443333333322 1112221 10000000 00001110 01111110 0011121 Q ss_pred --cceeecChHHHHHHHHHHhhcCCCceEeeC-------CCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEE Q lcl|NC_016654. 74 --PKRYHAPIPGVIAKLSTTELFSEQLKFLDA-------GKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIV 144 (533) Q Consensus 74 --~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~-------~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~ 144 (533) .+.+..+-. +++++....|++. ..++.+++.|..+++.+++...+.++...|.++|+++|++. T Consensus 60 ~~~r~~~~ps~--------~~~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~ 131 (527) T protein:vir:10 60 GDQRPIYVPNG--------EKLIEAKMRFLGQGLKWEFSKKDAKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLI 131 (527) T ss_pred cccceeeehhh--------HHhhCCcceeeccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEe Confidence 234444444 5555555555443 34567889999999999999999999999999999999999 Q ss_pred EcCCC--CCceEEEEEcCCeEEEEEec---CCceEEEEEEEEeec-C-CceEE-EEEEE-----e------cCeeEEEEE Q lcl|NC_016654. 145 WDPTI--ADNAWIDFVDADRAIPEFRW---GRLVAVTFWSELAGG-D-GQEVW-RHLER-----H------ESGYIVHAV 205 (533) Q Consensus 145 ~D~~~--~~~~~i~~v~~~~~~P~~~~---g~~~~v~f~~~~~~~-~-~~~~y-~~lE~-----h------~~~~I~~~~ 205 (533) ||++. ..+|++..++|.++||+.+. +.+..+.|+..|..- + ++.++ .++-+ . ..|.|.|.. T Consensus 132 wD~~k~~~~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~ 211 (527) T protein:vir:10 132 GDDEKDEGSRLSLHEVDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTE 211 (527) T ss_pred eccCCCcCCCceEeecCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeee Confidence 99753 25899999999999999876 467777777555432 2 22222 11111 1 135554432 Q ss_pred EeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHH Q lcl|NC_016654. 206 YKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFH 285 (533) Q Consensus 206 y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid 285 (533) +. ..+|.=-...+.+....-.....+..+....++....++++++||.. +..+.||+|++++ +.++++ T Consensus 212 ~~---w~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p--------~~~~~WG~S~La~-ll~l~d 279 (527) T protein:vir:10 212 EL---YEPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHP--------IMNAMFGRSGLAG-LESLIA 279 (527) T ss_pred ce---eeccccccccccccchhhhhhhcCceeeecccCCCCccceEeecCCC--------ccccccChhhHhH-HHHHHH Confidence 21 11111000111111111111111111111223444555566566653 3357899999996 679999 Q ss_pred HHHHHHHHHHHHHHhCcceeeechHHhcCCCCccccccCcchhhhhhccccc---cccccccccceeeechhhhhHHHHH Q lcl|NC_016654. 286 ELDRIYSSLMRDFRIGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGG---FNANGDMETIFEFFQPAIRVLEHDQ 362 (533) Q Consensus 286 ~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~---~~~~~~~~~~i~~~~~~ir~e~~~~ 362 (533) +||...|+..+.++.+...|++-..+--.+..|+... +.+++ .+-+..++ +..++-.-.+..|.. T Consensus 280 eLn~~~Td~s~is~~sG~Pi~~~tg~~~vd~~G~~~~----------~~VgPG~iweL~e~ak--~~~v~~~~~la~~~~ 347 (527) T protein:vir:10 280 SVNQTMTDEDLIMVFGGLGFYATDSAPPRDSRGNMVP----------WTISPLGMVEHGQNNK--IYRVNGVASLEPSQT 347 (527) T ss_pred HHhhhhhHHHHHHHHhCCceeeecccccccccCCcCc----------cccCCceeEecCCCcc--eeeccchhhhHHHHH Confidence 9999999999999998888887433311111111110 01110 01111111 222332234567888 Q ss_pred HHHHHHHHHHHhhCCChhhcc-cCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHH--HHHHHHHhh-ccCCCCC Q lcl|NC_016654. 363 GAALLLREVLRKTGYSPVSLG-LSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLST--TCLRVDAIK-FPGKGAA 438 (533) Q Consensus 363 ~l~~~l~~i~~~~g~s~~~~g-~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~--~il~l~~~~-~~~~~~~ 438 (533) +++.+++.|+..++++..+|| .+.++..|+.+++.+.++|++++.+|+..|+-..++..+ +..|+.++. +...... T Consensus 348 h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~ 427 (527) T protein:vir:10 348 HMNKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDAD 427 (527) T ss_pred HHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCc Confidence 899999999999999999999 455678899999999999999999999999999988664 234555532 2222233 Q ss_pred CceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCC-CCCHHHHHHHHHHHHHhhhcc-------cCccc-c Q lcl|NC_016654. 439 PSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHE-DWDDERVQEEADLIDNANTVS-------APTFG-F 509 (533) Q Consensus 439 ~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~-~~~dee~~~El~rI~~E~~~~-------~~~~~-~ 509 (533) ....+.|.|.+.+|.|..+.++.+.+|+++|++|+++|+++|-- .+ .+.+++|++||.++.+.+ ..+++ + T Consensus 428 ~~~~v~ivf~p~lP~D~~avie~v~tL~~aGiiS~etAv~~L~~~~g-~eD~E~E~~~I~~era~~a~a~a~a~~~~~a~ 506 (527) T protein:vir:10 428 KKLTVTITFRDPKPVNNEKRFAQLLELWEAGLIPAKKLTEELSKIMG-FELTEEDFRQATEDKKTQGIAQAEAADPFGAQ 506 (527) T ss_pred cccceEEEecccCCCCHHHHHHHHHHHHHcCchhHHHHHHHHHhccC-CCchHHHHHHHHHHHHHHhHHhhhhcCchhhh Confidence 44578999999999999999999999999999999999998710 01 223667788887775421 12222 1 Q ss_pred ccccC-CCCCCCCCCCCCCCC Q lcl|NC_016654. 510 GTDQP-PLPTENDPATDPEAV 529 (533) Q Consensus 510 ~~~~~-~~~~~~~~~~~~~~~ 529 (533) .++.. -..++.|+.+.+... T Consensus 507 ~~~~~g~~~~~~d~~~~~~~~ 527 (527) T protein:vir:10 507 MAAEQGIPDEEDDQALNGQPL 527 (527) T ss_pred hccccCCCCCCcccccCCCCC Confidence 12222 222223333433333 No 72 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=100.00 E-value=8.5e-36 Score=212.86 Aligned_cols=399 Identities=10% Similarity=-0.021 Sum_probs=247.7 Q ss_pred cchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCC-CcccceeecChHHHHHHHHHH Q lcl|NC_016654. 13 PELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTAT-GRAPKRYHAPIPGVIAKLSTT 91 (533) Q Consensus 13 ~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-g~~~~~~~~n~~k~i~~~~a~ 91 (533) =+.+-+..-...|..-.. -..++.+||.+.+..+- -+...+. -..+.++++|+|++||+.+|+ T Consensus 1 ~~~~~i~~L~~~~~~~~~-r~~~~~~yY~g~~~~~~---------------~~~~~p~~~~~~~~~v~nw~~~iVds~a~ 64 (409) T protein:vir:94 1 MTEKGIGYLRFKLSVHKR-RAEMRYDQYAMKYVDRF---------------KGITIPQALSQQYRSILGWCAKGVDSLAD 64 (409) T ss_pred CCHHHHHHHHHHHHHHhH-HHHHHHHHhcccCchhh---------------cChhhhHHHHHHHhhhcchhHHHHHHhHh Confidence 121111111111111110 11233444444332110 0000011 112446788999999999999 Q ss_pred hhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEec-- Q lcl|NC_016654. 92 ELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRW-- 169 (533) Q Consensus 92 ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~-- 169 (533) .+.-+. |+. ++ +.++++++.|+|.....+++..|+++|.+|+.++-+++ ++++|.+++|.+++.+|++ T Consensus 65 rl~~~G--f~~--~d----~~l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~~d--g~~~i~~~sp~~~~~i~D~~~ 134 (409) T protein:vir:94 65 RLVFRE--FEN--DD----FTVNEIFEENNPDIFFDSAVLSSLIASCSFTYISKGEN--DAVRLQVIEAVNATGIIDPIT 134 (409) T ss_pred hcccCc--ccC--Cc----hHHHHHHHhcChhHHHHHHHHHHHHhcceeEEEecCCC--CceEEEEeccceEEEEEecCC Confidence 775443 544 22 24788999999999999999999999999999988765 4689999999999999986 Q ss_pred CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccce Q lcl|NC_016654. 170 GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLT 249 (533) Q Consensus 170 g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 249 (533) +++... ++.+ ..+.....+....|.++.+.+ +++.++. ...++++...++ T Consensus 135 ~~~~~a--~~~~-~~d~~~~~~~~~~~~~~~~~~-~~~~~~~--------------------------~~~~~n~~g~vP 184 (409) T protein:vir:94 135 GLLTEG--YAVL-ERDENNNVVLEAHFLPDRTDY-YYRDSRN--------------------------NISIANPTGHPL 184 (409) T ss_pred Cceeee--EEEE-EecCCCceEEEEEEecCcEEE-EEecCce--------------------------eEeeeCCCCCcc Confidence 344333 2222 223223334455566665542 2221110 011234444333 Q ss_pred eEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCc-ceeeechHHhcCCCCcccc-ccCcch Q lcl|NC_016654. 250 AAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGA-GKVHASESVLTNLGMGQGV-SLDEEQ 327 (533) Q Consensus 250 ~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~-~~i~v~~~~l~~~~~~~~~-~~d~~~ 327 (533) ++..+|. .+. ..++|+|.+...+.+++|++|+++++.....+... +..++ +.-+.++... .|+.. T Consensus 185 vV~f~n~---~~~-----~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i----~G~d~d~~~~~~~~~~- 251 (409) T protein:vir:94 185 LVPIIHR---PDA-----VRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYV----TGLSDDAEPMETWKAT- 251 (409) T ss_pred eEEeccc---ccc-----ccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhhee----EecCCCCcccchhhhh- Confidence 4333332 222 25789999977788999999999999998876432 21222 1111111111 11111 Q ss_pred hhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHH Q lcl|NC_016654. 328 EVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTR 407 (533) Q Consensus 328 ~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~ 407 (533) ...+...+.+..+ ....+.+++ ....+.|.+.++.++++++..+++|++.||..+.++.||.||++.+..+..++. T Consensus 252 --~~~i~~~~~d~dg-~~~~v~q~~-~~~l~~~~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~ 327 (409) T protein:vir:94 252 --VSSMLQFTKDEDG-DKPTLGQFT-QPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGR 327 (409) T ss_pred --HHHhhcCCCCCCC-CCceEEecC-CCChhHHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHH Confidence 1112111222222 222333332 223468999999999999999999999999888888999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCC---HHHHHHHHHHHHhCC--CCCHHHHHHHhCC Q lcl|NC_016654. 408 AKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARES---DLAKAQTVQAWSVAS--AASTKTKVAYLHE 482 (533) Q Consensus 408 ~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d---~~e~a~~~~~l~~aG--i~S~et~v~~l~~ 482 (533) +|++.|+.+|++++++++.+.... .........+.+.|.+..+.+ ..+.|+.++||+++| +++.++...++ T Consensus 328 ~k~~~fg~~~~~~~rla~~i~~~~--~~~~~~~~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~l-- 403 (409) T protein:vir:94 328 KAQRSLGAGLLNVAYLAACLRDDA--PYLREQFRKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLT-- 403 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCC--CccccccccceEEeccCCCcchHHHHHHHHHHHHHHHhcccccchhHHHHHc-- Confidence 999999999999999988774321 111233467899999665555 567889999999999 67788877766 Q ss_pred CCCHHH Q lcl|NC_016654. 483 DWDDER 488 (533) Q Consensus 483 ~~~dee 488 (533) ++++.+ T Consensus 404 G~~~~d 409 (409) T protein:vir:94 404 GIEGGE 409 (409) T ss_pred CCCCCC Confidence 466654 No 73 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=100.00 E-value=8.2e-34 Score=201.97 Aligned_cols=400 Identities=10% Similarity=-0.014 Sum_probs=243.5 Q ss_pred cchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHh Q lcl|NC_016654. 13 PELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTE 92 (533) Q Consensus 13 ~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~l 92 (533) =+.+-+..-...+..-. .-..++.+||.+.+..+... ...|.. -..+.++++|+|++||+.+|+. T Consensus 1 ~~~~~i~~L~~~~~~~~-~r~~~~~~yY~g~~~~~~~~-------------~~~p~~-~~~~~~~v~nw~~~iVds~a~r 65 (409) T protein:vir:16 1 MTEKGIGYLRFKLSVHK-RRAEMRYEQYAMKHVDRFKG-------------ITIPQA-LSQQYRSILGWCAKGVDSLADR 65 (409) T ss_pred CCHHHHHHHHHHHHHHh-HHHHHHHHHHhccCchhhcc-------------hhhhHH-HHHHHhhhcChhHHHHHHhHhh Confidence 11111111111111100 01123444554433221000 000100 1123467889999999999997 Q ss_pred hcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEec--C Q lcl|NC_016654. 93 LFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRW--G 170 (533) Q Consensus 93 l~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~--g 170 (533) +.-+. |+. ++ +.++++++.|+|.....+++..|+++|.+|+.++-+++ ++++|.+++|.+++.+|++ + T Consensus 66 l~~~G--f~~--~d----~~l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~~d--g~~~i~~~sP~~~~~i~D~~~~ 135 (409) T protein:vir:16 66 LVFRE--FEN--DD----FTVNEIFEENNPDIFFDSTVLSALIASCSFTYISKGEN--DAVRLQVIEATNATGIIDPITG 135 (409) T ss_pred ccccc--ccC--cc----hHHHHHHHhcChhHHHHHHHHHHHHhCceeEEEecCCC--CceEEEEEcccceEEEeecccc Confidence 75443 544 22 24788999999999999999999999999999987764 4689999999999999986 3 Q ss_pred CceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCcccee Q lcl|NC_016654. 171 RLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTA 250 (533) Q Consensus 171 ~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 250 (533) ++... +..+...... .-+....|.++.+.+ .++.. .. ...++++...+++ T Consensus 136 ~~~~a--~~~~~~d~~~-~~~~~~~~~~~~~~~-~~~~~--~~------------------------~~~~~~~~g~vPv 185 (409) T protein:vir:16 136 LLTEG--YAVLERDENN-NVVLEAHFLPDRTDY-YYRDS--RN------------------------NISIANPTGNPLL 185 (409) T ss_pred cceee--eEEEEecCCC-ceEEEEEEecCcEEE-EEecC--cc------------------------ccceecCCCCcce Confidence 44433 2222222221 223333444544322 22211 10 0112344443333 Q ss_pred EEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhC-cceeeechHHhcCCCCcccc-ccCcchh Q lcl|NC_016654. 251 AYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIG-AGKVHASESVLTNLGMGQGV-SLDEEQE 328 (533) Q Consensus 251 ~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~-~~~i~v~~~~l~~~~~~~~~-~~d~~~~ 328 (533) +.++|. .+. ..++|+|.+...+.+++|++|+++++.....+.. -+..++ +.-+.++... .|+.. T Consensus 186 V~f~n~---~~~-----~~~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i----~G~d~d~~~~~~~~~~-- 251 (409) T protein:vir:16 186 VPIIHR---PDA-----VRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYV----TGLSDDAEPMETWKAT-- 251 (409) T ss_pred EEeccc---ccc-----cccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhhee----EecCCCCCccchhhhh-- Confidence 333332 222 2568999997778899999999999998887642 222222 1111111111 11111 Q ss_pred hhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHH Q lcl|NC_016654. 329 VYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRA 408 (533) Q Consensus 329 ~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~ 408 (533) ...+...+.+..+ ....+.+++ ....+.|.+.++.++++++..+++|++.||..+.++.||.||++.+..|..++.+ T Consensus 252 -~~~i~~~~~d~~g-~~~~v~q~~-~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~ 328 (409) T protein:vir:16 252 -VSSMLQFTKDEDG-DKPTLGQFT-QPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRK 328 (409) T ss_pred -hhHhhccCCCCCC-CCceEEecC-CCChhHHHHHHHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHH Confidence 0112111222222 223343443 2345689999999999999999999999998888889999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCC---CHHHHHHHHHHHHhCCC--CCHHHHHHHhCCC Q lcl|NC_016654. 409 KARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARE---SDLAKAQTVQAWSVASA--ASTKTKVAYLHED 483 (533) Q Consensus 409 ~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~---d~~e~a~~~~~l~~aGi--~S~et~v~~l~~~ 483 (533) |++.|+.+|++++++++.+.... +........+.+.|.+..++ +..+.++.++||+++|. +..++...++ + T Consensus 329 k~~~fg~~l~~~~rla~~~~~~~--~~~~~~~~~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~--g 404 (409) T protein:vir:16 329 AQRSLGAGLLNVAYLAACLRDDV--PYLREQFSKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLT--G 404 (409) T ss_pred HHHHHHHHHHHHHHHHHHHhcCC--CccchhhccceEEecCCCCcchhhHHHHHHHHHHHHhhcccccchhHHHHhc--c Confidence 99999999999999988874321 11122336789999977644 47899999999999984 4456666655 4 Q ss_pred CCHHH Q lcl|NC_016654. 484 WDDER 488 (533) Q Consensus 484 ~~dee 488 (533) +++.+ T Consensus 405 ~~~~d 409 (409) T protein:vir:16 405 IKGAE 409 (409) T ss_pred CCCCC Confidence 66654 No 74 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=99.82 E-value=6.1e-18 Score=114.96 Aligned_cols=459 Identities=13% Similarity=0.065 Sum_probs=231.2 Q ss_pred CCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHH---HHHhccCc--chhhHHHHHHHHHHHHHhcccCCCCCccccee Q lcl|NC_016654. 3 LPEANTAWPPPELAAVTARVAESHVWWEGDLDKLA---TFYGAEGR--TSPSGIKARTKAAYEAFHGRTPTATGRAPKRY 77 (533) Q Consensus 3 ~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~---~~y~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 77 (533) || +-..|=|+|.....+.+..|..|.|... +. ..|..... .....-+..|..+. .+-. T Consensus 1 m~--~V~~~hp~y~~~~~~W~~ird~~~G~~~-~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl--------------~rA~ 63 (501) T protein:vir:95 1 MP--NVSFIRPELGKLLPLYYLIRDAIAGEPT-VKGARTTYLPMPNAEDQSKENKARYEAYL--------------KRAV 63 (501) T ss_pred CC--CCCCCCHHHHHHHHHHHHHHHHhcChHH-HHhcccccCcCCCCCCCcccchHHHHHHh--------------hccc Confidence 55 3567888999999999999999999653 21 11221110 00000111122211 1134 Q ss_pred ecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHH-HhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCC----- Q lcl|NC_016654. 78 HAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLI-FNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIAD----- 151 (533) Q Consensus 78 ~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i-~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~----- 151 (533) -.|+++.+++.++.++|.++|++++. ..++.+++++ ..-++++..+..++..++.+|.+++.|=+....+. T Consensus 64 ~~n~~~~t~~~l~G~vf~k~p~~~~p---~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~ 140 (501) T protein:vir:95 64 FYNVARRTLFGLVGQVFMRDPVVKVP---ALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASI 140 (501) T ss_pred cCchHHHHHHHHhhhhhcCCcceeCc---HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccH Confidence 67999999999999999999998653 2233444333 23447999999999999999999998765433221 Q ss_pred --------ceEEEEEcCCeEEEE-E--ecC--CceEEEEEEEEeecCC------ceEEEEEEEecCeeEEEEEEeccCCc Q lcl|NC_016654. 152 --------NAWIDFVDADRAIPE-F--RWG--RLVAVTFWSELAGGDG------QEVWRHLERHESGYIVHAVYKGTATS 212 (533) Q Consensus 152 --------~~~i~~v~~~~~~P~-~--~~g--~~~~v~f~~~~~~~~~------~~~y~~lE~h~~~~I~~~~y~~~~~~ 212 (533) +|.|..+.|.+++=. + .+| +++.+++-+.+...++ ...|+.|+.-+.|...+++|+.+... T Consensus 141 a~~~~~~~rPy~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~ 220 (501) T protein:vir:95 141 ADLEAGRIRPTLYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQPT 220 (501) T ss_pred HHHHhccCCcEEEEecHhhhcCcceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEecCCc Confidence 477888888876542 1 123 4777777666554333 23478888777888888899765432 Q ss_pred c--cceeehhhccccccccccccccCCceeec-CCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHH Q lcl|NC_016654. 213 L--GWMMALTDHPATRDIAVEGADEGRGAYVE-TGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDR 289 (533) Q Consensus 213 l--G~~v~l~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~ 289 (533) . |.+++-.....+ . +...+. ++..-.++.|++....+..+. .|.+.|. +|. .++. T Consensus 221 ~~~~~~~~~~~~~~~---~-------~~~~~~~g~~~l~~IPfv~~~~~~~~~~-------~~~pPLl----~lA-~lni 278 (501) T protein:vir:95 221 KADGSKIPKGNYQQY---V-------VYKPTDAQGKRLTEIPFMFIGSENNDSN-------PDNPNFY----DLA-SLNM 278 (501) T ss_pred ccCcceecCCccccc---c-------eeeeeccCCCcCCeeeEEEEecCCCCCC-------CCccchH----HHH-HHHH Confidence 1 111111111000 0 000011 111111222222112121111 1222221 222 2222 Q ss_pred HH----HHHHHHHH-hCcceeeech--HHhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHH Q lcl|NC_016654. 290 IY----SSLMRDFR-IGAGKVHASE--SVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQ 362 (533) Q Consensus 290 ~~----s~~~~~~~-~~~~~i~v~~--~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~ 362 (533) .+ |++.+.+. .+.+..++.- ..-.......+..+-... ++ ....+++ ..+-.++++.-. .+ T Consensus 279 ~hy~~ssd~~~~l~~~~~P~l~i~G~~~~~~~~~~~~~i~~G~~~----~~---~lP~~~~--~~~ie~~~~~i~---~~ 346 (501) T protein:vir:95 279 AHYRNSADYEESCYIVGQPTPVLIGLTEEWVTNVLKGSVNFGSRG----GI---PLPVGAD--AKLLQASENTML---KE 346 (501) T ss_pred HHHhhhhHHHHHHHHcccceeeeeCCcccccccCCCCceeecccc----cc---cCCCCCc--eeEEecChhhHH---HH Confidence 21 22233333 2333333310 000000011111111110 11 1112222 222233432211 23 Q ss_pred HHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCcee Q lcl|NC_016654. 363 GAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEE 442 (533) Q Consensus 363 ~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~ 442 (533) .|+.+.+++.. .| ...+. ..++.+||++.+...+...+....+...++.+|.++++.+.... |.. ..... T Consensus 347 ~l~~l~~~m~~-~G--a~ll~-~~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~w~-----g~~-~~~~~ 416 (501) T protein:vir:95 347 AMDTKERQMVA-LG--AKLVE-QKEVQRTATEAELEAASEGSTLSSATKNVSAAFEWALKWAARWV-----GQA-DSGVK 416 (501) T ss_pred HHHHHHHHHHH-HH--Hhhcc-CCccchhHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHc-----CCC-CCceE Confidence 34444444332 12 12222 33456899999999988888888899999999999888665431 211 22223 Q ss_pred EEEEeCCCCCCC-HHHHHHHHHHHHhCCCCCHHHHHHHh-CCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCC Q lcl|NC_016654. 443 LELEWPKFARES-DLAKAQTVQAWSVASAASTKTKVAYL-HEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTEN 520 (533) Q Consensus 443 v~i~f~d~i~~d-~~e~a~~~~~l~~aGi~S~et~v~~l-~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~ 520 (533) +++. .|-.... ..+.++.+.+++.+|.+|.+|+++.+ -.++.+.+.++|.++|..|....+.... +...... T Consensus 417 v~i~-~df~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~~v~~~~~~~e~e~i~~~~~~~~~~~~-----~~~~~~~ 490 (501) T protein:vir:95 417 FELN-TDFDIARMTPDERRSLVEEWQKGAITFEEMRTGLRKAGVATEDDSKAKEKIAKDTAEAMALAT-----PANVPGD 490 (501) T ss_pred EEEe-cccccccCCHHHHHHHHHHHhCCCCcHHHHHHHHHhCCCCChhHHHHHHHHHhhhcCcccccc-----cCCCCCC Confidence 4432 2222222 45667888899999999999996554 1234444456677777776543222111 1111112 Q ss_pred CCCCCCCCCCCCC Q lcl|NC_016654. 521 DPATDPEAVDEGE 533 (533) Q Consensus 521 ~~~~~~~~~~d~~ 533 (533) .++|++ ..++| T Consensus 491 ~~gg~~--~~~~~ 501 (501) T protein:vir:95 491 GSGGDN--VGNSE 501 (501) T ss_pred Cccccc--ccCCC Confidence 233332 33334 No 75 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=99.79 E-value=9.6e-17 Score=108.39 Aligned_cols=458 Identities=15% Similarity=0.049 Sum_probs=224.2 Q ss_pred CCCCCCcC-CCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchh---hHHHHHHHHHHHHHhcccCCCCCcccce Q lcl|NC_016654. 1 MSLPEANT-AWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSP---SGIKARTKAAYEAFHGRTPTATGRAPKR 76 (533) Q Consensus 1 ~~~~~~~~-~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 76 (533) |+==++.+ ..|=|+|.....+.+.-|.-|.|... +. .+...|-+ ..-+..|..+. .+- T Consensus 1 m~~~~~~~v~~~h~~y~a~~~~W~~ird~~~G~~~-~r---~~g~~YLPk~~~E~~~~Y~~rl--------------~rA 62 (513) T protein:vir:97 1 MADKDPKSPATTSGAYDQMLPRWHVIETLLGGTEA-MR---EAGETYLPRHQEETDKGYQERL--------------ASA 62 (513) T ss_pred CCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHH-HH---hhcccCCCCCCCCCHHHHHHHH--------------hcc Confidence 33324344 56667888888888888889999632 21 11111111 11111122111 123 Q ss_pred eecChHHHHHHHHHHhhcCCCceEeeCCCchHHHH-HHHHH-HhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCC---- Q lcl|NC_016654. 77 YHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQA-RADLI-FNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIA---- 150 (533) Q Consensus 77 ~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~-~l~~i-~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~---- 150 (533) .-.|+++.+++.++.++|.++|+++.... ....+ +++++ ..-++++..++.++..++.+|.+++.|=+...+. T Consensus 63 ~~~n~~~~tl~~l~G~vf~k~p~~~~~~p-~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~ 141 (513) T protein:vir:97 63 VLLNMVEQTLDTLSGKPFSEPIKLNEDVP-KAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDG 141 (513) T ss_pred cCCChHHHHHHHHhhhhhhcCcccCcCch-HHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccch Confidence 46799999999999999999988854222 22222 22332 2445799999999999999999998875543321 Q ss_pred ------------CceEEEEEcCCeEEEE-E--ecC--CceEEEEEEEEeecCC--ceE---EEEEEEecCeeEEEEEEec Q lcl|NC_016654. 151 ------------DNAWIDFVDADRAIPE-F--RWG--RLVAVTFWSELAGGDG--QEV---WRHLERHESGYIVHAVYKG 208 (533) Q Consensus 151 ------------~~~~i~~v~~~~~~P~-~--~~g--~~~~v~f~~~~~~~~~--~~~---y~~lE~h~~~~I~~~~y~~ 208 (533) -+|-|..+.|++++=. + .+| .++.+++-+.+...|+ ... |++| ++|. +++|+- T Consensus 142 ~~~T~Ade~~~~~rPy~~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~Dgf~~~~~~q~rvL---~~g~--~~v~r~ 216 (513) T protein:vir:97 142 QPRTLADDRREGLRPYWVMIKPECLLFARSEVINGVEVLQHVRIIEHYMEQDGFAEVCKRRIRVL---EPGL--VQLWEP 216 (513) T ss_pred hHHhHHHHHhhccCceEEEecHhhhcCcceeccCcceeeeeEEEEEEEeecCCCcceEEEEEEEE---eCce--EEEEEe Confidence 1477888888876542 1 234 4677776655544442 222 3333 3442 345543 Q ss_pred cCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHH Q lcl|NC_016654. 209 TATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELD 288 (533) Q Consensus 209 ~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD 288 (533) ....-+ . ..+|. ..+.++ .+....||+.+.. . +..+ ..|.+-|.+ |-.+=.+.= T Consensus 217 ~~~~~~-~-----~~e~~-----~~~~g~----~~l~~IP~v~~~~-~--~~~~-------~~~~pPLl~-LA~ln~~hy 270 (513) T protein:vir:97 217 VKKSNA-Q-----KEEWA-----LADEWA----TGLNYVPLVTFYA-D--RQGF-------MMGKPPLLD-LAHLNVAHW 270 (513) T ss_pred ecCCCc-c-----ccceE-----EecCCC----CcCCceeEEEEec-C--CCCC-------CCCccchHH-HHHHHHHHH Confidence 211100 0 00110 111110 0112233333322 1 1111 113333321 111111221 Q ss_pred HHHHHHHHHHHh-CcceeeechHHhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhh-hHHHHHHHHH Q lcl|NC_016654. 289 RIYSSLMRDFRI-GAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIR-VLEHDQGAAL 366 (533) Q Consensus 289 ~~~s~~~~~~~~-~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir-~e~~~~~l~~ 366 (533) ...|.+-..+.. +.+..++. ...... ...+...-+......+.+++ ..+++++.. .+.+...|+. T Consensus 271 ~~~Sd~~~il~~~~~P~l~~~-----G~~~~~------~~~i~iG~~~~~~lpe~~~~--~~yie~~g~~i~~~~~~l~~ 337 (513) T protein:vir:97 271 QSASDQRHILTVSRFPILACS-----GASGED------SDPVVVGPNKVLYNPDPAGR--FYYVEHTGQAIAAGRTDLKD 337 (513) T ss_pred hhhhhHHHHHHhcccceeeee-----cCCcCC------CCceEeeccccccCCCCCCc--ceeeccCchhHHHHHHHHHH Confidence 223333344433 33333331 111110 00111000000001111112 344555432 2344555555 Q ss_pred HHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEE-- Q lcl|NC_016654. 367 LLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELE-- 444 (533) Q Consensus 367 ~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~-- 444 (533) +-+++. ..|. ..+. ..++.+||++.+...+...+....+...++.+|.++++.+.... |.. .....++ T Consensus 338 le~qm~-~~Ga--~ll~-~~~~~~Ta~a~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~wl-----g~~-~~~~~v~in 407 (513) T protein:vir:97 338 LEEQMA-GYGA--EFLK-RKTGGQTATARALDSAEATSDLSAMTGLFEDALAQALDITADWL-----RLG-PNGGTVELV 407 (513) T ss_pred HHHHHH-HHHH--Hhhc-cCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCC-CCccEEEec Confidence 555552 2222 2222 34567999999999999999999999999999999888765431 222 1122233 Q ss_pred EEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhC------CCCCH-HHHHHHHHHHHHhhhcccCccccccccCCCC Q lcl|NC_016654. 445 LEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLH------EDWDD-ERVQEEADLIDNANTVSAPTFGFGTDQPPLP 517 (533) Q Consensus 445 i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~------~~~~d-ee~~~El~rI~~E~~~~~~~~~~~~~~~~~~ 517 (533) -+|..... ..+.++.+.+++.+|.+|.+|+++.+- |+.++ ++.+++.+||++..+....+.....+.|+.. T Consensus 408 ~dF~~~~~--~~~~~~al~~a~~~G~is~~t~~~~L~r~gvl~~d~d~~~~~e~~~~~~~~~~~~~~~d~~~~~~~~~~~ 485 (513) T protein:vir:97 408 KDYDLEEM--DAPGLQALQVAREKRDISRKTYLNGLRLRGVLPEDFDEDEDWEELMEEISEAMGRAGLDLDPAQKNPPEG 485 (513) T ss_pred cccCcccC--CHHHHHHHHHHHhCCCCCHHHHHHHHHhccCCCccCCHHHHHHHHHHhhhhccCCCCccccccCCCCCCC Confidence 33432222 345677788899999999999876652 33343 3345566666555433222222223333333 Q ss_pred CCCCCCCCCCCCCCCC Q lcl|NC_016654. 518 TENDPATDPEAVDEGE 533 (533) Q Consensus 518 ~~~~~~~~~~~~~d~~ 533 (533) .+.+..+.++.-+++| T Consensus 486 ~~~~~~~~~~~~~~~~ 501 (513) T protein:vir:97 486 GEGEGEGEGEGGEGGE 501 (513) T ss_pred CCCCCCCCCCCCCCCC Confidence 3333333334444444 No 76 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=99.76 E-value=2.8e-16 Score=105.88 Aligned_cols=436 Identities=11% Similarity=0.041 Sum_probs=215.1 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchh---hHHHHHHHHHHHHHhcccCCCCCccccee Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSP---SGIKARTKAAYEAFHGRTPTATGRAPKRY 77 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 77 (533) || -..|=|+|.....+.+.-|.-|.|... + ..+...|-+ ..-+..|..+.+ +-. T Consensus 1 m~-----V~~~hp~y~a~~~~W~~~rd~~~G~~~-~---r~~g~~YLpk~~~E~~~~Y~~rl~--------------rA~ 57 (452) T protein:vir:94 1 MP-----IETKHPEYLAYENDWIDCRVASLGQRE-V---KKKGVRFLPKLSGQTDDMYNAYKQ--------------RAL 57 (452) T ss_pred CC-----CCCcCHHHHHHHHHHHHHHHHhcChHH-H---HcCCcccCCCCCCCCHHHHHHHHh--------------hcc Confidence 54 356678999999999999999999632 2 122111211 001111211111 123 Q ss_pred ecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEE Q lcl|NC_016654. 78 HAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDF 157 (533) Q Consensus 78 ~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~ 157 (533) -.|+++.+++.++.++|.++|++++.+ .+.....+ ..-++++..+...+..++.+|.+++.|=|+..+ .+|.|.. T Consensus 58 ~~n~~~~t~~~~~G~vf~k~p~~~~p~---~l~~~~~D-~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g-~rPy~~~ 132 (452) T protein:vir:94 58 FYSITSKTLSALSGMVLDQPPVITHPD---AMSKYFED-QSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTG-GDPYISV 132 (452) T ss_pred CCchHHHHHHHHhchhhcCCceecccH---HHHHHHhc-ccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCC-CceEEEE Confidence 469999999999999999999987632 23232223 566789999999999999999999999887654 5899999 Q ss_pred EcCCeEEEEEe---cCCceEEEEEEEEeecC--------CceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccc Q lcl|NC_016654. 158 VDADRAIPEFR---WGRLVAVTFWSELAGGD--------GQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATR 226 (533) Q Consensus 158 v~~~~~~P~~~---~g~~~~v~f~~~~~~~~--------~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~ 226 (533) ++|.+++= |+ .|+++-++|-+.....+ ....|+.|+.- +|..+-.+|+..+++. .+ +. + T Consensus 133 ~~~~~Ii~-W~~~~~g~l~~v~lre~~~~~d~~d~f~~~~~~~yRvL~l~-~g~~~v~~~~~~~~~~-~~--~~---~-- 202 (452) T protein:vir:94 133 YTTENILN-WEEDEDGRLLMVVLREFYTVRDTADRYVQNIRVRYRCLELV-DGLLQITVHETQDGKV-WE--LA---K-- 202 (452) T ss_pred echhhhcC-ccccccCCeeEEEEEEEEEEecCCCcccceeEEEEEEEEEe-CCeEEEEEEEccCCce-ee--ec---c-- Confidence 99998873 43 46777666543322211 12246665532 3433334454332221 00 00 0 Q ss_pred cccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceee Q lcl|NC_016654. 227 DIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVH 306 (533) Q Consensus 227 ~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~ 306 (533) ....+.+ .+....+++.++... +..+ ..|.+.|.+ +-.+--+.-..-|++.+.+......+. T Consensus 203 ---~~~~~~~-----~~~l~~IP~v~~~~~--~~~~-------~~~~pPLl~-LA~ln~~hy~~~sd~~~~l~~~~~P~l 264 (452) T protein:vir:94 203 ---TSTIQNV-----GVTMDYIPFFCITPS--GLSM-------TPAKPPMID-IVDINYSHYRTSADLEHGRHFTGLPTP 264 (452) T ss_pred ---ceeecCC-----CcccceeEEEEEcCC--CCCC-------CCCccchHH-HHHHHHHHhcchhHHHHHHHHccccee Confidence 0000000 011222222233222 2121 123333332 222322333344445555544433333 Q ss_pred echHHhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhh-hHHHHHHHHHHHHHHHHhhCCChhhcccC Q lcl|NC_016654. 307 ASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIR-VLEHDQGAALLLREVLRKTGYSPVSLGLS 385 (533) Q Consensus 307 v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir-~e~~~~~l~~~l~~i~~~~g~s~~~~g~~ 385 (533) + +.......+...-.. ..+. ..+.++ + ..+++++.. .+.+.+.|+.+-+++.. ++...+... T Consensus 265 ~----~~g~~~~~~i~iG~~----~~~~--lpe~~~--~--~~yie~~g~~i~~~~~~l~~le~~m~~---~Ga~ll~~~ 327 (452) T protein:vir:94 265 W----ITGAESQSTMHIGST----KAWV--IPEVAA--K--VGFLEFTGQGLQSLEKALSEKQAQLAS---LSARLIDNS 327 (452) T ss_pred E----eecCcCCCceEeccc----cccc--CCCCCC--c--ceEEccCchhHHHHHHHHHHHHHHHHH---HHHHhhccC Confidence 3 221111111111000 0000 011121 1 334444432 34555556555554432 112223323 Q ss_pred CCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHH Q lcl|NC_016654. 386 DEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAW 465 (533) Q Consensus 386 ~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l 465 (533) +.+..|+++.....+...+....+...++.+|.++++.+... .|.. ....+++.-+-....-..+.++.+.++ T Consensus 328 ~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al~~~l~~~a~w-----~g~~--~~~~v~~n~dF~~~~~~~~~~~al~~~ 400 (452) T protein:vir:94 328 TRGSEATETVKLRYMSETASLKSVTRAVEALLNKAYSCIMDM-----ESMG--GTLNIKLNSAFLDSKLTAAELKAWVEA 400 (452) T ss_pred CCcchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH-----cCCC--CceEEEeccccccccCCHHHHHHHHHH Confidence 334456666555554444555566666777777777755442 1222 233344332222223234677778889 Q ss_pred HhCCCCCHHHHHHHhCC-CCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCC Q lcl|NC_016654. 466 SVASAASTKTKVAYLHE-DWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEA 528 (533) Q Consensus 466 ~~aGi~S~et~v~~l~~-~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 528 (533) +.+|.+|.+|+++.+-- ++-+ +++|.++|..|..+..|. .+ +.|+++... . T Consensus 401 ~~~G~is~~t~~~~L~~~gvl~--~~~e~~~i~~E~~~~~~~--~~-~~~~~~~~~-------~ 452 (452) T protein:vir:94 401 YLSGGISKEIYIHALKVGKVLP--PPGESMGVIPDPPAPEPS--PS-NTPPNPSSK-------A 452 (452) T ss_pred HhcCCCcHHHHHHHHHhCCCCC--CccCHHHHHHHhhccCcc--cC-CCCCCCccC-------C Confidence 99999999998776511 2322 234455666665432221 11 122221110 0 No 77 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=99.74 E-value=9.1e-16 Score=103.05 Aligned_cols=457 Identities=10% Similarity=0.003 Sum_probs=219.1 Q ss_pred CCCC--CC---cCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhh----H----HHHHHHHHHHHHhcccC Q lcl|NC_016654. 1 MSLP--EA---NTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPS----G----IKARTKAAYEAFHGRTP 67 (533) Q Consensus 1 ~~~~--~~---~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~----~----~~~~~~~~~~~~~~~~~ 67 (533) =|-| -+ +-..+=|+|.....+.+..+..|.|... +. .+...|-|. . -+..|..+.+ T Consensus 23 ~~~~~~~~~m~dV~~~hp~y~a~~~~W~~ird~~~G~~~-~r---~~g~~YLP~~~~~~~~~E~~~~Y~~rl~------- 91 (535) T protein:vir:80 23 PPTSGLGPSLPNVGYQRVEFGEMLPKWRKIMDCLSGQEA-IK---AKREEYLPMPSVDSRDEEQRRRYETYLQ------- 91 (535) T ss_pred cCCCCCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHH-HH---hcccccCCCCCcccCCcCCHHHHHHHHh------- Confidence 1111 11 2335667788888888888889999643 21 111111111 0 0111222211 Q ss_pred CCCCcccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHH-HhhccHHHHHHHHHHHHhhhCCEEEEEEEc Q lcl|NC_016654. 68 TATGRAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLI-FNTPRFHSSLVEAGESCSALSGSFQRIVWD 146 (533) Q Consensus 68 ~~~g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i-~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D 146 (533) +-.-.|+++.+++.++.++|.++|++++. +.++.+++++ ..-++++..+..++..++.+|.+++.|=+. T Consensus 92 -------rA~~~n~~~~tl~~l~G~vfrk~p~~~~p---~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P 161 (535) T protein:vir:80 92 -------RAIFYNVTARTLDGMMGQVFSRDPIRQLP---PALEAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTDYP 161 (535) T ss_pred -------hccCCChhHHHHHHHhchhhcCCcceecc---HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeec Confidence 13467999999999999999999988663 2234444333 334479999999999999999999988665 Q ss_pred CCCC-----------CceEEEEEcCCeEEEEEe----cC--CceEEEEEEEEeecCC------ceEEEEEEEecCeeEEE Q lcl|NC_016654. 147 PTIA-----------DNAWIDFVDADRAIPEFR----WG--RLVAVTFWSELAGGDG------QEVWRHLERHESGYIVH 203 (533) Q Consensus 147 ~~~~-----------~~~~i~~v~~~~~~P~~~----~g--~~~~v~f~~~~~~~~~------~~~y~~lE~h~~~~I~~ 203 (533) ..++ -+|-|..+.|++++= |+ +| +++.+++-+.+...+. ...|+.|+....|..+- T Consensus 162 ~~~~~~t~ade~~~~~rPy~~~y~ae~Iin-W~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~RvL~~~~~G~y~v 240 (535) T protein:vir:80 162 NVGRPVTVLEQKLGLYRPTITLVHPTSIIN-WRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQWRVLQLNAEGNYQV 240 (535) T ss_pred CCCCcccHHHHHhcCCCcEEEEechhhccC-ccccccCCccceeEEEEEEEEEecCCCcccceeEEEEEEEecCCceEEE Confidence 5432 257888888887664 33 33 5777777666554332 23577787764443333 Q ss_pred EEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHH Q lcl|NC_016654. 204 AVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPT 283 (533) Q Consensus 204 ~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~l 283 (533) .+|+....+-+. ....+ +. ..+.+ ..+....||++ +. ..+..+ ..|.+-|. +| T Consensus 241 ~~~~~~~~~~~~-~~~~~---~~-----~~~~g----~~~l~~IPfv~-~~--~~~~~~-------~~~~pPLl----~L 293 (535) T protein:vir:80 241 ERWRRETQEEMY-YSYSK---HV-----PTDGN----GNPFKEIPFQF-IG--PLDNNA-------DIDHPPLL----DL 293 (535) T ss_pred EEEEeecCCccc-cccce---ee-----cccCC----CcccCeeEEEE-ee--cCCCCC-------CCCccchH----HH Confidence 456533221000 00000 00 00000 00111122332 21 111111 12222222 22 Q ss_pred HHHHHHHH----HHHHHHHHh-Ccceeeec--hHHh-cCCCCccccccCcchhhhhhccccccccccccccceeeechhh Q lcl|NC_016654. 284 FHELDRIY----SSLMRDFRI-GAGKVHAS--ESVL-TNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAI 355 (533) Q Consensus 284 id~lD~~~----s~~~~~~~~-~~~~i~v~--~~~l-~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i 355 (533) . .++..+ |.+.+.+.. +.+..++. +... +....+.+...-.. .++ ....+++ ..+-++++.- T Consensus 294 A-~lni~Hy~~ssd~~~il~~~~~P~l~i~G~~~~~~~~~~~~~~i~iG~~----~~~---~lP~~~~--~~~~e~~~~~ 363 (535) T protein:vir:80 294 C-EVNIGHYRNSADYEEMAFVAGQPTAFFTGLTKDWVEDVFKDFKVHLGSR----AII---PLPQGAT--AGILQITPNS 363 (535) T ss_pred H-HHHHHHhhchhHHHHHHHHhcCceeeeecCchhhhhcCCCCcceEecCc----ccc---cCCCCCC--cceeeeccch Confidence 2 222222 223333433 34433332 1110 00000111111000 111 1111222 2233444432 Q ss_pred hhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCC Q lcl|NC_016654. 356 RVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGK 435 (533) Q Consensus 356 r~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~ 435 (533) -.-+.++.++..+..+ +...+. ...+..||++.+...+...+....+...++.+|.++++.+... .|. T Consensus 364 ~a~~~l~~~e~qM~~l------Ga~ll~-~~~~~~Ta~~a~~~~~~~~S~L~~~a~~le~al~~aL~~~A~w-----~G~ 431 (535) T protein:vir:80 364 VPFEAMTHKESQMIAM------GANLLV-KSGGNRTFGEAQQEEASEQSILSACTKNVSMAFRKALRWANQF-----QTG 431 (535) T ss_pred hHHHHHHHHHHHHHHH------HHHhhc-cCcccccHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHH-----cCC Confidence 1122333333333322 222232 2345678888777777777777888888888888888765432 222 Q ss_pred CCC-CceeEE--EEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCC-CC-C-HHHHHHHHHHHHHhhhcccCcccc Q lcl|NC_016654. 436 GAA-PSEELE--LEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHE-DW-D-DERVQEEADLIDNANTVSAPTFGF 509 (533) Q Consensus 436 ~~~-~~~~v~--i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~-~~-~-dee~~~El~rI~~E~~~~~~~~~~ 509 (533) ... ....++ -+|... .-..+.++.+.+++.+|.||.+|++..+-- ++ + +.+.++|..||+.|........+. T Consensus 432 ~~~~~~~~i~~n~dF~~~--~ld~~~~~all~~~~~G~Is~et~~~~L~r~gvl~~~~~~eee~~ri~~E~~~~~~~~g~ 509 (535) T protein:vir:80 432 IVNDETVEYNLNTDFPAA--RLTPNERAELILEWQQGAITFKEMRAGLRRAGVASEDDAKAETEGKATVEFIAKTAAAGK 509 (535) T ss_pred ccCCCceEEEeccccccc--cCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCcccchHHHHHHHHhhhhhccccCCC Confidence 221 122232 333322 113456777888999999999998766521 22 1 123456778888884332222221 Q ss_pred -----ccccCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 510 -----GTDQPPLPTENDPATDPEAVD 530 (533) Q Consensus 510 -----~~~~~~~~~~~~~~~~~~~~~ 530 (533) ++.++..+-.|..+|.++..+ T Consensus 510 ~~d~~~~g~~~~~~~~~~~~~~~~~~ 535 (535) T protein:vir:80 510 VGDAASGGTNKAKLNNGNGGGNQAGN 535 (535) T ss_pred CCCCCCCCCCcCcccCCccccccCCC Confidence 122222222233333333333 No 78 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=99.68 E-value=1.9e-14 Score=95.82 Aligned_cols=449 Identities=12% Similarity=-0.018 Sum_probs=224.7 Q ss_pred CCCCC----cCCCcCcchHHHHHHHHhhhHhhcCCHHH-HHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccce Q lcl|NC_016654. 2 SLPEA----NTAWPPPELAAVTARVAESHVWWEGDLDK-LATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKR 76 (533) Q Consensus 2 ~~~~~----~~~~pp~~~~~~~~~~~~~~~w~~gd~~~-l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 76 (533) -|.+. +-..|=|+|.....+.+.-|.-|.|+... ..+-|...+...++. ..|..+.+ +- T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e--~~Y~~rl~--------------rA 64 (491) T protein:vir:95 1 MLTANGQGSGVKTKHREWLHYAPKWQKVRHALAGDLVGYLRNVGLNEPDKAYGE--ARQAEYEA--------------GG 64 (491) T ss_pred CcccCCccCCCCccCHHHHHHHHHHHHHHHHhcCcchhhcccCCCcCCCCCCCH--HHHHHHHh--------------cc Confidence 12222 23789999999999999999999996321 112222211111111 11222211 12 Q ss_pred eecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHH-HhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCC------ Q lcl|NC_016654. 77 YHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLI-FNTPRFHSSLVEAGESCSALSGSFQRIVWDPTI------ 149 (533) Q Consensus 77 ~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i-~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~------ 149 (533) .-.|+++.+++.++.++|.++|++++.+ .++.+++++ ..-++++..++..+..++.+|.+++.|=+...+ T Consensus 65 ~~~n~~~~tl~~l~G~vfrk~p~~~~p~---~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~Ad 141 (491) T protein:vir:95 65 IVYNFTRRTLSGMVGSVMRKEPEINIPK---ELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAAATAAE 141 (491) T ss_pred cCCChHHHHHHHHhchhhcCCceeeccH---HHHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcccCHHH Confidence 4569999999999999999999997632 244444433 345678999999999999999999987665442 Q ss_pred ----CCceEEEEEcCCeEEEE-E--ecC--CceEEEEEEEEeecC--------CceEEEEEEEecCeeEEEEEEeccCCc Q lcl|NC_016654. 150 ----ADNAWIDFVDADRAIPE-F--RWG--RLVAVTFWSELAGGD--------GQEVWRHLERHESGYIVHAVYKGTATS 212 (533) Q Consensus 150 ----~~~~~i~~v~~~~~~P~-~--~~g--~~~~v~f~~~~~~~~--------~~~~y~~lE~h~~~~I~~~~y~~~~~~ 212 (533) +-+|-|..+.|++++=. + .+| +++.++|-+.+...+ ....|++|+.-..|..+.++|+...++ T Consensus 142 e~~~~~rPy~~~~~~~~IinW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~~~~~qyRvL~l~~~g~~~~~v~r~~~~g 221 (491) T protein:vir:95 142 QNAGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFETKYGEQYRVLDIDTDGNYRQRLFRFDAEG 221 (491) T ss_pred HHHhcCCcEEEEechhhhcCceeeeeCCceeeeEEEEEEeEEeecCCCCcccceEEEEEEEeecCCCceEEEEEEEcCCC Confidence 22688889999887642 1 233 677787766543211 134578888766666666778654322 Q ss_pred ccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHH Q lcl|NC_016654. 213 LGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYS 292 (533) Q Consensus 213 lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s 292 (533) -.... ....+.+.++ .+....||+.+. ..+..+. .|.+-|. +|. .++..+= T Consensus 222 ~~~~~----------~~~~~~~~g~----~~l~~IPfv~~~---~~~~~~~-------~~~pPLl----~LA-~lni~Hy 272 (491) T protein:vir:95 222 GAQEE----------VVEIYPDLGE----SLRGVIPFTFIG---ATNNDAT-------IDDAPLL----PLA-ELNIGHY 272 (491) T ss_pred cceee----------eeeeeecCCC----cccCeeEEEEEe---cCCCCCC-------CCcCchH----HHH-HHHHHHh Confidence 10000 0000111110 011112333222 2222221 1222221 221 2222221 Q ss_pred H----HHHHHHh-Ccceeeech-HHhcCC----CCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHH Q lcl|NC_016654. 293 S----LMRDFRI-GAGKVHASE-SVLTNL----GMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQ 362 (533) Q Consensus 293 ~----~~~~~~~-~~~~i~v~~-~~l~~~----~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~ 362 (533) + +-+.+.. +.+..++.- +-+... +...+..+...... ..+ .+++ -.+-++++.--..+.++ T Consensus 273 ~~ssd~~~~l~~~~~P~l~~~G~d~~~~~~~~~~~~~~i~~g~~~~~-----~lP--~~~~--~~~ie~~~~~~~~~~l~ 343 (491) T protein:vir:95 273 RNSADNEESSFVVGQPTLFIYPGDNLTPQSFKEANPNGIKFGSRCGH-----NLG--YGGS--AQLIQAGENNLARQNML 343 (491) T ss_pred hhhhHHHHHHHHcccceeeeecCcccCcchhhccCcceeEecCcCCc-----CCC--CCCc--cceeecCcchHHHHHHH Confidence 1 2222322 333333311 000000 00001111111110 001 1111 12222332211222222 Q ss_pred HHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCcee Q lcl|NC_016654. 363 GAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEE 442 (533) Q Consensus 363 ~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~ 442 (533) .+...+.++ +...+. .++..||++.+...+...+....+...++.++.++++.+... .|........ T Consensus 344 ~~e~qm~~~------Ga~l~~--~~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w-----~G~~~~~~v~ 410 (491) T protein:vir:95 344 DKEQQAIQI------GAQLIT--PSQQITAESARIQRGADTSVMATIARNVSQAYTDALRWVAMM-----LGKPEDSEVE 410 (491) T ss_pred HHHHHHHHH------HHHhcc--CCcchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH-----cCCCCCCceE Confidence 222222222 222232 234689999999999888888999999999999988766543 1322222232 Q ss_pred --EEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhC-CCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCC Q lcl|NC_016654. 443 --LELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLH-EDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTE 519 (533) Q Consensus 443 --v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~-~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~ 519 (533) ++.+|..... ..+.++.+.++..+|.+|.++++..|- .++-+...++|.++|++|.....-....+++.|..- T Consensus 411 i~~n~dF~~~~~--~~~~~~all~~~~~G~is~~t~~~~L~~~~vl~~~~e~~~~~ie~~~~~~~~~~~~~~~~~~~~-- 486 (491) T protein:vir:95 411 FQLNMDFFLQPM--TAQDRAAWMADINAGLLPATAYYAALRKAGVTDWTDEDILNAIEDAPLPSGAVTQVAGEIPQAA-- 486 (491) T ss_pred EEeecccccccC--CHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCccHHHHHHHHHhcCCCCCccccccccchhhh-- Confidence 3444543332 244677788899999999999877552 133344466778888766521111111122222111 Q ss_pred CCCCCCCCC Q lcl|NC_016654. 520 NDPATDPEA 528 (533) Q Consensus 520 ~~~~~~~~~ 528 (533) .+.++ T Consensus 487 ----~~~~~ 491 (491) T protein:vir:95 487 ----QQQQE 491 (491) T ss_pred ----hhccC Confidence 11111 No 79 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=99.64 E-value=1.2e-13 Score=91.49 Aligned_cols=445 Identities=13% Similarity=0.010 Sum_probs=218.4 Q ss_pred CCCCC----cCCCcCcchHHHHHHHHhhhHhhcCCHH-HHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccce Q lcl|NC_016654. 2 SLPEA----NTAWPPPELAAVTARVAESHVWWEGDLD-KLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKR 76 (533) Q Consensus 2 ~~~~~----~~~~pp~~~~~~~~~~~~~~~w~~gd~~-~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 76 (533) -|.+. +-..|=|+|.....+.+.-|.-|.|+.. ....-|...+...++ ...|..+.. +- T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~--e~~Y~~rl~--------------rA 64 (489) T protein:vir:78 1 MLTENGQGSGVKTKHREWLHYAPKWQKVRHALAGELVSYLRNVGLNEPDKAYG--EARQAEYEA--------------GG 64 (489) T ss_pred CccCCCccCCCCccCHHHHHHHHHHHHHHHHhcCcccccccCCCCCCCCCCCC--hHHHHHHHh--------------cc Confidence 12222 2378999999999999999999999532 111112111111111 111222211 12 Q ss_pred eecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHH-HhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCC------ Q lcl|NC_016654. 77 YHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLI-FNTPRFHSSLVEAGESCSALSGSFQRIVWDPTI------ 149 (533) Q Consensus 77 ~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i-~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~------ 149 (533) .-.|+++.+++.++.++|.++|++++.+ .+..+++++ ..-++++..++..+..++.+|.+++.|=+...+ T Consensus 65 ~~~n~~~~tl~~l~G~vfrk~p~~~~p~---~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ad 141 (489) T protein:vir:78 65 IVYNFTRRTLSGMVGSVMRKEPEINIPK---ELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAE 141 (489) T ss_pred ccCChHHHHHHHHhchhhcCCcceeccH---HHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHH Confidence 3569999999999999999999987632 244444433 345678999999999999999999987766442 Q ss_pred ----CCceEEEEEcCCeEEEE-E--ecC--CceEEEEEEEEeecC--------CceEEEEEEEecCeeEEEEEEeccCCc Q lcl|NC_016654. 150 ----ADNAWIDFVDADRAIPE-F--RWG--RLVAVTFWSELAGGD--------GQEVWRHLERHESGYIVHAVYKGTATS 212 (533) Q Consensus 150 ----~~~~~i~~v~~~~~~P~-~--~~g--~~~~v~f~~~~~~~~--------~~~~y~~lE~h~~~~I~~~~y~~~~~~ 212 (533) +-+|-|..+.|++++=. + .+| +++.++|-+.+...+ ....|++|+.-..|....++|+...+. T Consensus 142 e~~~~~rPy~~~~~~~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~g 221 (489) T protein:vir:78 142 QNAGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQRLFRFDAEG 221 (489) T ss_pred HHHhcCCcEEEEechhhhcCceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEEEEEEeecCC Confidence 12688989999887642 1 244 577788766543221 123467776655565566677644321 Q ss_pred --ccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHH Q lcl|NC_016654. 213 --LGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRI 290 (533) Q Consensus 213 --lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~ 290 (533) .+..++ .+.+.++ .+....||+.+. ..+..+. .|.+-|. +|. .++.. T Consensus 222 ~~~~~~~~------------~~~~~g~----~~l~~IPfv~~~---~~~~~~~-------~~~pPLl----~LA-~lni~ 270 (489) T protein:vir:78 222 GAQEDVVE------------IYPDLGE----SLRGVIPFTFIG---ATNNDAT-------IDDAPLL----PLA-ELNIG 270 (489) T ss_pred cccceeeE------------EeccCCC----CccCeeeEEEEe---cCCCCCC-------CCcCchH----HHH-HHHHH Confidence 110000 0111110 011112333222 2222221 1222221 221 22222 Q ss_pred H----HHHHHHHH-hCcceeeech-HHhcCCC--C--ccccccCcchhhhhhccccccccccccccceeeechhhhhHHH Q lcl|NC_016654. 291 Y----SSLMRDFR-IGAGKVHASE-SVLTNLG--M--GQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEH 360 (533) Q Consensus 291 ~----s~~~~~~~-~~~~~i~v~~-~~l~~~~--~--~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~ 360 (533) + |.+-+.+. .+.+..++.- .-..... . ..+..+...... ..+ .++. ..++++...... T Consensus 271 Hy~~ssd~~~~l~~~~~P~l~i~G~d~~~~~~~~~~~~~~i~~g~~~~~-----~lp--~~~~----~~~ie~~~~~~~- 338 (489) T protein:vir:78 271 HYRNSADNEESSFVVGQPTLFIYPGENLTPQAFKEANPNGIKFGSRRGH-----NLG--YGGS----AQLIQAGENNLA- 338 (489) T ss_pred HhhhhhHHHHHHHHcccceeeeecCccCCcccccccCccceeeCCcccc-----cCC--CCCC----cceeccCcchHH- Confidence 2 12222233 2333333311 0000000 0 001111111110 001 1111 122333222111 Q ss_pred HHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCc Q lcl|NC_016654. 361 DQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPS 440 (533) Q Consensus 361 ~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~ 440 (533) .+.|+.+-+++.. ++...+. .++..||++.+...+...+....+...++.++.++++.+... .|...... T Consensus 339 r~~l~~le~qm~~---lGa~l~~--~~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w-----~G~~~~~~ 408 (489) T protein:vir:78 339 RQNMLDKEQQAIQ---IGAQLIT--PTQQITAQSARIQRGADTSVMATIARNVSQAYTDALRWVAVM-----LGKPEDTE 408 (489) T ss_pred HHHHHHHHHHHHH---Hhhhhcc--CCcchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH-----cCCCCCCc Confidence 2333333333321 2223332 234689999999998888888899999999999988866543 13222222 Q ss_pred ee--EEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCC-CCCHHHHHHHHHHHHHhhhcccCccccccccCCCC Q lcl|NC_016654. 441 EE--LELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHE-DWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLP 517 (533) Q Consensus 441 ~~--v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~-~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~ 517 (533) .. ++.+|..... ..+.++.+.+++.+|.||.+|+++.+-- ++-+.+.+++.++|..+. .| ..+...++.+ T Consensus 409 ~~i~~n~dF~~~~~--d~~~~~al~~~~~~G~is~~t~~~~L~~~gv~d~~~e~~~~ei~~~~---~~--~~~~~~g~~~ 481 (489) T protein:vir:78 409 VEFRLNMDFFLEPM--TAQDRAAWMADINAGLLPATAYYAALRKAGVTDWTDADIKDAVADQP---LP--VATEVQGEIP 481 (489) T ss_pred eEEEeecccCcccC--CHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCccHHHHHHHHhhcC---CC--cccCCcccCC Confidence 23 3445553322 2446777888899999999998775421 232222334445555432 11 1111222222 Q ss_pred CCCCCCCCCCC Q lcl|NC_016654. 518 TENDPATDPEA 528 (533) Q Consensus 518 ~~~~~~~~~~~ 528 (533) +++..+ +. T Consensus 482 ~~~q~~---~~ 489 (489) T protein:vir:78 482 QSAQQQ---EK 489 (489) T ss_pred CCcccc---cC Confidence 221111 11 No 80 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.60 E-value=1.2e-14 Score=96.81 Aligned_cols=495 Identities=8% Similarity=-0.044 Sum_probs=217.3 Q ss_pred CCC---CCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHH----hccCcchhhHHHHHHHHHHHHHhcccCCCCCcc Q lcl|NC_016654. 1 MSL---PEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFY----GAEGRTSPSGIKARTKAAYEAFHGRTPTATGRA 73 (533) Q Consensus 1 ~~~---~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 73 (533) |+- +++..+-=|-+-........+-..||..+.+.-..++ +...+|..+||.......... .. T Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~----------~g 92 (776) T protein:vir:93 23 SPGEDAAQREKPANPLDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNIQWSQDEIDELKE----------RG 92 (776) T ss_pred CCCCcccchhcccCCCCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHh----------cC Confidence 111 1111111111111211111121222222111111110 111111112222211111110 12 Q ss_pred cceeecChHHHHHHHHHHhhcCCCceEeeCCCc-------hHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEc Q lcl|NC_016654. 74 PKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKS-------KEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWD 146 (533) Q Consensus 74 ~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~-------~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D 146 (533) +..++.|+-+.+|+...++.....+.+.+.+.+ +.++..++.+.+.|++......+...+.+.|.+|++++|| T Consensus 93 ~p~~~~N~i~~~i~~v~g~~~~nr~~~~~~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~af~d~~~~G~G~~~v~~d 172 (776) T protein:vir:93 93 QAPTVYNVISQSVNWIIGSEKRGRSDFKVLPRRKDGGKAAERKTALLKYLSDVNHTPFERSMAFEETTKAGIGWLESQVQ 172 (776) T ss_pred CceEEecchHHHHHHHHHHHHhCCcceEEecCChhHHHHHHHHHHHHHHHHHhhcHHHHHHHHHHHhhhcCcceEEEEee Confidence 336899999999999999988877767665432 2345667778899999999999999999999999999998 Q ss_pred CCCC-CceEEEEEcCCeEEEEEe--cCCceEEEEE--EEEe-e------------------------------------- Q lcl|NC_016654. 147 PTIA-DNAWIDFVDADRAIPEFR--WGRLVAVTFW--SELA-G------------------------------------- 183 (533) Q Consensus 147 ~~~~-~~~~i~~v~~~~~~P~~~--~g~~~~v~f~--~~~~-~------------------------------------- 183 (533) .+.. +.+++.++++..|++-.+ ...+.+|-|+ ..+. . T Consensus 173 ~~~~~~~~~~~~~~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 252 (776) T protein:vir:93 173 DENDGEPIYAGAESWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDIDGDDAMD 252 (776) T ss_pred ccCCCCceEeeccChhheeeccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccchhccccccccc Confidence 6533 345556778887776432 1223333221 1100 0 Q ss_pred ----------------cCCceEEEEEEEecCeeEEEEEEeccC-Ccccceee-hhhccc---cccccc----------cc Q lcl|NC_016654. 184 ----------------GDGQEVWRHLERHESGYIVHAVYKGTA-TSLGWMMA-LTDHPA---TRDIAV----------EG 232 (533) Q Consensus 184 ----------------~~~~~~y~~lE~h~~~~I~~~~y~~~~-~~lG~~v~-l~~~~~---~~~~~~----------~~ 232 (533) ...+...++.|+|.+-.++..++.+.. +.-+..+. +..... -.+... .+ T Consensus 253 ~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~~~~v~~~ 332 (776) T protein:vir:93 253 SPEYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAVSPMMRMHCA 332 (776) T ss_pred ccccccccccccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceeehheeeeeeEEE Confidence 000012334555543333333332211 10010000 000000 000000 00 Q ss_pred cccCCceeecCC------CccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceee Q lcl|NC_016654. 233 ADEGRGAYVETG------VKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVH 306 (533) Q Consensus 233 ~~~~~~~~~~~g------~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~ 306 (533) .-.+. ..+..+ ...+|+.+.... ...+++|.|++.. +++..+.+|...|++.+.+ +..+++ T Consensus 333 ~~~g~-~~l~~~~~p~~~~~~Pfv~~~~~~---------~~~~~~~~G~v~~-~~d~Q~~~N~~~s~~~~~l--~~~~~~ 399 (776) T protein:vir:93 333 IMTTR-DLMWAGPSPYRHNRYPFTPIWGFR---------RARDGMPYGVIRF-MRGMQDDVNKRLSKALYIL--STNKVL 399 (776) T ss_pred EEecc-hhhhccCCCCCCCccceEEecCce---------ecccccccchHHh-hhHHHHHHHHHHHHHHHhh--cCCcee Confidence 00000 001111 112333332221 1125567788876 6799999999999999876 345677 Q ss_pred echHHhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCC Q lcl|NC_016654. 307 ASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSD 386 (533) Q Consensus 307 v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~ 386 (533) +.++.+.+...- .+ ...+.-.+-....+......++ ..+.+. ..+++.++.....+...+|++...+|..+ T Consensus 400 ~~~gav~~~d~~----~~---~~~rp~~vi~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~~~i~~~tGi~~~~~G~~~ 470 (776) T protein:vir:93 400 MEEGAVDDIDEF----RR---EAARPDAVMTVKNGKLGAVKMD-VDRDLA-PAHLELASRSIQMIQQVGGVTDEMLGRTT 470 (776) T ss_pred eccccccchHHH----HH---hcccCCceeeeCCccccccccc-cCcCcc-HHHHHHHHHHHHHHHHhhCcChHHhCCCc Confidence 776665432110 00 0000000000111111111121 123343 46888888888899999999999999654 Q ss_pred CcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCC-------Cce------------------ Q lcl|NC_016654. 387 EVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAA-------PSE------------------ 441 (533) Q Consensus 387 ~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~-------~~~------------------ 441 (533) +..||.+|..+..............+..+++++.+.++.+....+...... ... T Consensus 471 -n~~Sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~ 549 (776) T protein:vir:93 471 -NAVSGVAIQARQEQGSVATNKLFDNLRLAFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKA 549 (776) T ss_pred -chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhcccee Confidence 557899999988888888888888888899888888887755433211110 001 Q ss_pred eEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHH------HHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCC Q lcl|NC_016654. 442 ELELEWPKFARESDLAKAQTVQAWSVASAASTKTK------VAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPP 515 (533) Q Consensus 442 ~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~------v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~ 515 (533) +|.|.=..+.+.-..+..+.++++.. .+..+.. +.++.. +- .+.+-++++++.++..+|.......+.+ T Consensus 550 dv~v~~~~~~~s~r~~~~~~l~ql~~--~~~p~~~~~~~~~~~e~~d-~p--~~~e~~~~l~~~~~~~~p~q~~~~~e~~ 624 (776) T protein:vir:93 550 DFIIDEAEWRATMRQAAVAELMEVIG--KMPPEIALTMLDLLVENMD-IP--NRDELVKRIRAVNGQKDPDQDEPTPEEI 624 (776) T ss_pred eEEEeecccchhHHHHHHHHHHHHHh--hcChhhHHHHHHHHHHhcC-cc--chHHHHHHHHHhhcccccchhhcchhHH Confidence 12221111111113333333444432 2222211 111111 11 1112223333222221111100000000 Q ss_pred CC-------------------CCCCCCC------------C---CCCCCC--CC Q lcl|NC_016654. 516 LP-------------------TENDPAT------------D---PEAVDE--GE 533 (533) Q Consensus 516 ~~-------------------~~~~~~~------------~---~~~~~d--~~ 533 (533) .. ....... + .....+ +. T Consensus 625 ~~qq~q~~~~q~q~~~~~a~~~~~qa~a~~~~aea~~~~aqa~~~~~~a~~~~~ 678 (776) T protein:vir:93 625 AREQAQQQQQQYNDALAIATLEEQQAKARKAAAEAQVAEAKAKHISRMAIREGV 678 (776) T ss_pred HHHHHhhHHHHHHHHHhhhhhhHhhHHHHHHHHHHHHHhhhhhhhhhcchhhhh Confidence 00 0000000 0 000000 00 No 81 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.57 E-value=2.1e-13 Score=90.06 Aligned_cols=459 Identities=12% Similarity=-0.008 Sum_probs=211.4 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCC-cccceeec Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATG-RAPKRYHA 79 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~~~~~~~~ 79 (533) |.|=+.--.|--|.++... ...|..+++ |.+....+...+.................... .+.-.... T Consensus 1 mn~~dr~i~~~sP~~~~~R---~~ar~~~~~--------y~aa~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn 69 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKAAR---LRSRAVIQA--------YEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNH 69 (502) T ss_pred CchHhhHHhhcChHHHHHH---HhhHHHHhh--------ccccCcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcC Confidence 7776655555555544211 123332221 22221111000000000000000000000000 00112356 Q ss_pred ChHHHHHHHHHHhhcCC-CceEeeC--C----CchHHHHHHHHHHh----------hccHHHHHHHHHHHHhhhCCEEEE Q lcl|NC_016654. 80 PIPGVIAKLSTTELFSE-QLKFLDA--G----KSKEVQARADLIFN----------TPRFHSSLVEAGESCSALSGSFQR 142 (533) Q Consensus 80 n~~k~i~~~~a~ll~~e-~~~i~~~--~----~~~~~~~~l~~i~~----------~n~f~~~~~~~~~~~~~~G~~~~~ 142 (533) ++++.+++.+++.++|. ...+... . .++++++.|++.|+ ..+|......++...+..|.++++ T Consensus 70 ~~a~~av~~~~~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~ 149 (502) T protein:vir:79 70 DLVIGVFDKLEERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQ 149 (502) T ss_pred hHHHHHHHHHHHhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEE Confidence 79999999999999986 3333221 1 23445555555554 235777777777788899999999 Q ss_pred EEEcCCCC------CceEEEEEcCCeEEEE-EecCCceEEEEEEEEeecCCceEEEEEEEecCe-eEEEEEEeccCCccc Q lcl|NC_016654. 143 IVWDPTIA------DNAWIDFVDADRAIPE-FRWGRLVAVTFWSELAGGDGQEVWRHLERHESG-YIVHAVYKGTATSLG 214 (533) Q Consensus 143 ~~~D~~~~------~~~~i~~v~~~~~~P~-~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~-~I~~~~y~~~~~~lG 214 (533) +++++... -..+|..++|+++ |. ..+| ..+.--+|+-..| .+-|.+++..+.. + T Consensus 150 ~~~~~~~~~~~g~~~~l~lq~iepd~l-~~~~~~~----------------~~i~~GVe~d~~Gr~~aY~i~~~hPgd-~ 211 (502) T protein:vir:79 150 MVSGRINSLTPSAGVHFWLEALEPDFI-PMTSDES----------------NRLNQGVFVDDWGRPEKYLVYKSRPVS-G 211 (502) T ss_pred EeecccCccCCCcccceEEEEecchhc-CCCCCCC----------------CeeEeeeEECCCCceEEEEEeecCCCC-C Confidence 99876421 1357888998876 32 1121 1122223333322 2333344433222 1 Q ss_pred ceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHH Q lcl|NC_016654. 215 WMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSL 294 (533) Q Consensus 215 ~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~ 294 (533) ....... |+- ....-+|.| .. + ....|+|.|+..|. .+..+|...+.- T Consensus 212 ~~~~~~r-------------------vpA--~~vlH~f~~-~r-------~--gQ~RGis~lapvl~-~l~~l~~~~dae 259 (502) T protein:vir:79 212 RQMETKE-------------------VDA--ERMLHLKFV-RR-------L--HQMRGTSLLSGVLI-RLSALKEYEDSE 259 (502) T ss_pred cccceeE-------------------ech--hheEEeecc-cC-------C--ccccCCchHHHHHH-HHHHHhHHHHHH Confidence 0000000 100 001111221 11 1 13469999999774 556666544332 Q ss_pred HHHHH-hCcceeeechHHhcCCCCc-ccccc--CcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHH Q lcl|NC_016654. 295 MRDFR-IGAGKVHASESVLTNLGMG-QGVSL--DEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLRE 370 (533) Q Consensus 295 ~~~~~-~~~~~i~v~~~~l~~~~~~-~~~~~--d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~ 370 (533) ...-+ .+--..|| +...+. ..... +........+.-+..-..-.....++.++|.-+..+|..-++.+++. T Consensus 260 l~~a~i~A~~~~fi-----~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~ 334 (502) T protein:vir:79 260 LTAARIAAALGMYI-----RKGDGQSYEPDGNGSKENERELTIQPGIIYDDLKPGEEIGMVKSDRPNPNLETFRNGQLRA 334 (502) T ss_pred HHHHHHhhhheeee-----ecCCCcccccccCCCCCccccccccCCccccccCCCceeeeeCCCCCCCCHHHHHHHHHHH Confidence 22222 22222333 211111 00000 00000011111111000001112477778777777888888888999 Q ss_pred HHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCC-----ceeEEE Q lcl|NC_016654. 371 VLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAP-----SEELEL 445 (533) Q Consensus 371 i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~-----~~~v~i 445 (533) |...+|++++.++.+.++ |=.++++.....-..+...+..+...+-+-|+.. ||....+.|....+ ..-+.+ T Consensus 335 iaaglGi~ye~lt~D~s~--nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~-~l~~a~l~G~i~~p~~~~~~~~~~~ 411 (502) T protein:vir:79 335 VAAGSRLSFSSTARNYNG--TYSAQRQELVESTDGYLILQDWFIGAVTRPMYRA-WLKQAVASGVIRLPRDLDRSSLYTA 411 (502) T ss_pred HHhhcCCCHHHHhccccc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHcCCCCCCCCCCchhhcce Confidence 999999999999877543 4444455555555555555555554444433322 23333444433322 223466 Q ss_pred Ee--CCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHH---hhhcccC---ccccccccCCCC Q lcl|NC_016654. 446 EW--PKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDN---ANTVSAP---TFGFGTDQPPLP 517 (533) Q Consensus 446 ~f--~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~---E~~~~~~---~~~~~~~~~~~~ 517 (533) .| +.....|+..++++..+++.+|++|.++.+++. +.|-+++.+++++-.+ +.+...+ ....+....+.. T Consensus 412 ~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~~~a~~--G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~ 489 (502) T protein:vir:79 412 VYSGPVMPWIDPVKEAEAWKIQIRGGAATESDWVRAG--GRNPDDVKRRRKAEIDENRKLDLVFDTDPASDKGGSSAATK 489 (502) T ss_pred eeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHc--CCCHHHHHHHHHHHHHHHHHcCCCCCCCCCCCCCCCCCCCC Confidence 77 444557999999999999999999999999875 3555545443332111 1121111 111111111111 Q ss_pred CCCCCCCCCCCCC Q lcl|NC_016654. 518 TENDPATDPEAVD 530 (533) Q Consensus 518 ~~~~~~~~~~~~~ 530 (533) .+..+.++++.++ T Consensus 490 ~~e~~~~~~~~e~ 502 (502) T protein:vir:79 490 RQEPQHTDDQSEE 502 (502) T ss_pred CCCCCCCCCCCCC Confidence 1111122222222 No 82 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.52 E-value=8.1e-14 Score=92.35 Aligned_cols=443 Identities=9% Similarity=-0.048 Sum_probs=193.4 Q ss_pred cCCHHHHHHHHhccCcchhhHHHHHHH-HH-H---HHHhcccCCCCCcc---cceeecChHHHHHHHHHHhhcCCCceEe Q lcl|NC_016654. 30 EGDLDKLATFYGAEGRTSPSGIKARTK-AA-Y---EAFHGRTPTATGRA---PKRYHAPIPGVIAKLSTTELFSEQLKFL 101 (533) Q Consensus 30 ~gd~~~l~~~y~~~~~~~~~~~~~~~~-~~-~---~~~~~~~~~~~g~~---~~~~~~n~~k~i~~~~a~ll~~e~~~i~ 101 (533) .+..++-.+..-...+.....+...+. +. . ...+.+.+...+.. .-+.+..+++.||+..|..++.+...|+ T Consensus 1 ~~~~~~a~~~~~~~~a~~~~~~~~~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~~~~l~r~iVd~~a~d~~r~g~~i~ 80 (461) T protein:vir:80 1 MYSIDKAKQAKIDSKIVNRNDFMVGHGKANSRDKLTRQTPGNGQKLDLKACENLYASNSIAMNIVDIISEDMVRAGWSLK 80 (461) T ss_pred CccchhhhhhhhhhhhhhhhHHHhhcCCcchhhhhhccccCcccccCHHHHHHHHHhCCccchhhccchHHhhcCCeeee Confidence 111111110000000000000110100 00 0 00011111111111 1123667899999999999999988886 Q ss_pred eCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEE Q lcl|NC_016654. 102 DAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSEL 181 (533) Q Consensus 102 ~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~ 181 (533) +. +++..+.+.+.++.-+++..+.+++..+..+|++++.+-+.+.... .+...-|+ +.+.+..+.++..+ T Consensus 81 ~~--~~~~~~~~~~~~~~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~-------~~~~~~pl-~~~~~~~~~~l~~~ 150 (461) T protein:vir:80 81 TD--NKEMKKNIESKWRKLKTKDRFQKLYADKRLYGDGFLSIGVVSSNRE-------QADLSTAI-DPKTIKSIPYINTF 150 (461) T ss_pred cC--CHHHHHHHHHHHHHhhHHHHHHHHHHhhcccccEEEEEEeecCCcc-------ccCccCCc-ccccccceeEEEec Confidence 64 4556677888887778999999999999999999888766432110 11112222 22222222222211 Q ss_pred eecC--CceEEE--EEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCc Q lcl|NC_016654. 182 AGGD--GQEVWR--HLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVT 257 (533) Q Consensus 182 ~~~~--~~~~y~--~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~ 257 (533) .... .....+ +-+. -|..++ |.-.+...+.. +. .....+.....-+.+ ..+.|..... T Consensus 151 ~~~~i~~~~~~~dp~sp~--fg~P~~--y~i~~~~~~~~--~~----------~~~~~~~~~~~iH~S--Rii~~~~~~~ 212 (461) T protein:vir:80 151 NTQKVTQLYLNQDMFSEH--FGEVEF--FEVNRVSQLGE--EI----------LSGTTASTSEQIHRS--RIIHEQGLRF 212 (461) T ss_pred cccccchhhhcccCcCcc--cccceE--EEEeccccccc--cc----------cccccCccceEEccc--cEEEecCCCC Confidence 0000 000000 0000 011111 21111100000 00 000000000000111 1122222211 Q ss_pred ccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcC-CCCccccccCcchhhhhh-ccc Q lcl|NC_016654. 258 PNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTN-LGMGQGVSLDEEQEVYSR-VGS 335 (533) Q Consensus 258 ~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~-~~~~~~~~~d~~~~~~~~-~~~ 335 (533) + ...+|+|++.. +.+.+.+++.+.-....-+......++--.. +.. .+..... ....-..++. -.+ T Consensus 213 ~---------~~~~G~S~le~-~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~-l~~~~~~~~~~-~~~~~~~~~~~~g~ 280 (461) T protein:vir:80 213 E---------GETKGRSIFES-LYDIITVMDTSLWSVGQILYDFAFKVYKTDD-IDALNKDDKAN-LTAMLDFMFRTEAL 280 (461) T ss_pred C---------ccccCcchHHH-HHHHHHHHHHHHHHHHHHHHHhCCCceecch-HHhhhchHHHH-HHHHHHHhcCCceE Confidence 1 13469999987 5577888887776665555433333332111 110 0111000 0000001111 011 Q ss_pred cccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhh-cccCCCcchhHHHHHHHhhhHHHHHHHHH-HHH Q lcl|NC_016654. 336 GGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVS-LGLSDEVAQTATEASGKKDLTVKTTRAKA-RHF 413 (533) Q Consensus 336 ~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~-~g~~~~~~~Tatai~~~~~~l~~~~~~~~-~~~ 413 (533) ...+ . ...+++++.+ +......++.+.+.|+..+++|... ||-..++.+|+.+=. +-.+..+..++ ..+ T Consensus 281 ~~~d--~--~e~~e~~~~~--lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~asge~D~---~~yyd~i~~~qe~~l 351 (461) T protein:vir:80 281 AIIK--G--DEQLTKESTN--VSGMKDLLDYGWDYLAGAVRMPKTVLKGQEAGTLTGAQYDV---MNYYARVSSIQENRL 351 (461) T ss_pred EEEc--C--CcceEEEecC--cCCHHHHHHHHHHHHhhhhcCCeeeeecccCCccccchHHH---HHHHHHHHHHHHHHH Confidence 1111 1 1235555543 3455677888889999999999865 465556666776432 23455566666 568 Q ss_pred HHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHH Q lcl|NC_016654. 414 GSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEA 493 (533) Q Consensus 414 ~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El 493 (533) +..|..|+..++.-. ..+.....+..++++|.|++-...+..|.|++..+...+ ..+++. .+-.+.+|+.+++ T Consensus 352 ~p~le~l~~~i~~s~-~~~~~~~~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a----~~~~~~--~g~is~~e~r~~l 424 (461) T protein:vir:80 352 RPQLEYLTRLLMWAS-DDCGPSIDPDSFEWAIEFNPLWNLDSKTDAEVRKLTAEA----DQIYIV--NGVLDPDEVKETR 424 (461) T ss_pred HHHHHHHHHHHHHHh-cccccccCccccceEEEeCCCCCCCHHHHHHHHHHHHHH----HHHHHh--cCCCCHHHHHHHH Confidence 889999888776531 111122233457899999999999999988765443221 112222 2234444444444 Q ss_pred HHHHHhhhcccCccccccccCCCCC--CCCCCCCCCCCCCC Q lcl|NC_016654. 494 DLIDNANTVSAPTFGFGTDQPPLPT--ENDPATDPEAVDEG 532 (533) Q Consensus 494 ~rI~~E~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~d~ 532 (533) .. . ...+++....+..+...+ +.+.+...++++|| T Consensus 425 ~~---~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~g 461 (461) T protein:vir:80 425 FG---R-FGLENSSKFSGDSAEIDKLAKLVYDAYAKKNADG 461 (461) T ss_pred HH---h-cCCCCCccCCCCCchhhhhhhhccccccccCCCC Confidence 21 1 111121112222211110 01111112233333 No 83 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.51 E-value=3.7e-12 Score=83.27 Aligned_cols=450 Identities=11% Similarity=0.009 Sum_probs=202.1 Q ss_pred CCCCCCc---CCCcCcchHHHHHH----HHhhhHhhcC--CHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCC Q lcl|NC_016654. 1 MSLPEAN---TAWPPPELAAVTAR----VAESHVWWEG--DLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATG 71 (533) Q Consensus 1 ~~~~~~~---~~~pp~~~~~~~~~----~~~~~~w~~g--d~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 71 (533) |-+|+-- ..-++..+..++.- -.....|... +++... ..-+..+..+ +|.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i-----------~~~~~~lr~R-aRdl-------- 60 (530) T protein:vir:38 1 MKIPSLVGPDGKTSLREYAGYHGGGGGFGGQLRGWNPPSESADAAL-----------LPNYSRGNAR-ADDL-------- 60 (530) T ss_pred CccceeecCccccchHHHhhhhcccCCCCCcccccccCCCCHHHHH-----------HHHHHHHHHH-HHHH-------- Confidence 8888322 22333333322210 0111112221 111110 0111111111 1111 Q ss_pred cccceeecChHHHHHHHHHHhhcCCCceEeeCC----------CchHHHHHHHHHHhh--------------ccHHHHHH Q lcl|NC_016654. 72 RAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAG----------KSKEVQARADLIFNT--------------PRFHSSLV 127 (533) Q Consensus 72 ~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~----------~~~~~~~~l~~i~~~--------------n~f~~~~~ 127 (533) ....++++.+++.+++.+.|....+.... .++.+++.+++.|+. ..|..... T Consensus 61 ----~rNn~~a~~av~~~~~nvVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~ 136 (530) T protein:vir:38 61 ----VRNNGYAANAVQLHQDHIVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIR 136 (530) T ss_pred ----HhcChHHHHHHHHHHHHhhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHH Confidence 22456999999999999999876554432 234455556555532 24667777 Q ss_pred HHHHHHhhhCCEEEEEEEcCCCCC--ceEEEEEcCCeEE-EEE-ecCCceEEEEEEEEeecCCceEEEEEEEecCee-EE Q lcl|NC_016654. 128 EAGESCSALSGSFQRIVWDPTIAD--NAWIDFVDADRAI-PEF-RWGRLVAVTFWSELAGGDGQEVWRHLERHESGY-IV 202 (533) Q Consensus 128 ~~~~~~~~~G~~~~~~~~D~~~~~--~~~i~~v~~~~~~-P~~-~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~-I~ 202 (533) .++...+..|.+++++.+++..+. ..++..++|+.+- |.. .+| ..+.--+|+-..|+ +- T Consensus 137 l~~r~~~~dGE~~~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~----------------~~i~~GIe~d~~Gr~~a 200 (530) T protein:vir:38 137 EGVAMHAFNGELCVQATWDSDSTRLFRTQFKMVSPKRVSNPNNIGDT----------------RNCRAGVKINDSGAALG 200 (530) T ss_pred HHHHHHhhCCceEEEeeeccCCCCccceEEEEechhhcCCCCCCCCC----------------CeeEeeeEECCCCceEE Confidence 777778999999999988764321 3678888888653 211 111 11222233322222 22 Q ss_pred EEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHH Q lcl|NC_016654. 203 HAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFP 282 (533) Q Consensus 203 ~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~ 282 (533) |.+++....... ...+..+. .....+..-+.|+-... .+ ....|+|.|+..|.. T Consensus 201 Y~i~~~~~~~~~-------~~~~~~~~-----------~~~~v~a~~vlH~f~~~------r~--gQ~RGis~lapvl~~ 254 (530) T protein:vir:38 201 YYVSDDGYPGWM-------AQNWTYIP-----------RELPGGRPSFIHVFEPM------ED--GQTRGANAFYSVMEQ 254 (530) T ss_pred EEEeeccCCCcc-------ccccceee-----------eeeccChhHeEeecccc------CC--CcccCCchHHHHHHH Confidence 223322111000 00000000 00011111222221111 01 245699999997754 Q ss_pred HHHHHHHHHH-HHHHHHHhCcceeeechHHhcCCC----------Cccccc---cCcchhh--------hhhcccccccc Q lcl|NC_016654. 283 TFHELDRIYS-SLMRDFRIGAGKVHASESVLTNLG----------MGQGVS---LDEEQEV--------YSRVGSGGFNA 340 (533) Q Consensus 283 lid~lD~~~s-~~~~~~~~~~~~i~v~~~~l~~~~----------~~~~~~---~d~~~~~--------~~~~~~~~~~~ 340 (533) +..++.-.. .+.+..-.+--..||-... .... ...... +...... .....+..... T Consensus 255 -l~~l~~y~dael~~a~i~A~~a~fi~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~p 332 (530) T protein:vir:38 255 -MKMLDTLQNTQLQSAIVKAMYAATIESEL-DTQSAMDFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLP 332 (530) T ss_pred -HHHHhHHHHHHHHHHHHhhhheeeeeccC-CccccccccccCCcccccccccccchhhhhcccccceeccCceeeecCC Confidence 455554333 3332222222222331110 0000 000000 0000000 01111111112 Q ss_pred ccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc--hhHHHHHHHhhhHHHHHHHHHHHHHHHH- Q lcl|NC_016654. 341 NGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA--QTATEASGKKDLTVKTTRAKARHFGSAL- 417 (533) Q Consensus 341 ~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~--~Tatai~~~~~~l~~~~~~~~~~~~~al- 417 (533) | ..++.+++.-...+|..-++.+++.|...+|+|++.++-|.+++ .|+.+-... ....+...+..+...+ T Consensus 333 G----e~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~~nYSS~R~~~~e---~~r~~~~~q~~~~~~~~ 405 (530) T protein:vir:38 333 G----DSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQMSYSTARASANE---SWAYFMGRRKFVASRQA 405 (530) T ss_pred C----CeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHH---HHHHHHHHHHHHHHHHh Confidence 2 23677788777778888888899999999999999997665433 234444333 3334444444443333 Q ss_pred HHHHHHHHHHHHhhccCCCCCCc-----------eeEEEEe--CCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCC Q lcl|NC_016654. 418 GPLSTTCLRVDAIKFPGKGAAPS-----------EELELEW--PKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDW 484 (533) Q Consensus 418 ~~li~~il~l~~~~~~~~~~~~~-----------~~v~i~f--~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~ 484 (533) +.+.+. ||....+.|....+. ..+.+.| +.....|+..+++....++.+|+.|.++.+++. +. T Consensus 406 ~pi~~~--wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~--G~ 481 (530) T protein:vir:38 406 CQMFLC--WLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKR--GD 481 (530) T ss_pred hHHHHH--HHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHc--CC Confidence 222222 233333333332221 1134555 445567999999999999999999999999875 35 Q ss_pred CHHHHHHHHHHHHHhhhcccC-ccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 485 DDERVQEEADLIDNANTVSAP-TFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 485 ~dee~~~El~rI~~E~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) |-+++.+++ ..|+..... .....++....+..+.+..+ +.++||. T Consensus 482 D~~~v~~q~---a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~-~~~~d~~ 527 (530) T protein:vir:38 482 DYQEIFAQQ---VRESMERRAAGLNPPAWAAAAFEAGVKKSN-EEEQDGA 527 (530) T ss_pred CHHHHHHHH---HHHHHHHHHcCCCCCCCcccccCCCCCCCC-CCCCCCC Confidence 555454443 333321110 00000111111111111111 1112222 No 84 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.51 E-value=4.1e-12 Score=83.02 Aligned_cols=456 Identities=10% Similarity=-0.021 Sum_probs=202.8 Q ss_pred CCCC--CCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccC----cch-hhHHHHHHHHHHHHHhcccCCCCCcc Q lcl|NC_016654. 1 MSLP--EANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEG----RTS-PSGIKARTKAAYEAFHGRTPTATGRA 73 (533) Q Consensus 1 ~~~~--~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~----~~~-~~~~~~~~~~~~~~~~~~~~~~~g~~ 73 (533) |++= ..+-.||+.. ++.......+.+ +...+...-+..++ ... ....+..+..+ +|.+ T Consensus 8 ~~~~dr~i~~~~~~~~-~~~~~~~~~y~a---a~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~R-aRdL---------- 72 (505) T protein:vir:96 8 PSLAQRMVNWAWYRYV-EPQKNAARAFEA---ARRDRLGKAWLRRASRLSADEEIYADLASLVQR-AREQ---------- 72 (505) T ss_pred cchhhcccchhhhhhH-HHHHHhhhhccc---ccCCCccccccCCCCCCChHHHHHHHHHHHHHH-HHHH---------- Confidence 3321 1122455443 222222222111 11111111000000 000 01111111111 1111 Q ss_pred cceeecChHHHHHHHHHHhhcCC-CceEeeCC------CchHHHHHHHHHHhh------------ccHHHHHHHHHHHHh Q lcl|NC_016654. 74 PKRYHAPIPGVIAKLSTTELFSE-QLKFLDAG------KSKEVQARADLIFNT------------PRFHSSLVEAGESCS 134 (533) Q Consensus 74 ~~~~~~n~~k~i~~~~a~ll~~e-~~~i~~~~------~~~~~~~~l~~i~~~------------n~f~~~~~~~~~~~~ 134 (533) ....++++-+++.+++.++|. ...+.... .++.+++.|+..++. .+|......++...+ T Consensus 73 --~rNn~~a~~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~ 150 (505) T protein:vir:96 73 --SINNPYAKRFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLA 150 (505) T ss_pred --HhcChHHHHHHHHHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHh Confidence 224469999999999999984 55444321 244455555544332 136666677777778 Q ss_pred hhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeE-EEEEEeccCCcc Q lcl|NC_016654. 135 ALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYI-VHAVYKGTATSL 213 (533) Q Consensus 135 ~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I-~~~~y~~~~~~l 213 (533) ..|.++++..+..++.-..+|..++|+.+--- .++.. .+++.+.--+|+-..|+. -|.+++..+... T Consensus 151 ~dGE~f~~~~~~~~~~~~~~lqliepd~l~~~-~n~~~-----------~~~~~i~~GIe~d~~Gr~~aY~i~~~hPgd~ 218 (505) T protein:vir:96 151 RDGEVLVREHRGYPNKWGYALQILECDRLDLN-YNADL-----------QNGNRIRMSIELDAWERPVAYHLLVNHPGDN 218 (505) T ss_pred hCCceEEEEeecCCCCcceEEEEechhhcCCC-CCccc-----------CCcCeEEeceEECCCCceEEEEEeecCCCcc Confidence 89999999888765544568888998875311 11110 011112222233222222 222333222110 Q ss_pred cceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHH Q lcl|NC_016654. 214 GWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSS 293 (533) Q Consensus 214 G~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~ 293 (533) ...... ....+.- |. -....-+|.|- .+ ....|+|.|+..|. .+..++...+. T Consensus 219 ~~~~~~-~~~~~~r-------------vp--a~~vlH~f~~~--------r~--gQ~RGis~lapvl~-~l~~l~~y~da 271 (505) T protein:vir:96 219 SYCYHY-AGQTYER-------------VP--ADEIIHTFVPW--------RP--HQNRGIPWTHASMV-ELHHIGEYRKS 271 (505) T ss_pred cccccc-ccccccc-------------cC--HhHhhhhhccc--------CC--ccccCcchHHHHHH-HHHHHhHHHHH Confidence 000000 0000000 00 00011111111 01 23469999999774 45566544433 Q ss_pred HHHHHH-hCcceeeechHHhcCCCCccccc-cCcchhh---hhhccccccccccccccceeeechhhhhHHHHHHHHHHH Q lcl|NC_016654. 294 LMRDFR-IGAGKVHASESVLTNLGMGQGVS-LDEEQEV---YSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLL 368 (533) Q Consensus 294 ~~~~~~-~~~~~i~v~~~~l~~~~~~~~~~-~d~~~~~---~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l 368 (533) -...-+ .+--..|| +.+.+..+.. .+..... +.+..+.....| ..++.++++-+..+|..-++.++ T Consensus 272 el~~a~i~A~~a~fi-----~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pG----e~i~~~~~~~p~~~~~~f~~~~l 342 (505) T protein:vir:96 272 EMIAAELGAKKVGFY-----EQDPEAYDQPPEDDQGEIVEEVEAGTYQLLPYG----IRFKEHKIDHPHTNFGAFVKSSL 342 (505) T ss_pred HHHHHHHhhhheeee-----ecCCccCCCccccccCccccccCCceeeecCCC----CeeeeeCCCCCCCCHHHHHHHHH Confidence 322222 22222333 2222111111 1111111 111122222222 24777888877788888888999 Q ss_pred HHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhccCCCCCCce----eE Q lcl|NC_016654. 369 REVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGP-LSTTCLRVDAIKFPGKGAAPSE----EL 443 (533) Q Consensus 369 ~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~-li~~il~l~~~~~~~~~~~~~~----~v 443 (533) +.|...+|+|++.+..+.+++ |=.++++........+...+..+...+-+ +.+.. |....+.|....+.. -+ T Consensus 343 r~iaaglgi~ye~lt~D~s~~-nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~--l~~a~l~G~i~~p~~~~~~~~ 419 (505) T protein:vir:96 343 RGVAAGMGPAYNRLAHDLEGV-NFSSLRSGELDERDLYKLLQFFVVTELLERVAGNL--ISMSLLTQALPLNMVDIDRLS 419 (505) T ss_pred HHHHhhcCCCHHHHhcccccc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHcCCcCCCCccchhhc Confidence 999999999999997665432 22233333444444444444444443333 33332 233333343332221 23 Q ss_pred EEEeCC--CCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHH---hhhcccCccccccccCCCCC Q lcl|NC_016654. 444 ELEWPK--FARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDN---ANTVSAPTFGFGTDQPPLPT 518 (533) Q Consensus 444 ~i~f~d--~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~---E~~~~~~~~~~~~~~~~~~~ 518 (533) .+.|-- ....|+..++++...++.+|++|.++.++.. +.|-+++.+++++-.+ +.+...+.... +... T Consensus 420 ~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~t~~~~~a~~--G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~-----~~~~ 492 (505) T protein:vir:96 420 QYAFQPRGWDWVDPAKDSKAHSESIKNRTRSRSSIIRAA--GDDPEDVFDEIAWEEQLMRDKGVNPTPPEQ-----ESKD 492 (505) T ss_pred eeeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHc--CCCHHHHHHHHHHHHHHHHHcCCCCCCCCC-----CCCC Confidence 566643 3446999999999999999999999999885 3565555444332211 12211111001 1111 Q ss_pred CCCCCCCCCCCCC Q lcl|NC_016654. 519 ENDPATDPEAVDE 531 (533) Q Consensus 519 ~~~~~~~~~~~~d 531 (533) ...++.+++..|| T Consensus 493 ~~~~~~~~~~~d~ 505 (505) T protein:vir:96 493 ATTDEEDDSASDD 505 (505) T ss_pred CCCCCCCCCCCCC Confidence 1111111122222 No 85 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=99.50 E-value=4.5e-12 Score=82.77 Aligned_cols=443 Identities=10% Similarity=-0.005 Sum_probs=198.0 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCc--chhhHHHHHHHHHHHHHhcccCCCCCcc-ccee Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGR--TSPSGIKARTKAAYEAFHGRTPTATGRA-PKRY 77 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~g~~-~~~~ 77 (533) ||. ..|=|+|.....+.+..+.=+.|...+--.-|..+.. .....-+..|..+..+--..+. ... .+-. T Consensus 14 m~V-----~~~hp~y~a~~~~W~~~~d~g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~---~~~~~rA~ 85 (488) T protein:vir:96 14 MLT-----PIYHPDYLVNAPQWLRNLDCVMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWE---DLTWRLAN 85 (488) T ss_pred ecc-----cccCHHHHHHhhhhhHhhhhhhHHHHHhhhhcCCCCCCccccccCcchhhhhhccchhhhH---hhhhhccc Confidence 653 4455676665555544433122211110111221110 0000111111111110000000 000 0113 Q ss_pred ecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHH-HhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCC------- Q lcl|NC_016654. 78 HAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLI-FNTPRFHSSLVEAGESCSALSGSFQRIVWDPTI------- 149 (533) Q Consensus 78 ~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i-~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~------- 149 (533) -.|+++.+++.++.++|.++|+++.... ..++.+++++ ..-++++..++..+..++.+|.+++.|=+.+.+ T Consensus 86 ~~n~~~~tl~~l~G~vfrk~p~~~~~~~-~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~T~ade~ 164 (488) T protein:vir:96 86 YVNIVNPTMNAITGAVMRREPEFDTMDN-PVLIGLRDNIDGKGNGIDQECKQALNALQWGSRCGWLVRSHPESATMADWN 164 (488) T ss_pred cCchhHHHHHHhcchhhccCceeccCCc-HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCcCCHHHHH Confidence 4699999999999999999999976422 2355555443 345678999999999999999999987665432 Q ss_pred --CCceEEEEEcCCeEEEE-E--ecC--CceEEEEEEEEeecCC-----ceEEEEEEEecCeeEEEEEEeccCCccccee Q lcl|NC_016654. 150 --ADNAWIDFVDADRAIPE-F--RWG--RLVAVTFWSELAGGDG-----QEVWRHLERHESGYIVHAVYKGTATSLGWMM 217 (533) Q Consensus 150 --~~~~~i~~v~~~~~~P~-~--~~g--~~~~v~f~~~~~~~~~-----~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v 217 (533) +.+|-|..+.|++++=. + .+| +++.+++-+.+...|+ +..|+.+. .++|. |++|....+.-+. T Consensus 165 ~~~~rPy~~~~~a~~IinW~~~~v~G~~~L~~v~lrE~~~~~D~~~~~~~~~~~~~~-l~~g~--~~v~~~~~~~~~~-- 239 (488) T protein:vir:96 165 KGKKLPTAAFYDALHIIDWEVEYIDGEEKLTYLSLLEDYQERDGGTYVSKQRLINHR-LVDGL--CEFQEVTDDEYSD-- 239 (488) T ss_pred HhcCCcEEEEechhhhcCcceeccCCceeeEEEEEEEEEEeccCCCcccceEEEEEE-EECcE--EEEEEEecCCccc-- Confidence 23588888998877643 2 234 4777777665544332 11222221 13332 2333222111000 Q ss_pred ehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHH--- Q lcl|NC_016654. 218 ALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSL--- 294 (533) Q Consensus 218 ~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~--- 294 (533) ++ .+ .+.++ .+....||+.+ ...+..+. .|.+-|. +|. .++...=+. T Consensus 240 ------e~---~~--~~~g~----~~l~~IP~v~~---~~~~~~~~-------~~~pPLl----dLA-~lnl~Hy~~ssd 289 (488) T protein:vir:96 240 ------EW---TP--VLINS----KQSDTIPFFLA---SSQSNEWC-------IDSTPLT----SLA-EISLSIYVMNAY 289 (488) T ss_pred ------ce---Ee--ecCCC----cccCeeEEEEE---ecCCCCCC-------CCCCchH----HHH-HHHHHHHhhhhH Confidence 00 00 00000 01111233322 22222211 1222222 221 223222222 Q ss_pred -HHHHHhCcceeeechHHhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHH Q lcl|NC_016654. 295 -MRDFRIGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLR 373 (533) Q Consensus 295 -~~~~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~ 373 (533) -+.+..+...+.|.. + .....+...... ...+..+.........+ ...++++.+... ..+.|+.+.+++.. T Consensus 290 ~~~il~~~~~p~lv~~-~-~~~~~~~~~~~~-~~g~~~~~~~~~~~~~g----~~~~~e~~~~~l-~~~~l~~l~~qm~~ 361 (488) T protein:vir:96 290 SNKAMILANEAKWMVD-M-GDMNKTMASEMN-PLGFTLAGRMPYYVKNG----DVKVIQAQFSPE-TENKVEKLFEQAVK 361 (488) T ss_pred HHHHHHhcCCceeeec-c-CCCCcccccccc-cceeeecccccccccCC----ceeecCCchhHH-HHHHHHHHHHHHHH Confidence 222233333333310 0 000000000000 00111111111011111 123333333221 13345555444422 Q ss_pred hhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCC-CC Q lcl|NC_016654. 374 KTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKF-AR 452 (533) Q Consensus 374 ~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~-i~ 452 (533) ++...+. .++..||++.....+...+....+...++.++.++++.+....... .......+++|.-+.. .. T Consensus 362 ---~Ga~l~~--~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~A~w~g~~---~~~~~~~~~~~~in~dF~~ 433 (488) T protein:vir:96 362 ---VGASLFT--QQSNETATGAAIRSGSSTASMATLGNNVEDTVRNMLRFIMRYFEGT---NLYVNPDELVFKLNRDYFD 433 (488) T ss_pred ---HhHhhcc--CCCcchHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCC---CCCcCccceEEEeccCCCC Confidence 2222332 2345799999998888888888899999999999888765431110 1111222333333321 22 Q ss_pred CC-HHHHHHHHHHHHhCCCCCHHHHHHHhCC-CC-C-HHHHHHHHHHHHHhhhcccCcc Q lcl|NC_016654. 453 ES-DLAKAQTVQAWSVASAASTKTKVAYLHE-DW-D-DERVQEEADLIDNANTVSAPTF 507 (533) Q Consensus 453 ~d-~~e~a~~~~~l~~aGi~S~et~v~~l~~-~~-~-dee~~~El~rI~~E~~~~~~~~ 507 (533) .. ....++.+.++..+|.||.+|.++.+-- ++ + +-..++|.+||+++- ..+ T Consensus 434 ~~ld~~~~~al~~~~~~G~Is~~t~~~~L~~~gvl~~d~~~e~~~~~ie~~g----~~~ 488 (488) T protein:vir:96 434 VEVNPQMLQVAYAAMMEGNLPQVSWFELLKRARVVRGDMSKEEFDEHIAELG----FGM 488 (488) T ss_pred ccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCcCCccCCHHHHHHHHhhcC----CCC Confidence 21 3457777888999999999998766421 22 1 113456667776432 111 No 86 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.50 E-value=1.2e-12 Score=85.87 Aligned_cols=410 Identities=10% Similarity=-0.001 Sum_probs=184.2 Q ss_pred HHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHh-cccCCC-CCcc--cceeecChHHHHHHHHHHhhcCCC Q lcl|NC_016654. 22 VAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFH-GRTPTA-TGRA--PKRYHAPIPGVIAKLSTTELFSEQ 97 (533) Q Consensus 22 ~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~-~g~~--~~~~~~n~~k~i~~~~a~ll~~e~ 97 (533) |. -.+.|..+-.+.+..+.. .++ .+.+.. .+.. .-..+..+++.+|+..|.-++.+. T Consensus 1 ~~--------~~D~~~~~~~~~g~~~~~-----------~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~vd~~a~d~~r~~ 61 (437) T protein:vir:52 1 MK--------FFDGIKSLALKLGSKQEQ-----------TYYSPSLSLTDDLVQLEALWRDNWIANKVCIKRPEDMVRNW 61 (437) T ss_pred Cc--------hhhhhHhHHhcCCCcccc-----------ceeecCccccccHHHHHHHHHhCchhhHHhhcchHHhhcCC Confidence 00 112222222111110000 000 000000 0000 123466799999999999999999 Q ss_pred ceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEE Q lcl|NC_016654. 98 LKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTF 177 (533) Q Consensus 98 ~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f 177 (533) ..|.+++.++..-+.+.+.++.=+++..+.+++..+-.+|++++.+..|... + - .|+-..|.+..+.. T Consensus 62 ~~i~~~d~~~~~~~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~---~-~--------~pl~~~~~~~~~~v 129 (437) T protein:vir:52 62 REIYSNDLNSKQLDLFTKFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQN---T-S--------APLKPTERLKRLII 129 (437) T ss_pred ceEecCCCCHHHHHHHHHHHHhhcHHHHHHHHHHhcccccceEEEEEecCCC---c-c--------cccccCCceeEEEE Confidence 8888765544444667777777789999999999999999999887776321 0 0 11111233333222 Q ss_pred EEEEee--cCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecC Q lcl|NC_016654. 178 WSELAG--GDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPN 255 (533) Q Consensus 178 ~~~~~~--~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn 255 (533) +..+.- ......=-+-..+ |...+ |.-...+-+.. + +.++ .+.|... T Consensus 130 ~~~~~v~~~~~~~~dp~s~~f--g~p~~--y~v~~~~~~~~------------------------i-H~SR--ii~~~~~ 178 (437) T protein:vir:52 130 LPKWKISPTGTKDDDVLSPNF--GRYSE--YSILGGSQSIT------------------------V-HHSR--LIILNAN 178 (437) T ss_pred echhhcccccccccccccccc--CcceE--EEEecCCccee------------------------E-ccce--eEEecCc Confidence 211100 0000000000000 11111 11110000000 0 1111 1111111 Q ss_pred CcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceee-ec--hHHhcCCCCccccccCcchhhh-- Q lcl|NC_016654. 256 VTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVH-AS--ESVLTNLGMGQGVSLDEEQEVY-- 330 (533) Q Consensus 256 ~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~-v~--~~~l~~~~~~~~~~~d~~~~~~-- 330 (533) ..+ .+....+|+|.+... .+-|..++.+.-....-+...+..++ ++ ...+... .........+.+ T Consensus 179 ~~~------~~~~~~~G~s~le~~-~~~i~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~l~~~---~~~~~~~~~~~~~~ 248 (437) T protein:vir:52 179 DAP------LSDNDIWGVSDLEKI-IDVLKRFDSASVNVGDLIFESKIDIFKIAGLSDKIAAG---MENEVASVISAVQE 248 (437) T ss_pred cCC------CccccccCCchHHHH-HHHHHHHHHHHHHHHHHHHHcCCCceecchHHHHhcCC---cHHHHHHHHHHHHH Confidence 100 011245799999874 46666777666555544543344443 22 1223221 110000001111 Q ss_pred -hh-ccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhc-ccCCCcchhHHHHHHHhhhHHHHHH Q lcl|NC_016654. 331 -SR-VGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSL-GLSDEVAQTATEASGKKDLTVKTTR 407 (533) Q Consensus 331 -~~-~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~-g~~~~~~~Tatai~~~~~~l~~~~~ 407 (533) +. ......+. ...++.++.+ +......++.+..+|+..+++|...+ |-..+|-.|+.+-... .+..++ T Consensus 249 ~~~~~~~~~~d~----~~~~e~~~~~--~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s~~Glasge~D~~~---yyd~i~ 319 (437) T protein:vir:52 249 IKSATNSLLLDA----ENEYDRKELT--FTGLKDLLTEFRNAVAGAADMPVTILFGQSVSGLASGDEDIQN---YHEAIR 319 (437) T ss_pred hcCCCceEEEcC----CcceEEEecC--cCCHHHHHHHHHHHHHHHhcCchhhhcCcCcccccccHHHHHH---HHHHHH Confidence 10 11111121 1234444433 33445677788889999999997555 5545566666644333 344455 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHH-------HHHHHhCCCCCHHHHHHH Q lcl|NC_016654. 408 AKA-RHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQT-------VQAWSVASAASTKTKVAY 479 (533) Q Consensus 408 ~~~-~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~-------~~~l~~aGi~S~et~v~~ 479 (533) .++ ..++..|..|+..++.- .+ +. . ..+++|.|++-...+..|.+++ +++++++|++|.+++... T Consensus 320 ~~Qe~~l~p~le~l~~~i~~~---~~-g~-~--~~~~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~g~i~~~e~r~~ 392 (437) T protein:vir:52 320 RLQETRLRPIFEIIDPLICNE---LF-GG-L--PADWWFEFVPLTTVKQEQQINMLNTFATAANTLIQNGVLNEYQIANE 392 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHH---hc-CC-C--CCcceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHH Confidence 555 56788888888865432 11 21 1 2369999999988887777665 555666777777665554 Q ss_pred h-----CCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCC Q lcl|NC_016654. 480 L-----HEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAV 529 (533) Q Consensus 480 l-----~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 529 (533) + ++..++++.+ .....++.....++++...+. +.+.+.++ T Consensus 393 L~~~g~~~~i~~~~~~---------~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 437 (437) T protein:vir:52 393 LRESGLFANISAEHIE---------ELKNADEFAGNFEEPEKMEGA-QVQNSEDQ 437 (437) T ss_pred HHhcCCCCCCCccccc---------cccCCCCCCCccCCCCCCCCC-CCCCCCCC Confidence 3 2233322110 000011111111111111111 11111111 No 87 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=99.44 E-value=1.6e-11 Score=79.83 Aligned_cols=482 Identities=11% Similarity=0.009 Sum_probs=230.1 Q ss_pred CCCCCCcCCCcC------cchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCccc Q lcl|NC_016654. 1 MSLPEANTAWPP------PELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAP 74 (533) Q Consensus 1 ~~~~~~~~~~pp------~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 74 (533) +-|++++-- += ..++.+...+.....|++ +-.+-.+||.+ +||.......... +.+ T Consensus 6 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~R~-~a~~d~~fy~G------~Qw~~~~~~~l~~----------~g~ 67 (714) T protein:vir:81 6 NTMATKNDN-GATPRFSQRQLQALCSDIDSQPKWRD-AANKACAYYDG------DQLPPEVLQVLKD----------RGQ 67 (714) T ss_pred ccccCCCCc-chhHHHHHHHHHHHHHHHHhhHHHHH-HHHHHHHhhcC------CCCCHHHHHHHHh----------cCC Confidence 556655422 11 123333334444444543 23333445543 2443322222111 123 Q ss_pred ceeecChHHHHHHHHHHhhcCCCceEeeCCC--c-------hHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEE Q lcl|NC_016654. 75 KRYHAPIPGVIAKLSTTELFSEQLKFLDAGK--S-------KEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVW 145 (533) Q Consensus 75 ~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~--~-------~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~ 145 (533) ..++.|+.+.+|+...++--...+.+.+.+. + +.++..+..+...+++......+...+.+.|-+|+.+++ T Consensus 68 p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~ 147 (714) T protein:vir:81 68 PMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRR 147 (714) T ss_pred CcEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecc Confidence 3689999999999999998888887777542 1 123455667788889999999999999999999999998 Q ss_pred cCCC-CCceEEEEEcCCeEEEEEe--cCCceEEEE--EEEEeec--------C--------------------------- Q lcl|NC_016654. 146 DPTI-ADNAWIDFVDADRAIPEFR--WGRLVAVTF--WSELAGG--------D--------------------------- 185 (533) Q Consensus 146 D~~~-~~~~~i~~v~~~~~~P~~~--~g~~~~v~f--~~~~~~~--------~--------------------------- 185 (533) |.+. ++.|+|..|++..++.-++ ...+.++-| +..+... + T Consensus 148 ~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~ 227 (714) T protein:vir:81 148 NSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSP 227 (714) T ss_pred ccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccc Confidence 7553 3568999999998886432 233444422 2221100 0 Q ss_pred ---------------------CceEEEEEEEecCeeEEEEEEeccCCcccceeehhh----------------------- Q lcl|NC_016654. 186 ---------------------GQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTD----------------------- 221 (533) Q Consensus 186 ---------------------~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~----------------------- 221 (533) .+..+++.|.|..-.+...++...+ |.-+.++. T Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~---g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~r 304 (714) T protein:vir:81 228 LMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSN---GRVVAFDKNNLMQAVAVASGRVQVKVGRVSR 304 (714) T ss_pred cccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCC---CceEEeCccCHHHHHHHhhcchhhhccccce Confidence 0011234444433222222222211 11111110 Q ss_pred ccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_016654. 222 HPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIG 301 (533) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~ 301 (533) +.......+.+...+ +...++-.+.|+|......+.. ..++| .+ ..++|..+.+|...|+..+.+. T Consensus 305 v~~~~~~g~~~L~~~-----~~p~p~~~fp~vp~~g~~~~~~----g~~~G--~v-r~~~d~Qr~~N~~~s~~~~~l~-- 370 (714) T protein:vir:81 305 IREAWFVGPHFIVDR-----PCSAPQGMFPLVPFWGYRKDKT----GEPYG--LI-SRAIPAQDEVNFRRIKLTWLLQ-- 370 (714) T ss_pred EEEEEEecCcccccC-----CCCCCCCceeEEEEeeeeeecc----Cceee--hh-hhchhHHHHHHHHHHHHHHhhc-- Confidence 000001111111000 0111111244444322211111 22444 22 3456888999999999988763 Q ss_pred cceeeechHHhcCCCC---ccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCC Q lcl|NC_016654. 302 AGKVHASESVLTNLGM---GQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYS 378 (533) Q Consensus 302 ~~~i~v~~~~l~~~~~---~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s 378 (533) ..++++.+..+..... .....++.. -.|... ..++......++...+.--...+++.++.....+...+|++ T Consensus 371 ~~~~~~~~~a~~~~d~~~~e~~arp~~v-i~~~p~----~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~ 445 (714) T protein:vir:81 371 AKRVIMDEDATQLSDNDLMEQIERPDGI-IKLNPV----RKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVY 445 (714) T ss_pred CCceeeecCcccccHHHHHHhccCCCCc-eeeccc----ccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCC Confidence 3334443222211100 000111100 001100 01111111123333322223577888888888999999999 Q ss_pred hhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCC-------------------- Q lcl|NC_016654. 379 PVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAA-------------------- 438 (533) Q Consensus 379 ~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~-------------------- 438 (533) ...+|-. ++..||.+|..+..............+..+++.+.+.+|.+....+...... T Consensus 446 ~~~lG~~-~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~ 524 (714) T protein:vir:81 446 SAFLGQD-SGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAE 524 (714) T ss_pred hHHcCCC-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccc Confidence 9999965 4567999999888877777777777788888887777776644322111100 Q ss_pred ------------CceeEEEEeCCCCCCCHHHHHHHHHHHHhC-----CCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhh Q lcl|NC_016654. 439 ------------PSEELELEWPKFARESDLAKAQTVQAWSVA-----SAASTKTKVAYLHEDWDDERVQEEADLIDNANT 501 (533) Q Consensus 439 ------------~~~~v~i~f~d~i~~d~~e~a~~~~~l~~a-----Gi~S~et~v~~l~~~~~dee~~~El~rI~~E~~ 501 (533) ..++|.|.=..+.+..+.+.++.+.++.++ +.+.....+..+ ++. -+++.+++|++-++ T Consensus 525 ~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~--d~p--~~~el~~~ir~~~~ 600 (714) T protein:vir:81 525 GDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLL--DVP--QKQEFVERIRAALG 600 (714) T ss_pred cCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhc--CCC--CHHHHHHHHHHHcC Confidence 011233333333333456777777777542 111223333322 232 24556667765432 Q ss_pred cccCccccccccCCCCCCCCCCCC-CCCC---CC---CC Q lcl|NC_016654. 502 VSAPTFGFGTDQPPLPTENDPATD-PEAV---DE---GE 533 (533) Q Consensus 502 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~---~d---~~ 533 (533) ...+. .+..+++...... .+.+ .+ .+ T Consensus 601 ~~~~~------~~~~~e~q~~~~~~q~~~~~q~~lq~~~ 633 (714) T protein:vir:81 601 TPKSP------DEMTPEEQEVAAQQQALQQQQAELQMRE 633 (714) T ss_pred CCCCc------cccchhhHHHHHHHHHHHHHHHHHHHHH Confidence 21110 0000000000000 0000 00 00 No 88 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=99.44 E-value=1.6e-11 Score=79.83 Aligned_cols=482 Identities=11% Similarity=0.009 Sum_probs=230.1 Q ss_pred CCCCCCcCCCcC------cchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCccc Q lcl|NC_016654. 1 MSLPEANTAWPP------PELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAP 74 (533) Q Consensus 1 ~~~~~~~~~~pp------~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 74 (533) +-|++++-- += ..++.+...+.....|++ +-.+-.+||.+ +||.......... +.+ T Consensus 6 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~R~-~a~~d~~fy~G------~Qw~~~~~~~l~~----------~g~ 67 (714) T protein:vir:99 6 NTMATKNDN-GATPRFSQRQLQALCSDIDSQPKWRD-AANKACAYYDG------DQLPPEVLQVLKD----------RGQ 67 (714) T ss_pred ccccCCCCc-chhHHHHHHHHHHHHHHHHhhHHHHH-HHHHHHHhhcC------CCCCHHHHHHHHh----------cCC Confidence 556655422 11 123333334444444543 23333445543 2443322222111 123 Q ss_pred ceeecChHHHHHHHHHHhhcCCCceEeeCCC--c-------hHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEE Q lcl|NC_016654. 75 KRYHAPIPGVIAKLSTTELFSEQLKFLDAGK--S-------KEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVW 145 (533) Q Consensus 75 ~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~--~-------~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~ 145 (533) ..++.|+.+.+|+...++--...+.+.+.+. + +.++..+..+...+++......+...+.+.|-+|+.+++ T Consensus 68 p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~ 147 (714) T protein:vir:99 68 PMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRR 147 (714) T ss_pred CcEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecc Confidence 3689999999999999998888887777542 1 123455667788889999999999999999999999998 Q ss_pred cCCC-CCceEEEEEcCCeEEEEEe--cCCceEEEE--EEEEeec--------C--------------------------- Q lcl|NC_016654. 146 DPTI-ADNAWIDFVDADRAIPEFR--WGRLVAVTF--WSELAGG--------D--------------------------- 185 (533) Q Consensus 146 D~~~-~~~~~i~~v~~~~~~P~~~--~g~~~~v~f--~~~~~~~--------~--------------------------- 185 (533) |.+. ++.|+|..|++..++.-++ ...+.++-| +..+... + T Consensus 148 ~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~ 227 (714) T protein:vir:99 148 NSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSP 227 (714) T ss_pred ccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccc Confidence 7553 3568999999998886432 233444422 2221100 0 Q ss_pred ---------------------CceEEEEEEEecCeeEEEEEEeccCCcccceeehhh----------------------- Q lcl|NC_016654. 186 ---------------------GQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTD----------------------- 221 (533) Q Consensus 186 ---------------------~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~----------------------- 221 (533) .+..+++.|.|..-.+...++...+ |.-+.++. T Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~---g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~r 304 (714) T protein:vir:99 228 LMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSN---GRVVAFDKNNLMQAVAVASGRVQVKVGRVSR 304 (714) T ss_pred cccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCC---CceEEeCccCHHHHHHHhhcchhhhccccce Confidence 0011234444433222222222211 11111110 Q ss_pred ccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_016654. 222 HPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIG 301 (533) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~ 301 (533) +.......+.+...+ +...++-.+.|+|......+.. ..++| .+ ..++|..+.+|...|+..+.+. T Consensus 305 v~~~~~~g~~~L~~~-----~~p~p~~~fp~vp~~g~~~~~~----g~~~G--~v-r~~~d~Qr~~N~~~s~~~~~l~-- 370 (714) T protein:vir:99 305 IREAWFVGPHFIVDR-----PCSAPQGMFPLVPFWGYRKDKT----GEPYG--LI-SRAIPAQDEVNFRRIKLTWLLQ-- 370 (714) T ss_pred EEEEEEecCcccccC-----CCCCCCCceeEEEEeeeeeecc----Cceee--hh-hhchhHHHHHHHHHHHHHHhhc-- Confidence 000001111111000 0111111244444322211111 22444 22 3456888999999999988763 Q ss_pred cceeeechHHhcCCCC---ccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCC Q lcl|NC_016654. 302 AGKVHASESVLTNLGM---GQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYS 378 (533) Q Consensus 302 ~~~i~v~~~~l~~~~~---~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s 378 (533) ..++++.+..+..... .....++.. -.|... ..++......++...+.--...+++.++.....+...+|++ T Consensus 371 ~~~~~~~~~a~~~~d~~~~e~~arp~~v-i~~~p~----~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~ 445 (714) T protein:vir:99 371 AKRVIMDEDATQLSDNDLMEQIERPDGI-IKLNPV----RKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVY 445 (714) T ss_pred CCceeeecCcccccHHHHHHhccCCCCc-eeeccc----ccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCC Confidence 3334443222211100 000111100 001100 01111111123333322223577888888888999999999 Q ss_pred hhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCC-------------------- Q lcl|NC_016654. 379 PVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAA-------------------- 438 (533) Q Consensus 379 ~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~-------------------- 438 (533) ...+|-. ++..||.+|..+..............+..+++.+.+.+|.+....+...... T Consensus 446 ~~~lG~~-~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~ 524 (714) T protein:vir:99 446 SAFLGQD-SGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAE 524 (714) T ss_pred hHHcCCC-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccc Confidence 9999965 4567999999888877777777777788888887777776644322111100 Q ss_pred ------------CceeEEEEeCCCCCCCHHHHHHHHHHHHhC-----CCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhh Q lcl|NC_016654. 439 ------------PSEELELEWPKFARESDLAKAQTVQAWSVA-----SAASTKTKVAYLHEDWDDERVQEEADLIDNANT 501 (533) Q Consensus 439 ------------~~~~v~i~f~d~i~~d~~e~a~~~~~l~~a-----Gi~S~et~v~~l~~~~~dee~~~El~rI~~E~~ 501 (533) ..++|.|.=..+.+..+.+.++.+.++.++ +.+.....+..+ ++. -+++.+++|++-++ T Consensus 525 ~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~--d~p--~~~el~~~ir~~~~ 600 (714) T protein:vir:99 525 GDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLL--DVP--QKQEFVERIRAALG 600 (714) T ss_pred cCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhc--CCC--CHHHHHHHHHHHcC Confidence 011233333333333456777777777542 111223333322 232 24556667765432 Q ss_pred cccCccccccccCCCCCCCCCCCC-CCCC---CC---CC Q lcl|NC_016654. 502 VSAPTFGFGTDQPPLPTENDPATD-PEAV---DE---GE 533 (533) Q Consensus 502 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~---~d---~~ 533 (533) ...+. .+..+++...... .+.+ .+ .+ T Consensus 601 ~~~~~------~~~~~e~q~~~~~~q~~~~~q~~lq~~~ 633 (714) T protein:vir:99 601 TPKSP------DEMTPEEQEVAAQQQALQQQQAELQMRE 633 (714) T ss_pred CCCCc------cccchhhHHHHHHHHHHHHHHHHHHHHH Confidence 21110 0000000000000 0000 00 00 No 89 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=99.44 E-value=1.6e-11 Score=79.83 Aligned_cols=482 Identities=11% Similarity=0.009 Sum_probs=230.1 Q ss_pred CCCCCCcCCCcC------cchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCccc Q lcl|NC_016654. 1 MSLPEANTAWPP------PELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAP 74 (533) Q Consensus 1 ~~~~~~~~~~pp------~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 74 (533) +-|++++-- += ..++.+...+.....|++ +-.+-.+||.+ +||.......... +.+ T Consensus 6 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~R~-~a~~d~~fy~G------~Qw~~~~~~~l~~----------~g~ 67 (714) T protein:vir:10 6 NTMATKNDN-GATPRFSQRQLQALCSDIDSQPKWRD-AANKACAYYDG------DQLPPEVLQVLKD----------RGQ 67 (714) T ss_pred ccccCCCCc-chhHHHHHHHHHHHHHHHHhhHHHHH-HHHHHHHhhcC------CCCCHHHHHHHHh----------cCC Confidence 556655422 11 123333334444444543 23333445543 2443322222111 123 Q ss_pred ceeecChHHHHHHHHHHhhcCCCceEeeCCC--c-------hHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEE Q lcl|NC_016654. 75 KRYHAPIPGVIAKLSTTELFSEQLKFLDAGK--S-------KEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVW 145 (533) Q Consensus 75 ~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~--~-------~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~ 145 (533) ..++.|+.+.+|+...++--...+.+.+.+. + +.++..+..+...+++......+...+.+.|-+|+.+++ T Consensus 68 p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~ 147 (714) T protein:vir:10 68 PMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRR 147 (714) T ss_pred CcEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecc Confidence 3689999999999999998888887777542 1 123455667788889999999999999999999999998 Q ss_pred cCCC-CCceEEEEEcCCeEEEEEe--cCCceEEEE--EEEEeec--------C--------------------------- Q lcl|NC_016654. 146 DPTI-ADNAWIDFVDADRAIPEFR--WGRLVAVTF--WSELAGG--------D--------------------------- 185 (533) Q Consensus 146 D~~~-~~~~~i~~v~~~~~~P~~~--~g~~~~v~f--~~~~~~~--------~--------------------------- 185 (533) |.+. ++.|+|..|++..++.-++ ...+.++-| +..+... + T Consensus 148 ~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~ 227 (714) T protein:vir:10 148 NSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSP 227 (714) T ss_pred ccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccc Confidence 7553 3568999999998886432 233444422 2221100 0 Q ss_pred ---------------------CceEEEEEEEecCeeEEEEEEeccCCcccceeehhh----------------------- Q lcl|NC_016654. 186 ---------------------GQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTD----------------------- 221 (533) Q Consensus 186 ---------------------~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~----------------------- 221 (533) .+..+++.|.|..-.+...++...+ |.-+.++. T Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~---g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~r 304 (714) T protein:vir:10 228 LMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSN---GRVVAFDKNNLMQAVAVASGRVQVKVGRVSR 304 (714) T ss_pred cccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCC---CceEEeCccCHHHHHHHhhcchhhhccccce Confidence 0011234444433222222222211 11111110 Q ss_pred ccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_016654. 222 HPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIG 301 (533) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~ 301 (533) +.......+.+...+ +...++-.+.|+|......+.. ..++| .+ ..++|..+.+|...|+..+.+. T Consensus 305 v~~~~~~g~~~L~~~-----~~p~p~~~fp~vp~~g~~~~~~----g~~~G--~v-r~~~d~Qr~~N~~~s~~~~~l~-- 370 (714) T protein:vir:10 305 IREAWFVGPHFIVDR-----PCSAPQGMFPLVPFWGYRKDKT----GEPYG--LI-SRAIPAQDEVNFRRIKLTWLLQ-- 370 (714) T ss_pred EEEEEEecCcccccC-----CCCCCCCceeEEEEeeeeeecc----Cceee--hh-hhchhHHHHHHHHHHHHHHhhc-- Confidence 000001111111000 0111111244444322211111 22444 22 3456888999999999988763 Q ss_pred cceeeechHHhcCCCC---ccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCC Q lcl|NC_016654. 302 AGKVHASESVLTNLGM---GQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYS 378 (533) Q Consensus 302 ~~~i~v~~~~l~~~~~---~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s 378 (533) ..++++.+..+..... .....++.. -.|... ..++......++...+.--...+++.++.....+...+|++ T Consensus 371 ~~~~~~~~~a~~~~d~~~~e~~arp~~v-i~~~p~----~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~ 445 (714) T protein:vir:10 371 AKRVIMDEDATQLSDNDLMEQIERPDGI-IKLNPV----RKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVY 445 (714) T ss_pred CCceeeecCcccccHHHHHHhccCCCCc-eeeccc----ccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCC Confidence 3334443222211100 000111100 001100 01111111123333322223577888888888999999999 Q ss_pred hhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCC-------------------- Q lcl|NC_016654. 379 PVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAA-------------------- 438 (533) Q Consensus 379 ~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~-------------------- 438 (533) ...+|-. ++..||.+|..+..............+..+++.+.+.+|.+....+...... T Consensus 446 ~~~lG~~-~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~ 524 (714) T protein:vir:10 446 SAFLGQD-SGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAE 524 (714) T ss_pred hHHcCCC-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccc Confidence 9999965 4567999999888877777777777788888887777776644322111100 Q ss_pred ------------CceeEEEEeCCCCCCCHHHHHHHHHHHHhC-----CCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhh Q lcl|NC_016654. 439 ------------PSEELELEWPKFARESDLAKAQTVQAWSVA-----SAASTKTKVAYLHEDWDDERVQEEADLIDNANT 501 (533) Q Consensus 439 ------------~~~~v~i~f~d~i~~d~~e~a~~~~~l~~a-----Gi~S~et~v~~l~~~~~dee~~~El~rI~~E~~ 501 (533) ..++|.|.=..+.+..+.+.++.+.++.++ +.+.....+..+ ++. -+++.+++|++-++ T Consensus 525 ~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~--d~p--~~~el~~~ir~~~~ 600 (714) T protein:vir:10 525 GDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLL--DVP--QKQEFVERIRAALG 600 (714) T ss_pred cCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhc--CCC--CHHHHHHHHHHHcC Confidence 011233333333333456777777777542 111223333322 232 24556667765432 Q ss_pred cccCccccccccCCCCCCCCCCCC-CCCC---CC---CC Q lcl|NC_016654. 502 VSAPTFGFGTDQPPLPTENDPATD-PEAV---DE---GE 533 (533) Q Consensus 502 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~---~d---~~ 533 (533) ...+. .+..+++...... .+.+ .+ .+ T Consensus 601 ~~~~~------~~~~~e~q~~~~~~q~~~~~q~~lq~~~ 633 (714) T protein:vir:10 601 TPKSP------DEMTPEEQEVAAQQQALQQQQAELQMRE 633 (714) T ss_pred CCCCc------cccchhhHHHHHHHHHHHHHHHHHHHHH Confidence 21110 0000000000000 0000 00 00 No 90 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=99.44 E-value=1.6e-11 Score=79.83 Aligned_cols=482 Identities=11% Similarity=0.009 Sum_probs=230.1 Q ss_pred CCCCCCcCCCcC------cchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCccc Q lcl|NC_016654. 1 MSLPEANTAWPP------PELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAP 74 (533) Q Consensus 1 ~~~~~~~~~~pp------~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 74 (533) +-|++++-- += ..++.+...+.....|++ +-.+-.+||.+ +||.......... +.+ T Consensus 6 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~R~-~a~~d~~fy~G------~Qw~~~~~~~l~~----------~g~ 67 (714) T protein:vir:32 6 NTMATKNDN-GATPRFSQRQLQALCSDIDSQPKWRD-AANKACAYYDG------DQLPPEVLQVLKD----------RGQ 67 (714) T ss_pred ccccCCCCc-chhHHHHHHHHHHHHHHHHhhHHHHH-HHHHHHHhhcC------CCCCHHHHHHHHh----------cCC Confidence 556655422 11 123333334444444543 23333445543 2443322222111 123 Q ss_pred ceeecChHHHHHHHHHHhhcCCCceEeeCCC--c-------hHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEE Q lcl|NC_016654. 75 KRYHAPIPGVIAKLSTTELFSEQLKFLDAGK--S-------KEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVW 145 (533) Q Consensus 75 ~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~--~-------~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~ 145 (533) ..++.|+.+.+|+...++--...+.+.+.+. + +.++..+..+...+++......+...+.+.|-+|+.+++ T Consensus 68 p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~ 147 (714) T protein:vir:32 68 PMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRR 147 (714) T ss_pred CcEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecc Confidence 3689999999999999998888887777542 1 123455667788889999999999999999999999998 Q ss_pred cCCC-CCceEEEEEcCCeEEEEEe--cCCceEEEE--EEEEeec--------C--------------------------- Q lcl|NC_016654. 146 DPTI-ADNAWIDFVDADRAIPEFR--WGRLVAVTF--WSELAGG--------D--------------------------- 185 (533) Q Consensus 146 D~~~-~~~~~i~~v~~~~~~P~~~--~g~~~~v~f--~~~~~~~--------~--------------------------- 185 (533) |.+. ++.|+|..|++..++.-++ ...+.++-| +..+... + T Consensus 148 ~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~ 227 (714) T protein:vir:32 148 NSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSP 227 (714) T ss_pred ccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccc Confidence 7553 3568999999998886432 233444422 2221100 0 Q ss_pred ---------------------CceEEEEEEEecCeeEEEEEEeccCCcccceeehhh----------------------- Q lcl|NC_016654. 186 ---------------------GQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTD----------------------- 221 (533) Q Consensus 186 ---------------------~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~----------------------- 221 (533) .+..+++.|.|..-.+...++...+ |.-+.++. T Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~---g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~r 304 (714) T protein:vir:32 228 LMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSN---GRVVAFDKNNLMQAVAVASGRVQVKVGRVSR 304 (714) T ss_pred cccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCC---CceEEeCccCHHHHHHHhhcchhhhccccce Confidence 0011234444433222222222211 11111110 Q ss_pred ccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_016654. 222 HPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIG 301 (533) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~ 301 (533) +.......+.+...+ +...++-.+.|+|......+.. ..++| .+ ..++|..+.+|...|+..+.+. T Consensus 305 v~~~~~~g~~~L~~~-----~~p~p~~~fp~vp~~g~~~~~~----g~~~G--~v-r~~~d~Qr~~N~~~s~~~~~l~-- 370 (714) T protein:vir:32 305 IREAWFVGPHFIVDR-----PCSAPQGMFPLVPFWGYRKDKT----GEPYG--LI-SRAIPAQDEVNFRRIKLTWLLQ-- 370 (714) T ss_pred EEEEEEecCcccccC-----CCCCCCCceeEEEEeeeeeecc----Cceee--hh-hhchhHHHHHHHHHHHHHHhhc-- Confidence 000001111111000 0111111244444322211111 22444 22 3456888999999999988763 Q ss_pred cceeeechHHhcCCCC---ccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCC Q lcl|NC_016654. 302 AGKVHASESVLTNLGM---GQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYS 378 (533) Q Consensus 302 ~~~i~v~~~~l~~~~~---~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s 378 (533) ..++++.+..+..... .....++.. -.|... ..++......++...+.--...+++.++.....+...+|++ T Consensus 371 ~~~~~~~~~a~~~~d~~~~e~~arp~~v-i~~~p~----~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~ 445 (714) T protein:vir:32 371 AKRVIMDEDATQLSDNDLMEQIERPDGI-IKLNPV----RKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVY 445 (714) T ss_pred CCceeeecCcccccHHHHHHhccCCCCc-eeeccc----ccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCC Confidence 3334443222211100 000111100 001100 01111111123333322223577888888888999999999 Q ss_pred hhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCC-------------------- Q lcl|NC_016654. 379 PVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAA-------------------- 438 (533) Q Consensus 379 ~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~-------------------- 438 (533) ...+|-. ++..||.+|..+..............+..+++.+.+.+|.+....+...... T Consensus 446 ~~~lG~~-~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~ 524 (714) T protein:vir:32 446 SAFLGQD-SGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAE 524 (714) T ss_pred hHHcCCC-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccc Confidence 9999965 4567999999888877777777777788888887777776644322111100 Q ss_pred ------------CceeEEEEeCCCCCCCHHHHHHHHHHHHhC-----CCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhh Q lcl|NC_016654. 439 ------------PSEELELEWPKFARESDLAKAQTVQAWSVA-----SAASTKTKVAYLHEDWDDERVQEEADLIDNANT 501 (533) Q Consensus 439 ------------~~~~v~i~f~d~i~~d~~e~a~~~~~l~~a-----Gi~S~et~v~~l~~~~~dee~~~El~rI~~E~~ 501 (533) ..++|.|.=..+.+..+.+.++.+.++.++ +.+.....+..+ ++. -+++.+++|++-++ T Consensus 525 ~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~--d~p--~~~el~~~ir~~~~ 600 (714) T protein:vir:32 525 GDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLL--DVP--QKQEFVERIRAALG 600 (714) T ss_pred cCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhc--CCC--CHHHHHHHHHHHcC Confidence 011233333333333456777777777542 111223333322 232 24556667765432 Q ss_pred cccCccccccccCCCCCCCCCCCC-CCCC---CC---CC Q lcl|NC_016654. 502 VSAPTFGFGTDQPPLPTENDPATD-PEAV---DE---GE 533 (533) Q Consensus 502 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~---~d---~~ 533 (533) ...+. .+..+++...... .+.+ .+ .+ T Consensus 601 ~~~~~------~~~~~e~q~~~~~~q~~~~~q~~lq~~~ 633 (714) T protein:vir:32 601 TPKSP------DEMTPEEQEVAAQQQALQQQQAELQMRE 633 (714) T ss_pred CCCCc------cccchhhHHHHHHHHHHHHHHHHHHHHH Confidence 21110 0000000000000 0000 00 00 No 91 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=99.44 E-value=1.6e-11 Score=79.83 Aligned_cols=482 Identities=11% Similarity=0.009 Sum_probs=230.1 Q ss_pred CCCCCCcCCCcC------cchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCccc Q lcl|NC_016654. 1 MSLPEANTAWPP------PELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAP 74 (533) Q Consensus 1 ~~~~~~~~~~pp------~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 74 (533) +-|++++-- += ..++.+...+.....|++ +-.+-.+||.+ +||.......... +.+ T Consensus 6 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~R~-~a~~d~~fy~G------~Qw~~~~~~~l~~----------~g~ 67 (714) T protein:vir:27 6 NTMATKNDN-GATPRFSQRQLQALCSDIDSQPKWRD-AANKACAYYDG------DQLPPEVLQVLKD----------RGQ 67 (714) T ss_pred ccccCCCCc-chhHHHHHHHHHHHHHHHHhhHHHHH-HHHHHHHhhcC------CCCCHHHHHHHHh----------cCC Confidence 556655422 11 123333334444444543 23333445543 2443322222111 123 Q ss_pred ceeecChHHHHHHHHHHhhcCCCceEeeCCC--c-------hHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEE Q lcl|NC_016654. 75 KRYHAPIPGVIAKLSTTELFSEQLKFLDAGK--S-------KEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVW 145 (533) Q Consensus 75 ~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~--~-------~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~ 145 (533) ..++.|+.+.+|+...++--...+.+.+.+. + +.++..+..+...+++......+...+.+.|-+|+.+++ T Consensus 68 p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~ 147 (714) T protein:vir:27 68 PMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRR 147 (714) T ss_pred CcEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecc Confidence 3689999999999999998888887777542 1 123455667788889999999999999999999999998 Q ss_pred cCCC-CCceEEEEEcCCeEEEEEe--cCCceEEEE--EEEEeec--------C--------------------------- Q lcl|NC_016654. 146 DPTI-ADNAWIDFVDADRAIPEFR--WGRLVAVTF--WSELAGG--------D--------------------------- 185 (533) Q Consensus 146 D~~~-~~~~~i~~v~~~~~~P~~~--~g~~~~v~f--~~~~~~~--------~--------------------------- 185 (533) |.+. ++.|+|..|++..++.-++ ...+.++-| +..+... + T Consensus 148 ~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~ 227 (714) T protein:vir:27 148 NSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSP 227 (714) T ss_pred ccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccc Confidence 7553 3568999999998886432 233444422 2221100 0 Q ss_pred ---------------------CceEEEEEEEecCeeEEEEEEeccCCcccceeehhh----------------------- Q lcl|NC_016654. 186 ---------------------GQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTD----------------------- 221 (533) Q Consensus 186 ---------------------~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~----------------------- 221 (533) .+..+++.|.|..-.+...++...+ |.-+.++. T Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~---g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~r 304 (714) T protein:vir:27 228 LMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSN---GRVVAFDKNNLMQAVAVASGRVQVKVGRVSR 304 (714) T ss_pred cccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCC---CceEEeCccCHHHHHHHhhcchhhhccccce Confidence 0011234444433222222222211 11111110 Q ss_pred ccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_016654. 222 HPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIG 301 (533) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~ 301 (533) +.......+.+...+ +...++-.+.|+|......+.. ..++| .+ ..++|..+.+|...|+..+.+. T Consensus 305 v~~~~~~g~~~L~~~-----~~p~p~~~fp~vp~~g~~~~~~----g~~~G--~v-r~~~d~Qr~~N~~~s~~~~~l~-- 370 (714) T protein:vir:27 305 IREAWFVGPHFIVDR-----PCSAPQGMFPLVPFWGYRKDKT----GEPYG--LI-SRAIPAQDEVNFRRIKLTWLLQ-- 370 (714) T ss_pred EEEEEEecCcccccC-----CCCCCCCceeEEEEeeeeeecc----Cceee--hh-hhchhHHHHHHHHHHHHHHhhc-- Confidence 000001111111000 0111111244444322211111 22444 22 3456888999999999988763 Q ss_pred cceeeechHHhcCCCC---ccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCC Q lcl|NC_016654. 302 AGKVHASESVLTNLGM---GQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYS 378 (533) Q Consensus 302 ~~~i~v~~~~l~~~~~---~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s 378 (533) ..++++.+..+..... .....++.. -.|... ..++......++...+.--...+++.++.....+...+|++ T Consensus 371 ~~~~~~~~~a~~~~d~~~~e~~arp~~v-i~~~p~----~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~ 445 (714) T protein:vir:27 371 AKRVIMDEDATQLSDNDLMEQIERPDGI-IKLNPV----RKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVY 445 (714) T ss_pred CCceeeecCcccccHHHHHHhccCCCCc-eeeccc----ccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCC Confidence 3334443222211100 000111100 001100 01111111123333322223577888888888999999999 Q ss_pred hhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCC-------------------- Q lcl|NC_016654. 379 PVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAA-------------------- 438 (533) Q Consensus 379 ~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~-------------------- 438 (533) ...+|-. ++..||.+|..+..............+..+++.+.+.+|.+....+...... T Consensus 446 ~~~lG~~-~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~ 524 (714) T protein:vir:27 446 SAFLGQD-SGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAE 524 (714) T ss_pred hHHcCCC-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccc Confidence 9999965 4567999999888877777777777788888887777776644322111100 Q ss_pred ------------CceeEEEEeCCCCCCCHHHHHHHHHHHHhC-----CCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhh Q lcl|NC_016654. 439 ------------PSEELELEWPKFARESDLAKAQTVQAWSVA-----SAASTKTKVAYLHEDWDDERVQEEADLIDNANT 501 (533) Q Consensus 439 ------------~~~~v~i~f~d~i~~d~~e~a~~~~~l~~a-----Gi~S~et~v~~l~~~~~dee~~~El~rI~~E~~ 501 (533) ..++|.|.=..+.+..+.+.++.+.++.++ +.+.....+..+ ++. -+++.+++|++-++ T Consensus 525 ~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~--d~p--~~~el~~~ir~~~~ 600 (714) T protein:vir:27 525 GDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLL--DVP--QKQEFVERIRAALG 600 (714) T ss_pred cCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhc--CCC--CHHHHHHHHHHHcC Confidence 011233333333333456777777777542 111223333322 232 24556667765432 Q ss_pred cccCccccccccCCCCCCCCCCCC-CCCC---CC---CC Q lcl|NC_016654. 502 VSAPTFGFGTDQPPLPTENDPATD-PEAV---DE---GE 533 (533) Q Consensus 502 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~---~d---~~ 533 (533) ...+. .+..+++...... .+.+ .+ .+ T Consensus 601 ~~~~~------~~~~~e~q~~~~~~q~~~~~q~~lq~~~ 633 (714) T protein:vir:27 601 TPKSP------DEMTPEEQEVAAQQQALQQQQAELQMRE 633 (714) T ss_pred CCCCc------cccchhhHHHHHHHHHHHHHHHHHHHHH Confidence 21110 0000000000000 0000 00 00 No 92 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.43 E-value=2e-11 Score=79.30 Aligned_cols=461 Identities=12% Similarity=0.068 Sum_probs=206.6 Q ss_pred CCCCCCcCCCcCcchHHHHHHHH-------hhhHhhcC--CHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVA-------ESHVWWEG--DLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATG 71 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~-------~~~~w~~g--d~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 71 (533) -+.++++ +|..........+. ....|... +++.. ....+..+..+ +|. T Consensus 9 ~~~~a~~--~~~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~-----------~~~~~~~lr~R-aRd--------- 65 (553) T protein:vir:63 9 LSEVTSG--RPEQSASLGGGGLEGASRLSRETVSWNPSLRSPDAL-----------INPLKRIADAR-GRD--------- 65 (553) T ss_pred hcccccc--cchhhhhhhcccccccccCCCcccccccCCCChHHH-----------HHHHHHHHHHH-HHH--------- Confidence 2222333 22222110000000 00011110 01100 00111111111 111 Q ss_pred cccceeecChHHHHHHHHHHhhcCCCceEeeCCC-----------chHHHHHHHHHHh--------------hccHHHHH Q lcl|NC_016654. 72 RAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGK-----------SKEVQARADLIFN--------------TPRFHSSL 126 (533) Q Consensus 72 ~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~-----------~~~~~~~l~~i~~--------------~n~f~~~~ 126 (533) -....++++-+++.+++.++|...++....+ ++.+++.+++.|+ ..+|.... T Consensus 66 ---L~rNn~~a~~av~~~~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q 142 (553) T protein:vir:63 66 ---MADNDGFTNGAVGYQRDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLI 142 (553) T ss_pred ---HHhcChHHHHHHHHHHHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHH Confidence 1235579999999999999998766654321 2334444444332 11466667 Q ss_pred HHHHHHHhhhCCEEEEEEEcCCCCC--ceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCe-eEEE Q lcl|NC_016654. 127 VEAGESCSALSGSFQRIVWDPTIAD--NAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESG-YIVH 203 (533) Q Consensus 127 ~~~~~~~~~~G~~~~~~~~D~~~~~--~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~-~I~~ 203 (533) ..++...+..|.+++++.|.+..+. ..++..++|+++--... + .++..+.--+|+-..| .+-| T Consensus 143 ~l~~r~~~~dGE~~~~~~~~~~~~~~~~~~lq~ie~drl~~~~~-~-------------~~~~~i~~GVE~d~~Gr~vaY 208 (553) T protein:vir:63 143 RLGVVGYVKTGEVLATAEWDRAANRPYATCFQMVSTDRLSNPYQ-Q-------------LDTPTLRRGVQYDKRGRPQGY 208 (553) T ss_pred HHHHHHHHhCCceEEEeeeccCCCCcccceEEEechhhcCCCCC-C-------------CCCCeeEeeeEECCCCceEEE Confidence 7777788899999999988764321 35788888876543211 0 0111122223332222 2223 Q ss_pred EEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHH Q lcl|NC_016654. 204 AVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPT 283 (533) Q Consensus 204 ~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~l 283 (533) .+++..+....... .....|..+.. ....+-.....+|.| .. + ....|+|.|+..|.. T Consensus 209 ~i~~~hPgd~~~~~--~~~~~~~r~~~---------~~~v~a~~vlH~f~~-~r-------~--gQ~RGis~lapvl~~- 266 (553) T protein:vir:63 209 WIQVAHPGDLYQMA--PDMYKWKFVQQ---------SKPWGRRQVIHILEP-RE-------P--DQSRGIADIVSGLKD- 266 (553) T ss_pred EeeccCCCcccccc--ccccceeeecc---------ccccChhHheecccc-cC-------C--CcccCCchHHHHHHH- Confidence 33333222100000 00000000000 000111111112222 11 1 134699999998754 Q ss_pred HHHHHHHHHH-HHHHHHhCcceeeech-----HHhcCCCCcc--c--cccC----cch---------hhhhhcccccccc Q lcl|NC_016654. 284 FHELDRIYSS-LMRDFRIGAGKVHASE-----SVLTNLGMGQ--G--VSLD----EEQ---------EVYSRVGSGGFNA 340 (533) Q Consensus 284 id~lD~~~s~-~~~~~~~~~~~i~v~~-----~~l~~~~~~~--~--~~~d----~~~---------~~~~~~~~~~~~~ 340 (533) +..++.-.+. +.+..-.+--..||-. ......+.+. + .... ... ....+..+..... T Consensus 267 l~~l~~y~daeL~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~p 346 (553) T protein:vir:63 267 MRMAKRFKEMSLQNAVINASYAAAIESELPPEFIHSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFP 346 (553) T ss_pred HHHHhHHHHHHHHHHHHhhhheeeeecCCChhhhhhhcccccccccccccccccccccccccccccceeecCceeeecCC Confidence 4556544433 3333222233334311 1110000000 0 0000 000 0011111111222 Q ss_pred ccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc--hhHHHHHHHhhhHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 341 NGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA--QTATEASGKKDLTVKTTRAKARHFGSALG 418 (533) Q Consensus 341 ~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~--~Tatai~~~~~~l~~~~~~~~~~~~~al~ 418 (533) | ..++.+++.-+..+|..-.+.+++.|....|+|++.+..|.+++ .|+.+-.....+.+. ..+..|-..+. T Consensus 347 G----e~i~~~~p~~p~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS~R~~~~e~~r~~~---~~q~~~~~~~~ 419 (553) T protein:vir:63 347 G----TKLNLKPMGTPGGVGSEFEASLNRHLASAFGMSYEEFTRDFSKANYSSIQAGIAMTRRFLE---GRKKMCADRLA 419 (553) T ss_pred C----CeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHH---HHHHHHHHHHH Confidence 2 23677777777778888888999999999999999997665432 344444444444443 34444444443 Q ss_pred HHHHHHHHHHHhhccCCCCCCce--------------eEEEEeCCC--CCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCC Q lcl|NC_016654. 419 PLSTTCLRVDAIKFPGKGAAPSE--------------ELELEWPKF--ARESDLAKAQTVQAWSVASAASTKTKVAYLHE 482 (533) Q Consensus 419 ~li~~il~l~~~~~~~~~~~~~~--------------~v~i~f~d~--i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~ 482 (533) +-|+.. ||....+.|....+.. -+.+.|--+ ...|+..+++....++.+|+.|.+..+++. T Consensus 420 ~pi~~~-wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~t~~~~~a~~-- 496 (553) T protein:vir:63 420 TEFFTL-WLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVMRIDAGLSTYEREIARL-- 496 (553) T ss_pred HHHHHH-HHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHh-- Confidence 333221 3334444444332221 134555433 446999999999999999999999999986 Q ss_pred CCCHHHHHHHHHHHHH---hhhcc---cCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 483 DWDDERVQEEADLIDN---ANTVS---APTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 483 ~~~dee~~~El~rI~~---E~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +.|-+++.+++++-.+ +.+.. ++....+..+.....+.++...+..+++|| T Consensus 497 G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (553) T protein:vir:63 497 GGDFRKSFAQRAREDALLKKYGLTFNLSAKRSLGDGRDAATGIAEDPAAAQTSQQGE 553 (553) T ss_pred CCCHHHHHHHHHHHHHHHHHcCCCCCCCCccccCCCcccCCCCCCCCCCCCcccccC Confidence 3455545444332111 11211 111111111222222233333446666677 No 93 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=99.38 E-value=3.9e-11 Score=77.66 Aligned_cols=489 Identities=10% Similarity=-0.011 Sum_probs=239.5 Q ss_pred CCCC-CCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeec Q lcl|NC_016654. 1 MSLP-EANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHA 79 (533) Q Consensus 1 ~~~~-~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 79 (533) =.|| +...+..+..|..+...++....|++ +-.+-.+||.+ +||.......... +.+..++. T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~q~~~r~-~a~~d~~fy~G------~QW~~~~~~~l~~----------~g~p~~~~ 74 (772) T protein:vir:10 12 NGLPPAGDTPLTVDEYADINYEIEDQPAWRA-VADKEMDYADG------NQLDTELLRRQQA----------LGIPPAVE 74 (772) T ss_pred ccCCcccccccCHHHHHHHHHHHhccHHHHH-HHHHHHHhhcC------CCCCHHHHHHHHh----------cCCCcEEE Confidence 3455 45558888888888888888888866 34444567754 2444333222211 12236899 Q ss_pred ChHHHHHHHHHHhhcCCCceEeeCCCc--------hHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCC- Q lcl|NC_016654. 80 PIPGVIAKLSTTELFSEQLKFLDAGKS--------KEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIA- 150 (533) Q Consensus 80 n~~k~i~~~~a~ll~~e~~~i~~~~~~--------~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~- 150 (533) |+.+.+|+...++--...+.+.+.++. +.++..+..+.+.+++......+...+.+.|-+|+.++++.+.. T Consensus 75 N~i~~~v~~v~g~~~~nr~d~~v~Pr~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~d~~~ 154 (772) T protein:vir:10 75 DLIGPALLSLQGYEAVTRTDWRVTPNGDVGGQEVADALNYRLNTAERQSGADRACSEAFRPQIACGIGWVEVSRESDPFK 154 (772) T ss_pred cchHHHHHHHHHHHHhcCcceEEecCCCchHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhcCceeEEeccccCCCC Confidence 999999999999988888877775532 12345566678889999999999999999999999999986643 Q ss_pred CceEEEEEcCCeEEEEEec-CCceEE--EEEEEEee-------------------------------------------- Q lcl|NC_016654. 151 DNAWIDFVDADRAIPEFRW-GRLVAV--TFWSELAG-------------------------------------------- 183 (533) Q Consensus 151 ~~~~i~~v~~~~~~P~~~~-g~~~~v--~f~~~~~~-------------------------------------------- 183 (533) +.|+|..|++..++.-++. ..+.+| +|+..+-. T Consensus 155 ~~i~i~~v~p~~v~~Dp~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (772) T protein:vir:10 155 FPYRCRPIRRDEIHWDMKCGDDWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEGGTSTGLHN 234 (772) T ss_pred CCeEEEeeCcccceecCCCCCCHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCccccccccccccccccc Confidence 3588999999887754321 234443 12111000 Q ss_pred ----------------cCCceEEEEEEEecCeeEEEEEEeccCCcccceeehh---------------------hccccc Q lcl|NC_016654. 184 ----------------GDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALT---------------------DHPATR 226 (533) Q Consensus 184 ----------------~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~---------------------~~~~~~ 226 (533) ...+.-++++|+|..-.+.+.++.+.++. |..+.-. .+.... T Consensus 235 ~~~~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~-~~~~~~~~~~~~~~l~~g~~~~~~~~~~rv~~~~ 313 (772) T protein:vir:10 235 AWNEARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGR-VVEYDPNNLAHNIALASGRISPKKVTVSRVRRSY 313 (772) T ss_pred ccchhhccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCc-eEeeCcccHHHHHHHhhcccchheeeeeEEEEEE Confidence 00011245566544333333333332221 1111000 000111 Q ss_pred cccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceee Q lcl|NC_016654. 227 DIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVH 306 (533) Q Consensus 227 ~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~ 306 (533) .+.+.+...+. ...++-.+.|+|........ ...++| .+. .+++..+.+|...|+..+.+-.. ++. T Consensus 314 ~~g~~~L~~~~-----~p~~~~~fP~vP~~g~r~~~----~g~~~G--~vr-~~kd~Qr~~N~~~S~~~~~l~~~--~~~ 379 (772) T protein:vir:10 314 WLGPHCLHDGP-----TPYTHRHFPYVPFFGFREDA----TGIPYG--YVR-GMKYAQDSLNSGVSKLRWGMSVA--RVE 379 (772) T ss_pred EecceeeccCC-----CCCCCCccceEEEeeeEecc----CCcccc--hhh-hhhhHHHHHHHHHHHHHHHHhcc--ccc Confidence 11111111111 11111123333322111111 123444 343 36788999999999999877543 233 Q ss_pred echHHhcCCCC---ccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcc Q lcl|NC_016654. 307 ASESVLTNLGM---GQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLG 383 (533) Q Consensus 307 v~~~~l~~~~~---~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g 383 (533) .....+..... .....++. + ..++ .+..+.....++..++.---.+++..++.....|...+|++...+| T Consensus 380 ~~~gav~~~d~~~~e~~arp~~---v-i~~~---~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG 452 (772) T protein:vir:10 380 RTKGAVAMTDAQFRRQIARPDA---D-IVLD---ENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQG 452 (772) T ss_pred ccCCCccchhHHHHHhccCCCC---e-EEeC---CccccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcC Confidence 33322221100 00000100 0 0011 0100110112333332212347788888888999999999999999 Q ss_pred cCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCC---------Cce------------- Q lcl|NC_016654. 384 LSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAA---------PSE------------- 441 (533) Q Consensus 384 ~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~---------~~~------------- 441 (533) .. ++..||.+|..+..............+..+++.+.+.+|.+-...+...... ... T Consensus 453 ~~-~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~t 531 (772) T protein:vir:10 453 RK-GTATSGIQEQQQIEQSNQSIGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQT 531 (772) T ss_pred CC-cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceecccc Confidence 54 5567999999888887777777777888888888777777654332211100 001 Q ss_pred --------------eEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHh-----CCCCCHHHHHHHHHHHHHhhhc Q lcl|NC_016654. 442 --------------ELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYL-----HEDWDDERVQEEADLIDNANTV 502 (533) Q Consensus 442 --------------~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l-----~~~~~dee~~~El~rI~~E~~~ 502 (533) +|.|+=..+.+.=+.+.++.+.++. +.+..+.....+ .-++. -+++.+++|++-+++ T Consensus 532 g~~~~~NDi~~g~yDv~i~~~p~~~t~r~~~~~~m~ql~--~~~~P~~~~~~~~~~le~~D~p--~~~ei~~~ir~~~~~ 607 (772) T protein:vir:10 532 GAAYLSNDLLRTRIKVALEDVPSTNSYRGQQLNAMSEAV--KSMPPQYQAAVLPFLVSLMDVP--FKRDVVEAIRAVDQQ 607 (772) T ss_pred cccceeccceeeeEEEEeeccccchHHHHHHHHHHHHHH--hccChhHHHHHHHHHHhhcCCC--ChHHHHHHHHHHhcc Confidence 1111111111111345555555553 334444322111 11232 133444555544333 Q ss_pred ccCccccccccCCC---CCCCCCCCC------CCCCCCCC Q lcl|NC_016654. 503 SAPTFGFGTDQPPL---PTENDPATD------PEAVDEGE 533 (533) Q Consensus 503 ~~~~~~~~~~~~~~---~~~~~~~~~------~~~~~d~~ 533 (533) .+|......-+... -.....+-. .....+.+ T Consensus 608 ~~peq~~~~~~q~~qq~~~~~~~el~~~q~~a~~~~~~A~ 647 (772) T protein:vir:10 608 QTPEQIQQQIDQAVQDALAKAGNDIKLRELEIKERKADSE 647 (772) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22211000000000 000000000 00000000 No 94 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=99.37 E-value=4.9e-11 Score=77.12 Aligned_cols=474 Identities=11% Similarity=-0.031 Sum_probs=224.5 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhc---------CCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWE---------GDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATG 71 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~---------gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 71 (533) |.==-+.+.=+|++.....-+.+.+..|.+ ..-.+=.+||.+ +||.......... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~r~~a~~d~~fy~G------~Qw~~~~~~~l~~---------- 64 (714) T protein:vir:10 1 MKNEINTTAMKNDHGSTPRFSQRQLLSLCSDIDSQPLWRDAANKACAYYDG------DQLAPEVIQVLKD---------- 64 (714) T ss_pred CCcCcCcccCCCcchhhhhhhHHHHHHHHHHHhhhHHHHHHHHHHHHhhcC------CCCCHHHHHHHHh---------- Confidence 333222222333332322222233322221 111111233322 2332221111111 Q ss_pred cccceeecChHHHHHHHHHHhhcCCCceEeeCCC--c-------hHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEE Q lcl|NC_016654. 72 RAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGK--S-------KEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQR 142 (533) Q Consensus 72 ~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~--~-------~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~ 142 (533) ..+..++.|+.+.+|+...++--...+.+.+.+. + +.++..+..+...++.......+...+.+.|-+|+. T Consensus 65 ~g~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~ 144 (714) T protein:vir:10 65 RGQPMTIHNLIAPTVDGVLGMEAKTRTDLIVMSDDPNDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVE 144 (714) T ss_pred cCCCcEEeccHHHHHHHHHHHHHhCCcceEEecCCCChhhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcccceEE Confidence 1233689999999999999998888887777542 1 223455666788899999999999999999999999 Q ss_pred EEEcCC-CCCceEEEEEcCCeEEEEEe--cCCceEEEE--EEEEee---------------------------------- Q lcl|NC_016654. 143 IVWDPT-IADNAWIDFVDADRAIPEFR--WGRLVAVTF--WSELAG---------------------------------- 183 (533) Q Consensus 143 ~~~D~~-~~~~~~i~~v~~~~~~P~~~--~g~~~~v~f--~~~~~~---------------------------------- 183 (533) +++|.+ .++.|+|+.|+|..++.-++ ...+..+-| ..++.. T Consensus 145 ~~~d~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~~~~ 224 (714) T protein:vir:10 145 VRRNSEPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQ 224 (714) T ss_pred eeeccCCCCCCeEEEecChhheeeccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcccchhhhhh Confidence 998854 23579999999988876432 122333211 110000 Q ss_pred ----------------------cCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhc------------------- Q lcl|NC_016654. 184 ----------------------GDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDH------------------- 222 (533) Q Consensus 184 ----------------------~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~------------------- 222 (533) ...+..+++.|+|..-.+...++...+ |.-+.++.. T Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~---g~~~~~d~~~~~~~~~~~~g~~~~~~~~ 301 (714) T protein:vir:10 225 PSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSN---GRVVAFDKNNLMQAVAVASGRVQVKVGR 301 (714) T ss_pred cccccccchhhcccccccccccccCcceEEEEEEEEeEEEEEEeecCCC---CCeeeeCccCHHHHHHHHhccceecccc Confidence 000112456665543333333333221 111111100 Q ss_pred ----cccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 223 ----PATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDF 298 (533) Q Consensus 223 ----~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~ 298 (533) .......+.+...+ .+..++-++.|+|......+.. ..++| .+. .+++..+.+|...|+..+.+ T Consensus 302 ~~rv~~~~~~g~~~L~~~-----~~p~p~~~fp~vP~~g~~~~~~----g~~~G--~vr-~~~d~Qr~~N~~~s~~~~~l 369 (714) T protein:vir:10 302 VSRIREAWFVGPHFIVDR-----PCSAPQGMFPLVPFWGYRKDKT----GEPYG--LIS-RAIPAQDEVNFRRIKLTWLL 369 (714) T ss_pred eeeEEEEEEecchhhhcC-----CCCCCCCceeeEEecceeeecc----Cccce--ehh-hhhhHHHHHHHHHHHHHHHH Confidence 00000111111100 0011111234444322111111 23444 233 36688999999999998876 Q ss_pred HhCcceeeechHHhcCCCC---ccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhh Q lcl|NC_016654. 299 RIGAGKVHASESVLTNLGM---GQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKT 375 (533) Q Consensus 299 ~~~~~~i~v~~~~l~~~~~---~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~ 375 (533) . ..++++.+..+..... .+...++.. -.|.+.. .++......++..++.--...++..++.....|...+ T Consensus 370 ~--~~~~~~~~gav~~~d~~~~e~~~rp~~v-i~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~t 442 (714) T protein:vir:10 370 Q--AKRVIMDEDATQLSDNDLMEQLERPDGI-IKLNPVR----KNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTM 442 (714) T ss_pred h--CCceeeccccccccHHHHHHhccCCCCe-EEecccc----cccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhh Confidence 3 3345553333211100 000001000 0011110 1111111223333322123467888888888999999 Q ss_pred CCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCC----------CceeEEE Q lcl|NC_016654. 376 GYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAA----------PSEELEL 445 (533) Q Consensus 376 g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~----------~~~~v~i 445 (533) |++...+|-. ++..||.||..+..............+..+++.+.+.+|.+....+...... ....+.+ T Consensus 443 Gv~~~~lG~~-~na~SGvAI~~r~~qg~~~l~~~~dnl~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~ 521 (714) T protein:vir:10 443 GVYSAFLGQD-SGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVL 521 (714) T ss_pred CCCHHHcCCC-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEee Confidence 9999999965 4567999999988887777777778888888888887777654322211100 0111222 Q ss_pred Ee----------------------CCCCCCCHHHHHHHHHHHHhCC-----CCCHHHHHHHhCCCCCHHHHHHHHHHHHH Q lcl|NC_016654. 446 EW----------------------PKFARESDLAKAQTVQAWSVAS-----AASTKTKVAYLHEDWDDERVQEEADLIDN 498 (533) Q Consensus 446 ~f----------------------~d~i~~d~~e~a~~~~~l~~aG-----i~S~et~v~~l~~~~~dee~~~El~rI~~ 498 (533) ++ ..+.+.-+.+.++.+.++..+. .+.....+..+ ++. -+++.+++|.+ T Consensus 522 n~~~~~~~~~nDi~~~~~dv~i~~~p~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~le~~--d~p--~~~ei~~~ir~ 597 (714) T protein:vir:10 522 NAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLL--DVP--QKQEFVERIRA 597 (714) T ss_pred ccccCCccccccceeeeEEEEEeeccCcHHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhc--CCc--CHHHHHHHHHH Confidence 22 1122222455566666665421 11122223322 221 24455666655 Q ss_pred hhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 499 ANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 499 E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) -.+...+.- + ..+.+.... T Consensus 598 ~~~~~~~~~------~----------~~~e~q~~q 616 (714) T protein:vir:10 598 ALGTPKSPD------E----------MTPEEQEVA 616 (714) T ss_pred HcCCCCCcc------c----------cCcchhHHH Confidence 442211100 0 000000000 No 95 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=99.36 E-value=1.9e-11 Score=79.31 Aligned_cols=491 Identities=8% Similarity=-0.075 Sum_probs=225.7 Q ss_pred hHHHHHHHHhhhHhhcCCHHHHHHHH----hccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHH Q lcl|NC_016654. 15 LAAVTARVAESHVWWEGDLDKLATFY----GAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLST 90 (533) Q Consensus 15 ~~~~~~~~~~~~~w~~gd~~~l~~~y----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a 90 (533) +++....+.+.+.||.-+.+.-.+.+ ....+|..+||......... ...+..+|+.+.+|+.+. T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~------------~q~rp~~N~i~~~i~~v~ 68 (725) T protein:vir:77 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTT------------LQYRGQFDVVRPVVRKLV 68 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhCCCCCCHHHHHHHH------------hcCCCccccHHHHHHHHH Confidence 44444555556655554333222211 11112222233222211111 112336799999999988 Q ss_pred HhhcCCCceEeeCCCc-------hHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEc---CC-CCCceEEEEE- Q lcl|NC_016654. 91 TELFSEQLKFLDAGKS-------KEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWD---PT-IADNAWIDFV- 158 (533) Q Consensus 91 ~ll~~e~~~i~~~~~~-------~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D---~~-~~~~~~i~~v- 158 (533) ++--...+.+.+.+.+ +.++..+..+...++.......+...+.+.|.+|+.++.| ++ .++.+.|..+ T Consensus 69 g~~~~nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~ 148 (725) T protein:vir:77 69 SEMRQNPIDVLYRPKDGARPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREP 148 (725) T ss_pred hhHHhCCcceEEecCCccHHHHHHHHHHHHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEee Confidence 8877777777665432 2234556667778888999999999999999999999754 22 3345555544 Q ss_pred ---cCCeEEEEEec--CCceEE--EEEEEEee------------------------cC------CceEEEEEEEecCeeE Q lcl|NC_016654. 159 ---DADRAIPEFRW--GRLVAV--TFWSELAG------------------------GD------GQEVWRHLERHESGYI 201 (533) Q Consensus 159 ---~~~~~~P~~~~--g~~~~v--~f~~~~~~------------------------~~------~~~~y~~lE~h~~~~I 201 (533) ++.+++.-+.. -.++.+ +|...+.. .+ .....+++|+|+...+ T Consensus 149 ~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~E~~~r~~~ 228 (725) T protein:vir:77 149 IHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEK 228 (725) T ss_pred cccChhhceeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCeeEEEEEEEEEEE Confidence 34444432211 112211 11111100 00 0122467788776655 Q ss_pred EEEEEeccCCcccceeehhhccc--------ccccccc----------c-cccCCceeecC--CCccceeEEecCCcccc Q lcl|NC_016654. 202 VHAVYKGTATSLGWMMALTDHPA--------TRDIAVE----------G-ADEGRGAYVET--GVKDLTAAYVPNVTPNP 260 (533) Q Consensus 202 ~~~~y~~~~~~lG~~v~l~~~~~--------~~~~~~~----------~-~~~~~~~~~~~--g~~~~~~~~~pn~~~~~ 260 (533) .-.+|...+...|.-+.++.... -.++... + ....+....+. ..+.-.+.|+|..... T Consensus 229 ~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r- 307 (725) T protein:vir:77 229 KETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEW- 307 (725) T ss_pred eeEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCcCCCCccceEEEeeee- Confidence 54445433322222222110000 0000000 0 00000011111 0111123333322111 Q ss_pred cccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHh-CcceeeechHHhcCCCCccccccCcchhhhhhccccccc Q lcl|NC_016654. 261 EWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRI-GAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFN 339 (533) Q Consensus 261 ~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~-~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~ 339 (533) ..-..++++.+.+.+ ++|..+.+|...|...+.+-. .+.+..+....++... .... ..+...|...+..... T Consensus 308 ---~~~~g~~~~~G~vr~-~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~-~~~~--~~~~~~~~~~~~~~~~ 380 (725) T protein:vir:77 308 ---GFVEDKEVYEGVVRL-TKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFE-HMYD--GNDDYPYYLLNRTDEN 380 (725) T ss_pred ---eccCCcccccchhhh-hhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHH-HHHH--hccCCceecccccccC Confidence 111234554455654 679999999999999988854 3444445444442110 0000 0111111111111111 Q ss_pred cccccccceeee-chhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 340 ANGDMETIFEFF-QPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALG 418 (533) Q Consensus 340 ~~~~~~~~i~~~-~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~ 418 (533) .|......+..+ .++++ ..++..++.....|...+|+....+|-.++ ..||.+|..+..............+..+.+ T Consensus 381 ~g~~~~~~i~~~~~~~lp-~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n-~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~ 458 (725) T protein:vir:77 381 SGDLPTQPLAYYENPEVP-QANAYMLEAATSAVKEVATLGVDTEAVNGG-QVAFDTVNQLNMRADLETYVFQDNLATAMR 458 (725) T ss_pred CCcccccCccccCCCCch-HHHHHHHHHHHHHHHHHhCCCHHHhCCCch-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111112222 33333 366778888888999999999999996644 578999999998888888778888888888 Q ss_pred HHHHHHHHHHHhhccCC---------CCC------------------------CceeEEEEeCCCCCCCHHHHHHHHHHH Q lcl|NC_016654. 419 PLSTTCLRVDAIKFPGK---------GAA------------------------PSEELELEWPKFARESDLAKAQTVQAW 465 (533) Q Consensus 419 ~li~~il~l~~~~~~~~---------~~~------------------------~~~~v~i~f~d~i~~d~~e~a~~~~~l 465 (533) .+.+.+|.+-...+... +.. ..++|.|+=..+.+.=+++.+..++.+ T Consensus 459 ~~g~~lL~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~~~~l~ql 538 (725) T protein:vir:77 459 RDGEIYQSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILEL 538 (725) T ss_pred HHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeeccchHHHHHHHHHHHHHH Confidence 88777777644322110 000 012222222222222244555555555 Q ss_pred HhCC--CCCHH-HHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCC----CCCCCCCCCCCCC Q lcl|NC_016654. 466 SVAS--AASTK-TKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTEN----DPATDPEAVDEGE 533 (533) Q Consensus 466 ~~aG--i~S~e-t~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~d~~ 533 (533) ..+. .++.- ..+...-...+-+.+++.+++|++...+.... ++..+.+. ...-......+.| T Consensus 539 l~~~~~~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~------q~~~~~e~q~~~~~qq~~~~q~~~e 607 (725) T protein:vir:77 539 LGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVK------KPETPEEQQWLVEAQQAKQGQQDPA 607 (725) T ss_pred HHhccccchhHHHHHHHhhccccchHHHHHHHHHHhhhhhhhcc------CCCChhhHHHHHHHHHHHHHhHHHH Confidence 4322 11111 11222112233345666677777665332210 00000000 0000000011111 No 96 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.35 E-value=7.7e-11 Score=76.04 Aligned_cols=457 Identities=11% Similarity=0.027 Sum_probs=199.2 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCH---HHHHHHHhcc-Cc-chhhHHHHHHHHHHHHHhcccCCCCCcccc Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDL---DKLATFYGAE-GR-TSPSGIKARTKAAYEAFHGRTPTATGRAPK 75 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~---~~l~~~y~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 75 (533) |.+|.+-..=-|....+. ..+ ..-+.|-. .++..+.... .. ......+..+..+ +|. - T Consensus 1 ~~~p~~~~~~~~~~~~~~-~~~---~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~R-aRd------------l 63 (533) T protein:vir:34 1 MKTPTIPTLLGPDGMTSL-REY---AGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNAR-ADD------------L 63 (533) T ss_pred CCCchhhhhhcccccchH-HHH---HhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHH-HHH------------H Confidence 777733322112221111 111 11111100 0000000000 00 0001111111111 111 1 Q ss_pred eeecChHHHHHHHHHHhhcCCCceEeeCC----------CchHHHHHHHHHHh----h----------ccHHHHHHHHHH Q lcl|NC_016654. 76 RYHAPIPGVIAKLSTTELFSEQLKFLDAG----------KSKEVQARADLIFN----T----------PRFHSSLVEAGE 131 (533) Q Consensus 76 ~~~~n~~k~i~~~~a~ll~~e~~~i~~~~----------~~~~~~~~l~~i~~----~----------n~f~~~~~~~~~ 131 (533) ....++++-+++.+++.+.|....+.... .++.+++.++..++ + .+|......++. T Consensus 64 ~rNn~~a~~av~~~~~nvVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r 143 (533) T protein:vir:34 64 VRNNGYAANAIQLHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVA 143 (533) T ss_pred HhcChHHHHHHHHHHHHhhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHH Confidence 23457999999999999999876665432 23344455544432 1 146677777777 Q ss_pred HHhhhCCEEEEEEEcCCCCC--ceEEEEEcCCeEEEEE---ecCCceEEEEEEEEeecCCceEEEEEEEecCee-EEEEE Q lcl|NC_016654. 132 SCSALSGSFQRIVWDPTIAD--NAWIDFVDADRAIPEF---RWGRLVAVTFWSELAGGDGQEVWRHLERHESGY-IVHAV 205 (533) Q Consensus 132 ~~~~~G~~~~~~~~D~~~~~--~~~i~~v~~~~~~P~~---~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~-I~~~~ 205 (533) ..+..|.++++..|++..+. ..++..++|+.+---. +.+.+.. -+|+-..|+ +-|.+ T Consensus 144 ~~~~dGE~f~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~-----------------GIe~d~~Gr~~aY~i 206 (533) T protein:vir:34 144 MHAFNGELFVQATWDTSSSRLFRTQFRMVSPKRISNPNNTGDSRNCRA-----------------GVQINDSGAALGYYV 206 (533) T ss_pred HHHhCCceEEEeeeccCCCCccceEEEEechhhcCCCCCCCCCCceEe-----------------eeEECCCCCeEEEEE Confidence 77999999999998865322 3578888888654211 1122222 223322222 22333 Q ss_pred EeccCC-cccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHH Q lcl|NC_016654. 206 YKGTAT-SLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTF 284 (533) Q Consensus 206 y~~~~~-~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~li 284 (533) ++.... ..+ ..+..+.. .+. .+..-+.|+-... .+ ....|+|.|+..|. .+ T Consensus 207 ~~~~~~~~~~--------~~~~~~~~---------~~~--v~a~~VlH~f~~~------r~--gQ~RGis~lapvl~-~l 258 (533) T protein:vir:34 207 SEDGYPGWMP--------QKWTWIPR---------ELP--GGRASFIHVFEPV------ED--GQTRGANVFYSVME-QM 258 (533) T ss_pred eecCCCCccc--------cccceeee---------eec--cChhHeeeecccc------CC--CcccCCchHHHHHH-HH Confidence 322111 100 00000000 000 1111122221111 01 24569999999775 45 Q ss_pred HHHHHHHHH-HHHHHHhCcceeeechHH-----hc----CCCCccccccCc---c-------hh-hhhhccccccccccc Q lcl|NC_016654. 285 HELDRIYSS-LMRDFRIGAGKVHASESV-----LT----NLGMGQGVSLDE---E-------QE-VYSRVGSGGFNANGD 343 (533) Q Consensus 285 d~lD~~~s~-~~~~~~~~~~~i~v~~~~-----l~----~~~~~~~~~~d~---~-------~~-~~~~~~~~~~~~~~~ 343 (533) ..++.-.+. +....-.+--..||-... .. .........+.. . .. .+....+.....| T Consensus 259 ~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pG-- 336 (533) T protein:vir:34 259 KMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPG-- 336 (533) T ss_pred HHHHHHHHHHHHHHHHhhhheeeeecCCCcccccccccCCCcccccccccccchhhhhccCcceeeccCceeeecCCC-- Confidence 556544333 222222222223331110 00 000000000000 0 00 0111111111222 Q ss_pred cccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc--hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 344 METIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA--QTATEASGKKDLTVKTTRAKARHFGSALGPLS 421 (533) Q Consensus 344 ~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~--~Tatai~~~~~~l~~~~~~~~~~~~~al~~li 421 (533) ..++.+++.-...+|..-++.+++.|...+|+|++.++-|.+++ .|+.+-.....+ .+...+..+...+-+-+ T Consensus 337 --e~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r---~~~~~q~~~~~~~~~pi 411 (533) T protein:vir:34 337 --DSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWA---YFMGRRKFVASRQASQM 411 (533) T ss_pred --CeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHH Confidence 23667777766677777888889999999999999997765432 234444333333 33444443443333323 Q ss_pred HHHHHHHHhhccCCCCCCc-----------eeEEEEe--CCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHH Q lcl|NC_016654. 422 TTCLRVDAIKFPGKGAAPS-----------EELELEW--PKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDER 488 (533) Q Consensus 422 ~~il~l~~~~~~~~~~~~~-----------~~v~i~f--~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee 488 (533) +.. ||....+.|....+. ....+.| +.....|+..+++....++.+|++|.+..+++. +.|-++ T Consensus 412 ~~~-wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~--G~D~~e 488 (533) T protein:vir:34 412 FLC-WLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKR--GDDYQE 488 (533) T ss_pred HHH-HHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHc--CCCHHH Confidence 221 232233333332221 1134556 444567999999999999999999999999885 355555 Q ss_pred HHHHHHHHHHhhhcccCccccccc-cCC-CCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 489 VQEEADLIDNANTVSAPTFGFGTD-QPP-LPTENDPATDPEAVDEGE 533 (533) Q Consensus 489 ~~~El~rI~~E~~~~~~~~~~~~~-~~~-~~~~~~~~~~~~~~~d~~ 533 (533) +.+++ ..|...... .+...+ .+. ....+.+..+++...++. T Consensus 489 v~~q~---a~e~~~~~~-~gl~~~~~~~~~~~s~~~~~~~~~~~~~~ 531 (533) T protein:vir:34 489 IFAQQ---VRETMERRA-AGLKPPAWAAAAFESGLRQSTEEEKSDSR 531 (533) T ss_pred HHHHH---HHHHHHHHh-cCCCCCCCCCcCccCCCCCCCCCCcccCC Confidence 54443 333322111 000000 010 011111111112222222 No 97 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.33 E-value=9.6e-11 Score=75.51 Aligned_cols=459 Identities=9% Similarity=-0.063 Sum_probs=203.7 Q ss_pred CCCCCC-cCCCcCcc-hHHHHHHHHhhhHhhcCCHHHHHHHHhcc-Ccc-hhhHHHHHHHHHHHHHhcccCCCCCcccce Q lcl|NC_016654. 1 MSLPEA-NTAWPPPE-LAAVTARVAESHVWWEGDLDKLATFYGAE-GRT-SPSGIKARTKAAYEAFHGRTPTATGRAPKR 76 (533) Q Consensus 1 ~~~~~~-~~~~pp~~-~~~~~~~~~~~~~w~~gd~~~l~~~y~~~-~~~-~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 76 (533) |.|=+. ..++=|.. +..+.++... +.+=.+...+...-+... ... .....+..+.. .+|.+ . T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~ar~~~-~~y~aa~~~r~~~~~~~~~s~~~~i~~~~~~lr~-RaRdL------------~ 66 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAAREAI-QAYEAARPGRTHKAKRQPLGADTSLQKSAVSMRE-QCRKL------------D 66 (548) T ss_pred CchHHhHhhhcchHHHHHHHHhHHHh-ccccccCccccccccCCCCChHHHHHHHHHHHHH-HHHHH------------H Confidence 665532 22332221 1111111100 111112222211110000 000 00011111111 11111 1 Q ss_pred eecChHHHHHHHHHHhhcCC-CceEe--eCCCc----hHHHHH----HHHHHhh------ccHHHHHHHHHHHHhhhCCE Q lcl|NC_016654. 77 YHAPIPGVIAKLSTTELFSE-QLKFL--DAGKS----KEVQAR----ADLIFNT------PRFHSSLVEAGESCSALSGS 139 (533) Q Consensus 77 ~~~n~~k~i~~~~a~ll~~e-~~~i~--~~~~~----~~~~~~----l~~i~~~------n~f~~~~~~~~~~~~~~G~~ 139 (533) ...++++-+++.+++.+.|. ...+. .-..+ +.+++. ++++.++ .+|......++...+..|.+ T Consensus 67 rNn~~a~~av~~~~~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~ 146 (548) T protein:vir:95 67 EDHDLVTGLLDRLEERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEG 146 (548) T ss_pred hcChHHHHHHHHHHHhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCce Confidence 24569999999999999984 22222 21112 233333 3334332 24777777788888999999 Q ss_pred EEEEEEcCCCCC------ceEEEEEcCCeEE-EEEe-cCCceEEEEEEEEeecCCceEEEEEEEecCe-eEEEEEEeccC Q lcl|NC_016654. 140 FQRIVWDPTIAD------NAWIDFVDADRAI-PEFR-WGRLVAVTFWSELAGGDGQEVWRHLERHESG-YIVHAVYKGTA 210 (533) Q Consensus 140 ~~~~~~D~~~~~------~~~i~~v~~~~~~-P~~~-~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~-~I~~~~y~~~~ 210 (533) ++++.|++.... ..+|..++|+.+- |.-. .+.+. --+|+-..| .+-|.++...+ T Consensus 147 f~~~~~~~~~~~~~g~~~~~~lqliepd~l~~~~~~~~~~i~-----------------~GIE~D~~Grp~aY~i~~~hP 209 (548) T protein:vir:95 147 LAQKLMGRVPNYTFATSVPFALELLEPDYLPFSYNNLSKGIV-----------------QGIERDTWRRKRAYHLLKDHP 209 (548) T ss_pred EEEeeecccccccCCcccceEEEEechhhcCCCCCCCCCcee-----------------eeeEECCCCceEEEEEeecCC Confidence 999999765321 2478888888762 2111 12221 122322222 12233333222 Q ss_pred CcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHH Q lcl|NC_016654. 211 TSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRI 290 (533) Q Consensus 211 ~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~ 290 (533) ...... .....+.. |. -.....+|.| .. + ....|+|.|+..|.. +..++.. T Consensus 210 gd~~~~---~~~~~~~r-------------vp--A~~VlHif~~-~r-------~--gQ~RGvs~lapvl~~-l~~l~~y 260 (548) T protein:vir:95 210 GNLQTL---GGSLAVKR-------------VE--AERIIHIAYR-KR-------I--GQNRGVPMLHAVLIR-LADLKDY 260 (548) T ss_pred Cccccc---ccccceee-------------ec--hhHheecccc-cC-------C--ccccCcchHHHHHHH-HHHHhHH Confidence 211000 00000000 00 0001111211 11 1 234699999997754 4556544 Q ss_pred HHH-HHHHHHhCcceeeechHHhcCCC-Ccccc-ccCcchhhhhhccccc----cccccccccceeeechhhhhHHHHHH Q lcl|NC_016654. 291 YSS-LMRDFRIGAGKVHASESVLTNLG-MGQGV-SLDEEQEVYSRVGSGG----FNANGDMETIFEFFQPAIRVLEHDQG 363 (533) Q Consensus 291 ~s~-~~~~~~~~~~~i~v~~~~l~~~~-~~~~~-~~d~~~~~~~~~~~~~----~~~~~~~~~~i~~~~~~ir~e~~~~~ 363 (533) .+. +....-.+--..|| +... .+... ..+........+.-+. ...| ..++.+++.-...+|..- T Consensus 261 ~dael~~aki~A~~a~fi-----~~~~~~~~~~~~~~~~~~~~~~~~pG~iv~~L~pG----e~i~~~~p~~p~~~~~~f 331 (548) T protein:vir:95 261 EESERVAARISAALAMYI-----KKGNPDSYTVEPGKDRKNRTIPIAPGMVFDDLEPG----EDVGMIESNRPNPFLEGF 331 (548) T ss_pred HHHHHHHHHHhhhheeee-----ecCCCccccCCCCcccccccccccCCccccccCCC----ceeeecCCCCCCCCHHHH Confidence 433 22222222222232 2111 11110 0111111111111111 1111 236777777667777888 Q ss_pred HHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCC---- Q lcl|NC_016654. 364 AALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAP---- 439 (533) Q Consensus 364 l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~---- 439 (533) ++.+++.|...+|+|++.++.+.++ |=.+++......-......+..+...+-+-|+.. ||....+.|....+ T Consensus 332 ~~~~lr~IAaglGipYe~ltgD~s~--nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~-wle~a~l~G~i~lP~~~~ 408 (548) T protein:vir:95 332 RNGQLRMIGAGTRSTYSSVSRAYDG--TYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRS-WLQMYLLARKERLPADVD 408 (548) T ss_pred HHHHHHHHHhhcCCCHHHHhcccch--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHcCCcCCCCCCC Confidence 8888999999999999999877653 4444455554444445555544444444433221 34444444433322 Q ss_pred -ceeEEEEeC--CCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHH---hhhcccC--ccc--- Q lcl|NC_016654. 440 -SEELELEWP--KFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDN---ANTVSAP--TFG--- 508 (533) Q Consensus 440 -~~~v~i~f~--d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~---E~~~~~~--~~~--- 508 (533) ...+.+.|- .....|+..+++....++.+|++|.++.+++. +.|-+++.+++++-.+ +.+...+ +.. T Consensus 409 ~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a~~--G~D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~ 486 (548) T protein:vir:95 409 HRTLYAAVYQGPVMPWINPMHEANAWELLVKAGFADEAEVARAR--GRDPRELKKSRETEIKANRAAGLVFSSDAYHQLV 486 (548) T ss_pred chhheeeeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHh--CCCHHHHHHHHHHHHHHHHHcCCCCCCccccccc Confidence 224677784 34457999999999999999999999999985 3555555444332211 1111111 101 Q ss_pred cccccCCCCCC-CCCCCC-CCCCCCCC Q lcl|NC_016654. 509 FGTDQPPLPTE-NDPATD-PEAVDEGE 533 (533) Q Consensus 509 ~~~~~~~~~~~-~~~~~~-~~~~~d~~ 533 (533) .++.++..... ...+++ +...||+| T Consensus 487 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (548) T protein:vir:95 487 KSGMDPVEAVQKVYLGVGKMLTADEAR 513 (548) T ss_pred ccccCCCCchhhhccccccccccchhH Confidence 01111111111 112222 23333343 No 98 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.30 E-value=1.6e-10 Score=74.34 Aligned_cols=404 Identities=13% Similarity=0.062 Sum_probs=181.2 Q ss_pred CCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccC-----cchhh-HHHHHHHHHHHHHhcccCCCCCcccce Q lcl|NC_016654. 3 LPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEG-----RTSPS-GIKARTKAAYEAFHGRTPTATGRAPKR 76 (533) Q Consensus 3 ~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~-----~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~~ 76 (533) |+ ..+.+..+..+.. ...+. +-...+ ..-. T Consensus 1 ~~---------------------------~~D~~~n~~~gg~~~~~~~~~~~~~~~~~l-----------------~a~Y 36 (422) T protein:vir:10 1 MV---------------------------KTDSYANIFLGGSDGSEIYGSLQNQAPTIL-----------------ASLY 36 (422) T ss_pred Cc---------------------------cchhhHHHHcCCCCCccccCcccccCHHHH-----------------HHHH Confidence 11 1111222211100 00000 000000 0112 Q ss_pred eecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEE Q lcl|NC_016654. 77 YHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWID 156 (533) Q Consensus 77 ~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~ 156 (533) .+..+++.+|+..|.-++.+...|+..++.+...+.+++ =+++..+.+++..+-.+|++++.+-++.+.. +. T Consensus 37 ~~~~l~~~~Vd~~aed~~r~g~~i~~~~~~~~~~~~~~~----l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~--~~-- 108 (422) T protein:vir:10 37 ADNALVRRIIDTIPETALAAGFHIDGIDDEPAFWSRWDD----LEMTQNINDAWSWARLFGGAAIVAIVKDNRA--LT-- 108 (422) T ss_pred HhChhhHHHHhhhhHHHhcCCccccCCCHHHHHHHHHHH----hhHHHHHHHHHHhhccccceEEEEEecCCCC--cc-- Confidence 356799999999999999988777654333333344443 4789999999999999999998877742211 00 Q ss_pred EEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecC--eeEEEEEEeccCCcccceeehhhccccccccccccc Q lcl|NC_016654. 157 FVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHES--GYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGAD 234 (533) Q Consensus 157 ~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~--~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~ 234 (533) -|+-..|.+..+..+..+.-.-. .++. .-..+ |...+..+.+.....+.+| T Consensus 109 -------~Pl~~~g~~~~l~v~d~~~i~~~-~~~~--dp~s~~fg~P~~y~v~~~~~~~~~~i----------------- 161 (422) T protein:vir:10 109 -------SPVREGAELETVRVYDRTQVKVQ-TREE--NPRNARFGEPLTYRITTNESDMFYDV----------------- 161 (422) T ss_pred -------ccccccCceeeEEeeccccccch-hccc--CccccccCcceEEEEecCCCCcceee----------------- Confidence 12222343333322111100000 0000 00000 1111100111100000000 Q ss_pred cCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceee-ec--hHH Q lcl|NC_016654. 235 EGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVH-AS--ESV 311 (533) Q Consensus 235 ~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~-v~--~~~ 311 (533) +.++ .+.|.. .+..+. ..+....+|.|.+...+.+.+..++.+-.....-+...+.+++ ++ ..+ T Consensus 162 --------H~SR--li~~~g--~~~p~~-~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~ 228 (422) T protein:vir:10 162 --------HYSR--IHIIDG--ERIPNV-MRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLKRKQQAVWKAKGLAEL 228 (422) T ss_pred --------ccce--eEEeCC--CCchhh-hcccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHh Confidence 0011 111111 100111 1223456899999765556667777766655554543333333 22 122 Q ss_pred hcCCCCccccccCcchhhh---hhcccc-ccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhc-ccCC Q lcl|NC_016654. 312 LTNLGMGQGVSLDEEQEVY---SRVGSG-GFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSL-GLSD 386 (533) Q Consensus 312 l~~~~~~~~~~~d~~~~~~---~~~~~~-~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~-g~~~ 386 (533) +... .+..... .....+ +..... ..+. ....++.++-+ +......++.+..+++..+|+|-..+ |... T Consensus 229 ~~~~-~~~~~~~-~r~~~~~~~~~~~~~~~l~~---~~e~~e~~~~~--lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~ 301 (422) T protein:vir:10 229 CDDS-EGFGAAR-LRLAQVDNNSGVGQAIGIDA---ESEEYSVLNSD--IGGIDAFLDKKFDRIVALSGIHEIILKNKNV 301 (422) T ss_pred cCCc-cchHHHH-HHHHHHHHhcCCccceeEec---CCcceEEEecc--cCChHHHHHHHHHHHHhhhCCCeeeeccCCc Confidence 2111 1100000 000001 111100 0111 11235555433 33456778888899999999987644 5544 Q ss_pred Ccc-hhHHHHHHHhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHH Q lcl|NC_016654. 387 EVA-QTATEASGKKDLTVKTTRAKA-RHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQA 464 (533) Q Consensus 387 ~~~-~Tatai~~~~~~l~~~~~~~~-~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~ 464 (533) +|- .|+.+-...+ +..++.+| ..++..|..|+..+++ ..+++|.|++-..++..|+|++..+ T Consensus 302 ~Glnatgd~d~~~y---yd~i~~~Qe~~l~p~l~~l~~~i~~-------------s~~~~~~f~pL~~~sekekaei~~~ 365 (422) T protein:vir:10 302 GGVSSSQNTALETF---HKLVDRKRNAELLPILEFLIPFIVN-------------AEEWSVEFNPLAQESSKDKAEILEK 365 (422) T ss_pred ccccccchHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhcc-------------cCCcEEEeCCCCCCCHHHHHHHHHH Confidence 443 4565554444 34444444 5678888888876542 1468899999999999988886554 Q ss_pred HHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 465 WSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 465 l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ... . ..+++. .+-.+.+|++++|.......+. .....+...++.+.+.+++.++++| T Consensus 366 ~a~--a--~~~~~~--~g~i~~~e~r~~L~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~d 422 (422) T protein:vir:10 366 NVN--S--IAALIA--AGAMDIDEARDTLRTIAPEVKI------NDGSVETEVTISETSNDPLEVPTDD 422 (422) T ss_pred HHH--H--HHHHHh--cCCCCHHHHHHHhhhhcccccC------CCCCCccccchhhcCCCCCCCCCCC Confidence 322 1 122233 3346666776666433211111 0111122222222222333334444 No 99 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.28 E-value=1.8e-10 Score=74.06 Aligned_cols=454 Identities=9% Similarity=-0.005 Sum_probs=194.9 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcc--cceee Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRA--PKRYH 78 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~--~~~~~ 78 (533) |.++.|...=||. .+....-|...+ .++....|=......+.+. ...|+.... ..|.+ .-... T Consensus 48 ~~~~~~~~~~~~~-~~~~~~~~a~d~-----~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~-~~~~~l~a~Y~~ 112 (537) T protein:vir:10 48 MAIRDHAIAMMPK-VDGSHPDMAMDG-----LDVEGGTFSAYANPNLSEG--------LVLWYAQQA-FIGHQMCALIAT 112 (537) T ss_pred CCCCCccCccccc-ccccccchhccc-----cccchhhhhhhccccccch--------hhhhccccC-CccHHHHHHHHh Confidence 5655544333332 122111111111 1100000000000000000 000100000 00101 01234 Q ss_pred cChHHHHHHHHHHhhcCCCceEeeCCCc---hHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEE Q lcl|NC_016654. 79 APIPGVIAKLSTTELFSEQLKFLDAGKS---KEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWI 155 (533) Q Consensus 79 ~n~~k~i~~~~a~ll~~e~~~i~~~~~~---~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i 155 (533) ..+++.||+..|+-++.+...|++.+.+ ....+.|...++.-+++..+.+++..+-.+|++++.+.++..- +...- T Consensus 113 ~~l~r~iVd~~A~d~~r~~~~i~~~~~~~~~~~~~~~l~~~~~~l~~~~~l~~a~~~~rlyG~~~i~i~v~~~D-~~~~~ 191 (537) T protein:vir:10 113 HWLVNKACSQMPRDAMRKGYKIISDDGNELDPKDAKFIDRYDRAFNIKKHAIQFVRKGRIFGIRIALFKVDSPD-PYYYE 191 (537) T ss_pred CchhhhhhhhhhHHhhcCCceeecCCcccccHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEeecCcC-Ccccc Confidence 5799999999999999999888886432 2344667777777789999999999999999999887764221 11111 Q ss_pred EEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccccccc Q lcl|NC_016654. 156 DFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADE 235 (533) Q Consensus 156 ~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~ 235 (533) +-++.+. ...|.+..+..+..+.. .++.+.+..-.-....+|.+--+ .+. T Consensus 192 ~Pl~~~~----i~kg~~k~l~vidp~~~-------------~~~~~~~~~~dp~sp~fg~P~~y-----------~v~-- 241 (537) T protein:vir:10 192 KPFNIDG----VMPGAYKGIVQIDPYWC-------------APLLDAQASSNPVSMHFYEPTYW-----------LIN-- 241 (537) T ss_pred ccccccc----ccccceeEEEEechhhc-------------ccccchhhhccCCccccCCceee-----------eec-- Confidence 1111111 01122222221111000 00000000000000011111000 000 Q ss_pred CCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCC Q lcl|NC_016654. 236 GRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNL 315 (533) Q Consensus 236 ~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~ 315 (533) + ..| +.++ .+.|..+..+ +. ..+....+|+|.+... .+.|..++.+....+.-+.....+++--. .+... T Consensus 242 g--~~i-H~SR--li~f~g~~~p--~~-~~~~~~~~G~Svlq~~-~~~l~~~~~t~~~~~~l~~~~~~~v~k~~-~~~~l 311 (537) T protein:vir:10 242 G--KKY-HRSH--LAIYINDEVV--DF-LKPSYIYGGVPLPQQI-MERVYAAERTANEGPMLAMTKRQTVLKVD-AAQVL 311 (537) T ss_pred C--eEe-ccee--EEEecCCCCc--hh-hhcccCcccccHHHHH-HHHHHHHHHHHHHHHHHHHhcCCceeeec-hHHhh Confidence 0 000 0011 1112111111 11 1222346799999874 46667777777666665544444444211 11111 Q ss_pred CCccccccCcchhhhhhccc--cccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhh-cccCCCcc-hh Q lcl|NC_016654. 316 GMGQGVSLDEEQEVYSRVGS--GGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVS-LGLSDEVA-QT 391 (533) Q Consensus 316 ~~~~~~~~d~~~~~~~~~~~--~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~-~g~~~~~~-~T 391 (533) .+... +...-..+..... +..-.++ ....++.++. .+...-..++.+...|+..+|+|... ||-..+|. .| T Consensus 312 ~~~~~--~~~r~~~~~~~r~n~g~~~id~-e~e~~e~~~~--~lsgl~~~l~~~~~~iAa~~~IP~t~L~G~sp~Glnat 386 (537) T protein:vir:10 312 ANKQQ--FDETMSWWTATRDNYQVRVVDK-DNEDVVQIDT--TLNDLDKVIMNQYQLVCAIARTPAPKMLGTVPTGFNST 386 (537) T ss_pred cCHHH--HHHHHHHHHhhcCCcceeEecC-CCceeEEEec--cCCCHHHHHHHHHHHHHhhhCCCceeeccCCccccccc Confidence 11110 1000001111100 0011111 1123444442 33344556777778888899998764 56443343 56 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHH-------HHH Q lcl|NC_016654. 392 ATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQT-------VQA 464 (533) Q Consensus 392 atai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~-------~~~ 464 (533) |++=...+ +..++.+|..++..|..++..++... + + ...+++|.|+.-...|..|+|++ +++ T Consensus 387 Ge~D~~~y---yd~I~~~Qe~l~p~l~~l~~ll~~~~---~-~----~~~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~ 455 (537) T protein:vir:10 387 GDYEEASY---HEECESTQDDMRPLIDRHHQLVCRSH---L-R----KRIRVKVEFPPMDAPKESERADTFLKKMQAAKL 455 (537) T ss_pred hhHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHhc---C-C----CCcceEEEeCCCCCCCHHHHHHHHHHHHHHHHH Confidence 66554444 44445555567888888887766431 1 1 13469999999999998887764 888 Q ss_pred HHhCCCCCHHHHHHHhCC-----------CCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCC----CCCCCCCCCC Q lcl|NC_016654. 465 WSVASAASTKTKVAYLHE-----------DWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTE----NDPATDPEAV 529 (533) Q Consensus 465 l~~aGi~S~et~v~~l~~-----------~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~ 529 (533) ++++|+||.+++...+.- ..++++.+ ++ .+..|. ....+...++.+.+ +..+.+++.. T Consensus 456 ~~~~G~i~~~Evr~~L~~~~~~g~~~l~~~~~~ed~e-~~-~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 528 (537) T protein:vir:10 456 AFEMGAVDGVDVNEYLRMDPTLGFTSITPAMRPTDAE-DI-DVDDEG-----KPVRIIEDQPAPSEMFGATSSGESANDP 528 (537) T ss_pred HHHcCCCCHHHHHHHHhccCccccccccCCCChhhhh-cc-cCCccC-----CcCCCCCCCCCccccCCCCccccccCCC Confidence 889999999887665421 11111110 00 011110 00011111111111 0011111222 Q ss_pred CCCC Q lcl|NC_016654. 530 DEGE 533 (533) Q Consensus 530 ~d~~ 533 (533) .++. T Consensus 529 ~~~~ 532 (537) T protein:vir:10 529 RDSG 532 (537) T ss_pred ccCc Confidence 2222 No 100 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=99.28 E-value=3.3e-11 Score=78.04 Aligned_cols=491 Identities=10% Similarity=0.011 Sum_probs=196.4 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhH--hhcCCHHHHHHHHhccCcchhhHHHHHH----------HHHHHHHhcccCC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHV--WWEGDLDKLATFYGAEGRTSPSGIKART----------KAAYEAFHGRTPT 68 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~--w~~gd~~~l~~~y~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~ 68 (533) |-|- .|+==+. ..+.+.+.+. +......++..+.. .....|...+ ..++...+..... T Consensus 1 ~~~~--~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~r~----~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~~ 70 (651) T protein:vir:80 1 MKLA--TTTTDKN----RQTYDETHDVSSYVKKEYKRFCDARQ----VCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGD 70 (651) T ss_pred Cccc--ccccchh----hhhhhhhHHHHHHHHHHHHHHHHHhh----hhhhhHHHHHHhhcccHHHHHhhccccccccCC Confidence 3332 1111110 1112222221 11111111111100 0001111111 1111111111111 Q ss_pred CCCcccceeecChHHHHHHHHHHhh----cCCCceEeeCC-----CchHHHHHHHHHH----hhccHHHHHHHHHHHHhh Q lcl|NC_016654. 69 ATGRAPKRYHAPIPGVIAKLSTTEL----FSEQLKFLDAG-----KSKEVQARADLIF----NTPRFHSSLVEAGESCSA 135 (533) Q Consensus 69 ~~g~~~~~~~~n~~k~i~~~~a~ll----~~e~~~i~~~~-----~~~~~~~~l~~i~----~~n~f~~~~~~~~~~~~~ 135 (533) +.-..+.++..|.....|+.+...| |+.+..+.+.+ .+...++.++.++ ..++|......++.+|.. T Consensus 71 ~~~~~rs~~~~~~v~~~ve~~~~~l~~~~~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~ 150 (651) T protein:vir:80 71 VNADWRHKITTGKAFEAIETIHAYLMSATFPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLI 150 (651) T ss_pred CCCCCCccccChhHHHHHHHHHHHHHHhhcCCCceeEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcc Confidence 2222355788888888887665544 44443344322 1122445566664 477899999999999999 Q ss_pred hCCEEEEEEEcCCC-----------------------------CCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCC Q lcl|NC_016654. 136 LSGSFQRIVWDPTI-----------------------------ADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDG 186 (533) Q Consensus 136 ~G~~~~~~~~D~~~-----------------------------~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~ 186 (533) +|.+++|++||... .+.|+|+.|++..|++--+-..+..+.|+.+.....+ T Consensus 151 ~G~~i~kv~we~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~dp~a~~~~d~~~v~~~~~t~~ 230 (651) T protein:vir:80 151 TGNSVLALPWRVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFYDPNVTDPNRGAFIRKLTKTKA 230 (651) T ss_pred cCceEEEEeecceeeeeehheeccccccccccceeeeccceeeeceeEEEEecHHHeeecCCCcCccccceeeeeeeeHH Confidence 99999999997421 1357889999988887433345555555432211100 Q ss_pred -------ceEEEEE------EE-----ecCeeEEEEEEeccCC---cccceeehhhccc-----cccccccccccCCcee Q lcl|NC_016654. 187 -------QEVWRHL------ER-----HESGYIVHAVYKGTAT---SLGWMMALTDHPA-----TRDIAVEGADEGRGAY 240 (533) Q Consensus 187 -------~~~y~~l------E~-----h~~~~I~~~~y~~~~~---~lG~~v~l~~~~~-----~~~~~~~~~~~~~~~~ 240 (533) ..+|.-+ +. ++...-...-+++.+. .--++|.+-++-. ...+...+....+... T Consensus 231 ~l~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~~d~e~~~~~~~~v~~~g~~i 310 (651) T protein:vir:80 231 DILNLLSEGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHLENKTYHDVVVTIMGNEV 310 (651) T ss_pred HHHHHHhcccccchhhHHHHhhhccccccCCccccccccCCCccccccccceEEEEEEEEeeccCCceEEEEEEEcCcEE Confidence 0011000 00 0000000000000000 0000111110000 0000000000000001 Q ss_pred e---cCCC--ccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHH-hCcceeeechHHhcC Q lcl|NC_016654. 241 V---ETGV--KDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFR-IGAGKVHASESVLTN 314 (533) Q Consensus 241 ~---~~g~--~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~-~~~~~i~v~~~~l~~ 314 (533) + ++.. ..||+.+.+.. .+ .+.+|+|..... .+.+..+|....++.+.+. .+...+.|+.+.+.+ T Consensus 311 l~~~~~~~~~~~Pf~~~~~~~--------~~-~~~yG~g~~~~~-~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~ 380 (651) T protein:vir:80 311 LRFEQNPYWCGRPFVIGTYIP--------TA-RQPYAMGALQPN-LGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQ 380 (651) T ss_pred ecccccCCCCCCCeeeeccee--------cC-ccccCCChHHHH-hHHHHHHHHHHHHHHHHHHHHhCCcEEecCCcccc Confidence 1 1111 12444433322 11 367899999875 4888999999999988874 556666675443221 Q ss_pred CCCccccccCcchhhhhhccccccccccccccceeeechhh-hhHHHHHHHHHHHHHHHHhhCCChhhcccCC--Ccchh Q lcl|NC_016654. 315 LGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAI-RVLEHDQGAALLLREVLRKTGYSPVSLGLSD--EVAQT 391 (533) Q Consensus 315 ~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i-r~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~--~~~~T 391 (533) ... ..+.+. .+++. ...++ +..+++.- ........++.+-..+...+|++.-..|.+. .+..| T Consensus 381 ~~~---l~~~pg-~vi~~------~~~~~----~~~l~~~~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~T 446 (651) T protein:vir:80 381 PED---VYTEPG-KVFLV------SDHGD----LQPLANQSSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVT 446 (651) T ss_pred HHH---hhcCCC-ceEEe------cCCCC----ceeeccCcccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhcc Confidence 100 011111 11110 01111 22222211 1123345566666677888888876666533 34579 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcc--------CCCC-------CCceeEEEEeCCCCCCCH Q lcl|NC_016654. 392 ATEASGKKDLTVKTTRAKARHFGS-ALGPLSTTCLRVDAIKFP--------GKGA-------APSEELELEWPKFARESD 455 (533) Q Consensus 392 atai~~~~~~l~~~~~~~~~~~~~-al~~li~~il~l~~~~~~--------~~~~-------~~~~~v~i~f~d~i~~d~ 455 (533) |++|..+.+..........+.|.. .+..|++.++++...... +... ....+++++++= ++... T Consensus 447 AteI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~i-v~~g~ 525 (651) T protein:vir:80 447 AAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRL-VPIGS 525 (651) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCccceeeeeee-eeccH Confidence 999999998888888888888876 678888888776542211 1000 001123333321 11121 Q ss_pred ---HHHHHHHHHHHhCCCCCHHHHHHHhCCCCCH-HHHHHHHHHHHHhhhcccCcccc-ccccCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 456 ---LAKAQTVQAWSVASAASTKTKVAYLHEDWDD-ERVQEEADLIDNANTVSAPTFGF-GTDQPPLPTENDPATDPEAVD 530 (533) Q Consensus 456 ---~e~a~~~~~l~~aGi~S~et~v~~l~~~~~d-ee~~~El~rI~~E~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 530 (533) .+..+.++++.+ .-..+.. .|.... ......++++.+..+.-++..-. ..++.+....... -..+... T Consensus 526 ~~~~~r~~~~~~l~~-----~~q~~~~-~p~~~~~~~~~~~~~~l~~~~g~~~~~~~l~~~~q~~~~~~~~~-~~~q~~~ 598 (651) T protein:vir:80 526 DHVIERKQYIEDRLT-----FIQAVAQ-VPEMGQLVDYKRILVDLLQHWGFEEPEAYLKQQDQQAPANPQEA-LLSQAKD 598 (651) T ss_pred HHHHHHHHHHHHHHH-----HHHhhcc-CCccchhhhHHHHHHHHHHHcCCCCcHHhcCCCccchhhhhhHH-HHhhHHH Confidence 122222222211 1111111 111111 01112222222222211111000 0000000000000 0000000 Q ss_pred ---CCC Q lcl|NC_016654. 531 ---EGE 533 (533) Q Consensus 531 ---d~~ 533 (533) +.+ T Consensus 599 ~~~~a~ 604 (651) T protein:vir:80 599 VGGQAM 604 (651) T ss_pred HHHHHH Confidence 000 No 101 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.28 E-value=1.2e-10 Score=74.94 Aligned_cols=419 Identities=11% Similarity=0.029 Sum_probs=182.6 Q ss_pred CCCC--CCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceee Q lcl|NC_016654. 1 MSLP--EANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYH 78 (533) Q Consensus 1 ~~~~--~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 78 (533) |.|= +...+.- ..+.+...+.. .+......+|.... . .+... ..-+.+ T Consensus 1 ~~~~m~~~~~~~~--~~D~~~~~~~~------~~g~~~~~~~~~~~-~---~~~~l------------------~~~Y~~ 50 (435) T protein:vir:79 1 MGVFMSDKVKAIT--KEDGYNEIFGS------KDGTFRPNAFYMQR-A---AFKAL------------------SQFYEE 50 (435) T ss_pred CCcccccccccch--hhcchhhhhcc------cccccccCcccCCc-C---CHHHH------------------HHHHhc Confidence 5543 2211111 11111110100 00000000000000 0 00000 011234 Q ss_pred cChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEE Q lcl|NC_016654. 79 APIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFV 158 (533) Q Consensus 79 ~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v 158 (533) ..+++.||+..|+-++.+...|+...+.+ .+...++.=+++..+.+++..+-.+|++++.+-+..+. .+ T Consensus 51 ~~l~~~~Vd~~aed~~r~g~~i~g~~~~~----~~~~~~~~l~~~~~l~~a~~~~rl~G~~~i~i~~~d~~--~~----- 119 (435) T protein:vir:79 51 DGMARRIVDVIPEEMVTPGFKVDGVKNEK----SFKSRWDELRLNAKIIDALSWSRLFGGSAILAVVADNK--ML----- 119 (435) T ss_pred CchhhhhhccchHHhhcCCceecCCChHH----HHHHHHHHhhHHHHHHHHHHhhhccccEEEEEEecCCC--Cc----- Confidence 46999999999999999987776543333 34444444478899999999999999998887663221 11 Q ss_pred cCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecC--eeEEEEEEeccCC-cc-cceeehhhccccccccccccc Q lcl|NC_016654. 159 DADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHES--GYIVHAVYKGTAT-SL-GWMMALTDHPATRDIAVEGAD 234 (533) Q Consensus 159 ~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~--~~I~~~~y~~~~~-~l-G~~v~l~~~~~~~~~~~~~~~ 234 (533) --|+-..|.+..+..++.+.-.-. .+.+ .-..+ |..+ .|.-++. .+ +.+| T Consensus 120 ----~~Pl~~~g~i~~i~v~d~~~i~~~-~~~~--dp~sp~fg~P~--~y~v~~~~~~~~~~i----------------- 173 (435) T protein:vir:79 120 ----KSPVKPGAQLEDIRVYDRYQITIH-ERET--NARSVRYGEPK--LYKISPGGDIPEFFV----------------- 173 (435) T ss_pred ----ccccccCCceeeEEeechhhccch-hhcc--CCcccccCcce--EEEEecCCCCCceEE----------------- Confidence 013323344444332221100000 0000 00000 1111 1111110 00 0000 Q ss_pred cCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceee-ec--hHH Q lcl|NC_016654. 235 EGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVH-AS--ESV 311 (533) Q Consensus 235 ~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~-v~--~~~ 311 (533) +.++ .+.|.....+ +. ..+....+|.|++...+.+.+..++.+......-+...+.+++ ++ ..+ T Consensus 174 --------H~SR--li~~~g~~~p--~~-~~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~ 240 (435) T protein:vir:79 174 --------HYSR--ICIIDGERVS--NE-KRRQNDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQQAVWKARDLALM 240 (435) T ss_pred --------ccee--EEEecCCcch--hh-hccccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccchhHHHh Confidence 0011 0111111001 11 1233467899998655667777888777776665544444333 32 122 Q ss_pred hcCCCCccccccCcc--hhhhhhccc-cccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhh-cccCCC Q lcl|NC_016654. 312 LTNLGMGQGVSLDEE--QEVYSRVGS-GGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVS-LGLSDE 387 (533) Q Consensus 312 l~~~~~~~~~~~d~~--~~~~~~~~~-~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~-~g~~~~ 387 (533) +..... ........ ...++.... ...+ + ....++.++-+ +......++.+.+.++..+|+|... ||...+ T Consensus 241 ~~~~~~-~~~~~~r~~~~~~~~~~~~~~~i~-~--~~e~~e~~~~~--lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~ 314 (435) T protein:vir:79 241 CDDEEG-RYAARLRLAQVDDESGVGKAIGID-A--TDEEYEVLNSD--VSGVPEFLQEKIDRIVALTGIHEIIIKNKNTG 314 (435) T ss_pred hcCccc-hHHHHHHHHHHHHhcCCCCceeEe-c--CCcceEEEecc--cCCHHHHHHHHHHHHHhhhCCCeeeeccCCcc Confidence 211111 10000000 000111111 0111 1 11235555433 3455677888889999999999755 465555 Q ss_pred cc-hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHH Q lcl|NC_016654. 388 VA-QTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWS 466 (533) Q Consensus 388 ~~-~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~ 466 (533) |- .|+.+-...+...+... .+..++..|..|+..+++ ..+++|.|++-..++..|+|++..+.. T Consensus 315 glnstgd~d~~~yyd~i~~~--Qe~~l~p~l~~l~~li~~-------------s~d~~~~f~pL~~~sekEkAei~~~~a 379 (435) T protein:vir:79 315 GVSASQNTALETFYKLIDRK--RVEDYKPILEFLLPFMIS-------------ETEWSIEFEPLSVPSDKDKAEIMAKNV 379 (435) T ss_pred ccccchhHHHHHHHHHHHHH--HHHHHHHHHHHHHHHhhc-------------CCCCeEEeCCCCCCCHHHHHHHHHHHH Confidence 43 56666555554444332 235677788877776532 146889999999999988887655442 Q ss_pred hCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 467 VASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 467 ~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) . . ..+++. .+-.+.+++.++++.+-.+.+ ...+.....++.++...++....|| T Consensus 380 ~--a--~~~~~~--~g~i~~~e~r~~L~~~~~~~~-------~~~~~~~~~~~~~d~~~~~~~e~g~ 433 (435) T protein:vir:79 380 E--S--VVKLKA--EQAINLKETRDTLRSICPDLK-------IMDNDNIELPEPEDLDPEPGQEGGL 433 (435) T ss_pred H--H--HHHHHh--cCCCCHHHHHHHHHHhccccC-------CCCcccccCCccccCCCCCCCCCCC Confidence 2 1 112222 334566666666532221211 1111111111111111111112222 No 102 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=99.27 E-value=2.3e-10 Score=73.39 Aligned_cols=497 Identities=7% Similarity=-0.056 Sum_probs=219.6 Q ss_pred CCCCcCCCcCcch-HHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhccc-CCC------CCccc Q lcl|NC_016654. 3 LPEANTAWPPPEL-AAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRT-PTA------TGRAP 74 (533) Q Consensus 3 ~~~~~~~~pp~~~-~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~------~g~~~ 74 (533) |-...-.=|=|++ +..-.....++.+-+-...+|...|...... ...++...... -+|+.|. -+. .-+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~r~~a~~d-~~fy~G~Qw~~~~~~~l~~~g~ 78 (711) T protein:vir:10 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATY-WKDNWEAAEDD-LKFLGGEQWPSQVRTERELEQR 78 (711) T ss_pred CCcccccccccchhHHHHHhcccCcchHHHHHHHHHHHHHHHHhh-hHHHHHHHHHH-HHHhCCCCCCHHHHHHHHhcCC Confidence 2222222222221 1111111111111111112222222111100 01111111111 1112221 000 00123 Q ss_pred ceeecChHHHHHHHHHHhhcCCCceEeeCCC-----------------------------chHHHHHHHHHHhhccHHHH Q lcl|NC_016654. 75 KRYHAPIPGVIAKLSTTELFSEQLKFLDAGK-----------------------------SKEVQARADLIFNTPRFHSS 125 (533) Q Consensus 75 ~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~-----------------------------~~~~~~~l~~i~~~n~f~~~ 125 (533) ..++.|+.+.+|+...++--...+.+.+.+. ++.++..+..+.+.|+.... T Consensus 79 p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~ 158 (711) T protein:vir:10 79 PCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETE 158 (711) T ss_pred CcEEEcchHHHHHHHhhhHhhCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHHhcChhHH Confidence 3689999999999999998887777766542 12344555667788889999 Q ss_pred HHHHHHHHhhhCCEEEEEEEcC----CCCCceEEEEE-cCCeEEEEE--ecCCceEE--EEEEEEeec------------ Q lcl|NC_016654. 126 LVEAGESCSALSGSFQRIVWDP----TIADNAWIDFV-DADRAIPEF--RWGRLVAV--TFWSELAGG------------ 184 (533) Q Consensus 126 ~~~~~~~~~~~G~~~~~~~~D~----~~~~~~~i~~v-~~~~~~P~~--~~g~~~~v--~f~~~~~~~------------ 184 (533) ...+...+++.|.+|+++++|- ...+.|+|..| ++..++.=+ ..-++.++ +|...+... T Consensus 159 ~s~af~d~~~~G~G~~ev~~d~~~~d~~~~e~~i~~v~~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~yp~~a~ 238 (711) T protein:vir:10 159 YDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATA 238 (711) T ss_pred HHHHHHHhhhcCcceEEEEecccCCCCCCCCeEEeeecChhheeeCccccccChhhhcceeeeecCCHHHHHHhCCchhh Confidence 9999999999999999998762 22467888777 587765422 12234444 233222100 Q ss_pred ---------CC-----ceEEEEEEEecCeeEEEEEEeccCCcccceeeh-hhccc--------------------cc--c Q lcl|NC_016654. 185 ---------DG-----QEVWRHLERHESGYIVHAVYKGTATSLGWMMAL-TDHPA--------------------TR--D 227 (533) Q Consensus 185 ---------~~-----~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l-~~~~~--------------------~~--~ 227 (533) +. ..-.++.|++..-...+.++...++. +...+- ..+.. +. . T Consensus 239 ~~~~~~~~~~~~~~~~~~~vrv~E~~~r~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~ 317 (711) T protein:vir:10 239 EPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGR-SFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKI 317 (711) T ss_pred hhhhcccccccCcccCcceeeEEEEEeeeeeeeEEEeecCCc-eeccCcchhHHHHHHhcCchhhhhhhhceeeEEEEEE Confidence 00 01124455554433344444333221 111110 00000 00 0 Q ss_pred ccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHH-hCcceee Q lcl|NC_016654. 228 IAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFR-IGAGKVH 306 (533) Q Consensus 228 ~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~-~~~~~i~ 306 (533) ..+.+.+ + .... .+...||+.+..... +-..+..+-+.+.. ++|..+.+|...|++.+.+. .++.+++ T Consensus 318 ~G~~~L~-~-~~p~-~~~~~P~vp~~g~r~-------~~d~~~~~~G~vr~-~~d~Qr~~N~~~s~~~~~l~~~~~~~~~ 386 (711) T protein:vir:10 318 TGANVLE-G-PVEI-PSTTIPVIPVWGKSL-------IIKKKEIFRSIIRH-SKDAQRMANYWDSAATETVALAPKAPFI 386 (711) T ss_pred ecceeec-C-CCCC-CCCcccEEEEeeeee-------ccccccccchhhhh-hhhhHHHHHHHHHHHHHHHHhcCCCcee Confidence 0000100 0 0000 111123332221111 00112223334554 67999999999999999985 4677888 Q ss_pred echHHhcCCCCc-cccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccC Q lcl|NC_016654. 307 ASESVLTNLGMG-QGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLS 385 (533) Q Consensus 307 v~~~~l~~~~~~-~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~ 385 (533) ++.+.+.+.... ......+ ..+..++ .+......++...+.--...++..++.....+...+|++...+|.. T Consensus 387 ~~~gai~~~~~~~~e~~~~~--~~vi~~~-----~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~ 459 (711) T protein:vir:10 387 GSEGNVEGREDEWEQANTKN--FSLLTYI-----PQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAM 459 (711) T ss_pred ecCcccCChHHHHHhccccC--CCeeEec-----ccccCcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCC Confidence 877766432110 0000000 0111111 1111111234343222235678888888888999999999999976 Q ss_pred CCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCC-------C------------------- Q lcl|NC_016654. 386 DEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAA-------P------------------- 439 (533) Q Consensus 386 ~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~-------~------------------- 439 (533) + +..||.+|..+..............+..+++.+.+.+|.+....+...... . T Consensus 460 ~-n~~Sg~ai~~~q~qg~~~l~~~~dn~~~~~~~~g~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~ 538 (711) T protein:vir:10 460 G-NETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWV 538 (711) T ss_pred c-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccce Confidence 5 457999999999888877777888888888888887777654332111000 0 Q ss_pred --------ceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHH-----HHHhCCCCCHHHHHHHHHHHHHhhhcccCc Q lcl|NC_016654. 440 --------SEELELEWPKFARESDLAKAQTVQAWSVASAASTKTK-----VAYLHEDWDDERVQEEADLIDNANTVSAPT 506 (533) Q Consensus 440 --------~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~-----v~~l~~~~~dee~~~El~rI~~E~~~~~~~ 506 (533) .++|.|+=..+.+.-+.+.+..+..+. +.++.-.. +.++. ++.. +.+-.++|++-+.+..+. T Consensus 539 ~~nDi~~g~~Dv~i~~~p~~~s~r~~~~~~l~ql~--~~~p~~~~~~~~~il~~~-d~p~--~~el~e~lr~~~~~~~~~ 613 (711) T protein:vir:10 539 TIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFA--QAVPSAAAVMADLIAQNM-DWPG--ADVIAERLKKIVPPNVLS 613 (711) T ss_pred eeeccceeeeEEEEeeccCchhHHHHHHHHHHHHH--hhcchhhhHHHHHHHHhc-CCCC--HHHHHHHHHhhcCcccCc Confidence 012222222222222344444444442 33322111 11222 1211 222233333322211110 Q ss_pred cccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 507 FGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 507 ~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) . ++.+ ..............+ T Consensus 614 ~----~~~~---~~qq~~~e~qq~~~~ 633 (711) T protein:vir:10 614 K----DERE---AIEEDMPEQTEPTPE 633 (711) T ss_pred c----hhhh---HHHHHHHHHHHHHHH Confidence 0 0000 000000000000000 No 103 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=99.23 E-value=1.8e-10 Score=73.95 Aligned_cols=491 Identities=8% Similarity=-0.079 Sum_probs=220.0 Q ss_pred hHHHHHHHHhhhHhhcCCHHHHHHHH----hccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHH Q lcl|NC_016654. 15 LAAVTARVAESHVWWEGDLDKLATFY----GAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLST 90 (533) Q Consensus 15 ~~~~~~~~~~~~~w~~gd~~~l~~~y----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a 90 (533) +++....+.+.+.||.-+.+.-...+ .....|..+||......... ...+..+|+.+.+|+... T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~------------~q~rp~~N~i~~~i~~v~ 68 (725) T protein:vir:92 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRISQWDDWLSQYTT------------LQYRGQFDVVRPVVRKLV 68 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHH------------hcCCCcccchHHHHHHHH Confidence 33333445555555543332221111 11112222333322211111 011235799999999988 Q ss_pred HhhcCCCceEeeCCCc-------hHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEc---CC-CCCceEEEEEc Q lcl|NC_016654. 91 TELFSEQLKFLDAGKS-------KEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWD---PT-IADNAWIDFVD 159 (533) Q Consensus 91 ~ll~~e~~~i~~~~~~-------~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D---~~-~~~~~~i~~v~ 159 (533) ++--...+.+.+.+.+ +.++..+..+...++.......+...+.+.|.+|+.++.| ++ .++.++|..++ T Consensus 69 g~e~~nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~ 148 (725) T protein:vir:92 69 SEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPTSNNQVIRREP 148 (725) T ss_pred hhHHhCCcceEEecCCccHHHHHHHHHHHHHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEee Confidence 8876666666664432 2234556667778889999999999999999999998754 22 33455555432 Q ss_pred ----CCeEEEEEec--CCceEE--EEEE-------------EEee-----------cC------CceEEEEEEEecCeeE Q lcl|NC_016654. 160 ----ADRAIPEFRW--GRLVAV--TFWS-------------ELAG-----------GD------GQEVWRHLERHESGYI 201 (533) Q Consensus 160 ----~~~~~P~~~~--g~~~~v--~f~~-------------~~~~-----------~~------~~~~y~~lE~h~~~~I 201 (533) ..+++.-+.. -.++.+ +|+. .+.. .+ .....+++|+|+...+ T Consensus 149 i~~~~~~V~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~e~~~r~~~ 228 (725) T protein:vir:92 149 IHSACSHVIWDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEK 228 (725) T ss_pred ccCChhhcccCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCCCeEEEEEEEEEEEE Confidence 2234332211 111111 1111 1100 00 0122466777766555 Q ss_pred EEEEEeccCCcccceeehhhcccccccccccccc--------------------CCceeecCC--CccceeEEecCCccc Q lcl|NC_016654. 202 VHAVYKGTATSLGWMMALTDHPATRDIAVEGADE--------------------GRGAYVETG--VKDLTAAYVPNVTPN 259 (533) Q Consensus 202 ~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~--------------------~~~~~~~~g--~~~~~~~~~pn~~~~ 259 (533) .-.+|...+..-|.-+.++.... ........+. .+....+.. .+.-.+.|+|..... T Consensus 229 ~~~~~~~~d~~~g~~~~~~~~~~-~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r 307 (725) T protein:vir:92 229 KETAFIYQDPVTGEPVSYFKRDI-KDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEW 307 (725) T ss_pred eeeEEeecCCCCCceeecChhhH-HHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCCCCceeeEEEEeee Confidence 44455443332232222111000 0000000000 000000100 111123344432211 Q ss_pred ccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHH-hCcceeeechHHhcCCCCccccccCcchhhhhhcccccc Q lcl|NC_016654. 260 PEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFR-IGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGF 338 (533) Q Consensus 260 ~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~-~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~ 338 (533) . .-...+++-+.+.+ ++|..+.+|...|...+.+- .++.+..+....+.... ...... +...|...+.... T Consensus 308 ~----~~~g~~~~~G~vr~-~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~-~~~~~~--~~~~~~~~~~~~~ 379 (725) T protein:vir:92 308 G----FVEDKEVYEGVVRL-TKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFE-HMYDGN--DDYPYYLLNRTDE 379 (725) T ss_pred e----ccCCcccccceecc-chhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHH-HHHhcc--Cccceeecccccc Confidence 1 11234554455654 67999999999999999884 44555556555553210 000001 1111111111111 Q ss_pred ccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 339 NANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALG 418 (533) Q Consensus 339 ~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~ 418 (533) ..|.-....+..+.+.--..+++..++.....|...+|++...+|-.+ +..||.+|..+..............+..+.+ T Consensus 380 ~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~-n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~ 458 (725) T protein:vir:92 380 NNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNG-GQVAYDTVNQLNMRADLETYVFQDNLATAMR 458 (725) T ss_pred ccccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111122333333222347788888888899999999999999654 4578999999888877777777777777887 Q ss_pred HHHHHHHHHHHhhccCCC---------CC------------------------CceeEEEEeCCCCCCCHHHHHHHHHHH Q lcl|NC_016654. 419 PLSTTCLRVDAIKFPGKG---------AA------------------------PSEELELEWPKFARESDLAKAQTVQAW 465 (533) Q Consensus 419 ~li~~il~l~~~~~~~~~---------~~------------------------~~~~v~i~f~d~i~~d~~e~a~~~~~l 465 (533) .+.+.+|.+-...+.... .. ..++|.|+=..+.+.-+.+.+..++++ T Consensus 459 ~~g~~lL~lI~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~ql 538 (725) T protein:vir:92 459 RDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILEL 538 (725) T ss_pred HHHHHHHHHHHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHHHHHHHH Confidence 777777765433221110 00 012233322222222244555555555 Q ss_pred HhCC--CCCHH-HHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCC----CCCCCCCCCCCCC Q lcl|NC_016654. 466 SVAS--AASTK-TKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTEN----DPATDPEAVDEGE 533 (533) Q Consensus 466 ~~aG--i~S~e-t~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~d~~ 533 (533) ..+- ..+.- ..+...-...+-+-+.+.+++|++...+... . ++....+. ..........+.| T Consensus 539 ~~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~----~--~~~~~e~~q~~~~~qqa~~~q~~~e 607 (725) T protein:vir:92 539 LGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGV----K--KPETPEEQQWLVEAQQAKQGQQDPA 607 (725) T ss_pred HHhcccchhHHHHHHHHHhhcccchHHHHHHHHHHhhhchhcc----C--CccchhhhHHHHHHHHHHHhhhHHH Confidence 4321 11110 1121111112223345556666654432211 0 00000000 0000000001111 No 104 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.23 E-value=2.2e-10 Score=73.58 Aligned_cols=413 Identities=11% Similarity=0.055 Sum_probs=177.5 Q ss_pred HHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCc--ccceeecChHHHHHHHHHHhhcCCCce Q lcl|NC_016654. 22 VAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGR--APKRYHAPIPGVIAKLSTTELFSEQLK 99 (533) Q Consensus 22 ~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~--~~~~~~~n~~k~i~~~~a~ll~~e~~~ 99 (533) |...+ -+.|..+..+... +..+.......|- ..-..+..+++.||+..|.-++.+... T Consensus 1 ~~~~~------~d~~~~~~~~~~~--------------~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~ 60 (427) T protein:vir:10 1 MKIVK------HDGYNDIFNGGAD--------------GSPKPFFMSDASYHVGSFYNDNATAKRIVDVIPEEMVTAGFK 60 (427) T ss_pred CCccc------cchHHHHhhcCCC--------------CcccCccccCchHHHHHHHHcCchhhhhhccchHHhhcCCcc Confidence 11110 0112222211100 0000000000000 011336678999999999999998877 Q ss_pred EeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEE Q lcl|NC_016654. 100 FLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWS 179 (533) Q Consensus 100 i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~ 179 (533) |...++. +.+...++.=+++..+.+++..+-.+|++++.+-++.+ ..+. -|+-..|.+..+..+. T Consensus 61 i~g~~~~----~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~--~~l~---------~p~~~~g~l~~l~v~d 125 (427) T protein:vir:10 61 MSGVKDE----KEFKSLWDSYKLDSSLVDLLCWARLYGGAAMVAIIKDN--RMLT---------SQAKPGAKLEGVRVYD 125 (427) T ss_pred ccCccHH----HHHHHHHHHhhHHHHHHHHHHhccccceeEEEEEecCC--Cccc---------cccCCCcceeEEEEec Confidence 7653322 33444455557899999999999999999998877532 1111 1222234444432222 Q ss_pred EEeecCCceEEE--EEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCc Q lcl|NC_016654. 180 ELAGGDGQEVWR--HLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVT 257 (533) Q Consensus 180 ~~~~~~~~~~y~--~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~ 257 (533) .+.-.-.. .+. .-+.+ |..+ .|.-.+.+-+.. ..| +.++ .+.|..... T Consensus 126 ~~~~~~~~-~~~dp~s~~f--g~P~--~y~v~~~~~~~~----------------------~~i-H~SR--li~~~g~~~ 175 (427) T protein:vir:10 126 RFAITVEK-RVTNARSPRY--GEPE--IYKVSPGDNMQP----------------------YLI-HHSR--VFIADGERV 175 (427) T ss_pred hhcccccc-cccCcccccc--Ccce--EEEEecCCCCcc----------------------eEE-cccc--EEEecCCCc Confidence 11000000 000 00000 1111 111111000000 000 1111 111111110 Q ss_pred ccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceee-ec--hHHhcCCCCccccccCcchhhhhh-- Q lcl|NC_016654. 258 PNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVH-AS--ESVLTNLGMGQGVSLDEEQEVYSR-- 332 (533) Q Consensus 258 ~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~-v~--~~~l~~~~~~~~~~~d~~~~~~~~-- 332 (533) .+. ..+....+|.|++..++.+.+..++.+......-+...+.+++ ++ ..++...+ ..... ......+.. T Consensus 176 --p~~-~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~k~~~~v~k~~~l~~~~~~~~-~~~~~-~~r~~~~~~~~ 250 (427) T protein:vir:10 176 --AQQ-ARKQNQGWGASVLNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDD-AQYAA-RLRLAQVDDNS 250 (427) T ss_pred --hhh-hcccCCcccchhhhHHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHHhcCcc-chHHH-HHHHHHHHHhc Confidence 111 1234467899998766666666777766655554533333333 22 12222111 11000 000001110 Q ss_pred -ccc-cccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhc-ccCCCcc-hhHHHHHHHhhhHHHHHHH Q lcl|NC_016654. 333 -VGS-GGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSL-GLSDEVA-QTATEASGKKDLTVKTTRA 408 (533) Q Consensus 333 -~~~-~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~-g~~~~~~-~Tatai~~~~~~l~~~~~~ 408 (533) ... ...+. ....++.++-+ +......++.+.++++..+++|...+ |-..+|- .|+.+-...+... ++. T Consensus 251 ~~~~~~~l~~---~~e~~e~~~~~--lsgl~~~~~~~~~~iaaa~~IP~t~L~G~sp~Glnstgd~D~~nyyd~---i~~ 322 (427) T protein:vir:10 251 GVGRAIGIDA---ETEEYDVLNSD--ISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKL---VDR 322 (427) T ss_pred Ccccceeeec---CCCceeEEecc--cCChHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHH---HHH Confidence 000 01111 11234544433 34456678888889999999987644 5444443 5666544444333 444 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHH Q lcl|NC_016654. 409 KA-RHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDE 487 (533) Q Consensus 409 ~~-~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~de 487 (533) +| ..++..|..|+..+++ ..+++|.|++-..++..|+|++..+... . ..+++. .+-.+.+ T Consensus 323 ~Qe~~l~p~l~~l~~~i~~-------------s~~~~~~f~pL~~~s~kEkaei~~~~a~--a--~~~~~~--~gvi~~~ 383 (427) T protein:vir:10 323 KREEDYRPLLEFLLPFIVD-------------EEEWSIEFEPLSVPSKKEESEITKNNVE--S--VTKAIT--EQIIDLE 383 (427) T ss_pred HHHHHHHHHHHHHHHHhhc-------------CCCcEEEeCCCCCCCHHHHHHHHHHHHH--H--HHHHHh--cCCCCHH Confidence 43 5678888888776542 1368999999999999888776443321 1 112222 2235555 Q ss_pred HHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 488 RVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 488 e~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ++.++|..+-.+.+..+- -....++++...+.+|..+ +++.|.. T Consensus 384 e~r~~L~~~~~~~~~~~~-~~~~~e~~~~~~e~~p~~~-e~~~d~~ 427 (427) T protein:vir:10 384 EARDTLRSIAPEFKLKDG-NNINIREPEETTEPEPGLG-EKLEDEN 427 (427) T ss_pred HHHHHHHhhhccccCCCC-ccccccccchhcCCCCCCC-CCCCCCC Confidence 566666543322221100 0000111111111111111 1111111 No 105 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.21 E-value=4.1e-10 Score=72.06 Aligned_cols=473 Identities=12% Similarity=0.052 Sum_probs=198.0 Q ss_pred CCCCCCcCCCcCcchHHHHHH--HHhhhHhhcCCHHHHHHHHhccC-cchh----hHHH------HHHHHHHHHH--hcc Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTAR--VAESHVWWEGDLDKLATFYGAEG-RTSP----SGIK------ARTKAAYEAF--HGR 65 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~--~~~~~~w~~gd~~~l~~~y~~~~-~~~~----~~~~------~~~~~~~~~~--~~~ 65 (533) |.=-+ .||-|--.|+..... ++--|+=.. .+ .+++-++-.+ .+.+ .... ++..++.++. .+. T Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~ 77 (532) T protein:vir:94 1 MADTD-PTPRPEITYATLQQAQRVDAKRATHT-SL-GLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSFV 77 (532) T ss_pred CCCCC-CCCCcceehhhhhhHhhhhhhhhhhh-hh-hhhhhhhhcccccccccccccccccccccccCcccccccccccc Confidence 44333 346666666654322 111111100 11 1222111111 0000 0000 0000011110 011 Q ss_pred cC-CCCCcc--cceeecChHHHHHHHHHHhhcCCCceEeeCCCc---hHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCE Q lcl|NC_016654. 66 TP-TATGRA--PKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKS---KEVQARADLIFNTPRFHSSLVEAGESCSALSGS 139 (533) Q Consensus 66 ~~-~~~g~~--~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~---~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~ 139 (533) .+ ...|-. .-.....+++.+|+..|+-++.+..+|++.++. +.....|...++.=+++..+.+++..+-.+|++ T Consensus 78 ~~~~~~~~~l~a~Y~~~~l~r~~Vd~~aed~~r~~~~i~~~~~~~~~~~~~~~i~~~~~~l~v~~~l~~a~~~~rlyG~a 157 (532) T protein:vir:94 78 EATSWPGFPTLALLAQLPEYRTMHETPADECVRAWGKITCSSKDELAADKATRITQKLEQYNVRTLVRTVVIHDQAYGGA 157 (532) T ss_pred cccccchHHHHHHHHcCchhhhhhccchHHHhhCCceEeeCCccccchHHHHHHHHHHHhhhHHHHHHHHHHhhhcccce Confidence 11 011111 112356889999999999999999999875432 233345565555557889999999999999999 Q ss_pred EEEEEEcCCCCCceEEEEEcCCeEEEE-EecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEec-cC--Ccccc Q lcl|NC_016654. 140 FQRIVWDPTIADNAWIDFVDADRAIPE-FRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKG-TA--TSLGW 215 (533) Q Consensus 140 ~~~~~~D~~~~~~~~i~~v~~~~~~P~-~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~-~~--~~lG~ 215 (533) ++.+-++.++... .+-.+-.+-|. ...|.+..+..++.+. |+=..|.. ++ ..+|. T Consensus 158 ~i~i~v~~~~~~~---~~~~p~~l~~~~I~~g~~~~l~vld~~~------------------v~p~~~~~~dp~sp~fg~ 216 (532) T protein:vir:94 158 HVFPHLKMDGDSV---PADAPLLLSPSFVQRGCLIGFATIEPMW------------------LSPNAYNATDPTLPSFYK 216 (532) T ss_pred EEEEEeccCCccc---cccccccccccccccceeeEEEeechhe------------------ecccccccccccccccCC Confidence 9887775432110 00111111111 2233333332221110 00000000 00 00111 Q ss_pred eeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHH Q lcl|NC_016654. 216 MMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLM 295 (533) Q Consensus 216 ~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~ 295 (533) +--+. +. .+ ..| +.++ .+.|..+..+. . ..+....+|+|++.... +.+..++.+..... T Consensus 217 P~~y~-----------v~-~g--~~i-H~SR--li~f~g~~~p~--~-~~~~~~~~G~Svlq~~~-~~l~~~~~t~~~~~ 275 (532) T protein:vir:94 217 PDSWI-----------AT-SG--KKI-HSSR--IHTVVGRPVGD--M-LKAAYSFRGVSISQLAM-PYVDNWLRTRQSVS 275 (532) T ss_pred ceeEE-----------Ec-cC--eee-ccce--EEEecCCCchh--h-hccccccccccHHHHHH-HHHHHHHHHHHHHH Confidence 00000 00 00 000 0000 11121111111 1 11223457999998754 66667776665555 Q ss_pred HHHHhCcceeeechHHhcCCCCccccccCcchhhhhhccc--cccccccccccceeeechhhhhHHHHHHHHHHHHHHHH Q lcl|NC_016654. 296 RDFRIGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGS--GGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLR 373 (533) Q Consensus 296 ~~~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~--~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~ 373 (533) .-+......++.- .+..-...+....+...-..+..... +..-.. +....+++++- .+......++.+.+.++. T Consensus 276 ~l~~~~~~~v~k~-~~a~~ls~~~~~~~~~r~~~~~~~~~n~g~~~id-~~~e~~e~~~~--~lsgl~~~l~~~~~~iAa 351 (532) T protein:vir:94 276 DTVKQFSMTNLAT-DMAQLLAPGGAQSLDARLQLFNLYRDNRNIGALD-KGTEEIQQTNT--PLSGLDSLQAQSQEQMAA 351 (532) T ss_pred HHHHhcCCceeee-chHHhhcchhHHHHHHHHHHHHhhcCCccceEEc-CCCceeEEEec--ccCCHHHHHHHHHHHHHh Confidence 4444444333321 11111011111111100011111100 001111 11123555543 333456677888889999 Q ss_pred hhCCChhh-cccCCCcc-hhHHHHHHHhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCC Q lcl|NC_016654. 374 KTGYSPVS-LGLSDEVA-QTATEASGKKDLTVKTTRAKA-RHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKF 450 (533) Q Consensus 374 ~~g~s~~~-~g~~~~~~-~Tatai~~~~~~l~~~~~~~~-~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~ 450 (533) .+|+|... ||-..+|- .|+..-+..+ +..++.++ ..++..|..|+..++... + |. ...+++|.|++- T Consensus 352 a~~IP~t~LfG~sp~GlnstGe~D~~~y---yd~I~s~Qe~~l~p~le~l~~~l~~s~---~-g~---~~~d~~~~f~pL 421 (532) T protein:vir:94 352 VSHIPLVKLLGITPNGLNASSDGEIRVW---YDFIAGYQATNLTPLMEWIIDLIQLSE---Y-GQ---IDPGLAWEWSPL 421 (532) T ss_pred HhCCCeeeeecCCcccccccchHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHh---c-CC---CCCCceEEeCCC Confidence 99998763 56544443 4565443333 44455554 567788888887765421 1 21 234699999998 Q ss_pred CCCCHHHHHHH-------HHHHHhCCCCCHHHHHHHhCCC----CCH-----HHHHHHHHHHHHhhh-cccCccccc-cc Q lcl|NC_016654. 451 ARESDLAKAQT-------VQAWSVASAASTKTKVAYLHED----WDD-----ERVQEEADLIDNANT-VSAPTFGFG-TD 512 (533) Q Consensus 451 i~~d~~e~a~~-------~~~l~~aGi~S~et~v~~l~~~----~~d-----ee~~~El~rI~~E~~-~~~~~~~~~-~~ 512 (533) ...+..|+|++ .++++.+|++|.+++...+--. +.. ++. ++.+.+.+|.. ...++...+ .+ T Consensus 422 ~~~s~kEkAei~~~~a~a~~~~~~~Gvi~~~Evr~~l~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 500 (532) T protein:vir:94 422 MELDDKELAEVRQLNASTDSTLMELGVIDAKMVQQRLAADPTSGYAGALGERDEL-DDVEEIAKQLMAAALNPPATAPQT 500 (532) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhcCCcccccccccccccc-ccccchhhhhcccccCCCCCCCCC Confidence 88888876554 5778889999998876654211 100 000 01112222221 111111111 11 Q ss_pred cCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 513 QPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 513 ~~~~~~~~~~~~~~~~~~d~~ 533 (533) +.|.++..++..+.+.....+ T Consensus 501 ~~~~~~~~~d~~~~~~~~~~~ 521 (532) T protein:vir:94 501 PNPQPDSEDDQTDNQPDAQAD 521 (532) T ss_pred CCCCCCCCCCCCCCccCCCcc Confidence 111111111111111111111 No 106 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=99.20 E-value=1.4e-10 Score=74.62 Aligned_cols=486 Identities=9% Similarity=-0.032 Sum_probs=192.0 Q ss_pred CCCCCCcCCCcCc-chHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeec Q lcl|NC_016654. 1 MSLPEANTAWPPP-ELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHA 79 (533) Q Consensus 1 ~~~~~~~~~~pp~-~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 79 (533) |.==.....-=.- -++-+...+..+..||.|... .....+...+++..+...-.-+.+++. T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~------------------~~~~~~~~~y~g~~~~~~~~~~s~~~~ 62 (705) T protein:vir:88 1 MAKRRKIKPMDDEQVLRHLDQLVNDALDFNSSELS------------------KQRSEALKYYFGEPFGNERPGKSGIVS 62 (705) T ss_pred CCcccccccCCHHHHHHHHHHHHHHHHhhhhhHHH------------------HHHHHHHHHHhCCCCCcccCCCCcccc Confidence 2111111000000 112333444444455544221 000011111111111111111345666 Q ss_pred ChHHHHHHHHHHhh----cCCCceEeeCC---CchH----HHHHHHHH-HhhccHHHHHHHHHHHHhhhCCEEEEEEEcC Q lcl|NC_016654. 80 PIPGVIAKLSTTEL----FSEQLKFLDAG---KSKE----VQARADLI-FNTPRFHSSLVEAGESCSALSGSFQRIVWDP 147 (533) Q Consensus 80 n~~k~i~~~~a~ll----~~e~~~i~~~~---~~~~----~~~~l~~i-~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~ 147 (533) +.....|+.....| |+-+..+.+.+ .+.. .+++++-+ .+.|+....+..++..|+..|.++++++|+. T Consensus 63 ~~v~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~ 142 (705) T protein:vir:88 63 RDVQETVDWIMPSLMKVFTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEE 142 (705) T ss_pred HHHHHHHHHHHHHHHHhhcCCCceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEecccc Confidence 66666666655543 44444444432 2222 33445553 5566667889999999999999999999964 Q ss_pred CC----------------------------------------------CCceEEEEEcCCeEEEEEecCCceEEEEE--- Q lcl|NC_016654. 148 TI----------------------------------------------ADNAWIDFVDADRAIPEFRWGRLVAVTFW--- 178 (533) Q Consensus 148 ~~----------------------------------------------~~~~~i~~v~~~~~~P~~~~g~~~~v~f~--- 178 (533) .. .+.++|+.|++..|++--+-..+.++.|+ T Consensus 143 ~~~~~~e~~~~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp~a~~~~d~~~~~~~ 222 (705) T protein:vir:88 143 VLKPTFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARFLCHR 222 (705) T ss_pred ccchhhhhhccCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceecCCCCCcccCcEEEEE Confidence 20 04578888888877753221123333222 Q ss_pred EEEeecC-CceEE-----EEEEEec------------C---eeEE-EEEEeccCCcccceeehhhcccccccc-cc---- Q lcl|NC_016654. 179 SELAGGD-GQEVW-----RHLERHE------------S---GYIV-HAVYKGTATSLGWMMALTDHPATRDIA-VE---- 231 (533) Q Consensus 179 ~~~~~~~-~~~~y-----~~lE~h~------------~---~~I~-~~~y~~~~~~lG~~v~l~~~~~~~~~~-~~---- 231 (533) ...+..+ ....| ..+..++ . +.+. +....+.+.+-.+.|.+.++....+.. +. T Consensus 223 ~~~t~~dl~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~~~d~~~d~~~~~ 302 (705) T protein:vir:88 223 EKYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLDVDGDGISEL 302 (705) T ss_pred EeccHHHHHhhcCChhHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEEeeeEecccCCcceee Confidence 1111100 00000 0000000 0 0000 000000000000112111110000000 00 Q ss_pred -ccccCCcee--ecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHH-hCcceeee Q lcl|NC_016654. 232 -GADEGRGAY--VETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFR-IGAGKVHA 307 (533) Q Consensus 232 -~~~~~~~~~--~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~-~~~~~i~v 307 (533) .....+... ++....+||+.+-+...+ .+.+|.|++.. +.++.+.+|..++++++.+. .+..++.+ T Consensus 303 ~~~~~~g~~il~~~~~~~~PF~~~~~~p~~---------~~~~G~g~~~~-~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~ 372 (705) T protein:vir:88 303 RRILYVGDYIISNEPWDCRPFADLNAYRIA---------HKFHGMSVYDK-IRDIQEIRSVLMRNIMDNIYRTNQGRSVV 372 (705) T ss_pred EEEEEeCccccccccCCCCCEEEecceeec---------CccccCChHHH-HhHHHHHHHHHHHHHHHHHHhccCCceec Confidence 000000000 111122455543222111 25679999886 67999999999999999885 46667888 Q ss_pred chHHhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCC- Q lcl|NC_016654. 308 SESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSD- 386 (533) Q Consensus 308 ~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~- 386 (533) +.+++..... ....+.. ++. ...+ + .+..+.+.--.......++.+...+...+|++.-..|.+. T Consensus 373 ~~g~v~~~d~---~~~~pg~-vv~------~~~~-~---~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~ 438 (705) T protein:vir:88 373 LDGQVNLEDL---LTNEAAG-IVR------VKSM-N---SITPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQN 438 (705) T ss_pred cccccCcccc---cccCCCe-eEE------ecCC-C---ccccccCCcCcHHHHHHHHHHHHHHHHhhCCchHHcCCCcc Confidence 7776533211 1111111 110 0111 1 1333332222234566677777788889999988888543 Q ss_pred --CcchhHHHHHHHhhhHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhccCCCC---------------CCceeEEEEeC Q lcl|NC_016654. 387 --EVAQTATEASGKKDLTVKTTRAKARHFG-SALGPLSTTCLRVDAIKFPGKGA---------------APSEELELEWP 448 (533) Q Consensus 387 --~~~~Tatai~~~~~~l~~~~~~~~~~~~-~al~~li~~il~l~~~~~~~~~~---------------~~~~~v~i~f~ 448 (533) ++..||++|....+.--.......+.|. .+++++++.+++|....+..... ....++.++-. T Consensus 439 ~~~~~~Ta~~i~~~~~~~~~r~~~~~r~~a~~~~~~l~~~~~~li~~~~~~~~~~ri~g~~v~v~~~~~~~~~~v~v~v~ 518 (705) T protein:vir:88 439 TLHSNQAAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRERSDLTVTVG 518 (705) T ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCceEEeeccchhccchHhhccCCceEEeec Confidence 3357899988888777776777777774 57788888887765533221100 00112222211 Q ss_pred CCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCC---CCCC Q lcl|NC_016654. 449 KFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTEND---PATD 525 (533) Q Consensus 449 d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~---~~~~ 525 (533) .+ ..+..+....++.+.+ ..+.-.....+.+-.+.....+.++++.+..+.-.+. .....+....... ...+ T Consensus 519 ~~-~~~~eq~~a~l~~ll~--~~q~l~~~~~~~~~~~~~~~~~~~~el~e~~~~k~~~--~~~~~~~~~e~~~~~~~~~q 593 (705) T protein:vir:88 519 IG-NMNKDQQMLHLMRIWE--MAQAVVGGGGLGVLVSEQNLYNILKEVTENAGYKDPD--RFWTNPNSPEALQAKAIREQ 593 (705) T ss_pred cc-cchHHHHHHHHHHHHH--HHHHhhcccchhhhcChHHHHHHHHHHHHhhhhhhHH--HHhhhhhhHHHHHHHHhhhh Confidence 11 1122222222222211 0000000000111112221111111111100000000 0000000000000 0000 Q ss_pred CCCCCCCC Q lcl|NC_016654. 526 PEAVDEGE 533 (533) Q Consensus 526 ~~~~~d~~ 533 (533) .+.+...+ T Consensus 594 ~e~~~~~~ 601 (705) T protein:vir:88 594 KEAQPKPE 601 (705) T ss_pred hhhhHHHH Confidence 00000000 No 107 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.17 E-value=2.9e-10 Score=72.84 Aligned_cols=468 Identities=9% Similarity=-0.020 Sum_probs=192.1 Q ss_pred CCCCCCcCCCcCcch--HHHH------HHHHhhhHhhcCCHHHHHH-------HHhccCcchhhHHHHHHHHHHHHHhcc Q lcl|NC_016654. 1 MSLPEANTAWPPPEL--AAVT------ARVAESHVWWEGDLDKLAT-------FYGAEGRTSPSGIKARTKAAYEAFHGR 65 (533) Q Consensus 1 ~~~~~~~~~~pp~~~--~~~~------~~~~~~~~w~~gd~~~l~~-------~y~~~~~~~~~~~~~~~~~~~~~~~~~ 65 (533) =||-.-+..||=..- .|.+ -.++.. .+.+.-++.+ .++......+......+.++....... T Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~ 113 (862) T protein:vir:99 37 DPLARTRQNWPVQKEKPNPIIRSVKDFPFVEIS---DSVNAKSVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAE 113 (862) T ss_pred chHHhhcccCCcccccCCCCCCccccccccccc---ccccchhhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhcccc Confidence 223333344531110 0000 000000 0111111110 000000000000001111111100000 Q ss_pred -----------------cCCCCCcc--cceeecChHHHHHHHHHHhhcCCCceEeeCCC----chHHHHHHHHHHhhccH Q lcl|NC_016654. 66 -----------------TPTATGRA--PKRYHAPIPGVIAKLSTTELFSEQLKFLDAGK----SKEVQARADLIFNTPRF 122 (533) Q Consensus 66 -----------------~~~~~g~~--~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~----~~~~~~~l~~i~~~n~f 122 (533) +..-.|.+ .......+++.||+..|+-++.+...|.+.++ +++..+.|.+.++.-++ T Consensus 114 ~~~s~y~~~~~~~~~~~~~~f~gyql~alY~~~~larkiVd~pAeDatR~g~~I~~~~d~~e~~~e~~~~ie~~~~rL~v 193 (862) T protein:vir:99 114 GKQSSYAVPEALQDWYLSQGFIGHQACALIAQHWLVDKACSLAGEDAIRNGWHLKSLGEGEEIDEESLEKFKAIDVEFKV 193 (862) T ss_pred ccccccccchhccccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCceEeecCcccccCHHHHHHHHHHHHHhhH Confidence 00000110 12346789999999999999999999987433 22345667777777788 Q ss_pred HHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEE Q lcl|NC_016654. 123 HSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIV 202 (533) Q Consensus 123 ~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~ 202 (533) +..+.+++..+-.+|++++.+.++..- +.-+=+-++++. ...|.+..+..++.+... ++.+. T Consensus 194 ~~~l~eair~~RLyGga~ililv~~~D-~~~LsqPLn~e~----I~kG~lkgl~vlDp~w~~-------------p~~v~ 255 (862) T protein:vir:99 194 KENLIEFNRFKNVFGIRVAIFVVDSED-PDYYEKPFNPDG----ITPGSYRGISQIDPYWMM-------------PMLTA 255 (862) T ss_pred HHHHHHHHHhcccccceEEEEEecCcC-chhhhcCcCccc----ccccceeEEEEechhhhc-------------ccccc Confidence 999999999888899887766554211 100000011110 122333333322211000 00000 Q ss_pred EEEEeccC--CcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhH Q lcl|NC_016654. 203 HAVYKGTA--TSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDL 280 (533) Q Consensus 203 ~~~y~~~~--~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i 280 (533) .+..+. ..+|.+--+ . +.. ..| +.++ .+.|..... .++. .+....+|+|.+... T Consensus 256 --~~~~Dp~sp~yGkP~~y-------~----I~g----~~I-H~SR--liif~g~~v--pd~l-k~ay~f~G~SvLe~i- 311 (862) T protein:vir:99 256 --ESTADPSSQFFYEPEFW-------I----ISG----QKY-HRSH--LIIARGPQP--ADIL-KPTYIFGGIPLVQRI- 311 (862) T ss_pred --cccccccccccCCceee-------e----ecC----eee-ccce--eEEecCCCc--hhhh-hccCCccCccHHHHH- Confidence 000000 011110000 0 000 000 0000 011111111 1111 122346899999864 Q ss_pred HHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCCccccccCcchhhhhhccc--cccccccccccceeeechhhhhH Q lcl|NC_016654. 281 FPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGS--GGFNANGDMETIFEFFQPAIRVL 358 (533) Q Consensus 281 ~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~--~~~~~~~~~~~~i~~~~~~ir~e 358 (533) .+.|...+.+......-+.....+++-- ..+....+... +...-+.+..... +..-.+. ...+++++.+ +. T Consensus 312 yd~L~~~d~t~~saa~Ll~ka~l~v~kt-d~l~~l~~ed~--l~~r~~~~~~~rdN~Gi~liD~--eEe~e~ls~s--lS 384 (862) T protein:vir:99 312 YERVYAAERTANEAPLLAMNKRTTAIHT-DTAKAIANEDK--FIQRLMFWVRYRDNHAVKVLGT--DETMEQFDTS--LA 384 (862) T ss_pred HHHHHHHHHHHHHHHHHHHHhccceeec-hhHhhhccHHH--HHHHHHHHHhccCcceeEEecC--CCceeEEecc--cC Confidence 4666777776655554444444443321 11111111000 0000011111000 0011111 2235555433 33 Q ss_pred HHHHHHHHHHHHHHHhhCCChhh-cccCCCc-chhHHHHHHHhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhccCC Q lcl|NC_016654. 359 EHDQGAALLLREVLRKTGYSPVS-LGLSDEV-AQTATEASGKKDLTVKTTRAK-ARHFGSALGPLSTTCLRVDAIKFPGK 435 (533) Q Consensus 359 ~~~~~l~~~l~~i~~~~g~s~~~-~g~~~~~-~~Tatai~~~~~~l~~~~~~~-~~~~~~al~~li~~il~l~~~~~~~~ 435 (533) .....++.+.++|+..++++... ||-...| .+|+.+=...+ +..+..+ +..++..|..|+.++ .+ .+ + T Consensus 385 GL~dll~~~~q~IAaas~IP~tiLfGqspaGlnATGE~D~~nY---yD~I~s~QE~~L~P~LerL~~li-~~---~l-g- 455 (862) T protein:vir:99 385 DFDAVIMGQYQLVASIAKTPATKLLGTAPKGFNSTGEFETISY---HEELESIQEHVYMPFLQRHYLIS-RL---SL-G- 455 (862) T ss_pred ChHHHHHHHHHHHHhhhCCCceeecccCcccccCchHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHH-HH---hc-C- Confidence 45667788888999999998764 6654333 35666443333 3334444 356778887776543 22 11 1 Q ss_pred CCCCceeEEEEeCCCCCCCHHHHHHH-------HHHHHhCCCCCHHHHHHHh-------CCCCCHHHHHHHHHHHHHhhh Q lcl|NC_016654. 436 GAAPSEELELEWPKFARESDLAKAQT-------VQAWSVASAASTKTKVAYL-------HEDWDDERVQEEADLIDNANT 501 (533) Q Consensus 436 ~~~~~~~v~i~f~d~i~~d~~e~a~~-------~~~l~~aGi~S~et~v~~l-------~~~~~dee~~~El~rI~~E~~ 501 (533) ...+++|.|+.-...+..|+|++ +++++++|++|.+++..++ +...+++++++.- -+..|+. T Consensus 456 ---~~~d~~ieFnpL~~~sekEkAEi~kk~Aea~~~lv~sGvispdEvR~~L~~~~~~g~~~l~ded~E~d~-~~~~e~~ 531 (862) T protein:vir:99 456 ---IQHEIDVVMEPVASMTAQQQADLNKTKAEGGKVLIDGGVISPDEERNRIRDDKRSGYNRLTKEDAEETP-GASPENL 531 (862) T ss_pred ---CCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCcCCCCCCcccccccC-CCCcccc Confidence 12469999999999998888765 6788889999999877764 3335554443110 0111111 Q ss_pred cccCccccccccCCCCCC------CCCCCCCCCCCCCC Q lcl|NC_016654. 502 VSAPTFGFGTDQPPLPTE------NDPATDPEAVDEGE 533 (533) Q Consensus 502 ~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~d~~ 533 (533) .+....+....+.|..+. ...+++.+..+..+ T Consensus 532 ~~~e~~g~a~~~ap~de~~aga~~~~~e~d~~~~p~~~ 569 (862) T protein:vir:99 532 AAYQKAGAAQETASAKETQAGAAVTTAEGDQPNVQMVP 569 (862) T ss_pred cccccCCcccccccccccccccCCccccCCcccccccC Confidence 111111111111111000 00001111111111 No 108 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=99.14 E-value=1.3e-09 Score=69.39 Aligned_cols=491 Identities=8% Similarity=-0.078 Sum_probs=223.2 Q ss_pred hHHHHHHHHhhhHhhcCCHHHHHHHHh----ccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHH Q lcl|NC_016654. 15 LAAVTARVAESHVWWEGDLDKLATFYG----AEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLST 90 (533) Q Consensus 15 ~~~~~~~~~~~~~w~~gd~~~l~~~y~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a 90 (533) +++....+.+.+.||.-+.+.-.+.+. ....|..+||......... ...+..+|+.+.+|+... T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~QW~~~~~~~l~------------~q~rp~~N~i~~~v~~v~ 68 (725) T protein:vir:10 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTT------------LQYRGQFDVVRPVVRKLV 68 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHH------------hcCCCcccchHHHHHHHH Confidence 333444455555555543332222111 1112222333322211111 011235799999999999 Q ss_pred HhhcCCCceEeeCCCc---h----HHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEc---CC-CCCceEEEEE- Q lcl|NC_016654. 91 TELFSEQLKFLDAGKS---K----EVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWD---PT-IADNAWIDFV- 158 (533) Q Consensus 91 ~ll~~e~~~i~~~~~~---~----~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D---~~-~~~~~~i~~v- 158 (533) ++--...+.+.+.+.+ . .++..+..+...++.......+...+.+.|.+|+.+.+| ++ .++.+.|..+ T Consensus 69 g~e~~nr~d~~v~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~ 148 (725) T protein:vir:10 69 SEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREP 148 (725) T ss_pred hhHHhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccCCCCCCCceeeeeee Confidence 8877777777665432 2 234555667778888999999999999999999999755 22 2344555443 Q ss_pred ---cCCeEEEEEe--cCCceEEE--EEEEEee---------------------cC---------CceEEEEEEEecCeeE Q lcl|NC_016654. 159 ---DADRAIPEFR--WGRLVAVT--FWSELAG---------------------GD---------GQEVWRHLERHESGYI 201 (533) Q Consensus 159 ---~~~~~~P~~~--~g~~~~v~--f~~~~~~---------------------~~---------~~~~y~~lE~h~~~~I 201 (533) ++.+++.-+. ...+..+- |..++.. .+ .....+++|+|....+ T Consensus 149 i~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~vrv~E~~~r~~~ 228 (725) T protein:vir:10 149 IHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEK 228 (725) T ss_pred cccCHhHcccCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCCeEEEEEEEEEEEE Confidence 3344442211 11222221 1111100 00 0112356677665544 Q ss_pred EEEEEeccCCcccceeehhhcccccccccccccc--------------------CCceeecCC--CccceeEEecCCccc Q lcl|NC_016654. 202 VHAVYKGTATSLGWMMALTDHPATRDIAVEGADE--------------------GRGAYVETG--VKDLTAAYVPNVTPN 259 (533) Q Consensus 202 ~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~--------------------~~~~~~~~g--~~~~~~~~~pn~~~~ 259 (533) .-.+|...+..-|.-+.+..... ........+. .+....+.. .+.-.+.|+|..... T Consensus 229 ~~~~~~~~d~~~g~~~~~~~~~~-~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~fP~vP~~g~r 307 (725) T protein:vir:10 229 KETAFIYQDPVTGEPVSYFKRDI-KDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEW 307 (725) T ss_pred eeEEEEeccCCCCceeecchhhh-HHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCCCCCCceeEEEEEeee Confidence 44444433322232222111000 0000000000 000000110 111123344432211 Q ss_pred ccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHH-hCcceeeechHHhcCCCCccccccCcchhhhhhcccccc Q lcl|NC_016654. 260 PEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFR-IGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGF 338 (533) Q Consensus 260 ~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~-~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~ 338 (533) . ....++++-+.+.+ ++|..+.+|...|...+.+- .++....+....+.... .....+ +...|...+.... T Consensus 308 ~----~~~g~~~~~G~vr~-~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e-~~~~~~--~~~~~~~~~~~~~ 379 (725) T protein:vir:10 308 G----FVEDKEVYEGVVRL-TKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFE-HMYDGN--DDYPYYLLNRTDE 379 (725) T ss_pred e----ccCCcceeeeeecc-chhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHH-HHHhcc--CCceeeecccccc Confidence 1 11234554455654 67999999999999999984 45555556555543210 000001 1111111110011 Q ss_pred ccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 339 NANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALG 418 (533) Q Consensus 339 ~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~ 418 (533) ..|.-....+..+.+.--..+++..++.....|...+|++...+|-.+ +..||.+|..+..............+..+++ T Consensus 380 ~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~-n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~ 458 (725) T protein:vir:10 380 NNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNG-GQVAYDTVNQLNMRADLETYVFQDNLATAMR 458 (725) T ss_pred cCcccccccCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111112333332222246778888888899999999999998654 4578999999888877777777777788888 Q ss_pred HHHHHHHHHHHhhccCCCC---------C------------------------CceeEEEEeCCCCCCCHHHHHHHHHHH Q lcl|NC_016654. 419 PLSTTCLRVDAIKFPGKGA---------A------------------------PSEELELEWPKFARESDLAKAQTVQAW 465 (533) Q Consensus 419 ~li~~il~l~~~~~~~~~~---------~------------------------~~~~v~i~f~d~i~~d~~e~a~~~~~l 465 (533) .+.+.+|.+-...+..... . ..++|.|+=..+.+.-+.+.+..++++ T Consensus 459 ~~g~~lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~ql 538 (725) T protein:vir:10 459 RDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRSEILEL 538 (725) T ss_pred HHHHHHHHHHHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeeccCcHHHHHHHHHHHHHH Confidence 7777777664432211110 0 012333333222222244555555555 Q ss_pred HhCC--CCCHH-HHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCC----CCCCCCCCCCCCC Q lcl|NC_016654. 466 SVAS--AASTK-TKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTEN----DPATDPEAVDEGE 533 (533) Q Consensus 466 ~~aG--i~S~e-t~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~d~~ 533 (533) ..+- ..+.- ..+..+-+..+-+-+++.+++|++...+... .++....+. +..-......+.+ T Consensus 539 l~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~------~~~~~~e~~q~~~e~qq~~~~q~~~e 607 (725) T protein:vir:10 539 LGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGV------KKPETPEEQQWLVEAQQAKQGQQDPA 607 (725) T ss_pred HHhccccchhHHHHHHHHhhcCCchhHHHHHHHHHhhhhhhcc------CCccccchhHHHHHHHHHHHhhhHHH Confidence 4321 11111 1122212222233355566777665432211 001000000 0000001111111 No 109 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.09 E-value=2.2e-09 Score=68.09 Aligned_cols=458 Identities=10% Similarity=0.023 Sum_probs=188.5 Q ss_pred CCCC-----------------CCcC-CCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHH Q lcl|NC_016654. 1 MSLP-----------------EANT-AWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAF 62 (533) Q Consensus 1 ~~~~-----------------~~~~-~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~ 62 (533) =||| .... ++|.|.. .+.-..+-..|-.+.+.-+-.+...++... ....| T Consensus 35 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~a~ds~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~ 102 (765) T protein:vir:96 35 DPMIKLGKIRGWNVEPEKAPVIRSVKDFLEPGL-----SVAMDSAYGDGPTPAAKAAAGGQNPYVVPT-------MLQDW 102 (765) T ss_pred ccchhHHHHhhcccccccCCCCCCCCcccCccc-----ceeccccccccccchHHHhhhccCccchhh-------HHHhh Confidence 1111 1111 2222210 011111111111111111111111100000 00011 Q ss_pred hcccCCCCCc--ccceeecChHHHHHHHHHHhhcCCCceEeeCCC--chHHHHHHHHHHhhccHHHHHHHHHHHHhhhCC Q lcl|NC_016654. 63 HGRTPTATGR--APKRYHAPIPGVIAKLSTTELFSEQLKFLDAGK--SKEVQARADLIFNTPRFHSSLVEAGESCSALSG 138 (533) Q Consensus 63 ~~~~~~~~g~--~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~--~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~ 138 (533) +.. ....|. -.-+.+..+++.||+..|.-++.+...|++.++ .+...+.|++.++.=+++..+.++++.+-.+|+ T Consensus 103 ~~~-~~f~gyql~alY~~~~l~rkiVd~pAeDa~R~g~~I~~~~~e~~~~~~~~l~~~~~rl~v~~~l~ea~~~~RlyGg 181 (765) T protein:vir:96 103 YNS-QGFIGYQACAIISQHWLVDKACSMSGEDAARNGWELKSDGRKLSDEQSALIARRDMEFRVKDNLVELNRFKNVFGV 181 (765) T ss_pred hcc-cCCccHHHHHHHHhCchhhhhhhcchHHhhcCCceeecCccccCHHHHHHHHHHHHHhhHHHHHHHHHHHhhhcee Confidence 100 000010 111346679999999999999999988877432 233445677666666889999999999999999 Q ss_pred EEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccC--Ccccce Q lcl|NC_016654. 139 SFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTA--TSLGWM 216 (533) Q Consensus 139 ~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~--~~lG~~ 216 (533) +++.+-++..- +.-.=.-++.+. ...|.++.+..++.+.. .++.+.+ +..+. ..+|.+ T Consensus 182 a~i~i~i~~~D-~~~l~~PL~~~~----I~kg~~kgl~vldp~~~-------------~~~~v~e--~~~Dp~sp~fg~P 241 (765) T protein:vir:96 182 RIALFVVESDD-PDYYEKPFNPDG----IAPGSYKGISQIDPYWA-------------MPQLTAE--STADPSAEHFYEP 241 (765) T ss_pred eEEEEEecccC-cchhhccccccc----cccceeeEEEEechhhc-------------ccccchh--ccccccccccCcc Confidence 98876554211 100000011111 01222332222211000 0000000 00000 011110 Q ss_pred eehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 217 MALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMR 296 (533) Q Consensus 217 v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~ 296 (533) --+ . +. . ..| +.++ .+.|.... ..++ ..+....+|+|.+... .+.|..++.+...... T Consensus 242 ~~y-------~----i~---g-~~I-H~SR--li~~~g~~--lpd~-lk~~~~~~G~Svlq~~-yd~I~~~~~t~~~~a~ 299 (765) T protein:vir:96 242 DFW-------I----IS---G-KKY-HRSH--LVVVRGPQ--PPDI-LKPTYIFGGIPLTQRI-YERVYAAERTANEAPL 299 (765) T ss_pred eee-------e----ec---C-cee-ccce--EEEecCCC--chhh-hccccCccCccHHHHH-HHHHHHHHHHHHHHHH Confidence 000 0 00 0 000 0111 11111111 1111 1223345799999874 4666777776655555 Q ss_pred HHHhCcceeeechHHhcCCCCccccccCcchhhhhhcc-c-cccccccccccceeeechhhhhHHHHHHHHHHHHHHHHh Q lcl|NC_016654. 297 DFRIGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVG-S-GGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRK 374 (533) Q Consensus 297 ~~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~-~-~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~ 374 (533) -+.....+++-- .++....+... +...-..+.... . +..-.+. ...++.++-+ +......++.++++|+.. T Consensus 300 Ll~k~~~~v~k~-~~~~~l~~~~~--l~~r~~~~~~~r~n~g~~~id~--ee~~e~~s~~--lsgl~d~l~~~~~~iAaa 372 (765) T protein:vir:96 300 LAMSKRTSTIHV-DVEKAIANEDA--FNARLAFWIANRDNHGVKVIGI--DETMEQFDTN--LSDFDSVIMNQYQLVAAI 372 (765) T ss_pred HHHHhccceeee-chHhhhccHHH--HHHHHHHHHHhcCCceeEEecC--CcceeEEecc--cCCHHHHHHHHHHHHHhh Confidence 454444444321 11211111111 000000111110 0 0011111 2335555433 345567788888999999 Q ss_pred hCCChh-hcccCCCc-chhHHHHHHHhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCC Q lcl|NC_016654. 375 TGYSPV-SLGLSDEV-AQTATEASGKKDLTVKTTRAKA-RHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFA 451 (533) Q Consensus 375 ~g~s~~-~~g~~~~~-~~Tatai~~~~~~l~~~~~~~~-~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i 451 (533) ++++.. -||-..+| ..|+..=...+ +..+..+| ..++..|..|+..+++- +. . ..+++|.|++-. T Consensus 373 s~IP~t~LfGqsp~GlnATGe~D~~nY---yD~I~s~Qe~~l~p~le~L~~li~~s------~~-i--~~d~~i~FnpL~ 440 (765) T protein:vir:96 373 AKTPATKLLGTSPKGFNATGEHETISY---HEELESIQEHIFDPLLERHYLLLAKS------ES-I--DVQLEIVWNPVD 440 (765) T ss_pred hCCCeeeeccCCcccccCcchHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHh------cC-C--CCcceEEeCCCC Confidence 999863 34543233 35665433333 33444444 66788888888876542 11 1 236999999999 Q ss_pred CCCHHHHHHH-------HHHHHhCCCCCHHHHHHHhCC-------CCCHHHHHHHHHHHHHhhhcccCccccccccC--- Q lcl|NC_016654. 452 RESDLAKAQT-------VQAWSVASAASTKTKVAYLHE-------DWDDERVQEEADLIDNANTVSAPTFGFGTDQP--- 514 (533) Q Consensus 452 ~~d~~e~a~~-------~~~l~~aGi~S~et~v~~l~~-------~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~--- 514 (533) ..+..|+|++ +++++.+|++|.+++..++.. ..++++.+.+- -+..|...+.+..+...... T Consensus 441 ~~sekEkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~~-~~~pe~~~~~~~~~~~~~~~~~e 519 (765) T protein:vir:96 441 STTSQQQAELNNKKAATDEIYINSGVVSPDEVRERLRDDPRSGYNRLTDDQAETEP-GMSPENLAELEKAGAQSAKAKGE 519 (765) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhccccCCCCCCCcccccccc-CCCccccccccCCCcccccccCc Confidence 9988877664 777888999998887776521 23443322110 01111111011000000000 Q ss_pred -----CCCCCCCCCCCCCC--CCCCC Q lcl|NC_016654. 515 -----PLPTENDPATDPEA--VDEGE 533 (533) Q Consensus 515 -----~~~~~~~~~~~~~~--~~d~~ 533 (533) +.+...++++++.. ..+.+ T Consensus 520 ~~~~~a~p~~~eg~~~~~~~~p~~~~ 545 (765) T protein:vir:96 520 AERAEAQAGAVEGAGDPVPAAPRGTK 545 (765) T ss_pred cccccCCCCccCCCCcccccCCcccC Confidence 00000000000000 00000 No 110 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.06 E-value=3.2e-09 Score=67.14 Aligned_cols=449 Identities=13% Similarity=0.052 Sum_probs=198.8 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCC-HHHHHHHHhccCcchhh----HHHHHHHHHHHHHhcccCCCCCcccc Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGD-LDKLATFYGAEGRTSPS----GIKARTKAAYEAFHGRTPTATGRAPK 75 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd-~~~l~~~y~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~g~~~~ 75 (533) |.+=+.| +.+|.....- ...+.-|.|- ..+. ........++ ..+..+..+ +| .- T Consensus 1 m~~~~~~--~~a~~~~~~~---~~~~~~y~aa~~~~~---~~~~~~~s~d~~~~~~~~~lr~R-aR------------dl 59 (495) T protein:vir:10 1 MNMTPSG--YQSLASGLLV---PVGASAYEGASGGHR---WQDIGDYGPDTAVASGIQTLRAR-SH------------HN 59 (495) T ss_pred CCccccc--ccccchhhhh---HHHhhhhhccccCcc---cCCCCCCChhHHHHHHHHHHHHH-HH------------HH Confidence 6554433 2222211111 1111222221 0000 0000000000 001111111 11 11 Q ss_pred eeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHH----h------hccHHHHHHHHHHHHhhhCCEEEEEEE Q lcl|NC_016654. 76 RYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIF----N------TPRFHSSLVEAGESCSALSGSFQRIVW 145 (533) Q Consensus 76 ~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~----~------~n~f~~~~~~~~~~~~~~G~~~~~~~~ 145 (533) ....++++-+++.+++.+.|...++....+++.+++.|+..+ + ..+|......++...+..|.+++++.+ T Consensus 60 ~rNn~~a~~av~~~~~~vVG~Gi~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~ 139 (495) T protein:vir:10 60 VRNNPWATNAVATWVAAAVGNGLTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKP 139 (495) T ss_pred HhcChHHHHHHHHHHHhhcCCCcccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEee Confidence 235579999999999999998877766656666666665544 2 235777777777888899999998888 Q ss_pred cCCCCC---ceEEEEEcCCeEE-EEEecCCceEEEEEEEEeecCCceEEEEEEEecCe-eEEEEEEeccCCcccceeehh Q lcl|NC_016654. 146 DPTIAD---NAWIDFVDADRAI-PEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESG-YIVHAVYKGTATSLGWMMALT 220 (533) Q Consensus 146 D~~~~~---~~~i~~v~~~~~~-P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~-~I~~~~y~~~~~~lG~~v~l~ 220 (533) ++...+ ..++..++|+.+- |.-.... .++..+..-+|+-..| .+-|.+++..+.... ... T Consensus 140 ~~~~~g~~~~~~lqliepd~l~~~~~~~~~------------~~g~~i~~GIe~d~~Gr~vaY~i~~~hpgd~~---~~~ 204 (495) T protein:vir:10 140 RPLSEGLSVPLQLQIIEPDMLASDIPDETL------------PSGGYVKGGIRFSNGGKRKAYCFYRNHPAESS---LIG 204 (495) T ss_pred cccCCCCccceEEEEechhhcCCCCCCCCC------------CCCCEEEeceEECCCCceEEEEEeecCCCccc---ccc Confidence 654222 3688999998863 4311100 0011111122222211 222333333222100 000 Q ss_pred hccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_016654. 221 DHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRI 300 (533) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~ 300 (533) .-.++. .| +..-+.|+-... + ....|+|.|+. |.. +..+|..-+.-...-+. T Consensus 205 ~~~~~~-------------rv----pA~~vlH~f~~r-------~--gQ~RGis~la~-i~~-l~~l~~y~dael~~a~i 256 (495) T protein:vir:10 205 DPVDTV-------------WI----KAEHVLHVTVLT-------V--RSDAGAPWFQL-LLR-LNELDQYEDAELVRKKT 256 (495) T ss_pred ccccee-------------ee----chhheEeccccC-------C--CcccCcchhHH-HHH-HHHhhHHHHHHHHHHHH Confidence 000000 00 100112221111 1 13458998875 654 45555443322221121 Q ss_pred -CcceeeechHHhcCCCC-cc-ccc-----cCcchh---hhhhccccccccccccccceeeechhhhhHHHHHHHHHHHH Q lcl|NC_016654. 301 -GAGKVHASESVLTNLGM-GQ-GVS-----LDEEQE---VYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLR 369 (533) Q Consensus 301 -~~~~i~v~~~~l~~~~~-~~-~~~-----~d~~~~---~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~ 369 (533) +--..|| +...+ .. +.. -+.... -+.+..+.....| ..++.++|.-...+|..-+..+++ T Consensus 257 ~A~~~~fi-----~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pG----e~i~~~~p~~p~~~~~~f~~~~lr 327 (495) T protein:vir:10 257 AALFAAFI-----QEATADSTGGPTIGQPKRSKGGKRITGLNPGTLQYLQPG----QEVKFSNPADVGTTYEPWLRYQLL 327 (495) T ss_pred hhhheeee-----ecCCCccccccccCccccccCcccceecCCceeeecCCC----CeeeeeCCCCCCCCHHHHHHHHHH Confidence 2222232 11111 00 000 000000 1112122222222 237778877666777777888899 Q ss_pred HHHHhhCCChhhcccCCCcc--hhHHHHHHHhhhHHHHHHHHH-HHHHHHH-HHHHHHHHHHHHhhccCCCCCCc----- Q lcl|NC_016654. 370 EVLRKTGYSPVSLGLSDEVA--QTATEASGKKDLTVKTTRAKA-RHFGSAL-GPLSTTCLRVDAIKFPGKGAAPS----- 440 (533) Q Consensus 370 ~i~~~~g~s~~~~g~~~~~~--~Tatai~~~~~~l~~~~~~~~-~~~~~al-~~li~~il~l~~~~~~~~~~~~~----- 440 (533) .|....|+|++.+.-|.+++ .|+.+-.....+ .+...+ +.+-..+ +.+.+.. |....+.|....+. T Consensus 328 ~iaaglGi~Ye~ltgD~s~~nYSS~R~~~~e~~r---~~~~~q~~~~~~~~~~pi~~~~--l~~a~l~G~i~~p~~~~~~ 402 (495) T protein:vir:10 328 SIAKGYGITYEMLTGDLRGVNYSSIRAGLLEFRR---LCQQVQHHMIIHQFCRPVGRWF--MDFAVASGAVVIPDYLQRR 402 (495) T ss_pred HHHhhcCCCHHHHhcccccccHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHH--HHHHHHcCCCCCCCchhhh Confidence 99999999999997665442 234333333333 333322 2222222 2222222 22233333332221 Q ss_pred -eeEEEEe--CCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCC- Q lcl|NC_016654. 441 -EELELEW--PKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPL- 516 (533) Q Consensus 441 -~~v~i~f--~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~- 516 (533) .-+.+.| +.....|+..++++...++.+|++|+++.+++. +.|-+++.+|++ .|...... .+...+..|. T Consensus 403 ~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~--G~D~~~v~~q~a---~e~~~~~~-~Gl~~~~~p~~ 476 (495) T protein:vir:10 403 RYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAER--GYDMEELFDMIS---DANQLIDE-YDLRLDSDPRY 476 (495) T ss_pred HhhhccccccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHc--CCCHHHHHHHHH---HHHHHHHH-cCCCCCCCCCc Confidence 1245666 334456999999999999999999999999985 356555554443 33321110 1111111000 Q ss_pred --CCCCCCCCCCCCCCCCC Q lcl|NC_016654. 517 --PTENDPATDPEAVDEGE 533 (533) Q Consensus 517 --~~~~~~~~~~~~~~d~~ 533 (533) ..+...+..++..+..| T Consensus 477 ~~~~~~~~~~~~~~~~~~e 495 (495) T protein:vir:10 477 VNGSGAEQKSVMEAALNNE 495 (495) T ss_pred CCCccCCCCCCCCCCCCCC Confidence 01111111112222222 No 111 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=99.02 E-value=1.6e-09 Score=68.78 Aligned_cols=473 Identities=9% Similarity=-0.002 Sum_probs=195.1 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhh--HhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceee Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESH--VWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYH 78 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~--~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 78 (533) |+--.. .....+.+++ .|..-...+.+ .........|+.++..+...-....-..+-..++++. T Consensus 1 ~~~~~~----------~~~~~~~~~~~~~~v~~~~~~~~----~~r~~~~~~w~el~~y~~a~~~~~~~~~~~~~r~~~~ 66 (584) T protein:vir:95 1 MSVKVA----------ELNSLLVRDSSAQWVAYLWDRFN----NQRRQKIEEWKELRNYVFATDTTTTSNQGLPWKNSTT 66 (584) T ss_pred CCcchh----------hhhhhccccchHHHHHHHHHHHH----hhhchhhccCHHHHHHHHhhhhhhhhhcccccccccc Confidence 221110 0111111111 11110000111 1111122223222221111111111111122355777 Q ss_pred cChHHHHHHHHHHhh----cCCCceEeeC---CCc--hHHHHHHHHH----HhhccHHHHHHHHHHHHhhhCCEEEEEEE Q lcl|NC_016654. 79 APIPGVIAKLSTTEL----FSEQLKFLDA---GKS--KEVQARADLI----FNTPRFHSSLVEAGESCSALSGSFQRIVW 145 (533) Q Consensus 79 ~n~~k~i~~~~a~ll----~~e~~~i~~~---~~~--~~~~~~l~~i----~~~n~f~~~~~~~~~~~~~~G~~~~~~~~ 145 (533) +|-...+++.+.+.| |+..-.+... +.+ ...++.++.+ +.+.+|...+.+.+.++..+|.+++|++| T Consensus 67 ~~k~~~~~~~i~~~l~~~~Fp~~~w~~~v~~~~~~~~~~~~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~~ 146 (584) T protein:vir:95 67 LPKLCQIRDNLHSNYFSSLFPNDDWLRWVGYGKGDSTKTKAKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVSF 146 (584) T ss_pred hhHHHHHHHHHHHHHHHhhcCccceeeeecCCCchhhHHHHHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEeE Confidence 777777777666554 4433222221 111 1224555555 47779999999999999999999999999 Q ss_pred cCCC-----------CCceEEEEEcCCeEEEEEecCCceEEEEEEE--Eee--------cCC------------------ Q lcl|NC_016654. 146 DPTI-----------ADNAWIDFVDADRAIPEFRWGRLVAVTFWSE--LAG--------GDG------------------ 186 (533) Q Consensus 146 D~~~-----------~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~--~~~--------~~~------------------ 186 (533) .... ..+++|+.++|..+||--+-+.+..+.|+.+ ++. .++ T Consensus 147 ~~~~~e~~e~~~v~~~~~prieriSP~d~~~Dpsa~~i~d~~fivrs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~ 226 (584) T protein:vir:95 147 EAKYKEMTDGTLVPDYIGPRLVRISPLDIVFNPLATSISDTFKIVRSVKTKGELMRLAQDEPEQSYWLEALKRREEICRH 226 (584) T ss_pred eecceeeeccccccccccceEEeeChhheeecCCCCCccchhhhhhhhhhHHHHHHHHhhcCccccchHHHHHHHHhccC Confidence 7542 1258899999988885333344555443321 100 000 Q ss_pred -----------------ceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeec-----CC Q lcl|NC_016654. 187 -----------------QEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVE-----TG 244 (533) Q Consensus 187 -----------------~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~-----~g 244 (533) ...+-+-|.|++++|+=..|.+. -.|.+ .-+.+.+....+. .+...+- +. T Consensus 227 ~~~~~~~~~~~~~~~~~d~~~~~~ey~~~~~V~vl~~~g~--~~~~~----~~e~~~~~iv~v~--~g~~iIR~~~np~~ 298 (584) T protein:vir:95 227 LGGYSVEDFDKAAGFDVDGFGNLYEYYMSDWVEILEFYGD--YHDKE----TGELQTNRIITVV--DRSTEVRNESIPTW 298 (584) T ss_pred CCCCcccccccccccccccccccccccCCceeEEEeeccc--ccccc----cCCCcccceEEEE--eccEEEEeeecCCC Confidence 00011223333333332222110 00000 0000000000010 1111121 11 Q ss_pred -CccceeE--EecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCCcccc Q lcl|NC_016654. 245 -VKDLTAA--YVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGMGQGV 321 (533) Q Consensus 245 -~~~~~~~--~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~~~ 321 (533) ...||+. |+|.. -+.||.|+... +.++.+.+|.+.-++.+.+...-.. ++..++..... T Consensus 299 ~~~~PF~~~~~~p~~-----------~s~yG~gi~~l-l~d~Q~~lna~~r~~iDnl~l~~~p--v~k~~~~~~~~---- 360 (584) T protein:vir:95 299 FGSAPIYHVGWRFRP-----------DNLWAMGPLDN-LVGMQYRIDHLENAKADAVDLIIQP--PLKIIGEVEEF---- 360 (584) T ss_pred CCCCCEEEEcceeee-----------ccccCCCchhh-hhhHHHHHhHHHHHHHHHHHHhcCc--ceeeccccchh---- Confidence 1224433 33322 25689999887 5699999999999999988653332 12222221110 Q ss_pred ccCcchhhhhhccccccccccccccceeeechh-hhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhh Q lcl|NC_016654. 322 SLDEEQEVYSRVGSGGFNANGDMETIFEFFQPA-IRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKD 400 (533) Q Consensus 322 ~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~-ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~ 400 (533) .+.+.. .| ..+. ...++.+.|. ..+-+-...|+.+...+...+|+|+..-|.+..+.+||+.+....+ T Consensus 361 ~~~pg~-~~--------~~~~--~~~~q~~~p~a~~~~s~~~~lq~~e~~me~~sGvp~~~~G~~~~~~~TAtg~s~l~n 429 (584) T protein:vir:95 361 VWGPGA-EI--------HLDQ--GGDVQEIAKNVNYIINADNQIQMLEDRMELYAGAPREAMGIRTPGEKTAFEVQQLGN 429 (584) T ss_pred cccCCc-ee--------ecCC--CCCcceecCchhhhhHHHHHHHHHHHHHHhhhCCChhhcccccchhhhHHHHHHHHH Confidence 011110 01 1111 1123444442 1222233446667778889999999999988888999999988888 Q ss_pred hHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhccCCCCC---------------CceeEEEEeCC--CCCCCHHHHHHHH Q lcl|NC_016654. 401 LTVKTTRAKARHFGSAL-GPLSTTCLRVDAIKFPGKGAA---------------PSEELELEWPK--FARESDLAKAQTV 462 (533) Q Consensus 401 ~l~~~~~~~~~~~~~al-~~li~~il~l~~~~~~~~~~~---------------~~~~v~i~f~d--~i~~d~~e~a~~~ 462 (533) +.-.-+..+.+.|...| ++++.++..+....+...... ...+++-+|.= ....-..++++.. T Consensus 430 aa~~~~r~~~~~f~~~ll~~l~~ll~~~~~~nmd~~~~vr~~n~e~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~ 509 (584) T protein:vir:95 430 AAGRIFQEKVTTFEVELLEPVLNAMLETATRNMDGSDVIRVMDTDLGVKEFMSVTREDITANGKIRPIGARHFGKQAQDL 509 (584) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCceeeeccccccccccccChhhhccCeeEEeehhhHHHHHHHHH Confidence 88877788888887776 788887776643321111000 00112212110 0011112223322 Q ss_pred HHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCC--------CCC----CCCCCCC Q lcl|NC_016654. 463 QAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTEN--------DPA----TDPEAVD 530 (533) Q Consensus 463 ~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~--------~~~----~~~~~~~ 530 (533) +.+.+ ++-. ++-..+.|... ..++.++-++.... |..... ......+++ .++ -+.+-.. T Consensus 510 q~l~~--ilq~-~~~~~i~p~~~----~~~l~~~ladl~~~-p~~~~~-~~~~~~~~Q~~~q~~~~~~q~~~~~~~~~~~ 580 (584) T protein:vir:95 510 QNLVG--IFNS-QIGQMILPHTS----GKALATFVDDVTGL-QGYEIF-RPNVAVAEQAETQSLVAQAQEDLQLQAQMPA 580 (584) T ss_pred HHHHH--HHHh-hhhhhccccch----HHHHHHHHHHHhCC-Cccccc-CCCcccchhHHHHhhhHHHHHHHHHHHhhhh Confidence 22211 1100 11111111111 12222222111100 000000 000000000 000 0000000 Q ss_pred CCC Q lcl|NC_016654. 531 EGE 533 (533) Q Consensus 531 d~~ 533 (533) +|- T Consensus 581 ~~~ 583 (584) T protein:vir:95 581 EGA 583 (584) T ss_pred ccC Confidence 000 No 112 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=98.99 E-value=2.7e-09 Score=67.56 Aligned_cols=469 Identities=13% Similarity=0.086 Sum_probs=184.9 Q ss_pred CCCCCCcC-----CCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccc Q lcl|NC_016654. 1 MSLPEANT-----AWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPK 75 (533) Q Consensus 1 ~~~~~~~~-----~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 75 (533) .|+|+|-. -|+=.+. .+.+..+-.||..-. ....++....+.-++..--...+.+.| +. T Consensus 9 ~~~~~~~~~~~~~~~~~~~~---~~~l~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~g--rs 72 (763) T protein:vir:95 9 VPLPDPSQATKLTSWKNELS---LQALKADLDAAKPSH-----------TAMMIKVKEWNDLMRIEGKAKPPKVKG--RS 72 (763) T ss_pred CCCccccchhcCCCCCChHH---HHHHHHHHHhhhcch-----------hHHHHHHHHHHHhhhccccCcccccCC--Cc Confidence 67776653 4554432 223333322222211 111111111111111110011223333 33 Q ss_pred eeecChHHHHHHH----HHHhhcCCCceEeeCCC---ch----HHHHHHHH-HHhhccHHHHHHHHHHHHhhhCCEEEEE Q lcl|NC_016654. 76 RYHAPIPGVIAKL----STTELFSEQLKFLDAGK---SK----EVQARADL-IFNTPRFHSSLVEAGESCSALSGSFQRI 143 (533) Q Consensus 76 ~~~~n~~k~i~~~----~a~ll~~e~~~i~~~~~---~~----~~~~~l~~-i~~~n~f~~~~~~~~~~~~~~G~~~~~~ 143 (533) +++.+.-...++. +.+.+++-+..+.+.+. |. -...+++- +...|+=...+..++..|+..|.+++|+ T Consensus 73 ~vv~~~v~~~ve~~~~~l~~~f~~~~~~~~~~P~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k~ 152 (763) T protein:vir:95 73 QVQPKLVRRQAEWRYSALTEPFLGSNKLFKVTPVTWEDVQGARQNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVRV 152 (763) T ss_pred cccCHHHHHHHHHHHHHHHHhhcCCCcEEEEecCCcchHHHHHHHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEEE Confidence 5566655544443 33444555555555432 11 22345555 4455555677889999999999999999 Q ss_pred EEcCCC-------------------------------------------------------------------------- Q lcl|NC_016654. 144 VWDPTI-------------------------------------------------------------------------- 149 (533) Q Consensus 144 ~~D~~~-------------------------------------------------------------------------- 149 (533) |||... T Consensus 153 ~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (763) T protein:vir:95 153 GWNREIRKEKQEVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEV 232 (763) T ss_pred eeeeeeeeeeeeehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEE Confidence 996210 Q ss_pred --CCceEEEEEcCCeEEEEEe-cCCceEEEEE--EEE-eecC---CceEEEEEE------------------------Ee Q lcl|NC_016654. 150 --ADNAWIDFVDADRAIPEFR-WGRLVAVTFW--SEL-AGGD---GQEVWRHLE------------------------RH 196 (533) Q Consensus 150 --~~~~~i~~v~~~~~~P~~~-~g~~~~v~f~--~~~-~~~~---~~~~y~~lE------------------------~h 196 (533) .++|+|+.|+|..|++--+ .+++..+-|+ +.+ +..+ ....|..++ .+ T Consensus 233 ~~k~~p~ie~V~p~d~~iDp~a~sD~~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (763) T protein:vir:95 233 PLANHPTVEMLNPENIIIDPSCQGDINKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQIS 312 (763) T ss_pred EecCceEEEeecHHHheecCCCCCchhhCceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccchhhccCC Confidence 1355777777777765321 1233333332 111 1000 000010000 00 Q ss_pred c---CeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCC------CccceeEEecCCccccccccccc Q lcl|NC_016654. 197 E---SGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETG------VKDLTAAYVPNVTPNPEWRHDPK 267 (533) Q Consensus 197 ~---~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g------~~~~~~~~~pn~~~~~~~~~~~~ 267 (533) + .-...|+.|...+- .|..+ .+ ... +.- .+...+..+ ...||+.+.+... . T Consensus 313 d~~~~~V~v~E~y~~~d~-~gdg~-----~~--~~~--v~~-~g~~iL~~~~~p~~~~~~PFv~~~~~p~---------~ 372 (763) T protein:vir:95 313 DPMRKRVVAYEYWGFWDI-EGNGV-----LE--PIV--ATW-IGSTLIRLEKNPYPDGKLPFVLIPYMPV---------K 372 (763) T ss_pred CcccceEEEEEeeeeecc-CCcce-----eE--EEE--EEE-EcCeeeecccccccCCCcCEEEecceee---------c Confidence 0 00111222221100 00000 00 000 000 000111111 1134443322211 1 Q ss_pred ccccccchhhhhHHHHHHHHHHHHHHHHHHHH-hCcceeeechHHhcCCCCccccccCcchhhhhhcccccccccccccc Q lcl|NC_016654. 268 LRYLGRADLSTDLFPTFHELDRIYSSLMRDFR-IGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMET 346 (533) Q Consensus 268 ~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~-~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 346 (533) .+.+|.|++.. +.++.+.+|..+++..+.+. .++.++.|+.+.+..... ..+.+. .+.. + ..++.... T Consensus 373 ~~~~G~gi~~~-~~d~Qr~~N~~~~~~~d~l~~~~~~~~~v~~gav~~~d~---~~~~pg-~v~~-v-----~~g~~~~~ 441 (763) T protein:vir:95 373 RDMYGEPDAEL-LGDNQAVLGAVMRGMIDLLGRSANGQRGMPKGMLDALNS---RRYREG-EDYE-Y-----NPTQNPAQ 441 (763) T ss_pred CcccCCchHHH-hhHHHHHHHHHHHHHHHHHHhhcCCcEEeecccccchhh---hcccCC-ceEE-e-----eCCCChhh Confidence 25689999987 67999999999999999985 567788888766643211 001111 1110 0 01111111 Q ss_pred ceeeec-hhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCC-cchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 347 IFEFFQ-PAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDE-VAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTC 424 (533) Q Consensus 347 ~i~~~~-~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~-~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~i 424 (533) .+...+ +.+ -......+..+-..+...+|++....|.+.. ...||+++....+.--.....+.+.|..+++.+++.+ T Consensus 442 ~~~~~~~p~~-~~~~~~~l~~~~~~~e~~TGv~~~~~G~~~~~~~~tat~v~~l~qa~~~~~~~~~r~~~~~~k~l~~~~ 520 (763) T protein:vir:95 442 MIIEHKFPEL-PQSALTMATLQNQEAESLTGVKAFAGGVTGESYGDVAAGIRGVLDAASKREMAILRRLAKGMSEIGNKI 520 (763) T ss_pred hcccccCCCC-cchHHHHHHHHHHHHHHhhCcchhhcCcCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111 112 1133444444445667778888777775432 2357777766665555555666777777888888888 Q ss_pred HHHHHhhccCCCC--------C--------CceeEEEEeCCCCCCCH-HHHHHHHHHHH-hCC-CCCHHHHHHHhCCCC- Q lcl|NC_016654. 425 LRVDAIKFPGKGA--------A--------PSEELELEWPKFARESD-LAKAQTVQAWS-VAS-AASTKTKVAYLHEDW- 484 (533) Q Consensus 425 l~l~~~~~~~~~~--------~--------~~~~v~i~f~d~i~~d~-~e~a~~~~~l~-~aG-i~S~et~v~~l~~~~- 484 (533) +.+....+..... . ...+|+|.-. +.+. .+.+..++.+. ..| .+.... ...+.... T Consensus 521 l~Li~q~~d~~rviRI~g~e~v~v~~~~~~~~~DV~V~~~---~as~~~q~~~~l~~ll~~l~~~~~~~~-~~~il~~~~ 596 (763) T protein:vir:95 521 IAMNAVFLAEHEVVRITNEEFVTIKREDLKGNFDLEVDIS---TAEVDNQKSQDLGFMLQTIGPNVDQQI-TLNILAEIA 596 (763) T ss_pred HHHHHhhCCCCcEEEEeCCccccccHHHhcCCcceEEecc---cchHHHHHHHHHHHHHHHhccccChHH-HHHHHHHHH Confidence 8775533221100 0 0112222211 1111 12222222211 111 111110 00000000 Q ss_pred CHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 485 DDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 485 ~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +-....+.++.++.-+...++. ......-.....+.+ T Consensus 597 d~~~~~~~~~~lr~~q~~~d~~------------~q~qaqle~~~~q~e 633 (763) T protein:vir:95 597 DLKRMPKLAHDLRTWQPQPDPV------------QEQLKQLAVEKAQLE 633 (763) T ss_pred hhhchhhhHHHHHhcCCCccch------------hhhHHHHHHHHHHHH Confidence 0000000111111111000000 000000000001111 No 113 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=98.65 E-value=1.4e-07 Score=58.11 Aligned_cols=494 Identities=8% Similarity=-0.045 Sum_probs=214.3 Q ss_pred hHHHHHHH-HhhhHhhcCCHHHHHHHHhcc----Ccc--hhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHH Q lcl|NC_016654. 15 LAAVTARV-AESHVWWEGDLDKLATFYGAE----GRT--SPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAK 87 (533) Q Consensus 15 ~~~~~~~~-~~~~~w~~gd~~~l~~~y~~~----~~~--~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~ 87 (533) ++.-...+ ++-+.||.-+.+.-...++.. ..+ ..+||.......... +. ....+..++.|+.+.+|+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~----~~--q~~grP~~~~N~i~~~v~ 74 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKL----DE--QFEKYPKFEINKVATELN 74 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHH----hh--hhcCCCceEEcchHHHHH Confidence 44332222 333334433222111111100 000 011222211111000 00 001133689999999999 Q ss_pred HHHHhhcCCCceEeeCCC----ch----HHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCC-------CCCc Q lcl|NC_016654. 88 LSTTELFSEQLKFLDAGK----SK----EVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPT-------IADN 152 (533) Q Consensus 88 ~~a~ll~~e~~~i~~~~~----~~----~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~-------~~~~ 152 (533) ...++--...+.+.+.+. +. .++..+..+.+.++.......+...+.+.|-+|++++.|-. ...+ T Consensus 75 ~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~e~d~~~~~~~ 154 (708) T protein:vir:10 75 RIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQR 154 (708) T ss_pred HHHHHHHhCCcceEEEcCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhcccceeeeeeccccccCCCCCccc Confidence 999988887777776533 12 23455566778889999999999999999999999876521 1234 Q ss_pred eEEEEEc-C-CeEEE--EEecCCceEEEE--EEEE----------eec-----------C------CceEEEEEEEecCe Q lcl|NC_016654. 153 AWIDFVD-A-DRAIP--EFRWGRLVAVTF--WSEL----------AGG-----------D------GQEVWRHLERHESG 199 (533) Q Consensus 153 ~~i~~v~-~-~~~~P--~~~~g~~~~v~f--~~~~----------~~~-----------~------~~~~y~~lE~h~~~ 199 (533) +.|..+. | ..++. .-..-.++++-| ...+ .+. + .....++.|+|+.- T Consensus 155 i~i~~~~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~~~~d~v~v~ey~~r~ 234 (708) T protein:vir:10 155 IAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVR 234 (708) T ss_pred cceEEeecchhhcccCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccccCCCceEEEEeeeEE Confidence 5554442 2 34442 211123333321 1111 000 0 00123445655433 Q ss_pred eEEEEEEeccCCcccceeehhhcc--------cccccccc----cc-------ccCCceeec--CCCccceeEEecCCcc Q lcl|NC_016654. 200 YIVHAVYKGTATSLGWMMALTDHP--------ATRDIAVE----GA-------DEGRGAYVE--TGVKDLTAAYVPNVTP 258 (533) Q Consensus 200 ~I~~~~y~~~~~~lG~~v~l~~~~--------~~~~~~~~----~~-------~~~~~~~~~--~g~~~~~~~~~pn~~~ 258 (533) .+.-.++...+...|.-+.+.... +..+.... +. ...+....+ ...+.-.+.++|.... T Consensus 235 ~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~~~p~~~fP~vP~~g~ 314 (708) T protein:vir:10 235 KESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGK 314 (708) T ss_pred EEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhhccCCCCCCCceeeEEEeee Confidence 222222221111111111111000 00000000 00 000000011 1111122334443221 Q ss_pred cccccccccccc--cccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeee-chHHhcCCCCccccccCcchhhhhhccc Q lcl|NC_016654. 259 NPEWRHDPKLRY--LGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHA-SESVLTNLGMGQGVSLDEEQEVYSRVGS 335 (533) Q Consensus 259 ~~~~~~~~~~~~--~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v-~~~~l~~~~~~~~~~~d~~~~~~~~~~~ 335 (533) ....++ .+ +| .+.+ +++..+.+|...|...+.+-.....+++ ....+.... .....++.+...|..... T Consensus 315 r~~~d~----~~~~yG--~vr~-~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~-~~~~~~~~~~~~~~~~~~ 386 (708) T protein:vir:10 315 RWFIDD----IERVEG--HIAK-AMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLE-KHWEARNKKRPAFLPLRE 386 (708) T ss_pred eeccCC----Ccccce--eecc-cchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHH-HHHhhccccchhhhcccc Confidence 111111 22 33 2333 5788999999999999888655444433 222221100 000011111111211110 Q ss_pred ccccccc--ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHH Q lcl|NC_016654. 336 GGFNANG--DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHF 413 (533) Q Consensus 336 ~~~~~~~--~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~ 413 (533) .....|. ........+++.--...+++.++.....|...+|+++..+|-. ++ .||.+|..+..............+ T Consensus 387 ~~~~~G~~~~~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~~-sn-~SG~aI~~rq~qg~~~l~~~~Dnl 464 (708) T protein:vir:10 387 VRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMP-SN-IAQETVNNLMNRADMASFIYLDNM 464 (708) T ss_pred ccccccccccccCCccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccCc-cc-hHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000000 0001122233322234678888888889999999999999953 33 589999999888888888888888 Q ss_pred HHHHHHHHHHHHHHHHhhccCCCC----------------------------------CCceeEEEEeCCCCCCCHHHHH Q lcl|NC_016654. 414 GSALGPLSTTCLRVDAIKFPGKGA----------------------------------APSEELELEWPKFARESDLAKA 459 (533) Q Consensus 414 ~~al~~li~~il~l~~~~~~~~~~----------------------------------~~~~~v~i~f~d~i~~d~~e~a 459 (533) ..+++.+.+.+|.+-...+..... ...++|.|+=..+.+.-+.+.+ T Consensus 465 ~~~~~~~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~ 544 (708) T protein:vir:10 465 AKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATV 544 (708) T ss_pred HHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccCchhHHHHHH Confidence 888888888777764432211100 0012333333334444466777 Q ss_pred HHHHHHHhCCCC-CHHHH-----HHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCC--CCCC--CCCC Q lcl|NC_016654. 460 QTVQAWSVASAA-STKTK-----VAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTEND--PATD--PEAV 529 (533) Q Consensus 460 ~~~~~l~~aGi~-S~et~-----v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~--~~~~--~~~~ 529 (533) +.++++..+..- -..++ +-++. ++ .-+++.+++|++........ .+...++.. .+.+ .+.. T Consensus 545 ~~l~qll~~~~p~~~~~~~~~~~~l~~~-D~--p~~~ei~erir~~~~~~~~~------~~~~~ee~q~~~~~q~~~q~q 615 (708) T protein:vir:10 545 SVLTNVLSSMLPTDPMRPAIQGIILDNI-DG--EGLDDFKEYNRNQLLISGIA------KPRNEKEQQIVQQAQMAAQSQ 615 (708) T ss_pred HHHHHHHHhcCCCchhhHHHHHHHHHhc-CC--cChHHHHHHHHHhhcccccc------cccchhhHHHHHHHHHHHHHH Confidence 777777554321 11221 11111 22 22445566766654221110 010000000 0000 0000 Q ss_pred CCCC Q lcl|NC_016654. 530 DEGE 533 (533) Q Consensus 530 ~d~~ 533 (533) .+-+ T Consensus 616 ~~~~ 619 (708) T protein:vir:10 616 PNPE 619 (708) T ss_pred HHHH Confidence 0000 No 114 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=98.64 E-value=1.6e-07 Score=57.84 Aligned_cols=457 Identities=11% Similarity=0.022 Sum_probs=191.9 Q ss_pred CCCCc-CCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecCh Q lcl|NC_016654. 3 LPEAN-TAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPI 81 (533) Q Consensus 3 ~~~~~-~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~ 81 (533) |++.. +..+=......+..++..|..|...-.++.+|.... .+.......+.+..++--.- T Consensus 1 m~~~~~~~~~~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~------------------~~~~~~~~~~~~~~~~~dst 62 (535) T protein:vir:15 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPS------------------LFPKESDNESTDYTTPWQAV 62 (535) T ss_pred CCccchhccchHHHHHHHHHHHHHhhHHHHHHHHHHHHhccc------------------ccCCCCCccccccccccccc Confidence 44222 234444444567777777766665555555543221 11111112222333454556 Q ss_pred HHHHHHHHHHhhcCC----CceEeeCCCc-------------hHHHHHHH-------HHHhhccHHHHHHHHHHHHhhhC Q lcl|NC_016654. 82 PGVIAKLSTTELFSE----QLKFLDAGKS-------------KEVQARAD-------LIFNTPRFHSSLVEAGESCSALS 137 (533) Q Consensus 82 ~k~i~~~~a~ll~~e----~~~i~~~~~~-------------~~~~~~l~-------~i~~~n~f~~~~~~~~~~~~~~G 137 (533) +...++.+|+.|.+- .+.|.....+ ...+++|+ ..+..++|...+.++.+...+.| T Consensus 63 ~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G 142 (535) T protein:vir:15 63 GARGLNNLASKLMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAG 142 (535) T ss_pred HHHHHHHHHHHHHHhhcCCCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhC Confidence 677777777765443 2344432211 12334443 45888899999999999999999 Q ss_pred CEEEEEEEcCCCCCceEEEEEcCCeEEEEEe-cCCceEEEEEEEEeec--------------CCc------eEEEEEEEe Q lcl|NC_016654. 138 GSFQRIVWDPTIADNAWIDFVDADRAIPEFR-WGRLVAVTFWSELAGG--------------DGQ------EVWRHLERH 196 (533) Q Consensus 138 ~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~~--------------~~~------~~y~~lE~h 196 (533) .+.+++ +++.++.+++..++-..++-.-+ +|++..++.-.+++.. .++ .+|+++... T Consensus 143 ~a~l~~--~~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~ 220 (535) T protein:vir:15 143 NALLYL--PEPEGSYNPMKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLD 220 (535) T ss_pred ceeEEe--ecCCCCceeeEEEEcCeeEEeeCCCCCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEEEEEe Confidence 988664 44444567888888887776655 4777665543333210 000 122222111 Q ss_pred cCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchh Q lcl|NC_016654. 197 ESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADL 276 (533) Q Consensus 197 ~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~ 276 (533) ... -.|..|..-+ |..+++ ...+.+..-+.|++. +|... ..+.||+|-. T Consensus 221 ~~~-~~~~~~~e~~---g~~~~~---------------------~~~~~~~~~~P~i~~-----Rw~~~-~ge~YGrgp~ 269 (535) T protein:vir:15 221 EES-GDYLKYEEVE---DVEIDG---------------------SDATYPTDAMPYIPV-----RMVRI-DGESYGRSYC 269 (535) T ss_pred cCC-CcEEEEEEee---Cccccc---------------------cccccccccCCceee-----eeeec-CCCccccchH Confidence 000 0000110000 000000 000111001122222 23322 2477899987 Q ss_pred hhhHHHHHHHHHHHHHHHHHHH-HhCcceeeechHHhcCCCCccccccCc-chhhhhhccccccccccccccceeee-ch Q lcl|NC_016654. 277 STDLFPTFHELDRIYSSLMRDF-RIGAGKVHASESVLTNLGMGQGVSLDE-EQEVYSRVGSGGFNANGDMETIFEFF-QP 353 (533) Q Consensus 277 ~~~i~~lid~lD~~~s~~~~~~-~~~~~~i~v~~~~l~~~~~~~~~~~d~-~~~~~~~~~~~~~~~~~~~~~~i~~~-~~ 353 (533) ..++ +-+..|+..--...... ...++...|+++.. .+...+.+ ..+.+... .. ++. ..+... .. T Consensus 270 ~~~l-~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~-----~~~~~l~~~~~g~~v~g-----~~-~~v-~~~~~~~~~ 336 (535) T protein:vir:15 270 EEYL-GDLRSLENLQEAIVKMSMISAKVIGLVNPAGI-----TQPRRLTKAQTGDFVPG-----RR-EDI-DFLQLEKQA 336 (535) T ss_pred HHHH-HHHHHHHHHHHHHHHHHHHHhcCceeeccccc-----ccchhcccCCceeeecC-----Cc-ccc-eeeeccccc Confidence 7766 56678887666555554 34555555544322 11111101 11112110 00 010 011111 11 Q ss_pred hhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_016654. 354 AIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKA-RHFGSALGPLSTTCLRVDAIKF 432 (533) Q Consensus 354 ~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~-~~~~~al~~li~~il~l~~~~~ 432 (533) ++ ......++.+-..|.... +.. .+....+...|||||..+.+......+-.- +.-...|.-|+..++.+.... T Consensus 337 ~~--~~~~~~i~~~~~~I~~af-~~~-~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~- 411 (535) T protein:vir:15 337 DF--TVAKAVSDQIEARLSYAF-MLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQAT- 411 (535) T ss_pred ch--hHHHHHHHHHHHHHHHHH-hhh-hcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhc- Confidence 22 222233333333332221 110 122233445699999888877666555432 223334445555554443210 Q ss_pred cCCCCCCceeEEEEeCCCCCCC-----HHHHHHHHHHHHhCC------CCCHHHHHHHh---CC---C---CCHHHHHHH Q lcl|NC_016654. 433 PGKGAAPSEELELEWPKFARES-----DLAKAQTVQAWSVAS------AASTKTKVAYL---HE---D---WDDERVQEE 492 (533) Q Consensus 433 ~~~~~~~~~~v~i~f~d~i~~d-----~~e~a~~~~~l~~aG------i~S~et~v~~l---~~---~---~~dee~~~E 492 (533) +-....+...+++++--++..- .....+.++.+.+.+ .+....+++.+ .+ . -+++|+++. T Consensus 412 g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~~~~eev~~~ 491 (535) T protein:vir:15 412 SQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQAL 491 (535) T ss_pred CCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHH Confidence 1112234556777775554321 111112222221111 12233333332 11 1 256666655 Q ss_pred HHHHHHhhhc--ccCccccccccCCCCCCC------CCCCCCCC Q lcl|NC_016654. 493 ADLIDNANTV--SAPTFGFGTDQPPLPTEN------DPATDPEA 528 (533) Q Consensus 493 l~rI~~E~~~--~~~~~~~~~~~~~~~~~~------~~~~~~~~ 528 (533) .++.++.++. .+...+.+...++...+. +..|.+-. T Consensus 492 ~~q~~~~~~~~~~a~~~g~~~~~~~~~~p~~~~~~~~~~g~~~~ 535 (535) T protein:vir:15 492 MMQDAAQTGIENAAATGGAGVGALATSSPEAMQGAAAQAGLDAT 535 (535) T ss_pred HHHHHHHHHHHHHHHHHHhhccchhccChHHHHHHHhccCCCCC Confidence 5443322211 111111111111111000 11111111 No 115 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=98.62 E-value=1.8e-07 Score=57.60 Aligned_cols=441 Identities=10% Similarity=0.037 Sum_probs=170.1 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCc--ccceee Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGR--APKRYH 78 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~--~~~~~~ 78 (533) |-|=++.++=||-........ +||.... +- .. . .....+.........|+..-. .+..-. T Consensus 34 ~~~~~~p~~~~~~~~~~~~~~---------~d~~~~~-~~-r~------g-~~~~~~~~g~~~~~epp~d~~~l~~l~~~ 95 (648) T protein:vir:79 34 MQLGEAPGAMPKGGGGGGSAK---------RDPKMSL-VK-RI------G-LAIMDGGGGGRDFEEPEFDFNEITSAYNT 95 (648) T ss_pred cccCCCccccCCCCccccccc---------ccchhHH-HH-Hh------H-HHHHhhcCCccccccCCcCHHHHHHHHhc Confidence 666655545444442221111 1221100 00 00 0 000000000000000110000 011113 Q ss_pred cChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHH-HHHHhhc---cHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceE Q lcl|NC_016654. 79 APIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARA-DLIFNTP---RFHSSLVEAGESCSALSGSFQRIVWDPTIADNAW 154 (533) Q Consensus 79 ~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l-~~i~~~n---~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~ 154 (533) .+.....|+.+|+-+.+-+..+....+........ ..++..| .....+...+.+.+..|.+|+.+..|.++..-.. T Consensus 96 np~V~~aI~iia~~ia~l~~~i~~~~~~~~~~~~~~~ll~rPn~~~t~~~f~~~l~~~lll~GNAYveiiRd~~G~~~~~ 175 (648) T protein:vir:79 96 EGYVRQAVDKYIEMMFKADWDFVSKNPNAVEYIRMRFTLMAEATQIPTNQLFIEIAEDLVKYCNVVIAKSRAKDALPFQG 175 (648) T ss_pred ChHHHHHHHHHHHHHhhCcceEEecCCccchhhHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCCCccchh Confidence 34556677777777777665555443221111111 1122222 2334556667777889999999888876532222 Q ss_pred EEEEcCC------eEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccc Q lcl|NC_016654. 155 IDFVDAD------RAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDI 228 (533) Q Consensus 155 i~~v~~~------~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~ 228 (533) +..+... .++|+ +..+ +. +..-..+.+.++.|...++. ..+.+ T Consensus 176 l~~~~~~~~~~v~~l~pl-~p~~-----------------v~--v~~d~~g~~~~Y~y~~~g~~--~~~~~--------- 224 (648) T protein:vir:79 176 MNVMGVGDSMPVAGYFPL-NLAS-----------------MK--VKRDKFGMIKGWQQEQEGQD--KPQKF--------- 224 (648) T ss_pred hhhhhhccccceeeeEee-cCce-----------------eE--EEEcCCCceeeeEEEecCCc--eeEEe--------- Confidence 2111111 11221 0000 00 00011223333333322111 11111 Q ss_pred cccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCcceeee Q lcl|NC_016654. 229 AVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGAGKVHA 307 (533) Q Consensus 229 ~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~~~i~v 307 (533) ++--+.|+.... +....+|.|.+..+. ..|. +....+.+... |+.+...-.+ T Consensus 225 -----------------~~~dIIHik~~~--------~~d~~~GlSpi~~a~-~aI~-l~~aa~~~~~~fF~NGa~P~gi 277 (648) T protein:vir:79 225 -----------------KPEDIVHIYYKR--------EKGRAFGTPWLLPAL-DDIR-ALRQVEENVLRLVYRNLHPLWH 277 (648) T ss_pred -----------------cCccEEEEccCC--------CCCCceeccHHHHHH-HHHH-HHHHHHHHHHHHHhccCCccEE Confidence 011133333211 112456888877654 3442 33444444443 3544322222 Q ss_pred chHHhcCCCCccccccCcchh-------hhhhccccccccccccccceeeechh--hhhHHHHHHHHHHHHHHHHhhCCC Q lcl|NC_016654. 308 SESVLTNLGMGQGVSLDEEQE-------VYSRVGSGGFNANGDMETIFEFFQPA--IRVLEHDQGAALLLREVLRKTGYS 378 (533) Q Consensus 308 ~~~~l~~~~~~~~~~~d~~~~-------~~~~~~~~~~~~~~~~~~~i~~~~~~--ir~e~~~~~l~~~l~~i~~~~g~s 378 (533) +....+.. ..+..++ .|..... .++..+.....+++. -...++++..+...++|+...|+| T Consensus 278 ----l~~~~~~~--~~e~~k~~~e~~~~~~~~~~i----~gg~v~~~~~~i~~~~s~~dlqfle~rk~~~~eIa~aFgVP 347 (648) T protein:vir:79 278 ----VKVGLEQE--GFGAEEGEVDLVRGEVENMDV----EGGMVTTERVNISSIASNQIIDAKEYLKHFEQRAFTVLGVS 347 (648) T ss_pred ----EEeCCCcc--chHHHHHHHHHHHHhcccccc----cccccccceeeccccCCHHHHHHHHHHHHHHHHHHHHhCCC Confidence 21111111 1111111 1222111 122222111122221 134467777788889999999999 Q ss_pred hhhcccCCCc-chhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHH Q lcl|NC_016654. 379 PVSLGLSDEV-AQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLA 457 (533) Q Consensus 379 ~~~~g~~~~~-~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e 457 (533) |..+|+..++ -.|+.+....++..+ .-.+..+...+...+...+.+... +. ........+.++|++....|..+ T Consensus 348 P~lLG~~~~ss~stae~~~~~~~~~i---~~l~~~i~~~le~~~~~~ll~e~~-l~-~~l~~d~~ieF~~~~Llr~D~~~ 422 (648) T protein:vir:79 348 ELMMGRGGTASRSTGDNLSSDFKDRI---KALQKVMATFINEFMVKEILMEGG-FD-PVLNPDDKVEFRFNEIDMDSKIK 422 (648) T ss_pred HhHcccCCCccchHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHhhhhh-cc-ccccccceEEEeecccchhhHHH Confidence 9999975433 344555444333322 222223333333222111111111 11 11223456888999888889999 Q ss_pred HHHHHHHHHhCCCCCHHHHHHHhC-CCCCHHHHHHHHH-------HHHHhhhcccCccccccccCCCCCC--CCCCCC-C Q lcl|NC_016654. 458 KAQTVQAWSVASAASTKTKVAYLH-EDWDDERVQEEAD-------LIDNANTVSAPTFGFGTDQPPLPTE--NDPATD-P 526 (533) Q Consensus 458 ~a~~~~~l~~aGi~S~et~v~~l~-~~~~dee~~~El~-------rI~~E~~~~~~~~~~~~~~~~~~~~--~~~~~~-~ 526 (533) .++.+.+++++|+||..++.+++. +-..+.+-...+. ....+.+....+.+. +......+. .+.+.+ . T Consensus 423 ~a~~~~~l~~~GilT~NEaR~~lGlpPi~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~-~~~~a~~eg~~~e~~~~~~ 501 (648) T protein:vir:79 423 LENQAVFLYEHNAISEDEMRELIGRDPVDDGEGRAKMHLQMVTIAQATALAALAPTPAGG-SSASASGDKKKKATDNKTK 501 (648) T ss_pred HHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCccccccccccchhccccccCCCCCCCC-CCCCccccccccccCCCCC Confidence 999999999999999999877652 1122211111110 000000000000000 000000000 000001 0 Q ss_pred CCCCCC-------C Q lcl|NC_016654. 527 EAVDEG-------E 533 (533) Q Consensus 527 ~~~~d~-------~ 533 (533) ++...| + T Consensus 502 ~~~~~g~~~~~~~~ 515 (648) T protein:vir:79 502 PTNQHGTKTSPKKQ 515 (648) T ss_pred CCCCCCcCCCCccc Confidence 111111 1 No 116 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=98.59 E-value=2.2e-07 Score=57.06 Aligned_cols=483 Identities=13% Similarity=0.059 Sum_probs=176.6 Q ss_pred CCCCCCc--CC-CcCc-------c----hHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhccc Q lcl|NC_016654. 1 MSLPEAN--TA-WPPP-------E----LAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRT 66 (533) Q Consensus 1 ~~~~~~~--~~-~pp~-------~----~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~ 66 (533) |.+|.|- -. =++. . +...+...+..|.+|.-...++-++|......+ ..++... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~------------~~~~~~~ 68 (641) T protein:vir:94 1 MTIEMPTPIIEDKESAKRKLSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDR------------QNTRARN 68 (641) T ss_pred CccCCCcccccCCcchhhcCCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhh------------hhccccc Confidence 3333221 00 0111 1 122233333333333322222323322111100 0000000 Q ss_pred ----CCCCCcccceeecChHHHHHHHHHHhhcC----CCceEeeCC---Cch----HHHHHHHHHHhhccHHHHHHHHHH Q lcl|NC_016654. 67 ----PTATGRAPKRYHAPIPGVIAKLSTTELFS----EQLKFLDAG---KSK----EVQARADLIFNTPRFHSSLVEAGE 131 (533) Q Consensus 67 ----~~~~g~~~~~~~~n~~k~i~~~~a~ll~~----e~~~i~~~~---~~~----~~~~~l~~i~~~n~f~~~~~~~~~ 131 (533) ....+..++++..+-+...|+.+++-|.+ ....+...+ ++. .++++++..+.+++|...+.+.+. T Consensus 69 ~~~~~~~~~~~r~ki~~~~~~~~~~~l~s~Lm~~~~p~~~wf~~~p~~~ed~~~A~~~~~~~~~~l~~~~~~~~~~~~~~ 148 (641) T protein:vir:94 69 FQTTGADDADWRHRINTGHTFEVVETLVAYFKGATFPSDDWFDLKGMVPELADAARVVKQLTKTKLEAASIRDIFETYVR 148 (641) T ss_pred ccccccchhcccccccchhHHHHHHHHhhHHhhhhcCCCceEEEecCCCChHHHHHHHHHHHHHHHhhcchHHHHHHHHH Confidence 01112235577887777777777765544 333454322 222 244667777888999999999999 Q ss_pred HHhhhCCEEEEEEEcCC------------C--------------CCceEEEEEcCCeEEEEEecCCceEEEEEE--EEee Q lcl|NC_016654. 132 SCSALSGSFQRIVWDPT------------I--------------ADNAWIDFVDADRAIPEFRWGRLVAVTFWS--ELAG 183 (533) Q Consensus 132 ~~~~~G~~~~~~~~D~~------------~--------------~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~--~~~~ 183 (533) .++.+|.++++++|+.. + ...++++.+++..+++- ..++....+|+. .+.. T Consensus 149 d~~~~g~~iv~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~~d-ps~~~~~~~f~~~r~t~~ 227 (641) T protein:vir:94 149 NLVLYGVSTYRLGWDTSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVWLD-TSGGKNTGTFVRLRHTRE 227 (641) T ss_pred HHhhcCceEEEeehhhHHHHhhhhhcccchhhcccccccceecccceeeEEecchhheeec-CCCCcccccceehhhhHH Confidence 99999999999998621 0 01233444444433321 012222222321 1110 Q ss_pred ----cCCceEEEEEEEecCeeEEEEEEeccC-Ccc----cceeehhhccc-cc-------cccccccccCCceeec-CCC Q lcl|NC_016654. 184 ----GDGQEVWRHLERHESGYIVHAVYKGTA-TSL----GWMMALTDHPA-TR-------DIAVEGADEGRGAYVE-TGV 245 (533) Q Consensus 184 ----~~~~~~y~~lE~h~~~~I~~~~y~~~~-~~l----G~~v~l~~~~~-~~-------~~~~~~~~~~~~~~~~-~g~ 245 (533) --+..+|-+--......+. |+.+. +.. |......++-+ |. .+...+....+...+. .+. T Consensus 228 t~~~l~~eg~~~~d~v~~~~~~~---~~~~~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g~~il~~~~~ 304 (641) T protein:vir:94 228 ELHELVTSGYYDLDLTQVEQYVD---YKFADPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYGKQLIRLSDS 304 (641) T ss_pred HHHHHHhcCCCChhhcchhhccc---ccccccccccccccccccccceeeeeeeeccCCCceeeEEEEEeCCEEeecccc Confidence 0000001000000000000 00000 000 00000000000 00 0000001111111111 111 Q ss_pred ----ccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhC-cceeeechH-HhcCCCCcc Q lcl|NC_016654. 246 ----KDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIG-AGKVHASES-VLTNLGMGQ 319 (533) Q Consensus 246 ----~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~-~~~i~v~~~-~l~~~~~~~ 319 (533) ..||+.+.. ... ..+.+|+|....++ +.++.++...-+..+.+..+ +....++.+ .+.+.. T Consensus 305 ~~~d~~Pf~~~r~--------~~~-~~~~YG~gp~~~~l-~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~~~--- 371 (641) T protein:vir:94 305 KYWCGSPFVTTTL--------LPD-RDSVYGMSVLHPNL-GALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKRED--- 371 (641) T ss_pred cccCcCCeEEecc--------eec-CCcccCCChHHHHH-HHHHHHHHHHHHHHHHHHHHhCCeeeeccccccccce--- Confidence 123443322 211 13678999888755 78899999888888777543 333333322 222210 Q ss_pred ccccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCCh--hhcccCCCcchhHHHHHH Q lcl|NC_016654. 320 GVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSP--VSLGLSDEVAQTATEASG 397 (533) Q Consensus 320 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~--~~~g~~~~~~~Tatai~~ 397 (533) ..+.+.. ++. .. ..++. ..+...++++.+. ...++.+-..+....+.+. +......+...|||||.. T Consensus 372 -l~~~PG~-ii~-~~-----~~~~v-~pl~~~~~~~~~~--~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~ 440 (641) T protein:vir:94 372 -VKAKPGA-VFK-VA-----QHGSL-QPIDMGRQDFVVT--YQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQG 440 (641) T ss_pred -eeccCCc-cee-eC-----CCCcc-eeecCCccccchh--HHHHHHHHHHHHHhhhhhhhhcccccccchhccHHHHHH Confidence 0011111 111 00 00111 1111112222221 1222222223332222221 111111222359999999 Q ss_pred HhhhHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhc---------------cCCCCCCceeEEEEeCCCCCCCH---HHH Q lcl|NC_016654. 398 KKDLTVKTTRAKARHFG-SALGPLSTTCLRVDAIKF---------------PGKGAAPSEELELEWPKFARESD---LAK 458 (533) Q Consensus 398 ~~~~l~~~~~~~~~~~~-~al~~li~~il~l~~~~~---------------~~~~~~~~~~v~i~f~d~i~~d~---~e~ 458 (533) +.+........+.+.|. ..|..|++.++.+....+ ++....+...++.+|+- ++... .+. T Consensus 441 ~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~p~~L~~~~~i-v~l~~~q~~~~ 519 (641) T protein:vir:94 441 VRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVSPEYLHYPYKF-LALGANYVVER 519 (641) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCCCCccceeeeeeE-eecchhHHHHH Confidence 88888888888888877 477778887776543221 12223344455555543 23332 233 Q ss_pred HHHHHHHHhCCCCCH----HHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccc-cccCCCCCC-------------- Q lcl|NC_016654. 459 AQTVQAWSVASAAST----KTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFG-TDQPPLPTE-------------- 519 (533) Q Consensus 459 a~~~~~l~~aGi~S~----et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~-~~~~~~~~~-------------- 519 (533) ++.++.+. ++++. -.... + .+-+.+.+++.+ -.+.-.|....- .++++.... T Consensus 520 ~~~i~~l~--~~~~~~a~~P~v~d--~--~d~~~~~~~~~~---~~g~~~p~~~ir~~~~~~~~~~~~~~~~q~~~~~~a 590 (641) T protein:vir:94 520 ERMVTDLL--QLLDISGRVPQIGQ--S--LDYALILEDLLR---QMRFTDPMRYIKKAEAPPAAPPIAPAEPGALPPEMM 590 (641) T ss_pred HHHHHHHH--HHHHHhhcChhhhh--c--CCHHHHHHHHHH---HhCCCCchhhccCccCchhHHHHHHHHHHHHHHHHH Confidence 33333332 11110 01111 1 111222122211 111111110000 000000000 Q ss_pred CCCCCC-------------------CCCCCCCC Q lcl|NC_016654. 520 NDPATD-------------------PEAVDEGE 533 (533) Q Consensus 520 ~~~~~~-------------------~~~~~d~~ 533 (533) +..++. -=.-+.++ T Consensus 591 ~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 623 (641) T protein:vir:94 591 NSVGGGLNDQAIAGMTPEDVSDLASRIGIDTSD 623 (641) T ss_pred HHHHhhhHHHHHHHhhHHHHHHHHHhhcCCchh Confidence 000000 00000000 No 117 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=98.56 E-value=2.7e-07 Score=56.55 Aligned_cols=422 Identities=14% Similarity=0.063 Sum_probs=156.7 Q ss_pred HHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHH-HhcccCCCCCcccc---eeecChHHHHHHHHHHhhcCCC Q lcl|NC_016654. 22 VAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEA-FHGRTPTATGRAPK---RYHAPIPGVIAKLSTTELFSEQ 97 (533) Q Consensus 22 ~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~g~~~~---~~~~n~~k~i~~~~a~ll~~e~ 97 (533) |-.|..|. | +.............+...... +..+.....|.... -+.+.---..++.+|+-+-+=| T Consensus 1 Mg~~~~l~-~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~iA~lp 70 (457) T protein:vir:62 1 MGFWSALF-G---------RGHSPALDAAEGRAWEPYDPSIYNLGATASSGERVTPHDALQVSAVFASVRLLSETIATLP 70 (457) T ss_pred Cchhhhhh-c---------cccccccccccccccccchhhhhhccccccCCceechHHhhccHHHHHHHHHHHHhHhhCc Confidence 44443322 1 110000000000000000011 11111222221110 1111111223444444444335 Q ss_pred ceEeeCCCc--hHH-HHHHHHHHh-hcc---HHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEec- Q lcl|NC_016654. 98 LKFLDAGKS--KEV-QARADLIFN-TPR---FHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRW- 169 (533) Q Consensus 98 ~~i~~~~~~--~~~-~~~l~~i~~-~n~---f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~- 169 (533) ..+.-.... ... ...+..++. .|+ ....+...+...+..|.+|+.+..+. + .-..+..++|+++.+.-.. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~~-g-~~~~l~~l~p~~v~v~~~~~ 148 (457) T protein:vir:62 71 LSTYSKRGGTRKEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWAG-P-NIAGLDVLDPTKIHVHMVMV 148 (457) T ss_pred eEEEEecCCccccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeCC-C-cEEEEEEEcCcceEEEEecc Confidence 443221111 111 111222222 222 33345555666777899998886552 2 2335555666666554221 Q ss_pred CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccce Q lcl|NC_016654. 170 GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLT 249 (533) Q Consensus 170 g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 249 (533) +....-.|. .|.-... |.+.-+..+ .+-- T Consensus 149 ~~~~~~~~~--------------------------~y~~~~~--g~~~~~~~~-----------------------~~~e 177 (457) T protein:vir:62 149 DGLRRKVFE--------------------------AYDIDAD--GNEVLLGWF-----------------------TPRD 177 (457) T ss_pred CCccceeEE--------------------------EEEEccC--CceeEEEee-----------------------Cccc Confidence 111100000 0110000 000000000 0001 Q ss_pred eEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCc-ceeeechHHhcCCCCccccccCcch Q lcl|NC_016654. 250 AAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGA-GKVHASESVLTNLGMGQGVSLDEEQ 327 (533) Q Consensus 250 ~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~-~~i~v~~~~l~~~~~~~~~~~d~~~ 327 (533) +.|+....++ ...+|.|.+..+. ..|. ++....++... |+.+. ...++ ...+.-......... T Consensus 178 iih~r~~~~~--------~~~~G~sp~~~~~-~~i~-~~~~~~~~~~~~f~ng~~p~gil-----~~~~~ls~e~~~~~~ 242 (457) T protein:vir:62 178 VLHIPGMMLP--------GDFVGCSPISYAR-ESIG-LALAAQKYGAHFFRNGAMPGAVV-----EVPGTMSEEGLARAR 242 (457) T ss_pred eEEecCCCCC--------CceecccHHHHHH-HHHH-HHHHHHHHHHHHHhccCCcceEE-----EcCCCCCHHHHHHHH Confidence 2233322111 1235777776543 4443 33444444444 35432 22222 111100000000011 Q ss_pred hhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHH Q lcl|NC_016654. 328 EVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTV 403 (533) Q Consensus 328 ~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~ 403 (533) ..+.....+..++++ +....++.++......++++..+....+|+...|++|..+|+..++..++..+...... T Consensus 243 ~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~-- 320 (457) T protein:vir:62 243 EAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIA-- 320 (457) T ss_pred HHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHH-- Confidence 112111111111111 11223555666666668888888888999999999999998766554433333222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhC-C Q lcl|NC_016654. 404 KTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLH-E 482 (533) Q Consensus 404 ~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~-~ 482 (533) .+..+|.-++..+..-.+..+..........+.++++.-.-.|..++++.+.+++++|+|+..++.+++. + T Consensus 321 --------f~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~ 392 (457) T protein:vir:62 321 --------FTMFSLRPWLERIEAGFNRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMT 392 (457) T ss_pred --------HHHHHHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Confidence 2222333333333222222221111112334666666777789999999999999999999999777642 1 Q ss_pred CCCHHHHHHHHH-----HHHHh---hhcccC-ccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 483 DWDDERVQEEAD-----LIDNA---NTVSAP-TFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 483 ~~~dee~~~El~-----rI~~E---~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) -..+..+++-+. .+... +....+ ......+.+ .. .+++++.....+++| T Consensus 393 pi~~g~~D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~d~~~ 450 (457) T protein:vir:62 393 PLPDGLGEKYRVPLNLGEIGEEPEPEPAPAPPAIDPPAEEP-AD-DEEPDNAEGDPDEGE 450 (457) T ss_pred CCCCCCcceeeeccccccccccccccccCCCccCCCCccCC-CC-CCCCCCCCCCCcccc Confidence 122211111110 01000 000000 000000011 00 011111111111111 No 118 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=98.53 E-value=3.4e-07 Score=56.01 Aligned_cols=423 Identities=13% Similarity=0.052 Sum_probs=162.6 Q ss_pred HHhhhHhhc-CCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccc-eeecChH--HHHHHHHHHhhcCCC Q lcl|NC_016654. 22 VAESHVWWE-GDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPK-RYHAPIP--GVIAKLSTTELFSEQ 97 (533) Q Consensus 22 ~~~~~~w~~-gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~-~~~~n~~--k~i~~~~a~ll~~e~ 97 (533) |-.|+.|.. +....+...- ...+ ..+.. .-+..+.....|.... ...+..+ -..|+.+|+-+-+=| T Consensus 1 Mg~~~~l~~r~~~~~~~~~~-------~~~~-~~~~~--~~~~~~~~~~~g~~V~~~~al~~~~V~~~v~~Ia~~iA~lp 70 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIE-------ARAW-EPYDP--SIYNLGAVAASGETVTPHDALQVSAVFASVRLLSETIATLP 70 (457) T ss_pred Cchhhhhhcccccccccccc-------cccc-cccch--HHHhhcccccCCceechHHhhccHHHHHHHHHHHHhhccCc Confidence 333332211 0000000000 0000 00000 0111111222222111 1112222 134455555554445 Q ss_pred ceEeeCC---CchHHHHHHHHHHhh-cc---HHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEec- Q lcl|NC_016654. 98 LKFLDAG---KSKEVQARADLIFNT-PR---FHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRW- 169 (533) Q Consensus 98 ~~i~~~~---~~~~~~~~l~~i~~~-n~---f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~- 169 (533) ..+--.. ..+.....+..+++. ++ ....+...+...+..|.+|+.+..+ ++ .-+.+..++|+++.+.... T Consensus 71 ~~~~~~~~~~~~~~~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~-~g-~~~~l~~l~p~~v~v~~~~~ 148 (457) T protein:vir:13 71 LSTYSKRGGSRKEIVTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQ-GP-NIVGLDVLDPTKIHVHMVMV 148 (457) T ss_pred eEEEEecCCcccccccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CC-cEEEEEEEccCceEEEEecC Confidence 4432211 111112234444432 11 2234555556667789999888665 22 3345666777766665321 Q ss_pred CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccce Q lcl|NC_016654. 170 GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLT 249 (533) Q Consensus 170 g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 249 (533) +....-.|. .|.-... |.++-+..+ ..-- T Consensus 149 ~~~~~~~~~--------------------------~y~~~~~--~~~~~~~~~-----------------------~~~d 177 (457) T protein:vir:13 149 DGLRRKVFE--------------------------AYDIDAD--GNEVLLGWF-----------------------TPRD 177 (457) T ss_pred CCccceeEE--------------------------EEEEecC--CceeeEEee-----------------------Cccc Confidence 111111111 1110000 000000000 0001 Q ss_pred eEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHH-HhC-cceeeechHHhcCCCCccccccCcch Q lcl|NC_016654. 250 AAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDF-RIG-AGKVHASESVLTNLGMGQGVSLDEEQ 327 (533) Q Consensus 250 ~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~-~~~i~v~~~~l~~~~~~~~~~~d~~~ 327 (533) +.|++....+ ...+|.|.+..+. ..| .++....++...+ +.| .+..++ ...+.-.....+... T Consensus 178 iih~~~~~~~--------~~~~G~s~i~~~~-~~i-~~~~~~~~~~~~~f~ng~~p~gil-----~~~~~ls~e~~~~~~ 242 (457) T protein:vir:13 178 VLHIPGMMLP--------GDFVGCSPISYAR-ESI-GLALAAQKYGSKFFANGAMPGAVV-----EVPGTMSEEGLARAR 242 (457) T ss_pred eEEecCCCCC--------CccccccHHHHHH-HHH-HHHHHHHHHHHHHHhcCCCcceEE-----EcCCCCCHHHHHHHH Confidence 2333322111 1235777776543 444 3344444454443 443 222222 111100000000011 Q ss_pred hhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHH Q lcl|NC_016654. 328 EVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTV 403 (533) Q Consensus 328 ~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~ 403 (533) ..+.....+..++++ +....++.++......++++..+...++|+...|++|..+|+..++..++..+..... T Consensus 243 ~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~--- 319 (457) T protein:vir:13 243 EAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNI--- 319 (457) T ss_pred HHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHH--- Confidence 112111111111111 1112355666666666788888888899999999999999876655443333322221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhC-C Q lcl|NC_016654. 404 KTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLH-E 482 (533) Q Consensus 404 ~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~-~ 482 (533) ..++.+|..++..+..-.+..+..........+.++++.-...|..++++.+.+++++|+|+..++.+.+. + T Consensus 320 -------~f~~~tl~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~ 392 (457) T protein:vir:13 320 -------AFTMFSLRPWLERIEAGFNRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMT 392 (457) T ss_pred -------HHHHHHHHHHHHHHHHHHHHhhcCccccCceeEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Confidence 12233343333333322222222221122345677777878889999999999999999999999766541 1 Q ss_pred CCCHHHHHHHHH-----HHHH---hhhcccCcc-ccccccC---CCCCC-CCCCC--CCCCCCCC Q lcl|NC_016654. 483 DWDDERVQEEAD-----LIDN---ANTVSAPTF-GFGTDQP---PLPTE-NDPAT--DPEAVDEG 532 (533) Q Consensus 483 ~~~dee~~~El~-----rI~~---E~~~~~~~~-~~~~~~~---~~~~~-~~~~~--~~~~~~d~ 532 (533) ...+..+++-+. .+.. .+.+..++. +...+++ +..++ .++++ ....++|. T Consensus 393 Pi~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~d~~~~~~~~~~~~~ 457 (457) T protein:vir:13 393 PLPDGLGEKYRVPLNLGEVGEEPEPEPAPAPPAIEPPAEEPDEEPEPEGKPDDEGATEEDDEDDA 457 (457) T ss_pred CCCCCcccceeeccccccccccccccccCCCCCCCCCccccCCCCCCCCCCccccCCCCcccccC Confidence 121211111110 0000 011111111 1111111 00010 11111 11111222 No 119 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=98.49 E-value=4.6e-07 Score=55.31 Aligned_cols=493 Identities=7% Similarity=-0.088 Sum_probs=212.6 Q ss_pred hHHH-HHHHHhhhHhhcCCHHHHHHHHhcc----Ccc--hhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHH Q lcl|NC_016654. 15 LAAV-TARVAESHVWWEGDLDKLATFYGAE----GRT--SPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAK 87 (533) Q Consensus 15 ~~~~-~~~~~~~~~w~~gd~~~l~~~y~~~----~~~--~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~ 87 (533) ++.. ...+...+.||..+.+.-...++.. ..| ..+||............. ...+..+..|+.+.+|+ T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~------~~~~P~~~~N~i~~~v~ 74 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKH------FEKYPKFEINKISTELN 74 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHh------hCCCCeEEEccHHHHHH Confidence 5544 3445556666665544332222211 001 022332221110000000 01123588899999999 Q ss_pred HHHHhhcCCCceEeeCCC----chH----HHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcC--C-----CCCc Q lcl|NC_016654. 88 LSTTELFSEQLKFLDAGK----SKE----VQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDP--T-----IADN 152 (533) Q Consensus 88 ~~a~ll~~e~~~i~~~~~----~~~----~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~--~-----~~~~ 152 (533) ...++--...+.+.+.+. +.. ++..+..+.+.++.......+...+.+.|-+|+++++|- + .... T Consensus 75 ~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~d~~~~~~~ 154 (720) T protein:vir:35 75 RIISEYRHNRITVKFRPGDKTASEALANKLNGLFRADYEETDGGEACDNAFDDGSTGGFGCFRLTTNLVNALDPMDERQR 154 (720) T ss_pred HHHhHHHhCCCceEEEcCCCcchHHHHHHHHHHHHHHHHhcCchHHHhHHHHHhhhccceeEEeeecccccCCCCcccce Confidence 999998777777766543 222 344556677788999999999999999999999998751 1 1234 Q ss_pred eEEEEEc--CCeEEEEEe--cCCceEEE--EEEEE----------e-----------ec-----CCceEEEEEEEecCee Q lcl|NC_016654. 153 AWIDFVD--ADRAIPEFR--WGRLVAVT--FWSEL----------A-----------GG-----DGQEVWRHLERHESGY 200 (533) Q Consensus 153 ~~i~~v~--~~~~~P~~~--~g~~~~v~--f~~~~----------~-----------~~-----~~~~~y~~lE~h~~~~ 200 (533) ++|..+. ...++.-++ .-.+..+- |..++ . .. -.....++.|++..-. T Consensus 155 i~i~~v~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~d~~~~~~v~i~E~~~~~~ 234 (720) T protein:vir:35 155 ICLEPIYDPARSVWFDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMSGIERSWDYDWYDVDVVYIAKYYEVKK 234 (720) T ss_pred eeEecccCchhheeecccccccChhhhhhhhhhcCCCHHHHHHhCCCccccccccccccccccccCCCceEEEEeeEEEE Confidence 5555432 233332211 11222221 11111 0 00 0111245677664332 Q ss_pred EEEEEEeccCCcccceeehhh--ccc-ccccccc--------------c--cccCCceeecC--CCccceeEEecCCccc Q lcl|NC_016654. 201 IVHAVYKGTATSLGWMMALTD--HPA-TRDIAVE--------------G--ADEGRGAYVET--GVKDLTAAYVPNVTPN 259 (533) Q Consensus 201 I~~~~y~~~~~~lG~~v~l~~--~~~-~~~~~~~--------------~--~~~~~~~~~~~--g~~~~~~~~~pn~~~~ 259 (533) +.-.++...+...|..+.+.. .++ ...+... + ....+....+. ..+.-.+.|+|..... T Consensus 235 ~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g~r 314 (720) T protein:vir:35 235 ESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPGEHIPLIPVYGKR 314 (720) T ss_pred EEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCCCccceEEEEeee Confidence 221111111111121111110 000 0000000 0 00000000000 0111122333332211 Q ss_pred ccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCC--ccccccCcchhhhhhccccc Q lcl|NC_016654. 260 PEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGM--GQGVSLDEEQEVYSRVGSGG 337 (533) Q Consensus 260 ~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~--~~~~~~d~~~~~~~~~~~~~ 337 (533) ...+ .++..-+.+. .++|..+.+|...|.+.+.+-.++..+.. ...+.... .....++.+...|-.++... T Consensus 315 ~~~d----~~~~~~G~vr-~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~--~a~~~~~~~~~~~a~~~~~~~~~l~~~~~~ 387 (720) T protein:vir:35 315 WFID----DIERVEGHIA-KAMDAQRLYNLQVSMLADSATQDTGSIPI--VGKSQIKTLEKYWANRNKNRPAFLPLNEIV 387 (720) T ss_pred eccC----CCcccceeee-cchhHHHHHHHHHHHHHHHHHcCCccccc--cCcchHHHHHHHhhcccccccccccccccc Confidence 1111 1221112343 36788999999999999988544332221 11100000 00000111111111110000 Q ss_pred cccccc--cccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHH Q lcl|NC_016654. 338 FNANGD--METIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGS 415 (533) Q Consensus 338 ~~~~~~--~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~ 415 (533) ...|.- -...+...++.---..++..++.-...|-..+|++...+|-.+ + .||.+|..+..............+.. T Consensus 388 ~~~G~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~s-n-~SG~Ai~~rq~qg~~~~~~~~Dnl~~ 465 (720) T protein:vir:35 388 DKQGNIIAPPTPVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPMPS-N-IAKETVNHLMHRSDMSSFIYLDNMAK 465 (720) T ss_pred ccCcccccCCCcccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCccc-c-hHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 001100 0112233333222346678888888889999999999999654 4 58999999887777777777777888 Q ss_pred HHHHHHHHHHHHHHhhccCCC---------C-------------------------CCceeEEEEeCCCCCCCHHHHHHH Q lcl|NC_016654. 416 ALGPLSTTCLRVDAIKFPGKG---------A-------------------------APSEELELEWPKFARESDLAKAQT 461 (533) Q Consensus 416 al~~li~~il~l~~~~~~~~~---------~-------------------------~~~~~v~i~f~d~i~~d~~e~a~~ 461 (533) +.+.+.+.+|.+-...+.... . ...++|.|+=..+.+.-.++..+. T Consensus 466 ~~~~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~~ 545 (720) T protein:vir:35 466 SLKRAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDATVSV 545 (720) T ss_pred HHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCcccHHHHHHHH Confidence 888777777766443221100 0 011233333333333335566666 Q ss_pred HHHHHhCCCCCHHHH--------HHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCC-CC-CCC-CCCCCC Q lcl|NC_016654. 462 VQAWSVASAASTKTK--------VAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTE-ND-PAT-DPEAVD 530 (533) Q Consensus 462 ~~~l~~aGi~S~et~--------v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~-~~-~~~-~~~~~~ 530 (533) ++++.. .|++... +.++. ++ ..+++.+++|++.+...+. ..+...+. .. ..- +..... T Consensus 546 m~qll~--~~~p~~~~~~~~~~~ile~~-d~--p~~~e~~erirk~~~~~~~------~~~~~~e~qq~~a~~qq~~qq~ 614 (720) T protein:vir:35 546 LTNLLA--GMLPQDPMRQVLQGIILDNM-EG--EGLDEFKEYNRKQLLTQGV------VKPRNTEEEQMVAQMIQQAQQP 614 (720) T ss_pred HHHHHH--hcCCCchhHHHHHHHHHHhc-Cc--hhHHHHHHHHHhhcchhcc------cCccChhHHHHHHHHHHHHHhH Confidence 666542 3332211 12111 22 2244555666555422111 00000000 00 000 000000 Q ss_pred CCC Q lcl|NC_016654. 531 EGE 533 (533) Q Consensus 531 d~~ 533 (533) ..+ T Consensus 615 ~~e 617 (720) T protein:vir:35 615 NAE 617 (720) T ss_pred hHH Confidence 000 No 120 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=98.48 E-value=5e-07 Score=55.11 Aligned_cols=409 Identities=12% Similarity=0.042 Sum_probs=157.3 Q ss_pred CCCC--CCcCCC-------cCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCC Q lcl|NC_016654. 1 MSLP--EANTAW-------PPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATG 71 (533) Q Consensus 1 ~~~~--~~~~~~-------pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 71 (533) |.-| .--.|| .|.... ....+.....|-..+. + T Consensus 76 i~~pfkkk~~~~~~d~f~~s~es~s-~vtsls~pdaf~~vnV--s----------------------------------- 117 (945) T protein:vir:10 76 IIVPYNHQEPPFKFNLFEYSPESLM-YLPSISDPDAFFLINL--F----------------------------------- 117 (945) T ss_pred ccccccccccchhhhhhhccCccce-ecccccCccceeeehh--h----------------------------------- Confidence 4333 111111 111000 0000000001111100 0 Q ss_pred cccceeecChHHHHHHHHHHhhcCCCceEe--eCCCc-------hHHHHHHHHHHhh-c------cHHHH-HHHHHHHHh Q lcl|NC_016654. 72 RAPKRYHAPIPGVIAKLSTTELFSEQLKFL--DAGKS-------KEVQARADLIFNT-P------RFHSS-LVEAGESCS 134 (533) Q Consensus 72 ~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~--~~~~~-------~~~~~~l~~i~~~-n------~f~~~-~~~~~~~~~ 134 (533) .+..+...--...++.+|+-+.+=|..+- ..+.. ......+..+++. | .|++. ++..+...+ T Consensus 118 -~~~AlknsaV~scI~~IA~sIAsLPlklYrr~edG~~~~~~kk~~~~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~dLL 196 (945) T protein:vir:10 118 -RKYRFNNDSKLIKVSEIPKKLTSKELEIYKHIEDKHVNYYLKRIRDARNILEFLERPDPYFSEVNSWEYLLGMVLDDIL 196 (945) T ss_pred -hhhhhccHHHHHHHHHHHhhhccCceEEEEecccCcccccccccccchHHHHHHhCCCcccChhHHHHHHHHHHHHHHh Confidence 00011111122233333333333333221 00000 0001122333322 1 23333 334556778 Q ss_pred hhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEe-cCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcc Q lcl|NC_016654. 135 ALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFR-WGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSL 213 (533) Q Consensus 135 ~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~l 213 (533) ..|.+|+.+..|.+|. -+.+..++|.++.+..+ +|.+. +. |+ +.. ++.. T Consensus 197 L~GNAYieIiRd~~G~-ii~L~pLdPs~Vti~~ddDG~~~----y~----------Yv--------------~~i-dG~~ 246 (945) T protein:vir:10 197 TIDRGAIVKIRDEQGN-LVAITPVDGTTIKPILSEDTGIV----VG----------YV--------------QEV-DGAI 246 (945) T ss_pred hcCCeEEEEEECCCCc-EEEEEEECCcceEEEEcCCCcEE----EE----------EE--------------Eec-CCce Confidence 8899999988876553 24577788888877643 23220 00 00 000 0000 Q ss_pred cceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHH Q lcl|NC_016654. 214 GWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSS 293 (533) Q Consensus 214 G~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~ 293 (533) ...++- .--++|+.+...+. ...++|.|.+..+. ..| .++..... T Consensus 247 ~~~v~a---------------------------~DvIlhirn~s~DG------~~~GyGlSPIeaa~-~aI-~~alAaek 291 (945) T protein:vir:10 247 VAHFDK---------------------------RDVVLFRQNLTPDV------YMYGYSLPPIEILY-KVI-LSDIFIDK 291 (945) T ss_pred EEEecC---------------------------CceEEEeccCCCCc------ccccCCchHHHHHH-HHH-HHHHHHHH Confidence 000000 00122332221111 11346777766543 333 22333444 Q ss_pred HHHH-HH-hC-cce--eeechHHhcCCCCccccc-cCc---chhhhhhcccccccccc----ccccceeeechhhhhHHH Q lcl|NC_016654. 294 LMRD-FR-IG-AGK--VHASESVLTNLGMGQGVS-LDE---EQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEH 360 (533) Q Consensus 294 ~~~~-~~-~~-~~~--i~v~~~~l~~~~~~~~~~-~d~---~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~ 360 (533) +... |. .| .+. |.++...... ....+.. .+. ..+.+.....+ .+.++ +....++.++......++ T Consensus 292 ~aar~FskNGa~PsGILsvkg~~~~d-~k~~~~LseEq~erlKe~wee~~sG-~NnG~piVLdeGmef~pLs~s~~DaQf 369 (945) T protein:vir:10 292 GNLDYYRKGGSIPEGILAIEPPSYKE-GDIYPQLSREQLESIQRQLQAIMMG-DYTQVPILSGGKFTWIDFKGKRRDMQF 369 (945) T ss_pred HHHHHHHhCCCccceEEEecCccccc-cccccccCHHHHHHHHHHHHHHhCC-cccccceecCCCceEEEccCChhHHHH Confidence 4333 33 33 222 2222111100 0011110 010 11112221111 11111 122345666666677788 Q ss_pred HHHHHHHHHHHHHhhCCChhhcccCCCcch-hHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCC Q lcl|NC_016654. 361 DQGAALLLREVLRKTGYSPVSLGLSDEVAQ-TATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAP 439 (533) Q Consensus 361 ~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~-Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~ 439 (533) .+..+...++|+...|+||..+|+..+... ++.+. ....++.+|..++..+....+..+. .... T Consensus 370 LEsrkfs~eeIArAFGVPP~lLG~~e~st~SNiEqq-------------~~~Fv~~tL~Pil~~IEqeLNrkLl--~~~e 434 (945) T protein:vir:10 370 KELAEFVARKICAVYQVSPQDVGILEGSNKATAEVM-------------ASLTKAKGLEPLMATISKGFDEVVS--EFRN 434 (945) T ss_pred HHHHHHHHHHHHHHhCCCHHHcccCCCCCcchHHHH-------------HHHHHHHHHHHHHHHHHHHHHHhcc--cccc Confidence 898988899999999999999987554322 22222 1122233333333333222222221 1112 Q ss_pred ceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHH-HHHHHHHHHH----------HhhhcccCccc Q lcl|NC_016654. 440 SEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDE-RVQEEADLID----------NANTVSAPTFG 508 (533) Q Consensus 440 ~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~de-e~~~El~rI~----------~E~~~~~~~~~ 508 (533) ...+.++|+.....+..+.++.+.+++++|+|+..++.+++ ++..- .-+.-+-... ..+++..+... T Consensus 435 g~~i~fdFd~ldl~D~ksraEal~kli~sGiLTiNEvRe~l--GLpPIeGGD~lli~~nn~~P~d~~~ka~~ga~p~q~a 512 (945) T protein:vir:10 435 EKDIKLWFKEDDLEKERDWWNIIQGQLNTGFRSINEARMEK--GLEPVPWGDVPFSGLRNWKPEDEQAKAQQGAMPPQLA 512 (945) T ss_pred CceeEEEecchhccCHHHHHHHHHHHHhCCCcCHHHHHHHh--CCCCCCCcceeeeccccccccccccccccCCCCcccc Confidence 35678899888888999999999999999999999977654 12110 0000000000 00111111111 Q ss_pred cc-cccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 509 FG-TDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 509 ~~-~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) .. .+++ ...+++++.+.+...+.. T Consensus 513 q~~~dqp-~~kGGe~dEns~~psE~k 537 (945) T protein:vir:10 513 QAMADQP-SQQGGGVDENSSVPSEQK 537 (945) T ss_pred cCCCCCC-CCCCCCCCCCCCCCCccc Confidence 11 1111 111111111111111111 No 121 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=98.46 E-value=5.7e-07 Score=54.80 Aligned_cols=475 Identities=11% Similarity=0.048 Sum_probs=194.6 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |.=+. ...+...+.++...|..|...-.++.+|.-.. ..++.....+..+.+..++.-+ T Consensus 1 m~~~~------~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~---------------~~~~~~~~~~~~~~~~~~~~ds 59 (559) T protein:vir:95 1 MAETT------KERLNKQFAQLESERQSFEPHWRELSDYINPR---------------GSRFLTSEVNRNDRRNTRIIDS 59 (559) T ss_pred CChhh------HHHHHHHHHHHHHHhhHHHHHHHHHHHHhccc---------------cCCcCCCCCCcccccccccccc Confidence 43332 22455556666666665554444443332110 1111222222333345566777 Q ss_pred hHHHHHHHHHHhhcCC-----CceEee--CCC----chHHHHH-------HHHHHhhccHHHHHHHHHHHHhhhCCEEEE Q lcl|NC_016654. 81 IPGVIAKLSTTELFSE-----QLKFLD--AGK----SKEVQAR-------ADLIFNTPRFHSSLVEAGESCSALSGSFQR 142 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e-----~~~i~~--~~~----~~~~~~~-------l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~ 142 (533) -+...|+.+|+.|.+- .+.|.. .+. ...++++ +.+.+..++|...+.++.....++|.+.+. T Consensus 60 t~~~a~~~Las~l~~~ltpp~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~ 139 (559) T protein:vir:95 60 TGTMAARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMA 139 (559) T ss_pred hHHHHHHHHHHHHHHhhcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeE Confidence 7888888888766553 233333 221 1233344 445788889999999999999999999876 Q ss_pred EEEcCCCCCceEEEEEcCCeEEEEEe-cCCceEEEEEEEEeec--------CC-----ceEEEEEEEecCeeEEEEEEec Q lcl|NC_016654. 143 IVWDPTIADNAWIDFVDADRAIPEFR-WGRLVAVTFWSELAGG--------DG-----QEVWRHLERHESGYIVHAVYKG 208 (533) Q Consensus 143 ~~~D~~~~~~~~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~~--------~~-----~~~y~~lE~h~~~~I~~~~y~~ 208 (533) + +++....+++..++...++-.-+ +|++..|+.-.+++.. +. +..+..-..+....|.|.+|-. T Consensus 140 ~--~~d~~~~~r~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr 217 (559) T protein:vir:95 140 V--LDDDEDIIRTMPFPIGSYYLANSPRGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPN 217 (559) T ss_pred e--ecCCCceeEEEEeecCeEEEeeCCCCCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEEEEEecc Confidence 5 44444567888899888887755 4676554422121110 00 0001000001123344444422 Q ss_pred cC---Cccc-ceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccc-hhhhhHHHH Q lcl|NC_016654. 209 TA---TSLG-WMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRA-DLSTDLFPT 283 (533) Q Consensus 209 ~~---~~lG-~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S-~~~~~i~~l 283 (533) .+ +.++ +..|+..+.. +....+....-+.|.. -+.|++. +|.. ...+.||+| .-..++ +- T Consensus 218 ~~~~~~~~~~~~~pf~s~~~------e~~~~~~~~l~esg~~--e~P~~~~-----Rw~~-~~ge~YGrg~P~~~al-~d 282 (559) T protein:vir:95 218 IDRDTSKLDSKNKPFKSVYY------EVGGDNDKLLRESGFD--EFPIMAP-----RWEV-NGEDVYGSSCPGMLAL-GP 282 (559) T ss_pred ccccccccccccceEEEEEE------EecCCCceeeecCCcc--cCCccce-----eeee-cCCccccccchHHHhh-HH Confidence 11 1111 1122222110 0000000000112221 1122222 2432 234788998 355555 56 Q ss_pred HHHHHHHHHHHHHHHH-hCcceeeechHHhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhh-HHHH Q lcl|NC_016654. 284 FHELDRIYSSLMRDFR-IGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRV-LEHD 361 (533) Q Consensus 284 id~lD~~~s~~~~~~~-~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~-e~~~ 361 (533) +..|+..--......+ ..++.+.||.+... ......+....|. ... .+.+.-......++++.. .+-+ T Consensus 283 ~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~-----~~~~l~pgg~~~~----~~~-~~~~~i~p~~~~~~~~~~~~~~i 352 (559) T protein:vir:95 283 VKALQLLQKRKSQLIDKATNPPMVAPTSLKN-----QRASLLPGDITYI----DQI-TGQDGFRPAYLVNPSTADLVADI 352 (559) T ss_pred HHHHHHHHHHHHHHHHHHhcCceeccccccc-----cceeeeccceeee----CCC-CCcccceeecccccchHHHHHHH Confidence 6788777666665553 44555666555321 0000111111111 001 111111111223344321 1212 Q ss_pred HHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhccCCCC-CC Q lcl|NC_016654. 362 QGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAK-ARHFGSALGPLSTTCLRVDAIKFPGKGA-AP 439 (533) Q Consensus 362 ~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~-~~~~~~al~~li~~il~l~~~~~~~~~~-~~ 439 (533) +.+..-++...+...+ ..+....+...|||||..+.+......+-. .+.-...|.-+|..++.+.... |..+ .+ T Consensus 353 ~~~~~rI~~af~~d~~--~~l~~r~~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~--g~lP~~p 428 (559) T protein:vir:95 353 QDTRQIINSAYFVDLF--MMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRK--NMLPPPP 428 (559) T ss_pred HHHHHHHHHHhhhhhH--HHhhcCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc--CCCCCCc Confidence 3333333333332211 112223344569999999888777665553 3333445666666666554321 2111 11 Q ss_pred ----ceeEEEEeCCCCCCCH-H-------HHHHHHHHHHhCC-----CCCHHHHHHHh---CC-----CCCHHHHHHHHH Q lcl|NC_016654. 440 ----SEELELEWPKFARESD-L-------AKAQTVQAWSVAS-----AASTKTKVAYL---HE-----DWDDERVQEEAD 494 (533) Q Consensus 440 ----~~~v~i~f~d~i~~d~-~-------e~a~~~~~l~~aG-----i~S~et~v~~l---~~-----~~~dee~~~El~ 494 (533) ...++|++--.+..-. . ..++.+..+-+++ .+....+++.+ .+ -.+++|+++.-+ T Consensus 429 ~~l~~~~i~v~~is~La~aqk~~~~~~i~~~~~~~~~laq~~Pevld~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~rq 508 (559) T protein:vir:95 429 DVMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQVKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVEQARQ 508 (559) T ss_pred ccccCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhhhcCCHHHHHHHHHHHhCCchhhcCCHHHHHHHHH Confidence 1245566543332210 0 1111122211111 13334444332 11 024444443221 Q ss_pred HHHHhhh-----cccCcc-ccccccCCCCCCC------------CCCCCCC Q lcl|NC_016654. 495 LIDNANT-----VSAPTF-GFGTDQPPLPTEN------------DPATDPE 527 (533) Q Consensus 495 rI~~E~~-----~~~~~~-~~~~~~~~~~~~~------------~~~~~~~ 527 (533) +-++.++ +....+ +......+....+ ..+++.+ T Consensus 509 qr~~~qq~~q~~~~~~~aa~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 559 (559) T protein:vir:95 509 QRAQQQQQQQMMAMGMAAAQGVKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhccccccCCChhHHHHHHHhhcCccccCC Confidence 1111111 000000 1011111111000 1111111 No 122 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=98.43 E-value=7.1e-07 Score=54.31 Aligned_cols=480 Identities=12% Similarity=0.056 Sum_probs=196.3 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |.-+. +...+...+..+...|..|..--.++.+|.... ..+++.........+..++.-. T Consensus 1 M~~~~-----~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~---------------~~~~~~~~~~~~~~~~~~~~ds 60 (555) T protein:vir:10 1 MAEQT-----ERKLLLSRWGQLRTERESWMSHWKEISDYLLPR---------------AGRFFVQDRNRGEKRHNNILDN 60 (555) T ss_pred CCCcc-----cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcc---------------cccccCCCCCcchhcccccccc Confidence 66554 446677788888888887766555554443211 1112222222222334566777 Q ss_pred hHHHHHHHHHHhhcCC-----CceEeeCCC------chHHHHH-------HHHHHhhccHHHHHHHHHHHHhhhCCEEEE Q lcl|NC_016654. 81 IPGVIAKLSTTELFSE-----QLKFLDAGK------SKEVQAR-------ADLIFNTPRFHSSLVEAGESCSALSGSFQR 142 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e-----~~~i~~~~~------~~~~~~~-------l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~ 142 (533) -+...++.+|+.|.+- .+.|..... ...++++ +.+.+..++|...+.++.....+.|.+.+. T Consensus 61 t~~~a~~~LAa~L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~ 140 (555) T protein:vir:10 61 TGTRALRVLAAGMMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSI 140 (555) T ss_pred cHHHHHHHHHHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEE Confidence 7888888888776553 233333221 1223333 445788899999999999999999999975 Q ss_pred EEEcCCCCCceEEEEEcCCeEEEEEe-cCCceEEEEEEEEeec--------CC-----ceEEEEEEEecCeeEEEEEEec Q lcl|NC_016654. 143 IVWDPTIADNAWIDFVDADRAIPEFR-WGRLVAVTFWSELAGG--------DG-----QEVWRHLERHESGYIVHAVYKG 208 (533) Q Consensus 143 ~~~D~~~~~~~~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~~--------~~-----~~~y~~lE~h~~~~I~~~~y~~ 208 (533) + +++....+++..++...++..-+ +|++..|....+++.. +. +..|..-.....-.|.|.+|-. T Consensus 141 ~--~~d~~~~~rf~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr 218 (555) T protein:vir:10 141 V--LPDFDAVVYHHSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPR 218 (555) T ss_pred E--ecCCCceEEEEEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeec Confidence 4 44445668888888888877655 4676444322111100 00 0000000000112333444422 Q ss_pred cC---Cccc-ceeehhhccccccccccccccCCceeecCCC-ccceeEEecCCcccccccccccccccccchhhhhHHHH Q lcl|NC_016654. 209 TA---TSLG-WMMALTDHPATRDIAVEGADEGRGAYVETGV-KDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPT 283 (533) Q Consensus 209 ~~---~~lG-~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~-~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~l 283 (533) .+ ...+ +-.|+..+....... +...--+.|. ..|++.. +|... ..+.||+|-...++ +- T Consensus 219 ~~~~~~~~~~~~~p~~s~~~~~~~d------~~~vl~esgy~e~P~i~~--------Rw~~~-~ge~YGrgp~~~~l-gD 282 (555) T protein:vir:10 219 ADRDPSKRDDRNMAWKSVYFEPGAD------ETRTLRESGYRSFRALCP--------RWALV-GGDIYGNSPAMEAL-GD 282 (555) T ss_pred cCcCcCCCCccccceEEEEEEeccC------CccccccCCcccCCceee--------eeeec-CCCccccchHHHHH-HH Confidence 11 1111 112222221100000 0000011221 1222222 23322 24778999877666 55 Q ss_pred HHHHHHHHHHHHHHHH-hCcceeeechHHhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHH Q lcl|NC_016654. 284 FHELDRIYSSLMRDFR-IGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQ 362 (533) Q Consensus 284 id~lD~~~s~~~~~~~-~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~ 362 (533) +..|+..--....... ..++...||++.. .......+...-|.. ....+......++ .++++. .-.+ T Consensus 283 ~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~-----~~~~~~~pgg~~~v~----~g~~~d~~~~~~~-~~~d~~--~~~~ 350 (555) T protein:vir:10 283 VRQLQHEQLRKAQAIDYKSNPPLQLPVSAK-----NQDISTVPGGLSYVD----AAAPNGGIRTAFE-VNLDLS--HLLA 350 (555) T ss_pred HHHHHHHHHHHHHHHHHHhcCceeeccccc-----cccceeccccccccc----cCCCCcceecccc-cccchH--HHHH Confidence 6777775444444443 3444555554432 111111111111110 0011111111111 122322 2223 Q ss_pred HHHHHHHHHHHhhCCCh--hhcccCCCcchhHHHHHHHhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhccCCCCCC Q lcl|NC_016654. 363 GAALLLREVLRKTGYSP--VSLGLSDEVAQTATEASGKKDLTVKTTRAK-ARHFGSALGPLSTTCLRVDAIKFPGKGAAP 439 (533) Q Consensus 363 ~l~~~l~~i~~~~g~s~--~~~g~~~~~~~Tatai~~~~~~l~~~~~~~-~~~~~~al~~li~~il~l~~~~~~~~~~~~ 439 (533) .|+.+-..|.... +.. ..++...+...|||||..+.+......+-. .+.-...|.-++..++.+... .|..+.. T Consensus 351 ~i~~~~~rI~~af-~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r--~g~lP~~ 427 (555) T protein:vir:10 351 DIVDVRERIKASF-YADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVE--ANILPPP 427 (555) T ss_pred HHHHHHHHHHHHh-hcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh--cCCCCCC Confidence 3333333332222 111 112223445679999988877766666553 333344555566555555331 1111111 Q ss_pred -----ceeEEEEeCCCCCCCH-HH-------HHHHHHHHHhCC-----CCCHHHHHHHh---CC-----CCCHHHHHHHH Q lcl|NC_016654. 440 -----SEELELEWPKFARESD-LA-------KAQTVQAWSVAS-----AASTKTKVAYL---HE-----DWDDERVQEEA 493 (533) Q Consensus 440 -----~~~v~i~f~d~i~~d~-~e-------~a~~~~~l~~aG-----i~S~et~v~~l---~~-----~~~dee~~~El 493 (533) ...++|++--.+.... .+ .++.+..+-+.+ .+....+++.+ .+ --+++|+++.. T Consensus 428 P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r 507 (555) T protein:vir:10 428 PQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIR 507 (555) T ss_pred chhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHH Confidence 2235555544433211 11 111111111111 12233333322 11 02445554432 Q ss_pred HH-HHHhhhcccC-ccccccccCCCCCCCCCC----CCCCCCCCCC Q lcl|NC_016654. 494 DL-IDNANTVSAP-TFGFGTDQPPLPTENDPA----TDPEAVDEGE 533 (533) Q Consensus 494 ~r-I~~E~~~~~~-~~~~~~~~~~~~~~~~~~----~~~~~~~d~~ 533 (533) ++ .++++++... ...++.+......+.+.. -..-...+|- T Consensus 508 ~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 553 (555) T protein:vir:10 508 KQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNALTDVTRAFSG 553 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchhHHHHHhhhcc Confidence 21 1111111111 011111001100000000 0001111111 No 123 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=98.43 E-value=7.1e-07 Score=54.31 Aligned_cols=480 Identities=12% Similarity=0.056 Sum_probs=196.3 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |.-+. +...+...+..+...|..|..--.++.+|.... ..+++.........+..++.-. T Consensus 1 M~~~~-----~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~---------------~~~~~~~~~~~~~~~~~~~~ds 60 (555) T protein:vir:98 1 MAEQT-----ERKLLLSRWGQLRTERESWMSHWKEISDYLLPR---------------AGRFFVQDRNRGEKRHNNILDN 60 (555) T ss_pred CCCcc-----cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcc---------------cccccCCCCCcchhcccccccc Confidence 66554 446677788888888887766555554443211 1112222222222334566777 Q ss_pred hHHHHHHHHHHhhcCC-----CceEeeCCC------chHHHHH-------HHHHHhhccHHHHHHHHHHHHhhhCCEEEE Q lcl|NC_016654. 81 IPGVIAKLSTTELFSE-----QLKFLDAGK------SKEVQAR-------ADLIFNTPRFHSSLVEAGESCSALSGSFQR 142 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e-----~~~i~~~~~------~~~~~~~-------l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~ 142 (533) -+...++.+|+.|.+- .+.|..... ...++++ +.+.+..++|...+.++.....+.|.+.+. T Consensus 61 t~~~a~~~LAa~L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~ 140 (555) T protein:vir:98 61 TGTRALRVLAAGMMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSI 140 (555) T ss_pred cHHHHHHHHHHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEE Confidence 7888888888776553 233333221 1223333 445788899999999999999999999975 Q ss_pred EEEcCCCCCceEEEEEcCCeEEEEEe-cCCceEEEEEEEEeec--------CC-----ceEEEEEEEecCeeEEEEEEec Q lcl|NC_016654. 143 IVWDPTIADNAWIDFVDADRAIPEFR-WGRLVAVTFWSELAGG--------DG-----QEVWRHLERHESGYIVHAVYKG 208 (533) Q Consensus 143 ~~~D~~~~~~~~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~~--------~~-----~~~y~~lE~h~~~~I~~~~y~~ 208 (533) + +++....+++..++...++..-+ +|++..|....+++.. +. +..|..-.....-.|.|.+|-. T Consensus 141 ~--~~d~~~~~rf~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr 218 (555) T protein:vir:98 141 V--LPDFDAVVYHHSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPR 218 (555) T ss_pred E--ecCCCceEEEEEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeec Confidence 4 44445668888888888877655 4676444322111100 00 0000000000112333444422 Q ss_pred cC---Cccc-ceeehhhccccccccccccccCCceeecCCC-ccceeEEecCCcccccccccccccccccchhhhhHHHH Q lcl|NC_016654. 209 TA---TSLG-WMMALTDHPATRDIAVEGADEGRGAYVETGV-KDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPT 283 (533) Q Consensus 209 ~~---~~lG-~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~-~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~l 283 (533) .+ ...+ +-.|+..+....... +...--+.|. ..|++.. +|... ..+.||+|-...++ +- T Consensus 219 ~~~~~~~~~~~~~p~~s~~~~~~~d------~~~vl~esgy~e~P~i~~--------Rw~~~-~ge~YGrgp~~~~l-gD 282 (555) T protein:vir:98 219 ADRDPSKRDDRNMAWKSVYFEPGAD------ETRTLRESGYRSFRALCP--------RWALV-GGDIYGNSPAMEAL-GD 282 (555) T ss_pred cCcCcCCCCccccceEEEEEEeccC------CccccccCCcccCCceee--------eeeec-CCCccccchHHHHH-HH Confidence 11 1111 112222221100000 0000011221 1222222 23322 24778999877666 55 Q ss_pred HHHHHHHHHHHHHHHH-hCcceeeechHHhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHH Q lcl|NC_016654. 284 FHELDRIYSSLMRDFR-IGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQ 362 (533) Q Consensus 284 id~lD~~~s~~~~~~~-~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~ 362 (533) +..|+..--....... ..++...||++.. .......+...-|.. ....+......++ .++++. .-.+ T Consensus 283 ~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~-----~~~~~~~pgg~~~v~----~g~~~d~~~~~~~-~~~d~~--~~~~ 350 (555) T protein:vir:98 283 VRQLQHEQLRKAQAIDYKSNPPLQLPVSAK-----NQDISTVPGGLSYVD----AAAPNGGIRTAFE-VNLDLS--HLLA 350 (555) T ss_pred HHHHHHHHHHHHHHHHHHhcCceeeccccc-----cccceeccccccccc----cCCCCcceecccc-cccchH--HHHH Confidence 6777775444444443 3444555554432 111111111111110 0011111111111 122322 2223 Q ss_pred HHHHHHHHHHHhhCCCh--hhcccCCCcchhHHHHHHHhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhccCCCCCC Q lcl|NC_016654. 363 GAALLLREVLRKTGYSP--VSLGLSDEVAQTATEASGKKDLTVKTTRAK-ARHFGSALGPLSTTCLRVDAIKFPGKGAAP 439 (533) Q Consensus 363 ~l~~~l~~i~~~~g~s~--~~~g~~~~~~~Tatai~~~~~~l~~~~~~~-~~~~~~al~~li~~il~l~~~~~~~~~~~~ 439 (533) .|+.+-..|.... +.. ..++...+...|||||..+.+......+-. .+.-...|.-++..++.+... .|..+.. T Consensus 351 ~i~~~~~rI~~af-~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r--~g~lP~~ 427 (555) T protein:vir:98 351 DIVDVRERIKASF-YADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVE--ANILPPP 427 (555) T ss_pred HHHHHHHHHHHHh-hcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh--cCCCCCC Confidence 3333333332222 111 112223445679999988877766666553 333344555566555555331 1111111 Q ss_pred -----ceeEEEEeCCCCCCCH-HH-------HHHHHHHHHhCC-----CCCHHHHHHHh---CC-----CCCHHHHHHHH Q lcl|NC_016654. 440 -----SEELELEWPKFARESD-LA-------KAQTVQAWSVAS-----AASTKTKVAYL---HE-----DWDDERVQEEA 493 (533) Q Consensus 440 -----~~~v~i~f~d~i~~d~-~e-------~a~~~~~l~~aG-----i~S~et~v~~l---~~-----~~~dee~~~El 493 (533) ...++|++--.+.... .+ .++.+..+-+.+ .+....+++.+ .+ --+++|+++.. T Consensus 428 P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r 507 (555) T protein:vir:98 428 PQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIR 507 (555) T ss_pred chhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHH Confidence 2235555544433211 11 111111111111 12233333322 11 02445554432 Q ss_pred HH-HHHhhhcccC-ccccccccCCCCCCCCCC----CCCCCCCCCC Q lcl|NC_016654. 494 DL-IDNANTVSAP-TFGFGTDQPPLPTENDPA----TDPEAVDEGE 533 (533) Q Consensus 494 ~r-I~~E~~~~~~-~~~~~~~~~~~~~~~~~~----~~~~~~~d~~ 533 (533) ++ .++++++... ...++.+......+.+.. -..-...+|- T Consensus 508 ~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 553 (555) T protein:vir:98 508 KQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNALTDVTRAFSG 553 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchhHHHHHhhhcc Confidence 21 1111111111 011111001100000000 0001111111 No 124 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=98.43 E-value=7.1e-07 Score=54.31 Aligned_cols=480 Identities=12% Similarity=0.056 Sum_probs=196.3 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |.-+. +...+...+..+...|..|..--.++.+|.... ..+++.........+..++.-. T Consensus 1 M~~~~-----~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~---------------~~~~~~~~~~~~~~~~~~~~ds 60 (555) T protein:vir:10 1 MAEQT-----ERKLLLSRWGQLRTERESWMSHWKEISDYLLPR---------------AGRFFVQDRNRGEKRHNNILDN 60 (555) T ss_pred CCCcc-----cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcc---------------cccccCCCCCcchhcccccccc Confidence 66554 446677788888888887766555554443211 1112222222222334566777 Q ss_pred hHHHHHHHHHHhhcCC-----CceEeeCCC------chHHHHH-------HHHHHhhccHHHHHHHHHHHHhhhCCEEEE Q lcl|NC_016654. 81 IPGVIAKLSTTELFSE-----QLKFLDAGK------SKEVQAR-------ADLIFNTPRFHSSLVEAGESCSALSGSFQR 142 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e-----~~~i~~~~~------~~~~~~~-------l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~ 142 (533) -+...++.+|+.|.+- .+.|..... ...++++ +.+.+..++|...+.++.....+.|.+.+. T Consensus 61 t~~~a~~~LAa~L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~ 140 (555) T protein:vir:10 61 TGTRALRVLAAGMMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSI 140 (555) T ss_pred cHHHHHHHHHHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEE Confidence 7888888888776553 233333221 1223333 445788899999999999999999999975 Q ss_pred EEEcCCCCCceEEEEEcCCeEEEEEe-cCCceEEEEEEEEeec--------CC-----ceEEEEEEEecCeeEEEEEEec Q lcl|NC_016654. 143 IVWDPTIADNAWIDFVDADRAIPEFR-WGRLVAVTFWSELAGG--------DG-----QEVWRHLERHESGYIVHAVYKG 208 (533) Q Consensus 143 ~~~D~~~~~~~~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~~--------~~-----~~~y~~lE~h~~~~I~~~~y~~ 208 (533) + +++....+++..++...++..-+ +|++..|....+++.. +. +..|..-.....-.|.|.+|-. T Consensus 141 ~--~~d~~~~~rf~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr 218 (555) T protein:vir:10 141 V--LPDFDAVVYHHSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPR 218 (555) T ss_pred E--ecCCCceEEEEEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeec Confidence 4 44445668888888888877655 4676444322111100 00 0000000000112333444422 Q ss_pred cC---Cccc-ceeehhhccccccccccccccCCceeecCCC-ccceeEEecCCcccccccccccccccccchhhhhHHHH Q lcl|NC_016654. 209 TA---TSLG-WMMALTDHPATRDIAVEGADEGRGAYVETGV-KDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPT 283 (533) Q Consensus 209 ~~---~~lG-~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~-~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~l 283 (533) .+ ...+ +-.|+..+....... +...--+.|. ..|++.. +|... ..+.||+|-...++ +- T Consensus 219 ~~~~~~~~~~~~~p~~s~~~~~~~d------~~~vl~esgy~e~P~i~~--------Rw~~~-~ge~YGrgp~~~~l-gD 282 (555) T protein:vir:10 219 ADRDPSKRDDRNMAWKSVYFEPGAD------ETRTLRESGYRSFRALCP--------RWALV-GGDIYGNSPAMEAL-GD 282 (555) T ss_pred cCcCcCCCCccccceEEEEEEeccC------CccccccCCcccCCceee--------eeeec-CCCccccchHHHHH-HH Confidence 11 1111 112222221100000 0000011221 1222222 23322 24778999877666 55 Q ss_pred HHHHHHHHHHHHHHHH-hCcceeeechHHhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhhHHHHH Q lcl|NC_016654. 284 FHELDRIYSSLMRDFR-IGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQ 362 (533) Q Consensus 284 id~lD~~~s~~~~~~~-~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~ 362 (533) +..|+..--....... ..++...||++.. .......+...-|.. ....+......++ .++++. .-.+ T Consensus 283 ~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~-----~~~~~~~pgg~~~v~----~g~~~d~~~~~~~-~~~d~~--~~~~ 350 (555) T protein:vir:10 283 VRQLQHEQLRKAQAIDYKSNPPLQLPVSAK-----NQDISTVPGGLSYVD----AAAPNGGIRTAFE-VNLDLS--HLLA 350 (555) T ss_pred HHHHHHHHHHHHHHHHHHhcCceeeccccc-----cccceeccccccccc----cCCCCcceecccc-cccchH--HHHH Confidence 6777775444444443 3444555554432 111111111111110 0011111111111 122322 2223 Q ss_pred HHHHHHHHHHHhhCCCh--hhcccCCCcchhHHHHHHHhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhccCCCCCC Q lcl|NC_016654. 363 GAALLLREVLRKTGYSP--VSLGLSDEVAQTATEASGKKDLTVKTTRAK-ARHFGSALGPLSTTCLRVDAIKFPGKGAAP 439 (533) Q Consensus 363 ~l~~~l~~i~~~~g~s~--~~~g~~~~~~~Tatai~~~~~~l~~~~~~~-~~~~~~al~~li~~il~l~~~~~~~~~~~~ 439 (533) .|+.+-..|.... +.. ..++...+...|||||..+.+......+-. .+.-...|.-++..++.+... .|..+.. T Consensus 351 ~i~~~~~rI~~af-~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r--~g~lP~~ 427 (555) T protein:vir:10 351 DIVDVRERIKASF-YADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVE--ANILPPP 427 (555) T ss_pred HHHHHHHHHHHHh-hcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh--cCCCCCC Confidence 3333333332222 111 112223445679999988877766666553 333344555566555555331 1111111 Q ss_pred -----ceeEEEEeCCCCCCCH-HH-------HHHHHHHHHhCC-----CCCHHHHHHHh---CC-----CCCHHHHHHHH Q lcl|NC_016654. 440 -----SEELELEWPKFARESD-LA-------KAQTVQAWSVAS-----AASTKTKVAYL---HE-----DWDDERVQEEA 493 (533) Q Consensus 440 -----~~~v~i~f~d~i~~d~-~e-------~a~~~~~l~~aG-----i~S~et~v~~l---~~-----~~~dee~~~El 493 (533) ...++|++--.+.... .+ .++.+..+-+.+ .+....+++.+ .+ --+++|+++.. T Consensus 428 P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r 507 (555) T protein:vir:10 428 PQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIR 507 (555) T ss_pred chhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHH Confidence 2235555544433211 11 111111111111 12233333322 11 02445554432 Q ss_pred HH-HHHhhhcccC-ccccccccCCCCCCCCCC----CCCCCCCCCC Q lcl|NC_016654. 494 DL-IDNANTVSAP-TFGFGTDQPPLPTENDPA----TDPEAVDEGE 533 (533) Q Consensus 494 ~r-I~~E~~~~~~-~~~~~~~~~~~~~~~~~~----~~~~~~~d~~ 533 (533) ++ .++++++... ...++.+......+.+.. -..-...+|- T Consensus 508 ~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 553 (555) T protein:vir:10 508 KQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNALTDVTRAFSG 553 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchhHHHHHhhhcc Confidence 21 1111111111 011111001100000000 0001111111 No 125 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=98.40 E-value=8.2e-07 Score=53.95 Aligned_cols=413 Identities=11% Similarity=0.037 Sum_probs=171.2 Q ss_pred cCCHHHHHHHHhccCcchhhH-HHHHHHHHHHHHhcccCCCCCc-ccceeecChHHHHHHHHHHhhcCCCceEeeCCCc- Q lcl|NC_016654. 30 EGDLDKLATFYGAEGRTSPSG-IKARTKAAYEAFHGRTPTATGR-APKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKS- 106 (533) Q Consensus 30 ~gd~~~l~~~y~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~g~-~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~- 106 (533) +|=.+++..++....+..... .--........|++..+....- ...-+...--..+++.+|+-+-+-|..+--..++ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~ 80 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYG 80 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 444455554442111110000 0000111223333332211110 0111222222334555555555545443211111 Q ss_pred --hHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEE Q lcl|NC_016654. 107 --KEVQARADLIFNT--P---RFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWS 179 (533) Q Consensus 107 --~~~~~~l~~i~~~--n---~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~ 179 (533) ......+..+|+. | .....+...+...+..|.+|+.+..|..+. -+.+..++|+++.+..+...+.. T Consensus 81 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~-~~~L~~i~~~~v~v~~d~~~~~~----- 154 (432) T protein:vir:10 81 IQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGK-VQALWPIDASKVTVYIDDVGLLN----- 154 (432) T ss_pred eeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc-EEEEEEEcCceeEEEEcCccccc----- Confidence 1112234444432 1 123344555666677899999998887653 34666778887776543321100 Q ss_pred EEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCccc Q lcl|NC_016654. 180 ELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPN 259 (533) Q Consensus 180 ~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~ 259 (533) ..+.+ +..+... |..+.+ +.--+.|+.+..+. T Consensus 155 -----------------~~~~~-~y~~~~~----g~~~~~--------------------------~~~eiih~r~~~~~ 186 (432) T protein:vir:10 155 -----------------SKTKM-WYVVNTG----GQQRVL--------------------------KPEEILHFKNGITL 186 (432) T ss_pred -----------------ccceE-EEEEecC----CeEEEE--------------------------ccccEEEecCCCCC Confidence 00000 0011110 111100 00012333321111 Q ss_pred ccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhC-cceeeechHHhcCCCCccccccCcchhhhhhccccc Q lcl|NC_016654. 260 PEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIG-AGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGG 337 (533) Q Consensus 260 ~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~-~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~ 337 (533) ....|.|.+..+. ..|+. ....+++... |+.| ...-++ ...+.-...........+.....+. T Consensus 187 --------~~~~G~s~~~~~~-~~i~~-~~~~~~~~~~~~~ng~~p~gil-----~~~~~l~~e~~~~~~~~~~~~~~g~ 251 (432) T protein:vir:10 187 --------DGLVGVPTMEYLK-STLEN-SASADKFINNFYKQGLQVKGLV-----QYVGDLNEDAKKVFRENFESMSSGL 251 (432) T ss_pred --------CCcccccHHHHHH-HHHHH-HHHHHHHHHHHHhccCCccEEE-----EcCCCCCHHHHHHHHHHHHHHhccc Confidence 1235777776543 44533 3334444433 3544 333222 1111100000001111121111111 Q ss_pred ccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcch-hHHHHHHHhhhHHHHHHHHHHH Q lcl|NC_016654. 338 FNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQ-TATEASGKKDLTVKTTRAKARH 412 (533) Q Consensus 338 ~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~-Tatai~~~~~~l~~~~~~~~~~ 412 (533) .+.++ +....++.++......++++..+...++|+...|+||..+|...++.. +..+. .... T Consensus 252 ~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~-------------~~~~ 318 (432) T protein:vir:10 252 QNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQ-------------QQQF 318 (432) T ss_pred ccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH-------------HHHH Confidence 11111 111235556655566778888888899999999999999986544322 22222 1223 Q ss_pred HHHHHHHHHHHHHHHHHhhcc-CCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHH-HHH Q lcl|NC_016654. 413 FGSALGPLSTTCLRVDAIKFP-GKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDE-RVQ 490 (533) Q Consensus 413 ~~~al~~li~~il~l~~~~~~-~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~de-e~~ 490 (533) ++.+|..++..+....+..+. .........+.++++.-...|..++++.+.+++.+|+|+..++.+.+ ++..- ..+ T Consensus 319 ~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~--g~~pi~ggD 396 (432) T protein:vir:10 319 YTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKE--DLPPEAGGD 396 (432) T ss_pred HHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHh--CCCCCCCCC Confidence 344455444444332222221 11112234566667777788999999999999999999999977654 23221 010 Q ss_pred HHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 491 EEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 491 ~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +-+- .. +. .|.-..+....+..+++.+.+ ...++|. T Consensus 397 ~~~~--~~-n~--~~~~~~~~~~~k~~~~~~~~~--~~~~~~~ 432 (432) T protein:vir:10 397 RLLV--NG-NM--LPIDMAGQAYLKGGDTNGEVS--KEGNEGN 432 (432) T ss_pred eEee--cc-cc--cchhhccccccCCCCCCCCCC--CCCCCCC Confidence 0000 00 00 000000011112222222211 2222233 No 126 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=98.40 E-value=8.2e-07 Score=53.95 Aligned_cols=413 Identities=11% Similarity=0.037 Sum_probs=171.2 Q ss_pred cCCHHHHHHHHhccCcchhhH-HHHHHHHHHHHHhcccCCCCCc-ccceeecChHHHHHHHHHHhhcCCCceEeeCCCc- Q lcl|NC_016654. 30 EGDLDKLATFYGAEGRTSPSG-IKARTKAAYEAFHGRTPTATGR-APKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKS- 106 (533) Q Consensus 30 ~gd~~~l~~~y~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~g~-~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~- 106 (533) +|=.+++..++....+..... .--........|++..+....- ...-+...--..+++.+|+-+-+-|..+--..++ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~ 80 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYG 80 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 444455554442111110000 0000111223333332211110 0111222222334555555555545443211111 Q ss_pred --hHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEE Q lcl|NC_016654. 107 --KEVQARADLIFNT--P---RFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWS 179 (533) Q Consensus 107 --~~~~~~l~~i~~~--n---~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~ 179 (533) ......+..+|+. | .....+...+...+..|.+|+.+..|..+. -+.+..++|+++.+..+...+.. T Consensus 81 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~-~~~L~~i~~~~v~v~~d~~~~~~----- 154 (432) T protein:vir:10 81 IQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGK-VQALWPIDASKVTVYIDDVGLLN----- 154 (432) T ss_pred eeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc-EEEEEEEcCceeEEEEcCccccc----- Confidence 1112234444432 1 123344555666677899999998887653 34666778887776543321100 Q ss_pred EEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCccc Q lcl|NC_016654. 180 ELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPN 259 (533) Q Consensus 180 ~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~ 259 (533) ..+.+ +..+... |..+.+ +.--+.|+.+..+. T Consensus 155 -----------------~~~~~-~y~~~~~----g~~~~~--------------------------~~~eiih~r~~~~~ 186 (432) T protein:vir:10 155 -----------------SKTKM-WYVVNTG----GQQRVL--------------------------KPEEILHFKNGITL 186 (432) T ss_pred -----------------ccceE-EEEEecC----CeEEEE--------------------------ccccEEEecCCCCC Confidence 00000 0011110 111100 00012333321111 Q ss_pred ccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhC-cceeeechHHhcCCCCccccccCcchhhhhhccccc Q lcl|NC_016654. 260 PEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIG-AGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGG 337 (533) Q Consensus 260 ~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~-~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~ 337 (533) ....|.|.+..+. ..|+. ....+++... |+.| ...-++ ...+.-...........+.....+. T Consensus 187 --------~~~~G~s~~~~~~-~~i~~-~~~~~~~~~~~~~ng~~p~gil-----~~~~~l~~e~~~~~~~~~~~~~~g~ 251 (432) T protein:vir:10 187 --------DGLVGVPTMEYLK-STLEN-SASADKFINNFYKQGLQVKGLV-----QYVGDLNEDAKKVFRENFESMSSGL 251 (432) T ss_pred --------CCcccccHHHHHH-HHHHH-HHHHHHHHHHHHhccCCccEEE-----EcCCCCCHHHHHHHHHHHHHHhccc Confidence 1235777776543 44533 3334444433 3544 333222 1111100000001111121111111 Q ss_pred ccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcch-hHHHHHHHhhhHHHHHHHHHHH Q lcl|NC_016654. 338 FNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQ-TATEASGKKDLTVKTTRAKARH 412 (533) Q Consensus 338 ~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~-Tatai~~~~~~l~~~~~~~~~~ 412 (533) .+.++ +....++.++......++++..+...++|+...|+||..+|...++.. +..+. .... T Consensus 252 ~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~-------------~~~~ 318 (432) T protein:vir:10 252 QNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQ-------------QQQF 318 (432) T ss_pred ccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH-------------HHHH Confidence 11111 111235556655566778888888899999999999999986544322 22222 1223 Q ss_pred HHHHHHHHHHHHHHHHHhhcc-CCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHH-HHH Q lcl|NC_016654. 413 FGSALGPLSTTCLRVDAIKFP-GKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDE-RVQ 490 (533) Q Consensus 413 ~~~al~~li~~il~l~~~~~~-~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~de-e~~ 490 (533) ++.+|..++..+....+..+. .........+.++++.-...|..++++.+.+++.+|+|+..++.+.+ ++..- ..+ T Consensus 319 ~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~--g~~pi~ggD 396 (432) T protein:vir:10 319 YTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKE--DLPPEAGGD 396 (432) T ss_pred HHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHh--CCCCCCCCC Confidence 344455444444332222221 11112234566667777788999999999999999999999977654 23221 010 Q ss_pred HHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 491 EEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 491 ~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +-+- .. +. .|.-..+....+..+++.+.+ ...++|. T Consensus 397 ~~~~--~~-n~--~~~~~~~~~~~k~~~~~~~~~--~~~~~~~ 432 (432) T protein:vir:10 397 RLLV--NG-NM--LPIDMAGQAYLKGGDTNGEVS--KEGNEGN 432 (432) T ss_pred eEee--cc-cc--cchhhccccccCCCCCCCCCC--CCCCCCC Confidence 0000 00 00 000000011112222222211 2222233 No 127 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=98.40 E-value=8.2e-07 Score=53.95 Aligned_cols=413 Identities=11% Similarity=0.037 Sum_probs=171.2 Q ss_pred cCCHHHHHHHHhccCcchhhH-HHHHHHHHHHHHhcccCCCCCc-ccceeecChHHHHHHHHHHhhcCCCceEeeCCCc- Q lcl|NC_016654. 30 EGDLDKLATFYGAEGRTSPSG-IKARTKAAYEAFHGRTPTATGR-APKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKS- 106 (533) Q Consensus 30 ~gd~~~l~~~y~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~g~-~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~- 106 (533) +|=.+++..++....+..... .--........|++..+....- ...-+...--..+++.+|+-+-+-|..+--..++ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~ 80 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYG 80 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 444455554442111110000 0000111223333332211110 0111222222334555555555545443211111 Q ss_pred --hHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEE Q lcl|NC_016654. 107 --KEVQARADLIFNT--P---RFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWS 179 (533) Q Consensus 107 --~~~~~~l~~i~~~--n---~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~ 179 (533) ......+..+|+. | .....+...+...+..|.+|+.+..|..+. -+.+..++|+++.+..+...+.. T Consensus 81 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~-~~~L~~i~~~~v~v~~d~~~~~~----- 154 (432) T protein:vir:10 81 IQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGK-VQALWPIDASKVTVYIDDVGLLN----- 154 (432) T ss_pred eeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc-EEEEEEEcCceeEEEEcCccccc----- Confidence 1112234444432 1 123344555666677899999998887653 34666778887776543321100 Q ss_pred EEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCccc Q lcl|NC_016654. 180 ELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPN 259 (533) Q Consensus 180 ~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~ 259 (533) ..+.+ +..+... |..+.+ +.--+.|+.+..+. T Consensus 155 -----------------~~~~~-~y~~~~~----g~~~~~--------------------------~~~eiih~r~~~~~ 186 (432) T protein:vir:10 155 -----------------SKTKM-WYVVNTG----GQQRVL--------------------------KPEEILHFKNGITL 186 (432) T ss_pred -----------------ccceE-EEEEecC----CeEEEE--------------------------ccccEEEecCCCCC Confidence 00000 0011110 111100 00012333321111 Q ss_pred ccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhC-cceeeechHHhcCCCCccccccCcchhhhhhccccc Q lcl|NC_016654. 260 PEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIG-AGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGG 337 (533) Q Consensus 260 ~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~-~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~ 337 (533) ....|.|.+..+. ..|+. ....+++... |+.| ...-++ ...+.-...........+.....+. T Consensus 187 --------~~~~G~s~~~~~~-~~i~~-~~~~~~~~~~~~~ng~~p~gil-----~~~~~l~~e~~~~~~~~~~~~~~g~ 251 (432) T protein:vir:10 187 --------DGLVGVPTMEYLK-STLEN-SASADKFINNFYKQGLQVKGLV-----QYVGDLNEDAKKVFRENFESMSSGL 251 (432) T ss_pred --------CCcccccHHHHHH-HHHHH-HHHHHHHHHHHHhccCCccEEE-----EcCCCCCHHHHHHHHHHHHHHhccc Confidence 1235777776543 44533 3334444433 3544 333222 1111100000001111121111111 Q ss_pred ccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcch-hHHHHHHHhhhHHHHHHHHHHH Q lcl|NC_016654. 338 FNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQ-TATEASGKKDLTVKTTRAKARH 412 (533) Q Consensus 338 ~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~-Tatai~~~~~~l~~~~~~~~~~ 412 (533) .+.++ +....++.++......++++..+...++|+...|+||..+|...++.. +..+. .... T Consensus 252 ~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~-------------~~~~ 318 (432) T protein:vir:10 252 QNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQ-------------QQQF 318 (432) T ss_pred ccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH-------------HHHH Confidence 11111 111235556655566778888888899999999999999986544322 22222 1223 Q ss_pred HHHHHHHHHHHHHHHHHhhcc-CCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHH-HHH Q lcl|NC_016654. 413 FGSALGPLSTTCLRVDAIKFP-GKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDE-RVQ 490 (533) Q Consensus 413 ~~~al~~li~~il~l~~~~~~-~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~de-e~~ 490 (533) ++.+|..++..+....+..+. .........+.++++.-...|..++++.+.+++.+|+|+..++.+.+ ++..- ..+ T Consensus 319 ~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~--g~~pi~ggD 396 (432) T protein:vir:10 319 YTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKE--DLPPEAGGD 396 (432) T ss_pred HHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHh--CCCCCCCCC Confidence 344455444444332222221 11112234566667777788999999999999999999999977654 23221 010 Q ss_pred HHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 491 EEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 491 ~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +-+- .. +. .|.-..+....+..+++.+.+ ...++|. T Consensus 397 ~~~~--~~-n~--~~~~~~~~~~~k~~~~~~~~~--~~~~~~~ 432 (432) T protein:vir:10 397 RLLV--NG-NM--LPIDMAGQAYLKGGDTNGEVS--KEGNEGN 432 (432) T ss_pred eEee--cc-cc--cchhhccccccCCCCCCCCCC--CCCCCCC Confidence 0000 00 00 000000011112222222211 2222233 No 128 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=98.39 E-value=8.6e-07 Score=53.84 Aligned_cols=491 Identities=8% Similarity=-0.062 Sum_probs=211.4 Q ss_pred CCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhcc--CcchhhHHHHHHHHHHHHHhcccCCCCCc--ccceee Q lcl|NC_016654. 3 LPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAE--GRTSPSGIKARTKAAYEAFHGRTPTATGR--APKRYH 78 (533) Q Consensus 3 ~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~g~--~~~~~~ 78 (533) |-+-... .|..++.++ +.|+..+.+.-.++.... .+|..+||.......... .|+ .+..++ T Consensus 1 ma~~~~~----~~~~~~~r~---~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~--------~~q~~~rP~~~ 65 (708) T protein:vir:17 1 MAETLEK----KHERIMLRF---DRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKL--------DEQFEKYPKFE 65 (708) T ss_pred CchhHHH----HHHHHHHHH---HHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHh--------hhhhcCCCceE Confidence 2221111 122222222 334444433333332221 122233443332222111 011 133689 Q ss_pred cChHHHHHHHHHHhhcCCCceEeeCCC----ch----HHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEc---C Q lcl|NC_016654. 79 APIPGVIAKLSTTELFSEQLKFLDAGK----SK----EVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWD---P 147 (533) Q Consensus 79 ~n~~k~i~~~~a~ll~~e~~~i~~~~~----~~----~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D---~ 147 (533) .|+.+.+|+...++=-...+.+.+.+. +. .++..+..+.+.|+.......+...+.+.|-+|++++.| + T Consensus 66 ~N~i~~~i~~v~g~e~~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~~d~~~e 145 (708) T protein:vir:17 66 INKVATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNE 145 (708) T ss_pred EcchHHHHHHHHhhHhhCCcceEEecCCCcchHHHHHHHHHHHHHHHHhcCchhHHhHHHHHhhhcccceeeeeeccccc Confidence 999999999999986666666665433 12 234556667788899999999999999999999998653 1 Q ss_pred ----CCCCceEEEEE--cCCeEEEEEe--cCCceEEE--EEEEE----------eec-----------C------CceEE Q lcl|NC_016654. 148 ----TIADNAWIDFV--DADRAIPEFR--WGRLVAVT--FWSEL----------AGG-----------D------GQEVW 190 (533) Q Consensus 148 ----~~~~~~~i~~v--~~~~~~P~~~--~g~~~~v~--f~~~~----------~~~-----------~------~~~~y 190 (533) +...++.|..+ ++..|+.=++ .-.++.|- |...+ .+. + ..... T Consensus 146 ~d~~~~~~~i~i~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~~~~~~~d~v 225 (708) T protein:vir:17 146 YDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWEYDWFDADVI 225 (708) T ss_pred CCCCCCccccceEeeccchhheecCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhhhhhccccccccCCCeE Confidence 12335555554 3445552111 11334432 21111 000 0 00112 Q ss_pred EEEEEecCeeEEEEEEeccCCcccceeeh-------------------------hhccccc--cccccccccCCceeecC Q lcl|NC_016654. 191 RHLERHESGYIVHAVYKGTATSLGWMMAL-------------------------TDHPATR--DIAVEGADEGRGAYVET 243 (533) Q Consensus 191 ~~lE~h~~~~I~~~~y~~~~~~lG~~v~l-------------------------~~~~~~~--~~~~~~~~~~~~~~~~~ 243 (533) ++.|+|..-...-.++...+..-|.-+.+ ..+.++. ...+.+.+. .. T Consensus 226 rv~e~~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l~~------~~ 299 (708) T protein:vir:17 226 YIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFLEK------PR 299 (708) T ss_pred EEEEEEEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccccC------CC Confidence 34454432211111222111111111110 0000000 000101000 00 Q ss_pred CCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHH-hCcceeeechHHhcCCCCccccc Q lcl|NC_016654. 244 GVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFR-IGAGKVHASESVLTNLGMGQGVS 322 (533) Q Consensus 244 g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~-~~~~~i~v~~~~l~~~~~~~~~~ 322 (533) ..+.-.+.|+|........++. ..++|. +. .++|..+.+|...|.+.+.+- .++...+++...+......- .. T Consensus 300 ~~p~~~fP~vP~~g~r~~~d~~--~~~yG~--vr-~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~-~~ 373 (708) T protein:vir:17 300 RIPGEHIPLIPVYGKRWFIDDI--ERVEGH--IA-KAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHW-EA 373 (708) T ss_pred CCCCCccceEEEecccccccCC--Ccccch--hh-hchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhh-hh Confidence 0111123334432211111111 112443 33 367889999999999998884 44555666666553221100 00 Q ss_pred cCcchhhh---hhccccccccccccccceeee-chhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHH Q lcl|NC_016654. 323 LDEEQEVY---SRVGSGGFNANGDMETIFEFF-QPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGK 398 (533) Q Consensus 323 ~d~~~~~~---~~~~~~~~~~~~~~~~~i~~~-~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~ 398 (533) ...+...| ..+.........++. ....+ .+++. ..+++.++.....|-..+|++...+|-.+ + .||.+|..+ T Consensus 374 ~~~~~~~~~~~~~~~~~~g~v~~~a~-~~~~~~~~~~~-~~~~~llq~~~~~i~~~tGi~d~~~G~~s-n-~SG~Ai~~r 449 (708) T protein:vir:17 374 RNKKRPAFLPLREVRDKYGNIIAGAT-PAGYTQPAVMN-QALAALLQQTSADIQEVTGGSQAMQQMPS-N-IAQETVNNL 449 (708) T ss_pred cccchhhhhhhhccCCcccccccccC-CcccCCCcccc-HHHHHHHHHHHHHHHHhcCCChHHccCcc-c-hHHHHHHHH Confidence 00111111 111000000000000 11112 23444 47788899889999999999999998543 3 589999988 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCC----------------------------------CceeEE Q lcl|NC_016654. 399 KDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAA----------------------------------PSEELE 444 (533) Q Consensus 399 ~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~----------------------------------~~~~v~ 444 (533) ..............+..+.+.+.+.+|.+-...+.....+ ..++|. T Consensus 450 q~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~ 529 (708) T protein:vir:17 450 MNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVT 529 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEE Confidence 8887777777777777777777777776644332111000 011222 Q ss_pred EEeCCCCCCCHHHHHHHHHHHHhCCC-CCHHHH-----HHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCC Q lcl|NC_016654. 445 LEWPKFARESDLAKAQTVQAWSVASA-ASTKTK-----VAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPT 518 (533) Q Consensus 445 i~f~d~i~~d~~e~a~~~~~l~~aGi-~S~et~-----v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~ 518 (533) |+=..+.+.-.++..+.++++..+.. .-..+. +.++. ++ .-+++.+++|++....... ..++..+ T Consensus 530 v~~~p~~~t~r~~~~~~l~qll~~~~~~~~~~~~~~~l~l~~~-D~--p~~~ei~e~ir~~~~~~~~------~~~~~~e 600 (708) T protein:vir:17 530 VDVGPSYTARRDATVSVLTNVLSSMLPADPMRPAIQGIILDNI-DG--EGLDDFKEYNRNQLLISGI------AKPRNEK 600 (708) T ss_pred EecccCchhHHHHHHHHHHHHHHhcCCccchhHHHHHHHHHhc-CC--CChHHHHHHHHHHhhcccc------ccCcchh Confidence 22222222223455555555543321 111111 11111 12 1133445555544422111 0111100 Q ss_pred CCC--CCCC--CCCCCCC--------------C Q lcl|NC_016654. 519 END--PATD--PEAVDEG--------------E 533 (533) Q Consensus 519 ~~~--~~~~--~~~~~d~--------------~ 533 (533) +.. .+.+ .+...+- | T Consensus 601 ~~q~~~q~qq~~q~q~~~~~~eaqa~~~~~qAe 633 (708) T protein:vir:17 601 EQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAE 633 (708) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000 0000 0000000 0 No 129 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=98.36 E-value=1e-06 Score=53.38 Aligned_cols=414 Identities=12% Similarity=0.067 Sum_probs=160.9 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |...-+.+++++---. ++...=-++...|.+-|. ..+ T Consensus 55 ~~~~~~~~~~~~~~~~-------r~~~~~~~~l~~~~~~~~------------------------------------~np 91 (551) T protein:vir:80 55 SQPVIGSMSANPGFKT-------KPSIRNNQDLHGVLKKFG------------------------------------GNI 91 (551) T ss_pred ecccccceecCccccc-------CccccChhHHHHHHHHhh------------------------------------cCH Confidence 2222233333332100 000000000111111111 111 Q ss_pred hHHHHHHHHHHhhcC-----------CCceEeeCCC-------chHHHHHHHHHHhhc---------cHHHHHHHHHHHH Q lcl|NC_016654. 81 IPGVIAKLSTTELFS-----------EQLKFLDAGK-------SKEVQARADLIFNTP---------RFHSSLVEAGESC 133 (533) Q Consensus 81 ~~k~i~~~~a~ll~~-----------e~~~i~~~~~-------~~~~~~~l~~i~~~n---------~f~~~~~~~~~~~ 133 (533) +...+++..|+.+.. -+..+...+. +....+.|.+++..- .|...+..++... T Consensus 92 iv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~~~s~~~f~~~lv~dl 171 (551) T protein:vir:80 92 ILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFVKKIVRDT 171 (551) T ss_pred HHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCCccchHHHHHHHHHHHH Confidence 222232222222211 0111221111 111112344443322 2334445556666 Q ss_pred hhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEec-CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCc Q lcl|NC_016654. 134 SALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRW-GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATS 212 (533) Q Consensus 134 ~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~-g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~ 212 (533) +..|.+|+.+..|..|. -..+..++|.++.++.+. |.+..-.+ .|. . ...+.+. T Consensus 172 ll~Gnay~~i~rd~~G~-~~~L~~l~p~~V~v~~~~~g~~~~~~~-----------~y~--~-~~~g~~~---------- 226 (551) T protein:vir:80 172 YMYDQVNFEKVFNRNQS-MVRFVAKDPTTIFFATTADGKIPDNGN-----------RFV--Q-VIDQKIV---------- 226 (551) T ss_pred HhcCCEEEEEEECCCCc-EEEEEEeCCceeEEEECCccccccCce-----------EEE--E-EeCCcEE---------- Confidence 77899999888887653 356777888888876433 32211000 000 0 0000000 Q ss_pred ccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHH Q lcl|NC_016654. 213 LGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYS 292 (533) Q Consensus 213 lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s 292 (533) ..++- .+ .++++-|.... +....+|.|.+..+. ..|. +..... T Consensus 227 --~~~~~----------~e-----------------iiH~~~n~~~~------~~~~~~G~spi~~a~-~~i~-~~~a~~ 269 (551) T protein:vir:80 227 --ATFNA----------RE-----------------MAFAVRNPRSD------IYATGYGYPELEIAL-KQFI-AHENTE 269 (551) T ss_pred --EEEcc----------cc-----------------eEEecccCCCC------cccccccccHHHHHH-HHHH-HHHHHH Confidence 00000 00 01122111110 112346888776543 4443 334444 Q ss_pred HHHHH-HHhCc-ceeeechHHhcCCCCccc--cccCcchhhhhhcccccccccc-----ccccceeeechhhhhHHHHHH Q lcl|NC_016654. 293 SLMRD-FRIGA-GKVHASESVLTNLGMGQG--VSLDEEQEVYSRVGSGGFNANG-----DMETIFEFFQPAIRVLEHDQG 363 (533) Q Consensus 293 ~~~~~-~~~~~-~~i~v~~~~l~~~~~~~~--~~~d~~~~~~~~~~~~~~~~~~-----~~~~~i~~~~~~ir~e~~~~~ 363 (533) .+... |+.|. +.-++ ...+.... ...+.....+.....+..+++. .....++.++......++++. T Consensus 270 ~~~~~~f~Ng~~p~giL-----~~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~~~~~D~qfle~ 344 (551) T protein:vir:80 270 AFNDRFFSHGGTTRGIL-----QIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVSAEDVKFVNMTPSARDMEFEKW 344 (551) T ss_pred HHHHHHHHcCCCcceEE-----EEcCCCCCCHHHHHHHHHHHHHHhcCccccCccccccCCCceEEEccCChhHHHHHHH Confidence 44443 35543 33222 11111100 0001111112211111111111 111235556666677789999 Q ss_pred HHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccCCCCCCcee Q lcl|NC_016654. 364 AALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRA-KARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEE 442 (533) Q Consensus 364 l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~-~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~ 442 (533) .+...+.|+...|+||..+|+...+..++....+.. ..++.. ....++.+|.-++..+-...+..+... .... T Consensus 345 ~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t---~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~L~~~---~~~~ 418 (551) T protein:vir:80 345 LNYLINVISALYGIDPAEINIPNNGGATGSKGGSLN---EGNSAEKNQASKNKGLQPLLGFIEDFINKHIVAE---FGDK 418 (551) T ss_pred HHHHHHHHHHHhcCCHHHcCcccccccccccccccc---hhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc---cCCc Confidence 999999999999999999997554332222222211 112222 234555666666665544333333211 1245 Q ss_pred EEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCH--HHHHHHH---------HHH-----HHhhhc-ccC Q lcl|NC_016654. 443 LELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDD--ERVQEEA---------DLI-----DNANTV-SAP 505 (533) Q Consensus 443 v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~d--ee~~~El---------~rI-----~~E~~~-~~~ 505 (533) +.+.|+.....+..+.++.. +++.+|+|+..++.+++. +.. +.-+.-+ +.. +.+... ..+ T Consensus 419 ~~f~f~~~~~~~~~~~~~~~-~~~~~g~lT~NE~R~~~g--l~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (551) T protein:vir:80 419 YTFQFVGGDIKSELESVKIL-AEKAKVAMTVNEVRKELN--LPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQ 495 (551) T ss_pred eEEEeeccChhhHHHHHHHH-HHHhcCCcCHHHHHHHhC--CCCCCCCCceeecccccccccccccccCcchhhhhhccc Confidence 77888877777777777644 466789999999777652 211 1000000 000 000000 000 Q ss_pred c-cc-cccccCC----CCCCCC------CCC---CCCCCCCCC Q lcl|NC_016654. 506 T-FG-FGTDQPP----LPTEND------PAT---DPEAVDEGE 533 (533) Q Consensus 506 ~-~~-~~~~~~~----~~~~~~------~~~---~~~~~~d~~ 533 (533) . .+ .+....+ .++..+ +++ +++..+.|+ T Consensus 496 ~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~ 538 (551) T protein:vir:80 496 MLQEQTGNRVSTDVEDIPDGKDTTGDIGKDGQRKDKDNANAGK 538 (551) T ss_pred cccCcCCCCCCCCCCCCCCccccCCCccccccccCccccchhh Confidence 0 00 0000000 011100 011 011111121 No 130 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=98.36 E-value=1.1e-06 Score=53.29 Aligned_cols=404 Identities=13% Similarity=0.052 Sum_probs=164.9 Q ss_pred cccceeecChHHHHHHHHHHhhcCCCceEeeCC------CchHHHHHHHHHHh-h-------------ccHHHHHHHHHH Q lcl|NC_016654. 72 RAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAG------KSKEVQARADLIFN-T-------------PRFHSSLVEAGE 131 (533) Q Consensus 72 ~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~------~~~~~~~~l~~i~~-~-------------n~f~~~~~~~~~ 131 (533) .+.-.-.......+++.+|+-+.+-|..+.... ......+.+.+.+. . ..+...+...+. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 111111345667777777777777665553211 11111122222211 1 123455666777 Q ss_pred HHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEE-EEE----EEecCeeEEEEEE Q lcl|NC_016654. 132 SCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVW-RHL----ERHESGYIVHAVY 206 (533) Q Consensus 132 ~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y-~~l----E~h~~~~I~~~~y 206 (533) .....|.+|+.+..|..+. -+.+..+++..+.+.-+..+.. . ..+.+.+| ... -....+.+.+..+ T Consensus 81 ~l~l~Gn~~i~~~r~~~G~-~~~l~~l~~~~v~~~~d~~~~~-----~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 151 (467) T protein:vir:31 81 DYEAIGWLTIEILTQTDGT-PTGLAYVPGHTIRKRMDERGFV-----Q---LLEEKEKYFGVAGDRYQTNGNGDLDPVFV 151 (467) T ss_pred HHHhcCCeEEEEEECCCCc-EEEEEEeCCceeEeeeecceeE-----e---ecCCceeeEEeccccceeecccceeeeee Confidence 7888899999998887653 3577788888877764333211 0 00111111 000 0000111111111 Q ss_pred eccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHH Q lcl|NC_016654. 207 KGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHE 286 (533) Q Consensus 207 ~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~ 286 (533) .......|..+. .+.--+.|+....+ ....+|.|.+..+.. .+ . T Consensus 152 ~~~~~~~~~~~~--------------------------~~~~diih~r~~~~--------~~~~~G~s~~~~~~~-~i-~ 195 (467) T protein:vir:31 152 DADDGSTGTSVS--------------------------NPANELIFKRNHSP--------LYPHYGAPDIIPAVK-TI-R 195 (467) T ss_pred eeccccccceeE--------------------------eccccEEEecCCCC--------CCCcccccHHHHHHH-HH-H Confidence 000000011000 01111233322111 123468888877653 33 3 Q ss_pred HHHHHHHHHHHH-HhCcc-e--eeechHHhcCCCCccc-cccCcc-hhh----hh-------hccccccccccccc---c Q lcl|NC_016654. 287 LDRIYSSLMRDF-RIGAG-K--VHASESVLTNLGMGQG-VSLDEE-QEV----YS-------RVGSGGFNANGDME---T 346 (533) Q Consensus 287 lD~~~s~~~~~~-~~~~~-~--i~v~~~~l~~~~~~~~-~~~d~~-~~~----~~-------~~~~~~~~~~~~~~---~ 346 (533) ++.....+...+ +.|.. . |.++..++....-... ..+... .+. +. +........+.+.+ . T Consensus 196 ~~~~~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~ 275 (467) T protein:vir:31 196 GDSAAQDYNIDFFENDGVPRIAIIVKGAELTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEI 275 (467) T ss_pred HHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccce Confidence 455555565554 54432 2 2222222211000000 000000 000 00 00000001111100 0 Q ss_pred ceeeech-hhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 347 IFEFFQP-AIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCL 425 (533) Q Consensus 347 ~i~~~~~-~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il 425 (533) .++.++. .....++.+..+...+.|+...|+||..+|+..++. +++.+.... ...++.+|.-+++.+. T Consensus 276 ~~~~ls~~~~~d~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~-~~s~~e~~~----------~~f~~~~l~P~~~~ie 344 (467) T protein:vir:31 276 RLEPLTVGIDEEASFLEFRGRNEHDILKVHDVPPVIAGVVESGA-FSTDAEEQR----------KEFAEETIQPKQHDFG 344 (467) T ss_pred eEEeccccChhhHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCC-cccCHHHHH----------HHHHHHHHHHHHHHHH Confidence 1111221 123456778888888899999999999998654332 111121111 1122233333333332 Q ss_pred HHHHhhccC-CCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHh-CCCCCHHHHHHHHHHHHHh-hhc Q lcl|NC_016654. 426 RVDAIKFPG-KGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYL-HEDWDDERVQEEADLIDNA-NTV 502 (533) Q Consensus 426 ~l~~~~~~~-~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l-~~~~~dee~~~El~rI~~E-~~~ 502 (533) ...+..+.. ........+.+++......|..+.++...+++.+|+|+..++++++ ++.+.|+.... ..-+..+ ++. T Consensus 345 ~~ln~~l~~~~~~~~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~pi~d~~~~~-~~~~~~~~~~~ 423 (467) T protein:vir:31 345 ELLYELVHKQGLDAPDWTIEFELAKPDTKLQDVEIASQRVQAMQGLLTVNELRDEFGFEPFPEEHVYG-GETLVAEVTGG 423 (467) T ss_pred HHHHHhhcchhhccCCceEEEecchhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCcccccC-Ccccccccccc Confidence 222222221 1122345688888888999999999999999999999999987765 22222211000 0000000 000 Q ss_pred ccCccccccccCCCCCCCCCCC-------CCCCCCC---CC Q lcl|NC_016654. 503 SAPTFGFGTDQPPLPTENDPAT-------DPEAVDE---GE 533 (533) Q Consensus 503 ~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~d---~~ 533 (533) ..| .+..++++....+++++. +-+++.. |. T Consensus 424 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 463 (467) T protein:vir:31 424 SGP-GGGIGDQIEQLVEDRADEIIDSYQADLETEQLIEIGA 463 (467) T ss_pred cCC-CCcccCcCCCCCCCcccchHhhhhhccccchhhhhcc Confidence 000 011111111111111100 0000000 00 No 131 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=98.32 E-value=1.4e-06 Score=52.71 Aligned_cols=466 Identities=11% Similarity=0.012 Sum_probs=188.3 Q ss_pred CCCC-cCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecCh Q lcl|NC_016654. 3 LPEA-NTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPI 81 (533) Q Consensus 3 ~~~~-~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~ 81 (533) |++- -+..+=......+..++..|..|...-.++.+|.... .+.......+.+..++--.- T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~------------------~~~~~~~~~~~~~~~~~dst 62 (535) T protein:vir:33 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPS------------------LFPKESDNESTDYTTPWQAV 62 (535) T ss_pred CChhhhhccChhHHHHHHHHHHHHhhHHHHHHHHHHHHhccc------------------ccCCCCCccccccccccccc Confidence 4422 2244444445566677777666665445554443221 11111112222233444455 Q ss_pred HHHHHHHHHHhhcCC----CceEeeCCCc-------------hHHHHH-------HHHHHhhccHHHHHHHHHHHHhhhC Q lcl|NC_016654. 82 PGVIAKLSTTELFSE----QLKFLDAGKS-------------KEVQAR-------ADLIFNTPRFHSSLVEAGESCSALS 137 (533) Q Consensus 82 ~k~i~~~~a~ll~~e----~~~i~~~~~~-------------~~~~~~-------l~~i~~~n~f~~~~~~~~~~~~~~G 137 (533) +...++.+|+.|.+- .+.|.....+ .+.+++ +.+.+..++|...+.++.+...+.| T Consensus 63 ~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G 142 (535) T protein:vir:33 63 GARGLNNLASKLMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAG 142 (535) T ss_pred HHHHHHHHHHHHHHhhcCCCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhC Confidence 677777777765443 2344432211 123333 3445888899999999999999999 Q ss_pred CEEEEEEEcCCCCCceEEEEEcCCeEEEEEe-cCCceEEEEEEEEeec---------CCceEEEEEEEecCeeEEEEEEe Q lcl|NC_016654. 138 GSFQRIVWDPTIADNAWIDFVDADRAIPEFR-WGRLVAVTFWSELAGG---------DGQEVWRHLERHESGYIVHAVYK 207 (533) Q Consensus 138 ~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~~---------~~~~~y~~lE~h~~~~I~~~~y~ 207 (533) .+.+++ +++.+..+++..++-..++..-+ +|++..++.-.+++.. .....+..-. .+.-.|.+.+|. T Consensus 143 ~a~l~~--~~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~-~~~~~v~~~v~~ 219 (535) T protein:vir:33 143 NALLYL--PEPEGSYNPMKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKM-DEMVDVYTHVYL 219 (535) T ss_pred ceeEEe--ecCCCCceeeEEEEcCeeEEeeCCCCCeeEEEeeEeecHHHHHHHhhhhhccccccccc-ccCCeEEEEEEe Confidence 998775 44444567888888877766654 4777665543333210 0000000000 000111112221 Q ss_pred ccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHH Q lcl|NC_016654. 208 GTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHEL 287 (533) Q Consensus 208 ~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~l 287 (533) -..+ |.-.. +.+ ....... ....+.+..-+.|++. +|... ..+.||+|-...++ +-+..| T Consensus 220 ~~~~--~~~~~---~~~---~~~~~~~-----~~~~~~~~~~~P~i~~-----Rw~~~-~ge~YGrgp~~~~l-~D~k~L 279 (535) T protein:vir:33 220 DEES--GDYLK---YEE---VEDVEID-----GSDATYPTDAMPYIPV-----RMVRI-DGESYGRSYCEEYL-GDLRSL 279 (535) T ss_pred eCCC--CcEEE---EEE---EeCcccc-----ccccccccccCCceee-----eeeec-CCCccccchHHHHH-HHHHHH Confidence 1110 00000 000 0000000 0001111111122222 23322 24778999877766 566788 Q ss_pred HHHHHHHHHHH-HhCcceeeechHHhcCCCCccccccCc-chhhhhhccccccccccccccceeee-chhhhhHHHHHHH Q lcl|NC_016654. 288 DRIYSSLMRDF-RIGAGKVHASESVLTNLGMGQGVSLDE-EQEVYSRVGSGGFNANGDMETIFEFF-QPAIRVLEHDQGA 364 (533) Q Consensus 288 D~~~s~~~~~~-~~~~~~i~v~~~~l~~~~~~~~~~~d~-~~~~~~~~~~~~~~~~~~~~~~i~~~-~~~ir~e~~~~~l 364 (533) +..--...... ...++...|+++.. .+...+.+ ..+.+... .. ++. ..+... ..++ ......+ T Consensus 280 ~~l~~~~l~~~~~~~~p~~lv~~~g~-----~~~~~~~~~~~g~~v~g-----~~-~~v-~~~~~~~~~~~--~~~~~~i 345 (535) T protein:vir:33 280 ENLQEAIVKMSMISAKVIGLVNPAGI-----TQPRRLTKAQTGDFVPG-----RR-EDI-DFLQLEKQADF--TVAKAVS 345 (535) T ss_pred HHHHHHHHHHHHHHhcCceeeccccc-----cchhhcccCCceeeecC-----Cc-ccc-eeeecccccch--hHHHHHH Confidence 87666555554 34555555544322 11111101 11112110 00 010 011111 1122 2222333 Q ss_pred HHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhccCCCCCCceeE Q lcl|NC_016654. 365 ALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKA-RHFGSALGPLSTTCLRVDAIKFPGKGAAPSEEL 443 (533) Q Consensus 365 ~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~-~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v 443 (533) +.+-..|.... +.. .+....+...|||||..+.+......+-.- +.-...|..|+..++.+.... +-....+...+ T Consensus 346 ~~~~~~I~~af-~~~-~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~-g~lP~~p~~~v 422 (535) T protein:vir:33 346 DQIEARLSYAF-MLN-SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQAT-SQIPELPKEAV 422 (535) T ss_pred HHHHHHHHHHH-hhh-hcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCCccce Confidence 33333332221 110 122233445699999888877666555432 223334445555555443210 11122345567 Q ss_pred EEEeCCCCCCC-----HHHHHHHHHHHHhCC------CCCHHHHHHHh---CC---C---CCHHHHHHHHHHHHHhhhc- Q lcl|NC_016654. 444 ELEWPKFARES-----DLAKAQTVQAWSVAS------AASTKTKVAYL---HE---D---WDDERVQEEADLIDNANTV- 502 (533) Q Consensus 444 ~i~f~d~i~~d-----~~e~a~~~~~l~~aG------i~S~et~v~~l---~~---~---~~dee~~~El~rI~~E~~~- 502 (533) ++++--++..- .....+.++.+.+.+ .+....+++.+ .+ . -+++|+++..++-.+.++. T Consensus 423 ~~~yis~La~aqr~~~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~Gvp~~~i~~~~ee~~~~~~q~~~~~~~~ 502 (535) T protein:vir:33 423 EPTISTGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGVE 502 (535) T ss_pred eEEEecHHHHHHHHHHHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHHHHcCCCHhHhcCCHHHHHHHHHHHHHHHHHH Confidence 77775554321 111112222221111 12233333322 11 0 1455554443322211111 Q ss_pred -ccCccccccccCCCCCCC--------CCCCCCCC Q lcl|NC_016654. 503 -SAPTFGFGTDQPPLPTEN--------DPATDPEA 528 (533) Q Consensus 503 -~~~~~~~~~~~~~~~~~~--------~~~~~~~~ 528 (533) .+...+.+ +.....++ +..|-+-. T Consensus 503 ~~~~~~g~~--~~~~~~~~~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:33 503 NAAAAGGAG--VGALATSSPEAMQGAAAKAGLNAT 535 (535) T ss_pred HHHHhhhhh--hcchhhcCChhHHHHHHhccCCCC Confidence 11111111 11111110 00000000 No 132 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=98.31 E-value=1.4e-06 Score=52.66 Aligned_cols=386 Identities=10% Similarity=-0.016 Sum_probs=150.5 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcc---ccee Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRA---PKRY 77 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~---~~~~ 77 (533) |+|=... ... .+.--.+|++.+ .++.... .|.. ..-+ T Consensus 1 M~~f~~~----~~~----------~~~~~~~~~~~~------------------------~~~~~~~--~~~~v~~~~al 40 (397) T protein:vir:38 1 MPLLKLN----KSH----------SQGFSLNDPDWV------------------------NFLTGGE--AQKYVSADTAL 40 (397) T ss_pred Ccchhhh----hcc----------cCcccCCchhhh------------------------hhhcCCc--CCceechHHhh Confidence 4442110 000 000000111111 1111000 0000 0011 Q ss_pred ecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEE Q lcl|NC_016654. 78 HAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDF 157 (533) Q Consensus 78 ~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~ 157 (533) ...--...|+.+|+-+-+=| +.+ .+......+.+--..-.....++..+...+..|.+|+.+..|..+. -+.+.. T Consensus 41 ~~~~V~~~v~~ia~~ia~~p--~~~--~~~~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~-~~~l~~ 115 (397) T protein:vir:38 41 KNSDIFSLIMQLSGDLAMVR--YTS--ESDRSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGV-DLSWEY 115 (397) T ss_pred ccHHHHHHHHHHHHHHhhCc--ccc--cccHHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCc-EEEEEE Confidence 11112223444444443323 222 2222222221110111122234444556677899999888886543 356777 Q ss_pred EcCCeEEEEEe-cCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccC Q lcl|NC_016654. 158 VDADRAIPEFR-WGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEG 236 (533) Q Consensus 158 v~~~~~~P~~~-~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~ 236 (533) ++|..+.+..+ +|.. ++ |. |.......|..+.+ T Consensus 116 l~~~~v~i~~~~~~~~---~~------------y~--------------~~~~~~~~~~~~~~----------------- 149 (397) T protein:vir:38 116 LRPSQVQPMLLQDGSG---LI------------YN--------------INFDEPAIGYMENV----------------- 149 (397) T ss_pred EcCceeEEEEcCCCce---EE------------EE--------------EEeccccccceeEe----------------- Confidence 88887776532 2211 01 10 00000001111100 Q ss_pred CceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCc-ceeeechHHhcC Q lcl|NC_016654. 237 RGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGA-GKVHASESVLTN 314 (533) Q Consensus 237 ~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~-~~i~v~~~~l~~ 314 (533) +.--+.|++....+ ...+|.|.+..+. ..|. +.....++... |+.+. ...++ .. T Consensus 150 ---------~~~eiih~~~~~~~--------~~~~G~s~i~~~~-~~i~-~~~~~~~~~~~~f~ng~~~~~il-----~~ 205 (397) T protein:vir:38 150 ---------PAADVIHIRLLSKN--------GGKTGISPLSALI-NEQQ-IKDASNELTLKALKQSVTASAVL-----TI 205 (397) T ss_pred ---------cCccEEEecCCCCC--------CccccccHHHHHH-HHHH-HHHHHHHHHHHHHhccCCccEEE-----Ee Confidence 00012233321111 1235888777654 4443 33444444444 35443 22222 11 Q ss_pred CCCccccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcch Q lcl|NC_016654. 315 LGMGQGVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQ 390 (533) Q Consensus 315 ~~~~~~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~ 390 (533) .+.............+..... ..+.++ +....++.++......++.+..+...++|+...|+|+..+|...+... T Consensus 206 ~~~~~~e~~~~~~~~~~~~~~-~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~ 284 (397) T protein:vir:38 206 QKGGLLDAETRIARSKEISKQ-IHNSDGPVVIDALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQS 284 (397) T ss_pred CCCCCHHHHHHHHHHHHHHhc-ccccCCceecCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccc Confidence 111100000000111111111 111111 122345666666677788889999999999999999999986544332 Q ss_pred hHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCC Q lcl|NC_016654. 391 TATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASA 470 (533) Q Consensus 391 Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi 470 (533) +.++.. ..+..+|..++..+....+..+.. ..+. ++...+-.|..+.++.+.+++.+|+ T Consensus 285 ~~e~~~--------------~~~~~~l~P~~~~ie~~ln~~l~~-----~~~~--~~~~~~~~d~~~~~~~~~~~~~~G~ 343 (397) T protein:vir:38 285 SITQIS--------------GQYAKSLNRYVQAIVGELNDKLHA-----NISA--NIRFAIDAMGDQYASTISSSVKGGT 343 (397) T ss_pred HHHHHH--------------HHHHHHHHHHHHHHHHHHHHhccC-----hhcc--cccccccCCHHHHHHHHHHHHhCCC Confidence 222221 123344444444443322222221 1122 2333445688999999999999999 Q ss_pred CCHHHHHHHhC-CCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 471 ASTKTKVAYLH-EDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 471 ~S~et~v~~l~-~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) |+..++.+.+. +.+...+ +-..........+.....+......++++.+++ .| T Consensus 344 ~t~nE~R~~lg~~p~~~~d----~~~~~~~~~~~~~~~~~~~g~~~~~~~~e~~~~------~~ 397 (397) T protein:vir:38 344 IAGNQARFILQNSGYLAKD----LPDPEKEPQQAIQLIQQEGGENDGNNSDERGSD------PE 397 (397) T ss_pred cCHHHHHHHhCCCCCCCCc----cccccccccccccccccccCCCCCCCCCCCCCC------CC Confidence 99999776541 1111110 000000000000000000001111111112222 22 No 133 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=98.31 E-value=1.4e-06 Score=52.64 Aligned_cols=415 Identities=11% Similarity=0.002 Sum_probs=154.9 Q ss_pred CCCC-CCcCCC---cCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccce Q lcl|NC_016654. 1 MSLP-EANTAW---PPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKR 76 (533) Q Consensus 1 ~~~~-~~~~~~---pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 76 (533) |+.. ..++.+ ||..+...... |...+.. ...-.... . T Consensus 66 ~~~~~~~~~~~~~~~~~~~~~~l~~------------------~~~~~iv--~~~i~~~~-------------------~ 106 (574) T protein:vir:80 66 MSVNPGYKTKPSIRNSQDLHKTLKK------------------FGNNIIL--NAIINTRS-------------------N 106 (574) T ss_pred ccccccccCcCccCCcccHHHHHHh------------------hccChhH--HHHHHHHH-------------------H Confidence 2211 111111 11111111111 1100000 00000000 0 Q ss_pred eecChHHHHHHHHHHhhcCCCceEeeCC-------CchHHHHHHHHHHhh---------ccHHHHHHHHHHHHhhhCCEE Q lcl|NC_016654. 77 YHAPIPGVIAKLSTTELFSEQLKFLDAG-------KSKEVQARADLIFNT---------PRFHSSLVEAGESCSALSGSF 140 (533) Q Consensus 77 ~~~n~~k~i~~~~a~ll~~e~~~i~~~~-------~~~~~~~~l~~i~~~---------n~f~~~~~~~~~~~~~~G~~~ 140 (533) ......+.|+...|++ |..+-..+ ........|.+++.. ..|...+..++...+..|.+| T Consensus 107 ~V~~~~~~i~~~ia~l----p~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~~lll~Gnay 182 (574) T protein:vir:80 107 QVSMYCKPARNSETGV----GYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFRDPNRDNFTTFCKKLVRATYMYDQVN 182 (574) T ss_pred HHHHHHHHHHhhhccC----ceEEEEeccCCCccchhhhhhhHHHHHHhccCCCCCCccccHHHHHHHHHHHHHhcCCeE Confidence 0011222223333322 11121110 001111233444432 123344555666677889999 Q ss_pred EEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehh Q lcl|NC_016654. 141 QRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALT 220 (533) Q Consensus 141 ~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~ 220 (533) +.+..|..+. -+.+..++|..+.+..+..... ..+...+|.. ..+.+.. .++- T Consensus 183 i~i~r~~~G~-~~~L~pl~p~~V~v~~d~~~~~---------~~~~~~y~~~----~~g~~~~------------~~~~- 235 (574) T protein:vir:80 183 FEKVFDKDGN-FIKFDTVDPTTIFLATNGEGKL---------IKNGERFVQV----IDNRIVA------------KFNE- 235 (574) T ss_pred EEEEECCCCc-EEEEEEEcCceeEEEEcCcccc---------ccCceEEEEE----eCCceEE------------EEcc- Confidence 9888887653 3567778888888775432110 0011111100 0000000 0000 Q ss_pred hccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HH Q lcl|NC_016654. 221 DHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FR 299 (533) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~ 299 (533) .+ .++++-|..+.. ....+|.|.+..+. ..|. +.....++... |+ T Consensus 236 ---------~e-----------------iih~~~~~~~~~------~~~~~G~spi~~a~-~~i~-~~~~a~~~~~~~f~ 281 (574) T protein:vir:80 236 ---------RE-----------------LAFAVRNPRADI------EVGQYGYPELEIAL-KQFI-AHENTEVFNDRFFS 281 (574) T ss_pred ---------cc-----------------EEEEeccCCCCc------ccccccccHHHHHH-HHHH-HHHHHHHHHHHHHh Confidence 00 011221211111 11346888876543 4443 33444444444 35 Q ss_pred hCc-ceeeechHHhcCCCCcc--ccccCcchhhhhhcccccccccc-----ccccceeeechhhhhHHHHHHHHHHHHHH Q lcl|NC_016654. 300 IGA-GKVHASESVLTNLGMGQ--GVSLDEEQEVYSRVGSGGFNANG-----DMETIFEFFQPAIRVLEHDQGAALLLREV 371 (533) Q Consensus 300 ~~~-~~i~v~~~~l~~~~~~~--~~~~d~~~~~~~~~~~~~~~~~~-----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i 371 (533) .|. ..-++ ...++.. ....+.....|.....+..+.+. +....++.++......++++..+...+.| T Consensus 282 ng~~p~gil-----~~~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl~~~G~~~~~l~~s~~D~qfle~~~~~~~~I 356 (574) T protein:vir:80 282 HGGTTRGIL-----HVKTGQQQSQQALDIFRREWRSSLAGINGSWQIPVVSAEDVKFVNMTPSANDMQFEKWLNYLINVI 356 (574) T ss_pred ccCCCceEE-----EeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEEccCChhHHHHHHHHHHHHHHH Confidence 543 23222 1111110 00011111112211111111111 11123555666667778889888899999 Q ss_pred HHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCC Q lcl|NC_016654. 372 LRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRA-KARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKF 450 (533) Q Consensus 372 ~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~-~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~ 450 (533) +...|+||..+|+...+..+++.....+ +.+++. ....++.+|.-+++.+....+..+... ....+.+.|+.. T Consensus 357 a~afgVPp~~lG~~~~~t~~gs~~~~~n---~sn~E~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~---~~~~~~~~f~~~ 430 (574) T protein:vir:80 357 SALYGIDPAEINFPNNGGATGSKGGSLN---EGNSKEKMQASQNKGLQPLLRFIEDTVNTYIVAE---FGEKYQFQFRGG 430 (574) T ss_pred HHHhCCCHHHhccccccccccccccccc---chhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh---cCCceEEEeccc Confidence 9999999999997654333232221111 111211 233444455555554443333222211 123466778776 Q ss_pred CCCCHHHHHHHHHHHHhCCCCCHHHHHHHhC-CCCCHHHHHHHH--------HHHHHhhhcccCccccccccCCCCCCCC Q lcl|NC_016654. 451 ARESDLAKAQTVQAWSVASAASTKTKVAYLH-EDWDDERVQEEA--------DLIDNANTVSAPTFGFGTDQPPLPTEND 521 (533) Q Consensus 451 i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~-~~~~dee~~~El--------~rI~~E~~~~~~~~~~~~~~~~~~~~~~ 521 (533) -..+..+..++ .+++.+|+|+..++.+.+. +.+.. .+.-+ .........+........+++......+ T Consensus 431 d~~~~~~~~~~-~~~~~~G~lT~NE~R~~lgl~Pi~g--GD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (574) T protein:vir:80 431 DLSAQLDKLKI-IEQEGKVFRTVNEIRHDKGLEPIKG--GDVILNGVHIQAIGQALQEEQLEYQRSQDRLNRLLELSGGD 507 (574) T ss_pred chhhHHHHHHH-HHHHhCCccCHHHHHHHhCCCCCCC--CCEeeeccceeecccccccccCCccchhccccccccccCCC Confidence 66666655544 3466789999999776541 11110 00000 0000000000000000000000000011 Q ss_pred CCCC------CCCCCCCC Q lcl|NC_016654. 522 PATD------PEAVDEGE 533 (533) Q Consensus 522 ~~~~------~~~~~d~~ 533 (533) ++++ +...+|+| T Consensus 508 ~~~~~~~~p~~~~~d~~~ 525 (574) T protein:vir:80 508 VEQPEPEEPKDSQNDTDV 525 (574) T ss_pred CCCCCCCCCCCccccccc Confidence 1111 11222222 No 134 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=98.30 E-value=1.5e-06 Score=52.50 Aligned_cols=476 Identities=13% Similarity=0.052 Sum_probs=194.4 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |.-++. ..+...+..+...|..|...-.++.+|... +...|........+.+..++.-+ T Consensus 1 m~~~~~------~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP---------------~~~~~~~~~~~~~~~~~~~~~ds 59 (556) T protein:vir:73 1 MAETEK------ERLLKQLAQLKNERTSFESHWLDLSDFINP---------------RGSRFLTSDVNRDDRRNTKIVDP 59 (556) T ss_pred CChhhH------HHHHHHHHHHHHHhhHHHHHHHHHHHHhcc---------------ccCCcCCCCCCcchhhcCccccc Confidence 333321 133444444444444444333333332210 01112222222333344567777 Q ss_pred hHHHHHHHHHHhhcCC-----CceEeeCCCc------hHHH-------HHHHHHHhhccHHHHHHHHHHHHhhhCCEEEE Q lcl|NC_016654. 81 IPGVIAKLSTTELFSE-----QLKFLDAGKS------KEVQ-------ARADLIFNTPRFHSSLVEAGESCSALSGSFQR 142 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e-----~~~i~~~~~~------~~~~-------~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~ 142 (533) -+...|+.+|+.|.+- .+.|.....+ ..++ +.+.+.|..++|...+.++.....++|.+.+. T Consensus 60 t~~~a~~~Las~l~~~ltpp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~ 139 (556) T protein:vir:73 60 TGSMAQRILSSGMMSGITSPARPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMA 139 (556) T ss_pred hHHHHHHHHHHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeee Confidence 7888888888766543 2333332211 2233 34455788889999999999999999999986 Q ss_pred EEEcCCCCCceEEEEEcCCeEEEEEe-cCCceEEEEEEEEeec--------CC-----ceEEEEEEEecCeeEEEEEEec Q lcl|NC_016654. 143 IVWDPTIADNAWIDFVDADRAIPEFR-WGRLVAVTFWSELAGG--------DG-----QEVWRHLERHESGYIVHAVYKG 208 (533) Q Consensus 143 ~~~D~~~~~~~~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~~--------~~-----~~~y~~lE~h~~~~I~~~~y~~ 208 (533) + +++....+++..++...++-.-+ +|++..|+...+++.. ++ +..|..-.......|.|.+|.. T Consensus 140 ~--~~~~~~~~r~~~~~l~~~~~~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~~v~~~V~pr 217 (556) T protein:vir:73 140 V--MEDDQDVIRTMPFPIGSYYLANSPRGSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHCITPN 217 (556) T ss_pred e--eecCCceEEEEEeecceeEEeeCCCCCeEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcCCccceEEEEEEEecc Confidence 4 44444567888888888777655 4676554322222100 00 0011100001123344444532 Q ss_pred cC---Cccc-ceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccch-hhhhHHHH Q lcl|NC_016654. 209 TA---TSLG-WMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRAD-LSTDLFPT 283 (533) Q Consensus 209 ~~---~~lG-~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~-~~~~i~~l 283 (533) .+ +..+ +-.|+..+...... .+....-+.|... +.|++. +|... ..+.||+|. ...++ +- T Consensus 218 ~~~~~~~~~~~~~p~~s~~~~~~~------~~~~vl~esg~~e--~P~~~~-----Rw~~~-~ge~YGrg~P~~~~l-gD 282 (556) T protein:vir:73 218 VNRDSGKMDSKNKPYRSVYFESGG------DSDKLLRESGFDE--FPILAP-----RWEVN-GEDVYASSCPGMLAL-GQ 282 (556) T ss_pred ccccccccCcccceEEEEEEEecC------CCceecccCCccc--CCceee-----eeeec-CCcccccCccHHHhH-HH Confidence 21 1111 11122111100000 0000001122211 122222 24322 347889984 55555 55 Q ss_pred HHHHHHHHHHHHHHHH-hCcceeeechHHhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhh-HHHH Q lcl|NC_016654. 284 FHELDRIYSSLMRDFR-IGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRV-LEHD 361 (533) Q Consensus 284 id~lD~~~s~~~~~~~-~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~-e~~~ 361 (533) +..|+..--......+ ..++.+.||.+.. + ......+....|..+. .+.+.-..+...++++.. .+.+ T Consensus 283 ~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~-~----~~~~~~pgg~~~~~~~-----~~~~~i~p~~~~~~d~~~~~~~i 352 (556) T protein:vir:73 283 VKALQVEQKRKAQLIDKATNPPMVAPTSLK-N----QRVSLLPGDVTYLDVI-----SGQDGFKPAYLVNPNTADLLADI 352 (556) T ss_pred HHHHHHHHHHHHHHHHHHhcCceecccccc-c----cceeeccCccccccCC-----CCccceeeeccccccHHHHHHHH Confidence 6777776666555553 3455556655431 1 1111111111121111 111111112233444322 1223 Q ss_pred HHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhccCCCCC-C Q lcl|NC_016654. 362 QGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKA-RHFGSALGPLSTTCLRVDAIKFPGKGAA-P 439 (533) Q Consensus 362 ~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~-~~~~~al~~li~~il~l~~~~~~~~~~~-~ 439 (533) +.+..-++...+.-.| ..++...+...|||||..+.+......+-.- +.-...|.-+|..++.+... .|..+. + T Consensus 353 ~~~~~rI~~af~~d~~--~~l~~~~~~r~TAtEv~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r--~g~lP~~P 428 (556) T protein:vir:73 353 QDTRQTINSAYFVDLF--MMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMAR--KNMLPEPP 428 (556) T ss_pred HHHHHHHHHHhhcchh--hhhccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh--cCCCCCCc Confidence 3333333333322111 1123234445799999998888777655533 33344566666666655332 121111 1 Q ss_pred ----ceeEEEEeCCCCCCCH-HHH-------HHHHHHHHhCC-----CCCHHHHHHHh---CC-----CCCHHHHHHHHH Q lcl|NC_016654. 440 ----SEELELEWPKFARESD-LAK-------AQTVQAWSVAS-----AASTKTKVAYL---HE-----DWDDERVQEEAD 494 (533) Q Consensus 440 ----~~~v~i~f~d~i~~d~-~e~-------a~~~~~l~~aG-----i~S~et~v~~l---~~-----~~~dee~~~El~ 494 (533) ...++|++--.+.... ... ++.+..+-+++ .+..+.+++.+ .+ -.+++|+++.-+ T Consensus 429 ~~l~~~~i~v~yis~La~aqk~~~~~~i~~~~~~~~~laq~~Pe~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq 508 (556) T protein:vir:73 429 DVLQGMPLRIEYISVMAQAQKSIGLTSLSQTVGFIGQLAQFKPEALDKLDVDQAIDAFSEMSGVSPTVIVPQEQVQGIRE 508 (556) T ss_pred hhhcCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCChhhcCCHHHHHHHHH Confidence 2246666644433211 111 11111111111 13334444332 11 025555544322 Q ss_pred HHHHhh-hcc----cCc-cccccccCCCCCC--------CCCCCCCCC Q lcl|NC_016654. 495 LIDNAN-TVS----APT-FGFGTDQPPLPTE--------NDPATDPEA 528 (533) Q Consensus 495 rI~~E~-~~~----~~~-~~~~~~~~~~~~~--------~~~~~~~~~ 528 (533) +-.+++ +++ ... .+.+....+.... -...|.++. T Consensus 509 ~r~~~qq~~~~~~~~~~a~~~~~~~~~~~~~~~~~l~~~~~~~g~~~~ 556 (556) T protein:vir:73 509 ERAKQAQAAQAMAMGQAAAQGAKTLSETQTSDPSALTAIANAAGAPQQ 556 (556) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCHHHHHHHHHhhcCCCC Confidence 211111 110 000 0001111111000 012233333 No 135 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=98.28 E-value=1.7e-06 Score=52.24 Aligned_cols=410 Identities=11% Similarity=0.039 Sum_probs=165.0 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhh-hHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCccc---ce Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAES-HVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAP---KR 76 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~-~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~---~~ 76 (533) |-.|.-. +...+..- ..|. |-| .+. ..+.+-.+|.+..+..|... .- T Consensus 1 ~~~~~~~----------~~~~~~~~~~~~~-g~~--------------~s~----~~~~~~~~~~~~~~~~g~~v~~~~a 51 (437) T protein:vir:10 1 MKQGKQR----------ALGRIKSSFLKWL-GVP--------------ISL----TDGSFWSAWGGMGSSSGETVTADSA 51 (437) T ss_pred CCcchhh----------hhhhhHHhhhhhc-CCc--------------ccC----CchhHHHhhcccccCCCceechHhh Confidence 2211111 11111111 1111 111 000 01122223334333333321 11 Q ss_pred eecChHHHHHHHHHHhhcCCCceEe-eCCC---chHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhhCCEEEEEEEcC Q lcl|NC_016654. 77 YHAPIPGVIAKLSTTELFSEQLKFL-DAGK---SKEVQARADLIFNT--P---RFHSSLVEAGESCSALSGSFQRIVWDP 147 (533) Q Consensus 77 ~~~n~~k~i~~~~a~ll~~e~~~i~-~~~~---~~~~~~~l~~i~~~--n---~f~~~~~~~~~~~~~~G~~~~~~~~D~ 147 (533) +...--..+++.+|+-+.+=|..+- ...+ .......+..+|.. | ....-....+...+..|.+|+.+..|. T Consensus 52 l~~~~v~~ci~~Ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~ 131 (437) T protein:vir:10 52 LQLSAVWSCVRLIAETIATLPLNLYQTKPDGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRSA 131 (437) T ss_pred hccHHHHHHHHHHHHHHhhCceeEEEEcCCCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEecC Confidence 2222223345555555444343321 1111 00111223333322 1 122334444556677899998887763 Q ss_pred CCCCceEEEEEcCCeEEEEEe-cCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccc Q lcl|NC_016654. 148 TIADNAWIDFVDADRAIPEFR-WGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATR 226 (533) Q Consensus 148 ~~~~~~~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~ 226 (533) + .-+.+..++|+.+.+... +|.+ .|.+.... |....+ T Consensus 132 -g-~~~~L~~l~p~~v~i~~~~~g~~-----------------------------~y~~~~~~----g~~~~~------- 169 (437) T protein:vir:10 132 -G-VLIGLELMLPQRTTVKRLTSGAL-----------------------------QYTYRNVD----GTVSTL------- 169 (437) T ss_pred -C-cEEEEEEEcCcceEEEECCCCeE-----------------------------EEEEEecC----ceEEEE------- Confidence 2 223455677776665532 2221 11111000 111100 Q ss_pred cccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCc-ce Q lcl|NC_016654. 227 DIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGA-GK 304 (533) Q Consensus 227 ~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~-~~ 304 (533) ..--+.|+.+.. +. ..+|.|.+..+ ...|. +.....++... |+.+. .. T Consensus 170 -------------------~~~dIih~r~~~----~d-----~~~G~spi~~~-~~~i~-~~~~~~~~~~~~f~ng~~p~ 219 (437) T protein:vir:10 170 -------------------AEDDVFHVRGFS----LD-----GLMGLTPIQYA-REVLG-NSTAANKTSASVFRNGLRPS 219 (437) T ss_pred -------------------ccccEEEecCcC----CC-----CcccccHHHHH-HHHHH-HHHHHHHHHHHHHhccCCcc Confidence 000122332211 11 23577776643 34443 23344444444 35443 33 Q ss_pred eeechHHhcCCCCccccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChh Q lcl|NC_016654. 305 VHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPV 380 (533) Q Consensus 305 i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~ 380 (533) .++ ...+.-.....+.....+.....+..++++ +....++.++......++++..+...+.|+...|+||. T Consensus 220 gil-----~~~~~l~~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~ 294 (437) T protein:vir:10 220 GVL-----STDQILQKEKRAEIRTDLAEQFGGAMQAGKTMVLEAGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPF 294 (437) T ss_pred EEE-----EcCCCCCHHHHHHHHHHHHHHhcCccccCcceeccCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHH Confidence 332 111110000001111112111111111111 11234566666667778888888888999999999999 Q ss_pred hcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHH Q lcl|NC_016654. 381 SLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQ 460 (533) Q Consensus 381 ~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~ 460 (533) .+|+...+..++..+... ....++.+|..++..+..-.+..+..........+.++++.-+..|..++++ T Consensus 295 ~lg~~~~~t~~~sn~e~~----------~~~f~~~tl~P~~~~ie~~l~~kll~~~e~~~~~~~fd~~~ll~~d~~~r~~ 364 (437) T protein:vir:10 295 MVGHSEKSTSWGTGIEQQ----------TLGFLTFTLRPWLTRIEQAARRSLLRPGERDQFYAEFSVEGLLRADSAGRAA 364 (437) T ss_pred HhCCCCCcccccchHHHH----------HHHHHHHHHHHHHHHHHHHHHhhccCccccCceEEEEechhhhccCHHHHHH Confidence 998765543333333222 2233444555544444332222222222222344666667777889999999 Q ss_pred HHHHHHhCCCCCHHHHHHHhC-CCCCHHHHHHHHHHHHHhhhcccCcccc-ccccCCCCC----CCCCCCCCCCCCCCC Q lcl|NC_016654. 461 TVQAWSVASAASTKTKVAYLH-EDWDDERVQEEADLIDNANTVSAPTFGF-GTDQPPLPT----ENDPATDPEAVDEGE 533 (533) Q Consensus 461 ~~~~l~~aGi~S~et~v~~l~-~~~~dee~~~El~rI~~E~~~~~~~~~~-~~~~~~~~~----~~~~~~~~~~~~d~~ 533 (533) .+.+++.+|+|+..++.+.+. +.+... .++-.+ +....| ... +.+.++... .+.+.+..+...+.| T Consensus 365 ~~~~~~~~G~~T~NE~R~~~gl~pi~gg---~~~~~~---~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 436 (437) T protein:vir:10 365 FYSTMTQNGLMTRDECRAKENLPPMGGN---AAVLTV---QSALLP-IDKLGEHTTATAAQDALKAWLYQEEKTRATQE 436 (437) T ss_pred HHHHHHhCCCcCHHHHHHHhCCCCCCCC---cceEee---cCcccc-hhhccCcCCCcchhccccccCCCCCCCCcccc Confidence 999999999999999777651 111111 010000 000001 000 011111100 111111223333333 No 136 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=98.27 E-value=1.8e-06 Score=52.04 Aligned_cols=467 Identities=10% Similarity=-0.031 Sum_probs=183.4 Q ss_pred Ccc-hHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHH Q lcl|NC_016654. 12 PPE-LAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLST 90 (533) Q Consensus 12 p~~-~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a 90 (533) =.. ....+..+...|..|...-.++.+|... ..+...-...+.+..++--+-+...++.+| T Consensus 1 mk~~a~~r~~~l~~~R~~~e~~w~e~~~y~lP------------------~~~~~~~~~~~~~~~~~~dstg~~a~~~La 62 (542) T protein:vir:78 1 MKGLAQARYSAMRADREDFLDMARRCAALTLP------------------YLLTEDGHASGGRLQQPYQSLGSKGVNALS 62 (542) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcc------------------ccCCCCCCcccccccccccchHHHHHHHHH Confidence 111 1234444444444444333333333211 100000011122233455566777888888 Q ss_pred HhhcCC-----CceEeeCCCc--------------hHHH-------HHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEE Q lcl|NC_016654. 91 TELFSE-----QLKFLDAGKS--------------KEVQ-------ARADLIFNTPRFHSSLVEAGESCSALSGSFQRIV 144 (533) Q Consensus 91 ~ll~~e-----~~~i~~~~~~--------------~~~~-------~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~ 144 (533) +.|.+- .+.|.....+ ..++ +.+.+.+..++|...+.++.+...+.|.+++ | T Consensus 63 a~l~~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--~ 140 (542) T protein:vir:78 63 SKLMLSLFPIQTSFFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLV--F 140 (542) T ss_pred HHHHHhhcCCCCccccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEE--E Confidence 766553 2344333211 1122 3344567888999999999999999999875 4 Q ss_pred EcCCCCCceEEEEEcCCeEEEEEe-cCCceEEEEEEEEeec------CCce----E--EEEEEEecCeeEEEEEEeccCC Q lcl|NC_016654. 145 WDPTIADNAWIDFVDADRAIPEFR-WGRLVAVTFWSELAGG------DGQE----V--WRHLERHESGYIVHAVYKGTAT 211 (533) Q Consensus 145 ~D~~~~~~~~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~~------~~~~----~--y~~lE~h~~~~I~~~~y~~~~~ 211 (533) .|++. ++.++-..++..-+ +|++..|+...+++.. .... + ...-+......|.+..+...+- T Consensus 141 ~~~~~-----~~~~pl~~y~v~~d~~G~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~ 215 (542) T protein:vir:78 141 AGKKT-----LKVYPLDRYVIERDGDGNVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDA 215 (542) T ss_pred ecCCC-----ceEEecceeEEeeCCCCCeEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCC Confidence 56532 55666666554444 4677665433222210 0000 0 0000111123334444432222 Q ss_pred cccceeehhhccccccccccccccCCceee-cCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHH Q lcl|NC_016654. 212 SLGWMMALTDHPATRDIAVEGADEGRGAYV-ETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRI 290 (533) Q Consensus 212 ~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~-~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~ 290 (533) ...+.+... .+.+... .++.+..-.... +.|.. -+.|++. +|... ..+.||+|-...++ +-+..|+.. T Consensus 216 ~~~~~~~~~-~~~~s~~-~e~~g~~v~~~~~e~g~~--~~P~i~~-----Rw~~~-~ge~YGrgp~~~~l-~D~k~L~~l 284 (542) T protein:vir:78 216 EVFTCCKLV-DGQHRWH-QECDGKEIKGSRSSSPLK--HSPWLPL-----RFNVV-DGESYGRGRVEEFF-GDLSSLDAL 284 (542) T ss_pred ccccccccC-CCeEEEE-EEeccccccccccccccc--cCCceee-----eeeec-CCCccccchHHHHH-HHHHHHHHH Confidence 221111110 0111110 011110000001 11111 1112221 24322 34788999877766 556788776 Q ss_pred HHHHHHHH-HhCcceeeechHH-hcCCCCccccccCcchhhhhhccccccccccccccceeee-chhhhhHHHHHHHHHH Q lcl|NC_016654. 291 YSSLMRDF-RIGAGKVHASESV-LTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFF-QPAIRVLEHDQGAALL 367 (533) Q Consensus 291 ~s~~~~~~-~~~~~~i~v~~~~-l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~-~~~ir~e~~~~~l~~~ 367 (533) --...... ...+....|+++. ++...- .+...+.+..- ..++...+... ..++. .-...++.+ T Consensus 285 ~~~~l~~~~~a~~pp~lv~~~g~~~~~~~-----~~~~~g~iv~g-------~~~~v~~~~~~~~~~~~--~~~~~i~~~ 350 (542) T protein:vir:78 285 TRSLIEGSAAAAKVVFMVSPSATTKPQSL-----ARAGTGAIIQG-------RAEDVSVVQANKGADFR--TVQEMIRDL 350 (542) T ss_pred HHHHHHHHHHHhcCceeeccccccchhhc-----ccCCCceeecC-------Cccceeeeecccccchh--HHHHHHHHH Confidence 65555544 3455555664432 221110 01111111110 00111111111 11221 122333333 Q ss_pred HHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhccCCCCCCceeEEEE Q lcl|NC_016654. 368 LREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHF-GSALGPLSTTCLRVDAIKFPGKGAAPSEELELE 446 (533) Q Consensus 368 l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~-~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~ 446 (533) -..|....-+. ....+...|||||..+.+......+-.-..+ ...|.-+|..++.+.... +-....+..-++++ T Consensus 351 ~~rI~~aFl~~----~~~d~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r~-g~lP~~p~~lv~~~ 425 (542) T protein:vir:78 351 SQRISDAFLIL----NVRQSERTTATEVREVQMELDRQLSGIYGSLTVELLTPYLNRKLHLMQRS-KQLPSLPKGLVMPT 425 (542) T ss_pred HHHHHHHhccc----ccCCcccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCchhceeee Confidence 33333222111 1122334699999888877666655533333 334444555555543210 11122344456666 Q ss_pred eCCCCCCC-----HHHHHHHHHHHHh-CC------CCCHHHHHHHh---CC------CCCHHHHHHHHHHHHHhhhcccC Q lcl|NC_016654. 447 WPKFARES-----DLAKAQTVQAWSV-AS------AASTKTKVAYL---HE------DWDDERVQEEADLIDNANTVSAP 505 (533) Q Consensus 447 f~d~i~~d-----~~e~a~~~~~l~~-aG------i~S~et~v~~l---~~------~~~dee~~~El~rI~~E~~~~~~ 505 (533) +--++..- .....+.++.+-+ .| .+....+++.+ .+ --++|+++++.++.++.+.++.. T Consensus 426 ~~s~La~~~r~~~~~~l~~~~~~i~~~~~p~~l~~~id~d~~~~~~a~~~Gvp~~~i~~s~e~~~~~~~q~q~~~~~~al 505 (542) T protein:vir:78 426 VVAGLGGVGRGEDRAALIEFMQTVGQAMGPEALQQFIDPTEFLKRLAAASGIDTLNLVKSPETMANEAQQAQQQQMTASL 505 (542) T ss_pred eechHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHH Confidence 65554321 1111111121111 11 12233333322 11 01456666655554443322111 Q ss_pred --ccccc-----cccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 506 --TFGFG-----TDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 506 --~~~~~-----~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ..+.. +++.+.-..-+.+..|+...-|| T Consensus 506 ~~~a~~~a~~~~~~~~~~~~~a~~~~~~~~~~~~~ 540 (542) T protein:vir:78 506 MGQAGQLAKSPIGEKMMQQINAPGQEAPAGPQTGE 540 (542) T ss_pred HHhhhhccccccccchhhhcCCCCcCCCCCCcccc Confidence 01110 01110000011233455555566 No 137 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=98.22 E-value=2.4e-06 Score=51.41 Aligned_cols=450 Identities=11% Similarity=0.009 Sum_probs=191.7 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |.- .+.++=..+...+..+...|..|...-.++.+|..... +.......+.+..++--+ T Consensus 1 ~~~---~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~------------------~~~~~~~~~~~~~~~~ds 59 (522) T protein:vir:94 1 MAE---REGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSL------------------FPKESDNSSTEYTTPWQA 59 (522) T ss_pred Ccc---cchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc------------------cCCCCCcccccccccccc Confidence 433 56677777777777777777766655555554432210 000001112233456667 Q ss_pred hHHHHHHHHHHhhcCC--C--ceEeeCCC-------------chHHHHH-------HHHHHhhccHHHHHHHHHHHHhhh Q lcl|NC_016654. 81 IPGVIAKLSTTELFSE--Q--LKFLDAGK-------------SKEVQAR-------ADLIFNTPRFHSSLVEAGESCSAL 136 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e--~--~~i~~~~~-------------~~~~~~~-------l~~i~~~n~f~~~~~~~~~~~~~~ 136 (533) -+...++.+|+.|.+- | +.|..... ...++++ +.+.+..++|...+.++.+...+. T Consensus 60 t~~~a~~~Las~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~ 139 (522) T protein:vir:94 60 VGARCLNNLAAKLMLALFPQSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVS 139 (522) T ss_pred cHHHHHHHHHHHHHhhcCCCCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhh Confidence 7778888888776553 2 44443211 1123333 334577789999999999999999 Q ss_pred CCEEEEEEEcCCCCC-ceEEEEEcCCeEEEEEe-cCCceEEEEEEEEee------------cCC------ceEEEEEEEe Q lcl|NC_016654. 137 SGSFQRIVWDPTIAD-NAWIDFVDADRAIPEFR-WGRLVAVTFWSELAG------------GDG------QEVWRHLERH 196 (533) Q Consensus 137 G~~~~~~~~D~~~~~-~~~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~------------~~~------~~~y~~lE~h 196 (533) |.+++. ++++..+ ...+.+++-..++-.-+ +|++..+++-.++.. .++ =.+|+.++.. T Consensus 140 G~a~l~--~~~~~~~~~~~~~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~v~~~ 217 (522) T protein:vir:94 140 GNCLLY--IPEPEQGTYSPMRMYRLVSYVVQRDAFGNILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVYTHIYRQ 217 (522) T ss_pred CcEeEe--eeccCCCceeeEEEEEcceEEEeeCCCcCeEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEEEEEEee Confidence 998864 4444333 23577777766555444 478877765444321 111 0123333322 Q ss_pred cCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchh Q lcl|NC_016654. 197 ESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADL 276 (533) Q Consensus 197 ~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~ 276 (533) +...+ .|...+ |..++.+ +.. ......||+.+ +|... ..+.||+|-. T Consensus 218 ~~~~~---~~~~~~---g~~~~~~----------------~~~--~~~~e~P~~~~--------Rw~~~-~ge~YGrgp~ 264 (522) T protein:vir:94 218 DDEYL---RYEEVE---GIEVTGT----------------DGS--YPLTACPYIPV--------RMVRL-DGEDYGRSYC 264 (522) T ss_pred CCcee---EEeecc---Cceeccc----------------CCC--CccccCCceee--------eeeec-CCCccccchH Confidence 22111 111100 1111100 000 00011222222 23222 2477899877 Q ss_pred hhhHHHHHHHHHHHHHHHHHHH-HhCcceeeechHHhcCCCCcccccc-Ccchhhhhhccccccccccccccceeee-ch Q lcl|NC_016654. 277 STDLFPTFHELDRIYSSLMRDF-RIGAGKVHASESVLTNLGMGQGVSL-DEEQEVYSRVGSGGFNANGDMETIFEFF-QP 353 (533) Q Consensus 277 ~~~i~~lid~lD~~~s~~~~~~-~~~~~~i~v~~~~l~~~~~~~~~~~-d~~~~~~~~~~~~~~~~~~~~~~~i~~~-~~ 353 (533) ..++ +-+..|+..--...... ...++.+.||++.+. +...+ +...+.+..- .. ++...+... .. T Consensus 265 ~~~l-~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~-----~~~~~~~~~~g~~v~g-----~~--~~v~~~~~~~~~ 331 (522) T protein:vir:94 265 EEYL-GDLNSLETITEAITKMAKVASKVVGLVNPNGIT-----QPRRLNKAATGEFVAG-----RV--EDINFLQLTKGQ 331 (522) T ss_pred HHHH-HHHHHHHHHHHHHHHHHHHHhCCceeecccccc-----cchheeccCCceeecC-----Cc--ccceeeeccccc Confidence 7766 66778887666665555 455666666543221 11111 0011111110 00 000011111 11 Q ss_pred hhhh-HHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhh Q lcl|NC_016654. 354 AIRV-LEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKA-RHFGSALGPLSTTCLRVDAIK 431 (533) Q Consensus 354 ~ir~-e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~-~~~~~al~~li~~il~l~~~~ 431 (533) ++.+ .+-++.++.-++.+.... .++...+...|||||..+.+...+...-.- +.-...|.-|+..++.+... T Consensus 332 ~~~~~~~~i~~~~~rI~~af~~~-----~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r- 405 (522) T protein:vir:94 332 DFTIAKSVADAIEQRLGWAFLLN-----SAVQRNAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQS- 405 (522) T ss_pred chhHHHHHHHHHHHHHHHHHhhh-----hhccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHh- Confidence 2221 222333333333333222 122233445799999988877666665533 33333455555555544321 Q ss_pred ccCC-CCCCceeEEEEeCCCCCCC-----HHHHHHHHHHHHhCC---C---CCHHHHHHHh---CCC------CCHHHHH Q lcl|NC_016654. 432 FPGK-GAAPSEELELEWPKFARES-----DLAKAQTVQAWSVAS---A---ASTKTKVAYL---HED------WDDERVQ 490 (533) Q Consensus 432 ~~~~-~~~~~~~v~i~f~d~i~~d-----~~e~a~~~~~l~~aG---i---~S~et~v~~l---~~~------~~dee~~ 490 (533) .|. ...+...+++++--++..- .....+.++.+-+.+ + +....+++.+ .+- -+++|++ T Consensus 406 -~g~lP~~p~~~v~v~~~s~La~~qr~~~~~~l~~~~~~ia~l~P~~~~~~id~d~~~~~~a~~~Gv~~~~ivr~~ee~~ 484 (522) T protein:vir:94 406 -AGMIPDLPKEAVEPTVSTGLEALGRGQDLEKLTQAVNMMTGLQPLSQDPDINLPTLKLRLLNALGIDTAGLLLTQDEKI 484 (522) T ss_pred -cCCCCCCCcccEEeeEecHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHH Confidence 111 2334555777765443321 111111111111111 0 1222222221 110 1344454 Q ss_pred HHHHHHHHhhh--cccCccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 491 EEADLIDNANT--VSAPTFGFGTDQPPLPTENDPATDPEAVDEG 532 (533) Q Consensus 491 ~El~rI~~E~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 532 (533) ++.++-+..+. +.....+ ... .... +.+.+++..-+ T Consensus 485 ~~~~q~~~~~~~~~~~~~~~--~~~-~a~~---~~~~~~~~~~~ 522 (522) T protein:vir:94 485 QRMAEQSSQQAVVQGASAAG--ANM-GAAV---GQGAGEDMAQA 522 (522) T ss_pred HHHHHHHHHHHHHHHHHHHH--HHh-hhhh---hcccchhhhcC Confidence 44433111111 0000000 000 0000 00001110011 No 138 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=98.18 E-value=3e-06 Score=50.90 Aligned_cols=409 Identities=11% Similarity=0.032 Sum_probs=168.4 Q ss_pred cCCHHHHHHHHhccCcch-hhHHHHHHHHHHHHHhcccCCCCCc-ccceeecChHHHHHHHHHHhhcCCCceEeeCCC-- Q lcl|NC_016654. 30 EGDLDKLATFYGAEGRTS-PSGIKARTKAAYEAFHGRTPTATGR-APKRYHAPIPGVIAKLSTTELFSEQLKFLDAGK-- 105 (533) Q Consensus 30 ~gd~~~l~~~y~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~g~-~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~-- 105 (533) +|= +.+|+....+.. ..............|++..+....- ...-+...--..+++.+|+-+-+=|..+--..+ T Consensus 1 M~~---~~~~f~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~l~~~~~~~~~~~ 77 (429) T protein:vir:10 1 MDS---VKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYG 77 (429) T ss_pred Cch---hhhhhcccccCcccccccCCChHHHHHHhcCCCCcceechhhhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 331 222221100000 0000000111223333322211000 001122222234455555555444443321111 Q ss_pred -chHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEec-CCceEEEEE Q lcl|NC_016654. 106 -SKEVQARADLIFNT--P---RFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRW-GRLVAVTFW 178 (533) Q Consensus 106 -~~~~~~~l~~i~~~--n---~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~-g~~~~v~f~ 178 (533) .......+..+|+. | .....++.++...+..|.+|+.+..|..+. -+.+-.++++++.+..+. +.+.. T Consensus 78 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~-~~~L~~i~~~~v~v~~~~~~~~~~---- 152 (429) T protein:vir:10 78 IQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGK-VQALWPIDASKVTVYIDDVGLLNS---- 152 (429) T ss_pred eeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc-EEEEEEEcCceeEEEEcCcccccc---- Confidence 11112234444432 1 122344555666778899999998886553 346677888877765433 22110 Q ss_pred EEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcc Q lcl|NC_016654. 179 SELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTP 258 (533) Q Consensus 179 ~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~ 258 (533) .+.+ +..+... |....+ +.--+.|+....+ T Consensus 153 -------------------~~~~-~~~~~~~----g~~~~~--------------------------~~~evih~~~~~~ 182 (429) T protein:vir:10 153 -------------------KTKM-WYVVNTG----GQQRVL--------------------------KPEEILHFKNGIT 182 (429) T ss_pred -------------------cceE-EEEEccC----CeEEEE--------------------------ccccEEEecCCCC Confidence 0000 0011100 111110 0001233332111 Q ss_pred cccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHH-HhC-cceeeechHHhcCCCCccccccCcchhhhhhcccc Q lcl|NC_016654. 259 NPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDF-RIG-AGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSG 336 (533) Q Consensus 259 ~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~-~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~ 336 (533) . ....|.|.+..+. ..+. +....+++...+ +.+ ....++ ...+.-...........+.....+ T Consensus 183 ~--------~~~~G~s~i~~~~-~~i~-~~~~~~~~~~~~~~ng~~~~~il-----~~~~~l~~e~~~~~~~~~~~~~~g 247 (429) T protein:vir:10 183 L--------DGLVGVPTMEYLK-STLE-NSASADKFINNFYKQGLQVKGLV-----QYVGDLNEDAKKVFRENFESMSSG 247 (429) T ss_pred C--------CCcccccHHHHHH-HHHH-HHHHHHHHHHHHHhccCCccEEE-----EcCCCCCHHHHHHHHHHHHHHhcc Confidence 1 1234777776543 4443 334444444443 544 333332 111100000000011112111111 Q ss_pred cccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc-hhHHHHHHHhhhHHHHHHHHHH Q lcl|NC_016654. 337 GFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA-QTATEASGKKDLTVKTTRAKAR 411 (533) Q Consensus 337 ~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~-~Tatai~~~~~~l~~~~~~~~~ 411 (533) ..+.++ +....++.++......++++..+...++|+...|+||..+|...++. .++.+. ... T Consensus 248 ~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e~~-------------~~~ 314 (429) T protein:vir:10 248 LQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQ-------------QQQ 314 (429) T ss_pred ccccCceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH-------------HHH Confidence 111111 11223555655556667888888889999999999999998644432 222222 223 Q ss_pred HHHHHHHHHHHHHHHHHHhhccC-CCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHH-HH Q lcl|NC_016654. 412 HFGSALGPLSTTCLRVDAIKFPG-KGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDE-RV 489 (533) Q Consensus 412 ~~~~al~~li~~il~l~~~~~~~-~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~de-e~ 489 (533) .++.+|..++..+....+..+.. ........+.++++.-...|..++++.+.+++.+|+|+..++.+.+ ++..- .. T Consensus 315 f~~~~l~P~~~~ie~~ln~kl~~~~~~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~--gl~p~~gg 392 (429) T protein:vir:10 315 FYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKE--DLPPEAGG 392 (429) T ss_pred HHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHh--CCCCCCCc Confidence 34555555555554433322221 1112234566666677788999999999999999999999977654 22210 01 Q ss_pred HHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 490 QEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 490 ~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ++-+.. . + ..|.-..+....+..+++.+.+ .+.++|. T Consensus 393 D~~~~~--~-n--~~~~d~~~~~~~k~g~~~~~~~--~~~~e~~ 429 (429) T protein:vir:10 393 DRLLVN--G-N--MLPIDMAGQAYLKGGDTNGEVS--KEGNEGN 429 (429) T ss_pred Ceeeec--c-c--ccchhhccccccCCCCCCCCCC--CCCCCCC Confidence 111000 0 0 0000000001111222222222 2222333 No 139 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=98.17 E-value=3.3e-06 Score=50.66 Aligned_cols=415 Identities=9% Similarity=0.006 Sum_probs=154.3 Q ss_pred CC--------CCCC-cCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCC Q lcl|NC_016654. 1 MS--------LPEA-NTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATG 71 (533) Q Consensus 1 ~~--------~~~~-~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 71 (533) |+ +|.. +-..||.. ......+...-..|+.+|- T Consensus 54 ~a~~~p~~~~~~~~~~~~~~p~~-~~~~~~~~~~l~~~~~npi------------------------------------- 95 (576) T protein:vir:96 54 QAYAEPFLEVMDTNPEFRTKRSY-MKNSDNLHDVLKQFGNNPI------------------------------------- 95 (576) T ss_pred chhhcceeeeeecCCCccccCcc-hhhhhhhHHHHHHhhcCHH------------------------------------- Confidence 21 2311 22222222 1111111111011111100 Q ss_pred cccceeecChHHHHHHHHHHhhc----CC---CceEeeCC-----CchH--HHHHHHHHH----hh-----ccHHHHHHH Q lcl|NC_016654. 72 RAPKRYHAPIPGVIAKLSTTELF----SE---QLKFLDAG-----KSKE--VQARADLIF----NT-----PRFHSSLVE 128 (533) Q Consensus 72 ~~~~~~~~n~~k~i~~~~a~ll~----~e---~~~i~~~~-----~~~~--~~~~l~~i~----~~-----n~f~~~~~~ 128 (533) +-.--..|++..|.|.. .+ ...+.... .... ....+..++ .. ..+...+.. T Consensus 96 ------v~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~ 169 (576) T protein:vir:96 96 ------LNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDIDRDSFQSFCRK 169 (576) T ss_pred ------HHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCccccHHHHHHH Confidence 00011122222222210 00 00010000 0000 011122222 11 124455666 Q ss_pred HHHHHhhhCCEEEEEEEcCCCCCc-eEEEEEcCCeEEEEEec-CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEE Q lcl|NC_016654. 129 AGESCSALSGSFQRIVWDPTIADN-AWIDFVDADRAIPEFRW-GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVY 206 (533) Q Consensus 129 ~~~~~~~~G~~~~~~~~D~~~~~~-~~i~~v~~~~~~P~~~~-g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y 206 (533) ++...+..|.+++.+.++.++.++ +.+-.++|.++.++.+. |.+ |+.. +.|..+ T Consensus 170 lv~dlll~Gna~~~i~~~rd~~g~~~~L~pl~p~~V~v~~~~dg~~-----------------~~~~-------~~~~~~ 225 (576) T protein:vir:96 170 IVRDTYTYDQVNFEKVFNKKNATTMDKFIAVDPSTIFYATDKNGKI-----------------IKGG-------KRFVQV 225 (576) T ss_pred HHHHHHhcCCeEEEEEEecCCCCceEEEEEeCCceeEEEECCCCce-----------------eeee-------eEEEEe Confidence 667778899999998888765544 34666888888876543 321 0000 000000 Q ss_pred eccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHH Q lcl|NC_016654. 207 KGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHE 286 (533) Q Consensus 207 ~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~ 286 (533) . +......++.. . -++|+.+...... ...+|.|.+..+. ..|. T Consensus 226 ~--~~~~~~~~~~~--------------------------d-ii~~~~~~~~d~~------~~~~G~Spi~~a~-~~i~- 268 (576) T protein:vir:96 226 I--NKKVVASFTSR--------------------------E-MAMGIRNPRTELS------SSGYGLSEVEIAM-KQFI- 268 (576) T ss_pred c--CCceEEEeccc--------------------------c-eEEEeecCCCCcc------cCcccccHHHHHH-HHHH- Confidence 0 00000000000 0 0223322211111 1346888876544 4443 Q ss_pred HHHHHHHHHHH-HHhCc-ceeeechHHhcCCCCcc--ccccCcchhhhhhcccccccccc-----ccccceeeechhhhh Q lcl|NC_016654. 287 LDRIYSSLMRD-FRIGA-GKVHASESVLTNLGMGQ--GVSLDEEQEVYSRVGSGGFNANG-----DMETIFEFFQPAIRV 357 (533) Q Consensus 287 lD~~~s~~~~~-~~~~~-~~i~v~~~~l~~~~~~~--~~~~d~~~~~~~~~~~~~~~~~~-----~~~~~i~~~~~~ir~ 357 (533) +......+... |+.|. ..-++ ...+... ....+.....+.....+..++++ +....++.++..... T Consensus 269 ~~~~~~~~~~~~f~Ng~~p~giL-----~~~~~~~ls~e~~~~lr~~~~~~~~G~~nag~~p~vl~~G~~~~~ls~~~~d 343 (576) T protein:vir:96 269 AYNNTETFNDRFFSHGGTTRGIL-----QIKSEQQQSQRALENFKREWKSSFSGINGSWQVPVVMADDIKFVNMTPTAND 343 (576) T ss_pred HHHHHHHHHHHHHhccCCCceEE-----EeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEeccCChhh Confidence 33444445444 35442 22222 1111110 00011111222221111111111 112346666777778 Q ss_pred HHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccCCC Q lcl|NC_016654. 358 LEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRA-KARHFGSALGPLSTTCLRVDAIKFPGKG 436 (533) Q Consensus 358 e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~-~~~~~~~al~~li~~il~l~~~~~~~~~ 436 (533) .++++..+...+.|+...|++|..+|+...+..++..-.... ++..++. ....++.+|..+++.+....+..+... T Consensus 344 ~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~--t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~- 420 (576) T protein:vir:96 344 MQFEKWLTYLINIISALYGIDPAEIGFPNRGGATGGKGGNTL--NEADPGKKQQQSQNKGLQPLLRFIEDLINTHIISE- 420 (576) T ss_pred HHHHHHHHHhHHHHHHHhCCCHHHcccccccccccccccccc--ccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchh- Confidence 899999999999999999999999997654433322111111 1111111 234455556666555544333222211 Q ss_pred CCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhC-CCCCHHHHHHHH-----HHHH-----------Hh Q lcl|NC_016654. 437 AAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLH-EDWDDERVQEEA-----DLID-----------NA 499 (533) Q Consensus 437 ~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~-~~~~dee~~~El-----~rI~-----------~E 499 (533) ....+.+.|.+.-+.++.+...+. .+..+|+|+..++.+.+. +.+.. -+.-+ ..+. .+ T Consensus 421 --~~~~~~~~f~r~d~~~~~e~~~~~-~~~~~G~lT~NE~R~~~gl~pieg--GD~~~~~~~~~~~~~~~~~~~~e~~~~ 495 (576) T protein:vir:96 421 --YSDKYVFQFVGGDTKSELDKIKIL-QEEVKTYKTVNEARKEKGLKPIEG--GDVLLDGSFIQSMSLNTQKEQYEDTKQ 495 (576) T ss_pred --ccCceEEEeccCCHHHHHHHHHHH-HHHhcCccCHHHHHHHhCCCCCCC--cceeccccccccccccccCCCCCCccc Confidence 123467778776555555554433 345579999999766541 11110 00000 0000 00 Q ss_pred hhcccCcc---ccccccCCCCCCCC--CCC---CCCCCCCCC Q lcl|NC_016654. 500 NTVSAPTF---GFGTDQPPLPTEND--PAT---DPEAVDEGE 533 (533) Q Consensus 500 ~~~~~~~~---~~~~~~~~~~~~~~--~~~---~~~~~~d~~ 533 (533) +....+.. ..+...+|.....+ +++ +++..-|++ T Consensus 496 ~~~~~~~~~~~~~~~~~~~~~~s~~~~~~g~~~~~~~~~~~~ 537 (576) T protein:vir:96 496 KERFDMIQQFLNSPDDEEPQQESTEDKVDGRESNDPTKIDSP 537 (576) T ss_pred cccccccccccCCCCCCCCCCCCCCCcccccccccCCCCCCc Confidence 00000000 00000001000000 000 011111111 No 140 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=98.15 E-value=3.6e-06 Score=50.46 Aligned_cols=396 Identities=7% Similarity=0.010 Sum_probs=166.2 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCccc---cee Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAP---KRY 77 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~---~~~ 77 (533) |= |.+.++.+..... ...........+|.+.+...|... .-+ T Consensus 1 m~---------------------------------~~~~f~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~v~~~~al 45 (416) T protein:vir:12 1 ML---------------------------------LERMFEKRSGSSD--HEDGFNNILLNMFGGRKTASGERVSESNSL 45 (416) T ss_pred Cc---------------------------------cchhcccccCccc--cCccchhHHHHhhcCcccccCceechhhhh Confidence 11 1122211111110 000111222333443333222211 111 Q ss_pred ecChHHHHHHHHHHhhcCCCceEeeCCCc---hHHHHHHHHHHh-h-cc---HHHHHHHHHHHHhhhCCEEEEEEEcCCC Q lcl|NC_016654. 78 HAPIPGVIAKLSTTELFSEQLKFLDAGKS---KEVQARADLIFN-T-PR---FHSSLVEAGESCSALSGSFQRIVWDPTI 149 (533) Q Consensus 78 ~~n~~k~i~~~~a~ll~~e~~~i~~~~~~---~~~~~~l~~i~~-~-n~---f~~~~~~~~~~~~~~G~~~~~~~~D~~~ 149 (533) ...--...++.+|+-+-+=|..+--..+. .....-+..+|. . |. ...-+...+......|.+|+.+..|..+ T Consensus 46 ~~~~v~~~i~~Ia~~ia~l~~~~~~~~~~~~~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G 125 (416) T protein:vir:12 46 VQPDIFACVNVLSDDIAKLPIHTYKRTDGGIERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHG 125 (416) T ss_pred ccHHHHHHHHHHHHhhhhCceEEEEecCCccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC Confidence 22222345556665555545433111110 000011222221 1 11 1233445556667789999988887654 Q ss_pred CCceEEEEEcCCeEEEEEec-CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccc Q lcl|NC_016654. 150 ADNAWIDFVDADRAIPEFRW-GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDI 228 (533) Q Consensus 150 ~~~~~i~~v~~~~~~P~~~~-g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~ 228 (533) . -..+-.++|+.+-++.+. +.. .|+. +..+ |..+.+. T Consensus 126 ~-~~~L~~l~~~~v~v~~~~~~~~---~~~~--------------------------~~~~----g~~~~~~-------- 163 (416) T protein:vir:12 126 Y-PEALFPLRPDYTNAYVHPTTGM---LWYQ--------------------------TVLN----GKAIELY-------- 163 (416) T ss_pred c-EEEEEEECCcceEEEEeCCCcE---EEEE--------------------------EecC----CeEEEec-------- Confidence 3 245666777777665322 110 1110 0000 1111000 Q ss_pred cccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhC-cceee Q lcl|NC_016654. 229 AVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIG-AGKVH 306 (533) Q Consensus 229 ~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~-~~~i~ 306 (533) .--+.|+.+... . ...|.|.+..+ ...++ ++....++... |+.+ ....+ T Consensus 164 ------------------~~eiih~~~~~~----~-----~~~G~s~i~~~-~~~i~-~~~~~~~~~~~~~~ng~~p~~i 214 (416) T protein:vir:12 164 ------------------DYEVLHFKGLST----D-----GIHGKSPIGVV-REHIG-AQAAATKYNAKLYKNEATPRGI 214 (416) T ss_pred ------------------CccEEEecCcCC----C-----CcccccHHHHH-HHHHH-HHHHHHHHHHHHHhcCCCCceE Confidence 001222322111 1 23577777654 34553 34445555544 4554 33333 Q ss_pred echHHhcCCCCccccccCcchhhhhh----ccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhc Q lcl|NC_016654. 307 ASESVLTNLGMGQGVSLDEEQEVYSR----VGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSL 382 (533) Q Consensus 307 v~~~~l~~~~~~~~~~~d~~~~~~~~----~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~ 382 (533) +. ..+.-.....+.....|.. -.....+ +...++.++......++++..+...++|+...|+||..+ T Consensus 215 l~-----~~~~~~~e~~~~~~~~~~~~~~~~~~~vl~----~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l 285 (416) T protein:vir:12 215 LK-----VPAFLDEKPKENVRKEWKRVNKVENIAIID----YGLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKL 285 (416) T ss_pred Ee-----cCCCCCHHHHHHHHHHHHHHhcCCCeeecC----CCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHh Confidence 31 1110000000001111211 1111111 122356666666777889999989999999999999999 Q ss_pred ccCCCcc-hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCC-CCCCceeEEEEeCCCCCCCHHHHHH Q lcl|NC_016654. 383 GLSDEVA-QTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGK-GAAPSEELELEWPKFARESDLAKAQ 460 (533) Q Consensus 383 g~~~~~~-~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~-~~~~~~~v~i~f~d~i~~d~~e~a~ 460 (533) |...++. .++++.. ...++.+|..++..+....+..+... .......+.+++++-+..|..++++ T Consensus 286 g~~~~~t~sn~e~~~-------------~~f~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~ 352 (416) T protein:vir:12 286 NELDKATFSNIEHQS-------------IEYVRNTLQPWIVNFEQELNVKLFLDHDQKSGHYVKFNIDSELRGDSKTQAE 352 (416) T ss_pred CCccCCCcccHHHHH-------------HHHHHHHHHHHHHHHHHHHHHhhcCchhhcCCceEEeechhhhccCHHHHHH Confidence 8654432 2233321 12334445554444433222222111 1112345777777878889999999 Q ss_pred HHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 461 TVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEG 532 (533) Q Consensus 461 ~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 532 (533) .+.+++.+|+|+..++.+.+ ++.+-+--.++- ..-+-. + .+.. +.....+.+.+.+..+..++| T Consensus 353 ~~~~~~~~G~~T~NE~R~~~--gl~Pi~ggd~~~--~~~n~~--~-~~~~-~~~~~~~~~~~~~gge~~~~g 416 (416) T protein:vir:12 353 YLKTLHETGVLNKDEIRELL--ERNPIENGDKYI--SSLNYV--F-LDFL-EEYQRLKAGGAMKGGDNKNEG 416 (416) T ss_pred HHHHHHhCCCcCHHHHHHHh--CCCCCCCcceee--eccccc--c-cccc-chhhccccccccCCCCCcCCC Confidence 99999999999999977764 222210001110 000000 0 0000 111111111111112233444 No 141 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=98.15 E-value=3.6e-06 Score=50.44 Aligned_cols=391 Identities=9% Similarity=-0.010 Sum_probs=160.4 Q ss_pred HhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcc---cceeecChHHHHHHHHHHhhcCCCceEeeC Q lcl|NC_016654. 27 VWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRA---PKRYHAPIPGVIAKLSTTELFSEQLKFLDA 103 (533) Q Consensus 27 ~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~---~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~ 103 (533) -|+. .+++.+............. .+.++.++..|.. ..-+...---..++.+|+-+-+=|..+--. T Consensus 1 ~~f~-------~~f~r~~~~~~~~~~~~~~----~~~~~~~~~~g~~v~~~~~l~~~~v~~~i~~Ia~~iA~~p~~~~~~ 69 (413) T protein:vir:48 1 MFFS-------GLFQRKSDAPVTTPAELAE----AIGLSYDTYTGKRISSQRAMRLTAVYSCVRVLAESVGMLPCSLYKI 69 (413) T ss_pred Cccc-------hhhccCccCCccchHHHHH----hhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEe Confidence 1111 1222222111111111100 0111111111111 111222223344555555555445433211 Q ss_pred C-C--chHHHHHHHHHHhh-----ccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEE Q lcl|NC_016654. 104 G-K--SKEVQARADLIFNT-----PRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAV 175 (533) Q Consensus 104 ~-~--~~~~~~~l~~i~~~-----n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v 175 (533) . + .......+..+|.. -....-+...+...+..|.+|+.+..+. + .-+.+-.++++++.++.+... T Consensus 70 ~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~-g-~~~~L~~l~~~~v~~~~~~~~---- 143 (413) T protein:vir:48 70 SGTLKTRVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKAL-G-EVVELLPIDPGCVEPKLNSQW---- 143 (413) T ss_pred cCCcceeecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeCC-C-cEEEEEEEcCceEEEEEcCCc---- Confidence 1 1 01111223334321 1122334445556677899988876652 2 223455567776666543211 Q ss_pred EEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecC Q lcl|NC_016654. 176 TFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPN 255 (533) Q Consensus 176 ~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn 255 (533) ..+|. ++... |....+ + .+ -+.|+.. T Consensus 144 -----------~~~y~-------------~~~~~----g~~~~~---~-----~~------------------evih~~~ 169 (413) T protein:vir:48 144 -----------QPVYQ-------------VTFPD----GSVDVL---T-----QD------------------EIWHVRT 169 (413) T ss_pred -----------eEEEE-------------EEecC----ceEEEE---c-----cc------------------cEEEecC Confidence 11111 00000 000000 0 00 1122221 Q ss_pred CcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhC-cceeeechHHhcCCCCccccccCcchhhhhhc Q lcl|NC_016654. 256 VTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIG-AGKVHASESVLTNLGMGQGVSLDEEQEVYSRV 333 (533) Q Consensus 256 ~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~-~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~ 333 (533) .. +. ...|.|.+..+. ..|+ ++....++... |+.+ .+..++ ...+.-.....+...+.+... T Consensus 170 ~~----~d-----~~~G~s~i~~~~-~~i~-~~~~~~~~~~~~~~ng~~p~gil-----~~~~~~~~e~~~~~~~~~~~~ 233 (413) T protein:vir:48 170 LT----LD-----GLVGLNPIAYAR-EAIS-LAAATEEHGARLFGNGAVTSGVL-----RTEQKLTPDAYERLKKDFEER 233 (413) T ss_pred cC----CC-----CcccccHHHHHH-HHHH-HHHHHHHHHHHHHhccCCcceEE-----EeCCCCCHHHHHHHHHHHHHH Confidence 10 11 235777766543 4453 33344444444 3543 333332 111110000001111112211 Q ss_pred ccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc-hhHHHHHHHhhhHHHHHHH Q lcl|NC_016654. 334 GSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA-QTATEASGKKDLTVKTTRA 408 (533) Q Consensus 334 ~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~-~Tatai~~~~~~l~~~~~~ 408 (533) ..+..+.++ +....++.++......++.+..+...++|+...|+||..+|...++. .+.++.. T Consensus 234 ~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~------------ 301 (413) T protein:vir:48 234 HTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELG------------ 301 (413) T ss_pred hcCccccCcceecCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHH------------ Confidence 111111111 11223555666666778888888889999999999999998654322 2333222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHH Q lcl|NC_016654. 409 KARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDER 488 (533) Q Consensus 409 ~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee 488 (533) ...++.+|.-++..+....+..+..........+.++++.-.-.|..++++.+++++++|+|+..++.+++ ++..-+ T Consensus 302 -~~f~~~~i~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~--g~~p~~ 378 (413) T protein:vir:48 302 -LGFINYSLVPYLTRIEQRINTGLVRESKQGKFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLE--DMNPRP 378 (413) T ss_pred -HHHHHHHHHHHHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh--CCCCCC Confidence 22334445555544433222222211112234566777777778999999999999999999999977654 232200 Q ss_pred HHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 489 VQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 489 ~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) .- +..-...+..+....++..+.+.++++.+ T Consensus 379 ---gg-----------D~~~~~~n~~~~~~~~~~~~~~~~~~~~~ 409 (413) T protein:vir:48 379 ---GG-----------DVYLTPMNMTTSPSAGDDNGKKKESGDAD 409 (413) T ss_pred ---Cc-----------ceeeccccccccccccccCCCCCCCCCcc Confidence 00 00000111112222222222222222222 No 142 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=98.12 E-value=4.1e-06 Score=50.09 Aligned_cols=392 Identities=13% Similarity=0.007 Sum_probs=153.4 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhh-HHHHHHHHHHHHHhcccCCCCCcccceeec Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPS-GIKARTKAAYEAFHGRTPTATGRAPKRYHA 79 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 79 (533) |+|= .++++.....+.. ...............+.... ...-+.+ T Consensus 1 Mgl~--------------------------------~~~f~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~---~~~al~~ 45 (409) T protein:vir:84 1 MSLF--------------------------------TRIFSGPSEERTLTKISGIPSPAEDWAMHGDRPG---ANSAMTL 45 (409) T ss_pred Cchh--------------------------------hhhhcCCCcccccccccccccccchhhccCcccc---hhhhhcc Confidence 3332 1222221111100 00000000000000010000 0111222 Q ss_pred ChHHHHHHHHHHhhcCCCceEeeCCCc-hHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCce Q lcl|NC_016654. 80 PIPGVIAKLSTTELFSEQLKFLDAGKS-KEVQARADLIFNT--P---RFHSSLVEAGESCSALSGSFQRIVWDPTIADNA 153 (533) Q Consensus 80 n~~k~i~~~~a~ll~~e~~~i~~~~~~-~~~~~~l~~i~~~--n---~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~ 153 (533) .--...++.+|+-+-+=|..+--..+. .....-+.++|.. | .-...+...+...+..|.+|+.+.+....+.-+ T Consensus 46 ~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~ 125 (409) T protein:vir:84 46 GAFYACVTLLADTVASLSIDAYRKKDNVRIPVSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPT 125 (409) T ss_pred HHHHHHHHHHHHhhhhCceEEEEecCCcccccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceE Confidence 223344555555554444333111111 1111123333321 1 112334445556677799887775433332234 Q ss_pred EEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccccc Q lcl|NC_016654. 154 WIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGA 233 (533) Q Consensus 154 ~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~ 233 (533) .+..++|+.+.+....+. +.. ..+..|... |+.++-++ T Consensus 126 ~L~~l~p~~v~v~~~~~~-------------~~~-------------~~~~~~~~~----g~~~~~~d------------ 163 (409) T protein:vir:84 126 AIMPIHPDCIHVTDAKDE-------------DGD-------------WIEPVYRID----GKVVPNHR------------ 163 (409) T ss_pred EEEEEcCceeEEEEcCCC-------------cce-------------EEEEEecCC----ceEEchhh------------ Confidence 566677776654421110 000 001112111 11111110 Q ss_pred ccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhC-cceeeechHH Q lcl|NC_016654. 234 DEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIG-AGKVHASESV 311 (533) Q Consensus 234 ~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~-~~~i~v~~~~ 311 (533) +.|+....++ ...+|.|.+..+ ...|+.. ...+++... |+.+ ....++ T Consensus 164 ----------------vih~~~~~~~--------~~~~G~s~i~~~-~~~i~~~-~~~~~~~~~~f~ng~~p~gil---- 213 (409) T protein:vir:84 164 ----------------IMHIKRYPVA--------GCALGMSPIEKA-ASAIGLG-LAAERYGLRWFRDSANPSGIL---- 213 (409) T ss_pred ----------------EEEecCCCCC--------cccccccHHHHH-HHHHHHH-HHHHHHHHHHHhcCCCccEEE---- Confidence 2233221111 123577777653 3444333 333444433 4543 333332 Q ss_pred hcCCCCccccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCC Q lcl|NC_016654. 312 LTNLGMGQGVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDE 387 (533) Q Consensus 312 l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~ 387 (533) ...+. ...+..++....+.....++++ +....++.++......++++..+...++|+...|+||..+|+..+ T Consensus 214 -~~~~~---l~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~ 289 (409) T protein:vir:84 214 -SSDAD---LTPDQVKQTQKQWIQSHHNRRLPAVMSAGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEK 289 (409) T ss_pred -ecCCC---CCHHHHHHHHHHHHHHhccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC Confidence 11110 0000001111101000011111 112235666666667788888888899999999999999987554 Q ss_pred cchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHh Q lcl|NC_016654. 388 VAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSV 467 (533) Q Consensus 388 ~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~ 467 (533) +..++..++...... +..+|..++..+..-.+..+ .....+.++++.-.-.|..++++.+.++++ T Consensus 290 ~~~~~sn~e~~~~~f----------~~~~l~P~~~~ie~~l~~~L-----~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~ 354 (409) T protein:vir:84 290 STSWGTGIEEQGINF----------VRHTLLPWLRCIEQALDTFL-----PRGQFVKFNVDGLMRGDVTARFTAYQMGLQ 354 (409) T ss_pred cccccchHHHHHHHH----------HHHHHHHHHHHHHHHHHHhc-----cCCCeEEEechhhhccCHHHHHHHHHHHHh Confidence 443333332222211 22223333332222111111 123457777778778899999999999999 Q ss_pred CCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccC-ccccccccCCCCCCCCCC--CCCCCCCCCC Q lcl|NC_016654. 468 ASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAP-TFGFGTDQPPLPTENDPA--TDPEAVDEGE 533 (533) Q Consensus 468 aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~-~~~~~~~~~~~~~~~~~~--~~~~~~~d~~ 533 (533) +|+|+..++.+++ ++..- .+-+++. .+ .+...++.+ .+++. .++...-+|- T Consensus 355 ~G~~t~NE~R~~~--g~~p~---~ggD~~~------~~~n~~~~~~~~----~~~~~~~~~~~~~~~gn 408 (409) T protein:vir:84 355 NGIWSVNEVRAWE--DAPPI---PEGDIHL------QPMNFVPLGYVP----PEEPAQEPQPNSATEGN 408 (409) T ss_pred CCCcCHHHHHHHh--CCCCC---CCcceee------ecccccccccCC----ccccCcCCCCCCccCCC Confidence 9999999977664 23221 0000000 00 000000000 01111 1111111111 No 143 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=98.08 E-value=5e-06 Score=49.62 Aligned_cols=405 Identities=12% Similarity=0.090 Sum_probs=158.6 Q ss_pred CCC-CCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeec Q lcl|NC_016654. 1 MSL-PEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHA 79 (533) Q Consensus 1 ~~~-~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 79 (533) |+- |--++.+.|..+. ++...-..|..+| T Consensus 58 ~~~~~g~~~~~~~~~~~----~l~~l~~~~~~np---------------------------------------------- 87 (547) T protein:vir:63 58 MSANPGFKTKPSIRNNQ----DLHGVLKKFGGNI---------------------------------------------- 87 (547) T ss_pred eecccccccCCccCChh----HHHHHHHHhhcCH---------------------------------------------- Confidence 221 1112222222111 1111111111111 Q ss_pred ChHHHHHHHHHHhhcC---------CCc--eEeeCC-------CchHHHHHHHHHHhhc---------cHHHHHHHHHHH Q lcl|NC_016654. 80 PIPGVIAKLSTTELFS---------EQL--KFLDAG-------KSKEVQARADLIFNTP---------RFHSSLVEAGES 132 (533) Q Consensus 80 n~~k~i~~~~a~ll~~---------e~~--~i~~~~-------~~~~~~~~l~~i~~~n---------~f~~~~~~~~~~ 132 (533) +...+++..|+-+.+ ..+ .+.+.+ .+...-..|.+++..- .+...+...+.. T Consensus 88 -iv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s~~~f~~~lv~d 166 (547) T protein:vir:63 88 -ILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFVKKIVRD 166 (547) T ss_pred -HHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccchHHHHHHHHHHH Confidence 122222222211110 000 111111 0111112334433321 233445556667 Q ss_pred HhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEec-CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCC Q lcl|NC_016654. 133 CSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRW-GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTAT 211 (533) Q Consensus 133 ~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~-g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~ 211 (533) .+..|.+|+.+..|..+. -+.+.+++|.++.++.+. |.+.. +. ..| ..+. ++ T Consensus 167 ~ll~Gn~~~~i~rd~~G~-~~~L~~l~p~~V~~~~~~~g~~~~----------~~-~~y-------------~~~~--~~ 219 (547) T protein:vir:63 167 TYMYDQVNFEKVFNRNQS-MVRFVAKDPTTIFFATTADGKIPD----------NG-NRF-------------VQVI--DQ 219 (547) T ss_pred HHhhCCEEEEEEECCCCc-EEEEEEecCceeEEEECCcccccc----------Cc-eEE-------------EEEc--CC Confidence 788899999888887653 356777888888776433 22110 00 001 0000 00 Q ss_pred cccceeehhhccccccccccccccCCceeecCCCccceeEEec-CCcccccccccccccccccchhhhhHHHHHHHHHHH Q lcl|NC_016654. 212 SLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVP-NVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRI 290 (533) Q Consensus 212 ~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~p-n~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~ 290 (533) .....++- .+ +.|+. |... .+....+|.|.+..+. ..|. +... T Consensus 220 ~~~~~~~~----------~e------------------iih~r~n~~~------~~~~~~~G~Spi~~~~-~~i~-~~~~ 263 (547) T protein:vir:63 220 KIVATFNA----------RE------------------MAFAVRNPRS------DIYATGYGYPELEIAL-KQFI-AHEN 263 (547) T ss_pred cEEEEecc----------cc------------------EEEecccCCC------CcccccccccHHHHHH-HHHH-HHHH Confidence 00000000 00 11111 1100 0112456888877654 4443 3344 Q ss_pred HHHHHHH-HHhCc-ceee--echHHhcCCCCccccccCcchhhhhhcccccccccc-----ccccceeeechhhhhHHHH Q lcl|NC_016654. 291 YSSLMRD-FRIGA-GKVH--ASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG-----DMETIFEFFQPAIRVLEHD 361 (533) Q Consensus 291 ~s~~~~~-~~~~~-~~i~--v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~-----~~~~~i~~~~~~ir~e~~~ 361 (533) ...+... |+.|. ..-+ ++... .....+ .+...+.+.....+..+++. .....++.++......+++ T Consensus 264 a~~~~~~~f~Ng~~p~giL~~~~~~--~ls~e~---~~~lk~~~~~~~~G~~nagk~~vl~~~g~~~~~l~~~~~d~qfl 338 (547) T protein:vir:63 264 TEAFNDRFFSHGGTTRGILQIKAAQ--QQSQHA---LEIFKREWKNSLSGINGSWQIPVVSAEDVKFVNMTPSARDMEFE 338 (547) T ss_pred HHHHHHHHHHcCCCcceEEEecCCC--CCCHHH---HHHHHHHHHHHhcCcccccccccccCCCceEEEcCCChhHHHHH Confidence 4444444 35543 2222 22110 000000 01111112111111111111 1122355666667778899 Q ss_pred HHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccCCCCCCc Q lcl|NC_016654. 362 QGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRA-KARHFGSALGPLSTTCLRVDAIKFPGKGAAPS 440 (533) Q Consensus 362 ~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~-~~~~~~~al~~li~~il~l~~~~~~~~~~~~~ 440 (533) +..+...+.|+...|++|..+|+...+..++....+.. ..++.. ....++.+|.-+++.+....+..+... .. T Consensus 339 e~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t---~sn~e~~~~~~~~~tL~P~~~~ie~~ln~~L~~~---~~ 412 (547) T protein:vir:63 339 KWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLN---EGNSAEKNQASKNKGLQPLLGFIEDFINKHIVAE---FG 412 (547) T ss_pred HHHHHHHHHHHHHhCCCHHHcCcccccccccccccccc---hhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---cC Confidence 99999999999999999999997554332222221111 111222 234455666666666554433333211 12 Q ss_pred eeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCH--HHHHHHH---------HHHHHhh-----hcc- Q lcl|NC_016654. 441 EELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDD--ERVQEEA---------DLIDNAN-----TVS- 503 (533) Q Consensus 441 ~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~d--ee~~~El---------~rI~~E~-----~~~- 503 (533) ..+.+.|+.....+..+.++. .+++.+|+|+.-++.+++. +.. +..+.-+ +..+.++ ..+ T Consensus 413 ~~~~~~f~~~~~~~~~~~~~~-~~~~~~g~lT~NE~R~~~g--l~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (547) T protein:vir:63 413 DKYTFQFVGGDIKSELESVKI-LAEKAKVAMTVNEVRKELN--LPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSN 489 (547) T ss_pred CceEEEeeccccccHHHHHHH-HHHHhCCCcCHHHHHHHhC--CCCCCCCCceeecccccccccccccccCCccccchhh Confidence 356788887777787777664 4567789999999776652 211 0000000 0000000 000 Q ss_pred cC-ccc-ccccc----CCCCCCCCC------CC---CCCCCCCCC Q lcl|NC_016654. 504 AP-TFG-FGTDQ----PPLPTENDP------AT---DPEAVDEGE 533 (533) Q Consensus 504 ~~-~~~-~~~~~----~~~~~~~~~------~~---~~~~~~d~~ 533 (533) .+ ..+ .++.. +..+.+++. ++ ++...+.|+ T Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~ 534 (547) T protein:vir:63 490 LQMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDGQRKDKDNANAGK 534 (547) T ss_pred ccccccccCCCCCCCCCCCCCCcccCCCcCccccccCccccchhh Confidence 00 000 00001 111111000 00 011111121 No 144 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=98.07 E-value=5.3e-06 Score=49.51 Aligned_cols=391 Identities=11% Similarity=-0.014 Sum_probs=157.4 Q ss_pred cCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCccc---ceeecChHHHHHHHHHHhhcCCCceEeeCC-- Q lcl|NC_016654. 30 EGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAP---KRYHAPIPGVIAKLSTTELFSEQLKFLDAG-- 104 (533) Q Consensus 30 ~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~---~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~-- 104 (533) +|=.++| ++............ + ...+....++..|... .-+...--...++.+|+-+-+-|..+--.. T Consensus 1 Mg~f~~l---f~r~~~~~~~~~~~-~---~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~ 73 (414) T protein:vir:44 1 MVFFSGL---FQRKSDAPVTTPAE-L---ADAIGLSYDTYTGKQISSQRAMRLTAVFSCVRVLAESVGMLPCNLYHLNGS 73 (414) T ss_pred Cchhhhh---hccCccCcccchhh-H---hHhhccCccccCCceechhhhhccHHHHHHHHHHHHHhccCceEEEEecCC Confidence 3322111 11111111110000 0 0111112222222110 112222233455556665554454332111 Q ss_pred -CchHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEec-CCceEEEE Q lcl|NC_016654. 105 -KSKEVQARADLIFNT--P---RFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRW-GRLVAVTF 177 (533) Q Consensus 105 -~~~~~~~~l~~i~~~--n---~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~-g~~~~v~f 177 (533) ........+..+|+. | .....+...+......|.+|+.+..+ ++ .-+.+..++|.++.+..+. +++ T Consensus 74 ~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-~g-~~~~L~~l~~~~v~~~~~~~~~~----- 146 (414) T protein:vir:44 74 LKQRATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA-FG-EVAELLPVDPGCVVPKLNSSWEP----- 146 (414) T ss_pred ceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CC-cEEEEEEEcCceEEEEECCCCcE----- Confidence 111111223333321 1 12233444555667779898887654 32 2234566777777665432 211 Q ss_pred EEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCc Q lcl|NC_016654. 178 WSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVT 257 (533) Q Consensus 178 ~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~ 257 (533) +| .+.... |....+ ..--+.|+.+.. T Consensus 147 -----------~y-------------~~~~~~----g~~~~~--------------------------~~~evih~~~~~ 172 (414) T protein:vir:44 147 -----------VY-------------QVTFPD----GSTDVL--------------------------SQEDIWHVRTLT 172 (414) T ss_pred -----------EE-------------EEEecC----ceEEEE--------------------------ccccEEEecCCC Confidence 11 111000 000000 000122332211 Q ss_pred ccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCc-ceeeechHHhcCCCCccccccCcchhhhhhccc Q lcl|NC_016654. 258 PNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGA-GKVHASESVLTNLGMGQGVSLDEEQEVYSRVGS 335 (533) Q Consensus 258 ~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~-~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~ 335 (533) +. ...|.|.+..+. ..++ +.....++... |+.|. +..++ ...+.-.....+...+.+..... T Consensus 173 ----~d-----~~~G~s~i~~~~-~~i~-~~~~~~~~~~~~f~ng~~p~gil-----~~~~~l~~e~~~~~~~~~~~~~~ 236 (414) T protein:vir:44 173 ----LD-----GLVGLNPIAYAR-EAIS-LAAATEEHGARLFSNGAVTSGVL-----RTEQTLSDQAYERLKKDFEERHT 236 (414) T ss_pred ----CC-----CcccccHHHHHH-HHHH-HHHHHHHHHHHHHhccCCCceEE-----EeCCCCCHHHHHHHHHHHHHHhc Confidence 11 235777776543 4443 33444444444 35433 22222 11111000000011111211111 Q ss_pred ccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc-hhHHHHHHHhhhHHHHHHHHH Q lcl|NC_016654. 336 GGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA-QTATEASGKKDLTVKTTRAKA 410 (533) Q Consensus 336 ~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~-~Tatai~~~~~~l~~~~~~~~ 410 (533) +..++++ +....++.++......++++..+...++|+...|+||..+|...++. .++++. .. T Consensus 237 g~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~-------------~~ 303 (414) T protein:vir:44 237 GLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEEL-------------GL 303 (414) T ss_pred CccccCcceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH-------------HH Confidence 1111111 11223556666666778888888888999999999999998654332 232222 12 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHH Q lcl|NC_016654. 411 RHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQ 490 (533) Q Consensus 411 ~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~ 490 (533) ..++.+|..+++.+....+..+..........+.++++.-+..|..++++.+.+++++|+|+..++.+.+ ++..-+-- T Consensus 304 ~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~--gl~p~~gg 381 (414) T protein:vir:44 304 GFINYSLVPYLTRIEQRINTGLVRKSKQGVFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLE--DMNPRPGG 381 (414) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh--CCCCCCCc Confidence 3444555555555533333222222111233456666666778999999999999999999999977654 23321000 Q ss_pred HHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 491 EEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 491 ~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) .++- .+ .+....+......+.+.+..++| T Consensus 382 D~~~---------~~-----~n~~~~~~~~~~~~~~~~~~~~d 410 (414) T protein:vir:44 382 DVYL---------TP-----MNMTTKPSDGSKAGKQKDNANAD 410 (414) T ss_pred ceec---------cc-----ccccccCCccccCCCCCCCCCCC Confidence 0000 00 00000000001111111111111 No 145 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=98.07 E-value=5.4e-06 Score=49.45 Aligned_cols=411 Identities=10% Similarity=0.036 Sum_probs=158.0 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcC--CHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceee Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEG--DLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYH 78 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~g--d~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 78 (533) +++....+..++-. ......||+- ++..|++.+. + T Consensus 15 ~~i~~~~~~s~~~~-------~~~~~~~~~pp~~~~~la~l~~------------------------------------~ 51 (542) T protein:vir:41 15 KAIKREEVESQALG-------ETRFEEYVEPKVNPLVLLSLLQ------------------------------------V 51 (542) T ss_pred hhhhhccccccccc-------cccCCccccCCCCHHHHHHHHh------------------------------------h Confidence 22221111111100 0000111110 1122222221 1 Q ss_pred cChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHH-hhc-cHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEE Q lcl|NC_016654. 79 APIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIF-NTP-RFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWID 156 (533) Q Consensus 79 ~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~-~~n-~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~ 156 (533) ......+++.+|+-+.+-|..+.. ... ..+..++ +.+ .+...+...+......|.+|+.+..|..+. -..+. T Consensus 52 n~~v~scI~~ia~~IA~l~~~~~~--~~~---~~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~-~~~L~ 125 (542) T protein:vir:41 52 NPYHASACSIKANDIIRTGYILEG--DDE---GVVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGD-PIRFE 125 (542) T ss_pred cHHHHHHHHHHHHHHhhCceeeec--ccc---hhhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCc-EEEEE Confidence 223445556666655555544422 211 1122222 111 123344555667778899999988887653 35677 Q ss_pred EEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccC Q lcl|NC_016654. 157 FVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEG 236 (533) Q Consensus 157 ~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~ 236 (533) .+++.++.+.-+.+.. .. +...... +| +.-|..... +.. .+ +. T Consensus 126 ~l~~~~v~v~~d~~~~-----~~-~~~~~~~-~~------------~~~y~~~~~-----~~~----~~-g~-------- 168 (542) T protein:vir:41 126 YIPSHTIRVHKDGSRY-----RQ-TWDGVNI-TH------------FKDYRYEGE-----INP----ET-GE-------- 168 (542) T ss_pred EEcCcceEEEEcCCee-----Ee-eecCCcc-ee------------EEeeccccc-----ccc----cc-cc-------- Confidence 7888877766443321 11 1111111 11 011110000 000 00 00 Q ss_pred CceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHH-HhCc-ce--eeechHHh Q lcl|NC_016654. 237 RGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDF-RIGA-GK--VHASESVL 312 (533) Q Consensus 237 ~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~~-~~--i~v~~~~l 312 (533) .....+.--+.|+.+..+ ....+|.|.+..++. .+ .++.....+.+.+ +.|. +. |.++..+. T Consensus 169 ----~~~~~~~~eIiHir~~~~--------~~~~~Glspi~~~~~-~i-~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~ 234 (542) T protein:vir:41 169 ----DQDSVGANELVFIHIPSP--------VCSYYGVPRYVSAAP-AI-LAMQKIDEYNYAFFDNYTIPSYVITVTGEFE 234 (542) T ss_pred ----cccccCcccEEEecCCCC--------CCCcccccHHHHHHH-HH-HHHHHHHHHHHHHHhccCCccEEEEeCCccc Confidence 000011112344433211 124468888876653 33 3445555555443 5432 22 23333222 Q ss_pred cCCCCccccccCc---chhhhhhcccccc-cc----------ccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCC Q lcl|NC_016654. 313 TNLGMGQGVSLDE---EQEVYSRVGSGGF-NA----------NGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYS 378 (533) Q Consensus 313 ~~~~~~~~~~~d~---~~~~~~~~~~~~~-~~----------~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s 378 (533) +..........+. ..+.+.....+.. +. +.++...++.++......++.+..+...++|+...|+| T Consensus 235 de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVP 314 (542) T protein:vir:41 235 DELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSIPGGDTVKVTFTPLNTSQKELSFREYAAEKKYDIAAAHMID 314 (542) T ss_pred cccccccccCHHHHHHHHHHHHHHHhhhhcccCceeEeeccCCcccceeEEEcCCChhHHHHHHHHHHHHHHHHHHhCCC Confidence 1111111000000 0111111100100 00 11122234455555567788888888899999999999 Q ss_pred hhhcccCCCcc---hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeC--CCCCC Q lcl|NC_016654. 379 PVSLGLSDEVA---QTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWP--KFARE 453 (533) Q Consensus 379 ~~~~g~~~~~~---~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~--d~i~~ 453 (533) |..+|+..++. .++++.. ...++..|..+++.+....+..+... ....+.+.|+ +.... T Consensus 315 p~~lG~~~~~t~n~sn~Eq~~-------------~~f~~~tL~P~~~~ie~~ln~~L~~~---~~~~~~~~f~~~~ll~~ 378 (542) T protein:vir:41 315 PYRLGIADTGPLGGNFAEVTR-------------RTYYESVVRPQQNIISSILTDFFQVK---FNPKTRFKFNDETLLES 378 (542) T ss_pred HHHhCcCCCcccccccHHHHH-------------HHHHHHHHHHHHHHHHHHHHhhcccc---cCCceEEEecchhhcch Confidence 99998754332 2233221 22333334444433332222222111 1123445555 44444 Q ss_pred CHHHHHHHHHHHHhCCCCCHHHHHHHhCC--CCCHHHH------HHH------------HHHHHHhhhcccCcccccccc Q lcl|NC_016654. 454 SDLAKAQTVQAWSVASAASTKTKVAYLHE--DWDDERV------QEE------------ADLIDNANTVSAPTFGFGTDQ 513 (533) Q Consensus 454 d~~e~a~~~~~l~~aGi~S~et~v~~l~~--~~~dee~------~~E------------l~rI~~E~~~~~~~~~~~~~~ 513 (533) |.. ..+..++++|+|+..++.+.+.+ ..++.-. .+. ++++++-.+...|.+..... T Consensus 379 d~~---~~~~~~v~~GilT~NE~Re~L~g~~pgdd~~l~p~~~~~~~~~~~~~n~~~~~~~~~~k~~~k~~~~~~~~~~- 454 (542) T protein:vir:41 379 DSV---RNCALLVQSGVLTPAEARERLFGLDGGPDIFMVPSKGAAKSVKRQERNYEKNQIREIRKIYAKYRPRFNEIIS- 454 (542) T ss_pred HHH---HHHHHHHhCCCCCHHHHHHhhCCCCCCCccccccccccccccccCCcCCCCCchhhhhhcccccCcccccccc- Confidence 433 33556788999999997655532 1221100 000 00000000011111100000 Q ss_pred CCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 514 PPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 514 ~~~~~~~~~~~~~~~~~d~~ 533 (533) ... ..+....+..++++| T Consensus 455 ~~~--~~~~~~~~~~~~~~~ 472 (542) T protein:vir:41 455 SKL--SAEEKKKKIDESLAE 472 (542) T ss_pred ccc--cchhhcccccchhhh Confidence 000 001111223334444 No 146 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=98.05 E-value=5.8e-06 Score=49.30 Aligned_cols=497 Identities=7% Similarity=-0.054 Sum_probs=208.5 Q ss_pred CCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhcc----Ccc--hhhHHHHHHHHHHHHHhcccCCCCCcccce Q lcl|NC_016654. 3 LPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAE----GRT--SPSGIKARTKAAYEAFHGRTPTATGRAPKR 76 (533) Q Consensus 3 ~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~----~~~--~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 76 (533) |-+.+ ...+++.+.||.-+.++-...++.. ..+ ..+||..........- . ....+.. T Consensus 1 m~e~~-----------~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~----~--q~~grP~ 63 (706) T protein:vir:10 1 MAESR-----------QKQHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLD----E--QFEKYPK 63 (706) T ss_pred CCcch-----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhh----h--hhcCCCc Confidence 22222 1122222333332222211111110 000 0112222221111100 0 0011236 Q ss_pred eecChHHHHHHHHHHhhcCCCceEeeCC---C-chH----HHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcC- Q lcl|NC_016654. 77 YHAPIPGVIAKLSTTELFSEQLKFLDAG---K-SKE----VQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDP- 147 (533) Q Consensus 77 ~~~n~~k~i~~~~a~ll~~e~~~i~~~~---~-~~~----~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~- 147 (533) ++.|+.+.+|+...++.-...+.+.+.+ . +.. ++..+..+...++.......+...+...|-+|++++.|- T Consensus 64 ~~~N~i~~~v~~v~g~~~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~ 143 (706) T protein:vir:10 64 FEINKVATELNRIISEYRNNRISVKFRPGDNAASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTTSFV 143 (706) T ss_pred eEecchHHHHHHHhhHHHhCCCceEEecCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEeeeccc Confidence 8999999999999999887777776543 1 222 344556677888999999999999999999999997652 Q ss_pred ------CCCCceEEEEEc-C-CeEEEEE--ecCCceEE--EEEEEEeecC--------------------------CceE Q lcl|NC_016654. 148 ------TIADNAWIDFVD-A-DRAIPEF--RWGRLVAV--TFWSELAGGD--------------------------GQEV 189 (533) Q Consensus 148 ------~~~~~~~i~~v~-~-~~~~P~~--~~g~~~~v--~f~~~~~~~~--------------------------~~~~ 189 (533) .+...|.|+.+. | +.|+.=+ ...+++.+ +|...+-..+ .... T Consensus 144 ~~~d~~~~~~~i~i~~v~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~~~d~ 223 (706) T protein:vir:10 144 NEYDPMDERQRIAVEPIYDPARSVWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWFTPDV 223 (706) T ss_pred cccCCCCCCccceeeeeccchhceecCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhhhccccccccccCCCc Confidence 233456666542 3 3443221 12344443 2322211000 0001 Q ss_pred EEEEEEecCee--EEEEEEeccCCcccceeehhhcccc-ccccc----cc------------cccCCceeecC--CCccc Q lcl|NC_016654. 190 WRHLERHESGY--IVHAVYKGTATSLGWMMALTDHPAT-RDIAV----EG------------ADEGRGAYVET--GVKDL 248 (533) Q Consensus 190 y~~lE~h~~~~--I~~~~y~~~~~~lG~~v~l~~~~~~-~~~~~----~~------------~~~~~~~~~~~--g~~~~ 248 (533) .+..|+|+... +....|+.....-+..+....+..- ..+.. .+ ....+....+. ..++- T Consensus 224 ~~~~eyy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l~~~~p~~~~ 303 (706) T protein:vir:10 224 VYIAKYYEVRKESVDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFLEKPRRIPGE 303 (706) T ss_pred ceecccccccceeEEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeeccccccccCCCCCCC Confidence 23344443221 1111122110000000000000000 00000 00 00000000000 01111 Q ss_pred eeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeee-chHHhcCCCCccccccCcch Q lcl|NC_016654. 249 TAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHA-SESVLTNLGMGQGVSLDEEQ 327 (533) Q Consensus 249 ~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v-~~~~l~~~~~~~~~~~d~~~ 327 (533) .+.|+|........++ ...++|. +.+ +++..+.+|...|.+.+.+-.++....+ ...-++... .......... T Consensus 304 ~~P~vP~~g~r~~~d~--~~~~~G~--vr~-~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~-~~~~~~~~~~ 377 (706) T protein:vir:10 304 HIPLIPVYGKRWFIDD--VERVEGH--IAK-AMDPQRLYNLQVSMLADAAAQDPGQTPIVDMEQIRGLE-QHWEGRNRKR 377 (706) T ss_pred ccceEEEeeccccccc--cCcccce--ecc-chhhHHHHHHHHHHHHHHHHhcCCcccccchhHHHHHH-HHhhhccccc Confidence 1233333221111111 1123443 333 6789999999999999887444332221 111110000 0000000000 Q ss_pred hhhhhcc-cccccccc-ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHH Q lcl|NC_016654. 328 EVYSRVG-SGGFNANG-DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKT 405 (533) Q Consensus 328 ~~~~~~~-~~~~~~~~-~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~ 405 (533) ..|..+. .+..++.. .....+..+++.--...+.+.++.....|...+|++...+|-.+ + .||.+|..+....... T Consensus 378 ~~~l~~~~~~~~~g~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~s-n-~SG~Ai~~rq~qg~~~ 455 (706) T protein:vir:10 378 PAFLPLRTVTDKTGNVVAPANVAGYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQMPS-N-VARETVNSLLNRSDMA 455 (706) T ss_pred ccchhcccccCCCCcccccccccccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCcc-c-hHHHHHHHHHHHHHHH Confidence 0111110 00000000 00111122222222345677788888888899999999998644 3 5999999998888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccCCCC----------------------------------CCceeEEEEeCCCC Q lcl|NC_016654. 406 TRAKARHFGSALGPLSTTCLRVDAIKFPGKGA----------------------------------APSEELELEWPKFA 451 (533) Q Consensus 406 ~~~~~~~~~~al~~li~~il~l~~~~~~~~~~----------------------------------~~~~~v~i~f~d~i 451 (533) .......+..+++.+-+.+|.+-...+..... ...++|.|+=..+. T Consensus 456 ~~~~~Dnl~~~~~~~g~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~ 535 (706) T protein:vir:10 456 SFIYLDNMAKSLKRAGEIWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSY 535 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCc Confidence 88888888888888877777664432211000 01123333333344 Q ss_pred CCCHHHHHHHHHHHHhCCC-CCHHHH-----HHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCC- Q lcl|NC_016654. 452 RESDLAKAQTVQAWSVASA-ASTKTK-----VAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPAT- 524 (533) Q Consensus 452 ~~d~~e~a~~~~~l~~aGi-~S~et~-----v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~- 524 (533) +.-+.+..+.++.+..++. ....+. +-++. ++ .-+++-+++|++......+. .+....+....- T Consensus 536 ~t~r~~~~~~m~el~~~~~p~~~~~~~l~~~~~~~~-d~--p~~~e~~e~irk~~~~q~~~------~~~~~~eq~~~~q 606 (706) T protein:vir:10 536 SARRDATVNALTQLLQGMLPQDPMRPALMGIIIDNM-EG--EGLDDFKAFNRRQLLTQGIV------KPRNQQEQAIVQQ 606 (706) T ss_pred chHHHHHHHHHHHHHHhcCCcchhhHHHHHHHHhhc-Cc--cchHHHHHHHHHhhcccCCc------cccchhHHHHHHH Confidence 4446777777777765442 211221 11111 22 22444556665544222110 000000000000 Q ss_pred ---CCCCCCCCC Q lcl|NC_016654. 525 ---DPEAVDEGE 533 (533) Q Consensus 525 ---~~~~~~d~~ 533 (533) ....+.+-+ T Consensus 607 ~qq~q~~q~~~~ 618 (706) T protein:vir:10 607 AQQAQATQPDPN 618 (706) T ss_pred HHHHHHHHHHHH Confidence 000000000 No 147 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=98.03 E-value=6.6e-06 Score=48.98 Aligned_cols=391 Identities=9% Similarity=0.006 Sum_probs=162.0 Q ss_pred CCCCCC-cCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCC-cccceee Q lcl|NC_016654. 1 MSLPEA-NTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATG-RAPKRYH 78 (533) Q Consensus 1 ~~~~~~-~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~~~~~~~ 78 (533) |-.=.. ..++.+-.. .......|+...+.... ....-+. T Consensus 1 m~f~~~~~~~~~~~~~---------------------------------------~~~~~~~~~g~~~~~~~v~~~~al~ 41 (409) T protein:vir:10 1 MLFRKGFKNQSQEISI---------------------------------------DDKKILEWLGINPSETYVNGKSCLK 41 (409) T ss_pred CcccccccCcCCCCCC---------------------------------------ChHHHHHHhcCCcCcceechhhhhc Confidence 222111 001111000 00111222222111000 0001112 Q ss_pred cChHHHHHHHHHHhhcCCCceEeeC-C-CchHHHHHHHHHHhh--cc---HHHHHHHHHHHHhhhCCEEEEEEEcCCCCC Q lcl|NC_016654. 79 APIPGVIAKLSTTELFSEQLKFLDA-G-KSKEVQARADLIFNT--PR---FHSSLVEAGESCSALSGSFQRIVWDPTIAD 151 (533) Q Consensus 79 ~n~~k~i~~~~a~ll~~e~~~i~~~-~-~~~~~~~~l~~i~~~--n~---f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~ 151 (533) ..-....++.+|+-+.+=|..+--. + ........+..+|.. |. ...-+...+...+..|.+|+.+..|..+. T Consensus 42 ~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~- 120 (409) T protein:vir:10 42 QATVFGCIRILSDNISKLPIKIYQKKDGIKRVPDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGE- 120 (409) T ss_pred cHHHHHHHHHHHHhhhhCceEEEEecCCeeeccCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCc- Confidence 2222334444444444334333111 1 111111223333321 11 12334455666778899999998887654 Q ss_pred ceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccc Q lcl|NC_016654. 152 NAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVE 231 (533) Q Consensus 152 ~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~ 231 (533) -+.+-.++|+++-++.++..... ..+.+.|..... .|....+ T Consensus 121 ~~~L~~i~~~~V~v~~~~~~~~~----------------------~~~~~~y~~~~~----~g~~~~~------------ 162 (409) T protein:vir:10 121 IKGLYPLKSDGMKIFVDDTGLLN----------------------SENNVWYLYTDD----LGQRHKF------------ 162 (409) T ss_pred EEEEEEEcCCceEEEEcCCcccc----------------------ccceEEEEEEeC----CceeEEe------------ Confidence 34666788888777654422110 011111111100 0111100 Q ss_pred ccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCc-ceeeech Q lcl|NC_016654. 232 GADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGA-GKVHASE 309 (533) Q Consensus 232 ~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~-~~i~v~~ 309 (533) ..--+.|+.+... . ...|.|.+..+. ..|. ++....++... |+.+. ..-++ T Consensus 163 --------------~~~evih~r~~~~----d-----~~~G~s~i~~~~-~~i~-~~~~~~~~~~~~f~ng~~~~gil-- 215 (409) T protein:vir:10 163 --------------MSDEILHFKGLTA----D-----GLAGLSVIELLN-HLIE-NGKSSETYLNNFFKNGLQVKGLV-- 215 (409) T ss_pred --------------ccccEEEecCcCC----C-----CcccccHHHHHH-HHHH-HHHHHHHHHHHHHhccCCCcEEE-- Confidence 0001223222111 1 235777766533 4443 33333444433 35433 22222 Q ss_pred HHhcCCCCccccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccC Q lcl|NC_016654. 310 SVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLS 385 (533) Q Consensus 310 ~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~ 385 (533) ...+.-.....+.....|.....+..++++ +....++.++......++++..+...++|+...|+||..+|.. T Consensus 216 ---~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~ 292 (409) T protein:vir:10 216 ---QYAGDLNPEAEEVFKENFERMSSGLKNAHRIAMLPIGYKFEPISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDL 292 (409) T ss_pred ---EcCCCCCHHHHHHHHHHHHHHhccccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCC Confidence 111110000001111112111111111111 1122355666666677888888889999999999999999865 Q ss_pred CCcc-hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC-CCCCCceeEEEEeCCCCCCCHHHHHHHHH Q lcl|NC_016654. 386 DEVA-QTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPG-KGAAPSEELELEWPKFARESDLAKAQTVQ 463 (533) Q Consensus 386 ~~~~-~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~-~~~~~~~~v~i~f~d~i~~d~~e~a~~~~ 463 (533) .++. .++++. ....++.+|..+++.|....+..+.. ........+.++++.-.-.|..++++.+. T Consensus 293 ~~~~~~~~e~~-------------~~~f~~~~l~P~~~~ie~~ln~kL~~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~ 359 (409) T protein:vir:10 293 DRATHSNITEQ-------------NREFYIDTLQSILNMYELEINYKLFLISEIKNGFYSKFNVDTILRADIKTRYESYK 359 (409) T ss_pred CCCccccHHHH-------------HHHHHHHHHHHHHHHHHHHHHHhhcCchhccCCcEEEEechhhhccCHHHHHHHHH Confidence 4432 232222 22344455555555543332322221 11122345777777777889999999999 Q ss_pred HHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 464 AWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 464 ~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +++.+|+|+..++.+.+ ++.. + |.+ +..-...+..|..+ .++ +....|| T Consensus 360 ~~~~~G~~T~NE~R~~l--gl~p---------~--~gg---D~~~~~~n~~~~~~----~~~-~~~kgGe 408 (409) T protein:vir:10 360 EAIQNGFKTPNEIRELE--EDEP---------L--EGG---DVLLINGNMIPVKM----AGE-QYSKGGE 408 (409) T ss_pred HHHhCCCcCHHHHHHHh--CCCC---------C--CCc---CeeeeccCccchhh----ccc-cccccCC Confidence 99999999999976654 2221 0 000 00000011111111 111 1112233 No 148 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=98.01 E-value=7.2e-06 Score=48.77 Aligned_cols=460 Identities=12% Similarity=0.066 Sum_probs=186.5 Q ss_pred CcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCC----cccceeecChHHHH Q lcl|NC_016654. 10 WPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATG----RAPKRYHAPIPGVI 85 (533) Q Consensus 10 ~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g----~~~~~~~~n~~k~i 85 (533) ==...+...+..+...|..|...-.++.+|.- -+..+++.. ....+ .++.++--+-+... T Consensus 1 ~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~l---------------P~~~~~~~~-~~~~~~~~~~~~~~i~dst~~~a 64 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNVEQIWDCIRKYIM---------------PMRSDFFSD-LRSEGSINWNQNREVFDSTAGDG 64 (547) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHHHHHhc---------------ccccccccC-CCCCcccccccccccccchHHHH Confidence 00111112222223333322222222211110 000111211 11111 23455666788888 Q ss_pred HHHHHHhhcCC--C---ceEeeC--C----CchHHHHH-------HHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcC Q lcl|NC_016654. 86 AKLSTTELFSE--Q---LKFLDA--G----KSKEVQAR-------ADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDP 147 (533) Q Consensus 86 ~~~~a~ll~~e--~---~~i~~~--~----~~~~~~~~-------l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~ 147 (533) |+.+|+-|.+- | +.|... + +...++++ +.+.+..++|...+.++.....+.|.+.+++--|+ T Consensus 65 ~~~Las~L~~~ltPp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d~ 144 (547) T protein:vir:10 65 LETLSSSLHGSLTSPATKWFELAFRDKELNSDDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEEDE 144 (547) T ss_pred HHHHHHHHHHhhcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccCC Confidence 88888776553 1 334322 1 11233333 44578888999999999999999999988876665 Q ss_pred CCCCceEEEEEcCCeEEEEEe-cCCceEEEEEEEEee---------------------cCCceEEEEEEEecCeeEEEEE Q lcl|NC_016654. 148 TIADNAWIDFVDADRAIPEFR-WGRLVAVTFWSELAG---------------------GDGQEVWRHLERHESGYIVHAV 205 (533) Q Consensus 148 ~~~~~~~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~---------------------~~~~~~y~~lE~h~~~~I~~~~ 205 (533) +..+.+++..++...++..-+ +|++..|....+++. .+....... ..|.+.+ T Consensus 145 ~~~~~~r~~~~pl~~~~v~~d~~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~------~~v~~~v 218 (547) T protein:vir:10 145 DEEGSVVFQSSPIQDSYFEEDSRGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALK------QEVVMCV 218 (547) T ss_pred CCCCceeEEEeecceEEEeeCCCcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccce------EEEEEEE Confidence 545578899999888877655 477755432111110 000000001 1223333 Q ss_pred EeccCC---cccce-eehhhccccccccccccccCCc-e-eecCCCccceeEEecCCcccccccccccccccccchhhhh Q lcl|NC_016654. 206 YKGTAT---SLGWM-MALTDHPATRDIAVEGADEGRG-A-YVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTD 279 (533) Q Consensus 206 y~~~~~---~lG~~-v~l~~~~~~~~~~~~~~~~~~~-~-~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~ 279 (533) |...+. ..+.. +.....+ +...+.+..+. . .-+.|.. -+.|++. +|... ..+.||+|-...+ T Consensus 219 ~~~~~~~~~~~~~~~~~~~~~p----~~s~~~e~~~~~~~l~esg~~--e~P~~~~-----Rw~~~-~ge~YGrgp~~~~ 286 (547) T protein:vir:10 219 FTRYDKKQNRNAGTVLAPTERP----FGKKWILKEGAVQLGEEGGYY--EMPAYAI-----RWRKS-AGSQWGFGPSHLA 286 (547) T ss_pred eeccCCCCCccccceeeccccc----eeEEEEEecCceeeeecCCcc--cCCeeee-----eeeec-CCcccccchHHHH Confidence 332211 11100 0000000 00001110100 0 0112221 1122222 24322 2478899987776 Q ss_pred HHHHHHHHHHHHHHHHHHHH-hCcceeeechH-HhcCCCCccccccCcchhhhhhccccccccccccccceeeechhhhh Q lcl|NC_016654. 280 LFPTFHELDRIYSSLMRDFR-IGAGKVHASES-VLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRV 357 (533) Q Consensus 280 i~~lid~lD~~~s~~~~~~~-~~~~~i~v~~~-~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~ 357 (533) + +-+..|+..--....... ..++.+.||++ ++.+.... +....+. + ...+.++ ++ ..+++.+ T Consensus 287 l-~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~~~~------pgg~~~~----~---~~~~v~p-l~-~~~~~~~ 350 (547) T protein:vir:10 287 L-PDVLTANRYVELVLRSSEKVIDPAIMVTERGLISDIDLG------ASGLTVV----R---DMESMKP-FE-SRARFDV 350 (547) T ss_pred H-HHHHHHHHHHHHHHHHHHHHhcCceecccccccccceec------CCeeeec----C---Cccccee-ee-cccchHH Confidence 6 566888877776666653 34555556543 33221111 1100110 0 0111111 21 2223222 Q ss_pred -HHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhccCC Q lcl|NC_016654. 358 -LEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHF-GSALGPLSTTCLRVDAIKFPGK 435 (533) Q Consensus 358 -e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~-~~al~~li~~il~l~~~~~~~~ 435 (533) .+-++.+..-++...+... |....+...|||||..+.+...+..+-.-..+ ...|..++..++.+... .|. T Consensus 351 ~~~~i~~~~~rI~~af~~d~-----~~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r--~g~ 423 (547) T protein:vir:10 351 SSIQLTDLRSAVRRIYYVDQ-----LQMKDSPAMTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRFR--AGK 423 (547) T ss_pred HHHHHHHHHHHHHHHhhhhh-----hhcCCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh--cCC Confidence 2233333333343333222 22234456799999998887776655543333 34555566555554321 111 Q ss_pred CCC--------CceeEEEEeCCCCCCCH-HHH-------HHHHHHHHhCC-----CCCHHHHHHHh---CC-----CCCH Q lcl|NC_016654. 436 GAA--------PSEELELEWPKFARESD-LAK-------AQTVQAWSVAS-----AASTKTKVAYL---HE-----DWDD 486 (533) Q Consensus 436 ~~~--------~~~~v~i~f~d~i~~d~-~e~-------a~~~~~l~~aG-----i~S~et~v~~l---~~-----~~~d 486 (533) -+. ....+.|++-..+.... .+. ++.+..+-+++ .+....+++.+ .+ -.++ T Consensus 424 lP~~p~~l~~~~~~~~~v~~is~Laraq~~~~~~~i~~~~~~v~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~ 503 (547) T protein:vir:10 424 LGELPSKLLESGKAAMDIVYTGPLSRAQKIDQAASIERWAGSTAQLAEINPEVLDIPDWDEMVRMLGSLLGAPQTLMRPK 503 (547) T ss_pred CCCCchhhhccCcceEEEEeccHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhhcCCHHHHHHHHHHHhCCChhccCCH Confidence 111 12345566654443321 111 11111111111 13344444332 11 0255 Q ss_pred HHHHHHHHHHHHh-hhccc-Ccccccc-ccCCCCCCCCCC-CCCC Q lcl|NC_016654. 487 ERVQEEADLIDNA-NTVSA-PTFGFGT-DQPPLPTENDPA-TDPE 527 (533) Q Consensus 487 ee~~~El~rI~~E-~~~~~-~~~~~~~-~~~~~~~~~~~~-~~~~ 527 (533) +|+++..++-++. |.+.. ......+ .+..... ..+. .+++ T Consensus 504 eev~~~r~qr~~~~q~~~qaa~~~~~g~~m~~~~~-~~a~~~~~~ 547 (547) T protein:vir:10 504 AKVTSIRKNRSQTQQKAEQAAIAEAEGNAMEAQGK-GQAALKENQ 547 (547) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC-cccchhccC Confidence 5555433321111 11000 0001111 1111111 1111 1112 No 149 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=97.97 E-value=8.5e-06 Score=48.38 Aligned_cols=468 Identities=10% Similarity=0.013 Sum_probs=180.9 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |+-+.--..+.=......+..++..|..|..--.++.+|... ..+.......+.+..++.-. T Consensus 1 ~~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP------------------~~~~~~~~~~~~~~~~~~ds 62 (535) T protein:vir:94 1 MASSQKREGFAENGAKAVYDALKNDRNSYETRAENCAKYTIP------------------SLFPKDSDNASTDYTTPWQA 62 (535) T ss_pred CCchhhhhhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcc------------------ccCCCCCCccccccCCcccc Confidence 555543333333334444444444444444333333332211 10111111122333456667 Q ss_pred hHHHHHHHHHHhhcCC----CceEeeCCCc-------------hHHHHHHHH-------HHhhccHHHHHHHHHHHHhhh Q lcl|NC_016654. 81 IPGVIAKLSTTELFSE----QLKFLDAGKS-------------KEVQARADL-------IFNTPRFHSSLVEAGESCSAL 136 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e----~~~i~~~~~~-------------~~~~~~l~~-------i~~~n~f~~~~~~~~~~~~~~ 136 (533) -+...|+.+|+.|.+- .+.|.....+ ..++++|.+ .+..++|...+.++.+...+. T Consensus 63 t~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~ 142 (535) T protein:vir:94 63 VGARGLNNLASKLMLALFPMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVA 142 (535) T ss_pred cHHHHHHHHHHHHHhhhcCCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhh Confidence 7777888888766543 2444432221 124444444 477889999999999999999 Q ss_pred CCEEEEEEEcCCCCCceEEEEEcCCeEEEEEe-cCCceEEEEEEEEeecC-----CceEEEEEEEe--cCeeEEEEEEec Q lcl|NC_016654. 137 SGSFQRIVWDPTIADNAWIDFVDADRAIPEFR-WGRLVAVTFWSELAGGD-----GQEVWRHLERH--ESGYIVHAVYKG 208 (533) Q Consensus 137 G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~~~-----~~~~y~~lE~h--~~~~I~~~~y~~ 208 (533) |.+.+++ +++.+..+++..++-..++-.-+ +|++..|++-.++.... ++.+....+.. +.-.|.+.+|.- T Consensus 143 G~a~l~~--~~~~~~~~~f~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~~~v~~~ 220 (535) T protein:vir:94 143 GNALLYI--PEPEGTYNPMKLYRLSSYVVQRDAFGTVLQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTHIYLD 220 (535) T ss_pred CcEeEee--ccCcCcccceEEEEcCeEEEeeCCCCCeEEEEeeeeccHHHhhHHHHHHHHhccccCCCceeEEEEEEEee Confidence 9998764 44433446677788777666544 58887777544432100 00000000000 000111122221 Q ss_pred cCCcccceeehhhccccccccccccccCCceeecCCC-ccceeEEecCCcccccccccccccccccchhhhhHHHHHHHH Q lcl|NC_016654. 209 TATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGV-KDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHEL 287 (533) Q Consensus 209 ~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~-~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~l 287 (533) .++. ++..+.+ .......... -+.|. ..||+. . +|... ..+.||+|-...++ +-+..| T Consensus 221 ~~~~-----~~~~~~e---~~g~~~~~~~---~~~g~~~~P~~~---~-----Rw~~~-~ge~YGrgp~~~~l-~D~k~L 279 (535) T protein:vir:94 221 EESG-----EYLKYEE---IDGVEVEGTD---ASYPVDACPYIP---V-----RMVRI-DGESYGRSYCEEYL-GDLRSL 279 (535) T ss_pred CCCC-----cEEEEEE---ecCeeecccc---ccCccccCCcee---e-----eeeec-CCCccccchHHHHH-HHHHHH Confidence 1100 0000000 0000000000 01111 122222 1 23322 24678998777666 556677 Q ss_pred HHHHHHHHHHH-HhCcceeeech-HHhcCCCCcccccc-CcchhhhhhccccccccccccccceeeechhhhhHHHHHHH Q lcl|NC_016654. 288 DRIYSSLMRDF-RIGAGKVHASE-SVLTNLGMGQGVSL-DEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGA 364 (533) Q Consensus 288 D~~~s~~~~~~-~~~~~~i~v~~-~~l~~~~~~~~~~~-d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l 364 (533) +..--...... ...+....|++ ..+.+. .+ +...+.+... ......+..+....+...-...+ T Consensus 280 ~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~------~~~~~~~g~~v~g--------~~~~v~~~~~~~~~~~~~~~~~i 345 (535) T protein:vir:94 280 ENLQEAIVKMSMISAKVIGLVNPAGITQVR------RLTKAQTGDFVSG--------RPEDISFLQLEKAADFSVARAVS 345 (535) T ss_pred HHHHHHHHHHHHHhccCCcccccccccchh------hcccCCCceeecC--------CcccceeeecccccchhHHHHHH Confidence 76544333322 22333323322 222111 11 0011111110 00000011111111112222233 Q ss_pred HHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhccCCC-CCCcee Q lcl|NC_016654. 365 ALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKA-RHFGSALGPLSTTCLRVDAIKFPGKG-AAPSEE 442 (533) Q Consensus 365 ~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~-~~~~~al~~li~~il~l~~~~~~~~~-~~~~~~ 442 (533) +.+-..|.... + ...+....+...|||||..+.+......+-.- +.-...|.-|+..++.+... .|.- ..+.+- T Consensus 346 ~~~~~rI~~af-~-~~~~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r--~g~lP~~p~~~ 421 (535) T protein:vir:94 346 EQIEGRLSYAF-M-LNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLLKQLQA--TNQIPELPKEA 421 (535) T ss_pred HHHHHHHHHHH-h-HhhhccCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHh--CCCCCCCChhh Confidence 33333332211 1 00122223344699999888777665555432 22233444555555544321 1111 223444 Q ss_pred EEEEeCCCCCC-----CHHHHHHHHHHHHhCC------CCCHHHHHHHh---CC----C--CCHHHHHHHHHHHHHhhhc Q lcl|NC_016654. 443 LELEWPKFARE-----SDLAKAQTVQAWSVAS------AASTKTKVAYL---HE----D--WDDERVQEEADLIDNANTV 502 (533) Q Consensus 443 v~i~f~d~i~~-----d~~e~a~~~~~l~~aG------i~S~et~v~~l---~~----~--~~dee~~~El~rI~~E~~~ 502 (533) +.+++--++.. +.+...+.++.+-+.+ .+....+++.+ .+ . -+++|++++.++.++.+++ T Consensus 422 v~~~~vs~la~l~r~~~~~~l~~~~~~laq~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~rs~eev~~~~~q~~~~~~~ 501 (535) T protein:vir:94 422 VEPTISTGMEALGRGQDLDKLERCIAAWSALAPMQGDPDINIATIKLRIANAIGIDTSGILKTPEEKQQEMAEAAQGTAM 501 (535) T ss_pred ccceEeehHHHHHHHHHHHHHHHHHHHHHhhChHHhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHH Confidence 55555322211 1111111122221111 12333333332 11 0 2667777666544333321 Q ss_pred c--cCccccccccCCCCC-CC------CCCCCCC Q lcl|NC_016654. 503 S--APTFGFGTDQPPLPT-EN------DPATDPE 527 (533) Q Consensus 503 ~--~~~~~~~~~~~~~~~-~~------~~~~~~~ 527 (533) . +...+.+.......+ ++ --+..|+ T Consensus 502 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:94 502 QNAAASAGAGAGTMATASPENMKAAAAQAGMAPN 535 (535) T ss_pred HHHHHHHHHhhhcccccChHHHHHHHHHhccCCC Confidence 1 111111111000000 00 1111222 No 150 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=97.95 E-value=9.6e-06 Score=48.10 Aligned_cols=413 Identities=11% Similarity=0.010 Sum_probs=162.2 Q ss_pred CcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCc-chhhHHHHHHHHHHHHHhcccCCCCCcccc---eeecChHHHH Q lcl|NC_016654. 10 WPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGR-TSPSGIKARTKAAYEAFHGRTPTATGRAPK---RYHAPIPGVI 85 (533) Q Consensus 10 ~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~---~~~~n~~k~i 85 (533) -||..- .|=..++..++..... ...............+.+....+..|.... -+...---.. T Consensus 1 ~~~~~~--------------~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~a~~~~aV~~~ 66 (432) T protein:vir:97 1 MPDEKK--------------LGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAAC 66 (432) T ss_pred CCCccc--------------CchhhhhHhhcCCccccccccccccccCchhhhhhcccccccCcccchHhhhcchHHHHH Confidence 444431 2333344444432110 000000000001111112222222222211 1112222234 Q ss_pred HHHHHHhhcCCCceEeeCCC---chHHHHHHHHHHh--hccH---HHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEE Q lcl|NC_016654. 86 AKLSTTELFSEQLKFLDAGK---SKEVQARADLIFN--TPRF---HSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDF 157 (533) Q Consensus 86 ~~~~a~ll~~e~~~i~~~~~---~~~~~~~l~~i~~--~n~f---~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~ 157 (533) ++.+|+-+-+=|..+--... ......-+..+|. -|.. ..-....+...+..|.+|+.+..+. + .-..+.+ T Consensus 67 v~~Ia~~ia~lp~~~y~~~~~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~~-g-~~~~L~~ 144 (432) T protein:vir:97 67 VKLVSQAVAAMPLMMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTD-G-RIESLQY 144 (432) T ss_pred HHHHHHhhccCceEEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecC-C-cEEEEEE Confidence 44445444444543321111 1111122333332 1211 1233444556677899998887763 2 2245566 Q ss_pred EcCCeEEEEEe-cCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccC Q lcl|NC_016654. 158 VDADRAIPEFR-WGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEG 236 (533) Q Consensus 158 v~~~~~~P~~~-~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~ 236 (533) ++|+++.++.+ +|++ +++ +...+ |..+.+ + T Consensus 145 l~p~~v~v~~~~~g~~----~y~-~~~~~----------------------------g~~~~~---~------------- 175 (432) T protein:vir:97 145 LANDRLTITTDTKGNT----AYR-YRRTD----------------------------GQMIDI---P------------- 175 (432) T ss_pred EcCcceEEEEcCCCcE----EEE-EEecC----------------------------ceEEEE---c------------- Confidence 78877777643 2321 111 00001 000000 0 Q ss_pred CceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCcceeeechHHhcCC Q lcl|NC_016654. 237 RGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGAGKVHASESVLTNL 315 (533) Q Consensus 237 ~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~~~i~v~~~~l~~~ 315 (533) .--+.|+.+.. +. ...|.|.+..+. ..| .+.....++... |+.|...-.| +... T Consensus 176 ----------~~~iih~r~~~----~d-----g~~G~spi~~~~-~~i-~~~~a~~~~~~~~f~ng~~~~gi----l~~~ 230 (432) T protein:vir:97 176 ----------RQQIWKIMGYS----LD-----GENGLSAIRYGA-QIF-GTAIAAEAQAARAFRNGQLQSVY----YQID 230 (432) T ss_pred ----------cccEEEecCcC----CC-----CcccccHHHHHH-HHH-HHHHHHHHHHHHHHhccCCccee----EecC Confidence 00022332211 11 124666665433 333 223333444433 3544332111 2211 Q ss_pred CCccccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc-h Q lcl|NC_016654. 316 GMGQGVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA-Q 390 (533) Q Consensus 316 ~~~~~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~-~ 390 (533) +.-.....+.....|.. ..++++ +....++.++......++++..+....+|+...|+||..+|+...+. . T Consensus 231 ~~l~~e~~~~~~~~~~~----~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~ 306 (432) T protein:vir:97 231 RFLTDDQYDSFSKKVSG----SVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTS 306 (432) T ss_pred CCCCHHHHHHHHHHHhh----hhcCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccc Confidence 11000000001111111 111111 11223566666667778888888889999999999999998754432 2 Q ss_pred hHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCC Q lcl|NC_016654. 391 TATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASA 470 (533) Q Consensus 391 Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi 470 (533) ++..++... ...++.+|..+++.+..-.+..+..........+.++++.-+-.|..++++.+.+++++|+ T Consensus 307 ~~s~~e~~~----------~~f~~~tl~P~~~~ie~~ln~kLl~~~e~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~ 376 (432) T protein:vir:97 307 WGSGIESQQ----------LGFLTMTLSPWLRRIEQSIALNLLTPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGL 376 (432) T ss_pred cchhHHHHH----------HHHHHHHHHHHHHHHHHHHhhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCC Confidence 233332221 2233344444444443222222222211222345555556667899999999999999999 Q ss_pred CCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 471 ASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 471 ~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) |+..++.+++ ++..-+-...+ +. -+....| ....+. .+.++....+++.+...+.+ T Consensus 377 ~T~NE~R~~~--glpp~~g~~~~--~~-~~~~~~p-l~~~~~-~~~~~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:97 377 MTRDEAREIE--GLPKLGGNAAV--LT-VQSAMVP-LDSIGL-QASPEPASGLGNQQQDKVSK 432 (432) T ss_pred CCHHHHHHHh--CCCCCCCCcce--Ee-ecccccc-hhhhcc-cCCCCCCCCCCCcccccccC Confidence 9999977654 22210000000 00 0000011 000011 11111111111111222222 No 151 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=97.91 E-value=1.2e-05 Score=47.63 Aligned_cols=469 Identities=9% Similarity=0.003 Sum_probs=184.6 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |.==+ ....|=..+...+..++..|..|...-.++.+|.... .+.......+....++--. T Consensus 1 ~~~~~-~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~------------------~~~~~~~~~~~~~~~~~ds 61 (543) T protein:vir:88 1 MAETK-REGLAEEGAKAVYERLKNDRVPYETRAENCAKVTIPS------------------LFPKDSDNSSTDYTTPWQA 61 (543) T ss_pred Ccccc-cCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccc------------------cCCCCCCcccccccccccc Confidence 32211 2345555566667777777776665444554443221 0111111112223345556 Q ss_pred hHHHHHHHHHHhhcCC----CceEeeCCCc-------------hHHHH-------HHHHHHhhccHHHHHHHHHHHHhhh Q lcl|NC_016654. 81 IPGVIAKLSTTELFSE----QLKFLDAGKS-------------KEVQA-------RADLIFNTPRFHSSLVEAGESCSAL 136 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e----~~~i~~~~~~-------------~~~~~-------~l~~i~~~n~f~~~~~~~~~~~~~~ 136 (533) -+...++.+|+.|.+- .+.|.....+ ..+++ .+...+..++|...+.++.+...+. T Consensus 62 t~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~ 141 (543) T protein:vir:88 62 VGARGLNNLSAKVMLALFPLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALA 141 (543) T ss_pred hHHHHHHHHHHHHHHhhcCCCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhh Confidence 6677777777765543 2344432221 12333 3444677789999999999999999 Q ss_pred CCEEEEEEEcCCCCCceEE---EEEcCCeEEEEEe-cCCceEEEEEEEEeec-------------CCceEEEEEEEecCe Q lcl|NC_016654. 137 SGSFQRIVWDPTIADNAWI---DFVDADRAIPEFR-WGRLVAVTFWSELAGG-------------DGQEVWRHLERHESG 199 (533) Q Consensus 137 G~~~~~~~~D~~~~~~~~i---~~v~~~~~~P~~~-~g~~~~v~f~~~~~~~-------------~~~~~y~~lE~h~~~ 199 (533) |.+.+ |.+++....+++ ..++-..++-..+ +|++..+++..+++.. .++..+. .- T Consensus 142 G~a~l--y~~~~~~~~~~~~~~~~~pl~~y~v~~d~~G~v~~i~r~~~~~~~~l~~~~~~~v~~~~~~~p~~------~~ 213 (543) T protein:vir:88 142 GTALI--YLPPPDASSNSYNPMKLYTLHNHVVQRDAFGNVLQIVTLDKVAYAALPEDVRNSLSGGQEYKPEQ------EL 213 (543) T ss_pred Cceee--eeccCccccceecceEEeEcceEEEeeCCCCCeeeeeeeeeccHHHHhHHhhHHHHHHhhcCCcc------ce Confidence 99986 455554433333 3344344333323 4777666654443210 0111111 12 Q ss_pred eEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhh Q lcl|NC_016654. 200 YIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTD 279 (533) Q Consensus 200 ~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~ 279 (533) .|.|.+|.-.+.. +. ..+..+........+..... ...| |++. +|... ..+.||+|-...+ T Consensus 214 ~v~~~V~pr~~~~---~~-----~~~~~~~~~~v~~~~~~~~~--~e~P---~i~~-----Rw~~~-~ge~YGrgp~~~~ 274 (543) T protein:vir:88 214 EVYTHIYIDDESG---DF-----LSYQEIEGVEVDGSDGQYPQ--DALP---WIAV-----RWTKR-DGEHYGRSHVEEY 274 (543) T ss_pred EEEEEEEeecCCC---cc-----cccccccCeeeecCCCcccc--ccCC---ceee-----eeeec-CCCccccchHHHH Confidence 3333344221110 00 00111111111111100000 0112 2221 24322 2477899987776 Q ss_pred HHHHHHHHHHHHHHHHHHH-HhCcceeeechHHhcCCCCccccccCc-chhhhhhccccccccccccccceeee-chhhh Q lcl|NC_016654. 280 LFPTFHELDRIYSSLMRDF-RIGAGKVHASESVLTNLGMGQGVSLDE-EQEVYSRVGSGGFNANGDMETIFEFF-QPAIR 356 (533) Q Consensus 280 i~~lid~lD~~~s~~~~~~-~~~~~~i~v~~~~l~~~~~~~~~~~d~-~~~~~~~~~~~~~~~~~~~~~~i~~~-~~~ir 356 (533) + +-+..|+..--...... ...++...||++.. .+...+.+ ..+.+... ..+ ....++.. ++++ T Consensus 275 l-~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~-----~~~~~~~~~~~g~~v~g------~~~-~v~~~~~~~~~~~- 340 (543) T protein:vir:88 275 L-GDLNSLESLNEAMIKFAMISSKVVGLVNPNGI-----TQVRRLVKAQTGDFVAG------RKA-DIEFLQLEKTADF- 340 (543) T ss_pred H-HHHHHHHHHHHHHHHHHHHHhcCceeeccccc-----cchhhcccCCCceeecC------CCC-cceeeecccccch- Confidence 6 56678877665555544 44555556654422 11111111 11111110 001 11111111 1122 Q ss_pred hHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhccCC Q lcl|NC_016654. 357 VLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKAR-HFGSALGPLSTTCLRVDAIKFPGK 435 (533) Q Consensus 357 ~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~-~~~~al~~li~~il~l~~~~~~~~ 435 (533) ......|+.+-..|....=+. .+....+...|||||..+.+......+-.-. .-...|.-|+..++.+.... +-. T Consensus 341 -~~~~~~i~~~~~rI~~af~~~--~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~-g~l 416 (543) T protein:vir:88 341 -TVAKSVADAIEARLSYVFMLN--SAVQRSGERVTAEEIRYVASELEDTLGGVYSILSQELQLPIVRVLLNQLQAT-QQI 416 (543) T ss_pred -hHHHHHHHHHHHHHHHHHhhh--hhccCCCCcccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhc-CCC Confidence 222333333333332222011 1222334456999998888776665554322 22334445555544442210 111 Q ss_pred CCCCceeEEEEeCCCCCC-CHHHHHHHHHHHH-hCCC---------CCHHHHHHHhC--CCC-------CHHHHHHHHHH Q lcl|NC_016654. 436 GAAPSEELELEWPKFARE-SDLAKAQTVQAWS-VASA---------ASTKTKVAYLH--EDW-------DDERVQEEADL 495 (533) Q Consensus 436 ~~~~~~~v~i~f~d~i~~-d~~e~a~~~~~l~-~aGi---------~S~et~v~~l~--~~~-------~dee~~~El~r 495 (533) ...+...+++++--++.. .+...++.+.... ..|. +..+.+++.+. -++ +++|++++.++ T Consensus 417 P~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~v~~~~~p~vld~id~d~~~~~~a~~~Gv~~~~i~r~~~e~~~~~~q 496 (543) T protein:vir:88 417 PNLPQEAVEPTVTTGAEALGRGQDLDKLTQFLNAVATVSQLNGDPDLNVNNIKLRLANAIGIDTAGLLLTEAEKAQAQSQ 496 (543) T ss_pred CCCchhceeeeEEecHHHHHHHHHHHHHHHHHHHHHhccchhhhccCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHH Confidence 123444566665433221 1111111111111 1122 22333343321 012 44445443332 Q ss_pred HHHhhhcccCccccccccCCCCCCCCC-------CCCCCCCCCCC Q lcl|NC_016654. 496 IDNANTVSAPTFGFGTDQPPLPTENDP-------ATDPEAVDEGE 533 (533) Q Consensus 496 I~~E~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~d~~ 533 (533) -+.+++...-....++........+.. .-+-+..+.|- T Consensus 497 ~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~ 541 (543) T protein:vir:88 497 EMLKQGGLNAAAGIGSGVAAQATASPEAMESAMDTAGVQPGPIAT 541 (543) T ss_pred HHHHHHHHHHHHHHhhchhhhhccChHHHHHHhhhcCCCCCCCCC Confidence 111111111111111111111111100 00011111111 No 152 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=97.91 E-value=1.2e-05 Score=47.63 Aligned_cols=412 Identities=8% Similarity=-0.014 Sum_probs=163.9 Q ss_pred hhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCccc---ceeecChHHHHHHHHHHhhcCCCceE Q lcl|NC_016654. 24 ESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAP---KRYHAPIPGVIAKLSTTELFSEQLKF 100 (533) Q Consensus 24 ~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~---~~~~~n~~k~i~~~~a~ll~~e~~~i 100 (533) .|.-|-++...+- .....+...+.. ..+.....|++.. ..|... .-+.+.--...++.+|+-+-+=|..+ T Consensus 1 ~~~~~~~~~~~~~-----~~~~~~~~~~~~-~~~~~~~~~~g~~-~~g~~v~~~~al~~~~V~~~v~~Ia~~iA~lp~~~ 73 (454) T protein:vir:93 1 MWNLLRRTRKNQK-----SGRDVREAGWTS-LFQAVAEPFAGAW-QQGVKADPEAVLSFHAVFACISLISQDIAKMRLRL 73 (454) T ss_pred CCCccccCccccc-----ccccccchhhhh-hhhhhhhhhcchh-hcCcccChHHhhccHHHHHHHHHHHHhhccCceEE Confidence 3333333221100 000011111111 1122222222211 111110 01111112224455555554445444 Q ss_pred eeCC---C-chHHHHHHHHHHhhccHH----HHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEec-CC Q lcl|NC_016654. 101 LDAG---K-SKEVQARADLIFNTPRFH----SSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRW-GR 171 (533) Q Consensus 101 ~~~~---~-~~~~~~~l~~i~~~n~f~----~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~-g~ 171 (533) --.. . .......+..++..-+-. .-+..++...+..|.+|+.+..|..+. -..+-.++|+++-++.+. |. T Consensus 74 ~~~~~~g~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~-~~~L~~i~~~~v~v~~~~~g~ 152 (454) T protein:vir:93 74 MQTDAQGIRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQ-IKELRILDWNRVEPLVADDGE 152 (454) T ss_pred EEeccCCccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCc-EEEEEEEcCcceEEEEcCCCc Confidence 2111 1 111112233333321111 334444556778899999988876543 235667788877776543 32 Q ss_pred ceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeE Q lcl|NC_016654. 172 LVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAA 251 (533) Q Consensus 172 ~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 251 (533) + + |+ +........|..+.+ ..--+. T Consensus 153 ~----~------------y~-------------~~~~~~~~~~~~~~~--------------------------~~~eVi 177 (454) T protein:vir:93 153 V----F------------YR-------------ITPDRNCGITEAVTV--------------------------PAREVI 177 (454) T ss_pred E----E------------EE-------------EEeccccccceeEEe--------------------------cCcceE Confidence 2 1 11 000000000000000 000122 Q ss_pred EecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCc-ceeeechHHhcCCCCccccccCcchhh Q lcl|NC_016654. 252 YVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGA-GKVHASESVLTNLGMGQGVSLDEEQEV 329 (533) Q Consensus 252 ~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~-~~i~v~~~~l~~~~~~~~~~~d~~~~~ 329 (533) |+..... ....+|.|.+..+. ..+. +......+... |+.|. ..-++ ...+.-.....+..... T Consensus 178 H~k~~~~--------~~~~~G~sp~~~~~-~~i~-~~~~~~~~~~~~f~ng~~p~gil-----~~~~~l~~e~~~~~~~~ 242 (454) T protein:vir:93 178 HDRFNCF--------FHPLIGLPPVYAAG-LAAT-QGHHIQENSTSFFRNGGRPSGVI-----EIPGSITEENAKKLKSN 242 (454) T ss_pred EeccCCC--------CCCceeccHHHHHH-HHHH-HHHHHHHHHHHHHhccCCccEEE-----ecCCCCCHHHHHHHHHH Confidence 3321110 01335777766543 3342 33444444433 35432 22222 11111000000111112 Q ss_pred hhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcch-hHHHHHHHhhhHHH Q lcl|NC_016654. 330 YSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQ-TATEASGKKDLTVK 404 (533) Q Consensus 330 ~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~-Tatai~~~~~~l~~ 404 (533) +.....+ .+.++ +....++.++......++++..+....+|+...|+||..+|+..+..- ++++. T Consensus 243 ~~~~~~g-~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~--------- 312 (454) T protein:vir:93 243 WDSGYTG-ENAGKTAILSNGAKYNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEAL--------- 312 (454) T ss_pred HHHHhcc-cccCCceeccCCceEEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHH--------- Confidence 2222111 11111 122245556655566788888888889999999999999986544321 22222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCC Q lcl|NC_016654. 405 TTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDW 484 (533) Q Consensus 405 ~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~ 484 (533) .+..++.+|.-++..+-...+..+.. .....+.+++++-+..|..++++.+.+++.+|+|+..++.+++ ++ T Consensus 313 ----~~~f~~~~l~P~~~~ie~~ln~~L~~---~~~~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~--gl 383 (454) T protein:vir:93 313 ----EQQYYSQCLQTLIESIELLLDEALET---GENESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRE--NL 383 (454) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHhhcC---CCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh--CC Confidence 22233344444444332222222211 1234567777777788999999999999999999999977654 22 Q ss_pred CHH-HHHH--------HHHHHHHhhhcccCccccc--cccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 485 DDE-RVQE--------EADLIDNANTVSAPTFGFG--TDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 485 ~de-e~~~--------El~rI~~E~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ..= .-++ -+..+.+.+....|....+ ...++... ..+.+....+.+.+ T Consensus 384 ~pi~ggD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~d~~~~~~e~~~d 442 (454) T protein:vir:93 384 PPLAGGDALYLQQQNYSLEALSRRDAREDPFASSGKTASVPQAVA-ASDGNKAITETEHD 442 (454) T ss_pred CCCCCCCeeeeccCccchHhhhccCcccCCCCCCccCCCCCCCCC-CCCCCCCccCCccc Confidence 110 0000 0111111111111111111 11111111 11122222222222 No 153 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=97.89 E-value=1.2e-05 Score=47.47 Aligned_cols=404 Identities=10% Similarity=0.073 Sum_probs=169.8 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHH--HHHhcccCCCCCcccceee Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAY--EAFHGRTPTATGRAPKRYH 78 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~g~~~~~~~ 78 (533) |--|+ +.-.+...+-||. +|..++.+.....+..- ...+.. ..|+.+.. +. ...-+. T Consensus 1 ~~~~~------------~~~~~~~~~g~~~----~~~~~f~~~~~~~~~~~--~~~~~~~~~~~~~~~~-v~--~~~al~ 59 (424) T protein:vir:18 1 MEEPK------------YTIDLRTNNGWWA----RLKSWFVGGRLVTPNQG--SQTGPVSAHGYLGDSS-IN--DERILQ 59 (424) T ss_pred CCCCc------------cccccCCCCchHH----HHHhhccccccccccch--hhcccccccccccccc-cc--HHHhhc Confidence 33332 2223344444442 23333322211111110 000000 00111111 10 011122 Q ss_pred cChHHHHHHHHHHhhcCCCceEee--CCCc-h--HHHHHHHHHHhh--c---cHHHHHHHHHHHHhhhCCEEEEEEEcCC Q lcl|NC_016654. 79 APIPGVIAKLSTTELFSEQLKFLD--AGKS-K--EVQARADLIFNT--P---RFHSSLVEAGESCSALSGSFQRIVWDPT 148 (533) Q Consensus 79 ~n~~k~i~~~~a~ll~~e~~~i~~--~~~~-~--~~~~~l~~i~~~--n---~f~~~~~~~~~~~~~~G~~~~~~~~D~~ 148 (533) +.---..++.+|+-+-+=|..+-- .+.. . .....+..+|.. | .-..-....+...+..|.+|+.+..+.. T Consensus 60 ~~~v~~cv~~Ia~~iA~lp~~vy~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~ 139 (424) T protein:vir:18 60 ISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA 139 (424) T ss_pred cHHHHHHHHHHHHhhccCceEEEEeccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC Confidence 222334566666665554544311 1110 0 011223444432 1 1112234445566778999998877765 Q ss_pred CCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccc Q lcl|NC_016654. 149 IADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDI 228 (533) Q Consensus 149 ~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~ 228 (533) +. -+.+-.++|..+.+..+.+++ +++ +... |..+.+. T Consensus 140 G~-~~~L~~l~~~~v~v~~~~~~~----~y~-~~~~-----------------------------g~~~~~~-------- 176 (424) T protein:vir:18 140 GD-VISLLPLQSANMDVKLVGKKV----VYR-YQRD-----------------------------SEYADFS-------- 176 (424) T ss_pred Cc-EEEEEEecCcceEEEEcCCeE----EEE-EEeC-----------------------------CeEEEec-------- Confidence 43 345666777777665443332 111 1110 0000000 Q ss_pred cccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhC-cceee Q lcl|NC_016654. 229 AVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIG-AGKVH 306 (533) Q Consensus 229 ~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~-~~~i~ 306 (533) .--+.|+.+.. +. ...|.|.+..+. ..| .++....++... |+.| +...+ T Consensus 177 ------------------~~eVihir~~~----~d-----g~~G~spi~~~~-~~i-~~~~~~~~~~~~~f~ng~~~~gi 227 (424) T protein:vir:18 177 ------------------QKEIFHLKGFG----FT-----GLVGLSPIAFAC-KSA-GVAVAMEDQQRDFFANGAKSPQI 227 (424) T ss_pred ------------------cccEEEecCcC----CC-----CcccccHHHHHH-HHH-HHHHHHHHHHHHHHhccCCcceE Confidence 00122332211 11 234666665433 333 234444444444 3443 33333 Q ss_pred e--chHHhcCCCCccccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChh Q lcl|NC_016654. 307 A--SESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPV 380 (533) Q Consensus 307 v--~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~ 380 (533) + |..++.. .. .+.-...+.....+ .++++ +....++.++......++++..+...++|+...|+||. T Consensus 228 l~~~~~~l~~---e~---~~~~~~~~~~~~~~-~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~ 300 (424) T protein:vir:18 228 LSTGEKVLTE---QQ---RSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPH 300 (424) T ss_pred EEeCCcCCCH---HH---HHHHHHHHHHHhCC-cccCCceeccCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHH Confidence 2 2221110 00 00011112211111 11111 11223566666666778888888888999999999999 Q ss_pred hcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHH Q lcl|NC_016654. 381 SLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQ 460 (533) Q Consensus 381 ~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~ 460 (533) .+|+..++..++..++.... ..++.+|..+++.+..-.+..+..........+.++++.-+..|..++++ T Consensus 301 ~lg~~~~~t~~~sn~eq~~~----------~f~~~tl~P~~~~ie~~ln~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~ 370 (424) T protein:vir:18 301 LVGDVEKSTSWGSGIEQQNL----------GFLQYTLQPYISRWENSIQRWLIPSKDVGRLHAEHNLDGLLRGDSASRAA 370 (424) T ss_pred HhCCCCCcccccccHHHHHH----------HHHHHHHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHH Confidence 99976655443343332222 23334444444444332222222222222345777777888889999999 Q ss_pred HHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 461 TVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 461 ~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) .+.+++.+|+|+..++.+.+ +... | +.+ +..-...+..|..+ .+.+++..+.|= T Consensus 371 ~~~~~~~~G~~T~NE~R~~~--gl~p---------i--~gg---D~~~~~~n~~~l~~---~~~~~~~~~n~a 424 (424) T protein:vir:18 371 FMKAMGESGLRTINEMRRTD--NMPP---------L--PGG---DVAMRQAQYVPITD---LGTNKEPRNNGA 424 (424) T ss_pred HHHHHHhCCCcCHHHHHHHh--CCCC---------C--CCc---CeeeeccCccchhh---hhccCCccccCC Confidence 99999999999999866654 2221 0 000 00000011111100 111112222222 No 154 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=97.88 E-value=1.3e-05 Score=47.36 Aligned_cols=413 Identities=11% Similarity=0.013 Sum_probs=162.3 Q ss_pred CcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcc-hhhHHHHHHHHHHHHHhcccCCCCCccc---ceeecChHHHH Q lcl|NC_016654. 10 WPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRT-SPSGIKARTKAAYEAFHGRTPTATGRAP---KRYHAPIPGVI 85 (533) Q Consensus 10 ~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~g~~~---~~~~~n~~k~i 85 (533) -||..- .|=.+++..++.....- ..............+.+....+..|... .-+...---.. T Consensus 1 ~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~~~al~~~~V~~~ 66 (432) T protein:vir:10 1 MPDEKK--------------LGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAAC 66 (432) T ss_pred CCCCcc--------------cchhhhhHhhcCCccccccccccccccCcchhhhhcccccccCcccchhhhhcchHHHHH Confidence 444441 12233343343321110 0000000000011111111111122211 11122222334 Q ss_pred HHHHHHhhcCCCceEeeCCC---chHHHHHHHHHHh--hccH---HHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEE Q lcl|NC_016654. 86 AKLSTTELFSEQLKFLDAGK---SKEVQARADLIFN--TPRF---HSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDF 157 (533) Q Consensus 86 ~~~~a~ll~~e~~~i~~~~~---~~~~~~~l~~i~~--~n~f---~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~ 157 (533) ++.+|+-+-+=|..+--... ......-+..+|. -|.+ ...+...+...+..|.+|+.+..+. + .-+.+.. T Consensus 67 i~~Ia~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~~-g-~~~~L~~ 144 (432) T protein:vir:10 67 VKLVSQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTD-G-RIESLQY 144 (432) T ss_pred HHHHHHhhhhCceeEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecC-C-cEEEEEE Confidence 45555544444543321111 1111122333332 1221 1223445556677899998887752 2 2345666 Q ss_pred EcCCeEEEEEec-CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccC Q lcl|NC_016654. 158 VDADRAIPEFRW-GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEG 236 (533) Q Consensus 158 v~~~~~~P~~~~-g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~ 236 (533) ++++++.++.+. |++ .|+ ++..+ |..+.+. T Consensus 145 l~~~~v~v~~~~~g~~----------------~y~-------------~~~~~----g~~~~~~---------------- 175 (432) T protein:vir:10 145 LANDRLTITTDTKGNT----------------AYR-------------YRRTD----GQMIDIP---------------- 175 (432) T ss_pred EcCCceEEEEcCCCcE----------------EEE-------------EEecC----ceEEEEc---------------- Confidence 778777766432 321 111 11000 1000000 Q ss_pred CceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCcceeeechHHhcCC Q lcl|NC_016654. 237 RGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGAGKVHASESVLTNL 315 (533) Q Consensus 237 ~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~~~i~v~~~~l~~~ 315 (533) .--+.|+.+.. +. ...|.|.+..+. ..| .+.....++... |+.|...-.| +... T Consensus 176 ----------~~~iih~~~~~----~d-----g~~G~spi~~~~-~~i-~~~~~~~~~~~~~f~ng~~~~gi----l~~~ 230 (432) T protein:vir:10 176 ----------KQQIWKIMGYS----LD-----GENGLSAIRYGA-QIF-GTAIAAEAQAARAFRNGQLQSVY----YQID 230 (432) T ss_pred ----------CccEEEecCCC----CC-----CcccccHHHHHH-HHH-HHHHHHHHHHHHHHhcCCCcceE----EecC Confidence 00122332211 11 224677665433 333 223344444444 3543322222 2211 Q ss_pred CCccccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc-h Q lcl|NC_016654. 316 GMGQGVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA-Q 390 (533) Q Consensus 316 ~~~~~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~-~ 390 (533) +.-.....+.....|.. ..++++ +....++.++......++++..+....+|+...|+||..+|+...+. . T Consensus 231 ~~l~~e~~~~~~~~~~~----~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~ 306 (432) T protein:vir:10 231 RFLTDDQYDSFAKKVSG----SVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTS 306 (432) T ss_pred CCCCHHHHHHHHHHHhh----hhhCCCceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCccc Confidence 11000001111111211 111111 11223566666667778889888999999999999999999765432 2 Q ss_pred hHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCC Q lcl|NC_016654. 391 TATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASA 470 (533) Q Consensus 391 Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi 470 (533) ++..++... ...++..|..++..+..-.+..+..........+.++.+.-+-.|..++++.+.+++++|+ T Consensus 307 ~~sn~e~~~----------~~f~~~tl~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~ 376 (432) T protein:vir:10 307 WGSGIESQQ----------LGFLSMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGL 376 (432) T ss_pred ccchHHHHH----------HHHHHHHHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCC Confidence 233332222 2223334444444332222222222111122345555555566899999999999999999 Q ss_pred CCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 471 ASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 471 ~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) |+..++.+++ ++..-+=...+ +. -+....| ....+.+ +.+++...++..+...+.+ T Consensus 377 ~T~NE~R~~~--glppi~g~~~~--~~-~~~~~~p-l~~~~~~-~~~~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 377 MTRDEAREIE--GLPKLGGNAAV--LT-VQSAMVP-LDSIGLQ-ASPEPASGLGNQQQDKVSK 432 (432) T ss_pred CCHHHHHHHh--CCCCCCCCcce--Ee-ecCcccc-hhhhccc-CCCCCCCCCCCcccccccC Confidence 9999977765 22210000000 00 0011011 0111111 1111111111111111222 No 155 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=97.86 E-value=1.4e-05 Score=47.16 Aligned_cols=413 Identities=10% Similarity=0.018 Sum_probs=157.6 Q ss_pred CcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcch-hhHHHHHHHHHHHHHhcccCCCCCccc---ceeecChHHHH Q lcl|NC_016654. 10 WPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTS-PSGIKARTKAAYEAFHGRTPTATGRAP---KRYHAPIPGVI 85 (533) Q Consensus 10 ~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~g~~~---~~~~~n~~k~i 85 (533) -||. +.|-.| .++..++....... .............+.+...++..|... .-+...---.. T Consensus 1 ~~~~------~~mg~f--------~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~ 66 (432) T protein:vir:81 1 MPDE------KKLGLF--------GQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAAC 66 (432) T ss_pred CCch------hhcchh--------hhhhhhcccccccccccccccccCccchhhhcccccccCcccchHhhhccHHHHHH Confidence 1111 111111 22222221111000 000000000001111111121112111 01111112234 Q ss_pred HHHHHHhhcCCCceEeeC---CCchHHHHHHHHHHh--hccH--H-HHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEE Q lcl|NC_016654. 86 AKLSTTELFSEQLKFLDA---GKSKEVQARADLIFN--TPRF--H-SSLVEAGESCSALSGSFQRIVWDPTIADNAWIDF 157 (533) Q Consensus 86 ~~~~a~ll~~e~~~i~~~---~~~~~~~~~l~~i~~--~n~f--~-~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~ 157 (533) ++.+|+-+-+-|..+--. +.......-+..+|. -|.. . ..+...+...+..|.+|+.+..+. + .-+.+-. T Consensus 67 i~~Ia~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~~-g-~~~~L~~ 144 (432) T protein:vir:81 67 VKLVSQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTD-G-RIESLQY 144 (432) T ss_pred HHHHHHhhhhCceeeEEecCCcceecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecC-C-cEEEEEE Confidence 444555444445433111 111111122333432 1222 1 223334445677788888777653 2 2244556 Q ss_pred EcCCeEEEEEe-cCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccC Q lcl|NC_016654. 158 VDADRAIPEFR-WGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEG 236 (533) Q Consensus 158 v~~~~~~P~~~-~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~ 236 (533) ++++.+.+..+ +|++ .|+ ++..+ |..+.+. .. T Consensus 145 l~~~~v~v~~~~~g~~----------------~y~-------------~~~~~----g~~~~~~--------~~------ 177 (432) T protein:vir:81 145 LANDRLTITTDPKGNT----------------AYR-------------YRRTD----GQMIDIP--------KQ------ 177 (432) T ss_pred EcCCceEEEECCCCcE----------------EEE-------------EEecC----ceEEEEc--------cc------ Confidence 77777666543 2221 111 11000 1000000 00 Q ss_pred CceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCcceeeechHHhcCC Q lcl|NC_016654. 237 RGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGAGKVHASESVLTNL 315 (533) Q Consensus 237 ~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~~~i~v~~~~l~~~ 315 (533) -+.|+.+.. +. ...|.|.+..+. ..|. +.....++... |+.|...-.| +... T Consensus 178 ------------~iih~r~~~----~d-----g~~G~spi~~~~-~~i~-~~~~~~~~~~~~f~ng~~~~gi----l~~~ 230 (432) T protein:vir:81 178 ------------QIWKIMGYS----LD-----GENGLSAIRYGA-QIFG-TAIAAEAQAARAFRNGQLQSVY----YQID 230 (432) T ss_pred ------------cEEEecCCC----CC-----CcccccHHHHHH-HHHH-HHHHHHHHHHHHHhcCCCcceE----EecC Confidence 112222111 11 124666665432 3442 23333444433 3543322111 2111 Q ss_pred CCccccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc-h Q lcl|NC_016654. 316 GMGQGVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA-Q 390 (533) Q Consensus 316 ~~~~~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~-~ 390 (533) +.-.....+.....|.. ..++++ +....++.++......++++..+...++|+...|+||..+|+...+. . T Consensus 231 ~~l~~e~~~~~~~~~~~----~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~ 306 (432) T protein:vir:81 231 RFLTDDQYDSFAKKVSG----SVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTS 306 (432) T ss_pred CCCCHHHHHHHHHHHhh----hhcCCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccc Confidence 11000000001111111 111111 11223566666667778899988999999999999999999765432 2 Q ss_pred hHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCC Q lcl|NC_016654. 391 TATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASA 470 (533) Q Consensus 391 Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi 470 (533) ++..++... ...++..|..++..+..-.+..+..........+.++++.-+..|..++++.+.+++++|+ T Consensus 307 ~~sn~eq~~----------~~f~~~tl~P~~~~ie~~l~~kLl~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~ 376 (432) T protein:vir:81 307 WGSGIESQQ----------LGFLTMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGL 376 (432) T ss_pred ccchHHHHH----------HHHHHHHHHHHHHHHHHHHHhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCC Confidence 333332222 1222334444444332222222222111122345555556677899999999999999999 Q ss_pred CCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 471 ASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 471 ~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) |+..++.+.+ ++..-+-...+ +. -+....| ....+.+ +.+++...+++.+.+.+.+ T Consensus 377 ~t~NE~R~~~--glpp~~g~~~~--~~-~~~~~~p-l~~~~~~-~~~~~~~~~~n~~~~~~~~ 432 (432) T protein:vir:81 377 MTRDEAREIE--GLPKLGGNAAV--LT-VQSAMVP-LDSIGLQ-ASPEPASGLGNQQQDKVSK 432 (432) T ss_pred CCHHHHHHHh--CCCCCCCCcce--Ee-ecCcccc-hhhhccC-CCCCCCCCCCCcccccccC Confidence 9999977664 22210000000 00 0111111 1111111 1122222222333333333 No 156 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=97.82 E-value=1.7e-05 Score=46.75 Aligned_cols=391 Identities=12% Similarity=0.085 Sum_probs=143.7 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |.|=..-..+ +++..+. ++... +-.+...|.-...-.+. T Consensus 1 m~~~~~~~~~--------------------~~~~~~~-~~~~~--------------------~~~~~~~g~~~~~~Al~ 39 (417) T protein:vir:38 1 MKLFRGLATE--------------------VDPHWAD-HLLDS--------------------GVIPSFRGGYLGISALR 39 (417) T ss_pred CccccccccC--------------------CCccchh-hhccc--------------------ccccccCCceechhhcc Confidence 4432110000 0110000 00000 00000000000000111 Q ss_pred hHH--HHHHHHHHhhcCCCceEeeCCCchHH-HHHHHHHHhh--ccH---HHHHHHHHHHHhhhCCEEEEEEEcCCCCCc Q lcl|NC_016654. 81 IPG--VIAKLSTTELFSEQLKFLDAGKSKEV-QARADLIFNT--PRF---HSSLVEAGESCSALSGSFQRIVWDPTIADN 152 (533) Q Consensus 81 ~~k--~i~~~~a~ll~~e~~~i~~~~~~~~~-~~~l~~i~~~--n~f---~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~ 152 (533) .+. ..++.+|+-+-+-|..+.-...+... ...+..+|.. |.. ..-....+...+..|.+|+.+..|..++.- T Consensus 40 ~~~V~~cv~~ia~~iA~lp~~~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~ 119 (417) T protein:vir:38 40 NSDVLTAVSIVSGDVSRFPLVITDSSTDEVIDLANIEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEP 119 (417) T ss_pred cHHHHHHHHHHHHhhccCeeEEEEcCCcceeccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEE Confidence 111 23455555554445444222111111 1123333321 211 122233345566779999988877554333 Q ss_pred eEEEEEcCCeEEEEEec-CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccc Q lcl|NC_016654. 153 AWIDFVDADRAIPEFRW-GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVE 231 (533) Q Consensus 153 ~~i~~v~~~~~~P~~~~-g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~ 231 (533) ..+.+++|+++.+..++ +++ +|+ |..........++. . T Consensus 120 ~~l~~l~p~~v~v~~~~~~~~----------------~y~--------------~~~~~~~~~~~~~~----------~- 158 (417) T protein:vir:38 120 AMFEFYAPSQTQVDTSDPDNI----------------IYR--------------FTPYNSSMQKVCGF----------E- 158 (417) T ss_pred EEEEEeCCceEEEEEcCCCeE----------------EEE--------------EEEcCCcEEEEecC----------c- Confidence 44566777777654322 221 111 10000000000000 0 Q ss_pred ccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCc-ceeeech Q lcl|NC_016654. 232 GADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGA-GKVHASE 309 (533) Q Consensus 232 ~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~-~~i~v~~ 309 (533) -+.|+.... +. ...|.|.+..+. ..|. ++....++... |+.|. +..+ T Consensus 159 -----------------dviH~r~~~----~d-----~~~G~s~l~~~~-~~i~-~~~~~~~~~~~~f~ng~~p~~i--- 207 (417) T protein:vir:38 159 -----------------DVIHWKFFS----YD-----TIMGRSPLLSLG-DEIG-LQESGVSTLQKFFKSGLKGSII--- 207 (417) T ss_pred -----------------ceEEecCCC----CC-----CccccCHHHHHH-HHHH-HHHHHHHHHHHHHhccCCCcEE--- Confidence 012222110 11 134777766533 4442 34444444443 35442 2222 Q ss_pred HHhcCCCCccccccCcchhhhhhccccccccccc----cccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccC Q lcl|NC_016654. 310 SVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGD----METIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLS 385 (533) Q Consensus 310 ~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~----~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~ 385 (533) +...+.-.....+.....|.....+ .++++. ....++.++......++++..+...++|+...|+||..+|.. T Consensus 208 --l~~~~~l~~e~~~~~~~~~~~~~~g-~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~ 284 (417) T protein:vir:38 208 --KAKESRLSAEARQKIREDFERAQAG-ADAGSPIIVDATMDYQPLEVDTNVLNLINSNNYSTAQIAKALRVPAYRLAQN 284 (417) T ss_pred --EEeCCCCCHHHHHHHHHHHHHHhcc-cccCCceeccCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHhCCC Confidence 2111110000011111222222211 111111 122355555555666788888888899999999999999843 Q ss_pred CCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHH Q lcl|NC_016654. 386 DEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAW 465 (533) Q Consensus 386 ~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l 465 (533) .. ..|+++. ...+++..|..+++.+..-.+..+.... ......|.|+.... +... ...+.++ T Consensus 285 ~~-~s~~e~~-------------~~~~~~~tl~P~~~~ie~~l~~~Ll~~~--~~~~~~~~fd~~~l-~~~~-~~~~~~~ 346 (417) T protein:vir:38 285 SP-NQSVKQL-------------ADDYIRNDLPFYFEPITSEFELKLLDDA--QRHQYCIGFDTKSV-NGLP-IADVNTA 346 (417) T ss_pred Cc-chhHHHH-------------HHHHHHHHHHHHHHHHHHHHHhhhcChh--hcccceEEechhhh-hHHH-HHHHHHH Confidence 22 2222222 1223444555555554332222222211 11235567764321 2222 3346677 Q ss_pred HhCCCCCHHHHHHHhC-CCCCHHHHHHH--------HHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 466 SVASAASTKTKVAYLH-EDWDDERVQEE--------ADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 466 ~~aGi~S~et~v~~l~-~~~~dee~~~E--------l~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +.+|+|+..++.+++. |.++..++++- ++...+++..... ..-|++ +..+++.+..+.++ T Consensus 347 ~~~G~~T~NE~R~~~gl~pi~~g~~d~~~~~~n~~~~d~~~~~~~~~~~-~~kgg~-------~~~~~~~~~~~~~~ 415 (417) T protein:vir:38 347 VNGGLWTGNEGRAELGKKPLKDPNMDRIQSTLNTVFLDQKEAYQAEHAA-ELKGGD-------TNAKGNQNGSGTNA 415 (417) T ss_pred HhCCCcCHHHHHHHhCCCCCCCCCCCeeeeccccccccccccccccccc-ccCCCC-------CCCCCCCcCCCCcC Confidence 8899999999776641 11222111110 0111111110000 000111 11111212222222 No 157 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=97.79 E-value=1.9e-05 Score=46.45 Aligned_cols=427 Identities=11% Similarity=0.063 Sum_probs=160.8 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |+++.-+ .|.+-. +. ....+-+.+. ...+|.. .+-+.. .++-..+.. T Consensus 6 ~~~~~~~-~~~~~~-----~~-~~~~~~~~~~---~~~~~~p--p~~~~~---------------------La~~~~~n~ 52 (540) T protein:vir:41 6 LSIKSLE-KYRAIK-----GD-TDSQALKEDR---FEEYVEP--KVHPLV---------------------LLSLLQVNP 52 (540) T ss_pred cChhhcc-chhhhh-----cc-ccccccccCC---CCccccC--CCCHHH---------------------HHHHHHhcH Confidence 8887655 232222 11 0111111110 0001100 000000 001011234 Q ss_pred hHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcC Q lcl|NC_016654. 81 IPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDA 160 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~ 160 (533) ....+|+..|+.+.+-+..+.. ++..+.+.+-.- .-.+...+...+......|.+|+.+..|..|. -+.+..++| T Consensus 53 ~v~scI~~ia~~ia~~~~~i~~--~~~~~~~~lpN~--~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~-~~~L~~i~~ 127 (540) T protein:vir:41 53 YHASACSIKANDILRTGYLIDG--DDGGVEELLRAC--RPSFEFILLQALEDLQVFNYCTLEVVRDDQGE-PVRLDYIPA 127 (540) T ss_pred HHHHHHHHHHHHHhcCCceEec--CccchhhhccCC--CCCHHHHHHHHHHHHHhcCCeEEEEEECCCCc-EEEEEEeCC Confidence 5566777777777777755544 333333322111 01133444555566778899999988886543 356777888 Q ss_pred CeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCcee Q lcl|NC_016654. 161 DRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAY 240 (533) Q Consensus 161 ~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~ 240 (533) .++-+.-+..+. +. +.. +...+| +..|.... .+... .+.. T Consensus 128 ~~V~v~~~~~~~-----~~-~~d-~~~~~~------------~~~~~~~~-----~~~~~--------------~g~~-- 167 (540) T protein:vir:41 128 HTVRVHRDGSRY-----MQ-TWD-GIHVTY------------FKDYRYEG-----EVNPD--------------NGED-- 167 (540) T ss_pred cceEEeEcCcee-----Ee-eec-Cceeee------------eecccccc-----eeecc--------------cccc-- Confidence 877654332221 10 111 111111 01111000 00000 0000 Q ss_pred ecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCcce---eeechHHhcCCC Q lcl|NC_016654. 241 VETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGAGK---VHASESVLTNLG 316 (533) Q Consensus 241 ~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~~~---i~v~~~~l~~~~ 316 (533) ....+.--+.|+.+..+. ...+|.|.+..+.. .| .++.....+... |+.|... |.++..+.+... T Consensus 168 -~~~~~~~eViHir~~~~~--------~~~~G~Spi~~~~~-~i-~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~ 236 (540) T protein:vir:41 168 -QDGVGANEIIFIHLPSPI--------CSYYGVPRYLSAAP-SI-LAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEME 236 (540) T ss_pred -ceeecccceEEecCCCCC--------CCcccccHHHHHHH-HH-HHHHHHHHHHHHHHhccCCCceEEEeCcccCchhc Confidence 000011123344322111 23468888876543 33 233344444443 3544322 222222211110 Q ss_pred Ccccccc---Ccchhhh----hhc-------cccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhc Q lcl|NC_016654. 317 MGQGVSL---DEEQEVY----SRV-------GSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSL 382 (533) Q Consensus 317 ~~~~~~~---d~~~~~~----~~~-------~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~ 382 (533) ....... ......+ ... .......+.+....++.++......++++..+...++|+...|++|..+ T Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~l 316 (540) T protein:vir:41 237 LGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGGDTVEVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPYRL 316 (540) T ss_pred cchHHHHHHHHHHHHHHHHHhccccccccceEEEecCCCcccceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCHHHc Confidence 0000000 0000001 000 0000011111223455555666677889999999999999999999999 Q ss_pred ccCCCcc---hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHH Q lcl|NC_016654. 383 GLSDEVA---QTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKA 459 (533) Q Consensus 383 g~~~~~~---~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a 459 (533) |+..++. .++.+....+ ...+..-....++..|.. .+.-. ......|.|+..-.... +.+ T Consensus 317 G~~~~~~~n~sn~eq~~~~f--~~~tL~P~~~~ie~~ln~-----------~L~~~---~~~~~~i~f~~~~ll~~-D~~ 379 (540) T protein:vir:41 317 GITDVGPLGGNFAEVARRTY--YESVVRPQQEIVSSVLTD-----------FIQLK---LDPGARFVFNEEILMES-EFV 379 (540) T ss_pred CcccCCCCCcccHHHHHHHH--HHHHHHHHHHHHHHHHHH-----------hhhhc---cCCceEEEecchhhcch-HHH Confidence 8654322 2333332222 111122222222222222 11111 11234566665444332 344 Q ss_pred HHHHHHHhCCCCCHHHHHHHhCC--CCCHHH------HHHHHHHHHHhhhcccC-cccc--ccccC-CCCCCCC--CCCC Q lcl|NC_016654. 460 QTVQAWSVASAASTKTKVAYLHE--DWDDER------VQEEADLIDNANTVSAP-TFGF--GTDQP-PLPTEND--PATD 525 (533) Q Consensus 460 ~~~~~l~~aGi~S~et~v~~l~~--~~~dee------~~~El~rI~~E~~~~~~-~~~~--~~~~~-~~~~~~~--~~~~ 525 (533) ..+.+++++|+|+..++.+.+.| ..++.- ...++..-..+.....+ .... ....+ .....++ +..+ T Consensus 380 ~~~~~lv~~G~lT~NE~Re~L~g~e~gdd~~l~p~n~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~ 459 (540) T protein:vir:41 380 HNYALLVQCGVLTPSEVREKLFGLDGGPDMFMVPSSIGKSAMKRQKRNYEKNQINEIKRTYAKYKPRIQEIISSESPLED 459 (540) T ss_pred HHHHHHHhCCCCCHHHHHHHhCcCcCCCcccccccccccccccccccccCCCCccccccccchhcccccCcccccccccc Confidence 45677889999999997655532 122210 00111110000000000 0000 00000 0000000 0000 Q ss_pred -------------CCCCCCCC Q lcl|NC_016654. 526 -------------PEAVDEGE 533 (533) Q Consensus 526 -------------~~~~~d~~ 533 (533) ++.+.-|+ T Consensus 460 ~~~~~~~~~~~~~~~~~~~~~ 480 (540) T protein:vir:41 460 KKKKIDEVLSDFRAEAYENGK 480 (540) T ss_pred ccccccccccccCCccccchh Confidence 11111111 No 158 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=97.77 E-value=2.1e-05 Score=46.26 Aligned_cols=410 Identities=12% Similarity=0.032 Sum_probs=159.8 Q ss_pred CCCC---CCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccc-e Q lcl|NC_016654. 1 MSLP---EANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPK-R 76 (533) Q Consensus 1 ~~~~---~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~-~ 76 (533) |.=- .-...+..+. +..++. ..+.....| ......|+ +.++..|.... . T Consensus 1 ~~~~l~~~~~~~~~~~~-~~~~~~--~~~~~~~~~-----------------------~~~~~~~~-g~~~~~g~~v~~~ 53 (434) T protein:vir:43 1 MSKSLGKVLSSATSAPR-SSLFGW--GGKTIRLTD-----------------------GAFWSQFL-GRESSSGKKVTVD 53 (434) T ss_pred Cccchhhhhhhcccccc-hhhhcc--cccccccCc-----------------------hHHHHHHh-cCCccCCceechh Confidence 1100 0000111110 000000 000000000 01112232 22332222211 0 Q ss_pred eecChH--HHHHHHHHHhhcCCCceE-eeCCC---chHHHHHHHHHHh--hccHH---HHHHHHHHHHhhhCCEEEEEEE Q lcl|NC_016654. 77 YHAPIP--GVIAKLSTTELFSEQLKF-LDAGK---SKEVQARADLIFN--TPRFH---SSLVEAGESCSALSGSFQRIVW 145 (533) Q Consensus 77 ~~~n~~--k~i~~~~a~ll~~e~~~i-~~~~~---~~~~~~~l~~i~~--~n~f~---~~~~~~~~~~~~~G~~~~~~~~ 145 (533) -.+..+ -.+++.+|+-+-+=|..+ ....+ .....-.+..+|. -|.+. .-....+...+..|.+|+.+.. T Consensus 54 ~al~~~~V~~~i~~ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~ 133 (434) T protein:vir:43 54 KAMKLSAVWACVRLISTSVAGLPLGVYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRR 133 (434) T ss_pred hhhccHHHHHHHHHHHHhhhhCceEEEEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEe Confidence 011122 234555555555445433 11111 1111223444442 23221 3344445566778999888765 Q ss_pred cCCCCCceEEEEEcCCeEEEEEec-CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccc Q lcl|NC_016654. 146 DPTIADNAWIDFVDADRAIPEFRW-GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPA 224 (533) Q Consensus 146 D~~~~~~~~i~~v~~~~~~P~~~~-g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~ 224 (533) + ++ .-+.+..++|+.+.+..+. |++ .|..+... |..+.+. T Consensus 134 ~-~G-~~~~L~~l~p~~v~~~~~~~g~~-----------------------------~y~~~~~~----g~~~~~~---- 174 (434) T protein:vir:43 134 A-AG-RPAALDFLLPSRVDLECDENGRL-----------------------------KYFYTTKK----GARREIE---- 174 (434) T ss_pred C-CC-cEEEEEEEcCcceEEEEcCCCeE-----------------------------EEEEEecC----ceEEEEc---- Confidence 5 22 2244556777777665432 221 11111110 1111100 Q ss_pred cccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCc- Q lcl|NC_016654. 225 TRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGA- 302 (533) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~- 302 (533) .--+.|+++.. +. ..+|.|.+..+. ..| .++....++... |+.|. T Consensus 175 ----------------------~~eVih~~~~~----~d-----g~~G~spi~~~~-~~i-~~~~~~~~~~~~~f~ng~~ 221 (434) T protein:vir:43 175 ----------------------RTNMLHIPAFT----LD-----GRIGLSAIRYGV-DVF-GSVMSAEDAANGTFKNGLL 221 (434) T ss_pred ----------------------cccEEEecCcC----CC-----CccccCHHHHHH-HHH-HHHHHHHHHHHHHHhccCC Confidence 00122332211 11 234777665543 344 334444455444 35432 Q ss_pred ceeeechHHhcCCCCccccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCC Q lcl|NC_016654. 303 GKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYS 378 (533) Q Consensus 303 ~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s 378 (533) ...+ +...+.-.....+..+..+.... +..+.++ +....++.++......++++..+....+|+...|+| T Consensus 222 ~~gi-----l~~~~~l~~e~~~~~r~~~~~~~-g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP 295 (434) T protein:vir:43 222 PTVA-----FKVDRILQPAQREEFREYVKSVS-GAMNSGRSPVLEQGITPETIGINPVDAQLLETREHGVIEICRWFGVP 295 (434) T ss_pred cceE-----EecCCCCCHHHHHHHHHHHHHhc-CccccCCccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCC Confidence 2222 21111100000000111111111 1111111 112235566666677788888888899999999999 Q ss_pred hhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHH Q lcl|NC_016654. 379 PVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAK 458 (533) Q Consensus 379 ~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~ 458 (533) |..+|+...+..++..+.... ...+..+|..++..+..-.+..+..........+.++++.-+..|..++ T Consensus 296 p~~lg~~~~~~~~~s~~e~~~----------~~f~~~~L~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~llr~d~~~r 365 (434) T protein:vir:43 296 PWMIGQTDKGSNWGTGLEQQM----------LAFLTFSISSITNQIQQCVNKRLLTAPERIRYYAEFSLEGFLKADSAGR 365 (434) T ss_pred HHHhCCCcCCccccchHHHHH----------HHHHHHHHHHHHHHHHHHHHhhcCChhhhcCceEEEechhhhccCHHHH Confidence 999986554332233232211 2233444555554443322222222211123456666667777899999 Q ss_pred HHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccc--cCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 459 AQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTD--QPPLPTENDPATDPEAVDEG 532 (533) Q Consensus 459 a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~d~ 532 (533) ++...+++.+|+|+..++.+.+ ++..- +.-+++.- +.-..| ....++ .++...+......++.++.. T Consensus 366 ~~~~~~~~~~G~~T~NE~R~~~--gl~p~---~ggD~~~~-~~n~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 366 AAWYSTMAQNGFMTRNEGRRKE--NLPEL---PGGDILTV-QSNLVP-IDQLGQSNKSQAVRAALMNWFSQPEPQE 434 (434) T ss_pred HHHHHHHHhCCCcCHHHHHHHh--CCCCC---CCCCeEee-ccCccc-hhhhhccCCCcchhhhhhccCCCCCCCC Confidence 9999999999999999977654 22220 00000000 000000 000011 11111111111111222222 No 159 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=97.67 E-value=3e-05 Score=45.38 Aligned_cols=449 Identities=14% Similarity=0.108 Sum_probs=195.8 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) =.+|+..++=||-..+..+.-+.. =|-| .|.+......+ . ..++..|+.+- ..+ T Consensus 14 ~~~~~~~s~~~~~~~dg~~~~~~~---~~~g-------~~~~~e~~~~~-~-~eLI~~YR~ma--------------~~p 67 (537) T protein:vir:10 14 KKVPKGPSFVQKDSLDGSQPIVGG---GYFG-------YSVDFDGTIRN-D-HELITRYREMV--------------LNP 67 (537) T ss_pred cccccCCcccCCCcccccceeecc---cccc-------cccccccccch-H-HHHHHHHHHHh--------------hcc Confidence 112344444455553322210000 0000 00000000000 0 11122222110 111 Q ss_pred hHHHHHHHHHHh-h----cCCCceEeeCC--Cch----HHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCC- Q lcl|NC_016654. 81 IPGVIAKLSTTE-L----FSEQLKFLDAG--KSK----EVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPT- 148 (533) Q Consensus 81 ~~k~i~~~~a~l-l----~~e~~~i~~~~--~~~----~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~- 148 (533) -+.-.++..++= + ..+|+++.++. -++ ...+..+.|++-=+|++..++......+.|..||+.++|.. T Consensus 68 Evd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fhKiid~k~ 147 (537) T protein:vir:10 68 ECDSAVDDVVNETICGNFDDVPISIDLHNLKQSEKIKKLIRSEFDEILRLLDFDNRAYEIFRRWYVDGRLFFHKVIDPKK 147 (537) T ss_pred chhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCC Confidence 222223333321 1 11245555543 112 24455666777778999999999999999999999999854 Q ss_pred -CCCceEEEEEcCCeEEEEEec-----CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEecc----CCcccceee Q lcl|NC_016654. 149 -IADNAWIDFVDADRAIPEFRW-----GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGT----ATSLGWMMA 218 (533) Q Consensus 149 -~~~~~~i~~v~~~~~~P~~~~-----g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~----~~~lG~~v~ 218 (533) ..+=..+.+++|.++-++... ..+...-+.+. ++ .+.-.|.+|.-. ..+-|.++| T Consensus 148 pk~GI~ELr~lDPr~i~~vR~i~~~~~~~~~~~~~~~~--------v~-------~~~~eyf~ynp~g~~~~~~~~vkI~ 212 (537) T protein:vir:10 148 PRQGLVELRYVDPRKIRKVTEYEAKRPEALRTQDLNQQ--------LT-------QQSASYFLYNPKGLKNSTNQGMKIA 212 (537) T ss_pred ccccceeeeeeCCccceeeEeecccCCccceEEeccee--------ee-------ecccceeeeccccccccCCCceecc Confidence 234456777888888765321 11100001111 11 111122233211 111123333 Q ss_pred hhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHH--HHH Q lcl|NC_016654. 219 LTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSS--LMR 296 (533) Q Consensus 219 l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~--~~~ 296 (533) -+.+ .|..-..-.. ..+...|-+-.||+++ ..|=-+-+. +.| T Consensus 213 ~dAI----------------------------~y~hSGl~d~-------n~~~i~syLhkAiKp~-NQLkm~EDAlVIYR 256 (537) T protein:vir:10 213 PDSI----------------------------AYCHSGIQDL-------NKNMVLSHLHKAIKAV-NQLRMIEDSLVIYR 256 (537) T ss_pred Hhhe----------------------------eeecccceeC-------CCCeeeeeehhhhHHH-HhhHHHHhhHHHHh Confidence 2211 1111000000 0122333344455432 222111111 223 Q ss_pred HHHhCcceeee-c---------hHHhcC----------CCCccccccCcchhhhhhcc---ccccccccccccceeeech Q lcl|NC_016654. 297 DFRIGAGKVHA-S---------ESVLTN----------LGMGQGVSLDEEQEVYSRVG---SGGFNANGDMETIFEFFQP 353 (533) Q Consensus 297 ~~~~~~~~i~v-~---------~~~l~~----------~~~~~~~~~d~~~~~~~~~~---~~~~~~~~~~~~~i~~~~~ 353 (533) .-|.-.+|||- . +.+++. +....|..-| ++.|..+. +.+--.|+ ....|+++.. T Consensus 257 itRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~d--drk~msMlEDyWLPRReGg-rgTEItTLpG 333 (537) T protein:vir:10 257 LSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKD--DKKFMSMLEDFWLPRREGG-RGTEISTLPG 333 (537) T ss_pred hhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecc--cchhhhhhhhhcccccCCC-cccceeeccc Confidence 34566777763 1 111110 0111111111 11111100 01111122 2223444433 Q ss_pred hhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCC-cchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_016654. 354 AIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDE-VAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKF 432 (533) Q Consensus 354 ~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~-~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~ 432 (533) .-.. .-+..+..+.+.+....++|.+.++.+++ +..-++||...+-.--..+.+.+..|..-|.++++.-|.|.... T Consensus 334 gqnl-gem~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgii- 411 (537) T protein:vir:10 334 GQNL-GELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFVDLLKTQLILKGIC- 411 (537) T ss_pred cCCc-ChHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCC- Confidence 2222 23556677788899999999888875543 22245677777766777788888888888888888776553211 Q ss_pred cCCCCC--CceeEEEEeCCCCCCCHHHHH-------HHHHHHH--hCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhh Q lcl|NC_016654. 433 PGKGAA--PSEELELEWPKFARESDLAKA-------QTVQAWS--VASAASTKTKVAYLHEDWDDERVQEEADLIDNANT 501 (533) Q Consensus 433 ~~~~~~--~~~~v~i~f~d~i~~d~~e~a-------~~~~~l~--~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~ 501 (533) ..... ....+.++|...---.+...+ ..++++. -+...|.++..++.. -.+|+|.++|.++|++|.. T Consensus 412 -t~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~IL-r~tDeeI~~~~k~I~~E~k 489 (537) T protein:vir:10 412 -SIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVL-KQTESEIKEIDKEIKQEIA 489 (537) T ss_pred -CHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHh-ccCHHHHHHHHHHHHHHhh Confidence 00001 124577777654333333222 2333321 122569998766654 4899999999999999964 Q ss_pred cc---cCc----cccc-cccCCCCCCC-CC------CCCCCCCCCCC Q lcl|NC_016654. 502 VS---APT----FGFG-TDQPPLPTEN-DP------ATDPEAVDEGE 533 (533) Q Consensus 502 ~~---~~~----~~~~-~~~~~~~~~~-~~------~~~~~~~~d~~ 533 (533) .. +|. ++++ ++..+.+.++ +| ++.|+...-|| T Consensus 490 ~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 536 (537) T protein:vir:10 490 DGVIMDPQAMQAMEMGIGDEEPVPEGGEEPQTDPNSAVSPADQKRGE 536 (537) T ss_pred CCCCCCcccccccccCCCCcccCCCCCCCcccCCccCCCCCCccCCC Confidence 21 111 1111 1111111111 12 22344444555 No 160 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=97.67 E-value=3e-05 Score=45.37 Aligned_cols=436 Identities=9% Similarity=0.015 Sum_probs=170.6 Q ss_pred HHhhhHhhcCCHHHHHHHHhccCc---chhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHhhcCC-- Q lcl|NC_016654. 22 VAESHVWWEGDLDKLATFYGAEGR---TSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTELFSE-- 96 (533) Q Consensus 22 ~~~~~~w~~gd~~~l~~~y~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e-- 96 (533) |..-..-|.|+...|.+.|..... ...+.|+..........+. ....+....++--+-+...++.+|+-|.+- T Consensus 1 ~~~~~~~~~~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~~~~--~~~~~~~~~~~~dstg~~a~~~LAa~l~~~lt 78 (515) T protein:vir:70 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMN--NKGDNETSQNGWQGVGAQATNHLANKLAQVLF 78 (515) T ss_pred CcchhhhhcCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcccccC--CCCCcccccccccchHHHHHHHHHHHHHHhhc Confidence 555666677888888777754321 2223333332222221111 111222333444556677777777765543 Q ss_pred ---CceEeeCCCc-------------hHHHHH-------HHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCce Q lcl|NC_016654. 97 ---QLKFLDAGKS-------------KEVQAR-------ADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNA 153 (533) Q Consensus 97 ---~~~i~~~~~~-------------~~~~~~-------l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~ 153 (533) .+.|.....+ ..++++ +...+..++|...+.++.......|.+.+ |.|+.+ . T Consensus 79 pp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--~~d~~~--~- 153 (515) T protein:vir:70 79 PAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YKPSKG--A- 153 (515) T ss_pred CCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEEE--EEeCCC--C- Confidence 1344432111 122233 33347788999999999999999999875 457543 2 Q ss_pred EEEEEcCCeEEEEEe-cCCceEEEEEEEEeec----------------------CCceEEEEEEEecCeeEEEEEEeccC Q lcl|NC_016654. 154 WIDFVDADRAIPEFR-WGRLVAVTFWSELAGG----------------------DGQEVWRHLERHESGYIVHAVYKGTA 210 (533) Q Consensus 154 ~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~~----------------------~~~~~y~~lE~h~~~~I~~~~y~~~~ 210 (533) +..++-..++..-+ +|++..+++-.+++-. +.-.+|+++++...++ +.+|..-+ T Consensus 154 -~~~~pl~~y~v~~d~~G~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~--~~~~~e~d 230 (515) T protein:vir:70 154 -MSAVPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGF--WKINQSAD 230 (515) T ss_pred -eEEEEcCeEEEeeCCCcCeeEEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEEEEecCCCc--eEEEEecC Confidence 55677666555444 5787776644333210 0001233333221110 01111000 Q ss_pred CcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHH Q lcl|NC_016654. 211 TSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRI 290 (533) Q Consensus 211 ~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~ 290 (533) +.....+.+.+..-+.|++. +|... ..+.||+|--..++ +-+..|+.. T Consensus 231 -------------------------~~~~~~es~y~~~e~P~~~~-----Rw~~~-~ge~YGrgp~~~~l-~D~k~L~~l 278 (515) T protein:vir:70 231 -------------------------DIPVGKESRIKSEKLPFIPL-----TWKRS-YGEDWGRPLAEDYS-GDLFVIQFL 278 (515) T ss_pred -------------------------ceeeccccccccccCCceee-----eeeec-CCCCcccchHHHhh-HHHHHHHHH Confidence 00000122221111222222 24322 24678998777665 556777765 Q ss_pred HHHHHHHH-HhCcceeeechHHhcCCCCcccccc-CcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHH Q lcl|NC_016654. 291 YSSLMRDF-RIGAGKVHASESVLTNLGMGQGVSL-DEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLL 368 (533) Q Consensus 291 ~s~~~~~~-~~~~~~i~v~~~~l~~~~~~~~~~~-d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l 368 (533) --...... ...+....||++.. .+...+ +...+.+.. |.......-.+........-...|+.+- T Consensus 279 ~~~~l~~~~~a~~p~~lv~~~g~-----~~~~~l~~~~~g~iv~--------g~~~~v~~~~~~~~~d~~~~~~~i~~~~ 345 (515) T protein:vir:70 279 SEAMARGAALMADIKYLIRPGSQ-----TDVDHFVNSGTGEVIT--------GVAEDIHIVQLGKYADLTPISAVLEVYT 345 (515) T ss_pred HHHHHHHHHHhcCCCeeeCcccc-----cchhhccccCCceeec--------CCcccceeeecCcccchhHHHHHHHHHH Confidence 55554433 44555556654322 111111 000111111 1000000111111111121222233332 Q ss_pred HHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEe Q lcl|NC_016654. 369 REVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKA-RHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEW 447 (533) Q Consensus 369 ~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~-~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f 447 (533) +.|....=+. .+.-..+...|||||..+.+...+..+-.- +.-..-|..|+.. +.+..+ ...+...+.++. T Consensus 346 ~rI~~af~~~--~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~srL~~Ell~Pli~r---~~~~~~---p~~P~~~v~~~~ 417 (515) T protein:vir:70 346 RRIGVIFMME--TMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMW---GLQEAG---DSFTSELVDPVI 417 (515) T ss_pred HHHHHHHhhh--hhhccCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH---HHHhhC---CCCChhhcccce Confidence 2332211011 111112234699999877666544333211 1111122223222 112221 223333344433 Q ss_pred CCCCCCCHHHHHHHHHH------HHh--CC-------CCCHHHHHHHh---CC---C--CCHHHHHHHHHHHHHhhhccc Q lcl|NC_016654. 448 PKFARESDLAKAQTVQA------WSV--AS-------AASTKTKVAYL---HE---D--WDDERVQEEADLIDNANTVSA 504 (533) Q Consensus 448 ~d~i~~d~~e~a~~~~~------l~~--aG-------i~S~et~v~~l---~~---~--~~dee~~~El~rI~~E~~~~~ 504 (533) -.++ +...+++.+.. .++ ++ .+-...+++.+ .+ . -+++|++++.++.++.+.+.. T Consensus 418 vs~l--~~L~r~q~~~~i~~~~q~i~~~~~~~p~~~~~id~d~~~~~~a~~~g~p~~~~rs~eev~~~r~q~~~~~~~~~ 495 (515) T protein:vir:70 418 VTGI--EALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAM 495 (515) T ss_pred ehhH--HHHHHHHHHHHHHHHHHHHHHHhccChhHHhhCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHH Confidence 2222 22222111111 111 11 11122222221 10 1 266777766554333322211 Q ss_pred --CccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 505 --PTFGFGTDQPPLPTENDPATDPEAVD 530 (533) Q Consensus 505 --~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (533) ...+..... ..+|.-++. T Consensus 496 ~~~~~~~a~~~--------~~~~~~~~~ 515 (515) T protein:vir:70 496 LNEGVAKAVPG--------VIQQEMKEG 515 (515) T ss_pred HHHhhhhhccc--------chhhhhccC Confidence 111111111 111111111 No 161 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=97.64 E-value=3.4e-05 Score=45.08 Aligned_cols=417 Identities=11% Similarity=0.023 Sum_probs=154.2 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhh--HhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceee Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESH--VWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYH 78 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~--~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 78 (533) |=|- .|..-==|.-+.....+.... .|..|- ++.. ....+.+. ... T Consensus 1 ~~~~-~~~~~~~p~~~e~~~~~~~~~~~~~~~~~--~~~~------------~~~~~~~~-----------------a~~ 48 (518) T protein:vir:10 1 MLLA-NGQTLSAPAMAELSPQMQDSYYYAPAVGM--QLER------------QFSLYGGI-----------------YKN 48 (518) T ss_pred Cccc-CceeecCchhhhhhhhhhcccccccccce--eccc------------ccchhhHH-----------------Hhh Confidence 3332 222211111111111111100 011110 0000 00000000 000 Q ss_pred cChHHHHHHHHHHhhcCCCceEee---CCCchHHHHHHHHHHhhccHH----HHHHHHHHHHhhhCCEEEEEEEcCCCCC Q lcl|NC_016654. 79 APIPGVIAKLSTTELFSEQLKFLD---AGKSKEVQARADLIFNTPRFH----SSLVEAGESCSALSGSFQRIVWDPTIAD 151 (533) Q Consensus 79 ~n~~k~i~~~~a~ll~~e~~~i~~---~~~~~~~~~~l~~i~~~n~f~----~~~~~~~~~~~~~G~~~~~~~~D~~~~~ 151 (533) ..--..+|+.+|+-+-+=|..+-- ++..+.....+..++..-+-. .-....+...+..|.+|+.+..|..|. T Consensus 49 ~~~V~acV~~IA~~iA~lpl~l~~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~- 127 (518) T protein:vir:10 49 QPWVRTVIAKRAQALARLPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGT- 127 (518) T ss_pred hHHHHHHHHHHHHhhccCceEEEEEcCCCceeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc- Confidence 011123444444444333333211 111111112233333322211 123333445567799999888876543 Q ss_pred ceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccc Q lcl|NC_016654. 152 NAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVE 231 (533) Q Consensus 152 ~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~ 231 (533) -+.+..++++.+.+..+.... ...|+ |......-+..+.+ T Consensus 128 ~~~L~~l~p~~v~v~~~~~~~--------------~~~y~--------------~~~~~~~~~~~~~~------------ 167 (518) T protein:vir:10 128 PEKLMPMHPSRVAIKRNSRTG--------------RYEYY--------------FQAGAGVGTQLVSF------------ 167 (518) T ss_pred EEEEEEECCCceEEEEcCCCC--------------EEEEE--------------EEecCCccceEEEe------------ Confidence 245667777777665432110 00010 11000000000000 Q ss_pred ccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCc-ceeeech Q lcl|NC_016654. 232 GADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGA-GKVHASE 309 (533) Q Consensus 232 ~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~-~~i~v~~ 309 (533) +.--+.|+.+...+ ....|.|.+..+. ..|. +......+... |+.|. ...++ T Consensus 168 --------------~~~eViHir~~s~d--------g~~~G~spi~~a~-~~i~-~~~a~~~~~~~~f~ng~~p~gil-- 221 (518) T protein:vir:10 168 --------------ADDEVVPIRFFNPD--------GLERGLSLMESLK-STIF-SEDSSRNATAAMWKNAGRPNLVL-- 221 (518) T ss_pred --------------cCCcEEEecCCCCC--------cccccccHHHHHH-HHHH-HHHHHHHHHHHHHhcCCCccEEE-- Confidence 00012233222111 1124777765433 3443 23333333333 45543 33332 Q ss_pred HHhcCCCCccccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccC Q lcl|NC_016654. 310 SVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLS 385 (533) Q Consensus 310 ~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~ 385 (533) ...+.-...........|.....+..++++ +....++.++......++++..+...++|....|++|..+|+. T Consensus 222 ---~~~~~ls~e~~~~~k~~~~~~~~G~~nag~v~vL~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~ 298 (518) T protein:vir:10 222 ---RHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHIL 298 (518) T ss_pred ---ecCCCCCHHHHHHHHHHHHHHhcCccccCcceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccC Confidence 111100000000011112111111111111 1122355566656667888888888999999999999999865 Q ss_pred CCcch-hHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHH Q lcl|NC_016654. 386 DEVAQ-TATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQA 464 (533) Q Consensus 386 ~~~~~-Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~ 464 (533) .++.- ++++. ....+..+|.-++..|....+..+... ......+.++.+.-+..|..++++.+.+ T Consensus 299 ~~~t~sn~eq~-------------~~~f~~~tL~P~l~~ie~~ln~~L~~~-~~~~~~~~fd~~~llr~D~~~r~~~~~~ 364 (518) T protein:vir:10 299 DRATFSNISAQ-------------MRAFYRDTMAIPIARIQSAMDKYVGQY-WVRKNRMKFDIDDVIQPDWEAKSESTQK 364 (518) T ss_pred CCCCchhHHHH-------------HHHHHHHHHHHHHHHHHHHHHHhhccc-ccCCceEEEechhhhccCHHHHHHHHHH Confidence 44322 22221 122333444444444432222222111 1123446666667778899999999999 Q ss_pred HHhCCCCCHHHHHHHhCC-CCCHHHHHHHH-----HHHHHhh-----hcccCccccc-------cccCCCCCC------C Q lcl|NC_016654. 465 WSVASAASTKTKVAYLHE-DWDDERVQEEA-----DLIDNAN-----TVSAPTFGFG-------TDQPPLPTE------N 520 (533) Q Consensus 465 l~~aGi~S~et~v~~l~~-~~~dee~~~El-----~rI~~E~-----~~~~~~~~~~-------~~~~~~~~~------~ 520 (533) ++.+|+|+..++.+.+.- ..+++..++-+ ..|..-. +.+.+..... .++.+...+ . T Consensus 365 ~~~~G~lT~NE~R~~~Gl~pie~~~gD~~~~~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (518) T protein:vir:10 365 MVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPASTPVASLDQSPPTSVPGLSPTN 444 (518) T ss_pred HHhCCCcCHHHHHHHhCCCCCCCCCCCeeeecccceecccccccccCCCCCCCCCCCCccccccccccccccCCCCCccc Confidence 999999999997766521 12221221110 1110000 0000000000 000000000 0 Q ss_pred CCCCCCCCCCCCC Q lcl|NC_016654. 521 DPATDPEAVDEGE 533 (533) Q Consensus 521 ~~~~~~~~~~d~~ 533 (533) .+..+..+.+|+| T Consensus 445 ~~~~~~~~~~~~~ 457 (518) T protein:vir:10 445 SDRSTDSGKTEPR 457 (518) T ss_pred ccccccccccchh Confidence 0000111122222 No 162 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=97.61 E-value=3.8e-05 Score=44.84 Aligned_cols=459 Identities=10% Similarity=0.037 Sum_probs=177.8 Q ss_pred CCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChH Q lcl|NC_016654. 3 LPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIP 82 (533) Q Consensus 3 ~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~ 82 (533) |-+-.+-.==+.+...+..+...|..|...-.++.+|... ..+.......+.+..++.-+-+ T Consensus 1 m~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP------------------~~~~~~~~~~~~~~~~~~dst~ 62 (536) T protein:vir:10 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIP------------------SLFPKDSDNASTDYQTPWQAVG 62 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcc------------------cccCCCCCcccccccccccccH Confidence 3333332222334444444444444444333333333211 1111111111223345666778 Q ss_pred HHHHHHHHHhhcCC--C--ceEeeCCCc-------------hHHH-------HHHHHHHhhccHHHHHHHHHHHHhhhCC Q lcl|NC_016654. 83 GVIAKLSTTELFSE--Q--LKFLDAGKS-------------KEVQ-------ARADLIFNTPRFHSSLVEAGESCSALSG 138 (533) Q Consensus 83 k~i~~~~a~ll~~e--~--~~i~~~~~~-------------~~~~-------~~l~~i~~~n~f~~~~~~~~~~~~~~G~ 138 (533) ...|+.+|+.|.+- | +.|.....+ .+++ +.+...+..++|...+.++.+...+.|. T Consensus 63 ~~a~~~Laa~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~ 142 (536) T protein:vir:10 63 ARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGN 142 (536) T ss_pred HHHHHHHHHHHHhhhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCc Confidence 88888888766553 2 444432211 1222 3445568888999999999999999998 Q ss_pred EEEEEEEcCCCCCce-EEEEEcCCeEEEEEe-cCCceEEEEEEEEe--------------ecCCceEEEEEEEecCeeEE Q lcl|NC_016654. 139 SFQRIVWDPTIADNA-WIDFVDADRAIPEFR-WGRLVAVTFWSELA--------------GGDGQEVWRHLERHESGYIV 202 (533) Q Consensus 139 ~~~~~~~D~~~~~~~-~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~--------------~~~~~~~y~~lE~h~~~~I~ 202 (533) +.++ .+++....+ .+..++-..++..-+ +|++..++.-.+++ ...++..+. .-.|. T Consensus 143 a~ly--~~e~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~------~v~v~ 214 (536) T protein:vir:10 143 VLLY--LPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADE------TIDVY 214 (536) T ss_pred EeEE--EeeCCCCceeeEEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccCccc------ceEEE Confidence 8864 454443333 367777666665544 47777665333332 011111111 11222 Q ss_pred EEEEeccCCcccceeehhhccccccccccccccCCceeecCCCc-cceeEEecCCcccccccccccccccccchhhhhHH Q lcl|NC_016654. 203 HAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLF 281 (533) Q Consensus 203 ~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~ 281 (533) +.+|.-..+. . +.-+ ....... ..-+.|.. ..-+.|++. +|... ..+.||+|-...++ T Consensus 215 ~~V~~~~~~~--~---~~~~---~e~~g~~------v~~~~g~~~f~~~P~i~~-----Rw~~~-~ge~YGrgp~~~~l- 273 (536) T protein:vir:10 215 THIYLDEASG--E---YLRY---EEVEGME------VQGSDGTYPKEACPYIPI-----RMVRL-DGESYGRSYIEEYL- 273 (536) T ss_pred EEEEEecCCC--c---EEEE---EeecCcc------ccccccccccccCCceee-----eeeec-CCCccccchHHHHH- Confidence 3333221110 0 0000 0000000 00011111 001122222 23322 24778998777666 Q ss_pred HHHHHHHHHHHHHHHH-HHhCcceeeec-hHHhcCCCCccccccCcchhhhhhcccccccccccccccee-eechhhh-h Q lcl|NC_016654. 282 PTFHELDRIYSSLMRD-FRIGAGKVHAS-ESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFE-FFQPAIR-V 357 (533) Q Consensus 282 ~lid~lD~~~s~~~~~-~~~~~~~i~v~-~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~-~~~~~ir-~ 357 (533) +-+..|+..--..... ....+....|+ ..++.+..- .+...+.+... .. ++...+. ....++. + T Consensus 274 ~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~-----~~~~~g~~v~g--~~-----~~v~~~~~~~~~~~~~~ 341 (536) T protein:vir:10 274 GDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRL-----TKAQTGDFVTG--RP-----EDISFLQLEKQADFTVA 341 (536) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhh-----ccCCCcceecC--Cc-----ccceeeeccccccchHH Confidence 5567777665444432 23334333332 222211100 00011111110 00 0000011 0112221 1 Q ss_pred HHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhhccCCC Q lcl|NC_016654. 358 LEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARH-FGSALGPLSTTCLRVDAIKFPGKG 436 (533) Q Consensus 358 e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~-~~~al~~li~~il~l~~~~~~~~~ 436 (533) .+-++.++.-++....... +....+...|||||..+.+......+-.-.. -...|.-|+..++.+... .|.- T Consensus 342 ~~~i~~~~~rI~~af~~~~-----l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r--~g~l 414 (536) T protein:vir:10 342 KAVSDAIEARLSFAFMLNS-----AVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQA--TQQI 414 (536) T ss_pred HHHHHHHHHHHHHHHhhhh-----cccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh--CCCC Confidence 1223333333333322111 2222334469999998887777655543222 223344455554444311 1111 Q ss_pred -CCCceeEEEEeCCCCCCCHHHHHHH-------HHHHHhCC------CCCHHHHHHHh---CCC------CCHHHHHHHH Q lcl|NC_016654. 437 -AAPSEELELEWPKFARESDLAKAQT-------VQAWSVAS------AASTKTKVAYL---HED------WDDERVQEEA 493 (533) Q Consensus 437 -~~~~~~v~i~f~d~i~~d~~e~a~~-------~~~l~~aG------i~S~et~v~~l---~~~------~~dee~~~El 493 (533) ..+...+.+++--++. .....+. ++.+-+.+ .+....+++.+ .+- -+++|++++. T Consensus 415 P~~p~~~v~~~~vs~l~--~l~r~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r 492 (536) T protein:vir:10 415 PELPKEAVEPTISTGLE--AIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKM 492 (536) T ss_pred CCCChhhccceEEecHH--HHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCCchhhcCCHHHHHHHH Confidence 1233445555543332 2222222 22221111 12334444332 220 1555565554 Q ss_pred HHHHHhhhcc--cCccccc-----cccCCCCCC-CCCCCCCCCC Q lcl|NC_016654. 494 DLIDNANTVS--APTFGFG-----TDQPPLPTE-NDPATDPEAV 529 (533) Q Consensus 494 ~rI~~E~~~~--~~~~~~~-----~~~~~~~~~-~~~~~~~~~~ 529 (533) ++-+++++.. ....+.+ ...|+.-.. -+..|..+.- T Consensus 493 ~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 536 (536) T protein:vir:10 493 AQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPGI 536 (536) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCchhHHhhhhccccCCCC Confidence 3322221110 0000000 000000000 0111111111 No 163 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=97.59 E-value=4.1e-05 Score=44.64 Aligned_cols=389 Identities=10% Similarity=0.011 Sum_probs=159.7 Q ss_pred HhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCccc---ceeecChHHHHHHHHHHhhcCCCceEeeC Q lcl|NC_016654. 27 VWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAP---KRYHAPIPGVIAKLSTTELFSEQLKFLDA 103 (533) Q Consensus 27 ~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~---~~~~~n~~k~i~~~~a~ll~~e~~~i~~~ 103 (533) -||.-.+. ........ ...+....++++.++-.|... .-+.+.--..+++.+|+-+-+=|..+--. T Consensus 1 ~~~~r~~~------~~~~~~~~-----~~~~~~~~~~g~~~s~~~~~vt~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~ 69 (419) T protein:vir:14 1 MFFSRQLL------SNLGQTQM-----SAGGWVSALLGSSRSDSGQVVTPASALALTVLQNCVTLLAESIAQLPIELYER 69 (419) T ss_pred Cccccccc------cccccccc-----CcchhhHHhhcCCCccCCcccchHHhhccHHHHHHHHHHHHhhccCceEEEEe Confidence 12111000 00000111 111233444444433222221 11222334445666666555545444221 Q ss_pred C-Cc--hHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEE Q lcl|NC_016654. 104 G-KS--KEVQARADLIFNT--P---RFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAV 175 (533) Q Consensus 104 ~-~~--~~~~~~l~~i~~~--n---~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v 175 (533) . +. ......|..+|.. | ....-....+...+..|.+|+.+..|..+. -+.+-.++|+++.+..+.+ T Consensus 70 ~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~-~~~l~pl~~~~v~v~~~~~----- 143 (419) T protein:vir:14 70 SGEDRKPATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGV-IQGLYPLDNEAVTVMRGSD----- 143 (419) T ss_pred cCCccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc-EEEEEEecCceEEEEECCC----- Confidence 1 11 1111224444432 1 112223334556677899998888775542 2346666777776553221 Q ss_pred EEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecC Q lcl|NC_016654. 176 TFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPN 255 (533) Q Consensus 176 ~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn 255 (533) ...+| .+. +. . + ++- --+.|+.. T Consensus 144 ----------~~~~y-------------~~~-~~--~-~--~~~----------------------------~~i~h~~~ 166 (419) T protein:vir:14 144 ----------LKPVY-------------RVR-GS--D-P--MPQ----------------------------RLVHHVRW 166 (419) T ss_pred ----------ceEEE-------------EEc-cC--c-c--cch----------------------------hheeEecC Confidence 11111 111 00 0 0 000 01222222 Q ss_pred CcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhC-cceeeechHHhcCCCCcccc----ccCcchhh Q lcl|NC_016654. 256 VTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIG-AGKVHASESVLTNLGMGQGV----SLDEEQEV 329 (533) Q Consensus 256 ~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~-~~~i~v~~~~l~~~~~~~~~----~~d~~~~~ 329 (533) .. +. ..+|.|.+..+. ..|+ ++....++... |+.| ....++ ...+..... ..+..... T Consensus 167 ~~----~d-----g~~G~s~i~~~~-~~i~-~~~~~~~~~~~~f~ng~~p~gil-----~~~~~~~~~~~~~~~~~~~~~ 230 (419) T protein:vir:14 167 MS----IN-----GYTGLSPVLLHA-NAIG-HAQAIQQYAGKSFMNGTALSGVI-----ERPKDAPALKDQASVDRITDG 230 (419) T ss_pred cC----CC-----CcccccHHHHHH-HHHH-HHHHHHHHHHHHHhccCCccEEE-----EecCCCCcccCHHHHHHHHHH Confidence 11 11 235777776543 4443 23333344433 4543 333333 111111000 00001111 Q ss_pred hhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHH Q lcl|NC_016654. 330 YSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKT 405 (533) Q Consensus 330 ~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~ 405 (533) +.....+..++++ +....++.++......++++..+...++|+...|++|..+|...++.-+. +... T Consensus 231 ~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~--~E~~------- 301 (419) T protein:vir:14 231 WNAKFGGSGNAKKVALLQEGMTFRPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSN--IEHQ------- 301 (419) T ss_pred HHHHhcCccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccc--HHHH------- Confidence 1111111111110 11123455555555567778888888999999999999998654332221 1111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC Q lcl|NC_016654. 406 TRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWD 485 (533) Q Consensus 406 ~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~ 485 (533) .+..++..|.-++..+....+..+..........+.++++.-+-.|..++++.+.+++.+|+|+..++.+++ ++. T Consensus 302 ---~~~f~~~~L~P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~--gl~ 376 (419) T protein:vir:14 302 ---SLQFVIYTLLPWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLE--NMP 376 (419) T ss_pred ---HHHHHHHHHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh--CCC Confidence 123344445554444432222222211112234566666676778999999999999999999999976654 222 Q ss_pred HHHHHHHHHHHHHhhhcccCccccccccCCCCC-CCCCCCCC--CCCCCCC Q lcl|NC_016654. 486 DERVQEEADLIDNANTVSAPTFGFGTDQPPLPT-ENDPATDP--EAVDEGE 533 (533) Q Consensus 486 dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~~~~d~~ 533 (533) .-+ .-++. ..| .+..+... +..+.+.+ ......| T Consensus 377 p~~---gGD~~------~~~-----~n~~~~~~~~~~~~~~~~~~~~~~~e 413 (419) T protein:vir:14 377 PVK---GGDIY------LSP-----MNMVDASKPQQLPVGKSEPTKAAIDE 413 (419) T ss_pred CCC---CcCee------eec-----cccccccccccccCCCCCCccccccc Confidence 100 00000 000 01111111 11122222 2222233 No 164 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=97.59 E-value=4.1e-05 Score=44.64 Aligned_cols=374 Identities=10% Similarity=-0.006 Sum_probs=153.9 Q ss_pred hhcCCHHHHHHHHhccCc-chhhHHHHHH----HHHHHHHhcccCCCCCcc---cceeecChHHHHHHHHHHhhcCCCce Q lcl|NC_016654. 28 WWEGDLDKLATFYGAEGR-TSPSGIKART----KAAYEAFHGRTPTATGRA---PKRYHAPIPGVIAKLSTTELFSEQLK 99 (533) Q Consensus 28 w~~gd~~~l~~~y~~~~~-~~~~~~~~~~----~~~~~~~~~~~~~~~g~~---~~~~~~n~~k~i~~~~a~ll~~e~~~ 99 (533) =|.|=. .+++.... ..-....... ......++.+. .|.. +.-+...--..+|+.+|+-+-+=|.. T Consensus 1 m~m~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~g~~v~~~~al~~~~v~~~v~~ia~~ia~lp~~ 73 (392) T protein:vir:74 1 MILPIL----NFINQTNDPPEAGSVQSYFPDGNDAQIMESLLGD---NNEWVSARAALRNSDLFSIILQLSSDLAIVKIN 73 (392) T ss_pred Ccchhh----hhhhcccCcccccccccccccCchhhhhhhccCC---CCcccchhhhhcchHHHHHHHHHHHhhccCcee Confidence 223321 22222111 0000000000 00011111111 1111 11122233334556666655444433 Q ss_pred EeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEec-CCceEEEEE Q lcl|NC_016654. 100 FLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRW-GRLVAVTFW 178 (533) Q Consensus 100 i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~-g~~~~v~f~ 178 (533) +.... ....+++=-..-.-..-+...+...+..|.+|+.+..|..+. -+.+..++|+++.+..+. +.. T Consensus 74 --~~~~~--~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~-~~~L~~i~~~~v~v~~~~~~~~------ 142 (392) T protein:vir:74 74 --AEKKK--NQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGA-DMKWEYLRPSQVNTYYFEYENG------ 142 (392) T ss_pred --eccch--hhhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCc-EEEEEEEcCceeEEEEcCCCce------ Confidence 32221 112222110000112334444557788899998888886543 246666777777655322 211 Q ss_pred EEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcc Q lcl|NC_016654. 179 SELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTP 258 (533) Q Consensus 179 ~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~ 258 (533) ..|+ |...+...+..+.+ +.--+.|++.... T Consensus 143 ---------~~y~--------------~~~~~~~~~~~~~~--------------------------~~~evih~~~~~~ 173 (392) T protein:vir:74 143 ---------MYYN--------------ITFDDPKIEPILQA--------------------------PQSDLIHMKLLSI 173 (392) T ss_pred ---------EEEE--------------EEecCCccceeEEE--------------------------cCccEEEecCCCC Confidence 1111 00000010100000 0001233332211 Q ss_pred cccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCcc-eee--echHHhcCCCCccccccCcchhhhhhcc Q lcl|NC_016654. 259 NPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGAG-KVH--ASESVLTNLGMGQGVSLDEEQEVYSRVG 334 (533) Q Consensus 259 ~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~~-~i~--v~~~~l~~~~~~~~~~~d~~~~~~~~~~ 334 (533) + ....|.|.+..+ ...| .++....++... |+.+-. ..+ ++...... +.....+..-. T Consensus 174 ~--------~~~~G~s~i~~~-~~~i-~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~---------~~~~~~~~~~~ 234 (392) T protein:vir:74 174 D--------GGKTGISPLYSL-RRES-KIQRASDRLTISSLNSSLNVPGVLTVKGGGLLS---------DKDKASRSRSF 234 (392) T ss_pred C--------CccccccHHHHH-HHHH-HHHHHHHHHHHHHHhccCCCceEEEeCCCCCch---------HHHHHHHHHHH Confidence 1 123577777654 3555 344444555444 355432 222 22211100 11111111111 Q ss_pred cccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHH Q lcl|NC_016654. 335 SGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKA 410 (533) Q Consensus 335 ~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~ 410 (533) .+..++++ +....++.++.+..+.++++..+....+|+...|+||..+|+......++.+.+. T Consensus 235 ~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~e~~~~------------- 301 (392) T protein:vir:74 235 MKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQISG------------- 301 (392) T ss_pred hccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHH------------- Confidence 11111111 1223456666666677888888888999999999999999865443322222222 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHh-CCCCCHHHH Q lcl|NC_016654. 411 RHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYL-HEDWDDERV 489 (533) Q Consensus 411 ~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l-~~~~~dee~ 489 (533) .++.+|..+++.+..-.+..+. ..+.++....+-.|..+.++.+.+++.+|++++.++.+.+ ..++...++ T Consensus 302 -~~~~~l~p~~~~ie~~l~~~l~-------~~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~pne~ 373 (392) T protein:vir:74 302 -MYASALNRYLRPAISELEYKLS-------DHISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDL 373 (392) T ss_pred -HHHHHHHHHHHHHHHHHHHhcc-------chhcccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCcccc Confidence 2333444444443322222221 1122333334446788899999999999999999987653 223433322 Q ss_pred HHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 490 QEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVD 530 (533) Q Consensus 490 ~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (533) .+ .|+ .|+...| +++ +..| T Consensus 374 r~------~en---l~~~~~G-------d~~------~p~p 392 (392) T protein:vir:74 374 PA------PEN---TNKKTTG-------QSN------EPVP 392 (392) T ss_pred ch------hcC---CCCCCCC-------CCC------CCCC Confidence 21 111 1111111 001 1111 No 165 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=97.57 E-value=4.3e-05 Score=44.51 Aligned_cols=388 Identities=7% Similarity=-0.036 Sum_probs=160.9 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcc---ccee Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRA---PKRY 77 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~---~~~~ 77 (533) |-+ -+|++.+.......+.... ..+...+...|.. ..-+ T Consensus 1 m~~---------------------------------~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~g~~v~~~~al 42 (419) T protein:vir:57 1 MFI---------------------------------PQFWKGRPSENRVNWQVVP-----GGMRSSSSQAGVIITPETAL 42 (419) T ss_pred Ccc---------------------------------hhhhccCCccccccccccc-----cccccccccCCceechHHhh Confidence 222 2222222111111111000 0011111111111 0112 Q ss_pred ecChHHHHHHHHHHhhcCCCceE-eeCCCc---hHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhhCCEEEEEEEcCC Q lcl|NC_016654. 78 HAPIPGVIAKLSTTELFSEQLKF-LDAGKS---KEVQARADLIFNT--P---RFHSSLVEAGESCSALSGSFQRIVWDPT 148 (533) Q Consensus 78 ~~n~~k~i~~~~a~ll~~e~~~i-~~~~~~---~~~~~~l~~i~~~--n---~f~~~~~~~~~~~~~~G~~~~~~~~D~~ 148 (533) .+.--...++.+|+-+-+=|..+ ....+. ......|.++|.. | ....-....+......|.+|+.+..|.. T Consensus 43 ~~~~v~~~i~~ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~ 122 (419) T protein:vir:57 43 ALSAVRACVTLLAESVAQLPCVLYRRTENGGREIAFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGR 122 (419) T ss_pred ccHHHHHHHHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC Confidence 22223445555555554444333 111110 0011223444421 1 1223334445566778999888887765 Q ss_pred CCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccc Q lcl|NC_016654. 149 IADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDI 228 (533) Q Consensus 149 ~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~ 228 (533) |. -+.+..++|..+.++-+.+ +..+|+ +.+ .|..++.+ T Consensus 123 G~-~~~L~pl~~~~v~v~~~~~---------------g~~~y~--------------~~~----~~~~~~~~-------- 160 (419) T protein:vir:57 123 GD-ITELIPINPHKVIVLKGPD---------------GMPYYD--------------IPS----IGEILPMR-------- 160 (419) T ss_pred Cc-EEEEEEEcCcceEEEECCC---------------ceEEEE--------------EcC----CceEEchh-------- Confidence 43 2456666776666543211 110110 110 01111110 Q ss_pred cccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhC-cceee Q lcl|NC_016654. 229 AVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIG-AGKVH 306 (533) Q Consensus 229 ~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~-~~~i~ 306 (533) -+.|+.+.. + ...+|.|.+..+. ..|+. +....++... |+.| .+..+ T Consensus 161 --------------------~vih~r~~~----~-----d~~~G~s~i~~~~-~~i~~-~~~~~~~~~~~f~ng~~p~gi 209 (419) T protein:vir:57 161 --------------------MVHHIKSFS----L-----DGYIGTSPIQTNP-DVLGL-GIAVEQHAAQVFARGTTMSGV 209 (419) T ss_pred --------------------hEEEecCcC----C-----CCcccccHHHHHH-HHHHH-HHHHHHHHHHHHHccCCccEE Confidence 122332211 1 1235777766533 44432 3334444444 3443 33322 Q ss_pred echHHhcCCCC--cc--ccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCC Q lcl|NC_016654. 307 ASESVLTNLGM--GQ--GVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYS 378 (533) Q Consensus 307 v~~~~l~~~~~--~~--~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s 378 (533) + ...+. .. ....+.....|.....+..++++ +....++.++......++.+..+...++|+...|++ T Consensus 210 l-----~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP 284 (419) T protein:vir:57 210 I-----ERPFEAKAIASQAAVDAILAKWTERYGGVRNAFSVGMLQEGMTYKQLSQDNEKAQLLQSRQYTVNEVCRLYKVP 284 (419) T ss_pred E-----EecCcCCcccCHHHHHHHHHHHHHHhccccccccceecCCCceEEEcCCChhhHHHHHHHHHHHHHHHHHhCCC Confidence 2 11110 00 00011111112111111111111 122345666666677788888888889999999999 Q ss_pred hhhcccCCCcch-hHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHH Q lcl|NC_016654. 379 PVSLGLSDEVAQ-TATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLA 457 (533) Q Consensus 379 ~~~~g~~~~~~~-Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e 457 (533) |..+|....+.- ++++. ....++..|..++..+....+..+..........+.++++.-+..|..+ T Consensus 285 p~~lg~~~~~t~sn~e~~-------------~~~f~~~~l~P~~~~ie~~l~~~ll~~~~~~~~~i~fd~~~ll~~d~~~ 351 (419) T protein:vir:57 285 PHMIQDLQKSTNNNIEHQ-------------GLQYVIYTMLAILKRHESAMMRDLLLPSERRDFYIEFNVSSLLRGDQKS 351 (419) T ss_pred HHHhCCCCCCccccHHHH-------------HHHHHHHHHHHHHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHH Confidence 999986544322 22221 2233445555555544332222222111112345666677777889999 Q ss_pred HHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCC-CCCCCCCCCCCCCC Q lcl|NC_016654. 458 KAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTE-NDPATDPEAVDEGE 533 (533) Q Consensus 458 ~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~d~~ 533 (533) +++.+.+++.+|+|+..++.+.+ ++..-+ .+ +..-...+..+.... ......|+..+|.+ T Consensus 352 ~~~~~~~~~~~G~~T~NE~R~~~--gl~p~~-----------gg---D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~ 412 (419) T protein:vir:57 352 RYESYALGRQWGWLSVNDIRRME--NLTPIP-----------GG---DKYLTPLNMVDSKALTGIGKATPQQLKDIE 412 (419) T ss_pred HHHHHHHHHhCCCcCHHHHHHHh--CCCCCC-----------Cc---CeeeeccccccccccccccCCCcccCcchh Confidence 99999999999999999977654 232200 00 000000011111000 00111233344444 No 166 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=97.56 E-value=4.4e-05 Score=44.45 Aligned_cols=417 Identities=11% Similarity=0.005 Sum_probs=156.4 Q ss_pred CCCCCCcCCCcCc--chHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceee Q lcl|NC_016654. 1 MSLPEANTAWPPP--ELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYH 78 (533) Q Consensus 1 ~~~~~~~~~~pp~--~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 78 (533) |=|--+-+.==|. +..+.+...-.| .|..|-+ + +.....+.+. ... T Consensus 1 ~~~~~~~~~~~p~~~~~~~~~~~~~~~-~~~~g~~--~------------~~~~~~~~~~-----------------~~~ 48 (518) T protein:vir:78 1 MLLANGQTLSAPAMAELSPQMQDSYYY-APAVGMQ--L------------ERQFSLYGGI-----------------YKN 48 (518) T ss_pred CcccCceeeccchhhhhhhhhhhcccc-cceecee--c------------ccccchhhHH-----------------hhh Confidence 4333222222221 111100000000 0111100 0 0000000000 000 Q ss_pred cChHHHHHHHHHHhhcCCCceEeeCCC---chHHHHHHHHHHhhccHH----HHHHHHHHHHhhhCCEEEEEEEcCCCCC Q lcl|NC_016654. 79 APIPGVIAKLSTTELFSEQLKFLDAGK---SKEVQARADLIFNTPRFH----SSLVEAGESCSALSGSFQRIVWDPTIAD 151 (533) Q Consensus 79 ~n~~k~i~~~~a~ll~~e~~~i~~~~~---~~~~~~~l~~i~~~n~f~----~~~~~~~~~~~~~G~~~~~~~~D~~~~~ 151 (533) .+.-..+|+.+|+-+-+=|..+--... .+.....+..++..-+-. .-....+...+..|.+|+.+..|..+. T Consensus 49 ~~~V~acV~~IA~~iA~lp~~l~~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~- 127 (518) T protein:vir:78 49 QPWVRTVIAKRAQALARLPVKCMFTSGDTETEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGT- 127 (518) T ss_pred hHHHHHHHHHHHHhhccCceEEEEEcCCccccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCc- Confidence 111133455555544443433321111 111111233333332221 223334445566799999888776543 Q ss_pred ceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccc Q lcl|NC_016654. 152 NAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVE 231 (533) Q Consensus 152 ~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~ 231 (533) -+.+-.++|+++.+..+.... ...| . |......-+..+.+ + T Consensus 128 ~~~L~~l~p~~Vtv~~~~~~~--------------~~~y-------------~-~~~~~~~~~~~~~~---~-------- 168 (518) T protein:vir:78 128 PEKLMPMHPSRVAIKRNSRTG--------------RYEY-------------Y-FQAGAGVGTQLVSF---A-------- 168 (518) T ss_pred EEEEEEECCCceEEEEcCCCC--------------EEEE-------------E-EEecCCccceeEEe---c-------- Confidence 245666777777665432110 0000 0 11000000000000 0 Q ss_pred ccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCc-ceeeech Q lcl|NC_016654. 232 GADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGA-GKVHASE 309 (533) Q Consensus 232 ~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~-~~i~v~~ 309 (533) .--+.|+.+...+ ....|.|.+..+. ..|. +......+... |+.|. ...++ T Consensus 169 ---------------~~eIiHir~~~~d--------g~~~G~Spi~~~~-~~i~-~~~aa~~~~~~~f~Ng~~p~gvl-- 221 (518) T protein:vir:78 169 ---------------DDEVVPIRFFNPD--------GLERGLSLMESLK-STIF-SEDSSRNATAAMWKNAGRPNLVL-- 221 (518) T ss_pred ---------------CCcEEEecCCCCC--------cccccccHHHHHH-HHHH-HHHHHHHHHHHHHhcCCCccEEE-- Confidence 0012233221111 1124677665433 3443 23333444433 45543 22222 Q ss_pred HHhcCCCCccccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccC Q lcl|NC_016654. 310 SVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLS 385 (533) Q Consensus 310 ~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~ 385 (533) ...+.-...........|.....+..+.++ +....++.++......++++..+...++|+...|++|..+|+. T Consensus 222 ---~~~~~ls~e~~~~~k~~~~~~~~G~~nag~~~vL~~G~~~~~l~~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~ 298 (518) T protein:vir:78 222 ---RHEKRLSPEAQQRLREQFDRAHAGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHIL 298 (518) T ss_pred ---ecCCCCCHHHHHHHHHHHHHHhcCcccCCceeEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccC Confidence 111110000000011112211111111211 1122355566666677888888888899999999999999875 Q ss_pred CCcch-hHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHH Q lcl|NC_016654. 386 DEVAQ-TATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQA 464 (533) Q Consensus 386 ~~~~~-Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~ 464 (533) .++.- +.++. ....+..+|.-++..+....+..+... ......+.++.+.-+..|..++++.+.+ T Consensus 299 ~~st~sn~e~~-------------~~~f~~~tL~P~~~~ie~eln~~L~~~-~~~~~~~~fd~~~Llr~D~~~r~~~~~~ 364 (518) T protein:vir:78 299 DRATFSNISAQ-------------MRAFYRDTMAIPIARIQSAMDKYVGQY-WVRKNRMKFDIDDVIQPDWEAKSESTQK 364 (518) T ss_pred CCCCchhHHHH-------------HHHHHHHHHHHHHHHHHHHHHHhhccc-ccCcceEEeechhhhccCHHHHHHHHHH Confidence 44321 22221 222333444444444332222222111 1123456666667778899999999999 Q ss_pred HHhCCCCCHHHHHHHhCC-CCCHHHHHHHH-----HHHHHhh-----hcccCcccc-------ccccCCCCCC------C Q lcl|NC_016654. 465 WSVASAASTKTKVAYLHE-DWDDERVQEEA-----DLIDNAN-----TVSAPTFGF-------GTDQPPLPTE------N 520 (533) Q Consensus 465 l~~aGi~S~et~v~~l~~-~~~dee~~~El-----~rI~~E~-----~~~~~~~~~-------~~~~~~~~~~------~ 520 (533) ++.+|+|+..++.+.+.- -.++....+-+ ..+..-. +.+.+.... ..++.+.... . T Consensus 365 ~~~~G~lT~NE~R~~~gl~pie~~~gD~~~v~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (518) T protein:vir:78 365 MVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPASTPVASLDQSPPASVPGLSPTN 444 (518) T ss_pred HHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccceecccccccccCCCCCCCCCCCCcccccccccCccccCCCCCccc Confidence 999999999997765421 12221121110 1110000 000000000 0000000000 0 Q ss_pred CCCCCCCCCCCCC Q lcl|NC_016654. 521 DPATDPEAVDEGE 533 (533) Q Consensus 521 ~~~~~~~~~~d~~ 533 (533) .+..+..+.+|+| T Consensus 445 ~~~~~~~~~~~~~ 457 (518) T protein:vir:78 445 SDRSTDSGKTEPR 457 (518) T ss_pred ccccccccccchh Confidence 0000111222222 No 167 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=97.53 E-value=5e-05 Score=44.17 Aligned_cols=425 Identities=11% Similarity=0.041 Sum_probs=151.9 Q ss_pred CCC-----CCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhh-HHHHHHHHHHHHHhcccCCCCCccc Q lcl|NC_016654. 1 MSL-----PEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPS-GIKARTKAAYEAFHGRTPTATGRAP 74 (533) Q Consensus 1 ~~~-----~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~ 74 (533) +.+ =... |+..+|.+..+.... .||.......+. .....+ +.+ . T Consensus 43 ~~~~~~~~~~~~----~a~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~l~~~l----~~~----------~- 92 (563) T protein:vir:95 43 YQDLTKSLYGQQ----QAYAEPFIEMMDTNP-----------EFRDKRSYMKNEHNLHDVL----KKF----------G- 92 (563) T ss_pred HHHHHhhhccCC----CcchhhhHhhhcccc-----------cccccccCCCCcccHHHHH----HHh----------h- Confidence 000 0000 011122222211110 011110000000 000000 000 0 Q ss_pred ceeecChHHHHHHHHHHhhc--------CC-----CceE---eeCC--CchHHHHHHHHHHh----h-----ccHHHHHH Q lcl|NC_016654. 75 KRYHAPIPGVIAKLSTTELF--------SE-----QLKF---LDAG--KSKEVQARADLIFN----T-----PRFHSSLV 127 (533) Q Consensus 75 ~~~~~n~~k~i~~~~a~ll~--------~e-----~~~i---~~~~--~~~~~~~~l~~i~~----~-----n~f~~~~~ 127 (533) ...+...+++..++.+. ++ +..+ +... ........|..++. . ..|...+. T Consensus 93 ---~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~ 169 (563) T protein:vir:95 93 ---NNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCK 169 (563) T ss_pred ---cchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHH Confidence 01122222222221111 00 0000 0000 00011122333221 1 12455666 Q ss_pred HHHHHHhhhCCEEEEEEEcCCCCCc-eEEEEEcCCeEEEEEecC-CceEEEEEEEEeecCCceEEEEEEEecCeeEEEEE Q lcl|NC_016654. 128 EAGESCSALSGSFQRIVWDPTIADN-AWIDFVDADRAIPEFRWG-RLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAV 205 (533) Q Consensus 128 ~~~~~~~~~G~~~~~~~~D~~~~~~-~~i~~v~~~~~~P~~~~g-~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~ 205 (533) .++...+..|.+++.+.+..++.++ +.+..++|..+.++.+.+ .+ |+ +.+.|.. T Consensus 170 ~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~-----------------~~-------~~~~y~~ 225 (563) T protein:vir:95 170 KIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKI-----------------IK-------GGKRFVQ 225 (563) T ss_pred HHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCce-----------------ec-------cceeEEE Confidence 6777888899999887766554443 457778888888765432 21 00 0000000 Q ss_pred EeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHH Q lcl|NC_016654. 206 YKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFH 285 (533) Q Consensus 206 y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid 285 (533) +. . +.....++. .+ -++|+.+..... ....+|.|.+..+. ..| T Consensus 226 ~~-~-g~~~~~~~~----------~e-----------------vI~~~~~~~~d~------~~~~~G~Spi~~a~-~~i- 268 (563) T protein:vir:95 226 VV-D-KRVVASFTS----------RE-----------------LAMGIRNPRTEL------SSSGYGLSEVEIAM-KEF- 268 (563) T ss_pred Ee-C-CceeEEecC----------cc-----------------eEEEeccCCCCc------ccCcccchHHHHHH-HHH- Confidence 00 0 000000000 00 022333321111 12456888877654 444 Q ss_pred HHHHHHHHHHHH-HHhC-cceeeechHHhcCCCCcc--ccccCcchhhhhhcccccccccc-----ccccceeeechhhh Q lcl|NC_016654. 286 ELDRIYSSLMRD-FRIG-AGKVHASESVLTNLGMGQ--GVSLDEEQEVYSRVGSGGFNANG-----DMETIFEFFQPAIR 356 (533) Q Consensus 286 ~lD~~~s~~~~~-~~~~-~~~i~v~~~~l~~~~~~~--~~~~d~~~~~~~~~~~~~~~~~~-----~~~~~i~~~~~~ir 356 (533) .+......+... |+.| ...-+ |...++.. ....+...+.+.....+..+++. +....++.++.... T Consensus 269 ~~~~~~~~~~~~~f~ng~~p~gi-----L~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~ 343 (563) T protein:vir:95 269 IAYNNTESFNDRFFSHGGTTRGI-----LQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTAN 343 (563) T ss_pred HHHHHHHHHHHHHHHccCCCceE-----EEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChh Confidence 334444444444 4544 23322 21111100 00000111112211111111110 11123556666667 Q ss_pred hHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhccCC Q lcl|NC_016654. 357 VLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTR-AKARHFGSALGPLSTTCLRVDAIKFPGK 435 (533) Q Consensus 357 ~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~-~~~~~~~~al~~li~~il~l~~~~~~~~ 435 (533) ..++++..+...++|+...|++|..+|+...+..+++.-.+.. ...++. .....++.+|..++..+....+..+... T Consensus 344 d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~--~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~ 421 (563) T protein:vir:95 344 DMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTL--NEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISE 421 (563) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccch--hhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchh Confidence 7788999999999999999999999997554332211111000 111111 1234455555555555443333222111 Q ss_pred CCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCH-HHHHHHH--------HHH----HHhh-- Q lcl|NC_016654. 436 GAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDD-ERVQEEA--------DLI----DNAN-- 500 (533) Q Consensus 436 ~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~d-ee~~~El--------~rI----~~E~-- 500 (533) ....+.+.|.+.-+.++.+..+ +.+++.+|+|+..++.+.+ + +.. +.-+.-+ ... ..+. T Consensus 422 ---~~~~~~~~f~r~D~~~~~e~~~-~~~~~~~G~lT~NE~R~~~-g-l~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~ 495 (563) T protein:vir:95 422 ---YGDKYTFQFVGGDTKSATDKLN-ILKLETQIFKTVNEAREEQ-G-KKPIEGGDIILDASFLQGTAQLQQDKQYNDGK 495 (563) T ss_pred ---cccccEEEeccCCHHHHHHHHH-HHHHhcCCccCHHHHHHHh-C-CCCCCCcceeecccccccccccccccCCCccc Confidence 1234667776664444444433 3456788999999976654 2 211 0000000 000 0000 Q ss_pred -hcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 501 -TVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 501 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ....+....+...+....+.+++++ ..+++++ T Consensus 496 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 528 (563) T protein:vir:95 496 QKERLQMMMSLLEGDNDDSEEGQSTD-SSNDDKE 528 (563) T ss_pred cchhhhhcccccCCCCCCCCCCCCCC-CCCCccc Confidence 0000000000000100001111110 0011111 No 168 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=97.53 E-value=5e-05 Score=44.17 Aligned_cols=425 Identities=11% Similarity=0.041 Sum_probs=151.9 Q ss_pred CCC-----CCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhh-HHHHHHHHHHHHHhcccCCCCCccc Q lcl|NC_016654. 1 MSL-----PEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPS-GIKARTKAAYEAFHGRTPTATGRAP 74 (533) Q Consensus 1 ~~~-----~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~ 74 (533) +.+ =... |+..+|.+..+.... .||.......+. .....+ +.+ . T Consensus 43 ~~~~~~~~~~~~----~a~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~l~~~l----~~~----------~- 92 (563) T protein:vir:99 43 YQDLTKSLYGQQ----QAYAEPFIEMMDTNP-----------EFRDKRSYMKNEHNLHDVL----KKF----------G- 92 (563) T ss_pred HHHHHhhhccCC----CcchhhhHhhhcccc-----------cccccccCCCCcccHHHHH----HHh----------h- Confidence 000 0000 011122222211110 011110000000 000000 000 0 Q ss_pred ceeecChHHHHHHHHHHhhc--------CC-----CceE---eeCC--CchHHHHHHHHHHh----h-----ccHHHHHH Q lcl|NC_016654. 75 KRYHAPIPGVIAKLSTTELF--------SE-----QLKF---LDAG--KSKEVQARADLIFN----T-----PRFHSSLV 127 (533) Q Consensus 75 ~~~~~n~~k~i~~~~a~ll~--------~e-----~~~i---~~~~--~~~~~~~~l~~i~~----~-----n~f~~~~~ 127 (533) ...+...+++..++.+. ++ +..+ +... ........|..++. . ..|...+. T Consensus 93 ---~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~ 169 (563) T protein:vir:99 93 ---NNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCK 169 (563) T ss_pred ---cchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHH Confidence 01122222222221111 00 0000 0000 00011122333221 1 12455666 Q ss_pred HHHHHHhhhCCEEEEEEEcCCCCCc-eEEEEEcCCeEEEEEecC-CceEEEEEEEEeecCCceEEEEEEEecCeeEEEEE Q lcl|NC_016654. 128 EAGESCSALSGSFQRIVWDPTIADN-AWIDFVDADRAIPEFRWG-RLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAV 205 (533) Q Consensus 128 ~~~~~~~~~G~~~~~~~~D~~~~~~-~~i~~v~~~~~~P~~~~g-~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~ 205 (533) .++...+..|.+++.+.+..++.++ +.+..++|..+.++.+.+ .+ |+ +.+.|.. T Consensus 170 ~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~-----------------~~-------~~~~y~~ 225 (563) T protein:vir:99 170 KIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKI-----------------IK-------GGKRFVQ 225 (563) T ss_pred HHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCce-----------------ec-------cceeEEE Confidence 6777888899999887766554443 457778888888765432 21 00 0000000 Q ss_pred EeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHH Q lcl|NC_016654. 206 YKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFH 285 (533) Q Consensus 206 y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid 285 (533) +. . +.....++. .+ -++|+.+..... ....+|.|.+..+. ..| T Consensus 226 ~~-~-g~~~~~~~~----------~e-----------------vI~~~~~~~~d~------~~~~~G~Spi~~a~-~~i- 268 (563) T protein:vir:99 226 VV-D-KRVVASFTS----------RE-----------------LAMGIRNPRTEL------SSSGYGLSEVEIAM-KEF- 268 (563) T ss_pred Ee-C-CceeEEecC----------cc-----------------eEEEeccCCCCc------ccCcccchHHHHHH-HHH- Confidence 00 0 000000000 00 022333321111 12456888877654 444 Q ss_pred HHHHHHHHHHHH-HHhC-cceeeechHHhcCCCCcc--ccccCcchhhhhhcccccccccc-----ccccceeeechhhh Q lcl|NC_016654. 286 ELDRIYSSLMRD-FRIG-AGKVHASESVLTNLGMGQ--GVSLDEEQEVYSRVGSGGFNANG-----DMETIFEFFQPAIR 356 (533) Q Consensus 286 ~lD~~~s~~~~~-~~~~-~~~i~v~~~~l~~~~~~~--~~~~d~~~~~~~~~~~~~~~~~~-----~~~~~i~~~~~~ir 356 (533) .+......+... |+.| ...-+ |...++.. ....+...+.+.....+..+++. +....++.++.... T Consensus 269 ~~~~~~~~~~~~~f~ng~~p~gi-----L~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~ 343 (563) T protein:vir:99 269 IAYNNTESFNDRFFSHGGTTRGI-----LQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTAN 343 (563) T ss_pred HHHHHHHHHHHHHHHccCCCceE-----EEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChh Confidence 334444444444 4544 23322 21111100 00000111112211111111110 11123556666667 Q ss_pred hHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhccCC Q lcl|NC_016654. 357 VLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTR-AKARHFGSALGPLSTTCLRVDAIKFPGK 435 (533) Q Consensus 357 ~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~-~~~~~~~~al~~li~~il~l~~~~~~~~ 435 (533) ..++++..+...++|+...|++|..+|+...+..+++.-.+.. ...++. .....++.+|..++..+....+..+... T Consensus 344 d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~--~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~ 421 (563) T protein:vir:99 344 DMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTL--NEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISE 421 (563) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccch--hhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchh Confidence 7788999999999999999999999997554332211111000 111111 1234455555555555443333222111 Q ss_pred CCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCH-HHHHHHH--------HHH----HHhh-- Q lcl|NC_016654. 436 GAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDD-ERVQEEA--------DLI----DNAN-- 500 (533) Q Consensus 436 ~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~d-ee~~~El--------~rI----~~E~-- 500 (533) ....+.+.|.+.-+.++.+..+ +.+++.+|+|+..++.+.+ + +.. +.-+.-+ ... ..+. T Consensus 422 ---~~~~~~~~f~r~D~~~~~e~~~-~~~~~~~G~lT~NE~R~~~-g-l~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~ 495 (563) T protein:vir:99 422 ---YGDKYTFQFVGGDTKSATDKLN-ILKLETQIFKTVNEAREEQ-G-KKPIEGGDIILDASFLQGTAQLQQDKQYNDGK 495 (563) T ss_pred ---cccccEEEeccCCHHHHHHHHH-HHHHhcCCccCHHHHHHHh-C-CCCCCCcceeecccccccccccccccCCCccc Confidence 1234667776664444444433 3456788999999976654 2 211 0000000 000 0000 Q ss_pred -hcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 501 -TVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 501 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ....+....+...+....+.+++++ ..+++++ T Consensus 496 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 528 (563) T protein:vir:99 496 QKERLQMMMSLLEGDNDDSEEGQSTD-SSNDDKE 528 (563) T ss_pred cchhhhhcccccCCCCCCCCCCCCCC-CCCCccc Confidence 0000000000000100001111110 0011111 No 169 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=97.50 E-value=5.6e-05 Score=43.91 Aligned_cols=461 Identities=10% Similarity=0.015 Sum_probs=177.7 Q ss_pred CCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChH Q lcl|NC_016654. 3 LPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIP 82 (533) Q Consensus 3 ~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~ 82 (533) |-+-.+-.==+.+...+..+...|..|...-.++.+|... ..+.......+.+..++.-+-+ T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP------------------~~~~~~~~~~~~~~~~~~dst~ 62 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIP------------------SLFPKDSDNASTDYQTPWQAVG 62 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcc------------------cccCCCCCcccccccccccccH Confidence 3333332222334444444444444444333333333211 1111111111223345666777 Q ss_pred HHHHHHHHHhhcCC--C--ceEeeCCCc-------------hHHH-------HHHHHHHhhccHHHHHHHHHHHHhhhCC Q lcl|NC_016654. 83 GVIAKLSTTELFSE--Q--LKFLDAGKS-------------KEVQ-------ARADLIFNTPRFHSSLVEAGESCSALSG 138 (533) Q Consensus 83 k~i~~~~a~ll~~e--~--~~i~~~~~~-------------~~~~-------~~l~~i~~~n~f~~~~~~~~~~~~~~G~ 138 (533) ...|+.+|+.|.+- | +.|.....+ .+++ +.+...+..++|...+.++.+...+.|. T Consensus 63 ~~a~~~Laa~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~ 142 (536) T protein:vir:21 63 ARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGN 142 (536) T ss_pred HHHHHHHHHHHHHhhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCc Confidence 88888888766543 2 444432211 1222 3445568888999999999999999998 Q ss_pred EEEEEEEcCCCCCce-EEEEEcCCeEEEEEe-cCCceEEEEEEEEee--------------cCCceEEEEEEEecCeeEE Q lcl|NC_016654. 139 SFQRIVWDPTIADNA-WIDFVDADRAIPEFR-WGRLVAVTFWSELAG--------------GDGQEVWRHLERHESGYIV 202 (533) Q Consensus 139 ~~~~~~~D~~~~~~~-~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~--------------~~~~~~y~~lE~h~~~~I~ 202 (533) +.++ .+++....+ .+..++-..++..-+ +|++..++.-.+++. ..++..+.. -.|. T Consensus 143 a~ly--~~e~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~------v~v~ 214 (536) T protein:vir:21 143 VLLY--LPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADET------IDVY 214 (536) T ss_pred EeEE--EeeCCCCceeeEEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccccccc------eeEE Confidence 8864 454443333 367777666665544 477766653333221 011111111 1222 Q ss_pred EEEEeccCCcccceeehhhccccccccccccccCCceeecCCCc-cceeEEecCCcccccccccccccccccchhhhhHH Q lcl|NC_016654. 203 HAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVK-DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLF 281 (533) Q Consensus 203 ~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~-~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~ 281 (533) +.+|.-.++. +. .-+.+ +. +....-+.|.. ..-+.|++. +|... ..+.||+|-...++ T Consensus 215 ~~v~~~~~~~---~~--~~~~e-------~~--g~~v~~~~g~~~f~~~P~i~~-----Rw~~~-~ge~YGrgp~~~~l- 273 (536) T protein:vir:21 215 THIYLDEDSG---EY--LRYEE-------VE--GMEVQGSDGTYPKEACPYIPI-----RMVRL-DGESYGRSYIEEYL- 273 (536) T ss_pred EEEEEecCCC---cE--EEEec-------cC--CeeeccccCccccccCCeeee-----eeeec-CCCccccchHHHHH- Confidence 3333211110 00 00000 00 00000011211 101122222 23322 24778998777666 Q ss_pred HHHHHHHHHHHHHHHH-HHhCcceeeec-hHHhcCCCCccccccCcchhhhhhcccccccccccccccee-eechhhh-h Q lcl|NC_016654. 282 PTFHELDRIYSSLMRD-FRIGAGKVHAS-ESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFE-FFQPAIR-V 357 (533) Q Consensus 282 ~lid~lD~~~s~~~~~-~~~~~~~i~v~-~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~-~~~~~ir-~ 357 (533) +-+..|+..--..... ....+....|+ ..++.+..- .+...+.+... .. ++...+. ....++. + T Consensus 274 ~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~-----~~~~~g~~v~g--~~-----~~v~~~~~~~~~~~~~~ 341 (536) T protein:vir:21 274 GDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRL-----TKAQTGDFVTG--RP-----EDISFLQLEKQADFTVA 341 (536) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhh-----ccCCCcceecC--Cc-----ccceeeeccccccchHH Confidence 5567777665444432 23334333332 222211100 00011111110 00 0000011 0112221 1 Q ss_pred HHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhhccCCC Q lcl|NC_016654. 358 LEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARH-FGSALGPLSTTCLRVDAIKFPGKG 436 (533) Q Consensus 358 e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~-~~~al~~li~~il~l~~~~~~~~~ 436 (533) .+-++.++.-++....... +....+...|||||..+.+...+..+-.-.. -...|.-|+..++.+... .|.- T Consensus 342 ~~~i~~~~~rI~~af~~~~-----l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r--~g~l 414 (536) T protein:vir:21 342 KAVSDAIEARLSFAFMLNS-----AVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQA--TQQI 414 (536) T ss_pred HHHHHHHHHHHHHHHhhhh-----cccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh--CCCC Confidence 1223333333333322111 2222334469999998887777655543222 223344455554444311 1111 Q ss_pred -CCCceeEEEEeCCCCCC-----CHHHHHHHHHHHHhCC------CCCHHHHHHHh---CCC------CCHHHHHHHHHH Q lcl|NC_016654. 437 -AAPSEELELEWPKFARE-----SDLAKAQTVQAWSVAS------AASTKTKVAYL---HED------WDDERVQEEADL 495 (533) Q Consensus 437 -~~~~~~v~i~f~d~i~~-----d~~e~a~~~~~l~~aG------i~S~et~v~~l---~~~------~~dee~~~El~r 495 (533) ..+...+.+++--++.. +.+...+.++.+-+.+ .+....+++.+ .+- -+++|++++.++ T Consensus 415 P~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~Pe~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q 494 (536) T protein:vir:21 415 PELPKEAVEPTISTGLEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQ 494 (536) T ss_pred CCCChhhccceEEecHHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHH Confidence 12334455555433321 1111111122221111 12334444332 220 155555554433 Q ss_pred HHHhhhcc--cCcccc--c---cccCCCCCC-CCCCCCCCCC Q lcl|NC_016654. 496 IDNANTVS--APTFGF--G---TDQPPLPTE-NDPATDPEAV 529 (533) Q Consensus 496 I~~E~~~~--~~~~~~--~---~~~~~~~~~-~~~~~~~~~~ 529 (533) -+++++.. ....+. + ...|+.... -+..|..+.- T Consensus 495 ~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 536 (536) T protein:vir:21 495 QSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPGI 536 (536) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcChhhHHhhhhccccCCCC Confidence 22211110 000000 0 000100000 0111111111 No 170 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=97.47 E-value=6.1e-05 Score=43.68 Aligned_cols=414 Identities=10% Similarity=0.047 Sum_probs=153.5 Q ss_pred CCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHH-------Hhc--ccCCCCCcc Q lcl|NC_016654. 3 LPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEA-------FHG--RTPTATGRA 73 (533) Q Consensus 3 ~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~-------~~~--~~~~~~g~~ 73 (533) |++. ++ +..+++ .++ .+..+.|..+.+.... -|. +++...... T Consensus 1 ~~~~--------~~-----~~~~~~--~~~-------------~~~~~~rd~l~~~~~glg~~r~~~~~~~g~~~~~~~~ 52 (449) T protein:vir:10 1 MTDK--------LT-----LAVNHA--LND-------------ARMARARMGLMVPTMGLDNKRHSAWCEYGFPELVTYE 52 (449) T ss_pred Cchh--------hH-----HHHhhh--cch-------------hHHHHHHHHHHHHHhcCCcccchhhhhcCCcccCCHH Confidence 2221 11 111111 000 0111111111111000 010 222211100 Q ss_pred ---cceeecChHHHHHHHHHHhhcCCCceEeeCCCchH------HHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEE Q lcl|NC_016654. 74 ---PKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKE------VQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIV 144 (533) Q Consensus 74 ---~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~------~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~ 144 (533) ..+.+..+++.||+..|+-+..+-+.|.-+.+.+. ....+++++.. +++..+.++...+..+|++++.+. T Consensus 53 ~l~~~Yr~~~ia~~iVd~~~d~~~~~~~~i~~g~~~~~~~~~~~~e~~~~~l~~~-~~~~~l~ea~~~~rl~Gga~i~i~ 131 (449) T protein:vir:10 53 NLYSLYRRGGIAHGAVEKLVGKCWQTNPEIIEGDDADDSEDETSWEKKSKQVFTN-RLWRSFAEADRRRLVGRYAGILLH 131 (449) T ss_pred HHHHHHhcCchhHHHHHhhhhhhhhcCcccccCccccchhhhHHHHHHHHHHHHH-HHHHHHHHHHHhhhccCcEEEEEE Confidence 12335578999999999977666555543222211 22334444443 678889999999888888888776 Q ss_pred EcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEE--EEEEecCeeEEEEEEeccCCcccceeehhhc Q lcl|NC_016654. 145 WDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWR--HLERHESGYIVHAVYKGTATSLGWMMALTDH 222 (533) Q Consensus 145 ~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~--~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~ 222 (533) ++.+ +..- -|+-..+.|..+..+....-.-. .+++ .-+.+ |.-+ .|+-+....|... T Consensus 132 v~d~---~~l~--------~Pl~~~~~i~~i~v~~~~~i~~~-~~~~dp~sp~y--g~P~--~y~v~~~~~g~~~----- 190 (449) T protein:vir:10 132 IRDE---KDWN--------LPATKGRGLQKVSVSWAGSLKVA-EWDTGINSKTY--GQPK--LWKYTERLPNGSS----- 190 (449) T ss_pred ecCC---CCCC--------cccccCcceeeEEeeccccCChh-hhhcCCCCCCC--CCce--EEEEeeeccCCCc----- Confidence 7432 1110 13322234444322111100000 0000 00000 0001 1110000000000 Q ss_pred cccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHH-----HHHH Q lcl|NC_016654. 223 PATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSS-----LMRD 297 (533) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~-----~~~~ 297 (533) ....| +.++ + +.| +. ...-|.|.+....+. +-.++.+-.. +.+. T Consensus 191 --------------~~~~i-H~SR-l-~~~--~~-----------~~~~g~~~L~~~yn~-l~~~~~~~~~~a~~~l~~~ 239 (449) T protein:vir:10 191 --------------RRVDI-HPDR-V-FIL--GD-----------YSEDAIGFLEPAYNA-FVSLEKVEGGSGESFLKNA 239 (449) T ss_pred --------------cceee-ccce-e-Eee--cC-----------CCCCChhHHHHHHHH-hhhHHHhhhhHHHHHHHHH Confidence 00000 1111 0 011 00 011144544432221 1122222111 1111 Q ss_pred HHhC----cceeeechHHhcCCCCccc---cccCcchhhh-hhccccccccccccccceeeechhhhhHHHHHHHHHHHH Q lcl|NC_016654. 298 FRIG----AGKVHASESVLTNLGMGQG---VSLDEEQEVY-SRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLR 369 (533) Q Consensus 298 ~~~~----~~~i~v~~~~l~~~~~~~~---~~~d~~~~~~-~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~ 369 (533) ++.. ..++-+ ..+..-.+.+.. ..+......+ +.......+.+ + .+++++- .+......++.+.+ T Consensus 240 ~rq~~~~~~~~~~~-~~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~i~~~--~--d~~~~~~--~~sgl~d~l~~~~q 312 (449) T protein:vir:10 240 ARQLNVNFEKEIDF-TNLASLYGVSIDELQDKFNEVAGEINRGNDVLMTTQG--A--TVTPLVT--SVADPTATYNVNLQ 312 (449) T ss_pred HHHHhhhhhhhhhh-hhhhHHhhCCchHHHHHHHHHHHHHhccchheeecCC--c--ceEEEec--ccCChhHHHHHHHH Confidence 1100 000100 011100011100 0010000001 11111111111 1 2333322 22344556788888 Q ss_pred HHHHhhCCChhh-cccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeC Q lcl|NC_016654. 370 EVLRKTGYSPVS-LGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWP 448 (533) Q Consensus 370 ~i~~~~g~s~~~-~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~ 448 (533) .++..+|+|... ||-..+|..|...++ --+..+..++..++..|+.|+..++.. . .+.. ..+++|.|+ T Consensus 313 ~iaaa~~IP~t~L~Gqsp~glnst~D~~----nyyd~i~~~Q~~l~p~le~l~~~l~~s---~---~g~~-~~d~~i~f~ 381 (449) T protein:vir:10 313 TAAAGVDIPTRILIGNQQAERSSTEDQK----YFNARCQSRRVDLSFEIEDFCDKLIEL---K---IIDA-VAKKAVIWD 381 (449) T ss_pred HHHHHhCCCeeeeeccCccccccchhHH----HHHHHHHHHHHhhhHHHHHHHHHHHHh---h---cCCC-CCceeEEeC Confidence 899999998543 465555544333332 245556666667899999998876543 1 1222 247999999 Q ss_pred CCCCCCHHHHHHHHHHHHhCCCCCHHHHHHH-hCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCC Q lcl|NC_016654. 449 KFARESDLAKAQTVQAWSVASAASTKTKVAY-LHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPE 527 (533) Q Consensus 449 d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~-l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (533) +--.++..|+|++..+...+ ..+++.. -.+-++.+|+++-+ ..+|..+ + +.+++.+++.++. T Consensus 382 pL~~~t~kEkAei~k~~A~a----~~~~~~ag~~~~~~~~EiR~~~--------~~~~~~~---~--~~~~e~~de~~~~ 444 (449) T protein:vir:10 382 DLNEQTGTEKLTNAKTMGEI----NQTMLGSGDNPAFSREEIRTAA--------GYDNDDE---E--PLGEEDGDEEDKA 444 (449) T ss_pred CCCCCCHHHHHHHHHHHHHH----HHHHHHccccCCcCHHHHHHHh--------cccCCCC---C--CCCCCCCcccccc Confidence 99999999998866554321 1111110 01124554443221 1111111 1 1111112222222 Q ss_pred CCCCC Q lcl|NC_016654. 528 AVDEG 532 (533) Q Consensus 528 ~~~d~ 532 (533) .+... T Consensus 445 ~d~~a 449 (449) T protein:vir:10 445 TDSAA 449 (449) T ss_pred CCcCC Confidence 22222 No 171 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=97.45 E-value=6.6e-05 Score=43.52 Aligned_cols=382 Identities=9% Similarity=-0.038 Sum_probs=145.2 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcc---ccee Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRA---PKRY 77 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~---~~~~ 77 (533) |.|= +.|..+.+.... . .......++.......+.. ..-+ T Consensus 1 Mg~f-----------------------------~~~~~~~~~~~~-~-------~~~~~~~~~~~~~~~~~~~~~~~~~~ 43 (406) T protein:vir:95 1 MGLF-----------------------------DRWRRTKRKSKI-R-------ADTGYVGLFMSGEDVSFLVPGYVRLS 43 (406) T ss_pred Ccch-----------------------------hhhccccccccc-c-------ccchhhhhhccCcccCccccCHHHHh Confidence 2222 111111000000 0 0001111111111111100 0012 Q ss_pred ecChHHHHHHHHHHhhcCCCceEeeC--CCchHHHHHHHHHHh-h-c---cHHHHHHHHHHHHhhhCCEEE--EEEEcCC Q lcl|NC_016654. 78 HAPIPGVIAKLSTTELFSEQLKFLDA--GKSKEVQARADLIFN-T-P---RFHSSLVEAGESCSALSGSFQ--RIVWDPT 148 (533) Q Consensus 78 ~~n~~k~i~~~~a~ll~~e~~~i~~~--~~~~~~~~~l~~i~~-~-n---~f~~~~~~~~~~~~~~G~~~~--~~~~D~~ 148 (533) .......+++.+|+-+.+-|..+--. +........+..+|. . | .....+...+...+..|.++. .+..+.. T Consensus 44 ~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~ 123 (406) T protein:vir:95 44 DNPEVRMAVHKIADLISSMTIYLMQNTEDGDIRIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTAD 123 (406) T ss_pred hcHHHHHHHHHHHHhhccCceEEEEecCCcceeecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCC Confidence 23344566666776666555433111 110111111222221 1 1 122333444444455565543 3444433 Q ss_pred CCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccc Q lcl|NC_016654. 149 IADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDI 228 (533) Q Consensus 149 ~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~ 228 (533) +. -..+-.++|.++-++.+.+. | .++ |. |..++.. T Consensus 124 g~-~~~l~~i~~~~v~~~~~~~~------------------~---------~~~---~~------~~~~~~~-------- 158 (406) T protein:vir:95 124 GL-IDELVPLTPSKVNFLDTPDG------------------Y---------QVL---YG------GQTFNYD-------- 158 (406) T ss_pred Cc-EEEEEEEcCceeEEEEcCCe------------------E---------EEE---ec------cEEEchh-------- Confidence 21 12444566665554432220 1 111 10 1111111 Q ss_pred cccccccCCceeecCCCccceeEEec-CCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCcceee Q lcl|NC_016654. 229 AVEGADEGRGAYVETGVKDLTAAYVP-NVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGAGKVH 306 (533) Q Consensus 229 ~~~~~~~~~~~~~~~g~~~~~~~~~p-n~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~~~i~ 306 (533) -+.|+. +..+. ...+|.|.+..+ ...++. +....++... ++.|...-. T Consensus 159 --------------------evih~~~~~~~~--------~~~~G~s~i~~~-~~~i~~-~~~~~~~~~~~~~ng~~~~~ 208 (406) T protein:vir:95 159 --------------------EVLHFIYNPDPE--------RPYIGRGYRVVL-KDIADN-LKQATATKKSFMSGKYMPSL 208 (406) T ss_pred --------------------HEEEeeccCCCC--------CCccccCHHHHH-HHHHHH-HHHHHHHHHHHHhccCCcce Confidence 122222 11110 123577776653 344533 3333444443 344433212 Q ss_pred echHHhcCCCCccccccCcchhhhhhcccccc--------ccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCC Q lcl|NC_016654. 307 ASESVLTNLGMGQGVSLDEEQEVYSRVGSGGF--------NANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYS 378 (533) Q Consensus 307 v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~--------~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s 378 (533) + +...+.-.........+.+.....+.. ..++.....+..+ .....++++..+...+.|+...|+| T Consensus 209 i----l~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~v~~~~~~~~~~~~~~--~~~d~q~~e~~~~~~~~Ia~~fgVp 282 (406) T protein:vir:95 209 I----VKVDAATAELSSEEGRNAVFKKYLQATEAGQPWIIPAELLEVEQVKPL--SLKDIAINEAVELDKRTVAGMFGVP 282 (406) T ss_pred E----EEeCCCCCHHHHHHHHHHHHHHhccccccCCceeecCCCccccccccC--ChhHHHHHHHHHHHHHHHHHHhCCC Confidence 1 111111000000000111111111111 1111111111122 3344567788888889999999999 Q ss_pred hhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHH Q lcl|NC_016654. 379 PVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAK 458 (533) Q Consensus 379 ~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~ 458 (533) |..+|..... +. . ....++.+|..+++.+....+..+.. .....+.+++++-...|..++ T Consensus 283 ~~~lg~~~~~-----~~--~----------~~~~~~~~l~P~~~~ie~~l~~~l~~---~~~~~~~fd~~~l~~~d~~~~ 342 (406) T protein:vir:95 283 AFLLGIGEFN-----RD--E----------YNNFINSTILPIAKGIEQELTRKLLI---SPDLYFKFNPRSLYAYDLKEL 342 (406) T ss_pred HHHcCCCCch-----HH--H----------HHHHHHHHHHHHHHHHHHHHHHhcCC---CCCcEEEeechhhhcCCHHHH Confidence 9998743221 11 1 11245556666666554333333322 223467777777778899999 Q ss_pred HHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 459 AQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 459 a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ++.+.+++.+|+|+..++.+++ ++..- ++.+++.--. .+-+..... +.....|.....++|+ T Consensus 343 ~~~~~~l~~~G~~t~NE~R~~~--gl~p~---~~gd~~~~~~--n~~~~~~~~------~~~~~k~g~~~~~~~~ 404 (406) T protein:vir:95 343 AEVGSNMYVRGIMEGNEVRDWL--GLSPK---EGLSELVILE--NYIPLDKIG------DQSKLKGGDNSGADGQ 404 (406) T ss_pred HHHHHHHHhCCCcCHHHHHHHh--CCCCC---CCcceeeecc--Cccchhhcc------cccccCCCCCCCCCCC Confidence 9999999999999999977764 23321 0111110000 000000000 0000001111111222 No 172 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=97.44 E-value=6.6e-05 Score=43.49 Aligned_cols=394 Identities=10% Similarity=0.035 Sum_probs=165.7 Q ss_pred cCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHhhcCCCceEeeCC---Cc Q lcl|NC_016654. 30 EGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAG---KS 106 (533) Q Consensus 30 ~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~---~~ 106 (533) +|=.+++..+++........ .......|+.+..... ..-+...--..+++.+|+-+-+=|..+--.. .. T Consensus 1 MG~~~~~~~~~~~~~~~~~~-----~~~~~~~~~g~~~~~~---~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~~~ 72 (411) T protein:vir:81 1 MGWWSRLTRFFRPRNETVDM-----TNPLLLQWLGVDPDTP---RNQLSEATYFACLKILSESLGKLPLKMYQKTERGIV 72 (411) T ss_pred CchHHHHHhhccCccccccc-----chHHHHHHhcCcccCh---hhhhccHHHHHHHHHHHHhHhhCceeEEEecCCcee Confidence 34334444444332211110 0112233333322111 1112222233445555555554454442111 00 Q ss_pred hHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEE Q lcl|NC_016654. 107 KEVQARADLIFNT--P---RFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSEL 181 (533) Q Consensus 107 ~~~~~~l~~i~~~--n---~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~ 181 (533) +.....+..+|+. | ....-+...+...+..|.+|+.+..|. +.-..+..++|+.+.++.+++.... T Consensus 73 ~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~--g~~~~l~~l~~~~v~~~~~~~~~~~------- 143 (411) T protein:vir:81 73 KSDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYSG--PQLQALWILPSQYVTIVVDDRGLLG------- 143 (411) T ss_pred eecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC--CceEEEEEECCceEEEEEcCccccc------- Confidence 1111223333332 1 122334444555677799998887773 2334566788888887755432110 Q ss_pred eecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCccccc Q lcl|NC_016654. 182 AGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPE 261 (533) Q Consensus 182 ~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~ 261 (533) ..+.+.|. |....+ |..+.+ +.--+.|+....+ T Consensus 144 ---------------~~~~~~~~-~~~~~~--g~~~~~--------------------------~~~eiih~k~~~~--- 176 (411) T protein:vir:81 144 ---------------EKNAIWYR-YNDPYD--GKMYVF--------------------------RNDEILHFKTSVT--- 176 (411) T ss_pred ---------------ccceEEEE-EEecCC--ceEEEE--------------------------ccccEEEEcCCCC--- Confidence 01111111 110000 111100 0001223321100 Q ss_pred ccccccccccccchhhhhHHHHHHHHHHHHHHHHHHH-HhCc-ceeeechHHhcCCCCccccccCcchhhhhhccccccc Q lcl|NC_016654. 262 WRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDF-RIGA-GKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFN 339 (533) Q Consensus 262 ~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~~-~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~ 339 (533) . ...+|.|....+. ..+. +.....++...+ +.|. +.-++ ...+.-...........+.....+..+ T Consensus 177 ~-----~~~~G~s~~~~~~-~~i~-~~~~~~~~~~~~f~ng~~p~gil-----~~~~~l~~e~~~~~~~~~~~~~~g~~n 244 (411) T protein:vir:81 177 F-----DGITGLSVRDVLK-HTVD-GALESQKFMNNLYKTGLTGKAVL-----EYTGDLNQEARDRLVKGFEQFANGSKN 244 (411) T ss_pred C-----CCcccccHHHHHH-HHHH-HHHHHHHHHHHHHhccCCCceEE-----EeCCCCCHHHHHHHHHHHHHHhcCccc Confidence 0 1235777766543 4443 334444444443 5542 22222 111100000000011112211111111 Q ss_pred ccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc-hhHHHHHHHhhhHHHHHHHHHHHHH Q lcl|NC_016654. 340 ANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA-QTATEASGKKDLTVKTTRAKARHFG 414 (533) Q Consensus 340 ~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~-~Tatai~~~~~~l~~~~~~~~~~~~ 414 (533) +++ +....++.++......++++..+....+|+...|+||..+|...++. .++.+. ....++ T Consensus 245 ~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~-------------~~~f~~ 311 (411) T protein:vir:81 245 AGKIIPVPLGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQ-------------NLAFYV 311 (411) T ss_pred cCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHH-------------HHHHHH Confidence 111 11123555665556667788888888999999999999998654432 222222 223444 Q ss_pred HHHHHHHHHHHHHHHhhccCC-CCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHH Q lcl|NC_016654. 415 SALGPLSTTCLRVDAIKFPGK-GAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEA 493 (533) Q Consensus 415 ~al~~li~~il~l~~~~~~~~-~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El 493 (533) .+|..++..+..-.+..+... .......+.++++.-+-.|..++++.+.+++.+|+|+..++.+.+ ++..-+ .- T Consensus 312 ~~l~P~~~~ie~~l~~~ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~~--gl~p~~---gg 386 (411) T protein:vir:81 312 DTLLYVLKQYEEEITYKILSNDLISQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDYL--DMPADD---YG 386 (411) T ss_pred HHHHHHHHHHHHHHHhhcCChhhcCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh--CCCCCC---CC Confidence 555555554443223222211 112334567777777788999999999999999999999976654 232210 00 Q ss_pred HHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 494 DLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 494 ~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ++ .-...+..|...- + .+..-.|| T Consensus 387 D~-----------~~~~~n~~pl~~~----~-~~~~kgGd 410 (411) T protein:vir:81 387 NN-----------LMANGNYIPLSML----G-ANYGKGGD 410 (411) T ss_pred Ce-----------eeeccCccchhhh----h-hhhccCCC Confidence 00 0000011111000 0 01111222 No 173 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=97.41 E-value=7.3e-05 Score=43.26 Aligned_cols=392 Identities=13% Similarity=0.048 Sum_probs=157.7 Q ss_pred CCCCC----CcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccce Q lcl|NC_016654. 1 MSLPE----ANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKR 76 (533) Q Consensus 1 ~~~~~----~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 76 (533) |.|=. ..+..|+-.+......+-.|. ...... +. ...- T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~~~~~~~---~~~~~~----~~-------------------------------~~~a 42 (416) T protein:vir:45 1 MGIFYKNEKRDLQYNEDDLQMMVQTLPGFQ---GTKLRQ----YK-------------------------------DIEA 42 (416) T ss_pred CCcccccccccccCCCcchhHHHHHhcccc---ccCccc----cc-------------------------------hhhh Confidence 55431 112333333222221111110 000000 00 0000 Q ss_pred eecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhh--ccH---HHHHHHHHHHHhhhCCEEEEEEEcCCCCC Q lcl|NC_016654. 77 YHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNT--PRF---HSSLVEAGESCSALSGSFQRIVWDPTIAD 151 (533) Q Consensus 77 ~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~--n~f---~~~~~~~~~~~~~~G~~~~~~~~D~~~~~ 151 (533) ++..=--..|+.+|+-+.+=|..+.-++.. .....+..+|.. |.+ ..-....+...+..|.+|+.+..|..+. T Consensus 43 l~~~~v~~cv~~Ia~~iA~~p~~~~~~~~~-~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~- 120 (416) T protein:vir:45 43 IRHSDIFTAVMMIASDLARMPIRVTVNGQI-NYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGE- 120 (416) T ss_pred hcchHHHHHHHHHHHhhccCceEEecCccc-cccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc- Confidence 000000112344444444434444322211 112223334332 211 1223344455567899999888876543 Q ss_pred ceEEEEEcCCeEEEEEec-CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccc--eeehhhccccccc Q lcl|NC_016654. 152 NAWIDFVDADRAIPEFRW-GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGW--MMALTDHPATRDI 228 (533) Q Consensus 152 ~~~i~~v~~~~~~P~~~~-g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~--~v~l~~~~~~~~~ 228 (533) -+.+-.++|+++.++.+. |++. | ..+.......+. .++.+ T Consensus 121 ~~~L~~i~~~~v~v~~~~~g~~~----------------~-------------~~~~~~~~~~~~~~~~~~~-------- 163 (416) T protein:vir:45 121 PMNLTFRKTSEIELKSDARGRLY----------------Y-------------FHQRIDSNGNNIERNVKFE-------- 163 (416) T ss_pred EEEEEEEcCceeEEEECCCccEE----------------E-------------EEEEecCCCceeEEEEccc-------- Confidence 245667788888766432 3321 1 000000000000 00000 Q ss_pred cccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHH-HhCc-ceee Q lcl|NC_016654. 229 AVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDF-RIGA-GKVH 306 (533) Q Consensus 229 ~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~~-~~i~ 306 (533) -+.|+.... .. ...|.|.+..+. ..|+ ++.....+...+ +.+. ...+ T Consensus 164 --------------------evihir~~~----~d-----~~~G~s~i~~~~-~~i~-~~~~~~~~~~~~f~ng~~~~gi 212 (416) T protein:vir:45 164 --------------------DMLDIKFYS----LD-----GINGLSLLDTLS-RTIE-SDNNGKDFLNNFLRNGTHAGGI 212 (416) T ss_pred --------------------cEEEeccCC----CC-----CccccCHHHHHH-HHHH-HHHHHHHHHHHHHhccCCCcEE Confidence 012222110 11 235777766543 4443 344445555443 5432 3332 Q ss_pred e--chHHhcCCCCccccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChh Q lcl|NC_016654. 307 A--SESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPV 380 (533) Q Consensus 307 v--~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~ 380 (533) + +..+. +... .+.-...|.....+..++++ +....++.++......++++..+...++|+...|+||. T Consensus 213 l~~~~~~~----~~~~--~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~ 286 (416) T protein:vir:45 213 LKMKGVLD----NKKA--RDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLH 286 (416) T ss_pred EEeCCCCC----CHHH--HHHHHHHHHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHH Confidence 2 11110 0000 00001112111111111111 11223556666667778888888888999999999999 Q ss_pred hcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHH Q lcl|NC_016654. 381 SLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQ 460 (533) Q Consensus 381 ~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~ 460 (533) .+|.+.++. +.++... .|...|..++..+....+..+... .....+.++++.-...|..++++ T Consensus 287 ~lg~~~~~~-~~~~~~~--------------~~~~~l~P~~~~ie~~ln~~l~~~--~~~~~~~f~~~~l~~~D~~~~~~ 349 (416) T protein:vir:45 287 KFGIETANM-SITDANL--------------DYLSTLKPYITCVCAELNFKFNDE--YVNREFKFDTTEIRVVDEKTQAE 349 (416) T ss_pred HcCCCCCCc-cHHHHHH--------------HHHHHHHHHHHHHHHHHhhhcccc--ccCceEEEechhhhccCHHHHHH Confidence 998654432 2222211 122234444444433222222211 12345666667767789999999 Q ss_pred HHHHHHhCCCCCHHHHHHHhCCCCCHH-HHHHHHHHHHHhhhcccCccccccccCCCCCCC---CCCCCCCC Q lcl|NC_016654. 461 TVQAWSVASAASTKTKVAYLHEDWDDE-RVQEEADLIDNANTVSAPTFGFGTDQPPLPTEN---DPATDPEA 528 (533) Q Consensus 461 ~~~~l~~aGi~S~et~v~~l~~~~~de-e~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~ 528 (533) .+.+++.+|+|+..++.+++ ++..- .-.+.+-.+.......+. ....++.....+ -.+|+.++ T Consensus 350 ~~~~~~~~G~~T~NE~R~~~--gl~p~~~gd~~~~~~~~n~~~~~~---~~~~~~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:45 350 IDKINIDSGKMNIDEIRQRD--GLAPIPGGNGSIHRVDLNHVNIEL---VDEYQMNKSRATDKKLKGGEENE 416 (416) T ss_pred HHHHHHhCCCcCHHHHHHHh--CCCCCCCCCcceEeeccccccccc---ccccCcccccccccccCCCCCCC Confidence 99999999999999977664 22210 000000011000000000 000111111111 11222111 No 174 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=97.41 E-value=7.3e-05 Score=43.26 Aligned_cols=392 Identities=13% Similarity=0.048 Sum_probs=157.7 Q ss_pred CCCCC----CcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccce Q lcl|NC_016654. 1 MSLPE----ANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKR 76 (533) Q Consensus 1 ~~~~~----~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 76 (533) |.|=. ..+..|+-.+......+-.|. ...... +. ...- T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~~~~~~~---~~~~~~----~~-------------------------------~~~a 42 (416) T protein:vir:81 1 MGIFYKNEKRDLQYNEDDLQMMVQTLPGFQ---GTKLRQ----YK-------------------------------DIEA 42 (416) T ss_pred CCcccccccccccCCCcchhHHHHHhcccc---ccCccc----cc-------------------------------hhhh Confidence 55431 112333333222221111110 000000 00 0000 Q ss_pred eecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhh--ccH---HHHHHHHHHHHhhhCCEEEEEEEcCCCCC Q lcl|NC_016654. 77 YHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNT--PRF---HSSLVEAGESCSALSGSFQRIVWDPTIAD 151 (533) Q Consensus 77 ~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~--n~f---~~~~~~~~~~~~~~G~~~~~~~~D~~~~~ 151 (533) ++..=--..|+.+|+-+.+=|..+.-++.. .....+..+|.. |.+ ..-....+...+..|.+|+.+..|..+. T Consensus 43 l~~~~v~~cv~~Ia~~iA~~p~~~~~~~~~-~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~- 120 (416) T protein:vir:81 43 IRHSDIFTAVMMIASDLARMPIRVTVNGQI-NYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGE- 120 (416) T ss_pred hcchHHHHHHHHHHHhhccCceEEecCccc-cccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc- Confidence 000000112344444444434444322211 112223334332 211 1223344455567899999888876543 Q ss_pred ceEEEEEcCCeEEEEEec-CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccc--eeehhhccccccc Q lcl|NC_016654. 152 NAWIDFVDADRAIPEFRW-GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGW--MMALTDHPATRDI 228 (533) Q Consensus 152 ~~~i~~v~~~~~~P~~~~-g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~--~v~l~~~~~~~~~ 228 (533) -+.+-.++|+++.++.+. |++. | ..+.......+. .++.+ T Consensus 121 ~~~L~~i~~~~v~v~~~~~g~~~----------------~-------------~~~~~~~~~~~~~~~~~~~-------- 163 (416) T protein:vir:81 121 PMNLTFRKTSEIELKSDARGRLY----------------Y-------------FHQRIDSNGNNIERNVKFE-------- 163 (416) T ss_pred EEEEEEEcCceeEEEECCCccEE----------------E-------------EEEEecCCCceeEEEEccc-------- Confidence 245667788888766432 3321 1 000000000000 00000 Q ss_pred cccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHH-HhCc-ceee Q lcl|NC_016654. 229 AVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDF-RIGA-GKVH 306 (533) Q Consensus 229 ~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~~-~~i~ 306 (533) -+.|+.... .. ...|.|.+..+. ..|+ ++.....+...+ +.+. ...+ T Consensus 164 --------------------evihir~~~----~d-----~~~G~s~i~~~~-~~i~-~~~~~~~~~~~~f~ng~~~~gi 212 (416) T protein:vir:81 164 --------------------DMLDIKFYS----LD-----GINGLSLLDTLS-RTIE-SDNNGKDFLNNFLRNGTHAGGI 212 (416) T ss_pred --------------------cEEEeccCC----CC-----CccccCHHHHHH-HHHH-HHHHHHHHHHHHHhccCCCcEE Confidence 012222110 11 235777766543 4443 344445555443 5432 3332 Q ss_pred e--chHHhcCCCCccccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChh Q lcl|NC_016654. 307 A--SESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPV 380 (533) Q Consensus 307 v--~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~ 380 (533) + +..+. +... .+.-...|.....+..++++ +....++.++......++++..+...++|+...|+||. T Consensus 213 l~~~~~~~----~~~~--~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~ 286 (416) T protein:vir:81 213 LKMKGVLD----NKKA--RDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLH 286 (416) T ss_pred EEeCCCCC----CHHH--HHHHHHHHHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHH Confidence 2 11110 0000 00001112111111111111 11223556666667778888888888999999999999 Q ss_pred hcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHH Q lcl|NC_016654. 381 SLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQ 460 (533) Q Consensus 381 ~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~ 460 (533) .+|.+.++. +.++... .|...|..++..+....+..+... .....+.++++.-...|..++++ T Consensus 287 ~lg~~~~~~-~~~~~~~--------------~~~~~l~P~~~~ie~~ln~~l~~~--~~~~~~~f~~~~l~~~D~~~~~~ 349 (416) T protein:vir:81 287 KFGIETANM-SITDANL--------------DYLSTLKPYITCVCAELNFKFNDE--YVNREFKFDTTEIRVVDEKTQAE 349 (416) T ss_pred HcCCCCCCc-cHHHHHH--------------HHHHHHHHHHHHHHHHHhhhcccc--ccCceEEEechhhhccCHHHHHH Confidence 998654432 2222211 122234444444433222222211 12345666667767789999999 Q ss_pred HHHHHHhCCCCCHHHHHHHhCCCCCHH-HHHHHHHHHHHhhhcccCccccccccCCCCCCC---CCCCCCCC Q lcl|NC_016654. 461 TVQAWSVASAASTKTKVAYLHEDWDDE-RVQEEADLIDNANTVSAPTFGFGTDQPPLPTEN---DPATDPEA 528 (533) Q Consensus 461 ~~~~l~~aGi~S~et~v~~l~~~~~de-e~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~ 528 (533) .+.+++.+|+|+..++.+++ ++..- .-.+.+-.+.......+. ....++.....+ -.+|+.++ T Consensus 350 ~~~~~~~~G~~T~NE~R~~~--gl~p~~~gd~~~~~~~~n~~~~~~---~~~~~~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:81 350 IDKINIDSGKMNIDEIRQRD--GLAPIPGGNGSIHRVDLNHVNIEL---VDEYQMNKSRATDKKLKGGEENE 416 (416) T ss_pred HHHHHHhCCCcCHHHHHHHh--CCCCCCCCCcceEeeccccccccc---ccccCcccccccccccCCCCCCC Confidence 99999999999999977664 22210 000000011000000000 000111111111 11222111 No 175 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=97.40 E-value=7.5e-05 Score=43.20 Aligned_cols=390 Identities=13% Similarity=0.035 Sum_probs=155.7 Q ss_pred CCCC------------------------CCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHH Q lcl|NC_016654. 1 MSLP------------------------EANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTK 56 (533) Q Consensus 1 ~~~~------------------------~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~ 56 (533) |.|= .+-+.||- +.|+..+-..+..+..+..... T Consensus 1 Mgl~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g-------------~~~~~~~~~~~~~~~~~~~~~g--------- 58 (431) T protein:vir:10 1 MGLFDFIRREKQPEAQARPHVEPSFQASTPTTSIPG-------------ETFEGLDDPRLKEYIRRGELNG--------- 58 (431) T ss_pred CcchhhhhcCcccccccccccccccccccccccccc-------------cccccccchHHHHhhccCccCc--------- Confidence 3321 12222221 2333322222333322111000 Q ss_pred HHHHHHhcccCCCCCcccceeecChHHHHHHHHHHhhcCCCceEeeCCCc--hHHHHHHHHHHhh--ccH--H-HHHHHH Q lcl|NC_016654. 57 AAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKS--KEVQARADLIFNT--PRF--H-SSLVEA 129 (533) Q Consensus 57 ~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~--~~~~~~l~~i~~~--n~f--~-~~~~~~ 129 (533) .. . ....-+...--...++.+|+-+-+=|..+--.++. ......+..+|+. |.+ . ...... T Consensus 59 ---------~~-v--~~~~al~~~~V~~ci~~Ia~~iA~lp~~v~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l 126 (431) T protein:vir:10 59 ---------GT-G--RETRALRNMAVLRCVTLISGTIGMLPMNLISSDDSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLM 126 (431) T ss_pred ---------ce-e--chhhhhccHHHHHHHHHHHHhhccCceEEEEecCceeeeccchHHHHHhhccCCCCCHHHHHHHH Confidence 00 0 00111222223344455555554445443111111 1111223334332 111 1 223344 Q ss_pred HHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEecc Q lcl|NC_016654. 130 GESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGT 209 (533) Q Consensus 130 ~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~ 209 (533) +...+..|.+|+.+..|. + .-+.+-.+++.++.+..+.+ +.+.|.+.... T Consensus 127 ~~~lll~Gna~~~i~r~~-g-~~~~L~pl~~~~v~~~~~~~----------------------------~~~~y~~~~~~ 176 (431) T protein:vir:10 127 QLRALLDGESMARIVWSG-N-RPIRLIPMDRGSAKGRLTST----------------------------WQIVYDYTTPT 176 (431) T ss_pred HHHHhhcCCeEEEEEEcC-C-ceEEEEEEcCceeEEEEcCC----------------------------CeEEEEEEeCC Confidence 556667799999888873 2 23445455666665543321 11111111111 Q ss_pred CCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHH Q lcl|NC_016654. 210 ATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDR 289 (533) Q Consensus 210 ~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~ 289 (533) |..+.+. .--+.|+.+.. +. ...|.|.+..+ ...| .++. T Consensus 177 ----g~~~~~~--------------------------~~dViHir~~~----~d-----g~~G~spi~~~-~~~i-~~~~ 215 (431) T protein:vir:10 177 ----GDKIELP--------------------------AREVFHLRDLS----ID-----GVSGVSRVKLS-GNAL-ELAE 215 (431) T ss_pred ----ceEEEEc--------------------------hhhEEEecCcC----CC-----CcccccHHHHH-HHHH-HHHH Confidence 1111000 00122332211 11 23477766543 3444 3455 Q ss_pred HHHHHHHHH-HhCcceeeechHHhcCCCCccccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHH Q lcl|NC_016654. 290 IYSSLMRDF-RIGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGA 364 (533) Q Consensus 290 ~~s~~~~~~-~~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l 364 (533) ...++...+ +.|...-.| +...+.-.....+.-...+.....+..+.++ +....++.++......++++.. T Consensus 216 ~~~~~~~~~f~ng~~p~gi----l~~~~~ls~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~r 291 (431) T protein:vir:10 216 QAERAASRTFRTGVMAGGA----IEVPKELSDNAYGRMKASVQENHTGSENAGSWMLLEEGATAKQFSNTAASAQQIENR 291 (431) T ss_pred HHHHHHHHHHhccCCccEE----EecCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHH Confidence 555555544 543322111 2211110000011111112211111111111 1112355566666677888888 Q ss_pred HHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEE Q lcl|NC_016654. 365 ALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELE 444 (533) Q Consensus 365 ~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~ 444 (533) +...++|+...|++|..+|+..+...| .++.. ....++..|..++..+..-.+..+..........+. T Consensus 292 ~~~~~~Ia~~fgVPp~~lg~~~~~t~s--n~eq~----------~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~~~~~~~ 359 (431) T protein:vir:10 292 NHQIEEVARMYGVPRPLLMMDDTSWGS--GIEQL----------AIFFIQYGLSHWFVSWEQAAARAFLPEKMLGQRQFK 359 (431) T ss_pred HHhHHHHHHHhCCCHHHhCCCCCCccc--cHHHH----------HHHHHHHHHHHHHHHHHHHHHhhccChhhcCCceEE Confidence 888899999999999999875433222 22111 122333344444444332222222211111234566 Q ss_pred EEeCCCCCCCHHHHHHHHHHHHhCCC----CCHHHHHHHhC-CCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCC Q lcl|NC_016654. 445 LEWPKFARESDLAKAQTVQAWSVASA----ASTKTKVAYLH-EDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTE 519 (533) Q Consensus 445 i~f~d~i~~d~~e~a~~~~~l~~aGi----~S~et~v~~l~-~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~ 519 (533) ++++.-+..|..++++.+.+++.+|+ |+.-++.+.+. |-.++...+ ++ -.|.+..+ T Consensus 360 fd~~~llr~d~~~r~~~~~~~~~~G~~~g~lT~NE~R~~~gl~p~~~~~gD-~~------------------~~p~n~~~ 420 (431) T protein:vir:10 360 FNEGALLRGTLNDQAAFFSKALGAGGQSPWMKQNEVREMLDLPRADDPVAD-QL------------------RNPMTQKQ 420 (431) T ss_pred EechhhhccCHHHHHHHHHHHHhcccccCccCHHHHHHHhCCCCCCCcccc-ce------------------eccccccc Confidence 66777778899999999999998776 77777555431 111111111 00 00111111 Q ss_pred CCCCCCCCCCC Q lcl|NC_016654. 520 NDPATDPEAVD 530 (533) Q Consensus 520 ~~~~~~~~~~~ 530 (533) ..+..+++..- T Consensus 421 ~~~~~~~p~~~ 431 (431) T protein:vir:10 421 KGSGDEPPATT 431 (431) T ss_pred CCCCCCCCCCC Confidence 11111111111 No 176 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=97.30 E-value=0.0001 Score=42.50 Aligned_cols=405 Identities=11% Similarity=0.004 Sum_probs=163.6 Q ss_pred cCCHHHHHHHHhccCcchhhHH-HHHHHHHHHHHh---cccCCCCCcccceeecChHHHHHHHHHHhhcCCCceEeeCCC Q lcl|NC_016654. 30 EGDLDKLATFYGAEGRTSPSGI-KARTKAAYEAFH---GRTPTATGRAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGK 105 (533) Q Consensus 30 ~gd~~~l~~~y~~~~~~~~~~~-~~~~~~~~~~~~---~~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~ 105 (533) +|=.++|-...+.....+.... ...+......+| .......=....-+...--...++.+|+-+.+=|..+--.. T Consensus 1 MG~f~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~al~~~~v~~ci~~ia~~iA~lp~~~~~~~- 79 (422) T protein:vir:13 1 MGFLRGLFNKKNNNDEKRSNYDEDIGIDISDSNFWEKFGIKLNFSVRGKRALKENTVYVCTKIRAESIGKLSLKIYKDK- 79 (422) T ss_pred CchhhhhhhccCCccchhhhhhhccccccCcchhhhhccccCCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEecC- Confidence 4432222111010000000000 000000001111 11110000001112333334455666665555554442111 Q ss_pred chHHHHHHHHHHhh--ccH---HHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEE Q lcl|NC_016654. 106 SKEVQARADLIFNT--PRF---HSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSE 180 (533) Q Consensus 106 ~~~~~~~l~~i~~~--n~f---~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~ 180 (533) ++.....+.++|.. |.. ..-+...+...+..|.+|+.+..|..+. -+.+..++|+++.++.+.+..... T Consensus 80 ~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~-~~~L~~i~~~~v~~~~~~~~~~~~----- 153 (422) T protein:vir:13 80 EEYKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGK-IIGLYPINSDNVTKIIDDDNFLSS----- 153 (422) T ss_pred cccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc-EEEEEEECCcceEEEEcCCcceec----- Confidence 11111123333321 222 1344555566677899999998886543 356777888888887654332100 Q ss_pred EeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccc Q lcl|NC_016654. 181 LAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNP 260 (533) Q Consensus 181 ~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~ 260 (533) .+.+.|.+...+ |....+ .+--+.|+.... T Consensus 154 -----------------~~~~~y~~~~~~----g~~~~~--------------------------~~~eiih~~~~~--- 183 (422) T protein:vir:13 154 -----------------LSKVWYVVTDKN----GKEHKL--------------------------LPDEMLHFIGDI--- 183 (422) T ss_pred -----------------cceEEEEEEeCC----CeEEEE--------------------------cccceEEEcCCC--- Confidence 000111111000 111000 000112221100 Q ss_pred cccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhC-cceeeechHHhcCCCCccccccCcchhhhhhcccccc Q lcl|NC_016654. 261 EWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIG-AGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGF 338 (533) Q Consensus 261 ~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~-~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~ 338 (533) +....+|.|.+..+. ..|. +.....++... |+.| ...-++ ...+.-...........+.....+.. T Consensus 184 -----~~~~~~G~s~~~~~~-~~i~-~~~~~~~~~~~~f~ng~~p~gil-----~~~~~l~~e~~~~~~~~~~~~~~g~~ 251 (422) T protein:vir:13 184 -----TLDGLIGIKPLDYLR-CTIE-NGRATQEFINKFFKNGLSIKGIV-----QYVGDLDEKAKKIFKKEFESMSNGLE 251 (422) T ss_pred -----CCCCcccccHHHHHH-HHHH-HHHHHHHHHHHHHhccCCccEEE-----EeCCCCCHHHHHHHHHHHHHHhcCcc Confidence 001345788776543 4553 34444444444 3543 233222 11110000000011111211111111 Q ss_pred cccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc-hhHHHHHHHhhhHHHHHHHHHHHH Q lcl|NC_016654. 339 NANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA-QTATEASGKKDLTVKTTRAKARHF 413 (533) Q Consensus 339 ~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~-~Tatai~~~~~~l~~~~~~~~~~~ 413 (533) +.++ +....++.++......++++..+...++|+...|+||..+|...++. .+.++. ....+ T Consensus 252 n~~~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~e~~-------------~~~f~ 318 (422) T protein:vir:13 252 NAHSISLLPFGYQFQPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERATFNNLTEQ-------------QKDFY 318 (422) T ss_pred ccCCceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH-------------HHHHH Confidence 1111 11123455565556677888888888999999999999998654332 122221 22234 Q ss_pred HHHHHHHHHHHHHHHHhhccC-CCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHH Q lcl|NC_016654. 414 GSALGPLSTTCLRVDAIKFPG-KGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEE 492 (533) Q Consensus 414 ~~al~~li~~il~l~~~~~~~-~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~E 492 (533) +..|..+++.+..-.+..+.. ........+.++++.-.-.|..++++.+++++.+|+|+..++.+++ ++..-+ . T Consensus 319 ~~~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~--gl~p~~---g 393 (422) T protein:vir:13 319 VTTLQSSLTVYEQEIQDKLFSQYETLQDVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEANEARRRE--NLPPVE---G 393 (422) T ss_pred HHHHHHHHHHHHHHHHHhhCChhhhcCCceEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHh--CCCCCC---C Confidence 445555555443322222211 1111234455666666778999999999999999999999977654 232210 0 Q ss_pred HHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 493 ADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 493 l~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) -+++. ...+..|....++ ...+.++..|+ T Consensus 394 gD~~~-----------~~~n~~~l~~~~~-~~~~~g~~~g~ 422 (422) T protein:vir:13 394 GDRLL-----------VNGNMIPIEMAGE-QYKKGGEKGGK 422 (422) T ss_pred cCeee-----------eccCccchhhccc-ccccCCCcCCC Confidence 00000 0000001000000 00011112222 No 177 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=97.29 E-value=0.0001 Score=42.42 Aligned_cols=465 Identities=10% Similarity=-0.026 Sum_probs=172.4 Q ss_pred cCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHH Q lcl|NC_016654. 11 PPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLST 90 (533) Q Consensus 11 pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a 90 (533) ==..+...+..+...|..|...-.++.+|.. .+.+.......+.+..++.-+-+...++.+| T Consensus 1 m~~~~~~r~~~l~~~R~~~e~~w~e~~~y~l------------------P~~~~~~~~~~~~~~~~~~dst~~~a~~~La 62 (555) T protein:vir:17 1 MKHSAQAKYMMLRADREDYLDSGRQSARLTL------------------PYILTDEGHVQGGYLPTPWQSVGSKGVNVLA 62 (555) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhc------------------ccccCCCCCcccccccccccccHHHHHHHHH Confidence 0011334444444444444433333333221 1111111112223344566677888888888 Q ss_pred HhhcCC-----CceEeeCCCc---------hH----HH-------HHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEE Q lcl|NC_016654. 91 TELFSE-----QLKFLDAGKS---------KE----VQ-------ARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVW 145 (533) Q Consensus 91 ~ll~~e-----~~~i~~~~~~---------~~----~~-------~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~ 145 (533) +.|.+- .+.|.....+ .. ++ +.+...+..++|...+.++.+...+.|.+++ |. T Consensus 63 a~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--y~ 140 (555) T protein:vir:17 63 SKLMLSLFPVNTSFFKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALL--YQ 140 (555) T ss_pred HHHHHhhcCCCCcccccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEE--Ee Confidence 776553 2344443221 11 11 2333446668999999999999999999875 56 Q ss_pred cCCCCCceEEEEEcCCeEEEEEe-cCCceEEEEEEEEeecC-----Cce-----E---EEEEEEecCeeEEEEEEeccCC Q lcl|NC_016654. 146 DPTIADNAWIDFVDADRAIPEFR-WGRLVAVTFWSELAGGD-----GQE-----V---WRHLERHESGYIVHAVYKGTAT 211 (533) Q Consensus 146 D~~~~~~~~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~~~-----~~~-----~---y~~lE~h~~~~I~~~~y~~~~~ 211 (533) |++. +.+++-..++...+ +|++..|+.-.+++... ++. + ....+.+ ...+.+.++..+.. T Consensus 141 ~~~~-----~~~~pl~~y~v~~d~~G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~-~~~~~~~~~~~~~~ 214 (555) T protein:vir:17 141 GKKN-----LKLYPLDRFVVSRDGEGNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDG-PKMGVTAPGGRDKG 214 (555) T ss_pred cCCc-----eeEEEcCeEEEeeCCCcCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccc-hhhhhhhhcccccC Confidence 6542 45566566555444 47776665433332110 000 0 0000000 00000000000000 Q ss_pred cccceeehhhccccccccccccccCCceee-----cCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHH Q lcl|NC_016654. 212 SLGWMMALTDHPATRDIAVEGADEGRGAYV-----ETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHE 286 (533) Q Consensus 212 ~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~-----~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~ 286 (533) .- ..+.+..+..-..-...+....+...+ +.|.. -+.|++. +|... ..+.||+|-...++ +-+.. T Consensus 215 ~~-~~~~v~t~~~~~~~~~~~~~e~~~~~v~~~l~e~g~~--e~P~i~~-----Rw~~~-~ge~YGrgp~~~~l-~D~k~ 284 (555) T protein:vir:17 215 KS-NDALVYTYVCRKDGQVKWHQECDGKVIPGSNSSAPYT--HNPWIPL-----RFNIV-DGEAYGRGRVEEFM-GDLKS 284 (555) T ss_pred CC-cceeEeecccccCCeeEEEEecCceeccccccccCcc--cCCeeee-----eeeec-CCCccccchHHHHH-HHHHH Confidence 00 000000000000000000000000111 11111 1122222 23322 24778998777766 66778 Q ss_pred HHHHHHHHHHHH-HhCcceeeechH-HhcCCCCccccccCcch-hhhhhccccccccccccccceeeec-hhhh-hHHHH Q lcl|NC_016654. 287 LDRIYSSLMRDF-RIGAGKVHASES-VLTNLGMGQGVSLDEEQ-EVYSRVGSGGFNANGDMETIFEFFQ-PAIR-VLEHD 361 (533) Q Consensus 287 lD~~~s~~~~~~-~~~~~~i~v~~~-~l~~~~~~~~~~~d~~~-~~~~~~~~~~~~~~~~~~~~i~~~~-~~ir-~e~~~ 361 (533) |+..--...... ...++...|+++ .+.+.. +.+.. +.+.. . ..++-..+.... .++. ..+-+ T Consensus 285 L~~l~~~~l~~~~~~~~pp~lv~~~g~~~~~~------l~~~~~g~v~~-----g--~~~~v~~~~~~~~~~~~~~~~~i 351 (555) T protein:vir:17 285 LEALSQAMVEGSAASAKVVFMVSPSATTKPQN------LALAANGAIIQ-----G--RPDDVSVVQANKAADFRTVLEMI 351 (555) T ss_pred HHHHHHHHHHHHHHHhCCceeeccccccCcce------eecCCCceeec-----C--CcccceeeeccccchhhHHHHHH Confidence 887765555554 445666566443 222211 10000 11110 0 000000111111 1222 12233 Q ss_pred HHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhccCCC-CCC Q lcl|NC_016654. 362 QGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHF-GSALGPLSTTCLRVDAIKFPGKG-AAP 439 (533) Q Consensus 362 ~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~-~~al~~li~~il~l~~~~~~~~~-~~~ 439 (533) +.+..-++.+.... + ...+...|||||..+.+......+-.-..+ ...|.-+|..++++... .|.- ..+ T Consensus 352 ~~~~~~I~~aFm~~--~-----~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r--~g~lP~~p 422 (555) T protein:vir:17 352 QKLEQRISDAFLML--Q-----VRQSERTTATEVQATVQELNEQIGGIYSNLTTELLQPYLARKLHLLQK--QRKLPQLP 422 (555) T ss_pred HHHHHHHHHHHhhc--C-----CCCcccchHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHh--CCCCCCCC Confidence 33333344333221 1 123345699999988887776666544444 34555566666665432 1211 222 Q ss_pred ceeEEEEeCCCCCC-----CHHHHHHHHHHHHhCC-------CCCHHHHHHHh---CCC------CCHHHHHHHHHHHHH Q lcl|NC_016654. 440 SEELELEWPKFARE-----SDLAKAQTVQAWSVAS-------AASTKTKVAYL---HED------WDDERVQEEADLIDN 498 (533) Q Consensus 440 ~~~v~i~f~d~i~~-----d~~e~a~~~~~l~~aG-------i~S~et~v~~l---~~~------~~dee~~~El~rI~~ 498 (533) .+.+.++..-++.. +.....+.++.+.+.+ .+....+++.+ ++- -+++++++..+.-++ T Consensus 423 ~~~v~~~i~~~l~~l~r~~~~~~l~~~~~~laq~~~~p~~~d~id~d~~~~~~a~~~Gv~p~~ivrs~eev~~~rq~~~~ 502 (555) T protein:vir:17 423 KDLVQPTVVAGLWGVGRGQDKQQLMEFITTLAQTMGPEIAMKYINPTEFIKRLAAAQGIDTLQLINSPETMKQLGDQQKQ 502 (555) T ss_pred HhhhccceeehHHHHHHHHHHHHHHHHHHHHHhhcCchhHhhcCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHH Confidence 33233322212111 1111222222221111 13333444432 210 155555543322111 Q ss_pred hhhcc------cCcccccc-cc-----CCCCCCCCCCC---CCCCCCCCC Q lcl|NC_016654. 499 ANTVS------APTFGFGT-DQ-----PPLPTENDPAT---DPEAVDEGE 533 (533) Q Consensus 499 E~~~~------~~~~~~~~-~~-----~~~~~~~~~~~---~~~~~~d~~ 533 (533) ++... ...++... ++ .+.......+| ..-..++|- T Consensus 503 ~~~q~~~~~qa~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~~~~~~ 552 (555) T protein:vir:17 503 DMVQASLINQAGQLAKTPMAEQAMQLIQQQQEGAQDAGAAESETSSAEAQ 552 (555) T ss_pred HHHHHHHHHHHHHHHhhhhhhhHHhccccchhhhhHHHHHHhhcCCcccc Confidence 11000 00000000 00 00000000111 000011111 No 178 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=97.28 E-value=0.00011 Score=42.35 Aligned_cols=475 Identities=12% Similarity=0.017 Sum_probs=197.8 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHh---ccCcchhhHHHHHHHHHHHHHhcccCCCCCcc---- Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYG---AEGRTSPSGIKARTKAAYEAFHGRTPTATGRA---- 73 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~---- 73 (533) |++--. ++.....-+..-++|- .++...|+ ..+....+.|+..+. + .+.-..+..|.. T Consensus 1 m~~~~~-------~~~~~~~~~~~~~~~~----~~v~~~~~~~~~~r~~~~~~w~e~~~-y---i~~~~tr~t~~~~~~w 65 (599) T protein:vir:31 1 MSTDIK-------TLQKMLEGRDDDRAFI----DELVVLFTNMENARAQKDREDKELMD-Y---IDATDTRKTSNSKLPF 65 (599) T ss_pred CccchH-------HHHHHhhccCchHHHH----HHHHHHHHhhhhhhhhhhcccHHHHH-H---HhhhcccccccCCCCc Confidence 443211 1111111111222221 12222222 222233334433322 2 222112222222 Q ss_pred cceeecChHHHHHHHHHHhhcCC----CceEeeCC---C--chH----HHHHHHHHHhhccHHHHHHHHHHHHhhhCCEE Q lcl|NC_016654. 74 PKRYHAPIPGVIAKLSTTELFSE----QLKFLDAG---K--SKE----VQARADLIFNTPRFHSSLVEAGESCSALSGSF 140 (533) Q Consensus 74 ~~~~~~n~~k~i~~~~a~ll~~e----~~~i~~~~---~--~~~----~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~ 140 (533) ++++.+|-.-.|++.+..++|+- .-.+...+ + +.. .+.++++=+.+.+|...+...+-+-..+|-++ T Consensus 66 ~~s~t~~k~~~~~~~l~a~~~~~~fp~~~w~d~~~~~~~~~~~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G~~v 145 (599) T protein:vir:31 66 KNSTTINKLAHLHLMITTSYMEHLLPNRNWVDFVGFDNDSVNAEKREIARSYVRGKVEASNLEGVIERMVDDFAVRGFCV 145 (599) T ss_pred ccccchHHHHHHHHHHHHHHHhhhcCCccceEeeecCCchhHHHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccCcee Confidence 45566666666777777666553 22333221 1 112 23444555777789999888889999999888 Q ss_pred EEEEEcCCC-----------CCceEEEEEcCCeEEEEEecCCceEEEEEEEEee--c--------CCceEEEE------- Q lcl|NC_016654. 141 QRIVWDPTI-----------ADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAG--G--------DGQEVWRH------- 192 (533) Q Consensus 141 ~~~~~D~~~-----------~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~--~--------~~~~~y~~------- 192 (533) .++-+-... .-.|+++.++|..+||=-+-+.+..+.|+.+... . +...+|.+ T Consensus 146 at~~~er~~~~~~d~~v~~~~~~P~~ervsP~Di~~Dp~A~si~d~~fivRs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~ 225 (599) T protein:vir:31 146 AHTRHVKRMTVTAENQVIKNYSGTVTERLSPSDVFWDVTADSLPKAAKCIRQLYTLGSLKREIEEGTFPLMSMEDFQKLR 225 (599) T ss_pred EeeeEEEcceeecccccccccccceEEeecccceeeCCCCCCCCcceeeeehhhhHHHHHHHhccCCccccchHHHHHHH Confidence 876542211 1147888899988887555567777765543211 0 00011110 Q ss_pred -EEEecCeeEEE--EEEeccC----Ccccc------eeehhhcccccccccccccc---------CC-ceee-----cCC Q lcl|NC_016654. 193 -LERHESGYIVH--AVYKGTA----TSLGW------MMALTDHPATRDIAVEGADE---------GR-GAYV-----ETG 244 (533) Q Consensus 193 -lE~h~~~~I~~--~~y~~~~----~~lG~------~v~l~~~~~~~~~~~~~~~~---------~~-~~~~-----~~g 244 (533) -.+|....-.. .-+.+.+ +..|. +.+.+-+..|-.+.++-.+. -+ ...+ +.+ T Consensus 226 ~~~~~~~~~~~d~~~~~~g~D~~~~d~~~~~~eY~~~~~VevLeywGd~ydee~d~~~~~~ViTi~g~~~liR~e~np~~ 305 (599) T protein:vir:31 226 EERRTIREALADGYNGRRKFDSLHKKGYGSMMNYINEGVVEVLTFMGDFYDEENDELWNNYEITVIDRKIIGRKQSKDTW 305 (599) T ss_pred hhccCCCccccchhhhhhhccccccccccchhhhcccchhhhhhhhhhhhcccCCccccceEEEEecCcEEeecccCCCC Confidence 00110000000 0011111 00000 00111111111111110000 00 0000 011 Q ss_pred C---ccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCCcccc Q lcl|NC_016654. 245 V---KDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGMGQGV 321 (533) Q Consensus 245 ~---~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~~~ 321 (533) . +.+...|.|-. .+.+|..++... -++++.||.+.....+.++..=..+++....+.+.+ . T Consensus 306 ~g~~Pyvv~~~~P~~-----------~~~yG~G~l~~~-~gaQ~~lN~~~Ng~iD~~~~~l~p~l~~~~dl~~eD----~ 369 (599) T protein:vir:31 306 DGSQNLHIAVYEFQK-----------DTLCPIGPLHRL-TGMQYKLDKRENFREDLHDRFLHPSLKKVGDVREKG----M 369 (599) T ss_pred CCCCCeEEEEeeeec-----------cccCCCCCchhc-chHHHHHHHHHHHhhhhhhhhhcccccccccccccC----c Confidence 1 11222344432 256777788764 489999999987777665532221222111111110 1 Q ss_pred ccCcchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhh Q lcl|NC_016654. 322 SLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDL 401 (533) Q Consensus 322 ~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~ 401 (533) .+.+.. +|. .+. ...++.+.|..+.-+...-++.+...+...+|.|+..-|..+.|.+||+++...... T Consensus 370 ~~~P~~-v~~------~~d----~~~vq~~~p~s~~~~a~~~is~~e~~mee~sGvp~~~~G~~~ag~~TA~~is~l~na 438 (599) T protein:vir:31 370 RGGPNH-VFE------VEE----TGDVQYMTPPAEVLQPDNQLSITLQLMEDLSGAPKESIGQRTAGEKTKFEVQLLDQG 438 (599) T ss_pred cCCCCc-cee------ecC----CCccccccCchhhhhHHHHHHHHHHHHHHhhccchhhcCCcccchhhHHHHHHHHhh Confidence 111111 111 001 112344444333323333455555667778899999999888888999999999888 Q ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhccCCCC----CC----ceeEEEEeCC------CCC---CCHHHHHHHHH Q lcl|NC_016654. 402 TVKTTRAKARHFGSALGP-LSTTCLRVDAIKFPGKGA----AP----SEELELEWPK------FAR---ESDLAKAQTVQ 463 (533) Q Consensus 402 l~~~~~~~~~~~~~al~~-li~~il~l~~~~~~~~~~----~~----~~~v~i~f~d------~i~---~d~~e~a~~~~ 463 (533) .-....++.+.|...+-+ |++.++......+..... .+ ..-++|.=+| .++ .-..++.+..+ T Consensus 439 a~~~~~~~vr~~e~~~lepll~~l~e~~~~f~D~~~tiri~~~e~~~~~f~~i~redl~~~~~~v~~Ga~~v~ere~~~q 518 (599) T protein:vir:31 439 QNKVFRRKVKKFERELLTPVLNDYLEQGRNHLDASDTIKTFNSELGTATFLDITADDLNLNGQMVAQGATLFAEKANTLQ 518 (599) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeeecccccceeeEEeehhhhhCCeeeeechhhHHHHHHHHHH Confidence 888888888888887655 888777654422111000 00 0011221111 011 11234445444 Q ss_pred HHHh---C----CC---CCHHH---HH---HHhCC-C-----CCHHHHHHHHHHH----HHhhh-cccCccccccccCCC Q lcl|NC_016654. 464 AWSV---A----SA---ASTKT---KV---AYLHE-D-----WDDERVQEEADLI----DNANT-VSAPTFGFGTDQPPL 516 (533) Q Consensus 464 ~l~~---a----Gi---~S~et---~v---~~l~~-~-----~~dee~~~El~rI----~~E~~-~~~~~~~~~~~~~~~ 516 (533) .+.+ + ++ ||... ++ +.+|. . +--.|.+.++.-. ++++. +.+..+ .| .| T Consensus 519 ~l~~il~~~~~q~~~P~~~~k~l~~~l~~~~~l~~~~~~~~~va~~eqq~~~~m~Q~~lq~~~~~~~~~~~-~~---~~- 593 (599) T protein:vir:31 519 NLNAILGGPLGAALAPHMSRTKLFNAVEYLGDLDAYGIFTFGIGVQEDQQLARMAQKSTQQTEETALTQEE-VG---GP- 593 (599) T ss_pred HHHHHhcccCCCccchhhHHHHHHHHHHHHHhccccccCCCchhHHHHHHHHHHHHHHHHHhHhhhhhhhh-cC---CC- Confidence 4432 1 12 23211 11 11221 0 1111222232222 12211 111110 00 11 Q ss_pred CCCCCCCCC Q lcl|NC_016654. 517 PTENDPATD 525 (533) Q Consensus 517 ~~~~~~~~~ 525 (533) +-+.++ T Consensus 594 ---~~~~~~ 599 (599) T protein:vir:31 594 ---TTDTGQ 599 (599) T ss_pred ---CcccCC Confidence 122222 No 179 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=97.27 E-value=0.00011 Score=42.31 Aligned_cols=391 Identities=10% Similarity=0.023 Sum_probs=143.3 Q ss_pred hHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHH--HHHHHHHHhhcCCCceEeeC Q lcl|NC_016654. 26 HVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPG--VIAKLSTTELFSEQLKFLDA 103 (533) Q Consensus 26 ~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k--~i~~~~a~ll~~e~~~i~~~ 103 (533) =. |++.+..... ........++.+......-... .+..+. ..++.+|+-+-+=|....-. T Consensus 1 m~-----------~f~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~--Al~~~~V~~~i~~Ia~~iA~lp~~~~~~ 62 (406) T protein:vir:97 1 MS-----------FFQPLGTSKV-----SYDDYISSVLAGDVSQKYLGVS--ALKNSDILTATSIIAGDIARFPLVKKDV 62 (406) T ss_pred Cc-----------cccccCCCCC-----CcchHHHHHhcCCCCcccccch--hhccHHHHHHHHHHHHhhhhCeeEEEec Confidence 11 2211110000 0111222233322211111111 112211 13344444333324332221 Q ss_pred CCchHHHHHHHHHHh--hccHH---HHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEE Q lcl|NC_016654. 104 GKSKEVQARADLIFN--TPRFH---SSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFW 178 (533) Q Consensus 104 ~~~~~~~~~l~~i~~--~n~f~---~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~ 178 (533) +........+..+|. -|.++ .-....+...+..|.+|+.+..|...+.-..+-.++|+++.+..+.+. T Consensus 63 ~g~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~v~v~~~~~~------- 135 (406) T protein:vir:97 63 NGDIIHDEDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYRPSETTVEETDNH------- 135 (406) T ss_pred CccccccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEECCCeeEEEEcCCc------- Confidence 111111223444443 12221 333445555666798998887764322234666677777766543221 Q ss_pred EEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcc Q lcl|NC_016654. 179 SELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTP 258 (533) Q Consensus 179 ~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~ 258 (533) .+.|.+.... .|..+.+. .. -+.|+.... T Consensus 136 ---------------------~~~y~~~~~~---~~~~~~~~--------~~------------------evih~r~~~- 164 (406) T protein:vir:97 136 ---------------------EIVYTFTDML---TAKQVKCF--------AH------------------DVIHWKFFS- 164 (406) T ss_pred ---------------------eEEEEEEecC---CceEEEEc--------cc------------------cEEEecCCC- Confidence 1111110000 01111100 00 022332210 Q ss_pred cccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCc-ceeeechHHhcCCCCccccccCcchhhhhhcccc Q lcl|NC_016654. 259 NPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGA-GKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSG 336 (533) Q Consensus 259 ~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~-~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~ 336 (533) +. ...|.|.+..+ ...|. ++....++... |+.|. +.+++ ...+.-.....+...+.|.....+ T Consensus 165 ---~d-----g~~G~spi~~~-~~~i~-~~~a~~~~~~~~f~ng~~~~~i~-----~~~~~l~~e~~~~~~~~~~~~~~g 229 (406) T protein:vir:97 165 ---HD-----TILGRSPLLSL-GDEID-LQTGGINTLIKFFKDGFSSGILT-----MKGAQLSGDARQRARQEFEKMREG 229 (406) T ss_pred ---CC-----CcccccHHHHH-HHHHH-HHHHHHHHHHHHHhccCCCceEE-----ecCCCCCHHHHHHHHHHHHHHhcc Confidence 11 12477776643 34443 23333333333 34432 22222 111100000011111222222211 Q ss_pred cccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHH Q lcl|NC_016654. 337 GFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARH 412 (533) Q Consensus 337 ~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~ 412 (533) .+.++ +....++.++......++++..+...++|+...|+||..+|....+..++.. .... T Consensus 230 -~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~~~~e~~--------------~~~f 294 (406) T protein:vir:97 230 -SVGGSPLVFDSTMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVNSPNQSVAQL--------------MEDY 294 (406) T ss_pred -cccCceeecCCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCCCcchHHHH--------------HHHH Confidence 11111 1223455555556667788888888899999999999999854332211111 1122 Q ss_pred HHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhC-CCCCHHHHHH Q lcl|NC_016654. 413 FGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLH-EDWDDERVQE 491 (533) Q Consensus 413 ~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~-~~~~dee~~~ 491 (533) +..+|..+++.+..-.+..+....... ...|+|+ +..+..+.++.+.+++++|+|+..++...+. +...+...++ T Consensus 295 ~~~~l~P~~~~ie~~l~~kll~~~~~~--~~~i~fd--~~~~~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~~~~gD~ 370 (406) T protein:vir:97 295 VTNDLPFYFDAITSELGLKTLNDKDRR--LYHIEFD--TRSVTGRNVDEIVKLVNNQILTPNQGLVELGKQKSTDPNMDR 370 (406) T ss_pred HHHHHHHHHHHHHHHHhhhhcChhhcc--ceeEEEe--cCccchhhHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCe Confidence 334444444433322222222111111 2334554 2234556677778889999999999777651 1111110000 Q ss_pred HHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 492 EADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 492 El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ..--.. ..| .. ..+++.....+...|++....+++ T Consensus 371 ----~~~~~n-~~~-~~-~~~~~~~~~~~~~~gg~~~~~~~~ 405 (406) T protein:vir:97 371 ----YQSSLN-YVF-LD-KKEEYQDKVGIKGKGGEVNAEEDK 405 (406) T ss_pred ----EeeccC-ccc-hh-cccccccccccccCCCCCCCCCCC Confidence 000000 000 00 000111111111111111111111 No 180 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=97.27 E-value=0.00011 Score=42.28 Aligned_cols=403 Identities=11% Similarity=-0.019 Sum_probs=165.4 Q ss_pred CCCC--C-CcCCCcCcch-HHHHHHHHhhh----Hhhc-CCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCC Q lcl|NC_016654. 1 MSLP--E-ANTAWPPPEL-AAVTARVAESH----VWWE-GDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATG 71 (533) Q Consensus 1 ~~~~--~-~~~~~pp~~~-~~~~~~~~~~~----~w~~-gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 71 (533) |.-= + +|.|-+.+.. .+....+..-+ .|-. ..|.++...++.+... .. .+.++. T Consensus 1 m~~~i~~~~g~p~~~~~~~~~~~~~ia~~~~~~~~~~~~~~~~~~~~iLr~~~~~-~~--------~y~~m~-------- 63 (491) T protein:vir:10 1 MSKGLWVSPTEFVTFGEPDKSLSSQIATRARSIDFFALGMYLPNPDPVLKALGKD-IR--------VYRELR-------- 63 (491) T ss_pred CCCceeCCCCCccCcccCChHHHHHHHhhhcccccccccCCccchHHHHHhcCCC-HH--------HHHHHh-------- Confidence 4311 2 2223322221 11222222111 1211 2344444444322211 01 112210 Q ss_pred cccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCC Q lcl|NC_016654. 72 RAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIAD 151 (533) Q Consensus 72 ~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~ 151 (533) + ..-+. .++++...-+++.+-.|....+++...+.+.++++.-.|...+...+ .|..+|-.++-+.|...++ T Consensus 64 -~----D~~i~-s~l~~Rk~av~~~~w~i~~~~~~~~~~e~v~e~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g- 135 (491) T protein:vir:10 64 -A----DAHVG-GCVRRRKAAVKALEWGLDRGKAKSRVAKSIADVFADLDLSRIVTEML-DAVLYGYQPMEITWGKVGN- 135 (491) T ss_pred -h----ChHHH-HHHHHHHHHHhCCCcEEecCCCCHHHHHHHHHHHhcCCHHHHHHHHH-HhhhhcceeEEEEEeecCC- Confidence 0 11122 23344445566767666655555566788999998888888888775 6888999999888875432 Q ss_pred ceE---EEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCc-ccceeehhhcccccc Q lcl|NC_016654. 152 NAW---IDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATS-LGWMMALTDHPATRD 227 (533) Q Consensus 152 ~~~---i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~-lG~~v~l~~~~~~~~ 227 (533) .+. +.++++..|.+- .+++ .. |...++. -|.++ T Consensus 136 ~~~~~~l~~r~~~~f~~d-~~~~-----------------------------l~---~~~~~~~~~g~~l---------- 172 (491) T protein:vir:10 136 YIVPIDVVGKPADWFVYD-PENQ-----------------------------LR---FRSKDHWMQGEEL---------- 172 (491) T ss_pred eeEEEEeeeecccceeec-cCCc-----------------------------eE---EecCCCCCCccee---------- Confidence 222 233333222210 0011 00 1111110 01111 Q ss_pred ccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHH-hCcceee Q lcl|NC_016654. 228 IAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFR-IGAGKVH 306 (533) Q Consensus 228 ~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~-~~~~~i~ 306 (533) .+.-|+.|+.... ...|+|.+.+..+. ...---...+..|+.=++ .|.+.++ T Consensus 173 -----------------~~~k~i~~~~~~~---------~~~p~g~gLl~~~~-w~~~fK~~~~~~w~~f~E~yG~P~~i 225 (491) T protein:vir:10 173 -----------------PARKFLVPRQEAT---------YLNPYGFPDLSMCF-WPTTFKKGGLKFWVQFTEKYGSPMLV 225 (491) T ss_pred -----------------cCCCEEEEEecCC---------CCCcccchhHHHHH-HHHHHHHHHHHHHHHHHHHcCCCeEE Confidence 1122455553321 23567887777654 322233344444544343 3443333 Q ss_pred echHHhcCCCCccccccCcchhhhhhcccccccccc--ccccceeeechhh---hhHHHHHHHHHHHHHHHHhhCCChhh Q lcl|NC_016654. 307 ASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG--DMETIFEFFQPAI---RVLEHDQGAALLLREVLRKTGYSPVS 381 (533) Q Consensus 307 v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~--~~~~~i~~~~~~i---r~e~~~~~l~~~l~~i~~~~g~s~~~ 381 (533) . + +..+.. -+....+..++..-..++++ +....|+.++... ..+.|.+.++.+=++|+..+ ++ +| T Consensus 226 g-----k-y~~~a~--~~ek~~l~~al~~~~~~a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~i-LG-qt 295 (491) T protein:vir:10 226 G-----K-HPRSAS--DGEKNLLLDCLEDMVQDAVAVVPDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIAL-LG-QN 295 (491) T ss_pred E-----e-cCCCCC--HHHHHHHHHHHHHHhcCcEEEecCCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHH-hh-hh Confidence 2 1 111110 01111222222211111111 1122344444322 22345555555545555443 32 33 Q ss_pred cccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHH Q lcl|NC_016654. 382 LGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQT 461 (533) Q Consensus 382 ~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~ 461 (533) ++-+.+|..+..++-..-. ...++.-.+.+...|.++++-++.+ |.+.. ..+.+.|... ..+..+.++. T Consensus 296 lTt~~~gs~a~~~vh~~v~--~di~~~D~~~i~~tln~li~~l~~~------N~~~~--~~p~f~~~~~-~e~~~~~a~~ 364 (491) T protein:vir:10 296 QTTEATSTRASAQAGLEVT--DDIRDGDKAVVSEAMNMLIRWICDL------NFDGA--DRPVFDMWEQ-EQVDEIQAGR 364 (491) T ss_pred cccCcccchhHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHh------cCCCC--CcceEEecCc-CchhHHHHHH Confidence 4333333333334433322 2223334566777888888876655 22222 2456777654 3444678999 Q ss_pred HHHHHhCCC-CCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 462 VQAWSVASA-ASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 462 ~~~l~~aGi-~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +.+|+..|+ ++.+ ++++.++ ..+.+..++.. +.... ...+........+.++...|.. T Consensus 365 ~~~L~~~G~~i~~~-~i~e~~G-ip~~~~~~~~~----------~~~~~--~~~~~~~~~~~~~~~~~~~d~~ 423 (491) T protein:vir:10 365 DQKLTQAGARFTPA-YFKRAYN-LQDGDLDERPL----------PVSAV--DTVGAASFAEFEAPDQDALDAA 423 (491) T ss_pred HHHHHhCCCcCCHH-HHHHHhC-CCCCCcCcccc----------ccCCC--CCcccccccccCCCCCCchHHH Confidence 999999998 5544 5665554 43321111110 00000 0000000011111111111111 No 181 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=97.26 E-value=0.00011 Score=42.20 Aligned_cols=453 Identities=12% Similarity=0.055 Sum_probs=193.2 Q ss_pred CC------------CCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCC Q lcl|NC_016654. 1 MS------------LPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPT 68 (533) Q Consensus 1 ~~------------~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 68 (533) |+ -|...++=||-..+-+.. +... . |-| .|.+...-..+ . ..++..|+.+- T Consensus 1 m~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~-i~~~-~-~~~-------~~~~~e~~~~~-~-~eLI~~YR~ma----- 63 (533) T protein:vir:10 1 MSQLFGFSLERAKKAPKGPSFVQKDNLDGSQP-VSGG-G-YYG-------YTVDFDGQVRN-E-YQLISRYREMV----- 63 (533) T ss_pred CccccccccccccccccCCCCCCCCcccccce-eecc-c-ccc-------eeeecccccch-H-HHHHHHHHHHh----- Confidence 11 133344445554332111 0000 0 000 01110000000 1 11122222110 Q ss_pred CCCcccceeecChHHHHHHHHHHh-h----cCCCceEeeCCC--c----hHHHHHHHHHHhhccHHHHHHHHHHHHhhhC Q lcl|NC_016654. 69 ATGRAPKRYHAPIPGVIAKLSTTE-L----FSEQLKFLDAGK--S----KEVQARADLIFNTPRFHSSLVEAGESCSALS 137 (533) Q Consensus 69 ~~g~~~~~~~~n~~k~i~~~~a~l-l----~~e~~~i~~~~~--~----~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G 137 (533) ..+-+.-.++..++= + ..+|+++.++.. + +...+..+.|++-=+|++..++......+.| T Consensus 64 ---------~~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDg 134 (533) T protein:vir:10 64 ---------LQPECDSAVDDIVNETICGNFDDVPVSVELSNLKVSDKIKKLIREEFGEILRLLDFENRSYEIFRRWYVDG 134 (533) T ss_pred ---------hccchhhHHHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcc Confidence 111222223333321 1 112455555331 1 1234556667777789999999999999999 Q ss_pred CEEEEEEEcCC--CCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccC----C Q lcl|NC_016654. 138 GSFQRIVWDPT--IADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTA----T 211 (533) Q Consensus 138 ~~~~~~~~D~~--~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~----~ 211 (533) ..|+|..+|.+ ..+=..+.+++|.++-++..- ........+.++...+ -.++..+|.+|.-.. . T Consensus 135 Ri~fHkiid~~~pk~GI~ELr~lDPr~i~~vr~i---------~~~~~~~~~~~~~~~~-v~~~~~eyf~Ynp~g~~~~~ 204 (533) T protein:vir:10 135 RLFYHKVIDPDNPQGGLIELRYIDPRKIRKINET---------EQKRPEQLRGLPLNQQ-LSPKSAEYFLYDPKGLKNST 204 (533) T ss_pred eEEEEEEecCCCccccceeeeeccccceeeeeee---------eccCCCccceeecchh-hhccceeeeeeccccccccC Confidence 99999999854 223455677888877776310 0000000000011011 123334444553211 1 Q ss_pred cccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHH Q lcl|NC_016654. 212 SLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIY 291 (533) Q Consensus 212 ~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~ 291 (533) +-|..+|-+.+ .|+--.. -+.. .+.=.|-+-.||+++ ..| +++ T Consensus 205 ~~~vkI~~dAI----------------------------~y~hSGl--~d~~-----~~~i~syLhkAiKp~-NQL-km~ 247 (533) T protein:vir:10 205 TQGLKIAPDSI----------------------------CYVHSGI--MDLN-----KNMTLSHLHKAIKAV-NQL-RMI 247 (533) T ss_pred CCceecchhhe----------------------------eeeeccc--eeCC-----CCceeccchHhHHHH-Hhh-HHH Confidence 11222221111 1111000 0000 000112223344432 222 111 Q ss_pred HH---HHHHHHhCcceeee-c---------hHHhcC----------CCCccccccCcchhhhhhcc---ccccccccccc Q lcl|NC_016654. 292 SS---LMRDFRIGAGKVHA-S---------ESVLTN----------LGMGQGVSLDEEQEVYSRVG---SGGFNANGDME 345 (533) Q Consensus 292 s~---~~~~~~~~~~~i~v-~---------~~~l~~----------~~~~~~~~~d~~~~~~~~~~---~~~~~~~~~~~ 345 (533) .+ +.|.-|.-.+|||- . +.+++. +....|..-| ++.|..+. +.+--.|+ .. T Consensus 248 EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~d--drk~msMlEDyWLPRReGg-rg 324 (533) T protein:vir:10 248 EDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKD--DKKFMSMLEDFWLPRREGG-RG 324 (533) T ss_pred HhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecc--cchhhhhHhhhcccccCCC-Cc Confidence 11 22334566777763 1 111110 0111111111 11111100 01111122 22 Q ss_pred cceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCC-cchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 346 TIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDE-VAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTC 424 (533) Q Consensus 346 ~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~-~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~i 424 (533) ..|+++...-.. .-+..+..+.+.+....++|.+.++.+++ +..-++||...+-.--..+.+.+..|..-|.++++.- T Consensus 325 TEItTLpGgqnL-gem~DV~YF~kKLY~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~q 403 (533) T protein:vir:10 325 TEITTLPGGQNL-GELEDVKYFQKKLYKSLNVPGSRLETETTFNVGRAAEITRDEVKFQKFVARLRKRFSELFTDLLKTQ 403 (533) T ss_pred cceeeccccCCc-ChHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 234444332222 23556677788899999999888875543 2224567777776677778888888888888888877 Q ss_pred HHHHHhhccCCCCC--CceeEEEEeCCCCCCCHHHHH-------HHHHHH--HhCCCCCHHHHHHHhCCCCCHHHHHHHH Q lcl|NC_016654. 425 LRVDAIKFPGKGAA--PSEELELEWPKFARESDLAKA-------QTVQAW--SVASAASTKTKVAYLHEDWDDERVQEEA 493 (533) Q Consensus 425 l~l~~~~~~~~~~~--~~~~v~i~f~d~i~~d~~e~a-------~~~~~l--~~aGi~S~et~v~~l~~~~~dee~~~El 493 (533) |.|.... ..... ....+.++|...---.+...+ ..++++ .-+...|.++..++.. -.+|+|.++|. T Consensus 404 LiLKgii--t~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~IL-r~tDeei~~~~ 480 (533) T protein:vir:10 404 LVLKGVI--SIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVL-KQTDVEMKEID 480 (533) T ss_pred hhhccCC--CHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHh-ccCHHHHHHHH Confidence 6553211 00001 124577777644333333222 223322 1123569998777654 48999999999 Q ss_pred HHHHHhhhcc---cCcccc-ccccCCCCCCCCCCC--CCCCCCCCC Q lcl|NC_016654. 494 DLIDNANTVS---APTFGF-GTDQPPLPTENDPAT--DPEAVDEGE 533 (533) Q Consensus 494 ~rI~~E~~~~---~~~~~~-~~~~~~~~~~~~~~~--~~~~~~d~~ 533 (533) ++|++|.... +|.... ....++.++.+...+ ..|..++.+ T Consensus 481 kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (533) T protein:vir:10 481 KQIESEMESGIIADPAAEMDPAMAAGDPDAGGAPAEEVAPEGPDPS 526 (533) T ss_pred HHHHHHHhCCCCCCCcchhhHHhcCCCCCcCCcccccCCCCCCCcc Confidence 9999996421 111100 001111222222211 233333333 No 182 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=97.19 E-value=0.00014 Score=41.79 Aligned_cols=451 Identities=9% Similarity=-0.014 Sum_probs=174.0 Q ss_pred cchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHh Q lcl|NC_016654. 13 PELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTE 92 (533) Q Consensus 13 ~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~l 92 (533) =.+...+..+...|..|...-.++.+|..... ..+ .+.....+.+..++--.-+...++.+|+- T Consensus 1 m~~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~---------------~~~-~~~~~~~~~~~~~~~dstg~~a~~~LAa~ 64 (522) T protein:vir:10 1 MKARERYNQLTTARQMFLDKAVECSELTLPYL---------------IDD-DISSRPNHKSLTVPWQSVGAKCCVTLAAK 64 (522) T ss_pred CchHHHHHHHHHHhhHHHHHHHHHHHHhhhcc---------------cCC-CCCCCcccccccccccchHHHHHHHHHHH Confidence 22445666666666666554444444432110 000 00011112233445556677777777776 Q ss_pred hcCC-----CceEeeCCCc------------hHHHHH-------HHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCC Q lcl|NC_016654. 93 LFSE-----QLKFLDAGKS------------KEVQAR-------ADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPT 148 (533) Q Consensus 93 l~~e-----~~~i~~~~~~------------~~~~~~-------l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~ 148 (533) |.+- .+.|.....+ ..++++ +...+..++|...+.++.+...+.|.+.+ |.|++ T Consensus 65 l~~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--y~~~~ 142 (522) T protein:vir:10 65 LMLAVLPPQTSFFKLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALI--FMGKD 142 (522) T ss_pred HHHhhcCCCCccccccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeE--EEcCC Confidence 5543 2344332211 112333 33457888999999999999999999885 56764 Q ss_pred CCCceEEEEEcCCeEEEEEe-cCCceEEEEEEEEee--------cCC--ceEEEEEEEecCeeEEEEEEeccCCccccee Q lcl|NC_016654. 149 IADNAWIDFVDADRAIPEFR-WGRLVAVTFWSELAG--------GDG--QEVWRHLERHESGYIVHAVYKGTATSLGWMM 217 (533) Q Consensus 149 ~~~~~~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~--------~~~--~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v 217 (533) . +..++-..++..-+ +|++..+++-.+++. .+. ..+...-...+.-.|.+.+|...+. |.-. T Consensus 143 ~-----~~~~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~v~p~~~~--~~~~ 215 (522) T protein:vir:10 143 G-----LKTFPLTRYVINRDGDGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTYVKLDKSS--GRWV 215 (522) T ss_pred C-----ceEEEcceEEEeeCCCCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEEEEeeccC--CceE Confidence 2 45566666555444 477777665444321 000 0000000000111222223321110 0000 Q ss_pred ehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 218 ALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD 297 (533) Q Consensus 218 ~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~ 297 (533) ... ...+......+. +.|.. -+.|++. +|... ..+.||+|-...++ +-+..|+..--..... T Consensus 216 ~~~------~~~~~~~~~~~s---~~g~~--~~P~~~~-----Rw~~~-~ge~YGrgp~~~~l-~D~k~L~~l~~~~~~~ 277 (522) T protein:vir:10 216 WHQ------EAFDKIIPDSRS---TAPKN--ASPWLPL-----RFNTV-DGEDYGRGRVEEFL-GDLKSLDGLSQSLIEG 277 (522) T ss_pred EEE------ccCCcccccccc---ccccc--cCCceee-----eeeec-CCCccccchHHHHH-HHHHHHHHHHHHHHHH Confidence 000 000000000000 01111 0112221 23322 24678998777666 5567777665555444 Q ss_pred H-HhCcceeeechHHh-cCCCCccccccCcchhhhhhccccccccccccccceeee-chhhhh-HHHHHHHHHHHHHHHH Q lcl|NC_016654. 298 F-RIGAGKVHASESVL-TNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFF-QPAIRV-LEHDQGAALLLREVLR 373 (533) Q Consensus 298 ~-~~~~~~i~v~~~~l-~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~-~~~ir~-e~~~~~l~~~l~~i~~ 373 (533) . ...+....|+++.+ +...- .....+.+.. .. .+ ....++.- ..++.+ .+-++.+..-++.+.. T Consensus 278 ~~~a~~p~~lv~~~~~~~~~~l-----~~~~~~~~v~-----g~-~~-~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl 345 (522) T protein:vir:10 278 AAAASKVVFLVSPSSTTKPATI-----AKAGNGAIVQ-----GR-PE-DVAVIQVGKTADFSTAANMATAIEKRLLEAFL 345 (522) T ss_pred HHHhcCCceeeccccccccccc-----cCCCCcceec-----CC-Cc-cceeecccccccchHHHHHHHHHHHHHHHHHh Confidence 4 44566666644322 21110 0111111111 00 00 00111111 122321 1222223322332221 Q ss_pred hhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhccCCCCCCceeE----EEEeC Q lcl|NC_016654. 374 KTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHF-GSALGPLSTTCLRVDAIKFPGKGAAPSEEL----ELEWP 448 (533) Q Consensus 374 ~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~-~~al~~li~~il~l~~~~~~~~~~~~~~~v----~i~f~ 448 (533) + ..-..+...|||||..+.+...+..+-.-..+ ...|.-+|..++.+... .|.-+....++ .|++- T Consensus 346 ---~----~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r--~g~lP~~p~~~~~~~~v~~i 416 (522) T protein:vir:10 346 ---V----MNVRNAERVTAEEVRLTQLELEQQLGGIFSLLVIEFLIPYLNRTLLVLQR--SNQIPKLPKDIVRPTIVAGV 416 (522) T ss_pred ---h----ccCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh--cCCCCCCCccccccccccch Confidence 1 11122345699999988877666555533222 33444555555555331 12111111111 12222 Q ss_pred CCCCCCHHHHHHHHHHHH-------h-CC------CCCHHHHHHHh---CC----C--CCHHHHHHHHHHHHHhhh--cc Q lcl|NC_016654. 449 KFARESDLAKAQTVQAWS-------V-AS------AASTKTKVAYL---HE----D--WDDERVQEEADLIDNANT--VS 503 (533) Q Consensus 449 d~i~~d~~e~a~~~~~l~-------~-aG------i~S~et~v~~l---~~----~--~~dee~~~El~rI~~E~~--~~ 503 (533) +..++++.++++. + +| .+-...+++.+ .+ . -++++++++.++-++.+. +. T Consensus 417 -----s~Laraq~~~~l~~~~~~i~~~~~p~~~~~~id~d~~~~~~a~~~Gvp~~~ivrt~eev~~~~q~~q~~~~~~~~ 491 (522) T protein:vir:10 417 -----NALGRGQDRESLTAFVGTIAQTLGPEALMQYLNPLEAIKRLAAAQGIDVLNLVKTEQQLAEEQQAAQQQAAQQSL 491 (522) T ss_pred -----hHHHHHHHHHHHHHHHHHHHHhhCchhhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHH Confidence 2222222222211 0 11 12222333322 21 0 155555443332222211 11 Q ss_pred cCcccc-ccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 504 APTFGF-GTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 504 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ....++ ++.....+.++ +++-++....+. T Consensus 492 ~~~a~~~~~~~~~~~~~~-~~~~~~~~~~~~ 521 (522) T protein:vir:10 492 VDQAGQMTGSPLMDPTKN-PQLMDEEQPPME 521 (522) T ss_pred HHHHHHHhcccccCcccc-HHHHHHhCCCCC Confidence 111111 11111111111 112222222222 No 183 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=97.14 E-value=0.00016 Score=41.44 Aligned_cols=392 Identities=14% Similarity=0.058 Sum_probs=156.0 Q ss_pred CCC---C-CCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccce Q lcl|NC_016654. 1 MSL---P-EANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKR 76 (533) Q Consensus 1 ~~~---~-~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 76 (533) |+| + ..++.+|+...+.....+..+. ...... +. . . T Consensus 26 ~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~---~~~~~~----~~-------------------------------~--~ 65 (441) T protein:vir:98 26 VGIFYKNEKRDLQYNEDDLQMMVQTLPGFQ---GTKLRQ----YK-------------------------------D--I 65 (441) T ss_pred cccccccccccccCCCcchHHHHHHhhccc---ccCccc----cc-------------------------------h--h Confidence 332 1 1222333333222211111110 000000 00 0 0 Q ss_pred eecChH--HHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhh--ccH---HHHHHHHHHHHhhhCCEEEEEEEcCCC Q lcl|NC_016654. 77 YHAPIP--GVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNT--PRF---HSSLVEAGESCSALSGSFQRIVWDPTI 149 (533) Q Consensus 77 ~~~n~~--k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~--n~f---~~~~~~~~~~~~~~G~~~~~~~~D~~~ 149 (533) -.+..+ -..|+.+|+-+-+=|..+.-.+.. .....+..+|.. |.+ ..-+...+...+..|.+|+.+..|..+ T Consensus 66 ~al~~~~V~acv~~Ia~~iA~lpl~~~~~~~~-~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G 144 (441) T protein:vir:98 66 EAIRHSDIFTAVMMIASDLARMPIRVTVNGQI-NYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTG 144 (441) T ss_pred hhhccHHHHHHHHHHHHhhccCceEEecCCcc-cccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCC Confidence 001111 112344444433334443322111 112223333321 211 122344455567779999998888654 Q ss_pred CCceEEEEEcCCeEEEEEe-cCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccc Q lcl|NC_016654. 150 ADNAWIDFVDADRAIPEFR-WGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDI 228 (533) Q Consensus 150 ~~~~~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~ 228 (533) . -+.+-.++|+.+.+..+ +|++. |. .+..+....+....+ T Consensus 145 ~-~~~L~~i~~~~v~v~~~~~g~~~----------------~~-------------~~~~~~~~~~~~~~~--------- 185 (441) T protein:vir:98 145 E-PMNLTFRKTSEIELKLDARGRLY----------------YF-------------HQRIDSNGNNIERNV--------- 185 (441) T ss_pred c-EEEEEEEcCceeEEEECCCCcEE----------------EE-------------EEEeccCcceeeEEE--------- Confidence 3 35677788888887654 23321 10 000000000000000 Q ss_pred cccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHH-HhCc-ceee Q lcl|NC_016654. 229 AVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDF-RIGA-GKVH 306 (533) Q Consensus 229 ~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~~-~~i~ 306 (533) ++--+.|+.... +. ...|.|.+..+. ..| .+.....++...+ +.|. ...+ T Consensus 186 -----------------~~~dviHir~~~----~d-----g~~G~spi~~~~-~~i-~~~~a~~~~~~~~f~ng~~~~gi 237 (441) T protein:vir:98 186 -----------------KFEDMLDIKFYS----LD-----GINGLSLLDTLS-RTI-ESDNNGKDFLNNFLRNGTHAGGI 237 (441) T ss_pred -----------------ccccEEEeccCC----CC-----CccccCHHHHHH-HHH-HHHHHHHHHHHHHHhccCCCcEE Confidence 000012222110 11 224677665433 444 3445555555543 5543 2222 Q ss_pred e--chHHhcCCCCccccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChh Q lcl|NC_016654. 307 A--SESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPV 380 (533) Q Consensus 307 v--~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~ 380 (533) + +..+. +... .+.....|.....+..++++ +....++.++......++++..+...++|+...|+||. T Consensus 238 l~~~~~~~----~~e~--~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~ 311 (441) T protein:vir:98 238 LKMKGVLD----NKKA--RDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLH 311 (441) T ss_pred EEeCCCCC----CHHH--HHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHH Confidence 2 11110 0000 00001112211111111111 12234566666667778889888889999999999999 Q ss_pred hcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHH Q lcl|NC_016654. 381 SLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQ 460 (533) Q Consensus 381 ~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~ 460 (533) .+|.+..+. +.++.... |...|..++..+..-.+..+... .....+.++.+.-.-.|..++++ T Consensus 312 ~lg~~~~~~-s~~q~~~~--------------y~~tl~P~~~~ie~~ln~~L~~~--~~~~~~~fd~~~llr~d~~~~~~ 374 (441) T protein:vir:98 312 KFGIETANM-SITDANLD--------------YLSTLKPYITCVCAELNFKFNDE--YVNREFKFDTTEIRVVDEKTQAE 374 (441) T ss_pred HcCCCCCCc-cHHHHHHH--------------HHHHHHHHHHHHHHHHHhhcccc--ccCceEEEechhhhccCHHHHHH Confidence 998654432 22222111 11233333333322222222211 12345566666667789999999 Q ss_pred HHHHHHhCCCCCHHHHHHHhCCCCCHH-HHHHHHHHHHHhhhcccCccccccccCCCCCC---CCCCCCCCCCCCCC Q lcl|NC_016654. 461 TVQAWSVASAASTKTKVAYLHEDWDDE-RVQEEADLIDNANTVSAPTFGFGTDQPPLPTE---NDPATDPEAVDEGE 533 (533) Q Consensus 461 ~~~~l~~aGi~S~et~v~~l~~~~~de-e~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~d~~ 533 (533) .+.+++.+|+|+..++.+.+ ++..- .-.+-+-.+.. + ..|.-..+..+.....+ ...+|+ ++| T Consensus 375 ~~~~~~~~G~~T~NE~R~~~--gl~pi~gGd~~~~~~~~-n--~~~~~~~~~~q~~~~~~~~~~~kgGe-----~ne 441 (441) T protein:vir:98 375 IDKINIDSGKMNIDEIRQRD--GLAPIPGGNGSIHRVDL-N--HVNIELVDEYQMNKSRATDKKLKGGE-----ENE 441 (441) T ss_pred HHHHHHhCCCcCHHHHHHHh--CCCCCCCCCcceEeecc-c--ccccccccccccccccccccccCCCC-----CCC Confidence 99999999999999976654 22210 00000000000 0 00000000011111111 111111 111 No 184 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=97.11 E-value=0.00017 Score=41.30 Aligned_cols=396 Identities=14% Similarity=0.048 Sum_probs=157.1 Q ss_pred CCC---C-CCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccce Q lcl|NC_016654. 1 MSL---P-EANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKR 76 (533) Q Consensus 1 ~~~---~-~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 76 (533) |+| + ..++.+|+..++.....+..+ -..... .|... T Consensus 26 ~~lf~~~e~R~~~~~~~~~~~~~~~~~~~---~~~~~~----~~~~~--------------------------------- 65 (441) T protein:vir:94 26 VGIFYKNEKRDLQYNEDDLQMMVQTLPGF---QGTKLR----QYKDI--------------------------------- 65 (441) T ss_pred cccccccccccccCCCcchHHHHHHhccc---Cccccc----ccchh--------------------------------- Confidence 444 2 233344444433222211111 000000 00000 Q ss_pred eecChH--HHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHh--hccH---HHHHHHHHHHHhhhCCEEEEEEEcCCC Q lcl|NC_016654. 77 YHAPIP--GVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFN--TPRF---HSSLVEAGESCSALSGSFQRIVWDPTI 149 (533) Q Consensus 77 ~~~n~~--k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~--~n~f---~~~~~~~~~~~~~~G~~~~~~~~D~~~ 149 (533) -.+..+ -..|+.+|+-+-+=|..+.-.... .....+..+|. -|.+ .......+...+..|.+|+.+..|..| T Consensus 66 ~al~~~~V~~cv~~Ia~~iA~lp~~~~~~~~~-~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G 144 (441) T protein:vir:94 66 EAIRHSDIFTAVMMIASDLARMPIRVTVNGQI-NYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTG 144 (441) T ss_pred hhhccHHHHHHHHHHHHhhccCceeeecCccc-cccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC Confidence 001111 112333333333334333221111 11122333332 1211 122334445567779999998887654 Q ss_pred CCceEEEEEcCCeEEEEEec-CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccc Q lcl|NC_016654. 150 ADNAWIDFVDADRAIPEFRW-GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDI 228 (533) Q Consensus 150 ~~~~~i~~v~~~~~~P~~~~-g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~ 228 (533) . -+.+-.++|+++.++.+. |++. | ..+.......+....+ T Consensus 145 ~-~~~L~~i~~~~v~v~~d~~g~~~----------------~-------------~~~~~~~~~~~~~~~~--------- 185 (441) T protein:vir:94 145 E-PMNLTFRKTSEIELKSDARGRLY----------------Y-------------FHQRIDSNGNNIERNV--------- 185 (441) T ss_pred c-EEEEEEEcCceeEEEECCCccEE----------------E-------------EEEEeccCCceeEEEE--------- Confidence 3 245777888888776543 3321 1 0010000000000000 Q ss_pred cccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHH-HhCc-ceee Q lcl|NC_016654. 229 AVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDF-RIGA-GKVH 306 (533) Q Consensus 229 ~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~~-~~i~ 306 (533) ..--+.|+.... .. ...|.|.+..+. ..| .++.....+...+ +.|. ...+ T Consensus 186 -----------------~~~dvih~k~~~----~d-----g~~G~spl~~~~-~~i-~~~~~~~~~~~~~f~ng~~p~gi 237 (441) T protein:vir:94 186 -----------------KFEDMLDIKFYS----LD-----GINGLSLLDTLS-RTI-ESDNNGKDFLNNFLRNGTHAGGI 237 (441) T ss_pred -----------------ccccEEEeccCC----CC-----CccccCHHHHHH-HHH-HHHHHHHHHHHHHHhccCCCcEE Confidence 000012222110 11 235777766543 444 3455555565554 5442 3333 Q ss_pred echHHhcCCCCcc-ccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhh Q lcl|NC_016654. 307 ASESVLTNLGMGQ-GVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVS 381 (533) Q Consensus 307 v~~~~l~~~~~~~-~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~ 381 (533) + ...+... ....+.-...|.....+..++++ +....++.++......++++..+...++|+...|+||.. T Consensus 238 l-----~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~ 312 (441) T protein:vir:94 238 L-----KMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHK 312 (441) T ss_pred E-----EcCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHH Confidence 2 1111100 00000001112221111111111 112235666666677788888888899999999999999 Q ss_pred cccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHH Q lcl|NC_016654. 382 LGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQT 461 (533) Q Consensus 382 ~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~ 461 (533) +|.+.++. +.++.... |...|..++..+..-.+..+... .....+.++++.-.-.|..++++. T Consensus 313 lg~~~~~~-s~~q~~~~--------------~~~tl~P~~~~ie~eln~kl~~~--~~~~~~~fd~~~llr~D~~~~~~~ 375 (441) T protein:vir:94 313 FGIETANM-SITDANLD--------------YLSTLKPYITCVCAELNFKFNDE--YVNREFKFDTTEIRVVDEKTQAEI 375 (441) T ss_pred cCCCCCCc-cHHHHHHH--------------HHHHHHHHHHHHHHHHhhhcccc--ccCceEEeechhhhccCHHHHHHH Confidence 98654432 22222111 11233333333322212222111 123456666666677899999999 Q ss_pred HHHHHhCCCCCHHHHHHHhCCCCCHH-HHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 462 VQAWSVASAASTKTKVAYLHEDWDDE-RVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 462 ~~~l~~aGi~S~et~v~~l~~~~~de-e~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +.+++.+|+|+..++.+.+ ++..- .-.+.+-.+.. ...|.-..+.-+.....+.+ ....+.+++| T Consensus 376 ~~~~i~~G~~T~NE~R~~~--gl~Pi~ggd~~~~~~~~---n~~~~~~~~~~~~~~~~~~~--~~~kgGe~~e 441 (441) T protein:vir:94 376 DKINIDSGKMNIDEIRQRD--GLAPIPGGNGSIHRVDL---NHVNIELVDEYQMNKSRATD--KKLKGGEENE 441 (441) T ss_pred HHHHHhCCCcCHHHHHHHh--CCCCCCCCCcceEeecc---cccccccccccccccccccc--cccCCCCCCC Confidence 9999999999999976654 22210 00000000000 00000000000111111000 0011111122 No 185 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=97.11 E-value=0.00017 Score=41.30 Aligned_cols=396 Identities=14% Similarity=0.048 Sum_probs=157.1 Q ss_pred CCC---C-CCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccce Q lcl|NC_016654. 1 MSL---P-EANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKR 76 (533) Q Consensus 1 ~~~---~-~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 76 (533) |+| + ..++.+|+..++.....+..+ -..... .|... T Consensus 26 ~~lf~~~e~R~~~~~~~~~~~~~~~~~~~---~~~~~~----~~~~~--------------------------------- 65 (441) T protein:vir:79 26 VGIFYKNEKRDLQYNEDDLQMMVQTLPGF---QGTKLR----QYKDI--------------------------------- 65 (441) T ss_pred cccccccccccccCCCcchHHHHHHhccc---Cccccc----ccchh--------------------------------- Confidence 444 2 233344444433222211111 000000 00000 Q ss_pred eecChH--HHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHh--hccH---HHHHHHHHHHHhhhCCEEEEEEEcCCC Q lcl|NC_016654. 77 YHAPIP--GVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFN--TPRF---HSSLVEAGESCSALSGSFQRIVWDPTI 149 (533) Q Consensus 77 ~~~n~~--k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~--~n~f---~~~~~~~~~~~~~~G~~~~~~~~D~~~ 149 (533) -.+..+ -..|+.+|+-+-+=|..+.-.... .....+..+|. -|.+ .......+...+..|.+|+.+..|..| T Consensus 66 ~al~~~~V~~cv~~Ia~~iA~lp~~~~~~~~~-~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G 144 (441) T protein:vir:79 66 EAIRHSDIFTAVMMIASDLARMPIRVTVNGQI-NYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTG 144 (441) T ss_pred hhhccHHHHHHHHHHHHhhccCceeeecCccc-cccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC Confidence 001111 112333333333334333221111 11122333332 1211 122334445567779999998887654 Q ss_pred CCceEEEEEcCCeEEEEEec-CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccc Q lcl|NC_016654. 150 ADNAWIDFVDADRAIPEFRW-GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDI 228 (533) Q Consensus 150 ~~~~~i~~v~~~~~~P~~~~-g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~ 228 (533) . -+.+-.++|+++.++.+. |++. | ..+.......+....+ T Consensus 145 ~-~~~L~~i~~~~v~v~~d~~g~~~----------------~-------------~~~~~~~~~~~~~~~~--------- 185 (441) T protein:vir:79 145 E-PMNLTFRKTSEIELKSDARGRLY----------------Y-------------FHQRIDSNGNNIERNV--------- 185 (441) T ss_pred c-EEEEEEEcCceeEEEECCCccEE----------------E-------------EEEEeccCCceeEEEE--------- Confidence 3 245777888888776543 3321 1 0010000000000000 Q ss_pred cccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHH-HhCc-ceee Q lcl|NC_016654. 229 AVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDF-RIGA-GKVH 306 (533) Q Consensus 229 ~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~~-~~i~ 306 (533) ..--+.|+.... .. ...|.|.+..+. ..| .++.....+...+ +.|. ...+ T Consensus 186 -----------------~~~dvih~k~~~----~d-----g~~G~spl~~~~-~~i-~~~~~~~~~~~~~f~ng~~p~gi 237 (441) T protein:vir:79 186 -----------------KFEDMLDIKFYS----LD-----GINGLSLLDTLS-RTI-ESDNNGKDFLNNFLRNGTHAGGI 237 (441) T ss_pred -----------------ccccEEEeccCC----CC-----CccccCHHHHHH-HHH-HHHHHHHHHHHHHHhccCCCcEE Confidence 000012222110 11 235777766543 444 3455555565554 5442 3333 Q ss_pred echHHhcCCCCcc-ccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhh Q lcl|NC_016654. 307 ASESVLTNLGMGQ-GVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVS 381 (533) Q Consensus 307 v~~~~l~~~~~~~-~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~ 381 (533) + ...+... ....+.-...|.....+..++++ +....++.++......++++..+...++|+...|+||.. T Consensus 238 l-----~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~ 312 (441) T protein:vir:79 238 L-----KMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHK 312 (441) T ss_pred E-----EcCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHH Confidence 2 1111100 00000001112221111111111 112235666666677788888888899999999999999 Q ss_pred cccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHH Q lcl|NC_016654. 382 LGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQT 461 (533) Q Consensus 382 ~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~ 461 (533) +|.+.++. +.++.... |...|..++..+..-.+..+... .....+.++++.-.-.|..++++. T Consensus 313 lg~~~~~~-s~~q~~~~--------------~~~tl~P~~~~ie~eln~kl~~~--~~~~~~~fd~~~llr~D~~~~~~~ 375 (441) T protein:vir:79 313 FGIETANM-SITDANLD--------------YLSTLKPYITCVCAELNFKFNDE--YVNREFKFDTTEIRVVDEKTQAEI 375 (441) T ss_pred cCCCCCCc-cHHHHHHH--------------HHHHHHHHHHHHHHHHhhhcccc--ccCceEEeechhhhccCHHHHHHH Confidence 98654432 22222111 11233333333322212222111 123456666666677899999999 Q ss_pred HHHHHhCCCCCHHHHHHHhCCCCCHH-HHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 462 VQAWSVASAASTKTKVAYLHEDWDDE-RVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 462 ~~~l~~aGi~S~et~v~~l~~~~~de-e~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +.+++.+|+|+..++.+.+ ++..- .-.+.+-.+.. ...|.-..+.-+.....+.+ ....+.+++| T Consensus 376 ~~~~i~~G~~T~NE~R~~~--gl~Pi~ggd~~~~~~~~---n~~~~~~~~~~~~~~~~~~~--~~~kgGe~~e 441 (441) T protein:vir:79 376 DKINIDSGKMNIDEIRQRD--GLAPIPGGNGSIHRVDL---NHVNIELVDEYQMNKSRATD--KKLKGGEENE 441 (441) T ss_pred HHHHHhCCCcCHHHHHHHh--CCCCCCCCCcceEeecc---cccccccccccccccccccc--cccCCCCCCC Confidence 9999999999999976654 22210 00000000000 00000000000111111000 0011111122 No 186 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=97.11 E-value=0.00017 Score=41.29 Aligned_cols=450 Identities=10% Similarity=0.010 Sum_probs=168.7 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |-|==... ...+...+..++.-|..|...-.++.+|... ..+ .....+....++--. T Consensus 1 ~~~~~~~e---~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP------------------~~~--~~~~~~~~~~~~~ds 57 (517) T protein:vir:10 1 MDMRFAGN---KSKIPKLYEQLVGKRSPFLSRAENYSRFTLP------------------YLM--ADVNDDLSSQNAWQD 57 (517) T ss_pred Cccccccc---HHHHHHHHHHHHHhhhHHHHHHHHHHHHhcc------------------ccc--cCCCCCccccccccc Confidence 33320000 0133333444444444443333333332211 000 001112223345556 Q ss_pred hHHHHHHHHHHhhcCC-----CceEeeCCCc-------------hHHHH-------HHHHHHhhccHHHHHHHHHHHHhh Q lcl|NC_016654. 81 IPGVIAKLSTTELFSE-----QLKFLDAGKS-------------KEVQA-------RADLIFNTPRFHSSLVEAGESCSA 135 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e-----~~~i~~~~~~-------------~~~~~-------~l~~i~~~n~f~~~~~~~~~~~~~ 135 (533) -+...++.+|+-|.+- .+.|.....+ +.+++ .+...+..++|...+.++.....+ T Consensus 58 tg~~a~~~LAa~l~~~ltpp~~~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~ 137 (517) T protein:vir:10 58 DGASATNFLSNKLSQVLFPAQRSFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIV 137 (517) T ss_pred hHHHHHHHHHHHHHHhhcCCCCccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHh Confidence 6777777777765543 2344433221 12333 334467788999999999999999 Q ss_pred hCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEe-cCCceEEEEEEEEeecC-----Cce-----EEEEEEEecCeeEEEE Q lcl|NC_016654. 136 LSGSFQRIVWDPTIADNAWIDFVDADRAIPEFR-WGRLVAVTFWSELAGGD-----GQE-----VWRHLERHESGYIVHA 204 (533) Q Consensus 136 ~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~~~-----~~~-----~y~~lE~h~~~~I~~~ 204 (533) .|.+.+ |.++.. ..|..++-..++-.-+ +|++..+++-.+++... ++. ......-...-.|.+. T Consensus 138 ~G~a~l--y~~~~~---~~~~~~pl~~y~v~~d~~G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~ 212 (517) T protein:vir:10 138 TGNVMM--YHPDKT---SPIQAVPLHHYCVRRDNNGTVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTH 212 (517) T ss_pred HCeEEE--EEeCCC---CcEEEEEcCeEEEeeCCCcCeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEE Confidence 999765 456432 3466676666555444 57887776544332100 000 0000000011112222 Q ss_pred EEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHH Q lcl|NC_016654. 205 VYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTF 284 (533) Q Consensus 205 ~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~li 284 (533) ++...+ |.-.... ...... ...+.+.+..-+.|++. +|...+ .+.||+|--..++ +-+ T Consensus 213 v~~~~~---~~~~~~~------~~d~~~------~~~~s~y~~~e~P~~~~-----Rw~~~~-ge~YGrgp~~~~L-~D~ 270 (517) T protein:vir:10 213 AKRTKD---GKYLIRQ------SADDVP------VGKESTVTEDKSPFLIL-----TWKRSY-GEDYGRGMAEDHA-GAF 270 (517) T ss_pred EEEeCC---CceEEEE------EeCcee------eccccccccccCCeeee-----eeeecC-CCCcccchHHHhH-HHH Confidence 222111 1000000 000000 00112221111223332 243333 4788999766655 556 Q ss_pred HHHHHHHHHHHH-HHHhCcceeeechHHhcCCCCcccccc-Ccchhhhhhccccccccccccccceee-echhhh-hHHH Q lcl|NC_016654. 285 HELDRIYSSLMR-DFRIGAGKVHASESVLTNLGMGQGVSL-DEEQEVYSRVGSGGFNANGDMETIFEF-FQPAIR-VLEH 360 (533) Q Consensus 285 d~lD~~~s~~~~-~~~~~~~~i~v~~~~l~~~~~~~~~~~-d~~~~~~~~~~~~~~~~~~~~~~~i~~-~~~~ir-~e~~ 360 (533) ..|+..--.... .....+....||++... +...+ +...+.+.+.. . ++...++. ...++. +.+- T Consensus 271 k~L~~l~~~~~~~~~~a~~~~~lv~~~~~~-----~~~~l~~~~~g~~~~g~-----~--~~v~~~~~~~~~d~~~~~~~ 338 (517) T protein:vir:10 271 FVIQFLSEALARGMALMADVKYLVKPGSYT-----DINQFVEGGSGAVLHGV-----E--GDIHIVQLGKYADYTPIQAV 338 (517) T ss_pred HHHHHHHHHHHHHHHHhccCCcccCccccc-----chhhccCCCccccccCC-----c--ccceeeecccccchhHHHHH Confidence 677765433333 33455555556543321 11111 11111121100 0 00001110 111222 1233 Q ss_pred HHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhccCCCCCC Q lcl|NC_016654. 361 DQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKA-RHFGSALGPLSTTCLRVDAIKFPGKGAAP 439 (533) Q Consensus 361 ~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~-~~~~~al~~li~~il~l~~~~~~~~~~~~ 439 (533) ++.++.-++.+.+..+ +....+...|||||..+.+......+-.- +.-...|..++..++.+.. .....+ T Consensus 339 i~~~~~rI~~af~~~~-----l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~l~----~~l~~~ 409 (517) T protein:vir:10 339 LNDYRQRIGRVFMMEA-----MTRRDAERVTAYEIQRDAMLVEQSLGGVYSLFATTFQGPLARWFMNGIS----SILTSK 409 (517) T ss_pred HHHHHHHHHHHHhhhh-----hhccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhh----hhcCCC Confidence 3344444444443222 12122234699999877766555444321 1122223334444443221 111222 Q ss_pred ceeEEEEeCCCCCCCHHHHHHHHHHH------Hh--CC-------CCCHHHHHHHh---CC-----CCCHHHHHHHHHHH Q lcl|NC_016654. 440 SEELELEWPKFARESDLAKAQTVQAW------SV--AS-------AASTKTKVAYL---HE-----DWDDERVQEEADLI 496 (533) Q Consensus 440 ~~~v~i~f~d~i~~d~~e~a~~~~~l------~~--aG-------i~S~et~v~~l---~~-----~~~dee~~~El~rI 496 (533) . +.++.--++ +...+.+.+..+ ++ ++ .+-.+.+++.+ .+ --+++|++++.++- T Consensus 410 ~--v~~~~~s~l--a~l~r~~~~~~i~~~~~~i~~~a~~~~~~~~~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~~~~~ 485 (517) T protein:vir:10 410 N--VSPTILTGI--EALGRMAELDKLGTFNGYVSMTAQWPEPLQQAIKWPDFTDWVQGQISANFPFFKTQDELNAEAQAQ 485 (517) T ss_pred C--ccceeeccH--HHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHHhCCChhhcCCHHHHHHHHHHH Confidence 2 233221111 111111111111 10 11 11222233221 11 02556666555443 Q ss_pred HHhhhccc--Cccccc-cccCCCCCCCCCCCC Q lcl|NC_016654. 497 DNANTVSA--PTFGFG-TDQPPLPTENDPATD 525 (533) Q Consensus 497 ~~E~~~~~--~~~~~~-~~~~~~~~~~~~~~~ 525 (533) +..++..+ ...+.+ +.....+.++-++++ T Consensus 486 ~~~~~~~~~~~~ag~~~~~~~~~~~~~~~~~~ 517 (517) T protein:vir:10 486 QEQEATKYAAEQAGKAIPDMVKNGQINPQGGQ 517 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCCCCCCC Confidence 33332211 111111 112222223333333 No 187 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=97.07 E-value=0.00018 Score=41.07 Aligned_cols=375 Identities=8% Similarity=-0.046 Sum_probs=149.7 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcc---ccee Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRA---PKRY 77 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~---~~~~ 77 (533) |.| ++.|-+. . ..+. .-+....+...-.+. .....|.. ..-+ T Consensus 1 M~~---------------------f~~~~~~-----------~-~~~~-~~~~~~~~~~~~~~~-~~~~~~~~v~~~~al 45 (386) T protein:vir:49 1 MPI---------------------FNITNLA-----------T-ESPP-INQESFFDIADSDFL-ASLNSSEWVSAENAL 45 (386) T ss_pred Cch---------------------hhhhccC-----------C-CCcc-cchhhhhhhhhcccc-ccccCCceechhhhh Confidence 222 2221110 0 0000 000000000000000 00000100 0111 Q ss_pred ecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEE Q lcl|NC_016654. 78 HAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDF 157 (533) Q Consensus 78 ~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~ 157 (533) ...---..++.+|+-+.+-|..+.- . ..+..+.+=-..-....-+...+...+..|.+|+.+..|..+. -+.+.+ T Consensus 46 ~~~~v~~~i~~ia~~ia~~p~~~~~--~--~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~-~~~l~~ 120 (386) T protein:vir:49 46 KNSDLFSIISQLSNDLATAKITTSR--K--QLQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGR-DMKWEY 120 (386) T ss_pred ccHHHHHHHHHHHHHhhhCceeecc--c--hhhhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCc-EEEEEE Confidence 1111223456666655554543322 1 1122221110111123334445556677899999888886543 356777 Q ss_pred EcCCeEEEEEec-CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccC Q lcl|NC_016654. 158 VDADRAIPEFRW-GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEG 236 (533) Q Consensus 158 v~~~~~~P~~~~-g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~ 236 (533) ++|+++-++.+. +.. .++. |...+...|..+.+. T Consensus 121 i~~~~v~v~~~~~~~~---~~y~--------------------------~~~~~~~~~~~~~~~---------------- 155 (386) T protein:vir:49 121 LRPSQVSFNRLDNQNG---LYYN--------------------------ITFDDPHIAPKQHVP---------------- 155 (386) T ss_pred ecCceeEEEEcCCCce---EEEE--------------------------EEEcCccccceeEEc---------------- Confidence 888877665332 211 1110 100000111110000 Q ss_pred CceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhC-cceeeechHHhcC Q lcl|NC_016654. 237 RGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIG-AGKVHASESVLTN 314 (533) Q Consensus 237 ~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~-~~~i~v~~~~l~~ 314 (533) .--+.|+....++ ...+|.|.+..+. ..|+ +.....++... ++.+ ....++ .. T Consensus 156 ----------~~evih~~~~~~~--------~~~~G~s~l~~~~-~~i~-~~~~~~~~~~~~~~ng~~~~~il-----~~ 210 (386) T protein:vir:49 156 ----------QNDILHFRLLSVD--------GGLTSVSPLMALG-REFN-IQKASDKLTISALKNALNANGIL-----KI 210 (386) T ss_pred ----------cccEEEecCCCCC--------CccccccHHHHHH-HHHH-HHHHHHHHHHHHHHccCCccEEE-----Ee Confidence 0012333321111 1345778776543 4443 33344444444 3543 333322 11 Q ss_pred CCCccccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcch Q lcl|NC_016654. 315 LGMGQGVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQ 390 (533) Q Consensus 315 ~~~~~~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~ 390 (533) .+.... +.....-........++++ +....++.++......++++..+...++|+...|+||..+|.+.++.. T Consensus 211 ~~~~~~---~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~ 287 (386) T protein:vir:49 211 KGGGLL---DFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDGDQQS 287 (386) T ss_pred CCCCCh---HHHHHHHHHHHHhccCCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccc Confidence 111100 0000000000001111111 122235666666677788888888999999999999999986555444 Q ss_pred hHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCC Q lcl|NC_016654. 391 TATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASA 470 (533) Q Consensus 391 Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi 470 (533) ++..++..+ ...|...++.+..-.+..+. ..+.++.....-.|..+.+..+.+++.+|+ T Consensus 288 ~~~~~~~~~--------------~~~i~~~l~~i~~~~~~~l~-------~~~~~~~~~~~~~d~~~~~~~~~~l~~~g~ 346 (386) T protein:vir:49 288 SLEMIYNIY--------------FKSVSRYLRPFVSEMSKKLS-------CEVDVDISPAVDPTGSNYISLINSMVKSGT 346 (386) T ss_pred hHHHHHHHH--------------HHHHHHHHHHHHHHHHHHhc-------chhcccchhhhccCHHHHHHHHHHHHhCCC Confidence 454443222 22222222222211111111 123344455566677888999999999999 Q ss_pred CCHHHHHHHhC-CCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 471 ASTKTKVAYLH-EDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVD 530 (533) Q Consensus 471 ~S~et~v~~l~-~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (533) +++-++.+.+. .++...+. + . ++++-.. ...+||.+++| T Consensus 347 ~t~nE~r~~l~~~~~~~~~~---------------~---~-~~~~~~~--~~~gGd~~~~~ 386 (386) T protein:vir:49 347 LAQNQGLYILQQAEILPKEL---------------P---D-GKNPNRT--SLKGGEINEQD 386 (386) T ss_pred cCHHHHHHHHhhCCCCCCcC---------------c---c-hhccCCC--CCCCCCCCCCC Confidence 99988766531 01111111 0 0 1110000 01122222222 No 188 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=97.02 E-value=0.00021 Score=40.78 Aligned_cols=402 Identities=8% Similarity=-0.014 Sum_probs=159.9 Q ss_pred HHHHhhhHhhcCC--HHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcc---cceeecChHHHHHHHHHHhhc Q lcl|NC_016654. 20 ARVAESHVWWEGD--LDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRA---PKRYHAPIPGVIAKLSTTELF 94 (533) Q Consensus 20 ~~~~~~~~w~~gd--~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~---~~~~~~n~~k~i~~~~a~ll~ 94 (533) --+--|.-|+.|+ -.-+..|++.+....+... +.+..-.+ ......|.. ..-+.+.---..++.+|+-+- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~lf~~~~~~~~~~~---~~~~~~~~--~~~~~~~~~vs~~~al~~~~v~~cv~~Ia~~iA 75 (424) T protein:vir:45 1 MLYCWWAHWLWPEGGRVLLDALFRSKSLENPSTP---ITGDAVDT--DGLFRADVYVSPETAMKLAAVYSCIYVLSSSLA 75 (424) T ss_pred CeeEeeeceecCcchhHHHHhhccccCCCCCccc---cchhhhhh--hccccCCceechHHhhccHHHHHHHHHHHHHHh Confidence 0111233344433 2333444443322222211 00000000 001111111 111111112223444444444 Q ss_pred CCCceEeeC--CCchH-HHHHHHHHHh--hccH--H-HHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEE Q lcl|NC_016654. 95 SEQLKFLDA--GKSKE-VQARADLIFN--TPRF--H-SSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPE 166 (533) Q Consensus 95 ~e~~~i~~~--~~~~~-~~~~l~~i~~--~n~f--~-~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~ 166 (533) +=|..+--. +..+. ....+..+|. -|.. . .-....+...+..|.+|+.+..|..+. -+.+..++++.+.+. T Consensus 76 ~lp~~v~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~-~~~L~~l~~~~v~i~ 154 (424) T protein:vir:45 76 QMPLHVMRRHKGKVEPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGE-VISLDCCMPWETTLM 154 (424) T ss_pred hCceEEEEecCCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCc-EEEEEEecCceEEEE Confidence 445433211 11110 1112333332 1211 1 223344556677799999887776543 245666666666544 Q ss_pred EecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCc Q lcl|NC_016654. 167 FRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVK 246 (533) Q Consensus 167 ~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~ 246 (533) -..+++ .| .++... .+ ..++ T Consensus 155 ~~~~~~----------------~y-------------~~~~~~-~~--~~~~---------------------------- 174 (424) T protein:vir:45 155 NTGGRY----------------TY-------------GLYNEY-GA--FAIS---------------------------- 174 (424) T ss_pred EcCCeE----------------EE-------------EEEecC-ce--EEEC---------------------------- Confidence 322221 11 111100 00 0000 Q ss_pred cceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHH-HhCc-ceeeechHHhcCCCCccccccC Q lcl|NC_016654. 247 DLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDF-RIGA-GKVHASESVLTNLGMGQGVSLD 324 (533) Q Consensus 247 ~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~~-~~i~v~~~~l~~~~~~~~~~~d 324 (533) +--+.|+..... . ...|.|.+..+. ..|. +......+...+ +.|- +..++ ...+.-.....+ T Consensus 175 ~~eVih~r~~~~----d-----~~~G~spi~~~~-~~i~-~~~~~~~~~~~~f~ng~~p~gil-----~~~~~l~~e~~~ 238 (424) T protein:vir:45 175 PDDMIHIRALGN----N-----QKMGLSPIMQHA-ETIG-MGMSGQKYTESFFSGNARPAGIV-----SVKSGLNKESWG 238 (424) T ss_pred cccEEEecCcCC----C-----CcccccHHHHHH-HHHH-HHHHHHHHHHHHHhccCCccEEE-----EeCCCCCHHHHH Confidence 001223322111 1 234777766433 4442 334444454443 5433 23232 111110000000 Q ss_pred cchhhhhhccccc-ccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc-hhHHHHHHH Q lcl|NC_016654. 325 EEQEVYSRVGSGG-FNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA-QTATEASGK 398 (533) Q Consensus 325 ~~~~~~~~~~~~~-~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~-~Tatai~~~ 398 (533) .....+.....+. .++++ +....++.++......++++..+...++|+...|++|..+|...++. .++++. T Consensus 239 ~~~~~~~~~~~g~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~--- 315 (424) T protein:vir:45 239 WLKDQWQKASQALRRQENKTMLLPADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNISAQ--- 315 (424) T ss_pred HHHHHHHHHhccccccCCceeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH--- Confidence 0011111111110 01111 11223455555555667888888889999999999999998654332 222222 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCC-CCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHH Q lcl|NC_016654. 399 KDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKG-AAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKV 477 (533) Q Consensus 399 ~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~-~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v 477 (533) ....+..+|..+++.+..-.+..+.... ......+.++.+.-+-.|..++++.+.+++++|+|+..++. T Consensus 316 ----------~~~f~~~tL~P~~~~ie~~ln~kLl~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~NE~R 385 (424) T protein:vir:45 316 ----------AIQFVRYTMMPWVTNWEQELNRRLFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEAR 385 (424) T ss_pred ----------HHHHHHHHHHHHHHHHHHHHHHhcCChhhhcCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHH Confidence 2233444455555544332222222111 11224466666677778999999999999999999999966 Q ss_pred HHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 478 AYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDE 531 (533) Q Consensus 478 ~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 531 (533) +.+ ++..-+ .-+++ ..+. .-.++..+.+.+..+...++| T Consensus 386 ~~~--gl~pi~---ggD~~------~~~~----n~~~~~~~~~~~~~~~~~~~~ 424 (424) T protein:vir:45 386 AFE--DMNPVE---GLDEM------LVSV----NAANPAGDFKPPKNDEGKTNE 424 (424) T ss_pred HHh--CCCCCC---Cccee------eecc----cccccccccCCCCCCCCCCCC Confidence 654 232200 00000 0000 000111111111111111122 No 189 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=97.00 E-value=0.00022 Score=40.66 Aligned_cols=436 Identities=13% Similarity=0.081 Sum_probs=145.7 Q ss_pred CCCCCC----cCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccce Q lcl|NC_016654. 1 MSLPEA----NTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKR 76 (533) Q Consensus 1 ~~~~~~----~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 76 (533) =.||.. ..++-||.+ +|..|..+-. T Consensus 31 ~~~~~~~~~~~~~~~~p~~----------------~~~~L~~~~e----------------------------------- 59 (651) T protein:vir:99 31 TQIPDHRIQSHNVGVNPPY----------------NPDRLAAFLE----------------------------------- 59 (651) T ss_pred cccchhhhcccCCCCCCCC----------------CHHHHHHHHh----------------------------------- Confidence 011100 001111111 3344443322 Q ss_pred eecChHHHHHHHHHHhhcCCCceEe----eCCC--chHHHHHHHHHHhh---------------ccHHHHHHHHHHHHhh Q lcl|NC_016654. 77 YHAPIPGVIAKLSTTELFSEQLKFL----DAGK--SKEVQARADLIFNT---------------PRFHSSLVEAGESCSA 135 (533) Q Consensus 77 ~~~n~~k~i~~~~a~ll~~e~~~i~----~~~~--~~~~~~~l~~i~~~---------------n~f~~~~~~~~~~~~~ 135 (533) +.+....+++..++.+.|=+..+. +..+ ...-.+.....+.. ..+...+...+.+-.. T Consensus 60 -~~~~~~~~i~~~~~~iag~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~ 138 (651) T protein:vir:99 60 -LNETLATGIRKKSRYEVGFGFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHG 138 (651) T ss_pred -cChHHHHHHHHHhhhhhccCceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHH Confidence 223444555555555444332221 1111 11101111111100 1222333334444444 Q ss_pred hCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEec-----------------CCceEEEEEEEEeecCCceEEEEEEEecC Q lcl|NC_016654. 136 LSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRW-----------------GRLVAVTFWSELAGGDGQEVWRHLERHES 198 (533) Q Consensus 136 ~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~-----------------g~~~~v~f~~~~~~~~~~~~y~~lE~h~~ 198 (533) .|-+++.+.-+..+ .-+.+.++++..+- +-.. +..+...|...+........|-+ .+-.. T Consensus 139 tGna~ieiIrn~~g-~pv~L~~lp~~~~R-v~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~-~~g~~ 215 (651) T protein:vir:99 139 VGWLALEMLTDIEG-RPVGLAYVPARTVR-VRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFG-EAGDR 215 (651) T ss_pred HhhHhhhhhhcCcc-chhhhhhcChhhee-eecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEE-Eeecc Confidence 55555555443221 11223233332110 0000 00000000000000000011110 00000 Q ss_pred e-eEEEEEEeccCCcccceeehhhcccccc--ccccccccCCceeec-CC---CccceeEEecCCccccccccccccccc Q lcl|NC_016654. 199 G-YIVHAVYKGTATSLGWMMALTDHPATRD--IAVEGADEGRGAYVE-TG---VKDLTAAYVPNVTPNPEWRHDPKLRYL 271 (533) Q Consensus 199 ~-~I~~~~y~~~~~~lG~~v~l~~~~~~~~--~~~~~~~~~~~~~~~-~g---~~~~~~~~~pn~~~~~~~~~~~~~~~~ 271 (533) | ++. .+.......+.............. ..... +...... ++ .+.--+.|++... +....+ T Consensus 216 ~~~~~-~~~~~~~~~v~~~~~~d~~~~~~~~~~~~~~---g~~~~~~~~~~~~~~~~eViHir~~~--------~~~g~~ 283 (651) T protein:vir:99 216 YRGQE-VVIDESGDEPTIRYREDEESEREPIFVDRET---GDVTTGDANGLENRPANELIFIPNPS--------ILEDDY 283 (651) T ss_pred cccee-eeeccCCcceeEEeccCcceeeeeeccccee---eeEEEcCCCceeEecccceEEecCCC--------CCCCcc Confidence 0 000 000000000000000000000000 00000 0000000 00 0111234443221 112457 Q ss_pred ccchhhhhHHHHHHHHHHHHHHHHHHH-HhCc-ceee--echHHhcCCCCccccccCcchhhhhhccccc-----ccc-- Q lcl|NC_016654. 272 GRADLSTDLFPTFHELDRIYSSLMRDF-RIGA-GKVH--ASESVLTNLGMGQGVSLDEEQEVYSRVGSGG-----FNA-- 340 (533) Q Consensus 272 G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~~-~~i~--v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~-----~~~-- 340 (533) |.|.+..+. ..| .++.....+...+ +.|. ...+ +|...+.. .+. +.-...+....... ... T Consensus 284 G~spl~~a~-~~i-~~a~~a~~~~~~~f~NG~~p~gil~~~~~~ls~---e~~---~~lr~~~~~~~~nagk~~vL~~~~ 355 (651) T protein:vir:99 284 GVPDWVSAI-RTI-SADEAAKDYNRDFFDNDTIPRMVIKVTGGELSE---ESK---RDLRQMLNGLREESHRAVVLEVEK 355 (651) T ss_pred cccHHHHHH-HHH-HHHHHHHHHHHHHHhccCCCceEEEecCCCCCH---HHH---HHHHHHHHHHhccCCceEEeeccc Confidence 888887754 344 3445555555443 5543 2222 22111110 000 00011111110000 000 Q ss_pred -----ccccccceeeechhh-hhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc-hhHHHHHHHhhhHHHHHHHHHHHH Q lcl|NC_016654. 341 -----NGDMETIFEFFQPAI-RVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA-QTATEASGKKDLTVKTTRAKARHF 413 (533) Q Consensus 341 -----~~~~~~~i~~~~~~i-r~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~-~Tatai~~~~~~l~~~~~~~~~~~ 413 (533) ..+....++.++... ...++.+..+...++|+...|++|..+|+..++. .++++.. ...+ T Consensus 356 ~~~~~~~~~g~~~~pls~~~~~D~qfle~r~~~~~eIa~afgVPp~~lG~~~~~~~sn~E~~~-------------~~f~ 422 (651) T protein:vir:99 356 FQSQLDEDVEIELEPMGQGISEEMDFRQFREKNEHEIAKVLEVPPVKIGVTDSANRSNSDQQD-------------KDFA 422 (651) T ss_pred ccccccccCCceEEEcCcCchhhHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCcccHHHHH-------------HHHH Confidence 001112233333332 3457788778888999999999999998754432 2233221 1223 Q ss_pred HHHHHHHHHHHHHHHHhhccC-CCCCCceeEEEEeCC--CCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhC-CCCCHHHH Q lcl|NC_016654. 414 GSALGPLSTTCLRVDAIKFPG-KGAAPSEELELEWPK--FARESDLAKAQTVQAWSVASAASTKTKVAYLH-EDWDDERV 489 (533) Q Consensus 414 ~~al~~li~~il~l~~~~~~~-~~~~~~~~v~i~f~d--~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~-~~~~dee~ 489 (533) +.+|.-+++.+....+..+.. ........+.+.|+. -.-.|..++++.+.+++++|+|+..++.+.+. |.+.++.. T Consensus 423 ~~tL~P~~~~ie~eln~kLl~~~e~~~~~~i~~ef~~~~llr~D~~~~~e~~~~~i~~G~~T~NE~R~~lglppi~~~~g 502 (651) T protein:vir:99 423 LEVIQPEQHTFAEWLYQIIHQQALGVTDWTIEYELRGADQPKQEAQLAEQRVRAMRLAGVGLVDEAREELGLDPLGEPYG 502 (651) T ss_pred HHHHHHHHHHHHHHHHHhhcCccccccCceEEEEeccchhhhccHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccc Confidence 334444444443322222221 111223345566654 45578999999999999999999999877652 22332222 Q ss_pred HHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 490 QEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 490 ~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ..-+..++... . ++.....+.......++..+.++ T Consensus 503 d~~l~~~~~~~------~---g~~~~gge~~~~~~~~~~~~~~~ 537 (651) T protein:vir:99 503 EMTLSEFEAEV------A---GDVAGGGETEAVHEPPEENKIGE 537 (651) T ss_pred ccccccccccc------c---cccccCCCCcccccCcccccccc Confidence 11111111000 0 00000000000000111222222 No 190 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=96.96 E-value=0.00024 Score=40.45 Aligned_cols=393 Identities=7% Similarity=-0.043 Sum_probs=156.1 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHH--HHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceee Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLD--KLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYH 78 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~--~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 78 (533) ||+=++- ... .+=.|++..-. +-.+.+............ ..+....+..... ..-.. T Consensus 1 ~~~~~~~-----------~~~--~~m~~F~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~------~~~~~ 59 (413) T protein:vir:96 1 MPGVSEI-----------RKD--KNLKFFNNKRSPTEESKAKDEIPKAPQVVMT--LPNFFKELISDGY------TKLSD 59 (413) T ss_pred CCccchh-----------hhh--hcCCccccCCCcchhhhhhcccccccccccc--chhhHhhhccchh------HHHhh Confidence 4332110 000 00001111000 000000000000000000 0000000000000 00011 Q ss_pred cChHHHHHHHHHHhhcCCCceEeeCC--CchHHHHHHHHHHh--hcc---HHHHHHHHHHHHhhhCCEEEEEEEcCCCCC Q lcl|NC_016654. 79 APIPGVIAKLSTTELFSEQLKFLDAG--KSKEVQARADLIFN--TPR---FHSSLVEAGESCSALSGSFQRIVWDPTIAD 151 (533) Q Consensus 79 ~n~~k~i~~~~a~ll~~e~~~i~~~~--~~~~~~~~l~~i~~--~n~---f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~ 151 (533) ......+++.+|+-+.+-|..+--.. ........+..++. -|. ...-++.++...+..|.+|+.+..|..+.. T Consensus 60 ~~~v~~cI~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~~ 139 (413) T protein:vir:96 60 SPEVRMAVDCIADLVSNMTIQLMQNGETGDKRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGDK 139 (413) T ss_pred chHHHHHHHHHHHhhccCceEEEEecCCCccccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCc Confidence 23344555666666655454432111 01111122333332 121 223445556667788999999888866543 Q ss_pred ceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccc Q lcl|NC_016654. 152 NAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVE 231 (533) Q Consensus 152 ~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~ 231 (533) -..+..++|+.+.+..+.+.+ ++. +. + . |..++- .+ T Consensus 140 ~~~L~~l~~~~v~~~~~~~~~----~y~-~~-----------------------~--~----~~~~~~----------~e 175 (413) T protein:vir:96 140 IIGLTPISPYKVTFNVSDDDL----DYS-IT-----------------------F--D----NKEYDP----------ST 175 (413) T ss_pred eEEEEEecCceeEEEEcCCeE----EEE-Ee-----------------------e--c----CcEEch----------hh Confidence 336777888877765444321 110 00 0 0 001100 00 Q ss_pred ccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCcceeeechH Q lcl|NC_016654. 232 GADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGAGKVHASES 310 (533) Q Consensus 232 ~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~~~i~v~~~ 310 (533) -++++-+..++ ....|.|.+..+ ...| .++...+.+... |+.+...-.+ T Consensus 176 -----------------vih~k~~~~~~--------~~~~G~s~~~~~-~~~i-~~~~~~~~~~~~~~~ng~~p~gi--- 225 (413) T protein:vir:96 176 -----------------LLHFVLNPSIE--------RPFIGTGYKVAL-KDIV-GNLKQASVTKKGFMASEYMPNLI--- 225 (413) T ss_pred -----------------EEEEeccCCCC--------CccccccHHHHH-HHHH-HHHHHHHHHHHHHHhccCCccEE--- Confidence 01122111111 112477766543 3444 334444555444 3544322111 Q ss_pred HhcCCCCccccccCcchhhhhhccccc--------cccccccccceeeec-hhhhhHHHHHHHHHHHHHHHHhhCCChhh Q lcl|NC_016654. 311 VLTNLGMGQGVSLDEEQEVYSRVGSGG--------FNANGDMETIFEFFQ-PAIRVLEHDQGAALLLREVLRKTGYSPVS 381 (533) Q Consensus 311 ~l~~~~~~~~~~~d~~~~~~~~~~~~~--------~~~~~~~~~~i~~~~-~~ir~e~~~~~l~~~l~~i~~~~g~s~~~ 381 (533) +...+.-.....+.....+.....+. ...++. .++.+. ......++++..+...++|+...|+||.. T Consensus 226 -l~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~~~~---~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~ 301 (413) T protein:vir:96 226 -VSVDSDSDELSDEEGRENFEEMYLKRKEAGKPWIIPEGMV---NVQQIKPLTLNDLAINDAVTLDKKTVAGIFGVPAFL 301 (413) T ss_pred -EEeCCCCCHHHHHHHHHHHHHHhcCccccCceeeecCCcc---cccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHH Confidence 21111100000001111121111111 111111 112221 13334567777778888999999999999 Q ss_pred cccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHH Q lcl|NC_016654. 382 LGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQT 461 (533) Q Consensus 382 ~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~ 461 (533) +|...+. .++. ...++.+|..+++.+....+..+.. +...+.+++++-+..|..+.++. T Consensus 302 lg~~~~~--~~~~---------------~~~~~~~l~P~~~~ie~~ln~~ll~----~~~~~~fd~~~ll~~d~~~~~~~ 360 (413) T protein:vir:96 302 LGVGTYN--KDEF---------------NNFINTKIMSIAQVIQQTYNKLIVE----EDMYFSLNPRSLYNYSLTEMVSA 360 (413) T ss_pred cCCCcch--HHHH---------------HHHHHHHHHHHHHHHHHHHHHhhCC----CCcEEEEechhhhccCHHHHHHH Confidence 9743221 1111 1244555666666655444433322 34567777778888899999999 Q ss_pred HHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 462 VQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 462 ~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +.+++.+|+|+..++.+++ +...-+ +. +..-...+..|. +..++.+....|| T Consensus 361 ~~~~~~~G~~t~NE~R~~~--g~~p~~---~g-----------d~~~~~~n~~~~----~~~~~~~~~~~~d 412 (413) T protein:vir:96 361 GAQMTQLNALRRNEFRNWV--GMPPDA---EM-----------DDLLVLENYLQQ----KDLVNQKKLIQDE 412 (413) T ss_pred HHHHHhCCCcCHHHHHHHh--CCCCCC---Cc-----------ceeeecccccch----hhcccccCCCCCC Confidence 9999999999999976654 233210 00 000000111111 1111112222222 No 191 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=96.91 E-value=0.00027 Score=40.18 Aligned_cols=377 Identities=8% Similarity=-0.011 Sum_probs=147.8 Q ss_pred CCCCCCcC-CCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeec Q lcl|NC_016654. 1 MSLPEANT-AWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHA 79 (533) Q Consensus 1 ~~~~~~~~-~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 79 (533) |+|=.... ....|. +. -..|+.- ..+....++.+.... ....-+.. T Consensus 1 M~~f~~~~~~~~~~~-~~-------~~~~~~~-----------------------~~~~~~~~~~~~~~v--~~~~~~~~ 47 (386) T protein:vir:48 1 MPIFNITNLATESPP-IS-------QGGFFDI-----------------------TDPDFLSTLNGSEWV--SAESALRN 47 (386) T ss_pred Ccccccccccccccc-cc-------ccccccc-----------------------ccchhcccccCCcee--chhhhhcc Confidence 55532211 111111 00 0001000 000000000000000 00001111 Q ss_pred ChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEc Q lcl|NC_016654. 80 PIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVD 159 (533) Q Consensus 80 n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~ 159 (533) .--...++.+|+-+-+=|.. +... .....+.+--..-.....+...+...+..|.+|+.+..|..+. -+.+.+++ T Consensus 48 ~~v~~~i~~ia~~ia~~p~~--~~~~--~~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~-~~~L~~l~ 122 (386) T protein:vir:48 48 SDLFSIINQLSNDLATVKLT--ASRK--QLQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGR-DMKWEYLR 122 (386) T ss_pred hHHHHHHHHHHHhhccCcee--eccc--hhHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCc-EEEEEEec Confidence 22224555555555444433 2211 1222222111111122333444556677899998888876543 35666677 Q ss_pred CCeEEEEEec-CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCc Q lcl|NC_016654. 160 ADRAIPEFRW-GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRG 238 (533) Q Consensus 160 ~~~~~P~~~~-g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~ 238 (533) ++.+.++... |.. +++. |...+...|..+.+ T Consensus 123 ~~~v~v~~~~~~~~---~~y~--------------------------~~~~~~~~~~~~~~------------------- 154 (386) T protein:vir:48 123 PSQVSFNRLDNKDG---IYYN--------------------------ITFDDPRIPPKQHV------------------- 154 (386) T ss_pred CceeEEEEcCCCce---EEEE--------------------------EEecCccccceeEe------------------- Confidence 7777655322 210 1111 11111111111000 Q ss_pred eeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhC-cceeeechHHhcCCC Q lcl|NC_016654. 239 AYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIG-AGKVHASESVLTNLG 316 (533) Q Consensus 239 ~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~-~~~i~v~~~~l~~~~ 316 (533) +.--+.|+.+...+ ...+|.|.+..+. ..|.. .....++... ++.| .+..++ ...+ T Consensus 155 -------~~~evih~~~~~~~--------~~~~G~s~i~~~~-~~i~~-~~~~~~~~~~~~~ng~~~~~ii-----~~~~ 212 (386) T protein:vir:48 155 -------PQGDVLHFKLLSVD--------GGLTSVSPLMALS-RELNI-QKASDKLTLNSLKNALNANGIL-----KIKG 212 (386) T ss_pred -------cCccEEEecCCCCC--------CceeeccHHHHHH-HHHHH-HHHHHHHHHHHHhccCCcceEE-----EeCC Confidence 00012333322111 1245777776543 33432 2233333333 3443 333332 2111 Q ss_pred CccccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhH Q lcl|NC_016654. 317 MGQGVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTA 392 (533) Q Consensus 317 ~~~~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Ta 392 (533) ... .+..............++++ +....++.++......++++..+...++|+...|+||..+|+.+++. ++ T Consensus 213 ~~~---~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~-~~ 288 (386) T protein:vir:48 213 GGL---LDFKTKLSRSRQAMKQMQGGPLVLDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQGDQQ-SS 288 (386) T ss_pred CCC---HHHHHHHHHHHHHhhcCCCCceecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc-cH Confidence 111 01111111111111111111 12234566666666678888888889999999999999998644332 22 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCC Q lcl|NC_016654. 393 TEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAAS 472 (533) Q Consensus 393 tai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S 472 (533) .+. ....++.+|..++..+..-.+..+.. ++.+++...+-.+....+..+.+++.+|+++ T Consensus 289 e~~-------------~~~~~~~~l~P~~~~ie~~l~~~l~~-------~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t 348 (386) T protein:vir:48 289 LEM-------------SLDLYNKAVSRYLRPFLSELSQKLSC-------DVDADILPAVDPTGSNSVSRINSMVKSGTLA 348 (386) T ss_pred HHH-------------HHHHHHHHHHHHHHHHHHHHHHhhcc-------hhhcchhhhhccChHHHHHHHHHHHhCCCcC Confidence 211 11133344444444443322222211 1222233333456677888888999999999 Q ss_pred HHHHHHHh-CCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 473 TKTKVAYL-HEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVD 530 (533) Q Consensus 473 ~et~v~~l-~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (533) +.++.+.+ .+.+...++ .+. + . .+.+| .++++++++ + T Consensus 349 ~nE~r~~lg~~~~~~~~~----~~~--~----~------~~~~~-~~gGd~~~~----~ 386 (386) T protein:vir:48 349 QNQGLYILQQAEILPKEL----PEG--E----N------PNKTT-LKGGEINGE----D 386 (386) T ss_pred HHHHHHHhhcCCCCCccc----hhh--c----C------CCCCc-cCCCCCCCC----C Confidence 99977654 222332221 111 0 0 01111 111122111 1 No 192 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=96.89 E-value=0.00028 Score=40.08 Aligned_cols=376 Identities=10% Similarity=-0.004 Sum_probs=150.5 Q ss_pred HHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhccc------C-CCCCc------ccceeecChHHHHHHH Q lcl|NC_016654. 22 VAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRT------P-TATGR------APKRYHAPIPGVIAKL 88 (533) Q Consensus 22 ~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~------~-~~~g~------~~~~~~~n~~k~i~~~ 88 (533) |..-.+ +.++.. .....|.... + .+.+. .+.-++..--...++. T Consensus 1 ~~~~~~--------~~~~~~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~V~~ci~~ 57 (409) T protein:vir:93 1 MAKENI--------VTRIKK---------------KLIDNWIDQSTSKLYDFSPWKNRSFWGVINNTLETNETIFSAITK 57 (409) T ss_pred CCccch--------hhhhhh---------------hhhhhhhccccccccccccccCccccccchhhhhccHHHHHHHHH Confidence 111100 111100 0011111100 0 00000 0111122222334445 Q ss_pred HHHhhcCCCceEeeCCCchHHHHHHHHHHhh--cc---HHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeE Q lcl|NC_016654. 89 STTELFSEQLKFLDAGKSKEVQARADLIFNT--PR---FHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRA 163 (533) Q Consensus 89 ~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~--n~---f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~ 163 (533) +|+-+-.=|..+--. .+.....+..+|.. |. -..-....+...+..|.+|+.+..|..+. -+.+-.++|+.+ T Consensus 58 Ia~~ia~lp~~~~~~--~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~-~~~L~~l~~~~v 134 (409) T protein:vir:93 58 LSNSMASLPLKMYED--YKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQ-PSKLFLLNPDVV 134 (409) T ss_pred HHHhhhhCceeEeec--cccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCc-EEEEEEEcCcee Confidence 555444434333211 11122223334322 21 12223445556677899999888876543 245666778777 Q ss_pred EEEEec-CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeec Q lcl|NC_016654. 164 IPEFRW-GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVE 242 (533) Q Consensus 164 ~P~~~~-g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~ 242 (533) .++.++ +. .+| |.+.... |.++.+ T Consensus 135 ~~~~~~~~~---~~~-------------------------y~~~~~~----g~~~~~----------------------- 159 (409) T protein:vir:93 135 EMLIENQSR---ELY-------------------------YSIHAAT----GNKLIV----------------------- 159 (409) T ss_pred EEEEeCCCc---EEE-------------------------EEEEcCC----ceEEEE----------------------- Confidence 765433 21 011 1111000 111100 Q ss_pred CCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcc---eeeechHHhcCCCCcc Q lcl|NC_016654. 243 TGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAG---KVHASESVLTNLGMGQ 319 (533) Q Consensus 243 ~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~---~i~v~~~~l~~~~~~~ 319 (533) +.--+.|+++..+ + ...+|.|.+..+ ...++- +.....+ .+..++. .|+....-+. T Consensus 160 ---~~~eVih~r~~~~---~-----~~~~G~s~i~~~-~~~i~~-~~~~~~~--~~~~~~~~~~~i~~~~~~l~------ 218 (409) T protein:vir:93 160 ---HNMDMLHFKHIVA---S-----NMVQGISPIDVL-KNTTDF-DNAVRTF--NLTEMQKPDSFMLKYGSNVG------ 218 (409) T ss_pred ---ccccEEEeCCCCC---C-----CccccccHHHHH-HHHHHH-HHHHHHH--HHHhcCCCCceEEecCCCCC------ Confidence 0001233332211 1 123477766542 334332 2222222 1222221 1211111110 Q ss_pred ccccCcchhhhhhcc-----ccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc-hhHH Q lcl|NC_016654. 320 GVSLDEEQEVYSRVG-----SGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA-QTAT 393 (533) Q Consensus 320 ~~~~d~~~~~~~~~~-----~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~-~Tat 393 (533) ....+.....|.... ....+ ....++.++......++++..+...+.|+...|+||..+|...+.. .+.. T Consensus 219 ~e~~~~~~~~~~~~~~~~g~~~vl~----~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e 294 (409) T protein:vir:93 219 KEKRQQVLEDFKQYYEENGGILFQE----PGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNE 294 (409) T ss_pred HHHHHHHHHHHHHHhhcCCCeeecC----CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHH Confidence 000000111111100 00111 1123555555556668888888888999999999999998644322 2222 Q ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-CCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCC Q lcl|NC_016654. 394 EASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFP-GKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAAS 472 (533) Q Consensus 394 ai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~-~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S 472 (533) +.. ...+..+|..++..|..-.+..+. .........+.++++.-+-.|..+.++.+.+++.+|+|+ T Consensus 295 ~~~-------------~~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T 361 (409) T protein:vir:93 295 ELN-------------RFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYT 361 (409) T ss_pred HHH-------------HHHHHHHHHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcC Confidence 221 223334455554444322222221 111112234555555556679999999999999999999 Q ss_pred HHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCC--CC--CCCCCCCCCCCCC Q lcl|NC_016654. 473 TKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLP--TE--NDPATDPEAVDEG 532 (533) Q Consensus 473 ~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~--~~--~~~~~~~~~~~d~ 532 (533) .-++.+.+ ++..-+ .-+++ + ...+..|.. .+ ....|.++..++| T Consensus 362 ~NE~R~~~--g~~p~~---ggD~~----------~-~~~n~~~~~~~~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:93 362 INDIREWE--DLPPVE---GGDKP----------L-ISGDLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred HHHHHHHh--CCCCCC---CcCee----------e-ecccccccccchhhcccccCCCCCcCCC Confidence 99977764 232210 00000 0 000111110 00 1122333444444 No 193 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=96.81 E-value=0.00032 Score=39.72 Aligned_cols=377 Identities=10% Similarity=-0.011 Sum_probs=152.6 Q ss_pred hhcCCHHHHHHHHhccCcchhhHHHHHH-HHHHHHHhcccCCCCCccc-ce--eecChHHHHHHHHHHhhcCCCceEeeC Q lcl|NC_016654. 28 WWEGDLDKLATFYGAEGRTSPSGIKART-KAAYEAFHGRTPTATGRAP-KR--YHAPIPGVIAKLSTTELFSEQLKFLDA 103 (533) Q Consensus 28 w~~gd~~~l~~~y~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~g~~~-~~--~~~n~~k~i~~~~a~ll~~e~~~i~~~ 103 (533) -+.|=.+. +.+.+............ .+....++.......|... .+ +...--...++.+|+-+-+=|. .+. T Consensus 1 m~m~~f~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~--~~~ 75 (392) T protein:vir:39 1 MILPILNF---INQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKI--NAE 75 (392) T ss_pred Ccchhhhh---hhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCce--eec Confidence 33342221 11111111100000000 0000111111111111110 01 1222234455666665554443 333 Q ss_pred CCchHHHHHHHHHHhhccH---HHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEec-CCceEEEEEE Q lcl|NC_016654. 104 GKSKEVQARADLIFNTPRF---HSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRW-GRLVAVTFWS 179 (533) Q Consensus 104 ~~~~~~~~~l~~i~~~n~f---~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~-g~~~~v~f~~ 179 (533) .... ...+. +-|.. ..-+...+...+..|.+|+.+..|..+. -+.+..++|+.+-+..+. +.. .+ T Consensus 76 ~~~~--~~l~~---~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~-~~~L~~l~~~~v~~~~~~~~~~---~~-- 144 (392) T protein:vir:39 76 KKKN--QGIID---NPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGA-DMKWEYLRPSQVNTYYFEYENG---MY-- 144 (392) T ss_pred cchh--hhHhh---cCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCc-EEEEEEEcCceeEEEEcCCCce---EE-- Confidence 2221 11121 11111 2334445557788899999888876543 246666777777655322 211 01 Q ss_pred EEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCccc Q lcl|NC_016654. 180 ELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPN 259 (533) Q Consensus 180 ~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~ 259 (533) |+ |...+...+..+.+ +.--+.|+..... T Consensus 145 ----------y~--------------~~~~~~~~~~~~~~--------------------------~~~eiih~~~~~~- 173 (392) T protein:vir:39 145 ----------YN--------------ITFDDPKIEPILQA--------------------------PQSDLIHMKLLSI- 173 (392) T ss_pred ----------EE--------------EEecCcccceeEEE--------------------------ccccEEEecCCCC- Confidence 11 00000000000000 0001233332111 Q ss_pred ccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhC-cceeeechHHhcCCCCccccccCcchhhhhhccccc Q lcl|NC_016654. 260 PEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIG-AGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGG 337 (533) Q Consensus 260 ~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~-~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~ 337 (533) .....|.|.+..+. ..| .++....++... |+.+ ...-++ ...+... ..+.....+..-..+. T Consensus 174 -------~~~~~G~s~i~~~~-~~i-~~~~~~~~~~~~~f~ng~~p~gil-----~~~~~~~--~~~~~~~~~~~~~~~~ 237 (392) T protein:vir:39 174 -------DGGKTGISPLYSLR-RES-KIQRASDRLTISSLNSSLNVPGVL-----TVKGGGL--LSDKDKASRSRSFMKR 237 (392) T ss_pred -------CCccccccHHHHHH-HHH-HHHHHHHHHHHHHHhccCCCceEE-----EeCCCCC--chHHHHHHHHHHHhcc Confidence 11345888776543 455 344444555444 3543 333222 1111100 0011111111111111 Q ss_pred ccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHH Q lcl|NC_016654. 338 FNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHF 413 (533) Q Consensus 338 ~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~ 413 (533) .++++ +....++.++....+.++++..+...++|+...|+||..+|+......+.++. +..+ T Consensus 238 ~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~~~--------------~~f~ 303 (392) T protein:vir:39 238 SRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQI--------------SGMY 303 (392) T ss_pred ccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHH--------------HHHH Confidence 11111 12234555665556678888888888999999999999998644333222222 2233 Q ss_pred HHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHh-CCCCCHHHHHHH Q lcl|NC_016654. 414 GSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYL-HEDWDDERVQEE 492 (533) Q Consensus 414 ~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l-~~~~~dee~~~E 492 (533) +.+|..+++.+..-.+..+.. .+.++.....-.|..+.+..+.+++.+|++++.++.+.+ ..++...|+.+ T Consensus 304 ~~~l~P~~~~ie~~l~~~L~~-------~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r~- 375 (392) T protein:vir:39 304 ASALNRYLRPAISELEYKLSD-------HISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPA- 375 (392) T ss_pred HHHHHHHHHHHHHHHHHhccc-------cccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccch- Confidence 444444444443222222211 122233333345778888999999999999999976643 12343332211 Q ss_pred HHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 493 ADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVD 530 (533) Q Consensus 493 l~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (533) .|. .|+...| ++ ++..| T Consensus 376 -----~e~---l~~~~~G-------d~------~~p~p 392 (392) T protein:vir:39 376 -----PEN---TNKKTTG-------QS------NEPVP 392 (392) T ss_pred -----hcC---CCCCCCC-------CC------CCCCC Confidence 111 1111111 01 11111 No 194 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=96.81 E-value=0.00032 Score=39.72 Aligned_cols=377 Identities=10% Similarity=-0.011 Sum_probs=152.6 Q ss_pred hhcCCHHHHHHHHhccCcchhhHHHHHH-HHHHHHHhcccCCCCCccc-ce--eecChHHHHHHHHHHhhcCCCceEeeC Q lcl|NC_016654. 28 WWEGDLDKLATFYGAEGRTSPSGIKART-KAAYEAFHGRTPTATGRAP-KR--YHAPIPGVIAKLSTTELFSEQLKFLDA 103 (533) Q Consensus 28 w~~gd~~~l~~~y~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~g~~~-~~--~~~n~~k~i~~~~a~ll~~e~~~i~~~ 103 (533) -+.|=.+. +.+.+............ .+....++.......|... .+ +...--...++.+|+-+-+=|. .+. T Consensus 1 m~m~~f~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~--~~~ 75 (392) T protein:vir:10 1 MILPILNF---INQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKI--NAE 75 (392) T ss_pred Ccchhhhh---hhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCce--eec Confidence 33342221 11111111100000000 0000111111111111110 01 1222234455666665554443 333 Q ss_pred CCchHHHHHHHHHHhhccH---HHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEec-CCceEEEEEE Q lcl|NC_016654. 104 GKSKEVQARADLIFNTPRF---HSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRW-GRLVAVTFWS 179 (533) Q Consensus 104 ~~~~~~~~~l~~i~~~n~f---~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~-g~~~~v~f~~ 179 (533) .... ...+. +-|.. ..-+...+...+..|.+|+.+..|..+. -+.+..++|+.+-+..+. +.. .+ T Consensus 76 ~~~~--~~l~~---~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~-~~~L~~l~~~~v~~~~~~~~~~---~~-- 144 (392) T protein:vir:10 76 KKKN--QGIID---NPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGA-DMKWEYLRPSQVNTYYFEYENG---MY-- 144 (392) T ss_pred cchh--hhHhh---cCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCc-EEEEEEEcCceeEEEEcCCCce---EE-- Confidence 2221 11121 11111 2334445557788899999888876543 246666777777655322 211 01 Q ss_pred EEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCccc Q lcl|NC_016654. 180 ELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPN 259 (533) Q Consensus 180 ~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~ 259 (533) |+ |...+...+..+.+ +.--+.|+..... T Consensus 145 ----------y~--------------~~~~~~~~~~~~~~--------------------------~~~eiih~~~~~~- 173 (392) T protein:vir:10 145 ----------YN--------------ITFDDPKIEPILQA--------------------------PQSDLIHMKLLSI- 173 (392) T ss_pred ----------EE--------------EEecCcccceeEEE--------------------------ccccEEEecCCCC- Confidence 11 00000000000000 0001233332111 Q ss_pred ccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhC-cceeeechHHhcCCCCccccccCcchhhhhhccccc Q lcl|NC_016654. 260 PEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIG-AGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGG 337 (533) Q Consensus 260 ~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~-~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~ 337 (533) .....|.|.+..+. ..| .++....++... |+.+ ...-++ ...+... ..+.....+..-..+. T Consensus 174 -------~~~~~G~s~i~~~~-~~i-~~~~~~~~~~~~~f~ng~~p~gil-----~~~~~~~--~~~~~~~~~~~~~~~~ 237 (392) T protein:vir:10 174 -------DGGKTGISPLYSLR-RES-KIQRASDRLTISSLNSSLNVPGVL-----TVKGGGL--LSDKDKASRSRSFMKR 237 (392) T ss_pred -------CCccccccHHHHHH-HHH-HHHHHHHHHHHHHHhccCCCceEE-----EeCCCCC--chHHHHHHHHHHHhcc Confidence 11345888776543 455 344444555444 3543 333222 1111100 0011111111111111 Q ss_pred ccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHH Q lcl|NC_016654. 338 FNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHF 413 (533) Q Consensus 338 ~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~ 413 (533) .++++ +....++.++....+.++++..+...++|+...|+||..+|+......+.++. +..+ T Consensus 238 ~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~~~--------------~~f~ 303 (392) T protein:vir:10 238 SRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQI--------------SGMY 303 (392) T ss_pred ccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHH--------------HHHH Confidence 11111 12234555665556678888888888999999999999998644333222222 2233 Q ss_pred HHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHh-CCCCCHHHHHHH Q lcl|NC_016654. 414 GSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYL-HEDWDDERVQEE 492 (533) Q Consensus 414 ~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l-~~~~~dee~~~E 492 (533) +.+|..+++.+..-.+..+.. .+.++.....-.|..+.+..+.+++.+|++++.++.+.+ ..++...|+.+ T Consensus 304 ~~~l~P~~~~ie~~l~~~L~~-------~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r~- 375 (392) T protein:vir:10 304 ASALNRYLRPAISELEYKLSD-------HISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPA- 375 (392) T ss_pred HHHHHHHHHHHHHHHHHhccc-------cccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccch- Confidence 444444444443222222211 122233333345778888999999999999999976643 12343332211 Q ss_pred HHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 493 ADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVD 530 (533) Q Consensus 493 l~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (533) .|. .|+...| ++ ++..| T Consensus 376 -----~e~---l~~~~~G-------d~------~~p~p 392 (392) T protein:vir:10 376 -----PEN---TNKKTTG-------QS------NEPVP 392 (392) T ss_pred -----hcC---CCCCCCC-------CC------CCCCC Confidence 111 1111111 01 11111 No 195 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=96.74 E-value=0.00037 Score=39.39 Aligned_cols=372 Identities=11% Similarity=0.030 Sum_probs=138.7 Q ss_pred CCCCCCcCCCcCc-chHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeec Q lcl|NC_016654. 1 MSLPEANTAWPPP-ELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHA 79 (533) Q Consensus 1 ~~~~~~~~~~pp~-~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 79 (533) |.|=.... |.++ .-..++.. ++... ..++.+...+.=....-++. T Consensus 1 Mg~~~~~~-~~~~~~~~~~~~~----------~~~~~-----------------------~~~~~~~~~~~v~~~~al~~ 46 (385) T protein:vir:10 1 MGLLTPRN-FNKRKAKNMVYPS----------NPAFF-----------------------TTTVGGMQLSYVSALSALQN 46 (385) T ss_pred Cccccchh-ccccccccccccc----------chhhh-----------------------hhhccccCccccCHHHhhcc Confidence 43332110 1111 10000000 00000 00000000000001112233 Q ss_pred ChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHH-HHHHHhhhCCEEEEEEEcCCCCCceEEEEE Q lcl|NC_016654. 80 PIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVE-AGESCSALSGSFQRIVWDPTIADNAWIDFV 158 (533) Q Consensus 80 n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~-~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v 158 (533) .--..+++.+|+-+.+-|..+.- . .....|++= +...-...+.+ .+...+..|.+|+.+..|. +.++ T Consensus 47 ~~v~~~i~~ia~~ia~~p~~v~~--~--~~~~ll~~P-N~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~~-------~~~~ 114 (385) T protein:vir:10 47 TNVYSVINRIASDVASAHFKTEN--T--ATLNRLESP-SSLIGRFSFWQGALMQLCLSGNDYIPLVGQN-------LEHI 114 (385) T ss_pred HHHHHHHHHHHHHHhhCceeeec--c--chhhhhhcC-CCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc-------eeEe Confidence 33445666666666655544421 1 122222210 01111223333 3334456788888765431 2233 Q ss_pred cCCe--EEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccC Q lcl|NC_016654. 159 DADR--AIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEG 236 (533) Q Consensus 159 ~~~~--~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~ 236 (533) +++. +.+..+.+ ...|. ++.. .+.....++- T Consensus 115 p~~~~~v~~~~~~~----------------~~~~~-------------~~~~-~~~~~~~~~~----------------- 147 (385) T protein:vir:10 115 PNSDVQINYLPGNM----------------GIVYT-------------VLES-NDRPQMVLRQ----------------- 147 (385) T ss_pred ecCCceEEEEEcCC----------------ceEEE-------------EEEc-CCceEEEEcc----------------- Confidence 3322 22211111 11111 1100 0000000100 Q ss_pred CceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhC-cceeeechHHhcC Q lcl|NC_016654. 237 RGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIG-AGKVHASESVLTN 314 (533) Q Consensus 237 ~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~-~~~i~v~~~~l~~ 314 (533) --+.|+....++. + ....|.|.+..+. ..|+ +.....++... ++.| .+..++ .. T Consensus 148 -----------~eiihik~~~~~~-~-----~~~~G~s~i~~~~-~~i~-~~~~~~~~~~~~~~ng~~~~gil-----~~ 203 (385) T protein:vir:10 148 -----------DQMLHFRLMPDPQ-Y-----RYLIGRSPLESLQ-NALN-LDDKASKSNMSAMENQINPAGKL-----TI 203 (385) T ss_pred -----------ccEEEeccCCCCc-c-----cccccccHHHHHH-HHHH-HHHHHHHHHHHHHhccCCcceEE-----Ee Confidence 0122222211110 0 1235777776543 4553 33444444443 3554 333332 11 Q ss_pred CCCc-cccccCcchhhhhhccccccccc----cccccceeeechhhhhHHHH-HHHHHHHHHHHHhhCCChhhcccCCCc Q lcl|NC_016654. 315 LGMG-QGVSLDEEQEVYSRVGSGGFNAN----GDMETIFEFFQPAIRVLEHD-QGAALLLREVLRKTGYSPVSLGLSDEV 388 (533) Q Consensus 315 ~~~~-~~~~~d~~~~~~~~~~~~~~~~~----~~~~~~i~~~~~~ir~e~~~-~~l~~~l~~i~~~~g~s~~~~g~~~~~ 388 (533) .+.. .....+.....+.....+ .+.+ .+....++.++......+++ +..+...++|+...|+||..+|....+ T Consensus 204 ~~~~~~~e~~~~~~~~~~~~~~~-~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~ 282 (385) T protein:vir:10 204 SNYLSDGKDLESAREEFEKANTG-DNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTST 282 (385) T ss_pred CCCCCCHHHHHHHHHHHHHHhCc-cccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCC Confidence 1110 000001111112221111 0111 01112355566655666765 666777889999999999999864322 Q ss_pred chhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhC Q lcl|NC_016654. 389 AQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVA 468 (533) Q Consensus 389 ~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~a 468 (533) ..+...+..... .+ ..+|..+++.+..-.+..+.+ ..+.++++.-+..|..++++.+.+++++ T Consensus 283 ~~~~sn~eq~~~-~~----------~~~l~P~~~~ie~~l~~~l~~------~~~~f~~~~ll~~d~~~~~~~~~~~~~~ 345 (385) T protein:vir:10 283 ESQHSNIDQIKA-TY----------LANLNSYVNPIVDELRLKMNA------PDLELDIKDMLDVDDSALINQVSNLAKS 345 (385) T ss_pred CcccccHHHHHH-HH----------HHHHHHHHHHHHHHHHHhhCC------ceEEeechhhhccCHHHHHHHHHHHHhC Confidence 222222221111 11 112333333322221222221 2466777777889999999999999999 Q ss_pred CCCCHHHHHHHhC-CCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 469 SAASTKTKVAYLH-EDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVD 530 (533) Q Consensus 469 Gi~S~et~v~~l~-~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (533) |+|+..++...+. +.+.++ +++.... ..+...++++.+| T Consensus 346 G~~T~NE~R~~~g~~p~p~~----------------------~~~~~~~-~~~~~~~g~~~dn 385 (385) T protein:vir:10 346 GVLGAEQAQFILTRSGFLPD----------------------NLPEFKP-LTTQVKGGDEGDN 385 (385) T ss_pred CCcCHHHHHHHhCCCccCCC----------------------CCccccC-cccccCCCCCCCC Confidence 9999888665431 111110 0101000 0011111111111 No 196 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=96.73 E-value=0.00038 Score=39.33 Aligned_cols=375 Identities=11% Similarity=0.023 Sum_probs=138.7 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |.+=... -|......+.. .. .+...+..+..+. ...... ....+... T Consensus 1 Mg~~~~~-~~~k~~~~~~~---~~------~~~~~~~~~~~~~----------------------~~~~v~-~~~~l~~~ 47 (383) T protein:vir:10 1 MGLLTPK-NFSKRNAKNMV---YP------SNPAFFTTTVGGM----------------------QLSYVS-ALSALQNT 47 (383) T ss_pred CCccccc-ccccccccccc---cc------cchhhhhhhccCc----------------------cccccc-hhHhhcch Confidence 5553322 11111111000 00 0000000000000 000000 00111122 Q ss_pred hHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcC Q lcl|NC_016654. 81 IPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDA 160 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~ 160 (533) --...++.+|+-+.+-|. .+... .....|++--..-.....+...+...+-.|.+|+.+.-| . +.++++ T Consensus 48 ~v~~~i~~ia~~ia~~~~--~~~~~--~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~-----~--~~~~p~ 116 (383) T protein:vir:10 48 NVYSVINRIASDVSSAHF--KTENT--ATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ-----N--LEHIPN 116 (383) T ss_pred HHHHHHHHHHHhhccCce--eeccc--chhhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC-----c--eeEeec Confidence 223344555554444343 33221 122222211000012223344455555668887765322 1 223333 Q ss_pred CeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCcee Q lcl|NC_016654. 161 DRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAY 240 (533) Q Consensus 161 ~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~ 240 (533) +.+... . .......+|. ++.. .+ |..+.+. T Consensus 117 ~~~~v~-------------~-~~~~~~~~~~-------------~~~~-~~--~~~~~~~-------------------- 146 (383) T protein:vir:10 117 SDVQIN-------------Y-LPGNMGIVYT-------------VLES-ND--RPKMVLR-------------------- 146 (383) T ss_pred CcceEE-------------E-EEcCCceEEE-------------EEEc-CC--ceEEEEc-------------------- Confidence 221111 0 0111111111 0000 00 0000000 Q ss_pred ecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCc-ceeeechHHhcCCCCc Q lcl|NC_016654. 241 VETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGA-GKVHASESVLTNLGMG 318 (533) Q Consensus 241 ~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~-~~i~v~~~~l~~~~~~ 318 (533) .--+.|+.+..++.. ....|.|.+..+. ..|+.. ....++... |+.|. +..++ ...+.. T Consensus 147 ------~~evih~r~~~~~~~------~~~~G~s~l~~~~-~~i~~~-~~~~~~~~~~f~ng~~~~~il-----~~~~~~ 207 (383) T protein:vir:10 147 ------QDQMLHFRLMPDPQY------RYLIGRSPLESLQ-NALNLD-DKASKSNMSAMENQINPAGKL-----TISNYL 207 (383) T ss_pred ------ccceEEeccCCCCcc------cccccccHHHHHH-HHHHHH-HHHHHHHHHHHhccCCcceEE-----EeCCCC Confidence 001222222111100 1235777776543 445433 333334333 45442 22222 111100 Q ss_pred -cccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHH-HHHHHHHHHHHHhhCCChhhcccCCCcchhH Q lcl|NC_016654. 319 -QGVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHD-QGAALLLREVLRKTGYSPVSLGLSDEVAQTA 392 (533) Q Consensus 319 -~~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~-~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Ta 392 (533) ...........+.....+ .++++ +....++.++.+....+++ +..+...++|+...|+||..+|....+..+. T Consensus 208 ~~~e~~~~~~~~~~~~~~~-~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~ 286 (383) T protein:vir:10 208 SDGKDLESAREEFEKANTG-DNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQH 286 (383) T ss_pred CCHHHHHHHHHHHHHHhCc-cccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCcc Confidence 000000111112211111 11111 1223466666666666765 5667778999999999999998643322222 Q ss_pred HHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCC Q lcl|NC_016654. 393 TEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAAS 472 (533) Q Consensus 393 tai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S 472 (533) ..+... ...+..+|..+++.+..-.+..+. ...+.++++..+..|..+.++.+.+++++|+|+ T Consensus 287 sn~eq~-----------~~~~~~~l~P~~~~ie~~l~~~l~------~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t 349 (383) T protein:vir:10 287 SNIDQI-----------KATYLANLNSYVNPIVDELRLKMN------APDLELDIKDMLDVDDSILINQVSNLAKSGVLG 349 (383) T ss_pred ccHHHH-----------HHHHHHHHHHHHHHHHHHHHHhhC------CceEEeechhhhccCHHHHHHHHHHHHhCCCcC Confidence 222111 112222344444333222122222 135778888888899999999999999999999 Q ss_pred HHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCC Q lcl|NC_016654. 473 TKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPE 527 (533) Q Consensus 473 ~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (533) ..++.+.+ ++..- + +............-.+||+| T Consensus 350 ~nE~R~~l--g~~p~-----------------~--~~d~~~~~~~~~~~~gGd~e 383 (383) T protein:vir:10 350 AEQAQFIL--TRSGF-----------------L--PDNLPEFKPLTNETKGGDDK 383 (383) T ss_pred HHHHHHHh--CCCcc-----------------c--CCcccccCCCcccCCCCCCC Confidence 99876654 12110 0 00000000000011122322 No 197 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=96.69 E-value=0.00041 Score=39.17 Aligned_cols=459 Identities=12% Similarity=0.033 Sum_probs=182.4 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |.==+ .....=+.....+..+...|..|...-.++.+|..... +...-...+.+..++--. T Consensus 1 m~~~~-~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~------------------~~~~~~~~~~~~~~~~ds 61 (532) T protein:vir:99 1 MAEVE-KTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSV------------------FPSATADGSTSYTTPWQS 61 (532) T ss_pred Ccchh-hccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcc------------------cCCCCCcchhhccccccc Confidence 32211 11333344556666666666666554444444432210 100011111223445556 Q ss_pred hHHHHHHHHHHhhcCC-----CceEeeCCCc-------------hHHHHH-------HHHHHhhccHHHHHHHHHHHHhh Q lcl|NC_016654. 81 IPGVIAKLSTTELFSE-----QLKFLDAGKS-------------KEVQAR-------ADLIFNTPRFHSSLVEAGESCSA 135 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e-----~~~i~~~~~~-------------~~~~~~-------l~~i~~~n~f~~~~~~~~~~~~~ 135 (533) -+...++.+|+.|.+- .+.|.....+ ..++++ +...+..++|...+.++.....+ T Consensus 62 t~~~a~~~LAa~L~~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~ 141 (532) T protein:vir:99 62 IGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLV 141 (532) T ss_pred hHHHHHHHHHHHHHHhhcCCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHh Confidence 6777777777766543 2344433221 123333 34467778999999999999999 Q ss_pred hCCEEEEEEEcCC-CCCceEEEEEcCCeEEEEEe-cCCceEEEEEEEEeec--------------CCceEEEEEEEecCe Q lcl|NC_016654. 136 LSGSFQRIVWDPT-IADNAWIDFVDADRAIPEFR-WGRLVAVTFWSELAGG--------------DGQEVWRHLERHESG 199 (533) Q Consensus 136 ~G~~~~~~~~D~~-~~~~~~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~~--------------~~~~~y~~lE~h~~~ 199 (533) .|.+.+.+-.++. ......|..++-..++-.-+ +|++..+++-.++... .++..+..++ T Consensus 142 ~G~a~l~~~~~~~~~~~~~~f~~~pl~~y~v~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~p~~~v~----- 216 (532) T protein:vir:99 142 AGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEDAQGDQNPSEEVT----- 216 (532) T ss_pred HCcEeEEecccccccCcccceEEEEcCeEEEeeCCCCCeeeEeeeeeecHHhcChHHHHHhhccccccCCCcceE----- Confidence 9999987654432 23345677788777665544 5788777654443210 0001111111 Q ss_pred eEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhh Q lcl|NC_016654. 200 YIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTD 279 (533) Q Consensus 200 ~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~ 279 (533) |.+.++.-.+ |.+ ...+.+ ....... ....+.+..-+.|++. +|... ..+.||+|-...+ T Consensus 217 -v~~~v~~~~~---~~~--~~~~~~---~~g~~~~-----~~~~~~~~~e~P~~~~-----Rw~~~-~ge~YGrgp~~~~ 276 (532) T protein:vir:99 217 -IYTHVYRDPE---AMV--FRSYQE---IDGEIVA-----GTEGEYPLDSCPWIPV-----RLIKM-PNEDYGRSFVEEY 276 (532) T ss_pred -EEEEEEecCC---CCe--eEEEEe---ecCceec-----ccccccccccCCceee-----eeeec-CCCccccchHHHH Confidence 1122222111 000 000000 0000000 0011111101122222 23322 3477899877766 Q ss_pred HHHHHHHHHHHHHHHHHHH-HhCcceeeech-HHhcCCCCccccccCcchhhhhhccccccccccccccceeee-chhhh Q lcl|NC_016654. 280 LFPTFHELDRIYSSLMRDF-RIGAGKVHASE-SVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFF-QPAIR 356 (533) Q Consensus 280 i~~lid~lD~~~s~~~~~~-~~~~~~i~v~~-~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~-~~~ir 356 (533) + +-+..|+..--...... ...+....|++ ..++...- .+...+.+... .. ++...+... ..++. T Consensus 277 l-~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~-----~~~~~g~~v~g-----~~--~~i~~~~~~~~~~~~ 343 (532) T protein:vir:99 277 L-GDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRV-----AKANTGDFVAG-----RK--QDVEVFQLEKYNDFQ 343 (532) T ss_pred H-HHHHHHHHHHHHHHHHHHHHcCCCceeccccccchhhh-----ccCCCcceecC-----Cc--ccceeeecccccchh Confidence 6 55677776544443322 34444444432 22211100 00111111110 00 000011111 11221 Q ss_pred h-HHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_016654. 357 V-LEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKA-RHFGSALGPLSTTCLRVDAIKFPG 434 (533) Q Consensus 357 ~-e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~-~~~~~al~~li~~il~l~~~~~~~ 434 (533) + .+-++.+..-++.+..... +....+...|||||..+.+......+-.- +.-...|..|+..++.+... .| T Consensus 344 ~~~~~i~~~~~rI~~af~~~~-----~~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r--~g 416 (532) T protein:vir:99 344 VAKATADDIEKRLSYAFMLNS-----AVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQA--TS 416 (532) T ss_pred HHHHHHHHHHHHHHHHHhhhh-----cccCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh--cC Confidence 1 1222333333333332221 12123344699999888777666555422 22233344455555444321 11 Q ss_pred CCC-CCce--eEEEEeCCCCCCCHHHHHHHHHHHHh-----CC-------CCCHHHHHHHh---CC------CCCHHHHH Q lcl|NC_016654. 435 KGA-APSE--ELELEWPKFARESDLAKAQTVQAWSV-----AS-------AASTKTKVAYL---HE------DWDDERVQ 490 (533) Q Consensus 435 ~~~-~~~~--~v~i~f~d~i~~d~~e~a~~~~~l~~-----aG-------i~S~et~v~~l---~~------~~~dee~~ 490 (533) .-+ .+.+ .+.+. .-.+...+++.+..+.+ +. .+....+++.+ .+ --+++|++ T Consensus 417 ~lP~~p~~~~~~~iv----~~is~Laraq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~GV~~~~i~r~~ee~~ 492 (532) T protein:vir:99 417 KIPNLPKEAVEPAIA----TGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQ 492 (532) T ss_pred CCCCCChhhccccee----ecchHHHHHHHHHHHHHHHHHHHhhcchhhhhCCHHHHHHHHHHHhCCChhhccCCHHHHH Confidence 111 1221 22221 11223333333322211 11 12333344332 11 02556666 Q ss_pred HHHHHHHHhhhcccC--ccccccccC-CCCCCCCCCCCCC Q lcl|NC_016654. 491 EEADLIDNANTVSAP--TFGFGTDQP-PLPTENDPATDPE 527 (533) Q Consensus 491 ~El~rI~~E~~~~~~--~~~~~~~~~-~~~~~~~~~~~~~ 527 (533) ++.++.+..+++.+- .++..+.+. -....+..+-+.+ T Consensus 493 ~~~~q~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (532) T protein:vir:99 493 AKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGMPTQ 532 (532) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcchhHHhhcCCCCC Confidence 665544433322110 011111111 1111111222222 No 198 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=96.56 E-value=0.00051 Score=38.64 Aligned_cols=467 Identities=15% Similarity=0.123 Sum_probs=182.8 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccC--CCCCc-cccee Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTP--TATGR-APKRY 77 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~g~-~~~~~ 77 (533) |.= .+ .=+=.++...+.++...|..|...-.++.+|-.. +..+|+...+ ...|. +..++ T Consensus 1 m~~--d~-~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP---------------~~~~~~~~~~~~~~~~~~~~~~~ 62 (549) T protein:vir:10 1 MTN--DD-AKILQALNADHGRMKEKRQSYEAVWNDVIDYLMP---------------RLDKFGQLPRPDSEKGRERSQKM 62 (549) T ss_pred CCc--ch-HHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcc---------------ccccccccCCCCCCccccccccc Confidence 221 11 1111222233333333333333322222221100 0111211111 11122 23355 Q ss_pred ecChHHHHHHHHHHhhcCC--C---ceEeeCCCc------hHHHHHHHH-------HH--hhccHHHHHHHHHHHHhhhC Q lcl|NC_016654. 78 HAPIPGVIAKLSTTELFSE--Q---LKFLDAGKS------KEVQARADL-------IF--NTPRFHSSLVEAGESCSALS 137 (533) Q Consensus 78 ~~n~~k~i~~~~a~ll~~e--~---~~i~~~~~~------~~~~~~l~~-------i~--~~n~f~~~~~~~~~~~~~~G 137 (533) --.-+...++.+|+.|.+- | +.|.....+ ....++|+. ++ ..++|...+.++.....++| T Consensus 63 ~dstg~~a~~~LAs~l~~~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~G 142 (549) T protein:vir:10 63 FDSTAPLALRNFVAAMDSMITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFG 142 (549) T ss_pred ccchHHHHHHHHHHHHHhhccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhc Confidence 6667888888888776553 2 333332221 122334442 22 35789999999999999999 Q ss_pred CEEEEEEEcCCCCCceEEEEEcCCeEEEEEe-cCCceEEEEEEEEeec--------CC-----ceEEEEEEEecCeeEEE Q lcl|NC_016654. 138 GSFQRIVWDPTIADNAWIDFVDADRAIPEFR-WGRLVAVTFWSELAGG--------DG-----QEVWRHLERHESGYIVH 203 (533) Q Consensus 138 ~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~~--------~~-----~~~y~~lE~h~~~~I~~ 203 (533) .+.+++ +++....+++..++-..++..-+ +|++..|+...+++.. +. +..+.. .-++...|.| T Consensus 143 ta~l~~--~~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~-~~~~~~~v~~ 219 (549) T protein:vir:10 143 PGALMI--EHDVGKGIVYRNVPMQRLWFAENNSGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEK-DPEKSAIFYH 219 (549) T ss_pred ceeeEE--eecCCCeeEEEEEEcCeEEEeeCCCCCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhc-CCCceEEEEE Confidence 999775 45555668888888888877655 4777554321111100 00 000000 0012233444 Q ss_pred EEEeccCCc---c-cceeehhhccccccccccccccCCceee-cCCCccceeEEecCCcccccccccccccccccchhhh Q lcl|NC_016654. 204 AVYKGTATS---L-GWMMALTDHPATRDIAVEGADEGRGAYV-ETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLST 278 (533) Q Consensus 204 ~~y~~~~~~---l-G~~v~l~~~~~~~~~~~~~~~~~~~~~~-~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~ 278 (533) .+|...+.. . +.-.|+..+ +...+....+ ++|... +.|++. +|... ..+.||+|-... T Consensus 220 ~V~pr~~~~~~~~~~~~~pf~sv---------~~e~~~~~il~esg~~e--~P~~~~-----Rw~~~-~ge~YGrgp~~~ 282 (549) T protein:vir:10 220 AVEPRADRDPRKLDGRNMQFASY---------WLDEGRDRIVQNSGFRT--FPFAIG-----RFYVG-TDDVYGGSPAYD 282 (549) T ss_pred EeecCCCCCccccccccCceEEE---------EEEecCCEeeccCCccc--CCccee-----eeeec-CCCccccchHHH Confidence 455322211 1 111122111 1111111112 222211 122222 24333 246789998777 Q ss_pred hHHHHHHHHHHHHHHHHHHH-HhCcceeeechHHh-cCCCCccccccCcchhhhhhccccccccccccccceeee--chh Q lcl|NC_016654. 279 DLFPTFHELDRIYSSLMRDF-RIGAGKVHASESVL-TNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFF--QPA 354 (533) Q Consensus 279 ~i~~lid~lD~~~s~~~~~~-~~~~~~i~v~~~~l-~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~--~~~ 354 (533) ++ +-+..|+..--...... ...++.+.||++.. .+.. ..+....+.. ...++ ...+..+ ..+ T Consensus 283 ~l-~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~------l~pgg~~~~~-----~~~~~--~~~~~pl~~~~~ 348 (549) T protein:vir:10 283 AM-PDVRMANDMAKTNIRGAQKLVDPPLLANEDGVLDGFD------LRSGALNWGG-----LNDKG--EEMVKPLLTGKQ 348 (549) T ss_pred HH-HHHHHHHHHHHHHHHHHHHHhcCceeeccccccccce------eccCCccccc-----cCCCC--ccceeeeccccc Confidence 66 55678877665555554 34566666665432 2111 1111111111 00111 1111111 112 Q ss_pred hhh-HHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhc Q lcl|NC_016654. 355 IRV-LEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHF-GSALGPLSTTCLRVDAIKF 432 (533) Q Consensus 355 ir~-e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~-~~al~~li~~il~l~~~~~ 432 (533) +.+ .+-++.++.-++.+.+..-|.. . ..+...|||||..+.+......+-.-..+ ...|.-+|..++++... T Consensus 349 ~~~~~~~i~~~~~rI~~af~~d~~~~--~--~~~~~~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r-- 422 (549) T protein:vir:10 349 AQIGIEFAQDTRQTINQWFYVTLFQI--L--VDSGDMTATEVLQRAQEKGVLLAPTLGRTQSELLGPMIAREVDILAE-- 422 (549) T ss_pred hhHHHHHHHHHHHHHHHHHhhhhhhh--h--cCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-- Confidence 211 1223333333333332221111 1 12344799999998888777655533333 34555666655555332 Q ss_pred cCCCCC-------CceeEEEEeCCCCCCC-HHHHH-------HHHHHHHhCC-----CCCHHHHHHHh---CC-----CC Q lcl|NC_016654. 433 PGKGAA-------PSEELELEWPKFARES-DLAKA-------QTVQAWSVAS-----AASTKTKVAYL---HE-----DW 484 (533) Q Consensus 433 ~~~~~~-------~~~~v~i~f~d~i~~d-~~e~a-------~~~~~l~~aG-----i~S~et~v~~l---~~-----~~ 484 (533) .|..+. +...+.|++--.+... ..+.+ +.+..+-+.+ .+..+.+++.+ .+ -. T Consensus 423 ~g~lP~~p~~l~~~~~~~~i~yis~La~aq~~~~~~~i~~~~~~~~~laq~~Pe~ld~id~d~~~~~~a~~~Gvp~~~ir 502 (549) T protein:vir:10 423 AGQLPDMPQELIDAGADVDVEYDSPLNKAMRAGEGAAILQWLQQLGIVSQFDPAAAKVPNGARIARLLADYGGVPVEAMS 502 (549) T ss_pred cCCCCCCChhhhcCCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhcCCHHHHHHHHHHhcCCCccccC Confidence 122111 1224556553332221 01111 1111111111 12333333332 11 02 Q ss_pred CHHHHHHHHHHHHHhhh-----cccCcccccc-ccCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 485 DDERVQEEADLIDNANT-----VSAPTFGFGT-DQPPLPTENDPATDPEAVDE 531 (533) Q Consensus 485 ~dee~~~El~rI~~E~~-----~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~d 531 (533) +++|+++..+.-++.+. ++.+....+. +..+. .+..+.--. T Consensus 503 s~eev~~~r~~~~~qqq~~~~~~~a~~a~~~a~~~~~~------~ta~~~~~~ 549 (549) T protein:vir:10 503 TDEELQAQQAAEAQAAQMQQMLAAAPVAAGAIKDLSDA------QTAAQTARV 549 (549) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh------cCCCcccCC Confidence 55665543321111111 1111111111 11110 000000000 No 199 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=96.55 E-value=0.00052 Score=38.60 Aligned_cols=336 Identities=9% Similarity=-0.004 Sum_probs=133.7 Q ss_pred HHHhhcCCCceEeeCCCchHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeE Q lcl|NC_016654. 89 STTELFSEQLKFLDAGKSKEVQARADLIFNT--P---RFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRA 163 (533) Q Consensus 89 ~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~--n---~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~ 163 (533) +|+ =|..+.-. .+....-+.++|.. | .-..-+...+...+..|.+|+.+..|..|. -+.+-.++|+.+ T Consensus 1 ia~----lp~~~~~~--~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~-~~~L~~l~~~~v 73 (348) T protein:vir:93 1 MAS----LPLKMYED--YKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQ-PSKLFLLNPDVV 73 (348) T ss_pred Ccc----cceEeEec--CcCcccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc-EEEEEEEcCCce Confidence 111 12221110 01111112222221 1 111223334445677899999888776543 234555666666 Q ss_pred EEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecC Q lcl|NC_016654. 164 IPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVET 243 (533) Q Consensus 164 ~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~ 243 (533) -++.+.+. +.+.|.++... |..+.+ T Consensus 74 ~~~~~~~~---------------------------~~~~y~~~~~~----g~~~~~------------------------ 98 (348) T protein:vir:93 74 EMLIENQS---------------------------RELYYSIHAAT----GNKLIV------------------------ 98 (348) T ss_pred EEEEeCCC---------------------------cEEEEEEEcCC----CeEEEE------------------------ Confidence 55433210 11111111111 111100 Q ss_pred CCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcce-eeechHHhcCCCCccccc Q lcl|NC_016654. 244 GVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGK-VHASESVLTNLGMGQGVS 322 (533) Q Consensus 244 g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~-i~v~~~~l~~~~~~~~~~ 322 (533) +.--+.|+.+..+. ...+|.|.+..+ ...++.... ...+ .+..++.. .++ +...+.-.... T Consensus 99 --~~~eiih~r~~~~~--------~~~~G~s~~~~~-~~~i~~~~~-~~~~--~~~~~~~~~~~i----~~~~~~l~~e~ 160 (348) T protein:vir:93 99 --HNMDMLHFKHIVAS--------NMVQGISPIDVL-KNTTDFDNA-VRTF--NLTEMQKPDSFM----LKYGSNVSTEK 160 (348) T ss_pred --ccccEEEecCCCCC--------CceeeccHHHHH-HHHHHHHHH-HHHH--HHHhcCCCceeE----EecCCCCCHHH Confidence 00012333322111 123466665443 233332221 2222 23322221 111 11111100000 Q ss_pred cCcchhhhhhccc---cccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc-hhHHHHHHH Q lcl|NC_016654. 323 LDEEQEVYSRVGS---GGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA-QTATEASGK 398 (533) Q Consensus 323 ~d~~~~~~~~~~~---~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~-~Tatai~~~ 398 (533) .+.....|..... ...-. +....++.++......++++..+...+.|+...|+||..+|...++. .+..+. T Consensus 161 ~~~~~~~~~~~~~n~~~~~vl--~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~e~~--- 235 (348) T protein:vir:93 161 RQQVLEDFKQYYEENGGILFQ--EPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEEL--- 235 (348) T ss_pred HHHHHHHHHHHhhcCCCeeec--CCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH--- Confidence 0000111111110 00000 11223555665556668888888889999999999999998544322 222222 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-cCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHH Q lcl|NC_016654. 399 KDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKF-PGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKV 477 (533) Q Consensus 399 ~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~-~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v 477 (533) ....++.+|.-+++.+-...+..+ ..........+.++++.-.-.|..++++.+.+++.+|+|+..++. T Consensus 236 ----------~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R 305 (348) T protein:vir:93 236 ----------NRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIR 305 (348) T ss_pred ----------HHHHHHHHHHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHH Confidence 112233344444444322222221 111111233456666666678999999999999999999999977 Q ss_pred HHhCCCCCHHH-HHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 478 AYLHEDWDDER-VQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEG 532 (533) Q Consensus 478 ~~l~~~~~dee-~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 532 (533) +.+ ++..-+ .++-+ +. ....| . +.+..... ...|.++..++| T Consensus 306 ~~~--g~~p~~ggD~~~--~~---~n~~~-~----~~~~~~~~-~~~gg~~n~~~~ 348 (348) T protein:vir:93 306 EWE--DLPPVEGGDKPL--IS---GDLYP-I----DTPLELRK-SLKGGDKNVNES 348 (348) T ss_pred HHh--CCCCCCCcCeEe--ec---ccccc-c----ccchhhcc-cccCCCCCcCCC Confidence 764 232200 00000 00 00000 0 00000000 012222333333 No 200 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=96.52 E-value=0.00055 Score=38.46 Aligned_cols=396 Identities=9% Similarity=-0.004 Sum_probs=150.3 Q ss_pred HHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCc-ccceeecChHHHHHHHHHHhhcCCCceE Q lcl|NC_016654. 22 VAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGR-APKRYHAPIPGVIAKLSTTELFSEQLKF 100 (533) Q Consensus 22 ~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~-~~~~~~~n~~k~i~~~~a~ll~~e~~~i 100 (533) |....+|=+=...-|.+.-+. .... .+. ...|..+ .-.+- .+..+...--...++.+|+-+-.=|..+ T Consensus 1 ~~~~~~~~~~k~~~~~~~~~~-~~~~------~~~--~~~~~~~--~~~~v~~~~a~~~~~V~~ci~~ia~~ia~lp~~~ 69 (409) T protein:vir:96 1 MAKENIVTRIKKKLIDNWIDQ-SASK------LYD--FSPWKNK--SFWGVINNTLETNETIFSAITKLSNSMASLPLKM 69 (409) T ss_pred CccccchhhhhhHHhhhhhcc-cccc------ccc--cccccCc--cccccchhhHhhhHHHHHHHHHHHHhhhhCceEE Confidence 333333211000000000000 0000 000 0000000 00000 0111121222333344444443334322 Q ss_pred eeCCCchHHHHHHHHHHhh--cc---HHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEE Q lcl|NC_016654. 101 LDAGKSKEVQARADLIFNT--PR---FHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAV 175 (533) Q Consensus 101 ~~~~~~~~~~~~l~~i~~~--n~---f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v 175 (533) --. .+.....+..+|.. |. -..-....+...+..|.+|+.+..|..+. -+.+-.++|+++-++.+.+. T Consensus 70 ~~~--~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~-~~~L~~l~~~~v~v~~~~~~---- 142 (409) T protein:vir:96 70 YED--YKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQ-PSKLFLLNPDVVEMLIENQS---- 142 (409) T ss_pred eec--ccccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCc-EEEEEEEcCceeEEEEeCCC---- Confidence 111 11111223333321 21 12223445566678899999888776543 24555567776665543211 Q ss_pred EEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecC Q lcl|NC_016654. 176 TFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPN 255 (533) Q Consensus 176 ~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn 255 (533) . .+.|.++... |.++.+. .--+.|+.. T Consensus 143 ----------~-------------~~~y~~~~~~----g~~~~~~--------------------------~~evih~r~ 169 (409) T protein:vir:96 143 ----------R-------------ELYYSIHAAT----GNKLIVH--------------------------NMDMLHFKH 169 (409) T ss_pred ----------c-------------EEEEEEEcCC----ceEEEEc--------------------------cccEEEeCC Confidence 0 0111111110 1111000 001223321 Q ss_pred CcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcc-eeeechHHhcCCCCccccccCcchhhhhhcc Q lcl|NC_016654. 256 VTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAG-KVHASESVLTNLGMGQGVSLDEEQEVYSRVG 334 (533) Q Consensus 256 ~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~-~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~ 334 (533) ..+ + ...+|.|.+.. +...++. ......+ .+..++. .-++ +.....-.....+.....|.... T Consensus 170 ~~~---~-----~~~~G~s~l~~-~~~~i~~-~~~~~~~--~~~~~~~~~~~i----~~~~~~l~~e~~~~~~~~~~~~~ 233 (409) T protein:vir:96 170 IVA---S-----NMVQGISPIDV-LKNTTDF-DNAVRTF--NLTEMQKPDSFM----LKYGSNVSTEKRQQVLEDFKQYY 233 (409) T ss_pred CCC---C-----CccccccHHHH-HHHHHHH-HHHHHHH--HHHhcCCCceeE----EecCCCCCHHHHHHHHHHHHHHh Confidence 111 0 12346776654 2344432 2222222 1322221 1111 11111000000011111121110 Q ss_pred cccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc-hhHHHHHHHhhhHHHHHHHH Q lcl|NC_016654. 335 SGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA-QTATEASGKKDLTVKTTRAK 409 (533) Q Consensus 335 ~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~-~Tatai~~~~~~l~~~~~~~ 409 (533) .++++ +....++.++......++++..+...++|+...|+||..+|....+. .+.++. . T Consensus 234 ---~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~-------------~ 297 (409) T protein:vir:96 234 ---EENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEEL-------------N 297 (409) T ss_pred ---hcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH-------------H Confidence 01111 11223556666667778888888888999999999999998644322 223322 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcc-CCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHH Q lcl|NC_016654. 410 ARHFGSALGPLSTTCLRVDAIKFP-GKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDER 488 (533) Q Consensus 410 ~~~~~~al~~li~~il~l~~~~~~-~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee 488 (533) ...++.+|.-++..+-...+..+. .........+.++.+.-+-.|..+.++.+.+++.+|+|+.-++.+.+ ++.+-+ T Consensus 298 ~~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~--g~~pi~ 375 (409) T protein:vir:96 298 RFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWE--DLPPVE 375 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHh--CCCCCC Confidence 233344444444444322222221 11111223455555566677999999999999999999999976654 222200 Q ss_pred HHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 489 VQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEG 532 (533) Q Consensus 489 ~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 532 (533) .-+++-- +....| . +.+..... ...|+++..++| T Consensus 376 ---ggD~~~~-~~n~~~-~----~~~~~~~~-~~~gG~~n~~e~ 409 (409) T protein:vir:96 376 ---GGDKPLI-SGDLYP-I----DTPLELRK-SLKGGDKNVNES 409 (409) T ss_pred ---Ccceeee-cccccc-c----ccchhhcc-cccCCCCCcCCC Confidence 0000000 000000 0 00000000 122333444444 No 201 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=96.51 E-value=0.00056 Score=38.42 Aligned_cols=388 Identities=9% Similarity=-0.013 Sum_probs=154.0 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCccc---cee Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAP---KRY 77 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~---~~~ 77 (533) |-+- ++ .+...... .-...++...++.+.++..|... .-+ T Consensus 1 m~~~-------~~--------------------------~~~~~~~~----~~~~~~~~~~~~g~~~s~~~~~v~~~~al 43 (419) T protein:vir:80 1 MFFS-------RQ--------------------------LLSNLGQT----QPGSGGWVSALLGSARSEAGQVVTPASAL 43 (419) T ss_pred CCcc-------cc--------------------------cccccCcC----CCCcchhhHHhhcccccccCcccChHHhh Confidence 1111 00 00000000 00001122233333332222111 112 Q ss_pred ecChHHHHHHHHHHhhcCCCceEeeCC-Cc--hHHHHHHHHHHhh--c--c-HHHHHHHHHHHHhhhCCEEEEEEEcCCC Q lcl|NC_016654. 78 HAPIPGVIAKLSTTELFSEQLKFLDAG-KS--KEVQARADLIFNT--P--R-FHSSLVEAGESCSALSGSFQRIVWDPTI 149 (533) Q Consensus 78 ~~n~~k~i~~~~a~ll~~e~~~i~~~~-~~--~~~~~~l~~i~~~--n--~-f~~~~~~~~~~~~~~G~~~~~~~~D~~~ 149 (533) .+.---..++.+|+-+-+=|..+--.. +. ...+..+..+|.. | . ...-....+...+..|.+|+.+..|..| T Consensus 44 ~~~~v~~cv~~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G 123 (419) T protein:vir:80 44 SLTVLQNCVTLLAESIAQLPVELYERSGDDRKPATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDG 123 (419) T ss_pred ccHHHHHHHHHHHHhhccCceEEEEecCCCcccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC Confidence 222233455555555554454432111 11 1111224444431 1 1 1123344455667789999888877654 Q ss_pred CCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccc Q lcl|NC_016654. 150 ADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIA 229 (533) Q Consensus 150 ~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~ 229 (533) . -..+-.++|+.+.+..+.+ ...+| .+ .+. . .++ T Consensus 124 ~-~~~L~~i~~~~v~i~~~~~---------------~~~~y-------------~~-~~~--~---~~~----------- 157 (419) T protein:vir:80 124 V-IQGLYPLDNEAVTVMKGPD---------------LKPMY-------------RV-AGA--D---PLP----------- 157 (419) T ss_pred c-EEEEEEecCceEEEEECCC---------------ceEEE-------------EE-cCc--c---ccc----------- Confidence 2 2345566776665542211 11111 11 000 0 000 Q ss_pred ccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCc-ceeee Q lcl|NC_016654. 230 VEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGA-GKVHA 307 (533) Q Consensus 230 ~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~-~~i~v 307 (533) .-.+.|+.+.. + ...+|.|.+..+. ..|+ ++....++... |+.|. ..-++ T Consensus 158 -----------------~~~i~h~~~~~----~-----d~~~G~s~i~~~~-~~i~-~~~~~~~~~~~~f~ng~~~~gil 209 (419) T protein:vir:80 158 -----------------QRLVHHVRWMS----I-----NGYTGLSPVLLHA-NAIG-HAQAIQQYAGKSFMNGTALSGVI 209 (419) T ss_pred -----------------hhheEEecCCC----C-----CCcccccHHHHHH-HHHH-HHHHHHHHHHHHHhcCCCccEEE Confidence 00123333211 1 1245777766543 3442 33334444444 35433 33222 Q ss_pred chHHhcCCCCcccc----ccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCCh Q lcl|NC_016654. 308 SESVLTNLGMGQGV----SLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSP 379 (533) Q Consensus 308 ~~~~l~~~~~~~~~----~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~ 379 (533) ...+..... ..+.....+.....+..+.++ +....++.++......++++..+...++|+...|++| T Consensus 210 -----~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp 284 (419) T protein:vir:80 210 -----ERPTDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQEGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPA 284 (419) T ss_pred -----EecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCH Confidence 111111000 000001111111111111100 1112345555555666778888888899999999999 Q ss_pred hhcccCCCcch-hHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHH Q lcl|NC_016654. 380 VSLGLSDEVAQ-TATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAK 458 (533) Q Consensus 380 ~~~g~~~~~~~-Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~ 458 (533) ..+|...++.- ++.+.. ...++..|.-++..+..-.+..+........+.+.++++.-...|..++ T Consensus 285 ~llg~~~~~t~~n~e~~~-------------~~f~~~~l~P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~~~d~~~~ 351 (419) T protein:vir:80 285 HMVNELERATFSNIEHQS-------------LQFVIYTLLPWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSR 351 (419) T ss_pred HHhcCCCCCCcccHHHHH-------------HHHHHHHHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHH Confidence 99986543322 222221 2233334444444332222222211111223456666667777899999 Q ss_pred HHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccC-ccccccccCCCCCCCCCCCC--CCCCCCCC Q lcl|NC_016654. 459 AQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAP-TFGFGTDQPPLPTENDPATD--PEAVDEGE 533 (533) Q Consensus 459 a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~-~~~~~~~~~~~~~~~~~~~~--~~~~~d~~ 533 (533) ++.+.+++.+|+|+..++.+.+ ++..-+--.++ .+| .+.. .+.+. ..++++ +.+..-.| T Consensus 352 ~~~~~~~~~~G~~T~NE~R~~~--g~~p~~gGD~~---------~~~~n~~~-~~~~~----~~~~~~~~~~~~~~~~ 413 (419) T protein:vir:80 352 YAAYAVGRQWGWLSINDIRRLE--NMPPVKGGDIY---------LSPMNMVD-ASKPQ----PIPMGKTEPTKAALDE 413 (419) T ss_pred HHHHHHHHhCCCcCHHHHHHHh--CCCCCCCccee---------eecccccc-ccccc----cccCCCCCchhhhHHH Confidence 9999999999999999976654 22210000000 001 0000 00110 011111 11111122 No 202 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=96.49 E-value=0.00058 Score=38.34 Aligned_cols=386 Identities=11% Similarity=0.027 Sum_probs=155.6 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcc---cCCCCCccc--- Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGR---TPTATGRAP--- 74 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~g~~~--- 74 (533) |-++. |..+. .+..-.... +..++.+ .++..|... T Consensus 1 m~~~~----------------------~~~~~---------~~~~s~~~~--------w~~~~~~~~~~~~~~g~~vt~~ 41 (421) T protein:vir:10 1 MFIPQ----------------------MFEGK---------KRSVSGGGF--------WEAMLGGVRSSHSKAGVMITPE 41 (421) T ss_pred CCCcc----------------------hhccc---------ccccCcchh--------hHHHhhhhccCcccCCceechH Confidence 33321 00000 000000000 1111111 111112111 Q ss_pred ceeecChHHHHHHHHHHhhcCCCceEe-eCCCc--h-HHHHHHHHHHhh--cc---HHHHHHHHHHHHhhhCCEEEEEEE Q lcl|NC_016654. 75 KRYHAPIPGVIAKLSTTELFSEQLKFL-DAGKS--K-EVQARADLIFNT--PR---FHSSLVEAGESCSALSGSFQRIVW 145 (533) Q Consensus 75 ~~~~~n~~k~i~~~~a~ll~~e~~~i~-~~~~~--~-~~~~~l~~i~~~--n~---f~~~~~~~~~~~~~~G~~~~~~~~ 145 (533) .-+...--...++.+|+-+-+=|..+- ...+. . .....+..+|.. |. ........+......|.+|+.+.. T Consensus 42 ~al~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r 121 (421) T protein:vir:10 42 TALALSAVRACVTLLAESVAQLPVELYRRDKNGGRQRATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDR 121 (421) T ss_pred HhhccHHHHHHHHHHHHhhccCceEEEEEcCCCceeecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEE Confidence 011121222344445544444343331 11010 0 011123333321 11 122233445566778999988887 Q ss_pred cCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccc Q lcl|NC_016654. 146 DPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPAT 225 (533) Q Consensus 146 D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~ 225 (533) |..+. -+.+-.++|+++.++.+.. +..+| .++ .. |..++.+ T Consensus 122 ~~~G~-~~~L~~l~~~~v~v~~~~~---------------g~~~y-------------~~~-~~----g~~~~~~----- 162 (421) T protein:vir:10 122 DGKGY-PKELIPINPKKVIVLKGPD---------------GMPYY-------------EIP-EI----GETLPMR----- 162 (421) T ss_pred cCCCc-EEEEEEecCceEEEEECCC---------------ceEEE-------------EEc-CC----CcEEchh----- Confidence 76543 2345556677666543221 11111 111 00 1111111 Q ss_pred ccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHH-HhC-cc Q lcl|NC_016654. 226 RDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDF-RIG-AG 303 (533) Q Consensus 226 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~-~~ 303 (533) -+.|+.+.. +. ...|.|.+..+ ...| .++....++...+ +.| +. T Consensus 163 -----------------------eiih~~~~~----~d-----~~~G~spi~~~-~~~i-~~~~~~~~~~~~~f~ng~~~ 208 (421) T protein:vir:10 163 -----------------------MMHHVKVFS----LD-----GYIGSSPIQTN-ADVL-GLNLAVEEHASAVFRRGATM 208 (421) T ss_pred -----------------------hEEEecCcC----CC-----CcccccHHHHH-HHHH-HHHHHHHHHHHHHHhcCCCc Confidence 122222211 11 23477776653 3444 3344555555443 543 33 Q ss_pred eeeechHHhcCCCCcccc-c---cCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhh Q lcl|NC_016654. 304 KVHASESVLTNLGMGQGV-S---LDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKT 375 (533) Q Consensus 304 ~i~v~~~~l~~~~~~~~~-~---~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~ 375 (533) ..++ ...+...+. . .+.....|.....+..+.++ +....++.++......++++..+...++|+... T Consensus 209 ~gil-----~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~f 283 (421) T protein:vir:10 209 SGVI-----ERPKEAPAIKSQEKIDQLLAKWTDRYSGINNMFSVALLQEGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLY 283 (421) T ss_pred cEEE-----EecCccCccCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEecCCChhHHHHHHHHHHhHHHHHHHh Confidence 3332 111111000 0 00001111111111111111 222346677777777788888888999999999 Q ss_pred CCChhhcccCCCcch-hHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCC Q lcl|NC_016654. 376 GYSPVSLGLSDEVAQ-TATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARES 454 (533) Q Consensus 376 g~s~~~~g~~~~~~~-Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d 454 (533) |+||..+|+...+.- +.++. ....++.+|..++..+..-.+..+..........+.++.+.-+..| T Consensus 284 gVPp~~lg~~~~~t~sn~e~~-------------~~~f~~~tl~P~~~~ie~~ln~kL~~~~~~~~~~v~fd~~~l~~~d 350 (421) T protein:vir:10 284 KIPPHMVQMLAKATNNNIEHQ-------------GLQFVMYTLLAWLKRHEGALQRDLLLPSERRDLYIEFNVSGLLRGD 350 (421) T ss_pred CCCHHHcCCCcCCccccHHHH-------------HHHHHHHHHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhccC Confidence 999999986543321 22222 1233344455555444332222221111112334556666667789 Q ss_pred HHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccC-ccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 455 DLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAP-TFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 455 ~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ..++++.+.+++++|+|+..++.+.+ ++..-+--.++ - .| .+.. .+.. .+++ +.+..+..+| T Consensus 351 ~~~~~~~~~~~~~~G~~T~NE~R~~~--gl~p~~ggD~~---~------~~~n~~~-~~~~---~~~~--~~~~~~~~~e 413 (421) T protein:vir:10 351 QKSRYESYALGRQWGWLSVNDIRRME--NLPPIAGGDKY---L------TPLNMVD-SAQI---IPGD--KKPTAQQMAE 413 (421) T ss_pred HHHHHHHHHHHHhCCCcCHHHHHHHh--CCCCCCCccee---e------ecccccc-cccc---ccCC--CCcccccCcc Confidence 99999999999999999999977764 22210000010 0 00 0000 0000 0001 1111111111 No 203 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=96.46 E-value=0.0006 Score=38.24 Aligned_cols=434 Identities=11% Similarity=0.036 Sum_probs=158.4 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHH--------HHHHHHhcccCCCC-- Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTK--------AAYEAFHGRTPTAT-- 70 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~-- 70 (533) |.+- .++--++..- + .+....-+.++|.+..+ ...+..+.+-+-.. T Consensus 1 ~~~~--------~~~~~~~~~~--~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 56 (535) T protein:vir:10 1 MAIL--------KDLRNAFSLS--N--------------KKSTSYIELGDYDKDIVNKAIRPGRASARDTVDGIDIADGN 56 (535) T ss_pred Chhh--------HHHHHHHHhh--h--------------hhhhhhHHHhhhhHHHHHhhhhhhhhhhhccccccccccCC Confidence 1111 0000000000 0 00001111111111110 01111221111101 Q ss_pred --Cccc-----c--------e--eecChHHHHHHHHHHhhc-------------CCCceEee---CCCc--hHHHHHHHH Q lcl|NC_016654. 71 --GRAP-----K--------R--YHAPIPGVIAKLSTTELF-------------SEQLKFLD---AGKS--KEVQARADL 115 (533) Q Consensus 71 --g~~~-----~--------~--~~~n~~k~i~~~~a~ll~-------------~e~~~i~~---~~~~--~~~~~~l~~ 115 (533) |... + + ....+.+.+++..++.+. +=+..+-- .... ......|.. T Consensus 57 ~~g~~~~~~~~~~~~~~~l~~~~~~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~ 136 (535) T protein:vir:10 57 VAGQYSVASISDVLSTKKLLKAYADNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKRAHEIED 136 (535) T ss_pred cccccccCccccccCHHHHHHHhccChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhhhhHHHH Confidence 1000 0 0 011222344444443321 11222111 1111 111223444 Q ss_pred HHhh--cc------HH-HHHHHHHHHHhhhCC-EEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCc-eEEEEEEEEeec Q lcl|NC_016654. 116 IFNT--PR------FH-SSLVEAGESCSALSG-SFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRL-VAVTFWSELAGG 184 (533) Q Consensus 116 i~~~--n~------f~-~~~~~~~~~~~~~G~-~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~-~~v~f~~~~~~~ 184 (533) +|.. |. |. ..+..++..++.+|+ +|+.+..|..+. -+.+-.++|+++.+..+...- ....|+. T Consensus 137 lL~~~PN~~~~~~~~~~~~~~~lv~d~l~~~g~ay~~i~r~~~G~-~~~L~~l~p~~V~v~~d~~~~~~~~~~~~----- 210 (535) T protein:vir:10 137 FIYNTGSEYYEWRDTFPRLLTKIINDMYVQDQINIERIFKNDSNE-LDHFNAVDASKVVISYSPRSKDQPRKFEQ----- 210 (535) T ss_pred HHHhCCCCCCChhHHHHHHHHHHHHHHHhhCCceEEEEEECCCCc-EEEEEEeCCceeEEEEcCccccCceEEEE----- Confidence 4431 22 22 344455566666665 577777765543 245777888888776542111 0111110 Q ss_pred CCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccc Q lcl|NC_016654. 185 DGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRH 264 (533) Q Consensus 185 ~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~ 264 (533) |.. +..+..++-+ + .++++-|..+ +. T Consensus 211 ---------------------~~~--~~~~~~~~~~----------e-----------------iih~~~~~~~--~~-- 236 (535) T protein:vir:10 211 ---------------------FVS--ETKSVKFSER----------N-----------------LTFINYWNLS--DT-- 236 (535) T ss_pred ---------------------Eec--CceeEEECcc----------c-----------------EEEEeccCCC--Cc-- Confidence 100 0000001000 0 0112111110 00 Q ss_pred cccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCcceeeechHHhcCCCC-ccccc---cCcchhhhhhccccccc Q lcl|NC_016654. 265 DPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGAGKVHASESVLTNLGM-GQGVS---LDEEQEVYSRVGSGGFN 339 (533) Q Consensus 265 ~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~~~i~v~~~~l~~~~~-~~~~~---~d~~~~~~~~~~~~~~~ 339 (533) ....+|.|.+..+. ..| .+.....++... |+.|...-.| |...+. ..... .+.....+.....+..+ T Consensus 237 --~~~~~G~Spi~~~~-~~i-~~~~aa~~~~~~~f~ng~~p~gi----L~~~~~~~~~ls~e~~e~lk~~~~~~~~G~~n 308 (535) T protein:vir:10 237 --DRRGYGYSPVEASI-PLI-RAIYDTEQFNARFFSQGGTTRGI----LVIDQDGDAQANQMMLAGIRRQWTSQGSGLGG 308 (535) T ss_pred --ccccccccHHHHHH-HHH-HHHHHHHHHHHHHHhccCCccEE----EEecCCCCcccCHHHHHHHHHHHHHHhcCccc Confidence 11346888776543 444 334455555544 3554322111 211111 00010 01111112111111111 Q ss_pred ccc-----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhH--HHHHHHhhhHHHHHH-HHHH Q lcl|NC_016654. 340 ANG-----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTA--TEASGKKDLTVKTTR-AKAR 411 (533) Q Consensus 340 ~~~-----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Ta--tai~~~~~~l~~~~~-~~~~ 411 (533) ++. +....++.++......++++..+...++|+...|++|..+|+...+.-+. ..-...+.. +++ .... T Consensus 309 ag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~~~~~s---~~E~~~~~ 385 (535) T protein:vir:10 309 AWKIPILAAKDAKFVNMTQNSRDMEFDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGKSGTKSVNEGS---TAKAKLES 385 (535) T ss_pred ccccccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccccCcccccchhhhhhhhhh---hHHHHHHH Confidence 111 01223455666667778899999999999999999999999755432211 111111111 112 2233 Q ss_pred HHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhC-CCCCHHHHH Q lcl|NC_016654. 412 HFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLH-EDWDDERVQ 490 (533) Q Consensus 412 ~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~-~~~~dee~~ 490 (533) .++.+|..+++.+....+..+... ....+.+.|+.....|..++++..... .+|.|+..++.+++. |.+..-++- T Consensus 386 ~~~~~L~P~l~~ie~~ln~~Ll~~---~~~~~~f~f~~l~~~d~~~r~~~~~~~-~~g~lT~NE~R~~~gl~piegGD~~ 461 (535) T protein:vir:10 386 SKDKGLTPLLSFIEQVINDKIMRY---VDTDYRFSFTLGDAQDKLQEEQVWKLK-LANGYFINEYRKDHGLKTVDGLDVP 461 (535) T ss_pred HHHHHHHHHHHHHHHHHhhhcccc---cCCeEEEEeccccccCHHHHHHHHHHH-HcCCCCHHHHHHHhCCCCCCCcccc Confidence 445566666665544333332211 124578899888888888887766544 567799999666541 111110110 Q ss_pred H-HH--HHHHHhhh---cccCccc-----cccccCCCCC--------CCCCCCCCCCCCCCC Q lcl|NC_016654. 491 E-EA--DLIDNANT---VSAPTFG-----FGTDQPPLPT--------ENDPATDPEAVDEGE 533 (533) Q Consensus 491 ~-El--~rI~~E~~---~~~~~~~-----~~~~~~~~~~--------~~~~~~~~~~~~d~~ 533 (533) - .+ ......++ ...|... ...++.+... .+.+++.++...+.| T Consensus 462 ~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~q~~~~~~~~~~~g~~~~~~~~~~~~~ 523 (535) T protein:vir:10 462 GFIGSAENFINATGFGQPNVPDSSDDSGSTLGERERQERIQHSKDYEKGKDDPKSPLPKPSE 523 (535) T ss_pred ccccchhhcccccccccccCCCCCCCccccCCccccCcccccccccccCCCCCCCCCCcCCC Confidence 0 00 00000000 0001000 0000000000 011111111111111 No 204 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=96.32 E-value=0.00074 Score=37.75 Aligned_cols=404 Identities=11% Similarity=0.094 Sum_probs=169.8 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHH--HhcccCCCCCcccceee Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEA--FHGRTPTATGRAPKRYH 78 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~g~~~~~~~ 78 (533) |--|. +.=+++..+-||. +|..++.+..+..+.. ....+.... ++.+. ... ...-+. T Consensus 1 ~~~~~------------~~~~~~~~~g~~~----~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~-~v~--~~~al~ 59 (424) T protein:vir:18 1 MEEPK------------YTIDLRTNNGWWA----RLQSWFVGGRLVTPNQ--GSQTGPVSAHGHLGDS-SIN--DERILQ 59 (424) T ss_pred CCCCc------------ceEeecCCCchHH----HHHhhhcccccccccc--cccccccccccccccc-ccc--HHHhhc Confidence 32222 2223455555663 4544443322211111 000010010 00000 000 001112 Q ss_pred cChHHHHHHHHHHhhcCCCceEe-e-CCCch---HHHHHHHHHHhh--cc---HHHHHHHHHHHHhhhCCEEEEEEEcCC Q lcl|NC_016654. 79 APIPGVIAKLSTTELFSEQLKFL-D-AGKSK---EVQARADLIFNT--PR---FHSSLVEAGESCSALSGSFQRIVWDPT 148 (533) Q Consensus 79 ~n~~k~i~~~~a~ll~~e~~~i~-~-~~~~~---~~~~~l~~i~~~--n~---f~~~~~~~~~~~~~~G~~~~~~~~D~~ 148 (533) +.---..++.+|+-+-+=|..+- . .+... ....-+.++|.. |. -..-....+...+..|.+|+.+..|.. T Consensus 60 ~~~v~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~ 139 (424) T protein:vir:18 60 ISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA 139 (424) T ss_pred cHHHHHHHHHHHHhhccCceEEEEeecCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC Confidence 22222455555555554454331 1 11100 011223344431 11 122234445566778999998888765 Q ss_pred CCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccc Q lcl|NC_016654. 149 IADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDI 228 (533) Q Consensus 149 ~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~ 228 (533) +. -+.+..++|..+.+..+.+++ .|+ |... |..+.+ T Consensus 140 G~-~~~L~pl~~~~V~v~~~~~~~----------------~y~--------------~~~~----g~~~~~--------- 175 (424) T protein:vir:18 140 GD-VISLLPLQSANMDVKLVGKKV----------------VYR--------------YQRD----SEYADF--------- 175 (424) T ss_pred Cc-EEEEEEecCcceEEEEcCCeE----------------EEE--------------EEeC----CeEEEe--------- Confidence 43 345666777777654433321 111 1000 000000 Q ss_pred cccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHH-HhC-cceee Q lcl|NC_016654. 229 AVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDF-RIG-AGKVH 306 (533) Q Consensus 229 ~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~-~~~i~ 306 (533) ..--+.|+.+.. +. ...|.|.+..+ ...| .++....++...+ +.| ++..+ T Consensus 176 -----------------~~~eIih~r~~~----~d-----g~~G~spi~~~-~~~i-~~~~a~~~~~~~~f~ng~~p~gi 227 (424) T protein:vir:18 176 -----------------SQKEIFHLKGFG----FT-----GLVGLSPIAFA-CKSA-GVAVAMEDQQRDFFANGAKSPQI 227 (424) T ss_pred -----------------ccccEEEecCcC----CC-----CcccccHHHHH-HHHH-HHHHHHHHHHHHHHHccCCcceE Confidence 000122333211 11 23466666543 2344 2344444554443 543 33333 Q ss_pred e--chHHhcCCCCccccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChh Q lcl|NC_016654. 307 A--SESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPV 380 (533) Q Consensus 307 v--~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~ 380 (533) + |..++.. .. .+.-...+.....+ .++++ +....++.++......++++..+...++|+...|+||. T Consensus 228 l~~~~~~l~~---e~---~~~~~~~~~~~~~g-~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~ 300 (424) T protein:vir:18 228 LSTGEKVLTE---QQ---RSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPH 300 (424) T ss_pred EEeCCcCCCH---HH---HHHHHHHHHHHhCC-cccCCceeccCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHH Confidence 2 2221110 00 00011112211111 11111 11223566666667778888888889999999999999 Q ss_pred hcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHH Q lcl|NC_016654. 381 SLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQ 460 (533) Q Consensus 381 ~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~ 460 (533) .+|+..++..++..++.... ..++.+|..+++.+..-.+..+..........+.++++.-+..|..++++ T Consensus 301 ~lg~~~~~t~~~sn~eq~~~----------~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~ 370 (424) T protein:vir:18 301 LVGDVEKSTSWGSGIEQQNL----------GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAA 370 (424) T ss_pred HhCCCCCcccccccHHHHHH----------HHHHHHHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHH Confidence 99876554433333322221 22334444444444332222222221122345677777778889999999 Q ss_pred HHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 461 TVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 461 ~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) .+.+++.+|+|+..++.+.+ ++.. | +.+ +..-...+..|.. +.+.+++..++|- T Consensus 371 ~~~~~~~~G~~T~NE~R~~~--gl~p---------i--~gG---D~~~~~~n~~~l~---~~~~~~~p~~~ga 424 (424) T protein:vir:18 371 FMKAMGEAGLRTINEMRRTD--NLPP---------L--PGG---DVAMRQSQYVPIT---DLGTNKEPRNNGA 424 (424) T ss_pred HHHHHHhCCCcCHHHHHHHh--CCCC---------C--CCc---CeeeeccCccchH---hhhccCCCccCCC Confidence 99999999999999866654 2221 0 000 0000001111110 1111222222222 No 205 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=96.29 E-value=0.00078 Score=37.62 Aligned_cols=372 Identities=12% Similarity=-0.005 Sum_probs=148.2 Q ss_pred HHhhhHhhc------------------CCHHHHHHHHhccCcchhhHHHHHHHHHHHH-HhcccC----CCCCcc---cc Q lcl|NC_016654. 22 VAESHVWWE------------------GDLDKLATFYGAEGRTSPSGIKARTKAAYEA-FHGRTP----TATGRA---PK 75 (533) Q Consensus 22 ~~~~~~w~~------------------gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~-~~~~~~----~~~g~~---~~ 75 (533) |--|..|.. |+|+... |- +..... ...... ..+. .|.+.+ +..|.. .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~-~~~~~~---~~~~~~~~g~~~~~~~~~~~~~t~~~ 74 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVDYNPGDPDMVE-FR-GPEEEP-EARALP---WIRPTAWSGYPESWATPSWGSAQDKL 74 (409) T ss_pred CchhhhhcccccCCCcccccccccccCCCCceee-cc-CCCcch-hhhhcc---cccccccccccccccccCccccchhh Confidence 344444433 3443221 11 111100 000000 0000 011111 111111 11 Q ss_pred eeecChHHHHHHHHHHhhcCCCceEeeCCC-chHHHHHHHHHHhhc--cHHHHHHHHHHHHhhhCCEEEEE-EEcCCCCC Q lcl|NC_016654. 76 RYHAPIPGVIAKLSTTELFSEQLKFLDAGK-SKEVQARADLIFNTP--RFHSSLVEAGESCSALSGSFQRI-VWDPTIAD 151 (533) Q Consensus 76 ~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~-~~~~~~~l~~i~~~n--~f~~~~~~~~~~~~~~G~~~~~~-~~D~~~~~ 151 (533) -+.+.-.-..|+.+|+-+-+=|..+--.+. .+.....|.. +-| .-...+.+.+...+.+|.+|+.+ ..|.++. T Consensus 75 ~~~~~~v~acV~~Ia~~iA~lpl~~~~~~~~~~~~~~ll~~--~PN~~~t~~~f~~~l~~~lllGnay~~~i~r~~~G~- 151 (409) T protein:vir:83 75 RTLIDVAWACIDLNASVLSSMPIYRMRNGRIIDSVAWMSNP--DPEVYTSWQEFAKQLFWDFQLGEAFVLPMAHGSDGY- 151 (409) T ss_pred HhhhHHHHHHHHHHHHhhccCceEEeeCCccccchhhhccc--CCCCCCCHHHHHHHHHHHHhhCCcEEEEEEECCCCc- Confidence 122233344566666655554543321111 1111111111 111 12233444445555568887654 4555442 Q ss_pred ceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccc Q lcl|NC_016654. 152 NAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVE 231 (533) Q Consensus 152 ~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~ 231 (533) -+.+..++|+++.+..+.+ + .+.|. +.+ .. + .+ T Consensus 152 ~~~L~pl~p~~v~v~~~~~---------------g-------------~~~y~-~~~--~~-~--------------~~- 184 (409) T protein:vir:83 152 PIRFRVVPPWLVNVELKKG---------------A-------------RREYR-IGG--LN-V--------------TD- 184 (409) T ss_pred EEEEEEECCcceEEEEcCC---------------c-------------eEEEE-Ecc--cc-C--------------cc- Confidence 2345566676655443221 1 11111 100 00 0 00 Q ss_pred ccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHH-HhCcceeeechH Q lcl|NC_016654. 232 GADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDF-RIGAGKVHASES 310 (533) Q Consensus 232 ~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~~~~i~v~~~ 310 (533) -+.|++...+. ...+|.|.+..+ ...| .+.....++...+ +.+... .. T Consensus 185 -----------------eiiHir~~~~~--------~~~~G~spi~~~-~~~i-~~~~a~~~~~~~~f~nga~p----~g 233 (409) T protein:vir:83 185 -----------------EILHIRYQGNT--------ADAHGHGPLESA-APRQ-VVIGLLQKYVQNLAETGGVP----LY 233 (409) T ss_pred -----------------ceEEeCCCCCC--------CCcccccHHHHH-HHHH-HHHHHHHHHHHHHHhcCCCc----ce Confidence 12333322111 123577766543 3444 3445555555554 443221 11 Q ss_pred HhcCCCCccccccCcchhhhhhcccccccccc-----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccC Q lcl|NC_016654. 311 VLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG-----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLS 385 (533) Q Consensus 311 ~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~-----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~ 385 (533) ++...+.-.....+.....+.....+ +++. ++....+.++..-...++++..+...++|+...|++|..+|.. T Consensus 234 il~~~~~ls~e~~~~~~~~~~~~~~~--nag~~~il~~g~~~~~~~~~s~~d~q~le~r~~~~~eIa~~fgVPp~llg~~ 311 (409) T protein:vir:83 234 WLGVERRLSETEAVDLMDRWIESRSK--YAGHPALVTGGATLNQAKSMSAQDLSLMELTQFNEARIAILLGVPPFLVGLP 311 (409) T ss_pred EeecCCCCCHHHHHHHHHHHHHhhCC--ccCccceecCCcccccccCCCHHHHHHHHHHHhhHHHHHHHhCCCHHHccCC Confidence 12221110000000011111111100 1111 0001112223334455788888888899999999999999864 Q ss_pred CCcc-hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHH Q lcl|NC_016654. 386 DEVA-QTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQA 464 (533) Q Consensus 386 ~~~~-~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~ 464 (533) ..+. .|-..++.... ..++..|.-++..+..-.+..+. ...+.+.++++.-+-.|..++++.+++ T Consensus 312 ~~~~~~tysn~eq~~~----------~f~~~tL~P~~~~ie~~l~~~Ll----~~~~~~~f~~~~llr~d~~~r~~~~~~ 377 (409) T protein:vir:83 312 GATGSLTYSNIEQLFS----------FHDRSSLRPKATAVMAALDRWAL----PSPQHLELNRDDYTRPSLVERATAYKI 377 (409) T ss_pred CCccccccccHHHHHH----------HHHHHHHHHHHHHHHHHHHHhhC----CCCcEEEeehhhhhccCHHHHHHHHHH Confidence 4321 12111211121 12223333333333222121111 123467777777788899999999999 Q ss_pred HHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCC Q lcl|NC_016654. 465 WSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTE 519 (533) Q Consensus 465 l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~ 519 (533) ++++|+|+..++.+.+ +... ..|+|.-+.... T Consensus 378 ~~~~G~lT~NE~R~~~--glpp---------------------~~ggd~l~~~gv 409 (409) T protein:vir:83 378 MIEAGVMEPNEARAME--RLHS---------------------EAAAVRLSGGGV 409 (409) T ss_pred HHhCCCcCHHHHHHHh--CCCC---------------------CCCCcccCCCCC Confidence 9999999999865543 2211 111222111111 No 206 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=96.21 E-value=0.00087 Score=37.37 Aligned_cols=420 Identities=12% Similarity=0.099 Sum_probs=167.8 Q ss_pred CCCC--CCcC-CCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCccccee Q lcl|NC_016654. 1 MSLP--EANT-AWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRY 77 (533) Q Consensus 1 ~~~~--~~~~-~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 77 (533) |..| +.|+ +=||-...+.... - ...+||.+..... . .++..|+.+. + T Consensus 26 ~~~p~~~dG~s~i~~~~~~~~~~~-~-----------~~~~~~gg~~~n~----~-eLI~~YR~ma-------------~ 75 (533) T protein:vir:58 26 MGAPHGAGGSSMIPINMYHPFATA-G-----------YASRFYGGIEFNR----F-FLYDMYDRMD-------------Y 75 (533) T ss_pred ccCccCCCCCccccCCCCcchhhh-h-----------hhhhhhccccccH----H-HHHHHHHHhh-------------c Confidence 7777 4443 4444332221110 0 1123343221110 1 1111222210 0 Q ss_pred ecChHHHHHHHHHHhh-----cCCCceEeeCCC--chHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCC Q lcl|NC_016654. 78 HAPIPGVIAKLSTTEL-----FSEQLKFLDAGK--SKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIA 150 (533) Q Consensus 78 ~~n~~k~i~~~~a~ll-----~~e~~~i~~~~~--~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~ 150 (533) ..+-....++..++-. +..|+++..++. ++...+.|.+++ +|++..++.+....+.|.+|+|...++. . T Consensus 76 ~~pEVd~AideIvneaiv~d~~~~pV~v~l~~~e~s~~iK~kI~~ll---df~~~~~~~fR~WYVDGriy~Hkiik~~-k 151 (533) T protein:vir:58 76 TDPLISTVLDIIADECTIPNENGNIVDVVTKDIELAKAILSYLDYVI---NIEKNAYPIIRNMIKYGDMFLHILEKGS-D 151 (533) T ss_pred cCcchhhHHHhhhceeeEecCCCceeEeecccccccHHHHHHHHHHh---cchhhhhHHHHhhhhcceeEEEeccCCc-c Confidence 1122233333333321 233555554321 223344455444 6999999999999999999999855432 2 Q ss_pred Cce-EEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCC-cccceeehhhccccccc Q lcl|NC_016654. 151 DNA-WIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTAT-SLGWMMALTDHPATRDI 228 (533) Q Consensus 151 ~~~-~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~-~lG~~v~l~~~~~~~~~ 228 (533) .+| .+.+++|.++-++..-. ++ ..+| |-+..|.+... ..+..+|-+ T Consensus 152 ~GI~elr~lDPr~i~~vr~~~--t~------------~eyy----------vy~~~~~~~~s~~~~~kI~~d-------- 199 (533) T protein:vir:58 152 GTIEKFQVVSPYIFSKRYNPE--TD------------TWYY----------VITDVYRNVVSGYFNEDIPEE-------- 199 (533) T ss_pred cchhhheecCCeeeEEEEeec--cc------------eEEE----------eecccccccccCccccccchh-------- Confidence 333 66778887765553110 00 0111 11111111110 011111110 Q ss_pred cccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHH--HHHHHhCcceee Q lcl|NC_016654. 229 AVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSL--MRDFRIGAGKVH 306 (533) Q Consensus 229 ~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~--~~~~~~~~~~i~ 306 (533) .+.|..-.. ....++.+.|-+-.||+++ ..|=-+-+.+ .|.-|.-.+||| T Consensus 200 --------------------aI~y~~SGl-------~d~~~~~iisyLhkAiKp~-NQLkmiEDAlVIYRisRAPeRRvF 251 (533) T protein:vir:58 200 --------------------DVIHFSHKI-------DTNFFPYGRSYLESARAIW-NQLRLMEDALMLYRVVRSVDRRVF 251 (533) T ss_pred --------------------heeeeeecc-------ccCCCCceehhhhHHHHHH-HHHHHHHHHHHHHhhcCChhheEE Confidence 111111110 0012445566666666543 3332222211 122244455666 Q ss_pred e-ch---------HHhcCC--CCccccccCcch------hhhh----hc--cccccccccccccceeeechhhhhHHHHH Q lcl|NC_016654. 307 A-SE---------SVLTNL--GMGQGVSLDEEQ------EVYS----RV--GSGGFNANGDMETIFEFFQPAIRVLEHDQ 362 (533) Q Consensus 307 v-~~---------~~l~~~--~~~~~~~~d~~~------~~~~----~~--~~~~~~~~~~~~~~i~~~~~~ir~e~~~~ 362 (533) - .- .+++.- .-.+....|... +-|. ++ .+.+--.| +....|+++... . -.-+. T Consensus 252 YIDVGNlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRReG-grgTEI~TLpGg-~-lgeme 328 (533) T protein:vir:58 252 YVDVGNVPPDKINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRRGD-RRAVEIDILQGS-K-VDLAE 328 (533) T ss_pred EEeecCCCccCHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhhhcccccCC-CccceeeecCCC-C-CCcHH Confidence 2 11 111100 000000111110 0110 00 00011111 122235555432 3 24467 Q ss_pred HHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCcee Q lcl|NC_016654. 363 GAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEE 442 (533) Q Consensus 363 ~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~ 442 (533) .+..+.+.+....++|.+.++.+++. --+++|...+-.--..+.+.+.. +.+++...|.+ ++....++ T Consensus 329 DV~YF~kkLy~ALnVP~sRl~~e~~f-gr~~eItRDEiKF~KFI~rLR~r----F~~ll~~qLil-------k~iit~ee 396 (533) T protein:vir:58 329 DVEYMLNRLISALKVPKAFIGYEGDV-NAKNTLATQDIKFNNTIKRIQGF----FVEELERMVRM-------NKEFADQD 396 (533) T ss_pred HHHHHHHHHHHHhCCCeeecCCCCCC-ccchhhhHHHHHHHHHHHHHHHH----HHHHHhccccc-------ccCcchhh Confidence 77888889999999998888765542 23445544333333334444443 34444433322 23334456 Q ss_pred EEEEeCCCCCCCHHHHH-------HHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcc-cCccccccccC Q lcl|NC_016654. 443 LELEWPKFARESDLAKA-------QTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVS-APTFGFGTDQP 514 (533) Q Consensus 443 v~i~f~d~i~~d~~e~a-------~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~-~~~~~~~~~~~ 514 (533) ..++|...---.+...+ ..++++ .+.+++....++.. -.+|+.. ++.+.|++|.... -+....+++.. T Consensus 397 w~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~--dpyvgk~yi~k~IL-r~tdei~-~q~e~ie~E~~~~~~~~~~~~~e~~ 472 (533) T protein:vir:58 397 FRLVMNRSNSIVEGERFAVIEQRIGIAERL--KGWVREDWIYSNIL-QIPYDLK-PQEEVAEAAGGGGLFDTGGFGEETT 472 (533) T ss_pred eeeeeeccchHHHHHHHHHHHHHHHHHHHh--cchhhHHHHHHHHh-cCChhhh-HHHHHHHHhhcCCCCCCCCcccccC Confidence 67777654333333222 233322 35677776544433 3566544 4456788774321 11111111111 Q ss_pred ---------------CCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 515 ---------------PLPTENDPATDPEAVDEGE 533 (533) Q Consensus 515 ---------------~~~~~~~~~~~~~~~~d~~ 533 (533) +...+.+-++.++.+.-|+ T Consensus 473 ~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~ 506 (533) T protein:vir:58 473 PADFLGERGSPIESPRGRTEFDFGTEGGEELGGE 506 (533) T ss_pred CcccCccccCcccCCCChhhHhcccCCccccccc Confidence 1111111112222222222 No 207 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=96.19 E-value=0.00089 Score=37.30 Aligned_cols=465 Identities=10% Similarity=0.028 Sum_probs=191.9 Q ss_pred hHHHHHHHHhhhHhhcCCHHHHHHHHhccCcc-hhhHHHHH----HHHHHHHHhcccCCCCCcc---cce--e-ecChHH Q lcl|NC_016654. 15 LAAVTARVAESHVWWEGDLDKLATFYGAEGRT-SPSGIKAR----TKAAYEAFHGRTPTATGRA---PKR--Y-HAPIPG 83 (533) Q Consensus 15 ~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~-~~~~~~~~----~~~~~~~~~~~~~~~~g~~---~~~--~-~~n~~k 83 (533) ++..+...-.. -++.-...... .++...+. ..++++.+..-........ +.+ + ..+-+. T Consensus 1 m~~lfgf~~~~----------~~~~~~~~~s~~~p~~ddg~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd 70 (558) T protein:vir:10 1 MAKLFGFSIEE----------TQKKSTSIISPVPKNNEDGVDNFISSGFYGQYVDIEGAYRSEYDLIRRYREMALHPEAD 70 (558) T ss_pred Ccchhcchhhh----------hhhhccCCccccCCCccccccceeccceeeeeecccchhhhHHHHHHHHHHHhhccchh Confidence 22222110000 00000000000 11111111 0011111111000000000 000 0 112222 Q ss_pred HHHHHHHHh-hc----CCCceEeeCCCc------hHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCC--CC Q lcl|NC_016654. 84 VIAKLSTTE-LF----SEQLKFLDAGKS------KEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPT--IA 150 (533) Q Consensus 84 ~i~~~~a~l-l~----~e~~~i~~~~~~------~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~--~~ 150 (533) -.++..++= +. .+|+++.+++-+ +...+..+.|++-=+|++..++......+.|..||+.++|.. .. T Consensus 71 ~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRiyfHKiid~k~pk~ 150 (558) T protein:vir:10 71 GAIEDVVNEAIVSDLYDSPVEVELSNLNASNTLKKKIREEFRYIKEMMDFDKKSHEIFRNWYVDGRVFYLKVIDTKNPQE 150 (558) T ss_pred hHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccc Confidence 233333321 11 125555554322 233455566777778999999999999999999999999854 22 Q ss_pred CceEEEEEcCCeEEEEEe----cCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCc---ccceeehhhcc Q lcl|NC_016654. 151 DNAWIDFVDADRAIPEFR----WGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATS---LGWMMALTDHP 223 (533) Q Consensus 151 ~~~~i~~v~~~~~~P~~~----~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~---lG~~v~l~~~~ 223 (533) +=..+.+++|.++-++.. ......+.- ++...+ . ...+++.+|.+|...... .+-.++. T Consensus 151 GI~ELr~lDPr~i~~Vr~i~~~~~~~~~~~~---~~~~~~-~------~~~~~~~eyy~Y~~~~~~~~~~~~~~~~---- 216 (558) T protein:vir:10 151 GIQDLRYIDPLKIKFIRQEKRKPGNQDPAIR---VRSEQD-V------VPNPEFEEFYIYTPKVQHPTGMVGQMGG---- 216 (558) T ss_pred cceeeeeeCcccceeeeeeccccccccceee---eecccc-e------eeccceeEeeeecCCcccccccceeecC---- Confidence 345577788887766532 111111111 111111 0 123555666666432110 0000000 Q ss_pred ccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHH---HHHHHHh Q lcl|NC_016654. 224 ATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSS---LMRDFRI 300 (533) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~---~~~~~~~ 300 (533) ...+ .|+ .-.+.|..-+.-..+ ...=.|-+-.||+++ ..| +++.+ +.|.-|. T Consensus 217 -~~~v-----------kI~----~dAI~y~hSGL~d~~-------~~~i~syLhkAIKp~-NQL-kmlEDAlVIYRitRA 271 (558) T protein:vir:10 217 -KNSI-----------KIA----KDSITMCTSGLVDRN-------KNRVLSYLHKAIKAL-NQL-RMIEDSLVIYRLSRA 271 (558) T ss_pred -CCce-----------eec----hhheeeecccceecC-------CCeeeecchHhhHhH-Hhh-HHHHhhHHHHhhhcc Confidence 0000 000 001111111100000 000012222344432 222 11111 2233466 Q ss_pred Ccceeee-c---------hHHhcC----------CCCccccccCcchhhhhhcc---ccccccccccccceeeechhhhh Q lcl|NC_016654. 301 GAGKVHA-S---------ESVLTN----------LGMGQGVSLDEEQEVYSRVG---SGGFNANGDMETIFEFFQPAIRV 357 (533) Q Consensus 301 ~~~~i~v-~---------~~~l~~----------~~~~~~~~~d~~~~~~~~~~---~~~~~~~~~~~~~i~~~~~~ir~ 357 (533) -.+|||- . +.+++. +....|..-| ++.|..+. +.+--.|+ ....|+++...-. T Consensus 272 PERRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~d--drk~msMlEDyWLpRReGg-rgTEItTLpGgqn- 347 (558) T protein:vir:10 272 PERRIFYIDVGNLPKVKAEQYLKEVMSRYRNKLVYDANTGEVRD--DRKFMSMMEDFWLPRREGG-RGTEITTLPGGQN- 347 (558) T ss_pred ccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecc--cchhhhhHhhhcccccCCC-CccceeeccccCC- Confidence 6777763 1 111110 0111111111 11111100 01111122 2223444433222 Q ss_pred HHHHHHHHHHHHHHHHhhCCChhhcccCCC-cchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCC Q lcl|NC_016654. 358 LEHDQGAALLLREVLRKTGYSPVSLGLSDE-VAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKG 436 (533) Q Consensus 358 e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~-~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~ 436 (533) -.-+..+..+.+.+....++|.+.++.+++ +..-++||...+-.--..+.+.+..|..-|.++++.-|.|....-.... T Consensus 348 Lgem~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW 427 (558) T protein:vir:10 348 LGELSDVDYFQKKLYRALGVPESRIAAEGGFNLGRSSEILRDELKFAKFVGRLRKRFAAMFNDMLKTQLVLKNIVTPEDW 427 (558) T ss_pred cchHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHH Confidence 223556677778889999999888875543 2224567766666667778888888888888888877655321100000 Q ss_pred CCCceeEEEEeCCCCCCCHHHH-------HHHHHHHH--hCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcc---c Q lcl|NC_016654. 437 AAPSEELELEWPKFARESDLAK-------AQTVQAWS--VASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVS---A 504 (533) Q Consensus 437 ~~~~~~v~i~f~d~i~~d~~e~-------a~~~~~l~--~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~---~ 504 (533) ......+.++|...---.+... +..++++. -+...|.++..++.. -.+|+|.++|.++|++|.... + T Consensus 428 ~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~IL-r~tDeeI~~~~kqI~~E~k~~~~~~ 506 (558) T protein:vir:10 428 KTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYSTEYVRKRVL-RQTDMEIEEIDTQIEDEIQKGIIPD 506 (558) T ss_pred HHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHh-ccCHHHHHHHHHHHHHHHhCCCCCC Confidence 0112457777765433333322 23333332 122569998777654 489999999999999996421 1 Q ss_pred Cc---cccccccCCCCCCCC-CCCCCCCCCCCC Q lcl|NC_016654. 505 PT---FGFGTDQPPLPTEND-PATDPEAVDEGE 533 (533) Q Consensus 505 ~~---~~~~~~~~~~~~~~~-~~~~~~~~~d~~ 533 (533) |. +-.++-.|+..+..- .-+.++.+++-+ T Consensus 507 p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 539 (558) T protein:vir:10 507 PSQIDPITGEPLPQEGDPAMEGMGEQPVDPDLE 539 (558) T ss_pred ccccChhhccccCccCCchhccCCCCCcccccc Confidence 11 111222222222211 112223233322 No 208 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=96.18 E-value=0.00091 Score=37.27 Aligned_cols=372 Identities=9% Similarity=-0.059 Sum_probs=142.4 Q ss_pred HHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHH-HHHHhc---ccCCCCCcccceeecChHHHHHHHHHHhhcCCC Q lcl|NC_016654. 22 VAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAA-YEAFHG---RTPTATGRAPKRYHAPIPGVIAKLSTTELFSEQ 97 (533) Q Consensus 22 ~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~-~~~~~~---~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e~ 97 (533) |..++.|-.+ ..... . .+....+. ...++. ...... ...-++..--..+++.+|+-+-+=| T Consensus 1 Mglf~~~~~~-----------~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~v~--~~~al~~~~V~~~i~~Ia~~ia~l~ 65 (384) T protein:vir:49 1 MPIFNITNLA-----------TESPP-S-NQDSFFDITDPEFLDALNGSEWVS--AETALKNSDLFSIISQLSNDLATAK 65 (384) T ss_pred CccccccccC-----------ccccc-c-cchhhccccchhhcccccCCceec--hhhhhccHHHHHHHHHHHHHHhhCc Confidence 2222221111 11100 0 00000000 011110 000000 0111222223345566666665545 Q ss_pred ceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEec-CCceEEE Q lcl|NC_016654. 98 LKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRW-GRLVAVT 176 (533) Q Consensus 98 ~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~-g~~~~v~ 176 (533) ..+. ... ....+.+--..-....-....+...+..|.+|+.+..|..+. -+.+..++|+.+-++.+. +.. + T Consensus 66 ~~~~--~~~--~~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~-~~~L~~l~~~~v~v~~~~~~~~---~ 137 (384) T protein:vir:49 66 ITTS--RKQ--LQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGR-DMKWEYLRPSQVSFNRLDNQNG---L 137 (384) T ss_pred eeee--cch--hhhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCc-EEEEEEEcCceeEEEEcCCCce---E Confidence 4332 211 111111110111123334455566777899999988886543 346667777777665322 110 1 Q ss_pred EEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCC Q lcl|NC_016654. 177 FWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNV 256 (533) Q Consensus 177 f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~ 256 (533) | | . |...+...|..+.+. .--+.|+... T Consensus 138 ~------------y-------------~-~~~~~~~~~~~~~~~--------------------------~~eVih~~~~ 165 (384) T protein:vir:49 138 Y------------Y-------------N-ITFDDPRIPPKQHVP--------------------------QGDILHFRLL 165 (384) T ss_pred E------------E-------------E-EEecCccccceeEec--------------------------CccEEEecCC Confidence 1 1 1 111111111111100 0012333321 Q ss_pred cccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhC-cceeeechHHhcCCCCccccccCcchhhhhhcc Q lcl|NC_016654. 257 TPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIG-AGKVHASESVLTNLGMGQGVSLDEEQEVYSRVG 334 (533) Q Consensus 257 ~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~-~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~ 334 (533) .++ ....|.|.+..+ ...|+ ++....++... |+.| ....++ ...+... .+...+...... T Consensus 166 ~~~--------~~~~G~s~i~~~-~~~i~-~~~~~~~~~~~~~~ng~~~~~il-----~~~~~~~---~~~~~~~~~~~~ 227 (384) T protein:vir:49 166 SVD--------GGLTSVSPLMAL-GRELN-IQKASDKLTLNALKNALNANGIL-----KIKGGGL---LDFKTKQSRSRQ 227 (384) T ss_pred CCC--------CceeeccHHHHH-HHHHH-HHHHHHHHHHHHHhccCCCceEE-----EeCCCCC---hHHHHHHHHHHH Confidence 111 123577776653 34443 33333334333 4543 333322 1111100 000011111111 Q ss_pred cccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHH Q lcl|NC_016654. 335 SGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKA 410 (533) Q Consensus 335 ~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~ 410 (533) ....+.++ +....++.++......++++..+...++|+...|+||..+|...++..|+..++..+...+.. .. T Consensus 228 ~~~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~i~~---~l 304 (384) T protein:vir:49 228 AMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEMIYNIYFKAVSR---FL 304 (384) T ss_pred hcccCCccceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHHHHHHHHHHHHH---HH Confidence 11111111 111235556656667788888889999999999999999997655555665554433222211 11 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhC-CCCCHHHH Q lcl|NC_016654. 411 RHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLH-EDWDDERV 489 (533) Q Consensus 411 ~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~-~~~~dee~ 489 (533) .-+...|.+.+.. .+ .++.....-.+.......+..+..+|++++.++...+. .++...|+ T Consensus 305 ~pi~~~i~~~l~~-------~l-----------~~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~ne~ 366 (384) T protein:vir:49 305 RPFVSELSKKLSC-------EV-----------DADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILPKDL 366 (384) T ss_pred HHHHHHHHHHhch-------hh-----------hhhhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCChhH Confidence 1111111111110 00 00011111122233333455677789999888776541 12433333 Q ss_pred HHHHHHHHHhhhcccCccccccccCCCCCCCCCC Q lcl|NC_016654. 490 QEEADLIDNANTVSAPTFGFGTDQPPLPTENDPA 523 (533) Q Consensus 490 ~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~ 523 (533) .+ +. . .++. +.+++++-- T Consensus 367 r~----~~--~---~~p~-------~gGd~~~~~ 384 (384) T protein:vir:49 367 PE----GE--T---DSTL-------KGGETNEQY 384 (384) T ss_pred HH----Hc--C---CCCC-------CCCCCCCCC Confidence 22 11 1 1111 111111111 No 209 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=96.16 E-value=0.00093 Score=37.21 Aligned_cols=432 Identities=13% Similarity=0.066 Sum_probs=164.0 Q ss_pred CCCC-------------------------------CCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhh Q lcl|NC_016654. 1 MSLP-------------------------------EANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPS 49 (533) Q Consensus 1 ~~~~-------------------------------~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~ 49 (533) -||| ..-+.+-|.+..++.-.+..++.-|.+ | -||.+.. T Consensus 39 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----l-~~~~~~~----- 108 (694) T protein:vir:10 39 QPVPADFARRGALNALDAAPVAEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDA----L-SFVTSSG----- 108 (694) T ss_pred CcccCCccccccchhhcccccCCCCcchhhhhhccccccCCCccccchhhhhhccCcccccc----h-hhhhccC----- Confidence 1111 112233333433333333322221111 1 0111110 Q ss_pred HHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHhhcCCCceE-------------ee----CCCch-HHHH Q lcl|NC_016654. 50 GIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTELFSEQLKF-------------LD----AGKSK-EVQA 111 (533) Q Consensus 50 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i-------------~~----~~~~~-~~~~ 111 (533) |-|++.=- .--.++.-+.++..+|.-+..+=..+ ++ ...++ +.-+ T Consensus 109 -------------F~Gy~~la----~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~ 171 (694) T protein:vir:10 109 -------------FPGFPTLV----LLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLK 171 (694) T ss_pred -------------cchHHHHH----HHhhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHH Confidence 00111000 00011122223333333332221111 11 11111 2335 Q ss_pred HHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCC--CceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceE Q lcl|NC_016654. 112 RADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIA--DNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEV 189 (533) Q Consensus 112 ~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~--~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~ 189 (533) .|..-++.=+.+..++++++.+-.+|+++..+-.+.+.. ..|.+ ..+.. ...|.++.++.++.+.- T Consensus 172 ~L~~e~erl~V~~~l~eaik~aRlfGGa~~~i~I~gdd~~l~~PL~--~~~~~----I~kGslKGl~ViDp~~v------ 239 (694) T protein:vir:10 172 QINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLV--PRPYT----VPKGSFQGLRVVEPYWV------ 239 (694) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeecCccccccccc--ccccc----ccCcceeeeEeeccccc------ Confidence 566666677899999999999999999986665542210 01110 00000 11233333222211100 Q ss_pred EEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCccccccccccccc Q lcl|NC_016654. 190 WRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLR 269 (533) Q Consensus 190 y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~ 269 (533) .+. .|... -|++.. +..++.-...+ ..| +.+ .++.|+....+ +.- .+... T Consensus 240 -------tP~-----~~n~~-------dP~spd----fgkP~~y~V~G-~~I-H~S--RL~~f~g~plP--d~L-Kp~y~ 289 (694) T protein:vir:10 240 -------TPN-----NYNSI-------NPVADD----FYKPSTWWMIG-TEV-HAT--RLHTIVSRPVG--DML-KPTYS 289 (694) T ss_pred -------ccc-----hhhhc-------cchhhc----cCCCceEEEec-eEE-eee--eEEEecCCCch--hhh-hcccc Confidence 000 00000 000000 00000000000 000 001 11222222111 111 12234 Q ss_pred ccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCCccccccCcch---hhhhhcccc-ccccccccc Q lcl|NC_016654. 270 YLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGMGQGVSLDEEQ---EVYSRVGSG-GFNANGDME 345 (533) Q Consensus 270 ~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~---~~~~~~~~~-~~~~~~~~~ 345 (533) .+|.|....+. +-+++.+++-.....-+......++ -.++.....++........- +.|+..... ..|. +. T Consensus 290 ~~G~Sv~q~~~-e~V~~~~rT~~~v~~Li~~~~v~~l-k~dla~~L~~g~~~~l~~R~eli~~~Rsn~G~~llDk---~~ 364 (694) T protein:vir:10 290 FAGISMTQLAM-PYIDNWLRTRQSVSDIVKQFSVSGI-LMDLAQALMPGANVDLSMRAELINRYRDNRNILFLDK---AT 364 (694) T ss_pred cCcccHHHHHH-HHHHHHHHHHhHHHHHHHhhhhHHH-HHHHHHhhcChhHHHHHHHHHHHHHhcCccceEEEec---CC Confidence 57899877643 5556666555444433322111111 11222111111111010000 112211111 1111 11 Q ss_pred cceeeechhhhhHHHHHHHHHHHHHHHHhhCCChh-hcccCCCcc-hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 346 TIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPV-SLGLSDEVA-QTATEASGKKDLTVKTTRAKARHFGSALGPLSTT 423 (533) Q Consensus 346 ~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~-~~g~~~~~~-~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~ 423 (533) + ++.|.++.+...-..+..++.+++..+++|.. -||-...|- .||+.=...|-+.+... .+..++..|+.++.+ T Consensus 365 E--efeq~stslSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~--Qe~~L~p~L~rl~~i 440 (694) T protein:vir:10 365 E--EFFQFNTPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAY--QRNALQQLMNDVIVM 440 (694) T ss_pred c--ceEEEecccCCHHHHHHHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHHH--HHHHHHHHHHHHHHH Confidence 2 23344555666677778888888888999854 456655553 67774444444444322 367788888888776 Q ss_pred HHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHH-------HHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHH Q lcl|NC_016654. 424 CLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQA-------WSVASAASTKTKVAYLHEDWDDERVQEEADLI 496 (533) Q Consensus 424 il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~-------l~~aGi~S~et~v~~l~~~~~dee~~~El~rI 496 (533) |..- .| |. .+ .++++.|+.--..++.|+|++..+ ++.+|+++..+...++ T Consensus 441 i~rS---~~-G~--id-p~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL---------------- 497 (694) T protein:vir:10 441 IQLS---LF-GA--VD-PSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARL---------------- 497 (694) T ss_pred HHHH---hc-CC--CC-CcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHH---------------- Confidence 5331 22 22 22 368999998888888877775433 3344444444433332 Q ss_pred HHhhhcccCccccccccCCCCCCCCCCCC-------CCCCCCCC Q lcl|NC_016654. 497 DNANTVSAPTFGFGTDQPPLPTENDPATD-------PEAVDEGE 533 (533) Q Consensus 497 ~~E~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~d~~ 533 (533) ..+..----......+.|+.+.+++-+|. .+.++.|+ T Consensus 498 ~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 541 (694) T protein:vir:10 498 NTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGA 541 (694) T ss_pred hcCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCC Confidence 22110000000012233333333322221 11111111 No 210 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=95.94 E-value=0.0012 Score=36.56 Aligned_cols=441 Identities=11% Similarity=0.054 Sum_probs=190.5 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhH-------------hhcCCHH------HHHHHHhccCcchhhHHHHHHHHHHHH Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHV-------------WWEGDLD------KLATFYGAEGRTSPSGIKARTKAAYEA 61 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~-------------w~~gd~~------~l~~~y~~~~~~~~~~~~~~~~~~~~~ 61 (533) |-|++-=..|-+.+-..+...+ ..++ =+.+++. -..+|+...... +....++..|+. T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~-~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~---~~~~eLI~~YR~ 76 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERL-KLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNI---SGTKDLINTYRQ 76 (516) T ss_pred CCchHhcccccchhhhHHhhhh-cCCcCcccCCCCCCCceeeecCCCcccccceeeeeecccccc---chHHHHHHHHHH Confidence 6665444445543311111111 1100 0000000 000111000000 011112222222 Q ss_pred HhcccCCCCCcccceeecChHHHHHHHHHHh-hc----CCCceEeeCCCc--h----HHHHHHHHHHhhccHHHHHHHHH Q lcl|NC_016654. 62 FHGRTPTATGRAPKRYHAPIPGVIAKLSTTE-LF----SEQLKFLDAGKS--K----EVQARADLIFNTPRFHSSLVEAG 130 (533) Q Consensus 62 ~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~l-l~----~e~~~i~~~~~~--~----~~~~~l~~i~~~n~f~~~~~~~~ 130 (533) + ...+-+.-.++..++= +. .+|+++.++.-+ + ...+..+.|++-=+|++..++.. T Consensus 77 m--------------a~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~f 142 (516) T protein:vir:10 77 L--------------INNPEVERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLF 142 (516) T ss_pred H--------------hhccchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHH Confidence 1 0111222233333321 11 125555553322 1 24455666777778999999999 Q ss_pred HHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecC--CceEEEE-EEE--ecCeeEEEEE Q lcl|NC_016654. 131 ESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGD--GQEVWRH-LER--HESGYIVHAV 205 (533) Q Consensus 131 ~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~--~~~~y~~-lE~--h~~~~I~~~~ 205 (533) ....+.|..||+.+.|....+=..+..++|.++.++.. +...+ +..++.. .|+ |.++.-.| . T Consensus 143 R~WYVDgRi~fhKiid~~k~GI~Elr~lDPr~i~~vR~------------i~~~~~~~~~v~~~~~e~~~Y~~~~~~~-~ 209 (516) T protein:vir:10 143 RRWYVDSRIFFHKIMPNPKKGIAELRRLDPRFMEYYRE------------IVTSDIGGTTIVKGYREFFIYTTGNEGY-S 209 (516) T ss_pred hhhhhcceEEEEEEecCccccceeeeeeCCcceeeEee------------ecccccccchhhhhhhheeeeccCcccc-c Confidence 99999999999988886555556677888888777521 11111 1111110 011 11111111 1 Q ss_pred Eecc--CCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHH Q lcl|NC_016654. 206 YKGT--ATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPT 283 (533) Q Consensus 206 y~~~--~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~l 283 (533) |.+. ..+.+..++-+ .+.|..-+.-..+ +... .|-+-.||+++ T Consensus 210 ~~g~~~~~~~~ikI~~d----------------------------AI~y~hSGL~d~~--~~~i-----~syLhkAiKp~ 254 (516) T protein:vir:10 210 YNGRIFEPNTRIKIPRS----------------------------AVVYASSGLMDCS--DRGI-----IGYLHNAVKPA 254 (516) T ss_pred cccceeCCCcceeechh----------------------------heeeecccceeCC--CCce-----eeeehhhhHhH Confidence 1110 00001111110 1122221110000 0001 22223344432 Q ss_pred HHHHHHHHHH--HHHHHHhCcceeee-c---------hHHhcC----------CCCccccccCcchhhhhhcc--ccccc Q lcl|NC_016654. 284 FHELDRIYSS--LMRDFRIGAGKVHA-S---------ESVLTN----------LGMGQGVSLDEEQEVYSRVG--SGGFN 339 (533) Q Consensus 284 id~lD~~~s~--~~~~~~~~~~~i~v-~---------~~~l~~----------~~~~~~~~~d~~~~~~~~~~--~~~~~ 339 (533) ..|=-+-+. +.|.-|.-.+|||- . +.+++. +....|. ...+.....++. +.+-- T Consensus 255 -NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGe-v~ddrk~msMlEDyWLpRR 332 (516) T protein:vir:10 255 -NQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGT-VKNQKRNLSMTEDYWLMRR 332 (516) T ss_pred -HhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCe-eccchhhhhhHhhhccccc Confidence 222111111 22334666777763 1 111110 0111111 111111000000 01111 Q ss_pred cccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc---hhHHHHHHHhhhHHHHHHHHHHHHHHH Q lcl|NC_016654. 340 ANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA---QTATEASGKKDLTVKTTRAKARHFGSA 416 (533) Q Consensus 340 ~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~---~Tatai~~~~~~l~~~~~~~~~~~~~a 416 (533) .|+ ....|++....-.. .-+..+..+.+.+....++|.+.+..+++.. .-++||...+-.--.-+.+.+..|..- T Consensus 333 eGg-rgTEItTLpGgqnl-gem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~l 410 (516) T protein:vir:10 333 DGK-SVTEVSSLPGAQTM-GDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQHDFEEI 410 (516) T ss_pred CCC-CccceeeccccCCc-ChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHH Confidence 122 22234444332222 2355677778889999999988887554422 346677666666667788888888888 Q ss_pred HHHHHHHHHHHHHhhccCCCCC--CceeEEEEeCCCCCCCHHH-------HHHHHHHHH--hCCCCCHHHHHHHhCCCCC Q lcl|NC_016654. 417 LGPLSTTCLRVDAIKFPGKGAA--PSEELELEWPKFARESDLA-------KAQTVQAWS--VASAASTKTKVAYLHEDWD 485 (533) Q Consensus 417 l~~li~~il~l~~~~~~~~~~~--~~~~v~i~f~d~i~~d~~e-------~a~~~~~l~--~aGi~S~et~v~~l~~~~~ 485 (533) |.++++.-|.|.... ..... ....+.++|...---.+.. .+..++++. -++..|.++..++.. -.+ T Consensus 411 f~~~L~~qLilKgii--t~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~IL-r~t 487 (516) T protein:vir:10 411 FLDPLKTNLIYKRII--TEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNIL-QMT 487 (516) T ss_pred HHHHHHHhhhhccCC--CHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHh-cCC Confidence 888888776552211 00001 1245777776443333322 233333332 345789998777654 489 Q ss_pred HHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCC Q lcl|NC_016654. 486 DERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPA 523 (533) Q Consensus 486 dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~ 523 (533) |+|..+|.++|++|....- + +.+++.++= T Consensus 488 Deei~~e~k~I~~E~~~~~--~-------~~p~~~~~f 516 (516) T protein:vir:10 488 EEQIAQEEKQIEQEAGIKR--F-------QNPENEDDF 516 (516) T ss_pred HhhHHHHHHHHHHhhhCCC--C-------CCCCccccC Confidence 9999999999999974210 0 000000000 No 211 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=95.94 E-value=0.0012 Score=36.56 Aligned_cols=441 Identities=11% Similarity=0.054 Sum_probs=190.5 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhH-------------hhcCCHH------HHHHHHhccCcchhhHHHHHHHHHHHH Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHV-------------WWEGDLD------KLATFYGAEGRTSPSGIKARTKAAYEA 61 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~-------------w~~gd~~------~l~~~y~~~~~~~~~~~~~~~~~~~~~ 61 (533) |-|++-=..|-+.+-..+...+ ..++ =+.+++. -..+|+...... +....++..|+. T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~-~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~---~~~~eLI~~YR~ 76 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERL-KLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNI---SGTKDLINTYRQ 76 (516) T ss_pred CCchHhcccccchhhhHHhhhh-cCCcCcccCCCCCCCceeeecCCCcccccceeeeeecccccc---chHHHHHHHHHH Confidence 6665444445543311111111 1100 0000000 000111000000 011112222222 Q ss_pred HhcccCCCCCcccceeecChHHHHHHHHHHh-hc----CCCceEeeCCCc--h----HHHHHHHHHHhhccHHHHHHHHH Q lcl|NC_016654. 62 FHGRTPTATGRAPKRYHAPIPGVIAKLSTTE-LF----SEQLKFLDAGKS--K----EVQARADLIFNTPRFHSSLVEAG 130 (533) Q Consensus 62 ~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~l-l~----~e~~~i~~~~~~--~----~~~~~l~~i~~~n~f~~~~~~~~ 130 (533) + ...+-+.-.++..++= +. .+|+++.++.-+ + ...+..+.|++-=+|++..++.. T Consensus 77 m--------------a~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~f 142 (516) T protein:vir:10 77 L--------------INNPEVERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLF 142 (516) T ss_pred H--------------hhccchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHH Confidence 1 0111222233333321 11 125555553322 1 24455666777778999999999 Q ss_pred HHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecC--CceEEEE-EEE--ecCeeEEEEE Q lcl|NC_016654. 131 ESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGD--GQEVWRH-LER--HESGYIVHAV 205 (533) Q Consensus 131 ~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~--~~~~y~~-lE~--h~~~~I~~~~ 205 (533) ....+.|..||+.+.|....+=..+..++|.++.++.. +...+ +..++.. .|+ |.++.-.| . T Consensus 143 R~WYVDgRi~fhKiid~~k~GI~Elr~lDPr~i~~vR~------------i~~~~~~~~~v~~~~~e~~~Y~~~~~~~-~ 209 (516) T protein:vir:10 143 RRWYVDSRIFFHKIMPNPKKGIAELRRLDPRFMEYYRE------------IVTSDIGGTTIVKGYREFFIYTTGNEGY-S 209 (516) T ss_pred hhhhhcceEEEEEEecCccccceeeeeeCCcceeeEee------------ecccccccchhhhhhhheeeeccCcccc-c Confidence 99999999999988886555556677888888777521 11111 1111110 011 11111111 1 Q ss_pred Eecc--CCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHH Q lcl|NC_016654. 206 YKGT--ATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPT 283 (533) Q Consensus 206 y~~~--~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~l 283 (533) |.+. ..+.+..++-+ .+.|..-+.-..+ +... .|-+-.||+++ T Consensus 210 ~~g~~~~~~~~ikI~~d----------------------------AI~y~hSGL~d~~--~~~i-----~syLhkAiKp~ 254 (516) T protein:vir:10 210 YNGRIFEPNTRIKIPRS----------------------------AVVYASSGLMDCS--DRGI-----IGYLHNAVKPA 254 (516) T ss_pred cccceeCCCcceeechh----------------------------heeeecccceeCC--CCce-----eeeehhhhHhH Confidence 1110 00001111110 1122221110000 0001 22223344432 Q ss_pred HHHHHHHHHH--HHHHHHhCcceeee-c---------hHHhcC----------CCCccccccCcchhhhhhcc--ccccc Q lcl|NC_016654. 284 FHELDRIYSS--LMRDFRIGAGKVHA-S---------ESVLTN----------LGMGQGVSLDEEQEVYSRVG--SGGFN 339 (533) Q Consensus 284 id~lD~~~s~--~~~~~~~~~~~i~v-~---------~~~l~~----------~~~~~~~~~d~~~~~~~~~~--~~~~~ 339 (533) ..|=-+-+. +.|.-|.-.+|||- . +.+++. +....|. ...+.....++. +.+-- T Consensus 255 -NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGe-v~ddrk~msMlEDyWLpRR 332 (516) T protein:vir:10 255 -NQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGT-VKNQKRNLSMTEDYWLMRR 332 (516) T ss_pred -HhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCe-eccchhhhhhHhhhccccc Confidence 222111111 22334666777763 1 111110 0111111 111111000000 01111 Q ss_pred cccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc---hhHHHHHHHhhhHHHHHHHHHHHHHHH Q lcl|NC_016654. 340 ANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA---QTATEASGKKDLTVKTTRAKARHFGSA 416 (533) Q Consensus 340 ~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~---~Tatai~~~~~~l~~~~~~~~~~~~~a 416 (533) .|+ ....|++....-.. .-+..+..+.+.+....++|.+.+..+++.. .-++||...+-.--.-+.+.+..|..- T Consensus 333 eGg-rgTEItTLpGgqnl-gem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~l 410 (516) T protein:vir:10 333 DGK-SVTEVSSLPGAQTM-GDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQHDFEEI 410 (516) T ss_pred CCC-CccceeeccccCCc-ChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHH Confidence 122 22234444332222 2355677778889999999988887554422 346677666666667788888888888 Q ss_pred HHHHHHHHHHHHHhhccCCCCC--CceeEEEEeCCCCCCCHHH-------HHHHHHHHH--hCCCCCHHHHHHHhCCCCC Q lcl|NC_016654. 417 LGPLSTTCLRVDAIKFPGKGAA--PSEELELEWPKFARESDLA-------KAQTVQAWS--VASAASTKTKVAYLHEDWD 485 (533) Q Consensus 417 l~~li~~il~l~~~~~~~~~~~--~~~~v~i~f~d~i~~d~~e-------~a~~~~~l~--~aGi~S~et~v~~l~~~~~ 485 (533) |.++++.-|.|.... ..... ....+.++|...---.+.. .+..++++. -++..|.++..++.. -.+ T Consensus 411 f~~~L~~qLilKgii--t~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~IL-r~t 487 (516) T protein:vir:10 411 FLDPLKTNLIYKRII--TEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNIL-QMT 487 (516) T ss_pred HHHHHHHhhhhccCC--CHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHh-cCC Confidence 888888776552211 00001 1245777776443333322 233333332 345789998777654 489 Q ss_pred HHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCC Q lcl|NC_016654. 486 DERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPA 523 (533) Q Consensus 486 dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~ 523 (533) |+|..+|.++|++|....- + +.+++.++= T Consensus 488 Deei~~e~k~I~~E~~~~~--~-------~~p~~~~~f 516 (516) T protein:vir:10 488 EEQIAQEEKQIEQEAGIKR--F-------QNPENEDDF 516 (516) T ss_pred HhhHHHHHHHHHHhhhCCC--C-------CCCCccccC Confidence 9999999999999974210 0 000000000 No 212 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=95.83 E-value=0.0014 Score=36.24 Aligned_cols=393 Identities=9% Similarity=0.003 Sum_probs=147.8 Q ss_pred HHhhhH--hhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCC-cccceeecChHHHHHHHHHHhhcCCCc Q lcl|NC_016654. 22 VAESHV--WWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATG-RAPKRYHAPIPGVIAKLSTTELFSEQL 98 (533) Q Consensus 22 ~~~~~~--w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~~~~~~~~n~~k~i~~~~a~ll~~e~~ 98 (533) |...++ |..- .-|.+..... ..+......|....-.+ ..+.-+...--...++.+|+-+-.=|. T Consensus 1 ~~~~~~~~~~k~--~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~Ia~~ia~lp~ 67 (409) T protein:vir:94 1 MAKENIVTRIKK--KLIDNWIDQS-----------ASKLYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPL 67 (409) T ss_pred Ccccccchhhhh--HHhhhhhcCC-----------cccccccccccCccccccchhhhhccHHHHHHHHHHHHhhhhCce Confidence 111111 0000 0000000000 00000000000000000 000111112223333444444433343 Q ss_pred eEeeCCCchHHHHHHHHHHhh--c--cHHHH-HHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEec-CCc Q lcl|NC_016654. 99 KFLDAGKSKEVQARADLIFNT--P--RFHSS-LVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRW-GRL 172 (533) Q Consensus 99 ~i~~~~~~~~~~~~l~~i~~~--n--~f~~~-~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~-g~~ 172 (533) .+--..+ .....+..+|.. | .=... ....+...+..|.+|+.+..|..+. -+.+-.++|+.+.++.+. +. T Consensus 68 ~~~~~~~--~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~-~~~L~~l~~~~v~v~~~~~~~- 143 (409) T protein:vir:94 68 KMYEDYK--VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQ-PSKLFLLNPDVVEMLIENQSR- 143 (409) T ss_pred eEeeccc--ccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc-EEEEEEEcCceeEEEEeCCCc- Confidence 3311111 111223333321 1 11122 3344556677899998888776543 245666778777766443 11 Q ss_pred eEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEE Q lcl|NC_016654. 173 VAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAY 252 (533) Q Consensus 173 ~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 252 (533) .++ | .+...+ |.++.+ +.--+.| T Consensus 144 --~~~------------y-------------~~~~~~----g~~~~~--------------------------~~~dvih 166 (409) T protein:vir:94 144 --ELY------------Y-------------SIHAAT----GNKLIV--------------------------HNMDMLH 166 (409) T ss_pred --EEE------------E-------------EEEcCC----ceEEEE--------------------------ccccEEE Confidence 011 1 111000 111100 0001233 Q ss_pred ecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHh-CcceeeechHHhcCCCCccccccCcchhhhh Q lcl|NC_016654. 253 VPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRI-GAGKVHASESVLTNLGMGQGVSLDEEQEVYS 331 (533) Q Consensus 253 ~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~-~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~ 331 (533) +++..+. ...+|.|.+..+ ...++. +.....+ .+.. +...-++ +.....-.....+.....|. T Consensus 167 ~r~~~~~--------~~~~G~s~l~~~-~~~i~~-~~~~~~~--~~~~~~~~~~~i----~~~~~~l~~e~~~~~~~~~~ 230 (409) T protein:vir:94 167 FKHIVAS--------NMVQGISPIDVL-KNTTDF-DNAVRTF--NLTEMQKPDSFM----LKYGSNVGKEKRQQVLEDFK 230 (409) T ss_pred ecCCCCC--------CccccccHHHHH-HHHHHH-HHHHHHH--HHHhcCCCCeeE----EecCCCCCHHHHHHHHHHHH Confidence 3321111 123577766542 344432 2222222 1222 2111111 11111100000011111111 Q ss_pred hcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc-hhHHHHHHHhhhHHHHH Q lcl|NC_016654. 332 RVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA-QTATEASGKKDLTVKTT 406 (533) Q Consensus 332 ~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~-~Tatai~~~~~~l~~~~ 406 (533) ... .+.++ +....++.++......++++..+...++|+...|+||..+|....+. .+.++. T Consensus 231 ~~~---~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~----------- 296 (409) T protein:vir:94 231 QYY---EENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEEL----------- 296 (409) T ss_pred HHh---hcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH----------- Confidence 110 01110 11223555666666778888888888999999999999998644322 122222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccC-CCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC Q lcl|NC_016654. 407 RAKARHFGSALGPLSTTCLRVDAIKFPG-KGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWD 485 (533) Q Consensus 407 ~~~~~~~~~al~~li~~il~l~~~~~~~-~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~ 485 (533) ....++.+|..++..+..-.+..+.. ........+.++.+.-+-.|..++++.+.+++.+|+|+.-++.+.+ ++. T Consensus 297 --~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~--g~~ 372 (409) T protein:vir:94 297 --NRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWE--DLP 372 (409) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh--CCC Confidence 12233334444444443222222211 1111223355555555678999999999999999999999976654 232 Q ss_pred HHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 486 DERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEG 532 (533) Q Consensus 486 dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 532 (533) +-+--.++ .. +....| .+.+.... ....|+++..++| T Consensus 373 p~~ggD~~---~~-~~n~~~-----~~~~~~~~-~~~kGG~~n~~e~ 409 (409) T protein:vir:94 373 PVEGGDKP---LI-SGDLYP-----IDTPLELR-KSLKGGDKNVNES 409 (409) T ss_pred CCCCcCeE---ee-cccccc-----cccchhhc-ccccCCCCCcCCC Confidence 21000000 00 000000 00000000 0122233444444 No 213 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=95.81 E-value=0.0014 Score=36.21 Aligned_cols=441 Identities=9% Similarity=-0.010 Sum_probs=163.9 Q ss_pred CCCC-CCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeec Q lcl|NC_016654. 1 MSLP-EANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHA 79 (533) Q Consensus 1 ~~~~-~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 79 (533) |.=- +--+-..=..+...+..++.-|..|..--.++.+|... ..+-+. ..+....++-= T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP------------------~~~~~~--~~~~~~~~~~d 60 (516) T protein:vir:96 1 MKQSIDLEYGGKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLP------------------YLMNDK--GDNETSQNGWQ 60 (516) T ss_pred CcchhhhhhhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhcc------------------cccCCC--CCccccCCccc Confidence 2111 11111222444444445444444444333333333211 111111 11122334444 Q ss_pred ChHHHHHHHHHHhhcCC-----CceEeeCCCc-------------hHHHH-------HHHHHHhhccHHHHHHHHHHHHh Q lcl|NC_016654. 80 PIPGVIAKLSTTELFSE-----QLKFLDAGKS-------------KEVQA-------RADLIFNTPRFHSSLVEAGESCS 134 (533) Q Consensus 80 n~~k~i~~~~a~ll~~e-----~~~i~~~~~~-------------~~~~~-------~l~~i~~~n~f~~~~~~~~~~~~ 134 (533) +-+...++.+|+-|.+- .+.|.....+ ..+++ .+...+..++|...+.++..... T Consensus 61 stg~~a~~~LAa~l~~~ltpp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~ 140 (516) T protein:vir:96 61 GVGAQATNHLANKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLI 140 (516) T ss_pred chHHHHHHHHHHHHHhhhcCCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHH Confidence 56677777777765543 1344433211 12333 34446778899999999999999 Q ss_pred hhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEe-cCCceEEEEEEEEe---------e-------------cCCceEEE Q lcl|NC_016654. 135 ALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFR-WGRLVAVTFWSELA---------G-------------GDGQEVWR 191 (533) Q Consensus 135 ~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~---------~-------------~~~~~~y~ 191 (533) +.|.+.+ |.|+.+ .+..++-..++..-+ +|++..+++-.+.. . .+.=.+|+ T Consensus 141 ~~G~a~l--~~d~~~----~~~~~pl~~y~v~~d~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~ 214 (516) T protein:vir:96 141 VAGSCML--YKPSKG----AISAIPMHHYVVNRDTNGDLLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYT 214 (516) T ss_pred hHCeEeE--EecCCC----CEEEEEcCeEEEeeCCCCCeeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEE Confidence 9999875 457543 255666666555444 47776655322110 0 00011344 Q ss_pred EEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCccccccccccccccc Q lcl|NC_016654. 192 HLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYL 271 (533) Q Consensus 192 ~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~ 271 (533) ++++...++ +.+|..-++ .....+.+.+..-+.|++. +|... ..+.| T Consensus 215 ~v~~~~~~~--~~~~~~~d~-------------------------~~~~~es~~~~~e~P~~~~-----Rw~~~-~ge~Y 261 (516) T protein:vir:96 215 HAKYLGDGF--WELKQSADD-------------------------IPVGKVSKIKSEKLPFIPL-----TWKRS-YGEDW 261 (516) T ss_pred eeeeeCCce--eEEEEEeCc-------------------------eeeccccccccccCCeeee-----eeeec-CCCCc Confidence 444332211 111111000 0000112222111222322 23322 24788 Q ss_pred ccchhhhhHHHHHHHHHHHHHHHHHHH-HhCcceeeechH-HhcCCCCccccccCcchhhhhhcccccccccccccccee Q lcl|NC_016654. 272 GRADLSTDLFPTFHELDRIYSSLMRDF-RIGAGKVHASES-VLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFE 349 (533) Q Consensus 272 G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~~~~i~v~~~-~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~ 349 (533) |+|--..++ +-+..|+..--...... ...+....|+++ .+++..-. +...+.+..- ..+....++ T Consensus 262 Grgp~~~~L-~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~-----~~~~g~i~~g-------~~~~v~~~q 328 (516) T protein:vir:96 262 GRPLAEDYS-GDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFV-----NSGTGEVVTG-------VEEDIHIVQ 328 (516) T ss_pred ccchHHHhh-HHHHHHHHHHHHHHHHHHHhcCCccccCcccccchhhhc-----cCCCceeecC-------Ccccceeee Confidence 998766655 55567765444443333 344455455432 22211100 0000111110 000000111 Q ss_pred eechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 350 FFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKA-RHFGSALGPLSTTCLRVD 428 (533) Q Consensus 350 ~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~-~~~~~al~~li~~il~l~ 428 (533) +........-...++.+-+.|....=+. .+.-..+...|||||..+.+......+-.- +.-...|..++..++.. T Consensus 329 -~~~~~d~~~~~~~i~~~~~rI~~af~~~--~l~~r~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~- 404 (516) T protein:vir:96 329 -LGKYADLTPISAVLEVYTRRIGVVFMME--TMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGLLE- 404 (516) T ss_pred -cCcccchhHHHHHHHHHHHHHHHHHhhh--hhccCCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHh- Confidence 1111111212222333322332211011 112122334699999877766555443321 11122233334333322 Q ss_pred HhhccCCCCCCceeEEEEeCCCCCC-----CHHH---HHHHHHHHHhCC-----CCCHHHHHHHh---CC-----CCCHH Q lcl|NC_016654. 429 AIKFPGKGAAPSEELELEWPKFARE-----SDLA---KAQTVQAWSVAS-----AASTKTKVAYL---HE-----DWDDE 487 (533) Q Consensus 429 ~~~~~~~~~~~~~~v~i~f~d~i~~-----d~~e---~a~~~~~l~~aG-----i~S~et~v~~l---~~-----~~~de 487 (533) . + ...+...+.++.-.++.. +... .++.+..+.++. .+-...+++.+ .+ --+++ T Consensus 405 ---~-~-p~lp~~~v~~~~vs~l~~l~r~~~~~~i~~~~~~i~~~~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~irs~e 479 (516) T protein:vir:96 405 ---A-G-ESFTSDLVDPVIITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAE 479 (516) T ss_pred ---c-C-CCCccccccceeechHHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHhCCCccccCCHH Confidence 1 1 223333344443222211 1111 111111111000 11223333322 11 02556 Q ss_pred HHHHHHHHHHHhhhccc--CccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 488 RVQEEADLIDNANTVSA--PTFGFGTDQPPLPTENDPATDPEAVD 530 (533) Q Consensus 488 e~~~El~rI~~E~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (533) |++++.+.-++.++..+ ...+.+..... ++.-+|. T Consensus 480 ev~~~~~~~~~~q~~~~~a~~~~~~~~~~~--------~~~~~~~ 516 (516) T protein:vir:96 480 EMAQEQEAQMQAQQAQMLEEGVAKAVPGVI--------QQELKEA 516 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhHHh--------hcccccC Confidence 66555443222222111 11111111100 0111111 No 214 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=95.28 E-value=0.0024 Score=34.96 Aligned_cols=196 Identities=12% Similarity=0.080 Sum_probs=86.0 Q ss_pred ccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCCccccccCcchhhhhhcccccccccccccccee Q lcl|NC_016654. 270 YLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFE 349 (533) Q Consensus 270 ~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~ 349 (533) .+....++..+.+--.++.+.+..+ +..+ +...+...+... ..++ T Consensus 1 V~k~~~l~~~~~~~~~~~~~r~~~~-~~~~----------------~~~~~~~ld~~~------------------e~~e 45 (201) T protein:vir:10 1 MWKAKGLADLCDDSDGAARLRLAQV-DNNS----------------GVGQAIGIDADS------------------EEYN 45 (201) T ss_pred CccchHHHHHhcCChHHHHHHHHHH-HHhh----------------hhhhhheeecCC------------------ccee Confidence 1111112111100001111222111 0000 000011111111 1122 Q ss_pred eechhhhhHHHHHHHHHHHHHHHHhhCCCh-hhcccCCCcc-hhHHHHHHHhhhHHHHHHHHH-HHHHHHHHHHHHHHHH Q lcl|NC_016654. 350 FFQPAIRVLEHDQGAALLLREVLRKTGYSP-VSLGLSDEVA-QTATEASGKKDLTVKTTRAKA-RHFGSALGPLSTTCLR 426 (533) Q Consensus 350 ~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~-~~~g~~~~~~-~Tatai~~~~~~l~~~~~~~~-~~~~~al~~li~~il~ 426 (533) +++. .+...-..+..+..+++..+|+|- .-||-..+|- .|+..-...| |..+..+| ..++..|.+|+..++ T Consensus 46 ~~~~--~lsGl~d~l~~~~~~iaa~s~iP~t~LfG~sp~Glnatge~d~~ny---yd~i~~~Qe~~l~p~le~l~~~~~- 119 (201) T protein:vir:10 46 VLNS--DIGGIDTFLSQKFDRIVALSGIHEIILKGKNVGGVSASQNTALETF---YGYVDRKRKAELLPLLEFLLPFIV- 119 (201) T ss_pred eeec--CcCChHHHHHHHHHHHHhHhcCchhhhcCCCCccccccchhHHHHH---HHHHHHHHHHHHHHHHHHHHHhhc- Confidence 2222 222334556777888888889885 3446555554 4665444334 33344444 667777777665321 Q ss_pred HHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCc Q lcl|NC_016654. 427 VDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPT 506 (533) Q Consensus 427 l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~ 506 (533) ...+++|.|+.-...+..++|++..+... . ...++. .+-.+.+++.++|.+... T Consensus 120 ------------~~~~~~~~f~pL~~~s~kekAei~~~~a~--a--~~~~~~--~g~i~~~e~r~~L~~~~~-------- 173 (201) T protein:vir:10 120 ------------TEQEWSVEFNPLSQVSDKDKSEILEKNVN--S--VAALIA--AGIIDADEARDTLRAIST-------- 173 (201) T ss_pred ------------CCCCceEeeCCCCCCCHHHHHHHHHHHHH--H--HHHHHH--cCCCCHHHHHHHHHhcCC-------- Confidence 12468999999999999998887665432 1 122232 344666667666654211 Q ss_pred ccccccc--CCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 507 FGFGTDQ--PPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 507 ~~~~~~~--~~~~~~~~~~~~~~~~~d~~ 533 (533) .+.-++. .+..+.++ +.+|+++++-| T Consensus 174 ~~~~~~~~~~~~~~~~e-~~dp~~~~~~~ 201 (201) T protein:vir:10 174 EVKIGEGSIQTEVVINE-SEDPLDVSANN 201 (201) T ss_pred cCCCCCCCCCccccccc-cCCCCCCCCCC Confidence 1111111 11111111 23444444444 No 215 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=95.27 E-value=0.0024 Score=34.95 Aligned_cols=374 Identities=11% Similarity=0.032 Sum_probs=138.3 Q ss_pred HHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhccc--CCCCCcccceeec-ChHHHHHHHHHHhhcCCCc Q lcl|NC_016654. 22 VAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRT--PTATGRAPKRYHA-PIPGVIAKLSTTELFSEQL 98 (533) Q Consensus 22 ~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~g~~~~~~~~-n~~k~i~~~~a~ll~~e~~ 98 (533) |-.|. |+ +...+..+. .....+.... .........++.. .--...++.+|+-+.+=|. T Consensus 1 Mg~~~-~f-----------~~k~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~V~~~I~~ia~~iA~~p~ 61 (403) T protein:vir:80 1 MGLFN-FF-----------RRKTRSEPT-------NAISWFLTQEAYDTLAIPGYTRLSDNPEVRMAVHKIAELISSMTI 61 (403) T ss_pred Ccccc-cc-----------ccccccccc-------chhhhhcccccccccccchhhhhhhhHHHHHHHHHHHHhhhhCce Confidence 33332 21 111100000 0011111100 0000001111211 1122445566665555454 Q ss_pred eEeeC--CCchHHHHHHHHHHh--hccH--HHHHHHH-HHHHhhh--CCEEEEEEEcCCCCCceEEEEEcCCeEEEEEec Q lcl|NC_016654. 99 KFLDA--GKSKEVQARADLIFN--TPRF--HSSLVEA-GESCSAL--SGSFQRIVWDPTIADNAWIDFVDADRAIPEFRW 169 (533) Q Consensus 99 ~i~~~--~~~~~~~~~l~~i~~--~n~f--~~~~~~~-~~~~~~~--G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~ 169 (533) .+--. +........+..+|. -|.+ ...+.+. +...+-. |-+|+.+.+|..+. -..+..++|+.+.++.+. T Consensus 62 ~~~~~~~~g~~~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g~-~~~L~~l~p~~v~~~~~~ 140 (403) T protein:vir:80 62 HLMQNTDNGDIRIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSGL-IDELIPLAPSKVSFVDTD 140 (403) T ss_pred EEEEecCCceeecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCCc-EEEEEEEcCCeeEEEEcC Confidence 43211 111111222333333 2211 1123333 3334433 55777777765432 234555667666544332 Q ss_pred CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccce Q lcl|NC_016654. 170 GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLT 249 (533) Q Consensus 170 g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 249 (533) +. +.|. |.+ ..++. +++ T Consensus 141 ~g---------------------------~~~~---y~~------~~~~~----------~ei----------------- 157 (403) T protein:vir:80 141 TG---------------------------YQIW---YQG------KAYNY----------DEV----------------- 157 (403) T ss_pred Cc---------------------------eEEE---Eee------cccch----------hhE----------------- Confidence 21 1111 110 00110 000 Q ss_pred eEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhC-cceeee--chHHhcCCCCccccccCc Q lcl|NC_016654. 250 AAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIG-AGKVHA--SESVLTNLGMGQGVSLDE 325 (533) Q Consensus 250 ~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~-~~~i~v--~~~~l~~~~~~~~~~~d~ 325 (533) ++++-+..+. ....|.|.... +...+.. .....++... ++.| ....++ +..+ . ... .+. T Consensus 158 ih~~~~~~~~--------~~~~G~s~~~~-~~~~i~~-~~~~~~~~~~~~~ng~~p~~il~~~~~~-~---~~~---~~~ 220 (403) T protein:vir:80 158 LHFIVNPDPE--------KPYMGRGYRVV-LKDIVNN-LKQATTTKKSFMSGKYMPSLIVKVDAAT-A---ELS---SEE 220 (403) T ss_pred EEEeccCCCc--------CccccccHHHH-HHHHHHH-HHHHHHHHHHHHhccCCcceEEEeCCCC-C---hHH---HHH Confidence 1111111111 11235665443 3334432 2233334333 3443 222222 2111 0 000 000 Q ss_pred chh-hhhhcccccccccccc-----ccceeeec-hhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHH Q lcl|NC_016654. 326 EQE-VYSRVGSGGFNANGDM-----ETIFEFFQ-PAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGK 398 (533) Q Consensus 326 ~~~-~~~~~~~~~~~~~~~~-----~~~i~~~~-~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~ 398 (533) ... .+.... +..++++.. ....+.++ .+....++++..+....+|+...|+||..+|..... +++.. T Consensus 221 ~~~~~~~~~~-~~~~~g~~~~~~~~~~~~~~~~~l~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~--~~~~~--- 294 (403) T protein:vir:80 221 GRNAVFKKYL-EASEAGQPWIIPAELLDVEQVKPLSLKDLAIHETVELDKRTVAGIFGVPAFLLGVGKYD--KDEYN--- 294 (403) T ss_pred HHHHHHHHHh-hhhhcCCeeeecccccccceeccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCcc--HHHHH--- Confidence 011 111110 111111110 00122233 244455777888888889999999999999753222 11111 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHH Q lcl|NC_016654. 399 KDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVA 478 (533) Q Consensus 399 ~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~ 478 (533) ..+..+|..+++.+..-.+..+.. ..+..+.++.+.-+..|..++++.+.+++.+|+|+..++.+ T Consensus 295 ------------~f~~~~l~P~~~~ie~~l~~kll~---~~~~~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t~NE~R~ 359 (403) T protein:vir:80 295 ------------NFINSTILPIAKGIEQELTRKLLI---SPDLYFKFNPRSLYAYDLKELAEVGSNMYVRGLMEGNEVRD 359 (403) T ss_pred ------------HHHHHHHHHHHHHHHHHHHHhccC---CCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHH Confidence 133344555554443322222211 22344555555667789999999999999999999999766 Q ss_pred HhCCCCCHHH-HHHHHHHHHHhhhcccCccccccccCCC---CCCCCCCCCCCCCCCCC Q lcl|NC_016654. 479 YLHEDWDDER-VQEEADLIDNANTVSAPTFGFGTDQPPL---PTENDPATDPEAVDEGE 533 (533) Q Consensus 479 ~l~~~~~dee-~~~El~rI~~E~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~d~~ 533 (533) .+ ++.+-+ .+ ++ -...+..|. .+.+...|++....+|+ T Consensus 360 ~~--gl~p~~ggd-~~--------------~~~~n~~pl~~~~~~~~~k~ge~~~~~~~ 401 (403) T protein:vir:80 360 WL--GLSPKEGLS-EL--------------VILENYIPLDKIGDQNKLKGGEKGGADGQ 401 (403) T ss_pred Hh--CCCCCCCCC-eE--------------eecccccchhhccchhhccCCCCCCCCCC Confidence 54 232210 00 00 000111111 00011111111111122 No 216 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=95.12 E-value=0.0027 Score=34.65 Aligned_cols=434 Identities=12% Similarity=0.023 Sum_probs=162.8 Q ss_pred hcCCHHHHHHHHhc-cCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHhhcCC--C---ceEee Q lcl|NC_016654. 29 WEGDLDKLATFYGA-EGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTELFSE--Q---LKFLD 102 (533) Q Consensus 29 ~~gd~~~l~~~y~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e--~---~~i~~ 102 (533) ++.. +.+.|.. +.....+.++..........+...-...+.+..++--.-+...++.+|+-|.+- | +.|.. T Consensus 1 mk~~---~~~~~~~lkR~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l 77 (510) T protein:vir:63 1 MKTT---AAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) T ss_pred ChhH---HHHHHHHHhccchHHHHHHHHHhhccccCCCCCCccccccCCCccchHHHHHHHHHHHHHhhhcCCCCccccc Confidence 2221 1111110 011112222222111111111000001112223344456667777777765543 1 34433 Q ss_pred CCCc-------------hHHHH-------HHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCe Q lcl|NC_016654. 103 AGKS-------------KEVQA-------RADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADR 162 (533) Q Consensus 103 ~~~~-------------~~~~~-------~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~ 162 (533) ...+ ..+.+ .+...+..++|...+.++.......|.+.+ |.|+++ ..+..++-.. T Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l--~~~~~~---~~~~~~pl~~ 152 (510) T protein:vir:63 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRDSDA---ATVVAWSLRS 152 (510) T ss_pred CCChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEE--EEcCCC---cEEEEEEcce Confidence 2111 12333 345567888999999999999999999754 467653 3466677666 Q ss_pred EEEEEe-cCCceEEEEEEEEeec--------------CCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccc Q lcl|NC_016654. 163 AIPEFR-WGRLVAVTFWSELAGG--------------DGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRD 227 (533) Q Consensus 163 ~~P~~~-~g~~~~v~f~~~~~~~--------------~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~ 227 (533) ++-.-+ +|++..+++-.+++.. .++..+. .-.|.+.++...+. + .|...+. T Consensus 153 y~v~~d~~G~vd~i~rr~~~t~~~l~e~~~~~~~~~~~~~~~~~------~v~v~~~V~~~~~~--~--~~~~sv~---- 218 (510) T protein:vir:63 153 YAVRRDATGRWMDIVLKQRYKSKDLDEEYKQDLMRAGRNLSGSG------SVDLYTHVQRKKGT--A--MEYAELY---- 218 (510) T ss_pred eEEeeCCCcCeeEEEeeeeccHHHHhHHhhhhhhccccccCCCc------ceEEEEEEEeecCC--C--ceEEEEE---- Confidence 555444 4788776654443210 0001111 11122233322110 0 0110000 Q ss_pred ccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCcceee Q lcl|NC_016654. 228 IAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGAGKVH 306 (533) Q Consensus 228 ~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~~~i~ 306 (533) .++ ++.....+.+.+..-+.|++. +|... ..+.||+|--..++ +-+..|+..--..... .+..+.... T Consensus 219 --~e~--dg~~~~~~~~~~~~e~P~~~~-----Rw~~~-~ge~YGrgp~~~~l-~D~k~L~~l~~~~l~~a~~a~~~~~l 287 (510) T protein:vir:63 219 --HEI--DGVRVGKEGRWPIHLCPYIVP-----TWNLA-PGEHYGRGHVEDYI-GDFAKLSLLSEKLGLYELESLEVLNL 287 (510) T ss_pred --EEe--cCceeccccccccccCceeee-----eeeec-CCCccccchHHHHH-HHHHHHHHHHHHHHHHHHHhccCCcc Confidence 000 000000111111111223332 23322 24778998776665 5567777654443332 233444444 Q ss_pred echH-HhcCCCCccccccCcchhhhhhccccccccccccccceeeec----hhhhh-HHHHHHHHHHHHHHHHhhCCChh Q lcl|NC_016654. 307 ASES-VLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQ----PAIRV-LEHDQGAALLLREVLRKTGYSPV 380 (533) Q Consensus 307 v~~~-~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~----~~ir~-e~~~~~l~~~l~~i~~~~g~s~~ 380 (533) |+++ ++.+..- .....+.+.. |. ...+..++ .++.+ .+-++.+..-++.+.+.. + T Consensus 288 v~p~g~~~~~~~-----~~~~~g~~v~--------g~--~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~-l--- 348 (510) T protein:vir:63 288 VDEAKGAVVDDY-----QDAEMGDYVP--------GG--AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-A--- 348 (510) T ss_pred cCcccccchhhh-----ccCCCceeec--------CC--cccceeeecCcccchHHHHHHHHHHHHHHHHHHHhh-c--- Confidence 4332 1111100 0000011111 00 01111111 12221 233333333344433311 1 Q ss_pred hcccCCCcchhHHHHHHHhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHH Q lcl|NC_016654. 381 SLGLSDEVAQTATEASGKKDLTVKTTRA-KARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKA 459 (533) Q Consensus 381 ~~g~~~~~~~Tatai~~~~~~l~~~~~~-~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a 459 (533) .-..+...|||||..+.+...+..+- ..+.-...|..|++.++.+... .+..+.+.+.+....-. ..+...++ T Consensus 349 --~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r--~gl~p~p~~~~~~~~v~--~is~Lara 422 (510) T protein:vir:63 349 --NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD--ALLQGLITKQHKPAIET--GLPALSRS 422 (510) T ss_pred --ccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh--ccCCCCCchhcccceec--chhHHHHH Confidence 11122345999999887776665544 2333333444555555444321 12222233333221111 12222222 Q ss_pred HHHH-------HHHhCCC-------CCHHHHHHHh---CC-C-----CCHHHHHHHHHHHHHhhhcc----cCccccccc Q lcl|NC_016654. 460 QTVQ-------AWSVASA-------ASTKTKVAYL---HE-D-----WDDERVQEEADLIDNANTVS----APTFGFGTD 512 (533) Q Consensus 460 ~~~~-------~l~~aGi-------~S~et~v~~l---~~-~-----~~dee~~~El~rI~~E~~~~----~~~~~~~~~ 512 (533) +.+. .+-..|. +-...+++.+ .+ + -+++|++++.++.+++.... ...+...+. T Consensus 423 q~~~~l~~~~q~l~~~~~~aq~~~~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~~~~qq~~~~~~~~~~~~~~a~~ 502 (510) T protein:vir:63 423 AAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEQQRQQAAQAQAAQETLLEGASD 502 (510) T ss_pred HHHHHHHHHHHHHHHhcCchhhhccCCHHHHHHHHHHHhCCChhHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 2222 1111121 1233333332 22 0 16667766654422222111 111111111 Q ss_pred cCCCCCCCCCCCC Q lcl|NC_016654. 513 QPPLPTENDPATD 525 (533) Q Consensus 513 ~~~~~~~~~~~~~ 525 (533) ... .+.|- T Consensus 503 ~~~-----~~~g~ 510 (510) T protein:vir:63 503 MTN-----ALAGV 510 (510) T ss_pred hcc-----cccCC Confidence 111 11111 No 217 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=94.87 E-value=0.0033 Score=34.20 Aligned_cols=432 Identities=12% Similarity=0.058 Sum_probs=163.7 Q ss_pred CCCCC---------CcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCC Q lcl|NC_016654. 1 MSLPE---------ANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATG 71 (533) Q Consensus 1 ~~~~~---------~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 71 (533) -|-|. .-+-.-|.+..++.--+.-++.-|.+ | -||.+.. |-+++.=- T Consensus 62 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----l-~~~~~~~------------------F~Gy~~la- 117 (695) T protein:vir:36 62 EPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDA----L-SFVTSSG------------------FPGFPTLV- 117 (695) T ss_pred CCCcccccceeceecccccCccccchhhhhhccccccccc----c-hhhhccC------------------cchHHHHH- Confidence 11111 11122344434333333332221111 1 1111110 00000000 Q ss_pred cccceeecChHHHHHHHHHHhhcCCCceE-------------eeC----CC-chHHHHHHHHHHhhccHHHHHHHHHHHH Q lcl|NC_016654. 72 RAPKRYHAPIPGVIAKLSTTELFSEQLKF-------------LDA----GK-SKEVQARADLIFNTPRFHSSLVEAGESC 133 (533) Q Consensus 72 ~~~~~~~~n~~k~i~~~~a~ll~~e~~~i-------------~~~----~~-~~~~~~~l~~i~~~n~f~~~~~~~~~~~ 133 (533) .--.++.-+.++..+|.-+..+=..+ ++. .. +.+.-+.|+.-++.=+.+..++++++.+ T Consensus 118 ---~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~erL~V~~~l~eaik~a 194 (695) T protein:vir:36 118 ---LLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHD 194 (695) T ss_pred ---HHhhccchhhHHHHHHHHhhcccceecccchhhhhhccccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 00011112222333333332221111 111 11 1123355666667778999999999999 Q ss_pred hhhCCEEEEEEEcCCCC--CceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCC Q lcl|NC_016654. 134 SALSGSFQRIVWDPTIA--DNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTAT 211 (533) Q Consensus 134 ~~~G~~~~~~~~D~~~~--~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~ 211 (533) -.+|+++..+-.+.+.. ..|.+ ..+.. ...|.++.++.++.+.- .+. .|... T Consensus 195 RlfGGa~~~i~i~gdd~~l~~PL~--~~~~~----I~kGslKGl~ViDp~~v-------------tP~-----~~n~~-- 248 (695) T protein:vir:36 195 QAFGRAHPYFKIKGDDQIMDTPLV--PRPYT----VPKGSFQGLRVVEPYWV-------------TPN-----NYNSI-- 248 (695) T ss_pred ccccceEEEEEeccCccccccccc--ccccc----ccCcceeeeEeeccccc-------------ccc-----hhhhc-- Confidence 99999986665542210 01110 00000 11233333322211100 000 00000 Q ss_pred cccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHH Q lcl|NC_016654. 212 SLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIY 291 (533) Q Consensus 212 ~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~ 291 (533) -|++.. +..++.-...+. .| +.+ .++.|+....+ +.- .+....+|.|....+. +-+++.+++- T Consensus 249 -----dP~spd----fgkP~~y~V~G~-kI-H~S--RL~~f~g~plP--d~L-Kp~y~~~GiSv~q~~~-e~V~~~~rT~ 311 (695) T protein:vir:36 249 -----NPVADD----FYKPSTWWMIGT-EV-HAT--RLHTIVSRPVG--DML-KPTYSFAGISMTQLAM-PYIDNWLRTR 311 (695) T ss_pred -----cchhhc----cCCCceEEEece-EE-eee--eEEEecCCCch--hhh-hcccccCcccHHHHHH-HHHHHHHHHH Confidence 000000 000000000000 00 001 11222222111 111 1223457898877643 5556666554 Q ss_pred HHHHHHHHhCcceeeechHHhcCCCCccccccCcch---hhhhhcccc-ccccccccccceeeechhhhhHHHHHHHHHH Q lcl|NC_016654. 292 SSLMRDFRIGAGKVHASESVLTNLGMGQGVSLDEEQ---EVYSRVGSG-GFNANGDMETIFEFFQPAIRVLEHDQGAALL 367 (533) Q Consensus 292 s~~~~~~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~---~~~~~~~~~-~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~ 367 (533) .....-+......++ -.++..-..++........- +.|+..... ..|. +.+ ++.|.++.+...-..+..+ T Consensus 312 ~~v~~Li~~~~v~~l-k~dla~aL~~g~~~~l~~R~eli~~~Rsn~G~~llDk---~~E--efeq~stslSGLddVi~qf 385 (695) T protein:vir:36 312 QSVSDIVKQFSVSGI-LMDLAQALMPGANVDLSMRAELINRYRDNRNILFLDK---ATE--EFFQFNTPLSGLDALQAQA 385 (695) T ss_pred hHHHHHHHhhhHHHH-HHHHHHhhcChhHHHHHHHHHHHHHhcCccceEEEec---CCc--ceEEEecccCCHHHHHHHH Confidence 444433322111111 11221111111111010000 112211111 1111 112 2334455566667777888 Q ss_pred HHHHHHhhCCChh-hcccCCCcc-hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEE Q lcl|NC_016654. 368 LREVLRKTGYSPV-SLGLSDEVA-QTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELEL 445 (533) Q Consensus 368 l~~i~~~~g~s~~-~~g~~~~~~-~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i 445 (533) +.+++..+++|.. -||-...|- .|++.=...|-+.+... .+..++..|+.++.+|..- .| |. .+ .++++ T Consensus 386 ~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~--Qe~~L~p~L~rl~~ii~rS---~~-G~--id-pdi~~ 456 (695) T protein:vir:36 386 QEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAY--QRNALQQLMNDVIVMIQLS---LF-GA--VD-PSIKW 456 (695) T ss_pred HHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHH---hc-CC--CC-CcceE Confidence 8888888999854 456655553 67774444444444322 3677888888887765331 22 22 22 36899 Q ss_pred EeCCCCCCCHHHHHHHHHH-------HHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCC Q lcl|NC_016654. 446 EWPKFARESDLAKAQTVQA-------WSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPT 518 (533) Q Consensus 446 ~f~d~i~~d~~e~a~~~~~-------l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~ 518 (533) +|+.--..++.|+|++..+ ++.+|+++..+...++ ..+..----......+.|+.+. T Consensus 457 ~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL----------------~~d~~s~Y~~~~D~~d~p~~~~ 520 (695) T protein:vir:36 457 QWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARL----------------NTEPDGPYAGKLDANDDPGVPA 520 (695) T ss_pred EeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHH----------------hcCCCcccccccccccCCCcCc Confidence 9998888888887775433 3344444444433322 2211000000001223333333 Q ss_pred CCCCCCC-------CCCCCCCC Q lcl|NC_016654. 519 ENDPATD-------PEAVDEGE 533 (533) Q Consensus 519 ~~~~~~~-------~~~~~d~~ 533 (533) +++-+|. .+.++.|+ T Consensus 521 ~~~~~~~~~~~~~~~~~~~~~~ 542 (695) T protein:vir:36 521 DDDIDGVLTYVQRLAEGGDTGA 542 (695) T ss_pred cchhhhhHhhhcCcccccccCC Confidence 3322221 11111111 No 218 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=94.77 E-value=0.0036 Score=34.01 Aligned_cols=433 Identities=11% Similarity=0.001 Sum_probs=159.7 Q ss_pred hcCCHH-HHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHhhcCC--C---ceEee Q lcl|NC_016654. 29 WEGDLD-KLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTELFSE--Q---LKFLD 102 (533) Q Consensus 29 ~~gd~~-~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e--~---~~i~~ 102 (533) ++.... +++++ + .....+.++..........+...-...+.+..++-=.-+...++.+|+-|.+- | +.|.. T Consensus 1 mk~~~~~~~~~l-k--r~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l 77 (510) T protein:vir:78 1 MKSTAAMLWEKL-R--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) T ss_pred ChhHHHHHHHHH-h--ccchHHHHHHHHHhhccccccCCCCcccccccCcccchHHHHHHHHHHHHHHhhcCCCCccccc Confidence 222211 11111 1 11112222222211111111000001111112333445666777777665443 1 34433 Q ss_pred CCCc-------------hHHHHH-------HHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCe Q lcl|NC_016654. 103 AGKS-------------KEVQAR-------ADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADR 162 (533) Q Consensus 103 ~~~~-------------~~~~~~-------l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~ 162 (533) ...+ ..+.++ +...+..++|...+.++.+...+.|.+.+ |.++++. .+..++-.. T Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--~~~~~~~---~~~~~pl~~ 152 (510) T protein:vir:78 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSDEA---TVVAWSLRS 152 (510) T ss_pred CCChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEE--EEeCCCC---eEEEEEcce Confidence 2111 123333 34457788999999999999999998765 4565432 356666666 Q ss_pred EEEEEe-cCCceEEEEEEEEee--------------cCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccc Q lcl|NC_016654. 163 AIPEFR-WGRLVAVTFWSELAG--------------GDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRD 227 (533) Q Consensus 163 ~~P~~~-~g~~~~v~f~~~~~~--------------~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~ 227 (533) ++-.-+ +|++..+++-.+++. ..++..+.. -.|.+.++.-.... .|...+. T Consensus 153 y~v~~d~~G~vd~i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~~------v~v~~~V~~~~~~~----~~~~sv~---- 218 (510) T protein:vir:78 153 YAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGS------VDLYTHVQRRKGTA----MDYAEMY---- 218 (510) T ss_pred eEEeeCCCcCeeEEEeeeeccHHHHHHHhhHHhhhhhhccCCCce------EEEEEEEEeecCCC----CcEEEEE---- Confidence 555444 478877665444321 001111111 12222233211100 0000000 Q ss_pred ccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCcceee Q lcl|NC_016654. 228 IAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGAGKVH 306 (533) Q Consensus 228 ~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~~~i~ 306 (533) .++. +.....+.+.+..-+.|++. +|... ..+.||+|--..++ +-+..|+..--..... .+..+.... T Consensus 219 --~e~d--g~~i~~~~~~~~~e~P~~~~-----Rw~~~-~ge~YGrgp~~~~l-~D~k~L~~l~~~~l~~a~~a~~~~~l 287 (510) T protein:vir:78 219 --HEID--GVRVGETGRWPIHLCPYIVP-----TWNLA-PGEHYGRGHVEDYI-GDFAKLSLLSEKLGLYELESLEVLNL 287 (510) T ss_pred --EEec--CeeeccccccccccCCeeee-----eeeec-CCCccccchHHHHH-HHHHHHHHHHHHHHHHHHHhhcCCcc Confidence 0000 00000112221111223332 23322 24788998776665 5567777654444332 233333333 Q ss_pred ech-HHhcCCCCccccccCcchhhhhhccccccccccccccceeee----chhhhh-HHHHHHHHHHHHHHHHhhCCChh Q lcl|NC_016654. 307 ASE-SVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFF----QPAIRV-LEHDQGAALLLREVLRKTGYSPV 380 (533) Q Consensus 307 v~~-~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~----~~~ir~-e~~~~~l~~~l~~i~~~~g~s~~ 380 (533) |++ .++.+..- .+...+.+.. |.. ..+..+ ..++.+ .+-++.+..-++.+.+.. +. T Consensus 288 v~p~g~~~~~~l-----~~~~~g~~v~--------g~~--~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF~~~-l~-- 349 (510) T protein:vir:78 288 VDEAKGAVVDDY-----QDAEMGDYVP--------GGA--EAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-AN-- 349 (510) T ss_pred cCCccccchhhh-----ccCCCceeec--------CCc--ccccccccCcccchHHHHHHHHHHHHHHHHHHhhc-cc-- Confidence 332 21111100 0000011111 000 011111 122222 233334444444443311 11 Q ss_pred hcccCCCcchhHHHHHHHhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHH Q lcl|NC_016654. 381 SLGLSDEVAQTATEASGKKDLTVKTTRA-KARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKA 459 (533) Q Consensus 381 ~~g~~~~~~~Tatai~~~~~~l~~~~~~-~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a 459 (533) -..+...|||||..+.+...+..+- ..+.-...|..|++.++.+... .+..+.+.+.+....=.+ .+....+ T Consensus 350 ---~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r--~gl~p~p~~~~~~~~v~~--is~Lara 422 (510) T protein:vir:78 350 ---QRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD--ALLQGLITKQHKPAIETG--LPALSRS 422 (510) T ss_pred ---cCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh--ccCCCCCcccccceeeec--ccHHHHH Confidence 1122345999999887776665554 2333333445555555444321 122222222222111111 2222222 Q ss_pred HHHH-------HHHhCCC-------CCHHHHHHHh---CCCC-------CHHHHHHHHHHHHHhh----hcccCcccccc Q lcl|NC_016654. 460 QTVQ-------AWSVASA-------ASTKTKVAYL---HEDW-------DDERVQEEADLIDNAN----TVSAPTFGFGT 511 (533) Q Consensus 460 ~~~~-------~l~~aGi-------~S~et~v~~l---~~~~-------~dee~~~El~rI~~E~----~~~~~~~~~~~ 511 (533) +.+. .+-..|. +....+++.+ .+ + +++|++++.++-++.. ++..-.+.+++ T Consensus 423 q~~~~l~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~G-v~p~~ivrs~eev~a~~~~~~~q~~~~~~~~~a~~~~~~ 501 (510) T protein:vir:78 423 AAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFS-VDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGAS 501 (510) T ss_pred HHHHHHHHHHHHHHHhcChhhhhhcCCHHHHHHHHHHHhC-CChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 2222 1211121 2233333332 22 2 5566665544321111 11111112222 Q ss_pred ccCCCCCCCCCCCC Q lcl|NC_016654. 512 DQPPLPTENDPATD 525 (533) Q Consensus 512 ~~~~~~~~~~~~~~ 525 (533) +..+. ..|- T Consensus 502 ~~~~~-----~~g~ 510 (510) T protein:vir:78 502 DMTNA-----LAGV 510 (510) T ss_pred hhccc-----CCCC Confidence 11111 1111 No 219 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=94.65 E-value=0.0039 Score=33.82 Aligned_cols=442 Identities=11% Similarity=0.030 Sum_probs=193.1 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHh--hhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceee Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAE--SHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYH 78 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~--~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 78 (533) +..+.. ++=||-.-+-+ ..|+. +..=+.| .+-..|.+......+ + .++..|+.+- . T Consensus 16 ~~~~~~-S~~~p~~~DGa-~~i~~~~~~~~~~g---~~~~~~~~~~~~~~~--~-eLI~~YR~ma--------------~ 73 (511) T protein:vir:56 16 EKNPVR-SFSAPDNVDGA-KEIHTNLLAPQLGH---AIIPSDAQSEGTIPV--K-ELIKSYRALA--------------E 73 (511) T ss_pred ccCCcc-cccCCCCCCCc-eEEecccccceecc---eeccccccccCccch--H-HHHHHHHHHh--------------h Confidence 333322 23333331110 00000 0000001 001111111111001 1 1222222210 1 Q ss_pred cChHHHHHHHHHHh-h----cCCCceEeeCCCc------hHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcC Q lcl|NC_016654. 79 APIPGVIAKLSTTE-L----FSEQLKFLDAGKS------KEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDP 147 (533) Q Consensus 79 ~n~~k~i~~~~a~l-l----~~e~~~i~~~~~~------~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~ 147 (533) .+-+.-.++..++= + ..+|+++.+++-+ +...+..+.|++-=+|++..++......+.|..|+|..+|+ T Consensus 74 ~pEvd~Av~eIvne~iv~d~~~~pV~l~ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~ 153 (511) T protein:vir:56 74 YHEVDDAIQEIVDEAIVYENDKEVVWLNLDNTDFSENIKAKINEEFDRVVSLLQMRKHGYKWFRKWYVDSRIYFHKILDK 153 (511) T ss_pred ccchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecc Confidence 11122222332221 1 1235555553322 12445566677777899999999999999999999999987 Q ss_pred CCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeec--CCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccc Q lcl|NC_016654. 148 TIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGG--DGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPAT 225 (533) Q Consensus 148 ~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~--~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~ 225 (533) . .+=..+.+++|.++-++.. +... ++..+ -.++..|.+|...+-.-+..+ .... T Consensus 154 k-~GI~eLr~lDPr~i~~vr~------------i~~~~~~~~~v-------~~~~~ey~~Y~~~~~~~~~~~--~~~~-- 209 (511) T protein:vir:56 154 D-NNIIELRPLNPMKMELVRE------------IQKETIDGVEV-------VKGTLEYYVYKQSDYKMPSWM--SATN-- 209 (511) T ss_pred c-cceeehhhcCcccchhhhh------------hhccccccccc-------ccceeeeeEecCCCcccCccc--cccc-- Confidence 5 3445566677776665421 1111 11111 123456666642211100000 0000 Q ss_pred ccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHH--HHHHHHhCcc Q lcl|NC_016654. 226 RDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSS--LMRDFRIGAG 303 (533) Q Consensus 226 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~--~~~~~~~~~~ 303 (533) .......+ +.-.+.|...+. ..+....++..|-+-.||+++ ..|=-+-+. +.|.-|.-.+ T Consensus 210 --------~~~~~vkI----~~daI~y~hSGL-----~d~~~~~g~i~syLhkAiKp~-NQLkm~EDAlVIYRitRAPeR 271 (511) T protein:vir:56 210 --------RAQTSFRI----PKDAIVFAHSGL-----MRGCADDPYIIGYLDRAIKPA-NQLKMLEDALVIYRLARAPER 271 (511) T ss_pred --------ccccceee----chhheeeecccc-----eeccCCCCeeeccchhhhHHH-HhhHHHHhhHHHHhhhccccc Confidence 00000000 011122222221 111122344555565666553 333111111 2233466677 Q ss_pred eeee-c---------hHHhcC----------CCCccccccCcchhhhhhcc--ccccccccccccceeeechhhhhHHHH Q lcl|NC_016654. 304 KVHA-S---------ESVLTN----------LGMGQGVSLDEEQEVYSRVG--SGGFNANGDMETIFEFFQPAIRVLEHD 361 (533) Q Consensus 304 ~i~v-~---------~~~l~~----------~~~~~~~~~d~~~~~~~~~~--~~~~~~~~~~~~~i~~~~~~ir~e~~~ 361 (533) |||- . +.+++. +....|. ...+.....++. +.+--.|+ ....|+++...-.. .-+ T Consensus 272 RvFYIDVGnLPk~KAeqYl~~iM~k~kNklVYDa~TGe-v~ddrk~msMlEDyWLpRReGg-rgTEItTLpGgqnl-gem 348 (511) T protein:vir:56 272 RVFYVDVGNLPTQKAQQYVNGIMQNVKNRVVYDTQTGQ-VKNTTNAMSMLEDYYLPRREGS-KGTEVSTLPGGQSL-GDI 348 (511) T ss_pred eEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCce-eccchhhhhhHhhhcccccCCC-CccceeeccccCCc-ChH Confidence 8873 1 111110 0111111 111111000000 01111122 22234444332222 235 Q ss_pred HHHHHHHHHHHHhhCCChhhcccCCC--c--chhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCC Q lcl|NC_016654. 362 QGAALLLREVLRKTGYSPVSLGLSDE--V--AQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGA 437 (533) Q Consensus 362 ~~l~~~l~~i~~~~g~s~~~~g~~~~--~--~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~ 437 (533) ..+..+.+.+....++|.+.+..+.+ + .--++||...+-.--.-+.+.+..|..-|.++++.-|.|.... .... T Consensus 349 ~DV~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgii--t~ee 426 (511) T protein:vir:56 349 EDVLYFNRKLYKAMRIPTSRAASEDQTGGINFGQGAEITRDELKFTKFVKRLQTKFETVITDPLKHQLIVNNII--TEEE 426 (511) T ss_pred HHHHHHHHHHHHHhCCCcccccCCCCccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCC--CHHH Confidence 56677778889999999888863321 1 1135677766666677788888888888888888776552211 0000 Q ss_pred C--CceeEEEEeCCCCCCCHHHHHH-------HHHHHH--hCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCc Q lcl|NC_016654. 438 A--PSEELELEWPKFARESDLAKAQ-------TVQAWS--VASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPT 506 (533) Q Consensus 438 ~--~~~~v~i~f~d~i~~d~~e~a~-------~~~~l~--~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~ 506 (533) . ....+.++|...---.+...++ .++++. -+...|.++..++.. -.+|+|.++|.++|++|... |. T Consensus 427 W~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~~yi~k~IL-r~tDeei~~~~k~I~~E~k~--~~ 503 (511) T protein:vir:56 427 WDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGKYYSHKYIQKNIL-RLSDDQITAMQSEIDEEETN--PR 503 (511) T ss_pred HHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhccccchHHHHHHHh-ccCHHHHHHHHHHHHHhhcC--CC Confidence 1 1245777776543333333333 222221 122469998777654 48999999999999999753 11 Q ss_pred cccccccC Q lcl|NC_016654. 507 FGFGTDQP 514 (533) Q Consensus 507 ~~~~~~~~ 514 (533) +....++. T Consensus 504 ~~~~e~~f 511 (511) T protein:vir:56 504 FQQDDQGF 511 (511) T ss_pred CCCcccCC Confidence 11100000 No 220 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=94.50 E-value=0.0042 Score=33.59 Aligned_cols=434 Identities=8% Similarity=0.008 Sum_probs=158.1 Q ss_pred CCCC-CCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeec Q lcl|NC_016654. 1 MSLP-EANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHA 79 (533) Q Consensus 1 ~~~~-~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 79 (533) |-=- +-.+...=..+...+..+..-|..|..--.++.+|. ....+.. ...+....++-= T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~------------------lP~~~~~--~~~~~~~~~~~d 60 (516) T protein:vir:10 1 MKQSTDLEYGGKRSKIPKLWEKFSTKRSSFLDRAKHYSKLT------------------LPYLMND--KGDNETSQNGWQ 60 (516) T ss_pred CCchhhHhhhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhh------------------cccccCC--CCCccccccccc Confidence 1100 111112222233333333333333332222222221 1111111 111222334545 Q ss_pred ChHHHHHHHHHHhhcCC-----CceEeeCCCc-------------hHHHHH-------HHHHHhhccHHHHHHHHHHHHh Q lcl|NC_016654. 80 PIPGVIAKLSTTELFSE-----QLKFLDAGKS-------------KEVQAR-------ADLIFNTPRFHSSLVEAGESCS 134 (533) Q Consensus 80 n~~k~i~~~~a~ll~~e-----~~~i~~~~~~-------------~~~~~~-------l~~i~~~n~f~~~~~~~~~~~~ 134 (533) +-+...++.+|+-|.+- .+.|.....+ ..++++ +...+..++|...+.++..... T Consensus 61 stg~~a~~~LAa~l~~~ltpp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~ 140 (516) T protein:vir:10 61 GVGAQATNHLANKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLI 140 (516) T ss_pred chHHHHHHHHHHHHHhhhcCCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHH Confidence 56677777777765543 1344433111 123333 3346778899999999999999 Q ss_pred hhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEe-cCCceEEEEEEEEeec----------------------CCceEEE Q lcl|NC_016654. 135 ALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFR-WGRLVAVTFWSELAGG----------------------DGQEVWR 191 (533) Q Consensus 135 ~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~-~g~~~~v~f~~~~~~~----------------------~~~~~y~ 191 (533) +.|.+++ |.|+.+ . +..++-..++..-+ +|++..+++-.+++.. +.-.+|+ T Consensus 141 ~~G~a~l--~~d~~~--~--~~~~pl~~y~v~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t 214 (516) T protein:vir:10 141 VAGSCML--YKPSKG--A--ISAIPMHHYVVNRDTNGDLLDIILLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYT 214 (516) T ss_pred hHCeEeE--EecCCC--C--eEEEEcCeEEEeeCCCCCeEEEeeeecccHHHHHHHhhhhhhhhhhhhccCCCCceEEEE Confidence 9999864 567643 2 55666666555444 4788776644332110 0112344 Q ss_pred EEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCccccccccccccccc Q lcl|NC_016654. 192 HLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYL 271 (533) Q Consensus 192 ~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~ 271 (533) ++++...++ +.+|...++ . ....+.+.+..-+.|++. +|... ..+.| T Consensus 215 ~v~~~~~~~--~~~~~~~d~---~----------------------~~~~~s~~~~~e~P~~~~-----Rw~~~-~ge~Y 261 (516) T protein:vir:10 215 HAKYLGEGF--WELKQSADD---I----------------------PVGKVSKIKSEKLPFIPL-----TWKRS-YGEDW 261 (516) T ss_pred EEEecCCCc--eEEEEeeCc---e----------------------eeccccccccccCCeeee-----eeeec-CCCCc Confidence 444322111 111110000 0 000012221111223322 23322 24778 Q ss_pred ccchhhhhHHHHHHHHHHHHHHHHHHH-HhCcceeeechHH-hcCCCCccccccCcchhhhhhcccccccccccccccee Q lcl|NC_016654. 272 GRADLSTDLFPTFHELDRIYSSLMRDF-RIGAGKVHASESV-LTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFE 349 (533) Q Consensus 272 G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~~~~i~v~~~~-l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~ 349 (533) |+|--..++ +-+..|+..--...... ...+....|+++. .++..- .+...+.+.. |.. ..+. T Consensus 262 Grgp~~~~L-~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l-----~~~~~g~~~~--------g~~--~~v~ 325 (516) T protein:vir:10 262 GRPLAEDYS-GDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHF-----VNSGTGEVVT--------GVE--EDIH 325 (516) T ss_pred ccchHHHhh-HHHHHHHHHHHHHHHHHHHhcCCCcccCcccccchhhh-----ccCCCceeec--------CCc--ccce Confidence 998766655 45567765444443332 3445555554332 221100 0011111111 000 0111 Q ss_pred eec--h--hhhh-HHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHH-HHHHHHHHHHH Q lcl|NC_016654. 350 FFQ--P--AIRV-LEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARH-FGSALGPLSTT 423 (533) Q Consensus 350 ~~~--~--~ir~-e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~-~~~al~~li~~ 423 (533) .++ . ++.+ .+-++.+..-++.+.+. . .+.-..+...|||||..+.+...+..+-.-.. -...|..++.. T Consensus 326 ~~q~~~~~d~~~~~~~i~~~~~rI~~af~~---~--~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r 400 (516) T protein:vir:10 326 IVQLGKYADLTPISAVLEVYTRRIGVVFMM---E--TMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMW 400 (516) T ss_pred eeecCcccchHHHHHHHHHHHHHHHHHHhh---h--hhhccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHH Confidence 111 1 2211 12223333333333221 1 11111233469999987766644443332111 11122233322 Q ss_pred HHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHH------Hh--CCC-------CCHHHHHHH---hCC--- Q lcl|NC_016654. 424 CLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAW------SV--ASA-------ASTKTKVAY---LHE--- 482 (533) Q Consensus 424 il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l------~~--aGi-------~S~et~v~~---l~~--- 482 (533) ++ ... ....+..-+.++-- ...+...+++.+..+ ++ +++ +....+++. ..+ T Consensus 401 ~~---~~~---~p~~P~~lv~~~~v--~~i~~L~raq~~~~i~~~~q~i~~~~q~~p~v~d~id~d~~~~~~a~~~gvp~ 472 (516) T protein:vir:10 401 GL---LEA---GDSFTSDLVDPVII--TGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAEL 472 (516) T ss_pred HH---Hhh---CCCCChhhcCccee--hhHHHHHHHHHHHHHHHHHHHHHHHhcCChHHHhhcCHHHHHHHHHHHhCCCh Confidence 21 111 11222222322211 112222222222221 00 111 011111111 111 Q ss_pred --CCCHHHHHHHHHHHHHhhhcc--cCccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 483 --DWDDERVQEEADLIDNANTVS--APTFGFGTDQPPLPTENDPATDPEAVD 530 (533) Q Consensus 483 --~~~dee~~~El~rI~~E~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (533) --+++|++++.++-++.|+.. .+..+.+.+.+ .. +.=++. T Consensus 473 ~~irs~eev~~~r~~~~~~q~~~~~~~~~~~~~~~~-~~-------~~~~~~ 516 (516) T protein:vir:10 473 PFLKSAEEMEQEQEAQMQAQQAQMLEEGVAKAVPGV-IQ-------QELKEA 516 (516) T ss_pred hccCCHHHHHHHHHHHHHHHHHHHHHHHhhhcccch-hh-------hhhhcC Confidence 025566665544332222211 11111111110 00 000000 No 221 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=94.40 E-value=0.0045 Score=33.44 Aligned_cols=428 Identities=11% Similarity=0.058 Sum_probs=186.6 Q ss_pred CCCC--CCcCC------CcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCc Q lcl|NC_016654. 1 MSLP--EANTA------WPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGR 72 (533) Q Consensus 1 ~~~~--~~~~~------~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 72 (533) +..| +.|+. .=|.....++. +|+...... .+ .. .++..|+.+. T Consensus 31 ~~~p~~~dGa~~I~~~~~~~~~~~~~~~-----------------~~~~~~~~~-~n-~~-eLI~~YR~ma--------- 81 (521) T protein:vir:10 31 FAVPDTADGAIEVDKQIDTTAPKTAIVQ-----------------SVLGYAPKI-QN-TK-DLINQYRSLS--------- 81 (521) T ss_pred cccccCCCCceeeccCCCccccccchhh-----------------hhhcccccc-ch-HH-HHHHHHHHHh--------- Confidence 3333 22221 00100111111 111111100 00 00 1111222210 Q ss_pred ccceeecChHHHHHHHHHHh-h----cCCCceEeeCCC--ch----HHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEE Q lcl|NC_016654. 73 APKRYHAPIPGVIAKLSTTE-L----FSEQLKFLDAGK--SK----EVQARADLIFNTPRFHSSLVEAGESCSALSGSFQ 141 (533) Q Consensus 73 ~~~~~~~n~~k~i~~~~a~l-l----~~e~~~i~~~~~--~~----~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~ 141 (533) ..+-+.-.++..++= + ..+|+++.++.. ++ ...+..+.|++-=+|++..++......+.|..|+ T Consensus 82 -----~~pEvd~Av~eIvneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~f 156 (521) T protein:vir:10 82 -----KYHEVDNAIDEIINDAIVQEDNRDTVYLDLDKTDWNESVKEMVREEFRTILKLLKFEREGKRHFRRWYVDSRIYF 156 (521) T ss_pred -----hccchhhHHHhhhcceEEecCCCceEEEEecCcccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeeEEE Confidence 111222223333321 1 123555655332 22 2445566677777899999999999999999999 Q ss_pred EEEEcCC--CCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecC--CceEEEEEEEecCeeEEEEEEeccCCccccee Q lcl|NC_016654. 142 RIVWDPT--IADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGD--GQEVWRHLERHESGYIVHAVYKGTATSLGWMM 217 (533) Q Consensus 142 ~~~~D~~--~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~--~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v 217 (533) |..+|.+ ..+=..+..++|.++-++ +++.+.+ +..++ +++-.|.+|....+. .. T Consensus 157 Hkiid~~~pk~GI~Elr~lDPr~i~~v------------r~i~k~~~~~~~v~-------~~~~e~f~Y~~~~~~---~~ 214 (521) T protein:vir:10 157 HKMIDPARPKDGIKELRLLDPRNVEYY------------RVNLKSNENGNDVY-------KGVKEFFTYGATEDN---RY 214 (521) T ss_pred EEEeeCCCccccceeeeeeCCcceeee------------eeecCCCCCcchhh-------ccceeeeeeccCCCc---ee Confidence 9999843 123345666777766544 2222111 11111 122344455422111 00 Q ss_pred ehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHH--HH Q lcl|NC_016654. 218 ALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSS--LM 295 (533) Q Consensus 218 ~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~--~~ 295 (533) +.+- ..+....++ .-.+.|..-+.-.. ..+...|-+-.||+++ ..|=-+-+. +. T Consensus 215 ~~~g------------~~~~~vkI~----~daI~y~hSGL~d~-------~~~~i~syLhkAiKp~-NQLkm~EDAlVIY 270 (521) T protein:vir:10 215 NISG------------NSNNLVQIP----IDAIVYSHSGKVDI-------DGKTIVGYLHNVIKPA-NQLKMLEDAMVIY 270 (521) T ss_pred cCCC------------CCCcceeec----hhheeeecccceeC-------CCCceeccchhhhHhH-HhhHHHHhhHHHH Confidence 0000 000000010 01122222111000 1233444555566543 333111111 22 Q ss_pred HHHHhCcceeee-c---------hHHhcC----------CCCccccccCcchhhhhhcc--ccccccccccccceeeech Q lcl|NC_016654. 296 RDFRIGAGKVHA-S---------ESVLTN----------LGMGQGVSLDEEQEVYSRVG--SGGFNANGDMETIFEFFQP 353 (533) Q Consensus 296 ~~~~~~~~~i~v-~---------~~~l~~----------~~~~~~~~~d~~~~~~~~~~--~~~~~~~~~~~~~i~~~~~ 353 (533) |.-|.-.+|||- . +.+++. +....|. ...+.....++. +.+--.|+ ....|++... T Consensus 271 RitRAPeRRvFYIDvGnlpk~KAeqYl~~iM~k~kNklVYDa~TGe-v~ddrk~msMlEDyWLpRReGg-rgTEI~TLpg 348 (521) T protein:vir:10 271 RITRAPERRVFYIDVGTMPNKKATQHLNNVMQGLKNRVVYDSSTGK-VKNSSNNLAMTEDYWLMRRDGK-ATTEVSTLPG 348 (521) T ss_pred hhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCce-eccchhhhhhHhhhcccccCCC-Cccceeeccc Confidence 444666777763 1 111110 0111111 111111000000 01111122 2223444433 Q ss_pred hhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCc--chhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_016654. 354 AIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEV--AQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIK 431 (533) Q Consensus 354 ~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~--~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~ 431 (533) .-.. .-+.-+..+.+.+....++|.+.++.++++ .--++||...+-.--..+.+.+..|..-|.++++.-|.|.... T Consensus 349 gqnl-gem~DV~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr~~EItRDEikF~KFI~rLR~rFs~~f~~~L~~qLilKgii 427 (521) T protein:vir:10 349 AQSM-GEMDDVRWFNRKLYESMKIPLSRLPQEGAGVTFGAGNDITRDELQFTKYIRGLQQQFEPIFLNPLRTNLMLKGKM 427 (521) T ss_pred cCCc-ChHHHHHHHHHHHHHHhCCCccccCCCCCceecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCC Confidence 2222 235566777788888899988888655321 1124566666666667788888888888888888776552211 Q ss_pred ccCCCCC--CceeEEEEeCCCCCCCHH-------HHHHHHHHHHh---CC-CCCHHHHHHHhCCCCCHHHHHHHHHHHHH Q lcl|NC_016654. 432 FPGKGAA--PSEELELEWPKFARESDL-------AKAQTVQAWSV---AS-AASTKTKVAYLHEDWDDERVQEEADLIDN 498 (533) Q Consensus 432 ~~~~~~~--~~~~v~i~f~d~i~~d~~-------e~a~~~~~l~~---aG-i~S~et~v~~l~~~~~dee~~~El~rI~~ 498 (533) ..... ....+.++|...---.+. +.+..++++.. .| ..|.++..++.. -.+|+|.++|.++|++ T Consensus 428 --t~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~yvGky~s~dyi~k~IL-r~tDeeik~~~k~I~~ 504 (521) T protein:vir:10 428 --SVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEVTGKYLSHEYVMKNIL-RMSDEDIKTEREKIDG 504 (521) T ss_pred --CHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCccccccccchHHHHHHHh-cCCHhHHHHHHHHHHH Confidence 00001 124577777644332222 33344444422 23 689998776654 4899999999999999 Q ss_pred hhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 499 ANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 499 E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) |.... -+ +.+ .++++| T Consensus 505 E~~~~--~~-------~~p----------~~e~~d 520 (521) T protein:vir:10 505 ELKDS--VY-------KNP----------EDPMEE 520 (521) T ss_pred hhhCC--CC-------CCC----------cchhhc Confidence 97431 00 000 111111 No 222 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=94.29 E-value=0.0048 Score=33.28 Aligned_cols=419 Identities=15% Similarity=0.074 Sum_probs=144.8 Q ss_pred CCCCCC---cCCCcCcchHHHHHHHHhh------hHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCC-- Q lcl|NC_016654. 1 MSLPEA---NTAWPPPELAAVTARVAES------HVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTA-- 69 (533) Q Consensus 1 ~~~~~~---~~~~pp~~~~~~~~~~~~~------~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 69 (533) |.|=+. ....+|.--.......... ..++.+.- .+...|+++.... T Consensus 1 M~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~~~~~~~~g~~~~~~ 57 (466) T protein:vir:81 1 MRLIDRLLSTRGAAPRMSIDDYAQMLNEFAFNGIGYGFGGGV-----------------------PRIQQTLAGPSTELA 57 (466) T ss_pred CchhHHHhhccCcccccchhhhhhhhhhhhcccccccccccc-----------------------HHHHHhhcccccccc Confidence 443310 1111221111111000000 00000000 1112222221111 Q ss_pred --CCcc---cceeecChHHHHHHHHHHhhcCCCceEeeCCCc---hHHHHHHHHHHhhcc----HHHHHHHHHHHHhhhC Q lcl|NC_016654. 70 --TGRA---PKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKS---KEVQARADLIFNTPR----FHSSLVEAGESCSALS 137 (533) Q Consensus 70 --~g~~---~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~---~~~~~~l~~i~~~n~----f~~~~~~~~~~~~~~G 137 (533) .|.. ..-+.+.--..+++.+|+-+.+=|..+.-..+. +.....+..++..-+ ........+...+..| T Consensus 58 ~~~g~~v~~~~a~~~~~v~~~i~~Ia~~ia~lp~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~G 137 (466) T protein:vir:81 58 PDTFVGLATQAYQANGPVFACMLVRQLVFSSVRFRWQRLRDGKPSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAG 137 (466) T ss_pred CccccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCceeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcC Confidence 1111 112334445566777777766656544322111 111122333433222 1223344455667789 Q ss_pred CEEEEEEEcCCCC-------CceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccC Q lcl|NC_016654. 138 GSFQRIVWDPTIA-------DNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTA 210 (533) Q Consensus 138 ~~~~~~~~D~~~~-------~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~ 210 (533) .+|+.+..+..+. .-+.+..++++++.+..... +. ..+.| .|.-.. T Consensus 138 nay~~i~r~~~g~l~~~~~g~~~~l~~l~~~~v~~~~~~~---------------~~-----------~~~~y-~~~~~~ 190 (466) T protein:vir:81 138 NSYWTIVDGEFVRMRPDWVDVVVEERMVRGGRGELGGGQL---------------GW-----------RKVGY-LYTEGG 190 (466) T ss_pred CeEEEEEecCccccccccCcceeEEEEecCcceEEEEcCC---------------Cc-----------eEEEE-EEEecC Confidence 9999887764321 01223333344333332111 00 00111 011100 Q ss_pred Cccc-ceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHH Q lcl|NC_016654. 211 TSLG-WMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDR 289 (533) Q Consensus 211 ~~lG-~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~ 289 (533) ...+ ..+. .+.--+.|+....+. + ...+|.|.+..+. ..| .+.. T Consensus 191 ~~~~~~~~~--------------------------~~~~dviHir~~~~~--~-----d~~~G~s~i~~~~-~~i-~~~~ 235 (466) T protein:vir:81 191 RQSGNESVG--------------------------FLAEDVVHFAPIPDP--L-----ASYRGMSWLTPIL-REI-RADQ 235 (466) T ss_pred cccccceee--------------------------eccccEEEEcCCCCc--c-----cccccccHHHHHH-HHH-HHHH Confidence 0000 0000 000112333321110 0 1235777776543 444 3344 Q ss_pred HHHHHHHH-HHhCcceeeechHHhcCCCCccccccCcchhhhhhccccccccc----cccccceeeechhhhhHHHHHHH Q lcl|NC_016654. 290 IYSSLMRD-FRIGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNAN----GDMETIFEFFQPAIRVLEHDQGA 364 (533) Q Consensus 290 ~~s~~~~~-~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~----~~~~~~i~~~~~~ir~e~~~~~l 364 (533) +..++... |+.+...-.| +...+.-.....+.....+.....+..+.+ .+....++.++......++++.. T Consensus 236 a~~~~~~~~f~ng~~p~gi----l~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~ 311 (466) T protein:vir:81 236 AMSKHQAKFFDNGATVNLV----IKHNPMADPAAVKKWADEVNSKHAGVDNAWKNLNLYPGADADVVGSNLQEIDFKNVR 311 (466) T ss_pred HHHHHHHHHHhcCCCcceE----EecCCCCCHHHHHHHHHHHHHHhcCccccccceEcCCCceEEEccCChhHHHHHHHH Confidence 44455444 3544322111 222111000001111111211111111111 11123356666666777888888 Q ss_pred HHHHHHHHHhhCCChhhcccCCC-cchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeE Q lcl|NC_016654. 365 ALLLREVLRKTGYSPVSLGLSDE-VAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEEL 443 (533) Q Consensus 365 ~~~l~~i~~~~g~s~~~~g~~~~-~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v 443 (533) +...++|+...|+||..+|...+ +..|...++... +..++.+|..++..+....+..+.... ..... T Consensus 312 ~~~~~~Ia~~fgVPp~~lG~~~~~~~st~sn~eq~~----------~~f~~~tl~P~~~~ie~~l~~~L~~~~--~~~~~ 379 (466) T protein:vir:81 312 GGGETRIAAAAGVPPVIVGLSEGLAAATYSNYGQAR----------RRLADGTAHPLWQNLSGCIGHVMPDMG--PDVRL 379 (466) T ss_pred HHHHHHHHHHhCCCHHHcccccCCCccccccHHHHH----------HHHHHHHHHHHHHHHHHHHHhhcCCcc--cCcce Confidence 88999999999999999986543 222322222222 223334444444443332222222211 11223 Q ss_pred EEEeC--CCCCCCHHHHHHH-------HHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcc-cCcccccccc Q lcl|NC_016654. 444 ELEWP--KFARESDLAKAQT-------VQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVS-APTFGFGTDQ 513 (533) Q Consensus 444 ~i~f~--d~i~~d~~e~a~~-------~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~-~~~~~~~~~~ 513 (533) .++|+ .-+-.|..+.++. +..++++|+ ++.++.....+ -+. . .+. -+... .+....+... T Consensus 380 ~~~f~~~~llr~d~~~r~~~~~~~~~~~~~~~~~g~-t~nE~r~~~~~-gd~-----~--~~~-~~~~~~~~~~~~~~~~ 449 (466) T protein:vir:81 380 WYDADDVPFLREDEKDAADIQKVRAETINTLITAGY-EPESVVAAVNS-GDL-----R--LLK-HTGLTSVQLLPPGVSA 449 (466) T ss_pred EEEecchhhhccCHHHHHHHHHHHHHHHHHHHHcCC-ChhhccccccC-Ccc-----c--ccc-CCCcchhhhccccccc Confidence 45554 4455676666554 344555554 34433221110 000 0 000 00000 0000000000 Q ss_pred CCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 514 PPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 514 ~~~~~~~~~~~~~~~~~d~~ 533 (533) .+..++...+|. +++|. T Consensus 450 ~~~~~~~~~~Gg---~~ngn 466 (466) T protein:vir:81 450 SASSDTPTSGGA---DDNGN 466 (466) T ss_pred ccCCCCcccCCC---CcCCC Confidence 000111111111 11122 No 223 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=94.25 E-value=0.005 Score=33.22 Aligned_cols=432 Identities=13% Similarity=0.068 Sum_probs=161.6 Q ss_pred CCCCC--------------------C-----------cCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhh Q lcl|NC_016654. 1 MSLPE--------------------A-----------NTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPS 49 (533) Q Consensus 1 ~~~~~--------------------~-----------~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~ 49 (533) -|.|+ | -+-.-|-+-.++.--+.-++.-|.+ | -||.+.. T Consensus 40 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----l-~~~~~~~----- 109 (695) T protein:vir:78 40 QPVPADMGRRGALNALDAAPVAEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDA----L-SFVTSSG----- 109 (695) T ss_pred cccchhhcccccccccccccccCCCcccccceeceeccccCCccccchhhhhhccccccccc----c-hhhhccC----- Confidence 11111 0 0111222222222222222111110 1 0111100 Q ss_pred HHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHhhcCCCceE-------------ee----CCCch-HHHH Q lcl|NC_016654. 50 GIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTELFSEQLKF-------------LD----AGKSK-EVQA 111 (533) Q Consensus 50 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i-------------~~----~~~~~-~~~~ 111 (533) |-+++.=- .--.++.-+.++..+|.-+..+=..+ ++ ...++ +.-+ T Consensus 110 -------------F~Gy~~la----~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~ 172 (695) T protein:vir:78 110 -------------FPGFPTLV----LLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLK 172 (695) T ss_pred -------------cchHHHHH----HHhhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHH Confidence 00110000 00011122223333333332221111 11 11111 2335 Q ss_pred HHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCC--CceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceE Q lcl|NC_016654. 112 RADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIA--DNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEV 189 (533) Q Consensus 112 ~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~--~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~ 189 (533) .|..-++.=+.+..++++++.+-.+|+++..+-.+.+.. ..|.+ ..+.. ...|.++.++.++.+.- T Consensus 173 ~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~--~~~~~----I~kGslKGl~ViDp~~v------ 240 (695) T protein:vir:78 173 QINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLV--PRPYT----VPKGSFQGLRVVEPYWV------ 240 (695) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeccCccccccccc--ccccc----ccCcceeeeEeeccccc------ Confidence 566666677899999999999999999986665542210 01110 00000 11233333222211100 Q ss_pred EEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCccccccccccccc Q lcl|NC_016654. 190 WRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLR 269 (533) Q Consensus 190 y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~ 269 (533) .+. .|... -|++.. +..++.-...+ ..| +.+ .++.|+....+ +.- .+... T Consensus 241 -------tP~-----~~n~~-------dP~spd----fgkP~~y~V~G-~kI-H~S--RL~~f~g~plP--d~L-Kp~y~ 290 (695) T protein:vir:78 241 -------TPN-----NYNSI-------NPVADD----FYKPSTWWMIG-TEV-HAT--RLHTIVSRPVG--DML-KPTYS 290 (695) T ss_pred -------ccc-----hhhhc-------cchhhc----cCCCceEEEec-eEE-eee--eEEEecCCCch--hhh-hcccc Confidence 000 00000 000000 00000000000 000 001 11222222111 111 12234 Q ss_pred ccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCCccccccCcch---hhhhhcccc-ccccccccc Q lcl|NC_016654. 270 YLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGMGQGVSLDEEQ---EVYSRVGSG-GFNANGDME 345 (533) Q Consensus 270 ~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~---~~~~~~~~~-~~~~~~~~~ 345 (533) .+|.|....+. +-+++.+++-.....-+......++ -.++.....++........- +.|+..... ..|. +. T Consensus 291 ~~GiSv~q~~~-e~V~~~~rT~~~v~~Li~~~~v~~l-k~dla~~L~~g~~~~l~~R~eli~~~Rsn~G~~llDk---~~ 365 (695) T protein:vir:78 291 FAGISMTQLAM-PYIDNWLRTRQSVSDIVKQFSVSGI-LMDLAQALMPGANVDLSMRAELINRYRDNRNILFLDK---AT 365 (695) T ss_pred cCcccHHHHHH-HHHHHHHHHHhHHHHHHHhhhhHHH-HHHHHHhhcChhHHHHHHHHHHHHHhcCccceEEEec---CC Confidence 57899877643 5566666555444433322111111 11222111111111010000 112211111 1111 11 Q ss_pred cceeeechhhhhHHHHHHHHHHHHHHHHhhCCChh-hcccCCCcc-hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 346 TIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPV-SLGLSDEVA-QTATEASGKKDLTVKTTRAKARHFGSALGPLSTT 423 (533) Q Consensus 346 ~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~-~~g~~~~~~-~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~ 423 (533) + ++.|.++.+...-..+..++.+++..+++|.. -||-...|- .|++.=...|-+.+... .+..++..|+.++.+ T Consensus 366 E--efeq~stslSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~--Qe~~L~p~L~rl~~i 441 (695) T protein:vir:78 366 E--EFFQFNTPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAY--QRNALQQLMNDVIVM 441 (695) T ss_pred c--ceEEEecccCCHHHHHHHHHHHHHhhhcCchhhhhccCCccccccchhhHHHHHHHHHHH--HHHHHHHHHHHHHHH Confidence 2 23344555666677778888888888999854 456655553 67774444444444322 367788888888776 Q ss_pred HHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHH-------HHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHH Q lcl|NC_016654. 424 CLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQA-------WSVASAASTKTKVAYLHEDWDDERVQEEADLI 496 (533) Q Consensus 424 il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~-------l~~aGi~S~et~v~~l~~~~~dee~~~El~rI 496 (533) |..- .| |. .+ .+++++|+.--..++.|+|++..+ ++.+|+++..+...++ T Consensus 442 i~rS---~~-G~--id-pdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL---------------- 498 (695) T protein:vir:78 442 IQLS---LF-GA--VD-PSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARL---------------- 498 (695) T ss_pred HHHH---hc-CC--CC-CcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHH---------------- Confidence 5331 22 22 22 358999998888888877775433 3344554444433332 Q ss_pred HHhhhcccCccccccccCCCCCCCCCCCC-------CCCCCCCC Q lcl|NC_016654. 497 DNANTVSAPTFGFGTDQPPLPTENDPATD-------PEAVDEGE 533 (533) Q Consensus 497 ~~E~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~d~~ 533 (533) ..+..----......+.|+.+.+++-+|. .+.++.|+ T Consensus 499 ~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 542 (695) T protein:vir:78 499 NTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGA 542 (695) T ss_pred hcCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCC Confidence 22110000000012233333333322221 11111111 No 224 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=93.99 E-value=0.0057 Score=32.88 Aligned_cols=410 Identities=7% Similarity=-0.021 Sum_probs=142.9 Q ss_pred hHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcc-cceeecChH--HHHHHHHHH Q lcl|NC_016654. 15 LAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRA-PKRYHAPIP--GVIAKLSTT 91 (533) Q Consensus 15 ~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~-~~~~~~n~~--k~i~~~~a~ 91 (533) ++..+.+...-.+..... +........+..+.+++. .|.. .....++.| -..++.+|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~------------------~~~~~~~~~g~~~~~~~~-~~~~~~~~~a~~~~~v~~~v~~ia~ 61 (460) T protein:vir:10 1 MANRIIRALRELTGLDNK------------------FNDAFIKYIGQTFTKYDN-NGKTYLEQGYNINPDVYSCISQMAA 61 (460) T ss_pred CchhHHHHHhhhhccCCC------------------chHHHHHhhccccCCCcc-chhhhhHHHHhcchHHHHHHHHHHH Confidence 333333332221111100 000000000111111111 0111 000111222 233444444 Q ss_pred hhcCCCceEeeCCCchHH-------------------------------HHHHHHHHhhc----cHHHHHHHHHHHHhhh Q lcl|NC_016654. 92 ELFSEQLKFLDAGKSKEV-------------------------------QARADLIFNTP----RFHSSLVEAGESCSAL 136 (533) Q Consensus 92 ll~~e~~~i~~~~~~~~~-------------------------------~~~l~~i~~~n----~f~~~~~~~~~~~~~~ 136 (533) -+.+=|..+--...+... ...+..++..- .........+...+.. T Consensus 62 ~iA~lp~~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~ 141 (460) T protein:vir:10 62 KTVAVPYTIKVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLN 141 (460) T ss_pred hhhhCceEEEeccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhc Confidence 444434333211111000 01111122111 1223334445577788 Q ss_pred CCEEEEEEEcCCC--CCce-EEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcc Q lcl|NC_016654. 137 SGSFQRIVWDPTI--ADNA-WIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSL 213 (533) Q Consensus 137 G~~~~~~~~D~~~--~~~~-~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~l 213 (533) |.+|+.+..+..+ .+.+ .+-.++|+++.+.-+.+... ++. .+.+.+..|.. + T Consensus 142 Gnay~~i~r~~~~~~~G~~~~L~~l~~~~v~v~~~~~~~~---------------~~~------~~~~~~~~~~~--~-- 196 (460) T protein:vir:10 142 GNCYFYLMSPDDGINAGVPSQMYVLPAHLIKIVLKDDINL---------------LST------DSPIKSYMLIQ--G-- 196 (460) T ss_pred CCeEEEEEecCCCccCceeEEEEEEcCceEEEEEcCCCce---------------eee------eeeeeEEEEec--C-- Confidence 9999888776432 2222 35556777776654332110 000 00011101110 0 Q ss_pred cceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHH Q lcl|NC_016654. 214 GWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSS 293 (533) Q Consensus 214 G~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~ 293 (533) |....+ ..--+.|+....++.... +...+|.|.+..+. ..|. +...... T Consensus 197 g~~~~~--------------------------~~~evih~r~~~~~~~~~---~~~~~G~sp~~~~~-~~i~-~~~~~~~ 245 (460) T protein:vir:10 197 DQFIEF--------------------------NEDEVIHTKYANPNFDLQ---GSHLYGMSPIRAIL-RNIN-SQNSTID 245 (460) T ss_pred ceeEEe--------------------------cccceEEEecCCCCcccc---cCccccccHHHHHH-HHHH-HHHHHHH Confidence 111100 000112222111111111 11235777776543 4443 3333444 Q ss_pred HHHH-HHhCcceeeechHHhcCCCCccccccCcchhhhhhccccccccc----cccccceeeechhhhhHHHHHHHHHHH Q lcl|NC_016654. 294 LMRD-FRIGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNAN----GDMETIFEFFQPAIRVLEHDQGAALLL 368 (533) Q Consensus 294 ~~~~-~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~----~~~~~~i~~~~~~ir~e~~~~~l~~~l 368 (533) +... |+.|...-.+ +...+.-.....+.....+.....+..+.+ .+....++.++......++++..+... T Consensus 246 ~~~~~f~ng~~~~~i----~~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~ 321 (460) T protein:vir:10 246 NNVKTMQNGGVFGFI----HGGSTGLTQPQADSLKQRLTEMDKSPDRLSQIAGASGEIAFTKISLNTDELKPFDYLKYDQ 321 (460) T ss_pred HHHHHHhcCCCccee----eecCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHHH Confidence 4443 3554322222 211111000001111111211111111111 111223555666666778888888889 Q ss_pred HHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC-CCCCCceeEEEEe Q lcl|NC_016654. 369 REVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPG-KGAAPSEELELEW 447 (533) Q Consensus 369 ~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~-~~~~~~~~v~i~f 447 (533) ++|+...|+||..+|...++..+...++.. ....++.+|..++..+....+..+.. ........+.++| T Consensus 322 ~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~----------~~~f~~~~l~P~~~~ie~~ln~kl~~~~~~~~~~~i~~d~ 391 (460) T protein:vir:10 322 KAICNALGWSDKLLNNNEGGGLNTGNLEEE----------RKRVVTDNIQPDLVILKQAFDKKFIKRFKGYENAVIEWDI 391 (460) T ss_pred HHHHHHhCCCHHHhCCCCCCCCccccHHHH----------HHHHHHHHHHHHHHHHHHHHHHhhcCcccccCCceEEeec Confidence 999999999999998754433222122111 12233334444444443222222211 1111223344444 Q ss_pred CCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCC---CCC Q lcl|NC_016654. 448 PKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTEND---PAT 524 (533) Q Consensus 448 ~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~---~~~ 524 (533) +.- ....+...+...++.+|+|+..++.+.+ ++.. |..|.+ +..-...+..+..+..+ +++ T Consensus 392 ~~l--~~l~~d~~~~~~~~~~g~~T~NE~R~~~--g~~p---------i~~~~g---D~~~~~~n~~~~~~~~~~~~~~~ 455 (460) T protein:vir:10 392 SEL--PEMQTDMVAMASWLNTIPVTPNEIRIAM--KYET---------LNQDGM---DIVFMPSNKVRIDDVSNNLIDSA 455 (460) T ss_pred chh--hhHHHHHHHHHHHHhCCCCCHHHHHHHh--CCCC---------CCCCCC---CeeeecccccchhhcccccCCCc Confidence 332 2223334444557788999988866654 2221 000000 00000001111100000 111 Q ss_pred CCCCC Q lcl|NC_016654. 525 DPEAV 529 (533) Q Consensus 525 ~~~~~ 529 (533) +++.+ T Consensus 456 ~nq~~ 460 (460) T protein:vir:10 456 FNQNQ 460 (460) T ss_pred ccCCC Confidence 11111 No 225 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=93.83 E-value=0.0062 Score=32.67 Aligned_cols=430 Identities=12% Similarity=0.049 Sum_probs=161.3 Q ss_pred CCCCC--------------------C-----------cCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhh Q lcl|NC_016654. 1 MSLPE--------------------A-----------NTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPS 49 (533) Q Consensus 1 ~~~~~--------------------~-----------~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~ 49 (533) -|.|+ | -+-.-|-+-.++.--+.-++.-|.+ | -||.+.. T Consensus 40 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----l-~~~~~~~----- 109 (698) T protein:vir:10 40 QPVPADMGRRGALNALDAAPVAEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDA----L-SFVTSSG----- 109 (698) T ss_pred cccchhhcccccccccccccccCCCccccccccceeccccCCccccchhhhhhccccccccc----c-hhhhccC----- Confidence 11111 0 0111122222222222211111100 1 0111100 Q ss_pred HHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHhhcCCCceE-------------eeC----CCch-HHHH Q lcl|NC_016654. 50 GIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTELFSEQLKF-------------LDA----GKSK-EVQA 111 (533) Q Consensus 50 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i-------------~~~----~~~~-~~~~ 111 (533) |-+++.=- .--.++.-+.++..+|.-+..+=..+ ++. ..++ +.-+ T Consensus 110 -------------F~Gy~~la----~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~ 172 (698) T protein:vir:10 110 -------------FPGFPTLV----LLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLK 172 (698) T ss_pred -------------cchHHHHH----HHhhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHH Confidence 00111000 00011122223333333332221111 111 1111 2335 Q ss_pred HHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCC--CceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceE Q lcl|NC_016654. 112 RADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIA--DNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEV 189 (533) Q Consensus 112 ~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~--~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~ 189 (533) .|..-++.=+.+..++++++.+-.+|++...+-++.+.. ..|.+ ..+.. ...|.++.++.++.+.- T Consensus 173 ~L~~e~erl~V~~~l~eai~~aRlfGGa~~~i~I~gdd~~l~~PL~--~~~~~----I~kGslKGL~ViDp~~v------ 240 (698) T protein:vir:10 173 QINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLV--PRPYT----VPKGSFQGLRVVEPYWV------ 240 (698) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccccceEEEEEeecCccccccccc--ccccc----ccCccceeeeeeccccc------ Confidence 566666667899999999999999999986665542210 01110 00000 12233333322221100 Q ss_pred EEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCccccccccccccc Q lcl|NC_016654. 190 WRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLR 269 (533) Q Consensus 190 y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~ 269 (533) .+. .|... -|++.. +..++.-...+. .| +.++ ++.|+....+ +.- .+... T Consensus 241 -------tP~-----~~n~~-------dP~spd----fgkP~~y~V~G~-~I-H~SR--L~~~vg~pvp--d~L-Kp~y~ 290 (698) T protein:vir:10 241 -------TPN-----NYNSI-------NPVADD----FYKPSTWWMIGS-EV-HATR--LHTIVSRPVG--DML-KPTYS 290 (698) T ss_pred -------ccc-----hhhhc-------cchhhc----cCCCceEEEecc-ee-ccee--EEEecCCCch--hhh-cchhc Confidence 010 00000 000000 000000000000 00 0111 1122222111 111 12234 Q ss_pred ccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCCccccccCcch-----hhhhhcccc-ccccccc Q lcl|NC_016654. 270 YLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGMGQGVSLDEEQ-----EVYSRVGSG-GFNANGD 343 (533) Q Consensus 270 ~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~-----~~~~~~~~~-~~~~~~~ 343 (533) .+|.|..... .+-+++.+++-.....-+.......+ ..++..-..++.. .+... +.|+..... ..| + T Consensus 291 f~G~Sv~q~~-~e~V~~~~rT~~~v~~Li~~~~~~~l-~~dla~aL~~g~~--~~l~~R~eli~~~Rsn~G~~llD---k 363 (698) T protein:vir:10 291 FAGISMTQLA-MPYIDNWLRTRQSVSDIVKQFSVSGI-LMDLAQALTPGAN--VDLSMRAELINRYRDNRNILFLD---K 363 (698) T ss_pred cCCccHHHHH-HHHHHHHHHHhhhHHHHHHHhhHHHH-HHHHHHhcCChhh--HHHHHHHHHHHHhcCccceEEEe---c Confidence 5799988764 35566666655444443322111111 1122211122211 11111 112211111 111 1 Q ss_pred cccceeeechhhhhHHHHHHHHHHHHHHHHhhCCCh-hhcccCCCcc-hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 344 METIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSP-VSLGLSDEVA-QTATEASGKKDLTVKTTRAKARHFGSALGPLS 421 (533) Q Consensus 344 ~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~-~~~g~~~~~~-~Tatai~~~~~~l~~~~~~~~~~~~~al~~li 421 (533) +.+.++ |.+..+...-.-+..++.+++..+++|. .-||-...|- .|+..=...|-+.+... .+..++..|+.++ T Consensus 364 ~~Eefe--q~st~lSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~--Qe~~L~p~L~rl~ 439 (698) T protein:vir:10 364 ATEEFF--QFNTPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAY--QRNALQQLMNDVI 439 (698) T ss_pred CCcceE--EEecCcCCHHHHHHHHHHHHHhhhcCchhhhhccCCcccCccchhhHHHHHHHHHHH--HHHHHHHHHHHHH Confidence 122233 4445555666777778888888888885 4456655553 67774444444444322 3577888888887 Q ss_pred HHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHH-------HHhCCCCCHHHHHHHhCCCCCHHHHHHHHH Q lcl|NC_016654. 422 TTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQA-------WSVASAASTKTKVAYLHEDWDDERVQEEAD 494 (533) Q Consensus 422 ~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~-------l~~aGi~S~et~v~~l~~~~~dee~~~El~ 494 (533) .++..- .+ |. .+ .++++.|+.--..++.|+|++..+ ++..|+++..+..+++ T Consensus 440 ~ii~rS---~~-G~--id-p~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL-------------- 498 (698) T protein:vir:10 440 VMIQLS---LF-GA--VD-PSIKWQWNALRELDDLEVAEARYKQAQSDVLYVQEQVIRPDQVAARL-------------- 498 (698) T ss_pred HHHHHH---hc-CC--CC-CcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHH-------------- Confidence 765331 22 22 22 358999998888888888776443 2334444444433333 Q ss_pred HHHHhhhcccCccccccccCCCCCCCCCCC-------CCCCCCCCC Q lcl|NC_016654. 495 LIDNANTVSAPTFGFGTDQPPLPTENDPAT-------DPEAVDEGE 533 (533) Q Consensus 495 rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~d~~ 533 (533) ..+..----......+.|+.+++++-++ .+++.+.|+ T Consensus 499 --~~d~~s~Y~~~~d~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 542 (698) T protein:vir:10 499 --NTEPDGPYAGKLDANDDPGAPADDDIDGVLTYVQRMAEGGDTGA 542 (698) T ss_pred --hccCCCccccccCCcccCCCCCCCcchHHHhhhcCCcCCCCccc Confidence 2111000000000122222233322221 122222222 No 226 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=93.53 E-value=0.0072 Score=32.32 Aligned_cols=380 Identities=11% Similarity=0.043 Sum_probs=137.4 Q ss_pred cCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhc---ccCCCCCcccceeecChHHHHHHHHHHhhcCCCceEee---- Q lcl|NC_016654. 30 EGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHG---RTPTATGRAPKRYHAPIPGVIAKLSTTELFSEQLKFLD---- 102 (533) Q Consensus 30 ~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~---- 102 (533) .|=.++|...... ..+. ......... ..+.+.++ ......--...|+.+|+-+.+=|..+.- T Consensus 1 mg~~~~~~~~~~~--~~~~-------~~~~~~~~~~~~~~~~~t~~--~~~~~~~v~~cv~~Ia~~ia~~p~~v~~~~~~ 69 (403) T protein:vir:10 1 MGFKSWITEKLNP--GQRI-------IRDMEPVSHRTNRKPFTTGQ--AYSKIEILNRTANMVIDSAAECSYTVGDKYNI 69 (403) T ss_pred Ccchhhhhhccch--hhhh-------hhcccccccccCCcccccHH--HHHHHHHHHHHHHHHHHHHhhCceeEeecccc Confidence 3322222211100 0000 000000000 00000010 0111111222333444444333332211 Q ss_pred -CCCchHHHHHHHHHHhh--ccHH--HHHHHH-HHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEE Q lcl|NC_016654. 103 -AGKSKEVQARADLIFNT--PRFH--SSLVEA-GESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVT 176 (533) Q Consensus 103 -~~~~~~~~~~l~~i~~~--n~f~--~~~~~~-~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~ 176 (533) ...+.....-+..+|+. |... ..+.+. +......|.+|+.+ +. ..+-.++++.+...-+.+.+ T Consensus 70 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~--~~-----~~l~~l~~~~~~v~~~~~~~---- 138 (403) T protein:vir:10 70 VTYANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYW--DG-----TSLYHVPAALMQVEADANKF---- 138 (403) T ss_pred cccccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEE--eC-----ceeEeecCcceEEEEcCCce---- Confidence 11111111223344432 2111 233333 44455667777543 21 12334555544332221111 Q ss_pred EEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCC Q lcl|NC_016654. 177 FWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNV 256 (533) Q Consensus 177 f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~ 256 (533) +| +..|. + +..++.+ + +.|+... T Consensus 139 ------------~~------------~~~~~-~----~~~~~~~----------e------------------iih~~~~ 161 (403) T protein:vir:10 139 ------------IK------------KFIFN-N----QINYRVD----------E------------------IIFIKDN 161 (403) T ss_pred ------------EE------------EEEec-C----ceeeccc----------c------------------eEEeccc Confidence 10 00010 0 0000000 0 0111100 Q ss_pred cccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCcceeeechHHhcCCCCccccccCcchhhhhhccc Q lcl|NC_016654. 257 TPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGS 335 (533) Q Consensus 257 ~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~ 335 (533) .. +... .....|.|.+..+. ..++ +......+.+. |+.|...-.| +...+.-.....+.-.+.+..... T Consensus 162 ~~---~~~~-~~~~~G~s~i~~~~-~~i~-~~~~~~~~~~~~f~ng~~~~gi----l~~~~~l~~e~~~~~~~~~~~~~~ 231 (403) T protein:vir:10 162 SY---VCGT-NSQISGQSRVATVI-DSLE-KRSKMLNFKEKFLDNGTVIGLI----LETDEILNKKLRERKQEELQLDYN 231 (403) T ss_pred cc---ccCC-CCCcccccHHHHHH-HHHH-HHHHHHHHHHHHHhccCCcceE----EEeCCCCCHHHHHHHHHHHHHHhC Confidence 00 0000 01234667665433 3332 23333334333 4554332222 221111110001111111221111 Q ss_pred cccccccc----cccceeeec--hhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHH Q lcl|NC_016654. 336 GGFNANGD----METIFEFFQ--PAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAK 409 (533) Q Consensus 336 ~~~~~~~~----~~~~i~~~~--~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~ 409 (533) +..+++.. ....++.++ .+....++++..+...++|+...|+||..+|.... .+..+. . T Consensus 232 g~~n~g~~~vl~~g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~--sn~e~~-------------~ 296 (403) T protein:vir:10 232 PSTGQSSVLILDGGMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGGNN--ANIRPN-------------I 296 (403) T ss_pred CcccCcceeecCCCceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC--cCHHHH-------------H Confidence 11111110 011233333 23445577888888899999999999999974322 122111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCC--CCCCHHHHHHHHHHHHhCCCCCHHHHHHHh-CCCCCH Q lcl|NC_016654. 410 ARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKF--ARESDLAKAQTVQAWSVASAASTKTKVAYL-HEDWDD 486 (533) Q Consensus 410 ~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~--i~~d~~e~a~~~~~l~~aGi~S~et~v~~l-~~~~~d 486 (533) ...++..|..++..+..-.+..+ ...+.++++.- +..|..+.++.+.+++.+|+|+..++...+ .+..++ T Consensus 297 ~~f~~~tl~P~~~~ie~~l~~~L-------~~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl~pi~~ 369 (403) T protein:vir:10 297 ELFYYMTIIPMLNKLTSSLTFFF-------GYKITPNTKEVAALTPDKEAEAKHLTSLVNNGIITGNEARSELNLEPLDD 369 (403) T ss_pred HHHHHHHHHHHHHHHHHHHHHhc-------CceeeeccchhhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCc Confidence 22333344444443333222222 12355555533 567899999999999999999999976654 112332 Q ss_pred HHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 487 ERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 487 ee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +.+.+-+ .| ....+ ...+..+.+++++....+|| T Consensus 370 ~~~d~~~----------~p-~n~~~--~~~~~~~~e~~~~~~~~~g~ 403 (403) T protein:vir:10 370 EQMNKIR----------IP-ANVAG--SATGVSGQEGGRPKGSTEGD 403 (403) T ss_pred ccccccc----------cc-ccccc--ccccCCCCcCCCCCCCcCCC Confidence 2221111 01 00000 11122233444555566666 No 227 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=93.52 E-value=0.0073 Score=32.31 Aligned_cols=441 Identities=11% Similarity=0.054 Sum_probs=184.3 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhcc-Cc-chhhHHHH-----------HHHHHHHHHhcccC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAE-GR-TSPSGIKA-----------RTKAAYEAFHGRTP 67 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~-~~-~~~~~~~~-----------~~~~~~~~~~~~~~ 67 (533) |-|++-=..|--.+ ..+.++--+.. .. ..+....+ .+.|+++.++.... T Consensus 1 ~~~~~lf~f~~~~d------------------~~~~~~~~~~~~~s~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~d~~~ 62 (516) T protein:vir:10 1 MKFLDLFKFWDRVD------------------QNEYDERLKQGHESIATPKKDDGATEIEAREGESSYNALMQQFFGIDN 62 (516) T ss_pred CCchHhcccccchh------------------hHHHHhhhcCCCCcccCCCCccCceeeecCcccccccceeeeeecccC Confidence 55543332332211 11111111000 00 00000000 01111111111111 Q ss_pred CCCCcc---cce--e-ecChHHHHHHHHHHh-hc----CCCceEeeCCC--ch----HHHHHHHHHHhhccHHHHHHHHH Q lcl|NC_016654. 68 TATGRA---PKR--Y-HAPIPGVIAKLSTTE-LF----SEQLKFLDAGK--SK----EVQARADLIFNTPRFHSSLVEAG 130 (533) Q Consensus 68 ~~~g~~---~~~--~-~~n~~k~i~~~~a~l-l~----~e~~~i~~~~~--~~----~~~~~l~~i~~~n~f~~~~~~~~ 130 (533) ...... +++ + ..+=+.-.++..++= +. .+|+++.+++- ++ ...+..+.|++-=+|++..++.. T Consensus 63 ~~~~~~~LI~~YR~ma~~pEvd~Av~eIvneaiv~d~~~~pV~l~l~~~e~s~sik~kI~eeF~~Il~ll~F~~~~~~~f 142 (516) T protein:vir:10 63 NISGTKDLINTYRQLTNNPEVERAVANIVNEAVVYEKGHKVVSLDLDDTEFSSSIKDKILEEFDEICRLLDASRKLDTLF 142 (516) T ss_pred ccccHHHHHHHHHHhhhccchhHHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHH Confidence 000000 000 0 000111122222221 11 12555555432 11 23455566777778999999999 Q ss_pred HHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeec--CCceEEEE-EEE--ecCeeEEEEE Q lcl|NC_016654. 131 ESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGG--DGQEVWRH-LER--HESGYIVHAV 205 (533) Q Consensus 131 ~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~--~~~~~y~~-lE~--h~~~~I~~~~ 205 (533) ....+.|..||+.+.|....+=..+..++|.++.++.. +.+. ++..+++- .|+ |.+| +.. T Consensus 143 R~WYVDgRi~fhKiid~~k~GI~elr~lDPr~i~~vR~------------i~~~~~~~~~v~~~~~e~~~Y~~~---~~~ 207 (516) T protein:vir:10 143 RRWYIDSRIFFHKIMPNPKEGIVELRRLDPRHVEYYRE------------IVTSDVGGTSVVKGYREFFVYTTG---NEG 207 (516) T ss_pred HhhhhcceEEEEEEecCcccceeeeeeeCCcceeeEEe------------eecccCcchhhhhceeeeeeeecC---ccc Confidence 99999999999988986555556677888888777521 1111 11111110 010 1111 111 Q ss_pred EeccCC----cccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHH Q lcl|NC_016654. 206 YKGTAT----SLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLF 281 (533) Q Consensus 206 y~~~~~----~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~ 281 (533) |.-++. +.+.+++-+. +.|+.-.. .+..+... .|-+-.||+ T Consensus 208 ~~~~g~~~~~~~~ikI~~da----------------------------I~y~hSGl--~d~~~~~i-----~syLhkAiK 252 (516) T protein:vir:10 208 YAYNGRLFEPNTRIKIPRSA----------------------------IVYAHSGL--QDCSDRGI-----VGYLHNAVK 252 (516) T ss_pred eeccccccCCCCceecchhh----------------------------eeeeecCc--ccCCCCce-----eceehhhhH Confidence 111111 0011111111 11111100 00000001 122233444 Q ss_pred HHHHHHHHHHHH--HHHHHHhCcceeee-c---------hHHhcC----------CCCccccccCcchhhhhhcc--ccc Q lcl|NC_016654. 282 PTFHELDRIYSS--LMRDFRIGAGKVHA-S---------ESVLTN----------LGMGQGVSLDEEQEVYSRVG--SGG 337 (533) Q Consensus 282 ~lid~lD~~~s~--~~~~~~~~~~~i~v-~---------~~~l~~----------~~~~~~~~~d~~~~~~~~~~--~~~ 337 (533) ++ ..|=-+-+. +.|.-|.-.+|||- . +.+++. +....|. ...+.....++. +.+ T Consensus 253 p~-NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~KNklvYDa~TGe-v~ddrk~msMlEDyWLp 330 (516) T protein:vir:10 253 PA-NQLKLLEDALVIYRITRAPERRVFYIDVGNMPNRKATEYVNGIMQSLKNRVVYDSNTGT-VKNQKRNLSMTEDYWLM 330 (516) T ss_pred hH-HhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCe-eccchhhhhhHhhhccc Confidence 32 222111111 22334566777763 1 111110 0111111 111111000000 011 Q ss_pred cccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc---hhHHHHHHHhhhHHHHHHHHHHHHH Q lcl|NC_016654. 338 FNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA---QTATEASGKKDLTVKTTRAKARHFG 414 (533) Q Consensus 338 ~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~---~Tatai~~~~~~l~~~~~~~~~~~~ 414 (533) --.|+ ....|++....-.. .-+..+..+.+.+....++|.+.+..+++.. .-++||...+-.--.-+.+.+..|. T Consensus 331 RReGg-rgTEItTLpGgqnl-gem~DV~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs 408 (516) T protein:vir:10 331 RRDGK-SVTEVTSLPGAQTM-GEMDDVRWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITRDELDFRKFIVQLQHNFE 408 (516) T ss_pred ccCCC-cccceeeccccCCc-ChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHH Confidence 11122 22234444332222 2355677778889999999988887544321 2456776666666677888888888 Q ss_pred HHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHH-------HHHHHHHHH--hCCCCCHHHHHHHhCCCCC Q lcl|NC_016654. 415 SALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLA-------KAQTVQAWS--VASAASTKTKVAYLHEDWD 485 (533) Q Consensus 415 ~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e-------~a~~~~~l~--~aGi~S~et~v~~l~~~~~ 485 (533) .-+.++++.-|.|....-..........+.++|...---.+.. .+..++++. -++..|.++..++.. -.+ T Consensus 409 ~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~~yi~k~IL-r~t 487 (516) T protein:vir:10 409 EIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVGKYVSHDYVMKNIL-QMT 487 (516) T ss_pred HHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHh-cCC Confidence 8888888876655221100000011235677776443333322 333333332 345789998777654 489 Q ss_pred HHHHHHHHHHHHHhhhcccCccccccccCCCCCCC Q lcl|NC_016654. 486 DERVQEEADLIDNANTVSAPTFGFGTDQPPLPTEN 520 (533) Q Consensus 486 dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~ 520 (533) |+|..+|.++|++|.... -+ .+|...++. T Consensus 488 Deei~~~~k~I~~E~~~~--~~----~~p~~e~~f 516 (516) T protein:vir:10 488 DEQIAQEEKQIEKEANVK--RF----QNPENEDDF 516 (516) T ss_pred HhHHHHHHHHHHHhhhCC--CC----CCCCccccC Confidence 999999999999997431 00 000000000 No 228 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=93.51 E-value=0.0073 Score=32.30 Aligned_cols=374 Identities=9% Similarity=-0.020 Sum_probs=144.2 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) |+|=.....=+|.. ..|+..-.. ..+. ..+..+.... ...-+... T Consensus 1 Mg~f~~~~~~~~~~-----------~~~~~~~~~---~~~~------------------~~~~~~~~v~---~~~~l~~~ 45 (382) T protein:vir:48 1 MPIFNLATESPPDN-----------QGGFFDVVD---SDFL------------------ASLKGNEWVS---AETALRNS 45 (382) T ss_pred CccccccccCCccc-----------ccccccchh---hhcc------------------ccccCCcccc---hHhhhccH Confidence 55543332211111 001110000 0000 0000000000 00011111 Q ss_pred hHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcC Q lcl|NC_016654. 81 IPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDA 160 (533) Q Consensus 81 ~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~ 160 (533) --..+++.+|+-+-+-|..+.- .. ....+.+=-..-.....+..++...+..|.+|+.+..|..+. -+.+..++| T Consensus 46 ~v~~~i~~ia~~ia~~~~~~~~--~~--~~~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~-~~~l~~i~~ 120 (382) T protein:vir:48 46 DLFSIINQLSNDLATVKLITSR--KK--LQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGR-DMKWEYLRP 120 (382) T ss_pred HHHHHHHHHHHhhccCceeeec--ch--hhhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCc-EEEEEEEcC Confidence 2233455555555444433221 11 111111100000123334445556677899999888776543 356777788 Q ss_pred CeEEEEEec-CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCce Q lcl|NC_016654. 161 DRAIPEFRW-GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGA 239 (533) Q Consensus 161 ~~~~P~~~~-g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~ 239 (533) +.+.++.+. +.. .+ |+ |...+...|..+.+. T Consensus 121 ~~v~v~~~~~~~~---~~------------y~--------------~~~~~~~~~~~~~~~------------------- 152 (382) T protein:vir:48 121 SQVSFNRLDNKDG---IY------------YN--------------ITFDDPRIPPKQHVP------------------- 152 (382) T ss_pred ceeEEEEcCCCCe---EE------------EE--------------EEecCccccceeEEc------------------- Confidence 877665322 211 01 10 111111111111110 Q ss_pred eecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhC-cceeeechHHhcCCCC Q lcl|NC_016654. 240 YVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIG-AGKVHASESVLTNLGM 317 (533) Q Consensus 240 ~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~-~~~i~v~~~~l~~~~~ 317 (533) .--+.|+..... .....|.|.+..+. ..| .+....+++... |+.+ .+..++ ...+. T Consensus 153 -------~~evih~~~~~~--------~~~~~G~s~l~~~~-~~i-~~~~~~~~~~~~~~~ng~~p~~il-----~~~~~ 210 (382) T protein:vir:48 153 -------QNDVLHFRLLSV--------DGGMTSVSPLMALS-REL-DIQKASGNLTINSLKNALNANGIL-----KIKGG 210 (382) T ss_pred -------CccEEEecCCCC--------CCccccccHHHHHH-HHH-HHHHHHHHHHHHHHhccCCCceEE-----EeCCC Confidence 001223322111 11346888776543 455 344445555544 3544 333332 11111 Q ss_pred ccccccCcchhhhhhcccccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHH Q lcl|NC_016654. 318 GQGVSLDEEQEVYSRVGSGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTAT 393 (533) Q Consensus 318 ~~~~~~d~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tat 393 (533) ... +..............+.++ +....++.++......++++..+...++|+...|+||..+|....+..+.. T Consensus 211 ~~~---e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~~ 287 (382) T protein:vir:48 211 GLL---DFKTKLSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGDQQSSLE 287 (382) T ss_pred CCh---HHHHHHHHHHHhhccCCCCeeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHH Confidence 110 0000000000001111111 112235566666667788888888899999999999999986443322221 Q ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCH Q lcl|NC_016654. 394 EASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAAST 473 (533) Q Consensus 394 ai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~ 473 (533) +. ...+..+|..+++.+..-.+..+.... ...+.. .+-.+.......+.++..+|++++ T Consensus 288 ~~--------------~~~~~~~l~p~~~~i~~~l~~~l~~~~---~~~~~~----~~~~~~~~~~~~~~~l~~~g~~t~ 346 (382) T protein:vir:48 288 MS--------------SDLYSKAVSRYLRPFLSELSQKLSCDV---DADIFP----AVDPTGSNYISRINSLVKTGTLAQ 346 (382) T ss_pred HH--------------HHHHHHHHHHHHHHHHHHHHHHhcChh---hhhhhh----hhccchhHHHHHHHHHhhcCccCH Confidence 21 223334444444444322222221110 001111 111234455566778888999999 Q ss_pred HHHHHHhC-CCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 474 KTKVAYLH-EDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVD 530 (533) Q Consensus 474 et~v~~l~-~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (533) .++.+.+- .++...++ .+ .++ ..+.. .+||+.+++ T Consensus 347 ~e~r~~l~~~g~~~~~~----~~--~~~------------~~~~~----~GGd~~~~~ 382 (382) T protein:vir:48 347 NQGLYILQQAEILPKEL----PN--GEN------------PNSTL----KGGEEDGQD 382 (382) T ss_pred HHHHHHHhhCCCCCcch----hh--hhc------------CCCCC----CCCCCCCCC Confidence 98876531 12322211 11 011 00111 122222222 No 229 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=93.45 E-value=0.0075 Score=32.22 Aligned_cols=383 Identities=10% Similarity=0.008 Sum_probs=152.8 Q ss_pred HHHHhccCcchhhHHHHHHHHHHHHHhccc-------CCCCCc------ccceeecChHHHHHHHHHHhhcCCCceEeeC Q lcl|NC_016654. 37 ATFYGAEGRTSPSGIKARTKAAYEAFHGRT-------PTATGR------APKRYHAPIPGVIAKLSTTELFSEQLKFLDA 103 (533) Q Consensus 37 ~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~g~------~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~ 103 (533) -.|.+... ...........+|.... ..+.+. .+.-+...--...++.+|+-+..=|..+-- T Consensus 1 m~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~ia~~iA~lp~~~~~- 74 (412) T protein:vir:26 1 MNVIAKEN-----IVTRIKKKLIDNWIDQSTSKLYDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYE- 74 (412) T ss_pred Cccchhhh-----hhhhhhhhHhhhhhcccccccccccccCCccccccchhhhhccHHHHHHHHHHHHhHhhCceeEee- Confidence 11111000 00000000111111000 000000 011122222334444555544443433311 Q ss_pred CCchHHHHHHHHHHh--hccH--HH-HHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEE Q lcl|NC_016654. 104 GKSKEVQARADLIFN--TPRF--HS-SLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFW 178 (533) Q Consensus 104 ~~~~~~~~~l~~i~~--~n~f--~~-~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~ 178 (533) +.+.....+..+|. -|.. .. -....+...+..|.+|+.+..|..+. -..+..++|+.+.++.+.+.- .+| T Consensus 75 -~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~-~~~L~~l~~~~v~v~~~~~~~--~~~- 149 (412) T protein:vir:26 75 -DYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQ-PSKLFLLNPDVVEMLIENQSR--ELY- 149 (412) T ss_pred -ccccccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCc-EEEEEEEcCceeEEEEeCCCc--EEE- Confidence 11111222333333 1222 22 23445667788899998888776553 245667788877776543210 111 Q ss_pred EEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcc Q lcl|NC_016654. 179 SELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTP 258 (533) Q Consensus 179 ~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~ 258 (533) |.+.... |.++.+. .--+.|+.+... T Consensus 150 ------------------------y~~~~~~----g~~~~~~--------------------------~~evih~~~~~~ 175 (412) T protein:vir:26 150 ------------------------YSIHAAT----GNKLIVH--------------------------NMDMLHFKHIVA 175 (412) T ss_pred ------------------------EEEEcCC----ceEEEEc--------------------------cccEEEeCCCCC Confidence 1111000 1111000 001233332111 Q ss_pred cccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcce-eeechHHhcCCCCccccccCcchhhhhhccccc Q lcl|NC_016654. 259 NPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGK-VHASESVLTNLGMGQGVSLDEEQEVYSRVGSGG 337 (533) Q Consensus 259 ~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~-i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~ 337 (533) . ...+|.|.+..+ ...++ ++.....+ .+..++.. -++ +.....-.....+.-...|.... T Consensus 176 ~--------~~~~G~s~i~~~-~~~i~-~~~a~~~~--~~~~~~~~~~~i----~~~~~~l~~e~~~~~~~~~~~~~--- 236 (412) T protein:vir:26 176 S--------NMVQGISPIDVL-KNTTD-FDNAVRTF--NLTEMQKPDSFM----LKYGSNVGKEKRQQVLEDFKQYY--- 236 (412) T ss_pred C--------CCcccccHHHHH-HHHHH-HHHHHHHH--HHHhcCCCCceE----EecCCCCCHHHHHHHHHHHHHHh--- Confidence 1 123577766543 23332 22222222 23222211 111 11111100000011111121111 Q ss_pred ccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc-hhHHHHHHHhhhHHHHHHHHHHH Q lcl|NC_016654. 338 FNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA-QTATEASGKKDLTVKTTRAKARH 412 (533) Q Consensus 338 ~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~-~Tatai~~~~~~l~~~~~~~~~~ 412 (533) .+.++ +....++.++......++++..+...++|+...|+||..+|...++. .++++.. ... T Consensus 237 ~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~~~-------------~~f 303 (412) T protein:vir:26 237 EENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELN-------------RFY 303 (412) T ss_pred hcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH-------------HHH Confidence 01110 11223555665566678888888888999999999999998644322 2232221 122 Q ss_pred HHHHHHHHHHHHHHHHHhhcc-CCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHH Q lcl|NC_016654. 413 FGSALGPLSTTCLRVDAIKFP-GKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQE 491 (533) Q Consensus 413 ~~~al~~li~~il~l~~~~~~-~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~ 491 (533) +..+|..++..+..-.+..+. .........+.+++++-+..|..+.++.+.+++.+|+|+..++.+.+ ++.+-+ T Consensus 304 ~~~~l~P~~~~ie~~ln~kLl~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~--gl~p~~--- 378 (412) T protein:vir:26 304 LQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWE--DLPPVE--- 378 (412) T ss_pred HHHHHHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh--CCCCCC--- Confidence 333444444444322222221 11111223455556666778999999999999999999999977664 232210 Q ss_pred HHHHHHHhhhcccCccccccccCCCCC--C--CCCCCCCCCCCCC Q lcl|NC_016654. 492 EADLIDNANTVSAPTFGFGTDQPPLPT--E--NDPATDPEAVDEG 532 (533) Q Consensus 492 El~rI~~E~~~~~~~~~~~~~~~~~~~--~--~~~~~~~~~~~d~ 532 (533) .-+++ + ...+..|... + ....|..+..++| T Consensus 379 ggD~~----------~-~~~n~~~~~~~~~~~~~~~gG~~n~~e~ 412 (412) T protein:vir:26 379 GGDKP----------L-ISGDLYPIDTPLELRKSLKGGDKNVNES 412 (412) T ss_pred CcCee----------e-ecccccccccchhhcccccCCCCCcCCC Confidence 00000 0 0011111100 0 0122223444444 No 230 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=93.28 E-value=0.0081 Score=32.05 Aligned_cols=439 Identities=10% Similarity=0.034 Sum_probs=183.3 Q ss_pred HHHHHHhhhHhhcCCHHHHHHHHhccCc-c-hhhHHHHH-------------HHHHHHHHhcccCCCCCcc---cce--- Q lcl|NC_016654. 18 VTARVAESHVWWEGDLDKLATFYGAEGR-T-SPSGIKAR-------------TKAAYEAFHGRTPTATGRA---PKR--- 76 (533) Q Consensus 18 ~~~~~~~~~~w~~gd~~~l~~~y~~~~~-~-~~~~~~~~-------------~~~~~~~~~~~~~~~~g~~---~~~--- 76 (533) .+..+.--.-|..=|..++++--+.... . .|....+. +.|+...+..-.....+.. +.+ T Consensus 1 ~~~~l~~~~~~~~~d~~~~~e~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~~~~~~~eLI~~YR~m 80 (521) T protein:vir:65 1 MFSRLKMLARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQKISTTKQLVNTYRGL 80 (521) T ss_pred CccchhhhhhccCchhhHHHhhhccCCCcccCCCCCCCceeecccCCccccccccceeeeccccchhhhHHHHHHHHHHH Confidence 1111222222333333333222211110 0 00000000 0000000000000000000 000 Q ss_pred eecChHHHHHHHHHHh-hc----CCCceEeeCCC--ch----HHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEE Q lcl|NC_016654. 77 YHAPIPGVIAKLSTTE-LF----SEQLKFLDAGK--SK----EVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVW 145 (533) Q Consensus 77 ~~~n~~k~i~~~~a~l-l~----~e~~~i~~~~~--~~----~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~ 145 (533) ...+-+.-.++..++= +. .+|+++.+++. ++ ...+..+.|++-=+|++..++......+.|..|++..+ T Consensus 81 a~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkii 160 (521) T protein:vir:65 81 MNNHEVENAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKII 160 (521) T ss_pred hhccchhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceeEEEEEE Confidence 0112222233333321 11 12555555322 11 23455666777778999999999999999999999999 Q ss_pred cCCCC-CceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCC-----------cc Q lcl|NC_016654. 146 DPTIA-DNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTAT-----------SL 213 (533) Q Consensus 146 D~~~~-~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~-----------~l 213 (533) |++.. +=..+..++|.++.++...-+ ...++..++ .++..|.+|.-... +. T Consensus 161 d~~pk~GI~ELr~lDPr~i~~vr~i~k----------~~~~~~~v~-------~~~~e~f~Y~~~~~~~~~~g~~~~~~~ 223 (521) T protein:vir:65 161 GKNPKDGIVELRQLDPRNLEYVREIIT----------EDTPEGKIY-------KATKEYFIYTVGNSSYCAGGQVFSPNS 223 (521) T ss_pred cCCccccceeeeeeCCcceeeeeeecc----------cccCCccee-------cceeeeeeeecCCcceeccceeecCCc Confidence 85533 334566788887776632100 000111111 12223333321110 01 Q ss_pred cceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHH Q lcl|NC_016654. 214 GWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSS 293 (533) Q Consensus 214 G~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~ 293 (533) +.+++-+. +.|+.-.. -+.. .+.=.|-+-.||+++ ..|=-+-+. T Consensus 224 ~vkI~~dA----------------------------I~y~hSGl--~d~~-----~~~i~syLhkAiKp~-NQLkm~EDA 267 (521) T protein:vir:65 224 RVKIPRSA----------------------------ITYAHSGL--MDCD-----DKYIIGYLHRAVKPA-NQLKLLEDA 267 (521) T ss_pred ceeechhh----------------------------eeeeeccc--eeCC-----CCeeeecchhhhHhH-HhhHHHHhh Confidence 11111111 11111100 0000 000112233344432 222111111 Q ss_pred --HHHHHHhCcceeee-c---------hHHhc-------C---CCCccccccCcchhhhhhcc---ccccccccccccce Q lcl|NC_016654. 294 --LMRDFRIGAGKVHA-S---------ESVLT-------N---LGMGQGVSLDEEQEVYSRVG---SGGFNANGDMETIF 348 (533) Q Consensus 294 --~~~~~~~~~~~i~v-~---------~~~l~-------~---~~~~~~~~~d~~~~~~~~~~---~~~~~~~~~~~~~i 348 (533) +.|.-|.-.+|||- . +.+++ + +....|..-| + +.+..+. +.+--.|+ ....| T Consensus 268 lVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~d-d-rk~msMlEDyWLpRReGg-rgTEI 344 (521) T protein:vir:65 268 MVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKN-Q-QANLSMTEDYWLQRRDGK-AITDV 344 (521) T ss_pred HHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccc-c-ccccchhhhhcccccCCC-Cccce Confidence 22334666777763 1 11111 0 0011111111 1 1110000 01111122 22234 Q ss_pred eeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc---hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 349 EFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA---QTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCL 425 (533) Q Consensus 349 ~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~---~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il 425 (533) ++....-.. .-+.-+..+.+.+....++|.+.++..+++. --++||...+-.--.-+.+.+..|..-|.++++.-| T Consensus 345 tTLpGgqnl-gem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qL 423 (521) T protein:vir:65 345 TTLPGASGM-SDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIRTLQSQFSEVLRDPLKYNL 423 (521) T ss_pred eecccCCCc-ChHHHHHHHHHHHHHHhCCCceeccCCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 544432222 2355667777888999999988875433211 235677666666667788888888888888888776 Q ss_pred HHHHhhccCCCCCCceeEEEEeCCCCCCCHHHH-------HHHHHHHH--hCCCCCHHHHHHHhCCCCCHHHHHHHHHHH Q lcl|NC_016654. 426 RVDAIKFPGKGAAPSEELELEWPKFARESDLAK-------AQTVQAWS--VASAASTKTKVAYLHEDWDDERVQEEADLI 496 (533) Q Consensus 426 ~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~-------a~~~~~l~--~aGi~S~et~v~~l~~~~~dee~~~El~rI 496 (533) .|....-..........+.++|...---.+... +..++++. -+...|.++..++.. -.+|+|.++|.++| T Consensus 424 ilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~S~dyi~k~IL-r~tDeei~~~~k~I 502 (521) T protein:vir:65 424 ILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDIL-KYTDDQMDTEKKQI 502 (521) T ss_pred hhhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHh-ccCHHHHHHHHHHH Confidence 552211000000012347777765433333322 23333332 122569998777654 48999999999999 Q ss_pred HHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 497 DNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 497 ~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ++|....- + +. +..++++ T Consensus 503 ~~E~~~~~--~-------~~----------p~~~~~~ 520 (521) T protein:vir:65 503 EEEANDPR--F-------KQ----------TPDEIED 520 (521) T ss_pred HHhhhCCC--C-------CC----------CcccccC Confidence 99974211 0 00 0001111 No 231 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=92.83 E-value=0.0098 Score=31.59 Aligned_cols=376 Identities=11% Similarity=0.021 Sum_probs=123.5 Q ss_pred cCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHH Q lcl|NC_016654. 30 EGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEV 109 (533) Q Consensus 30 ~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~ 109 (533) .|=.++ +++..... ..... ... .. ... ....+....-..+++.+|+-+.+-|..+--. .... T Consensus 1 Mg~f~~---lf~~~~~~--~~~~~-----~~~---~~-~v~--~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~--~~~~ 62 (395) T protein:vir:95 1 MSILEK---IFKTRKDI--TYMLD-----LDM---IE-DLS--QQAYVKRLAIDSCIEFVARAVAQSHFKVLEG--NRIQ 62 (395) T ss_pred Cchhhh---hhccCccc--ccccc-----chh---cc-ccc--hhhhhhhHHHHHHHHHHHHhhccceeEeccC--Cccc Confidence 342222 22221110 00000 000 00 000 0111222233444555555444434332211 1111 Q ss_pred HHHHHHHHhh--c--cHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecC Q lcl|NC_016654. 110 QARADLIFNT--P--RFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGD 185 (533) Q Consensus 110 ~~~l~~i~~~--n--~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~ 185 (533) ...+..+|.. | .-...+.+.+...+.+|+.++.+..+.. + +. ..++-...|....+.. |.. +...+ T Consensus 63 ~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~--~-~~--~~~~~~~~~~~~~~~~----~~~-~~~~~ 132 (395) T protein:vir:95 63 KNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK--E-LL--IADSFYREEYALYDDI----FKD-VTVKD 132 (395) T ss_pred cchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC--C-eE--ecCCccceeEeecCcc----eeE-EEEcC Confidence 2223333321 1 1123344444445556666665433321 1 11 1111111121111110 000 00000 Q ss_pred CceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCccccccccc Q lcl|NC_016654. 186 GQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHD 265 (533) Q Consensus 186 ~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~ 265 (533) . ..-..+....|.|-.| +-+.. T Consensus 133 ~----~~~~~~~~~evih~~~---------------------------------------------~~~~~--------- 154 (395) T protein:vir:95 133 Y----TYQRTFTMQEVIYLKY---------------------------------------------NNNKV--------- 154 (395) T ss_pred c----eeeeeeccccEEEEcc---------------------------------------------CCCCc--------- Confidence 0 0000001111111111 00111 Q ss_pred ccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeee--chHHhcCCCCccccccCcchhhhhhcccccccccc- Q lcl|NC_016654. 266 PKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHA--SESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG- 342 (533) Q Consensus 266 ~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v--~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~- 342 (533) ..+|.|.+..+ -..++.+...+. +.+...-+| +...+. .... +..++.+....... +.++ T Consensus 155 ---~~~G~spi~~~----~~~~~~~~~~~~---~~~~~~gii~~~~~~~~---~e~~---~~~~~~~~~~~~~~-~~~~~ 217 (395) T protein:vir:95 155 ---THFVESLFEDY----GKIFGRMIGAQL---KNYQIRGILKSASSAYD---EKNI---EKLQAFTNKLFNTF-NKNQL 217 (395) T ss_pred ---ccccchHHHHH----HHHHHHHHHHHH---hcCCCceEEEeCCCCCC---HHHH---HHHHHHHHHHhccc-cccCc Confidence 01233333221 122222222222 222222222 111100 0000 00000011000000 0000 Q ss_pred -----ccccceeeec--h---hhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHH Q lcl|NC_016654. 343 -----DMETIFEFFQ--P---AIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARH 412 (533) Q Consensus 343 -----~~~~~i~~~~--~---~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~ 412 (533) +....++.++ + +....++.+..+...++|+...|+||..++-..+ +.++. .... T Consensus 218 ~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~~s---n~e~~-------------~~~~ 281 (395) T protein:vir:95 218 AIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYGETA---DLEKN-------------TLVF 281 (395) T ss_pred ceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcCccc---CHHHH-------------HHHH Confidence 0001122221 1 2234467888888889999999999999862211 22221 1223 Q ss_pred HHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHh-CCCCCHHHHHH Q lcl|NC_016654. 413 FGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYL-HEDWDDERVQE 491 (533) Q Consensus 413 ~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l-~~~~~dee~~~ 491 (533) ++.+|..++..+....+..+..... ....+.++++.-+-.|..+.++.+.+++.+|+|+..++.+.+ ++-.++.+.++ T Consensus 282 ~~~~l~P~~~~ie~~l~~kL~~~~~-~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~ 360 (395) T protein:vir:95 282 EKFCLTPLLKKIQNELNAKLITQSM-YLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDE 360 (395) T ss_pred HHHHHHHHHHHHHHHHHHhhcChhh-hcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCce Confidence 3334444444333222222221111 112245666666778999999999999999999999977664 11122211111 Q ss_pred HHHHHHHhhhcccCccccccccCCCCCCCC-CCCCCCCCCCCC Q lcl|NC_016654. 492 EADLIDNANTVSAPTFGFGTDQPPLPTEND-PATDPEAVDEGE 533 (533) Q Consensus 492 El~rI~~E~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~d~~ 533 (533) -+ +.... .+ ...+.+......+.. .+|++. ++|+ T Consensus 361 ~~--~~~n~---~~-~~~~~~~~~~~~~~~~kgg~~~--~~g~ 395 (395) T protein:vir:95 361 YL--ITKNY---EK-ANSGENDEKEKDENTLKGGDED--ESGD 395 (395) T ss_pred ee--ecccc---cc-ccccccccCcccccccCCCCCC--CCCC Confidence 00 00000 00 011111111111111 222222 2233 No 232 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=92.83 E-value=0.0098 Score=31.59 Aligned_cols=376 Identities=11% Similarity=0.021 Sum_probs=123.5 Q ss_pred cCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHH Q lcl|NC_016654. 30 EGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEV 109 (533) Q Consensus 30 ~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~ 109 (533) .|=.++ +++..... ..... ... .. ... ....+....-..+++.+|+-+.+-|..+--. .... T Consensus 1 Mg~f~~---lf~~~~~~--~~~~~-----~~~---~~-~v~--~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~--~~~~ 62 (395) T protein:vir:10 1 MSILEK---IFKTRKDI--TYMLD-----LDM---IE-DLS--QQAYVKRLAIDSCIEFVARAVAQSHFKVLEG--NRIQ 62 (395) T ss_pred Cchhhh---hhccCccc--ccccc-----chh---cc-ccc--hhhhhhhHHHHHHHHHHHHhhccceeEeccC--Cccc Confidence 342222 22221110 00000 000 00 000 0111222233444555555444434332211 1111 Q ss_pred HHHHHHHHhh--c--cHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecC Q lcl|NC_016654. 110 QARADLIFNT--P--RFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGD 185 (533) Q Consensus 110 ~~~l~~i~~~--n--~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~ 185 (533) ...+..+|.. | .-...+.+.+...+.+|+.++.+..+.. + +. ..++-...|....+.. |.. +...+ T Consensus 63 ~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~--~-~~--~~~~~~~~~~~~~~~~----~~~-~~~~~ 132 (395) T protein:vir:10 63 KNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK--E-LL--IADSFYREEYALYDDI----FKD-VTVKD 132 (395) T ss_pred cchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC--C-eE--ecCCccceeEeecCcc----eeE-EEEcC Confidence 2223333321 1 1123344444445556666665433321 1 11 1111111121111110 000 00000 Q ss_pred CceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCccccccccc Q lcl|NC_016654. 186 GQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHD 265 (533) Q Consensus 186 ~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~ 265 (533) . ..-..+....|.|-.| +-+.. T Consensus 133 ~----~~~~~~~~~evih~~~---------------------------------------------~~~~~--------- 154 (395) T protein:vir:10 133 Y----TYQRTFTMQEVIYLKY---------------------------------------------NNNKV--------- 154 (395) T ss_pred c----eeeeeeccccEEEEcc---------------------------------------------CCCCc--------- Confidence 0 0000001111111111 00111 Q ss_pred ccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeee--chHHhcCCCCccccccCcchhhhhhcccccccccc- Q lcl|NC_016654. 266 PKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHA--SESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG- 342 (533) Q Consensus 266 ~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v--~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~- 342 (533) ..+|.|.+..+ -..++.+...+. +.+...-+| +...+. .... +..++.+....... +.++ T Consensus 155 ---~~~G~spi~~~----~~~~~~~~~~~~---~~~~~~gii~~~~~~~~---~e~~---~~~~~~~~~~~~~~-~~~~~ 217 (395) T protein:vir:10 155 ---THFVESLFEDY----GKIFGRMIGAQL---KNYQIRGILKSASSAYD---EKNI---EKLQAFTNKLFNTF-NKNQL 217 (395) T ss_pred ---ccccchHHHHH----HHHHHHHHHHHH---hcCCCceEEEeCCCCCC---HHHH---HHHHHHHHHHhccc-cccCc Confidence 01233333221 122222222222 222222222 111100 0000 00000011000000 0000 Q ss_pred -----ccccceeeec--h---hhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHH Q lcl|NC_016654. 343 -----DMETIFEFFQ--P---AIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARH 412 (533) Q Consensus 343 -----~~~~~i~~~~--~---~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~ 412 (533) +....++.++ + +....++.+..+...++|+...|+||..++-..+ +.++. .... T Consensus 218 ~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~~s---n~e~~-------------~~~~ 281 (395) T protein:vir:10 218 AIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYGETA---DLEKN-------------TLVF 281 (395) T ss_pred ceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcCccc---CHHHH-------------HHHH Confidence 0001122221 1 2234467888888889999999999999862211 22221 1223 Q ss_pred HHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHh-CCCCCHHHHHH Q lcl|NC_016654. 413 FGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYL-HEDWDDERVQE 491 (533) Q Consensus 413 ~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l-~~~~~dee~~~ 491 (533) ++.+|..++..+....+..+..... ....+.++++.-+-.|..+.++.+.+++.+|+|+..++.+.+ ++-.++.+.++ T Consensus 282 ~~~~l~P~~~~ie~~l~~kL~~~~~-~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~ 360 (395) T protein:vir:10 282 EKFCLTPLLKKIQNELNAKLITQSM-YLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDE 360 (395) T ss_pred HHHHHHHHHHHHHHHHHHhhcChhh-hcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCce Confidence 3334444444333222222221111 112245666666778999999999999999999999977664 11122211111 Q ss_pred HHHHHHHhhhcccCccccccccCCCCCCCC-CCCCCCCCCCCC Q lcl|NC_016654. 492 EADLIDNANTVSAPTFGFGTDQPPLPTEND-PATDPEAVDEGE 533 (533) Q Consensus 492 El~rI~~E~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~d~~ 533 (533) -+ +.... .+ ...+.+......+.. .+|++. ++|+ T Consensus 361 ~~--~~~n~---~~-~~~~~~~~~~~~~~~~kgg~~~--~~g~ 395 (395) T protein:vir:10 361 YL--ITKNY---EK-ANSGENDEKEKDENTLKGGDED--ESGD 395 (395) T ss_pred ee--ecccc---cc-ccccccccCcccccccCCCCCC--CCCC Confidence 00 00000 00 011111111111111 222222 2233 No 233 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=92.83 E-value=0.0098 Score=31.59 Aligned_cols=376 Identities=11% Similarity=0.021 Sum_probs=123.5 Q ss_pred cCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHH Q lcl|NC_016654. 30 EGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEV 109 (533) Q Consensus 30 ~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~ 109 (533) .|=.++ +++..... ..... ... .. ... ....+....-..+++.+|+-+.+-|..+--. .... T Consensus 1 Mg~f~~---lf~~~~~~--~~~~~-----~~~---~~-~v~--~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~--~~~~ 62 (395) T protein:vir:10 1 MSILEK---IFKTRKDI--TYMLD-----LDM---IE-DLS--QQAYVKRLAIDSCIEFVARAVAQSHFKVLEG--NRIQ 62 (395) T ss_pred Cchhhh---hhccCccc--ccccc-----chh---cc-ccc--hhhhhhhHHHHHHHHHHHHhhccceeEeccC--Cccc Confidence 342222 22221110 00000 000 00 000 0111222233444555555444434332211 1111 Q ss_pred HHHHHHHHhh--c--cHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecC Q lcl|NC_016654. 110 QARADLIFNT--P--RFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGD 185 (533) Q Consensus 110 ~~~l~~i~~~--n--~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~ 185 (533) ...+..+|.. | .-...+.+.+...+.+|+.++.+..+.. + +. ..++-...|....+.. |.. +...+ T Consensus 63 ~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~--~-~~--~~~~~~~~~~~~~~~~----~~~-~~~~~ 132 (395) T protein:vir:10 63 KNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK--E-LL--IADSFYREEYALYDDI----FKD-VTVKD 132 (395) T ss_pred cchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC--C-eE--ecCCccceeEeecCcc----eeE-EEEcC Confidence 2223333321 1 1123344444445556666665433321 1 11 1111111121111110 000 00000 Q ss_pred CceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCccccccccc Q lcl|NC_016654. 186 GQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHD 265 (533) Q Consensus 186 ~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~ 265 (533) . ..-..+....|.|-.| +-+.. T Consensus 133 ~----~~~~~~~~~evih~~~---------------------------------------------~~~~~--------- 154 (395) T protein:vir:10 133 Y----TYQRTFTMQEVIYLKY---------------------------------------------NNNKV--------- 154 (395) T ss_pred c----eeeeeeccccEEEEcc---------------------------------------------CCCCc--------- Confidence 0 0000001111111111 00111 Q ss_pred ccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeee--chHHhcCCCCccccccCcchhhhhhcccccccccc- Q lcl|NC_016654. 266 PKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHA--SESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG- 342 (533) Q Consensus 266 ~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v--~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~- 342 (533) ..+|.|.+..+ -..++.+...+. +.+...-+| +...+. .... +..++.+....... +.++ T Consensus 155 ---~~~G~spi~~~----~~~~~~~~~~~~---~~~~~~gii~~~~~~~~---~e~~---~~~~~~~~~~~~~~-~~~~~ 217 (395) T protein:vir:10 155 ---THFVESLFEDY----GKIFGRMIGAQL---KNYQIRGILKSASSAYD---EKNI---EKLQAFTNKLFNTF-NKNQL 217 (395) T ss_pred ---ccccchHHHHH----HHHHHHHHHHHH---hcCCCceEEEeCCCCCC---HHHH---HHHHHHHHHHhccc-cccCc Confidence 01233333221 122222222222 222222222 111100 0000 00000011000000 0000 Q ss_pred -----ccccceeeec--h---hhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHH Q lcl|NC_016654. 343 -----DMETIFEFFQ--P---AIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARH 412 (533) Q Consensus 343 -----~~~~~i~~~~--~---~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~ 412 (533) +....++.++ + +....++.+..+...++|+...|+||..++-..+ +.++. .... T Consensus 218 ~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~~s---n~e~~-------------~~~~ 281 (395) T protein:vir:10 218 AIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYGETA---DLEKN-------------TLVF 281 (395) T ss_pred ceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcCccc---CHHHH-------------HHHH Confidence 0001122221 1 2234467888888889999999999999862211 22221 1223 Q ss_pred HHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHh-CCCCCHHHHHH Q lcl|NC_016654. 413 FGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYL-HEDWDDERVQE 491 (533) Q Consensus 413 ~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l-~~~~~dee~~~ 491 (533) ++.+|..++..+....+..+..... ....+.++++.-+-.|..+.++.+.+++.+|+|+..++.+.+ ++-.++.+.++ T Consensus 282 ~~~~l~P~~~~ie~~l~~kL~~~~~-~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~ 360 (395) T protein:vir:10 282 EKFCLTPLLKKIQNELNAKLITQSM-YLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDE 360 (395) T ss_pred HHHHHHHHHHHHHHHHHHhhcChhh-hcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCce Confidence 3334444444333222222221111 112245666666778999999999999999999999977664 11122211111 Q ss_pred HHHHHHHhhhcccCccccccccCCCCCCCC-CCCCCCCCCCCC Q lcl|NC_016654. 492 EADLIDNANTVSAPTFGFGTDQPPLPTEND-PATDPEAVDEGE 533 (533) Q Consensus 492 El~rI~~E~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~d~~ 533 (533) -+ +.... .+ ...+.+......+.. .+|++. ++|+ T Consensus 361 ~~--~~~n~---~~-~~~~~~~~~~~~~~~~kgg~~~--~~g~ 395 (395) T protein:vir:10 361 YL--ITKNY---EK-ANSGENDEKEKDENTLKGGDED--ESGD 395 (395) T ss_pred ee--ecccc---cc-ccccccccCcccccccCCCCCC--CCCC Confidence 00 00000 00 011111111111111 222222 2233 No 234 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=91.78 E-value=0.014 Score=30.70 Aligned_cols=396 Identities=12% Similarity=0.029 Sum_probs=147.7 Q ss_pred hcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcc-cceeecChHHHHHHHHHHhhcCCCceEeeCCCch Q lcl|NC_016654. 29 WEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRA-PKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSK 107 (533) Q Consensus 29 ~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~-~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~ 107 (533) ++..|. +.... .-|. .....+.. ...+...-....|+.+|+-+-+=|..+.-.+... T Consensus 1 ~~~~~~-------~~g~~-------------~~~~--~~~~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~~~~ 58 (723) T protein:vir:94 1 MTTFPS-------GAGGW-------------NAWS--ADSVFGNGAKGWSNSAVAYRCISMLANNAASVDLVVRGPDGEL 58 (723) T ss_pred Cccccc-------CCCcc-------------cccc--ccccccccHHHHhhhHHHHHHHHHHHHhhccceeEEEcCCCcc Confidence 222221 00000 0000 00000000 0011222223445555555444454332211111 Q ss_pred HHHHHHHHHHhh--ccH--HHHHHHHHH-HHhhhCCEEEEEEEcCC-CCCc-eEEEEEcCCeEEEEEecCCceEEEEEEE Q lcl|NC_016654. 108 EVQARADLIFNT--PRF--HSSLVEAGE-SCSALSGSFQRIVWDPT-IADN-AWIDFVDADRAIPEFRWGRLVAVTFWSE 180 (533) Q Consensus 108 ~~~~~l~~i~~~--n~f--~~~~~~~~~-~~~~~G~~~~~~~~D~~-~~~~-~~i~~v~~~~~~P~~~~g~~~~v~f~~~ 180 (533) ....-+-.+|.. |.+ ...+.+.+. .....|.+|+.+..+.. ..+. ..+..++++.+.++..++. T Consensus 59 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~--------- 129 (723) T protein:vir:94 59 DELHPLSQLWNVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAA--------- 129 (723) T ss_pred chhhHHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCC--------- Confidence 111223444432 221 223444444 44556888887765421 1111 2233344443333322211 Q ss_pred EeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccc Q lcl|NC_016654. 181 LAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNP 260 (533) Q Consensus 181 ~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~ 260 (533) ..++ ..+.+.|.++..+ |..+++. .--+.|+....+ T Consensus 130 ------~~~~------~~~~~~y~~~~~~----G~~~~~~--------------------------~~dIiHir~~~~-- 165 (723) T protein:vir:94 130 ------DAVP------QAQIIGYVIERTD----GVRVPVL--------------------------ADEMLWLRFSDP-- 165 (723) T ss_pred ------ccce------eeeeeEEEEEecC----ceeEEec--------------------------ccceEEecCCCC-- Confidence 0000 1111112111111 2211110 001233332111 Q ss_pred cccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCc-ceeeechHHhcCCCCccccccCcchhhhhhcccccc Q lcl|NC_016654. 261 EWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGA-GKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGF 338 (533) Q Consensus 261 ~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~-~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~ 338 (533) + ....|.|.+..+. ..| .++....++... |+.|. +.-+ |....-.. .........|.....+.. T Consensus 166 -~-----dg~~G~Spi~~a~-~~i-~~~~aa~~~~~~~f~NG~~p~gi-----L~~~~l~~-e~~~~~~~~~~~~~~G~~ 231 (723) T protein:vir:94 166 -Y-----DPLAVMAPWKAAR-AAV-DADFYAATWQRQSFKNGARPGGV-----VNLGDMDE-QTFTKTVAAFRSQVEGVQ 231 (723) T ss_pred -C-----CCcccccHHHHHH-HHH-HHHHHHHHHHHHHHhcCCCcceE-----EEcCCCCH-HHHHHHHHHHHHHhhchh Confidence 1 1235777776543 344 233444444444 35443 2222 22111000 000011111211111111 Q ss_pred cccc--------------ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHH Q lcl|NC_016654. 339 NANG--------------DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVK 404 (533) Q Consensus 339 ~~~~--------------~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~ 404 (533) +.+. +....++.++.+....++++..+....+|....|+||..++..+...-...+.+ T Consensus 232 Nagk~~vL~g~~~~~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~~st~sN~e~~~~-------- 303 (723) T protein:vir:94 232 NAGRHLLIAGQGSDGGAAGKGATFTSLSMSPAEMDYINSRMHSAEEVMLAFGIRKDALLGGSTYENQAEAKA-------- 303 (723) T ss_pred hcCcceeecccccccccccCCceEEEccCCHHHHHHHHHHHHhHHHHHHHhCCChhHcCCCCCcccHHHHHH-------- Confidence 1110 112235556666667788898888899999999999998864332211111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCC--CCCCHHHHHHHHHHHHhCCCCCHHHHHHHhC- Q lcl|NC_016654. 405 TTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKF--ARESDLAKAQTVQAWSVASAASTKTKVAYLH- 481 (533) Q Consensus 405 ~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~--i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~- 481 (533) ..+...|..+++.+....+..+.. .....+.++|+.. +-.|..+.++.+.+++.+|+|+..++.+.+. T Consensus 304 ------~f~~~tL~P~~~~ie~~ln~~Ll~---~~g~~~~~~f~~~~lLr~D~~~r~~~~~~~v~~G~~T~NE~R~~lgl 374 (723) T protein:vir:94 304 ------AVWTETLIPQMEVMASITDLQLLP---DIGWTVEWDFNSVPALQEDLEAQAGRNQGYLVNDVLMVDEVRATIGL 374 (723) T ss_pred ------HHHHHHHHHHHHHHHHHHhHhhcc---cccCceEEeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 123344444444443332322221 1124567888753 5689999999999999999999999776641 Q ss_pred CCCCHHHHHHHHHHHHHhhhccc-Ccc--cc------------ccccCCCCCCCCCCC--CCCCCCCCC Q lcl|NC_016654. 482 EDWDDERVQEEADLIDNANTVSA-PTF--GF------------GTDQPPLPTENDPAT--DPEAVDEGE 533 (533) Q Consensus 482 ~~~~dee~~~El~rI~~E~~~~~-~~~--~~------------~~~~~~~~~~~~~~~--~~~~~~d~~ 533 (533) |.+..-+.+--+.-....-+..+ |.. .. ..+.| .++.+.. -....++|. T Consensus 375 pPi~gGd~~~~~~p~~~~~a~~~~~~p~~~e~~~~~~~~~~~~~~~~p---~~~~~~~~~~~~~~~~~~ 440 (723) T protein:vir:94 375 DPLPGGIGQMTLTPYRAQFAPAPAPAPAVEEGAARMLALLERVAADRP---LPELPVRATTVLHHDPGP 440 (723) T ss_pred CCCCCCcccceeccccccccCCCCCCccchhhhHhhhhhccccccccC---cCCCCCCCCCCCCCCccc Confidence 11211110000110000000000 000 00 01111 1111111 011122222 No 235 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=91.42 E-value=0.016 Score=30.44 Aligned_cols=458 Identities=12% Similarity=0.074 Sum_probs=187.1 Q ss_pred CCCC--CCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcc--hhhHHHHHHHHHHHHHhcccCCCCCcc--- Q lcl|NC_016654. 1 MSLP--EANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRT--SPSGIKARTKAAYEAFHGRTPTATGRA--- 73 (533) Q Consensus 1 ~~~~--~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~g~~--- 73 (533) |-.| .--.+|-+.+-..+... ...+.-=..-|... -++... ..+.....+.|.+..++.+........ T Consensus 1 m~~~~L~~~~~w~~~de~~~~~~-~~~~~~S~~~p~~~----Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eL 75 (524) T protein:vir:10 1 MKFNVLSLFAPWAKMDERNFKDQ-EKEDLVSITAPKLD----DGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTREL 75 (524) T ss_pred CCCchhhHhhccccCcchhhhhh-hccCCccccCccCC----CCceeeeecccccccccceeeeehhcccccccchHHHH Confidence 5444 22235555542222222 11111000000000 000000 000000111222222222211100000 Q ss_pred -cce---eecChHHHHHHHHHHh-hc----CCCceEeeCCCc--h----HHHHHHHHHHhhccHHHHHHHHHHHHhhhCC Q lcl|NC_016654. 74 -PKR---YHAPIPGVIAKLSTTE-LF----SEQLKFLDAGKS--K----EVQARADLIFNTPRFHSSLVEAGESCSALSG 138 (533) Q Consensus 74 -~~~---~~~n~~k~i~~~~a~l-l~----~e~~~i~~~~~~--~----~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~ 138 (533) +.+ ...+-+.-.++..++= +. .+|+++.++.-+ + ...+..+.|++-=+|++..++......+.|. T Consensus 76 I~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgR 155 (524) T protein:vir:10 76 IDTYRNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFNDVLNHLSFQRKGSDHFRRWYVDSR 155 (524) T ss_pred HHHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeE Confidence 000 0112222233333321 11 125555553321 2 2445566677777899999999999999999 Q ss_pred EEEEEEEcCC--CCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeec--CCceEEEEEEEecCeeEEEEEEeccCCc-- Q lcl|NC_016654. 139 SFQRIVWDPT--IADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGG--DGQEVWRHLERHESGYIVHAVYKGTATS-- 212 (533) Q Consensus 139 ~~~~~~~D~~--~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~--~~~~~y~~lE~h~~~~I~~~~y~~~~~~-- 212 (533) .||+.++|.. ..+=..+..++|.++-++. ++... ++..+++ ++-.|.+|.-...+ T Consensus 156 i~fhKiid~k~pk~GI~Elr~lDPr~i~~vr------------~i~~~~~~~~~vi~-------~~~e~f~Y~~~~~~y~ 216 (524) T protein:vir:10 156 IFFHKIIDPKRPKEGIKELRRLDPRQVQYVR------------EIITETEAGTKIVK-------GYKEYFIYDTAHESYA 216 (524) T ss_pred EEEEEEeeCCCccccceeeeeeCCccceeee------------eeccCCCccchhhc-------chhhheeeccCccccc Confidence 9999999854 2233456677777665542 22111 1111221 22223333311000 Q ss_pred -ccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHH Q lcl|NC_016654. 213 -LGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIY 291 (533) Q Consensus 213 -lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~ 291 (533) -|. +. ..+....|+ .-.+.|..-+. ..+.+ ..=.|-+-.||+++ ..|=-+- T Consensus 217 ~~g~-~~---------------~~~~~ikI~----~dAI~y~hSGL-----~d~~~--~~i~gyLhkAiKp~-NQLkmlE 268 (524) T protein:vir:10 217 CDGR-MY---------------EAGTKIKIP----KAAIVYAHSGL-----VDCCG--KNIIGYLHRAVKPA-NQLKLLE 268 (524) T ss_pred cCcc-cc---------------CCCcceecc----hhheeeeeccc-----eeCCC--CceeccchhhhHHH-HhhhHHH Confidence 010 00 000000000 00112221110 00000 00012233344432 2221111 Q ss_pred HH--HHHHHHhCcceeee-c---------hHHhcC----------CCCccccccCcchhhhhhcc--ccccccccccccc Q lcl|NC_016654. 292 SS--LMRDFRIGAGKVHA-S---------ESVLTN----------LGMGQGVSLDEEQEVYSRVG--SGGFNANGDMETI 347 (533) Q Consensus 292 s~--~~~~~~~~~~~i~v-~---------~~~l~~----------~~~~~~~~~d~~~~~~~~~~--~~~~~~~~~~~~~ 347 (533) +. +.|.-|.-.+|||- . +.+++. +....|. ...+.....++. +.+--.|+ .... T Consensus 269 DAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGe-v~ddrk~msMlEDyWLpRReGg-rgTE 346 (524) T protein:vir:10 269 DAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGK-IKNQQHNMSMTEDYWLQRRDGK-AVTE 346 (524) T ss_pred hhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCe-eccchhhhhhHhhhcccccCCC-cccc Confidence 11 22334666777763 1 111110 0111111 111111000000 01111122 2223 Q ss_pred eeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCC-Cc--chhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 348 FEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSD-EV--AQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTC 424 (533) Q Consensus 348 i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~-~~--~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~i 424 (533) |+++...-.. .-+..+..+.+.+....++|.+.+..++ ++ .--++||...+-.--.-+.+.+..|..-|.++++.- T Consensus 347 ItTLpGgqnl-gem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~q 425 (524) T protein:vir:10 347 VDTLPGADNT-GNMEDVRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIRELQHKFEEVFLDPLKTN 425 (524) T ss_pred eeeccccCCc-ChHHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 4544432222 2355667777888888999888884332 11 124567766666666778888888888888888877 Q ss_pred HHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHH-------HHHHHHH--hCCCCCHHHHHHHhCCCCCHHHHHHHHHH Q lcl|NC_016654. 425 LRVDAIKFPGKGAAPSEELELEWPKFARESDLAKA-------QTVQAWS--VASAASTKTKVAYLHEDWDDERVQEEADL 495 (533) Q Consensus 425 l~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a-------~~~~~l~--~aGi~S~et~v~~l~~~~~dee~~~El~r 495 (533) |.|....-..........+.++|...---.+...+ ..++++. -+..+|.+++.++.. -.+|+|.++|.++ T Consensus 426 LilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~IL-r~tDeei~~~~k~ 504 (524) T protein:vir:10 426 LLLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINMLTMAEPFIGKYISHRTAMKDIL-QMTDEEIEQEAKQ 504 (524) T ss_pred hhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHh-ccCHHHHHHHHHH Confidence 65522110000001124577777654433333333 3333332 122569998777654 4899999999999 Q ss_pred HHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 496 IDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 496 I~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) |++|....- + +.++ +++.| T Consensus 505 I~~E~k~~~--~-------~~~~----------~~~~~ 523 (524) T protein:vir:10 505 IEEESKEAR--F-------QDPD----------QEQED 523 (524) T ss_pred HHHHhhcCC--C-------CCCc----------hhhhc Confidence 999963210 0 0000 00000 No 236 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=91.39 E-value=0.016 Score=30.42 Aligned_cols=392 Identities=13% Similarity=0.057 Sum_probs=140.6 Q ss_pred cCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCc-ccce--eecChHHHHHHHHHHhhcCCCceEe--e-C Q lcl|NC_016654. 30 EGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGR-APKR--YHAPIPGVIAKLSTTELFSEQLKFL--D-A 103 (533) Q Consensus 30 ~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~-~~~~--~~~n~~k~i~~~~a~ll~~e~~~i~--~-~ 103 (533) +|=.++|.. +......+.... +. ..+.......... .... ....--..+++.+|+-+.+=|..+- . . T Consensus 1 Mg~~~~~~~--~~~~~~~~~~~~--~~---~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~~~~~~d 73 (423) T protein:vir:81 1 MGFLQKLGL--APSVVATPEPIE--LV---GPIFESLKLSTKNMTVEQIWEDQPHLRTVTTFIARNVASLQLQAFERVED 73 (423) T ss_pred CchhHhhcc--ccccccCccccc--cc---cccccccccccchhhHHHHHHhhhHHHHHHHHHHHhHhhCceEEEEEecC Confidence 343333210 000000000000 00 0000000000000 0000 0111223455566665555454331 1 1 Q ss_pred CCch-HHHHHHHHHHhhcc----HHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEE--EcCCeEEEEEecCCceEEE Q lcl|NC_016654. 104 GKSK-EVQARADLIFNTPR----FHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDF--VDADRAIPEFRWGRLVAVT 176 (533) Q Consensus 104 ~~~~-~~~~~l~~i~~~n~----f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~--v~~~~~~P~~~~g~~~~v~ 176 (533) +..+ ..+..+.+++..-+ ....+...+......|.+|+.+..|..+... .+.. +++..+.+. T Consensus 74 g~~~~~~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~-~~~l~p~~~~~v~~~---------- 142 (423) T protein:vir:81 74 GGRERVREGHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTP-TLDIRPIPVSWVQRR---------- 142 (423) T ss_pred CceeeeccchHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcc-eEEEeecccceeeee---------- Confidence 1111 11122334443222 2222333444556778888777666432211 1111 111111110 Q ss_pred EEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCC Q lcl|NC_016654. 177 FWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNV 256 (533) Q Consensus 177 f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~ 256 (533) .... ..+.+.|.++.... .-|..+.+. .--+.|+.+. T Consensus 143 ---~~~~-------------~~~~~~Y~~~~~~~-~~g~~~~~~--------------------------~~evih~r~~ 179 (423) T protein:vir:81 143 ---AYKD-------------GWGSLDYIIIESGD-NDGRSVKVP--------------------------GERVIHRHGY 179 (423) T ss_pred ---eccC-------------CCcceEEEEEEecC-CCceEEEEc--------------------------ccceEEecCC Confidence 0000 11122222221111 112221110 0012333322 Q ss_pred cccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHH-HhC-cceeeechHHhcCCCCccccccCcc-----hhh Q lcl|NC_016654. 257 TPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDF-RIG-AGKVHASESVLTNLGMGQGVSLDEE-----QEV 329 (533) Q Consensus 257 ~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~-~~~i~v~~~~l~~~~~~~~~~~d~~-----~~~ 329 (533) .++ ....|.|....+. ..| .++....++...+ +.| .+..++ ...........+.+ ... T Consensus 180 ~~~--------~~~~G~spi~~~~-~~i-~~~~~~~~~~~~~f~ng~~p~gvi-----~~~~~~~~~~l~~e~~~~~~~~ 244 (423) T protein:vir:81 180 NPK--------TMKRGKSPVQSLR-DIL-GEQIEAAIFRAQMWRNGPRPGMVI-----MRDPESKAGKWDAESRTRFMAN 244 (423) T ss_pred CCC--------CccccccHHHHHH-HHH-HHHHHHHHHHHHHHhccCCCceEE-----EecCcccCccCCHHHHHHHHHH Confidence 111 1235777776543 444 3344455555443 443 222222 11110000001111 111 Q ss_pred hhhcc-cccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcch-hHHHHHHHhhhHH Q lcl|NC_016654. 330 YSRVG-SGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQ-TATEASGKKDLTV 403 (533) Q Consensus 330 ~~~~~-~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~-Tatai~~~~~~l~ 403 (533) +.... .+..+.++ +....++.++......++++..+....+|+...|+||..+|+..++.- +.++. T Consensus 245 ~~~~~~~~~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn~e~~-------- 316 (423) T protein:vir:81 245 LRASFSPKSSDVGGTLLLEDGMKAENFHTTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNANYSNVREF-------- 316 (423) T ss_pred HHHHhccccccCCcceecCCCceEEeccCChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCCcccHHHH-------- Confidence 11111 11111111 011234555555556677888778888999999999999986443321 22222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCC--CCceeEEEEeCCCCCCCHHHHHHHHHHHH-hCCCCCHHHHHHHh Q lcl|NC_016654. 404 KTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGA--APSEELELEWPKFARESDLAKAQTVQAWS-VASAASTKTKVAYL 480 (533) Q Consensus 404 ~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~--~~~~~v~i~f~d~i~~d~~e~a~~~~~l~-~aGi~S~et~v~~l 480 (533) ....++.+|.-++..+..-.+..+..... .....+.++++.-+-.|..++++.+++++ ++|+|+..++.+.+ T Consensus 317 -----~~~f~~~~L~P~~~~ie~~l~~~L~~~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~NE~R~~~ 391 (423) T protein:vir:81 317 -----RKALYGDNLGSWIRIIQDVMNLFLLPRVGIDNEKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTINEVRAMD 391 (423) T ss_pred -----HHHHHHHHHHHHHHHHHHHHhhhhcCccccccCccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCHHHHHHHh Confidence 12222333444333332222222211111 12233555555667789999999888876 56999998866543 Q ss_pred CCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCC-CCCCCCCC Q lcl|NC_016654. 481 HEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATD-PEAVDEGE 533 (533) Q Consensus 481 ~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~d~~ 533 (533) ++..-+ ++|..-.+..-.++.. +...++.| T Consensus 392 --gl~p~~---------------------gGD~~~~p~n~~~~~~~~~~~~~~~ 422 (423) T protein:vir:81 392 --NLPSID---------------------GGDDLARPLNTEFGDSEDAPGEEVE 422 (423) T ss_pred --CCCCCC---------------------CcceeecccccccCccCCCCCCCCC Confidence 232200 0111100000000000 11111111 No 237 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=91.34 E-value=0.016 Score=30.38 Aligned_cols=458 Identities=12% Similarity=0.072 Sum_probs=187.0 Q ss_pred CCCC--CCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcc--hhhHHHHHHHHHHHHHhcccCCCCCcc--- Q lcl|NC_016654. 1 MSLP--EANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRT--SPSGIKARTKAAYEAFHGRTPTATGRA--- 73 (533) Q Consensus 1 ~~~~--~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~g~~--- 73 (533) |-.| .--.+|-+.+-..+... ...+.-=..-|... -++... ..+.....+.|.+..++.+........ T Consensus 1 m~~~~L~~~~~w~~~de~~~~~~-~~~~~~S~~~p~~~----Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eL 75 (524) T protein:vir:72 1 MKFNVLSLFAPWAKMDERNFKDQ-EKEDLVSITAPKLD----DGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTREL 75 (524) T ss_pred CCCchhhHhhccccCcchhhhhh-hccCCccccCccCC----CCceeeeecccccccccceeeeehhcccccccchHHHH Confidence 5444 22235555542222222 11111000000000 000000 000000111222222222211100000 Q ss_pred -cce---eecChHHHHHHHHHHh-hc----CCCceEeeCCCc--h----HHHHHHHHHHhhccHHHHHHHHHHHHhhhCC Q lcl|NC_016654. 74 -PKR---YHAPIPGVIAKLSTTE-LF----SEQLKFLDAGKS--K----EVQARADLIFNTPRFHSSLVEAGESCSALSG 138 (533) Q Consensus 74 -~~~---~~~n~~k~i~~~~a~l-l~----~e~~~i~~~~~~--~----~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~ 138 (533) +.+ ...+-+.-.++..++= +. .+|+++.++.-+ + ...+..+.|++-=+|++..++......+.|. T Consensus 76 I~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgR 155 (524) T protein:vir:72 76 IDTYRNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFSDVLNHLSFQRKGSDHFRRWYVDSR 155 (524) T ss_pred HHHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeE Confidence 000 0112222233333321 11 125555553321 2 2445566677777899999999999999999 Q ss_pred EEEEEEEcCC--CCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeec--CCceEEEEEEEecCeeEEEEEEeccCCc-- Q lcl|NC_016654. 139 SFQRIVWDPT--IADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGG--DGQEVWRHLERHESGYIVHAVYKGTATS-- 212 (533) Q Consensus 139 ~~~~~~~D~~--~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~--~~~~~y~~lE~h~~~~I~~~~y~~~~~~-- 212 (533) .||+.++|.. ..+=..+..++|.++-++. ++... ++..+++ ++-.|.+|.-...+ T Consensus 156 i~fhKiid~k~pk~GI~Elr~lDPr~i~~vr------------~i~~~~~~~~~vi~-------~~~e~f~Y~~~~~~y~ 216 (524) T protein:vir:72 156 IFFHKIIDPKRPKEGIKELRRLDPRQVQYVR------------EIITETEAGTKIVK-------GYKEYFIYDTAHESYA 216 (524) T ss_pred EEEEEEEeCCCccccceeeeeeCCccceeee------------eeccCCCccchhhc-------chhhheeeccCccccc Confidence 9999999854 2233456677777665542 22111 1111221 22223333311000 Q ss_pred -ccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHH Q lcl|NC_016654. 213 -LGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIY 291 (533) Q Consensus 213 -lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~ 291 (533) -|. +. ..+....|+ .-.+.|..-+. ..+.+ ..=.|-+-.||+++ ..|=-+- T Consensus 217 ~~g~-~~---------------~~~~~ikI~----~dAI~y~hSGL-----~d~~~--~~i~gyLhkAiKp~-NQLkmlE 268 (524) T protein:vir:72 217 CDGR-MY---------------EAGTKIKIP----KAAVVYAHSGL-----VDCCG--KNIIGYLHRAVKPA-NQLKLLE 268 (524) T ss_pred cCcc-cc---------------CCCcceecc----hhheeeeeccc-----eeCCC--CceeccchhhhHhH-HhhhHHH Confidence 010 00 000000000 00112221110 00000 00012233344432 2221111 Q ss_pred HH--HHHHHHhCcceeee-c---------hHHhcC----------CCCccccccCcchhhhhhcc--ccccccccccccc Q lcl|NC_016654. 292 SS--LMRDFRIGAGKVHA-S---------ESVLTN----------LGMGQGVSLDEEQEVYSRVG--SGGFNANGDMETI 347 (533) Q Consensus 292 s~--~~~~~~~~~~~i~v-~---------~~~l~~----------~~~~~~~~~d~~~~~~~~~~--~~~~~~~~~~~~~ 347 (533) +. +.|.-|.-.+|||- . +.+++. +....|. ...+.....++. +.+--.|+ .... T Consensus 269 DAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGe-v~ddrk~msMlEDyWLpRReGg-rgTE 346 (524) T protein:vir:72 269 DAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGK-IKNQQHNMSMTEDYWLQRRDGK-AVTE 346 (524) T ss_pred hhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCe-eccchhhhhhHhhhcccccCCC-cccc Confidence 11 22334666777763 1 111110 0111111 111111000000 01111122 2223 Q ss_pred eeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCC-Cc--chhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 348 FEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSD-EV--AQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTC 424 (533) Q Consensus 348 i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~-~~--~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~i 424 (533) |+++...-.. .-+..+..+.+.+....++|.+.+..++ ++ .--++||...+-.--.-+.+.+..|..-|.++++.- T Consensus 347 ItTLpGgqnl-gem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~q 425 (524) T protein:vir:72 347 VDTLPGADNT-GNMEDIRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIRELQHKFEEVFLDPLKTN 425 (524) T ss_pred eeeccccCCc-ChHHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 4544432222 2355667777888889999888884332 11 124567766666666778888888888888888877 Q ss_pred HHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHH-------HHHHHHH--hCCCCCHHHHHHHhCCCCCHHHHHHHHHH Q lcl|NC_016654. 425 LRVDAIKFPGKGAAPSEELELEWPKFARESDLAKA-------QTVQAWS--VASAASTKTKVAYLHEDWDDERVQEEADL 495 (533) Q Consensus 425 l~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a-------~~~~~l~--~aGi~S~et~v~~l~~~~~dee~~~El~r 495 (533) |.|....-..........+.++|...---.+...+ ..++++. -+..+|.+++.++.. -.+|+|.++|.++ T Consensus 426 LilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~IL-r~tDeei~~~~k~ 504 (524) T protein:vir:72 426 LLLKGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINMLTMAEPFIGKYISHRTAMKDIL-QMTDEEIEQEAKQ 504 (524) T ss_pred hhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHh-ccCHHHHHHHHHH Confidence 65522110000001124577777654433333333 3333332 122569998777654 4899999999999 Q ss_pred HHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 496 IDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 496 I~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) |++|....- + +.++ +++.| T Consensus 505 I~~E~k~~~--~-------~~~~----------~~~~~ 523 (524) T protein:vir:72 505 IEEESKEAR--F-------QDPD----------QEQED 523 (524) T ss_pred HHHHhhcCC--C-------CCCc----------hhhhc Confidence 999964210 0 0000 00000 No 238 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=90.88 E-value=0.019 Score=30.07 Aligned_cols=469 Identities=11% Similarity=0.053 Sum_probs=178.6 Q ss_pred CCC--CCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceee Q lcl|NC_016654. 1 MSL--PEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYH 78 (533) Q Consensus 1 ~~~--~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 78 (533) |+= |+.-..= |..++ ..|-.-.+---.++..|++. ....+. .|..+-+ .+.+...+-+-+| T Consensus 1 m~~~~~~~~~~t-pe~la------~~W~~~I~~a~~~~~~~h~r-----~~~~~k----~y~~~~~-~~~~~~~r~nl~~ 63 (663) T protein:vir:34 1 MNESQPTDFADT-PQGWA------QRWQEEMSAAREPLEKWHTQ-----GKEIVK----RYRDERD-SAHDAETRWNLFS 63 (663) T ss_pred CCccccccchhc-chhHH------HHHHHHHHHHHhccchHHHH-----HHHHHH----Hhhcccc-CCCccccccchhh Confidence 542 3211111 23322 12211111000011111110 000001 1111111 1111111112233 Q ss_pred cChHHHHHHHHHHhhcCCCceEeeCC----Cc----hHHHHHHHHHH------hhccHHHHHHHHHHHHhhhCCEEEEEE Q lcl|NC_016654. 79 APIPGVIAKLSTTELFSEQLKFLDAG----KS----KEVQARADLIF------NTPRFHSSLVEAGESCSALSGSFQRIV 144 (533) Q Consensus 79 ~n~~k~i~~~~a~ll~~e~~~i~~~~----~~----~~~~~~l~~i~------~~n~f~~~~~~~~~~~~~~G~~~~~~~ 144 (533) .|+-... =-|.+.+|.+++.. .+ ..+.+.+.+.+ ++++|...+...+..++..|.+.+++. T Consensus 64 sni~~i~-----P~iYar~P~p~V~~rf~d~d~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~v~ 138 (663) T protein:vir:34 64 TNIQTQM-----ASLYGQTPKVSVSRRFADADDDVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCRIR 138 (663) T ss_pred hhHHHHh-----hhhhcCCCcceeeecccCcccchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEEEE Confidence 3333322 22344556665532 12 23455666655 566799999999999999999999999 Q ss_pred EcC--------------CCC---------------CceEEEEEcCCeEE--E--EEecCCceEEEEEEEEeecC--Cc-- Q lcl|NC_016654. 145 WDP--------------TIA---------------DNAWIDFVDADRAI--P--EFRWGRLVAVTFWSELAGGD--GQ-- 187 (533) Q Consensus 145 ~D~--------------~~~---------------~~~~i~~v~~~~~~--P--~~~~g~~~~v~f~~~~~~~~--~~-- 187 (533) |-. .+. .+++|++|.-..|. | .|+ .+.-|+|....++.+ +. T Consensus 139 Ye~~~~~~~~~~~~~D~~~~~~~a~~~~~~e~~a~E~v~id~v~~~dfl~~pAr~W~--ev~wva~r~~mtk~e~~~rf~ 216 (663) T protein:vir:34 139 YEVEWEEVAGVDAILDEATGAELAAAVPPTQRKAYECVETDYLHWQDVLWSPARVWH--EVRWLAFRNLLDMREFNARFD 216 (663) T ss_pred eecccchhccccccCCCccccchhcccccchhhcccceeeeeechhhcccchhhccc--cccceeeeccCCHHHHHHhhc Confidence 822 111 15667666554442 2 232 333333322221110 00 Q ss_pred -eEEE--EEEEecCe----------------eEEEEEEeccCCcc-----cceeehhhccccccccccccccCCceeecC Q lcl|NC_016654. 188 -EVWR--HLERHESG----------------YIVHAVYKGTATSL-----GWMMALTDHPATRDIAVEGADEGRGAYVET 243 (533) Q Consensus 188 -~~y~--~lE~h~~~----------------~I~~~~y~~~~~~l-----G~~v~l~~~~~~~~~~~~~~~~~~~~~~~~ 243 (533) ..|. ..+.-..+ --..++|....... |..+.|.. .+..--.. T Consensus 217 ~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~VwEIWdK~~~~V~w~~eg~~~~L~~--------------~~p~lgl~ 282 (663) T protein:vir:34 217 ADGSRNLWASVPKVGKPKDGKDGQSCHPWDRAEVWEIWDKGGRKVDWYVEGYSAVLDT--------------QPDPLGLE 282 (663) T ss_pred CChhhhhhhhccCcCCccccCCCCCcchhcCcceeEEEecCCcEEEEEEcCcceeccc--------------CCCCCCCC Confidence 0000 00000000 00112222221110 11111110 00000011 Q ss_pred CCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHH----hcC--CCC Q lcl|NC_016654. 244 GVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESV----LTN--LGM 317 (533) Q Consensus 244 g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~----l~~--~~~ 317 (533) |.-|-+....|+..+. +-..+++|. -.+++.+++|.+-.++.--.+.-+.+-+.|.+. .+. ... T Consensus 283 ~ffPcPrpl~~~~~~d---------s~ipvpd~~-~y~~~~~E~n~~t~Rin~l~d~ikv~gvy~~~~g~~i~~~l~~a~ 352 (663) T protein:vir:34 283 SFFPCPKPLLANWTTD---------KVVPRPDFV-LAQDLYKEIDLVSTRITLLERAIRVVGVYDKSSGLTIGRLLSEAA 352 (663) T ss_pred CCCCCcccccceecCC---------CeecCCcHH-HHHHHHHHHHHHHHHHHHHHhhhhhceeeccccchhHHHHHHHhh Confidence 1111111122222211 334566776 478999999987666643334444444544211 000 000 Q ss_pred ccccccCcchhhhhhccccccccccccccceeeechhhhh---HHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHH Q lcl|NC_016654. 318 GQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFFQPAIRV---LEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATE 394 (533) Q Consensus 318 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~---e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tata 394 (533) .+...+-.+...+ .+. ++.+..|..+..+--+ .+....-..+...+++.+|++.-.=| .....+|||| T Consensus 353 ~n~lvpV~~~~~~-------~~~-gg~~k~I~~~pi~~~~~aI~~l~~~r~qir~d~~qITGiaDi~Rg-a~~a~ETatA 423 (663) T protein:vir:34 353 QNDLIPVENWLTF-------ADK-GGLRGVVDWFPLEPVVAALTSLRDYRRELVDALHQVTGMADIMRG-ASDPRETAMA 423 (663) T ss_pred CCCceecchhhhh-------hhh-cCccchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHhHHHHhhc-ccCcchhhHH Confidence 0011111111111 111 1222223333322111 11122223444566788898844333 2233567777 Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc--------CCCC-----------------CCceeEEEEeCC Q lcl|NC_016654. 395 ASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFP--------GKGA-----------------APSEELELEWPK 449 (533) Q Consensus 395 i~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~--------~~~~-----------------~~~~~v~i~f~d 449 (533) ...+.+-+-.++..++..+++.++++++..-++....|. +... .....+.|.=+- T Consensus 424 Q~IKsq~gS~RIqe~qdevqR~arDi~ql~AEIl~~~~~~etl~~m~~~elp~~~ei~~~~~~L~n~~~r~~~ldIe~ds 503 (663) T protein:vir:34 424 QGVKAKFGSIRLQRLQDEVARFASDIQRLKAEVIAEHYDVASILAQANAEFTFDKELAPKAAELIKSRFSMYRVEVKPEA 503 (663) T ss_pred HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHHhcCCCCcccchhHHHHHHhcCCCcceeeeeccCC Confidence 777778888999999999999999998876554332111 1111 133456677777 Q ss_pred CCCCCHHHHHHHHHHHHhCCCCCHHHHHH---HhCCCCCHHHHHHHHHHHHHhhhcccCcccc---ccccCCCCCCCCCC Q lcl|NC_016654. 450 FARESDLAKAQTVQAWSVASAASTKTKVA---YLHEDWDDERVQEEADLIDNANTVSAPTFGF---GTDQPPLPTENDPA 523 (533) Q Consensus 450 ~i~~d~~e~a~~~~~l~~aGi~S~et~v~---~l~~~~~dee~~~El~rI~~E~~~~~~~~~~---~~~~~~~~~~~~~~ 523 (533) .+.+|..++.+..+++.. ++-+--+.+. ..-|. +- ....| +.++...-.....+ ..+......+ +.+ T Consensus 504 T~~~D~~~eK~~~~E~l~-~i~~~~qq~~pl~~q~p~-~~-p~l~E---llk~~~~~f~~~~qie~ai~~~~~~~e-~aa 576 (663) T protein:vir:34 504 VSLQDFAALRNEKMEVLS-GIASFMQGVAPLAQQVPG-SA-PFLLQ---MLKWSVSGLRGSSTIEGVLDKAIAAAE-EAQ 576 (663) T ss_pred CCcCChHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhh-hH-HHHHH---HHHHHhhcCChhhhHHHHHHHHHhhhH-HHh Confidence 888888777776666553 2222222221 11111 00 01111 11111000000000 0000000000 000 Q ss_pred CC-CCCCCCCC Q lcl|NC_016654. 524 TD-PEAVDEGE 533 (533) Q Consensus 524 ~~-~~~~~d~~ 533 (533) .+ .+.++..+ T Consensus 577 ~~~~~~~pa~~ 587 (663) T protein:vir:34 577 KQAAQQSPAPQ 587 (663) T ss_pred hccCCCCcccc Confidence 00 01111111 No 239 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=90.81 E-value=0.019 Score=30.03 Aligned_cols=422 Identities=10% Similarity=-0.031 Sum_probs=162.4 Q ss_pred CCCC--CCcCCCcCcchHH-HHHHHHh-hhHhh----cC-CHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCC Q lcl|NC_016654. 1 MSLP--EANTAWPPPELAA-VTARVAE-SHVWW----EG-DLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATG 71 (533) Q Consensus 1 ~~~~--~~~~~~pp~~~~~-~~~~~~~-~~~w~----~g-d~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 71 (533) ||-= --|.|.+=+.+.. ....+.. ++.|. +| +|.+|.+..+....-....+..++. .+.. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~~----~m~e------- 69 (528) T protein:vir:10 1 MAAIVDIYGNPLRTQQLRKQQTAHLAGLAKEFANHPAKGLTPAKLAHILIEAEQGHLQAQAELFM----DMEE------- 69 (528) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCCCCCCHHHHHHHHHhhhCCCHHHHHHHHH----HHHh------- Confidence 3210 1111222221111 1111111 22221 12 4556655554333221222222211 1100 Q ss_pred cccceeecChHHHHHHHHHHhhcCCCceEeeCCC----chHHHHHHHHHHhh-ccHHHHHHHHHHHHhhhCCEEEEEEEc Q lcl|NC_016654. 72 RAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGK----SKEVQARADLIFNT-PRFHSSLVEAGESCSALSGSFQRIVWD 146 (533) Q Consensus 72 ~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~----~~~~~~~l~~i~~~-n~f~~~~~~~~~~~~~~G~~~~~~~~D 146 (533) + ..-+. .++++-..-+++-+-.|....+ +...-+++++++.+ ..|...+.. +..|..+|-.++-+.|. T Consensus 70 -~----D~~i~-s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~i~~-~lda~~~G~s~~Ei~w~ 142 (528) T protein:vir:10 70 -R----DAHLF-AEMSKRKRAVLGLDWTIEPPRNASAAEKADAEYLHELLLDLEGIEDLMLD-CMDGVGHGYSAIELDWS 142 (528) T ss_pred -h----ChHHH-HHHHHHHHHHhcCCceEecCCCCCHHHHHHHHHHHHHHhCCccHHHHHHH-HHhhhhhcceeEEEEEe Confidence 0 01122 2333444455665555544222 22344567777765 347766655 45577889999888886 Q ss_pred CCCCCceE---EEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcc Q lcl|NC_016654. 147 PTIADNAW---IDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHP 223 (533) Q Consensus 147 ~~~~~~~~---i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~ 223 (533) .+++ ... +.++++..|.. ...++. .++ .+.+ ..-|.++| T Consensus 143 ~~~g-~~~~~~~~~r~~~~f~~----------------~~~~~~-~l~--------------~~~~-~~~g~~l~----- 184 (528) T protein:vir:10 143 LQGR-EWLPQAFDHRPQSWFQL----------------NPDDQD-ELR--------------LRDN-SIAGEVLQ----- 184 (528) T ss_pred ecCC-ceeEEEeeeecccceee----------------ccCCCc-EEe--------------ccCC-CCCceeec----- Confidence 5432 211 22222211110 000110 000 0000 00011110 Q ss_pred ccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHH-hCc Q lcl|NC_016654. 224 ATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFR-IGA 302 (533) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~-~~~ 302 (533) +.-|++|+.... ...++|.+.+..+. ...--=...+..|+.=++ .|- T Consensus 185 ----------------------~~k~iv~~~~~~---------~g~p~g~gLlr~~~-w~~~fK~~~~~~w~~f~E~yG~ 232 (528) T protein:vir:10 185 ----------------------PFGWIMHKPRSR---------SGYVARSGLFRVLA-WPYLFKHYSTADLAEMLEIYGL 232 (528) T ss_pred ----------------------CCCeEEEeecCC---------CCCccccchHHHHH-HHHHHHHhhHHHHHHHHHHcCC Confidence 112344443211 13556776666543 222122334444444343 333 Q ss_pred ceeeechHHhcCCCCccccccCcchhhhhhcccccccccc--ccccceeeechh-hhhHHHHHHHHHHHHHHHHhhCCCh Q lcl|NC_016654. 303 GKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG--DMETIFEFFQPA-IRVLEHDQGAALLLREVLRKTGYSP 379 (533) Q Consensus 303 ~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~--~~~~~i~~~~~~-ir~e~~~~~l~~~l~~i~~~~g~s~ 379 (533) +..+. + +..+.. -.....+..++..-..++++ +....|+.++.. ...+.|.+.++.+=++|+..+ ++ T Consensus 233 P~~ig-----k-y~~~a~--~~ek~~L~~al~~i~~~~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i-LG- 302 (528) T protein:vir:10 233 PIRLG-----K-YPPGTP--DEEKVTLLRAVTGLGHAAAGIIPESMSIDFQEASKGSAEPFMAMMRWCDDSMSKAI-LG- 302 (528) T ss_pred CeEEE-----e-cCCCCC--HHHHHHHHHHHHHHhhCcEEEecCCceeEEeecCCCChhHHHHHHHHHHHHHHHHH-hh- Confidence 32222 1 111110 01112222222211111111 112234444432 233455556665555666555 33 Q ss_pred hhccc-CC---CcchhHHHHHHHhhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCC Q lcl|NC_016654. 380 VSLGL-SD---EVAQTATEASGKKDLTVKTTRAKARHFGSALG-PLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARES 454 (533) Q Consensus 380 ~~~g~-~~---~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~-~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d 454 (533) +|++- .+ +|..+.-++.. .-....++.-.+.+...|. +|++.++.+. + +....+...+.+.|+..-++| T Consensus 303 qtlTs~~~~g~~gS~Alg~vh~--~v~~di~~aDa~~i~~tln~~li~~l~~~N---~-~~~~~~~~~p~~~~~~~e~eD 376 (528) T protein:vir:10 303 GTLTSQTSESGGGAYALGQVHN--EVRHDLLAADARQLAATLSRDLLWPLLVLN---R-SGNLDARRAPRLVFDLKDRAD 376 (528) T ss_pred hhhhccccccccchhhhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhC---C-CCCCCccccceEEecCCCccc Confidence 33322 11 22222222321 1122334445566677775 5777776551 1 122233456789999999999 Q ss_pred HHHHHHHHHHHHhCCC-CCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 455 DLAKAQTVQAWSVASA-ASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 455 ~~e~a~~~~~l~~aGi-~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ..+.++.+++|+..|+ +|.+. +++.++ ....+..+++.. .+.......... ...+... .-.....+..++-+ T Consensus 377 l~~~a~~~~~L~~~G~~i~~~~-i~e~~g-ip~p~~~e~~~~--~~~~~~~~~~~~--~~~~~~~-~~~~~~~~~~~~~~ 449 (528) T protein:vir:10 377 LAAMATSLPPLVKLGVQVPVNW-VQEQLG-IPLPANGEAVLG--DQAGAGIAQLSR--RPGPRIA-ALAQVIGPRYRDQE 449 (528) T ss_pred HHHHHHHHHHHHhCCCCCCHHH-HHHHhC-CCCCCCCccccc--CCCcccccccCc--ccccccc-cccccccccccccc Confidence 9999999999999998 66664 555554 432211112111 111000000000 0000000 00000111111111 No 240 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=90.80 E-value=0.019 Score=30.02 Aligned_cols=429 Identities=13% Similarity=0.087 Sum_probs=149.0 Q ss_pred CCCCcC-CCcCcchHHHHHH--HHhhhHhhc-CCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceee Q lcl|NC_016654. 3 LPEANT-AWPPPELAAVTAR--VAESHVWWE-GDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYH 78 (533) Q Consensus 3 ~~~~~~-~~pp~~~~~~~~~--~~~~~~w~~-gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 78 (533) |-+--+ |=|+++..-+..+ ..-+++|=+ -+.++| + +...+. .+.++.. ++ T Consensus 1 ~~~~~~~~~p~~~~g~~~~~~~~~~~~~~~~~e~~~~l----------r---~~~~~~-ly~~m~e--------~D---- 54 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFGSGVVDGWTVWDPFEQTPEL----------Q---WPQSVA-VYSRMDN--------ED---- 54 (469) T ss_pred CCCcccCCCCccchhhhhhcccccchhhcccccccccc----------c---cccchH-HHHHHHh--------hC---- Confidence 211111 1233332222221 001111100 000111 0 000000 0001000 00 Q ss_pred cChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHh-----------------hccHHHHHHHHHHHHhhhCCEEE Q lcl|NC_016654. 79 APIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFN-----------------TPRFHSSLVEAGESCSALSGSFQ 141 (533) Q Consensus 79 ~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~-----------------~n~f~~~~~~~~~~~~~~G~~~~ 141 (533) .-+. .++++....|.+-+-+|.-.+++++.-+++.+.+. ...|...+.+.+..+..+|-.++ T Consensus 55 ~~i~-s~l~~rk~av~~~~w~v~p~~~~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~ 133 (469) T protein:vir:10 55 SRVT-SLLEAISLPIRSTPWRIRANGASDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVF 133 (469) T ss_pred hHHH-HHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceee Confidence 0011 11222222333433333322222222222222111 12366777777777888899999 Q ss_pred EEEEcCCCC---CceEEEEEcCCeEEEEEecCC-ceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCccccee Q lcl|NC_016654. 142 RIVWDPTIA---DNAWIDFVDADRAIPEFRWGR-LVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMM 217 (533) Q Consensus 142 ~~~~D~~~~---~~~~i~~v~~~~~~P~~~~g~-~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v 217 (533) -+.|..... +.+.+.-+ . |++.+ +....| ...++...++. ..-.+.+....|... T Consensus 134 Eivw~~~~~~~dG~~~~~~l-----~--~rp~~~i~~~~~----~~~~~l~~~~~--~~~~~~~~~~~~~~~-------- 192 (469) T protein:vir:10 134 EQVYRPRNQSPDGRFWLRKL-----A--PRPQWTISKFNV----APDGGLESIEQ--IAPPARTRGSLYVAN-------- 192 (469) T ss_pred eeeeecccccCCCceeeeee-----e--ecCcccceeeee----ccCCceeeeee--cCcccccccccccCC-------- Confidence 888864321 22222111 1 11111 100000 00111000000 000000000000000 Q ss_pred ehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 218 ALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD 297 (533) Q Consensus 218 ~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~ 297 (533) .+... ..+.-|+.|+.+.. ...++|.|.+..+.-. .--=+..+..|+.= T Consensus 193 ------------------~~~~~---lp~~k~i~~~~~~~---------~g~p~g~gLlr~~~~~-~~fK~~~~~~w~~f 241 (469) T protein:vir:10 193 ------------------IAPPE---IPVNRLVVYTRNKR---------PGQWQGKSILRSAYKH-WLLKDKLLRIEAAT 241 (469) T ss_pred ------------------CCccc---cccCcEEEEEecCC---------CCCcccchhHHHHHHH-HHHHHHHHHHHHHH Confidence 00000 01122455554321 2356787777765432 11223334444443 Q ss_pred HH-hCcceeeechHHhcCCCCccccccCcchhhhhhcccc--ccccc--cccccceeeechhhhhHHHHHHHHHHHHHHH Q lcl|NC_016654. 298 FR-IGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSG--GFNAN--GDMETIFEFFQPAIRVLEHDQGAALLLREVL 372 (533) Q Consensus 298 ~~-~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~--~~~~~--~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~ 372 (533) ++ .|-+..+. + +..+. .-+....+..++..- ..+++ -+....|+.++.......|.+.++.+-++|+ T Consensus 242 ~EryG~P~~vg-----k-y~~~a--~~~ek~~l~~a~~~~~~g~~a~~iip~~~~ie~~ea~g~~~~~~~li~~~d~~Is 313 (469) T protein:vir:10 242 AERNGMGIPVG-----T-ASSAT--DEDEVRKMAALARSVRGGINAGVGLAQGQILELLGVSGNLPDIRRAIEGHDRSIA 313 (469) T ss_pred HHHcCCcceEE-----e-cCCCC--CHHHHHHHHHHHHHHhcCCceEEEccCCceEEEeecCCCchHHHHHHHHHHHHHH Confidence 33 33322211 1 11111 011111222222210 11111 1122345566555555667777777767776 Q ss_pred HhhCCChhhcccC-CCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhccCCCCCCceeEEEEeCCC Q lcl|NC_016654. 373 RKTGYSPVSLGLS-DEVAQTATEASGKKDLTVKTTRAKARHFGSALG-PLSTTCLRVDAIKFPGKGAAPSEELELEWPKF 450 (533) Q Consensus 373 ~~~g~s~~~~g~~-~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~-~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~ 450 (533) ..+ ++. +++.+ .+|..+..++...- ....++.-.+.+...|. +|++-++.+ |.+ .....+.+.|+.. T Consensus 314 k~i-LG~-tlTs~~~gGS~a~~~vh~ev--~~d~~~sDa~~i~~tln~~li~~l~~l------N~g-~~~~~P~~~~~~~ 382 (469) T protein:vir:10 314 LSG-LAH-FLNLDGKGGSYALASVLEDP--FTQAVHAYATSICRIANQHIIEDLVDI------NFG-VDTPAPVLTFDPI 382 (469) T ss_pred HHH-hcc-cccccCccchhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHh------cCC-CCCCccEEEecCC Confidence 655 332 22222 22332222332222 22234445556667775 577776654 211 1223467888754 Q ss_pred CCCCHHHHHHHHHHHHhCCCCC----HHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCC Q lcl|NC_016654. 451 ARESDLAKAQTVQAWSVASAAS----TKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDP 526 (533) Q Consensus 451 i~~d~~e~a~~~~~l~~aGi~S----~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (533) -.+.+..++.+++|+++|++. .++++++.++ ..+.+-.+.+..-.+.++ .+....+.+........+..... T Consensus 383 -e~~~~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~g-ip~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 458 (469) T protein:vir:10 383 -GSRQDLTAAAVKLLYDAGVFDDDPAVKRAIRQRFN-LPSELNDTPSAEPEEPAA--VPNQSAAPARTRSSGNADARARA 458 (469) T ss_pred -CCcHHHHHHHHHHHHhcCCccCccccHHHHHHHhC-CCCCCCCcccccchhccc--CCCCCccccccCCCCCccccccc Confidence 456677899999999999842 3445666554 332211111111111111 11111111111111111111112 Q ss_pred CCCCCCC Q lcl|NC_016654. 527 EAVDEGE 533 (533) Q Consensus 527 ~~~~d~~ 533 (533) +..++++ T Consensus 459 ~~~~~~~ 465 (469) T protein:vir:10 459 PKADQGV 465 (469) T ss_pred CCChHHh Confidence 2222222 No 241 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=90.50 E-value=0.021 Score=29.84 Aligned_cols=428 Identities=12% Similarity=-0.001 Sum_probs=157.4 Q ss_pred hcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcc--cCCCCCcccceeecChHHHHHHHHHHhhcCC-----CceEe Q lcl|NC_016654. 29 WEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGR--TPTATGRAPKRYHAPIPGVIAKLSTTELFSE-----QLKFL 101 (533) Q Consensus 29 ~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e-----~~~i~ 101 (533) ++...+.|-. +.......+.++......+.+.+.. .....+.+..+.-=.-+...++.+|+-|.+- .+.|. T Consensus 1 m~~~~~~l~~--k~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 78 (514) T protein:vir:80 1 MRQQASAMWA--EYRDSTAIRKAEDFAKFTIASLMVDPLDKTHQAEVVEYDFQSAGAFLVNNLTAKLALTLFPPGRPSFQ 78 (514) T ss_pred CccchHHHHH--HhhcchHHHHHHHHHHHhcccccCCCCCCcccccccccccchhHHHHHHHHHHHHHhhhcCCCCcccc Confidence 3322222211 1111111222222222111111100 0011111222222345566666766655443 13444 Q ss_pred eCCCc-------------hHHHHH-------HHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCC Q lcl|NC_016654. 102 DAGKS-------------KEVQAR-------ADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDAD 161 (533) Q Consensus 102 ~~~~~-------------~~~~~~-------l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~ 161 (533) ....+ ..++++ +...+..++|...+.++.....+.|.+.++ .+++.. .+..++-. T Consensus 79 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~--~~~~~~---~~~~~pl~ 153 (514) T protein:vir:80 79 IELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFY--REPGTG---KMLVWTMQ 153 (514) T ss_pred cccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEE--EecCCC---cEEEEEcC Confidence 32211 123333 444577789999999999999999998754 565432 25566666 Q ss_pred eEEEEEe-cCCceEEEEEEEEeec--------------------CCceEEEEEEEecCe-eEEEEEEeccCCcccceeeh Q lcl|NC_016654. 162 RAIPEFR-WGRLVAVTFWSELAGG--------------------DGQEVWRHLERHESG-YIVHAVYKGTATSLGWMMAL 219 (533) Q Consensus 162 ~~~P~~~-~g~~~~v~f~~~~~~~--------------------~~~~~y~~lE~h~~~-~I~~~~y~~~~~~lG~~v~l 219 (533) .++-.-+ +|++..+++-.+++.. ++-.+|+++++.... .=.+.+|.. T Consensus 154 ~y~v~~d~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e----------- 222 (514) T protein:vir:80 154 SYTVRRTSHGDPAVVVLRQQMPFRELTPEIQADAQAKQIAKRDSDKCDLYTVIEWQPTPNGKRCAVWHE----------- 222 (514) T ss_pred eEEEeeCCCcCeEEEEeeeeecHHHhhhhhhhhhhhhhccCCCCCceEEEEEEEeecCCCCeEEEEEEe----------- Confidence 6555444 4788776654443210 000123333322110 000111110 Q ss_pred hhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHH- Q lcl|NC_016654. 220 TDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDF- 298 (533) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~- 298 (533) .. +....-+.+.+..-+.|++. +|... ..+.||+|--..++ +-+..|+..--...... T Consensus 223 --------~~------g~~i~~es~y~~~e~P~i~~-----Rw~~~-~ge~YGrgp~~~al-~D~k~L~~l~~~~l~~~~ 281 (514) T protein:vir:80 223 --------LE------GKRVGPESSYPAHLCPYVPV-----AWNVP-DGEHYGRGYVEEYS-GDFARLSILSERLGLYEF 281 (514) T ss_pred --------cc------ceeecccCccccccCCeeee-----eeEec-CCCCcccchHHHHH-HHHHHHHHHHHHHHHHHH Confidence 00 00000112222111223322 24322 24778998777665 55677775544333322 Q ss_pred HhCcceeeechH-HhcCCCCccccccCcchhhhhhccccccccccccccceeee----chhhhh-HHHHHHHHHHHHHHH Q lcl|NC_016654. 299 RIGAGKVHASES-VLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDMETIFEFF----QPAIRV-LEHDQGAALLLREVL 372 (533) Q Consensus 299 ~~~~~~i~v~~~-~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~i~~~----~~~ir~-e~~~~~l~~~l~~i~ 372 (533) ...+....|+++ .+.+..-. ....+.+.. |.. ..+..+ +.++.+ .+-++.+..-++... T Consensus 282 ~a~~~~~~v~~~g~~~~~~l~-----~~~~g~~v~--------g~~--~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF 346 (514) T protein:vir:80 282 EALSLLNLVDEAKGGAVDDYR-----DAETGDFVP--------GQV--GSVASYERGDYNKIAQASASVESIVMRLNRAF 346 (514) T ss_pred HhcCCCceeCcccccchhhhc-----ccCCceeec--------CCC--ccceeeecCcccchHHHHHHHHHHHHHHHHHH Confidence 334444455432 22111000 000011111 100 111111 122321 122333333333222 Q ss_pred HhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhccCC-CCCCceeEEEEeCCC Q lcl|NC_016654. 373 RKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAK-ARHFGSALGPLSTTCLRVDAIKFPGK-GAAPSEELELEWPKF 450 (533) Q Consensus 373 ~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~-~~~~~~al~~li~~il~l~~~~~~~~-~~~~~~~v~i~f~d~ 450 (533) +.. ... ..+...|||||..+.+...+..+-. .+.-...|..+++.++.+..-...+. ...+...+.+++--+ T Consensus 347 ml~----~~~--rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~il~r~~~g~lP~~p~~l~~~~~vs~ 420 (514) T protein:vir:80 347 MYT----GQV--RDAERVTVEEIRTVAEEAENLLGGVYSLLAETLQAPLAYLTMYEASRGNGGMLLGIAQGVYRPSIITG 420 (514) T ss_pred hhh----ccC--CCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCchhhcceeeec Confidence 211 111 1223359999998877766654442 22222334444554444432111111 122333444444222 Q ss_pred CCC-----C---HHHHHHHHHHHHhCC-----CCCHHHHHHHh---CC------CCCHHHHHHHHHHHHHhhhc-----c Q lcl|NC_016654. 451 ARE-----S---DLAKAQTVQAWSVAS-----AASTKTKVAYL---HE------DWDDERVQEEADLIDNANTV-----S 503 (533) Q Consensus 451 i~~-----d---~~e~a~~~~~l~~aG-----i~S~et~v~~l---~~------~~~dee~~~El~rI~~E~~~-----~ 503 (533) +.. + ....++.++.+..+. .+-.+.+++.+ .+ --++|+++.+.+|.++.+.. . T Consensus 421 la~l~r~~~~~~l~~~~~~i~~l~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~ 500 (514) T protein:vir:80 421 IPALTRNIETANILRATQEASAIVPALVQLSKRFDPEKLVERIFANNSVDLSTLSKDPDVVAAEAEQEAALAQQQLDVAS 500 (514) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHhCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHH Confidence 210 1 111122222221110 12233444332 11 01444444444443322211 1 Q ss_pred cCcc-ccccccCCCCC Q lcl|NC_016654. 504 APTF-GFGTDQPPLPT 518 (533) Q Consensus 504 ~~~~-~~~~~~~~~~~ 518 (533) ...+ +.+. .-.+. T Consensus 501 ~~~~~~~~~--~~~~~ 514 (514) T protein:vir:80 501 GALAAETSA--GVLTS 514 (514) T ss_pred HHHHHhhhc--cccCC Confidence 1111 1111 11111 No 242 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=90.50 E-value=0.021 Score=29.84 Aligned_cols=393 Identities=10% Similarity=-0.027 Sum_probs=155.9 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHH---HHHHHHhcccCCCCCccccee Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTK---AAYEAFHGRTPTATGRAPKRY 77 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~g~~~~~~ 77 (533) +.-| .+...++-...++.+|++=+..| . .+. ..-.+.... ..+.++. ++ T Consensus 1 v~~~---------~l~~e~at~~~~~d~~~~~~~~l---~--~~~--~~il~~a~~g~~~~y~~l~---------~D--- 52 (488) T protein:vir:99 1 MEKP---------ALGREIATSGDGRDITRPFISGL---Q--VPN--DSILQRRGGNDLRVYEEIL---------SD--- 52 (488) T ss_pred CCcc---------chhHHHHHHHhhhhhhccccCCC---C--CCC--hHHHHhhccCCHHHHHHHh---------hC--- Confidence 1111 12222222233333333211100 0 000 000000000 0111111 00 Q ss_pred ecChHHHHHHHHHHhhcCCCceEeeCCC---chHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceE Q lcl|NC_016654. 78 HAPIPGVIAKLSTTELFSEQLKFLDAGK---SKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIADNAW 154 (533) Q Consensus 78 ~~n~~k~i~~~~a~ll~~e~~~i~~~~~---~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~ 154 (533) .-+ ..++++.-.-+++-+-.|...++ +....+.+.+.++.-.|...+...+ .|..+|-+++-+.|..++ +.+. T Consensus 53 -~~i-~s~l~~rk~av~~~~w~i~p~~~~~~~~~~ae~v~~~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~-g~~~ 128 (488) T protein:vir:99 53 -AQV-KTVWGQRQLAVVSREWKVEAGGDRPIDQAAAEHLEQQLQRVGWDRVTSKML-FGVFYGYAVSELIYGRDD-RYIT 128 (488) T ss_pred -hHH-HHHHHHHHHHHhcCCceEEcCCCChHHHHHHHHHHHHHhCCCHHHHHHHHH-hhhhhcceeEEEEEeecC-Ceee Confidence 012 22334444556666666653322 2334577888888878888888776 578899999988886542 2222 Q ss_pred ---EEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccc Q lcl|NC_016654. 155 ---IDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVE 231 (533) Q Consensus 155 ---i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~ 231 (533) |.++++..|.+ ...++. .+ .......-|.++| T Consensus 129 ~~~l~~r~~~~f~~----------------d~~~~l-~~---------------~~~~~~~~g~~lp------------- 163 (488) T protein:vir:99 129 LEAIKVRNRRRFRY----------------DQDGGL-RL---------------LTPNNMFEGEPCP------------- 163 (488) T ss_pred Eeeeeeecccceee----------------cCCCce-EE---------------eccCCCCCccccc------------- Confidence 22222221110 000100 00 0000000011110 Q ss_pred ccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHH-HHHHHHHHHHHHHHHHH-hCcceeeech Q lcl|NC_016654. 232 GADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFP-TFHELDRIYSSLMRDFR-IGAGKVHASE 309 (533) Q Consensus 232 ~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~-lid~lD~~~s~~~~~~~-~~~~~i~v~~ 309 (533) .+.-|++++... ....|+|.+.+..+.-. +++ ...+..|+.=++ .|-+..+. T Consensus 164 -------------~~~~~i~~~~~~---------~~g~p~g~gLl~~~~w~~~fK--~~~~~~w~~f~E~yG~P~~ig-- 217 (488) T protein:vir:99 164 -------------APYFWHFSTGAD---------NDDEPYGLGLAHWLYWPVFFK--RNGIKFWLIFLDKFGMPTAVG-- 217 (488) T ss_pred -------------cCceEEEEeecC---------CCCCcccchHHHHHHHHHHHH--HhhHHHHHHHHHHcCCceeee-- Confidence 011122222211 11367788888765432 232 333444444343 33332221 Q ss_pred HHhcCCCCccccccCcchhhhhhcccccccccc--ccccceeeechh-hhhHHHHHHHHHHHHHHHHhhCCChhhcc-cC Q lcl|NC_016654. 310 SVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG--DMETIFEFFQPA-IRVLEHDQGAALLLREVLRKTGYSPVSLG-LS 385 (533) Q Consensus 310 ~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~--~~~~~i~~~~~~-ir~e~~~~~l~~~l~~i~~~~g~s~~~~g-~~ 385 (533) + +.+ .+........+..++..-..++++ +....|+.++.. ...+.|.+.++.+-++|+..+ ++ ++++ .+ T Consensus 218 ---k-y~~-~~a~~~ek~~l~~av~~~~~~~~~viP~~~~ie~~ea~~~~~~~~~~li~~~d~~Isk~i-LG-qtlts~~ 290 (488) T protein:vir:99 218 ---R-YDD-KTATPEDKAKLLAALHAIQTDSAIIMPAGMQAELLEAGRSGTADYKTLHDTMDATIAKVG-LG-QVASTQG 290 (488) T ss_pred ---e-cCC-CCCCHHHHHHHHHHHHHHhcCcEEEecCCceeEEeecCCCChHHHHHHHHHHHHHHHHHH-hh-hhhcccc Confidence 1 111 011111122223332221121111 112234444432 222345555555555555444 33 2333 22 Q ss_pred CCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHH Q lcl|NC_016654. 386 DEVAQTATEASGKKDLTVKTTRAKARHFGSALG-PLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQA 464 (533) Q Consensus 386 ~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~-~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~ 464 (533) .+|..+..++... -....++.-.+.+...|. +|++.++.+ |.. ....+.+.|...-++|..+.++.+.+ T Consensus 291 ~~Gs~a~~~vh~~--v~~d~~~aDa~~i~~tln~~li~~l~~~------N~~--~~~~p~~~~~~~e~edl~~~a~~~~~ 360 (488) T protein:vir:99 291 TPGRLGNDDLQAD--VRLDLVKADADLICESFNLGPARWLTEW------NFP--GAQPPRVYRVIEEPEDITAKAERDEK 360 (488) T ss_pred cccchhhHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHh------CcC--CcCCceeEecCCCcccHHHHHHHHHH Confidence 2332222233222 233334455566677774 577766654 221 12346778888888999999999999 Q ss_pred HHhC-CCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 465 WSVA-SAASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 465 l~~a-Gi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ++.. |+--.+..+++.++ ...++..++ .. .+.... ....++.+..+... T Consensus 361 l~~~~G~~i~~~~i~e~~G-ip~~~~~~~-------~~--~~~~~~----------~~~~~~~~~~~~~~ 410 (488) T protein:vir:99 361 VFRMSGFRPTRGYVQETYG-VEVESTQAE-------AT--APTPST----------EFAEGDQPSDPAAA 410 (488) T ss_pred HHhhcCCCCCHHHHHHHcC-CCCcccccc-------cc--cCCCcc----------cCCCCCCCCCchHH Confidence 9985 87333445666554 443221111 00 011000 00011111111111 No 243 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=90.03 E-value=0.023 Score=29.56 Aligned_cols=452 Identities=12% Similarity=0.069 Sum_probs=187.0 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccC-c-chhhHHHHH-----------HHHHHHHHhcccC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEG-R-TSPSGIKAR-----------TKAAYEAFHGRTP 67 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~-~-~~~~~~~~~-----------~~~~~~~~~~~~~ 67 (533) |-|+..++ .+..+.-|.. -|-.+.++-.+... . ..|....+. +.|.+..++.+.. T Consensus 1 ~~~~~~~~---------~l~~~~~~~~---~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~y~~~e 68 (524) T protein:vir:98 1 MNFLGFGN---------VLSFFKNFAR---EDEIELEQQLKNDTGSVAPPKNNDGAYEIETDLNNQKYAGVFQQFYSGQD 68 (524) T ss_pred CCCcchhh---------HHHHhhhhhh---hhhhhHhhhhcCCcccccCCCCCCCceeecCCCCcceecceeeeeccccc Confidence 66664442 3333333322 22222222111110 0 001111000 1111111111110 Q ss_pred CCCCcc----cce--e-ecChHHHHHHHHHHh-hcC----CCceEeeCCCc--h----HHHHHHHHHHhhccHHHHHHHH Q lcl|NC_016654. 68 TATGRA----PKR--Y-HAPIPGVIAKLSTTE-LFS----EQLKFLDAGKS--K----EVQARADLIFNTPRFHSSLVEA 129 (533) Q Consensus 68 ~~~g~~----~~~--~-~~n~~k~i~~~~a~l-l~~----e~~~i~~~~~~--~----~~~~~l~~i~~~n~f~~~~~~~ 129 (533) ...... +.+ + ..+-+.-.++..++= ++. +|+++.++..+ + ...+..+.|++-=+|++..++. T Consensus 69 ~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaIv~~~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~ 148 (524) T protein:vir:98 69 PAIQNKEQLINTYRGIMSYPEVENAVSEIIDDAIVNEQGKDIITMDLAKTNFSKAIQDKIVEEFDNVLNIYDFDNMGARL 148 (524) T ss_pred cccchHHHHHHHHHHHhhccchhhHHHhhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHH Confidence 000000 000 0 111222222222221 121 25555554322 1 2445566677777899999999 Q ss_pred HHHHhhhCCEEEEEEEcCCCCCc-eEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEec Q lcl|NC_016654. 130 GESCSALSGSFQRIVWDPTIADN-AWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKG 208 (533) Q Consensus 130 ~~~~~~~G~~~~~~~~D~~~~~~-~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~ 208 (533) .....+.|..|++..+|++...+ ..+..++|.++-++...- ... ..++..++ .++..|.+|.- T Consensus 149 fR~WYVDgRi~fhkiid~~~~kGI~ELr~lDPr~i~~vr~~~--------~~~-~~~~~~v~-------~~~~e~f~Y~~ 212 (524) T protein:vir:98 149 FRDWYVDSRIYFHKIMHKDESKGIRELRQLDPRCMELIRESI--------TET-LDGGVKVF-------RGYREFFVYSA 212 (524) T ss_pred HhhhhhcceeEEEEEEcCCCCcceeeeeeeCCccceeeeecc--------ccc-cccchhhc-------cceeeeeeecc Confidence 99999999999999998654332 345567777766542110 000 00111111 12223333431 Q ss_pred cCC--cc-cceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHH Q lcl|NC_016654. 209 TAT--SL-GWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFH 285 (533) Q Consensus 209 ~~~--~l-G~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid 285 (533) ... +. |.-.+. +....| +.-.+.|..-+. ..+.+ .+ +|-+-.||+++ . T Consensus 213 ~~~~~~~~g~~~~~----------------~~~ikI----~~dAIvy~hSGL-----~d~~~--~i-isyLhkAiKp~-N 263 (524) T protein:vir:98 213 PKAGYTYNGQIYQA----------------NQKIKI----PRSAIVYAHSGL-----EDCSN--NI-IGYLHRAVKPA-N 263 (524) T ss_pred CCCccccccceecC----------------CCceee----chhheeeeccCc-----ccCCC--Ce-eeehhHhhHhH-H Confidence 110 00 000000 000000 001122222111 00110 01 13333455442 2 Q ss_pred HHHHHHHH---HHHHHHhCcceeee-c---------hHHhcC----------CCCccccccCcchhhhhhcc--cccccc Q lcl|NC_016654. 286 ELDRIYSS---LMRDFRIGAGKVHA-S---------ESVLTN----------LGMGQGVSLDEEQEVYSRVG--SGGFNA 340 (533) Q Consensus 286 ~lD~~~s~---~~~~~~~~~~~i~v-~---------~~~l~~----------~~~~~~~~~d~~~~~~~~~~--~~~~~~ 340 (533) .| +++.+ +.|.-|.-.+|||- . +.+++. +....|.. ..++....++. +.+--. T Consensus 264 QL-km~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev-rddrk~msMlEDyWLpRRe 341 (524) T protein:vir:98 264 QL-RLLEDAMVIYRITRAPERRVFYIDVGQMGGNKATQYVNNIAQGLKNRVVYDARTGTV-KNQQNNLSMTEDYWLMRRD 341 (524) T ss_pred hh-HHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeeccCcee-eccccccchhhhhcccccC Confidence 22 11111 22334666778873 1 111110 00111111 11111100000 011111 Q ss_pred ccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCC--cchhHHHHHHHhhhHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 341 NGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDE--VAQTATEASGKKDLTVKTTRAKARHFGSALG 418 (533) Q Consensus 341 ~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~--~~~Tatai~~~~~~l~~~~~~~~~~~~~al~ 418 (533) |+ ....|++....-.. .-+.-+..+.+.+....++|.+.++.+.+ +.--++||...+-.--.-+.+.+..|..-|. T Consensus 342 Gg-rgTEItTLpggqnl-gem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~ 419 (524) T protein:vir:98 342 GK-AITEVSTLPGGQNF-SDMDDIKWFNRKLYEALRVPLSRMPRDDGGMQIGGGGEITRDELKFSKFIRTLQIQFSPVLS 419 (524) T ss_pred CC-CccceeeccccCCc-ChHHHHHHHHHHHHHHhCCCceeccCCCCccccccccchhHHHHHHHHHHHHHHHHHHHHHH Confidence 22 22234444332222 23556677778888888898888753322 1222556666666666778888888888888 Q ss_pred HHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHH-------HHHHHHHHh-CC-CCCHHHHHHHhCCCCCHHHH Q lcl|NC_016654. 419 PLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAK-------AQTVQAWSV-AS-AASTKTKVAYLHEDWDDERV 489 (533) Q Consensus 419 ~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~-------a~~~~~l~~-aG-i~S~et~v~~l~~~~~dee~ 489 (533) ++++.-|.|....-..........+.++|...---.+... +..++++.. .| ..|.++..++.. -.+|+|. T Consensus 420 ~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~IL-r~tDeei 498 (524) T protein:vir:98 420 DPLKTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQVEGVVGKYVSHKYIMKEIL-RMSDEDI 498 (524) T ss_pred HHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhccccccccchHHHHHHHh-ccCHHHH Confidence 8888766552211000000012347777764433333322 333333321 23 689998776654 4899999 Q ss_pred HHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 490 QEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 490 ~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +++.++|++|.... -+ + ++..+++| T Consensus 499 ~~~~k~I~~E~k~~--~~-------~----------~p~~e~~~ 523 (524) T protein:vir:98 499 DEQAKLIEEESKEE--RF-------K----------NPEAEEEN 523 (524) T ss_pred HHHHHHHHHHHhCC--CC-------c----------CCcccccc Confidence 99999999886321 00 0 01111111 No 244 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=89.29 E-value=0.027 Score=29.17 Aligned_cols=343 Identities=8% Similarity=0.033 Sum_probs=131.6 Q ss_pred hHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCccc-ceeecChHH--HHHHHHHHhhcCCCceEee Q lcl|NC_016654. 26 HVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAP-KRYHAPIPG--VIAKLSTTELFSEQLKFLD 102 (533) Q Consensus 26 ~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~-~~~~~n~~k--~i~~~~a~ll~~e~~~i~~ 102 (533) =.|| + .|. .+....+..+. ..+........|... .+..+..+. ..|+.+|+-+-+-|. .. T Consensus 1 M~~~----~---~f~-~r~~~~~~~~~-------~~~~~~~~~~~~~~v~~~~al~~~av~~cv~~ia~~ia~~p~--~~ 63 (359) T protein:vir:10 1 MSIL----N---PFE-RRSSITPNNYY-------PFMVQNGSIVPNSLVDATEALKNSDLYAVTSLISSDIAGTRF--IG 63 (359) T ss_pred Cccc----c---hhh-ccccCCCCcch-------hhhhccccccCCcccCHHHhhcchHHHHHHHHHHHhhhcCcc--cc Confidence 1111 1 111 11111111111 111111111111111 011222222 344555555544442 11 Q ss_pred CCCchHHHHHHHHHHhhccHH---HHH-HHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEE Q lcl|NC_016654. 103 AGKSKEVQARADLIFNTPRFH---SSL-VEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFW 178 (533) Q Consensus 103 ~~~~~~~~~~l~~i~~~n~f~---~~~-~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~ 178 (533) +.. +..++..-+-. ..+ ...+...+..|.+|+.+..|..+. -..+..++++.+.+..+++.+ |+ T Consensus 64 ---~~~----~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~-~~~l~~l~~~~v~i~~~~~~~----~y 131 (359) T protein:vir:10 64 ---NQV----FTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDNSL-MKELRLIPSNAITIDLTDDTL----TY 131 (359) T ss_pred ---chH----HHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEECCCCe-EEEEEEeCCceEEEEEcCCeE----EE Confidence 111 22222222211 122 233334455688988887775432 234556677766655444321 11 Q ss_pred EEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcc Q lcl|NC_016654. 179 SELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTP 258 (533) Q Consensus 179 ~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~ 258 (533) . ++...+. .+..++- + -+.|+..... T Consensus 132 ~-------------------------~~~~~~~-~~~~~~~----------~------------------evih~~~~~~ 157 (359) T protein:vir:10 132 E-------------------------VNQFDDY-PSAKYNA----------S------------------EMIHVKIMAY 157 (359) T ss_pred E-------------------------EEecCCc-eEEEEcc----------c------------------ceEEeccCCC Confidence 1 1000000 0000000 0 0122221111 Q ss_pred cccccccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhCc-ceeee--chHHhcCCCCccccccCcchhhhhhcc Q lcl|NC_016654. 259 NPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIGA-GKVHA--SESVLTNLGMGQGVSLDEEQEVYSRVG 334 (533) Q Consensus 259 ~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~~-~~i~v--~~~~l~~~~~~~~~~~d~~~~~~~~~~ 334 (533) +.+. .....|.|.+..+ ...+. +.....++... |+.|. +.-++ |...+.. .. .+.-.+.+.... T Consensus 158 ~~~~----~dg~~G~spi~~~-~~~i~-~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~---e~---~~~~~~~~~~~~ 225 (359) T protein:vir:10 158 GVDT----LHNLVGHSPLESL-TSEIG-QQKEANRLSLSTLKGALNPTSVVKVPQGTLSS---EA---KDSIRKEFEKAN 225 (359) T ss_pred CCCc----cCccccccHHHHH-HHHHH-HHHHHHHHHHHHHhccCCcceEEEeCCCCCCH---HH---HHHHHHHHHHHh Confidence 1000 0123477766543 34443 23333334333 45543 22222 2111110 00 011111222221 Q ss_pred cccccccc----ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHH-HHHHHH Q lcl|NC_016654. 335 SGGFNANG----DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTV-KTTRAK 409 (533) Q Consensus 335 ~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~-~~~~~~ 409 (533) .+ .++++ +....++.++......++++..+...+.|+...|+||+.+|...+...|...++..+...+ .+.... T Consensus 226 ~~-~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~e~~~~~~l~~~l~p~ 304 (359) T protein:vir:10 226 GG-NNSGRVMVLDQSADFSTVSINADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQSSLDQIKDLYVNALNRFIEPL 304 (359) T ss_pred Cc-cccCCceecCCCcceeeecCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCcccccHHHHHHHHHHHHHHHHHHH Confidence 11 11111 1223355555555666788888888999999999999999864444445555544332221 112222 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHH Q lcl|NC_016654. 410 ARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERV 489 (533) Q Consensus 410 ~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~ 489 (533) ...++..|.. . +.++....+-.|.......+.+++.+|+|+..++.+.+. ... + T Consensus 305 ~~~l~~~l~~---~-------------------~~~~~~~~~~~d~~~~~~~~~~~~~~G~~t~NE~R~~l~--~~p--v 358 (359) T protein:vir:10 305 ISELRIKCDS---S-------------------IGVDMSPITDYSNSVFKADILNWVKEGIIEPTEAKTLLE--SKG--I 358 (359) T ss_pred HHHHHHHhhh---h-------------------hcccchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhC--CCC--C Confidence 2222221111 0 011111112223445556677788899999888766541 111 1 Q ss_pred H Q lcl|NC_016654. 490 Q 490 (533) Q Consensus 490 ~ 490 (533) . T Consensus 359 ~ 359 (359) T protein:vir:10 359 I 359 (359) T ss_pred C Confidence 1 No 245 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=87.30 E-value=0.04 Score=28.27 Aligned_cols=376 Identities=9% Similarity=-0.003 Sum_probs=144.8 Q ss_pred cCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcc---cceeecChHHHHHHHHHHhhcCCCceEeeCCCc Q lcl|NC_016654. 30 EGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRA---PKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKS 106 (533) Q Consensus 30 ~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~---~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~ 106 (533) +|=.++|..+....... ......++.......|.. ..-+...-...+++.+|+-+-+=|..+--.+.. T Consensus 1 MGl~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~v~~~~g~ 71 (394) T protein:vir:62 1 MGLRDRFSNYLFKKAEK---------RGYLDNVLGKSIRYSGVYVTDSNILQSSDVYELLQDISNQMVLADIVVEDEFGN 71 (394) T ss_pred CchhhhhhhhccCCCCc---------hhhhhhhhhcccccCccccChhhhhccHHHHHHHHHHHHhhcccceEEEcCCCc Confidence 33222221110000000 011122222222111111 111333445566666666665555444322221 Q ss_pred hHHHHHHHHHHhhccH----HHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEe Q lcl|NC_016654. 107 KEVQARADLIFNTPRF----HSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELA 182 (533) Q Consensus 107 ~~~~~~l~~i~~~n~f----~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~ 182 (533) ......+..++..-+- ..-....+...+..|.+|+.+ +.+. +. -+..+.|+.+. T Consensus 72 ~~~~~~~~~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i--~~~~-----~~--~~~~~~~~~~~------------- 129 (394) T protein:vir:62 72 EIKDDIALQILRNPNNYLTQSEFIKLMTNTYLLEGETFPIL--NGAQ-----IH--LASNVFTELDD------------- 129 (394) T ss_pred ccchhhHHHHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEE--ecce-----ee--ccccceEEECC------------- Confidence 1112223333332111 122233444556667777654 3211 11 11233332211 Q ss_pred ecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccc Q lcl|NC_016654. 183 GGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEW 262 (533) Q Consensus 183 ~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~ 262 (533) ...+ .|... |..++-+ -+.|+.... + T Consensus 130 ----~~~~--------------~~~~~----~~~~~~~----------------------------eiih~r~~~----~ 155 (394) T protein:vir:62 130 ----NLVE--------------HFNIG----GHEIPPC----------------------------MIRHVKNIG----A 155 (394) T ss_pred ----ceEE--------------EEeeC----CEEechh----------------------------heEEecCcC----C Confidence 0010 01100 1111100 012222110 1 Q ss_pred cccccccccccchhhhhHHHHHHHHHHHHHHHHHH-HHhC-cceeeechHHhcCCCCcc--ccccCcchhhhhhcccccc Q lcl|NC_016654. 263 RHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRD-FRIG-AGKVHASESVLTNLGMGQ--GVSLDEEQEVYSRVGSGGF 338 (533) Q Consensus 263 ~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~-~~~~-~~~i~v~~~~l~~~~~~~--~~~~d~~~~~~~~~~~~~~ 338 (533) . ..+|.|.+..+ ..+|.. +....++... ++.| ....++ ...+... ....+.....|.....+.. T Consensus 156 d-----~~~G~s~~~~~-~~~i~~-~~~~~~~~~~~~~ng~~~~~il-----~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 223 (394) T protein:vir:62 156 D-----HLRGKGILDLG-RDTLEG-VMSAEKTLTDKYKKGGLLTFLL-----NLDAHINPQNGAQSKLINAILDQLESID 223 (394) T ss_pred C-----CccccChHHHH-HHHHHH-HHHHHHHHHHHHHccCCcceEE-----EeCCCCCcCHHHHHHHHHHHHHHhcccc Confidence 1 12477776643 344433 2333333333 4554 332222 1111100 0000000111211111111 Q ss_pred ccc------cccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHH Q lcl|NC_016654. 339 NAN------GDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARH 412 (533) Q Consensus 339 ~~~------~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~ 412 (533) +.+ ++.+..+..++......++++..+...++|+...|+||..+|.... .+.++. .+.. T Consensus 224 n~g~~~vl~~g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~--sn~e~~-------------~~~~ 288 (394) T protein:vir:62 224 EARSVKMIPLGKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTELIK--EDIEKA-------------MMYI 288 (394) T ss_pred ccCceeEeeCCCceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC--cCHHHH-------------HHHH Confidence 111 1112223445555566778888888899999999999999974221 111111 2333 Q ss_pred HHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhC-CCCCHHHHHH Q lcl|NC_016654. 413 FGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLH-EDWDDERVQE 491 (533) Q Consensus 413 ~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~-~~~~dee~~~ 491 (533) ++.+|..++..+..-.+..+.... ....+.|+|+.....+..++++.+.+++.+|+|+..++.+.+. +..++++... T Consensus 289 ~~~~l~P~~~~ie~~l~~kll~~~--~~~~~~~~fd~~~~~~~~~~~~~~~~~~~~g~~T~NE~R~~~gl~p~~~~~gd~ 366 (394) T protein:vir:62 289 HNKAVRPIMKNFEDHLSLLFYAQN--SGKRIKFKINILDFVTYSNKTNIGYNLVRTAITSPDNVADMLGFPKQNTKESQA 366 (394) T ss_pred HHHHHHHHHHHHHHHHhhhhcCcc--ccCceEEEechhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCe Confidence 445555555555433333222221 2245788898887778888899999999999999999766542 1111111111 Q ss_pred HHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 492 EADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVD 530 (533) Q Consensus 492 El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (533) +.--... .+ . +.....++...+|+ +.++ T Consensus 367 ----~~~~~n~-~~-~----~~~~~~~~~~kgge-~~en 394 (394) T protein:vir:62 367 ----IYISNDV-TE-I----GKKEATDGSLGGGE-ENEN 394 (394) T ss_pred ----eeccccc-cc-c----cccccccccCCCCC-CCCC Confidence 0000000 00 0 00000011111111 1111 No 246 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=82.60 E-value=0.075 Score=26.74 Aligned_cols=367 Identities=10% Similarity=-0.023 Sum_probs=135.2 Q ss_pred cCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHH Q lcl|NC_016654. 30 EGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEV 109 (533) Q Consensus 30 ~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~ 109 (533) +|=. ..+++...... ..+ .+. .+.. .. ....+...--..+++.+|+-+.+-|..+--.+ ... T Consensus 1 Mg~f---~~~f~~~~~~~-~~~----~~~---~~~~-~~----~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~--~~~ 62 (385) T protein:vir:95 1 MGLF---DSVFKRHSELS-WMY----DLE---FLQD-KS----KKAYLKQIALNTVVEMVARTISQSEFRVMKNN--TKE 62 (385) T ss_pred Cchh---hhhhccCcccc-ccc----chh---hhhc-cc----hhhhhhhHHHHHHHHHHHHHHcccceeeeecC--ccc Confidence 4422 22332211110 000 000 0000 00 01112222234556666665555454332111 111 Q ss_pred HHHHHHHHhh--cc---HHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeec Q lcl|NC_016654. 110 QARADLIFNT--PR---FHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGG 184 (533) Q Consensus 110 ~~~l~~i~~~--n~---f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~ 184 (533) ..-+..+|.. |. .......++...+-.|.+|+.+ +.++.. + .+..+.+....+ +..-.|.. T Consensus 63 ~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~--~~~~~~-~-----~~~~~~~~~~~~-~~~~~~~~----- 128 (385) T protein:vir:95 63 KGTLYYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVK--NDEGHF-F-----VADDFEKEDELG-LYSHRFTN----- 128 (385) T ss_pred cchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEE--ecCCCe-e-----eccccccccccc-ccccccee----- Confidence 2234444431 11 1222333445555567776543 333221 1 111111100000 00000000 Q ss_pred CCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccc Q lcl|NC_016654. 185 DGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRH 264 (533) Q Consensus 185 ~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~ 264 (533) .-. ..+.+. ..++ .--+.|+..... T Consensus 129 --------~~~-~~~~~~------------~~~~----------------------------~~eiih~~~~~~------ 153 (385) T protein:vir:95 129 --------VLV-NDFEFK------------RVFT----------------------------MDDVIYLKYNNQ------ 153 (385) T ss_pred --------eee-ccccee------------eeec----------------------------cccEEEecCCCC------ Confidence 000 000000 0000 001223322111 Q ss_pred cccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCCccc--cccCcchhhhhhcccccccccc Q lcl|NC_016654. 265 DPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGMGQG--VSLDEEQEVYSRVGSGGFNANG 342 (533) Q Consensus 265 ~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~~--~~~d~~~~~~~~~~~~~~~~~~ 342 (533) ....+|.|.+..+ ...++..++... ..+..+-++ ........ ...+..+..+.....+..+.++ T Consensus 154 --~~~~~G~s~~~~~----~~~i~~~~~~~~---~~~~~~g~l-----~~~~~~~~~~e~~~~~~~~~~~~~~g~~~~~~ 219 (385) T protein:vir:95 154 --KLDAFSLGLFEDY----GEIFGRMIDLQM---LNNQIRGIL-----KVDATKFYNKEKQKELQAYIDTLFDAFQNNTI 219 (385) T ss_pred --CcccccchHHHHH----HHHHHHHHHHHH---hcCCCceEE-----EeCCccCCCHHHHHHHHHHHHHHhhhhhhcCC Confidence 1123466665443 233344444332 223322222 11000000 0000001111111111101111 Q ss_pred -----ccccceeeech------hhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHH Q lcl|NC_016654. 343 -----DMETIFEFFQP------AIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKAR 411 (533) Q Consensus 343 -----~~~~~i~~~~~------~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~ 411 (533) +....++.++. .....++.+..+...++|+...|+||..++.. ..++++ .... T Consensus 220 ~i~~l~~g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~---~sn~e~-------------~~~~ 283 (385) T protein:vir:95 220 AVVPLTEGLAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVLGE---MADLEK-------------TIES 283 (385) T ss_pred ceEEcCCCceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhcCC---CcCHHH-------------HHHH Confidence 01111233321 12346788888888999999999999998521 112222 2334 Q ss_pred HHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHH Q lcl|NC_016654. 412 HFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQE 491 (533) Q Consensus 412 ~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~ 491 (533) .++.+|..++..+....+..+..........+.++++.-+..|..+.++.+.+++.+|+|+..++.+.+ ++.. T Consensus 284 ~~~~~l~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~R~~~--g~~p----- 356 (385) T protein:vir:95 284 YLQFCINPLLRKIEAELNSKFFYQDEYLNDDMHIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQVRIMT--GEEP----- 356 (385) T ss_pred HHHHHHHHHHHHHHHHHHhhcCChhhcccceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh--CCCC----- Confidence 455555555555544333333222222233577777788888999999999999999999999977654 2221 Q ss_pred HHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 492 EADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 492 El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +..+.+ +..-...+..+. +..+.+++++| T Consensus 357 ----~~~~~g---d~~~~~~n~~~~------~~~kgge~~~e 385 (385) T protein:vir:95 357 ----ADDPEL---DKFIITKNLQSA------DAFKGGESNEE 385 (385) T ss_pred ----CCCCCC---ceeeecccceec------ccccCCCCCCC Confidence 000100 000000111111 00112222222 No 247 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=78.32 E-value=0.12 Score=25.72 Aligned_cols=313 Identities=12% Similarity=0.023 Sum_probs=103.7 Q ss_pred ceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeE Q lcl|NC_016654. 172 LVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAA 251 (533) Q Consensus 172 ~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 251 (533) +.|+++- ..++......+.+..+-++.+-.|..++ .+ +.+...+ . ...+. ++ ....-|+. T Consensus 1 v~Eivw~----~~~g~~~~~~l~~r~~~~~~~f~~~~~~-~l---~~~~~~~--------~-~g~~~--~~-lp~~kfi~ 60 (355) T protein:vir:78 1 MFEQVYR----IENGRARLGKLAWRPPRTISRFDVAPDG-GL---VAIEQWG--------V-FGKAT--VR-IPVDRLVV 60 (355) T ss_pred CeEEEEE----eeCCeEEEeeeeecCccceeeeeeccCC-ce---eEEEecC--------C-CCCCc--ce-eccCCEEE Confidence 2222221 1111111122222222222221111111 10 0000000 0 00000 00 01122455 Q ss_pred EecCCcccccccccccccccccchhhhhHHH-HHHHHHHHHHHHHHHHH-h--CcceeeechHHhcCCCCcc--cccc-C Q lcl|NC_016654. 252 YVPNVTPNPEWRHDPKLRYLGRADLSTDLFP-TFHELDRIYSSLMRDFR-I--GAGKVHASESVLTNLGMGQ--GVSL-D 324 (533) Q Consensus 252 ~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~-lid~lD~~~s~~~~~~~-~--~~~~i~v~~~~l~~~~~~~--~~~~-d 324 (533) |+.... ...++|.+.+..+.-. +++ ...+..|+.=++ . +-+....|...-..+.+.. .... + T Consensus 61 ~~~~~~---------~g~p~G~gLlr~~~w~~~fK--~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~ 129 (355) T protein:vir:78 61 FVNERE---------GANWLGQSLLRQAYKNWLLK--DRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLND 129 (355) T ss_pred EEeCCC---------CCCccchhhHHHHHHHHHHH--HhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHH Confidence 553321 2356788777665432 222 233333333332 2 2122222211000000000 0000 0 Q ss_pred cchhhhhh---ccccccccc--cccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhccc---CCCcchhHHHHH Q lcl|NC_016654. 325 EEQEVYSR---VGSGGFNAN--GDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGL---SDEVAQTATEAS 396 (533) Q Consensus 325 ~~~~~~~~---~~~~~~~~~--~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~---~~~~~~Tatai~ 396 (533) ........ +..+. +++ -+....|+.+........+...++.+=++|+... ++. ++.. +.+|..+.-++. T Consensus 130 ~~~~l~~~~~~i~~g~-~a~~iip~g~~ie~~ea~g~~~~~~~~i~~~d~~Isk~i-LGq-tlTs~~~~~gGS~Alg~vh 206 (355) T protein:vir:78 130 QKEEGLQLAKEFRAGE-AAGGYIPHGANFTLTGVQGKLPEMDGPIRYHDEQIARAV-LAH-FLTLGGDKSTGSYALGDTF 206 (355) T ss_pred HHHHHHHHHHHhhCCc-ceeEeecCCceEEEeecCCCcccHHHHHHHHHHHHHHHH-hhh-hhccccCCccchhhHHHHH Confidence 00011111 11110 000 0111234444433333345555665555665555 332 2221 122322223332 Q ss_pred HHhhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCH-- Q lcl|NC_016654. 397 GKKDLTVKTTRAKARHFGSALG-PLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAAST-- 473 (533) Q Consensus 397 ~~~~~l~~~~~~~~~~~~~al~-~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~-- 473 (533) ..- ....++.-.+.+...|. +||+.++.+ |.+. ....+.+.|+. ..++..+.++.+++++.+|++-. T Consensus 207 ~~v--~~~~~~aD~~~i~~~ln~~li~~l~~l------N~~~-~~~~P~~~~~~-~~~~~~~~a~~~~~l~~~G~~~~~~ 276 (355) T protein:vir:78 207 ASF--FTGSLNAVMKHIADVTQQHVVEDLVDQ------NWGP-EEPAPRLVPAQ-LGKEQPVTAEAIRALVECGAFTADP 276 (355) T ss_pred HHH--HHHHHHHHHHHHHHHHHHHHHHHHHHh------cCCC-CCCCCEEEecC-cChhHHHHHHHHHHHHhCCCccccH Confidence 222 22333444455666774 577766654 2111 22346778864 56677788999999999998543 Q ss_pred --HHHHHHhCCCCCHHH-HHHHHHHHHHhhhcccCccccccccCCCCCCCCC-CCCCCCCCCCC Q lcl|NC_016654. 474 --KTKVAYLHEDWDDER-VQEEADLIDNANTVSAPTFGFGTDQPPLPTENDP-ATDPEAVDEGE 533 (533) Q Consensus 474 --et~v~~l~~~~~dee-~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~d~~ 533 (533) ++++++.++ ..+.+ .++++. -..+... +...... .+......+. .-.+...+..+ T Consensus 277 ~~~~~~~e~~g-ip~p~~~~~~~~-~~~~~~~--~~~~~~~-~~~~~~~~~~~a~~~~a~~~~~ 335 (355) T protein:vir:78 277 ELEKDLRARYG-LPAPAERDDGAD-AAAAKAA--GRRRAKR-LPGQRQGAALPSRSPRADPPRR 335 (355) T ss_pred HHHHHHHHHhC-CCCCCCCCcccC-Ccccccc--ccccccc-cCCccccccccccCCCCCChhh Confidence 345666554 32211 111111 0001000 0000000 0000001111 11111122222 No 248 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=78.19 E-value=0.12 Score=25.69 Aligned_cols=400 Identities=11% Similarity=-0.024 Sum_probs=156.6 Q ss_pred CCCC---CCcCCCcCcc-hHHHHHHHHhhh----Hh-hcCC-HHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCC Q lcl|NC_016654. 1 MSLP---EANTAWPPPE-LAAVTARVAESH----VW-WEGD-LDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTAT 70 (533) Q Consensus 1 ~~~~---~~~~~~pp~~-~~~~~~~~~~~~----~w-~~gd-~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 70 (533) |.-= .+|.|-+-+. -.+....+...+ .+ +.|- |+... .+.++... ...+.++. T Consensus 1 ~~~~i~~~~g~~~~~~~~~~~~~~~ia~~~~~~~~~~~~~~~p~~~~-il~~~~~~---------~~~y~~m~------- 63 (491) T protein:vir:79 1 MSKGLWVSPTEFVKFGEPDKSLSSQIATRARSIDFFALGMYLPNPDP-VLKALGKD---------IRVYRELR------- 63 (491) T ss_pred CCCeeeCCCCCcccccccchhHHHHHhhhccccccccccccCcchhH-HHhhccCC---------HHHHHHHh------- Confidence 3210 1222222211 133444444332 22 2221 11111 11111110 01122221 Q ss_pred CcccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCC Q lcl|NC_016654. 71 GRAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIA 150 (533) Q Consensus 71 g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~ 150 (533) + ..-+.. ++++.-.-+.+.+-.|....+++...+.+.++++.-.|...+...+ .|..+|-+++-+.|+.+++ T Consensus 64 --~----D~~i~s-~l~~Rk~av~~~~w~i~~~~~~~~~a~~i~e~l~~~~~~~~i~~~l-da~~~G~s~~Ei~w~~~~g 135 (491) T protein:vir:79 64 --A----DAHVGG-CVRRRKAAVKALEWGLDRGKAKSRVAKSIADVFADLDLSRIATEML-DAVLYGYQPMEITWGKVGN 135 (491) T ss_pred --h----ChHHHH-HHHHHHHHHhCCCcEEecCCCCHHHHHHHHHHHhcCCHHHHHHHHH-HhhhhcceeEEEEEeecCC Confidence 0 111222 2333344455666666655555566788999988878888887764 5888999999888876432 Q ss_pred Cce---EEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCC-cccceeehhhccccc Q lcl|NC_016654. 151 DNA---WIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTAT-SLGWMMALTDHPATR 226 (533) Q Consensus 151 ~~~---~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~-~lG~~v~l~~~~~~~ 226 (533) .+ .|.++++..|.+. ..+++ . +...++ .-|.++ T Consensus 136 -~~~~~~l~~r~~~~f~~d-~~~~l----------------~----------------l~~~~~~~~g~~l--------- 172 (491) T protein:vir:79 136 -YIVPIDVVGKPADWFVYD-PENQL----------------R----------------FRSKEHWVQGEEL--------- 172 (491) T ss_pred -eeeEEeeeeecccceeec-cCCce----------------E----------------EeecCCCCCceee--------- Confidence 22 2333333322210 01110 0 111000 001111 Q ss_pred cccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHH-hCccee Q lcl|NC_016654. 227 DIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFR-IGAGKV 305 (533) Q Consensus 227 ~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~-~~~~~i 305 (533) .+.-|+.|+.... ...|+|.+.+..+. ...--=...+..|+.=++ .|-+.. T Consensus 173 ------------------p~~k~i~~~~~~~---------~g~p~g~gLl~~~~-w~~~fK~~~~~~w~~f~E~~G~P~~ 224 (491) T protein:vir:79 173 ------------------PARKFLVPRQEAT---------YLNPYGFPDLSMCF-WPTTFKKGGLKFWVQFTEKYGSPML 224 (491) T ss_pred ------------------cCCCeEEEEecCC---------CCCcccchhHHHHH-HHHHHHHhhHHHHHHHHHHcCCCeE Confidence 0112444443221 13567777776643 222122333444444343 343322 Q ss_pred eechHHhcCCCCccccccCcchhhhhhcccccccccc--ccccceeeechhh---hhHHHHHHHHHHHHHHHHhhCCChh Q lcl|NC_016654. 306 HASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG--DMETIFEFFQPAI---RVLEHDQGAALLLREVLRKTGYSPV 380 (533) Q Consensus 306 ~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~--~~~~~i~~~~~~i---r~e~~~~~l~~~l~~i~~~~g~s~~ 380 (533) +. + +..+.. -+....+..++..-..++++ +....|+.++... ..+.|.+.++.+=++|+..+ ++ + T Consensus 225 ig-----k-y~~~a~--~~ek~~l~~al~~~~~~a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~i-LG-q 294 (491) T protein:vir:79 225 VG-----K-HPRSAS--DAETNLLLDRLEDMVQDAVAVIPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIAL-LG-Q 294 (491) T ss_pred EE-----e-cCCCCC--HHHHHHHHHHHHHHhcCeEEEecCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHH-hh-h Confidence 21 1 111110 01112222222211111111 1122344443321 22335555554444554443 22 2 Q ss_pred hcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHH Q lcl|NC_016654. 381 SLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQ 460 (533) Q Consensus 381 ~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~ 460 (533) +++-+.+|..+..++...- ....++.-.+.+...|.++++-++.+ |... ...+.+.|.+.- ....+.++ T Consensus 295 tlTt~~~gs~a~~~vh~~v--~~~i~~~D~~~i~~tln~li~~l~~~------N~~~--~~~p~f~~~e~e-e~~~~~a~ 363 (491) T protein:vir:79 295 NQTTEATSTRASAQAGLEV--TDDIRDGDKAIVVEAMNMLIRWICDL------NFDG--AARPVFDMWEQE-QVDEIQAG 363 (491) T ss_pred hhccCcccchhhHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHh------cCCC--CCcceEeecCcC-chhHHHHH Confidence 2332333333333333222 22334444666677888888876654 2222 223456665432 22356789 Q ss_pred HHHHHHhCCC-CCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 461 TVQAWSVASA-ASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 461 ~~~~l~~aGi-~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) .+++++..|+ ++.+ .+++.++ ..+.+..++..... .+...... .......+..+..+ T Consensus 364 ~~~~L~~~G~~i~~~-~~~e~~G-ip~~~~~e~~~~~~------~~~~~~~~--------~~~~~~~~~~~~~d 421 (491) T protein:vir:79 364 RDEKLTRAGARFTPA-YFKRAYN-LQDGDLDERPLPVS------AVDAVGAA--------SFAEFEAPDQDALD 421 (491) T ss_pred HHHHHHhCCCccCHH-HHHHHhC-CCCCCCCccccCcC------cccccccc--------cccccCCCCCcchH Confidence 9999999998 5544 5655554 43322111111000 00000000 00000111111111 No 249 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=76.87 E-value=0.13 Score=25.43 Aligned_cols=447 Identities=13% Similarity=0.106 Sum_probs=180.8 Q ss_pred hHHHHHHHHhhhHhhcCCHHHHHHHHhcc--CcchhhHHH-----------HHHHHHHHHHhcccCCCCCcc----cce- Q lcl|NC_016654. 15 LAAVTARVAESHVWWEGDLDKLATFYGAE--GRTSPSGIK-----------ARTKAAYEAFHGRTPTATGRA----PKR- 76 (533) Q Consensus 15 ~~~~~~~~~~~~~w~~gd~~~l~~~y~~~--~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~g~~----~~~- 76 (533) ++-.-...+.-.=|-.=|-.+..+--+.. ....+.... ..+.++.+.++.+........ +.+ T Consensus 1 ~~~~~~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR 80 (524) T protein:vir:10 1 MANFNTILSFLKPWANEDEKEYKQQINNNLESVTAPKLDDGAREIETQEQNIPYNALMQQMFGSNEPEVKNTRELIDTYR 80 (524) T ss_pred CCchhhHHHHhhhhhcchhhhhhhhhccCCCccccCCCCCCceeeccCcccccchhhhhhhhhcccchhhhHHHHHHHHH Confidence 22111122222222221111111111000 000000000 001111111111111000000 000 Q ss_pred -e-ecChHHHHHHHHHHh-hc----CCCceEeeCCCc--h----HHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEE Q lcl|NC_016654. 77 -Y-HAPIPGVIAKLSTTE-LF----SEQLKFLDAGKS--K----EVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRI 143 (533) Q Consensus 77 -~-~~n~~k~i~~~~a~l-l~----~e~~~i~~~~~~--~----~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~ 143 (533) + ..+-+.-.++..++= +. .+|+++.+++-+ + ...+..+.|++-=+|++..++......+.|..|+|. T Consensus 81 ~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~Ld~~~~s~siK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHk 160 (524) T protein:vir:10 81 NLMNNYEVDNAVQEIVSDAIVYEDDKEVVALNLDGTDFSQSIKDKILAEFSEVLNLLNFQRKGTDHFQRWYVDSRIFFHK 160 (524) T ss_pred HHhhccchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeceEEEEE Confidence 0 112222223333321 11 125555553322 1 244556667777789999999999999999999999 Q ss_pred EEcCC--CCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeec--CCceEEEEEEEecCeeEEEEEEe-ccCC-c-ccce Q lcl|NC_016654. 144 VWDPT--IADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGG--DGQEVWRHLERHESGYIVHAVYK-GTAT-S-LGWM 216 (533) Q Consensus 144 ~~D~~--~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~--~~~~~y~~lE~h~~~~I~~~~y~-~~~~-~-lG~~ 216 (533) .+|.+ ..+=..+..++|.++-++. ++.+. ++..+++ +.-.|.+|. ++.. . -|.- T Consensus 161 iid~~~pk~GI~Elr~lDPr~i~~vr------------~i~~~~~~~~~vi~-------~~~e~f~Y~~~~~~~~~~~~~ 221 (524) T protein:vir:10 161 IINPKKMKDGVQELRRLDPRQVQYIR------------EIVTRMEDGVKIVD-------GYREFFVYDTGHESYCADGRI 221 (524) T ss_pred EeeCCCccccceeeeeeCCccceeee------------eecccCcccchhhc-------chhhheeecCCCcccccCcce Confidence 99843 1233456667777665542 22111 1111121 112233332 1000 0 0000 Q ss_pred eehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHH--H Q lcl|NC_016654. 217 MALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSS--L 294 (533) Q Consensus 217 v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~--~ 294 (533) .+. +....| +.-.+.|..-+. ..+.+ ..=.|-+-.||+++ ..|=-+-+. + T Consensus 222 ~~~----------------~~~ikI----~~dAIvy~~SGL-----~d~~~--~~i~syLhkAiKp~-NQLkm~EDAlVI 273 (524) T protein:vir:10 222 YSA----------------GTKVKI----PRAAVVYAHSGL-----LDCCG--KNIIGYLQRAIKPA-NQLKLMEDAMVI 273 (524) T ss_pred ecC----------------Ccceec----chhheeeeccCc-----ccCCC--CceeccchHhhHHH-HhhHHHHhhHHH Confidence 000 000000 000122221110 00000 00012233344432 222111111 2 Q ss_pred HHHHHhCcceeee-c---------hHHhc-------C---CCCccccccCcchhhhhhcc--ccccccccccccceeeec Q lcl|NC_016654. 295 MRDFRIGAGKVHA-S---------ESVLT-------N---LGMGQGVSLDEEQEVYSRVG--SGGFNANGDMETIFEFFQ 352 (533) Q Consensus 295 ~~~~~~~~~~i~v-~---------~~~l~-------~---~~~~~~~~~d~~~~~~~~~~--~~~~~~~~~~~~~i~~~~ 352 (533) .|.-|.-.+|||- . +.+++ + +....|. ...+.....++. +.+--.|+ ....|+++. T Consensus 274 YRitRAPeRRvFYIDVGnlPk~KAeqYl~~im~k~kNKlvYDa~TGe-v~ddrk~msMlEDyWLpRReGg-rgTEItTLp 351 (524) T protein:vir:10 274 YRITRAPDRRVFYIDTGNMPSRKAAAQMQHIMNTMKNRVVYDASTGK-IKNQQHNMSMTEDYWLQRRDGK-AVTEVDTMP 351 (524) T ss_pred HhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeccCCe-eccchhhhhhHhhhcccccCCC-Cccceeecc Confidence 2334666777763 1 11111 0 0111111 111111000000 01111122 222344443 Q ss_pred hhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCC-C--cchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 353 PAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSD-E--VAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDA 429 (533) Q Consensus 353 ~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~-~--~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~ 429 (533) ..-.. .-+..+..+.+.+....++|.+.++.+. + +.--++||...+-.--.-+.+.+..|..-|.++++.-|.|.. T Consensus 352 Ggqnl-gem~DV~YF~kkLy~aLnVP~sRl~~e~~~~f~~gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKg 430 (524) T protein:vir:10 352 GATGM-SDMDDVLYFRTALYRALRIPESRIPSESNSGVMFDAGTAITRDELKFAKWIRQLQNKFEEIFLDPLKTNLILKK 430 (524) T ss_pred ccCCc-ChHHHHHHHHHHHHHHhCCCchhccCCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Confidence 32222 2355667777888999999988885332 1 122456776666666677888888888888888887765522 Q ss_pred hhccCCCCC--CceeEEEEeCCCCCCCHHHHH-------HHHHHHH--hCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHH Q lcl|NC_016654. 430 IKFPGKGAA--PSEELELEWPKFARESDLAKA-------QTVQAWS--VASAASTKTKVAYLHEDWDDERVQEEADLIDN 498 (533) Q Consensus 430 ~~~~~~~~~--~~~~v~i~f~d~i~~d~~e~a-------~~~~~l~--~aGi~S~et~v~~l~~~~~dee~~~El~rI~~ 498 (533) .. ..... ....+.++|...---.+...+ ..++++. -+...|.++..++.. -.+|+|.++|.++|++ T Consensus 431 ii--t~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~IL-r~tDeei~~~~k~I~~ 507 (524) T protein:vir:10 431 II--TEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRINMLTMAEPFIGKYISHQTAMKDFL-QMTDEEINQEAKQIEE 507 (524) T ss_pred CC--CHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHh-ccCHHHHHHHHHHHHH Confidence 11 00001 124577777654333333332 3333332 122569998776654 4899999999999999 Q ss_pred hhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 499 ANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 499 E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) |....- + +.++ +++.| T Consensus 508 E~k~~~--~-------~~~~----------~~~~~ 523 (524) T protein:vir:10 508 ESKEAR--F-------QNPD----------EEEED 523 (524) T ss_pred HhhcCC--C-------CCCC----------hhhhc Confidence 963210 0 0000 00001 No 250 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=75.96 E-value=0.14 Score=25.25 Aligned_cols=467 Identities=12% Similarity=0.057 Sum_probs=191.2 Q ss_pred CCCCCCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP 80 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n 80 (533) +.. ...++=||-.-+++ ..+. .. |.|. ..+++.++.. .. ...++..|+.+. .. + T Consensus 13 ~~~-~~~S~vpp~~~~~~-~~i~--~g-~~g~---~v~~~g~~~~---~n-~~eLI~~YR~ma-~~-------------p 66 (564) T protein:vir:10 13 EGQ-KGQSPVPPNDEASV-STVA--GG-YFGT---YVDTSGGQNS---RN-EYELIRRYRDMS-LH-------------P 66 (564) T ss_pred ccC-CCCCcccCCcCCCh-hhhh--cc-ccce---eeecccccch---hh-HHHHHHHHHHHh-hc-------------c Confidence 433 33345566554432 2221 11 1111 1112221100 00 111222222220 01 1 Q ss_pred hHHHHHHHHHH-hhcC----CCceEeeCCCc------hHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCC Q lcl|NC_016654. 81 IPGVIAKLSTT-ELFS----EQLKFLDAGKS------KEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTI 149 (533) Q Consensus 81 ~~k~i~~~~a~-ll~~----e~~~i~~~~~~------~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~ 149 (533) -+.-.++..++ .++. +|+++..++.. +...+..+.|++-=+|++..++......+.|..|||..+|.+- T Consensus 67 EVd~Av~eIVneaIv~d~~~~pV~vdL~~~~~s~siK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~ 146 (564) T protein:vir:10 67 EVDSAIDEIVNEFVVNDGDDKPVEVDLQNLEIGSGVKKKIRDEFNRILRMMNFNVNAHEIIRNWYVDGRSHYHKVIDLDN 146 (564) T ss_pred chhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeeCCC Confidence 11222222222 1222 24555443211 1244556667777789999999999999999999999998431 Q ss_pred --CCceEEEEEcCCeEEEEEecC-----CceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhc Q lcl|NC_016654. 150 --ADNAWIDFVDADRAIPEFRWG-----RLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDH 222 (533) Q Consensus 150 --~~~~~i~~v~~~~~~P~~~~g-----~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~ 222 (533) .+=..+.+++|.++-+++..- ..++|.- +.. +-+...+...|.+|....-. |.. +. T Consensus 147 pk~GI~eLr~lDPr~i~~vr~i~~~~~~~~~~v~k------~~~------~~~~y~~~~Eyy~Ynp~~~~-g~~-~~--- 209 (564) T protein:vir:10 147 PKKGILELRYIDSLKIRKVRQKLKDVDPNRKEIEK------GTA------LQYDYGDFIEYYIYNPKGFA-GNI-PM--- 209 (564) T ss_pred hhhhhhhhhhhcccceeeeeeeccccccccceeee------eee------eeccccccccceeecccccc-Ccc-cc--- Confidence 223346678888777775211 1111110 000 00001112233444322100 000 00 Q ss_pred cccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHH--HHHHHHh Q lcl|NC_016654. 223 PATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSS--LMRDFRI 300 (533) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~--~~~~~~~ 300 (533) ..+.. +...+....++ .-.+.|..-+.-..+ .+.=.|-+-.||+++ ..|=-+-+. +.|.-|. T Consensus 210 --~~~~~--~~~~~~~ikI~----~daI~y~hSGL~d~~-------~~~i~gyLhkAIKp~-NQLkmlEDAlVIYRitRA 273 (564) T protein:vir:10 210 --VTGSM--DWSNQEGIKIA----SDAIAQSTSGLMDLN-------KKMTLSFLHKAIKSL-NQLRMIEDSLVIYRLSRA 273 (564) T ss_pred --ccccc--ccccccceeec----hhhcceecccceeCC-------CCceeccchhhhHhH-HhhHHHHhhHHHHhhhcc Confidence 00000 00000000000 111222221110000 000112222344432 222111111 2233466 Q ss_pred Ccceeee-c---------hHHhcC----------CCCccccccCcchhhhhhcc---ccccccccccccceeeechhhhh Q lcl|NC_016654. 301 GAGKVHA-S---------ESVLTN----------LGMGQGVSLDEEQEVYSRVG---SGGFNANGDMETIFEFFQPAIRV 357 (533) Q Consensus 301 ~~~~i~v-~---------~~~l~~----------~~~~~~~~~d~~~~~~~~~~---~~~~~~~~~~~~~i~~~~~~ir~ 357 (533) -.+|||- . +.+++. +....|..-| ++.|..+. +.+--.|+ ....|+++...-. T Consensus 274 PeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGevrd--drk~msMlEDyWLPRReGg-rgTEItTLpGgqn- 349 (564) T protein:vir:10 274 PERRIFYIDVGNLPKVKAEQYLRDVMSRYRNKLVYDGQTGEIRD--DKKHMSMLEDFWLPRREGG-RGTEITTLPGGQN- 349 (564) T ss_pred ccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceecc--cchhhhhHhhhcccccCCC-cccceeeccccCC- Confidence 6777763 1 111110 0111111111 11111100 01111122 2223444433222 Q ss_pred HHHHHHHHHHHHHHHHhhCCChhhcccCCC--cchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCC Q lcl|NC_016654. 358 LEHDQGAALLLREVLRKTGYSPVSLGLSDE--VAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGK 435 (533) Q Consensus 358 e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~--~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~ 435 (533) -.-+..+..+.+.+....++|.+.+..+++ ..--++||...+-.--.-+.+.+..|..-|.++++.-|.|.... .. T Consensus 350 Lgem~DV~YF~kKLY~aLnVP~SRl~~e~~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgii--t~ 427 (564) T protein:vir:10 350 LGELKDVEYFKKKLYNSLNLPPSRLTDDNKAFNLGKSTEILRDELKFTKFIGRLRKRFAQLFHDILKTQLILKGII--TP 427 (564) T ss_pred cchHHHHHHHHHHHHHHhCCCcccccCCCceeecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCC--CH Confidence 123556677778888889998888875532 11234566666666667788888888888888888776553211 00 Q ss_pred CCC--CceeEEEEeCCCCCCCHHHHHH-------HHHHH--HhCCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcc- Q lcl|NC_016654. 436 GAA--PSEELELEWPKFARESDLAKAQ-------TVQAW--SVASAASTKTKVAYLHEDWDDERVQEEADLIDNANTVS- 503 (533) Q Consensus 436 ~~~--~~~~v~i~f~d~i~~d~~e~a~-------~~~~l--~~aGi~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~- 503 (533) ... ....+.++|...---.+...++ .++++ .-+...|.++..++.. -.+|+|.++|.++|++|.... T Consensus 428 eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~IL-r~tDeei~~~~kqI~~E~k~~~ 506 (564) T protein:vir:10 428 EDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVGKYFSTEYIRRKIL-MQTENEFKEIDKQMKSDIESGL 506 (564) T ss_pred HHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHh-ccCHHHHHHHHHHHHHHhhcCC Confidence 001 1245777776443333332222 22222 1123569998777654 489999999999999996421 Q ss_pred --cC-------cccc-ccccCCC-----------CCCCCCCC--C-CCCCCCCC Q lcl|NC_016654. 504 --AP-------TFGF-GTDQPPL-----------PTENDPAT--D-PEAVDEGE 533 (533) Q Consensus 504 --~~-------~~~~-~~~~~~~-----------~~~~~~~~--~-~~~~~d~~ 533 (533) +| ++.. ++..+|. +++.-.++ + ++..+-++ T Consensus 507 ~~~P~e~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 560 (564) T protein:vir:10 507 AIDPIQVNMLDDMEKQNQAFAPELQAAQDDLAAEREIKKLNSAPKPPPSQQSKS 560 (564) T ss_pred CCCchhhhcCCCccCCCCcCCcchhhhccccccccChhhhccCCCCCCCCCCcC Confidence 11 1111 1111111 11100000 1 11111112 No 251 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=75.18 E-value=0.15 Score=25.11 Aligned_cols=439 Identities=10% Similarity=0.035 Sum_probs=185.8 Q ss_pred hHHHHHHHHhhhHhhcCCHHHHHHHHhccC--cchhhHHHHHH---------HHHHHHHhcccCCCCCcc-------cce Q lcl|NC_016654. 15 LAAVTARVAESHVWWEGDLDKLATFYGAEG--RTSPSGIKART---------KAAYEAFHGRTPTATGRA-------PKR 76 (533) Q Consensus 15 ~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~--~~~~~~~~~~~---------~~~~~~~~~~~~~~~g~~-------~~~ 76 (533) +=++... -..|..=+..++++-.+... ...|....+.+ ..+++-+..-+-...+.. +.+ T Consensus 1 ~~~~l~~---~~~~~~~~~~~~~~~~~~~~~s~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~~~~~~~eLI~~Y 77 (521) T protein:vir:81 1 MFSRLKM---LARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNLPASAWNSLTQQFYSTDQKISTTKQLVNTY 77 (521) T ss_pred Ccchhhh---hHhhcCchhhhHHhhhccCccccccCCCCCCceEecccCCCcceeecceeeeecccccchhhHHHHHHHH Confidence 2222333 33444444445443222110 00111111000 000000000000000000 000 Q ss_pred --e-ecChHHHHHHHHHHh-hc----CCCceEeeCCC--ch----HHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEE Q lcl|NC_016654. 77 --Y-HAPIPGVIAKLSTTE-LF----SEQLKFLDAGK--SK----EVQARADLIFNTPRFHSSLVEAGESCSALSGSFQR 142 (533) Q Consensus 77 --~-~~n~~k~i~~~~a~l-l~----~e~~~i~~~~~--~~----~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~ 142 (533) + ..+-+.-.++..++= +. .+|+++.+++. ++ ...+..+.|++-=+|++..++......+.|..|++ T Consensus 78 R~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fh 157 (521) T protein:vir:81 78 RGLMNNHEVENAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFH 157 (521) T ss_pred HHHhhccchhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEE Confidence 0 112222233333321 11 12555555322 11 23455666777778999999999999999999999 Q ss_pred EEEcCCCC-CceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCC---------- Q lcl|NC_016654. 143 IVWDPTIA-DNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTAT---------- 211 (533) Q Consensus 143 ~~~D~~~~-~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~---------- 211 (533) ..+|++.. +=..+..++|.++.++...-+ ...++..++ +++..|.+|.-... T Consensus 158 kiid~~pk~GI~Elr~lDPr~i~~vr~i~k----------~~~~~~~v~-------~~~~e~f~Y~~~~~~~~~~g~~~~ 220 (521) T protein:vir:81 158 KIIGKNPKDGIVELRQLDPRNLEYVREIIT----------EDTPEGKIY-------KATKEYFIYTVGNSSYCAGGQVFS 220 (521) T ss_pred EEEcCCccccceeeeeeCCcceeeeeeecc----------cccCcccee-------cceeeeeeeecCCccccccceeec Confidence 99985533 334566788887776632100 000111111 12223333321100 Q ss_pred -cccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHH Q lcl|NC_016654. 212 -SLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRI 290 (533) Q Consensus 212 -~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~ 290 (533) +.+.+++-+. +.|+.-.. ..+. .+.=.|-+-.||+++ ..|=-+ T Consensus 221 ~~~~vkI~~dA----------------------------I~y~hSGl-----~d~~--~~~i~syLhkAiKp~-NQLkm~ 264 (521) T protein:vir:81 221 PNSRVKIPRSA----------------------------ITYAHSGL-----MDCD--DKYIIGYLHRAVKPA-NQLKLL 264 (521) T ss_pred CCcceeechhh----------------------------eeeeeccc-----eeCC--CCeeeecchhhhHhH-HhhHHH Confidence 0111111111 11111100 0000 000112233344432 222111 Q ss_pred HHH--HHHHHHhCcceeee-c---------hHHhc-------C---CCCccccccCcchhhhhhcc---ccccccccccc Q lcl|NC_016654. 291 YSS--LMRDFRIGAGKVHA-S---------ESVLT-------N---LGMGQGVSLDEEQEVYSRVG---SGGFNANGDME 345 (533) Q Consensus 291 ~s~--~~~~~~~~~~~i~v-~---------~~~l~-------~---~~~~~~~~~d~~~~~~~~~~---~~~~~~~~~~~ 345 (533) -+. +.|.-|.-.+|||- . +.+++ + +....|..-| + +.+..+. +.+--.|+ .. T Consensus 265 EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~im~k~kNklvYDa~TGev~d-d-rk~msMlEDyWLpRReGg-rg 341 (521) T protein:vir:81 265 EDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKN-Q-QANLSMTEDYWLQRRDGK-AI 341 (521) T ss_pred HhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccc-c-ccccchhhhhcccccCCC-cc Confidence 111 22334666777763 1 11111 0 0011111111 1 1110000 01111122 22 Q ss_pred cceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCC-c--chhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 346 TIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDE-V--AQTATEASGKKDLTVKTTRAKARHFGSALGPLST 422 (533) Q Consensus 346 ~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~-~--~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~ 422 (533) ..|++....-.. .-+..+..+.+.+....++|.+.++..++ + .--++||...+-.--.-+.+.+..|..-|.++++ T Consensus 342 TEItTLpGgqnl-gem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~ 420 (521) T protein:vir:81 342 TDVTTLPGASGM-SDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIRTRQSQFSEVLRDPLK 420 (521) T ss_pred cceeecccCCCC-ChHHHHHHHHHHHHHHhCCccccccCCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 234544432222 23556677778899999999888853322 1 1235667666666667788888888888888888 Q ss_pred HHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHH-------HHHHHHHHh--CCCCCHHHHHHHhCCCCCHHHHHHHH Q lcl|NC_016654. 423 TCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAK-------AQTVQAWSV--ASAASTKTKVAYLHEDWDDERVQEEA 493 (533) Q Consensus 423 ~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~-------a~~~~~l~~--aGi~S~et~v~~l~~~~~dee~~~El 493 (533) .-|.|....-..........+.++|...---.+... +..++++.- +...|.++..++.. -.+|+|.++|. T Consensus 421 ~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~IL-r~tDeei~~~~ 499 (521) T protein:vir:81 421 YNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDIL-KYTDDQMDTEK 499 (521) T ss_pred HhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHh-ccCHHHHHHHH Confidence 776552211000000112347777765433333322 233333321 22569998776654 48999999999 Q ss_pred HHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 494 DLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 494 ~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ++|++|....- + +. +.++.++ T Consensus 500 k~I~~E~~~~~--~-------~~----------p~~~~~~ 520 (521) T protein:vir:81 500 KQIEEEANDPR--F-------KQ----------TPDEIED 520 (521) T ss_pred HHHHHHhhCCC--C-------CC----------CcccccC Confidence 99999974211 0 00 0111111 No 252 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=74.50 E-value=0.16 Score=24.98 Aligned_cols=419 Identities=11% Similarity=-0.000 Sum_probs=165.3 Q ss_pred CCCC--CCcCCCcCcchHH-HHHHH-HhhhHhh----cC-CHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCC Q lcl|NC_016654. 1 MSLP--EANTAWPPPELAA-VTARV-AESHVWW----EG-DLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATG 71 (533) Q Consensus 1 ~~~~--~~~~~~pp~~~~~-~~~~~-~~~~~w~----~g-d~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 71 (533) ||-= --|.|.+=+.+.. ....+ ..++.|- +| +|.+|.+..+.+..--..++-.++. .+.. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iLr~a~~gd~~~~~~L~e----~m~e------- 69 (526) T protein:vir:99 1 MAQIVDVYGNPIRTQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFM----DMEE------- 69 (526) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCcCCCCHHHHHHHHHhhhCCCHHHHHHHHH----HHHh------- Confidence 3311 2233444222211 11111 1223321 12 4566666555433211122211211 1100 Q ss_pred cccceeecChHHHHHHHHHHhhcCCCceEeeCCC----chHHHHHHHHHHhhc-cHHHHHHHHHHHHhhhCCEEEEEEEc Q lcl|NC_016654. 72 RAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGK----SKEVQARADLIFNTP-RFHSSLVEAGESCSALSGSFQRIVWD 146 (533) Q Consensus 72 ~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~----~~~~~~~l~~i~~~n-~f~~~~~~~~~~~~~~G~~~~~~~~D 146 (533) + ..-+...+ .+--.-+.+.+-.|....+ +....+.+++++.+- +|...+...+ .|..+|-.++-+.|+ T Consensus 70 -~----D~~i~s~l-~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~l-da~~~G~s~~Eivw~ 142 (526) T protein:vir:99 70 -R----DAHLFAEM-SKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDAL-DGIGHGYSCIELEWA 142 (526) T ss_pred -h----ChHHHHHH-HHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcccCHHHHHHHHH-HhhhhcceeEEEEEe Confidence 0 11122222 2333344555545543211 234456778777653 5777777655 688899999988887 Q ss_pred CCCCCceE---EEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcc Q lcl|NC_016654. 147 PTIADNAW---IDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHP 223 (533) Q Consensus 147 ~~~~~~~~---i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~ 223 (533) .+++ ... +.+.++..|. +...++. .+. ++.. ..-|.++ T Consensus 143 ~~~g-~~~~~~l~~r~~~~f~----------------~~~~~~~------------~l~---~~~~-~~~g~~l------ 183 (526) T protein:vir:99 143 LQGR-EWMPLAFHHRPQSWFQ----------------LNPEDQN------------ELR---LRDN-SPAGEAL------ 183 (526) T ss_pred ecCC-ceeEEEeeeeccccee----------------eccCCCc------------EEE---ecCC-CCCceee------ Confidence 6432 111 2222222111 0001110 000 1110 0001111 Q ss_pred ccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHH-HHHHHHHHHHHHHHHHH-hC Q lcl|NC_016654. 224 ATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFP-TFHELDRIYSSLMRDFR-IG 301 (533) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~-lid~lD~~~s~~~~~~~-~~ 301 (533) .+.-|++|+.... ...|+|.+.+..+.-. +++ ...+..|+.=++ .| T Consensus 184 ---------------------~~~k~i~~~~~~~---------~g~p~g~gLlr~~~w~~~fK--~~~~~~w~~f~E~yG 231 (526) T protein:vir:99 184 ---------------------QPFGWIIHRPRAR---------SGYVARSGLFRVLAWPYLFR--HYATSDLAEMLEIYG 231 (526) T ss_pred ---------------------cCCCeEEEeecCC---------cCCccccchHHHHHHHHHHH--HhhHHHHHHHHHHcC Confidence 0112344443221 1356677776654322 222 234444444343 34 Q ss_pred cceeeechHHhcCCCCccccccCcchhhhhhcccccccccc--ccccceeeechh-hhhHHHHHHHHHHHHHHHHhhCCC Q lcl|NC_016654. 302 AGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG--DMETIFEFFQPA-IRVLEHDQGAALLLREVLRKTGYS 378 (533) Q Consensus 302 ~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~--~~~~~i~~~~~~-ir~e~~~~~l~~~l~~i~~~~g~s 378 (533) -+..+. .+..+.. -+....+..++..-..++++ .....|+.++.. ...+.|.+.++.+=++|+..+ ++ T Consensus 232 ~P~~ig------ky~~~a~--~~ek~~L~~av~~i~~d~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i-LG 302 (526) T protein:vir:99 232 LPIRLG------KYPPGTA--DEEKATLLRAVTGLGHAAAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV-LG 302 (526) T ss_pred CceEEE------ecCCCCC--HHHHHHHHHHHHHHhhCcEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH-hh Confidence 333222 1111111 01122222322211111111 112234444432 233445555665555665544 33 Q ss_pred hhhcccC----CCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCC Q lcl|NC_016654. 379 PVSLGLS----DEVAQTATEASGKKDLTVKTTRAKARHFGSALG-PLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARE 453 (533) Q Consensus 379 ~~~~g~~----~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~-~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~ 453 (533) +|++.+ .+|.....++... -....++.-.+.+...|. +|++.++.+. + +........+.+.|+..-++ T Consensus 303 -qtlTs~~~~g~~gS~a~g~vh~~--v~~di~~aDa~~i~~tln~~Li~~l~~~N---~-~~~~~~~~~p~~~~~~~e~e 375 (526) T protein:vir:99 303 -GTLTSTTSQSGGGAFALGQVHNE--VRHDLLASDARQLAATLSRDLLWPLLVLN---R-PGSPDVRRAPRLVFDLREQA 375 (526) T ss_pred -hhhccccccCcchhhhHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHhC---C-CCcCCccccceEEeCCCCcc Confidence 233221 1122222222211 122234445566667774 5877776651 1 11122345678899999999 Q ss_pred CHHHHHHHHHHHHhCCC-CCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 454 SDLAKAQTVQAWSVASA-ASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEG 532 (533) Q Consensus 454 d~~e~a~~~~~l~~aGi-~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 532 (533) |..+.++.+.+|+..|+ +|.+. +++.++ ..+.+..+++- .... .+.... ..++...........+..++- T Consensus 376 Dl~~~a~~~~~L~~~G~~i~~~~-i~e~~G-ip~~~~~e~~l---~~~~--~~~~~~--~~~~~~~~~~~~~~~~~~~~~ 446 (526) T protein:vir:99 376 DITSMAQSIPALVNVGLEIPSAW-VYDKLG-IPQPAKNEPVL---RSAA--QPAILS--RQHGQRVAALATIVGPRYGDQ 446 (526) T ss_pred cHHHHHHHHHHHHhCCCccCHHH-HHHHhC-CCCCCCccccc---CCCC--CCcccc--cccccccccccccccccCcch Confidence 99999999999999998 66665 555454 43321111111 0000 010000 000000000011111111111 Q ss_pred C Q lcl|NC_016654. 533 E 533 (533) Q Consensus 533 ~ 533 (533) + T Consensus 447 ~ 447 (526) T protein:vir:99 447 Q 447 (526) T ss_pred h Confidence 1 No 253 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=74.41 E-value=0.16 Score=24.97 Aligned_cols=441 Identities=12% Similarity=0.081 Sum_probs=182.3 Q ss_pred CCC---------------------CCC-cCCCcCcchHHHHHHHHhhhH----hhcCCHHHHHHHHhccCcchhhHHHHH Q lcl|NC_016654. 1 MSL---------------------PEA-NTAWPPPELAAVTARVAESHV----WWEGDLDKLATFYGAEGRTSPSGIKAR 54 (533) Q Consensus 1 ~~~---------------------~~~-~~~~pp~~~~~~~~~~~~~~~----w~~gd~~~l~~~y~~~~~~~~~~~~~~ 54 (533) |-. -.. .++=|| ..+.-...+..... =|.| =.+++|.+...-..+. .. T Consensus 1 m~f~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p-~~dDGa~~i~~~~~~~~~~~~~---~~q~~y~~~e~~~~~~--~e 74 (523) T protein:vir:68 1 MKFNILSLFAPWAKMDERDYKDQEKENLESITSP-KLDDGAKEYEVSENEAQQTYNA---MFQRMFGSQEPGLKST--RE 74 (523) T ss_pred CCCchhhhhhhhhhhhhhhhhhhhhccCCCcccc-CCCCcceeeeccccccccccch---hhhhhhhccccccchH--HH Confidence 111 000 011222 22211111110000 0011 0122333221111111 11 Q ss_pred HHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHh-h----cCCCceEeeCCC--ch----HHHHHHHHHHhhccHH Q lcl|NC_016654. 55 TKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTE-L----FSEQLKFLDAGK--SK----EVQARADLIFNTPRFH 123 (533) Q Consensus 55 ~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~l-l----~~e~~~i~~~~~--~~----~~~~~l~~i~~~n~f~ 123 (533) ++..|+.+. ..+-+.-.++..++= + ..+|+++.++.. ++ ...+..+.|++-=+|+ T Consensus 75 LI~~YR~ma--------------~~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~ 140 (523) T protein:vir:68 75 LIDTYRNLM--------------TNYEVDNAVSEIVSDAIVYEDDTEVVSINLDNTKFSPNIKSMMLDEFNEVLNHLSFQ 140 (523) T ss_pred HHHHHHHHh--------------hccchhhHHHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccc Confidence 122222210 111222223333321 1 123556655432 11 2445566677777899 Q ss_pred HHHHHHHHHHhhhCCEEEEEEEcCC--CCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCC--ceEEEEEEEecCe Q lcl|NC_016654. 124 SSLVEAGESCSALSGSFQRIVWDPT--IADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDG--QEVWRHLERHESG 199 (533) Q Consensus 124 ~~~~~~~~~~~~~G~~~~~~~~D~~--~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~--~~~y~~lE~h~~~ 199 (533) +..++......+.|..||+.++|.. ..+=..+..++|.++-++. ++...+. ..+++ + T Consensus 141 ~~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr------------~i~~~~~~g~~vi~-------~ 201 (523) T protein:vir:68 141 RKGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVR------------EVITTTEAGVKIVK-------G 201 (523) T ss_pred hhhhHHHHhheeeeEEEEEEEeeCCCccccceeeeeeCCcceeEEE------------eecCCCCcchhhhh-------h Confidence 9999999999999999999999854 2234456677887665542 2221111 11111 2 Q ss_pred eEEEEEEeccCC--c-ccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchh Q lcl|NC_016654. 200 YIVHAVYKGTAT--S-LGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADL 276 (533) Q Consensus 200 ~I~~~~y~~~~~--~-lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~ 276 (533) +-.|.+|.-... + .|. +. ..+....++ .-.+.|..-+. ..+.+ ..=.|-+ T Consensus 202 ~~e~f~Y~~~~~~~~~~g~-~~---------------~~~~~ikI~----~dAI~y~hSGL-----~d~~~--~~i~gyL 254 (523) T protein:vir:68 202 YKEYFIYDTSHESYACDGR-IY---------------EAGTKIKIP----KAAIVYAHSGL-----VDCCG--KNIIGYL 254 (523) T ss_pred hhhheeecccccccccccc-cc---------------CCCcceecc----hhheeeeeccc-----eeCCC--Cceeccc Confidence 222233321110 0 010 00 000000000 00111221110 00000 0001223 Q ss_pred hhhHHHHHHHHHHHHHH--HHHHHHhCcceeee-c---------hHHhcC----------CCCccccccCcchhhhhhcc Q lcl|NC_016654. 277 STDLFPTFHELDRIYSS--LMRDFRIGAGKVHA-S---------ESVLTN----------LGMGQGVSLDEEQEVYSRVG 334 (533) Q Consensus 277 ~~~i~~lid~lD~~~s~--~~~~~~~~~~~i~v-~---------~~~l~~----------~~~~~~~~~d~~~~~~~~~~ 334 (533) -.||+++ ..|=-+-+. +.|.-|.-.+|||- . +.+++. +....|. ...+.....++. T Consensus 255 hkAiKp~-NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNKlvYDa~TGe-v~ddrk~msMlE 332 (523) T protein:vir:68 255 HRAIKPA-NQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPSRKAAEHMQHVMNTMKNRIAYDATTGK-IKNQQHIMSMTE 332 (523) T ss_pred hhhhHHH-HhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhhcceeEEeccCCe-eccchhhhhhHh Confidence 3344432 222111111 22334666777763 1 111110 0011111 111111000000 Q ss_pred --ccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCc--chhHHHHHHHhhhHHHHHHHHH Q lcl|NC_016654. 335 --SGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEV--AQTATEASGKKDLTVKTTRAKA 410 (533) Q Consensus 335 --~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~--~~Tatai~~~~~~l~~~~~~~~ 410 (533) +.+--.|+ ....|+++...-.. .-+.-+..+.+.+....++|.+.+..+.++ .--++||...+-.--.-+.+.+ T Consensus 333 DyWLpRReGg-rgTEItTLpGgqnl-gem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDEikF~KFI~rLR 410 (523) T protein:vir:68 333 DYWLQRRDGK-AVTEVDTLPGADNT-GNMEDVRWFRNALYMALRIPITRIPSDQGGIQFDAGTSITRDELSFGKFIRELQ 410 (523) T ss_pred hhcccccCCC-cccceeeccccCCc-ChHHHHHHHHHHHHHHhCCcceeecCCCcceecccccchhHHHHHHHHHHHHHH Confidence 01111122 22234544432222 235566777788888889988877433221 1125567666666667788888 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCCCCC--CceeEEEEeCCCCCCCHHHH-------HHHHHHHH--hCCCCCHHHHHHH Q lcl|NC_016654. 411 RHFGSALGPLSTTCLRVDAIKFPGKGAA--PSEELELEWPKFARESDLAK-------AQTVQAWS--VASAASTKTKVAY 479 (533) Q Consensus 411 ~~~~~al~~li~~il~l~~~~~~~~~~~--~~~~v~i~f~d~i~~d~~e~-------a~~~~~l~--~aGi~S~et~v~~ 479 (533) ..|..-|.++++.-|.|.... ..... ....+.++|...---.+... +..++++. -+..+|.+++.++ T Consensus 411 ~rFs~lf~~~Lk~qLilKgii--t~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ 488 (523) T protein:vir:68 411 HKFEEIFLDPLKTNLILKGII--TEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINMLQMAEPFIGKYISHRTAMKD 488 (523) T ss_pred HHHHHHHHHHHHHhhhhccCC--CHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHH Confidence 888888888888776552211 00001 12457777765433333332 23333332 1225699987776 Q ss_pred hCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 480 LHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 480 l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) .. -.+|+|.++|.++|++|....- + +.++ +++.| T Consensus 489 IL-r~tDeei~~~~kqI~~E~k~~~--~-------~~p~----------~e~~~ 522 (523) T protein:vir:68 489 IL-QMSDEEIEQEAKQIEEESKEAR--F-------QDPD----------QEQED 522 (523) T ss_pred Hh-ccCHHHHHHHHHHHHHHhhcCC--C-------CCCc----------hhhhc Confidence 54 4899999999999999963210 0 0000 00000 No 254 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=73.54 E-value=0.17 Score=24.82 Aligned_cols=204 Identities=11% Similarity=0.047 Sum_probs=85.3 Q ss_pred EEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccc Q lcl|NC_016654. 193 LERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLG 272 (533) Q Consensus 193 lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G 272 (533) +.+...|.+.|.........-|..+.+ .+--+.|+++..+. ...+| T Consensus 1 ~r~~~dg~~~y~~~~~~~~~~g~~~~~--------------------------~~~eilH~r~~~~~--------~~~~G 46 (219) T protein:vir:98 1 MRVCKDGNYKYLMKKSLYDTKSEIYEY--------------------------NKNDVIFIKLYDPM--------QQVYG 46 (219) T ss_pred CceeecCeEEEEEecceecCCceeEEe--------------------------ccccEEEecCCCCC--------CCcce Confidence 111222222211100000000111100 00113344432111 12358 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHH-HhCcc-eee--echHHhcCCCCccccccCcchhhhhhcccc-------ccccc Q lcl|NC_016654. 273 RADLSTDLFPTFHELDRIYSSLMRDF-RIGAG-KVH--ASESVLTNLGMGQGVSLDEEQEVYSRVGSG-------GFNAN 341 (533) Q Consensus 273 ~S~~~~~i~~lid~lD~~~s~~~~~~-~~~~~-~i~--v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~-------~~~~~ 341 (533) .|.+..++. .+ .++....++...+ +.|.. .-+ +|...+.. .. .+.-...+...... ....+ T Consensus 47 lspi~~a~~-~i-~~~~aa~~~~~~~f~Ng~~p~gil~~~~~~l~~---e~---~~~~~~~~~~~~g~~n~~~~~l~~~g 118 (219) T protein:vir:98 47 SPDYVGGIT-SA-LLNSDATIFRRRYYSNGAHMGFILYSTDPDMTE---EM---EDEIAERIRDSKGVGNFRSMFVNIAG 118 (219) T ss_pred ecHHHHHHH-HH-HHHHHHHHHHHHHHhcCCCCceEEEeCCCCCCH---HH---HHHHHHHHHHhcCcccccceeEecCC Confidence 887776653 33 3456666666554 44422 211 22111111 00 00001111111000 00111 Q ss_pred c-ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcch---hHHHHHHHhhhHHHHHHHHHHHHHHHH Q lcl|NC_016654. 342 G-DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQ---TATEASGKKDLTVKTTRAKARHFGSAL 417 (533) Q Consensus 342 ~-~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~---Tatai~~~~~~l~~~~~~~~~~~~~al 417 (533) + .....++.++......++++.-+....+|+..-|++|..+|+...+.. +.++... ..++..| T Consensus 119 g~~~G~~~~~~~~~~~d~qfle~rk~~~~eIa~~fgVPp~~lG~~~~~~~~~sn~eq~~~-------------~f~~~tL 185 (219) T protein:vir:98 119 GHPDGLKVIPIGDTGQKDEFANIKNISAQDVLTSHRFPPGLSGIIPVNTAGLGDPLKIRE-------------AYQADEV 185 (219) T ss_pred CCccceeEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcccccCCCCCccCHHHHHH-------------HHHHHHH Confidence 1 112345556666677789998888899999999999999986433222 2332221 2223333 Q ss_pred HHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHH Q lcl|NC_016654. 418 GPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDL 456 (533) Q Consensus 418 ~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~ 456 (533) ..++..+....+..+ .-...+.+.|++..+.|.. T Consensus 186 ~P~~~~ie~~ln~~~-----~~~~~~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 186 LPLQEIIAESINSDY-----EIKSALKVNFKQPEKRDKN 219 (219) T ss_pred HHHHHHHHHHhhhhh-----cCCCccEEeecCcccccCC Confidence 333333322211111 1123467889988888776 No 255 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=65.60 E-value=0.28 Score=23.61 Aligned_cols=294 Identities=11% Similarity=0.051 Sum_probs=110.9 Q ss_pred hcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHH---HHHHhhcCCCceE-eeCC Q lcl|NC_016654. 29 WEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAK---LSTTELFSEQLKF-LDAG 104 (533) Q Consensus 29 ~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~---~~a~ll~~e~~~i-~~~~ 104 (533) |+ .++.+. +....+.+-++.-.. ....+-||+|..+ +..+ T Consensus 1 ~~---------------------------------~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~ 44 (351) T protein:vir:79 1 MS---------------------------------KRRSRA---PRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAE 44 (351) T ss_pred CC---------------------------------CCCCCC---CCCCCCCCchhhhhcccceeEEEEcCCceeecCcch Confidence 00 000000 000000000000000 0011223332111 0000 Q ss_pred ---------Cc-----------------------hHHHH---HHHHHHhhccH--HHHHHHHHHHHhhhCCEEEEEEEcC Q lcl|NC_016654. 105 ---------KS-----------------------KEVQA---RADLIFNTPRF--HSSLVEAGESCSALSGSFQRIVWDP 147 (533) Q Consensus 105 ---------~~-----------------------~~~~~---~l~~i~~~n~f--~~~~~~~~~~~~~~G~~~~~~~~D~ 147 (533) ++ ..+.. .|...+.-|.+ ...+.+++.+.+.+|.+|+.+..|. T Consensus 45 ~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h~~~l~~k~n~l~~~~~Pnp~~t~~~f~~~v~d~ll~Gnay~~~~r~~ 124 (351) T protein:vir:79 45 ILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSSALFFKANVLASTFRPHRWLSRHAFERWALDFLTFGNGYLERRRNM 124 (351) T ss_pred hhhhhhhhhcCceecCCCCHHHHHHHHhhhHhhhhhhhhhhhHHhhcccCCCCCCHHHHHHHHHHHHhcCCeEEEEEECC Confidence 00 00000 00001111111 2225566667777888888877775 Q ss_pred CCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccc Q lcl|NC_016654. 148 TIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRD 227 (533) Q Consensus 148 ~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~ 227 (533) .|. -+.+..++|..+-+..+.+ ..+|+ ... |.++.+. T Consensus 125 ~G~-~~~L~~l~~~~v~~~~~~~----------------~~~~~---------------~~~----g~~~~~~------- 161 (351) T protein:vir:79 125 VGG-TLRLEPALAKYVRRKADFS----------------GFVYV---------------NGW----QERHEFE------- 161 (351) T ss_pred CCC-EEEEEEeCCcceeeeecCC----------------eEEEE---------------ecC----ceEEEEc------- Confidence 542 3455555555443221111 00110 000 1111000 Q ss_pred ccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHH-HhCcceee Q lcl|NC_016654. 228 IAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDF-RIGAGKVH 306 (533) Q Consensus 228 ~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~~~~i~ 306 (533) +--+.|+.+..++ ...+|.|.+..++.. + .++..-+.+.+.+ +.|-..=+ T Consensus 162 -------------------~~eIihir~~~~~--------~~~yGl~~~~~a~~s-i-~l~~~a~~~~~~~f~NGa~pg~ 212 (351) T protein:vir:79 162 -------------------PDSVFQLVRPDIN--------QEVYGLPEYLSSLHS-A-WLNESSTLFRRKYYENGSHAGF 212 (351) T ss_pred -------------------CccEEEeCCCCCC--------CCcccccHHHHHHHH-H-HHHHHHHHHHHHHHhccCCCce Confidence 0012333322111 134688888776643 3 3566677776665 55532211 Q ss_pred ---echHHhcCCCCccccccCcchhhhhhcccccc--------cc-ccccccceeeechhhhhHHHHHHHHHHHHHHHHh Q lcl|NC_016654. 307 ---ASESVLTNLGMGQGVSLDEEQEVYSRVGSGGF--------NA-NGDMETIFEFFQPAIRVLEHDQGAALLLREVLRK 374 (533) Q Consensus 307 ---v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~--------~~-~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~ 374 (533) ++...+... . .+.-...+.... +.. .. +..+...++.++......++.+.-+...+.|+.. T Consensus 213 il~~~~~~ls~e---~---~~~lk~~~~~~~-G~~N~~~~~v~~~~g~~~gi~~~pl~~~~~d~ef~e~k~~s~~eI~~a 285 (351) T protein:vir:79 213 ILYMTDAAQKQD---D---VDNMRDALKNAK-GPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAA 285 (351) T ss_pred EEEecCCCCCHH---H---HHHHHHHHHHhc-CccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHH Confidence 221111100 0 000011111110 000 11 1112223444444556678888888888999999 Q ss_pred hCCChhhcccCCCcc---hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCC Q lcl|NC_016654. 375 TGYSPVSLGLSDEVA---QTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFA 451 (533) Q Consensus 375 ~g~s~~~~g~~~~~~---~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i 451 (533) .|+||..+|...++. .++++. .+..++..|..++..+.++.. .+ + .+ -|.|++.. T Consensus 286 ~~VPp~llGi~~~~t~~~~n~e~~-------------~~~f~~~~l~Pl~~~ie~ln~-~l-g------~~-~~~F~~~~ 343 (351) T protein:vir:79 286 HRVPPQLLGIVPSNSGGFGTPDTA-------------ARVFGRNEIRPLQARFAELND-WL-G------DE-VVTFDDYE 343 (351) T ss_pred hCCCHHHhcccCCCCCCcccHHHH-------------HHHHHHHHHHHHHHHHHHHHh-hc-C------cc-eeeeChhh Confidence 999999998743322 222222 123333444444444444322 11 1 11 25677654 Q ss_pred CCCHHHHH Q lcl|NC_016654. 452 RESDLAKA 459 (533) Q Consensus 452 ~~d~~e~a 459 (533) ...-...+ T Consensus 344 llr~d~~a 351 (351) T protein:vir:79 344 IPPAPVAA 351 (351) T ss_pred hccccccC Confidence 33332222 No 256 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=65.40 E-value=0.28 Score=23.58 Aligned_cols=326 Identities=10% Similarity=0.007 Sum_probs=121.3 Q ss_pred CCCCCCcCCCcCcchHHHHHHHH---hhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhccc--CCCCC---c Q lcl|NC_016654. 1 MSLPEANTAWPPPELAAVTARVA---ESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRT--PTATG---R 72 (533) Q Consensus 1 ~~~~~~~~~~pp~~~~~~~~~~~---~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~g---~ 72 (533) |+=....-+=.+++......... .-..+=-|||+-. ..++ .+..+...|+.+. .+|.. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v---~~~~----------~~~~~~~~~~~~~~~~pp~~~~~l 67 (351) T protein:vir:78 1 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPV---MNRA----------EILDYVECWSNGEWFEPPVSFAGL 67 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhhcccceeEEEEcCCceee---cCcc----------hhhhhhhhhccCceecCCCCHHHH Confidence 33221111111111000000000 0001111333200 0000 0000111111110 00000 0 Q ss_pred ccceeecChHHHHHHHHHHhhcCCC-ceEeeCCCchHHHHHHHHHHhhccHHHHHHHHHHHHhhhCCEEEEEEEcCCCCC Q lcl|NC_016654. 73 APKRYHAPIPGVIAKLSTTELFSEQ-LKFLDAGKSKEVQARADLIFNTPRFHSSLVEAGESCSALSGSFQRIVWDPTIAD 151 (533) Q Consensus 73 ~~~~~~~n~~k~i~~~~a~ll~~e~-~~i~~~~~~~~~~~~l~~i~~~n~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~ 151 (533) .+-......+..++...+++|.+.- |.-.. . ...+.+++.+.+.+|.+|+.+..|..|. T Consensus 68 a~~~~~~~~h~~~l~~k~n~l~~~~~Pn~~~-----t--------------~~~f~~~~~d~ll~Gnay~~~~rn~~G~- 127 (351) T protein:vir:78 68 AKSFRASTHHSSALFFKANVLASTFRPHRWL-----S--------------RHAFERWALDFLTFGNGYLERRRNMVGG- 127 (351) T ss_pred HHHHhhhHhhhhhhhhhhhHHhhcccCCCCC-----C--------------HHHHHHHHHHHHhcCCeEEEEEECCCCC- Confidence 0000011122233344455554321 11100 0 1113344555667799999888876543 Q ss_pred ceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccc Q lcl|NC_016654. 152 NAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVE 231 (533) Q Consensus 152 ~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~ 231 (533) -+.+..+++.++.+..+.++ .+|+ . +. |.++.+ T Consensus 128 ~~~L~pl~~~~v~~~~~~~~----------------~~~~--------------~-~~----~~~~~~------------ 160 (351) T protein:vir:78 128 TLRLEPALAKYVRRKADFSG----------------FVYV--------------N-GW----QERHEF------------ 160 (351) T ss_pred EEEEEEecCcceEEeeeCCe----------------EEEE--------------e-cC----CeEEEE------------ Confidence 35555566655443322211 1110 0 00 111000 Q ss_pred ccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHH-HhCcceeee--- Q lcl|NC_016654. 232 GADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDF-RIGAGKVHA--- 307 (533) Q Consensus 232 ~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~~~~i~v--- 307 (533) . .--+.|+.+.-++ ...+|.|.+..++.. + .++..-+.+.+.+ +.|...=+| T Consensus 161 ----------~----~~eVihir~~~~~--------~~~yGl~~~~~a~~s-i-~l~~~a~~~~~~~f~NGa~pggIl~~ 216 (351) T protein:vir:78 161 ----------A----PDSVFQLVRPDIN--------QEVYGLPEYLSSLHS-A-WLNESSTLFRRKYYENGSHAGFILYM 216 (351) T ss_pred ----------c----cccEEEEcCCCCC--------CCcccccHHHHHHHH-H-HHHHHHHHHHHHHHhccCCCceEEEe Confidence 0 0012233222111 244688888877643 3 3556666666554 554322222 Q ss_pred chHHhcCCCCccccccCcchhhhhhcccccc--------cc-ccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCC Q lcl|NC_016654. 308 SESVLTNLGMGQGVSLDEEQEVYSRVGSGGF--------NA-NGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYS 378 (533) Q Consensus 308 ~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~--------~~-~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s 378 (533) +...+.. ...+.-...+.... +.. .. +.++...++.++......++.+.-+...+.|+...|+| T Consensus 217 ~~~~ls~------e~~~~lr~~~~~~~-G~~N~~~~~v~~~~g~~~g~k~~pls~~~~d~qf~e~k~~~~~eIa~a~~VP 289 (351) T protein:vir:78 217 TDAAQKQ------DDVDNMRDALKNAK-GPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVP 289 (351) T ss_pred cCCCCCH------HHHHHHHHHHHHhc-CcccccceeeecCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCC Confidence 1111100 00000011111111 000 11 11122234455555566788888888889999999999 Q ss_pred hhhcccCCCcc---hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCH Q lcl|NC_016654. 379 PVSLGLSDEVA---QTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESD 455 (533) Q Consensus 379 ~~~~g~~~~~~---~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~ 455 (533) |..+|+..++. .+.++. .+..++..|..+++.+.++.+. + ..+ -|.|++.....- T Consensus 290 p~llGi~~~~t~~~sn~e~~-------------~~~f~~~~l~P~~~~iee~n~~-l------~~~--~~~F~~~~Llr~ 347 (351) T protein:vir:78 290 PQLLGIVPSNSGGFGTPDTA-------------ARVFGRNEIRPLQARFAELNDW-L------GDE--VVRFDDYEIPPA 347 (351) T ss_pred HHHhcccCCCCCCcccHHHH-------------HHHHHHHHHHHHHHHHHHHHhh-c------Ccc--ceecChhhhccc Confidence 99998743322 222222 1233334444444444443221 1 111 256776654444 Q ss_pred HHHH Q lcl|NC_016654. 456 LAKA 459 (533) Q Consensus 456 ~e~a 459 (533) ++.+ T Consensus 348 d~ka 351 (351) T protein:vir:78 348 PVAA 351 (351) T ss_pred cccC Confidence 4333 No 257 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=64.50 E-value=0.3 Score=23.46 Aligned_cols=286 Identities=10% Similarity=0.031 Sum_probs=109.0 Q ss_pred HhcccCCCCCcccceeecChHHHHHHHHHHhhcCCCceEeeCCC---------------------------------chH Q lcl|NC_016654. 62 FHGRTPTATGRAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGK---------------------------------SKE 108 (533) Q Consensus 62 ~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~---------------------------------~~~ 108 (533) +..+++.+... ....-....-.+-||+|..+....+ ... T Consensus 1 m~~~~~~~~~~--------~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~a~~~h~s~ 72 (340) T protein:vir:98 1 MSKRKPRKAVA--------MTASAPQKMEAFTFGEPVPVLDKRDILDYVECISNGKWYEPPVSFSGLAKSLRSAVHHSSP 72 (340) T ss_pred CCCCCCCcccc--------ccccCccceeEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHHhccccchh Confidence 00000000000 0000000001122333211110000 000 Q ss_pred HHH---HHHHHHhhccH--HHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEee Q lcl|NC_016654. 109 VQA---RADLIFNTPRF--HSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAG 183 (533) Q Consensus 109 ~~~---~l~~i~~~n~f--~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~ 183 (533) +.. .|...+.-|.+ ...+..++..-+..|.+|+.+..|..|. -+.+..+++.++-... T Consensus 73 i~~k~n~l~~~~~Pn~~lt~~~f~~~~~d~ll~Gnay~~~~rn~~G~-~~~L~pl~~~~vr~~~---------------- 135 (340) T protein:vir:98 73 IYVKRNVLASTYIPHPLLSRQDFSRFALDYLVFGNAFLEQRHSVTGQ-LIKLLTSPAKYTRRGV---------------- 135 (340) T ss_pred hhhhhhHHhhccCCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCc-EEEEEEeCCceEEEcc---------------- Confidence 000 01111111111 1334455556666788888877765443 2344444443332211 Q ss_pred cCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCccccccc Q lcl|NC_016654. 184 GDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWR 263 (533) Q Consensus 184 ~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~ 263 (533) ++..+|. +... |..+.+ . .--+.|+.+..++ T Consensus 136 -~~~~~~~--------------~~~~----~~~~~~----------------------~----~~eViHir~~~~~---- 166 (340) T protein:vir:98 136 -DDSVFWF--------------VENF----TQPHEF----------------------A----PDTVFHLLEPDIN---- 166 (340) T ss_pred -cCcEEEE--------------EecC----CeEEEE----------------------c----cccEEEEcCCCCC---- Confidence 1111111 0000 111100 0 0012333321111 Q ss_pred ccccccccccchhhhhHHHHHHHHHHHHHHHHHHH-HhCccee---eechHHhcCCCCccccccCcchhhhhh------- Q lcl|NC_016654. 264 HDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDF-RIGAGKV---HASESVLTNLGMGQGVSLDEEQEVYSR------- 332 (533) Q Consensus 264 ~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~~~~i---~v~~~~l~~~~~~~~~~~d~~~~~~~~------- 332 (533) ...+|.|.+..++..+ .++..-..+.+.+ +.|...= +++...+... . .+.-.+.+.. T Consensus 167 ----~~~~Gls~~~~a~~si--~l~~aa~~~~~~~f~NGa~pg~il~~~~~~ls~e---~---~~~lk~~~~~~~G~~n~ 234 (340) T protein:vir:98 167 ----QEIYGLPEYLSALNSA--WLNESATLFRRKYYQNGAHAGYIMYVTDPAQSAT---D---VESLRDAMRNSKGLGNF 234 (340) T ss_pred ----CCcccccHHHHHHHHH--HHHHHHHHHHHHHHhccCCCceEEEecCCCCCHH---H---HHHHHHHHHHhcCcccc Confidence 1346888888776433 3566666666654 5543222 2221111100 0 0000111111 Q ss_pred --ccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc---hhHHHHHHHhhhHHHHHH Q lcl|NC_016654. 333 --VGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA---QTATEASGKKDLTVKTTR 407 (533) Q Consensus 333 --~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~---~Tatai~~~~~~l~~~~~ 407 (533) +.+... .+..+...++.++......++++.-+...+.|+..-|+||+..|+..++. .+.++. T Consensus 235 ~~~~vl~~-~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~------------ 301 (340) T protein:vir:98 235 KNLFFYSP-NGKPDGIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPFQLMGGKPENIGSLGDVEKV------------ 301 (340) T ss_pred CceeEecC-CCCccceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHH------------ Confidence 111100 01112223445555556678888888889999999999999999743322 222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHH Q lcl|NC_016654. 408 AKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDL 456 (533) Q Consensus 408 ~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~ 456 (533) .+..++..|.-++..+.++.+. + +. + .|.|++....+.+ T Consensus 302 -~~~f~~~~l~Pl~~~iee~n~~-L-~~------e-~~rF~~~~l~~~d 340 (340) T protein:vir:98 302 -AKVFVRNELSPLQDRFREVNDW-L-GM------E-VIRFKEYTLDNPE 340 (340) T ss_pred -HHHHHHHHHHHHHHHHHHHHhc-c-cc------c-ccccCccccccCC Confidence 1233334445555544443221 1 11 1 1567766554444 No 258 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=63.33 E-value=0.32 Score=23.30 Aligned_cols=354 Identities=11% Similarity=-0.012 Sum_probs=110.9 Q ss_pred cCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecCh--HHHHHHHHHHhhcCCCceE-eeCC-- Q lcl|NC_016654. 30 EGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPI--PGVIAKLSTTELFSEQLKF-LDAG-- 104 (533) Q Consensus 30 ~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~--~k~i~~~~a~ll~~e~~~i-~~~~-- 104 (533) +|=..++..|... ...+.....+ ...|. .+.++. -..+|+.+|+-+.+=|... .... T Consensus 1 M~if~~~~~~~~~-----------~~~~~~~~~~----~~~~~---~~~~~~~~v~~~v~~Ia~~iA~lp~~~~~~~~~~ 62 (378) T protein:vir:94 1 MNLFGKVVSFSRG-----------KLNNDTQRVT----AWQNE---AVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSD 62 (378) T ss_pred CchhHHhHhhhhc-----------ccccCcceee----eeecc---hhhhhhHHHHHHHHHHHHhHhhCceeeeeecccc Confidence 3322222221100 0000000000 01111 112222 2344555555444434321 1100 Q ss_pred --Cc---hHHHHHHHHHHhh--c--cHHHHHHH-HHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceE Q lcl|NC_016654. 105 --KS---KEVQARADLIFNT--P--RFHSSLVE-AGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVA 174 (533) Q Consensus 105 --~~---~~~~~~l~~i~~~--n--~f~~~~~~-~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~ 174 (533) .+ .....-|..+|+. | .=...+.+ .+...+-.|.+|+.+.++... +.+... +| .++ T Consensus 63 ~~~~~~~~~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~-g~~~~~-------~~--~~~---- 128 (378) T protein:vir:94 63 VGSDTLISMAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSET-GELLDL-------LF--AND---- 128 (378) T ss_pred cccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCC-CcEEEE-------EE--ecC---- Confidence 00 0111223344432 1 11122333 344455568788765554221 222110 00 001 Q ss_pred EEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEec Q lcl|NC_016654. 175 VTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVP 254 (533) Q Consensus 175 v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~p 254 (533) + ..|..-| --+|++-++.+...+ ++..... .+. ....++.. .++.-.| T Consensus 129 -----------~-~~~~~~d---vih~~~~~~~~~~~~-----~~~~~~~------~~~-----~~~~~~~~-~g~l~~~ 176 (378) T protein:vir:94 129 -----------K-KEYKPEE---LVRLTSPFYINEDTS-----ILDNALA------SIQ-----TKLEQGKL-RGLLKIN 176 (378) T ss_pred -----------c-EEechhc---eeeecCcCCcccchh-----HHHHHHH------HHH-----HHHhhCCc-ccceeeC Confidence 0 0010000 000000000000000 0000000 000 00001110 0110000 Q ss_pred CCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCc--ceeeechHHhcCCCCccccccCcchhhhhh Q lcl|NC_016654. 255 NVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGA--GKVHASESVLTNLGMGQGVSLDEEQEVYSR 332 (533) Q Consensus 255 n~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~--~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~ 332 (533) . .+++ .+.+.+-+.+...|. ....+. .++.| T Consensus 177 ~-----------~l~~-------~~~~~~~e~~~~~~~----~~~~~~n~~~~~v------------------------- 209 (378) T protein:vir:94 177 A-----------FLDI-------DNTQEYREKALATIK----NMQEGSSYNGLTP------------------------- 209 (378) T ss_pred C-----------cCCH-------HHHHHHHHHHHHHHH----Hhhccccccccee------------------------- Confidence 0 0000 001111111111111 100000 00111 Q ss_pred ccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHH Q lcl|NC_016654. 333 VGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARH 412 (533) Q Consensus 333 ~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~ 412 (533) .+.+ ..++.++......+ +..++...++|+...|+||+.++- |+.+... ... T Consensus 210 -----l~~g----~~~~~l~~~~~~~~-~~~~~~~~~~Ia~~fgvPp~~l~g------~~~e~~~------------~~f 261 (378) T protein:vir:94 210 -----VDNK----TEIVELKKDYSVLN-KDEIDLIKSELLTGYFMNENILLG------TATQEQQ------------IYF 261 (378) T ss_pred -----ccCC----ceEEEccCChHHhh-HHHHHHHHHHHHHHhCCCHHHhcC------CchHHHH------------HHH Confidence 1111 11333333222223 345566677899999999988741 1112111 123 Q ss_pred HHHHHHHHHHHHHHHHHhhcc-------CCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC Q lcl|NC_016654. 413 FGSALGPLSTTCLRVDAIKFP-------GKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWD 485 (533) Q Consensus 413 ~~~al~~li~~il~l~~~~~~-------~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~ 485 (533) ++.+|..++..+-.-.+..+. +........+.++++.-.-.|..+.++.+.+++.+|+|+..++.+++ ++. T Consensus 262 ~~~tl~P~~~~ie~~l~~~Ll~~~e~~~g~~~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~~t~NE~R~~~--g~~ 339 (378) T protein:vir:94 262 YNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKM--GEQ 339 (378) T ss_pred HHHHHHHHHHHHHHHHHhhcCChhHhhhhhhhcccceeEeecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh--CCC Confidence 333444444433222221111 11111224466777778889999999999999999999999976654 232 Q ss_pred HHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 486 DERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 486 dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) .-+--.++- +.. + ..+ .+...+..+ +..++.+-+++++| T Consensus 340 p~~ggd~~~-~~~-n--~~~-~~~~~~~~~----~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 340 PIEGGDVYI-ANL-N--AVA-VKNLSDLQG----NRKDVTSTDETNNQ 378 (378) T ss_pred CCCCCCeee-ecc-c--ccc-hhcchhccc----ccCCCCCCCCCCCC Confidence 200000000 000 0 000 000001111 11111112222223 No 259 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=63.31 E-value=0.32 Score=23.30 Aligned_cols=354 Identities=8% Similarity=-0.036 Sum_probs=113.4 Q ss_pred HHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChH--HHHHHHHHHhhcCCCce Q lcl|NC_016654. 22 VAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIP--GVIAKLSTTELFSEQLK 99 (533) Q Consensus 22 ~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~--k~i~~~~a~ll~~e~~~ 99 (533) |-.|+. +..|-... . ......... -....+.++.+ ..+++.+|+-+-.=|.. T Consensus 1 Mg~f~~--------~~~~~~~~-~----------------~~~~~~~~~-~~~~~~~~~~~~v~~~i~~Ia~~iA~l~~~ 54 (378) T protein:vir:16 1 MNLFGK--------VVSFSRGK-L----------------NNDTQRVTA-WQNEAVEYTSAFVTNIHNKIANEITKVEFN 54 (378) T ss_pred Cccchh--------hhhhhccc-c----------------cCCcceeee-cccchhhHHHHHHHHHHHHHHhhhhhCcee Confidence 222211 11000000 0 000000000 00111112222 23344444444433432 Q ss_pred Ee-eCCC-------chHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEE Q lcl|NC_016654. 100 FL-DAGK-------SKEVQARADLIFNT--P---RFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPE 166 (533) Q Consensus 100 i~-~~~~-------~~~~~~~l~~i~~~--n---~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~ 166 (533) +- -... ......-|..+|+. | ....-....+...+..|.+|+.+.+|... +.+. . .+|. T Consensus 55 ~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~-g~~~-~------l~~~ 126 (378) T protein:vir:16 55 HVKYKKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNT-GELL-D------LLFA 126 (378) T ss_pred EEEEcccccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCC-ceEE-E------EEec Confidence 21 0000 00112334445542 1 11222333445556678888877666321 1211 0 1111 Q ss_pred EecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEE--EeccCCcccceeehhhccccccccccccccCCceeecCC Q lcl|NC_016654. 167 FRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAV--YKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETG 244 (533) Q Consensus 167 ~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~--y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g 244 (533) ++. ..|. ...|.|-- |.+.. |. -++.... ..+. ....++ T Consensus 127 --~~~----------------~~~~------~~diih~r~~~~~~~---~~-s~l~~~~------~~i~-----~~~~~~ 167 (378) T protein:vir:16 127 --DDK----------------KEYK------PEELVRLTSPFYINE---DT-SILDNAL------ASIQ-----TKLEQG 167 (378) T ss_pred --CCe----------------eEec------ccceEEecCccCccc---hh-HHHHHHH------HHHH-----HHHhcC Confidence 000 0000 00011100 00000 00 0010000 0000 000111 Q ss_pred CccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCCccccccC Q lcl|NC_016654. 245 VKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGMGQGVSLD 324 (533) Q Consensus 245 ~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~~~~~d 324 (533) . +.++.-.|.. + +. .+.+.+.+.+...|+.....- ...++.| T Consensus 168 ~-~~g~l~~~~~--------------l--~~--~~~~~~~~~~~~~~~~~~~~~--~~g~~~v----------------- 209 (378) T protein:vir:16 168 K-LRGLLKINAF--------------L--DI--DNTQEYREKALTTIKNMQEGS--SYNGLTP----------------- 209 (378) T ss_pred c-cceeeEeCCc--------------C--CH--HHHHHHHHHHHHHHHHhhccc--ccccceE----------------- Confidence 1 1111100000 0 00 011111111111111100000 0000111 Q ss_pred cchhhhhhccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHH Q lcl|NC_016654. 325 EEQEVYSRVGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVK 404 (533) Q Consensus 325 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~ 404 (533) .+. ...++.++.+..+.+ +..++...++|+...|+||..++- + ++ +.. T Consensus 210 -------------l~~----g~~~~~l~~~~~~~~-~~~~~~~~~~Ia~~fgVPp~~l~g--~---~~-e~~-------- 257 (378) T protein:vir:16 210 -------------VDN----KTEIVELKKDYSVLN-KDEIDLIKSELLTGYFMNENILLG--T---AS-QEQ-------- 257 (378) T ss_pred -------------cCC----CceEEEccCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC--C---ch-HHH-------- Confidence 011 111333333323333 344566677899999999988731 1 11 111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcc-------CCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHH Q lcl|NC_016654. 405 TTRAKARHFGSALGPLSTTCLRVDAIKFP-------GKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKV 477 (533) Q Consensus 405 ~~~~~~~~~~~al~~li~~il~l~~~~~~-------~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v 477 (533) ....+..+|..+++.+..-.+..+. +........+.++++.-...|..+.++.+.+++.+|+|+..++. T Consensus 258 ----~~~f~~~tl~P~~~~ie~~l~~kLl~~~e~~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R 333 (378) T protein:vir:16 258 ----QIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLL 333 (378) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHhhcCChhhhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHH Confidence 1123444455554444332222211 11111223466677778888999999999999999999999977 Q ss_pred HHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 478 AYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 478 ~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +++ ++..-+ .-+++.--. -..+ .+...+.. .+..++.+-+++++| T Consensus 334 ~~~--g~~p~~---ggD~~~~~~-n~~~-~~~~~~~~----~~~~~~~~~~e~~ne 378 (378) T protein:vir:16 334 VKM--GEQPIE---GGDVYIANL-NAVA-VKNLSDLQ----GSRKDVTSTDETNNQ 378 (378) T ss_pred HHh--CCCCCC---CCCeEeecc-cccc-ccchhhhc----CccCCCCCCCCCCCC Confidence 664 222100 000000000 0000 00000111 111111122222333 No 260 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=62.67 E-value=0.33 Score=23.22 Aligned_cols=417 Identities=11% Similarity=-0.003 Sum_probs=165.2 Q ss_pred CCCC--CCcCCCcCcchH-HHHHHHHh-hhHhhcC------CHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCC Q lcl|NC_016654. 1 MSLP--EANTAWPPPELA-AVTARVAE-SHVWWEG------DLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTAT 70 (533) Q Consensus 1 ~~~~--~~~~~~pp~~~~-~~~~~~~~-~~~w~~g------d~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 70 (533) |+-= -.|.|..++.+. +....+.. ++.| ++ +|.+|.+..+....-....+-.++ ..+.. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~-~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~----edm~e------ 69 (526) T protein:vir:79 1 MAQIVDVYGNPIRPQQLREPQTSRLAGLAKEF-AQHPAKGLTPAKLARILVEAEQGNLQAQAELF----MDMEE------ 69 (526) T ss_pred CCeeeCCCCCccCccccchhhhhhhhhhhhhc-ccCCCCCcCHHHHHHHHHHhhCCCHHHHHHHH----HHHHh------ Confidence 4321 334455444432 22222222 3333 22 455665555433221111111111 11100 Q ss_pred CcccceeecChHHHHHHHHHHhhcCCCceEeeCC----CchHHHHHHHHHHhhc-cHHHHHHHHHHHHhhhCCEEEEEEE Q lcl|NC_016654. 71 GRAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAG----KSKEVQARADLIFNTP-RFHSSLVEAGESCSALSGSFQRIVW 145 (533) Q Consensus 71 g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~----~~~~~~~~l~~i~~~n-~f~~~~~~~~~~~~~~G~~~~~~~~ 145 (533) + ..-+.. ++.+--.-+.+-+-.|.... .+....+.+++++.+- +|...+..++ .|..+|-+++-+.| T Consensus 70 --~----D~~i~s-~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~l-dA~~~G~s~~Ei~w 141 (526) T protein:vir:79 70 --R----DAHLFA-EMSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDAL-DGIGHGYSCIELEW 141 (526) T ss_pred --h----ChHHHH-HHHHHHHHHhCCCceEecCCCCChHHHHHHHHHHHHHhcccCHHHHHHHHH-hhhhhcceeEEEEE Confidence 0 011222 22333344555555554321 1234456677777653 4777776654 48888999998888 Q ss_pred cCCCCCceE---EEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhc Q lcl|NC_016654. 146 DPTIADNAW---IDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDH 222 (533) Q Consensus 146 D~~~~~~~~---i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~ 222 (533) +.+++ ... +.+.++..|. +...++. .+. ++. +..-|.++ T Consensus 142 ~~~~g-~~~~~~l~~r~~~~F~----------------~~~~~~~------------~l~---~~~-~~~~g~~l----- 183 (526) T protein:vir:79 142 ALQGR-EWMPLAFHHRPQSWFQ----------------LNPEDQN------------ELR---LRD-NSPAGEAL----- 183 (526) T ss_pred eecCC-ceeEEEeeeecccceE----------------eccCCCc------------EEE---ecC-CCCCceee----- Confidence 76432 111 1112221111 0001110 000 111 00001111 Q ss_pred cccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHH-HHHHHHHHHHHHHHHHH-h Q lcl|NC_016654. 223 PATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFP-TFHELDRIYSSLMRDFR-I 300 (533) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~-lid~lD~~~s~~~~~~~-~ 300 (533) .+.-|++|+.... ...|+|.+.+..+.-. +++ ...+..|+.=++ . T Consensus 184 ----------------------~~~k~iv~~~~~~---------~g~p~g~gLlr~~~w~~~fK--~~~~~~w~~F~E~y 230 (526) T protein:vir:79 184 ----------------------QPFGWIIHRPRAR---------SGYVARSGLFRVLAWPYLFR--HYATSDLAEMLEIY 230 (526) T ss_pred ----------------------cCCceEEEeecCC---------cCCccccchHHHHHHHHHHH--HhhHHHHHHHHHHc Confidence 0112344443211 1355677666654322 222 233444443333 3 Q ss_pred CcceeeechHHhcCCCCccccccCcchhhhhhcccccccccc--ccccceeeech-hhhhHHHHHHHHHHHHHHHHhhCC Q lcl|NC_016654. 301 GAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG--DMETIFEFFQP-AIRVLEHDQGAALLLREVLRKTGY 377 (533) Q Consensus 301 ~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~--~~~~~i~~~~~-~ir~e~~~~~l~~~l~~i~~~~g~ 377 (533) |-+..+. .+..+.. -+....+..++..-..++++ .....|+.++. ....+.|.+.++.+=++|+..+ + T Consensus 231 G~P~~ig------ky~~~a~--~~ek~~L~~av~~i~~da~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i-L 301 (526) T protein:vir:79 231 GLPIRLG------KYPPGTA--DEEKATLLRAVTGLGHAAAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV-L 301 (526) T ss_pred CCceEEE------ecCCCCC--HHHHHHHHHHHHHHhcCcEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH-h Confidence 4332222 1111111 01122223322221111111 11223444443 2233456666666655665554 3 Q ss_pred ChhhcccC----CCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCC Q lcl|NC_016654. 378 SPVSLGLS----DEVAQTATEASGKKDLTVKTTRAKARHFGSALG-PLSTTCLRVDAIKFPGKGAAPSEELELEWPKFAR 452 (533) Q Consensus 378 s~~~~g~~----~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~-~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~ 452 (533) + +|++.+ .+|..+..++...- ....++.-.+.+...|. +|++.++.+. + +........+.+.|+..-+ T Consensus 302 G-qtlTs~~~~g~~gS~a~g~vh~~v--~~di~~aDa~~i~~tln~~Li~~l~~~N---~-~~~~~~~~~p~~~~~~~e~ 374 (526) T protein:vir:79 302 G-GTLTSTTSQSGGGAFALGQVHNEV--RHDILASDARQLAATLSRDLLWPLLVLN---R-PGSPDVRRAPRLVFDLREQ 374 (526) T ss_pred h-hhhccccccCcchhhhhHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHhC---C-CCcCCccccceEEeCCCCc Confidence 3 233221 11222222222111 22234445566667774 5777776551 1 1111234457889999999 Q ss_pred CCHHHHHHHHHHHHhCCC-CCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhc-ccCccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 453 ESDLAKAQTVQAWSVASA-ASTKTKVAYLHEDWDDERVQEEADLIDNANTV-SAPTFGFGTDQPPLPTENDPATDPEAVD 530 (533) Q Consensus 453 ~d~~e~a~~~~~l~~aGi-~S~et~v~~l~~~~~dee~~~El~rI~~E~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (533) +|..+.++.+.+|+..|+ +|.+ ++++.++ ....+..+++ ...... ..+....+. ..... .....+..+ T Consensus 375 eDl~~~a~~~~~L~~~G~~i~~~-~i~e~~g-ip~~~~~e~~---l~~~~~~~~~~~~~~~-~~~~~----~~~~~~~~~ 444 (526) T protein:vir:79 375 ADITSMAQSIPALVNVGLEIPSA-WVYDKLG-IPQPAKNEPV---LRPAAQPAILSRQHGQ-RVAAL----ATIVGPRYG 444 (526) T ss_pred ccHHHHHHHHHHHHhCCCcCCHH-HHHHHhC-CCCCCCchhh---ccccCCcccccccccc-ccccc----cccccccCc Confidence 999999999999999998 6665 4555454 4332111121 111110 000000000 00000 001111111 Q ss_pred CCC Q lcl|NC_016654. 531 EGE 533 (533) Q Consensus 531 d~~ 533 (533) +-+ T Consensus 445 ~~~ 447 (526) T protein:vir:79 445 DQQ 447 (526) T ss_pred hhh Confidence 111 No 261 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=57.62 E-value=0.43 Score=22.59 Aligned_cols=354 Identities=10% Similarity=-0.034 Sum_probs=115.5 Q ss_pred cCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecC--hHHHHHHHHHHhhcCCCceE-eeCCCc Q lcl|NC_016654. 30 EGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAP--IPGVIAKLSTTELFSEQLKF-LDAGKS 106 (533) Q Consensus 30 ~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n--~~k~i~~~~a~ll~~e~~~i-~~~~~~ 106 (533) .|=...+..|.+..... ...+ .+. .....+..+ ....+++.+|+-+-+=|+.+ .....+ T Consensus 1 Mg~f~~~~~~~~~~~~~-----------~~~~-----~~~--~~~~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~ 62 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNN-----------DTQR-----VTA--WQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSD 62 (378) T ss_pred CCccccchhcccccccC-----------Ccce-----eee--eccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccC Confidence 22111111110000000 0000 000 001112222 22234555555555445432 111000 Q ss_pred -------hHHHHHHHHHHhh--cc---HHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceE Q lcl|NC_016654. 107 -------KEVQARADLIFNT--PR---FHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVA 174 (533) Q Consensus 107 -------~~~~~~l~~i~~~--n~---f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~ 174 (533) ...+.-|.++|+. |. ...-....+...+..|.+|+.+.++... +++..- +|. ++. T Consensus 63 ~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~-g~~~~l-------~p~--~~~--- 129 (378) T protein:vir:94 63 VGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNT-GELLDL-------LFA--DDK--- 129 (378) T ss_pred cccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCC-ceEEEE-------Eec--CCe--- Confidence 0011234445442 11 1123344455567778888776665321 222110 110 110 Q ss_pred EEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCc-ccceeehhhccccccccccccccCCceeecCCCccceeEEe Q lcl|NC_016654. 175 VTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATS-LGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYV 253 (533) Q Consensus 175 v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~-lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 253 (533) ..|. ...|.| +++.-+. -|. -+++..... +. ....++. +.++.-. T Consensus 130 -------------~~~~------~~diiH--~~~~~~~~~g~-s~l~~~~~~------i~-----~~~~~~~-~~gil~~ 175 (378) T protein:vir:94 130 -------------KEYK------PEELVR--LTSPFYINEDT-SILDNALAS------IQ-----TKLEQGK-LRGLLKI 175 (378) T ss_pred -------------eEee------eeeeEE--ecCcCCccchh-HHHHHHHHH------HH-----HHHhccc-ccceeee Confidence 0000 000111 0000000 000 011100000 00 0000110 0011000 Q ss_pred cCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCCccccccCcchhhhhhc Q lcl|NC_016654. 254 PNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRV 333 (533) Q Consensus 254 pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~ 333 (533) |. .+ +. .+.+.+.+.+...|+.....-. ..++.| T Consensus 176 ~~--------------~l--~~--~~~~~~~~~~~~~~~~~~~~~~--~g~~~v-------------------------- 209 (378) T protein:vir:94 176 NA--------------FL--DI--DNTQEYREKALTTIKNMQEGSS--YNGLTP-------------------------- 209 (378) T ss_pred CC--------------cC--CH--HHHHHHHHHHHHHHHHhhcccc--ccccee-------------------------- Confidence 00 00 00 0011111111111111000000 000000 Q ss_pred cccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHH Q lcl|NC_016654. 334 GSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHF 413 (533) Q Consensus 334 ~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~ 413 (533) .+. ...++.++.+..+.+ +..++...++|+...|+||..++- |..+.. ....+ T Consensus 210 ----l~~----g~~~~~l~~~~~~~~-~~~~~~~~~~Ia~~fgVP~~~l~~------~~se~~------------~~~f~ 262 (378) T protein:vir:94 210 ----VDN----KTEIVELKKDYSVLN-KDEIDLIKSELLTGYFMNENILLG------TASQEQ------------QIYFY 262 (378) T ss_pred ----cCC----CceEEEccCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC------ChHHHH------------HHHHH Confidence 111 112333333333333 345566677899999999988731 111211 12244 Q ss_pred HHHHHHHHHHHHHHHHhhccC-------CCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCH Q lcl|NC_016654. 414 GSALGPLSTTCLRVDAIKFPG-------KGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDD 486 (533) Q Consensus 414 ~~al~~li~~il~l~~~~~~~-------~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~d 486 (533) ..+|..+++.+..-.+..+.. ........+.++++.-...|..+.++.+.+++.+|+|+.-++.+++ ++.. T Consensus 263 ~~tL~P~~~~ie~~l~~~Ll~~~er~~g~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~--gl~p 340 (378) T protein:vir:94 263 NSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKM--GEQP 340 (378) T ss_pred HHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh--CCCC Confidence 445555554443322222211 1111123466667777888999999999999999999999976654 2221 Q ss_pred H-HHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 487 E-RVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 487 e-e~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) - .-+ ++- +.. + ..| .+...+..+ +..++.+.++++.| T Consensus 341 ~~gGD-~~~-~~~-n--~~~-~~~~~~~~~----~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 341 IEGGD-VYI-ANL-N--AVA-VKNLSDLQG----SRKDVTSTDETNNQ 378 (378) T ss_pred CCCCC-eee-ecc-c--ccc-cccchhhcC----CcCCCCCCCCCCCC Confidence 0 000 000 000 0 000 000000000 01111122222333 No 262 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=50.78 E-value=0.6 Score=21.80 Aligned_cols=440 Identities=9% Similarity=0.050 Sum_probs=149.3 Q ss_pred CCCC-CCcCCCcCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeec Q lcl|NC_016654. 1 MSLP-EANTAWPPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHA 79 (533) Q Consensus 1 ~~~~-~~~~~~pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 79 (533) |.=- ..++-=+|..++.+..... +|++.. +|..... ..++...+ ..+.++. + .. T Consensus 1 ~~~~~~~~~gl~p~rl~~i~~~~~------~~~~~~---~~~~~~~--~Lr~~~~~-~ly~~m~---------~----D~ 55 (488) T protein:vir:95 1 MADITETQESLPPFRMGEVGSLGL------KVKNGR---IYEEPRQ--ALRFPESI-KTFQLMM---------R----DP 55 (488) T ss_pred CCCccccCCCCCHHHHHHHHHHhh------ccccch---hhccchh--hhcccchH-HHHHHHh---------h----Ch Confidence 5433 3333456666666543321 111111 1110000 00000000 0111110 0 01 Q ss_pred ChHHHHHHHHHHhhcCCCceEeeCC---Cch---HHHHHHHHHHhhcc--HHHHHHHHHHHHhhhCCEEEEEEEcCCCCC Q lcl|NC_016654. 80 PIPGVIAKLSTTELFSEQLKFLDAG---KSK---EVQARADLIFNTPR--FHSSLVEAGESCSALSGSFQRIVWDPTIAD 151 (533) Q Consensus 80 n~~k~i~~~~a~ll~~e~~~i~~~~---~~~---~~~~~l~~i~~~n~--f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~ 151 (533) -+.. +.++--.-|.+-.-.|...+ ++. ...+.++.++++-. |...+..+ ..|..+|-+++-+.|...... T Consensus 56 hi~s-~l~~Rk~av~~~~w~v~p~~~~~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~-lda~~~G~s~~Eivw~~~~~~ 133 (488) T protein:vir:95 56 AVAA-SVNIIKMFVRKVNWRFVPPKGKEQDPKMLERADFFNSLMDDMEHDWADFINSV-MSFCTYGFCVNEKVYKKRQGK 133 (488) T ss_pred HHHH-HHHHHHHHHhcCCceEecCCCCchhHHHHHHHHHHHHHHhccCccHHHHHHHH-HHhhcccceeeeeeeeccccc Confidence 1222 22333334445444454221 111 23456777766433 44555555 468888999988888653211 Q ss_pred ceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCc-ccceeehhhccccccccc Q lcl|NC_016654. 152 NAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATS-LGWMMALTDHPATRDIAV 230 (533) Q Consensus 152 ~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~-lG~~v~l~~~~~~~~~~~ 230 (533) .... ...+++|++.-.-+.- +-+..+..-.|..++.. ++..-.+........... T Consensus 134 ~~~~--------~~~~~dg~~~~~~i~~----------------Rpq~~~~~f~~d~d~~l~~~~~~~~~~~~~~~~~~~ 189 (488) T protein:vir:95 134 KGKY--------QSKFDDGLIGWAKLPI----------------RNQSTLDKWYFDEDFRRVTGVRQNLRNVSHIAGAIN 189 (488) T ss_pred cccc--------cccccCCeeeeeeeee----------------cCcccccceeeccCCCceeecccccccccccccccc Confidence 1011 1112222211100000 00000000000000000 000000000000000000 Q ss_pred cccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHH-HHHHHHHHHHHHHHHHH---hCcceee Q lcl|NC_016654. 231 EGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFP-TFHELDRIYSSLMRDFR---IGAGKVH 306 (533) Q Consensus 231 ~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~-lid~lD~~~s~~~~~~~---~~~~~i~ 306 (533) +....... .....-|+.|+.... ...|+|.+.+..+.-. +++ +..+..|+.-++ .+-+.+. T Consensus 190 -~~~~~~~~---~lP~~kfi~~~~~~~---------~g~p~g~gLlr~~~w~~~fK--~~~~~~w~~f~Er~g~g~p~~~ 254 (488) T protein:vir:95 190 -LGERPLTR---KLPRAKFMLFKYDDE---------YGNPEGRSPLLNAYVPWKYK--VQIEEYEAVGVSRDLVGMPKIG 254 (488) T ss_pred -cccccccc---cccccceEEEeecCC---------CCccchhhHHHHHHHHHHHH--HHHHHHHHHHHHHhcccceeEe Confidence 00000000 011123455554321 2356777777654422 222 222222332222 2222233 Q ss_pred echHHhcCCCCccccccCcchhhhhhccccc------ccccc--------cccc-c--eeeech-hhhhHHHHHHHHHHH Q lcl|NC_016654. 307 ASESVLTNLGMGQGVSLDEEQEVYSRVGSGG------FNANG--------DMET-I--FEFFQP-AIRVLEHDQGAALLL 368 (533) Q Consensus 307 v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~------~~~~~--------~~~~-~--i~~~~~-~ir~e~~~~~l~~~l 368 (533) .|..+...... +....+..++..-. ..++. +.+. . ++.... ......|.+.++.+= T Consensus 255 ~p~~~~~~~~~------~e~~~l~~a~~~i~~~~~~~~~ag~iiP~g~~~~~k~~~~e~~l~~~~~~~~~~~~~li~~~d 328 (488) T protein:vir:95 255 LPPDYLDENAE------PEKKAFVQYCKTVVNDMIANDRAGLIWPRYIDPDTKEDIFEFSLVSRQGAKAYDTGSIIDRYS 328 (488) T ss_pred eccCCCCCccc------HHHHHHHHHHHHHHHHhhccchhheeeccccccccchhhhhhhccccccCCchhHHHHHHHHH Confidence 33322111000 01111111111000 00000 0000 0 000100 111223444455544 Q ss_pred HHHHHhhCCChh-hcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhccCCCCCCceeEEEE Q lcl|NC_016654. 369 REVLRKTGYSPV-SLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALG-PLSTTCLRVDAIKFPGKGAAPSEELELE 446 (533) Q Consensus 369 ~~i~~~~g~s~~-~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~-~li~~il~l~~~~~~~~~~~~~~~v~i~ 446 (533) ++|+..+ ++.. |-+.+.+|..+..++..+- ....++.-.+.+...|. +||.-++.+ |.+ .....+.+. T Consensus 329 ~~Isk~i-LGqtLT~~~~~~Gs~Al~~vh~ev--~~~i~~aDa~~i~~tln~~li~~l~~~------Nfg-~~~~~P~~~ 398 (488) T protein:vir:95 329 KQIMMAF-MSDVLAMGQSKYGSFSLADSKTSL--LAMSVDILLKQIKNVINRDLVAQTYAL------NMW-DDEEHVQIT 398 (488) T ss_pred HHHHHHH-hccccccccCcchhhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHh------cCC-CCCCccEEE Confidence 5555444 3221 2122222322222332222 22223334455556664 577666554 211 123346788 Q ss_pred eCCCCCCCHHHHHHHHHHHHhCCCCCH----HHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCC Q lcl|NC_016654. 447 WPKFARESDLAKAQTVQAWSVASAAST----KTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDP 522 (533) Q Consensus 447 f~d~i~~d~~e~a~~~~~l~~aGi~S~----et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~ 522 (533) |+..-++|..+.++.+++|+.+|+.-. ++++++.++ ....+-.+.+. ........+..+.+ ...+ T Consensus 399 ~~~~e~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~g-ip~~~~~e~~~--~~~~~~~~~~~~~~--------~~~~ 467 (488) T protein:vir:95 399 YDDIETPDLEAIGSYIQKTVAVGALEVDKELSNKLREHIG-LPPADESQPVS--EKLSPNSQSRSGDG--------YKTA 467 (488) T ss_pred ecCcChhhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhC-CCCCCCCcccc--ccCCCCCCCCCCcc--------cCCC Confidence 998889999999999999999998543 456766553 43221111110 00000000111000 0011 Q ss_pred CCCCCCCCCCC Q lcl|NC_016654. 523 ATDPEAVDEGE 533 (533) Q Consensus 523 ~~~~~~~~d~~ 533 (533) +......+++| T Consensus 468 ~~~~~~~~~~~ 478 (488) T protein:vir:95 468 GEGTAKTPSAK 478 (488) T ss_pred cccCCcccccc Confidence 11112222222 No 263 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=47.93 E-value=0.69 Score=21.48 Aligned_cols=265 Identities=10% Similarity=-0.020 Sum_probs=106.8 Q ss_pred HHHhhcCCCceEeeCCCchHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeE Q lcl|NC_016654. 89 STTELFSEQLKFLDAGKSKEVQARADLIFNT--P---RFHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRA 163 (533) Q Consensus 89 ~a~ll~~e~~~i~~~~~~~~~~~~l~~i~~~--n---~f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~ 163 (533) .|+ -|..+.-..+ .....+..++.. | .....+...+...+..|.+++.+..|..+. -+.+..++|+.+ T Consensus 1 ia~----l~~~~~~~~~--~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~-~~~l~~l~~~~v 73 (278) T protein:vir:78 1 MAS----LPLKMYEDYK--VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQ-PSKLFLLNPDVV 73 (278) T ss_pred Ccc----ceeEEEecCc--ccccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCc-EEEEEEECCcee Confidence 121 1221211111 011112222211 1 122334555667777899999888876543 245666777777 Q ss_pred EEEEec-CCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeec Q lcl|NC_016654. 164 IPEFRW-GRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVE 242 (533) Q Consensus 164 ~P~~~~-g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~ 242 (533) .+..++ +.. +|..++. .. |..+.+ T Consensus 74 ~v~~~~~~~~----------------~~y~~~~------------~~----g~~~~~----------------------- 98 (278) T protein:vir:78 74 EMLIENQSRE----------------LYYSIHA------------AT----GNKLIV----------------------- 98 (278) T ss_pred EEEEcCCCce----------------EEEEEEc------------CC----ceEEEE----------------------- Confidence 654322 211 1100000 00 111100 Q ss_pred CCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHH-HHHHhCcceeeechHHhcCCCCcccc Q lcl|NC_016654. 243 TGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLM-RDFRIGAGKVHASESVLTNLGMGQGV 321 (533) Q Consensus 243 ~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~-~~~~~~~~~i~v~~~~l~~~~~~~~~ 321 (533) +.--+.|+.+..+. ...+|.|.+..+. ..++....+ ..+. ..+..+...|+.....+.. .. T Consensus 99 ---~~~evih~~~~~~~--------~~~~G~s~~~~~~-~~i~~~~~~-~~~~~~~~~~~~~~i~~~~~~l~~---e~-- 160 (278) T protein:vir:78 99 ---HNMDMLHFKHIVAS--------NMVQGISPIDVLK-NTTDFDNAV-RTFNLTEMQKPDSFMLKYGSNVGK---EK-- 160 (278) T ss_pred ---ccccEEEECCCCCC--------CCeeeccHHHHHH-HHHHHHHHH-HHHHHHHhcCCCcEEEEeCCCCCH---HH-- Confidence 00012333321111 1345777766543 444433222 2221 1122222222221111100 00 Q ss_pred ccCcchhhhhhc-----cccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCc-chhHHHH Q lcl|NC_016654. 322 SLDEEQEVYSRV-----GSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEV-AQTATEA 395 (533) Q Consensus 322 ~~d~~~~~~~~~-----~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~-~~Tatai 395 (533) .......|... .....+ ....++.++......++.+..+...++|+...|+||..+|...++ -.|+++. T Consensus 161 -~~~~~~~~~~~~~~~g~~~vl~----~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~ 235 (278) T protein:vir:78 161 -RQQVLEDFKQYYEENGGILFQE----PGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEEL 235 (278) T ss_pred -HHHHHHHHHHHhccCCCceecC----CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH Confidence 00000111110 011111 112356677777777888888999999999999999999865432 2333332 Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCC Q lcl|NC_016654. 396 SGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFAR 452 (533) Q Consensus 396 ~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~ 452 (533) ....++.+|..++..+..-.+..+...... ....-|.|+-... T Consensus 236 -------------~~~~~~~~l~P~~~~i~~~ln~~L~~~~e~-~~g~~~~f~~~~l 278 (278) T protein:vir:78 236 -------------NRFYLQHTLLPIVKQYEEEFNRKLLTKTDR-EKIGILNLTLNLI 278 (278) T ss_pred -------------HHHHHHHHHHHHHHHHHHHHHhhcCChhHh-cCCceEEEecccC Confidence 123333344444444433222222111100 0113355553333 No 264 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=46.73 E-value=0.72 Score=21.35 Aligned_cols=354 Identities=10% Similarity=-0.002 Sum_probs=110.7 Q ss_pred cCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChH--HHHHHHHHHhhcCCCceEee---CC Q lcl|NC_016654. 30 EGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIP--GVIAKLSTTELFSEQLKFLD---AG 104 (533) Q Consensus 30 ~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~--k~i~~~~a~ll~~e~~~i~~---~~ 104 (533) .|=..++..|.+... ..+.............++.+ ..+|+.+|+-+.+=|..+-- ++ T Consensus 1 M~~f~k~~~~~~~~~------------------~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~ 62 (378) T protein:vir:85 1 MNLFGKVVSFSRGKL------------------NNDTQRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSD 62 (378) T ss_pred Cchhhhhhhhhhccc------------------ccCCcceeeeeccchhhhhHHHHHHHHHHHHhHhhCceeEEEEeccc Confidence 332222221211100 00111000001112222322 23455555555544533210 00 Q ss_pred C--c---hHHHHHHHHHHhh----ccHHHHHHHH-HHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceE Q lcl|NC_016654. 105 K--S---KEVQARADLIFNT----PRFHSSLVEA-GESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVA 174 (533) Q Consensus 105 ~--~---~~~~~~l~~i~~~----n~f~~~~~~~-~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~ 174 (533) . + +....-|.++|+. +.=...+.+. +...+..|.+|+.+.++.. .+.+. +..+.++. T Consensus 63 ~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~-~g~~~---------~~~~~~~~--- 129 (378) T protein:vir:85 63 VGSDTLISMAGSDLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSE-TGELL---------DLLFANDK--- 129 (378) T ss_pred cccccccccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCC-CceEE---------EEEecCCC--- Confidence 0 0 0111224444431 1111223333 3344556888876554422 11111 11111111 Q ss_pred EEEEEEEeecCCceEEEEEEEecCeeEEEEE--EeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEE Q lcl|NC_016654. 175 VTFWSELAGGDGQEVWRHLERHESGYIVHAV--YKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAY 252 (533) Q Consensus 175 v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~--y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 252 (533) ..|. +..|.|.. |... ..+|. +... ...+. ....++. +.++.- T Consensus 130 -------------~~~~------~~dvih~~~~~~~~-~~~~~---~~~a------~~~~~-----~~~~~~~-~~g~l~ 174 (378) T protein:vir:85 130 -------------KEYK------PEELVRLVSPFYIN-EDTSI---LDNA------LASIQ-----TKLEQGK-LRGLLK 174 (378) T ss_pred -------------EEEc------ccceEEEecCcCcc-chhhH---HHHH------HHHHH-----HHHhcCC-cceEEE Confidence 1111 00111100 0000 00000 0000 00000 0001111 111111 Q ss_pred ecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCCccccccCcchhhhhh Q lcl|NC_016654. 253 VPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGMGQGVSLDEEQEVYSR 332 (533) Q Consensus 253 ~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~ 332 (533) .+.. +++ .+.+.+-+.+...++.....-..+ ++.| T Consensus 175 ~~~~-----------l~~-------~~~~~~~~~~~~~~~~~~~~~~~g--~~~v------------------------- 209 (378) T protein:vir:85 175 INAF-----------LDI-------DNTQEYREKALATIKNMQEGSSYN--GLTP------------------------- 209 (378) T ss_pred eCCc-----------CCH-------HHHHHHHHHHHHHHHHhhcccccc--ccee------------------------- Confidence 1100 000 001111111111111110000000 0111 Q ss_pred ccccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHH Q lcl|NC_016654. 333 VGSGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARH 412 (533) Q Consensus 333 ~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~ 412 (533) .+.+ ..++.++.+....+ ++.++...++|+...|+||+.++. + ..+.. .... T Consensus 210 -----l~~g----~~~~~l~~~~~~~~-~~~~~~~~~~Ia~~fgVPp~~l~~--s----~~e~~------------~~~f 261 (378) T protein:vir:85 210 -----VDNK----TEIVELKKDYSVLN-KDEIELIKSELLTGYFMNENILLG--T----ATQEQ------------QIYF 261 (378) T ss_pred -----cCCC----ceEEeccCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC--C----chHHH------------HHHH Confidence 1111 11222222222222 344555567899999999988841 1 11111 1123 Q ss_pred HHHHHHHHHHHHHHHHHhhcc-------CCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC Q lcl|NC_016654. 413 FGSALGPLSTTCLRVDAIKFP-------GKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWD 485 (533) Q Consensus 413 ~~~al~~li~~il~l~~~~~~-------~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~ 485 (533) +..+|..++..+..-.+..+. +.......++.++++.-.-.|..+.++.+.+++.+|+|+.-++.+++ ++. T Consensus 262 ~~~tL~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~l--gl~ 339 (378) T protein:vir:85 262 YNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKM--GEQ 339 (378) T ss_pred HHHHHHHHHHHHHHHHHhhcCChhhhhhhhhccccceeeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh--CCC Confidence 444455544444322222211 11111123355556666778999999999999999999999977654 222 Q ss_pred HHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 486 DERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 486 dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) .-+--.++ .+.. + ..+.-..+..++.. .++.+.++++.| T Consensus 340 p~~gGD~~-~~~~-N--~~~~~~~~~~~~~~-----~~~~~~~e~~n~ 378 (378) T protein:vir:85 340 PIEGGDIY-IANL-N--AVAVKNLSDLQGSR-----KDVASTDETNNQ 378 (378) T ss_pred CCCCCCeE-eecc-c--ccccccchhhcCcc-----CCCCCCCCCCCC Confidence 10000000 0000 0 00000000001100 001111111122 No 265 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=46.49 E-value=0.73 Score=21.32 Aligned_cols=421 Identities=11% Similarity=0.019 Sum_probs=157.0 Q ss_pred CCCC--CCcCCCcCcch-HHHHHHH-HhhhHhh----cC-CHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCC Q lcl|NC_016654. 1 MSLP--EANTAWPPPEL-AAVTARV-AESHVWW----EG-DLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATG 71 (533) Q Consensus 1 ~~~~--~~~~~~pp~~~-~~~~~~~-~~~~~w~----~g-d~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 71 (533) |+== --|.|-+=+.+ .+....+ ..|+.|- +| +|.+|....+....--...+..++. .+. T Consensus 1 m~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~gd~~~~~~L~~----dm~-------- 68 (512) T protein:vir:19 1 MGRILDISGQPFDFDDEMQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAERGDLTAQADLAF----DME-------- 68 (512) T ss_pred CcceeCCCCCccccccccccccchhcccchhhccccccCCCHHHHHHHHHHhhCCCHHHHHHHHH----HHH-------- Confidence 2210 11111111110 0000111 1233332 11 2556655544332211111111111 000 Q ss_pred cccceeecChHHHHHHHHHHhhcCCCceEeeCCC----chHHHHHHHHHHhhc-cHHHHHHHHHHHHhhhCCEEEEEEEc Q lcl|NC_016654. 72 RAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGK----SKEVQARADLIFNTP-RFHSSLVEAGESCSALSGSFQRIVWD 146 (533) Q Consensus 72 ~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~----~~~~~~~l~~i~~~n-~f~~~~~~~~~~~~~~G~~~~~~~~D 146 (533) .+ ..-+... ..+--.-+.+-+-.|.-..+ +....+.+++.+.+- .|...+..++ .|..+|-.++-+.|. T Consensus 69 ~~----D~hi~s~-l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~~~~ll-dA~~~G~s~~Ei~w~ 142 (512) T protein:vir:19 69 EK----DTHLFSE-LSKRRLAIQALEWRIAPARDASAQEKKDADMLNEYLHDAAWFEDALFDAG-DAILKGYSMQEIEWG 142 (512) T ss_pred hh----ChHHHHH-HHHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcCCCHHHHHHHHH-hhhhhcceeeeeEee Confidence 00 1112222 23333445565555543222 224456677777653 4777777654 688889999888786 Q ss_pred CCCCC-ce-EEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccc Q lcl|NC_016654. 147 PTIAD-NA-WIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPA 224 (533) Q Consensus 147 ~~~~~-~~-~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~ 224 (533) ..++. .| .|.++++..|... ..+. ..+. ++. +..-|.++ T Consensus 143 ~~~g~~~~~~~~~r~~~~f~~~----------------~~~~------------~~lr---~~~-~~~~G~~l------- 183 (512) T protein:vir:19 143 WLGKMRVPVALHHRDPALFCAN----------------PDNL------------NELR---LRD-ASYHGLEL------- 183 (512) T ss_pred eeCCceeeeeeeeeccccceec----------------cCCC------------cEEE---ecC-CCCCceee------- Confidence 43221 11 1223333222110 0000 0010 110 00001110 Q ss_pred cccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHH-hCcc Q lcl|NC_016654. 225 TRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFR-IGAG 303 (533) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~-~~~~ 303 (533) .+.-|++++.... ...|+|.+.+..+. ...--=...+..|+.=++ .|.+ T Consensus 184 --------------------~~~k~i~~~~~~~---------~g~p~g~gLlr~~~-w~~~fK~~~~~~w~~f~E~yG~P 233 (512) T protein:vir:19 184 --------------------QPFGWFMHRAKSR---------TGYVGTNGLVRTLI-WPFIFKNYSVRDFAEFLEIYGLP 233 (512) T ss_pred --------------------cCCceEEEeccCC---------CCCcccccHHHHHH-HHHHHHHHHHHHHHHHHHHcCCC Confidence 0111344443221 13566777776543 222122344444444443 3433 Q ss_pred eeeechHHhcCCCCccccccCcchhhhhhcccccccccc--ccccceeeechh-hhhHHHHHHHHHHHHHHHHhhCCChh Q lcl|NC_016654. 304 KVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG--DMETIFEFFQPA-IRVLEHDQGAALLLREVLRKTGYSPV 380 (533) Q Consensus 304 ~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~--~~~~~i~~~~~~-ir~e~~~~~l~~~l~~i~~~~g~s~~ 380 (533) .++. .+..+.. -+....+..++..-..++++ .....|+..+.. ...+.|...++.+-++|+..+ ++ + T Consensus 234 ~~ig------ky~~~a~--~~ek~~L~~al~~~~~~a~~iiP~~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~i-LG-q 303 (512) T protein:vir:19 234 MRVG------KYPTGST--NREKATLMQAVMDIGRRAGGIIPMGMTLDFQSAADGQSDPFMAMIGWAEKAISKAI-LG-G 303 (512) T ss_pred eeEE------ecCCCCC--HHHHHHHHHHHHHHhhCcEEEecCCceEEEeecCCCCHHHHHHHHHHHHHHHHHHH-hh-h Confidence 2221 1111110 11111222222211111111 111234444322 223345555555555555443 22 2 Q ss_pred hcccC--CCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHH Q lcl|NC_016654. 381 SLGLS--DEVAQTATEASGKKDLTVKTTRAKARHFGSALG-PLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLA 457 (533) Q Consensus 381 ~~g~~--~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~-~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e 457 (533) |++.+ .+|..+..++...- ....++.-.+.+...|. +|++.++.+. + +........+.+.|+..-++|... T Consensus 304 tlTs~~g~~Gs~a~~~vh~ev--~~di~~aDa~~i~~tln~~li~~l~~~N---~-~~~~~~~~~p~~~f~~~e~eDl~~ 377 (512) T protein:vir:19 304 TLTTEAGDKGARSLGEVHDEV--RREIRNADVGQLARSINRDLIYPLLALN---S-DSTIDINRLPGIVFDTSEAGDITA 377 (512) T ss_pred hhcccccccchhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHhC---C-CCCCCccccceEEecCCChhhHHH Confidence 22212 22222233332222 33334455666777774 6777766541 1 111222346788999999999999 Q ss_pred HHHHHHHHHhCCC-CCHHHHHHHhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 458 KAQTVQAWSVASA-ASTKTKVAYLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 458 ~a~~~~~l~~aGi-~S~et~v~~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) .++.+.++. .|+ +|.+ .+++.++ ..+.+..+.+... +. ..++.+ ............+. +++.+..++ T Consensus 378 ~a~~~~~l~-~G~~i~~~-~i~e~~G-ip~~~~~e~~~~~---~~-~~~~~~-~~~~~~~~~~~~~~-~~~~d~~~~ 445 (512) T protein:vir:19 378 LSDAIPKLA-AGMRIPVS-WIQEKLH-IPQPVGDEAVFTI---QP-VVPDNG-SQKEAALSAEDIPQ-EDDIDRMGV 445 (512) T ss_pred HHHHHHHHh-cCCCCCHH-HHHHHhC-CCCCCCccccccC---CC-cccccc-ccccccccccCCCc-hhhHhHHhh Confidence 999999886 787 5555 4655554 4322111111110 00 001000 00000000000000 000000011 No 266 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=41.73 E-value=0.91 Score=20.80 Aligned_cols=388 Identities=9% Similarity=-0.056 Sum_probs=123.1 Q ss_pred cCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHH Q lcl|NC_016654. 30 EGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEV 109 (533) Q Consensus 30 ~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~ 109 (533) .|=.+++ ............ ++....... ....+...--..+++.+|+-+-+=|..+.-.+..... T Consensus 1 Mgl~d~~---~~~~~~~~~~~~----------~~~~~~~~~--~~~~l~~~~v~~~i~~Ia~~ia~lp~~v~~~~~~~~~ 65 (395) T protein:vir:96 1 MGILDFF---SFKKSGTLSDDD----------SGSTTSEKL--TNVVLKEDALYKCVNYLARIISKSTFRIKAPEKLTEN 65 (395) T ss_pred Ccchhhh---cCCCCccccccc----------cccchhhhc--chhhhhhHHHHHHHHHHHHhhccceeEEEeCCccccc Confidence 5643332 111111100000 000000000 0001111122234455555554445444322222222 Q ss_pred HHHHHHHHhh--cc---HHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeec Q lcl|NC_016654. 110 QARADLIFNT--PR---FHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGG 184 (533) Q Consensus 110 ~~~l~~i~~~--n~---f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~ 184 (533) ...+..+|+. |. ...-....+...+..|.+|+.+..|..+ +.++.+...+ .+....+.. +... T Consensus 66 ~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~~~~--------~~~~~~~~~~---~~~~~~~~~-v~~~ 133 (395) T protein:vir:96 66 QKDWLYWINTKANPNQSASQFWVEVVQKLLVDGETLIFVIPGKGI--------YVADAFTQDK---KLSGNKFKV-SRVQ 133 (395) T ss_pred cchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEcCCce--------ecCCcccccc---ccccceeee-eeec Confidence 2334445432 21 1222334455556668888776655321 1111111110 000001111 1100 Q ss_pred CCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccc Q lcl|NC_016654. 185 DGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRH 264 (533) Q Consensus 185 ~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~ 264 (533) + |+.-..+....|.|-.+..........-.+................ .+..+...-.+.+.. T Consensus 134 ~----~~~~~~~~~~dvih~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~------~~~~~~~~~~~~~~~-------- 195 (395) T protein:vir:96 134 G----QTYEKIFTFDQVIYLKNDNSDLMLKVESLWEEYGELLGHVINNQKI------ANQIRFTMTPPKDKV-------- 195 (395) T ss_pred c----ceeeeEeccCceEEecccCCccccccccccchHHHHHHHHHHHHHH------HHHHHHHhhhccccc-------- Confidence 1 1000011222233311111000000000000000000000000000 000000000000000 Q ss_pred cccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCCccccccCcchhhhhhcccccccccccc Q lcl|NC_016654. 265 DPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANGDM 344 (533) Q Consensus 265 ~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 344 (533) .+-|.....+.. .-+.....+.++......+..++.+ +. ..-.|..+.....+.+ T Consensus 196 ----~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~v~~----l~------------~g~~~~~l~~~~~d~q--- 250 (395) T protein:vir:96 196 ----RERAQENSDGGR--QPKSDKDFFKRTIEKIRTESVVGIP----VT------------ANTNYEEYGSKNTGSV--- 250 (395) T ss_pred ----ccceeeccCchh--hHHHHHHHHHHHHHHhhcCCcceEE----cc------------CCceeEecccChhhhh--- Confidence 000000000000 0112222222222222222222222 00 0001111111100000 Q ss_pred ccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 345 ETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTC 424 (533) Q Consensus 345 ~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~i 424 (533) -+-...+.+......++|+...|+||..++.+.+ +..+. ....++.+|..++..+ T Consensus 251 ---------~~e~~~~~~~~~~~~~eIa~~fgVPp~~l~~~~s---n~e~~-------------~~~f~~~~L~P~~~~i 305 (395) T protein:vir:96 251 ---------KSYVDDIKKLKDQYMAEFAEMLGIPISLLHGDIA---DNQKN-------------YELLLEGPIESLITNI 305 (395) T ss_pred ---------hhhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCc---cHHHH-------------HHHHHHHHHHHHHHHH Confidence 0112234444556678899999999999862211 22221 2234444555555444 Q ss_pred HHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhC-CCCCHHHHHHHHHHHHHhhhcc Q lcl|NC_016654. 425 LRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLH-EDWDDERVQEEADLIDNANTVS 503 (533) Q Consensus 425 l~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~-~~~~dee~~~El~rI~~E~~~~ 503 (533) ..-.+..+..... -...+.++|+.-+..|..+.++.+.+++.+|+|+..++.+.+. |..++.+. T Consensus 306 e~~l~~~Ll~~~e-~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~~~~g-------------- 370 (395) T protein:vir:96 306 VDGLEYAIFDKSE-TLEGSFIKVTGLKNYDLFSISSQADKLISSGFVFIDEVREEIGLPELPDGLG-------------- 370 (395) T ss_pred HHHHHhhcCChhh-hcCceeEeecchhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCC-------------- Confidence 3322222221111 1123457788888889999999999999999999998766541 11111111 Q ss_pred cCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 504 APTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 504 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) +..-...+..+.. +.+|+ .+++.| T Consensus 371 -D~~~~~~N~~~~~---~~gge--~~~~~~ 394 (395) T protein:vir:96 371 -KVLYMTKNYESVL---ERGGE--VDEEVE 394 (395) T ss_pred -ceeeecccceech---hccCC--CCCCCC Confidence 0000011111111 11121 111111 No 267 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=40.84 E-value=0.95 Score=20.70 Aligned_cols=347 Identities=10% Similarity=-0.027 Sum_probs=114.2 Q ss_pred cCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccC-CCCCcccceeecChH--HHHHHHHHHhhcCCCceEee-CCC Q lcl|NC_016654. 30 EGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTP-TATGRAPKRYHAPIP--GVIAKLSTTELFSEQLKFLD-AGK 105 (533) Q Consensus 30 ~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~~~~~~~n~~--k~i~~~~a~ll~~e~~~i~~-~~~ 105 (533) .|=...+..|-.. +..... ....-....+.+|.+ ..+++.+|+-+.+-|..+-- ... T Consensus 1 Mg~f~~~~~f~~~-------------------~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~ 61 (378) T protein:vir:93 1 MNLFGKVVSFSRG-------------------KLNNDTQRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKS 61 (378) T ss_pred Cccchhhhhhhcc-------------------ccCCCcceeeecccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEccc Confidence 2211111111000 000000 000000111122222 23345555555544543311 000 Q ss_pred ----c---hHHHHHHHHHHhh--cc---HHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCce Q lcl|NC_016654. 106 ----S---KEVQARADLIFNT--PR---FHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLV 173 (533) Q Consensus 106 ----~---~~~~~~l~~i~~~--n~---f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~ 173 (533) . .....-|..+|+. |. ...-....+...+..|.+|+.+..|... +++.. .+|. ++. T Consensus 62 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~-g~~~~-------l~~~--~~~-- 129 (378) T protein:vir:93 62 DVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNT-GELLD-------LLFA--DDK-- 129 (378) T ss_pred ccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCC-ceEEE-------EEec--CCe-- Confidence 0 0112234455542 21 1123333445566678888776655321 11111 0110 000 Q ss_pred EEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEe Q lcl|NC_016654. 174 AVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYV 253 (533) Q Consensus 174 ~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 253 (533) ..| ....|. |+ T Consensus 130 --------------~~~------~~~dii-------------------------------------------------h~ 140 (378) T protein:vir:93 130 --------------KEY------KTEELV-------------------------------------------------RL 140 (378) T ss_pred --------------eEe------ccceeE-------------------------------------------------Ee Confidence 000 000011 11 Q ss_pred cCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeec-hHHhcCCCCccccccCcchhhhhh Q lcl|NC_016654. 254 PNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHAS-ESVLTNLGMGQGVSLDEEQEVYSR 332 (533) Q Consensus 254 pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~-~~~l~~~~~~~~~~~d~~~~~~~~ 332 (533) .+.. ...-|.|.+. .+..+++..+ +.+..+-++. ...+.... .....+.....|.. T Consensus 141 r~~~----------~~~~~~s~l~----~~~~~i~~~~-------~~~~~~g~l~~~~~l~~~~--~~~~~~~~~~~~~~ 197 (378) T protein:vir:93 141 TSPF----------YINEDTSILD----NALASIQTKL-------EQGKLRGLLKINAFLDIDN--TQEYREKALTTIKN 197 (378) T ss_pred cCcc----------ccchhhHHHH----HHHHHHHHHH-------hcCcccceeeeCCcCCHHH--HHHHHHHHHHHHHH Confidence 0000 0000111111 1111221111 1111111100 00000000 00000000000100 Q ss_pred cc-------ccccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHH Q lcl|NC_016654. 333 VG-------SGGFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKT 405 (533) Q Consensus 333 ~~-------~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~ 405 (533) .. ....+. ...++.++.+..+.+ +..++...++|+...|+||..++ + |+.+.. T Consensus 198 ~~~~~~~~~~~~l~~----g~~~~~l~~~~~~~~-~~~~~~~~~~Ia~~fgVPp~~l~----g--~~~e~~--------- 257 (378) T protein:vir:93 198 MQEGSSYNGLTPVDN----KTEIVELKKDYSVLN-KDEIDLIKSELLTGYFMNENILL----G--TATQEQ--------- 257 (378) T ss_pred hhcccccccceEcCC----CceEEEccCChhhhh-HHHHHHHHHHHHHHhCCCHHHhc----C--CcHHHH--------- Confidence 00 000111 112334443333333 35556667899999999998873 1 111211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccC-------CCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHH Q lcl|NC_016654. 406 TRAKARHFGSALGPLSTTCLRVDAIKFPG-------KGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVA 478 (533) Q Consensus 406 ~~~~~~~~~~al~~li~~il~l~~~~~~~-------~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~ 478 (533) ....+..+|..+++.+..-.+..+.. ........+.++++.-...|..+.++.+.+++.+|+|+..++.+ T Consensus 258 ---~~~f~~~tl~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~ 334 (378) T protein:vir:93 258 ---QIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLV 334 (378) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHH Confidence 12234445555554443322222211 11111234666777778899999999999999999999999766 Q ss_pred HhCCCCCHHHHHHHHHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 479 YLHEDWDDERVQEEADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEGE 533 (533) Q Consensus 479 ~l~~~~~dee~~~El~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 533 (533) ++ ++..-+--.++- +.. +. .| .+...+..+. ..+..+.++++.| T Consensus 335 ~~--gl~p~~ggD~~~-~~~-n~--~~-~~~~~~~~~~----~~~~~~~~e~~n~ 378 (378) T protein:vir:93 335 KM--GEQPIEGGDVYI-ANL-NA--VA-VKNLSDLQGS----RKDVTSTDETNNQ 378 (378) T ss_pred Hh--CCCCCCCCCeee-ecc-cc--cc-ccchhhhcCc----cCCCCCCCCCCCC Confidence 54 222200000000 000 00 00 0000000000 1111111222222 No 268 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=37.20 E-value=1.1 Score=20.29 Aligned_cols=311 Identities=11% Similarity=0.039 Sum_probs=111.8 Q ss_pred HHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHH--HHHHHhhcCCCceEeeCC--------- Q lcl|NC_016654. 36 LATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIA--KLSTTELFSEQLKFLDAG--------- 104 (533) Q Consensus 36 l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~--~~~a~ll~~e~~~i~~~~--------- 104 (533) ..+-++.+.+. . ....+.+... +.+...... .....+-||+|..+.... T Consensus 1 m~~~~~~~~~~---~------------~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~fg~p~~~~~~~~~~~~~~~~ 60 (368) T protein:vir:79 1 MSRNKTRRAAR---A------------ASAHVRTANT-----DAPTEHHTDRAAQAEVFSFGDPVEVLDRRELLDYVECM 60 (368) T ss_pred CCccccccchh---c------------cCcccccccc-----cCcchhhccccCceEEEEcCCceeecchhhHHHHHHHH Confidence 11111100000 0 0000000000 000000000 000012222221111000 Q ss_pred -----------------------C-chHH---HHHHHHHHhhccH--HHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEE Q lcl|NC_016654. 105 -----------------------K-SKEV---QARADLIFNTPRF--HSSLVEAGESCSALSGSFQRIVWDPTIADNAWI 155 (533) Q Consensus 105 -----------------------~-~~~~---~~~l~~i~~~n~f--~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i 155 (533) . .... ...+.-++.-|.+ ...+.+++...+..|.+|+.+..|..|. -+.+ T Consensus 61 ~~~~~~~~pi~~~~la~~~~~~~~h~~~~~~~~n~l~l~~~Pn~~~t~~~f~~l~~d~ll~Gnay~~~~r~~~G~-~~~L 139 (368) T protein:vir:79 61 RMGQWYEPPMPWDGLARSFRAAAHHSSAVYVKRNILVSTFIPHPLLSRATFERLVLDWQVFGNAYLERRENVLGG-TIRL 139 (368) T ss_pred hccchhccCcCHHHHHHHHhhccccchhhhhhcchhhhhcCCCcCCCHHHHHHHHHHHhhcCCeEEEEEEcCCCC-EEEE Confidence 0 0000 0001111122211 1234555556666788877776665432 2344 Q ss_pred EEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccccccc Q lcl|NC_016654. 156 DFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADE 235 (533) Q Consensus 156 ~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~ 235 (533) ..+++..+-..- +.+.+|.. . +. |..+.+ T Consensus 140 ~~l~~~~v~~~~-----------------~~~~~~~~-------------~-~~----~~~~~~---------------- 168 (368) T protein:vir:79 140 DTPLAKYVRRGL-----------------DLNTYFFV-------------Q-NW----QQPYTF---------------- 168 (368) T ss_pred EEeCcccceeec-----------------cCCEEEEE-------------e-cC----CeEEEE---------------- Confidence 444444332111 11111100 0 00 000000 Q ss_pred CCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHH-HhCcceee---echHH Q lcl|NC_016654. 236 GRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDF-RIGAGKVH---ASESV 311 (533) Q Consensus 236 ~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~~~~i~---v~~~~ 311 (533) . .--+.|+.+..++. ..+|.|.+..++. .+ .++..-+.|.+.+ +.|...=+ ++... T Consensus 169 ------~----~~dIihir~~~~~~--------~~yGlsp~~~a~~-si-~l~~aa~~~~~~~~~NGa~~~gil~~~~~~ 228 (368) T protein:vir:79 169 ------A----AGSVFHLQEPDINQ--------EVYGLPEYLSALN-AT-WLNESATLFRRRYYKNGSHAGFILYMTDAA 228 (368) T ss_pred ------c----cccEEEecCCCCCC--------CcccccHHHHHHH-HH-HHHHHHHHHHHHHHhccCCCceEEEeCCCC Confidence 0 00133444322221 3368888877653 33 3566666676665 55432222 22111 Q ss_pred hcCCCCccccccCcchhhhhhccc-------ccccc-ccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcc Q lcl|NC_016654. 312 LTNLGMGQGVSLDEEQEVYSRVGS-------GGFNA-NGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLG 383 (533) Q Consensus 312 l~~~~~~~~~~~d~~~~~~~~~~~-------~~~~~-~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g 383 (533) +.. . ..+.-.+.+..... ..... +..+...++.++......++.+..+...++|+...|+||..+| T Consensus 229 l~~---e---~~~~lk~~~~~~~G~~N~g~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llG 302 (368) T protein:vir:79 229 QKQ---E---DVDTLREAMKSAKGPGNFRNLFMYAPNGKKDGIQLLPVSEVAAKDEFWNIKNVTRDDQLAAHRVPPQLMG 302 (368) T ss_pred CCH---H---HHHHHHHHHHHhcCCcccCceeEecCCCCccceeEEEcCCCHHHHHHHHHHHHhHHHHHHHhCCCHHHcc Confidence 110 0 00000111111100 00011 1122234555566667788888888889999999999999998 Q ss_pred cCCCcc---hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCC--CHHHH Q lcl|NC_016654. 384 LSDEVA---QTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARE--SDLAK 458 (533) Q Consensus 384 ~~~~~~---~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~--d~~e~ 458 (533) +..++. .+.++. .+..++..|.-++..+.++... + + .+ .+.|+..... |..+. T Consensus 303 i~~~~t~~~sn~e~~-------------~~~f~~~~l~Pl~~~ie~ln~~-l-~-----~e--~~rF~~~~l~~~D~~a~ 360 (368) T protein:vir:79 303 IIPNNTGGFGDVEKA-------------AMVFARNEVKPLQDRLLAINDW-I-G-----DE--VVRFAPYALGGHDQPAA 360 (368) T ss_pred ccCCCCCccccHHHH-------------HHHHHHHHHHHHHHHHHHHHhc-c-C-----cc--eeeechhHhhccccccc Confidence 744332 222222 2233444455555544443221 1 1 11 2456654332 22233 Q ss_pred HHHHHHHHhC Q lcl|NC_016654. 459 AQTVQAWSVA 468 (533) Q Consensus 459 a~~~~~l~~a 468 (533) +. ...++| T Consensus 361 a~--~~~rsa 368 (368) T protein:vir:79 361 AP--GGQRSA 368 (368) T ss_pred CC--cccccC Confidence 22 223344 No 269 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=33.43 E-value=1.4 Score=19.86 Aligned_cols=317 Identities=11% Similarity=0.053 Sum_probs=119.1 Q ss_pred cCcchHHHHHHHHhhhHhhcCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHH- Q lcl|NC_016654. 11 PPPELAAVTARVAESHVWWEGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLS- 89 (533) Q Consensus 11 pp~~~~~~~~~~~~~~~w~~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~- 89 (533) -|..-.|.-+..+.+..-+.-..-...+-.+... .-+.-.+.....+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------------------------~~~~~~~~~~~~~~~ 49 (376) T protein:vir:10 1 MPARDRPRAARRRRHSFIFIHGVLRMSKRRSRAP-------------------------------RTFAAAPNPSAGSAA 49 (376) T ss_pred CCCCccchhhhhhcccchhhcccccchhccCCCc-------------------------------ccchhhhhHhhhccC Confidence 1111122222222222211110000000000000 00000000000000 Q ss_pred ----HHhhcCCCceEeeCC----------Cch----HH-HHHHHH------------HHhhcc----H-------HHHHH Q lcl|NC_016654. 90 ----TTELFSEQLKFLDAG----------KSK----EV-QARADL------------IFNTPR----F-------HSSLV 127 (533) Q Consensus 90 ----a~ll~~e~~~i~~~~----------~~~----~~-~~~l~~------------i~~~n~----f-------~~~~~ 127 (533) ..+-||+|..+.... .++ +. ...|.+ .++.|. | ...+. T Consensus 50 ~~~~~~f~fg~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~La~~~~~~~~h~s~l~~k~n~l~~~~~Pnp~lT~~~f~ 129 (376) T protein:vir:10 50 PARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSSALFFKANVLASTFRPHRWLSRHAFE 129 (376) T ss_pred cceeEEEEcCCceeccCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhHHhhhhHHHHhHHHHhccCCCCCCCHHHHH Confidence 122333331110000 000 00 000001 011111 1 23456 Q ss_pred HHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEe Q lcl|NC_016654. 128 EAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYK 207 (533) Q Consensus 128 ~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~ 207 (533) +++...+.+|.+|+.+..|..|. -+.+..++|.++-+..+.++ .+|+. .+ T Consensus 130 ~~v~d~ll~Gnay~~~~rn~~G~-~~~L~pl~~~~vr~~~d~~~----------------~~~~~-----~~-------- 179 (376) T protein:vir:10 130 RWALDFLTFGNGYLERRRNMVGG-TLRLEPALAKYVRRKADFNG----------------FVYVN-----GW-------- 179 (376) T ss_pred HHHHHHHhcCCeEEEEEECCCCC-EEEEEEeCCcceEEEeeCCe----------------EEEEE-----cC-------- Confidence 66677778899998887776543 35566666665544322221 01100 00 Q ss_pred ccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHH Q lcl|NC_016654. 208 GTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHEL 287 (533) Q Consensus 208 ~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~l 287 (533) |..+.+. .--+.|+.+.-++ ...+|.|.+..++.. + .+ T Consensus 180 ------~~~~~~~--------------------------~~eViHir~~~~~--------~~~yGls~~~~a~~s-i-~l 217 (376) T protein:vir:10 180 ------QERHEFE--------------------------PDSVFQLVRPDIN--------QEVYGLPEYLSSLHS-A-WL 217 (376) T ss_pred ------CeEEEEc--------------------------cccEEEecCCCCC--------CCcccccHHHHHHHH-H-HH Confidence 0111000 0012333322111 234688888876643 3 46 Q ss_pred HHHHHHHHHHH-HhCcce---eeechHHhcCCCCccccccCcchhhhhh---------ccccccccccccccceeeechh Q lcl|NC_016654. 288 DRIYSSLMRDF-RIGAGK---VHASESVLTNLGMGQGVSLDEEQEVYSR---------VGSGGFNANGDMETIFEFFQPA 354 (533) Q Consensus 288 D~~~s~~~~~~-~~~~~~---i~v~~~~l~~~~~~~~~~~d~~~~~~~~---------~~~~~~~~~~~~~~~i~~~~~~ 354 (533) +..-+.+.+.+ +.|... |+++...++.. . .+.-.+.+.. +.+... .+.++...++.++.. T Consensus 218 ~~aa~~f~~~~f~NGa~pggIl~~~d~~l~~e---~---~~~lr~~~~~~~G~~N~~~~~vl~~-~g~~~Gi~~~pls~~ 290 (376) T protein:vir:10 218 NESSTLFRRKYYENGSHAGFILYMTDAAQKQD---D---VDNMRDALKNAKGPGNFRNVFMYAP-GGKKDGIQLIPVSEV 290 (376) T ss_pred HHHHHHHHHHHHhccCCCceEEEecCCCCCHH---H---HHHHHHHHHHhcCccccCceeEecC-CCCccceEEEEccCC Confidence 67777776665 554322 12221111110 0 0000011111 111100 111122234445555 Q ss_pred hhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcc---hhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_016654. 355 IRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVA---QTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIK 431 (533) Q Consensus 355 ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~---~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~ 431 (533) ....++.+.-+...+.|+...|++|..+|+..++. .++++.. +..++..|.-++..+.++.+ . T Consensus 291 ~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~eq~~-------------~~f~~~~L~Pl~~~ieeln~-~ 356 (376) T protein:vir:10 291 AAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAA-------------RVFGRNEIRPLQARFAELND-W 356 (376) T ss_pred HHHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHH-------------HHHHHHHHHHHHHHHHHHHh-h Confidence 57778888888889999999999999998643322 2333221 22333334444444433322 1 Q ss_pred ccCCCCCCceeEEEEeCCCCCCCHHHHH Q lcl|NC_016654. 432 FPGKGAAPSEELELEWPKFARESDLAKA 459 (533) Q Consensus 432 ~~~~~~~~~~~v~i~f~d~i~~d~~e~a 459 (533) + + .+ -|.|++.....-++.+ T Consensus 357 L-~------~~-~~~F~~~~Llr~d~ka 376 (376) T protein:vir:10 357 L-G------EE-VVRFDDYEIPPAPVAA 376 (376) T ss_pred c-c------cc-ccccChhHhhcccccC Confidence 1 1 11 1567665443333333 No 270 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=23.86 E-value=2.2 Score=18.65 Aligned_cols=387 Identities=8% Similarity=-0.052 Sum_probs=130.0 Q ss_pred cCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHH Q lcl|NC_016654. 30 EGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEV 109 (533) Q Consensus 30 ~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~ 109 (533) .|=.++|. ..........+.. ..+ ... .....+...--..+++.+|+-+.+=|..+--.+..... T Consensus 1 MGlf~~~~---~~~~~~~~~~~~~------~~~----~~~--~~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~~~~~ 65 (395) T protein:vir:98 1 MGILDFFS---FKKSGTLSDDDSG------STT----SEK--LTNVVLKEDALYKCVNYLARIISKSTFRLKTPEKLTEN 65 (395) T ss_pred Ccchhhhc---CCCcccccccccc------hhh----hhh--cchhhhhhHHHHHHHHHHHHHHhhCceeEEecCCcccc Confidence 56433331 1111100000000 000 000 00001112222234455555555445443222221111 Q ss_pred HHHHHHHHhh--cc---HHHHHHHHHHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeec Q lcl|NC_016654. 110 QARADLIFNT--PR---FHSSLVEAGESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGG 184 (533) Q Consensus 110 ~~~l~~i~~~--n~---f~~~~~~~~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~ 184 (533) ..-+..+|+. |. ...-....+...+..|.+|+.+..|.. . ++ ++.+.... .+....+... ... T Consensus 66 ~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~~----~---~~-~~~~~~~~---~~~~~~~~~~-~~~ 133 (395) T protein:vir:98 66 QKDWLYWINTKANPNQSASQFWVEVIQKLLVDGETLIFVIPGKG----I---YV-ADSFTQDK---KISGSQFKVS-RVQ 133 (395) T ss_pred cchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeCCc----e---ec-CCcccccc---cccCccccee-eec Confidence 2224444432 21 122234445556667888887765531 1 11 22111111 1111111111 111 Q ss_pred CCceEEEEEEEecCeeEEEEEEeccCCcccceeehhh-ccccccccccccccC-CceeecCCCccceeEEecCCcccccc Q lcl|NC_016654. 185 DGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTD-HPATRDIAVEGADEG-RGAYVETGVKDLTAAYVPNVTPNPEW 262 (533) Q Consensus 185 ~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~-~~~~~~~~~~~~~~~-~~~~~~~g~~~~~~~~~pn~~~~~~~ 262 (533) +. .+. . .+....|.|--|...+...+. .++.. ............... ......++....+....+ T Consensus 134 ~~--~~~-~-~~~~~evih~k~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------- 200 (395) T protein:vir:98 134 GQ--TYE-K-TFTFDQVIYLKNDNSDLMSKV-ESLWEEYGELLGHVINNQKIANQIRFTMIPPKDKVRERAQ-------- 200 (395) T ss_pred Cc--eee-e-EecCccEEEecCCCCCccccc-cchhhhHHHHHHHHHHHHHHHHHHHHhhcccccccccccc-------- Confidence 10 010 0 112223333222111111000 00000 000000000000000 000000111100000000 Q ss_pred cccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcCCCCccccccCcchhhhhhcccccccccc Q lcl|NC_016654. 263 RHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDFRIGAGKVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG 342 (533) Q Consensus 263 ~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 342 (533) .. ... ....+...+.+.++...+..++.++++ +. ..-.|..+.....+ T Consensus 201 ---------~~--~~~--~~~~~~~~~~~~~~~~~~~~~~~~v~~----l~------------~g~~~~~l~~~~~~--- 248 (395) T protein:vir:98 201 ---------EN--SDG--GRQSKSDKDFFKRTVEKIRTESVVGIP----VT------------ANTNYEEYGSKNTG--- 248 (395) T ss_pred ---------cc--CCc--HHHHHHHHHHHHHHHhhhhcCCcceee----cC------------CCceeEeccccccc--- Confidence 00 000 011122223333332222223333322 10 00011111111000 Q ss_pred ccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016654. 343 DMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARHFGSALGPLST 422 (533) Q Consensus 343 ~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~ 422 (533) .....++++.+..+...++|+...|+||+.++.+.+ +.++. ....++..|.-++. T Consensus 249 ---------~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l~~~~s---n~e~~-------------~~~f~~~tl~P~~~ 303 (395) T protein:vir:98 249 ---------AVKSYVDDIKKLKDQYMAEFAEMLGIPISLLHGDIA---DNQKN-------------YELLLEGPIESLIT 303 (395) T ss_pred ---------ccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCcc---cHHHH-------------HHHHHHHHHHHHHH Confidence 012334577777788889999999999999862211 11111 12233344444443 Q ss_pred HHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHh-CCCCCHHHHHHHHHHHHHhhh Q lcl|NC_016654. 423 TCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYL-HEDWDDERVQEEADLIDNANT 501 (533) Q Consensus 423 ~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l-~~~~~dee~~~El~rI~~E~~ 501 (533) .+-.-.+..+..... ....+.|+|+.-...|..++++.+.+++..|+|+..++.+.+ .|.++++..++ T Consensus 304 ~ie~~l~~kll~~~~-~~~g~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~Pi~~~~gD~---------- 372 (395) T protein:vir:98 304 NIVDGLEYAIFDKSE-TLQGSFIKVTGLKNYDLFSISNQADKLISSGFVFIDEVREEIGLPELPDGLGKV---------- 372 (395) T ss_pred HHHHHHHHhcCChhh-hcCcceeeehhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCce---------- Confidence 332222222211111 122345788888889999999999999999999999977664 11122211110 Q ss_pred cccCccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 502 VSAPTFGFGTDQPPLPTENDPATDPEAVDEG 532 (533) Q Consensus 502 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 532 (533) .-...+..|..+ .+|++.++++- T Consensus 373 -----~~~~~n~~~~~~---~gge~~~~~~~ 395 (395) T protein:vir:98 373 -----LYMTKNYESVLE---RGGEVDEEVET 395 (395) T ss_pred -----eeecccceeccc---ccCCCCCCCCC Confidence 000111111111 11111111111 No 271 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=21.53 E-value=2.6 Score=18.32 Aligned_cols=359 Identities=10% Similarity=-0.007 Sum_probs=120.6 Q ss_pred cCCHHHHHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHHHHHhhcCCCceEeeCCCchHH Q lcl|NC_016654. 30 EGDLDKLATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKLSTTELFSEQLKFLDAGKSKEV 109 (533) Q Consensus 30 ~gd~~~l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~~a~ll~~e~~~i~~~~~~~~~ 109 (533) +|=.++| ++.+.. ....+. . ..+.. .+ ....+...--..+++.+|+-+..-|..+- ...... T Consensus 1 Mg~f~~l---~~~~~~-~~~~~~------~-~~~~~-~~----~~~~l~~~~v~~~i~~Ia~~ia~~p~~~~--~~~~~~ 62 (376) T protein:vir:78 1 MGFFSEL---FKRNKE-IEWMWD------L-DFLED-KT----TKVYLKKMALNTCVKHIARTIAKSDFRLK--NGETSV 62 (376) T ss_pred Cchhhhh---hccCCc-cccccc------h-hhccc-cc----hhhhhhhHHHHHHHHHHHHhhcccceeec--cccccc Confidence 5533332 111110 000000 0 00000 00 01112222334445555554444443332 111112 Q ss_pred HHHHHHHHhh--c--cHHHHHHHH-HHHHhhhCCEEEEEEEcCCCCCceEEEEEcCCeEEEEEecCCceEEEEEEEEeec Q lcl|NC_016654. 110 QARADLIFNT--P--RFHSSLVEA-GESCSALSGSFQRIVWDPTIADNAWIDFVDADRAIPEFRWGRLVAVTFWSELAGG 184 (533) Q Consensus 110 ~~~l~~i~~~--n--~f~~~~~~~-~~~~~~~G~~~~~~~~D~~~~~~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~ 184 (533) ...+..+|.. | .=...+.+. +......|.+|+.+..+.. +.+ .. .+|+-. ..+....+. T Consensus 63 ~~~l~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~--~~~-~~------~~~~~~-~~~~~~~~~------ 126 (376) T protein:vir:78 63 RDKLYYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDD--FLI-AD------SYVRKE-FAFFPDVFE------ 126 (376) T ss_pred cchHHHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCC--eee-cc------ceeecc-cceeeeeee------ Confidence 2223333321 2 112233333 3344445766666544432 211 11 122210 011000000 Q ss_pred CCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhccccccccccccccCCceeecCCCccceeEEecCCcccccccc Q lcl|NC_016654. 185 DGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVEGADEGRGAYVETGVKDLTAAYVPNVTPNPEWRH 264 (533) Q Consensus 185 ~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~ 264 (533) .+.. ..|.+.. .| +- . -+.|+... . T Consensus 127 -------~~~~-~~~~~~~-~~-----------~~----------~------------------evih~~~~-------~ 151 (376) T protein:vir:78 127 -------GVTV-KDYRYNR-NF-----------SM----------D------------------DVIFLEYG-------N 151 (376) T ss_pred -------eeee-ecceeee-ee-----------cc----------c------------------cEEEeccC-------C Confidence 0000 0000000 00 00 0 01111100 0 Q ss_pred cccccccccchhhhhHHHHHHHHHHHHHHHHHHH-HhCcc-eeeechHHhcCCCCccccccCcchhhhhhcccccccccc Q lcl|NC_016654. 265 DPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDF-RIGAG-KVHASESVLTNLGMGQGVSLDEEQEVYSRVGSGGFNANG 342 (533) Q Consensus 265 ~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~~~-~i~v~~~~l~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 342 (533) .+ +..++. .++..+........+.. ..+.. .+.+ +...........+..+..+.....+..+.++ T Consensus 152 ~~-~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~e~~~~~~~~~~~~~~g~~~~~~ 218 (376) T protein:vir:78 152 ER-LSAFTD--------GMFEDYGELFGKMIRAQMRNFQIRGAVN----FKMAGVADKDKQTKLQEYIDKVYASFNNNEI 218 (376) T ss_pred CC-chhhhh--------HHHHHHHHHHHHHHHHHHhcCCCceeEE----EccCCCCCHHHHHHHHHHHHHHhccccccCc Confidence 00 111111 12222223333322222 22221 1111 1110000000000011111111111101110 Q ss_pred -----ccccceeeec--h-h--hhhHHHHHHHHHHHHHHHHhhCCChhhcccCCCcchhHHHHHHHhhhHHHHHHHHHHH Q lcl|NC_016654. 343 -----DMETIFEFFQ--P-A--IRVLEHDQGAALLLREVLRKTGYSPVSLGLSDEVAQTATEASGKKDLTVKTTRAKARH 412 (533) Q Consensus 343 -----~~~~~i~~~~--~-~--ir~e~~~~~l~~~l~~i~~~~g~s~~~~g~~~~~~~Tatai~~~~~~l~~~~~~~~~~ 412 (533) +....++.++ + + ....++.+..+...++|+...|+||..+|.+.++ .++. .... T Consensus 219 ~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~~~~s~---~e~~-------------~~~f 282 (376) T protein:vir:78 219 AIVPQLEGFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLHGDMAD---LSNN-------------MKAY 282 (376) T ss_pred ceEEcCCCceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC---HHHH-------------HHHH Confidence 0011122222 2 1 1234678888888899999999999999733221 1111 1233 Q ss_pred HHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCHHHHHHH Q lcl|NC_016654. 413 FGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDLAKAQTVQAWSVASAASTKTKVAYLHEDWDDERVQEE 492 (533) Q Consensus 413 ~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~e~a~~~~~l~~aGi~S~et~v~~l~~~~~dee~~~E 492 (533) +..+|..++..+..-.+..+.. .....+.+++....-.|..+.++.+.+++.+|+|+..++.+.+ ++.. T Consensus 283 ~~~~l~P~~~~ie~~l~~kll~---~~~~~~~~~~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~l--g~~p------ 351 (376) T protein:vir:78 283 MEYCIDPLTKKLEDELNAKLFT---FSEFLAGEHIKIIHKKDIIENAEAVDKLVASGSFNRNEVRELL--GAER------ 351 (376) T ss_pred HHHHHHHHHHHHHHHHHhhhCC---cccceecccchhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHh--CCCC------ Confidence 4444555544443332222221 1122344455555667899999999999999999998866654 2221 Q ss_pred HHHHHHhhhcccCccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_016654. 493 ADLIDNANTVSAPTFGFGTDQPPLPTENDPATDPEAVDEG 532 (533) Q Consensus 493 l~rI~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 532 (533) +. .+. .+..-...+..|..+. +++| T Consensus 352 ---~~--~g~-~d~~~~~~n~~~~~~~---------~e~g 376 (376) T protein:vir:78 352 ---VD--NPE-LDKYLITKNYQSADEG---------GEDG 376 (376) T ss_pred ---CC--CCC-CceeeeccCceehhcc---------ccCC Confidence 00 000 0000001112221111 1111 No 272 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=20.91 E-value=2.7 Score=18.23 Aligned_cols=285 Identities=10% Similarity=0.048 Sum_probs=106.4 Q ss_pred HHHHHhccCcchhhHHHHHHHHHHHHHhcccCCCCCcccceeecChHHHHHHH---HHHhhcCCCceEeeCCCchHHHHH Q lcl|NC_016654. 36 LATFYGAEGRTSPSGIKARTKAAYEAFHGRTPTATGRAPKRYHAPIPGVIAKL---STTELFSEQLKFLDAGKSKEVQAR 112 (533) Q Consensus 36 l~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~n~~k~i~~~---~a~ll~~e~~~i~~~~~~~~~~~~ 112 (533) ..+ ++++...+-.+..... ...+-||+|..+.. .. ...++ T Consensus 1 ~~~----------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~-~~--~~~~~ 43 (344) T protein:vir:56 1 MSK----------------------------------KKGKTPQPAAKTMTASAPKMEAFTFGEPVPVLD-RR--DILDY 43 (344) T ss_pred CCC----------------------------------CCCCCCchhhHHhhcCCCceEEEEcCCceeecC-cc--hhhhH Confidence 000 0000000000000000 01123333311110 00 00111 Q ss_pred HH------------------H---------------------HHhhccH--HHHHHHHHHHHhhhCCEEEEEEEcCCCCC Q lcl|NC_016654. 113 AD------------------L---------------------IFNTPRF--HSSLVEAGESCSALSGSFQRIVWDPTIAD 151 (533) Q Consensus 113 l~------------------~---------------------i~~~n~f--~~~~~~~~~~~~~~G~~~~~~~~D~~~~~ 151 (533) +. + .+.-|.. ...+..++..-+..|.+|+.+..+..|. T Consensus 44 ~~~~~~~~~~~pp~~~~~la~~~~a~~~h~s~i~~k~n~l~~~~~Pnp~~t~~~f~~~~~d~ll~Gnay~~~~rn~~G~- 122 (344) T protein:vir:56 44 VECISNGRWYEPPVSFTGLAKSLRAAVHHSSPIYVKRNILASTFIPHPWLSQQDFSRFVLDFLVFGNAFLEKRYSTTGK- 122 (344) T ss_pred HHhhhcCccccCCCCHHHHHHHHhhhhhhCccceehhhhHHhhcCCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCc- Confidence 11 1 1111111 1234444555566677777666655432 Q ss_pred ceEEEEEcCCeEEEEEecCCceEEEEEEEEeecCCceEEEEEEEecCeeEEEEEEeccCCcccceeehhhcccccccccc Q lcl|NC_016654. 152 NAWIDFVDADRAIPEFRWGRLVAVTFWSELAGGDGQEVWRHLERHESGYIVHAVYKGTATSLGWMMALTDHPATRDIAVE 231 (533) Q Consensus 152 ~~~i~~v~~~~~~P~~~~g~~~~v~f~~~~~~~~~~~~y~~lE~h~~~~I~~~~y~~~~~~lG~~v~l~~~~~~~~~~~~ 231 (533) -+.+..++|..+-.. . ++..+|.. .. . |..+.+ T Consensus 123 ~~~L~pl~~~~v~~~----------------~-~~~~~~~~--------------~~--~--g~~~~~------------ 155 (344) T protein:vir:56 123 VIRLETSPAKYTRRG----------------V-EEDVYWWV--------------PS--F--NEPTAF------------ 155 (344) T ss_pred EEEEEEeCCceeEEe----------------e-cCCEEEEE--------------ec--C--CeEEEE------------ Confidence 233444444332211 0 11111100 00 0 111100 Q ss_pred ccccCCceeecCCCccceeEEecCCcccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHH-HhCcceee---e Q lcl|NC_016654. 232 GADEGRGAYVETGVKDLTAAYVPNVTPNPEWRHDPKLRYLGRADLSTDLFPTFHELDRIYSSLMRDF-RIGAGKVH---A 307 (533) Q Consensus 232 ~~~~~~~~~~~~g~~~~~~~~~pn~~~~~~~~~~~~~~~~G~S~~~~~i~~lid~lD~~~s~~~~~~-~~~~~~i~---v 307 (533) ..--+.|+.+.-++ ...+|.|.+..++.. + .++..-..+.+.+ +.|...=+ + T Consensus 156 --------------~~~dIiHir~~~~~--------~~~~Gls~~~~a~~s-i-~l~~~a~~~~~~~f~NGa~pg~Il~~ 211 (344) T protein:vir:56 156 --------------APGSVFHLLEPDIN--------QELYGLPEYLSALNS-A-WLNESATLFRRKYYENGAHAGYIMYV 211 (344) T ss_pred --------------cCccEEEECCCCCC--------CCcccccHHHHHHHH-H-HHHHHHHHHHHHHHhccCCCceEEEe Confidence 00013344332111 134688888776643 3 3566666666554 55432222 2 Q ss_pred chHHhcCCCCccccccCcchhhhhhcccc--------ccccccccccceeeechhhhhHHHHHHHHHHHHHHHHhhCCCh Q lcl|NC_016654. 308 SESVLTNLGMGQGVSLDEEQEVYSRVGSG--------GFNANGDMETIFEFFQPAIRVLEHDQGAALLLREVLRKTGYSP 379 (533) Q Consensus 308 ~~~~l~~~~~~~~~~~d~~~~~~~~~~~~--------~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~ 379 (533) +...+... . .+.-...+...... ....+..+...++.++......++++.-+...+.|+..-|++| T Consensus 212 ~d~~ls~e---~---~~~lk~~~~~~~g~~~~r~l~l~~p~g~~~G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp 285 (344) T protein:vir:56 212 TDAVQDRN---D---IEMLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPF 285 (344) T ss_pred cCCCCCHH---H---HHHHHHHHHHhcCCCCccceEEecCCCCccceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCH Confidence 21111100 0 00001111111100 0001111222344455555667888888888899999999999 Q ss_pred hhcccCCCc---chhHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCceeEEEEeCCCCCCCHH Q lcl|NC_016654. 380 VSLGLSDEV---AQTATEASGKKDLTVKTTRAKARHFGSALGPLSTTCLRVDAIKFPGKGAAPSEELELEWPKFARESDL 456 (533) Q Consensus 380 ~~~g~~~~~---~~Tatai~~~~~~l~~~~~~~~~~~~~al~~li~~il~l~~~~~~~~~~~~~~~v~i~f~d~i~~d~~ 456 (533) ..+|+-.++ -.+.++... ..++..|.-++..+.++.+. + +. -.+.|++....+.. T Consensus 286 ~llGi~~~~t~~~~n~eq~~~-------------~f~~~tL~Pl~~~ie~~n~~-l-~~-------~~~~F~~y~l~~~~ 343 (344) T protein:vir:56 286 QLMGGKPENVGSLGDIEKVAK-------------VFVRNELIPLQDRIREINGW-I-GQ-------EVIRFKNYSLDTDN 343 (344) T ss_pred HHhccCCCCCCccccHHHHHH-------------HHHHHHHHHHHHHHHHHHhh-h-cc-------ccccCCCccccccC Confidence 999863322 222332211 22333334444433333221 1 10 12457766665555 Q ss_pred H Q lcl|NC_016654. 457 A 457 (533) Q Consensus 457 e 457 (533) + T Consensus 344 ~ 344 (344) T protein:vir:56 344 G 344 (344) T ss_pred C Confidence 5 Done!