Query lcl|NC_019418.1_cdsid_YP_006990343.1 [gene=phiNJ2_0024] [protein=putative phage portal protein] [protein_id=YP_006990343.1] [location=complement(19199..20782)] Match_columns 527 No_of_seqs 131 out of 186 Neff 8.2 Searched_HMMs 1612 Date Thu Nov 7 17:42:51 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_24 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_24_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:4782 Length: 522 # 100.0 3E-158 2E-161 884.2 58.7 520 1-522 1-522 (522) 2 protein:vir:98883 Length: 517 100.0 4E-153 2E-156 856.3 58.7 506 1-525 1-517 (517) 3 protein:vir:3028 Length: 500 # 100.0 1E-152 6E-156 853.9 58.6 500 1-516 1-500 (500) 4 protein:vir:9815 Length: 500 # 100.0 1E-152 6E-156 853.9 58.6 500 1-516 1-500 (500) 5 protein:vir:79703 Length: 505 100.0 3E-149 2E-152 835.2 57.6 495 1-508 1-505 (505) 6 protein:vir:1587 Length: 508 # 100.0 4E-147 3E-150 823.1 58.7 496 1-525 1-508 (508) 7 protein:vir:80959 Length: 499 100.0 8E-135 5E-138 755.7 56.0 492 3-526 1-499 (499) 8 protein:vir:38 Length: 496 # N 100.0 5E-127 3E-130 713.0 56.3 491 3-526 1-496 (496) 9 protein:vir:78907 Length: 518 100.0 2E-117 1E-120 660.8 53.0 485 1-510 1-518 (518) 10 protein:vir:5961 Length: 503 # 100.0 1E-58 6.3E-62 338.5 46.2 476 1-527 9-496 (503) 11 protein:vir:96240 Length: 511 100.0 5.1E-58 3.1E-61 334.7 49.8 464 1-526 39-511 (511) 12 protein:vir:99781 Length: 511 100.0 8E-58 4.9E-61 333.6 48.7 464 1-526 39-511 (511) 13 protein:vir:96366 Length: 511 100.0 6.4E-57 4E-60 328.7 49.2 464 1-526 39-511 (511) 14 protein:vir:78805 Length: 511 100.0 6.4E-57 4E-60 328.7 49.2 464 1-526 39-511 (511) 15 protein:vir:103951 Length: 511 100.0 4.7E-56 2.9E-59 323.9 49.9 464 1-526 39-511 (511) 16 protein:vir:9306 Length: 511 # 100.0 4.7E-56 2.9E-59 323.9 49.2 464 1-526 39-511 (511) 17 protein:vir:105461 Length: 470 100.0 1.2E-55 7.2E-59 321.7 47.5 451 2-526 1-470 (470) 18 protein:vir:97171 Length: 512 100.0 3.8E-55 2.4E-58 318.9 49.3 463 1-526 31-512 (512) 19 protein:vir:79043 Length: 479 100.0 1.5E-55 9.3E-59 321.1 46.5 445 1-527 18-479 (479) 20 protein:vir:2732 Length: 501 # 100.0 3.7E-55 2.3E-58 319.0 47.3 449 1-527 38-498 (501) 21 protein:vir:9922 Length: 489 # 100.0 5.5E-55 3.4E-58 318.0 47.5 458 1-521 13-489 (489) 22 protein:vir:96179 Length: 468 100.0 1.7E-55 1.1E-58 320.8 44.6 453 3-514 1-468 (468) 23 protein:vir:94498 Length: 474 100.0 3.4E-55 2.1E-58 319.2 45.3 455 3-526 1-474 (474) 24 protein:vir:97447 Length: 474 100.0 3.4E-55 2.1E-58 319.2 45.3 455 3-526 1-474 (474) 25 protein:vir:106571 Length: 499 100.0 1E-54 6.4E-58 316.6 47.6 454 1-527 1-488 (499) 26 protein:vir:96494 Length: 501 100.0 1.2E-54 7.6E-58 316.2 47.8 449 1-526 38-501 (501) 27 protein:vir:4898 Length: 502 # 100.0 5.2E-55 3.2E-58 318.2 45.7 468 1-527 1-497 (502) 28 protein:vir:96839 Length: 474 100.0 4.3E-55 2.7E-58 318.6 44.5 457 1-523 1-474 (474) 29 protein:vir:106639 Length: 481 100.0 3.2E-54 2E-57 313.9 48.6 462 1-527 6-481 (481) 30 protein:vir:95113 Length: 474 100.0 3E-55 1.8E-58 319.5 42.6 457 1-526 7-474 (474) 31 protein:vir:105292 Length: 478 100.0 1.6E-54 9.7E-58 315.6 45.7 458 1-524 1-478 (478) 32 protein:vir:94546 Length: 506 100.0 2.8E-54 1.8E-57 314.2 45.5 475 2-527 1-504 (506) 33 protein:vir:1236 Length: 483 # 100.0 5.8E-54 3.6E-57 312.4 46.3 441 1-526 34-483 (483) 34 protein:vir:107112 Length: 478 100.0 4.3E-54 2.7E-57 313.2 45.4 458 1-524 1-478 (478) 35 protein:vir:3964 Length: 453 # 100.0 1.1E-53 6.6E-57 311.0 47.0 436 1-526 11-453 (453) 36 protein:vir:99522 Length: 470 100.0 3.9E-53 2.4E-56 307.9 47.8 445 1-526 19-470 (470) 37 protein:vir:93747 Length: 472 100.0 1.7E-53 1.1E-56 309.9 45.6 457 1-526 5-472 (472) 38 protein:vir:102950 Length: 471 100.0 1.1E-52 6.7E-56 305.5 47.4 445 1-515 1-471 (471) 39 protein:vir:3609 Length: 452 # 100.0 4.5E-53 2.8E-56 307.6 45.3 434 1-526 17-452 (452) 40 protein:vir:94805 Length: 492 100.0 1E-52 6.3E-56 305.6 45.4 459 1-526 4-492 (492) 41 protein:vir:97336 Length: 492 100.0 1.9E-52 1.2E-55 304.2 45.5 463 1-526 4-492 (492) 42 protein:vir:96266 Length: 474 100.0 3.6E-52 2.2E-55 302.6 44.7 441 1-526 26-474 (474) 43 protein:vir:95899 Length: 474 100.0 3.6E-52 2.2E-55 302.6 44.7 441 1-526 26-474 (474) 44 protein:vir:95806 Length: 440 100.0 5E-52 3.1E-55 301.8 43.9 430 6-525 1-440 (440) 45 protein:vir:9871 Length: 429 # 100.0 4.8E-52 3E-55 301.9 42.7 426 6-525 1-429 (429) 46 protein:vir:733 Length: 453 # 100.0 2.2E-52 1.3E-55 303.8 39.4 446 1-517 1-453 (453) 47 protein:vir:78083 Length: 537 100.0 2.7E-50 1.7E-53 292.4 47.8 480 1-527 8-526 (537) 48 protein:vir:105889 Length: 474 100.0 5.1E-50 3.2E-53 290.8 44.9 452 1-526 1-474 (474) 49 protein:vir:94101 Length: 474 100.0 5.1E-50 3.2E-53 290.8 44.9 452 1-526 1-474 (474) 50 protein:vir:102330 Length: 451 100.0 1.6E-48 9.9E-52 282.6 42.7 431 1-511 1-451 (451) 51 protein:vir:78537 Length: 480 100.0 2.4E-46 1.5E-49 270.7 43.0 453 1-527 1-470 (480) 52 protein:vir:78227 Length: 480 100.0 2.3E-46 1.4E-49 270.8 42.7 454 1-527 1-470 (480) 53 protein:vir:2427 Length: 485 # 100.0 4E-46 2.5E-49 269.5 41.8 457 16-527 1-485 (485) 54 protein:vir:7768 Length: 484 # 100.0 1.9E-45 1.2E-48 265.8 44.0 443 1-527 14-482 (484) 55 protein:vir:104082 Length: 485 100.0 7.1E-45 4.4E-48 262.6 43.2 453 1-525 8-485 (485) 56 protein:vir:80680 Length: 441 100.0 2.1E-44 1.3E-47 260.1 43.8 433 3-514 1-441 (441) 57 protein:vir:2500 Length: 501 # 100.0 2.8E-44 1.7E-47 259.4 44.5 463 19-527 1-499 (501) 58 protein:vir:4223 Length: 486 # 100.0 1.9E-44 1.2E-47 260.3 43.4 451 1-524 6-486 (486) 59 protein:vir:99072 Length: 479 100.0 4E-44 2.5E-47 258.5 41.4 445 1-527 9-471 (479) 60 protein:vir:2341 Length: 488 # 100.0 1.1E-43 6.7E-47 256.1 43.4 450 1-527 10-486 (488) 61 protein:vir:7987 Length: 456 # 100.0 7.3E-44 4.5E-47 257.1 40.9 444 1-521 4-456 (456) 62 protein:vir:102602 Length: 456 100.0 9E-43 5.6E-46 251.1 41.8 445 1-517 1-456 (456) 63 protein:vir:105819 Length: 456 100.0 9E-43 5.6E-46 251.1 41.8 445 1-517 1-456 (456) 64 protein:vir:98444 Length: 434 100.0 1.3E-41 7.8E-45 244.8 39.4 425 55-527 1-434 (434) 65 protein:vir:99916 Length: 504 100.0 4.9E-41 3E-44 241.6 41.4 458 1-527 1-498 (504) 66 protein:vir:8184 Length: 474 # 100.0 8.8E-36 5.4E-39 212.8 38.9 439 1-513 17-474 (474) 67 protein:vir:9568 Length: 410 # 100.0 6.4E-35 3.9E-38 208.1 40.0 397 32-497 1-410 (410) 68 protein:vir:9751 Length: 422 # 100.0 4.2E-34 2.6E-37 203.6 39.2 406 1-498 1-422 (422) 69 protein:vir:94742 Length: 409 100.0 3.1E-34 1.9E-37 204.3 36.4 395 1-483 1-409 (409) 70 protein:vir:1634 Length: 409 # 100.0 2.9E-34 1.8E-37 204.4 35.5 395 1-483 1-409 (409) 71 protein:vir:101494 Length: 527 100.0 7.3E-34 4.6E-37 202.2 33.7 474 14-526 1-527 (527) 72 protein:vir:102239 Length: 527 100.0 8.2E-34 5.1E-37 202.0 33.7 474 14-526 1-527 (527) 73 protein:vir:7430 Length: 563 # 100.0 4E-33 2.5E-36 198.2 33.8 479 14-527 1-552 (563) 74 protein:vir:94956 Length: 452 99.7 6.2E-15 3.9E-18 98.5 37.1 429 18-527 1-452 (452) 75 protein:vir:97265 Length: 513 99.7 4.6E-14 2.8E-17 93.7 38.5 441 14-525 1-513 (513) 76 protein:vir:79538 Length: 502 99.7 3.6E-14 2.2E-17 94.3 37.9 438 1-526 1-502 (502) 77 protein:vir:80453 Length: 535 99.6 1.6E-13 9.9E-17 90.7 42.7 472 4-527 1-534 (535) 78 protein:vir:95149 Length: 501 99.6 6.6E-13 4.1E-16 87.4 40.5 439 29-525 1-501 (501) 79 protein:vir:95542 Length: 548 99.6 4.7E-13 2.9E-16 88.2 35.8 442 1-527 1-514 (548) 80 protein:vir:96738 Length: 505 99.5 1.1E-12 6.6E-16 86.2 34.8 436 1-524 8-505 (505) 81 protein:vir:78393 Length: 489 99.5 1.6E-12 1E-15 85.2 35.0 436 18-514 1-489 (489) 82 protein:vir:93630 Length: 776 99.5 2.2E-14 1.4E-17 95.4 24.7 486 1-527 44-659 (776) 83 protein:vir:80040 Length: 461 99.5 1.7E-13 1E-16 90.7 29.2 427 1-527 1-459 (461) 84 protein:vir:8846 Length: 705 # 99.5 4.1E-13 2.5E-16 88.5 31.3 473 1-527 10-609 (705) 85 protein:vir:95014 Length: 491 99.5 3.2E-12 2E-15 83.6 34.3 435 18-514 1-491 (491) 86 protein:vir:5249 Length: 437 # 99.5 3.1E-12 1.9E-15 83.7 31.1 401 1-524 1-437 (437) 87 protein:vir:80165 Length: 651 99.4 2E-11 1.3E-14 79.2 32.5 489 1-527 1-605 (651) 88 protein:vir:95449 Length: 584 99.3 4.1E-11 2.5E-14 77.6 30.8 467 1-505 1-584 (584) 89 protein:vir:108295 Length: 711 99.3 8.1E-11 5E-14 75.9 30.5 476 1-527 29-625 (711) 90 protein:vir:96783 Length: 488 99.3 2.2E-10 1.4E-13 73.5 34.9 427 1-498 14-488 (488) 91 protein:vir:107742 Length: 537 99.3 5E-11 3.1E-14 77.0 27.2 451 1-527 25-531 (537) 92 protein:vir:80644 Length: 551 99.3 1.6E-11 9.9E-15 79.8 23.8 444 1-527 5-526 (551) 93 protein:vir:105002 Length: 432 99.2 1.8E-10 1.1E-13 74.0 29.2 408 1-522 1-432 (432) 94 protein:vir:107605 Length: 432 99.2 1.8E-10 1.1E-13 74.0 29.2 408 1-522 1-432 (432) 95 protein:vir:102855 Length: 432 99.2 1.8E-10 1.1E-13 74.0 29.2 408 1-522 1-432 (432) 96 protein:vir:3420 Length: 533 # 99.2 3.6E-10 2.2E-13 72.4 35.6 450 19-527 1-530 (533) 97 protein:vir:63755 Length: 547 99.2 1.6E-10 9.8E-14 74.3 26.9 442 1-527 1-522 (547) 98 protein:vir:10321 Length: 495 99.2 6.2E-10 3.8E-13 71.1 34.8 437 1-526 1-495 (495) 99 protein:vir:79647 Length: 435 99.2 4.6E-10 2.8E-13 71.8 28.0 415 1-526 1-435 (435) 100 protein:vir:389 Length: 530 # 99.2 1.1E-09 6.6E-13 69.8 39.4 445 18-527 1-527 (530) 101 protein:vir:95821 Length: 763 99.1 1.2E-09 7.3E-13 69.5 31.4 460 1-527 26-627 (763) 102 protein:vir:6382 Length: 553 # 99.1 1.6E-09 1E-12 68.8 34.2 456 3-525 1-553 (553) 103 protein:vir:3139 Length: 599 # 99.1 1.8E-09 1.1E-12 68.6 28.6 470 1-511 1-599 (599) 104 protein:vir:104437 Length: 714 99.1 1.2E-09 7.7E-13 69.4 27.3 481 1-527 17-636 (714) 105 protein:vir:6240 Length: 457 # 99.1 1.1E-09 6.6E-13 69.8 27.0 421 1-527 1-449 (457) 106 protein:vir:102080 Length: 429 99.1 2.5E-09 1.6E-12 67.7 29.0 405 1-525 1-429 (429) 107 protein:vir:3296 Length: 714 # 99.1 6.2E-10 3.9E-13 71.1 24.6 495 1-527 6-624 (714) 108 protein:vir:9950 Length: 714 # 99.1 6.2E-10 3.9E-13 71.1 24.6 495 1-527 6-624 (714) 109 protein:vir:2764 Length: 714 # 99.1 6.2E-10 3.9E-13 71.1 24.6 495 1-527 6-624 (714) 110 protein:vir:817 Length: 714 # 99.1 6.2E-10 3.9E-13 71.1 24.6 495 1-527 6-624 (714) 111 protein:vir:10117 Length: 714 99.1 6.2E-10 3.9E-13 71.1 24.6 495 1-527 6-624 (714) 112 protein:vir:1326 Length: 457 # 99.1 1.3E-09 8E-13 69.3 26.2 419 1-527 1-457 (457) 113 protein:vir:105619 Length: 772 99.1 3.2E-09 2E-12 67.2 28.0 494 1-527 1-646 (772) 114 protein:vir:96068 Length: 765 99.1 6.5E-10 4E-13 71.0 24.1 462 1-527 37-539 (765) 115 protein:vir:105429 Length: 708 99.1 3.6E-09 2.2E-12 66.9 29.7 498 3-527 1-625 (708) 116 protein:vir:99563 Length: 862 99.1 9.1E-10 5.7E-13 70.1 24.7 439 1-527 66-592 (862) 117 protein:vir:81152 Length: 411 99.0 4.2E-09 2.6E-12 66.5 30.8 391 1-524 1-411 (411) 118 protein:vir:107662 Length: 427 99.0 4.4E-10 2.7E-13 71.9 21.5 406 1-525 1-427 (427) 119 protein:vir:104338 Length: 422 99.0 1.9E-09 1.2E-12 68.4 24.7 407 1-527 1-422 (422) 120 protein:vir:94049 Length: 532 99.0 3.7E-09 2.3E-12 66.8 26.1 452 1-527 17-514 (532) 121 protein:vir:107404 Length: 555 98.9 1.1E-08 7.1E-12 64.1 30.0 481 3-527 1-553 (555) 122 protein:vir:98506 Length: 555 98.9 1.1E-08 7.1E-12 64.1 30.0 481 3-527 1-553 (555) 123 protein:vir:107822 Length: 555 98.9 1.1E-08 7.1E-12 64.1 30.0 481 3-527 1-553 (555) 124 protein:vir:102668 Length: 547 98.9 1.4E-08 8.5E-12 63.7 33.9 458 1-521 1-547 (547) 125 protein:vir:94709 Length: 522 98.9 1.6E-08 1E-11 63.3 32.5 450 1-527 1-520 (522) 126 protein:vir:7321 Length: 556 # 98.9 2.1E-08 1.3E-11 62.7 29.8 467 3-515 1-556 (556) 127 protein:vir:1785 Length: 555 # 98.8 2.8E-08 1.7E-11 62.0 31.8 457 7-527 1-551 (555) 128 protein:vir:95315 Length: 559 98.8 2.9E-08 1.8E-11 61.9 30.8 478 3-527 1-558 (559) 129 protein:vir:77597 Length: 725 98.8 3.5E-08 2.2E-11 61.4 24.8 489 1-527 1-613 (725) 130 protein:vir:3520 Length: 720 # 98.8 5.1E-08 3.2E-11 60.6 27.9 468 3-527 1-602 (720) 131 protein:vir:81072 Length: 432 98.8 5.4E-08 3.4E-11 60.4 26.4 402 1-527 7-431 (432) 132 protein:vir:94599 Length: 641 98.8 6.1E-08 3.8E-11 60.1 26.1 471 1-527 23-603 (641) 133 protein:vir:4194 Length: 540 # 98.7 7E-08 4.4E-11 59.8 27.2 424 18-527 1-467 (540) 134 protein:vir:78696 Length: 542 98.7 7.8E-08 4.8E-11 59.6 38.7 466 7-525 1-542 (542) 135 protein:vir:1538 Length: 535 # 98.7 9.1E-08 5.6E-11 59.2 37.1 454 19-527 1-535 (535) 136 protein:vir:3361 Length: 535 # 98.7 1E-07 6.2E-11 59.0 36.6 450 19-527 1-535 (535) 137 protein:vir:3843 Length: 397 # 98.7 1.3E-07 8E-11 58.4 30.1 389 1-526 1-397 (397) 138 protein:vir:9263 Length: 725 # 98.6 1.1E-07 6.5E-11 58.8 22.7 489 1-527 5-613 (725) 139 protein:vir:79772 Length: 648 98.6 1.7E-07 1E-10 57.7 32.1 452 1-527 8-506 (648) 140 protein:vir:4454 Length: 414 # 98.6 1.7E-07 1E-10 57.7 32.1 394 1-527 1-414 (414) 141 protein:vir:8883 Length: 543 # 98.6 1.9E-07 1.2E-10 57.4 31.1 472 15-527 1-542 (543) 142 protein:vir:10362 Length: 432 98.6 2.1E-07 1.3E-10 57.2 26.5 403 1-527 7-431 (432) 143 protein:vir:78942 Length: 510 98.6 2.1E-07 1.3E-10 57.2 36.7 452 7-516 1-510 (510) 144 protein:vir:99672 Length: 532 98.6 2.4E-07 1.5E-10 56.9 30.7 457 4-525 1-532 (532) 145 protein:vir:6322 Length: 510 # 98.6 2.8E-07 1.8E-10 56.5 36.9 451 7-516 1-510 (510) 146 protein:vir:97060 Length: 432 98.6 2.9E-07 1.8E-10 56.4 27.4 404 1-527 7-431 (432) 147 protein:vir:103765 Length: 549 98.5 3.1E-07 1.9E-10 56.3 29.7 473 1-518 1-549 (549) 148 protein:vir:100920 Length: 725 98.5 3.3E-07 2.1E-10 56.1 24.3 483 1-527 1-601 (725) 149 protein:vir:1380 Length: 422 # 98.5 3.5E-07 2.2E-10 56.0 30.3 404 1-523 1-422 (422) 150 protein:vir:4952 Length: 386 # 98.5 3.5E-07 2.2E-10 55.9 30.4 376 1-527 1-385 (386) 151 protein:vir:80796 Length: 574 98.5 3.7E-07 2.3E-10 55.9 26.7 446 1-527 1-526 (574) 152 protein:vir:8418 Length: 409 # 98.5 4.1E-07 2.5E-10 55.6 23.3 389 1-524 1-409 (409) 153 protein:vir:94572 Length: 535 98.4 6.3E-07 3.9E-10 54.6 32.3 455 1-522 1-535 (535) 154 protein:vir:100882 Length: 383 98.4 6.4E-07 4E-10 54.5 26.7 373 1-526 1-383 (383) 155 protein:vir:1266 Length: 416 # 98.4 1E-06 6.2E-10 53.5 26.5 399 2-527 1-416 (416) 156 protein:vir:172 Length: 708 # 98.3 1.5E-06 9.1E-10 52.6 30.4 493 3-527 1-623 (708) 157 protein:vir:4854 Length: 386 # 98.3 1.5E-06 9.1E-10 52.6 26.6 373 1-524 1-386 (386) 158 protein:vir:7407 Length: 392 # 98.3 1.6E-06 1E-09 52.3 30.2 381 1-514 3-392 (392) 159 protein:vir:483 Length: 413 # 98.2 2.2E-06 1.3E-09 51.6 30.8 391 2-527 1-411 (413) 160 protein:vir:100039 Length: 522 98.2 2.2E-06 1.3E-09 51.6 36.8 459 1-525 1-522 (522) 161 protein:vir:99312 Length: 563 98.2 2.4E-06 1.5E-09 51.3 23.0 435 7-527 1-529 (563) 162 protein:vir:95599 Length: 563 98.2 2.4E-06 1.5E-09 51.3 23.0 435 7-527 1-529 (563) 163 protein:vir:102727 Length: 945 98.2 2.7E-06 1.6E-09 51.2 30.0 424 1-527 60-539 (945) 164 protein:vir:2683 Length: 412 # 98.2 3.3E-06 2.1E-09 50.6 26.1 397 1-527 1-412 (412) 165 protein:vir:100249 Length: 431 98.1 3.7E-06 2.3E-09 50.3 26.2 398 1-516 1-431 (431) 166 protein:vir:100187 Length: 385 98.1 3.7E-06 2.3E-09 50.3 26.8 375 1-520 1-385 (385) 167 protein:vir:6210 Length: 394 # 98.1 3.9E-06 2.4E-09 50.2 24.1 371 1-527 1-394 (394) 168 protein:vir:104500 Length: 537 98.0 6.1E-06 3.8E-09 49.2 23.9 452 14-527 1-524 (537) 169 protein:vir:98396 Length: 441 98.0 6.6E-06 4.1E-09 49.0 25.7 379 46-526 1-441 (441) 170 protein:vir:3989 Length: 392 # 98.0 6.6E-06 4.1E-09 49.0 30.2 378 1-514 3-392 (392) 171 protein:vir:1023 Length: 392 # 98.0 6.6E-06 4.1E-09 49.0 30.2 378 1-514 3-392 (392) 172 protein:vir:103330 Length: 517 98.0 7.2E-06 4.4E-09 48.8 36.7 430 1-517 1-517 (517) 173 protein:vir:2198 Length: 536 # 98.0 8.3E-06 5.1E-09 48.4 36.2 463 1-527 1-535 (536) 174 protein:vir:960 Length: 413 # 98.0 8.6E-06 5.3E-09 48.4 23.8 387 1-524 1-413 (413) 175 protein:vir:100150 Length: 437 97.9 1E-05 6.4E-09 47.9 27.3 408 1-526 1-437 (437) 176 protein:vir:105520 Length: 706 97.9 1E-05 6.4E-09 47.9 31.1 488 3-527 1-607 (706) 177 protein:vir:95378 Length: 406 97.9 1.2E-05 7.3E-09 47.6 24.7 383 1-526 1-406 (406) 178 protein:vir:4995 Length: 384 # 97.9 1.3E-05 7.9E-09 47.4 28.7 369 1-527 1-383 (384) 179 protein:vir:103177 Length: 533 97.8 1.6E-05 1E-08 46.8 21.5 449 15-527 1-529 (533) 180 protein:vir:4337 Length: 434 # 97.8 1.6E-05 1E-08 46.8 25.5 404 3-525 1-434 (434) 181 protein:vir:105782 Length: 449 97.8 1.7E-05 1.1E-08 46.7 26.2 417 1-527 1-447 (449) 182 protein:vir:3153 Length: 467 # 97.8 2E-05 1.3E-08 46.3 33.6 399 56-527 1-462 (467) 183 protein:vir:7853 Length: 518 # 97.8 2.2E-05 1.3E-08 46.2 29.7 409 14-527 1-451 (518) 184 protein:vir:101648 Length: 518 97.8 2.2E-05 1.4E-08 46.1 29.1 407 14-527 1-451 (518) 185 protein:vir:4156 Length: 542 # 97.7 2.5E-05 1.5E-08 45.8 27.5 422 3-527 1-471 (542) 186 protein:vir:102118 Length: 409 97.7 2.8E-05 1.7E-08 45.6 32.7 389 17-524 1-409 (409) 187 protein:vir:96980 Length: 409 97.7 3E-05 1.9E-08 45.4 27.1 392 1-527 4-409 (409) 188 protein:vir:93943 Length: 409 97.7 3.1E-05 1.9E-08 45.3 26.4 392 1-527 4-409 (409) 189 protein:vir:1082 Length: 359 # 97.7 3.2E-05 2E-08 45.2 25.5 346 1-480 1-359 (359) 190 protein:vir:10447 Length: 536 97.6 3.4E-05 2.1E-08 45.1 38.5 463 1-527 1-535 (536) 191 protein:vir:4828 Length: 382 # 97.6 3.5E-05 2.2E-08 45.0 26.7 375 1-516 1-382 (382) 192 protein:vir:189 Length: 424 # 97.6 4E-05 2.5E-08 44.7 27.9 388 1-522 14-424 (424) 193 protein:vir:79984 Length: 441 97.6 4.3E-05 2.7E-08 44.5 27.2 379 46-526 1-441 (441) 194 protein:vir:9408 Length: 441 # 97.6 4.3E-05 2.7E-08 44.5 27.2 379 46-526 1-441 (441) 195 protein:vir:1884 Length: 424 # 97.5 5.6E-05 3.5E-08 43.9 29.0 387 1-522 14-424 (424) 196 protein:vir:3868 Length: 417 # 97.5 6.2E-05 3.8E-08 43.7 28.3 390 16-527 1-416 (417) 197 protein:vir:81218 Length: 423 97.5 6.2E-05 3.8E-08 43.7 26.8 396 1-527 1-423 (423) 198 protein:vir:9359 Length: 348 # 97.4 7.2E-05 4.4E-08 43.3 26.1 338 50-527 1-348 (348) 199 protein:vir:104259 Length: 403 97.4 7.6E-05 4.7E-08 43.2 27.1 381 1-524 1-403 (403) 200 protein:vir:96579 Length: 576 97.4 7.7E-05 4.8E-08 43.1 27.8 438 3-527 1-531 (576) 201 protein:vir:8100 Length: 466 # 97.3 9.3E-05 5.8E-08 42.7 28.6 420 1-527 1-466 (466) 202 protein:vir:4598 Length: 416 # 97.2 0.00013 8.2E-08 41.8 26.5 394 1-526 1-416 (416) 203 protein:vir:81095 Length: 416 97.2 0.00013 8.2E-08 41.8 26.5 394 1-526 1-416 (416) 204 protein:vir:96988 Length: 516 97.1 0.00016 9.7E-08 41.5 31.9 431 3-512 1-516 (516) 205 protein:vir:93610 Length: 454 97.0 0.00022 1.4E-07 40.6 29.7 408 3-527 1-441 (454) 206 protein:vir:7017 Length: 515 # 97.0 0.00024 1.5E-07 40.5 32.3 431 3-517 1-515 (515) 207 protein:vir:101647 Length: 460 96.9 0.00026 1.6E-07 40.3 28.4 402 3-523 1-460 (460) 208 protein:vir:94426 Length: 409 96.8 0.00032 2E-07 39.7 29.8 392 1-527 4-409 (409) 209 protein:vir:105064 Length: 421 96.7 0.00037 2.3E-07 39.4 28.3 394 1-524 1-421 (421) 210 protein:vir:4509 Length: 424 # 96.7 0.0004 2.5E-07 39.2 31.7 381 1-526 16-424 (424) 211 protein:vir:105641 Length: 516 96.5 0.00052 3.2E-07 38.6 30.2 433 3-517 1-516 (516) 212 protein:vir:80211 Length: 514 96.5 0.00057 3.6E-07 38.4 36.6 425 7-507 1-514 (514) 213 protein:vir:101289 Length: 395 96.4 0.00062 3.8E-07 38.2 22.4 372 4-527 1-393 (395) 214 protein:vir:9507 Length: 395 # 96.4 0.00062 3.8E-07 38.2 22.4 372 4-527 1-393 (395) 215 protein:vir:100650 Length: 395 96.4 0.00062 3.8E-07 38.2 22.4 372 4-527 1-393 (395) 216 protein:vir:6896 Length: 523 # 96.4 0.00064 3.9E-07 38.1 22.2 451 1-518 1-523 (523) 217 protein:vir:100691 Length: 535 96.4 0.00065 4E-07 38.1 32.7 436 1-527 1-514 (535) 218 protein:vir:101806 Length: 516 96.3 0.00076 4.7E-07 37.7 28.8 432 1-518 1-516 (516) 219 protein:vir:101189 Length: 516 96.3 0.00076 4.7E-07 37.7 28.8 432 1-518 1-516 (516) 220 protein:vir:104892 Length: 558 96.3 0.00082 5.1E-07 37.5 23.8 443 1-527 5-545 (558) 221 protein:vir:345 Length: 663 # 96.1 0.00095 5.9E-07 37.1 23.3 463 1-527 1-591 (663) 222 protein:vir:80134 Length: 403 95.9 0.0013 8.3E-07 36.3 22.2 375 1-526 1-403 (403) 223 protein:vir:3648 Length: 695 # 95.7 0.0015 9.5E-07 36.0 18.7 448 1-527 67-569 (695) 224 protein:vir:5737 Length: 419 # 95.5 0.002 1.2E-06 35.4 27.4 386 1-527 1-413 (419) 225 protein:vir:1431 Length: 419 # 95.3 0.0024 1.5E-06 35.0 30.9 388 2-527 1-414 (419) 226 protein:vir:8317 Length: 409 # 95.3 0.0024 1.5E-06 35.0 23.3 385 1-511 1-409 (409) 227 protein:vir:106282 Length: 521 95.1 0.0027 1.6E-06 34.7 29.6 431 1-518 1-521 (521) 228 protein:vir:9702 Length: 406 # 95.1 0.0028 1.7E-06 34.6 26.7 383 14-527 1-406 (406) 229 protein:vir:107880 Length: 491 94.9 0.0033 2.1E-06 34.2 24.7 395 19-527 1-422 (491) 230 protein:vir:7208 Length: 524 # 94.6 0.0039 2.4E-06 33.8 25.7 432 1-518 1-524 (524) 231 protein:vir:103458 Length: 524 94.6 0.0041 2.5E-06 33.7 25.7 432 1-518 1-524 (524) 232 protein:vir:103219 Length: 201 94.5 0.0037 2.3E-06 33.9 11.9 196 265-524 1-201 (201) 233 protein:vir:106999 Length: 564 94.5 0.0042 2.6E-06 33.6 25.2 463 15-527 1-545 (564) 234 protein:vir:78161 Length: 355 94.5 0.0042 2.6E-06 33.6 19.2 313 162-527 1-336 (355) 235 protein:vir:108049 Length: 524 94.1 0.0053 3.3E-06 33.1 28.2 430 1-518 1-524 (524) 236 protein:vir:78589 Length: 695 93.8 0.0063 3.9E-06 32.7 21.7 416 1-527 102-569 (695) 237 protein:vir:80333 Length: 419 93.7 0.0066 4.1E-06 32.5 29.4 391 18-527 1-414 (419) 238 protein:vir:101541 Length: 694 93.5 0.0074 4.6E-06 32.3 21.6 423 1-527 91-568 (694) 239 protein:vir:99853 Length: 488 93.4 0.0077 4.8E-06 32.2 27.6 385 29-527 1-417 (488) 240 protein:vir:93867 Length: 378 92.7 0.01 6.4E-06 31.5 18.1 345 1-527 1-377 (378) 241 protein:vir:4089 Length: 395 # 92.0 0.013 8.3E-06 30.8 25.5 379 1-527 1-395 (395) 242 protein:vir:99452 Length: 651 91.9 0.014 8.5E-06 30.8 27.0 434 1-527 55-542 (651) 243 protein:vir:81017 Length: 521 91.4 0.016 9.9E-06 30.4 30.3 429 1-518 2-521 (521) 244 protein:vir:6596 Length: 521 # 90.7 0.019 1.2E-05 30.0 30.1 428 1-518 2-521 (521) 245 protein:vir:5665 Length: 511 # 90.6 0.02 1.2E-05 29.9 26.8 419 30-513 1-511 (511) 246 protein:vir:100598 Length: 516 90.6 0.02 1.2E-05 29.9 29.4 446 1-514 1-516 (516) 247 protein:vir:78641 Length: 278 89.4 0.026 1.6E-05 29.2 22.1 266 83-448 1-278 (278) 248 protein:vir:1986 Length: 512 # 89.4 0.026 1.6E-05 29.2 24.9 416 1-527 1-449 (512) 249 protein:vir:1661 Length: 378 # 88.7 0.031 1.9E-05 28.9 18.5 358 1-527 1-377 (378) 250 protein:vir:106716 Length: 698 88.3 0.033 2.1E-05 28.7 24.4 418 1-527 102-572 (698) 251 protein:vir:94002 Length: 378 88.1 0.035 2.1E-05 28.6 19.0 345 1-527 1-377 (378) 252 protein:vir:103860 Length: 528 87.6 0.038 2.3E-05 28.4 29.1 422 1-527 1-454 (528) 253 protein:vir:95965 Length: 385 83.5 0.068 4.2E-05 27.0 24.0 363 4-523 1-385 (385) 254 protein:vir:5839 Length: 533 # 80.5 0.094 5.8E-05 26.2 25.3 422 1-527 17-527 (533) 255 protein:vir:108215 Length: 469 79.0 0.11 6.7E-05 25.9 27.3 408 26-527 1-469 (469) 256 protein:vir:98853 Length: 219 78.9 0.11 6.8E-05 25.8 13.5 207 172-452 1-219 (219) 257 protein:vir:99232 Length: 526 76.4 0.14 8.4E-05 25.3 29.4 414 1-527 1-451 (526) 258 protein:vir:79063 Length: 491 75.9 0.14 8.8E-05 25.2 29.6 395 18-527 1-422 (491) 259 protein:vir:94666 Length: 723 74.0 0.16 0.0001 24.9 30.7 391 32-527 1-446 (723) 260 protein:vir:98265 Length: 524 71.2 0.2 0.00012 24.4 29.9 433 1-518 1-524 (524) 261 protein:vir:79233 Length: 526 69.9 0.22 0.00013 24.2 31.4 417 1-527 1-452 (526) 262 protein:vir:94869 Length: 378 59.4 0.39 0.00024 22.8 21.6 365 1-527 1-377 (378) 263 protein:vir:95254 Length: 488 53.3 0.53 0.00033 22.1 29.1 439 19-526 1-488 (488) 264 protein:vir:858 Length: 378 # 50.2 0.62 0.00038 21.7 21.0 353 1-521 1-378 (378) 265 protein:vir:105154 Length: 525 46.9 0.72 0.00045 21.4 19.3 432 1-527 35-520 (525) 266 protein:vir:98643 Length: 395 26.9 1.9 0.0012 19.1 26.7 371 14-520 1-395 (395) 267 protein:vir:9641 Length: 395 # 25.9 2 0.0012 18.9 28.0 364 1-527 1-392 (395) 268 protein:vir:78310 Length: 376 25.3 2.1 0.0013 18.8 25.1 354 1-521 1-376 (376) No 1 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=100.00 E-value=3e-158 Score=884.21 Aligned_cols=520 Identities=77% Similarity=1.176 Sum_probs=496.5 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) ||||++||+|||||+.++++|+++++++|++|+++++|+.||+.|++||+|+++++.+++..++.++++++|+|||+.|| T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIKSRPMNHLPIARTAS 80 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchhcccceecchHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcCCceEEEEEcCC Q lcl|NC_019418. 81 KKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQAPVFLPLQSNTQ 160 (527) Q Consensus 81 ~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a~~~~P~~~d~~ 160 (527) +++|+|||+|||+|++++++++++|++++++|+|..+++++++.|+++|++|||||||+++++|++|+|++|+|++|+++ T Consensus 81 ~~~A~lv~~e~~~i~v~d~~~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~~~~~i~~v~ad~~~P~~~~~~ 160 (522) T protein:vir:47 81 KKIASLVYNEQATITTKNEILQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYIDGDKVRVAFIQAPVFFPLESNTQ 160 (522) T ss_pred HHHhhhhcCCcceeecCChHHHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEcCCceEEEEEcCCceEEEEEcCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccc--cCCcccc Q lcl|NC_019418. 161 DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSEL--YPDLQPV 238 (527) Q Consensus 161 ~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~--~~~l~~~ 238 (527) ++++||+|+++++.++++++|||+||+|+|...+......+..+++|+|+|+||++.+.++||.+|||+++ |++|+++ T Consensus 161 ~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~ 240 (522) T protein:vir:47 161 DVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLSELDKYKNLEPV 240 (522) T ss_pred ceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCccccccccccccCCCCc Confidence 99999999999999999999999999999999888888888899999999999999999999999999987 8899999 Q ss_pred eeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCcccc Q lcl|NC_019418. 239 TPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGNIA 318 (527) Q Consensus 239 ~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~~~ 318 (527) ++++|+++|+|+|||+|.+||++.+||||+|+|++++++||+||++||+|+|||++|+++|+||++|++...++.+|+.. T Consensus 241 ~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~ 320 (522) T protein:vir:47 241 TVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVIVPEHLTQRQYQRPDGTID 320 (522) T ss_pred eEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccceeecchHHhccCCCCCCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999888 Q ss_pred cccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHH Q lcl|NC_019418. 319 FKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTY 398 (527) Q Consensus 319 ~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~ 398 (527) .++.||.++++|++++.+.+++.+|+++||+||+++|.++++.+|++|+++||||+++||++++|.+|||||++++++++ T Consensus 321 ~~~~fd~~~~~f~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~~kTAtEi~s~~~~~~ 400 (522) T protein:vir:47 321 FRPRFDVEQNVYMQIGGSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQGMKTATEIVSENSDTY 400 (522) T ss_pred cccccCcccceEeecCCCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccccccccHHHHHHHHHHHH Confidence 88899999999999998888888999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCC Q lcl|NC_019418. 399 QMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLG 478 (527) Q Consensus 399 ~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~ 478 (527) +|+++|++.|+.+|++|+++|++|+++++++++.++..++|+|+|+|++++|++++++++++++++|+||+++||+++|| T Consensus 401 ~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~~~~~~~~v~aG~~s~e~~i~~~~g 480 (522) T protein:vir:47 401 QMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAELDYWAKMVAAGFSTKKRAIGKTLN 480 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019418. 479 ITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVD 522 (527) Q Consensus 479 ~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (527) |||+||++|++||++|++++.+...++++. +++.+.++|+.| T Consensus 481 ~~eeea~~el~ri~~E~~~~~~~~~~~~~~--~~~~~~~~d~~~ 522 (522) T protein:vir:47 481 ISGVEAEKELNAINSELLPMNDAELAIYGM--HDQNEEKADDKG 522 (522) T ss_pred CChHHHHHHHHHHHHhhccCCCCCCCCCCC--CCcccccCCCCC Confidence 999999999999999998877666666542 223223333333 No 2 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=100.00 E-value=3.7e-153 Score=856.29 Aligned_cols=506 Identities=41% Similarity=0.680 Sum_probs=469.4 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) |+||++||+|||||+.+|+.|+++++++|++|++|++|++||++|++||+|+++|+.+++..++.++++++|+|+|+.|| T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~~~i~ 80 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLRKLSA 80 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecCcHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhhhhcccceEeeCCH-----------HHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcC Q lcl|NC_019418. 81 KKIASLVYNEQAEISAEDE-----------TLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQA 149 (527) Q Consensus 81 ~~~A~ll~~e~~~i~~~d~-----------~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a 149 (527) +++|+|||+|+|+|++++. .++++|++++++|+|..+++++++.|+++|+++||||||+++++|++|+| T Consensus 81 ~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~~~~~I~~v~a 160 (517) T protein:vir:98 81 DVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPYVDNGEIEFSWALA 160 (517) T ss_pred HHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEEEeCCeeEEEEEcC Confidence 9999999999999999863 47899999999999999999999999999999999999999999999999 Q ss_pred CceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecc Q lcl|NC_019418. 150 PVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLS 229 (527) Q Consensus 150 ~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~ 229 (527) ++|+|+.++++++++|||++..++..+++.+|||+||+|+|... ....+.|+|+|+||++.+++.||.+|||+ T Consensus 161 d~~~Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~-------~~~~~~y~I~n~ly~s~~~~~lG~~v~L~ 233 (517) T protein:vir:98 161 NAFYPLRSNSNGISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKT-------EEGESLYVITNELYKSDNEGEIGKRIPLE 233 (517) T ss_pred CeeEEEEecCCCeEEEEEEEEEEEeecCCceEEEEEEEEecCce-------eccCCcEEEEEEEEecCCCcccccccccc Confidence 99999999999999999998888888888899999999998643 23356899999999999999999999999 Q ss_pred cccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCC Q lcl|NC_019418. 230 ELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLK 309 (527) Q Consensus 230 ~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~ 309 (527) ++|++|++.++++|+++|+|+||++|++||++.+||||+|+|++++++||+||++||+|+|||++|+++|+||++|++.. T Consensus 234 ~~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~~i~vp~~~l~~~ 313 (517) T protein:vir:98 234 ELYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQRTVFVSDVMLRTV 313 (517) T ss_pred ccccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCcceecChhhhccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999776 Q ss_pred CCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHH Q lcl|NC_019418. 310 VQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATE 389 (527) Q Consensus 310 ~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAte 389 (527) .++.+ ...++.||.++++|++++++. ++.+++++||+||+++|.++++.+|++|+++||||+++||++++|.+|||| T Consensus 314 ~~~~g--~~~~~~~d~~~~~y~~~~~~~-~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~~kTATE 390 (517) T protein:vir:98 314 PDESG--MPPPQVFDPDVNVYKSIRMGT-DEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRSMKTATE 390 (517) T ss_pred cCCCC--cccCCCCCcccceeeeccCCC-CCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCcccccccccccccHHH Confidence 66544 445678999999999998754 456899999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCH Q lcl|NC_019418. 390 IVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQ 469 (527) Q Consensus 390 i~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~ 469 (527) |+++++++++|+++|++.|+++|++|+++|++|+++++++++..+..++|+|+|+|++++|++++++++++++++|+||+ T Consensus 391 i~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~D~~~~~~~~~~~v~aG~ms~ 470 (517) T protein:vir:98 391 IVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQDRSALLRFYGQAKTFGFIPT 470 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCCCHHHHHHHHHHHHhcCCCCH Confidence 99999999999999999999999999999999999999999999899999999999999999999999999999999999 Q ss_pred HHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccc Q lcl|NC_019418. 470 KRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDED 525 (527) Q Consensus 470 ~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (527) ++||+++||+||+||++|++||++|+++.++. +. ....++...||+| T Consensus 471 ~~~i~~~~g~~eeeA~~e~~~i~~E~~~~~~~----~~-----~~~~~~~~~gd~e 517 (517) T protein:vir:98 471 VEAIQRIFKVPKKTAEQWLEEIRKDQIELDPV----TI-----SQRAQKRMFGDEE 517 (517) T ss_pred HHHHHHhCCCChHHHHHHHHHHHHhccccCCC----Cc-----cccccCCCCCCCC Confidence 99999999999999999999999999754321 11 1111233344444 No 3 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=100.00 E-value=1e-152 Score=853.85 Aligned_cols=500 Identities=63% Similarity=1.015 Sum_probs=474.5 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) |+||++||+||||++++|++|+++++++|++|++|++|++||++|++||+|+++++.+++..+..+.++++|+|+|+.|| T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999998999999999999999 Q ss_pred HHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcCCceEEEEEcCC Q lcl|NC_019418. 81 KKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQAPVFLPLQSNTQ 160 (527) Q Consensus 81 ~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a~~~~P~~~d~~ 160 (527) +++|+|||+|||+|++++++++++|++++++|+|..+++++++.|+++|++|||||||+++++|++|+|++|+|++++++ T Consensus 81 ~~~A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~I~~v~ad~~~P~~~d~~ 160 (500) T protein:vir:30 81 KKIASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDGDKVRVAFVQAPVFLPLQSNTQ 160 (500) T ss_pred HHHhhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeeEEEEEcCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCccccee Q lcl|NC_019418. 161 DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTP 240 (527) Q Consensus 161 ~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~ 240 (527) +++++||++++++..+.+.+|||+||+|+|. ++++|+|+|++|++.+.+.+|.+|||+++|++++++++ T Consensus 161 ~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~-----------~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~ 229 (500) T protein:vir:30 161 DVSSAAVVIKSVKTINGKEVYYTLIEFHEWQ-----------SSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAK 229 (500) T ss_pred CeEEEEEEEEEeeeecCCceEEEEEEEEEEe-----------CCceeEEEEEEEecccccccCcccccccccCCcCcceE Confidence 9999999999888888788999999999963 35689999999999999999999999999999999999 Q ss_pred ecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCcccccc Q lcl|NC_019418. 241 IQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGNIAFK 320 (527) Q Consensus 241 ~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~~~~~ 320 (527) ++|+++|+|+||++|.+||++.+||||+|+|++++++||+||++||+|+|||++|+++|+||++|++...++.+|+...+ T Consensus 230 ~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~~~ 309 (500) T protein:vir:30 230 VTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGDVVPR 309 (500) T ss_pred eccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCccccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999888888888 Q ss_pred cccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHH Q lcl|NC_019418. 321 RRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQM 400 (527) Q Consensus 321 ~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~ 400 (527) +.|++++++|+.++++++++.+|++++|+||+++|.++++.+|++++++||||+++||++++|.+|||||+++++++++| T Consensus 310 ~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t 389 (500) T protein:vir:30 310 PRFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQM 389 (500) T ss_pred cccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHH Confidence 89999999999998888888899999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCC Q lcl|NC_019418. 401 RNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLGIT 480 (527) Q Consensus 401 ~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~ 480 (527) ++++++.|+++|++|+++|++++++++++++.++..++|+|+|+|++++|++++++++++++++|+||+++||+++||++ T Consensus 390 ~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~~g~~ 469 (500) T protein:vir:30 390 RNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKVLNVT 469 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCCCC Confidence 99999999999999999999999999999999889999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCC Q lcl|NC_019418. 481 EEEAEKELAEINGELPPESDAELALYGKGQQNTVGN 516 (527) Q Consensus 481 deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 516 (527) |+||++|++||++|++++.+.+.. +.+..++ T Consensus 470 eeea~~~l~~i~~E~~~~~~~~~~-----~~~~~g~ 500 (500) T protein:vir:30 470 EEKAQEIAAEINTGIVDEINQQRT-----DTHLYGE 500 (500) T ss_pred HHHHHHHHHHHHHhccccCCCCCc-----cccccCC Confidence 999999999999998754433211 1111111 No 4 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=100.00 E-value=1e-152 Score=853.85 Aligned_cols=500 Identities=63% Similarity=1.015 Sum_probs=474.5 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) |+||++||+||||++++|++|+++++++|++|++|++|++||++|++||+|+++++.+++..+..+.++++|+|+|+.|| T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999998999999999999999 Q ss_pred HHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcCCceEEEEEcCC Q lcl|NC_019418. 81 KKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQAPVFLPLQSNTQ 160 (527) Q Consensus 81 ~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a~~~~P~~~d~~ 160 (527) +++|+|||+|||+|++++++++++|++++++|+|..+++++++.|+++|++|||||||+++++|++|+|++|+|++++++ T Consensus 81 ~~~A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~I~~v~ad~~~P~~~d~~ 160 (500) T protein:vir:98 81 KKIASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDGDKVRVAFVQAPVFLPLQSNTQ 160 (500) T ss_pred HHHhhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeeEEEEEcCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCccccee Q lcl|NC_019418. 161 DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTP 240 (527) Q Consensus 161 ~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~ 240 (527) +++++||++++++..+.+.+|||+||+|+|. ++++|+|+|++|++.+.+.+|.+|||+++|++++++++ T Consensus 161 ~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~-----------~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~ 229 (500) T protein:vir:98 161 DVSSAAVVIKSVKTINGKEVYYTLIEFHEWQ-----------SSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAK 229 (500) T ss_pred CeEEEEEEEEEeeeecCCceEEEEEEEEEEe-----------CCceeEEEEEEEecccccccCcccccccccCCcCcceE Confidence 9999999999888888788999999999963 35689999999999999999999999999999999999 Q ss_pred ecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCcccccc Q lcl|NC_019418. 241 IQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGNIAFK 320 (527) Q Consensus 241 ~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~~~~~ 320 (527) ++|+++|+|+||++|.+||++.+||||+|+|++++++||+||++||+|+|||++|+++|+||++|++...++.+|+...+ T Consensus 230 ~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~~~ 309 (500) T protein:vir:98 230 VTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGDVVPR 309 (500) T ss_pred eccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCccccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999888888888 Q ss_pred cccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHH Q lcl|NC_019418. 321 RRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQM 400 (527) Q Consensus 321 ~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~ 400 (527) +.|++++++|+.++++++++.+|++++|+||+++|.++++.+|++++++||||+++||++++|.+|||||+++++++++| T Consensus 310 ~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t 389 (500) T protein:vir:98 310 PRFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQM 389 (500) T ss_pred cccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHH Confidence 89999999999998888888899999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCC Q lcl|NC_019418. 401 RNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLGIT 480 (527) Q Consensus 401 ~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~ 480 (527) ++++++.|+++|++|+++|++++++++++++.++..++|+|+|+|++++|++++++++++++++|+||+++||+++||++ T Consensus 390 ~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~~g~~ 469 (500) T protein:vir:98 390 RNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKVLNVT 469 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCCCC Confidence 99999999999999999999999999999999889999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCC Q lcl|NC_019418. 481 EEEAEKELAEINGELPPESDAELALYGKGQQNTVGN 516 (527) Q Consensus 481 deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 516 (527) |+||++|++||++|++++.+.+.. +.+..++ T Consensus 470 eeea~~~l~~i~~E~~~~~~~~~~-----~~~~~g~ 500 (500) T protein:vir:98 470 EEKAQEIAAEINTGIVDEINQQRT-----DTHLYGE 500 (500) T ss_pred HHHHHHHHHHHHHhccccCCCCCc-----cccccCC Confidence 999999999999998754433211 1111111 No 5 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=100.00 E-value=2.5e-149 Score=835.23 Aligned_cols=495 Identities=37% Similarity=0.617 Sum_probs=464.6 Q ss_pred CChHHHHHHHHHHHHHHh-hcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNM-TTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~-~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i 79 (527) |+||++||+|||||..++ +.++++++++|++|+++++|++||+.|++||+|+++|+.+++..|+.+.++++|+|+|+.| T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~slnl~~~i 80 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQKHELQSVNVTKLA 80 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccCCCccccceeecchHHHH Confidence 999999999999999887 7999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcCCceEEEEEcC Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQAPVFLPLQSNT 159 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a~~~~P~~~d~ 159 (527) |+++|+|||+|||+|++++++.+++|++++++|+|..+++++++.|+++|++|||||||+++++|++|+|++|+|+++++ T Consensus 81 ~~~~A~ll~~e~~~i~~~d~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D~~~~~i~~v~ad~~~P~~~d~ 160 (505) T protein:vir:79 81 SAKLASLIFNEQCQVTVSDETANDFLDDVFQQNDFYTTFEEKLEEWIALGSGCVRPYVDSGKIKLAWATADQVYPLQADT 160 (505) T ss_pred HHHHHhhhcCCCceeecCChHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCeEEEEEEeCCceEEEEEcCCeeEEEEEcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccc--cCCccc Q lcl|NC_019418. 160 QDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSEL--YPDLQP 237 (527) Q Consensus 160 ~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~--~~~l~~ 237 (527) +++.++||+.++++.++++..|||+||+|+| ++++|+|+|+||++.+.++||.+|||+++ |++|++ T Consensus 161 ~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~------------~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~ 228 (505) T protein:vir:79 161 NQVNELAIASRTTEVENHRTIYYTLLEFHQW------------DHGDYVITNELYRSEAAETVGINVPLNSLEQYEGLEP 228 (505) T ss_pred CCeEEEEEEEEEEEecCCcceEEEEEEEEEe------------cCceEEEEEEEEecCCCCccCcccchhhcccccccCc Confidence 9999999999999888888899999999986 46789999999999999999999999987 889999 Q ss_pred ceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCccc Q lcl|NC_019418. 238 VTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGNI 317 (527) Q Consensus 238 ~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~~ 317 (527) +++++|+++|+|+|||||.+||++++||||+|+|++++++||+||++||+|+|||++|+++|+||++|++..+++.+... T Consensus 229 ~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~~~~ 308 (505) T protein:vir:79 229 QVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQRRLIVPAEWLKTGSSYGGQAS 308 (505) T ss_pred ceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcccCCCCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999988877766543 Q ss_pred c-cccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHH Q lcl|NC_019418. 318 A-FKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSD 396 (527) Q Consensus 318 ~-~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~ 396 (527) . .++.|+.++++|.++..++ ++.+++++||+||+++|.++++.++++|+++||||+++||++++|.+|||||++++++ T Consensus 309 ~~~~~~fd~~~~~y~~~~~~~-~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~~~ 387 (505) T protein:vir:79 309 ETHPPMFDPDETVYQAMYGDA-SEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNNSQ 387 (505) T ss_pred cccccCCCccceeeeeccCCC-CCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHHHHHHHHhH Confidence 3 3356899999999987654 4568999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCC------cccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHH Q lcl|NC_019418. 397 TYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRG------TIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQK 470 (527) Q Consensus 397 ~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~------~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~ 470 (527) |++|+++|++.|+++|++|+++|+++++.+++.+. ..++..+++|+|+|++++|++++++++++++++|+||++ T Consensus 388 l~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~v~~Gi~s~e 467 (505) T protein:vir:79 388 TYQTRSSYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRAADLQAVQAQVMPKK 467 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHH Confidence 99999999999999999999999999988765432 334567899999999999999999999999999999999 Q ss_pred HHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCC Q lcl|NC_019418. 471 RGIAKTLGITEEEAEKELAEINGELPPESDAELALYGK 508 (527) Q Consensus 471 ~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~ 508 (527) +|++++|||||+||++|++||++|+++..+....++|+ T Consensus 468 ~~l~~~~~~~eeea~~el~ri~~E~~~~~p~~~~~gg~ 505 (505) T protein:vir:79 468 QFLMRNYGLDEEEADEWLAQIDAENSTAEPEFNQFGGD 505 (505) T ss_pred HHHHhcCCCChHHHHHHHHHHHHhccccCCCchhccCC Confidence 99999999999999999999999998766565556654 No 6 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=100.00 E-value=4.2e-147 Score=823.09 Aligned_cols=496 Identities=44% Similarity=0.736 Sum_probs=456.0 Q ss_pred CChHHHHHHHHHHHHHHh-hcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNM-TTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~-~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i 79 (527) |+||++||+||||++++| +.+++.++++|++|++|++|+.||++|++||+|+++++.+++..|.++.++++|+|+|+.| T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~~~~sln~~~~i 80 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKRLKNTINMAKTA 80 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccCCCCccccceeecchHHHH Confidence 999999999999999998 6999999999999999999999999999999999999999999999888999999999999 Q ss_pred HHHHhhhhhcccceEeeC-CHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcCCceEEEEEc Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAE-DETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQAPVFLPLQSN 158 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~-d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a~~~~P~~~d 158 (527) |+++|+|||+|||+|+++ ++..+++|++++++|+|..+++++++.|+++|++|||||||+++++|++|+|++|+|+.++ T Consensus 81 ~~~~A~lv~~e~~~i~v~~~~~~~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~i~~v~ad~~~P~~~d 160 (508) T protein:vir:15 81 ARRIASVVFNEKAEIHVKDNNEADKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRPYIDGNHIKIAWVRADQFYPLQSN 160 (508) T ss_pred HHHHHhhhhCCCceEEeCCchHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEeCCeeEEEEEcCCeeEEEEEc Confidence 999999999999999995 5667789999999999999999999999999999999999999999999999999999999 Q ss_pred CCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccc--cCCcc Q lcl|NC_019418. 159 TQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSEL--YPDLQ 236 (527) Q Consensus 159 ~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~--~~~l~ 236 (527) +++++++||+.+..+.+..+.+|||+||+|+|. ++++|+|+|++|++++.+++|.+|||+++ |++|+ T Consensus 161 ~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~-----------~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~ 229 (508) T protein:vir:15 161 TNDISEAAIASRTQRTESNQTKYYTLLEFHQWQ-----------DNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELA 229 (508) T ss_pred CCCeEEEEEEEEEEeecCCCceEEEEEEEEEEe-----------cCcceEEEEEEEecCCchhcCcccchhhcccccCCC Confidence 999999999999888888888899999999863 46799999999999999999999999987 88999 Q ss_pred cceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCcc Q lcl|NC_019418. 237 PVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGN 316 (527) Q Consensus 237 ~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~ 316 (527) ++++++|+++|+|+|||+|++||++.+||||+|+|++++++||+||++||+|+|||++|+++|+||+++++.+++ ++ T Consensus 230 ~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~d~~--~~- 306 (508) T protein:vir:15 230 PQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIRLGQKHIAVQPGMLRFDDE--HK- 306 (508) T ss_pred cceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcCCCC--Cc- Confidence 999999999999999999999999999999999999999999999999999999999999999999999975443 32 Q ss_pred cccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHH Q lcl|NC_019418. 317 IAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSD 396 (527) Q Consensus 317 ~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~ 396 (527) +.|+.++++|++++.+++++.+|+++||+||+++|.++++.++++|+++||+|+++||++++|.+|||||++++++ T Consensus 307 ----~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~s~~~~ 382 (508) T protein:vir:15 307 ----PTFDTEQNVYVGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVVSNNSM 382 (508) T ss_pred ----cccCCCCeeEEeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHHHHHHH Confidence 4578889999999888777788999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCc--------ccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCC Q lcl|NC_019418. 397 TYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGT--------IPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFAT 468 (527) Q Consensus 397 ~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~--------~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s 468 (527) |++|+++|++.|+.+|++|+++|+++++++++.++. +....+|+|+|+|++++|++++++++++++++|+|| T Consensus 383 ~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~aGi~s 462 (508) T protein:vir:15 383 TYQTRSSYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVLAIGALS 462 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHHhcCCCC Confidence 999999999999999999999999999987765543 245678999999999999999999999999999999 Q ss_pred HHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccc Q lcl|NC_019418. 469 QKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDED 525 (527) Q Consensus 469 ~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (527) +++||+++|||||+||++|++||++|++......+.+...++. +.+ T Consensus 463 ~e~~i~~~~g~~deea~~el~ri~~E~~~~~~~~~~~~~~~g~-----------~ge 508 (508) T protein:vir:15 463 KQTFLQRNYGMTDEQAAEELAKIQSEAPTDTFEGGRSAILNGG-----------DGE 508 (508) T ss_pred HHHHHHhcCCCChHHHHHHHHHHHHhccccCccccccccCCCC-----------CCC Confidence 9999999999999999999999999987544333322211111 111 No 7 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=100.00 E-value=8.2e-135 Score=755.70 Aligned_cols=492 Identities=32% Similarity=0.592 Sum_probs=444.2 Q ss_pred hHHHHHHHHHHHHHHhh-cccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccc--cCccccCceeecchHHHH Q lcl|NC_019418. 3 LIQKVKDFFNRGRYNMT-TSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNT--DGDRKRRKMQHLPIARTA 79 (527) Q Consensus 3 ~~~~~k~~~~~~~~~~~-~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~--~~~~~~~~~~~lnl~~~i 79 (527) ||+++|+|||.++.+|+ .+++.++++|++|+++++|+.||.+|++||+|+++.|..++. .+..+.++++|+|+|+.| T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~s~n~~~~i 80 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNRRQLSMNLPKVT 80 (499) T ss_pred ChhHHHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccccceeecchHHHH Confidence 99999999999998885 778999999999999999999999999999999988765543 345566888999999999 Q ss_pred HHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe-CCeeEEEEEcCCceEEEEEc Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD-GDKIRVAFIQAPVFLPLQSN 158 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d-~~~~~i~~v~a~~~~P~~~d 158 (527) |+++|+|||+||++|++++++++++|++++++|+|..++.++++.|+++|++|+|||+| +++++|++|+|+++||++++ T Consensus 81 v~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~a~~~~Pi~~d 160 (499) T protein:vir:80 81 AKYMSKLLFNEKVKINIDDETAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFATADCMYPLSND 160 (499) T ss_pred HHHHHHhhhCCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEEEEECCCCcEEEEEEcCCceEEEEec Confidence 99999999999999999999999999999999999999999999999999999999998 47899999999999999999 Q ss_pred CCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccc Q lcl|NC_019418. 159 TQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPV 238 (527) Q Consensus 159 ~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~ 238 (527) ++++..++|+.... .+++|||+||+|+|.. ...+.|+|+|++|++++.+.+|.+|||+++|+++++. T Consensus 161 ~~~~~~~~f~~~~~----~~~~~y~~lE~h~~~~---------~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~ 227 (499) T protein:vir:80 161 SENVDECLIANSFH----KNNKYYKLLEWNEWKG---------EKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPV 227 (499) T ss_pred CCCeEEEEEEEEEe----ecCeEEEEEEEEEecc---------cceeeEEEEEEEEeccCccccCcccchhhhccCcCCc Confidence 88888777765443 2456999999999743 2356899999999999999999999999999999999 Q ss_pred eeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCcccc Q lcl|NC_019418. 239 TPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGNIA 318 (527) Q Consensus 239 ~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~~~ 318 (527) +.++|+++|+|+|||+|++|+++.+||||+|+|++++++||+||++||+|+|+|++|+++|+||++|++...+++|+.. T Consensus 228 ~~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~g~~~- 306 (499) T protein:vir:80 228 VPLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGSTT- 306 (499) T ss_pred eeecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhcccceecchhhhhccCCCCCCcc- Confidence 9999999999999999999999999999999999999999999999999999999999999999999998877776643 Q ss_pred cccccccccceeeeccCCCC-CCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHH Q lcl|NC_019418. 319 FKRRFDVEQNVYMQVGAGNM-DSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDT 397 (527) Q Consensus 319 ~~~~~d~~~~~~~~~~~~~~-~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~ 397 (527) +.|+.++++|..+.+..+ ++.+|++++|+||+++|.++++.++++|+++||+|+++||++++|.+|||||+++++++ T Consensus 307 --~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l 384 (499) T protein:vir:80 307 --QYFDSTDEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSET 384 (499) T ss_pred --cCCCcccceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHHHH Confidence 457788889988765443 34589999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcC Q lcl|NC_019418. 398 YQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTL 477 (527) Q Consensus 398 ~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~ 477 (527) +++++++++.|+++|++|+++|++++++++..++..++..+++|+|+|++++|++++++++++++++|+||++++|++++ T Consensus 385 ~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~ 464 (499) T protein:vir:80 385 YQTKNSHSQLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYTTAKNQGMIPLKIALQRAW 464 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHhhcC Confidence 99999999999999999999999999988887777778889999999999999999999999999999999999999999 Q ss_pred CCCHHHHHHHHHHHHHhcccccc--cccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 478 GITEEEAEKELAEINGELPPESD--AELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 478 ~~~deea~~el~ri~~E~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) |++|+||++|++||++|++...+ +..+++|+ +| T Consensus 465 ~~~d~ea~~el~~i~~E~~~~~~~~d~~g~~ge----------------~e 499 (499) T protein:vir:80 465 NITEAEADEWAEMLAKEKQAEIPNNDMTGIFGE----------------EE 499 (499) T ss_pred CCChHHHHHHHHHHHHHhhcCCCCCCccccCCC----------------CC Confidence 99999999999999999865321 11122221 11 No 8 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=100.00 E-value=5.1e-127 Score=712.97 Aligned_cols=491 Identities=32% Similarity=0.590 Sum_probs=442.9 Q ss_pred hHHHHHHHHHHHHHHh-hcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccc--cCccccCceeecchHHHH Q lcl|NC_019418. 3 LIQKVKDFFNRGRYNM-TTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNT--DGDRKRRKMQHLPIARTA 79 (527) Q Consensus 3 ~~~~~k~~~~~~~~~~-~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~--~~~~~~~~~~~lnl~~~i 79 (527) ||++|++|||.++++| +.++++++.++++|+++++|+.||.+|++||.|+|+.|..+.. .++.+.++++|+|+|+.| T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~~k~i 80 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSMNLPKVT 80 (496) T ss_pred ChhHHHHHHHHHHHHhccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeecchHHHH Confidence 9999999999999998 5799999999999999999999999999999999998865443 456667888999999999 Q ss_pred HHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEEEEEcCCceEEEEEc Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRVAFIQAPVFLPLQSN 158 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i~~v~a~~~~P~~~d 158 (527) |+++|+|||++||+|++++++.+++|++++++|+|..++.++++.|+++|++|++||+|. ++++|++|+|+++||++++ T Consensus 81 ~~~~a~~l~~~p~~i~~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~~~~~~P~~~~ 160 (496) T protein:vir:38 81 AKYMSKLLFNEKVKINIDDKAAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFATADCMYPLSND 160 (496) T ss_pred HHHHhhhhhCCcceEeeCChHHHHHHHHHHhccCHHHHHHHHHHHHhhhCcEEEEEEEcCCCcEEEEEEcccceEEEEec Confidence 999999999999999999999999999999999999999999999999999999999984 7899999999999999998 Q ss_pred CCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccc Q lcl|NC_019418. 159 TQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPV 238 (527) Q Consensus 159 ~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~ 238 (527) ++++..+||+..+. .++++|++||+|++ +++.|+|+|++|++.+++.+|++||++++|+++++. T Consensus 161 ~~~~~~~~f~~~~~----~~~~~y~~le~h~~------------~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~~ 224 (496) T protein:vir:38 161 SENVDECVIANSFH----KNNKYYTLLEWNEW------------QGDVYTVTTELYQSDDPNELGTKVSLTLLFDDIEPV 224 (496) T ss_pred CCcEEEEEEEEEEE----eCCeEEEEEEEEEE------------eCceEEEEEEEEecCCccccCccccccccccccccc Confidence 88777777765442 24568999999985 467899999999999999999999999999999999 Q ss_pred eeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCcccc Q lcl|NC_019418. 239 TPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGNIA 318 (527) Q Consensus 239 ~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~~~ 318 (527) +.++|+++|+|+||++|.+|+.+.+||+|+|+|++++++||+||++||+++|+|++++++|+||+++++..++++|+.. T Consensus 225 ~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~- 303 (496) T protein:vir:38 225 VPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGSTT- 303 (496) T ss_pred eeecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhhcccceecchHHhhccCCCCCccc- Confidence 9999999999999999999999999999999999999999999999999999999999999999999988888776643 Q ss_pred cccccccccceeeeccCCC-CCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHH Q lcl|NC_019418. 319 FKRRFDVEQNVYMQVGAGN-MDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDT 397 (527) Q Consensus 319 ~~~~~d~~~~~~~~~~~~~-~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~ 397 (527) +.|+.+.++|..+.... ++..+++.++++||+++|.++++.+++++++.||+|+++||++++|.+|||||++++++| T Consensus 304 --~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~l 381 (496) T protein:vir:38 304 --QYFDSTDEAFFLYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSET 381 (496) T ss_pred --cCCCCccceEEEeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHHHH Confidence 34677778887766543 344589999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcC Q lcl|NC_019418. 398 YQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTL 477 (527) Q Consensus 398 ~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~ 477 (527) +++++++++.|+++|++|+++|+++++++..+++......+++|+|+|++++|++++++++++++++|+||++++|++++ T Consensus 382 ~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~GiiS~et~l~~~~ 461 (496) T protein:vir:38 382 YQTKNSHSQLIEQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAKNQGMIPLKIALQRAW 461 (496) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcC Confidence 99999999999999999999999999988888887788888999999999999999999999999999999999999999 Q ss_pred CCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 478 GITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 478 ~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) +++|+||++|++||++|++.+.. .+++++.++ ++| T Consensus 462 ~~~d~ea~~el~ri~~E~~~~~~-~~d~~~~~~-------------~~e 496 (496) T protein:vir:38 462 NITEAEADEWAEMLAKEKQAEMP-NNDMNGIFG-------------EEE 496 (496) T ss_pred CCChHHHHHHHHHHHHhhhccCc-cccccCCCC-------------CCC Confidence 99999999999999999865431 112211111 111 No 9 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=100.00 E-value=1.7e-117 Score=660.81 Aligned_cols=485 Identities=16% Similarity=0.087 Sum_probs=393.3 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccc------cc--ccCccccCceee Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEY------TN--TDGDRKRRKMQH 72 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~------~~--~~~~~~~~~~~~ 72 (527) ||+|+.||+||+.+. . +..++..+++|..++++|.+....+.+ .+ ...+...++++| T Consensus 1 ~~~~~~~~~~i~~w~---~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~ 65 (518) T protein:vir:78 1 MGVWSVMTRFIKGWL---N------------GKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVHDKLMN 65 (518) T ss_pred CcchhhHHHHHHHhh---c------------CCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCcccccccc Confidence 999999999998654 2 222445677788887777776443321 11 122344567889 Q ss_pred cchHHHHHHHHhhhhhcccceEee------CCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEE Q lcl|NC_019418. 73 LPIARTAAKKIASLVYNEQAEISA------EDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVAF 146 (527) Q Consensus 73 lnl~~~i~~~~A~ll~~e~~~i~~------~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~ 146 (527) +|||+.||+++|+|||+|+|+|+| +++.++++|++++++|+|..+++++++.|+++|++|||||||+++++|++ T Consensus 66 ~~l~~~i~~~~A~ll~~e~~~i~v~~~~~~d~e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~i~~ 145 (518) T protein:vir:78 66 SGTGNEIVVVAAEYISGKPLSIDVTGVNGSKDENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINILNGRPSISV 145 (518) T ss_pred CChHHHHHHHHHHhhcCCCceEEecCccccCcHHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEECCeeEEEE Confidence 999999999999999999999998 46778999999999999999999999999999999999999999999999 Q ss_pred EcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCcccc-Cce Q lcl|NC_019418. 147 IQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQL-GER 225 (527) Q Consensus 147 v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~l-G~~ 225 (527) |+|++|+|++++ +..+.|+ |++.+..+++ ..|||+||+|++.... .....+++|+|+|+||++.....+ +.. T Consensus 146 v~ad~~~P~~~~-g~~~~~~-f~~~~~~~~k-~~~y~~lE~he~~~~~----~~~~~~~~~~I~n~ly~~~~~~~v~~~~ 218 (518) T protein:vir:78 146 HSSSQFWIDFKN-NEPFRFN-FFEEIPTSNK-ADIYYLVESREIKQWD----KEGKKLSGGFVTYSVIKIDGDKTTPISA 218 (518) T ss_pred EcCCeeEEEeec-CcEEEEE-EEEEeecCCc-ceeEEEEEeecccccc----ceeecccceeEEEEEeeecCcccccccc Confidence 999999999765 4455544 4455555444 4589999999986543 233457789999999987533322 233 Q ss_pred eeccc------ccCCcccceee-cCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcce Q lcl|NC_019418. 226 VNLSE------LYPDLQPVTPI-QGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRR 298 (527) Q Consensus 226 v~l~~------~~~~l~~~~~~-~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~ 298 (527) +++.. .|+++.+.+.+ +|+.+|+|.|+||+.+||++.+||||+|+|++++++||+||++||+|+|||++|+++ T Consensus 219 ~~~~~~l~~~~~~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~ 298 (518) T protein:vir:78 219 ERLPEQITSYLHTNDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEKTKTK 298 (518) T ss_pred cccccccccccccccCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHhCCce Confidence 33322 25566666554 566677777778889999999999999999999999999999999999999999999 Q ss_pred eeechhHhcCCCCCCCcccccccccccccceeeeccCCCC----CCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_019418. 299 VIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNM----DSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSS 374 (527) Q Consensus 299 i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~----~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~ 374 (527) |+||++|++...++.++ ...+.|+.++++|+++++..+ .+..|+.+||+||+++|.++++.+|++++++||+|| T Consensus 299 i~v~~~~l~~~~~~~~~--~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~ 376 (518) T protein:vir:78 299 IAASERMFRKKVNKSTD--KEEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNP 376 (518) T ss_pred eeechhHhccCCCCCCC--ccccccCCCCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCCh Confidence 99999999876665543 344668889999998865432 233699999999999999999999999999999999 Q ss_pred ccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc--cCCcccCccceEEEeCCCccCCHH Q lcl|NC_019418. 375 GMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGI--YRGTIPELDDISVNLDDGVFTDRH 452 (527) Q Consensus 375 ~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~--~~~~~~~~~~v~v~f~d~i~~d~~ 452 (527) ++||.+ ++.+|||||++++++|++|+++|++.|+++|++|+++|+++++.+.. ......+..+|+|+|+|++++|++ T Consensus 377 ~tfg~~-~~~~TATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~ 455 (518) T protein:vir:78 377 ATFNLG-NREVKATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLN 455 (518) T ss_pred hhcCcc-cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCHH Confidence 999986 56799999999999999999999999999999999999999887543 233455677899999999999999 Q ss_pred HHHHHHHHHHhcCCCCHHHHHHh-cCCCCHHHHHHHHHHHHHhcccccccccC----CCCCCC Q lcl|NC_019418. 453 AELDYWMKMVAAGFATQKRGIAK-TLGITEEEAEKELAEINGELPPESDAELA----LYGKGQ 510 (527) Q Consensus 453 ~~~~~~~~~~~aGi~s~~~~i~~-~~~~~deea~~el~ri~~E~~~~~~~~~~----~~~~~~ 510 (527) ++++++++++++|+||+++++++ +++|+|+||++|++||++|++....+++. ++.+++ T Consensus 456 ~~~~~~~~~v~aGimS~e~~i~~~~~~~~deea~~e~~ri~~E~~~~~~~~p~~~~g~~~~~g 518 (518) T protein:vir:78 456 ELSSTLNNMNSALAMSVEEKVKLIHPKWEDEEIQAEVKRIYLENAIGEVPDPEAIGGMETKGG 518 (518) T ss_pred HHHHHHHHHHhcCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHHhcccCCCCCccccCCCCCCC Confidence 99999999999999999999977 56899999999999999999865433332 333333 No 10 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=100.00 E-value=1e-58 Score=338.53 Aligned_cols=476 Identities=11% Similarity=0.085 Sum_probs=305.3 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc---------CccccCcee Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD---------GDRKRRKMQ 71 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~---------~~~~~~~~~ 71 (527) =+.+..+..+++.....+.......+.+ -+......++.++.+||.|+++++...... ...+.+.++ T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~----~i~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ri 84 (503) T protein:vir:59 9 KTHTEELNEIIVESAKEIAEPDTTMIQK----LIDEHNPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKTNNRT 84 (503) T ss_pred hhhHHhHHHhhhhhhhhccchhHHHHHH----HHHhhcHHHHHHHHHHhccccchhhccchhccccccccccccccccee Confidence 1122222233222211111111111100 012224578999999999998866433211 112234578 Q ss_pred ecchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEEEEEcCC Q lcl|NC_019418. 72 HLPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRVAFIQAP 150 (527) Q Consensus 72 ~lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i~~v~a~ 150 (527) ++|||+.||+..|+|+|++|++++++++..+++|+.+++ |+|...+.+++..++++|.+|+++|+|. +++++.+++|. T Consensus 85 ~~n~~~~ivd~~~~yl~g~~~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~~p~ 163 (503) T protein:vir:59 85 SHAWHKLFVDQKTQYLVGEPVTFTSDNKTLLEYVNELAD-DDFDDILNETVKNMSNKGIEYWHPFVDEEGEFDYVIFPAE 163 (503) T ss_pred ecchHHHHHHHHHhhhhcCCeeeccCcHHHHHHHHHHHh-cCHHHHHHHHHHHHhhCCeEEEEEeecCCCceEEEEEccc Confidence 899999999999999999999999999999999999885 7899999999999999999999999974 68999999999 Q ss_pred ceEEEEEcCC-ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCC-ccccCceeec Q lcl|NC_019418. 151 VFLPLQSNTQ-DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTS-DSQLGERVNL 228 (527) Q Consensus 151 ~~~P~~~d~~-~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~-~~~lG~~v~l 228 (527) +++|++.++. +...+++. .+.........++++|+|+... |.+ |.... ...++...+. T Consensus 164 ~~~~i~d~~~~~~~~~~ir--~~~~~~~~~~~~~~~evy~~~~----------------i~~--~~~~~~~~~~~~~~~~ 223 (503) T protein:vir:59 164 EMIVVYKDNTRRDILFALR--YYSYKGIMGEETQKAELYTDTH----------------VYY--YEKIDGVYQMDYSYGE 223 (503) T ss_pred eeEEEEeCCCCCceEEEEE--EEEEecCCCceEEEEEEEeCCc----------------EEE--EEEcCCcccccccccc Confidence 9999976653 44444442 2333333334556688876322 221 22111 1111111111 Q ss_pred ccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcC Q lcl|NC_019418. 229 SELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQL 308 (527) Q Consensus 229 ~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~ 308 (527) ......+......+++.+++|++|++ +++|.|+|++++++||+||.++|+++++++....++.+...+- T Consensus 224 ~~~~~~~~~~~~~~~~~~vPiv~~~n---------n~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~-- 292 (503) T protein:vir:59 224 NNPRPHMTKGGQAIGWGRVPIIPFKN---------NEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYD-- 292 (503) T ss_pred cccccceeecceeccCCccceEEecC---------CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCC-- Confidence 11111112223346777888888864 2569999999999999999999999999998777777743321 Q ss_pred CCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHH Q lcl|NC_019418. 309 KVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTAT 388 (527) Q Consensus 309 ~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAt 388 (527) +.. ... ....... +..+..+ +.+.++.++++++.+.+...++.+.+.|...++.+.-.+ ...+|..||+ T Consensus 293 ---~~~--~~~-~~~~~~~--~~~~~~~--~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~~~~~~Sg~ 361 (503) T protein:vir:59 293 ---GEN--PKE-FTANLRY--HSVIKVS--GDGGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSP-ETIGGGATGP 361 (503) T ss_pred ---ccc--cch-hhhhhhc--ccceecc--CCCcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCc-ccccccccHH Confidence 111 000 0001111 1112222 234588899999999988888877776655544332222 1224567899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCC Q lcl|NC_019418. 389 EIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFAT 468 (527) Q Consensus 389 ei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s 468 (527) ++++..+.+.++++.+++.|+.+|+++++.|+.+.+.. .........+++|.|++++|.|..++++.+++++++|+|| T Consensus 362 Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~--~~~~~~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~GiiS 439 (503) T protein:vir:59 362 ALENLYALLDLKANMAERKIRAGLRLFFWFFAEYLRNT--GKGDFNPDKELTMTFTRTRIQNDSEIVQSLVQGVTGGIMS 439 (503) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--cCcccccccceeEEeCCCCCCCHHHHHHHHHHHHhCCCCc Confidence 99999999999999999999999999999999876542 2233334566999999999999999999999999999999 Q ss_pred HHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 469 QKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 469 ~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) .+|++..++++++ +++|++||++|+.........+......++.++.++...++++. T Consensus 440 ~et~l~~l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (503) T protein:vir:59 440 KETAVARNPFVQD--PEEELARIEEEMNQYAEMQGNLLDDEGGDDDLEEDDPNAGAAES 496 (503) T ss_pred hHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhhccccCccCCCCCCCcCCCCCCcccC Confidence 9999987755543 67889999887754333222221111111111111111111111 No 11 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=100.00 E-value=5.1e-58 Score=334.70 Aligned_cols=464 Identities=9% Similarity=0.006 Sum_probs=309.8 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccc-cCccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNT-DGDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~-~~~~~~~~~~~lnl~~~i 79 (527) +...+.|+++|++.. .....|++++++||.|+++.+..... ..+.+.++++++|+|+.| T Consensus 39 ~~~~~~i~~~i~~~~--------------------~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~I 98 (511) T protein:vir:96 39 LQNVNEVSKYIEHHM--------------------DYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYI 98 (511) T ss_pred hccHHHHHHHHHHHH--------------------HhhHHHHHHHHHHhcccCccccccCcCcccccCcceeecchHHHH Confidence 223344555554322 12345899999999999998755433 334445678899999999 Q ss_pred HHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEEEEEcCCceEEEEEc Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRVAFIQAPVFLPLQSN 158 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i~~v~a~~~~P~~~d 158 (527) |+..++|||++|++++++++..++.|++++++|+|...+.++++.++++|.+|.++|+|. +++++.+++|.+++|++.+ T Consensus 99 v~~~~~yl~g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~~p~~~~~vydd 178 (511) T protein:vir:96 99 SDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDN 178 (511) T ss_pred HHHHHhhhccCCceeecCchHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcC Confidence 999999999999999999999999999999999999999999999999999999999974 7899999999999999766 Q ss_pred CC-ceEEEEEEEEEEee-CCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcc Q lcl|NC_019418. 159 TQ-DVSSAAILTKTIKT-ENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQ 236 (527) Q Consensus 159 ~~-~~~~~a~~~~~~~~-~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~ 236 (527) +. +...+++.+..... +.....+....|.|.. ..|.. |...... ...++. . . T Consensus 179 ~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~----------------~~i~~--~~~~~~~----~~~~~~--~--~ 232 (511) T protein:vir:96 179 TIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTS----------------HGVYR--YLTSRTN----GLKLTP--R--E 232 (511) T ss_pred CCCCceEEEEEEEEeeeccccccceEEEEEEEeC----------------CcEEE--EEecCCC----cccccc--c--c Confidence 54 33333332222211 1112222223444321 11211 2221111 111110 0 0 Q ss_pred cceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCcc Q lcl|NC_019418. 237 PVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGN 316 (527) Q Consensus 237 ~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~ 316 (527) ....-+++..+++++|++ ++.|+|+|++++++||++|.++|++++.++....++.|...+....... ... T Consensus 233 ~~~~~~~~~~vPvv~~~n---------n~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~-~~~ 302 (511) T protein:vir:96 233 NGFESHSFERMPITEFSN---------NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE-VRK 302 (511) T ss_pred cccccccCCceeeEEecC---------CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCCchh-hcc Confidence 111234667777888864 2468999999999999999999999999998777777755544222111 111 Q ss_pred cccccccccccce-eeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHH Q lcl|NC_019418. 317 IAFKRRFDVEQNV-YMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENS 395 (527) Q Consensus 317 ~~~~~~~d~~~~~-~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~ 395 (527) ....+.+...... +.+.....++.+.+++++++++++.+...++.+.+.|...++.+.-+++. .+|..||.+++++.+ T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~-~~~n~Sg~Al~~~~~ 381 (511) T protein:vir:96 303 QKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDN-FSGTQSGEAMKYKLF 381 (511) T ss_pred cccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-ccccchHHHHHHHHH Confidence 1111111111111 11222233445568999999999999888888888887777655443322 235668999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHh Q lcl|NC_019418. 396 DTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAK 475 (527) Q Consensus 396 ~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~ 475 (527) .+.++++.+++.|+++|+++++.|+.+..... ......+..+++|+|++++|.|..+++++++++ +|+||.+|++.. T Consensus 382 ~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~-~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl--~G~iS~et~l~~ 458 (511) T protein:vir:96 382 GLEQRTKTKEGLFTKGLRRRAKLLETILKNTW-SIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSL 458 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-CcccccccccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHh Confidence 99999999999999999999999998765422 111223456799999999999999999988876 699999999977 Q ss_pred cCCCCHHHHHHHHHHHHHhccccccc----ccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 476 TLGITEEEAEKELAEINGELPPESDA----ELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 476 ~~~~~deea~~el~ri~~E~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) +++++| +++|++||++|+...... ....+....+.+..+..++..+++| T Consensus 459 l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:96 459 FSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCC--HHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCcccccccccC Confidence 755554 678899998887543222 1222222222233333344444444 No 12 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=100.00 E-value=8e-58 Score=333.62 Aligned_cols=464 Identities=10% Similarity=0.029 Sum_probs=306.0 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccc-cCccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNT-DGDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~-~~~~~~~~~~~lnl~~~i 79 (527) +.-.+.|+++|++.. .....|++++++||.|+|+.+..... ....+.++++++|+|+.| T Consensus 39 ~~~~~~i~~~i~~~~--------------------~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~I 98 (511) T protein:vir:99 39 LQNVNEVSKYIEHHM--------------------DYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYI 98 (511) T ss_pred hccHHHHHHHHHHHH--------------------HhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHH Confidence 223344555544322 12345788999999999998755433 333445678899999999 Q ss_pred HHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe-CCeeEEEEEcCCceEEEEEc Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD-GDKIRVAFIQAPVFLPLQSN 158 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d-~~~~~i~~v~a~~~~P~~~d 158 (527) |+..++|||++|++++++++..++.|++++++|+|...+.++++.++++|.+|+++|+| ++++++.+++|.++||++.+ T Consensus 99 v~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~~p~~~~~vyd~ 178 (511) T protein:vir:99 99 SDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDN 178 (511) T ss_pred HHHHHhhhcccCceeecCchHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcC Confidence 99999999999999999999999999999999999999999999999999999999997 57899999999999999766 Q ss_pred CC-ceEEEEEEEEEEee-CCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcc Q lcl|NC_019418. 159 TQ-DVSSAAILTKTIKT-ENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQ 236 (527) Q Consensus 159 ~~-~~~~~a~~~~~~~~-~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~ 236 (527) +. +...+++.+..+.. +..........|.|.. ..|.+ |...... ...+.. .. T Consensus 179 ~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~----------------~~i~~--~~~~~~~----~~~~~~----~~ 232 (511) T protein:vir:99 179 TIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTS----------------HGVYR--YLTSRTN----GLKLTP----RE 232 (511) T ss_pred CCCCceEEEEEEEEeeecccCccceEEEEEEEeC----------------CcEEE--EEecCCc----cccccc----cc Confidence 53 34444432222221 1112122223455431 11211 2111101 001000 01 Q ss_pred cceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCcc Q lcl|NC_019418. 237 PVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGN 316 (527) Q Consensus 237 ~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~ 316 (527) ....-+++..+++++|++ ++.|+|+|++++++||+||.++|++++.++....++.+-..+...... .... T Consensus 233 ~~~~~~~~g~vPvv~~~n---------n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~-~~~~ 302 (511) T protein:vir:99 233 NGFESHSFERMPITEFSN---------NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPV-EVRK 302 (511) T ss_pred cccccCCCCccceEEecC---------CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcccCch-hhcc Confidence 112235667777888864 246999999999999999999999999998755555553333211111 1111 Q ss_pred ccccccccccccee-eeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHH Q lcl|NC_019418. 317 IAFKRRFDVEQNVY-MQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENS 395 (527) Q Consensus 317 ~~~~~~~d~~~~~~-~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~ 395 (527) ....+.+......+ .+......++..+++++++++++.+...++.+.+.|...++.+.-+++. .+|..||.+++++.+ T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~-~~gn~Sg~Alk~~~~ 381 (511) T protein:vir:99 303 QKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDN-FSGTQSGEAMKYKLF 381 (511) T ss_pred cccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-ccccchHHHHHHHHH Confidence 11111111111111 2222333445568899999999998888888888777666654433322 235678999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHh Q lcl|NC_019418. 396 DTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAK 475 (527) Q Consensus 396 ~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~ 475 (527) .+.++++.+++.|+.+|+++++.|+.+....+-. ........++|.|.+++|.|..++++.+++++ |++|.+|++.. T Consensus 382 ~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~-~~~~~~~~i~i~f~~~~p~n~~e~~~~~~kl~--GiiS~et~l~~ 458 (511) T protein:vir:99 382 GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSI-DVSKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSL 458 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc-ccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHHHh Confidence 9999999999999999999999999876543211 12334557899999999999999999988874 99999999988 Q ss_pred cCCCCHHHHHHHHHHHHHhccccccc-ccCCCCCC---CCCCCCCCCCCCCcccc Q lcl|NC_019418. 476 TLGITEEEAEKELAEINGELPPESDA-ELALYGKG---QQNTVGNSKDTVDDEDE 526 (527) Q Consensus 476 ~~~~~deea~~el~ri~~E~~~~~~~-~~~~~~~~---~~~~~~~~~~~~~~~~~ 526 (527) +++++| +++|++||++|+...... ....+.++ ++.+..+..+.+.|++| T Consensus 459 l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~e 511 (511) T protein:vir:99 459 FSFFQD--PELEVKKIEEDEKESIKKAQKNMYQDPRNINDDEQDDSTKDSIDKKE 511 (511) T ss_pred CCCCCC--HHHHHHHHHHHHHHHHHHHhhcccccCCCCCCCCCCCCCcCcccccC Confidence 866654 678899998887532221 11222221 12222222233333333 No 13 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=100.00 E-value=6.4e-57 Score=328.67 Aligned_cols=464 Identities=10% Similarity=0.016 Sum_probs=308.6 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccc-cCccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNT-DGDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~-~~~~~~~~~~~lnl~~~i 79 (527) |...+.|+++|+++. .....|++++++||.|+++.+..... ..+.+.++++++|+|+.| T Consensus 39 ~~~~~~i~~~i~~~~--------------------~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~I 98 (511) T protein:vir:96 39 LQNVNEVSKYIEHHM--------------------DYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYI 98 (511) T ss_pred hcCHHHHHHHHHHHH--------------------HhhhHHHHHHHHHhhccCccccccCcccccccCcceeecchHHHH Confidence 334455555555432 11345788899999999998755433 334445678999999999 Q ss_pred HHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEEEEEcCCceEEEEEc Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRVAFIQAPVFLPLQSN 158 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i~~v~a~~~~P~~~d 158 (527) |+..++|||+.|++++++++..++.|+++++.|+|.....++++.++++|.+|.++|+|. +++++.+++|.+++|++.+ T Consensus 99 v~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~~p~~~~~v~dd 178 (511) T protein:vir:96 99 SDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFIIYDN 178 (511) T ss_pred HHHHhhhhcccCceeecCchHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccceEEEEcC Confidence 999999999999999999999999999999999999999999999999999999999974 6899999999999999766 Q ss_pred CC-ceEEEEE-EEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcc Q lcl|NC_019418. 159 TQ-DVSSAAI-LTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQ 236 (527) Q Consensus 159 ~~-~~~~~a~-~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~ 236 (527) +. +...+++ ++.....++.....+..+|.|.. ..|.+ |...... ..++.... T Consensus 179 ~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~----------------~~i~~--~~~~~~~----~~~~~~~~---- 232 (511) T protein:vir:96 179 TVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTS----------------HGVYR--YLTNRTN----GLKLTPRE---- 232 (511) T ss_pred CCCCceEEEEEEEEeeeccccccceEEEEEEEeC----------------CcEEE--EEecCCC----cccccccc---- Confidence 54 3333333 22222122222222233455431 12211 2111111 11111101 Q ss_pred cceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCcc Q lcl|NC_019418. 237 PVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGN 316 (527) Q Consensus 237 ~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~ 316 (527) ....-+++..+++++|++ .++|.|+|++++++||++|.++|++++.++....++.+-..+....... ... T Consensus 233 ~~~~~~~~g~vPvv~~~n---------~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~-~~~ 302 (511) T protein:vir:96 233 NSFESHSFERMPITEFSN---------NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE-VRK 302 (511) T ss_pred cccccCcCcccceEEecC---------CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchh-hcc Confidence 111224566667777754 2469999999999999999999999999987555555533322111110 001 Q ss_pred cccccccccccceee-eccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHH Q lcl|NC_019418. 317 IAFKRRFDVEQNVYM-QVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENS 395 (527) Q Consensus 317 ~~~~~~~d~~~~~~~-~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~ 395 (527) ....+.+......+. +...+..+.+.+++++++++++.+...++.+.+.|...++.+.-+++.- +|..||.+++.+++ T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~-~~n~Sg~Al~~~~~ 381 (511) T protein:vir:96 303 QKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNF-SGTQSGEAMKYKLF 381 (511) T ss_pred cccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc-ccccHHHHHHHHHH Confidence 111111111222222 2222334455688999999999998898888888877776554333222 35668999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHh Q lcl|NC_019418. 396 DTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAK 475 (527) Q Consensus 396 ~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~ 475 (527) .+.+++..+++.|+.+|++++++|+.+....+ .........+++|.|++++|.|..++++.+++++ |+||.+|++.. T Consensus 382 ~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~-~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~--G~iS~et~l~~ 458 (511) T protein:vir:96 382 GLEQRTKTKEGLFTKGLRRRAKLLETILKNTR-SIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSL 458 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CCccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHh Confidence 99999999999999999999999988765322 1112334567899999999999999999988875 99999999987 Q ss_pred cCCCCHHHHHHHHHHHHHhccccccc----ccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 476 TLGITEEEAEKELAEINGELPPESDA----ELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 476 ~~~~~deea~~el~ri~~E~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) +++++| +++|++||++|+...... ....+...++.+..++.++..+|+| T Consensus 459 l~~v~d--~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:96 459 FSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCC--HHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCCccCcccccC Confidence 755543 778999999887543222 1222222223333333444444444 No 14 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=100.00 E-value=6.4e-57 Score=328.67 Aligned_cols=464 Identities=10% Similarity=0.016 Sum_probs=308.6 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccc-cCccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNT-DGDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~-~~~~~~~~~~~lnl~~~i 79 (527) |...+.|+++|+++. .....|++++++||.|+++.+..... ..+.+.++++++|+|+.| T Consensus 39 ~~~~~~i~~~i~~~~--------------------~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~I 98 (511) T protein:vir:78 39 LQNVNEVSKYIEHHM--------------------DYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYI 98 (511) T ss_pred hcCHHHHHHHHHHHH--------------------HhhhHHHHHHHHHhhccCccccccCcccccccCcceeecchHHHH Confidence 334455555555432 11345788899999999998755433 334445678999999999 Q ss_pred HHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEEEEEcCCceEEEEEc Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRVAFIQAPVFLPLQSN 158 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i~~v~a~~~~P~~~d 158 (527) |+..++|||+.|++++++++..++.|+++++.|+|.....++++.++++|.+|.++|+|. +++++.+++|.+++|++.+ T Consensus 99 v~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~~p~~~~~v~dd 178 (511) T protein:vir:78 99 SDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFIIYDN 178 (511) T ss_pred HHHHhhhhcccCceeecCchHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccceEEEEcC Confidence 999999999999999999999999999999999999999999999999999999999974 6899999999999999766 Q ss_pred CC-ceEEEEE-EEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcc Q lcl|NC_019418. 159 TQ-DVSSAAI-LTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQ 236 (527) Q Consensus 159 ~~-~~~~~a~-~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~ 236 (527) +. +...+++ ++.....++.....+..+|.|.. ..|.+ |...... ..++.... T Consensus 179 ~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~----------------~~i~~--~~~~~~~----~~~~~~~~---- 232 (511) T protein:vir:78 179 TVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTS----------------HGVYR--YLTNRTN----GLKLTPRE---- 232 (511) T ss_pred CCCCceEEEEEEEEeeeccccccceEEEEEEEeC----------------CcEEE--EEecCCC----cccccccc---- Confidence 54 3333333 22222122222222233455431 12211 2111111 11111101 Q ss_pred cceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCcc Q lcl|NC_019418. 237 PVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGN 316 (527) Q Consensus 237 ~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~ 316 (527) ....-+++..+++++|++ .++|.|+|++++++||++|.++|++++.++....++.+-..+....... ... T Consensus 233 ~~~~~~~~g~vPvv~~~n---------~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~-~~~ 302 (511) T protein:vir:78 233 NSFESHSFERMPITEFSN---------NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE-VRK 302 (511) T ss_pred cccccCcCcccceEEecC---------CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchh-hcc Confidence 111224566667777754 2469999999999999999999999999987555555533322111110 001 Q ss_pred cccccccccccceee-eccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHH Q lcl|NC_019418. 317 IAFKRRFDVEQNVYM-QVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENS 395 (527) Q Consensus 317 ~~~~~~~d~~~~~~~-~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~ 395 (527) ....+.+......+. +...+..+.+.+++++++++++.+...++.+.+.|...++.+.-+++.- +|..||.+++.+++ T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~-~~n~Sg~Al~~~~~ 381 (511) T protein:vir:78 303 QKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNF-SGTQSGEAMKYKLF 381 (511) T ss_pred cccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc-ccccHHHHHHHHHH Confidence 111111111222222 2222334455688999999999998898888888877776554333222 35668999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHh Q lcl|NC_019418. 396 DTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAK 475 (527) Q Consensus 396 ~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~ 475 (527) .+.+++..+++.|+.+|++++++|+.+....+ .........+++|.|++++|.|..++++.+++++ |+||.+|++.. T Consensus 382 ~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~-~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~--G~iS~et~l~~ 458 (511) T protein:vir:78 382 GLEQRTKTKEGLFTKGLRRRAKLLETILKNTR-SIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSL 458 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CCccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHh Confidence 99999999999999999999999988765322 1112334567899999999999999999988875 99999999987 Q ss_pred cCCCCHHHHHHHHHHHHHhccccccc----ccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 476 TLGITEEEAEKELAEINGELPPESDA----ELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 476 ~~~~~deea~~el~ri~~E~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) +++++| +++|++||++|+...... ....+...++.+..++.++..+|+| T Consensus 459 l~~v~d--~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:78 459 FSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCC--HHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCCccCcccccC Confidence 755543 778999999887543222 1222222223333333444444444 No 15 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=100.00 E-value=4.7e-56 Score=323.90 Aligned_cols=464 Identities=10% Similarity=0.010 Sum_probs=310.0 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc-CccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD-GDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~-~~~~~~~~~~lnl~~~i 79 (527) |.-.+.|+++|.+.. .....|++++++||.|+++.+...... .+.+.++++++|+|+.| T Consensus 39 ~~~~~~i~~~i~~~~--------------------~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~I 98 (511) T protein:vir:10 39 LQNVNEVSKCIEHHM--------------------DYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYI 98 (511) T ss_pred ccCHHHHHHHHHHHH--------------------HhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHH Confidence 334455555554322 112457889999999999987554432 33445678899999999 Q ss_pred HHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEEEEEcCCceEEEEEc Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRVAFIQAPVFLPLQSN 158 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i~~v~a~~~~P~~~d 158 (527) |+..++|||++|++++++++..++.|++++++|+|.....+++..++++|.+|.++|+|. +++++.+++|.+++|++.+ T Consensus 99 v~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~dedg~~~i~~~~p~~~~~vydd 178 (511) T protein:vir:10 99 SDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIRNQDDETRLYKSDAMSTFVIYDN 178 (511) T ss_pred HHHHhhhhcccCceeecCchHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcC Confidence 999999999999999999999999999999999999999999999999999999999975 7899999999999999776 Q ss_pred CC-ceEEEEEEEEEEe-eCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcc Q lcl|NC_019418. 159 TQ-DVSSAAILTKTIK-TENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQ 236 (527) Q Consensus 159 ~~-~~~~~a~~~~~~~-~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~ 236 (527) +. +...+++.+..++ .+.........+|.+.. ..|.. |...... ...++. . . T Consensus 179 ~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~----------------~~i~~--~~~~~~~----~~~~~~--~--~ 232 (511) T protein:vir:10 179 TIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTS----------------HGVYR--YLTSRTN----GLKLTP--R--E 232 (511) T ss_pred CCCCceEEEEEEEEeeecccCccceEEEEEEEeC----------------CcEEE--EEecCCC----cccccc--c--c Confidence 54 3344433222222 12112222223444431 11111 2111111 001100 0 0 Q ss_pred cceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCcc Q lcl|NC_019418. 237 PVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGN 316 (527) Q Consensus 237 ~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~ 316 (527) ....-+++..+++++|++ + +.|.|+|++++++||+||.++|++++.++....++.|-..+....... ... T Consensus 233 ~~~~~~~~~~vPvv~f~n----n-----~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~-~~~ 302 (511) T protein:vir:10 233 NGFESHSFERMPITEFSN----N-----ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE-VRK 302 (511) T ss_pred cccccccCcceeEEEecC----C-----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccCCchh-hcc Confidence 111224566777888764 2 368999999999999999999999999988777777755443221111 111 Q ss_pred ccccccccccccee-eeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHH Q lcl|NC_019418. 317 IAFKRRFDVEQNVY-MQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENS 395 (527) Q Consensus 317 ~~~~~~~d~~~~~~-~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~ 395 (527) ......+......+ .+...+.++++.+++++++++++.+...++.+.+.|...++.+.-+++. .+|..||.+++++.+ T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~-~~~n~Sg~Al~~~~~ 381 (511) T protein:vir:10 303 QKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDN-FSGTQSGEAMKYKLF 381 (511) T ss_pred chhccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-ccccchHHHHHHHHH Confidence 11111111111111 2222333445568899999999999999998888887776655433322 235668999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHh Q lcl|NC_019418. 396 DTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAK 475 (527) Q Consensus 396 ~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~ 475 (527) .+.++++.+++.|+.+|+++++.|+.+....+- ........+++|.|.+++|.|..++++++.+++ |++|.+|++.. T Consensus 382 ~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~-~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~--G~iS~et~~~~ 458 (511) T protein:vir:10 382 GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRS-IDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSL 458 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCC-cccccccceeeEEeCCCCCcCHHHHHHHHHHHh--ccCcHHHHHHh Confidence 999999999999999999999999887654321 112334567999999999999999999998885 99999999988 Q ss_pred cCCCCHHHHHHHHHHHHHhccccccc----ccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 476 TLGITEEEAEKELAEINGELPPESDA----ELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 476 ~~~~~deea~~el~ri~~E~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) ++++++ +++|++||++|+...... ....+....+.+..+..++..+++| T Consensus 459 l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:10 459 FSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCC--HHHHHHHHHHHHHHHHHHHhhhcccCCCCCCCCCCCCcccCcccccC Confidence 766654 567899998886543221 1222222222233333344444444 No 16 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=100.00 E-value=4.7e-56 Score=323.89 Aligned_cols=464 Identities=9% Similarity=0.007 Sum_probs=307.5 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc-CccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD-GDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~-~~~~~~~~~~lnl~~~i 79 (527) ....+.|+++|++.. .....|++++++||.|+|+.+...... .+.+.++++++|+|+.| T Consensus 39 ~~~~~~i~~~i~~~~--------------------~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~I 98 (511) T protein:vir:93 39 LQNVNEVSKYIEHHM--------------------DYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYI 98 (511) T ss_pred hccHHHHHHHHHHHH--------------------HhhHHHHHHHHHHhcccCccccccCcCcccccCcceeecchHHHH Confidence 112333444443321 223568899999999999987554332 33345678899999999 Q ss_pred HHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe-CCeeEEEEEcCCceEEEEEc Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD-GDKIRVAFIQAPVFLPLQSN 158 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d-~~~~~i~~v~a~~~~P~~~d 158 (527) |+..++|||+.|++++++++..++.|++++++|+|.....+++..++++|.+|+++|+| ++.+++.+++|.+++|++.+ T Consensus 99 v~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de~~~~~i~~~~p~~~~~vydd 178 (511) T protein:vir:93 99 SDFINGYFLGNPIQYQDDDKDVLEVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDN 178 (511) T ss_pred HHHHhhhhcccCeeeccCChHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcC Confidence 99999999999999999999999999999999999999999999999999999999997 57899999999999999766 Q ss_pred CC-ceEEEEEEEEEEe-eCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcc Q lcl|NC_019418. 159 TQ-DVSSAAILTKTIK-TENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQ 236 (527) Q Consensus 159 ~~-~~~~~a~~~~~~~-~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~ 236 (527) +. +...+++....+. .++........+|.|.. ..|.+ |...... ...+.. + . T Consensus 179 ~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~iyt~----------------~~i~~--~~~~~~~----~~~~~~-~---~ 232 (511) T protein:vir:93 179 TIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTS----------------HGVYR--YLTSRTN----GLKLTP-R---E 232 (511) T ss_pred CCCCceEEEEEEEEeeeccccccceEEEEEEEeC----------------CcEEE--EEecCCC----cccccc-c---c Confidence 53 3344433222222 12222222223455431 11211 2221111 001100 0 0 Q ss_pred cceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCcc Q lcl|NC_019418. 237 PVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGN 316 (527) Q Consensus 237 ~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~ 316 (527) ....-+++..+++++|++ ++.|.|+|+++++++|++|.++|++++.++....++.|-..+....... ... T Consensus 233 ~~~~~~~~g~vPvv~~~n---------n~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~-~~~ 302 (511) T protein:vir:93 233 NGFESHSFERMPITEFSN---------NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE-VRK 302 (511) T ss_pred ccccccCCCccceEEecC---------CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcccCchh-hcc Confidence 111224667777887764 2468999999999999999999999999987667666644443211111 001 Q ss_pred cccccccccccce-eeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHH Q lcl|NC_019418. 317 IAFKRRFDVEQNV-YMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENS 395 (527) Q Consensus 317 ~~~~~~~d~~~~~-~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~ 395 (527) ......+...... ..+...+..+++.++++++++.++.+...++.+.+.|...++.+.-+++. .+|..||.+++.+.+ T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~-~~~n~Sg~Al~~~~~ 381 (511) T protein:vir:93 303 QKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDN-FSGTQSGEAMKYKLF 381 (511) T ss_pred cccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-ccccchHHHHHHHHH Confidence 1111111111111 11122233445678899999999988888888888777766655433322 235668999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHh Q lcl|NC_019418. 396 DTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAK 475 (527) Q Consensus 396 ~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~ 475 (527) .+.++++.+++.|+.+|+++++.|+.+....+ .........++++.|++++|.|..++++++.++ +|+||.+|++.. T Consensus 382 ~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~-~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~ 458 (511) T protein:vir:93 382 GLEQRTKTKEGLFTKGLRRRAKLLETILKNTW-SIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSL 458 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-CcccccccccceEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHh Confidence 99999999999999999999999998754322 111223456789999999999999999988887 599999999988 Q ss_pred cCCCCHHHHHHHHHHHHHhccccccc----ccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 476 TLGITEEEAEKELAEINGELPPESDA----ELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 476 ~~~~~deea~~el~ri~~E~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) ++++++ +++|++||++|+...... ....+....+.+..+..++..+++| T Consensus 459 l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:93 459 FSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCC--HHHHHHHHHHHHHHHHHHHhhhcccCCCCCCCCCCCCcccccccccC Confidence 765554 567889998887543322 1122222223333334444444444 No 17 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=100.00 E-value=1.2e-55 Score=321.75 Aligned_cols=451 Identities=14% Similarity=0.111 Sum_probs=305.3 Q ss_pred ChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc----------CccccCcee Q lcl|NC_019418. 2 SLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD----------GDRKRRKMQ 71 (527) Q Consensus 2 ~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~----------~~~~~~~~~ 71 (527) .-.+.++.++.++... -.+...|+...++||.|+|+.+..+... ...+..+++ T Consensus 1 ~~~~~~~~~i~~~~~~-----------------~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki 63 (470) T protein:vir:10 1 MELDALKKLIQNTSTS-----------------RNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRI 63 (470) T ss_pred CchHHHHHHHHHHHHH-----------------HHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCccc Confidence 3345555555543311 1235678889999999999877554321 122335688 Q ss_pred ecchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEEEEEcCC Q lcl|NC_019418. 72 HLPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRVAFIQAP 150 (527) Q Consensus 72 ~lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i~~v~a~ 150 (527) ++|+++.|++..|+||||+|++++++++..++.|++++++ +|...+.++++.+++.|.+|.++|+|. +.+++..++|. T Consensus 64 ~~n~~k~Iv~~~~~yl~G~p~~~~~~d~~~~~~l~~~~~~-~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~~~~~~~~p~ 142 (470) T protein:vir:10 64 PSNFYQLLVDQEAGYVASVFPDIDVGKDADNKKIIDVLGD-DRALTLNGLLVDSSNAGRAWLHYWIDEDGNFRYGIIQPD 142 (470) T ss_pred ccchHHHHHHhhhhheeccceeeecCchHHHHHHHHHHhh-hHHHHHHHHHHHHhhcCeeEEEEEecCCCceEEEEEccc Confidence 9999999999999999999999999999999999999975 688899999999999999999999975 67999999999 Q ss_pred ceEEEEEcCC-ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCcee--- Q lcl|NC_019418. 151 VFLPLQSNTQ-DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERV--- 226 (527) Q Consensus 151 ~~~P~~~d~~-~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v--- 226 (527) ++||++.++. +...+++.+... .+..+..+++.+|.|... .|.+-...+.. ..+.... T Consensus 143 ~~~~v~d~~~~~~~~a~ir~y~~-~~~~~~~~~~~~e~yt~~----------------~~~~~~~~~~~-~~~~~~~~~~ 204 (470) T protein:vir:10 143 QITPIYATTLDNKLLGILRSYKQ-LDPDSGKYFTVHEYWTDK----------------EAQFFRTNATD-STVIEPYNII 204 (470) T ss_pred ceEEEEcCCCCCceEEEEEEEEe-eecCCceEEEEEEEEcCC----------------cEEEEEeecCc-ceeccccccc Confidence 9999976653 455555433222 233334456667776421 22221111111 1110000 Q ss_pred ecc---cccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeech Q lcl|NC_019418. 227 NLS---ELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPE 303 (527) Q Consensus 227 ~l~---~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~ 303 (527) +.. ...+.......-+++++.+|++|++ | +.|+|+|+++++|||++|.++|+++++++.-...+++-. T Consensus 205 ~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n----n-----~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~ 275 (470) T protein:vir:10 205 TSYDLSAGYETGQSNTLKHNFGRVPFIEFSK----N-----KYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLT 275 (470) T ss_pred cccccccccccccccccccCCCeeeEEEeec----C-----CCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeee Confidence 000 0111112223335677777877764 2 469999999999999999999999999986444454422 Q ss_pred hHhcCCCCCCCcccccccccccccceeeecc-CCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccccc Q lcl|NC_019418. 304 QMTQLKVQDNQGNIAFKRRFDVEQNVYMQVG-AGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQ 382 (527) Q Consensus 304 ~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~-~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~ 382 (527) .+-. .+. ++.. . +..+.....++ .+.+..+++++++++++++.+...++.+.+.|...++.+.. ++... T Consensus 276 g~~~--~~~--~~~~--~--~~~~~~~i~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~--~~~~~ 345 (470) T protein:vir:10 276 NYGG--ADL--HQFM--N--DLRKYKSIKINNTGNGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGIDP--ANFES 345 (470) T ss_pred cCCc--ccc--chhh--h--hhhhcCeEeccCCCCCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCCC--Ccccc Confidence 2210 000 1100 1 11111111122 23345667999999999999999999999988776655532 33445 Q ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHH Q lcl|NC_019418. 383 GVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMV 462 (527) Q Consensus 383 g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~ 462 (527) |..|+.++++.++.+.++++++++.|+++|++++++|+.+.+. ...+..+++|+|++.+|.|..+.+++.+++ T Consensus 346 gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~~------~~~d~~~i~i~f~~~~p~d~~e~~~~~~~~- 418 (470) T protein:vir:10 346 SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF------SDADKRHISQHWTRTKVEDSLTKAQIVSTV- 418 (470) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc------cCcccceeeEEeccCCCCCHHHHHHHHHHH- Confidence 7789999999999999999999999999999999999876542 224556799999999999999999888776 Q ss_pred hcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 463 AAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 463 ~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) +|+||.+|++..++.++ + +++|++||++|+....+.... .+.. ...+.|+|| T Consensus 419 -~g~iS~et~l~~~p~v~-D-~~~E~eri~~E~~e~~~~~~~----~~~~-----~~~~~dde~ 470 (470) T protein:vir:10 419 -ANYSSKEAVAKANPIVD-D-WQQELKDLAKDKEENDPYSNQ----ADEL-----NGKGVNDEQ 470 (470) T ss_pred -hccCcHHHHHHhCCCCC-C-HHHHHHHHHHHHHHHHHhhcc----cccc-----CCCCCCCCC Confidence 59999999987764444 3 678999999987654332211 1111 111112222 No 18 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=100.00 E-value=3.8e-55 Score=318.94 Aligned_cols=463 Identities=9% Similarity=0.001 Sum_probs=307.4 Q ss_pred CCh--------HHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccc-cCccccCcee Q lcl|NC_019418. 1 MSL--------IQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNT-DGDRKRRKMQ 71 (527) Q Consensus 1 m~~--------~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~-~~~~~~~~~~ 71 (527) |.= .+.|+.+|.+.. .....|++++.+||.|+++.+..... ..+.+.++++ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~--------------------~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~ki 90 (512) T protein:vir:97 31 YDGTESDLLQNINEVSKYIEHHM--------------------DYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRV 90 (512) T ss_pred cCchhhhhhhhHHHHHHHHHHHH--------------------HhhHHHHHHHHHHhcccCccccccCcccccccCccee Confidence 221 233444444322 11345788999999999998755433 2334456789 Q ss_pred ecchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe-CCeeEEEEEcCC Q lcl|NC_019418. 72 HLPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD-GDKIRVAFIQAP 150 (527) Q Consensus 72 ~lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d-~~~~~i~~v~a~ 150 (527) ++|+|+.||+..++|+|++|++++++++..++.|++++++|+|...+.+++..++++|.+|.++|+| .+++++.+++|. T Consensus 91 ~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded~~~~i~~~~p~ 170 (512) T protein:vir:97 91 AHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAM 170 (512) T ss_pred ecchHHHHHHHHhhhhcccCceeccCChHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEccc Confidence 9999999999999999999999999999999999999999999999999999999999999999997 478999999999 Q ss_pred ceEEEEEcCC-ceEEEEEEEEEEeeCC-CcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeec Q lcl|NC_019418. 151 VFLPLQSNTQ-DVSSAAILTKTIKTEN-RKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNL 228 (527) Q Consensus 151 ~~~P~~~d~~-~~~~~a~~~~~~~~~~-~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l 228 (527) +++|++.++. +...+++-...++... ........+|.+. ...|.+ |...... ...+ T Consensus 171 ~~~~iyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt----------------~~~i~~--~~~~~~~----~~~~ 228 (512) T protein:vir:97 171 STFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFT----------------SHGVYR--YLTSRTN----GLKL 228 (512) T ss_pred ceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEe----------------CCcEEE--EEecCCC----cccc Confidence 9999976543 3444443222222111 1111222344432 111211 2221111 0011 Q ss_pred ccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcC Q lcl|NC_019418. 229 SELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQL 308 (527) Q Consensus 229 ~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~ 308 (527) .. ......-+++..+++++|++ .+.|.|+|+++++++|++|.++|++++.++....++.+-..+... T Consensus 229 ~~----~~~~~~~~~~g~vPvv~~~n---------n~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~ 295 (512) T protein:vir:97 229 TP----RENGFESHSFERMPITEFSN---------NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL 295 (512) T ss_pred cc----cccccccccCcccceEeecC---------CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccC Confidence 00 01112234667777777764 256999999999999999999999999998766666663333211 Q ss_pred CCCCCCc-cccccccc--ccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccc Q lcl|NC_019418. 309 KVQDNQG-NIAFKRRF--DVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVK 385 (527) Q Consensus 309 ~~~~~~~-~~~~~~~~--d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~ 385 (527) . .... .......+ ......+.....+.++++.++++++++.++.+...++.+.+.|...++.+.-+++. .+|.. T Consensus 296 ~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~-~~gn~ 372 (512) T protein:vir:97 296 D--PVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDN-FSGTQ 372 (512) T ss_pred C--chhhhhhhhcccccccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCccc-ccccc Confidence 1 1110 00000000 11111111222233444568899999999988888888888777666655443332 23556 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcC Q lcl|NC_019418. 386 TATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAG 465 (527) Q Consensus 386 TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aG 465 (527) ||.+++++++.+.++++.+++.|+.+|++++++|+.+....+-. ....+..+++|.|++++|.|..+.++.+.++ +| T Consensus 373 Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~-~~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl--~g 449 (512) T protein:vir:97 373 SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSI-DANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GG 449 (512) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc-ccccccccceEEeCCCCCcCHHHHHHHHHHH--hc Confidence 89999999999999999999999999999999999876543211 1234456799999999999999999988887 49 Q ss_pred CCCHHHHHHhcCCCCHHHHHHHHHHHHHhccccccc----ccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 466 FATQKRGIAKTLGITEEEAEKELAEINGELPPESDA----ELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 466 i~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) ++|.+|++..++++++ +++|++||++|+...... ....++..++.+..+..++..+++| T Consensus 450 iiS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (512) T protein:vir:97 450 KISQTTLMSLFSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) T ss_pred cCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCccccccccC Confidence 9999999988766654 567889998876543222 1222223333334444555555555 No 19 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=100.00 E-value=1.5e-55 Score=321.15 Aligned_cols=445 Identities=13% Similarity=0.115 Sum_probs=304.0 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc-C-------ccccCceee Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD-G-------DRKRRKMQH 72 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~-~-------~~~~~~~~~ 72 (527) +..-..+++++.+.. ...+..+++++.+||.|+|+.+..+... + ..+..++++ T Consensus 18 ~~~~~~~~~~i~~~~-------------------~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~ 78 (479) T protein:vir:79 18 KESTINLVKVIEHYI-------------------LKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAI 78 (479) T ss_pred cCChhHHHHHHHHHH-------------------hhhhHHHHHHHHHHhccCCcccccccccccccccccccccCcceee Confidence 333333444433211 2234568999999999999876543321 1 112345788 Q ss_pred cchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe-CCeeEEEEEcCCc Q lcl|NC_019418. 73 LPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD-GDKIRVAFIQAPV 151 (527) Q Consensus 73 lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d-~~~~~i~~v~a~~ 151 (527) +|||+.||+..|+|+|++|++++++++..++.|+.+++ |+|...+.++++.++++|.+|+++|+| ++++++.+++|.+ T Consensus 79 ~~~~~~Ivd~~~~~l~g~p~~~~~~~~~~~~~~~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~ 157 (479) T protein:vir:79 79 NNYHKLLVDQKVGYSVGNPIVFNADDDNLTKLLNDLLG-EEFDDTITELYLNASNKGVEWLHPYINRKGEFKYVIIPAEE 157 (479) T ss_pred cchHHHHHHHHHhhhhcCCceeccCCHHHHHHHHHHHh-cCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEccce Confidence 99999999999999999999999999999999988775 789999999999999999999999997 4689999999999 Q ss_pred eEEEEEcCC-ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeeccc Q lcl|NC_019418. 152 FLPLQSNTQ-DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSE 230 (527) Q Consensus 152 ~~P~~~d~~-~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~ 230 (527) ++|++.+.. +...+++.+........ ...+.+|.|... .|.+ |.... ......+.... T Consensus 158 ~~~v~d~~~~~~~~~~ir~y~~~~~~~--~~~~~~e~y~~~----------------~i~~--~~~~~-~~~~~~~~~~~ 216 (479) T protein:vir:79 158 AIPIWDSKRQRELVAFIRFYYIEDIDG--NKIKRVEYYTEN----------------DITY--FIERG-NSFIQEFLYDE 216 (479) T ss_pred eEEEEeCCCCCceEEEEEEEEEeecCC--ceEEEEEEEeCC----------------cEEE--EEecC-Ccccccccccc Confidence 999965543 34444443222222222 223346776532 2222 11111 11111111100 Q ss_pred ------ccCC-cccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeech Q lcl|NC_019418. 231 ------LYPD-LQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPE 303 (527) Q Consensus 231 ------~~~~-l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~ 303 (527) ..+. ......-+++++++|++|++ +++|+|+|+++++++|++|.++|+++++++....++.+.. T Consensus 217 ~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n---------n~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~ 287 (479) T protein:vir:79 217 YGKMTDIQEGHFRINNKEQGWGKVPFIPFKN---------NEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLK 287 (479) T ss_pred cccccccccccccccccccCCCcccEEEecC---------CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeee Confidence 0000 01112235667777877764 2569999999999999999999999999998777666633 Q ss_pred hHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccc Q lcl|NC_019418. 304 QMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQG 383 (527) Q Consensus 304 ~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g 383 (527) .+- ....++ +..+......+..+ +.+.++++++++..+.+...++.+.+.|...++.+. +++...| T Consensus 288 g~~---~~~~~~-------~~~~~~~~~~i~~~--~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~--~~~~~~g 353 (479) T protein:vir:79 288 EYP---GTSLQE-------FIDNIRYYKSIKVD--GGGGVDKLEINIPVEAKKELLDRLEKNIIIFGQGVN--PESQNTG 353 (479) T ss_pred cCC---cccccc-------chhhhhhccceecC--CCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCccc--ccccccc Confidence 321 111110 11111122223332 234588999999999999999999888877765543 3444457 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHh Q lcl|NC_019418. 384 VKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVA 463 (527) Q Consensus 384 ~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~ 463 (527) ..|++++++.++.+.++++.+++.|+++|+++++.|+.+.+. .++...+..+++|.|++++|.|..+.+++.+++ T Consensus 354 n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~---~~~~~~~~~~i~i~f~~~~p~~~~~~a~~~~kl-- 428 (479) T protein:vir:79 354 DKSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKI---SGNKSYDYKTVQITFNHSMIINEAEKIDMAAKS-- 428 (479) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---cCCCccccccceEEeCCCCCcCHHHHHHHHHHH-- Confidence 789999999999999999999999999999999999987653 344555677899999999999999999988886 Q ss_pred cCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 464 AGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 464 aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) +|+||.+|++..+++++| +++|++||++|+....+.....+++.++.. ||+ T Consensus 429 ~g~iS~et~l~~l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~-----------~e~ 479 (479) T protein:vir:79 429 TGIVSDETIVSNHPWVED--VNDELERLKKQEDTQKEYDDLIPNNQDGVI-----------DET 479 (479) T ss_pred hccCcHHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHhccCcccCCCc-----------CcC Confidence 499999999988766654 668899999988765544444443222221 111 No 20 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=100.00 E-value=3.7e-55 Score=318.97 Aligned_cols=449 Identities=13% Similarity=0.078 Sum_probs=298.4 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccc-cccccCccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIE-YTNTDGDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~-~~~~~~~~~~~~~~~lnl~~~i 79 (527) ++-++.|+.+|.++. +....||+++.+||.|+++.+. +.......+..+++++|+|+.| T Consensus 38 ~~~~~~l~~~i~~~~--------------------~~~~~r~~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~I 97 (501) T protein:vir:27 38 VNNWELLKNFINHHK--------------------LRQAPRIQELLDYARGENHDVLQFGRRKDREMADKRAVHNYGRMI 97 (501) T ss_pred cccHHHHHHHHHHHH--------------------HHHHHHHHHHHHHhcCCCccccccCccCccccccceeccchHHHH Confidence 444445555544321 2345689999999999876553 3333334445678899999999 Q ss_pred HHHHhhhhhcccceEeeCC----HHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEEEEEcCCceEE Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAED----ETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRVAFIQAPVFLP 154 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~d----~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i~~v~a~~~~P 154 (527) |+..++|+|++|+++++++ +.++++|++++..|+|...+.++++.|+++|.+|.++|+|. ++++|.+++|.+++| T Consensus 98 vd~~~~yl~g~p~~~~~~d~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~ded~~~~i~~~~p~~~~~ 177 (501) T protein:vir:27 98 SKFKTGYLAGNPIRVEYDDNDNNSQNDDTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRNEYDETRIKRLNPLETFV 177 (501) T ss_pred HHHHhhhhcccCeeEecCCccchHHHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeCCCCceEEEEEccceeEE Confidence 9999999999999999876 45678899999999999999999999999999999999974 679999999999999 Q ss_pred EEEcCC-ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccC Q lcl|NC_019418. 155 LQSNTQ-DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYP 233 (527) Q Consensus 155 ~~~d~~-~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~ 233 (527) ++.++. +...+++.+....... +.++ .+|.+.. +..|+ |...... . + T Consensus 178 v~d~~~~~~~~~~ir~~~~~~~~-~~~~--~~~vyt~-------------~~v~~-----~~~~~~~---~---~----- 225 (501) T protein:vir:27 178 IYDNSLEDNSIAAVRYYNRGTLQ-NAKD--VVEIYTN-------------EHIYT-----LDASDDF---N---E----- 225 (501) T ss_pred EecCCCCCceEEEEEEEEeeecC-CcEE--EEEEEeC-------------CeEEE-----EEeCCce---e---e----- Confidence 976653 4444444322222222 2222 2454431 11111 2111100 0 0 Q ss_pred CcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCC Q lcl|NC_019418. 234 DLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDN 313 (527) Q Consensus 234 ~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~ 313 (527) ....-++++++++++|++ ++.|+|+|+++++++|++|.++|++++.++....++.+...+........ T Consensus 226 ---~~~~~~~~g~vPvv~~~n---------n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~ 293 (501) T protein:vir:27 226 ---ISVTTHAFGTVPITEFLN---------NVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQ 293 (501) T ss_pred ---ccccccCCCcccEEEecC---------CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccc Confidence 001124567778888864 25699999999999999999999999999987777777554432111111 Q ss_pred Ccccccccccccccceeeecc---CCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHH Q lcl|NC_019418. 314 QGNIAFKRRFDVEQNVYMQVG---AGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEI 390 (527) Q Consensus 314 ~~~~~~~~~~d~~~~~~~~~~---~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei 390 (527) +.. ......+..... .+......++++++++.++.+...++.+.+.|...++.+..+++. .+|..||.++ T Consensus 294 ~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~-~~~n~Sg~Al 366 (501) T protein:vir:27 294 ASD------MKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSDTN-FSGNTSGEAL 366 (501) T ss_pred hhh------hhhcCceeecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCccc-cccCchHHHH Confidence 111 111111111111 112233457889999888888777887777776666654333322 1355688999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHH Q lcl|NC_019418. 391 VSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQK 470 (527) Q Consensus 391 ~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~ 470 (527) ++..+.+.++++.+++.|+.+|+++++.|+.+.+..+ .+...+..+|+|+|++++|.|..+.++.++++ +|++|.+ T Consensus 367 ~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~--~~~~~d~~~i~v~f~~~~p~n~~e~ad~~~kl--~g~iS~e 442 (501) T protein:vir:27 367 KYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVN--EFKDFDESLLKITFTPNLPKSLNEQVSILTGL--GGQVSQE 442 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc--cccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCcHH Confidence 9999999999999999999999999999998765422 22334556799999999999999999988886 5999999 Q ss_pred HHHHhcCCCCHHHHHHHHHHHHHhccccccc--ccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 471 RGIAKTLGITEEEAEKELAEINGELPPESDA--ELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 471 ~~i~~~~~~~deea~~el~ri~~E~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) |++..+++++| +++|++||++|+...... ..++....+... .+..++++|+.|. T Consensus 443 t~l~~l~~v~D--~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~-d~~~~~~~d~~e~ 498 (501) T protein:vir:27 443 TALSLSGLVES--PNEELDKINKEVSEIDFKGYSNDFNEHVGKYT-DEVKETHTDDFER 498 (501) T ss_pred HHHHhCCCCCC--HHHHHHHHHHHHHhhhHhhhcCcccccccccc-CCCCCCccccccc Confidence 99988866654 567899998887543211 111221111111 1112222333333 No 21 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=100.00 E-value=5.5e-55 Score=318.04 Aligned_cols=458 Identities=11% Similarity=0.050 Sum_probs=305.6 Q ss_pred CCh-HHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHH Q lcl|NC_019418. 1 MSL-IQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~-~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i 79 (527) +.+ .+.+++++.+.. .+...|++++++||.|+++.+......++.+..+++++|+|+.| T Consensus 13 ~~~~~~~~~~~i~~~~--------------------~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~~~~i 72 (489) T protein:vir:99 13 SKLWIDQLKNYISRFK--------------------AEQLERLKELKRYYLGDNNIKYRPAKTDKYAADNRIASDFAKYI 72 (489) T ss_pred CCCCHHHHHHHHHHHH--------------------HHHHHHHHHHHHHhcccCccccccccccccCCcceeecchHHHH Confidence 554 344555554421 12356899999999999988765544444445667899999999 Q ss_pred HHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe-----CCeeEEEEEcCCceEE Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD-----GDKIRVAFIQAPVFLP 154 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d-----~~~~~i~~v~a~~~~P 154 (527) |+..|+|+|++|++++++++..+++|+.++++|+|...+.++++.+++.|.+|..+|+. .++++|.+++|.+++| T Consensus 73 v~~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~~~i~~~~p~~~~~ 152 (489) T protein:vir:99 73 TVFEQGYMLGVPVEYKNENKDLQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTEVKLYQLPAEQTFV 152 (489) T ss_pred HHHHhhhhccCCceeecCChhHHHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcceEEEEEcccceEE Confidence 99999999999999999999999999999999999999999999999999999999973 3579999999999999 Q ss_pred EEEcCC-ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccC Q lcl|NC_019418. 155 LQSNTQ-DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYP 233 (527) Q Consensus 155 ~~~d~~-~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~ 233 (527) ++.+.. +...+++.. +..+......+..++.+.. ..|.. |+......-| ..+. T Consensus 153 v~dd~~~~~~~~~i~~--~~~~~~~~~~~~~~~~y~~----------------~~i~~--~~~~~~~~~~--~~~~---- 206 (489) T protein:vir:99 153 IYDDTYQRNSLMAVHF--YDIDYGSGKRKQIIKAYTS----------------DTIYT--YEDYNLETKG--MRLK---- 206 (489) T ss_pred EEcCCCCCceEEEEEE--EEEecCCCceEEEEEEEeC----------------CcEEE--EEecCCCccc--ceec---- Confidence 976554 344444432 2222222223334455431 11111 2221111111 1111 Q ss_pred CcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCC Q lcl|NC_019418. 234 DLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDN 313 (527) Q Consensus 234 ~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~ 313 (527) ...-++++++++++|++ + +.|.|+|+++++++|++|.++|+++++++....++.+-..+.....+.. T Consensus 207 ----~~~~~~~g~vPvv~~~n----~-----~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~~~~~ 273 (489) T protein:vir:99 207 ----DYEGHFFKGVPVNEYAN----N-----EERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTGADEN 273 (489) T ss_pred ----ccccccCCceeEEEeec----C-----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCcccccch Confidence 11124566777777764 2 4589999999999999999999999999765554544322211111100 Q ss_pred ----------Ccccccccccccccceeeecc-CCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccccc Q lcl|NC_019418. 314 ----------QGNIAFKRRFDVEQNVYMQVG-AGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQ 382 (527) Q Consensus 314 ----------~~~~~~~~~~d~~~~~~~~~~-~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~ 382 (527) .+........+..+.+...-+ ...+....+++++.+++++.+...++.+.+.|...++.+.-++ ...+ T Consensus 274 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~~~ 352 (489) T protein:vir:99 274 DYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQD-MKFS 352 (489) T ss_pred hhhhhcccccccccccccccccceeeeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCccccc-cccc Confidence 000000001111111111101 1122234578899999999999999998888877666543222 2234 Q ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHH Q lcl|NC_019418. 383 GVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMV 462 (527) Q Consensus 383 g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~ 462 (527) |..||.+++++.+.+.++++.+++.|+.+|+++++.|+.+.+..+........+.+++|+|++++|.|..+.++.+++++ T Consensus 353 ~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl~ 432 (489) T protein:vir:99 353 GVQSGESMKYKLMASDNYREKQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNLY 432 (489) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHHHHHHHHHh Confidence 66789999999999999999999999999999999999887543322222334567999999999999999999988874 Q ss_pred hcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccc-cccCCCCCCCCCCCCCCCCCC Q lcl|NC_019418. 463 AAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESD-AELALYGKGQQNTVGNSKDTV 521 (527) Q Consensus 463 ~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 521 (527) |+||.++++..+++++++++++|++||++|+..... ..+..+++..+.+.. ..+.+ T Consensus 433 --giis~et~~~~l~~v~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~-~~~~p 489 (489) T protein:vir:99 433 --GIVSDQTIFEILNTVTGVDAEAELKRLKEEADKKQSLPEPRLVGDASGQEEP-TAEKP 489 (489) T ss_pred --ccCCHHHHHHhcCCCCchhHHHHHHHHHHHHHHHhccccccccCCCCCCcCC-CCCCC Confidence 999999999999999988899999999988754322 222222211111100 00111 No 22 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=100.00 E-value=1.7e-55 Score=320.81 Aligned_cols=453 Identities=13% Similarity=0.110 Sum_probs=292.6 Q ss_pred hHHH----HHHHHHHHHHHh-hcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc------CccccCcee Q lcl|NC_019418. 3 LIQK----VKDFFNRGRYNM-TTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD------GDRKRRKMQ 71 (527) Q Consensus 3 ~~~~----~k~~~~~~~~~~-~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~------~~~~~~~~~ 71 (527) |++. =|.|+.+....+ ....+..-....-|..-.....++.++++||.|+|+.+..+... ...+.++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki 80 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQYETQEEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWRM 80 (468) T ss_pred CccccCCcCceeehheeecccccccCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccccccccccccccccc Confidence 1000 001111100000 00000000000001112235678999999999999876543221 122335678 Q ss_pred ecchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe-CCeeEEEEEcCC Q lcl|NC_019418. 72 HLPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD-GDKIRVAFIQAP 150 (527) Q Consensus 72 ~lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d-~~~~~i~~v~a~ 150 (527) ++|+|+.||+..++|+|++|++++++++..++.|+++++ |+|...+.++++.|+++|.+|+++|+| .+.+++.+++|+ T Consensus 81 ~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~ 159 (468) T protein:vir:96 81 YTNYHQNLVDQKVAYAVANPVTYGTEDEKSLKTIQEVLN-HKWDDKLVDILTAASNKGVEWIQPYVDEQGEFKTFRVPAE 159 (468) T ss_pred ccchHHHHHHHHHhhhccCCceeccCChHHHHHHHHHHh-cCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEEEccc Confidence 899999999999999999999999999999999999996 689999999999999999999999997 467999999999 Q ss_pred ceEEEEEcCC-ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeec- Q lcl|NC_019418. 151 VFLPLQSNTQ-DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNL- 228 (527) Q Consensus 151 ~~~P~~~d~~-~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l- 228 (527) +++|++.++. +...+++ +.+...+.. + +|.|.. .+|.+ |...+.. .....+- T Consensus 160 ~~~~v~~~~~~~~~~~~i--r~~~~~~~~--~---~~~~~~----------------~~~~~--~~~~~~~-~~~~~~~~ 213 (468) T protein:vir:96 160 QAIPIWTNKERDELKAFI--RLYELDGGE--R---VEYWTA----------------NDVTF--YELKDGQ-LIPDYYQG 213 (468) T ss_pred ceEEEEcCCCCCceEEEE--EEEEecCce--E---EEEEeC----------------CeEEE--EEEcCCc-eeeccccc Confidence 9999975542 3433333 333322221 1 233321 11111 1111100 0000000 Q ss_pred -ccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhc Q lcl|NC_019418. 229 -SELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQ 307 (527) Q Consensus 229 -~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~ 307 (527) ............-.++.++++++|++ ++.|+|+|+++++++|++|.++|+++++++.....+.+...+- T Consensus 214 ~~~~~~~~~~~~~~~~~~~iPvv~~~n---------~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~- 283 (468) T protein:vir:96 214 EEHVQAHYYVGNKSMSWNRVPFIPFKN---------NPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYE- 283 (468) T ss_pred ccccccceeeccccccCCcccEEEecC---------CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC- Confidence 00000000111225667788888864 2569999999999999999999999999987555555533221 Q ss_pred CCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchH Q lcl|NC_019418. 308 LKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTA 387 (527) Q Consensus 308 ~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TA 387 (527) ....++ ...+... +..+.+..++++.++++++++.++.+...++.+.+.|...++.+.-++ ...+|..|| T Consensus 284 --~~~~~~-----~~~~~~~--~~~i~~~~d~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~~~~~n~Sg 353 (468) T protein:vir:96 284 --GEDLEE-----FMYNLKY--YKAINVDGDGSGGVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQQ-DKFGNSPSG 353 (468) T ss_pred --ccccch-----hhhhhhc--CceEEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCcccccc-cccccchHH Confidence 000000 1111111 222333333445689999999999999999998888877776543222 222467789 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCC Q lcl|NC_019418. 388 TEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFA 467 (527) Q Consensus 388 tei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~ 467 (527) .++++.++.+.++++.+++.|+++|+++++.|+.+.. ...+..+++|.|++++|.|..+.+++ ++.+|+| T Consensus 354 ~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g-------~~~d~~~i~i~f~~~~p~d~~e~a~~---~~~~g~i 423 (468) T protein:vir:96 354 IALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYK-------LSIKVQDVEITFNFNVMVNELEQSQI---GVNSQYL 423 (468) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CCcccceeeEEecCCCCcCHHHHHHH---HHhcCCC Confidence 9999999999999999999999999999999987642 23455679999999999998877665 4457999 Q ss_pred CHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCC Q lcl|NC_019418. 468 TQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTV 514 (527) Q Consensus 468 s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~ 514 (527) |.+|++..+++++| +++|++||++|+....+...++++.++++|- T Consensus 424 S~et~i~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 468 (468) T protein:vir:96 424 SKETVVTNHPWVDD--PVAEMERIDQEELALPSIEEGLNGKENNEPT 468 (468) T ss_pred chHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHhhccCCCCCCCCC Confidence 99999987766654 6789999999987666655555543333332 No 23 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=100.00 E-value=3.4e-55 Score=319.22 Aligned_cols=455 Identities=13% Similarity=0.116 Sum_probs=294.5 Q ss_pred hHHHHHHHHHH---------HHHH--hhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccc---cC---cc Q lcl|NC_019418. 3 LIQKVKDFFNR---------GRYN--MTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNT---DG---DR 65 (527) Q Consensus 3 ~~~~~k~~~~~---------~~~~--~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~---~~---~~ 65 (527) ||..++-=..| +.-+ +..+-+.+. |..-.....||.++++||.|+|+.+..... .+ .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-----i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~ 75 (474) T protein:vir:94 1 MFNIIRMPWDKPYGEEVVEQLKPQFETQEEMIVRL-----IDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYD 75 (474) T ss_pred CcccccccCCCchhhHHHHhhhhcccCHHHHHHHH-----HHHHHHHHHHHHHHHHHhccccchhcccchhccccccccc Confidence 22222111111 0000 000111110 111223567899999999999987643221 11 22 Q ss_pred ccCceeecchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEE Q lcl|NC_019418. 66 KRRKMQHLPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRV 144 (527) Q Consensus 66 ~~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i 144 (527) +..+++++|+|+.||+..|+|+|++|++++++++..++.|+.+++ |+|...+.++++.++++|.+|+++|+|. ++++| T Consensus 76 ~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i 154 (474) T protein:vir:94 76 KPDWRITTNFHQNLVDQKVSYVASKPVTYSCEDENVLKVIHDVLD-TRWDNKLIDILTATSNKGIDWLQVYINENGEMKL 154 (474) T ss_pred cCcceeecchHHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEE Confidence 334578999999999999999999999999999999999999885 6899999999999999999999999975 67999 Q ss_pred EEEcCCceEEEEEcCC-ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccC Q lcl|NC_019418. 145 AFIQAPVFLPLQSNTQ-DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLG 223 (527) Q Consensus 145 ~~v~a~~~~P~~~d~~-~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG 223 (527) .+++|.+++|++.++. +...+++ +++...+. .+ +|.+.. .....+... . ...+ T Consensus 155 ~~~~p~~~~~v~d~~~~~~~~~~i--r~~~~~~~--~~---~~~yt~---------------~~~~~y~~~---~-~~~~ 208 (474) T protein:vir:94 155 FRVPAEQAIPIWVDKEREELKSFI--RYYKFNNE--EK---VEFWTD---------------TTVTYYVLE---N-GGLI 208 (474) T ss_pred EEEcccceEEEEcCCCCCceEEEE--EEEEecCe--EE---EEEEeC---------------CeEEEEEEc---C-Cccc Confidence 9999999999976553 3444333 22222221 11 222211 111111111 1 1111 Q ss_pred ceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeech Q lcl|NC_019418. 224 ERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPE 303 (527) Q Consensus 224 ~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~ 303 (527) ....... .........+++.++++++|++ ++.|+|+|+++++++|++|.++|+++++++.....+.+-. T Consensus 209 ~~~~~~~--~~~~~~~~~~~~g~vPvv~~~n---------n~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~ 277 (474) T protein:vir:94 209 PDYYYGA--NHVQSHFSNGNWGRVPFIAFKN---------NPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILK 277 (474) T ss_pred cccccCc--CcccccccccCCCccceEEecC---------CcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee Confidence 1000000 1112233446778888888864 2579999999999999999999999999987555555522 Q ss_pred hHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccc Q lcl|NC_019418. 304 QMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQG 383 (527) Q Consensus 304 ~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g 383 (527) .+ .+... ..+..+...+..+..+. .+++++++++++.+.+.+.++.+.+.|...++.+.-+++ ..+| T Consensus 278 g~-----~~~~~-----~~~~~~~~~~~~i~~~~--~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~ 344 (474) T protein:vir:94 278 GY-----EGEDL-----EEFMRGLKYYKAINVDG--DGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTD-KFGS 344 (474) T ss_pred cC-----Ccccc-----hhhhhhhhccceeeccC--CCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCcc-cccc Confidence 22 11110 01111222233333332 345889999999999999999988888776665422221 1235 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHh Q lcl|NC_019418. 384 VKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVA 463 (527) Q Consensus 384 ~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~ 463 (527) ..||.++++.++.+.++++.+++.|+++|+++++.|+.+.. ...+..+++|+|++++|.|..+.++.. +. T Consensus 345 n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~-------~~~d~~~i~v~f~~~~p~~~~e~a~~~---~~ 414 (474) T protein:vir:94 345 APSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNN-------LKTDVKDIEISFNFNRMMNDAEQSQII---AQ 414 (474) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CCcccceeeEEeccCcccCHHHHHHHH---HH Confidence 67899999999999999999999999999999999987643 234567789999999999987766654 45 Q ss_pred cCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 464 AGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 464 aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) +|+||.++++..+++++| +++|++||++|+....+..+.+.+.+.+...+ +.....++.| T Consensus 415 ~g~iS~et~l~~l~~v~D--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~e 474 (474) T protein:vir:94 415 SQYLSRETLVKSSPLVDD--YKAELERIEQEQMEYNKQLPNLDDGGADGAQQ-QEGSNNKESE 474 (474) T ss_pred cCCCCHHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhccccCCCCCCCccc-CCCCcccccC Confidence 699999999988866654 56889999998876555555554333322222 1222222222 No 24 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=100.00 E-value=3.4e-55 Score=319.22 Aligned_cols=455 Identities=13% Similarity=0.116 Sum_probs=294.5 Q ss_pred hHHHHHHHHHH---------HHHH--hhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccc---cC---cc Q lcl|NC_019418. 3 LIQKVKDFFNR---------GRYN--MTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNT---DG---DR 65 (527) Q Consensus 3 ~~~~~k~~~~~---------~~~~--~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~---~~---~~ 65 (527) ||..++-=..| +.-+ +..+-+.+. |..-.....||.++++||.|+|+.+..... .+ .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-----i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~ 75 (474) T protein:vir:97 1 MFNIIRMPWDKPYGEEVVEQLKPQFETQEEMIVRL-----IDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYD 75 (474) T ss_pred CcccccccCCCchhhHHHHhhhhcccCHHHHHHHH-----HHHHHHHHHHHHHHHHHhccccchhcccchhccccccccc Confidence 22222111111 0000 000111110 111223567899999999999987643221 11 22 Q ss_pred ccCceeecchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEE Q lcl|NC_019418. 66 KRRKMQHLPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRV 144 (527) Q Consensus 66 ~~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i 144 (527) +..+++++|+|+.||+..|+|+|++|++++++++..++.|+.+++ |+|...+.++++.++++|.+|+++|+|. ++++| T Consensus 76 ~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i 154 (474) T protein:vir:97 76 KPDWRITTNFHQNLVDQKVSYVASKPVTYSCEDENVLKVIHDVLD-TRWDNKLIDILTATSNKGIDWLQVYINENGEMKL 154 (474) T ss_pred cCcceeecchHHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEE Confidence 334578999999999999999999999999999999999999885 6899999999999999999999999975 67999 Q ss_pred EEEcCCceEEEEEcCC-ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccC Q lcl|NC_019418. 145 AFIQAPVFLPLQSNTQ-DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLG 223 (527) Q Consensus 145 ~~v~a~~~~P~~~d~~-~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG 223 (527) .+++|.+++|++.++. +...+++ +++...+. .+ +|.+.. .....+... . ...+ T Consensus 155 ~~~~p~~~~~v~d~~~~~~~~~~i--r~~~~~~~--~~---~~~yt~---------------~~~~~y~~~---~-~~~~ 208 (474) T protein:vir:97 155 FRVPAEQAIPIWVDKEREELKSFI--RYYKFNNE--EK---VEFWTD---------------TTVTYYVLE---N-GGLI 208 (474) T ss_pred EEEcccceEEEEcCCCCCceEEEE--EEEEecCe--EE---EEEEeC---------------CeEEEEEEc---C-Cccc Confidence 9999999999976553 3444333 22222221 11 222211 111111111 1 1111 Q ss_pred ceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeech Q lcl|NC_019418. 224 ERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPE 303 (527) Q Consensus 224 ~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~ 303 (527) ....... .........+++.++++++|++ ++.|+|+|+++++++|++|.++|+++++++.....+.+-. T Consensus 209 ~~~~~~~--~~~~~~~~~~~~g~vPvv~~~n---------n~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~ 277 (474) T protein:vir:97 209 PDYYYGA--NHVQSHFSNGNWGRVPFIAFKN---------NPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILK 277 (474) T ss_pred cccccCc--CcccccccccCCCccceEEecC---------CcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee Confidence 1000000 1112233446778888888864 2579999999999999999999999999987555555522 Q ss_pred hHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccc Q lcl|NC_019418. 304 QMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQG 383 (527) Q Consensus 304 ~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g 383 (527) .+ .+... ..+..+...+..+..+. .+++++++++++.+.+.+.++.+.+.|...++.+.-+++ ..+| T Consensus 278 g~-----~~~~~-----~~~~~~~~~~~~i~~~~--~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~ 344 (474) T protein:vir:97 278 GY-----EGEDL-----EEFMRGLKYYKAINVDG--DGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTD-KFGS 344 (474) T ss_pred cC-----Ccccc-----hhhhhhhhccceeeccC--CCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCcc-cccc Confidence 22 11110 01111222233333332 345889999999999999999988888776665422221 1235 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHh Q lcl|NC_019418. 384 VKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVA 463 (527) Q Consensus 384 ~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~ 463 (527) ..||.++++.++.+.++++.+++.|+++|+++++.|+.+.. ...+..+++|+|++++|.|..+.++.. +. T Consensus 345 n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~-------~~~d~~~i~v~f~~~~p~~~~e~a~~~---~~ 414 (474) T protein:vir:97 345 APSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNN-------LKTDVKDIEISFNFNRMMNDAEQSQII---AQ 414 (474) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CCcccceeeEEeccCcccCHHHHHHHH---HH Confidence 67899999999999999999999999999999999987643 234567789999999999987766654 45 Q ss_pred cCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 464 AGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 464 aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) +|+||.++++..+++++| +++|++||++|+....+..+.+.+.+.+...+ +.....++.| T Consensus 415 ~g~iS~et~l~~l~~v~D--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~e 474 (474) T protein:vir:97 415 SQYLSRETLVKSSPLVDD--YKAELERIEQEQMEYNKQLPNLDDGGADGAQQ-QEGSNNKESE 474 (474) T ss_pred cCCCCHHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhccccCCCCCCCccc-CCCCcccccC Confidence 699999999988866654 56889999998876555555554333322222 1222222222 No 25 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=100.00 E-value=1e-54 Score=316.57 Aligned_cols=454 Identities=10% Similarity=0.026 Sum_probs=296.7 Q ss_pred CChH------HHH----HHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCce Q lcl|NC_019418. 1 MSLI------QKV----KDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKM 70 (527) Q Consensus 1 m~~~------~~~----k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~ 70 (527) |-+. +.+ ..+++++... -.....|+.++++||.|+|+.+.. ...+..+.+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~i~~-----------------~~~~~~~~~~l~~Yy~g~~~i~~~-~~~~~~~~~~k 62 (499) T protein:vir:10 1 MAVVIDKDLLDDVNEPNIEAINYAIRE-----------------LQNRKKRLDKLSDYYNGKQEIEKH-EFDNATVEAAN 62 (499) T ss_pred CccchhhhHHhhhhcCCHHHHHHHHHH-----------------HHHHHHHHHHHHHHhccccchhcC-CcCcCCCCcce Confidence 2221 111 1112221110 112346788899999999887643 33444556778 Q ss_pred eecchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCC---------- Q lcl|NC_019418. 71 QHLPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGD---------- 140 (527) Q Consensus 71 ~~lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~---------- 140 (527) +++|+|+.||+..|+|||++|++++++++..++.|++++++|+|...+.+++..++++|.+|.++|++.+ T Consensus 63 i~~n~~~~Iv~~~~~~l~g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~ 142 (499) T protein:vir:10 63 VMVNHAKYITDMNVGFMTGNPVKYVAEKGKNIDDILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELG 142 (499) T ss_pred eecchHHHHHHHHhhhhcccCceeecCChhHHHHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEeccccccccccccc Confidence 8999999999999999999999999999999999999999999999999999999999999999999754 Q ss_pred --------eeEEEEEcCCceEEEEEcCCc-eEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEE Q lcl|NC_019418. 141 --------KIRVAFIQAPVFLPLQSNTQD-VSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITN 211 (527) Q Consensus 141 --------~~~i~~v~a~~~~P~~~d~~~-~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n 211 (527) .+++..++|.++||++.+..+ ...+++.+....... +...++.+|.|.. .+|.+ T Consensus 143 ~~~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~-~~~~~~~~~iyt~----------------~~i~~ 205 (499) T protein:vir:10 143 NEKLTPNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLE-GNTNGYSITVYMP----------------QRIVE 205 (499) T ss_pred ccccccccceEEEEEcccceEEEecCCCCcceEEEEEEEEEeecC-CCceEEEEEEEeC----------------CeEEE Confidence 367899999999999776654 344444333222222 2222334566542 22322 Q ss_pred EEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_019418. 212 ELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWE 291 (527) Q Consensus 212 ~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e 291 (527) |.......++...++. ...-++++++++++|++ +++|.|+|+++++|+|++|.++|++++. T Consensus 206 --~~~~~~~~~~~~~~~~--------~~~~~~~g~vPvv~~~n---------~~~~~~d~e~v~~liD~~~~~~S~~~~~ 266 (499) T protein:vir:10 206 --YRTKTTMEVSANDPIV--------YDGENLFGAVPIIEFRN---------NEERQGDFEQLISLIDAYNLLQTDRISD 266 (499) T ss_pred --EEecCCccccCcceec--------ccccCCCCccceEEecC---------CCCCCCchHhHHHHHHHHHHHHHHHHHH Confidence 2221111111100110 11124677788888864 2468999999999999999999999999 Q ss_pred HHcCcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcC Q lcl|NC_019418. 292 IKMGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIG 371 (527) Q Consensus 292 ~~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g 371 (527) ++.....+.+-..+- ....... .... .. +.....+.++...++++++++..+.+...++.+.+.|...++ T Consensus 267 ~~~~~~~~lv~~G~~---~~~~~~~---~~~~--~~--~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~ 336 (499) T protein:vir:10 267 KEAFVDALLVTFGFG---LGDDKDD---IQRL--KR--GAIEAPPREEGADIEWLTKSFDETQVNLLSQSIENDIHKISY 336 (499) T ss_pred HHHhcCceeeeecCc---cccccch---hhhh--hh--cceeccCCCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhC Confidence 987555555522211 0100000 0001 11 111112223344588999999999999999999888877766 Q ss_pred CCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCH Q lcl|NC_019418. 372 VSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDR 451 (527) Q Consensus 372 ~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~ 451 (527) .+.-+++. .+|..||.+++++.+.+.++++.+++.|+.+|+++++.|+.+.+.. +...+..+++|.|++++|.|. T Consensus 337 ~p~~~~~~-~~gn~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~----~~~~d~~~i~i~f~~~~p~n~ 411 (499) T protein:vir:10 337 VPNMNDEK-FMGNVSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIK----GANDDASGCKISLVANIPSNL 411 (499) T ss_pred cccCCchh-hcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----CCccccccceEEeCCCCCCCH Confidence 55333222 2355689999999999999999999999999999999999886532 234456689999999999999 Q ss_pred HHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhccccccc----ccCCCCC-CCCCCCCCCCCCCCcccc Q lcl|NC_019418. 452 HAELDYWMKMVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDA----ELALYGK-GQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 452 ~~~~~~~~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~----~~~~~~~-~~~~~~~~~~~~~~~~~~ 526 (527) .+.++.++++ +|++|.+|++..++++++ +++|++||++|+...... ..+..++ +.+++..++....+++.. T Consensus 412 ~e~~~~~~kl--~g~iS~et~~~~l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 487 (499) T protein:vir:10 412 SDVVNNVKNA--DGIIPRKYTYSWLPDVDN--PQDVIDEMNQQDAETIKKNQEALRGQDPDRLELEDKQDDSSENDKEAG 487 (499) T ss_pred HHHHHHHHHH--hccCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCCCcccCCCCCCCc Confidence 9999999987 599999999988766654 567788887776432111 1111111 111111111111111222 Q ss_pred C Q lcl|NC_019418. 527 A 527 (527) Q Consensus 527 ~ 527 (527) + T Consensus 488 ~ 488 (499) T protein:vir:10 488 S 488 (499) T ss_pred c Confidence 2 No 26 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=100.00 E-value=1.2e-54 Score=316.16 Aligned_cols=449 Identities=12% Similarity=0.067 Sum_probs=300.4 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcc-cccccccCccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDD-IEYTNTDGDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~-l~~~~~~~~~~~~~~~~lnl~~~i 79 (527) ++-++.|++++.+.. .....||+++.+||.|+++. +.........+.++++++|+|+.| T Consensus 38 ~~~~~~i~~~i~~~~--------------------~~~~~r~~~~~~yY~g~~~~i~~~~~~~~~~~~~~ri~~n~~k~I 97 (501) T protein:vir:96 38 VNNWELLKNFINHHK--------------------LRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMI 97 (501) T ss_pred CChHHHHHHHHHHHH--------------------HHHHHHHHHHHHHhcCCCCcccCccccCccccccceeecchHHHH Confidence 455555666554321 22346899999999998654 443333444455678899999999 Q ss_pred HHHHhhhhhcccceEeeCC----HHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe-CCeeEEEEEcCCceEE Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAED----ETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD-GDKIRVAFIQAPVFLP 154 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~d----~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d-~~~~~i~~v~a~~~~P 154 (527) |+..++|+|++|+++++++ +.++++|++++++|+|...+.+++..|+++|.+|+++|+| .+.+++.+++|.+++| T Consensus 98 vd~~~~yl~g~p~~~~~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~~~~p~~~~~ 177 (501) T protein:vir:96 98 SKFKTGYLAGNPIRVEYDDNDDNSQNDDAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSEYDETRIKRLSPLETFV 177 (501) T ss_pred HHHHhhhhcccCeeEeeCCccchhHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEEEccceeEE Confidence 9999999999999999865 4567889999999999999999999999999999999997 4789999999999999 Q ss_pred EEEcCC-ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccC Q lcl|NC_019418. 155 LQSNTQ-DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYP 233 (527) Q Consensus 155 ~~~d~~-~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~ 233 (527) ++.++. +...+++.+...... .+.+. .++.|.. +..| + |... ... . + T Consensus 178 v~d~~~~~~~~~~v~~~~~~~~-~~~~~--~~~vyt~-------------~~i~--~---~~~~--~~~-~---~----- 225 (501) T protein:vir:96 178 IYDNSLEDNSIAAVRYYNRGTL-QSAKD--VVEIYTD-------------EHIY--T---LDAS--DDF-N---E----- 225 (501) T ss_pred EEcCCCCCceEEEEEEEEeecC-CCcEE--EEEEEcC-------------CcEE--E---EeeC--CCc-e---e----- Confidence 976653 455555433322222 22222 2344321 1111 1 1111 100 0 0 Q ss_pred CcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCC Q lcl|NC_019418. 234 DLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDN 313 (527) Q Consensus 234 ~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~ 313 (527) ....-++++++++++|++ +|.|+|+|+++++++|++|.++|++++.++....++.+-..+........ T Consensus 226 ---~~~~~~~~g~vPvv~~~n---------n~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~ 293 (501) T protein:vir:96 226 ---ISVTTHAFGTVPITEYLN---------NIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQ 293 (501) T ss_pred ---ccccccCCCccceEEecC---------CccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccc Confidence 011124567778888864 36799999999999999999999999999876666665444321111111 Q ss_pred Ccccccccccccccceeeec-c--CCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHH Q lcl|NC_019418. 314 QGNIAFKRRFDVEQNVYMQV-G--AGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEI 390 (527) Q Consensus 314 ~~~~~~~~~~d~~~~~~~~~-~--~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei 390 (527) +.. ......+.... . .+......++++++++..+.+...++.+.+.|...++.+..+++.. +|..||+++ T Consensus 294 ~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Al 366 (501) T protein:vir:96 294 ASD------MKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNF-SGNTSGEAL 366 (501) T ss_pred hhh------hhhcCeeeecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCcccc-cccchHHHH Confidence 111 11111111111 1 1112334578889999888888888888777777666654443322 356689999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHH Q lcl|NC_019418. 391 VSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQK 470 (527) Q Consensus 391 ~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~ 470 (527) ++..+.+.++++.+++.|+++|+++++.|+.+.+..+ .+...+..+++|+|++.+|.|..+.++.+++++ |++|.+ T Consensus 367 ~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~--~~~~~d~~~i~i~f~~~~p~n~~e~ad~~~kl~--g~iS~e 442 (501) T protein:vir:96 367 KYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVN--EFKDFDESLLKITFTPNLPKSLNEQVSILTGLG--GQVSQE 442 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--cccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCchH Confidence 9999999999999999999999999999998765422 233345567999999999999999999988874 999999 Q ss_pred HHHHhcCCCCHHHHHHHHHHHHHhcccccccc--cC---CCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 471 RGIAKTLGITEEEAEKELAEINGELPPESDAE--LA---LYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 471 ~~i~~~~~~~deea~~el~ri~~E~~~~~~~~--~~---~~~~~~~~~~~~~~~~~~~~~~ 526 (527) |++..+++++| +++|++||++|+....... .. ..++..+......-++++++.| T Consensus 443 t~~~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~e~~~d~~e~~~~ 501 (501) T protein:vir:96 443 TALSLSGLVES--PNEELDKINKEMSEIDFKGYSNDFNEHVGKYTDEVKETHTDDFEREYE 501 (501) T ss_pred HHHHhCCCCCC--HHHHHHHHHHHHHHhhccccccchhhcccccCCcCCCCCCCccccccC Confidence 99988866654 5678999988876432211 11 1111111112222333344444 No 27 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=100.00 E-value=5.2e-55 Score=318.20 Aligned_cols=468 Identities=12% Similarity=0.059 Sum_probs=299.4 Q ss_pred CCh-----HHHHH------HHHHHHHHHhhcccchhhhccC-c-----cc-cCHHHHHHHHHHHHHhcCCCc-ccccccc Q lcl|NC_019418. 1 MSL-----IQKVK------DFFNRGRYNMTTSHLSSILDHP-K-----VA-VTQSEFRRIQHNLAYYQSKFD-DIEYTNT 61 (527) Q Consensus 1 m~~-----~~~~k------~~~~~~~~~~~~~~~~~~~~~~-~-----i~-~~~~~~~~i~~~~~~y~g~~~-~l~~~~~ 61 (527) |.- .+-.+ +|-++.........+.+.+... + |. -......||+++.+||.|+++ .+..... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~~~~~~ 80 (502) T protein:vir:48 1 MMEQTLFTDSTGQDLVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRR 80 (502) T ss_pred CceeEEEEecchhHHHhhcccChhHHhhhcccchhhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccc Confidence 000 00000 1111111111111111111000 0 00 013345689999999999765 4444444 Q ss_pred cCccccCceeecchHHHHHHHHhhhhhcccceEeeCCH----HHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEE Q lcl|NC_019418. 62 DGDRKRRKMQHLPIARTAAKKIASLVYNEQAEISAEDE----TLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYV 137 (527) Q Consensus 62 ~~~~~~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~d~----~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~ 137 (527) ....+.++++++|+|+.||+..++|||++|++++++++ ..+++|++++++|+|...+.+++..++++|.+|+++|. T Consensus 81 ~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ 160 (502) T protein:vir:48 81 KDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNEDNSQNDDAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYR 160 (502) T ss_pred cccccccceeecchHHHHHHHHhhhhcccCeeEecCCccchhHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEe Confidence 44445567889999999999999999999999998753 46778999999999999999999999999999999999 Q ss_pred eC-CeeEEEEEcCCceEEEEEcC-CceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEe Q lcl|NC_019418. 138 DG-DKIRVAFIQAPVFLPLQSNT-QDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYK 215 (527) Q Consensus 138 d~-~~~~i~~v~a~~~~P~~~d~-~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~ 215 (527) |. +++++.+++|.+++|++.++ .+...+++.+....... ..++ .+|.|... ..|+ |. T Consensus 161 dedg~~~i~~~~p~~~~~vydd~~~~~~~~~ir~~~~~~~~-~~~~--~~~iyt~~-------------~i~~-----~~ 219 (502) T protein:vir:48 161 SEYDETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQ-NAKD--VVEIYTNQ-------------HIYT-----LD 219 (502) T ss_pred CCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEEeecC-CcEE--EEEEEeCC-------------eEEE-----EE Confidence 74 78999999999999997654 34455544332222222 2222 34555311 1111 11 Q ss_pred cCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_019418. 216 STSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMG 295 (527) Q Consensus 216 ~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~ 295 (527) .. .. ..+ ....-++++++|+++|++ .+.|+|+|++++++||++|.++|++++.++.. T Consensus 220 ~~--~~----~~~--------~~~~~~~~g~vPvv~~~n---------n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~ 276 (502) T protein:vir:48 220 AS--DS----FNE--------ISVTPHAFGTVPITEFLN---------NADGIGDYETELYLIDLYDSAESDTANHMSDM 276 (502) T ss_pred eC--Cc----eee--------ccceecCCCccceEEecC---------CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHh Confidence 11 10 000 011124566777888764 25699999999999999999999999999987 Q ss_pred cceeeechhHhcCCCCCCCcccccccccccccceeeecc---CCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCC Q lcl|NC_019418. 296 QRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVG---AGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGV 372 (527) Q Consensus 296 ~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~---~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~ 372 (527) ..++.+-..+.... .+..+ ..+.....++.... .+.+....+++++++++++.+...++.+.+.|...++. T Consensus 277 ~~~~lv~~g~~~~~-~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~ 350 (502) T protein:vir:48 277 ADAILAIYGDLALP-QGMQA-----SDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNT 350 (502) T ss_pred cCceeeeecCcccc-cccch-----hhhhhcceeeccccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCC Confidence 77777644432111 11111 11111111222111 11233456889999999999999999999888877776 Q ss_pred CcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHH Q lcl|NC_019418. 373 SSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRH 452 (527) Q Consensus 373 s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~ 452 (527) +..+++.. +|..||++++++.+.+.++++.+++.|+.+|+++++.|+.+.+..+ .....+..+++|+|.+.+|.|.. T Consensus 351 p~~~~~~~-~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~--~~~~~d~~~i~i~f~~~~p~d~~ 427 (502) T protein:vir:48 351 PDMSDNHF-SGNASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVN--EFKDFDESRLKITFTPNLPKSLY 427 (502) T ss_pred CCcCcccc-ccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc--cccccccccceEEeCCCCCcCHH Confidence 65444322 3567899999999999999999999999999999999998865422 22334556799999999999999 Q ss_pred HHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccc-cccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 453 AELDYWMKMVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESD-AELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 453 ~~~~~~~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) ++++.+.++ +|++|.+|++..++.++| +++|++||++|+..... ..+..+.+ +...+.+....+.++|. T Consensus 428 e~a~~~~kl--~g~iS~et~l~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~--~~~~~~d~~~e~~~~~~ 497 (502) T protein:vir:48 428 EQVSILNDL--GGQVSQETALSLSGLVEN--PTEELDKINEESSKIDFKGYPSYFYD--NVGKYTDEVKETHTDDF 497 (502) T ss_pred HHHHHHHHH--hccCcHHHHHHhCCCCCC--HHHHHHHHHHHHHhhhhhcccccccc--cccccCCCccCCCCcCc Confidence 999998887 599999999888755554 56889999888764221 11111111 11111111111112222 No 28 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=100.00 E-value=4.3e-55 Score=318.62 Aligned_cols=457 Identities=12% Similarity=0.110 Sum_probs=296.2 Q ss_pred CChH------HHHHHHHHHHHHHh--hcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccc-cC-----ccc Q lcl|NC_019418. 1 MSLI------QKVKDFFNRGRYNM--TTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNT-DG-----DRK 66 (527) Q Consensus 1 m~~~------~~~k~~~~~~~~~~--~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~-~~-----~~~ 66 (527) |--+ .+..++++.+.... ..+-+.+. |.--.....++.++++||.|+|+.+..... .+ ..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-----i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~ 75 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRL-----INDHKPKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLK 75 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHH-----HHHHHHHHHHHHHHHHHhccCCcchhccchhcccccccccc Confidence 3322 12223332211110 00000011 111123567899999999999876643321 11 123 Q ss_pred cCceeecchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe-CCeeEEE Q lcl|NC_019418. 67 RRKMQHLPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD-GDKIRVA 145 (527) Q Consensus 67 ~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d-~~~~~i~ 145 (527) ..+++++|+|+.||+..|+|||++|++++++++..++.|+++++ +++.....+++..++++|.+|+++|+| .+++++. T Consensus 76 ~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~y~d~~~~~~i~ 154 (474) T protein:vir:96 76 PDWRMFTNYHQNLVDQKVAYAVANPVTFSSDDDKSLKTIQEVLN-HKWDDKLVDILTAASNKGIEWLQPYIDENGEFKTF 154 (474) T ss_pred cchhcccchHHHHHHhhhhhhcccCceeecCchHHHHHHHHHHh-cCHHHHHHHHHHHHHhcCeeEEEEEecCCCceEEE Confidence 34578899999999999999999999999999999999999986 578999999999999999999999997 4789999 Q ss_pred EEcCCceEEEEEcC-CceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCc Q lcl|NC_019418. 146 FIQAPVFLPLQSNT-QDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGE 224 (527) Q Consensus 146 ~v~a~~~~P~~~d~-~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~ 224 (527) +++|++++|++.++ .+...+++. ++..++.. + .|.|.. ..|.+ |...+...+.. T Consensus 155 ~~~p~~~~~v~d~~~~~~~~~~vr--~~~~~~~~--~---~~~yt~----------------~~v~~--~~~~~~~~~~~ 209 (474) T protein:vir:96 155 RVPAEQAIPIWTNKERDTLKAFIR--YYRLDGAE--R---VEYWTD----------------SDVTY--YEYQDGILIPD 209 (474) T ss_pred EEcccceEEEEcCCCCCceEEEEE--EEeecCce--E---EEEEeC----------------CeEEE--EEecCCceeec Confidence 99999999997654 344444432 23322221 2 233221 11111 11111111100 Q ss_pred eeecccccCCcc-cceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeech Q lcl|NC_019418. 225 RVNLSELYPDLQ-PVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPE 303 (527) Q Consensus 225 ~v~l~~~~~~l~-~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~ 303 (527) .+......+... ....-.++.++++++|++ ++.|+|+|+.+++++|++|.++|+++++++.....++|.. T Consensus 210 ~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n---------n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~ 280 (474) T protein:vir:96 210 YYHGEEHIQSHYYVGNKRVSWGRVPFIPFKN---------NPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILK 280 (474) T ss_pred cccccccccccccccccccCCCceeEEEecc---------CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeee Confidence 000000011000 011225677788888875 2569999999999999999999999999988777777744 Q ss_pred hHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccc Q lcl|NC_019418. 304 QMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQG 383 (527) Q Consensus 304 ~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g 383 (527) .+- ....++ +..+-..++.+.. +++++.++++++++..+.+...++.+.+.|...++.+.-+++. .++ T Consensus 281 g~~---~~~~~~-------~~~~~~~~~~i~~-~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~~~ 348 (474) T protein:vir:96 281 GYE---GQDLDE-------FMRNLKYYKAINV-DGDGSGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQDK-FGN 348 (474) T ss_pred cCC---cccccc-------hhhhhhcCceEEe-cCCCCceeEEeecCChHHHHHHHHHHHHHHHHHhCCccccccc-ccc Confidence 331 111011 1111112233332 2344568999999999999999999988888877765443322 245 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHh Q lcl|NC_019418. 384 VKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVA 463 (527) Q Consensus 384 ~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~ 463 (527) ..||.++++..+.+.++++.+++.|+++|+++++.|+.+.. ......+++|+|++++|.|..+.++. +++ T Consensus 349 n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~-------~~~~~~~i~i~f~~~~p~~~~e~~~~---~~~ 418 (474) T protein:vir:96 349 SPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYK-------LNIKVQDVEITFNFNVMVNELEQSQI---GVQ 418 (474) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CCcccceeeEEeccCCCcCHHHHHHH---HHh Confidence 67899999999999999999999999999999999987642 23345678999999999997776654 456 Q ss_pred cCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCc Q lcl|NC_019418. 464 AGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDD 523 (527) Q Consensus 464 aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 523 (527) +|+||.+|++..+++++| +++|++||++|++...+..++..++.... ..+++..++ T Consensus 419 ag~iS~et~~~~~~~v~d--~~~E~~ri~~E~~e~~~~~~~~~~~~~~~--~~d~~~e~~ 474 (474) T protein:vir:96 419 SQYLSKETVVTNHPWVDD--PVAELERIEQDNIDFNKQLPPLEGDANGR--AQDNESETN 474 (474) T ss_pred cCCCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhcccccccccccc--cCCCcccCC Confidence 899999999988766654 66899999998876555544443322111 111111111 No 29 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=100.00 E-value=3.2e-54 Score=313.90 Aligned_cols=462 Identities=11% Similarity=0.066 Sum_probs=305.0 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCcc--cc---CHHHHHHHHHHHHHhcCCCcccccccc---cCccccCceee Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKV--AV---TQSEFRRIQHNLAYYQSKFDDIEYTNT---DGDRKRRKMQH 72 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i--~~---~~~~~~~i~~~~~~y~g~~~~l~~~~~---~~~~~~~~~~~ 72 (527) |+.+.-+.+.+..-. .+-..+...++...| .| ..+.+.+++++.+||.|+++.+..+.. ....+..++++ T Consensus 6 ~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki~ 83 (481) T protein:vir:10 6 INNINTKFSPLANDD--FVVSDLAELLKEENLRNFISRHQTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRAV 83 (481) T ss_pred eehhchhcccccCce--eeeecchhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCccccccccccccceee Confidence 666555444433111 011111111111111 11 245678899999999999876532221 22223345789 Q ss_pred cchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe-CCeeEEEEEcCCc Q lcl|NC_019418. 73 LPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD-GDKIRVAFIQAPV 151 (527) Q Consensus 73 lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d-~~~~~i~~v~a~~ 151 (527) +|+|+.||+..|+|+|++|++++++++..++.|++++++|+|...+.++++.+++.|.+|+++|+| ++++++.+++|++ T Consensus 84 ~n~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~~i~~~~p~~ 163 (481) T protein:vir:10 84 HNYAKYVSRFIVGYLTGNPITITHQDNQTNDKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRDFEDRDTFKVLDPKS 163 (481) T ss_pred cchHHHHHHHHHhhhccCCceEecCChhHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCeEEEEEEcccc Confidence 999999999999999999999999999999999999999999999999999999999999999997 4679999999999 Q ss_pred eEEEEEcCC-ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeeccc Q lcl|NC_019418. 152 FLPLQSNTQ-DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSE 230 (527) Q Consensus 152 ~~P~~~d~~-~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~ 230 (527) ++|++.+.. +...+++. ++...+.....++.+|.|.. ..|.+ |+... | ...+.+ T Consensus 164 ~~~v~d~~~~~~~~~~i~--~~~~~~~~~~~~~~~~~y~~----------------~~i~~--~~~~~----~-~~~~~~ 218 (481) T protein:vir:10 164 TFVVYDQTLDKKVVAGVR--YFEKQDKDKVPVQHVEVYTT----------------DKIYY--IEIKG----G-TYHRVE 218 (481) T ss_pred eEEEEcCCCCCceEEEEE--EEEEeeCCCceEEEEEEEec----------------CeEEE--EEecC----C-ceeecc Confidence 999965543 34444432 22222222222334555531 12211 21111 0 011100 Q ss_pred ccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCC Q lcl|NC_019418. 231 LYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKV 310 (527) Q Consensus 231 ~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~ 310 (527) ..-++++++++++|++ +++|+|+|+++++++|++|.++|++.++++....++++-..+.. . T Consensus 219 --------~~~~~~g~vPvv~~~n---------~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~--~ 279 (481) T protein:vir:10 219 --------EVEHYYNDVPIIEYLN---------DQFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVD--L 279 (481) T ss_pred --------cccccCCceeEEEeec---------CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcC--C Confidence 0113456666777764 24699999999999999999999999999865555555333321 1 Q ss_pred CCCCcccccccccccccceeeecc---CCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchH Q lcl|NC_019418. 311 QDNQGNIAFKRRFDVEQNVYMQVG---AGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTA 387 (527) Q Consensus 311 ~~~~~~~~~~~~~d~~~~~~~~~~---~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TA 387 (527) +++.+. .+..+..++.... .+.++.+.+++++++++.+++...++.+.+.|...++.+..+++. .+|..|| T Consensus 280 ~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~~~n~Sg 353 (481) T protein:vir:10 280 DSEDAK-----AFRDANMIHLEPGTNANGSEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQ-FSGVQSG 353 (481) T ss_pred Cccchh-----hhhhccceeccccccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cccccHH Confidence 222211 1111222222211 123445578899999999999999988888787777766555543 3466789 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCC Q lcl|NC_019418. 388 TEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFA 467 (527) Q Consensus 388 tei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~ 467 (527) .++++..+.+.++++++++.|+.+|+++++.|+.+.+. .++......++++.|+++++.|..+.++.+++++ |++ T Consensus 354 ~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~---~~~~~~~~~~i~v~f~~~~~~~~~~~a~~~~kl~--g~i 428 (481) T protein:vir:10 354 ESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNL---TGLKQHNYAELTITFTPNLPKSMMESINAFNALS--GGV 428 (481) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---cCCCccccceeeEEeCCCCCcCHHHHHHHHHHHh--ccC Confidence 99999999999999999999999999999999987654 3344455678999999999999999999988874 999 Q ss_pred CHHHHHHhcCCCCHHHHHHHHHHHHHhccccccccc-CCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 468 TQKRGIAKTLGITEEEAEKELAEINGELPPESDAEL-ALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 468 s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) |.+|++..++.++| +++|++||++|+........ ...++..++. +..|++++ T Consensus 429 s~et~~~~l~~i~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~------~~~dd~~g 481 (481) T protein:vir:10 429 SESTRLSLLDFIDN--PKEELEKMQEEEAQREKQADKRGYGEAFENH------LNVDDSNG 481 (481) T ss_pred ChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhhhhccCCccCCCC------CCCCCCCC Confidence 99999977755543 67889999888754332211 1122222221 11122222 No 30 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=100.00 E-value=3e-55 Score=319.53 Aligned_cols=457 Identities=13% Similarity=0.098 Sum_probs=295.4 Q ss_pred CChH-HHHHHHHHHHHHHh--hcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccc---cC---ccccCcee Q lcl|NC_019418. 1 MSLI-QKVKDFFNRGRYNM--TTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNT---DG---DRKRRKMQ 71 (527) Q Consensus 1 m~~~-~~~k~~~~~~~~~~--~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~---~~---~~~~~~~~ 71 (527) |++= .....+|....-+. ..+-+.+ -|.-......|+.++++||.|+|+.+..... .+ ..+..+++ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~-----~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ki 81 (474) T protein:vir:95 7 MPWDKPYGEEVVEQLKPQFETQEEMIIR-----LIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDWRI 81 (474) T ss_pred cCCCCchhhHHHHhhhhccCChHHHHHH-----HHHHHHHHHHHHHHHHHHhcccCchhcccccccccccccccccccee Confidence 2110 01112222222110 1111111 1112234567889999999999987643322 11 12234578 Q ss_pred ecchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEEEEEcCC Q lcl|NC_019418. 72 HLPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRVAFIQAP 150 (527) Q Consensus 72 ~lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i~~v~a~ 150 (527) ++|+|+.||+..|+|||++|++++++++..++.|+.+++ |+|...+.++++.++++|.+|+++|+|. +++++.+++|. T Consensus 82 ~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~ 160 (474) T protein:vir:95 82 TTNFHQNLVDQKVSYVASKPVTYSCEDESVLKIIHDVLD-TRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAE 160 (474) T ss_pred ccchHHHHHHHHHhhhccCCceeccCchHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCcEEEEEEecCCCceEEEEEccc Confidence 899999999999999999999999999999999999986 6799999999999999999999999975 68999999999 Q ss_pred ceEEEEEcCC-ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecc Q lcl|NC_019418. 151 VFLPLQSNTQ-DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLS 229 (527) Q Consensus 151 ~~~P~~~d~~-~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~ 229 (527) +++|++.++. +...++ ++. +...+.. + ++.+.. ...+.+....+ .... .++. T Consensus 161 ~~~~v~d~~~~~~~~~~-i~~-~~~~~~~--~---~~~y~~---------------~~~~~~~~~~~----~~~~-~~~~ 213 (474) T protein:vir:95 161 QAIPIWVDKEREELKSF-IRY-YKFNNEE--K---VEFWTD---------------TTVTYYVLENG----GLIP-DYYY 213 (474) T ss_pred ceEEEEcCCCCCceEEE-EEE-EEEcCee--E---EEEEeC---------------CeEEEEEEcCC----cccc-cccc Confidence 9999976653 333333 322 2222221 1 222211 11111111111 0000 0000 Q ss_pred cccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCC Q lcl|NC_019418. 230 ELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLK 309 (527) Q Consensus 230 ~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~ 309 (527) . .........-+++.+++|++|++ +|.|+|+|+++++++|+||.++|+++++++.....+.+...+- T Consensus 214 ~-~~~~~~~~~~~~~g~iPvv~~~n---------n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~--- 280 (474) T protein:vir:95 214 G-ANHIQSHFSNGNWGRVPFIAFKN---------NPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYE--- 280 (474) T ss_pred C-cccccccccccCCCccceEeecC---------CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC--- Confidence 0 00112222335667778888764 3579999999999999999999999999987666666533321 Q ss_pred CCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHH Q lcl|NC_019418. 310 VQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATE 389 (527) Q Consensus 310 ~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAte 389 (527) +.. . ..+..+...+..+..+ +++++++++++++++++...++.+.+.|...++.+.-+++ ..+|..||.+ T Consensus 281 --~~~--~---~~~~~~~~~~~~i~~~--~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~~~n~Sg~A 350 (474) T protein:vir:95 281 --GQD--L---EEFMRGLKYYKAINVD--GDGGVETIQVEVPVSSTKEYIDLMRAYIMEFGQGVDFQTD-KFGSAPSGIA 350 (474) T ss_pred --ccc--c---hhhhhhhhccceeecc--CCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccc-cccccchHHH Confidence 110 0 1112222223334333 2346888999999999999999998888777665532221 1235678999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCH Q lcl|NC_019418. 390 IVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQ 469 (527) Q Consensus 390 i~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~ 469 (527) ++++++.+.++++.+++.|+++|+++++.|+.+.. ...+..+++|+|++++|.|..+.++.+ +.+|+||. T Consensus 351 lk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g-------~~~d~~~i~v~f~~~~p~d~~e~a~~~---~~~g~iS~ 420 (474) T protein:vir:95 351 LKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNN-------LKMDVKDIEISFNFNRMMNDAEQSQII---AQSQYLSR 420 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CCcccceeeEEeccCCCcCHHHHHHHH---HhcCCCch Confidence 99999999999999999999999999999987642 234567799999999999987776654 45799999 Q ss_pred HHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 470 KRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 470 ~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) ++++.++++++| +++|++||++|+............. +.+...+...+.+++.| T Consensus 421 et~i~~l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~-~~d~~~~~~~~~~~~~~ 474 (474) T protein:vir:95 421 ETLVKSSPLVDD--YKAELERIEQEQMEYNKQLPNLDDG-GADGAQQQERSNDKESE 474 (474) T ss_pred HHHHHhCCCCCC--HHHHHHHHHHHHHHHHhcccccccc-cCCCCcCCCCCccCCCC Confidence 999988765654 5688999998886555444443322 22222222222222222 No 31 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=100.00 E-value=1.6e-54 Score=315.56 Aligned_cols=458 Identities=11% Similarity=0.095 Sum_probs=294.3 Q ss_pred CChHH------HHHHHHHHHHHH--hhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc-C-----ccc Q lcl|NC_019418. 1 MSLIQ------KVKDFFNRGRYN--MTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD-G-----DRK 66 (527) Q Consensus 1 m~~~~------~~k~~~~~~~~~--~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~-~-----~~~ 66 (527) |.=+. ..+.+++...-. ...+-+.+. |..-.....++.++++||.|+|+.+...... + ..+ T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~-----i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~ 75 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRL-----VREHKENIDNITMGERYYNHHPDILDAPPKRDVNGDYDETK 75 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccCCcHHHHHHH-----HHHHHHHHHHHHHHHHHhcCCCchhcccccccccccccccc Confidence 22111 122222211100 000111111 1111234567888999999999866543211 1 122 Q ss_pred cCceeecchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe-CCeeEEE Q lcl|NC_019418. 67 RRKMQHLPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD-GDKIRVA 145 (527) Q Consensus 67 ~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d-~~~~~i~ 145 (527) ..+++++|+|+.||+.+|+|+|++|++++++++..++.|+++++ |+|...+.++++.|+++|.+|+++|+| ++++++. T Consensus 76 ~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~d~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~~~d~~g~~~~~ 154 (478) T protein:vir:10 76 PDWRMYTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQHTLN-HKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTF 154 (478) T ss_pred ccceeccchHHHHHHHHHhhhccCCeeeecCChHHHHHHHHHHh-cCHHHHHHHHHHHHHhcCeEEEEEEecCCCeeEEE Confidence 34578899999999999999999999999999999999999986 689999999999999999999999997 4789999 Q ss_pred EEcCCceEEEEEcCC-ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCcccc-- Q lcl|NC_019418. 146 FIQAPVFLPLQSNTQ-DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQL-- 222 (527) Q Consensus 146 ~v~a~~~~P~~~d~~-~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~l-- 222 (527) +++|.+++|++.++. +...+++. .+...+.. + .|.|.. ..|.+ |.......+ T Consensus 155 ~~~p~~~~~i~d~~~~~~~~~~v~--~~~~~~~~--~---~~~y~~----------------~~i~~--~~~~~~~~~~~ 209 (478) T protein:vir:10 155 RVPAEQAVPIWTNKERDELQAFIR--VYELDGAE--R---VEYWTK----------------DDVTY--YELKEGQLIPD 209 (478) T ss_pred EEcccceEEEEcCCCCCceEEEEE--EEEecCce--E---EEEEeC----------------CeEEE--EEEcCCeeecc Confidence 999999999976543 44444432 22222221 1 222221 11111 111100000 Q ss_pred -Cc-eeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceee Q lcl|NC_019418. 223 -GE-RVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVI 300 (527) Q Consensus 223 -G~-~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~ 300 (527) .. .......+ ......+++.++++++|++ +|+|+|+|+++++|+|++|.++|+++++++.....+. T Consensus 210 ~~~~~~~~~~~~---~~~~~~~~~~~vPvv~~~n---------~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~ 277 (478) T protein:vir:10 210 FYRSDDHIQPHY---YQGNKLMSWGRVPFIPFKN---------NPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIY 277 (478) T ss_pred ccccccccccce---ecccccccCCccceEEecc---------CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcee Confidence 00 00000000 0011124566777777754 3679999999999999999999999999987666666 Q ss_pred echhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccc Q lcl|NC_019418. 301 VPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFD 380 (527) Q Consensus 301 v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~ 380 (527) +...+- ....+ . ...+... +..+.....+++.+++++++++++++...++.+.+.|...++.+..+++. T Consensus 278 ~~~g~~---~~~~~-~----~~~~~~~--~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~- 346 (478) T protein:vir:10 278 ILKGYE---GEDMK-D----FMHNLKY--YKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDK- 346 (478) T ss_pred eeecCC---ccccc-h----hhhhhhh--cceEEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCccc- Confidence 643331 11100 0 1111111 22222332344568889999999999888888888776666554322221 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHH Q lcl|NC_019418. 381 GQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMK 460 (527) Q Consensus 381 ~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~ 460 (527) .+|..||.++++.++.+.++++.+++.|+.+|++++++|+.+.. ...+..+++|+|++++|.|..+.++++++ T Consensus 347 ~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~g-------~~~~~~~i~i~f~~~~p~d~~e~a~~~~k 419 (478) T protein:vir:10 347 FGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYR-------LDVKVQDIEITFNFNVMVNELENSQIAMN 419 (478) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CCcccccceEEecCCCCCCHHHHHHHHHH Confidence 23567899999999999999999999999999999999987642 23456678999999999999999988877 Q ss_pred HHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_019418. 461 MVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDE 524 (527) Q Consensus 461 ~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (527) + +|+||.+|++..++.++| +++|++||++|+....+....+..+ ...+.++.+++..+| T Consensus 420 l--~g~iS~et~~~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 478 (478) T protein:vir:10 420 S--TGLLSKETILSNHAWVED--PVAEMERIEQENIELNQQLPDIEEG-LNGEQQRQSENNQPE 478 (478) T ss_pred H--hCCCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhccccccc-cCCCCCCCCCCCCCC Confidence 6 699999999877744443 6688999999887655544444322 222222233333333 No 32 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=100.00 E-value=2.8e-54 Score=314.15 Aligned_cols=475 Identities=11% Similarity=0.029 Sum_probs=299.2 Q ss_pred ChHHHHHHHHHHHHHHhhcccchhhhccCcccc----CHHHHHHHHHHHHHhcCCCccccccc-c-cCccccCceeecch Q lcl|NC_019418. 2 SLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAV----TQSEFRRIQHNLAYYQSKFDDIEYTN-T-DGDRKRRKMQHLPI 75 (527) Q Consensus 2 ~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~----~~~~~~~i~~~~~~y~g~~~~l~~~~-~-~~~~~~~~~~~lnl 75 (527) ++++ +.+...... .+...++.++...=..+ -.....|++++.+||.|+++.+..+. . ....+..+++++|+ T Consensus 1 ~~~~-~~~~~~~~~--~~~~~~~~l~~~~i~~li~~~~~~~~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki~~n~ 77 (506) T protein:vir:94 1 MDYD-LTEHKQANL--IYQESLENLTPNKIMKFITHHFNYQRPRLEMLDDYYQGYNLKILDKQSRRHEDGKADHRATHSF 77 (506) T ss_pred CCcc-hhhhhccee--ecccchhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccCCcceeecch Confidence 1111 222211111 01122222111100000 12345678999999999987653332 1 23334456789999 Q ss_pred HHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEEEEEcCCceEE Q lcl|NC_019418. 76 ARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRVAFIQAPVFLP 154 (527) Q Consensus 76 ~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i~~v~a~~~~P 154 (527) |+.||+..|+|||++|++++++++..++.|++++++|+|...+.++++.++++|.+|+++|+|. +++++.+++|.+++| T Consensus 78 ~~~Iv~~~~~~l~G~p~~~~~~d~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded~~~~i~~~~p~~~~~ 157 (506) T protein:vir:94 78 AKYIADFQTSYSVGNPINVKLPDDGSNSGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGEDNEEHLAKLDPLDTFV 157 (506) T ss_pred HHHHHHHhhhhhcccCceeecCcchHHHHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEE Confidence 9999999999999999999999999999999999999999999999999999999999999974 789999999999999 Q ss_pred EEEcCC-ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccC Q lcl|NC_019418. 155 LQSNTQ-DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYP 233 (527) Q Consensus 155 ~~~d~~-~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~ 233 (527) +++++. +...+++............. ++...+++.+. ...+. .|.. ...|..+ T Consensus 158 v~dd~~~~~~~~~v~~~~~~~~~~~~~-~~~~~~~~~yt-------------~~~~~--~~~~---~~~~~~~------- 211 (506) T protein:vir:94 158 IYSTDVDPKPIMAVRYHQIELVDDNQV-STINYVPETWT-------------ADTYT--LYNP---TPIMGKM------- 211 (506) T ss_pred EecCCCCCceEEEEEEEeeeeccCCce-eEEEEEEEEEe-------------CceEE--Eecc---ccCccce------- Confidence 987654 33333332222222222222 22233333211 11111 1211 1111111 Q ss_pred CcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCC-- Q lcl|NC_019418. 234 DLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQ-- 311 (527) Q Consensus 234 ~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~-- 311 (527) .....+++.++++++|+++ +.|+|+|+++++++|++|.++|++++.++.....+++-..+...... T Consensus 212 ---~~~~~~~~g~vPvv~~~n~---------~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~ 279 (506) T protein:vir:94 212 ---QVDTTKPITTFPVVEFKNS---------NFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGS 279 (506) T ss_pred ---eccccccCCccceEEecCC---------CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccch Confidence 0112256677778877642 35899999999999999999999999887533333332211100000 Q ss_pred ----------CCCc-cccc-ccccccccceeeeccC-------CCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCC Q lcl|NC_019418. 312 ----------DNQG-NIAF-KRRFDVEQNVYMQVGA-------GNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGV 372 (527) Q Consensus 312 ----------~~~~-~~~~-~~~~d~~~~~~~~~~~-------~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~ 372 (527) ..+. .... ...+......+..+.. +......++++++++..+.+...++.+.+.|...++. T Consensus 280 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~ 359 (506) T protein:vir:94 280 DMMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHT 359 (506) T ss_pred hccccccccccccccccccchhHHHhhhhhcCeeeecccccccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCc Confidence 0000 0000 0000000000111111 1122345888999999999999999999988887776 Q ss_pred CcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHH Q lcl|NC_019418. 373 SSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRH 452 (527) Q Consensus 373 s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~ 452 (527) +..+++ ..+|..||.+++++++.+.++++.+++.|+++|+++++.|+.+.+.. .++...+..+++|.|++++|.|.. T Consensus 360 p~~~~~-~~~~n~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~--~~~~~~d~~~i~i~f~~~~p~d~~ 436 (506) T protein:vir:94 360 PDLTDE-NFASNSSGVAMQYKVLGTVELASTKRRMFERGLYARYQIISDIENSI--HGDWTFDPQELTFTFRDNLPADNI 436 (506) T ss_pred cccccc-cccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCccccccccceEEeCCCCCcCHH Confidence 643322 22356789999999999999999999999999999999999887542 233345566799999999999999 Q ss_pred HHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 453 AELDYWMKMVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 453 ~~~~~~~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) +.++++.++ +|+||.+|++.++++++| +++|++||++|++..+........ .++.+...+..+.++|. T Consensus 437 e~a~~~~kl--~g~iS~et~~~~lp~v~d--~~~E~~ri~~E~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~e 504 (506) T protein:vir:94 437 SQIKALVQA--GATLPQKYLYQQLPGVTN--PQDIVDMMKEQSANGDYSFDQNGV---ISNDGQTNTTATQTDEE 504 (506) T ss_pred HHHHHHHHH--hccCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHhhcchhhcC---CCcccCccccccccccC Confidence 999988887 599999999988866665 567899999988654333222211 11111122222222333 No 33 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=100.00 E-value=5.8e-54 Score=312.43 Aligned_cols=441 Identities=14% Similarity=0.155 Sum_probs=297.2 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc------CccccCceeecc Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD------GDRKRRKMQHLP 74 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~------~~~~~~~~~~ln 74 (527) +...+-|++++.+. ...+.++.++++||.|+|+.+...... ...+.++++++| T Consensus 34 e~~~~~i~~~i~~~---------------------~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n 92 (483) T protein:vir:12 34 ETLEEMIVRYIKQH---------------------LEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITN 92 (483) T ss_pred hhHHHHHHHHHHHH---------------------HHHHHHHHHHHHHhccccccccccccccccccccccccccccccc Confidence 23333333333221 124567888999999999877543221 122334578899 Q ss_pred hHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEEEEEcCCceE Q lcl|NC_019418. 75 IARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRVAFIQAPVFL 153 (527) Q Consensus 75 l~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i~~v~a~~~~ 153 (527) +|+.||+..|+|||++|++++++++..++.|+++++ |+|...+.+++..++++|.+|+.+|+|. +++++.+++|.+++ T Consensus 93 ~~k~Ivd~~~~~l~G~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~y~~v~~d~d~~~~i~~~~p~~~~ 171 (483) T protein:vir:12 93 FHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGI 171 (483) T ss_pred hHHHHHHHHhhhhcccCceeccCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEEcCCCceEEEEEcccceE Confidence 999999999999999999999999999999999986 6899999999999999999999999975 67999999999999 Q ss_pred EEEEcC-CceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecc-cc Q lcl|NC_019418. 154 PLQSNT-QDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLS-EL 231 (527) Q Consensus 154 P~~~d~-~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~-~~ 231 (527) |++.++ .+...+++.. +..++.. + +|+|. +..|.+-.+.. |..++-. .. T Consensus 172 ~v~d~~~~~~~~~~ir~--~~~~~~~--~---~~~y~----------------~~~v~~~~~~~------~~~~~~~~~~ 222 (483) T protein:vir:12 172 PIWTDKEHEELEAFIRM--YKLENET--K---VEYWD----------------KVTVNYYVYEN------GSLIPDYSNN 222 (483) T ss_pred EEEcCCCCCceEEEEEE--EEeecce--E---EEEEe----------------cCeEEEEEEeC------Ceeeeccccc Confidence 997544 3455444432 2222221 1 23322 11222111111 0000000 00 Q ss_pred cCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCC Q lcl|NC_019418. 232 YPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQ 311 (527) Q Consensus 232 ~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~ 311 (527) .........-.++.++|+++|++ ++.|.|+|+++++|+|++|.++|++++.++....++.+-..+ . T Consensus 223 ~~~~~~~~~~~~~g~vPvv~~~n---------n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~-----~ 288 (483) T protein:vir:12 223 LENSKTHFSTGSWGKIPFIPFKN---------NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNY-----D 288 (483) T ss_pred ccccccccccCCCCccceEEecC---------CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecC-----C Confidence 11111222335677778888864 256999999999999999999999999998655444441111 1 Q ss_pred CCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHH Q lcl|NC_019418. 312 DNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIV 391 (527) Q Consensus 312 ~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~ 391 (527) ... . ..+..+-..+..+..+ +++.+++++++++++.+...++.+.+.|...++.+.-+++ ..+|..||.+++ T Consensus 289 ~~~--~---~~~~~~~~~~~~~~~~--~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~n~Sg~Al~ 360 (483) T protein:vir:12 289 DQE--L---PEFKRLLRYYGAIKVS--DNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSD-KFGSAPSGVALE 360 (483) T ss_pred ccc--c---hhHHHhhhhccccccC--CCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCcc-ccccCcHHHHHH Confidence 110 0 0011111122233333 2346888999999999988888888877666655433332 123456889999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHH Q lcl|NC_019418. 392 SENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKR 471 (527) Q Consensus 392 s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~ 471 (527) +.++.+..+++.+++.|+.+|+++++.|+.+.. ......+++|.|++.+|.|..+.++..+++ +|+||.+| T Consensus 361 ~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~-------~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl--~GiiS~et 431 (483) T protein:vir:12 361 FLYTNLNLKADKLARKAKVAIQELLWFVFEHFD-------IKGEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHET 431 (483) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-------CCCccceeeEEeCCCCCCCHHHHHHHHHHH--hccCchHH Confidence 999999999999999999999999999987643 223567789999999999999999998887 59999999 Q ss_pred HHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 472 GIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 472 ~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) ++..+++++| +++|++||++|+.......++.++.+.++. .++.+.++.|.| T Consensus 432 ~~~~~~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~-~~~~~~~~~e~e 483 (483) T protein:vir:12 432 VLENHPFVED--LQAELERIEQEQMEYNKQLPNLDDGGADGA-QQQERSNNKESE 483 (483) T ss_pred HHHhCCCCCC--HHHHHHHHHHHHHHHHhhcccccccccCCc-ccCCCCCcccCC Confidence 9987766554 678999999988766555555544333332 223333333334 No 34 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=100.00 E-value=4.3e-54 Score=313.15 Aligned_cols=458 Identities=12% Similarity=0.097 Sum_probs=296.3 Q ss_pred CChH------HHHHHHHHHHHHH--hhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc------Cccc Q lcl|NC_019418. 1 MSLI------QKVKDFFNRGRYN--MTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD------GDRK 66 (527) Q Consensus 1 m~~~------~~~k~~~~~~~~~--~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~------~~~~ 66 (527) |.=+ ...+.+|+.+.-+ +....+.+. |.--.....|+.++++||.|+|+.+...... ...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-----i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~ 75 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRL-----VREHKENIDNITMGERYYNHHPDILDAPFKRDVNGDYDETK 75 (478) T ss_pred CccccccCCchhhhHHHHHhhhccCChHHHHHHH-----HHHHHHHHHHHHHHHHHhcccccccccchhhhccccccccc Confidence 1111 1112222211100 000000010 1111235678899999999999876533221 2233 Q ss_pred cCceeecchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEEE Q lcl|NC_019418. 67 RRKMQHLPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRVA 145 (527) Q Consensus 67 ~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i~ 145 (527) ...++++|+|+.||+..|+|+|++|++++++++..++.|+.+++ |+|...+.++++.|+++|.+|+++|+|. +++++. T Consensus 76 ~~~ki~~n~~k~ivd~~~~yl~g~p~~~~~~~~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~~~ 154 (478) T protein:vir:10 76 PDWRMYTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQHTLN-HKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTF 154 (478) T ss_pred ccceeccchHHHHHHHHhhhhcccCceeecCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEE Confidence 34578899999999999999999999999999999999999985 7899999999999999999999999984 789999 Q ss_pred EEcCCceEEEEEcC-CceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCc Q lcl|NC_019418. 146 FIQAPVFLPLQSNT-QDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGE 224 (527) Q Consensus 146 ~v~a~~~~P~~~d~-~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~ 224 (527) +++|.+++|++.++ .+...+++. .+...+.. + +|.|.. ..|.+ |.... ..+.. T Consensus 155 ~~~p~~~~~v~d~~~~~~~~~~ir--~~~~~~~~--~---~~~y~~----------------~~i~~--~~~~~-~~~~~ 208 (478) T protein:vir:10 155 RVPAEQAVPIWTNKERDELQAFIR--VYELDGAE--R---VEYWTK----------------DDVTF--YELKE-GQLIP 208 (478) T ss_pred EEcccceEEEEcCCCCCceEEEEE--EEeeeCce--E---EEEEeC----------------CcEEE--EEecC-Ceeec Confidence 99999999997654 345544432 23222221 1 233321 11111 11110 11110 Q ss_pred eeecccccCCcc----cceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceee Q lcl|NC_019418. 225 RVNLSELYPDLQ----PVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVI 300 (527) Q Consensus 225 ~v~l~~~~~~l~----~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~ 300 (527) ..... +.... .....+++.++++++|++ ++.|+|+|+++++++|++|.++|+++++++....++. T Consensus 209 ~~~~~--~~~~~~~~~~~~~~~~~g~vPvv~~~n---------~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~ 277 (478) T protein:vir:10 209 DFYRS--EDHIQPHYYQGNKLMSWGRVPFIPFKN---------NPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIY 277 (478) T ss_pred ccccc--ccccccceecccccccCCcceEEEecc---------CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcce Confidence 00000 00000 111125667777777764 2569999999999999999999999999987555555 Q ss_pred echhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccc Q lcl|NC_019418. 301 VPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFD 380 (527) Q Consensus 301 v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~ 380 (527) +...+- .+..+ . +..+..-+..+.....+++.+++++++++.+++...++.+.+.|...++.+.-+++ . T Consensus 278 ~~~g~~---~~~~~-~------~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~ 346 (478) T protein:vir:10 278 ILKGYE---GEDMK-D------FMHNLKYYKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQD-K 346 (478) T ss_pred eeecCC---ccccc-c------hhhhhhhCceeEecCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCcc-c Confidence 533221 11111 1 11111112222233334456899999999999999999888887776664422221 1 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHH Q lcl|NC_019418. 381 GQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMK 460 (527) Q Consensus 381 ~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~ 460 (527) .+|..||.++++.++.+.++++.+++.|+++|+++++.|+.+.. ...+..+++|+|++++|.|..+.++..++ T Consensus 347 ~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~-------~~~d~~~i~i~f~~~~p~~~~e~~~~~~~ 419 (478) T protein:vir:10 347 FGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYR-------LDVRVQDIEITFNFNVMVNELENSQIAMN 419 (478) T ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CCcccccceEEeCCCCCCCHHHHHHHHHH Confidence 23567899999999999999999999999999999999987643 22455678999999999999998888776 Q ss_pred HHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_019418. 461 MVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDE 524 (527) Q Consensus 461 ~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (527) + +|+||.+|++..+++++| +++|++||++|+....+..+.+++... ++.....++.+.| T Consensus 420 ~--~g~iS~et~i~~~~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~-d~~~~~~~d~~~e 478 (478) T protein:vir:10 420 S--TGLLSKETILGNHSWVQD--PVAEMERIEQENIELNQQLPDIEEGLN-DEQQRQSEDNQSE 478 (478) T ss_pred H--hCCCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHhccccCCCCc-ccccccCcCCCCC Confidence 5 699999999977755554 679999999998876666555443222 2222222222222 No 35 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=100.00 E-value=1.1e-53 Score=311.01 Aligned_cols=436 Identities=11% Similarity=0.071 Sum_probs=298.6 Q ss_pred CCh-----HHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecch Q lcl|NC_019418. 1 MSL-----IQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPI 75 (527) Q Consensus 1 m~~-----~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl 75 (527) |+- .+.|++++++. .....|++++++||.|+++++..+.. ...+..+++++|+ T Consensus 11 ~p~d~~~~~~~l~~~i~~~---------------------~~~~~r~~~~~~yy~g~~~i~~~~~~-~~~~~~~ki~~n~ 68 (453) T protein:vir:39 11 FPKDEPITNEVVTKFMEKH---------------------RLEVARYEYLKNMYRGIMAIDAEPTK-DLWKPDNRLTVNF 68 (453) T ss_pred cCCCCCCCHHHHHHHHHHH---------------------HHHHHHHHHHHHHhhccCchhcCCCc-cccCccceeecch Confidence 111 12233333221 12346888899999999987765433 2233456788999 Q ss_pred HHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEEEEEcCCceEE Q lcl|NC_019418. 76 ARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRVAFIQAPVFLP 154 (527) Q Consensus 76 ~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i~~v~a~~~~P 154 (527) |+.||+.+|+|||++|++++++++..++.|++++++|+|...+.++++.+++.|.+|+++|+|. +.+++.+++|.+++| T Consensus 69 ~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~ 148 (453) T protein:vir:39 69 TKYIVDTFTGYFNGIPVKKSHSDKETLSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNEETQTNVIYNTPENMFM 148 (453) T ss_pred HHHHHHHHhhhhcccCceeccCChHHHHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEE Confidence 9999999999999999999999999999999999999999999999999999999999999975 679999999999999 Q ss_pred EEEcCCce-EEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccC Q lcl|NC_019418. 155 LQSNTQDV-SSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYP 233 (527) Q Consensus 155 ~~~d~~~~-~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~ 233 (527) ++.+..+. ..+++ +++..++ . .+.+|.|. +.+|.+ |....+. ..+ T Consensus 149 v~d~~~~~~~~~~i--r~~~~~~--~--~~~~~~yt----------------~~~i~~--~~~~~~~-----~~~----- 194 (453) T protein:vir:39 149 VYDDTIKQEPLFAV--RYGYDDD--Y--KLYGEVYT----------------KETTYA--LNGTMGF-----YNM----- 194 (453) T ss_pred EecCCCCCeEEEEE--EEEEeCC--e--EEEEEEEe----------------CCeEEE--EEecCCc-----eee----- Confidence 97655443 33333 3332222 1 22345543 222221 2211100 000 Q ss_pred CcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCC Q lcl|NC_019418. 234 DLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDN 313 (527) Q Consensus 234 ~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~ 313 (527) ....-++++++++++|++ .+.|+|+|+.+++++|++|+++|++++.++.....+.+-..+ ..+++ T Consensus 195 ---~~~~~~~~g~vPvv~~~n---------~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~---~~~~~ 259 (453) T protein:vir:39 195 ---TEQAPNPFDDLPVVEFYF---------NEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGA---AVEEE 259 (453) T ss_pred ---ecccccCCCceeEEEecC---------CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecC---CCCch Confidence 011124667777777764 246999999999999999999999999997644444441111 11111 Q ss_pred CcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHH Q lcl|NC_019418. 314 QGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSE 393 (527) Q Consensus 314 ~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~ 393 (527) .. ..+....-+......+.+.++.+.++++++..+.+.+.++.+.+.|...++.+. +++...|..|+.+++++ T Consensus 260 ~~-----~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~~gn~Sg~Al~~~ 332 (453) T protein:vir:39 260 DL-----KNIRSNRVINYYGESSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVAN--ISDESFGSSSGVSLAYK 332 (453) T ss_pred hh-----hhhhhcceeeecCCCCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcc--cccccccCChHHHHHHH Confidence 10 112222212222222333456789999999999999999998888877766542 33444456789999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHH Q lcl|NC_019418. 394 NSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGI 473 (527) Q Consensus 394 ~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i 473 (527) .+.+..+++.+++.|+.+|+++++.|+.+....+ ......+|+|+|+++++.|..+.++..+++ +|+||.+|++ T Consensus 333 ~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~----~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl--~g~is~et~l 406 (453) T protein:vir:39 333 LQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVS----NKEAWKDIEYTFTRNEPKDIKEQAETANIL--MGITSQETAL 406 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC----CccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHH Confidence 9999999999999999999999999998865422 234556789999999999999999988876 5999999999 Q ss_pred HhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 474 AKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 474 ~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) ..+++++| +++|++||++|+....+..... .++..+.+++.+.+++| T Consensus 407 ~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~----~~~~~~~~~~~~~~~~e 453 (453) T protein:vir:39 407 SVISVIPD--VQAEMEKIKKEEASTAIFDKDK----QPSEKGTDTVVPETNEE 453 (453) T ss_pred HhCCCCCC--HHHHHHHHHHHHHHHHHHHHhc----cCCCCCCCCCCCCcCCC Confidence 77755543 6788999999987554332211 12222223333334444 No 36 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=100.00 E-value=3.9e-53 Score=307.93 Aligned_cols=445 Identities=12% Similarity=0.073 Sum_probs=297.8 Q ss_pred CC-----hHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecch Q lcl|NC_019418. 1 MS-----LIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPI 75 (527) Q Consensus 1 m~-----~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl 75 (527) |+ ..+.|+++++++. .....|+++.++||.|+++.+......+ +..+++++|+ T Consensus 19 ~~~~~~~~~~~i~~~i~~~~--------------------~~~~~~~~~l~~Yy~g~~~i~~~~~~~~--~~~~ki~~n~ 76 (470) T protein:vir:99 19 FPKGEKLTSNELLGFIAYNE--------------------TVLKPRYRENMKLYLGKHKILTAPEKET--GADNRIVVNS 76 (470) T ss_pred eCCCCCcCHHHHHHHHHHHH--------------------HhhHHHHHHHHHHhccccccccCccccc--CCcceeecch Confidence 11 1233444444321 1234678889999999998775543332 3356788999 Q ss_pred HHHHHHHHhhhhhcccceEeeCC-HHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe-CCeeEEEEEcCCceE Q lcl|NC_019418. 76 ARTAAKKIASLVYNEQAEISAED-ETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD-GDKIRVAFIQAPVFL 153 (527) Q Consensus 76 ~~~i~~~~A~ll~~e~~~i~~~d-~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d-~~~~~i~~v~a~~~~ 153 (527) |+.||+..++|+|++|+++++++ ...++.|++++.+|+|...+.+++..++++|.+|+++|++ ++++++.+++|.+++ T Consensus 77 ~~~Ivd~~~~~l~g~p~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~~p~~~~ 156 (470) T protein:vir:99 77 AKYVVDVYNGYFCGIEPKLALLNDSSKIDEIARWNRQENFFDTINEISKQCDIFGRSIASIYQGEDARPHLMYSSPNHAF 156 (470) T ss_pred HHHHHHHHhhhhccCCeeEeeCCchhHHHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeEEEEEEccceeE Confidence 99999999999999999999865 4567899999999999999999999999999999999997 467999999999999 Q ss_pred EEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccC Q lcl|NC_019418. 154 PLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYP 233 (527) Q Consensus 154 P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~ 233 (527) |++.++.+....++++.+...++....+|. +.+. ....++ |... .++....+ T Consensus 157 ~i~d~~~~~~~~~~vr~~~~~~~~~~~~~~--~~~~-------------~~~~~~-----~~~~---~~~~~~~~----- 208 (470) T protein:vir:99 157 IIYDDTVQRQPLAFVHYQIDNSNNWTDAYG--VIQY-------------ADKFYK-----FKGY---DIEEDTNA----- 208 (470) T ss_pred EEEcCCCCcceEEEEEEEEEecCCeeEEEE--EEEe-------------cCeEEE-----EEec---cccccccc----- Confidence 997665443333332222222222222222 2111 011111 1111 11111111 Q ss_pred CcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCC Q lcl|NC_019418. 234 DLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDN 313 (527) Q Consensus 234 ~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~ 313 (527) .....+++.++++++|++ .++|+|+|+++++++|++|.++|++++.++....++.+-..+... ..+ T Consensus 209 ---~~~~~~~~g~vPvv~~~n---------~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~--~~~ 274 (470) T protein:vir:99 209 ---AGYAINPYGLVPAVEFFE---------NEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLP--EDD 274 (470) T ss_pred ---ccccccCCCccceEeecC---------CCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcc--ccc Confidence 111224566777777764 246999999999999999999999999998766666664443211 111 Q ss_pred CcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHH Q lcl|NC_019418. 314 QGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSE 393 (527) Q Consensus 314 ~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~ 393 (527) .|+.. ..+.....+... ..+.+..+.++++++++..+.+...++.+.+.|...++.+..+++.. +|..||+++++. T Consensus 275 ~g~~~--~~~~~~~~~~~~-~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Ai~~~ 350 (470) T protein:vir:99 275 EGNPK--FDFKNNRVLYVS-QLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKNF-AGNSSGVALQYK 350 (470) T ss_pred ccchh--hhhhhcceeeec-CCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCcccccccc-ccCchHHHHHHH Confidence 22211 111111111111 22234455689999999999999999999998888888765443322 456789999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHH Q lcl|NC_019418. 394 NSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGI 473 (527) Q Consensus 394 ~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i 473 (527) .+.+.+++..+++.|+.+|+++++.|+.+... .........+++|.|++++|.|..+.++.+.+++ |+||.+|++ T Consensus 351 ~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~---~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl~--giis~et~l 425 (470) T protein:vir:99 351 LFAMKNKADSKERKFDKSLMQLYRIVLATLFN---NKQDQELWSELDFKFTRNLPEDMASAIDNAKNAE--GIVSKKTQL 425 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---cCCcccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHH Confidence 99999999999999999999999999877543 2233345668999999999999999999888875 999999999 Q ss_pred HhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 474 AKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 474 ~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) ..++++ + +++|++||++|+.............. +. .+..+++||| T Consensus 426 ~~l~~v-d--~~~E~eri~~E~~~~~~~~~~~~~~~----d~-~~~d~~~ee~ 470 (470) T protein:vir:99 426 GMIPDI-E--PDAEMKQIAKEKADAIKQTQQLSMPI----DI-LKRDNNAEEE 470 (470) T ss_pred HhCCCC-C--HHHHHHHHHHHHHHHHHHHHhhcCCC----Cc-CCCCCCccCC Confidence 887655 3 56788999888754332222111100 00 1112233333 No 37 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=100.00 E-value=1.7e-53 Score=309.88 Aligned_cols=457 Identities=14% Similarity=0.139 Sum_probs=294.9 Q ss_pred CChHHHHHHHHHHHHHHh-h-cccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc-C-----ccccCceee Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNM-T-TSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD-G-----DRKRRKMQH 72 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~-~-~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~-~-----~~~~~~~~~ 72 (527) |+.-..+++=+=++-... . .+-+.. -|.-..+.+.|+.++.+||.|+|+.+.+.... + ..+.+++++ T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~i~~-----~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~ 79 (472) T protein:vir:93 5 QPTQTEIFDAIVRTNNKPETLEEMIVR-----YIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMI 79 (472) T ss_pred CCcchhhhhceeeecCchhhHHHHHHH-----HHHHHHHHHHHHHHHHHHhccccccccccchhhccccccccccccccc Confidence 222111111110100000 0 000000 01112345678999999999999876543221 1 122345678 Q ss_pred cchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEEEEEcCCc Q lcl|NC_019418. 73 LPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRVAFIQAPV 151 (527) Q Consensus 73 lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i~~v~a~~ 151 (527) +|+|+.||+..|+++|++|++++++++...+.|+.+++ |+|...+.+++..++++|.+|+.+|+|. +++++.+++|.+ T Consensus 80 ~n~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~d~~~~i~~~~p~~ 158 (472) T protein:vir:93 80 TNFHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQ 158 (472) T ss_pred cchHHHHHHHHhhhhcccCeeeccCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCeEEEEEEECCCCceEEEEEcccc Confidence 89999999999999999999999999999999999985 6899999999999999999999999975 679999999999 Q ss_pred eEEEEEcC-CceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeec-c Q lcl|NC_019418. 152 FLPLQSNT-QDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNL-S 229 (527) Q Consensus 152 ~~P~~~d~-~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l-~ 229 (527) ++|++.++ .+...+++ +.+..++... +|++. .+.+.+-.+.. .. .+.- . T Consensus 159 ~~~i~d~~~~~~~~~~i--r~~~~~~~~~-----~~~~~----------------~~~~~~~~~~~--~~----~~~~~~ 209 (472) T protein:vir:93 159 GIPIWTDKEHEELEAFI--RMYKLENETK-----VEYWD----------------KVTVNYYVYEN--GS----LIPDYS 209 (472) T ss_pred eEEEEcCCCCCceEEEE--EEEEeeccee-----EEEEe----------------cCeEEEEEEec--Ce----eeeccc Confidence 99997544 34444443 2333322221 22221 11111111111 00 0000 0 Q ss_pred cccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCC Q lcl|NC_019418. 230 ELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLK 309 (527) Q Consensus 230 ~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~ 309 (527) ...........-+++.++++++|++ +++|+|+|+++++++|++|.++|+++++++....++.+-..+- T Consensus 210 ~~~~~~~~~~~~~~~~~vPvv~~~n---------n~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~--- 277 (472) T protein:vir:93 210 NNLENSKTHFSTGSWGKIPFIPFKN---------NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD--- 277 (472) T ss_pred ccccccccccccCCCCCcceEEecC---------CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCC--- Confidence 0011112233346778888888874 2579999999999999999999999999987555555522210 Q ss_pred CCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHH Q lcl|NC_019418. 310 VQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATE 389 (527) Q Consensus 310 ~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAte 389 (527) ..+.+ .+...-..+..+..+ ++++++++++++.++++...++.+.+.|...++.+..+++. .+|..||.+ T Consensus 278 -~~~~~------~~~~~~~~~~~~~~~--~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~~~n~Sg~A 347 (472) T protein:vir:93 278 -DQELP------EFKRLLRYYGAIKVS--DNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDK-FGSAPSGVA 347 (472) T ss_pred -cccch------hhHHHHhhccccccC--CCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccc-cccCchHHH Confidence 00001 011111122233333 23458888889999999999998888777776655433322 235568889 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCH Q lcl|NC_019418. 390 IVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQ 469 (527) Q Consensus 390 i~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~ 469 (527) +++.++.+..+++++++.|+.+|+++++.|+.+.. ......+++|.|++.+|.|..++++..+++ +|++|. T Consensus 348 l~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~-------~~~~~~~i~v~f~~~~p~~~~~~~~~~~k~--~giis~ 418 (472) T protein:vir:93 348 LEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD-------IKGEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSH 418 (472) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CCcccceeeEEeCCCCCCCHHHHHHHHHHH--hccCch Confidence 99999999999999999999999999999887643 223566789999999999999999998886 599999 Q ss_pred HHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 470 KRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 470 ~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) +|++..+++++| +++|++||++|+........+.+..+.++. .+..+++..+-| T Consensus 419 et~l~~l~~~~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~-~~~~~~~~~~~e 472 (472) T protein:vir:93 419 ETVLENHPFVED--LQAELERIEQEQMEYNKQLPNLDDGGADGA-QQQERSNNKESE 472 (472) T ss_pred HHHHHhCCCCCC--HHHHHHHHHHHHHHHHHhccCcCcccCCCC-CCCCCCCcccCC Confidence 999988866654 678999999887655555544433222211 111122112222 No 38 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=100.00 E-value=1.1e-52 Score=305.47 Aligned_cols=445 Identities=12% Similarity=0.109 Sum_probs=301.8 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccc--------------cCccc Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNT--------------DGDRK 66 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~--------------~~~~~ 66 (527) |.+ +.++..|.+.. .+. .+...++.+.++||.|+|+.+..+.. ....+ T Consensus 1 ~~~-e~~~~~i~~~~----~~~-------------~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~ 62 (471) T protein:vir:10 1 MEI-EVIKKIISSQM----VKH-------------GKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRN 62 (471) T ss_pred CCH-HHHHHHHHHHH----HHH-------------HHHHHHHHHHHHHhccccccccccchhhhhccccccccccccccc Confidence 433 33333333322 111 22456899999999999887643211 11223 Q ss_pred cCceeecchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe--CCeeEE Q lcl|NC_019418. 67 RRKMQHLPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD--GDKIRV 144 (527) Q Consensus 67 ~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d--~~~~~i 144 (527) ..+++++||++.||+..++|+||+|++++++++..++.|+.+++ |+|.....+++..+++.|.+|+++|+| .+++++ T Consensus 63 ~~~ki~~n~~~~Ivd~~~~yl~G~p~~~~~~~~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~g~~~~ 141 (471) T protein:vir:10 63 ADNRISHNWHQLLLDQKKAYALTYPPTFDVDDKKVNDMIVDVLG-DDYERISKQLCVNAGNAGIAWLHVWKDASDNSFRY 141 (471) T ss_pred ccceeccchhHHHHHhhhhhhcccCceeccCChHHHHHHHHHHh-cCHHHHHHHHHHHHhhCCeEEEEEEeeCCCCeeEE Confidence 34578999999999999999999999999999999999999985 789999999999999999999999998 378999 Q ss_pred EEEcCCceEEEEEcCC-ceEEEEE-EEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCcccc Q lcl|NC_019418. 145 AFIQAPVFLPLQSNTQ-DVSSAAI-LTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQL 222 (527) Q Consensus 145 ~~v~a~~~~P~~~d~~-~~~~~a~-~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~l 222 (527) .+++|.+++|++.++. +...+++ ++..........++ .+|.+... .+.+ |...... + T Consensus 142 ~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~--~~~vy~~~----------------~~~~--y~~~~~~-~ 200 (471) T protein:vir:10 142 ACVDSKEVIPIYSKSLDKKSIGVLRVYSSIDETDGKNYT--VYEYWNDK----------------ECSF--YRHEKEK-P 200 (471) T ss_pred EEEcccceEEEEcCCCCCceEEEEEEEEeeccCCCceeE--EEEEEeCC----------------cEEE--EEecCCc-c Confidence 9999999999976553 3444444 33222222222222 34544211 0110 1111111 0 Q ss_pred Ccee------ecccc--cCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHc Q lcl|NC_019418. 223 GERV------NLSEL--YPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKM 294 (527) Q Consensus 223 G~~v------~l~~~--~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~ 294 (527) ...+ +.... ..-.......+++++.++++|++ + ..|.|+|+.+++|+|++|.++|++++.++. T Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n----~-----~~~~sd~e~v~~liDa~d~~~S~~~~~~~~ 271 (471) T protein:vir:10 201 LEELETFQAISLIDTMNGDRSSDNSFKHDFGLVPFIPFKN----N-----EIETNDLKPIKDLVDVYDKVFSGFVNDTDD 271 (471) T ss_pred cccccccccccccccccccccccccccCCCCceeEEEecc----C-----CCCCCchHHHHHHHHHHHHHHHHHHHHHHH Confidence 0111 00000 00112222345777778888865 2 348999999999999999999999999987 Q ss_pred CcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_019418. 295 GQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSS 374 (527) Q Consensus 295 ~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~ 374 (527) ....+.+...+- ....+ +. ....... .+...-+.+.+..+++++++++++.+.+...++.+.+.|...++.+. T Consensus 272 ~~~~~lv~~g~~---~~~~~-~~--~~~~~~~-~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~ 344 (471) T protein:vir:10 272 VQEVIFVLTNYG---GQDKQ-EF--LEDLKRY-KMIKMDNDGMGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVN 344 (471) T ss_pred hhCceeeeecCC---ccccc-hh--HHHhhcC-CeEEecCCCCccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcC Confidence 555565533321 00000 00 0011111 11111122334456799999999999999999999888877766543 Q ss_pred ccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHH Q lcl|NC_019418. 375 GMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAE 454 (527) Q Consensus 375 ~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~ 454 (527) . ++...|..|+++++++++.+.++++.+++.|+++|+++++.|+.+.+. .+..+++|+|++.+|.|..+. T Consensus 345 ~--~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~--------~d~~~i~i~f~~~~p~n~~e~ 414 (471) T protein:vir:10 345 P--ETDKLGNSSGVALKFLYSLLELKAGNMETQFRSGYATLVKMILKHLGL--------SDKLKIKQTWTRNSINNDTEM 414 (471) T ss_pred C--CcccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--------CCCceeEEEeCCCCCCCHHHH Confidence 2 333346678999999999999999999999999999999999876542 234578999999999999999 Q ss_pred HHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCC Q lcl|NC_019418. 455 LDYWMKMVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVG 515 (527) Q Consensus 455 ~~~~~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~ 515 (527) ++.++++ +|+||.+|++..+++++| +++|++||++|+....+..+++++...+++.. T Consensus 415 ~~~~~kl--~g~iS~et~~~~~p~v~D--~~~E~eri~~E~~~~~~~~~~~~~~~~~~e~~ 471 (471) T protein:vir:10 415 AQVVSTL--ATITSRENVAKSNPIVED--WQDELRLQKAEQEGRSEKLYDMEEVEHESEVE 471 (471) T ss_pred HHHHHHH--hccCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhcccccCCCCCccccC Confidence 9988886 599999999988866664 67899999999877666555554444333332 No 39 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=100.00 E-value=4.5e-53 Score=307.55 Aligned_cols=434 Identities=12% Similarity=0.110 Sum_probs=297.5 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) |. .+.|.+++++.. ....|+.+..+||.|+++.+...... ..+..+++++|+|+.|| T Consensus 17 ~~-~~~i~~~i~~~~---------------------~~~~r~~~~~~Yy~g~~~i~~~~~~~-~~~~~~ki~~n~~~~iv 73 (452) T protein:vir:36 17 IT-VEVVTKFMEKHK---------------------LEVARYEYLKNMYLGIMAIDDEPAKD-SWKPDNRLAVNFTKYIV 73 (452) T ss_pred CC-HHHHHHHHHHHH---------------------HHHHHHHHHHHHhccccccccCcccc-ccCccceeecchHHHHH Confidence 32 234444443311 23457888899999999877654433 33345678899999999 Q ss_pred HHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe-CCeeEEEEEcCCceEEEEEcC Q lcl|NC_019418. 81 KKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD-GDKIRVAFIQAPVFLPLQSNT 159 (527) Q Consensus 81 ~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d-~~~~~i~~v~a~~~~P~~~d~ 159 (527) +..|+|||++|++++++++..++.|++++++|+|...+.++++.++++|.+|+++|+| .+++++.+++|.+++|++.+. T Consensus 74 d~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~ 153 (452) T protein:vir:36 74 DTFTGYFNGIPVKKSHSDKEILTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDEDTQTNVVYNSPENMFMVYDDT 153 (452) T ss_pred HHHhhhhcccCceeecCChhHHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEEEcCC Confidence 9999999999999999999999999999999999999999999999999999999997 478999999999999997654 Q ss_pred C-ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccc Q lcl|NC_019418. 160 Q-DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPV 238 (527) Q Consensus 160 ~-~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~ 238 (527) . +...+++ +++...+ +..+ +|.+.. ...+. |.... . |. .+ .. T Consensus 154 ~~~~~~~~i--~~~~~~~-~~~~---~~vyt~---------------~~i~~---~~~~~-~--~~--~~--------~~ 196 (452) T protein:vir:36 154 VKQEPLFAV--RYGVDED-KKLQ---GEVYTL---------------LETIK---ISGEN-D--EI--SF--------GE 196 (452) T ss_pred CCCceEEEE--EEEEecC-ceEE---EEEEec---------------CeEEE---EEEcC-C--ce--EE--------ec Confidence 3 3343333 2222222 2222 333321 11111 11111 0 00 00 01 Q ss_pred eeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCcccc Q lcl|NC_019418. 239 TPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGNIA 318 (527) Q Consensus 239 ~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~~~ 318 (527) ..-++++++++++|++ .+.|+|+|+++++++|++|.++|++++.++.....+.+.... ....... T Consensus 197 ~~~~~~g~iPvv~~~n---------~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g~-----~~~~~~~- 261 (452) T protein:vir:36 197 GTYNPYPDLPVVEFYF---------NEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGA-----AVEEEDL- 261 (452) T ss_pred ceeccCCcccEEEecC---------CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecC-----CcCchhh- Confidence 1124566777777754 245999999999999999999999999997655555552221 1111100 Q ss_pred cccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHH Q lcl|NC_019418. 319 FKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTY 398 (527) Q Consensus 319 ~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~ 398 (527) ......+-+.. ...+.+....++++++++..+.+...++.+.+.|...++.+. +++...|..||+++++.++.+. T Consensus 262 --~~~~~~~~~~~-~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~~gn~Sg~Al~~~~~~l~ 336 (452) T protein:vir:36 262 --KNIRSNRVINY-YADGEGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVAN--ISDESFGSSSGVSLAYKLQAMS 336 (452) T ss_pred --hhhhhcceEEe-cCCCCccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCccc--cCcccccCCcHHHHHHHHHHHH Confidence 00111111111 111233345688899999999999999988888877776543 4445556778999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCC Q lcl|NC_019418. 399 QMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLG 478 (527) Q Consensus 399 ~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~ 478 (527) ++++.+++.|+.+|+++++.|+.+....+ ......+|+|.|++++|.|..+.++.++++ +|+||.+|++..+++ T Consensus 337 ~k~~~~~~~~~~~l~~~~~li~~~~~~~~----~~~~~~~i~i~f~~~~p~d~~~~a~~~~k~--~g~iS~et~~~~~~~ 410 (452) T protein:vir:36 337 NLALSFQRKFQSSLNSRYKLFCELSTNVS----NKDSWKDIEYTFTRNEPKDIKEQAETANIL--MGITSQETALSVISV 410 (452) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccC----CccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCC Confidence 99999999999999999999998876422 233556789999999999999999888876 599999999876644 Q ss_pred CCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 479 ITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 479 ~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) ++| +++|++||++|++....... +...+..+..++.+.+++| T Consensus 411 ~~d--~~~E~~ri~~E~~~~~~~~~----~~~~~~~~~~~~~~~~~~e 452 (452) T protein:vir:36 411 IPD--VQAEMEKIKKEEASTAIFDK----DKQPSEKGTDTVVSETNEE 452 (452) T ss_pred CCC--HHHHHHHHHHHHHHHHHHHh----hccCCCCcccccCccccCC Confidence 443 67899999998865432211 1222223333344444444 No 40 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=100.00 E-value=1e-52 Score=305.64 Aligned_cols=459 Identities=16% Similarity=0.190 Sum_probs=299.7 Q ss_pred CChHHHHHHHHHHHHHHh--hcccchhhhccC-----cccc-----------CHHHHHHHHHHHHHhcCCCcccccccc- Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNM--TTSHLSSILDHP-----KVAV-----------TQSEFRRIQHNLAYYQSKFDDIEYTNT- 61 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~--~~~~~~~~~~~~-----~i~~-----------~~~~~~~i~~~~~~y~g~~~~l~~~~~- 61 (527) .-+++.+-.-+-|+|.-+ |++.-.+.+.+. +..+ -.+...|+.++++||.|+|+.+..... T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~~~~ 83 (492) T protein:vir:94 4 IQLISQVAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPV 83 (492) T ss_pred HHHHHHHHHHHhcCCceeecCccchhhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Confidence 555555554444444332 333322222110 0011 123456788889999999987654322 Q ss_pred cC-----ccccCceeecchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEE Q lcl|NC_019418. 62 DG-----DRKRRKMQHLPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPY 136 (527) Q Consensus 62 ~~-----~~~~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~ 136 (527) .+ ..+..+++++|+|+.||+..|+|+|+.|++++++++..++.|+.+++ |+|...+.+++..|+++|.+|+.+| T Consensus 84 ~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~G~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~a~~~G~a~~~v~ 162 (492) T protein:vir:94 84 DATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPY 162 (492) T ss_pred cccccccccccccccccchHHHHHHHHHhhhcccCceeccCchHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEE Confidence 11 12234568899999999999999999999999999999999999985 6899999999999999999999999 Q ss_pred EeC-CeeEEEEEcCCceEEEEEcC-CceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEE Q lcl|NC_019418. 137 VDG-DKIRVAFIQAPVFLPLQSNT-QDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELY 214 (527) Q Consensus 137 ~d~-~~~~i~~v~a~~~~P~~~d~-~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly 214 (527) +|. +++++.+++|.+++|++.++ .+...+++ +++..+... .+|.|.. ..|.+-.+ T Consensus 163 ~d~dg~~~~~~~~p~~~~~v~d~~~~~~~~a~i--r~~~~~~~~-----~~~~y~~----------------~~v~~~~~ 219 (492) T protein:vir:94 163 LDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFI--RMYKLENET-----KVEYWDK----------------VTVNYYVY 219 (492) T ss_pred ecCCCceEEEEEcccceEEEEcCCCCCceEEEE--EEEeeccce-----eEEEEec----------------CeEEEEEE Confidence 974 78999999999999997544 34444433 223222221 1222211 11111111 Q ss_pred ecCCccccCceeec-ccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019418. 215 KSTSDSQLGERVNL-SELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIK 293 (527) Q Consensus 215 ~~~~~~~lG~~v~l-~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~ 293 (527) .. .. .++. ............-++++.+|+++|++ +++|+|+|+++++++|++|.++|++++.++ T Consensus 220 ~~--~~----~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n---------n~~~~sd~e~v~~liDa~d~~~S~~~~~~~ 284 (492) T protein:vir:94 220 EN--GS----LIPDYSNNLENSKTHFSTGSWGKIPFIPFKN---------NDLEISDIFMYKTLIDAYNRRLSDLSNTFK 284 (492) T ss_pred ec--Ce----eeeccccccccccccccccCCCccceEEecC---------CCCCCCchHHHHHHHHHHHHHHHHHHHHHH Confidence 10 00 0000 00011111222335677777888864 246999999999999999999999999998 Q ss_pred cCcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcC-- Q lcl|NC_019418. 294 MGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIG-- 371 (527) Q Consensus 294 ~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g-- 371 (527) .....+.+-..+- +... ..+..+-..+..+..+ +++.++++++++..+.+...++.+.+.|...++ T Consensus 285 ~~~~p~lv~~g~~-----~~~~-----~~~~~~~~~~~~~~~~--~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p 352 (492) T protein:vir:94 285 DSNELTYVLKNYD-----DQEL-----PEFKRLLRYYGAIKVS--DNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAV 352 (492) T ss_pred HhcCceeeeecCC-----cccc-----hhhHHHHhhccceecC--CCCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCc Confidence 7555555532221 1100 0011111112223333 234578888888888877777777766655554 Q ss_pred -CCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCC Q lcl|NC_019418. 372 -VSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTD 450 (527) Q Consensus 372 -~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d 450 (527) ++.+.|+ |..||.++++..+.+..+++.+++.|+.+|+++++.|+.+.+. ..+..++.|+|++++|.| T Consensus 353 ~~~~~~~~----~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~-------~~~~~~i~v~f~~~~p~~ 421 (492) T protein:vir:94 353 DFSSDKFG----SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI-------KGEHKDVDISFNYNKVAN 421 (492) T ss_pred CCCccccc----cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-------CcccceeeEEecCCCCCC Confidence 3444443 4567888999999999999999999999999999999876432 234567899999999999 Q ss_pred HHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 451 RHAELDYWMKMVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 451 ~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) ..+.++...+++ |++|.+|++..+++++| +++|++||++|+....+..+.+++.+.++.. ++.++.+.|.| T Consensus 422 ~~e~~~~~~kl~--giiS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~-~~~~~~~~e~e 492 (492) T protein:vir:94 422 TELQVQTAQQSM--GIVSHETVLENHPFVED--LQAELERIEQEQMEYNKQLPNLDDGGADSAQ-QQERSNNKESE 492 (492) T ss_pred HHHHHHHHHHHh--ccCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhccccccccCCCCc-cccCCccccCC Confidence 999999888874 99999999887755554 6789999998876655555555443333332 23333333333 No 41 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=100.00 E-value=1.9e-52 Score=304.19 Aligned_cols=463 Identities=15% Similarity=0.171 Sum_probs=302.9 Q ss_pred CChHHHHHHHHHHHHHHh--hcccchhhhccC-----cccc-----------CHHHHHHHHHHHHHhcCCCcccccccc- Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNM--TTSHLSSILDHP-----KVAV-----------TQSEFRRIQHNLAYYQSKFDDIEYTNT- 61 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~--~~~~~~~~~~~~-----~i~~-----------~~~~~~~i~~~~~~y~g~~~~l~~~~~- 61 (527) .-+++.+-.-+-|+|.-+ +++...+.+.+. +..+ -.+...|+.++++||.|+|+.+..... T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~ 83 (492) T protein:vir:97 4 IQLISQVAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPV 83 (492) T ss_pred HHHHHHHHHHHhcCCceeeccchhhhhHhhhcccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccc Confidence 555655555554544332 333333332211 1111 112456777788999999987654322 Q ss_pred c-----CccccCceeecchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEE Q lcl|NC_019418. 62 D-----GDRKRRKMQHLPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPY 136 (527) Q Consensus 62 ~-----~~~~~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~ 136 (527) . ...+.++++++|+|+.||+..++|+|+.|++++++++...+.|+++++ |+|...+.+++..++++|.+|+.+| T Consensus 84 ~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~a~~~v~ 162 (492) T protein:vir:97 84 DATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPY 162 (492) T ss_pred cccccccccccccccccchHHHHHHHHhhhhcccCceeccCchHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCeEEEEEE Confidence 1 122334578899999999999999999999999999999999999985 6899999999999999999999999 Q ss_pred Ee-CCeeEEEEEcCCceEEEEEcC-CceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEE Q lcl|NC_019418. 137 VD-GDKIRVAFIQAPVFLPLQSNT-QDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELY 214 (527) Q Consensus 137 ~d-~~~~~i~~v~a~~~~P~~~d~-~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly 214 (527) .| .+++++.+++|.+++|++.++ .+...+++ +.+..+... + +|+|. .+.|.+-.+ T Consensus 163 ~d~dg~~~~~~~~p~~~~~i~d~~~~~~~~~~v--r~~~~~~~~--~---~~~y~----------------~~~v~~~~~ 219 (492) T protein:vir:97 163 LDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFI--RMYKLENET--K---VEYWD----------------KVTVNYYVY 219 (492) T ss_pred ecCCCceEEEEEcccceEEEEcCCCCCceEEEE--EEEeeccce--e---EEEEe----------------cCeEEEEEE Confidence 97 468999999999999997654 34444443 233322221 1 23332 112221111 Q ss_pred ecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHc Q lcl|NC_019418. 215 KSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKM 294 (527) Q Consensus 215 ~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~ 294 (527) .. ...+.. .............-++++.+|+++|++ ++.|+|+|+++++++|++|.++|++++.++. T Consensus 220 ~~--~~~~~~---~~~~~~~~~~~~~~~~~g~vPvv~~~n---------n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~ 285 (492) T protein:vir:97 220 EN--GSLIPD---YSNNLENSKTHFSTGSWGKIPFIPFKN---------NDLEISDIFMYKTLIDAYNRRLSDLSNTFKD 285 (492) T ss_pred ec--Ceeeec---ccccccccccccccCCCCCcceEEecC---------CCCCCCchHhHHHHHHHHHHHHHHHHHHHHH Confidence 11 010000 000011111222335677788888864 2469999999999999999999999999987 Q ss_pred CcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_019418. 295 GQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSS 374 (527) Q Consensus 295 ~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~ 374 (527) ...++.+...+-. .+. + .+..+-..+..+..+ +++.++++++++.++.+...++.+.+.|...++.+. T Consensus 286 ~~~~~l~~~g~~~--~~~--~------~~~~~~~~~~~~~~~--~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~ 353 (492) T protein:vir:97 286 SNELTYVLKNYDD--QEL--P------EFKRLLRYYGAIKVS--DNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVD 353 (492) T ss_pred hccceeeeecCCc--ccc--h------hHHHHHhhccceecC--CCCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCCC Confidence 6666666333210 000 0 011111112223333 234578888999999888888888777766655443 Q ss_pred ccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHH Q lcl|NC_019418. 375 GMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAE 454 (527) Q Consensus 375 ~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~ 454 (527) -+++ ..+|..||.++++.++.+..+++.+++.|+++|+++++.|+.+.+. ..+..+++|.|++.+|.|..+. T Consensus 354 ~~~~-~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~-------~~~~~~i~v~f~~~~p~~~~e~ 425 (492) T protein:vir:97 354 FSSD-KFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI-------KGEHKDVDISFNYNKVANTELQ 425 (492) T ss_pred CCcc-ccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-------CcccceeeEEecCCCCCCHHHH Confidence 2221 1234567889999999999999999999999999999998876532 2356779999999999999999 Q ss_pred HHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 455 LDYWMKMVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 455 ~~~~~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) ++..+++ +|++|.+|++.+++.++| +++|++||++|+....+........+.++ ..+..++...+.| T Consensus 426 a~~~~kl--~G~iS~et~l~~l~~v~d--~~~Eleri~~E~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~e 492 (492) T protein:vir:97 426 VQTAQQS--MGIVSHETVLENHPFVED--LQAELERIEQEQTEYNKQLPNLDDGGADS-AQQQERSNNKESE 492 (492) T ss_pred HHHHHHH--hccCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHhhhccccCCCCC-CcccccccccccC Confidence 9998887 599999999987766654 56899999888765544444443322222 2222222223333 No 42 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=100.00 E-value=3.6e-52 Score=302.65 Aligned_cols=441 Identities=11% Similarity=0.121 Sum_probs=289.7 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccc---c---CccccCceeecc Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNT---D---GDRKRRKMQHLP 74 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~---~---~~~~~~~~~~ln 74 (527) --..+.|++++.+ -.+...++.+.++||.|+|+.+..... . .+.+.++++++| T Consensus 26 ~~~~~~i~~~i~~---------------------~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n 84 (474) T protein:vir:96 26 ETQEEMIIRLINN---------------------HKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTN 84 (474) T ss_pred cchHHHHHHHHHH---------------------HHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccc Confidence 1111112222211 123456788899999999986643221 1 122335578899 Q ss_pred hHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe-CCeeEEEEEcCCceE Q lcl|NC_019418. 75 IARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD-GDKIRVAFIQAPVFL 153 (527) Q Consensus 75 l~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d-~~~~~i~~v~a~~~~ 153 (527) +|+.||+..|+||||+|++++++++..++.|+.+++ |+|...+.+++..++++|.+|+++|+| .+.+++.+++|.++| T Consensus 85 ~~k~Iv~~~~~yl~g~p~~~~~~~~~~~~~l~~~~~-n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~~~ 163 (474) T protein:vir:96 85 FHQNLVDQKVSYVAGKPVTYAHDDDKVLDVIHQVLD-TRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRVPAEQAI 163 (474) T ss_pred hHHHHHHhhhhhhcccCceeccCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceE Confidence 999999999999999999999999999999999985 689999999999999999999999997 568999999999999 Q ss_pred EEEEcCC-ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeeccccc Q lcl|NC_019418. 154 PLQSNTQ-DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELY 232 (527) Q Consensus 154 P~~~d~~-~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~ 232 (527) |++.++. +...++ + +.+..... .+ +|.|. ...|.+-.+.+.. .... ....+.+ T Consensus 164 ~v~d~~~~~~~~a~-i-r~~~~~~~--~~---~~vy~----------------~~~i~~~~~~~~~-~~~~--~~~~~~~ 217 (474) T protein:vir:96 164 PIWTDKEREQLNAF-I-RIFTFNGE--TK---VEYWT----------------AETVTYYVYENGG-LIPD--FYYGDEH 217 (474) T ss_pred EEEcCCCCCceEEE-E-EEEeecCe--eE---EEEEe----------------CCeEEEEEEcCCc-eeec--ccccccc Confidence 9976543 333333 3 22322221 12 34432 1122211111100 0000 0000001 Q ss_pred CCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCC Q lcl|NC_019418. 233 PDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQD 312 (527) Q Consensus 233 ~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~ 312 (527) ......-+++.++++++|++ + +.|.|+|+++++++|++|.++|++++.++.....+++...+- ... T Consensus 218 --~~~~~~~~~~~~vPvv~~~n----n-----~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~---~~~ 283 (474) T protein:vir:96 218 --IQTHFSTGSWERVPFIAFKN----N-----PEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYE---GED 283 (474) T ss_pred --ccCcccccCCCccceEEecC----C-----CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCC---ccc Confidence 11112224667777777754 2 468999999999999999999999999987665565533321 111 Q ss_pred CCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHH Q lcl|NC_019418. 313 NQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVS 392 (527) Q Consensus 313 ~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s 392 (527) .++ +..+-..+..+..+ +.+.++++++++..+.+...++.+.+.|...++.+.-++. ..+|..||.++++ T Consensus 284 ~~~-------~~~~~~~~~~i~~~--~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~n~Sg~Alk~ 353 (474) T protein:vir:96 284 LSE-------FMEGLKYYKAINVS--SDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTD-KFGSATSGIALKF 353 (474) T ss_pred ccc-------hhhhhhccceeecc--CCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcccc-ccccccHHHHHHH Confidence 010 11111112223332 2346899999999999999999999888777765432221 2235678999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHH Q lcl|NC_019418. 393 ENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRG 472 (527) Q Consensus 393 ~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~ 472 (527) +++.+.++++.+++.|+++|+++++.|+.+.. ...+..+|+|+|.+++|.|..+.++.. +.+|+||.+|+ T Consensus 354 ~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g-------~~~d~~~i~i~f~~~~p~~~~e~a~~~---~~~giiS~et~ 423 (474) T protein:vir:96 354 LYTNLNLKANKLKNKANVALQELMQFILDFNK-------IKLDAKEIEITFNFNVMVNDLEQSQIG---AQSQYLSKETL 423 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CCcccceeeEEecCCCccCHHHHHHHH---HHcCCCChHHH Confidence 99999999999999999999999999987642 234567799999999999988777654 45799999999 Q ss_pred HHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 473 IAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 473 i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) +..++.++| +++|++||++|+.........+.+.+ .+...++.+..+++.| T Consensus 424 ~~~lp~v~D--~~~E~eri~~E~~~~~~~~~~~~~~~-~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:96 424 VRHHPWVDD--PKAELERLDEEQLELNKQLPNLDDGG-ADGAQQQQQSENNQSK 474 (474) T ss_pred HHhCCCCCC--HHHHHHHHHHHHHHHHhhcccccccc-CCCCCCcCCCCccccC Confidence 977755544 67889999988765444433333222 2222222222222333 No 43 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=100.00 E-value=3.6e-52 Score=302.65 Aligned_cols=441 Identities=11% Similarity=0.121 Sum_probs=289.7 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccc---c---CccccCceeecc Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNT---D---GDRKRRKMQHLP 74 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~---~---~~~~~~~~~~ln 74 (527) --..+.|++++.+ -.+...++.+.++||.|+|+.+..... . .+.+.++++++| T Consensus 26 ~~~~~~i~~~i~~---------------------~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n 84 (474) T protein:vir:95 26 ETQEEMIIRLINN---------------------HKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTN 84 (474) T ss_pred cchHHHHHHHHHH---------------------HHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccc Confidence 1111112222211 123456788899999999986643221 1 122335578899 Q ss_pred hHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe-CCeeEEEEEcCCceE Q lcl|NC_019418. 75 IARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD-GDKIRVAFIQAPVFL 153 (527) Q Consensus 75 l~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d-~~~~~i~~v~a~~~~ 153 (527) +|+.||+..|+||||+|++++++++..++.|+.+++ |+|...+.+++..++++|.+|+++|+| .+.+++.+++|.++| T Consensus 85 ~~k~Iv~~~~~yl~g~p~~~~~~~~~~~~~l~~~~~-n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~~~ 163 (474) T protein:vir:95 85 FHQNLVDQKVSYVAGKPVTYAHDDDKVLDVIHQVLD-TRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRVPAEQAI 163 (474) T ss_pred hHHHHHHhhhhhhcccCceeccCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceE Confidence 999999999999999999999999999999999985 689999999999999999999999997 568999999999999 Q ss_pred EEEEcCC-ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeeccccc Q lcl|NC_019418. 154 PLQSNTQ-DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELY 232 (527) Q Consensus 154 P~~~d~~-~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~ 232 (527) |++.++. +...++ + +.+..... .+ +|.|. ...|.+-.+.+.. .... ....+.+ T Consensus 164 ~v~d~~~~~~~~a~-i-r~~~~~~~--~~---~~vy~----------------~~~i~~~~~~~~~-~~~~--~~~~~~~ 217 (474) T protein:vir:95 164 PIWTDKEREQLNAF-I-RIFTFNGE--TK---VEYWT----------------AETVTYYVYENGG-LIPD--FYYGDEH 217 (474) T ss_pred EEEcCCCCCceEEE-E-EEEeecCe--eE---EEEEe----------------CCeEEEEEEcCCc-eeec--ccccccc Confidence 9976543 333333 3 22322221 12 34432 1122211111100 0000 0000001 Q ss_pred CCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCC Q lcl|NC_019418. 233 PDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQD 312 (527) Q Consensus 233 ~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~ 312 (527) ......-+++.++++++|++ + +.|.|+|+++++++|++|.++|++++.++.....+++...+- ... T Consensus 218 --~~~~~~~~~~~~vPvv~~~n----n-----~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~---~~~ 283 (474) T protein:vir:95 218 --IQTHFSTGSWERVPFIAFKN----N-----PEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYE---GED 283 (474) T ss_pred --ccCcccccCCCccceEEecC----C-----CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCC---ccc Confidence 11112224667777777754 2 468999999999999999999999999987665565533321 111 Q ss_pred CCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHH Q lcl|NC_019418. 313 NQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVS 392 (527) Q Consensus 313 ~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s 392 (527) .++ +..+-..+..+..+ +.+.++++++++..+.+...++.+.+.|...++.+.-++. ..+|..||.++++ T Consensus 284 ~~~-------~~~~~~~~~~i~~~--~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~n~Sg~Alk~ 353 (474) T protein:vir:95 284 LSE-------FMEGLKYYKAINVS--SDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTD-KFGSATSGIALKF 353 (474) T ss_pred ccc-------hhhhhhccceeecc--CCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcccc-ccccccHHHHHHH Confidence 010 11111112223332 2346899999999999999999999888777765432221 2235678999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHH Q lcl|NC_019418. 393 ENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRG 472 (527) Q Consensus 393 ~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~ 472 (527) +++.+.++++.+++.|+++|+++++.|+.+.. ...+..+|+|+|.+++|.|..+.++.. +.+|+||.+|+ T Consensus 354 ~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g-------~~~d~~~i~i~f~~~~p~~~~e~a~~~---~~~giiS~et~ 423 (474) T protein:vir:95 354 LYTNLNLKANKLKNKANVALQELMQFILDFNK-------IKLDAKEIEITFNFNVMVNDLEQSQIG---AQSQYLSKETL 423 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CCcccceeeEEecCCCccCHHHHHHHH---HHcCCCChHHH Confidence 99999999999999999999999999987642 234567799999999999988777654 45799999999 Q ss_pred HHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 473 IAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 473 i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) +..++.++| +++|++||++|+.........+.+.+ .+...++.+..+++.| T Consensus 424 ~~~lp~v~D--~~~E~eri~~E~~~~~~~~~~~~~~~-~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:95 424 VRHHPWVDD--PKAELERLDEEQLELNKQLPNLDDGG-ADGAQQQQQSENNQSK 474 (474) T ss_pred HHhCCCCCC--HHHHHHHHHHHHHHHHhhcccccccc-CCCCCCcCCCCccccC Confidence 977755544 67889999988765444433333222 2222222222222333 No 44 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=100.00 E-value=5e-52 Score=301.84 Aligned_cols=430 Identities=10% Similarity=0.030 Sum_probs=294.3 Q ss_pred HHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccc-cCccccCceeecchHHHHHHHHh Q lcl|NC_019418. 6 KVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNT-DGDRKRRKMQHLPIARTAAKKIA 84 (527) Q Consensus 6 ~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~-~~~~~~~~~~~lnl~~~i~~~~A 84 (527) .|. ....+...|+++..+||.|+++.+..+.. ....+..+++++|+|+.||+..| T Consensus 1 ~~~------------------------~~~~~~~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~ 56 (440) T protein:vir:95 1 MLA------------------------AFLGSQKQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFAT 56 (440) T ss_pred Chh------------------------hHHHHHHHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhh Confidence 011 12334567899999999999987643322 23334456789999999999999 Q ss_pred hhhhcccceEeeCCH---HHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEEEEEcCCceEEEEEcCC Q lcl|NC_019418. 85 SLVYNEQAEISAEDE---TLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRVAFIQAPVFLPLQSNTQ 160 (527) Q Consensus 85 ~ll~~e~~~i~~~d~---~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i~~v~a~~~~P~~~d~~ 160 (527) +|||++|++++++++ ...+.|++++.+|+|...+.+++..|+++|.+|+++|+|. +++++.+++|.+++|++.++. T Consensus 57 ~~l~g~~~~~~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~i~~~~p~~~~~~~d~~~ 136 (440) T protein:vir:95 57 GYVIGNPVSIGVMEGGSADQLSTIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDKDKVDRVVLISPLEMFVIRDLTV 136 (440) T ss_pred hheeccCceEeeCCCccHHHHHHHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEEcCCC Confidence 999999999987654 4556889999999999999999999999999999999975 679999999999999976554 Q ss_pred -ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccce Q lcl|NC_019418. 161 -DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVT 239 (527) Q Consensus 161 -~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~ 239 (527) +...+++.+. ...+. .+ .+.+. ....+++...... .+.. .+ ... T Consensus 137 ~~~~~~~i~~~--~~~~~--~~---~~vyt---------------~~~~~~~~~~~~~----~~~~-~~--------~~~ 181 (440) T protein:vir:95 137 EQNIIAAVHLP--IYADK--VN---MTVYT---------------KDKVITYKPYSNN----SVRL-VV--------DDV 181 (440) T ss_pred CCceEEEEEEE--EecCc--eE---EEEEe---------------CCeEEEEEEecCC----ccce-ee--------cce Confidence 3444443322 22221 11 12221 1222233322111 1110 00 111 Q ss_pred eecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcC-CCCCCCcccc Q lcl|NC_019418. 240 PIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQL-KVQDNQGNIA 318 (527) Q Consensus 240 ~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~-~~~~~~~~~~ 318 (527) .-+++.+.|+++|++ + ++|+|+|+.+++++|+||.++|+++++++.....+.+...+... ..+++.+ T Consensus 182 ~~~~~g~vPvv~~~n----~-----~~g~sd~e~v~~lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~--- 249 (440) T protein:vir:95 182 KKHSYNDVPVVEWWN----N-----RFRMGDYESEISLIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKLSPEDA--- 249 (440) T ss_pred eeccCceeeEEEeeC----C-----CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcceeeeecccccCCCCccch--- Confidence 224667777787764 2 35999999999999999999999999998866666553332111 1111111 Q ss_pred cccccccccceeeecc---CCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHH Q lcl|NC_019418. 319 FKRRFDVEQNVYMQVG---AGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENS 395 (527) Q Consensus 319 ~~~~~d~~~~~~~~~~---~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~ 395 (527) ..+.....++.... ...+..+.++++++++..+.+...++.+.+.|...++.+..+++.- +|..||.++++.++ T Consensus 250 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~ 326 (440) T protein:vir:95 250 --AKMKDANMLFLKTGISTTGQQTTADASYIYKQYDVNGTEAYKNRLANDIHRFSRIPNLDDDRF-NSTSSGIALLYKMI 326 (440) T ss_pred --hhhhhccceecccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc-cccchHHHHHHHHH Confidence 11111222232211 1123345689999999999999999999999988887765444332 35678999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHh Q lcl|NC_019418. 396 DTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAK 475 (527) Q Consensus 396 ~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~ 475 (527) .+.++++++++.|+++|+++++.|+.+... ..+......+++|.|.+.+|.|..+.++.+.++ +|+||.++++.+ T Consensus 327 ~l~~k~~~k~~~~~~~l~~~~~li~~~~~~---~~~~~~~~~~v~i~f~~~~p~~~~~~ad~~~kl--~g~iS~et~~~~ 401 (440) T protein:vir:95 327 GLEQVRKDKETYFTKALRRRYELISNIHKA---INGPVIEANKLTFTFHPNIPQDVWTEIKAYIEA--GGEISQETLMEN 401 (440) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---cCCcccccccceEEeCCCCCCCHHHHHHHHHHH--hccCcHHHHHHh Confidence 999999999999999999999999887653 233445567799999999999999999998886 599999999988 Q ss_pred cCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccc Q lcl|NC_019418. 476 TLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDED 525 (527) Q Consensus 476 ~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (527) ++++++ .+|++||++|+..+........++ .+++++++| T Consensus 402 l~~~d~---~~E~~ri~~E~~~~~~~~~~~~~~--------~~~~~~~~e 440 (440) T protein:vir:95 402 ASFTDY---KTEHSRILKQGGSSDLEIGQIVGD--------ADVGQADTE 440 (440) T ss_pred CCCCCc---HHHHHHHHHHHHHhhhhHHhhccC--------CCCCCcCCC Confidence 866643 357888888876554433222221 112222222 No 45 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=100.00 E-value=4.8e-52 Score=301.91 Aligned_cols=426 Identities=9% Similarity=0.052 Sum_probs=293.3 Q ss_pred HHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHHHHhh Q lcl|NC_019418. 6 KVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIAS 85 (527) Q Consensus 6 ~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~ 85 (527) =-+++|++.... | .....|+.+.++||.|+++.+..+.... .+..+++++|+|+.||+..++ T Consensus 1 l~~~~l~~~i~~-----------~------~~~~~r~~~l~~yy~g~~~il~~~~~~~-~~~~~ki~~n~~~~ivd~~~~ 62 (429) T protein:vir:98 1 MTKDLLSELIQK-----------H------RSFNLSYSAYKQLYEGDHAILQQKQKEQ-YKPDNRLVVNFAKYIVDTFNG 62 (429) T ss_pred CCHHHHHHHHHH-----------H------HHHHHHHHHHHHHhcccccccccccccc-CCCcceeecchHHHHHHHHhh Confidence 022333322211 1 1245788899999999999886554332 334567899999999999999 Q ss_pred hhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEEEEEcCCceEEEEEcCCc-eE Q lcl|NC_019418. 86 LVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRVAFIQAPVFLPLQSNTQD-VS 163 (527) Q Consensus 86 ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i~~v~a~~~~P~~~d~~~-~~ 163 (527) +||++|++++++++..++.|++++++|+|...+.+++..++++|.+|+.+|+|. +++++.+++|.+++|++.+... .. T Consensus 63 ~l~g~~~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~~~~~~p~~~~~v~dd~~~~~~ 142 (429) T protein:vir:98 63 YFIGVPVQTSHENKQVSNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDENAEAGITYLTPLEAFIVYDDSIRQKP 142 (429) T ss_pred hhcccCceeecCChHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCcEEEEEEcccceEEEEeCCCCCce Confidence 999999999999999999999999999999999999999999999999999974 6899999999999999876543 33 Q ss_pred EEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecC Q lcl|NC_019418. 164 SAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQG 243 (527) Q Consensus 164 ~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g 243 (527) .+++ +++...+ ..++ .++.. ...++ .|.... -|. .+. ...-++ T Consensus 143 ~~~i--~~~~~~~--~~~~--~~~~~----------------~~~~~--~~~~~~---~~~--~~~--------~~~~~~ 185 (429) T protein:vir:98 143 LFAV--RYFYNKG--GVLE--GSYSD----------------ASNIT--YFKDGE---KGI--EIG--------ESEPHP 185 (429) T ss_pred EEEE--EEEEecC--ceEE--EEEEe----------------CceEE--EEEecC---Cce--Eec--------cccccc Confidence 3333 3332222 1111 11110 00111 121110 010 000 011245 Q ss_pred CCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCccccccccc Q lcl|NC_019418. 244 LSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGNIAFKRRF 323 (527) Q Consensus 244 ~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~ 323 (527) +.++++++|++ +++|+|+|+++++++|++|.++|++++.++.....+.+-..+ .+.... .... T Consensus 186 ~g~vPvv~~~n---------~~~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~-----~~~~~~---~~~~ 248 (429) T protein:vir:98 186 FDGVPMIEYVE---------NEERQSLLASVVTLINAFNKAISEKANDVEYFADAYLKILGA-----ELDDET---LKSL 248 (429) T ss_pred CCccceEEecC---------CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecC-----CCCcch---hhhH Confidence 66777777764 256999999999999999999999999998766666552211 111110 0110 Q ss_pred ccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHH Q lcl|NC_019418. 324 DVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNS 403 (527) Q Consensus 324 d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~ 403 (527) ...++ ..+..+++....++++++++..+.+...++.+.+.|...++.+. +++...|..||.++++..+.+.++++. T Consensus 249 -~~~~~-~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~--~~~~~~gn~Sg~Al~~~~~~l~~k~~~ 324 (429) T protein:vir:98 249 -RDTRI-INLKDTDAQQLTVEFLQKPDADATQEHLLDRLENLIFRTAMVAN--ISDESFGTASGIALRYRLQAMDNLAKT 324 (429) T ss_pred -hhCce-eeccCCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCccc--cCccccccchHHHHHHHHHHHHHHHHH Confidence 01111 11222333445688999999999998889988888877776542 344445677999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHH Q lcl|NC_019418. 404 IVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLGITEEE 483 (527) Q Consensus 404 ~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~dee 483 (527) +++.|+.+|+++++.|+.+.+.. + ......+++|.|++.+|.|..+.++..+++ +|+||.+|++..+++++| T Consensus 325 ~~~~~~~~l~~~~~li~~~~~~~---~-~~~d~~~i~v~f~~~~p~~~~~~a~~~~kl--~g~is~et~~~~l~~v~d-- 396 (429) T protein:vir:98 325 KERKFMSGMNRRYKLIASYPTSK---I-GPKDWIGIKYKFTRNLPANLLEESQIAGNL--AGIVSEETQVGVLSIVEN-- 396 (429) T ss_pred HHHHHHHHHHHHHHHHHHHhccC---C-CccccccceEEeCCCCCcCHHHHHHHHHHH--hccCchHHHHHhCCCCCC-- Confidence 99999999999999999876432 1 233456789999999999999999988886 699999999877755654 Q ss_pred HHHHHHHHHHhccccccc-ccCCCCCCCCCCCCCCCCCCCccc Q lcl|NC_019418. 484 AEKELAEINGELPPESDA-ELALYGKGQQNTVGNSKDTVDDED 525 (527) Q Consensus 484 a~~el~ri~~E~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 525 (527) +++|++||++|+....+. ..+++++..++ +.| T Consensus 397 ~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~----------~~~ 429 (429) T protein:vir:98 397 PQKEIERKNSDKSTLISRQAGGLNGQNTTT----------ILE 429 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhcCCCCCC----------CCC Confidence 568899999988754322 22333222211 111 No 46 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=100.00 E-value=2.2e-52 Score=303.82 Aligned_cols=446 Identities=10% Similarity=0.042 Sum_probs=291.0 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) |+| ..+|-|...-...+..+.+.++ |.--.....|+.++++||.|+++.+..... ...+..+++++|+|+.|| T Consensus 1 ~~~-~~~~~~~~~~~~~~~~~~i~~~-----i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~-~~~~~~~ki~~n~~~~iv 73 (453) T protein:vir:73 1 MNL-KPIKLMTYSRDEEITDKVVNDF-----MKKHQEEVERYEYLGNMYKGIMEISSQKAK-DSWKPDNRLTNNFAKYIV 73 (453) T ss_pred Ccc-ccceeeeccccccCCHHHHHHH-----HHHHHHHHHHHHHHHHHhccccchhcCCCC-CccCccceeecchHHHHH Confidence 433 1111111110111111111111 011123457888899999999987654333 233345678899999999 Q ss_pred HHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe-CCeeEEEEEcCCceEEEEEcC Q lcl|NC_019418. 81 KKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD-GDKIRVAFIQAPVFLPLQSNT 159 (527) Q Consensus 81 ~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d-~~~~~i~~v~a~~~~P~~~d~ 159 (527) +..|+++|++|++++++++..++.|++++++|+|...+.++++.++++|.+|+++|+| ++.+++.+++|.+++|++.++ T Consensus 74 d~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~dd~ 153 (453) T protein:vir:73 74 DTFVGYFNGIPIKKTHDDKSVLEAMQLFDNLNDMEDEESELAKIACVYGRAYELMYQNESTESEVIYCSPLNVFMVYDDS 153 (453) T ss_pred HHhhhhhcccCceeecCChHHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEcccceEEEEeCC Confidence 9999999999999999999999999999999999999999999999999999999997 467999999999999998776 Q ss_pred Cce-EEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccc Q lcl|NC_019418. 160 QDV-SSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPV 238 (527) Q Consensus 160 ~~~-~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~ 238 (527) .+. ..+++. ++... ++..+ .+.+.. .+.+. |...... ..+ .. T Consensus 154 ~~~~~~~~i~--~~~~~-~~~~~---~~vyt~---------------~~i~~---~~~~~~~-----~~~--------~~ 196 (453) T protein:vir:73 154 IKQKPLFAVY--YGFDE-EGNLS---GTVYTL---------------LETIS---ITGKAGE-----VKF--------GE 196 (453) T ss_pred CCceeEEEEE--EEEec-CceEE---EEEEeC---------------CeEEE---EEecCCc-----eEE--------cc Confidence 544 333333 22222 22222 222210 11111 2211100 000 00 Q ss_pred eeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCcccc Q lcl|NC_019418. 239 TPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGNIA 318 (527) Q Consensus 239 ~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~~~ 318 (527) ..-+++.++++++|++ .+.|.|+|+++++++|++|.++|++++.++.....+.+-..+ ...+... T Consensus 197 ~~~~~~g~vPvv~~~n---------~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~---~~~~~~~--- 261 (453) T protein:vir:73 197 STYNVYSDLPIVEYNF---------NEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGA---EVDEEDA--- 261 (453) T ss_pred ceeccCCceeEEEecC---------CCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecC---CCCchhh--- Confidence 1124556677777754 357999999999999999999999999997644433331111 0011100 Q ss_pred cccccccccceeeec-----cCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHH Q lcl|NC_019418. 319 FKRRFDVEQNVYMQV-----GAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSE 393 (527) Q Consensus 319 ~~~~~d~~~~~~~~~-----~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~ 393 (527) ..+...+.+.... .........++++++++..+.+...++.+.+.|...++.+ .+++...|..||.++++. T Consensus 262 --~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p--~~~~~~~gn~Sg~Al~~~ 337 (453) T protein:vir:73 262 --KNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAA--NISDENFGNSSGVALAYK 337 (453) T ss_pred --hcccccccccccccccccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCc--ccCcccccCccHHHHHHH Confidence 0111111111100 0111222347888999999999888888888776666544 244455567799999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHH Q lcl|NC_019418. 394 NSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGI 473 (527) Q Consensus 394 ~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i 473 (527) ++.+..+++.+++.|+.+|+++++.|+.+.... +......+++|.|++++|.|..+.++..++++ |++|.+|++ T Consensus 338 ~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~----~~~~~~~~i~v~f~~~~p~~~~~~a~~~~k~~--giis~et~~ 411 (453) T protein:vir:73 338 LQAMSNLALSFQRKFQSALNRRYSLWSSLSTNA----SNKDAWKDIEYTFTRNEPKDIKEQAETANILK--GITSEETAL 411 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----CCccccccceEEeCCCCCCCHHHHHHHHHHHh--ccCcHHHHH Confidence 999999999999999999999999998875432 22345567899999999999999999988875 999999999 Q ss_pred HhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCC Q lcl|NC_019418. 474 AKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNS 517 (527) Q Consensus 474 ~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 517 (527) ..+++++| +++|++||++|+...............+.+-++- T Consensus 412 ~~~~~~~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 453 (453) T protein:vir:73 412 SVISVIPD--VQAEMEKIKKKKLLQLSLTRTSNLVRMKQMRGNL 453 (453) T ss_pred HhCCCCCC--HHHHHHHHHHHHHHHHHHHHhccCCcchhhhcCC Confidence 77765554 6788899988876544333222222222221111 No 47 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=100.00 E-value=2.7e-50 Score=292.35 Aligned_cols=480 Identities=12% Similarity=0.055 Sum_probs=295.3 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc---------CccccCcee Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD---------GDRKRRKMQ 71 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~---------~~~~~~~~~ 71 (527) |-+ ..++.+|.+...+. ..+.+..++.+.++||.|+|+.+.++... ...+..+++ T Consensus 8 ~~~-~~~~~~~~~~i~~~---------------~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki 71 (537) T protein:vir:78 8 KPI-DQLGGLLNTEITTY---------------MASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKI 71 (537) T ss_pred ccH-HHHHHHHHHHHHHH---------------HHHHHHHHHHHHHHHhcccchhhhccccccccccccccccccccccc Confidence 333 55566665433221 13345678899999999999887654321 111234578 Q ss_pred ecchHHHHHHHHhhhhhcccceEeeCCH---HHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEEEEE Q lcl|NC_019418. 72 HLPIARTAAKKIASLVYNEQAEISAEDE---TLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRVAFI 147 (527) Q Consensus 72 ~lnl~~~i~~~~A~ll~~e~~~i~~~d~---~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i~~v 147 (527) ++|+|+.||++.++||||.|++++++++ ..++.|+.+++ ++|...+.+++..++..|.+|.++|+|. +.+++..+ T Consensus 72 ~~nf~k~Ivd~~~~yl~G~Pv~~~~~d~~~~e~~~~l~~~~~-~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~~~~~i 150 (537) T protein:vir:78 72 SHGFFTELVDQLAQYLLSNGVEVKVKDEDNTQLDEILQEYFD-EDFQATIDTLVTNASKKGFEGIFARTTSEGKLKFQTV 150 (537) T ss_pred ccchHHHHHHHHhhhhcccCceeecCcchhHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeeEEEeeecCCCceEEEEE Confidence 9999999999999999999999998754 45677888774 7899999999999999999999999984 67999999 Q ss_pred cCCceEEEEEcCCceEEEEEEEEE-Eee--CCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCc Q lcl|NC_019418. 148 QAPVFLPLQSNTQDVSSAAILTKT-IKT--ENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGE 224 (527) Q Consensus 148 ~a~~~~P~~~d~~~~~~~a~~~~~-~~~--~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~ 224 (527) +|.++||++.+++ ...+++.... ... .+...+..+.+|++.....+.. ....+...-.+.+........+.. T Consensus 151 ~p~~~~pv~d~~~-~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y----~~~~~~~~~~~~~~~~~~~~~i~~ 225 (537) T protein:vir:78 151 DGLTLIPVFDDYG-VLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYY----IQDDEGVSTTYKLDEAYNPNPAPH 225 (537) T ss_pred ccceeEEEEcCCC-CceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEE----EecCCcccccccccccccccccce Confidence 9999999976544 4444433222 211 1222233444666643221110 000000000001000000011100 Q ss_pred eeeccc-----ccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCccee Q lcl|NC_019418. 225 RVNLSE-----LYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRV 299 (527) Q Consensus 225 ~v~l~~-----~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i 299 (527) -....+ ..+.......-+++.+.+|+.|++ | ..|+|+|+++++|||++|.++|++++.++.-...| T Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~n----n-----~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~i 296 (537) T protein:vir:78 226 VLAIEESTDADFEDTDGYQVLGRSYSKFPFQLLYN----N-----KDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAI 296 (537) T ss_pred eeeccccccccccccccccccccCCcceeEEEecc----C-----ccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCce Confidence 000000 000011112225667777777764 2 35899999999999999999999999998755556 Q ss_pred eechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccc Q lcl|NC_019418. 300 IVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTF 379 (527) Q Consensus 300 ~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~ 379 (527) ++-..+ ..... +. +..+-.-+..+.. +++++.++++++++..+.+...++.+.+.|...+ ..+. ++. T Consensus 297 lvi~g~---~~~~~-~~------~~~~l~~~~~i~v-~~d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s-~~~~-~~~ 363 (537) T protein:vir:78 297 YVVKGF---SGDST-DK------LRQNIKAKKMIGV-NGDNAGMEIQTVSIPYEARKAKMDIDVENIYRSG-MGFN-STA 363 (537) T ss_pred eeeecC---CCccc-hh------HHHHHhhcCceee-cCCCCceeEEEecCCHHHHHHHHHHHHHHHHHhc-CCCC-Ccc Confidence 552221 00100 10 1111111222222 2344568999999999999999998888775443 2222 233 Q ss_pred cccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHH Q lcl|NC_019418. 380 DGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWM 459 (527) Q Consensus 380 ~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~ 459 (527) ...|..|+.+++++++.+.++++.+++.|+++|++++++|+.+.+.. +....+...|.|+|.+.+|.|..+.+++.+ T Consensus 364 ~~~gn~SGvAlk~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~---~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~ 440 (537) T protein:vir:78 364 VGDGNVTNVVIKSRYTLLAMKARKMETSLRKVLRWCADMVVSDIALR---GLGEYDSNDICFEIEPHVLANELDIATTRK 440 (537) T ss_pred ccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc---CCcccccceeeEEeccCCCCCHHHHHHHHH Confidence 44567789999999999999999999999999999999999887542 333445678999999999999999999999 Q ss_pred HHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHH-----------hcc-cc------cccccCCCCCCCCCCCCCCCCCC Q lcl|NC_019418. 460 KMVAAGFATQKRGIAKTLGITEEEAEKELAEING-----------ELP-PE------SDAELALYGKGQQNTVGNSKDTV 521 (527) Q Consensus 460 ~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~-----------E~~-~~------~~~~~~~~~~~~~~~~~~~~~~~ 521 (527) +++++|++|.+|++..++.++|.|.+++.++..+ ++. +. .+.........++.++.++.+.+ T Consensus 441 ~l~~~giiS~eT~l~~~p~vdd~e~ek~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 520 (537) T protein:vir:78 441 TEAETEALKIGNIMTVAPRIGDDETLKLIAEELDLDYNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVDPNQPV 520 (537) T ss_pred HHHhcCcchHHHHHHhCCCCCCHHHHHHHHHHHHhhhhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCCCCCccCCC Confidence 9999999999999877754555443322222110 111 10 01111111112222333332222 Q ss_pred CccccC Q lcl|NC_019418. 522 DDEDEA 527 (527) Q Consensus 522 ~~~~~~ 527 (527) ||-.-. T Consensus 521 ~~~~~~ 526 (537) T protein:vir:78 521 ADPNVV 526 (537) T ss_pred CCCCCC Confidence 222222 No 48 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=100.00 E-value=5.1e-50 Score=290.81 Aligned_cols=452 Identities=11% Similarity=0.066 Sum_probs=294.9 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccc---ccc-------------ccCc Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIE---YTN-------------TDGD 64 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~---~~~-------------~~~~ 64 (527) |+|..-+...=-+.. ..+-+.+. |.--.....|+.+..+||.|.+..+. .+. .... T Consensus 1 ~~~~~~~~~~~~~~~---~~e~i~~~-----i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (474) T protein:vir:10 1 MTLYKLIDDIEAQGI---LPKHIEAL-----IESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLD 72 (474) T ss_pred CchHHHHhhccccCC---CHHHHHHH-----HHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccc Confidence 888766654411100 00000000 11111235677778889988654321 110 0111 Q ss_pred cccCceeecchHHHHHHHHhhhhhcccceEeeC-----CHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC Q lcl|NC_019418. 65 RKRRKMQHLPIARTAAKKIASLVYNEQAEISAE-----DETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG 139 (527) Q Consensus 65 ~~~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~-----d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~ 139 (527) .+..+++++|+|+.||+..++|+||.|++++++ ++..+++|++++++|+|.....+++..+++.|.+|.++|.|. T Consensus 73 ~~~~~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~ 152 (474) T protein:vir:10 73 VSVNNKLNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDT 152 (474) T ss_pred cCcccccccchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCC Confidence 223457889999999999999999999999985 456788999999999999999999999999999999999974 Q ss_pred -CeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCC Q lcl|NC_019418. 140 -DKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTS 218 (527) Q Consensus 140 -~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~ 218 (527) +++++.+++|.+++|++.++...+.++.++. ...+..+. .|..+++++. ..+ ..|.... T Consensus 153 ~~~~~~~~i~p~~~~~v~d~~~~~~~~i~~~~-~~~~~~~~-~~~~~~~y~~----------------~~~--~~~~~~~ 212 (474) T protein:vir:10 153 NGDIRIKNIDPYNVIFVGDNILEPTYSLRYFY-EKDDDNGT-DYVYAEFYDN----------------AYY--YVFRGEG 212 (474) T ss_pred CCeeEEEEEcccceEEEEcCCCceEEEEEEEE-EeeCCCce-EEEEEEEEcC----------------ceE--EEEeecC Confidence 6799999999999999766555554443333 33333332 3334555421 111 1132211 Q ss_pred ccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcce Q lcl|NC_019418. 219 DSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRR 298 (527) Q Consensus 219 ~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~ 298 (527) .|. +... ...-+++++++++.|++ ++.|+|+|+++++++|++|.++|++++.++..... T Consensus 213 ---~~~-------~~~~--~~~~~~~g~vPvv~~~n---------~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~ 271 (474) T protein:vir:10 213 ---IDA-------LQEV--GRYEHLFDYNPLFGVPN---------NKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLA 271 (474) T ss_pred ---CCc-------cccc--ccccCCCCccceEEecC---------CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 111 0000 01124567777777754 34699999999999999999999999999864444 Q ss_pred eeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccc Q lcl|NC_019418. 299 VIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFT 378 (527) Q Consensus 299 i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~ 378 (527) +.+-..+ ..+.+. ...... +..+.. .++++.++++++++.++.+...++.+.+.|...++.+..+++ T Consensus 272 ~l~i~g~---~~~~~~-----~~~~~~----~~~i~~-~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~ 338 (474) T protein:vir:10 272 YLVLRGM---GMSEEM-----IQETQK----SGAFEL-FDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSD 338 (474) T ss_pred hhhhccC---CCCchh-----hhhhhh----cceeEe-cCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccc Confidence 4331111 111111 011111 111111 123456899999999999999999988888777665543332 Q ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHH Q lcl|NC_019418. 379 FDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYW 458 (527) Q Consensus 379 ~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~ 458 (527) . .+|..||.++++.++.+.++++.+++.|+++|+++++.|+.+....+ .+.......++++.|.+++|.|..+.++.. T Consensus 339 ~-~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~-~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~ 416 (474) T protein:vir:10 339 E-FNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKG-YNLDDDSYLNLIFKFTRNIPVNKLEESQVL 416 (474) T ss_pred c-ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-CCCCccccccceEEeCCCCCCCHHHHHHHH Confidence 2 23567899999999999999999999999999999999998765422 111223446799999999999999999998 Q ss_pred HHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 459 MKMVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 459 ~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) ++++ |++|.+|++.+++.++| +++|++||++|+....+..+.... ++.++ ....+++| T Consensus 417 ~kl~--g~iS~et~~~~l~~v~d--~~~E~eri~~E~~e~~~~~~~~~~-~~~~~-----~~~~~~s~ 474 (474) T protein:vir:10 417 INLK--GQVSERTRLGQSQLVDD--VDYELDEMEKESLEFNDKLPDIDE-GDAND-----KSQNNQSE 474 (474) T ss_pred HHHh--ccCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhcccccC-CCcCC-----CCccccCC Confidence 8874 99999999887755553 788999999888655544433321 11111 11112222 No 49 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=100.00 E-value=5.1e-50 Score=290.81 Aligned_cols=452 Identities=11% Similarity=0.066 Sum_probs=294.9 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccc---ccc-------------ccCc Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIE---YTN-------------TDGD 64 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~---~~~-------------~~~~ 64 (527) |+|..-+...=-+.. ..+-+.+. |.--.....|+.+..+||.|.+..+. .+. .... T Consensus 1 ~~~~~~~~~~~~~~~---~~e~i~~~-----i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (474) T protein:vir:94 1 MTLYKLIDDIEAQGI---LPKHIEAL-----IESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLD 72 (474) T ss_pred CchHHHHhhccccCC---CHHHHHHH-----HHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccc Confidence 888766654411100 00000000 11111235677778889988654321 110 0111 Q ss_pred cccCceeecchHHHHHHHHhhhhhcccceEeeC-----CHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC Q lcl|NC_019418. 65 RKRRKMQHLPIARTAAKKIASLVYNEQAEISAE-----DETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG 139 (527) Q Consensus 65 ~~~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~-----d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~ 139 (527) .+..+++++|+|+.||+..++|+||.|++++++ ++..+++|++++++|+|.....+++..+++.|.+|.++|.|. T Consensus 73 ~~~~~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~ 152 (474) T protein:vir:94 73 VSVNNKLNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDT 152 (474) T ss_pred cCcccccccchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCC Confidence 223457889999999999999999999999985 456788999999999999999999999999999999999974 Q ss_pred -CeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCC Q lcl|NC_019418. 140 -DKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTS 218 (527) Q Consensus 140 -~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~ 218 (527) +++++.+++|.+++|++.++...+.++.++. ...+..+. .|..+++++. ..+ ..|.... T Consensus 153 ~~~~~~~~i~p~~~~~v~d~~~~~~~~i~~~~-~~~~~~~~-~~~~~~~y~~----------------~~~--~~~~~~~ 212 (474) T protein:vir:94 153 NGDIRIKNIDPYNVIFVGDNILEPTYSLRYFY-EKDDDNGT-DYVYAEFYDN----------------AYY--YVFRGEG 212 (474) T ss_pred CCeeEEEEEcccceEEEEcCCCceEEEEEEEE-EeeCCCce-EEEEEEEEcC----------------ceE--EEEeecC Confidence 6799999999999999766555554443333 33333332 3334555421 111 1132211 Q ss_pred ccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcce Q lcl|NC_019418. 219 DSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRR 298 (527) Q Consensus 219 ~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~ 298 (527) .|. +... ...-+++++++++.|++ ++.|+|+|+++++++|++|.++|++++.++..... T Consensus 213 ---~~~-------~~~~--~~~~~~~g~vPvv~~~n---------~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~ 271 (474) T protein:vir:94 213 ---IDA-------LQEV--GRYEHLFDYNPLFGVPN---------NKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLA 271 (474) T ss_pred ---CCc-------cccc--ccccCCCCccceEEecC---------CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 111 0000 01124567777777754 34699999999999999999999999999864444 Q ss_pred eeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccc Q lcl|NC_019418. 299 VIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFT 378 (527) Q Consensus 299 i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~ 378 (527) +.+-..+ ..+.+. ...... +..+.. .++++.++++++++.++.+...++.+.+.|...++.+..+++ T Consensus 272 ~l~i~g~---~~~~~~-----~~~~~~----~~~i~~-~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~ 338 (474) T protein:vir:94 272 YLVLRGM---GMSEEM-----IQETQK----SGAFEL-FDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSD 338 (474) T ss_pred hhhhccC---CCCchh-----hhhhhh----cceeEe-cCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccc Confidence 4331111 111111 011111 111111 123456899999999999999999988888777665543332 Q ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHH Q lcl|NC_019418. 379 FDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYW 458 (527) Q Consensus 379 ~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~ 458 (527) . .+|..||.++++.++.+.++++.+++.|+++|+++++.|+.+....+ .+.......++++.|.+++|.|..+.++.. T Consensus 339 ~-~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~-~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~ 416 (474) T protein:vir:94 339 E-FNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKG-YNLDDDSYLNLIFKFTRNIPVNKLEESQVL 416 (474) T ss_pred c-ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-CCCCccccccceEEeCCCCCCCHHHHHHHH Confidence 2 23567899999999999999999999999999999999998765422 111223446799999999999999999998 Q ss_pred HHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 459 MKMVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 459 ~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) ++++ |++|.+|++.+++.++| +++|++||++|+....+..+.... ++.++ ....+++| T Consensus 417 ~kl~--g~iS~et~~~~l~~v~d--~~~E~eri~~E~~e~~~~~~~~~~-~~~~~-----~~~~~~s~ 474 (474) T protein:vir:94 417 INLK--GQVSERTRLGQSQLVDD--VDYELDEMEKESLEFNDKLPDIDE-GDAND-----KSQNNQSE 474 (474) T ss_pred HHHh--ccCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhcccccC-CCcCC-----CCccccCC Confidence 8874 99999999887755553 788999999888655544433321 11111 11112222 No 50 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=100.00 E-value=1.6e-48 Score=282.63 Aligned_cols=431 Identities=14% Similarity=0.129 Sum_probs=286.0 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc-C-----ccccCceeecc Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD-G-----DRKRRKMQHLP 74 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~-~-----~~~~~~~~~ln 74 (527) |.. +.|++++++ -.+...|+...++||.|+|+.+...... + ..+..++++.| T Consensus 1 l~~-~~i~~~i~~---------------------~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n 58 (451) T protein:vir:10 1 MEL-EKIRAIISA---------------------DAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHN 58 (451) T ss_pred CCH-HHHHHHHHH---------------------HHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccc Confidence 332 334444332 1234678999999999999876543221 1 12234578889 Q ss_pred hHHHHHHHHhhhhhcccceEeeCC-HHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC---------CeeEE Q lcl|NC_019418. 75 IARTAAKKIASLVYNEQAEISAED-ETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG---------DKIRV 144 (527) Q Consensus 75 l~~~i~~~~A~ll~~e~~~i~~~d-~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~---------~~~~i 144 (527) +++.||+..++|+||.|+++++++ +...+.|+.++ +|+|.....+++..++..|.+|..+|+|. +++++ T Consensus 59 ~~~~Ivd~~~~yl~G~p~~~~~~~~~~~~~~~~~~~-~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~ 137 (451) T protein:vir:10 59 FHEILVDEKASYMFTYPVLFDIDNNKELNEKVTDVL-GNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKY 137 (451) T ss_pred hHHHHHHhhhhheecccceeecCCcHHHHHHHHHHh-ccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeE Confidence 999999999999999999999866 45677888777 47899999999999999999999999974 57889 Q ss_pred EEEcCCceEEEEEcCC-ceEEEEEEEEEEeeCCCc---ceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCcc Q lcl|NC_019418. 145 AFIQAPVFLPLQSNTQ-DVSSAAILTKTIKTENRK---NVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDS 220 (527) Q Consensus 145 ~~v~a~~~~P~~~d~~-~~~~~a~~~~~~~~~~~~---~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~ 220 (527) ..++|.+++|++.++. +...+++.+........+ ...++++|.++. . .|.+ |+..... T Consensus 138 ~~i~p~~~~~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~---------------~-~~~~--~~~~~~~ 199 (451) T protein:vir:10 138 GVVNTEEIIPIYRNGIERELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTD---------------K-ILDK--YKFFGVS 199 (451) T ss_pred EEEcccceEEEEcCCCCCceEEEEEEEEeeecccccccceEEEEEEEEeC---------------C-eEEE--EEecccC Confidence 9999999999976653 444444432222222111 122333444431 1 1111 2221112 Q ss_pred ccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceee Q lcl|NC_019418. 221 QLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVI 300 (527) Q Consensus 221 ~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~ 300 (527) ..|..+ .....-+++++.++++|++ | ..|.|+|+++++|+|++|.++|++++.++.-...+. T Consensus 200 ~~~~~~---------~~~~~~~~~g~vPvv~~~n----n-----~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l 261 (451) T protein:vir:10 200 CCGSQI---------EHITVQHRFNSVPFVEFSN----N-----IKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIY 261 (451) T ss_pred cccccc---------ccccccCCCCeeeEEEecc----C-----CCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcccee Confidence 122111 1112235777888888864 2 348999999999999999999999999986555555 Q ss_pred echhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccc Q lcl|NC_019418. 301 VPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFD 380 (527) Q Consensus 301 v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~ 380 (527) +-..+-. .+... . ...+. ...+......+.+..+.+++++++++.+.+...++.+.+.|...++.+. +++. T Consensus 262 ~~~g~~~--~~~~~--~--~~~~~-~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~ 332 (451) T protein:vir:10 262 ILENFGG--EDTSE--F--LKELK-RYKTIKTETDSEGDSGGLKTMQIEIPTEARKIILEILKKQIYESGQGLQ--QDTE 332 (451) T ss_pred eeecCCc--ccchh--h--HHHHh-hCCeEEecCcCCccCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCccc--cccc Confidence 4222210 00000 0 00011 1111121122334456799999999999999999999988877766542 3344 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHH Q lcl|NC_019418. 381 GQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMK 460 (527) Q Consensus 381 ~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~ 460 (527) ..|..|+.++++.++.+.++++.+++.|+++|+++++.|+.+.+. ....++.|+|++++|.|..+.++..++ T Consensus 333 ~~gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~--------~d~~~i~i~f~~~~p~n~~e~~~~~~k 404 (451) T protein:vir:10 333 NFGNASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLGV--------TDYKKIQQTYTRNMMSNDLEDADIATK 404 (451) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--------CCccceeEEecCCCCCCHHHHHHHHHH Confidence 446678999999999999999999999999999999999976532 235678899999999999999998888 Q ss_pred HHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCC Q lcl|NC_019418. 461 MVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQ 511 (527) Q Consensus 461 ~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~ 511 (527) ++ |++|.+|++..++++++ ++++++++++|...+.....+..++-.+ T Consensus 405 l~--g~iS~et~~~~~p~v~d--~~~e~~~~~ee~~~~~~~~~~~~~~~~~ 451 (451) T protein:vir:10 405 SV--GIIPTKIILRHHPWVDD--VEEAEKLYLEEKKIQASKVSDDYNNFTE 451 (451) T ss_pred Hh--ccCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHHhhcCCCCC Confidence 75 99999999888766654 4566667655544322221111111111 No 51 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=100.00 E-value=2.4e-46 Score=270.67 Aligned_cols=453 Identities=10% Similarity=0.017 Sum_probs=287.8 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccc-cCccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNT-DGDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~-~~~~~~~~~~~lnl~~~i 79 (527) |+- -..++.++...+ .....|+.+..+||.|+++. .+... ..+..++.+..+|+|+.| T Consensus 1 ~~t---~~d~i~~L~~~~-----------------~~~~~r~~~~~~Yy~G~~~i-~~~~~~~~~~~~~~~~~~n~~~~i 59 (480) T protein:vir:78 1 MTT---YHEHVERLQGLL-----------------ARDLPNLLEAEAYRNGTRRL-KTIGIGAPPELAYLDVQPGWVATY 59 (480) T ss_pred CCC---HHHHHHHHHHHH-----------------HHHHHHHHHHHHHHhccccc-hhcccccchhhhhhhhhcchHHHH Confidence 443 444444433221 12356888889999999874 22222 222233456778999999 Q ss_pred HHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEE------e-CCeeEEEEEcCCce Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYV------D-GDKIRVAFIQAPVF 152 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~------d-~~~~~i~~v~a~~~ 152 (527) |+.+|++|+.....+ .+++..++.|++++++|+|.....+++..++.+|.+|+.+|- | .+.++|.+++|.++ T Consensus 60 vd~~~~~l~~~g~~~-~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~i~~~~p~~~ 138 (480) T protein:vir:78 60 LRTLSDRLDIEGFRI-SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYM 138 (480) T ss_pred HHHHHhhhccCceec-CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeEEEEEcccce Confidence 999999998765432 245677889999999999999999999999999999999984 2 46799999999999 Q ss_pred EEEEEcC-CceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccc Q lcl|NC_019418. 153 LPLQSNT-QDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSEL 231 (527) Q Consensus 153 ~P~~~d~-~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~ 231 (527) +|++.++ .+...+++.+. ....+.+. ++..+.|.. ..|.+ |........+..+ T Consensus 139 ~~i~D~~~~~~~~~~i~~~-~~~d~~~~--~~~~~~y~~----------------~~~~~--~~~~~~~~~~~~~----- 192 (480) T protein:vir:78 139 YAELDPRNTRRVTRAVRLY-TTRDDVAV--PDRATLYLP----------------DETVP--LRRNGGLNDQWVV----- 192 (480) T ss_pred EEEEcCCCccceEEEEEEE-EeecCCcc--eEEEEEEeC----------------CeEEE--EEecCCCcccccc----- Confidence 9997654 34555554332 22222222 334555532 11221 2221111111100 Q ss_pred cCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhh-hHHHHHHHHHHHHHHHHHHHcCc-ceeeechhHhcCC Q lcl|NC_019418. 232 YPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDN-AKTTIDFINRTYDEFMWEIKMGQ-RRVIVPEQMTQLK 309 (527) Q Consensus 232 ~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~-~~~lid~ld~~~s~~~~e~~~~~-~~i~v~~~~l~~~ 309 (527) .....-+++++++++.|+ |+...+.|+|+|+++. +++|+|++|.++|++++.++... +.+++ + + T Consensus 193 ----~~~~~~~~~g~vPvv~f~----n~~~~~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i----~--G 258 (480) T protein:vir:78 193 ----DGDVIKHGLGVVPVVPLT----NDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVI----S--G 258 (480) T ss_pred ----cccccccCCCCcceEEee----cccccCCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhh----h--C Confidence 011223567777777664 7777888999999985 89999999999999999987422 22222 1 1 Q ss_pred CCCCC-cccccccccccccceeeeccCCCCCCCcceEeccc-cChHHHHHHHHHHHHHHHHhcCCCcccccccccccchH Q lcl|NC_019418. 310 VQDNQ-GNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTP-IRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTA 387 (527) Q Consensus 310 ~~~~~-~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~-ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TA 387 (527) .+... ........+.... ..+-..+++. .++.+++ ...+.|.+.++.+++++...+++++..||..+.+..|| T Consensus 259 ~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~--~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg 333 (480) T protein:vir:78 259 VTTDELTNDGENTTLDIYY---GRILTLASEA--AKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASA 333 (480) T ss_pred CCccccccccccchhhhhh---hhhccCCCCC--ceEEecCccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHH Confidence 11000 0000001111000 0011112222 3333332 34688999999999999999999999999877777789 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcC-- Q lcl|NC_019418. 388 TEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAG-- 465 (527) Q Consensus 388 tei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aG-- 465 (527) .+++++++.|.++++.+++.|+.+|+++++.|+.+.. +....+...+.|.|.+..+.+..+.++...+++++| T Consensus 334 ~Al~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~~-----~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~ 408 (480) T protein:vir:78 334 EAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMG-----REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQG 408 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC-----CCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhccc Confidence 9999999999999999999999999999998887643 122345567999999999999999999999999887 Q ss_pred CCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccC--CCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 466 FATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELA--LYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 466 i~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) ++|.++++ .++|+++++++++ ++++++++....+... ..++.+..+.+..++..+....+ T Consensus 409 ~~s~et~~-~~lg~~~d~~~e~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 470 (480) T protein:vir:78 409 PIPKEQAR-IDLGYTATQREQM-RDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTS 470 (480) T ss_pred CCCHHHHH-hcCCCCHhHHHHH-HHHHHHHHHHHHHHhhccccCCCccccCCCCCCCCCccCCC Confidence 68998866 4559998876654 3443333222111111 11122222222222221111111 No 52 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=100.00 E-value=2.3e-46 Score=270.76 Aligned_cols=454 Identities=11% Similarity=0.049 Sum_probs=288.2 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccc-cCccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNT-DGDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~-~~~~~~~~~~~lnl~~~i 79 (527) |+- -..+++++...+ .....|+.+..+||.|+++ +.+... ..+..++++..+|+|+.| T Consensus 1 ~~t---~~~~i~~L~~~~-----------------~~~~~r~~~l~~Yy~G~~~-i~~~~~~~~~~~~~~~~~~n~~~~i 59 (480) T protein:vir:78 1 MTT---YHEHVERLQGLL-----------------ARDLPNLLEAEAYRNGTRR-LKTIGIGAPPELAYLDVQPGWVATY 59 (480) T ss_pred CCC---HHHHHHHHHHHH-----------------HHHHHHHHHHHHHHhcccc-ccccccccchhHhhhhhhcchHHHH Confidence 443 333333322111 1235678889999999987 433322 222233456778999999 Q ss_pred HHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEE------e-CCeeEEEEEcCCce Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYV------D-GDKIRVAFIQAPVF 152 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~------d-~~~~~i~~v~a~~~ 152 (527) |+.++++++....++. +++..++.|+++++.|+|...+.+++..|+.+|.+|+.+|. | .+.++|.+++|.++ T Consensus 60 vd~~~~~l~~~g~~~~-~d~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~i~~~~p~~~ 138 (480) T protein:vir:78 60 LRTLSDRLDIEGFRIS-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPLIRVESPLYM 138 (480) T ss_pred HHHHHhhhccCceecC-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeEEEEEcccce Confidence 9999999977654322 35567889999999999999999999999999999999985 2 46799999999999 Q ss_pred EEEEEcC-CceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccc Q lcl|NC_019418. 153 LPLQSNT-QDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSEL 231 (527) Q Consensus 153 ~P~~~d~-~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~ 231 (527) +|++.++ .+...+++.+. ......+. +++.+.|.. ..|.+ |........+. + T Consensus 139 ~~~~D~~~~~~~~~~i~~~-~~~~~~~~--~~~~~~y~~----------------~~~~~--~~~~~~~~~~~-~----- 191 (480) T protein:vir:78 139 YAELDPRNTRRVTRAVRLY-TTRDDVAV--PDRATLYLP----------------DETVP--LRRNGGLNDQW-V----- 191 (480) T ss_pred EEEEcCCCccceEEEEEEE-EeecCCCc--eEEEEEEeC----------------CeEEE--EEecCCCcccc-c----- Confidence 9987654 45555554332 22222222 223454431 12221 22111111110 0 Q ss_pred cCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhh-hHHHHHHHHHHHHHHHHHHHc-CcceeeechhHhcCC Q lcl|NC_019418. 232 YPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDN-AKTTIDFINRTYDEFMWEIKM-GQRRVIVPEQMTQLK 309 (527) Q Consensus 232 ~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~-~~~lid~ld~~~s~~~~e~~~-~~~~i~v~~~~l~~~ 309 (527) + .....-+++++++++.|+ |+...+.|+|+|+|+. +++|+|++|+++|++++.++. +-+.+++ ++.. T Consensus 192 ~---~~~~~~~~~g~vPvv~f~----n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i----~G~~ 260 (480) T protein:vir:78 192 V---DGDVIKHGLGVVPVVPLT----NDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVI----SGVT 260 (480) T ss_pred c---ccccccCCCCCcceEEee----cccccCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhh----hcCC Confidence 0 011223567777777664 7777888999999985 899999999999999998874 3332222 1100 Q ss_pred CCCCCcccccccccccccceeee-ccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHH Q lcl|NC_019418. 310 VQDNQGNIAFKRRFDVEQNVYMQ-VGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTAT 388 (527) Q Consensus 310 ~~~~~~~~~~~~~~d~~~~~~~~-~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAt 388 (527) .+.... ......+.. +.+ +-..+++...+..++. ...+.|.+.++.++++|...+++++..||..+.+..||. T Consensus 261 ~~~~~~-~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~ 334 (480) T protein:vir:78 261 TDELTN-DGENTTLDI----YYGRILTLASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAE 334 (480) T ss_pred cccccc-ccccchhhh----hhhhhccCCCCCceEEecCc-cCHHHHHHHHHHHHHHHhcccCCChHHhccccCcchHHH Confidence 000000 000011110 010 1111222223333222 246889999999999999999999999998777778899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcC--C Q lcl|NC_019418. 389 EIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAG--F 466 (527) Q Consensus 389 ei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aG--i 466 (527) +++..+..+..+++++++.|..+|+++++.|+.+.. +....+...+.|.|.+..+.+..++++...+++++| + T Consensus 335 Alk~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~g-----~~~~~~~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~ 409 (480) T protein:vir:78 335 AIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMG-----REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGP 409 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC-----CCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhcccc Confidence 999999999999999999999999999999887643 223345567899999999999999999999999877 7 Q ss_pred CCHHHHHHhcCCCCHHHHHHHHHHHHHhccccccc-ccC-CCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 467 ATQKRGIAKTLGITEEEAEKELAEINGELPPESDA-ELA-LYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 467 ~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~-~~~-~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) +|.++++.. +|+++++++++ ++.++|++.+... ... ..++.+..+.++.++..++.+++ T Consensus 410 ~s~et~~~~-lg~~~d~~~~~-~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 470 (480) T protein:vir:78 410 IPKEQARID-LGYTATQREQM-RDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTS 470 (480) T ss_pred CCHHHHHhc-CCCCHhHHHHH-HHHHHHHHHHHHHHhhccccccCCCCCCCCCCCCCCccccc Confidence 999997755 59998876544 3333333221111 111 11111112212222222222222 No 53 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=100.00 E-value=4e-46 Score=269.47 Aligned_cols=457 Identities=14% Similarity=0.115 Sum_probs=283.8 Q ss_pred HHhhcccchhhhccCc-----cccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHHHHhhhhhcc Q lcl|NC_019418. 16 YNMTTSHLSSILDHPK-----VAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIASLVYNE 90 (527) Q Consensus 16 ~~~~~~~~~~~~~~~~-----i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~ll~~e 90 (527) ....++.+.+.-+... |.--.....|+.+..+||.|+++............++.+..+|+|+.||+.+|++|+.. T Consensus 1 ~~~~i~~~~~~~~~~~~~~~L~~~~~~~~~r~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~ 80 (485) T protein:vir:24 1 MTAPLPGQEEIADPAIARDEMVSAFEDQNQNLRSNTSYYEAERRPEAIGVTVPVQMQSLLAHVGYPRLYVDSIAERQAVE 80 (485) T ss_pred CCCCCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCchhhcCcccchhhhhhhhccchHHHHHHHHhhhhccC Confidence 0111222221111100 01112345688888999999987422111112222345566799999999999999876 Q ss_pred cceEeeC-CHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC---------CeeEEEEEcCCceEEEEEcCC Q lcl|NC_019418. 91 QAEISAE-DETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG---------DKIRVAFIQAPVFLPLQSNTQ 160 (527) Q Consensus 91 ~~~i~~~-d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~---------~~~~i~~v~a~~~~P~~~d~~ 160 (527) . |+++ ++..++.+++++++|+|.....+++..|+.+|.+|+.+|.+. +.++|..++|.+++|++.++. T Consensus 81 g--~~~~~~~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~i~D~~~ 158 (485) T protein:vir:24 81 G--FRLGDADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVEPPTRMYAEIDPRI 158 (485) T ss_pred c--eecCCCchhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEEEeccceeEEEeeCCc Confidence 5 4554 456678899999999999999999999999999999999864 456899999999999976665 Q ss_pred ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCccccee Q lcl|NC_019418. 161 DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTP 240 (527) Q Consensus 161 ~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~ 240 (527) +...+++... .. ...+. .+.++.+.. + ..+ .++.. + |..+... .. T Consensus 159 ~~~~~~~~~~-~~-~~~~~--~~~~~~y~~-------------~--~~~--~~~~~-~----~~~~~~~---------~~ 203 (485) T protein:vir:24 159 GRPAKAIRVA-YD-AEGNE--IQAATLYTP-------------N--ETF--GWFRA-E----GEWVEWF---------SD 203 (485) T ss_pred CceeEEEEEE-Ee-ecCCe--EEEEEEEcC-------------C--cEE--EEEec-C----CceEeec---------cc Confidence 6555544322 22 22222 222333321 1 111 11111 1 1111110 11 Q ss_pred ecCCCcccEEEecCCccccccCCCccCcchhhh-hHHHHHHHHHHHHHHHHHHHc-Ccceeeech-hHhcCCCCCCCccc Q lcl|NC_019418. 241 IQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDN-AKTTIDFINRTYDEFMWEIKM-GQRRVIVPE-QMTQLKVQDNQGNI 317 (527) Q Consensus 241 ~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~-~~~lid~ld~~~s~~~~e~~~-~~~~i~v~~-~~l~~~~~~~~~~~ 317 (527) -+++++++++.|+ |+.....|+|+|++++ +++++|++|.+.|++++..+. +-+.+++.- +.-........+. T Consensus 204 ~h~~g~vPvv~f~----n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~- 278 (485) T protein:vir:24 204 PHGLGAVPVVPLP----NRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQ- 278 (485) T ss_pred ccCCCcccEEEec----cCcccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCcccccccccccc- Confidence 2456777777774 6667788999999985 899999999999999988764 222222210 0000000001110 Q ss_pred ccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHH Q lcl|NC_019418. 318 AFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDT 397 (527) Q Consensus 318 ~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~ 397 (527) ..+... ...+-..+++...+..++ ..-.+.|.+.++.++++++..+++++..||..+.+..||.+++..+..+ T Consensus 279 ---~~~~~~---~~~i~~~~~~~~~~~q~~-~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l 351 (485) T protein:vir:24 279 ---TLFDAY---LARILAFEDAEGKIQQFS-AAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRL 351 (485) T ss_pred ---chhhhc---ccceeccCCCCceEEeec-ccchHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHH Confidence 111110 001111122222333332 2346789999999999999999999999998777777899999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcC--CCCHHHHHHh Q lcl|NC_019418. 398 YQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAG--FATQKRGIAK 475 (527) Q Consensus 398 ~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aG--i~s~~~~i~~ 475 (527) .++++++++.|+++|+++++.++.+... .........++|.|.++.+.+..+.++...+++++| ++|.++++ + T Consensus 352 ~~ka~~~~~~f~~~l~~~~~l~~~~~~~----~~~~~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~-~ 426 (485) T protein:vir:24 352 IKKVERKNAIFGGAWEEAMRLAYRLMKG----GDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERAR-K 426 (485) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcC----CCCccccceeeEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHH-h Confidence 9999999999999999999998876442 223345678999999999999999999999999876 89999976 6 Q ss_pred cCCCCHHHHHHHHHHHHHhccccccccc-CC-------CCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 476 TLGITEEEAEKELAEINGELPPESDAEL-AL-------YGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 476 ~~~~~deea~~el~ri~~E~~~~~~~~~-~~-------~~~~~~~~~~~~~~~~~~~~~~ 527 (527) ++|+++++++ +++++++|+........ .+ ++.+.+.+.+++....++.|-+ T Consensus 427 ~l~~~~d~~~-e~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~a 485 (485) T protein:vir:24 427 DMGYSIAERE-EMRRWDEEEAAMGLGLLGTMVDADPTVPGSPNPTPAPKPQPAIEGGDSA 485 (485) T ss_pred hCCCCHhHHH-HHHHHHHHHhhhhhhHHHhhcccCCCCCCCCCCCCCCCCccCCCCCCCC Confidence 6799988764 56776655533222111 11 1111111111110011111111 No 54 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=100.00 E-value=1.9e-45 Score=265.80 Aligned_cols=443 Identities=13% Similarity=0.100 Sum_probs=280.6 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc-CccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD-GDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~-~~~~~~~~~~lnl~~~i 79 (527) .-++++|.+.++ +...|+.++.+||.|+++. .+.... .+..++.+.++|+|+.| T Consensus 14 ~~~~~~l~~~~~------------------------~~~~rl~~l~~Yy~G~~~i-~~~~~~~~~~~~~~~~~~n~~~~i 68 (484) T protein:vir:77 14 EKAREEMLNLFT------------------------ERTQDLGDNTAYYESERRP-DAVGVTVPQQMQKLLAHVGYPRLY 68 (484) T ss_pred HHHHHHHHHHHH------------------------HHHHHHHHHHHHHhccccc-hhcccccchhHHhhhhhcCcHHHH Confidence 112222222221 2335677788999998773 222221 12222334577999999 Q ss_pred HHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCe---------eEEEEEcCC Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDK---------IRVAFIQAP 150 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~---------~~i~~v~a~ 150 (527) |+.++++++....++. +++..++.+++++++|+|.....+++..|+.+|.+|+.+|.+.++ ++|.+++|. T Consensus 69 vd~~~~~l~~~g~~~~-~~~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~ 147 (484) T protein:vir:77 69 IDAIAARQELEGFRLG-GADKADEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEPPT 147 (484) T ss_pred HHHHHhhhccCceecC-CcchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEeccc Confidence 9999999987654322 344567889999999999999999999999999999999997542 578999999 Q ss_pred ceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeeccc Q lcl|NC_019418. 151 VFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSE 230 (527) Q Consensus 151 ~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~ 230 (527) ++++++.+..+...+++.+ +..+..+.++ .++.+.. ..| +.+++.. |.. .+. T Consensus 148 ~~~~~~D~~~~~~~~a~~~--~~~~~~~~~~--~~~~y~~----------------~~~-~~~~~~~-----~~~-~~~- 199 (484) T protein:vir:77 148 NLYAQIDPRTRQVMRAIRA--IEDEEGNEVI--GATLYLP----------------NNT-VIWNRED-----GQW-VQV- 199 (484) T ss_pred eeEEEecCCCCceEEEEEE--EEeecCCcEE--EEEEEec----------------CeE-EEEEecC-----Cce-Eee- Confidence 9999976665655554432 2233322222 1232221 111 1112211 110 000 Q ss_pred ccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhh-hhHHHHHHHHHHHHHHHHHHHc-Ccceeeec----hh Q lcl|NC_019418. 231 LYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFD-NAKTTIDFINRTYDEFMWEIKM-GQRRVIVP----EQ 304 (527) Q Consensus 231 ~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~-~~~~lid~ld~~~s~~~~e~~~-~~~~i~v~----~~ 304 (527) ...-+++++++++.|+ |+...+.|+|+|+|+ .+++|+|++|+++|++++..+. +-+..++- .. T Consensus 200 -------~~~~~~~g~vPvv~f~----N~~~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~ 268 (484) T protein:vir:77 200 -------ANVAHNLEMVPVIPIP----NRTRLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVKGEE 268 (484) T ss_pred -------ccccCCCCCcceEEec----cccccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCCcch Confidence 0112566777777664 667788899999998 5889999999999999988774 22222220 00 Q ss_pred HhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccccccc Q lcl|NC_019418. 305 MTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGV 384 (527) Q Consensus 305 ~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~ 384 (527) .. .....+.. .+...-. .+-..+++...+.+++ ..-.+.|++.+..++++|+..+++++..||..+.+. T Consensus 269 ~~---~~~~~~~~----~~~~~~~---~~~~~~~~~~~~~q~~-~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~ 337 (484) T protein:vir:77 269 LG---VDPETGQT----LFDAYLA---RILAFEDHESKAQQFS-AAELRNFVDALDALDRKAAAYTGLPPYYLSFSSENP 337 (484) T ss_pred hc---ccccccch----hhhhhhh---hhcccCCCCceeEeec-CCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcc Confidence 00 00111111 1111000 0111122223333332 234578999999999999999999999999887777 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhc Q lcl|NC_019418. 385 KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAA 464 (527) Q Consensus 385 ~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~a 464 (527) .||.+++..++.+.++++++++.|+++|+++++.++.+... .........++|.|.+..+.+..+.++...+++++ T Consensus 338 ~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~~~----~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~ 413 (484) T protein:vir:77 338 ASAEAIRSSESRLVKTVERKNKIFGGAWEQAMRVAYKVMNG----GDIPPEYYRMESIWRDPSTPTYAAKADAATKLYNN 413 (484) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CCcccccccceEEecCCCCCCHHHHHHHHHHHHhc Confidence 89999999999999999999999999999999998876532 12233456789999999999999999999999998 Q ss_pred C--CCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccc--------cccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 465 G--FATQKRGIAKTLGITEEEAEKELAEINGELPPESD--------AELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 465 G--i~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) | ++|.++++.. +|+++++++ +++++++|...... .....+++++.+++.+ ...+.++++ T Consensus 414 g~gi~s~et~~~~-l~~~~~~~~-e~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~ 482 (484) T protein:vir:77 414 GQGVIPKERARID-MGYSITERE-EMRKWDEEEQAQGLGLMGTMFGTDPSGGGNPDNPETPE--PQPNPAEEA 482 (484) T ss_pred cCCCCCHHHHHhc-CCCChhHHH-HHHHHHHHHHHHHHHHHhhhccccccCCCCCCCCCccc--ccCCCcccc Confidence 6 8999997654 599988764 46666655432211 1111222222222111 112222222 No 55 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=100.00 E-value=7.1e-45 Score=262.62 Aligned_cols=453 Identities=12% Similarity=0.047 Sum_probs=281.2 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) |--.+-...++..+... -.....|+++..+||.|+++............++.+..+|+|+.|| T Consensus 8 ~~~~~~~~~~~~~l~~~-----------------~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iv 70 (485) T protein:vir:10 8 QEEIEDPAIARDEMVSA-----------------FEDSTQNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRLYV 70 (485) T ss_pred CCCCCCHHHHHHHHHHH-----------------HHHHHHHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHHHH Confidence 11111111111111100 1123467888999999998753222222223334556679999999 Q ss_pred HHHhhhhhcccceEee-CCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC---------CeeEEEEEcCC Q lcl|NC_019418. 81 KKIASLVYNEQAEISA-EDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG---------DKIRVAFIQAP 150 (527) Q Consensus 81 ~~~A~ll~~e~~~i~~-~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~---------~~~~i~~v~a~ 150 (527) +.+|++|+... |++ +++..++.+++++.+|+|.....+++..|+.+|.+|+.+|.+. +.++|.+++|. T Consensus 71 d~~~~~l~~~g--~~~~~~~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~ 148 (485) T protein:vir:10 71 DSIAERQAVEG--FRFGDADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRVEPPT 148 (485) T ss_pred HHHHhhhcccc--eecCCCchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEEEccc Confidence 99999997654 444 4456778899999999999999999999999999999999863 46789999999 Q ss_pred ceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeeccc Q lcl|NC_019418. 151 VFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSE 230 (527) Q Consensus 151 ~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~ 230 (527) +++|++.+..+...+++.+. .... .+. ++.++++.. ...++ +++. . |. ..+ T Consensus 149 ~~~~~~D~~~~~~~~~~~~~-~~~~-~~~--~~~~~~y~~---------------~~~~~--~~~~-~----~~-~~~-- 199 (485) T protein:vir:10 149 RMYAEIDPRIGRVSKAIRVA-YDAE-GNE--IQAATLYTP---------------NDIFG--WYRV-E----NE-WQE-- 199 (485) T ss_pred eeEEEEcCCCCceeEEEEEE-EeeC-CCe--EEEEEEEeC---------------CeEEE--EEEc-C----Cc-eEE-- Confidence 99999866666666555432 2222 221 223444321 11111 1111 1 00 000 Q ss_pred ccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhh-hHHHHHHHHHHHHHHHHHHHcCc-ceeeechhHhcC Q lcl|NC_019418. 231 LYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDN-AKTTIDFINRTYDEFMWEIKMGQ-RRVIVPEQMTQL 308 (527) Q Consensus 231 ~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~-~~~lid~ld~~~s~~~~e~~~~~-~~i~v~~~~l~~ 308 (527) ....-+++++++++.| +|+...+.|+|+|+++. +++|+|++|+++|++.+..+... +..++ ++. T Consensus 200 ------~~~~~~~~g~vPvv~~----~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i----~G~ 265 (485) T protein:vir:10 200 ------WFNNPHGLGVVPVVPI----PNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLI----FGI 265 (485) T ss_pred ------eccccCCCCcccEEEe----ccccccCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHH----hcC Confidence 0011245667777666 47778889999999985 89999999999999998776422 11111 110 Q ss_pred CCCCCCc-ccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchH Q lcl|NC_019418. 309 KVQDNQG-NIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTA 387 (527) Q Consensus 309 ~~~~~~~-~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TA 387 (527) ..+.... .......+.... -.+-..+++...+.+++ .-..+.|.+.++.++++|+..+++++..||..+.+..|| T Consensus 266 ~~~~~~~~~~~~~~~~~~~~---~~i~~~~~~d~k~~q~~-~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg 341 (485) T protein:vir:10 266 KPEEIGVDPETGQTLFDAYL---ARILAFEDAEGKIQQFS-AAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASA 341 (485) T ss_pred Ccccccccccccchhhhhcc---cceeccCCCCceEEeec-ccchHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHH Confidence 0000000 000001111100 00111122222333332 233678999999999999999999999998877777789 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcC-- Q lcl|NC_019418. 388 TEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAG-- 465 (527) Q Consensus 388 tei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aG-- 465 (527) .+++.....+.++++.+++.|..+|+++++.++.+... .........+.|.|.++.+.|..+.++...+++++| T Consensus 342 ~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~~~----~~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~ 417 (485) T protein:vir:10 342 EAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYRMMKG----GDVPPDMLRMETVWRDPSTPTYAAKADAASKLYNGGTG 417 (485) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhcccc Confidence 99999999999999999999999999999988876542 223345568999999999999999999999999987 Q ss_pred CCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccc--------cccCCCCCCC--CCCCCCCCCCCCccc Q lcl|NC_019418. 466 FATQKRGIAKTLGITEEEAEKELAEINGELPPESD--------AELALYGKGQ--QNTVGNSKDTVDDED 525 (527) Q Consensus 466 i~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~--------~~~~~~~~~~--~~~~~~~~~~~~~~~ 525 (527) ++|.++++ +++|++++++ ++++++++|+..... ......+.++ +.+......++|+.- T Consensus 418 ~~s~et~~-~~lg~~~~~~-~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 418 VIPRERAR-KDMGYSIAER-EEMRRWDEEEAAMGLGLIGTMVDPNPTVPGSPSPAPAPKPAALESGGDAA 485 (485) T ss_pred CCCHHHHH-HhCCCCHhHH-HHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCCccccccCcCCCCCCCCC Confidence 89999976 5679998875 455665554432111 1111111111 111110111111111 No 56 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=100.00 E-value=2.1e-44 Score=260.08 Aligned_cols=433 Identities=13% Similarity=0.014 Sum_probs=276.9 Q ss_pred hHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHHH Q lcl|NC_019418. 3 LIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKK 82 (527) Q Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~ 82 (527) |=.--+.|++++... -.....|+++..+||.|+++.........+..++.++.+|+|+.||+. T Consensus 1 ~~~~~~~~i~~l~~~-----------------~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~ 63 (441) T protein:vir:80 1 MNSDELALIEGMYDR-----------------IQRLSSWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDA 63 (441) T ss_pred CCccHHHHHHHHHHH-----------------HHHHHHHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHH Confidence 222222222221111 112345788899999999874322222223333566788999999999 Q ss_pred HhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEEEEEcCCceEEEEEcCCc Q lcl|NC_019418. 83 IASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRVAFIQAPVFLPLQSNTQD 161 (527) Q Consensus 83 ~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i~~v~a~~~~P~~~d~~~ 161 (527) +|++++. ..|+++++ +.|++++++|+|.....+++..++..|.+|+.+|.|. +.++|.+++|.+++|++.++.+ T Consensus 64 ~~~~l~~--~g~~~~d~---~~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~ 138 (441) T protein:vir:80 64 LEERLDW--LGWTNGDG---YGLDGVYAANRLATASCDVHLDALIFGLSFVAIIPHGDGTVSVRPQSPKNCTGKFSADGS 138 (441) T ss_pred HHhhhcc--ccccCCCh---HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeCCCCceEEEEEccceEEEEEeCCCC Confidence 9999954 45666553 4588888999999999999999999999999999974 6799999999999999776666 Q ss_pred eEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceee Q lcl|NC_019418. 162 VSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPI 241 (527) Q Consensus 162 ~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~ 241 (527) ...+++..... . .+...+ .+.|.. ...++ |..... |..+. ....- T Consensus 139 ~~~~~~~~~~~-~-~~~~~~---~~vy~~---------------~~~~~---~~~~~~---~~~~~---------~~~~~ 183 (441) T protein:vir:80 139 RLDAGLVVQQT-C-DPEVVE---AELLLP---------------DVIVQ---VERRGS---REWVE---------VDRIP 183 (441) T ss_pred ceeEEEEEEEE-e-cCceEE---EEEEec---------------CeEEE---EEEcCC---cceee---------ccccc Confidence 55555433222 1 222111 232211 11111 111110 11000 01112 Q ss_pred cCCCcccEEEecCCccccccCCCccCcchhhh-hHHHHHHHHHHHHHHHHHHHc-CcceeeechhHhcCCCCCCCccccc Q lcl|NC_019418. 242 QGLSRPLFTYLKTPGMNNKDINSPLGLSIFDN-AKTTIDFINRTYDEFMWEIKM-GQRRVIVPEQMTQLKVQDNQGNIAF 319 (527) Q Consensus 242 ~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~-~~~lid~ld~~~s~~~~e~~~-~~~~i~v~~~~l~~~~~~~~~~~~~ 319 (527) +++++++++.|+ |+.....|+|.|++.+ +++|+|++|.++|++.+..+. +-+.+++. .+ ..+.... T Consensus 184 ~~~g~vPvv~~~----n~~~~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~-G~---~~~~~~~---- 251 (441) T protein:vir:80 184 NVLGAVPLVPIV----NRRRTSRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVT-GV---SADEFSQ---- 251 (441) T ss_pred cCCCceeEEEee----ccccCCccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeee-cC---Ccccccc---- Confidence 455666666554 6777788999999975 899999999999999998875 33333331 11 1111000 Q ss_pred ccccccccceeeeccCCCCCCCcceEecc-ccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHH Q lcl|NC_019418. 320 KRRFDVEQNVYMQVGAGNMDSGGIVDLTT-PIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTY 398 (527) Q Consensus 320 ~~~~d~~~~~~~~~~~~~~~~~~i~~~~~-~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~ 398 (527) ..+......+..+..+ .+..+++..+. .-..+.|++.++.++++|...+++++..||..+.+..||.+++.+.+.+. T Consensus 252 -~~~~~~~~~i~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~ 329 (441) T protein:vir:80 252 -PGWVLSMASVWAVDKD-DDGDTPNVGSFPVNSPTPYSDQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEESRLV 329 (441) T ss_pred -chhhhcccccccCCCC-CCCCcceeEecCccchHHHHHHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHHHHH Confidence 0010000011122211 12223443332 23468899999999999999999999999988777788999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCC--CHHHHHHhc Q lcl|NC_019418. 399 QMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFA--TQKRGIAKT 476 (527) Q Consensus 399 ~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~--s~~~~i~~~ 476 (527) .+++.+++.|+++|+++++.++.+... .........++++.|.++++.|..++++...+++++|++ |+++++ .+ T Consensus 330 ~k~~~~~~~f~~~l~~~~~l~~~~~~~---~~~~~~~~~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s~~~~~-~~ 405 (441) T protein:vir:80 330 KRAERRQTSFGQGWLSVGFLAAKALDS---RVDEADFFGDVGLRWRDASTPTRAATADAVTKLVGAGILPADSRTVL-EM 405 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcC---CCcccccceeeeEEeCCCCCcCHHHHHHHHHHHHhcCcccccHHHHH-Hh Confidence 999999999999999999988876432 223333456789999999999999999999999999974 666654 66 Q ss_pred CCCCHHHHHHHHHHHHHhcccccccccCCCC--CCCCCCC Q lcl|NC_019418. 477 LGITEEEAEKELAEINGELPPESDAELALYG--KGQQNTV 514 (527) Q Consensus 477 ~~~~deea~~el~ri~~E~~~~~~~~~~~~~--~~~~~~~ 514 (527) .|+++++++++. +.++|+.. ....+++ +.+++.. T Consensus 406 l~~~~~e~~~~~-~e~~e~~~---~~~~~~~~~~~~~~~~ 441 (441) T protein:vir:80 406 LGLDDVQVEAVM-RHRAESSD---PLAVLAGAISRQTNEV 441 (441) T ss_pred CCCCHHHHHHHH-HHHHHHHH---HHHHHhhhhhcccccC Confidence 699888766443 33333321 1111111 1222222 No 57 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=100.00 E-value=2.8e-44 Score=259.39 Aligned_cols=463 Identities=11% Similarity=0.054 Sum_probs=279.8 Q ss_pred hcccchhhhcc--CccccCH--------------------HHHHHHHHHHHHhcCCCcccccc-cccCcccc-Cceeecc Q lcl|NC_019418. 19 TTSHLSSILDH--PKVAVTQ--------------------SEFRRIQHNLAYYQSKFDDIEYT-NTDGDRKR-RKMQHLP 74 (527) Q Consensus 19 ~~~~~~~~~~~--~~i~~~~--------------------~~~~~i~~~~~~y~g~~~~l~~~-~~~~~~~~-~~~~~ln 74 (527) ++.++..|.+. .+|.+|. ....|+++..+||.|+++..... ......+. .++.++| T Consensus 1 ~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v~n 80 (501) T protein:vir:25 1 MTVPVDVIADAPAADVEFPEDSMSREQLGALVADMWRLHISERQWLDRIYEYTKGLRGRPEVPEGASDEVKELAKLSVKN 80 (501) T ss_pred CcccchhhhccCcccccCCcccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccCChhhhhhHhhhhcC Confidence 22222222211 1232221 12346777788999998742211 11111111 2245679 Q ss_pred hHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcCCceEE Q lcl|NC_019418. 75 IARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQAPVFLP 154 (527) Q Consensus 75 l~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a~~~~P 154 (527) +|+.||+.+|++|+-+ .|++.+...++.+++++++|+|.....+++..|+.+|.+|+.+|.+.+.++|.+++|.++++ T Consensus 81 ~~~~ivd~~a~~l~~~--gf~~~d~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~~~~i~~~sp~~~~~ 158 (501) T protein:vir:25 81 VLSLVRDSFAQNLSVV--GYRNALAKENDPAWEMWQRNRMDARQAEVHRPALTYGASYVTVTPTDEGPVFRTRSPRQILA 158 (501) T ss_pred hHHHHHHHHHhhhccc--ceecCCccchHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCCeEEEeccccEEE Confidence 9999999999999754 47777777788899999999999999999999999999999999998888999999999999 Q ss_pred EEEcCC--ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEE-EEEEEecCCccccCceeeccc- Q lcl|NC_019418. 155 LQSNTQ--DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRI-TNELYKSTSDSQLGERVNLSE- 230 (527) Q Consensus 155 ~~~d~~--~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I-~n~ly~~~~~~~lG~~v~l~~- 230 (527) ++.|.. +...+++.+.......+...+.+ ++... . -|++ .+.++............+.+. T Consensus 159 iy~D~~~~~~~~~ai~~~~~~~~~~~~~~~~---~y~~~------~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 222 (501) T protein:vir:25 159 VYADPSVDAWPQYALETWVAQKDAKPHRRGV---LYDDT------Y-------MYELDLGEVVLGDAGGGQATQQPVNVR 222 (501) T ss_pred EEecCCCCcceeEEEEEEeeccccCcceeEE---EecCe------e-------EEEEecCceeeeecccccccccccccc Confidence 976532 23444444333222222211111 11000 0 0000 000000000000000011110 Q ss_pred -ccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCC Q lcl|NC_019418. 231 -LYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLK 309 (527) Q Consensus 231 -~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~ 309 (527) ..+..+.....++++..+++.|+ |+.. ..++|+|+|+.+++|+|++|++.|++.+..+.... |.-++. + T Consensus 223 ~~~~~~~~~~~~~~~~~vPiv~f~----N~~~-~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~----p~~~i~-G 292 (501) T protein:vir:25 223 EVTDVIEHGATFEGKPVCPVVRFV----NGRD-ADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGAN----PQRVIS-G 292 (501) T ss_pred ccccccccccccCCccceeeEecc----Cccc-cCccccchhhhhHHHHHHHHHHHHHHHHHHHhhcc----HHHHHh-C Confidence 01111112223455666666654 5444 36789999999999999999999999988875322 222221 1 Q ss_pred CCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHH Q lcl|NC_019418. 310 VQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATE 389 (527) Q Consensus 310 ~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAte 389 (527) ..++. ...|.... -.+-..+++...+..++ ..-.+.|.+.++.++++|...+++++.+||...+. .||.+ T Consensus 293 ~~~~~-----~~~~~~~~---~~i~~~~~~~~~~~q~~-~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N-~Sg~A 362 (501) T protein:vir:25 293 WTGSK-----AEVLKASA---LRVWTFEDPEVKAQAFP-PASVEPYNLILEEMLQHVAMVAQISPAQVTGKMIN-VSAEA 362 (501) T ss_pred CCCCc-----cchhhhcc---cceeccCCCCceEEEec-ccChHHHHHHHHHHHHHHHhhcCCChhhhccccCC-hHHHH Confidence 11111 01111111 01111122223343332 23457899999999999999999999999865444 48999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCH Q lcl|NC_019418. 390 IVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQ 469 (527) Q Consensus 390 i~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~ 469 (527) +++.+..+.+++.++++.|.++|+++++.++.+.. +.......+++|.|.+..+.+..++++.+.+++++|+ |. T Consensus 363 l~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~-----~~~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~gi-s~ 436 (501) T protein:vir:25 363 LAAAEANQQRKLAAKRESFGESWEQLLRLAAEMDD-----DPDTAADSGAEVLWRDTEARSFGAVVDGITKLASAGI-PI 436 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCC-CH Confidence 99999999999999999999999999999887643 1223455689999999999999999999999998885 99 Q ss_pred HHHHHhcCCCCHHHHHHHHHHHHHhcccccccc-------cCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 470 KRGIAKTLGITEEEAEKELAEINGELPPESDAE-------LALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 470 ~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) ++.+.+++|+++++++++.++.+++.+...... +...+.+++ ..+..+.++-..++ T Consensus 437 et~~~~~~g~~~~~ie~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~ 499 (501) T protein:vir:25 437 EHLLSMVPGMTQQTIQAIKDSLRGGEVKSLVDKLLSNEPAPVPPPPPQA--AAQALNEGGVNGNG 499 (501) T ss_pred HHHHHHcCCCCHHHHHHHHHHHHHHhHHHHHHHhhccCcCCCCCCCCCC--CccccccccCCCCC Confidence 999999999999887766655544433111110 111111111 00111111111111 No 58 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=100.00 E-value=1.9e-44 Score=260.25 Aligned_cols=451 Identities=12% Similarity=0.058 Sum_probs=280.9 Q ss_pred CCh--HHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHH Q lcl|NC_019418. 1 MSL--IQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIART 78 (527) Q Consensus 1 m~~--~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~ 78 (527) +++ .+-.+.++.++...+ .....|+.+..+||.|+++.........+..++.+..+|+|+. T Consensus 6 ~~~~e~~~~~~~~~~l~~~~-----------------~~~~~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~~~ 68 (486) T protein:vir:42 6 PGMEEIEDPAVVREEMISAF-----------------EDASKDLASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYPRL 68 (486) T ss_pred CCCCCcccHHHHHHHHHHHH-----------------HHHHHHHHHHHHHhcccCcchhcccccchhHhhhhhccchHHH Confidence 111 111122222211110 1134678888899999986432111111222233456799999 Q ss_pred HHHHHhhhhhcccceEeeCC-HHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC---------CeeEEEEEc Q lcl|NC_019418. 79 AAKKIASLVYNEQAEISAED-ETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG---------DKIRVAFIQ 148 (527) Q Consensus 79 i~~~~A~ll~~e~~~i~~~d-~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~---------~~~~i~~v~ 148 (527) ||+.+|++|+... +++++ +..++.+++++++|+|.....+++..|+.+|.+|+.+|.+. +.++|..++ T Consensus 69 iVd~~~~~l~~~g--~~~~~~~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~ 146 (486) T protein:vir:42 69 YVDSVAERQAVEG--FRLGDADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVEP 146 (486) T ss_pred HHHHHHhhhcccc--eecCCCchhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEec Confidence 9999999986544 55554 44567789999999999999999999999999999998753 457899999 Q ss_pred CCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeec Q lcl|NC_019418. 149 APVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNL 228 (527) Q Consensus 149 a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l 228 (527) |.+++|++.+..+...+++.+ +. ....+.+ +.++++.. ..++++ ...+ |.... T Consensus 147 p~~~~~i~d~~~~~~~~~~~~-~~-~~~~~~~--~~~~~y~~---------------~~~~~~---~~~~----~~~~~- 199 (486) T protein:vir:42 147 PTRMHAEIDPRINRVSKAIRV-AY-DKEGNEI--QAATLYTP---------------METIGW---FRAD----GEWAE- 199 (486) T ss_pred ccceEEEEeCCCCCeEEEEEE-EE-ecCCCeE--EEEEEEcC---------------CcEEEE---EecC----CcEEe- Confidence 999999977666665555532 22 2222222 22333221 111111 1111 11111 Q ss_pred ccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhh-hHHHHHHHHHHHHHHHHHHHcCcceeeechhHhc Q lcl|NC_019418. 229 SELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDN-AKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQ 307 (527) Q Consensus 229 ~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~-~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~ 307 (527) ....-+++++++++.| +||.....|+|+|+|+. +++++|++|+++|++.+..+...-++ -++. T Consensus 200 --------~~~~~h~~g~vPvv~~----~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~----~~i~ 263 (486) T protein:vir:42 200 --------WFNVPHGLGVVPVVPL----PNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQ----RLIF 263 (486) T ss_pred --------ecceecCCCCceEEEe----ccccccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchH----HHhh Confidence 0111256677777666 47778889999999995 88999999999999998766422111 1111 Q ss_pred CCCCCCCc---ccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccccccc Q lcl|NC_019418. 308 LKVQDNQG---NIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGV 384 (527) Q Consensus 308 ~~~~~~~~---~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~ 384 (527) +.+.... .......|.... -.+-..+++...+..+ +....+.|.+.++.++++++..+++++..||..+.+. T Consensus 264 -G~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~q~-~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~ 338 (486) T protein:vir:42 264 -GIKPEEIGVDSETGQTLFDAYL---ARILAFEDAEGKIQQF-SAAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNP 338 (486) T ss_pred -cCCccccccccccccchhhhhh---chhcccCCCCceEEee-cccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCch Confidence 1000000 000001111100 0111111222233333 2345688999999999999999999999999888777 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhc Q lcl|NC_019418. 385 KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAA 464 (527) Q Consensus 385 ~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~a 464 (527) .||.+++..++.+.++++.+++.|+.+|+++++.++.+... .....+...+.|.|.+..+.|..+.++...+++++ T Consensus 339 ~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~----~~~~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~ 414 (486) T protein:vir:42 339 ASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMKG----GDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGN 414 (486) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC----CCccccceeeeEEecCCCCCCHHHHHHHHHHHHhc Confidence 88999999999999999999999999999999998876542 12234556799999999999999999999999987 Q ss_pred --CCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccC-CCCCCC---------CCCCCCC--CCCCCcc Q lcl|NC_019418. 465 --GFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELA-LYGKGQ---------QNTVGNS--KDTVDDE 524 (527) Q Consensus 465 --Gi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~-~~~~~~---------~~~~~~~--~~~~~~~ 524 (527) |++|.++++ .++|++++++ ++++|+++|+........+ +.+... +++.+++ ...+|+. T Consensus 415 ~~g~~s~et~~-~~lg~~~d~~-~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (486) T protein:vir:42 415 GQGVIPRERAR-IDMGYSVKER-EEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPSPTAPPKPQPAIESSGGDA 486 (486) T ss_pred ccCCCCHHHHH-hcCCCChhHH-HHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCCCCCCCCCcccCCCCCCC Confidence 789999976 6679998764 5677876665432222111 111100 0000000 1122222 No 59 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=100.00 E-value=4e-44 Score=258.51 Aligned_cols=445 Identities=10% Similarity=0.037 Sum_probs=273.8 Q ss_pred CChHHHHHHHHH-HHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCcc---ccCceeecchH Q lcl|NC_019418. 1 MSLIQKVKDFFN-RGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDR---KRRKMQHLPIA 76 (527) Q Consensus 1 m~~~~~~k~~~~-~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~---~~~~~~~lnl~ 76 (527) |.- +.++.++. ++. .+ =.....|+.+..+||.|+++........... +-+++.++|+| T Consensus 9 l~~-~~~~~~~~~~l~----~~-------------~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~ 70 (479) T protein:vir:99 9 LSS-EGLAKYLETKVF----PK-------------MNTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWM 70 (479) T ss_pred CCh-hHHHHHHHHHHH----HH-------------HHHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcH Confidence 110 00111111 100 00 0123457888899999998754332221111 11233467999 Q ss_pred HHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEE-----e-CCeeEEEEEcCC Q lcl|NC_019418. 77 RTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYV-----D-GDKIRVAFIQAP 150 (527) Q Consensus 77 ~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~-----d-~~~~~i~~v~a~ 150 (527) +.||+.+|++++.+ .|++.++..++.+++++++|+|.....+++..++.+|.+|+.+|. | .+.++|.+++|. T Consensus 71 ~~iVd~~~~~l~~~--gf~~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~~~i~~~~p~ 148 (479) T protein:vir:99 71 GLMVNSFAQQLIVD--GYRKTGTNENAKGWDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGISPLDGTTVARIKCIDPR 148 (479) T ss_pred HHHHHHHHhhcccc--cccCCCchhhHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCcCCCCceEEEEechh Confidence 99999999998754 577888888888999999999999999999999999999999985 2 356899999999 Q ss_pred ceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeeccc Q lcl|NC_019418. 151 VFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSE 230 (527) Q Consensus 151 ~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~ 230 (527) +++|++.+.......++.. .+. ......||+...++ .|.... |.. .+ T Consensus 149 ~~~~iydd~~~~~~~~~~~-~~~-~~~~~~~~~~~~~~------------------------~~~~~~----~~~-~~-- 195 (479) T protein:vir:99 149 DAFAIWEDPYWDEWPKYLL-ERQ-PNGQYWWWTEEDYS------------------------IFEFKQ----GKF-IY-- 195 (479) T ss_pred heEEEecCCcccceeeEEE-eec-CceeEEEEecceEE------------------------EEEecC----Cce-ee-- Confidence 9999975544333223222 111 11122233211111 111111 100 00 Q ss_pred ccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCccee-eechhHhcCC Q lcl|NC_019418. 231 LYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRV-IVPEQMTQLK 309 (527) Q Consensus 231 ~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i-~v~~~~l~~~ 309 (527) ....-+++++++++.|+ |+... .++|+|+|+.+++++|++|++.|++.+..+.....+ ++. .+. . T Consensus 196 ------~~~~~h~~g~vPvv~f~----n~~~~-~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~-G~~--~ 261 (479) T protein:vir:99 196 ------RETVSHDYGHIPFVRYV----NVMDL-RGVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWAT-GLM--L 261 (479) T ss_pred ------ccccccCCCCcceEEee----cCCCc-CcCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhc-CCC--c Confidence 01112456778888775 44433 568999999999999999999999999887533222 221 110 0 Q ss_pred CCCCCcccccccccccc-cceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHH Q lcl|NC_019418. 310 VQDNQGNIAFKRRFDVE-QNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTAT 388 (527) Q Consensus 310 ~~~~~~~~~~~~~~d~~-~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAt 388 (527) .+...+.. ..+... .++. . ..++...+..++ ..-.+.|.+.++.++++|+..+++++..||.. +..||. T Consensus 262 ~~~~~~~~---~~~~~~~~~i~---~-~~~~~~~~~q~~-~~~~~~~~~~l~~~i~~i~~~t~~p~~~~g~~--~n~Sg~ 331 (479) T protein:vir:99 262 PEGANADQ---EKMRFAQESML---I-SQNEKASFGAIP-AAPLDGLLNAYKESLLEFLALAQLPPHIAGQI--VNVAAD 331 (479) T ss_pred ccccccch---hccccccccce---e-ecCCCceEEEec-ccchHHHHHHHHHHHHHHhccCCCCHHHcccc--cchHHH Confidence 11111111 001110 1111 1 122223343333 34478999999999999999999999999864 347899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCC Q lcl|NC_019418. 389 EIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFAT 468 (527) Q Consensus 389 ei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s 468 (527) +++.++..+..+++.+++.|..+|+++++.++.+... ........+++.|.+..+.+..+.++.+.+++++|++| T Consensus 332 Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~~~-----~~~~~~~~i~~~w~~~~~~s~~~~ad~~~kl~~ag~is 406 (479) T protein:vir:99 332 ALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKIEGR-----TEEATDLDFTITWQDVTIQSLAQFADAWAKMVESLKIP 406 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC-----CccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCCC Confidence 9999999999999999999999999999998876431 12234567999999999999999999999999999999 Q ss_pred HHHHHHhcCCCCHHHHHHHHHHHHHhcc--cccccccC--CCCCCCCCCCCCCC--CCCCccccC Q lcl|NC_019418. 469 QKRGIAKTLGITEEEAEKELAEINGELP--PESDAELA--LYGKGQQNTVGNSK--DTVDDEDEA 527 (527) Q Consensus 469 ~~~~i~~~~~~~deea~~el~ri~~E~~--~~~~~~~~--~~~~~~~~~~~~~~--~~~~~~~~~ 527 (527) .++++..++|+++++++++.+..+++.+ ........ .+.+..+.+.+... ..+++.++. T Consensus 407 ~et~l~~l~gv~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (479) T protein:vir:99 407 AEGVWDMIPNLDQSTVNGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNGATNMQQANNKTGEP 471 (479) T ss_pred HHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCCCCCCCCCCCCCCCCcch Confidence 9999999989998876544322222211 11111110 11111111111111 111212222 No 60 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=100.00 E-value=1.1e-43 Score=256.13 Aligned_cols=450 Identities=12% Similarity=0.084 Sum_probs=276.9 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) ..|+++|.+.+. ....|+.+..+||.|+++.........+..+++++.+|+|+.|| T Consensus 10 ~~~i~~L~~~~~------------------------~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iv 65 (488) T protein:vir:23 10 EKLRDQLLDAFE------------------------NKQNELKSSKAYYDAERRPDAIGLAVPLDMRKYLAHVGYPRTYV 65 (488) T ss_pred HHHHHHHHHHHH------------------------HHHHHHHHHHHHHhcccchhhcCcccchhhhhhhhhcchHHHHH Confidence 223222222221 12357888899999998743222222233335567889999999 Q ss_pred HHHhhhhhcccce------E---eeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe---------CCee Q lcl|NC_019418. 81 KKIASLVYNEQAE------I---SAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD---------GDKI 142 (527) Q Consensus 81 ~~~A~ll~~e~~~------i---~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d---------~~~~ 142 (527) +.+|++|+-.... . ..++....+.|++++++|+|......++..++.+|.+|+.+|.. .+.+ T Consensus 66 d~~a~~l~~~Gf~~~~~~~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~ 145 (488) T protein:vir:23 66 DAIAERQELEGFRIPSANGEEPESGGENDPASELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVP 145 (488) T ss_pred HHHHHhhhccceeccCCcccccccccchhHHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcc Confidence 9999877533221 1 23456678889999999999999999999999999999998863 2457 Q ss_pred EEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCcccc Q lcl|NC_019418. 143 RVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQL 222 (527) Q Consensus 143 ~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~l 222 (527) +|..++|.+++|++.+..+...+++. .++.. +.+.+++ .+.+.. +..++ ..+. + T Consensus 146 ~i~~~~p~~~~~~~d~~~~~~~~~~~-~~~~~-~~~~~~~--~~~y~~-------------~~~~~----~~~~-~---- 199 (488) T protein:vir:23 146 LIRVEPPTALYAEVDPRTRKVLYAIR-AIYGA-DGNEIVS--ATLYLP-------------DTTMT----WLRA-E---- 199 (488) T ss_pred eEEEeccceeEEEEecCCCceEEEEE-EEEec-CCCcEEE--EEEEec-------------CcEEE----EEec-C---- Confidence 89999999999997766665555443 22222 2232222 233221 11111 1111 1 Q ss_pred CceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhh-hhHHHHHHHHHHHHHHHHHHHcCcc-eee Q lcl|NC_019418. 223 GERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFD-NAKTTIDFINRTYDEFMWEIKMGQR-RVI 300 (527) Q Consensus 223 G~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~-~~~~lid~ld~~~s~~~~e~~~~~~-~i~ 300 (527) |.. .+ ....-+++++++++.|+ |+.....|+|+|+++ .+++|+|++|+++|++++.++...- ..+ T Consensus 200 ~~~-~~--------~~~~~h~~g~vPvv~f~----n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~ 266 (488) T protein:vir:23 200 GEW-EA--------PTSTPHGLEMVPVIPIS----NRTRLSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRL 266 (488) T ss_pred Cce-Ee--------ccccccCCCCcceEEec----cccccCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHH Confidence 110 00 11122566777777664 666777899999998 5799999999999999998774221 111 Q ss_pred echhHhcCCCCCCC-cccccccccccc-cceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccc Q lcl|NC_019418. 301 VPEQMTQLKVQDNQ-GNIAFKRRFDVE-QNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFT 378 (527) Q Consensus 301 v~~~~l~~~~~~~~-~~~~~~~~~d~~-~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~ 378 (527) + ++...+... ........|... ..+ +...+++...+.+++ ....+.|.+.++.++++|...+++++..|| T Consensus 267 i----~G~~~~~~~~~~~~~~~~~~~~~~~v---~~~~~g~~~~~~q~~-~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g 338 (488) T protein:vir:23 267 I----FGAKPEELGINAETGQRMFDAYMARI---LAFEGGEGAHAEQFS-AAELRNFVDALDALDRKAASYSGLPPQYLS 338 (488) T ss_pred H----hCCCcccccccccccchhhhhhhhhh---ccCCCCCCceeEecC-CCChHHHHHHHHHHHHHHhcccCCCHHHhc Confidence 1 100000000 000000111100 011 112233333444333 345789999999999999999999999998 Q ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHH Q lcl|NC_019418. 379 FDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYW 458 (527) Q Consensus 379 ~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~ 458 (527) ..+.+..||.+++..++.+.++++.+++.|+.+|+++++.++.+.... ........+.|.|.++.+.+..+.++.. T Consensus 339 ~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~----~~~~~~~~i~v~f~~~~~~s~~~~ada~ 414 (488) T protein:vir:23 339 SSSDNPASAEAIKAAESRLVKKVERKNKIFGGAWEQAMRLAYKMVKGG----DIPTEYYRMETVWRDPSTPTYAAKADAA 414 (488) T ss_pred cccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC----CcchhhccceEEecCCCCCCHHHHHHHH Confidence 877777889999999999999999999999999999999998765421 1233456799999999999999999999 Q ss_pred HHHHhcC--CCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccc-cCCCCCC-CCCCCCC-CCCCCCccccC Q lcl|NC_019418. 459 MKMVAAG--FATQKRGIAKTLGITEEEAEKELAEINGELPPESDAE-LALYGKG-QQNTVGN-SKDTVDDEDEA 527 (527) Q Consensus 459 ~~~~~aG--i~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~-~~~~~~~-~~~~~~~-~~~~~~~~~~~ 527 (527) .+++++| ++|.++++.. +|+++++. +++++++++........ ..+.+.. ++...++ +.....+++.. T Consensus 415 ~kl~~~g~~~~s~et~~~~-l~~~~d~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 486 (488) T protein:vir:23 415 AKLFANGAGLIPRERGWVD-MGYTIVER-EQMRQWLEQDQKQGLGLIGSLYGASTPEGKPGEAPVGEPPAPEPD 486 (488) T ss_pred HHHHhcccccCCHHHHHHh-CCCCchHH-HHHHHHHHHHHHHHHHHHHHHhccCCCcccCCCCCCCCCCCCCCC Confidence 9999976 7999997655 48877654 34444433221111111 1111111 1111110 11111111111 No 61 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=100.00 E-value=7.3e-44 Score=257.08 Aligned_cols=444 Identities=11% Similarity=0.074 Sum_probs=278.2 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccc-ccCcc-ccCceeecchHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTN-TDGDR-KRRKMQHLPIART 78 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~-~~~~~-~~~~~~~lnl~~~ 78 (527) +.--+.++.++++ =.....|+++..+||.|+++...... ..... ...++.++|+|+. T Consensus 4 ~t~~~~~~~l~~~---------------------~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ 62 (456) T protein:vir:79 4 STPAEWLPVLTKR---------------------IDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLM 62 (456) T ss_pred CCHHHHHHHHHHH---------------------HHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHH Confidence 1111112222211 11235578888999999988532111 11111 1234466799999 Q ss_pred HHHHHhhhhhcccceEeeCC-HHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe-CCeeEEEEEcCCceEEEE Q lcl|NC_019418. 79 AAKKIASLVYNEQAEISAED-ETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD-GDKIRVAFIQAPVFLPLQ 156 (527) Q Consensus 79 i~~~~A~ll~~e~~~i~~~d-~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d-~~~~~i~~v~a~~~~P~~ 156 (527) ||+.+|+++++++.++...+ ...++.+++++++|+|.....+++..++.+|.+|+.+|.+ .+.+++..++|.+++|++ T Consensus 63 ivd~~~~~l~~~g~~~~~~~d~~~~~~~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~~i~~~~p~~~~~i~ 142 (456) T protein:vir:79 63 VRDSVADRIIPNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSV 142 (456) T ss_pred HHHHHHhhhccCCeecCCCCCccHHHHHHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEeccceeEEEE Confidence 99999999999998876654 4567789999999999999999999999999999999997 467999999999999997 Q ss_pred EcCCc-eEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCc Q lcl|NC_019418. 157 SNTQD-VSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDL 235 (527) Q Consensus 157 ~d~~~-~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l 235 (527) .+..+ .+.+++.+ +...++...+++ .|..... ..|+.....+.. .. ...+......+ T Consensus 143 d~~~~~~~~~~~~~--~~~~d~~~~~~~--~~~~~~~------------~~~~~~~~~~~~----~~--~~~~~~~~~~~ 200 (456) T protein:vir:79 143 DPLQPWRIRSAMRW--WRDLDAESDFAI--VWSGDGW------------QKFARPCFVQSS----SR--RRLVTRISDSW 200 (456) T ss_pred cCCCCCceEEEEEE--EEecCCceeEEE--EEcCCce------------EEEEEEEEeecc----cc--ceeeeccCCce Confidence 76544 44444432 222233332222 1211000 000000001100 00 00011001111 Q ss_pred ccc-eeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcC-c-ceeeechhHhcCCCCC Q lcl|NC_019418. 236 QPV-TPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMG-Q-RRVIVPEQMTQLKVQD 312 (527) Q Consensus 236 ~~~-~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~-~-~~i~v~~~~l~~~~~~ 312 (527) .+. ...+++++|++++|.| +.|+|+|+++++++|++|++.|+.+++.+.. - .+++.....-....+. T Consensus 201 ~~~~~~~~~~~~~pvv~~~N----------~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~ 270 (456) T protein:vir:79 201 VPVGDAVVTGSPPPVVVYQN----------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDE 270 (456) T ss_pred eecccccCCCCceeEEEecC----------CCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCcccccccc Confidence 111 1224567788877632 5689999999999999999999998877641 1 1111111110001111 Q ss_pred CCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHH Q lcl|NC_019418. 313 NQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVS 392 (527) Q Consensus 313 ~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s 392 (527) .|........|.... -.+-..+++ ..+..++ +...+.|.+.++.++++|...+++++..||...++ .||.+++. T Consensus 271 ~g~~i~~~~~~~~~~---~~~~~~~~~-~~~~q~~-~~~~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~N-~Sg~Al~~ 344 (456) T protein:vir:79 271 NGNAIDYASIFEAAP---GALWELPPG-VDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHN 344 (456) T ss_pred cccccchhhhhhhhc---cccccCCCC-cceeeec-ccChHHHHHHHHHHHHHHHhhcCCChhHhcccccC-cHHHHHHH Confidence 121111111111111 011111122 2233332 23457899999999999999999999999876544 48999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHH Q lcl|NC_019418. 393 ENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRG 472 (527) Q Consensus 393 ~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~ 472 (527) .+..+.++++.+++.|+++|+++++.++.+.. ......+.|.|.+..+.+..++++..++++++|++|.+++ T Consensus 345 ~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~g--------~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~G~~~~~~~ 416 (456) T protein:vir:79 345 IEKGFLFKCEDRLSIAKIGLEAILVKALQIEG--------ESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIR 416 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--------CCccccceEEeCCCCCcCHHHHHHHHHHHHhcCCChHHHH Confidence 99999999999999999999999999876532 1344578999999999999999999999999999999987 Q ss_pred HHhcCCCCHHHH-HHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCC Q lcl|NC_019418. 473 IAKTLGITEEEA-EKELAEINGELPPESDAELALYGKGQQNTVGNSKDTV 521 (527) Q Consensus 473 i~~~~~~~deea-~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 521 (527) + ..+|++++++ ++|++|+++|........... ++++- +. T Consensus 417 ~-~~lg~~~~~i~~~e~~r~~~e~~~~~~~~~~~---~~~~~------~~ 456 (456) T protein:vir:79 417 R-NILNYNADQIKQDDLDRAREQITLFAGNPVQR---PQEDG------SR 456 (456) T ss_pred H-hcCCCCHHHHHHHHHHHHHHHHHHHhhhHhhc---CCCCC------CC Confidence 6 5669998775 467888888765433222111 11111 00 No 62 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=100.00 E-value=9e-43 Score=251.10 Aligned_cols=445 Identities=10% Similarity=0.057 Sum_probs=277.4 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc---CccccCceeecchHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD---GDRKRRKMQHLPIAR 77 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~---~~~~~~~~~~lnl~~ 77 (527) |.=-+ =..|+.++... =.....|+++..+||.|+++.. +.... ..+...++.++|+|+ T Consensus 1 ~~~~t-~~~~~~~l~~~-----------------~~~~~~r~~~l~~Yy~g~~~i~-~~~~~~~~~~~~~~~k~~~n~~~ 61 (456) T protein:vir:10 1 MTAST-PAEWLPVLTKR-----------------IDDGMSRVRLLARYSNGDAPLP-ELTRNTSAAWRSFQREARTNWGL 61 (456) T ss_pred CCCCC-HHHHHHHHHHH-----------------HHHHHHHHHHHHHHHhcCCCch-hcCcccChhhhhhhhhhhcchHH Confidence 22111 00111111100 0123567888899999998642 22111 111223567889999 Q ss_pred HHHHHHhhhhhcccceEeeCC-HHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe-CCeeEEEEEcCCceEEE Q lcl|NC_019418. 78 TAAKKIASLVYNEQAEISAED-ETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD-GDKIRVAFIQAPVFLPL 155 (527) Q Consensus 78 ~i~~~~A~ll~~e~~~i~~~d-~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d-~~~~~i~~v~a~~~~P~ 155 (527) .||+.+++++++++.++..++ ....+.+++++++|+|.....+++..|+.+|.+|..+|.+ .+.++|..++|.+++|+ T Consensus 62 ~ivd~~~~~l~~~~~~~~~~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i 141 (456) T protein:vir:10 62 MVRDSVADRIIPNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVS 141 (456) T ss_pred HHHHHHHhhhccCCeecCCCCCcchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeEEE Confidence 999999999999998876653 4456778999999999999999999999999999999986 46799999999999999 Q ss_pred EEcCCc-eEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCC Q lcl|NC_019418. 156 QSNTQD-VSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPD 234 (527) Q Consensus 156 ~~d~~~-~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~ 234 (527) +.+..+ ...+++.+. ...+....+.+ ++.... ...+++....+.... ....+.. ... T Consensus 142 ~d~~~~~~~~~~i~~~--~~~d~~~~~~~--~~~~~~------------~~~~~~~~~~~~~~~----~~~~~~~--~~~ 199 (456) T protein:vir:10 142 VDPLQPWRIRAAMRWW--RDLDAESDFAI--VWSGDG------------WQKFARPCFVQSSSR----RRLVTRI--SDS 199 (456) T ss_pred EcCCCCcceEEEEEEE--EecCCceeEEE--EEeccc------------eeEEEEEEEEeeccc----ceeeeec--CCc Confidence 766554 444444332 22233333322 221100 001111111111000 0000000 000 Q ss_pred cccce-eecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcc-eeeechhHhc--CCC Q lcl|NC_019418. 235 LQPVT-PIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQR-RVIVPEQMTQ--LKV 310 (527) Q Consensus 235 l~~~~-~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~-~i~v~~~~l~--~~~ 310 (527) ..... .-.+..+|+++++.| +.|+|+|+.+++++|++|.+.|+.++..+...- ..++. .+.. ... T Consensus 200 ~~~~~~~~~~~~~~pvv~~~N----------~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~-G~~~~~~~~ 268 (456) T protein:vir:10 200 WVPVGDAVVTGSPPPVVVYQN----------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALK-STEHGLPNV 268 (456) T ss_pred eeeccccCCCCCceeEEEecC----------CCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhh-ccCcccccc Confidence 01100 113456777776632 469999999999999999999999887764221 11111 1000 001 Q ss_pred CCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHH Q lcl|NC_019418. 311 QDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEI 390 (527) Q Consensus 311 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei 390 (527) +..+........|..... .+-..+ +...+.+++ .-..+.|.+.++.++++|...+++++..||..+++ .||.+| T Consensus 269 d~~g~~~~~~~~~~~~~~---~~~~~~-~~~~~~q~~-~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N-~Sg~Ai 342 (456) T protein:vir:10 269 DENGNAIDYASIFEAAPG---ALWELP-PGVDIWESQ-ANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGA 342 (456) T ss_pred cccccccchhhhhhhhcc---ccccCC-CCcceEEec-ccChhHHHHHHHHHHHHHHhccCCChHHhcccccC-hHHHHH Confidence 111111111111111100 011111 122344443 23467899999999999999999999999876554 489999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHH Q lcl|NC_019418. 391 VSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQK 470 (527) Q Consensus 391 ~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~ 470 (527) +.++..+.++++.+++.|+++|+++++.++.+.. .+....+.|.|.+..+.|..++++..++++++|++|.+ T Consensus 343 ~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g--------~~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~ 414 (456) T protein:vir:10 343 HNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEG--------ESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWAS 414 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--------CCcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHH Confidence 9999999999999999999999999998876532 23456789999999999999999999999999999999 Q ss_pred HHHHhcCCCCHHHHH-HHHHHHHHhcccccccccCCCCCCCCCCCCCC Q lcl|NC_019418. 471 RGIAKTLGITEEEAE-KELAEINGELPPESDAELALYGKGQQNTVGNS 517 (527) Q Consensus 471 ~~i~~~~~~~deea~-~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 517 (527) +++ .++|+++++++ +|++|+++|+..........+. +.+.. T Consensus 415 ~~~-~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~~~-----~~~~~ 456 (456) T protein:vir:10 415 IRR-NILNYNADQIKQDDLDRAREQITLFAGNPVQRPQ-----EDGSR 456 (456) T ss_pred HHH-hhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhcCC-----CCCCC Confidence 865 56799988764 6788888887644333222221 11000 No 63 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=100.00 E-value=9e-43 Score=251.10 Aligned_cols=445 Identities=10% Similarity=0.057 Sum_probs=277.4 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc---CccccCceeecchHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD---GDRKRRKMQHLPIAR 77 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~---~~~~~~~~~~lnl~~ 77 (527) |.=-+ =..|+.++... =.....|+++..+||.|+++.. +.... ..+...++.++|+|+ T Consensus 1 ~~~~t-~~~~~~~l~~~-----------------~~~~~~r~~~l~~Yy~g~~~i~-~~~~~~~~~~~~~~~k~~~n~~~ 61 (456) T protein:vir:10 1 MTAST-PAEWLPVLTKR-----------------IDDGMSRVRLLARYSNGDAPLP-ELTRNTSAAWRSFQREARTNWGL 61 (456) T ss_pred CCCCC-HHHHHHHHHHH-----------------HHHHHHHHHHHHHHHhcCCCch-hcCcccChhhhhhhhhhhcchHH Confidence 22111 00111111100 0123567888899999998642 22111 111223567889999 Q ss_pred HHHHHHhhhhhcccceEeeCC-HHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe-CCeeEEEEEcCCceEEE Q lcl|NC_019418. 78 TAAKKIASLVYNEQAEISAED-ETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD-GDKIRVAFIQAPVFLPL 155 (527) Q Consensus 78 ~i~~~~A~ll~~e~~~i~~~d-~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d-~~~~~i~~v~a~~~~P~ 155 (527) .||+.+++++++++.++..++ ....+.+++++++|+|.....+++..|+.+|.+|..+|.+ .+.++|..++|.+++|+ T Consensus 62 ~ivd~~~~~l~~~~~~~~~~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i 141 (456) T protein:vir:10 62 MVRDSVADRIIPNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVS 141 (456) T ss_pred HHHHHHHhhhccCCeecCCCCCcchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeEEE Confidence 999999999999998876653 4456778999999999999999999999999999999986 46799999999999999 Q ss_pred EEcCCc-eEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCC Q lcl|NC_019418. 156 QSNTQD-VSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPD 234 (527) Q Consensus 156 ~~d~~~-~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~ 234 (527) +.+..+ ...+++.+. ...+....+.+ ++.... ...+++....+.... ....+.. ... T Consensus 142 ~d~~~~~~~~~~i~~~--~~~d~~~~~~~--~~~~~~------------~~~~~~~~~~~~~~~----~~~~~~~--~~~ 199 (456) T protein:vir:10 142 VDPLQPWRIRAAMRWW--RDLDAESDFAI--VWSGDG------------WQKFARPCFVQSSSR----RRLVTRI--SDS 199 (456) T ss_pred EcCCCCcceEEEEEEE--EecCCceeEEE--EEeccc------------eeEEEEEEEEeeccc----ceeeeec--CCc Confidence 766554 444444332 22233333322 221100 001111111111000 0000000 000 Q ss_pred cccce-eecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcc-eeeechhHhc--CCC Q lcl|NC_019418. 235 LQPVT-PIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQR-RVIVPEQMTQ--LKV 310 (527) Q Consensus 235 l~~~~-~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~-~i~v~~~~l~--~~~ 310 (527) ..... .-.+..+|+++++.| +.|+|+|+.+++++|++|.+.|+.++..+...- ..++. .+.. ... T Consensus 200 ~~~~~~~~~~~~~~pvv~~~N----------~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~-G~~~~~~~~ 268 (456) T protein:vir:10 200 WVPVGDAVVTGSPPPVVVYQN----------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALK-STEHGLPNV 268 (456) T ss_pred eeeccccCCCCCceeEEEecC----------CCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhh-ccCcccccc Confidence 01100 113456777776632 469999999999999999999999887764221 11111 1000 001 Q ss_pred CCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHH Q lcl|NC_019418. 311 QDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEI 390 (527) Q Consensus 311 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei 390 (527) +..+........|..... .+-..+ +...+.+++ .-..+.|.+.++.++++|...+++++..||..+++ .||.+| T Consensus 269 d~~g~~~~~~~~~~~~~~---~~~~~~-~~~~~~q~~-~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N-~Sg~Ai 342 (456) T protein:vir:10 269 DENGNAIDYASIFEAAPG---ALWELP-PGVDIWESQ-ANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGA 342 (456) T ss_pred cccccccchhhhhhhhcc---ccccCC-CCcceEEec-ccChhHHHHHHHHHHHHHHhccCCChHHhcccccC-hHHHHH Confidence 111111111111111100 011111 122344443 23467899999999999999999999999876554 489999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHH Q lcl|NC_019418. 391 VSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQK 470 (527) Q Consensus 391 ~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~ 470 (527) +.++..+.++++.+++.|+++|+++++.++.+.. .+....+.|.|.+..+.|..++++..++++++|++|.+ T Consensus 343 ~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g--------~~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~ 414 (456) T protein:vir:10 343 HNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEG--------ESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWAS 414 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--------CCcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHH Confidence 9999999999999999999999999998876532 23456789999999999999999999999999999999 Q ss_pred HHHHhcCCCCHHHHH-HHHHHHHHhcccccccccCCCCCCCCCCCCCC Q lcl|NC_019418. 471 RGIAKTLGITEEEAE-KELAEINGELPPESDAELALYGKGQQNTVGNS 517 (527) Q Consensus 471 ~~i~~~~~~~deea~-~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 517 (527) +++ .++|+++++++ +|++|+++|+..........+. +.+.. T Consensus 415 ~~~-~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~~~-----~~~~~ 456 (456) T protein:vir:10 415 IRR-NILNYNADQIKQDDLDRAREQITLFAGNPVQRPQ-----EDGSR 456 (456) T ss_pred HHH-hhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhcCC-----CCCCC Confidence 865 56799988764 6788888887644333222221 11000 No 64 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=100.00 E-value=1.3e-41 Score=244.83 Aligned_cols=425 Identities=10% Similarity=0.019 Sum_probs=259.8 Q ss_pred ccccccccCccccCceeecchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEE Q lcl|NC_019418. 55 DIEYTNTDGDRKRRKMQHLPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMR 134 (527) Q Consensus 55 ~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~ 134 (527) .+......--+...++.++|+|+.||+.+++++... .|++.|...++.+++++++|+|.....+++..|+.+|.+|+. T Consensus 1 ~l~~~~~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~--gf~~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~ 78 (434) T protein:vir:98 1 MLPKNAEQAFLDFQRKARTNFCGLIANASVHRLLAL--GVTGPDGEPDTRASRWWQANRLDSRQKLVWRMAMAQSAGYML 78 (434) T ss_pred CCCCCccHHHHHhhhhhhccchHHHHHHHHhhhccC--ceecCCCchHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEE Confidence 111111100011123346799999999999988654 577888888899999999999999999999999999999999 Q ss_pred EEEeCC--------eeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCc Q lcl|NC_019418. 135 PYVDGD--------KIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSL 206 (527) Q Consensus 135 ~~~d~~--------~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~ 206 (527) +|.+++ .+.|.+++|.++++++.++.++..+++..... ...+.. +..+.++. .. T Consensus 79 v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~--~~~~~~-~~~~~~~~---------------~~ 140 (434) T protein:vir:98 79 VGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHN--DIDGFG-YARVFFDD---------------TS 140 (434) T ss_pred EecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEe--ccCCce-EEEEEEeC---------------cE Confidence 998642 46799999999999987776776666643322 222221 11111111 00 Q ss_pred eEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHH Q lcl|NC_019418. 207 YRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYD 286 (527) Q Consensus 207 ~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s 286 (527) +. . . ++. ....-+...+...++....+....+++++++++.|+ ||...+. .|+|+|+.+++++|++|+++| T Consensus 141 ~~-~-~-~~~-~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~----N~~~~~~-~g~sd~e~vi~liDa~~~~~s 211 (434) T protein:vir:98 141 FP-Y-R-TRE-RTGARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFA----RMPDLGE-DPEPEFAGVLDIQDRVNLGIL 211 (434) T ss_pred EE-E-E-Eee-ccccccccccccceecccccccccCCCCccceEEec----cCCCcCc-CCcchhhhHHHHHHHHHHHHH Confidence 00 0 0 010 000000000001112222233334567777777664 5544333 699999999999999999999 Q ss_pred HHHHHHHc-CcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHH Q lcl|NC_019418. 287 EFMWEIKM-GQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKL 365 (527) Q Consensus 287 ~~~~e~~~-~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~ 365 (527) +.++..+. +-+..++.-.-+....+.+++.......+.... -.+-..+++...+.+++ ....+.|.+.++.++++ T Consensus 212 ~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~---~~i~~~~~~~~~~~q~~-~~~~~~~~~~l~~~i~~ 287 (434) T protein:vir:98 212 NRMAASRFSGFRQKWIKGHKFAKRTDPATGMTVVDQPFVPSP---SAVWASEGENTQFGQLD-ATDLSGFLKEHASDVRD 287 (434) T ss_pred HHHHHHHHhcchhhhhcCCCcccccccccccchhhhhhhccc---cccccCCCCCceEEEec-CcchHHHHHHHHHHHHH Confidence 99998874 333333321111111122221111111111100 01111222223333332 34568899999999999 Q ss_pred HHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCC Q lcl|NC_019418. 366 FEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDD 445 (527) Q Consensus 366 i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d 445 (527) |+..+++++..||. ..+..||.+++..+..+.+++.++++.|+++|+++++.++.+. +......++.|.|.+ T Consensus 288 ~~~~~~~p~~~~~~-~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~-------g~~~~~~~~~v~w~~ 359 (434) T protein:vir:98 288 MLTISQTPTYLYAT-DLVNISADTIGALDILHVAKVREHIASFSEGLESVLALAAAQA-------GVPEDYTEAEVRWAN 359 (434) T ss_pred HhcccCCCHHHhcc-ccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-------CCChhheeeeEEecC Confidence 99999999999985 3456789999999999999999999999999999999887553 233456679999999 Q ss_pred CccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccc Q lcl|NC_019418. 446 GVFTDRHAELDYWMKMVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDED 525 (527) Q Consensus 446 ~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (527) ..+.+..++++...+++++|+ |.++ +..+.|++++|.+++.++..++........ ... ++.+.++..+.+ +.= T Consensus 360 ~~~~s~~~~ada~~kl~~~g~-~~e~-~~~~lg~~~~e~~r~~~e~~~~~~~~~~~~-~~~---~~~~~g~~~~~~-~~~ 432 (434) T protein:vir:98 360 PAHVTMAVKADAATKLKSIGY-PLDV-IAEELDESPARVRRIVAGAASQALLAASLL-PAP---GAPSAGNVPDSG-GAV 432 (434) T ss_pred CCCCCHHHHHHHHHHHHhcCC-cHHH-HHHhCCCCHHHHHHHHHHHHHHHHHHHhhh-ccC---CCCCCCCCCccc-CCC Confidence 999999999999999998885 6665 567779998877665555433221111111 010 111111111111 101 Q ss_pred cC Q lcl|NC_019418. 526 EA 527 (527) Q Consensus 526 ~~ 527 (527) ++ T Consensus 433 dg 434 (434) T protein:vir:98 433 DG 434 (434) T ss_pred CC Confidence 11 No 65 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=100.00 E-value=4.9e-41 Score=241.59 Aligned_cols=458 Identities=12% Similarity=0.080 Sum_probs=277.1 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHH--------------HHHHHHHHHHHhcCCCccccccccc-Ccc Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQS--------------EFRRIQHNLAYYQSKFDDIEYTNTD-GDR 65 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~--------------~~~~i~~~~~~y~g~~~~l~~~~~~-~~~ 65 (527) |. .+ .++...++.. -..++++ ...|+.+..+||.|+++. .+.... .+. T Consensus 1 ~~--------------~~-~~~~~~~~~~-~~~l~~~e~~~i~~L~~~~~~~~~r~~~l~~YY~G~~~i-~~~~~~~p~~ 63 (504) T protein:vir:99 1 MT--------------EE-TTSASKFTFR-IPELNDDVVDKVNGLYQQLVDRTPRNLLRASFYDGKYAI-RQIGNLIPPE 63 (504) T ss_pred CC--------------cc-CCcccccccc-cCCCCHHHHHHHHHHHHHHHHHhHHHHHHHHHHhccccc-hhccccccHH Confidence 00 00 0010000000 0011111 234677778899999873 332221 222 Q ss_pred ccCceeecchHHHHHHHHhhhhhcccceEeeC-CHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCC---e Q lcl|NC_019418. 66 KRRKMQHLPIARTAAKKIASLVYNEQAEISAE-DETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGD---K 141 (527) Q Consensus 66 ~~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~-d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~---~ 141 (527) .++.+.++|+|+.||+.+|+.+.-+. |.++ ++..++.|+++++.|+|.....+++..|+.+|.+|+.+|-+++ . T Consensus 64 ~~~~~~v~n~~~~iVd~~a~rl~~~G--f~~~d~~~~~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~ 141 (504) T protein:vir:99 64 YLRTATVLGWSAKAVDTLARRCNLES--FVWPDGDYGSIGGPDVWDENFFATKANNAMVSSLIHGPAFLINTEGGAGEPD 141 (504) T ss_pred HHHHhhccCcHHHHHHHHHhhhccce--eeCCCCChhhHHHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecCCCCCce Confidence 22445678999999999999887665 4553 4455677999999999999999999999999999999998643 4 Q ss_pred eEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccc Q lcl|NC_019418. 142 IRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQ 221 (527) Q Consensus 142 ~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~ 221 (527) +.|++++|.+++.++.+..+...+++.+... ..++ .++..+.|. ++.|.+ |+... T Consensus 142 ~~I~~~sP~~~~~iyD~~~~~~~~a~~~~~~--d~~g--~~~~~~~y~----------------~~~~~~--~~~~~--- 196 (504) T protein:vir:99 142 SLIHVKSAMQATGEWNSRRNAMDSLLSITSR--DAEG--HPTGIALYE----------------DGVTVT--ADMDD--- 196 (504) T ss_pred eEEEEeccceeEEEEeCCCCceeEEEEEEEe--cCCC--eEEEEEEEc----------------CCcEEE--EEEcC--- Confidence 6789999999999987777777777654332 2222 233344442 222211 11110 Q ss_pred cCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhh-hhHHHHHHHHHHHHHHHHHHHc-Cccee Q lcl|NC_019418. 222 LGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFD-NAKTTIDFINRTYDEFMWEIKM-GQRRV 299 (527) Q Consensus 222 lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~-~~~~lid~ld~~~s~~~~e~~~-~~~~i 299 (527) -|... . + ..-++...| ++. ..|+.....|+|+|.+. .+++++|++|+++++.++..+. +-+.. T Consensus 197 ~~~~~--~----~----~~~~~~gvP-vV~----~~n~~~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r 261 (504) T protein:vir:99 197 DGDWH--A----D----VRTHKLGVP-VEV----LPYKPREDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQL 261 (504) T ss_pred Cceee--e----c----cccCCCCcc-eEE----ecccccCccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhh Confidence 11110 0 0 001122223 332 35777788899999986 8899999999999999987764 22111 Q ss_pred eechhHhcCCCC---CCCccccccccccc--ccceeeeccCC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhc Q lcl|NC_019418. 300 IVPEQMTQLKVQ---DNQGNIAFKRRFDV--EQNVYMQVGAG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQI 370 (527) Q Consensus 300 ~v~~~~l~~~~~---~~~~~~~~~~~~d~--~~~~~~~~~~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~ 370 (527) ++ ++.... ...++.. ..|.. .+-.+..-+.. .+....+..++- -..+.|.+.++.++++|+..+ T Consensus 262 ~i----~G~~~~~~~~~d~~~~--~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~~~-~~l~~~~~~l~~~i~~~a~~t 334 (504) T protein:vir:99 262 IL----LGADAKNFRNKDGSMK--PAWQIALARVFALPDDEDEPDAARARADVKQFPA-SSPQPHIEMLEQIAMMFSGET 334 (504) T ss_pred hh----ccCCcccccccccccc--chhhhhhhhhhcCCCccccccccCccceeeecCC-CChHHHHHHHHHHHHHHHhhh Confidence 11 111100 0111110 11111 11111110000 011122333322 235689999999999999999 Q ss_pred CCCcccccccc-cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccC Q lcl|NC_019418. 371 GVSSGMFTFDG-QGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFT 449 (527) Q Consensus 371 g~s~~~~~~~~-~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~ 449 (527) +++++.||+.+ .+..||.+|++....+..++.++++.|..+|+++++.++.+... .+.....+..+.|.|.|..+. T Consensus 335 ~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~---~~~~~~~~~~~~v~w~d~~~~ 411 (504) T protein:vir:99 335 SIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSPAFRRSMIRALAIKNG---LDRIPPEWKTIDSKFRSPLYL 411 (504) T ss_pred CCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC---CCccccccccceeEecCCCcc Confidence 99999999765 47789999999999999999999999999999999998877542 223455667899999999999 Q ss_pred CHHHHHHHHHHHHhcCC--CCHHHHHHhcCCCCHHHHHHHHHHHHHhccccccccc-------CCCCCCCCCCCCCCCCC Q lcl|NC_019418. 450 DRHAELDYWMKMVAAGF--ATQKRGIAKTLGITEEEAEKELAEINGELPPESDAEL-------ALYGKGQQNTVGNSKDT 520 (527) Q Consensus 450 d~~~~~~~~~~~~~aGi--~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~-------~~~~~~~~~~~~~~~~~ 520 (527) +..+.++...+++++|. ++..+++..+.|++++|++++.++.+++++....+.+ .-.+...+.+..++-.+ T Consensus 412 s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~e~a~~ 491 (504) T protein:vir:99 412 SKAAQADAGAKMLGAGPEWLKETEVGLELLGLTPQQAKRALAERRRASSVSIIEALNRRQQEAATAGEDQDQGAGEPPAN 491 (504) T ss_pred CHHHHHHHHHHHHhhccccccchHHHHhhcCCCHHHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCCCCCCcCCCCCCCC Confidence 99999999999999985 4556667777799999887766655544432211110 01111111111111111 Q ss_pred CCccccC Q lcl|NC_019418. 521 VDDEDEA 527 (527) Q Consensus 521 ~~~~~~~ 527 (527) ..++-.+ T Consensus 492 ~~~~~~~ 498 (504) T protein:vir:99 492 EPPAALG 498 (504) T ss_pred CCCccCC Confidence 1111111 No 66 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=100.00 E-value=8.8e-36 Score=212.78 Aligned_cols=439 Identities=11% Similarity=0.033 Sum_probs=271.7 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc-CccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD-GDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~-~~~~~~~~~~lnl~~~i 79 (527) +.+++.|.+.+. ....++.+..+||.|+++. .+.... .+..+..+..+|+|+++ T Consensus 17 ~~~~~~L~~~~~------------------------~~~~~~~~~~~Yy~G~~~~-~~~~~~~p~~~r~~~~v~nw~~~~ 71 (474) T protein:vir:81 17 NALINGLLAQIE------------------------NLRWKNLLRTSYYENKRTI-QYVGTLIPPQYFNLGLVLGWTGKA 71 (474) T ss_pred HHHHHHHHHHHH------------------------HHhhHHHHHHHHhccCCCh-hhccccccHHHHHHHhhcChHHHH Confidence 222221111111 1234667778999999873 333221 11112224578999999 Q ss_pred HHHHhhhhhcccceEeeCCH-HHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC---CeeEEEEEcCCceEEE Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAEDE-TLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG---DKIRVAFIQAPVFLPL 155 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~d~-~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~---~~~~i~~v~a~~~~P~ 155 (527) |+.+|+.+.-+. |.+++. ..+..++++++.|+|......++..|+.+|.+|+.++.++ +.+.|..++|.+++.+ T Consensus 72 Vd~~a~rl~~~G--f~~~d~~~~~~~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~~~~~~ 149 (474) T protein:vir:81 72 VDALARRCNLEG--FVWPDGDLDSLGGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDASEATGE 149 (474) T ss_pred HHHHHhhhcccc--eECCCCCccchHHHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceEEEE Confidence 999999886654 445443 3455688999999999999999999999999999999853 3488999999999999 Q ss_pred EEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCc Q lcl|NC_019418. 156 QSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDL 235 (527) Q Consensus 156 ~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l 235 (527) +.+..+...+++..... +.++. .+....|. ++.+. .+++. . .+.. |. T Consensus 150 ~D~~~~~~~~al~~~~~--~~~g~--~~~~~ly~----------------~~~~~-~~~~~---~-~~~~------w~-- 196 (474) T protein:vir:81 150 WNRRRRGLNNLLSIIDK--DKEGK--VLSLALYL----------------DNETV-TAQRD---K-ATLK------WQ-- 196 (474) T ss_pred EeCCCCcceeeeEEEEE--cCCCc--EEEEEEEe----------------CCcEE-EEEEc---C-ccce------ee-- Confidence 87777777766643322 22221 11111111 11111 11111 1 0100 10 Q ss_pred ccceeecCCCcccEEEecCCccccccCCCccCcchh-hhhHHHHHHHHHHHHHHHHHHHc--CcceeeechhHhc-CCCC Q lcl|NC_019418. 236 QPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIF-DNAKTTIDFINRTYDEFMWEIKM--GQRRVIVPEQMTQ-LKVQ 311 (527) Q Consensus 236 ~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~-~~~~~lid~ld~~~s~~~~e~~~--~~~~i~v~~~~l~-~~~~ 311 (527) .....+++..| ++ +..|+.+...|+|+|.+ ..+++++|++|++.++.....+. -+.|++.-.+.-. ...+ T Consensus 197 -~~~~~~~~gvP-vV----~~~n~~~~~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~~~~~~d~d 270 (474) T protein:vir:81 197 -VDRDEHVYGVP-AQ----VLPYKPAPKRPFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLLGADESALKNAD 270 (474) T ss_pred -eccCCCCCCcc-eE----EecccccccCcCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeecCChhhccccc Confidence 00111233333 22 34577788899999998 59999999999999999876653 3333332111000 0011 Q ss_pred CCCcccccccccccc-cceeeeccCC-CCCC-----CcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccc-cc Q lcl|NC_019418. 312 DNQGNIAFKRRFDVE-QNVYMQVGAG-NMDS-----GGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDG-QG 383 (527) Q Consensus 312 ~~~~~~~~~~~~d~~-~~~~~~~~~~-~~~~-----~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~-~g 383 (527) ++. ...|+.. .++. .+..+ +++. ..+..++ .-..+.|.+.+..++++|+..+|++++.||+.+ ++ T Consensus 271 ~~~-----~~~~~~~~~~i~-~~~~d~d~~~~~~~~~~~~q~~-~a~l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~n 343 (474) T protein:vir:81 271 GTI-----KSVWEARLGRIK-GLPDDADADIPQLARADVKQFP-AASPDAHWSDINGLAKLFAREASLPDTAVAISGLSN 343 (474) T ss_pred ccc-----cchhhhhHHHHh-cCCCcccccccccccccccccC-CCChhHHHHHHHHHHHHHHhhhCCCHHHhccccccc Confidence 111 1122210 0111 11111 1111 1223332 234578999999999999999999999999765 67 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHh Q lcl|NC_019418. 384 VKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVA 463 (527) Q Consensus 384 ~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~ 463 (527) ..||.+|++.+..+..++.++++.|..+|+++++.++.+..... .......+..+.+.|.|.-..+..+.++...++++ T Consensus 344 p~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~-~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~ 422 (474) T protein:vir:81 344 PTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVA-IDEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQLA 422 (474) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC-ccccchhhccceeEecCCCccCHHHHHHHHHHHHh Confidence 78999999999999999999999999999999999887753211 11223456789999999999999999999999999 Q ss_pred cCC-CCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCC-CCCCCCCC Q lcl|NC_019418. 464 AGF-ATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELAL-YGKGQQNT 513 (527) Q Consensus 464 aGi-~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~-~~~~~~~~ 513 (527) +|. +..++.+.++.|+|++|++++..+.+++++...-+.... .++++-.. T Consensus 423 a~~~~~~~~~~~~~lg~t~~~i~~~~~~~~~~~~~~~~~~l~~~~~~~~~aq 474 (474) T protein:vir:81 423 AVPWLAETEVGLELIGLTPQQARRAMADKRRVQGRGTLQALIDRSNNGATAQ 474 (474) T ss_pred cccCCCcHHHHHhhcCCCHHHHHHHHHHHHHHhHHHHHHHHHhcCCCCCCCC Confidence 984 444454667789999988776655555444322221111 11111000 No 67 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=100.00 E-value=6.4e-35 Score=208.06 Aligned_cols=397 Identities=13% Similarity=0.134 Sum_probs=265.3 Q ss_pred cccCHHHHHHHHHHHHHhcCCCccccccccc--CccccCceeecchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHH Q lcl|NC_019418. 32 VAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD--GDRKRRKMQHLPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDML 109 (527) Q Consensus 32 i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~--~~~~~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l 109 (527) +++ ...|+.+..+||.|+++. .+.... ...+.+.+..+|+|+.+|+.+|+.+.-+. |+.+|.. +++++ T Consensus 1 l~~---~~~r~~~~~~yY~g~~~~-~~~~~~~p~~~~~~~~~v~nw~~~~Vds~a~rl~~~G--f~~~d~~----l~~i~ 70 (410) T protein:vir:95 1 MNL---YQSRVNLRYKHYAMQHYE-APTGITIPAHIRAKYQAVLGWAAKGVDSLADRLIFRA--FANDDFN----VTEIF 70 (410) T ss_pred CCc---chhhHHHHHHHhcCCCCc-cccchhccHHHHhHHHhhcchhHHHHHHhHhhhcccc--ccCCCch----HHHHH Confidence 222 357888899999999864 222221 11222334667999999999999776543 5566654 56777 Q ss_pred hhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEE Q lcl|NC_019418. 110 SNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFH 188 (527) Q Consensus 110 ~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h 188 (527) +.|+|.....+++..|+.+|.+|+.+|-++ +++.|.+++|.+++.++.+..+...+++.. +..+.++. .+...++ T Consensus 71 ~~N~ld~~~~~~~~~al~~G~sf~~v~~~~d~~~~i~~~sP~~~~~i~Dp~~~~~~~al~~--~~~~~~~~--~~~~~~~ 146 (410) T protein:vir:95 71 DRNNPDIFFDSAILSALIGSCSFVYISKGEDDEVRLQVIESSNATGVIDPITGLLVEGYAV--LARDDYNR--PTLEAYF 146 (410) T ss_pred hhcChHHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEEEeCCCCceEEEEEE--EEecCCCe--EEEEEEE Confidence 889999999999999999999999998864 679999999999999987777777766542 22222221 1222221 Q ss_pred eecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCc Q lcl|NC_019418. 189 EWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGL 268 (527) Q Consensus 189 ~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~ 268 (527) . .+..++ |.. + |. .| ...+++++++++.| +|+.+...|+|+ T Consensus 147 ~-------------~~~~~~-----~~~---~--~~------~~------~~~~~~g~vPvV~f----~n~~~l~~~~G~ 187 (410) T protein:vir:95 147 E-------------PNATHF-----IPK---D--GE------PY------SVTNETGIPLLVPV----IHRPDAVRPFGR 187 (410) T ss_pred e-------------CCcEEE-----Eee---C--Cc------cc------cccCCCCCcceEEe----cccccCCccCCc Confidence 1 111111 111 1 10 01 11245667777666 466777889999 Q ss_pred chh-hhhHHHHHHHHHHHHHHHHHHHc--CcceeeechhHhcCCCCCCCcccccccccccc-cceeeeccC-CCCCCCcc Q lcl|NC_019418. 269 SIF-DNAKTTIDFINRTYDEFMWEIKM--GQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVE-QNVYMQVGA-GNMDSGGI 343 (527) Q Consensus 269 S~~-~~~~~lid~ld~~~s~~~~e~~~--~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~-~~~~~~~~~-~~~~~~~i 343 (527) |.+ ..+++++|++|++.++.+...+. -..+.+. +.+.++.+. ..|+.. .++.. +.. .+++...+ T Consensus 188 s~I~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-----G~d~d~~~~-----~~~~~~~~~i~~-~~~~~~~~~~~v 256 (410) T protein:vir:95 188 SRITRAGMYYQKYAKRTLERADITAEFYSWPQKYIL-----GLDPDAEPM-----EKWKATVSSLLT-ISSSDKGVKPSV 256 (410) T ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHhcchhheee-----ccCCCCCcC-----chhhhhhhhhee-ccCCCCCCcceE Confidence 988 57999999999999999876654 2323222 112222211 112211 11111 111 12223344 Q ss_pred eEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019418. 344 VDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELG 423 (527) Q Consensus 344 ~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~ 423 (527) .+++. -..+.|.+.+..++++|+..++++++.||..+....||.+|.+.++.+..++.++++.|..+|+++++.++.+. T Consensus 257 ~q~~~-~~l~~~~~~l~~l~~~~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~ 335 (410) T protein:vir:95 257 GQFTT-ASMSPFTEQLRTAAAGFAGEMGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLR 335 (410) T ss_pred EecCC-CChHHHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 44433 23468999999999999999999999999988877889999999999999999999999999999999988775 Q ss_pred hhhcccCCcccCccceEEEeC---CCccCCHHHHHHHHHHHHhc--CCCCHHHHHHhcCCCCHHHHHHHHHHHHHhccc Q lcl|NC_019418. 424 KVVGIYRGTIPELDDISVNLD---DGVFTDRHAELDYWMKMVAA--GFATQKRGIAKTLGITEEEAEKELAEINGELPP 497 (527) Q Consensus 424 ~~~~~~~~~~~~~~~v~v~f~---d~i~~d~~~~~~~~~~~~~a--Gi~s~~~~i~~~~~~~deea~~el~ri~~E~~~ 497 (527) .. .........++.|.|. |.-..+..+.++...+++++ |+++.++++..+ |+++++..+.+.+-++.++. T Consensus 336 ~~---~~~~~~~~~~~~v~W~p~~d~~~~s~a~~aDa~~Kl~~a~~g~~~~~~~~~~l-g~~~~~~~~~~~~e~~~~g~ 410 (410) T protein:vir:95 336 DE---FRYTRSQFVRTAVKWEPLFEADANTMTMIGDGVVKLNQALPGYINAETIRDLT-GIAGDMSAKPVVSEGGSNGE 410 (410) T ss_pred cC---CCCcccccceeeEEeeecCCcchhhHHHHHHHHHHHHHhccCCccHHHHHHhc-CCChHHHHHHHHHHHHhCCC Confidence 32 1223456678899998 66666778888999999998 788999966555 99877543332222221111 No 68 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=100.00 E-value=4.2e-34 Score=203.59 Aligned_cols=406 Identities=11% Similarity=0.095 Sum_probs=265.1 Q ss_pred CCh--HHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc--CccccCceeecchH Q lcl|NC_019418. 1 MSL--IQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD--GDRKRRKMQHLPIA 76 (527) Q Consensus 1 m~~--~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~--~~~~~~~~~~lnl~ 76 (527) |.- ++.|.+.+. ....|+.+..+||.|+++.. +.... ...+...+..+|+| T Consensus 1 m~~~~i~~L~~~~~------------------------~~~~r~~~~~~yy~g~~~~~-~~~~~~p~~~~~~~~~v~nw~ 55 (422) T protein:vir:97 1 MNYMGMGYLRRKLA------------------------LFKTGVDKRYRYYAMDDRDD-TRSIVMPNNVREMYRSVLEWT 55 (422) T ss_pred CChHHHHHHHHHHH------------------------HHHHHHHHHHHHHhcCCChh-hcCccccHHHHHHHHhhcchh Confidence 221 111111111 13457888899999988642 22221 11222333456999 Q ss_pred HHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC--CeeEEEEEcCCceEE Q lcl|NC_019418. 77 RTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG--DKIRVAFIQAPVFLP 154 (527) Q Consensus 77 ~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~--~~~~i~~v~a~~~~P 154 (527) +.+|+.+|+.++-+. |+++|.. ++++++.|+|.....+++..|+.+|.+|+.+|.+. +.+.|.+++|.+++. T Consensus 56 ~~~Vd~~a~rl~~~G--f~~~d~~----l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p~i~~~sp~~~~~ 129 (422) T protein:vir:97 56 AKGVDSLADRIIFRE--FTNDDFN----AWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAEDGLPKMQVIEASKATG 129 (422) T ss_pred HHHHHHHHhccccce--eeCCchh----HHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCCCCeeEEEEechhhEEE Confidence 999999999665443 5666654 45677789999999999999999999999999863 678999999999999 Q ss_pred EEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCC Q lcl|NC_019418. 155 LQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPD 234 (527) Q Consensus 155 ~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~ 234 (527) ++.+..+++.+++.+.. .+..+....+ .+.. ...+. .++. + |.. T Consensus 130 i~D~~~~~~~~a~~~~~--~~~~~~~~~~--~~~~----------------~~~~~--~~~~---~--~~~--------- 173 (422) T protein:vir:97 130 ILDPTTFLLTEGYAILE--SDSNGNPTLE--AYFT----------------DKDIW--YYPK---K--GKP--------- 173 (422) T ss_pred EEeCCCCcceeeEEEEE--ecCCCcEEEE--EEEc----------------CceEE--EEcC---C--Ccc--------- Confidence 98777777776665432 2222222111 1110 11111 1111 1 110 Q ss_pred cccceeecCCCcccEEEecCCccccccCCCccCcchh-hhhHHHHHHHHHHHHHHHHHHHc--CcceeeechhHhcCCCC Q lcl|NC_019418. 235 LQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIF-DNAKTTIDFINRTYDEFMWEIKM--GQRRVIVPEQMTQLKVQ 311 (527) Q Consensus 235 l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~-~~~~~lid~ld~~~s~~~~e~~~--~~~~i~v~~~~l~~~~~ 311 (527) ...-++.++|+++.| .|+.....|+|+|.+ ..+++++|++|++.++.....+. -..+.+ ++...+ T Consensus 174 ---~~~~~~~g~vPvv~~----~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i-----~G~d~d 241 (422) T protein:vir:97 174 ---YNIKNPTGHPLLVPI----IHRPDAVRPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYV-----LGMDPD 241 (422) T ss_pred ---ccccCCCCCcceEEe----cccCCCccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhh-----cccCcc Confidence 011245566666655 466677889999998 57999999999999999887664 222222 121222 Q ss_pred CCCcccccccccccc-cceeeeccC-CCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHH Q lcl|NC_019418. 312 DNQGNIAFKRRFDVE-QNVYMQVGA-GNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATE 389 (527) Q Consensus 312 ~~~~~~~~~~~~d~~-~~~~~~~~~-~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAte 389 (527) +... ..|... .++.. +.. .+++...+.+++. -..+.|.+.++.+++++...++++++.||..+.+..||.+ T Consensus 242 ~~~~-----~~~~~~~~~i~~-~~~de~~~~~~v~q~~~-~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~A 314 (422) T protein:vir:97 242 AKPM-----EKWRATVSTLLE-ISKDEDGDKPTVGQFTT-ASMAPFMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVES 314 (422) T ss_pred cccC-----chhhhhhhhhhc-cCCCCCCCcceeeecCC-CChhHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHH Confidence 2111 112211 12211 211 2223334443432 2246899999999999999999999999998887788999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCC---HHHHHHHHHHHHhc-- Q lcl|NC_019418. 390 IVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTD---RHAELDYWMKMVAA-- 464 (527) Q Consensus 390 i~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d---~~~~~~~~~~~~~a-- 464 (527) |+++...+..++.++++.|..+|+++++.++.+... .......+.++.+.|....+.| ..+.++...+++++ T Consensus 315 i~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~---~~~~~~~~~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~ 391 (422) T protein:vir:97 315 IKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDE---FPYLRNQFMDTVIKWEPLFEADANMLTLVGDGAIKLNQAIP 391 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC---CcccchhhccceEEEccCCCCChHHHHHHHHHHHHHHhhcc Confidence 999999999999999999999999999998876532 1223345677999999666666 34556677888888 Q ss_pred CCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccc Q lcl|NC_019418. 465 GFATQKRGIAKTLGITEEEAEKELAEINGELPPE 498 (527) Q Consensus 465 Gi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~ 498 (527) |+++.++.+..+ |+++. +.+.+++.++++.. T Consensus 392 ~~~~~~~~~~~l-g~~~~--~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 392 GFMDADVIRDLT-GVKGA--DKPIPAITEVTTDG 422 (422) T ss_pred ccccHHHHHHHc-CCCch--hHHHHHHHhhhccC Confidence 789999876555 88653 45666766654322 No 69 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=100.00 E-value=3.1e-34 Score=204.31 Aligned_cols=395 Identities=12% Similarity=0.072 Sum_probs=257.5 Q ss_pred CC--hHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccC--ccccCceeecchH Q lcl|NC_019418. 1 MS--LIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDG--DRKRRKMQHLPIA 76 (527) Q Consensus 1 m~--~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~--~~~~~~~~~lnl~ 76 (527) |. ++.+|.+.+. ....|+.+..+||.|+++. .+....- ....+.+..+|+| T Consensus 1 ~~~~~i~~L~~~~~------------------------~~~~r~~~~~~yY~g~~~~-~~~~~~~p~~~~~~~~~v~nw~ 55 (409) T protein:vir:94 1 MTEKGIGYLRFKLS------------------------VHKRRAEMRYDQYAMKYVD-RFKGITIPQALSQQYRSILGWC 55 (409) T ss_pred CCHHHHHHHHHHHH------------------------HHhHHHHHHHHHhcccCch-hhcChhhhHHHHHHHhhhcchh Confidence 21 2222222111 1345788888999999763 3322211 1122334667999 Q ss_pred HHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe-CCeeEEEEEcCCceEEE Q lcl|NC_019418. 77 RTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD-GDKIRVAFIQAPVFLPL 155 (527) Q Consensus 77 ~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d-~~~~~i~~v~a~~~~P~ 155 (527) +.||+.+|+.+.-+. |+.+|.. ++++++.|+|.....+++..|+.+|.+|+.+|-+ .++++|.+++|.+++-+ T Consensus 56 ~~iVds~a~rl~~~G--f~~~d~~----l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~~i~~~sp~~~~~i 129 (409) T protein:vir:94 56 AKGVDSLADRLVFRE--FENDDFT----VNEIFEENNPDIFFDSAVLSSLIASCSFTYISKGENDAVRLQVIEAVNATGI 129 (409) T ss_pred HHHHHHhHhhcccCc--ccCCchH----HHHHHHhcChhHHHHHHHHHHHHhcceeEEEecCCCCceEEEEeccceEEEE Confidence 999999999765443 5566543 5678889999999999999999999999999986 46799999999999988 Q ss_pred EEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCc Q lcl|NC_019418. 156 QSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDL 235 (527) Q Consensus 156 ~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l 235 (527) +.+..++..+++.. ...+..+ +.+....+.. +..+ .+++. +..+. T Consensus 130 ~D~~~~~~~~a~~~--~~~d~~~--~~~~~~~~~~-------------~~~~----~~~~~---~~~~~----------- 174 (409) T protein:vir:94 130 IDPITGLLTEGYAV--LERDENN--NVVLEAHFLP-------------DRTD----YYYRD---SRNNI----------- 174 (409) T ss_pred EecCCCceeeeEEE--EEecCCC--ceEEEEEEec-------------CcEE----EEEec---CceeE----------- Confidence 76666666665432 2222221 1222222211 1111 11111 11111 Q ss_pred ccceeecCCCcccEEEecCCccccccCCCccCcchh-hhhHHHHHHHHHHHHHHHHHHHc--CcceeeechhHhcCCCCC Q lcl|NC_019418. 236 QPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIF-DNAKTTIDFINRTYDEFMWEIKM--GQRRVIVPEQMTQLKVQD 312 (527) Q Consensus 236 ~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~-~~~~~lid~ld~~~s~~~~e~~~--~~~~i~v~~~~l~~~~~~ 312 (527) ..-+++++|+++.| .|+.+...|+|+|.+ +.+++++|++|++.++.+...+. -..+++. +...++ T Consensus 175 ---~~~n~~g~vPvV~f----~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-----G~d~d~ 242 (409) T protein:vir:94 175 ---SIANPTGHPLLVPI----IHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVT-----GLSDDA 242 (409) T ss_pred ---eeeCCCCCcceEEe----ccccccccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeE-----ecCCCC Confidence 11134556666655 466777899999999 57999999999999999987654 2223222 112222 Q ss_pred CCcccccccccccc-cceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHH Q lcl|NC_019418. 313 NQGNIAFKRRFDVE-QNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIV 391 (527) Q Consensus 313 ~~~~~~~~~~~d~~-~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~ 391 (527) ++. ..|... .++..--+..+++...|.+++. -..+.|.+.++.++++++..++++++.||..+.+..||.+|+ T Consensus 243 ~~~-----~~~~~~~~~i~~~~~d~dg~~~~v~q~~~-~~l~~~~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~ 316 (409) T protein:vir:94 243 EPM-----ETWKATVSSMLQFTKDEDGDKPTLGQFTQ-PSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIK 316 (409) T ss_pred ccc-----chhhhhHHHhhcCCCCCCCCCceEEecCC-CChhHHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHH Confidence 211 122211 1121111112233334444432 224689999999999999999999999999888778999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCH---HHHHHHHHHHHhcC--C Q lcl|NC_019418. 392 SENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDR---HAELDYWMKMVAAG--F 466 (527) Q Consensus 392 s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~---~~~~~~~~~~~~aG--i 466 (527) +.++.+..++.++++.|..+|+++++.++.+... .......+..+.+.|.+..+.+. .+.++.+.+++++| + T Consensus 317 a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~---~~~~~~~~~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~ 393 (409) T protein:vir:94 317 ASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDD---APYLREQFRKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEF 393 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC---CCccccccccceEEeccCCCcchHHHHHHHHHHHHHHHhcccc Confidence 9999999999999999999999999988876532 12233455789999996555554 45567789999999 5 Q ss_pred CCHHHHHHhcCCCCHHH Q lcl|NC_019418. 467 ATQKRGIAKTLGITEEE 483 (527) Q Consensus 467 ~s~~~~i~~~~~~~dee 483 (527) ++.++.+ +..|+|+.+ T Consensus 394 ~~~~~~~-~~lG~~~~d 409 (409) T protein:vir:94 394 INKDTIR-DLTGIEGGE 409 (409) T ss_pred cchhHHH-HHcCCCCCC Confidence 6667654 555998776 No 70 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=100.00 E-value=2.9e-34 Score=204.41 Aligned_cols=395 Identities=13% Similarity=0.072 Sum_probs=257.8 Q ss_pred CC--hHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc--CccccCceeecchH Q lcl|NC_019418. 1 MS--LIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD--GDRKRRKMQHLPIA 76 (527) Q Consensus 1 m~--~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~--~~~~~~~~~~lnl~ 76 (527) |. ++++|.+.+ .....|+.+..+||.|+++ +.+.... .....+.+..+|+| T Consensus 1 ~~~~~i~~L~~~~------------------------~~~~~r~~~~~~yY~g~~~-~~~~~~~~p~~~~~~~~~v~nw~ 55 (409) T protein:vir:16 1 MTEKGIGYLRFKL------------------------SVHKRRAEMRYEQYAMKHV-DRFKGITIPQALSQQYRSILGWC 55 (409) T ss_pred CCHHHHHHHHHHH------------------------HHHhHHHHHHHHHHhccCc-hhhcchhhhHHHHHHHhhhcChh Confidence 22 112221111 1134678888999999876 3332221 11112334667999 Q ss_pred HHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe-CCeeEEEEEcCCceEEE Q lcl|NC_019418. 77 RTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD-GDKIRVAFIQAPVFLPL 155 (527) Q Consensus 77 ~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d-~~~~~i~~v~a~~~~P~ 155 (527) +.+|+.+|+.+.-+. |+.+|.. ++++++.|+|.....+++..|+.+|.+|+.+|-+ .+++.|.+++|.+++.+ T Consensus 56 ~~iVds~a~rl~~~G--f~~~d~~----l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~~i~~~sP~~~~~i 129 (409) T protein:vir:16 56 AKGVDSLADRLVFRE--FENDDFT----VNEIFEENNPDIFFDSTVLSALIASCSFTYISKGENDAVRLQVIEATNATGI 129 (409) T ss_pred HHHHHHhHhhccccc--ccCcchH----HHHHHHhcChhHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEE Confidence 999999999765443 5555543 6677888999999999999999999999999986 46799999999999999 Q ss_pred EEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCc Q lcl|NC_019418. 156 QSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDL 235 (527) Q Consensus 156 ~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l 235 (527) +.+..++..+++.... ..+......++ .+. ++.+ ...++. +.. |. T Consensus 130 ~D~~~~~~~~a~~~~~-~d~~~~~~~~~---~~~----------------~~~~-~~~~~~---~~~---------~~-- 174 (409) T protein:vir:16 130 IDPITGLLTEGYAVLE-RDENNNVVLEA---HFL----------------PDRT-DYYYRD---SRN---------NI-- 174 (409) T ss_pred eecccccceeeeEEEE-ecCCCceEEEE---EEe----------------cCcE-EEEEec---Ccc---------cc-- Confidence 8776676666554222 22222111111 111 0000 111211 111 11 Q ss_pred ccceeecCCCcccEEEecCCccccccCCCccCcchh-hhhHHHHHHHHHHHHHHHHHHHc--CcceeeechhHhcCCCCC Q lcl|NC_019418. 236 QPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIF-DNAKTTIDFINRTYDEFMWEIKM--GQRRVIVPEQMTQLKVQD 312 (527) Q Consensus 236 ~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~-~~~~~lid~ld~~~s~~~~e~~~--~~~~i~v~~~~l~~~~~~ 312 (527) ..-+++++++++.| .|+.+...|+|+|.+ ..+++++|++|++.++.+...+. -..+++. +...++ T Consensus 175 ---~~~~~~g~vPvV~f----~n~~~~~~~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-----G~d~d~ 242 (409) T protein:vir:16 175 ---SIANPTGNPLLVPI----IHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVT-----GLSDDA 242 (409) T ss_pred ---ceecCCCCcceEEe----cccccccccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeE-----ecCCCC Confidence 11245566666655 477788899999998 57999999999999999876653 3333322 112222 Q ss_pred CCccccccccccc--ccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHH Q lcl|NC_019418. 313 NQGNIAFKRRFDV--EQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEI 390 (527) Q Consensus 313 ~~~~~~~~~~~d~--~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei 390 (527) ++. ..|+. .+-.+.+ ...+++...|.+++. -..+.|.+.++.++++++..++++++.||..+....||.+| T Consensus 243 ~~~-----~~~~~~~~~i~~~~-~d~~g~~~~v~q~~~-~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai 315 (409) T protein:vir:16 243 EPM-----ETWKATVSSMLQFT-KDEDGDKPTLGQFTQ-PSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAI 315 (409) T ss_pred Ccc-----chhhhhhhHhhccC-CCCCCCCceEEecCC-CChhHHHHHHHHHHHHHhhhcCCCHHHcccccCchhHHHHH Confidence 211 11221 1111111 112233344544433 23468999999999999999999999999988877899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCC---HHHHHHHHHHHHhcCC- Q lcl|NC_019418. 391 VSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTD---RHAELDYWMKMVAAGF- 466 (527) Q Consensus 391 ~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d---~~~~~~~~~~~~~aGi- 466 (527) ++....+..++.++++.|..+|+++++.++.+... .+........+.|.|.+..+.+ ..+.++...+++++|. T Consensus 316 ~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~---~~~~~~~~~~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~a~~~ 392 (409) T protein:vir:16 316 KASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDD---VPYLREQFSKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPE 392 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC---CCccchhhccceEEecCCCCcchhhHHHHHHHHHHHHhhccc Confidence 99999999999999999999999999998877532 1223345577899999766444 5677888999999974 Q ss_pred CCHHHHHHhcCCCCHHH Q lcl|NC_019418. 467 ATQKRGIAKTLGITEEE 483 (527) Q Consensus 467 ~s~~~~i~~~~~~~dee 483 (527) +...+.+.++.|+|+.+ T Consensus 393 ~~~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 393 FINKDTIRDLTGIKGAE 409 (409) T ss_pred ccchhHHHHhccCCCCC Confidence 33333345566998776 No 71 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=100.00 E-value=7.3e-34 Score=202.24 Aligned_cols=474 Identities=10% Similarity=0.095 Sum_probs=276.1 Q ss_pred HHHHh----hcccchhhhcc-CccccCHH---HHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHHHHhh Q lcl|NC_019418. 14 GRYNM----TTSHLSSILDH-PKVAVTQS---EFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIAS 85 (527) Q Consensus 14 ~~~~~----~~~~~~~~~~~-~~i~~~~~---~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~ 85 (527) |.++- ..|++..-.++ |. .+++. ++.+|+..-.||.|++..|...-..+....++.+..+- +. T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~-~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps--------~~ 71 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPN-AVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPN--------GE 71 (527) T ss_pred CCccccccCCCcCcCCccccCcc-cCCHHHHHHHHHHHHHHHHhcCchhheeeecCCccccccceeeehh--------hH Confidence 44432 23333111111 11 14554 45555556678999887775332222211122222232 25 Q ss_pred hhhcccceEee---------CCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-----CeeEEEEEcCCc Q lcl|NC_019418. 86 LVYNEQAEISA---------EDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-----DKIRVAFIQAPV 151 (527) Q Consensus 86 ll~~e~~~i~~---------~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-----~~~~i~~v~a~~ 151 (527) .+++..-+|.+ .++..++.|..+++.+++..++..+-.+|..+|+++|++.||. +++++..++|.+ T Consensus 72 ~~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~~ 151 (527) T protein:vir:10 72 KLIEAKMRFLGQGLKWEFSKKDAKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPST 151 (527) T ss_pred HhhCCcceeeccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCcce Confidence 55555555544 2456788999999999999999999999999999999999995 369999999999 Q ss_pred eEEEEEcCC-ceEEEEEEEE--EEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCc---- Q lcl|NC_019418. 152 FLPLQSNTQ-DVSSAAILTK--TIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGE---- 224 (527) Q Consensus 152 ~~P~~~d~~-~~~~~a~~~~--~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~---- 224 (527) +||++.+.+ +....+.+.. ..+.+.+.+..-.++-+....-. .... --.+|+|.|....+. +|. T Consensus 152 ~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~--~~g~---~~~~G~~~yt~~~w~----lg~w~d~ 222 (527) T protein:vir:10 152 YFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLD--DDGK---PVPGGAIKYTEELYE----PGKWDDR 222 (527) T ss_pred eeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcC--cccc---cccCcceeeeeceee----ccccccc Confidence 999966544 3445554332 22222222221122211100000 0000 002355555333222 221 Q ss_pred -eeecc-cccCCcccceeec----CCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcce Q lcl|NC_019418. 225 -RVNLS-ELYPDLQPVTPIQ----GLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRR 298 (527) Q Consensus 225 -~v~l~-~~~~~l~~~~~~~----g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~ 298 (527) +.|+. +-++.....+++. .+..+++++|+ |-....+.||.|+++++++++++||.+.|+..+.++.+... T Consensus 223 ~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~----t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~P 298 (527) T protein:vir:10 223 PESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFR----GHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLG 298 (527) T ss_pred cccccchhhhhhhcCceeeecccCCCCccceEeec----CCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCc Confidence 11221 1122122222222 33445555553 44445678999999999999999999999999999998877 Q ss_pred eeechhHhcCCCCCCCcccc-----cccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_019418. 299 VIVPEQMTQLKVQDNQGNIA-----FKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVS 373 (527) Q Consensus 299 i~v~~~~l~~~~~~~~~~~~-----~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s 373 (527) |++-..+--. + ..|+.. +...|. . ++++.++.++..--...|...+..+.+.|....+++ T Consensus 299 i~~~tg~~~v--d-~~G~~~~~~VgPG~iwe----------L--~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~P 363 (527) T protein:vir:10 299 FYATDSAPPR--D-SRGNMVPWTISPLGMVE----------H--GQNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIP 363 (527) T ss_pred eeeecccccc--c-ccCCcCccccCCceeEe----------c--CCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCC Confidence 8775554311 2 122211 111111 1 233345554443345678888999999999999999 Q ss_pred ccccc-ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHhhhhc-ccCCcccCccceEEEeCCCccCC Q lcl|NC_019418. 374 SGMFT-FDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVS-MCELGKVVG-IYRGTIPELDDISVNLDDGVFTD 450 (527) Q Consensus 374 ~~~~~-~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~-il~~~~~~~-~~~~~~~~~~~v~v~f~d~i~~d 450 (527) ..+|| .+.++..|+.++.-+.+.++++..+++..|+-..++..+- +..+..++. +..........++|.|.+.+|.| T Consensus 364 avA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D 443 (527) T protein:vir:10 364 DIAVGVVDAAVAESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVN 443 (527) T ss_pred eeeeccccCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCC Confidence 99999 3456677888888899999999999999888888775542 222222222 22222233456899999999999 Q ss_pred HHHHHHHHHHHHhcCCCCHHHHHHhc---CCCCHHHHHHHHHHHHHhcccccccc----cCCCC---CCCCCCCCCCCCC Q lcl|NC_019418. 451 RHAELDYWMKMVAAGFATQKRGIAKT---LGITEEEAEKELAEINGELPPESDAE----LALYG---KGQQNTVGNSKDT 520 (527) Q Consensus 451 ~~~~~~~~~~~~~aGi~s~~~~i~~~---~~~~deea~~el~ri~~E~~~~~~~~----~~~~~---~~~~~~~~~~~~~ 520 (527) ..+.++++.+++++|++|.++|+.++ -|.. .++++++||.++.+.+..+. .+++. +.+--+.++.++- T Consensus 444 ~~avie~v~tL~~aGi~S~~tAv~~L~~~~g~e--D~E~E~~~I~~era~~a~a~a~A~~~~~a~~~~~~g~~~~~~d~~ 521 (527) T protein:vir:10 444 SEKRFNQLLQLWEAGLIPAKKLTEELSKIMGFE--LTEEDFKQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQA 521 (527) T ss_pred HHHHHHHHHHHHHcCchhHHHHHHHHHhccCCC--ChHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccc Confidence 99999999999999999999998776 2332 24566666665554332221 11110 0000011111111 Q ss_pred CCcccc Q lcl|NC_019418. 521 VDDEDE 526 (527) Q Consensus 521 ~~~~~~ 526 (527) ++..-= T Consensus 522 ~~~~~~ 527 (527) T protein:vir:10 522 LNGQPL 527 (527) T ss_pred cCCCCC Confidence 111100 No 72 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=100.00 E-value=8.2e-34 Score=201.97 Aligned_cols=474 Identities=10% Similarity=0.092 Sum_probs=276.3 Q ss_pred HHHHh----hcccchhhhcc-CccccCHH---HHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHHHHhh Q lcl|NC_019418. 14 GRYNM----TTSHLSSILDH-PKVAVTQS---EFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIAS 85 (527) Q Consensus 14 ~~~~~----~~~~~~~~~~~-~~i~~~~~---~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~ 85 (527) |.++- ..|++..-.++ |. .+++. ++.+|+..-.||.|++..|...-..+....++.+..+- +. T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~-~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps--------~~ 71 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPN-AVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPN--------GE 71 (527) T ss_pred CCccccccCCCcCcCCccccCcc-cCCHHHHHHHHHHHHHHHHhcCchhheeeecCCccccccceeeehh--------hH Confidence 44432 23333111111 11 14554 45555556678999887775332222211122222232 25 Q ss_pred hhhcccceEee---------CCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-----CeeEEEEEcCCc Q lcl|NC_019418. 86 LVYNEQAEISA---------EDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-----DKIRVAFIQAPV 151 (527) Q Consensus 86 ll~~e~~~i~~---------~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-----~~~~i~~v~a~~ 151 (527) .+++..-+|.+ .++..++.|..+++.+++..++..+-.+|..+|+++|++.||. +++++..++|.+ T Consensus 72 ~~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~~ 151 (527) T protein:vir:10 72 KLIEAKMRFLGQGLKWEFSKKDAKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPST 151 (527) T ss_pred HhhCCcceeeccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCcce Confidence 55555555544 2456788999999999999999999999999999999999995 369999999999 Q ss_pred eEEEEEcCC-ceEEEEEEEE--EEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCc---- Q lcl|NC_019418. 152 FLPLQSNTQ-DVSSAAILTK--TIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGE---- 224 (527) Q Consensus 152 ~~P~~~d~~-~~~~~a~~~~--~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~---- 224 (527) +||++.+.+ +....+.+.. ..+.+.+.+..-.++-+....-. .... --.+|+|.|....+. +|. T Consensus 152 ~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~--~~g~---~~~~G~~~yt~~~w~----lg~w~d~ 222 (527) T protein:vir:10 152 YFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLD--DDGK---PVPGGAIKYTEELYE----PGKWDDR 222 (527) T ss_pred eeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcC--cccc---cccCcceeeeeceee----ccccccc Confidence 999966544 3445554332 22222222221122211100000 0000 002355555333222 221 Q ss_pred -eeecc-cccCCcccceeec----CCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcce Q lcl|NC_019418. 225 -RVNLS-ELYPDLQPVTPIQ----GLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRR 298 (527) Q Consensus 225 -~v~l~-~~~~~l~~~~~~~----g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~ 298 (527) +.|+. +-++.....+++. .+..+++++|+ |-....+.||.|+++++++++++||.+.|+..+.++.+... T Consensus 223 ~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~----t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~P 298 (527) T protein:vir:10 223 PESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFR----GHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLG 298 (527) T ss_pred cccccchhhhhhhcCceeeecccCCCCccceEeec----CCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCc Confidence 11221 1122122222222 33445555553 44445678999999999999999999999999999998877 Q ss_pred eeechhHhcCCCCCCCcccc-----cccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_019418. 299 VIVPEQMTQLKVQDNQGNIA-----FKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVS 373 (527) Q Consensus 299 i~v~~~~l~~~~~~~~~~~~-----~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s 373 (527) |++-..+--. + ..|+.. +...|. . ++++.++.++..--...|...++.+.+.|....+++ T Consensus 299 i~~~tg~~~v--d-~~G~~~~~~VgPG~iwe----------L--~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~P 363 (527) T protein:vir:10 299 FYATDSAPPR--D-SRGNMVPWTISPLGMVE----------H--GQNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIP 363 (527) T ss_pred eeeecccccc--c-ccCCcCccccCCceeEe----------c--CCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCC Confidence 8775554311 2 122211 111111 1 233345555443345678888999999999999999 Q ss_pred ccccc-ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHhhhhc-ccCCcccCccceEEEeCCCccCC Q lcl|NC_019418. 374 SGMFT-FDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVS-MCELGKVVG-IYRGTIPELDDISVNLDDGVFTD 450 (527) Q Consensus 374 ~~~~~-~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~-il~~~~~~~-~~~~~~~~~~~v~v~f~d~i~~d 450 (527) ..+|| .+.++..|+.++.-+.+.++++..+++..|+-..++..+- +..+..++. +..........++|.|.+.+|.| T Consensus 364 avA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D 443 (527) T protein:vir:10 364 DIAVGVVDAAVAESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVN 443 (527) T ss_pred eeeeccccCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCC Confidence 99999 3456677888888899999999999999888888775542 222222222 22222233456899999999999 Q ss_pred HHHHHHHHHHHHhcCCCCHHHHHHhc---CCCCHHHHHHHHHHHHHhcccccccc----cCCCC---CCCCCCCCCCCCC Q lcl|NC_019418. 451 RHAELDYWMKMVAAGFATQKRGIAKT---LGITEEEAEKELAEINGELPPESDAE----LALYG---KGQQNTVGNSKDT 520 (527) Q Consensus 451 ~~~~~~~~~~~~~aGi~s~~~~i~~~---~~~~deea~~el~ri~~E~~~~~~~~----~~~~~---~~~~~~~~~~~~~ 520 (527) ..+.++++.+++++|++|.++|+.++ -|.. .++++++||.++.+.+..+. .+++. +.+--+.++.++- T Consensus 444 ~~avie~v~tL~~aGiiS~etAv~~L~~~~g~e--D~E~E~~~I~~era~~a~a~a~a~~~~~a~~~~~~g~~~~~~d~~ 521 (527) T protein:vir:10 444 NEKRFAQLLELWEAGLIPAKKLTEELSKIMGFE--LTEEDFRQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQA 521 (527) T ss_pred HHHHHHHHHHHHHcCchhHHHHHHHHHhccCCC--chHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccc Confidence 99999999999999999999998776 2332 25566667766554332221 11110 0000011111111 Q ss_pred CCcccc Q lcl|NC_019418. 521 VDDEDE 526 (527) Q Consensus 521 ~~~~~~ 526 (527) ++..-= T Consensus 522 ~~~~~~ 527 (527) T protein:vir:10 522 LNGQPL 527 (527) T ss_pred cCCCCC Confidence 111100 No 73 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=100.00 E-value=4e-33 Score=198.20 Aligned_cols=479 Identities=13% Similarity=0.085 Sum_probs=265.0 Q ss_pred HHHHh-hcccchhhhccC-ccccCH---HHHHHHHHHHHHhcCCCcccccccccCccccCceeecch--HHHHHHHHhhh Q lcl|NC_019418. 14 GRYNM-TTSHLSSILDHP-KVAVTQ---SEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPI--ARTAAKKIASL 86 (527) Q Consensus 14 ~~~~~-~~~~~~~~~~~~-~i~~~~---~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl--~~~i~~~~A~l 86 (527) |.++. +-.+....+... .--+++ .++.+|+..-.||.|+.-.+.-. ..| +.+..+++ ++.+|++.+++ T Consensus 1 m~~~~~q~~p~~~~fp~~~a~wV~~~D~~RlaaY~ly~d~y~n~~~el~~i-l~G----~dr~~~~~ps~r~~V~~~~~~ 75 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLRGGDDNIVDENDKNRVRAYDLYENIYLNSAETLKLV-LRG----DDSVPILMPSGRKIVEAVHRF 75 (563) T ss_pred CCccccccCCCcccccccccccCCHHHHHHHHHHHHHHHhhcCchhhhhhh-cCC----CceeeeccchHHHHHHHHHHh Confidence 44332 222222222111 112444 45555555667899976554311 122 23556665 56999997765 Q ss_pred hhcccceEeeCC--------HHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-----CeeEEEEEcCCceE Q lcl|NC_019418. 87 VYNEQAEISAED--------ETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-----DKIRVAFIQAPVFL 153 (527) Q Consensus 87 l~~e~~~i~~~d--------~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-----~~~~i~~v~a~~~~ 153 (527) | +.+..+.|.. ...+.+|..+++.+++..++..+..+|..+|+++|++.||. .++++.-|+|.++| T Consensus 76 L-g~~~~~~Ve~~~~de~~~~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~~g~R~rv~~vDP~~~f 154 (563) T protein:vir:74 76 L-GVGFDYLVEPDMGDEGIRQSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKKAGERISVDEVDPRQIF 154 (563) T ss_pred c-CCCcEEecCccccCcchHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccccccCCCceEeecCCceee Confidence 5 9988998753 23567889999999999999999999999999999999984 47899999999999 Q ss_pred EEEEcCCceEEEEEEEE-----EEeeCCCcceEEEE---EEEEeecccccccceeeecCCceEEEEEEEecCCccccCce Q lcl|NC_019418. 154 PLQSNTQDVSSAAILTK-----TIKTENRKNVYYTL---VEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGER 225 (527) Q Consensus 154 P~~~d~~~~~~~a~~~~-----~~~~~~~~~~~yt~---lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~ 225 (527) |.... +. ..+++..+ ..+.+.++..+..+ .++..- .-..+.|.+.+-.+ .+|.- T Consensus 155 p~~dp-d~-v~g~~~v~v~~~~~~pdd~~~~~~r~~~~~~~lnde------------g~~~~~~~~dae~w----~lg~w 216 (563) T protein:vir:74 155 LIEDG-ST-VVGFHMVDIVQDFRSPDDPSKKLARRRTFRRVRNDE------------GMFTGRISSELTHW----TLGNW 216 (563) T ss_pred eccCC-CC-cccceeeecccCCCCCcchhccceeeeeeeeeeCCC------------CCccceeeeccchh----ccccc Confidence 95433 22 22222111 11222223222211 111100 00111222221111 12210 Q ss_pred -------eecccccCCc------ccceee-cCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_019418. 226 -------VNLSELYPDL------QPVTPI-QGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWE 291 (527) Q Consensus 226 -------v~l~~~~~~l------~~~~~~-~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e 291 (527) +......+.+ .++..+ ..+..+++.+|+ |-...+|.||.|+++++++++++||.+.|+..+. T Consensus 217 d~r~~~~~~~~~~~~~~~~~~~d~e~~~LP~pi~~iPiv~~~----tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i 292 (563) T protein:vir:74 217 DDRGAISDEQARRKEQVRSAQHDEEEEELPEPISQLPLYRWR----NKPPQNSSWGTSQLEGMETLAYALNQSLTDEDAT 292 (563) T ss_pred cccCccchhhhcccchhhhhhhhchhhhccccccCccEEEcC----CCCCcccccchhhHHHHHHHHHHHhhhhhHHHHH Confidence 0000001110 000001 112233343332 2234568899999999999999999999999999 Q ss_pred HHcCcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHH-HHHHhc Q lcl|NC_019418. 292 IKMGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLK-LFEMQI 370 (527) Q Consensus 292 ~~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~-~i~~~~ 370 (527) +..+...|+|-++.. ..++..++...... .+-..+.. .++...+.++.++..--++.+...++.+.. .|.... T Consensus 293 ~~~tG~pi~vl~~~~--p~d~~~g~~~~w~v-gpG~i~El---~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~eral~~~s 366 (563) T protein:vir:74 293 IVFQGLGMYVTNASA--PVDPNTGELTDWNI-GPMQIVEI---AGNRNDNYFERVSGVQDVSPFQDHMKWIDEKGIAEGS 366 (563) T ss_pred HHhcCCCeEEecccc--cccccccccccccc-CCceeEec---cCCccccceeeecchhhhHHHHHHHHHHHHHHHHhhc Confidence 999988888866643 34444544321100 01111111 112233456666554444555556666655 456667 Q ss_pred CCCcccccc-cccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHh-hhh------c-ccCCcccCcc Q lcl|NC_019418. 371 GVSSGMFTF-DGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKE----LCVSMCELG-KVV------G-IYRGTIPELD 437 (527) Q Consensus 371 g~s~~~~~~-~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~----li~~il~~~-~~~------~-~~~~~~~~~~ 437 (527) +.+..+||- +.+...|+.++.-+.+.+.+++++++..|..++++ ++..+|..- .++ . +.....+... T Consensus 367 ~tPavA~G~vD~~~~~SGiALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~~ 446 (563) T protein:vir:74 367 GTPEVAIGRVDVTSAESGISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNEC 446 (563) T ss_pred cCcceeecccccccccchhhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCce Confidence 899888982 23335566666668899999999998887777766 554444211 110 1 1112223334 Q ss_pred ceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcC--CCCHHHHHHHHHHHHH--------hcccccccccCC-- Q lcl|NC_019418. 438 DISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTL--GITEEEAEKELAEING--------ELPPESDAELAL-- 505 (527) Q Consensus 438 ~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~--~~~deea~~el~ri~~--------E~~~~~~~~~~~-- 505 (527) .|+|.|.+.+|+|.++.+++...++++|++|.+||+.++- ||...+|+.+.++|+. .++. +++..+. T Consensus 447 ~v~ivf~p~~P~d~~~vv~~~~tl~~aGiiSretAv~~L~~~g~~~pdae~e~~~ie~~~i~~~~~a~a~-ad~~~~~~a 525 (563) T protein:vir:74 447 SVVCIFADPMPVNKTQVTQDTLLLQQAHLILRKMAVAKLRSIGWEYPEVDDQGNALTDDDIADMLLAEAE-ADASLGLSA 525 (563) T ss_pred EEEEEeCCCCCccHHHHHHHHHHHHHcCchhHHHHHHHHHhCCCCCCcHHHHHhhcCHHHHHHHHHHHhh-ccCccccee Confidence 4789999999999999999999999999999999966551 5555445555444422 1221 1222111 Q ss_pred CC-----CCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 506 YG-----KGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 506 ~~-----~~~~~~~~~~~~~~~~~~~~ 527 (527) .+ +++.++.+++.|.-|.--|- T Consensus 526 ~~~~g~~~~~~dd~g~p~~~~~~~~~~ 552 (563) T protein:vir:74 526 MDNGGAGEQQFDDQGNPIDQFGNPVEI 552 (563) T ss_pred cccCCCCcccccccCCchhHcCCcccC Confidence 11 11111111111111111111 No 74 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=99.69 E-value=6.2e-15 Score=98.47 Aligned_cols=429 Identities=10% Similarity=0.035 Sum_probs=208.0 Q ss_pred hhcccchhhhccCccccCHHHHHHHHHHHH---HhcCCCc-------ccccccccCccccCcee----ecchHHHHHHHH Q lcl|NC_019418. 18 MTTSHLSSILDHPKVAVTQSEFRRIQHNLA---YYQSKFD-------DIEYTNTDGDRKRRKMQ----HLPIARTAAKKI 83 (527) Q Consensus 18 ~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~---~y~g~~~-------~l~~~~~~~~~~~~~~~----~lnl~~~i~~~~ 83 (527) |... ...+++......|.. .|.|... .|..+.......-..|+ -.|+.+.+++.+ T Consensus 1 m~V~-----------~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~ 69 (452) T protein:vir:94 1 MPIE-----------TKHPEYLAYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSAL 69 (452) T ss_pred CCCC-----------CcCHHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHH Confidence 1100 125566666667754 4555322 12111111110001111 249999999999 Q ss_pred hhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe--CCeeEEEEEcCCceEEEEEcCCc Q lcl|NC_019418. 84 ASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD--GDKIRVAFIQAPVFLPLQSNTQD 161 (527) Q Consensus 84 A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d--~~~~~i~~v~a~~~~P~~~d~~~ 161 (527) ++++|.++|++++.+. ... +..=..-+++...++..+..++.+|.+++.|=+. +.+|-+..++|.+++=-..+..+ T Consensus 70 ~G~vf~k~p~~~~p~~-l~~-~~~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~~~~~~~~Ii~W~~~~~g 147 (452) T protein:vir:94 70 SGMVLDQPPVITHPDA-MSK-YFEDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYISVYTTENILNWEEDEDG 147 (452) T ss_pred hchhhcCCceecccHH-HHH-HHhcccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEEEEechhhhcCccccccC Confidence 9999999999887643 222 2222456789999999999999999999998774 56799999999998853334445 Q ss_pred eEEEEEEEEEEeeCCCcceE-EEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCccccee Q lcl|NC_019418. 162 VSSAAILTKTIKTENRKNVY-YTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTP 240 (527) Q Consensus 162 ~~~~a~~~~~~~~~~~~~~~-yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~ 240 (527) .+..+.+.+.....+..+.| -...+.|. ..+.+.+.|.+ ++|+..... . |...++... T Consensus 148 ~l~~v~lre~~~~~d~~d~f~~~~~~~yR---------vL~l~~g~~~v--~~~~~~~~~----~------~~~~~~~~~ 206 (452) T protein:vir:94 148 RLLMVVLREFYTVRDTADRYVQNIRVRYR---------CLELVDGLLQI--TVHETQDGK----V------WELAKTSTI 206 (452) T ss_pred CeeEEEEEEEEEEecCCCcccceeEEEEE---------EEEEeCCeEEE--EEEEccCCc----e------eeeccceee Confidence 55555554432221111110 00111111 01122344444 344432211 1 111111111 Q ss_pred ---ecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCccc Q lcl|NC_019418. 241 ---IQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGNI 317 (527) Q Consensus 241 ---~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~~ 317 (527) -+++...+|+++... ++.. ..|.|-|.++-.+--+.-..-|++.+.+......+.+-.. ..+..+-.+ T Consensus 207 ~~~~~~l~~IP~v~~~~~--~~~~---~~~~pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g----~~~~~~i~i 277 (452) T protein:vir:94 207 QNVGVTMDYIPFFCITPS--GLSM---TPAKPPMIDIVDINYSHYRTSADLEHGRHFTGLPTPWITG----AESQSTMHI 277 (452) T ss_pred cCCCcccceeEEEEEcCC--CCCC---CCCccchHHHHHHHHHHhcchhHHHHHHHHcccceeEeec----CcCCCceEe Confidence 124556667666422 2211 2366667777777666666667676666543333333111 111111111 Q ss_pred ccccccccccceeeeccCCCCCCCcceEeccccCh-HHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHH Q lcl|NC_019418. 318 AFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRS-SDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSD 396 (527) Q Consensus 318 ~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~-e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~ 396 (527) ..... +..++ ....+.+++++... +.+.+.|+.+-+++.. .|. ..+...+.+..|+++.....+. T Consensus 278 G~~~~----------~~lpe-~~~~~~yie~~g~~i~~~~~~l~~le~~m~~-~Ga--~ll~~~~~~~~s~ea~~~~~~~ 343 (452) T protein:vir:94 278 GSTKA----------WVIPE-VAAKVGFLEFTGQGLQSLEKALSEKQAQLAS-LSA--RLIDNSTRGSEATETVKLRYMS 343 (452) T ss_pred ccccc----------ccCCC-CCCcceEEccCchhHHHHHHHHHHHHHHHHH-HHH--HhhccCCCcchHHHHHHHHHHH Confidence 11111 12222 12235566665433 4566666666554422 221 2222222333333333333333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc Q lcl|NC_019418. 397 TYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKT 476 (527) Q Consensus 397 ~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~ 476 (527) ..+....+...++.+|.++++.+..+... .......++-+|... ..+ .+.++.+.+++.+|.+|.+|++..+ T Consensus 344 ~~s~L~~~a~~~e~al~~~l~~~a~w~g~------~~~~~v~~n~dF~~~-~~~-~~~~~al~~~~~~G~is~~t~~~~L 415 (452) T protein:vir:94 344 ETASLKSVTRAVEALLNKAYSCIMDMESM------GGTLNIKLNSAFLDS-KLT-AAELKAWVEAYLSGGISKEIYIHAL 415 (452) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHcCC------CCceEEEeccccccc-cCC-HHHHHHHHHHHhcCCCcHHHHHHHH Confidence 34555556666778888888877765321 111122333444332 223 4677888899999999999986543 Q ss_pred --CCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 477 --LGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 477 --~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) .|+-+-+ ++.+++.+|....... ..| ++.+++ +++ T Consensus 416 ~~~gvl~~~--~e~~~i~~E~~~~~~~---~~~-----~~~~~~------~~~ 452 (452) T protein:vir:94 416 KVGKVLPPP--GESMGVIPDPPAPEPS---PSN-----TPPNPS------SKA 452 (452) T ss_pred HhCCCCCCc--cCHHHHHHHhhccCcc---cCC-----CCCCCc------cCC Confidence 3443322 2234455543321111 010 111111 111 No 75 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=99.66 E-value=4.6e-14 Score=93.72 Aligned_cols=441 Identities=9% Similarity=0.052 Sum_probs=208.6 Q ss_pred HHHHhhcccchhhhccCccccCHHHHHHHHHHH---HHhcCCCc-------cccccccc-----CccccCceeecchHHH Q lcl|NC_019418. 14 GRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNL---AYYQSKFD-------DIEYTNTD-----GDRKRRKMQHLPIART 78 (527) Q Consensus 14 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~---~~y~g~~~-------~l~~~~~~-----~~~~~~~~~~lnl~~~ 78 (527) |-.+ -++..... .+++......|+ ..|.|... .|..+... ..+..|- .-.|+++. T Consensus 1 m~~~-~~~~v~~~--------h~~y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA-~~~n~~~~ 70 (513) T protein:vir:97 1 MADK-DPKSPATT--------SGAYDQMLPRWHVIETLLGGTEAMREAGETYLPRHQEETDKGYQERLASA-VLLNMVEQ 70 (513) T ss_pred CCCC-CCCCCCcC--------CHHHHHHHHHHHHHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcc-cCCChHHH Confidence 1111 11111110 222333333332 22333210 11100000 0001111 22399999 Q ss_pred HHHHHhhhhhcccceEeeCCH-HHHHHH-HHH-HhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC--C------------- Q lcl|NC_019418. 79 AAKKIASLVYNEQAEISAEDE-TLNDFL-SDM-LSNDRFNKNFERYLESALALGGLAMRPYVDG--D------------- 140 (527) Q Consensus 79 i~~~~A~ll~~e~~~i~~~d~-~~~~~l-~~~-l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~--~------------- 140 (527) +++.++.++|.++|+++.+.. ...+.| +++ ..-+++...++.++..++..|.+++.|=+.. + T Consensus 71 tl~~l~G~vf~k~p~~~~~~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~~~~T~Ade~ 150 (513) T protein:vir:97 71 TLDTLSGKPFSEPIKLNEDVPKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDGQPRTLADDR 150 (513) T ss_pred HHHHHhhhhhhcCcccCcCchHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccchhHHhHHHHH Confidence 999999999999998764332 333333 221 2245788899999999999999988874421 1 Q ss_pred ----eeEEEEEcCCceEEEEEcC---Cc--eEEEEEEEEEEeeCCC-cceEEEEEEEEeecccccccceeeecCCceEEE Q lcl|NC_019418. 141 ----KIRVAFIQAPVFLPLQSNT---QD--VSSAAILTKTIKTENR-KNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRIT 210 (527) Q Consensus 141 ----~~~i~~v~a~~~~P~~~d~---~~--~~~~a~~~~~~~~~~~-~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~ 210 (527) +|-+..+.|.+++= |.. ++ ++.-+.+.+.+...+. ..+.+ +.+ ..++.+.+. T Consensus 151 ~~~~rPy~~~~~~e~Iin--W~~~~v~G~~~L~~v~l~E~~~~~Dgf~~~~~---~q~-----------rvL~~g~~~-- 212 (513) T protein:vir:97 151 REGLRPYWVMIKPECLLF--ARSEVINGVEVLQHVRIIEHYMEQDGFAEVCK---RRI-----------RVLEPGLVQ-- 212 (513) T ss_pred hhccCceEEEecHhhhcC--cceeccCcceeeeeEEEEEEEeecCCCcceEE---EEE-----------EEEeCceEE-- Confidence 37788999999874 432 22 4444544444332221 11111 111 112223333 Q ss_pred EEEEecCCcccc-CceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHH Q lcl|NC_019418. 211 NELYKSTSDSQL-GERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFM 289 (527) Q Consensus 211 n~ly~~~~~~~l-G~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~ 289 (527) +|+....... +..+.+.+. .-+++..++|+++-.. .|... -|.+-|-++..+--+.=...|.+- T Consensus 213 --v~r~~~~~~~~~~e~~~~~~--------g~~~l~~IP~v~~~~~-~~~~~----~~~pPLl~LA~ln~~hy~~~Sd~~ 277 (513) T protein:vir:97 213 --LWEPVKKSNAQKEEWALADE--------WATGLNYVPLVTFYAD-RQGFM----MGKPPLLDLAHLNVAHWQSASDQR 277 (513) T ss_pred --EEEeecCCCccccceEEecC--------CCCcCCceeEEEEecC-CCCCC----CCccchHHHHHHHHHHHhhhhhHH Confidence 3332111111 111111000 0134566777766432 12111 244445554444444444445555 Q ss_pred HHHHc-CcceeeechhHhcCCCCCCCcccccccccccccceeeecc---CCCCCCCcceEeccccCh-HHHHHHHHHHHH Q lcl|NC_019418. 290 WEIKM-GQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVG---AGNMDSGGIVDLTTPIRS-SDYISAISEGLK 364 (527) Q Consensus 290 ~e~~~-~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~---~~~~~~~~i~~~~~~ir~-e~~~~~~~~~l~ 364 (527) ..+.. +.+..+++ . .+...+ ..+..+-+ ..+.....+.+++++-.. +.+.+.|+.+-. T Consensus 278 ~il~~~~~P~l~~~-G-----~~~~~~-----------~~i~iG~~~~~~lpe~~~~~~yie~~g~~i~~~~~~l~~le~ 340 (513) T protein:vir:97 278 HILTVSRFPILACS-G-----ASGEDS-----------DPVVVGPNKVLYNPDPAGRFYYVEHTGQAIAAGRTDLKDLEE 340 (513) T ss_pred HHHHhcccceeeee-c-----CCcCCC-----------CceEeeccccccCCCCCCcceeeccCchhHHHHHHHHHHHHH Confidence 55543 33333331 1 111100 01111111 111223346666666443 445566666555 Q ss_pred HHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeC Q lcl|NC_019418. 365 LFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLD 444 (527) Q Consensus 365 ~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~ 444 (527) ++ ...|.. +-...++.+|||+.....+...+....+...++.||+++++.+..+... + .......++-+|. T Consensus 341 qm-~~~Ga~---ll~~~~~~~Ta~a~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~wlg~----~-~~~~~v~in~dF~ 411 (513) T protein:vir:97 341 QM-AGYGAE---FLKRKTGGQTATARALDSAEATSDLSAMTGLFEDALAQALDITADWLRL----G-PNGGTVELVKDYD 411 (513) T ss_pred HH-HHHHHH---hhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----C-CCccEEEeccccC Confidence 54 233322 2223456799999998989999999999999999999999988866431 1 1111233444554 Q ss_pred CCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcC--CC-----CHHH-HHHHHHHHHHhccccccc---ccCCCCCCCC-- Q lcl|NC_019418. 445 DGVFTDRHAELDYWMKMVAAGFATQKRGIAKTL--GI-----TEEE-AEKELAEINGELPPESDA---ELALYGKGQQ-- 511 (527) Q Consensus 445 d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~--~~-----~dee-a~~el~ri~~E~~~~~~~---~~~~~~~~~~-- 511 (527) .... + .+.++.+++++.+|.+|.++++..+- |+ ++++ -+++..||.+.......+ ....++++.+ T Consensus 412 ~~~~-~-~~~~~al~~a~~~G~is~~t~~~~L~r~gvl~~d~d~~~~~e~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~ 489 (513) T protein:vir:97 412 LEEM-D-APGLQALQVAREKRDISRKTYLNGLRLRGVLPEDFDEDEDWEELMEEISEAMGRAGLDLDPAQKNPPEGGEGE 489 (513) T ss_pred cccC-C-HHHHHHHHHHHhCCCCCHHHHHHHHHhccCCCccCCHHHHHHHHHHhhhhccCCCCccccccCCCCCCCCCCC Confidence 4322 2 35678889999999999999865532 33 3332 334444554433221111 1111221111 Q ss_pred ----------CCCCCCCCCCCccc Q lcl|NC_019418. 512 ----------NTVGNSKDTVDDED 525 (527) Q Consensus 512 ----------~~~~~~~~~~~~~~ 525 (527) -|-++.+.|.|+|. T Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) T protein:vir:97 490 GEGEGEGGEGGEGGEGGGNPGGES 513 (513) T ss_pred CCCCCCCCCCCCccccCCCCCCCC Confidence 12222223333333 No 76 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.66 E-value=3.6e-14 Score=94.27 Aligned_cols=438 Identities=11% Similarity=0.049 Sum_probs=213.0 Q ss_pred CChHHHHHHHHH--HHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCc-----ccccccc-c----Cc---- Q lcl|NC_019418. 1 MSLIQKVKDFFN--RGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFD-----DIEYTNT-D----GD---- 64 (527) Q Consensus 1 m~~~~~~k~~~~--~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~-----~l~~~~~-~----~~---- 64 (527) ||||+|+-.+|. +...++..... .+=|.+-.. |+..... + .. T Consensus 1 mn~~dr~i~~~sP~~~~~R~~ar~~----------------------~~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~l 58 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKAARLRSRAV----------------------IQAYEAVKTTRTHKARRENRTADQLSQYGAVSL 58 (502) T ss_pred CchHhhHHhhcChHHHHHHHhhHHH----------------------HhhccccCcccccCCCCCCCChHHHHHHHHHHH Confidence 999999999984 22222211111 111222111 0000000 0 00 Q ss_pred --cccCceeecchHHHHHHHHhhhhhcc-cc----eEeeCC----HHHHHHHHHHHh----------hhhHHHHHHHHHH Q lcl|NC_019418. 65 --RKRRKMQHLPIARTAAKKIASLVYNE-QA----EISAED----ETLNDFLSDMLS----------NDRFNKNFERYLE 123 (527) Q Consensus 65 --~~~~~~~~lnl~~~i~~~~A~ll~~e-~~----~i~~~d----~~~~~~l~~~l~----------~n~f~~~~~~~~~ 123 (527) +.++-...-++++.+++.+++.+.|. .. ++...+ ++.++.+++.+. ..+|......++. T Consensus 59 r~RaRdl~rNn~~a~~av~~~~~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r 138 (502) T protein:vir:79 59 REQARYLDNNHDLVIGVFDKLEERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLR 138 (502) T ss_pred HHHHHHHHhcChHHHHHHHHHHHhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHH Confidence 00011123378899999999999985 22 222222 233444444332 2357777777888 Q ss_pred HHHhcCCEEEEEEEeCC---------eeEEEEEcCCceEEEEEcCC-ceEEEEEEEEEEeeCCCcceEEEEEEEEeeccc Q lcl|NC_019418. 124 SALALGGLAMRPYVDGD---------KIRVAFIQAPVFLPLQSNTQ-DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTP 193 (527) Q Consensus 124 ~a~~~G~~~~~~~~d~~---------~~~i~~v~a~~~~P~~~d~~-~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~ 193 (527) ..+..|.++++..++.. ..+|..++|+.+ |...+.+ .+..+|.+ .....-+-|.+.-.|- T Consensus 139 ~~~~dGE~f~~~~~~~~~~~~~g~~~~l~lq~iepd~l-~~~~~~~~~i~~GVe~-----d~~Gr~~aY~i~~~hP---- 208 (502) T protein:vir:79 139 TWLRDGEVFAQMVSGRINSLTPSAGVHFWLEALEPDFI-PMTSDESNRLNQGVFV-----DDWGRPEKYLVYKSRP---- 208 (502) T ss_pred HHHhCCceEEEEeecccCccCCCcccceEEEEecchhc-CCCCCCCCeeEeeeEE-----CCCCceEEEEEeecCC---- Confidence 88899999999988642 258899999885 5433332 33333322 1111122233322231 Q ss_pred ccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhh Q lcl|NC_019418. 194 TGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDN 273 (527) Q Consensus 194 ~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~ 273 (527) +.+. ... =..|| ..-+.|+.. ....+...|+|.|+. T Consensus 209 -----------gd~~----------~~~-~~rvp------------------A~~vlH~f~----~~r~gQ~RGis~lap 244 (502) T protein:vir:79 209 -----------VSGR----------QME-TKEVD------------------AERMLHLKF----VRRLHQMRGTSLLSG 244 (502) T ss_pred -----------CCCc----------ccc-eeEec------------------hhheEEeec----ccCCccccCCchHHH Confidence 0000 000 00111 111233322 223455679999999 Q ss_pred hHHHHHHHHHHHHHHHH-HHHcCcceeeechhHhcCCCCCCCcccccccccccccceee--e--cc-CCCCCCCcceEec Q lcl|NC_019418. 274 AKTTIDFINRTYDEFMW-EIKMGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYM--Q--VG-AGNMDSGGIVDLT 347 (527) Q Consensus 274 ~~~lid~ld~~~s~~~~-e~~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~--~--~~-~~~~~~~~i~~~~ 347 (527) ++..+..|+.-.+.-.. ..-.+--..||-.. ++..........-.....+.. + +. ..+ ...|+.++ T Consensus 245 vl~~l~~l~~~~dael~~a~i~A~~~~fi~~~------~~~~~~~~~~~~~~~~~~~~l~pG~i~~~L~p--Ge~i~~~~ 316 (502) T protein:vir:79 245 VLIRLSALKEYEDSELTAARIAAALGMYIRKG------DGQSYEPDGNGSKENERELTIQPGIIYDDLKP--GEEIGMVK 316 (502) T ss_pred HHHHHHHHhHHHHHHHHHHHHhhhheeeeecC------CCcccccccCCCCCccccccccCCccccccCC--CceeeeeC Confidence 99999999965433322 22223333344211 111110000000000111111 1 11 122 23588888 Q ss_pred cccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhhhh Q lcl|NC_019418. 348 TPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKE-LCVSMCELGKVV 426 (527) Q Consensus 348 ~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~-li~~il~~~~~~ 426 (527) |.-+..+|..-+..+++.|....|+++..++.+-++ |=..+++.....-.+....|..+...+-+ +.+..+..+-+- T Consensus 317 p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~--nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~ 394 (502) T protein:vir:79 317 SDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYNG--TYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVAS 394 (502) T ss_pred CCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccccc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc Confidence 887778888888899999999999999999877543 33333444444444444444444444433 333333322211 Q ss_pred c---ccCCcccCccceEEEe--CCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHH---hcccc Q lcl|NC_019418. 427 G---IYRGTIPELDDISVNL--DDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLGITEEEAEKELAEING---ELPPE 498 (527) Q Consensus 427 ~---~~~~~~~~~~~v~v~f--~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~---E~~~~ 498 (527) + +.+... ...-..+.| .--..+|+.++++....++.+|+.|.++.+.+. |.+-+++.+++++-.+ +.... T Consensus 395 G~i~~p~~~~-~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~~~a~~-G~D~~~v~~q~a~e~~~~~~~Gl~ 472 (502) T protein:vir:79 395 GVIRLPRDLD-RSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESDWVRAG-GRNPDDVKRRRKAEIDENRKLDLV 472 (502) T ss_pred CCCCCCCCCC-chhhcceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHc-CCCHHHHHHHHHHHHHHHHHcCCC Confidence 1 111111 112245666 444457999999999999999999999988776 8776665554443222 11111 Q ss_pred ccccc-CCCCC-CCCCCCCCCCCCCCcccc Q lcl|NC_019418. 499 SDAEL-ALYGK-GQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 499 ~~~~~-~~~~~-~~~~~~~~~~~~~~~~~~ 526 (527) .+..+ ...+. ..+.+..++....++.+| T Consensus 473 ~~~~~~~~~~~~~~~~~~~e~~~~~~~~e~ 502 (502) T protein:vir:79 473 FDTDPASDKGGSSAATKRQEPQHTDDQSEE 502 (502) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 11111 11111 111111112222222222 No 77 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=99.64 E-value=1.6e-13 Score=90.75 Aligned_cols=472 Identities=10% Similarity=0.086 Sum_probs=222.2 Q ss_pred HHHHHHHHHH-HHHHhhcccchhhhcc-----Ccc-ccCHHHHHHHHHHHH---HhcCCCc-------cccccccc---C Q lcl|NC_019418. 4 IQKVKDFFNR-GRYNMTTSHLSSILDH-----PKV-AVTQSEFRRIQHNLA---YYQSKFD-------DIEYTNTD---G 63 (527) Q Consensus 4 ~~~~k~~~~~-~~~~~~~~~~~~~~~~-----~~i-~~~~~~~~~i~~~~~---~y~g~~~-------~l~~~~~~---~ 63 (527) +.+-+.-+++ ......+.+-...+.| ++| ...+++......|.. .|.|... .|..+... . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~dV~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~~ 80 (535) T protein:vir:80 1 MARKRTTIRRDVQSKVLIPPQAPPTSGLGPSLPNVGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRDE 80 (535) T ss_pred CCcchhhhhhhhhhhcccCCCCcCCCCCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCCc Confidence 3333333332 2222233332233322 334 235677777777864 4666421 12221110 0 Q ss_pred c----cccCc--eeecchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHH-HhhhhHHHHHHHHHHHHHhcCCEEEEEE Q lcl|NC_019418. 64 D----RKRRK--MQHLPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDM-LSNDRFNKNFERYLESALALGGLAMRPY 136 (527) Q Consensus 64 ~----~~~~~--~~~lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~-l~~n~f~~~~~~~~~~a~~~G~~~~~~~ 136 (527) + .+.+. -.-.|+.+.+++.++.++|.++|++++. +.+..+++++ ..-+++...++.++..++.+|.+++.|= T Consensus 81 E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~~~~p-~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD 159 (535) T protein:vir:80 81 EQRRRYETYLQRAIFYNVTARTLDGMMGQVFSRDPIRQLP-PALEAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTD 159 (535) T ss_pred CCHHHHHHHHhhccCCChhHHHHHHHhchhhcCCcceecc-HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEe Confidence 0 01111 1223999999999999999999988764 3333333322 1233788899999999999999999885 Q ss_pred EeC--------------CeeEEEEEcCCceEEEEEcCC-----ceEEEEEEEEEEeeCCC-----cceEEEEEEEEeecc Q lcl|NC_019418. 137 VDG--------------DKIRVAFIQAPVFLPLQSNTQ-----DVSSAAILTKTIKTENR-----KNVYYTLVEFHEWVT 192 (527) Q Consensus 137 ~d~--------------~~~~i~~v~a~~~~P~~~d~~-----~~~~~a~~~~~~~~~~~-----~~~~yt~lE~h~~~~ 192 (527) +.. -+|-+..+.|++++= |... +++.-+.+.+.+...++ ....|..|+.. T Consensus 160 ~P~~~~~~t~ade~~~~~rPy~~~y~ae~Iin--W~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~RvL~~~---- 233 (535) T protein:vir:80 160 YPNVGRPVTVLEQKLGLYRPTITLVHPTSIIN--WRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQWRVLQLN---- 233 (535) T ss_pred ecCCCCcccHHHHHhcCCCcEEEEechhhccC--ccccccCCccceeEEEEEEEEEecCCCcccceeEEEEEEEec---- Confidence 521 137799999999874 4422 24555555555443332 12234444442 Q ss_pred cccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhh Q lcl|NC_019418. 193 PTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFD 272 (527) Q Consensus 193 ~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~ 272 (527) .++.|++ ++|+....... .....+. ..+....+.+...+|+++... .|... .|.+-|- T Consensus 234 ----------~~G~y~v--~~~~~~~~~~~--~~~~~~~---~~~~~g~~~l~~IPfv~~~~~-~~~~~----~~~pPLl 291 (535) T protein:vir:80 234 ----------AEGNYQV--ERWRRETQEEM--YYSYSKH---VPTDGNGNPFKEIPFQFIGPL-DNNAD----IDHPPLL 291 (535) T ss_pred ----------CCceEEE--EEEEeecCCcc--cccccee---ecccCCCcccCeeEEEEeecC-CCCCC----CCccchH Confidence 2345555 33433211111 1111110 011111234566677766421 12111 2334344 Q ss_pred hhHHHHHHHHHHHHHHHHHHHc-CcceeeechhHhcCCCCCC--CcccccccccccccceeeeccCCCCCCCcceEeccc Q lcl|NC_019418. 273 NAKTTIDFINRTYDEFMWEIKM-GQRRVIVPEQMTQLKVQDN--QGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTP 349 (527) Q Consensus 273 ~~~~lid~ld~~~s~~~~e~~~-~~~~i~v~~~~l~~~~~~~--~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ 349 (527) ++..+--+.=..-|++.+.+.. +.+..++. .+-....++. ...+..+ ... .+..+.+...++-++++. T Consensus 292 ~LA~lni~Hy~~ssd~~~il~~~~~P~l~i~-G~~~~~~~~~~~~~~i~iG-----~~~---~~~lP~~~~~~~~e~~~~ 362 (535) T protein:vir:80 292 DLCEVNIGHYRNSADYEEMAFVAGQPTAFFT-GLTKDWVEDVFKDFKVHLG-----SRA---IIPLPQGATAGILQITPN 362 (535) T ss_pred HHHHHHHHHhhchhHHHHHHHHhcCceeeee-cCchhhhhcCCCCcceEec-----Ccc---cccCCCCCCcceeeeccc Confidence 4333322221122333333433 34433331 1100000000 0000000 001 122333444445455554 Q ss_pred cChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_019418. 350 IRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIY 429 (527) Q Consensus 350 ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~ 429 (527) --. .++++.+-.++.. ++...+ ....+..||++.+...+...+....+...++.||.++++.+..+... T Consensus 363 ~~a---~~~l~~~e~qM~~---lGa~ll-~~~~~~~Ta~~a~~~~~~~~S~L~~~a~~le~al~~aL~~~A~w~G~---- 431 (535) T protein:vir:80 363 SVP---FEAMTHKESQMIA---MGANLL-VKSGGNRTFGEAQQEEASEQSILSACTKNVSMAFRKALRWANQFQTG---- 431 (535) T ss_pred hhH---HHHHHHHHHHHHH---HHHHhh-ccCcccccHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCC---- Confidence 222 2334333332211 122222 23345788888877777777777888888999999988877755321 Q ss_pred CCcccCcc--ceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCH--HHHHHHHHHHHHhccccccc-- Q lcl|NC_019418. 430 RGTIPELD--DISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKT--LGITE--EEAEKELAEINGELPPESDA-- 501 (527) Q Consensus 430 ~~~~~~~~--~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~--~~~~d--eea~~el~ri~~E~~~~~~~-- 501 (527) ....... .++-+|.+. ..|. ..++.+.+++.+|.||.++++..+ -++-+ .+-++|..||+.|....... T Consensus 432 -~~~~~~~~i~~n~dF~~~-~ld~-~~~~all~~~~~G~Is~et~~~~L~r~gvl~~~~~~eee~~ri~~E~~~~~~~~g 508 (535) T protein:vir:80 432 -IVNDETVEYNLNTDFPAA-RLTP-NERAELILEWQQGAITFKEMRAGLRRAGVASEDDAKAETEGKATVEFIAKTAAAG 508 (535) T ss_pred -ccCCCceEEEeccccccc-cCCH-HHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCcccchHHHHHHHHhhhhhccccCC Confidence 0111222 233444432 2233 467888899999999999986543 24321 12345667777764321111 Q ss_pred ccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 502 ELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 502 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) .....+.++++...-++.++|..--+ T Consensus 509 ~~~d~~~~g~~~~~~~~~~~~~~~~~ 534 (535) T protein:vir:80 509 KVGDAASGGTNKAKLNNGNGGGNQAG 534 (535) T ss_pred CCCCCCCCCCCcCcccCCccccccCC Confidence 11122223333333333333333333 No 78 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=99.59 E-value=6.6e-13 Score=87.36 Aligned_cols=439 Identities=11% Similarity=0.048 Sum_probs=212.0 Q ss_pred cCcc-ccCHHHHHHHHHHHH---HhcCCCc-------cccccccc---Cc----cccCcee--ecchHHHHHHHHhhhhh Q lcl|NC_019418. 29 HPKV-AVTQSEFRRIQHNLA---YYQSKFD-------DIEYTNTD---GD----RKRRKMQ--HLPIARTAAKKIASLVY 88 (527) Q Consensus 29 ~~~i-~~~~~~~~~i~~~~~---~y~g~~~-------~l~~~~~~---~~----~~~~~~~--~lnl~~~i~~~~A~ll~ 88 (527) -++| ...+++......|.. .|.|... .|...... +. .+.+... =.|.++.+++.+.+++| T Consensus 1 m~~V~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~l~G~vf 80 (501) T protein:vir:95 1 MPNVSFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFGLVGQVF 80 (501) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHHHhhhhh Confidence 3444 235677777777754 4666421 22221110 10 1111111 23999999999999999 Q ss_pred cccceEeeCCHHHHHHHHHH-HhhhhHHHHHHHHHHHHHhcCCEEEEEEEe--CC--------------eeEEEEEcCCc Q lcl|NC_019418. 89 NEQAEISAEDETLNDFLSDM-LSNDRFNKNFERYLESALALGGLAMRPYVD--GD--------------KIRVAFIQAPV 151 (527) Q Consensus 89 ~e~~~i~~~d~~~~~~l~~~-l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d--~~--------------~~~i~~v~a~~ 151 (527) .++|++++. +....+++++ ..-+++...++.++..++..|.+++.|=+. ++ +|-+..+.|.+ T Consensus 81 ~k~p~~~~p-~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~~~~~~rPy~~~~~~~~ 159 (501) T protein:vir:95 81 MRDPVVKVP-ALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADLEAGRIRPTLYVYSPTE 159 (501) T ss_pred cCCcceeCc-HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHHHhccCCcEEEEecHhh Confidence 999998744 2333333332 123378899999999999999999987542 11 37789999999 Q ss_pred eEEEEEcCC-----ceEEEEEEEEEEeeCCCc-----ceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccc Q lcl|NC_019418. 152 FLPLQSNTQ-----DVSSAAILTKTIKTENRK-----NVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQ 221 (527) Q Consensus 152 ~~P~~~d~~-----~~~~~a~~~~~~~~~~~~-----~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~ 221 (527) ++= |... .++.-+.+.+.+...+.. ...|..|+. +..+....++|+...... T Consensus 160 Iin--W~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~----------------~~~g~~~~~v~r~~~~~~ 221 (501) T protein:vir:95 160 IIN--WRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRL----------------DEEGYYVHEIWREPQPTK 221 (501) T ss_pred hcC--cceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEee----------------CCCceEEEEEEEecCCcc Confidence 874 4322 234445555544332221 122333322 234444556665432110 Q ss_pred -cCceeecccccCCccccee---ecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHH----HHHHH Q lcl|NC_019418. 222 -LGERVNLSELYPDLQPVTP---IQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEF----MWEIK 293 (527) Q Consensus 222 -lG~~v~l~~~~~~l~~~~~---~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~----~~e~~ 293 (527) -|..++-.......+.... -+++...+|+++-. .++... -|.+-|-++- .+|..+-+. -+-+. T Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~--~~~~~~---~~~pPLl~lA----~lni~hy~~ssd~~~~l~ 292 (501) T protein:vir:95 222 ADGSKIPKGNYQQYVVYKPTDAQGKRLTEIPFMFIGS--ENNDSN---PDNPNFYDLA----SLNMAHYRNSADYEESCY 292 (501) T ss_pred cCcceecCCcccccceeeeeccCCCcCCeeeEEEEec--CCCCCC---CCccchHHHH----HHHHHHHhhhhHHHHHHH Confidence 1111111110000011001 13455666765522 122111 1222222222 223333222 22232 Q ss_pred c-CcceeeechhHhcCCCCCCCccccc--ccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhc Q lcl|NC_019418. 294 M-GQRRVIVPEQMTQLKVQDNQGNIAF--KRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQI 370 (527) Q Consensus 294 ~-~~~~i~v~~~~l~~~~~~~~~~~~~--~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~ 370 (527) . +.+..++. ..+.+...... +-.+.... ++..+.++ .+.+++++-..- ..+.|+.+..++.. . T Consensus 293 ~~~~P~l~i~------G~~~~~~~~~~~~~i~~G~~~----~~~lP~~~--~~~~ie~~~~~i-~~~~l~~l~~~m~~-~ 358 (501) T protein:vir:95 293 IVGQPTPVLI------GLTEEWVTNVLKGSVNFGSRG----GIPLPVGA--DAKLLQASENTM-LKEAMDTKERQMVA-L 358 (501) T ss_pred Hcccceeeee------CCcccccccCCCCceeecccc----cccCCCCC--ceeEEecChhhH-HHHHHHHHHHHHHH-H Confidence 2 33323221 11111000000 00111111 11222222 344444432221 13445555444322 2 Q ss_pred CCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceE--EEeCCCcc Q lcl|NC_019418. 371 GVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDIS--VNLDDGVF 448 (527) Q Consensus 371 g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~--v~f~d~i~ 448 (527) |. ..+ ...++.+|||+.....+...+....+...++.||.++++.++.+... .+....+. -+|... T Consensus 359 Ga--~ll-~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~w~g~-------~~~~~~v~i~~df~~~-- 426 (501) T protein:vir:95 359 GA--KLV-EQKEVQRTATEAELEAASEGSTLSSATKNVSAAFEWALKWAARWVGQ-------ADSGVKFELNTDFDIA-- 426 (501) T ss_pred HH--hhc-cCCccchhHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCC-------CCCceEEEEecccccc-- Confidence 31 122 23446789999888888888888888889999999999988766321 12222333 333321 Q ss_pred CCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccc Q lcl|NC_019418. 449 TDRHAELDYWMKMVAAGFATQKRGIAKT--LGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDED 525 (527) Q Consensus 449 ~d~~~~~~~~~~~~~aGi~s~~~~i~~~--~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (527) .-..+.++.+.+++.+|.+|.++++..+ -++.+++..++.++|.+|..+.... .. +.. .....+.++|.+..+ T Consensus 427 ~~~~~~~~al~~~~~~G~is~~t~~~~L~~~~v~~~~~~~e~e~i~~~~~~~~~~-~~-~~~--~~~~~~gg~~~~~~~ 501 (501) T protein:vir:95 427 RMTPDERRSLVEEWQKGAITFEEMRTGLRKAGVATEDDSKAKEKIAKDTAEAMAL-AT-PAN--VPGDGSGGDNVGNSE 501 (501) T ss_pred cCCHHHHHHHHHHHhCCCCcHHHHHHHHHhCCCCChhHHHHHHHHHhhhcCcccc-cc-cCC--CCCCCcccccccCCC Confidence 1124557888899999999999985443 3665555556666776654321111 01 111 111122444444444 No 79 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.57 E-value=4.7e-13 Score=88.17 Aligned_cols=442 Identities=9% Similarity=0.010 Sum_probs=211.6 Q ss_pred CChHHHHHHHHH--HHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCc-----ccccccc-c------C--- Q lcl|NC_019418. 1 MSLIQKVKDFFN--RGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFD-----DIEYTNT-D------G--- 63 (527) Q Consensus 1 m~~~~~~k~~~~--~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~-----~l~~~~~-~------~--- 63 (527) ||||+++-.+|. +...++.... ..+=|.+-.. ++..... + . T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~ar~----------------------~~~~y~aa~~~r~~~~~~~~~s~~~~i~~~~~~l 58 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAARE----------------------AIQAYEAARPGRTHKAKRQPLGADTSLQKSAVSM 58 (548) T ss_pred CchHHhHhhhcchHHHHHHHHhHH----------------------HhccccccCccccccccCCCCChHHHHHHHHHHH Confidence 999999999984 2222221111 1111222100 0000000 0 0 Q ss_pred -ccccCceeecchHHHHHHHHhhhhhcc-cceEe----eCCHH----HHH----HHHHHHhh------hhHHHHHHHHHH Q lcl|NC_019418. 64 -DRKRRKMQHLPIARTAAKKIASLVYNE-QAEIS----AEDET----LND----FLSDMLSN------DRFNKNFERYLE 123 (527) Q Consensus 64 -~~~~~~~~~lnl~~~i~~~~A~ll~~e-~~~i~----~~d~~----~~~----~l~~~l~~------n~f~~~~~~~~~ 123 (527) .+.++-...-++++.+++.+.+.+.+. ...++ ..+.. .++ .|+.|.++ .+|......++. T Consensus 59 r~RaRdL~rNn~~a~~av~~~~~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R 138 (548) T protein:vir:95 59 REQCRKLDEDHDLVTGLLDRLEERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCR 138 (548) T ss_pred HHHHHHHHhcChHHHHHHHHHHHhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHH Confidence 000001123378899999999988874 22222 12222 222 34444432 347777777888 Q ss_pred HHHhcCCEEEEEEEeCC---------eeEEEEEcCCceEEEEEc--CCceEEEEEEEEEEeeCCCcceEEEEEEEEeecc Q lcl|NC_019418. 124 SALALGGLAMRPYVDGD---------KIRVAFIQAPVFLPLQSN--TQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVT 192 (527) Q Consensus 124 ~a~~~G~~~~~~~~d~~---------~~~i~~v~a~~~~P~~~d--~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~ 192 (527) ..+..|.++++..|+.. ..+|..++|+.+ |...+ ++.+..+|.+- ....-+-|.+...|- + T Consensus 139 ~~~~dGE~f~~~~~~~~~~~~~g~~~~~~lqliepd~l-~~~~~~~~~~i~~GIE~D-----~~Grp~aY~i~~~hP-g- 210 (548) T protein:vir:95 139 TWLRDGEGLAQKLMGRVPNYTFATSVPFALELLEPDYL-PFSYNNLSKGIVQGIERD-----TWRRKRAYHLLKDHP-G- 210 (548) T ss_pred HHHhCCceEEEeeecccccccCCcccceEEEEechhhc-CCCCCCCCCceeeeeEEC-----CCCceEEEEEeecCC-C- Confidence 88999999999998632 258889999875 32222 23333443221 111223343444441 0 Q ss_pred cccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhh Q lcl|NC_019418. 193 PTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFD 272 (527) Q Consensus 193 ~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~ 272 (527) ..... +.. ..|. -++..-+.|+. .....+...|+|.|+ T Consensus 211 -------------d~~~~------------~~~----~~~~---------rvpA~~VlHif----~~~r~gQ~RGvs~la 248 (548) T protein:vir:95 211 -------------NLQTL------------GGS----LAVK---------RVEAERIIHIA----YRKRIGQNRGVPMLH 248 (548) T ss_pred -------------ccccc------------ccc----ccee---------eechhHheecc----cccCCccccCcchHH Confidence 00000 000 0000 11222233332 223345667999999 Q ss_pred hhHHHHHHHHHHHHHHHHH-HHcCcceeeechhHhcCCCCCCCcccccccccccccceee--e--cc-CCCCCCCcceEe Q lcl|NC_019418. 273 NAKTTIDFINRTYDEFMWE-IKMGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYM--Q--VG-AGNMDSGGIVDL 346 (527) Q Consensus 273 ~~~~lid~ld~~~s~~~~e-~~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~--~--~~-~~~~~~~~i~~~ 346 (527) .++..+..|+.-.+.-..- .-.+--..||-. .++.+.... ....+....+-. + +. ..+++ .|+.+ T Consensus 249 pvl~~l~~l~~y~dael~~aki~A~~a~fi~~------~~~~~~~~~-~~~~~~~~~~~~~pG~iv~~L~pGe--~i~~~ 319 (548) T protein:vir:95 249 AVLIRLADLKDYEESERVAARISAALAMYIKK------GNPDSYTVE-PGKDRKNRTIPIAPGMVFDDLEPGE--DVGMI 319 (548) T ss_pred HHHHHHHHHhHHHHHHHHHHHHhhhheeeeec------CCCccccCC-CCcccccccccccCCccccccCCCc--eeeec Confidence 9999999999755433322 222222333311 111111000 000011111111 1 11 12222 47888 Q ss_pred ccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhhh Q lcl|NC_019418. 347 TTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKE-LCVSMCELGKV 425 (527) Q Consensus 347 ~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~-li~~il~~~~~ 425 (527) +|.-+..+|..-+..+++.|....|+++..++.+-++ |=..+++.....-......|..+...+-+ +.+..+..+-+ T Consensus 320 ~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s~--nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l 397 (548) T protein:vir:95 320 ESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYDG--TYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLL 397 (548) T ss_pred CCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccch--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 8887777888888999999999999999999877653 33333444444444444444444433333 44444433322 Q ss_pred hc---ccCCcccCccceEEEe--CCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHh---ccc Q lcl|NC_019418. 426 VG---IYRGTIPELDDISVNL--DDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLGITEEEAEKELAEINGE---LPP 497 (527) Q Consensus 426 ~~---~~~~~~~~~~~v~v~f--~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E---~~~ 497 (527) .+ +.... ....-+.+.| .--..+|+.++++....++.+|+.|.++.+.+. |.+-+++.+++++-.+. ..- T Consensus 398 ~G~i~lP~~~-~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a~~-G~D~~ev~~q~a~E~~~~~~~GL 475 (548) T protein:vir:95 398 ARKERLPADV-DHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGFADEAEVARAR-GRDPRELKKSRETEIKANRAAGL 475 (548) T ss_pred cCCcCCCCCC-CchhheeeeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHh-CCCHHHHHHHHHHHHHHHHHcCC Confidence 11 11111 1122356777 333457999999999999999999999988875 87766655554433221 111 Q ss_pred ccccccC---CCCCCCCCCCCC----CC--CCCCccccC Q lcl|NC_019418. 498 ESDAELA---LYGKGQQNTVGN----SK--DTVDDEDEA 527 (527) Q Consensus 498 ~~~~~~~---~~~~~~~~~~~~----~~--~~~~~~~~~ 527 (527) ..+..+. .....++.++.+ .+ -..|||-|. T Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (548) T protein:vir:95 476 VFSSDAYHQLVKSGMDPVEAVQKVYLGVGKMLTADEARE 514 (548) T ss_pred CCCCcccccccccccCCCCchhhhccccccccccchhHH Confidence 0000000 000011111100 00 012222222 No 80 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.54 E-value=1.1e-12 Score=86.23 Aligned_cols=436 Identities=10% Similarity=0.057 Sum_probs=208.6 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCc------cccccc-ccCc--------- Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFD------DIEYTN-TDGD--------- 64 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~------~l~~~~-~~~~--------- 64 (527) |++++++-.+.-+.. ..+-....+-|.+-.. |..... ...+ T Consensus 8 ~~~~dr~i~~~~~~~-----------------------~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~ 64 (505) T protein:vir:96 8 PSLAQRMVNWAWYRY-----------------------VEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLAS 64 (505) T ss_pred cchhhcccchhhhhh-----------------------HHHHHHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHH Confidence 777776665542111 1111222333443111 100000 0000 Q ss_pred ---cccCceeecchHHHHHHHHhhhhhcc-cceEeeC--------CHHHHHHH----HHHHhh--------hhHHHHHHH Q lcl|NC_019418. 65 ---RKRRKMQHLPIARTAAKKIASLVYNE-QAEISAE--------DETLNDFL----SDMLSN--------DRFNKNFER 120 (527) Q Consensus 65 ---~~~~~~~~lnl~~~i~~~~A~ll~~e-~~~i~~~--------d~~~~~~l----~~~l~~--------n~f~~~~~~ 120 (527) +.++-...-++++.+++.+++.+.|. ..++... ++..++.+ +.|.+. .+|...... T Consensus 65 lr~RaRdL~rNn~~a~~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l 144 (505) T protein:vir:96 65 LVQRAREQSINNPYAKRFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHL 144 (505) T ss_pred HHHHHHHHHhcChHHHHHHHHHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHH Confidence 00001123378899999999999984 3333321 44455544 444322 237777777 Q ss_pred HHHHHHhcCCEEEEEEEeCC---eeEEEEEcCCceEEEEE-----cCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecc Q lcl|NC_019418. 121 YLESALALGGLAMRPYVDGD---KIRVAFIQAPVFLPLQS-----NTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVT 192 (527) Q Consensus 121 ~~~~a~~~G~~~~~~~~d~~---~~~i~~v~a~~~~P~~~-----d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~ 192 (527) ++...+.-|.++++..+..+ ..+|..++|+.+- ... +++.+..+|.+- ....-+-|.+...|- T Consensus 145 ~~r~~~~dGE~f~~~~~~~~~~~~~~lqliepd~l~-~~~n~~~~~~~~i~~GIe~d-----~~Gr~~aY~i~~~hP--- 215 (505) T protein:vir:96 145 WMETLARDGEVLVREHRGYPNKWGYALQILECDRLD-LNYNADLQNGNRIRMSIELD-----AWERPVAYHLLVNHP--- 215 (505) T ss_pred HHHHHhhCCceEEEEeecCCCCcceEEEEechhhcC-CCCCcccCCcCeEEeceEEC-----CCCceEEEEEeecCC--- Confidence 88888889999888877544 3688999999743 211 112233344321 112223344444441 Q ss_pred cccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhh Q lcl|NC_019418. 193 PTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFD 272 (527) Q Consensus 193 ~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~ 272 (527) +..+.. +... . ..|. -++.+-+.|+. .....+...|+|.|+ T Consensus 216 -----------gd~~~~----~~~~--~---------~~~~---------rvpa~~vlH~f----~~~r~gQ~RGis~la 256 (505) T protein:vir:96 216 -----------GDNSYC----YHYA--G---------QTYE---------RVPADEIIHTF----VPWRPHQNRGIPWTH 256 (505) T ss_pred -----------Cccccc----cccc--c---------cccc---------ccCHhHhhhhh----cccCCccccCcchHH Confidence 000000 0000 0 0011 11122222222 222345567999999 Q ss_pred hhHHHHHHHHHHHHHHHHHH-HcCcceeeechhHhcCCCCCCCcccccccccccccceeeec------cCCCCCCCcceE Q lcl|NC_019418. 273 NAKTTIDFINRTYDEFMWEI-KMGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQV------GAGNMDSGGIVD 345 (527) Q Consensus 273 ~~~~lid~ld~~~s~~~~e~-~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~------~~~~~~~~~i~~ 345 (527) .++..+..|+.-.+.-..-- -.+--..||-. ..+ ..+.. ..+........+ ...+| ..|+. T Consensus 257 pvl~~l~~l~~y~dael~~a~i~A~~a~fi~~-----~~~-~~~~~----~~~~~~~~~~~l~pG~i~~L~pG--e~i~~ 324 (505) T protein:vir:96 257 ASMVELHHIGEYRKSEMIAAELGAKKVGFYEQ-----DPE-AYDQP----PEDDQGEIVEEVEAGTYQLLPYG--IRFKE 324 (505) T ss_pred HHHHHHHHHhHHHHHHHHHHHHhhhheeeeec-----CCc-cCCCc----cccccCccccccCCceeeecCCC--Ceeee Confidence 99999999997554433222 22333334411 111 11100 001000000001 11222 35888 Q ss_pred eccccChHHHHHHHHHHHHHHHHhcCCCcccccccccc-cchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHh Q lcl|NC_019418. 346 LTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQG-VKTATEIVSENSDTYQMRNSIVALVEQ-SIKELCVSMCELG 423 (527) Q Consensus 346 ~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g-~~TAtei~s~~~~~~~~~~~~~~~~~~-al~~li~~il~~~ 423 (527) ++|+-+..+|..-+..+++.|....|+++..++.+-++ .-+ .+++.....-......|..+.. .++.+.+..+..+ T Consensus 325 ~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~~nYS--S~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a 402 (505) T protein:vir:96 325 HKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEGVNFS--SLRSGELDERDLYKLLQFFVVTELLERVAGNLISMS 402 (505) T ss_pred eCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 88988888898999999999999999999998766543 221 1222333333333333333433 3333333333322 Q ss_pred hhhc---ccCCcccCccceEEEeCC--CccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccc Q lcl|NC_019418. 424 KVVG---IYRGTIPELDDISVNLDD--GVFTDRHAELDYWMKMVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPE 498 (527) Q Consensus 424 ~~~~---~~~~~~~~~~~v~v~f~d--~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~ 498 (527) -+.+ +.... ...-..+.|-- -..+|+.++++.....+.+|+.|.++.+.+. |.+-+++.+++++-++..... T Consensus 403 ~l~G~i~~p~~~--~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~t~~~~~a~~-G~D~~~v~~q~a~e~~~~~~~ 479 (505) T protein:vir:96 403 LLTQALPLNMVD--IDRLSQYAFQPRGWDWVDPAKDSKAHSESIKNRTRSRSSIIRAA-GDDPEDVFDEIAWEEQLMRDK 479 (505) T ss_pred HHcCCcCCCCcc--chhhceeeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHc-CCCHHHHHHHHHHHHHHHHHc Confidence 2111 11111 11113455633 3447999999999999999999999988886 877666655544432211100 Q ss_pred cccccCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_019418. 499 SDAELALYGKGQQNTVGNSKDTVDDE 524 (527) Q Consensus 499 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (527) .-..........+.+.+++.++.+|+ T Consensus 480 Gl~~~~~~~~~~~~~~~~~~~~~~d~ 505 (505) T protein:vir:96 480 GVNPTPPEQESKDATTDEEDDSASDD 505 (505) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 00001111111122222233333333 No 81 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=99.53 E-value=1.6e-12 Score=85.19 Aligned_cols=436 Identities=12% Similarity=0.066 Sum_probs=207.3 Q ss_pred hhcccchhhhccCccc-cCHHHHHHHHHHHH---HhcCCCc------ccccccc-cCcc-cc-Cc-e-eecchHHHHHHH Q lcl|NC_019418. 18 MTTSHLSSILDHPKVA-VTQSEFRRIQHNLA---YYQSKFD------DIEYTNT-DGDR-KR-RK-M-QHLPIARTAAKK 82 (527) Q Consensus 18 ~~~~~~~~~~~~~~i~-~~~~~~~~i~~~~~---~y~g~~~------~l~~~~~-~~~~-~~-~~-~-~~lnl~~~i~~~ 82 (527) |...+ .+ ..+|+ ..+++......|+. .|.|..- .+.+... .++. .+ +. + .-.|+.+.+++. T Consensus 1 ~~~~~-~~---~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~ 76 (489) T protein:vir:78 1 MLTEN-GQ---GSGVKTKHREWLHYAPKWQKVRHALAGELVSYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSG 76 (489) T ss_pred CccCC-Cc---cCCCCccCHHHHHHHHHHHHHHHHhcCcccccccCCCCCCCCCCCChHHHHHHHhccccCChHHHHHHH Confidence 22111 12 12442 36677787788865 4667321 1111111 1111 11 11 1 124999999999 Q ss_pred HhhhhhcccceEeeCCHHHHHHHHHH-HhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-------------CeeEEEEEc Q lcl|NC_019418. 83 IASLVYNEQAEISAEDETLNDFLSDM-LSNDRFNKNFERYLESALALGGLAMRPYVDG-------------DKIRVAFIQ 148 (527) Q Consensus 83 ~A~ll~~e~~~i~~~d~~~~~~l~~~-l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-------------~~~~i~~v~ 148 (527) ++.++|.++|++++.+ .+..+++++ ..-+++...++.++..++.+|.+++.|=+.. -+|-+..+. T Consensus 77 l~G~vfrk~p~~~~p~-~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ade~~~~~rPy~~~~~ 155 (489) T protein:vir:78 77 MVGSVMRKEPEINIPK-ELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAEQNAGLLNPTIAFYT 155 (489) T ss_pred HhchhhcCCcceeccH-HHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHHHHHhcCCcEEEEec Confidence 9999999999987653 344444322 1235688889999999999999998876632 157899999 Q ss_pred CCceEEEEEcC---Cc--eEEEEEEEEEEeeCC-Cc------ceEEEEEEEEeecccccccceeeecCCceEEEEEEEec Q lcl|NC_019418. 149 APVFLPLQSNT---QD--VSSAAILTKTIKTEN-RK------NVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKS 216 (527) Q Consensus 149 a~~~~P~~~d~---~~--~~~~a~~~~~~~~~~-~~------~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~ 216 (527) |++++= |.. +| ++.-+.+.+.....+ .+ ...|..|+. +..++...++|+. T Consensus 156 ~~~Iin--W~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~----------------~~~g~~~~~~~r~ 217 (489) T protein:vir:78 156 TENIVN--WRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDI----------------DSDGNYRQRLFRF 217 (489) T ss_pred hhhhcC--ceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEec----------------CCCcceEEEEEEe Confidence 999874 331 22 455555555432211 11 112222221 1123334455654 Q ss_pred CCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHH----HHH Q lcl|NC_019418. 217 TSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFM----WEI 292 (527) Q Consensus 217 ~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~----~e~ 292 (527) ..+...+. ....++++ .--+.+...+|+++-.. ++... -|.+-|-++-. ||..+-+.. +.+ T Consensus 218 ~~~g~~~~--~~~~~~~~----~g~~~l~~IPfv~~~~~--~~~~~---~~~pPLl~LA~----lni~Hy~~ssd~~~~l 282 (489) T protein:vir:78 218 DAEGGAQE--DVVEIYPD----LGESLRGVIPFTFIGAT--NNDAT---IDDAPLLPLAE----LNIGHYRNSADNEESS 282 (489) T ss_pred ecCCcccc--eeeEEecc----CCCCccCeeeEEEEecC--CCCCC---CCcCchHHHHH----HHHHHhhhhhHHHHHH Confidence 32221111 00111111 01134556667666432 22111 12233333222 233332222 233 Q ss_pred Hc-CcceeeechhHhcCCCCCCCcccccc----cccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHH Q lcl|NC_019418. 293 KM-GQRRVIVPEQMTQLKVQDNQGNIAFK----RRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFE 367 (527) Q Consensus 293 ~~-~~~~i~v~~~~l~~~~~~~~~~~~~~----~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~ 367 (527) .. +.+..++.. ..+...+..... -.+. ... ++....++ ....+++....-. .+.|+.+-.++. T Consensus 283 ~~~~~P~l~i~G-----~d~~~~~~~~~~~~~~i~~g-~~~---~~~lp~~~--~~~~ie~~~~~~~-r~~l~~le~qm~ 350 (489) T protein:vir:78 283 FVVGQPTLFIYP-----GENLTPQAFKEANPNGIKFG-SRR---GHNLGYGG--SAQLIQAGENNLA-RQNMLDKEQQAI 350 (489) T ss_pred HHcccceeeeec-----CccCCcccccccCccceeeC-Ccc---cccCCCCC--CcceeccCcchHH-HHHHHHHHHHHH Confidence 32 333333311 000000000000 0011 111 11122222 2334444322222 334444333321 Q ss_pred HhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCc Q lcl|NC_019418. 368 MQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGV 447 (527) Q Consensus 368 ~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i 447 (527) ..|. ..+. .++.+|||+.....+...+....+...++.||.++++.++.+.. ...+ ......++.+|.... T Consensus 351 -~lGa--~l~~--~~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w~G---~~~~-~~~~i~~n~dF~~~~ 421 (489) T protein:vir:78 351 -QIGA--QLIT--PTQQITAQSARIQRGADTSVMATIARNVSQAYTDALRWVAVMLG---KPED-TEVEFRLNMDFFLEP 421 (489) T ss_pred -HHhh--hhcc--CCcchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcC---CCCC-CceEEEeecccCccc Confidence 2222 2332 33568999988888888888888889999999999988876632 1111 011233456665532 Q ss_pred cCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCHHHHHHHHHHHHHhcccc-cccccCCCCCCCCCCC Q lcl|NC_019418. 448 FTDRHAELDYWMKMVAAGFATQKRGIAKT--LGITEEEAEKELAEINGELPPE-SDAELALYGKGQQNTV 514 (527) Q Consensus 448 ~~d~~~~~~~~~~~~~aGi~s~~~~i~~~--~~~~deea~~el~ri~~E~~~~-~~~~~~~~~~~~~~~~ 514 (527) .| ...++.+.+++.+|.||.++++..+ -|+-+.+.+++..||..+..+. ....++.+.+.++.+. T Consensus 422 -~d-~~~~~al~~~~~~G~is~~t~~~~L~~~gv~d~~~e~~~~ei~~~~~~~~~~~~g~~~~~~q~~~~ 489 (489) T protein:vir:78 422 -MT-AQDRAAWMADINAGLLPATAYYAALRKAGVTDWTDADIKDAVADQPLPVATEVQGEIPQSAQQQEK 489 (489) T ss_pred -CC-HHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCccHHHHHHHHhhcCCCcccCCcccCCCCcccccC Confidence 34 3457888899999999999987643 2443333334444555543221 1111222222222221 No 82 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.52 E-value=2.2e-14 Score=95.43 Aligned_cols=486 Identities=9% Similarity=0.049 Sum_probs=219.6 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccc-ccCccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTN-TDGDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~-~~~~~~~~~~~~lnl~~~i 79 (527) --++.+++.++++-. .-..+-+....+..+||.|+ .|.... ..-+-..+..++.|+=+.+ T Consensus 44 ~~~~~~l~~~~~~~~-----------------~~~~~~r~~a~~d~~fy~G~--Qw~~~~~~~l~~~g~p~~~~N~i~~~ 104 (776) T protein:vir:93 44 VELHSRLLSYYRQEL-----------------SRQQDNRAEMAVDEDYYDNI--QWSQDEIDELKERGQAPTVYNVISQS 104 (776) T ss_pred HHHHHHHHHHHHHHH-----------------hhchHHHHHHHHHHHHhCCC--CCCHHHHHHHHhcCCceEEecchHHH Confidence 235566666665422 11223344556778899886 232111 1111123456788999999 Q ss_pred HHHHhhhhhcccceEeeC-----CHHH----HHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC----CeeEEEE Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAE-----DETL----NDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG----DKIRVAF 146 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~-----d~~~----~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~----~~~~i~~ 146 (527) ++...++...-.+.+.+. |... +..++.+.+.+++......+..+++..|.+|++++||. +.+.+.+ T Consensus 105 i~~v~g~~~~nr~~~~~~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~af~d~~~~G~G~~~v~~d~~~~~~~~~~~~ 184 (776) T protein:vir:93 105 VNWIIGSEKRGRSDFKVLPRRKDGGKAAERKTALLKYLSDVNHTPFERSMAFEETTKAGIGWLESQVQDENDGEPIYAGA 184 (776) T ss_pred HHHHHHHHHhCCcceEEecCChhHHHHHHHHHHHHHHHHHhhcHHHHHHHHHHHhhhcCcceEEEEeeccCCCCceEeec Confidence 999888877776666652 3333 44556677788999999999999999999999999973 4566778 Q ss_pred EcCCceEEEE-EcCCceEEEEEEEE--EEeeCC-------------------------CcceEEEEEEEEeecccccccc Q lcl|NC_019418. 147 IQAPVFLPLQ-SNTQDVSSAAILTK--TIKTEN-------------------------RKNVYYTLVEFHEWVTPTGQEV 198 (527) Q Consensus 147 v~a~~~~P~~-~d~~~~~~~a~~~~--~~~~~~-------------------------~~~~~yt~lE~h~~~~~~~~~~ 198 (527) +++..||+=. ...-....|-++.+ ++..+. ....+..++..|.....+.... T Consensus 185 ~~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 264 (776) T protein:vir:93 185 ESWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDIDGDDAMDSPEYERSMNSVT 264 (776) T ss_pred cChhheeeccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccchhccccccccccccccccccccc Confidence 8998888511 10001112222111 000000 0000000000000000000000 Q ss_pred eeeecCC-ceEEEEEEEe----------cCCccccC---------ceeecccccCCccc-------ceeecC-------- Q lcl|NC_019418. 199 GSTKDKS-LYRITNELYK----------STSDSQLG---------ERVNLSELYPDLQP-------VTPIQG-------- 243 (527) Q Consensus 199 ~~~~~~~-~~~I~n~ly~----------~~~~~~lG---------~~v~l~~~~~~l~~-------~~~~~g-------- 243 (527) ....... ......++|. +...+.-+ ..+.+..-...+.. ...+.| T Consensus 265 ~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~~g~~~l~~~~ 344 (776) T protein:vir:93 265 AGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAVSPMMRMHCAIMTTRDLMWAGP 344 (776) T ss_pred ccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceeehheeeeeeEEEEEecchhhhccC Confidence 0000000 0000111111 00000000 00000000000000 000111 Q ss_pred ----CCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCccccc Q lcl|NC_019418. 244 ----LSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGNIAF 319 (527) Q Consensus 244 ----~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~~~~ 319 (527) ..+.+|+.|. ......+.+|.|++..+++.++.+|...|++.+-+ +..+++++.+.+...... .. T Consensus 345 ~p~~~~~~Pfv~~~----~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l--~~~~~~~~~gav~~~d~~-----~~ 413 (776) T protein:vir:93 345 SPYRHNRYPFTPIW----GFRRARDGMPYGVIRFMRGMQDDVNKRLSKALYIL--STNKVLMEEGAVDDIDEF-----RR 413 (776) T ss_pred CCCCCCccceEEec----CceecccccccchHHhhhHHHHHHHHHHHHHHHhh--cCCceeeccccccchHHH-----HH Confidence 0122344332 22334566799999999999999999999998865 455688877765321100 00 Q ss_pred ccccccccceeeeccCCCCCCCcceE-eccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHH Q lcl|NC_019418. 320 KRRFDVEQNVYMQVGAGNMDSGGIVD-LTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTY 398 (527) Q Consensus 320 ~~~~d~~~~~~~~~~~~~~~~~~i~~-~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~ 398 (527) ....++ .+.. ..++....+.. ..+.+. ..++..++.....|...+|++...+|..++ ..|+.+|.+...... T Consensus 414 -~~~rp~-~vi~---~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n-~~Sg~ai~~~~~~~~ 486 (776) T protein:vir:93 414 -EAARPD-AVMT---VKNGKLGAVKMDVDRDLA-PAHLELASRSIQMIQQVGGVTDEMLGRTTN-AVSGVAIQARQEQGS 486 (776) T ss_pred -hcccCC-ceee---eCCccccccccccCcCcc-HHHHHHHHHHHHHHHHhhCcChHHhCCCcc-hhhHHHHHHHHHHHH Confidence 000000 1111 11222222221 134454 568888888888899999999999997654 456777877777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhh-c------ccCCc-cc----------------CccceEEEeCCCccCCHHHH Q lcl|NC_019418. 399 QMRNSIVALVEQSIKELCVSMCELGKVV-G------IYRGT-IP----------------ELDDISVNLDDGVFTDRHAE 454 (527) Q Consensus 399 ~~~~~~~~~~~~al~~li~~il~~~~~~-~------~~~~~-~~----------------~~~~v~v~f~d~i~~d~~~~ 454 (527) .....+...+..+++++.+.++.+...+ . +.+.. .. ..++|+|.=+.+.+.-+++. T Consensus 487 ~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v~~~~~~~s~r~~~ 566 (776) T protein:vir:93 487 VATNKLFDNLRLAFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFIIDEAEWRATMRQAA 566 (776) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEEeecccchhHHHHH Confidence 7778888888888888888888775432 0 00100 00 11233333222221112333 Q ss_pred HHHHHHHHhcCCCCHHH------HHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCC-----------------CC Q lcl|NC_019418. 455 LDYWMKMVAAGFATQKR------GIAKTLGITEEEAEKELAEINGELPPESDAELALYGKG-----------------QQ 511 (527) Q Consensus 455 ~~~~~~~~~aGi~s~~~------~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~-----------------~~ 511 (527) .+.++++. +.+.+.. .+.++-++.. +.+.++++++.+.+............ .. T Consensus 567 ~~~l~ql~--~~~~p~~~~~~~~~~~e~~d~p~--~~e~~~~l~~~~~~~~p~q~~~~~e~~~~qq~q~~~~q~q~~~~~ 642 (776) T protein:vir:93 567 VAELMEVI--GKMPPEIALTMLDLLVENMDIPN--RDELVKRIRAVNGQKDPDQDEPTPEEIAREQAQQQQQQYNDALAI 642 (776) T ss_pred HHHHHHHH--hhcChhhHHHHHHHHHHhcCccc--hHHHHHHHHHhhcccccchhhcchhHHHHHHHhhHHHHHHHHHhh Confidence 44444443 2222221 1122212211 11122222221111000000000000 00 Q ss_pred CCCC-CCCCCCCccccC Q lcl|NC_019418. 512 NTVG-NSKDTVDDEDEA 527 (527) Q Consensus 512 ~~~~-~~~~~~~~~~~~ 527 (527) .... ..-.....+-+. T Consensus 643 a~~~~~qa~a~~~~aea 659 (776) T protein:vir:93 643 ATLEEQQAKARKAAAEA 659 (776) T ss_pred hhhhHhhHHHHHHHHHH Confidence 0000 000000000000 No 83 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.52 E-value=1.7e-13 Score=90.65 Aligned_cols=427 Identities=10% Similarity=0.068 Sum_probs=197.4 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcC--CCcccccccccCcccc-------Ccee Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQS--KFDDIEYTNTDGDRKR-------RKMQ 71 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g--~~~~l~~~~~~~~~~~-------~~~~ 71 (527) |.=+..-+ .+++.-... .+. +..--.| ....-..+...+-.+. .-+. T Consensus 1 ~~~~~~a~--------------------~~~~~~~a~--~~~--~~~~~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~ 56 (461) T protein:vir:80 1 MYSIDKAK--------------------QAKIDSKIV--NRN--DFMVGHGKANSRDKLTRQTPGNGQKLDLKACENLYA 56 (461) T ss_pred Cccchhhh--------------------hhhhhhhhh--hhh--HHHhhcCCcchhhhhhccccCcccccCHHHHHHHHH Confidence 21111111 011110000 000 0000111 0000000011110000 0112 Q ss_pred ecchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcCCc Q lcl|NC_019418. 72 HLPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQAPV 151 (527) Q Consensus 72 ~lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a~~ 151 (527) +-.|++.+|+..|..++.+...|+++++...+.+++.+++.+++..+.+++..+..+|++++.+-+..++.+ .+.. T Consensus 57 ~~~l~r~iVd~~a~d~~r~g~~i~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~----~~~~ 132 (461) T protein:vir:80 57 SNSIAMNIVDIISEDMVRAGWSLKTDNKEMKKNIESKWRKLKTKDRFQKLYADKRLYGDGFLSIGVVSSNRE----QADL 132 (461) T ss_pred hCCccchhhccchHHhhcCCeeeecCCHHHHHHHHHHHHHhhHHHHHHHHHHhhcccccEEEEEEeecCCcc----ccCc Confidence 337889999999999999999999999888888888888889999999999999999998888776443321 1222 Q ss_pred eEEEEEcCCceEEEE-EEEEEE------eeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEe--cCCcccc Q lcl|NC_019418. 152 FLPLQSNTQDVSSAA-ILTKTI------KTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYK--STSDSQL 222 (527) Q Consensus 152 ~~P~~~d~~~~~~~a-~~~~~~------~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~--~~~~~~l 222 (527) .-||...+-+.+..+ .+|... ..+.....|+. -..|.|...-.- ....+.. T Consensus 133 ~~pl~~~~~~~~~~l~~~~~~~i~~~~~~~dp~sp~fg~--------------------P~~y~i~~~~~~~~~~~~~~~ 192 (461) T protein:vir:80 133 STAIDPKTIKSIPYINTFNTQKVTQLYLNQDMFSEHFGE--------------------VEFFEVNRVSQLGEEILSGTT 192 (461) T ss_pred cCCcccccccceeEEEeccccccchhhhcccCcCccccc--------------------ceEEEEecccccccccccccc Confidence 233322221111111 111110 00000000110 011111100000 0000001 Q ss_pred CceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHH-cCcceeee Q lcl|NC_019418. 223 GERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIK-MGQRRVIV 301 (527) Q Consensus 223 G~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~-~~~~~i~v 301 (527) |. ....++. .| +.+|. |....+..+|+|++..+.+.+.+++.+.-....-+. ..-..+.+ T Consensus 193 ~~------------~~~~iH~-SR--ii~~~----~~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~ 253 (461) T protein:vir:80 193 AS------------TSEQIHR-SR--IIHEQ----GLRFEGETKGRSIFESLYDIITVMDTSLWSVGQILYDFAFKVYKT 253 (461) T ss_pred Cc------------cceEEcc-cc--EEEec----CCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCCceec Confidence 10 0111111 12 12232 222223457999999999999999988866654443 33333333 Q ss_pred chhHhcCCCCCCCccccccccccc---ccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccc-c Q lcl|NC_019418. 302 PEQMTQLKVQDNQGNIAFKRRFDV---EQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGM-F 377 (527) Q Consensus 302 ~~~~l~~~~~~~~~~~~~~~~~d~---~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~-~ 377 (527) + .+-....+... . ....++. ...++. .+.+ ..++.++.++ .-....++.+...|+..++++... | T Consensus 254 ~-~l~~~~~~~~~-~--~~~~~~~~~~~~g~~~---~d~~--e~~e~~~~~l--sgl~~~l~~~~~~iaa~s~iP~t~L~ 322 (461) T protein:vir:80 254 D-DIDALNKDDKA-N--LTAMLDFMFRTEALAI---IKGD--EQLTKESTNV--SGMKDLLDYGWDYLAGAVRMPKTVLK 322 (461) T ss_pred c-hHHhhhchHHH-H--HHHHHHHhcCCceEEE---EcCC--cceEEEecCc--CCHHHHHHHHHHHHhhhhcCCeeeee Confidence 2 11100111111 0 0111111 111211 1222 2366666543 345677778888899999999865 5 Q ss_pred cccccccchHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHH Q lcl|NC_019418. 378 TFDGQGVKTATEIVSENSDTYQMRNSIV-ALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELD 456 (527) Q Consensus 378 ~~~~~g~~TAtei~s~~~~~~~~~~~~~-~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~ 456 (527) |...++.+|+.+= .+..+..++.+| ..++..|+.|+..|+.-. ..+.....+...+++|.|++-...+..+.++ T Consensus 323 G~s~g~~asge~D---~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~--~~~~~~~~p~~~~~~i~f~~L~~~s~kekAe 397 (461) T protein:vir:80 323 GQEAGTLTGAQYD---VMNYYARVSSIQENRLRPQLEYLTRLLMWAS--DDCGPSIDPDSFEWAIEFNPLWNLDSKTDAE 397 (461) T ss_pred cccCCccccchHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--cccccccCccccceEEEeCCCCCCCHHHHHH Confidence 7666666665532 234556666666 457889999988876532 1223334455578999999988888877665 Q ss_pred H-------HHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCC-CCCCCCCCCCCccccC Q lcl|NC_019418. 457 Y-------WMKMVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQ-NTVGNSKDTVDDEDEA 527 (527) Q Consensus 457 ~-------~~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 527 (527) . +.+++.+|++|.++ +++++.. +..... ...+++...+ .+..+.....+.++++ T Consensus 398 ~~~~~a~a~~~~~~~g~is~~e------------~r~~l~~---~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~e~~ 459 (461) T protein:vir:80 398 VRKLTAEADQIYIVNGVLDPDE------------VKETRFG---RFGLEN--SSKFSGDSAEIDKLAKLVYDAYAKKNA 459 (461) T ss_pred HHHHHHHHHHHHHhcCCCCHHH------------HHHHHHH---hcCCCC--CccCCCCCchhhhhhhhccccccccCC Confidence 5 34445555555554 4333321 000000 0011111111 1111111112222222 No 84 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=99.52 E-value=4.1e-13 Score=88.51 Aligned_cols=473 Identities=14% Similarity=0.119 Sum_probs=199.5 Q ss_pred CChH---HHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHH Q lcl|NC_019418. 1 MSLI---QKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIAR 77 (527) Q Consensus 1 m~~~---~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~ 77 (527) |.-= +.|++.++.. ... .+.+......+|..||.|++..... .| +.....+.-. T Consensus 10 ~~~~~~~~~~~~~~~~a------------~~~----~~~~~~~~~~~~~~~y~g~~~~~~~---~~----~s~~~~~~v~ 66 (705) T protein:vir:88 10 MDDEQVLRHLDQLVNDA------------LDF----NSSELSKQRSEALKYYFGEPFGNER---PG----KSGIVSRDVQ 66 (705) T ss_pred CCHHHHHHHHHHHHHHH------------Hhh----hhhHHHHHHHHHHHHHhCCCCCccc---CC----CCccccHHHH Confidence 2211 1111221110 000 1122233456677899998654321 11 2222334333 Q ss_pred HHHHHHhh----hhhcccceEee-----CCHHHHH----HHHHH-HhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC---- Q lcl|NC_019418. 78 TAAKKIAS----LVYNEQAEISA-----EDETLND----FLSDM-LSNDRFNKNFERYLESALALGGLAMRPYVDG---- 139 (527) Q Consensus 78 ~i~~~~A~----ll~~e~~~i~~-----~d~~~~~----~l~~~-l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~---- 139 (527) ..++.+.. .+|+-...+.+ +|...++ +++-+ .+.|+....+..++.+|+..|.+++++||+. T Consensus 67 ~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~~~ 146 (705) T protein:vir:88 67 ETVDWIMPSLMKVFTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKP 146 (705) T ss_pred HHHHHHHHHHHHhhcCCCceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccccch Confidence 33344333 34443333433 3444444 44443 3345556778899999999999999999943 Q ss_pred ---------------------------------------------CeeEEEEEcCCceEEEEEcCCceEEEEEEE-EEEe Q lcl|NC_019418. 140 ---------------------------------------------DKIRVAFIQAPVFLPLQSNTQDVSSAAILT-KTIK 173 (527) Q Consensus 140 ---------------------------------------------~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~-~~~~ 173 (527) ++++|+.|+|..|++=. +..+.-.+-++. +.+. T Consensus 147 ~~e~~~~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp-~a~~~~d~~~~~~~~~~ 225 (705) T protein:vir:88 147 TFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDR-LATCIDDARFLCHREKY 225 (705) T ss_pred hhhhhccCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceecC-CCCCcccCcEEEEEEec Confidence 45788889999988421 112222332221 1111 Q ss_pred eCCCcc-eEEE-----EEEEEeeccccccc----------------ceeeecCCceEE-EEEEEecCCccccCceeeccc Q lcl|NC_019418. 174 TENRKN-VYYT-----LVEFHEWVTPTGQE----------------VGSTKDKSLYRI-TNELYKSTSDSQLGERVNLSE 230 (527) Q Consensus 174 ~~~~~~-~~yt-----~lE~h~~~~~~~~~----------------~~~~~~~~~~~I-~n~ly~~~~~~~lG~~v~l~~ 230 (527) +..+-. .+|- .+..|++....... ...+.......| .|+.|...+...-|...+..- T Consensus 226 t~~dl~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~~~d~~~d~~~~~~~~ 305 (705) T protein:vir:88 226 TVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLDVDGDGISELRRI 305 (705) T ss_pred cHHHHHhhcCChhHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEEeeeEecccCCcceeeEEE Confidence 100000 0000 01111110000000 000000011112 244443222111121111111 Q ss_pred ccCC--cccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHc-CcceeeechhHhc Q lcl|NC_019418. 231 LYPD--LQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKM-GQRRVIVPEQMTQ 307 (527) Q Consensus 231 ~~~~--l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~-~~~~i~v~~~~l~ 307 (527) .+.+ +. .+..+.+++|+.+++ . ..+++.||.|+++.+.++++.+|..++++++.+.. ..++++++.+++. T Consensus 306 ~~~g~~il---~~~~~~~~PF~~~~~-~---p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~~g~v~ 378 (705) T protein:vir:88 306 LYVGDYII---SNEPWDCRPFADLNA-Y---RIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQVN 378 (705) T ss_pred EEeCcccc---ccccCCCCCEEEecc-e---eecCccccCChHHHHhHHHHHHHHHHHHHHHHHHhccCCceeccccccC Confidence 1111 10 112334566765542 1 13356789999999999999999999999988854 7778888888763 Q ss_pred CCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccc---ccc Q lcl|NC_019418. 308 LKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDG---QGV 384 (527) Q Consensus 308 ~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~---~g~ 384 (527) ..... . ..+..-+.. . ..+.|..+.+.=....+...++.+...+...+|++.-..|.++ ++. T Consensus 379 ~~d~~-~--------~~pg~vv~~--~----~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~ 443 (705) T protein:vir:88 379 LEDLL-T--------NEAAGIVRV--K----SMNSITPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSN 443 (705) T ss_pred ccccc-c--------cCCCeeEEe--c----CCCccccccCCcCcHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccch Confidence 21110 0 011111111 1 1123555543322334566677767778888999988887553 345 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhhhcccCCcc----------------cCccceEEEeCCCc Q lcl|NC_019418. 385 KTATEIVSENSDTYQMRNSIVALVE-QSIKELCVSMCELGKVVGIYRGTI----------------PELDDISVNLDDGV 447 (527) Q Consensus 385 ~TAtei~s~~~~~~~~~~~~~~~~~-~al~~li~~il~~~~~~~~~~~~~----------------~~~~~v~v~f~d~i 447 (527) .||++|....+..-.....+.+.|. .++++|++.++.+...+ ++... ....++.++-+.+. T Consensus 444 ~Ta~~i~~~~~~~~~r~~~~~r~~a~~~~~~l~~~~~~li~~~--~~~~~~~ri~g~~v~v~~~~~~~~~~v~v~v~~~~ 521 (705) T protein:vir:88 444 QAAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKY--QNQEEVFQLRGKWVAVNPANWRERSDLTVTVGIGN 521 (705) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--CCCceEEeeccchhccchHhhccCCceEEeecccc Confidence 7899887777766666777777774 56788888877765432 11110 01122333222111 Q ss_pred cCCHHHHHHHHHHHHhcCCCCHHHHHHhc---CC-CCHHHHHHHHHHHHHhccccccccc-CCCCCCC--CCCCCCCCCC Q lcl|NC_019418. 448 FTDRHAELDYWMKMVAAGFATQKRGIAKT---LG-ITEEEAEKELAEINGELPPESDAEL-ALYGKGQ--QNTVGNSKDT 520 (527) Q Consensus 448 ~~d~~~~~~~~~~~~~aGi~s~~~~i~~~---~~-~~deea~~el~ri~~E~~~~~~~~~-~~~~~~~--~~~~~~~~~~ 520 (527) .+.+...+....+.. ....+... .+ .+.....+.+.++.+.......... ..+...+ +......-.. T Consensus 522 -~~~eq~~a~l~~ll~-----~~q~l~~~~~~~~~~~~~~~~~~~~el~e~~~~k~~~~~~~~~~~~e~~~~~~~~~q~e 595 (705) T protein:vir:88 522 -MNKDQQMLHLMRIWE-----MAQAVVGGGGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKE 595 (705) T ss_pred -chHHHHHHHHHHHHH-----HHHHhhcccchhhhcChHHHHHHHHHHHHhhhhhhHHHHhhhhhhHHHHHHHHhhhhhh Confidence 112222222222211 00000000 00 1111111111111100000000000 0000000 0000000000 Q ss_pred CC-------ccccC Q lcl|NC_019418. 521 VD-------DEDEA 527 (527) Q Consensus 521 ~~-------~~~~~ 527 (527) .. .+-+. T Consensus 596 ~~~~~~~~~~q~e~ 609 (705) T protein:vir:88 596 AQPKPEDIKAQADA 609 (705) T ss_pred hhHHHHHHHHHHHH Confidence 00 00000 No 85 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=99.49 E-value=3.2e-12 Score=83.63 Aligned_cols=435 Identities=13% Similarity=0.081 Sum_probs=209.7 Q ss_pred hhcccchhhhccCccc-cCHHHHHHHHHHHH---HhcCCCc------ccccccc-cCcc--ccCc--eeecchHHHHHHH Q lcl|NC_019418. 18 MTTSHLSSILDHPKVA-VTQSEFRRIQHNLA---YYQSKFD------DIEYTNT-DGDR--KRRK--MQHLPIARTAAKK 82 (527) Q Consensus 18 ~~~~~~~~~~~~~~i~-~~~~~~~~i~~~~~---~y~g~~~------~l~~~~~-~~~~--~~~~--~~~lnl~~~i~~~ 82 (527) |.. .+.+ ..+|+ ..+++......|+. .|.|..- .+.++.. .++. +.|. -.-.|+.+.+++. T Consensus 1 ~~~-~~~~---~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~ 76 (491) T protein:vir:95 1 MLT-ANGQ---GSGVKTKHREWLHYAPKWQKVRHALAGDLVGYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSG 76 (491) T ss_pred Ccc-cCCc---cCCCCccCHHHHHHHHHHHHHHHHhcCcchhhcccCCCcCCCCCCCHHHHHHHHhcccCCChHHHHHHH Confidence 211 1111 12442 36677777777864 4777321 1111111 1111 1111 1223899999999 Q ss_pred HhhhhhcccceEeeCCHHHHHHHHHH-HhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-------------CeeEEEEEc Q lcl|NC_019418. 83 IASLVYNEQAEISAEDETLNDFLSDM-LSNDRFNKNFERYLESALALGGLAMRPYVDG-------------DKIRVAFIQ 148 (527) Q Consensus 83 ~A~ll~~e~~~i~~~d~~~~~~l~~~-l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-------------~~~~i~~v~ 148 (527) ++.++|.++|++++.+ .+..+++++ ..-+++...++.++..++.+|.+++.|=+.. -+|-+..+. T Consensus 77 l~G~vfrk~p~~~~p~-~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~Ade~~~~~rPy~~~~~ 155 (491) T protein:vir:95 77 MVGSVMRKEPEINIPK-ELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAAATAAEQNAGLLNPTIAFYT 155 (491) T ss_pred HhchhhcCCceeeccH-HHHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcccCHHHHHHhcCCcEEEEec Confidence 9999999999987653 344444322 1235688889999999999999998876531 147899999 Q ss_pred CCceEEEE---EcCCceEEEEEEEEEEee-CC------CcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCC Q lcl|NC_019418. 149 APVFLPLQ---SNTQDVSSAAILTKTIKT-EN------RKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTS 218 (527) Q Consensus 149 a~~~~P~~---~d~~~~~~~a~~~~~~~~-~~------~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~ 218 (527) |++++=-. .++.+++.-+.+.+.... +. +....|..|+.- ..+....++|+... T Consensus 156 ~~~IinW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~~~~~qyRvL~l~----------------~~g~~~~~v~r~~~ 219 (491) T protein:vir:95 156 TENIVNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFETKYGEQYRVLDID----------------TDGNYRQRLFRFDA 219 (491) T ss_pred hhhhcCceeeeeCCceeeeEEEEEEeEEeecCCCCcccceEEEEEEEeec----------------CCCceEEEEEEEcC Confidence 99987421 122234555655554322 11 112233334321 12233345565322 Q ss_pred ccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHH----c Q lcl|NC_019418. 219 DSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIK----M 294 (527) Q Consensus 219 ~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~----~ 294 (527) +... .....+..++ .--+.+...+|+++-.. ++... -|.+-|-++-. ||..+-+..-+++ . T Consensus 220 ~g~~--~~~~~~~~~~----~g~~~l~~IPfv~~~~~--~~~~~---~~~pPLl~LA~----lni~Hy~~ssd~~~~l~~ 284 (491) T protein:vir:95 220 EGGA--QEEVVEIYPD----LGESLRGVIPFTFIGAT--NNDAT---IDDAPLLPLAE----LNIGHYRNSADNEESSFV 284 (491) T ss_pred CCcc--eeeeeeeeec----CCCcccCeeEEEEEecC--CCCCC---CCcCchHHHHH----HHHHHhhhhhHHHHHHHH Confidence 1111 0000011110 01134555666666432 22111 12333332222 2433333332332 2 Q ss_pred -CcceeeechhHhcCCCCCCC-c--ccccc--cccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHH Q lcl|NC_019418. 295 -GQRRVIVPEQMTQLKVQDNQ-G--NIAFK--RRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEM 368 (527) Q Consensus 295 -~~~~i~v~~~~l~~~~~~~~-~--~~~~~--~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~ 368 (527) +.+..++.. .+..+ + ....+ -.+.... ++..+.+.. ...+++....-. .+.|+.+-.++ . T Consensus 285 ~~~P~l~~~G------~d~~~~~~~~~~~~~~i~~g~~~----~~~lP~~~~--~~~ie~~~~~~~-~~~l~~~e~qm-~ 350 (491) T protein:vir:95 285 VGQPTLFIYP------GDNLTPQSFKEANPNGIKFGSRC----GHNLGYGGS--AQLIQAGENNLA-RQNMLDKEQQA-I 350 (491) T ss_pred cccceeeeec------CcccCcchhhccCcceeEecCcC----CcCCCCCCc--cceeecCcchHH-HHHHHHHHHHH-H Confidence 333333311 11000 0 00000 0011100 112222222 333333322211 33343333322 1 Q ss_pred hcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCc--cceEEEeCCC Q lcl|NC_019418. 369 QIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPEL--DDISVNLDDG 446 (527) Q Consensus 369 ~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~--~~v~v~f~d~ 446 (527) ..|. ..+. .++.+|||+.....+...+....+...++.||.++++.++.+... ..... ..++.+|... T Consensus 351 ~~Ga--~l~~--~~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w~G~------~~~~~v~i~~n~dF~~~ 420 (491) T protein:vir:95 351 QIGA--QLIT--PSQQITAESARIQRGADTSVMATIARNVSQAYTDALRWVAMMLGK------PEDSEVEFQLNMDFFLQ 420 (491) T ss_pred HHHH--Hhcc--CCcchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCC------CCCCceEEEeecccccc Confidence 1221 2222 234689999888888888888888899999999998888766321 11122 2345566543 Q ss_pred ccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCHHHHHHHHHHHHHhcccc---cccccCCCCCCCCCCC Q lcl|NC_019418. 447 VFTDRHAELDYWMKMVAAGFATQKRGIAKT--LGITEEEAEKELAEINGELPPE---SDAELALYGKGQQNTV 514 (527) Q Consensus 447 i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~--~~~~deea~~el~ri~~E~~~~---~~~~~~~~~~~~~~~~ 514 (527) ..+. ..++...++..+|.+|.++++..+ -++.+.+.+++..+|++|..+. ....+.++...+.... T Consensus 421 -~~~~-~~~~all~~~~~G~is~~t~~~~L~~~~vl~~~~e~~~~~ie~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (491) T protein:vir:95 421 -PMTA-QDRAAWMADINAGLLPATAYYAALRKAGVTDWTDEDILNAIEDAPLPSGAVTQVAGEIPQAAQQQQE 491 (491) T ss_pred -cCCH-HHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCccHHHHHHHHHhcCCCCCccccccccchhhhhhccC Confidence 2343 467888999999999999987643 2555555667777887765321 1111122221121111 No 86 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.45 E-value=3.1e-12 Score=83.70 Aligned_cols=401 Identities=10% Similarity=0.056 Sum_probs=191.9 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) |++.+.+.++.-++|..- .+..... .....++ ...-...| .+-.+++.+| T Consensus 1 ~~~~D~~~~~~~~~g~~~-~~~~~~~--~~~~~~~------~~~l~a~Y---------------------~~~~l~~~~v 50 (437) T protein:vir:52 1 MKFFDGIKSLALKLGSKQ-EQTYYSP--SLSLTDD------LVQLEALW---------------------RDNWIANKVC 50 (437) T ss_pred CchhhhhHhHHhcCCCcc-ccceeec--Ccccccc------HHHHHHHH---------------------HhCchhhHHh Confidence 999999988865444110 0000000 0000001 01111122 1237889999 Q ss_pred HHHhhhhhcccceEeeCC--HHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeE-----------EEEE Q lcl|NC_019418. 81 KKIASLVYNEQAEISAED--ETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIR-----------VAFI 147 (527) Q Consensus 81 ~~~A~ll~~e~~~i~~~d--~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~-----------i~~v 147 (527) +..|.-++.+...|+.++ +...+.+++.+++-+++..+.+++..+-.+|++++.+..|+..+. +.++ T Consensus 51 d~~a~d~~r~~~~i~~~d~~~~~~~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~ 130 (437) T protein:vir:52 51 IKRPEDMVRNWREIYSNDLNSKQLDLFTKFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIIL 130 (437) T ss_pred hcchHHhhcCCceEecCCCCHHHHHHHHHHHHhhcHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEe Confidence 999999999999998764 334457888888878999999999999999999888877753211 1112 Q ss_pred cCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceee Q lcl|NC_019418. 148 QAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVN 227 (527) Q Consensus 148 ~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~ 227 (527) ++.++-|...... ......|+ ....|+|. + ... +..|. T Consensus 131 ~~~~v~~~~~~~~--------------dp~s~~fg--------------------~p~~y~v~-----~--~~~-~~~iH 168 (437) T protein:vir:52 131 PKWKISPTGTKDD--------------DVLSPNFG--------------------RYSEYSIL-----G--GSQ-SITVH 168 (437) T ss_pred chhhccccccccc--------------cccccccC--------------------cceEEEEe-----c--CCc-ceeEc Confidence 2222111100000 00000011 00111221 0 000 00010 Q ss_pred cccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHH-cCcceeeec--hh Q lcl|NC_019418. 228 LSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIK-MGQRRVIVP--EQ 304 (527) Q Consensus 228 l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~-~~~~~i~v~--~~ 304 (527) +. ..+.+.| .+.|.++ ..-+|+|++..+.+.|..++.+--....-+. .....+-++ .+ T Consensus 169 ----~S---Rii~~~~--------~~~~~~~----~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~v~k~~~l~~ 229 (437) T protein:vir:52 169 ----HS---RLIILNA--------NDAPLSD----NDIWGVSDLEKIIDVLKRFDSASVNVGDLIFESKIDIFKIAGLSD 229 (437) T ss_pred ----cc---eeEEecC--------ccCCCcc----ccccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCCceecchHHH Confidence 00 0011111 1122222 2347999999999999999987766554343 333333332 12 Q ss_pred HhcCCCCCCCcccccccccc---cccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccc-cccc Q lcl|NC_019418. 305 MTQLKVQDNQGNIAFKRRFD---VEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGM-FTFD 380 (527) Q Consensus 305 ~l~~~~~~~~~~~~~~~~~d---~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~-~~~~ 380 (527) .+. .+ +.-.....+. ..+.....+-++.++ .++.++.++ ....+.++....+|+..+|++... ||.. T Consensus 230 ~l~---~~--~~~~~~~~~~~~~~~~~~~~~~~~d~~~--~~e~~~~~~--sgl~~~l~~~~~~iaaa~~iP~t~L~G~s 300 (437) T protein:vir:52 230 KIA---AG--MENEVASVISAVQEIKSATNSLLLDAEN--EYDRKELTF--TGLKDLLTEFRNAVAGAADMPVTILFGQS 300 (437) T ss_pred Hhc---CC--cHHHHHHHHHHHHHhcCCCceEEEcCCc--ceEEEecCc--CCHHHHHHHHHHHHHHHhcCchhhhcCcC Confidence 221 11 1100011111 111111111122222 355555442 335566777778889999999765 4666 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHH-- Q lcl|NC_019418. 381 GQGVKTATEIVSENSDTYQMRNSIV-ALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDY-- 457 (527) Q Consensus 381 ~~g~~TAtei~s~~~~~~~~~~~~~-~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~-- 457 (527) .+|.+|+.+=. +..+..++.+| ..++..|+.|+..|+.-+ ++ .. ..+++|.|++-...+..+.++. T Consensus 301 ~~Glasge~D~---~~yyd~i~~~Qe~~l~p~le~l~~~i~~~~-----~g-~~--~~~~~~~f~pL~~~s~kekae~~~ 369 (437) T protein:vir:52 301 VSGLASGDEDI---QNYHEAIRRLQETRLRPIFEIIDPLICNEL-----FG-GL--PADWWFEFVPLTTVKQEQQINMLN 369 (437) T ss_pred cccccccHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----cC-CC--CCcceEEeCCcCCcCHHHHHHHHH Confidence 67776655332 33445555555 457888888888765321 12 22 2368899998877886655544 Q ss_pred -----HHHHHhcCCCCHHHHHHhc-----C-CCCHHHHHHHHHHHHHhcccccccccCCCCCCCC--CCCCCCCCCCCcc Q lcl|NC_019418. 458 -----WMKMVAAGFATQKRGIAKT-----L-GITEEEAEKELAEINGELPPESDAELALYGKGQQ--NTVGNSKDTVDDE 524 (527) Q Consensus 458 -----~~~~~~aGi~s~~~~i~~~-----~-~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 524 (527) +.+++++|+++..+++.++ + ++++++. ++........++.++ ...+.....+.++ T Consensus 370 ~~a~a~~~~~~~g~i~~~e~r~~L~~~g~~~~i~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (437) T protein:vir:52 370 TFATAANTLIQNGVLNEYQIANELRESGLFANISAEHI------------EELKNADEFAGNFEEPEKMEGAQVQNSEDQ 437 (437) T ss_pred HHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCCCCcccc------------ccccCCCCCCCccCCCCCCCCCCCCCCCCC Confidence 4455666777766654332 1 1222211 000000011111111 1111111122222 No 87 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=99.40 E-value=2e-11 Score=79.19 Aligned_cols=489 Identities=12% Similarity=0.074 Sum_probs=194.4 Q ss_pred CChHHHHHHHHHHHHHHhh-cccchhhhccCccccCHHHHHHHHHHH---HHhcCCCcccccc-------cccCccccCc Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMT-TSHLSSILDHPKVAVTQSEFRRIQHNL---AYYQSKFDDIEYT-------NTDGDRKRRK 69 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~-~~~~~~~~~~~~i~~~~~~~~~i~~~~---~~y~g~~~~l~~~-------~~~~~~~~~~ 69 (527) |.| -...+.++...+. ...+....-+.--..-..+......|+ ++|......+.|+ ...++...|+ T Consensus 1 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~rs 77 (651) T protein:vir:80 1 MKL---ATTTTDKNRQTYDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGDVNADWRH 77 (651) T ss_pred Ccc---cccccchhhhhhhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccccccCCCCCCCCc Confidence 111 0000000000000 000000000000000001111122342 2333322222211 1112222344 Q ss_pred eeecchHHHHHHHHhh----hhhcccceEee---CCH----HHHHHHHHHH----hhhhHHHHHHHHHHHHHhcCCEEEE Q lcl|NC_019418. 70 MQHLPIARTAAKKIAS----LVYNEQAEISA---EDE----TLNDFLSDML----SNDRFNKNFERYLESALALGGLAMR 134 (527) Q Consensus 70 ~~~lnl~~~i~~~~A~----ll~~e~~~i~~---~d~----~~~~~l~~~l----~~n~f~~~~~~~~~~a~~~G~~~~~ 134 (527) +...|.....++.+.. .+|+...-+.+ ++. ..++.++.++ .+.+|...+..++.+|+..|.+++| T Consensus 78 ~~~~~~v~~~ve~~~~~l~~~~~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~i~k 157 (651) T protein:vir:80 78 KITTGKAFEAIETIHAYLMSATFPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNSVLA 157 (651) T ss_pred cccChhHHHHHHHHHHHHHHhhcCCCceeEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCceEEE Confidence 4555555555554333 23433322332 122 2334455554 4678998888999999999999999 Q ss_pred EEEeC--------------------------------CeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCC----- Q lcl|NC_019418. 135 PYVDG--------------------------------DKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENR----- 177 (527) Q Consensus 135 ~~~d~--------------------------------~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~----- 177 (527) +|||. +.++|+.|+|..|++= -...++-.|.++.+.+..... T Consensus 158 v~we~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~d-p~a~~~~d~~~v~~~~~t~~~l~~l~ 236 (651) T protein:vir:80 158 LPWRVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFYD-PNVTDPNRGAFIRKLTKTKADILNLL 236 (651) T ss_pred EeecceeeeeehheeccccccccccceeeeccceeeeceeEEEEecHHHeeec-CCCcCccccceeeeeeeeHHHHHHHH Confidence 99962 3478899999999962 223344444443332221110 Q ss_pred cceEEEEE------EE-----Eeeccc--ccccceeeecCCce-EE-EEEEEecCC-ccccCceeecccccCCcccceee Q lcl|NC_019418. 178 KNVYYTLV------EF-----HEWVTP--TGQEVGSTKDKSLY-RI-TNELYKSTS-DSQLGERVNLSELYPDLQPVTPI 241 (527) Q Consensus 178 ~~~~yt~l------E~-----h~~~~~--~~~~~~~~~~~~~~-~I-~n~ly~~~~-~~~lG~~v~l~~~~~~l~~~~~~ 241 (527) ...+|.-+ +. |..... .......+...... .+ .|+.|.-.+ .+.....+-. .+. ....+ T Consensus 237 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~~d~e~~~~~~~~v--~~~---g~~il 311 (651) T protein:vir:80 237 SEGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHLENKTYHDVVV--TIM---GNEVL 311 (651) T ss_pred hcccccchhhHHHHhhhccccccCCccccccccCCCccccccccceEEEEEEEEeeccCCceEEEEE--EEc---CcEEe Confidence 00011000 00 000000 00000000000000 00 111111100 0100000000 000 00000 Q ss_pred --cCCC---cccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHH-cCcceeeechhHhcCCCCCCCc Q lcl|NC_019418. 242 --QGLS---RPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIK-MGQRRVIVPEQMTQLKVQDNQG 315 (527) Q Consensus 242 --~g~~---~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~-~~~~~i~v~~~~l~~~~~~~~~ 315 (527) ...+ ..+|+.++. ...+++.||+|..+.+.+.+..||....++..-.. ...+++.|+++.+....+.. T Consensus 312 ~~~~~~~~~~~Pf~~~~~----~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~~l~-- 385 (651) T protein:vir:80 312 RFEQNPYWCGRPFVIGTY----IPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQPEDVY-- 385 (651) T ss_pred cccccCCCCCCCeeeecc----eecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccHHHhh-- Confidence 0111 113444432 12346789999999999999999999999997775 47777788655432111110 Q ss_pred ccccccccccccceeeeccCCCCCCCcceEecccc-ChHHHHHHHHHHHHHHHHhcCCCcccccccc--cccchHHHHHH Q lcl|NC_019418. 316 NIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPI-RSSDYISAISEGLKLFEMQIGVSSGMFTFDG--QGVKTATEIVS 392 (527) Q Consensus 316 ~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~i-r~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~--~g~~TAtei~s 392 (527) +.+.- +++ ++ ....+..+++.- ....-...++.+...+....|++.-..|.+. .+..|||||.. T Consensus 386 -------~~pg~-vi~-~~----~~~~~~~l~~~~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~ 452 (651) T protein:vir:80 386 -------TEPGK-VFL-VS----DHGDLQPLANQSSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAA 452 (651) T ss_pred -------cCCCc-eEE-ec----CCCCceeeccCcccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHH Confidence 11111 111 11 112244444321 1122335566666677788888876666543 35689999998 Q ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhhhc-------ccCCcc-------cCccceEEEeCCCccCCHH----- Q lcl|NC_019418. 393 ENSDTYQMRNSIVALVEQ-SIKELCVSMCELGKVVG-------IYRGTI-------PELDDISVNLDDGVFTDRH----- 452 (527) Q Consensus 393 ~~~~~~~~~~~~~~~~~~-al~~li~~il~~~~~~~-------~~~~~~-------~~~~~v~v~f~d~i~~d~~----- 452 (527) ..+........+.+.|.. .+..|++.++.+...+. +.+... ....+++++++- ++.... T Consensus 453 ~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~i-v~~g~~~~~~r 531 (651) T protein:vir:80 453 VREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRL-VPIGSDHVIER 531 (651) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCccceeeeeee-eeccHHHHHHH Confidence 888888888888888876 67888888887764321 111100 001123333321 122221 Q ss_pred -HHHHHHHHHHhcCCCCHHHHHHhcCCCCHH-HHHHHHHHHHHhcccccccccCCCCCCCCCCC--CCCCCCCCcc---- Q lcl|NC_019418. 453 -AELDYWMKMVAAGFATQKRGIAKTLGITEE-EAEKELAEINGELPPESDAELALYGKGQQNTV--GNSKDTVDDE---- 524 (527) Q Consensus 453 -~~~~~~~~~~~aGi~s~~~~i~~~~~~~de-ea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~---- 524 (527) ..++.+.++.+ .+...+.+... .....++++-+..+-.+.... +....++.+. +......... T Consensus 532 ~~~~~~l~~~~q--------~~~~~p~~~~~~~~~~~~~~l~~~~g~~~~~~~-l~~~~q~~~~~~~~~~~~q~~~~~~~ 602 (651) T protein:vir:80 532 KQYIEDRLTFIQ--------AVAQVPEMGQLVDYKRILVDLLQHWGFEEPEAY-LKQQDQQAPANPQEALLSQAKDVGGQ 602 (651) T ss_pred HHHHHHHHHHHH--------hhccCCccchhhhHHHHHHHHHHHcCCCCcHHh-cCCCccchhhhhhHHHHhhHHHHHHH Confidence 11222222221 11111111110 111222222222211111100 0000000000 0000000000 Q ss_pred ccC Q lcl|NC_019418. 525 DEA 527 (527) Q Consensus 525 ~~~ 527 (527) -++ T Consensus 603 a~~ 605 (651) T protein:vir:80 603 AMS 605 (651) T ss_pred HHH Confidence 000 No 88 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=99.34 E-value=4.1e-11 Score=77.55 Aligned_cols=467 Identities=12% Similarity=0.069 Sum_probs=199.2 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCcc-ccCHHHHHHHHHHHH---HhcCCCcccccccccCccccCceeecchH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKV-AVTQSEFRRIQHNLA---YYQSKFDDIEYTNTDGDRKRRKMQHLPIA 76 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~i~~~~~---~y~g~~~~l~~~~~~~~~~~~~~~~lnl~ 76 (527) |+ .++|+. .+|.. ..++.... ..++ .....+...+++|+. ||... ...... ..+-+++.++.+| T Consensus 1 ~~--~~~~~~-~~~~~---~~~~~~~v-~~~~~~~~~~r~~~~~~w~el~~y~~a~---~~~~~~--~~~~~~r~~~~~~ 68 (584) T protein:vir:95 1 MS--VKVAEL-NSLLV---RDSSAQWV-AYLWDRFNNQRRQKIEEWKELRNYVFAT---DTTTTS--NQGLPWKNSTTLP 68 (584) T ss_pred CC--cchhhh-hhhcc---ccchHHHH-HHHHHHHHhhhchhhccCHHHHHHHHhh---hhhhhh--hcccccccccchh Confidence 22 222221 11110 00000000 0000 001112222333432 33331 111112 2223344555555 Q ss_pred H------HHHHHHhhhhhcccceEee-----CCHH--HHHHH----HHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC Q lcl|NC_019418. 77 R------TAAKKIASLVYNEQAEISA-----EDET--LNDFL----SDMLSNDRFNKNFERYLESALALGGLAMRPYVDG 139 (527) Q Consensus 77 ~------~i~~~~A~ll~~e~~~i~~-----~d~~--~~~~l----~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~ 139 (527) | .|+..+-+.+|+..-=+++ ++.+ .++.+ ++-|.+.+|...+...+.++..+|.+++|++|.. T Consensus 69 k~~~~~~~i~~~l~~~~Fp~~~w~~~v~~~~~~~~~~~~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~~~~ 148 (584) T protein:vir:95 69 KLCQIRDNLHSNYFSSLFPNDDWLRWVGYGKGDSTKTKAKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVSFEA 148 (584) T ss_pred HHHHHHHHHHHHHHHhhcCccceeeeecCCCchhhHHHHHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEeEee Confidence 4 3333444456654322221 1221 24444 4445677999999999999999999999999975 Q ss_pred C--------------eeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCC--------CcceEEEE----EEEEe---- Q lcl|NC_019418. 140 D--------------KIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTEN--------RKNVYYTL----VEFHE---- 189 (527) Q Consensus 140 ~--------------~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~--------~~~~~yt~----lE~h~---- 189 (527) + +++|+.++|..+||= -.......+.++.+.+.+-. ++..||-. -+.|+ T Consensus 149 ~~~e~~e~~~v~~~~~prieriSP~d~~~D-psa~~i~d~~fivrs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~~ 227 (584) T protein:vir:95 149 KYKEMTDGTLVPDYIGPRLVRISPLDIVFN-PLATSISDTFKIVRSVKTKGELMRLAQDEPEQSYWLEALKRREEICRHL 227 (584) T ss_pred cceeeeccccccccccceEEeeChhheeec-CCCCCccchhhhhhhhhhHHHHHHHHhhcCccccchHHHHHHHHhccCC Confidence 5 589999999998861 11222333443332211000 00011100 00011 Q ss_pred eccccc---ccc--------eeeecCCceEEEEEEEec-CCcccc--CceeecccccCC---cccceeecCCCcccEEEe Q lcl|NC_019418. 190 WVTPTG---QEV--------GSTKDKSLYRITNELYKS-TSDSQL--GERVNLSELYPD---LQPVTPIQGLSRPLFTYL 252 (527) Q Consensus 190 ~~~~~~---~~~--------~~~~~~~~~~I~n~ly~~-~~~~~l--G~~v~l~~~~~~---l~~~~~~~g~~~p~f~~~ 252 (527) -..... ... ...+-+.+++++=..|-+ ..+... +...++..++-+ |--+...+...+++|.+. T Consensus 228 ~~~~~~~~~~~~~~~~d~~~~~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v~~g~~iIR~~~np~~~~~~PF~~~ 307 (584) T protein:vir:95 228 GGYSVEDFDKAAGFDVDGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITVVDRSTEVRNESIPTWFGSAPIYHV 307 (584) T ss_pred CCCcccccccccccccccccccccccCCceeEEEeecccccccccCCCcccceEEEEeccEEEEeeecCCCCCCCCEEEE Confidence 000000 000 000001111121111100 000000 000000000000 000000122345556555 Q ss_pred cCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCcccccccccccccceeee Q lcl|NC_019418. 253 KTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQ 332 (527) Q Consensus 253 ~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~ 332 (527) .+ .|. ..|.||.|+.+.+.++++.+|.+..++.+.+...-.. ++..++.+ ......++..| +. T Consensus 308 ~~-~p~---~~s~yG~gi~~ll~d~Q~~lna~~r~~iDnl~l~~~p--v~k~~~~~----~~~~~~pg~~~------~~- 370 (584) T protein:vir:95 308 GW-RFR---PDNLWAMGPLDNLVGMQYRIDHLENAKADAVDLIIQP--PLKIIGEV----EEFVWGPGAEI------HL- 370 (584) T ss_pred cc-eee---eccccCCCchhhhhhHHHHHhHHHHHHHHHHHHhcCc--ceeecccc----chhcccCCcee------ec- Confidence 43 222 3577999999999999999999999999988653222 22333221 11111111111 11 Q ss_pred ccCCCCCCCcceEeccccChHHHH---HHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019418. 333 VGAGNMDSGGIVDLTTPIRSSDYI---SAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVE 409 (527) Q Consensus 333 ~~~~~~~~~~i~~~~~~ir~e~~~---~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~ 409 (527) ++.+.++.+.|. ...+. ..++.+...++...|+++..-|..+.+.+|||.+...-++.-.-+..+.+.|. T Consensus 371 -----~~~~~~q~~~p~--a~~~~s~~~~lq~~e~~me~~sGvp~~~~G~~~~~~~TAtg~s~l~naa~~~~r~~~~~f~ 443 (584) T protein:vir:95 371 -----DQGGDVQEIAKN--VNYIINADNQIQMLEDRMELYAGAPREAMGIRTPGEKTAFEVQQLGNAAGRIFQEKVTTFE 443 (584) T ss_pred -----CCCCCcceecCc--hhhhhHHHHHHHHHHHHHHhhhCCChhhcccccchhhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 233346666653 23333 33555666788899999999999888899999998877877777788888887 Q ss_pred HHH-HHHHHHHHHHhhhhcc-cCCcc--------------cCccceEEEeCC---Cc--cCCHHHHHHHHHHHHhc--C- Q lcl|NC_019418. 410 QSI-KELCVSMCELGKVVGI-YRGTI--------------PELDDISVNLDD---GV--FTDRHAELDYWMKMVAA--G- 465 (527) Q Consensus 410 ~al-~~li~~il~~~~~~~~-~~~~~--------------~~~~~v~v~f~d---~i--~~d~~~~~~~~~~~~~a--G- 465 (527) ..| ++|+.++..++.. ++ ..+.+ ....++.-+|.= +. ....+...+...+..++ | T Consensus 444 ~~ll~~l~~ll~~~~~~-nmd~~~~vr~~n~e~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~q~l~~ilq~~~~~ 522 (584) T protein:vir:95 444 VELLEPVLNAMLETATR-NMDGSDVIRVMDTDLGVKEFMSVTREDITANGKIRPIGARHFGKQAQDLQNLVGIFNSQIGQ 522 (584) T ss_pred HHHHHHHHHHHHHHHHh-hccccCceeeeccccccccccccChhhhccCeeEEeehhhHHHHHHHHHHHHHHHHHhhhhh Confidence 776 8888888877532 11 11110 001122222211 10 01122223333333321 1 Q ss_pred ----CCCHHH---HH---HhcCC--CCH------HH--HHHHHHHHHHhc--ccccccccCC Q lcl|NC_019418. 466 ----FATQKR---GI---AKTLG--ITE------EE--AEKELAEINGEL--PPESDAELAL 505 (527) Q Consensus 466 ----i~s~~~---~i---~~~~~--~~d------ee--a~~el~ri~~E~--~~~~~~~~~~ 505 (527) -++... .+ +++++ .-. ++ .+..+...|++. ..+..+...+ T Consensus 523 ~i~p~~~~~~l~~~ladl~~~p~~~~~~~~~~~~~Q~~~q~~~~~~q~~~~~~~~~~~~~~~ 584 (584) T protein:vir:95 523 MILPHTSGKALATFVDDVTGLQGYEIFRPNVAVAEQAETQSLVAQAQEDLQLQAQMPAEGAI 584 (584) T ss_pred hccccchHHHHHHHHHHHhCCCcccccCCCcccchhHHHHhhhHHHHHHHHHHHhhhhccCC Confidence 123222 12 22332 111 11 011000111100 0111111122 No 89 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=99.31 E-value=8.1e-11 Score=75.92 Aligned_cols=476 Identities=11% Similarity=0.072 Sum_probs=217.1 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc-CccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD-GDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~-~~~~~~~~~~lnl~~~i 79 (527) =.++.+++.+|+.. ..-..+.+....+..+||.|+ .|...... -+-..+..+..|+=+-+ T Consensus 29 ~~~~~~~~~~~~~~-----------------~~~~~~~r~~a~~d~~fy~G~--Qw~~~~~~~l~~~g~p~~~~N~i~~~ 89 (711) T protein:vir:10 29 RALLATARERARDG-----------------ATYWKDNWEAAEDDLKFLGGE--QWPSQVRTERELEQRPCLVNNVLPTF 89 (711) T ss_pred HHHHHHHHHHHHHH-----------------HhhhHHHHHHHHHHHHHhCCC--CCCHHHHHHHHhcCCCcEEEcchHHH Confidence 12333333333210 111333344455667788774 33211111 11223556788999999 Q ss_pred HHHHhhhhhcccceEeeC---------------------------CHHHHHHHHH----HHhhhhHHHHHHHHHHHHHhc Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAE---------------------------DETLNDFLSD----MLSNDRFNKNFERYLESALAL 128 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~---------------------------d~~~~~~l~~----~l~~n~f~~~~~~~~~~a~~~ 128 (527) |+...++--...+.+.+. |..+++.|+. +.+.++.......+..+++.. T Consensus 90 v~~v~g~~~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~af~d~~~~ 169 (711) T protein:vir:10 90 VDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVES 169 (711) T ss_pred HHHHhhhHhhCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHHhcChhHHHHHHHHHhhhc Confidence 999888877766666543 2345555544 555777888888999999999 Q ss_pred CCEEEEEEEe-------CCeeEEEEE-cCCceEE--EEEcCCceEEE--EEEEEEEeeCCCcceEEEEEEEEeecccccc Q lcl|NC_019418. 129 GGLAMRPYVD-------GDKIRVAFI-QAPVFLP--LQSNTQDVSSA--AILTKTIKTENRKNVYYTLVEFHEWVTPTGQ 196 (527) Q Consensus 129 G~~~~~~~~d-------~~~~~i~~v-~a~~~~P--~~~d~~~~~~~--a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~ 196 (527) |-+|+++++| .+.++|..| +|..++. -... -....| ++..+++..+.-.. .|--...|.+...+.. T Consensus 170 G~G~~ev~~d~~~~d~~~~e~~i~~v~~p~~v~~Dp~a~~-~D~sDar~~~~~~~~~~~~~~~-~yp~~a~~~~~~~~~~ 247 (711) T protein:vir:10 170 GMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKK-RDRSDMNWCLIDDTMSKEKFKA-LYPDATAEPVYEDSVA 247 (711) T ss_pred CcceEEEEecccCCCCCCCCeEEeeecChhheeeCccccc-cChhhhcceeeeecCCHHHHHH-hCCchhhhhhhccccc Confidence 9999999875 256788777 5887663 1110 011122 22222211110000 0000000000000000 Q ss_pred cceeeecCCceEE---------EEEEEecCCccccCceeecc-c----ccCC-------------------cccceeecC Q lcl|NC_019418. 197 EVGSTKDKSLYRI---------TNELYKSTSDSQLGERVNLS-E----LYPD-------------------LQPVTPIQG 243 (527) Q Consensus 197 ~~~~~~~~~~~~I---------~n~ly~~~~~~~lG~~v~l~-~----~~~~-------------------l~~~~~~~g 243 (527) ..........-++ ..+++...+.. +...+-. . .++. +.....+.+ T Consensus 248 ~~~~~~~~~~vrv~E~~~r~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~G~~~L~~ 325 (711) T protein:vir:10 248 DYDTWFTEKSVRVSEYFTREPVIREIALLSDGR--SFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEG 325 (711) T ss_pred ccCcccCcceeeEEEEEeeeeeeeEEEeecCCc--eeccCcchhHHHHHHhcCchhhhhhhhceeeEEEEEEecceeecC Confidence 0000000000000 01111111100 0000000 0 0000 000011111 Q ss_pred C-----CcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHc-CcceeeechhHhcCCCCCCCccc Q lcl|NC_019418. 244 L-----SRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKM-GQRRVIVPEQMTQLKVQDNQGNI 317 (527) Q Consensus 244 ~-----~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~-~~~~i~v~~~~l~~~~~~~~~~~ 317 (527) - .+.+|+.| -..=.....+..+-|++.++++.++.+|...|++++.+-. ++.+++++.+.+... +....+ T Consensus 326 ~~p~~~~~~P~vp~--~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~~gai~~~-~~~~~e- 401 (711) T protein:vir:10 326 PVEIPSTTIPVIPV--WGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGR-EDEWEQ- 401 (711) T ss_pred CCCCCCCcccEEEE--eeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCceeecCcccCCh-HHHHHh- Confidence 1 11122211 0000011233334557999999999999999999999855 777888877776421 110000 Q ss_pred ccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHH Q lcl|NC_019418. 318 AFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDT 397 (527) Q Consensus 318 ~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~ 397 (527) ....+ ..+..++.+......++...+.--..++...++.....|...+|++...+|..++ ..|+.+|.+..... T Consensus 402 ---~~~~~--~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n-~~Sg~ai~~~q~qg 475 (711) T protein:vir:10 402 ---ANTKN--FSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGN-ETSGRAIIARQRQG 475 (711) T ss_pred ---ccccC--CCeeEecccccCcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCcc-chHHHHHHHHHHHH Confidence 00001 1122222222222356655543334567888988888899999999999988754 45788888888887 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhc-------ccCC-ccc-------------------------CccceEEEeC Q lcl|NC_019418. 398 YQMRNSIVALVEQSIKELCVSMCELGKVVG-------IYRG-TIP-------------------------ELDDISVNLD 444 (527) Q Consensus 398 ~~~~~~~~~~~~~al~~li~~il~~~~~~~-------~~~~-~~~-------------------------~~~~v~v~f~ 444 (527) ......+...+..+++.+.+.++.+...+- +.+. ... ..++|+|+=. T Consensus 476 ~~~l~~~~dn~~~~~~~~g~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i~~~ 555 (711) T protein:vir:10 476 DRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTG 555 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccceeeeccceeeeEEEEeec Confidence 777777888888888888877776654321 1110 000 1112333222 Q ss_pred CCccCCHHHHHHHHHHHHhcCCCCHHH-----HHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCC Q lcl|NC_019418. 445 DGVFTDRHAELDYWMKMVAAGFATQKR-----GIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKD 519 (527) Q Consensus 445 d~i~~d~~~~~~~~~~~~~aGi~s~~~-----~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (527) .+.+.-+.+.++.++++. +.++.-. .+.++-++.. +.+..++++.-..+.....+ . . T Consensus 556 p~~~s~r~~~~~~l~ql~--~~~p~~~~~~~~~il~~~d~p~--~~el~e~lr~~~~~~~~~~~-------~-------~ 617 (711) T protein:vir:10 556 PAFATQRIEAAEAMIQFA--QAVPSAAAVMADLIAQNMDWPG--ADVIAERLKKIVPPNVLSKD-------E-------R 617 (711) T ss_pred cCchhHHHHHHHHHHHHH--hhcchhhhHHHHHHHHhcCCCC--HHHHHHHHHhhcCcccCcch-------h-------h Confidence 222222233333333332 3332211 1233333322 22223333332221110000 0 0 Q ss_pred CCCccccC Q lcl|NC_019418. 520 TVDDEDEA 527 (527) Q Consensus 520 ~~~~~~~~ 527 (527) ....+... T Consensus 618 ~~~qq~~~ 625 (711) T protein:vir:10 618 EAIEEDMP 625 (711) T ss_pred hHHHHHHH Confidence 00000000 No 90 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=99.28 E-value=2.2e-10 Score=73.53 Aligned_cols=427 Identities=11% Similarity=0.026 Sum_probs=194.2 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccc-----c---cccc--------cCc Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDI-----E---YTNT--------DGD 64 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l-----~---~~~~--------~~~ 64 (527) |.+-.+=..+....-.+.+. .+ + -...++....-|.-+++-- . |... +.. T Consensus 14 m~V~~~hp~y~a~~~~W~~~------~d----~----g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~~~ 79 (488) T protein:vir:96 14 MLTPIYHPDYLVNAPQWLRN------LD----C----VMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWEDL 79 (488) T ss_pred ecccccCHHHHHHhhhhhHh------hh----h----hhHHHHHhhhhcCCCCCCccccccCcchhhhhhccchhhhHhh Confidence 44211111111100000000 00 0 0112222222222111100 0 0000 000 Q ss_pred cccCceeecchHHHHHHHHhhhhhcccceEeeCC-HHHHHHHHHH-HhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC--- Q lcl|NC_019418. 65 RKRRKMQHLPIARTAAKKIASLVYNEQAEISAED-ETLNDFLSDM-LSNDRFNKNFERYLESALALGGLAMRPYVDG--- 139 (527) Q Consensus 65 ~~~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~d-~~~~~~l~~~-l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~--- 139 (527) +.+|- .=.|+++.+++.++.++|.++|+++.++ +.++.+++++ .+-+++...++.++..++..|.+++.|=+.. T Consensus 80 ~~~rA-~~~n~~~~tl~~l~G~vfrk~p~~~~~~~~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~ 158 (488) T protein:vir:96 80 TWRLA-NYVNIVNPTMNAITGAVMRREPEFDTMDNPVLIGLRDNIDGKGNGIDQECKQALNALQWGSRCGWLVRSHPESA 158 (488) T ss_pred hhhcc-ccCchhHHHHHHhcchhhccCceeccCCcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCcC Confidence 00011 1239999999999999999999998765 4455544432 2245688889999999999999998876532 Q ss_pred ---------CeeEEEEEcCCceEEEEEcC---Cc--eEEEEEEEEEEeeCCCcceEEE--EEEEEeecccccccceeeec Q lcl|NC_019418. 140 ---------DKIRVAFIQAPVFLPLQSNT---QD--VSSAAILTKTIKTENRKNVYYT--LVEFHEWVTPTGQEVGSTKD 203 (527) Q Consensus 140 ---------~~~~i~~v~a~~~~P~~~d~---~~--~~~~a~~~~~~~~~~~~~~~yt--~lE~h~~~~~~~~~~~~~~~ 203 (527) .+|-+..++|++++= |.. +| ++.-+.+.+.+...+..+ ++. +..++. .. T Consensus 159 T~ade~~~~~rPy~~~~~a~~Iin--W~~~~v~G~~~L~~v~lrE~~~~~D~~~-~~~~~~~~~~~------------l~ 223 (488) T protein:vir:96 159 TMADWNKGKKLPTAAFYDALHIID--WEVEYIDGEEKLTYLSLLEDYQERDGGT-YVSKQRLINHR------------LV 223 (488) T ss_pred CHHHHHHhcCCcEEEEechhhhcC--cceeccCCceeeEEEEEEEEEEeccCCC-cccceEEEEEE------------EE Confidence 247899999999884 432 23 344454444443322211 111 111111 12 Q ss_pred CCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHH Q lcl|NC_019418. 204 KSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINR 283 (527) Q Consensus 204 ~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~ 283 (527) .+.|.+. +...+...+..+|.. . --+++...+|+++... .|.... |.+-|-++- .||. T Consensus 224 ~g~~~v~----~~~~~~~~~e~~~~~------~---g~~~l~~IP~v~~~~~-~~~~~~----~~pPLldLA----~lnl 281 (488) T protein:vir:96 224 DGLCEFQ----EVTDDEYSDEWTPVL------I---NSKQSDTIPFFLASSQ-SNEWCI----DSTPLTSLA----EISL 281 (488) T ss_pred CcEEEEE----EEecCCcccceEeec------C---CCcccCeeEEEEEecC-CCCCCC----CCCchHHHH----HHHH Confidence 3334332 222222222222210 0 0124455566666422 122111 233333222 2343 Q ss_pred HHHHHHHHHH----cCcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCC-CCCCcceEeccccChHHHHHH Q lcl|NC_019418. 284 TYDEFMWEIK----MGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGN-MDSGGIVDLTTPIRSSDYISA 358 (527) Q Consensus 284 ~~s~~~~e~~----~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~-~~~~~i~~~~~~ir~e~~~~~ 358 (527) .+-+..-+++ .+...+.+ +.... ...+..... ....+..+-.... ..++.+.+++..+..- ..++ T Consensus 282 ~Hy~~ssd~~~il~~~~~p~lv----~~~~~-~~~~~~~~~----~~~g~~~~~~~~~~~~~g~~~~~e~~~~~l-~~~~ 351 (488) T protein:vir:96 282 SIYVMNAYSNKAMILANEAKWM----VDMGD-MNKTMASEM----NPLGFTLAGRMPYYVKNGDVKVIQAQFSPE-TENK 351 (488) T ss_pred HHHhhhhHHHHHHHhcCCceee----eccCC-CCccccccc----ccceeeecccccccccCCceeecCCchhHH-HHHH Confidence 3333333332 22222222 10000 000000000 0001111111110 1122344555444322 2444 Q ss_pred HHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccc Q lcl|NC_019418. 359 ISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDD 438 (527) Q Consensus 359 ~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~ 438 (527) |+.+..++ ...|. ..+. .++.+|||+.....+...+....+...++.||+++++.+..+..... ++..+.... T Consensus 352 l~~l~~qm-~~~Ga--~l~~--~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~A~w~g~~~--~~~~~~~~~ 424 (488) T protein:vir:96 352 VEKLFEQA-VKVGA--SLFT--QQSNETATGAAIRSGSSTASMATLGNNVEDTVRNMLRFIMRYFEGTN--LYVNPDELV 424 (488) T ss_pred HHHHHHHH-HHHhH--hhcc--CCCcchHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCC--CCcCccceE Confidence 55555543 22222 2332 23458999988888888888888889999999999998886543211 111122222 Q ss_pred --eEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcC--CCC--HHHHHHHHHHHHHhcccc Q lcl|NC_019418. 439 --ISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTL--GIT--EEEAEKELAEINGELPPE 498 (527) Q Consensus 439 --v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~--~~~--deea~~el~ri~~E~~~~ 498 (527) ++-+|... ..| ...++.+.++..+|.||.+|++..+- |+- |-+.+++..||+++--.. T Consensus 425 ~~in~dF~~~-~ld-~~~~~al~~~~~~G~Is~~t~~~~L~~~gvl~~d~~~e~~~~~ie~~g~~~ 488 (488) T protein:vir:96 425 FKLNRDYFDV-EVN-PQMLQVAYAAMMEGNLPQVSWFELLKRARVVRGDMSKEEFDEHIAELGFGM 488 (488) T ss_pred EEeccCCCCc-cCC-HHHHHHHHHHHhcCCCCHHHHHHHHHhCCcCCccCCHHHHHHHHhhcCCCC Confidence 33333332 223 45688899999999999999865432 441 212345666666543222 No 91 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.27 E-value=5e-11 Score=77.05 Aligned_cols=451 Identities=12% Similarity=0.053 Sum_probs=196.7 Q ss_pred CChHH---HHHHHHHHHHHHhhcccchhhh-------c--cCccccCHHHHHHHHHHHHHhcCCC------ccccccccc Q lcl|NC_019418. 1 MSLIQ---KVKDFFNRGRYNMTTSHLSSIL-------D--HPKVAVTQSEFRRIQHNLAYYQSKF------DDIEYTNTD 62 (527) Q Consensus 1 m~~~~---~~k~~~~~~~~~~~~~~~~~~~-------~--~~~i~~~~~~~~~i~~~~~~y~g~~------~~l~~~~~~ 62 (527) |++|. ..+.....+.-.-....+.... + ++-++..... ........|.|-. .+....... T Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 102 (537) T protein:vir:10 25 VGIFGAGDDEKPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLD--VEGGTFSAYANPNLSEGLVLWYAQQAFI 102 (537) T ss_pred cCCCcccchhhHHHHHHhhhhccCCCCCccCcccccccccccchhccccc--cchhhhhhhccccccchhhhhccccCCc Confidence 77774 3444443322110011110100 0 1111111100 0000011111100 000000000 Q ss_pred CccccCceeecchHHHHHHHHhhhhhcccceEeeCCH-----HHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEE Q lcl|NC_019418. 63 GDRKRRKMQHLPIARTAAKKIASLVYNEQAEISAEDE-----TLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYV 137 (527) Q Consensus 63 ~~~~~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~d~-----~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~ 137 (527) +..-.--+..-.+++.+|+..|+.++.+...|++++. ...+.|+..+++.+++..+.+++..+-.+|++++.+.+ T Consensus 103 ~~~l~a~Y~~~~l~r~iVd~~A~d~~r~~~~i~~~~~~~~~~~~~~~l~~~~~~l~~~~~l~~a~~~~rlyG~~~i~i~v 182 (537) T protein:vir:10 103 GHQMCALIATHWLVNKACSQMPRDAMRKGYKIISDDGNELDPKDAKFIDRYDRAFNIKKHAIQFVRKGRIFGIRIALFKV 182 (537) T ss_pred cHHHHHHHHhCchhhhhhhhhhHHhhcCCceeecCCcccccHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEee Confidence 1000001223489999999999999999999988643 34567777787888999999999999999999988776 Q ss_pred eCC--eeEEEEEcCCceEEEEEcCCceEEEEEEEEE-E--------eeCCCcceEEEEEEEEeecccccccceeeecCCc Q lcl|NC_019418. 138 DGD--KIRVAFIQAPVFLPLQSNTQDVSSAAILTKT-I--------KTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSL 206 (527) Q Consensus 138 d~~--~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~-~--------~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~ 206 (527) +.. ...-.-+.++.+-|. +.....++.+. + ........||. -.. T Consensus 183 ~~~D~~~~~~Pl~~~~i~kg-----~~k~l~vidp~~~~~~~~~~~~~dp~sp~fg~--------------------P~~ 237 (537) T protein:vir:10 183 DSPDPYYYEKPFNIDGVMPG-----AYKGIVQIDPYWCAPLLDAQASSNPVSMHFYE--------------------PTY 237 (537) T ss_pred cCcCCccccccccccccccc-----ceeEEEEechhhcccccchhhhccCCccccCC--------------------cee Confidence 421 111111111111110 00000111000 0 00000000110 011 Q ss_pred eEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHH Q lcl|NC_019418. 207 YRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYD 286 (527) Q Consensus 207 ~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s 286 (527) |.|. |..|.-+ ..+.+.|-+-|. +.++ . ...+|+|++..+.+.|..++.+-. T Consensus 238 y~v~------------g~~iH~S-------Rli~f~g~~~p~--~~~~--~-----~~~~G~Svlq~~~~~l~~~~~t~~ 289 (537) T protein:vir:10 238 WLIN------------GKKYHRS-------HLAIYINDEVVD--FLKP--S-----YIYGGVPLPQQIMERVYAAERTAN 289 (537) T ss_pred eeec------------CeEecce-------eEEEecCCCCch--hhhc--c-----cCcccccHHHHHHHHHHHHHHHHH Confidence 1111 1111000 001111211111 1110 1 124699999999999999998877 Q ss_pred HHHHHHHcCcceeeechhHhcCCCCCCCccccccccc---ccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHH Q lcl|NC_019418. 287 EFMWEIKMGQRRVIVPEQMTQLKVQDNQGNIAFKRRF---DVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGL 363 (527) Q Consensus 287 ~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~---d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l 363 (527) ....-+.....+++--. ++....+.++ ....+ ...+..+..+-.+ .+...++.++.+ ..-....+.... T Consensus 290 ~~~~l~~~~~~~v~k~~-~~~~l~~~~~----~~~r~~~~~~~r~n~g~~~id-~e~e~~e~~~~~--lsgl~~~l~~~~ 361 (537) T protein:vir:10 290 EGPMLAMTKRQTVLKVD-AAQVLANKQQ----FDETMSWWTATRDNYQVRVVD-KDNEDVVQIDTT--LNDLDKVIMNQY 361 (537) T ss_pred HHHHHHHhcCCceeeec-hHHhhcCHHH----HHHHHHHHHhhcCCcceeEec-CCCceeEEEecc--CCCHHHHHHHHH Confidence 76655543333333211 1111111110 00000 0001111111111 122235544433 233455666677 Q ss_pred HHHHHhcCCCccc-cccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEE Q lcl|NC_019418. 364 KLFEMQIGVSSGM-FTFDGQGV-KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISV 441 (527) Q Consensus 364 ~~i~~~~g~s~~~-~~~~~~g~-~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v 441 (527) ..|+..+|++..- ||...+|. .|+.+=. +..+..++.+|..++..|+.|+..|+... .+ ...+++| T Consensus 362 ~~iAa~~~IP~t~L~G~sp~GlnatGe~D~---~~yyd~I~~~Qe~l~p~l~~l~~ll~~~~-----~~----~~~~~~i 429 (537) T protein:vir:10 362 QLVCAIARTPAPKMLGTVPTGFNSTGDYEE---ASYHEECESTQDDMRPLIDRHHQLVCRSH-----LR----KRIRVKV 429 (537) T ss_pred HHHHhhhCCCceeeccCCccccccchhHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----CC----CCcceEE Confidence 7788888998764 56655554 4455333 33455556666668889988888776532 11 1346899 Q ss_pred EeCCCccCCHHHHHH-------HHHHHHhcCCCCHHHHHHhc--------CCCCHHHHHHHHHH--HHHhcccccccccC Q lcl|NC_019418. 442 NLDDGVFTDRHAELD-------YWMKMVAAGFATQKRGIAKT--------LGITEEEAEKELAE--INGELPPESDAELA 504 (527) Q Consensus 442 ~f~d~i~~d~~~~~~-------~~~~~~~aGi~s~~~~i~~~--------~~~~deea~~el~r--i~~E~~~~~~~~~~ 504 (527) .|++-...|..+.++ ++.+++.+|+|+..+++.++ .++....-.++.+. ++.|..+ ....+. T Consensus 430 ~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~~~~g~~~l~~~~~~ed~e~~~~~~~~~~-~~~~~~ 508 (537) T protein:vir:10 430 EFPPMDAPKESERADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMDPTLGFTSITPAMRPTDAEDIDVDDEGKP-VRIIED 508 (537) T ss_pred EeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccCccccccccCCCChhhhhcccCCccCCc-CCCCCC Confidence 999887788776654 47788899999998876553 12211100011111 1111111 000000 Q ss_pred CCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 505 LYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 505 ~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) -+...+......++++.++.-++ T Consensus 509 ~~~~~~~~~~~~~~~~~~~~~~~ 531 (537) T protein:vir:10 509 QPAPSEMFGATSSGESANDPRDS 531 (537) T ss_pred CCCccccCCCCccccccCCCccC Confidence 11111111122222222222222 No 92 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=99.25 E-value=1.6e-11 Score=79.78 Aligned_cols=444 Identities=11% Similarity=0.070 Sum_probs=175.5 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCcccc---CHHHHHHHHHHH-HHhcCCC-cccccccccCcc-c-------- Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAV---TQSEFRRIQHNL-AYYQSKF-DDIEYTNTDGDR-K-------- 66 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~---~~~~~~~i~~~~-~~y~g~~-~~l~~~~~~~~~-~-------- 66 (527) |++|++++-. +....-+.++ ....+.-.|.+ +.+.++....-+ +=|.-.. +...+....+.+ . T Consensus 5 ~~~~~~~~~~--~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~~~~~~~~~~r~~~~~~~~l~ 81 (551) T protein:vir:80 5 LGLFESIRLV--GVNKSDAVKH-IEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLH 81 (551) T ss_pred hhhHHHhhhc--cCChhhcccc-cccccceeeecccccHHHHHHhhccCcceeecccccceecCcccccCccccChhHHH Confidence 9998887710 1110111111 11111111222 222222111100 0111000 001000000000 0 Q ss_pred --cCceeecchHHHHHHHHhhhhhc-----------ccceEeeCC---------HHHHHHHHHHHhhh---------hHH Q lcl|NC_019418. 67 --RRKMQHLPIARTAAKKIASLVYN-----------EQAEISAED---------ETLNDFLSDMLSND---------RFN 115 (527) Q Consensus 67 --~~~~~~lnl~~~i~~~~A~ll~~-----------e~~~i~~~d---------~~~~~~l~~~l~~n---------~f~ 115 (527) .+....-++...+++..|+.+.. -+-.+.+.+ ....+.+.+++..- .|. T Consensus 82 ~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~~~s~~ 161 (551) T protein:vir:80 82 GVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFS 161 (551) T ss_pred HHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCCccchHH Confidence 00111224455665666654432 011222221 12223444444422 234 Q ss_pred HHHHHHHHHHHhcCCEEEEEEEeCC-e-eEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeeccc Q lcl|NC_019418. 116 KNFERYLESALALGGLAMRPYVDGD-K-IRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTP 193 (527) Q Consensus 116 ~~~~~~~~~a~~~G~~~~~~~~d~~-~-~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~ 193 (527) ..+..++.+.+..|.+++.+..+.+ + ..+.+++|.++.++..+++.... ...+|. ... T Consensus 162 ~f~~~lv~dlll~Gnay~~i~rd~~G~~~~L~~l~p~~V~v~~~~~g~~~~-------------~~~~y~--~~~----- 221 (551) T protein:vir:80 162 SFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPD-------------NGNRFV--QVI----- 221 (551) T ss_pred HHHHHHHHHHHhcCCEEEEEEECCCCcEEEEEEeCCceeEEEECCcccccc-------------CceEEE--EEe----- Confidence 4556677778888999988888653 3 35777888888775432221110 011110 000 Q ss_pred ccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhh Q lcl|NC_019418. 194 TGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDN 273 (527) Q Consensus 194 ~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~ 273 (527) ++....+ | + +.+ ..|++.+.. ......++|+|-+.. T Consensus 222 ----------~g~~~~~---~------------~--------~~e----------iiH~~~n~~-~~~~~~~~G~spi~~ 257 (551) T protein:vir:80 222 ----------DQKIVAT---F------------N--------ARE----------MAFAVRNPR-SDIYATGYGYPELEI 257 (551) T ss_pred ----------CCcEEEE---E------------c--------ccc----------eEEecccCC-CCcccccccccHHHH Confidence 0000000 0 0 000 234432100 011234679999888 Q ss_pred hHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCccc---ccccccccccceeeecc-CC-----CCCCCcce Q lcl|NC_019418. 274 AKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGNI---AFKRRFDVEQNVYMQVG-AG-----NMDSGGIV 344 (527) Q Consensus 274 ~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~~---~~~~~~d~~~~~~~~~~-~~-----~~~~~~i~ 344 (527) +...|.....+-.-..+-|..|.. |..+|....+.+-... .+...| ...|.+.+ .+ .++...++ T Consensus 258 a~~~i~~~~a~~~~~~~~f~Ng~~----p~giL~~~~~~~lt~e~~~~lk~~~---~~~~~G~~nag~~~vl~~~g~~~~ 330 (551) T protein:vir:80 258 ALKQFIAHENTEAFNDRFFSHGGT----TRGILQIKAAQQQSQHALEIFKREW---KNSLSGINGSWQIPVVSAEDVKFV 330 (551) T ss_pred HHHHHHHHHHHHHHHHHHHHcCCC----cceEEEEcCCCCCCHHHHHHHHHHH---HHHhcCccccCccccccCCCceEE Confidence 887776554433333334565432 2222211111100000 000111 11232221 11 11222455 Q ss_pred EeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHh Q lcl|NC_019418. 345 DLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNS-IVALVEQSIKELCVSMCELG 423 (527) Q Consensus 345 ~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~-~~~~~~~al~~li~~il~~~ 423 (527) .++......++.+..+...+.|+...|++|..+|+...+..++....+.. +.++.. ....++.+|.-++..|-... T Consensus 331 ~l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t---~sn~e~~~~~f~~~tL~P~~~~ie~~l 407 (551) T protein:vir:80 331 NMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLN---EGNSAEKNQASKNKGLQPLLGFIEDFI 407 (551) T ss_pred EccCChhHHHHHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccc---hhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 55666677889999998999999999999999987554322222111111 111111 11233444544444443222 Q ss_pred hhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCH--HHHHHHH-----H----HHH Q lcl|NC_019418. 424 KVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLGITE--EEAEKEL-----A----EIN 492 (527) Q Consensus 424 ~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~d--eea~~el-----~----ri~ 492 (527) +. .|.. .....+.+.|+.....+.. +.....+++.+|+|++-+++.+. |+.. +....-+ . ..+ T Consensus 408 n~-~L~~---~~~~~~~f~f~~~~~~~~~-~~~~~~~~~~~g~lT~NE~R~~~-gl~P~~egGD~~~~~~~~~~~~~~~~ 481 (551) T protein:vir:80 408 NK-HIVA---EFGDKYTFQFVGGDIKSEL-ESVKILAEKAKVAMTVNEVRKEL-NLPGDVIGGDIPLNGVIVQRIGQLMQ 481 (551) T ss_pred Hh-hhcc---ccCCceEEEeeccChhhHH-HHHHHHHHHhcCCcCHHHHHHHh-CCCCCCCCCceeeccccccccccccc Confidence 21 1211 1123467888876666644 34455667778999999977654 5422 1111000 0 000 Q ss_pred -------HhcccccccccCCCCCCCC---CCCCCCCCCCCccccC Q lcl|NC_019418. 493 -------GELPPESDAELALYGKGQQ---NTVGNSKDTVDDEDEA 527 (527) Q Consensus 493 -------~E~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~ 527 (527) .+++..........+..++ .++...+.+++.++++ T Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~ 526 (551) T protein:vir:80 482 QEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDG 526 (551) T ss_pred ccCcchhhhhhccccccCcCCCCCCCCCCCCCCccccCCCccccc Confidence 0000000000011111111 1111111222222222 No 93 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=99.24 E-value=1.8e-10 Score=73.96 Aligned_cols=408 Identities=13% Similarity=0.161 Sum_probs=176.2 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCC-CcccccccccCccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSK-FDDIEYTNTDGDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~-~~~l~~~~~~~~~~~~~~~~lnl~~~i 79 (527) |+||++|+.||.- ..+. ....+.++. ...+...|.|- +..+. ... ...+...--..+ T Consensus 1 M~~~~r~~~~~~~-----~~r~-----~~~~~~~~~-----~~~~~~~~~g~~~~~~~---v~~----~~al~~~~v~~~ 58 (432) T protein:vir:10 1 MKIVDSVKKFFNF-----EKRQ-----TSQVIELNK-----DDEKLLEWLGISPSTIS---VKG----KNALKVATVFAC 58 (432) T ss_pred CChHHHHHHhcCc-----cccC-----cccccccCC-----chHHHHHHhCCCcCccc---cch----hhhhccHHHHHH Confidence 9999999999751 1111 111222222 12222222231 11110 100 111111222234 Q ss_pred HHHHhhhhhcccceEeeCC-----HHHHHHHHHHHhh-----hhHHHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEEE Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAED-----ETLNDFLSDMLSN-----DRFNKNFERYLESALALGGLAMRPYVDG-DK-IRVAFI 147 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~d-----~~~~~~l~~~l~~-----n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~v 147 (527) ++.+|+-+-+-|..+--.+ ......|..+|.. -.....++.++...+..|.+++.+..+. |+ ..+..+ T Consensus 59 i~~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i 138 (432) T protein:vir:10 59 IKILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPI 138 (432) T ss_pred HHHHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEE Confidence 4555555544444331111 1112223333321 1123344556777788899999998875 33 356667 Q ss_pred cCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceee Q lcl|NC_019418. 148 QAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVN 227 (527) Q Consensus 148 ~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~ 227 (527) +|+++-+.. +..+... .+...||.. .. -|.... T Consensus 139 ~~~~v~v~~-d~~~~~~-----------~~~~~~y~~------------------------------~~-----~g~~~~ 171 (432) T protein:vir:10 139 DASKVTVYI-DDVGLLN-----------SKTKMWYVV------------------------------NT-----GGQQRV 171 (432) T ss_pred cCceeEEEE-cCccccc-----------ccceEEEEE------------------------------ec-----CCeEEE Confidence 777766532 2222111 001111110 00 011100 Q ss_pred cccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcC-cceeeechhHh Q lcl|NC_019418. 228 LSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMG-QRRVIVPEQMT 306 (527) Q Consensus 228 l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~-~~~i~v~~~~l 306 (527) + ++ --..||+.+.+ .+...|+|.+..+...+.....+-....+-|+.| +++-++ T Consensus 172 ~-------~~---------~eiih~r~~~~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil----- 226 (432) T protein:vir:10 172 L-------KP---------EEILHFKNGIT----LDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLV----- 226 (432) T ss_pred E-------cc---------ccEEEecCCCC----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEE----- Confidence 0 00 01345654321 1234699999888887776665444444445553 333222 Q ss_pred cCCCCCCCc-ccccccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccc Q lcl|NC_019418. 307 QLKVQDNQG-NIAFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFD 380 (527) Q Consensus 307 ~~~~~~~~~-~~~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~ 380 (527) ......... .-.....| +..|.+.. .+ -+++..++.++......++.+..+...++|+...|++|..+|.. T Consensus 227 ~~~~~l~~e~~~~~~~~~---~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~ 303 (432) T protein:vir:10 227 QYVGDLNEDAKKVFRENF---ESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDL 303 (432) T ss_pred EcCCCCCHHHHHHHHHHH---HHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC Confidence 211111100 00000111 11122111 00 01122355666666677888888888899999999999999875 Q ss_pred cccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc-CCcccCccceEEEeCCCccCCHHHHHHHH Q lcl|NC_019418. 381 GQGV-KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIY-RGTIPELDDISVNLDDGVFTDRHAELDYW 458 (527) Q Consensus 381 ~~g~-~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~-~~~~~~~~~v~v~f~d~i~~d~~~~~~~~ 458 (527) .++. .++.+... ..++.+|..++..|-...+. .++ .........+.++++.-+..|..+.++.. T Consensus 304 ~~~~~s~~e~~~~-------------~~~~~~l~P~~~~ie~~ln~-kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~ 369 (432) T protein:vir:10 304 SKATLNNIEQQQQ-------------QFYTDTLQATLTMYEQEMTY-KLFLDSELDKGFYSKFNVDAILRADIKTRYEAY 369 (432) T ss_pred CCCCcccHHHHHH-------------HHHHHHHHHHHHHHHHHHHH-hhcChhhcCCCcEEEeechhhhcCCHHHHHHHH Confidence 5432 22232211 11233333333333222110 111 11112233456666676778999999999 Q ss_pred HHHHhcCCCCHHHHHHhcCCCCHHH-HHHHHHHHHHhccc-ccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019418. 459 MKMVAAGFATQKRGIAKTLGITEEE-AEKELAEINGELPP-ESDAELALYGKGQQNTVGNSKDTVD 522 (527) Q Consensus 459 ~~~~~aGi~s~~~~i~~~~~~~dee-a~~el~ri~~E~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (527) .+++.+|+|++-+++..+ |+..-+ ..+-+.. ....+ +..+.....+.+.+.+.+...++++ T Consensus 370 ~~~~~~G~~t~NE~R~~~-g~~pi~ggD~~~~~--~n~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 370 RTGIQGGFLKPNEARSKE-DLPPEAGGDRLLVN--GNMLPIDMAGQAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred HHHHhCCCcCHHHHHHHh-CCCCCCCCCeEeec--ccccchhhccccccCCCCCCCCCCCCCCCCC Confidence 999999999999976543 554211 1111000 00000 0000000111111111111111111 No 94 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=99.24 E-value=1.8e-10 Score=73.96 Aligned_cols=408 Identities=13% Similarity=0.161 Sum_probs=176.2 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCC-CcccccccccCccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSK-FDDIEYTNTDGDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~-~~~l~~~~~~~~~~~~~~~~lnl~~~i 79 (527) |+||++|+.||.- ..+. ....+.++. ...+...|.|- +..+. ... ...+...--..+ T Consensus 1 M~~~~r~~~~~~~-----~~r~-----~~~~~~~~~-----~~~~~~~~~g~~~~~~~---v~~----~~al~~~~v~~~ 58 (432) T protein:vir:10 1 MKIVDSVKKFFNF-----EKRQ-----TSQVIELNK-----DDEKLLEWLGISPSTIS---VKG----KNALKVATVFAC 58 (432) T ss_pred CChHHHHHHhcCc-----cccC-----cccccccCC-----chHHHHHHhCCCcCccc---cch----hhhhccHHHHHH Confidence 9999999999751 1111 111222222 12222222231 11110 100 111111222234 Q ss_pred HHHHhhhhhcccceEeeCC-----HHHHHHHHHHHhh-----hhHHHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEEE Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAED-----ETLNDFLSDMLSN-----DRFNKNFERYLESALALGGLAMRPYVDG-DK-IRVAFI 147 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~d-----~~~~~~l~~~l~~-----n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~v 147 (527) ++.+|+-+-+-|..+--.+ ......|..+|.. -.....++.++...+..|.+++.+..+. |+ ..+..+ T Consensus 59 i~~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i 138 (432) T protein:vir:10 59 IKILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPI 138 (432) T ss_pred HHHHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEE Confidence 4555555544444331111 1112223333321 1123344556777788899999998875 33 356667 Q ss_pred cCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceee Q lcl|NC_019418. 148 QAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVN 227 (527) Q Consensus 148 ~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~ 227 (527) +|+++-+.. +..+... .+...||.. .. -|.... T Consensus 139 ~~~~v~v~~-d~~~~~~-----------~~~~~~y~~------------------------------~~-----~g~~~~ 171 (432) T protein:vir:10 139 DASKVTVYI-DDVGLLN-----------SKTKMWYVV------------------------------NT-----GGQQRV 171 (432) T ss_pred cCceeEEEE-cCccccc-----------ccceEEEEE------------------------------ec-----CCeEEE Confidence 777766532 2222111 001111110 00 011100 Q ss_pred cccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcC-cceeeechhHh Q lcl|NC_019418. 228 LSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMG-QRRVIVPEQMT 306 (527) Q Consensus 228 l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~-~~~i~v~~~~l 306 (527) + ++ --..||+.+.+ .+...|+|.+..+...+.....+-....+-|+.| +++-++ T Consensus 172 ~-------~~---------~eiih~r~~~~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil----- 226 (432) T protein:vir:10 172 L-------KP---------EEILHFKNGIT----LDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLV----- 226 (432) T ss_pred E-------cc---------ccEEEecCCCC----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEE----- Confidence 0 00 01345654321 1234699999888887776665444444445553 333222 Q ss_pred cCCCCCCCc-ccccccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccc Q lcl|NC_019418. 307 QLKVQDNQG-NIAFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFD 380 (527) Q Consensus 307 ~~~~~~~~~-~~~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~ 380 (527) ......... .-.....| +..|.+.. .+ -+++..++.++......++.+..+...++|+...|++|..+|.. T Consensus 227 ~~~~~l~~e~~~~~~~~~---~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~ 303 (432) T protein:vir:10 227 QYVGDLNEDAKKVFRENF---ESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDL 303 (432) T ss_pred EcCCCCCHHHHHHHHHHH---HHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC Confidence 211111100 00000111 11122111 00 01122355666666677888888888899999999999999875 Q ss_pred cccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc-CCcccCccceEEEeCCCccCCHHHHHHHH Q lcl|NC_019418. 381 GQGV-KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIY-RGTIPELDDISVNLDDGVFTDRHAELDYW 458 (527) Q Consensus 381 ~~g~-~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~-~~~~~~~~~v~v~f~d~i~~d~~~~~~~~ 458 (527) .++. .++.+... ..++.+|..++..|-...+. .++ .........+.++++.-+..|..+.++.. T Consensus 304 ~~~~~s~~e~~~~-------------~~~~~~l~P~~~~ie~~ln~-kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~ 369 (432) T protein:vir:10 304 SKATLNNIEQQQQ-------------QFYTDTLQATLTMYEQEMTY-KLFLDSELDKGFYSKFNVDAILRADIKTRYEAY 369 (432) T ss_pred CCCCcccHHHHHH-------------HHHHHHHHHHHHHHHHHHHH-hhcChhhcCCCcEEEeechhhhcCCHHHHHHHH Confidence 5432 22232211 11233333333333222110 111 11112233456666676778999999999 Q ss_pred HHHHhcCCCCHHHHHHhcCCCCHHH-HHHHHHHHHHhccc-ccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019418. 459 MKMVAAGFATQKRGIAKTLGITEEE-AEKELAEINGELPP-ESDAELALYGKGQQNTVGNSKDTVD 522 (527) Q Consensus 459 ~~~~~aGi~s~~~~i~~~~~~~dee-a~~el~ri~~E~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (527) .+++.+|+|++-+++..+ |+..-+ ..+-+.. ....+ +..+.....+.+.+.+.+...++++ T Consensus 370 ~~~~~~G~~t~NE~R~~~-g~~pi~ggD~~~~~--~n~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 370 RTGIQGGFLKPNEARSKE-DLPPEAGGDRLLVN--GNMLPIDMAGQAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred HHHHhCCCcCHHHHHHHh-CCCCCCCCCeEeec--ccccchhhccccccCCCCCCCCCCCCCCCCC Confidence 999999999999976543 554211 1111000 00000 0000000111111111111111111 No 95 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=99.24 E-value=1.8e-10 Score=73.96 Aligned_cols=408 Identities=13% Similarity=0.161 Sum_probs=176.2 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCC-CcccccccccCccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSK-FDDIEYTNTDGDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~-~~~l~~~~~~~~~~~~~~~~lnl~~~i 79 (527) |+||++|+.||.- ..+. ....+.++. ...+...|.|- +..+. ... ...+...--..+ T Consensus 1 M~~~~r~~~~~~~-----~~r~-----~~~~~~~~~-----~~~~~~~~~g~~~~~~~---v~~----~~al~~~~v~~~ 58 (432) T protein:vir:10 1 MKIVDSVKKFFNF-----EKRQ-----TSQVIELNK-----DDEKLLEWLGISPSTIS---VKG----KNALKVATVFAC 58 (432) T ss_pred CChHHHHHHhcCc-----cccC-----cccccccCC-----chHHHHHHhCCCcCccc---cch----hhhhccHHHHHH Confidence 9999999999751 1111 111222222 12222222231 11110 100 111111222234 Q ss_pred HHHHhhhhhcccceEeeCC-----HHHHHHHHHHHhh-----hhHHHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEEE Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAED-----ETLNDFLSDMLSN-----DRFNKNFERYLESALALGGLAMRPYVDG-DK-IRVAFI 147 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~d-----~~~~~~l~~~l~~-----n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~v 147 (527) ++.+|+-+-+-|..+--.+ ......|..+|.. -.....++.++...+..|.+++.+..+. |+ ..+..+ T Consensus 59 i~~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i 138 (432) T protein:vir:10 59 IKILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPI 138 (432) T ss_pred HHHHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEE Confidence 4555555544444331111 1112223333321 1123344556777788899999998875 33 356667 Q ss_pred cCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceee Q lcl|NC_019418. 148 QAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVN 227 (527) Q Consensus 148 ~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~ 227 (527) +|+++-+.. +..+... .+...||.. .. -|.... T Consensus 139 ~~~~v~v~~-d~~~~~~-----------~~~~~~y~~------------------------------~~-----~g~~~~ 171 (432) T protein:vir:10 139 DASKVTVYI-DDVGLLN-----------SKTKMWYVV------------------------------NT-----GGQQRV 171 (432) T ss_pred cCceeEEEE-cCccccc-----------ccceEEEEE------------------------------ec-----CCeEEE Confidence 777766532 2222111 001111110 00 011100 Q ss_pred cccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcC-cceeeechhHh Q lcl|NC_019418. 228 LSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMG-QRRVIVPEQMT 306 (527) Q Consensus 228 l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~-~~~i~v~~~~l 306 (527) + ++ --..||+.+.+ .+...|+|.+..+...+.....+-....+-|+.| +++-++ T Consensus 172 ~-------~~---------~eiih~r~~~~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil----- 226 (432) T protein:vir:10 172 L-------KP---------EEILHFKNGIT----LDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLV----- 226 (432) T ss_pred E-------cc---------ccEEEecCCCC----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEE----- Confidence 0 00 01345654321 1234699999888887776665444444445553 333222 Q ss_pred cCCCCCCCc-ccccccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccc Q lcl|NC_019418. 307 QLKVQDNQG-NIAFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFD 380 (527) Q Consensus 307 ~~~~~~~~~-~~~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~ 380 (527) ......... .-.....| +..|.+.. .+ -+++..++.++......++.+..+...++|+...|++|..+|.. T Consensus 227 ~~~~~l~~e~~~~~~~~~---~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~ 303 (432) T protein:vir:10 227 QYVGDLNEDAKKVFRENF---ESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDL 303 (432) T ss_pred EcCCCCCHHHHHHHHHHH---HHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC Confidence 211111100 00000111 11122111 00 01122355666666677888888888899999999999999875 Q ss_pred cccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc-CCcccCccceEEEeCCCccCCHHHHHHHH Q lcl|NC_019418. 381 GQGV-KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIY-RGTIPELDDISVNLDDGVFTDRHAELDYW 458 (527) Q Consensus 381 ~~g~-~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~-~~~~~~~~~v~v~f~d~i~~d~~~~~~~~ 458 (527) .++. .++.+... ..++.+|..++..|-...+. .++ .........+.++++.-+..|..+.++.. T Consensus 304 ~~~~~s~~e~~~~-------------~~~~~~l~P~~~~ie~~ln~-kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~ 369 (432) T protein:vir:10 304 SKATLNNIEQQQQ-------------QFYTDTLQATLTMYEQEMTY-KLFLDSELDKGFYSKFNVDAILRADIKTRYEAY 369 (432) T ss_pred CCCCcccHHHHHH-------------HHHHHHHHHHHHHHHHHHHH-hhcChhhcCCCcEEEeechhhhcCCHHHHHHHH Confidence 5432 22232211 11233333333333222110 111 11112233456666676778999999999 Q ss_pred HHHHhcCCCCHHHHHHhcCCCCHHH-HHHHHHHHHHhccc-ccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019418. 459 MKMVAAGFATQKRGIAKTLGITEEE-AEKELAEINGELPP-ESDAELALYGKGQQNTVGNSKDTVD 522 (527) Q Consensus 459 ~~~~~aGi~s~~~~i~~~~~~~dee-a~~el~ri~~E~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (527) .+++.+|+|++-+++..+ |+..-+ ..+-+.. ....+ +..+.....+.+.+.+.+...++++ T Consensus 370 ~~~~~~G~~t~NE~R~~~-g~~pi~ggD~~~~~--~n~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 370 RTGIQGGFLKPNEARSKE-DLPPEAGGDRLLVN--GNMLPIDMAGQAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred HHHHhCCCcCHHHHHHHh-CCCCCCCCCeEeec--ccccchhhccccccCCCCCCCCCCCCCCCCC Confidence 999999999999976543 554211 1111000 00000 0000000111111111111111111 No 96 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.24 E-value=3.6e-10 Score=72.38 Aligned_cols=450 Identities=10% Similarity=0.065 Sum_probs=203.8 Q ss_pred hcccchhhhccCccccCHHHHHHHHHHHHHhcCCCc---ccccccccC---c------------cccCceeecchHHHHH Q lcl|NC_019418. 19 TTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFD---DIEYTNTDG---D------------RKRRKMQHLPIARTAA 80 (527) Q Consensus 19 ~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~---~l~~~~~~~---~------------~~~~~~~~lnl~~~i~ 80 (527) +..+.....- -.+....-..+..||.+... .+.-+.... + +.++-...-++++.++ T Consensus 1 ~~~p~~~~~~------~~~~~~~~~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av 74 (533) T protein:vir:34 1 MKTPTIPTLL------GPDGMTSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAI 74 (533) T ss_pred CCCchhhhhh------cccccchHHHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Confidence 1111111000 11112233455666654321 111110000 0 0000112337899999 Q ss_pred HHHhhhhhcccceEeeC------------CHHHHH----HHHHHHhh----------hhHHHHHHHHHHHHHhcCCEEEE Q lcl|NC_019418. 81 KKIASLVYNEQAEISAE------------DETLND----FLSDMLSN----------DRFNKNFERYLESALALGGLAMR 134 (527) Q Consensus 81 ~~~A~ll~~e~~~i~~~------------d~~~~~----~l~~~l~~----------n~f~~~~~~~~~~a~~~G~~~~~ 134 (527) +.+++.+.|.-.++... +++.++ .|+.+.++ .+|......++...+.-|.++++ T Consensus 75 ~~~~~nvVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~ 154 (533) T protein:vir:34 75 QLHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQ 154 (533) T ss_pred HHHHHHhhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEE Confidence 99999999975444431 122233 34444322 24777777788888999999999 Q ss_pred EEEeCC-----eeEEEEEcCCceE-EE-EEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCce Q lcl|NC_019418. 135 PYVDGD-----KIRVAFIQAPVFL-PL-QSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLY 207 (527) Q Consensus 135 ~~~d~~-----~~~i~~v~a~~~~-P~-~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~ 207 (527) ..+... ..++..++|+.+- |. ..+++.+..+|.+.. ...-+-|.+...|-. T Consensus 155 ~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~-----~Gr~~aY~i~~~~~~----------------- 212 (533) T protein:vir:34 155 ATWDTSSSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQIND-----SGAALGYYVSEDGYP----------------- 212 (533) T ss_pred eeeccCCCCccceEEEEechhhcCCCCCCCCCCceEeeeEECC-----CCCeEEEEEeecCCC----------------- Confidence 988642 4688999998743 11 112233333332211 111122222222210 Q ss_pred EEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHH Q lcl|NC_019418. 208 RITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDE 287 (527) Q Consensus 208 ~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~ 287 (527) +..+. .++.++ ....++++-+.|+..+ ...+...|+|.|+.++..+..++.-.+. T Consensus 213 ------------~~~~~------~~~~~~---~~~~v~a~~VlH~f~~----~r~gQ~RGis~lapvl~~l~~l~~y~da 267 (533) T protein:vir:34 213 ------------GWMPQ------KWTWIP---RELPGGRASFIHVFEP----VEDGQTRGANVFYSVMEQMKMLDTLQNT 267 (533) T ss_pred ------------Ccccc------ccceee---eeeccChhHeeeeccc----cCCCcccCCchHHHHHHHHHHHHHHHHH Confidence 00000 000000 1122344455555432 2345667999999999999999975543 Q ss_pred HHHH-HHcCcceeeechhHh-----cCC-C-CCCCccccccc------ccccccceee-e---ccCCCCCCCcceEeccc Q lcl|NC_019418. 288 FMWE-IKMGQRRVIVPEQMT-----QLK-V-QDNQGNIAFKR------RFDVEQNVYM-Q---VGAGNMDSGGIVDLTTP 349 (527) Q Consensus 288 ~~~e-~~~~~~~i~v~~~~l-----~~~-~-~~~~~~~~~~~------~~d~~~~~~~-~---~~~~~~~~~~i~~~~~~ 349 (527) -..- .-.+.-..||-...- ... . ..+.+...... .++....+.. + ....+| ..|+.++|. T Consensus 268 el~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pG--e~i~~~~~~ 345 (533) T protein:vir:34 268 QLQSAIVKAMYAATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPG--DSLNLQTAQ 345 (533) T ss_pred HHHHHHHhhhheeeeecCCCcccccccccCCCcccccccccccchhhhhccCcceeeccCceeeecCCC--CeeeecCCC Confidence 3322 222223333311110 000 0 00000000000 0000000000 0 001122 237778888 Q ss_pred cChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHh---hh Q lcl|NC_019418. 350 IRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIK-ELCVSMCELG---KV 425 (527) Q Consensus 350 ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~-~li~~il~~~---~~ 425 (527) -+..+|..-+..+++.|....|+++..++.+-+++ |=..+++.....-......+..+...+- .+.+..+..+ .. T Consensus 346 ~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~-nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~ 424 (533) T protein:vir:34 346 DTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQM-SYSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRV 424 (533) T ss_pred CCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCc Confidence 77788888888899999999999999987775432 1111222333333333333333433332 2323223221 11 Q ss_pred hcccCCcccCcc-----ceEEEe--CCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccc Q lcl|NC_019418. 426 VGIYRGTIPELD-----DISVNL--DDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPE 498 (527) Q Consensus 426 ~~~~~~~~~~~~-----~v~v~f--~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~ 498 (527) ..+..+...+.. -..+.| .--..+|+.++++.....+.+|++|.++.+.+. |.+-+++.+++++-++..... T Consensus 425 i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~-G~D~~ev~~q~a~e~~~~~~~ 503 (533) T protein:vir:34 425 VTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKR-GDDYQEIFAQQVRETMERRAA 503 (533) T ss_pred ccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHc-CCCHHHHHHHHHHHHHHHHhc Confidence 112222111110 123455 445567999999999999999999999988776 877666555544432211111 Q ss_pred cccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 499 SDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 499 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) .-..+..+.. ....+...+..++.+++ T Consensus 504 gl~~~~~~~~--~~~s~~~~~~~~~~~~~ 530 (533) T protein:vir:34 504 GLKPPAWAAA--AFESGLRQSTEEEKSDS 530 (533) T ss_pred CCCCCCCCCc--CccCCCCCCCCCCcccC Confidence 0111111110 00001111111111122 No 97 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=99.20 E-value=1.6e-10 Score=74.33 Aligned_cols=442 Identities=12% Similarity=0.067 Sum_probs=180.7 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHH----HhcC--------------CCccccccccc Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLA----YYQS--------------KFDDIEYTNTD 62 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~----~y~g--------------~~~~l~~~~~~ 62 (527) |++|+++...++-. .-+.++ ....+.-.+++..-+.+.+..... =|.- ++-+..+.... T Consensus 1 ~~~~~~~~~~~~~~--~~~~~~-~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~ 77 (547) T protein:vir:63 1 MGLFESIRLAGVNK--SDAVKH-IEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLH 77 (547) T ss_pred CchhhhhhhhcCCc--cccccc-cccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecccccccCCccCChhHHH Confidence 99999887755410 111222 122222233333333333333211 0100 00000000000 Q ss_pred CccccCceeecchHHHHHHHHhhhhhc--cc---------ceEeeC---------CHHHHHHHHHHHhhh---------h Q lcl|NC_019418. 63 GDRKRRKMQHLPIARTAAKKIASLVYN--EQ---------AEISAE---------DETLNDFLSDMLSND---------R 113 (527) Q Consensus 63 ~~~~~~~~~~lnl~~~i~~~~A~ll~~--e~---------~~i~~~---------d~~~~~~l~~~l~~n---------~ 113 (527) . -.+.....++...+++..|+-+.. -+ -.+.+. +....+.+.+++..- . T Consensus 78 ~--l~~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s 155 (547) T protein:vir:63 78 G--VLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDS 155 (547) T ss_pred H--HHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccch Confidence 0 001112224445555555543321 00 012221 112223455554322 2 Q ss_pred HHHHHHHHHHHHHhcCCEEEEEEEeCC-e-eEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeec Q lcl|NC_019418. 114 FNKNFERYLESALALGGLAMRPYVDGD-K-IRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWV 191 (527) Q Consensus 114 f~~~~~~~~~~a~~~G~~~~~~~~d~~-~-~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~ 191 (527) +...+..++.+.+..|.+++.+..+.+ + ..+.+++|..+.++..+ ++.+. +...+|. ... T Consensus 156 ~~~f~~~lv~d~ll~Gn~~~~i~rd~~G~~~~L~~l~p~~V~~~~~~-~g~~~------------~~~~~y~--~~~--- 217 (547) T protein:vir:63 156 FSSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTA-DGKIP------------DNGNRFV--QVI--- 217 (547) T ss_pred HHHHHHHHHHHHHhhCCEEEEEEECCCCcEEEEEEecCceeEEEECC-ccccc------------cCceEEE--EEc--- Confidence 344556677788888999988888653 3 45777888887775322 22110 0111110 000 Q ss_pred ccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchh Q lcl|NC_019418. 192 TPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIF 271 (527) Q Consensus 192 ~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~ 271 (527) ++..... | + +.+ ..|++.+. .......++|+|.+ T Consensus 218 ------------~~~~~~~---~------------~--------~~e----------iih~r~n~-~~~~~~~~~G~Spi 251 (547) T protein:vir:63 218 ------------DQKIVAT---F------------N--------ARE----------MAFAVRNP-RSDIYATGYGYPEL 251 (547) T ss_pred ------------CCcEEEE---e------------c--------ccc----------EEEecccC-CCCcccccccccHH Confidence 0000000 0 0 000 23343210 01112356799999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCccc---ccccccccccceeeecc-C-----CCCCCCc Q lcl|NC_019418. 272 DNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGNI---AFKRRFDVEQNVYMQVG-A-----GNMDSGG 342 (527) Q Consensus 272 ~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~~---~~~~~~d~~~~~~~~~~-~-----~~~~~~~ 342 (527) ..+...|.....+-.-..+-|..|.. |..+|....+..-... .+...| ...|.+.+ . -.++... T Consensus 252 ~~~~~~i~~~~~a~~~~~~~f~Ng~~----p~giL~~~~~~~ls~e~~~~lk~~~---~~~~~G~~nagk~~vl~~~g~~ 324 (547) T protein:vir:63 252 EIALKQFIAHENTEAFNDRFFSHGGT----TRGILQIKAAQQQSQHALEIFKREW---KNSLSGINGSWQIPVVSAEDVK 324 (547) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCC----cceEEEecCCCCCCHHHHHHHHHHH---HHHhcCcccccccccccCCCce Confidence 88887776555443333334565432 2222211111100000 000111 11122211 1 1122234 Q ss_pred ceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_019418. 343 IVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNS-IVALVEQSIKELCVSMCE 421 (527) Q Consensus 343 i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~-~~~~~~~al~~li~~il~ 421 (527) ++.++......++.+..+...+.|+...|++|..+|+...+..++....+. ++.++.. .+..++.+|.-++..|-. T Consensus 325 ~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~---t~sn~e~~~~~~~~~tL~P~~~~ie~ 401 (547) T protein:vir:63 325 FVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSL---NEGNSAEKNQASKNKGLQPLLGFIED 401 (547) T ss_pred EEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCccccccccccccccc---chhhHHHHHHHHHHHHHHHHHHHHHH Confidence 556666677888999998889999999999999998754432222111111 1111111 112334555555555443 Q ss_pred HhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCH--HHHHHHH-----H----H Q lcl|NC_019418. 422 LGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLGITE--EEAEKEL-----A----E 490 (527) Q Consensus 422 ~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~d--eea~~el-----~----r 490 (527) ..+. .|... ....+.+.|+.....+.. +.....+++.+|+|+.-+++.+. |+.. +....-+ . . T Consensus 402 ~ln~-~L~~~---~~~~~~~~f~~~~~~~~~-~~~~~~~~~~~g~lT~NE~R~~~-gl~P~~egGD~~~~~~~~~~~~~~ 475 (547) T protein:vir:63 402 FINK-HIVAE---FGDKYTFQFVGGDIKSEL-ESVKILAEKAKVAMTVNEVRKEL-NLPGDVIGGDIPLNGVIVQRIGQL 475 (547) T ss_pred HHHh-hcccc---cCCceEEEeeccccccHH-HHHHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCceeeccccccccccc Confidence 3221 12111 123467788776666644 44455677888999999977654 5422 1110000 0 0 Q ss_pred HHHhccc-------ccccccCCCCCCCCCCCCCC---CCCCCccccC Q lcl|NC_019418. 491 INGELPP-------ESDAELALYGKGQQNTVGNS---KDTVDDEDEA 527 (527) Q Consensus 491 i~~E~~~-------~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~ 527 (527) .+.++.. ........++....+++.++ +.+++.++++ T Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 522 (547) T protein:vir:63 476 MQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDG 522 (547) T ss_pred ccccCCccccchhhccccccccCCCCCCCCCCCCCCcccCCCcCccc Confidence 0000000 00000011111111111111 1222223333 No 98 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.20 E-value=6.2e-10 Score=71.07 Aligned_cols=437 Identities=12% Similarity=0.081 Sum_probs=197.9 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcc-----cccccccCc----------c Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDD-----IEYTNTDGD----------R 65 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~-----l~~~~~~~~----------~ 65 (527) |||+.+ .++- +.--.+-..+.+-|.+-... +..+..+.. + T Consensus 1 m~~~~~--~~~a-----------------------~~~~~~~~~~~~~y~aa~~~~~~~~~~~~s~d~~~~~~~~~lr~R 55 (495) T protein:vir:10 1 MNMTPS--GYQS-----------------------LASGLLVPVGASAYEGASGGHRWQDIGDYGPDTAVASGIQTLRAR 55 (495) T ss_pred CCcccc--cccc-----------------------cchhhhhHHHhhhhhccccCcccCCCCCCChhHHHHHHHHHHHHH Confidence 777665 2210 00000111122233331110 000000000 0 Q ss_pred ccCceeecchHHHHHHHHhhhhhcccce--EeeCCHHHHHHHHH----HHh------hhhHHHHHHHHHHHHHhcCCEEE Q lcl|NC_019418. 66 KRRKMQHLPIARTAAKKIASLVYNEQAE--ISAEDETLNDFLSD----MLS------NDRFNKNFERYLESALALGGLAM 133 (527) Q Consensus 66 ~~~~~~~lnl~~~i~~~~A~ll~~e~~~--i~~~d~~~~~~l~~----~l~------~n~f~~~~~~~~~~a~~~G~~~~ 133 (527) .++-...-++++.+++.+.+.+.|...+ ...+++..++.++. +.+ ..+|......++...+.-|.+++ T Consensus 56 aRdl~rNn~~a~~av~~~~~~vVG~Gi~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~ 135 (495) T protein:vir:10 56 SHHNVRNNPWATNAVATWVAAAVGNGLTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFV 135 (495) T ss_pred HHHHHhcChHHHHHHHHHHHhhcCCCcccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEE Confidence 0001123378999999999999887433 33456555555444 433 23577777778888899999998 Q ss_pred EEEEeC--C----eeEEEEEcCCceE-EEEE----cCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeee Q lcl|NC_019418. 134 RPYVDG--D----KIRVAFIQAPVFL-PLQS----NTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTK 202 (527) Q Consensus 134 ~~~~d~--~----~~~i~~v~a~~~~-P~~~----d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~ 202 (527) +..+.. + ..++..++|+.+- |... +++.+..+|.+.. ...-+-|.+...|- T Consensus 136 ~~~~~~~~~g~~~~~~lqliepd~l~~~~~~~~~~~g~~i~~GIe~d~-----~Gr~vaY~i~~~hp------------- 197 (495) T protein:vir:10 136 IKKPRPLSEGLSVPLQLQIIEPDMLASDIPDETLPSGGYVKGGIRFSN-----GGKRKAYCFYRNHP------------- 197 (495) T ss_pred EEeecccCCCCccceEEEEechhhcCCCCCCCCCCCCCEEEeceEECC-----CCceEEEEEeecCC------------- Confidence 877742 1 3689999999852 3211 1223344443311 11223333333331 Q ss_pred cCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHH Q lcl|NC_019418. 203 DKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFIN 282 (527) Q Consensus 203 ~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld 282 (527) +... ..+.... +. -++..-+.|+... ..+...|+|.++-+.. +..+| T Consensus 198 --gd~~------------~~~~~~~----~~---------rvpA~~vlH~f~~-----r~gQ~RGis~la~i~~-l~~l~ 244 (495) T protein:vir:10 198 --AESS------------LIGDPVD----TV---------WIKAEHVLHVTVL-----TVRSDAGAPWFQLLLR-LNELD 244 (495) T ss_pred --Cccc------------ccccccc----ee---------eechhheEecccc-----CCCcccCcchhHHHHH-HHHhh Confidence 0000 0000000 00 0111223344221 2355669999986654 56666 Q ss_pred HHHHHHH-HHHHcCcceeeechhHhcCCCCCCCcccccccccccccceeeec------cCCCCCCCcceEeccccChHHH Q lcl|NC_019418. 283 RTYDEFM-WEIKMGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQV------GAGNMDSGGIVDLTTPIRSSDY 355 (527) Q Consensus 283 ~~~s~~~-~e~~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~------~~~~~~~~~i~~~~~~ir~e~~ 355 (527) .--+.-. ...-.+--..||-... .+...+........+........+ ...+| ..|+.++|.-+..+| T Consensus 245 ~y~dael~~a~i~A~~~~fi~~~~----~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pG--e~i~~~~p~~p~~~~ 318 (495) T protein:vir:10 245 QYEDAELVRKKTAALFAAFIQEAT----ADSTGGPTIGQPKRSKGGKRITGLNPGTLQYLQPG--QEVKFSNPADVGTTY 318 (495) T ss_pred HHHHHHHHHHHHhhhheeeeecCC----CccccccccCccccccCcccceecCCceeeecCCC--CeeeeeCCCCCCCCH Confidence 5333221 2211222223331110 011111000000000000000011 11222 248888888777788 Q ss_pred HHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHH-HHH-HHHHHHHHHHHHHhhhhcccCCcc Q lcl|NC_019418. 356 ISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVA-LVE-QSIKELCVSMCELGKVVGIYRGTI 433 (527) Q Consensus 356 ~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~-~~~-~al~~li~~il~~~~~~~~~~~~~ 433 (527) ..-+..+++.|....|+++..++.+-+++- =..+++............|. .+. ..++.+.+..+..+-+-+. ... T Consensus 319 ~~f~~~~lr~iaaglGi~Ye~ltgD~s~~n-YSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~--i~~ 395 (495) T protein:vir:10 319 EPWLRYQLLSIAKGYGITYEMLTGDLRGVN-YSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGA--VVI 395 (495) T ss_pred HHHHHHHHHHHHhhcCCCHHHHhccccccc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC--CCC Confidence 888889999999999999999876654421 11122222222333333332 122 2223333333333221111 011 Q ss_pred cCcc-----ceEEEe--CCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccc----cccc Q lcl|NC_019418. 434 PELD-----DISVNL--DDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPE----SDAE 502 (527) Q Consensus 434 ~~~~-----~v~v~f--~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~----~~~~ 502 (527) |+.. -+.+.| .--..+|+.++++.....+.+|++|.++.+.+. |.+-+++.+++++-++..... +.+. T Consensus 396 p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~-G~D~~~v~~q~a~e~~~~~~~Gl~~~~~p 474 (495) T protein:vir:10 396 PDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAER-GYDMEELFDMISDANQLIDEYDLRLDSDP 474 (495) T ss_pred CCchhhhHhhhccccccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHc-CCCHHHHHHHHHHHHHHHHHcCCCCCCCC Confidence 1111 134556 333457899999999999999999999988887 887776655554432211111 1111 Q ss_pred cCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 503 LALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 503 ~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) ....+.+... ++.....+++| T Consensus 475 ~~~~~~~~~~---~~~~~~~~~~e 495 (495) T protein:vir:10 475 RYVNGSGAEQ---KSVMEAALNNE 495 (495) T ss_pred CcCCCccCCC---CCCCCCCCCCC Confidence 1111111111 11122222222 No 99 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.17 E-value=4.6e-10 Score=71.79 Aligned_cols=415 Identities=13% Similarity=0.091 Sum_probs=184.2 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHH-----HHHhcCCCcccccccccCccccCceeecch Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHN-----LAYYQSKFDDIEYTNTDGDRKRRKMQHLPI 75 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~-----~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl 75 (527) ||+|-+=|. +.+.+.+.+.+.+.-. ..+|.+. .+.+... ..-+.+-.+ T Consensus 1 ~~~~m~~~~--------------------~~~~~~D~~~~~~~~~~g~~~~~~~~~~--~~~~~~l-----~~~Y~~~~l 53 (435) T protein:vir:79 1 MGVFMSDKV--------------------KAITKEDGYNEIFGSKDGTFRPNAFYMQ--RAAFKAL-----SQFYEEDGM 53 (435) T ss_pred CCccccccc--------------------ccchhhcchhhhhcccccccccCcccCC--cCCHHHH-----HHHHhcCch Confidence 776422110 1111222221111000 0011110 0000000 011233489 Q ss_pred HHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcCCceEEE Q lcl|NC_019418. 76 ARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQAPVFLPL 155 (527) Q Consensus 76 ~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a~~~~P~ 155 (527) ++.+|+..|.-++.+...|+.+++ .+.++..+++-+++..+.+++..+-.+|++++.+-...++.. + =|+ T Consensus 54 ~~~~Vd~~aed~~r~g~~i~g~~~--~~~~~~~~~~l~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~------~--~Pl 123 (435) T protein:vir:79 54 ARRIVDVIPEEMVTPGFKVDGVKN--EKSFKSRWDELRLNAKIIDALSWSRLFGGSAILAVVADNKML------K--SPV 123 (435) T ss_pred hhhhhccchHHhhcCCceecCCCh--HHHHHHHHHHhhHHHHHHHHHHhhhccccEEEEEEecCCCCc------c--ccc Confidence 999999999999999877765433 355667777778889999999999999998887766322211 1 133 Q ss_pred EEcCCceEEEEE-EEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccc-cCceeecccccC Q lcl|NC_019418. 156 QSNTQDVSSAAI-LTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQ-LGERVNLSELYP 233 (527) Q Consensus 156 ~~d~~~~~~~a~-~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~-lG~~v~l~~~~~ 233 (527) .. .+.+..+. +.+. .. +-.++..-. ....+ .....|+|. ..+. -+..|.-+ T Consensus 124 ~~--~g~i~~i~v~d~~-~i--------~~~~~~~dp--~sp~f---g~P~~y~v~-------~~~~~~~~~iH~S---- 176 (435) T protein:vir:79 124 KP--GAQLEDIRVYDRY-QI--------TIHERETNA--RSVRY---GEPKLYKIS-------PGGDIPEFFVHYS---- 176 (435) T ss_pred cc--CCceeeEEeechh-hc--------cchhhccCC--ccccc---CcceEEEEe-------cCCCCCceEEcce---- Confidence 21 22222221 1110 00 000000000 00000 000112221 1110 01111100 Q ss_pred CcccceeecCCCcccEEEecCCccccccCCCccCcchh-hhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCC Q lcl|NC_019418. 234 DLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIF-DNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQD 312 (527) Q Consensus 234 ~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~-~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~ 312 (527) ..+.+.|.+-|. +. ....++||.|++ ..+.+.+..++.+......=+...+.+++--..+-....++ T Consensus 177 ---Rli~~~g~~~p~--~~-------~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~ 244 (435) T protein:vir:79 177 ---RICIIDGERVSN--EK-------RRQNDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQQAVWKARDLALMCDDE 244 (435) T ss_pred ---eEEEecCCcchh--hh-------ccccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccchhHHHhhcCc Confidence 011122221110 00 112467899988 68889999999888776655433333332222221111111 Q ss_pred CCcccccccc---cccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccc-cccccccc-chH Q lcl|NC_019418. 313 NQGNIAFKRR---FDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGM-FTFDGQGV-KTA 387 (527) Q Consensus 313 ~~~~~~~~~~---~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~-~~~~~~g~-~TA 387 (527) .+ ....... +...+.....+-.. +....++.++.++ .-....++....+|+..+|++..- ||...+|. .|+ T Consensus 245 ~~-~~~~~~r~~~~~~~~~~~~~~~i~-~~~e~~e~~~~~l--sgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnstg 320 (435) T protein:vir:79 245 EG-RYAARLRLAQVDDESGVGKAIGID-ATDEEYEVLNSDV--SGVPEFLQEKIDRIVALTGIHEIIIKNKNTGGVSASQ 320 (435) T ss_pred cc-hHHHHHHHHHHHHhcCCCCceeEe-cCCcceEEEeccc--CCHHHHHHHHHHHHHhhhCCCeeeeccCCccccccch Confidence 11 1100000 11111111111111 2222466665443 345677777888899999999765 57766664 455 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHH------- Q lcl|NC_019418. 388 TEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMK------- 460 (527) Q Consensus 388 tei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~------- 460 (527) .+-...+.+....+. +..++..|+.|+..++. ..+++|.|++-...+..+.++...+ T Consensus 321 d~d~~~yyd~i~~~Q--e~~l~p~l~~l~~li~~--------------s~d~~~~f~pL~~~sekEkAei~~~~a~a~~~ 384 (435) T protein:vir:79 321 NTALETFYKLIDRKR--VEDYKPILEFLLPFMIS--------------ETEWSIEFEPLSVPSDKDKAEIMAKNVESVVK 384 (435) T ss_pred hHHHHHHHHHHHHHH--HHHHHHHHHHHHHHhhc--------------CCCCeEEeCCCCCCCHHHHHHHHHHHHHHHHH Confidence 554444444443332 24567777877776542 1367899999888888666554433 Q ss_pred HHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 461 MVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 461 ~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) ++.+|+ ++.+|+++++...-.+..........++ +.++.++. ....|+|++ T Consensus 385 ~~~~g~------------i~~~e~r~~L~~~~~~~~~~~~~~~~~~-~~~d~~~~--~~~e~g~~~ 435 (435) T protein:vir:79 385 LKAEQA------------INLKETRDTLRSICPDLKIMDNDNIELP-EPEDLDPE--PGQEGGLNK 435 (435) T ss_pred HHhcCC------------CCHHHHHHHHHHhccccCCCCcccccCC-ccccCCCC--CCCCCCCCC Confidence 333344 4555555555322222222221111222 11111111 111222222 No 100 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.16 E-value=1.1e-09 Score=69.77 Aligned_cols=445 Identities=12% Similarity=0.088 Sum_probs=205.4 Q ss_pred hhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCc---ccccccccC---c------------cccCceeecchHHHH Q lcl|NC_019418. 18 MTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFD---DIEYTNTDG---D------------RKRRKMQHLPIARTA 79 (527) Q Consensus 18 ~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~---~l~~~~~~~---~------------~~~~~~~~lnl~~~i 79 (527) |....+. .++ -......+..||.+... ...-+.... + +.++-...-++++.+ T Consensus 1 ~~~~~~~--------~~~--~~~~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~a 70 (530) T protein:vir:38 1 MKIPSLV--------GPD--GKTSLREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANA 70 (530) T ss_pred Cccceee--------cCc--cccchHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHH Confidence 1111111 111 12224556667765321 111100000 0 000011233789999 Q ss_pred HHHHhhhhhcccceEeeC------------CHHHHHH----HHHHHh----------hhhHHHHHHHHHHHHHhcCCEEE Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAE------------DETLNDF----LSDMLS----------NDRFNKNFERYLESALALGGLAM 133 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~------------d~~~~~~----l~~~l~----------~n~f~~~~~~~~~~a~~~G~~~~ 133 (527) ++.+++.+.|...++... +.+.++. |+.|.+ ..+|......++...+..|.+++ T Consensus 71 v~~~~~nvVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~ 150 (530) T protein:vir:38 71 VQLHQDHIVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCV 150 (530) T ss_pred HHHHHHHhhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEE Confidence 999999999985444331 2233333 444332 12477777778888899999999 Q ss_pred EEEEeCC-----eeEEEEEcCCceE-EEE-EcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCc Q lcl|NC_019418. 134 RPYVDGD-----KIRVAFIQAPVFL-PLQ-SNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSL 206 (527) Q Consensus 134 ~~~~d~~-----~~~i~~v~a~~~~-P~~-~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~ 206 (527) +..+... ..++..++|+.+- |.. .+++.+..+|.+. ..++ -+-|.+...|-.. T Consensus 151 ~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d----~~Gr-~~aY~i~~~~~~~--------------- 210 (530) T protein:vir:38 151 QATWDSDSTRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKIN----DSGA-ALGYYVSDDGYPG--------------- 210 (530) T ss_pred EeeeccCCCCccceEEEEechhhcCCCCCCCCCCeeEeeeEEC----CCCc-eEEEEEeeccCCC--------------- Confidence 9988643 3688999998743 211 1223333333221 1111 1222222322100 Q ss_pred eEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHH Q lcl|NC_019418. 207 YRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYD 286 (527) Q Consensus 207 ~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s 286 (527) ...+ .|..++. ...++++-+.|+..+ ...+...|+|.|+.++..+..|+.-.+ T Consensus 211 -------------~~~~-------~~~~~~~---~~~v~a~~vlH~f~~----~r~gQ~RGis~lapvl~~l~~l~~y~d 263 (530) T protein:vir:38 211 -------------WMAQ-------NWTYIPR---ELPGGRPSFIHVFEP----MEDGQTRGANAFYSVMEQMKMLDTLQN 263 (530) T ss_pred -------------cccc-------ccceeee---eeccChhHeEeeccc----cCCCcccCCchHHHHHHHHHHHhHHHH Confidence 0000 0111111 123455556666533 234566799999999999999997544 Q ss_pred HHHH-HHHcCcceeeechhHh-----cC-CCCCCCccccccccccccccee---eeccCCCC------CCCcceEecccc Q lcl|NC_019418. 287 EFMW-EIKMGQRRVIVPEQMT-----QL-KVQDNQGNIAFKRRFDVEQNVY---MQVGAGNM------DSGGIVDLTTPI 350 (527) Q Consensus 287 ~~~~-e~~~~~~~i~v~~~~l-----~~-~~~~~~~~~~~~~~~d~~~~~~---~~~~~~~~------~~~~i~~~~~~i 350 (527) .-.. ..-.+--..||-...- .. ...+..+.......+...+.-+ ..+.+.++ ....|+.++|.- T Consensus 264 ael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~ 343 (530) T protein:vir:38 264 TQLQSAIVKAMYAATIESELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLPGDSLNLQSAQD 343 (530) T ss_pred HHHHHHHHhhhheeeeeccCCccccccccccCCcccccccccccchhhhhcccccceeccCceeeecCCCCeeeeeCCCC Confidence 3322 2222222233311110 00 0000000000000000000000 00011111 122488888888 Q ss_pred ChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhh---h Q lcl|NC_019418. 351 RSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQ-SIKELCVSMCELGKV---V 426 (527) Q Consensus 351 r~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~-al~~li~~il~~~~~---~ 426 (527) +..+|..-+..+++.|....|+++..++.+-+++ |=..+++.....-......|..+.. .++.+.+..+..+-+ . T Consensus 344 p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~~-nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i 422 (530) T protein:vir:38 344 TDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQM-SYSTARASANESWAYFMGRRKFVASRQACQMFLCWLEEAIVRRVV 422 (530) T ss_pred CCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccc-cHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCc Confidence 8888888899999999999999999987765432 1111222333333333334443433 223333333322111 1 Q ss_pred cccCCcccCcc-----ceEEEe--CCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHH---hcc Q lcl|NC_019418. 427 GIYRGTIPELD-----DISVNL--DDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLGITEEEAEKELAEING---ELP 496 (527) Q Consensus 427 ~~~~~~~~~~~-----~v~v~f--~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~---E~~ 496 (527) .+.++...+.. -+.+.| .--..+|+.++++.....+.+|+.|.++.+.+. |.+-+++.+++++-.+ +.. T Consensus 423 ~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~-G~D~~~v~~q~a~e~~~~~~~G 501 (530) T protein:vir:38 423 TLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKR-GDDYQEIFAQQVRESMERRAAG 501 (530) T ss_pred cCCCCCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHc-CCCHHHHHHHHHHHHHHHHHcC Confidence 11111111111 123444 445567999999999999999999999988876 8776665555443322 111 Q ss_pred c-ccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 497 P-ESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 497 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) - ...........+..++ ..+.+|.+ T Consensus 502 l~~~~~~~~~~~~~~~~~------~~~~~d~~ 527 (530) T protein:vir:38 502 LNPPAWAAAAFEAGVKKS------NEEEQDGA 527 (530) T ss_pred CCCCCCcccccCCCCCCC------CCCCCCCC Confidence 1 1111111111111111 11111111 No 101 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=99.15 E-value=1.2e-09 Score=69.54 Aligned_cols=460 Identities=11% Similarity=0.038 Sum_probs=188.9 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) =...+-|++.+.. .+. -...++.+...|..||.+... ...+...|+ .....+.-...+ T Consensus 26 ~~~~~~l~~~~~~------~~~-----------~~~~~~~~~~~~~~~~~~~~~-~~~~~~~gr----s~vv~~~v~~~v 83 (763) T protein:vir:95 26 ELSLQALKADLDA------AKP-----------SHTAMMIKVKEWNDLMRIEGK-AKPPKVKGR----SQVQPKLVRRQA 83 (763) T ss_pred hHHHHHHHHHHHh------hhc-----------chhHHHHHHHHHHHhhhcccc-CcccccCCC----ccccCHHHHHHH Confidence 2233344444331 111 123456667778887555432 122223332 223323222222 Q ss_pred HH-Hhhh---hhcccceEee-----CCHHHH----HHHHHHH-hhhhHHHHHHHHHHHHHhcCCEEEEEEEeC------- Q lcl|NC_019418. 81 KK-IASL---VYNEQAEISA-----EDETLN----DFLSDML-SNDRFNKNFERYLESALALGGLAMRPYVDG------- 139 (527) Q Consensus 81 ~~-~A~l---l~~e~~~i~~-----~d~~~~----~~l~~~l-~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~------- 139 (527) +- +++| +++-..-|.+ +|.+.+ .+++-+| ..|+=...+..++..|+..|.+++|+||+. T Consensus 84 e~~~~~l~~~f~~~~~~~~~~P~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k~~W~~~~~~~~~ 163 (763) T protein:vir:95 84 EWRYSALTEPFLGSNKLFKVTPVTWEDVQGARQNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVRVGWNREIRKEKQ 163 (763) T ss_pred HHHHHHHHHhhcCCCcEEEEecCCcchHHHHHHHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEEEeeeeeeeeeee Confidence 22 2222 2232222333 233333 3555433 344555668899999999999999999961 Q ss_pred ------------------------------------------------------------------------CeeEEEEE Q lcl|NC_019418. 140 ------------------------------------------------------------------------DKIRVAFI 147 (527) Q Consensus 140 ------------------------------------------------------------------------~~~~i~~v 147 (527) ++++|+.| T Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~ie~V 243 (763) T protein:vir:95 164 EVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTVEML 243 (763) T ss_pred eehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecCceEEEee Confidence 12355567 Q ss_pred cCCceEEEEEcCC-ceEEEEE-EEEEEeeCCC---cceEEEEEEE-----Eee----ccccc--ccceeeecCCceEEEE Q lcl|NC_019418. 148 QAPVFLPLQSNTQ-DVSSAAI-LTKTIKTENR---KNVYYTLVEF-----HEW----VTPTG--QEVGSTKDKSLYRITN 211 (527) Q Consensus 148 ~a~~~~P~~~d~~-~~~~~a~-~~~~~~~~~~---~~~~yt~lE~-----h~~----~~~~~--~~~~~~~~~~~~~I~n 211 (527) +|..|++= -+.. .+..|-+ +.+.+.+..+ -+..|..++- +.. ...+. ...........-...+ T Consensus 244 ~p~d~~iD-p~a~sD~~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~V~v~ 322 (763) T protein:vir:95 244 NPENIIID-PSCQGDINKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQISDPMRKRVVAY 322 (763) T ss_pred cHHHheec-CCCCCchhhCceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccchhhccCCCcccceEEEE Confidence 77666641 1111 1222222 1111111000 0000111100 000 00000 0000000000111223 Q ss_pred EEEecCCccccCceeecccccCCcccceee------cCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHH Q lcl|NC_019418. 212 ELYKSTSDSQLGERVNLSELYPDLQPVTPI------QGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTY 285 (527) Q Consensus 212 ~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~------~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~ 285 (527) +.|...+-+.=|.....--.+. ....+ ....+.+|+.+++ ++ ..++++|.|+++.++++++.+|..+ T Consensus 323 E~y~~~d~~gdg~~~~~~v~~~---g~~iL~~~~~p~~~~~~PFv~~~~-~p---~~~~~~G~gi~~~~~d~Qr~~N~~~ 395 (763) T protein:vir:95 323 EYWGFWDIEGNGVLEPIVATWI---GSTLIRLEKNPYPDGKLPFVLIPY-MP---VKRDMYGEPDAELLGDNQAVLGAVM 395 (763) T ss_pred EeeeeeccCCcceeEEEEEEEE---cCeeeecccccccCCCcCEEEecc-ee---ecCcccCCchHHHhhHHHHHHHHHH Confidence 4443321111111100000000 00000 1123445665543 11 3467899999999999999999999 Q ss_pred HHHHHHHH-cCcceeeechhHhcCCCCCCCcccccccccccccce-eeeccCCCCCCCcceEec-cccChHHHHHHHHHH Q lcl|NC_019418. 286 DEFMWEIK-MGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNV-YMQVGAGNMDSGGIVDLT-TPIRSSDYISAISEG 362 (527) Q Consensus 286 s~~~~e~~-~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~-~~~~~~~~~~~~~i~~~~-~~ir~e~~~~~~~~~ 362 (527) ++..+.+. ..+.++.|+.+.+... +. .. +.+...+ |.+.. .....++..+ +.+. ..+...++.+ T Consensus 396 ~~~~d~l~~~~~~~~~v~~gav~~~-d~----~~----~~pg~v~~v~~g~---~~~~~~~~~~~p~~~-~~~~~~l~~~ 462 (763) T protein:vir:95 396 RGMIDLLGRSANGQRGMPKGMLDAL-NS----RR----YREGEDYEYNPTQ---NPAQMIIEHKFPELP-QSALTMATLQ 462 (763) T ss_pred HHHHHHHHhhcCCcEEeecccccch-hh----hc----ccCCceEEeeCCC---ChhhhcccccCCCCc-chHHHHHHHH Confidence 99999886 4777899987776321 11 00 1111111 12111 1111222222 2232 2344455555 Q ss_pred HHHHHHhcCCCcccccccccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCc--------- Q lcl|NC_019418. 363 LKLFEMQIGVSSGMFTFDGQG-VKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGT--------- 432 (527) Q Consensus 363 l~~i~~~~g~s~~~~~~~~~g-~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~--------- 432 (527) ...+...+|++....|.++.+ ..||++|....+..-.....+.+.|..+++.+++.++.+...+ ++.. T Consensus 463 ~~~~e~~TGv~~~~~G~~~~~~~~tat~v~~l~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~--~d~~rviRI~g~e 540 (763) T protein:vir:95 463 NQEAESLTGVKAFAGGVTGESYGDVAAGIRGVLDAASKREMAILRRLAKGMSEIGNKIIAMNAVF--LAEHEVVRITNEE 540 (763) T ss_pred HHHHHHhhCcchhhcCcCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh--CCCCcEEEEeCCc Confidence 556677788888777755433 4578877666555555556677788889999999888876542 1110 Q ss_pred c--------cCccceEEEeCCCccCCHH-HHHHHHHHHHh-cC-CCCHHH--HH-HhcCCCCHHHHHHHHHHHHHhcccc Q lcl|NC_019418. 433 I--------PELDDISVNLDDGVFTDRH-AELDYWMKMVA-AG-FATQKR--GI-AKTLGITEEEAEKELAEINGELPPE 498 (527) Q Consensus 433 ~--------~~~~~v~v~f~d~i~~d~~-~~~~~~~~~~~-aG-i~s~~~--~i-~~~~~~~deea~~el~ri~~E~~~~ 498 (527) . ....+|+|.-+- .... +..+..+.+.. .| .+.... .| .+.-...+ ....+..++..+++ T Consensus 541 ~v~v~~~~~~~~~DV~V~~~~---as~~~q~~~~l~~ll~~l~~~~~~~~~~~il~~~~d~~~--~~~~~~~lr~~q~~- 614 (763) T protein:vir:95 541 FVTIKREDLKGNFDLEVDIST---AEVDNQKSQDLGFMLQTIGPNVDQQITLNILAEIADLKR--MPKLAHDLRTWQPQ- 614 (763) T ss_pred cccccHHHhcCCcceEEeccc---chHHHHHHHHHHHHHHHhccccChHHHHHHHHHHHhhhc--hhhhHHHHHhcCCC- Confidence 0 112333333221 1111 11222222211 11 111110 00 00000000 00011111111100 Q ss_pred cccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 499 SDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 499 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) +++... .-.+-+- T Consensus 615 ------------~d~~~q----~qaqle~ 627 (763) T protein:vir:95 615 ------------PDPVQE----QLKQLAV 627 (763) T ss_pred ------------ccchhh----hHHHHHH Confidence 000000 0000000 No 102 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.12 E-value=1.6e-09 Score=68.76 Aligned_cols=456 Identities=10% Similarity=0.024 Sum_probs=203.9 Q ss_pred hHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCC--Cc-----ccccccc-cCc---------- Q lcl|NC_019418. 3 LIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSK--FD-----DIEYTNT-DGD---------- 64 (527) Q Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~--~~-----~l~~~~~-~~~---------- 64 (527) |+..+.+.+. ..... -.. .+......-|.|- .. |...... +.. T Consensus 1 m~~~~~r~~~------------~~a~~----~~~---~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~ 61 (553) T protein:vir:63 1 MTKVTVRKLS------------EVTSG----RPE---QSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADA 61 (553) T ss_pred Ccchhhhhhc------------ccccc----cch---hhhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHH Confidence 2222222221 00000 000 0111112223321 11 1110000 000 Q ss_pred cccCceeecchHHHHHHHHhhhhhcccceEeeC---------C----HHHHH----HHHHHHh----------hhhHHHH Q lcl|NC_019418. 65 RKRRKMQHLPIARTAAKKIASLVYNEQAEISAE---------D----ETLND----FLSDMLS----------NDRFNKN 117 (527) Q Consensus 65 ~~~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~---------d----~~~~~----~l~~~l~----------~n~f~~~ 117 (527) +.++-...-++++.+++.+++.+.|...+.... + +..++ .|+.|.+ ..+|... T Consensus 62 RaRdL~rNn~~a~~av~~~~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~ 141 (553) T protein:vir:63 62 RGRDMADNDGFTNGAVGYQRDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGL 141 (553) T ss_pred HHHHHHhcChHHHHHHHHHHHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHH Confidence 000011233789999999999999985544321 1 12222 3344432 2247777 Q ss_pred HHHHHHHHHhcCCEEEEEEEeC--C---eeEEEEEcCCceE-EE-EEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEee Q lcl|NC_019418. 118 FERYLESALALGGLAMRPYVDG--D---KIRVAFIQAPVFL-PL-QSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEW 190 (527) Q Consensus 118 ~~~~~~~a~~~G~~~~~~~~d~--~---~~~i~~v~a~~~~-P~-~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~ 190 (527) ...++...+..|.+++++.+.. + ..++..++|+.+- |. ..+++.+..+|.+.. ...-+-|.+...|- T Consensus 142 q~l~~r~~~~dGE~~~~~~~~~~~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE~d~-----~Gr~vaY~i~~~hP- 215 (553) T protein:vir:63 142 IRLGVVGYVKTGEVLATAEWDRAANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQYDK-----RGRPQGYWIQVAHP- 215 (553) T ss_pred HHHHHHHHHhCCceEEEeeeccCCCCcccceEEEechhhcCCCCCCCCCCeeEeeeEECC-----CCceEEEEeeccCC- Confidence 7778888899999999998853 2 3688899998753 11 112333444443311 11222333333331 Q ss_pred cccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcch Q lcl|NC_019418. 191 VTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSI 270 (527) Q Consensus 191 ~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~ 270 (527) + ..+... .....|..++. ...++.+-+.|+.-+ ...+...|+|. T Consensus 216 g-------------d~~~~~----------------~~~~~~~r~~~---~~~v~a~~vlH~f~~----~r~gQ~RGis~ 259 (553) T protein:vir:63 216 G-------------DLYQMA----------------PDMYKWKFVQQ---SKPWGRRQVIHILEP----REPDQSRGIAD 259 (553) T ss_pred C-------------cccccc----------------ccccceeeecc---ccccChhHheecccc----cCCCcccCCch Confidence 0 000000 00000111111 122344444444322 23456679999 Q ss_pred hhhhHHHHHHHHHHHHHHHH-HHHcCcceeeechh-----HhcCCCCC--CCcccccc-ccccc------ccc-e-eee- Q lcl|NC_019418. 271 FDNAKTTIDFINRTYDEFMW-EIKMGQRRVIVPEQ-----MTQLKVQD--NQGNIAFK-RRFDV------EQN-V-YMQ- 332 (527) Q Consensus 271 ~~~~~~lid~ld~~~s~~~~-e~~~~~~~i~v~~~-----~l~~~~~~--~~~~~~~~-~~~d~------~~~-~-~~~- 332 (527) |+.++..+..|+.-.+.-.. ..-.+--..||-.. .......+ ++...... ...+. ... + ..+ T Consensus 260 lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG 339 (553) T protein:vir:63 260 IVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESELPPEFIHSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQIDGA 339 (553) T ss_pred HHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCChhhhhhhcccccccccccccccccccccccccccccceeecCc Confidence 99999999999975544332 22223333444111 11000000 00000000 00000 000 0 000 Q ss_pred --ccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccc-c-chHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019418. 333 --VGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQG-V-KTATEIVSENSDTYQMRNSIVALV 408 (527) Q Consensus 333 --~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g-~-~TAtei~s~~~~~~~~~~~~~~~~ 408 (527) +...++ ..|+.++|.-+..+|..-...+++.|....|+++..++.+-++ . .++-....... ......|..| T Consensus 340 ~i~~L~pG--e~i~~~~p~~p~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS~R~~~~e~~---r~~~~~q~~~ 414 (553) T protein:vir:63 340 KIPHLFPG--TKLNLKPMGTPGGVGSEFEASLNRHLASAFGMSYEEFTRDFSKANYSSIQAGIAMTR---RFLEGRKKMC 414 (553) T ss_pred eeeecCCC--CeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHH---HHHHHHHHHH Confidence 001122 2478888888888888889999999999999999998776543 2 12222222223 3333333334 Q ss_pred HHHH-HHHHHHHHHHhhhh---cccCCcccCc--------cceEEEeC--CCccCCHHHHHHHHHHHHhcCCCCHHHHHH Q lcl|NC_019418. 409 EQSI-KELCVSMCELGKVV---GIYRGTIPEL--------DDISVNLD--DGVFTDRHAELDYWMKMVAAGFATQKRGIA 474 (527) Q Consensus 409 ~~al-~~li~~il~~~~~~---~~~~~~~~~~--------~~v~v~f~--d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~ 474 (527) ...+ +.+.+..|..+-+- .+..+..... .-+.+.|- --..+|+.++++.....+.+|+.|.++.+. T Consensus 415 ~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~t~~~~~a 494 (553) T protein:vir:63 415 ADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVMRIDAGLSTYEREIA 494 (553) T ss_pred HHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHH Confidence 3333 33334333322211 1111110000 01234553 334468999999999999999999999988 Q ss_pred hcCCCCHHHHHHHHHHHHH---hcc-cccccccCCCCCC-----CCCCCCCCCCCCCccc Q lcl|NC_019418. 475 KTLGITEEEAEKELAEING---ELP-PESDAELALYGKG-----QQNTVGNSKDTVDDED 525 (527) Q Consensus 475 ~~~~~~deea~~el~ri~~---E~~-~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~ 525 (527) +. |.+-+++.+++++-.+ +.. +.+.+.......+ .+.+.+...++..++| T Consensus 495 ~~-G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (553) T protein:vir:63 495 RL-GGDFRKSFAQRAREDALLKKYGLTFNLSAKRSLGDGRDAATGIAEDPAAAQTSQQGE 553 (553) T ss_pred Hh-CCCHHHHHHHHHHHHHHHHHcCCCCCCCCccccCCCcccCCCCCCCCCCCCcccccC Confidence 88 8776665555443322 111 1111111011111 1111111122222222 No 103 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=99.11 E-value=1.8e-09 Score=68.56 Aligned_cols=470 Identities=12% Similarity=0.057 Sum_probs=201.2 Q ss_pred CChHHHHHHHHHHHHH--HhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHH- Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRY--NMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIAR- 77 (527) Q Consensus 1 m~~~~~~k~~~~~~~~--~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~- 77 (527) |++-.+-.+.++..+. .-++..++.... .....+...+++|+..|.--...-++ .-|+.+..-++|+..|| T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~v~~~~~----~~~~~r~~~~~~w~e~~~yi~~~~tr--~t~~~~~~w~~s~t~~k~ 74 (599) T protein:vir:31 1 MSTDIKTLQKMLEGRDDDRAFIDELVVLFT----NMENARAQKDREDKELMDYIDATDTR--KTSNSKLPFKNSTTINKL 74 (599) T ss_pred CccchHHHHHHhhccCchHHHHHHHHHHHH----hhhhhhhhhhcccHHHHHHHhhhccc--ccccCCCCcccccchHHH Confidence 5542222222111000 000000000000 11223344455565543321111122 22333445566667766 Q ss_pred -HHHHHHhhhhhcc----cceEee-----CCH------HHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-- Q lcl|NC_019418. 78 -TAAKKIASLVYNE----QAEISA-----EDE------TLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-- 139 (527) Q Consensus 78 -~i~~~~A~ll~~e----~~~i~~-----~d~------~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-- 139 (527) .+++.+..++++- .-=+.+ +++ ....+++.-|...+|...+...+-+-+.+|.++.++-+.. T Consensus 75 ~~~~~~l~a~~~~~~fp~~~w~d~~~~~~~~~~~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G~~vat~~~er~~ 154 (599) T protein:vir:31 75 AHLHLMITTSYMEHLLPNRNWVDFVGFDNDSVNAEKREIARSYVRGKVEASNLEGVIERMVDDFAVRGFCVAHTRHVKRM 154 (599) T ss_pred HHHHHHHHHHHHhhhcCCccceEeeecCCchhHHHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccCceeEeeeEEEcc Confidence 3455554443332 111222 111 1234456667777899999999999999998888766531 Q ss_pred ------------CeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCC--------cceEEEEEE-----EEeecc-- Q lcl|NC_019418. 140 ------------DKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENR--------KNVYYTLVE-----FHEWVT-- 192 (527) Q Consensus 140 ------------~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~--------~~~~yt~lE-----~h~~~~-- 192 (527) -+|+++.|+|..+||= -+......++++.+.+.+-.+ ...||. +| .|+... T Consensus 155 ~~~~d~~v~~~~~~P~~ervsP~Di~~D-p~A~si~d~~fivRs~~Tk~~L~~l~~~~~~~~y~-~d~~~~~~~~~~~~~ 232 (599) T protein:vir:31 155 TVTAENQVIKNYSGTVTERLSPSDVFWD-VTADSLPKAAKCIRQLYTLGSLKREIEEGTFPLMS-MEDFQKLREERRTIR 232 (599) T ss_pred eeecccccccccccceEEeecccceeeC-CCCCCCCcceeeeehhhhHHHHHHHhccCCccccc-hHHHHHHHhhccCCC Confidence 1488999999999971 122334445555554332110 001121 01 011110 Q ss_pred --cccccce-eeec----CCceEEEEEEEecCCccccCceeeccccc----C----Cccc--ceeecC------------ Q lcl|NC_019418. 193 --PTGQEVG-STKD----KSLYRITNELYKSTSDSQLGERVNLSELY----P----DLQP--VTPIQG------------ 243 (527) Q Consensus 193 --~~~~~~~-~~~~----~~~~~I~n~ly~~~~~~~lG~~v~l~~~~----~----~l~~--~~~~~g------------ 243 (527) ....... ...+ ++.|.|.. .|. + | +|.+-+.| + .+.. .+++.| T Consensus 233 ~~~~d~~~~~~g~D~~~~d~~~~~~e-Y~~---~---~-~VevLeywGd~ydee~d~~~~~~ViTi~g~~~liR~e~np~ 304 (599) T protein:vir:31 233 EALADGYNGRRKFDSLHKKGYGSMMN-YIN---E---G-VVEVLTFMGDFYDEENDELWNNYEITVIDRKIIGRKQSKDT 304 (599) T ss_pred ccccchhhhhhhccccccccccchhh-hcc---c---c-hhhhhhhhhhhhcccCCccccceEEEEecCcEEeecccCCC Confidence 0000000 0000 11111110 000 0 0 11111111 1 0111 111111 Q ss_pred -CCc-cc--EEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCccccc Q lcl|NC_019418. 244 -LSR-PL--FTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGNIAF 319 (527) Q Consensus 244 -~~~-p~--f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~~~~ 319 (527) .++ |+ ..|.|. ..+.||.++++.+.++++.||.++....+.+.. ++.+ ++....+-.... T Consensus 305 ~~g~~Pyvv~~~~P~-------~~~~yG~G~l~~~~gaQ~~lN~~~Ng~iD~~~~-----~l~p-~l~~~~dl~~eD--- 368 (599) T protein:vir:31 305 WDGSQNLHIAVYEFQ-------KDTLCPIGPLHRLTGMQYKLDKRENFREDLHDR-----FLHP-SLKKVGDVREKG--- 368 (599) T ss_pred CCCCCCeEEEEeeee-------ccccCCCCCchhcchHHHHHHHHHHHhhhhhhh-----hhcc-cccccccccccC--- Confidence 111 22 122222 246789999999999999999998888776543 1111 111111111100 Q ss_pred ccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHH Q lcl|NC_019418. 320 KRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQ 399 (527) Q Consensus 320 ~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~ 399 (527) ..+.++...+. ++.+.++.+.|..+.-+-..-++.+...++..+|.++...|..+.|.+||+++.......-. T Consensus 369 -~~~~P~~v~~~------~d~~~vq~~~p~s~~~~a~~~is~~e~~mee~sGvp~~~~G~~~ag~~TA~~is~l~naa~~ 441 (599) T protein:vir:31 369 -MRGGPNHVFEV------EETGDVQYMTPPAEVLQPDNQLSITLQLMEDLSGAPKESIGQRTAGEKTKFEVQLLDQGQNK 441 (599) T ss_pred -ccCCCCcceee------cCCCccccccCchhhhhHHHHHHHHHHHHHHhhccchhhcCCcccchhhHHHHHHHHhhhhh Confidence 11122211111 23344566666543333333344455556778899999999998888999999888877777 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHHHhhhhcccCCc--c---cC------ccceEEEe----CCCccC------CHHHHHHH Q lcl|NC_019418. 400 MRNSIVALVEQSI-KELCVSMCELGKVVGIYRGT--I---PE------LDDISVNL----DDGVFT------DRHAELDY 457 (527) Q Consensus 400 ~~~~~~~~~~~al-~~li~~il~~~~~~~~~~~~--~---~~------~~~v~v~f----~d~i~~------d~~~~~~~ 457 (527) .+.++.+.|.+.+ +.|++.+++.... +.+.. + .+ ..+|+.+. .+.++. .++.-.+. T Consensus 442 ~~~~~vr~~e~~~lepll~~l~e~~~~--f~D~~~tiri~~~e~~~~~f~~i~redl~~~~~~v~~Ga~~v~ere~~~q~ 519 (599) T protein:vir:31 442 VFRRKVKKFERELLTPVLNDYLEQGRN--HLDASDTIKTFNSELGTATFLDITADDLNLNGQMVAQGATLFAEKANTLQN 519 (599) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHh--hcccccceeeecccccceeeEEeehhhhhCCeeeeechhhHHHHHHHHHHH Confidence 7788888887766 5599988877653 11110 0 00 11111110 001111 11221222 Q ss_pred HHHHHhc--C--C---CCHHH---HHH---hcCC---------CCHHHHHHHHHHHHHhcccc---cccccCCCC-CCCC Q lcl|NC_019418. 458 WMKMVAA--G--F---ATQKR---GIA---KTLG---------ITEEEAEKELAEINGELPPE---SDAELALYG-KGQQ 511 (527) Q Consensus 458 ~~~~~~a--G--i---~s~~~---~i~---~~~~---------~~deea~~el~ri~~E~~~~---~~~~~~~~~-~~~~ 511 (527) +.+...+ | + |+.+. ++. .++. +.+.|.+..+++++-++..+ .+...+.|+ +..+ T Consensus 520 l~~il~~~~~q~~~P~~~~k~l~~~l~~~~~l~~~~~~~~~va~~eqq~~~~m~Q~~lq~~~~~~~~~~~~~~~~~~~~~ 599 (599) T protein:vir:31 520 LNAILGGPLGAALAPHMSRTKLFNAVEYLGDLDAYGIFTFGIGVQEDQQLARMAQKSTQQTEETALTQEEVGGPTTDTGQ 599 (599) T ss_pred HHHHhcccCCCccchhhHHHHHHHHHHHHHhccccccCCCchhHHHHHHHHHHHHHHHHHhHhhhhhhhhcCCCCcccCC Confidence 3333321 1 1 33321 111 2222 22333333333332222211 111111111 1111 No 104 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=99.10 E-value=1.2e-09 Score=69.41 Aligned_cols=481 Identities=11% Similarity=0.067 Sum_probs=216.9 Q ss_pred CC-hH-HHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc-CccccCceeecchHH Q lcl|NC_019418. 1 MS-LI-QKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD-GDRKRRKMQHLPIAR 77 (527) Q Consensus 1 m~-~~-~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~-~~~~~~~~~~lnl~~ 77 (527) +. +- ..+.+|.+. +.-.++......++.+||.|. .|..-... -.-..+..++.|+=+ T Consensus 17 ~~~~~~~~l~~~~~~------------------~~~~~~~r~~a~~d~~fy~G~--Qw~~~~~~~l~~~g~p~~~~N~i~ 76 (714) T protein:vir:10 17 TPRFSQRQLLSLCSD------------------IDSQPLWRDAANKACAYYDGD--QLAPEVIQVLKDRGQPMTIHNLIA 76 (714) T ss_pred hhhhhHHHHHHHHHH------------------HhhhHHHHHHHHHHHHhhcCC--CCCHHHHHHHHhcCCCcEEeccHH Confidence 11 11 112222111 111233445566677899884 34211111 111235668889999 Q ss_pred HHHHHHhhhhhcccceEeeC----CH---HHHHHH----HHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC----Cee Q lcl|NC_019418. 78 TAAKKIASLVYNEQAEISAE----DE---TLNDFL----SDMLSNDRFNKNFERYLESALALGGLAMRPYVDG----DKI 142 (527) Q Consensus 78 ~i~~~~A~ll~~e~~~i~~~----d~---~~~~~l----~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~----~~~ 142 (527) .+|+...++--...+.+.+. ++ ++++.| ..+.+.++.......+...++..|-+|+.+++|. +.+ T Consensus 77 ~~v~~v~g~~~~nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~d~d~~~~~i 156 (714) T protein:vir:10 77 PTVDGVLGMEAKTRTDLIVMSDDPNDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSEPFGPEF 156 (714) T ss_pred HHHHHHHHHHHhCCcceEEecCCCChhhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcccceEEeeeccCCCCCCe Confidence 99999999888777777663 11 244544 4455677888888999999999999999999973 468 Q ss_pred EEEEEcCCceEEEEEcC--CceEEEEEEE--EEEeeC------C------------------------------------ Q lcl|NC_019418. 143 RVAFIQAPVFLPLQSNT--QDVSSAAILT--KTIKTE------N------------------------------------ 176 (527) Q Consensus 143 ~i~~v~a~~~~P~~~d~--~~~~~~a~~~--~~~~~~------~------------------------------------ 176 (527) +|.+|+|..++.= .+. .....|-++. +++..+ + T Consensus 157 ~i~~v~p~~v~~D-p~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (714) T protein:vir:10 157 KVSTVSRNEVFWD-WLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEY 235 (714) T ss_pred EEEecChhheeec-cccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcccchhhhhhcccccccchhh Confidence 9999999998841 111 1111221111 000000 0 Q ss_pred ------------CcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCccc------- Q lcl|NC_019418. 177 ------------RKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQP------- 237 (527) Q Consensus 177 ------------~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~------- 237 (527) ......++.|+. ........+ .....|.+. .| +...++..+.+..-...+.. T Consensus 236 ~~~~~~~~~~~~~~~~rV~v~E~w--~k~~~~~~~--~~~~~g~~~--~~---d~~~~~~~~~~~~g~~~~~~~~~~rv~ 306 (714) T protein:vir:10 236 QSWDRQQNEWLQRERRRVLLQVVY--YRTFERLPV--IELSNGRVV--AF---DKNNLMQAVAVASGRVQVKVGRVSRIR 306 (714) T ss_pred cccccccccccccCcceEEEEEEE--EeEEEEEEe--ecCCCCCee--ee---CccCHHHHHHHHhccceecccceeeEE Confidence 000011122210 000000000 000001000 01 01111000000000000000 Q ss_pred ceeec-------C---CCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhc Q lcl|NC_019418. 238 VTPIQ-------G---LSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQ 307 (527) Q Consensus 238 ~~~~~-------g---~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~ 307 (527) ...++ + ++.-.|.|+|....-....+.|+| .+.++++.++.+|...|+..+-+ +..++++.+..+. T Consensus 307 ~~~~~g~~~L~~~~~p~p~~~fp~vP~~g~~~~~~g~~~G--~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~gav~ 382 (714) T protein:vir:10 307 EAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG--LISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQ 382 (714) T ss_pred EEEEecchhhhcCCCCCCCCceeeEEecceeeeccCccce--ehhhhhhHHHHHHHHHHHHHHHH--hCCceeecccccc Confidence 00011 1 111124444443322222334565 59999999999999999998865 3334555444331 Q ss_pred CCCCCCCccccccccccccc-ceeeecc-CCCCCCCcceEeccc-cChHHHHHHHHHHHHHHHHhcCCCccccccccccc Q lcl|NC_019418. 308 LKVQDNQGNIAFKRRFDVEQ-NVYMQVG-AGNMDSGGIVDLTTP-IRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGV 384 (527) Q Consensus 308 ~~~~~~~~~~~~~~~~d~~~-~~~~~~~-~~~~~~~~i~~~~~~-ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~ 384 (527) .. +...-+ ... .++- -.|.+.. .+......++..++. ++ ..+...++.....|...+|++...+|..++ . T Consensus 383 ~~-d~~~~e--~~~--rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~llq~~~~~i~~~tGv~~~~lG~~~n-a 455 (714) T protein:vir:10 383 LS-DNDLME--QLE--RPDGIIKLNPVRKNQKSVADVFRVEQDFQVA-SQQFQVMQESEKLIQDTMGVYSAFLGQDSG-A 455 (714) T ss_pred cc-HHHHHH--hcc--CCCCeEEecccccccCCccccccccCCCCCc-HHHHHHHHHHHHHHHHhhCCCHHHcCCCcc-h Confidence 10 000000 000 0110 0121111 111112345555533 44 457888988888999999999999887643 4 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh----c---ccC--Cccc--------------------- Q lcl|NC_019418. 385 KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVV----G---IYR--GTIP--------------------- 434 (527) Q Consensus 385 ~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~----~---~~~--~~~~--------------------- 434 (527) .++.+|.+...........+...+..+.+.+.+.++.+...+ . +-+ +... T Consensus 456 ~SGvAI~~r~~qg~~~l~~~~dnl~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~ 535 (714) T protein:vir:10 456 TSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDIS 535 (714) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCCccccccce Confidence 567778877777777777777778888887777777665321 1 100 0000 Q ss_pred -CccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHH------HHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCC Q lcl|NC_019418. 435 -ELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQK------RGIAKTLGITEEEAEKELAEINGELPPESDAELALYG 507 (527) Q Consensus 435 -~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~------~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~ 507 (527) ..++|+|+=..+.+.-+++.++.++++..+ +.+. ..+.++-++.- +.+.+++|++-.....+.. .+.. T Consensus 536 ~~~~dv~i~~~p~~~s~r~~~~~~l~ql~~~--~~p~~~~~~~~~~le~~d~p~--~~ei~~~ir~~~~~~~~~~-~~~~ 610 (714) T protein:vir:10 536 RLNTHIALAPVQQTPAFKAQLAQRMSEVIQG--LPPQVQAVVLDLWVNLLDVPQ--KQEFVERIRAALGTPKSPD-EMTP 610 (714) T ss_pred eeeEEEEEeeccCcHHHHHHHHHHHHHHHhh--cCchhhhhHHHHHHHhcCCcC--HHHHHHHHHHHcCCCCCcc-ccCc Confidence 011222222222223234445555555532 2222 22334434431 3345566665443211100 0000 Q ss_pred CCCCCCCCCCCCCCCccc------cC Q lcl|NC_019418. 508 KGQQNTVGNSKDTVDDED------EA 527 (527) Q Consensus 508 ~~~~~~~~~~~~~~~~~~------~~ 527 (527) ++++-.....--.....+ .+ T Consensus 611 e~q~~q~~~~~~~~~q~~l~~~e~~a 636 (714) T protein:vir:10 611 EEQEVAAQQQALQQQQAELQMREMAG 636 (714) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000000000000000 00 No 105 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=99.10 E-value=1.1e-09 Score=69.78 Aligned_cols=421 Identities=14% Similarity=0.124 Sum_probs=165.4 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchH--HH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIA--RT 78 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~--~~ 78 (527) ||||+++ |.+ ......+...-. .+..+.. ..+..|. ....|..... ...|..| -. T Consensus 1 Mg~~~~l---~~~--------~~~~~~~~~~~~----~~~~~~~-~~~~~~~------~~~~g~~v~~-~~al~~~~v~~ 57 (457) T protein:vir:62 1 MGFWSAL---FGR--------GHSPALDAAEGR----AWEPYDP-SIYNLGA------TASSGERVTP-HDALQVSAVFA 57 (457) T ss_pred Cchhhhh---hcc--------cccccccccccc----ccccchh-hhhhccc------cccCCceech-HHhhccHHHHH Confidence 9998865 321 100100000000 0000100 0111111 1111211100 0111222 12 Q ss_pred HHHHHhhhhhcccceEe---------eCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCee-EEEEEc Q lcl|NC_019418. 79 AAKKIASLVYNEQAEIS---------AEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKI-RVAFIQ 148 (527) Q Consensus 79 i~~~~A~ll~~e~~~i~---------~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~-~i~~v~ 148 (527) .++.+|+-+-+=|..+- ++.......+..--..-.....++..+...+..|.+++.+..+++++ .+..++ T Consensus 58 ~i~~ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~~g~~~~l~~l~ 137 (457) T protein:vir:62 58 SVRLLSETIATLPLSTYSKRGGTRKEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWAGPNIAGLDVLD 137 (457) T ss_pred HHHHHHHhHhhCceEEEEecCCccccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeCCCcEEEEEEEc Confidence 33344443333333321 11111222222110111234445566777788899998887766654 455566 Q ss_pred CCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeec Q lcl|NC_019418. 149 APVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNL 228 (527) Q Consensus 149 a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l 228 (527) |+++-+.....++. ....||. |.+. . -|....+ T Consensus 138 p~~v~v~~~~~~~~--------------~~~~~~~-----------------------y~~~-----~-----~g~~~~~ 170 (457) T protein:vir:62 138 PTKIHVHMVMVDGL--------------RRKVFEA-----------------------YDID-----A-----DGNEVLL 170 (457) T ss_pred CcceEEEEeccCCc--------------cceeEEE-----------------------EEEc-----c-----CCceeEE Confidence 66554422111110 0011110 0000 0 0000000 Q ss_pred ccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcC Q lcl|NC_019418. 229 SELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQL 308 (527) Q Consensus 229 ~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~ 308 (527) .... +. -+.||+.+.++ +...|+|.+.-+...|.....+-.-..+-|..|.. |..++.. T Consensus 171 ~~~~---~~----------eiih~r~~~~~----~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~----p~gil~~ 229 (457) T protein:vir:62 171 GWFT---PR----------DVLHIPGMMLP----GDFVGCSPISYARESIGLALAAQKYGAHFFRNGAM----PGAVVEV 229 (457) T ss_pred EeeC---cc----------ceEEecCCCCC----CceecccHHHHHHHHHHHHHHHHHHHHHHHhccCC----cceEEEc Confidence 0000 00 13466644333 22468998887777766555444433344555332 2222222 Q ss_pred CCCCCCcc-cccccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccccc Q lcl|NC_019418. 309 KVQDNQGN-IAFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQ 382 (527) Q Consensus 309 ~~~~~~~~-~~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~ 382 (527) ........ -.....| +..|.+.+ .+ -.++..++.++......++.+..+....+|+...|++|..+|+..+ T Consensus 230 ~~~ls~e~~~~~~~~~---~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~ 306 (457) T protein:vir:62 230 PGTMSEEGLARAREAW---RAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATN 306 (457) T ss_pred CCCCCHHHHHHHHHHH---HHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC Confidence 11111000 0000011 11122211 00 0112245566666667788888888888999999999999987665 Q ss_pred ccchHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHH Q lcl|NC_019418. 383 GVKTATEIVSENSDTY-QMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKM 461 (527) Q Consensus 383 g~~TAtei~s~~~~~~-~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~ 461 (527) +..+...+.......+ .++.-+...|+.+|... +..........+.++++.-+-.|..+.++...++ T Consensus 307 ~~~~~sn~eq~~~~f~~~~l~P~~~~ie~~ln~~------------L~~~~~~~~~~i~fd~~~l~~~d~~~r~~~~~~~ 374 (457) T protein:vir:62 307 STSWGSGLAEQNIAFTMFSLRPWLERIEAGFNRL------------LFAETADRFRFVKFNLDEIKRGAPKERMELWSLG 374 (457) T ss_pred cccccchHHHHHHHHHHHHHHHHHHHHHHHHHhh------------hcCccccCceEEEeechhhhccCHHHHHHHHHHH Confidence 5432222211111111 12222222222222211 1111112233456666676667889999999999 Q ss_pred HhcCCCCHHHHHHhc--CCCCHHHHHHHH-----HHHHHhcccc-cccccC-CCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 462 VAAGFATQKRGIAKT--LGITEEEAEKEL-----AEINGELPPE-SDAELA-LYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 462 ~~aGi~s~~~~i~~~--~~~~deea~~el-----~ri~~E~~~~-~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) +.+|+|++-+++... .++.+..+.+.+ ..+.....+. .....+ -+..+++++..+..+..++.||+ T Consensus 375 ~~~G~~T~NE~R~~~gl~pi~~g~~D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 449 (457) T protein:vir:62 375 LQNGIYSIDEVRAAEDMTPLPDGLGEKYRVPLNLGEIGEEPEPEPAPAPPAIDPPAEEPADDEEPDNAEGDPDEG 449 (457) T ss_pred HhCCCcCHHHHHHHhCCCCCCCCCcceeeeccccccccccccccccCCCccCCCCccCCCCCCCCCCCCCCCccc Confidence 999999999977654 223322111111 1111100000 000000 01111111111112222222222 No 106 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=99.09 E-value=2.5e-09 Score=67.72 Aligned_cols=405 Identities=12% Similarity=0.130 Sum_probs=174.4 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCC-CcccccccccCccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSK-FDDIEYTNTDGDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~-~~~l~~~~~~~~~~~~~~~~lnl~~~i 79 (527) |++|+++-+|.++.. .+...++. ...|...|.|- ...+. .... .-+...--..+ T Consensus 1 M~~~~~~f~~~~r~~-------------~~~~~~~~-----~~~~~~~~~g~~~~~~~---v~~~----~al~~~~v~~~ 55 (429) T protein:vir:10 1 MDSVKKFFNFEKRQT-------------SQVIELNK-----DDEKLLEWLGISPSTIS---VKGK----NALKVATVFAC 55 (429) T ss_pred CchhhhhhcccccCc-------------ccccccCC-----ChHHHHHHhcCCCCcce---echh----hhhccHHHHHH Confidence 999888888766411 01112221 11122222231 11111 1111 11111222234 Q ss_pred HHHHhhhhhcccceEeeCC-----HHHHHHHHHHHhh-----hhHHHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEEE Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAED-----ETLNDFLSDMLSN-----DRFNKNFERYLESALALGGLAMRPYVDG-DK-IRVAFI 147 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~d-----~~~~~~l~~~l~~-----n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~v 147 (527) ++.+|+-+-+=|..+--.+ ......+..+|.. -....-++.++...+..|.+++.+..+. |+ ..+..+ T Consensus 56 i~~ia~~ia~l~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i 135 (429) T protein:vir:10 56 IKILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPI 135 (429) T ss_pred HHHHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEE Confidence 4455554444333321111 1111223333321 1122334556777788899999988875 33 356677 Q ss_pred cCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceee Q lcl|NC_019418. 148 QAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVN 227 (527) Q Consensus 148 ~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~ 227 (527) +|+++-+.. +..+.... +...||. + ..-|.... T Consensus 136 ~~~~v~v~~-~~~~~~~~-----------~~~~~~~------------------------------~-----~~~g~~~~ 168 (429) T protein:vir:10 136 DASKVTVYI-DDVGLLNS-----------KTKMWYV------------------------------V-----NTGGQQRV 168 (429) T ss_pred cCceeEEEE-cCcccccc-----------cceEEEE------------------------------E-----ccCCeEEE Confidence 887765532 22221110 0001111 0 00011111 Q ss_pred cccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcC-cceeeechhHh Q lcl|NC_019418. 228 LSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMG-QRRVIVPEQMT 306 (527) Q Consensus 228 l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~-~~~i~v~~~~l 306 (527) + ++ --..||+.+.+. +...|+|.+..+...+......-....+-++.| +++.++ T Consensus 169 ~-------~~---------~evih~~~~~~~----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il----- 223 (429) T protein:vir:10 169 L-------KP---------EEILHFKNGITL----DGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLV----- 223 (429) T ss_pred E-------cc---------ccEEEecCCCCC----CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEE----- Confidence 0 00 013456543221 234589999888887776655444444445553 333333 Q ss_pred cCCCCCCCc-ccccccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccc Q lcl|NC_019418. 307 QLKVQDNQG-NIAFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFD 380 (527) Q Consensus 307 ~~~~~~~~~-~~~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~ 380 (527) ......... .-.....| +..|.+.. .+ -.++..++.++......++.+..+....+|+...|++|..+|.. T Consensus 224 ~~~~~l~~e~~~~~~~~~---~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~ 300 (429) T protein:vir:10 224 QYVGDLNEDAKKVFRENF---ESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDL 300 (429) T ss_pred EcCCCCCHHHHHHHHHHH---HHHhccccccCceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC Confidence 111111000 00000111 11122110 00 01122355555555677888888888889999999999999865 Q ss_pred cccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccC-CcccCccceEEEeCCCccCCHHHHHHHH Q lcl|NC_019418. 381 GQGV-KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYR-GTIPELDDISVNLDDGVFTDRHAELDYW 458 (527) Q Consensus 381 ~~g~-~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~-~~~~~~~~v~v~f~d~i~~d~~~~~~~~ 458 (527) .++. .++.+.. ...++.+|..++..|....+. .++. ........+.++++.-+..|..+.++.. T Consensus 301 ~~~~~sn~e~~~-------------~~f~~~~l~P~~~~ie~~ln~-kl~~~~~~~~g~~~~fd~~~ll~~d~~~~~~~~ 366 (429) T protein:vir:10 301 SKATLNNIEQQQ-------------QQFYTDTLQATLTMYEQEMTY-KLFLDSELDKGFYSKFNVDAILRADIKTRYEAY 366 (429) T ss_pred CCCCcccHHHHH-------------HHHHHHHHHHHHHHHHHHHHH-hhcChhhcCCCcEEEeechhhhcCCHHHHHHHH Confidence 5432 2333321 112334444444443322211 1111 1112233456666666677899999999 Q ss_pred HHHHhcCCCCHHHHHHhcCCCCHH-HHHHHHHHHHHhcccccc-cccCCCCCCCCCCCCCCCCCCCccc Q lcl|NC_019418. 459 MKMVAAGFATQKRGIAKTLGITEE-EAEKELAEINGELPPESD-AELALYGKGQQNTVGNSKDTVDDED 525 (527) Q Consensus 459 ~~~~~aGi~s~~~~i~~~~~~~de-ea~~el~ri~~E~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (527) .+++.+|+|++-+++..+ |+..- ...+-+.. ....+-+. +...+. .++++. +..+++++.+ T Consensus 367 ~~~~~~G~~T~NE~R~~~-gl~p~~ggD~~~~~--~n~~~~d~~~~~~~k-~g~~~~--~~~~~~~e~~ 429 (429) T protein:vir:10 367 RTGIQGGFLKPNEARSKE-DLPPEAGGDRLLVN--GNMLPIDMAGQAYLK-GGDTNG--EVSKEGNEGN 429 (429) T ss_pred HHHHhCCCcCHHHHHHHh-CCCCCCCcCeeeec--ccccchhhccccccC-CCCCCC--CCCCCCCCCC Confidence 999999999999977553 54321 11111110 00000000 000011 111111 1111111111 No 107 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=99.07 E-value=6.2e-10 Score=71.05 Aligned_cols=495 Identities=12% Similarity=0.093 Sum_probs=217.3 Q ss_pred CChHHHH-----HHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc-CccccCceeecc Q lcl|NC_019418. 1 MSLIQKV-----KDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD-GDRKRRKMQHLP 74 (527) Q Consensus 1 m~~~~~~-----k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~-~~~~~~~~~~ln 74 (527) +.|=+++ .++.++.+.+.. .++.-.++-.....+..+||.|+ .|+.-... -+...+..++.| T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~R~~a~~d~~fy~G~--Qw~~~~~~~l~~~g~p~~~~N 73 (714) T protein:vir:32 6 NTMATKNDNGATPRFSQRQLQALC----------SDIDSQPKWRDAANKACAYYDGD--QLPPEVLQVLKDRGQPMTIHN 73 (714) T ss_pred ccccCCCCcchhHHHHHHHHHHHH----------HHHHhhHHHHHHHHHHHHhhcCC--CCCHHHHHHHHhcCCCcEEec Confidence 2222221 011111110000 01111222334444556789873 44211111 111235668889 Q ss_pred hHHHHHHHHhhhhhcccceEeeC----C-H--HHHHH----HHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC---- Q lcl|NC_019418. 75 IARTAAKKIASLVYNEQAEISAE----D-E--TLNDF----LSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG---- 139 (527) Q Consensus 75 l~~~i~~~~A~ll~~e~~~i~~~----d-~--~~~~~----l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~---- 139 (527) +=+.+|+...++--...+.+.+. + . ++++. +..+.+.+++......+...++..|-+|+.++++. T Consensus 74 ~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~ 153 (714) T protein:vir:32 74 LIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFG 153 (714) T ss_pred cHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCC Confidence 99999999998887777777652 1 1 24444 44556677888889999999999999999999963 Q ss_pred CeeEEEEEcCCceEEEEEc-CCceEEEEEEE--EEEeeCCCcceE----------------------E-----EEEE-EE Q lcl|NC_019418. 140 DKIRVAFIQAPVFLPLQSN-TQDVSSAAILT--KTIKTENRKNVY----------------------Y-----TLVE-FH 188 (527) Q Consensus 140 ~~~~i~~v~a~~~~P~~~d-~~~~~~~a~~~--~~~~~~~~~~~~----------------------y-----t~lE-~h 188 (527) +.++|.+|+|..+|.=... ......|-++. ++...+.-...| + .+.. +. T Consensus 154 ~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:32 154 PEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred CCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchh Confidence 4689999999998841100 01122222221 111100000000 0 0000 00 Q ss_pred eecccccccceeeecCCceEEEEEEE-ecC-----CccccCceeecccc-----------cCCcc-------cceeecC- Q lcl|NC_019418. 189 EWVTPTGQEVGSTKDKSLYRITNELY-KST-----SDSQLGERVNLSEL-----------YPDLQ-------PVTPIQG- 243 (527) Q Consensus 189 ~~~~~~~~~~~~~~~~~~~~I~n~ly-~~~-----~~~~lG~~v~l~~~-----------~~~l~-------~~~~~~g- 243 (527) ....+......+...+......++.| +.. -...-|..+.+... ...+. ....++| T Consensus 234 ~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~ 313 (714) T protein:vir:32 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGP 313 (714) T ss_pred hhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecC Confidence 00000000000000000000001111 100 00011211111100 00000 0001111 Q ss_pred -------CCcc--cEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCC Q lcl|NC_019418. 244 -------LSRP--LFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQ 314 (527) Q Consensus 244 -------~~~p--~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~ 314 (527) .+-| -|.|+|+-..-..-.+-|+| .+.++++.++.+|...|+..+-+ +..++++.+..+... +.+. T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G--~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~-d~~~ 388 (714) T protein:vir:32 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG--LISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLS-DNDL 388 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCceee--hhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCccccc-HHHH Confidence 0111 13333322221112234665 58999999999999999998865 333444433332110 0000 Q ss_pred cccccccccccccceeeecc-CCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHH Q lcl|NC_019418. 315 GNIAFKRRFDVEQNVYMQVG-AGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSE 393 (527) Q Consensus 315 ~~~~~~~~~d~~~~~~~~~~-~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~ 393 (527) .......+. --.|.+.. .+......|+...+.--..++...++.....|...+|++...+|..++ ..++.+|.++ T Consensus 389 --~e~~arp~~-vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~n-a~SGvAi~~r 464 (714) T protein:vir:32 389 --MEQIERPDG-IIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSG-ATSGVAISNL 464 (714) T ss_pred --HHhccCCCC-ceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCcc-chhHHHHHHH Confidence 000000010 00111110 011111234544433234567889988888899999999999987644 3456678777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh----c---ccC--Cc----------------------ccCccceEEE Q lcl|NC_019418. 394 NSDTYQMRNSIVALVEQSIKELCVSMCELGKVV----G---IYR--GT----------------------IPELDDISVN 442 (527) Q Consensus 394 ~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~----~---~~~--~~----------------------~~~~~~v~v~ 442 (527) ..........+...+..+.+.+.+.+|.+...+ . +-+ +. ....++|+|+ T Consensus 465 q~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~ 544 (714) T protein:vir:32 465 VEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALA 544 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEe Confidence 777666666677777777777777766654221 1 110 00 0112344444 Q ss_pred eCCCccCCHHHHHHHHHHHHhcCCCCHHH------HHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCC Q lcl|NC_019418. 443 LDDGVFTDRHAELDYWMKMVAAGFATQKR------GIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGN 516 (527) Q Consensus 443 f~d~i~~d~~~~~~~~~~~~~aGi~s~~~------~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 516 (527) =..+.+.-+++.++.++++.++ +++.. .+.++-++.. +.+.+++|++-.....+. ++.+.+.+ T Consensus 545 ~~p~~~t~r~~~~~~l~~l~~~--~~p~~~~~~~~~~l~~~d~p~--~~el~~~ir~~~~~~~~~------~~~~~e~q- 613 (714) T protein:vir:32 545 PVQQTPAFKAQLAQRMSEVIQG--LPPQVQAVVLDLWVNLLDVPQ--KQEFVERIRAALGTPKSP------DEMTPEEQ- 613 (714) T ss_pred eccCchHHHHHHHHHHHHHHhh--cCchhhhhHHHHHHHhcCCCC--HHHHHHHHHHHcCCCCCc------cccchhhH- Confidence 3443334445666666666643 34332 2334445532 445666776644322110 00000000 Q ss_pred CCCCCCccccC Q lcl|NC_019418. 517 SKDTVDDEDEA 527 (527) Q Consensus 517 ~~~~~~~~~~~ 527 (527) .-...-.+-+. T Consensus 614 ~~~~~~q~~~~ 624 (714) T protein:vir:32 614 EVAAQQQALQQ 624 (714) T ss_pred HHHHHHHHHHH Confidence 00000000000 No 108 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=99.07 E-value=6.2e-10 Score=71.05 Aligned_cols=495 Identities=12% Similarity=0.093 Sum_probs=217.3 Q ss_pred CChHHHH-----HHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc-CccccCceeecc Q lcl|NC_019418. 1 MSLIQKV-----KDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD-GDRKRRKMQHLP 74 (527) Q Consensus 1 m~~~~~~-----k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~-~~~~~~~~~~ln 74 (527) +.|=+++ .++.++.+.+.. .++.-.++-.....+..+||.|+ .|+.-... -+...+..++.| T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~R~~a~~d~~fy~G~--Qw~~~~~~~l~~~g~p~~~~N 73 (714) T protein:vir:99 6 NTMATKNDNGATPRFSQRQLQALC----------SDIDSQPKWRDAANKACAYYDGD--QLPPEVLQVLKDRGQPMTIHN 73 (714) T ss_pred ccccCCCCcchhHHHHHHHHHHHH----------HHHHhhHHHHHHHHHHHHhhcCC--CCCHHHHHHHHhcCCCcEEec Confidence 2222221 011111110000 01111222334444556789873 44211111 111235668889 Q ss_pred hHHHHHHHHhhhhhcccceEeeC----C-H--HHHHH----HHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC---- Q lcl|NC_019418. 75 IARTAAKKIASLVYNEQAEISAE----D-E--TLNDF----LSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG---- 139 (527) Q Consensus 75 l~~~i~~~~A~ll~~e~~~i~~~----d-~--~~~~~----l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~---- 139 (527) +=+.+|+...++--...+.+.+. + . ++++. +..+.+.+++......+...++..|-+|+.++++. T Consensus 74 ~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~ 153 (714) T protein:vir:99 74 LIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFG 153 (714) T ss_pred cHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCC Confidence 99999999998887777777652 1 1 24444 44556677888889999999999999999999963 Q ss_pred CeeEEEEEcCCceEEEEEc-CCceEEEEEEE--EEEeeCCCcceE----------------------E-----EEEE-EE Q lcl|NC_019418. 140 DKIRVAFIQAPVFLPLQSN-TQDVSSAAILT--KTIKTENRKNVY----------------------Y-----TLVE-FH 188 (527) Q Consensus 140 ~~~~i~~v~a~~~~P~~~d-~~~~~~~a~~~--~~~~~~~~~~~~----------------------y-----t~lE-~h 188 (527) +.++|.+|+|..+|.=... ......|-++. ++...+.-...| + .+.. +. T Consensus 154 ~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:99 154 PEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred CCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchh Confidence 4689999999998841100 01122222221 111100000000 0 0000 00 Q ss_pred eecccccccceeeecCCceEEEEEEE-ecC-----CccccCceeecccc-----------cCCcc-------cceeecC- Q lcl|NC_019418. 189 EWVTPTGQEVGSTKDKSLYRITNELY-KST-----SDSQLGERVNLSEL-----------YPDLQ-------PVTPIQG- 243 (527) Q Consensus 189 ~~~~~~~~~~~~~~~~~~~~I~n~ly-~~~-----~~~~lG~~v~l~~~-----------~~~l~-------~~~~~~g- 243 (527) ....+......+...+......++.| +.. -...-|..+.+... ...+. ....++| T Consensus 234 ~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~ 313 (714) T protein:vir:99 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGP 313 (714) T ss_pred hhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecC Confidence 00000000000000000000001111 100 00011211111100 00000 0001111 Q ss_pred -------CCcc--cEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCC Q lcl|NC_019418. 244 -------LSRP--LFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQ 314 (527) Q Consensus 244 -------~~~p--~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~ 314 (527) .+-| -|.|+|+-..-..-.+-|+| .+.++++.++.+|...|+..+-+ +..++++.+..+... +.+. T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G--~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~-d~~~ 388 (714) T protein:vir:99 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG--LISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLS-DNDL 388 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCceee--hhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCccccc-HHHH Confidence 0111 13333322221112234665 58999999999999999998865 333444433332110 0000 Q ss_pred cccccccccccccceeeecc-CCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHH Q lcl|NC_019418. 315 GNIAFKRRFDVEQNVYMQVG-AGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSE 393 (527) Q Consensus 315 ~~~~~~~~~d~~~~~~~~~~-~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~ 393 (527) .......+. --.|.+.. .+......|+...+.--..++...++.....|...+|++...+|..++ ..++.+|.++ T Consensus 389 --~e~~arp~~-vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~n-a~SGvAi~~r 464 (714) T protein:vir:99 389 --MEQIERPDG-IIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSG-ATSGVAISNL 464 (714) T ss_pred --HHhccCCCC-ceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCcc-chhHHHHHHH Confidence 000000010 00111110 011111234544433234567889988888899999999999987644 3456678777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh----c---ccC--Cc----------------------ccCccceEEE Q lcl|NC_019418. 394 NSDTYQMRNSIVALVEQSIKELCVSMCELGKVV----G---IYR--GT----------------------IPELDDISVN 442 (527) Q Consensus 394 ~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~----~---~~~--~~----------------------~~~~~~v~v~ 442 (527) ..........+...+..+.+.+.+.+|.+...+ . +-+ +. ....++|+|+ T Consensus 465 q~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~ 544 (714) T protein:vir:99 465 VEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALA 544 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEe Confidence 777666666677777777777777766654221 1 110 00 0112344444 Q ss_pred eCCCccCCHHHHHHHHHHHHhcCCCCHHH------HHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCC Q lcl|NC_019418. 443 LDDGVFTDRHAELDYWMKMVAAGFATQKR------GIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGN 516 (527) Q Consensus 443 f~d~i~~d~~~~~~~~~~~~~aGi~s~~~------~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 516 (527) =..+.+.-+++.++.++++.++ +++.. .+.++-++.. +.+.+++|++-.....+. ++.+.+.+ T Consensus 545 ~~p~~~t~r~~~~~~l~~l~~~--~~p~~~~~~~~~~l~~~d~p~--~~el~~~ir~~~~~~~~~------~~~~~e~q- 613 (714) T protein:vir:99 545 PVQQTPAFKAQLAQRMSEVIQG--LPPQVQAVVLDLWVNLLDVPQ--KQEFVERIRAALGTPKSP------DEMTPEEQ- 613 (714) T ss_pred eccCchHHHHHHHHHHHHHHhh--cCchhhhhHHHHHHHhcCCCC--HHHHHHHHHHHcCCCCCc------cccchhhH- Confidence 3443334445666666666643 34332 2334445532 445666776644322110 00000000 Q ss_pred CCCCCCccccC Q lcl|NC_019418. 517 SKDTVDDEDEA 527 (527) Q Consensus 517 ~~~~~~~~~~~ 527 (527) .-...-.+-+. T Consensus 614 ~~~~~~q~~~~ 624 (714) T protein:vir:99 614 EVAAQQQALQQ 624 (714) T ss_pred HHHHHHHHHHH Confidence 00000000000 No 109 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=99.07 E-value=6.2e-10 Score=71.05 Aligned_cols=495 Identities=12% Similarity=0.093 Sum_probs=217.3 Q ss_pred CChHHHH-----HHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc-CccccCceeecc Q lcl|NC_019418. 1 MSLIQKV-----KDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD-GDRKRRKMQHLP 74 (527) Q Consensus 1 m~~~~~~-----k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~-~~~~~~~~~~ln 74 (527) +.|=+++ .++.++.+.+.. .++.-.++-.....+..+||.|+ .|+.-... -+...+..++.| T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~R~~a~~d~~fy~G~--Qw~~~~~~~l~~~g~p~~~~N 73 (714) T protein:vir:27 6 NTMATKNDNGATPRFSQRQLQALC----------SDIDSQPKWRDAANKACAYYDGD--QLPPEVLQVLKDRGQPMTIHN 73 (714) T ss_pred ccccCCCCcchhHHHHHHHHHHHH----------HHHHhhHHHHHHHHHHHHhhcCC--CCCHHHHHHHHhcCCCcEEec Confidence 2222221 011111110000 01111222334444556789873 44211111 111235668889 Q ss_pred hHHHHHHHHhhhhhcccceEeeC----C-H--HHHHH----HHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC---- Q lcl|NC_019418. 75 IARTAAKKIASLVYNEQAEISAE----D-E--TLNDF----LSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG---- 139 (527) Q Consensus 75 l~~~i~~~~A~ll~~e~~~i~~~----d-~--~~~~~----l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~---- 139 (527) +=+.+|+...++--...+.+.+. + . ++++. +..+.+.+++......+...++..|-+|+.++++. T Consensus 74 ~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~ 153 (714) T protein:vir:27 74 LIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFG 153 (714) T ss_pred cHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCC Confidence 99999999998887777777652 1 1 24444 44556677888889999999999999999999963 Q ss_pred CeeEEEEEcCCceEEEEEc-CCceEEEEEEE--EEEeeCCCcceE----------------------E-----EEEE-EE Q lcl|NC_019418. 140 DKIRVAFIQAPVFLPLQSN-TQDVSSAAILT--KTIKTENRKNVY----------------------Y-----TLVE-FH 188 (527) Q Consensus 140 ~~~~i~~v~a~~~~P~~~d-~~~~~~~a~~~--~~~~~~~~~~~~----------------------y-----t~lE-~h 188 (527) +.++|.+|+|..+|.=... ......|-++. ++...+.-...| + .+.. +. T Consensus 154 ~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:27 154 PEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred CCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchh Confidence 4689999999998841100 01122222221 111100000000 0 0000 00 Q ss_pred eecccccccceeeecCCceEEEEEEE-ecC-----CccccCceeecccc-----------cCCcc-------cceeecC- Q lcl|NC_019418. 189 EWVTPTGQEVGSTKDKSLYRITNELY-KST-----SDSQLGERVNLSEL-----------YPDLQ-------PVTPIQG- 243 (527) Q Consensus 189 ~~~~~~~~~~~~~~~~~~~~I~n~ly-~~~-----~~~~lG~~v~l~~~-----------~~~l~-------~~~~~~g- 243 (527) ....+......+...+......++.| +.. -...-|..+.+... ...+. ....++| T Consensus 234 ~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~ 313 (714) T protein:vir:27 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGP 313 (714) T ss_pred hhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecC Confidence 00000000000000000000001111 100 00011211111100 00000 0001111 Q ss_pred -------CCcc--cEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCC Q lcl|NC_019418. 244 -------LSRP--LFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQ 314 (527) Q Consensus 244 -------~~~p--~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~ 314 (527) .+-| -|.|+|+-..-..-.+-|+| .+.++++.++.+|...|+..+-+ +..++++.+..+... +.+. T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G--~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~-d~~~ 388 (714) T protein:vir:27 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG--LISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLS-DNDL 388 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCceee--hhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCccccc-HHHH Confidence 0111 13333322221112234665 58999999999999999998865 333444433332110 0000 Q ss_pred cccccccccccccceeeecc-CCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHH Q lcl|NC_019418. 315 GNIAFKRRFDVEQNVYMQVG-AGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSE 393 (527) Q Consensus 315 ~~~~~~~~~d~~~~~~~~~~-~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~ 393 (527) .......+. --.|.+.. .+......|+...+.--..++...++.....|...+|++...+|..++ ..++.+|.++ T Consensus 389 --~e~~arp~~-vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~n-a~SGvAi~~r 464 (714) T protein:vir:27 389 --MEQIERPDG-IIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSG-ATSGVAISNL 464 (714) T ss_pred --HHhccCCCC-ceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCcc-chhHHHHHHH Confidence 000000010 00111110 011111234544433234567889988888899999999999987644 3456678777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh----c---ccC--Cc----------------------ccCccceEEE Q lcl|NC_019418. 394 NSDTYQMRNSIVALVEQSIKELCVSMCELGKVV----G---IYR--GT----------------------IPELDDISVN 442 (527) Q Consensus 394 ~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~----~---~~~--~~----------------------~~~~~~v~v~ 442 (527) ..........+...+..+.+.+.+.+|.+...+ . +-+ +. ....++|+|+ T Consensus 465 q~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~ 544 (714) T protein:vir:27 465 VEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALA 544 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEe Confidence 777666666677777777777777766654221 1 110 00 0112344444 Q ss_pred eCCCccCCHHHHHHHHHHHHhcCCCCHHH------HHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCC Q lcl|NC_019418. 443 LDDGVFTDRHAELDYWMKMVAAGFATQKR------GIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGN 516 (527) Q Consensus 443 f~d~i~~d~~~~~~~~~~~~~aGi~s~~~------~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 516 (527) =..+.+.-+++.++.++++.++ +++.. .+.++-++.. +.+.+++|++-.....+. ++.+.+.+ T Consensus 545 ~~p~~~t~r~~~~~~l~~l~~~--~~p~~~~~~~~~~l~~~d~p~--~~el~~~ir~~~~~~~~~------~~~~~e~q- 613 (714) T protein:vir:27 545 PVQQTPAFKAQLAQRMSEVIQG--LPPQVQAVVLDLWVNLLDVPQ--KQEFVERIRAALGTPKSP------DEMTPEEQ- 613 (714) T ss_pred eccCchHHHHHHHHHHHHHHhh--cCchhhhhHHHHHHHhcCCCC--HHHHHHHHHHHcCCCCCc------cccchhhH- Confidence 3443334445666666666643 34332 2334445532 445666776644322110 00000000 Q ss_pred CCCCCCccccC Q lcl|NC_019418. 517 SKDTVDDEDEA 527 (527) Q Consensus 517 ~~~~~~~~~~~ 527 (527) .-...-.+-+. T Consensus 614 ~~~~~~q~~~~ 624 (714) T protein:vir:27 614 EVAAQQQALQQ 624 (714) T ss_pred HHHHHHHHHHH Confidence 00000000000 No 110 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=99.07 E-value=6.2e-10 Score=71.05 Aligned_cols=495 Identities=12% Similarity=0.093 Sum_probs=217.3 Q ss_pred CChHHHH-----HHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc-CccccCceeecc Q lcl|NC_019418. 1 MSLIQKV-----KDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD-GDRKRRKMQHLP 74 (527) Q Consensus 1 m~~~~~~-----k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~-~~~~~~~~~~ln 74 (527) +.|=+++ .++.++.+.+.. .++.-.++-.....+..+||.|+ .|+.-... -+...+..++.| T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~R~~a~~d~~fy~G~--Qw~~~~~~~l~~~g~p~~~~N 73 (714) T protein:vir:81 6 NTMATKNDNGATPRFSQRQLQALC----------SDIDSQPKWRDAANKACAYYDGD--QLPPEVLQVLKDRGQPMTIHN 73 (714) T ss_pred ccccCCCCcchhHHHHHHHHHHHH----------HHHHhhHHHHHHHHHHHHhhcCC--CCCHHHHHHHHhcCCCcEEec Confidence 2222221 011111110000 01111222334444556789873 44211111 111235668889 Q ss_pred hHHHHHHHHhhhhhcccceEeeC----C-H--HHHHH----HHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC---- Q lcl|NC_019418. 75 IARTAAKKIASLVYNEQAEISAE----D-E--TLNDF----LSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG---- 139 (527) Q Consensus 75 l~~~i~~~~A~ll~~e~~~i~~~----d-~--~~~~~----l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~---- 139 (527) +=+.+|+...++--...+.+.+. + . ++++. +..+.+.+++......+...++..|-+|+.++++. T Consensus 74 ~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~ 153 (714) T protein:vir:81 74 LIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFG 153 (714) T ss_pred cHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCC Confidence 99999999998887777777652 1 1 24444 44556677888889999999999999999999963 Q ss_pred CeeEEEEEcCCceEEEEEc-CCceEEEEEEE--EEEeeCCCcceE----------------------E-----EEEE-EE Q lcl|NC_019418. 140 DKIRVAFIQAPVFLPLQSN-TQDVSSAAILT--KTIKTENRKNVY----------------------Y-----TLVE-FH 188 (527) Q Consensus 140 ~~~~i~~v~a~~~~P~~~d-~~~~~~~a~~~--~~~~~~~~~~~~----------------------y-----t~lE-~h 188 (527) +.++|.+|+|..+|.=... ......|-++. ++...+.-...| + .+.. +. T Consensus 154 ~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:81 154 PEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred CCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchh Confidence 4689999999998841100 01122222221 111100000000 0 0000 00 Q ss_pred eecccccccceeeecCCceEEEEEEE-ecC-----CccccCceeecccc-----------cCCcc-------cceeecC- Q lcl|NC_019418. 189 EWVTPTGQEVGSTKDKSLYRITNELY-KST-----SDSQLGERVNLSEL-----------YPDLQ-------PVTPIQG- 243 (527) Q Consensus 189 ~~~~~~~~~~~~~~~~~~~~I~n~ly-~~~-----~~~~lG~~v~l~~~-----------~~~l~-------~~~~~~g- 243 (527) ....+......+...+......++.| +.. -...-|..+.+... ...+. ....++| T Consensus 234 ~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~ 313 (714) T protein:vir:81 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGP 313 (714) T ss_pred hhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecC Confidence 00000000000000000000001111 100 00011211111100 00000 0001111 Q ss_pred -------CCcc--cEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCC Q lcl|NC_019418. 244 -------LSRP--LFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQ 314 (527) Q Consensus 244 -------~~~p--~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~ 314 (527) .+-| -|.|+|+-..-..-.+-|+| .+.++++.++.+|...|+..+-+ +..++++.+..+... +.+. T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G--~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~-d~~~ 388 (714) T protein:vir:81 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG--LISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLS-DNDL 388 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCceee--hhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCccccc-HHHH Confidence 0111 13333322221112234665 58999999999999999998865 333444433332110 0000 Q ss_pred cccccccccccccceeeecc-CCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHH Q lcl|NC_019418. 315 GNIAFKRRFDVEQNVYMQVG-AGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSE 393 (527) Q Consensus 315 ~~~~~~~~~d~~~~~~~~~~-~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~ 393 (527) .......+. --.|.+.. .+......|+...+.--..++...++.....|...+|++...+|..++ ..++.+|.++ T Consensus 389 --~e~~arp~~-vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~n-a~SGvAi~~r 464 (714) T protein:vir:81 389 --MEQIERPDG-IIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSG-ATSGVAISNL 464 (714) T ss_pred --HHhccCCCC-ceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCcc-chhHHHHHHH Confidence 000000010 00111110 011111234544433234567889988888899999999999987644 3456678777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh----c---ccC--Cc----------------------ccCccceEEE Q lcl|NC_019418. 394 NSDTYQMRNSIVALVEQSIKELCVSMCELGKVV----G---IYR--GT----------------------IPELDDISVN 442 (527) Q Consensus 394 ~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~----~---~~~--~~----------------------~~~~~~v~v~ 442 (527) ..........+...+..+.+.+.+.+|.+...+ . +-+ +. ....++|+|+ T Consensus 465 q~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~ 544 (714) T protein:vir:81 465 VEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALA 544 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEe Confidence 777666666677777777777777766654221 1 110 00 0112344444 Q ss_pred eCCCccCCHHHHHHHHHHHHhcCCCCHHH------HHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCC Q lcl|NC_019418. 443 LDDGVFTDRHAELDYWMKMVAAGFATQKR------GIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGN 516 (527) Q Consensus 443 f~d~i~~d~~~~~~~~~~~~~aGi~s~~~------~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 516 (527) =..+.+.-+++.++.++++.++ +++.. .+.++-++.. +.+.+++|++-.....+. ++.+.+.+ T Consensus 545 ~~p~~~t~r~~~~~~l~~l~~~--~~p~~~~~~~~~~l~~~d~p~--~~el~~~ir~~~~~~~~~------~~~~~e~q- 613 (714) T protein:vir:81 545 PVQQTPAFKAQLAQRMSEVIQG--LPPQVQAVVLDLWVNLLDVPQ--KQEFVERIRAALGTPKSP------DEMTPEEQ- 613 (714) T ss_pred eccCchHHHHHHHHHHHHHHhh--cCchhhhhHHHHHHHhcCCCC--HHHHHHHHHHHcCCCCCc------cccchhhH- Confidence 3443334445666666666643 34332 2334445532 445666776644322110 00000000 Q ss_pred CCCCCCccccC Q lcl|NC_019418. 517 SKDTVDDEDEA 527 (527) Q Consensus 517 ~~~~~~~~~~~ 527 (527) .-...-.+-+. T Consensus 614 ~~~~~~q~~~~ 624 (714) T protein:vir:81 614 EVAAQQQALQQ 624 (714) T ss_pred HHHHHHHHHHH Confidence 00000000000 No 111 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=99.07 E-value=6.2e-10 Score=71.05 Aligned_cols=495 Identities=12% Similarity=0.093 Sum_probs=217.3 Q ss_pred CChHHHH-----HHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc-CccccCceeecc Q lcl|NC_019418. 1 MSLIQKV-----KDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD-GDRKRRKMQHLP 74 (527) Q Consensus 1 m~~~~~~-----k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~-~~~~~~~~~~ln 74 (527) +.|=+++ .++.++.+.+.. .++.-.++-.....+..+||.|+ .|+.-... -+...+..++.| T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~R~~a~~d~~fy~G~--Qw~~~~~~~l~~~g~p~~~~N 73 (714) T protein:vir:10 6 NTMATKNDNGATPRFSQRQLQALC----------SDIDSQPKWRDAANKACAYYDGD--QLPPEVLQVLKDRGQPMTIHN 73 (714) T ss_pred ccccCCCCcchhHHHHHHHHHHHH----------HHHHhhHHHHHHHHHHHHhhcCC--CCCHHHHHHHHhcCCCcEEec Confidence 2222221 011111110000 01111222334444556789873 44211111 111235668889 Q ss_pred hHHHHHHHHhhhhhcccceEeeC----C-H--HHHHH----HHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC---- Q lcl|NC_019418. 75 IARTAAKKIASLVYNEQAEISAE----D-E--TLNDF----LSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG---- 139 (527) Q Consensus 75 l~~~i~~~~A~ll~~e~~~i~~~----d-~--~~~~~----l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~---- 139 (527) +=+.+|+...++--...+.+.+. + . ++++. +..+.+.+++......+...++..|-+|+.++++. T Consensus 74 ~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~ 153 (714) T protein:vir:10 74 LIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFG 153 (714) T ss_pred cHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCC Confidence 99999999998887777777652 1 1 24444 44556677888889999999999999999999963 Q ss_pred CeeEEEEEcCCceEEEEEc-CCceEEEEEEE--EEEeeCCCcceE----------------------E-----EEEE-EE Q lcl|NC_019418. 140 DKIRVAFIQAPVFLPLQSN-TQDVSSAAILT--KTIKTENRKNVY----------------------Y-----TLVE-FH 188 (527) Q Consensus 140 ~~~~i~~v~a~~~~P~~~d-~~~~~~~a~~~--~~~~~~~~~~~~----------------------y-----t~lE-~h 188 (527) +.++|.+|+|..+|.=... ......|-++. ++...+.-...| + .+.. +. T Consensus 154 ~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:10 154 PEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred CCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchh Confidence 4689999999998841100 01122222221 111100000000 0 0000 00 Q ss_pred eecccccccceeeecCCceEEEEEEE-ecC-----CccccCceeecccc-----------cCCcc-------cceeecC- Q lcl|NC_019418. 189 EWVTPTGQEVGSTKDKSLYRITNELY-KST-----SDSQLGERVNLSEL-----------YPDLQ-------PVTPIQG- 243 (527) Q Consensus 189 ~~~~~~~~~~~~~~~~~~~~I~n~ly-~~~-----~~~~lG~~v~l~~~-----------~~~l~-------~~~~~~g- 243 (527) ....+......+...+......++.| +.. -...-|..+.+... ...+. ....++| T Consensus 234 ~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~ 313 (714) T protein:vir:10 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGP 313 (714) T ss_pred hhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecC Confidence 00000000000000000000001111 100 00011211111100 00000 0001111 Q ss_pred -------CCcc--cEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCC Q lcl|NC_019418. 244 -------LSRP--LFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQ 314 (527) Q Consensus 244 -------~~~p--~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~ 314 (527) .+-| -|.|+|+-..-..-.+-|+| .+.++++.++.+|...|+..+-+ +..++++.+..+... +.+. T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G--~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~-d~~~ 388 (714) T protein:vir:10 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG--LISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLS-DNDL 388 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCceee--hhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCccccc-HHHH Confidence 0111 13333322221112234665 58999999999999999998865 333444433332110 0000 Q ss_pred cccccccccccccceeeecc-CCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHH Q lcl|NC_019418. 315 GNIAFKRRFDVEQNVYMQVG-AGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSE 393 (527) Q Consensus 315 ~~~~~~~~~d~~~~~~~~~~-~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~ 393 (527) .......+. --.|.+.. .+......|+...+.--..++...++.....|...+|++...+|..++ ..++.+|.++ T Consensus 389 --~e~~arp~~-vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~n-a~SGvAi~~r 464 (714) T protein:vir:10 389 --MEQIERPDG-IIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSG-ATSGVAISNL 464 (714) T ss_pred --HHhccCCCC-ceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCcc-chhHHHHHHH Confidence 000000010 00111110 011111234544433234567889988888899999999999987644 3456678777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh----c---ccC--Cc----------------------ccCccceEEE Q lcl|NC_019418. 394 NSDTYQMRNSIVALVEQSIKELCVSMCELGKVV----G---IYR--GT----------------------IPELDDISVN 442 (527) Q Consensus 394 ~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~----~---~~~--~~----------------------~~~~~~v~v~ 442 (527) ..........+...+..+.+.+.+.+|.+...+ . +-+ +. ....++|+|+ T Consensus 465 q~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~ 544 (714) T protein:vir:10 465 VEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALA 544 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEe Confidence 777666666677777777777777766654221 1 110 00 0112344444 Q ss_pred eCCCccCCHHHHHHHHHHHHhcCCCCHHH------HHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCC Q lcl|NC_019418. 443 LDDGVFTDRHAELDYWMKMVAAGFATQKR------GIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGN 516 (527) Q Consensus 443 f~d~i~~d~~~~~~~~~~~~~aGi~s~~~------~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 516 (527) =..+.+.-+++.++.++++.++ +++.. .+.++-++.. +.+.+++|++-.....+. ++.+.+.+ T Consensus 545 ~~p~~~t~r~~~~~~l~~l~~~--~~p~~~~~~~~~~l~~~d~p~--~~el~~~ir~~~~~~~~~------~~~~~e~q- 613 (714) T protein:vir:10 545 PVQQTPAFKAQLAQRMSEVIQG--LPPQVQAVVLDLWVNLLDVPQ--KQEFVERIRAALGTPKSP------DEMTPEEQ- 613 (714) T ss_pred eccCchHHHHHHHHHHHHHHhh--cCchhhhhHHHHHHHhcCCCC--HHHHHHHHHHHcCCCCCc------cccchhhH- Confidence 3443334445666666666643 34332 2334445532 445666776644322110 00000000 Q ss_pred CCCCCCccccC Q lcl|NC_019418. 517 SKDTVDDEDEA 527 (527) Q Consensus 517 ~~~~~~~~~~~ 527 (527) .-...-.+-+. T Consensus 614 ~~~~~~q~~~~ 624 (714) T protein:vir:10 614 EVAAQQQALQQ 624 (714) T ss_pred HHHHHHHHHHH Confidence 00000000000 No 112 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=99.07 E-value=1.3e-09 Score=69.32 Aligned_cols=419 Identities=12% Similarity=0.085 Sum_probs=170.5 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCcc-ccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHH-- Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKV-AVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIAR-- 77 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~-- 77 (527) ||+|++++.+.++-. +...+. ..++ +..+.-.+-+ ....|..-. ....+..|. T Consensus 1 Mg~~~~l~~r~~~~~-----------~~~~~~~~~~~-----~~~~~~~~~~-------~~~~g~~V~-~~~al~~~~V~ 56 (457) T protein:vir:13 1 MGFWSALFGRGHSPA-----------LDGIEARAWEP-----YDPSIYNLGA-------VAASGETVT-PHDALQVSAVF 56 (457) T ss_pred Cchhhhhhccccccc-----------ccccccccccc-----cchHHHhhcc-------cccCCceec-hHHhhccHHHH Confidence 999887766543211 000000 0010 0001000101 011111100 011222222 Q ss_pred HHHHHHhhhhhcccceEee-CC----HHHHHHHHHHHhh----hhHHHHHHHHHHHHHhcCCEEEEEEEeCCee-EEEEE Q lcl|NC_019418. 78 TAAKKIASLVYNEQAEISA-ED----ETLNDFLSDMLSN----DRFNKNFERYLESALALGGLAMRPYVDGDKI-RVAFI 147 (527) Q Consensus 78 ~i~~~~A~ll~~e~~~i~~-~d----~~~~~~l~~~l~~----n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~-~i~~v 147 (527) .+++.+|+-+-+-|..+-- .+ ......|-.++.. -.....++..+...+..|.+++.+..+++++ .+..+ T Consensus 57 ~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~~g~~~~l~~l 136 (457) T protein:vir:13 57 ASVRLLSETIATLPLSTYSKRGGSRKEIVTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQGPNIVGLDVL 136 (457) T ss_pred HHHHHHHHhhccCceEEEEecCCcccccccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEEEEEE Confidence 3344444444443433311 11 1112223333331 1123445556677778899998887776654 45666 Q ss_pred cCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceee Q lcl|NC_019418. 148 QAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVN 227 (527) Q Consensus 148 ~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~ 227 (527) +|+++-+.....++.. ...|+ . |.+. . -|..+- T Consensus 137 ~p~~v~v~~~~~~~~~--------------~~~~~-~----------------------y~~~-----~-----~~~~~~ 169 (457) T protein:vir:13 137 DPTKIHVHMVMVDGLR--------------RKVFE-A----------------------YDID-----A-----DGNEVL 169 (457) T ss_pred ccCceEEEEecCCCcc--------------ceeEE-E----------------------EEEe-----c-----CCceee Confidence 6666554321111110 00111 0 0000 0 000000 Q ss_pred cccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhc Q lcl|NC_019418. 228 LSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQ 307 (527) Q Consensus 228 l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~ 307 (527) +.... +. -+.|++.+.++. ..+|+|.+.-+...|.....+-.-..+-|..|.. |..++. T Consensus 170 ~~~~~---~~----------diih~~~~~~~~----~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~----p~gil~ 228 (457) T protein:vir:13 170 LGWFT---PR----------DVLHIPGMMLPG----DFVGCSPISYARESIGLALAAQKYGSKFFANGAM----PGAVVE 228 (457) T ss_pred EEeeC---cc----------ceEEecCCCCCC----ccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCC----cceEEE Confidence 00000 00 134555443332 2368898887777666544433333333454322 222222 Q ss_pred CCCCCCCcc-cccccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccc Q lcl|NC_019418. 308 LKVQDNQGN-IAFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDG 381 (527) Q Consensus 308 ~~~~~~~~~-~~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~ 381 (527) ......... -.....| +..|.+.+ .+ -.+...++.++......++.+..+....+|+...|++|..+|... T Consensus 229 ~~~~ls~e~~~~~~~~~---~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~ 305 (457) T protein:vir:13 229 VPGTMSEEGLARAREAW---RAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDAT 305 (457) T ss_pred cCCCCCHHHHHHHHHHH---HHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC Confidence 211111000 0000011 11111110 00 012234566666666777888888888899999999999998766 Q ss_pred cccchHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHH Q lcl|NC_019418. 382 QGVKTATEIVSENSDT-YQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMK 460 (527) Q Consensus 382 ~g~~TAtei~s~~~~~-~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~ 460 (527) ++..++..+....... ..++.-+...++.+|.. .+..........+.+++++-+..|..+.++...+ T Consensus 306 ~~~~~~sn~eq~~~~f~~~tl~P~~~~ie~~ln~------------~L~~~~~~~~~~i~fd~~~l~~~D~~~r~~~~~~ 373 (457) T protein:vir:13 306 NSTSWGSGLAEQNIAFTMFSLRPWLERIEAGFNR------------LLFAETADRFRFVKFNLDEIKRGAPKERMELWSL 373 (457) T ss_pred CcccccchHHHHHHHHHHHHHHHHHHHHHHHHHH------------hhcCccccCceeEEeechhhhccCHHHHHHHHHH Confidence 5543322221111111 12222222222222221 1111111123346667777777898999999999 Q ss_pred HHhcCCCCHHHHHHhcCCCC---HHHHHHHHH-----HHHH----hc--ccccccccCCCCC----CCCCCCCCCCCCCC Q lcl|NC_019418. 461 MVAAGFATQKRGIAKTLGIT---EEEAEKELA-----EING----EL--PPESDAELALYGK----GQQNTVGNSKDTVD 522 (527) Q Consensus 461 ~~~aGi~s~~~~i~~~~~~~---deea~~el~-----ri~~----E~--~~~~~~~~~~~~~----~~~~~~~~~~~~~~ 522 (527) ++.+|+|++-+++... |++ +..+.+.+. .+.+ +. .+.+...+.-..+ ....++.+...... T Consensus 374 ~~~~G~~T~NE~R~~~-gl~Pi~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~d~~~~~~~~ 452 (457) T protein:vir:13 374 GLQNGIYSIDEVRAAE-DMTPLPDGLGEKYRVPLNLGEVGEEPEPEPAPAPPAIEPPAEEPDEEPEPEGKPDDEGATEED 452 (457) T ss_pred HHhCCCcCHHHHHHHh-CCCCCCCCcccceeeccccccccccccccccCCCCCCCCCccccCCCCCCCCCCccccCCCCc Confidence 9999999999976553 442 221121111 1111 00 1111111111011 11111111222333 Q ss_pred ccccC Q lcl|NC_019418. 523 DEDEA 527 (527) Q Consensus 523 ~~~~~ 527 (527) ++||| T Consensus 453 ~~~~~ 457 (457) T protein:vir:13 453 DEDDA 457 (457) T ss_pred ccccC Confidence 44444 No 113 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=99.06 E-value=3.2e-09 Score=67.17 Aligned_cols=494 Identities=9% Similarity=0.061 Sum_probs=211.1 Q ss_pred CChHHHHHHHHHHHH----HHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc-CccccCceeecch Q lcl|NC_019418. 1 MSLIQKVKDFFNRGR----YNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD-GDRKRRKMQHLPI 75 (527) Q Consensus 1 m~~~~~~k~~~~~~~----~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~-~~~~~~~~~~lnl 75 (527) |-+-+.....+.+-- .++....+..+. .++.-.++.+....+..+||.|+ .|..-... -+-..+..++.|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~q~~~r~~a~~d~~fy~G~--QW~~~~~~~l~~~g~p~~~~N~ 76 (772) T protein:vir:10 1 MQITENDRQYLNGLPPAGDTPLTVDEYADIN--YEIEDQPAWRAVADKEMDYADGN--QLDTELLRRQQALGIPPAVEDL 76 (772) T ss_pred CCcchhhHHhhccCCcccccccCHHHHHHHH--HHHhccHHHHHHHHHHHHhhcCC--CCCHHHHHHHHhcCCCcEEEcc Confidence 443333333322100 000000000000 01112333444444556688875 34211111 1112355688899 Q ss_pred HHHHHHHHhhhhhcccceEeeC------CHHHHHHH----HHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC----Ce Q lcl|NC_019418. 76 ARTAAKKIASLVYNEQAEISAE------DETLNDFL----SDMLSNDRFNKNFERYLESALALGGLAMRPYVDG----DK 141 (527) Q Consensus 76 ~~~i~~~~A~ll~~e~~~i~~~------d~~~~~~l----~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~----~~ 141 (527) =+.+|+...++--...+.+.+. +..+++.| ..+.+.+++......+...++..|-+|+.++++. +. T Consensus 77 i~~~v~~v~g~~~~nr~d~~v~Pr~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~d~~~~~ 156 (772) T protein:vir:10 77 IGPALLSLQGYEAVTRTDWRVTPNGDVGGQEVADALNYRLNTAERQSGADRACSEAFRPQIACGIGWVEVSRESDPFKFP 156 (772) T ss_pred hHHHHHHHHHHHHhcCcceEEecCCCchHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhcCceeEEeccccCCCCCC Confidence 9999999999888877777652 23344444 4555678888999999999999999999999863 35 Q ss_pred eEEEEEcCCceEEEEEcCC-ceEEE--EEEEEEEeeC-----------------------------------CCcc---- Q lcl|NC_019418. 142 IRVAFIQAPVFLPLQSNTQ-DVSSA--AILTKTIKTE-----------------------------------NRKN---- 179 (527) Q Consensus 142 ~~i~~v~a~~~~P~~~d~~-~~~~~--a~~~~~~~~~-----------------------------------~~~~---- 179 (527) ++|.+|++..+|.= .... ....| +|..+++..+ .... T Consensus 157 i~i~~v~p~~v~~D-p~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (772) T protein:vir:10 157 YRCRPIRRDEIHWD-MKCGDDWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEGGTSTGLHNA 235 (772) T ss_pred eEEEeeCcccceec-CCCCCCHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCcccccccccccccccccc Confidence 88999999988841 1111 11222 1111111000 0000 Q ss_pred -------------------eEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCc-------eeecc---- Q lcl|NC_019418. 180 -------------------VYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGE-------RVNLS---- 229 (527) Q Consensus 180 -------------------~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~-------~v~l~---- 229 (527) ...+++|+ |+......... ..+.+.+. .|.. ..+.+ .+.+. T Consensus 236 ~~~~~~~~~~~~~~~~~~~~rVrv~E~--w~r~~~~~~~~--~~~~g~~~--~~~~---~~~~~~~~l~~g~~~~~~~~~ 306 (772) T protein:vir:10 236 WNEARAWTVQEDHWYNPTSKEICLVEL--WYRRWVQVHVL--KSPDGRVV--EYDP---NNLAHNIALASGRISPKKVTV 306 (772) T ss_pred cchhhccccccccccccCCceEEEEEE--eeeeeeeeeee--ccCCCceE--eeCc---ccHHHHHHHhhcccchheeee Confidence 01112221 11000000000 00011000 0000 00000 00000 Q ss_pred -cc-cCCcccceeec----CCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeech Q lcl|NC_019418. 230 -EL-YPDLQPVTPIQ----GLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPE 303 (527) Q Consensus 230 -~~-~~~l~~~~~~~----g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~ 303 (527) .+ +.-......+. .++.-.|.|+|.-..-....+-|+| .+.++++.++.+|.+.|+.++-+-.. ++.... T Consensus 307 ~rv~~~~~~g~~~L~~~~~p~~~~~fP~vP~~g~r~~~~g~~~G--~vr~~kd~Qr~~N~~~S~~~~~l~~~--~~~~~~ 382 (772) T protein:vir:10 307 SRVRRSYWLGPHCLHDGPTPYTHRHFPYVPFFGFREDATGIPYG--YVRGMKYAQDSLNSGVSKLRWGMSVA--RVERTK 382 (772) T ss_pred eEEEEEEEecceeeccCCCCCCCCccceEEEeeeEeccCCcccc--hhhhhhhHHHHHHHHHHHHHHHHhcc--cccccC Confidence 00 00000001111 0111112233221111111223554 69999999999999999999876432 344433 Q ss_pred hHhcCCCCCCCcccccccccccccceeeeccCCCC--CCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccc Q lcl|NC_019418. 304 QMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNM--DSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDG 381 (527) Q Consensus 304 ~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~--~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~ 381 (527) ..+... +.... ..... ++ .+ ..++.+.. ....++..++.---.+++..++.....|....|++...+|..+ T Consensus 383 gav~~~-d~~~~--e~~ar--p~-~v-i~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~~~ 455 (772) T protein:vir:10 383 GAVAMT-DAQFR--RQIAR--PD-AD-IVLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGRKG 455 (772) T ss_pred CCccch-hHHHH--HhccC--CC-Ce-EEeCCccccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcCCCc Confidence 333210 00000 00000 11 11 11221111 1223554443322356889999888899999999999988654 Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-------cccCCc-c-------------------- Q lcl|NC_019418. 382 QGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVV-------GIYRGT-I-------------------- 433 (527) Q Consensus 382 ~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~-------~~~~~~-~-------------------- 433 (527) ...++.+|..+..........+...+..+.+.+-+.+|.+...+ .+-+.. . T Consensus 456 -na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~ 534 (772) T protein:vir:10 456 -TATSGIQEQQQIEQSNQSIGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAA 534 (772) T ss_pred -chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceeccccccc Confidence 44677788877777777777777777888877777777664321 011100 0 Q ss_pred -------cCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHH------HHhcCCCCHHHHHHHHHHHHHhcccccc Q lcl|NC_019418. 434 -------PELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRG------IAKTLGITEEEAEKELAEINGELPPESD 500 (527) Q Consensus 434 -------~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~------i~~~~~~~deea~~el~ri~~E~~~~~~ 500 (527) ...++|.|+=..+.+.=+++.++.++++. +.+++... +.++-++.- .++.++++++-+.+.+. T Consensus 535 ~~~NDi~~g~yDv~i~~~p~~~t~r~~~~~~m~ql~--~~~~P~~~~~~~~~~le~~D~p~--~~ei~~~ir~~~~~~~p 610 (772) T protein:vir:10 535 YLSNDLLRTRIKVALEDVPSTNSYRGQQLNAMSEAV--KSMPPQYQAAVLPFLVSLMDVPF--KRDVVEAIRAVDQQQTP 610 (772) T ss_pred ceeccceeeeEEEEeeccccchHHHHHHHHHHHHHH--hccChhHHHHHHHHHHhhcCCCC--hHHHHHHHHHHhccCCh Confidence 00011111111111111344555555554 33455432 122223321 22333344432211110 Q ss_pred cccCCCCCCCCCCCCCCCCCCC------------ccccC Q lcl|NC_019418. 501 AELALYGKGQQNTVGNSKDTVD------------DEDEA 527 (527) Q Consensus 501 ~~~~~~~~~~~~~~~~~~~~~~------------~~~~~ 527 (527) .... ....+.. ...-.... ...++ T Consensus 611 eq~~--~~~~q~~-qq~~~~~~~el~~~q~~a~~~~~~A 646 (772) T protein:vir:10 611 EQIQ--QQIDQAV-QDALAKAGNDIKLRELEIKERKADS 646 (772) T ss_pred HHHH--HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000 0000000 00000000 00000 No 114 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.06 E-value=6.5e-10 Score=70.96 Aligned_cols=462 Identities=11% Similarity=0.045 Sum_probs=196.1 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhh-ccCccccCHHHHHHHHHHHHHhc-CC---------CcccccccccCccccCc Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSIL-DHPKVAVTQSEFRRIQHNLAYYQ-SK---------FDDIEYTNTDGDRKRRK 69 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~-~~~~i~~~~~~~~~i~~~~~~y~-g~---------~~~l~~~~~~~~~~~~~ 69 (527) |.=.-++..|..+...-=..+.++..- ..+.++.++-..........-.. +- ..|.......|..-.-- T Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~gyql~al 116 (765) T protein:vir:96 37 MIKLGKIRGWNVEPEKAPVIRSVKDFLEPGLSVAMDSAYGDGPTPAAKAAAGGQNPYVVPTMLQDWYNSQGFIGYQACAI 116 (765) T ss_pred chhHHHHhhcccccccCCCCCCCCcccCcccceeccccccccccchHHHhhhccCccchhhHHHhhhcccCCccHHHHHH Confidence 555555556554432111112221111 11122222211000000000000 00 00000000001000011 Q ss_pred eeecchHHHHHHHHhhhhhcccceEeeCCH----HHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCee--E Q lcl|NC_019418. 70 MQHLPIARTAAKKIASLVYNEQAEISAEDE----TLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKI--R 143 (527) Q Consensus 70 ~~~lnl~~~i~~~~A~ll~~e~~~i~~~d~----~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~--~ 143 (527) +.+-.+++.+|+..|.-++.+..+|+.+++ ...++|++.+++-+++..+.+++..+-.+|++++.+-+++... . T Consensus 117 Y~~~~l~rkiVd~pAeDa~R~g~~I~~~~~e~~~~~~~~l~~~~~rl~v~~~l~ea~~~~RlyGga~i~i~i~~~D~~~l 196 (765) T protein:vir:96 117 ISQHWLVDKACSMSGEDAARNGWELKSDGRKLSDEQSALIARRDMEFRVKDNLVELNRFKNVFGVRIALFVVESDDPDYY 196 (765) T ss_pred HHhCchhhhhhhcchHHhhcCCceeecCccccCHHHHHHHHHHHHHhhHHHHHHHHHHHhhhceeeEEEEEecccCcchh Confidence 234489999999999999999988887543 3445677777777899999999999999999988766643211 0 Q ss_pred EEEEcCCceEEEEEcCCceEEEEE-EEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCcccc Q lcl|NC_019418. 144 VAFIQAPVFLPLQSNTQDVSSAAI-LTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQL 222 (527) Q Consensus 144 i~~v~a~~~~P~~~d~~~~~~~a~-~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~l 222 (527) -.-+.++.+-| +.+.++. +.+..... ++..|+..-. ....+ ..-..|.|. T Consensus 197 ~~PL~~~~I~k------g~~kgl~vldp~~~~~------~~v~e~~~Dp--~sp~f---g~P~~y~i~------------ 247 (765) T protein:vir:96 197 EKPFNPDGIAP------GSYKGISQIDPYWAMP------QLTAESTADP--SAEHF---YEPDFWIIS------------ 247 (765) T ss_pred hcccccccccc------ceeeEEEEechhhccc------ccchhccccc--ccccc---Ccceeeeec------------ Confidence 01111111111 1111111 10000000 0000000000 00000 000111111 Q ss_pred CceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeec Q lcl|NC_019418. 223 GERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVP 302 (527) Q Consensus 223 G~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~ 302 (527) |..|.-+ ..+.+.|.+-|. +.+. . ...||+|++..+.+.|..++.+......=+.....+++-- T Consensus 248 g~~IH~S-------Rli~~~g~~lpd--~lk~--~-----~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~v~k~ 311 (765) T protein:vir:96 248 GKKYHRS-------HLVVVRGPQPPD--ILKP--T-----YIFGGIPLTQRIYERVYAAERTANEAPLLAMSKRTSTIHV 311 (765) T ss_pred Cceeccc-------eEEEecCCCchh--hhcc--c-----cCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeee Confidence 1111100 011122211111 1110 1 1236999999999999999987765554443333333321 Q ss_pred hhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcc-cccccc Q lcl|NC_019418. 303 EQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSG-MFTFDG 381 (527) Q Consensus 303 ~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~-~~~~~~ 381 (527) .++....+.+.-.-.. ..+...+..+-.+-.+.+ ..++.++.++ .-....+...+.+|+..++++.. -||... T Consensus 312 -~~~~~l~~~~~l~~r~-~~~~~~r~n~g~~~id~e--e~~e~~s~~l--sgl~d~l~~~~~~iAaas~IP~t~LfGqsp 385 (765) T protein:vir:96 312 -DVEKAIANEDAFNARL-AFWIANRDNHGVKVIGID--ETMEQFDTNL--SDFDSVIMNQYQLVAAIAKTPATKLLGTSP 385 (765) T ss_pred -chHhhhccHHHHHHHH-HHHHHhcCCceeEEecCC--cceeEEeccc--CCHHHHHHHHHHHHHhhhCCCeeeeccCCc Confidence 2221111111100000 000001111100111222 2366665443 34567777888889999999864 456654 Q ss_pred ccc-chHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHH-- Q lcl|NC_019418. 382 QGV-KTATEIVSENSDTYQMRNSIV-ALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDY-- 457 (527) Q Consensus 382 ~g~-~TAtei~s~~~~~~~~~~~~~-~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~-- 457 (527) +|. +|+.+=. +..+..++.+| ..++..|+.|+..|+... .. ..+++|.|++-...+..+.++. T Consensus 386 ~GlnATGe~D~---~nYyD~I~s~Qe~~l~p~le~L~~li~~s~--------~i--~~d~~i~FnpL~~~sekEkAei~~ 452 (765) T protein:vir:96 386 KGFNATGEHET---ISYHEELESIQEHIFDPLLERHYLLLAKSE--------SI--DVQLEIVWNPVDSTTSQQQAELNN 452 (765) T ss_pred ccccCcchHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--------CC--CCcceEEeCCCCCCCHHHHHHHHH Confidence 553 5555322 33444555454 457888888888876431 11 2368999999888887665544 Q ss_pred -----HHHHHhcCCCCHHHHHHhc--------CCCCHHHHHHHHHHHHHhccccccc----ccCCCCCCCCCCCC-CCCC Q lcl|NC_019418. 458 -----WMKMVAAGFATQKRGIAKT--------LGITEEEAEKELAEINGELPPESDA----ELALYGKGQQNTVG-NSKD 519 (527) Q Consensus 458 -----~~~~~~aGi~s~~~~i~~~--------~~~~deea~~el~ri~~E~~~~~~~----~~~~~~~~~~~~~~-~~~~ 519 (527) +.+++.+|+++..+++.++ ..+++++++.+ .-+..+...+... ....+.++..++.. ...+ T Consensus 453 k~Aea~~~~~~~Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~-~~~~pe~~~~~~~~~~~~~~~~~e~~~~~a~p~~~e 531 (765) T protein:vir:96 453 KKAATDEIYINSGVVSPDEVRERLRDDPRSGYNRLTDDQAETE-PGMSPENLAELEKAGAQSAKAKGEAERAEAQAGAVE 531 (765) T ss_pred HHHHHHHHHHhcCCCCHHHHHHHHhccccCCCCCCCccccccc-cCCCccccccccCCCcccccccCccccccCCCCccC Confidence 6677888999988877654 12444433211 0010000000000 00000000000000 0000 Q ss_pred CCCccccC Q lcl|NC_019418. 520 TVDDEDEA 527 (527) Q Consensus 520 ~~~~~~~~ 527 (527) ..++..++ T Consensus 532 g~~~~~~~ 539 (765) T protein:vir:96 532 GAGDPVPA 539 (765) T ss_pred CCCccccc Confidence 00011110 No 115 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=99.05 E-value=3.6e-09 Score=66.90 Aligned_cols=498 Identities=10% Similarity=0.010 Sum_probs=220.2 Q ss_pred hHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccc-----ccCccccCceeecchHH Q lcl|NC_019418. 3 LIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTN-----TDGDRKRRKMQHLPIAR 77 (527) Q Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~-----~~~~~~~~~~~~lnl~~ 77 (527) |=++.++.+++.+.++- . ...-..+-+....+-++||.+....|.... ..+....+..++.|+=+ T Consensus 1 m~~~~~~~~~~~~~~~~-----~-----~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~ 70 (708) T protein:vir:10 1 MAETLEKKHERIMLRFD-----R-----AYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVA 70 (708) T ss_pred CchhHHHHHHHHHHHHH-----H-----HHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchH Confidence 44555555555443321 0 011123344444445667765444553221 12333346678889999 Q ss_pred HHHHHHhhhhhcccceEeeC------CHHHHHHHH----HHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-------- Q lcl|NC_019418. 78 TAAKKIASLVYNEQAEISAE------DETLNDFLS----DMLSNDRFNKNFERYLESALALGGLAMRPYVDG-------- 139 (527) Q Consensus 78 ~i~~~~A~ll~~e~~~i~~~------d~~~~~~l~----~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-------- 139 (527) .+|+...++--...+.+.+. +..+++.|+ .+.+.++.......+...++..|-+|++++.|. T Consensus 71 ~~v~~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~e~d~~~ 150 (708) T protein:vir:10 71 TELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMD 150 (708) T ss_pred HHHHHHHHHHHhCCcceEEEcCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhcccceeeeeeccccccCCCC Confidence 99999998877777777652 234455444 455577888889999999999999999987541 Q ss_pred --CeeEEEEE--cCCceE--EEEEcCCceEEE--EEEEEEEeeC---------------------------CCcceEEEE Q lcl|NC_019418. 140 --DKIRVAFI--QAPVFL--PLQSNTQDVSSA--AILTKTIKTE---------------------------NRKNVYYTL 184 (527) Q Consensus 140 --~~~~i~~v--~a~~~~--P~~~d~~~~~~~--a~~~~~~~~~---------------------------~~~~~~yt~ 184 (527) .+++|..+ +...++ |-+... ....| ++..+++..+ +.+.++ + T Consensus 151 ~~~~i~i~~~~~p~~~v~~Dp~a~~~-D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~~~~d~v~--v 227 (708) T protein:vir:10 151 DRQRIAIEPIYDPSRSVWFDPDAKKY-DKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIY--I 227 (708) T ss_pred CccccceEEeecchhhcccCcccccc-ChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccccCCCceE--E Confidence 23444443 334444 211111 11111 1111111000 001111 1 Q ss_pred EEEEeecccccccceeeecCCceEEEEEEEecCCccc-------cCceee----ccc--c-cCCcccceeecCCCcccEE Q lcl|NC_019418. 185 VEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQ-------LGERVN----LSE--L-YPDLQPVTPIQGLSRPLFT 250 (527) Q Consensus 185 lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~-------lG~~v~----l~~--~-~~~l~~~~~~~g~~~p~f~ 250 (527) .||.+... .........+...+.|. .|.+..... .|.... ... + +--+.+...+.+-...++. T Consensus 228 ~ey~~r~~-~~~~~~~~~~~~tg~~~--~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~~~p~~ 304 (708) T protein:vir:10 228 AKYYEVRK-ESVDVISYRHPITGEIA--TYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGE 304 (708) T ss_pred EEeeeEEE-EEEEEEEEecCCCCcee--eecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhhccCCCCCCC Confidence 12211000 00000000000001000 010000000 000000 000 0 0000000011111112222 Q ss_pred Eec---CCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeee-chhHhcCCCCCCCcccccccccccc Q lcl|NC_019418. 251 YLK---TPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIV-PEQMTQLKVQDNQGNIAFKRRFDVE 326 (527) Q Consensus 251 ~~~---~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v-~~~~l~~~~~~~~~~~~~~~~~d~~ 326 (527) +|| +-..-..-.+.|.+-+.+.++++.++.+|.+.|...+-+-..+..+++ +...+. +-......++.+ T Consensus 305 ~fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~-------~~~~~~~~~~~~ 377 (708) T protein:vir:10 305 HIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIR-------GLEKHWEARNKK 377 (708) T ss_pred ceeeEEEeeeeeccCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhh-------hHHHHHhhcccc Confidence 232 211111112455344669999999999999999999888654443333 232221 000000111222 Q ss_pred cceeeeccC---CCCC----CCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHH Q lcl|NC_019418. 327 QNVYMQVGA---GNMD----SGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQ 399 (527) Q Consensus 327 ~~~~~~~~~---~~~~----~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~ 399 (527) ...|...+. ..+. ...++.+++.--...+...++.....|....|+++..+|..+ + .++.+|.+....... T Consensus 378 ~~~~~~~~~~~~~~G~~~~~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~~s-n-~SG~aI~~rq~qg~~ 455 (708) T protein:vir:10 378 RPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPS-N-IAQETVNNLMNRADM 455 (708) T ss_pred chhhhccccccccccccccccCCccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccCcc-c-hHHHHHHHHHHHHHH Confidence 222322111 0111 112334444333455788888888899999999999998533 3 478888888877777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhh----c---ccCC-----------c---------------ccCccceEEEeCCC Q lcl|NC_019418. 400 MRNSIVALVEQSIKELCVSMCELGKVV----G---IYRG-----------T---------------IPELDDISVNLDDG 446 (527) Q Consensus 400 ~~~~~~~~~~~al~~li~~il~~~~~~----~---~~~~-----------~---------------~~~~~~v~v~f~d~ 446 (527) ........+..+.+..-+.+|.+...+ . +-+. . ....++|.|+=..+ T Consensus 456 ~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~ 535 (708) T protein:vir:10 456 ASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPS 535 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccC Confidence 777777888888887777776654321 1 1110 0 01123455544444 Q ss_pred ccCCHHHHHHHHHHHHhcCC-CCHHHH-----HHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_019418. 447 VFTDRHAELDYWMKMVAAGF-ATQKRG-----IAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDT 520 (527) Q Consensus 447 i~~d~~~~~~~~~~~~~aGi-~s~~~~-----i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (527) .+.-+++.++.++++..+.. ..+.++ +.++-++.- +++.+++|++...+.....+....+.+.......-.. T Consensus 536 ~~s~r~~~~~~l~qll~~~~p~~~~~~~~~~~~l~~~D~p~--~~ei~erir~~~~~~~~~~~~~~ee~q~~~~~q~~~q 613 (708) T protein:vir:10 536 YTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEG--LDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQ 613 (708) T ss_pred chhHHHHHHHHHHHHHHhcCCCchhhHHHHHHHHHhcCCcC--hHHHHHHHHHhhcccccccccchhhHHHHHHHHHHHH Confidence 44445666777777765432 111221 223323322 3455666666543322111111000000000000000 Q ss_pred C-C--c--cccC Q lcl|NC_019418. 521 V-D--D--EDEA 527 (527) Q Consensus 521 ~-~--~--~~~~ 527 (527) . - . +-.. T Consensus 614 ~q~~~~~~e~qa 625 (708) T protein:vir:10 614 SQPNPEMVLAQA 625 (708) T ss_pred HHHHHHHHHHHH Confidence 0 0 0 0000 No 116 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.05 E-value=9.1e-10 Score=70.15 Aligned_cols=439 Identities=13% Similarity=0.096 Sum_probs=183.9 Q ss_pred CChHHH----------------HHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCc Q lcl|NC_019418. 1 MSLIQK----------------VKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGD 64 (527) Q Consensus 1 m~~~~~----------------~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~ 64 (527) |.+.++ ++...+... -+-.-++..+.......-+...+..=.....||.. ....+- T Consensus 66 ~~~~~~~~~~~~~~~~~a~~~a~~~~~~~~~-~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~-------~~f~gy 137 (862) T protein:vir:99 66 VEISDSVNAKSVSGKNFAMDSAVRSAIKAIT-GFAMDDGGGAPVPIGAEGKQSSYAVPEALQDWYLS-------QGFIGH 137 (862) T ss_pred ccccccccchhhhhhhhcchhhcchhhhhhh-hhhhhcchhhhhhccccccccccccchhccccccc-------cCcccH Confidence 221111 111111000 00000000100000000000000000000112211 111110 Q ss_pred cccCceeecchHHHHHHHHhhhhhcccceEeeCC------HHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe Q lcl|NC_019418. 65 RKRRKMQHLPIARTAAKKIASLVYNEQAEISAED------ETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD 138 (527) Q Consensus 65 ~~~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~d------~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d 138 (527) .-..-+..-.+++.+|+..|+-++.+...|.+.+ +...+.|++.+++.++...+.+++..+-.+|++++.+.++ T Consensus 138 ql~alY~~~~larkiVd~pAeDatR~g~~I~~~~d~~e~~~e~~~~ie~~~~rL~v~~~l~eair~~RLyGga~ililv~ 217 (862) T protein:vir:99 138 QACALIAQHWLVDKACSLAGEDAIRNGWHLKSLGEGEEIDEESLEKFKAIDVEFKVKENLIEFNRFKNVFGIRVAIFVVD 217 (862) T ss_pred HHHHHHHhCchhhhhhhhhhHHHhhCCceEeecCcccccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEEec Confidence 0001123448999999999999999999998742 3445677777777788888999999888899887766554 Q ss_pred CCeeE-E-EEEcCCceEEEEEcCCceEEEE-EEEEEE---------eeCCCcceEEEEEEEEeecccccccceeeecCCc Q lcl|NC_019418. 139 GDKIR-V-AFIQAPVFLPLQSNTQDVSSAA-ILTKTI---------KTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSL 206 (527) Q Consensus 139 ~~~~~-i-~~v~a~~~~P~~~d~~~~~~~a-~~~~~~---------~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~ 206 (527) +..+. + .-+.++.+ ..+.+.++ ++.+.. ..+.....||. -.. T Consensus 218 ~~D~~~LsqPLn~e~I------~kG~lkgl~vlDp~w~~p~~v~~~~~Dp~sp~yGk--------------------P~~ 271 (862) T protein:vir:99 218 SEDPDYYEKPFNPDGI------TPGSYRGISQIDPYWMMPMLTAESTADPSSQFFYE--------------------PEF 271 (862) T ss_pred CcCchhhhcCcCcccc------cccceeEEEEechhhhcccccccccccccccccCC--------------------cee Confidence 32110 0 00111110 00111111 111100 00000000110 011 Q ss_pred eEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHH Q lcl|NC_019418. 207 YRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYD 286 (527) Q Consensus 207 ~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s 286 (527) |.|. |..|.-+. .+.+.|-+-|. +.++ ....||+|++..+.+.|...+.+.. T Consensus 272 y~I~------------g~~IH~SR-------liif~g~~vpd--~lk~-------ay~f~G~SvLe~iyd~L~~~d~t~~ 323 (862) T protein:vir:99 272 WIIS------------GQKYHRSH-------LIIARGPQPAD--ILKP-------TYIFGGIPLVQRIYERVYAAERTAN 323 (862) T ss_pred eeec------------Ceeeccce-------eEEecCCCchh--hhhc-------cCCccCccHHHHHHHHHHHHHHHHH Confidence 1111 11111000 11122211111 1111 1235799999999999999998766 Q ss_pred HHHHHHHcCcceeeechhHhcCCCCCCCc--ccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHH Q lcl|NC_019418. 287 EFMWEIKMGQRRVIVPEQMTQLKVQDNQG--NIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLK 364 (527) Q Consensus 287 ~~~~e~~~~~~~i~v~~~~l~~~~~~~~~--~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~ 364 (527) ....-+.....+++-- ..+....+...- .......+..+..+.. ++.+ ..++.++.++ .-....+..... T Consensus 324 saa~Ll~ka~l~v~kt-d~l~~l~~ed~l~~r~~~~~~~rdN~Gi~l---iD~e--Ee~e~ls~sl--SGL~dll~~~~q 395 (862) T protein:vir:99 324 EAPLLAMNKRTTAIHT-DTAKAIANEDKFIQRLMFWVRYRDNHAVKV---LGTD--ETMEQFDTSL--ADFDAVIMGQYQ 395 (862) T ss_pred HHHHHHHHhccceeec-hhHhhhccHHHHHHHHHHHHhccCcceeEE---ecCC--CceeEEeccc--CChHHHHHHHHH Confidence 5554443333333221 111111111000 0000000101111111 1222 2366555443 345567777788 Q ss_pred HHHHhcCCCccc-cccccccc-chHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEE Q lcl|NC_019418. 365 LFEMQIGVSSGM-FTFDGQGV-KTATEIVSENSDTYQMRNSIV-ALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISV 441 (527) Q Consensus 365 ~i~~~~g~s~~~-~~~~~~g~-~TAtei~s~~~~~~~~~~~~~-~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v 441 (527) +|+..++++..- ||...+|. +|+.+=...+ +..++.+| ..++..|+.|+..|. +. + + . ..+++| T Consensus 396 ~IAaas~IP~tiLfGqspaGlnATGE~D~~nY---yD~I~s~QE~~L~P~LerL~~li~-~~----l--g-~--~~d~~i 462 (862) T protein:vir:99 396 LVASIAKTPATKLLGTAPKGFNSTGEFETISY---HEELESIQEHVYMPFLQRHYLISR-LS----L--G-I--QHEIDV 462 (862) T ss_pred HHHhhhCCCceeecccCcccccCchHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHH-Hh----c--C-C--CCcceE Confidence 899999999774 67665553 5665433333 34444443 457788887765443 21 1 1 1 246899 Q ss_pred EeCCCccCCHHHHHHH-------HHHHHhcCCCCHHHHHHhc--------CCCCHHHHHHHHHHHHHhcccccccccCCC Q lcl|NC_019418. 442 NLDDGVFTDRHAELDY-------WMKMVAAGFATQKRGIAKT--------LGITEEEAEKELAEINGELPPESDAELALY 506 (527) Q Consensus 442 ~f~d~i~~d~~~~~~~-------~~~~~~aGi~s~~~~i~~~--------~~~~deea~~el~ri~~E~~~~~~~~~~~~ 506 (527) .|++-...+..+.++. +.+++.+|+++..+++.++ .+++++++++.-. +..+.. ....-+ T Consensus 463 eFnpL~~~sekEkAEi~kk~Aea~~~lv~sGvispdEvR~~L~~~~~~g~~~l~ded~E~d~~-~~~e~~----~~~e~~ 537 (862) T protein:vir:99 463 VMEPVASMTAQQQADLNKTKAEGGKVLIDGGVISPDEERNRIRDDKRSGYNRLTKEDAEETPG-ASPENL----AAYQKA 537 (862) T ss_pred EeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCcCCCCCCcccccccCC-CCcccc----cccccC Confidence 9999888887766544 4577888999998877652 2355544321100 000000 000000 Q ss_pred CCCCCCCCCCC-------------------------CCCCCc---------cccC Q lcl|NC_019418. 507 GKGQQNTVGNS-------------------------KDTVDD---------EDEA 527 (527) Q Consensus 507 ~~~~~~~~~~~-------------------------~~~~~~---------~~~~ 527 (527) ++...+.+.++ +...+. ++++ T Consensus 538 g~a~~~ap~de~~aga~~~~~e~d~~~~p~~~~~~~g~~~~~t~~~~a~~p~~~~ 592 (862) T protein:vir:99 538 GAAQETASAKETQAGAAVTTAEGDQPNVQMVPSMKPGQMVGPEVGITAPMPEDDA 592 (862) T ss_pred CcccccccccccccccCCccccCCcccccccCCCCCCCccccccccccCCCcccc Confidence 00000000000 000000 0000 No 117 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=99.04 E-value=4.2e-09 Score=66.52 Aligned_cols=391 Identities=13% Similarity=0.066 Sum_probs=175.1 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) ||||++++++|++-. ..+.++.. .+..|+.+.. +. . +.-+...--..++ T Consensus 1 MG~~~~~~~~~~~~~--------------~~~~~~~~------~~~~~~g~~~--~~---~------~~al~~~~V~~~v 49 (411) T protein:vir:81 1 MGWWSRLTRFFRPRN--------------ETVDMTNP------LLLQWLGVDP--DT---P------RNQLSEATYFACL 49 (411) T ss_pred CchHHHHHhhccCcc--------------cccccchH------HHHHHhcCcc--cC---h------hhhhccHHHHHHH Confidence 999999999886411 11222221 1334443321 10 0 0111111122344 Q ss_pred HHHhhhhhcccceE---------eeCCHHHHHHHHH-HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCee-EEEEEcC Q lcl|NC_019418. 81 KKIASLVYNEQAEI---------SAEDETLNDFLSD-MLSNDRFNKNFERYLESALALGGLAMRPYVDGDKI-RVAFIQA 149 (527) Q Consensus 81 ~~~A~ll~~e~~~i---------~~~d~~~~~~l~~-~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~-~i~~v~a 149 (527) +.+|+-+-+-|..+ .+.+..+...|+. --..-.....+..++...+..|.+++.+..+++++ .+..++| T Consensus 50 ~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~g~~~~l~~l~~ 129 (411) T protein:vir:81 50 KILSESLGKLPLKMYQKTERGIVKSDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYSGPQLQALWILPS 129 (411) T ss_pred HHHHHhHhhCceeEEEecCCceeeecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCceEEEEEECC Confidence 55555544434333 2223333333321 11111223334455667777899998888887665 3566777 Q ss_pred CceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecc Q lcl|NC_019418. 150 PVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLS 229 (527) Q Consensus 150 ~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~ 229 (527) +.+-|+. +.++... .+...+|+. .. ..-|..+.+ T Consensus 130 ~~v~~~~-~~~~~~~-----------~~~~~~~~~-~~--------------------------------~~~g~~~~~- 163 (411) T protein:vir:81 130 QYVTIVV-DDRGLLG-----------EKNAIWYRY-ND--------------------------------PYDGKMYVF- 163 (411) T ss_pred ceEEEEE-cCccccc-----------ccceEEEEE-Ee--------------------------------cCCceEEEE- Confidence 7766642 2222110 011111210 00 000111110 Q ss_pred cccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCC Q lcl|NC_019418. 230 ELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLK 309 (527) Q Consensus 230 ~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~ 309 (527) . .--..||+.+.+ .+..+|+|.+.-+...+......-.-..+-|..|.. |..++..+ T Consensus 164 ------------~---~~eiih~k~~~~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~----p~gil~~~ 220 (411) T protein:vir:81 164 ------------R---NDEILHFKTSVT----FDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLT----GKAVLEYT 220 (411) T ss_pred ------------c---cccEEEEcCCCC----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCC----CceEEEeC Confidence 0 001345653211 123469998888887776666554444444555332 22222222 Q ss_pred CCCCCc-ccccccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccc Q lcl|NC_019418. 310 VQDNQG-NIAFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQG 383 (527) Q Consensus 310 ~~~~~~-~~~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g 383 (527) ...... .-.....| ...|.+.+ .+ -.++..++.++......++.+..+....+|+...|++|..+|...++ T Consensus 221 ~~l~~e~~~~~~~~~---~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~ 297 (411) T protein:vir:81 221 GDLNQEARDRLVKGF---EQFANGSKNAGKIIPVPLGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKS 297 (411) T ss_pred CCCCHHHHHHHHHHH---HHHhcCccccCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC Confidence 111100 00000111 11122111 00 01222355555555667788888888889999999999999876544 Q ss_pred c-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc-CCcccCccceEEEeCCCccCCHHHHHHHHHHH Q lcl|NC_019418. 384 V-KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIY-RGTIPELDDISVNLDDGVFTDRHAELDYWMKM 461 (527) Q Consensus 384 ~-~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~-~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~ 461 (527) . .++.+. ....++.+|..++..|...... .+. .........+.++++.-+..|..+.++...++ T Consensus 298 t~~n~e~~-------------~~~f~~~~l~P~~~~ie~~l~~-~ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~ 363 (411) T protein:vir:81 298 SYASAEAQ-------------NLAFYVDTLLYVLKQYEEEITY-KILSNDLISQGHYFKFNVNVILRADIKTQMDSLSTA 363 (411) T ss_pred CchhHHHH-------------HHHHHHHHHHHHHHHHHHHHHh-hcCChhhcCCCcEEEeechhhhccCHHHHHHHHHHH Confidence 2 222222 1122334444444443322211 111 11112334466677776778999999999999 Q ss_pred HhcCCCCHHHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_019418. 462 VAAGFATQKRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDE 524 (527) Q Consensus 462 ~~aGi~s~~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (527) +.+|+|++-+++.. .|+...+ ..+-+ .....-.-+. .++....+||. T Consensus 364 ~~~g~~t~NE~R~~-~gl~p~~ggD~~~--------------~~~n~~pl~~-~~~~~~kgGd~ 411 (411) T protein:vir:81 364 VQNGIMTPNEARDY-LDMPADDYGNNLM--------------ANGNYIPLSM-LGANYGKGGDS 411 (411) T ss_pred HhCCCcCHHHHHHH-hCCCCCCCCCeee--------------eccCccchhh-hhhhhccCCCC Confidence 99999999997654 3654311 00000 0000000000 00000111111 No 118 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.01 E-value=4.4e-10 Score=71.87 Aligned_cols=406 Identities=12% Similarity=0.098 Sum_probs=173.8 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccc---cccCccccCceeecchHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYT---NTDGDRKRRKMQHLPIAR 77 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~---~~~~~~~~~~~~~lnl~~ 77 (527) |.+ |-.-+..+.+ -|..+...+. ...|-.-..-+.+-.+++ T Consensus 1 ~~~------~~~d~~~~~~------------------------------~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~ 44 (427) T protein:vir:10 1 MKI------VKHDGYNDIF------------------------------NGGADGSPKPFFMSDASYHVGSFYNDNATAK 44 (427) T ss_pred CCc------cccchHHHHh------------------------------hcCCCCcccCccccCchHHHHHHHHcCchhh Confidence 222 1111111111 1110000000 000000001122347899 Q ss_pred HHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcCCceEEEEE Q lcl|NC_019418. 78 TAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQAPVFLPLQS 157 (527) Q Consensus 78 ~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a~~~~P~~~ 157 (527) .+|+..|.-++.+..+|+.+++ .+.++..+++-+++..+.+++..+-.+|++++.+-+++++..- -|+ T Consensus 45 ~~Vd~~aed~~r~g~~i~g~~~--~~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~--------~p~-- 112 (427) T protein:vir:10 45 RIVDVIPEEMVTAGFKMSGVKD--EKEFKSLWDSYKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLT--------SQA-- 112 (427) T ss_pred hhhccchHHhhcCCccccCccH--HHHHHHHHHHhhHHHHHHHHHHhccccceeEEEEEecCCCccc--------ccc-- Confidence 9999999999998877776543 2456666777789999999999999999999988776543211 011 Q ss_pred cCCceEEE-EEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCcc-ccCceeecccccCCc Q lcl|NC_019418. 158 NTQDVSSA-AILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDS-QLGERVNLSELYPDL 235 (527) Q Consensus 158 d~~~~~~~-a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~-~lG~~v~l~~~~~~l 235 (527) ...+.+.+ .++++..-. . + +++.- .....-|..+ .|+-.... .-+..|.-+ T Consensus 113 ~~~g~l~~l~v~d~~~~~-~-----~---~~~~d----------p~s~~fg~P~--~y~v~~~~~~~~~~iH~S------ 165 (427) T protein:vir:10 113 KPGAKLEGVRVYDRFAIT-V-----E---KRVTN----------ARSPRYGEPE--IYKVSPGDNMQPYLIHHS------ 165 (427) T ss_pred CCCcceeEEEEechhccc-c-----c---ccccC----------ccccccCcce--EEEEecCCCCcceEEccc------ Confidence 11111222 122211000 0 0 00000 0000000111 11110000 000011100 Q ss_pred ccceeecCCCcccEEEecCCccccccCCCccCcchhhh-hHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCC Q lcl|NC_019418. 236 QPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDN-AKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQ 314 (527) Q Consensus 236 ~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~-~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~ 314 (527) ..+.+.|.+-|. +. ......||.|++.. +.+.+..++.+-.....=+...+.+++--..+-....++.. T Consensus 166 -Rli~~~g~~~p~---~~------~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~k~~~~v~k~~~l~~~~~~~~~ 235 (427) T protein:vir:10 166 -RVFIADGERVAQ---QA------RKQNQGWGASVLNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDA 235 (427) T ss_pred -cEEEecCCCchh---hh------cccCCcccchhhhHHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHHhcCccc Confidence 011122221110 10 01235689999865 66878888877665554443323323221111111111111 Q ss_pred cccccccccc---cccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccc-cccccccc-chHHH Q lcl|NC_019418. 315 GNIAFKRRFD---VEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGM-FTFDGQGV-KTATE 389 (527) Q Consensus 315 ~~~~~~~~~d---~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~-~~~~~~g~-~TAte 389 (527) .......+. ..+.....+... +....++.++.++ .-....++....+|+..+|++..- ||...+|. .|+.+ T Consensus 236 -~~~~~~r~~~~~~~~~~~~~~~l~-~~~e~~e~~~~~l--sgl~~~~~~~~~~iaaa~~IP~t~L~G~sp~Glnstgd~ 311 (427) T protein:vir:10 236 -QYAARLRLAQVDDNSGVGRAIGID-AETEEYDVLNSDI--SGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNT 311 (427) T ss_pred -hHHHHHHHHHHHHhcCcccceeee-cCCCceeEEeccc--CChHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhH Confidence 100000111 111111111111 2223466555443 345666777788899999998764 46666664 44454 Q ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHH-------HHH Q lcl|NC_019418. 390 IVSENSDTYQMRNSIV-ALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYW-------MKM 461 (527) Q Consensus 390 i~s~~~~~~~~~~~~~-~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~-------~~~ 461 (527) -.+.+ +..++.+| ..++..|+.|+..|++ + .++++.|++-...+..+.++.. .++ T Consensus 312 D~~ny---yd~i~~~Qe~~l~p~l~~l~~~i~~--------s------~~~~~~f~pL~~~s~kEkaei~~~~a~a~~~~ 374 (427) T protein:vir:10 312 ALETF---YKLVDRKREEDYRPLLEFLLPFIVD--------E------EEWSIEFEPLSVPSKKEESEITKNNVESVTKA 374 (427) T ss_pred HHHHH---HHHHHHHHHHHHHHHHHHHHHHhhc--------C------CCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHH Confidence 33333 34444444 4577888888776542 1 2678999998888877665443 334 Q ss_pred HhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccC--CCCCCCCCCCCCCCCCCCccc Q lcl|NC_019418. 462 VAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELA--LYGKGQQNTVGNSKDTVDDED 525 (527) Q Consensus 462 ~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~ 525 (527) +.+|++++ +|++++|..+-.+....+..... .+.+..+.++.. ++...|++ T Consensus 375 ~~~gvi~~------------~e~r~~L~~~~~~~~~~~~~~~~~e~~~~~~e~~p~~-~e~~~d~~ 427 (427) T protein:vir:10 375 ITEQIIDL------------EEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGL-GEKLEDEN 427 (427) T ss_pred HhcCCCCH------------HHHHHHHHhhhccccCCCCccccccccchhcCCCCCC-CCCCCCCC Confidence 44454444 44555543332221111000000 011111111110 11111111 No 119 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.01 E-value=1.9e-09 Score=68.37 Aligned_cols=407 Identities=14% Similarity=0.116 Sum_probs=174.5 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccc---cc-cCccccCceeecchH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYT---NT-DGDRKRRKMQHLPIA 76 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~---~~-~~~~~~~~~~~lnl~ 76 (527) |.--+...+.+ .|-.+.-++. .. ....-..-+.+-.++ T Consensus 1 ~~~~D~~~n~~--------------------------------------~gg~~~~~~~~~~~~~~~~~l~a~Y~~~~l~ 42 (422) T protein:vir:10 1 MVKTDSYANIF--------------------------------------LGGSDGSEIYGSLQNQAPTILASLYADNALV 42 (422) T ss_pred CccchhhHHHH--------------------------------------cCCCCCccccCcccccCHHHHHHHHHhChhh Confidence 33333333322 1111000000 00 000000012234789 Q ss_pred HHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcCCceEEEE Q lcl|NC_019418. 77 RTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQAPVFLPLQ 156 (527) Q Consensus 77 ~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a~~~~P~~ 156 (527) +.+|+..|.-++.+..+|+.+++. +.+..-+++-+++..+.+++..+-.+|++++.+-+..++.. + =|+. T Consensus 43 ~~~Vd~~aed~~r~g~~i~~~~~~--~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~------~--~Pl~ 112 (422) T protein:vir:10 43 RRIIDTIPETALAAGFHIDGIDDE--PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRAL------T--SPVR 112 (422) T ss_pred HHHHhhhhHHHhcCCccccCCCHH--HHHHHHHHHhhHHHHHHHHHHhhccccceEEEEEecCCCCc------c--cccc Confidence 999999999999988887765543 33555556668999999999999999999888776332210 0 1221 Q ss_pred EcCCceEEEE-EEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCc Q lcl|NC_019418. 157 SNTQDVSSAA-ILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDL 235 (527) Q Consensus 157 ~d~~~~~~~a-~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l 235 (527) . .+.+.++ ++.+. .... .+++.-. ....+ .....|.|. +. ....+..|. +. T Consensus 113 ~--~g~~~~l~v~d~~-~i~~--------~~~~~dp--~s~~f---g~P~~y~v~-----~~-~~~~~~~iH----~S-- 164 (422) T protein:vir:10 113 E--GAELETVRVYDRT-QVKV--------QTREENP--RNARF---GEPLTYRIT-----TN-ESDMFYDVH----YS-- 164 (422) T ss_pred c--cCceeeEEeeccc-cccc--------hhcccCc--ccccc---CcceEEEEe-----cC-CCCcceeec----cc-- Confidence 1 1222221 11110 0000 0000000 00000 000111111 00 000111111 00 Q ss_pred ccceeecCCCcccEEEecCCccccccCCCccCcchhhh-hHHHHHHHHHHHHHHHHHHH-cCcceeeec--hhHhcCCCC Q lcl|NC_019418. 236 QPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDN-AKTTIDFINRTYDEFMWEIK-MGQRRVIVP--EQMTQLKVQ 311 (527) Q Consensus 236 ~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~-~~~lid~ld~~~s~~~~e~~-~~~~~i~v~--~~~l~~~~~ 311 (527) ..+.+.|.+-|. +.+ .....||.|++.. +.+.+..++.+-.....=+. .....+.++ .+++. + T Consensus 165 -Rli~~~g~~~p~--~~~-------~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~---~ 231 (422) T protein:vir:10 165 -RIHIIDGERIPN--VMR-------RQNDGWGRSVLSSDILDSIKDYTNCERLATQLLKRKQQAVWKAKGLAELCD---D 231 (422) T ss_pred -eeEEeCCCCchh--hhc-------ccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHhcC---C Confidence 011222222111 011 1234689999986 67888888877766554443 333333332 12221 1 Q ss_pred CCCcccccccc---cccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccc-cccccccc-ch Q lcl|NC_019418. 312 DNQGNIAFKRR---FDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGM-FTFDGQGV-KT 386 (527) Q Consensus 312 ~~~~~~~~~~~---~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~-~~~~~~g~-~T 386 (527) +. +....... +...+.....+-.. +....++.++.++ .-....++....+|+..+|++..- ||...+|. .| T Consensus 232 ~~-~~~~~~~r~~~~~~~~~~~~~~~l~-~~~e~~e~~~~~l--sgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~Glnat 307 (422) T protein:vir:10 232 SE-GFGAARLRLAQVDNNSGVGQAIGID-AESEEYSVLNSDI--GGIDAFLDKKFDRIVALSGIHEIILKNKNVGGVSSS 307 (422) T ss_pred cc-chHHHHHHHHHHHHhcCCccceeEe-cCCcceEEEeccc--CChHHHHHHHHHHHHhhhCCCeeeeccCCccccccc Confidence 11 11000000 11111111111111 2223466665443 345677777888899999998764 46666664 34 Q ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcC Q lcl|NC_019418. 387 ATEIVSENSDTYQMRNSIV-ALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAG 465 (527) Q Consensus 387 Atei~s~~~~~~~~~~~~~-~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aG 465 (527) ..+-... .+..++.+| ..++.+|+.|+..|++ ..+++|.|++-...+..+.++...+...+ T Consensus 308 gd~d~~~---yyd~i~~~Qe~~l~p~l~~l~~~i~~--------------s~~~~~~f~pL~~~sekekaei~~~~a~a- 369 (422) T protein:vir:10 308 QNTALET---FHKLVDRKRNAELLPILEFLIPFIVN--------------AEEWSVEFNPLAQESSKDKAEILEKNVNS- 369 (422) T ss_pred chHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhcc--------------cCCcEEEeCCCCCCCHHHHHHHHHHHHHH- Confidence 4444333 444555555 3578888888887652 12578999988888876655543322111 Q ss_pred CCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 466 FATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 466 i~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) ..+++ ..--++.+|++++|.+.-....... +. .+.+.++..+..+...+.++. T Consensus 370 ---~~~~~-~~g~i~~~e~r~~L~~~~~~~~~~~----~~-~~~~~~~~~~~~~~~~~~~~d 422 (422) T protein:vir:10 370 ---IAALI-AAGAMDIDEARDTLRTIAPEVKIND----GS-VETEVTISETSNDPLEVPTDD 422 (422) T ss_pred ---HHHHH-hcCCCCHHHHHHHhhhhcccccCCC----CC-CccccchhhcCCCCCCCCCCC Confidence 11111 1211455566655533211111000 00 000000000000000000000 No 120 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.00 E-value=3.7e-09 Score=66.79 Aligned_cols=452 Identities=14% Similarity=0.080 Sum_probs=186.7 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhh--hc----cCccccCHHHHHHHHHHHHHhcCCC-----cccccccccCccccCc Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSI--LD----HPKVAVTQSEFRRIQHNLAYYQSKF-----DDIEYTNTDGDRKRRK 69 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~--~~----~~~i~~~~~~~~~i~~~~~~y~g~~-----~~l~~~~~~~~~~~~~ 69 (527) ++.=+++++.= +.. ..+.+... .+ .+.+. +...+.+..-.-+..+.. .|.......+-.-.-- T Consensus 17 ~~~~~~~~~~~-~~~---~~~~~~~~~~~~~~~~~~~~~--~~~~~~~a~~~g~~~~~~~~~~~~~~~~~~~~~~~l~a~ 90 (532) T protein:vir:94 17 LQQAQRVDAKR-ATH---TSLGLATAHEIDPTAYSPYER--NAAQNAMAMDYGLQTGRNGRNALSFVEATSWPGFPTLAL 90 (532) T ss_pred hhhHhhhhhhh-hhh---hhhhhhhhhhhcccccccccc--cccccccccccccCcccccccccccccccccchHHHHHH Confidence 33333333221 100 00000000 00 00000 000000000000000000 0000000000000001 Q ss_pred eeecchHHHHHHHHhhhhhcccceEeeCCH-----HHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEE Q lcl|NC_019418. 70 MQHLPIARTAAKKIASLVYNEQAEISAEDE-----TLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRV 144 (527) Q Consensus 70 ~~~lnl~~~i~~~~A~ll~~e~~~i~~~d~-----~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i 144 (527) +..-.+++.+|+..|+-++.+..+|+.+++ ...+.|+..++.-+++..+.+++..+-.+|++++.+-+++..... T Consensus 91 Y~~~~l~r~~Vd~~aed~~r~~~~i~~~~~~~~~~~~~~~i~~~~~~l~v~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~ 170 (532) T protein:vir:94 91 LAQLPEYRTMHETPADECVRAWGKITCSSKDELAADKATRITQKLEQYNVRTLVRTVVIHDQAYGGAHVFPHLKMDGDSV 170 (532) T ss_pred HHcCchhhhhhccchHHHhhCCceEeeCCccccchHHHHHHHHHHHhhhHHHHHHHHHHhhhcccceEEEEEeccCCccc Confidence 123488999999999999999999987532 344566666666678889999999999999999887776433221 Q ss_pred EEEcCCceEEEEEcCCceEEEEEEEEEE-ee-----CC-CcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecC Q lcl|NC_019418. 145 AFIQAPVFLPLQSNTQDVSSAAILTKTI-KT-----EN-RKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKST 217 (527) Q Consensus 145 ~~v~a~~~~P~~~d~~~~~~~a~~~~~~-~~-----~~-~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~ 217 (527) .+-.|-..-|.....+.......+.++. .. .+ ....|+ .- ..|.+. T Consensus 171 ~~~~p~~l~~~~I~~g~~~~l~vld~~~v~p~~~~~~dp~sp~fg-~P-------------------~~y~v~------- 223 (532) T protein:vir:94 171 PADAPLLLSPSFVQRGCLIGFATIEPMWLSPNAYNATDPTLPSFY-KP-------------------DSWIAT------- 223 (532) T ss_pred cccccccccccccccceeeEEEeechheecccccccccccccccC-Cc-------------------eeEEEc------- Confidence 1111100000000000011111111100 00 00 000000 00 011110 Q ss_pred CccccCceeecccccCCcccceeecCCCcccEEEecCCccccccC-CCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCc Q lcl|NC_019418. 218 SDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDI-NSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQ 296 (527) Q Consensus 218 ~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~-~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~ 296 (527) -|..|.-+. .+.+.|-+ .++-... ..-+|+|++..+.+.|..++.+-.....-+.... T Consensus 224 ----~g~~iH~SR-------li~f~g~~----------~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~ 282 (532) T protein:vir:94 224 ----SGKKIHSSR-------IHTVVGRP----------VGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFS 282 (532) T ss_pred ----cCeeeccce-------EEEecCCC----------chhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 011111000 11111111 1111100 1226999999999999999987766654333323 Q ss_pred ceeeechhHhcCCCCCCCcccccccccc---cccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_019418. 297 RRVIVPEQMTQLKVQDNQGNIAFKRRFD---VEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVS 373 (527) Q Consensus 297 ~~i~v~~~~l~~~~~~~~~~~~~~~~~d---~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s 373 (527) ..++.. .+-..... ++.......+. ..+..+..+ .-+++...++.++.+ ..-....++....+|+..+|++ T Consensus 283 ~~v~k~-~~a~~ls~--~~~~~~~~r~~~~~~~~~n~g~~-~id~~~e~~e~~~~~--lsgl~~~l~~~~~~iAaa~~IP 356 (532) T protein:vir:94 283 MTNLAT-DMAQLLAP--GGAQSLDARLQLFNLYRDNRNIG-ALDKGTEEIQQTNTP--LSGLDSLQAQSQEQMAAVSHIP 356 (532) T ss_pred Cceeee-chHHhhcc--hhHHHHHHHHHHHHhhcCCccce-EEcCCCceeEEEecc--cCCHHHHHHHHHHHHHhHhCCC Confidence 333221 11111111 11110001111 001111011 111222346655543 3345667777788888899998 Q ss_pred ccc-cccccccc-chHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCC Q lcl|NC_019418. 374 SGM-FTFDGQGV-KTATEIVSENSDTYQMRNSIV-ALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTD 450 (527) Q Consensus 374 ~~~-~~~~~~g~-~TAtei~s~~~~~~~~~~~~~-~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d 450 (527) ..- ||...+|. .|+.+=+ +..+..++.+| ..++..|+.|+..|+... + + .. ..+++|.|++-...+ T Consensus 357 ~t~LfG~sp~GlnstGe~D~---~~yyd~I~s~Qe~~l~p~le~l~~~l~~s~----~-g-~~--~~d~~~~f~pL~~~s 425 (532) T protein:vir:94 357 LVKLLGITPNGLNASSDGEI---RVWYDFIAGYQATNLTPLMEWIIDLIQLSE----Y-G-QI--DPGLAWEWSPLMELD 425 (532) T ss_pred eeeeecCCcccccccchHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----c-C-CC--CCCceEEeCCCCCCC Confidence 764 67666664 4444332 33455555555 446788888888776432 1 1 22 235889999877777 Q ss_pred HHHHHH-------HHHHHHhcCCCCHHHHHHhcC-----CCCHH----HHHHHHHHHHHhcccccccccCCCCCCCCCCC Q lcl|NC_019418. 451 RHAELD-------YWMKMVAAGFATQKRGIAKTL-----GITEE----EAEKELAEINGELPPESDAELALYGKGQQNTV 514 (527) Q Consensus 451 ~~~~~~-------~~~~~~~aGi~s~~~~i~~~~-----~~~de----ea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~ 514 (527) ..+.++ .+.+++.+|++|..++..++- ++... +--++.+.+.++........+. .+...+++. T Consensus 426 ~kEkAei~~~~a~a~~~~~~~Gvi~~~Evr~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 504 (532) T protein:vir:94 426 DKELAEVRQLNASTDSTLMELGVIDAKMVQQRLAADPTSGYAGALGERDELDDVEEIAKQLMAAALNPPA-TAPQTPNPQ 504 (532) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhcCCccccccccccccccccccchhhhhcccccCCCC-CCCCCCCCC Confidence 665544 346778889999988765431 11111 0001111222222211111111 111111111 Q ss_pred CCCCCCCCccccC Q lcl|NC_019418. 515 GNSKDTVDDEDEA 527 (527) Q Consensus 515 ~~~~~~~~~~~~~ 527 (527) . +..+|+-++ T Consensus 505 ~---~~~~d~~~~ 514 (532) T protein:vir:94 505 P---DSEDDQTDN 514 (532) T ss_pred C---CCCCCCCCC Confidence 1 111111111 No 121 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=98.94 E-value=1.1e-08 Score=64.11 Aligned_cols=481 Identities=10% Similarity=0.048 Sum_probs=196.3 Q ss_pred hHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccc-cccccCc--cccCceeecchHHHH Q lcl|NC_019418. 3 LIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIE-YTNTDGD--RKRRKMQHLPIARTA 79 (527) Q Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~-~~~~~~~--~~~~~~~~lnl~~~i 79 (527) |-+ .+..+.+...-+ .+..++......|+.+|.---|-.. +....+. .....++-=+.+... T Consensus 1 M~~--------------~~~~~~l~~r~~-~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a 65 (555) T protein:vir:10 1 MAE--------------QTERKLLLSRWG-QLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRA 65 (555) T ss_pred CCC--------------cccHHHHHHHHH-HHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHH Confidence 111 111111110000 2233444556667776654323211 1111111 111122333667777 Q ss_pred HHHHhhhhhccc--c-----eEeeCCH------HHHH-------HHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC Q lcl|NC_019418. 80 AKKIASLVYNEQ--A-----EISAEDE------TLND-------FLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG 139 (527) Q Consensus 80 ~~~~A~ll~~e~--~-----~i~~~d~------~~~~-------~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~ 139 (527) ++.+|+-|.+-. | ++.+.+. ..++ .+...|..++|...+.++..+....|.+.+..-.|. T Consensus 66 ~~~LAa~L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~ 145 (555) T protein:vir:10 66 LRVLAAGMMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDF 145 (555) T ss_pred HHHHHHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCC Confidence 777777555431 1 2333322 2233 445678889999999999999999999998765664 Q ss_pred -CeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeeccc---ccccce-eeec-CCceEEEEEE Q lcl|NC_019418. 140 -DKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTP---TGQEVG-STKD-KSLYRITNEL 213 (527) Q Consensus 140 -~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~---~~~~~~-~~~~-~~~~~I~n~l 213 (527) +.+++..++..+++- ..|..|++..++....... ..- .+ +|+.. ...... .... +....|-|.+ T Consensus 146 ~~~~rf~~~pl~~~~v-~~d~~G~vd~i~r~~~~t~-~ql------~~--~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V 215 (555) T protein:vir:10 146 DAVVYHHSLTAGEYAI-AADNQGRVNTLYREFQITV-AQM------VR--EFGKDKCSTTVQSLFDRGALEQWVTVIHAI 215 (555) T ss_pred CceEEEEEeecceeEE-eeCCCCCEEEEEEEEeccH-HHH------HH--hcCcccCCHHHHHHHhcCCCCceEEEEEEE Confidence 456788889988884 5677777655542111000 000 00 00000 000000 0000 0112333333 Q ss_pred EecCCcc--cc-Cceeecccc-cCC-cccc--eeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHH Q lcl|NC_019418. 214 YKSTSDS--QL-GERVNLSEL-YPD-LQPV--TPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYD 286 (527) Q Consensus 214 y~~~~~~--~l-G~~v~l~~~-~~~-l~~~--~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s 286 (527) |-..+.+ .. ++-.|..+. |+. .... ....|+..-+|..++- +...++.||+|-...+.+-+..|+..-- T Consensus 216 ~pr~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw----~~~~ge~YGrgp~~~~lgD~k~L~~l~~ 291 (555) T protein:vir:10 216 EPRADRDPSKRDDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRW----ALVGGDIYGNSPAMEALGDVRQLQHEQL 291 (555) T ss_pred eeccCcCcCCCCccccceEEEEEEeccCCccccccCCcccCCceeeee----eecCCCccccchHHHHHHHHHHHHHHHH Confidence 3221111 00 111222221 111 0111 1112332223333321 2234678999999999999999998555 Q ss_pred HHHHHHH-cCcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHH Q lcl|NC_019418. 287 EFMWEIK-MGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKL 365 (527) Q Consensus 287 ~~~~e~~-~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~ 365 (527) ....-.. ..++.+.||++......+.. +...-|.. .+.++....-.+++......-.+.++.+... T Consensus 292 ~~l~~~~~~~~pp~~v~~~~~~~~~~~~-----------pgg~~~v~--~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~r 358 (555) T protein:vir:10 292 RKAQAIDYKSNPPLQLPVSAKNQDISTV-----------PGGLSYVD--AAAPNGGIRTAFEVNLDLSHLLADIVDVRER 358 (555) T ss_pred HHHHHHHHHhcCceeeccccccccceec-----------cccccccc--cCCCCcceecccccccchHHHHHHHHHHHHH Confidence 5554444 35555556555421111111 11111111 1111111111122222223333445544444 Q ss_pred HHHhcCCCc--ccccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhcccCCccc--CccceE Q lcl|NC_019418. 366 FEMQIGVSS--GMFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKVVGIYRGTIP--ELDDIS 440 (527) Q Consensus 366 i~~~~g~s~--~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~~~~~~~~~~--~~~~v~ 440 (527) |.... +.. ..++...+...|||||....+...+..+-.-..+ ...|.-|+..++.+..-.+.....+. ...++. T Consensus 359 I~~af-~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~ 437 (555) T protein:vir:10 359 IKASF-YADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLN 437 (555) T ss_pred HHHHh-hcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeE Confidence 43322 111 1233344566899999999888888777644444 45666677666665433222211111 123366 Q ss_pred EEeCCCccCCHHHH----HHHHHHHH--hcCC-------CCHHHH---HHhcCCC------CHHHHHHHHHH-HHHhccc Q lcl|NC_019418. 441 VNLDDGVFTDRHAE----LDYWMKMV--AAGF-------ATQKRG---IAKTLGI------TEEEAEKELAE-INGELPP 497 (527) Q Consensus 441 v~f~d~i~~d~~~~----~~~~~~~~--~aGi-------~s~~~~---i~~~~~~------~deea~~el~r-i~~E~~~ 497 (527) |++--++....... +.+..+.+ .+++ +....+ +....|+ |++|++++.++ .+.++.. T Consensus 438 v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~ 517 (555) T protein:vir:10 438 VEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAA 517 (555) T ss_pred EEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHH Confidence 66644442221110 11111111 1222 222332 2334454 44555443322 1111111 Q ss_pred c----cccccC---CCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 498 E----SDAELA---LYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 498 ~----~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) + ..+... .-++.+.+.++ ...+.=..+-+ T Consensus 518 ~~a~~~~q~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 553 (555) T protein:vir:10 518 QQAALLNQGADTAAKLGSVDTSKQN-ALTDVTRAFSG 553 (555) T ss_pred HHHHHHHHHHHHHHHhcccccCcch-hHHHHHhhhcc Confidence 0 000000 00111111000 00000000111 No 122 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=98.94 E-value=1.1e-08 Score=64.11 Aligned_cols=481 Identities=10% Similarity=0.048 Sum_probs=196.3 Q ss_pred hHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccc-cccccCc--cccCceeecchHHHH Q lcl|NC_019418. 3 LIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIE-YTNTDGD--RKRRKMQHLPIARTA 79 (527) Q Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~-~~~~~~~--~~~~~~~~lnl~~~i 79 (527) |-+ .+..+.+...-+ .+..++......|+.+|.---|-.. +....+. .....++-=+.+... T Consensus 1 M~~--------------~~~~~~l~~r~~-~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a 65 (555) T protein:vir:98 1 MAE--------------QTERKLLLSRWG-QLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRA 65 (555) T ss_pred CCC--------------cccHHHHHHHHH-HHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHH Confidence 111 111111110000 2233444556667776654323211 1111111 111122333667777 Q ss_pred HHHHhhhhhccc--c-----eEeeCCH------HHHH-------HHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC Q lcl|NC_019418. 80 AKKIASLVYNEQ--A-----EISAEDE------TLND-------FLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG 139 (527) Q Consensus 80 ~~~~A~ll~~e~--~-----~i~~~d~------~~~~-------~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~ 139 (527) ++.+|+-|.+-. | ++.+.+. ..++ .+...|..++|...+.++..+....|.+.+..-.|. T Consensus 66 ~~~LAa~L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~ 145 (555) T protein:vir:98 66 LRVLAAGMMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDF 145 (555) T ss_pred HHHHHHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCC Confidence 777777555431 1 2333322 2233 445678889999999999999999999998765664 Q ss_pred -CeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeeccc---ccccce-eeec-CCceEEEEEE Q lcl|NC_019418. 140 -DKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTP---TGQEVG-STKD-KSLYRITNEL 213 (527) Q Consensus 140 -~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~---~~~~~~-~~~~-~~~~~I~n~l 213 (527) +.+++..++..+++- ..|..|++..++....... ..- .+ +|+.. ...... .... +....|-|.+ T Consensus 146 ~~~~rf~~~pl~~~~v-~~d~~G~vd~i~r~~~~t~-~ql------~~--~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V 215 (555) T protein:vir:98 146 DAVVYHHSLTAGEYAI-AADNQGRVNTLYREFQITV-AQM------VR--EFGKDKCSTTVQSLFDRGALEQWVTVIHAI 215 (555) T ss_pred CceEEEEEeecceeEE-eeCCCCCEEEEEEEEeccH-HHH------HH--hcCcccCCHHHHHHHhcCCCCceEEEEEEE Confidence 456788889988884 5677777655542111000 000 00 00000 000000 0000 0112333333 Q ss_pred EecCCcc--cc-Cceeecccc-cCC-cccc--eeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHH Q lcl|NC_019418. 214 YKSTSDS--QL-GERVNLSEL-YPD-LQPV--TPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYD 286 (527) Q Consensus 214 y~~~~~~--~l-G~~v~l~~~-~~~-l~~~--~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s 286 (527) |-..+.+ .. ++-.|..+. |+. .... ....|+..-+|..++- +...++.||+|-...+.+-+..|+..-- T Consensus 216 ~pr~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw----~~~~ge~YGrgp~~~~lgD~k~L~~l~~ 291 (555) T protein:vir:98 216 EPRADRDPSKRDDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRW----ALVGGDIYGNSPAMEALGDVRQLQHEQL 291 (555) T ss_pred eeccCcCcCCCCccccceEEEEEEeccCCccccccCCcccCCceeeee----eecCCCccccchHHHHHHHHHHHHHHHH Confidence 3221111 00 111222221 111 0111 1112332223333321 2234678999999999999999998555 Q ss_pred HHHHHHH-cCcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHH Q lcl|NC_019418. 287 EFMWEIK-MGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKL 365 (527) Q Consensus 287 ~~~~e~~-~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~ 365 (527) ....-.. ..++.+.||++......+.. +...-|.. .+.++....-.+++......-.+.++.+... T Consensus 292 ~~l~~~~~~~~pp~~v~~~~~~~~~~~~-----------pgg~~~v~--~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~r 358 (555) T protein:vir:98 292 RKAQAIDYKSNPPLQLPVSAKNQDISTV-----------PGGLSYVD--AAAPNGGIRTAFEVNLDLSHLLADIVDVRER 358 (555) T ss_pred HHHHHHHHHhcCceeeccccccccceec-----------cccccccc--cCCCCcceecccccccchHHHHHHHHHHHHH Confidence 5554444 35555556555421111111 11111111 1111111111122222223333445544444 Q ss_pred HHHhcCCCc--ccccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhcccCCccc--CccceE Q lcl|NC_019418. 366 FEMQIGVSS--GMFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKVVGIYRGTIP--ELDDIS 440 (527) Q Consensus 366 i~~~~g~s~--~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~~~~~~~~~~--~~~~v~ 440 (527) |.... +.. ..++...+...|||||....+...+..+-.-..+ ...|.-|+..++.+..-.+.....+. ...++. T Consensus 359 I~~af-~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~ 437 (555) T protein:vir:98 359 IKASF-YADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLN 437 (555) T ss_pred HHHHh-hcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeE Confidence 43322 111 1233344566899999999888888777644444 45666677666665433222211111 123366 Q ss_pred EEeCCCccCCHHHH----HHHHHHHH--hcCC-------CCHHHH---HHhcCCC------CHHHHHHHHHH-HHHhccc Q lcl|NC_019418. 441 VNLDDGVFTDRHAE----LDYWMKMV--AAGF-------ATQKRG---IAKTLGI------TEEEAEKELAE-INGELPP 497 (527) Q Consensus 441 v~f~d~i~~d~~~~----~~~~~~~~--~aGi-------~s~~~~---i~~~~~~------~deea~~el~r-i~~E~~~ 497 (527) |++--++....... +.+..+.+ .+++ +....+ +....|+ |++|++++.++ .+.++.. T Consensus 438 v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~ 517 (555) T protein:vir:98 438 VEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAA 517 (555) T ss_pred EEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHH Confidence 66644442221110 11111111 1222 222332 2334454 44555443322 1111111 Q ss_pred c----cccccC---CCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 498 E----SDAELA---LYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 498 ~----~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) + ..+... .-++.+.+.++ ...+.=..+-+ T Consensus 518 ~~a~~~~q~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 553 (555) T protein:vir:98 518 QQAALLNQGADTAAKLGSVDTSKQN-ALTDVTRAFSG 553 (555) T ss_pred HHHHHHHHHHHHHHHhcccccCcch-hHHHHHhhhcc Confidence 0 000000 00111111000 00000000111 No 123 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=98.94 E-value=1.1e-08 Score=64.11 Aligned_cols=481 Identities=10% Similarity=0.048 Sum_probs=196.3 Q ss_pred hHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccc-cccccCc--cccCceeecchHHHH Q lcl|NC_019418. 3 LIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIE-YTNTDGD--RKRRKMQHLPIARTA 79 (527) Q Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~-~~~~~~~--~~~~~~~~lnl~~~i 79 (527) |-+ .+..+.+...-+ .+..++......|+.+|.---|-.. +....+. .....++-=+.+... T Consensus 1 M~~--------------~~~~~~l~~r~~-~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a 65 (555) T protein:vir:10 1 MAE--------------QTERKLLLSRWG-QLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRA 65 (555) T ss_pred CCC--------------cccHHHHHHHHH-HHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHH Confidence 111 111111110000 2233444556667776654323211 1111111 111122333667777 Q ss_pred HHHHhhhhhccc--c-----eEeeCCH------HHHH-------HHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC Q lcl|NC_019418. 80 AKKIASLVYNEQ--A-----EISAEDE------TLND-------FLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG 139 (527) Q Consensus 80 ~~~~A~ll~~e~--~-----~i~~~d~------~~~~-------~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~ 139 (527) ++.+|+-|.+-. | ++.+.+. ..++ .+...|..++|...+.++..+....|.+.+..-.|. T Consensus 66 ~~~LAa~L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~ 145 (555) T protein:vir:10 66 LRVLAAGMMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDF 145 (555) T ss_pred HHHHHHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCC Confidence 777777555431 1 2333322 2233 445678889999999999999999999998765664 Q ss_pred -CeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeeccc---ccccce-eeec-CCceEEEEEE Q lcl|NC_019418. 140 -DKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTP---TGQEVG-STKD-KSLYRITNEL 213 (527) Q Consensus 140 -~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~---~~~~~~-~~~~-~~~~~I~n~l 213 (527) +.+++..++..+++- ..|..|++..++....... ..- .+ +|+.. ...... .... +....|-|.+ T Consensus 146 ~~~~rf~~~pl~~~~v-~~d~~G~vd~i~r~~~~t~-~ql------~~--~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V 215 (555) T protein:vir:10 146 DAVVYHHSLTAGEYAI-AADNQGRVNTLYREFQITV-AQM------VR--EFGKDKCSTTVQSLFDRGALEQWVTVIHAI 215 (555) T ss_pred CceEEEEEeecceeEE-eeCCCCCEEEEEEEEeccH-HHH------HH--hcCcccCCHHHHHHHhcCCCCceEEEEEEE Confidence 456788889988884 5677777655542111000 000 00 00000 000000 0000 0112333333 Q ss_pred EecCCcc--cc-Cceeecccc-cCC-cccc--eeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHH Q lcl|NC_019418. 214 YKSTSDS--QL-GERVNLSEL-YPD-LQPV--TPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYD 286 (527) Q Consensus 214 y~~~~~~--~l-G~~v~l~~~-~~~-l~~~--~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s 286 (527) |-..+.+ .. ++-.|..+. |+. .... ....|+..-+|..++- +...++.||+|-...+.+-+..|+..-- T Consensus 216 ~pr~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw----~~~~ge~YGrgp~~~~lgD~k~L~~l~~ 291 (555) T protein:vir:10 216 EPRADRDPSKRDDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRW----ALVGGDIYGNSPAMEALGDVRQLQHEQL 291 (555) T ss_pred eeccCcCcCCCCccccceEEEEEEeccCCccccccCCcccCCceeeee----eecCCCccccchHHHHHHHHHHHHHHHH Confidence 3221111 00 111222221 111 0111 1112332223333321 2234678999999999999999998555 Q ss_pred HHHHHHH-cCcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHH Q lcl|NC_019418. 287 EFMWEIK-MGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKL 365 (527) Q Consensus 287 ~~~~e~~-~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~ 365 (527) ....-.. ..++.+.||++......+.. +...-|.. .+.++....-.+++......-.+.++.+... T Consensus 292 ~~l~~~~~~~~pp~~v~~~~~~~~~~~~-----------pgg~~~v~--~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~r 358 (555) T protein:vir:10 292 RKAQAIDYKSNPPLQLPVSAKNQDISTV-----------PGGLSYVD--AAAPNGGIRTAFEVNLDLSHLLADIVDVRER 358 (555) T ss_pred HHHHHHHHHhcCceeeccccccccceec-----------cccccccc--cCCCCcceecccccccchHHHHHHHHHHHHH Confidence 5554444 35555556555421111111 11111111 1111111111122222223333445544444 Q ss_pred HHHhcCCCc--ccccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhcccCCccc--CccceE Q lcl|NC_019418. 366 FEMQIGVSS--GMFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKVVGIYRGTIP--ELDDIS 440 (527) Q Consensus 366 i~~~~g~s~--~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~~~~~~~~~~--~~~~v~ 440 (527) |.... +.. ..++...+...|||||....+...+..+-.-..+ ...|.-|+..++.+..-.+.....+. ...++. T Consensus 359 I~~af-~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~ 437 (555) T protein:vir:10 359 IKASF-YADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLN 437 (555) T ss_pred HHHHh-hcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeE Confidence 43322 111 1233344566899999999888888777644444 45666677666665433222211111 123366 Q ss_pred EEeCCCccCCHHHH----HHHHHHHH--hcCC-------CCHHHH---HHhcCCC------CHHHHHHHHHH-HHHhccc Q lcl|NC_019418. 441 VNLDDGVFTDRHAE----LDYWMKMV--AAGF-------ATQKRG---IAKTLGI------TEEEAEKELAE-INGELPP 497 (527) Q Consensus 441 v~f~d~i~~d~~~~----~~~~~~~~--~aGi-------~s~~~~---i~~~~~~------~deea~~el~r-i~~E~~~ 497 (527) |++--++....... +.+..+.+ .+++ +....+ +....|+ |++|++++.++ .+.++.. T Consensus 438 v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~ 517 (555) T protein:vir:10 438 VEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAA 517 (555) T ss_pred EEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHH Confidence 66644442221110 11111111 1222 222332 2334454 44555443322 1111111 Q ss_pred c----cccccC---CCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 498 E----SDAELA---LYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 498 ~----~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) + ..+... .-++.+.+.++ ...+.=..+-+ T Consensus 518 ~~a~~~~q~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 553 (555) T protein:vir:10 518 QQAALLNQGADTAAKLGSVDTSKQN-ALTDVTRAFSG 553 (555) T ss_pred HHHHHHHHHHHHHHHhcccccCcch-hHHHHHhhhcc Confidence 0 000000 00111111000 00000000111 No 124 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=98.92 E-value=1.4e-08 Score=63.68 Aligned_cols=458 Identities=9% Similarity=0.010 Sum_probs=198.9 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccc-cc---ccCc--cccCceeecc Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEY-TN---TDGD--RKRRKMQHLP 74 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~-~~---~~~~--~~~~~~~~ln 74 (527) |.- ++|++-+.+ +..++......|+.+|.---|-+.. .. ..+. .+...++--+ T Consensus 1 ~~~-~~l~~r~~~--------------------l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i~ds 59 (547) T protein:vir:10 1 MEN-SKIVKRLDF--------------------LKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDS 59 (547) T ss_pred CCH-HHHHHHHHH--------------------HHHHhhHHHHHHHHHHHHhcccccccccCCCCCcccccccccccccc Confidence 222 112221111 2333445567787776543332211 11 1111 1112222236 Q ss_pred hHHHHHHHHhhhhhcc--cc-----eEeeCCH------HHHH-------HHHHHHhhhhHHHHHHHHHHHHHhcCCEEEE Q lcl|NC_019418. 75 IARTAAKKIASLVYNE--QA-----EISAEDE------TLND-------FLSDMLSNDRFNKNFERYLESALALGGLAMR 134 (527) Q Consensus 75 l~~~i~~~~A~ll~~e--~~-----~i~~~d~------~~~~-------~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~ 134 (527) .+...|+.+|+-|.+- || ++.+.|. ..++ .+...|..++|...+.++..+..+.|++.+. T Consensus 60 t~~~a~~~Las~L~~~ltPp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~ 139 (547) T protein:vir:10 60 TAGDGLETLSSSLHGSLTSPATKWFELAFRDKELNSDDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMV 139 (547) T ss_pred hHHHHHHHHHHHHHHhhcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEE Confidence 6777777777755443 11 1233222 2333 4456788899999999999999999999888 Q ss_pred EEEeC---CeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEe------e--------------CCCcceEEEEEEE-Eee Q lcl|NC_019418. 135 PYVDG---DKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIK------T--------------ENRKNVYYTLVEF-HEW 190 (527) Q Consensus 135 ~~~d~---~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~------~--------------~~~~~~~yt~lE~-h~~ 190 (527) +-.|+ +.+++..++..+++- ..|..|++..++-..... . ..+...+...+|. |-. T Consensus 140 ~~~d~~~~~~~r~~~~pl~~~~v-~~d~~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v 218 (547) T protein:vir:10 140 EEEDEDEEGSVVFQSSPIQDSYF-EEDSRGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCV 218 (547) T ss_pred eccCCCCCCceeEEEeecceEEE-eeCCCcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEE Confidence 76664 467888999998884 556667665554211110 0 0001011111111 110 Q ss_pred cccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCccc---ceeecCCCcccEEEecCCccccccCCCccC Q lcl|NC_019418. 191 VTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQP---VTPIQGLSRPLFTYLKTPGMNNKDINSPLG 267 (527) Q Consensus 191 ~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~---~~~~~g~~~p~f~~~~~~~~N~~~~~splG 267 (527) . ..............+...|. |...+|-.... .....|+..-+|..++- +...++.|| T Consensus 219 ~-~~~~~~~~~~~~~~~~~~~~--------------p~~s~~~e~~~~~~~l~esg~~e~P~~~~Rw----~~~~ge~YG 279 (547) T protein:vir:10 219 F-TRYDKKQNRNAGTVLAPTER--------------PFGKKWILKEGAVQLGEEGGYYEMPAYAIRW----RKSAGSQWG 279 (547) T ss_pred e-eccCCCCCccccceeecccc--------------ceeEEEEEecCceeeeecCCcccCCeeeeee----eecCCcccc Confidence 0 00000000000000111111 22222111010 01112222223333321 223467899 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHc-CcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEe Q lcl|NC_019418. 268 LSIFDNAKTTIDFINRTYDEFMWEIKM-GQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDL 346 (527) Q Consensus 268 ~S~~~~~~~lid~ld~~~s~~~~e~~~-~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~ 346 (527) +|-...+.+-++.|+..--..+...+. .++.+.||++-+....+...| -.++. ++...++-+ T Consensus 280 rgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~~~~pg-----------g~~~~------~~~~~v~pl 342 (547) T protein:vir:10 280 FGPSHLALPDVLTANRYVELVLRSSEKVIDPAIMVTERGLISDIDLGAS-----------GLTVV------RDMESMKPF 342 (547) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceecccccccccceecCC-----------eeeec------CCcccceee Confidence 999999999999999988877776654 555566654432111111111 11111 112233333 Q ss_pred ccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhh Q lcl|NC_019418. 347 TTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKV 425 (527) Q Consensus 347 ~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~ 425 (527) +...+...-...++.+-..|... |=...|....+...|||||....+...+..+-.-..+ ...|..|+..++.+..- T Consensus 343 ~~~~~~~~~~~~i~~~~~rI~~a--f~~d~~~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r 420 (547) T protein:vir:10 343 ESRARFDVSSIQLTDLRSAVRRI--YYVDQLQMKDSPAMTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRFR 420 (547) T ss_pred ecccchHHHHHHHHHHHHHHHHH--hhhhhhhcCCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh Confidence 33222232234444443333321 1111222234566899999999998888877655555 34566676666655433 Q ss_pred hcccCCccc-----CccceEEEeCCCccCCHH----HHHHHHHHHHh--cCC-------CCHHHHHH---hcCCC----- Q lcl|NC_019418. 426 VGIYRGTIP-----ELDDISVNLDDGVFTDRH----AELDYWMKMVA--AGF-------ATQKRGIA---KTLGI----- 479 (527) Q Consensus 426 ~~~~~~~~~-----~~~~v~v~f~d~i~~d~~----~~~~~~~~~~~--aGi-------~s~~~~i~---~~~~~----- 479 (527) .+.....+. ...++.|++-..+..... +.+.+.++.+. +++ +....++. ...|+ T Consensus 421 ~g~lP~~p~~l~~~~~~~~~v~~is~Laraq~~~~~~~i~~~~~~v~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~i 500 (547) T protein:vir:10 421 AGKLGELPSKLLESGKAAMDIVYTGPLSRAQKIDQAASIERWAGSTAQLAEINPEVLDIPDWDEMVRMLGSLLGAPQTLM 500 (547) T ss_pred cCCCCCCchhhhccCcceEEEEeccHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhhcCCHHHHHHHHHHHhCCChhcc Confidence 222221111 123456666544333211 11112222221 122 23333332 33454 Q ss_pred -CHHHHHHHHHHHHH-hc----cc-ccccccCCCCCC-CCCCCCCCCCCC Q lcl|NC_019418. 480 -TEEEAEKELAEING-EL----PP-ESDAELALYGKG-QQNTVGNSKDTV 521 (527) Q Consensus 480 -~deea~~el~ri~~-E~----~~-~~~~~~~~~~~~-~~~~~~~~~~~~ 521 (527) |++|++++.++.++ ++ ++ .......+..-+ .... -..|. T Consensus 501 rs~eev~~~r~qr~~~~q~~~qaa~~~~~g~~m~~~~~~~a~---~~~~~ 547 (547) T protein:vir:10 501 RPKAKVTSIRKNRSQTQQKAEQAAIAEAEGNAMEAQGKGQAA---LKENQ 547 (547) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccc---hhccC Confidence 45555544332111 11 10 000111111000 0000 00111 No 125 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=98.91 E-value=1.6e-08 Score=63.27 Aligned_cols=450 Identities=10% Similarity=0.053 Sum_probs=190.4 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) |-- -..+- .+.+++... .+..++......|+.+|.---|-+-.....+....+.+.-=+.+...+ T Consensus 1 ~~~---~~~~~--------~~~~~~r~~----~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~ 65 (522) T protein:vir:94 1 MAE---REGFA--------AEGAKAVYD----RLKNGRQPYETRAQNCAAVTIPSLFPKESDNSSTEYTTPWQAVGARCL 65 (522) T ss_pred Ccc---cchhh--------HHHHHHHHH----HHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHH Confidence 211 11110 011111110 122333445666777765433332222222222222333336677778 Q ss_pred HHHhhhhhcc--c--ceEe--eCC-------------HHHHHH-------HHHHHhhhhHHHHHHHHHHHHHhcCCEEEE Q lcl|NC_019418. 81 KKIASLVYNE--Q--AEIS--AED-------------ETLNDF-------LSDMLSNDRFNKNFERYLESALALGGLAMR 134 (527) Q Consensus 81 ~~~A~ll~~e--~--~~i~--~~d-------------~~~~~~-------l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~ 134 (527) +.+|+-|.+- | |=|. +.+ ....++ +...|..++|...+.++..+....|.+.+. T Consensus 66 ~~Las~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~ 145 (522) T protein:vir:94 66 NNLAAKLMLALFPQSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLY 145 (522) T ss_pred HHHHHHHHhhcCCCCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEe Confidence 8877755443 2 2122 111 112333 345677789999999999999999998875 Q ss_pred EEEeC-C-eeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeC-----------------CCcceEEEEEEEEeeccccc Q lcl|NC_019418. 135 PYVDG-D-KIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTE-----------------NRKNVYYTLVEFHEWVTPTG 195 (527) Q Consensus 135 ~~~d~-~-~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~-----------------~~~~~~yt~lE~h~~~~~~~ 195 (527) +--+. + ...+.+++-.+++ +..|..|++..++....+... .+...+||.++.+ T Consensus 146 ~~~~~~~~~~~~~~~pl~~y~-v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~v~~~------- 217 (522) T protein:vir:94 146 IPEPEQGTYSPMRMYRLVSYV-VQRDAFGNILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVYTHIYRQ------- 217 (522) T ss_pred eeccCCCceeeEEEEEcceEE-EeeCCCcCeEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEEEEEEee------- Confidence 43232 2 2357778877766 456777777766643322110 0011122222211 Q ss_pred ccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhH Q lcl|NC_019418. 196 QEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAK 275 (527) Q Consensus 196 ~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~ 275 (527) .+.+. .|..-+ |..++.++- . .|...-+|..++- +...++.||+|-...+. T Consensus 218 --------~~~~~----~~~~~~----g~~~~~~~~------~---~~~~e~P~~~~Rw----~~~~ge~YGrgp~~~~l 268 (522) T protein:vir:94 218 --------DDEYL----RYEEVE----GIEVTGTDG------S---YPLTACPYIPVRM----VRLDGEDYGRSYCEEYL 268 (522) T ss_pred --------CCcee----EEeecc----CceecccCC------C---CccccCCceeeee----eecCCCccccchHHHHH Confidence 11111 111111 222222110 0 1111122222221 12346789999999999 Q ss_pred HHHHHHHHHHHHHHHHH-HcCcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEec--cccCh Q lcl|NC_019418. 276 TTIDFINRTYDEFMWEI-KMGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLT--TPIRS 352 (527) Q Consensus 276 ~lid~ld~~~s~~~~e~-~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~--~~ir~ 352 (527) +-++.|+..--....-. ...++.+.||++.+....+...+ ....+.. + ....++.++ ..-+. T Consensus 269 ~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~~~~~----------~~g~~v~---g--~~~~v~~~~~~~~~~~ 333 (522) T protein:vir:94 269 GDLNSLETITEAITKMAKVASKVVGLVNPNGITQPRRLNKA----------ATGEFVA---G--RVEDINFLQLTKGQDF 333 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCceeecccccccchheecc----------CCceeec---C--Ccccceeeecccccch Confidence 99999998776666554 44677777765543221111000 0001111 1 111222222 11122 Q ss_pred HHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhcccCC Q lcl|NC_019418. 353 SDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKVVGIYRG 431 (527) Q Consensus 353 e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~~~~~~~ 431 (527) ..-...++.+...|....-+. .++...+...|||||....+...+...-.-..+ ...|.-|+..++.+..-.++... T Consensus 334 ~~~~~~i~~~~~rI~~af~~~--~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~ 411 (522) T protein:vir:94 334 TIAKSVADAIEQRLGWAFLLN--SAVQRNAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGMIPD 411 (522) T ss_pred hHHHHHHHHHHHHHHHHHhhh--hhccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC Confidence 223344444444443332222 223334456899999999988887776644444 34556666666655432222222 Q ss_pred cccCccceEEEeCCCccCCH-HHHHHHHHHHHh--cCC--------CCHHHHH---HhcCCC-------CHHHHHHHHHH Q lcl|NC_019418. 432 TIPELDDISVNLDDGVFTDR-HAELDYWMKMVA--AGF--------ATQKRGI---AKTLGI-------TEEEAEKELAE 490 (527) Q Consensus 432 ~~~~~~~v~v~f~d~i~~d~-~~~~~~~~~~~~--aGi--------~s~~~~i---~~~~~~-------~deea~~el~r 490 (527) . +...+.+++--++..-. ...++...+..+ +++ +....++ ....|+ +++|+++..++ T Consensus 412 ~--p~~~v~v~~~s~La~~qr~~~~~~l~~~~~~ia~l~P~~~~~~id~d~~~~~~a~~~Gv~~~~ivr~~ee~~~~~~q 489 (522) T protein:vir:94 412 L--PKEAVEPTVSTGLEALGRGQDLEKLTQAVNMMTGLQPLSQDPDINLPTLKLRLLNALGIDTAGLLLTQDEKIQRMAE 489 (522) T ss_pred C--CcccEEeeEecHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHH Confidence 1 22235565543332111 111111111111 111 1222222 223455 24455444443 Q ss_pred HHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 491 INGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 491 i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) .+..++..... ...++.-.... .+.-.+|-+ T Consensus 490 ~~~~~~~~~~~----~~~~~~~~a~~--~~~~~~~~~ 520 (522) T protein:vir:94 490 QSSQQAVVQGA----SAAGANMGAAV--GQGAGEDMA 520 (522) T ss_pred HHHHHHHHHHH----HHHHHHhhhhh--hcccchhhh Confidence 22211111000 00000000000 111111111 No 126 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=98.88 E-value=2.1e-08 Score=62.71 Aligned_cols=467 Identities=10% Similarity=0.065 Sum_probs=203.0 Q ss_pred hHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccc-cccccCc--cccCceeecchHHHH Q lcl|NC_019418. 3 LIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIE-YTNTDGD--RKRRKMQHLPIARTA 79 (527) Q Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~-~~~~~~~--~~~~~~~~lnl~~~i 79 (527) |=+..++.++|.... +..++......|+.+|.---|-.. +...... .+...++--+.+... T Consensus 1 m~~~~~~~l~~r~~~----------------l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a 64 (556) T protein:vir:73 1 MAETEKERLLKQLAQ----------------LKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDDRRNTKIVDPTGSMA 64 (556) T ss_pred CChhhHHHHHHHHHH----------------HHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcchhhcCccccchHHHH Confidence 333444444443221 122344456667776654322211 1111100 111122333567777 Q ss_pred HHHHhhhhhccc--c-----eEeeCCH------HH-------HHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC Q lcl|NC_019418. 80 AKKIASLVYNEQ--A-----EISAEDE------TL-------NDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG 139 (527) Q Consensus 80 ~~~~A~ll~~e~--~-----~i~~~d~------~~-------~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~ 139 (527) |+.+|+-|.+-. | ++.+.++ .. .+.+.+.|..++|...+.+++.+....|.+.+.+-.+. T Consensus 65 ~~~Las~l~~~ltpp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~ 144 (556) T protein:vir:73 65 QRILSSGMMSGITSPARPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVMEDD 144 (556) T ss_pred HHHHHHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeeecC Confidence 777777554431 1 2333332 22 33455678889999999999999999999998766664 Q ss_pred -CeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeeccc---cc--ccceeeecCCceEEEEEE Q lcl|NC_019418. 140 -DKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTP---TG--QEVGSTKDKSLYRITNEL 213 (527) Q Consensus 140 -~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~---~~--~~~~~~~~~~~~~I~n~l 213 (527) +.+++..++..+++- ..|..|++..++-...+.. ..- .+ +|+.. .. .....-..+....|.|.+ T Consensus 145 ~~~~r~~~~~l~~~~~-~~d~~G~vd~i~r~~~~t~-~ql------~~--~fg~~~l~~~v~~~~~~~~~~~~~~v~~~V 214 (556) T protein:vir:73 145 QDVIRTMPFPIGSYYL-ANSPRGSVDTCIRQFSMTV-RQM------VQ--EFGLDNVSTSVKGMWENGTYETWVEVNHCI 214 (556) T ss_pred CceEEEEEeecceeEE-eeCCCCCeEEEEEEEeccH-HHH------HH--HcCcccCCHHHHHHHhcCCccceEEEEEEE Confidence 456788899988884 5566776665542211110 000 00 00000 00 000000001123344444 Q ss_pred EecCCccc---cCceeecccc-cCC-cccc--eeecCCCcccEEEecCCccccccCCCccCcch-hhhhHHHHHHHHHHH Q lcl|NC_019418. 214 YKSTSDSQ---LGERVNLSEL-YPD-LQPV--TPIQGLSRPLFTYLKTPGMNNKDINSPLGLSI-FDNAKTTIDFINRTY 285 (527) Q Consensus 214 y~~~~~~~---lG~~v~l~~~-~~~-l~~~--~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~-~~~~~~lid~ld~~~ 285 (527) |...+.+. -++-.|..+. |+. .... ....|+..-+|..++ =+...++.||+|. ...+.+-++.|+..- T Consensus 215 ~pr~~~~~~~~~~~~~p~~s~~~~~~~~~~~vl~esg~~e~P~~~~R----w~~~~ge~YGrg~P~~~~lgD~k~L~~l~ 290 (556) T protein:vir:73 215 TPNVNRDSGKMDSKNKPYRSVYFESGGDSDKLLRESGFDEFPILAPR----WEVNGEDVYASSCPGMLALGQVKALQVEQ 290 (556) T ss_pred eccccccccccCcccceEEEEEEEecCCCceecccCCcccCCceeee----eeecCCcccccCccHHHhHHHHHHHHHHH Confidence 43222111 1112222222 211 1111 112333222333222 1223467899994 999999999999887 Q ss_pred HHHHHHHHc-CcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEe---ccccChHHHHHHHHH Q lcl|NC_019418. 286 DEFMWEIKM-GQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDL---TTPIRSSDYISAISE 361 (527) Q Consensus 286 s~~~~e~~~-~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~---~~~ir~e~~~~~~~~ 361 (527) -....-.+. .++.+.||++......+.. +.-..|..+. ++...++.+ ++++ ....+.++. T Consensus 291 ~~~l~~~~~~~~pp~~v~~~~~~~~~~~~-----------pgg~~~~~~~---~~~~~i~p~~~~~~d~--~~~~~~i~~ 354 (556) T protein:vir:73 291 KRKAQLIDKATNPPMVAPTSLKNQRVSLL-----------PGDVTYLDVI---SGQDGFKPAYLVNPNT--ADLLADIQD 354 (556) T ss_pred HHHHHHHHHHhcCceeccccccccceeec-----------cCccccccCC---CCccceeeeccccccH--HHHHHHHHH Confidence 777766654 5666666655421111111 1111122111 112223332 3442 222334444 Q ss_pred HHHHHHHhcCCC-cccccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhcccCCccc--Ccc Q lcl|NC_019418. 362 GLKLFEMQIGVS-SGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKVVGIYRGTIP--ELD 437 (527) Q Consensus 362 ~l~~i~~~~g~s-~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~~~~~~~~~~--~~~ 437 (527) +-..|....-.+ ...++...+...|||||....+...+..+-.-..+ ...|.-|+..++.+..-.+.....+. ... T Consensus 355 ~~~rI~~af~~d~~~~l~~~~~~r~TAtEv~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~ 434 (556) T protein:vir:73 355 TRQTINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMARKNMLPEPPDVLQGM 434 (556) T ss_pred HHHHHHHHhhcchhhhhccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCc Confidence 444443222111 01233344556899999999988888876644444 45667777776666543332222111 123 Q ss_pred ceEEEeCCCccCCHHH-H---HHHHHHHHh--cCC-------CCHHHHHH---hcCCCC------HHHHHHHHHH-HHHh Q lcl|NC_019418. 438 DISVNLDDGVFTDRHA-E---LDYWMKMVA--AGF-------ATQKRGIA---KTLGIT------EEEAEKELAE-INGE 494 (527) Q Consensus 438 ~v~v~f~d~i~~d~~~-~---~~~~~~~~~--aGi-------~s~~~~i~---~~~~~~------deea~~el~r-i~~E 494 (527) +++|++--.+...... . +.+.++.+. +++ +....++. ...|++ ++|++++-++ .+.. T Consensus 435 ~i~v~yis~La~aqk~~~~~~i~~~~~~~~~laq~~Pe~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~r~~~q 514 (556) T protein:vir:73 435 PLRIEYISVMAQAQKSIGLTSLSQTVGFIGQLAQFKPEALDKLDVDQAIDAFSEMSGVSPTVIVPQEQVQGIREERAKQA 514 (556) T ss_pred eeEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCChhhcCCHHHHHHHHHHHHHHH Confidence 4666764443221111 1 111111111 121 23333332 234543 4444333211 1111 Q ss_pred ccc----ccccc----cCCCCCC--C-----------CCCCC Q lcl|NC_019418. 495 LPP----ESDAE----LALYGKG--Q-----------QNTVG 515 (527) Q Consensus 495 ~~~----~~~~~----~~~~~~~--~-----------~~~~~ 515 (527) +.. ..... ..+..-+ + -.+++ T Consensus 515 q~~~~~~~~~~a~~~~~~~~~~~~~~~~~l~~~~~~~g~~~~ 556 (556) T protein:vir:73 515 QAAQAMAMGQAAAQGAKTLSETQTSDPSALTAIANAAGAPQQ 556 (556) T ss_pred HHHHHHHHHHHHHHHHHHhhhccCCCHHHHHHHHHhhcCCCC Confidence 100 00000 0000000 0 00111 No 127 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=98.85 E-value=2.8e-08 Score=61.97 Aligned_cols=457 Identities=9% Similarity=0.029 Sum_probs=183.9 Q ss_pred HHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHHHHhhh Q lcl|NC_019418. 7 VKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIASL 86 (527) Q Consensus 7 ~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~l 86 (527) ||.-+++.... +..++......|+.+|.---|-+.............+.-=+.+...++.+|+- T Consensus 1 m~~~~~~r~~~----------------l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~ 64 (555) T protein:vir:17 1 MKHSAQAKYMM----------------LRADREDYLDSGRQSARLTLPYILTDEGHVQGGYLPTPWQSVGSKGVNVLASK 64 (555) T ss_pred ChhHHHHHHHH----------------HHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHH Confidence 22222222211 12223345666777765433322111111111111222236677888888876 Q ss_pred hhcc--cc-----eEeeCCH---------HHH-----------HHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC Q lcl|NC_019418. 87 VYNE--QA-----EISAEDE---------TLN-----------DFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG 139 (527) Q Consensus 87 l~~e--~~-----~i~~~d~---------~~~-----------~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~ 139 (527) |.+- || ++.+.+. ... +.+...|..++|...+.++..+....|.+++ |.+. T Consensus 65 l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--y~~~ 142 (555) T protein:vir:17 65 LMLSLFPVNTSFFKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALL--YQGK 142 (555) T ss_pred HHHhhcCCCCcccccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEE--EecC Confidence 5553 11 2233321 112 2334456678999999999999999999875 5666 Q ss_pred CeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeC------CCc---ceE---EEEEEE------Eeecccccc--cce Q lcl|NC_019418. 140 DKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTE------NRK---NVY---YTLVEF------HEWVTPTGQ--EVG 199 (527) Q Consensus 140 ~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~------~~~---~~~---yt~lE~------h~~~~~~~~--~~~ 199 (527) +.. ++++-.+++ +..|..|++..++....+... .+. ... ....+. |........ ... T Consensus 143 ~~~--~~~pl~~y~-v~~d~~G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 219 (555) T protein:vir:17 143 KNL--KLYPLDRFV-VSRDGEGNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDKGKSNDA 219 (555) T ss_pred Cce--eEEEcCeEE-EeeCCCcCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhcccccCCCcce Confidence 654 346666655 456777777766543221100 000 000 000000 000000000 000 Q ss_pred -----eeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhh Q lcl|NC_019418. 200 -----STKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNA 274 (527) Q Consensus 200 -----~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~ 274 (527) .....+.+ .+|..-+ |..++ ......|...-+|..++- +...++.||+|-...+ T Consensus 220 ~v~t~~~~~~~~~----~~~~e~~----~~~v~---------~~l~e~g~~e~P~i~~Rw----~~~~ge~YGrgp~~~~ 278 (555) T protein:vir:17 220 LVYTYVCRKDGQV----KWHQECD----GKVIP---------GSNSSAPYTHNPWIPLRF----NIVDGEAYGRGRVEEF 278 (555) T ss_pred eEeecccccCCee----EEEEecC----ceecc---------ccccccCcccCCeeeeee----eecCCCccccchHHHH Confidence 00000000 1111111 11111 000011222223333321 2234678999999999 Q ss_pred HHHHHHHHHHHHHHHHHH-HcCcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccc--cC Q lcl|NC_019418. 275 KTTIDFINRTYDEFMWEI-KMGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTP--IR 351 (527) Q Consensus 275 ~~lid~ld~~~s~~~~e~-~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~--ir 351 (527) .+-++.|+..--....-. ...++.+.||++.+....+...+. ...+. .+....++.++.. .+ T Consensus 279 l~D~k~L~~l~~~~l~~~~~~~~pp~lv~~~g~~~~~~l~~~~----------~g~v~-----~g~~~~v~~~~~~~~~~ 343 (555) T protein:vir:17 279 MGDLKSLEALSQAMVEGSAASAKVVFMVSPSATTKPQNLALAA----------NGAII-----QGRPDDVSVVQANKAAD 343 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHhCCceeeccccccCcceeecCC----------Cceee-----cCCcccceeeeccccch Confidence 999999998766666544 346666677555432221111110 00111 0111223222211 11 Q ss_pred hHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhcccC Q lcl|NC_019418. 352 SSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKVVGIYR 430 (527) Q Consensus 352 ~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~~~~~~ 430 (527) ...-.+.++.+...|....-+ .+...+...|||||....+...+...-....+ ...|.-|+..++.+..-.++.. T Consensus 344 ~~~~~~~i~~~~~~I~~aFm~----~~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~g~lP 419 (555) T protein:vir:17 344 FRTVLEMIQKLEQRISDAFLM----LQVRQSERTTATEVQATVQELNEQIGGIYSNLTTELLQPYLARKLHLLQKQRKLP 419 (555) T ss_pred hhHHHHHHHHHHHHHHHHHhh----cCCCCcccchHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCCCC Confidence 111122233332222221111 11223456799999999999888887766666 3566777777777664433332 Q ss_pred CcccCccceEEEeCCCcc-CCHHHHHHHHHHHHh--cCC---------CCHHHHH---HhcCCC-------CHHHHHHHH Q lcl|NC_019418. 431 GTIPELDDISVNLDDGVF-TDRHAELDYWMKMVA--AGF---------ATQKRGI---AKTLGI-------TEEEAEKEL 488 (527) Q Consensus 431 ~~~~~~~~v~v~f~d~i~-~d~~~~~~~~~~~~~--aGi---------~s~~~~i---~~~~~~-------~deea~~el 488 (527) ..+.+...+++. -++. .-..+.++..+...+ +++ +....++ ...+|+ ++||++++- T Consensus 420 ~~p~~~v~~~i~--~~l~~l~r~~~~~~l~~~~~~laq~~~~p~~~d~id~d~~~~~~a~~~Gv~p~~ivrs~eev~~~r 497 (555) T protein:vir:17 420 QLPKDLVQPTVV--AGLWGVGRGQDKQQLMEFITTLAQTMGPEIAMKYINPTEFIKRLAAAQGIDTLQLINSPETMKQLG 497 (555) T ss_pred CCCHhhhcccee--ehHHHHHHHHHHHHHHHHHHHHHhhcCchhHhhcCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHH Confidence 222222333332 1110 011122222222111 011 2223332 334565 555554443 Q ss_pred HHHHHhc-----ccccccccCCCC------CCCCCCCCCCCCCCCccc-----cC Q lcl|NC_019418. 489 AEINGEL-----PPESDAELALYG------KGQQNTVGNSKDTVDDED-----EA 527 (527) Q Consensus 489 ~ri~~E~-----~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~-----~~ 527 (527) ++.+.++ .....+...... ...++.. .....+...- |+ T Consensus 498 q~~~~~~~q~~~~~qa~~~~~~~~~~~~~~~~~~~~~-~a~~~~~a~~~~~~~~~ 551 (555) T protein:vir:17 498 DQQKQDMVQASLINQAGQLAKTPMAEQAMQLIQQQQE-GAQDAGAAESETSSAEA 551 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhhHHhccccchh-hhhHHHHHHhhcCCccc Confidence 2221111 000000000000 0000000 0000010011 11 No 128 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=98.84 E-value=2.9e-08 Score=61.88 Aligned_cols=478 Identities=10% Similarity=0.071 Sum_probs=202.9 Q ss_pred hHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccc-cc---ccCccccCceeecchHHH Q lcl|NC_019418. 3 LIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEY-TN---TDGDRKRRKMQHLPIART 78 (527) Q Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~-~~---~~~~~~~~~~~~lnl~~~ 78 (527) |-++.+..+++.... +..++......|+.+|.---|-+.. .. ..+.. ...++--+.+.. T Consensus 1 m~~~~~~~l~~r~~~----------------l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~-~~~~~~dst~~~ 63 (559) T protein:vir:95 1 MAETTKERLNKQFAQ----------------LESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDR-RNTRIIDSTGTM 63 (559) T ss_pred CChhhHHHHHHHHHH----------------HHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcccc-cccccccchHHH Confidence 444455544433322 2334455666777776543222211 11 11111 122223356777 Q ss_pred HHHHHhhhhhccc--c-----eEeeCCH------HHHHH-------HHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe Q lcl|NC_019418. 79 AAKKIASLVYNEQ--A-----EISAEDE------TLNDF-------LSDMLSNDRFNKNFERYLESALALGGLAMRPYVD 138 (527) Q Consensus 79 i~~~~A~ll~~e~--~-----~i~~~d~------~~~~~-------l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d 138 (527) .|+.+|+-|.+-. | ++.+.++ ..+++ +.+.|..++|...+.++..+....|.+.+.+-.| T Consensus 64 a~~~Las~l~~~ltpp~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~d 143 (559) T protein:vir:95 64 AARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLDD 143 (559) T ss_pred HHHHHHHHHHHhhcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeecC Confidence 7777777554431 1 2333332 23333 4567888999999999999999999999876666 Q ss_pred C-CeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeeccc---cc--ccceeeecCCceEEEEE Q lcl|NC_019418. 139 G-DKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTP---TG--QEVGSTKDKSLYRITNE 212 (527) Q Consensus 139 ~-~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~---~~--~~~~~~~~~~~~~I~n~ 212 (527) . +.+++..++..+++- ..|..|++..++....+.. .. +.+. |+.. .. ...........+.|.|. T Consensus 144 ~~~~~r~~~~~l~~~~v-~~d~~G~vd~i~r~~~~t~-~q------l~~~--fg~~~l~~~~~~~~~~~~~~~~v~v~~~ 213 (559) T protein:vir:95 144 DEDIIRTMPFPIGSYYL-ANSPRGSVDTCFRKFSMTV-RQ------LVQE--FGLNNVSESVKSMWESGTYEKWIEVMHS 213 (559) T ss_pred CCceeEEEEeecCeEEE-eeCCCCCeEEEEEeEecCH-HH------HHHH--cCcccCCHHHHHHHhcCCCCCeEEEEEE Confidence 4 456788899999885 5566776665543211100 00 0000 0000 00 00000000112344444 Q ss_pred EEecCCccc--c-Cceeeccccc-CC-cccc--eeecCCCcccEEEecCCccccccCCCccCcc-hhhhhHHHHHHHHHH Q lcl|NC_019418. 213 LYKSTSDSQ--L-GERVNLSELY-PD-LQPV--TPIQGLSRPLFTYLKTPGMNNKDINSPLGLS-IFDNAKTTIDFINRT 284 (527) Q Consensus 213 ly~~~~~~~--l-G~~v~l~~~~-~~-l~~~--~~~~g~~~p~f~~~~~~~~N~~~~~splG~S-~~~~~~~lid~ld~~ 284 (527) +|...+.+. + .+..|..++| +. .... ....|+..-+|..++ =+...++.||+| .-..+.+-++.|+.. T Consensus 214 V~pr~~~~~~~~~~~~~pf~s~~~e~~~~~~~~l~esg~~e~P~~~~R----w~~~~ge~YGrg~P~~~al~d~k~L~~l 289 (559) T protein:vir:95 214 VYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPR----WEVNGEDVYGSSCPGMLALGPVKALQLL 289 (559) T ss_pred EeccccccccccccccceEEEEEEEecCCCceeeecCCcccCCcccee----eeecCCccccccchHHHhhHHHHHHHHH Confidence 443222111 1 1222333332 11 0111 112333222222222 123346789999 589999999999988 Q ss_pred HHHHHHHHHc-CcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceE---eccccChHHHHHHHH Q lcl|NC_019418. 285 YDEFMWEIKM-GQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVD---LTTPIRSSDYISAIS 360 (527) Q Consensus 285 ~s~~~~e~~~-~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~---~~~~ir~e~~~~~~~ 360 (527) --....-.+. .++.+.+|.+......+.. +....|. +... +...++. .++++- .-...++ T Consensus 290 ~~~~l~~~~~~~~pp~~v~~~~~~~~~~l~-----------pgg~~~~--~~~~-~~~~i~p~~~~~~~~~--~~~~~i~ 353 (559) T protein:vir:95 290 QKRKSQLIDKATNPPMVAPTSLKNQRASLL-----------PGDITYI--DQIT-GQDGFRPAYLVNPSTA--DLVADIQ 353 (559) T ss_pred HHHHHHHHHHHhcCceeccccccccceeee-----------ccceeee--CCCC-CcccceeecccccchH--HHHHHHH Confidence 8777776654 5666666665431111111 1111111 1111 1112332 233321 1122233 Q ss_pred HHHHHHHHhcCCCc-ccccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhcccCCccc--Cc Q lcl|NC_019418. 361 EGLKLFEMQIGVSS-GMFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKVVGIYRGTIP--EL 436 (527) Q Consensus 361 ~~l~~i~~~~g~s~-~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~~~~~~~~~~--~~ 436 (527) .+...|....-.+. ..+....+...|||||....+...+..+-.-..+ ...|.-|+..++.+..-.+.....+. .. T Consensus 354 ~~~~rI~~af~~d~~~~l~~r~~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~ 433 (559) T protein:vir:95 354 DTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKNMLPPPPDVMEG 433 (559) T ss_pred HHHHHHHHHhhhhhHHHhhcCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccC Confidence 33333322221110 1223334566799999999988887776644444 45667777776666543332221111 12 Q ss_pred cceEEEeCCCccCCHHH-H---HHHHHHHHh--cCC-------CCHHHHHH---hcCCCC------HHHHHHHHHH-HHH Q lcl|NC_019418. 437 DDISVNLDDGVFTDRHA-E---LDYWMKMVA--AGF-------ATQKRGIA---KTLGIT------EEEAEKELAE-ING 493 (527) Q Consensus 437 ~~v~v~f~d~i~~d~~~-~---~~~~~~~~~--aGi-------~s~~~~i~---~~~~~~------deea~~el~r-i~~ 493 (527) .++.|++--.+..-... . +.+.++.+. +++ +....++. ...|++ ++|+++.-++ .++ T Consensus 434 ~~i~v~~is~La~aqk~~~~~~i~~~~~~~~~laq~~Pevld~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~rqqr~~~ 513 (559) T protein:vir:95 434 MPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQVKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVEQARQQRAQQ 513 (559) T ss_pred cceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhhhcCCHHHHHHHHHHHhCCchhhcCCHHHHHHHHHHHHHH Confidence 34666664433211100 0 111111111 121 33333332 234544 3443322111 111 Q ss_pred hc-------c-cccccccCCCCCCCCCCC--CCC-CCCCCccccC Q lcl|NC_019418. 494 EL-------P-PESDAELALYGKGQQNTV--GNS-KDTVDDEDEA 527 (527) Q Consensus 494 E~-------~-~~~~~~~~~~~~~~~~~~--~~~-~~~~~~~~~~ 527 (527) .+ + ..++....+..-....+. +.. +.-+|..... T Consensus 514 qq~~q~~~~~~~aa~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 558 (559) T protein:vir:95 514 QQQQQMMAMGMAAAQGVKTLSEAKTSDPSVLSAMANAVSGQGGQS 558 (559) T ss_pred HHHHHHHHHHHHHHHhhhccccccCCChhHHHHHHHhhcCccccC Confidence 00 0 000000000000000000 000 0000000000 No 129 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=98.80 E-value=3.5e-08 Score=61.45 Aligned_cols=489 Identities=7% Similarity=-0.009 Sum_probs=215.0 Q ss_pred CC----hHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchH Q lcl|NC_019418. 1 MS----LIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIA 76 (527) Q Consensus 1 m~----~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~ 76 (527) |. .+.+++.+|+.- +.-.++-.....+..+||.|. .|........ ..+.+..+|+= T Consensus 1 m~d~~~~~~~~~~~~~~~-----------------~~~~~~~r~~a~~d~~fy~G~--Qw~~~~~~~l-~~q~rp~~N~i 60 (725) T protein:vir:77 1 MADNENRLESILSRFDAD-----------------WTASDEARREAKNDLFFSRVS--QWDDWLSQYT-TLQYRGQFDVV 60 (725) T ss_pred CCchHHHHHHHHHHHHHH-----------------HHhhHHHHHHHHHHHHhhCCC--CCCHHHHHHH-HhcCCCccccH Confidence 43 344444444421 112344555666778899984 3321111111 11112245777 Q ss_pred HHHHHHHhhhhhcccceEee-----CCHHHHHHHHH----HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe---C----C Q lcl|NC_019418. 77 RTAAKKIASLVYNEQAEISA-----EDETLNDFLSD----MLSNDRFNKNFERYLESALALGGLAMRPYVD---G----D 140 (527) Q Consensus 77 ~~i~~~~A~ll~~e~~~i~~-----~d~~~~~~l~~----~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d---~----~ 140 (527) +-+|+...++--...+.+.+ ++...++.|.. +.+.+++......+...+++.|-+|+.++.| . + T Consensus 61 ~~~i~~v~g~~~~nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~ 140 (725) T protein:vir:77 61 RPVVRKLVSEMRQNPIDVLYRPKDGARPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSN 140 (725) T ss_pred HHHHHHHHhhHHhCCcceEEecCCccHHHHHHHHHHHHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCCCCCCC Confidence 77788877776666666665 24445555544 4556777888888999999999999999865 1 2 Q ss_pred eeEEEEE----cCCceEEEEEcCCce----EEE--EEEEEEEeeCC-------CcceEEEEEEEEeeccc---ccc-cce Q lcl|NC_019418. 141 KIRVAFI----QAPVFLPLQSNTQDV----SSA--AILTKTIKTEN-------RKNVYYTLVEFHEWVTP---TGQ-EVG 199 (527) Q Consensus 141 ~~~i~~v----~a~~~~P~~~d~~~~----~~~--a~~~~~~~~~~-------~~~~~yt~lE~h~~~~~---~~~-~~~ 199 (527) .++|.++ ++.++|. |..-. ..| ++..+++..+. .+..+....+++....+ |.. ... T Consensus 141 ~~~i~~~~~~~~~~~v~~---Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~v 217 (725) T protein:vir:77 141 NQVIRREPIHSACSHVIW---DSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTI 217 (725) T ss_pred ceeeEEeecccChhhcee---CchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCee Confidence 3444443 3444442 22111 111 11112221110 00000000011000000 000 000 Q ss_pred eeecCCceE---EEEEEEecCCccccCceeecc--c---c-----------------------cCCcccceeecCCC--- Q lcl|NC_019418. 200 STKDKSLYR---ITNELYKSTSDSQLGERVNLS--E---L-----------------------YPDLQPVTPIQGLS--- 245 (527) Q Consensus 200 ~~~~~~~~~---I~n~ly~~~~~~~lG~~v~l~--~---~-----------------------~~~l~~~~~~~g~~--- 245 (527) .. ...|+ +.-.+|...++ ..|..+.+. + + |--+.+...+.+-. T Consensus 218 rv--~E~~~r~~~~~~~~~~~~~-~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~~~ 294 (725) T protein:vir:77 218 QI--AEFYEVVEKKETAFIYQDP-VTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIA 294 (725) T ss_pred EE--EEEEEEEEEeeEEEEecCC-CCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCcCC Confidence 00 00000 00011111111 122211111 0 0 00011111111100 Q ss_pred cccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHc-CcceeeechhHhcCCCCCCCcccccccccc Q lcl|NC_019418. 246 RPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKM-GQRRVIVPEQMTQLKVQDNQGNIAFKRRFD 324 (527) Q Consensus 246 ~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~-~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d 324 (527) .-.|-|+|.-+.-..-.++|++-+.+.++++.++.+|...|...+.+-. .+.+..+..+.+..... .+. T Consensus 295 ~~~~P~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~----------~~~ 364 (725) T protein:vir:77 295 GEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEH----------MYD 364 (725) T ss_pred CCccceEEEeeeeeccCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHH----------HHH Confidence 0012222211111112357787788999999999999999999988844 55555555555421100 000 Q ss_pred -cccceeee---ccCCCCC--CCcceEe-ccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHH Q lcl|NC_019418. 325 -VEQNVYMQ---VGAGNMD--SGGIVDL-TTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDT 397 (527) Q Consensus 325 -~~~~~~~~---~~~~~~~--~~~i~~~-~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~ 397 (527) ++...|.. +...++. ...+..+ +++++. .+...++.....|...+|+....+|..++. .++-.|.+..... T Consensus 365 ~~~~~~~~~~~~~~~~~g~~~~~~i~~~~~~~lp~-~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~-~SG~ai~~rq~qg 442 (725) T protein:vir:77 365 GNDDYPYYLLNRTDENSGDLPTQPLAYYENPEVPQ-ANAYMLEAATSAVKEVATLGVDTEAVNGGQ-VAFDTVNQLNMRA 442 (725) T ss_pred hccCCceecccccccCCCcccccCccccCCCCchH-HHHHHHHHHHHHHHHHhCCCHHHhCCCchh-hHHHHHHHHHHHH Confidence 11111111 1111111 1233333 345544 467788888888999999999988877553 4566777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhh-h------cccCCc-c------------------------cCccceEEEeCC Q lcl|NC_019418. 398 YQMRNSIVALVEQSIKELCVSMCELGKV-V------GIYRGT-I------------------------PELDDISVNLDD 445 (527) Q Consensus 398 ~~~~~~~~~~~~~al~~li~~il~~~~~-~------~~~~~~-~------------------------~~~~~v~v~f~d 445 (527) ......+...+..+.+.+.+.+|.+... + .+.+.. . ...++|+|+=.. T Consensus 443 ~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p 522 (725) T protein:vir:77 443 DLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGP 522 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeecc Confidence 7777777788888888877777765332 1 010100 0 011344444333 Q ss_pred CccCCHHHHHHHHHHHHhcC--CCCH-HHHHHhcCCCCH-HHHHHHHHHHHHhcccccccccCCCCCCCCCCCCC-CCCC Q lcl|NC_019418. 446 GVFTDRHAELDYWMKMVAAG--FATQ-KRGIAKTLGITE-EEAEKELAEINGELPPESDAELALYGKGQQNTVGN-SKDT 520 (527) Q Consensus 446 ~i~~d~~~~~~~~~~~~~aG--i~s~-~~~i~~~~~~~d-eea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~-~~~~ 520 (527) +.+.=+++.++.++++..+. .++. -..+..+....+ +.+++.+++|++...+.....+.-+.+.+...... .-.. T Consensus 523 ~~~s~r~~~~~~l~qll~~~~~~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~q~~~~~e~q~~~~~qq~~~~ 602 (725) T protein:vir:77 523 SFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQG 602 (725) T ss_pred chHHHHHHHHHHHHHHHHhccccchhHHHHHHHhhccccchHHHHHHHHHHhhhhhhhccCCCChhhHHHHHHHHHHHHH Confidence 32222345555555555331 1111 122333334333 33555666776654432211111000000000000 0000 Q ss_pred CCccc----cC Q lcl|NC_019418. 521 VDDED----EA 527 (527) Q Consensus 521 ~~~~~----~~ 527 (527) ..+.+ ++ T Consensus 603 q~~~e~~q~q~ 613 (725) T protein:vir:77 603 QQDPAMVQAQG 613 (725) T ss_pred hHHHHHHHHHH Confidence 00000 00 No 130 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=98.78 E-value=5.1e-08 Score=60.56 Aligned_cols=468 Identities=9% Similarity=0.005 Sum_probs=219.9 Q ss_pred hHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccccc----C-ccccCceeecchHH Q lcl|NC_019418. 3 LIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTD----G-DRKRRKMQHLPIAR 77 (527) Q Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~----~-~~~~~~~~~lnl~~ 77 (527) |=+++++++++...++-. .+.-.++-.....+-++||.+....|...... . ....+..+..|+=+ T Consensus 1 ma~~~~~~l~~~~~~~~~----------~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~ 70 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDR----------AHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKIS 70 (720) T ss_pred CchHHHHHHHHHHHHHHH----------HHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHH Confidence 556667777665433210 11112333444555677887555555322211 1 12245668889999 Q ss_pred HHHHHHhhhhhcccceEeeC------CHHHHHHHHH----HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe---C----- Q lcl|NC_019418. 78 TAAKKIASLVYNEQAEISAE------DETLNDFLSD----MLSNDRFNKNFERYLESALALGGLAMRPYVD---G----- 139 (527) Q Consensus 78 ~i~~~~A~ll~~e~~~i~~~------d~~~~~~l~~----~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d---~----- 139 (527) .+|+...++--...+.+.+. +...++.|.. +.+.++.......+...++..|-+|+++++| + T Consensus 71 ~~v~~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~d~~~ 150 (720) T protein:vir:35 71 TELNRIISEYRHNRITVKFRPGDKTASEALANKLNGLFRADYEETDGGEACDNAFDDGSTGGFGCFRLTTNLVNALDPMD 150 (720) T ss_pred HHHHHHHhHHHhCCCceEEEcCCCcchHHHHHHHHHHHHHHHHhcCchHHHhHHHHHhhhccceeEEeeecccccCCCCc Confidence 99999999887777777652 3344555544 4557778888899999999999999999875 1 Q ss_pred --CeeEEEEE--cCCceEE--EEEcCCceEEE--EEEEEEEeeCC----------------Ccc--------eEEEEEEE Q lcl|NC_019418. 140 --DKIRVAFI--QAPVFLP--LQSNTQDVSSA--AILTKTIKTEN----------------RKN--------VYYTLVEF 187 (527) Q Consensus 140 --~~~~i~~v--~a~~~~P--~~~d~~~~~~~--a~~~~~~~~~~----------------~~~--------~~yt~lE~ 187 (527) +.++|..| ++..++. -+..-+ ...| ++..+++..+. ... ..-++.|+ T Consensus 151 ~~~~i~i~~v~~~~~~v~~Dp~a~~~D-~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~d~~~~~~v~i~E~ 229 (720) T protein:vir:35 151 ERQRICLEPIYDPARSVWFDPDAKKYD-KSDAEWAFCMYSLSAEKYKAEYNKDPATLMSGIERSWDYDWYDVDVVYIAKY 229 (720) T ss_pred ccceeeEecccCchhheeecccccccC-hhhhhhhhhhcCCCHHHHHHhCCCccccccccccccccccccCCCceEEEEe Confidence 23455433 3334442 111101 1111 11111110000 000 00112333 Q ss_pred EeecccccccceeeecCCceEEEEEEEecCCccccCceeecccc----------------------------cCCcccce Q lcl|NC_019418. 188 HEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSEL----------------------------YPDLQPVT 239 (527) Q Consensus 188 h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~----------------------------~~~l~~~~ 239 (527) ..... ..++..+|.. ...|..+...+. |--+-+.. T Consensus 230 ~~~~~--------------~~~~~~~~~~---~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~ 292 (720) T protein:vir:35 230 YEVKK--------------ESVDVVSFQN---PLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEG 292 (720) T ss_pred eEEEE--------------EEEEEEEeec---CCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccch Confidence 21100 0000011111 011221111100 00000111 Q ss_pred eecCCCcccEEEecC-Ccccc--ccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCcc Q lcl|NC_019418. 240 PIQGLSRPLFTYLKT-PGMNN--KDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGN 316 (527) Q Consensus 240 ~~~g~~~p~f~~~~~-~~~N~--~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~ 316 (527) .+.+-..+++.+||. |.--. .-.++|..-+.+.++++.++.+|.+.|.+.+-+-..+..+.. ... ++-.+. T Consensus 293 ~l~~~~~~p~~~fP~vP~~g~r~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~--~a~----~~~~~~ 366 (720) T protein:vir:35 293 FLEKAQRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDAQRLYNLQVSMLADSATQDTGSIPI--VGK----SQIKTL 366 (720) T ss_pred hcccCCCCCCCccceEEEEeeeeccCCCcccceeeecchhHHHHHHHHHHHHHHHHHcCCccccc--cCc----chHHHH Confidence 111111112222221 11111 113455444669999999999999999999987544432221 111 000000 Q ss_pred cccccccccccceeeeccC---CCCC----CCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHH Q lcl|NC_019418. 317 IAFKRRFDVEQNVYMQVGA---GNMD----SGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATE 389 (527) Q Consensus 317 ~~~~~~~d~~~~~~~~~~~---~~~~----~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAte 389 (527) .......+.+...|..++. .+|. +..+...++.-....+...++.-...|....|++...+|..++ .++.+ T Consensus 367 ~~~~a~~~~~~~~~l~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~sn--~SG~A 444 (720) T protein:vir:35 367 EKYWANRNKNRPAFLPLNEIVDKQGNIIAPPTPVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPMPSN--IAKET 444 (720) T ss_pred HHHhhccccccccccccccccccCcccccCCCcccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCcccc--hHHHH Confidence 0000011112222222111 1111 1234455543334567888888888899999999999997543 46777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-hc------ccC--C------------------------cccCc Q lcl|NC_019418. 390 IVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKV-VG------IYR--G------------------------TIPEL 436 (527) Q Consensus 390 i~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~-~~------~~~--~------------------------~~~~~ 436 (527) |.++..............+..+.+.+-+.+|.+... +. +.+ + ..... T Consensus 445 i~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~ 524 (720) T protein:vir:35 445 VNHLMHRSDMSSFIYLDNMAKSLKRAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGR 524 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeee Confidence 888777777777777777777777777766655432 11 111 0 00122 Q ss_pred cceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHH--------HHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCC Q lcl|NC_019418. 437 DDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKR--------GIAKTLGITEEEAEKELAEINGELPPESDAELALYGK 508 (527) Q Consensus 437 ~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~--------~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~ 508 (527) ++|+|+=..+-+.-.++.++.++++.. .|++.. .+.++-+++- +++.+++++....+.....+. T Consensus 525 yDv~v~~~p~~~s~req~~~~m~qll~--~~~p~~~~~~~~~~~ile~~d~p~--~~e~~erirk~~~~~~~~~~~---- 596 (720) T protein:vir:35 525 YDVTVDVGPSYTARRDATVSVLTNLLA--GMLPQDPMRQVLQGIILDNMEGEG--LDEFKEYNRKQLLTQGVVKPR---- 596 (720) T ss_pred eEEEEecccCcccHHHHHHHHHHHHHH--hcCCCchhHHHHHHHHHHhcCchh--HHHHHHHHHhhcchhcccCcc---- Confidence 345555444444445555666666553 233221 1233333432 345556666544322111000 Q ss_pred CCCCCCCCCCCCCCccccC Q lcl|NC_019418. 509 GQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 509 ~~~~~~~~~~~~~~~~~~~ 527 (527) ..+++. T Consensus 597 -------------~~e~qq 602 (720) T protein:vir:35 597 -------------NTEEEQ 602 (720) T ss_pred -------------ChhHHH Confidence 011111 No 131 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=98.77 E-value=5.4e-08 Score=60.43 Aligned_cols=402 Identities=12% Similarity=0.114 Sum_probs=160.5 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccc-cCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRK-RRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~-~~~~~~lnl~~~i 79 (527) ||||+++|++|++-... .... ....++ +......+ |-.. ...|..- ...-+...---.+ T Consensus 7 mg~f~r~~~~~~~~~~~-------~~~~--~~~~~~-----~~~~~~~~-~~~~-----~~~g~~v~~~~al~~~~V~~~ 66 (432) T protein:vir:81 7 LGLFGQLKAMFVPPDPV-------DIGG--GQTFTP-----VNATARDL-GIII-----SDTGAAVNADAIMRLDAVAAC 66 (432) T ss_pred cchhhhhhhhccccccc-------cccc--cccccc-----Cccchhhh-cccc-----cccCcccchHhhhccHHHHHH Confidence 99999999998752100 0000 001110 11000111 1000 0001100 0000111111223 Q ss_pred HHHHhhhhhcccceE---------eeCCHHHHHHHHHHHhhhhH---HHHHHHHHHHHHhcCCEEEEEEEeCCee-EEEE Q lcl|NC_019418. 80 AKKIASLVYNEQAEI---------SAEDETLNDFLSDMLSNDRF---NKNFERYLESALALGGLAMRPYVDGDKI-RVAF 146 (527) Q Consensus 80 ~~~~A~ll~~e~~~i---------~~~d~~~~~~l~~~l~~n~f---~~~~~~~~~~a~~~G~~~~~~~~d~~~~-~i~~ 146 (527) ++.+|+-+-+-|..+ .+.+..+...|.. .-|.. ...++.++...+..|.+++.+..+++++ .+.+ T Consensus 67 i~~Ia~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~--~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~~g~~~~L~~ 144 (432) T protein:vir:81 67 VKLVSQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLD--GPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTDGRIESLQY 144 (432) T ss_pred HHHHHHhhhhCceeeEEecCCcceecccchHHHHHHh--cccccCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEEEEE Confidence 333444333334332 1122223333321 01111 1223445566777899888877766654 4556 Q ss_pred EcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCcee Q lcl|NC_019418. 147 IQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERV 226 (527) Q Consensus 147 v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v 226 (527) ++|+.+-+.. +.++ ..+|.. .. .+ |..+ T Consensus 145 l~~~~v~v~~-~~~g-----------------~~~y~~-~~---------------~~------------------g~~~ 172 (432) T protein:vir:81 145 LANDRLTITT-DPKG-----------------NTAYRY-RR---------------TD------------------GQMI 172 (432) T ss_pred EcCCceEEEE-CCCC-----------------cEEEEE-Ee---------------cC------------------ceEE Confidence 7776655432 2222 112210 00 00 0000 Q ss_pred ecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHh Q lcl|NC_019418. 227 NLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMT 306 (527) Q Consensus 227 ~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l 306 (527) .+ +.. -+.||+.+..| .-.|+|-+.-+...|......-.-..+-|+.|.. |..++ T Consensus 173 ~~-------~~~---------~iih~r~~~~d-----g~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~----~~gil 227 (432) T protein:vir:81 173 DI-------PKQ---------QIWKIMGYSLD-----GENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQL----QSVYY 227 (432) T ss_pred EE-------ccc---------cEEEecCCCCC-----CcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCC----cceEE Confidence 00 000 12244432112 1258888877776665444332222233454332 22222 Q ss_pred cCCCCCCCcccccccccccccceeeec-cCC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccc Q lcl|NC_019418. 307 QLKVQDNQGNIAFKRRFDVEQNVYMQV-GAG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDG 381 (527) Q Consensus 307 ~~~~~~~~~~~~~~~~~d~~~~~~~~~-~~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~ 381 (527) ..+....... ...| +.-|.+. +.+ -.++..++.++......++.+..+...++|+...|++|..+|... T Consensus 228 ~~~~~l~~e~---~~~~---~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~ 301 (432) T protein:vir:81 228 QIDRFLTDDQ---YDSF---AKKVSGSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSS 301 (432) T ss_pred ecCCCCCHHH---HHHH---HHHHhhhhcCCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcC Confidence 2211111000 0001 1111111 100 012224566666667788888888889999999999999998765 Q ss_pred ccc-chHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHH Q lcl|NC_019418. 382 QGV-KTATEIVSENSDTY-QMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWM 459 (527) Q Consensus 382 ~g~-~TAtei~s~~~~~~-~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~ 459 (527) .+. .+++.+.......+ .+..-+...++.+|.. .|+.........+.++++.-+..|..+.++... T Consensus 302 ~~~~~~~sn~eq~~~~f~~~tl~P~~~~ie~~l~~------------kLl~~~~~~~~~~~fd~~~llr~d~~~r~~~~~ 369 (432) T protein:vir:81 302 AGTTSWGSGIESQQLGFLTMTLSPWLRRIEQSIAL------------NLLSPAERRRYFADFDTSALLRADSAARSSYYS 369 (432) T ss_pred CccccccchHHHHHHHHHHHHHHHHHHHHHHHHHh------------hccCccccCceEEEeechhhhccCHHHHHHHHH Confidence 432 22222211111111 1222222222222221 121111122234555555666788899999999 Q ss_pred HHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCC--CCCCCCCCccccC Q lcl|NC_019418. 460 KMVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTV--GNSKDTVDDEDEA 527 (527) Q Consensus 460 ~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 527 (527) +++.+|+|++-+++..+ |+..-+-...+-.+..... .+...+++.++ .....|..++..| T Consensus 370 ~~~~~G~~t~NE~R~~~-glpp~~g~~~~~~~~~~~~-------pl~~~~~~~~~~~~~~~~n~~~~~~~ 431 (432) T protein:vir:81 370 QLVNNGLMTRDEAREIE-GLPKLGGNAAVLTVQSAMV-------PLDSIGLQASPEPASGLGNQQQDKVS 431 (432) T ss_pred HHHhCCCCCHHHHHHHh-CCCCCCCCcceEeecCccc-------chhhhccCCCCCCCCCCCCccccccc Confidence 99999999999976553 5432100000000000000 00000000000 0000111111111 No 132 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=98.76 E-value=6.1e-08 Score=60.15 Aligned_cols=471 Identities=11% Similarity=0.022 Sum_probs=174.1 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhc---CCCcc-------cccccccCccccCce Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQ---SKFDD-------IEYTNTDGDRKRRKM 70 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~---g~~~~-------l~~~~~~~~~~~~~~ 70 (527) =++-+.|+..++... ..+......|+..|. ..+.. ......+.....|++ T Consensus 23 ~~~~~~l~~~~~~~~--------------------~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~k 82 (641) T protein:vir:94 23 DRIGGVVISKWQESR--------------------DKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADWRHR 82 (641) T ss_pred hhHHHHHHHHHHHHH--------------------HhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhccccc Confidence 122222333332211 111122334554432 21111 111111112222445 Q ss_pred eecchHHHHHHHHhhhh----hcccceEee-----CCHH----HHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEE Q lcl|NC_019418. 71 QHLPIARTAAKKIASLV----YNEQAEISA-----EDET----LNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYV 137 (527) Q Consensus 71 ~~lnl~~~i~~~~A~ll----~~e~~~i~~-----~d~~----~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~ 137 (527) +..+-+...++.+++-| +....-+.+ ++.+ ++++++..+.+++|...+...+.+++.+|.++++++| T Consensus 83 i~~~~~~~~~~~l~s~Lm~~~~p~~~wf~~~p~~~ed~~~A~~~~~~~~~~l~~~~~~~~~~~~~~d~~~~g~~iv~~~w 162 (641) T protein:vir:94 83 INTGHTFEVVETLVAYFKGATFPSDDWFDLKGMVPELADAARVVKQLTKTKLEAASIRDIFETYVRNLVLYGVSTYRLGW 162 (641) T ss_pred ccchhHHHHHHHHhhHHhhhhcCCCceEEEecCCCChHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHhhcCceEEEeeh Confidence 55666666666666543 333323433 2333 3456777778888999999999999999999999998 Q ss_pred eC-----------------------------CeeEEEEEcCCceEEEEEcCCceEEEEEEE-EEEeeCC---CcceEEEE Q lcl|NC_019418. 138 DG-----------------------------DKIRVAFIQAPVFLPLQSNTQDVSSAAILT-KTIKTEN---RKNVYYTL 184 (527) Q Consensus 138 d~-----------------------------~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~-~~~~~~~---~~~~~yt~ 184 (527) +. ..+++..|++..+++ ..+.+...++|+. +.++..- ....||-. T Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~~--dps~~~~~~~f~~~r~t~~t~~~l~~eg~~~~ 240 (641) T protein:vir:94 163 DTSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVWL--DTSGGKNTGTFVRLRHTREELHELVTSGYYDL 240 (641) T ss_pred hhHHHHhhhhhcccchhhcccccccceecccceeeEEecchhheee--cCCCCcccccceehhhhHHHHHHHHhcCCCCh Confidence 51 123445555555553 1222333333221 1111000 00011100 Q ss_pred ---EEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceee--cCC---CcccEEEecCCc Q lcl|NC_019418. 185 ---VEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPI--QGL---SRPLFTYLKTPG 256 (527) Q Consensus 185 ---lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~--~g~---~~p~f~~~~~~~ 256 (527) -+.|................+...-.+++|+-.. +-.+...++...+-.......+ .+. ...+|+.++. T Consensus 241 d~v~~~~~~~~~~~~~d~~~d~~~~~~~~~~~~e~~g-d~~~d~~~~~~~~~~~~g~~il~~~~~~~~d~~Pf~~~r~-- 317 (641) T protein:vir:94 241 DLTQVEQYVDYKFADPDTPKDVNGTDTSGWDIIEYYG-PLLVEGVQFWCVHAVFYGKQLIRLSDSKYWCGSPFVTTTL-- 317 (641) T ss_pred hhcchhhcccccccccccccccccccccccceeeeee-eeccCCCceeeEEEEEeCCEEeecccccccCcCCeEEecc-- Confidence 0011110000000000000000000001110000 0001111111110000111111 111 1113433332 Q ss_pred cccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHc-CcceeeechhHhcCCCCCCCcccccccccccccceeeeccC Q lcl|NC_019418. 257 MNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKM-GQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGA 335 (527) Q Consensus 257 ~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~-~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~ 335 (527) ....++.||.|....+.+.++.||...-+....+.. ..+.+.++.+.+-...+- ...++. .++. + T Consensus 318 --~~~~~~~YG~gp~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~~~l---~~~PG~------ii~~--~- 383 (641) T protein:vir:94 318 --LPDRDSVYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKREDV---KAKPGA------VFKV--A- 383 (641) T ss_pred --eecCCcccCCChHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeecccccccccee---eccCCc------ceee--C- Confidence 112357899999999999999999999888877754 444444433322111100 000111 1111 1 Q ss_pred CCCCCCcceEec---cccChHHHHHHHHHHHHHHHHhcCCCc--ccccccccccchHHHHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_019418. 336 GNMDSGGIVDLT---TPIRSSDYISAISEGLKLFEMQIGVSS--GMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVE- 409 (527) Q Consensus 336 ~~~~~~~i~~~~---~~ir~e~~~~~~~~~l~~i~~~~g~s~--~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~- 409 (527) ....++.++ +++... ...++.+-..+....+.+. +......+...|||||....+........+.+.|. T Consensus 384 ---~~~~v~pl~~~~~~~~~~--~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~ 458 (641) T protein:vir:94 384 ---QHGSLQPIDMGRQDFVVT--YQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIED 458 (641) T ss_pred ---CCCcceeecCCccccchh--HHHHHHHHHHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111122222 222221 1222222222333222221 11111122246999999888888888888888886 Q ss_pred HHHHHHHHHHHHHhhhhccc--------------CCcccCccceEEEeCCCccCCHHHH------HHHHHHHHhc-CCCC Q lcl|NC_019418. 410 QSIKELCVSMCELGKVVGIY--------------RGTIPELDDISVNLDDGVFTDRHAE------LDYWMKMVAA-GFAT 468 (527) Q Consensus 410 ~al~~li~~il~~~~~~~~~--------------~~~~~~~~~v~v~f~d~i~~d~~~~------~~~~~~~~~a-Gi~s 468 (527) ..|..|+..++.+....... +..++...++..+|+- ++...... ++...+..+. |..+ T Consensus 459 e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~p~~L~~~~~i-v~l~~~q~~~~~~~i~~l~~~~~~~a~~P 537 (641) T protein:vir:94 459 SSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVSPEYLHYPYKF-LALGANYVVERERMVTDLLQLLDISGRVP 537 (641) T ss_pred HHHHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCCCCccceeeeeeE-eecchhHHHHHHHHHHHHHHHHHHhhcCh Confidence 57788888877765432110 1112222334444432 33332221 1122222211 1111 Q ss_pred HHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCC------------------CCCCCCCCccccC Q lcl|NC_019418. 469 QKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTV------------------GNSKDTVDDEDEA 527 (527) Q Consensus 469 ~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~------------------~~~~~~~~~~~~~ 527 (527) . +....++ +.+.+++.+. ....+... -+ ..++.++ +.......|+-.+ T Consensus 538 -~--v~d~~d~--~~~~~~~~~~---~g~~~p~~-~i--r~~~~~~~~~~~~~~~~q~~~~~~a~~~~~~~~~~a~~ 603 (641) T protein:vir:94 538 -Q--IGQSLDY--ALILEDLLRQ---MRFTDPMR-YI--KKAEAPPAAPPIAPAEPGALPPEMMNSVGGGLNDQAIA 603 (641) T ss_pred -h--hhhcCCH--HHHHHHHHHH---hCCCCchh-hc--cCccCchhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHH Confidence 0 0111011 1111111110 00000000 00 0000000 0000000000000 No 133 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=98.74 E-value=7e-08 Score=59.80 Aligned_cols=424 Identities=11% Similarity=0.095 Sum_probs=168.1 Q ss_pred hh--cccchhhhccCccccCHHHHHHHH---HHHHHhcCCCcccccccccCccccCceeecchHHHHHHHHhhhhhcccc Q lcl|NC_019418. 18 MT--TSHLSSILDHPKVAVTQSEFRRIQ---HNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIASLVYNEQA 92 (527) Q Consensus 18 ~~--~~~~~~~~~~~~i~~~~~~~~~i~---~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~ll~~e~~ 92 (527) || .+++..... +.+|. ..+.++.+....+-.+..+-..-.+-..+......+++.+|+.+.+-+. T Consensus 1 ~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~La~~~~~n~~v~scI~~ia~~ia~~~~ 70 (540) T protein:vir:41 1 MFNYHLSIKSLEK----------YRAIKGDTDSQALKEDRFEEYVEPKVHPLVLLSLLQVNPYHASACSIKANDILRTGY 70 (540) T ss_pred CCCcccChhhccc----------hhhhhccccccccccCCCCccccCCCCHHHHHHHHHhcHHHHHHHHHHHHHHhcCCc Confidence 21 111111100 11111 1122222211111000000000001111224556778888888888777 Q ss_pred eEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEEEcCCceEEEEEcCCceEEEEEEEE Q lcl|NC_019418. 93 EISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DK-IRVAFIQAPVFLPLQSNTQDVSSAAILTK 170 (527) Q Consensus 93 ~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~v~a~~~~P~~~d~~~~~~~a~~~~ 170 (527) .+...+....+++-.- .-.+...+...+.+.+..|.+++.+..+. |+ ..+.+++|+++-+.. +..+.. . T Consensus 71 ~i~~~~~~~~~~lpN~--~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~~L~~i~~~~V~v~~-~~~~~~------~ 141 (540) T protein:vir:41 71 LIDGDDGGVEELLRAC--RPSFEFILLQALEDLQVFNYCTLEVVRDDQGEPVRLDYIPAHTVRVHR-DGSRYM------Q 141 (540) T ss_pred eEecCccchhhhccCC--CCCHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCcceEEeE-cCceeE------e Confidence 7776666555544211 11234445566777888899999887764 44 467778888765432 211100 0 Q ss_pred EEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEE Q lcl|NC_019418. 171 TIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFT 250 (527) Q Consensus 171 ~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~ 250 (527) ..+.....||.. + .....+.. . -| .....++. --.. T Consensus 142 --~~d~~~~~~~~~--~--------------------~~~~~~~~----~-~g------------~~~~~~~~---~eVi 177 (540) T protein:vir:41 142 --TWDGIHVTYFKD--Y--------------------RYEGEVNP----D-NG------------EDQDGVGA---NEII 177 (540) T ss_pred --eecCceeeeeec--c--------------------cccceeec----c-cc------------ccceeecc---cceE Confidence 001111112110 0 00000000 0 00 00001110 0134 Q ss_pred EecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcc-e--eeechhHhcCCCCCCCcccc----ccccc Q lcl|NC_019418. 251 YLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQR-R--VIVPEQMTQLKVQDNQGNIA----FKRRF 323 (527) Q Consensus 251 ~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~-~--i~v~~~~l~~~~~~~~~~~~----~~~~~ 323 (527) ||+.+.++ ...+|+|.+..+...+.....+-.--.+=|+.|.. . |.++..+............. +...+ T Consensus 178 Hir~~~~~----~~~~G~Spi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~ 253 (540) T protein:vir:41 178 FIHLPSPI----CSYYGVPRYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLI 253 (540) T ss_pred EecCCCCC----CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHH Confidence 66544222 34579999887766665443332222233455432 1 22222221100000000000 00000 Q ss_pred c---------cccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccccccc---chHHHHH Q lcl|NC_019418. 324 D---------VEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGV---KTATEIV 391 (527) Q Consensus 324 d---------~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~---~TAtei~ 391 (527) . ...-+.........+...++-++......++.+..+...++|+...|++|..+|...++. .++.+.. T Consensus 254 ~~~~~g~~~nag~~~vLe~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~ 333 (540) T protein:vir:41 254 EDNFKYLKEAPHTPLVFSIPGGDTVEVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVAR 333 (540) T ss_pred HHHhccccccccceEEEecCCCcccceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHHH Confidence 0 000011111111122234455556667778899999899999999999999998754332 2333331 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHH Q lcl|NC_019418. 392 SENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKR 471 (527) Q Consensus 392 s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~ 471 (527) ... ...++.-+...++.+|...+ + . .......|.|+..-..+.+ .+....+++.+|+|++-+ T Consensus 334 ~~f--~~~tL~P~~~~ie~~ln~~L---~---------~---~~~~~~~i~f~~~~ll~~D-~~~~~~~lv~~G~lT~NE 395 (540) T protein:vir:41 334 RTY--YESVVRPQQEIVSSVLTDFI---Q---------L---KLDPGARFVFNEEILMESE-FVHNYALLVQCGVLTPSE 395 (540) T ss_pred HHH--HHHHHHHHHHHHHHHHHHhh---h---------h---ccCCceEEEecchhhcchH-HHHHHHHHHhCCCCCHHH Confidence 111 12233333333444443321 0 0 0112345667665444433 334456788999999999 Q ss_pred HHHhcCCCC---HHH------HHHHHHHHHHhccccc-cc-------ccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 472 GIAKTLGIT---EEE------AEKELAEINGELPPES-DA-------ELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 472 ~i~~~~~~~---dee------a~~el~ri~~E~~~~~-~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) ++.++.|+. |.- ....+...+.+..... .. ..+...+..+++. ..++..++-||. T Consensus 396 ~Re~L~g~e~gdd~~l~p~n~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 467 (540) T protein:vir:41 396 VREKLFGLDGGPDMFMVPSSIGKSAMKRQKRNYEKNQINEIKRTYAKYKPRIQEIISSES-PLEDKKKKIDEV 467 (540) T ss_pred HHHHhCcCcCCCcccccccccccccccccccccCCCCccccccccchhcccccCcccccc-cccccccccccc Confidence 886665543 210 0011111110000000 00 0000000000000 001111111111 No 134 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=98.73 E-value=7.8e-08 Score=59.56 Aligned_cols=466 Identities=8% Similarity=0.033 Sum_probs=189.7 Q ss_pred HHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHHHHhhh Q lcl|NC_019418. 7 VKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIASL 86 (527) Q Consensus 7 ~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~l 86 (527) ||..+++.... +..++......|+.+|.---|-+......+......+.-=+.+...++.+|+- T Consensus 1 mk~~a~~r~~~----------------l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~ 64 (542) T protein:vir:78 1 MKGLAQARYSA----------------MRADREDFLDMARRCAALTLPYLLTEDGHASGGRLQQPYQSLGSKGVNALSSK 64 (542) T ss_pred ChhHHHHHHHH----------------HHHHhhHHHHHHHHHHHHhccccCCCCCCcccccccccccchHHHHHHHHHHH Confidence 33333332211 12233445667777765433322111111111111122225567777787776 Q ss_pred hhccc--c-----eEeeCCH--------------HHH-------HHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe Q lcl|NC_019418. 87 VYNEQ--A-----EISAEDE--------------TLN-------DFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD 138 (527) Q Consensus 87 l~~e~--~-----~i~~~d~--------------~~~-------~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d 138 (527) |.+-. | ++.+++. ... +.+...+..++|...+.++..+..+.|.+.+ |.+ T Consensus 65 l~~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--~~~ 142 (542) T protein:vir:78 65 LMLSLFPIQTSFFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLV--FAG 142 (542) T ss_pred HHHhhcCCCCccccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEE--Eec Confidence 55431 1 1233221 122 2445677788999999999999999999865 456 Q ss_pred CCeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCC Q lcl|NC_019418. 139 GDKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTS 218 (527) Q Consensus 139 ~~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~ 218 (527) .+. ++.++-.+++ +..|..|++..+|....+... .-..-|- +-. .............+..+.|.|..+...+ T Consensus 143 ~~~--~~~~pl~~y~-v~~d~~G~vd~v~r~~~~t~~-ql~~~fg--~~~--l~~~~~~~~~~~~~~~~~v~~~v~pr~~ 214 (542) T protein:vir:78 143 KKT--LKVYPLDRYV-IERDGDGNVIEIITRELVDRS-LLPAEFQ--KQS--LLEGKDSNAVGEDGPKFGVAQGKGGRND 214 (542) T ss_pred CCC--ceEEecceeE-EeeCCCCCeEEEeeeeecCHH-HHHHhhc--ccc--CchHHHhhccccCCCeEEEEEEeecccC Confidence 554 4456666655 456677777666533211100 0000000 000 0000000000011122233333332211 Q ss_pred ccccCcee----ecccccCCcccc-----eeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHH Q lcl|NC_019418. 219 DSQLGERV----NLSELYPDLQPV-----TPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFM 289 (527) Q Consensus 219 ~~~lG~~v----~l~~~~~~l~~~-----~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~ 289 (527) .+ +.+.+ +....+..+... ....|...-+|..++- +...++.||+|-...+.+-+..|+..--... T Consensus 215 ~~-~~~~~~~~~~~~s~~~e~~g~~v~~~~~e~g~~~~P~i~~Rw----~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l 289 (542) T protein:vir:78 215 AE-VFTCCKLVDGQHRWHQECDGKEIKGSRSSSPLKHSPWLPLRF----NVVDGESYGRGRVEEFFGDLSSLDALTRSLI 289 (542) T ss_pred Cc-cccccccCCCeEEEEEEeccccccccccccccccCCceeeee----eecCCCccccchHHHHHHHHHHHHHHHHHHH Confidence 11 11110 000111111111 1111222222322221 2234678999999999999999998766665 Q ss_pred HHH-HcCcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHH Q lcl|NC_019418. 290 WEI-KMGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEM 368 (527) Q Consensus 290 ~e~-~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~ 368 (527) .-. ...++.+.||++.+....+.. ......+.. +..+..++..+....+...-...++.+...|.. T Consensus 290 ~~~~~a~~pp~lv~~~g~~~~~~~~----------~~~~g~iv~---g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~ 356 (542) T protein:vir:78 290 EGSAAAAKVVFMVSPSATTKPQSLA----------RAGTGAIIQ---GRAEDVSVVQANKGADFRTVQEMIRDLSQRISD 356 (542) T ss_pred HHHHHHhcCceeeccccccchhhcc----------cCCCceeec---CCccceeeeecccccchhHHHHHHHHHHHHHHH Confidence 544 346666667554321111110 000111111 111111111111111222233444444444433 Q ss_pred hcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCc Q lcl|NC_019418. 369 QIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGV 447 (527) Q Consensus 369 ~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i 447 (527) ..-+. ....+...|||||....+...+...-.-..+ ...|.-|+..++.+..-.++....+. .-+.+++--++ T Consensus 357 aFl~~----~~~d~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~--~lv~~~~~s~L 430 (542) T protein:vir:78 357 AFLIL----NVRQSERTTATEVREVQMELDRQLSGIYGSLTVELLTPYLNRKLHLMQRSKQLPSLPK--GLVMPTVVAGL 430 (542) T ss_pred Hhccc----ccCCcccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCch--hceeeeeechH Confidence 32221 1122344699999999988888877655555 44555676666665433333222222 22555554443 Q ss_pred cCCH-HHHHHH---HHHHHhc--C------CCCHHHHHH---hcCCCC-------HHHHHHHHHHHHHhcccccc----- Q lcl|NC_019418. 448 FTDR-HAELDY---WMKMVAA--G------FATQKRGIA---KTLGIT-------EEEAEKELAEINGELPPESD----- 500 (527) Q Consensus 448 ~~d~-~~~~~~---~~~~~~a--G------i~s~~~~i~---~~~~~~-------deea~~el~ri~~E~~~~~~----- 500 (527) ..-- ...++. ..+.++. | .+....++. ...|++ +|+++++-++.+..+...+. T Consensus 431 a~~~r~~~~~~l~~~~~~i~~~~~p~~l~~~id~d~~~~~~a~~~Gvp~~~i~~s~e~~~~~~~q~q~~~~~~al~~~a~ 510 (542) T protein:vir:78 431 GGVGRGEDRAALIEFMQTVGQAMGPEALQQFIDPTEFLKRLAAASGIDTLNLVKSPETMANEAQQAQQQQMTASLMGQAG 510 (542) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 1110 011111 1122211 1 122222222 224554 35555554443332211100 Q ss_pred --cccCCC--------CCCCCCCCCCCCCCCCccc Q lcl|NC_019418. 501 --AELALY--------GKGQQNTVGNSKDTVDDED 525 (527) Q Consensus 501 --~~~~~~--------~~~~~~~~~~~~~~~~~~~ 525 (527) +...+. .+++..|. +-..|.+- T Consensus 511 ~~a~~~~~~~~~~~~~a~~~~~~~---~~~~~~~~ 542 (542) T protein:vir:78 511 QLAKSPIGEKMMQQINAPGQEAPA---GPQTGEDL 542 (542) T ss_pred hccccccccchhhhcCCCCcCCCC---CCcccccC Confidence 000000 01111110 01111111 No 135 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=98.71 E-value=9.1e-08 Score=59.19 Aligned_cols=454 Identities=9% Similarity=0.029 Sum_probs=193.5 Q ss_pred hcccchhhhccCc----c-ccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHHHHhhhhhcc--c Q lcl|NC_019418. 19 TTSHLSSILDHPK----V-AVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIASLVYNE--Q 91 (527) Q Consensus 19 ~~~~~~~~~~~~~----i-~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~ll~~e--~ 91 (527) |.+...+-+.... . .+..++......|+.++.---|-+-.....+......+.-=..+...++.+|+-|.+- | T Consensus 1 m~~~~~~~~~~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 80 (535) T protein:vir:15 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLASKLMLALFP 80 (535) T ss_pred CCccchhccchHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcC Confidence 2222111111100 0 1233344456667666554333221111111111111222245667777777754442 2 Q ss_pred c----eEeeCCH-------------HHHHHH-------HHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEEEE Q lcl|NC_019418. 92 A----EISAEDE-------------TLNDFL-------SDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRVAF 146 (527) Q Consensus 92 ~----~i~~~d~-------------~~~~~l-------~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i~~ 146 (527) + ++.+.+. ..+.+| ...+..++|...+.++..+....|.+.+.+-.+. +.+++.. T Consensus 81 ~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~ 160 (535) T protein:vir:15 81 MQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGSYNPMKL 160 (535) T ss_pred CCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCCceeeEE Confidence 2 1222221 233344 4458889999999999999999999987765554 4577888 Q ss_pred EcCCceEEEEEcCCceEEEEEEEEEEee-------------------CCCcceEEEEEEEEeecccccccceeeecCCce Q lcl|NC_019418. 147 IQAPVFLPLQSNTQDVSSAAILTKTIKT-------------------ENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLY 207 (527) Q Consensus 147 v~a~~~~P~~~d~~~~~~~a~~~~~~~~-------------------~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~ 207 (527) ++-.+++- ..|..|++..++....+.. .++...+||.+... .+++.+ T Consensus 161 ~pl~~~~v-~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~-------------~~~~~~ 226 (535) T protein:vir:15 161 YRLSSYVV-QRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLD-------------EESGDY 226 (535) T ss_pred EEcCeeEE-eeCCCCCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEEEEEe-------------cCCCcE Confidence 88888774 5667777776654322210 00011122222111 112223 Q ss_pred EEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHH Q lcl|NC_019418. 208 RITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDE 287 (527) Q Consensus 208 ~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~ 287 (527) ...++++ |..+++.. .-.+...-+|..++- +...++.||+|-...+.+-++.|+..--. T Consensus 227 ~~~~e~~--------g~~~~~~~---------~~~~~~~~P~i~~Rw----~~~~ge~YGrgp~~~~l~D~k~L~~l~~~ 285 (535) T protein:vir:15 227 LKYEEVE--------DVEIDGSD---------ATYPTDAMPYIPVRM----VRIDGESYGRSYCEEYLGDLRSLENLQEA 285 (535) T ss_pred EEEEEee--------Cccccccc---------cccccccCCceeeee----eecCCCccccchHHHHHHHHHHHHHHHHH Confidence 2222221 11222110 001111122222221 22346789999999999999999987666 Q ss_pred HHHHHH-cCcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEec--cccChHHHHHHHHHHHH Q lcl|NC_019418. 288 FMWEIK-MGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLT--TPIRSSDYISAISEGLK 364 (527) Q Consensus 288 ~~~e~~-~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~--~~ir~e~~~~~~~~~l~ 364 (527) ...-.. ..++.+.||++......+...+ ....|.. + ....++.++ ..-+...-.+.++.+.. T Consensus 286 ~l~~~~~~~~p~~lv~~~g~~~~~~l~~~----------~~g~~v~---g--~~~~v~~~~~~~~~~~~~~~~~i~~~~~ 350 (535) T protein:vir:15 286 IVKMSMISAKVIGLVNPAGITQPRRLTKA----------QTGDFVP---G--RREDIDFLQLEKQADFTVAKAVSDQIEA 350 (535) T ss_pred HHHHHHHHhcCceeecccccccchhcccC----------Cceeeec---C--CcccceeeecccccchhHHHHHHHHHHH Confidence 665543 4666666655443222111100 0011111 1 111222222 11223334445555444 Q ss_pred HHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhcccCCcccCccceEEEe Q lcl|NC_019418. 365 LFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKVVGIYRGTIPELDDISVNL 443 (527) Q Consensus 365 ~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f 443 (527) .|....=+. .+....+...|||||....+...+...-.-..+ ...|.-|+..++.+....++.... +...+.++| T Consensus 351 ~I~~af~~~--~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~--p~~~v~~~y 426 (535) T protein:vir:15 351 RLSYAFMLN--SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPEL--PKEAVEPTI 426 (535) T ss_pred HHHHHHhhh--hcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC--CccceeEEE Confidence 443322111 122233455799999999888887776644444 345566676666554333332222 223366666 Q ss_pred CCCccCCHH-HHHHHHHHHHh--cCC--------CCHHHHHH---hcCCC-------CHHHHHHHHHHHHHhcccc---c Q lcl|NC_019418. 444 DDGVFTDRH-AELDYWMKMVA--AGF--------ATQKRGIA---KTLGI-------TEEEAEKELAEINGELPPE---S 499 (527) Q Consensus 444 ~d~i~~d~~-~~~~~~~~~~~--aGi--------~s~~~~i~---~~~~~-------~deea~~el~ri~~E~~~~---~ 499 (527) --++..-.. ..++..++..+ +++ +....++. ...|+ +++|++++.++.++.+... . T Consensus 427 is~La~aqr~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~~~~eev~~~~~q~~~~~~~~~~a~ 506 (535) T protein:vir:15 427 STGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGIENAAA 506 (535) T ss_pred ecHHHHHHHHHHHHHHHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHH Confidence 444322111 11111111110 111 12222222 22343 4455555544433222111 1 Q ss_pred ccccCCCCCCCCCCC--CCCCCCCCccccC Q lcl|NC_019418. 500 DAELALYGKGQQNTV--GNSKDTVDDEDEA 527 (527) Q Consensus 500 ~~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 527 (527) ............+|. ..--+. -+.+-+ T Consensus 507 ~~g~~~~~~~~~~p~~~~~~~~~-~g~~~~ 535 (535) T protein:vir:15 507 TGGAGVGALATSSPEAMQGAAAQ-AGLDAT 535 (535) T ss_pred HHHhhccchhccChHHHHHHHhc-cCCCCC Confidence 000001000000000 000000 000000 No 136 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=98.70 E-value=1e-07 Score=58.95 Aligned_cols=450 Identities=8% Similarity=0.022 Sum_probs=191.5 Q ss_pred hcccchhhhccCcc-----ccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHHHHhhhhhcc--c Q lcl|NC_019418. 19 TTSHLSSILDHPKV-----AVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIASLVYNE--Q 91 (527) Q Consensus 19 ~~~~~~~~~~~~~i-----~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~ll~~e--~ 91 (527) |.+...+-+.+..+ .+..++......|+.++.---|-+-.....+......+.-=..+...++.+|+-|.+- | T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 80 (535) T protein:vir:33 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLASKLMLALFP 80 (535) T ss_pred CChhhhhccChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcC Confidence 22222111111111 1233344456667766654333221111111111111111245667777777754442 2 Q ss_pred ce----EeeCCH-------------HHHHH-------HHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-CeeEEEE Q lcl|NC_019418. 92 AE----ISAEDE-------------TLNDF-------LSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKIRVAF 146 (527) Q Consensus 92 ~~----i~~~d~-------------~~~~~-------l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~~i~~ 146 (527) ++ +.+.+. ..+++ +...+..++|...+.++..+....|.+.+.+-.+. +.+++.. T Consensus 81 ~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~ 160 (535) T protein:vir:33 81 MQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGSYNPMKL 160 (535) T ss_pred CCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCCceeeEE Confidence 21 222221 12333 34558889999999999999999999988876664 4577888 Q ss_pred EcCCceEEEEEcCCceEEEEEEEEEEee-------------------CCCcceEEEEEEEEeecccccccceeeecCCce Q lcl|NC_019418. 147 IQAPVFLPLQSNTQDVSSAAILTKTIKT-------------------ENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLY 207 (527) Q Consensus 147 v~a~~~~P~~~d~~~~~~~a~~~~~~~~-------------------~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~ 207 (527) ++-.+++- ..|..|++..++....+.. .++...+||++-+ ..+++++ T Consensus 161 ~pl~~~~v-~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~v~~-------------~~~~~~~ 226 (535) T protein:vir:33 161 YRLSSYVV-QRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYTHVYL-------------DEESGDY 226 (535) T ss_pred EEcCeeEE-eeCCCCCeeEEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEEEEEe-------------eCCCCcE Confidence 88888774 5667777776654322210 0011112222111 1112222 Q ss_pred EEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHH Q lcl|NC_019418. 208 RITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDE 287 (527) Q Consensus 208 ~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~ 287 (527) ...++++ |..+++... -.+....+|..++- +...++.||+|-...+.+-++.|+..--. T Consensus 227 ~~~~~~~--------~~~~~~~~~---------~~~~~~~P~i~~Rw----~~~~ge~YGrgp~~~~l~D~k~L~~l~~~ 285 (535) T protein:vir:33 227 LKYEEVE--------DVEIDGSDA---------TYPTDAMPYIPVRM----VRIDGESYGRSYCEEYLGDLRSLENLQEA 285 (535) T ss_pred EEEEEEe--------Ccccccccc---------ccccccCCceeeee----eecCCCccccchHHHHHHHHHHHHHHHHH Confidence 2222221 112211110 00111112222221 22346789999999999999999987666 Q ss_pred HHHHHH-cCcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEec--cccChHHHHHHHHHHHH Q lcl|NC_019418. 288 FMWEIK-MGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLT--TPIRSSDYISAISEGLK 364 (527) Q Consensus 288 ~~~e~~-~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~--~~ir~e~~~~~~~~~l~ 364 (527) ...-.. ..++.+.||++......+...+ ....|.. + ....++.++ ..-+...-.+.++.+.. T Consensus 286 ~l~~~~~~~~p~~lv~~~g~~~~~~~~~~----------~~g~~v~---g--~~~~v~~~~~~~~~~~~~~~~~i~~~~~ 350 (535) T protein:vir:33 286 IVKMSMISAKVIGLVNPAGITQPRRLTKA----------QTGDFVP---G--RREDIDFLQLEKQADFTVAKAVSDQIEA 350 (535) T ss_pred HHHHHHHHhcCceeeccccccchhhcccC----------Cceeeec---C--CcccceeeecccccchhHHHHHHHHHHH Confidence 665543 4666666655443222111100 0011111 1 111222222 11223333444554444 Q ss_pred HHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhcccCCcccCccceEEEe Q lcl|NC_019418. 365 LFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKVVGIYRGTIPELDDISVNL 443 (527) Q Consensus 365 ~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f 443 (527) .|....=+. .+....+...|||||....+...+...-.-..+ ...|.-|+..++.+....++.... +...+.++| T Consensus 351 ~I~~af~~~--~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~--p~~~v~~~y 426 (535) T protein:vir:33 351 RLSYAFMLN--SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPEL--PKEAVEPTI 426 (535) T ss_pred HHHHHHhhh--hcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC--CccceeEEE Confidence 443322111 122233455799999999888887776654444 345566676666554333332222 223466666 Q ss_pred CCCccCCHH-HHHHHHHHHHh--cCC--------CCHHHHHH---hcCCCC-------HHHHHHHHHHHHHhcccccccc Q lcl|NC_019418. 444 DDGVFTDRH-AELDYWMKMVA--AGF--------ATQKRGIA---KTLGIT-------EEEAEKELAEINGELPPESDAE 502 (527) Q Consensus 444 ~d~i~~d~~-~~~~~~~~~~~--aGi--------~s~~~~i~---~~~~~~-------deea~~el~ri~~E~~~~~~~~ 502 (527) --++..-.. ..++..++..+ +++ +....++. ...|++ ++|+++..++.++++.... .. T Consensus 427 is~La~aqr~~~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~Gvp~~~i~~~~ee~~~~~~q~~~~~~~~~-~~ 505 (535) T protein:vir:33 427 STGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGVEN-AA 505 (535) T ss_pred ecHHHHHHHHHHHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHHHHcCCCHhHhcCCHHHHHHHHHHHHHHHHHHH-HH Confidence 444322111 11111111110 111 12222222 223543 4444444333222111100 00 Q ss_pred cCCCCCCCCCCCCCCCCCCCccc---------cC Q lcl|NC_019418. 503 LALYGKGQQNTVGNSKDTVDDED---------EA 527 (527) Q Consensus 503 ~~~~~~~~~~~~~~~~~~~~~~~---------~~ 527 (527) ...++..... ...+..+.. -+ T Consensus 506 ~~~g~~~~~~----~~~~~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:33 506 AAGGAGVGAL----ATSSPEAMQGAAAKAGLNAT 535 (535) T ss_pred Hhhhhhhcch----hhcCChhHHHHHHhccCCCC Confidence 0000000000 000000000 00 No 137 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=98.67 E-value=1.3e-07 Score=58.35 Aligned_cols=389 Identities=10% Similarity=0.049 Sum_probs=167.0 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) |++|.+.+.- + ..+.+++ ..|..++.+..... .+.. +..+...--..++ T Consensus 1 M~~f~~~~~~----------~--------~~~~~~~------~~~~~~~~~~~~~~---~v~~----~~al~~~~V~~~v 49 (397) T protein:vir:38 1 MPLLKLNKSH----------S--------QGFSLND------PDWVNFLTGGEAQK---YVSA----DTALKNSDIFSLI 49 (397) T ss_pred Ccchhhhhcc----------c--------CcccCCc------hhhhhhhcCCcCCc---eech----HHhhccHHHHHHH Confidence 9998765422 0 1111221 12444433321100 0111 1111111222345 Q ss_pred HHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCC-e-eEEEEEcCCceEEEEEc Q lcl|NC_019418. 81 KKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGD-K-IRVAFIQAPVFLPLQSN 158 (527) Q Consensus 81 ~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~-~-~~i~~v~a~~~~P~~~d 158 (527) +.+|+-+-.-| +.+++......+.+--..-.....++.++.+.+..|.+++.+..+.. . +.+.+++|..+-+.... T Consensus 50 ~~ia~~ia~~p--~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v~i~~~~ 127 (397) T protein:vir:38 50 MQLSGDLAMVR--YTSESDRSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQVQPMLLQ 127 (397) T ss_pred HHHHHHHhhCc--ccccccHHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcC Confidence 55555554323 44555544433322111112333345566677778999888877753 3 45677788776553222 Q ss_pred CCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccc Q lcl|NC_019418. 159 TQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPV 238 (527) Q Consensus 159 ~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~ 238 (527) ++ +..+|.. .. .....|..+.+ + +. T Consensus 128 ~~-----------------~~~~y~~-~~------------------------------~~~~~~~~~~~---~---~~- 152 (397) T protein:vir:38 128 DG-----------------SGLIYNI-NF------------------------------DEPAIGYMENV---P---AA- 152 (397) T ss_pred CC-----------------ceEEEEE-Ee------------------------------ccccccceeEe---c---Cc- Confidence 11 1112210 00 00000111110 0 00 Q ss_pred eeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCc-ceeeechhHhcCCCCCCCcc- Q lcl|NC_019418. 239 TPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQ-RRVIVPEQMTQLKVQDNQGN- 316 (527) Q Consensus 239 ~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~-~~i~v~~~~l~~~~~~~~~~- 316 (527) -..|++.+.++ +..+|+|.+..+...+......-.-..+-|..|. +..++ .......... T Consensus 153 ---------eiih~~~~~~~----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il-----~~~~~~~~e~~ 214 (397) T protein:vir:38 153 ---------DVIHIRLLSKN----GGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVL-----TIQKGGLLDAE 214 (397) T ss_pred ---------cEEEecCCCCC----CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEE-----EeCCCCCHHHH Confidence 13466544333 2346999998888777655444333334455433 23222 1111111000 Q ss_pred cccccccccccceeeeccCC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHH Q lcl|NC_019418. 317 IAFKRRFDVEQNVYMQVGAG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVS 392 (527) Q Consensus 317 ~~~~~~~d~~~~~~~~~~~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s 392 (527) -.....|. ..+.+-+.+ -..+..++.++....+.++.+..+...++|+...|++|..+|...+...+.++... T Consensus 215 ~~~~~~~~---~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~e~~~~ 291 (397) T protein:vir:38 215 TRIARSKE---ISKQIHNSDGPVVIDALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQSSITQISG 291 (397) T ss_pred HHHHHHHH---HHhcccccCCceecCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHH Confidence 00000110 111110000 01223455566666777888888888999999999999999876543322222211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHH Q lcl|NC_019418. 393 ENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRG 472 (527) Q Consensus 393 ~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~ 472 (527) .+..+|..++..|....+. .++. ..+ +++..-+-.|.++.++...+++.+|+|++-++ T Consensus 292 --------------~~~~~l~P~~~~ie~~ln~-~l~~-----~~~--~~~~~~~~~d~~~~~~~~~~~~~~G~~t~nE~ 349 (397) T protein:vir:38 292 --------------QYAKSLNRYVQAIVGELND-KLHA-----NIS--ANIRFAIDAMGDQYASTISSSVKGGTIAGNQA 349 (397) T ss_pred --------------HHHHHHHHHHHHHHHHHHH-hccC-----hhc--ccccccccCCHHHHHHHHHHHHhCCCcCHHHH Confidence 1223333333333221111 1111 111 22222345678888888899999999999998 Q ss_pred HHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 473 IAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 473 i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) +..+ |+..-+.. ++....... .........++++++.++.++.+++ +| T Consensus 350 R~~l-g~~p~~~~-d~~~~~~~~---~~~~~~~~~~~g~~~~~~~~e~~~~-~~ 397 (397) T protein:vir:38 350 RFIL-QNSGYLAK-DLPDPEKEP---QQAIQLIQQEGGENDGNNSDERGSD-PE 397 (397) T ss_pred HHHh-CCCCCCCC-ccccccccc---cccccccccccCCCCCCCCCCCCCC-CC Confidence 7653 33210000 000000000 0011111222333333333333333 33 No 138 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=98.64 E-value=1.1e-07 Score=58.84 Aligned_cols=489 Identities=7% Similarity=0.001 Sum_probs=211.8 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) =..+.+++.+|+.-. .-.++-.....+..+||.|. .|......... ...+..+|+=+.+| T Consensus 5 ~~~~~~~~~~~~~~~-----------------~~~~~~r~~a~~d~~fy~G~--Qw~~~~~~~l~-~q~rp~~N~i~~~i 64 (725) T protein:vir:92 5 ENRLESILSRFDADW-----------------TASDEARREAKNDLFFSRIS--QWDDWLSQYTT-LQYRGQFDVVRPVV 64 (725) T ss_pred HHHHHHHHHHHHHHH-----------------HhhHHHHHHHHHHHHhhcCC--CCCHHHHHHHH-hcCCCcccchHHHH Confidence 124555555554311 12344555666788899984 33211111111 11122457777777 Q ss_pred HHHhhhhhcccceEee-----CCHHHHHHHHH----HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe---C----CeeEE Q lcl|NC_019418. 81 KKIASLVYNEQAEISA-----EDETLNDFLSD----MLSNDRFNKNFERYLESALALGGLAMRPYVD---G----DKIRV 144 (527) Q Consensus 81 ~~~A~ll~~e~~~i~~-----~d~~~~~~l~~----~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d---~----~~~~i 144 (527) +...++--...+.+.+ ++...++.|.. +.+.+++......+...++..|-+|+.++.| . +.++| T Consensus 65 ~~v~g~e~~nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i 144 (725) T protein:vir:92 65 RKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPTSNNQVI 144 (725) T ss_pred HHHHhhHHhCCcceEEecCCccHHHHHHHHHHHHHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCCCCCCCceee Confidence 7777766555666655 24445555544 4556777888889999999999999999765 1 23444 Q ss_pred EEE----cCCceEEEEEcCCce----EEE--EEEEEEEeeC-------CCcceEEEEEEEEeeccc---cccc-ceeeec Q lcl|NC_019418. 145 AFI----QAPVFLPLQSNTQDV----SSA--AILTKTIKTE-------NRKNVYYTLVEFHEWVTP---TGQE-VGSTKD 203 (527) Q Consensus 145 ~~v----~a~~~~P~~~d~~~~----~~~--a~~~~~~~~~-------~~~~~~yt~lE~h~~~~~---~~~~-~~~~~~ 203 (527) ..+ +..++| ||..-. ..| +|..+++..+ ..+..+....+++....+ |... .... T Consensus 145 ~~~~i~~~~~~V~---~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv-- 219 (725) T protein:vir:92 145 RREPIHSACSHVI---WDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQI-- 219 (725) T ss_pred EEeeccCChhhcc---cCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCCCeEEE-- Confidence 433 233343 222111 111 1111111110 000000000111110000 0000 0000 Q ss_pred CCceEE---EEEEEecCCccccCceeecc--c---cc-----CC------------------cccceeecCCC---cccE Q lcl|NC_019418. 204 KSLYRI---TNELYKSTSDSQLGERVNLS--E---LY-----PD------------------LQPVTPIQGLS---RPLF 249 (527) Q Consensus 204 ~~~~~I---~n~ly~~~~~~~lG~~v~l~--~---~~-----~~------------------l~~~~~~~g~~---~p~f 249 (527) ...|+. .-.+|...+ ...|..+... + +. ++ +.+...+.+-. .-.| T Consensus 220 ~e~~~r~~~~~~~~~~~d-~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~~ 298 (725) T protein:vir:92 220 AEFYEVVEKKETAFIYQD-PVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHI 298 (725) T ss_pred EEEEEEEEEeeeEEeecC-CCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCCCCce Confidence 000000 001121111 1122222111 0 00 00 00111111100 0012 Q ss_pred EEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHH-cCcceeeechhHhcCCCCCCCcccccccccc-ccc Q lcl|NC_019418. 250 TYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIK-MGQRRVIVPEQMTQLKVQDNQGNIAFKRRFD-VEQ 327 (527) Q Consensus 250 ~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~-~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d-~~~ 327 (527) -|+|.-+.=....++|++-+.+.++++.++.+|...|...+-+- ..+.+..++.+.+..... .+. ++. T Consensus 299 P~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~----------~~~~~~~ 368 (725) T protein:vir:92 299 PIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEH----------MYDGNDD 368 (725) T ss_pred eeEEEEeeeeccCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHH----------HHhccCc Confidence 22221111111245777778899999999999999999998884 466666666665521100 000 111 Q ss_pred ceeeecc---CCCC--CCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHH Q lcl|NC_019418. 328 NVYMQVG---AGNM--DSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRN 402 (527) Q Consensus 328 ~~~~~~~---~~~~--~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~ 402 (527) ..|...+ ..++ ....++...+.=-..++...++.....|....|++...+|..++. .++-.|.++......... T Consensus 369 ~~~~~~~~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~-~SG~ai~~rq~qg~~~l~ 447 (725) T protein:vir:92 369 YPYYLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQ-VAYDTVNQLNMRADLETY 447 (725) T ss_pred cceeeccccccccccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCchh-hHHHHHHHHHHHHHHHHH Confidence 1121111 1111 112344443332234577888888888999999999988886543 456677777777776767 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhh-h------cccCC-----------cc--------------cCccceEEEeCCCccCC Q lcl|NC_019418. 403 SIVALVEQSIKELCVSMCELGKV-V------GIYRG-----------TI--------------PELDDISVNLDDGVFTD 450 (527) Q Consensus 403 ~~~~~~~~al~~li~~il~~~~~-~------~~~~~-----------~~--------------~~~~~v~v~f~d~i~~d 450 (527) .....+..+.+.+-+.+|.+... + .+.+. .. ...++|+|+=..+.+.- T Consensus 448 ~~~Dnl~~~~~~~g~~lL~lI~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~ 527 (725) T protein:vir:92 448 VFQDNLATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSM 527 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccChHHH Confidence 77777777777777766665322 1 01000 00 01234444433332222 Q ss_pred HHHHHHHHHHHHhcC--CCCH-HHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccc- Q lcl|NC_019418. 451 RHAELDYWMKMVAAG--FATQ-KRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDED- 525 (527) Q Consensus 451 ~~~~~~~~~~~~~aG--i~s~-~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 525 (527) +++.++.++++..+- ..+. -..+.......+-+ +.+.+++|+....+.....+..+-+.+ ......-......+ T Consensus 528 r~~~~~~l~ql~~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q-~~~~~qqa~~~q~~~ 606 (725) T protein:vir:92 528 KQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQ-WLVEAQQAKQGQQDP 606 (725) T ss_pred HHHHHHHHHHHHHhcccchhHHHHHHHHHhhcccchHHHHHHHHHHhhhchhccCCccchhhhH-HHHHHHHHHHhhhHH Confidence 345555555555331 1111 11233333333322 334455565544332211111000000 00000000000000 Q ss_pred -----cC Q lcl|NC_019418. 526 -----EA 527 (527) Q Consensus 526 -----~~ 527 (527) ++ T Consensus 607 e~~~~qa 613 (725) T protein:vir:92 607 AMVQAQG 613 (725) T ss_pred HHHHHHH Confidence 00 No 139 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=98.63 E-value=1.7e-07 Score=57.73 Aligned_cols=452 Identities=8% Similarity=0.039 Sum_probs=177.5 Q ss_pred CChHHHHHHHHHHHHHH-----h-hcccchhh-hcc------Cc-cccCHHHHHHHHHHHHHhcC---CCcccccccccC Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYN-----M-TTSHLSSI-LDH------PK-VAVTQSEFRRIQHNLAYYQS---KFDDIEYTNTDG 63 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~-----~-~~~~~~~~-~~~------~~-i~~~~~~~~~i~~~~~~y~g---~~~~l~~~~~~~ 63 (527) -|+|+++..++|+--.- + ....|.+. ... .. ..=++.+....+.....-.| ..+++ .+..+- T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~d~~~~~~~r~g~~~~~~~~g~~~~~-epp~d~ 86 (648) T protein:vir:79 8 RGFWSRISLMWRDEDDDKEPLVLEESMQLGEAPGAMPKGGGGGGSAKRDPKMSLVKRIGLAIMDGGGGGRDFE-EPEFDF 86 (648) T ss_pred chhhhhhhhhccCccccccccccccccccCCCccccCCCCcccccccccchhHHHHHhHHHHHhhcCCccccc-cCCcCH Confidence 78999999999831100 0 00000000 000 00 00022222211111111111 11111 111110 Q ss_pred ccccCceeecchHHHHHHHHhhhhhcccceEeeCCHHHHHHH--H-HHHhh---hhHHHHHHHHHHHHHhcCCEEEEEEE Q lcl|NC_019418. 64 DRKRRKMQHLPIARTAAKKIASLVYNEQAEISAEDETLNDFL--S-DMLSN---DRFNKNFERYLESALALGGLAMRPYV 137 (527) Q Consensus 64 ~~~~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l--~-~~l~~---n~f~~~~~~~~~~a~~~G~~~~~~~~ 137 (527) ..-.+-...-+.....++.+|+.+.+-+..+..+++...+.. . .++.. ......+...+.+.+..|.+|+.+.. T Consensus 87 ~~l~~l~~~np~V~~aI~iia~~ia~l~~~i~~~~~~~~~~~~~~~ll~rPn~~~t~~~f~~~l~~~lll~GNAYveiiR 166 (648) T protein:vir:79 87 NEITSAYNTEGYVRQAVDKYIEMMFKADWDFVSKNPNAVEYIRMRFTLMAEATQIPTNQLFIEIAEDLVKYCNVVIAKSR 166 (648) T ss_pred HHHHHHHhcChHHHHHHHHHHHHHhhCcceEEecCCccchhhHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEe Confidence 000111122344566777778777776666655443221111 1 11111 12334566678888899999998887 Q ss_pred eCCeeEEEEE---cC------CceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceE Q lcl|NC_019418. 138 DGDKIRVAFI---QA------PVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYR 208 (527) Q Consensus 138 d~~~~~i~~v---~a------~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~ 208 (527) ++++....++ .. ..++|+.. ..+. +..+. .+. T Consensus 167 d~~G~~~~~l~~~~~~~~~~v~~l~pl~p--~~v~--------v~~d~-----------------------------~g~ 207 (648) T protein:vir:79 167 AKDALPFQGMNVMGVGDSMPVAGYFPLNL--ASMK--------VKRDK-----------------------------FGM 207 (648) T ss_pred cCCCccchhhhhhhhccccceeeeEeecC--ceeE--------EEEcC-----------------------------CCc Confidence 7654221111 11 11222210 0000 00000 011 Q ss_pred EEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHH Q lcl|NC_019418. 209 ITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEF 288 (527) Q Consensus 209 I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~ 288 (527) |....|.... -+..+++ .+. -..||+.. ...+.++|+|.+..+...|.....+-.-- T Consensus 208 ~~~Y~y~~~g---~~~~~~~-------~~~---------dIIHik~~----~~~d~~~GlSpi~~a~~aI~l~~aa~~~~ 264 (648) T protein:vir:79 208 IKGWQQEQEG---QDKPQKF-------KPE---------DIVHIYYK----REKGRAFGTPWLLPALDDIRALRQVEENV 264 (648) T ss_pred eeeeEEEecC---CceeEEe-------cCc---------cEEEEccC----CCCCCceeccHHHHHHHHHHHHHHHHHHH Confidence 1111111100 0111111 000 13456532 11235679999988877775443322222 Q ss_pred HHHHHcCcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceE--ecccc--ChHHHHHHHHHHHH Q lcl|NC_019418. 289 MWEIKMGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVD--LTTPI--RSSDYISAISEGLK 364 (527) Q Consensus 289 ~~e~~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~--~~~~i--r~e~~~~~~~~~l~ 364 (527) .+-|+.|.+ |..++......... ....+..+.-+.-|.++....+. ...+. +++.. .+.++.+..+...+ T Consensus 265 ~~fF~NGa~----P~gil~~~~~~~~~-e~~k~~~e~~~~~~~~~~i~gg~-v~~~~~~i~~~~s~~dlqfle~rk~~~~ 338 (648) T protein:vir:79 265 LRLVYRNLH----PLWHVKVGLEQEGF-GAEEGEVDLVRGEVENMDVEGGM-VTTERVNISSIASNQIIDAKEYLKHFEQ 338 (648) T ss_pred HHHHhccCC----ccEEEEeCCCccch-HHHHHHHHHHHHhcccccccccc-cccceeeccccCCHHHHHHHHHHHHHHH Confidence 223454432 22222211111110 00001001011123332221111 11111 12211 24467777788888 Q ss_pred HHHHhcCCCccccccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEe Q lcl|NC_019418. 365 LFEMQIGVSSGMFTFDGQGV-KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNL 443 (527) Q Consensus 365 ~i~~~~g~s~~~~~~~~~g~-~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f 443 (527) +|+...|++|..+|...++. .|+.+....+ ..++.-.+..+...+...+...+.+... ++.-......+.++| T Consensus 339 eIa~aFgVPP~lLG~~~~ss~stae~~~~~~---~~~i~~l~~~i~~~le~~~~~~ll~e~~---l~~~l~~d~~ieF~~ 412 (648) T protein:vir:79 339 RAFTVLGVSELMMGRGGTASRSTGDNLSSDF---KDRIKALQKVMATFINEFMVKEILMEGG---FDPVLNPDDKVEFRF 412 (648) T ss_pred HHHHHhCCCHhHcccCCCccchHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhhhhh---ccccccccceEEEee Confidence 99999999999998765433 3343332222 2333333444444443322211111111 111222345678889 Q ss_pred CCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCHHHHHHHHHH----HHHhcccccccccCC-CCCCCCCC--- Q lcl|NC_019418. 444 DDGVFTDRHAELDYWMKMVAAGFATQKRGIAKT--LGITEEEAEKELAE----INGELPPESDAELAL-YGKGQQNT--- 513 (527) Q Consensus 444 ~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~--~~~~deea~~el~r----i~~E~~~~~~~~~~~-~~~~~~~~--- 513 (527) ++-...|..+.++...+++++|+||+-+++... .++.+.+-...+.. ...+..+......+. .+....+. T Consensus 413 ~~Llr~D~~~~a~~~~~l~~~GilT~NEaR~~lGlpPi~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~eg~ 492 (648) T protein:vir:79 413 NEIDMDSKIKLENQAVFLYEHNAISEDEMRELIGRDPVDDGEGRAKMHLQMVTIAQATALAALAPTPAGGSSASASGDKK 492 (648) T ss_pred cccchhhHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCccccccccccchhccccccCCCCCCCCCCCCcccccc Confidence 888888888888889999999999999987654 23332211111110 001111111010000 00000000 Q ss_pred CCCCCCCCCccccC Q lcl|NC_019418. 514 VGNSKDTVDDEDEA 527 (527) Q Consensus 514 ~~~~~~~~~~~~~~ 527 (527) ..+.++...+++.+ T Consensus 493 ~~e~~~~~~~~~~~ 506 (648) T protein:vir:79 493 KKATDNKTKPTNQH 506 (648) T ss_pred ccccCCCCCCCCCC Confidence 00001111111111 No 140 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=98.63 E-value=1.7e-07 Score=57.71 Aligned_cols=394 Identities=11% Similarity=0.072 Sum_probs=167.2 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) |++|++ +|++ ++- ..+..+. .|..++.+..+...-..+... ..+...--...+ T Consensus 1 Mg~f~~---lf~r-------~~~------~~~~~~~-------~~~~~~~~~~~~~~g~~v~~~----~al~~~~v~~~i 53 (414) T protein:vir:44 1 MVFFSG---LFQR-------KSD------APVTTPA-------ELADAIGLSYDTYTGKQISSQ----RAMRLTAVFSCV 53 (414) T ss_pred Cchhhh---hhcc-------Ccc------Ccccchh-------hHhHhhccCccccCCceechh----hhhccHHHHHHH Confidence 999764 4553 111 1111111 123333222111110111111 111111223444 Q ss_pred HHHhhhhhcccceEe---------eCCHHHHHHHHH-HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCee-EEEEEcC Q lcl|NC_019418. 81 KKIASLVYNEQAEIS---------AEDETLNDFLSD-MLSNDRFNKNFERYLESALALGGLAMRPYVDGDKI-RVAFIQA 149 (527) Q Consensus 81 ~~~A~ll~~e~~~i~---------~~d~~~~~~l~~-~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~-~i~~v~a 149 (527) +.+|+-+-.-|..+- +.+..+...|.. --........+..++...+..|.+++.+.-+++++ .+..++| T Consensus 54 ~~Ia~~ia~~p~~~~~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~~g~~~~L~~l~~ 133 (414) T protein:vir:44 54 RVLAESVGMLPCNLYHLNGSLKQRATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKAFGEVAELLPVDP 133 (414) T ss_pred HHHHHHhccCceEEEEecCCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeCCCcEEEEEEEcC Confidence 555555544443321 112222333321 10011223334455666777899988876666654 4566677 Q ss_pred CceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecc Q lcl|NC_019418. 150 PVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLS 229 (527) Q Consensus 150 ~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~ 229 (527) ..+-+...+ ++ ..+|. ++. ..+... .| + T Consensus 134 ~~v~~~~~~-~~-----------------~~~y~---~~~---------------~~g~~~--~~------------~-- 161 (414) T protein:vir:44 134 GCVVPKLNS-SW-----------------EPVYQ---VTF---------------PDGSTD--VL------------S-- 161 (414) T ss_pred ceEEEEECC-CC-----------------cEEEE---EEe---------------cCceEE--EE------------c-- Confidence 665543211 11 11121 000 000000 00 0 Q ss_pred cccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCC Q lcl|NC_019418. 230 ELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLK 309 (527) Q Consensus 230 ~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~ 309 (527) +. -+.||+.+.. ....|+|.+.-+...++.....-.-..+-|..|.+ |..++... T Consensus 162 ------~~----------evih~~~~~~-----d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~----p~gil~~~ 216 (414) T protein:vir:44 162 ------QE----------DIWHVRTLTL-----DGLVGLNPIAYAREAISLAAATEEHGARLFSNGAV----TSGVLRTE 216 (414) T ss_pred ------cc----------cEEEecCCCC-----CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCC----CceEEEeC Confidence 00 1245553321 23579999988887776544443333333454332 11222222 Q ss_pred CCCCCc-ccccccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccc Q lcl|NC_019418. 310 VQDNQG-NIAFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQG 383 (527) Q Consensus 310 ~~~~~~-~~~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g 383 (527) ...... .-.....| ...|.+.+ .+ -.++..++.++......++.+..+....+|+...|++|..++...++ T Consensus 217 ~~l~~e~~~~~~~~~---~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~ 293 (414) T protein:vir:44 217 QTLSDQAYERLKKDF---EERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRA 293 (414) T ss_pred CCCCHHHHHHHHHHH---HHHhcCccccCcceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC Confidence 111100 00000111 11122211 00 01222456666666677888888888889999999999999875543 Q ss_pred -cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHH Q lcl|NC_019418. 384 -VKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMV 462 (527) Q Consensus 384 -~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~ 462 (527) -.++.+.. +..++.+|..++..|-...+. .++.........+.++++.-+..|..+.++...+++ T Consensus 294 t~~n~e~~~-------------~~~~~~~l~P~~~~ie~~ln~-~L~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~ 359 (414) T protein:vir:44 294 TFNNIEELG-------------LGFINYSLVPYLTRIEQRINT-GLVRKSKQGVFYAKFNAGALLRGDMKSRFEAYATGI 359 (414) T ss_pred CcccHHHHH-------------HHHHHHHHHHHHHHHHHHHHh-hcCCccccCceEEEEechhhhccCHHHHHHHHHHHH Confidence 23333321 222344444444444322211 122211122334556666666778889899999999 Q ss_pred hcCCCCHHHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCC-CccccC Q lcl|NC_019418. 463 AAGFATQKRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTV-DDEDEA 527 (527) Q Consensus 463 ~aGi~s~~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 527 (527) .+|+|++-+++.. .|++.-+ ..+-+.. ... . ..+. +..+.+.+.++. .|+..| T Consensus 360 ~~G~~t~NE~R~~-~gl~p~~ggD~~~~~--~n~-----~--~~~~--~~~~~~~~~~~~~~d~~~~ 414 (414) T protein:vir:44 360 NWGIYSPNDCRDL-EDMNPRPGGDVYLTP--MNM-----T--TKPS--DGSKAGKQKDNANADETTS 414 (414) T ss_pred hCCCcCHHHHHHH-hCCCCCCCcceeccc--ccc-----c--ccCC--ccccCCCCCCCCCCCCCCC Confidence 9999999997754 3664311 1110000 000 0 0000 011111112222 222222 No 141 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=98.61 E-value=1.9e-07 Score=57.37 Aligned_cols=472 Identities=7% Similarity=0.004 Sum_probs=183.3 Q ss_pred HHHhhcccchhhhccCcc-ccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHHHHhhhhhcc--c Q lcl|NC_019418. 15 RYNMTTSHLSSILDHPKV-AVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIASLVYNE--Q 91 (527) Q Consensus 15 ~~~~~~~~~~~~~~~~~i-~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~ll~~e--~ 91 (527) |..+-...+.+-+.+... .+..++......|+.+|.---|-+-.....+......+.-=..+...++.+|+-|.+- | T Consensus 1 ~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 80 (543) T protein:vir:88 1 MAETKREGLAEEGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDNSSTDYTTPWQAVGARGLNNLSAKVMLALFP 80 (543) T ss_pred CcccccCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccccCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Confidence 111101100000000001 1234445556667776654333221111111111111222245667777777754442 2 Q ss_pred ce----EeeCCH-------------HHHH-------HHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCe-eE--- Q lcl|NC_019418. 92 AE----ISAEDE-------------TLND-------FLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDK-IR--- 143 (527) Q Consensus 92 ~~----i~~~d~-------------~~~~-------~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~-~~--- 143 (527) ++ +.+.+. ..+. .+...+..++|...+.++..+....|.+.+.+--|.++ ++ T Consensus 81 ~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~~~~~~~ 160 (543) T protein:vir:88 81 LQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALIYLPPPDASSNSYNP 160 (543) T ss_pred CCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeccCccccceecc Confidence 21 222221 1222 34456777899999999999999999998654434332 22 Q ss_pred EEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCcccc- Q lcl|NC_019418. 144 VAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQL- 222 (527) Q Consensus 144 i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~l- 222 (527) +..++-.+++ +..|..|++.+++......... |.-+ +.... ........+....|-|.+|.-.+...- T Consensus 161 ~~~~pl~~y~-v~~d~~G~v~~i~r~~~~~~~~--------l~~~-~~~~v-~~~~~~~p~~~~~v~~~V~pr~~~~~~~ 229 (543) T protein:vir:88 161 MKLYTLHNHV-VQRDAFGNVLQIVTLDKVAYAA--------LPED-VRNSL-SGGQEYKPEQELEVYTHIYIDDESGDFL 229 (543) T ss_pred eEEeEcceEE-EeeCCCCCeeeeeeeeeccHHH--------HhHH-hhHHH-HHHhhcCCccceEEEEEEEeecCCCccc Confidence 3334445433 4567778777766543221000 0000 00000 000000001122222333321111100 Q ss_pred ------CceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHH-HcC Q lcl|NC_019418. 223 ------GERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEI-KMG 295 (527) Q Consensus 223 ------G~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~-~~~ 295 (527) |..|+.++ ....+ ..-+|..++ =+...++.||+|-...+.+-++.|+..--....-. ... T Consensus 230 ~~~~~~~~~v~~~~------~~~~~---~e~P~i~~R----w~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~ 296 (543) T protein:vir:88 230 SYQEIEGVEVDGSD------GQYPQ---DALPWIAVR----WTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISS 296 (543) T ss_pred ccccccCeeeecCC------Ccccc---ccCCceeee----eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 11111110 00000 011222222 12234678999999999999999998766665544 346 Q ss_pred cceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcc Q lcl|NC_019418. 296 QRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSG 375 (527) Q Consensus 296 ~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~ 375 (527) ++.+.||++......+...+ ....|.. +..+...+..+....+...-...++.+...|....=+. T Consensus 297 ~pp~~v~~~g~~~~~~~~~~----------~~g~~v~---g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~-- 361 (543) T protein:vir:88 297 KVVGLVNPNGITQVRRLVKA----------QTGDFVA---GRKADIEFLQLEKTADFTVAKSVADAIEARLSYVFMLN-- 361 (543) T ss_pred cCceeeccccccchhhcccC----------CCceeec---CCCCcceeeecccccchhHHHHHHHHHHHHHHHHHhhh-- Confidence 66667755543222111110 0111111 11111111112211123334455555544443322111 Q ss_pred cccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCcc-CCHHH Q lcl|NC_019418. 376 MFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVF-TDRHA 453 (527) Q Consensus 376 ~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~-~d~~~ 453 (527) .+....+...|||||....+...+..+-.-..+ ...|.-|+..++.+....+.....+.+ .+.+++--++. .-+.. T Consensus 362 ~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~--~v~~~~vs~l~~l~r~~ 439 (543) T protein:vir:88 362 SAVQRSGERVTAEEIRYVASELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQIPNLPQE--AVEPTVTTGAEALGRGQ 439 (543) T ss_pred hhccCCCCcccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchh--ceeeeEEecHHHHHHHH Confidence 122234455799999999988887777654444 345566666666554332332222222 34444422211 11112 Q ss_pred HHHHHHHHHh-cCC---------CCHHHHHHh---cCCC-------CHHHHHHHHHHHHHhcccc---cccccCCCCCCC Q lcl|NC_019418. 454 ELDYWMKMVA-AGF---------ATQKRGIAK---TLGI-------TEEEAEKELAEINGELPPE---SDAELALYGKGQ 510 (527) Q Consensus 454 ~~~~~~~~~~-aGi---------~s~~~~i~~---~~~~-------~deea~~el~ri~~E~~~~---~~~~~~~~~~~~ 510 (527) .++.+++..+ .|. +....++.. ..|+ +++|++++-++.+.+++.. .....+...+.. T Consensus 440 ~~~~l~~~~~~v~~~~~p~vld~id~d~~~~~~a~~~Gv~~~~i~r~~~e~~~~~~q~~~q~~~~~~~~~~~~~~~~~~~ 519 (543) T protein:vir:88 440 DLDKLTQFLNAVATVSQLNGDPDLNVNNIKLRLANAIGIDTAGLLLTEAEKAQAQSQEMLKQGGLNAAAGIGSGVAAQAT 519 (543) T ss_pred HHHHHHHHHHHHHhccchhhhccCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhc Confidence 2222222211 011 222333322 2365 2344443322221111100 000000000000 Q ss_pred CCCCC------CCCCCCCccccC Q lcl|NC_019418. 511 QNTVG------NSKDTVDDEDEA 527 (527) Q Consensus 511 ~~~~~------~~~~~~~~~~~~ 527 (527) .++.. +.+--.+...-- T Consensus 520 ~~~~~~~~~~~~~~~~~~p~~~~ 542 (543) T protein:vir:88 520 ASPEAMESAMDTAGVQPGPIATQ 542 (543) T ss_pred cChHHHHHHhhhcCCCCCCCCCC Confidence 00000 000000000000 No 142 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=98.60 E-value=2.1e-07 Score=57.23 Aligned_cols=403 Identities=13% Similarity=0.095 Sum_probs=162.9 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccc-cCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRK-RRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~-~~~~~~lnl~~~i 79 (527) |+||+++|.+|.+-- ..+ +..... .++. .- .++.++.. . ...|..- ...-+...--... T Consensus 7 ~~~~~~~~~~~~~~~----~~~---~~~~~~--~~~~--~~--~~~~~~~~--~-----s~~g~~v~~~~al~~~~V~~~ 66 (432) T protein:vir:10 7 LGLLGQLKAMFVPPD----PVD---IGGGQT--FTPV--NA--TARDLGII--I-----SDTGAAVNADAIMRLDAVAAC 66 (432) T ss_pred cchhhhhHhhcCCcc----ccc---cccccc--cccC--cc--hhhhhccc--c-----cccCcccchhhhhcchHHHHH Confidence 999999999986411 011 111110 1100 00 01111110 0 0011100 0011111111233 Q ss_pred HHHHhhhhhcccceEe---------eCCHHHHHHHHH-HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCee-EEEEEc Q lcl|NC_019418. 80 AKKIASLVYNEQAEIS---------AEDETLNDFLSD-MLSNDRFNKNFERYLESALALGGLAMRPYVDGDKI-RVAFIQ 148 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~---------~~d~~~~~~l~~-~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~-~i~~v~ 148 (527) ++.+|+-+-+-|..+- +.+..+...|.. =-..-.....++.++...+..|.+++.+..+++++ .+.+++ T Consensus 67 i~~Ia~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~~g~~~~L~~l~ 146 (432) T protein:vir:10 67 VKLVSQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTDGRIESLQYLA 146 (432) T ss_pred HHHHHHhhhhCceeEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEEEEEEc Confidence 3444444433343321 112222222211 00001122233456667788899998887776653 466678 Q ss_pred CCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeec Q lcl|NC_019418. 149 APVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNL 228 (527) Q Consensus 149 a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l 228 (527) |+++-++. +.++. .+|.. ..+ ++..+. | + T Consensus 147 ~~~v~v~~-~~~g~-----------------~~y~~-~~~--------------~g~~~~-----~------------~- 175 (432) T protein:vir:10 147 NDRLTITT-DTKGN-----------------TAYRY-RRT--------------DGQMID-----I------------P- 175 (432) T ss_pred CCceEEEE-cCCCc-----------------EEEEE-Eec--------------CceEEE-----E------------c- Confidence 87776643 22221 11210 000 000000 0 0 Q ss_pred ccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHH-HHcCcceeeechhHhc Q lcl|NC_019418. 229 SELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWE-IKMGQRRVIVPEQMTQ 307 (527) Q Consensus 229 ~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e-~~~~~~~i~v~~~~l~ 307 (527) +. -+.|++.+..| ...|+|.+.-+...+.-... -.++... |+.|.. |..++. T Consensus 176 -------~~----------~iih~~~~~~d-----g~~G~spi~~~~~~i~~~~~-~~~~~~~~f~ng~~----~~gil~ 228 (432) T protein:vir:10 176 -------KQ----------QIWKIMGYSLD-----GENGLSAIRYGAQIFGTAIA-AEAQAARAFRNGQL----QSVYYQ 228 (432) T ss_pred -------Cc----------cEEEecCCCCC-----CcccccHHHHHHHHHHHHHH-HHHHHHHHHhcCCC----cceEEe Confidence 00 12344432212 23588888877766654332 2333333 454332 222222 Q ss_pred CCCCCCCcccccccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccccc Q lcl|NC_019418. 308 LKVQDNQGNIAFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQ 382 (527) Q Consensus 308 ~~~~~~~~~~~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~ 382 (527) .+..... .. ++.-+.-|.+.. .+ -.++..++.++......++++..+....+|+...|++|..+|.... T Consensus 229 ~~~~l~~-----e~-~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~ 302 (432) T protein:vir:10 229 IDRFLTD-----DQ-YDSFAKKVSGSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSA 302 (432) T ss_pred cCCCCCH-----HH-HHHHHHHHhhhhhCCCceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccC Confidence 2211110 00 000011121110 00 0122246666666677788888888889999999999999987654 Q ss_pred cc-chHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHH Q lcl|NC_019418. 383 GV-KTATEIVSENSDTY-QMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMK 460 (527) Q Consensus 383 g~-~TAtei~s~~~~~~-~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~ 460 (527) +. .+++.+.......+ .++.-+...++.+|.. .++.........+.++.+.-+..|..+.++...+ T Consensus 303 ~t~~~~sn~e~~~~~f~~~tl~P~~~~ie~~ln~------------kL~~~~~~~~~~~~fd~~~ll~~d~~~r~~~~~~ 370 (432) T protein:vir:10 303 GTTSWGSGIESQQLGFLSMTLSPWLRRIEQSIAL------------NLLSPAERRRYFADFDTSALLRADSAARSSYYSQ 370 (432) T ss_pred CcccccchHHHHHHHHHHHHHHHHHHHHHHHHHh------------hhcCccccCceEEEeechhhhccCHHHHHHHHHH Confidence 32 22222211111111 2333333333333322 1111111122334444445556788888999999 Q ss_pred HHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCC--CCCCCCCCCCccccC Q lcl|NC_019418. 461 MVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQN--TVGNSKDTVDDEDEA 527 (527) Q Consensus 461 ~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~ 527 (527) ++.+|+|++-+++..+ |+..-+-...+-.++. ....+...+++. ++.....|....+.| T Consensus 371 ~~~~G~~T~NE~R~~~-glppi~g~~~~~~~~~-------~~~pl~~~~~~~~~~~~~~~~~~~~~~~~ 431 (432) T protein:vir:10 371 LVNNGLMTRDEAREIE-GLPKLGGNAAVLTVQS-------AMVPLDSIGLQASPEPASGLGNQQQDKVS 431 (432) T ss_pred HHhCCCCCHHHHHHHh-CCCCCCCCcceEeecC-------cccchhhhcccCCCCCCCCCCCccccccc Confidence 9999999999977654 5432100000000000 000000000000 000011111111111 No 143 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=98.60 E-value=2.1e-07 Score=57.17 Aligned_cols=452 Identities=8% Similarity=0.033 Sum_probs=176.3 Q ss_pred HHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHHHHhhh Q lcl|NC_019418. 7 VKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIASL 86 (527) Q Consensus 7 ~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~l 86 (527) ||..+++...++ | .......|+.++.---|-+-.....+......+.-=..+...++.+|+- T Consensus 1 mk~~~~~~~~~l--k----------------r~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~ 62 (510) T protein:vir:78 1 MKSTAAMLWEKL--R----------------DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAK 62 (510) T ss_pred ChhHHHHHHHHH--h----------------ccchHHHHHHHHHhhccccccCCCCcccccccCcccchHHHHHHHHHHH Confidence 444444433222 0 1123556777765433322111111111111111114556677777765 Q ss_pred hhcc--cc-----eEeeCCHH-------------HHHH-------HHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC Q lcl|NC_019418. 87 VYNE--QA-----EISAEDET-------------LNDF-------LSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG 139 (527) Q Consensus 87 l~~e--~~-----~i~~~d~~-------------~~~~-------l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~ 139 (527) |.+- || ++.+++.. ..++ +...|..++|...+.++..+....|.+.+ |.+. T Consensus 63 l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--~~~~ 140 (510) T protein:vir:78 63 LARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNS 140 (510) T ss_pred HHHhhcCCCCcccccCCChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEE--EEeC Confidence 4443 11 13333221 2333 34567788999999999999999998765 4554 Q ss_pred CeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCc Q lcl|NC_019418. 140 DKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSD 219 (527) Q Consensus 140 ~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~ 219 (527) +..++..++-.+++ +..|..|++.+++....+....-... |.............+....|-|.+++..+. T Consensus 141 ~~~~~~~~pl~~y~-v~~d~~G~vd~i~rr~~~t~~~l~~~---------~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~ 210 (510) T protein:vir:78 141 DEATVVAWSLRSYA-VRRDATGRWMDIVLKQRYKSKDLDDV---------YKQDLMRAGRNLSGSGSVDLYTHVQRRKGT 210 (510) T ss_pred CCCeEEEEEcceeE-EeeCCCcCeeEEEeeeeccHHHHHHH---------hhHHhhhhhhccCCCceEEEEEEEEeecCC Confidence 44456667777755 45677777766664432210000000 000000000000001112222222221110 Q ss_pred cccCceeecccccCCcccc--eeecC--CCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHH-HHHc Q lcl|NC_019418. 220 SQLGERVNLSELYPDLQPV--TPIQG--LSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMW-EIKM 294 (527) Q Consensus 220 ~~lG~~v~l~~~~~~l~~~--~~~~g--~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~-e~~~ 294 (527) -.|...+|-.+... ....+ ....+|..++- +...++.||+|--..+.+-+..|+..--.... .... T Consensus 211 -----~~~~~sv~~e~dg~~i~~~~~~~~~e~P~~~~Rw----~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a 281 (510) T protein:vir:78 211 -----AMDYAEMYHEIDGVRVGETGRWPIHLCPYIVPTW----NLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELES 281 (510) T ss_pred -----CCcEEEEEEEecCeeeccccccccccCCeeeeee----eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 01111111100000 00111 11122222221 22346789999999999999999976544443 3344 Q ss_pred CcceeeechhHh-cCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccc--cChHHHHHHHHHHHHHHHHhcC Q lcl|NC_019418. 295 GQRRVIVPEQMT-QLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTP--IRSSDYISAISEGLKLFEMQIG 371 (527) Q Consensus 295 ~~~~i~v~~~~l-~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~--ir~e~~~~~~~~~l~~i~~~~g 371 (527) .+....|+++.+ .+. +.. ......+++ +....++.++.. .....-...++.+...|....= T Consensus 282 ~~~~~lv~p~g~~~~~-~l~----------~~~~g~~v~-----g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF~ 345 (510) T protein:vir:78 282 LEVLNLVDEAKGAVVD-DYQ----------DAEMGDYVP-----GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFM 345 (510) T ss_pred hcCCcccCCccccchh-hhc----------cCCCceeec-----CCcccccccccCcccchHHHHHHHHHHHHHHHHHHh Confidence 444445533221 111 000 000011111 111122222211 1112222333333333322210 Q ss_pred CCcccccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCC Q lcl|NC_019418. 372 VSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTD 450 (527) Q Consensus 372 ~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d 450 (527) + ++....+...|||||....+...+..+-.-..+ ...|.-|++.++.+..-.++....+.......|++-..+-.. T Consensus 346 ~---~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl~p~p~~~~~~~~v~~is~Lara 422 (510) T protein:vir:78 346 Y---GANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRS 422 (510) T ss_pred h---ccccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccceeeecccHHHHH Confidence 1 111123344699999998888887776533333 344455666555544322232221111122233432222221 Q ss_pred HHHH-HHHHHHHHh-cC----C---CCHHHHH---HhcCCC-------CHHHHHHHHHHHHHhcccc---cccccCCCCC Q lcl|NC_019418. 451 RHAE-LDYWMKMVA-AG----F---ATQKRGI---AKTLGI-------TEEEAEKELAEINGELPPE---SDAELALYGK 508 (527) Q Consensus 451 ~~~~-~~~~~~~~~-aG----i---~s~~~~i---~~~~~~-------~deea~~el~ri~~E~~~~---~~~~~~~~~~ 508 (527) .+.+ +....+.++ .| + +....++ ....|+ |+||++++.++.+++.+.+ ..+.+.--++ T Consensus 423 q~~~~l~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~~~~~q~~~~~~~~~a~~~~~~~ 502 (510) T protein:vir:78 423 AAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASD 502 (510) T ss_pred HHHHHHHHHHHHHHHhcChhhhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 1110 111111111 11 1 2223332 334565 3566655544322111101 0111111111 Q ss_pred CCCCCCCC Q lcl|NC_019418. 509 GQQNTVGN 516 (527) Q Consensus 509 ~~~~~~~~ 516 (527) -.....+= T Consensus 503 ~~~~~~g~ 510 (510) T protein:vir:78 503 MTNALAGV 510 (510) T ss_pred hcccCCCC Confidence 11111111 No 144 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=98.58 E-value=2.4e-07 Score=56.86 Aligned_cols=457 Identities=10% Similarity=0.042 Sum_probs=181.7 Q ss_pred HHHHHHH-HHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccc-cccccCccccCceeecchHHHHHH Q lcl|NC_019418. 4 IQKVKDF-FNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIE-YTNTDGDRKRRKMQHLPIARTAAK 81 (527) Q Consensus 4 ~~~~k~~-~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~-~~~~~~~~~~~~~~~lnl~~~i~~ 81 (527) |+.+|+- +. ...+++... .+..++......|+.++.---|-+- .....+.. ...++-=+.+...++ T Consensus 1 m~~~~~~~~~-------~~~~~~r~~----~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~-~~~~~~dst~~~a~~ 68 (532) T protein:vir:99 1 MAEVEKTGFA-------ADGAAAAYN----RLKNDRGAYETRAEDCATYTIPSVFPSATADGST-SYTTPWQSIGARGLN 68 (532) T ss_pred Ccchhhcccc-------HHHHHHHHH----HHHHHhhHHHHHHHHHHHHhhhcccCCCCCcchh-hccccccchHHHHHH Confidence 1111100 00 000000000 1122333345556666544333221 11111111 111222255677777 Q ss_pred HHhhhhhcc--cc-----eEeeCCHH-------------HHHH-------HHHHHhhhhHHHHHHHHHHHHHhcCCEEEE Q lcl|NC_019418. 82 KIASLVYNE--QA-----EISAEDET-------------LNDF-------LSDMLSNDRFNKNFERYLESALALGGLAMR 134 (527) Q Consensus 82 ~~A~ll~~e--~~-----~i~~~d~~-------------~~~~-------l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~ 134 (527) .+|+-|.+- || ++.+++.. ..++ +...|..++|...+.++..+..+.|.+.+. T Consensus 69 ~LAa~L~~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~ 148 (532) T protein:vir:99 69 NLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLY 148 (532) T ss_pred HHHHHHHHhhcCCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEE Confidence 777755443 11 22333221 2333 345677899999999999999999999987 Q ss_pred EEEeC----CeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCC-------------cceEEEEEEEEeeccccccc Q lcl|NC_019418. 135 PYVDG----DKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENR-------------KNVYYTLVEFHEWVTPTGQE 197 (527) Q Consensus 135 ~~~d~----~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~-------------~~~~yt~lE~h~~~~~~~~~ 197 (527) +-.+. ....+..++-.+++ +..|..|++..++.........- ....+..++.++.... T Consensus 149 ~~~~~~~~~~~~~f~~~pl~~y~-v~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~p~~~v~v~~~v~~---- 223 (532) T protein:vir:99 149 IPSTEQVEGQSNAPKLYKLHNFV-VERDAYDNVLQIVTEDKIARAALPEDVRKSLEDAQGDQNPSEEVTIYTHVYR---- 223 (532) T ss_pred ecccccccCcccceEEEEcCeEE-EeeCCCCCeeeEeeeeeecHHhcChHHHHHhhccccccCCCcceEEEEEEEe---- Confidence 65542 34567778887766 45677777776664332210000 0001111111110000 Q ss_pred ceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHH Q lcl|NC_019418. 198 VGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTT 277 (527) Q Consensus 198 ~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~l 277 (527) ..++.++...+.+ + |..+++.+. . .+....+|..++- +...++.||+|-...+.+- T Consensus 224 ---~~~~~~~~~~~~~----~----g~~~~~~~~------~---~~~~e~P~~~~Rw----~~~~ge~YGrgp~~~~l~D 279 (532) T protein:vir:99 224 ---DPEAMVFRSYQEI----D----GEIVAGTEG------E---YPLDSCPWIPVRL----IKMPNEDYGRSFVEEYLGD 279 (532) T ss_pred ---cCCCCeeEEEEee----c----Cceeccccc------c---cccccCCceeeee----eecCCCccccchHHHHHHH Confidence 0011111111111 0 111111110 0 0111112222221 2234678999999999999 Q ss_pred HHHHHHHHHHHHH-HHHcCcceeeechhHhcCCCCC-CCcccccccccccccceeeeccCCCCCCCcceEec--cccChH Q lcl|NC_019418. 278 IDFINRTYDEFMW-EIKMGQRRVIVPEQMTQLKVQD-NQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLT--TPIRSS 353 (527) Q Consensus 278 id~ld~~~s~~~~-e~~~~~~~i~v~~~~l~~~~~~-~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~--~~ir~e 353 (527) ++.|+..--.... .....+....|+++.+....+. .++ ...+++ +....++.++ ...+.. T Consensus 280 ~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~-----------~g~~v~-----g~~~~i~~~~~~~~~~~~ 343 (532) T protein:vir:99 280 LKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKAN-----------TGDFVA-----GRKQDVEVFQLEKYNDFQ 343 (532) T ss_pred HHHHHHHHHHHHHHHHHHcCCCceeccccccchhhhccCC-----------Ccceec-----CCcccceeeecccccchh Confidence 9999976544443 3344555555543332111110 000 001111 1111122221 111222 Q ss_pred HHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhcccCCc Q lcl|NC_019418. 354 DYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKVVGIYRGT 432 (527) Q Consensus 354 ~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~~~~~~~~ 432 (527) .-...++.+...|....=+. .+....+...|||||....+...+..+-.-..+ ...|.-|+..++.+..-.++.... T Consensus 344 ~~~~~i~~~~~rI~~af~~~--~~~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~ 421 (532) T protein:vir:99 344 VAKATADDIEKRLSYAFMLN--SAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNL 421 (532) T ss_pred HHHHHHHHHHHHHHHHHhhh--hcccCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC Confidence 22344444443332222111 122223345799999999888887776544444 344556666666554332332222 Q ss_pred ccCccceEEEeCCCccCCHHHHHHHHHHHHh-----cCC-------CCHHHHH---HhcCCC-------CHHHHHHHHHH Q lcl|NC_019418. 433 IPELDDISVNLDDGVFTDRHAELDYWMKMVA-----AGF-------ATQKRGI---AKTLGI-------TEEEAEKELAE 490 (527) Q Consensus 433 ~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~-----aGi-------~s~~~~i---~~~~~~-------~deea~~el~r 490 (527) +.+...+.+. .-.+..++.+....+.+ +.+ +....++ ....|+ +++|++++.++ T Consensus 422 p~~~~~~~iv----~~is~Laraq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~GV~~~~i~r~~ee~~~~~~q 497 (532) T protein:vir:99 422 PKEAVEPAIA----TGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAE 497 (532) T ss_pred Chhhccccee----ecchHHHHHHHHHHHHHHHHHHHhhcchhhhhCCHHHHHHHHHHHhCCChhhccCCHHHHHHHHHH Confidence 2222233221 11233333332222110 111 2233333 233454 34555555443 Q ss_pred HHHhcccc--cccccCCCCCCCCCCCCCCCCCCCccc Q lcl|NC_019418. 491 INGELPPE--SDAELALYGKGQQNTVGNSKDTVDDED 525 (527) Q Consensus 491 i~~E~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (527) .+.++... ........+...-.+. .-.++-|.+ T Consensus 498 ~~~~~~~~~a~~~~~~~~~~~~~~~~--~~~~~~~~~ 532 (532) T protein:vir:99 498 ASTAAGMVTAGQQMGAAGGQAAAAMM--QQQAGMPTQ 532 (532) T ss_pred HHHHHHHHHHHHHHHHHHHHhcchhH--HhhcCCCCC Confidence 33222111 0000000000000000 001111111 No 145 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=98.56 E-value=2.8e-07 Score=56.48 Aligned_cols=451 Identities=8% Similarity=0.029 Sum_probs=180.8 Q ss_pred HHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHHHHhhh Q lcl|NC_019418. 7 VKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIASL 86 (527) Q Consensus 7 ~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~l 86 (527) ||..+.+...++- .......|+.++.---|-+-.....+......+.-=+.+...++.+|+- T Consensus 1 mk~~~~~~~~~lk------------------R~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~ 62 (510) T protein:vir:63 1 MKTTAAMLWEKLR------------------DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAK 62 (510) T ss_pred ChhHHHHHHHHHh------------------ccchHHHHHHHHHhhccccCCCCCCccccccCCCccchHHHHHHHHHHH Confidence 4444443332220 1123556777665433322111111111111111114556777777775 Q ss_pred hhcc--cc-----eEeeCCH-------------HHHH-------HHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC Q lcl|NC_019418. 87 VYNE--QA-----EISAEDE-------------TLND-------FLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG 139 (527) Q Consensus 87 l~~e--~~-----~i~~~d~-------------~~~~-------~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~ 139 (527) |.+- || ++.+++. ...+ .+...|..++|...+.++..+....|.+.+ |.+. T Consensus 63 l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l--~~~~ 140 (510) T protein:vir:63 63 LARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRDS 140 (510) T ss_pred HHhhhcCCCCcccccCCChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEE--EEcC Confidence 4443 11 1333321 1233 345677788999999999999999998644 4666 Q ss_pred CeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEe-ecccccccceeeecCCceEEEEEEEecCC Q lcl|NC_019418. 140 DKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHE-WVTPTGQEVGSTKDKSLYRITNELYKSTS 218 (527) Q Consensus 140 ~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~-~~~~~~~~~~~~~~~~~~~I~n~ly~~~~ 218 (527) +..++..++-.+++ +..|..|++.+++......... ..+ +.............+....|-+.+++..+ T Consensus 141 ~~~~~~~~pl~~y~-v~~d~~G~vd~i~rr~~~t~~~----------l~e~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~ 209 (510) T protein:vir:63 141 DAATVVAWSLRSYA-VRRDATGRWMDIVLKQRYKSKD----------LDEEYKQDLMRAGRNLSGSGSVDLYTHVQRKKG 209 (510) T ss_pred CCcEEEEEEcceeE-EeeCCCcCeeEEEeeeeccHHH----------HhHHhhhhhhccccccCCCcceEEEEEEEeecC Confidence 66667777777766 4567777777665443221000 000 00000000000000112223333332211 Q ss_pred ccccCceeecccccCCcccc--eeecC--CCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHH-HHH Q lcl|NC_019418. 219 DSQLGERVNLSELYPDLQPV--TPIQG--LSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMW-EIK 293 (527) Q Consensus 219 ~~~lG~~v~l~~~~~~l~~~--~~~~g--~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~-e~~ 293 (527) +-.|...+|-..... ....+ ....+|..++- +...++.||+|--..+.+-+..|+..--.... ... T Consensus 210 -----~~~~~~sv~~e~dg~~~~~~~~~~~~e~P~~~~Rw----~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~ 280 (510) T protein:vir:63 210 -----TAMEYAELYHEIDGVRVGKEGRWPIHLCPYIVPTW----NLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELE 280 (510) T ss_pred -----CCceEEEEEEEecCceeccccccccccCceeeeee----eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111222211111110 01111 11122322221 22346789999999999999999976544443 334 Q ss_pred cCcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEec--cccChHHHHHHHHHHHHHHHHhcC Q lcl|NC_019418. 294 MGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLT--TPIRSSDYISAISEGLKLFEMQIG 371 (527) Q Consensus 294 ~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~--~~ir~e~~~~~~~~~l~~i~~~~g 371 (527) ..+....|+++.+........ .....+++ +....++.++ +..+...-...++.+...|....= T Consensus 281 a~~~~~lv~p~g~~~~~~~~~----------~~~g~~v~-----g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~ 345 (510) T protein:vir:63 281 SLEVLNLVDEAKGAVVDDYQD----------AEMGDYVP-----GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFM 345 (510) T ss_pred hccCCcccCcccccchhhhcc----------CCCceeec-----CCcccceeeecCcccchHHHHHHHHHHHHHHHHHHH Confidence 455555554432211000000 00001111 1112233332 112222223344444333332210 Q ss_pred CCcccccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhcccCCcccCccc-eEEEeCCCccC Q lcl|NC_019418. 372 VSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKVVGIYRGTIPELDD-ISVNLDDGVFT 449 (527) Q Consensus 372 ~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~~~~~~~~~~~~~~-v~v~f~d~i~~ 449 (527) + ++....+...|||||....+...+...-.-..+ ...|.-|++.++.+..-.++... +++... ..|++-..+-. T Consensus 346 ~---~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl~p~-p~~~~~~~~v~~is~Lar 421 (510) T protein:vir:63 346 Y---GANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGL-ITKQHKPAIETGLPALSR 421 (510) T ss_pred h---hcccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCC-CchhcccceecchhHHHH Confidence 1 111123344699999998888887766533333 33445566655544432223221 122222 22333222221 Q ss_pred CHHHH-HHHHHHHHh-cC-C------CCHHHHH---HhcCCC-------CHHHHHHHHHHHHHhcccc---cccccCCCC Q lcl|NC_019418. 450 DRHAE-LDYWMKMVA-AG-F------ATQKRGI---AKTLGI-------TEEEAEKELAEINGELPPE---SDAELALYG 507 (527) Q Consensus 450 d~~~~-~~~~~~~~~-aG-i------~s~~~~i---~~~~~~-------~deea~~el~ri~~E~~~~---~~~~~~~~~ 507 (527) ..+.+ +....+.++ .| + +....++ ....|+ |++|++++.++.+.+.... ......--+ T Consensus 422 aq~~~~l~~~~q~l~~~~~~aq~~~~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~~~~qq~~~~~~~~~~~~~~a~ 501 (510) T protein:vir:63 422 SAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEQQRQQAAQAQAAQETLLEGAS 501 (510) T ss_pred HHHHHHHHHHHHHHHHhcCchhhhccCCHHHHHHHHHHHhCCChhHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111 111111111 11 1 1123333 233465 4566666544322221111 001111011 Q ss_pred CCCCCCCCC Q lcl|NC_019418. 508 KGQQNTVGN 516 (527) Q Consensus 508 ~~~~~~~~~ 516 (527) .-...+.+= T Consensus 502 ~~~~~~~g~ 510 (510) T protein:vir:63 502 DMTNALAGV 510 (510) T ss_pred hhcccccCC Confidence 111222211 No 146 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=98.56 E-value=2.9e-07 Score=56.43 Aligned_cols=404 Identities=12% Similarity=0.095 Sum_probs=162.2 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCcccc-CceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKR-RKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~-~~~~~lnl~~~i 79 (527) |+||+++|.+|.+-- .. .+.....+ ++. .- .++.++... ...|..-. ..-+...---.. T Consensus 7 ~g~~~~~~~~~~~~~----~~---~~~~~~~~--~~~--~~--~~~~~~~~~-------~~~g~~v~~~~a~~~~aV~~~ 66 (432) T protein:vir:97 7 LGLLGQLKAMFVPPD----PV---DIGGGQTF--TPV--NA--TARDLGIII-------SDTGAAVNADAIMRLDAVAAC 66 (432) T ss_pred CchhhhhHhhcCCcc----cc---cccccccc--ccC--ch--hhhhhcccc-------cccCcccchHhhhcchHHHHH Confidence 999999999986411 00 00000111 110 00 011111110 00111000 001111111233 Q ss_pred HHHHhhhhhcccceEe---------eCCHHHHHHHHH-HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCee-EEEEEc Q lcl|NC_019418. 80 AKKIASLVYNEQAEIS---------AEDETLNDFLSD-MLSNDRFNKNFERYLESALALGGLAMRPYVDGDKI-RVAFIQ 148 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~---------~~d~~~~~~l~~-~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~-~i~~v~ 148 (527) ++.+|+-+-+-|..+- +.+..+...|.. =-..-.-..-++..+...+..|.+++.+..+++++ .+.+++ T Consensus 67 v~~Ia~~ia~lp~~~y~~~~~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~~g~~~~L~~l~ 146 (432) T protein:vir:97 67 VKLVSQAVAAMPLMMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTDGRIESLQYLA 146 (432) T ss_pred HHHHHHhhccCceEEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEEEEEEc Confidence 3444444333343321 112222333321 00011112233445667778899998888876664 456677 Q ss_pred CCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeec Q lcl|NC_019418. 149 APVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNL 228 (527) Q Consensus 149 a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l 228 (527) |+.+-|+. +.++. .+|.. .. .++.. ++ | + T Consensus 147 p~~v~v~~-~~~g~-----------------~~y~~-~~---------------~~g~~-~~---~------------~- 175 (432) T protein:vir:97 147 NDRLTITT-DTKGN-----------------TAYRY-RR---------------TDGQM-ID---I------------P- 175 (432) T ss_pred CcceEEEE-cCCCc-----------------EEEEE-Ee---------------cCceE-EE---E------------c- Confidence 77766542 22221 12210 00 00000 00 0 0 Q ss_pred ccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHH-HHHHcCcceeeechhHhc Q lcl|NC_019418. 229 SELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFM-WEIKMGQRRVIVPEQMTQ 307 (527) Q Consensus 229 ~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~-~e~~~~~~~i~v~~~~l~ 307 (527) +. -+.|++.+..+ ...|+|.+.-+...+.-.... .++. +-|+.|.. |..++. T Consensus 176 -------~~----------~iih~r~~~~d-----g~~G~spi~~~~~~i~~~~a~-~~~~~~~f~ng~~----~~gil~ 228 (432) T protein:vir:97 176 -------RQ----------QIWKIMGYSLD-----GENGLSAIRYGAQIFGTAIAA-EAQAARAFRNGQL----QSVYYQ 228 (432) T ss_pred -------cc----------cEEEecCcCCC-----CcccccHHHHHHHHHHHHHHH-HHHHHHHHhccCC----cceeEe Confidence 00 12344432111 235888888776666433322 2333 23454432 222222 Q ss_pred CCCCCCCcccccccccccccceeeec-cCC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccccc Q lcl|NC_019418. 308 LKVQDNQGNIAFKRRFDVEQNVYMQV-GAG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQ 382 (527) Q Consensus 308 ~~~~~~~~~~~~~~~~d~~~~~~~~~-~~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~ 382 (527) .+....... ...| +.-|.+. +.+ -+++..++.++....+.++.+..+....+|+...|++|..+|.... T Consensus 229 ~~~~l~~e~---~~~~---~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~ 302 (432) T protein:vir:97 229 IDRFLTDDQ---YDSF---SKKVSGSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSA 302 (432) T ss_pred cCCCCCHHH---HHHH---HHHHhhhhcCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCC Confidence 221111000 0001 1111111 100 0122245666666677888888888889999999999999987654 Q ss_pred cc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHH Q lcl|NC_019418. 383 GV-KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKM 461 (527) Q Consensus 383 g~-~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~ 461 (527) +. .++..+..... ..++.+|..++..|-...+. .++.........+.++++.-+..|..+.++...++ T Consensus 303 ~t~~~~s~~e~~~~----------~f~~~tl~P~~~~ie~~ln~-kLl~~~e~~~~~~~fd~~~llr~d~~~r~~~~~~~ 371 (432) T protein:vir:97 303 GTTSWGSGIESQQL----------GFLTMTLSPWLRRIEQSIAL-NLLTPAERRRYFADFDTSALLRADSAARSSYYSQL 371 (432) T ss_pred cccccchhHHHHHH----------HHHHHHHHHHHHHHHHHHhh-hccCccccCceEEEeechhhhccCHHHHHHHHHHH Confidence 32 11222211111 11223333333333221111 12211111223345555555667888999999999 Q ss_pred HhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCC--CCCCCCCCCCccccC Q lcl|NC_019418. 462 VAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQN--TVGNSKDTVDDEDEA 527 (527) Q Consensus 462 ~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~ 527 (527) +.+|+|++-+++... |+..-+-...+-.++... ..+...+.+. ++.....+...+..| T Consensus 372 ~~~G~~T~NE~R~~~-glpp~~g~~~~~~~~~~~-------~pl~~~~~~~~~~~~~~~~~~~~~~~~ 431 (432) T protein:vir:97 372 VNNGLMTRDEAREIE-GLPKLGGNAAVLTVQSAM-------VPLDSIGLQASPEPASGLGNQQQDKVS 431 (432) T ss_pred HhCCCCCHHHHHHHh-CCCCCCCCcceEeecccc-------cchhhhcccCCCCCCCCCCCccccccc Confidence 999999999977553 443210000000000000 0000000000 000011111111111 No 147 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=98.55 E-value=3.1e-07 Score=56.30 Aligned_cols=473 Identities=11% Similarity=0.048 Sum_probs=196.7 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccc-c---ccC--ccccCceeecc Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYT-N---TDG--DRKRRKMQHLP 74 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~-~---~~~--~~~~~~~~~ln 74 (527) |.= =..-+.+ .+++... .+..++......|+.+|.---|-+... . ..+ ..+...++-=+ T Consensus 1 m~~---d~~~~~~--------~l~~r~~----~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~~~ds 65 (549) T protein:vir:10 1 MTN---DDAKILQ--------ALNADHG----RMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERSQKMFDS 65 (549) T ss_pred CCc---chHHHHH--------HHHHHHH----HHHHHhhhHHHHHHHHHHHhccccccccccCCCCCCcccccccccccc Confidence 211 0011110 1111110 123344556666777765433322110 0 011 01111122225 Q ss_pred hHHHHHHHHhhhhhcc--cc-----eEeeCCHH------HHHHHH-------HHH--hhhhHHHHHHHHHHHHHhcCCEE Q lcl|NC_019418. 75 IARTAAKKIASLVYNE--QA-----EISAEDET------LNDFLS-------DML--SNDRFNKNFERYLESALALGGLA 132 (527) Q Consensus 75 l~~~i~~~~A~ll~~e--~~-----~i~~~d~~------~~~~l~-------~~l--~~n~f~~~~~~~~~~a~~~G~~~ 132 (527) .+...++.+|+-|.+- || ++.++++. ..++|+ .++ ..++|...+.++..+....|.+. T Consensus 66 tg~~a~~~LAs~l~~~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~ 145 (549) T protein:vir:10 66 TAPLALRNFVAAMDSMITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGA 145 (549) T ss_pred hHHHHHHHHHHHHHhhccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhccee Confidence 6777788877765543 11 23444332 233444 322 35789999999999999999999 Q ss_pred EEEEEeC-CeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEe-eccc----ccccceeeecCCc Q lcl|NC_019418. 133 MRPYVDG-DKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHE-WVTP----TGQEVGSTKDKSL 206 (527) Q Consensus 133 ~~~~~d~-~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~-~~~~----~~~~~~~~~~~~~ 206 (527) +.+-.|. +.+++..++-.+++- ..|..|++..+|-...+. -+ +.++ |+.. ..........+.. T Consensus 146 l~~~~~~~~~~~f~~~pl~~~~v-~~d~~G~vd~i~r~~~~t--~~--------ql~~~fg~~~l~~~v~~~~~~~~~~~ 214 (549) T protein:vir:10 146 LMIEHDVGKGIVYRNVPMQRLWF-AENNSGLIDKTHVQWELT--LR--------QAAQRFGRENLSPSMQSTLEKDPEKS 214 (549) T ss_pred eEEeecCCCeeEEEEEEcCeEEE-eeCCCCCeEEEEEEeecC--HH--------HHHHhcCcccCCHHHHHHhhcCCCce Confidence 8877665 456788899988884 566777776655211100 00 0000 0000 0000000011223 Q ss_pred eEEEEEEEecCCccc---cCceeecccccCCccccee--ecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHH Q lcl|NC_019418. 207 YRITNELYKSTSDSQ---LGERVNLSELYPDLQPVTP--IQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFI 281 (527) Q Consensus 207 ~~I~n~ly~~~~~~~---lG~~v~l~~~~~~l~~~~~--~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~l 281 (527) +.|-|.+|...+.+. -+.-.|...+|-....... ..|+..-+|..++ =+...++.||+|-...+.+-++.| T Consensus 215 ~~v~~~V~pr~~~~~~~~~~~~~pf~sv~~e~~~~~il~esg~~e~P~~~~R----w~~~~ge~YGrgp~~~~l~D~k~L 290 (549) T protein:vir:10 215 AIFYHAVEPRADRDPRKLDGRNMQFASYWLDEGRDRIVQNSGFRTFPFAIGR----FYVGTDDVYGGSPAYDAMPDVRMA 290 (549) T ss_pred EEEEEEeecCCCCCccccccccCceEEEEEEecCCEeeccCCcccCCcceee----eeecCCCccccchHHHHHHHHHHH Confidence 333344443222111 1111232222211111111 1233222222222 122346789999999999999999 Q ss_pred HHHHHHHHHHHH-cCcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHH Q lcl|NC_019418. 282 NRTYDEFMWEIK-MGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAIS 360 (527) Q Consensus 282 d~~~s~~~~e~~-~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~ 360 (527) +..--....-.+ ..++.+.||++......+...|. ..|.. .+.++...+..+....+..--...++ T Consensus 291 ~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~l~pgg-----------~~~~~--~~~~~~~~~~pl~~~~~~~~~~~~i~ 357 (549) T protein:vir:10 291 NDMAKTNIRGAQKLVDPPLLANEDGVLDGFDLRSGA-----------LNWGG--LNDKGEEMVKPLLTGKQAQIGIEFAQ 357 (549) T ss_pred HHHHHHHHHHHHHHhcCceeeccccccccceeccCC-----------ccccc--cCCCCccceeeeccccchhHHHHHHH Confidence 987766665554 46677777665432221211111 01111 11112222332221112222223344 Q ss_pred HHHHHHHHhcCCCcccccc-cccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhcccCCccc---- Q lcl|NC_019418. 361 EGLKLFEMQIGVSSGMFTF-DGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKVVGIYRGTIP---- 434 (527) Q Consensus 361 ~~l~~i~~~~g~s~~~~~~-~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~~~~~~~~~~---- 434 (527) .+-..|....=.. .|+. ..+...|||||....+...+..+-.-..+ ...|.-|+..++.+..-.+.....+. T Consensus 358 ~~~~rI~~af~~d--~~~~~~~~~~~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r~g~lP~~p~~l~~ 435 (549) T protein:vir:10 358 DTRQTINQWFYVT--LFQILVDSGDMTATEVLQRAQEKGVLLAPTLGRTQSELLGPMIAREVDILAEAGQLPDMPQELID 435 (549) T ss_pred HHHHHHHHHHhhh--hhhhhcCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhhc Confidence 4433333322111 1111 13345799999999998888876655444 35566676666666543333222111 Q ss_pred CccceEEEeCCCccCCHH-HHH---HHHHHHHh--cCC-------CCHHHHHH---hcCCC------CHHHHHHHHHHH- Q lcl|NC_019418. 435 ELDDISVNLDDGVFTDRH-AEL---DYWMKMVA--AGF-------ATQKRGIA---KTLGI------TEEEAEKELAEI- 491 (527) Q Consensus 435 ~~~~v~v~f~d~i~~d~~-~~~---~~~~~~~~--aGi-------~s~~~~i~---~~~~~------~deea~~el~ri- 491 (527) ....+.|++--.+..... ..+ .+..+.+. +++ +....++. ...|+ |++|++++.++- T Consensus 436 ~~~~~~i~yis~La~aq~~~~~~~i~~~~~~~~~laq~~Pe~ld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~~~~ 515 (549) T protein:vir:10 436 AGADVDVEYDSPLNKAMRAGEGAAILQWLQQLGIVSQFDPAAAKVPNGARIARLLADYGGVPVEAMSTDEELQAQQAAEA 515 (549) T ss_pred CCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhcCCHHHHHHHHHHhcCCCccccCCHHHHHHHHHHHH Confidence 122455665332222110 111 11111111 121 22233332 23453 455554433211 Q ss_pred ---H-HhcccccccccC---CCCCCCCCCCCCCC Q lcl|NC_019418. 492 ---N-GELPPESDAELA---LYGKGQQNTVGNSK 518 (527) Q Consensus 492 ---~-~E~~~~~~~~~~---~~~~~~~~~~~~~~ 518 (527) + .+.........+ ...+.+........ T Consensus 516 ~qqq~~~~~~~a~~a~~~a~~~~~~~ta~~~~~~ 549 (549) T protein:vir:10 516 QAAQMQQMLAAAPVAAGAIKDLSDAQTAAQTARV 549 (549) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhcCCCcccCC Confidence 1 111111111111 11111111111111 No 148 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=98.54 E-value=3.3e-07 Score=56.08 Aligned_cols=483 Identities=8% Similarity=0.020 Sum_probs=215.0 Q ss_pred CC----hHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchH Q lcl|NC_019418. 1 MS----LIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIA 76 (527) Q Consensus 1 m~----~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~ 76 (527) |. .+.+++.+|+.- +.-.++-.....+..+||.|. .|......... ...+..+|+= T Consensus 1 m~d~~~~~~~~~~~~~~~-----------------~~~~~~~R~~a~~d~~fy~G~--QW~~~~~~~l~-~q~rp~~N~i 60 (725) T protein:vir:10 1 MADNENRLESILSRFDAD-----------------WTASDEARREAKNDLFFSRVS--QWDDWLSQYTT-LQYRGQFDVV 60 (725) T ss_pred CCchHHHHHHHHHHHHHH-----------------HHhhHHHHHHHHHHHHhhcCC--CCCHHHHHHHH-hcCCCcccch Confidence 43 344555555431 112344555667788899984 33211111111 1112246888 Q ss_pred HHHHHHHhhhhhcccceEee-----CCHHHHHHHHH----HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe---C----C Q lcl|NC_019418. 77 RTAAKKIASLVYNEQAEISA-----EDETLNDFLSD----MLSNDRFNKNFERYLESALALGGLAMRPYVD---G----D 140 (527) Q Consensus 77 ~~i~~~~A~ll~~e~~~i~~-----~d~~~~~~l~~----~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d---~----~ 140 (527) +.+|+...++--...+.+.+ ++...++.|.. +.+.+++......+...+++.|-+|+.+.+| . + T Consensus 61 ~~~v~~v~g~e~~nr~d~~v~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~ 140 (725) T protein:vir:10 61 RPVVRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSN 140 (725) T ss_pred HHHHHHHHhhHHhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccCCCCCCC Confidence 88888888776666666665 24455555544 4456777788888999999999999999754 1 2 Q ss_pred eeEEEEE----cCCceEEEEEcCC----ceEEE--EEEEEEEeeC-------CCcceEEEEEEE---Eeeccccccc-ce Q lcl|NC_019418. 141 KIRVAFI----QAPVFLPLQSNTQ----DVSSA--AILTKTIKTE-------NRKNVYYTLVEF---HEWVTPTGQE-VG 199 (527) Q Consensus 141 ~~~i~~v----~a~~~~P~~~d~~----~~~~~--a~~~~~~~~~-------~~~~~~yt~lE~---h~~~~~~~~~-~~ 199 (527) .+.|..+ ++.++| ||.. ....| +|..+++... ..+....+..++ ..+...|... .. T Consensus 141 ~~~i~~~~i~~~~~~v~---~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~v 217 (725) T protein:vir:10 141 NQVIRREPIHSACSHVI---WDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQDTI 217 (725) T ss_pred ceeeeeeecccCHhHcc---cCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCCeE Confidence 2344433 344454 2211 01112 1222222110 000000000000 0000000000 00 Q ss_pred eeecCCceEEE---EEEEecCCccccCceeecc-----ccc-----CC------------------cccceeecCCC--- Q lcl|NC_019418. 200 STKDKSLYRIT---NELYKSTSDSQLGERVNLS-----ELY-----PD------------------LQPVTPIQGLS--- 245 (527) Q Consensus 200 ~~~~~~~~~I~---n~ly~~~~~~~lG~~v~l~-----~~~-----~~------------------l~~~~~~~g~~--- 245 (527) .. ...|+.. -.+|...+ ...|..+... .+. ++ +.+...+.+-. T Consensus 218 rv--~E~~~r~~~~~~~~~~~d-~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~ 294 (725) T protein:vir:10 218 QI--AEFYEVVEKKETAFIYQD-PVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIA 294 (725) T ss_pred EE--EEEEEEEEEeeEEEEecc-CCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCCCC Confidence 00 0001000 11121111 1122222111 000 00 00111111100 Q ss_pred cccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHH-cCcceeeechhHhcCCCCCCCcccccccccc Q lcl|NC_019418. 246 RPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIK-MGQRRVIVPEQMTQLKVQDNQGNIAFKRRFD 324 (527) Q Consensus 246 ~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~-~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d 324 (527) ...|-|+|.-+.=....++|++-+.+.++++.++.+|...|...+-+- .++.+..++...+..... .-.. T Consensus 295 ~~~fP~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~-------~~~~-- 365 (725) T protein:vir:10 295 GEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEH-------MYDG-- 365 (725) T ss_pred CCceeEEEEEeeeeccCCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHH-------HHhc-- Confidence 001222222111111245677778899999999999999999999884 466666666665521100 0000 Q ss_pred cccceeeecc---CCCC--CCCcceEec-cccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHH Q lcl|NC_019418. 325 VEQNVYMQVG---AGNM--DSGGIVDLT-TPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTY 398 (527) Q Consensus 325 ~~~~~~~~~~---~~~~--~~~~i~~~~-~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~ 398 (527) ++...|...+ ..++ ....++... +.++ .++...++.....|...+|++...+|..++ ..++-.|.+...... T Consensus 366 ~~~~~~~~~~~~~~~~g~~~~~~i~~~~~~~~p-~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n-~~SG~ai~~rq~qg~ 443 (725) T protein:vir:10 366 NDDYPYYLLNRTDENNGEMPTQPLAYYENPEVP-QANAYMLEAATAAVKEVATLGVDAEAVNGG-QVAYDTVNQLNMRAD 443 (725) T ss_pred cCCceeeecccccccCcccccccCcccCCCCch-HHHHHHHHHHHHHHHHHhCCCHHHhCcCch-hhHHHHHHHHHHHHH Confidence 1111122111 1111 112344333 3343 457788888888899999999988887654 345666777776666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhh-hc------ccCC-----------c--------------ccCccceEEEeCCC Q lcl|NC_019418. 399 QMRNSIVALVEQSIKELCVSMCELGKV-VG------IYRG-----------T--------------IPELDDISVNLDDG 446 (527) Q Consensus 399 ~~~~~~~~~~~~al~~li~~il~~~~~-~~------~~~~-----------~--------------~~~~~~v~v~f~d~ 446 (527) .....+...+..+.+.+.+.+|.+... +. +.+. . ....++|+|+=..+ T Consensus 444 ~~l~~~~Dnl~~~~~~~g~~lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~ 523 (725) T protein:vir:10 444 LETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPS 523 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeeccC Confidence 666777777777777777776665322 10 1000 0 00123444443333 Q ss_pred ccCCHHHHHHHHHHHHhc-C-CCCH-HHHHHhcCCCCH-HHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019418. 447 VFTDRHAELDYWMKMVAA-G-FATQ-KRGIAKTLGITE-EEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVD 522 (527) Q Consensus 447 i~~d~~~~~~~~~~~~~a-G-i~s~-~~~i~~~~~~~d-eea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (527) .+.=+++.++.++++..+ + ..+. -..+..+....+ +-+++.+++|++...+.....+..+-+.+.... . T Consensus 524 ~~s~r~~~~~~l~qll~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e-------~ 596 (725) T protein:vir:10 524 FQSMKQQNRSEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVE-------A 596 (725) T ss_pred cHHHHHHHHHHHHHHHHhccccchhHHHHHHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccchhHHHHH-------H Confidence 222234555555566533 1 1111 122333333322 223445566665443322111100000000000 0 Q ss_pred ccccC Q lcl|NC_019418. 523 DEDEA 527 (527) Q Consensus 523 ~~~~~ 527 (527) ...+. T Consensus 597 qq~~~ 601 (725) T protein:vir:10 597 QQAKQ 601 (725) T ss_pred HHHHH Confidence 00000 No 149 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=98.53 E-value=3.5e-07 Score=55.99 Aligned_cols=404 Identities=12% Similarity=0.068 Sum_probs=168.4 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) ||||+++ |.+-. .+..........+.++...-.. | ..+... ... ...+. .-+....-...+ T Consensus 1 MG~f~~l---f~~~~----~~~~~~~~~~~~~~~~~~~~~~---~-~~~g~~-~~~---~v~~~----~al~~~~v~~ci 61 (422) T protein:vir:13 1 MGFLRGL---FNKKN----NNDEKRSNYDEDIGIDISDSNF---W-EKFGIK-LNF---SVRGK----RALKENTVYVCT 61 (422) T ss_pred Cchhhhh---hhccC----CccchhhhhhhccccccCcchh---h-hhcccc-CCc---ccchh----hhhccHHHHHHH Confidence 9998866 33200 1111111111111111100000 1 111111 100 01111 112223233445 Q ss_pred HHHhhhhhcccceEe-----eCCHHHHHHHHH-HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEEEcCCce Q lcl|NC_019418. 81 KKIASLVYNEQAEIS-----AEDETLNDFLSD-MLSNDRFNKNFERYLESALALGGLAMRPYVDG-DK-IRVAFIQAPVF 152 (527) Q Consensus 81 ~~~A~ll~~e~~~i~-----~~d~~~~~~l~~-~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~v~a~~~ 152 (527) +.+|+-+-+-|..+- +.+..+...|.. --..-....-++.++.+.+..|.+++.+..+. |+ +.+..++|+++ T Consensus 62 ~~ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v 141 (422) T protein:vir:13 62 KIRAESIGKLSLKIYKDKEEYKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGKIIGLYPINSDNV 141 (422) T ss_pred HHHHHhhhhCceEEEecCcccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcce Confidence 555655555454431 122233333321 00011122344556677788899999887764 33 45777888887 Q ss_pred EEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeeccccc Q lcl|NC_019418. 153 LPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELY 232 (527) Q Consensus 153 ~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~ 232 (527) -++.. .++.... .+..+|. +. . . -|....+ T Consensus 142 ~~~~~-~~~~~~~-----------~~~~~y~---~~---------------------------~--~--~g~~~~~---- 171 (422) T protein:vir:13 142 TKIID-DDNFLSS-----------LSKVWYV---VT---------------------------D--K--NGKEHKL---- 171 (422) T ss_pred EEEEc-CCcceec-----------cceEEEE---EE---------------------------e--C--CCeEEEE---- Confidence 77542 2222110 0111221 00 0 0 0110000 Q ss_pred CCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcC-cceeeechhHhcCCCC Q lcl|NC_019418. 233 PDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMG-QRRVIVPEQMTQLKVQ 311 (527) Q Consensus 233 ~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~-~~~i~v~~~~l~~~~~ 311 (527) .+. -..|++.+. ..+..+|+|.+.-+...|.-...+-....+-|+.| +++-++ ..... T Consensus 172 ---~~~---------eiih~~~~~----~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil-----~~~~~ 230 (422) T protein:vir:13 172 ---LPD---------EMLHFIGDI----TLDGLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIV-----QYVGD 230 (422) T ss_pred ---ccc---------ceEEEcCCC----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEE-----EeCCC Confidence 000 123444221 11235699999988888765443333333335553 333222 22111 Q ss_pred CCCc-ccccccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccccccc- Q lcl|NC_019418. 312 DNQG-NIAFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGV- 384 (527) Q Consensus 312 ~~~~-~~~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~- 384 (527) .... .-.....| +..|.+.+ .+ -.++..++.++....+.++.+..+....+|+...|++|..++...++. T Consensus 231 l~~e~~~~~~~~~---~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~ 307 (422) T protein:vir:13 231 LDEKAKKIFKKEF---ESMSNGLENAHSISLLPFGYQFQPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERATF 307 (422) T ss_pred CCHHHHHHHHHHH---HHHhcCccccCCceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCc Confidence 1000 00000111 11122211 00 012224555666666778888888888899999999999998765432 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCC-cccCccceEEEeCCCccCCHHHHHHHHHHHHh Q lcl|NC_019418. 385 KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRG-TIPELDDISVNLDDGVFTDRHAELDYWMKMVA 463 (527) Q Consensus 385 ~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~-~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~ 463 (527) .++.+. ....++.+|..++..|-...+. .++.. .......+.+++++-+..|..+.++...+++. T Consensus 308 sn~e~~-------------~~~f~~~~l~P~~~~ie~~l~~-~Ll~~~~~~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~ 373 (422) T protein:vir:13 308 NNLTEQ-------------QKDFYVTTLQSSLTVYEQEIQD-KLFSQYETLQDVKAEFNVDTILRSDIKTRYEAYRIGIQ 373 (422) T ss_pred ccHHHH-------------HHHHHHHHHHHHHHHHHHHHHH-hhCChhhhcCCceEEeechhhhcCCHHHHHHHHHHHHh Confidence 222222 1112233444444433322111 11111 11122335555556666788899999999999 Q ss_pred cCCCCHHHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCc Q lcl|NC_019418. 464 AGFATQKRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDD 523 (527) Q Consensus 464 aGi~s~~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 523 (527) +|+|++-+++..+ |+..-+ ..+-+.. .... +-+.........++++|. T Consensus 374 ~G~~T~NE~R~~~-gl~p~~ggD~~~~~--~n~~---------~l~~~~~~~~~~g~~~g~ 422 (422) T protein:vir:13 374 GGFIEANEARRRE-NLPPVEGGDRLLVN--GNMI---------PIEMAGEQYKKGGEKGGK 422 (422) T ss_pred CCCcCHHHHHHHh-CCCCCCCcCeeeec--cCcc---------chhhcccccccCCCcCCC Confidence 9999999976543 654311 1110000 0000 000000000111222222 No 150 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=98.53 E-value=3.5e-07 Score=55.95 Aligned_cols=376 Identities=11% Similarity=0.096 Sum_probs=157.9 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHH--H Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIAR--T 78 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~--~ 78 (527) |+||++++.--. +...+... ....+...+.+ ++ ..|..... ...+..|. . T Consensus 1 M~~f~~~~~~~~------------------~~~~~~~~--~~~~~~~~~~~--~~-----~~~~~v~~-~~al~~~~v~~ 52 (386) T protein:vir:49 1 MPIFNITNLATE------------------SPPINQES--FFDIADSDFLA--SL-----NSSEWVSA-ENALKNSDLFS 52 (386) T ss_pred CchhhhhccCCC------------------Ccccchhh--hhhhhhccccc--cc-----cCCceech-hhhhccHHHHH Confidence 999877643210 01111100 11111111111 00 01110000 01122222 4 Q ss_pred HHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCC-e-eEEEEEcCCceEEEE Q lcl|NC_019418. 79 AAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGD-K-IRVAFIQAPVFLPLQ 156 (527) Q Consensus 79 i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~-~-~~i~~v~a~~~~P~~ 156 (527) +++.+|+-+.+-| +.+........+.+--.......-++.++...+..|.+++.+..+.. + +.+.+++|+++-+.. T Consensus 53 ~i~~ia~~ia~~p--~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i~~~~v~v~~ 130 (386) T protein:vir:49 53 IISQLSNDLATAK--ITTSRKQLQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNR 130 (386) T ss_pred HHHHHHHHhhhCc--eeeccchhhhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEecCceeEEEE Confidence 5566666665544 34444443333322111112233345566677778999988877643 3 456777887765543 Q ss_pred EcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcc Q lcl|NC_019418. 157 SNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQ 236 (527) Q Consensus 157 ~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~ 236 (527) .+.++ ..+|.. .. .+...|..+. T Consensus 131 ~~~~~-----------------~~~y~~-~~------------------------------~~~~~~~~~~--------- 153 (386) T protein:vir:49 131 LDNQN-----------------GLYYNI-TF------------------------------DDPHIAPKQH--------- 153 (386) T ss_pred cCCCc-----------------eEEEEE-EE------------------------------cCccccceeE--------- Confidence 22221 112210 00 0000010000 Q ss_pred cceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHc-CcceeeechhHhcCCCCCCCc Q lcl|NC_019418. 237 PVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKM-GQRRVIVPEQMTQLKVQDNQG 315 (527) Q Consensus 237 ~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~-~~~~i~v~~~~l~~~~~~~~~ 315 (527) +.. --+.||+.+.++ +..+|+|.+..+...++.......-..+-|.. +.++.++ ......... T Consensus 154 ----~~~---~evih~~~~~~~----~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il-----~~~~~~~~~ 217 (386) T protein:vir:49 154 ----VPQ---NDILHFRLLSVD----GGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGIL-----KIKGGGLLD 217 (386) T ss_pred ----Ecc---ccEEEecCCCCC----CccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEE-----EeCCCCChH Confidence 000 013456543222 23469999988888886555443333344554 3333332 221111110 Q ss_pred cc-cccccccc-ccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHH Q lcl|NC_019418. 316 NI-AFKRRFDV-EQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSE 393 (527) Q Consensus 316 ~~-~~~~~~d~-~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~ 393 (527) .. .....|.. .......+-. .++..++.++....+.++.+..+....+|+...|++|..+|...++-.++..+... T Consensus 218 ~~~~~~~~~~~~~~n~g~~~vl--~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~ 295 (386) T protein:vir:49 218 FKTKVSRSRQAMKQMQGGPLVL--DDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDGDQQSSLEMIYNI 295 (386) T ss_pred HHHHHHHHHHHhccCCCCceec--CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccchHHHHHHH Confidence 00 00000000 0000000001 12224666666667778888888889999999999999998655444444333222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHH Q lcl|NC_019418. 394 NSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGI 473 (527) Q Consensus 394 ~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i 473 (527) +. ..+.-....|...|.+. +. ..+.++....+-.|..+.+....+++.+|++++-+++ T Consensus 296 ~~---~~i~~~l~~i~~~~~~~------------l~-------~~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r 353 (386) T protein:vir:49 296 YF---KSVSRYLRPFVSEMSKK------------LS-------CEVDVDISPAVDPTGSNYISLINSMVKSGTLAQNQGL 353 (386) T ss_pred HH---HHHHHHHHHHHHHHHHH------------hc-------chhcccchhhhccCHHHHHHHHHHHHhCCCcCHHHHH Confidence 11 11111111122211110 10 1123334444556667777888888899998887766 Q ss_pred Hhc--CCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 474 AKT--LGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 474 ~~~--~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) ..+ .|+...+. .+ ++....++ ..+||+++- T Consensus 354 ~~l~~~~~~~~~~----~~----------------~~~~~~~~----~~gGd~~~~ 385 (386) T protein:vir:49 354 YILQQAEILPKEL----PD----------------GKNPNRTS----LKGGEINEQ 385 (386) T ss_pred HHHhhCCCCCCcC----cc----------------hhccCCCC----CCCCCCCCC Confidence 432 12221111 00 00000011 111111111 No 151 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=98.52 E-value=3.7e-07 Score=55.85 Aligned_cols=446 Identities=12% Similarity=0.095 Sum_probs=168.5 Q ss_pred CChH---------HHHHHHHHHHHHHhhcccchhhhccCccccCHH-HHHHHHHHHHHhcCCCcc-cccccccCc-cccC Q lcl|NC_019418. 1 MSLI---------QKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQS-EFRRIQHNLAYYQSKFDD-IEYTNTDGD-RKRR 68 (527) Q Consensus 1 m~~~---------~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~~~~i~~~~~~y~g~~~~-l~~~~~~~~-~~~~ 68 (527) |+-| +.|..|++-..+.|=.++..+..-+.+. .+.+ ...+++.-..-|...... +.+....+. ...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (574) T protein:vir:80 1 MPKWLDKALGIEKSSIEETRNMENYKMHLREIDTNVVNNEP-YSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIR 79 (574) T ss_pred CcchhhhhhccchhhHHHHHhhhhhccccchhhhhhhhccC-CCHHHHHHhHhhhcccccchhhhhccccccccCcCccC Confidence 3322 2344444333333323332222211111 0111 111111111111111100 000000000 0000 Q ss_pred c----------eeecchHHHHHHHHhhhhh-----------cccceEeeCC---------HHHHHHHHHHHhh------- Q lcl|NC_019418. 69 K----------MQHLPIARTAAKKIASLVY-----------NEQAEISAED---------ETLNDFLSDMLSN------- 111 (527) Q Consensus 69 ~----------~~~lnl~~~i~~~~A~ll~-----------~e~~~i~~~d---------~~~~~~l~~~l~~------- 111 (527) + ..+-.+...+++..++-++ +=|..|...+ ......|..++.+ T Consensus 80 ~~~~~~~~l~~~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~nP 159 (574) T protein:vir:80 80 NSQDLHKTLKKFGNNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFRDP 159 (574) T ss_pred CcccHHHHHHhhccChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCCCCC Confidence 0 0111222233333333222 1122222111 1222345555532 Q ss_pred --hhHHHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEE Q lcl|NC_019418. 112 --DRFNKNFERYLESALALGGLAMRPYVDG-DK-IRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEF 187 (527) Q Consensus 112 --n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~ 187 (527) ..|...+..++...+..|.+++.+..+. |+ +.+..++|..+.+.. +.++.. ......||... T Consensus 160 ~~~s~~ef~~~lv~~lll~Gnayi~i~r~~~G~~~~L~pl~p~~V~v~~-d~~~~~-----------~~~~~~y~~~~-- 225 (574) T protein:vir:80 160 NRDNFTTFCKKLVRATYMYDQVNFEKVFDKDGNFIKFDTVDPTTIFLAT-NGEGKL-----------IKNGERFVQVI-- 225 (574) T ss_pred ccccHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEEcCceeEEEE-cCcccc-----------ccCceEEEEEe-- Confidence 1244555667777888899999888764 33 456778888877643 222110 01111122100 Q ss_pred EeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccC Q lcl|NC_019418. 188 HEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLG 267 (527) Q Consensus 188 h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG 267 (527) ++. +... | + +.+ +.|++.+ ++......++| T Consensus 226 ----------------~g~--~~~~-~------------~--------~~e----------iih~~~~-~~~~~~~~~~G 255 (574) T protein:vir:80 226 ----------------DNR--IVAK-F------------N--------ERE----------LAFAVRN-PRADIEVGQYG 255 (574) T ss_pred ----------------CCc--eEEE-E------------c--------ccc----------EEEEecc-CCCCccccccc Confidence 000 0000 0 0 000 2344321 11112234679 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCccc---ccccccccccceeeecc-C-----CCC Q lcl|NC_019418. 268 LSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGNI---AFKRRFDVEQNVYMQVG-A-----GNM 338 (527) Q Consensus 268 ~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~~---~~~~~~d~~~~~~~~~~-~-----~~~ 338 (527) +|.+.-+...|.....+-.-..+-|..|.. |..+|....+..-... .+...| +..|.+.. . -.+ T Consensus 256 ~spi~~a~~~i~~~~~a~~~~~~~f~ng~~----p~gil~~~~~~~ls~e~~~~lk~~~---~~~~~G~~n~g~~~vl~~ 328 (574) T protein:vir:80 256 YPELEIALKQFIAHENTEVFNDRFFSHGGT----TRGILHVKTGQQQSQQALDIFRREW---RSSLAGINGSWQIPVVSA 328 (574) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCC----CceEEEeCCCCCCCHHHHHHHHHHH---HHHhccccccccceeecC Confidence 999988887776555444333344565432 2222211111100000 000111 11122211 0 011 Q ss_pred CCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_019418. 339 DSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNS-IVALVEQSIKELCV 417 (527) Q Consensus 339 ~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~-~~~~~~~al~~li~ 417 (527) ++..++.++....+.++.+..+...+.|+...|++|..+|+...+..+++.+.+.+. .++.. ....++.+|.-++. T Consensus 329 ~G~~~~~l~~s~~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~---sn~E~~~~~f~~~tL~P~~~ 405 (574) T protein:vir:80 329 EDVKFVNMTPSANDMQFEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNE---GNSKEKMQASQNKGLQPLLR 405 (574) T ss_pred CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccccccccc---hhHHHHHHHHHHHHHHHHHH Confidence 223455666667788888999888999999999999999876543322221111110 11111 11122333443333 Q ss_pred HHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHH-HHHHHH-----HHH Q lcl|NC_019418. 418 SMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLGITEE-EAEKEL-----AEI 491 (527) Q Consensus 418 ~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~de-ea~~el-----~ri 491 (527) .|-...+. .|+. .....+.+.|+..-..+. .+.....+++.+|+|++-+++..+ |+..= ....-+ ..+ T Consensus 406 ~ie~~ln~-~Ll~---~~~~~~~~~f~~~d~~~~-~~~~~~~~~~~~G~lT~NE~R~~l-gl~Pi~gGD~~~~~~n~~~~ 479 (574) T protein:vir:80 406 FIEDTVNT-YIVA---EFGEKYQFQFRGGDLSAQ-LDKLKIIEQEGKVFRTVNEIRHDK-GLEPIKGGDVILNGVHIQAI 479 (574) T ss_pred HHHHHHHh-hhhh---hcCCceEEEecccchhhH-HHHHHHHHHHhCCccCHHHHHHHh-CCCCCCCCCEeeeccceeec Confidence 33322211 1111 112235667766544443 334445567788999999977654 33210 000000 000 Q ss_pred ----HH---h----cccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 492 ----NG---E----LPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 492 ----~~---E----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) +. + ++...........+...+++.++.++..|.++. T Consensus 480 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~ 526 (574) T protein:vir:80 480 GQALQEEQLEYQRSQDRLNRLLELSGGDVEQPEPEEPKDSQNDTDVS 526 (574) T ss_pred ccccccccCCccchhccccccccccCCCCCCCCCCCCCCccccccch Confidence 00 0 000010111111111111222222333333332 No 152 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=98.51 E-value=4.1e-07 Score=55.62 Aligned_cols=389 Identities=13% Similarity=0.068 Sum_probs=160.7 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCcccc-CceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKR-RKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~-~~~~~lnl~~~i 79 (527) ||+|+++ |++.. -.+...... .++ ..+..|+. .|..-. ..-+...--..+ T Consensus 1 Mgl~~~~---f~~~~---~~~~~~~~~-----~~~-------~~~~~~~~-----------~g~~v~~~~al~~~~v~~~ 51 (409) T protein:vir:84 1 MSLFTRI---FSGPS---EERTLTKIS-----GIP-------SPAEDWAM-----------HGDRPGANSAMTLGAFYAC 51 (409) T ss_pred Cchhhhh---hcCCC---ccccccccc-----ccc-------cccchhhc-----------cCcccchhhhhccHHHHHH Confidence 9998855 44311 000000000 000 00011111 111000 011111112234 Q ss_pred HHHHhhhhhcccceEe-------eCCHHHHHHHHH-HHhhhhHHHHHHHHHHHHHhcCCEEEEEEE-e-CCe-eEEEEEc Q lcl|NC_019418. 80 AKKIASLVYNEQAEIS-------AEDETLNDFLSD-MLSNDRFNKNFERYLESALALGGLAMRPYV-D-GDK-IRVAFIQ 148 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~-------~~d~~~~~~l~~-~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~-d-~~~-~~i~~v~ 148 (527) ++.+|+-+-+-|..+- +.+..+...|.. --..-.....+..++.+.+..|.+++.+.+ + +++ ..+..++ T Consensus 52 v~~ia~~iA~lp~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~ 131 (409) T protein:vir:84 52 VTLLADTVASLSIDAYRKKDNVRIPVSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIH 131 (409) T ss_pred HHHHHHhhhhCceEEEEecCCcccccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEc Confidence 4444444444333221 111222333321 001111223345566677788998877655 3 233 3466666 Q ss_pred CCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeec Q lcl|NC_019418. 149 APVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNL 228 (527) Q Consensus 149 a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l 228 (527) |+.+-+.... +....+|.. .|.. -|+.++- T Consensus 132 p~~v~v~~~~-----------------~~~~~~~~~----------------------------~~~~-----~g~~~~~ 161 (409) T protein:vir:84 132 PDCIHVTDAK-----------------DEDGDWIEP----------------------------VYRI-----DGKVVPN 161 (409) T ss_pred CceeEEEEcC-----------------CCcceEEEE----------------------------EecC-----CceEEch Confidence 6655432111 111111100 0100 0111110 Q ss_pred ccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHc-CcceeeechhHhc Q lcl|NC_019418. 229 SELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKM-GQRRVIVPEQMTQ 307 (527) Q Consensus 229 ~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~-~~~~i~v~~~~l~ 307 (527) . -+.|++.+.++ +..+|+|.+..+...++....+-....+-|.. ++++.++ . T Consensus 162 --------~----------dvih~~~~~~~----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil-----~ 214 (409) T protein:vir:84 162 --------H----------RIMHIKRYPVA----GCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGIL-----S 214 (409) T ss_pred --------h----------hEEEecCCCCC----cccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEE-----e Confidence 0 12455543222 23469998888877776665544444444565 3333333 2 Q ss_pred CCCCCCCcccccccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccccc Q lcl|NC_019418. 308 LKVQDNQGNIAFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQ 382 (527) Q Consensus 308 ~~~~~~~~~~~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~ 382 (527) ......... ... + .+....+.. .+ -.+...++.++......++.+..+....+|+...|++|.-+|...+ T Consensus 215 ~~~~l~~e~--~~~-~--~~~~~~~~~n~g~~~vl~~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~ 289 (409) T protein:vir:84 215 SDADLTPDQ--VKQ-T--QKQWIQSHHNRRLPAVMSAGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEK 289 (409) T ss_pred cCCCCCHHH--HHH-H--HHHHHHHhccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC Confidence 111111000 000 0 000011100 00 0112245666666667788888888899999999999999887654 Q ss_pred ccchHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHH Q lcl|NC_019418. 383 GVKTATEIVSENSD-TYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKM 461 (527) Q Consensus 383 g~~TAtei~s~~~~-~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~ 461 (527) +..++..+...... ...++.-+...++.+|... + .....+.++++.-+..|..+.++...++ T Consensus 290 ~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~------------L-----~~g~~i~fd~~~l~~~d~~~~~~~~~~~ 352 (409) T protein:vir:84 290 STSWGTGIEEQGINFVRHTLLPWLRCIEQALDTF------------L-----PRGQFVKFNVDGLMRGDVTARFTAYQMG 352 (409) T ss_pred cccccchHHHHHHHHHHHHHHHHHHHHHHHHHHh------------c-----cCCCeEEEechhhhccCHHHHHHHHHHH Confidence 43322222111111 1122233333333333321 1 1123466777777778889999999999 Q ss_pred HhcCCCCHHHHHHhcCCCCHH-HHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_019418. 462 VAAGFATQKRGIAKTLGITEE-EAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDE 524 (527) Q Consensus 462 ~~aGi~s~~~~i~~~~~~~de-ea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (527) +.+|+|++-+++.+. |+..- ...+-+.. ....+..+....-+. ++..+ .+.+.|.+ T Consensus 353 ~~~G~~t~NE~R~~~-g~~p~~ggD~~~~~--~n~~~~~~~~~~~~~--~~~~~--~~~~~gn~ 409 (409) T protein:vir:84 353 LQNGIWSVNEVRAWE-DAPPIPEGDIHLQP--MNFVPLGYVPPEEPA--QEPQP--NSATEGNK 409 (409) T ss_pred HhCCCcCHHHHHHHh-CCCCCCCcceeeec--ccccccccCCccccC--cCCCC--CCccCCCC Confidence 999999999977654 55421 11111100 000110110000001 00000 01111111 No 153 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=98.44 E-value=6.3e-07 Score=54.59 Aligned_cols=455 Identities=11% Similarity=0.091 Sum_probs=187.8 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) |.- ++.++=|.+-. .++..+ .+..++......|+.+|.---|-+-.....+......++-=..+...| T Consensus 1 ~~~-~~~~~~~~~~~-------~~~r~~----~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~ 68 (535) T protein:vir:94 1 MAS-SQKREGFAENG-------AKAVYD----ALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARGL 68 (535) T ss_pred CCc-hhhhhhHHHHH-------HHHHHH----HHHHHhhHHHHHHHHHHHHhccccCCCCCCccccccCCcccccHHHHH Confidence 433 22222221110 001000 122333445666777665433322111111111122223335667777 Q ss_pred HHHhhhhhcc--c--ceEe--eCCH-------------HHHHHHH-------HHHhhhhHHHHHHHHHHHHHhcCCEEEE Q lcl|NC_019418. 81 KKIASLVYNE--Q--AEIS--AEDE-------------TLNDFLS-------DMLSNDRFNKNFERYLESALALGGLAMR 134 (527) Q Consensus 81 ~~~A~ll~~e--~--~~i~--~~d~-------------~~~~~l~-------~~l~~n~f~~~~~~~~~~a~~~G~~~~~ 134 (527) +.+|+-|.+- | +=|. +.+. ..+++|. ..+..++|...+.++..+....|.+.+. T Consensus 69 ~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~ 148 (535) T protein:vir:94 69 NNLASKLMLALFPMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALLY 148 (535) T ss_pred HHHHHHHHhhhcCCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEe Confidence 7777754442 2 2122 2221 2344443 4477899999999999999999999886 Q ss_pred EEEeCC-eeEEEEEcCCceEEEEEcCCceEEEEEEEEEEee---------------CCC---cceEEEEEEEEeeccccc Q lcl|NC_019418. 135 PYVDGD-KIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKT---------------ENR---KNVYYTLVEFHEWVTPTG 195 (527) Q Consensus 135 ~~~d~~-~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~---------------~~~---~~~~yt~lE~h~~~~~~~ 195 (527) +-.+.+ .+++..++-.+++ +..|..|++..++....... ..+ ...+||.+.. T Consensus 149 ~~~~~~~~~~f~~~pl~~y~-v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~~~v~~-------- 219 (535) T protein:vir:94 149 IPEPEGTYNPMKLYRLSSYV-VQRDAFGTVLQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTHIYL-------- 219 (535) T ss_pred eccCcCcccceEEEEcCeEE-EeeCCCCCeEEEEeeeeccHHHhhHHHHHHHHhccccCCCceeEEEEEEEe-------- Confidence 655543 3567778877766 45677777766664322110 000 0112222211 Q ss_pred ccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhH Q lcl|NC_019418. 196 QEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAK 275 (527) Q Consensus 196 ~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~ 275 (527) ..++.+|.+.+++ + |..++... ...|+..-+|..++- +...++.||+|-...+. T Consensus 220 -----~~~~~~~~~~~e~----~----g~~~~~~~---------~~~g~~~~P~~~~Rw----~~~~ge~YGrgp~~~~l 273 (535) T protein:vir:94 220 -----DEESGEYLKYEEI----D----GVEVEGTD---------ASYPVDACPYIPVRM----VRIDGESYGRSYCEEYL 273 (535) T ss_pred -----eCCCCcEEEEEEe----c----Ceeecccc---------ccCccccCCceeeee----eecCCCccccchHHHHH Confidence 1122333322211 1 22222110 011222223333321 22346789999999999 Q ss_pred HHHHHHHHHHHHHHHH-HHcCcceeeechh-HhcCCC--CCCCcccccccccccccceeeeccCCCCCCCcceEeccccC Q lcl|NC_019418. 276 TTIDFINRTYDEFMWE-IKMGQRRVIVPEQ-MTQLKV--QDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIR 351 (527) Q Consensus 276 ~lid~ld~~~s~~~~e-~~~~~~~i~v~~~-~l~~~~--~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir 351 (527) +-+..|+..--....- ....+....|+++ .+.+.. ++.+| .+++ +..+..++..+....+ T Consensus 274 ~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g-------------~~v~---g~~~~v~~~~~~~~~~ 337 (535) T protein:vir:94 274 GDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQVRRLTKAQTG-------------DFVS---GRPEDISFLQLEKAAD 337 (535) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCcccccccccchhhcccCCCc-------------eeec---CCcccceeeecccccc Confidence 9999999765444432 2334444444332 221110 01111 1111 1111112222222223 Q ss_pred hHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhcccC Q lcl|NC_019418. 352 SSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKVVGIYR 430 (527) Q Consensus 352 ~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~~~~~~ 430 (527) ...-...++.+...|....=+. .+....+...|||||....+...+..+-.-..+ ...|.-|+..++.+..-.++.. T Consensus 338 ~~~~~~~i~~~~~rI~~af~~~--~~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r~g~lP 415 (535) T protein:vir:94 338 FSVARAVSEQIEGRLSYAFMLN--SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLLKQLQATNQIP 415 (535) T ss_pred hhHHHHHHHHHHHHHHHHHhHh--hhccCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhCCCCC Confidence 3333344444444443222111 122233445799999999888887776544444 3445666666665543323322 Q ss_pred CcccCccceEEEeCCCccC-CHHHHHHHHHHHHh--cCC--------CCHHHHHH---hcCCC-------CHHHHHHHHH Q lcl|NC_019418. 431 GTIPELDDISVNLDDGVFT-DRHAELDYWMKMVA--AGF--------ATQKRGIA---KTLGI-------TEEEAEKELA 489 (527) Q Consensus 431 ~~~~~~~~v~v~f~d~i~~-d~~~~~~~~~~~~~--aGi--------~s~~~~i~---~~~~~-------~deea~~el~ 489 (527) ..+.+. +.+++--++.. -+...++..+...+ +++ +....++. ...|+ |++|++++.+ T Consensus 416 ~~p~~~--v~~~~vs~la~l~r~~~~~~l~~~~~~laq~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~rs~eev~~~~~ 493 (535) T protein:vir:94 416 ELPKEA--VEPTISTGMEALGRGQDLDKLERCIAAWSALAPMQGDPDINIATIKLRIANAIGIDTSGILKTPEEKQQEMA 493 (535) T ss_pred CCChhh--ccceEeehHHHHHHHHHHHHHHHHHHHHHhhChHHhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHH Confidence 222222 33333222210 00111111111111 121 22222222 23343 4556665554 Q ss_pred HHHHhcccc---cc--ccc-CC---CCCCCCCCCCCCCCCCC Q lcl|NC_019418. 490 EINGELPPE---SD--AEL-AL---YGKGQQNTVGNSKDTVD 522 (527) Q Consensus 490 ri~~E~~~~---~~--~~~-~~---~~~~~~~~~~~~~~~~~ 522 (527) +.++.+..+ .. ... ++ .....+.-.+.-+-.++ T Consensus 494 q~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:94 494 EAAQGTAMQNAAASAGAGAGTMATASPENMKAAAAQAGMAPN 535 (535) T ss_pred HHHHHHHHHHHHHHHHHhhhcccccChHHHHHHHHHhccCCC Confidence 433222111 00 100 00 00000000000011111 No 154 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=98.44 E-value=6.4e-07 Score=54.55 Aligned_cols=373 Identities=10% Similarity=0.043 Sum_probs=152.8 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhc-CCC-cccccccccCccccCceeecchHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQ-SKF-DDIEYTNTDGDRKRRKMQHLPIART 78 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~-g~~-~~l~~~~~~~~~~~~~~~~lnl~~~ 78 (527) ||+|++. +++ |...+ ..+..+. ..|..... +.. .++. . +..+...--.. T Consensus 1 Mg~~~~~--~~~--------k~~~~----~~~~~~~------~~~~~~~~~~~~~~~v~-----~----~~~l~~~~v~~ 51 (383) T protein:vir:10 1 MGLLTPK--NFS--------KRNAK----NMVYPSN------PAFFTTTVGGMQLSYVS-----A----LSALQNTNVYS 51 (383) T ss_pred CCccccc--ccc--------ccccc----ccccccc------hhhhhhhccCccccccc-----h----hHhhcchHHHH Confidence 9998762 111 11111 0011110 11111111 111 1111 1 11111122234 Q ss_pred HHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcCCceEEEEEc Q lcl|NC_019418. 79 AAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQAPVFLPLQSN 158 (527) Q Consensus 79 i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a~~~~P~~~d 158 (527) .++.+|+-+-.-| +.+.+......|.+--..-.....+..++.+.+-.|.+++.+. ++.. . ++|+.. T Consensus 52 ~i~~ia~~ia~~~--~~~~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~--~~~~--~------~~p~~~- 118 (383) T protein:vir:10 52 VINRIASDVSSAH--FKTENTATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLV--GQNL--E------HIPNSD- 118 (383) T ss_pred HHHHHHHhhccCc--eeecccchhhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEE--cCce--e------EeecCc- Confidence 4455555544434 4555544444443211111223334445666666788877653 3322 2 233211 Q ss_pred CCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccc Q lcl|NC_019418. 159 TQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPV 238 (527) Q Consensus 159 ~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~ 238 (527) .. + ... . +....+|..... .+...++ | + +. T Consensus 119 -~~----v---~~~-~-~~~~~~~~~~~~----------------~~~~~~~---~------------~--------~~- 148 (383) T protein:vir:10 119 -VQ----I---NYL-P-GNMGIVYTVLES----------------NDRPKMV---L------------R--------QD- 148 (383) T ss_pred -ce----E---EEE-E-cCCceEEEEEEc----------------CCceEEE---E------------c--------cc- Confidence 00 0 001 1 111222211100 0000000 0 0 00 Q ss_pred eeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCc-ceeeechhHhcCCCCC--CCc Q lcl|NC_019418. 239 TPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQ-RRVIVPEQMTQLKVQD--NQG 315 (527) Q Consensus 239 ~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~-~~i~v~~~~l~~~~~~--~~~ 315 (527) -..||+...++. .....|+|.+.-+...++....+-.-..+-|..|. +.-++ ...... +.. T Consensus 149 ---------evih~r~~~~~~--~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il-----~~~~~~~~~e~ 212 (383) T protein:vir:10 149 ---------QMLHFRLMPDPQ--YRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKL-----TISNYLSDGKD 212 (383) T ss_pred ---------ceEEeccCCCCc--ccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEE-----EeCCCCCCHHH Confidence 123555322211 12246999998888888766655544444455533 32222 111110 000 Q ss_pred ccccccccccccceeeeccCCC----CCCCcceEeccccChHHHH-HHHHHHHHHHHHhcCCCcccccccccccchHHHH Q lcl|NC_019418. 316 NIAFKRRFDVEQNVYMQVGAGN----MDSGGIVDLTTPIRSSDYI-SAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEI 390 (527) Q Consensus 316 ~~~~~~~~d~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~-~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei 390 (527) .-.....|. ..+.+-+.+. +++..++.++.+....+++ +..+...++|+...|++|..+|....+..|...+ T Consensus 213 ~~~~~~~~~---~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~ 289 (383) T protein:vir:10 213 LESAREEFE---KANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNI 289 (383) T ss_pred HHHHHHHHH---HHhCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccH Confidence 000001111 1122111110 1223456666666667764 5667778899999999999998654332222212 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHH Q lcl|NC_019418. 391 VSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQK 470 (527) Q Consensus 391 ~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~ 470 (527) ..... .+..+|..++..|-..... .++ ...+.++++.-+..|..+.++...+++.+|+|++- T Consensus 290 eq~~~-----------~~~~~l~P~~~~ie~~l~~-~l~------~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~n 351 (383) T protein:vir:10 290 DQIKA-----------TYLANLNSYVNPIVDELRL-KMN------APDLELDIKDMLDVDDSILINQVSNLAKSGVLGAE 351 (383) T ss_pred HHHHH-----------HHHHHHHHHHHHHHHHHHH-hhC------CceEEeechhhhccCHHHHHHHHHHHHhCCCcCHH Confidence 11111 1122233333332211110 111 12467888888889999999999999999999999 Q ss_pred HHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 471 RGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 471 ~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) +++..+ |+..-+ +++....... .....|+++| T Consensus 352 E~R~~l-g~~p~~----------------------~~d~~~~~~~-~~~~~gGd~e 383 (383) T protein:vir:10 352 QAQFIL-TRSGFL----------------------PDNLPEFKPL-TNETKGGDDK 383 (383) T ss_pred HHHHHh-CCCccc----------------------CCcccccCCC-cccCCCCCCC Confidence 876543 432200 0000000000 0111222333 No 155 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=98.37 E-value=1e-06 Score=53.47 Aligned_cols=399 Identities=9% Similarity=0.019 Sum_probs=166.6 Q ss_pred ChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHH Q lcl|NC_019418. 2 SLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAK 81 (527) Q Consensus 2 ~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~ 81 (527) ++ ++++|+| ........ +. -...+..+|.|...... ..+.. ...+...--...++ T Consensus 1 m~---~~~~f~~------------~~~~~~~~-~~----~~~~~~~~~~~~~~~~~-~~v~~----~~al~~~~v~~~i~ 55 (416) T protein:vir:12 1 ML---LERMFEK------------RSGSSDHE-DG----FNNILLNMFGGRKTASG-ERVSE----SNSLVQPDIFACVN 55 (416) T ss_pred Cc---cchhccc------------ccCccccC-cc----chhHHHHhhcCcccccC-ceech----hhhhccHHHHHHHH Confidence 12 1222221 11100000 10 01223455554322110 00100 11111111123445 Q ss_pred HHhhhhhcccceEee-C--------CHHHHHHHHH-HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEEEcC Q lcl|NC_019418. 82 KIASLVYNEQAEISA-E--------DETLNDFLSD-MLSNDRFNKNFERYLESALALGGLAMRPYVDG-DK-IRVAFIQA 149 (527) Q Consensus 82 ~~A~ll~~e~~~i~~-~--------d~~~~~~l~~-~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~v~a 149 (527) .+|+-+-+-|..+-- + +..+...|.. --..-....-++.++.+.+..|.+++.+..+. +. ..+..++| T Consensus 56 ~Ia~~ia~l~~~~~~~~~~~~~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~ 135 (416) T protein:vir:12 56 VLSDDIAKLPIHTYKRTDGGIERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEALFPLRP 135 (416) T ss_pred HHHHhhhhCceEEEEecCCccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECC Confidence 555555444433211 1 1122222211 00011122334556677778899998888764 33 34566777 Q ss_pred CceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecc Q lcl|NC_019418. 150 PVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLS 229 (527) Q Consensus 150 ~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~ 229 (527) +.+-++..+.+ +..||... .. |..+. T Consensus 136 ~~v~v~~~~~~-----------------~~~~~~~~----------------~~-------------------g~~~~-- 161 (416) T protein:vir:12 136 DYTNAYVHPTT-----------------GMLWYQTV----------------LN-------------------GKAIE-- 161 (416) T ss_pred cceEEEEeCCC-----------------cEEEEEEe----------------cC-------------------CeEEE-- Confidence 66554322211 11222100 00 00000 Q ss_pred cccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcC-cceeeechhHhcC Q lcl|NC_019418. 230 ELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMG-QRRVIVPEQMTQL 308 (527) Q Consensus 230 ~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~-~~~i~v~~~~l~~ 308 (527) +..- -+.|++.+.. +.+.|+|.+.-+...++....+-....+-|+.| .++.++ .. T Consensus 162 -----------~~~~---eiih~~~~~~-----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il-----~~ 217 (416) T protein:vir:12 162 -----------LYDY---EVLHFKGLST-----DGIHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGIL-----KV 217 (416) T ss_pred -----------ecCc---cEEEecCcCC-----CCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEE-----ec Confidence 0000 1235553211 235799999888888876554433334445653 333333 11 Q ss_pred CCCCCCc-ccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccccccc-ch Q lcl|NC_019418. 309 KVQDNQG-NIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGV-KT 386 (527) Q Consensus 309 ~~~~~~~-~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~-~T 386 (527) ....... .-.....|.........+-. .++..++.++......++.+..+....+|+...|++|..+|...++. .+ T Consensus 218 ~~~~~~e~~~~~~~~~~~~~~~~~~~vl--~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn 295 (416) T protein:vir:12 218 PAFLDEKPKENVRKEWKRVNKVENIAII--DYGLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKATFSN 295 (416) T ss_pred CCCCCHHHHHHHHHHHHHHhcCCCeeec--CCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCccc Confidence 1111000 00000111100000000001 12234666666677888999998888999999999999998765432 23 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccC-CcccCccceEEEeCCCccCCHHHHHHHHHHHHhcC Q lcl|NC_019418. 387 ATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYR-GTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAG 465 (527) Q Consensus 387 Atei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~-~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aG 465 (527) +.+... ..++.+|..++..|....+. .++. ........+.+++++-+..|..+.++...+++.+| T Consensus 296 ~e~~~~-------------~f~~~~l~P~~~~ie~~l~~-~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G 361 (416) T protein:vir:12 296 IEHQSI-------------EYVRNTLQPWIVNFEQELNV-KLFLDHDQKSGHYVKFNIDSELRGDSKTQAEYLKTLHETG 361 (416) T ss_pred HHHHHH-------------HHHHHHHHHHHHHHHHHHHH-hhcCchhhcCCceEEeechhhhccCHHHHHHHHHHHHhCC Confidence 333311 12233444444333322111 1111 11122344667777778889999999999999999 Q ss_pred CCCHHHHHHhcCCCCHH-HHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 466 FATQKRGIAKTLGITEE-EAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 466 i~s~~~~i~~~~~~~de-ea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) +|++-+++..+ |+..- ...+-+.. .+....+..++.+....+.....++...|+ T Consensus 362 ~~T~NE~R~~~-gl~Pi~ggd~~~~~-------~n~~~~~~~~~~~~~~~~~~~~gge~~~~g 416 (416) T protein:vir:12 362 VLNKDEIRELL-ERNPIENGDKYISS-------LNYVFLDFLEEYQRLKAGGAMKGGDNKNEG 416 (416) T ss_pred CcCHHHHHHHh-CCCCCCCcceeeec-------cccccccccchhhccccccccCCCCCcCCC Confidence 99999977654 55331 11111100 000000000111111111111111122222 No 156 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=98.31 E-value=1.5e-06 Score=52.56 Aligned_cols=493 Identities=11% Similarity=0.010 Sum_probs=207.0 Q ss_pred hHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHH--HHhcCCCccccc-----ccccCccccCceeecch Q lcl|NC_019418. 3 LIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNL--AYYQSKFDDIEY-----TNTDGDRKRRKMQHLPI 75 (527) Q Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~--~~y~g~~~~l~~-----~~~~~~~~~~~~~~lnl 75 (527) |=+++.+.+++.+.++.. .+.-..+.+.....-. .||.|. .|+. ....|....|..++.|+ T Consensus 1 ma~~~~~~~~~~~~r~~~----------~~~~~~~~r~~~~~d~~f~~y~G~--Qw~~~~~~~l~~~~q~~~rP~~~~N~ 68 (708) T protein:vir:17 1 MAETLEKKHERIMLRFDR----------AYSPQQEVREKCIEATRFARVPGG--QWEGATAAGTKLDEQFEKYPKFEINK 68 (708) T ss_pred CchhHHHHHHHHHHHHHH----------HHhhhHHHHHHHHHHHHhhccCCC--CCCHHHHHHHHhhhhhcCCCceEEcc Confidence 444445544443322110 0111333344444444 356663 3321 11223334467788899 Q ss_pred HHHHHHHHhhhhhcccceEeeC------CHHHHHHHH----HHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe------- Q lcl|NC_019418. 76 ARTAAKKIASLVYNEQAEISAE------DETLNDFLS----DMLSNDRFNKNFERYLESALALGGLAMRPYVD------- 138 (527) Q Consensus 76 ~~~i~~~~A~ll~~e~~~i~~~------d~~~~~~l~----~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d------- 138 (527) =+.+|+...++--...+.+.+. +..+++.|. .+.+.++.......+...++..|-+|++++.| T Consensus 69 i~~~i~~v~g~e~~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~~d~~~e~d~ 148 (708) T protein:vir:17 69 VATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDP 148 (708) T ss_pred hHHHHHHHHhhHhhCCcceEEecCCCcchHHHHHHHHHHHHHHHHhcCchhHHhHHHHHhhhcccceeeeeecccccCCC Confidence 9999999888866666666652 234455544 45557788888899999999999999998643 Q ss_pred ---CCeeEEEEE--cCCceEEEEEcCC----ceEEEE--EEEEEEeeCC---------------------------Ccce Q lcl|NC_019418. 139 ---GDKIRVAFI--QAPVFLPLQSNTQ----DVSSAA--ILTKTIKTEN---------------------------RKNV 180 (527) Q Consensus 139 ---~~~~~i~~v--~a~~~~P~~~d~~----~~~~~a--~~~~~~~~~~---------------------------~~~~ 180 (527) ..+++|..+ ++..+| ||.. ....|- |..+++..+. .+.+ T Consensus 149 ~~~~~~i~i~~~~~~~~~v~---~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~~~~~~~d~v 225 (708) T protein:vir:17 149 MDDRQRIAIEPIYDPSRSVW---FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWEYDWFDADVI 225 (708) T ss_pred CCCccccceEeeccchhhee---cCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhhhhhccccccccCCCeE Confidence 124555443 445555 2211 111121 1111110000 0011 Q ss_pred EEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccc-------cCcee----eccc--c-cCCcccceeecCCCc Q lcl|NC_019418. 181 YYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQ-------LGERV----NLSE--L-YPDLQPVTPIQGLSR 246 (527) Q Consensus 181 ~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~-------lG~~v----~l~~--~-~~~l~~~~~~~g~~~ 246 (527) + +.|+..- .+.........+...+.|. .|.+..... .|... +... + |-.+-+...+.+-.. T Consensus 226 r--v~e~~~r-~~~~~~~~~~~~~~~g~~~--~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l~~~~~ 300 (708) T protein:vir:17 226 Y--IAKYYEV-RKESVDVISYRHPITGEIA--TYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFLEKPRR 300 (708) T ss_pred E--EEEEEEE-eeeeeEEEEEecCccCcee--eeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccccCCCC Confidence 0 1111100 0000000000000011000 011100000 00000 0000 0 000011111112111 Q ss_pred cc---EEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHH-cCcceeeechhHhcCCCCCCCcccccccc Q lcl|NC_019418. 247 PL---FTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIK-MGQRRVIVPEQMTQLKVQDNQGNIAFKRR 322 (527) Q Consensus 247 p~---f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~-~~~~~i~v~~~~l~~~~~~~~~~~~~~~~ 322 (527) .+ |-|+|+-..-..-.+.|---+.+.++++.++.+|...|.+.+-+- .++...+++.+.++.......+. T Consensus 301 ~p~~~fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~------ 374 (708) T protein:vir:17 301 IPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEAR------ 374 (708) T ss_pred CCCCccceEEEecccccccCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhc------ Confidence 12 222222111000122331124599999999999999999998874 45666677777653221111110 Q ss_pred cccccceeeeccC---C----CCCCCcceEe-ccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHH Q lcl|NC_019418. 323 FDVEQNVYMQVGA---G----NMDSGGIVDL-TTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSEN 394 (527) Q Consensus 323 ~d~~~~~~~~~~~---~----~~~~~~i~~~-~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~ 394 (527) ..++..|...+. . ..+...+..+ .+.++ ..+...++.....|....|++...+|..+ ..++.+|.+.. T Consensus 375 -~~~~~~~~~~~~~~~~~g~v~~~a~~~~~~~~~~~~-~~~~~llq~~~~~i~~~tGi~d~~~G~~s--n~SG~Ai~~rq 450 (708) T protein:vir:17 375 -NKKRPAFLPLREVRDKYGNIIAGATPAGYTQPAVMN-QALAALLQQTSADIQEVTGGSQAMQQMPS--NIAQETVNNLM 450 (708) T ss_pred -ccchhhhhhhhccCCcccccccccCCcccCCCcccc-HHHHHHHHHHHHHHHHhcCCChHHccCcc--chHHHHHHHHH Confidence 001111111110 0 0001112222 34454 46788898888889999999999888643 24666777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh----c---ccC-----------Cc---------------ccCccceEE Q lcl|NC_019418. 395 SDTYQMRNSIVALVEQSIKELCVSMCELGKVV----G---IYR-----------GT---------------IPELDDISV 441 (527) Q Consensus 395 ~~~~~~~~~~~~~~~~al~~li~~il~~~~~~----~---~~~-----------~~---------------~~~~~~v~v 441 (527) .............+..+.+...+.+|.+...+ . +.+ .. ....++|+| T Consensus 451 ~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v 530 (708) T protein:vir:17 451 NRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTV 530 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEE Confidence 76666666677777777776666666554321 0 110 00 001123344 Q ss_pred EeCCCccCCHHHHHHHHHHHHhcCC-CCHHH-----HHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCC Q lcl|NC_019418. 442 NLDDGVFTDRHAELDYWMKMVAAGF-ATQKR-----GIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVG 515 (527) Q Consensus 442 ~f~d~i~~d~~~~~~~~~~~~~aGi-~s~~~-----~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~ 515 (527) +=..+.+.-.++..+.++++..+.. ....+ .+.++-++.- +++.+++|+....+.....+..+...+..... T Consensus 531 ~~~p~~~t~r~~~~~~l~qll~~~~~~~~~~~~~~~l~l~~~D~p~--~~ei~e~ir~~~~~~~~~~~~~~e~~q~~~q~ 608 (708) T protein:vir:17 531 DVGPSYTARRDATVSVLTNVLSSMLPADPMRPAIQGIILDNIDGEG--LDDFKEYNRNQLLISGIAKPRNEKEQQIVQQA 608 (708) T ss_pred ecccCchhHHHHHHHHHHHHHHhcCCccchhHHHHHHHHHhcCCCC--hHHHHHHHHHHhhccccccCcchhhHHHHHHH Confidence 3222322223444555555544321 11111 1222222321 23345555554432211111111000000000 Q ss_pred CCCC-CCC--ccccC Q lcl|NC_019418. 516 NSKD-TVD--DEDEA 527 (527) Q Consensus 516 ~~~~-~~~--~~~~~ 527 (527) ..-. ..- ...+. T Consensus 609 qq~~q~q~~~~~~ea 623 (708) T protein:vir:17 609 QMAAQSQPNPEMVLA 623 (708) T ss_pred HHHHHHHHHHHHHHH Confidence 0000 000 00000 No 157 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=98.31 E-value=1.5e-06 Score=52.55 Aligned_cols=373 Identities=11% Similarity=0.115 Sum_probs=156.6 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccc-cCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRK-RRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~-~~~~~~lnl~~~i 79 (527) |+||.+.+.-- .+ +... ...|..... ++++... ..|... .+..+..+--..+ T Consensus 1 M~~f~~~~~~~---------~~-------~~~~--------~~~~~~~~~--~~~~~~~-~~~~~v~~~~~~~~~~v~~~ 53 (386) T protein:vir:48 1 MPIFNITNLAT---------ES-------PPIS--------QGGFFDITD--PDFLSTL-NGSEWVSAESALRNSDLFSI 53 (386) T ss_pred Ccccccccccc---------cc-------cccc--------ccccccccc--chhcccc-cCCceechhhhhcchHHHHH Confidence 99987643320 00 0000 000000000 0111110 111100 0111111222345 Q ss_pred HHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCC-e-eEEEEEcCCceEEEEE Q lcl|NC_019418. 80 AKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGD-K-IRVAFIQAPVFLPLQS 157 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~-~-~~i~~v~a~~~~P~~~ 157 (527) ++.+|+-+-.-| +.+.+......+.+--..-.....++.++.+.+..|.+++.+..|.+ + +.+.+++|+.+-+... T Consensus 54 i~~ia~~ia~~p--~~~~~~~~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~ 131 (386) T protein:vir:48 54 INQLSNDLATVK--LTASRKQLQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRL 131 (386) T ss_pred HHHHHHhhccCc--eeeccchhHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCceeEEEEc Confidence 555666554434 34444444333332221222333445566777888999888877643 3 3566677776554321 Q ss_pred cCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCccc Q lcl|NC_019418. 158 NTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQP 237 (527) Q Consensus 158 d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~ 237 (527) . +....+|.. .. ++...|..+. T Consensus 132 ~-----------------~~~~~~y~~-~~------------------------------~~~~~~~~~~---------- 153 (386) T protein:vir:48 132 D-----------------NKDGIYYNI-TF------------------------------DDPRIPPKQH---------- 153 (386) T ss_pred C-----------------CCceEEEEE-Ee------------------------------cCccccceeE---------- Confidence 1 111122210 00 0000110000 Q ss_pred ceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHc-CcceeeechhHhcCCCCCCCcc Q lcl|NC_019418. 238 VTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKM-GQRRVIVPEQMTQLKVQDNQGN 316 (527) Q Consensus 238 ~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~-~~~~i~v~~~~l~~~~~~~~~~ 316 (527) +. .--+.|++.+.++ +..+|+|.+.-+...+.....+-....+-|.. +.+..++ ......... T Consensus 154 ---~~---~~evih~~~~~~~----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii-----~~~~~~~~e- 217 (386) T protein:vir:48 154 ---VP---QGDVLHFKLLSVD----GGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGIL-----KIKGGGLLD- 217 (386) T ss_pred ---ec---CccEEEecCCCCC----CceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEE-----EeCCCCCHH- Confidence 00 0013566644333 33469998888777776555444444444554 3333333 221111100 Q ss_pred cccccccccccceeeeccCCC------CCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHH Q lcl|NC_019418. 317 IAFKRRFDVEQNVYMQVGAGN------MDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEI 390 (527) Q Consensus 317 ~~~~~~~d~~~~~~~~~~~~~------~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei 390 (527) ....+. +. +....... +++..++.++......++++..+....+|+...|++|..+|..+++. ++.+. T Consensus 218 --~~~~~~--~~-~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~-~~e~~ 291 (386) T protein:vir:48 218 --FKTKLS--RS-RQAMKQMQGGPLVLDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQGDQQ-SSLEM 291 (386) T ss_pred --HHHHHH--HH-HHHhhcCCCCceecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc-cHHHH Confidence 000000 00 11000011 11223555565666677888888888999999999999998654432 22221 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCH Q lcl|NC_019418. 391 -VSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQ 469 (527) Q Consensus 391 -~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~ 469 (527) +. .++.+|..++..|....+. .++. ++.+++...+-.|....+....+++.+|++++ T Consensus 292 ~~~--------------~~~~~l~P~~~~ie~~l~~-~l~~-------~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~ 349 (386) T protein:vir:48 292 SLD--------------LYNKAVSRYLRPFLSELSQ-KLSC-------DVDADILPAVDPTGSNSVSRINSMVKSGTLAQ 349 (386) T ss_pred HHH--------------HHHHHHHHHHHHHHHHHHH-hhcc-------hhhcchhhhhccChHHHHHHHHHHHhCCCcCH Confidence 11 2223333333333211110 0110 11122222334556677777888999999999 Q ss_pred HHHHHhc--CCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_019418. 470 KRGIAKT--LGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDE 524 (527) Q Consensus 470 ~~~i~~~--~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (527) -+++... .|+.+.++.+ .+ .. . ..+.++++++ +.| T Consensus 350 nE~r~~lg~~~~~~~~~~~----~~--~~---~---~~~~~gGd~~--------~~~ 386 (386) T protein:vir:48 350 NQGLYILQQAEILPKELPE----GE--NP---N---KTTLKGGEIN--------GED 386 (386) T ss_pred HHHHHHhhcCCCCCccchh----hc--CC---C---CCccCCCCCC--------CCC Confidence 9887653 3444333211 10 00 0 0011111111 111 No 158 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=98.29 E-value=1.6e-06 Score=52.29 Aligned_cols=381 Identities=10% Similarity=0.073 Sum_probs=159.8 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) |++| +||++.. +.... ..+..-.+. ..-..+..++.+..... +. .+.-+...--..++ T Consensus 3 m~~~----~~~~~~~-----~~~~~--~~~~~~~~~---~~~~~~~~~~~~~~g~~----v~----~~~al~~~~v~~~v 60 (392) T protein:vir:74 3 LPIL----NFINQTN-----DPPEA--GSVQSYFPD---GNDAQIMESLLGDNNEW----VS----ARAALRNSDLFSII 60 (392) T ss_pred chhh----hhhhccc-----Ccccc--ccccccccc---CchhhhhhhccCCCCcc----cc----hhhhhcchHHHHHH Confidence 5554 5665422 11000 000000000 00001122222221110 00 01111222233455 Q ss_pred HHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEEEcCCceEEEEEc Q lcl|NC_019418. 81 KKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DK-IRVAFIQAPVFLPLQSN 158 (527) Q Consensus 81 ~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~v~a~~~~P~~~d 158 (527) +.+|+-+-+=| +.+-.......+++--..-.-..-++..+.+.+..|.+++.+..+. |+ ..+..++|+++-+...+ T Consensus 61 ~~ia~~ia~lp--~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~ 138 (392) T protein:vir:74 61 LQLSSDLAIVK--INAEKKKNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFE 138 (392) T ss_pred HHHHHhhccCc--eeeccchhhhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcC Confidence 55666554433 3444433333333211111123334456667788899988887775 33 35666777765543221 Q ss_pred CCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccc Q lcl|NC_019418. 159 TQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPV 238 (527) Q Consensus 159 ~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~ 238 (527) .+ +..+|.. ... ++. .+..+.+ + +. T Consensus 139 ~~-----------------~~~~y~~-~~~---------------~~~---------------~~~~~~~---~---~~- 163 (392) T protein:vir:74 139 YE-----------------NGMYYNI-TFD---------------DPK---------------IEPILQA---P---QS- 163 (392) T ss_pred CC-----------------ceEEEEE-Eec---------------CCc---------------cceeEEE---c---Cc- Confidence 11 1122211 000 000 0000000 0 00 Q ss_pred eeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCcccc Q lcl|NC_019418. 239 TPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGNIA 318 (527) Q Consensus 239 ~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~~~ 318 (527) -+.||+.+..+ +...|+|.+.-+...|+....+-.-..+-|+.|-. |..++....+... ... T Consensus 164 ---------evih~~~~~~~----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~----p~~il~~~~~~~~-~~~ 225 (392) T protein:vir:74 164 ---------DLIHMKLLSID----GGKTGISPLYSLRRESKIQRASDRLTISSLNSSLN----VPGVLTVKGGGLL-SDK 225 (392) T ss_pred ---------cEEEecCCCCC----CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCC----CceEEEeCCCCCc-hHH Confidence 13455543222 33469999988888885444443333334565432 2222221111000 000 Q ss_pred cccccccccceeeec-cCCC----CCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHH Q lcl|NC_019418. 319 FKRRFDVEQNVYMQV-GAGN----MDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSE 393 (527) Q Consensus 319 ~~~~~d~~~~~~~~~-~~~~----~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~ 393 (527) ....+ ..-|.+. +.+. .++..++.++.+..+.++.+..+....+|+...|++|..+|+.++...++.+..+ T Consensus 226 ~~~~~---~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~e~~~~- 301 (392) T protein:vir:74 226 DKASR---SRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQISG- 301 (392) T ss_pred HHHHH---HHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHH- Confidence 00000 0111111 1100 1223456666666777888888888899999999999999865443322222211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHH Q lcl|NC_019418. 394 NSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGI 473 (527) Q Consensus 394 ~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i 473 (527) .++.+|..+++.|..-... .+.. .+.+++..-+-.|..+.+..+..++.+|++++.+++ T Consensus 302 -------------~~~~~l~p~~~~ie~~l~~-~l~~-------~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near 360 (392) T protein:vir:74 302 -------------MYASALNRYLRPAISELEY-KLSD-------HISVNMRPAIDPLGDNYLSTISTATRWGALAENQAT 360 (392) T ss_pred -------------HHHHHHHHHHHHHHHHHHH-hccc-------hhcccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHH Confidence 1223333333333221111 0110 122222233345667778888899999999999876 Q ss_pred Hhc--CCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCC Q lcl|NC_019418. 474 AKT--LGITEEEAEKELAEINGELPPESDAELALYGKGQQNTV 514 (527) Q Consensus 474 ~~~--~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~ 514 (527) ... -|+...|+.+ .|..+ +..++++.+..| T Consensus 361 ~~~~~~g~~pne~r~------~enl~-----~~~~Gd~~~p~p 392 (392) T protein:vir:74 361 FVLQEAGYIPKDLPA------PENTN-----KKTTGQSNEPVP 392 (392) T ss_pred HHHHhCCCCccccch------hcCCC-----CCCCCCCCCCCC Confidence 543 4665444432 11111 111122222211 No 159 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=98.24 E-value=2.2e-06 Score=51.65 Aligned_cols=391 Identities=12% Similarity=0.071 Sum_probs=167.3 Q ss_pred ChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHH Q lcl|NC_019418. 2 SLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAK 81 (527) Q Consensus 2 ~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~ 81 (527) .| ++.||+| + ....+..+.+ |..++.+..+...-..+.. ...+...---..++ T Consensus 1 ~~---f~~~f~r-------~------~~~~~~~~~~-------~~~~~~~~~~~~~g~~v~~----~~~l~~~~v~~~i~ 53 (413) T protein:vir:48 1 MF---FSGLFQR-------K------SDAPVTTPAE-------LAEAIGLSYDTYTGKRISS----QRAMRLTAVYSCVR 53 (413) T ss_pred Cc---cchhhcc-------C------ccCCccchHH-------HHHhhhcCcccccCceech----hhhhccHHHHHHHH Confidence 11 1222222 1 1112222211 2223322111110000000 11111122234445 Q ss_pred HHhhhhhcccceEe---------eCCHHHHHHHHH-HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCee-EEEEEcCC Q lcl|NC_019418. 82 KIASLVYNEQAEIS---------AEDETLNDFLSD-MLSNDRFNKNFERYLESALALGGLAMRPYVDGDKI-RVAFIQAP 150 (527) Q Consensus 82 ~~A~ll~~e~~~i~---------~~d~~~~~~l~~-~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~-~i~~v~a~ 150 (527) .+|+-+-+=|..+- +.+..+...|.. --..-.....+..++...+..|.+++.+..+.|++ .+..++|+ T Consensus 54 ~Ia~~iA~~p~~~~~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~g~~~~L~~l~~~ 133 (413) T protein:vir:48 54 VLAESVGMLPCSLYKISGTLKTRVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKALGEVVELLPIDPG 133 (413) T ss_pred HHHHhhhhCceEEEEecCCcceeecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeCCCcEEEEEEEcCc Confidence 55555444443321 122233334321 10111223334556677778899988887776654 35556666 Q ss_pred ceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeeccc Q lcl|NC_019418. 151 VFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSE 230 (527) Q Consensus 151 ~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~ 230 (527) .+-+.. +.++ ..+|. +..+ .+... .|. T Consensus 134 ~v~~~~-~~~~-----------------~~~y~-~~~~-----------------~g~~~--~~~--------------- 160 (413) T protein:vir:48 134 CVEPKL-NSQW-----------------QPVYQ-VTFP-----------------DGSVD--VLT--------------- 160 (413) T ss_pred eEEEEE-cCCc-----------------eEEEE-EEec-----------------CceEE--EEc--------------- Confidence 655432 1111 11111 0000 00000 010 Q ss_pred ccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHc-CcceeeechhHhcCC Q lcl|NC_019418. 231 LYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKM-GQRRVIVPEQMTQLK 309 (527) Q Consensus 231 ~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~-~~~~i~v~~~~l~~~ 309 (527) +. -+.|++.+.. ...+|+|.+.-+...++-....-....+-|+. +.++-++ ... T Consensus 161 -----~~----------evih~~~~~~-----d~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil-----~~~ 215 (413) T protein:vir:48 161 -----QD----------EIWHVRTLTL-----DGLVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVL-----RTE 215 (413) T ss_pred -----cc----------cEEEecCcCC-----CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEE-----EeC Confidence 00 1235553321 23579999988888887555443333344555 3333333 222 Q ss_pred CCCCCcc-cccccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccc Q lcl|NC_019418. 310 VQDNQGN-IAFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQG 383 (527) Q Consensus 310 ~~~~~~~-~~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g 383 (527) ....... -.....| +..|.+.+ .+ -.++..++.++......++.+..+....+|+...|++|..+|...++ T Consensus 216 ~~~~~e~~~~~~~~~---~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~ 292 (413) T protein:vir:48 216 QKLTPDAYERLKKDF---EERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRA 292 (413) T ss_pred CCCCHHHHHHHHHHH---HHHhcCccccCcceecCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCC Confidence 1111000 0000111 11122211 00 01122455566666677888888888889999999999999875433 Q ss_pred -cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHH Q lcl|NC_019418. 384 -VKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMV 462 (527) Q Consensus 384 -~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~ 462 (527) -.++.+.. ...++.+|.-++..|....+. .++.........+.++++.-+..|..+.++...+++ T Consensus 293 t~~n~e~~~-------------~~f~~~~i~P~~~~ie~~l~~-~L~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~ 358 (413) T protein:vir:48 293 TFNNIEELG-------------LGFINYSLVPYLTRIEQRINT-GLVRESKQGKFYAKFNAGALLRGDMKSRFEAYATGI 358 (413) T ss_pred CcccHHHHH-------------HHHHHHHHHHHHHHHHHHHHh-hccCccccCCeEEEEechhhhccCHHHHHHHHHHHH Confidence 22333321 122333444444443322111 122111122344666666766778899999999999 Q ss_pred hcCCCCHHHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 463 AAGFATQKRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 463 ~aGi~s~~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) .+|+|++-+++.. .|+..-+ ..+-+. ... ........+.+....+++++||+ T Consensus 359 ~~g~~T~NE~R~~-~g~~p~~ggD~~~~--~~n----------~~~~~~~~~~~~~~~~~~~~~~~ 411 (413) T protein:vir:48 359 NWGIYSPNDCRDL-EDMNPRPGGDVYLT--PMN----------MTTSPSAGDDNGKKKESGDADKT 411 (413) T ss_pred hCCCcCHHHHHHH-hCCCCCCCcceeec--ccc----------ccccccccccCCCCCCCCCcccc Confidence 9999999997654 3654211 100000 000 00000111111223334455555 No 160 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=98.24 E-value=2.2e-06 Score=51.65 Aligned_cols=459 Identities=9% Similarity=0.039 Sum_probs=182.0 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCccccccc--ccCccccCceeecchHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTN--TDGDRKRRKMQHLPIART 78 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~--~~~~~~~~~~~~lnl~~~ 78 (527) |..-++...+ ..++......|+.++.-.-|-+.... ..+......+.-=..+.. T Consensus 1 m~~~~r~~~L------------------------~~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~~~~~~~~~dstg~~ 56 (522) T protein:vir:10 1 MKARERYNQL------------------------TTARQMFLDKAVECSELTLPYLIDDDISSRPNHKSLTVPWQSVGAK 56 (522) T ss_pred CchHHHHHHH------------------------HHHhhHHHHHHHHHHHHhhhcccCCCCCCCcccccccccccchHHH Confidence 4432211111 22233456667776654333221111 111111111122255667 Q ss_pred HHHHHhhhhhcc--cc-----eEeeCCHH------------HHHH-------HHHHHhhhhHHHHHHHHHHHHHhcCCEE Q lcl|NC_019418. 79 AAKKIASLVYNE--QA-----EISAEDET------------LNDF-------LSDMLSNDRFNKNFERYLESALALGGLA 132 (527) Q Consensus 79 i~~~~A~ll~~e--~~-----~i~~~d~~------------~~~~-------l~~~l~~n~f~~~~~~~~~~a~~~G~~~ 132 (527) .++.+|+-|.+- || ++.+.+.. .+++ +...+..++|...+.++..+....|.+. T Consensus 57 a~~~LAa~l~~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ 136 (522) T protein:vir:10 57 CCVTLAAKLMLAVLPPQTSFFKLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNAL 136 (522) T ss_pred HHHHHHHHHHHhhcCCCCccccccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcee Confidence 777777754443 11 22333221 2333 3355778999999999999999999988 Q ss_pred EEEEEeCCeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEE Q lcl|NC_019418. 133 MRPYVDGDKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNE 212 (527) Q Consensus 133 ~~~~~d~~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ 212 (527) + |.+.+.+ +.++-.+++ +..|..|++..++....+...-- ...| -+....... ......+....|-|. T Consensus 137 l--y~~~~~~--~~~pl~~y~-v~~d~~G~vd~i~r~~~~t~~ql-~~~f-----g~~~~~~~~-~~~~~~~~~v~v~~~ 204 (522) T protein:vir:10 137 I--FMGKDGL--KTFPLTRYV-INRDGDGNVLEIVTKELISRKVL-DIEL-----PEPKPNTGI-DESSTTNDDVTIYTY 204 (522) T ss_pred E--EEcCCCc--eEEEcceEE-EeeCCCCCeeEEEeeeeccHHHH-HHhc-----chhccchhh-hcccCCCCceEEEEE Confidence 4 6676654 346666655 45677777776664332210000 0000 000000000 000000111222222 Q ss_pred EEecCCccccCceeecccccCC-cccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_019418. 213 LYKSTSDSQLGERVNLSELYPD-LQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWE 291 (527) Q Consensus 213 ly~~~~~~~lG~~v~l~~~~~~-l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e 291 (527) +|...+. |..+-..+.-.. +.......|....+|..++- +...++.||+|-...+.+-++.|+..--....- T Consensus 205 v~p~~~~---~~~~~~~~~~~~~~~~~~s~~g~~~~P~~~~Rw----~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~ 277 (522) T protein:vir:10 205 VKLDKSS---GRWVWHQEAFDKIIPDSRSTAPKNASPWLPLRF----NTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEG 277 (522) T ss_pred EEeeccC---CceEEEEccCCccccccccccccccCCceeeee----eecCCCccccchHHHHHHHHHHHHHHHHHHHHH Confidence 2211110 000000000000 00000011222222332321 223467899999999999999999876555544 Q ss_pred H-HcCcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEec----cccChHHHHHHHHHHHHHH Q lcl|NC_019418. 292 I-KMGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLT----TPIRSSDYISAISEGLKLF 366 (527) Q Consensus 292 ~-~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~----~~ir~e~~~~~~~~~l~~i 366 (527) . ...++.+.||++.+....+...+ ....+. . +....+..++ .++. .....++.+.+.| T Consensus 278 ~~~a~~p~~lv~~~~~~~~~~l~~~----------~~~~~v---~--g~~~~v~~~~~~~~~d~~--~~~~~i~~~~~ri 340 (522) T protein:vir:10 278 AAAASKVVFLVSPSSTTKPATIAKA----------GNGAIV---Q--GRPEDVAVIQVGKTADFS--TAANMATAIEKRL 340 (522) T ss_pred HHHhcCCceeeccccccccccccCC----------CCccee---c--CCCccceeecccccccch--HHHHHHHHHHHHH Confidence 4 44666667755543222111110 001111 1 1111222222 2222 2233344444434 Q ss_pred HHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhcccCCcccCccc-eEEEeC Q lcl|NC_019418. 367 EMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKVVGIYRGTIPELDD-ISVNLD 444 (527) Q Consensus 367 ~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~~~~~~~~~~~~~~-v~v~f~ 444 (527) ....-+ +....+...|||||....+...+..+-.-..+ ...|.-|+..++.+..-.++....+.+... ..|++- T Consensus 341 ~~aFl~----~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~~v~~i 416 (522) T protein:vir:10 341 LEAFLV----MNVRNAERVTAEEVRLTQLELEQQLGGIFSLLVIEFLIPYLNRTLLVLQRSNQIPKLPKDIVRPTIVAGV 416 (522) T ss_pred HHHHhh----ccCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccccccccccch Confidence 322111 11223456799999999988888776644444 445556666666665433332222221111 123332 Q ss_pred CCccCCHHHHHHHHHHHHhc--CCC---------CHHHHHH---hcCCC-------CHHHHHHHHHHHHHhc-----ccc Q lcl|NC_019418. 445 DGVFTDRHAELDYWMKMVAA--GFA---------TQKRGIA---KTLGI-------TEEEAEKELAEINGEL-----PPE 498 (527) Q Consensus 445 d~i~~d~~~~~~~~~~~~~a--Gi~---------s~~~~i~---~~~~~-------~deea~~el~ri~~E~-----~~~ 498 (527) ..+.. ...++.++...+. .++ ...+++. ...|+ |++|+.++-++.++.+ ++. T Consensus 417 s~Lar--aq~~~~l~~~~~~i~~~~~p~~~~~~id~d~~~~~~a~~~Gvp~~~ivrt~eev~~~~q~~q~~~~~~~~~~~ 494 (522) T protein:vir:10 417 NALGR--GQDRESLTAFVGTIAQTLGPEALMQYLNPLEAIKRLAAAQGIDVLNLVKTEQQLAEEQQAAQQQAAQQSLVDQ 494 (522) T ss_pred hHHHH--HHHHHHHHHHHHHHHHhhCchhhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHH Confidence 22211 1111222111100 111 1222222 23454 4444433322221111 111 Q ss_pred ccc-ccCCCCCCCCCCCCCCCCCCCccc Q lcl|NC_019418. 499 SDA-ELALYGKGQQNTVGNSKDTVDDED 525 (527) Q Consensus 499 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 525 (527) +.. ....-.++..++..-+-.+....+ T Consensus 495 a~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (522) T protein:vir:10 495 AGQMTGSPLMDPTKNPQLMDEEQPPMEE 522 (522) T ss_pred HHHHhcccccCccccHHHHHHhCCCCCC Confidence 111 111111222222111111111111 No 161 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=98.22 E-value=2.4e-06 Score=51.35 Aligned_cols=435 Identities=13% Similarity=0.132 Sum_probs=158.2 Q ss_pred HHHHHHHHH--HHh-hcccchhhhccCccccCHH---HHHHHHHHHHHhc-------CCCcc-----cccccc-cCcccc Q lcl|NC_019418. 7 VKDFFNRGR--YNM-TTSHLSSILDHPKVAVTQS---EFRRIQHNLAYYQ-------SKFDD-----IEYTNT-DGDRKR 67 (527) Q Consensus 7 ~k~~~~~~~--~~~-~~~~~~~~~~~~~i~~~~~---~~~~i~~~~~~y~-------g~~~~-----l~~~~~-~~~~~~ 67 (527) |-..|+... .-. ..|.++ ++.| ++. ....|+..-..|. ++.+. ...... .|-..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~----~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~ 74 (563) T protein:vir:99 1 MADLFKQFRLGKDYGNNSTIA----QVPI--DEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDK 74 (563) T ss_pred Chhhhhhhhcccccccccccc----eeec--cCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhccccccccc Confidence 333333211 001 112211 1222 111 1222222211111 11100 000000 000000 Q ss_pred --------------CceeecchHHHHHHHHhhhhhcccc-----------eEeeC-------CH--HHHHHHHHHHh--- Q lcl|NC_019418. 68 --------------RKMQHLPIARTAAKKIASLVYNEQA-----------EISAE-------DE--TLNDFLSDMLS--- 110 (527) Q Consensus 68 --------------~~~~~lnl~~~i~~~~A~ll~~e~~-----------~i~~~-------d~--~~~~~l~~~l~--- 110 (527) +....-++...+++..|+.+..-.| .+.+. .. .....|...+. T Consensus 75 ~~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~ 154 (563) T protein:vir:99 75 RSYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTG 154 (563) T ss_pred ccCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcC Confidence 0001112334444444443221100 11111 11 11223433332 Q ss_pred -h-----hhHHHHHHHHHHHHHhcCCEEEEEEEeC---Ce-eEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcce Q lcl|NC_019418. 111 -N-----DRFNKNFERYLESALALGGLAMRPYVDG---DK-IRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNV 180 (527) Q Consensus 111 -~-----n~f~~~~~~~~~~a~~~G~~~~~~~~d~---~~-~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~ 180 (527) + ..|...+..++.+.+..|.+++.+.+.. ++ +.+.+++|..+.+...+++.... ... T Consensus 155 ~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~-------------~~~ 221 (563) T protein:vir:99 155 KDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIK-------------GGK 221 (563) T ss_pred CCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceec-------------cce Confidence 1 1355666778888999999998877642 33 45777888887775322211100 000 Q ss_pred EEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccc Q lcl|NC_019418. 181 YYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNK 260 (527) Q Consensus 181 ~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~ 260 (527) .|... . ++....+ | + +.++ +-+++++..+ T Consensus 222 ~y~~~----------------~-~g~~~~~---~------------~--------~~ev---------I~~~~~~~~d-- 250 (563) T protein:vir:99 222 RFVQV----------------V-DKRVVAS---F------------T--------SREL---------AMGIRNPRTE-- 250 (563) T ss_pred eEEEE----------------e-CCceeEE---e------------c--------Ccce---------EEEeccCCCC-- Confidence 01000 0 0000000 0 0 0000 0122222211 Q ss_pred cCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCC---CCcccccccccccccceeeecc-CC Q lcl|NC_019418. 261 DINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQD---NQGNIAFKRRFDVEQNVYMQVG-AG 336 (527) Q Consensus 261 ~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~---~~~~~~~~~~~d~~~~~~~~~~-~~ 336 (527) ....++|+|.+.-+...|.....+-.-..+-|..|.. |..+|....+. +...-.....| +..|.+.+ .+ T Consensus 251 ~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~----p~giL~~~~~~~ls~e~~~~~~~~~---~~~~~G~~nag 323 (563) T protein:vir:99 251 LSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGT----TRGILQIRSDQQQSQHALENFKREW---KSSLSGINGSW 323 (563) T ss_pred cccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCC----CceEEEeCCCCCCCHHHHHHHHHHH---HHHhccccccc Confidence 1124679999988887776544444333344555432 22222211110 00000000111 11122211 00 Q ss_pred -----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHH-HHHHHHHHHHHHH-HHHHHHH Q lcl|NC_019418. 337 -----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATE-IVSENSDTYQMRN-SIVALVE 409 (527) Q Consensus 337 -----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAte-i~s~~~~~~~~~~-~~~~~~~ 409 (527) -+++..++.++......++++......++|+...|++|..+|+...+..+++. -.+..+ .++. ..+..++ T Consensus 324 k~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~---sn~e~~~~~f~~ 400 (563) T protein:vir:99 324 QIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNE---ADPGKKQQQSQN 400 (563) T ss_pred cceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhh---ccHHHHHHHHHH Confidence 01222455566666778889999989999999999999999876543221111 100000 0010 1112233 Q ss_pred HHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHH-HHHHHH Q lcl|NC_019418. 410 QSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLGITEE-EAEKEL 488 (527) Q Consensus 410 ~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~de-ea~~el 488 (527) .+|..++..|-...+. .|.. .....+.+.|...-..+. .++....+++.+|+|++-+++.++ |+..- ....-+ T Consensus 401 ~tL~P~l~~ie~~ln~-~L~~---~~~~~~~~~f~r~D~~~~-~e~~~~~~~~~~G~lT~NE~R~~~-gl~Pi~gGD~~~ 474 (563) T protein:vir:99 401 KGLQPLLRFIEDLVNR-HIIS---EYGDKYTFQFVGGDTKSA-TDKLNILKLETQIFKTVNEAREEQ-GKKPIEGGDIIL 474 (563) T ss_pred HHHHHHHHHHHHHHHh-hhch---hcccccEEEeccCCHHHH-HHHHHHHHHhcCCccCHHHHHHHh-CCCCCCCcceee Confidence 4444444433322111 1111 112345677765433332 223334456788999999976553 44211 000000 Q ss_pred -----HHH-------HHhccc-cccc--ccCCCCCCCCCCCCCCCCCC-CccccC Q lcl|NC_019418. 489 -----AEI-------NGELPP-ESDA--ELALYGKGQQNTVGNSKDTV-DDEDEA 527 (527) Q Consensus 489 -----~ri-------~~E~~~-~~~~--~~~~~~~~~~~~~~~~~~~~-~~~~~~ 527 (527) ..+ ..+... .... ..+....+.++++.+..+++ +++.+. T Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 529 (563) T protein:vir:99 475 DASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKEI 529 (563) T ss_pred cccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCCcccc Confidence 000 000000 0000 00000111111111111111 111111 No 162 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=98.22 E-value=2.4e-06 Score=51.35 Aligned_cols=435 Identities=13% Similarity=0.132 Sum_probs=158.2 Q ss_pred HHHHHHHHH--HHh-hcccchhhhccCccccCHH---HHHHHHHHHHHhc-------CCCcc-----cccccc-cCcccc Q lcl|NC_019418. 7 VKDFFNRGR--YNM-TTSHLSSILDHPKVAVTQS---EFRRIQHNLAYYQ-------SKFDD-----IEYTNT-DGDRKR 67 (527) Q Consensus 7 ~k~~~~~~~--~~~-~~~~~~~~~~~~~i~~~~~---~~~~i~~~~~~y~-------g~~~~-----l~~~~~-~~~~~~ 67 (527) |-..|+... .-. ..|.++ ++.| ++. ....|+..-..|. ++.+. ...... .|-..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~----~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~ 74 (563) T protein:vir:95 1 MADLFKQFRLGKDYGNNSTIA----QVPI--DEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDK 74 (563) T ss_pred Chhhhhhhhcccccccccccc----eeec--cCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhccccccccc Confidence 333333211 001 112211 1222 111 1222222211111 11100 000000 000000 Q ss_pred --------------CceeecchHHHHHHHHhhhhhcccc-----------eEeeC-------CH--HHHHHHHHHHh--- Q lcl|NC_019418. 68 --------------RKMQHLPIARTAAKKIASLVYNEQA-----------EISAE-------DE--TLNDFLSDMLS--- 110 (527) Q Consensus 68 --------------~~~~~lnl~~~i~~~~A~ll~~e~~-----------~i~~~-------d~--~~~~~l~~~l~--- 110 (527) +....-++...+++..|+.+..-.| .+.+. .. .....|...+. T Consensus 75 ~~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~ 154 (563) T protein:vir:95 75 RSYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTG 154 (563) T ss_pred ccCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcC Confidence 0001112334444444443221100 11111 11 11223433332 Q ss_pred -h-----hhHHHHHHHHHHHHHhcCCEEEEEEEeC---Ce-eEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcce Q lcl|NC_019418. 111 -N-----DRFNKNFERYLESALALGGLAMRPYVDG---DK-IRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNV 180 (527) Q Consensus 111 -~-----n~f~~~~~~~~~~a~~~G~~~~~~~~d~---~~-~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~ 180 (527) + ..|...+..++.+.+..|.+++.+.+.. ++ +.+.+++|..+.+...+++.... ... T Consensus 155 ~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~-------------~~~ 221 (563) T protein:vir:95 155 KDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIK-------------GGK 221 (563) T ss_pred CCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceec-------------cce Confidence 1 1355666778888999999998877642 33 45777888887775322211100 000 Q ss_pred EEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccc Q lcl|NC_019418. 181 YYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNK 260 (527) Q Consensus 181 ~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~ 260 (527) .|... . ++....+ | + +.++ +-+++++..+ T Consensus 222 ~y~~~----------------~-~g~~~~~---~------------~--------~~ev---------I~~~~~~~~d-- 250 (563) T protein:vir:95 222 RFVQV----------------V-DKRVVAS---F------------T--------SREL---------AMGIRNPRTE-- 250 (563) T ss_pred eEEEE----------------e-CCceeEE---e------------c--------Ccce---------EEEeccCCCC-- Confidence 01000 0 0000000 0 0 0000 0122222211 Q ss_pred cCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCC---CCcccccccccccccceeeecc-CC Q lcl|NC_019418. 261 DINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQD---NQGNIAFKRRFDVEQNVYMQVG-AG 336 (527) Q Consensus 261 ~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~---~~~~~~~~~~~d~~~~~~~~~~-~~ 336 (527) ....++|+|.+.-+...|.....+-.-..+-|..|.. |..+|....+. +...-.....| +..|.+.+ .+ T Consensus 251 ~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~----p~giL~~~~~~~ls~e~~~~~~~~~---~~~~~G~~nag 323 (563) T protein:vir:95 251 LSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGT----TRGILQIRSDQQQSQHALENFKREW---KSSLSGINGSW 323 (563) T ss_pred cccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCC----CceEEEeCCCCCCCHHHHHHHHHHH---HHHhccccccc Confidence 1124679999988887776544444333344555432 22222211110 00000000111 11122211 00 Q ss_pred -----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHH-HHHHHHHHHHHHH-HHHHHHH Q lcl|NC_019418. 337 -----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATE-IVSENSDTYQMRN-SIVALVE 409 (527) Q Consensus 337 -----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAte-i~s~~~~~~~~~~-~~~~~~~ 409 (527) -+++..++.++......++++......++|+...|++|..+|+...+..+++. -.+..+ .++. ..+..++ T Consensus 324 k~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~---sn~e~~~~~f~~ 400 (563) T protein:vir:95 324 QIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNE---ADPGKKQQQSQN 400 (563) T ss_pred cceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhh---ccHHHHHHHHHH Confidence 01222455566666778889999989999999999999999876543221111 100000 0010 1112233 Q ss_pred HHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHH-HHHHHH Q lcl|NC_019418. 410 QSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLGITEE-EAEKEL 488 (527) Q Consensus 410 ~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~de-ea~~el 488 (527) .+|..++..|-...+. .|.. .....+.+.|...-..+. .++....+++.+|+|++-+++.++ |+..- ....-+ T Consensus 401 ~tL~P~l~~ie~~ln~-~L~~---~~~~~~~~~f~r~D~~~~-~e~~~~~~~~~~G~lT~NE~R~~~-gl~Pi~gGD~~~ 474 (563) T protein:vir:95 401 KGLQPLLRFIEDLVNR-HIIS---EYGDKYTFQFVGGDTKSA-TDKLNILKLETQIFKTVNEAREEQ-GKKPIEGGDIIL 474 (563) T ss_pred HHHHHHHHHHHHHHHh-hhch---hcccccEEEeccCCHHHH-HHHHHHHHHhcCCccCHHHHHHHh-CCCCCCCcceee Confidence 4444444433322111 1111 112345677765433332 223334456788999999976553 44211 000000 Q ss_pred -----HHH-------HHhccc-cccc--ccCCCCCCCCCCCCCCCCCC-CccccC Q lcl|NC_019418. 489 -----AEI-------NGELPP-ESDA--ELALYGKGQQNTVGNSKDTV-DDEDEA 527 (527) Q Consensus 489 -----~ri-------~~E~~~-~~~~--~~~~~~~~~~~~~~~~~~~~-~~~~~~ 527 (527) ..+ ..+... .... ..+....+.++++.+..+++ +++.+. T Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 529 (563) T protein:vir:95 475 DASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKEI 529 (563) T ss_pred cccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCCcccc Confidence 000 000000 0000 00000111111111111111 111111 No 163 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=98.20 E-value=2.7e-06 Score=51.15 Aligned_cols=424 Identities=11% Similarity=0.057 Sum_probs=162.3 Q ss_pred CC-hHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCC-cccccc---ccc--CccccCceeec Q lcl|NC_019418. 1 MS-LIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKF-DDIEYT---NTD--GDRKRRKMQHL 73 (527) Q Consensus 1 m~-~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~-~~l~~~---~~~--~~~~~~~~~~l 73 (527) -+ +|=+.-+.|||-. +..+. .++. ++.+.. + . .+.+.. ..++.. ... .+...+..... T Consensus 60 ~~~~~~~~~~~~kk~~---i~~pf----kkk~---~~~~~d-~---f-~~s~es~s~vtsls~pdaf~~vnVs~~~Alkn 124 (945) T protein:vir:10 60 YSIIIFRKNQVLKKEK---IIVPY----NHQE---PPFKFN-L---F-EYSPESLMYLPSISDPDAFFLINLFRKYRFNN 124 (945) T ss_pred eeeeeehhhhHHHhhc---ccccc----cccc---cchhhh-h---h-hccCccceecccccCccceeeehhhhhhhhcc Confidence 01 1112223333311 11111 0000 000000 0 0 122211 011000 000 00000111111 Q ss_pred chHHHHHHHHhhhhhcccceEe--eCCH---------HHHHHHHHHHhh-------hhHHHHH-HHHHHHHHhcCCEEEE Q lcl|NC_019418. 74 PIARTAAKKIASLVYNEQAEIS--AEDE---------TLNDFLSDMLSN-------DRFNKNF-ERYLESALALGGLAMR 134 (527) Q Consensus 74 nl~~~i~~~~A~ll~~e~~~i~--~~d~---------~~~~~l~~~l~~-------n~f~~~~-~~~~~~a~~~G~~~~~ 134 (527) ..-...++.+|+-+-+-|..+- ..+. .....+..++.+ -.|++.+ +.++.+.+..|.+++. T Consensus 125 saV~scI~~IA~sIAsLPlklYrr~edG~~~~~~kk~~~~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYie 204 (945) T protein:vir:10 125 DSKLIKVSEIPKKLTSKELEIYKHIEDKHVNYYLKRIRDARNILEFLERPDPYFSEVNSWEYLLGMVLDDILTIDRGAIV 204 (945) T ss_pred HHHHHHHHHHHhhhccCceEEEEecccCcccccccccccchHHHHHHhCCCcccChhHHHHHHHHHHHHHHhhcCCeEEE Confidence 1222344555555444443321 1110 112233344432 1244433 4456788889999998 Q ss_pred EEEeC-Cee-EEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEE Q lcl|NC_019418. 135 PYVDG-DKI-RVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNE 212 (527) Q Consensus 135 ~~~d~-~~~-~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ 212 (527) +..+. |++ .+..++|+++-|...++++... +|. +. .++...+. T Consensus 205 IiRd~~G~ii~L~pLdPs~Vti~~ddDG~~~y----------------~Yv----~~-------------idG~~~~~-- 249 (945) T protein:vir:10 205 KIRDEQGNLVAITPVDGTTIKPILSEDTGIVV----------------GYV----QE-------------VDGAIVAH-- 249 (945) T ss_pred EEECCCCcEEEEEEECCcceEEEEcCCCcEEE----------------EEE----Ee-------------cCCceEEE-- Confidence 87754 443 5777888887775433222110 010 00 00000000 Q ss_pred EEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHH-HH Q lcl|NC_019418. 213 LYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFM-WE 291 (527) Q Consensus 213 ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~-~e 291 (527) |. +.. . .-+++++.++. ...++|+|.+..+...+.....+ ..+. +- T Consensus 250 -v~--------------------a~D-------v--Ilhirn~s~DG--~~~GyGlSPIeaa~~aI~~alAa-ek~aar~ 296 (945) T protein:vir:10 250 -FD--------------------KRD-------V--VLFRQNLTPDV--YMYGYSLPPIEILYKVILSDIFI-DKGNLDY 296 (945) T ss_pred -ec--------------------CCc-------e--EEEeccCCCCc--ccccCCchHHHHHHHHHHHHHHH-HHHHHHH Confidence 00 000 0 11233332222 22346888887666655443322 2222 22 Q ss_pred HH-cC-cce--eeechhHhcCCCCCCCccccccc--cc-ccccceeeeccCC----CCCCCcceEeccccChHHHHHHHH Q lcl|NC_019418. 292 IK-MG-QRR--VIVPEQMTQLKVQDNQGNIAFKR--RF-DVEQNVYMQVGAG----NMDSGGIVDLTTPIRSSDYISAIS 360 (527) Q Consensus 292 ~~-~~-~~~--i~v~~~~l~~~~~~~~~~~~~~~--~~-d~~~~~~~~~~~~----~~~~~~i~~~~~~ir~e~~~~~~~ 360 (527) |. .| .++ +.++...... ....+.+.... .+ ..-+..+.+.+.+ -+++..++.++....+.++.+..+ T Consensus 297 FskNGa~PsGILsvkg~~~~d--~k~~~~LseEq~erlKe~wee~~sG~NnG~piVLdeGmef~pLs~s~~DaQfLEsrk 374 (945) T protein:vir:10 297 YRKGGSIPEGILAIEPPSYKE--GDIYPQLSREQLESIQRQLQAIMMGDYTQVPILSGGKFTWIDFKGKRRDMQFKELAE 374 (945) T ss_pred HHhCCCccceEEEecCccccc--cccccccCHHHHHHHHHHHHHHhCCcccccceecCCCceEEEccCChhHHHHHHHHH Confidence 43 33 222 2222111100 00000000000 00 0001111111100 012223555666667888888888 Q ss_pred HHHHHHHHhcCCCccccccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccce Q lcl|NC_019418. 361 EGLKLFEMQIGVSSGMFTFDGQGV-KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDI 439 (527) Q Consensus 361 ~~l~~i~~~~g~s~~~~~~~~~g~-~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v 439 (527) ....+|+...|++|..+|+.++.. .++.+.. .+-...+..-+...++.+|...+ + .......+ T Consensus 375 fs~eeIArAFGVPP~lLG~~e~st~SNiEqq~--~~Fv~~tL~Pil~~IEqeLNrkL---l-----------~~~eg~~i 438 (945) T protein:vir:10 375 FVARKICAVYQVSPQDVGILEGSNKATAEVMA--SLTKAKGLEPLMATISKGFDEVV---S-----------EFRNEKDI 438 (945) T ss_pred HHHHHHHHHhCCCHHHcccCCCCCcchHHHHH--HHHHHHHHHHHHHHHHHHHHHhc---c-----------ccccCcee Confidence 888999999999999998765432 2222221 11111233333333444433221 0 01123457 Q ss_pred EEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHH-HHHHHHHHHHHhccccc----ccccCCCCCCCCCCC Q lcl|NC_019418. 440 SVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLGITEE-EAEKELAEINGELPPES----DAELALYGKGQQNTV 514 (527) Q Consensus 440 ~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~de-ea~~el~ri~~E~~~~~----~~~~~~~~~~~~~~~ 514 (527) .+.|+.....|..+.++...+++.+|+|++-+++++. |+..- .-..-+-.... ..+.+ ......+....+... T Consensus 439 ~fdFd~ldl~D~ksraEal~kli~sGiLTiNEvRe~l-GLpPIeGGD~lli~~nn-~~P~d~~~ka~~ga~p~q~aq~~~ 516 (945) T protein:vir:10 439 KLWFKEDDLEKERDWWNIIQGQLNTGFRSINEARMEK-GLEPVPWGDVPFSGLRN-WKPEDEQAKAQQGAMPPQLAQAMA 516 (945) T ss_pred EEEecchhccCHHHHHHHHHHHHhCCCcCHHHHHHHh-CCCCCCCcceeeecccc-ccccccccccccCCCCcccccCCC Confidence 8889888888889999999999999999999977654 43221 00011100000 00000 000000101000101 Q ss_pred CCCC----------CCCCccccC Q lcl|NC_019418. 515 GNSK----------DTVDDEDEA 527 (527) Q Consensus 515 ~~~~----------~~~~~~~~~ 527 (527) +++. +..++.+++ T Consensus 517 dqp~~kGGe~dEns~~psE~kda 539 (945) T protein:vir:10 517 DQPSQQGGGVDENSSVPSEQKNA 539 (945) T ss_pred CCCCCCCCCCCCCCCCCCcccch Confidence 1111 111111112 No 164 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=98.16 E-value=3.3e-06 Score=50.61 Aligned_cols=397 Identities=13% Similarity=0.104 Sum_probs=155.1 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) |++|.| .+++++....+..+.+.+. . + ....+-.|... .+..... +..+..+--...+ T Consensus 1 m~~~~~-~~~~~~~~~~~~~~~~~~~-----~--~-----~~~~~~~~~~~-----~~~~v~~----~~a~~~~~v~~~i 58 (412) T protein:vir:26 1 MNVIAK-ENIVTRIKKKLIDNWIDQS-----T--S-----KLYDFSPWKNR-----SFWGVIN----NTLETNETIFSAI 58 (412) T ss_pred Cccchh-hhhhhhhhhhHhhhhhccc-----c--c-----ccccccccCCc-----cccccch----hhhhccHHHHHHH Confidence 999966 3343332211111111000 0 0 00000000000 0000100 1111112223334 Q ss_pred HHHhhhhhcccceEe----eCCHHHHHHHHHHHhhhh---HHHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEEEcCCc Q lcl|NC_019418. 81 KKIASLVYNEQAEIS----AEDETLNDFLSDMLSNDR---FNKNFERYLESALALGGLAMRPYVDG-DK-IRVAFIQAPV 151 (527) Q Consensus 81 ~~~A~ll~~e~~~i~----~~d~~~~~~l~~~l~~n~---f~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~v~a~~ 151 (527) +.+|+-+-.-|..+- ..+......|.. .-|. -..-.+..+...+..|.+++.+..+. |+ ..+..++|+. T Consensus 59 ~~ia~~iA~lp~~~~~~~~~~~~~~~~lL~~--~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~ 136 (412) T protein:vir:26 59 TKLSNSMASLPLKMYEDYKVVNTEVSDLLTV--SPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDV 136 (412) T ss_pred HHHHHhHhhCceeEeeccccccchHHHHHHh--hcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCce Confidence 444444444343321 122233333321 1111 22233556677788899988887664 33 3566677776 Q ss_pred eEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccc Q lcl|NC_019418. 152 FLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSEL 231 (527) Q Consensus 152 ~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~ 231 (527) +-+...+.++ .++|.. .. .. |..+.+ T Consensus 137 v~v~~~~~~~-----------------~~~y~~-~~-----------------~~----------------g~~~~~--- 162 (412) T protein:vir:26 137 VEMLIENQSR-----------------ELYYSI-HA-----------------AT----------------GNKLIV--- 162 (412) T ss_pred eEEEEeCCCc-----------------EEEEEE-Ec-----------------CC----------------ceEEEE--- Confidence 6654322111 112210 00 00 110000 Q ss_pred cCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCC Q lcl|NC_019418. 232 YPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQ 311 (527) Q Consensus 232 ~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~ 311 (527) . +. -..||+.+... +..+|+|.+.-+...++-...+ ..+ -+..++.. +.-++..... T Consensus 163 ~---~~----------evih~~~~~~~----~~~~G~s~i~~~~~~i~~~~a~-~~~--~~~~~~~~---~~~i~~~~~~ 219 (412) T protein:vir:26 163 H---NM----------DMLHFKHIVAS----NMVQGISPIDVLKNTTDFDNAV-RTF--NLTEMQKP---DSFMLKYGSN 219 (412) T ss_pred c---cc----------cEEEeCCCCCC----CCcccccHHHHHHHHHHHHHHH-HHH--HHHhcCCC---CceEEecCCC Confidence 0 00 12455543211 2345888887776665543322 222 12222211 1111111111 Q ss_pred CCCcc-cccccccccccceeeeccC--CCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccc-cchH Q lcl|NC_019418. 312 DNQGN-IAFKRRFDVEQNVYMQVGA--GNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQG-VKTA 387 (527) Q Consensus 312 ~~~~~-~~~~~~~d~~~~~~~~~~~--~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g-~~TA 387 (527) ..... -.....|. ..+..-+. --.++..++.++......++.+..+....+|+...|++|.-+|...++ -.++ T Consensus 220 l~~e~~~~~~~~~~---~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~ 296 (412) T protein:vir:26 220 VGKEKRQQVLEDFK---QYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKN 296 (412) T ss_pred CCHHHHHHHHHHHH---HHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccH Confidence 11000 00000110 00110000 001222355555555667888888888889999999999999865443 2233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccC-CcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCC Q lcl|NC_019418. 388 TEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYR-GTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGF 466 (527) Q Consensus 388 tei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~-~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi 466 (527) .+.. +..++.+|.-++..|....+. .+.. ........+.+++++-+..|..+.++...+++.+|+ T Consensus 297 e~~~-------------~~f~~~~l~P~~~~ie~~ln~-kLl~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~ 362 (412) T protein:vir:26 297 EELN-------------RFYLQHTLLPIVKQYEEEFNR-KLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGY 362 (412) T ss_pred HHHH-------------HHHHHHHHHHHHHHHHHHHHh-hcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCC Confidence 3321 111222333333333211110 1111 111122335555555566789999999999999999 Q ss_pred CCHHHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 467 ATQKRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 467 ~s~~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) |++-+++.+. |++.-+ ..+-+- +.+....+.+.+.+.. .....++++|| T Consensus 363 ~t~NE~R~~~-gl~p~~ggD~~~~-------~~n~~~~~~~~~~~~~----~~gG~~n~~e~ 412 (412) T protein:vir:26 363 YTINDIREWE-DLPPVEGGDKPLI-------SGDLYPIDTPLELRKS----LKGGDKNVNES 412 (412) T ss_pred cCHHHHHHHh-CCCCCCCcCeeee-------cccccccccchhhccc----ccCCCCCcCCC Confidence 9999977654 554311 100000 0000000011111111 01111122222 No 165 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=98.14 E-value=3.7e-06 Score=50.34 Aligned_cols=398 Identities=11% Similarity=0.086 Sum_probs=157.3 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhc--cCccccCHHHHHHHHHH--HHHhcCCCcccccc----cccCccc-cCcee Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILD--HPKVAVTQSEFRRIQHN--LAYYQSKFDDIEYT----NTDGDRK-RRKMQ 71 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~--~~~i~~~~~~~~~i~~~--~~~y~g~~~~l~~~----~~~~~~~-~~~~~ 71 (527) ||||+++++.=. +...... .+....+. .--.| +.|+...-+++..+ ...|... ...-+ T Consensus 1 Mgl~d~~r~~~~---------~~~~~~~~~~~~~~~~~----~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al 67 (431) T protein:vir:10 1 MGLFDFIRREKQ---------PEAQARPHVEPSFQAST----PTTSIPGETFEGLDDPRLKEYIRRGELNGGTGRETRAL 67 (431) T ss_pred CcchhhhhcCcc---------ccccccccccccccccc----ccccccccccccccchHHHHhhccCccCcceechhhhh Confidence 999988665211 0000000 01111110 00001 11211111111110 1111110 01111 Q ss_pred ecchHHHHHHHHhhhhhcccceE-eeCCH---HHHHHHHHHHhh--hh---HHHHHHHHHHHHHhcCCEEEEEEEeCCee Q lcl|NC_019418. 72 HLPIARTAAKKIASLVYNEQAEI-SAEDE---TLNDFLSDMLSN--DR---FNKNFERYLESALALGGLAMRPYVDGDKI 142 (527) Q Consensus 72 ~lnl~~~i~~~~A~ll~~e~~~i-~~~d~---~~~~~l~~~l~~--n~---f~~~~~~~~~~a~~~G~~~~~~~~d~~~~ 142 (527) ....-...++.+|+-+-+-|..+ ..++. .....+..+|.. |. -......++...+..|.+++.+..+++++ T Consensus 68 ~~~~V~~ci~~Ia~~iA~lp~~v~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~g~~ 147 (431) T protein:vir:10 68 RNMAVLRCVTLISGTIGMLPMNLISSDDSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSGNRP 147 (431) T ss_pred ccHHHHHHHHHHHHhhccCceEEEEecCceeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCce Confidence 11222344455555554444433 11110 111223333321 11 11223445667777899999988886553 Q ss_pred -EEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccc Q lcl|NC_019418. 143 -RVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQ 221 (527) Q Consensus 143 -~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~ 221 (527) .+-.++|..+-+...+ + +..+|.. .. . T Consensus 148 ~~L~pl~~~~v~~~~~~-~-----------------~~~~y~~-----------------------------~~-~---- 175 (431) T protein:vir:10 148 IRLIPMDRGSAKGRLTS-T-----------------WQIVYDY-----------------------------TT-P---- 175 (431) T ss_pred EEEEEEcCceeEEEEcC-C-----------------CeEEEEE-----------------------------Ee-C---- Confidence 3444555554443211 1 1112210 00 0 Q ss_pred cCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHH-HHcCcceee Q lcl|NC_019418. 222 LGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWE-IKMGQRRVI 300 (527) Q Consensus 222 lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e-~~~~~~~i~ 300 (527) -|..+.+ +. --+.||+.+..| ...|+|.+.-+...+.- ...-..+... |+.|.. T Consensus 176 ~g~~~~~-------~~---------~dViHir~~~~d-----g~~G~spi~~~~~~i~~-~~~~~~~~~~~f~ng~~--- 230 (431) T protein:vir:10 176 TGDKIEL-------PA---------REVFHLRDLSID-----GVSGVSRVKLSGNALEL-AEQAERAASRTFRTGVM--- 230 (431) T ss_pred CceEEEE-------ch---------hhEEEecCcCCC-----CcccccHHHHHHHHHHH-HHHHHHHHHHHHhccCC--- Confidence 0111000 00 012355543222 24688988877776653 3333444433 454333 Q ss_pred echhHhcCCCCCCCcccccccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcc Q lcl|NC_019418. 301 VPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSG 375 (527) Q Consensus 301 v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~ 375 (527) |..++.......... ..+....-+..|.+.+ .+ -++...++.++....+.++++..+....+|+...|++|. T Consensus 231 -p~gil~~~~~ls~e~--~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~ 307 (431) T protein:vir:10 231 -AGGAIEVPKELSDNA--YGRMKASVQENHTGSENAGSWMLLEEGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRP 307 (431) T ss_pred -ccEEEecCCCCCHHH--HHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHH Confidence 222222221111100 0000000011122110 00 012224555666666778888888888899999999999 Q ss_pred ccccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHH Q lcl|NC_019418. 376 MFTFDGQGV-KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAE 454 (527) Q Consensus 376 ~~~~~~~g~-~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~ 454 (527) .+|...++. .+..+. . ...++.+|..++..|-.-.+. .++.........+.++++.-+..|..+. T Consensus 308 ~lg~~~~~t~sn~eq~---~----------~~f~~~tL~P~~~~ie~~ln~-~Ll~~~~~~~~~~~fd~~~llr~d~~~r 373 (431) T protein:vir:10 308 LLMMDDTSWGSGIEQL---A----------IFFIQYGLSHWFVSWEQAAAR-AFLPEKMLGQRQFKFNEGALLRGTLNDQ 373 (431) T ss_pred HhCCCCCCccccHHHH---H----------HHHHHHHHHHHHHHHHHHHHh-hccChhhcCCceEEEechhhhccCHHHH Confidence 998765432 222222 1 111223333333332221110 1111111123345666666677888888 Q ss_pred HHHHHHHHhcCC----CCHHHHHHhcCCCC---HHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCC Q lcl|NC_019418. 455 LDYWMKMVAAGF----ATQKRGIAKTLGIT---EEEAEKELAEINGELPPESDAELALYGKGQQNTVGN 516 (527) Q Consensus 455 ~~~~~~~~~aGi----~s~~~~i~~~~~~~---deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 516 (527) ++...+++.+|+ |++-++++.. |+. ++...+-. .+..... .+++++.|... T Consensus 374 ~~~~~~~~~~G~~~g~lT~NE~R~~~-gl~p~~~~~gD~~~-------~p~n~~~---~~~~~~~p~~~ 431 (431) T protein:vir:10 374 AAFFSKALGAGGQSPWMKQNEVREML-DLPRADDPVADQLR-------NPMTQKQ---KGSGDEPPATT 431 (431) T ss_pred HHHHHHHHhcccccCccCHHHHHHHh-CCCCCCCcccccee-------ccccccc---CCCCCCCCCCC Confidence 888888887776 6777755432 332 21121100 0000000 00111111111 No 166 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=98.14 E-value=3.7e-06 Score=50.33 Aligned_cols=375 Identities=12% Similarity=0.076 Sum_probs=151.9 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) ||||++.. |.++ +. .....+ .-..|...+.+.... ....+ +..++..--..++ T Consensus 1 Mg~~~~~~-~~~~-------~~--------~~~~~~----~~~~~~~~~~~~~~~---~~v~~----~~al~~~~v~~~i 53 (385) T protein:vir:10 1 MGLLTPRN-FNKR-------KA--------KNMVYP----SNPAFFTTTVGGMQL---SYVSA----LSALQNTNVYSVI 53 (385) T ss_pred Cccccchh-cccc-------cc--------cccccc----cchhhhhhhccccCc---cccCH----HHhhccHHHHHHH Confidence 99998632 2111 11 000111 011122222221110 00111 1112223334566 Q ss_pred HHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcCCceEEEEEcCC Q lcl|NC_019418. 81 KKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQAPVFLPLQSNTQ 160 (527) Q Consensus 81 ~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a~~~~P~~~d~~ 160 (527) +.+|+-+.+-| +.+.+......|++--..-......+.++.+.+..|.+++.+..+ .. . .+|+.. . T Consensus 54 ~~ia~~ia~~p--~~v~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~--~~--~------~~p~~~--~ 119 (385) T protein:vir:10 54 NRIASDVASAH--FKTENTATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ--NL--E------HIPNSD--V 119 (385) T ss_pred HHHHHHHhhCc--eeeeccchhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC--ce--e------EeecCC--c Confidence 66777666555 445444444444321111112222333455566678888876533 22 2 233210 0 Q ss_pred ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCccccee Q lcl|NC_019418. 161 DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTP 240 (527) Q Consensus 161 ~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~ 240 (527) . + ... .+....+|...+ ..+...++ | + +.+ T Consensus 120 ~----v---~~~--~~~~~~~~~~~~----------------~~~~~~~~---~------------~--------~~e-- 149 (385) T protein:vir:10 120 Q----I---NYL--PGNMGIVYTVLE----------------SNDRPQMV---L------------R--------QDQ-- 149 (385) T ss_pred e----E---EEE--EcCCceEEEEEE----------------cCCceEEE---E------------c--------ccc-- Confidence 0 0 001 111222221100 00000000 0 0 000 Q ss_pred ecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcC-cceeeechhHhcCCCCCCCc--cc Q lcl|NC_019418. 241 IQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMG-QRRVIVPEQMTQLKVQDNQG--NI 317 (527) Q Consensus 241 ~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~-~~~i~v~~~~l~~~~~~~~~--~~ 317 (527) ..||+...++. .+...|+|.+..+...++.....-.-..+-|..| ++..++ ......... .- T Consensus 150 --------iihik~~~~~~--~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil-----~~~~~~~~~e~~~ 214 (385) T protein:vir:10 150 --------MLHFRLMPDPQ--YRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKL-----TISNYLSDGKDLE 214 (385) T ss_pred --------EEEeccCCCCc--ccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEE-----EeCCCCCCHHHHH Confidence 23454322211 1234699999888888866554444444445654 333333 211111000 00 Q ss_pred ccccccccccceeeeccCC----CCCCCcceEeccccChHHHH-HHHHHHHHHHHHhcCCCcccccccccccchHHHHHH Q lcl|NC_019418. 318 AFKRRFDVEQNVYMQVGAG----NMDSGGIVDLTTPIRSSDYI-SAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVS 392 (527) Q Consensus 318 ~~~~~~d~~~~~~~~~~~~----~~~~~~i~~~~~~ir~e~~~-~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s 392 (527) .....| +..+.+-+.+ -+++..++.++......+++ +..+...++|+...|++|..+|....+..+...+.. T Consensus 215 ~~~~~~---~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq 291 (385) T protein:vir:10 215 SAREEF---EKANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQ 291 (385) T ss_pred HHHHHH---HHHhCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHH Confidence 000111 1112111100 01222355566666666764 566777789999999999999864333222222211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHH Q lcl|NC_019418. 393 ENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRG 472 (527) Q Consensus 393 ~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~ 472 (527) .......+..-..+.++.+|.. .++. ..+.++++.-+..|..+.++...+++.+|+|++-++ T Consensus 292 ~~~~~~~~l~P~~~~ie~~l~~------------~l~~------~~~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~ 353 (385) T protein:vir:10 292 IKATYLANLNSYVNPIVDELRL------------KMNA------PDLELDIKDMLDVDDSALINQVSNLAKSGVLGAEQA 353 (385) T ss_pred HHHHHHHHHHHHHHHHHHHHHH------------hhCC------ceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHH Confidence 1111111222222222222221 1111 235666677777899999999999999999988886 Q ss_pred HHhc--CCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_019418. 473 IAKT--LGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDT 520 (527) Q Consensus 473 i~~~--~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (527) +... .|++++.. .+. ..+ ..+..-++++|| T Consensus 354 R~~~g~~p~p~~~~----~~~------------~~~--~~~~~~g~~~dn 385 (385) T protein:vir:10 354 QFILTRSGFLPDNL----PEF------------KPL--TTQVKGGDEGDN 385 (385) T ss_pred HHHhCCCccCCCCC----ccc------------cCc--ccccCCCCCCCC Confidence 6432 23322110 000 000 000111111111 No 167 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=98.13 E-value=3.9e-06 Score=50.22 Aligned_cols=371 Identities=11% Similarity=0.109 Sum_probs=153.4 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHH---HHHhcCCCcccccccccCccccCceeecchHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHN---LAYYQSKFDDIEYTNTDGDRKRRKMQHLPIAR 77 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~---~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~ 77 (527) ||+|+++++++.+ +.. +. ..+..+ ..++.|.+ ++ .+.-+....-. T Consensus 1 MGl~~~~~~~~~~-------~~~------~~--------~~~~~~~~~~~~~~~~~--vt---------~~~al~~~~v~ 48 (394) T protein:vir:62 1 MGLRDRFSNYLFK-------KAE------KR--------GYLDNVLGKSIRYSGVY--VT---------DSNILQSSDVY 48 (394) T ss_pred CchhhhhhhhccC-------CCC------ch--------hhhhhhhhcccccCccc--cC---------hhhhhccHHHH Confidence 9999998876532 000 00 001111 01111100 00 01111222233 Q ss_pred HHHHHHhhhhhcccceEeeCC-H-HHHHHHHHHHhhh----hHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcCCc Q lcl|NC_019418. 78 TAAKKIASLVYNEQAEISAED-E-TLNDFLSDMLSND----RFNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQAPV 151 (527) Q Consensus 78 ~i~~~~A~ll~~e~~~i~~~d-~-~~~~~l~~~l~~n----~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a~~ 151 (527) .+++.+|+-+-+-|..+.-.+ + .....+-.++.+. ....-....+...+..|.+++.+ +++.+.+ ++. T Consensus 49 ~~i~~Ia~~iA~lp~~v~~~~g~~~~~~~~~~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i--~~~~~~~----~~~ 122 (394) T protein:vir:62 49 ELLQDISNQMVLADIVVEDEFGNEIKDDIALQILRNPNNYLTQSEFIKLMTNTYLLEGETFPIL--NGAQIHL----ASN 122 (394) T ss_pred HHHHHHHHhhcccceEEEcCCCcccchhhHHHHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEE--ecceeec----ccc Confidence 444555555444443332111 1 1111222233221 11122334555666678777653 3332111 122 Q ss_pred eEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccc Q lcl|NC_019418. 152 FLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSEL 231 (527) Q Consensus 152 ~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~ 231 (527) +.|+. ......+|+ .++ ..++- T Consensus 123 ~~~~~------------------~~~~~~~~~--------------------~~~-----------------~~~~~--- 144 (394) T protein:vir:62 123 VFTEL------------------DDNLVEHFN--------------------IGG-----------------HEIPP--- 144 (394) T ss_pred ceEEE------------------CCceEEEEe--------------------eCC-----------------EEech--- Confidence 22211 011111110 000 00110 Q ss_pred cCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcC-cceeeechhHhcCCC Q lcl|NC_019418. 232 YPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMG-QRRVIVPEQMTQLKV 310 (527) Q Consensus 232 ~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~-~~~i~v~~~~l~~~~ 310 (527) . -+.|++.+..+ ..+|+|.+.-+...|......-....+-+..| .++.++ .... T Consensus 145 -----~----------eiih~r~~~~d-----~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il-----~~~~ 199 (394) T protein:vir:62 145 -----C----------MIRHVKNIGAD-----HLRGKGILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFLL-----NLDA 199 (394) T ss_pred -----h----------heEEecCcCCC-----CccccChHHHHHHHHHHHHHHHHHHHHHHHccCCcceEE-----EeCC Confidence 0 12455543222 23689998888777765555444444445653 332222 2221 Q ss_pred CCCCcc---cccccccccccceeeecc-C------CCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccc Q lcl|NC_019418. 311 QDNQGN---IAFKRRFDVEQNVYMQVG-A------GNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFD 380 (527) Q Consensus 311 ~~~~~~---~~~~~~~d~~~~~~~~~~-~------~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~ 380 (527) ...... -.....| ..-|.+.+ . ..+..-.+..++......++.+..+...++|+...|++|..+|.. T Consensus 200 ~~~~~~~~~~~~~~~~---~~~~~g~~n~g~~~vl~~g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~ 276 (394) T protein:vir:62 200 HINPQNGAQSKLINAI---LDQLESIDEARSVKMIPLGKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTEL 276 (394) T ss_pred CCCcCHHHHHHHHHHH---HHHhccccccCceeEeeCCCceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCC Confidence 111100 0000111 11122211 0 111111233455556677888888888899999999999998743 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHH Q lcl|NC_019418. 381 GQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMK 460 (527) Q Consensus 381 ~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~ 460 (527) ... ++.+. .+..++.+|..++..|...... .++.. .....+.|+|+..-..+.++.++...+ T Consensus 277 ~~s--n~e~~-------------~~~~~~~~l~P~~~~ie~~l~~-kll~~--~~~~~~~~~fd~~~~~~~~~~~~~~~~ 338 (394) T protein:vir:62 277 IKE--DIEKA-------------MMYIHNKAVRPIMKNFEDHLSL-LFYAQ--NSGKRIKFKINILDFVTYSNKTNIGYN 338 (394) T ss_pred CCc--CHHHH-------------HHHHHHHHHHHHHHHHHHHHhh-hhcCc--cccCceEEEechhhhcCHHHHHHHHHH Confidence 221 12211 1222334444444444322211 12211 123457788988777888888888889 Q ss_pred HHhcCCCCHHHHHHhcCCCC---HHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 461 MVAAGFATQKRGIAKTLGIT---EEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 461 ~~~aGi~s~~~~i~~~~~~~---deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) ++.+|+|++-+++... |+. +++....... .... .+. .....++...+|+++|. T Consensus 339 ~~~~g~~T~NE~R~~~-gl~p~~~~~gd~~~~~--~n~~-------~~~----~~~~~~~~~kgge~~en 394 (394) T protein:vir:62 339 LVRTAITSPDNVADML-GFPKQNTKESQAIYIS--NDVT-------EIG----KKEATDGSLGGGEENEN 394 (394) T ss_pred HHhCCCcCHHHHHHHh-CCCCCCCCCCCeeecc--cccc-------ccc----ccccccccCCCCCCCCC Confidence 9999999999976543 443 2222111100 0000 000 00001112223333333 No 168 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=98.05 E-value=6.1e-06 Score=49.19 Aligned_cols=452 Identities=14% Similarity=0.151 Sum_probs=204.7 Q ss_pred HHHHhhcccchhhhc---cCccccCHHHHHHHHHHHHHhcCCCcccccccccCcccc-----Cceeec---chHHHHHHH Q lcl|NC_019418. 14 GRYNMTTSHLSSILD---HPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKR-----RKMQHL---PIARTAAKK 82 (527) Q Consensus 14 ~~~~~~~~~~~~~~~---~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~-----~~~~~l---nl~~~i~~~ 82 (527) |-+.+|+=++.+.-. .+.+..+...-..+..-..-|-| .+....|..+. +.+.+| +--...++. T Consensus 1 ~~~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~g-----~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~e 75 (537) T protein:vir:10 1 MAQQLFGFSLQRAKKVPKGPSFVQKDSLDGSQPIVGGGYFG-----YSVDFDGTIRNDHELITRYREMVLNPECDSAVDD 75 (537) T ss_pred CccccccceeecccccccCCcccCCCcccccceeecccccc-----cccccccccchHHHHHHHHHHHhhccchhhHHHH Confidence 555666555444322 22233333322222211111111 11111221111 111111 111122222 Q ss_pred Hhh-hhhc----ccceEeeCCH--------HHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCe-----eEE Q lcl|NC_019418. 83 IAS-LVYN----EQAEISAEDE--------TLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDK-----IRV 144 (527) Q Consensus 83 ~A~-ll~~----e~~~i~~~d~--------~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~-----~~i 144 (527) ..+ -+.. .|+++.+++- ...+.++.++.--+|.+...+.+....+-|..+|+.++|..+ ..+ T Consensus 76 IVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fhKiid~k~pk~GI~EL 155 (537) T protein:vir:10 76 VVNETICGNFDDVPISIDLHNLKQSEKIKKLIRSEFDEILRLLDFDNRAYEIFRRWYVDGRLFFHKVIDPKKPRQGLVEL 155 (537) T ss_pred hhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccccceee Confidence 222 1111 2445655541 244556677776789999999999999999999999998543 468 Q ss_pred EEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCc Q lcl|NC_019418. 145 AFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGE 224 (527) Q Consensus 145 ~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~ 224 (527) .+++|.++-++..--....... ... .....++. . ...|.+-|--+...+ ...|- T Consensus 156 r~lDPr~i~~vR~i~~~~~~~~---~~~---~~~~~v~~--~-----------------~~eyf~ynp~g~~~~-~~~~v 209 (537) T protein:vir:10 156 RYVDPRKIRKVTEYEAKRPEAL---RTQ---DLNQQLTQ--Q-----------------SASYFLYNPKGLKNS-TNQGM 209 (537) T ss_pred eeeCCccceeeEeecccCCccc---eEE---ecceeeee--c-----------------ccceeeecccccccc-CCCce Confidence 8899988876532100000000 000 00000000 0 001111000000000 11122 Q ss_pred eeecccccCCcccceee--cCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHH--HHHHcCcceee Q lcl|NC_019418. 225 RVNLSELYPDLQPVTPI--QGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFM--WEIKMGQRRVI 300 (527) Q Consensus 225 ~v~l~~~~~~l~~~~~~--~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~--~e~~~~~~~i~ 300 (527) .+|-+. +++ +|+- +...+..+|-+..|.-.+..|=..-+.++ +-.|+-.+||| T Consensus 210 kI~~dA--------I~y~hSGl~---------------d~n~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvF 266 (537) T protein:vir:10 210 KIAPDS--------IAYCHSGIQ---------------DLNKNMVLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIF 266 (537) T ss_pred eccHhh--------eeeecccce---------------eCCCCeeeeeehhhhHHHHhhHHHHhhHHHHhhhccccceEE Confidence 222211 111 1211 11234456777777777766655555544 44456667776 Q ss_pred ec----------hhHh---------cCCCCCCCcccccccccccc-cceeeeccCCCCC-CCcceEeccccChHHHHHHH Q lcl|NC_019418. 301 VP----------EQMT---------QLKVQDNQGNIAFKRRFDVE-QNVYMQVGAGNMD-SGGIVDLTTPIRSSDYISAI 359 (527) Q Consensus 301 v~----------~~~l---------~~~~~~~~~~~~~~~~~d~~-~~~~~~~~~~~~~-~~~i~~~~~~ir~e~~~~~~ 359 (527) -- +.++ +..-|..+|++.-.+.+-.- ...+.+- -+|+ ..-|+++..-=...+ ...+ T Consensus 267 YIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLPR--ReGgrgTEItTLpGgqnlge-m~DV 343 (537) T protein:vir:10 267 YIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPR--REGGRGTEISTLPGGQNLGE-LEDV 343 (537) T ss_pred EEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhhhhhcccc--cCCCcccceeeccccCCcCh-HHHH Confidence 41 0111 00113444444333222110 0001110 1122 123555554333333 4556 Q ss_pred HHHHHHHHHhcCCCcccccccccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCc-- Q lcl|NC_019418. 360 SEGLKLFEMQIGVSSGMFTFDGQG-VKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPEL-- 436 (527) Q Consensus 360 ~~~l~~i~~~~g~s~~~~~~~~~g-~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~-- 436 (527) ..+.+.+....+++..-++.+++. ..-++||.-..-.....+.+.+..|..-+.++++.=|.|-.. +...-+.. T Consensus 344 ~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgi---it~eeW~~i~ 420 (537) T protein:vir:10 344 KYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFVDLLKTQLILKGI---CSIEEWEEMK 420 (537) T ss_pred HHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccC---CCHHHHHHHh Confidence 666777777778877766655431 234567777777777788888888888888888876655332 21111111 Q ss_pred cceEEEeCCCccCCHHHHHHHHHH---HHh--c---C-CCCHHHHHHhcCCCCHHHHHHHHHHHHHhcc------ccccc Q lcl|NC_019418. 437 DDISVNLDDGVFTDRHAELDYWMK---MVA--A---G-FATQKRGIAKTLGITEEEAEKELAEINGELP------PESDA 501 (527) Q Consensus 437 ~~v~v~f~d~i~~d~~~~~~~~~~---~~~--a---G-i~s~~~~i~~~~~~~deea~~el~ri~~E~~------~~~~~ 501 (527) ..|.++|...=-..+..+++.... +.+ . | ..|.+++.++....||+|.+++..+|++|.. |++.. T Consensus 421 ~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~tDeeI~~~~k~I~~E~k~~~~~~p~~~~ 500 (537) T protein:vir:10 421 EHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVLKQTESEIKEIDKEIKQEIADGVIMDPQAMQ 500 (537) T ss_pred hcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCccccc Confidence 346677754333333333332221 111 1 2 3588887788889999999999999998874 22222 Q ss_pred ccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 502 ELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 502 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) .+.++ .++..+......+. ..|.+ T Consensus 501 ~~~~~-~~~~~~~~~~~~~~-~~~~~ 524 (537) T protein:vir:10 501 AMEMG-IGDEEPVPEGGEEP-QTDPN 524 (537) T ss_pred ccccC-CCCcccCCCCCCCc-ccCCc Confidence 22222 12222111111111 11111 No 169 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=98.03 E-value=6.6e-06 Score=48.98 Aligned_cols=379 Identities=13% Similarity=0.114 Sum_probs=161.9 Q ss_pred HHHhcCCCcccccccccCccc------------cC----------------------------ceeecchHH--HHHHHH Q lcl|NC_019418. 46 LAYYQSKFDDIEYTNTDGDRK------------RR----------------------------KMQHLPIAR--TAAKKI 83 (527) Q Consensus 46 ~~~y~g~~~~l~~~~~~~~~~------------~~----------------------------~~~~lnl~~--~i~~~~ 83 (527) ++||.-+...+.+......++ +| ....|..|. ..++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~acv~~I 80 (441) T protein:vir:98 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMI 80 (441) T ss_pred CceecCccceeccccccchhhhhhccccccccccccccCCCcchHHHHHHhhcccccCccccchhhhhccHHHHHHHHHH Confidence 555554443333222111100 00 000112221 234555 Q ss_pred hhhhhcccceEeeCCH-HHHHHHHHHHh--hhh---HHHHHHHHHHHHHhcCCEEEEEEEeCC-e-eEEEEEcCCceEEE Q lcl|NC_019418. 84 ASLVYNEQAEISAEDE-TLNDFLSDMLS--NDR---FNKNFERYLESALALGGLAMRPYVDGD-K-IRVAFIQAPVFLPL 155 (527) Q Consensus 84 A~ll~~e~~~i~~~d~-~~~~~l~~~l~--~n~---f~~~~~~~~~~a~~~G~~~~~~~~d~~-~-~~i~~v~a~~~~P~ 155 (527) |+-+-+-|..+.-++. .....+-.+|. -|. -...+..++...+..|.+++.+..+.. + +.+..++|+.+-+. T Consensus 81 a~~iA~lpl~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~ 160 (441) T protein:vir:98 81 ASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELK 160 (441) T ss_pred HHhhccCceEEecCCcccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCceeEEE Confidence 5555444433322211 11122222332 111 112334556667778999988877653 3 45777888877764 Q ss_pred EEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCc Q lcl|NC_019418. 156 QSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDL 235 (527) Q Consensus 156 ~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l 235 (527) . +.++. .+|.. ... ......+. ..|. T Consensus 161 ~-~~~g~-----------------~~~~~-~~~--------------~~~~~~~~-~~~~-------------------- 186 (441) T protein:vir:98 161 L-DARGR-----------------LYYFH-QRI--------------DSNGNNIE-RNVK-------------------- 186 (441) T ss_pred E-CCCCc-----------------EEEEE-EEe--------------ccCcceee-EEEc-------------------- Confidence 3 33332 12210 000 00000000 0010 Q ss_pred ccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHH-HHcCc-ceeeechhHhcCCCCCC Q lcl|NC_019418. 236 QPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWE-IKMGQ-RRVIVPEQMTQLKVQDN 313 (527) Q Consensus 236 ~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e-~~~~~-~~i~v~~~~l~~~~~~~ 313 (527) +. -+.||+.+.. ....|+|.+.-+...++... ...++... |+.|. ++.++ ....... T Consensus 187 ~~----------dviHir~~~~-----dg~~G~spi~~~~~~i~~~~-a~~~~~~~~f~ng~~~~gil-----~~~~~~~ 245 (441) T protein:vir:98 187 FE----------DMLDIKFYSL-----DGINGLSLLDTLSRTIESDN-NGKDFLNNFLRNGTHAGGIL-----KMKGVLD 245 (441) T ss_pred cc----------cEEEeccCCC-----CCccccCHHHHHHHHHHHHH-HHHHHHHHHHhccCCCcEEE-----EeCCCCC Confidence 00 0234543211 12358888887777765433 33334333 45543 22222 2221111 Q ss_pred --Ccccccccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccch Q lcl|NC_019418. 314 --QGNIAFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKT 386 (527) Q Consensus 314 --~~~~~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~T 386 (527) .........|. ..|.+.+ .+ -.++..++.++....+.++.+.......+|+...|++|..+|.+.++. + T Consensus 246 ~~e~~~~~~~~~~---~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~-s 321 (441) T protein:vir:98 246 NKKARDRAREEFH---KSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANM-S 321 (441) T ss_pred CHHHHHHHHHHHH---HHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCc-c Confidence 10000111111 1122211 00 012234666666677778888888888999999999999998654432 2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCC Q lcl|NC_019418. 387 ATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGF 466 (527) Q Consensus 387 Atei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi 466 (527) .++....+ ..+..-+...++.+|... ++.. .....+.++.+.-+..|..+.++...+++.+|+ T Consensus 322 ~~q~~~~y---~~tl~P~~~~ie~~ln~~------------L~~~--~~~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~ 384 (441) T protein:vir:98 322 ITDANLDY---LSTLKPYITCVCAELNFK------------FNDE--YVNREFKFDTTEIRVVDEKTQAEIDKINIDSGK 384 (441) T ss_pred HHHHHHHH---HHHHHHHHHHHHHHHHhh------------cccc--ccCceEEEechhhhccCHHHHHHHHHHHHhCCC Confidence 22221111 122222222222222211 1111 123445566666677888999999999999999 Q ss_pred CCHHHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCCC--CCCCCCCCCcccc Q lcl|NC_019418. 467 ATQKRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQNT--VGNSKDTVDDEDE 526 (527) Q Consensus 467 ~s~~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 526 (527) |++-+++... |+..-+ -.+.+-.+.....+.+. -++.+.+. .++....+|++.| T Consensus 385 ~T~NE~R~~~-gl~pi~gGd~~~~~~~~n~~~~~~-----~~~~q~~~~~~~~~~~kgGe~ne 441 (441) T protein:vir:98 385 MNIDEIRQRD-GLAPIPGGNGSIHRVDLNHVNIEL-----VDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred cCHHHHHHHh-CCCCCCCCCcceEeeccccccccc-----ccccccccccccccccCCCCCCC Confidence 9999976543 543210 00000000000000000 00111111 1112234555555 No 170 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=98.03 E-value=6.6e-06 Score=48.98 Aligned_cols=378 Identities=11% Similarity=0.093 Sum_probs=156.4 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHH--HHHHHHHHHhcCCCcccccccccCccccCceeecchHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEF--RRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIART 78 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~--~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~ 78 (527) |+||+++++-- +. +..+...... .....+..++.+.... .+.. +.-+..+--.. T Consensus 3 m~~f~~~~~~~---------~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~v~~----~~al~~~~v~~ 58 (392) T protein:vir:39 3 LPILNFINQTN---------DP-------PEVGSVQSYFPDGNDAQIMESLLGDNNE----WVSA----RAALRNSDLFS 58 (392) T ss_pred chhhhhhhccc---------cc-------ccccccccccccCchhhhhhhhcCCCCc----eech----HHhhccHHHHH Confidence 77776543211 11 0111000000 0011111122221100 0000 00011122234 Q ss_pred HHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEEEcCCceEEEE Q lcl|NC_019418. 79 AAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DK-IRVAFIQAPVFLPLQ 156 (527) Q Consensus 79 i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~v~a~~~~P~~ 156 (527) .++.+|+-+-+=| +.+.+......+.+--..-....-+..++.+.+..|.+++.+..+. |+ +.+.+++|+.+-+.. T Consensus 59 ~i~~ia~~ia~lp--~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~ 136 (392) T protein:vir:39 59 IILQLSSDLAIVK--INAEKKKNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYY 136 (392) T ss_pred HHHHHHHhhccCc--eeeccchhhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEE Confidence 5555666554433 4444444333332211111123334456667788899998887764 34 356667777665432 Q ss_pred EcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcc Q lcl|NC_019418. 157 SNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQ 236 (527) Q Consensus 157 ~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~ 236 (527) ...++ ..+|.. .. .++. .+..+.+ + + T Consensus 137 ~~~~~-----------------~~~y~~-~~---------------~~~~---------------~~~~~~~---~---~ 162 (392) T protein:vir:39 137 FEYEN-----------------GMYYNI-TF---------------DDPK---------------IEPILQA---P---Q 162 (392) T ss_pred cCCCc-----------------eEEEEE-Ee---------------cCcc---------------cceeEEE---c---c Confidence 21111 122211 00 0000 0000000 0 0 Q ss_pred cceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcC-cceeeechhHhcCCCCCCCc Q lcl|NC_019418. 237 PVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMG-QRRVIVPEQMTQLKVQDNQG 315 (527) Q Consensus 237 ~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~-~~~i~v~~~~l~~~~~~~~~ 315 (527) . -+.|++.+..+ +..+|+|-+.-+...++....+-....+-|+.| .+.-++ ....+.... T Consensus 163 ~----------eiih~~~~~~~----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil-----~~~~~~~~~ 223 (392) T protein:vir:39 163 S----------DLIHMKLLSID----GGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVL-----TVKGGGLLS 223 (392) T ss_pred c----------cEEEecCCCCC----CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEE-----EeCCCCCch Confidence 0 13455543222 334699999888888855444433333345553 333222 211111000 Q ss_pred ccccccccccccceeeec-cCCC----CCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHH Q lcl|NC_019418. 316 NIAFKRRFDVEQNVYMQV-GAGN----MDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEI 390 (527) Q Consensus 316 ~~~~~~~~d~~~~~~~~~-~~~~----~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei 390 (527) + .....+ ..-|.+. +.+. .++..++.++....+.++.+..+...++|+...|++|..+|+..+...+..+. T Consensus 224 ~-~~~~~~---~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~~~ 299 (392) T protein:vir:39 224 D-KDKASR---SRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQI 299 (392) T ss_pred H-HHHHHH---HHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHH Confidence 0 000000 0111111 1100 12224555555556678888888888999999999999998654332211111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHH Q lcl|NC_019418. 391 VSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQK 470 (527) Q Consensus 391 ~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~ 470 (527) . ..++.+|..+++.|..-.+. .+.. .+.++...-+-.|..+.+..+.+++.+|++++- T Consensus 300 ~--------------~f~~~~l~P~~~~ie~~l~~-~L~~-------~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~n 357 (392) T protein:vir:39 300 S--------------GMYASALNRYLRPAISELEY-KLSD-------HISVNMRPAIDPLGDNYLSTISTATRWGALAEN 357 (392) T ss_pred H--------------HHHHHHHHHHHHHHHHHHHH-hccc-------cccccchhhhccCHHHHHHHHHHHHhCCCcCHH Confidence 1 12233333333333221110 0110 111222222334567777888899999999998 Q ss_pred HHHHhc--CCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCC Q lcl|NC_019418. 471 RGIAKT--LGITEEEAEKELAEINGELPPESDAELALYGKGQQNTV 514 (527) Q Consensus 471 ~~i~~~--~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~ 514 (527) +++..+ .|+...|+.+. |..+ +.-++++.+..| T Consensus 358 E~r~~l~~~g~~p~e~r~~------e~l~-----~~~~Gd~~~p~p 392 (392) T protein:vir:39 358 QATFVLQEAGYIPKDLPAP------ENTN-----KKTTGQSNEPVP 392 (392) T ss_pred HHHHHHHhcCCCccccchh------cCCC-----CCCCCCCCCCCC Confidence 876432 46655444321 1111 111122222111 No 171 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=98.03 E-value=6.6e-06 Score=48.98 Aligned_cols=378 Identities=11% Similarity=0.093 Sum_probs=156.4 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHH--HHHHHHHHHhcCCCcccccccccCccccCceeecchHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEF--RRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIART 78 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~--~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~ 78 (527) |+||+++++-- +. +..+...... .....+..++.+.... .+.. +.-+..+--.. T Consensus 3 m~~f~~~~~~~---------~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~v~~----~~al~~~~v~~ 58 (392) T protein:vir:10 3 LPILNFINQTN---------DP-------PEVGSVQSYFPDGNDAQIMESLLGDNNE----WVSA----RAALRNSDLFS 58 (392) T ss_pred chhhhhhhccc---------cc-------ccccccccccccCchhhhhhhhcCCCCc----eech----HHhhccHHHHH Confidence 77776543211 11 0111000000 0011111122221100 0000 00011122234 Q ss_pred HHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEEEcCCceEEEE Q lcl|NC_019418. 79 AAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DK-IRVAFIQAPVFLPLQ 156 (527) Q Consensus 79 i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~v~a~~~~P~~ 156 (527) .++.+|+-+-+=| +.+.+......+.+--..-....-+..++.+.+..|.+++.+..+. |+ +.+.+++|+.+-+.. T Consensus 59 ~i~~ia~~ia~lp--~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~ 136 (392) T protein:vir:10 59 IILQLSSDLAIVK--INAEKKKNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYY 136 (392) T ss_pred HHHHHHHhhccCc--eeeccchhhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEE Confidence 5555666554433 4444444333332211111123334456667788899998887764 34 356667777665432 Q ss_pred EcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcc Q lcl|NC_019418. 157 SNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQ 236 (527) Q Consensus 157 ~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~ 236 (527) ...++ ..+|.. .. .++. .+..+.+ + + T Consensus 137 ~~~~~-----------------~~~y~~-~~---------------~~~~---------------~~~~~~~---~---~ 162 (392) T protein:vir:10 137 FEYEN-----------------GMYYNI-TF---------------DDPK---------------IEPILQA---P---Q 162 (392) T ss_pred cCCCc-----------------eEEEEE-Ee---------------cCcc---------------cceeEEE---c---c Confidence 21111 122211 00 0000 0000000 0 0 Q ss_pred cceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcC-cceeeechhHhcCCCCCCCc Q lcl|NC_019418. 237 PVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMG-QRRVIVPEQMTQLKVQDNQG 315 (527) Q Consensus 237 ~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~-~~~i~v~~~~l~~~~~~~~~ 315 (527) . -+.|++.+..+ +..+|+|-+.-+...++....+-....+-|+.| .+.-++ ....+.... T Consensus 163 ~----------eiih~~~~~~~----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil-----~~~~~~~~~ 223 (392) T protein:vir:10 163 S----------DLIHMKLLSID----GGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVL-----TVKGGGLLS 223 (392) T ss_pred c----------cEEEecCCCCC----CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEE-----EeCCCCCch Confidence 0 13455543222 334699999888888855444433333345553 333222 211111000 Q ss_pred ccccccccccccceeeec-cCCC----CCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHH Q lcl|NC_019418. 316 NIAFKRRFDVEQNVYMQV-GAGN----MDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEI 390 (527) Q Consensus 316 ~~~~~~~~d~~~~~~~~~-~~~~----~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei 390 (527) + .....+ ..-|.+. +.+. .++..++.++....+.++.+..+...++|+...|++|..+|+..+...+..+. T Consensus 224 ~-~~~~~~---~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~~~ 299 (392) T protein:vir:10 224 D-KDKASR---SRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQI 299 (392) T ss_pred H-HHHHHH---HHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHH Confidence 0 000000 0111111 1100 12224555555556678888888888999999999999998654332211111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHH Q lcl|NC_019418. 391 VSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQK 470 (527) Q Consensus 391 ~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~ 470 (527) . ..++.+|..+++.|..-.+. .+.. .+.++...-+-.|..+.+..+.+++.+|++++- T Consensus 300 ~--------------~f~~~~l~P~~~~ie~~l~~-~L~~-------~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~n 357 (392) T protein:vir:10 300 S--------------GMYASALNRYLRPAISELEY-KLSD-------HISVNMRPAIDPLGDNYLSTISTATRWGALAEN 357 (392) T ss_pred H--------------HHHHHHHHHHHHHHHHHHHH-hccc-------cccccchhhhccCHHHHHHHHHHHHhCCCcCHH Confidence 1 12233333333333221110 0110 111222222334567777888899999999998 Q ss_pred HHHHhc--CCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCC Q lcl|NC_019418. 471 RGIAKT--LGITEEEAEKELAEINGELPPESDAELALYGKGQQNTV 514 (527) Q Consensus 471 ~~i~~~--~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~ 514 (527) +++..+ .|+...|+.+. |..+ +.-++++.+..| T Consensus 358 E~r~~l~~~g~~p~e~r~~------e~l~-----~~~~Gd~~~p~p 392 (392) T protein:vir:10 358 QATFVLQEAGYIPKDLPAP------ENTN-----KKTTGQSNEPVP 392 (392) T ss_pred HHHHHHHhcCCCccccchh------cCCC-----CCCCCCCCCCCC Confidence 876432 46655444321 1111 111122222111 No 172 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=98.01 E-value=7.2e-06 Score=48.79 Aligned_cols=430 Identities=13% Similarity=0.079 Sum_probs=180.5 Q ss_pred CChH-----HHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecch Q lcl|NC_019418. 1 MSLI-----QKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPI 75 (527) Q Consensus 1 m~~~-----~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl 75 (527) |-|. ++++..+.++ ..+.......|+.++.---|-+-.....+... .+.==.. T Consensus 1 ~~~~~~~e~~~l~~r~~~L--------------------k~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~--~~~~dst 58 (517) T protein:vir:10 1 MDMRFAGNKSKIPKLYEQL--------------------VGKRSPFLSRAENYSRFTLPYLMADVNDDLSS--QNAWQDD 58 (517) T ss_pred CcccccccHHHHHHHHHHH--------------------HHhhhHHHHHHHHHHHHhccccccCCCCCccc--cccccch Confidence 6554 3333333321 12233445667776654333221111111111 1111245 Q ss_pred HHHHHHHHhhhhhcc--cc-----eEeeCCHH-------------HHH-------HHHHHHhhhhHHHHHHHHHHHHHhc Q lcl|NC_019418. 76 ARTAAKKIASLVYNE--QA-----EISAEDET-------------LND-------FLSDMLSNDRFNKNFERYLESALAL 128 (527) Q Consensus 76 ~~~i~~~~A~ll~~e--~~-----~i~~~d~~-------------~~~-------~l~~~l~~n~f~~~~~~~~~~a~~~ 128 (527) +...++.+|+-|.+- || ++.+++.. ..+ .+...+..++|...+.++..+.... T Consensus 59 g~~a~~~LAa~l~~~ltpp~~~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~ 138 (517) T protein:vir:10 59 GASATNFLSNKLSQVLFPAQRSFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVT 138 (517) T ss_pred HHHHHHHHHHHHHHhhcCCCCccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhH Confidence 667777777755443 11 23333321 222 3345677889999999999999999 Q ss_pred CCEEEEEEEeCCeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeC---------------------CCcceEEEEEEE Q lcl|NC_019418. 129 GGLAMRPYVDGDKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTE---------------------NRKNVYYTLVEF 187 (527) Q Consensus 129 G~~~~~~~~d~~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~---------------------~~~~~~yt~lE~ 187 (527) |.+.+ |.+++...+..++-.+++ +..|..|++..++....+... +....+||.+++ T Consensus 139 G~a~l--y~~~~~~~~~~~pl~~y~-v~~d~~G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~ 215 (517) T protein:vir:10 139 GNVMM--YHPDKTSPIQAVPLHHYC-VRRDNNGTVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHAKR 215 (517) T ss_pred CeEEE--EEeCCCCcEEEEEcCeEE-EeeCCCcCeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEEEEE Confidence 98764 567666667777777766 456777766666543221100 011122222222 Q ss_pred EeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCC--CcccEEEecCCccccccCCCc Q lcl|NC_019418. 188 HEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGL--SRPLFTYLKTPGMNNKDINSP 265 (527) Q Consensus 188 h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~--~~p~f~~~~~~~~N~~~~~sp 265 (527) . .++.+. .|...++..+|. ..+. ..-+|..++- +...++. T Consensus 216 ~--------------~~~~~~----~~~~~d~~~~~~----------------~s~y~~~e~P~~~~Rw----~~~~ge~ 257 (517) T protein:vir:10 216 T--------------KDGKYL----IRQSADDVPVGK----------------ESTVTEDKSPFLILTW----KRSYGED 257 (517) T ss_pred e--------------CCCceE----EEEEeCceeecc----------------ccccccccCCeeeeee----eecCCCC Confidence 1 011111 122111111110 1111 1112222221 2233678 Q ss_pred cCcchhhhhHHHHHHHHHHHHHHH-HHHHcCcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcce Q lcl|NC_019418. 266 LGLSIFDNAKTTIDFINRTYDEFM-WEIKMGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIV 344 (527) Q Consensus 266 lG~S~~~~~~~lid~ld~~~s~~~-~e~~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~ 344 (527) ||+|--..+.+-++.|+..--... ......+....||++......+...+ ....+.+ +....+. T Consensus 258 YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~~~~~~~~~l~~~----------~~g~~~~-----g~~~~v~ 322 (517) T protein:vir:10 258 YGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYLVKPGSYTDINQFVEG----------GSGAVLH-----GVEGDIH 322 (517) T ss_pred cccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhccCC----------Ccccccc-----CCcccce Confidence 999999999999999997644443 34556677777766544221111110 0000111 1112233 Q ss_pred Eeccc--cChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH Q lcl|NC_019418. 345 DLTTP--IRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCE 421 (527) Q Consensus 345 ~~~~~--ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~ 421 (527) .++.. .....-.+.++.+-..|....=+.. +....+...|||||....+...+..+-.-..+ ...|.-|++.++. T Consensus 323 ~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~--l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~ 400 (517) T protein:vir:10 323 IVQLGKYADYTPIQAVLNDYRQRIGRVFMMEA--MTRRDAERVTAYEIQRDAMLVEQSLGGVYSLFATTFQGPLARWFMN 400 (517) T ss_pred eeecccccchhHHHHHHHHHHHHHHHHHhhhh--hhccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHH Confidence 22211 1122223334444333322211111 12222334799999988888887766543333 2333445555543 Q ss_pred HhhhhcccCCcccCccceEEEeCCCccC-CHHHHHHHHHHH---Hh--cCC-------CCHHHHH---HhcCCCC----- Q lcl|NC_019418. 422 LGKVVGIYRGTIPELDDISVNLDDGVFT-DRHAELDYWMKM---VA--AGF-------ATQKRGI---AKTLGIT----- 480 (527) Q Consensus 422 ~~~~~~~~~~~~~~~~~v~v~f~d~i~~-d~~~~~~~~~~~---~~--aGi-------~s~~~~i---~~~~~~~----- 480 (527) ... .. .+.+. +.++.--++.. .+...++...+. ++ +++ +....++ ....|++ T Consensus 401 ~l~--~~---l~~~~--v~~~~~s~la~l~r~~~~~~i~~~~~~i~~~a~~~~~~~~~id~d~~~~~~a~~~Gvp~~~ir 473 (517) T protein:vir:10 401 GIS--SI---LTSKN--VSPTILTGIEALGRMAELDKLGTFNGYVSMTAQWPEPLQQAIKWPDFTDWVQGQISANFPFFK 473 (517) T ss_pred Hhh--hh---cCCCC--ccceeeccHHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHHhCCChhhcC Confidence 321 11 11111 22222111110 001111111111 11 110 1122222 2234544 Q ss_pred -HHHHHHHHHHHHHhccccc------ccccCCCCCCCCCCCCCC Q lcl|NC_019418. 481 -EEEAEKELAEINGELPPES------DAELALYGKGQQNTVGNS 517 (527) Q Consensus 481 -deea~~el~ri~~E~~~~~------~~~~~~~~~~~~~~~~~~ 517 (527) ++|++++.++.+.+++... ...+..-.+++.++.+-. T Consensus 474 s~~ev~~~~~~~~~~~~~~~~~~~ag~~~~~~~~~~~~~~~~~~ 517 (517) T protein:vir:10 474 TQDELNAEAQAQQEQEATKYAAEQAGKAIPDMVKNGQINPQGGQ 517 (517) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCCC Confidence 4565544433322221110 011111123333332211 No 173 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=97.98 E-value=8.3e-06 Score=48.45 Aligned_cols=463 Identities=10% Similarity=0.084 Sum_probs=181.3 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) |.= ++ ..+-. ..+++... .+..++......|+.++.---|-+-.....+......++-=+.+...| T Consensus 1 m~~-~~-~~~~~--------~~~~~r~~----~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~ 66 (536) T protein:vir:21 1 MAE-KR-TGLAE--------DGAKSVYE----RLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGL 66 (536) T ss_pred Ccc-hh-hchhH--------HHHHHHHH----HHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHH Confidence 111 00 00000 01111100 112233344566776665433322111111111122223335677777 Q ss_pred HHHhhhhhcc--c--ceE--eeCCH-------------HH-------HHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEE Q lcl|NC_019418. 81 KKIASLVYNE--Q--AEI--SAEDE-------------TL-------NDFLSDMLSNDRFNKNFERYLESALALGGLAMR 134 (527) Q Consensus 81 ~~~A~ll~~e--~--~~i--~~~d~-------------~~-------~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~ 134 (527) +.+|+-|.+- | +=| .+.+. .. .+.+...+..++|...+.++..+....|.+.+. T Consensus 67 ~~Laa~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly 146 (536) T protein:vir:21 67 NNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLY 146 (536) T ss_pred HHHHHHHHHhhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEE Confidence 7777754442 2 212 22221 12 234556788899999999999999999988875 Q ss_pred EEEeCC-ee-EEEEEcCCceEEEEEcCCceEEEEEEEEEEee-------------CCCcceEEEEEEEEeecccccccce Q lcl|NC_019418. 135 PYVDGD-KI-RVAFIQAPVFLPLQSNTQDVSSAAILTKTIKT-------------ENRKNVYYTLVEFHEWVTPTGQEVG 199 (527) Q Consensus 135 ~~~d~~-~~-~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~-------------~~~~~~~yt~lE~h~~~~~~~~~~~ 199 (527) +--+.+ ++ .+..++-.+++ +..|..|++..+|....+.. ...+..++..+++.+... T Consensus 147 ~~e~~~~~~~~f~~~pl~~~~-v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~------- 218 (536) T protein:vir:21 147 LPEPEGSNYNPMKLYRLSSYV-VQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIY------- 218 (536) T ss_pred EeeCCCCceeeEEEEEcCeEE-EeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEEE------- Confidence 433332 23 36677877777 45667777776663321110 000011111111111000 Q ss_pred eeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHH Q lcl|NC_019418. 200 STKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTID 279 (527) Q Consensus 200 ~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid 279 (527) ...++.++.+ |..-+ |..|+.++-+ .++..-+|..++- +...++.||+|-...+.+-+. T Consensus 219 ~~~~~~~~~~----~~e~~----g~~v~~~~g~---------~~f~~~P~i~~Rw----~~~~ge~YGrgp~~~~l~D~k 277 (536) T protein:vir:21 219 LDEDSGEYLR----YEEVE----GMEVQGSDGT---------YPKEACPYIPIRM----VRLDGESYGRSYIEEYLGDLR 277 (536) T ss_pred EecCCCcEEE----EeccC----CeeeccccCc---------cccccCCeeeeee----eecCCCccccchHHHHHHHHH Confidence 0011122221 11111 1122111100 0111112222221 223467899999999999999 Q ss_pred HHHHHHHHHHH-HHHcCcceeeech-hHhcCCC--CCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHH Q lcl|NC_019418. 280 FINRTYDEFMW-EIKMGQRRVIVPE-QMTQLKV--QDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDY 355 (527) Q Consensus 280 ~ld~~~s~~~~-e~~~~~~~i~v~~-~~l~~~~--~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~ 355 (527) .|+..--.... .....+....|++ .++.+.. ++..|. +++ +..+..++..+....+...- T Consensus 278 ~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~-------------~v~---g~~~~v~~~~~~~~~~~~~~ 341 (536) T protein:vir:21 278 SLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGD-------------FVT---GRPEDISFLQLEKQADFTVA 341 (536) T ss_pred HHHHHHHHHHHHHHHHhcCCcccCcccccchhhhccCCCcc-------------eec---CCcccceeeeccccccchHH Confidence 99976555544 3344444444432 2322110 111111 111 11111112212221222222 Q ss_pred HHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhcccCCccc Q lcl|NC_019418. 356 ISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKVVGIYRGTIP 434 (527) Q Consensus 356 ~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~~~~~~~~~~ 434 (527) .+.++.+-..|....=+. .+....+...|||||....+...+...-.-..+ ...|.-|+..++.+..-.++....+. T Consensus 342 ~~~i~~~~~rI~~af~~~--~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~ 419 (536) T protein:vir:21 342 KAVSDAIEARLSFAFMLN--SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPK 419 (536) T ss_pred HHHHHHHHHHHHHHHhhh--hcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCh Confidence 344444444443322111 122233445799999999988888666544444 23444566655555432222222222 Q ss_pred CccceEEEeCCCccC-CHHHHHHHHHHHHh--cCC--------CCHHHHHH---hcCCC-------CHHHHHHHHHHHHH Q lcl|NC_019418. 435 ELDDISVNLDDGVFT-DRHAELDYWMKMVA--AGF--------ATQKRGIA---KTLGI-------TEEEAEKELAEING 493 (527) Q Consensus 435 ~~~~v~v~f~d~i~~-d~~~~~~~~~~~~~--aGi--------~s~~~~i~---~~~~~-------~deea~~el~ri~~ 493 (527) +. +.+++--++.. .+...++..+...+ +++ +....++. ..+|+ |++|++++.++.+. T Consensus 420 ~~--v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~Pe~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~ 497 (536) T protein:vir:21 420 EA--VEPTISTGLEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSM 497 (536) T ss_pred hh--ccceEEecHHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHH Confidence 22 33333222211 11122222222211 121 22233332 23465 34555544433221 Q ss_pred hcccccccc-----cCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 494 ELPPESDAE-----LALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 494 E~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) ++....... ........+. ....-=+..+.+++ T Consensus 498 ~~~~~~~a~~~~~~~~~~~~~~~~-~~~~~~~~~g~~~~ 535 (536) T protein:vir:21 498 QMGMDNGAAALAQGMAAQATASPE-AMAAAADSVGLQPG 535 (536) T ss_pred HHHHHHHHHHHHHHHHHHHhcChh-hHHhhhhccccCCC Confidence 111100000 0000000000 00000011122222 No 174 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=97.97 E-value=8.6e-06 Score=48.36 Aligned_cols=387 Identities=11% Similarity=0.033 Sum_probs=161.2 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhh----hccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSI----LDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIA 76 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~----~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~ 76 (527) |+.|+.++ .+ ++.++|.+.-... +.......+ ........|..+..+. +. ........- T Consensus 1 ~~~~~~~~-~~--~~m~~F~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~----------~~---~~~~~~~~v 63 (413) T protein:vir:96 1 MPGVSEIR-KD--KNLKFFNNKRSPTEESKAKDEIPKAP-QVVMTLPNFFKELISD----------GY---TKLSDSPEV 63 (413) T ss_pred CCccchhh-hh--hcCCccccCCCcchhhhhhccccccc-cccccchhhHhhhccc----------hh---HHHhhchHH Confidence 88777766 11 2222332221110 000000000 0000111222111111 00 001112333 Q ss_pred HHHHHHHhhhhhcccceEeeC--------CHHHHHHHHH-HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCC--ee-EE Q lcl|NC_019418. 77 RTAAKKIASLVYNEQAEISAE--------DETLNDFLSD-MLSNDRFNKNFERYLESALALGGLAMRPYVDGD--KI-RV 144 (527) Q Consensus 77 ~~i~~~~A~ll~~e~~~i~~~--------d~~~~~~l~~-~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~--~~-~i 144 (527) ..+++.+|+-+..-|..+--. +..+...|.. --..-....-++.++.+.+..|.+++.+..+.. .+ .+ T Consensus 64 ~~cI~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~~~~~L 143 (413) T protein:vir:96 64 RMAVDCIADLVSNMTIQLMQNGETGDKRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGDKIIGL 143 (413) T ss_pred HHHHHHHHHhhccCceEEEEecCCCccccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCceEEE Confidence 455666666665544433111 1122222211 101112234446677788888999998888643 33 56 Q ss_pred EEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCc Q lcl|NC_019418. 145 AFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGE 224 (527) Q Consensus 145 ~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~ 224 (527) ..++|+.+-+.. +. +.++|.. . +.+.-| T Consensus 144 ~~l~~~~v~~~~-~~------------------~~~~y~~-~----------------------~~~~~~---------- 171 (413) T protein:vir:96 144 TPISPYKVTFNV-SD------------------DDLDYSI-T----------------------FDNKEY---------- 171 (413) T ss_pred EEecCceeEEEE-cC------------------CeEEEEE-e----------------------ecCcEE---------- Confidence 677777665532 11 1112210 0 000001 Q ss_pred eeecccccCCcccceeecCCCcccEEEecC-CccccccCCCc-cCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeec Q lcl|NC_019418. 225 RVNLSELYPDLQPVTPIQGLSRPLFTYLKT-PGMNNKDINSP-LGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVP 302 (527) Q Consensus 225 ~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~-~~~N~~~~~sp-lG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~ 302 (527) + +.+ ..||+. +.++ ++ .|.|.+.-+...+......-..-.+-|+.|.. | T Consensus 172 --~--------~~e----------vih~k~~~~~~-----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~----p 222 (413) T protein:vir:96 172 --D--------PST----------LLHFVLNPSIE-----RPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYM----P 222 (413) T ss_pred --c--------hhh----------EEEEeccCCCC-----CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCC----c Confidence 0 000 235542 2122 22 48888887777766555443333334555432 2 Q ss_pred hhHhcCCCCCCCcccccccccccccceeeecc-C-----CCCCCCcceEec-cccChHHHHHHHHHHHHHHHHhcCCCcc Q lcl|NC_019418. 303 EQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVG-A-----GNMDSGGIVDLT-TPIRSSDYISAISEGLKLFEMQIGVSSG 375 (527) Q Consensus 303 ~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~-~-----~~~~~~~i~~~~-~~ir~e~~~~~~~~~l~~i~~~~g~s~~ 375 (527) ..++..+.+..... ..+....-+..|.+.. . -+.+...++.+. ......++++..+...++|+...|++|. T Consensus 223 ~gil~~~~~l~~e~--~~~~~~~~~~~~~g~~n~g~~~vl~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~ 300 (413) T protein:vir:96 223 NLIVSVDSDSDELS--DEEGRENFEEMYLKRKEAGKPWIIPEGMVNVQQIKPLTLNDLAINDAVTLDKKTVAGIFGVPAF 300 (413) T ss_pred cEEEEeCCCCCHHH--HHHHHHHHHHHhcCccccCceeeecCCcccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHH Confidence 22222221111100 0000000011122110 0 011111222222 2334567777777778899999999999 Q ss_pred cccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHH Q lcl|NC_019418. 376 MFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAEL 455 (527) Q Consensus 376 ~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~ 455 (527) .+|.... ++-. . ...++.+|..++..|....+. .++ ++...+.+++++-+..|.++.+ T Consensus 301 ~lg~~~~-----~~~~--~----------~~~~~~~l~P~~~~ie~~ln~-~ll----~~~~~~~fd~~~ll~~d~~~~~ 358 (413) T protein:vir:96 301 LLGVGTY-----NKDE--F----------NNFINTKIMSIAQVIQQTYNK-LIV----EEDMYFSLNPRSLYNYSLTEMV 358 (413) T ss_pred HcCCCcc-----hHHH--H----------HHHHHHHHHHHHHHHHHHHHH-hhC----CCCcEEEEechhhhccCHHHHH Confidence 9875322 1110 0 113344555555554433221 122 1234567777777788989999 Q ss_pred HHHHHHHhcCCCCHHHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_019418. 456 DYWMKMVAAGFATQKRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDE 524 (527) Q Consensus 456 ~~~~~~~~aGi~s~~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (527) +...+++.+|+|++-++++.. |+..-+ ..+-+. +.+....+.-++.+. ..+||. T Consensus 359 ~~~~~~~~~G~~t~NE~R~~~-g~~p~~~gd~~~~-------~~n~~~~~~~~~~~~-------~~~~dt 413 (413) T protein:vir:96 359 SAGAQMTQLNALRRNEFRNWV-GMPPDAEMDDLLV-------LENYLQQKDLVNQKK-------LIQDET 413 (413) T ss_pred HHHHHHHhCCCcCHHHHHHHh-CCCCCCCcceeee-------cccccchhhcccccC-------CCCCCC Confidence 999999999999999976543 654321 111000 000000000000000 011111 No 175 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=97.93 E-value=1e-05 Score=47.94 Aligned_cols=408 Identities=11% Similarity=0.068 Sum_probs=163.9 Q ss_pred CC-hHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccc-cCceeecchHHH Q lcl|NC_019418. 1 MS-LIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRK-RRKMQHLPIART 78 (527) Q Consensus 1 m~-~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~-~~~~~~lnl~~~ 78 (527) |. -++++...++.- +.+.+. +.+.++.. .-|..| .|.... .|... ....+...--.. T Consensus 1 ~~~~~~~~~~~~~~~--------~~~~~g-~~~s~~~~-----~~~~~~-~~~~~~------~g~~v~~~~al~~~~v~~ 59 (437) T protein:vir:10 1 MKQGKQRALGRIKSS--------FLKWLG-VPISLTDG-----SFWSAW-GGMGSS------SGETVTADSALQLSAVWS 59 (437) T ss_pred CCcchhhhhhhhHHh--------hhhhcC-CcccCCch-----hHHHhh-cccccC------CCceechHhhhccHHHHH Confidence 33 122222222211 111111 12222221 112222 221111 11100 011111111123 Q ss_pred HHHHHhhhhhcccceE----------eeCCHHHHHHHH-HHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCee-EEEE Q lcl|NC_019418. 79 AAKKIASLVYNEQAEI----------SAEDETLNDFLS-DMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKI-RVAF 146 (527) Q Consensus 79 i~~~~A~ll~~e~~~i----------~~~d~~~~~~l~-~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~-~i~~ 146 (527) +++.+|+-+.+-|..+ .+.+..+...|. +--..-......+.++...+..|.+++.+..++|++ .+.. T Consensus 60 ci~~Ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~g~~~~L~~ 139 (437) T protein:vir:10 60 CVRLIAETIATLPLNLYQTKPDGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRSAGVLIGLEL 139 (437) T ss_pred HHHHHHHHHhhCceeEEEEcCCCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEEEEE Confidence 3444444443333222 112222333232 111111223334456667778899998887776654 3555 Q ss_pred EcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCcee Q lcl|NC_019418. 147 IQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERV 226 (527) Q Consensus 147 v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v 226 (527) ++|+.+-+.. +.++ ..+|.. .. .. |... T Consensus 140 l~p~~v~i~~-~~~g-----------------~~~y~~-~~-----------------~~----------------g~~~ 167 (437) T protein:vir:10 140 MLPQRTTVKR-LTSG-----------------ALQYTY-RN-----------------VD----------------GTVS 167 (437) T ss_pred EcCcceEEEE-CCCC-----------------eEEEEE-Ee-----------------cC----------------ceEE Confidence 6666655432 1111 112210 00 00 1000 Q ss_pred ecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCc-ceeeechhH Q lcl|NC_019418. 227 NLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQ-RRVIVPEQM 305 (527) Q Consensus 227 ~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~-~~i~v~~~~ 305 (527) .+ ++. -+.||+.+.. +..+|+|.+.-+...+......-.-..+-|..|. +.-++ T Consensus 168 ~~-------~~~---------dIih~r~~~~-----d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil---- 222 (437) T protein:vir:10 168 TL-------AED---------DVFHVRGFSL-----DGLMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVL---- 222 (437) T ss_pred EE-------ccc---------cEEEecCcCC-----CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEE---- Confidence 00 000 1245553311 2357999888777777655544333344455533 33333 Q ss_pred hcCCCCCCCcc-cccccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccc Q lcl|NC_019418. 306 TQLKVQDNQGN-IAFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTF 379 (527) Q Consensus 306 l~~~~~~~~~~-~~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~ 379 (527) .......... -.....| ...|.+.. .+ -+++..++.++......++.+..+...++|+...|++|..+|+ T Consensus 223 -~~~~~l~~e~~~~~~~~~---~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~ 298 (437) T protein:vir:10 223 -STDQILQKEKRAEIRTDL---AEQFGGAMQAGKTMVLEAGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGH 298 (437) T ss_pred -EcCCCCCHHHHHHHHHHH---HHHhcCccccCcceeccCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCC Confidence 2111111000 0000111 11122111 00 0122245566666667788888888888999999999999987 Q ss_pred cccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHH Q lcl|NC_019418. 380 DGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWM 459 (527) Q Consensus 380 ~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~ 459 (527) ...+..+...+..... ..++.+|..++..|-...+. .++.........+.++++.-+..|..+.++... T Consensus 299 ~~~~t~~~sn~e~~~~----------~f~~~tl~P~~~~ie~~l~~-kll~~~e~~~~~~~fd~~~ll~~d~~~r~~~~~ 367 (437) T protein:vir:10 299 SEKSTSWGTGIEQQTL----------GFLTFTLRPWLTRIEQAARR-SLLRPGERDQFYAEFSVEGLLRADSAGRAAFYS 367 (437) T ss_pred CCCcccccchHHHHHH----------HHHHHHHHHHHHHHHHHHHh-hccCccccCceEEEEechhhhccCHHHHHHHHH Confidence 6554332222211111 12233333333333221111 121111122234666666667788899999999 Q ss_pred HHHhcCCCCHHHHHHhcCCCCH---HHH----HHHHHHHHHhcccccccccCCCCC-CCCCCCCCCCCCCCcccc Q lcl|NC_019418. 460 KMVAAGFATQKRGIAKTLGITE---EEA----EKELAEINGELPPESDAELALYGK-GQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 460 ~~~~aGi~s~~~~i~~~~~~~d---eea----~~el~ri~~E~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 526 (527) +++.+|+|++-+++.++ |+.. ... ..-+..+..... .. ++..++ +.........+..++||= T Consensus 368 ~~~~~G~~T~NE~R~~~-gl~pi~gg~~~~~~~~~~~~~~~~~~-~~---~~~~~~~~~~~~~~~~~~~~~~~e~ 437 (437) T protein:vir:10 368 TMTQNGLMTRDECRAKE-NLPPMGGNAAVLTVQSALLPIDKLGE-HT---TATAAQDALKAWLYQEEKTRATQER 437 (437) T ss_pred HHHhCCCcCHHHHHHHh-CCCCCCCCcceEeecCcccchhhccC-cC---CCcchhccccccCCCCCCCCccccC Confidence 99999999999987654 4422 110 000011111000 00 000000 000011111112222222 No 176 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=97.93 E-value=1e-05 Score=47.93 Aligned_cols=488 Identities=11% Similarity=0.025 Sum_probs=215.7 Q ss_pred hHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccc-----cCccccCceeecchHH Q lcl|NC_019418. 3 LIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNT-----DGDRKRRKMQHLPIAR 77 (527) Q Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~-----~~~~~~~~~~~lnl~~ 77 (527) |=++-+.++++...++--. +.-.++-..+..+..+||.+....|..... .+....+..++.|+=+ T Consensus 1 m~e~~~~~~~~~~~~~~~~----------~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~ 70 (706) T protein:vir:10 1 MAESRQKQHERVMLRFDRA----------WSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVA 70 (706) T ss_pred CCcchHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchH Confidence 4445666666544332110 111234455556677888765555543222 2233346678889999 Q ss_pred HHHHHHhhhhhcccceEeeC------CHHHHHHH----HHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-------- Q lcl|NC_019418. 78 TAAKKIASLVYNEQAEISAE------DETLNDFL----SDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-------- 139 (527) Q Consensus 78 ~i~~~~A~ll~~e~~~i~~~------d~~~~~~l----~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-------- 139 (527) .+|+...++.-...+.+.+- +..+++.| ..+.+.++.......+...++..|-+|+++..|- T Consensus 71 ~~v~~v~g~~~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~~~~d~~~ 150 (706) T protein:vir:10 71 TELNRIISEYRNNRISVKFRPGDNAASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTTSFVNEYDPMD 150 (706) T ss_pred HHHHHHhhHHHhCCCceEEecCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEeeeccccccCCCC Confidence 99999888877777666642 23345544 4455577888889999999999999999997641 Q ss_pred --CeeEEEEEc-C-CceEEEEEcCC----ceEEE--EEEEEEEeeCC------C--------cceEE----------EEE Q lcl|NC_019418. 140 --DKIRVAFIQ-A-PVFLPLQSNTQ----DVSSA--AILTKTIKTEN------R--------KNVYY----------TLV 185 (527) Q Consensus 140 --~~~~i~~v~-a-~~~~P~~~d~~----~~~~~--a~~~~~~~~~~------~--------~~~~y----------t~l 185 (527) ..++|..|. | +.++ ||.. ....| ++..+++..+. + ...++ ... T Consensus 151 ~~~~i~i~~v~~p~~~v~---~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~~~d~~~~~ 227 (706) T protein:vir:10 151 ERQRIAVEPIYDPARSVW---FDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWFTPDVVYIA 227 (706) T ss_pred CCccceeeeeccchhcee---cCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhhhccccccccccCCCcceec Confidence 245555542 3 3443 2211 11222 22221111000 0 00000 001 Q ss_pred EEEeeccc--ccccceeeecCCceEEEEEEEecCCcccc---Cce------eecccc-cCCcccceeecC---CCcccEE Q lcl|NC_019418. 186 EFHEWVTP--TGQEVGSTKDKSLYRITNELYKSTSDSQL---GER------VNLSEL-YPDLQPVTPIQG---LSRPLFT 250 (527) Q Consensus 186 E~h~~~~~--~~~~~~~~~~~~~~~I~n~ly~~~~~~~l---G~~------v~l~~~-~~~l~~~~~~~g---~~~p~f~ 250 (527) ||.+.... ....+......+.+.+...-+.. ....+ |.. ++--.+ |..+.+...+.+ ++.-.|- T Consensus 228 eyy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l~~~~p~~~~~~P 306 (706) T protein:vir:10 228 KYYEVRKESVDVISYRQPLTQEIATYDSEQIAD-IQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFLEKPRRIPGEHIP 306 (706) T ss_pred ccccccceeEEEEEeeccccCCceeeccchhhh-hHHHHhhCCchhhhhcccceeeEEEEeeccccccccCCCCCCCccc Confidence 11100000 00000000000000000000000 00000 000 000000 000111111111 1001122 Q ss_pred EecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCccee-eechhHhcCCCCCCCcccccccccccccce Q lcl|NC_019418. 251 YLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRV-IVPEQMTQLKVQDNQGNIAFKRRFDVEQNV 329 (527) Q Consensus 251 ~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i-~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~ 329 (527) |+|+-..-....+++..-+.+.++++.++.+|.+.|.+.+-+-..+... .++.+-+... .. ... .....+.- T Consensus 307 ~vP~~g~r~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~---~~-~~~---~~~~~~~~ 379 (706) T protein:vir:10 307 LIPVYGKRWFIDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTPIVDMEQIRGL---EQ-HWE---GRNRKRPA 379 (706) T ss_pred eEEEeeccccccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCcccccchhHHHHH---HH-Hhh---hccccccc Confidence 2322111110122322335599999999999999999998774433322 1211111000 00 000 00000000 Q ss_pred e---eeccCCCCC----CCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHH Q lcl|NC_019418. 330 Y---MQVGAGNMD----SGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRN 402 (527) Q Consensus 330 ~---~~~~~~~~~----~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~ 402 (527) | ..+...++. ...+..+++.--...+.+.++.....|....|++...+|..++ .++.+|.+.......... T Consensus 380 ~l~~~~~~~~~g~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~sn--~SG~Ai~~rq~qg~~~~~ 457 (706) T protein:vir:10 380 FLPLRTVTDKTGNVVAPANVAGYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQMPSN--VARETVNSLLNRSDMASF 457 (706) T ss_pred chhcccccCCCCcccccccccccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCccc--hHHHHHHHHHHHHHHHHH Confidence 1 111111111 1112222332223457777887788888999999999886543 467788888877777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhh-------cccCC--------------------------cccCccceEEEeCCCccC Q lcl|NC_019418. 403 SIVALVEQSIKELCVSMCELGKVV-------GIYRG--------------------------TIPELDDISVNLDDGVFT 449 (527) Q Consensus 403 ~~~~~~~~al~~li~~il~~~~~~-------~~~~~--------------------------~~~~~~~v~v~f~d~i~~ 449 (527) .+...+..+.+..-+.+|.+...+ .+.+. .....++|+|+=..+.+. T Consensus 458 ~~~Dnl~~~~~~~g~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t 537 (706) T protein:vir:10 458 IYLDNMAKSLKRAGEIWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSA 537 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCcch Confidence 788888888888777777664321 11110 001123444444444444 Q ss_pred CHHHHHHHHHHHHhcCC-CCHHH-----HHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCc Q lcl|NC_019418. 450 DRHAELDYWMKMVAAGF-ATQKR-----GIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDD 523 (527) Q Consensus 450 d~~~~~~~~~~~~~aGi-~s~~~-----~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 523 (527) -+++..+.++++..++. ....+ .+.++-+++- +++.+++|++...+.....+ .+.+. ..- T Consensus 538 ~r~~~~~~m~el~~~~~p~~~~~~~l~~~~~~~~d~p~--~~e~~e~irk~~~~q~~~~~------~~~~e------q~~ 603 (706) T protein:vir:10 538 RRDATVNALTQLLQGMLPQDPMRPALMGIIIDNMEGEG--LDDFKAFNRRQLLTQGIVKP------RNQQE------QAI 603 (706) T ss_pred HHHHHHHHHHHHHHhcCCcchhhHHHHHHHHhhcCccc--hHHHHHHHHHhhcccCCccc------cchhH------HHH Confidence 46677777777775432 22222 1233333321 33445566554432211100 00000 000 Q ss_pred cccC Q lcl|NC_019418. 524 EDEA 527 (527) Q Consensus 524 ~~~~ 527 (527) ..+. T Consensus 604 ~~q~ 607 (706) T protein:vir:10 604 VQQA 607 (706) T ss_pred HHHH Confidence 0000 No 177 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=97.90 E-value=1.2e-05 Score=47.62 Aligned_cols=383 Identities=12% Similarity=0.056 Sum_probs=154.4 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcC-CCcccccccccCccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQS-KFDDIEYTNTDGDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g-~~~~l~~~~~~~~~~~~~~~~lnl~~~i 79 (527) ||||++++.+.++ +. .... ..+..++.+ ........ +. ........-..+ T Consensus 1 Mg~f~~~~~~~~~--------~~--------~~~~-------~~~~~~~~~~~~~~~~~~---~~---~~~~~~~~v~~~ 51 (406) T protein:vir:95 1 MGLFDRWRRTKRK--------SK--------IRAD-------TGYVGLFMSGEDVSFLVP---GY---VRLSDNPEVRMA 51 (406) T ss_pred Ccchhhhcccccc--------cc--------cccc-------chhhhhhccCcccCcccc---CH---HHHhhcHHHHHH Confidence 9999876654332 10 0000 011122221 11100000 00 001112333455 Q ss_pred HHHHhhhhhcccceEe-eC-------CHHHHHHHH-HHHhhhhHHHHHHHHHHHHHhcCCEEE--EEEEeC-Cee-EEEE Q lcl|NC_019418. 80 AKKIASLVYNEQAEIS-AE-------DETLNDFLS-DMLSNDRFNKNFERYLESALALGGLAM--RPYVDG-DKI-RVAF 146 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~-~~-------d~~~~~~l~-~~l~~n~f~~~~~~~~~~a~~~G~~~~--~~~~d~-~~~-~i~~ 146 (527) ++.+|+-+..-+..+- .+ +......|. +--..-.....++..+.+.+..|.++. .+..+. +.+ .+.. T Consensus 52 i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g~~~~l~~ 131 (406) T protein:vir:95 52 VHKIADLISSMTIYLMQNTEDGDIRIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADGLIDELVP 131 (406) T ss_pred HHHHHHhhccCceEEEEecCCcceeecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCCcEEEEEE Confidence 5666665554443321 01 111111111 100011223334445556666665543 333443 232 3444 Q ss_pred EcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCcee Q lcl|NC_019418. 147 IQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERV 226 (527) Q Consensus 147 v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v 226 (527) ++|+.+-++... + + | .+++ . |..+ T Consensus 132 i~~~~v~~~~~~-~------------------~--~-------------------------~~~~---~-------~~~~ 155 (406) T protein:vir:95 132 LTPSKVNFLDTP-D------------------G--Y-------------------------QVLY---G-------GQTF 155 (406) T ss_pred EcCceeEEEEcC-C------------------e--E-------------------------EEEe---c-------cEEE Confidence 556554432111 0 0 1 1110 0 0011 Q ss_pred ecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHh Q lcl|NC_019418. 227 NLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMT 306 (527) Q Consensus 227 ~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l 306 (527) +. . -+.||+.+. +.. ..-.|+|.+.-+...+.....+-....+-+..|... ..++ T Consensus 156 ~~--------~----------evih~~~~~-~~~--~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~----~~il 210 (406) T protein:vir:95 156 NY--------D----------EVLHFIYNP-DPE--RPYIGRGYRVVLKDIADNLKQATATKKSFMSGKYMP----SLIV 210 (406) T ss_pred ch--------h----------HEEEeeccC-CCC--CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCc----ceEE Confidence 10 0 023554211 111 123589998888887776665544444445544331 1112 Q ss_pred cCCCCCCCc-ccccccccccccceeeecc-CC-----CCCCCcceEec-cccChHHHHHHHHHHHHHHHHhcCCCccccc Q lcl|NC_019418. 307 QLKVQDNQG-NIAFKRRFDVEQNVYMQVG-AG-----NMDSGGIVDLT-TPIRSSDYISAISEGLKLFEMQIGVSSGMFT 378 (527) Q Consensus 307 ~~~~~~~~~-~~~~~~~~d~~~~~~~~~~-~~-----~~~~~~i~~~~-~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~ 378 (527) ......... .-.....| ..-|.+.. .+ ..+...++.++ ......++.+..+....+|+...|++|.-+| T Consensus 211 ~~~~~l~~e~~~~~~~~~---~~~~~g~~n~~~~~v~~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg 287 (406) T protein:vir:95 211 KVDAATAELSSEEGRNAV---FKKYLQATEAGQPWIIPAELLEVEQVKPLSLKDIAINEAVELDKRTVAGMFGVPAFLLG 287 (406) T ss_pred EeCCCCCHHHHHHHHHHH---HHHhccccccCCceeecCCCccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcC Confidence 111110000 00000111 11122211 00 11111122222 2334567778888888899999999999886 Q ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHH Q lcl|NC_019418. 379 FDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYW 458 (527) Q Consensus 379 ~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~ 458 (527) ..... +. .. ...++.+|..++..|-...+. .++. .....+.+++++-+..|..+.++.. T Consensus 288 ~~~~~-----~~--~~----------~~~~~~~l~P~~~~ie~~l~~-~l~~---~~~~~~~fd~~~l~~~d~~~~~~~~ 346 (406) T protein:vir:95 288 IGEFN-----RD--EY----------NNFINSTILPIAKGIEQELTR-KLLI---SPDLYFKFNPRSLYAYDLKELAEVG 346 (406) T ss_pred CCCch-----HH--HH----------HHHHHHHHHHHHHHHHHHHHH-hcCC---CCCcEEEeechhhhcCCHHHHHHHH Confidence 43221 11 11 123445555555555433221 1221 2234577777777778889999999 Q ss_pred HHHHhcCCCCHHHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 459 MKMVAAGFATQKRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 459 ~~~~~aGi~s~~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (527) .+++.+|+|++-+++.+ .|++.-+ ..+-+. .....+ ....++.+....+ ++++.+++-| T Consensus 347 ~~l~~~G~~t~NE~R~~-~gl~p~~~gd~~~~--~~n~~~-----~~~~~~~~~~k~g-~~~~~~~~~~ 406 (406) T protein:vir:95 347 SNMYVRGIMEGNEVRDW-LGLSPKEGLSELVI--LENYIP-----LDKIGDQSKLKGG-DNSGADGQTD 406 (406) T ss_pred HHHHhCCCcCHHHHHHH-hCCCCCCCcceeee--ccCccc-----hhhcccccccCCC-CCCCCCCCCC Confidence 99999999999997754 3664321 111100 000000 0000000000111 1111111111 No 178 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=97.88 E-value=1.3e-05 Score=47.42 Aligned_cols=369 Identities=9% Similarity=0.079 Sum_probs=155.0 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcC--CCcccccccccCcc-ccCceeecchHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQS--KFDDIEYTNTDGDR-KRRKMQHLPIAR 77 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g--~~~~l~~~~~~~~~-~~~~~~~lnl~~ 77 (527) |+||+++. + .. +....+ +.++.+ .+..+..+.. |.. ..+..++..--. T Consensus 1 Mglf~~~~---~--------~~-------~~~~~~----------~~~~~~~~~~~~~~~~~~-~~~v~~~~al~~~~V~ 51 (384) T protein:vir:49 1 MPIFNITN---L--------AT-------ESPPSN----------QDSFFDITDPEFLDALNG-SEWVSAETALKNSDLF 51 (384) T ss_pred Cccccccc---c--------Cc-------cccccc----------chhhccccchhhcccccC-CceechhhhhccHHHH Confidence 99987521 0 00 000000 010000 0111111100 100 001111112223 Q ss_pred HHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEEEcCCceEEE Q lcl|NC_019418. 78 TAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DK-IRVAFIQAPVFLPL 155 (527) Q Consensus 78 ~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~v~a~~~~P~ 155 (527) .+++.+|+-+-+-| +.+.+......+.+--..-....-...++.+.+..|.+++.+..+. ++ +.+..++|+.+-++ T Consensus 52 ~~i~~Ia~~ia~l~--~~~~~~~~~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~ 129 (384) T protein:vir:49 52 SIISQLSNDLATAK--ITTSRKQLQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFN 129 (384) T ss_pred HHHHHHHHHHhhCc--eeeecchhhhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEE Confidence 45555666555444 3444443333222211111233344556777788899999888864 33 35666777766554 Q ss_pred EEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCc Q lcl|NC_019418. 156 QSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDL 235 (527) Q Consensus 156 ~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l 235 (527) ..+++ ...+|.. .. ++...|..+.+ T Consensus 130 ~~~~~-----------------~~~~y~~-~~------------------------------~~~~~~~~~~~------- 154 (384) T protein:vir:49 130 RLDNQ-----------------NGLYYNI-TF------------------------------DDPRIPPKQHV------- 154 (384) T ss_pred EcCCC-----------------ceEEEEE-Ee------------------------------cCccccceeEe------- Confidence 32211 1122210 00 00001111110 Q ss_pred ccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcC-cceeeechhHhcCCCCCCC Q lcl|NC_019418. 236 QPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMG-QRRVIVPEQMTQLKVQDNQ 314 (527) Q Consensus 236 ~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~-~~~i~v~~~~l~~~~~~~~ 314 (527) .. --+.|++.+.++ +..+|+|.+.-+...++....+-....+-|..| .++.++ ........ T Consensus 155 ------~~---~eVih~~~~~~~----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il-----~~~~~~~~ 216 (384) T protein:vir:49 155 ------PQ---GDILHFRLLSVD----GGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGIL-----KIKGGGLL 216 (384) T ss_pred ------cC---ccEEEecCCCCC----CceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEE-----EeCCCCCh Confidence 00 113466643322 234689988888877765554444444445553 333332 21111100 Q ss_pred cccccccccccccceeeec-cCC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHH Q lcl|NC_019418. 315 GNIAFKRRFDVEQNVYMQV-GAG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATE 389 (527) Q Consensus 315 ~~~~~~~~~d~~~~~~~~~-~~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAte 389 (527) . ......+.-+.+. +.+ -+++..++.++....+.++.+..+...++|+...|++|..+|...++..|+.. T Consensus 217 ~-----~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~ 291 (384) T protein:vir:49 217 D-----FKTKQSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEM 291 (384) T ss_pred H-----HHHHHHHHHHhcccCCccceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHH Confidence 0 0000000001110 000 01122355556566777888888888999999999999999876555555554 Q ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCC Q lcl|NC_019418. 390 IVSENSDTY-QMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFAT 468 (527) Q Consensus 390 i~s~~~~~~-~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s 468 (527) +...+...+ .+..-+...++.+|..- +. .+.....-.+..........++.+|+++ T Consensus 292 ~~~~~~~~i~~~l~pi~~~i~~~l~~~----l~-------------------~~~~~~~~~~~~~~~~~~~~l~~~~~~t 348 (384) T protein:vir:49 292 IYNIYFKAVSRFLRPFVSELSKKLSCE----VD-------------------ADILPAVDPTGSNYIGLINSMVKTGTLA 348 (384) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhchh----hh-------------------hhhhhhhhccchHHHHHHHHHhhcCccc Confidence 433322221 11222222222222110 00 0000111112222233444677889999 Q ss_pred HHHHHHhc--CCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 469 QKRGIAKT--LGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 469 ~~~~i~~~--~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) +-+++..+ .|+...|+++.. .. +++ .+||++|- T Consensus 349 ~~e~~~~l~~~g~~~ne~r~~~-----~~-------~p~--------------~gGd~~~~ 383 (384) T protein:vir:49 349 QNQGLYVLQQAEILPKDLPEGE-----TD-------STL--------------KGGETNEQ 383 (384) T ss_pred HHHHHHHHhhCCCCChhHHHHc-----CC-------CCC--------------CCCCCCCC Confidence 98876654 366545554331 00 111 12333333 No 179 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=97.83 E-value=1.6e-05 Score=46.85 Aligned_cols=449 Identities=14% Similarity=0.155 Sum_probs=198.4 Q ss_pred HHHhhcccchhhhcc---CccccCHHHHHHHHHHHHHhcCCCcccccccccCcccc-----Cceeec---chHHHHHHHH Q lcl|NC_019418. 15 RYNMTTSHLSSILDH---PKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKR-----RKMQHL---PIARTAAKKI 83 (527) Q Consensus 15 ~~~~~~~~~~~~~~~---~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~-----~~~~~l---nl~~~i~~~~ 83 (527) |..+|+=+++..... +.++.+..+-..+..-.--|-| .+....+..+. +.+.+| +--...++.. T Consensus 1 m~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~i~~~~~~~-----~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eI 75 (533) T protein:vir:10 1 MSQLFGFSLERAKKAPKGPSFVQKDNLDGSQPVSGGGYYG-----YTVDFDGQVRNEYQLISRYREMVLQPECDSAVDDI 75 (533) T ss_pred CccccccccccccccccCCCCCCCCcccccceeecccccc-----eeeecccccchHHHHHHHHHHHhhccchhhHHHHh Confidence 444444444433222 2222222222221111110111 11111221111 011111 1111122222 Q ss_pred hh-h-hh---cccceEeeCCH--------HHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCe-----eEEE Q lcl|NC_019418. 84 AS-L-VY---NEQAEISAEDE--------TLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDK-----IRVA 145 (527) Q Consensus 84 A~-l-l~---~e~~~i~~~d~--------~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~-----~~i~ 145 (527) .+ - ++ ..|+++.+++- ...+.++.++.=-+|.+...+.+....+-|..+|+..+|.++ ..+. T Consensus 76 Vneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~ELr 155 (533) T protein:vir:10 76 VNETICGNFDDVPVSVELSNLKVSDKIKKLIREEFGEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPDNPQGGLIELR 155 (533) T ss_pred hcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCCCccccceeee Confidence 22 1 11 12455555542 234556667776789999999999999999999999998543 4688 Q ss_pred EEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecC---Ccccc Q lcl|NC_019418. 146 FIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKST---SDSQL 222 (527) Q Consensus 146 ~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~---~~~~l 222 (527) +++|.++-|+.. ++.....++.+..+.... -+...++-+|.-. ....- T Consensus 156 ~lDPr~i~~vr~--------------i~~~~~~~~~~~~~~~~v---------------~~~~~eyf~Ynp~g~~~~~~~ 206 (533) T protein:vir:10 156 YIDPRKIRKINE--------------TEQKRPEQLRGLPLNQQL---------------SPKSAEYFLYDPKGLKNSTTQ 206 (533) T ss_pred eccccceeeeee--------------eeccCCCccceeecchhh---------------hccceeeeeeccccccccCCC Confidence 899988877531 111111111000000000 0001111112100 00111 Q ss_pred CceeecccccCCcccceee--cCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHH--HHHHcCcce Q lcl|NC_019418. 223 GERVNLSELYPDLQPVTPI--QGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFM--WEIKMGQRR 298 (527) Q Consensus 223 G~~v~l~~~~~~l~~~~~~--~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~--~e~~~~~~~ 298 (527) |-.+|-. .+++ +|+ + +.....=+|-+..|.-.+..|=..-+.++ +-.|+-.+| T Consensus 207 ~vkI~~d--------AI~y~hSGl-------~--------d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRR 263 (533) T protein:vir:10 207 GLKIAPD--------SICYVHSGI-------M--------DLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERR 263 (533) T ss_pred ceecchh--------heeeeeccc-------e--------eCCCCceeccchHhHHHHHhhHHHHhhHHHHhhhccccce Confidence 2222211 1111 221 0 11111123556666665555554444443 444566677 Q ss_pred eeec----------hhHh---------cCCCCCCCcccccccccccc-cceeeeccCCCCC-CCcceEeccccChHHHHH Q lcl|NC_019418. 299 VIVP----------EQMT---------QLKVQDNQGNIAFKRRFDVE-QNVYMQVGAGNMD-SGGIVDLTTPIRSSDYIS 357 (527) Q Consensus 299 i~v~----------~~~l---------~~~~~~~~~~~~~~~~~d~~-~~~~~~~~~~~~~-~~~i~~~~~~ir~e~~~~ 357 (527) ||-- +.++ +..-|..+|++.-.+.+-.- ...+.+- -+|+ ..-|+++..-=...+ .. T Consensus 264 vFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLPR--ReGgrgTEItTLpGgqnLge-m~ 340 (533) T protein:vir:10 264 IFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPR--REGGRGTEITTLPGGQNLGE-LE 340 (533) T ss_pred EEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhhhcccc--cCCCCccceeeccccCCcCh-HH Confidence 7641 0111 00123444444333222110 0001110 1122 123555554333333 45 Q ss_pred HHHHHHHHHHHhcCCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCc Q lcl|NC_019418. 358 AISEGLKLFEMQIGVSSGMFTFDGQ-GVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPEL 436 (527) Q Consensus 358 ~~~~~l~~i~~~~g~s~~~~~~~~~-g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~ 436 (527) .+..+.+.+....+++..-++.+++ ...-++||.-..-.....+.+.+..|..-+.++++.=|.|-.. +...-+.. T Consensus 341 DV~YF~kKLY~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgi---it~eeW~~ 417 (533) T protein:vir:10 341 DVKYFQKKLYKSLNVPGSRLETETTFNVGRAAEITRDEVKFQKFVARLRKRFSELFTDLLKTQLVLKGV---ISIEEWDQ 417 (533) T ss_pred HHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccC---CCHHHHHH Confidence 5666677777777887776665443 1223567777777777788888888888888888876655332 21111111 Q ss_pred --cceEEEeCCCccCCHHHHHHHHHH---HH-hc----C-CCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccc------c Q lcl|NC_019418. 437 --DDISVNLDDGVFTDRHAELDYWMK---MV-AA----G-FATQKRGIAKTLGITEEEAEKELAEINGELPPE------S 499 (527) Q Consensus 437 --~~v~v~f~d~i~~d~~~~~~~~~~---~~-~a----G-i~s~~~~i~~~~~~~deea~~el~ri~~E~~~~------~ 499 (527) ..|.++|...=-..+..+++.... +. ++ | ..|.+++.++....||+|.+++..+|++|.... + T Consensus 418 i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~ 497 (533) T protein:vir:10 418 MKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLKQTDVEMKEIDKQIESEMESGIIADPAA 497 (533) T ss_pred HhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCCcc Confidence 346677754333333333332221 11 11 2 368888888888999999999999999887421 1 Q ss_pred ccccCCCCCCCCCCCCC-----CCCCCCccccC Q lcl|NC_019418. 500 DAELALYGKGQQNTVGN-----SKDTVDDEDEA 527 (527) Q Consensus 500 ~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~ 527 (527) ..++. .+.+++...+. .-..+..++|. T Consensus 498 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 529 (533) T protein:vir:10 498 EMDPA-MAAGDPDAGGAPAEEVAPEGPDPSDER 529 (533) T ss_pred hhhHH-hcCCCCCcCCcccccCCCCCCCcchhh Confidence 11110 01111110000 01122223333 No 180 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=97.83 E-value=1.6e-05 Score=46.83 Aligned_cols=404 Identities=12% Similarity=0.104 Sum_probs=162.2 Q ss_pred hHHHHHHHHHHHHHHhhcccchhhh--ccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHH--H Q lcl|NC_019418. 3 LIQKVKDFFNRGRYNMTTSHLSSIL--DHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIAR--T 78 (527) Q Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~--~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~--~ 78 (527) |...+.+.+.+... ..-..++ ....+.+++.. -|..|..+... .|..- .....+..|. . T Consensus 1 ~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~-----~~~~~~g~~~~-------~g~~v-~~~~al~~~~V~~ 63 (434) T protein:vir:43 1 MSKSLGKVLSSATS----APRSSLFGWGGKTIRLTDGA-----FWSQFLGRESS-------SGKKV-TVDKAMKLSAVWA 63 (434) T ss_pred Cccchhhhhhhccc----ccchhhhcccccccccCchH-----HHHHHhcCCcc-------CCcee-chhhhhccHHHHH Confidence 44444444332110 0000001 11222222211 13333322111 11110 0001122222 3 Q ss_pred HHHHHhhhhhcccceE-ee--CCH---HHHHHHHHHHh--hhh---HHHHHHHHHHHHHhcCCEEEEEEEeCCee-EEEE Q lcl|NC_019418. 79 AAKKIASLVYNEQAEI-SA--EDE---TLNDFLSDMLS--NDR---FNKNFERYLESALALGGLAMRPYVDGDKI-RVAF 146 (527) Q Consensus 79 i~~~~A~ll~~e~~~i-~~--~d~---~~~~~l~~~l~--~n~---f~~~~~~~~~~a~~~G~~~~~~~~d~~~~-~i~~ 146 (527) +++.+|+-+-+-|..+ .. ++. ..+-.+..+|. -|. -..-.+..+...+..|.+++.+..++|++ .+.. T Consensus 64 ~i~~ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~~G~~~~L~~ 143 (434) T protein:vir:43 64 CVRLISTSVAGLPLGVYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRAAGRPAALDF 143 (434) T ss_pred HHHHHHHhhhhCceEEEEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeCCCcEEEEEE Confidence 4455555554444333 11 110 11112333332 122 22334455667778899988877776664 4566 Q ss_pred EcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCcee Q lcl|NC_019418. 147 IQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERV 226 (527) Q Consensus 147 v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v 226 (527) ++|+.+-|.. +.++. .+|.. .. .+ |..+ T Consensus 144 l~p~~v~~~~-~~~g~-----------------~~y~~-~~---------------~~------------------g~~~ 171 (434) T protein:vir:43 144 LLPSRVDLEC-DENGR-----------------LKYFY-TT---------------KK------------------GARR 171 (434) T ss_pred EcCcceEEEE-cCCCe-----------------EEEEE-Ee---------------cC------------------ceEE Confidence 7777665532 22221 11110 00 00 0000 Q ss_pred ecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHh Q lcl|NC_019418. 227 NLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMT 306 (527) Q Consensus 227 ~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l 306 (527) .+ ++. -+.|++.+..+ ..+|+|.+.-+...+......-.--.+-|+.|.. |..++ T Consensus 172 ~~-------~~~---------eVih~~~~~~d-----g~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~----~~gil 226 (434) T protein:vir:43 172 EI-------ERT---------NMLHIPAFTLD-----GRIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLL----PTVAF 226 (434) T ss_pred EE-------ccc---------cEEEecCcCCC-----CccccCHHHHHHHHHHHHHHHHHHHHHHHhccCC----cceEE Confidence 00 000 12345432111 2468888887776665444322222223444322 12222 Q ss_pred cCCCCCCCc-ccccccccccccceeeeccCCC----CCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccc Q lcl|NC_019418. 307 QLKVQDNQG-NIAFKRRFDVEQNVYMQVGAGN----MDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDG 381 (527) Q Consensus 307 ~~~~~~~~~-~~~~~~~~d~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~ 381 (527) ..+...... .-.....+ +..+.+-+.+. .++..++.++....+.++.+..+....+|+...|++|..+|... T Consensus 227 ~~~~~l~~e~~~~~r~~~---~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~ 303 (434) T protein:vir:43 227 KVDRILQPAQREEFREYV---KSVSGAMNSGRSPVLEQGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTD 303 (434) T ss_pred ecCCCCCHHHHHHHHHHH---HHhcCccccCCccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCc Confidence 222111110 00000111 11111111110 11224555565666778888888888999999999999988755 Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHH Q lcl|NC_019418. 382 QGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKM 461 (527) Q Consensus 382 ~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~ 461 (527) .+..+.+.+.... ...+..+|..++..|-...+. .++.........+.+++++-+..|..+.++...++ T Consensus 304 ~~~~~~s~~e~~~----------~~f~~~~L~P~~~~ie~~ln~-kL~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~ 372 (434) T protein:vir:43 304 KGSNWGTGLEQQM----------LAFLTFSISSITNQIQQCVNK-RLLTAPERIRYYAEFSLEGFLKADSAGRAAWYSTM 372 (434) T ss_pred CCccccchHHHHH----------HHHHHHHHHHHHHHHHHHHHh-hcCChhhhcCceEEEechhhhccCHHHHHHHHHHH Confidence 4332222111111 112233333333333221110 12111111234456666666678999999999999 Q ss_pred HhcCCCCHHHHHHhcCCCCHH-HHHHH--------HHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccc Q lcl|NC_019418. 462 VAAGFATQKRGIAKTLGITEE-EAEKE--------LAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDED 525 (527) Q Consensus 462 ~~aGi~s~~~~i~~~~~~~de-ea~~e--------l~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (527) +.+|+|++-+++... |+..- ...+- +..+.+.+.+.+ ......+..++.++. + T Consensus 373 ~~~G~~T~NE~R~~~-gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~---------~ 434 (434) T protein:vir:43 373 AQNGFMTRNEGRRKE-NLPELPGGDILTVQSNLVPIDQLGQSNKSQA-VRAALMNWFSQPEPQ---------E 434 (434) T ss_pred HhCCCcCHHHHHHHh-CCCCCCCCCeEeeccCccchhhhhccCCCcc-hhhhhhccCCCCCCC---------C Confidence 999999999977653 54321 11100 111111111111 111111111111111 1 No 181 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=97.81 E-value=1.7e-05 Score=46.69 Aligned_cols=417 Identities=13% Similarity=0.116 Sum_probs=157.8 Q ss_pred CChHHHHHHH-----HHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecch Q lcl|NC_019418. 1 MSLIQKVKDF-----FNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPI 75 (527) Q Consensus 1 m~~~~~~k~~-----~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl 75 (527) |+ +|+. + ++.+...-...++.+... .+.. .|-+.|..|+.. .-+++... ..-+.+-.+ T Consensus 1 ~~--~~~~-~~~~~~~~~~~~~~~rd~l~~~~~----glg~---~r~~~~~~~g~~--~~~~~~~l-----~~~Yr~~~i 63 (449) T protein:vir:10 1 MT--DKLT-LAVNHALNDARMARARMGLMVPTM----GLDN---KRHSAWCEYGFP--ELVTYENL-----YSLYRRGGI 63 (449) T ss_pred Cc--hhhH-HHHhhhcchhHHHHHHHHHHHHHh----cCCc---ccchhhhhcCCc--ccCCHHHH-----HHHHhcCch Confidence 11 1111 1 000000000000000000 0000 011112222110 00100000 001123368 Q ss_pred HHHHHHHHhhhhhcccceEeeCCH----H----HHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEE Q lcl|NC_019418. 76 ARTAAKKIASLVYNEQAEISAEDE----T----LNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVAFI 147 (527) Q Consensus 76 ~~~i~~~~A~ll~~e~~~i~~~d~----~----~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v 147 (527) ++.||+..|+-+....+.|.-+++ . ....+++++ ..+++..+.++..++...|++++.+-++.++..- T Consensus 64 a~~iVd~~~d~~~~~~~~i~~g~~~~~~~~~~~~e~~~~~l~-~~~~~~~l~ea~~~~rl~Gga~i~i~v~d~~~l~--- 139 (449) T protein:vir:10 64 AHGAVEKLVGKCWQTNPEIIEGDDADDSEDETSWEKKSKQVF-TNRLWRSFAEADRRRLVGRYAGILLHIRDEKDWN--- 139 (449) T ss_pred hHHHHHhhhhhhhhcCcccccCccccchhhhHHHHHHHHHHH-HHHHHHHHHHHHHhhhccCcEEEEEEecCCCCCC--- Confidence 899999999988766555532211 1 123455544 3478888899999888888888777664443211 Q ss_pred cCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceee Q lcl|NC_019418. 148 QAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVN 227 (527) Q Consensus 148 ~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~ 227 (527) -|+. ...++....++|+. +... ..|++- +.. ..+ ..-..|+|...+. |... T Consensus 140 -----~Pl~-~~~~i~~i~v~~~~-~i~~--~~~~~d-----p~s---p~y---g~P~~y~v~~~~~--------g~~~- 190 (449) T protein:vir:10 140 -----LPAT-KGRGLQKVSVSWAG-SLKV--AEWDTG-----INS---KTY---GQPKLWKYTERLP--------NGSS- 190 (449) T ss_pred -----cccc-cCcceeeEEeeccc-cCCh--hhhhcC-----CCC---CCC---CCceEEEEeeecc--------CCCc- Confidence 1432 12233222233321 0000 001000 000 000 0111222221111 0000 Q ss_pred cccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHH----HH-HHHHcC----cce Q lcl|NC_019418. 228 LSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDE----FM-WEIKMG----QRR 298 (527) Q Consensus 228 l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~----~~-~e~~~~----~~~ 298 (527) ..+.++ -.|.+ .|. - ...-|+|.+..+-+-+-.++.+-.. ++ +..+.. .++ T Consensus 191 ---------~~~~iH-~SRl~--~~~-~-------~~~~g~~~L~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~~ 250 (449) T protein:vir:10 191 ---------RRVDIH-PDRVF--ILG-D-------YSEDAIGFLEPAYNAFVSLEKVEGGSGESFLKNAARQLNVNFEKE 250 (449) T ss_pred ---------cceeec-cceeE--eec-C-------CCCCChhHHHHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhhh Confidence 000111 12211 110 0 0011667776665555444443211 11 111100 001 Q ss_pred eeechhHhcCCCCCCCcccccccccccc-cceeeecc---CCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_019418. 299 VIVPEQMTQLKVQDNQGNIAFKRRFDVE-QNVYMQVG---AGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSS 374 (527) Q Consensus 299 i~v~~~~l~~~~~~~~~~~~~~~~~d~~-~~~~~~~~---~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~ 374 (527) +-+ ..+.... +.+-+ .....+... +.+-.+.+ .+.++ .++.++.. ..-....++....+++..+|++. T Consensus 251 ~~~-~~l~~~~--~~~~e-~~~~~~~~~~~~~~~~~~~~~i~~~~--d~~~~~~~--~sgl~d~l~~~~q~iaaa~~IP~ 322 (449) T protein:vir:10 251 IDF-TNLASLY--GVSID-ELQDKFNEVAGEINRGNDVLMTTQGA--TVTPLVTS--VADPTATYNVNLQTAAAGVDIPT 322 (449) T ss_pred hhh-hhhhHHh--hCCch-HHHHHHHHHHHHHhccchheeecCCc--ceEEEecc--cCChhHHHHHHHHHHHHHhCCCe Confidence 100 0111000 00000 000011000 00001111 12222 24444332 12344557777778888889886 Q ss_pred cc-ccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHH Q lcl|NC_019418. 375 GM-FTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHA 453 (527) Q Consensus 375 ~~-~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~ 453 (527) .- ||...+|..+ |+ ..+-.+..++..|..++..|+.|+..|+... + +.+ ..+++|.|++--..+..+ T Consensus 323 t~L~Gqsp~glns-t~---D~~nyyd~i~~~Q~~l~p~le~l~~~l~~s~----~--g~~--~~d~~i~f~pL~~~t~kE 390 (449) T protein:vir:10 323 RILIGNQQAERSS-TE---DQKYFNARCQSRRVDLSFEIEDFCDKLIELK----I--IDA--VAKKAVIWDDLNEQTGTE 390 (449) T ss_pred eeeeccCcccccc-ch---hHHHHHHHHHHHHHhhhHHHHHHHHHHHHhh----c--CCC--CCceeEEeCCCCCCCHHH Confidence 54 6777777653 32 2344566666677778999999998776542 2 122 236999999988888777 Q ss_pred HHHHHHHHHhcCCCCHHHHHHhc---CCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 454 ELDYWMKMVAAGFATQKRGIAKT---LGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 454 ~~~~~~~~~~aGi~s~~~~i~~~---~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) .++...+...+ ..+ +... .-++.+|+++.+.- + +.++.+.+ .+++++.+.+-++ T Consensus 391 kAei~k~~A~a----~~~-~~~ag~~~~~~~~EiR~~~~~--------~------~~~~~~~~-~e~~de~~~~~d~ 447 (449) T protein:vir:10 391 KLTNAKTMGEI----NQT-MLGSGDNPAFSREEIRTAAGY--------D------NDDEEPLG-EEDGDEEDKATDS 447 (449) T ss_pred HHHHHHHHHHH----HHH-HHHccccCCcCHHHHHHHhcc--------c------CCCCCCCC-CCCCccccccCCc Confidence 65554433221 111 1111 12466666544310 0 00111111 1111222222222 No 182 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=97.77 E-value=2e-05 Score=46.31 Aligned_cols=399 Identities=10% Similarity=0.062 Sum_probs=164.2 Q ss_pred cccccccCccccCceeecchHHHHHHHHhhhhhcccceEeeC-----CH---HHHHHHHH-HHhh-------------hh Q lcl|NC_019418. 56 IEYTNTDGDRKRRKMQHLPIARTAAKKIASLVYNEQAEISAE-----DE---TLNDFLSD-MLSN-------------DR 113 (527) Q Consensus 56 l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~-----d~---~~~~~l~~-~l~~-------------n~ 113 (527) |. .-...-+....+++.+|+-+.+-|..+... +. ..-+.+.. ++.. .- T Consensus 1 l~----------~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t 70 (467) T protein:vir:31 1 MA----------ELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERAT 70 (467) T ss_pred Ch----------hhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhH Confidence 10 000112456677777777777666555321 11 11111111 2211 12 Q ss_pred HHHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeec Q lcl|NC_019418. 114 FNKNFERYLESALALGGLAMRPYVDG-DK-IRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWV 191 (527) Q Consensus 114 f~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~ 191 (527) +...+..++.+.+..|.+++.+..+. |+ +.+.+++|..+-+.. +..+. +........||-. +.. T Consensus 71 ~~~~~~~~~~~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~-d~~~~---------~~~~~~~~~~~~~--~~~-- 136 (467) T protein:vir:31 71 ATNVLQTAWTDYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRM-DERGF---------VQLLEEKEKYFGV--AGD-- 136 (467) T ss_pred HHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeee-eccee---------EeecCCceeeEEe--ccc-- Confidence 34455667888888999999888875 33 567888888777642 22211 1111112222211 000 Q ss_pred ccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchh Q lcl|NC_019418. 192 TPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIF 271 (527) Q Consensus 192 ~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~ 271 (527) ... ....+.+.+..+.. .....|..+. + +.--..|++.+.+. +..+|+|.+ T Consensus 137 -----~~~---~~~~~~~~~~~~~~-~~~~~~~~~~-------------~---~~~diih~r~~~~~----~~~~G~s~~ 187 (467) T protein:vir:31 137 -----RYQ---TNGNGDLDPVFVDA-DDGSTGTSVS-------------N---PANELIFKRNHSPL----YPHYGAPDI 187 (467) T ss_pred -----cce---eecccceeeeeeee-ccccccceeE-------------e---ccccEEEecCCCCC----CCcccccHH Confidence 000 00011111100100 0111111111 1 11124566654322 234699999 Q ss_pred hhhHHHHHHHHHHHHHHHHH-HHcCcc-e--eeechhHhcCCCCCCCcccccccccccc-----cceee---e------- Q lcl|NC_019418. 272 DNAKTTIDFINRTYDEFMWE-IKMGQR-R--VIVPEQMTQLKVQDNQGNIAFKRRFDVE-----QNVYM---Q------- 332 (527) Q Consensus 272 ~~~~~lid~ld~~~s~~~~e-~~~~~~-~--i~v~~~~l~~~~~~~~~~~~~~~~~d~~-----~~~~~---~------- 332 (527) ..+...+.. +..-..+... |+.|.. . +.++..++. .+.. -.....|... +..+. + T Consensus 188 ~~~~~~i~~-~~~~~~~~~~~f~ng~~p~gil~~~~~~l~----~e~~-~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~ 261 (467) T protein:vir:31 188 IPAVKTIRG-DSAAQDYNIDFFENDGVPRIAIIVKGAELT----EKGR-EEMRNLIEDNNEDNHRTAFIETEKIVQNEDY 261 (467) T ss_pred HHHHHHHHH-HHHHHHHHHHHHhccCCCceEEEecCcCCC----HHHH-HHHHHHHHhhhcchhhhhhhhhccccccccc Confidence 888777643 4444455444 355432 2 222322220 0000 0000000000 00000 0 Q ss_pred --ccCC-CCCCCcceEe--cc-ccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHH-HHHHHHHHH Q lcl|NC_019418. 333 --VGAG-NMDSGGIVDL--TT-PIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSD-TYQMRNSIV 405 (527) Q Consensus 333 --~~~~-~~~~~~i~~~--~~-~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~-~~~~~~~~~ 405 (527) +..+ +-...+++.. +. ...+.++.+..+...++|+...|++|..+|...++.. ++.+...... ...++.-+. T Consensus 262 ~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~-~s~~e~~~~~f~~~~l~P~~ 340 (467) T protein:vir:31 262 LNLADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEHDILKVHDVPPVIAGVVESGAF-STDAEEQRKEFAEETIQPKQ 340 (467) T ss_pred ccccCCCcccccceeEEeccccChhhHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCc-ccCHHHHHHHHHHHHHHHHH Confidence 0000 0001122222 11 2235677888888888999999999999887544322 1111111111 112222222 Q ss_pred HHHHHHHHHHHHHHHHHhhhhcccC-CcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCHH Q lcl|NC_019418. 406 ALVEQSIKELCVSMCELGKVVGIYR-GTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKT--LGITEE 482 (527) Q Consensus 406 ~~~~~al~~li~~il~~~~~~~~~~-~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~--~~~~de 482 (527) +.|+.+|... ++. ......+.+.+++..-+..|.++.++....++.+|+++.-+++.+. .++.|+ T Consensus 341 ~~ie~~ln~~------------l~~~~~~~~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~pi~d~ 408 (467) T protein:vir:31 341 HDFGELLYEL------------VHKQGLDAPDWTIEFELAKPDTKLQDVEIASQRVQAMQGLLTVNELRDEFGFEPFPEE 408 (467) T ss_pred HHHHHHHHHh------------hcchhhccCCceEEEecchhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCcc Confidence 2233322221 111 1112345577888888889999999999999999999999987664 233332 Q ss_pred HHHH---HHHHHHHhcccccccccCCCCCCCCCCC-------CCCCCCCCccccC Q lcl|NC_019418. 483 EAEK---ELAEINGELPPESDAELALYGKGQQNTV-------GNSKDTVDDEDEA 527 (527) Q Consensus 483 ea~~---el~ri~~E~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~ 527 (527) +... ..........+..... +-...+.+++. +.+.++.+..|++ T Consensus 409 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 462 (467) T protein:vir:31 409 HVYGGETLVAEVTGGSGPGGGIG-DQIEQLVEDRADEIIDSYQADLETEQLIEIG 462 (467) T ss_pred cccCCcccccccccccCCCCccc-CcCCCCCCCcccchHhhhhhccccchhhhhc Confidence 1100 0000000000000000 00000000000 0011111111111 No 183 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=97.76 E-value=2.2e-05 Score=46.16 Aligned_cols=409 Identities=11% Similarity=0.054 Sum_probs=157.5 Q ss_pred HHHHhhcccchhhhccCccccCHHHHHHHHHHH-HHhcCCCcccccc-cccCccccCceeecchHHHHHHHHhhhhhccc Q lcl|NC_019418. 14 GRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNL-AYYQSKFDDIEYT-NTDGDRKRRKMQHLPIARTAAKKIASLVYNEQ 91 (527) Q Consensus 14 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~-~~y~g~~~~l~~~-~~~~~~~~~~~~~lnl~~~i~~~~A~ll~~e~ 91 (527) |. .-..+.+ .+|++.++ ..|. ..|.+.+. .... ..............+.-..+++.+|+-+-+-| T Consensus 1 ~~-~~~~~~~----------~~p~~~~~-~~~~~~~~~~~~~-~g~~~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~lp 67 (518) T protein:vir:78 1 ML-LANGQTL----------SAPAMAEL-SPQMQDSYYYAPA-VGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLP 67 (518) T ss_pred Cc-ccCceee----------ccchhhhh-hhhhhhcccccce-eceecccccchhhHHhhhhHHHHHHHHHHHHhhccCc Confidence 10 0022222 23333222 1122 11211111 0000 00000000000000111234555555444434 Q ss_pred ceEee-C-CH---HHHHHHHHHHhhhh----HHHHHHHHHHHHHhcCCEEEEEEEeCC-e-eEEEEEcCCceEEEEEcCC Q lcl|NC_019418. 92 AEISA-E-DE---TLNDFLSDMLSNDR----FNKNFERYLESALALGGLAMRPYVDGD-K-IRVAFIQAPVFLPLQSNTQ 160 (527) Q Consensus 92 ~~i~~-~-d~---~~~~~l~~~l~~n~----f~~~~~~~~~~a~~~G~~~~~~~~d~~-~-~~i~~v~a~~~~P~~~d~~ 160 (527) ..+-- + +. .....+..++.+.+ ...-.+.++.+.+..|.+++.+..+.. + ..+..++|+.+-+.....+ T Consensus 68 ~~l~~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~Vtv~~~~~~ 147 (518) T protein:vir:78 68 VKCMFTSGDTETEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT 147 (518) T ss_pred eEEEEEcCCccccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEECCCceEEEEcCCC Confidence 33311 1 10 01112223333221 122234455666677999888877653 3 3466677766655322111 Q ss_pred ceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCccccee Q lcl|NC_019418. 161 DVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTP 240 (527) Q Consensus 161 ~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~ 240 (527) +. .+|. +.. ..+.+ +..+. T Consensus 148 ~~-----------------~~y~-~~~---------------~~~~~---------------~~~~~------------- 166 (518) T protein:vir:78 148 GR-----------------YEYY-FQA---------------GAGVG---------------TQLVS------------- 166 (518) T ss_pred CE-----------------EEEE-EEe---------------cCCcc---------------ceeEE------------- Confidence 11 1110 000 00000 00000 Q ss_pred ecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCc-ceeeechhHhcCCCCCCCc-ccc Q lcl|NC_019418. 241 IQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQ-RRVIVPEQMTQLKVQDNQG-NIA 318 (527) Q Consensus 241 ~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~-~~i~v~~~~l~~~~~~~~~-~~~ 318 (527) +.. --+.||+.+.++. ...|+|.+.-+...|.....+-....+-|+.|. +..++ ......... .-. T Consensus 167 ~~~---~eIiHir~~~~dg----~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl-----~~~~~ls~e~~~~ 234 (518) T protein:vir:78 167 FAD---DEVVPIRFFNPDG----LERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVL-----RHEKRLSPEAQQR 234 (518) T ss_pred ecC---CcEEEecCCCCCc----ccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEE-----ecCCCCCHHHHHH Confidence 000 0124555433332 235888887777666555544433334456543 22222 211111000 000 Q ss_pred cccccccccceeeecc-CCC----CCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccccccc-chHHHHHH Q lcl|NC_019418. 319 FKRRFDVEQNVYMQVG-AGN----MDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGV-KTATEIVS 392 (527) Q Consensus 319 ~~~~~d~~~~~~~~~~-~~~----~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~-~TAtei~s 392 (527) +...| +..|.+.+ .+. .++..++.++......++.+.......+|+...|++|..+|+...+. .++.+. T Consensus 235 ~k~~~---~~~~~G~~nag~~~vL~~G~~~~~l~~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st~sn~e~~-- 309 (518) T protein:vir:78 235 LREQF---DRAHAGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQ-- 309 (518) T ss_pred HHHHH---HHHhcCcccCCceeEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHH-- Confidence 00111 11122110 000 11223555566666778888888888899999999999998765432 222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHH Q lcl|NC_019418. 393 ENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRG 472 (527) Q Consensus 393 ~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~ 472 (527) ....++.+|.-++..|-...+. .+... ......+.++.+.-+..|.++.++...+++.+|+|++-++ T Consensus 310 -----------~~~f~~~tL~P~~~~ie~eln~-~L~~~-~~~~~~~~fd~~~Llr~D~~~r~~~~~~~~~~G~lT~NE~ 376 (518) T protein:vir:78 310 -----------MRAFYRDTMAIPIARIQSAMDK-YVGQY-WVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEG 376 (518) T ss_pred -----------HHHHHHHHHHHHHHHHHHHHHH-hhccc-ccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHH Confidence 1111222333333332221110 11111 1122345566566678899999999999999999999997 Q ss_pred HHhcCCCC---HHHHHHHH-----HHHHH--------hcccccccccCCC-CCCCCC----CCCCCCCCCCccccC Q lcl|NC_019418. 473 IAKTLGIT---EEEAEKEL-----AEING--------ELPPESDAELALY-GKGQQN----TVGNSKDTVDDEDEA 527 (527) Q Consensus 473 i~~~~~~~---deea~~el-----~ri~~--------E~~~~~~~~~~~~-~~~~~~----~~~~~~~~~~~~~~~ 527 (527) +... |+. +....+.+ ..+.. +.++..+.....+ .+.++. ...-..++.++.+|+ T Consensus 377 R~~~-gl~pie~~~gD~~~v~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (518) T protein:vir:78 377 REIM-GLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPASTPVASLDQSPPASVPGLSPTNSDRSTDS 451 (518) T ss_pred HHHh-CCCCCCCCCCceeeecccceecccccccccCCCCCCCCCCCCcccccccccCccccCCCCCcccccccccc Confidence 6553 543 22222111 11100 0000000000000 000010 001111111111111 No 184 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=97.75 E-value=2.2e-05 Score=46.12 Aligned_cols=407 Identities=13% Similarity=0.082 Sum_probs=157.0 Q ss_pred HHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCc---ccc-cccccCccccCceeecchHHHHHHHHhhhhhc Q lcl|NC_019418. 14 GRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFD---DIE-YTNTDGDRKRRKMQHLPIARTAAKKIASLVYN 89 (527) Q Consensus 14 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~---~l~-~~~~~~~~~~~~~~~lnl~~~i~~~~A~ll~~ 89 (527) |. .-..+.+...+.. +-...+.. .|.+.+. .+. .....+... ...+-=..+|+.+|+-+-+ T Consensus 1 ~~-~~~~~~~~~p~~~-------e~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~a----~~~~~V~acV~~IA~~iA~ 65 (518) T protein:vir:10 1 ML-LANGQTLSAPAMA-------ELSPQMQD---SYYYAPAVGMQLERQFSLYGGIY----KNQPWVRTVIAKRAQALAR 65 (518) T ss_pred Cc-ccCceeecCchhh-------hhhhhhhc---ccccccccceecccccchhhHHH----hhhHHHHHHHHHHHHhhcc Confidence 10 1123333222111 11111111 1111110 000 000000000 0001112344444444433 Q ss_pred ccceEe-e--CC--HHHHHHHHHHHhhhh----HHHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEEEcCCceEEEEEc Q lcl|NC_019418. 90 EQAEIS-A--ED--ETLNDFLSDMLSNDR----FNKNFERYLESALALGGLAMRPYVDG-DK-IRVAFIQAPVFLPLQSN 158 (527) Q Consensus 90 e~~~i~-~--~d--~~~~~~l~~~l~~n~----f~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~v~a~~~~P~~~d 158 (527) -|..+- . ++ ......+..++.+.+ .....+.++...+..|.+++.+..+. |+ ..+..++|+.+-+.... T Consensus 66 lpl~l~~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~v~v~~~~ 145 (518) T protein:vir:10 66 LPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNS 145 (518) T ss_pred CceEEEEEcCCCceeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCCceEEEEcC Confidence 232221 0 11 011112222332211 11223445556677899998887765 33 35666777666543221 Q ss_pred CCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccc Q lcl|NC_019418. 159 TQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPV 238 (527) Q Consensus 159 ~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~ 238 (527) .++. .+|+ +. . ....|.. . T Consensus 146 ~~~~-----------------~~y~---~~---------------------------~--~~~~~~~------------~ 164 (518) T protein:vir:10 146 RTGR-----------------YEYY---FQ---------------------------A--GAGVGTQ------------L 164 (518) T ss_pred CCCE-----------------EEEE---EE---------------------------e--cCCccce------------E Confidence 1111 1111 00 0 0000000 0 Q ss_pred eeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCc-ceeeechhHhcCCCCCC-Ccc Q lcl|NC_019418. 239 TPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQ-RRVIVPEQMTQLKVQDN-QGN 316 (527) Q Consensus 239 ~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~-~~i~v~~~~l~~~~~~~-~~~ 316 (527) +++.. --+.||+.+.++. ...|+|.+.-+...|.....+-..-.+-|+.|. ++.++ ....... ... T Consensus 165 ~~~~~---~eViHir~~s~dg----~~~G~spi~~a~~~i~~~~a~~~~~~~~f~ng~~p~gil-----~~~~~ls~e~~ 232 (518) T protein:vir:10 165 VSFAD---DEVVPIRFFNPDG----LERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVL-----RHEKRLSEAAQ 232 (518) T ss_pred EEecC---CcEEEecCCCCCc----ccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEE-----ecCCCCCHHHH Confidence 00000 0134566543332 235888887776666555544444444456543 33333 1111100 000 Q ss_pred cccccccccccceeeec-cCC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccccccc-chHHHH Q lcl|NC_019418. 317 IAFKRRFDVEQNVYMQV-GAG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGV-KTATEI 390 (527) Q Consensus 317 ~~~~~~~d~~~~~~~~~-~~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~-~TAtei 390 (527) -.+...|. ..|.+. +.+ -.++..++.++......++.+..+....+|+...|++|..+|+...+. .++.+. T Consensus 233 ~~~k~~~~---~~~~G~~nag~v~vL~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn~eq~ 309 (518) T protein:vir:10 233 QRLREQFD---RAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQ 309 (518) T ss_pred HHHHHHHH---HHhcCccccCcceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHH Confidence 00111111 112211 000 011223555666666778888888888999999999999998765432 222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHH Q lcl|NC_019418. 391 VSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQK 470 (527) Q Consensus 391 ~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~ 470 (527) . ...+..+|.-++..|-...+. .+... ......+.++.+.-+..|..+.++...+++.+|+|++- T Consensus 310 ---~----------~~f~~~tL~P~l~~ie~~ln~-~L~~~-~~~~~~~~fd~~~llr~D~~~r~~~~~~~~~~G~lT~N 374 (518) T protein:vir:10 310 ---M----------RAFYRDTMAIPIARIQSAMDK-YVGQY-WVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPN 374 (518) T ss_pred ---H----------HHHHHHHHHHHHHHHHHHHHH-hhccc-ccCCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHH Confidence 1 111223333333332211110 01111 11123455555666778999999999999999999999 Q ss_pred HHHHhcCCCC---HHHHHHHH-----HHHHH--------hcccccccccCCC-CCCCCC----CCCCCCCCCCccccC Q lcl|NC_019418. 471 RGIAKTLGIT---EEEAEKEL-----AEING--------ELPPESDAELALY-GKGQQN----TVGNSKDTVDDEDEA 527 (527) Q Consensus 471 ~~i~~~~~~~---deea~~el-----~ri~~--------E~~~~~~~~~~~~-~~~~~~----~~~~~~~~~~~~~~~ 527 (527) +++... |+. ++...+.+ ..+.. +.++..+.....+ .+.++. ...-+.++.++.+|+ T Consensus 375 E~R~~~-Gl~pie~~~gD~~~~~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (518) T protein:vir:10 375 EGREIM-GLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDS 451 (518) T ss_pred HHHHHh-CCCCCCCCCCCeeeecccceecccccccccCCCCCCCCCCCCccccccccccccccCCCCCcccccccccc Confidence 976543 543 22222211 11110 0000000000000 000010 011111122222222 No 185 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=97.73 E-value=2.5e-05 Score=45.85 Aligned_cols=422 Identities=12% Similarity=0.136 Sum_probs=159.9 Q ss_pred hHH---HHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHH--HHHHHHhcCCCcccccccccCccccCceeecchHH Q lcl|NC_019418. 3 LIQ---KVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRI--QHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIAR 77 (527) Q Consensus 3 ~~~---~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i--~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~ 77 (527) ||. .|+.+.+ ...|.....-...+ ..+..||.....+ ..-.+-..+-+... T Consensus 1 ~~~~~~~i~s~~~----------------~~~i~~~~~~s~~~~~~~~~~~~~pp~~~--------~~la~l~~~n~~v~ 56 (542) T protein:vir:41 1 MFNYHLSIRSLEK----------------YKAIKREEVESQALGETRFEEYVEPKVNP--------LVLLSLLQVNPYHA 56 (542) T ss_pred Ccccccccccccc----------------chhhhhccccccccccccCCccccCCCCH--------HHHHHHHhhcHHHH Confidence 222 2222211 11110000000000 0011222211000 00001111224446 Q ss_pred HHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhh--hhHHHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEEEcCCceE Q lcl|NC_019418. 78 TAAKKIASLVYNEQAEISAEDETLNDFLSDMLSN--DRFNKNFERYLESALALGGLAMRPYVDG-DK-IRVAFIQAPVFL 153 (527) Q Consensus 78 ~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~--n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~v~a~~~~ 153 (527) .+++.+|+.+.+-+..+.-++. ..+..++-+ -.+...+...+.+.+..|.+++.+..+. |+ ..+.+++|.++. T Consensus 57 scI~~ia~~IA~l~~~~~~~~~---~~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~~L~~l~~~~v~ 133 (542) T protein:vir:41 57 SACSIKANDIIRTGYILEGDDE---GVVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPIRFEYIPSHTIR 133 (542) T ss_pred HHHHHHHHHHhhCceeeecccc---hhhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcceE Confidence 7777788777766655443322 222222211 1233445667778888899999887775 33 457778887766 Q ss_pred EEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccC Q lcl|NC_019418. 154 PLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYP 233 (527) Q Consensus 154 P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~ 233 (527) +. .+.++... .. .....+||. .+ ...+.+.. ..|.. T Consensus 134 v~-~d~~~~~~------~~--~~~~~~~~~--~y--------------------~~~~~~~~-----~~g~~-------- 169 (542) T protein:vir:41 134 VH-KDGSRYRQ------TW--DGVNITHFK--DY--------------------RYEGEINP-----ETGED-------- 169 (542) T ss_pred EE-EcCCeeEe------ee--cCCcceeEE--ee--------------------cccccccc-----ccccc-------- Confidence 53 22222111 11 011111221 11 00000000 00000 Q ss_pred CcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHH-HHcCc-ce--eeechhHhcCC Q lcl|NC_019418. 234 DLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWE-IKMGQ-RR--VIVPEQMTQLK 309 (527) Q Consensus 234 ~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e-~~~~~-~~--i~v~~~~l~~~ 309 (527) ... ++.--..||+.+.+ ...++|+|.+..+...+... ..-..+.+. |+.|. +. |.++..+.+.. T Consensus 170 ----~~~---~~~~eIiHir~~~~----~~~~~Glspi~~~~~~i~~~-~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~ 237 (542) T protein:vir:41 170 ----QDS---VGANELVFIHIPSP----VCSYYGVPRYVSAAPAILAM-QKIDEYNYAFFDNYTIPSYVITVTGEFEDEL 237 (542) T ss_pred ----ccc---cCcccEEEecCCCC----CCCcccccHHHHHHHHHHHH-HHHHHHHHHHHhccCCccEEEEeCCcccccc Confidence 000 00011346664322 23457999998887766443 334444433 45433 22 23333222100 Q ss_pred CCCCCcccccc------ccccc---------ccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_019418. 310 VQDNQGNIAFK------RRFDV---------EQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSS 374 (527) Q Consensus 310 ~~~~~~~~~~~------~~~d~---------~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~ 374 (527) . ........ ..|.. ..-+......+..++..++-++......++.+..+...++|+...|++| T Consensus 238 ~--~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVPp 315 (542) T protein:vir:41 238 E--EDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSIPGGDTVKVTFTPLNTSQKELSFREYAAEKKYDIAAAHMIDP 315 (542) T ss_pred c--cccccCHHHHHHHHHHHHHHHhhhhcccCceeEeeccCCcccceeEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCH Confidence 0 00000000 00000 0011111111111222344455555677888888888899999999999 Q ss_pred cccccccccc---chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCH Q lcl|NC_019418. 375 GMFTFDGQGV---KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDR 451 (527) Q Consensus 375 ~~~~~~~~g~---~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~ 451 (527) ..+|...++. .++.+. ..+-...++.-+.+.++.+|... +.. .....+.+.|+..-.... T Consensus 316 ~~lG~~~~~t~n~sn~Eq~--~~~f~~~tL~P~~~~ie~~ln~~------------L~~---~~~~~~~~~f~~~~ll~~ 378 (542) T protein:vir:41 316 YRLGIADTGPLGGNFAEVT--RRTYYESVVRPQQNIISSILTDF------------FQV---KFNPKTRFKFNDETLLES 378 (542) T ss_pred HHhCcCCCcccccccHHHH--HHHHHHHHHHHHHHHHHHHHHhh------------ccc---ccCCceEEEecchhhcch Confidence 9998764432 233322 11111222223333333333321 110 011234455654322222 Q ss_pred HHHHHHHHHHHhcCCCCHHHHHHhcCCCCH--HHH-------HHHHHHHHHhcccccccccCC-CCCCCC--C-----C- Q lcl|NC_019418. 452 HAELDYWMKMVAAGFATQKRGIAKTLGITE--EEA-------EKELAEINGELPPESDAELAL-YGKGQQ--N-----T- 513 (527) Q Consensus 452 ~~~~~~~~~~~~aGi~s~~~~i~~~~~~~d--eea-------~~el~ri~~E~~~~~~~~~~~-~~~~~~--~-----~- 513 (527) + ..+....++++|+|++-+++.++.|+.. +.- .+.+...+.+...+....... ..++++ + . T Consensus 379 d-~~~~~~~~v~~GilT~NE~Re~L~g~~pgdd~~l~p~~~~~~~~~~~~~n~~~~~~~~~~k~~~k~~~~~~~~~~~~~ 457 (542) T protein:vir:41 379 D-SVRNCALLVQSGVLTPAEARERLFGLDGGPDIFMVPSKGAAKSVKRQERNYEKNQIREIRKIYAKYRPRFNEIISSKL 457 (542) T ss_pred H-HHHHHHHHHhCCCCCHHHHHHhhCCCCCCCccccccccccccccccCCcCCCCCchhhhhhcccccCccccccccccc Confidence 2 2344556889999999998766655431 110 000000000000000000000 000000 0 0 Q ss_pred CCCCCCCCCccccC Q lcl|NC_019418. 514 VGNSKDTVDDEDEA 527 (527) Q Consensus 514 ~~~~~~~~~~~~~~ 527 (527) .....++.-++.|+ T Consensus 458 ~~~~~~~~~~~~~~ 471 (542) T protein:vir:41 458 SAEEKKKKIDESLA 471 (542) T ss_pred cchhhcccccchhh Confidence 00011111111111 No 186 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=97.70 E-value=2.8e-05 Score=45.57 Aligned_cols=389 Identities=11% Similarity=0.061 Sum_probs=165.8 Q ss_pred HhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHHHHhhhhhcccceE-- Q lcl|NC_019418. 17 NMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIASLVYNEQAEI-- 94 (527) Q Consensus 17 ~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~ll~~e~~~i-- 94 (527) .+|.+...... ..+.+++. .+ ..|..+.+... + ... +.-+....--.+++.+|+-+-+=|..+ T Consensus 1 m~f~~~~~~~~--~~~~~~~~---~~---~~~~g~~~~~~-~--v~~----~~al~~~~v~~~i~~ia~~ia~lp~~~~~ 65 (409) T protein:vir:10 1 MLFRKGFKNQS--QEISIDDK---KI---LEWLGINPSET-Y--VNG----KSCLKQATVFGCIRILSDNISKLPIKIYQ 65 (409) T ss_pred CcccccccCcC--CCCCCChH---HH---HHHhcCCcCcc-e--ech----hhhhccHHHHHHHHHHHHhhhhCceEEEE Confidence 11233322211 12333321 11 12222211111 0 000 111111222234444444443333222 Q ss_pred ------eeCCHHHHHHHHH-HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCC-e-eEEEEEcCCceEEEEEcCCceEEE Q lcl|NC_019418. 95 ------SAEDETLNDFLSD-MLSNDRFNKNFERYLESALALGGLAMRPYVDGD-K-IRVAFIQAPVFLPLQSNTQDVSSA 165 (527) Q Consensus 95 ------~~~d~~~~~~l~~-~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~-~-~~i~~v~a~~~~P~~~d~~~~~~~ 165 (527) .+.+..+...|.. --..-....-++..+.+.+..|.+++.+..+.. + ..+..++|+++-++. +.++.... T Consensus 66 ~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~V~v~~-~~~~~~~~ 144 (409) T protein:vir:10 66 KKDGIKRVPDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYPLKSDGMKIFV-DDTGLLNS 144 (409) T ss_pred ecCCeeeccCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCCceEEEE-cCCccccc Confidence 1112223332321 000111223345566777888999998888653 3 356667777765542 22221110 Q ss_pred EEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCC Q lcl|NC_019418. 166 AILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLS 245 (527) Q Consensus 166 a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~ 245 (527) ...++|. |. ...|....+ + T Consensus 145 -----------~~~~~y~------------------------------~~----~~~g~~~~~-------------~--- 163 (409) T protein:vir:10 145 -----------ENNVWYL------------------------------YT----DDLGQRHKF-------------M--- 163 (409) T ss_pred -----------cceEEEE------------------------------EE----eCCceeEEe-------------c--- Confidence 0111111 00 001111100 0 Q ss_pred cccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCc-ceeeechhHhcCCCCCCC-ccccccccc Q lcl|NC_019418. 246 RPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQ-RRVIVPEQMTQLKVQDNQ-GNIAFKRRF 323 (527) Q Consensus 246 ~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~-~~i~v~~~~l~~~~~~~~-~~~~~~~~~ 323 (527) .--+.|++.+.++ ...|+|.+.-+...+......-....+-|+.|. ++-++ ........ ..-.....| T Consensus 164 ~~evih~r~~~~d-----~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil-----~~~~~l~~e~~~~~~~~~ 233 (409) T protein:vir:10 164 SDEILHFKGLTAD-----GLAGLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGLV-----QYAGDLNPEAEEVFKENF 233 (409) T ss_pred cccEEEecCcCCC-----CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEE-----EcCCCCCHHHHHHHHHHH Confidence 0013456543222 356999988888777665444433334456533 23222 21111110 000000111 Q ss_pred ccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccc-cchHHHHHHHHHHH Q lcl|NC_019418. 324 DVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQG-VKTATEIVSENSDT 397 (527) Q Consensus 324 d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g-~~TAtei~s~~~~~ 397 (527) +..+.+.. .+ -.+...++.++......++.+..+...++|+...|++|..+|...++ -.++.+. T Consensus 234 ---~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~e~~------- 303 (409) T protein:vir:10 234 ---ERMSSGLKNAHRIAMLPIGYKFEPISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRATHSNITEQ------- 303 (409) T ss_pred ---HHHhccccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccccHHHH------- Confidence 11122211 00 01222456666666778888888888999999999999999866543 2333322 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc-CCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc Q lcl|NC_019418. 398 YQMRNSIVALVEQSIKELCVSMCELGKVVGIY-RGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKT 476 (527) Q Consensus 398 ~~~~~~~~~~~~~al~~li~~il~~~~~~~~~-~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~ 476 (527) .+..++.+|.-++..|-...+. .++ .........+.+++++-+-.|..+.++...+++.+|+|++-+++..+ T Consensus 304 ------~~~f~~~~l~P~~~~ie~~ln~-kL~~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~l 376 (409) T protein:vir:10 304 ------NREFYIDTLQSILNMYELEINY-KLFLISEIKNGFYSKFNVDTILRADIKTRYESYKEAIQNGFKTPNEIRELE 376 (409) T ss_pred ------HHHHHHHHHHHHHHHHHHHHHH-hhcCchhccCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh Confidence 1222344444444444322211 111 11122334567777777778999999999999999999999976543 Q ss_pred CCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_019418. 477 LGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDE 524 (527) Q Consensus 477 ~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (527) |+..=+ - .+.-....+.-.-+. .++....+|+. T Consensus 377 -gl~p~~-----------g--gD~~~~~~n~~~~~~-~~~~~~kgGe~ 409 (409) T protein:vir:10 377 -EDEPLE-----------G--GDVLLINGNMIPVKM-AGEQYSKGGEK 409 (409) T ss_pred -CCCCCC-----------C--cCeeeeccCccchhh-ccccccccCCC Confidence 553210 0 000000000000000 01111222222 No 187 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=97.67 E-value=3e-05 Score=45.39 Aligned_cols=392 Identities=13% Similarity=0.085 Sum_probs=154.6 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) |++|.|+|+.+-. +...++..... .|..| .+. .+...... ..+...--...+ T Consensus 4 ~~~~~~~k~~~~~---~~~~~~~~~~~----------------~~~~~-~~~----~~~~v~~~----~a~~~~~V~~ci 55 (409) T protein:vir:96 4 ENIVTRIKKKLID---NWIDQSASKLY----------------DFSPW-KNK----SFWGVINN----TLETNETIFSAI 55 (409) T ss_pred ccchhhhhhHHhh---hhhcccccccc----------------ccccc-cCc----cccccchh----hHhhhHHHHHHH Confidence 8899998887531 11222211110 01111 111 00111000 011111112223 Q ss_pred HHHhhhhhcccceE----eeCCHHHHHHHHH-HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCe--eEEEEEcCCceE Q lcl|NC_019418. 81 KKIASLVYNEQAEI----SAEDETLNDFLSD-MLSNDRFNKNFERYLESALALGGLAMRPYVDGDK--IRVAFIQAPVFL 153 (527) Q Consensus 81 ~~~A~ll~~e~~~i----~~~d~~~~~~l~~-~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~--~~i~~v~a~~~~ 153 (527) +.+|+-+-.-|..+ ...+..+.+.|.. --..-.-..-...++.+.+..|.+++.+..+..+ ..+..++|+.+- T Consensus 56 ~~ia~~ia~lp~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~ 135 (409) T protein:vir:96 56 TKLSNSMASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVE 135 (409) T ss_pred HHHHHhhhhCceEEeecccccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeE Confidence 33333333323222 1222333333321 0000112222345666778889999888776433 345556666554 Q ss_pred EEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccC Q lcl|NC_019418. 154 PLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYP 233 (527) Q Consensus 154 P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~ 233 (527) ++..+. ....+|.. .. .. |..+.+ + T Consensus 136 v~~~~~-----------------~~~~~y~~-~~-----------------~~----------------g~~~~~---~- 160 (409) T protein:vir:96 136 MLIENQ-----------------SRELYYSI-HA-----------------AT----------------GNKLIV---H- 160 (409) T ss_pred EEEeCC-----------------CcEEEEEE-Ec-----------------CC----------------ceEEEE---c- Confidence 432211 11122211 00 00 000000 0 Q ss_pred CcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCC Q lcl|NC_019418. 234 DLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDN 313 (527) Q Consensus 234 ~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~ 313 (527) +. -..||+.+.+ .+..+|+|.+.-+...++-..............+..-|+.+.. ... T Consensus 161 --~~----------evih~r~~~~----~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~------~l~ 218 (409) T protein:vir:96 161 --NM----------DMLHFKHIVA----SNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQKPDSFMLKYGS------NVS 218 (409) T ss_pred --cc----------cEEEeCCCCC----CCccccccHHHHHHHHHHHHHHHHHHHHHhcCCCceeEEecCC------CCC Confidence 00 1235543211 1233588888777766654433322222111111111221111 111 Q ss_pred Cccc-ccccccccccceeee---ccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccc-cchHH Q lcl|NC_019418. 314 QGNI-AFKRRFDVEQNVYMQ---VGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQG-VKTAT 388 (527) Q Consensus 314 ~~~~-~~~~~~d~~~~~~~~---~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g-~~TAt 388 (527) .... .....|. ..|.. +-.- .++..++.++......++.+..+....+|+...|++|.-+|....+ -.++. T Consensus 219 ~e~~~~~~~~~~---~~~~n~g~~~vl-~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e 294 (409) T protein:vir:96 219 TEKRQQVLEDFK---QYYEENGGILFQ-EPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNE 294 (409) T ss_pred HHHHHHHHHHHH---HHhhcCCCeeec-CCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHH Confidence 0000 0000010 00110 0000 1223456666666777888888888889999999999999865432 22333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCC-cccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCC Q lcl|NC_019418. 389 EIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRG-TIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFA 467 (527) Q Consensus 389 ei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~-~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~ 467 (527) +.. ...++.+|.-++..|-...+. .+... .......+.++.++-+-.|..+.++...+++.+|+| T Consensus 295 ~~~-------------~~f~~~~l~P~~~~ie~~l~~-~Ll~~~~~~~g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~ 360 (409) T protein:vir:96 295 ELN-------------RFYLQHTLLPIVKQYEEEFNR-KLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYY 360 (409) T ss_pred HHH-------------HHHHHHHHHHHHHHHHHHHHh-hcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCC Confidence 221 122233333333333221110 11111 111223344444555567888999999999999999 Q ss_pred CHHHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 468 TQKRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 468 s~~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) ++-+++.+. |+..-+ ..+-+. ..... .+....+.......++ +.++|| T Consensus 361 T~NE~R~~~-g~~pi~ggD~~~~--~~n~~-------~~~~~~~~~~~~~gG~--~n~~e~ 409 (409) T protein:vir:96 361 TINDIREWE-DLPPVEGGDKPLI--SGDLY-------PIDTPLELRKSLKGGD--KNVNES 409 (409) T ss_pred CHHHHHHHh-CCCCCCCcceeee--ccccc-------ccccchhhcccccCCC--CCcCCC Confidence 999976554 554211 100000 00000 0000000000011111 111222 No 188 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=97.66 E-value=3.1e-05 Score=45.27 Aligned_cols=392 Identities=12% Similarity=0.066 Sum_probs=154.4 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) -++++++|+.+.. +.+.++....++ |..|.... +...... ..+...--...+ T Consensus 4 ~~~~~~~~~~~~~---~~~~~~~~~~~~----------------~~~~~~~~-----~~~v~~~----~~~~~~~V~~ci 55 (409) T protein:vir:93 4 ENIVTRIKKKLID---NWIDQSTSKLYD----------------FSPWKNRS-----FWGVINN----TLETNETIFSAI 55 (409) T ss_pred cchhhhhhhhhhh---hhhccccccccc----------------cccccCcc-----ccccchh----hhhccHHHHHHH Confidence 4566666664432 112222211111 11111100 0001000 111111112333 Q ss_pred HHHhhhhhcccceEe----eCCHHHHHHHHH-HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCC-e-eEEEEEcCCceE Q lcl|NC_019418. 81 KKIASLVYNEQAEIS----AEDETLNDFLSD-MLSNDRFNKNFERYLESALALGGLAMRPYVDGD-K-IRVAFIQAPVFL 153 (527) Q Consensus 81 ~~~A~ll~~e~~~i~----~~d~~~~~~l~~-~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~-~-~~i~~v~a~~~~ 153 (527) +.+|+-+-.-|..+- ..+..+...|.. --..-.-..-...++...+..|.+++.+..+.. + ..+..++|+.+- T Consensus 56 ~~Ia~~ia~lp~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~ 135 (409) T protein:vir:93 56 TKLSNSMASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVE 135 (409) T ss_pred HHHHHhhhhCceeEeeccccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeE Confidence 444444433332221 122333333321 000111222235566677778999888877643 3 356667776655 Q ss_pred EEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccC Q lcl|NC_019418. 154 PLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYP 233 (527) Q Consensus 154 P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~ 233 (527) ++..+.+ ..++|.. .. .. |..+.+ + T Consensus 136 ~~~~~~~-----------------~~~~y~~-~~-----------------~~----------------g~~~~~---~- 160 (409) T protein:vir:93 136 MLIENQS-----------------RELYYSI-HA-----------------AT----------------GNKLIV---H- 160 (409) T ss_pred EEEeCCC-----------------cEEEEEE-Ec-----------------CC----------------ceEEEE---c- Confidence 4322211 1122210 00 00 111100 0 Q ss_pred CcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCC Q lcl|NC_019418. 234 DLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDN 313 (527) Q Consensus 234 ~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~ 313 (527) +. -..||+.+... +..+|+|.+.-+...++-...+-.........+..-|... +.... T Consensus 161 --~~----------eVih~r~~~~~----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~------~~~l~ 218 (409) T protein:vir:93 161 --NM----------DMLHFKHIVAS----NMVQGISPIDVLKNTTDFDNAVRTFNLTEMQKPDSFMLKY------GSNVG 218 (409) T ss_pred --cc----------cEEEeCCCCCC----CccccccHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEec------CCCCC Confidence 00 12455532111 2335888887776666544332211111112111112211 11111 Q ss_pred Ccc-cccccccccccceeee---ccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccccccc-chHH Q lcl|NC_019418. 314 QGN-IAFKRRFDVEQNVYMQ---VGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGV-KTAT 388 (527) Q Consensus 314 ~~~-~~~~~~~d~~~~~~~~---~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~-~TAt 388 (527) ... -.....|. ..+.. +-.- .++..++.++......++.+..+....+|+...|++|.-+|...++. .++. T Consensus 219 ~e~~~~~~~~~~---~~~~~~g~~~vl-~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e 294 (409) T protein:vir:93 219 KEKRQQVLEDFK---QYYEENGGILFQ-EPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNE 294 (409) T ss_pred HHHHHHHHHHHH---HHhhcCCCeeec-CCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHH Confidence 000 00000110 00110 0000 12223555555566778888888888899999999999998654332 2233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccC-CcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCC Q lcl|NC_019418. 389 EIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYR-GTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFA 467 (527) Q Consensus 389 ei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~-~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~ 467 (527) +.. ...++.+|..++..|....+. .+.. ........+.++++.-+..|..+.++...+++.+|+| T Consensus 295 ~~~-------------~~f~~~~l~P~~~~ie~~l~~-~Ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~ 360 (409) T protein:vir:93 295 ELN-------------RFYLQHTLLPIVKQYEEEFNR-KLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYY 360 (409) T ss_pred HHH-------------HHHHHHHHHHHHHHHHHHHHh-hcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCc Confidence 221 112233344444433221111 1111 1111223345555555567888999999999999999 Q ss_pred CHHHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 468 TQKRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 468 s~~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) ++-+++.++ |++.-+ ..+-+. +.+-...+...+.+.+ .....++++|+ T Consensus 361 T~NE~R~~~-g~~p~~ggD~~~~-------~~n~~~~~~~~~~~~~----~~gG~~n~~e~ 409 (409) T protein:vir:93 361 TINDIREWE-DLPPVEGGDKPLI-------SGDLYPIDTPLELRKS----LKGGDKNVNES 409 (409) T ss_pred CHHHHHHHh-CCCCCCCcCeeee-------cccccccccchhhccc----ccCCCCCcCCC Confidence 999977654 654311 000000 0000000000000000 01111222223 No 189 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=97.65 E-value=3.2e-05 Score=45.21 Aligned_cols=346 Identities=11% Similarity=0.123 Sum_probs=143.0 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHH--H Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIAR--T 78 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~--~ 78 (527) |+||.+.++ + +. . .+... |..+..+- .+. .+..- .....+..+. . T Consensus 1 M~~~~~f~~---r--------~~--~-------~~~~~------~~~~~~~~-~~~-----~~~~v-~~~~al~~~av~~ 47 (359) T protein:vir:10 1 MSILNPFER---R--------SS--I-------TPNNY------YPFMVQNG-SIV-----PNSLV-DATEALKNSDLYA 47 (359) T ss_pred Ccccchhhc---c--------cc--C-------CCCcc------hhhhhccc-ccc-----CCccc-CHHHhhcchHHHH Confidence 999875332 1 00 0 01000 11111111 110 00000 0001122222 3 Q ss_pred HHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCC-e-eEEEEEcCCceEEEE Q lcl|NC_019418. 79 AAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGD-K-IRVAFIQAPVFLPLQ 156 (527) Q Consensus 79 i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~-~-~~i~~v~a~~~~P~~ 156 (527) .++.+|+-+-+-|. +++......+.+--..-.-..-....+...+..|.+++.+..+.+ . ..+..++|+.+-+.. T Consensus 48 cv~~ia~~ia~~p~---~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~~l~~l~~~~v~i~~ 124 (359) T protein:vir:10 48 VTSLISSDIAGTRF---IGNQVFTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDNSLMKELRLIPSNAITIDL 124 (359) T ss_pred HHHHHHHhhhcCcc---ccchHHHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEECCCCeEEEEEEeCCceEEEEE Confidence 45555555544442 233333333222100001111123344555667888887776643 3 345566666554421 Q ss_pred EcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcc Q lcl|NC_019418. 157 SNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQ 236 (527) Q Consensus 157 ~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~ 236 (527) ++ +..+|..-. . ..|. . .-| + + T Consensus 125 -~~------------------~~~~y~~~~-~----------------~~~~-~-~~~------------~--------~ 146 (359) T protein:vir:10 125 -TD------------------DTLTYEVNQ-F----------------DDYP-S-AKY------------N--------A 146 (359) T ss_pred -cC------------------CeEEEEEEe-c----------------CCce-E-EEE------------c--------c Confidence 11 112221000 0 0000 0 000 0 0 Q ss_pred cceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCC-C-CCC Q lcl|NC_019418. 237 PVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKV-Q-DNQ 314 (527) Q Consensus 237 ~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~-~-~~~ 314 (527) . -..||+.+..+....+.-.|.|.+.-+...+.....+-.-..+-|..|.+ |..++..+. . .+. T Consensus 147 ~----------evih~~~~~~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~----~~gil~~~~~~l~~e 212 (359) T protein:vir:10 147 S----------EMIHVKIMAYGVDTLHNLVGHSPLESLTSEIGQQKEANRLSLSTLKGALN----PTSVVKVPQGTLSSE 212 (359) T ss_pred c----------ceEEeccCCCCCCccCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCC----cceEEEeCCCCCCHH Confidence 0 02355543333222233469998888777776655544444445565442 222222111 0 000 Q ss_pred cccccccccccccceeeeccCCC----CCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHH Q lcl|NC_019418. 315 GNIAFKRRFDVEQNVYMQVGAGN----MDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEI 390 (527) Q Consensus 315 ~~~~~~~~~d~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei 390 (527) ........|. ..+.+-+.+. +++..++.++..-.+.++.+..+....+|+...|++|..+|..++...|...+ T Consensus 213 ~~~~~~~~~~---~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~ 289 (359) T protein:vir:10 213 AKDSIRKEFE---KANGGNNSGRVMVLDQSADFSTVSINADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQSSLDQI 289 (359) T ss_pred HHHHHHHHHH---HHhCccccCCceecCCCcceeeecCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCcccccHHHH Confidence 0000111111 1121111110 12223555555555667888888888899999999999998655444455555 Q ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCH Q lcl|NC_019418. 391 VSENSDT-YQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQ 469 (527) Q Consensus 391 ~s~~~~~-~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~ 469 (527) ...+... ..+..-+...++..|.. .+ .++.+.-+..|.+.......+++.+|+|++ T Consensus 290 e~~~~~~l~~~l~p~~~~l~~~l~~---~~--------------------~~~~~~~~~~d~~~~~~~~~~~~~~G~~t~ 346 (359) T protein:vir:10 290 KDLYVNALNRFIEPLISELRIKCDS---SI--------------------GVDMSPITDYSNSVFKADILNWVKEGIIEP 346 (359) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhh---hh--------------------cccchhhhhcCHHHHHHHHHHHHhCCCcCH Confidence 3333222 12223322222222211 00 011111122233444556677888899888 Q ss_pred HHHHHhc--CCCC Q lcl|NC_019418. 470 KRGIAKT--LGIT 480 (527) Q Consensus 470 ~~~i~~~--~~~~ 480 (527) -++++.+ .|+= T Consensus 347 NE~R~~l~~~pv~ 359 (359) T protein:vir:10 347 TEAKTLLESKGII 359 (359) T ss_pred HHHHHHhCCCCCC Confidence 8876553 2322 No 190 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=97.64 E-value=3.4e-05 Score=45.07 Aligned_cols=463 Identities=10% Similarity=0.083 Sum_probs=182.3 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) |.= ++ ..+-. ..+++... .+..++......|+.++.---|-+-.....+......++-=+.+...+ T Consensus 1 m~~-~~-~~~~~--------~~~~~r~~----~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~ 66 (536) T protein:vir:10 1 MAE-KR-TGLAE--------DGAKSVYE----RLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGL 66 (536) T ss_pred Ccc-hh-hchhH--------HHHHHHHH----HHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHH Confidence 111 00 00000 01111100 112233344666776665433322111111111122223335677777 Q ss_pred HHHhhhhhcc--c--ceE--eeCCH-------------HH-------HHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEE Q lcl|NC_019418. 81 KKIASLVYNE--Q--AEI--SAEDE-------------TL-------NDFLSDMLSNDRFNKNFERYLESALALGGLAMR 134 (527) Q Consensus 81 ~~~A~ll~~e--~--~~i--~~~d~-------------~~-------~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~ 134 (527) +.+|+-|.+- | +=| .+.+. .. .+.+...+..++|...+.++..+....|.+.+. T Consensus 67 ~~Laa~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly 146 (536) T protein:vir:10 67 NNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLY 146 (536) T ss_pred HHHHHHHHhhhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEE Confidence 7777754442 2 212 22221 12 234556788899999999999999999988875 Q ss_pred EEEeCC-ee-EEEEEcCCceEEEEEcCCceEEEEEEEEEEe-------------eCCCcceEEEEEEEEeecccccccce Q lcl|NC_019418. 135 PYVDGD-KI-RVAFIQAPVFLPLQSNTQDVSSAAILTKTIK-------------TENRKNVYYTLVEFHEWVTPTGQEVG 199 (527) Q Consensus 135 ~~~d~~-~~-~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~-------------~~~~~~~~yt~lE~h~~~~~~~~~~~ 199 (527) +--+.+ ++ .+..++-.+++ +..|..|++..+|....+. ....+..++..++..+.... T Consensus 147 ~~e~~~~~~~~~~~~pl~~~~-v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V~~------ 219 (536) T protein:vir:10 147 LPEPEGSNYNPMKLYRLSSYV-VQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYL------ 219 (536) T ss_pred EeeCCCCceeeEEEEEcCeEE-EeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEEEE------ Confidence 432332 23 36677877777 4566777777665332211 00001112222222111000 Q ss_pred eeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHH Q lcl|NC_019418. 200 STKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTID 279 (527) Q Consensus 200 ~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid 279 (527) ..++..|...+++ + |..++..+- ..++..-+|..++- +...++.||+|-...+.+-+. T Consensus 220 -~~~~~~~~~~~e~----~----g~~v~~~~g---------~~~f~~~P~i~~Rw----~~~~ge~YGrgp~~~~l~D~k 277 (536) T protein:vir:10 220 -DEASGEYLRYEEV----E----GMEVQGSDG---------TYPKEACPYIPIRM----VRLDGESYGRSYIEEYLGDLR 277 (536) T ss_pred -ecCCCcEEEEEee----c----Ccccccccc---------ccccccCCceeeee----eecCCCccccchHHHHHHHHH Confidence 0111222221111 1 122211110 01111122222221 223467899999999999999 Q ss_pred HHHHHHHHHHH-HHHcCcceeeech-hHhcCCC--CCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHH Q lcl|NC_019418. 280 FINRTYDEFMW-EIKMGQRRVIVPE-QMTQLKV--QDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDY 355 (527) Q Consensus 280 ~ld~~~s~~~~-e~~~~~~~i~v~~-~~l~~~~--~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~ 355 (527) .|+..--.... .....+....|++ .++.+.. ++..|. +.+ +..+..++..+...-+...- T Consensus 278 ~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~-------------~v~---g~~~~v~~~~~~~~~~~~~~ 341 (536) T protein:vir:10 278 SLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGD-------------FVT---GRPEDISFLQLEKQADFTVA 341 (536) T ss_pred HHHHHHHHHHHHHHHHhcCCcccCcccccchhhhccCCCcc-------------eec---CCcccceeeeccccccchHH Confidence 99976555544 3344444444432 2322110 111111 111 11111112112221222222 Q ss_pred HHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhcccCCccc Q lcl|NC_019418. 356 ISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKVVGIYRGTIP 434 (527) Q Consensus 356 ~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~~~~~~~~~~ 434 (527) .+.++.+-..|....=+. .+....+...|||||....+...+...-.-..+ ...|.-|+..++.+..-.++....+. T Consensus 342 ~~~i~~~~~rI~~af~~~--~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~ 419 (536) T protein:vir:10 342 KAVSDAIEARLSFAFMLN--SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPK 419 (536) T ss_pred HHHHHHHHHHHHHHHhhh--hcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCh Confidence 344444444443322111 122233445799999999988888666544444 23444566665555432222222222 Q ss_pred CccceEEEeCCCccC-CHHHHHHHHHHHHh--cCC--------CCHHHHHH---hcCCC-------CHHHHHHHHHHHHH Q lcl|NC_019418. 435 ELDDISVNLDDGVFT-DRHAELDYWMKMVA--AGF--------ATQKRGIA---KTLGI-------TEEEAEKELAEING 493 (527) Q Consensus 435 ~~~~v~v~f~d~i~~-d~~~~~~~~~~~~~--aGi--------~s~~~~i~---~~~~~-------~deea~~el~ri~~ 493 (527) +. +.+++--++.. .+...++..+...+ +++ +....++. ...|+ |++|++++.++.+. T Consensus 420 ~~--v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~ 497 (536) T protein:vir:10 420 EA--VEPTISTGLEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSM 497 (536) T ss_pred hh--ccceEEecHHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCCchhhcCCHHHHHHHHHHHHH Confidence 22 33333222211 01122222222111 121 22233332 23465 34555544433222 Q ss_pred hcccccccc-----cCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 494 ELPPESDAE-----LALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 494 E~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) ++....... ........+... ..-=+..+.+++ T Consensus 498 ~~~~~~~a~~~~~~~~~~~~~~~~~~-~~~~~~~g~~~~ 535 (536) T protein:vir:10 498 QMGMDNGAAALAQGMAAQATASPEAM-AAAADSVGLQPG 535 (536) T ss_pred HHHHHHHHHHHHHHHHHHHhcCchhH-HhhhhccccCCC Confidence 111100000 000000000000 000011122222 No 191 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=97.63 E-value=3.5e-05 Score=45.01 Aligned_cols=375 Identities=10% Similarity=0.053 Sum_probs=151.2 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) |+||+++..- ++ . ...-..+ .+ ...+ +..............+...--..++ T Consensus 1 Mg~f~~~~~~----------~~--~---~~~~~~~-----~~---~~~~------~~~~~~~~~v~~~~~l~~~~v~~~i 51 (382) T protein:vir:48 1 MPIFNLATES----------PP--D---NQGGFFD-----VV---DSDF------LASLKGNEWVSAETALRNSDLFSII 51 (382) T ss_pred CccccccccC----------Cc--c---ccccccc-----ch---hhhc------cccccCCcccchHhhhccHHHHHHH Confidence 9998765321 10 0 0000000 00 0000 0000000000000111111223445 Q ss_pred HHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEEEcCCceEEEEEc Q lcl|NC_019418. 81 KKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DK-IRVAFIQAPVFLPLQSN 158 (527) Q Consensus 81 ~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~v~a~~~~P~~~d 158 (527) +.+|+-+-+-| +.+.+......+.+--..-.....++.++...+..|.+++.+.-|. |+ +.+.+++|+.+-++..+ T Consensus 52 ~~ia~~ia~~~--~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~v~~~~ 129 (382) T protein:vir:48 52 NQLSNDLATVK--LITSRKKLQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLD 129 (382) T ss_pred HHHHHhhccCc--eeeecchhhhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcC Confidence 55565554434 3343333332222211111223344556667778899988887764 33 46677777776553222 Q ss_pred CCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccc Q lcl|NC_019418. 159 TQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPV 238 (527) Q Consensus 159 ~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~ 238 (527) .++ ..+|. +.. ++...|..+.+ T Consensus 130 ~~~-----------------~~~y~---~~~----------------------------~~~~~~~~~~~---------- 151 (382) T protein:vir:48 130 NKD-----------------GIYYN---ITF----------------------------DDPRIPPKQHV---------- 151 (382) T ss_pred CCC-----------------eEEEE---EEe----------------------------cCccccceeEE---------- Confidence 111 11221 000 00011111111 Q ss_pred eeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcC-cceeeechhHhcCCCCCCCccc Q lcl|NC_019418. 239 TPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMG-QRRVIVPEQMTQLKVQDNQGNI 317 (527) Q Consensus 239 ~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~-~~~i~v~~~~l~~~~~~~~~~~ 317 (527) . .--+.||+.+.++ +..+|.|.+..+...++.....-.-..+-|+.| .++.++ ........... T Consensus 152 ---~---~~evih~~~~~~~----~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il-----~~~~~~~~e~~ 216 (382) T protein:vir:48 152 ---P---QNDVLHFRLLSVD----GGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGIL-----KIKGGGLLDFK 216 (382) T ss_pred ---c---CccEEEecCCCCC----CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEE-----EeCCCCChHHH Confidence 0 0013466543322 345799999988888865554444344445653 333333 22111110000 Q ss_pred -cccccccc-ccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHH Q lcl|NC_019418. 318 -AFKRRFDV-EQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENS 395 (527) Q Consensus 318 -~~~~~~d~-~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~ 395 (527) .....|.. .+.....+-. .++..++.++....+.++.+..+...++|+...|++|..+|...++..++.... T Consensus 217 ~~~~~~~~~~~~n~g~~~vl--~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~~~~~---- 290 (382) T protein:vir:48 217 TKLSRSRQAMKQMQGGPLVL--DDLEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGDQQSSLEMSS---- 290 (382) T ss_pred HHHHHHHHhhccCCCCeeEc--CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHH---- Confidence 00000000 0000000001 122246666666677788888888889999999999999986443321111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHh Q lcl|NC_019418. 396 DTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAK 475 (527) Q Consensus 396 ~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~ 475 (527) ..++.+|..++..|..-... .++.. ...++... +-.+.........++..+|++++-+++.. T Consensus 291 ----------~~~~~~l~p~~~~i~~~l~~-~l~~~---~~~~~~~~----~~~~~~~~~~~~~~l~~~g~~t~~e~r~~ 352 (382) T protein:vir:48 291 ----------DLYSKAVSRYLRPFLSELSQ-KLSCD---VDADIFPA----VDPTGSNYISRINSLVKTGTLAQNQGLYI 352 (382) T ss_pred ----------HHHHHHHHHHHHHHHHHHHH-HhcCh---hhhhhhhh----hccchhHHHHHHHHHhhcCccCHHHHHHH Confidence 12223333333332211110 01110 00111111 11233445556677888999999887754 Q ss_pred c--CCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCC Q lcl|NC_019418. 476 T--LGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGN 516 (527) Q Consensus 476 ~--~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 516 (527) + .|+..+++.+. +.. .+. .+++++..++ T Consensus 353 l~~~g~~~~~~~~~------~~~-----~~~--~~GGd~~~~~ 382 (382) T protein:vir:48 353 LQQAEILPKELPNG------ENP-----NST--LKGGEEDGQD 382 (382) T ss_pred HhhCCCCCcchhhh------hcC-----CCC--CCCCCCCCCC Confidence 3 24433322111 110 000 1112211111 No 192 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=97.59 E-value=4e-05 Score=44.70 Aligned_cols=388 Identities=13% Similarity=0.102 Sum_probs=162.7 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) =|||+++|+||++... . ...-...+ .+ + + |..|..|.. ++ .+.-+.+.--...+ T Consensus 14 ~g~~~~~~~~f~~~~~-~-~~~~~~~~-~~-~--~---------~~~~~~~~~--v~---------~~~al~~~~v~~cv 67 (424) T protein:vir:18 14 NGWWARLKSWFVGGRL-V-TPNQGSQT-GP-V--S---------AHGYLGDSS--IN---------DERILQISTVWRCV 67 (424) T ss_pred CchHHHHHhhcccccc-c-cccchhhc-cc-c--c---------ccccccccc--cc---------HHHhhccHHHHHHH Confidence 6899999999864210 0 00000000 00 0 0 111221110 00 01111111122444 Q ss_pred HHHhhhhhcccceEe-eC-CH---H--HHHHHHHHHhh--h---hHHHHHHHHHHHHHhcCCEEEEEEEeCC-e-eEEEE Q lcl|NC_019418. 81 KKIASLVYNEQAEIS-AE-DE---T--LNDFLSDMLSN--D---RFNKNFERYLESALALGGLAMRPYVDGD-K-IRVAF 146 (527) Q Consensus 81 ~~~A~ll~~e~~~i~-~~-d~---~--~~~~l~~~l~~--n---~f~~~~~~~~~~a~~~G~~~~~~~~d~~-~-~~i~~ 146 (527) +.+|+-+-+=|..+- .+ +. . ...-+..+|.. | .-..-.+..+...+..|.+++.+..+.+ + +.+.. T Consensus 68 ~~Ia~~iA~lp~~vy~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~ 147 (424) T protein:vir:18 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) T ss_pred HHHHHhhccCceEEEEeccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEE Confidence 555554444343321 11 10 0 11122233321 1 1122234456677888999988877643 3 34555 Q ss_pred EcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCcee Q lcl|NC_019418. 147 IQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERV 226 (527) Q Consensus 147 v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v 226 (527) ++|..+-+. .++ +..+|. +. . ++ ...+ |. T Consensus 148 l~~~~v~v~-~~~------------------~~~~y~---~~-------------~-~g-~~~~---~~----------- 176 (424) T protein:vir:18 148 LQSANMDVK-LVG------------------KKVVYR---YQ-------------R-DS-EYAD---FS----------- 176 (424) T ss_pred ecCcceEEE-EcC------------------CeEEEE---EE-------------e-CC-eEEE---ec----------- Confidence 666655432 111 111221 00 0 00 0000 10 Q ss_pred ecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHH-HHHHcC-cceeee--c Q lcl|NC_019418. 227 NLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFM-WEIKMG-QRRVIV--P 302 (527) Q Consensus 227 ~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~-~e~~~~-~~~i~v--~ 302 (527) +. -+.|++.+.. +...|+|.+.-+...+.-.. ...++. +-|..| ++.-++ | T Consensus 177 ---------~~----------eVihir~~~~-----dg~~G~spi~~~~~~i~~~~-~~~~~~~~~f~ng~~~~gil~~~ 231 (424) T protein:vir:18 177 ---------QK----------EIFHLKGFGF-----TGLVGLSPIAFACKSAGVAV-AMEDQQRDFFANGAKSPQILSTG 231 (424) T ss_pred ---------cc----------cEEEecCcCC-----CCcccccHHHHHHHHHHHHH-HHHHHHHHHHhccCCcceEEEeC Confidence 00 1235553322 22468888877766665433 223333 334543 333333 2 Q ss_pred hhHhcCCCCCCCcccccccccccccceeeeccCCC----CCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccc Q lcl|NC_019418. 303 EQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGN----MDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFT 378 (527) Q Consensus 303 ~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~ 378 (527) ..++ .+...-.....| +..+.+-+.+. .++..++.++......++.+..+....+|+...|++|..+| T Consensus 232 ~~~l-----~~e~~~~~~~~~---~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg 303 (424) T protein:vir:18 232 EKVL-----TEQQRSQVEENF---KEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVG 303 (424) T ss_pred CcCC-----CHHHHHHHHHHH---HHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhC Confidence 2211 000000000111 11111111000 11224556665666778888888888899999999999998 Q ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHH Q lcl|NC_019418. 379 FDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYW 458 (527) Q Consensus 379 ~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~ 458 (527) +...+..++..+..... ..++.+|..++..|-...+. .++.........+.++++.-+..|..+.++.. T Consensus 304 ~~~~~t~~~sn~eq~~~----------~f~~~tl~P~~~~ie~~ln~-~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~ 372 (424) T protein:vir:18 304 DVEKSTSWGSGIEQQNL----------GFLQYTLQPYISRWENSIQR-WLIPSKDVGRLHAEHNLDGLLRGDSASRAAFM 372 (424) T ss_pred CCCCcccccccHHHHHH----------HHHHHHHHHHHHHHHHHHHh-hcCCccccCCeEEEEechhhhccCHHHHHHHH Confidence 77655432222211111 11233333333333222111 12211112234466777777788999999999 Q ss_pred HHHHhcCCCCHHHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019418. 459 MKMVAAGFATQKRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVD 522 (527) Q Consensus 459 ~~~~~aGi~s~~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (527) .+++.+|+|++-+++... |++.-+ ..+-+ +.....+ +...+. ..++.+|+. T Consensus 373 ~~~~~~G~~T~NE~R~~~-gl~pi~ggD~~~--~~~n~~~-------l~~~~~---~~~~~~n~a 424 (424) T protein:vir:18 373 KAMGESGLRTINEMRRTD-NMPPLPGGDVAM--RQAQYVP-------ITDLGT---NKEPRNNGA 424 (424) T ss_pred HHHHhCCCcCHHHHHHHh-CCCCCCCcCeee--eccCccc-------hhhhhc---cCCccccCC Confidence 999999999999876543 554210 00000 0000000 000000 111222222 No 193 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=97.57 E-value=4.3e-05 Score=44.51 Aligned_cols=379 Identities=13% Similarity=0.130 Sum_probs=160.5 Q ss_pred HHHhcCCCcccccccccCccc------------cC----------------------------ceeecchHH--HHHHHH Q lcl|NC_019418. 46 LAYYQSKFDDIEYTNTDGDRK------------RR----------------------------KMQHLPIAR--TAAKKI 83 (527) Q Consensus 46 ~~~y~g~~~~l~~~~~~~~~~------------~~----------------------------~~~~lnl~~--~i~~~~ 83 (527) ++||.-+...+.+.....+++ +| ....+..+. ..++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~~cv~~I 80 (441) T protein:vir:79 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMI 80 (441) T ss_pred CccccCccccccccccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhhhccHHHHHHHHHH Confidence 333333222221111000000 00 001122221 234445 Q ss_pred hhhhhcccceEeeCCH-HHHHHHHHHHh--hhhH---HHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEEEcCCceEEE Q lcl|NC_019418. 84 ASLVYNEQAEISAEDE-TLNDFLSDMLS--NDRF---NKNFERYLESALALGGLAMRPYVDG-DK-IRVAFIQAPVFLPL 155 (527) Q Consensus 84 A~ll~~e~~~i~~~d~-~~~~~l~~~l~--~n~f---~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~v~a~~~~P~ 155 (527) |+-+-+-|..+.-++. .....+-.+|. -|.+ ....+..+...+..|.+++.+..+. |+ +.+.+++|+++-+. T Consensus 81 a~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~ 160 (441) T protein:vir:79 81 ASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELK 160 (441) T ss_pred HHhhccCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEE Confidence 5444443433321111 11122222332 1211 1223455666778899999888775 33 35777888877664 Q ss_pred EEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCc Q lcl|NC_019418. 156 QSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDL 235 (527) Q Consensus 156 ~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l 235 (527) . +.++. .+|.. ..- .+....+. ..|. T Consensus 161 ~-d~~g~-----------------~~~~~-~~~--------------~~~~~~~~-~~~~-------------------- 186 (441) T protein:vir:79 161 S-DARGR-----------------LYYFH-QRI--------------DSNGNNIE-RNVK-------------------- 186 (441) T ss_pred E-CCCcc-----------------EEEEE-EEe--------------ccCCceeE-EEEc-------------------- Confidence 3 32222 12210 000 00000000 0010 Q ss_pred ccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHH-HHcCc-ceeeechhHhcCCCCCC Q lcl|NC_019418. 236 QPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWE-IKMGQ-RRVIVPEQMTQLKVQDN 313 (527) Q Consensus 236 ~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e-~~~~~-~~i~v~~~~l~~~~~~~ 313 (527) +. -+.||+.+.. ....|+|.+.-+...++ +......+... |+.|. ++.++ ....... T Consensus 187 ~~----------dvih~k~~~~-----dg~~G~spl~~~~~~i~-~~~~~~~~~~~~f~ng~~p~gil-----~~~~~~~ 245 (441) T protein:vir:79 187 FE----------DMLDIKFYSL-----DGINGLSLLDTLSRTIE-SDNNGKDFLNNFLRNGTHAGGIL-----KMKGVLD 245 (441) T ss_pred cc----------cEEEeccCCC-----CCccccCHHHHHHHHHH-HHHHHHHHHHHHHhccCCCcEEE-----EcCCCCC Confidence 00 0234543211 12468999888887775 34444444444 45533 33332 2221111 Q ss_pred --Ccccccccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccch Q lcl|NC_019418. 314 --QGNIAFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKT 386 (527) Q Consensus 314 --~~~~~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~T 386 (527) .........|. ..|.+.. .+ -.++..++.++....+.++.+......++|+...|++|..+|...++. + T Consensus 246 ~~e~~e~~r~~~~---~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~-s 321 (441) T protein:vir:79 246 NKKARDRAREEFH---KSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANM-S 321 (441) T ss_pred CHHHHHHHHHHHH---HHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCc-c Confidence 10000111111 1122211 00 011224566666777778888888888999999999999998654432 2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCC Q lcl|NC_019418. 387 ATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGF 466 (527) Q Consensus 387 Atei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi 466 (527) .++.... ...+..-+...++.+|... +... .....+.++++.-+-.|..+.++...+++.+|+ T Consensus 322 ~~q~~~~---~~~tl~P~~~~ie~eln~k------------l~~~--~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~ 384 (441) T protein:vir:79 322 ITDANLD---YLSTLKPYITCVCAELNFK------------FNDE--YVNREFKFDTTEIRVVDEKTQAEIDKINIDSGK 384 (441) T ss_pred HHHHHHH---HHHHHHHHHHHHHHHHhhh------------cccc--ccCceEEeechhhhccCHHHHHHHHHHHHhCCC Confidence 2222111 1123333333333322221 1111 123446666666677788999999999999999 Q ss_pred CCHHHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCCCC--CCCCCCCCcccc Q lcl|NC_019418. 467 ATQKRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQNTV--GNSKDTVDDEDE 526 (527) Q Consensus 467 ~s~~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~ 526 (527) |++-+++... |+..-+ -.+.+-.......+-+ ..++.+.+.. .+....+|++.| T Consensus 385 ~T~NE~R~~~-gl~Pi~ggd~~~~~~~~n~~~~~-----~~~~~~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:79 385 MNIDEIRQRD-GLAPIPGGNGSIHRVDLNHVNIE-----LVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred cCHHHHHHHh-CCCCCCCCCcceEeecccccccc-----cccccccccccccccccCCCCCCC Confidence 9999977554 553210 0000001111111100 0011111111 112234444444 No 194 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=97.57 E-value=4.3e-05 Score=44.51 Aligned_cols=379 Identities=13% Similarity=0.130 Sum_probs=160.5 Q ss_pred HHHhcCCCcccccccccCccc------------cC----------------------------ceeecchHH--HHHHHH Q lcl|NC_019418. 46 LAYYQSKFDDIEYTNTDGDRK------------RR----------------------------KMQHLPIAR--TAAKKI 83 (527) Q Consensus 46 ~~~y~g~~~~l~~~~~~~~~~------------~~----------------------------~~~~lnl~~--~i~~~~ 83 (527) ++||.-+...+.+.....+++ +| ....+..+. ..++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~~cv~~I 80 (441) T protein:vir:94 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMI 80 (441) T ss_pred CccccCccccccccccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhhhccHHHHHHHHHH Confidence 333333222221111000000 00 001122221 234445 Q ss_pred hhhhhcccceEeeCCH-HHHHHHHHHHh--hhhH---HHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEEEcCCceEEE Q lcl|NC_019418. 84 ASLVYNEQAEISAEDE-TLNDFLSDMLS--NDRF---NKNFERYLESALALGGLAMRPYVDG-DK-IRVAFIQAPVFLPL 155 (527) Q Consensus 84 A~ll~~e~~~i~~~d~-~~~~~l~~~l~--~n~f---~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~v~a~~~~P~ 155 (527) |+-+-+-|..+.-++. .....+-.+|. -|.+ ....+..+...+..|.+++.+..+. |+ +.+.+++|+++-+. T Consensus 81 a~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~ 160 (441) T protein:vir:94 81 ASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELK 160 (441) T ss_pred HHhhccCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEE Confidence 5444443433321111 11122222332 1211 1223455666778899999888775 33 35777888877664 Q ss_pred EEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCc Q lcl|NC_019418. 156 QSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDL 235 (527) Q Consensus 156 ~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l 235 (527) . +.++. .+|.. ..- .+....+. ..|. T Consensus 161 ~-d~~g~-----------------~~~~~-~~~--------------~~~~~~~~-~~~~-------------------- 186 (441) T protein:vir:94 161 S-DARGR-----------------LYYFH-QRI--------------DSNGNNIE-RNVK-------------------- 186 (441) T ss_pred E-CCCcc-----------------EEEEE-EEe--------------ccCCceeE-EEEc-------------------- Confidence 3 32222 12210 000 00000000 0010 Q ss_pred ccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHH-HHcCc-ceeeechhHhcCCCCCC Q lcl|NC_019418. 236 QPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWE-IKMGQ-RRVIVPEQMTQLKVQDN 313 (527) Q Consensus 236 ~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e-~~~~~-~~i~v~~~~l~~~~~~~ 313 (527) +. -+.||+.+.. ....|+|.+.-+...++ +......+... |+.|. ++.++ ....... T Consensus 187 ~~----------dvih~k~~~~-----dg~~G~spl~~~~~~i~-~~~~~~~~~~~~f~ng~~p~gil-----~~~~~~~ 245 (441) T protein:vir:94 187 FE----------DMLDIKFYSL-----DGINGLSLLDTLSRTIE-SDNNGKDFLNNFLRNGTHAGGIL-----KMKGVLD 245 (441) T ss_pred cc----------cEEEeccCCC-----CCccccCHHHHHHHHHH-HHHHHHHHHHHHHhccCCCcEEE-----EcCCCCC Confidence 00 0234543211 12468999888887775 34444444444 45533 33332 2221111 Q ss_pred --Ccccccccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccch Q lcl|NC_019418. 314 --QGNIAFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKT 386 (527) Q Consensus 314 --~~~~~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~T 386 (527) .........|. ..|.+.. .+ -.++..++.++....+.++.+......++|+...|++|..+|...++. + T Consensus 246 ~~e~~e~~r~~~~---~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~-s 321 (441) T protein:vir:94 246 NKKARDRAREEFH---KSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANM-S 321 (441) T ss_pred CHHHHHHHHHHHH---HHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCc-c Confidence 10000111111 1122211 00 011224566666777778888888888999999999999998654432 2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCC Q lcl|NC_019418. 387 ATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGF 466 (527) Q Consensus 387 Atei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi 466 (527) .++.... ...+..-+...++.+|... +... .....+.++++.-+-.|..+.++...+++.+|+ T Consensus 322 ~~q~~~~---~~~tl~P~~~~ie~eln~k------------l~~~--~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~ 384 (441) T protein:vir:94 322 ITDANLD---YLSTLKPYITCVCAELNFK------------FNDE--YVNREFKFDTTEIRVVDEKTQAEIDKINIDSGK 384 (441) T ss_pred HHHHHHH---HHHHHHHHHHHHHHHHhhh------------cccc--ccCceEEeechhhhccCHHHHHHHHHHHHhCCC Confidence 2222111 1123333333333322221 1111 123446666666677788999999999999999 Q ss_pred CCHHHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCCCC--CCCCCCCCcccc Q lcl|NC_019418. 467 ATQKRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQNTV--GNSKDTVDDEDE 526 (527) Q Consensus 467 ~s~~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~ 526 (527) |++-+++... |+..-+ -.+.+-.......+-+ ..++.+.+.. .+....+|++.| T Consensus 385 ~T~NE~R~~~-gl~Pi~ggd~~~~~~~~n~~~~~-----~~~~~~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:94 385 MNIDEIRQRD-GLAPIPGGNGSIHRVDLNHVNIE-----LVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred cCHHHHHHHh-CCCCCCCCCcceEeecccccccc-----cccccccccccccccccCCCCCCC Confidence 9999977554 553210 0000001111111100 0011111111 112234444444 No 195 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=97.50 E-value=5.6e-05 Score=43.90 Aligned_cols=387 Identities=12% Similarity=0.068 Sum_probs=161.8 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) =|||.++|.||++....-..+. ..+. + + + +..++.|.. ++ . +.-+...---..+ T Consensus 14 ~g~~~~~~~~~~~~~~~~~~~~--~~~~-~-~--~---------~~~~~~~~~--v~-----~----~~al~~~~v~~cv 67 (424) T protein:vir:18 14 NGWWARLQSWFVGGRLVTPNQG--SQTG-P-V--S---------AHGHLGDSS--IN-----D----ERILQISTVWRCV 67 (424) T ss_pred CchHHHHHhhhccccccccccc--cccc-c-c--c---------ccccccccc--cc-----H----HHhhccHHHHHHH Confidence 6899999999964221100000 0000 0 0 0 011111110 00 0 0001111111334 Q ss_pred HHHhhhhhcccceE-eeC-CH-----HHHHHHHHHHh-h-h---hHHHHHHHHHHHHHhcCCEEEEEEEeCCe--eEEEE Q lcl|NC_019418. 81 KKIASLVYNEQAEI-SAE-DE-----TLNDFLSDMLS-N-D---RFNKNFERYLESALALGGLAMRPYVDGDK--IRVAF 146 (527) Q Consensus 81 ~~~A~ll~~e~~~i-~~~-d~-----~~~~~l~~~l~-~-n---~f~~~~~~~~~~a~~~G~~~~~~~~d~~~--~~i~~ 146 (527) +.+|+-+-+=|..+ ..+ +. ....-+..+|. . | ....-....+...+-.|.+++.+..+.++ +.+.. T Consensus 68 ~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~p 147 (424) T protein:vir:18 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) T ss_pred HHHHHhhccCceEEEEeecCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEE Confidence 44444443333222 111 00 00111222222 1 1 12223344566777789999888776533 34555 Q ss_pred EcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCcee Q lcl|NC_019418. 147 IQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERV 226 (527) Q Consensus 147 v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v 226 (527) ++|+.+-+. .+++ ..+|+ +.. ++. ..+ | T Consensus 148 l~~~~V~v~-~~~~------------------~~~y~---~~~--------------~g~-~~~---~------------ 175 (424) T protein:vir:18 148 LQSANMDVK-LVGK------------------KVVYR---YQR--------------DSE-YAD---F------------ 175 (424) T ss_pred ecCcceEEE-EcCC------------------eEEEE---EEe--------------CCe-EEE---e------------ Confidence 666655432 1111 11221 100 000 000 1 Q ss_pred ecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHH-HHcC-cceeee--c Q lcl|NC_019418. 227 NLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWE-IKMG-QRRVIV--P 302 (527) Q Consensus 227 ~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e-~~~~-~~~i~v--~ 302 (527) + +. -..||+.+.. +...|+|.+.-+...++.. ....++... |..| ++..++ | T Consensus 176 ~--------~~----------eIih~r~~~~-----dg~~G~spi~~~~~~i~~~-~a~~~~~~~~f~ng~~p~gil~~~ 231 (424) T protein:vir:18 176 S--------QK----------EIFHLKGFGF-----TGLVGLSPIAFACKSAGVA-VAMEDQQRDFFANGAKSPQILSTG 231 (424) T ss_pred c--------cc----------cEEEecCcCC-----CCcccccHHHHHHHHHHHH-HHHHHHHHHHHHccCCcceEEEeC Confidence 0 00 1235553321 2346888887777666443 333333333 4543 333333 2 Q ss_pred hhHhcCCCCCCCcccccccccccccceeeeccCCC----CCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccc Q lcl|NC_019418. 303 EQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGN----MDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFT 378 (527) Q Consensus 303 ~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~ 378 (527) ..++ .+.........| +..+.+-+.+. .++..++.++......++.+..+...++|+...|++|..+| T Consensus 232 ~~~l-----~~e~~~~~~~~~---~~~~~g~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg 303 (424) T protein:vir:18 232 EKVL-----TEQQRSQVEENF---KEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVG 303 (424) T ss_pred CcCC-----CHHHHHHHHHHH---HHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhC Confidence 2211 000000011111 11121111100 12224566666666778888888888999999999999998 Q ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHH Q lcl|NC_019418. 379 FDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYW 458 (527) Q Consensus 379 ~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~ 458 (527) ....+..+.+.+..... ..++.+|..++..|....+. .++.........+.++++.-+..|..+.++.. T Consensus 304 ~~~~~t~~~sn~eq~~~----------~f~~~tl~P~~~~ie~~l~~-~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~ 372 (424) T protein:vir:18 304 DVEKSTSWGSGIEQQNL----------GFLQYTLQPYISRWENSIQR-WLIPAKDVGRIHAEHNLDGLLRGDSASRAAFM 372 (424) T ss_pred CCCCcccccccHHHHHH----------HHHHHHHHHHHHHHHHHHHh-hcCCccccCCeEEEEechhhhccCHHHHHHHH Confidence 76554432222211111 11223333333333221111 12111111234466777777788999999999 Q ss_pred HHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCC--CCCCCCCCCCCCC Q lcl|NC_019418. 459 MKMVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKG--QQNTVGNSKDTVD 522 (527) Q Consensus 459 ~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 522 (527) .+++.+|+|++-+++... |++.-+ - .+.-....+... +-....++.+|+. T Consensus 373 ~~~~~~G~~T~NE~R~~~-gl~pi~-----------g--GD~~~~~~n~~~l~~~~~~~~p~~~ga 424 (424) T protein:vir:18 373 KAMGEAGLRTINEMRRTD-NLPPLP-----------G--GDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) T ss_pred HHHHhCCCcCHHHHHHHh-CCCCCC-----------C--cCeeeeccCccchHhhhccCCCccCCC Confidence 999999999999876543 544210 0 000000000000 0000111222222 No 196 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=97.46 E-value=6.2e-05 Score=43.66 Aligned_cols=390 Identities=11% Similarity=0.052 Sum_probs=149.9 Q ss_pred HHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHH--HHHHHHhhhhhcccce Q lcl|NC_019418. 16 YNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIAR--TAAKKIASLVYNEQAE 93 (527) Q Consensus 16 ~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~--~i~~~~A~ll~~e~~~ 93 (527) ++||..-. . .+. +. |...+... .+ .....|..... ..+..+. .+++.+|+-+-+-|.. T Consensus 1 m~~~~~~~-~-----~~~--~~-------~~~~~~~~--~~-~~~~~g~~~~~--~Al~~~~V~~cv~~ia~~iA~lp~~ 60 (417) T protein:vir:38 1 MKLFRGLA-T-----EVD--PH-------WADHLLDS--GV-IPSFRGGYLGI--SALRNSDVLTAVSIVSGDVSRFPLV 60 (417) T ss_pred Cccccccc-c-----CCC--cc-------chhhhccc--cc-ccccCCceech--hhcccHHHHHHHHHHHHhhccCeeE Confidence 11221110 0 000 00 21111110 00 00011111000 1223232 3445555555544444 Q ss_pred EeeCC--HH-HHHHHHHHHhh--h---hHHHHHHHHHHHHHhcCCEEEEEEEeC--Cee-EEEEEcCCceEEEEEcCCce Q lcl|NC_019418. 94 ISAED--ET-LNDFLSDMLSN--D---RFNKNFERYLESALALGGLAMRPYVDG--DKI-RVAFIQAPVFLPLQSNTQDV 162 (527) Q Consensus 94 i~~~d--~~-~~~~l~~~l~~--n---~f~~~~~~~~~~a~~~G~~~~~~~~d~--~~~-~i~~v~a~~~~P~~~d~~~~ 162 (527) +--.+ .. ....+..+|.. | ....-....+...+..|.+++.+..+. +.+ .+.+++|+++.+...+.+ T Consensus 61 ~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~~~~~~-- 138 (417) T protein:vir:38 61 ITDSSTDEVIDLANIEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVDTSDPD-- 138 (417) T ss_pred EEEcCCcceeccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEEEcCCC-- Confidence 32111 10 01122223321 1 112233445666777899988887763 333 355677777665322211 Q ss_pred EEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeec Q lcl|NC_019418. 163 SSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQ 242 (527) Q Consensus 163 ~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~ 242 (527) ..+|.. .. .++... ..+ + +. T Consensus 139 ----------------~~~y~~---~~-------------~~~~~~---~~~------------~--------~~----- 158 (417) T protein:vir:38 139 ----------------NIIYRF---TP-------------YNSSMQ---KVC------------G--------FE----- 158 (417) T ss_pred ----------------eEEEEE---EE-------------cCCcEE---EEe------------c--------Cc----- Confidence 112210 00 000000 000 0 00 Q ss_pred CCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCc-ceeeechhHhcCCCCCCCc-ccccc Q lcl|NC_019418. 243 GLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQ-RRVIVPEQMTQLKVQDNQG-NIAFK 320 (527) Q Consensus 243 g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~-~~i~v~~~~l~~~~~~~~~-~~~~~ 320 (527) -+.||+.+..| ...|+|.+.-+...|......-.-..+-|+.|. +..++ ......... .-... T Consensus 159 -----dviH~r~~~~d-----~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il-----~~~~~l~~e~~~~~~ 223 (417) T protein:vir:38 159 -----DVIHWKFFSYD-----TIMGRSPLLSLGDEIGLQESGVSTLQKFFKSGLKGSIIK-----AKESRLSAEARQKIR 223 (417) T ss_pred -----ceEEecCCCCC-----CccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEE-----EeCCCCCHHHHHHHH Confidence 02355542211 235899887777766544443333333355533 22222 211111100 00000 Q ss_pred cccccccceeeeccCCC----CCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHH Q lcl|NC_019418. 321 RRFDVEQNVYMQVGAGN----MDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSD 396 (527) Q Consensus 321 ~~~d~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~ 396 (527) ..| +..|.+-+.+. .++..++.++....+.++++......++|+...|++|..+|.... ..++++.. T Consensus 224 ~~~---~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~-~s~~e~~~----- 294 (417) T protein:vir:38 224 EDF---ERAQAGADAGSPIIVDATMDYQPLEVDTNVLNLINSNNYSTAQIAKALRVPAYRLAQNSP-NQSVKQLA----- 294 (417) T ss_pred HHH---HHHhcccccCCceeccCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHhCCCCc-chhHHHHH----- Confidence 111 11122211110 112234555555556677787777788999999999999974322 22232221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc Q lcl|NC_019418. 397 TYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKT 476 (527) Q Consensus 397 ~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~ 476 (527) ...++.+|..++..|....+. .++... ......|.|+..- .+. ...+...+++.+|+|++-+++... T Consensus 295 --------~~~~~~tl~P~~~~ie~~l~~-~Ll~~~--~~~~~~~~fd~~~-l~~-~~~~~~~~~~~~G~~T~NE~R~~~ 361 (417) T protein:vir:38 295 --------DDYIRNDLPFYFEPITSEFEL-KLLDDA--QRHQYCIGFDTKS-VNG-LPIADVNTAVNGGLWTGNEGRAEL 361 (417) T ss_pred --------HHHHHHHHHHHHHHHHHHHHh-hhcChh--hcccceEEechhh-hhH-HHHHHHHHHHhCCCcCHHHHHHHh Confidence 112233444444443322111 122111 1223456675432 222 234456678889999999987654 Q ss_pred --CCCCHHHHHHHH-----HHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 477 --LGITEEEAEKEL-----AEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 477 --~~~~deea~~el-----~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) .++.+.++.+-. ..+..... .+.......++++++ ++.+.++.++++. T Consensus 362 gl~pi~~g~~d~~~~~~n~~~~d~~~~--~~~~~~~~~kgg~~~-~~~~~~~~~~~~~ 416 (417) T protein:vir:38 362 GKKPLKDPNMDRIQSTLNTVFLDQKEA--YQAEHAAELKGGDTN-AKGNQNGSGTNAN 416 (417) T ss_pred CCCCCCCCCCCeeeecccccccccccc--cccccccccCCCCCC-CCCCCcCCCCcCC Confidence 223322221111 01111000 001111222333332 2233333333333 No 197 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=97.46 E-value=6.2e-05 Score=43.65 Aligned_cols=396 Identities=13% Similarity=0.067 Sum_probs=149.1 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCce-eecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKM-QHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~-~~lnl~~~i 79 (527) ||+|++|... + .. .+.++.+ | ..|.. ................ ...+--..+ T Consensus 1 Mg~~~~~~~~--~-------~~---------~~~~~~~------~---~~~~~-~~~~~~~~~~~~~~~~~~~~~~v~~~ 52 (423) T protein:vir:81 1 MGFLQKLGLA--P-------SV---------VATPEPI------E---LVGPI-FESLKLSTKNMTVEQIWEDQPHLRTV 52 (423) T ss_pred CchhHhhccc--c-------cc---------ccCcccc------c---ccccc-ccccccccchhhHHHHHHhhhHHHHH Confidence 9998887411 1 00 0111111 0 11100 0000000000000000 011112244 Q ss_pred HHHHhhhhhcccceE-e-e-CCH---HHHHHHHHHHhhhh----HHHHHHHHHHHHHhcCCEEEEEEEeCCee-EEEEEc Q lcl|NC_019418. 80 AKKIASLVYNEQAEI-S-A-EDE---TLNDFLSDMLSNDR----FNKNFERYLESALALGGLAMRPYVDGDKI-RVAFIQ 148 (527) Q Consensus 80 ~~~~A~ll~~e~~~i-~-~-~d~---~~~~~l~~~l~~n~----f~~~~~~~~~~a~~~G~~~~~~~~d~~~~-~i~~v~ 148 (527) ++.+|+-+-+-|..+ . . ++. ..+..+..++.+.+ ....+..++.+.+..|.+++.+.-|.+.. .+-.+ T Consensus 53 i~~ia~~ia~lp~~~~~~~~dg~~~~~~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~~~~l- 131 (423) T protein:vir:81 53 TTFIARNVASLQLQAFERVEDGGRERVREGHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTPTLDI- 131 (423) T ss_pred HHHHHHhHhhCceEEEEEecCCceeeeccchHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcceEEE- Confidence 555555555544332 1 1 111 11122333333211 22333445566677888887766554321 11111 Q ss_pred CCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeec Q lcl|NC_019418. 149 APVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNL 228 (527) Q Consensus 149 a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l 228 (527) .|+.. . .+.. .... ...+..+|+. +... ..-|..+.+ T Consensus 132 ----~p~~~---~---~v~~-~~~~-~~~~~~~Y~~-----------------------------~~~~--~~~g~~~~~ 168 (423) T protein:vir:81 132 ----RPIPV---S---WVQR-RAYK-DGWGSLDYII-----------------------------IESG--DNDGRSVKV 168 (423) T ss_pred ----eeccc---c---eeee-eecc-CCCcceEEEE-----------------------------EEec--CCCceEEEE Confidence 11100 0 0000 0000 0111122221 1110 011222111 Q ss_pred ccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHH-HHcC-cceeeechhHh Q lcl|NC_019418. 229 SELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWE-IKMG-QRRVIVPEQMT 306 (527) Q Consensus 229 ~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e-~~~~-~~~i~v~~~~l 306 (527) ++ --+.|++.+.++. ...|+|.+.-+...+.-.... .++... |+.| .+..++ T Consensus 169 -------~~---------~evih~r~~~~~~----~~~G~spi~~~~~~i~~~~~~-~~~~~~~f~ng~~p~gvi----- 222 (423) T protein:vir:81 169 -------PG---------ERVIHRHGYNPKT----MKRGKSPVQSLRDILGEQIEA-AIFRAQMWRNGPRPGMVI----- 222 (423) T ss_pred -------cc---------cceEEecCCCCCC----ccccccHHHHHHHHHHHHHHH-HHHHHHHHhccCCCceEE----- Confidence 00 1134666443332 235899888777766544332 233333 4543 332222 Q ss_pred cCCCCCCCccccccc--cc-ccccceeeeccCCC------CCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccc Q lcl|NC_019418. 307 QLKVQDNQGNIAFKR--RF-DVEQNVYMQVGAGN------MDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMF 377 (527) Q Consensus 307 ~~~~~~~~~~~~~~~--~~-d~~~~~~~~~~~~~------~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~ 377 (527) ........+++.... .+ ..-+..|.+..... .+...++.++....+.++.+..+....+|+...|++|..+ T Consensus 223 ~~~~~~~~~~l~~e~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~l 302 (423) T protein:vir:81 223 MRDPESKAGKWDAESRTRFMANLRASFSPKSSDVGGTLLLEDGMKAENFHTTSKDEQTVETTKLSLQTVAQVYGINPTMV 302 (423) T ss_pred EecCcccCccCCHHHHHHHHHHHHHHhccccccCCcceecCCCceEEeccCChhhHHHHHHHHhhHHHHHHHhCCCHHHh Confidence 211111111111100 00 00001111100000 1112355555555566777877777888999999999999 Q ss_pred ccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcc--cCccceEEEeCCCccCCHHHH Q lcl|NC_019418. 378 TFDGQGV-KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTI--PELDDISVNLDDGVFTDRHAE 454 (527) Q Consensus 378 ~~~~~g~-~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~--~~~~~v~v~f~d~i~~d~~~~ 454 (527) |+..++. .+.++..... ...+..-+...++.+|... ++.... .....+.++++.-+..|.++. T Consensus 303 g~~~~~t~sn~e~~~~~f--~~~~L~P~~~~ie~~l~~~------------L~~~~~~~~~~~~~~fd~~~llr~d~~~r 368 (423) T protein:vir:81 303 GQLDNANYSNVREFRKAL--YGDNLGSWIRIIQDVMNLF------------LLPRVGIDNEKFYFEFNLEEKLRASFEEA 368 (423) T ss_pred cCCCCCCcccHHHHHHHH--HHHHHHHHHHHHHHHHhhh------------hcCccccccCccEEEecchhhhccCHHHH Confidence 8754432 1222221111 1112222222233332221 111111 112224444455556688777 Q ss_pred HHHHHHHH-hcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 455 LDYWMKMV-AAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 455 ~~~~~~~~-~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) ++...+++ .+|+|++-++++. .|+..-+ ..+.-..+ .+...++..+.++++.|. T Consensus 369 ~~~~~~~l~~~G~~T~NE~R~~-~gl~p~~-------------gGD~~~~p-----~n~~~~~~~~~~~~~~~t 423 (423) T protein:vir:81 369 AEIKRAAVGNVAWMTINEVRAM-DNLPSID-------------GGDDLARP-----LNTEFGDSEDAPGEEVET 423 (423) T ss_pred HHHHHHHHhCCCCcCHHHHHHH-hCCCCCC-------------Ccceeecc-----cccccCccCCCCCCCCCC Confidence 77777665 4689988886643 3553211 00001111 111223333444555555 No 198 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=97.42 E-value=7.2e-05 Score=43.31 Aligned_cols=338 Identities=14% Similarity=0.111 Sum_probs=132.4 Q ss_pred cCCCcccccccccCccccCceeecchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcC Q lcl|NC_019418. 50 QSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSNDRFNKNFERYLESALALG 129 (527) Q Consensus 50 ~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G 129 (527) -.+-|+.-++ + + +.+-+.++.+|--.|.. .....++ +...+...+..| T Consensus 1 ia~lp~~~~~---~----~--------~~~~~~l~~lL~~~PN~----~~t~~~f-------------~~~~~~~l~l~G 48 (348) T protein:vir:93 1 MASLPLKMYE---D----Y--------KVVNTEVSDLLTVSPNN----SLSSFDF-------------INQIETIRNEKG 48 (348) T ss_pred CcccceEeEe---c----C--------cCcccHHHHHHHhCCCC----CCCHHHH-------------HHHHHHHHhhcC Confidence 1111110000 0 0 00001112222111210 1111222 233455667789 Q ss_pred CEEEEEEEeC-Cee-EEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCce Q lcl|NC_019418. 130 GLAMRPYVDG-DKI-RVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLY 207 (527) Q Consensus 130 ~~~~~~~~d~-~~~-~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~ 207 (527) .+++.+..+. |++ .+..++|+.+-++..+. ....+|+. .. .. T Consensus 49 na~~~i~r~~~G~~~~L~~l~~~~v~~~~~~~-----------------~~~~~y~~-~~-----------------~~- 92 (348) T protein:vir:93 49 NAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ-----------------SRELYYSI-HA-----------------AT- 92 (348) T ss_pred CeEEEEEECCCCcEEEEEEEcCCceEEEEeCC-----------------CcEEEEEE-Ec-----------------CC- Confidence 9988887764 333 44455555544332111 11112210 00 00 Q ss_pred EEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHH Q lcl|NC_019418. 208 RITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDE 287 (527) Q Consensus 208 ~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~ 287 (527) |..+.+ ..- -..||+.+.+. +..+|+|.+.-+...++..+.+-.. T Consensus 93 ---------------g~~~~~-------------~~~---eiih~r~~~~~----~~~~G~s~~~~~~~~i~~~~~~~~~ 137 (348) T protein:vir:93 93 ---------------GNKLIV-------------HNM---DMLHFKHIVAS----NMVQGISPIDVLKNTTDFDNAVRTF 137 (348) T ss_pred ---------------CeEEEE-------------ccc---cEEEecCCCCC----CceeeccHHHHHHHHHHHHHHHHHH Confidence 111000 000 02455543222 2335888887777766644433222 Q ss_pred HHHHHHcCcceeeechhHhcCCCCCCCcccccccccccccceeee---ccCCCCCCCcceEeccccChHHHHHHHHHHHH Q lcl|NC_019418. 288 FMWEIKMGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQ---VGAGNMDSGGIVDLTTPIRSSDYISAISEGLK 364 (527) Q Consensus 288 ~~~e~~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~---~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~ 364 (527) -...+..+..-++.+...+ +.+.- -.....|. ..|.. +-.- +++..++.++......++.+..+...+ T Consensus 138 ~~~~~~~~~~~i~~~~~~l----~~e~~-~~~~~~~~---~~~~n~~~~~vl-~~g~~~~~l~~~~~d~q~~e~~~~~~~ 208 (348) T protein:vir:93 138 NLTEMQKPDSFMLKYGSNV----STEKR-QQVLEDFK---QYYEENGGILFQ-EPGVEIEPLPKKYVSEDIVASENLTRE 208 (348) T ss_pred HHHhcCCCceeEEecCCCC----CHHHH-HHHHHHHH---HHhhcCCCeeec-CCCceEEEcCCChhHHHHHHHHHHHHH Confidence 1112211111111111111 00000 00001111 11110 0000 122245556656666688888888888 Q ss_pred HHHHhcCCCccccccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCC-cccCccceEEE Q lcl|NC_019418. 365 LFEMQIGVSSGMFTFDGQGV-KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRG-TIPELDDISVN 442 (527) Q Consensus 365 ~i~~~~g~s~~~~~~~~~g~-~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~-~~~~~~~v~v~ 442 (527) +|+...|++|.-++...++. .++.+.. +..++.+|.-++..|-...+. .++.. .......+.++ T Consensus 209 ~Ia~~fgVP~~~lg~~~~~~~~~~e~~~-------------~~~~~~~l~P~~~~ie~~l~~-~l~~~~~~~~g~~i~fd 274 (348) T protein:vir:93 209 RVANVFQLPSIFLNARSNTNFAKNEELN-------------RFYLQHTLLPIVKQYEEEFNR-KLLTKTDREKNRYFKFN 274 (348) T ss_pred HHHHHhCCCHHHhCCCCCCCcccHHHHH-------------HHHHHHHHHHHHHHHHHHHHH-hhCCcccccCcceEEee Confidence 99999999999887654332 2233221 111233333333333222111 11111 11122335555 Q ss_pred eCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCC Q lcl|NC_019418. 443 LDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTV 521 (527) Q Consensus 443 f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 521 (527) ++.-+..|..+.++...+++.+|+|++-+++.++ |+..-+ ..+-+ +.....+ .+...+ .. ..-.+ T Consensus 275 ~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~~~-g~~p~~ggD~~~--~~~n~~~-----~~~~~~--~~----~~~~g 340 (348) T protein:vir:93 275 VKSYLRADSATQAEVYFKAVRSGYYTINDIREWE-DLPPVEGGDKPL--ISGDLYP-----IDTPLE--LR----KSLKG 340 (348) T ss_pred chhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHh-CCCCCCCcCeEe--ecccccc-----cccchh--hc----ccccC Confidence 5565667889999999999999999999977654 553210 10100 0000000 000000 00 00111 Q ss_pred Cc--cccC Q lcl|NC_019418. 522 DD--EDEA 527 (527) Q Consensus 522 ~~--~~~~ 527 (527) |+ .+|+ T Consensus 341 g~~n~~~~ 348 (348) T protein:vir:93 341 GDKNVNES 348 (348) T ss_pred CCCCcCCC Confidence 11 1111 No 199 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=97.40 E-value=7.6e-05 Score=43.18 Aligned_cols=381 Identities=12% Similarity=0.086 Sum_probs=139.7 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCcc--ccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKV--AVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIART 78 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i--~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~ 78 (527) ||+++-++..+..+. ++ ...........+. ..+.+-+. .+-..|. =-.. T Consensus 1 mg~~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~t~~~~~---~~~~v~~------------------------cv~~ 51 (403) T protein:vir:10 1 MGFKSWITEKLNPGQ-RI-IRDMEPVSHRTNRKPFTTGQAYS---KIEILNR------------------------TANM 51 (403) T ss_pred Ccchhhhhhccchhh-hh-hhcccccccccCCcccccHHHHH---HHHHHHH------------------------HHHH Confidence 998776666653211 11 0111111111100 00110000 0000000 0123 Q ss_pred HHHHHhhhhhcccceEe---eCCHHHHHHHHHHHhh--hh---HHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcCC Q lcl|NC_019418. 79 AAKKIASLVYNEQAEIS---AEDETLNDFLSDMLSN--DR---FNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQAP 150 (527) Q Consensus 79 i~~~~A~ll~~e~~~i~---~~d~~~~~~l~~~l~~--n~---f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a~ 150 (527) |++..|++-+.-..... -.+.....-+..+|.. |. .....+..+..++..|.+++. .++.. +..++++ T Consensus 52 Ia~~ia~~p~~v~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~--~~~~~--l~~l~~~ 127 (403) T protein:vir:10 52 VIDSAAECSYTVGDKYNIVTYANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIY--WDGTS--LYHVPAA 127 (403) T ss_pred HHHHHhhCceeEeecccccccccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEE--EeCce--eEeecCc Confidence 44444433221100000 0000111112223321 11 112223345566667877654 34332 3334554 Q ss_pred ceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeeccc Q lcl|NC_019418. 151 VFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSE 230 (527) Q Consensus 151 ~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~ 230 (527) .+-. ..+.++ .+|. ..+. +.++ |.. T Consensus 128 ~~~v-~~~~~~------------------~~~~-~~~~------------------~~~~---~~~-------------- 152 (403) T protein:vir:10 128 LMQV-EADANK------------------FIKK-FIFN------------------NQIN---YRV-------------- 152 (403) T ss_pred ceEE-EEcCCc------------------eEEE-EEec------------------Ccee---ecc-------------- Confidence 4322 111111 1110 0000 0000 000 Q ss_pred ccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCC Q lcl|NC_019418. 231 LYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKV 310 (527) Q Consensus 231 ~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~ 310 (527) .+ +.||+....-........|.|.+.-+...++....+..--.+-|..|... ..++..+. T Consensus 153 ------~e----------iih~~~~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~----~gil~~~~ 212 (403) T protein:vir:10 153 ------DE----------IIFIKDNSYVCGTNSQISGQSRVATVIDSLEKRSKMLNFKEKFLDNGTVI----GLILETDE 212 (403) T ss_pred ------cc----------eEEecccccccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCc----ceEEEeCC Confidence 00 11222111000011234688888777777765554443333445654432 22222222 Q ss_pred CCCCccc-ccccccccccceeeecc-CC----CCCCCcceEec--cccChHHHHHHHHHHHHHHHHhcCCCccccccccc Q lcl|NC_019418. 311 QDNQGNI-AFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLT--TPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQ 382 (527) Q Consensus 311 ~~~~~~~-~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~--~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~ 382 (527) ....... .....| ...|.+.. .+ -.++..++.++ ++....++.+..+...++|+...|++|..+|.... T Consensus 213 ~l~~e~~~~~~~~~---~~~~~g~~n~g~~~vl~~g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~ 289 (403) T protein:vir:10 213 ILNKKLRERKQEEL---QLDYNPSTGQSSVLILDGGMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGGNN 289 (403) T ss_pred CCCHHHHHHHHHHH---HHHhCCcccCcceeecCCCceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC Confidence 1111100 000011 11121110 00 00111244444 23446678888888888999999999999874322 Q ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCC--CccCCHHHHHHHHHH Q lcl|NC_019418. 383 GVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDD--GVFTDRHAELDYWMK 460 (527) Q Consensus 383 g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d--~i~~d~~~~~~~~~~ 460 (527) .+..+. ..+-...++.-+...++.+|... + ...+.++++. -+..|..+.++...+ T Consensus 290 --sn~e~~--~~~f~~~tl~P~~~~ie~~l~~~------------L-------~~~~~~d~~~~~~l~~D~~~~~~~~~~ 346 (403) T protein:vir:10 290 --ANIRPN--IELFYYMTIIPMLNKLTSSLTFF------------F-------GYKITPNTKEVAALTPDKEAEAKHLTS 346 (403) T ss_pred --cCHHHH--HHHHHHHHHHHHHHHHHHHHHHh------------c-------CceeeeccchhhhcccCHHHHHHHHHH Confidence 122221 11111122222222222222221 1 1234444543 255688888888889 Q ss_pred HHhcCCCCHHHHHHhc--CCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_019418. 461 MVAAGFATQKRGIAKT--LGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDE 524 (527) Q Consensus 461 ~~~aGi~s~~~~i~~~--~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (527) ++.+|+|++-+++... .+++++.+.+-+-. ...+. ...+..++++++. +.+..+| T Consensus 347 ~~~~G~lT~NE~R~~~gl~pi~~~~~d~~~~p--~n~~~--~~~~~~~~e~~~~-----~~~~~g~ 403 (403) T protein:vir:10 347 LVNNGIITGNEARSELNLEPLDDEQMNKIRIP--ANVAG--SATGVSGQEGGRP-----KGSTEGD 403 (403) T ss_pred HHhCCCcCHHHHHHHhCCCCCCcccccccccc--ccccc--ccccCCCCcCCCC-----CCCcCCC Confidence 9999999999977653 23443333222110 11100 0111111111111 1111111 No 200 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=97.39 E-value=7.7e-05 Score=43.13 Aligned_cols=438 Identities=12% Similarity=0.098 Sum_probs=162.7 Q ss_pred hHHHHHHHHHHHHHHhh---cccchhhhccCccccCHHHHHHHHHH-----HHHhcCCC----cccccccccCc-cccCc Q lcl|NC_019418. 3 LIQKVKDFFNRGRYNMT---TSHLSSILDHPKVAVTQSEFRRIQHN-----LAYYQSKF----DDIEYTNTDGD-RKRRK 69 (527) Q Consensus 3 ~~~~~k~~~~~~~~~~~---~~~~~~~~~~~~i~~~~~~~~~i~~~-----~~~y~g~~----~~l~~~~~~~~-~~~~~ 69 (527) |.+++-+.|.+-..-.+ +.....+.|... ....+|+.. +..+.-.- |.+.. ..|. ...++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~a~~~p~~~~--~~~~~~~~~~ 73 (576) T protein:vir:96 1 MVTRLADIFKRLRLGRDYEDIIDTVPIDDGLQ-----ANIRNIEEKSKELNKSLYGKQQAYAEPFLEV--MDTNPEFRTK 73 (576) T ss_pred ChhhHHHHHHHHhccCccccchhhhhcccChh-----HHHHHhhhhhhhhccccCCccchhhcceeee--eecCCCcccc Confidence 66666666654331111 111222222211 111222211 11111100 00000 0110 00000 Q ss_pred e--e--------------ecchHHHHHHHHhhhhhccc-----------ceEee-------CCHHH--HHHHHHHHh--- Q lcl|NC_019418. 70 M--Q--------------HLPIARTAAKKIASLVYNEQ-----------AEISA-------EDETL--NDFLSDMLS--- 110 (527) Q Consensus 70 ~--~--------------~lnl~~~i~~~~A~ll~~e~-----------~~i~~-------~d~~~--~~~l~~~l~--- 110 (527) . . .-++...+++..|+-+..-. ..|.. .+... ...++..+. T Consensus 74 p~~~~~~~~~~~~l~~~~~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~ 153 (576) T protein:vir:96 74 RSYMKNSDNLHDVLKQFGNNPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTG 153 (576) T ss_pred CcchhhhhhhHHHHHHhhcCHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhcc Confidence 0 0 01223444444444332110 01111 11111 122233221 Q ss_pred -h-----hhHHHHHHHHHHHHHhcCCEEEEEEEeC---Ce-eEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcce Q lcl|NC_019418. 111 -N-----DRFNKNFERYLESALALGGLAMRPYVDG---DK-IRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNV 180 (527) Q Consensus 111 -~-----n~f~~~~~~~~~~a~~~G~~~~~~~~d~---~~-~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~ 180 (527) + ..+...+..++.+.+..|.+++.+.++. ++ +.+..++|.++.++...++.. T Consensus 154 ~~~~p~~~t~~~f~~~lv~dlll~Gna~~~i~~~rd~~g~~~~L~pl~p~~V~v~~~~dg~~------------------ 215 (576) T protein:vir:96 154 RDKDIDRDSFQSFCRKIVRDTYTYDQVNFEKVFNKKNATTMDKFIAVDPSTIFYATDKNGKI------------------ 215 (576) T ss_pred CCCCCccccHHHHHHHHHHHHHhcCCeEEEEEEecCCCCceEEEEEeCCceeEEEECCCCce------------------ Confidence 1 1345566777788889999999888753 22 356678888877653222211 Q ss_pred EEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccc Q lcl|NC_019418. 181 YYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNK 260 (527) Q Consensus 181 ~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~ 260 (527) |....++.. ..++. +... | + +.++ .. +++++..+ T Consensus 216 -~~~~~~~~~-----------~~~~~--~~~~-~------------~--------~~di-------i~--~~~~~~~d-- 249 (576) T protein:vir:96 216 -IKGGKRFVQ-----------VINKK--VVAS-F------------T--------SREM-------AM--GIRNPRTE-- 249 (576) T ss_pred -eeeeeEEEE-----------ecCCc--eEEE-e------------c--------ccce-------EE--EeecCCCC-- Confidence 100001000 00000 0000 0 0 0010 01 12222111 Q ss_pred cCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCcc---cccccccccccceeeecc-CC Q lcl|NC_019418. 261 DINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGN---IAFKRRFDVEQNVYMQVG-AG 336 (527) Q Consensus 261 ~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~---~~~~~~~d~~~~~~~~~~-~~ 336 (527) ....++|+|.+.-+...|.....+-.-..+-|..|.. |..+|....+..-.+ -.+...| +..|.+.+ .+ T Consensus 250 ~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~----p~giL~~~~~~~ls~e~~~~lr~~~---~~~~~G~~nag 322 (576) T protein:vir:96 250 LSSSGYGLSEVEIAMKQFIAYNNTETFNDRFFSHGGT----TRGILQIKSEQQQSQRALENFKREW---KSSFSGINGSW 322 (576) T ss_pred cccCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCC----CceEEEeCCCCCCCHHHHHHHHHHH---HHHhccccccc Confidence 1134579999988877776554443333333455432 222222111110000 0000111 11122211 00 Q ss_pred -----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHH-HHHHHHH Q lcl|NC_019418. 337 -----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNS-IVALVEQ 410 (527) Q Consensus 337 -----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~-~~~~~~~ 410 (527) -.+...++.++......++.+..+...++|+...|++|..+|+...+.++++.-.. +-++..+.. .+..++. T Consensus 323 ~~p~vl~~G~~~~~ls~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~--s~t~sn~e~~~~~f~~~ 400 (576) T protein:vir:96 323 QVPVVMADDIKFVNMTPTANDMQFEKWLTYLINIISALYGIDPAEIGFPNRGGATGGKGGN--TLNEADPGKKQQQSQNK 400 (576) T ss_pred cceeecCCCceEEeccCChhhHHHHHHHHHhHHHHHHHhCCCHHHcccccccccccccccc--ccccccHHHHHHHHHHH Confidence 01223466666677788999999999999999999999999876544332211100 001111111 1122344 Q ss_pred HHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHH-HHHH-- Q lcl|NC_019418. 411 SIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLGITEEE-AEKE-- 487 (527) Q Consensus 411 al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~dee-a~~e-- 487 (527) +|.-++..|....+. .|.. ....++.+.|...-..+..+ .........+|+|++-+++..+ |+..-+ ...- T Consensus 401 tL~P~~~~ie~~ln~-~Ll~---~~~~~~~~~f~r~d~~~~~e-~~~~~~~~~~G~lT~NE~R~~~-gl~piegGD~~~~ 474 (576) T protein:vir:96 401 GLQPLLRFIEDLINT-HIIS---EYSDKYVFQFVGGDTKSELD-KIKILQEEVKTYKTVNEARKEK-GLKPIEGGDVLLD 474 (576) T ss_pred HHHHHHHHHHHHHHh-hhch---hccCceEEEeccCCHHHHHH-HHHHHHHHhcCccCHHHHHHHh-CCCCCCCcceecc Confidence 444444444322211 1111 11234567786643333222 2233344557999999976554 432100 0000 Q ss_pred ---HHHH----HH-------hcccccc----cccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 488 ---LAEI----NG-------ELPPESD----AELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 488 ---l~ri----~~-------E~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) +..+ +. +++.... ...+.+..+.+... ++.+++....|+ T Consensus 475 ~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~s~-~~~~~g~~~~~~ 531 (576) T protein:vir:96 475 GSFIQSMSLNTQKEQYEDTKQKERFDMIQQFLNSPDDEEPQQEST-EDKVDGRESNDP 531 (576) T ss_pred ccccccccccccCCCCCCccccccccccccccCCCCCCCCCCCCC-CCcccccccccC Confidence 0000 00 0000000 00000100111111 111222222211 No 201 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=97.33 E-value=9.3e-05 Score=42.67 Aligned_cols=420 Identities=10% Similarity=0.009 Sum_probs=149.6 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCc-------cccCHHHHHHHHHHHHHhcCCCcccccccccCcc-ccCceee Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPK-------VAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDR-KRRKMQH 72 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~-------i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~-~~~~~~~ 72 (527) |+||++++...+... ++-............ ...+ ..++ ..|..|....+. ...|.. ..+..+. T Consensus 1 M~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~---~~~~~g~~~~~~--~~~g~~v~~~~a~~ 71 (466) T protein:vir:81 1 MRLIDRLLSTRGAAP-RMSIDDYAQMLNEFAFNGIGYGFGGG---VPRI---QQTLAGPSTELA--PDTFVGLATQAYQA 71 (466) T ss_pred CchhHHHhhccCccc-ccchhhhhhhhhhhhccccccccccc---cHHH---HHhhcccccccc--Cccccccchhhhhc Confidence 999999998876311 000000000000000 0000 0011 122222211111 001111 1111222 Q ss_pred cchHHHHHHHHhhhhhcccceEeeCCH-----HHHHHHHHHHhhh----hHHHHHHHHHHHHHhcCCEEEEEEEeCCe-- Q lcl|NC_019418. 73 LPIARTAAKKIASLVYNEQAEISAEDE-----TLNDFLSDMLSND----RFNKNFERYLESALALGGLAMRPYVDGDK-- 141 (527) Q Consensus 73 lnl~~~i~~~~A~ll~~e~~~i~~~d~-----~~~~~l~~~l~~n----~f~~~~~~~~~~a~~~G~~~~~~~~d~~~-- 141 (527) ..--..+++.+|+-+.+-|..+.-.++ .....+-.++.+. ......+.++.+.+..|.+++.+..++.+ T Consensus 72 ~~~v~~~i~~Ia~~ia~lp~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l 151 (466) T protein:vir:81 72 NGPVFACMLVRQLVFSSVRFRWQRLRDGKPSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFVRM 151 (466) T ss_pred cHHHHHHHHHHHHhhccCceEEEEecCCceeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCcccc Confidence 333345566666666555544322111 0111222233221 12233345566777789998888765321 Q ss_pred --------eEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEE Q lcl|NC_019418. 142 --------IRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNEL 213 (527) Q Consensus 142 --------~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~l 213 (527) ..+..++|+++.+.... ++. ...+| .+.. T Consensus 152 ~~~~~g~~~~l~~l~~~~v~~~~~~-~~~---------------~~~~y---~~~~------------------------ 188 (466) T protein:vir:81 152 RPDWVDVVVEERMVRGGRGELGGGQ-LGW---------------RKVGY---LYTE------------------------ 188 (466) T ss_pred ccccCcceeEEEEecCcceEEEEcC-CCc---------------eEEEE---EEEe------------------------ Confidence 22333444433332110 000 00011 0000 Q ss_pred EecCCccccC-ceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019418. 214 YKSTSDSQLG-ERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEI 292 (527) Q Consensus 214 y~~~~~~~lG-~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~ 292 (527) . ....| ..+. +.+. -..||+.. .|. .+...|+|.+.-+...|.....+-.-..+-| T Consensus 189 --~--~~~~~~~~~~-------~~~~---------dviHir~~-~~~--~d~~~G~s~i~~~~~~i~~~~a~~~~~~~~f 245 (466) T protein:vir:81 189 --G--GRQSGNESVG-------FLAE---------DVVHFAPI-PDP--LASYRGMSWLTPILREIRADQAMSKHQAKFF 245 (466) T ss_pred --c--Ccccccceee-------eccc---------cEEEEcCC-CCc--ccccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 00000 0000 0000 12355421 111 1233589988888777754333322222335 Q ss_pred HcCcceeeechhHhcCCCCCCC-cccccccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHH Q lcl|NC_019418. 293 KMGQRRVIVPEQMTQLKVQDNQ-GNIAFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLF 366 (527) Q Consensus 293 ~~~~~~i~v~~~~l~~~~~~~~-~~~~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i 366 (527) +.|.. |..++..+..... ..-.....| ...|.+.. .+ -.+...++.++......++.+..+....+| T Consensus 246 ~ng~~----p~gil~~~~~l~~e~~~~~~~~~---~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~I 318 (466) T protein:vir:81 246 DNGAT----VNLVIKHNPMADPAAVKKWADEV---NSKHAGVDNAWKNLNLYPGADADVVGSNLQEIDFKNVRGGGETRI 318 (466) T ss_pred hcCCC----cceEEecCCCCCHHHHHHHHHHH---HHHhcCccccccceEcCCCceEEEccCChhHHHHHHHHHHHHHHH Confidence 55332 2222222211110 000000111 11122211 00 012234666666667788888888889999 Q ss_pred HHhcCCCccccccccc-ccchHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeC Q lcl|NC_019418. 367 EMQIGVSSGMFTFDGQ-GVKTATEIVSENSD-TYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLD 444 (527) Q Consensus 367 ~~~~g~s~~~~~~~~~-g~~TAtei~s~~~~-~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~ 444 (527) +...|++|..+|...+ +..|...+...... ...++.-+...|+.+|... ++.. .......++|+ T Consensus 319 a~~fgVPp~~lG~~~~~~~st~sn~eq~~~~f~~~tl~P~~~~ie~~l~~~------------L~~~--~~~~~~~~~f~ 384 (466) T protein:vir:81 319 AAAAGVPPVIVGLSEGLAAATYSNYGQARRRLADGTAHPLWQNLSGCIGHV------------MPDM--GPDVRLWYDAD 384 (466) T ss_pred HHHhCCCHHHcccccCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHHhh------------cCCc--ccCcceEEEec Confidence 9999999999987644 22222222111111 1233333333333333221 1111 11223345554 Q ss_pred C--CccCCHHHHHHH-------HHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCC-CCCCCCCCC Q lcl|NC_019418. 445 D--GVFTDRHAELDY-------WMKMVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELAL-YGKGQQNTV 514 (527) Q Consensus 445 d--~i~~d~~~~~~~-------~~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~-~~~~~~~~~ 514 (527) . -+-.|..+..+. ...++.+|+ .+.+++....+-+.. .+.... ......+ +++..+.+. T Consensus 385 ~~~llr~d~~~r~~~~~~~~~~~~~~~~~g~-t~nE~r~~~~~gd~~-------~~~~~~---~~~~~~~~~~~~~~~~~ 453 (466) T protein:vir:81 385 DVPFLREDEKDAADIQKVRAETINTLITAGY-EPESVVAAVNSGDLR-------LLKHTG---LTSVQLLPPGVSASASS 453 (466) T ss_pred chhhhccCHHHHHHHHHHHHHHHHHHHHcCC-ChhhccccccCCccc-------cccCCC---cchhhhcccccccccCC Confidence 3 333454433322 223344443 333333211110000 000000 0000011 111111111 Q ss_pred CCCCCCCCccccC Q lcl|NC_019418. 515 GNSKDTVDDEDEA 527 (527) Q Consensus 515 ~~~~~~~~~~~~~ 527 (527) .+....++++++. T Consensus 454 ~~~~~~Gg~~ngn 466 (466) T protein:vir:81 454 DTPTSGGADDNGN 466 (466) T ss_pred CCcccCCCCcCCC Confidence 1112222222222 No 202 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=97.20 E-value=0.00013 Score=41.84 Aligned_cols=394 Identities=14% Similarity=0.107 Sum_probs=161.8 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchH--HH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIA--RT 78 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~--~~ 78 (527) ||||.+-+. + ++..+..- ...|.....+ +. ...+..... ...+..+ -. T Consensus 1 Mg~f~~~~~---r-----------------~~~~~~~~---~~~~~~~~~~---~~---~~~~~~~~~-~~al~~~~v~~ 50 (416) T protein:vir:45 1 MGIFYKNEK---R-----------------DLQYNEDD---LQMMVQTLPG---FQ---GTKLRQYKD-IEAIRHSDIFT 50 (416) T ss_pred CCccccccc---c-----------------cccCCCcc---hhHHHHHhcc---cc---ccCccccch-hhhhcchHHHH Confidence 999864321 0 01111000 1111111111 00 000000000 0011111 12 Q ss_pred HHHHHhhhhhcccceEeeCCHH-HHHHHHHHHh--hhh---HHHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEEEcCC Q lcl|NC_019418. 79 AAKKIASLVYNEQAEISAEDET-LNDFLSDMLS--NDR---FNKNFERYLESALALGGLAMRPYVDG-DK-IRVAFIQAP 150 (527) Q Consensus 79 i~~~~A~ll~~e~~~i~~~d~~-~~~~l~~~l~--~n~---f~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~v~a~ 150 (527) .++.+|+-+-+=|..+.-++.. ....+..+|. -|. .....+..+...+..|.+++.+..+. |. ..+..++|+ T Consensus 51 cv~~Ia~~iA~~p~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~ 130 (416) T protein:vir:45 51 AVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTS 130 (416) T ss_pred HHHHHHHhhccCceEEecCccccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCc Confidence 3444444444434333222211 1122223332 111 12223445566677899988888775 33 356778888 Q ss_pred ceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeeccc Q lcl|NC_019418. 151 VFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSE 230 (527) Q Consensus 151 ~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~ 230 (527) ++-++. +.++.+ +|.. . +. ....+... +.|. T Consensus 131 ~v~v~~-~~~g~~-----------------~~~~-~-~~-------------~~~~~~~~-~~~~--------------- 161 (416) T protein:vir:45 131 EIELKS-DARGRL-----------------YYFH-Q-RI-------------DSNGNNIE-RNVK--------------- 161 (416) T ss_pred eeEEEE-CCCccE-----------------EEEE-E-Ee-------------cCCCceeE-EEEc--------------- Confidence 776543 333321 1110 0 00 00000000 0010 Q ss_pred ccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHH-HHcC-cceeeechhHhcC Q lcl|NC_019418. 231 LYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWE-IKMG-QRRVIVPEQMTQL 308 (527) Q Consensus 231 ~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e-~~~~-~~~i~v~~~~l~~ 308 (527) +. -+.||+.... +...|.|.+.-+...++.... ...+... |+.| .++.++ .. T Consensus 162 -----~~----------evihir~~~~-----d~~~G~s~i~~~~~~i~~~~~-~~~~~~~~f~ng~~~~gil-----~~ 215 (416) T protein:vir:45 162 -----FE----------DMLDIKFYSL-----DGINGLSLLDTLSRTIESDNN-GKDFLNNFLRNGTHAGGIL-----KM 215 (416) T ss_pred -----cc----------cEEEeccCCC-----CCccccCHHHHHHHHHHHHHH-HHHHHHHHHhccCCCcEEE-----Ee Confidence 00 0234543211 234689988888877764433 3444433 4543 333332 21 Q ss_pred CCCCCC--cccccccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccc Q lcl|NC_019418. 309 KVQDNQ--GNIAFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDG 381 (527) Q Consensus 309 ~~~~~~--~~~~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~ 381 (527) ...... ........|. ..|.+.. .+ -.++..++.++....+.++.+......++|+...|++|..+|.+. T Consensus 216 ~~~~~~~~~~~~~~~~~~---~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~ 292 (416) T protein:vir:45 216 KGVLDNKKARDRAREEFH---KSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIET 292 (416) T ss_pred CCCCCCHHHHHHHHHHHH---HHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC Confidence 111110 0000001111 1111110 00 012224566666677778888888888899999999999998654 Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHH Q lcl|NC_019418. 382 QGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKM 461 (527) Q Consensus 382 ~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~ 461 (527) ++. +.++.... +..+|..++..|....+. .++.. .....+.++++.-...|..+.++...++ T Consensus 293 ~~~-~~~~~~~~--------------~~~~l~P~~~~ie~~ln~-~l~~~--~~~~~~~f~~~~l~~~D~~~~~~~~~~~ 354 (416) T protein:vir:45 293 ANM-SITDANLD--------------YLSTLKPYITCVCAELNF-KFNDE--YVNREFKFDTTEIRVVDEKTQAEIDKIN 354 (416) T ss_pred CCc-cHHHHHHH--------------HHHHHHHHHHHHHHHHhh-hcccc--ccCceEEEechhhhccCHHHHHHHHHHH Confidence 432 11222111 112233333332221110 11111 1234566666666677889999999999 Q ss_pred HhcCCCCHHHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCC--CCCCCCCCCCcccc Q lcl|NC_019418. 462 VAAGFATQKRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQN--TVGNSKDTVDDEDE 526 (527) Q Consensus 462 ~~aGi~s~~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~ 526 (527) +.+|+|++-+++.++ |++.-+ -.+.+-.+.....+- +..++.+.+ ...+....+|++.| T Consensus 355 ~~~G~~T~NE~R~~~-gl~p~~~gd~~~~~~~~n~~~~-----~~~~~~~~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:45 355 IDSGKMNIDEIRQRD-GLAPIPGGNGSIHRVDLNHVNI-----ELVDEYQMNKSRATDKKLKGGEENE 416 (416) T ss_pred HhCCCcCHHHHHHHh-CCCCCCCCCcceEeeccccccc-----ccccccCcccccccccccCCCCCCC Confidence 999999999977654 543210 000000111111110 011111111 11112234444444 No 203 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=97.20 E-value=0.00013 Score=41.84 Aligned_cols=394 Identities=14% Similarity=0.107 Sum_probs=161.8 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchH--HH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIA--RT 78 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~--~~ 78 (527) ||||.+-+. + ++..+..- ...|.....+ +. ...+..... ...+..+ -. T Consensus 1 Mg~f~~~~~---r-----------------~~~~~~~~---~~~~~~~~~~---~~---~~~~~~~~~-~~al~~~~v~~ 50 (416) T protein:vir:81 1 MGIFYKNEK---R-----------------DLQYNEDD---LQMMVQTLPG---FQ---GTKLRQYKD-IEAIRHSDIFT 50 (416) T ss_pred CCccccccc---c-----------------cccCCCcc---hhHHHHHhcc---cc---ccCccccch-hhhhcchHHHH Confidence 999864321 0 01111000 1111111111 00 000000000 0011111 12 Q ss_pred HHHHHhhhhhcccceEeeCCHH-HHHHHHHHHh--hhh---HHHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEEEcCC Q lcl|NC_019418. 79 AAKKIASLVYNEQAEISAEDET-LNDFLSDMLS--NDR---FNKNFERYLESALALGGLAMRPYVDG-DK-IRVAFIQAP 150 (527) Q Consensus 79 i~~~~A~ll~~e~~~i~~~d~~-~~~~l~~~l~--~n~---f~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~v~a~ 150 (527) .++.+|+-+-+=|..+.-++.. ....+..+|. -|. .....+..+...+..|.+++.+..+. |. ..+..++|+ T Consensus 51 cv~~Ia~~iA~~p~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~ 130 (416) T protein:vir:81 51 AVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTS 130 (416) T ss_pred HHHHHHHhhccCceEEecCccccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCc Confidence 3444444444434333222211 1122223332 111 12223445566677899988888775 33 356778888 Q ss_pred ceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeeccc Q lcl|NC_019418. 151 VFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSE 230 (527) Q Consensus 151 ~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~ 230 (527) ++-++. +.++.+ +|.. . +. ....+... +.|. T Consensus 131 ~v~v~~-~~~g~~-----------------~~~~-~-~~-------------~~~~~~~~-~~~~--------------- 161 (416) T protein:vir:81 131 EIELKS-DARGRL-----------------YYFH-Q-RI-------------DSNGNNIE-RNVK--------------- 161 (416) T ss_pred eeEEEE-CCCccE-----------------EEEE-E-Ee-------------cCCCceeE-EEEc--------------- Confidence 776543 333321 1110 0 00 00000000 0010 Q ss_pred ccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHH-HHcC-cceeeechhHhcC Q lcl|NC_019418. 231 LYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWE-IKMG-QRRVIVPEQMTQL 308 (527) Q Consensus 231 ~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e-~~~~-~~~i~v~~~~l~~ 308 (527) +. -+.||+.... +...|.|.+.-+...++.... ...+... |+.| .++.++ .. T Consensus 162 -----~~----------evihir~~~~-----d~~~G~s~i~~~~~~i~~~~~-~~~~~~~~f~ng~~~~gil-----~~ 215 (416) T protein:vir:81 162 -----FE----------DMLDIKFYSL-----DGINGLSLLDTLSRTIESDNN-GKDFLNNFLRNGTHAGGIL-----KM 215 (416) T ss_pred -----cc----------cEEEeccCCC-----CCccccCHHHHHHHHHHHHHH-HHHHHHHHHhccCCCcEEE-----Ee Confidence 00 0234543211 234689988888877764433 3444433 4543 333332 21 Q ss_pred CCCCCC--cccccccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccc Q lcl|NC_019418. 309 KVQDNQ--GNIAFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDG 381 (527) Q Consensus 309 ~~~~~~--~~~~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~ 381 (527) ...... ........|. ..|.+.. .+ -.++..++.++....+.++.+......++|+...|++|..+|.+. T Consensus 216 ~~~~~~~~~~~~~~~~~~---~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~ 292 (416) T protein:vir:81 216 KGVLDNKKARDRAREEFH---KSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIET 292 (416) T ss_pred CCCCCCHHHHHHHHHHHH---HHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC Confidence 111110 0000001111 1111110 00 012224566666677778888888888899999999999998654 Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHH Q lcl|NC_019418. 382 QGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKM 461 (527) Q Consensus 382 ~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~ 461 (527) ++. +.++.... +..+|..++..|....+. .++.. .....+.++++.-...|..+.++...++ T Consensus 293 ~~~-~~~~~~~~--------------~~~~l~P~~~~ie~~ln~-~l~~~--~~~~~~~f~~~~l~~~D~~~~~~~~~~~ 354 (416) T protein:vir:81 293 ANM-SITDANLD--------------YLSTLKPYITCVCAELNF-KFNDE--YVNREFKFDTTEIRVVDEKTQAEIDKIN 354 (416) T ss_pred CCc-cHHHHHHH--------------HHHHHHHHHHHHHHHHhh-hcccc--ccCceEEEechhhhccCHHHHHHHHHHH Confidence 432 11222111 112233333332221110 11111 1234566666666677889999999999 Q ss_pred HhcCCCCHHHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCC--CCCCCCCCCCcccc Q lcl|NC_019418. 462 VAAGFATQKRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQN--TVGNSKDTVDDEDE 526 (527) Q Consensus 462 ~~aGi~s~~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~ 526 (527) +.+|+|++-+++.++ |++.-+ -.+.+-.+.....+- +..++.+.+ ...+....+|++.| T Consensus 355 ~~~G~~T~NE~R~~~-gl~p~~~gd~~~~~~~~n~~~~-----~~~~~~~~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:81 355 IDSGKMNIDEIRQRD-GLAPIPGGNGSIHRVDLNHVNI-----ELVDEYQMNKSRATDKKLKGGEENE 416 (416) T ss_pred HhCCCcCHHHHHHHh-CCCCCCCCCcceEeeccccccc-----ccccccCcccccccccccCCCCCCC Confidence 999999999977654 543210 000000111111110 011111111 11112234444444 No 204 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=97.14 E-value=0.00016 Score=41.45 Aligned_cols=431 Identities=10% Similarity=0.093 Sum_probs=174.5 Q ss_pred hHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHHH Q lcl|NC_019418. 3 LIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKK 82 (527) Q Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~ 82 (527) |=+.++.-.- +-.+.+.+..+ .+..++......|+.++.---|.+ + ...+.....++.==+.+...++. T Consensus 1 ~~~~~~~~~~-----~~~~~l~~r~~----~L~~~R~~~e~~w~e~a~~~lP~~-~-~~~~~~~~~~~~~dstg~~a~~~ 69 (516) T protein:vir:96 1 MKQSIDLEYG-----GKRSKIPKLWE----KFSNKRSSFLDRAKHYSKLTLPYL-M-NDKGDNETSQNGWQGVGAQATNH 69 (516) T ss_pred Ccchhhhhhh-----hhHHHHHHHHH----HHHHHhhHHHHHHHHHHHhhcccc-c-CCCCCccccCCcccchHHHHHHH Confidence 1011100000 00111111111 123334455666777665433322 1 11122111222212456677777 Q ss_pred Hhhhhhcc--cc-----eEeeCCH-------------HHHH-------HHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEE Q lcl|NC_019418. 83 IASLVYNE--QA-----EISAEDE-------------TLND-------FLSDMLSNDRFNKNFERYLESALALGGLAMRP 135 (527) Q Consensus 83 ~A~ll~~e--~~-----~i~~~d~-------------~~~~-------~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~ 135 (527) +|+-|.+- || ++.+++. ...+ .+...|..++|...+.++..+....|.+.+ T Consensus 70 LAa~l~~~ltpp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l-- 147 (516) T protein:vir:96 70 LANKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCML-- 147 (516) T ss_pred HHHHHHhhhcCCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeE-- Confidence 77755443 11 2333221 1233 344577788999999999999999999875 Q ss_pred EEeCCeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEe---------------------eCCCcceEEEEEEEEeecccc Q lcl|NC_019418. 136 YVDGDKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIK---------------------TENRKNVYYTLVEFHEWVTPT 194 (527) Q Consensus 136 ~~d~~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~---------------------~~~~~~~~yt~lE~h~~~~~~ 194 (527) |.|... .+..++-.+++ +..|..|++..++...... ..+.....||+++++. T Consensus 148 ~~d~~~-~~~~~pl~~y~-v~~d~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~----- 220 (516) T protein:vir:96 148 YKPSKG-AISAIPMHHYV-VNRDTNGDLLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHAKYLG----- 220 (516) T ss_pred EecCCC-CEEEEEcCeEE-EeeCCCCCeeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEeeeeeC----- Confidence 455432 25566666655 4567777666655322100 0011122333333321 Q ss_pred cccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCC--CcccEEEecCCccccccCCCccCcchhh Q lcl|NC_019418. 195 GQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGL--SRPLFTYLKTPGMNNKDINSPLGLSIFD 272 (527) Q Consensus 195 ~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~--~~p~f~~~~~~~~N~~~~~splG~S~~~ 272 (527) +.+ +.+|...+...+|. ..+. ..-+|..++- +...++.||+|--. T Consensus 221 ----------~~~---~~~~~~~d~~~~~~----------------es~~~~~e~P~~~~Rw----~~~~ge~YGrgp~~ 267 (516) T protein:vir:96 221 ----------DGF---WELKQSADDIPVGK----------------VSKIKSEKLPFIPLTW----KRSYGEDWGRPLAE 267 (516) T ss_pred ----------Cce---eEEEEEeCceeecc----------------ccccccccCCeeeeee----eecCCCCcccchHH Confidence 111 12222211111111 1111 1112222221 22346789999999 Q ss_pred hhHHHHHHHHHHHHHHHHHH-HcCcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEec--cc Q lcl|NC_019418. 273 NAKTTIDFINRTYDEFMWEI-KMGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLT--TP 349 (527) Q Consensus 273 ~~~~lid~ld~~~s~~~~e~-~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~--~~ 349 (527) .+.+-++.|+..--....-. ...+....||++.+....+...+ ....+.+ +....+..++ +. T Consensus 268 ~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~~----------~~g~i~~-----g~~~~v~~~q~~~~ 332 (516) T protein:vir:96 268 DYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVNS----------GTGEVVT-----GVEEDIHIVQLGKY 332 (516) T ss_pred HhhHHHHHHHHHHHHHHHHHHHhcCCccccCcccccchhhhccC----------CCceeec-----CCcccceeeecCcc Confidence 99999999996555544433 34556666644432111110000 0011111 1111222222 21 Q ss_pred cChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhcc Q lcl|NC_019418. 350 IRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKVVGI 428 (527) Q Consensus 350 ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~~~~ 428 (527) .....-...++.+...|....=+. .+....+...|||||....+...+..+-.-..+ ...|..|+..++.... T Consensus 333 ~d~~~~~~~i~~~~~rI~~af~~~--~l~~r~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~~---- 406 (516) T protein:vir:96 333 ADLTPISAVLEVYTRRIGVVFMME--TMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAG---- 406 (516) T ss_pred cchhHHHHHHHHHHHHHHHHHhhh--hhccCCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhcC---- Confidence 222333344444444443322111 122223344799999988888887766533333 2344555555443221 Q ss_pred cCCcccCccceEEEeCCCccCCHHHHH------HHHHHHHh--cCC-------CCHHHHHH---hcCCC------CHHHH Q lcl|NC_019418. 429 YRGTIPELDDISVNLDDGVFTDRHAEL------DYWMKMVA--AGF-------ATQKRGIA---KTLGI------TEEEA 484 (527) Q Consensus 429 ~~~~~~~~~~v~v~f~d~i~~d~~~~~------~~~~~~~~--aGi-------~s~~~~i~---~~~~~------~deea 484 (527) ...+ ...+.++.-.++ +..... .+..+.++ +++ +....++. ...|+ +++|+ T Consensus 407 --p~lp-~~~v~~~~vs~l--~~l~r~~~~~~i~~~~~~i~~~~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~irs~eev 481 (516) T protein:vir:96 407 --ESFT-SDLVDPVIITGI--EALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEM 481 (516) T ss_pred --CCCc-cccccceeechH--HHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHhCCCccccCCHHHH Confidence 1111 222233221111 111111 11111110 111 11223322 22343 45565 Q ss_pred HHHHHHHHHhcccc------cccccC-CCCCCCCC Q lcl|NC_019418. 485 EKELAEINGELPPE------SDAELA-LYGKGQQN 512 (527) Q Consensus 485 ~~el~ri~~E~~~~------~~~~~~-~~~~~~~~ 512 (527) +++.++.++.+... .+..++ .....++. T Consensus 482 ~~~~~~~~~~q~~~~~a~~~~~~~~~~~~~~~~~~ 516 (516) T protein:vir:96 482 AQEQEAQMQAQQAQMLEEGVAKAVPGVIQQELKEA 516 (516) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhHHhhcccccC Confidence 55443322221111 111111 11111121 No 205 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=96.99 E-value=0.00022 Score=40.64 Aligned_cols=408 Identities=11% Similarity=0.096 Sum_probs=161.2 Q ss_pred hHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHH-HHhcCCCcccccccccCccccCceeecchH--HHH Q lcl|NC_019418. 3 LIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNL-AYYQSKFDDIEYTNTDGDRKRRKMQHLPIA--RTA 79 (527) Q Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~-~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~--~~i 79 (527) ||+..++- +...+ ....+++.-...+-.|. .-|.| .+..|... .....+..+ ... T Consensus 1 ~~~~~~~~----------~~~~~----~~~~~~~~~~~~~~~~~~~~~~g-------~~~~g~~v-~~~~al~~~~V~~~ 58 (454) T protein:vir:93 1 MWNLLRRT----------RKNQK----SGRDVREAGWTSLFQAVAEPFAG-------AWQQGVKA-DPEAVLSFHAVFAC 58 (454) T ss_pred CCCccccC----------ccccc----ccccccchhhhhhhhhhhhhhcc-------hhhcCccc-ChHHhhccHHHHHH Confidence 33333221 11111 11111111111111111 11211 11111110 111112222 224 Q ss_pred HHHHhhhhhcccceEe-eC-C---H-HHHHHHHHHHhhh----hHHHHHHHHHHHHHhcCCEEEEEEEeC-Cee-EEEEE Q lcl|NC_019418. 80 AKKIASLVYNEQAEIS-AE-D---E-TLNDFLSDMLSND----RFNKNFERYLESALALGGLAMRPYVDG-DKI-RVAFI 147 (527) Q Consensus 80 ~~~~A~ll~~e~~~i~-~~-d---~-~~~~~l~~~l~~n----~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~-~i~~v 147 (527) ++.+|+-+-+=|..+- .+ + . .....+..++.+. ....-++.++.+.+..|.+++.+..+. |++ .+..+ T Consensus 59 v~~Ia~~iA~lp~~~~~~~~~g~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~i 138 (454) T protein:vir:93 59 ISLISQDIAKMRLRLMQTDAQGIRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRIL 138 (454) T ss_pred HHHHHHhhccCceEEEEeccCCccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEE Confidence 4444444444344331 11 1 0 1111222233221 122334556667788899999888864 343 56667 Q ss_pred cCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceee Q lcl|NC_019418. 148 QAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVN 227 (527) Q Consensus 148 ~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~ 227 (527) +|+++-++.. .++ ..+|+.. .. .....|..+. T Consensus 139 ~~~~v~v~~~-~~g-----------------~~~y~~~-~~-----------------------------~~~~~~~~~~ 170 (454) T protein:vir:93 139 DWNRVEPLVA-DDG-----------------EVFYRIT-PD-----------------------------RNCGITEAVT 170 (454) T ss_pred cCcceEEEEc-CCC-----------------cEEEEEE-ec-----------------------------cccccceeEE Confidence 7777665422 222 1222110 00 0000000000 Q ss_pred cccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhc Q lcl|NC_019418. 228 LSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQ 307 (527) Q Consensus 228 l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~ 307 (527) + + +. -..||+.+.. .+..+|+|.+..+...+.....+-....+-|..|.. |..++. T Consensus 171 ~---~---~~----------eViH~k~~~~----~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~----p~gil~ 226 (454) T protein:vir:93 171 V---P---AR----------EVIHDRFNCF----FHPLIGLPPVYAAGLAATQGHHIQENSTSFFRNGGR----PSGVIE 226 (454) T ss_pred e---c---Cc----------ceEEeccCCC----CCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCC----ccEEEe Confidence 0 0 00 1245553211 123468998888777776444433333334555332 112222 Q ss_pred CCCCCCCc-ccccccccccccceeeeccCCC----CCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccccc Q lcl|NC_019418. 308 LKVQDNQG-NIAFKRRFDVEQNVYMQVGAGN----MDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQ 382 (527) Q Consensus 308 ~~~~~~~~-~~~~~~~~d~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~ 382 (527) ........ .-.....| +..|.+-+.+. .++..++.++..-...++.+.......+|+...|++|..+|...+ T Consensus 227 ~~~~l~~e~~~~~~~~~---~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~ 303 (454) T protein:vir:93 227 IPGSITEENAKKLKSNW---DSGYTGENAGKTAILSNGAKYNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQP 303 (454) T ss_pred cCCCCCHHHHHHHHHHH---HHHhcccccCCceeccCCceEEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC Confidence 11111000 00000111 11122211110 122245555656667788888888888999999999999987554 Q ss_pred cc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHH Q lcl|NC_019418. 383 GV-KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKM 461 (527) Q Consensus 383 g~-~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~ 461 (527) .. .++++. .+..++.+|.-++..|-...+. .+.. .....+.+++++-+..|..+.++...++ T Consensus 304 ~t~sn~e~~-------------~~~f~~~~l~P~~~~ie~~ln~-~L~~---~~~~~~~f~~~~ll~~D~~~r~~~~~~~ 366 (454) T protein:vir:93 304 PSSDNVEAL-------------EQQYYSQCLQTLIESIELLLDE-ALET---GENESTEFDVTTLLRMDSERRMKTLGDA 366 (454) T ss_pred CcchhHHHH-------------HHHHHHHHHHHHHHHHHHHHHH-hhcC---CCCcEEEeechhhhccCHHHHHHHHHHH Confidence 32 222222 1111223333333332211111 1111 1234466777777778889999999999 Q ss_pred HhcCCCCHHHHHHhcCCCCH----HHH--HH---HHHHHHHhcccccccccCCCCCCCCCCCCC---CCCCCCccccC Q lcl|NC_019418. 462 VAAGFATQKRGIAKTLGITE----EEA--EK---ELAEINGELPPESDAELALYGKGQQNTVGN---SKDTVDDEDEA 527 (527) Q Consensus 462 ~~aGi~s~~~~i~~~~~~~d----eea--~~---el~ri~~E~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~ 527 (527) +.+|+|++-+++..+ |+.. |+. .. -+..+.+....+ ......++....+... +.+....++|. T Consensus 367 ~~~G~~T~NE~R~~~-gl~pi~ggD~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~d~~~~~~e~~~ 441 (454) T protein:vir:93 367 VKNTLLTPNEARKRE-NLPPLAGGDALYLQQQNYSLEALSRRDARE--DPFASSGKTASVPQAVAASDGNKAITETEH 441 (454) T ss_pred HhCCCcCHHHHHHHh-CCCCCCCCCeeeeccCccchHhhhccCccc--CCCCCCccCCCCCCCCCCCCCCCCccCCcc Confidence 999999999976553 4432 110 00 011111111111 1111111111111110 11111111111 No 206 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=96.96 E-value=0.00024 Score=40.46 Aligned_cols=431 Identities=11% Similarity=0.117 Sum_probs=169.7 Q ss_pred hHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHHH Q lcl|NC_019418. 3 LIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKK 82 (527) Q Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~ 82 (527) |-+. .++-++ -.+.+.+... .+..+.......|+.++.---|.+ ....+......+.==+.+...++. T Consensus 1 ~~~~---~~~~~~---~~~~l~~r~~----~Lk~~R~~~e~~w~e~~~~tlP~~--~~~~~~~~~~~~~~dstg~~a~~~ 68 (515) T protein:vir:70 1 MQDT---ILEYGG---QRSKIPKLWE----KFSKKRSPYLDRAKHFAKLTLPYL--MNNKGDNETSQNGWQGVGAQATNH 68 (515) T ss_pred Ccch---hhhhcC---CHHHHHHHHH----HHHHhhhHHHHHHHHHHHHhcccc--cCCCCCcccccccccchHHHHHHH Confidence 1000 000000 0111111110 112233344556766655433322 111121111221112455667777 Q ss_pred Hhhhhhcc--cc-----eEeeCCH-------------HHHHH-------HHHHHhhhhHHHHHHHHHHHHHhcCCEEEEE Q lcl|NC_019418. 83 IASLVYNE--QA-----EISAEDE-------------TLNDF-------LSDMLSNDRFNKNFERYLESALALGGLAMRP 135 (527) Q Consensus 83 ~A~ll~~e--~~-----~i~~~d~-------------~~~~~-------l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~ 135 (527) +|+-|.+- || ++.+++. ..+++ +...+..++|...+.++..+....|.+.+ T Consensus 69 LAa~l~~~ltpp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l-- 146 (515) T protein:vir:70 69 LANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL-- 146 (515) T ss_pred HHHHHHHhhcCCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEEE-- Confidence 77654443 11 1222221 12233 34457788999999999999999999865 Q ss_pred EEeCCeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEee---------------------CCCcceEEEEEEEEeecccc Q lcl|NC_019418. 136 YVDGDKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKT---------------------ENRKNVYYTLVEFHEWVTPT 194 (527) Q Consensus 136 ~~d~~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~---------------------~~~~~~~yt~lE~h~~~~~~ 194 (527) |.|... .+..++-.+++ +..|..|++..++....... .++...+||++++.. T Consensus 147 ~~d~~~-~~~~~pl~~y~-v~~d~~G~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~v~~~~----- 219 (515) T protein:vir:70 147 YKPSKG-AMSAVPMHHYV-VNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAG----- 219 (515) T ss_pred EEeCCC-CeEEEEcCeEE-EeeCCCcCeeEEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEEEEecC----- Confidence 455432 25566667755 45677777776654321110 011122333333321 Q ss_pred cccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCC--CcccEEEecCCccccccCCCccCcchhh Q lcl|NC_019418. 195 GQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGL--SRPLFTYLKTPGMNNKDINSPLGLSIFD 272 (527) Q Consensus 195 ~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~--~~p~f~~~~~~~~N~~~~~splG~S~~~ 272 (527) .++ +..|...++..+|. ..+. ...+|..++ =+...++.||+|--. T Consensus 220 ----------~~~---~~~~~e~d~~~~~~----------------es~y~~~e~P~~~~R----w~~~~ge~YGrgp~~ 266 (515) T protein:vir:70 220 ----------EGF---WKINQSADDIPVGK----------------ESRIKSEKLPFIPLT----WKRSYGEDWGRPLAE 266 (515) T ss_pred ----------CCc---eEEEEecCceeecc----------------ccccccccCCceeee----eeecCCCCcccchHH Confidence 111 11122111111111 1111 111222222 122346779999999 Q ss_pred hhHHHHHHHHHHHHHHHHHH-HcCcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEec--cc Q lcl|NC_019418. 273 NAKTTIDFINRTYDEFMWEI-KMGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLT--TP 349 (527) Q Consensus 273 ~~~~lid~ld~~~s~~~~e~-~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~--~~ 349 (527) .+.+-++.|+..--....-. ...++.+.||++......+...+ ....+.+ +....+..++ +. T Consensus 267 ~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~l~~~----------~~g~iv~-----g~~~~v~~~~~~~~ 331 (515) T protein:vir:70 267 DYSGDLFVIQFLSEAMARGAALMADIKYLIRPGSQTDVDHFVNS----------GTGEVIT-----GVAEDIHIVQLGKY 331 (515) T ss_pred HhhHHHHHHHHHHHHHHHHHHHhcCCCeeeCcccccchhhcccc----------CCceeec-----CCcccceeeecCcc Confidence 99999999997665555433 45666667755543211111000 0011111 1111222222 21 Q ss_pred cChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhhhcc Q lcl|NC_019418. 350 IRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVE-QSIKELCVSMCELGKVVGI 428 (527) Q Consensus 350 ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~-~al~~li~~il~~~~~~~~ 428 (527) .....-...++.+.+.|....=+. ++....+...|||||....+...+..+-.-..+. ..|..|+..++. +. T Consensus 332 ~d~~~~~~~i~~~~~rI~~af~~~--~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~srL~~Ell~Pli~r~~~-----~~ 404 (515) T protein:vir:70 332 ADLTPISAVLEVYTRRIGVIFMME--TMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ-----EA 404 (515) T ss_pred cchhHHHHHHHHHHHHHHHHHhhh--hhhccCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH-----hh Confidence 222333344444444443322111 1111122347999999888887766654333332 223344433321 11 Q ss_pred cCCcccCccceEEEeCCCccCCHHHH---HHHH---HHHHh--cCC-------CCHHHHHH---hcCC----C--CHHHH Q lcl|NC_019418. 429 YRGTIPELDDISVNLDDGVFTDRHAE---LDYW---MKMVA--AGF-------ATQKRGIA---KTLG----I--TEEEA 484 (527) Q Consensus 429 ~~~~~~~~~~v~v~f~d~i~~d~~~~---~~~~---~~~~~--aGi-------~s~~~~i~---~~~~----~--~deea 484 (527) ....+... +.++.-.+ .+.... ++.+ .+.++ +++ +....++. ...| + |++|+ T Consensus 405 ~p~~P~~~--v~~~~vs~--l~~L~r~q~~~~i~~~~q~i~~~~~~~p~~~~~id~d~~~~~~a~~~g~p~~~~rs~eev 480 (515) T protein:vir:70 405 GDSFTSEL--VDPVIVTG--IEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEM 480 (515) T ss_pred CCCCChhh--cccceehh--HHHHHHHHHHHHHHHHHHHHHHHhccChhHHhhCCHHHHHHHHHHHhCCCccccCCHHHH Confidence 11111111 22222111 111111 1111 11111 111 12222222 2222 2 66777 Q ss_pred HHHHHHHHH-hcccc-----cccccCCCCCCCCCCCCCC Q lcl|NC_019418. 485 EKELAEING-ELPPE-----SDAELALYGKGQQNTVGNS 517 (527) Q Consensus 485 ~~el~ri~~-E~~~~-----~~~~~~~~~~~~~~~~~~~ 517 (527) +++.++.++ ++... .+..++.-++ +.-+. T Consensus 481 ~~~r~q~~~~~~~~~~~~~~~~a~~~~~~~----~~~~~ 515 (515) T protein:vir:70 481 QQEMAQQAQAQQEAMLNEGVAKAVPGVIQQ----EMKEG 515 (515) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhcccchhh----hhccC Confidence 665443222 11110 1111111100 00000 No 207 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=96.92 E-value=0.00026 Score=40.27 Aligned_cols=402 Identities=9% Similarity=-0.026 Sum_probs=155.7 Q ss_pred hHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchH--HHHH Q lcl|NC_019418. 3 LIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIA--RTAA 80 (527) Q Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~--~~i~ 80 (527) |...+.+.+++.- ....+ ....|..|.-.. +......+...... ..++.| -.++ T Consensus 1 ~~~~~~~~~~~~~---------------~~~~~-----~~~~~~~~~g~~---~~~~~~~~~~~~~~-~a~~~~~v~~~v 56 (460) T protein:vir:10 1 MANRIIRALRELT---------------GLDNK-----FNDAFIKYIGQT---FTKYDNNGKTYLEQ-GYNINPDVYSCI 56 (460) T ss_pred CchhHHHHHhhhh---------------ccCCC-----chHHHHHhhccc---cCCCccchhhhhHH-HHhcchHHHHHH Confidence 6666666665421 00111 123455544321 11111122111100 112222 2334 Q ss_pred HHHhhhhhcccceEeeC--CHHH-------------------------------HHHHHHHHhhh----hHHHHHHHHHH Q lcl|NC_019418. 81 KKIASLVYNEQAEISAE--DETL-------------------------------NDFLSDMLSND----RFNKNFERYLE 123 (527) Q Consensus 81 ~~~A~ll~~e~~~i~~~--d~~~-------------------------------~~~l~~~l~~n----~f~~~~~~~~~ 123 (527) +.+|+-+.+-|..+--. +... ...+..++... ......+.++. T Consensus 57 ~~ia~~iA~lp~~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~ 136 (460) T protein:vir:10 57 SQMAAKTVAVPYTIKVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKT 136 (460) T ss_pred HHHHHhhhhCceEEEeccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHH Confidence 44554444433322110 0000 01111112110 12233445666 Q ss_pred HHHhcCCEEEEEEEeC-----Cee-EEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeeccccccc Q lcl|NC_019418. 124 SALALGGLAMRPYVDG-----DKI-RVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQE 197 (527) Q Consensus 124 ~a~~~G~~~~~~~~d~-----~~~-~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~ 197 (527) ..+..|.+++.+..+. |.+ .+..++|+++-+...+ ++... +| ++.. T Consensus 137 ~lll~Gnay~~i~r~~~~~~~G~~~~L~~l~~~~v~v~~~~-~~~~~----------------~~---~~~~-------- 188 (460) T protein:vir:10 137 YMRLNGNCYFYLMSPDDGINAGVPSQMYVLPAHLIKIVLKD-DINLL----------------ST---DSPI-------- 188 (460) T ss_pred HHhhcCCeEEEEEecCCCccCceeEEEEEEcCceEEEEEcC-CCcee----------------ee---eeee-------- Confidence 7888899988877642 233 3566777776654222 21110 11 0000 Q ss_pred ceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCC-ccCcchhhhhHH Q lcl|NC_019418. 198 VGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINS-PLGLSIFDNAKT 276 (527) Q Consensus 198 ~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~s-plG~S~~~~~~~ 276 (527) ..|.+. . + |....+ ++. -..||+.+.+++.-.++ .+|+|.+.-+.. T Consensus 189 -------~~~~~~-----~---~--g~~~~~-------~~~---------evih~r~~~~~~~~~~~~~~G~sp~~~~~~ 235 (460) T protein:vir:10 189 -------KSYMLI-----Q---G--DQFIEF-------NED---------EVIHTKYANPNFDLQGSHLYGMSPIRAILR 235 (460) T ss_pred -------eEEEEe-----c---C--ceeEEe-------ccc---------ceEEEecCCCCcccccCccccccHHHHHHH Confidence 000000 0 0 110000 000 13466665555443333 369999888777 Q ss_pred HHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCcc-cccccccccccceeeecc-CC----CCCCCcceEecccc Q lcl|NC_019418. 277 TIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGN-IAFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPI 350 (527) Q Consensus 277 lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~-~~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~i 350 (527) .+.-....-....+-|+.|... ..++.......... -.....| ...|.+.+ .+ -+++..++.++... T Consensus 236 ~i~~~~~~~~~~~~~f~ng~~~----~~i~~~~~~l~~e~~~~~~~~~---~~~~~g~~n~g~~~vl~~g~~~~~l~~~~ 308 (460) T protein:vir:10 236 NINSQNSTIDNNVKTMQNGGVF----GFIHGGSTGLTQPQADSLKQRL---TEMDKSPDRLSQIAGASGEIAFTKISLNT 308 (460) T ss_pred HHHHHHHHHHHHHHHHhcCCCc----ceeeecCCCCCHHHHHHHHHHH---HHHhcCccccCCceecCCCceEEEccCCh Confidence 7766554433333445554322 11121111111000 0000001 11122110 00 01222455556556 Q ss_pred ChHHHHHHHHHHHHHHHHhcCCCcccccccccccc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_019418. 351 RSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVK---TATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVG 427 (527) Q Consensus 351 r~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~---TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~ 427 (527) ...++.+..+....+|+...|++|..+|...++.. ++.+.. .+-...++.-+...++.+|..-+ T Consensus 309 ~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~--~~f~~~~l~P~~~~ie~~ln~kl----------- 375 (460) T protein:vir:10 309 DELKPFDYLKYDQKAICNALGWSDKLLNNNEGGGLNTGNLEEER--KRVVTDNIQPDLVILKQAFDKKF----------- 375 (460) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCCccccHHHHH--HHHHHHHHHHHHHHHHHHHHHhh----------- Confidence 67788888888889999999999999987644322 222221 11111122222222333222210 Q ss_pred ccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCC---HHHHHHHHHHHHHhcccccccccC Q lcl|NC_019418. 428 IYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLGIT---EEEAEKELAEINGELPPESDAELA 504 (527) Q Consensus 428 ~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~---deea~~el~ri~~E~~~~~~~~~~ 504 (527) +..........+.++|+. +.. ..+.......++.+|+|++-+++... |+. ++...+-+.. .... T Consensus 376 ~~~~~~~~~~~i~~d~~~-l~~-l~~d~~~~~~~~~~g~~T~NE~R~~~-g~~pi~~~~gD~~~~~--~n~~-------- 442 (460) T protein:vir:10 376 IKRFKGYENAVIEWDISE-LPE-MQTDMVAMASWLNTIPVTPNEIRIAM-KYETLNQDGMDIVFMP--SNKV-------- 442 (460) T ss_pred cCcccccCCceEEeecch-hhh-HHHHHHHHHHHHhCCCCCHHHHHHHh-CCCCCCCCCCCeeeec--cccc-------- Confidence 011111223334444433 211 12233444567788999988866543 433 2211111000 0000 Q ss_pred CCCCCCCCCCCCCCCCCCc Q lcl|NC_019418. 505 LYGKGQQNTVGNSKDTVDD 523 (527) Q Consensus 505 ~~~~~~~~~~~~~~~~~~~ 523 (527) +-+...+...++.+|.+. T Consensus 443 -~~~~~~~~~~~~~~nq~~ 460 (460) T protein:vir:10 443 -RIDDVSNNLIDSAFNQNQ 460 (460) T ss_pred -chhhcccccCCCcccCCC Confidence 000011111222222222 No 208 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=96.82 E-value=0.00032 Score=39.75 Aligned_cols=392 Identities=13% Similarity=0.085 Sum_probs=149.2 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) =++++++|+.|- .+.+.++.... ..|..|. +.. +...... ..+...--...+ T Consensus 4 ~~~~~~~k~~~~---~~~~~~~~~~~----------------~~~~~~~-~~~----~~~v~~~----~a~~~~~v~~~i 55 (409) T protein:vir:94 4 ENIVTRIKKKLI---DNWIDQSASKL----------------YDFSPWK-NKS----FWGVINN----TLETNETIFSAI 55 (409) T ss_pred cccchhhhhHHh---hhhhcCCcccc----------------ccccccc-Ccc----ccccchh----hhhccHHHHHHH Confidence 234444444331 11112221111 1111111 110 0000000 011111112223 Q ss_pred HHHhhhhhcccceE----eeCCHHHHHHHHH-HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCC-e-eEEEEEcCCceE Q lcl|NC_019418. 81 KKIASLVYNEQAEI----SAEDETLNDFLSD-MLSNDRFNKNFERYLESALALGGLAMRPYVDGD-K-IRVAFIQAPVFL 153 (527) Q Consensus 81 ~~~A~ll~~e~~~i----~~~d~~~~~~l~~-~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~-~-~~i~~v~a~~~~ 153 (527) +.+|+-+-.-|..+ ...+..+...|.. --..-.-..-....+...+..|.+++.+..+.. . ..+.+++|+.+- T Consensus 56 ~~Ia~~ia~lp~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~ 135 (409) T protein:vir:94 56 TKLSNSMASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVE 135 (409) T ss_pred HHHHHhhhhCceeEeecccccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeE Confidence 33333333323222 1223333333321 111111222234456677788999888877643 3 355666777665 Q ss_pred EEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccC Q lcl|NC_019418. 154 PLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYP 233 (527) Q Consensus 154 P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~ 233 (527) ++..+.+ ..++|.. .. .. |..+.+ + T Consensus 136 v~~~~~~-----------------~~~~y~~-~~-----------------~~----------------g~~~~~---~- 160 (409) T protein:vir:94 136 MLIENQS-----------------RELYYSI-HA-----------------AT----------------GNKLIV---H- 160 (409) T ss_pred EEEeCCC-----------------cEEEEEE-Ec-----------------CC----------------ceEEEE---c- Confidence 5422211 1122210 00 00 111000 0 Q ss_pred CcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCC Q lcl|NC_019418. 234 DLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDN 313 (527) Q Consensus 234 ~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~ 313 (527) +. -..||+.+.+. +..+|+|.+.-+...++....+ ..+ +-...+...-+ ++..+.... T Consensus 161 --~~----------dvih~r~~~~~----~~~~G~s~l~~~~~~i~~~~~~-~~~-~~~~~~~~~~~----i~~~~~~l~ 218 (409) T protein:vir:94 161 --NM----------DMLHFKHIVAS----NMVQGISPIDVLKNTTDFDNAV-RTF-NLTEMQKPDSF----MLKYGSNVG 218 (409) T ss_pred --cc----------cEEEecCCCCC----CccccccHHHHHHHHHHHHHHH-HHH-HHHhcCCCCee----EEecCCCCC Confidence 00 13455532221 2345888887776666643332 222 11122221111 111111110 Q ss_pred Ccc-cccccccccccceeee---ccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccccccc-chHH Q lcl|NC_019418. 314 QGN-IAFKRRFDVEQNVYMQ---VGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGV-KTAT 388 (527) Q Consensus 314 ~~~-~~~~~~~d~~~~~~~~---~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~-~TAt 388 (527) ... -.....|. ..|.. +-.- .+...++.++......++.+..+....+|+...|++|.-+|...++. .+.. T Consensus 219 ~e~~~~~~~~~~---~~~~~~g~~~vl-~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e 294 (409) T protein:vir:94 219 KEKRQQVLEDFK---QYYEENGGILFQ-EPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNE 294 (409) T ss_pred HHHHHHHHHHHH---HHhhcCCCeeec-CCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHH Confidence 000 00000110 11110 0000 12223555666666778888888888899999999999988654332 2222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccC-CcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCC Q lcl|NC_019418. 389 EIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYR-GTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFA 467 (527) Q Consensus 389 ei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~-~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~ 467 (527) +. ....++.+|..++..|-...+. .+.. ........+.++.++-+..|..+.++...+++.+|+| T Consensus 295 ~~-------------~~~f~~~~l~P~~~~ie~~ln~-~Ll~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~ 360 (409) T protein:vir:94 295 EL-------------NRFYLQHTLLPIVKQYEEEFNR-KLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYY 360 (409) T ss_pred HH-------------HHHHHHHHHHHHHHHHHHHHHH-hhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCc Confidence 22 1111223333333333211110 0111 1111123344444455567888999999999999999 Q ss_pred CHHHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 468 TQKRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 468 s~~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) ++-+++... |++.-+ ..+-+. .. +-...+...+.+.. ...++ +..+|+ T Consensus 361 T~NE~R~~~-g~~p~~ggD~~~~--~~-----n~~~~~~~~~~~~~--~kGG~--~n~~e~ 409 (409) T protein:vir:94 361 TINDIREWE-DLPPVEGGDKPLI--SG-----DLYPIDTPLELRKS--LKGGD--KNVNES 409 (409) T ss_pred CHHHHHHHh-CCCCCCCcCeEee--cc-----cccccccchhhccc--ccCCC--CCcCCC Confidence 999976543 554321 000000 00 00000000000000 00111 111222 No 209 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=96.74 E-value=0.00037 Score=39.40 Aligned_cols=394 Identities=11% Similarity=0.068 Sum_probs=160.8 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchH--HH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIA--RT 78 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~--~~ 78 (527) |. +.+||++ .+-.++.. .-|..+..+-... ....|..-. ....+..| .. T Consensus 1 m~----~~~~~~~----------------~~~~~s~~-----~~w~~~~~~~~~~---~~~~g~~vt-~~~al~~~~v~~ 51 (421) T protein:vir:10 1 MF----IPQMFEG----------------KKRSVSGG-----GFWEAMLGGVRSS---HSKAGVMIT-PETALALSAVRA 51 (421) T ss_pred CC----Ccchhcc----------------cccccCcc-----hhhHHHhhhhccC---cccCCceec-hHHhhccHHHHH Confidence 22 2233321 11122221 1244443332111 111111100 00112222 23 Q ss_pred HHHHHhhhhhcccceE----------eeCCHHHHHHHHH-HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-Cee-EEE Q lcl|NC_019418. 79 AAKKIASLVYNEQAEI----------SAEDETLNDFLSD-MLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKI-RVA 145 (527) Q Consensus 79 i~~~~A~ll~~e~~~i----------~~~d~~~~~~l~~-~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~-~i~ 145 (527) +++.+|+-+-.-|..+ .+.+..+...|.. --..-......+..+.+.+..|.+++.+..+. |++ .+. T Consensus 52 ~i~~Ia~~iA~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~ 131 (421) T protein:vir:10 52 CVTLLAESVAQLPVELYRRDKNGGRQRATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGYPKELI 131 (421) T ss_pred HHHHHHHhhccCceEEEEEcCCCceeecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEE Confidence 4444444443333332 1122223333321 00011122333455667778899988887765 333 455 Q ss_pred EEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCce Q lcl|NC_019418. 146 FIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGER 225 (527) Q Consensus 146 ~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~ 225 (527) .++|+++-++. +.+ +..||.. . .-|.. T Consensus 132 ~l~~~~v~v~~-~~~-----------------g~~~y~~---~--------------------------------~~g~~ 158 (421) T protein:vir:10 132 PINPKKVIVLK-GPD-----------------GMPYYEI---P--------------------------------EIGET 158 (421) T ss_pred EecCceEEEEE-CCC-----------------ceEEEEE---c--------------------------------CCCcE Confidence 56777665532 111 1122210 0 00111 Q ss_pred eecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcC-cceeeechh Q lcl|NC_019418. 226 VNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMG-QRRVIVPEQ 304 (527) Q Consensus 226 v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~-~~~i~v~~~ 304 (527) +|.. -+.|++.+..| ...|+|.+.-+...++.....-....+-|+.| ++.-++ T Consensus 159 ~~~~------------------eiih~~~~~~d-----~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil--- 212 (421) T protein:vir:10 159 LPMR------------------MMHHVKVFSLD-----GYIGSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVI--- 212 (421) T ss_pred Echh------------------hEEEecCcCCC-----CcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEE--- Confidence 1110 12345433222 34688988887777754333332223334553 333332 Q ss_pred HhcCCCCCCC--cccccccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccc Q lcl|NC_019418. 305 MTQLKVQDNQ--GNIAFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMF 377 (527) Q Consensus 305 ~l~~~~~~~~--~~~~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~ 377 (527) ....+..+ .+....+....-+..|.+.+ .+ -+++..++.++......++.+..+...++|+...|++|..+ T Consensus 213 --~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l 290 (421) T protein:vir:10 213 --ERPKEAPAIKSQEKIDQLLAKWTDRYSGINNMFSVALLQEGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMV 290 (421) T ss_pred --EecCccCccCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHc Confidence 21111110 00000000000011122211 00 01223566677777788888888888899999999999998 Q ss_pred ccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHH Q lcl|NC_019418. 378 TFDGQGV-KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELD 456 (527) Q Consensus 378 ~~~~~g~-~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~ 456 (527) +....+. .++++.. ...++.+|..++..|-...+. .+..........+.++.+.-+..|..+.++ T Consensus 291 g~~~~~t~sn~e~~~-------------~~f~~~tl~P~~~~ie~~ln~-kL~~~~~~~~~~v~fd~~~l~~~d~~~~~~ 356 (421) T protein:vir:10 291 QMLAKATNNNIEHQG-------------LQFVMYTLLAWLKRHEGALQR-DLLLPSERRDLYIEFNVSGLLRGDQKSRYE 356 (421) T ss_pred CCCcCCccccHHHHH-------------HHHHHHHHHHHHHHHHHHHhh-hccCccccCCeEEEEechhhhccCHHHHHH Confidence 8755432 2222221 112233333333333221111 111111112233555555666678899999 Q ss_pred HHHHHHhcCCCCHHHHHHhcCCCCHH-HHHHHHHHHHHhcccccccccCCCCCC--CCCCCCCCCCCCCcc Q lcl|NC_019418. 457 YWMKMVAAGFATQKRGIAKTLGITEE-EAEKELAEINGELPPESDAELALYGKG--QQNTVGNSKDTVDDE 524 (527) Q Consensus 457 ~~~~~~~aGi~s~~~~i~~~~~~~de-ea~~el~ri~~E~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~ 524 (527) ...+++.+|+|++-+++.++ |+..- ...+-+. ..... +.....+++. ......++++-..+. T Consensus 357 ~~~~~~~~G~~T~NE~R~~~-gl~p~~ggD~~~~--~~n~~---~~~~~~~~~~~~~~~~~~e~d~~~~~~ 421 (421) T protein:vir:10 357 SYALGRQWGWLSVNDIRRME-NLPPIAGGDKYLT--PLNMV---DSAQIIPGDKKPTAQQMAEIDTILSRT 421 (421) T ss_pred HHHHHHhCCCcCHHHHHHHh-CCCCCCCcceeee--ccccc---cccccccCCCCcccccCcccccccccC Confidence 99999999999999987654 55421 1111110 00000 0000001111 111111111222222 No 210 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=96.70 E-value=0.0004 Score=39.21 Aligned_cols=381 Identities=11% Similarity=0.066 Sum_probs=156.1 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchH--HH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIA--RT 78 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~--~~ 78 (527) +.+ ++.||++ +++. .+.+.++..... +...+.+ |.. -.....+..| .. T Consensus 16 ~~~---~~~lf~~-------~~~~----~~~~~~~~~~~~----~~~~~~~-----------~~~-vs~~~al~~~~v~~ 65 (424) T protein:vir:45 16 RVL---LDALFRS-------KSLE----NPSTPITGDAVD----TDGLFRA-----------DVY-VSPETAMKLAAVYS 65 (424) T ss_pred hHH---HHhhccc-------cCCC----CCccccchhhhh----hhccccC-----------Cce-echHHhhccHHHHH Confidence 333 3444432 2211 122222221110 0001110 000 0000112222 22 Q ss_pred HHHHHhhhhhcccceE---------eeCCHHHHHHHHH-HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEE Q lcl|NC_019418. 79 AAKKIASLVYNEQAEI---------SAEDETLNDFLSD-MLSNDRFNKNFERYLESALALGGLAMRPYVDG-DK-IRVAF 146 (527) Q Consensus 79 i~~~~A~ll~~e~~~i---------~~~d~~~~~~l~~-~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~ 146 (527) .++.+|+-+-+=|..+ .+.+..+...|.. --..-.........+...+..|.+++.+..+. |+ +.+.+ T Consensus 66 cv~~Ia~~iA~lp~~v~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~ 145 (424) T protein:vir:45 66 CIYVLSSSLAQMPLHVMRRHKGKVEPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEVISLDC 145 (424) T ss_pred HHHHHHHHHhhCceEEEEecCCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEE Confidence 3344444443333332 1222233333321 00011112233446667777899998887764 33 35666 Q ss_pred EcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCcee Q lcl|NC_019418. 147 IQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERV 226 (527) Q Consensus 147 v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v 226 (527) ++|+.+.... ++ +..+|.. . ..++... | T Consensus 146 l~~~~v~i~~-~~------------------~~~~y~~-~---------------~~~~~~~-----~------------ 173 (424) T protein:vir:45 146 CMPWETTLMN-TG------------------GRYTYGL-Y---------------NEYGAFA-----I------------ 173 (424) T ss_pred ecCceEEEEE-cC------------------CeEEEEE-E---------------ecCceEE-----E------------ Confidence 6666554321 11 1111210 0 0000000 0 Q ss_pred ecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHH-HHcCc-ceeeechh Q lcl|NC_019418. 227 NLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWE-IKMGQ-RRVIVPEQ 304 (527) Q Consensus 227 ~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e-~~~~~-~~i~v~~~ 304 (527) + +. -+.||+.+..| ...|+|.+.-+...|...... ..+... |+.|. +..++ T Consensus 174 ~--------~~----------eVih~r~~~~d-----~~~G~spi~~~~~~i~~~~~~-~~~~~~~f~ng~~p~gil--- 226 (424) T protein:vir:45 174 S--------PD----------DMIHIRALGNN-----QKMGLSPIMQHAETIGMGMSG-QKYTESFFSGNARPAGIV--- 226 (424) T ss_pred C--------cc----------cEEEecCcCCC-----CcccccHHHHHHHHHHHHHHH-HHHHHHHHhccCCccEEE--- Confidence 0 00 13455543222 346888888777766544333 233333 45433 23333 Q ss_pred HhcCCCCCCCcc-cccccccccccceeeeccCCC------CCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccc Q lcl|NC_019418. 305 MTQLKVQDNQGN-IAFKRRFDVEQNVYMQVGAGN------MDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMF 377 (527) Q Consensus 305 ~l~~~~~~~~~~-~~~~~~~d~~~~~~~~~~~~~------~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~ 377 (527) .......... -.....| +..|.+..... .++..++.++......++.+.......+|+...|++|..+ T Consensus 227 --~~~~~l~~e~~~~~~~~~---~~~~~g~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l 301 (424) T protein:vir:45 227 --SVKSGLNKESWGWLKDQW---QKASQALRRQENKTMLLPADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMI 301 (424) T ss_pred --EeCCCCCHHHHHHHHHHH---HHHhccccccCCceeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh Confidence 2111111000 0000111 11222211000 1222345555555566788888888889999999999999 Q ss_pred ccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCC-cccCccceEEEeCCCccCCHHHHH Q lcl|NC_019418. 378 TFDGQGV-KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRG-TIPELDDISVNLDDGVFTDRHAEL 455 (527) Q Consensus 378 ~~~~~g~-~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~-~~~~~~~v~v~f~d~i~~d~~~~~ 455 (527) |...++. .++.+. .+..++.+|..++..|-.-.+. .+... .......+.++.+.-+..|..+.+ T Consensus 302 g~~~~~t~sn~eq~-------------~~~f~~~tL~P~~~~ie~~ln~-kLl~~~e~~~g~~i~fd~~~llr~d~~~r~ 367 (424) T protein:vir:45 302 NDLEKATFSNISAQ-------------AIQFVRYTMMPWVTNWEQELNR-RLFTRAELAAGYYVRFNLTGLLRGTPQERA 367 (424) T ss_pred CCCCCCCcccHHHH-------------HHHHHHHHHHHHHHHHHHHHHH-hcCChhhhcCCcEEEeechhhhccCHHHHH Confidence 8754432 222222 1112233333333333221111 11111 111223456666666678888989 Q ss_pred HHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCC---CCCCCCCCCCCCCCCCcccc Q lcl|NC_019418. 456 DYWMKMVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALY---GKGQQNTVGNSKDTVDDEDE 526 (527) Q Consensus 456 ~~~~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~ 526 (527) +...+++.+|+|++-+++.. .|+..-+ . .+.-....+ ..++..++. ...+.++| T Consensus 368 ~~~~~~~~~g~~T~NE~R~~-~gl~pi~-----------g--gD~~~~~~n~~~~~~~~~~~~---~~~~~~~~ 424 (424) T protein:vir:45 368 QFYHFAITDGWMSRNEARAF-EDMNPVE-----------G--LDEMLVSVNAANPAGDFKPPK---NDEGKTNE 424 (424) T ss_pred HHHHHHHhCCCcCHHHHHHH-hCCCCCC-----------C--cceeeecccccccccccCCCC---CCCCCCCC Confidence 99999999999999997654 4654210 0 000001111 112222222 22222222 No 211 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=96.55 E-value=0.00052 Score=38.58 Aligned_cols=433 Identities=12% Similarity=0.089 Sum_probs=172.5 Q ss_pred hHHHHHHHHHHHHHHh--hcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 3 LIQKVKDFFNRGRYNM--TTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 3 ~~~~~k~~~~~~~~~~--~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) |=+ +|..++ -.+.+.+..+ .+..++......|+.++.---|.+ ....+.....++.==+.+...+ T Consensus 1 ~~~-------~~~~~~~~~~~~l~~r~~----~L~~~R~~~e~~w~e~a~~~lP~~--~~~~~~~~~~~~~~dstg~~a~ 67 (516) T protein:vir:10 1 MKQ-------STDLEYGGKRSKIPKLWE----KFSTKRSSFLDRAKHYSKLTLPYL--MNDKGDNETSQNGWQGVGAQAT 67 (516) T ss_pred CCc-------hhhHhhhhHHHHHHHHHH----HHHHhhhHHHHHHHHHHHhhcccc--cCCCCCcccccccccchHHHHH Confidence 111 111111 1112222111 123334445666777665433322 1111221222222124566777 Q ss_pred HHHhhhhhcc--cc-----eEeeCCH-------------HHHHHH-------HHHHhhhhHHHHHHHHHHHHHhcCCEEE Q lcl|NC_019418. 81 KKIASLVYNE--QA-----EISAEDE-------------TLNDFL-------SDMLSNDRFNKNFERYLESALALGGLAM 133 (527) Q Consensus 81 ~~~A~ll~~e--~~-----~i~~~d~-------------~~~~~l-------~~~l~~n~f~~~~~~~~~~a~~~G~~~~ 133 (527) +.+|+-|.+- || ++.+++. ...++| ...+..++|...+.++..+....|.+.+ T Consensus 68 ~~LAa~l~~~ltpp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l 147 (516) T protein:vir:10 68 NHLANKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCML 147 (516) T ss_pred HHHHHHHHhhhcCCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeE Confidence 7777755443 11 2233321 133333 3567788999999999999999999864 Q ss_pred EEEEeCCeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEee---------------------CCCcceEEEEEEEEeecc Q lcl|NC_019418. 134 RPYVDGDKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKT---------------------ENRKNVYYTLVEFHEWVT 192 (527) Q Consensus 134 ~~~~d~~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~---------------------~~~~~~~yt~lE~h~~~~ 192 (527) |.|... .+..++-.+++ +..|..|++..++....+.. .+....+||++++. T Consensus 148 --~~d~~~-~~~~~pl~~y~-v~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t~v~~~---- 219 (516) T protein:vir:10 148 --YKPSKG-AISAIPMHHYV-VNRDTNGDLLDIILLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYTHAKYL---- 219 (516) T ss_pred --EecCCC-CeEEEEcCeEE-EeeCCCCCeEEEeeeecccHHHHHHHhhhhhhhhhhhhccCCCCceEEEEEEEec---- Confidence 556432 24556666655 45677776666654322110 00111233333321 Q ss_pred cccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecC--CCcccEEEecCCccccccCCCccCcch Q lcl|NC_019418. 193 PTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQG--LSRPLFTYLKTPGMNNKDINSPLGLSI 270 (527) Q Consensus 193 ~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g--~~~p~f~~~~~~~~N~~~~~splG~S~ 270 (527) ...+. ..|...+...+| ...+ ...-+|..++- +...++.||+|- T Consensus 220 -----------~~~~~---~~~~~~d~~~~~----------------~~s~~~~~e~P~~~~Rw----~~~~ge~YGrgp 265 (516) T protein:vir:10 220 -----------GEGFW---ELKQSADDIPVG----------------KVSKIKSEKLPFIPLTW----KRSYGEDWGRPL 265 (516) T ss_pred -----------CCCce---EEEEeeCceeec----------------cccccccccCCeeeeee----eecCCCCcccch Confidence 01111 111111111111 0111 11112222221 223467899999 Q ss_pred hhhhHHHHHHHHHHHHHHHHH-HHcCcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEec-- Q lcl|NC_019418. 271 FDNAKTTIDFINRTYDEFMWE-IKMGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLT-- 347 (527) Q Consensus 271 ~~~~~~lid~ld~~~s~~~~e-~~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~-- 347 (527) -..+.+-++.|+..--....- ....+....||++.+....+... .....+.+ +....+..++ T Consensus 266 ~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~----------~~~g~~~~-----g~~~~v~~~q~~ 330 (516) T protein:vir:10 266 AEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVN----------SGTGEVVT-----GVEEDIHIVQLG 330 (516) T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHhcCCCcccCcccccchhhhcc----------CCCceeec-----CCcccceeeecC Confidence 999999999999655555443 34466666665443311111000 01111111 1112222222 Q ss_pred cccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhh Q lcl|NC_019418. 348 TPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKVV 426 (527) Q Consensus 348 ~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~~ 426 (527) +......-...++.+...|....=++ .+....+...|||||....+...+..+-.-..+ ...|.-|+..++. T Consensus 331 ~~~d~~~~~~~i~~~~~rI~~af~~~--~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~----- 403 (516) T protein:vir:10 331 KYADLTPISAVLEVYTRRIGVVFMME--TMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGLL----- 403 (516) T ss_pred cccchHHHHHHHHHHHHHHHHHHhhh--hhhccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHH----- Confidence 22222333344444444443221111 111122335799999988888887665533333 2233444443321 Q ss_pred cccCCcccCccceEEEeCCCccCCHHHHHHHHH------HHHh--cCC-------CCHHHH---HHhcCCC------CHH Q lcl|NC_019418. 427 GIYRGTIPELDDISVNLDDGVFTDRHAELDYWM------KMVA--AGF-------ATQKRG---IAKTLGI------TEE 482 (527) Q Consensus 427 ~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~------~~~~--aGi-------~s~~~~---i~~~~~~------~de 482 (527) ......+.....+++. . ..+.....+... +.++ +++ +....+ +....|+ +++ T Consensus 404 ~~~p~~P~~lv~~~~v--~--~i~~L~raq~~~~i~~~~q~i~~~~q~~p~v~d~id~d~~~~~~a~~~gvp~~~irs~e 479 (516) T protein:vir:10 404 EAGDSFTSDLVDPVII--T--GIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAE 479 (516) T ss_pred hhCCCCChhhcCccee--h--hHHHHHHHHHHHHHHHHHHHHHHHhcCChHHHhhcCHHHHHHHHHHHhCCChhccCCHH Confidence 1111112222222221 1 122222211111 1110 111 111111 2223332 456 Q ss_pred HHHHHHHHHH-Hhccc--ccccccCCCCCCCCCCCCCC Q lcl|NC_019418. 483 EAEKELAEIN-GELPP--ESDAELALYGKGQQNTVGNS 517 (527) Q Consensus 483 ea~~el~ri~-~E~~~--~~~~~~~~~~~~~~~~~~~~ 517 (527) |++++.++.+ .++.. .........+--++ +..+. T Consensus 480 ev~~~r~~~~~~q~~~~~~~~~~~~~~~~~~~-~~~~~ 516 (516) T protein:vir:10 480 EMEQEQEAQMQAQQAQMLEEGVAKAVPGVIQQ-ELKEA 516 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhcccchhhh-hhhcC Confidence 6665544332 22211 11111111111011 11000 No 212 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=96.49 E-value=0.00057 Score=38.36 Aligned_cols=425 Identities=9% Similarity=0.056 Sum_probs=169.3 Q ss_pred HHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccc-cccc-cCccccCceeecchHHHHHHHHh Q lcl|NC_019418. 7 VKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIE-YTNT-DGDRKRRKMQHLPIARTAAKKIA 84 (527) Q Consensus 7 ~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~-~~~~-~~~~~~~~~~~lnl~~~i~~~~A 84 (527) |+.-+-+ ++. ..+.......|+.++.---|.+. +... .+...+..+.-=..+...++.+| T Consensus 1 m~~~~~~----l~~--------------k~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LA 62 (514) T protein:vir:80 1 MRQQASA----MWA--------------EYRDSTAIRKAEDFAKFTIASLMVDPLDKTHQAEVVEYDFQSAGAFLVNNLT 62 (514) T ss_pred CccchHH----HHH--------------HhhcchHHHHHHHHHHHhcccccCCCCCCcccccccccccchhHHHHHHHHH Confidence 1111100 000 11122346667776654333221 1100 01111111111134566667766 Q ss_pred hhhhcc--cc-----eEeeCCH-------------HHHHH-------HHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEE Q lcl|NC_019418. 85 SLVYNE--QA-----EISAEDE-------------TLNDF-------LSDMLSNDRFNKNFERYLESALALGGLAMRPYV 137 (527) Q Consensus 85 ~ll~~e--~~-----~i~~~d~-------------~~~~~-------l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~ 137 (527) +-|.+- || ++.+++. ..+++ +...|..++|...+.++..+....|.+.+. . T Consensus 63 a~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~--~ 140 (514) T protein:vir:80 63 AKLALTLFPPGRPSFQIELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFY--R 140 (514) T ss_pred HHHHhhhcCCCCcccccccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEE--E Confidence 654443 11 2333321 12333 445667889999999999999999997655 4 Q ss_pred eCCeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeC-------------------CCcceEEEEEEEEeecccccccc Q lcl|NC_019418. 138 DGDKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTE-------------------NRKNVYYTLVEFHEWVTPTGQEV 198 (527) Q Consensus 138 d~~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~-------------------~~~~~~yt~lE~h~~~~~~~~~~ 198 (527) +++.-.+..++-.+++ +..|..|++..++....+... ......||++++.. T Consensus 141 ~~~~~~~~~~pl~~y~-v~~d~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~--------- 210 (514) T protein:vir:80 141 EPGTGKMLVWTMQSYT-VRRTSHGDPAVVVLRQQMPFRELTPEIQADAQAKQIAKRDSDKCDLYTVIEWQP--------- 210 (514) T ss_pred ecCCCcEEEEEcCeEE-EeeCCCcCeEEEEeeeeecHHHhhhhhhhhhhhhhccCCCCCceEEEEEEEeec--------- Confidence 5554446667767755 456777777666544322100 01112233332210 Q ss_pred eeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCC--cccEEEecCCccccccCCCccCcchhhhhHH Q lcl|NC_019418. 199 GSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLS--RPLFTYLKTPGMNNKDINSPLGLSIFDNAKT 276 (527) Q Consensus 199 ~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~--~p~f~~~~~~~~N~~~~~splG~S~~~~~~~ 276 (527) ..+.+. +.+|..-++..+|. ..+.. .-+|..++- +...++.||+|--..+.+ T Consensus 211 ---~~~~~~---~sv~~e~~g~~i~~----------------es~y~~~e~P~i~~Rw----~~~~ge~YGrgp~~~al~ 264 (514) T protein:vir:80 211 ---TPNGKR---CAVWHELEGKRVGP----------------ESSYPAHLCPYVPVAW----NVPDGEHYGRGYVEEYSG 264 (514) T ss_pred ---CCCCeE---EEEEEeccceeecc----------------cCccccccCCeeeeee----EecCCCCcccchHHHHHH Confidence 001111 11122111111111 11211 112222221 223467899999999999 Q ss_pred HHHHHHHHHHHHHH-HHHcCcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccc--cChH Q lcl|NC_019418. 277 TIDFINRTYDEFMW-EIKMGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTP--IRSS 353 (527) Q Consensus 277 lid~ld~~~s~~~~-e~~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~--ir~e 353 (527) -++.|+..--.... .....+....|+++......+...+ ....+++ +....+..++.. .... T Consensus 265 D~k~L~~l~~~~l~~~~~a~~~~~~v~~~g~~~~~~l~~~----------~~g~~v~-----g~~~~v~~~~~~~~~d~~ 329 (514) T protein:vir:80 265 DFARLSILSERLGLYEFEALSLLNLVDEAKGGAVDDYRDA----------ETGDFVP-----GQVGSVASYERGDYNKIA 329 (514) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCceeCcccccchhhhccc----------CCceeec-----CCCccceeeecCcccchH Confidence 99999976444433 3344555566644332111110000 0011111 112223333221 1222 Q ss_pred HHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhh--hcccC Q lcl|NC_019418. 354 DYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALV-EQSIKELCVSMCELGKV--VGIYR 430 (527) Q Consensus 354 ~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~-~~al~~li~~il~~~~~--~~~~~ 430 (527) .-...++.+...|....=+. +.. ..+...|||||....+...+..+-.-..+ ...|.-|++.++.+..- .+..- T Consensus 330 ~~~~~i~~~~~rI~~aFml~-~~~--rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~il~r~~~g~lP 406 (514) T protein:vir:80 330 QASASVESIVMRLNRAFMYT-GQV--RDAERVTVEEIRTVAEEAENLLGGVYSLLAETLQAPLAYLTMYEASRGNGGMLL 406 (514) T ss_pred HHHHHHHHHHHHHHHHHhhh-ccC--CCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhccCCCC Confidence 22344444444443221011 111 12234699999998888887765533333 33445555555544321 11111 Q ss_pred CcccCccceEEEeCCCc-cCCHHHHH---HHHHHHHh--cCC-------CCHHHHHHh---cCCCC-------HHHHHHH Q lcl|NC_019418. 431 GTIPELDDISVNLDDGV-FTDRHAEL---DYWMKMVA--AGF-------ATQKRGIAK---TLGIT-------EEEAEKE 487 (527) Q Consensus 431 ~~~~~~~~v~v~f~d~i-~~d~~~~~---~~~~~~~~--aGi-------~s~~~~i~~---~~~~~-------deea~~e 487 (527) ..+.+...+++. -++ ..-+...+ ....+.++ +++ +....++.. ..|++ +|+++.. T Consensus 407 ~~p~~l~~~~~v--s~la~l~r~~~~~~l~~~~~~i~~l~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~i~~~~e~~~~~ 484 (514) T protein:vir:80 407 GIAQGVYRPSII--TGIPALTRNIETANILRATQEASAIVPALVQLSKRFDPEKLVERIFANNSVDLSTLSKDPDVVAAE 484 (514) T ss_pred CCCchhhcceee--ecHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHhCCCHhhccCCHHHHHHH Confidence 122222333322 111 00111111 11111111 011 223333332 34544 3333222 Q ss_pred HHHHHHhcc---------cccccc-cCCCC Q lcl|NC_019418. 488 LAEINGELP---------PESDAE-LALYG 507 (527) Q Consensus 488 l~ri~~E~~---------~~~~~~-~~~~~ 507 (527) -+|.++.+. ..+... .-++. T Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (514) T protein:vir:80 485 AEQEAALAQQQLDVASGALAAETSAGVLTS 514 (514) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhccccCC Confidence 222211110 111111 11222 No 213 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=96.44 E-value=0.00062 Score=38.17 Aligned_cols=372 Identities=11% Similarity=0.083 Sum_probs=120.1 Q ss_pred HHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHHHH Q lcl|NC_019418. 4 IQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKI 83 (527) Q Consensus 4 ~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~ 83 (527) |.-..+ +|.+. .... .|..|..+. .+. .+.......-..+++.+ T Consensus 1 Mg~f~~--------lf~~~-------~~~~----------~~~~~~~~~--~v~---------~~~~~~~~~v~~~i~~I 44 (395) T protein:vir:10 1 MSILEK--------IFKTR-------KDIT----------YMLDLDMIE--DLS---------QQAYVKRLAIDSCIEFV 44 (395) T ss_pred Cchhhh--------hhccC-------cccc----------ccccchhcc--ccc---------hhhhhhhHHHHHHHHHH Confidence 111111 11111 0000 011111110 000 01111112222333444 Q ss_pred hhhhhcccceEeeC----CHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcCCceEEEEEcC Q lcl|NC_019418. 84 ASLVYNEQAEISAE----DETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQAPVFLPLQSNT 159 (527) Q Consensus 84 A~ll~~e~~~i~~~----d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a~~~~P~~~d~ 159 (527) |+-+-.-|..+--. +......|..==...-=...+.+.+...+.+||.++.+..+++++ +|+.. T Consensus 45 a~~iA~~p~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~~~----------~~~~~-- 112 (395) T protein:vir:10 45 ARAVAQSHFKVLEGNRIQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSKEL----------LIADS-- 112 (395) T ss_pred HHhhccceeEeccCCccccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCCCe----------EecCC-- Confidence 44333323222111 222333222100000011123333344455666666544344332 12100 Q ss_pred CceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccce Q lcl|NC_019418. 160 QDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVT 239 (527) Q Consensus 160 ~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~ 239 (527) ..+..... .+.....++.. .|.+.. .++- .++ T Consensus 113 ~~~~~~~~-------~~~~~~~~~~~--------------------~~~~~~-------------~~~~--------~ev 144 (395) T protein:vir:10 113 FYREEYAL-------YDDIFKDVTVK--------------------DYTYQR-------------TFTM--------QEV 144 (395) T ss_pred ccceeEee-------cCcceeEEEEc--------------------Cceeee-------------eecc--------ccE Confidence 00000000 00000001000 000100 0010 011 Q ss_pred eecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeee--chhHhcCCCCCCCccc Q lcl|NC_019418. 240 PIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIV--PEQMTQLKVQDNQGNI 317 (527) Q Consensus 240 ~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v--~~~~l~~~~~~~~~~~ 317 (527) .. |.++.+ .+. .+|.|.++.+..+++... ..+. ..+..+-++ +...+ +.+. .- T Consensus 145 ih-------~~~~~~--~~~-----~~G~spi~~~~~~~~~~~---~~~~---~~~~~~gii~~~~~~~----~~e~-~~ 199 (395) T protein:vir:10 145 IY-------LKYNNN--KVT-----HFVESLFEDYGKIFGRMI---GAQL---KNYQIRGILKSASSAY----DEKN-IE 199 (395) T ss_pred EE-------EccCCC--Ccc-----cccchHHHHHHHHHHHHH---HHHH---hcCCCceEEEeCCCCC----CHHH-HH Confidence 10 111111 111 246676666655554332 2222 233333222 11111 0000 00 Q ss_pred ccccccccccceeeeccCCC------CCCCcceEecc-----ccChHHHHHHHHHHHHHHHHhcCCCcccccccccccch Q lcl|NC_019418. 318 AFKRRFDVEQNVYMQVGAGN------MDSGGIVDLTT-----PIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKT 386 (527) Q Consensus 318 ~~~~~~d~~~~~~~~~~~~~------~~~~~i~~~~~-----~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~T 386 (527) ...+.| +..+.+.+.+. +++..++.++. +....++.+..+...++|+...|++|.-++...+ + T Consensus 200 ~~~~~~---~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~~s---n 273 (395) T protein:vir:10 200 KLQAFT---NKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYGETA---D 273 (395) T ss_pred HHHHHH---HHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcCccc---C Confidence 000111 11122211110 11112222221 2234478888888888999999999998863222 1 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcC Q lcl|NC_019418. 387 ATEIVSENSDTY-QMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAG 465 (527) Q Consensus 387 Atei~s~~~~~~-~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aG 465 (527) +.+. ...++ .++.-+...++..|..- ++... .....+.+++..-+..|..+.++...+++.+| T Consensus 274 ~e~~---~~~~~~~~l~P~~~~ie~~l~~k------------L~~~~-~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G 337 (395) T protein:vir:10 274 LEKN---TLVFEKFCLTPLLKKIQNELNAK------------LITQS-MYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSG 337 (395) T ss_pred HHHH---HHHHHHHHHHHHHHHHHHHHHHh------------hcChh-hhcccceecchhhhccCHHHHHHHHHHHHhCC Confidence 2221 11111 12222222222222211 11110 01112346666666788888888999999999 Q ss_pred CCCHHHHHHhcCCCC---HHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 466 FATQKRGIAKTLGIT---EEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 466 i~s~~~~i~~~~~~~---deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) +|++-+++... |+. +.++.+-+ +.....+.... ..++.+..+....+|++++. T Consensus 338 ~lt~NE~R~~~-g~~p~~~g~~d~~~--~~~n~~~~~~~------~~~~~~~~~~~~kgg~~~~~ 393 (395) T protein:vir:10 338 SFTRNEVRIML-GEEPSDNPELDEYL--ITKNYEKANSG------ENDEKEKDENTLKGGDEDES 393 (395) T ss_pred CcCHHHHHHHh-CCCCCCCCCCceee--ecccccccccc------ccccCcccccccCCCCCCCC Confidence 99999987654 443 22111111 01100000000 00011111111112222222 No 214 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=96.44 E-value=0.00062 Score=38.17 Aligned_cols=372 Identities=11% Similarity=0.083 Sum_probs=120.1 Q ss_pred HHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHHHH Q lcl|NC_019418. 4 IQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKI 83 (527) Q Consensus 4 ~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~ 83 (527) |.-..+ +|.+. .... .|..|..+. .+. .+.......-..+++.+ T Consensus 1 Mg~f~~--------lf~~~-------~~~~----------~~~~~~~~~--~v~---------~~~~~~~~~v~~~i~~I 44 (395) T protein:vir:95 1 MSILEK--------IFKTR-------KDIT----------YMLDLDMIE--DLS---------QQAYVKRLAIDSCIEFV 44 (395) T ss_pred Cchhhh--------hhccC-------cccc----------ccccchhcc--ccc---------hhhhhhhHHHHHHHHHH Confidence 111111 11111 0000 011111110 000 01111112222333444 Q ss_pred hhhhhcccceEeeC----CHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcCCceEEEEEcC Q lcl|NC_019418. 84 ASLVYNEQAEISAE----DETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQAPVFLPLQSNT 159 (527) Q Consensus 84 A~ll~~e~~~i~~~----d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a~~~~P~~~d~ 159 (527) |+-+-.-|..+--. +......|..==...-=...+.+.+...+.+||.++.+..+++++ +|+.. T Consensus 45 a~~iA~~p~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~~~----------~~~~~-- 112 (395) T protein:vir:95 45 ARAVAQSHFKVLEGNRIQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSKEL----------LIADS-- 112 (395) T ss_pred HHhhccceeEeccCCccccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCCCe----------EecCC-- Confidence 44333323222111 222333222100000011123333344455666666544344332 12100 Q ss_pred CceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccce Q lcl|NC_019418. 160 QDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVT 239 (527) Q Consensus 160 ~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~ 239 (527) ..+..... .+.....++.. .|.+.. .++- .++ T Consensus 113 ~~~~~~~~-------~~~~~~~~~~~--------------------~~~~~~-------------~~~~--------~ev 144 (395) T protein:vir:95 113 FYREEYAL-------YDDIFKDVTVK--------------------DYTYQR-------------TFTM--------QEV 144 (395) T ss_pred ccceeEee-------cCcceeEEEEc--------------------Cceeee-------------eecc--------ccE Confidence 00000000 00000001000 000100 0010 011 Q ss_pred eecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeee--chhHhcCCCCCCCccc Q lcl|NC_019418. 240 PIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIV--PEQMTQLKVQDNQGNI 317 (527) Q Consensus 240 ~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v--~~~~l~~~~~~~~~~~ 317 (527) .. |.++.+ .+. .+|.|.++.+..+++... ..+. ..+..+-++ +...+ +.+. .- T Consensus 145 ih-------~~~~~~--~~~-----~~G~spi~~~~~~~~~~~---~~~~---~~~~~~gii~~~~~~~----~~e~-~~ 199 (395) T protein:vir:95 145 IY-------LKYNNN--KVT-----HFVESLFEDYGKIFGRMI---GAQL---KNYQIRGILKSASSAY----DEKN-IE 199 (395) T ss_pred EE-------EccCCC--Ccc-----cccchHHHHHHHHHHHHH---HHHH---hcCCCceEEEeCCCCC----CHHH-HH Confidence 10 111111 111 246676666655554332 2222 233333222 11111 0000 00 Q ss_pred ccccccccccceeeeccCCC------CCCCcceEecc-----ccChHHHHHHHHHHHHHHHHhcCCCcccccccccccch Q lcl|NC_019418. 318 AFKRRFDVEQNVYMQVGAGN------MDSGGIVDLTT-----PIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKT 386 (527) Q Consensus 318 ~~~~~~d~~~~~~~~~~~~~------~~~~~i~~~~~-----~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~T 386 (527) ...+.| +..+.+.+.+. +++..++.++. +....++.+..+...++|+...|++|.-++...+ + T Consensus 200 ~~~~~~---~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~~s---n 273 (395) T protein:vir:95 200 KLQAFT---NKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYGETA---D 273 (395) T ss_pred HHHHHH---HHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcCccc---C Confidence 000111 11122211110 11112222221 2234478888888888999999999998863222 1 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcC Q lcl|NC_019418. 387 ATEIVSENSDTY-QMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAG 465 (527) Q Consensus 387 Atei~s~~~~~~-~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aG 465 (527) +.+. ...++ .++.-+...++..|..- ++... .....+.+++..-+..|..+.++...+++.+| T Consensus 274 ~e~~---~~~~~~~~l~P~~~~ie~~l~~k------------L~~~~-~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G 337 (395) T protein:vir:95 274 LEKN---TLVFEKFCLTPLLKKIQNELNAK------------LITQS-MYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSG 337 (395) T ss_pred HHHH---HHHHHHHHHHHHHHHHHHHHHHh------------hcChh-hhcccceecchhhhccCHHHHHHHHHHHHhCC Confidence 2221 11111 12222222222222211 11110 01112346666666788888888999999999 Q ss_pred CCCHHHHHHhcCCCC---HHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 466 FATQKRGIAKTLGIT---EEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 466 i~s~~~~i~~~~~~~---deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) +|++-+++... |+. +.++.+-+ +.....+.... ..++.+..+....+|++++. T Consensus 338 ~lt~NE~R~~~-g~~p~~~g~~d~~~--~~~n~~~~~~~------~~~~~~~~~~~~kgg~~~~~ 393 (395) T protein:vir:95 338 SFTRNEVRIML-GEEPSDNPELDEYL--ITKNYEKANSG------ENDEKEKDENTLKGGDEDES 393 (395) T ss_pred CcCHHHHHHHh-CCCCCCCCCCceee--ecccccccccc------ccccCcccccccCCCCCCCC Confidence 99999987654 443 22111111 01100000000 00011111111112222222 No 215 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=96.44 E-value=0.00062 Score=38.17 Aligned_cols=372 Identities=11% Similarity=0.083 Sum_probs=120.1 Q ss_pred HHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHHHH Q lcl|NC_019418. 4 IQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKI 83 (527) Q Consensus 4 ~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~ 83 (527) |.-..+ +|.+. .... .|..|..+. .+. .+.......-..+++.+ T Consensus 1 Mg~f~~--------lf~~~-------~~~~----------~~~~~~~~~--~v~---------~~~~~~~~~v~~~i~~I 44 (395) T protein:vir:10 1 MSILEK--------IFKTR-------KDIT----------YMLDLDMIE--DLS---------QQAYVKRLAIDSCIEFV 44 (395) T ss_pred Cchhhh--------hhccC-------cccc----------ccccchhcc--ccc---------hhhhhhhHHHHHHHHHH Confidence 111111 11111 0000 011111110 000 01111112222333444 Q ss_pred hhhhhcccceEeeC----CHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcCCceEEEEEcC Q lcl|NC_019418. 84 ASLVYNEQAEISAE----DETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQAPVFLPLQSNT 159 (527) Q Consensus 84 A~ll~~e~~~i~~~----d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a~~~~P~~~d~ 159 (527) |+-+-.-|..+--. +......|..==...-=...+.+.+...+.+||.++.+..+++++ +|+.. T Consensus 45 a~~iA~~p~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~~~----------~~~~~-- 112 (395) T protein:vir:10 45 ARAVAQSHFKVLEGNRIQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSKEL----------LIADS-- 112 (395) T ss_pred HHhhccceeEeccCCccccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCCCe----------EecCC-- Confidence 44333323222111 222333222100000011123333344455666666544344332 12100 Q ss_pred CceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccce Q lcl|NC_019418. 160 QDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVT 239 (527) Q Consensus 160 ~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~ 239 (527) ..+..... .+.....++.. .|.+.. .++- .++ T Consensus 113 ~~~~~~~~-------~~~~~~~~~~~--------------------~~~~~~-------------~~~~--------~ev 144 (395) T protein:vir:10 113 FYREEYAL-------YDDIFKDVTVK--------------------DYTYQR-------------TFTM--------QEV 144 (395) T ss_pred ccceeEee-------cCcceeEEEEc--------------------Cceeee-------------eecc--------ccE Confidence 00000000 00000001000 000100 0010 011 Q ss_pred eecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeee--chhHhcCCCCCCCccc Q lcl|NC_019418. 240 PIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIV--PEQMTQLKVQDNQGNI 317 (527) Q Consensus 240 ~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v--~~~~l~~~~~~~~~~~ 317 (527) .. |.++.+ .+. .+|.|.++.+..+++... ..+. ..+..+-++ +...+ +.+. .- T Consensus 145 ih-------~~~~~~--~~~-----~~G~spi~~~~~~~~~~~---~~~~---~~~~~~gii~~~~~~~----~~e~-~~ 199 (395) T protein:vir:10 145 IY-------LKYNNN--KVT-----HFVESLFEDYGKIFGRMI---GAQL---KNYQIRGILKSASSAY----DEKN-IE 199 (395) T ss_pred EE-------EccCCC--Ccc-----cccchHHHHHHHHHHHHH---HHHH---hcCCCceEEEeCCCCC----CHHH-HH Confidence 10 111111 111 246676666655554332 2222 233333222 11111 0000 00 Q ss_pred ccccccccccceeeeccCCC------CCCCcceEecc-----ccChHHHHHHHHHHHHHHHHhcCCCcccccccccccch Q lcl|NC_019418. 318 AFKRRFDVEQNVYMQVGAGN------MDSGGIVDLTT-----PIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKT 386 (527) Q Consensus 318 ~~~~~~d~~~~~~~~~~~~~------~~~~~i~~~~~-----~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~T 386 (527) ...+.| +..+.+.+.+. +++..++.++. +....++.+..+...++|+...|++|.-++...+ + T Consensus 200 ~~~~~~---~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~~s---n 273 (395) T protein:vir:10 200 KLQAFT---NKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYGETA---D 273 (395) T ss_pred HHHHHH---HHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcCccc---C Confidence 000111 11122211110 11112222221 2234478888888888999999999998863222 1 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcC Q lcl|NC_019418. 387 ATEIVSENSDTY-QMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAG 465 (527) Q Consensus 387 Atei~s~~~~~~-~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aG 465 (527) +.+. ...++ .++.-+...++..|..- ++... .....+.+++..-+..|..+.++...+++.+| T Consensus 274 ~e~~---~~~~~~~~l~P~~~~ie~~l~~k------------L~~~~-~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G 337 (395) T protein:vir:10 274 LEKN---TLVFEKFCLTPLLKKIQNELNAK------------LITQS-MYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSG 337 (395) T ss_pred HHHH---HHHHHHHHHHHHHHHHHHHHHHh------------hcChh-hhcccceecchhhhccCHHHHHHHHHHHHhCC Confidence 2221 11111 12222222222222211 11110 01112346666666788888888999999999 Q ss_pred CCCHHHHHHhcCCCC---HHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 466 FATQKRGIAKTLGIT---EEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 466 i~s~~~~i~~~~~~~---deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) +|++-+++... |+. +.++.+-+ +.....+.... ..++.+..+....+|++++. T Consensus 338 ~lt~NE~R~~~-g~~p~~~g~~d~~~--~~~n~~~~~~~------~~~~~~~~~~~~kgg~~~~~ 393 (395) T protein:vir:10 338 SFTRNEVRIML-GEEPSDNPELDEYL--ITKNYEKANSG------ENDEKEKDENTLKGGDEDES 393 (395) T ss_pred CcCHHHHHHHh-CCCCCCCCCCceee--ecccccccccc------ccccCcccccccCCCCCCCC Confidence 99999987654 443 22111111 01100000000 00011111111112222222 No 216 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=96.43 E-value=0.00064 Score=38.11 Aligned_cols=451 Identities=12% Similarity=0.140 Sum_probs=191.0 Q ss_pred CCh--HHHHHHHHHHHH---HHhhcccchhhh----cc--CccccCHHHH--HHHHHHHHHhcCCCcccccccccCcccc Q lcl|NC_019418. 1 MSL--IQKVKDFFNRGR---YNMTTSHLSSIL----DH--PKVAVTQSEF--RRIQHNLAYYQSKFDDIEYTNTDGDRKR 67 (527) Q Consensus 1 m~~--~~~~k~~~~~~~---~~~~~~~~~~~~----~~--~~i~~~~~~~--~~i~~~~~~y~g~~~~l~~~~~~~~~~~ 67 (527) |++ .+-.+-|++.-- .....+...+++ ++ ..|.++.-.- ..--...++|-+.-.... +. .-.. T Consensus 1 m~f~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~--~~--~eLI 76 (523) T protein:vir:68 1 MKFNILSLFAPWAKMDERDYKDQEKENLESITSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLK--ST--RELI 76 (523) T ss_pred CCCchhhhhhhhhhhhhhhhhhhhhccCCCccccCCCCcceeeeccccccccccchhhhhhhhccccccc--hH--HHHH Confidence 665 455555554211 111111111111 11 1111110000 000001112222100000 00 0000 Q ss_pred Cceeec---chHHHHHHHHhh-hhhc----ccceEeeCCH--------HHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCE Q lcl|NC_019418. 68 RKMQHL---PIARTAAKKIAS-LVYN----EQAEISAEDE--------TLNDFLSDMLSNDRFNKNFERYLESALALGGL 131 (527) Q Consensus 68 ~~~~~l---nl~~~i~~~~A~-ll~~----e~~~i~~~d~--------~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~ 131 (527) +.+.+| +--...++...+ -+.. +|+++.+++. ...+.++.++.--+|.++..+.+....+-|.. T Consensus 77 ~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi 156 (523) T protein:vir:68 77 DTYRNLMTNYEVDNAVSEIVSDAIVYEDDTEVVSINLDNTKFSPNIKSMMLDEFNEVLNHLSFQRKGSDHFRRWYVDSRI 156 (523) T ss_pred HHHHHHhhccchhhHHHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhheeeeEE Confidence 001111 111122222222 1111 3555666543 23455667777778999999999999999999 Q ss_pred EEEEEEeCCe-----eEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceE-EEE-EEEEeecccccccceeeecC Q lcl|NC_019418. 132 AMRPYVDGDK-----IRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVY-YTL-VEFHEWVTPTGQEVGSTKDK 204 (527) Q Consensus 132 ~~~~~~d~~~-----~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~-yt~-lE~h~~~~~~~~~~~~~~~~ 204 (527) +|+.++|..+ ..+.+++|.++-++. .+..++..++. ++- .|+..+ ...+ T Consensus 157 ~fhKiid~k~pk~GI~Elr~lDPr~i~~vr--------------~i~~~~~~g~~vi~~~~e~f~Y----------~~~~ 212 (523) T protein:vir:68 157 FFHKIIDPKRPKEGIKELRRLDPRQVQYVR--------------EVITTTEAGVKIVKGYKEYFIY----------DTSH 212 (523) T ss_pred EEEEEeeCCCccccceeeeeeCCcceeEEE--------------eecCCCCcchhhhhhhhhheee----------cccc Confidence 9999998543 457888888765542 12222222111 110 011000 0000 Q ss_pred CceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHH Q lcl|NC_019418. 205 SLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRT 284 (527) Q Consensus 205 ~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~ 284 (527) ..|.+.-..| ..|..|-+ ++ ..+++.. .+.++.... .=+|-+..|.-.+..|=.. T Consensus 213 ~~~~~~g~~~------~~~~~ikI---~~---dAI~y~h-----SGL~d~~~~--------~i~gyLhkAiKp~NQLkml 267 (523) T protein:vir:68 213 ESYACDGRIY------EAGTKIKI---PK---AAIVYAH-----SGLVDCCGK--------NIIGYLHRAIKPANQLKLL 267 (523) T ss_pred cccccccccc------CCCcceec---ch---hheeeee-----ccceeCCCC--------ceeccchhhhHHHHhhHHH Confidence 0000000000 01111110 00 1111111 001111100 0134455555555555444 Q ss_pred HHHHH--HHHHcCcceeee-c---------hhHhc---------CCCCCCCcccccccccccc-cceeeeccCCCCC-CC Q lcl|NC_019418. 285 YDEFM--WEIKMGQRRVIV-P---------EQMTQ---------LKVQDNQGNIAFKRRFDVE-QNVYMQVGAGNMD-SG 341 (527) Q Consensus 285 ~s~~~--~e~~~~~~~i~v-~---------~~~l~---------~~~~~~~~~~~~~~~~d~~-~~~~~~~~~~~~~-~~ 341 (527) -+.++ +-.|+-.+|||- . +.+++ ..-|..+|++.-.+.+-.- ...+.+- -+|+ .. T Consensus 268 EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLpR--ReGgrgT 345 (523) T protein:vir:68 268 EDAVVIYRITRAPDRRVWYVDTGNMPSRKAAEHMQHVMNTMKNRIAYDATTGKIKNQQHIMSMTEDYWLQR--RDGKAVT 345 (523) T ss_pred HhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhhcceeEEeccCCeeccchhhhhhHhhhcccc--cCCCccc Confidence 44433 444566667753 1 11110 0113344443222211100 0001110 1122 12 Q ss_pred cceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019418. 342 GIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQG--VKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSM 419 (527) Q Consensus 342 ~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g--~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~i 419 (527) -|+++...=...+ ...+..+.+.+....+++..-+..++++ .--++||.-..-.....+.+.+..|..-+.++++.= T Consensus 346 EItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDEikF~KFI~rLR~rFs~lf~~~Lk~q 424 (523) T protein:vir:68 346 EVDTLPGADNTGN-MEDVRWFRNALYMALRIPITRIPSDQGGIQFDAGTSITRDELSFGKFIRELQHKFEEIFLDPLKTN 424 (523) T ss_pred ceeeccccCCcCh-HHHHHHHHHHHHHHhCCcceeecCCCcceecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 3566554433333 4556666777777777777766433221 112557766666666778888888888888888876 Q ss_pred HHHhhhhcccCCcccCc--cceEEEeCCCccCCHHHHHHHHHH---HH--hcC----CCCHHHHHHhcCCCCHHHHHHHH Q lcl|NC_019418. 420 CELGKVVGIYRGTIPEL--DDISVNLDDGVFTDRHAELDYWMK---MV--AAG----FATQKRGIAKTLGITEEEAEKEL 488 (527) Q Consensus 420 l~~~~~~~~~~~~~~~~--~~v~v~f~d~i~~d~~~~~~~~~~---~~--~aG----i~s~~~~i~~~~~~~deea~~el 488 (527) |.|-.. +...-+.. ..|.++|...=-..+..+++.... +. ..+ .+|.+++.++....||+|.+++. T Consensus 425 LilKgi---it~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~ 501 (523) T protein:vir:68 425 LILKGI---ITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINMLQMAEPFIGKYISHRTAMKDILQMSDEEIEQEA 501 (523) T ss_pred hhhccC---CCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHH Confidence 655332 21111111 346677754333333333332221 11 112 35888888888899999999999 Q ss_pred HHHHHhcccccccccCCCCCCCCCCCCCCC Q lcl|NC_019418. 489 AEINGELPPESDAELALYGKGQQNTVGNSK 518 (527) Q Consensus 489 ~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 518 (527) .+|++|... +.+.++++ +.++- T Consensus 502 kqI~~E~k~------~~~~~p~~--e~~~f 523 (523) T protein:vir:68 502 KQIEEESKE------ARFQDPDQ--EQEDF 523 (523) T ss_pred HHHHHHhhc------CCCCCCch--hhhcC Confidence 999998642 11111111 11111 No 217 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=96.41 E-value=0.00065 Score=38.07 Aligned_cols=436 Identities=11% Similarity=0.072 Sum_probs=167.2 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHH--------HHHhcCCCcccccccccCcccc-Ccee Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHN--------LAYYQSKFDDIEYTNTDGDRKR-RKMQ 71 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~--------~~~y~g~~~~l~~~~~~~~~~~-~~~~ 71 (527) |-++..+++-|. |+.|.+.. .+.++.=.-.+|.+. +..+-|-. +..-+..|-..+ ..+. T Consensus 1 ~~~~~~~~~~~~-----~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~g~~~~~~~~~ 68 (535) T protein:vir:10 1 MAILKDLRNAFS-----LSNKKSTS-----YIELGDYDKDIVNKAIRPGRASARDTVDGID--IADGNVAGQYSVASISD 68 (535) T ss_pred ChhhHHHHHHHH-----hhhhhhhh-----hHHHhhhhHHHHHhhhhhhhhhhhccccccc--cccCCcccccccCcccc Confidence 888777766653 22222221 123333222222211 11222200 000000010000 0001 Q ss_pred ec------------chHHHHHHHHhhhhh-------------cccceEe-----eCC--HHHHHHHHHHHhh--h----- Q lcl|NC_019418. 72 HL------------PIARTAAKKIASLVY-------------NEQAEIS-----AED--ETLNDFLSDMLSN--D----- 112 (527) Q Consensus 72 ~l------------nl~~~i~~~~A~ll~-------------~e~~~i~-----~~d--~~~~~~l~~~l~~--n----- 112 (527) ++ ++...+++..++.+. +-+..+. .+. ......|..+|.. | T Consensus 69 ~~~~~~l~~~~~~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~ 148 (535) T protein:vir:10 69 VLSTKKLLKAYADNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKRAHEIEDFIYNTGSEYYEW 148 (535) T ss_pred ccCHHHHHHHhccChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhhhhHHHHHHHhCCCCCCCh Confidence 11 122233333333221 1111111 011 1122334444431 2 Q ss_pred -hHHH-HHHHHHHHHHhcCC-EEEEEEEeC-Cee-EEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEE Q lcl|NC_019418. 113 -RFNK-NFERYLESALALGG-LAMRPYVDG-DKI-RVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEF 187 (527) Q Consensus 113 -~f~~-~~~~~~~~a~~~G~-~~~~~~~d~-~~~-~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~ 187 (527) .|+. .+..++..++.+|+ +++.+..+. |++ .+..++|+++.+.. +.++. +....||.. T Consensus 149 ~~~~~~~~~~lv~d~l~~~g~ay~~i~r~~~G~~~~L~~l~p~~V~v~~-d~~~~-------------~~~~~~~~~--- 211 (535) T protein:vir:10 149 RDTFPRLLTKIINDMYVQDQINIERIFKNDSNELDHFNAVDASKVVISY-SPRSK-------------DQPRKFEQF--- 211 (535) T ss_pred hHHHHHHHHHHHHHHHhhCCceEEEEEECCCCcEEEEEEeCCceeEEEE-cCccc-------------cCceEEEEE--- Confidence 2333 44556667777775 466666653 444 47778888777642 21110 011111100 Q ss_pred EeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccC Q lcl|NC_019418. 188 HEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLG 267 (527) Q Consensus 188 h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG 267 (527) .++....+ | + +.+ +.||+.+. +......++| T Consensus 212 ---------------~~~~~~~~---~------------~--------~~e----------iih~~~~~-~~~~~~~~~G 242 (535) T protein:vir:10 212 ---------------VSETKSVK---F------------S--------ERN----------LTFINYWN-LSDTDRRGYG 242 (535) T ss_pred ---------------ecCceeEE---E------------C--------ccc----------EEEEeccC-CCCccccccc Confidence 00000000 0 0 000 23444211 1112235679 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCcccccc--cccc-cccceeeecc------CCCC Q lcl|NC_019418. 268 LSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGNIAFK--RRFD-VEQNVYMQVG------AGNM 338 (527) Q Consensus 268 ~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~~~~~--~~~d-~~~~~~~~~~------~~~~ 338 (527) +|.+.-+...|.....+-.-..+-|+.|.. |..+|........ ..... ..+. .-+..|.+.+ +-.+ T Consensus 243 ~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~----p~giL~~~~~~~~-~ls~e~~e~lk~~~~~~~~G~~nag~~~vl~~ 317 (535) T protein:vir:10 243 YSPVEASIPLIRAIYDTEQFNARFFSQGGT----TRGILVIDQDGDA-QANQMMLAGIRRQWTSQGSGLGGAWKIPILAA 317 (535) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCC----ccEEEEecCCCCc-ccCHHHHHHHHHHHHHHhcCcccccccccccC Confidence 999988887776665444333344565443 2222222211100 00000 0000 0011122211 1111 Q ss_pred CCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHH-HHHHH-HHHHHHHHHHHHH Q lcl|NC_019418. 339 DSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTY-QMRNS-IVALVEQSIKELC 416 (527) Q Consensus 339 ~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~-~~~~~-~~~~~~~al~~li 416 (527) +...++.++......++.+..+...++|+...|++|..+|+...+..+.. ....+.++ .++.. .+..++.+|..++ T Consensus 318 ~g~~~~~l~~~~~D~qfle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~--~~~~~~~~~s~~E~~~~~~~~~~L~P~l 395 (535) T protein:vir:10 318 KDAKFVNMTQNSRDMEFDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGK--SGTKSVNEGSTAKAKLESSKDKGLTPLL 395 (535) T ss_pred CCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccccCcccccc--hhhhhhhhhhhHHHHHHHHHHHHHHHHH Confidence 22344555666677889999998999999999999999987654322110 00111111 11111 2222344555555 Q ss_pred HHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHH---HHH-HHHH--H Q lcl|NC_019418. 417 VSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLGITEE---EAE-KELA--E 490 (527) Q Consensus 417 ~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~de---ea~-~el~--r 490 (527) ..|-...+. .++.. ....+.+.|+.....|..+..+ ..++..+|.|++-+++.+. |+..- ++- -.++ . T Consensus 396 ~~ie~~ln~-~Ll~~---~~~~~~f~f~~l~~~d~~~r~~-~~~~~~~g~lT~NE~R~~~-gl~piegGD~~~~~~~~~~ 469 (535) T protein:vir:10 396 SFIEQVIND-KIMRY---VDTDYRFSFTLGDAQDKLQEEQ-VWKLKLANGYFINEYRKDH-GLKTVDGLDVPGFIGSAEN 469 (535) T ss_pred HHHHHHHhh-hcccc---cCCeEEEEeccccccCHHHHHH-HHHHHHcCCCCHHHHHHHh-CCCCCCCccccccccchhh Confidence 544433221 12211 1235778898877777666544 4455567889999976554 44321 110 0000 0 Q ss_pred ---HH-H-hc-cc--ccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 491 ---IN-G-EL-PP--ESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 491 ---i~-~-E~-~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) .. . +. .+ .++.......+..+.+.++..+...+.||. T Consensus 470 ~~~~~~~~~~~~p~~~~~~~~~~~~~~~q~~~~~~~~~~~g~~~~ 514 (535) T protein:vir:10 470 FINATGFGQPNVPDSSDDSGSTLGERERQERIQHSKDYEKGKDDP 514 (535) T ss_pred cccccccccccCCCCCCCccccCCccccCcccccccccccCCCCC Confidence 00 0 00 00 000000010000011111111111111111 No 218 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=96.31 E-value=0.00076 Score=37.70 Aligned_cols=432 Identities=10% Similarity=0.120 Sum_probs=198.7 Q ss_pred CChHHHHHHHHHHHHHH----hhcccchhhhccC-----cccc--------------------CHHHHHHHHHHHHHhcC Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYN----MTTSHLSSILDHP-----KVAV--------------------TQSEFRRIQHNLAYYQS 51 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~----~~~~~~~~~~~~~-----~i~~--------------------~~~~~~~i~~~~~~y~g 51 (527) |+|.+-.+-|++.--.. +..+...-...+. .|.+ ......-|++++.++.. T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~ 80 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINN 80 (516) T ss_pred CCchHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhc Confidence 99988888877621111 1111111111110 0110 00112223333333211 Q ss_pred CCcccccccccCccccCceeecchHHHHHHHHhh-hhh----cccceEeeCCHH--------HHHHHHHHHhhhhHHHHH Q lcl|NC_019418. 52 KFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIAS-LVY----NEQAEISAEDET--------LNDFLSDMLSNDRFNKNF 118 (527) Q Consensus 52 ~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~-ll~----~e~~~i~~~d~~--------~~~~l~~~l~~n~f~~~~ 118 (527) |.+ ...++...+ -+. ..|+++.+++.. ..+.++.++.=-+|.++. T Consensus 81 --pEv--------------------d~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~ 138 (516) T protein:vir:10 81 --PEV--------------------ERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKL 138 (516) T ss_pred --cch--------------------hhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhh Confidence 111 111222221 111 124555555432 455666777777899999 Q ss_pred HHHHHHHHhcCCEEEEEEEeC---CeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcc--eEEEEEEEEeeccc Q lcl|NC_019418. 119 ERYLESALALGGLAMRPYVDG---DKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKN--VYYTLVEFHEWVTP 193 (527) Q Consensus 119 ~~~~~~a~~~G~~~~~~~~d~---~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~--~~yt~lE~h~~~~~ 193 (527) .+.+....+-|..+|+.++|+ |=..+.+++|.++.++.. ++.++.++ ++.-..|+.-+.. T Consensus 139 ~~~fR~WYVDgRi~fhKiid~~k~GI~Elr~lDPr~i~~vR~--------------i~~~~~~~~~v~~~~~e~~~Y~~- 203 (516) T protein:vir:10 139 DTLFRRWYVDSRIFFHKIMPNPKKGIAELRRLDPRFMEYYRE--------------IVTSDIGGTTIVKGYREFFIYTT- 203 (516) T ss_pred hHHHhhhhhcceEEEEEEecCccccceeeeeeCCcceeeEee--------------ecccccccchhhhhhhheeeecc- Confidence 999999999999999988885 235688899988887632 11111111 0000011111000 Q ss_pred ccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhh Q lcl|NC_019418. 194 TGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDN 273 (527) Q Consensus 194 ~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~ 273 (527) ....|.+.-..|.. +.+|-+ ++ ..+++...+ -+.. | ...=+|-+.. T Consensus 204 ---------~~~~~~~~g~~~~~------~~~ikI---~~---dAI~y~hSG-----L~d~---~-----~~~i~syLhk 249 (516) T protein:vir:10 204 ---------GNEGYSYNGRIFEP------NTRIKI---PR---SAVVYASSG-----LMDC---S-----DRGIIGYLHN 249 (516) T ss_pred ---------CccccccccceeCC------Ccceee---ch---hheeeeccc-----ceeC---C-----CCceeeeehh Confidence 00011110011111 111110 00 111211100 0110 1 0111344555 Q ss_pred hHHHHHHHHHHHHHHH--HHHHcCcceeeec----------hhHh---------cCCCCCCCcccccccccccc-cceee Q lcl|NC_019418. 274 AKTTIDFINRTYDEFM--WEIKMGQRRVIVP----------EQMT---------QLKVQDNQGNIAFKRRFDVE-QNVYM 331 (527) Q Consensus 274 ~~~lid~ld~~~s~~~--~e~~~~~~~i~v~----------~~~l---------~~~~~~~~~~~~~~~~~d~~-~~~~~ 331 (527) |.-.+..|=-.-+.++ +-.|+-.+|||-- +.++ +..-|..+|++.-.+.+-.- ...+. T Consensus 250 AiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWL 329 (516) T protein:vir:10 250 AVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWL 329 (516) T ss_pred hhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcc Confidence 5555555544444433 4445556666531 0111 00113344443322211100 00011 Q ss_pred eccCCCCC-CCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccc-c--chHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019418. 332 QVGAGNMD-SGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQG-V--KTATEIVSENSDTYQMRNSIVAL 407 (527) Q Consensus 332 ~~~~~~~~-~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g-~--~TAtei~s~~~~~~~~~~~~~~~ 407 (527) +- -+|+ ..-|+++..-=...+ ...+..+.+.+....+++.+-+..++++ . .-++||.-.+-.....+.+.+.. T Consensus 330 pR--ReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~r 406 (516) T protein:vir:10 330 MR--RDGKSVTEVSSLPGAQTMGD-MDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQHD 406 (516) T ss_pred cc--cCCCCccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHH Confidence 10 1122 123565554333333 4556667777878888887777654432 2 23677766666667788888888 Q ss_pred HHHHHHHHHHHHHHHhhhhcccCCcccCc--cceEEEeCCCccCCHHHHHHHHHH-------H--HhcCCCCHHHHHHhc Q lcl|NC_019418. 408 VEQSIKELCVSMCELGKVVGIYRGTIPEL--DDISVNLDDGVFTDRHAELDYWMK-------M--VAAGFATQKRGIAKT 476 (527) Q Consensus 408 ~~~al~~li~~il~~~~~~~~~~~~~~~~--~~v~v~f~d~i~~d~~~~~~~~~~-------~--~~aGi~s~~~~i~~~ 476 (527) |..-+.++++.=|.|-.. +...-+.. ..|.++|...=-..+..+++.... + .-+...|.+++.++. T Consensus 407 Fs~lf~~~L~~qLilKgi---it~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~I 483 (516) T protein:vir:10 407 FEEIFLDPLKTNLIYKRI---ITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNI 483 (516) T ss_pred HHHHHHHHHHHhhhhccC---CCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHH Confidence 888888888876655332 21111111 346677754333333333322221 1 123468888888888 Q ss_pred CCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCC Q lcl|NC_019418. 477 LGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSK 518 (527) Q Consensus 477 ~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 518 (527) ...||+|.+++-.+|++|.... .+.++++. ++- T Consensus 484 Lr~tDeei~~e~k~I~~E~~~~------~~~~p~~~---~~f 516 (516) T protein:vir:10 484 LQMTEEQIAQEEKQIEQEAGIK------RFQNPENE---DDF 516 (516) T ss_pred hcCCHhhHHHHHHHHHHhhhCC------CCCCCCcc---ccC Confidence 8999999999999999987421 11111100 001 No 219 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=96.31 E-value=0.00076 Score=37.70 Aligned_cols=432 Identities=10% Similarity=0.120 Sum_probs=198.7 Q ss_pred CChHHHHHHHHHHHHHH----hhcccchhhhccC-----cccc--------------------CHHHHHHHHHHHHHhcC Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYN----MTTSHLSSILDHP-----KVAV--------------------TQSEFRRIQHNLAYYQS 51 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~----~~~~~~~~~~~~~-----~i~~--------------------~~~~~~~i~~~~~~y~g 51 (527) |+|.+-.+-|++.--.. +..+...-...+. .|.+ ......-|++++.++.. T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~ 80 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINN 80 (516) T ss_pred CCchHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhc Confidence 99988888877621111 1111111111110 0110 00112223333333211 Q ss_pred CCcccccccccCccccCceeecchHHHHHHHHhh-hhh----cccceEeeCCHH--------HHHHHHHHHhhhhHHHHH Q lcl|NC_019418. 52 KFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIAS-LVY----NEQAEISAEDET--------LNDFLSDMLSNDRFNKNF 118 (527) Q Consensus 52 ~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~-ll~----~e~~~i~~~d~~--------~~~~l~~~l~~n~f~~~~ 118 (527) |.+ ...++...+ -+. ..|+++.+++.. ..+.++.++.=-+|.++. T Consensus 81 --pEv--------------------d~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~ 138 (516) T protein:vir:10 81 --PEV--------------------ERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKL 138 (516) T ss_pred --cch--------------------hhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhh Confidence 111 111222221 111 124555555432 455666777777899999 Q ss_pred HHHHHHHHhcCCEEEEEEEeC---CeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcc--eEEEEEEEEeeccc Q lcl|NC_019418. 119 ERYLESALALGGLAMRPYVDG---DKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKN--VYYTLVEFHEWVTP 193 (527) Q Consensus 119 ~~~~~~a~~~G~~~~~~~~d~---~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~--~~yt~lE~h~~~~~ 193 (527) .+.+....+-|..+|+.++|+ |=..+.+++|.++.++.. ++.++.++ ++.-..|+.-+.. T Consensus 139 ~~~fR~WYVDgRi~fhKiid~~k~GI~Elr~lDPr~i~~vR~--------------i~~~~~~~~~v~~~~~e~~~Y~~- 203 (516) T protein:vir:10 139 DTLFRRWYVDSRIFFHKIMPNPKKGIAELRRLDPRFMEYYRE--------------IVTSDIGGTTIVKGYREFFIYTT- 203 (516) T ss_pred hHHHhhhhhcceEEEEEEecCccccceeeeeeCCcceeeEee--------------ecccccccchhhhhhhheeeecc- Confidence 999999999999999988885 235688899988887632 11111111 0000011111000 Q ss_pred ccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhh Q lcl|NC_019418. 194 TGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDN 273 (527) Q Consensus 194 ~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~ 273 (527) ....|.+.-..|.. +.+|-+ ++ ..+++...+ -+.. | ...=+|-+.. T Consensus 204 ---------~~~~~~~~g~~~~~------~~~ikI---~~---dAI~y~hSG-----L~d~---~-----~~~i~syLhk 249 (516) T protein:vir:10 204 ---------GNEGYSYNGRIFEP------NTRIKI---PR---SAVVYASSG-----LMDC---S-----DRGIIGYLHN 249 (516) T ss_pred ---------CccccccccceeCC------Ccceee---ch---hheeeeccc-----ceeC---C-----CCceeeeehh Confidence 00011110011111 111110 00 111211100 0110 1 0111344555 Q ss_pred hHHHHHHHHHHHHHHH--HHHHcCcceeeec----------hhHh---------cCCCCCCCcccccccccccc-cceee Q lcl|NC_019418. 274 AKTTIDFINRTYDEFM--WEIKMGQRRVIVP----------EQMT---------QLKVQDNQGNIAFKRRFDVE-QNVYM 331 (527) Q Consensus 274 ~~~lid~ld~~~s~~~--~e~~~~~~~i~v~----------~~~l---------~~~~~~~~~~~~~~~~~d~~-~~~~~ 331 (527) |.-.+..|=-.-+.++ +-.|+-.+|||-- +.++ +..-|..+|++.-.+.+-.- ...+. T Consensus 250 AiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWL 329 (516) T protein:vir:10 250 AVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWL 329 (516) T ss_pred hhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcc Confidence 5555555544444433 4445556666531 0111 00113344443322211100 00011 Q ss_pred eccCCCCC-CCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccc-c--chHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019418. 332 QVGAGNMD-SGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQG-V--KTATEIVSENSDTYQMRNSIVAL 407 (527) Q Consensus 332 ~~~~~~~~-~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g-~--~TAtei~s~~~~~~~~~~~~~~~ 407 (527) +- -+|+ ..-|+++..-=...+ ...+..+.+.+....+++.+-+..++++ . .-++||.-.+-.....+.+.+.. T Consensus 330 pR--ReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~r 406 (516) T protein:vir:10 330 MR--RDGKSVTEVSSLPGAQTMGD-MDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQHD 406 (516) T ss_pred cc--cCCCCccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHH Confidence 10 1122 123565554333333 4556667777878888887777654432 2 23677766666667788888888 Q ss_pred HHHHHHHHHHHHHHHhhhhcccCCcccCc--cceEEEeCCCccCCHHHHHHHHHH-------H--HhcCCCCHHHHHHhc Q lcl|NC_019418. 408 VEQSIKELCVSMCELGKVVGIYRGTIPEL--DDISVNLDDGVFTDRHAELDYWMK-------M--VAAGFATQKRGIAKT 476 (527) Q Consensus 408 ~~~al~~li~~il~~~~~~~~~~~~~~~~--~~v~v~f~d~i~~d~~~~~~~~~~-------~--~~aGi~s~~~~i~~~ 476 (527) |..-+.++++.=|.|-.. +...-+.. ..|.++|...=-..+..+++.... + .-+...|.+++.++. T Consensus 407 Fs~lf~~~L~~qLilKgi---it~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~I 483 (516) T protein:vir:10 407 FEEIFLDPLKTNLIYKRI---ITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNI 483 (516) T ss_pred HHHHHHHHHHHhhhhccC---CCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHH Confidence 888888888876655332 21111111 346677754333333333322221 1 123468888888888 Q ss_pred CCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCC Q lcl|NC_019418. 477 LGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSK 518 (527) Q Consensus 477 ~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 518 (527) ...||+|.+++-.+|++|.... .+.++++. ++- T Consensus 484 Lr~tDeei~~e~k~I~~E~~~~------~~~~p~~~---~~f 516 (516) T protein:vir:10 484 LQMTEEQIAQEEKQIEQEAGIK------RFQNPENE---DDF 516 (516) T ss_pred hcCCHhhHHHHHHHHHHhhhCC------CCCCCCcc---ccC Confidence 8999999999999999987421 11111100 001 No 220 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=96.25 E-value=0.00082 Score=37.50 Aligned_cols=443 Identities=13% Similarity=0.143 Sum_probs=188.5 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhh----ccCcccc---------------CHHHHHHHHHHHHHhcCCCcccccccc Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSIL----DHPKVAV---------------TQSEFRRIQHNLAYYQSKFDDIEYTNT 61 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~----~~~~i~~---------------~~~~~~~i~~~~~~y~g~~~~l~~~~~ 61 (527) ||+. |++ -.++..+. .+.+ +..-..+ ......-|++++.+... |.+ T Consensus 5 fgf~------~~~-~~~~~~~~-~s~~~p~~ddg~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~--pEv----- 69 (558) T protein:vir:10 5 FGFS------IEE-TQKKSTSI-ISPVPKNNEDGVDNFISSGFYGQYVDIEGAYRSEYDLIRRYREMALH--PEA----- 69 (558) T ss_pred hcch------hhh-hhhhccCC-ccccCCCccccccceeccceeeeeecccchhhhHHHHHHHHHHHhhc--cch----- Confidence 2221 110 00011110 0100 0000000 01122223333333211 111 Q ss_pred cCccccCceeecchHHHHHHHHhh-hhh----cccceEeeCCH--------HHHHHHHHHHhhhhHHHHHHHHHHHHHhc Q lcl|NC_019418. 62 DGDRKRRKMQHLPIARTAAKKIAS-LVY----NEQAEISAEDE--------TLNDFLSDMLSNDRFNKNFERYLESALAL 128 (527) Q Consensus 62 ~~~~~~~~~~~lnl~~~i~~~~A~-ll~----~e~~~i~~~d~--------~~~~~l~~~l~~n~f~~~~~~~~~~a~~~ 128 (527) ...++...+ -+. ..|+++.+++. ...+.++.++.=-+|.++..+.+....+- T Consensus 70 ---------------d~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVD 134 (558) T protein:vir:10 70 ---------------DGAIEDVVNEAIVSDLYDSPVEVELSNLNASNTLKKKIREEFRYIKEMMDFDKKSHEIFRNWYVD 134 (558) T ss_pred ---------------hhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheee Confidence 111222221 111 12445555432 23455666777678999999999999999 Q ss_pred CCEEEEEEEeCCe-----eEEEEEcCCceEEEEEcCCceE--EEEEEEEEEeeCCCcceEEEEEEEEeecccccccceee Q lcl|NC_019418. 129 GGLAMRPYVDGDK-----IRVAFIQAPVFLPLQSNTQDVS--SAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGST 201 (527) Q Consensus 129 G~~~~~~~~d~~~-----~~i~~v~a~~~~P~~~d~~~~~--~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~ 201 (527) |..+|+.++|..+ ..+.+++|.++-++..--.... ..+. .+..+ .+.+++ -++.+| T Consensus 135 gRiyfHKiid~k~pk~GI~ELr~lDPr~i~~Vr~i~~~~~~~~~~~---~~~~~-~~~~~~--~~~~ey----------- 197 (558) T protein:vir:10 135 GRVFYLKVIDTKNPQEGIQDLRYIDPLKIKFIRQEKRKPGNQDPAI---RVRSE-QDVVPN--PEFEEF----------- 197 (558) T ss_pred eEEEEEEEEeCCCccccceeeeeeCcccceeeeeecccccccccee---eeecc-cceeec--cceeEe----------- Confidence 9999999998542 4688899988766532100000 0010 01100 011111 111111 Q ss_pred ecCCceEEEEEEEecCCc------c--ccCceeecccccCCcccceee--cCCCcccEEEecCCccccccCCCccCcchh Q lcl|NC_019418. 202 KDKSLYRITNELYKSTSD------S--QLGERVNLSELYPDLQPVTPI--QGLSRPLFTYLKTPGMNNKDINSPLGLSIF 271 (527) Q Consensus 202 ~~~~~~~I~n~ly~~~~~------~--~lG~~v~l~~~~~~l~~~~~~--~g~~~p~f~~~~~~~~N~~~~~splG~S~~ 271 (527) -+|..+.. . ..|.+|-+ ++ ..+++ +|+-. .|.. .=+|-+ T Consensus 198 ----------y~Y~~~~~~~~~~~~~~~~~~~vkI---~~---dAI~y~hSGL~d----------~~~~-----~i~syL 246 (558) T protein:vir:10 198 ----------YIYTPKVQHPTGMVGQMGGKNSIKI---AK---DSITMCTSGLVD----------RNKN-----RVLSYL 246 (558) T ss_pred ----------eeecCCcccccccceeecCCCceee---ch---hheeeeccccee----------cCCC-----eeeecc Confidence 11111000 0 00111100 00 11111 11110 0110 113445 Q ss_pred hhhHHHHHHHHHHHHHHH--HHHHcCcceeeec----------hhHh---------cCCCCCCCcccccccccccc-cce Q lcl|NC_019418. 272 DNAKTTIDFINRTYDEFM--WEIKMGQRRVIVP----------EQMT---------QLKVQDNQGNIAFKRRFDVE-QNV 329 (527) Q Consensus 272 ~~~~~lid~ld~~~s~~~--~e~~~~~~~i~v~----------~~~l---------~~~~~~~~~~~~~~~~~d~~-~~~ 329 (527) ..|.-.+..|=-.-+.++ |-.|+-.+|||-- +.++ +..-|..+|++.-.+.+-.- ... T Consensus 247 hkAIKp~NQLkmlEDAlVIYRitRAPERRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDy 326 (558) T protein:vir:10 247 HKAIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLKEVMSRYRNKLVYDANTGEVRDDRKFMSMMEDF 326 (558) T ss_pred hHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhhh Confidence 566555555544444433 4445666677631 0111 00113444444333222110 000 Q ss_pred eeeccCCCCC-CCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019418. 330 YMQVGAGNMD-SGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQ-GVKTATEIVSENSDTYQMRNSIVAL 407 (527) Q Consensus 330 ~~~~~~~~~~-~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~-g~~TAtei~s~~~~~~~~~~~~~~~ 407 (527) +.+- -+|+ ..-|+++..-=...+ +..+..+.+.+....+++..-++.+++ ...-++||.-..-.....+.+.+.. T Consensus 327 WLpR--ReGgrgTEItTLpGgqnLge-m~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~r 403 (558) T protein:vir:10 327 WLPR--REGGRGTEITTLPGGQNLGE-LSDVDYFQKKLYRALGVPESRIAAEGGFNLGRSSEILRDELKFAKFVGRLRKR 403 (558) T ss_pred cccc--cCCCCccceeeccccCCcch-HHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHH Confidence 1110 1122 123555554322332 345666667777777777776655443 2223567776666677788888888 Q ss_pred HHHHHHHHHHHHHHHhhhhcccCCcccCc--cceEEEeCCCccCCHHHHHHHHHH---HH--hcC----CCCHHHHHHhc Q lcl|NC_019418. 408 VEQSIKELCVSMCELGKVVGIYRGTIPEL--DDISVNLDDGVFTDRHAELDYWMK---MV--AAG----FATQKRGIAKT 476 (527) Q Consensus 408 ~~~al~~li~~il~~~~~~~~~~~~~~~~--~~v~v~f~d~i~~d~~~~~~~~~~---~~--~aG----i~s~~~~i~~~ 476 (527) |..-+.++++.=|.|-.. +...-+.. ..|.++|...=-..+..+++.... +. ..+ ..|.+++.++. T Consensus 404 Fs~lF~~~Lk~qLilKgi---it~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~I 480 (558) T protein:vir:10 404 FAAMFNDMLKTQLVLKNI---VTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYSTEYVRKRV 480 (558) T ss_pred HHHHHHHHHHHhhhhccC---CCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHH Confidence 888888888876655332 21111111 346677754333333333332221 11 112 36888888888 Q ss_pred CCCCHHHHHHHHHHHHHhccc------ccccccC---CCC--CCC-CC--CCCCCCCCCCccccC Q lcl|NC_019418. 477 LGITEEEAEKELAEINGELPP------ESDAELA---LYG--KGQ-QN--TVGNSKDTVDDEDEA 527 (527) Q Consensus 477 ~~~~deea~~el~ri~~E~~~------~~~~~~~---~~~--~~~-~~--~~~~~~~~~~~~~~~ 527 (527) ...||+|.+++..+|++|... +...... ++. ++. ++ .+..+.+-.+.+.++ T Consensus 481 Lr~tDeeI~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 545 (558) T protein:vir:10 481 LRQTDMEIEEIDTQIEDEIQKGIIPDPSQIDPITGEPLPQEGDPAMEGMGEQPVDPDLEAQAQAV 545 (558) T ss_pred hccCHHHHHHHHHHHHHHHhCCCCCCccccChhhccccCccCCchhccCCCCCcccccccchhhh Confidence 899999999999999988731 1111111 111 100 00 000000111111111 No 221 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=96.14 E-value=0.00095 Score=37.15 Aligned_cols=463 Identities=11% Similarity=0.061 Sum_probs=185.1 Q ss_pred CCh----------HHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCce Q lcl|NC_019418. 1 MSL----------IQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKM 70 (527) Q Consensus 1 m~~----------~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~ 70 (527) |+= =.--|+|.+.+- .+-.| -.+.-.+++.+.+.|.+.- ++..+...+=+. T Consensus 1 m~~~~~~~~~~tpe~la~~W~~~I~---------~a~~~-----~~~~h~r~~~~~k~y~~~~-----~~~~~~~~r~nl 61 (663) T protein:vir:34 1 MNESQPTDFADTPQGWAQRWQEEMS---------AAREP-----LEKWHTQGKEIVKRYRDER-----DSAHDAETRWNL 61 (663) T ss_pred CCccccccchhcchhHHHHHHHHHH---------HHHhc-----cchHHHHHHHHHHHhhccc-----cCCCccccccch Confidence 110 001223322110 00000 1123455555666676521 122222222233 Q ss_pred eecchHHHHHHHHhhhhhcccceEeeC------CH----HHHHHHHHHH------hhhhHHHHHHHHHHHHHhcCCEEEE Q lcl|NC_019418. 71 QHLPIARTAAKKIASLVYNEQAEISAE------DE----TLNDFLSDML------SNDRFNKNFERYLESALALGGLAMR 134 (527) Q Consensus 71 ~~lnl~~~i~~~~A~ll~~e~~~i~~~------d~----~~~~~l~~~l------~~n~f~~~~~~~~~~a~~~G~~~~~ 134 (527) +|-|+-..+-. |.+.+|.++|. +. ...+.+.+.+ ++++|...+...+.+++..|.+.++ T Consensus 62 ~~sni~~i~P~-----iYar~P~p~V~~rf~d~d~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~ 136 (663) T protein:vir:34 62 FSTNIQTQMAS-----LYGQTPKVSVSRRFADADDDVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCR 136 (663) T ss_pred hhhhHHHHhhh-----hhcCCCcceeeecccCcccchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEE Confidence 44444333333 33456666652 22 2345555555 5667999999999999999999999 Q ss_pred EEEeC--------------------------------CeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEE Q lcl|NC_019418. 135 PYVDG--------------------------------DKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYY 182 (527) Q Consensus 135 ~~~d~--------------------------------~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~y 182 (527) +.+.. .++.|++|+-..|.---...+....-+.+.-..+...-...|- T Consensus 137 v~Ye~~~~~~~~~~~~~D~~~~~~~a~~~~~~e~~a~E~v~id~v~~~dfl~~pAr~W~ev~wva~r~~mtk~e~~~rf~ 216 (663) T protein:vir:34 137 IRYEVEWEEVAGVDAILDEATGAELAAAVPPTQRKAYECVETDYLHWQDVLWSPARVWHEVRWLAFRNLLDMREFNARFD 216 (663) T ss_pred EEeecccchhccccccCCCccccchhcccccchhhcccceeeeeechhhcccchhhccccccceeeeccCCHHHHHHhhc Confidence 88831 1355666655555411011111111111111110000000000 Q ss_pred EEEE-----EEe-ecccccccceeeecC-CceEEEEEEEecCCcccc-----CceeecccccCCcccceeecCCCcccEE Q lcl|NC_019418. 183 TLVE-----FHE-WVTPTGQEVGSTKDK-SLYRITNELYKSTSDSQL-----GERVNLSELYPDLQPVTPIQGLSRPLFT 250 (527) Q Consensus 183 t~lE-----~h~-~~~~~~~~~~~~~~~-~~~~I~n~ly~~~~~~~l-----G~~v~l~~~~~~l~~~~~~~g~~~p~f~ 250 (527) .-+. .+- .+...........+. ....| .+.+.. ....+ |-.+.|.. -+|...+.| T Consensus 217 ~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~V-wEIWdK-~~~~V~w~~eg~~~~L~~----~~p~lgl~~------- 283 (663) T protein:vir:34 217 ADGSRNLWASVPKVGKPKDGKDGQSCHPWDRAEV-WEIWDK-GGRKVDWYVEGYSAVLDT----QPDPLGLES------- 283 (663) T ss_pred CChhhhhhhhccCcCCccccCCCCCcchhcCcce-eEEEec-CCcEEEEEEcCcceeccc----CCCCCCCCC------- Confidence 0000 000 000000000000000 00000 011100 00000 11111111 011122222 Q ss_pred EecCCcc--ccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCcccccccccccccc Q lcl|NC_019418. 251 YLKTPGM--NNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQN 328 (527) Q Consensus 251 ~~~~~~~--N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~ 328 (527) |||+|.| -+...+|-..+++|.-.+++++++|.+-.++.-=.++-+.+-+.|.+.-.. .++... +..+. T Consensus 284 ffPcPrpl~~~~~~ds~ipvpd~~~y~~~~~E~n~~t~Rin~l~d~ikv~gvy~~~~g~~-----i~~~l~----~a~~n 354 (663) T protein:vir:34 284 FFPCPKPLLANWTTDKVVPRPDFVLAQDLYKEIDLVSTRITLLERAIRVVGVYDKSSGLT-----IGRLLS----EAAQN 354 (663) T ss_pred CCCCcccccceecCCCeecCCcHHHHHHHHHHHHHHHHHHHHHHhhhhhceeeccccchh-----HHHHHH----HhhCC Confidence 4444433 111224566889999999999999977776654345566666765332100 000000 00011 Q ss_pred eeeecc--CCCCCC----CcceEeccccChHHHH---HHHHHHHHHHHHhcCCCcccccccc-cccchHHHHHHHHHHHH Q lcl|NC_019418. 329 VYMQVG--AGNMDS----GGIVDLTTPIRSSDYI---SAISEGLKLFEMQIGVSSGMFTFDG-QGVKTATEIVSENSDTY 398 (527) Q Consensus 329 ~~~~~~--~~~~~~----~~i~~~~~~ir~e~~~---~~~~~~l~~i~~~~g~s~~~~~~~~-~g~~TAtei~s~~~~~~ 398 (527) ...++. ..-+++ +.|.++..+--+.... ..-..+...++..+|++-- .... .-.+||||-.-..+-+- T Consensus 355 ~lvpV~~~~~~~~~gg~~k~I~~~pi~~~~~aI~~l~~~r~qir~d~~qITGiaDi--~Rga~~a~ETatAQ~IKsq~gS 432 (663) T protein:vir:34 355 DLIPVENWLTFADKGGLRGVVDWFPLEPVVAALTSLRDYRRELVDALHQVTGMADI--MRGASDPRETAMAQGVKAKFGS 432 (663) T ss_pred CceecchhhhhhhhcCccchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHhHHHH--hhcccCcchhhHHHHHHHHHHh Confidence 122221 011112 2355544432221111 1111222345777888822 2222 23467776666668888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhh-------hcccCCccc-----------------CccceEEEeCCCccCCHHHH Q lcl|NC_019418. 399 QMRNSIVALVEQSIKELCVSMCELGKV-------VGIYRGTIP-----------------ELDDISVNLDDGVFTDRHAE 454 (527) Q Consensus 399 ~~~~~~~~~~~~al~~li~~il~~~~~-------~~~~~~~~~-----------------~~~~v~v~f~d~i~~d~~~~ 454 (527) .++.+++.++++.++++++...++... ..+.+...+ ....+.|.=+-.+..|..++ T Consensus 433 ~RIqe~qdevqR~arDi~ql~AEIl~~~~~~etl~~m~~~elp~~~ei~~~~~~L~n~~~r~~~ldIe~dsT~~~D~~~e 512 (663) T protein:vir:34 433 IRLQRLQDEVARFASDIQRLKAEVIAEHYDVASILAQANAEFTFDKELAPKAAELIKSRFSMYRVEVKPEAVSLQDFAAL 512 (663) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHHhcCCCCcccchhHHHHHHhcCCCcceeeeeccCCCCcCChHHH Confidence 999999999999999999986655321 011111111 33446677777888888777 Q ss_pred HHHHHHHHhcCCCCHHHHH--------------HhcC-----CCCH-HHHHHHHHHHHH--hcccccccccCCCCCCCCC Q lcl|NC_019418. 455 LDYWMKMVAAGFATQKRGI--------------AKTL-----GITE-EEAEKELAEING--ELPPESDAELALYGKGQQN 512 (527) Q Consensus 455 ~~~~~~~~~aGi~s~~~~i--------------~~~~-----~~~d-eea~~el~ri~~--E~~~~~~~~~~~~~~~~~~ 512 (527) .+..+..+++ +-+--..+ .+++ ++.. .+++.-++++.. |.+... +..+++. T Consensus 513 K~~~~E~l~~-i~~~~qq~~pl~~q~p~~~p~l~Ellk~~~~~f~~~~qie~ai~~~~~~~e~aa~~------~~~~~pa 585 (663) T protein:vir:34 513 RNEKMEVLSG-IASFMQGVAPLAQQVPGSAPFLLQMLKWSVSGLRGSSTIEGVLDKAIAAAEEAQKQ------AAQQSPA 585 (663) T ss_pred HHHHHHHHHH-HHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhcCChhhhHHHHHHHHHhhhHHHhhc------cCCCCcc Confidence 7666655432 11111111 1110 1111 111111122111 111000 0000000 Q ss_pred CCCCCCCCCCccccC Q lcl|NC_019418. 513 TVGNSKDTVDDEDEA 527 (527) Q Consensus 513 ~~~~~~~~~~~~~~~ 527 (527) + ....+ T Consensus 586 ~---------~~~~~ 591 (663) T protein:vir:34 586 P---------QQPDP 591 (663) T ss_pred c---------chhhH Confidence 0 11111 No 222 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=95.87 E-value=0.0013 Score=36.35 Aligned_cols=375 Identities=13% Similarity=0.105 Sum_probs=138.3 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHh-cCCCcccccccccCccccCceee-cchHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYY-QSKFDDIEYTNTDGDRKRRKMQH-LPIART 78 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y-~g~~~~l~~~~~~~~~~~~~~~~-lnl~~~ 78 (527) ||||. ||++ |+.... . ++ + .|+ .+.. .......+.. ..+ .+--.. T Consensus 1 Mg~~~----~f~~-------k~~~~~------~-~~-----~----~~~~~~~~--~~~~~~~~~~----~~~~~~~V~~ 47 (403) T protein:vir:80 1 MGLFN----FFRR-------KTRSEP------T-NA-----I----SWFLTQEA--YDTLAIPGYT----RLSDNPEVRM 47 (403) T ss_pred Ccccc----cccc-------cccccc------c-ch-----h----hhhccccc--ccccccchhh----hhhhhHHHHH Confidence 99985 5543 111000 0 00 0 011 0100 0000000000 010 011123 Q ss_pred HHHHHhhhhhcccceEe-e-CC--HHHHHHHHHHHh--hhhH---HHHHHHHHHHHHhc--CCEEEEEEEeCC-e-eEEE Q lcl|NC_019418. 79 AAKKIASLVYNEQAEIS-A-ED--ETLNDFLSDMLS--NDRF---NKNFERYLESALAL--GGLAMRPYVDGD-K-IRVA 145 (527) Q Consensus 79 i~~~~A~ll~~e~~~i~-~-~d--~~~~~~l~~~l~--~n~f---~~~~~~~~~~a~~~--G~~~~~~~~d~~-~-~~i~ 145 (527) .++.+|+-+.+-|..+- - ++ .....-+..+|. -|.+ ..-++..+.+++-. |-+++.+.++.. + ..+. T Consensus 48 ~I~~ia~~iA~~p~~~~~~~~~g~~~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g~~~~L~ 127 (403) T protein:vir:80 48 AVHKIAELISSMTIHLMQNTDNGDIRIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSGLIDELI 127 (403) T ss_pred HHHHHHHhhhhCceEEEEecCCceeecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCCcEEEEE Confidence 34444444444343321 0 11 011111222222 1111 12222334445443 556777766653 3 3455 Q ss_pred EEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCce Q lcl|NC_019418. 146 FIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGER 225 (527) Q Consensus 146 ~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~ 225 (527) .++|+++-++..+ ++ |.+. |.. .. T Consensus 128 ~l~p~~v~~~~~~-~g---------------------------------------------~~~~---y~~-------~~ 151 (403) T protein:vir:80 128 PLAPSKVSFVDTD-TG---------------------------------------------YQIW---YQG-------KA 151 (403) T ss_pred EEcCCeeEEEEcC-Cc---------------------------------------------eEEE---Eee-------cc Confidence 5666655432111 11 1111 000 00 Q ss_pred eecccccCCcccceeecCCCcccEEEecC-CccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcC-cceeee-- Q lcl|NC_019418. 226 VNLSELYPDLQPVTPIQGLSRPLFTYLKT-PGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMG-QRRVIV-- 301 (527) Q Consensus 226 v~l~~~~~~l~~~~~~~g~~~p~f~~~~~-~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~-~~~i~v-- 301 (527) .+ +.+ ..||+. +.+++ .-.|.|.+.-+...+......-.-...-+..| .+..++ T Consensus 152 ~~--------~~e----------iih~~~~~~~~~----~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~ 209 (403) T protein:vir:80 152 YN--------YDE----------VLHFIVNPDPEK----PYMGRGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKV 209 (403) T ss_pred cc--------hhh----------EEEEeccCCCcC----ccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEe Confidence 00 000 123331 11121 11377777666666555443222222223443 323222 Q ss_pred chhHhcCCCCCCCcccccccccccccceeeec-cCC-----CCCCCcceEec-cccChHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_019418. 302 PEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQV-GAG-----NMDSGGIVDLT-TPIRSSDYISAISEGLKLFEMQIGVSS 374 (527) Q Consensus 302 ~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~-~~~-----~~~~~~i~~~~-~~ir~e~~~~~~~~~l~~i~~~~g~s~ 374 (527) +..+- ....+ .....+. .-|.+. +.+ +++....+.++ .+....++.+..+....+|+...|++| T Consensus 210 ~~~~~-----~~~~~-~~~~~~~---~~~~~~~~~g~~~~~~~~~~~~~~~~~l~~~d~q~~e~~~~~~~~Ia~~fgVPp 280 (403) T protein:vir:80 210 DAATA-----ELSSE-EGRNAVF---KKYLEASEAGQPWIIPAELLDVEQVKPLSLKDLAIHETVELDKRTVAGIFGVPA 280 (403) T ss_pred CCCCC-----hHHHH-HHHHHHH---HHHhhhhhcCCeeeecccccccceeccCCHHHHHHHHHHHHhHHHHHHHhCCCH Confidence 22110 00000 0000000 001111 100 01111123333 244556777888777888999999999 Q ss_pred ccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHH Q lcl|NC_019418. 375 GMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAE 454 (527) Q Consensus 375 ~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~ 454 (527) ..+|...... ++. +. .++.+|..++..|...... .+.. .....+.++.+.-+..|..+. T Consensus 281 ~~lg~~~~~~--~~~-----~~----------f~~~~l~P~~~~ie~~l~~-kll~---~~~~~~~f~~~~ll~~d~~~~ 339 (403) T protein:vir:80 281 FLLGVGKYDK--DEY-----NN----------FINSTILPIAKGIEQELTR-KLLI---SPDLYFKFNPRSLYAYDLKEL 339 (403) T ss_pred HHcCCCCccH--HHH-----HH----------HHHHHHHHHHHHHHHHHHH-hccC---CCCcEEEeechhhhccCHHHH Confidence 9887532211 111 11 2233444444443322111 1211 122334444445566788899 Q ss_pred HHHHHHHHhcCCCCHHHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCC-CCCCCCCCCCcccc Q lcl|NC_019418. 455 LDYWMKMVAAGFATQKRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQN-TVGNSKDTVDDEDE 526 (527) Q Consensus 455 ~~~~~~~~~aGi~s~~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 526 (527) ++...+++.+|+|++-+++... |+.+-+ ..+.+. ..... ++-..++++ ..+.+.++.+++-| T Consensus 340 ~~~~~~~~~~Gi~t~NE~R~~~-gl~p~~ggd~~~~--~~n~~-------pl~~~~~~~~~k~ge~~~~~~~~~ 403 (403) T protein:vir:80 340 AEVGSNMYVRGLMEGNEVRDWL-GLSPKEGLSELVI--LENYI-------PLDKIGDQNKLKGGEKGGADGQTD 403 (403) T ss_pred HHHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCeEee--ccccc-------chhhccchhhccCCCCCCCCCCCC Confidence 9999999999999999976554 654321 111000 00000 000001111 11111122222222 No 223 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=95.75 E-value=0.0015 Score=36.03 Aligned_cols=448 Identities=15% Similarity=0.096 Sum_probs=164.0 Q ss_pred CChHHHHHHH-HHHHHHHhhccc-chhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccC--ceeecchH Q lcl|NC_019418. 1 MSLIQKVKDF-FNRGRYNMTTSH-LSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRR--KMQHLPIA 76 (527) Q Consensus 1 m~~~~~~k~~-~~~~~~~~~~~~-~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~--~~~~lnl~ 76 (527) .++ -++| ++-.++-....+ ....+|--.+..+ . ..||.+. .++.|+...-..+.- +..+-=++ T Consensus 67 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~----l~~~~~~-~F~Gy~~la~laQ~~eyr~~~~~ia 133 (695) T protein:vir:36 67 LRL---ARQFEVDVSNYTPRERRAASYALDFNGTSMD-----A----LSFVTSS-GFPGFPTLVLLAQLPEYRAMHEVLA 133 (695) T ss_pred ccc---ceeceecccccCccccchhhhhhcccccccc-----c----chhhhcc-CcchHHHHHHHhhccchhhHHHHHH Confidence 222 1111 000000000000 0000000000000 0 1244432 112222111111100 00111233 Q ss_pred HHHHHHHhhhhhcccce-----Eee-------CCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEE Q lcl|NC_019418. 77 RTAAKKIASLVYNEQAE-----ISA-------EDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRV 144 (527) Q Consensus 77 ~~i~~~~A~ll~~e~~~-----i~~-------~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i 144 (527) ...+++|-..+.++... +++ ++...-+.|+.-+++-+.+..++++++.+-.+|++++.+-++++.... T Consensus 134 ~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l 213 (695) T protein:vir:36 134 DECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIM 213 (695) T ss_pred HHhhcccceecccchhhhhhccccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeccCcccc Confidence 34444443222222111 111 122445677777788889999999999999999999777776532100 Q ss_pred EEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeeccccccccee-----eecCCceEEEEEEEecCCc Q lcl|NC_019418. 145 AFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGS-----TKDKSLYRITNELYKSTSD 219 (527) Q Consensus 145 ~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~-----~~~~~~~~I~n~ly~~~~~ 219 (527) + -|+.-....+..+. -+..+.++++- .......... -...+.|+| T Consensus 214 ---~----~PL~~~~~~I~kGs------------lKGl~ViDp~~-vtP~~~n~~dP~spdfgkP~~y~V---------- 263 (695) T protein:vir:36 214 ---D----TPLVPRPYTVPKGS------------FQGLRVVEPYW-VTPNNYNSINPVADDFYKPSTWWM---------- 263 (695) T ss_pred ---c----cccccccccccCcc------------eeeeEeecccc-cccchhhhccchhhccCCCceEEE---------- Confidence 0 01100000000000 00111112110 0000000000 000111111 Q ss_pred cccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCccee Q lcl|NC_019418. 220 SQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRV 299 (527) Q Consensus 220 ~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i 299 (527) .|+.|.-+.+ ..+.| +|+.-.+|+. ..-+|+|....+.+-+++.+++-.....=+..-.-.+ T Consensus 264 --~G~kIH~SRL-------~~f~g--~plPd~LKp~-------y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~ 325 (695) T protein:vir:36 264 --IGTEVHATRL-------HTIVS--RPVGDMLKPT-------YSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSG 325 (695) T ss_pred --eceEEeeeeE-------EEecC--CCchhhhhcc-------cccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhHHH Confidence 1222211110 11222 1221122221 2346999999999999999877655554332111111 Q ss_pred eechhHhcCCCCCCCcccccccccc---cccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccc Q lcl|NC_019418. 300 IVPEQMTQLKVQDNQGNIAFKRRFD---VEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGM 376 (527) Q Consensus 300 ~v~~~~l~~~~~~~~~~~~~~~~~d---~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~ 376 (527) + -.+|.....++ +.......+. .-+..+..+-++ ++...+++++ +...-..+.+..+..+++..+|++... T Consensus 326 l-k~dla~aL~~g--~~~~l~~R~eli~~~Rsn~G~~llD-k~~Eefeq~s--tslSGLddVi~qf~q~VAgaa~IPltk 399 (695) T protein:vir:36 326 I-LMDLAQALMPG--ANVDLSMRAELINRYRDNRNILFLD-KATEEFFQFN--TPLSGLDALQAQAQEQMSAVSHIPLIK 399 (695) T ss_pred H-HHHHHHhhcCh--hHHHHHHHHHHHHHhcCccceEEEe-cCCcceEEEe--cccCCHHHHHHHHHHHHHhhhcCchhh Confidence 1 11222111122 1111111111 112222211122 2222344443 344455566666677777788887654 Q ss_pred -cccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHH Q lcl|NC_019418. 377 -FTFDGQGV-KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAE 454 (527) Q Consensus 377 -~~~~~~g~-~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~ 454 (527) ||...+|. +|+..=...+.+...... +..++.+|+.|+.+|..- .++.. .+++++.|+.--..++.+. T Consensus 400 LfGqSPkGlNATGE~D~rnYYD~I~s~Q--e~~L~p~L~rl~~ii~rS-----~~G~i---dpdi~~~fnPL~qmtd~Ek 469 (695) T protein:vir:36 400 LLGITPTGLNASSEGEIRVWYDYVRAYQ--RNALQQLMNDVIVMIQLS-----LFGAV---DPSIKWQWNALRELDDLEV 469 (695) T ss_pred hhccCcccccccchhhHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH-----hcCCC---CCcceEEeCCCCCcCHHHH Confidence 68777775 666633333333333332 456788888887766432 12222 2368889986555555443 Q ss_pred HH-------HHHHHHhcCCCCHHHHHHhcCC-----CC-------------HHHHHHHHHHHHHhcccccccccCC---- Q lcl|NC_019418. 455 LD-------YWMKMVAAGFATQKRGIAKTLG-----IT-------------EEEAEKELAEINGELPPESDAELAL---- 505 (527) Q Consensus 455 ~~-------~~~~~~~aGi~s~~~~i~~~~~-----~~-------------deea~~el~ri~~E~~~~~~~~~~~---- 505 (527) ++ .+..++.+|+++..+...++-. .. |++..-++.--+. .++.++..+. T Consensus 470 AeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~ 547 (695) T protein:vir:36 470 AESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQR--LAEGGDTGAPGGAR 547 (695) T ss_pred HHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcC--cccccccCCCCccc Confidence 33 3344556677777775554311 10 0011000000000 0000000000 Q ss_pred CCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 506 YGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 506 ~~~~~~~~~~~~~~~~~~~~~~ 527 (527) .+...+....+..-+.++.+-+ T Consensus 548 ~g~~~~~~v~~~~~~~~~~~ag 569 (695) T protein:vir:36 548 AGATAPPTVANVNANVNPREAG 569 (695) T ss_pred ccccCCCcccccccccCccccC Confidence 0000011111111111111111 No 224 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=95.47 E-value=0.002 Score=35.36 Aligned_cols=386 Identities=11% Similarity=0.006 Sum_probs=160.3 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccc-cCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRK-RRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~-~~~~~~lnl~~~i 79 (527) |.++...|+ ++.... -.|..+..+ +-...+..|..- .+..+...--... T Consensus 1 m~~~~~~~~-----------~~~~~~----------------~~~~~~~~~---~~~~~~~~g~~v~~~~al~~~~v~~~ 50 (419) T protein:vir:57 1 MFIPQFWKG-----------RPSENR----------------VNWQVVPGG---MRSSSSQAGVIITPETALALSAVRAC 50 (419) T ss_pred Ccchhhhcc-----------CCcccc----------------ccccccccc---cccccccCCceechHHhhccHHHHHH Confidence 665333222 111110 001111000 000011111110 0111111112344 Q ss_pred HHHHhhhhhcccceE-e-e--------CCHHHHHHHHH-HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEE Q lcl|NC_019418. 80 AKKIASLVYNEQAEI-S-A--------EDETLNDFLSD-MLSNDRFNKNFERYLESALALGGLAMRPYVDG-DK-IRVAF 146 (527) Q Consensus 80 ~~~~A~ll~~e~~~i-~-~--------~d~~~~~~l~~-~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~ 146 (527) ++.+|+-+-+-|..+ . . .+..+...|.. --..-......+..+...+..|.+++.+..+. |+ +.+.. T Consensus 51 i~~ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~p 130 (419) T protein:vir:57 51 VTLLAESVAQLPCVLYRRTENGGREIAFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITELIP 130 (419) T ss_pred HHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEE Confidence 444444444333332 1 1 12223333321 11111223334456667778899988887775 33 45666 Q ss_pred EcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCcee Q lcl|NC_019418. 147 IQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERV 226 (527) Q Consensus 147 v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v 226 (527) ++|+.+-+.. +.++ ..||.. . + .|..+ T Consensus 131 l~~~~v~v~~-~~~g-----------------~~~y~~-~------------------~----------------~~~~~ 157 (419) T protein:vir:57 131 INPHKVIVLK-GPDG-----------------MPYYDI-P------------------S----------------IGEIL 157 (419) T ss_pred EcCcceEEEE-CCCc-----------------eEEEEE-c------------------C----------------CceEE Confidence 7777665532 1111 112210 0 0 01111 Q ss_pred ecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHc-CcceeeechhH Q lcl|NC_019418. 227 NLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKM-GQRRVIVPEQM 305 (527) Q Consensus 227 ~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~-~~~~i~v~~~~ 305 (527) |.. -+.|++.+.. +..+|.|.+.-+...++.....-....+-|.. +.+.-++ T Consensus 158 ~~~------------------~vih~r~~~~-----d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil---- 210 (419) T protein:vir:57 158 PMR------------------MVHHIKSFSL-----DGYIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVI---- 210 (419) T ss_pred chh------------------hEEEecCcCC-----CCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEE---- Confidence 100 1234543211 23569999888888777555443333334555 3333222 Q ss_pred hcCCCCCCC--ccc---ccccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcc Q lcl|NC_019418. 306 TQLKVQDNQ--GNI---AFKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSG 375 (527) Q Consensus 306 l~~~~~~~~--~~~---~~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~ 375 (527) ......+. .+. .....|. ..|.+.. .+ -.++..++.++......++.+..+...++|+...|++|. T Consensus 211 -~~~~~~~~~~~~e~~~~~~~~~~---~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~ 286 (419) T protein:vir:57 211 -ERPFEAKAIASQAAVDAILAKWT---ERYGGVRNAFSVGMLQEGMTYKQLSQDNEKAQLLQSRQYTVNEVCRLYKVPPH 286 (419) T ss_pred -EecCcCCcccCHHHHHHHHHHHH---HHhccccccccceecCCCceEEEcCCChhhHHHHHHHHHHHHHHHHHhCCCHH Confidence 22111110 000 0001111 1111110 00 012234566666677788888888888999999999999 Q ss_pred ccccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHH Q lcl|NC_019418. 376 MFTFDGQGV-KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAE 454 (527) Q Consensus 376 ~~~~~~~g~-~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~ 454 (527) .++....+. .++.+. .+..++.+|.-++..|....+. .+..........+.+++++-+..|..+. T Consensus 287 ~lg~~~~~t~sn~e~~-------------~~~f~~~~l~P~~~~ie~~l~~-~ll~~~~~~~~~i~fd~~~ll~~d~~~~ 352 (419) T protein:vir:57 287 MIQDLQKSTNNNIEHQ-------------GLQYVIYTMLAILKRHESAMMR-DLLLPSERRDFYIEFNVSSLLRGDQKSR 352 (419) T ss_pred HhCCCCCCccccHHHH-------------HHHHHHHHHHHHHHHHHHHHHh-hccCccccCCeEEEEechhhhccCHHHH Confidence 998655432 222222 1122334444444443322111 1111111223446666667677888999 Q ss_pred HHHHHHHHhcCCCCHHHHHHhcCCCCHH-HHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 455 LDYWMKMVAAGFATQKRGIAKTLGITEE-EAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 455 ~~~~~~~~~aGi~s~~~~i~~~~~~~de-ea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) ++...+++.+|+|++-+++.. .|+..- ...+-+- +..........++++. ..+.-.+.++ T Consensus 353 ~~~~~~~~~~G~~T~NE~R~~-~gl~p~~ggD~~~~-------~~n~~~~~~~~~~~~~-----~~~~~~~~~~ 413 (419) T protein:vir:57 353 YESYALGRQWGWLSVNDIRRM-ENLTPIPGGDKYLT-------PLNMVDSKALTGIGKA-----TPQQLKDIEA 413 (419) T ss_pred HHHHHHHHhCCCcCHHHHHHH-hCCCCCCCcCeeee-------ccccccccccccccCC-----CcccCcchhh Confidence 999999999999999997754 355421 1111000 0000000000000000 0111111111 No 225 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=95.28 E-value=0.0024 Score=34.97 Aligned_cols=388 Identities=12% Similarity=0.024 Sum_probs=163.0 Q ss_pred ChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHH-HHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 2 SLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNL-AYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 2 ~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~-~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) .| |.+....... ....++ .-|. .+..+....-. ..+.. ..-+...--..++ T Consensus 1 ~~---------------~~r~~~~~~~--~~~~~~------~~~~~~~~g~~~s~~~-~~vt~----~~al~~~~v~~~v 52 (419) T protein:vir:14 1 MF---------------FSRQLLSNLG--QTQMSA------GGWVSALLGSSRSDSG-QVVTP----ASALALTVLQNCV 52 (419) T ss_pred Cc---------------cccccccccc--ccccCc------chhhHHhhcCCCccCC-cccch----HHhhccHHHHHHH Confidence 11 2222111111 111111 1122 23322211100 00000 1111112223445 Q ss_pred HHHhhhhhcccceEee-CCH----HHHHHHHHHHhh--h---hHHHHHHHHHHHHHhcCCEEEEEEEeC-Cee-EEEEEc Q lcl|NC_019418. 81 KKIASLVYNEQAEISA-EDE----TLNDFLSDMLSN--D---RFNKNFERYLESALALGGLAMRPYVDG-DKI-RVAFIQ 148 (527) Q Consensus 81 ~~~A~ll~~e~~~i~~-~d~----~~~~~l~~~l~~--n---~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~-~i~~v~ 148 (527) +.+|+-+-+-|..+-- +++ ..+..|..+|.. | ....-....+...+..|.+++.+..+. |++ .+-.++ T Consensus 53 ~~ia~~iA~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~ 132 (419) T protein:vir:14 53 TLLAESIAQLPIELYERSGEDRKPATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLD 132 (419) T ss_pred HHHHHhhccCceEEEEecCCccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEec Confidence 5555555444443311 111 011123333321 1 122233445677777899988887764 343 455666 Q ss_pred CCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeec Q lcl|NC_019418. 149 APVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNL 228 (527) Q Consensus 149 a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l 228 (527) |+.+-+.. +.++ ..+|. + ... . + ++ T Consensus 133 ~~~v~v~~-~~~~-----------------~~~y~-------------------------~-----~~~--~--~--~~- 157 (419) T protein:vir:14 133 NEAVTVMR-GSDL-----------------KPVYR-------------------------V-----RGS--D--P--MP- 157 (419) T ss_pred CceEEEEE-CCCc-----------------eEEEE-------------------------E-----ccC--c--c--cc- Confidence 66655432 2111 11121 0 000 0 0 00 Q ss_pred ccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcC-cceeeechhHhc Q lcl|NC_019418. 229 SELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMG-QRRVIVPEQMTQ 307 (527) Q Consensus 229 ~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~-~~~i~v~~~~l~ 307 (527) .. .+.|++.+.. +..+|+|.+.-+...++.....-....+-|+.| +++.++ . T Consensus 158 --------~~---------~i~h~~~~~~-----dg~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil-----~ 210 (419) T protein:vir:14 158 --------QR---------LVHHVRWMSI-----NGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVI-----E 210 (419) T ss_pred --------hh---------heeEecCcCC-----CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEE-----E Confidence 00 1234443211 224699998888777765554443333445653 333333 1 Q ss_pred CCCCCCCc--ccc---cccccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccc Q lcl|NC_019418. 308 LKVQDNQG--NIA---FKRRFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMF 377 (527) Q Consensus 308 ~~~~~~~~--~~~---~~~~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~ 377 (527) ........ +.. +...|. ..|.+.+ .+ -++...++.++....+.++.+..+....+|+...|++|..+ T Consensus 211 ~~~~~~~~~~~~~~~~~~~~~~---~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~l 287 (419) T protein:vir:14 211 RPKDAPALKDQASVDRITDGWN---AKFGGSGNAKKVALLQEGMTFRPLSMTNVDAALIDALRLSALDIARIYKIPAHMV 287 (419) T ss_pred ecCCCCcccCHHHHHHHHHHHH---HHhcCccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHh Confidence 11111110 000 001110 1111110 00 01122355555555566777878778889999999999999 Q ss_pred ccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHH Q lcl|NC_019418. 378 TFDGQGV-KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELD 456 (527) Q Consensus 378 ~~~~~g~-~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~ 456 (527) +...++. .++.+. .+..++.+|.-++..|-...+. .++.......+.+.+++++-+..|..+.++ T Consensus 288 g~~~~~t~s~~E~~-------------~~~f~~~~L~P~~~~ie~~l~~-kll~~~~~~~~~i~fd~~~l~r~d~~~~~~ 353 (419) T protein:vir:14 288 NELERATFSNIEHQ-------------SLQFVIYTLLPWVKRHEQAKTR-DLLLPSERKQYFIEYNLAGLLRGDQSSRYA 353 (419) T ss_pred cCCCCCCcccHHHH-------------HHHHHHHHHHHHHHHHHHHHhh-hccCccccCCeEEEEechhhhccCHHHHHH Confidence 8654432 222222 1122334444444443222111 122222223344666666666778899999 Q ss_pred HHHHHHhcCCCCHHHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 457 YWMKMVAAGFATQKRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 457 ~~~~~~~aGi~s~~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) ...+++.+|+|++-+++.+. |+..-+ ...-+ .+.+-.. .++.++.+.++.....++++|. T Consensus 354 ~~~~~~~~G~~T~NE~R~~~-gl~p~~gGD~~~-------~~~n~~~---~~~~~~~~~~~~~~~~~~~~e~ 414 (419) T protein:vir:14 354 AYAVGRQWGWLSINDIRRLE-NMPPVKGGDIYL-------SPMNMVD---ASKPQQLPVGKSEPTKAAIDEI 414 (419) T ss_pred HHHHHHhCCCcCHHHHHHHh-CCCCCCCcCeee-------ecccccc---ccccccccCCCCCCccccccch Confidence 99999999999999977543 554211 00000 0000000 0122222333344455555555 No 226 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=95.28 E-value=0.0024 Score=34.97 Aligned_cols=385 Identities=12% Similarity=0.059 Sum_probs=152.7 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhh-ccCccc-c---CHHHHHHHHHHHHH--hcCCCcccccccccCccc-cCceee Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSIL-DHPKVA-V---TQSEFRRIQHNLAY--YQSKFDDIEYTNTDGDRK-RRKMQH 72 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~-~~~~i~-~---~~~~~~~i~~~~~~--y~g~~~~l~~~~~~~~~~-~~~~~~ 72 (527) |++|+.++.+-.--... .-....++. ..|++. . +.....+-..|... +.|-+..+. ...+... .+.... T Consensus 1 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~--~~~~~~~t~~~~~~ 77 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLP-NDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAWSGYPESWA--TPSWGSAQDKLRTL 77 (409) T ss_pred CchhhhhcccccCCCcc-cccccccccCCCCceeeccCCCcchhhhhccccccccccccccccc--ccCccccchhhHhh Confidence 99999888761100000 000000000 011110 0 11111222223221 222221111 0111111 111122 Q ss_pred cchHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHh--hhh--HHHHHHHHHHHHHhcCCEEEEEE-EeC-Ce-eEEE Q lcl|NC_019418. 73 LPIARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLS--NDR--FNKNFERYLESALALGGLAMRPY-VDG-DK-IRVA 145 (527) Q Consensus 73 lnl~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~--~n~--f~~~~~~~~~~a~~~G~~~~~~~-~d~-~~-~~i~ 145 (527) +..-...++.+|+-+-+-|..+--.+... +.+..++. -|. -...+.+.+...+.+|++++.+. .+. +. +.+. T Consensus 78 ~~~v~acV~~Ia~~iA~lpl~~~~~~~~~-~~~~~ll~~~PN~~~t~~~f~~~l~~~lllGnay~~~i~r~~~G~~~~L~ 156 (409) T protein:vir:83 78 IDVAWACIDLNASVLSSMPIYRMRNGRII-DSVAWMSNPDPEVYTSWQEFAKQLFWDFQLGEAFVLPMAHGSDGYPIRFR 156 (409) T ss_pred hHHHHHHHHHHHHhhccCceEEeeCCccc-cchhhhcccCCCCCCCHHHHHHHHHHHHhhCCcEEEEEEECCCCcEEEEE Confidence 23334556666666555444332111111 11111121 111 11223333444455688876544 443 33 3455 Q ss_pred EEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCce Q lcl|NC_019418. 146 FIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGER 225 (527) Q Consensus 146 ~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~ 225 (527) .++|+.+-+... .+ +..+|. |.. .+ + T Consensus 157 pl~p~~v~v~~~-~~-----------------g~~~y~-------------------------~~~-~~--------~-- 182 (409) T protein:vir:83 157 VVPPWLVNVELK-KG-----------------ARREYR-------------------------IGG-LN--------V-- 182 (409) T ss_pred EECCcceEEEEc-CC-----------------ceEEEE-------------------------Ecc-cc--------C-- Confidence 566655433211 11 111111 100 00 0 Q ss_pred eecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHH-HHcCcceeeechh Q lcl|NC_019418. 226 VNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWE-IKMGQRRVIVPEQ 304 (527) Q Consensus 226 v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e-~~~~~~~i~v~~~ 304 (527) +. -+.|+|...++ +..+|+|-+.-+...++-.. ...++... |..|.. |.. T Consensus 183 ----------~~----------eiiHir~~~~~----~~~~G~spi~~~~~~i~~~~-a~~~~~~~~f~nga~----p~g 233 (409) T protein:vir:83 183 ----------TD----------EILHIRYQGNT----ADAHGHGPLESAAPRQVVIG-LLQKYVQNLAETGGV----PLY 233 (409) T ss_pred ----------cc----------ceEEeCCCCCC----CCcccccHHHHHHHHHHHHH-HHHHHHHHHHhcCCC----cce Confidence 00 12355532222 23468888877777765333 34444444 344332 333 Q ss_pred HhcCCCCCCCccc-ccccccccccceeeeccCC-----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccc Q lcl|NC_019418. 305 MTQLKVQDNQGNI-AFKRRFDVEQNVYMQVGAG-----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFT 378 (527) Q Consensus 305 ~l~~~~~~~~~~~-~~~~~~d~~~~~~~~~~~~-----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~ 378 (527) ++..+........ .+...| ...|.+ +.+ .++....+.++..-.+.++.+..+...++|+...|++|..+| T Consensus 234 il~~~~~ls~e~~~~~~~~~---~~~~~~-nag~~~il~~g~~~~~~~~~s~~d~q~le~r~~~~~eIa~~fgVPp~llg 309 (409) T protein:vir:83 234 WLGVERRLSETEAVDLMDRW---IESRSK-YAGHPALVTGGATLNQAKSMSAQDLSLMELTQFNEARIAILLGVPPFLVG 309 (409) T ss_pred EeecCCCCCHHHHHHHHHHH---HHhhCC-ccCccceecCCcccccccCCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcc Confidence 3322221111000 000011 011111 110 011011122333444667888888888899999999999998 Q ss_pred cccccc-chHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHH Q lcl|NC_019418. 379 FDGQGV-KTATEIVSENSDTY-QMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELD 456 (527) Q Consensus 379 ~~~~g~-~TAtei~s~~~~~~-~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~ 456 (527) ....+. .|-..+.......+ .++.-+.+.++.+|..- +. +....+.++++.-+..|..+.++ T Consensus 310 ~~~~~~~~tysn~eq~~~~f~~~tL~P~~~~ie~~l~~~------------Ll----~~~~~~~f~~~~llr~d~~~r~~ 373 (409) T protein:vir:83 310 LPGATGSLTYSNIEQLFSFHDRSSLRPKATAVMAALDRW------------AL----PSPQHLELNRDDYTRPSLVERAT 373 (409) T ss_pred CCCCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHh------------hC----CCCcEEEeehhhhhccCHHHHHH Confidence 654322 12111111111111 23333333333333321 11 11234667777767788888888 Q ss_pred HHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCC Q lcl|NC_019418. 457 YWMKMVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQ 511 (527) Q Consensus 457 ~~~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~ 511 (527) ...+++++|+|++-+++++. |++..+ ....+ ..++- T Consensus 374 ~~~~~~~~G~lT~NE~R~~~-glpp~~-----------------ggd~l-~~~gv 409 (409) T protein:vir:83 374 AYKIMIEAGVMEPNEARAME-RLHSEA-----------------AAVRL-SGGGV 409 (409) T ss_pred HHHHHHhCCCcCHHHHHHHh-CCCCCC-----------------CCccc-CCCCC Confidence 88899999999988865432 443210 00000 00000 No 227 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=95.15 E-value=0.0027 Score=34.70 Aligned_cols=431 Identities=10% Similarity=0.125 Sum_probs=195.3 Q ss_pred CCh--HHHHHHHHHH---HHHHhhcccchhhh-cc-----CccccC--H-------------------HHHHHHHHHHHH Q lcl|NC_019418. 1 MSL--IQKVKDFFNR---GRYNMTTSHLSSIL-DH-----PKVAVT--Q-------------------SEFRRIQHNLAY 48 (527) Q Consensus 1 m~~--~~~~k~~~~~---~~~~~~~~~~~~~~-~~-----~~i~~~--~-------------------~~~~~i~~~~~~ 48 (527) |++ ..-++-|++. .....+.....+.. ++ ..|..+ . ....-|++++.+ T Consensus 1 m~~~~l~lf~f~~k~~e~~~~~~~~~~~~s~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~~~~n~~eLI~~YR~m 80 (521) T protein:vir:10 1 MNPIFLKLLQPWMKDDEKRVQSDLSDRIDSFAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAPKIQNTKDLINQYRSL 80 (521) T ss_pred CCcchhHHhhhhhhhhhhHHhhhhccCccccccccCCCCceeeccCCCccccccchhhhhhccccccchHHHHHHHHHHH Confidence 554 2223333321 11111110111111 11 011110 0 111222222222 Q ss_pred hcCCCcccccccccCccccCceeecchHHHHHHHHhh-hhhc----ccceEeeCCH--------HHHHHHHHHHhhhhHH Q lcl|NC_019418. 49 YQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIAS-LVYN----EQAEISAEDE--------TLNDFLSDMLSNDRFN 115 (527) Q Consensus 49 y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~-ll~~----e~~~i~~~d~--------~~~~~l~~~l~~n~f~ 115 (527) +.. | --...++...+ -+.. .|+++.+++. ...+.++.++.=-+|. T Consensus 81 a~~--p--------------------Evd~Av~eIvneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~ 138 (521) T protein:vir:10 81 SKY--H--------------------EVDNAIDEIINDAIVQEDNRDTVYLDLDKTDWNESVKEMVREEFRTILKLLKFE 138 (521) T ss_pred hhc--c--------------------chhhHHHhhhcceEEecCCCceEEEEecCcccchHHHHHHHHHHHHHHHHhccc Confidence 111 1 11112222222 1111 2555555532 2345566677666899 Q ss_pred HHHHHHHHHHHhcCCEEEEEEEeCCe-----eEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceE-EE-EEEEE Q lcl|NC_019418. 116 KNFERYLESALALGGLAMRPYVDGDK-----IRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVY-YT-LVEFH 188 (527) Q Consensus 116 ~~~~~~~~~a~~~G~~~~~~~~d~~~-----~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~-yt-~lE~h 188 (527) +...+.+....+-|..+|+..+|.++ ..+.+++|.++-++.. +..++.++++ ++ ..|+. T Consensus 139 ~~~~~~fR~WYVDgRi~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~--------------i~k~~~~~~~v~~~~~e~f 204 (521) T protein:vir:10 139 REGKRHFRRWYVDSRIYFHKMIDPARPKDGIKELRLLDPRNVEYYRV--------------NLKSNENGNDVYKGVKEFF 204 (521) T ss_pred hhhhHHHhhheeeeeEEEEEEeeCCCccccceeeeeeCCcceeeeee--------------ecCCCCCcchhhccceeee Confidence 99999999999999999999998533 4578888887655421 1111111110 00 01111 Q ss_pred eecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCc Q lcl|NC_019418. 189 EWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGL 268 (527) Q Consensus 189 ~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~ 268 (527) .+. +..... |........|-.+|- ..+++...+ - .+...+..+ T Consensus 205 ~Y~---------~~~~~~-------~~~~g~~~~~vkI~~--------daI~y~hSG-----L--------~d~~~~~i~ 247 (521) T protein:vir:10 205 TYG---------ATEDNR-------YNISGNSNNLVQIPI--------DAIVYSHSG-----K--------VDIDGKTIV 247 (521) T ss_pred eec---------cCCCce-------ecCCCCCCcceeech--------hheeeeccc-----c--------eeCCCCcee Confidence 000 000000 100000011111111 112221100 0 122345567 Q ss_pred chhhhhHHHHHHHHHHHHHHH--HHHHcCcceeeec----------hhHh---------cCCCCCCCcccccccccccc- Q lcl|NC_019418. 269 SIFDNAKTTIDFINRTYDEFM--WEIKMGQRRVIVP----------EQMT---------QLKVQDNQGNIAFKRRFDVE- 326 (527) Q Consensus 269 S~~~~~~~lid~ld~~~s~~~--~e~~~~~~~i~v~----------~~~l---------~~~~~~~~~~~~~~~~~d~~- 326 (527) |-+..|.-.+..|=-.-+.++ +-.|+-.+|||-- +.++ +..-|..+|++.-.+.+-.- T Consensus 248 syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~msMl 327 (521) T protein:vir:10 248 GYLHNVIKPANQLKMLEDAMVIYRITRAPERRVFYIDVGTMPNKKATQHLNNVMQGLKNRVVYDSSTGKVKNSSNNLAMT 327 (521) T ss_pred ccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchhhhhhH Confidence 778888777777765555544 4446666777641 1111 00113344443322211100 Q ss_pred cceeeeccCCCCC-CCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccc--cchHHHHHHHHHHHHHHHHH Q lcl|NC_019418. 327 QNVYMQVGAGNMD-SGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQG--VKTATEIVSENSDTYQMRNS 403 (527) Q Consensus 327 ~~~~~~~~~~~~~-~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g--~~TAtei~s~~~~~~~~~~~ 403 (527) ...+.+- -+|+ ..-|+++..-=...+ ...+..+.+.+....+++.+-+..+++| .--++||.-..-.....+.+ T Consensus 328 EDyWLpR--ReGgrgTEI~TLpggqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr~~EItRDEikF~KFI~r 404 (521) T protein:vir:10 328 EDYWLMR--RDGKATTEVSTLPGAQSMGE-MDDVRWFNRKLYESMKIPLSRLPQEGAGVTFGAGNDITRDELQFTKYIRG 404 (521) T ss_pred hhhcccc--cCCCCccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCccccCCCCCceecccccchhHHHHHHHHHHHH Confidence 0001110 1122 123566554333333 4556666777777777777766555332 11245676666666677888 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhcccCCcccCc--cceEEEeCCCccCCHHHHHHHHH---HHH--hcC------CCCHH Q lcl|NC_019418. 404 IVALVEQSIKELCVSMCELGKVVGIYRGTIPEL--DDISVNLDDGVFTDRHAELDYWM---KMV--AAG------FATQK 470 (527) Q Consensus 404 ~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~--~~v~v~f~d~i~~d~~~~~~~~~---~~~--~aG------i~s~~ 470 (527) .+..|..-+.++++.=|.|-.. +...-+.. ..|.++|...=-..+..+++... .+. ..+ ..|.+ T Consensus 405 LR~rFs~~f~~~L~~qLilKgi---it~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~yvGky~s~d 481 (521) T protein:vir:10 405 LQQQFEPIFLNPLRTNLMLKGK---MSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEVTGKYLSHE 481 (521) T ss_pred HHHHHHHHHHHHHHHhhhhccC---CCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCccccccccchH Confidence 8888888888888876655332 21111111 34667775433233333322222 111 123 57888 Q ss_pred HHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCC Q lcl|NC_019418. 471 RGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSK 518 (527) Q Consensus 471 ~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 518 (527) ++.++....||+|.+++-.+|++|.... .+.+++++ .++- T Consensus 482 yi~k~ILr~tDeeik~~~k~I~~E~~~~------~~~~p~~e--~~df 521 (521) T protein:vir:10 482 YVMKNILRMSDEDIKTEREKIDGELKDS------VYKNPEDP--MEEF 521 (521) T ss_pred HHHHHHhcCCHhHHHHHHHHHHHhhhCC------CCCCCcch--hhcC Confidence 8878888999999999999999987431 11111110 0011 No 228 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=95.08 E-value=0.0028 Score=34.56 Aligned_cols=383 Identities=12% Similarity=0.093 Sum_probs=145.5 Q ss_pred HHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHH--HHHHHHhhhhhccc Q lcl|NC_019418. 14 GRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIAR--TAAKKIASLVYNEQ 91 (527) Q Consensus 14 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~--~i~~~~A~ll~~e~ 91 (527) |. .|.+. .. .....++ -|..++.|.... .+-.. ..+..|. .+++.+|+-+-.=| T Consensus 1 m~--~f~~~-~~----~~~~~~~-------~~~~~~~~~~~~-~~~~~---------~Al~~~~V~~~i~~Ia~~iA~lp 56 (406) T protein:vir:97 1 MS--FFQPL-GT----SKVSYDD-------YISSVLAGDVSQ-KYLGV---------SALKNSDILTATSIIAGDIARFP 56 (406) T ss_pred Cc--ccccc-CC----CCCCcch-------HHHHHhcCCCCc-ccccc---------hhhccHHHHHHHHHHHHhhhhCe Confidence 11 12111 11 1111111 133344443221 00000 1122221 23344444333323 Q ss_pred ceEeeCCH--HHHHHHHHHHh--hh---hHHHHHHHHHHHHHhcCCEEEEEEEeC--Ce-eEEEEEcCCceEEEEEcCCc Q lcl|NC_019418. 92 AEISAEDE--TLNDFLSDMLS--ND---RFNKNFERYLESALALGGLAMRPYVDG--DK-IRVAFIQAPVFLPLQSNTQD 161 (527) Q Consensus 92 ~~i~~~d~--~~~~~l~~~l~--~n---~f~~~~~~~~~~a~~~G~~~~~~~~d~--~~-~~i~~v~a~~~~P~~~d~~~ 161 (527) ..+.-.+. .....+..+|. -| ....-.+..+...+-.|.+++.+..++ +. ..+..++|+++-+...+ ++ T Consensus 57 ~~~~~~~g~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~v~v~~~~-~~ 135 (406) T protein:vir:97 57 LVKKDVNGDIIHDEDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYRPSETTVEETD-NH 135 (406) T ss_pred eEEEecCccccccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEECCCeeEEEEcC-Cc Confidence 32221111 11122333332 11 122334446666777899988887763 33 35666777766543211 11 Q ss_pred eEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceee Q lcl|NC_019418. 162 VSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPI 241 (527) Q Consensus 162 ~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~ 241 (527) ..+|+.-+ . ..|..+.+ + +.+ T Consensus 136 -----------------~~~y~~~~-----------------------------~----~~~~~~~~---~---~~e--- 156 (406) T protein:vir:97 136 -----------------EIVYTFTD-----------------------------M----LTAKQVKC---F---AHD--- 156 (406) T ss_pred -----------------eEEEEEEe-----------------------------c----CCceEEEE---c---ccc--- Confidence 11221000 0 00111110 0 000 Q ss_pred cCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcC-cceeee-chhHhcCCCCCCCccccc Q lcl|NC_019418. 242 QGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMG-QRRVIV-PEQMTQLKVQDNQGNIAF 319 (527) Q Consensus 242 ~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~-~~~i~v-~~~~l~~~~~~~~~~~~~ 319 (527) +.||+.+.. +...|+|.+.-+...+.....+-.-..+-|+.| .+.+++ +...+ +.+. .-.. T Consensus 157 -------vih~r~~~~-----dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~~i~~~~~~l----~~e~-~~~~ 219 (406) T protein:vir:97 157 -------VIHWKFFSH-----DTILGRSPLLSLGDEIDLQTGGINTLIKFFKDGFSSGILTMKGAQL----SGDA-RQRA 219 (406) T ss_pred -------EEEecCCCC-----CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEecCCCC----CHHH-HHHH Confidence 235543211 123488888777776654333222222224443 222222 11111 0000 0000 Q ss_pred ccccccccceeeeccCCC----CCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHH Q lcl|NC_019418. 320 KRRFDVEQNVYMQVGAGN----MDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENS 395 (527) Q Consensus 320 ~~~~d~~~~~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~ 395 (527) ...|. ..+.+-+.+. .++..++.++....+.++++..+....+|+...|++|..+|....+. +..+.. T Consensus 220 ~~~~~---~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~~-~~e~~~---- 291 (406) T protein:vir:97 220 RQEFE---KMREGSVGGSPLVFDSTMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVNSPNQ-SVAQLM---- 291 (406) T ss_pred HHHHH---HHhcccccCceeecCCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCCCcc-hHHHHH---- Confidence 01111 1111111100 12224555555556667888777778899999999999998543332 111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHh Q lcl|NC_019418. 396 DTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAK 475 (527) Q Consensus 396 ~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~ 475 (527) +..++.+|..++..|-..... .++....... ..|.|+- ..+....++...+++.+|+|++-+++.+ T Consensus 292 ---------~~f~~~~l~P~~~~ie~~l~~-kll~~~~~~~--~~i~fd~--~~~~~~~~~~~~~~~~~g~~T~NE~R~~ 357 (406) T protein:vir:97 292 ---------EDYVTNDLPFYFDAITSELGL-KTLNDKDRRL--YHIEFDT--RSVTGRNVDEIVKLVNNQILTPNQGLVE 357 (406) T ss_pred ---------HHHHHHHHHHHHHHHHHHHhh-hhcChhhccc--eeEEEec--CccchhhHHHHHHHHhCCCcCHHHHHHH Confidence 111223333333332221110 1111111122 2344532 2234455666778889999999997765 Q ss_pred cCCCC---HHHHHHHHHHHHHhcccccccccCCCCCCC--CCCCCCCCCCCCccccC Q lcl|NC_019418. 476 TLGIT---EEEAEKELAEINGELPPESDAELALYGKGQ--QNTVGNSKDTVDDEDEA 527 (527) Q Consensus 476 ~~~~~---deea~~el~ri~~E~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~ 527 (527) + |+. +....+-+.. ....+ .+.-.+.+ .......+++.+.+++| T Consensus 358 ~-g~~p~~~~~gD~~~~~--~n~~~-----~~~~~~~~~~~~~~~~gg~~~~~~~~~ 406 (406) T protein:vir:97 358 L-GKQKSTDPNMDRYQSS--LNYVF-----LDKKEEYQDKVGIKGKGGEVNAEEDKS 406 (406) T ss_pred h-CCCCCCCCCCCeEeec--cCccc-----hhcccccccccccccCCCCCCCCCCCC Confidence 4 433 2111111000 00000 00000000 11122344444555555 No 229 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=94.86 E-value=0.0033 Score=34.17 Aligned_cols=395 Identities=12% Similarity=0.082 Sum_probs=162.0 Q ss_pred hcccchhhhccC-cc-ccCHHHHHHHHHHHHHhcCCCc-ccccccccCc-----cccCc----eeecchHHHHHHHHhhh Q lcl|NC_019418. 19 TTSHLSSILDHP-KV-AVTQSEFRRIQHNLAYYQSKFD-DIEYTNTDGD-----RKRRK----MQHLPIARTAAKKIASL 86 (527) Q Consensus 19 ~~~~~~~~~~~~-~i-~~~~~~~~~i~~~~~~y~g~~~-~l~~~~~~~~-----~~~~~----~~~lnl~~~i~~~~A~l 86 (527) |+..|-..-..+ +. +........|.. ++-+.+.+. .++......- ...+- ....-+...+ .+...- T Consensus 1 m~~~i~~~~g~p~~~~~~~~~~~~~ia~-~~~~~~~~~~~~~~~~~~~iLr~~~~~~~~y~~m~~D~~i~s~l-~~Rk~a 78 (491) T protein:vir:10 1 MSKGLWVSPTEFVTFGEPDKSLSSQIAT-RARSIDFFALGMYLPNPDPVLKALGKDIRVYRELRADAHVGGCV-RRRKAA 78 (491) T ss_pred CCCceeCCCCCccCcccCChHHHHHHHh-hhcccccccccCCccchHHHHHhcCCCHHHHHHHhhChHHHHHH-HHHHHH Confidence 222221111111 11 112222233321 211111110 0000000000 00000 0001122222 222334 Q ss_pred hhcccceEee--CCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe--CCee---EEEEEcCCceEEEEEcC Q lcl|NC_019418. 87 VYNEQAEISA--EDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD--GDKI---RVAFIQAPVFLPLQSNT 159 (527) Q Consensus 87 l~~e~~~i~~--~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d--~~~~---~i~~v~a~~~~P~~~d~ 159 (527) |++.+-+|.. +++...+++.+++++-.|...+..++ +|..+|-+++-+.|. ++.+ ++.++++..|.+ +. T Consensus 79 v~~~~w~i~~~~~~~~~~e~v~e~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~---d~ 154 (491) T protein:vir:10 79 VKALEWGLDRGKAKSRVAKSIADVFADLDLSRIVTEML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFVY---DP 154 (491) T ss_pred HhCCCcEEecCCCCHHHHHHHHHHHhcCCHHHHHHHHH-HhhhhcceeEEEEEeecCCeeEEEEeeeecccceee---cc Confidence 5566656653 34456789999998888888888776 688899999988885 3322 234444433221 11 Q ss_pred CceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccce Q lcl|NC_019418. 160 QDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVT 239 (527) Q Consensus 160 ~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~ 239 (527) ++ ...| +..+...-|.+ +++. T Consensus 155 ~~-----------------~l~~--------------------------------~~~~~~~~g~~---------l~~~- 175 (491) T protein:vir:10 155 EN-----------------QLRF--------------------------------RSKDHWMQGEE---------LPAR- 175 (491) T ss_pred CC-----------------ceEE--------------------------------ecCCCCCCcce---------ecCC- Confidence 11 0001 10011111111 1111 Q ss_pred eecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHc-CcceeeechhHhcCCCCCCCcccc Q lcl|NC_019418. 240 PIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKM-GQRRVIVPEQMTQLKVQDNQGNIA 318 (527) Q Consensus 240 ~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~-~~~~i~v~~~~l~~~~~~~~~~~~ 318 (527) -|.++.. ....++|+|.|.+..|....---+..+..|+.=++. |.+-++. +.+ .+...+ T Consensus 176 --------k~i~~~~----~~~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~ig-----ky~-~~a~~~-- 235 (491) T protein:vir:10 176 --------KFLVPRQ----EATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVG-----KHP-RSASDG-- 235 (491) T ss_pred --------CEEEEEe----cCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEE-----ecC-CCCCHH-- Confidence 1222221 112356899999999988776666666666544443 4433322 111 111100 Q ss_pred cccccccccceeeeccCCC----CCCCcceEeccccC---hHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHH Q lcl|NC_019418. 319 FKRRFDVEQNVYMQVGAGN----MDSGGIVDLTTPIR---SSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIV 391 (527) Q Consensus 319 ~~~~~d~~~~~~~~~~~~~----~~~~~i~~~~~~ir---~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~ 391 (527) ....+ .+.+ ..+..+. .....|+.++.... .+.|.+.++.+-++|+... ++ +|++-+++|...+.++- T Consensus 236 ek~~l--~~al-~~~~~~a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~i-LG-qtlTt~~~gs~a~~~vh 310 (491) T protein:vir:10 236 EKNLL--LDCL-EDMVQDAVAVVPDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIAL-LG-QNQTTEATSTRASAQAG 310 (491) T ss_pred HHHHH--HHHH-HHHhcCcEEEecCCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHH-hh-hhcccCcccchhHHHHH Confidence 00000 0001 0010000 11223555554322 2335555555545543322 22 33433333322222331 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHH Q lcl|NC_019418. 392 SENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKR 471 (527) Q Consensus 392 s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~ 471 (527) .. -...-++.-.+.+...|.+|++.++.+ ++.+ . ..+.+.|... ..+.++.++.+.+++..|+--.+. T Consensus 311 ~~--v~~di~~~D~~~i~~tln~li~~l~~~----N~~~---~--~~p~f~~~~~-~e~~~~~a~~~~~L~~~G~~i~~~ 378 (491) T protein:vir:10 311 LE--VTDDIRDGDKAVVSEAMNMLIRWICDL----NFDG---A--DRPVFDMWEQ-EQVDEIQAGRDQKLTQAGARFTPA 378 (491) T ss_pred HH--HHHHHHHHHHHHHHHHHHHHHHHHHHh----cCCC---C--CcceEEecCc-CchhHHHHHHHHHHHhCCCcCCHH Confidence 11 122333344566778888888877755 2211 1 1245777653 333356788888999999866667 Q ss_pred HHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 472 GIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 472 ~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) ++.+.+|+++.+-.+.....+..+.. +.. ..........++.+. T Consensus 379 ~i~e~~Gip~~~~~~~~~~~~~~~~~-----~~~-------~~~~~~~~~~~~~d~ 422 (491) T protein:vir:10 379 YFKRAYNLQDGDLDERPLPVSAVDTV-----GAA-------SFAEFEAPDQDALDA 422 (491) T ss_pred HHHHHhCCCCCCcCccccccCCCCCc-----ccc-------cccccCCCCCCchHH Confidence 78898998754322221111110000 000 000001111111111 No 230 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=94.62 E-value=0.0039 Score=33.77 Aligned_cols=432 Identities=13% Similarity=0.137 Sum_probs=189.8 Q ss_pred CCh--HHHHHHHHHHHHHHhhcccch----hhh-----c-cCccccCH-----------------------HHHHHHHHH Q lcl|NC_019418. 1 MSL--IQKVKDFFNRGRYNMTTSHLS----SIL-----D-HPKVAVTQ-----------------------SEFRRIQHN 45 (527) Q Consensus 1 m~~--~~~~k~~~~~~~~~~~~~~~~----~~~-----~-~~~i~~~~-----------------------~~~~~i~~~ 45 (527) |++ ...+|-|-+. -..-..+..+ +.+ | ..++.++. ....-|+++ T Consensus 1 m~~~~L~~~~~w~~~-de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~Y 79 (524) T protein:vir:72 1 MKFNVLSLFAPWAKM-DERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTY 79 (524) T ss_pred CCCchhhHhhccccC-cchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHH Confidence 666 4444444321 1000101000 000 0 01111110 111222222 Q ss_pred HHHhcCCCcccccccccCccccCceeecchHHHHHHHHhh-hhh----cccceEeeCCHH--------HHHHHHHHHhhh Q lcl|NC_019418. 46 LAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIAS-LVY----NEQAEISAEDET--------LNDFLSDMLSND 112 (527) Q Consensus 46 ~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~-ll~----~e~~~i~~~d~~--------~~~~l~~~l~~n 112 (527) +.+... |. -...++...+ -+. .+|+++.+++.. ..+.++.++.-- T Consensus 80 R~ma~~--pE--------------------vd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll 137 (524) T protein:vir:72 80 RNLMNN--YE--------------------VDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFSDVLNHL 137 (524) T ss_pred HHHhhc--cc--------------------hhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHHh Confidence 222111 11 1111222221 111 124555555432 455666777777 Q ss_pred hHHHHHHHHHHHHHhcCCEEEEEEEeCCe-----eEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceE-EEEEE Q lcl|NC_019418. 113 RFNKNFERYLESALALGGLAMRPYVDGDK-----IRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVY-YTLVE 186 (527) Q Consensus 113 ~f~~~~~~~~~~a~~~G~~~~~~~~d~~~-----~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~-yt~lE 186 (527) +|.++..+.+....+-|..+|+.++|..+ ..+.+++|.++-++. .+..+...+++ ++ . T Consensus 138 ~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr--------------~i~~~~~~~~~vi~--~ 201 (524) T protein:vir:72 138 SFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVR--------------EIITETEAGTKIVK--G 201 (524) T ss_pred ccchhhhHHHhhheeeeEEEEEEEEeCCCccccceeeeeeCCccceeee--------------eeccCCCccchhhc--c Confidence 89999999999999999999999998543 457888888765542 22222222111 11 1 Q ss_pred EEeecccccccceeeecCCceEEEEEEEecCCc-cccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCc Q lcl|NC_019418. 187 FHEWVTPTGQEVGSTKDKSLYRITNELYKSTSD-SQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSP 265 (527) Q Consensus 187 ~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~-~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~sp 265 (527) +.++.- + .+ .+.-|..... ...|..|-+ ++ ..+++.. .+.++.... T Consensus 202 ~~e~f~-------Y----~~---~~~~y~~~g~~~~~~~~ikI---~~---dAI~y~h-----SGL~d~~~~-------- 248 (524) T protein:vir:72 202 YKEYFI-------Y----DT---AHESYACDGRMYEAGTKIKI---PK---AAVVYAH-----SGLVDCCGK-------- 248 (524) T ss_pred hhhhee-------e----cc---CccccccCccccCCCcceec---ch---hheeeee-----ccceeCCCC-------- Confidence 000000 0 00 0000100000 001111110 00 1111111 011111100 Q ss_pred cCcchhhhhHHHHHHHHHHHHHHH--HHHHcCcceeee-c---------hhHh---------cCCCCCCCcccccccccc Q lcl|NC_019418. 266 LGLSIFDNAKTTIDFINRTYDEFM--WEIKMGQRRVIV-P---------EQMT---------QLKVQDNQGNIAFKRRFD 324 (527) Q Consensus 266 lG~S~~~~~~~lid~ld~~~s~~~--~e~~~~~~~i~v-~---------~~~l---------~~~~~~~~~~~~~~~~~d 324 (527) .=+|-+..|.-.+..|=..-+.++ +-.|+-.+|||- . +.++ +..-|..+|++.-.+.+- T Consensus 249 ~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGev~ddrk~m 328 (524) T protein:vir:72 249 NIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQHNM 328 (524) T ss_pred ceeccchhhhHhHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhh Confidence 013445555555555544444433 444566667753 1 0111 001133444433222111 Q ss_pred cc-cceeeeccCCCCC-CCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccc-cc--cchHHHHHHHHHHHHH Q lcl|NC_019418. 325 VE-QNVYMQVGAGNMD-SGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDG-QG--VKTATEIVSENSDTYQ 399 (527) Q Consensus 325 ~~-~~~~~~~~~~~~~-~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~-~g--~~TAtei~s~~~~~~~ 399 (527) .- ...+.+- -+|+ ..-|+++...=...+ ...+..+.+.+....+++.+-+..++ ++ .--++||.-..-.... T Consensus 329 sMlEDyWLpR--ReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~K 405 (524) T protein:vir:72 329 SMTEDYWLQR--RDGKAVTEVDTLPGADNTGN-MEDIRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAK 405 (524) T ss_pred hhHhhhcccc--cCCCcccceeeccccCCcCh-HHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHH Confidence 00 0001110 1122 223566554433333 45566667777777777777663332 22 1236677666666677 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCc--cceEEEeCCCccCCHHHHHHHHHH---HH--hcC----CCC Q lcl|NC_019418. 400 MRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPEL--DDISVNLDDGVFTDRHAELDYWMK---MV--AAG----FAT 468 (527) Q Consensus 400 ~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~--~~v~v~f~d~i~~d~~~~~~~~~~---~~--~aG----i~s 468 (527) .+.+.+..|..-+.++++.=|.|-.. +...-+.. ..|.++|...=-..+..+++.... +. ..+ ..| T Consensus 406 FI~rLR~rFs~~f~~~Lk~qLilKgi---it~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s 482 (524) T protein:vir:72 406 FIRELQHKFEEVFLDPLKTNLLLKGI---ITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINMLTMAEPFIGKYIS 482 (524) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhccC---CCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccch Confidence 78888888888888888876655332 21111111 346677754333333333332221 11 112 358 Q ss_pred HHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCC Q lcl|NC_019418. 469 QKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSK 518 (527) Q Consensus 469 ~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 518 (527) .+++.++....||+|.+++..+|++|... +.+.++++ +.++- T Consensus 483 ~~yi~k~ILr~tDeei~~~~k~I~~E~k~------~~~~~~~~--~~~~f 524 (524) T protein:vir:72 483 HRTAMKDILQMTDEEIEQEAKQIEEESKE------ARFQDPDQ--EQEDF 524 (524) T ss_pred hHHHHHHHhccCHHHHHHHHHHHHHHhhc------CCCCCCch--hhhcC Confidence 88888888899999999999999998642 11111111 11111 No 231 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=94.55 E-value=0.0041 Score=33.67 Aligned_cols=432 Identities=13% Similarity=0.136 Sum_probs=189.8 Q ss_pred CCh--HHHHHHHHHHHHHHhhcccch----hhh-----c-cCccccCH-----------------------HHHHHHHHH Q lcl|NC_019418. 1 MSL--IQKVKDFFNRGRYNMTTSHLS----SIL-----D-HPKVAVTQ-----------------------SEFRRIQHN 45 (527) Q Consensus 1 m~~--~~~~k~~~~~~~~~~~~~~~~----~~~-----~-~~~i~~~~-----------------------~~~~~i~~~ 45 (527) |++ ...+|-|-+. -..-..+..+ +.+ | ..++.++. ....-|+++ T Consensus 1 m~~~~L~~~~~w~~~-de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~Y 79 (524) T protein:vir:10 1 MKFNVLSLFAPWAKM-DERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTY 79 (524) T ss_pred CCCchhhHhhccccC-cchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHH Confidence 666 4444444321 1000101000 000 0 01111110 111222222 Q ss_pred HHHhcCCCcccccccccCccccCceeecchHHHHHHHHhh-hhh----cccceEeeCCHH--------HHHHHHHHHhhh Q lcl|NC_019418. 46 LAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIAS-LVY----NEQAEISAEDET--------LNDFLSDMLSND 112 (527) Q Consensus 46 ~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~-ll~----~e~~~i~~~d~~--------~~~~l~~~l~~n 112 (527) +.+... |. -...++...+ -+. .+|+++.+++.. ..+.++.++.-- T Consensus 80 R~ma~~--pE--------------------vd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll 137 (524) T protein:vir:10 80 RNLMNN--YE--------------------VDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFNDVLNHL 137 (524) T ss_pred HHHhhc--cc--------------------hhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHHh Confidence 222111 11 1111222221 111 124555555422 455667777777 Q ss_pred hHHHHHHHHHHHHHhcCCEEEEEEEeCCe-----eEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceE-EEEEE Q lcl|NC_019418. 113 RFNKNFERYLESALALGGLAMRPYVDGDK-----IRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVY-YTLVE 186 (527) Q Consensus 113 ~f~~~~~~~~~~a~~~G~~~~~~~~d~~~-----~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~-yt~lE 186 (527) +|.++..+.+....+-|..+|+.++|..+ ..+.+++|.++-++. .+..+...+++ ++ . T Consensus 138 ~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr--------------~i~~~~~~~~~vi~--~ 201 (524) T protein:vir:10 138 SFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVR--------------EIITETEAGTKIVK--G 201 (524) T ss_pred ccchhhhHHHhhheeeeEEEEEEEeeCCCccccceeeeeeCCccceeee--------------eeccCCCccchhhc--c Confidence 89999999999999999999999998543 457888888765542 22222222111 11 1 Q ss_pred EEeecccccccceeeecCCceEEEEEEEecCCc-cccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCc Q lcl|NC_019418. 187 FHEWVTPTGQEVGSTKDKSLYRITNELYKSTSD-SQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSP 265 (527) Q Consensus 187 ~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~-~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~sp 265 (527) +.++.- + .+ .+.-|..... ...|..|-+ ++ ..+++.. .+.++.... T Consensus 202 ~~e~f~-------Y----~~---~~~~y~~~g~~~~~~~~ikI---~~---dAI~y~h-----SGL~d~~~~-------- 248 (524) T protein:vir:10 202 YKEYFI-------Y----DT---AHESYACDGRMYEAGTKIKI---PK---AAIVYAH-----SGLVDCCGK-------- 248 (524) T ss_pred hhhhee-------e----cc---CccccccCccccCCCcceec---ch---hheeeee-----ccceeCCCC-------- Confidence 000000 0 00 0000100000 001111110 00 1111111 011111100 Q ss_pred cCcchhhhhHHHHHHHHHHHHHHH--HHHHcCcceeee-c---------hhHh---------cCCCCCCCcccccccccc Q lcl|NC_019418. 266 LGLSIFDNAKTTIDFINRTYDEFM--WEIKMGQRRVIV-P---------EQMT---------QLKVQDNQGNIAFKRRFD 324 (527) Q Consensus 266 lG~S~~~~~~~lid~ld~~~s~~~--~e~~~~~~~i~v-~---------~~~l---------~~~~~~~~~~~~~~~~~d 324 (527) .=+|-+..|.-.+..|=..-+.++ +-.|+-.+|||- . +.++ +..-|..+|++.-.+.+- T Consensus 249 ~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGev~ddrk~m 328 (524) T protein:vir:10 249 NIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQHNM 328 (524) T ss_pred ceeccchhhhHHHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhh Confidence 013445566555555544444433 444566667753 1 0111 001133444433222111 Q ss_pred cc-cceeeeccCCCCC-CCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccc-cc--cchHHHHHHHHHHHHH Q lcl|NC_019418. 325 VE-QNVYMQVGAGNMD-SGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDG-QG--VKTATEIVSENSDTYQ 399 (527) Q Consensus 325 ~~-~~~~~~~~~~~~~-~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~-~g--~~TAtei~s~~~~~~~ 399 (527) .- ...+.+- -+|+ ..-|+++...=...+ ...+..+.+.+....+++.+-+..++ ++ .--++||.-..-.... T Consensus 329 sMlEDyWLpR--ReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~K 405 (524) T protein:vir:10 329 SMTEDYWLQR--RDGKAVTEVDTLPGADNTGN-MEDVRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAK 405 (524) T ss_pred hhHhhhcccc--cCCCcccceeeccccCCcCh-HHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHH Confidence 00 0001110 1122 223566554433333 45566667777777777777663332 22 1236677666666677 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCc--cceEEEeCCCccCCHHHHHHHHHH---HH--hcC----CCC Q lcl|NC_019418. 400 MRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPEL--DDISVNLDDGVFTDRHAELDYWMK---MV--AAG----FAT 468 (527) Q Consensus 400 ~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~--~~v~v~f~d~i~~d~~~~~~~~~~---~~--~aG----i~s 468 (527) .+.+.+..|..-+.++++.=|.|-.. +...-+.. ..|.++|...=-..+..+++.... +. ..+ .+| T Consensus 406 FI~rLR~rFs~~f~~~Lk~qLilKgi---it~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s 482 (524) T protein:vir:10 406 FIRELQHKFEEVFLDPLKTNLLLKGI---ITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINMLTMAEPFIGKYIS 482 (524) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhccC---CCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccch Confidence 78888888888888888876655332 21111111 346677754333333333332221 11 112 358 Q ss_pred HHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCC Q lcl|NC_019418. 469 QKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSK 518 (527) Q Consensus 469 ~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 518 (527) .+++.++....||+|.+++..+|++|... +.+.++++ +.++- T Consensus 483 ~~yi~k~ILr~tDeei~~~~k~I~~E~k~------~~~~~~~~--~~~~f 524 (524) T protein:vir:10 483 HRTAMKDILQMTDEEIEQEAKQIEEESKE------ARFQDPDQ--EQEDF 524 (524) T ss_pred hHHHHHHHhccCHHHHHHHHHHHHHHhhc------CCCCCCch--hhhcC Confidence 88888888899999999999999998642 11111111 11111 No 232 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=94.54 E-value=0.0037 Score=33.95 Aligned_cols=196 Identities=14% Similarity=0.057 Sum_probs=78.6 Q ss_pred ccCcchhhhhHHH-HHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcc Q lcl|NC_019418. 265 PLGLSIFDNAKTT-IDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGI 343 (527) Q Consensus 265 plG~S~~~~~~~l-id~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i 343 (527) -|.+..++++.+- -.++.+++..+. . .+.++.++..+ ++...+ T Consensus 1 V~k~~~l~~~~~~~~~~~~~r~~~~~-~----------------------------------~~~~~~~~~ld-~~~e~~ 44 (201) T protein:vir:10 1 MWKAKGLADLCDDSDGAARLRLAQVD-N----------------------------------NSGVGQAIGID-ADSEEY 44 (201) T ss_pred CccchHHHHHhcCChHHHHHHHHHHH-H----------------------------------hhhhhhhheee-cCCcce Confidence 1112222211110 011111111110 0 11111111111 111124 Q ss_pred eEeccccChHHHHHHHHHHHHHHHHhcCCCcc-ccccccccc-chHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH Q lcl|NC_019418. 344 VDLTTPIRSSDYISAISEGLKLFEMQIGVSSG-MFTFDGQGV-KTATEIVSENSDTYQMRNSIV-ALVEQSIKELCVSMC 420 (527) Q Consensus 344 ~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~-~~~~~~~g~-~TAtei~s~~~~~~~~~~~~~-~~~~~al~~li~~il 420 (527) +.++.++ .-....+.....+|+..+|++.. -||...+|. .|+..-... .|..++.+| ..++.+|+.|+..+. T Consensus 45 e~~~~~l--sGl~d~l~~~~~~iaa~s~iP~t~LfG~sp~Glnatge~d~~n---yyd~i~~~Qe~~l~p~le~l~~~~~ 119 (201) T protein:vir:10 45 NVLNSDI--GGIDTFLSQKFDRIVALSGIHEIILKGKNVGGVSASQNTALET---FYGYVDRKRKAELLPLLEFLLPFIV 119 (201) T ss_pred eeeecCc--CChHHHHHHHHHHHHhHhcCchhhhcCCCCccccccchhHHHH---HHHHHHHHHHHHHHHHHHHHHHhhc Confidence 4443332 23445566666777888888754 367776665 344433333 344444444 456777777665322 Q ss_pred HHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCC-CCHHHHHHHHHHHHHhccccc Q lcl|NC_019418. 421 ELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLG-ITEEEAEKELAEINGELPPES 499 (527) Q Consensus 421 ~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~-~~deea~~el~ri~~E~~~~~ 499 (527) . ..+++|.|+.-...+..+.++...+...+ ...+ ... | ++.+|++++|.+.-..... T Consensus 120 ------------~--~~~~~~~f~pL~~~s~kekAei~~~~a~a----~~~~-~~~-g~i~~~e~r~~L~~~~~~~~~-- 177 (201) T protein:vir:10 120 ------------T--EQEWSVEFNPLSQVSDKDKSEILEKNVNS----VAAL-IAA-GIIDADEARDTLRAISTEVKI-- 177 (201) T ss_pred ------------C--CCCceEeeCCCCCCCHHHHHHHHHHHHHH----HHHH-HHc-CCCCHHHHHHHHHhcCCcCCC-- Confidence 0 23688999998888877666554433211 1111 112 3 4566666665542111100 Q ss_pred ccccCCCCCCCCCCCCCCCCCCCcc Q lcl|NC_019418. 500 DAELALYGKGQQNTVGNSKDTVDDE 524 (527) Q Consensus 500 ~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (527) .......+...++..++.+.+.++ T Consensus 178 -~~~~~~~~~~~~e~~dp~~~~~~~ 201 (201) T protein:vir:10 178 -GEGSIQTEVVINESEDPLDVSANN 201 (201) T ss_pred -CCCCCCccccccccCCCCCCCCCC Confidence 000111111111111111222222 No 233 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=94.51 E-value=0.0042 Score=33.61 Aligned_cols=463 Identities=13% Similarity=0.156 Sum_probs=188.1 Q ss_pred HHHhhcccchhhhccCccc-cCHHHHHHHHHH-HHHhcCCCcccccccccCcc--c-----cCceeec---chHHHHHHH Q lcl|NC_019418. 15 RYNMTTSHLSSILDHPKVA-VTQSEFRRIQHN-LAYYQSKFDDIEYTNTDGDR--K-----RRKMQHL---PIARTAAKK 82 (527) Q Consensus 15 ~~~~~~~~~~~~~~~~~i~-~~~~~~~~i~~~-~~~y~g~~~~l~~~~~~~~~--~-----~~~~~~l---nl~~~i~~~ 82 (527) |..+|.=++.+.-...... +++.+..-...- ..|| |.. ....|.. + .+.+.+| +--...++. T Consensus 1 m~~lfgf~i~~~~~~~~~S~vpp~~~~~~~~i~~g~~-g~~-----v~~~g~~~~~n~~eLI~~YR~ma~~pEVd~Av~e 74 (564) T protein:vir:10 1 MSQLFGFLINEKEGQKGQSPVPPNDEASVSTVAGGYF-GTY-----VDTSGGQNSRNEYELIRRYRDMSLHPEVDSAIDE 74 (564) T ss_pred CcchhcceeeeeccCCCCCcccCCcCCChhhhhcccc-cee-----eecccccchhhHHHHHHHHHHHhhccchhhHHHH Confidence 3333332222211111111 111111111100 0111 100 0001100 0 0011111 111111111 Q ss_pred Hhh-hhhc----ccceEeeCCH--------HHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCe-----eEE Q lcl|NC_019418. 83 IAS-LVYN----EQAEISAEDE--------TLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDK-----IRV 144 (527) Q Consensus 83 ~A~-ll~~----e~~~i~~~d~--------~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~-----~~i 144 (527) ..+ .++. .|+++.+++. ...+.++.++.=-+|.++..+.+....+-|..+|+..+|.++ ..+ T Consensus 75 IVneaIv~d~~~~pV~vdL~~~~~s~siK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~eL 154 (564) T protein:vir:10 75 IVNEFVVNDGDDKPVEVDLQNLEIGSGVKKKIRDEFNRILRMMNFNVNAHEIIRNWYVDGRSHYHKVIDLDNPKKGILEL 154 (564) T ss_pred hhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeeCCChhhhhhhh Confidence 111 1222 2445555432 245566677776789999999999999999999999998543 357 Q ss_pred EEEcCCceEEEEEc---CCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccc Q lcl|NC_019418. 145 AFIQAPVFLPLQSN---TQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQ 221 (527) Q Consensus 145 ~~v~a~~~~P~~~d---~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~ 221 (527) .+++|-++=++... .+-....++ +.......|....|+..+ .-+-|.+..... T Consensus 155 r~lDPr~i~~vr~i~~~~~~~~~~v~-----k~~~~~~~y~~~~Eyy~Y-------------------np~~~~g~~~~~ 210 (564) T protein:vir:10 155 RYIDSLKIRKVRQKLKDVDPNRKEIE-----KGTALQYDYGDFIEYYIY-------------------NPKGFAGNIPMV 210 (564) T ss_pred hhhcccceeeeeeeccccccccceee-----eeeeeeccccccccceee-------------------ccccccCccccc Confidence 88899877766421 110001110 000000011111111110 000011100001 Q ss_pred cCceeecccccCCcccceeecCCCcc--cEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHH--HHHHcCcc Q lcl|NC_019418. 222 LGERVNLSELYPDLQPVTPIQGLSRP--LFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFM--WEIKMGQR 297 (527) Q Consensus 222 lG~~v~l~~~~~~l~~~~~~~g~~~p--~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~--~e~~~~~~ 297 (527) .|.. .+... +++..| .++|..--.. +.....=+|-+..|.-.+..|=..-+.++ |-.|+-.+ T Consensus 211 ~~~~-----~~~~~------~~ikI~~daI~y~hSGL~---d~~~~~i~gyLhkAIKp~NQLkmlEDAlVIYRitRAPeR 276 (564) T protein:vir:10 211 TGSM-----DWSNQ------EGIKIASDAIAQSTSGLM---DLNKKMTLSFLHKAIKSLNQLRMIEDSLVIYRLSRAPER 276 (564) T ss_pred cccc-----ccccc------cceeechhhcceecccce---eCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccc Confidence 1100 00000 001000 1111110000 00111123445566555555544444433 44456667 Q ss_pred eeeec----------hhHh---------cCCCCCCCcccccccccccc-cceeeeccCCCCC-CCcceEeccccChHHHH Q lcl|NC_019418. 298 RVIVP----------EQMT---------QLKVQDNQGNIAFKRRFDVE-QNVYMQVGAGNMD-SGGIVDLTTPIRSSDYI 356 (527) Q Consensus 298 ~i~v~----------~~~l---------~~~~~~~~~~~~~~~~~d~~-~~~~~~~~~~~~~-~~~i~~~~~~ir~e~~~ 356 (527) |||-- +.++ +..-|..+|++.-.+.+-.- ...+.+- -+|+ ..-|+++..-=...+ + T Consensus 277 RvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGevrddrk~msMlEDyWLPR--ReGgrgTEItTLpGgqnLge-m 353 (564) T protein:vir:10 277 RIFYIDVGNLPKVKAEQYLRDVMSRYRNKLVYDGQTGEIRDDKKHMSMLEDFWLPR--REGGRGTEITTLPGGQNLGE-L 353 (564) T ss_pred eEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceecccchhhhhHhhhcccc--cCCCcccceeeccccCCcch-H Confidence 77641 1111 01123445544333222110 0001110 1122 123555554322332 3 Q ss_pred HHHHHHHHHHHHhcCCCcccccccccc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCccc Q lcl|NC_019418. 357 SAISEGLKLFEMQIGVSSGMFTFDGQG--VKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIP 434 (527) Q Consensus 357 ~~~~~~l~~i~~~~g~s~~~~~~~~~g--~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~ 434 (527) ..+..+.+.+....+++..-+..+++| .--++||.-..-.....+.+.+..|..-+.++++.=|.|-.. +...-+ T Consensus 354 ~DV~YF~kKLY~aLnVP~SRl~~e~~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgi---it~eeW 430 (564) T protein:vir:10 354 KDVEYFKKKLYNSLNLPPSRLTDDNKAFNLGKSTEILRDELKFTKFIGRLRKRFAQLFHDILKTQLILKGI---ITPEDW 430 (564) T ss_pred HHHHHHHHHHHHHhCCCcccccCCCceeecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccC---CCHHHH Confidence 456666677777777777766655322 112456766666666778888888888888888876655332 211111 Q ss_pred Cc--cceEEEeCCCccCCHHHHHHHHHH---HH-hc----C-CCCHHHHHHhcCCCCHHHHHHHHHHHHHhccccccccc Q lcl|NC_019418. 435 EL--DDISVNLDDGVFTDRHAELDYWMK---MV-AA----G-FATQKRGIAKTLGITEEEAEKELAEINGELPPESDAEL 503 (527) Q Consensus 435 ~~--~~v~v~f~d~i~~d~~~~~~~~~~---~~-~a----G-i~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~ 503 (527) .. ..|.++|...=-..+..+++.... +. ++ | ..|.+++.++....||+|.+++..+|++|....--..| T Consensus 431 ~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P 510 (564) T protein:vir:10 431 DDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVGKYFSTEYIRRKILMQTENEFKEIDKQMKSDIESGLAIDP 510 (564) T ss_pred HHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCc Confidence 11 346677754333333333332221 11 11 2 36888888888899999999999999988742111111 Q ss_pred -------CCC--CCC-CCCCCCC-CCCCCCccccC Q lcl|NC_019418. 504 -------ALY--GKG-QQNTVGN-SKDTVDDEDEA 527 (527) Q Consensus 504 -------~~~--~~~-~~~~~~~-~~~~~~~~~~~ 527 (527) ++. +.. ++...+- ++.....+.+. T Consensus 511 ~e~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~ 545 (564) T protein:vir:10 511 IQVNMLDDMEKQNQAFAPELQAAQDDLAAEREIKK 545 (564) T ss_pred hhhhcCCCccCCCCcCCcchhhhccccccccChhh Confidence 110 100 0000000 00000111110 No 234 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=94.51 E-value=0.0042 Score=33.60 Aligned_cols=313 Identities=9% Similarity=0.019 Sum_probs=109.2 Q ss_pred eEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceee Q lcl|NC_019418. 162 VSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPI 241 (527) Q Consensus 162 ~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~ 241 (527) +.+.+ |+ ..+ +......|.+... ..-.......+ ...+.-+.. ..+.. T Consensus 1 v~Eiv--w~---~~~-g~~~~~~l~~r~~---~~~~~f~~~~~-~~l~~~~~~-----------------~~~g~----- 48 (355) T protein:vir:78 1 MFEQV--YR---IEN-GRARLGKLAWRPP---RTISRFDVAPD-GGLVAIEQW-----------------GVFGK----- 48 (355) T ss_pred CeEEE--EE---eeC-CeEEEeeeeecCc---cceeeeeeccC-CceeEEEec-----------------CCCCC----- Confidence 22222 21 111 1111111222110 00000000011 111110000 00000 Q ss_pred cCCCccc--EEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHc---CcceeeechhHhcCCCCCCCcc Q lcl|NC_019418. 242 QGLSRPL--FTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKM---GQRRVIVPEQMTQLKVQDNQGN 316 (527) Q Consensus 242 ~g~~~p~--f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~---~~~~i~v~~~~l~~~~~~~~~~ 316 (527) .++.-|. |..++. ....++|+|.|.+..|.-..--=...+..|+.=++. +-+-...|...-....+.... T Consensus 49 ~~~~lp~~kfi~~~~----~~~~g~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~- 123 (355) T protein:vir:78 49 ATVRIPVDRLVVFVN----EREGANWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARA- 123 (355) T ss_pred CcceeccCCEEEEEe----CCCCCCccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhH- Confidence 0111111 232222 123467999999998877554444444444433331 222222222110000000000 Q ss_pred cccccccccccc----eeeeccCCC------CCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccc---cccc Q lcl|NC_019418. 317 IAFKRRFDVEQN----VYMQVGAGN------MDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTF---DGQG 383 (527) Q Consensus 317 ~~~~~~~d~~~~----~~~~~~~~~------~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~---~~~g 383 (527) .+.....+. +...+.++. .....|+.++.......|...++.+=++|+... ++. |++. +++| T Consensus 124 ---~~~~~~~~~~l~~~~~~i~~g~~a~~iip~g~~ie~~ea~g~~~~~~~~i~~~d~~Isk~i-LGq-tlTs~~~~~gG 198 (355) T protein:vir:78 124 ---EQWLNDQKEEGLQLAKEFRAGEAAGGYIPHGANFTLTGVQGKLPEMDGPIRYHDEQIARAV-LAH-FLTLGGDKSTG 198 (355) T ss_pred ---HHHHHHHHHHHHHHHHHhhCCcceeEeecCCceEEEeecCCCcccHHHHHHHHHHHHHHHH-hhh-hhccccCCccc Confidence 000000000 000000000 011234555444444445555555555554433 332 2221 1222 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHH Q lcl|NC_019418. 384 VKTATEIVSENSDTYQMRNSIVALVEQSIK-ELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMV 462 (527) Q Consensus 384 ~~TAtei~s~~~~~~~~~~~~~~~~~~al~-~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~ 462 (527) ...+.++ -+.-....++.-.+.+...|. +|++.++.+ ++.. ...-+.+.|+. +..+..+.++.+.+++ T Consensus 199 S~Alg~v--h~~v~~~~~~aD~~~i~~~ln~~li~~l~~l----N~~~----~~~~P~~~~~~-~~~~~~~~a~~~~~l~ 267 (355) T protein:vir:78 199 SYALGDT--FASFFTGSLNAVMKHIADVTQQHVVEDLVDQ----NWGP----EEPAPRLVPAQ-LGKEQPVTAEAIRALV 267 (355) T ss_pred hhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCCC----CCCCCEEEecC-cChhHHHHHHHHHHHH Confidence 2112223 112223334445566777774 688877654 2211 12234577764 5566667788899999 Q ss_pred hcCCCCH----HHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 463 AAGFATQ----KRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 463 ~aGi~s~----~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) ..|+... ++++.+.+|+++.+..++...-..+..+.........+.....+......+..+.++- T Consensus 268 ~~G~~~~~~~~~~~~~e~~gip~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~a~~~~~~ 336 (355) T protein:vir:78 268 ECGAFTADPELEKDLRARYGLPAPAERDDGADAAAAKAAGRRRAKRLPGQRQGAALPSRSPRADPPRRR 336 (355) T ss_pred hCCCccccHHHHHHHHHHhCCCCCCCCCcccCCccccccccccccccCCccccccccccCCCCCChhhh Confidence 9998554 4578888888643211111111111111111111111111110100000011111111 No 235 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=94.14 E-value=0.0053 Score=33.08 Aligned_cols=430 Identities=13% Similarity=0.144 Sum_probs=190.6 Q ss_pred CChHHHHHHHHHHHHH---Hhhcccc----hhhh-cc-----CccccCH---------------------HHHHHHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRY---NMTTSHL----SSIL-DH-----PKVAVTQ---------------------SEFRRIQHNL 46 (527) Q Consensus 1 m~~~~~~k~~~~~~~~---~~~~~~~----~~~~-~~-----~~i~~~~---------------------~~~~~i~~~~ 46 (527) |--|.-|-.+|+-... .-..+.+ .+.+ ++ ..|..+. ....-|++++ T Consensus 1 ~~~~~~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR 80 (524) T protein:vir:10 1 MANFNTILSFLKPWANEDEKEYKQQINNNLESVTAPKLDDGAREIETQEQNIPYNALMQQMFGSNEPEVKNTRELIDTYR 80 (524) T ss_pred CCchhhHHHHhhhhhcchhhhhhhhhccCCCccccCCCCCCceeeccCcccccchhhhhhhhhcccchhhhHHHHHHHHH Confidence 6555555555542211 0000111 0111 11 0111110 1122222222 Q ss_pred HHhcCCCcccccccccCccccCceeecchHHHHHHHHhh-hhh----cccceEeeCCHH--------HHHHHHHHHhhhh Q lcl|NC_019418. 47 AYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIAS-LVY----NEQAEISAEDET--------LNDFLSDMLSNDR 113 (527) Q Consensus 47 ~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~-ll~----~e~~~i~~~d~~--------~~~~l~~~l~~n~ 113 (527) .+... |.+ ...++...+ -+. ..|+++.+++.. ..+.++.++.--+ T Consensus 81 ~ma~~--pEv--------------------d~Av~eIVneaiv~d~~~~pV~l~Ld~~~~s~siK~kI~eeF~~Il~ll~ 138 (524) T protein:vir:10 81 NLMNN--YEV--------------------DNAVQEIVSDAIVYEDDKEVVALNLDGTDFSQSIKDKILAEFSEVLNLLN 138 (524) T ss_pred HHhhc--cch--------------------hhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhc Confidence 22111 111 112222222 111 124555555432 4556667777678 Q ss_pred HHHHHHHHHHHHHhcCCEEEEEEEeCCe-----eEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceE-EEE-EE Q lcl|NC_019418. 114 FNKNFERYLESALALGGLAMRPYVDGDK-----IRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVY-YTL-VE 186 (527) Q Consensus 114 f~~~~~~~~~~a~~~G~~~~~~~~d~~~-----~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~-yt~-lE 186 (527) |.++..+.+....+-|..+|+..+|.++ ..+.+++|.++-++. .+..+...+++ ++- .| T Consensus 139 F~~~~~~~fR~WYVDgRi~fHkiid~~~pk~GI~Elr~lDPr~i~~vr--------------~i~~~~~~~~~vi~~~~e 204 (524) T protein:vir:10 139 FQRKGTDHFQRWYVDSRIFFHKIINPKKMKDGVQELRRLDPRQVQYIR--------------EIVTRMEDGVKIVDGYRE 204 (524) T ss_pred cchhhhHHHhhheeeceEEEEEEeeCCCccccceeeeeeCCccceeee--------------eecccCcccchhhcchhh Confidence 9999999999999999999999998533 457888888765542 12222222111 110 01 Q ss_pred EEeecccccccceeeecCCceEEEEE-EEecCCccccCceeecccccCCcccceee--cCCCcccEEEecCCccccccCC Q lcl|NC_019418. 187 FHEWVTPTGQEVGSTKDKSLYRITNE-LYKSTSDSQLGERVNLSELYPDLQPVTPI--QGLSRPLFTYLKTPGMNNKDIN 263 (527) Q Consensus 187 ~h~~~~~~~~~~~~~~~~~~~~I~n~-ly~~~~~~~lG~~v~l~~~~~~l~~~~~~--~g~~~p~f~~~~~~~~N~~~~~ 263 (527) +..+.. +....+.+. .| ..+..|-+ ++ ..+++ +|+ ++. |.. T Consensus 205 ~f~Y~~-----------~~~~~~~~~~~~------~~~~~ikI---~~---dAIvy~~SGL-------~d~---~~~--- 248 (524) T protein:vir:10 205 FFVYDT-----------GHESYCADGRIY------SAGTKVKI---PR---AAVVYAHSGL-------LDC---CGK--- 248 (524) T ss_pred heeecC-----------CCcccccCccee------cCCcceec---ch---hheeeeccCc-------ccC---CCC--- Confidence 100000 000000000 00 01111100 00 01111 111 111 100 Q ss_pred CccCcchhhhhHHHHHHHHHHHHHHH--HHHHcCcceeeec----------hhHh-------c--CCCCCCCcccccccc Q lcl|NC_019418. 264 SPLGLSIFDNAKTTIDFINRTYDEFM--WEIKMGQRRVIVP----------EQMT-------Q--LKVQDNQGNIAFKRR 322 (527) Q Consensus 264 splG~S~~~~~~~lid~ld~~~s~~~--~e~~~~~~~i~v~----------~~~l-------~--~~~~~~~~~~~~~~~ 322 (527) .=+|-+..|.-.+..|=-.-+.++ +-.|+-.+|||-- +.++ + ..-|..+|++.-.+. T Consensus 249 --~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnlPk~KAeqYl~~im~k~kNKlvYDa~TGev~ddrk 326 (524) T protein:vir:10 249 --NIIGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRVFYIDTGNMPSRKAAAQMQHIMNTMKNRVVYDASTGKIKNQQH 326 (524) T ss_pred --ceeccchHhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeccCCeeccchh Confidence 013445566555555544444443 4446666777641 0111 0 011334444322221 Q ss_pred cccc-cceeeeccCCCCC-CCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccc-cc--cchHHHHHHHHHHH Q lcl|NC_019418. 323 FDVE-QNVYMQVGAGNMD-SGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDG-QG--VKTATEIVSENSDT 397 (527) Q Consensus 323 ~d~~-~~~~~~~~~~~~~-~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~-~g--~~TAtei~s~~~~~ 397 (527) +-.- ...+.+- -+|+ ..-|+++..-=...+ ...+..+.+.+....+++..-+..++ ++ .--++||.-..-.. T Consensus 327 ~msMlEDyWLpR--ReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~~f~~gr~~EItRDEiKF 403 (524) T protein:vir:10 327 NMSMTEDYWLQR--RDGKAVTEVDTMPGATGMSD-MDDVLYFRTALYRALRIPESRIPSESNSGVMFDAGTAITRDELKF 403 (524) T ss_pred hhhhHhhhcccc--cCCCCccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCchhccCCCCccccccccchhhHHHHHH Confidence 1100 0001110 1122 123555554333333 45566667777777777777664332 11 22366776666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCc--cceEEEeCCCccCCHHHHHHHHHH---HH--hcC----C Q lcl|NC_019418. 398 YQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPEL--DDISVNLDDGVFTDRHAELDYWMK---MV--AAG----F 466 (527) Q Consensus 398 ~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~--~~v~v~f~d~i~~d~~~~~~~~~~---~~--~aG----i 466 (527) ...+.+.+..|..-+.++++.=|.|-.. +...-+.. ..|.++|...=-..+..+++.... +. ..+ . T Consensus 404 ~KFI~rLR~rFs~lf~~~L~~qLilKgi---it~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky 480 (524) T protein:vir:10 404 AKWIRQLQNKFEEIFLDPLKTNLILKKI---ITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRINMLTMAEPFIGKY 480 (524) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhccC---CCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 7778888888888888888876655332 21111111 346677754333333333332221 11 112 3 Q ss_pred CCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCC Q lcl|NC_019418. 467 ATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSK 518 (527) Q Consensus 467 ~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 518 (527) .|.+++.++....||+|.+++..+|++|... +.+.++++. .++- T Consensus 481 ~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~------~~~~~~~~~--~~~f 524 (524) T protein:vir:10 481 ISHQTAMKDFLQMTDEEINQEAKQIEEESKE------ARFQNPDEE--EEDF 524 (524) T ss_pred chhHHHHHHHhccCHHHHHHHHHHHHHHhhc------CCCCCCChh--hhcC Confidence 5888877888899999999999999998642 112111111 1111 No 236 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=93.82 E-value=0.0063 Score=32.66 Aligned_cols=416 Identities=14% Similarity=0.078 Sum_probs=159.9 Q ss_pred CChHHHHHHHHH-HHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFN-RGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~-~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i 79 (527) |+|+ .-..|+- .+...| +-.+++... +-=++... T Consensus 102 l~~~-~~~~F~Gy~~la~l--------------aQ~~eyr~~------------------------------~~~ia~e~ 136 (695) T protein:vir:78 102 LSFV-TSSGFPGFPTLVLL--------------AQLPEYRAM------------------------------HEVLADEC 136 (695) T ss_pred chhh-hccCcchHHHHHHH--------------hhccchhhH------------------------------HHHHHHHh Confidence 3221 0111111 000000 000111111 11123333 Q ss_pred HHHHhhhhhcccce-----Ee-------eCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEE Q lcl|NC_019418. 80 AKKIASLVYNEQAE-----IS-------AEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVAFI 147 (527) Q Consensus 80 ~~~~A~ll~~e~~~-----i~-------~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v 147 (527) +++|-..+.++... ++ ..+...-+.|+.-+++-+.+..++++++.+-.+|++++.+-++++.... T Consensus 137 ~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l--- 213 (695) T protein:vir:78 137 IRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIM--- 213 (695) T ss_pred hcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeccCcccc--- Confidence 33332222221111 11 1233455677777788889999999999999999999777776532100 Q ss_pred cCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeeccccccccee-----eecCCceEEEEEEEecCCcccc Q lcl|NC_019418. 148 QAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGS-----TKDKSLYRITNELYKSTSDSQL 222 (527) Q Consensus 148 ~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~-----~~~~~~~~I~n~ly~~~~~~~l 222 (527) + -|+.-....+..+. -+..+.++++- .......... -...+.|+| . T Consensus 214 ~----~PL~~~~~~I~kGs------------lKGl~ViDp~~-vtP~~~n~~dP~spdfgkP~~y~V------------~ 264 (695) T protein:vir:78 214 D----TPLVPRPYTVPKGS------------FQGLRVVEPYW-VTPNNYNSINPVADDFYKPSTWWM------------I 264 (695) T ss_pred c----cccccccccccCcc------------eeeeEeecccc-cccchhhhccchhhccCCCceEEE------------e Confidence 0 01100000000000 00111112110 0000000000 000111111 1 Q ss_pred CceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeec Q lcl|NC_019418. 223 GERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVP 302 (527) Q Consensus 223 G~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~ 302 (527) |+.|.-+.+ ..+.| +|+.-.+|+. ..-+|+|....+.+-+++.+++-.....=+..-+-.++ - T Consensus 265 G~kIH~SRL-------~~f~g--~plPd~LKp~-------y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~l-k 327 (695) T protein:vir:78 265 GTEVHATRL-------HTIVS--RPVGDMLKPT-------YSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGI-L 327 (695) T ss_pred ceEEeeeeE-------EEecC--CCchhhhhcc-------cccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhhHHH-H Confidence 222111110 11222 1221122221 23469999999999999998776555543321111111 1 Q ss_pred hhHhcCCCCCCCcccccccccc---cccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccc-cc Q lcl|NC_019418. 303 EQMTQLKVQDNQGNIAFKRRFD---VEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGM-FT 378 (527) Q Consensus 303 ~~~l~~~~~~~~~~~~~~~~~d---~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~-~~ 378 (527) .+|.....++ +......++. .-+..+..+-++ ++...+++++ +...-..+.+..+..+++..++++... || T Consensus 328 ~dla~~L~~g--~~~~l~~R~eli~~~Rsn~G~~llD-k~~Eefeq~s--tslSGLddVi~qf~q~VAgaa~IPltkLfG 402 (695) T protein:vir:78 328 MDLAQALMPG--ANVDLSMRAELINRYRDNRNILFLD-KATEEFFQFN--TPLSGLDALQAQAQEQMSAVSHIPLIKLLG 402 (695) T ss_pred HHHHHhhcCh--hHHHHHHHHHHHHHhcCccceEEEe-cCCcceEEEe--cccCCHHHHHHHHHHHHHhhhcCchhhhhc Confidence 1222111122 1111111111 112222211122 2222344443 344455566666677777788887654 68 Q ss_pred cccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHH- Q lcl|NC_019418. 379 FDGQGV-KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELD- 456 (527) Q Consensus 379 ~~~~g~-~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~- 456 (527) ...+|. +|+..=...+.+...... +..++.+|+.|+.+|..- .++.. .+++++.|+.--..++.+.++ T Consensus 403 qSPkGlNATGE~D~rnYYD~I~s~Q--e~~L~p~L~rl~~ii~rS-----~~G~i---dpdi~~~fnPL~qmtd~EkAeI 472 (695) T protein:vir:78 403 ITPTGLNASSEGEIRVWYDYVRAYQ--RNALQQLMNDVIVMIQLS-----LFGAV---DPSIKWQWNALRELDDLEVAES 472 (695) T ss_pred cCCccccccchhhHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH-----hcCCC---CCcceEEeCCCCCcCHHHHHHH Confidence 777775 666633333333333332 456788888887766432 12222 235888998655555444333 Q ss_pred ------HHHHHHhcCCCCHHHHHHhcCC-----C-------------CHHHHHHHHHHHHHhcccccccccC----CCCC Q lcl|NC_019418. 457 ------YWMKMVAAGFATQKRGIAKTLG-----I-------------TEEEAEKELAEINGELPPESDAELA----LYGK 508 (527) Q Consensus 457 ------~~~~~~~aGi~s~~~~i~~~~~-----~-------------~deea~~el~ri~~E~~~~~~~~~~----~~~~ 508 (527) .+..++.+|+++..+...++-. . +|++..-++.--+. .++.++..+ -.+. T Consensus 473 ~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~g~ 550 (695) T protein:vir:78 473 RYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQR--LAEGGDTGAPGGARAGA 550 (695) T ss_pred HhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcC--cccccccCCCCCCCCCC Confidence 3345556777777775554311 1 00111111000000 000000000 0000 Q ss_pred CCCCCCCCCCCCCCccccC Q lcl|NC_019418. 509 GQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 509 ~~~~~~~~~~~~~~~~~~~ 527 (527) ..+..+.+..-+...++-+ T Consensus 551 ~~~~~~~~~~~~~~~~~ag 569 (695) T protein:vir:78 551 TAPPTVANVNANVKPREAG 569 (695) T ss_pred CCCCceeeeeccccccccC Confidence 1111111111222111111 No 237 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=93.71 E-value=0.0066 Score=32.52 Aligned_cols=391 Identities=12% Similarity=0.008 Sum_probs=153.8 Q ss_pred hh-cccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHHHHhhhhhcccceEee Q lcl|NC_019418. 18 MT-TSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIASLVYNEQAEISA 96 (527) Q Consensus 18 ~~-~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~ 96 (527) || .+....-. .+....+. -|...+.|-.+...-..+.. ..-+...--...++.+|+-+-+-|..+-- T Consensus 1 m~~~~~~~~~~-~~~~~~~~-------~~~~~~~g~~~s~~~~~v~~----~~al~~~~v~~cv~~ia~~ia~lp~~~~~ 68 (419) T protein:vir:80 1 MFFSRQLLSNL-GQTQPGSG-------GWVSALLGSARSEAGQVVTP----ASALSLTVLQNCVTLLAESIAQLPVELYE 68 (419) T ss_pred CCccccccccc-CcCCCCcc-------hhhHHhhcccccccCcccCh----HHhhccHHHHHHHHHHHHhhccCceEEEE Confidence 22 22111111 11111111 22222222111100000000 01111122234455555555444443311 Q ss_pred ---CCHH--HHHHHHHHHhh-----hhHHHHHHHHHHHHHhcCCEEEEEEEeCC-ee-EEEEEcCCceEEEEEcCCceEE Q lcl|NC_019418. 97 ---EDET--LNDFLSDMLSN-----DRFNKNFERYLESALALGGLAMRPYVDGD-KI-RVAFIQAPVFLPLQSNTQDVSS 164 (527) Q Consensus 97 ---~d~~--~~~~l~~~l~~-----n~f~~~~~~~~~~a~~~G~~~~~~~~d~~-~~-~i~~v~a~~~~P~~~d~~~~~~ 164 (527) ++.. .+..+..+|.. -......+..+...+..|.+++.+..+.. ++ .+-.++|+.+-+.. +.+ T Consensus 69 ~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~i~~~~v~i~~-~~~---- 143 (419) T protein:vir:80 69 RSGDDRKPATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLYPLDNEAVTVMK-GPD---- 143 (419) T ss_pred ecCCCcccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEE-CCC---- Confidence 1110 11123333321 11223334556677888999888877653 32 35556666554321 111 Q ss_pred EEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCC Q lcl|NC_019418. 165 AAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGL 244 (527) Q Consensus 165 ~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~ 244 (527) ...+|.. .+ .. .+ +.. T Consensus 144 -------------~~~~y~~------------------------------~~--~~----~~---------~~~------ 159 (419) T protein:vir:80 144 -------------LKPMYRV------------------------------AG--AD----PL---------PQR------ 159 (419) T ss_pred -------------ceEEEEE------------------------------cC--cc----cc---------chh------ Confidence 1111210 00 00 00 000 Q ss_pred CcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCc-ceeeechhHhcCCCCCCC--ccccccc Q lcl|NC_019418. 245 SRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQ-RRVIVPEQMTQLKVQDNQ--GNIAFKR 321 (527) Q Consensus 245 ~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~-~~i~v~~~~l~~~~~~~~--~~~~~~~ 321 (527) .+.|++.+. .+..+|+|.+.-+...|+-....-.-..+-|..|. +.-++ ....+..+ ......+ T Consensus 160 ---~i~h~~~~~-----~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil-----~~~~~~~~~~~~~~~~~ 226 (419) T protein:vir:80 160 ---LVHHVRWMS-----INGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVI-----ERPTDAPALKDQASVDR 226 (419) T ss_pred ---heEEecCCC-----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEE-----EecCCCCcccCHHHHHH Confidence 123444321 12357999888777766544433322223345533 33222 22111111 0000000 Q ss_pred ccccccceeeecc-CC----CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccccccc-chHHHHHHHHH Q lcl|NC_019418. 322 RFDVEQNVYMQVG-AG----NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGV-KTATEIVSENS 395 (527) Q Consensus 322 ~~d~~~~~~~~~~-~~----~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~-~TAtei~s~~~ 395 (527) ..+.-...|.+.+ .+ -.++..++.++......++.+..+....+|+...|++|..+|...++. .++.+.. T Consensus 227 ~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~---- 302 (419) T protein:vir:80 227 ITDGWNAKFGGSGNAKKVALLQEGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQS---- 302 (419) T ss_pred HHHHHHHHhcCccccCCceecCCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHH---- Confidence 0000001111110 00 011223555555556677888888888899999999999998654432 2222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHh Q lcl|NC_019418. 396 DTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAK 475 (527) Q Consensus 396 ~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~ 475 (527) +..++.+|.-++..|-...+. .++.......+.+.++++.-+..|..+.++...+++.+|+|++-+++.. T Consensus 303 ---------~~f~~~~l~P~~~~ie~~l~~-kll~~~~~~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~ 372 (419) T protein:vir:80 303 ---------LQFVIYTLLPWVKRHEQAKTR-DLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRL 372 (419) T ss_pred ---------HHHHHHHHHHHHHHHHHHHhh-hccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 112233333333333221111 1111111223445666666666788899999999999999999997754 Q ss_pred cCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 476 TLGITEEE-AEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 476 ~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) . |+..-+ ..+-+ .+..-.. .+..++.+.++..+....-+|. T Consensus 373 ~-g~~p~~gGD~~~-------~~~n~~~---~~~~~~~~~~~~~~~~~~~~~~ 414 (419) T protein:vir:80 373 E-NMPPVKGGDIYL-------SPMNMVD---ASKPQPIPMGKTEPTKAALDEI 414 (419) T ss_pred h-CCCCCCCcceee-------ecccccc---ccccccccCCCCCchhhhHHHH Confidence 3 554210 10000 0000000 0011111111111111111111 No 238 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=93.49 E-value=0.0074 Score=32.27 Aligned_cols=423 Identities=14% Similarity=0.083 Sum_probs=159.0 Q ss_pred CCh----HHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchH Q lcl|NC_019418. 1 MSL----IQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIA 76 (527) Q Consensus 1 m~~----~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~ 76 (527) |.+ .+.+ .|.. ...+.=+.-=..++-.+++... +-=++ T Consensus 91 ~~~~~~~~~~l-~~~~-------~~~F~Gy~~la~laQ~~eyr~~------------------------------~~~ia 132 (694) T protein:vir:10 91 LDFNGTSMDAL-SFVT-------SSGFPGFPTLVLLAQLPEYRAM------------------------------HEVLA 132 (694) T ss_pred hccCcccccch-hhhh-------ccCcchHHHHHHHhhccchhhH------------------------------HHHHH Confidence 221 0000 1111 1110000000000000011111 11123 Q ss_pred HHHHHHHhhhhhcccce-----Ee-------eCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEE Q lcl|NC_019418. 77 RTAAKKIASLVYNEQAE-----IS-------AEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRV 144 (527) Q Consensus 77 ~~i~~~~A~ll~~e~~~-----i~-------~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i 144 (527) ...+++|-..+.++... ++ ..+...-+.|+.-+++-+.+..++++++.+-.+|++++.+-++++.... T Consensus 133 ~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eaik~aRlfGGa~~~i~I~gdd~~l 212 (694) T protein:vir:10 133 DECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIM 212 (694) T ss_pred HHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeecCcccc Confidence 33333332222221111 11 1233455677777788889999999999999999999777776532100 Q ss_pred EEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeeccccccccee-----eecCCceEEEEEEEecCCc Q lcl|NC_019418. 145 AFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGS-----TKDKSLYRITNELYKSTSD 219 (527) Q Consensus 145 ~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~-----~~~~~~~~I~n~ly~~~~~ 219 (527) + -|+.-....+..+. -+..+.++++- .......... -...+.|+| T Consensus 213 ---~----~PL~~~~~~I~kGs------------lKGl~ViDp~~-vtP~~~n~~dP~spdfgkP~~y~V---------- 262 (694) T protein:vir:10 213 ---D----TPLVPRPYTVPKGS------------FQGLRVVEPYW-VTPNNYNSINPVADDFYKPSTWWM---------- 262 (694) T ss_pred ---c----cccccccccccCcc------------eeeeEeecccc-cccchhhhccchhhccCCCceEEE---------- Confidence 0 01100000000000 00111112110 0000000000 000111111 Q ss_pred cccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCccee Q lcl|NC_019418. 220 SQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRV 299 (527) Q Consensus 220 ~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i 299 (527) .|+.|.-+.+ ..+.| +|+.-.+|+. ..-+|+|....+.+-+++.+++-.....=+..-+-.+ T Consensus 263 --~G~~IH~SRL-------~~f~g--~plPd~LKp~-------y~~~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~ 324 (694) T protein:vir:10 263 --IGTEVHATRL-------HTIVS--RPVGDMLKPT-------YSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSG 324 (694) T ss_pred --eceEEeeeeE-------EEecC--CCchhhhhcc-------cccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhhHH Confidence 1222211110 11222 1221122221 2346999999999999999877655554332111111 Q ss_pred eechhHhcCCCCCCCcccccccccc---cccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccc Q lcl|NC_019418. 300 IVPEQMTQLKVQDNQGNIAFKRRFD---VEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGM 376 (527) Q Consensus 300 ~v~~~~l~~~~~~~~~~~~~~~~~d---~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~ 376 (527) + -.+|.....++ +.......+. .-+..+..+-++ ++...++.++ +...-..+.+..+..+++..+|++... T Consensus 325 l-k~dla~~L~~g--~~~~l~~R~eli~~~Rsn~G~~llD-k~~Eefeq~s--tslSGLddVi~qf~q~VAgaa~IPltk 398 (694) T protein:vir:10 325 I-LMDLAQALMPG--ANVDLSMRAELINRYRDNRNILFLD-KATEEFFQFN--TPLSGLDALQAQAQEQMSAVSHIPLIK 398 (694) T ss_pred H-HHHHHHhhcCh--hHHHHHHHHHHHHHhcCccceEEEe-cCCcceEEEe--cccCCHHHHHHHHHHHHHhhhcCchhh Confidence 1 11222111122 1111111111 112222211122 2222344443 344455566666677777788887654 Q ss_pred -cccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHH Q lcl|NC_019418. 377 -FTFDGQGV-KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAE 454 (527) Q Consensus 377 -~~~~~~g~-~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~ 454 (527) ||...+|. +|+..=...+.+...... +..++.+|+.|+.+|..- .++.. .+++++.|+.--..++.+. T Consensus 399 LfGqSPkGlNATGE~D~rnYYD~I~s~Q--e~~L~p~L~rl~~ii~rS-----~~G~i---dp~i~~~fnPL~qmtd~Ek 468 (694) T protein:vir:10 399 LLGITPTGLNASSEGEIRVWYDYVRAYQ--RNALQQLMNDVIVMIQLS-----LFGAV---DPSIKWQWNALRELDDLEV 468 (694) T ss_pred hhccCcccccccchhhHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH-----hcCCC---CCcceEEeCCCCCcCHHHH Confidence 68777775 666633333333333332 456788888887766432 12222 2368889986555554433 Q ss_pred H-------HHHHHHHhcCCCCHHHHHHhcCC-----C-------------CHHHHHHHHHHHHHhcccccccccCCC--- Q lcl|NC_019418. 455 L-------DYWMKMVAAGFATQKRGIAKTLG-----I-------------TEEEAEKELAEINGELPPESDAELALY--- 506 (527) Q Consensus 455 ~-------~~~~~~~~aGi~s~~~~i~~~~~-----~-------------~deea~~el~ri~~E~~~~~~~~~~~~--- 506 (527) + +.+..++.+|+++..+...++-. . +|++..-++.--+. .++.++..+.+ T Consensus 469 AeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~ 546 (694) T protein:vir:10 469 AESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQR--LAEGGDTGAPGGAR 546 (694) T ss_pred HHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcC--cccccccCCCCccc Confidence 3 33345556677777775554311 1 00011000000000 00000000000 Q ss_pred -CCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 507 -GKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 507 -~~~~~~~~~~~~~~~~~~~~~ 527 (527) +...+....+..-+.++.+-+ T Consensus 547 ~g~~~~~~v~~~~~~~~~~~ag 568 (694) T protein:vir:10 547 AGATAPPTVANVNANVNPREAG 568 (694) T ss_pred ccccCCCcccccccccCccccC Confidence 000011111111111111111 No 239 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=93.41 E-value=0.0077 Score=32.18 Aligned_cols=385 Identities=11% Similarity=0.080 Sum_probs=158.6 Q ss_pred cCccccCHHHHHHHHHHHHHhcCCCcccccccc-------cCcc--ccCceeecchHHHHHHHHhhhhhcccceEeeC-- Q lcl|NC_019418. 29 HPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNT-------DGDR--KRRKMQHLPIARTAAKKIASLVYNEQAEISAE-- 97 (527) Q Consensus 29 ~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~-------~~~~--~~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~-- 97 (527) -.+-+++.+- .-+..++.++.+-+..+...+. .|.. ...=....-+.. +..+.-.-|++-+-+|... T Consensus 1 v~~~~l~~e~-at~~~~~d~~~~~~~~l~~~~~~il~~a~~g~~~~y~~l~~D~~i~s-~l~~rk~av~~~~w~i~p~~~ 78 (488) T protein:vir:99 1 MEKPALGREI-ATSGDGRDITRPFISGLQVPNDSILQRRGGNDLRVYEEILSDAQVKT-VWGQRQLAVVSREWKVEAGGD 78 (488) T ss_pred CCccchhHHH-HHHHhhhhhhccccCCCCCCChHHHHhhccCCHHHHHHHhhChHHHH-HHHHHHHHHhcCCceEEcCCC Confidence 0111122111 1233344444332221211110 0000 000000001111 2222233455655556432 Q ss_pred ---CHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEe--CCee---EEEEEcCCceEEEEEcCCceEEEEEEE Q lcl|NC_019418. 98 ---DETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVD--GDKI---RVAFIQAPVFLPLQSNTQDVSSAAILT 169 (527) Q Consensus 98 ---d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d--~~~~---~i~~v~a~~~~P~~~d~~~~~~~a~~~ 169 (527) +....++++++|++-.|...+..++ +|..+|-+++-..|. ++.+ ++.++++..|.+ +.++. T Consensus 79 ~~~~~~~ae~v~~~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~---d~~~~------- 147 (488) T protein:vir:99 79 RPIDQAAAEHLEQQLQRVGWDRVTSKML-FGVFYGYAVSELIYGRDDRYITLEAIKVRNRRRFRY---DQDGG------- 147 (488) T ss_pred ChHHHHHHHHHHHHHhCCCHHHHHHHHH-hhhhhcceeEEEEEeecCCeeeEeeeeeecccceee---cCCCc------- Confidence 2345688999998878888887776 688899999988885 3332 233344432221 11110 Q ss_pred EEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcc-c Q lcl|NC_019418. 170 KTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRP-L 248 (527) Q Consensus 170 ~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p-~ 248 (527) .++. ..+...-|.++| .| . T Consensus 148 -------------------------------------l~~~-----~~~~~~~g~~lp------------------~~~~ 167 (488) T protein:vir:99 148 -------------------------------------LRLL-----TPNNMFEGEPCP------------------APYF 167 (488) T ss_pred -------------------------------------eEEe-----ccCCCCCccccc------------------cCce Confidence 0000 000000111111 11 1 Q ss_pred EEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHc-CcceeeechhHhcCCCCCCCccccccccccccc Q lcl|NC_019418. 249 FTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKM-GQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQ 327 (527) Q Consensus 249 f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~-~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~ 327 (527) |.+.. . ....++|+|.|.+..|....--=+..+..|+.=++. |.+-.+. +.+..+.+.+ . .+ T Consensus 168 ~i~~~-~---~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~ig-----ky~~~~a~~~--e------k~ 230 (488) T protein:vir:99 168 WHFST-G---ADNDDEPYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVG-----RYDDKTATPE--D------KA 230 (488) T ss_pred EEEEe-e---cCCCCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeee-----ecCCCCCCHH--H------HH Confidence 11111 0 112367899999998877644444444444433333 4432222 1111010000 0 01 Q ss_pred ceeeec-cCCC------CCCCcceEeccc-cChHHHHHHHHHHHHHHHHhcCCCccccc-ccccccchHHHHHHHHHHHH Q lcl|NC_019418. 328 NVYMQV-GAGN------MDSGGIVDLTTP-IRSSDYISAISEGLKLFEMQIGVSSGMFT-FDGQGVKTATEIVSENSDTY 398 (527) Q Consensus 328 ~~~~~~-~~~~------~~~~~i~~~~~~-ir~e~~~~~~~~~l~~i~~~~g~s~~~~~-~~~~g~~TAtei~s~~~~~~ 398 (527) .+...+ ++.. .....|+.++.. -..+.|...++.+-++|+... ++ +|++ .+++|-..+.++. +.-.. T Consensus 231 ~l~~av~~~~~~~~~viP~~~~ie~~ea~~~~~~~~~~li~~~d~~Isk~i-LG-qtlts~~~~Gs~a~~~vh--~~v~~ 306 (488) T protein:vir:99 231 KLLAALHAIQTDSAIIMPAGMQAELLEAGRSGTADYKTLHDTMDATIAKVG-LG-QVASTQGTPGRLGNDDLQ--ADVRL 306 (488) T ss_pred HHHHHHHHHhcCcEEEecCCceeEEeecCCCChHHHHHHHHHHHHHHHHHH-hh-hhhcccccccchhhHHHH--HHHHH Confidence 111111 0000 111235555432 223345666665555554332 22 3332 2223222122221 12233 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhc-CCCCHHHHHHhc Q lcl|NC_019418. 399 QMRNSIVALVEQSIK-ELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAA-GFATQKRGIAKT 476 (527) Q Consensus 399 ~~~~~~~~~~~~al~-~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~a-Gi~s~~~~i~~~ 476 (527) ..+..-.+.+...|. +|++.++.+ ++.+ ...+.+.|+..-++|..+.++.+.+++.. |+--.+.++.+. T Consensus 307 d~~~aDa~~i~~tln~~li~~l~~~----N~~~-----~~~p~~~~~~~e~edl~~~a~~~~~l~~~~G~~i~~~~i~e~ 377 (488) T protein:vir:99 307 DLVKADADLICESFNLGPARWLTEW----NFPG-----AQPPRVYRVIEEPEDITAKAERDEKVFRMSGFRPTRGYVQET 377 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh----CcCC-----cCCceeEecCCCcccHHHHHHHHHHHHhhcCCCCCHHHHHHH Confidence 444455666777884 688877765 2211 11245667777778888889999999885 886667778899 Q ss_pred CCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 477 LGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 477 ~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) +|++.++-.+.+. .+ .....+.+....+ +..+...++.+. T Consensus 378 ~Gip~~~~~~~~~------~~---~~~~~~~~~~~~~--~~~~~~~~~~~~ 417 (488) T protein:vir:99 378 YGVEVESTQAEAT------AP---TPSTEFAEGDQPS--DPAAAMAPQLAE 417 (488) T ss_pred cCCCCcccccccc------cC---CCcccCCCCCCCC--CchHHHHHHHHH Confidence 9987643211110 00 0000011100000 000000000000 No 240 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=92.71 E-value=0.01 Score=31.48 Aligned_cols=345 Identities=15% Similarity=0.104 Sum_probs=114.9 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) ||||.+++.|-+.-. ............ ..+.... -....++ T Consensus 1 Mg~f~~~~~f~~~~~-~~~~~~~~~~~~-~~~~~~~-------------------------------------~~v~~~i 41 (378) T protein:vir:93 1 MNLFGKVVSFSRGKL-NNDTQRVTAWQN-EAVEYTS-------------------------------------AFVTNIH 41 (378) T ss_pred Cccchhhhhhhcccc-CCCcceeeeccc-chhHHHH-------------------------------------HHHHHHH Confidence 999999998643211 111111111000 0000000 0011122 Q ss_pred HHHhhhhhcccceE--------------eeCCHHHHHHHHH-HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEE Q lcl|NC_019418. 81 KKIASLVYNEQAEI--------------SAEDETLNDFLSD-MLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVA 145 (527) Q Consensus 81 ~~~A~ll~~e~~~i--------------~~~d~~~~~~l~~-~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~ 145 (527) +.+|+-+-+-|..+ ...+..+...|+. =-..-....-....+...+..|.+++.+..++..-++ T Consensus 42 ~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~g~~- 120 (378) T protein:vir:93 42 NKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGEL- 120 (378) T ss_pred HHHHhhhhhCceeeEEEcccccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceE- Confidence 22233222222211 0111222222211 0000011222233455666778888766555321111 Q ss_pred EEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCce Q lcl|NC_019418. 146 FIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGER 225 (527) Q Consensus 146 ~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~ 225 (527) +.+.....+ +.|. ... T Consensus 121 -------~~l~~~~~~------------------~~~~----------------------~~d----------------- 136 (378) T protein:vir:93 121 -------LDLLFADDK------------------KEYK----------------------TEE----------------- 136 (378) T ss_pred -------EEEEecCCe------------------eEec----------------------cce----------------- Confidence 111000000 0000 000 Q ss_pred eecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhH Q lcl|NC_019418. 226 VNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQM 305 (527) Q Consensus 226 v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~ 305 (527) ..|++.|. +.--|.|.++.+ .++++. .+..+..+ .+ T Consensus 137 -----------------------iih~r~~~------~~~~~~s~l~~~---~~~i~~-------~~~~~~~~-----g~ 172 (378) T protein:vir:93 137 -----------------------LVRLTSPF------YINEDTSILDNA---LASIQT-------KLEQGKLR-----GL 172 (378) T ss_pred -----------------------eEEecCcc------ccchhhHHHHHH---HHHHHH-------HHhcCccc-----ce Confidence 01111110 000022222211 112211 11112111 11 Q ss_pred hcCCCCCCCcc-cccccccccccceeeeccC-C-------CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccc Q lcl|NC_019418. 306 TQLKVQDNQGN-IAFKRRFDVEQNVYMQVGA-G-------NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGM 376 (527) Q Consensus 306 l~~~~~~~~~~-~~~~~~~d~~~~~~~~~~~-~-------~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~ 376 (527) +.......... -.....|. ..|..... + -+++..++.++.+..+.+. ..++...++|+...|++|.. T Consensus 173 l~~~~~l~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~l~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~ 248 (378) T protein:vir:93 173 LKINAFLDIDNTQEYREKAL---TTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENI 248 (378) T ss_pred eeeCCcCCHHHHHHHHHHHH---HHHHHhhcccccccceEcCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHH Confidence 11110000000 00000000 00000000 0 0111224444444333343 44566677899999999987 Q ss_pred ccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCC-------cccCccceEEEeCCCccC Q lcl|NC_019418. 377 FTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRG-------TIPELDDISVNLDDGVFT 449 (527) Q Consensus 377 ~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~-------~~~~~~~v~v~f~d~i~~ 449 (527) +.. |++|.. . ...++.+|..++..|..-... .++.. .......+.++++.-... T Consensus 249 l~g------~~~e~~--~----------~~f~~~tl~P~~~~ie~~l~~-kLl~~~er~~~~~~~~~~~~~fd~~~l~~~ 309 (378) T protein:vir:93 249 LLG------TATQEQ--Q----------IYFYNSTIIPLLIQLEKELTY-KLISTNRRRVVKGNLYYERIIVDNQLFKFA 309 (378) T ss_pred hcC------CcHHHH--H----------HHHHHHHHHHHHHHHHHHHHh-hcCChhHhhhhhhcccccceeeccchhhhc Confidence 731 122211 1 112233333333333321110 01100 001122356677777788 Q ss_pred CHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 450 DRHAELDYWMKMVAAGFATQKRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 450 d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) |..+.++...+++.+|+|++-+++++. |+..-+ ..+-+ ......+.........++.+ ++.++|+.. T Consensus 310 d~~~~~~~~~~~~~~G~~t~NE~R~~~-gl~p~~ggD~~~--~~~n~~~~~~~~~~~~~~~~--------~~~~~e~~n 377 (378) T protein:vir:93 310 TLKELIDLYHENINGPIFTQNQLLVKM-GEQPIEGGDVYI--ANLNAVAVKNLSDLQGSRKD--------VTSTDETNN 377 (378) T ss_pred CHHHHHHHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCeee--eccccccccchhhhcCccCC--------CCCCCCCCC Confidence 989999999999999999999976654 544211 00000 00000010000000111111 111111111 No 241 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=91.96 E-value=0.013 Score=30.85 Aligned_cols=379 Identities=12% Similarity=0.101 Sum_probs=118.3 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) ||+|++++.||.+-- +.. .. . .|..|-.+.. . .....-++-.....-..|+ T Consensus 1 Mg~~~~~~~~~~~~~-~~~--~~-----------~--------~~~~~~~~~~--~-----~~~~~l~~~~v~~~v~~Ia 51 (395) T protein:vir:40 1 MGFKSWVSGFFNEEQ-RTL--NL-----------T--------DTVWCSIPSE--K-----LKELSIKKWAIDSCANKIA 51 (395) T ss_pred CchHHHHHhhhcccc-ccc--cc-----------c--------cchhhccccc--c-----chhhhhhhHHHHHHHHHHH Confidence 999999999986411 000 00 0 0111100000 0 0000000000001222344 Q ss_pred HHHhhhhhcccceEeeCCHHHHHHHHHHHhh--hh---HHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcCCceEEE Q lcl|NC_019418. 81 KKIASLVYNEQAEISAEDETLNDFLSDMLSN--DR---FNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQAPVFLPL 155 (527) Q Consensus 81 ~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~--n~---f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a~~~~P~ 155 (527) +.+|++ |..+--++.....-+..+|.. |. ........+...+-.|.+++.+ .+++.. + ++.+... T Consensus 52 ~~ia~~----p~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~--~~~~~~---~-~~~~~~~ 121 (395) T protein:vir:40 52 NTLSCA----EVLTYEKGEEVRKKNWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFM--QDEYIY---V-ADSFTKN 121 (395) T ss_pred HHHhhC----ceeeccCCccccchHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEE--ecCcee---e-cCCcccc Confidence 444432 222211222222222223321 10 1111233445555567777544 333321 1 1111111 Q ss_pred EEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCc Q lcl|NC_019418. 156 QSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDL 235 (527) Q Consensus 156 ~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l 235 (527) ....... .+. .+..... .|.. + ...-.|-|--|-.......+.. +......+ T Consensus 122 ~~~~~~~----~~~-~v~~~~~---~~~~-~-----------------~~~~evih~r~~~~~~~~~~~~--l~~~~~~~ 173 (395) T protein:vir:40 122 DKSLYEN----TYT-EVTLKDL---TLKK-E-----------------FKESEVLHLTLNNESIKSIIDG--FYLLYGDL 173 (395) T ss_pred ccccccc----eee-eeeecCc---eeee-e-----------------eccccEEEeecCCCCccccchh--HHHHHHHH Confidence 0000000 000 0000000 0000 0 0000111100111000000000 00000000 Q ss_pred c----cceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCC Q lcl|NC_019418. 236 Q----PVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQ 311 (527) Q Consensus 236 ~----~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~ 311 (527) . ......+-.++. ..++.+ -+.+. +..+.+.+.++..+... .....++++ T Consensus 174 ~~~~~~~~~~~~~~~~~-l~~~~~----------~~~~~-~~~~~~~~~~~~~~~~~----~~~~~~~~v---------- 227 (395) T protein:vir:40 174 LTAAVNKYKKLNSRKII-VKLKAM----------FGQTP-EAEEKLRLMLSERMKKF----LAEGDSALP---------- 227 (395) T ss_pred HHHHHHHHHhcCCCCce-EEEecc----------cCCCH-HHHHHHHHHHHHHHHHh----hccCCceee---------- Confidence 0 000001111110 111000 00000 00011111111111111 000111111 Q ss_pred CCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHH---HHHHHHHHHhcCCCcccccccccccchHH Q lcl|NC_019418. 312 DNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAI---SEGLKLFEMQIGVSSGMFTFDGQGVKTAT 388 (527) Q Consensus 312 ~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~---~~~l~~i~~~~g~s~~~~~~~~~g~~TAt 388 (527) . +++..++.++.+....++.+.- +.+.++|+...|++|+.++...+. .. T Consensus 228 -----------------------l--~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l~~~~sn---~e 279 (395) T protein:vir:40 228 -----------------------V--EDGMEIDELAGDSKIAESRDIKKMIDDVFEMVANSFNIPLGLAKGDTVG---LS 279 (395) T ss_pred -----------------------c--CCCceEEeccCChhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCcC---HH Confidence 0 0111244444444444444322 334568999999999988532221 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccC-CcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCC Q lcl|NC_019418. 389 EIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYR-GTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFA 467 (527) Q Consensus 389 ei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~-~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~ 467 (527) +. ....++.+|..++..|..-.+. .++. ........+.++++.-+..|..+.++...+++.+|+| T Consensus 280 ~~-------------~~~f~~~~L~P~~~~ie~~l~~-kLl~~~~~~~g~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~ 345 (395) T protein:vir:40 280 EQ-------------VNSFLMFSINPIAEMFTDEGNR-KFYGRDSVLERTYMKLDTTRIKVQDIQEIASSMDVLFHIGVN 345 (395) T ss_pred HH-------------HHHHHHHHHHHHHHHHHHHHHH-hcCChhhhcCCceEEEechhhhccCHHHHHHHHHHHHhCCCC Confidence 11 1222334444444444322111 1111 1112234566777777788888888888999999999 Q ss_pred CHHHHHHhcCCCCH---HHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 468 TQKRGIAKTLGITE---EEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 468 s~~~~i~~~~~~~d---eea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) ++-+++... |+.. .+..+-.. .....+. +...+....+++.+++.++ T Consensus 346 t~NE~R~~~-g~~pi~~~~gD~~~~--~~n~~~~----------~~~~~~~kgge~~~~~~~~ 395 (395) T protein:vir:40 346 TIDDNLRMI-GREPVMSPETQERFV--TKNYAPL----------GENEEDLKGGDINENKGDS 395 (395) T ss_pred CHHHHHHHh-CCCCCCCCCCceeee--ccccccc----------cccccccCCCCCCCCcCCC Confidence 999977554 5432 11111100 0000000 0000001111111111111 No 242 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=91.89 E-value=0.014 Score=30.79 Aligned_cols=434 Identities=9% Similarity=-0.005 Sum_probs=140.9 Q ss_pred CChHH---HHHHHHHHHHHHh--hcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccc-cccccCc--c----cc- Q lcl|NC_019418. 1 MSLIQ---KVKDFFNRGRYNM--TTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIE-YTNTDGD--R----KR- 67 (527) Q Consensus 1 m~~~~---~~k~~~~~~~~~~--~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~-~~~~~~~--~----~~- 67 (527) ..|.+ -+.+=++..-..+ +.-.+.. +.++.-++..-.+++.-..++.+.++.+. .....+. . .. T Consensus 55 ~~~~e~~~~~~~~i~~~~~~iag~g~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~ 131 (651) T protein:vir:99 55 AAFLELNETLATGIRKKSRYEVGFGFDLVP---AQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKEL 131 (651) T ss_pred HHHHhcChHHHHHHHHHhhhhhccCceeee---cccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHH Confidence 11111 1111111111111 0000000 01111112222222222222222111110 0000000 0 00 Q ss_pred --Cceeecc------------hHHHHHHHHhhhhhcccceEeeCCHHHHHHHHHHHhh--hh--HHHHHHHHHHHHHhcC Q lcl|NC_019418. 68 --RKMQHLP------------IARTAAKKIASLVYNEQAEISAEDETLNDFLSDMLSN--DR--FNKNFERYLESALALG 129 (527) Q Consensus 68 --~~~~~ln------------l~~~i~~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~~--n~--f~~~~~~~~~~a~~~G 129 (527) ..+...+ -|..++...+..+ ++..++......+..++.. |. ....+...+ +....+ T Consensus 132 ~~~Dle~tGna~ieiIrn~~g~pv~L~~lp~~~~-----Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~-q~~~~~ 205 (651) T protein:vir:99 132 ARQDYHGVGWLALEMLTDIEGRPVGLAYVPARTV-----RVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYV-QIRNGN 205 (651) T ss_pred HHHHHHHHhhHhhhhhhcCccchhhhhhcChhhe-----eeecccccccchhhhhhhcccccccchhHHHHHH-HHHhcC Confidence 0000000 0001111111100 1111111111112222211 00 011111222 233456 Q ss_pred CEEEEEEEeCC-ee-EEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCce Q lcl|NC_019418. 130 GLAMRPYVDGD-KI-RVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLY 207 (527) Q Consensus 130 ~~~~~~~~d~~-~~-~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~ 207 (527) ..++..+-+.. ++ .+....++.+.++...+........+ .... .| T Consensus 206 ~~~~~~~g~~~~~~~~~~~~~~~~v~~~~~~d~~~~~~~~~--------~~~~-------------------------~g 252 (651) T protein:vir:99 206 RRYFGEAGDRYRGQEVVIDESGDEPTIRYREDEESEREPIF--------VDRE-------------------------TG 252 (651) T ss_pred cceEEEeeccccceeeeeccCCcceeEEeccCcceeeeeec--------ccce-------------------------ee Confidence 67777765432 22 22333444433332221111000000 0000 00 Q ss_pred EEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHH Q lcl|NC_019418. 208 RITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDE 287 (527) Q Consensus 208 ~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~ 287 (527) .++ . .+..|. ..+. .--+.|||.+.+ ...++|+|.+..+...+.... .-.. T Consensus 253 ~~~-----~--~~~~~~--------------~~~~---~~eViHir~~~~----~~g~~G~spl~~a~~~i~~a~-~a~~ 303 (651) T protein:vir:99 253 DVT-----T--GDANGL--------------ENRP---ANELIFIPNPSI----LEDDYGVPDWVSAIRTISADE-AAKD 303 (651) T ss_pred eEE-----E--cCCCce--------------eEec---ccceEEecCCCC----CCCcccccHHHHHHHHHHHHH-HHHH Confidence 000 0 000000 0000 001346664321 134579999998888775443 3333 Q ss_pred HH-HHHHcCc-ceeee--chhHhcCCCCCCCccccccccccc-----ccceeeeccCC---CCCCCcceEeccc--c-Ch Q lcl|NC_019418. 288 FM-WEIKMGQ-RRVIV--PEQMTQLKVQDNQGNIAFKRRFDV-----EQNVYMQVGAG---NMDSGGIVDLTTP--I-RS 352 (527) Q Consensus 288 ~~-~e~~~~~-~~i~v--~~~~l~~~~~~~~~~~~~~~~~d~-----~~~~~~~~~~~---~~~~~~i~~~~~~--i-r~ 352 (527) +. +-|..|. +..++ |...+. . ...-.....|.. .+.++...... .....++++...+ . .+ T Consensus 304 ~~~~~f~NG~~p~gil~~~~~~ls----~-e~~~~lr~~~~~~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D 378 (651) T protein:vir:99 304 YNRDFFDNDTIPRMVIKVTGGELS----E-ESKRDLRQMLNGLREESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEE 378 (651) T ss_pred HHHHHHhccCCCceEEEecCCCCC----H-HHHHHHHHHHHHHhccCCceEEeecccccccccccCCceEEEcCcCchhh Confidence 33 3345543 22222 221110 0 000000011100 01111110000 0011234443333 2 35 Q ss_pred HHHHHHHHHHHHHHHHhcCCCccccccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCC Q lcl|NC_019418. 353 SDYISAISEGLKLFEMQIGVSSGMFTFDGQGV-KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRG 431 (527) Q Consensus 353 e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~-~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~ 431 (527) .++.+..+....+|+...|++|..+|+..++. .|+.+.. ...++.+|.-++..|....+..=+... T Consensus 379 ~qfle~r~~~~~eIa~afgVPp~~lG~~~~~~~sn~E~~~-------------~~f~~~tL~P~~~~ie~eln~kLl~~~ 445 (651) T protein:vir:99 379 MDFRQFREKNEHEIAKVLEVPPVKIGVTDSANRSNSDQQD-------------KDFALEVIQPEQHTFAEWLYQIIHQQA 445 (651) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCcccHHHHH-------------HHHHHHHHHHHHHHHHHHHHHhhcCcc Confidence 67888888888899999999999998765432 2333221 111222333333322221111001111 Q ss_pred cccCccceEEEeCC--CccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCHHHHHHHHHHHHHhcccccccccCCCC Q lcl|NC_019418. 432 TIPELDDISVNLDD--GVFTDRHAELDYWMKMVAAGFATQKRGIAKT--LGITEEEAEKELAEINGELPPESDAELALYG 507 (527) Q Consensus 432 ~~~~~~~v~v~f~d--~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~--~~~~deea~~el~ri~~E~~~~~~~~~~~~~ 507 (527) .......+.+.|+. -+..|..+.++....++++|+|++-+++... .++.++....-+..++.....+. ..++ T Consensus 446 e~~~~~~i~~ef~~~~llr~D~~~~~e~~~~~i~~G~~T~NE~R~~lglppi~~~~gd~~l~~~~~~~~g~~----~~gg 521 (651) T protein:vir:99 446 LGVTDWTIEYELRGADQPKQEAQLAEQRVRAMRLAGVGLVDEAREELGLDPLGEPYGEMTLSEFEAEVAGDV----AGGG 521 (651) T ss_pred ccccCceEEEEeccchhhhccHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcccccccccccccccccc----ccCC Confidence 11223345566654 4456888888888899999999999987654 23333333222222221111110 1111 Q ss_pred CCCCC-CCCCCCCCCCccccC Q lcl|NC_019418. 508 KGQQN-TVGNSKDTVDDEDEA 527 (527) Q Consensus 508 ~~~~~-~~~~~~~~~~~~~~~ 527 (527) +...+ ++.+...-++.+-++ T Consensus 522 e~~~~~~~~~~~~~~~~e~~~ 542 (651) T protein:vir:99 522 ETEAVHEPPEENKIGEREWDT 542 (651) T ss_pred CCcccccCccccccccchhhh Confidence 11111 111111111111111 No 243 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=91.41 E-value=0.016 Score=30.43 Aligned_cols=429 Identities=11% Similarity=0.149 Sum_probs=194.1 Q ss_pred CChHHHHHHHHHHHHHHh---hcccchhhh-cc-----Ccccc----------------------CHHHHHHHHHHHHHh Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNM---TTSHLSSIL-DH-----PKVAV----------------------TQSEFRRIQHNLAYY 49 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~---~~~~~~~~~-~~-----~~i~~----------------------~~~~~~~i~~~~~~y 49 (527) .+....++.|++--..+. ......+.+ .+ ..|.+ ......-|++++.+. T Consensus 2 ~~~l~~~~~~~~~~~~~~~~~~~~~~~s~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~~~~~~~eLI~~YR~ma 81 (521) T protein:vir:81 2 FSRLKMLARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNLPASAWNSLTQQFYSTDQKISTTKQLVNTYRGLM 81 (521) T ss_pred cchhhhhHhhcCchhhhHHhhhccCccccccCCCCCCceEecccCCCcceeecceeeeecccccchhhHHHHHHHHHHHh Confidence 333344455543111110 111111111 00 00100 012233344444442 Q ss_pred cCCCcccccccccCccccCceeecchHHHHHHHHhh-hhh----cccceEeeCCH--------HHHHHHHHHHhhhhHHH Q lcl|NC_019418. 50 QSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIAS-LVY----NEQAEISAEDE--------TLNDFLSDMLSNDRFNK 116 (527) Q Consensus 50 ~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~-ll~----~e~~~i~~~d~--------~~~~~l~~~l~~n~f~~ 116 (527) .. |.+ ...++...+ -+. ..|+++.+++. ...+.++.++.=-+|.+ T Consensus 82 ~~--pEv--------------------d~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~ 139 (521) T protein:vir:81 82 NN--HEV--------------------ENAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFDR 139 (521) T ss_pred hc--cch--------------------hhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccch Confidence 21 111 111111111 111 12445555432 23455666776668999 Q ss_pred HHHHHHHHHHhcCCEEEEEEEeC----CeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcce-EEE-EEEEEee Q lcl|NC_019418. 117 NFERYLESALALGGLAMRPYVDG----DKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNV-YYT-LVEFHEW 190 (527) Q Consensus 117 ~~~~~~~~a~~~G~~~~~~~~d~----~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~-~yt-~lE~h~~ 190 (527) +..+.+....+-|..+|+..+|. |=..+.+++|.++-++..... ++..++ .+. ..|+.-+ T Consensus 140 ~~~~~fR~WYVDgRi~fhkiid~~pk~GI~Elr~lDPr~i~~vr~i~k--------------~~~~~~~v~~~~~e~f~Y 205 (521) T protein:vir:81 140 RGQDMFRRWYVDSRIFFHKIIGKNPKDGIVELRQLDPRNLEYVREIIT--------------EDTPEGKIYKATKEYFIY 205 (521) T ss_pred hhhHHHhhhhhcceEEEEEEEcCCccccceeeeeeCCcceeeeeeecc--------------cccCccceecceeeeeee Confidence 99999999999999999999873 224688899988887642211 111110 010 0111100 Q ss_pred cccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceee--cCCCcccEEEecCCccccccCCCccCc Q lcl|NC_019418. 191 VTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPI--QGLSRPLFTYLKTPGMNNKDINSPLGL 268 (527) Q Consensus 191 ~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~--~g~~~p~f~~~~~~~~N~~~~~splG~ 268 (527) ...+..|......|. ++ .|-.+|-+ .+++ +|+ + +.....=+ T Consensus 206 ----------~~~~~~~~~~g~~~~---~~-~~vkI~~d--------AI~y~hSGl-------~--------d~~~~~i~ 248 (521) T protein:vir:81 206 ----------TVGNSSYCAGGQVFS---PN-SRVKIPRS--------AITYAHSGL-------M--------DCDDKYII 248 (521) T ss_pred ----------ecCCccccccceeec---CC-cceeechh--------heeeeeccc-------e--------eCCCCeee Confidence 000111111111111 11 11122211 1111 221 0 11111123 Q ss_pred chhhhhHHHHHHHHHHHHHHH--HHHHcCcceeeec----------hh-----HhcCCC----CCCCcccccccccccc- Q lcl|NC_019418. 269 SIFDNAKTTIDFINRTYDEFM--WEIKMGQRRVIVP----------EQ-----MTQLKV----QDNQGNIAFKRRFDVE- 326 (527) Q Consensus 269 S~~~~~~~lid~ld~~~s~~~--~e~~~~~~~i~v~----------~~-----~l~~~~----~~~~~~~~~~~~~d~~- 326 (527) |-+..|.-.+..|=-.-+.++ +-.|+-.+|||-- +. |.+... |..+|+..-.+.+-.- T Consensus 249 syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMl 328 (521) T protein:vir:81 249 GYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMT 328 (521) T ss_pred ecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccccccccchh Confidence 556666665555554444443 4445666777631 01 111111 4445544322221110 Q ss_pred cceeeeccCCCCC-CCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccccc-c--cchHHHHHHHHHHHHHHHH Q lcl|NC_019418. 327 QNVYMQVGAGNMD-SGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQ-G--VKTATEIVSENSDTYQMRN 402 (527) Q Consensus 327 ~~~~~~~~~~~~~-~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~-g--~~TAtei~s~~~~~~~~~~ 402 (527) ...+.+- -+|+ ..-|+++..-=...+ ...+..+.+.+....+++..-++..++ + .--++||.-..-.....+. T Consensus 329 EDyWLpR--ReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~ 405 (521) T protein:vir:81 329 EDYWLQR--RDGKAITDVTTLPGASGMSD-IDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIR 405 (521) T ss_pred hhhcccc--cCCCcccceeecccCCCCCh-HHHHHHHHHHHHHHhCCccccccCCCCcceeccccchhhHHHHHHHHHHH Confidence 0001110 1122 223666654333333 455666677777777888777743332 2 1236677666666677788 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCc----cceEEEeCCCccCCHHHHHHHHHH---HH--hcC----CCCH Q lcl|NC_019418. 403 SIVALVEQSIKELCVSMCELGKVVGIYRGTIPEL----DDISVNLDDGVFTDRHAELDYWMK---MV--AAG----FATQ 469 (527) Q Consensus 403 ~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~----~~v~v~f~d~i~~d~~~~~~~~~~---~~--~aG----i~s~ 469 (527) +.+..|..-+.++++.=|.|-.. +. ..+| ..|.++|...=-..+..+++.... +. ..+ ..|. T Consensus 406 rLR~rFs~lf~~~L~~qLilKgi---it--~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~ 480 (521) T protein:vir:81 406 TRQSQFSEVLRDPLKYNLILKNV---IT--EDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSN 480 (521) T ss_pred HHHHHHHHHHHHHHHHhhhhhcC---CC--HHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhhhhhccccch Confidence 88888888888888876655322 11 1112 236677754333333333332221 11 112 3588 Q ss_pred HHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCC Q lcl|NC_019418. 470 KRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSK 518 (527) Q Consensus 470 ~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 518 (527) +++.++....||+|.+++..+|++|.... .+.+++++. ++- T Consensus 481 dyi~k~ILr~tDeei~~~~k~I~~E~~~~------~~~~p~~~~--~~f 521 (521) T protein:vir:81 481 QTVMRDILKYTDDQMDTEKKQIEEEANDP------RFKQTPDEI--EDF 521 (521) T ss_pred HHHHHHHhccCHHHHHHHHHHHHHHhhCC------CCCCCcccc--cCC Confidence 88878888999999999999999987431 121111111 001 No 244 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=90.74 E-value=0.019 Score=29.99 Aligned_cols=428 Identities=11% Similarity=0.151 Sum_probs=192.3 Q ss_pred CChHHHHHHHHHH---HH-HHhhcccchhhh-cc-----CccccC----------------------HHHHHHHHHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNR---GR-YNMTTSHLSSIL-DH-----PKVAVT----------------------QSEFRRIQHNLAY 48 (527) Q Consensus 1 m~~~~~~k~~~~~---~~-~~~~~~~~~~~~-~~-----~~i~~~----------------------~~~~~~i~~~~~~ 48 (527) .+....+|.|.+- -. .++..+. .+.+ .+ ..|.++ .....-|++++.+ T Consensus 2 ~~~l~~~~~~~~~d~~~~~e~~~~~~-~s~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~~~~~~~eLI~~YR~m 80 (521) T protein:vir:65 2 FSRLKMLARWADFDNDKYEEQIKDKA-ESIAAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQKISTTKQLVNTYRGL 80 (521) T ss_pred ccchhhhhhccCchhhHHHhhhccCC-CcccCCCCCCCceeecccCCccccccccceeeeccccchhhhHHHHHHHHHHH Confidence 2223334444431 11 1111111 1111 10 111100 0111222222222 Q ss_pred hcCCCcccccccccCccccCceeecchHHHHHHHHhh-hhh----cccceEeeCCH--------HHHHHHHHHHhhhhHH Q lcl|NC_019418. 49 YQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIAS-LVY----NEQAEISAEDE--------TLNDFLSDMLSNDRFN 115 (527) Q Consensus 49 y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~-ll~----~e~~~i~~~d~--------~~~~~l~~~l~~n~f~ 115 (527) ... +--...++...+ -+. ..|+++.+++. ...+.++.++.=-+|. T Consensus 81 a~~----------------------pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~ 138 (521) T protein:vir:65 81 MNN----------------------HEVENAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFD 138 (521) T ss_pred hhc----------------------cchhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccc Confidence 111 111111222222 111 12455555433 2345566677666899 Q ss_pred HHHHHHHHHHHhcCCEEEEEEEeC----CeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcc--eEEEEEEEEe Q lcl|NC_019418. 116 KNFERYLESALALGGLAMRPYVDG----DKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKN--VYYTLVEFHE 189 (527) Q Consensus 116 ~~~~~~~~~a~~~G~~~~~~~~d~----~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~--~~yt~lE~h~ 189 (527) ++..+.+....+-|..+|+..+|. |=..+.+++|.++-++..... ++..+ ++-...|+.- T Consensus 139 ~~~~~~fR~WYVDgRi~fhkiid~~pk~GI~ELr~lDPr~i~~vr~i~k--------------~~~~~~~v~~~~~e~f~ 204 (521) T protein:vir:65 139 RRGQDMFRRWYVDSRIFFHKIIGKNPKDGIVELRQLDPRNLEYVREIIT--------------EDTPEGKIYKATKEYFI 204 (521) T ss_pred hhhhHHHhhhhhcceeEEEEEEcCCccccceeeeeeCCcceeeeeeecc--------------cccCCcceecceeeeee Confidence 999999999999999999999873 224688899988887642111 11111 1100111110 Q ss_pred ecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceee--cCCCcccEEEecCCccccccCCCccC Q lcl|NC_019418. 190 WVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPI--QGLSRPLFTYLKTPGMNNKDINSPLG 267 (527) Q Consensus 190 ~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~--~g~~~p~f~~~~~~~~N~~~~~splG 267 (527) + ...+..|......|.. + .|-.+|-+ .+++ +|+ + +.....= T Consensus 205 Y----------~~~~~~~~~~g~~~~~---~-~~vkI~~d--------AI~y~hSGl-------~--------d~~~~~i 247 (521) T protein:vir:65 205 Y----------TVGNSSYCAGGQVFSP---N-SRVKIPRS--------AITYAHSGL-------M--------DCDDKYI 247 (521) T ss_pred e----------ecCCcceeccceeecC---C-cceeechh--------heeeeeccc-------e--------eCCCCee Confidence 0 0001111111111111 1 11122211 1111 221 0 1111112 Q ss_pred cchhhhhHHHHHHHHHHHHHHH--HHHHcCcceeeec----------hh-----HhcCCC----CCCCcccccccccccc Q lcl|NC_019418. 268 LSIFDNAKTTIDFINRTYDEFM--WEIKMGQRRVIVP----------EQ-----MTQLKV----QDNQGNIAFKRRFDVE 326 (527) Q Consensus 268 ~S~~~~~~~lid~ld~~~s~~~--~e~~~~~~~i~v~----------~~-----~l~~~~----~~~~~~~~~~~~~d~~ 326 (527) +|-+..|.-.+..|=-.-+.++ +-.|+-.+|||-- +. |.+... |..+|+..-.+.+-.- T Consensus 248 ~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msM 327 (521) T protein:vir:65 248 IGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSM 327 (521) T ss_pred eecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccccccccch Confidence 3556666666555554444443 4445666777631 01 111111 4445544322221110 Q ss_pred -cceeeeccCCCCC-CCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccccc-c--cchHHHHHHHHHHHHHHH Q lcl|NC_019418. 327 -QNVYMQVGAGNMD-SGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQ-G--VKTATEIVSENSDTYQMR 401 (527) Q Consensus 327 -~~~~~~~~~~~~~-~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~-g--~~TAtei~s~~~~~~~~~ 401 (527) ...+.+- -+|+ ..-|+++..-=...+ ...+..+.+.+....+++.+-++..++ + .--++||.-..-.....+ T Consensus 328 lEDyWLpR--ReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~EItRDEiKF~KFI 404 (521) T protein:vir:65 328 TEDYWLQR--RDGKAITDVTTLPGASGMSD-IDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFI 404 (521) T ss_pred hhhhcccc--cCCCCccceeecccCCCcCh-HHHHHHHHHHHHHHhCCCceeccCCCCcceeccccchhhHHHHHHHHHH Confidence 0001110 1122 223666654333333 455666677777777888777644332 2 123667766666667778 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCc----cceEEEeCCCccCCHHHHHHHHHH---HH--hcC----CCC Q lcl|NC_019418. 402 NSIVALVEQSIKELCVSMCELGKVVGIYRGTIPEL----DDISVNLDDGVFTDRHAELDYWMK---MV--AAG----FAT 468 (527) Q Consensus 402 ~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~----~~v~v~f~d~i~~d~~~~~~~~~~---~~--~aG----i~s 468 (527) .+.+..|..-+.++++.=|.|-.. +. ..+| ..|.++|...=-..+..+++.... +. ..+ ..| T Consensus 405 ~rLR~rFs~lf~~~L~~qLilKgi---it--~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~S 479 (521) T protein:vir:65 405 RTLQSQFSEVLRDPLKYNLILKNV---IT--EDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFS 479 (521) T ss_pred HHHHHHHHHHHHHHHHHhhhhhcC---CC--HHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 888888888888888876655322 11 1112 236677754333333333332221 11 112 368 Q ss_pred HHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCC Q lcl|NC_019418. 469 QKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSK 518 (527) Q Consensus 469 ~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 518 (527) .+++.++....||+|.+++..+|++|.... .+.+++++. ++- T Consensus 480 ~dyi~k~ILr~tDeei~~~~k~I~~E~~~~------~~~~p~~~~--~~f 521 (521) T protein:vir:65 480 NQTVMRDILKYTDDQMDTEKKQIEEEANDP------RFKQTPDEI--EDF 521 (521) T ss_pred hHHHHHHHhccCHHHHHHHHHHHHHhhhCC------CCCCCcccc--cCC Confidence 888888888999999999999999987421 111111110 001 No 245 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=90.63 E-value=0.02 Score=29.91 Aligned_cols=419 Identities=12% Similarity=0.137 Sum_probs=187.5 Q ss_pred CccccCHHHHHHHHHHHHHhcCCCccc-ccccccCcc--------------------ccCceeecchHHHHHHHHhhh-- Q lcl|NC_019418. 30 PKVAVTQSEFRRIQHNLAYYQSKFDDI-EYTNTDGDR--------------------KRRKMQHLPIARTAAKKIASL-- 86 (527) Q Consensus 30 ~~i~~~~~~~~~i~~~~~~y~g~~~~l-~~~~~~~~~--------------------~~~~~~~lnl~~~i~~~~A~l-- 86 (527) .++-...+... + ..--.++..-+ ...+.+|-. ......+ -+.+++..-++ T Consensus 1 ~~~w~~~de~~-~---~~~~~~~~~S~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~---~~eLI~~YR~ma~ 73 (511) T protein:vir:56 1 MKFWTKEEEQD-I---QKIEKNPVRSFSAPDNVDGAKEIHTNLLAPQLGHAIIPSDAQSEGTIP---VKELIKSYRALAE 73 (511) T ss_pred CCCccchhhhh-h---hhhccCCcccccCCCCCCCceEEecccccceecceeccccccccCccc---hHHHHHHHHHHhh Confidence 00100000000 0 00001110000 000000000 0000000 01222222221 Q ss_pred ----------hhc---------ccceEeeCCHH--------HHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC Q lcl|NC_019418. 87 ----------VYN---------EQAEISAEDET--------LNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG 139 (527) Q Consensus 87 ----------l~~---------e~~~i~~~d~~--------~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~ 139 (527) +.+ .|+++.+++.. ..+.++.++.--+|.+...+.+....+-|..+|+..+|+ T Consensus 74 ~pEvd~Av~eIvne~iv~d~~~~pV~l~ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~ 153 (511) T protein:vir:56 74 YHEVDDAIQEIVDEAIVYENDKEVVWLNLDNTDFSENIKAKINEEFDRVVSLLQMRKHGYKWFRKWYVDSRIYFHKILDK 153 (511) T ss_pred ccchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecc Confidence 112 24455555432 445566677767899999999999999999999999985 Q ss_pred C-e-eEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcc--eEEEEEEEEeecccccccceeeecCCceEEEEEEEe Q lcl|NC_019418. 140 D-K-IRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKN--VYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYK 215 (527) Q Consensus 140 ~-~-~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~--~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~ 215 (527) . + ..+.+++|.++-++.. ++.+...+ ++.-.+|+..+.. +.+.-....+. T Consensus 154 k~GI~eLr~lDPr~i~~vr~--------------i~~~~~~~~~v~~~~~ey~~Y~~------------~~~~~~~~~~~ 207 (511) T protein:vir:56 154 DNNIIELRPLNPMKMELVRE--------------IQKETIDGVEVVKGTLEYYVYKQ------------SDYKMPSWMSA 207 (511) T ss_pred ccceeehhhcCcccchhhhh--------------hhcccccccccccceeeeeEecC------------CCcccCccccc Confidence 3 2 4577788877766521 11111110 1100122211100 00000000000 Q ss_pred cCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHH--HHHH Q lcl|NC_019418. 216 STSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFM--WEIK 293 (527) Q Consensus 216 ~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~--~e~~ 293 (527) .. ....|-.+|- ..+++.. ++.+-.+ ...++.+|-+..|.-.+..|=..-+.++ +-.| T Consensus 208 ~~-~~~~~vkI~~--------daI~y~h-----SGL~d~~------~~~g~i~syLhkAiKp~NQLkm~EDAlVIYRitR 267 (511) T protein:vir:56 208 TN-RAQTSFRIPK--------DAIVFAH-----SGLMRGC------ADDPYIIGYLDRAIKPANQLKMLEDALVIYRLAR 267 (511) T ss_pred cc-ccccceeech--------hheeeec-----ccceecc------CCCCeeeccchhhhHHHHhhHHHHhhHHHHhhhc Confidence 00 0001111111 1111110 0111000 1234567778888777777765555544 4446 Q ss_pred cCcceeeec----------hhHh---------cCCCCCCCcccccccccccc-cceeeeccCCCCC-CCcceEeccccCh Q lcl|NC_019418. 294 MGQRRVIVP----------EQMT---------QLKVQDNQGNIAFKRRFDVE-QNVYMQVGAGNMD-SGGIVDLTTPIRS 352 (527) Q Consensus 294 ~~~~~i~v~----------~~~l---------~~~~~~~~~~~~~~~~~d~~-~~~~~~~~~~~~~-~~~i~~~~~~ir~ 352 (527) +-.+|||-- +.++ +..-|..+|++.-.+.+-.- ...+.+- -+|+ ..-|+++..-=.. T Consensus 268 APeRRvFYIDVGnLPk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~msMlEDyWLpR--ReGgrgTEItTLpGgqnl 345 (511) T protein:vir:56 268 APERRVFYVDVGNLPTQKAQQYVNGIMQNVKNRVVYDTQTGQVKNTTNAMSMLEDYYLPR--REGSKGTEVSTLPGGQSL 345 (511) T ss_pred cccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchhhhhhHhhhcccc--cCCCCccceeeccccCCc Confidence 666777641 1111 00113344443322211100 0001110 1122 1235555543333 Q ss_pred HHHHHHHHHHHHHHHHhcCCCcccccccc--cc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_019418. 353 SDYISAISEGLKLFEMQIGVSSGMFTFDG--QG--VKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGI 428 (527) Q Consensus 353 e~~~~~~~~~l~~i~~~~g~s~~~~~~~~--~g--~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~ 428 (527) .+ ...+..+.+.+....+++.+-+..++ +| .--++||.-..-.....+.+.+..|..-+.++++.=|.|-.. T Consensus 346 ge-m~DV~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgi--- 421 (511) T protein:vir:56 346 GD-IEDVLYFNRKLYKAMRIPTSRAASEDQTGGINFGQGAEITRDELKFTKFVKRLQTKFETVITDPLKHQLIVNNI--- 421 (511) T ss_pred Ch-HHHHHHHHHHHHHHhCCCcccccCCCCccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccC--- Confidence 33 45566667777777788777665332 11 123677776666677788888888888888888876655332 Q ss_pred cCCcccCc--cceEEEeCCCccCCHHHHHHHHHH---HH--hcC----CCCHHHHHHhcCCCCHHHHHHHHHHHHHhccc Q lcl|NC_019418. 429 YRGTIPEL--DDISVNLDDGVFTDRHAELDYWMK---MV--AAG----FATQKRGIAKTLGITEEEAEKELAEINGELPP 497 (527) Q Consensus 429 ~~~~~~~~--~~v~v~f~d~i~~d~~~~~~~~~~---~~--~aG----i~s~~~~i~~~~~~~deea~~el~ri~~E~~~ 497 (527) +...-+.. ..|.++|...=-..+..+++.... +. ..+ ..|.+++.++....||+|.+++..+|++|... T Consensus 422 it~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~~yi~k~ILr~tDeei~~~~k~I~~E~k~ 501 (511) T protein:vir:56 422 ITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGKYYSHKYIQKNILRLSDDQITAMQSEIDEEETN 501 (511) T ss_pred CCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhccccchHHHHHHHhccCHHHHHHHHHHHHHhhcC Confidence 21111111 346677754333333333332221 11 112 35888888888899999999999999998753 Q ss_pred ccccccCCCCCCCCCC Q lcl|NC_019418. 498 ESDAELALYGKGQQNT 513 (527) Q Consensus 498 ~~~~~~~~~~~~~~~~ 513 (527) +.+.+.+++- T Consensus 502 ------~~~~~~e~~f 511 (511) T protein:vir:56 502 ------PRFQQDDQGF 511 (511) T ss_pred ------CCCCCcccCC Confidence 1222111111 No 246 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=90.62 E-value=0.02 Score=29.91 Aligned_cols=446 Identities=11% Similarity=0.133 Sum_probs=194.2 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHH----HHHHhcCCCcccccccccCcccc-----Ccee Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQH----NLAYYQSKFDDIEYTNTDGDRKR-----RKMQ 71 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~----~~~~y~g~~~~l~~~~~~~~~~~-----~~~~ 71 (527) |+|.+-.+-|.+.--.. ..+.+.+ ....++.|...-.+... +-.-|.|-.. .+....+..+. +.+. T Consensus 1 ~~~~~lf~f~~~~d~~~-~~~~~~~--~~~s~~~p~~~DGa~~i~~~~~~~~~~g~~~--~~~d~~~~~~~~~~LI~~YR 75 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNE-YDERLKQ--GHESIATPKKDDGATEIEAREGESSYNALMQ--QFFGIDNNISGTKDLINTYR 75 (516) T ss_pred CCchHhcccccchhhHH-HHhhhcC--CCCcccCCCCccCceeeecCcccccccceee--eeecccCccccHHHHHHHHH Confidence 88887777765521111 1111100 01111111111110000 0000001000 00011111110 0000 Q ss_pred ec-chH--HHHHHHHhh-hhhc----ccceEeeCCH--------HHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEE Q lcl|NC_019418. 72 HL-PIA--RTAAKKIAS-LVYN----EQAEISAEDE--------TLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRP 135 (527) Q Consensus 72 ~l-nl~--~~i~~~~A~-ll~~----e~~~i~~~d~--------~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~ 135 (527) +| .-| ...++...+ -+.. .|+++.+++. ...+.++.++.--+|.+...+.+....+-|..+|+. T Consensus 76 ~ma~~pEvd~Av~eIvneaiv~d~~~~pV~l~l~~~e~s~sik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhK 155 (516) T protein:vir:10 76 QLTNNPEVERAVANIVNEAVVYEKGHKVVSLDLDDTEFSSSIKDKILEEFDEICRLLDASRKLDTLFRRWYIDSRIFFHK 155 (516) T ss_pred HhhhccchhHHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhhhhcceEEEEE Confidence 00 000 011122221 1111 2455555543 234556667776689999999999999999999998 Q ss_pred EEeC---CeeEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcc--eEEEEEEEEeecccccccceeeecCCceEEE Q lcl|NC_019418. 136 YVDG---DKIRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKN--VYYTLVEFHEWVTPTGQEVGSTKDKSLYRIT 210 (527) Q Consensus 136 ~~d~---~~~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~--~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~ 210 (527) ++|+ |=..+.+++|.++.++.. ++.++.++ +.....|+..+. .....|.+. T Consensus 156 iid~~k~GI~elr~lDPr~i~~vR~--------------i~~~~~~~~~v~~~~~e~~~Y~----------~~~~~~~~~ 211 (516) T protein:vir:10 156 IMPNPKEGIVELRRLDPRHVEYYRE--------------IVTSDVGGTSVVKGYREFFVYT----------TGNEGYAYN 211 (516) T ss_pred EecCcccceeeeeeeCCcceeeEEe--------------eecccCcchhhhhceeeeeeee----------cCccceecc Confidence 8885 235688899988887632 11111110 000001111100 000111100 Q ss_pred EEEEecCCccccCceeecccccCCcccceee--cCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHH Q lcl|NC_019418. 211 NELYKSTSDSQLGERVNLSELYPDLQPVTPI--QGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEF 288 (527) Q Consensus 211 n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~--~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~ 288 (527) -..|. +++ +-.+|-+ .+++ +|+- +. | ...=+|-+..|.-.+..|=..-+.+ T Consensus 212 g~~~~---~~~-~ikI~~d--------aI~y~hSGl~-------d~---~-----~~~i~syLhkAiKp~NQLkm~EDAl 264 (516) T protein:vir:10 212 GRLFE---PNT-RIKIPRS--------AIVYAHSGLQ-------DC---S-----DRGIVGYLHNAVKPANQLKLLEDAL 264 (516) T ss_pred ccccC---CCC-ceecchh--------heeeeecCcc-------cC---C-----CCceeceehhhhHhHHhhHHHHhhH Confidence 01111 110 0111111 1111 1211 00 0 0011344555555555554444443 Q ss_pred H--HHHHcCcceeeec----------hhHh---------cCCCCCCCcccccccccccc-cceeeeccCCCCC-CCcceE Q lcl|NC_019418. 289 M--WEIKMGQRRVIVP----------EQMT---------QLKVQDNQGNIAFKRRFDVE-QNVYMQVGAGNMD-SGGIVD 345 (527) Q Consensus 289 ~--~e~~~~~~~i~v~----------~~~l---------~~~~~~~~~~~~~~~~~d~~-~~~~~~~~~~~~~-~~~i~~ 345 (527) + +-.|+-.+|||-- +.++ +..-|..+|++.-.+.+-.- ...+.+- -+|+ ..-|++ T Consensus 265 VIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~KNklvYDa~TGev~ddrk~msMlEDyWLpR--ReGgrgTEItT 342 (516) T protein:vir:10 265 VIYRITRAPERRVFYIDVGNMPNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMR--RDGKSVTEVTS 342 (516) T ss_pred HHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccc--cCCCcccceee Confidence 3 4445666677531 0111 00113344443322211100 0001110 1122 123555 Q ss_pred eccccChHHHHHHHHHHHHHHHHhcCCCcccccccccc-c--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019418. 346 LTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQG-V--KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCEL 422 (527) Q Consensus 346 ~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g-~--~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~ 422 (527) +..-=...+ ...+..+.+.+....+++.+-+..++++ . --++||.-.+-....-+.+.+..|..-+.++++.=|.| T Consensus 343 LpGgqnlge-m~DV~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lF~~~L~~qLil 421 (516) T protein:vir:10 343 LPGAQTMGE-MDDVRWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITRDELDFRKFIVQLQHNFEEIFLDPLKTNLIY 421 (516) T ss_pred ccccCCcCh-HHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 554333333 4556667777878888887777654432 2 23577766666666778888888888888888876654 Q ss_pred hhhhcccCCcccCc--cceEEEeCCCccCCHHHHHHHHHH---H------HhcCCCCHHHHHHhcCCCCHHHHHHHHHHH Q lcl|NC_019418. 423 GKVVGIYRGTIPEL--DDISVNLDDGVFTDRHAELDYWMK---M------VAAGFATQKRGIAKTLGITEEEAEKELAEI 491 (527) Q Consensus 423 ~~~~~~~~~~~~~~--~~v~v~f~d~i~~d~~~~~~~~~~---~------~~aGi~s~~~~i~~~~~~~deea~~el~ri 491 (527) -.. +...-+.. ..|.++|...=-..+..+++.... + .-+...|.+++.++....||+|.+++-.+| T Consensus 422 KgI---it~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I 498 (516) T protein:vir:10 422 KKI---ILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVGKYVSHDYVMKNILQMTDEQIAQEEKQI 498 (516) T ss_pred cCC---CCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhHHHHHHHHH Confidence 322 11111111 236677744332333333322221 1 123468888888888899999999999999 Q ss_pred HHhcccccccccCCCCCC-CCCCC Q lcl|NC_019418. 492 NGELPPESDAELALYGKG-QQNTV 514 (527) Q Consensus 492 ~~E~~~~~~~~~~~~~~~-~~~~~ 514 (527) ++|.... .+.++ ++.+. T Consensus 499 ~~E~~~~------~~~~p~~e~~f 516 (516) T protein:vir:10 499 EKEANVK------RFQNPENEDDF 516 (516) T ss_pred HHhhhCC------CCCCCCccccC Confidence 9987431 11111 11111 No 247 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=89.43 E-value=0.026 Score=29.24 Aligned_cols=266 Identities=13% Similarity=0.074 Sum_probs=109.5 Q ss_pred HhhhhhcccceEe----eCCHHHHHHHHH-HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-Ce-eEEEEEcCCceEEE Q lcl|NC_019418. 83 IASLVYNEQAEIS----AEDETLNDFLSD-MLSNDRFNKNFERYLESALALGGLAMRPYVDG-DK-IRVAFIQAPVFLPL 155 (527) Q Consensus 83 ~A~ll~~e~~~i~----~~d~~~~~~l~~-~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~-~~i~~v~a~~~~P~ 155 (527) .|++ |..+. ..+..+...|.. --........+..++.+.+..|.+++.+..+. |+ +.+..++|+.+-+. T Consensus 1 ia~l----~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~~l~~l~~~~v~v~ 76 (278) T protein:vir:78 1 MASL----PLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEML 76 (278) T ss_pred Cccc----eeEEEecCcccccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEEEEEEECCceeEEE Confidence 1111 11110 011122222210 00011122234455667788899988887764 33 35556667665543 Q ss_pred EEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCc Q lcl|NC_019418. 156 QSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDL 235 (527) Q Consensus 156 ~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l 235 (527) ..+.+ ...+|. ++. .. |..+.+ + T Consensus 77 ~~~~~-----------------~~~~y~-~~~-----------------~~----------------g~~~~~---~--- 99 (278) T protein:vir:78 77 IENQS-----------------RELYYS-IHA-----------------AT----------------GNKLIV---H--- 99 (278) T ss_pred EcCCC-----------------ceEEEE-EEc-----------------CC----------------ceEEEE---c--- Confidence 22111 112221 110 00 111000 0 Q ss_pred ccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCCCCCCCc Q lcl|NC_019418. 236 QPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLKVQDNQG 315 (527) Q Consensus 236 ~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~~~ 315 (527) +. -..|++.+.+. ..++|+|.+..+...++.....-..-...+..+..-|+....-+ +.+.. T Consensus 100 ~~----------evih~~~~~~~----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~l----~~e~~ 161 (278) T protein:vir:78 100 NM----------DMLHFKHIVAS----NMVQGISPIDVLKNTTDFDNAVRTFNLTEMQKPDSFMLKYGSNV----GKEKR 161 (278) T ss_pred cc----------cEEEECCCCCC----CCeeeccHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeCCCC----CHHHH Confidence 00 12455543222 34579999998888887655443322222233332222211111 00000 Q ss_pred ccccccccccccceeee---ccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccc-cchHHHHH Q lcl|NC_019418. 316 NIAFKRRFDVEQNVYMQ---VGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQG-VKTATEIV 391 (527) Q Consensus 316 ~~~~~~~~d~~~~~~~~---~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g-~~TAtei~ 391 (527) + .....|. ..+.. +-.- +++..++.++......++.+..+...++|+...|++|..+|...++ -.|+.+.. T Consensus 162 ~-~~~~~~~---~~~~~~g~~~vl-~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~~ 236 (278) T protein:vir:78 162 Q-QVLEDFK---QYYEENGGILFQ-EPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELN 236 (278) T ss_pred H-HHHHHHH---HHhccCCCceec-CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH Confidence 0 0000110 00000 0000 1223466777777788888888888999999999999998876543 33443321 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCC-cccCccceEEEeCCCcc Q lcl|NC_019418. 392 SENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRG-TIPELDDISVNLDDGVF 448 (527) Q Consensus 392 s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~-~~~~~~~v~v~f~d~i~ 448 (527) ...++.+|..++..|....+. .++.. ..... .-|.|+=+.. T Consensus 237 -------------~~~~~~~l~P~~~~i~~~ln~-~L~~~~e~~~g--~~~~f~~~~l 278 (278) T protein:vir:78 237 -------------RFYLQHTLLPIVKQYEEEFNR-KLLTKTDREKI--GILNLTLNLI 278 (278) T ss_pred -------------HHHHHHHHHHHHHHHHHHHHh-hcCChhHhcCC--ceEEEecccC Confidence 112223333333333222111 11111 01111 2344543333 No 248 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=89.42 E-value=0.026 Score=29.23 Aligned_cols=416 Identities=15% Similarity=0.163 Sum_probs=160.2 Q ss_pred CChHH-HHHHHHH-HHHHH----hhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCc---ccccccccCccccCcee Q lcl|NC_019418. 1 MSLIQ-KVKDFFN-RGRYN----MTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFD---DIEYTNTDGDRKRRKMQ 71 (527) Q Consensus 1 m~~~~-~~k~~~~-~~~~~----~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~---~l~~~~~~~~~~~~~~~ 71 (527) |+=|= .-=+=++ ....+ ...........|+--.+++..+.+|-+-.. .|+.. .|-+.-.. . T Consensus 1 m~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~--~gd~~~~~~L~~dm~~--------~ 70 (512) T protein:vir:19 1 MGRILDISGQPFDFDDEMQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAE--RGDLTAQADLAFDMEE--------K 70 (512) T ss_pred CcceeCCCCCccccccccccccchhcccchhhccccccCCCHHHHHHHHHHhh--CCCHHHHHHHHHHHHh--------h Confidence 21110 0000000 00000 000000112234444455544444433210 01100 00000000 0 Q ss_pred ecchHHHHHHHHhhhhhcccceEeeC------CHHHHHHHHHHHhhh-hHHHHHHHHHHHHHhcCCEEEEEEEe--CCee Q lcl|NC_019418. 72 HLPIARTAAKKIASLVYNEQAEISAE------DETLNDFLSDMLSND-RFNKNFERYLESALALGGLAMRPYVD--GDKI 142 (527) Q Consensus 72 ~lnl~~~i~~~~A~ll~~e~~~i~~~------d~~~~~~l~~~l~~n-~f~~~~~~~~~~a~~~G~~~~~~~~d--~~~~ 142 (527) ..-+...+-+. -.-|.+.+-+|.-. +....++++++|.+. .|...+..++ +|..+|=+++-+.|. ++.. T Consensus 71 D~hi~s~l~~R-k~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~~~~ll-dA~~~G~s~~Ei~w~~~~g~~ 148 (512) T protein:vir:19 71 DTHLFSELSKR-RLAIQALEWRIAPARDASAQEKKDADMLNEYLHDAAWFEDALFDAG-DAILKGYSMQEIEWGWLGKMR 148 (512) T ss_pred ChHHHHHHHHH-HHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcCCCHHHHHHHHH-hhhhhcceeeeeEeeeeCCce Confidence 01122222222 23455555555421 224567888888654 4777666654 688899999988784 3322 Q ss_pred ---EEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCc Q lcl|NC_019418. 143 ---RVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSD 219 (527) Q Consensus 143 ---~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~ 219 (527) ++.++++..|.. +.++ ...+. ++ +. T Consensus 149 ~~~~~~~r~~~~f~~---~~~~--------------------------------------------~~~lr---~~--~~ 176 (512) T protein:vir:19 149 VPVALHHRDPALFCA---NPDN--------------------------------------------LNELR---LR--DA 176 (512) T ss_pred eeeeeeeecccccee---ccCC--------------------------------------------CcEEE---ec--CC Confidence 233444432221 0000 00000 00 00 Q ss_pred cccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHc-Ccce Q lcl|NC_019418. 220 SQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKM-GQRR 298 (527) Q Consensus 220 ~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~-~~~~ 298 (527) ..-|.+ +++. -|.+... +...++|+|.+.+..|.-..--=+..+..|+.=++. |.+- T Consensus 177 ~~~G~~---------l~~~---------k~i~~~~----~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~ 234 (512) T protein:vir:19 177 SYHGLE---------LQPF---------GWFMHRA----KSRTGYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIYGLPM 234 (512) T ss_pred CCCcee---------ecCC---------ceEEEec----cCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHcCCCe Confidence 101111 1111 1222211 112367888888888766655555555555543443 3332 Q ss_pred eeechhHhcCCCCCCCcccccccccccccceeeec-cCCC------CCCCcceEeccc-cChHHHHHHHHHHHHHHHHhc Q lcl|NC_019418. 299 VIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQV-GAGN------MDSGGIVDLTTP-IRSSDYISAISEGLKLFEMQI 370 (527) Q Consensus 299 i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~-~~~~------~~~~~i~~~~~~-ir~e~~~~~~~~~l~~i~~~~ 370 (527) .+. +.+ .+...+ ..+.+...+ ++.. ..+..|+.++.. ...+.|...++.+-++|+... T Consensus 235 ~ig-----ky~-~~a~~~--------ek~~L~~al~~~~~~a~~iiP~~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~i 300 (512) T protein:vir:19 235 RVG-----KYP-TGSTNR--------EKATLMQAVMDIGRRAGGIIPMGMTLDFQSAADGQSDPFMAMIGWAEKAISKAI 300 (512) T ss_pred eEE-----ecC-CCCCHH--------HHHHHHHHHHHHhhCcEEEecCCceEEEeecCCCCHHHHHHHHHHHHHHHHHHH Confidence 222 111 000000 001111110 0000 011234444432 223335555555555554432 Q ss_pred CCCccccccc-c-cccchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhhhcccCCcccCccceEEEeCCCc Q lcl|NC_019418. 371 GVSSGMFTFD-G-QGVKTATEIVSENSDTYQMRNSIVALVEQSIK-ELCVSMCELGKVVGIYRGTIPELDDISVNLDDGV 447 (527) Q Consensus 371 g~s~~~~~~~-~-~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~-~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i 447 (527) ++ +|++.+ + +|.....++ -+.-...-+..-.+.+...|. +|++.++.+ ++ +...+....+.+.|+..= T Consensus 301 -LG-qtlTs~~g~~Gs~a~~~v--h~ev~~di~~aDa~~i~~tln~~li~~l~~~----N~-~~~~~~~~~p~~~f~~~e 371 (512) T protein:vir:19 301 -LG-GTLTTEAGDKGARSLGEV--HDEVRREIRNADVGQLARSINRDLIYPLLAL----NS-DSTIDINRLPGIVFDTSE 371 (512) T ss_pred -hh-hhhcccccccchhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CC-CCCCCccccceEEecCCC Confidence 11 232211 1 221112222 122233344455666778884 688887765 22 222222334667888777 Q ss_pred cCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 448 FTDRHAELDYWMKMVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 448 ~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) ++|..+.++.+.+++ .|+--.+.++.+.+|+++.+-.+.+...+.... ............+..+.++..+...+..+. T Consensus 372 ~eDl~~~a~~~~~l~-~G~~i~~~~i~e~~Gip~~~~~e~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 449 (512) T protein:vir:19 372 AGDITALSDAIPKLA-AGMRIPVSWIQEKLHIPQPVGDEAVFTIQPVVP-DNGSQKEAALSAEDIPQEDDIDRMGVSPED 449 (512) T ss_pred hhhHHHHHHHHHHHh-cCCCCCHHHHHHHhCCCCCCCccccccCCCccc-cccccccccccccCCCchhhHhHHhhhHHH Confidence 888888887777776 688556777888889864322222211111111 110000000011111111111111111111 No 249 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=88.72 E-value=0.031 Score=28.89 Aligned_cols=358 Identities=15% Similarity=0.088 Sum_probs=116.2 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) ||+|.+++.|-+... ....+....... ..++..... .| .--..|| T Consensus 1 Mg~f~~~~~~~~~~~-~~~~~~~~~~~~-~~~~~~~~~---------v~------------------------~~i~~Ia 45 (378) T protein:vir:16 1 MNLFGKVVSFSRGKL-NNDTQRVTAWQN-EAVEYTSAF---------VT------------------------NIHNKIA 45 (378) T ss_pred Cccchhhhhhhcccc-cCCcceeeeccc-chhhHHHHH---------HH------------------------HHHHHHH Confidence 999999998655321 111111111100 001100000 00 0011233 Q ss_pred HHHhhhhhccc----------ceEeeCCHHHHHHHHH-HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcC Q lcl|NC_019418. 81 KKIASLVYNEQ----------AEISAEDETLNDFLSD-MLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQA 149 (527) Q Consensus 81 ~~~A~ll~~e~----------~~i~~~d~~~~~~l~~-~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a 149 (527) +..|++=|.-. ......+..+...|+. --..-....-....+...+..|.+++.+.+|+...++-+. T Consensus 46 ~~iA~l~~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~g~~~~l-- 123 (378) T protein:vir:16 46 NEITKVEFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDL-- 123 (378) T ss_pred hhhhhCceeEEEEcccccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEE-- Confidence 33332211000 0000112222222211 0000112222333455666788888877776532222111 Q ss_pred CceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecc Q lcl|NC_019418. 150 PVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLS 229 (527) Q Consensus 150 ~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~ 229 (527) +| ....+ .+. ...++ | |.+-++-. ..+ -++. T Consensus 124 ---~~---~~~~~--------~~~---~~dii------h--------------------~r~~~~~~---~~~---s~l~ 154 (378) T protein:vir:16 124 ---LF---ADDKK--------EYK---PEELV------R--------------------LTSPFYIN---EDT---SILD 154 (378) T ss_pred ---Ee---cCCee--------Eec---ccceE------E--------------------ecCccCcc---chh---HHHH Confidence 11 11100 000 00111 1 00000000 000 0111 Q ss_pred cccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCC Q lcl|NC_019418. 230 ELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLK 309 (527) Q Consensus 230 ~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~ 309 (527) .....+.... ..| .|- ++++.+. .++. +..+.+.+.+...|......-. ..++.+ T Consensus 155 ~~~~~i~~~~-~~~--~~~-g~l~~~~----------~l~~-~~~~~~~~~~~~~~~~~~~~~~--~g~~~v-------- 209 (378) T protein:vir:16 155 NALASIQTKL-EQG--KLR-GLLKINA----------FLDI-DNTQEYREKALTTIKNMQEGSS--YNGLTP-------- 209 (378) T ss_pred HHHHHHHHHH-hcC--ccc-eeeEeCC----------cCCH-HHHHHHHHHHHHHHHHhhcccc--cccceE-------- Confidence 0000000000 001 010 1121110 0000 0111122222222211110000 000111 Q ss_pred CCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHH Q lcl|NC_019418. 310 VQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATE 389 (527) Q Consensus 310 ~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAte 389 (527) . +++..++.++.+-.+.+. ..++.+..+|+...|++|..++. |..| T Consensus 210 -------------------------l--~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~g------~~~e 255 (378) T protein:vir:16 210 -------------------------V--DNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLG------TASQ 255 (378) T ss_pred -------------------------c--CCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcC------CchH Confidence 0 011123333333233332 34555667899999999987731 1112 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCC-------cccCccceEEEeCCCccCCHHHHHHHHHHHH Q lcl|NC_019418. 390 IVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRG-------TIPELDDISVNLDDGVFTDRHAELDYWMKMV 462 (527) Q Consensus 390 i~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~-------~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~ 462 (527) .. .. ..+..+|..++..|-.-.+. .|+.. .......+.++++.-...|..+.++...+++ T Consensus 256 ~~-~~-----------~f~~~tl~P~~~~ie~~l~~-kLl~~~e~~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~ 322 (378) T protein:vir:16 256 EQ-QI-----------YFYNSTIIPLLIQLEKELTY-KLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENI 322 (378) T ss_pred HH-HH-----------HHHHHHHHHHHHHHHHHHHh-hcCChhhhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHH Confidence 11 11 12233333333333221110 01100 0011233566777778888899999999999 Q ss_pred hcCCCCHHHHHHhcCCCCHH-HHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 463 AAGFATQKRGIAKTLGITEE-EAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 463 ~aGi~s~~~~i~~~~~~~de-ea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) .+|+|++-+++++. |+..- ...+-+. .....+-... .+......++..+||+.. T Consensus 323 ~~G~~T~NE~R~~~-g~~p~~ggD~~~~--~~n~~~~~~~--------~~~~~~~~~~~~~~e~~n 377 (378) T protein:vir:16 323 NGPIFTQNQLLVKM-GEQPIEGGDVYIA--NLNAVAVKNL--------SDLQGSRKDVTSTDETNN 377 (378) T ss_pred hCCCcCHHHHHHHh-CCCCCCCCCeEee--ccccccccch--------hhhcCccCCCCCCCCCCC Confidence 99999999977654 54321 0100000 0000000000 000011111111111111 No 250 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=88.26 E-value=0.033 Score=28.68 Aligned_cols=418 Identities=13% Similarity=0.074 Sum_probs=162.8 Q ss_pred CChHHHHHHHHH-HHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFN-RGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTA 79 (527) Q Consensus 1 m~~~~~~k~~~~-~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i 79 (527) |+|+ .-..|+- .+...| +-.+++... +-=++... T Consensus 102 l~~~-~~~~F~Gy~~la~l--------------aQ~~eyr~~------------------------------~~~ia~e~ 136 (698) T protein:vir:10 102 LSFV-TSSGFPGFPTLVLL--------------AQLPEYRAM------------------------------HEVLADEC 136 (698) T ss_pred chhh-hccCcchHHHHHHH--------------hhccchhhH------------------------------HHHHHHHh Confidence 3221 0111111 000000 000011111 11123333 Q ss_pred HHHHhhhhhcccce-----Ee-------eCCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEE Q lcl|NC_019418. 80 AKKIASLVYNEQAE-----IS-------AEDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVAFI 147 (527) Q Consensus 80 ~~~~A~ll~~e~~~-----i~-------~~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v 147 (527) +++|-..+.++... ++ ..+...-+.|+.-+++-+.+..++++++.+-.+|++++.+-++++.... T Consensus 137 ~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eai~~aRlfGGa~~~i~I~gdd~~l--- 213 (698) T protein:vir:10 137 IRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIM--- 213 (698) T ss_pred hcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEEeecCcccc--- Confidence 33332222221111 11 1233455677777788889999999999999999999777776532100 Q ss_pred cCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeeccccccccee-----eecCCceEEEEEEEecCCcccc Q lcl|NC_019418. 148 QAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGS-----TKDKSLYRITNELYKSTSDSQL 222 (527) Q Consensus 148 ~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~-----~~~~~~~~I~n~ly~~~~~~~l 222 (527) + -|+.-+...+.. ..-+..+.++++- .......... -...+.|.| . T Consensus 214 ~----~PL~~~~~~I~k------------GslKGL~ViDp~~-vtP~~~n~~dP~spdfgkP~~y~V------------~ 264 (698) T protein:vir:10 214 D----TPLVPRPYTVPK------------GSFQGLRVVEPYW-VTPNNYNSINPVADDFYKPSTWWM------------I 264 (698) T ss_pred c----cccccccccccC------------ccceeeeeecccc-cccchhhhccchhhccCCCceEEE------------e Confidence 0 011000000000 0001112222220 0000000000 000111111 1 Q ss_pred CceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeec Q lcl|NC_019418. 223 GERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVP 302 (527) Q Consensus 223 G~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~ 302 (527) |+.|.-+.+ ..+.| +|+--.+|+. ..-+|+|....+.+-+++.+++-.....-+..-....+ - T Consensus 265 G~~IH~SRL-------~~~vg--~pvpd~LKp~-------y~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~~~~l-~ 327 (698) T protein:vir:10 265 GSEVHATRL-------HTIVS--RPVGDMLKPT-------YSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGI-L 327 (698) T ss_pred cceecceeE-------EEecC--CCchhhhcch-------hccCCccHHHHHHHHHHHHHHHhhhHHHHHHHhhHHHH-H Confidence 222211111 11111 1111112221 23469999999999999998776655543321111111 1 Q ss_pred hhHhcCCCCCCCcccccccccc---cccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcc-ccc Q lcl|NC_019418. 303 EQMTQLKVQDNQGNIAFKRRFD---VEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSG-MFT 378 (527) Q Consensus 303 ~~~l~~~~~~~~~~~~~~~~~d---~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~-~~~ 378 (527) .+|.....++ +......++. .-+..+..+-++ ++...+++++ ....-.-+.+..+..+++-.++++.. -|| T Consensus 328 ~dla~aL~~g--~~~~l~~R~eli~~~Rsn~G~~llD-k~~Eefeq~s--t~lSGLddVi~qf~q~VAgaa~IPltkLfG 402 (698) T protein:vir:10 328 MDLAQALTPG--ANVDLSMRAELINRYRDNRNILFLD-KATEEFFQFN--TPLSGLDALQAQAQEQMSAVSHIPLIKLLG 402 (698) T ss_pred HHHHHhcCCh--hhHHHHHHHHHHHHhcCccceEEEe-cCCcceEEEe--cCcCCHHHHHHHHHHHHHhhhcCchhhhhc Confidence 2222211222 2111111111 112222111122 2222344443 33444555566666667777787754 367 Q ss_pred cccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHH- Q lcl|NC_019418. 379 FDGQGV-KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELD- 456 (527) Q Consensus 379 ~~~~g~-~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~- 456 (527) ...+|. +|+..=...+.+...... +..++.+|+.|+.+|..- .++.. ..++++.|+.--..++.+.++ T Consensus 403 qSPkGlNATGE~D~rnYYD~I~s~Q--e~~L~p~L~rl~~ii~rS-----~~G~i---dp~i~~~fnPL~qmtd~EkAeI 472 (698) T protein:vir:10 403 ITPTGLNASSEGEIRVWYDYVRAYQ--RNALQQLMNDVIVMIQLS-----LFGAV---DPSIKWQWNALRELDDLEVAEA 472 (698) T ss_pred cCCcccCccchhhHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH-----hcCCC---CCcceEEeCCCCCcCHHHHHHH Confidence 777775 666632233333333322 456788888877766432 12222 235888998655555544333 Q ss_pred ------HHHHHHhcCCCCHHHHHHhcC--------CC----------CHHHHHHHHHHHHHhcc-----cccccccCCCC Q lcl|NC_019418. 457 ------YWMKMVAAGFATQKRGIAKTL--------GI----------TEEEAEKELAEINGELP-----PESDAELALYG 507 (527) Q Consensus 457 ------~~~~~~~aGi~s~~~~i~~~~--------~~----------~deea~~el~ri~~E~~-----~~~~~~~~~~~ 507 (527) ++..++..|+++..+...++- +. .|++++.++...+.-.+ ........-.+ T Consensus 473 ~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~d~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 552 (698) T protein:vir:10 473 RYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGAPADDDIDGVLTYVQRMAEGGDTGAPTAPGGARAG 552 (698) T ss_pred HhhhhHHHHHHHHhcCCCHHHHHHHHhccCCCccccccCCcccCCCCCCCcchHHHhhhcCCcCCCCcccccccccccCC Confidence 334445667888777655531 11 11122233222211110 00001111111 Q ss_pred CCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 508 KGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 508 ~~~~~~~~~~~~~~~~~~~~ 527 (527) ...+....+..-|...-+-+ T Consensus 553 ~~~~~~~~~~~~~~~~~~~~ 572 (698) T protein:vir:10 553 ATAPPAAANVNANANPREAG 572 (698) T ss_pred CCCCcccccccCCCCccccC Confidence 12222211111111111111 No 251 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=88.07 E-value=0.035 Score=28.59 Aligned_cols=345 Identities=14% Similarity=0.096 Sum_probs=116.6 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) ||+|.+++.|-++-... ..+..... ..-+ +..+. ..-..++ T Consensus 1 Mg~f~~~~~~~~~~~~~-~~~~~~~~-----~~~~------~~~~~---------------------------~~v~~~v 41 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNN-DTQRVTAW-----QNEA------VEYTS---------------------------AFVTNIH 41 (378) T ss_pred CCccccchhcccccccC-Ccceeeee-----ccch------hHHHH---------------------------HHHHHHH Confidence 99999999876531100 01110000 0000 00000 0111233 Q ss_pred HHHhhhhhcccceE--------------eeCCHHHHHHHHH-HHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEE Q lcl|NC_019418. 81 KKIASLVYNEQAEI--------------SAEDETLNDFLSD-MLSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVA 145 (527) Q Consensus 81 ~~~A~ll~~e~~~i--------------~~~d~~~~~~l~~-~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~ 145 (527) +.+|+-+-+-|..+ .+.+..+...|+. --..-....-....+..++..|.+++.+.+++...++- T Consensus 42 ~~IA~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~g~~~ 121 (378) T protein:vir:94 42 NKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNTGELL 121 (378) T ss_pred HHHHhhhhhCceeeEEEcccCcccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCCceEE Confidence 33333333323221 0112223333321 00000112223445566677788888766654321111 Q ss_pred EEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCce Q lcl|NC_019418. 146 FIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGER 225 (527) Q Consensus 146 ~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~ 225 (527) +. +| .++++ .+.. .. T Consensus 122 ~l-----~p---~~~~~--------~~~~---~d---------------------------------------------- 136 (378) T protein:vir:94 122 DL-----LF---ADDKK--------EYKP---EE---------------------------------------------- 136 (378) T ss_pred EE-----Ee---cCCee--------Eeee---ee---------------------------------------------- Confidence 10 11 11110 0000 00 Q ss_pred eecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhH Q lcl|NC_019418. 226 VNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQM 305 (527) Q Consensus 226 v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~ 305 (527) ..||+.|. + .--|.|.++.+.. +++.. +..+..+ .+ T Consensus 137 -----------------------iiH~~~~~-~-----~~~g~s~l~~~~~---~i~~~-------~~~~~~~-----gi 172 (378) T protein:vir:94 137 -----------------------LVRLTSPF-Y-----INEDTSILDNALA---SIQTK-------LEQGKLR-----GL 172 (378) T ss_pred -----------------------eEEecCcC-C-----ccchhHHHHHHHH---HHHHH-------Hhccccc-----ce Confidence 01111110 0 0002222222221 11111 1111111 11 Q ss_pred hcCCCCCCCcc-cccccccccccceeeec----c----CCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccc Q lcl|NC_019418. 306 TQLKVQDNQGN-IAFKRRFDVEQNVYMQV----G----AGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGM 376 (527) Q Consensus 306 l~~~~~~~~~~-~~~~~~~d~~~~~~~~~----~----~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~ 376 (527) +.......... -.....|. .-|... + +--+++..++.++.+..+.+. ...+...++|+...|++|.. T Consensus 173 l~~~~~l~~~~~~~~~~~~~---~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVP~~~ 248 (378) T protein:vir:94 173 LKINAFLDIDNTQEYREKAL---TTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENI 248 (378) T ss_pred eeeCCcCCHHHHHHHHHHHH---HHHHHhhcccccccceecCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHH Confidence 11100000000 00000000 000000 0 000011123333333333443 44555667899999999987 Q ss_pred ccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCC------c-ccCccceEEEeCCCccC Q lcl|NC_019418. 377 FTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRG------T-IPELDDISVNLDDGVFT 449 (527) Q Consensus 377 ~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~------~-~~~~~~v~v~f~d~i~~ 449 (527) +.. |..|... +..+..+|..++..|..-... .|+.. . .....++.++++.-... T Consensus 249 l~~------~~se~~~------------~~f~~~tL~P~~~~ie~~l~~-~Ll~~~er~~g~~~~~~~~~~f~~~~l~~~ 309 (378) T protein:vir:94 249 LLG------TASQEQQ------------IYFYNSTIIPLLIQLEKELTY-KLISTNRRRVVKGNLYYERIIVDNQLFKFA 309 (378) T ss_pred hcC------ChHHHHH------------HHHHHHHHHHHHHHHHHHHHh-hcCChhHhhhhhhcccccceeecchhhhhc Confidence 731 1112111 112333444444333321110 11100 0 01112355666777788 Q ss_pred CHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 450 DRHAELDYWMKMVAAGFATQKRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 450 d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) |..+.++...+++.+|+|++-++++.. |+..-+ ..+-+ +.....+-... .+......++++++|+.. T Consensus 310 d~~~~~~~~~~~~~~G~~T~NE~R~~~-gl~p~~gGD~~~--~~~n~~~~~~~--------~~~~~~~~~~~~~~e~~n 377 (378) T protein:vir:94 310 TLKELIDLYHENINGPIFTQNQLLVKM-GEQPIEGGDVYI--ANLNAVAVKNL--------SDLQGSRKDVTSTDETNN 377 (378) T ss_pred CHHHHHHHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCeee--ecccccccccc--------hhhcCCcCCCCCCCCCCC Confidence 889999999999999999999977653 543211 11000 00000000000 000000111111111111 No 252 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=87.58 E-value=0.038 Score=28.38 Aligned_cols=422 Identities=12% Similarity=0.063 Sum_probs=162.7 Q ss_pred CChH-HHHHHHHHHHH-HHh----hcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecc Q lcl|NC_019418. 1 MSLI-QKVKDFFNRGR-YNM----TTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLP 74 (527) Q Consensus 1 m~~~-~~~k~~~~~~~-~~~----~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~ln 74 (527) |+-| +.-.+-+++-. .+- ....-.....|+--.+++..+.+|-+-. -.|+... +....-..-. ...- T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a--~~gd~~~--~~~L~~~m~e---~D~~ 73 (528) T protein:vir:10 1 MAAIVDIYGNPLRTQQLRKQQTAHLAGLAKEFANHPAKGLTPAKLAHILIEA--EQGHLQA--QAELFMDMEE---RDAH 73 (528) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCCCCCCHHHHHHHHHhh--hCCCHHH--HHHHHHHHHh---hChH Confidence 2211 11111111100 000 0000001112333344444433333211 0111100 0000000000 0001 Q ss_pred hHHHHHHHHhhhhhcccceEeeC------CHHHHHHHHHHHhhh-hHHHHHHHHHHHHHhcCCEEEEEEEeC--Cee--- Q lcl|NC_019418. 75 IARTAAKKIASLVYNEQAEISAE------DETLNDFLSDMLSND-RFNKNFERYLESALALGGLAMRPYVDG--DKI--- 142 (527) Q Consensus 75 l~~~i~~~~A~ll~~e~~~i~~~------d~~~~~~l~~~l~~n-~f~~~~~~~~~~a~~~G~~~~~~~~d~--~~~--- 142 (527) +...+ .+--.-|++.+-+|... +....+++++++.+- .|...+..+ .+|..+|=+++-+.|.. +.. T Consensus 74 i~s~l-~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~i~~~-lda~~~G~s~~Ei~w~~~~g~~~~~ 151 (528) T protein:vir:10 74 LFAEM-SKRKRAVLGLDWTIEPPRNASAAEKADAEYLHELLLDLEGIEDLMLDC-MDGVGHGYSAIELDWSLQGREWLPQ 151 (528) T ss_pred HHHHH-HHHHHHHhcCCceEecCCCCCHHHHHHHHHHHHHHhCCccHHHHHHHH-HhhhhhcceeEEEEEeecCCceeEE Confidence 22222 22233455555555432 234567788888763 466655544 56888999999887753 221 Q ss_pred EEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCcccc Q lcl|NC_019418. 143 RVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQL 222 (527) Q Consensus 143 ~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~l 222 (527) ++.++++..|.. +. .+ ..++ +..+...- T Consensus 152 ~~~~r~~~~f~~---~~-----------------~~---------------------------~~~l-----~~~~~~~~ 179 (528) T protein:vir:10 152 AFDHRPQSWFQL---NP-----------------DD---------------------------QDEL-----RLRDNSIA 179 (528) T ss_pred Eeeeecccceee---cc-----------------CC---------------------------CcEE-----eccCCCCC Confidence 222333321110 00 00 0000 00001111 Q ss_pred CceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHc-Ccceeee Q lcl|NC_019418. 223 GERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKM-GQRRVIV 301 (527) Q Consensus 223 G~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~-~~~~i~v 301 (527) |.++ ++. -|.++.. ....++|+|.|.+..|.-..--=+..+..|+.=++. |.+-.+. T Consensus 180 g~~l---------~~~---------k~iv~~~----~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~ig 237 (528) T protein:vir:10 180 GEVL---------QPF---------GWIMHKP----RSRSGYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLPIRLG 237 (528) T ss_pred ceee---------cCC---------CeEEEee----cCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEE Confidence 2111 111 0222211 112356788888888876665555555555543443 4332222 Q ss_pred chhHhcCCCCCCCcccccccccccccceeeeccCCC----CCCCcceEeccc-cChHHHHHHHHHHHHHHHHhcCCCccc Q lcl|NC_019418. 302 PEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGN----MDSGGIVDLTTP-IRSSDYISAISEGLKLFEMQIGVSSGM 376 (527) Q Consensus 302 ~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~----~~~~~i~~~~~~-ir~e~~~~~~~~~l~~i~~~~g~s~~~ 376 (527) +.+ .+...+ .... ..+.+ ..+..+. .....|+.++.. ...+.|...++.+-++|+... ++ +| T Consensus 238 -----ky~-~~a~~~--ek~~--L~~al-~~i~~~~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i-LG-qt 304 (528) T protein:vir:10 238 -----KYP-PGTPDE--EKVT--LLRAV-TGLGHAAAGIIPESMSIDFQEASKGSAEPFMAMMRWCDDSMSKAI-LG-GT 304 (528) T ss_pred -----ecC-CCCCHH--HHHH--HHHHH-HHHhhCcEEEecCCceeEEeecCCCChhHHHHHHHHHHHHHHHHH-hh-hh Confidence 111 000000 0000 00111 0010000 011235555432 333446666665555554433 23 34 Q ss_pred ccc-cc---cccchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCH Q lcl|NC_019418. 377 FTF-DG---QGVKTATEIVSENSDTYQMRNSIVALVEQSIK-ELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDR 451 (527) Q Consensus 377 ~~~-~~---~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~-~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~ 451 (527) ++- .+ +|...+.++ -+.-....+..-.+.+...|. +|++.++.+. + ++..+...-+.+.|+..-++|. T Consensus 305 lTs~~~~g~~gS~Alg~v--h~~v~~di~~aDa~~i~~tln~~li~~l~~~N----~-~~~~~~~~~p~~~~~~~e~eDl 377 (528) T protein:vir:10 305 LTSQTSESGGGAYALGQV--HNEVRHDLLAADARQLAATLSRDLLWPLLVLN----R-SGNLDARRAPRLVFDLKDRADL 377 (528) T ss_pred hhccccccccchhhhHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC----C-CCCCCccccceEEecCCCcccH Confidence 422 11 121111222 112222334445566778885 6888887652 2 2222233345678888888888 Q ss_pred HHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccc---cC Q lcl|NC_019418. 452 HAELDYWMKMVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDED---EA 527 (527) Q Consensus 452 ~~~~~~~~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~ 527 (527) .+.++.+.+++..|+--.+.++.+.||++..+-.+++.. .+..+........++...............+.+ +. T Consensus 378 ~~~a~~~~~L~~~G~~i~~~~i~e~~gip~p~~~e~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 454 (528) T protein:vir:10 378 AAMATSLPPLVKLGVQVPVNWVQEQLGIPLPANGEAVLG--DQAGAGIAQLSRRPGPRIAALAQVIGPRYRDQEALDQV 454 (528) T ss_pred HHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCCccccc--CCCcccccccCcccccccccccccccccccccchHHHH Confidence 888999999999998445667888889865432222211 111111000000000000000000011111111 11 No 253 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=83.47 E-value=0.068 Score=26.98 Aligned_cols=363 Identities=12% Similarity=0.082 Sum_probs=130.8 Q ss_pred HHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHHHH Q lcl|NC_019418. 4 IQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKI 83 (527) Q Consensus 4 ~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~ 83 (527) |....++|+ +.. .+....+.+.... .... ......--..+++.+ T Consensus 1 Mg~f~~~f~--------~~~-----~~~~~~~~~~~~~-------------------~~~~----~a~~~~~v~~~i~~i 44 (385) T protein:vir:95 1 MGLFDSVFK--------RHS-----ELSWMYDLEFLQD-------------------KSKK----AYLKQIALNTVVEMV 44 (385) T ss_pred Cchhhhhhc--------cCc-----ccccccchhhhhc-------------------cchh----hhhhhHHHHHHHHHH Confidence 232222222 110 0111111110000 0000 001111112444444 Q ss_pred hhhhhcccceEeeCCHHHHHHHHHHHhh--h---hHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcCCceEEEEEc Q lcl|NC_019418. 84 ASLVYNEQAEISAEDETLNDFLSDMLSN--D---RFNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQAPVFLPLQSN 158 (527) Q Consensus 84 A~ll~~e~~~i~~~d~~~~~~l~~~l~~--n---~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a~~~~P~~~d 158 (527) |+-+.+-|..+--.+.....-+..+|.. | .....++.++.+.+-.|.+++.+..+++.+ .+..+.+. T Consensus 45 a~~ia~~p~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~~~~~~-----~~~~~~~~--- 116 (385) T protein:vir:95 45 ARTISQSEFRVMKNNTKEKGTLYYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVKNDEGHFF-----VADDFEKE--- 116 (385) T ss_pred HHHHcccceeeeecCccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEEecCCCee-----eccccccc--- Confidence 4444443433321222222223333321 1 112233445566666777776543333321 11111110 Q ss_pred CCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccc Q lcl|NC_019418. 159 TQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPV 238 (527) Q Consensus 159 ~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~ 238 (527) .. ... ....||. .-. ..+.+. +.| +- . T Consensus 117 -~~----~~~--------~~~~~~~-~~~-----------------~~~~~~-~~~------------~~--------~- 143 (385) T protein:vir:95 117 -DE----LGL--------YSHRFTN-VLV-----------------NDFEFK-RVF------------TM--------D- 143 (385) T ss_pred -cc----ccc--------cccccee-eee-----------------ccccee-eee------------cc--------c- Confidence 00 000 0000110 000 001110 000 00 0 Q ss_pred eeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeee--chhHhcCCCCCCCcc Q lcl|NC_019418. 239 TPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIV--PEQMTQLKVQDNQGN 316 (527) Q Consensus 239 ~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v--~~~~l~~~~~~~~~~ 316 (527) -..|++.+..+... +|.|.+.-+...+. ..++... ..+..+-++ +.... .+.+.. T Consensus 144 ---------eiih~~~~~~~~~~----~G~s~~~~~~~~i~---~~~~~~~---~~~~~~g~l~~~~~~~---~~~e~~- 200 (385) T protein:vir:95 144 ---------DVIYLKYNNQKLDA----FSLGLFEDYGEIFG---RMIDLQM---LNNQIRGILKVDATKF---YNKEKQ- 200 (385) T ss_pred ---------cEEEecCCCCCccc----ccchHHHHHHHHHH---HHHHHHH---hcCCCceEEEeCCccC---CCHHHH- Confidence 12345544444333 36676666655443 2333222 223333222 11100 000000 Q ss_pred cccccccccccceeeeccCC------CCCCCcceEecc------ccChHHHHHHHHHHHHHHHHhcCCCccccccccccc Q lcl|NC_019418. 317 IAFKRRFDVEQNVYMQVGAG------NMDSGGIVDLTT------PIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGV 384 (527) Q Consensus 317 ~~~~~~~d~~~~~~~~~~~~------~~~~~~i~~~~~------~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~ 384 (527) -.....|. ..|.+.... -.+...++.++. ...+.++.+..+...++|+...|++|..++.. - T Consensus 201 ~~~~~~~~---~~~~g~~~~~~~i~~l~~g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~---~ 274 (385) T protein:vir:95 201 KELQAYID---TLFDAFQNNTIAVVPLTEGLAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVLGE---M 274 (385) T ss_pred HHHHHHHH---HHhhhhhhcCCceEEcCCCceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhcCC---C Confidence 00001111 111211000 011112333332 12256788888888889999999999988521 1 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhc Q lcl|NC_019418. 385 KTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAA 464 (527) Q Consensus 385 ~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~a 464 (527) .++.+. ....++.+|..++..|....+. .+..........+.++++.-+..|.++.++...+++.+ T Consensus 275 sn~e~~-------------~~~~~~~~l~P~~~~ie~~l~~-~L~~~~~~~~~~~~fd~~~l~~~D~~~~~~~~~~~~~~ 340 (385) T protein:vir:95 275 ADLEKT-------------IESYLQFCINPLLRKIEAELNS-KFFYQDEYLNDDMHIKVVGIDKRDPLKLSEAIDKLVAS 340 (385) T ss_pred cCHHHH-------------HHHHHHHHHHHHHHHHHHHHHh-hcCChhhcccceEEEechhhhccCHHHHHHHHHHHHhC Confidence 222221 1122333444444433322211 11111111223467777777888989999999999999 Q ss_pred CCCCHHHHHHhcCCCCH---HHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCc Q lcl|NC_019418. 465 GFATQKRGIAKTLGITE---EEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDD 523 (527) Q Consensus 465 Gi~s~~~~i~~~~~~~d---eea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 523 (527) |+|++-+++... |+.. +...+-+ .+.+. ..-+....+++.++ T Consensus 341 g~lt~NE~R~~~-g~~p~~~~~gd~~~--------------~~~n~--~~~~~~kgge~~~e 385 (385) T protein:vir:95 341 GTFTRNQVRIMT-GEEPADDPELDKFI--------------ITKNL--QSADAFKGGESNEE 385 (385) T ss_pred CCcCHHHHHHHh-CCCCCCCCCCceee--------------ecccc--eecccccCCCCCCC Confidence 999999976544 5532 1111100 00000 00000011111111 No 254 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=80.50 E-value=0.094 Score=26.21 Aligned_cols=422 Identities=11% Similarity=0.099 Sum_probs=168.3 Q ss_pred CChHHHHHHHHH---HHHHHhhcccchhhhccCcc------ccCHHHHHHHHHHHHHhcCCCcccccccccCccccCcee Q lcl|NC_019418. 1 MSLIQKVKDFFN---RGRYNMTTSHLSSILDHPKV------AVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQ 71 (527) Q Consensus 1 m~~~~~~k~~~~---~~~~~~~~~~~~~~~~~~~i------~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~ 71 (527) =.+.+++-.|-. ..|..+..+++...+...-+ ..-.....-|++++.+...+|. + T Consensus 17 ~~~~~~~~~~~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~gg~~~n~~eLI~~YR~ma~~~pE-V--------------- 80 (533) T protein:vir:58 17 TNFLSPMYGMGAPHGAGGSSMIPINMYHPFATAGYASRFYGGIEFNRFFLYDMYDRMDYTDPL-I--------------- 80 (533) T ss_pred HHhhchhhcccCccCCCCCccccCCCCcchhhhhhhhhhhccccccHHHHHHHHHHhhccCcc-h--------------- Confidence 222222222210 01110000000000000000 0000112234444444321111 1 Q ss_pred ecchHHHHHHHHhhh-----hhcccceEeeCCH----HHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC--C Q lcl|NC_019418. 72 HLPIARTAAKKIASL-----VYNEQAEISAEDE----TLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG--D 140 (527) Q Consensus 72 ~lnl~~~i~~~~A~l-----l~~e~~~i~~~d~----~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~--~ 140 (527) ...++..++- -...|+++.+++. ...+++..++ +|.++..+.+....+.|..+++.-.++ . T Consensus 81 -----d~AideIvneaiv~d~~~~pV~v~l~~~e~s~~iK~kI~~ll---df~~~~~~~fR~WYVDGriy~Hkiik~~k~ 152 (533) T protein:vir:58 81 -----STVLDIIADECTIPNENGNIVDVVTKDIELAKAILSYLDYVI---NIEKNAYPIIRNMIKYGDMFLHILEKGSDG 152 (533) T ss_pred -----hhHHHhhhceeeEecCCCceeEeecccccccHHHHHHHHHHh---cchhhhhHHHHhhhhcceeEEEeccCCccc Confidence 1112222221 1223455555443 3345555555 599999999999999999999986543 3 Q ss_pred ee-EEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCc Q lcl|NC_019418. 141 KI-RVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSD 219 (527) Q Consensus 141 ~~-~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~ 219 (527) +| .+.+++|-++=++. ...++ ..||. ++ +. |.+... T Consensus 153 GI~elr~lDPr~i~~vr--------------~~~t~---~eyyv---y~-----------------~~------~~~~~s 189 (533) T protein:vir:58 153 TIEKFQVVSPYIFSKRY--------------NPETD---TWYYV---IT-----------------DV------YRNVVS 189 (533) T ss_pred chhhheecCCeeeEEEE--------------eeccc---eEEEe---ec-----------------cc------cccccc Confidence 44 67888887765542 11111 12231 00 00 000000 Q ss_pred cccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHH--HHHHcCcc Q lcl|NC_019418. 220 SQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFM--WEIKMGQR 297 (527) Q Consensus 220 ~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~--~e~~~~~~ 297 (527) ...+..+| + ..++ |+--- ..+..++.|+|-+..|.-.+..|=..-+.++ |-.|.-.+ T Consensus 190 ~~~~~kI~-----~---daI~----------y~~SG---l~d~~~~~iisyLhkAiKp~NQLkmiEDAlVIYRisRAPeR 248 (533) T protein:vir:58 190 GYFNEDIP-----E---EDVI----------HFSHK---IDTNFFPYGRSYLESARAIWNQLRLMEDALMLYRVVRSVDR 248 (533) T ss_pred Cccccccc-----h---hhee----------eeeec---cccCCCCceehhhhHHHHHHHHHHHHHHHHHHHhhcCChhh Confidence 11111111 0 1111 11110 0122466788889888777777766555554 23344445 Q ss_pred eeee-ch---------hHh---------cCCCCCCCcccccccccccc----cceeeeccCCCCCCCcceEeccccChHH Q lcl|NC_019418. 298 RVIV-PE---------QMT---------QLKVQDNQGNIAFKRRFDVE----QNVYMQVGAGNMDSGGIVDLTTPIRSSD 354 (527) Q Consensus 298 ~i~v-~~---------~~l---------~~~~~~~~~~~~~~~~~d~~----~~~~~~~~~~~~~~~~i~~~~~~ir~e~ 354 (527) |||- .- .++ +..-+..+|++.-.+.+-.. ...+.+-. +.+...-|+++... ... T Consensus 249 RvFYIDVGNlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRR-eGgrgTEI~TLpGg-~lg- 325 (533) T protein:vir:58 249 RVFYVDVGNVPPDKINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRR-GDRRAVEIDILQGS-KVD- 325 (533) T ss_pred eEEEEeecCCCccCHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhhhccccc-CCCccceeeecCCC-CCC- Confidence 6653 10 111 00112334433211111000 00011100 11111236666543 333 Q ss_pred HHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCccc Q lcl|NC_019418. 355 YISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIP 434 (527) Q Consensus 355 ~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~ 434 (527) -+..+..+.+.+....+++..-++.+++. --++||.-..-.....+.+.+..|. +++...|.| .+. . T Consensus 326 emeDV~YF~kkLy~ALnVP~sRl~~e~~f-gr~~eItRDEiKF~KFI~rLR~rF~----~ll~~qLil------k~i--i 392 (533) T protein:vir:58 326 LAEDVEYMLNRLISALKVPKAFIGYEGDV-NAKNTLATQDIKFNNTIKRIQGFFV----EELERMVRM------NKE--F 392 (533) T ss_pred cHHHHHHHHHHHHHHhCCCeeecCCCCCC-ccchhhhHHHHHHHHHHHHHHHHHH----HHHhccccc------ccC--c Confidence 34677777888888888888777665442 1255664444444444555554444 444433322 111 2 Q ss_pred CccceEEEeCCCccCCHHHHHHHHHH---H--HhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccC----C Q lcl|NC_019418. 435 ELDDISVNLDDGVFTDRHAELDYWMK---M--VAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELA----L 505 (527) Q Consensus 435 ~~~~v~v~f~d~i~~d~~~~~~~~~~---~--~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~----~ 505 (527) ...++.++|...=-..+..+.+.+.. + ...+.+++..+.++....||| .+++.+.|++|....--..++ + T Consensus 393 t~eew~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~dpyvgk~yi~k~ILr~tde-i~~q~e~ie~E~~~~~~~~~~~~~e~ 471 (533) T protein:vir:58 393 ADQDFRLVMNRSNSIVEGERFAVIEQRIGIAERLKGWVREDWIYSNILQIPYD-LKPQEEVAEAAGGGGLFDTGGFGEET 471 (533) T ss_pred chhheeeeeeccchHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHhcCChh-hhHHHHHHHHhhcCCCCCCCCccccc Confidence 23334566644322222222222221 1 123667777666667788975 444445577664321100000 0 Q ss_pred CCCCCCCCCCCCCCC-----------------------CCcccc-----------C Q lcl|NC_019418. 506 YGKGQQNTVGNSKDT-----------------------VDDEDE-----------A 527 (527) Q Consensus 506 ~~~~~~~~~~~~~~~-----------------------~~~~~~-----------~ 527 (527) .+....-+.+++.++ +|...+ - T Consensus 472 ~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~g~~~~~~ 527 (533) T protein:vir:58 472 TPADFLGERGSPIESPRGRTEFDFGTEGGEELGGELNLGGAFEEFEEETGGGEEEL 527 (533) T ss_pred CCcccCccccCcccCCCChhhHhcccCCcccccccccccccchhhhhhcCCcccCC Confidence 000000001111111 011111 0 No 255 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=79.00 E-value=0.11 Score=25.87 Aligned_cols=408 Identities=12% Similarity=0.077 Sum_probs=154.5 Q ss_pred hhccCccccCHHHHHHHHHH-----HHHhcC--CCcccccccccCccccCceeecchHHHHH----------HHHhhhhh Q lcl|NC_019418. 26 ILDHPKVAVTQSEFRRIQHN-----LAYYQS--KFDDIEYTNTDGDRKRRKMQHLPIARTAA----------KKIASLVY 88 (527) Q Consensus 26 ~~~~~~i~~~~~~~~~i~~~-----~~~y~g--~~~~l~~~~~~~~~~~~~~~~lnl~~~i~----------~~~A~ll~ 88 (527) -+....+..+.-...++-.. .+.|.- ..+.|. .-..+.+...+- .+....|. T Consensus 1 ~~~~~~~~~p~~~~g~~~~~~~~~~~~~~~~~e~~~~lr-----------~~~~~~ly~~m~e~D~~i~s~l~~rk~av~ 69 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFGSGVVDGWTVWDPFEQTPELQ-----------WPQSVAVYSRMDNEDSRVTSLLEAISLPIR 69 (469) T ss_pred CCCcccCCCCccchhhhhhcccccchhhccccccccccc-----------cccchHHHHHHHhhChHHHHHHHHHHHHHh Confidence 11111222222122222110 111210 111110 001122222221 11223355 Q ss_pred cccceEee--CCHHHHHHHHHHHh-----------------hhhHHHHHHHHHHHHHhcCCEEEEEEEeC------CeeE Q lcl|NC_019418. 89 NEQAEISA--EDETLNDFLSDMLS-----------------NDRFNKNFERYLESALALGGLAMRPYVDG------DKIR 143 (527) Q Consensus 89 ~e~~~i~~--~d~~~~~~l~~~l~-----------------~n~f~~~~~~~~~~a~~~G~~~~~~~~d~------~~~~ 143 (527) +-+-+|.- ++++..+++.+.|. ...|...+.+.+..|+.+|-+++-+.|.. |... T Consensus 70 ~~~w~v~p~~~~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~dG~~~ 149 (469) T protein:vir:10 70 STPWRIRANGASDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPDGRFW 149 (469) T ss_pred cCCceEecCCCCHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCCCcee Confidence 55545542 23333344433332 12356667777777888899999888853 2221 Q ss_pred ---EEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCcc Q lcl|NC_019418. 144 ---VAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDS 220 (527) Q Consensus 144 ---i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~ 220 (527) +.++++..+--..++. +.+-..++. . ...+.+....|.. + T Consensus 150 ~~~l~~rp~~~i~~~~~~~----------------~~~l~~~~~--~----------------~~~~~~~~~~~~~---~ 192 (469) T protein:vir:10 150 LRKLAPRPQWTISKFNVAP----------------DGGLESIEQ--I----------------APPARTRGSLYVA---N 192 (469) T ss_pred eeeeeecCcccceeeeecc----------------CCceeeeee--c----------------CcccccccccccC---C Confidence 1122222111000011 111011100 0 0000000000000 0 Q ss_pred ccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHc-Cccee Q lcl|NC_019418. 221 QLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKM-GQRRV 299 (527) Q Consensus 221 ~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~-~~~~i 299 (527) .-|.+ +++. + |.++.. +...++|+|.|.+..|--..--=+..+..|+.=++. |.+-. T Consensus 193 ~~~~~---------lp~~-------k--~i~~~~----~~~~g~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~ 250 (469) T protein:vir:10 193 IAPPE---------IPVN-------R--LVVYTR----NKRPGQWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIP 250 (469) T ss_pred CCccc---------cccC-------c--EEEEEe----cCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcce Confidence 00111 1111 1 222221 123467999999998877654444444444443332 33322 Q ss_pred eechhHhcCCCCCCCcccccccccccccceeeeccCC------CCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_019418. 300 IVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAG------NMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVS 373 (527) Q Consensus 300 ~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~------~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s 373 (527) +. +.. .+.... ....+ .+....+..+ -.....|+.++.......|...++.+-++|+... ++ T Consensus 251 vg-----ky~-~~a~~~--ek~~l---~~a~~~~~~g~~a~~iip~~~~ie~~ea~g~~~~~~~li~~~d~~Isk~i-LG 318 (469) T protein:vir:10 251 VG-----TAS-SATDED--EVRKM---AALARSVRGGINAGVGLAQGQILELLGVSGNLPDIRRAIEGHDRSIALSG-LA 318 (469) T ss_pred EE-----ecC-CCCCHH--HHHHH---HHHHHHHhcCCceEEEccCCceEEEeecCCCchHHHHHHHHHHHHHHHHH-hc Confidence 11 111 110000 00000 0001111000 0112346666665555667777777666664433 22 Q ss_pred ccccccc-ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCH Q lcl|NC_019418. 374 SGMFTFD-GQGVKTATEIVSENSDTYQMRNSIVALVEQSIK-ELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDR 451 (527) Q Consensus 374 ~~~~~~~-~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~-~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~ 451 (527) ++++.+ .+|.....++ -+.-....++.-.+.+...|. +|++.++.+. + + +...-+.+.|+. +-.+. T Consensus 319 -~tlTs~~~gGS~a~~~v--h~ev~~d~~~sDa~~i~~tln~~li~~l~~lN----~-g---~~~~~P~~~~~~-~e~~~ 386 (469) T protein:vir:10 319 -HFLNLDGKGGSYALASV--LEDPFTQAVHAYATSICRIANQHIIEDLVDIN----F-G---VDTPAPVLTFDP-IGSRQ 386 (469) T ss_pred -ccccccCccchhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc----C-C---CCCCccEEEecC-CCCcH Confidence 222222 2222111222 111222334445566778885 6888777652 2 1 112225677865 34555 Q ss_pred HHHHHHHHHHHhcCCCC----HHHHHHhcCCCCHHHHHHHHHHHHHhc-ccccccccCCCCCCCCC--CCCCCCCCCCcc Q lcl|NC_019418. 452 HAELDYWMKMVAAGFAT----QKRGIAKTLGITEEEAEKELAEINGEL-PPESDAELALYGKGQQN--TVGNSKDTVDDE 524 (527) Q Consensus 452 ~~~~~~~~~~~~aGi~s----~~~~i~~~~~~~deea~~el~ri~~E~-~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~ 524 (527) +..++.+.+++.+|++. .+.++.+.+|+++.+-.+.+..-.+.. .+.....+......++. ...+.....+.- T Consensus 387 ~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 466 (469) T protein:vir:10 387 DLTAAAVKLLYDAGVFDDDPAVKRAIRQRFNLPSELNDTPSAEPEEPAAVPNQSAAPARTRSSGNADARARAPKADQGVL 466 (469) T ss_pred HHHHHHHHHHHhcCCccCccccHHHHHHHhCCCCCCCCcccccchhcccCCCCCccccccCCCCCcccccccCCChHHhh Confidence 67788888999999843 456788888886443222222211111 11111111111111111 111111111111 Q ss_pred ccC Q lcl|NC_019418. 525 DEA 527 (527) Q Consensus 525 ~~~ 527 (527) +|+ T Consensus 467 ~da 469 (469) T protein:vir:10 467 FDA 469 (469) T ss_pred ccC Confidence 111 No 256 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=78.85 E-value=0.11 Score=25.83 Aligned_cols=207 Identities=10% Similarity=0.060 Sum_probs=86.6 Q ss_pred EeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEE Q lcl|NC_019418. 172 IKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTY 251 (527) Q Consensus 172 ~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~ 251 (527) +....++..+|+... .....-|..+.+. +.+ ..| T Consensus 1 ~r~~~dg~~~y~~~~------------------------------~~~~~~g~~~~~~------~~e----------ilH 34 (219) T protein:vir:98 1 MRVCKDGNYKYLMKK------------------------------SLYDTKSEIYEYN------KND----------VIF 34 (219) T ss_pred CceeecCeEEEEEec------------------------------ceecCCceeEEec------ccc----------EEE Confidence 222233333332110 0000011111110 011 235 Q ss_pred ecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHH-HcCcceeeechhHhcCC-CCCCC-cccccccccccc-- Q lcl|NC_019418. 252 LKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEI-KMGQRRVIVPEQMTQLK-VQDNQ-GNIAFKRRFDVE-- 326 (527) Q Consensus 252 ~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~-~~~~~~i~v~~~~l~~~-~~~~~-~~~~~~~~~d~~-- 326 (527) ||.+.+.. .-+|+|.+..+...+. ++..-.+|...| +.|.. |..++... ..... ..-.....|... T Consensus 35 ~r~~~~~~----~~~Glspi~~a~~~i~-~~~aa~~~~~~~f~Ng~~----p~gil~~~~~~l~~e~~~~~~~~~~~~~g 105 (219) T protein:vir:98 35 IKLYDPMQ----QVYGSPDYVGGITSAL-LNSDATIFRRRYYSNGAH----MGFILYSTDPDMTEEMEDEIAERIRDSKG 105 (219) T ss_pred ecCCCCCC----CcceecHHHHHHHHHH-HHHHHHHHHHHHHhcCCC----CceEEEeCCCCCCHHHHHHHHHHHHHhcC Confidence 66543221 1259998877766664 344455555443 44332 22222111 00000 000001111100 Q ss_pred ----cceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCccccccccccc---chHHHHHHHHHHHHH Q lcl|NC_019418. 327 ----QNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGV---KTATEIVSENSDTYQ 399 (527) Q Consensus 327 ----~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~---~TAtei~s~~~~~~~ 399 (527) ..+.+....+..+...++-++......++++.-+....+|+..-|++|..+|....+. .++.+... T Consensus 106 ~~n~~~~~l~~~gg~~~G~~~~~~~~~~~d~qfle~rk~~~~eIa~~fgVPp~~lG~~~~~~~~~sn~eq~~~------- 178 (219) T protein:vir:98 106 VGNFRSMFVNIAGGHPDGLKVIPIGDTGQKDEFANIKNISAQDVLTSHRFPPGLSGIIPVNTAGLGDPLKIRE------- 178 (219) T ss_pred cccccceeEecCCCCccceeEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcccccCCCCCccCHHHHHH------- Confidence 0111100011112223445556666788888888888899999999999988654332 23333211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHH Q lcl|NC_019418. 400 MRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRH 452 (527) Q Consensus 400 ~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~ 452 (527) ..++..|.-++..|....+. . ......+.+.|++..+.|.. T Consensus 179 ------~f~~~tL~P~~~~ie~~ln~-~-----~~~~~~~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 179 ------AYQADEVLPLQEIIAESINS-D-----YEIKSALKVNFKQPEKRDKN 219 (219) T ss_pred ------HHHHHHHHHHHHHHHHHhhh-h-----hcCCCccEEeecCcccccCC Confidence 11222333332222221110 0 01122356889998888876 No 257 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=76.41 E-value=0.14 Score=25.34 Aligned_cols=414 Identities=11% Similarity=0.077 Sum_probs=158.4 Q ss_pred CChHH----------HHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCce Q lcl|NC_019418. 1 MSLIQ----------KVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKM 70 (527) Q Consensus 1 m~~~~----------~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~ 70 (527) |+-|= .+++-... ...........|+--.+++..+.+|-+-.. .|+... +....-..-.+ T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~----~~~~~~~~~~~~~~~gltp~~l~~iLr~a~--~gd~~~--~~~L~e~m~e~-- 70 (526) T protein:vir:99 1 MAQIVDVYGNPIRTQQLREPQTS----RLAGLAKEFAQHPAKGLTPAKLARILVEAE--QGNLQA--QAELFMDMEER-- 70 (526) T ss_pred CCeeECCCCCccccccccchhhh----hhhhhhhhhcccCcCCCCHHHHHHHHHhhh--CCCHHH--HHHHHHHHHhh-- Confidence 21110 00000000 000000111223433444444333332110 000000 00000000000 Q ss_pred eecchHHHHHHHHhhhhhcccceEeeC------CHHHHHHHHHHHhhh-hHHHHHHHHHHHHHhcCCEEEEEEEeC--Ce Q lcl|NC_019418. 71 QHLPIARTAAKKIASLVYNEQAEISAE------DETLNDFLSDMLSND-RFNKNFERYLESALALGGLAMRPYVDG--DK 141 (527) Q Consensus 71 ~~lnl~~~i~~~~A~ll~~e~~~i~~~------d~~~~~~l~~~l~~n-~f~~~~~~~~~~a~~~G~~~~~~~~d~--~~ 141 (527) ..-+...+-+. -.-|.+.+-+|.-. +....+++++++.+. +|...+..++ +|..+|=+++-..|+. +. T Consensus 71 -D~~i~s~l~~R-k~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~g~ 147 (526) T protein:vir:99 71 -DAHLFAEMSKR-KRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDAL-DGIGHGYSCIELEWALQGRE 147 (526) T ss_pred -ChHHHHHHHHH-HHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcccCHHHHHHHHH-HhhhhcceeEEEEEeecCCc Confidence 00122222222 22344444455431 234567888888763 4777776665 6888999999888853 22 Q ss_pred e---EEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCC Q lcl|NC_019418. 142 I---RVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTS 218 (527) Q Consensus 142 ~---~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~ 218 (527) . .+.++++..|.- +. . +...+. ++ + T Consensus 148 ~~~~~l~~r~~~~f~~---~~-----------------~---------------------------~~~~l~---~~--~ 175 (526) T protein:vir:99 148 WMPLAFHHRPQSWFQL---NP-----------------E---------------------------DQNELR---LR--D 175 (526) T ss_pred eeEEEeeeecccceee---cc-----------------C---------------------------CCcEEE---ec--C Confidence 1 122333322110 00 0 000000 00 0 Q ss_pred ccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHc-Ccc Q lcl|NC_019418. 219 DSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKM-GQR 297 (527) Q Consensus 219 ~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~-~~~ 297 (527) ...-|.++ ++. -|.++.. +...++|+|.|.+..|.-..--=+..+..|+.=++. |.+ T Consensus 176 ~~~~g~~l---------~~~---------k~i~~~~----~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P 233 (526) T protein:vir:99 176 NSPAGEAL---------QPF---------GWIIHRP----RARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLP 233 (526) T ss_pred CCCCceee---------cCC---------CeEEEee----cCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCc Confidence 11112111 111 0222211 122367888888888766554444444445433333 433 Q ss_pred eeeechhHhcCCCCCCCcccccccccccccceeee---ccCCC----CCCCcceEeccc-cChHHHHHHHHHHHHHHHHh Q lcl|NC_019418. 298 RVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQ---VGAGN----MDSGGIVDLTTP-IRSSDYISAISEGLKLFEMQ 369 (527) Q Consensus 298 ~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~---~~~~~----~~~~~i~~~~~~-ir~e~~~~~~~~~l~~i~~~ 369 (527) -.+. +.+ .+...+ ..+.+... +..+. ..+..|+.++.. ...+.|.+.++.+=++|+.. T Consensus 234 ~~ig-----ky~-~~a~~~--------ek~~L~~av~~i~~d~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~ 299 (526) T protein:vir:99 234 IRLG-----KYP-PGTADE--------EKATLLRAVTGLGHAAAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKA 299 (526) T ss_pred eEEE-----ecC-CCCCHH--------HHHHHHHHHHHHhhCcEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHH Confidence 2222 111 111100 01111111 10000 011235554432 33344666666555566443 Q ss_pred cCCCccccccc-c---cccchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhhhcccCCcccCccceEEEeC Q lcl|NC_019418. 370 IGVSSGMFTFD-G---QGVKTATEIVSENSDTYQMRNSIVALVEQSIK-ELCVSMCELGKVVGIYRGTIPELDDISVNLD 444 (527) Q Consensus 370 ~g~s~~~~~~~-~---~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~-~li~~il~~~~~~~~~~~~~~~~~~v~v~f~ 444 (527) . ++ +|++.+ + +|.....++- +.-...-+..-.+.+...|. +|++.++.+ ++.. ..+...-+.+.|+ T Consensus 300 i-LG-qtlTs~~~~g~~gS~a~g~vh--~~v~~di~~aDa~~i~~tln~~Li~~l~~~----N~~~-~~~~~~~p~~~~~ 370 (526) T protein:vir:99 300 V-LG-GTLTSTTSQSGGGAFALGQVH--NEVRHDLLASDARQLAATLSRDLLWPLLVL----NRPG-SPDVRRAPRLVFD 370 (526) T ss_pred H-hh-hhhccccccCcchhhhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CCCC-cCCccccceEEeC Confidence 2 22 333221 1 1111111221 11122333445566777884 688888765 2222 2222334567888 Q ss_pred CCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccC-CCCCCCCCCCCCCCCCCCc Q lcl|NC_019418. 445 DGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELA-LYGKGQQNTVGNSKDTVDD 523 (527) Q Consensus 445 d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 523 (527) ..-++|..+.++.+.+++..|+--.+.++.+.+|+++.+-.+.+-.-....++....... .........+ .....+ T Consensus 371 ~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~Gip~~~~~e~~l~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~ 447 (526) T protein:vir:99 371 LREQADITSMAQSIPALVNVGLEIPSAWVYDKLGIPQPAKNEPVLRSAAQPAILSRQHGQRVAALATIVGP---RYGDQQ 447 (526) T ss_pred CCCcccHHHHHHHHHHHHhCCCccCHHHHHHHhCCCCCCCcccccCCCCCCcccccccccccccccccccc---cCcchh Confidence 888888888999999999999844556678888986543222221100000000000000 0000000000 000000 Q ss_pred cccC Q lcl|NC_019418. 524 EDEA 527 (527) Q Consensus 524 ~~~~ 527 (527) ..+. T Consensus 448 ~~d~ 451 (526) T protein:vir:99 448 ALDK 451 (526) T ss_pred hHHH Confidence 0000 No 258 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=75.92 E-value=0.14 Score=25.24 Aligned_cols=395 Identities=11% Similarity=0.038 Sum_probs=156.1 Q ss_pred hhcccchhhhccCccc-cCHHHHHHHHHHHHHhcCCCc-ccccccccCccccCceeecc----------hHHHHHHHHhh Q lcl|NC_019418. 18 MTTSHLSSILDHPKVA-VTQSEFRRIQHNLAYYQSKFD-DIEYTNTDGDRKRRKMQHLP----------IARTAAKKIAS 85 (527) Q Consensus 18 ~~~~~~~~~~~~~~i~-~~~~~~~~i~~~~~~y~g~~~-~l~~~~~~~~~~~~~~~~ln----------l~~~i~~~~A~ 85 (527) |...-+.....-.+.. ........|...++-|...+. .+. +....-...+. ..+. +...+- +--. T Consensus 1 ~~~~i~~~~g~~~~~~~~~~~~~~~ia~~~~~~~~~~~~~~~-p~~~~il~~~~-~~~~~y~~m~~D~~i~s~l~-~Rk~ 77 (491) T protein:vir:79 1 MSKGLWVSPTEFVKFGEPDKSLSSQIATRARSIDFFALGMYL-PNPDPVLKALG-KDIRVYRELRADAHVGGCVR-RRKA 77 (491) T ss_pred CCCeeeCCCCCcccccccchhHHHHHhhhccccccccccccC-cchhHHHhhcc-CCHHHHHHHhhChHHHHHHH-HHHH Confidence 1111111110000111 111222334333322221100 000 00000000000 0111 111221 2223 Q ss_pred hhhcccceEee--CCHHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC--Cee---EEEEEcCCceEEEEEc Q lcl|NC_019418. 86 LVYNEQAEISA--EDETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG--DKI---RVAFIQAPVFLPLQSN 158 (527) Q Consensus 86 ll~~e~~~i~~--~d~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~--~~~---~i~~v~a~~~~P~~~d 158 (527) -|.+.+-+|.. +++...+++.+++++-.|...+..++ +|..+|-+++-..|.. +.+ ++.++++..|.. + T Consensus 78 av~~~~w~i~~~~~~~~~a~~i~e~l~~~~~~~~i~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~---d 153 (491) T protein:vir:79 78 AVKALEWGLDRGKAKSRVAKSIADVFADLDLSRIATEML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFVY---D 153 (491) T ss_pred HHhCCCcEEecCCCCHHHHHHHHHHHhcCCHHHHHHHHH-HhhhhcceeEEEEEeecCCeeeEEeeeeecccceee---c Confidence 35555555653 24456789999998888888777664 6888999999888853 332 344444443321 1 Q ss_pred CCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccc Q lcl|NC_019418. 159 TQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPV 238 (527) Q Consensus 159 ~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~ 238 (527) ..+. ..+ +..+...-|.++ ++. T Consensus 154 ~~~~-----------------l~l--------------------------------~~~~~~~~g~~l---------p~~ 175 (491) T protein:vir:79 154 PENQ-----------------LRF--------------------------------RSKEHWVQGEEL---------PAR 175 (491) T ss_pred cCCc-----------------eEE--------------------------------eecCCCCCceee---------cCC Confidence 1110 000 000001011111 111 Q ss_pred eeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHc-CcceeeechhHhcCCCCCCCccc Q lcl|NC_019418. 239 TPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKM-GQRRVIVPEQMTQLKVQDNQGNI 317 (527) Q Consensus 239 ~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~-~~~~i~v~~~~l~~~~~~~~~~~ 317 (527) -|.+++. ....++|+|.|.+..|....---+..+..|+.=++. |.+-.+. +.+ .+...+ T Consensus 176 ---------k~i~~~~----~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~ig-----ky~-~~a~~~- 235 (491) T protein:vir:79 176 ---------KFLVPRQ----EATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVG-----KHP-RSASDA- 235 (491) T ss_pred ---------CeEEEEe----cCCCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEE-----ecC-CCCCHH- Confidence 1222221 112356889999998877655555445555443443 4432222 111 110100 Q ss_pred ccccccccccceeeeccCCC----CCCCcceEecccc---ChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHH Q lcl|NC_019418. 318 AFKRRFDVEQNVYMQVGAGN----MDSGGIVDLTTPI---RSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEI 390 (527) Q Consensus 318 ~~~~~~d~~~~~~~~~~~~~----~~~~~i~~~~~~i---r~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei 390 (527) ..+.+ .+.+. .+..+. .....|+.++... -.+.|.+.++.+=++|+... ++ +|++-+++|...+.++ T Consensus 236 -ek~~l--~~al~-~~~~~a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~i-LG-qtlTt~~~gs~a~~~v 309 (491) T protein:vir:79 236 -ETNLL--LDRLE-DMVQDAVAVIPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIAL-LG-QNQTTEATSTRASAQA 309 (491) T ss_pred -HHHHH--HHHHH-HHhcCeEEEecCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHH-hh-hhhccCcccchhhHHH Confidence 00000 01111 011000 1122355554332 12335555554444443322 22 2232222222212223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHH Q lcl|NC_019418. 391 VSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQK 470 (527) Q Consensus 391 ~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~ 470 (527) - +.-...-+..-.+.+...|.+|++.++.+ ++.+ ...+.+.|.+.- .+.+..++.+.+++..|+--.+ T Consensus 310 h--~~v~~~i~~~D~~~i~~tln~li~~l~~~----N~~~-----~~~p~f~~~e~e-e~~~~~a~~~~~L~~~G~~i~~ 377 (491) T protein:vir:79 310 G--LEVTDDIRDGDKAIVVEAMNMLIRWICDL----NFDG-----AARPVFDMWEQE-QVDEIQAGRDEKLTRAGARFTP 377 (491) T ss_pred H--HHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCCC-----CCcceEeecCcC-chhHHHHHHHHHHHhCCCccCH Confidence 1 11223333445666778888888877755 2211 122456676532 2224567788899999986666 Q ss_pred HHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 471 RGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 471 ~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) .++.+.+|+++.+..++.........+ +...... ......++.++ T Consensus 378 ~~~~e~~Gip~~~~~e~~~~~~~~~~~-----~~~~~~~-------~~~~~~~~~d~ 422 (491) T protein:vir:79 378 AYFKRAYNLQDGDLDERPLPVSAVDAV-----GAASFAE-------FEAPDQDALDA 422 (491) T ss_pred HHHHHHhCCCCCCCCccccCcCccccc-----ccccccc-------cCCCCCcchHH Confidence 778888898754322221111111100 0000000 01111111122 No 259 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=74.03 E-value=0.16 Score=24.90 Aligned_cols=391 Identities=12% Similarity=0.057 Sum_probs=143.8 Q ss_pred cccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHHHHhhhhhcccceEeeCCHH--HHHHHHHHH Q lcl|NC_019418. 32 VAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIASLVYNEQAEISAEDET--LNDFLSDML 109 (527) Q Consensus 32 i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~ll~~e~~~i~~~d~~--~~~~l~~~l 109 (527) ++.-+ -+. |....|......+... ...+....-...++.+|+-+-+-|..+.-.+.. ...-+-.+| T Consensus 1 ~~~~~-------~~~----g~~~~~~~~~~~~~~~-~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~~~~~~~~~l~~lL 68 (723) T protein:vir:94 1 MTTFP-------SGA----GGWNAWSADSVFGNGA-KGWSNSAVAYRCISMLANNAASVDLVVRGPDGELDELHPLSQLW 68 (723) T ss_pred Ccccc-------cCC----CccccccccccccccH-HHHhhhHHHHHHHHHHHHhhccceeEEEcCCCccchhhHHHHHH Confidence 00000 000 0000010000011000 001111222344444555444434333211111 111122233 Q ss_pred hh--hh---HHHHHHHHHHHHHhcCCEEEEEEEeCC----e-eEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcc Q lcl|NC_019418. 110 SN--DR---FNKNFERYLESALALGGLAMRPYVDGD----K-IRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKN 179 (527) Q Consensus 110 ~~--n~---f~~~~~~~~~~a~~~G~~~~~~~~d~~----~-~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~ 179 (527) .. |. ........+...+-.|.+++.+..++. . ..+..++++...++..+ ... T Consensus 69 ~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~------------------~~~ 130 (723) T protein:vir:94 69 NVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATR------------------AAD 130 (723) T ss_pred hhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecC------------------CCc Confidence 21 11 112223344456667888887766532 1 23334444433322111 111 Q ss_pred eEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCcccc Q lcl|NC_019418. 180 VYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNN 259 (527) Q Consensus 180 ~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~ 259 (527) .+|. .+.+.|.++.. -|..+++. +. -..||+.+.+. T Consensus 131 ~~~~----------------------~~~~~y~~~~~-----~G~~~~~~------~~----------dIiHir~~~~~- 166 (723) T protein:vir:94 131 AVPQ----------------------AQIIGYVIERT-----DGVRVPVL------AD----------EMLWLRFSDPY- 166 (723) T ss_pred ccee----------------------eeeeEEEEEec-----CceeEEec------cc----------ceEEecCCCCC- Confidence 1110 00111111100 02222210 00 13456543221 Q ss_pred ccCCCccCcchhhhhHHHHHHHHHHHHHHHH-HHHcCcceeeechhHhcCCCCCCCcccccccccccccceeeec-cC-- Q lcl|NC_019418. 260 KDINSPLGLSIFDNAKTTIDFINRTYDEFMW-EIKMGQRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQV-GA-- 335 (527) Q Consensus 260 ~~~~splG~S~~~~~~~lid~ld~~~s~~~~-e~~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~-~~-- 335 (527) +...|+|.+.-+...|...... ..+.. -|..|.. |..+|..+.-++...-.....|. ..|.+. +. T Consensus 167 ---dg~~G~Spi~~a~~~i~~~~aa-~~~~~~~f~NG~~----p~giL~~~~l~~e~~~~~~~~~~---~~~~G~~Nagk 235 (723) T protein:vir:94 167 ---DPLAVMAPWKAARAAVDADFYA-ATWQRQSFKNGAR----PGGVVNLGDMDEQTFTKTVAAFR---SQVEGVQNAGR 235 (723) T ss_pred ---CCcccccHHHHHHHHHHHHHHH-HHHHHHHHhcCCC----cceEEEcCCCCHHHHHHHHHHHH---HHhhchhhcCc Confidence 2236899888777666543332 33332 3455432 22222211100000000000110 111111 00 Q ss_pred -----CC-------CCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHH Q lcl|NC_019418. 336 -----GN-------MDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNS 403 (527) Q Consensus 336 -----~~-------~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~ 403 (527) ++ +++..++.++.+....++.+.......+|+...|++|..++..... .+..+. ... T Consensus 236 ~~vL~g~~~~~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~~st~-sN~e~~---~~~------- 304 (723) T protein:vir:94 236 HLLIAGQGSDGGAAGKGATFTSLSMSPAEMDYINSRMHSAEEVMLAFGIRKDALLGGSTY-ENQAEA---KAA------- 304 (723) T ss_pred ceeecccccccccccCCceEEEccCCHHHHHHHHHHHHhHHHHHHHhCCChhHcCCCCCc-ccHHHH---HHH------- Confidence 00 1122345556666677888888888899999999999877643321 111111 111 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCC--CccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCC Q lcl|NC_019418. 404 IVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDD--GVFTDRHAELDYWMKMVAAGFATQKRGIAKT--LGI 479 (527) Q Consensus 404 ~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d--~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~--~~~ 479 (527) .+..+|..++..|-...+. .+.. .....+.++|+. -+..|.++.++....++.+|+|++-+++... .++ T Consensus 305 ---f~~~tL~P~~~~ie~~ln~-~Ll~---~~g~~~~~~f~~~~lLr~D~~~r~~~~~~~v~~G~~T~NE~R~~lglpPi 377 (723) T protein:vir:94 305 ---VWTETLIPQMEVMASITDL-QLLP---DIGWTVEWDFNSVPALQEDLEAQAGRNQGYLVNDVLMVDEVRATIGLDPL 377 (723) T ss_pred ---HHHHHHHHHHHHHHHHHhH-hhcc---cccCceEEeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 1223333333333222111 1111 112346778875 3567888999999999999999999987653 122 Q ss_pred CHHHH--------------------HHH-HHHHHH--hcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 480 TEEEA--------------------EKE-LAEING--ELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 480 ~deea--------------------~~e-l~ri~~--E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) ..-.. .++ ..|... +....+...+..+.... ++-..+.+...+++. T Consensus 378 ~gGd~~~~~~p~~~~~a~~~~~~p~~~e~~~~~~~~~~~~~~~~p~~~~~~~~~--~~~~~~~~~~~~~~~ 446 (723) T protein:vir:94 378 PGGIGQMTLTPYRAQFAPAPAPAPAVEEGAARMLALLERVAADRPLPELPVRAT--TVLHHDPGPDPQQTL 446 (723) T ss_pred CCCcccceeccccccccCCCCCCccchhhhHhhhhhccccccccCcCCCCCCCC--CCCCCCcccCCchhH Confidence 11000 000 011111 10011111111111111 111111222222222 No 260 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=71.19 E-value=0.2 Score=24.43 Aligned_cols=433 Identities=12% Similarity=0.152 Sum_probs=187.0 Q ss_pred CCh---HH---HHHHHHHHHHHHh---hcccchhhh-ccC-----ccccC---------------------HHHHHHHHH Q lcl|NC_019418. 1 MSL---IQ---KVKDFFNRGRYNM---TTSHLSSIL-DHP-----KVAVT---------------------QSEFRRIQH 44 (527) Q Consensus 1 m~~---~~---~~k~~~~~~~~~~---~~~~~~~~~-~~~-----~i~~~---------------------~~~~~~i~~ 44 (527) ||| +. .+|.|-+.--.+. ..+...+.+ .+. .|... .....-|++ T Consensus 1 ~~~~~~~~~l~~~~~~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~y~~~e~~~~~~~eLI~~ 80 (524) T protein:vir:98 1 MNFLGFGNVLSFFKNFAREDEIELEQQLKNDTGSVAPPKNNDGAYEIETDLNNQKYAGVFQQFYSGQDPAIQNKEQLINT 80 (524) T ss_pred CCCcchhhHHHHhhhhhhhhhhhHhhhhcCCcccccCCCCCCCceeecCCCCcceecceeeeeccccccccchHHHHHHH Confidence 443 22 2444433211111 111111111 000 01000 001122222 Q ss_pred HHHHhcCCCcccccccccCccccCceeecchHHHHHHHHhh-hhhc----ccceEeeCCHH--------HHHHHHHHHhh Q lcl|NC_019418. 45 NLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIAS-LVYN----EQAEISAEDET--------LNDFLSDMLSN 111 (527) Q Consensus 45 ~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~-ll~~----e~~~i~~~d~~--------~~~~l~~~l~~ 111 (527) ++.+... | --...++...+ -++. .|+++.+++.. ..+.++.++.= T Consensus 81 YR~ma~~--p--------------------Evd~Av~eIVneaIv~~~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~l 138 (524) T protein:vir:98 81 YRGIMSY--P--------------------EVENAVSEIIDDAIVNEQGKDIITMDLAKTNFSKAIQDKIVEEFDNVLNI 138 (524) T ss_pred HHHHhhc--c--------------------chhhHHHhhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHH Confidence 2222111 1 11111111111 1222 25555555432 45556667776 Q ss_pred hhHHHHHHHHHHHHHhcCCEEEEEEEeCCe----eEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEE Q lcl|NC_019418. 112 DRFNKNFERYLESALALGGLAMRPYVDGDK----IRVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEF 187 (527) Q Consensus 112 n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~----~~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~ 187 (527) -+|.+...+.+....+-|..+|+..+|+.. ..+.+++|.++-++.-.-.+...... ..++ .| .|+ T Consensus 139 l~F~~~~~~~fR~WYVDgRi~fhkiid~~~~kGI~ELr~lDPr~i~~vr~~~~~~~~~~~--~v~~-------~~--~e~ 207 (524) T protein:vir:98 139 YDFDNMGARLFRDWYVDSRIYFHKIMHKDESKGIRELRQLDPRCMELIRESITETLDGGV--KVFR-------GY--REF 207 (524) T ss_pred hccchhhhHHHhhhhhcceeEEEEEEcCCCCcceeeeeeeCCccceeeeeccccccccch--hhcc-------ce--eee Confidence 789999999999999999999999997432 45788888887665311000000000 0010 01 122 Q ss_pred EeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccC Q lcl|NC_019418. 188 HEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLG 267 (527) Q Consensus 188 h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG 267 (527) ..+... ...|...-..|. .++ +-.+|-+ .+++.. .+.+..+ ++ . T Consensus 208 f~Y~~~----------~~~~~~~g~~~~---~~~-~ikI~~d--------AIvy~h-----SGL~d~~--~~------i- 251 (524) T protein:vir:98 208 FVYSAP----------KAGYTYNGQIYQ---ANQ-KIKIPRS--------AIVYAH-----SGLEDCS--NN------I- 251 (524) T ss_pred eeeccC----------CCccccccceec---CCC-ceeechh--------heeeec-----cCcccCC--CC------e- Confidence 111100 000100001111 010 1111111 111110 0111110 00 1 Q ss_pred cchhhhhHHHHHHHHHHHHHHH--HHHHcCcceeeec----------hhHh-----c----CCCCCCCcccccccccccc Q lcl|NC_019418. 268 LSIFDNAKTTIDFINRTYDEFM--WEIKMGQRRVIVP----------EQMT-----Q----LKVQDNQGNIAFKRRFDVE 326 (527) Q Consensus 268 ~S~~~~~~~lid~ld~~~s~~~--~e~~~~~~~i~v~----------~~~l-----~----~~~~~~~~~~~~~~~~d~~ 326 (527) +|-+..|.-.+..|=-.-+.++ +-.|+-.+|||-- +.++ + ..-|..+|++.-.+.+-.- T Consensus 252 isyLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGevrddrk~msM 331 (524) T protein:vir:98 252 IGYLHRAVKPANQLRLLEDAMVIYRITRAPERRVFYIDVGQMGGNKATQYVNNIAQGLKNRVVYDARTGTVKNQQNNLSM 331 (524) T ss_pred eeehhHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeeccCceeeccccccch Confidence 3455666555555544444443 4445666777641 0111 0 0113344443322221110 Q ss_pred -cceeeeccCCCCC-CCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccc--cchHHHHHHHHHHHHHHHH Q lcl|NC_019418. 327 -QNVYMQVGAGNMD-SGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQG--VKTATEIVSENSDTYQMRN 402 (527) Q Consensus 327 -~~~~~~~~~~~~~-~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g--~~TAtei~s~~~~~~~~~~ 402 (527) ...+.+- -+|+ ..-|+++..-=...+ ...+..+.+.+....+++.+-+..+.+| .--++||.-..-....-+. T Consensus 332 lEDyWLpR--ReGgrgTEItTLpggqnlge-m~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDEiKF~KFI~ 408 (524) T protein:vir:98 332 TEDYWLMR--RDGKAITEVSTLPGGQNFSD-MDDIKWFNRKLYEALRVPLSRMPRDDGGMQIGGGGEITRDELKFSKFIR 408 (524) T ss_pred hhhhcccc--cCCCCccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCceeccCCCCccccccccchhHHHHHHHHHHH Confidence 0001110 1122 123565554333333 4556666677777777776666533221 1125567666666667788 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCc----cceEEEeCCCccCCHHHHHHHHHH---HH--hcC----CCCH Q lcl|NC_019418. 403 SIVALVEQSIKELCVSMCELGKVVGIYRGTIPEL----DDISVNLDDGVFTDRHAELDYWMK---MV--AAG----FATQ 469 (527) Q Consensus 403 ~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~----~~v~v~f~d~i~~d~~~~~~~~~~---~~--~aG----i~s~ 469 (527) +.+..|..-+.++++.=|.|-.. +. ..+| ..|.++|...=-..+..+++.... +. ..+ ..|. T Consensus 409 rLR~rFs~lf~~~L~~qLilKgi---it--~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~ 483 (524) T protein:vir:98 409 TLQIQFSPVLSDPLKTNLIAKKI---IT--EDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQVEGVVGKYVSH 483 (524) T ss_pred HHHHHHHHHHHHHHHHhhhhhcC---CC--HHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhccccccccch Confidence 88888888888888876655322 11 1112 236677754333333333332221 11 112 5788 Q ss_pred HHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCC Q lcl|NC_019418. 470 KRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSK 518 (527) Q Consensus 470 ~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 518 (527) +++.++....||+|.+++..+|++|... +.+.++ ..+.++- T Consensus 484 dyi~k~ILr~tDeei~~~~k~I~~E~k~------~~~~~p--~~e~~~f 524 (524) T protein:vir:98 484 KYIMKEILRMSDEDIDEQAKLIEEESKE------ERFKNP--EAEEENF 524 (524) T ss_pred HHHHHHHhccCHHHHHHHHHHHHHHHhC------CCCcCC--ccccccC Confidence 8888888899999999999999988642 111111 1111111 No 261 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=69.94 E-value=0.22 Score=24.23 Aligned_cols=417 Identities=12% Similarity=0.091 Sum_probs=158.6 Q ss_pred CChH-HHHHHHH-----HHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecc Q lcl|NC_019418. 1 MSLI-QKVKDFF-----NRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLP 74 (527) Q Consensus 1 m~~~-~~~k~~~-----~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~ln 74 (527) |+-| +.-=+=+ ++--.-.....-.....|+-=.+++..+.+|-+-. =.|+.. .+.......-. ...- T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a--~~gd~~--~~~~L~edm~e---~D~~ 73 (526) T protein:vir:79 1 MAQIVDVYGNPIRPQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEA--EQGNLQ--AQAELFMDMEE---RDAH 73 (526) T ss_pred CCeeeCCCCCccCccccchhhhhhhhhhhhhcccCCCCCcCHHHHHHHHHHh--hCCCHH--HHHHHHHHHHh---hChH Confidence 2110 0000000 00000000000011123444445554444443321 011100 00000000000 0001 Q ss_pred hHHHHHHHHhhhhhcccceEeeC------CHHHHHHHHHHHhhh-hHHHHHHHHHHHHHhcCCEEEEEEEeC--Cee--- Q lcl|NC_019418. 75 IARTAAKKIASLVYNEQAEISAE------DETLNDFLSDMLSND-RFNKNFERYLESALALGGLAMRPYVDG--DKI--- 142 (527) Q Consensus 75 l~~~i~~~~A~ll~~e~~~i~~~------d~~~~~~l~~~l~~n-~f~~~~~~~~~~a~~~G~~~~~~~~d~--~~~--- 142 (527) +...+-+. -.-|.+.+-+|.-. +....+++++++.+. +|...+..++ +|..+|=+++-+.|+. +.. T Consensus 74 i~s~l~~R-k~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~l-dA~~~G~s~~Ei~w~~~~g~~~~~ 151 (526) T protein:vir:79 74 LFAEMSKR-KRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDAL-DGIGHGYSCIELEWALQGREWMPL 151 (526) T ss_pred HHHHHHHH-HHHHhCCCceEecCCCCChHHHHHHHHHHHHHhcccCHHHHHHHHH-hhhhhcceeEEEEEeecCCceeEE Confidence 22222222 22344555455431 235667888888763 4777666554 4888999999888853 221 Q ss_pred EEEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCcccc Q lcl|NC_019418. 143 RVAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQL 222 (527) Q Consensus 143 ~i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~l 222 (527) ++.++++..|.- +.. +...+. ++ ++..- T Consensus 152 ~l~~r~~~~F~~---~~~--------------------------------------------~~~~l~---~~--~~~~~ 179 (526) T protein:vir:79 152 AFHHRPQSWFQL---NPE--------------------------------------------DQNELR---LR--DNSPA 179 (526) T ss_pred EeeeecccceEe---ccC--------------------------------------------CCcEEE---ec--CCCCC Confidence 122222221110 000 000000 00 01111 Q ss_pred CceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHc-Ccceeee Q lcl|NC_019418. 223 GERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKM-GQRRVIV 301 (527) Q Consensus 223 G~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~-~~~~i~v 301 (527) |.++ ++. -|.++.. +...++|+|.+.+..|.-..--=+..+..|+.=++. |.+-.+. T Consensus 180 g~~l---------~~~---------k~iv~~~----~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~yG~P~~ig 237 (526) T protein:vir:79 180 GEAL---------QPF---------GWIIHRP----RARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLG 237 (526) T ss_pred ceee---------cCC---------ceEEEee----cCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEE Confidence 2111 111 0222211 122357788888887765554334344444433332 4332222 Q ss_pred chhHhcCCCCCCCcccccccccccccceeeec-cCCC------CCCCcceEeccc-cChHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_019418. 302 PEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQV-GAGN------MDSGGIVDLTTP-IRSSDYISAISEGLKLFEMQIGVS 373 (527) Q Consensus 302 ~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~-~~~~------~~~~~i~~~~~~-ir~e~~~~~~~~~l~~i~~~~g~s 373 (527) +.+ .+...+ ..+.+...+ ++.. ..+..|+.++.. ...+.|.+.++.+-++|+... ++ T Consensus 238 -----ky~-~~a~~~--------ek~~L~~av~~i~~da~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i-LG 302 (526) T protein:vir:79 238 -----KYP-PGTADE--------EKATLLRAVTGLGHAAAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV-LG 302 (526) T ss_pred -----ecC-CCCCHH--------HHHHHHHHHHHHhcCcEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH-hh Confidence 110 111100 011111111 0000 011235555432 333446666666555664432 23 Q ss_pred ccccccc-c---cccchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhhhcccCCcccCccceEEEeCCCcc Q lcl|NC_019418. 374 SGMFTFD-G---QGVKTATEIVSENSDTYQMRNSIVALVEQSIK-ELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVF 448 (527) Q Consensus 374 ~~~~~~~-~---~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~-~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~ 448 (527) +|++.+ + +|.-...++- +.-...-+..-.+.+...|. +|++.++.+ ++... .+...-+.+.|+..-+ T Consensus 303 -qtlTs~~~~g~~gS~a~g~vh--~~v~~di~~aDa~~i~~tln~~Li~~l~~~----N~~~~-~~~~~~p~~~~~~~e~ 374 (526) T protein:vir:79 303 -GTLTSTTSQSGGGAFALGQVH--NEVRHDILASDARQLAATLSRDLLWPLLVL----NRPGS-PDVRRAPRLVFDLREQ 374 (526) T ss_pred -hhhccccccCcchhhhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CCCCc-CCccccceEEeCCCCc Confidence 333221 1 1111111221 11122333445566777884 688888765 22222 1222335677888788 Q ss_pred CCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCcccc-- Q lcl|NC_019418. 449 TDRHAELDYWMKMVAAGFATQKRGIAKTLGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDE-- 526 (527) Q Consensus 449 ~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 526 (527) +|..+.++.+.+++..|+--.+.++.+.+|+++.+..+.+..-.....+.... .+...........-...+.++ T Consensus 375 eDl~~~a~~~~~L~~~G~~i~~~~i~e~~gip~~~~~e~~l~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~d 450 (526) T protein:vir:79 375 ADITSMAQSIPALVNVGLEIPSAWVYDKLGIPQPAKNEPVLRPAAQPAILSRQ----HGQRVAALATIVGPRYGDQQALD 450 (526) T ss_pred ccHHHHHHHHHHHHhCCCcCCHHHHHHHhCCCCCCCchhhccccCCccccccc----cccccccccccccccCchhhHHH Confidence 88888899999999999855566788888986533222221100000000000 000000000000000111110 Q ss_pred -C Q lcl|NC_019418. 527 -A 527 (527) Q Consensus 527 -~ 527 (527) . T Consensus 451 ~~ 452 (526) T protein:vir:79 451 KA 452 (526) T ss_pred HH Confidence 0 No 262 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=59.38 E-value=0.39 Score=22.80 Aligned_cols=365 Identities=17% Similarity=0.114 Sum_probs=111.9 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) ||+|.|++.|++.-... -..+..... ..+++.... .+ | .--..|| T Consensus 1 M~if~~~~~~~~~~~~~-~~~~~~~~~-~~~~~~~~~---~v------~------------------------~~v~~Ia 45 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNN-DTQRVTAWQ-NEAVEYTSA---FV------T------------------------NIHNKIA 45 (378) T ss_pred CchhHHhHhhhhccccc-Ccceeeeee-cchhhhhhH---HH------H------------------------HHHHHHH Confidence 99999999987631100 000000000 000000000 00 0 0011233 Q ss_pred HHHhhhhhccc----c------eEeeCCHHHHHHHHHH-HhhhhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcC Q lcl|NC_019418. 81 KKIASLVYNEQ----A------EISAEDETLNDFLSDM-LSNDRFNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQA 149 (527) Q Consensus 81 ~~~A~ll~~e~----~------~i~~~d~~~~~~l~~~-l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a 149 (527) +..|++=|.-. . ...+.+..+...|+.- -..-....-....+...+-.|.+++.|.++. T Consensus 46 ~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~---------- 115 (378) T protein:vir:94 46 NEITKVEFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDS---------- 115 (378) T ss_pred HhHhhCceeeeeecccccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeC---------- Confidence 33333211100 0 0011122233333210 0000111222234555566788887654432 Q ss_pred CceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecc Q lcl|NC_019418. 150 PVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLS 229 (527) Q Consensus 150 ~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~ 229 (527) .++.+...+. .. +.+.|. .+ .-.+|.+-.+. +.+.. ++. T Consensus 116 ---------~~g~~~~~~~----~~---~~~~~~---~~----------------dvih~~~~~~~----~~~~~--~~~ 154 (378) T protein:vir:94 116 ---------ETGELLDLLF----AN---DKKEYK---PE----------------ELVRLTSPFYI----NEDTS--ILD 154 (378) T ss_pred ---------CCCcEEEEEE----ec---CcEEec---hh----------------ceeeecCcCCc----ccchh--HHH Confidence 1111100000 00 000000 00 00001000000 00000 000 Q ss_pred cccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeechhHhcCC Q lcl|NC_019418. 230 ELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPEQMTQLK 309 (527) Q Consensus 230 ~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~~~l~~~ 309 (527) .....+... .-.|-.+ ++++.+. ..+. +..+.+.+.+...|... ..+. T Consensus 155 ~~~~~~~~~-~~~~~~~---g~l~~~~----------~l~~-~~~~~~~e~~~~~~~~~----~~~~------------- 202 (378) T protein:vir:94 155 NALASIQTK-LEQGKLR---GLLKINA----------FLDI-DNTQEYREKALATIKNM----QEGS------------- 202 (378) T ss_pred HHHHHHHHH-HhhCCcc---cceeeCC----------cCCH-HHHHHHHHHHHHHHHHh----hccc------------- Confidence 000000000 0001000 1111110 0000 01111111111111111 0000 Q ss_pred CCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHH Q lcl|NC_019418. 310 VQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATE 389 (527) Q Consensus 310 ~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAte 389 (527) ..++-+ .+ +++..++.++.+....+ ...++.+.++|+...|++|..+.. |++| T Consensus 203 --n~~~~~----vl--------------~~g~~~~~l~~~~~~~~-~~~~~~~~~~Ia~~fgvPp~~l~g------~~~e 255 (378) T protein:vir:94 203 --SYNGLT----PV--------------DNKTEIVELKKDYSVLN-KDEIDLIKSELLTGYFMNENILLG------TATQ 255 (378) T ss_pred --ccccce----ec--------------cCCceEEEccCChHHhh-HHHHHHHHHHHHHHhCCCHHHhcC------CchH Confidence 000000 00 01112333333222233 344555667899999999987731 1222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCH Q lcl|NC_019418. 390 IVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQ 469 (527) Q Consensus 390 i~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~ 469 (527) ..+ ..-...++.-+...++.+|..-+-.=.+.... + ......++.++++.-...|..+.++...+++.+|+|++ T Consensus 256 ~~~-~~f~~~tl~P~~~~ie~~l~~~Ll~~~e~~~g--~---~~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~~t~ 329 (378) T protein:vir:94 256 EQQ-IYFYNSTIIPLLIQLEKELTYKLISTNRRRVV--K---GNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQ 329 (378) T ss_pred HHH-HHHHHHHHHHHHHHHHHHHHhhcCChhHhhhh--h---hhcccceeEeecchhhhcCHHHHHHHHHHHHhCCCcCH Confidence 111 11111222222222333332210000000000 0 01112345667777778899999999999999999999 Q ss_pred HHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 470 KRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 470 ~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) -+++.+. |+..=+ -.+-+ +.....+ .. ...+......+.+++||+.. T Consensus 330 NE~R~~~-g~~p~~ggd~~~--~~~n~~~-----~~---~~~~~~~~~~~~~~~~e~~n 377 (378) T protein:vir:94 330 NQLLVKM-GEQPIEGGDVYI--ANLNAVA-----VK---NLSDLQGNRKDVTSTDETNN 377 (378) T ss_pred HHHHHHh-CCCCCCCCCeee--ecccccc-----hh---cchhcccccCCCCCCCCCCC Confidence 9976543 553210 00000 0000000 00 00011111111222222222 No 263 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=53.29 E-value=0.53 Score=22.09 Aligned_cols=439 Identities=11% Similarity=0.081 Sum_probs=151.2 Q ss_pred hcccchhhhccCccccCHHHHHHHHHH------HHHhcCCCcccccccccCccccCceeecchHHHHHHHHhhhhhcccc Q lcl|NC_019418. 19 TTSHLSSILDHPKVAVTQSEFRRIQHN------LAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIASLVYNEQA 92 (527) Q Consensus 19 ~~~~~~~~~~~~~i~~~~~~~~~i~~~------~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~ll~~e~~ 92 (527) |..+ .++---+++..+..|-+- ..+|+..-+.|. +...-....+=....-+...+-+. -.-|.+-.- T Consensus 1 ~~~~-----~~~~~gl~p~rl~~i~~~~~~~~~~~~~~~~~~~Lr-~~~~~~ly~~m~~D~hi~s~l~~R-k~av~~~~w 73 (488) T protein:vir:95 1 MADI-----TETQESLPPFRMGEVGSLGLKVKNGRIYEEPRQALR-FPESIKTFQLMMRDPAVAASVNII-KMFVRKVNW 73 (488) T ss_pred CCCc-----cccCCCCCHHHHHHHHHHhhccccchhhccchhhhc-ccchHHHHHHHhhChHHHHHHHHH-HHHHhcCCc Confidence 1111 112224567666666421 112322212221 000000000000001122222222 223445444 Q ss_pred eEeeC-----C---HHHHHHHHHHHhhhh--HHHHHHHHHHHHHhcCCEEEEEEEeCCee---EEEEEcCCceEEEEEcC Q lcl|NC_019418. 93 EISAE-----D---ETLNDFLSDMLSNDR--FNKNFERYLESALALGGLAMRPYVDGDKI---RVAFIQAPVFLPLQSNT 159 (527) Q Consensus 93 ~i~~~-----d---~~~~~~l~~~l~~n~--f~~~~~~~~~~a~~~G~~~~~~~~d~~~~---~i~~v~a~~~~P~~~d~ 159 (527) .|... + ...+++++.++.+-+ |...+..+ -+|..+|-+++-+.|..+.. .+.++..+..+ T Consensus 74 ~v~p~~~~~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~-lda~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~~------ 146 (488) T protein:vir:95 74 RFVPPKGKEQDPKMLERADFFNSLMDDMEHDWADFINSV-MSFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGLI------ 146 (488) T ss_pred eEecCCCCchhHHHHHHHHHHHHHHhccCccHHHHHHHH-HHhhcccceeeeeeeeccccccccccccccCCee------ Confidence 55422 1 124577888886543 44555555 47888999999888864321 11111111111 Q ss_pred CceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccc-cCCcccc Q lcl|NC_019418. 160 QDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSEL-YPDLQPV 238 (527) Q Consensus 160 ~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~-~~~l~~~ 238 (527) ...-+.++-. ..-.+|. + +...+.+. .. .+..-+ ..++... +-. T Consensus 147 ---~~~~i~~Rpq----~~~~~f~------~------------d~d~~l~~---~~--~~~~~~-~~~~~~~~~~~---- 191 (488) T protein:vir:95 147 ---GWAKLPIRNQ----STLDKWY------F------------DEDFRRVT---GV--RQNLRN-VSHIAGAINLG---- 191 (488) T ss_pred ---eeeeeeecCc----cccccee------e------------ccCCCcee---ec--cccccc-ccccccccccc---- Confidence 1111111100 0000000 0 00000000 00 000000 0000000 000 Q ss_pred eeecCCCccc--EEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHH---cCcceeeechhHhcCCCCCC Q lcl|NC_019418. 239 TPIQGLSRPL--FTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIK---MGQRRVIVPEQMTQLKVQDN 313 (527) Q Consensus 239 ~~~~g~~~p~--f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~---~~~~~i~v~~~~l~~~~~~~ 313 (527) ....++.-|. |..++ .....++|+|.|.+..|--..--=+..+..|+.=++ .+-+.+..|..+.....+.+ T Consensus 192 ~~~~~~~lP~~kfi~~~----~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e 267 (488) T protein:vir:95 192 ERPLTRKLPRAKFMLFK----YDDEYGNPEGRSPLLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPE 267 (488) T ss_pred cccccccccccceEEEe----ecCCCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHH Confidence 0001122222 22222 122346799999998886555332322222322222 23333444433221110000 Q ss_pred Ccccccccccccccce------------eeeccCCCCCCC---cceEeccc-cChHHHHHHHHHHHHHHHHhcCCCc-cc Q lcl|NC_019418. 314 QGNIAFKRRFDVEQNV------------YMQVGAGNMDSG---GIVDLTTP-IRSSDYISAISEGLKLFEMQIGVSS-GM 376 (527) Q Consensus 314 ~~~~~~~~~~d~~~~~------------~~~~~~~~~~~~---~i~~~~~~-ir~e~~~~~~~~~l~~i~~~~g~s~-~~ 376 (527) .. ..+..-..+ -.+.+++.+.+. .++..... .....|.+.++.+=++|+... ++. -| T Consensus 268 ~~-----~l~~a~~~i~~~~~~~~~ag~iiP~g~~~~~k~~~~e~~l~~~~~~~~~~~~~li~~~d~~Isk~i-LGqtLT 341 (488) T protein:vir:95 268 KK-----AFVQYCKTVVNDMIANDRAGLIWPRYIDPDTKEDIFEFSLVSRQGAKAYDTGSIIDRYSKQIMMAF-MSDVLA 341 (488) T ss_pred HH-----HHHHHHHHHHHHhhccchhheeeccccccccchhhhhhhccccccCCchhHHHHHHHHHHHHHHHH-hccccc Confidence 00 000000000 000000000000 01111111 111234454554444543332 221 12 Q ss_pred ccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHH Q lcl|NC_019418. 377 FTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIK-ELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAEL 455 (527) Q Consensus 377 ~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~-~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~ 455 (527) .+...+|.-...++- +.-....+..-.+.+..+|. +|+.-++.+ ++. +...-+.+.|+..-++|..+.+ T Consensus 342 ~~~~~~Gs~Al~~vh--~ev~~~i~~aDa~~i~~tln~~li~~l~~~----Nfg----~~~~~P~~~~~~~e~~Dl~~~a 411 (488) T protein:vir:95 342 MGQSKYGSFSLADSK--TSLLAMSVDILLKQIKNVINRDLVAQTYAL----NMW----DDEEHVQITYDDIETPDLEAIG 411 (488) T ss_pred cccCcchhhhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCC----CCCCccEEEecCcChhhHHHHH Confidence 222222211112221 11122233344455667774 688877654 221 1222356788887788878888 Q ss_pred HHHHHHHhcCCCCH----HHHHHhcCCCCHHHHHHHHHHHH-HhcccccccccCCCCCCCCC-CCCCCCCCCCcccc Q lcl|NC_019418. 456 DYWMKMVAAGFATQ----KRGIAKTLGITEEEAEKELAEIN-GELPPESDAELALYGKGQQN-TVGNSKDTVDDEDE 526 (527) Q Consensus 456 ~~~~~~~~aGi~s~----~~~i~~~~~~~deea~~el~ri~-~E~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 526 (527) +.+.+++.+|+.-+ ++++.+.+|+++.+-.+.+..-. .+..+.........+..... +...+....+.+.+ T Consensus 412 e~~~~L~~~G~~i~~~~~~~~i~e~~gip~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~ 488 (488) T protein:vir:95 412 SYIQKTVAVGALEVDKELSNKLREHIGLPPADESQPVSEKLSPNSQSRSGDGYKTAGEGTAKTPSAKDPSTANKANK 488 (488) T ss_pred HHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCccccccCCCCCCCCCCcccCCCcccCCcccccccchhhhhccC Confidence 89999999998554 67888999987432111111000 00000010000000000000 00000111111111 No 264 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=50.25 E-value=0.62 Score=21.74 Aligned_cols=353 Identities=15% Similarity=0.118 Sum_probs=108.1 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchH--HH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIA--RT 78 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~--~~ 78 (527) ||+|.|++.|.|.-. .+.. ... . .|.|. . ..++.+ .. T Consensus 1 M~~f~k~~~~~~~~~---~~~~-------~~~----------~----~~~~~-----------~------~~~~~~~v~~ 39 (378) T protein:vir:85 1 MNLFGKVVSFSRGKL---NNDT-------QRV----------T----AWQNE-----------A------VEYTSAFVTN 39 (378) T ss_pred Cchhhhhhhhhhccc---ccCC-------cce----------e----eeecc-----------c------hhhhhHHHHH Confidence 999999998876311 0000 000 0 00000 0 000111 12 Q ss_pred HHHHHhhhhhcccceE--------------eeCCHHHHHHHHHHHhh-hhHHHHHHHHHHHHHhcCCEEEEEEEeCCeeE Q lcl|NC_019418. 79 AAKKIASLVYNEQAEI--------------SAEDETLNDFLSDMLSN-DRFNKNFERYLESALALGGLAMRPYVDGDKIR 143 (527) Q Consensus 79 i~~~~A~ll~~e~~~i--------------~~~d~~~~~~l~~~l~~-n~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~ 143 (527) +++.+|+-+-+-|..+ .+.+..+...|+.==.. -.-.......+...+-.|.+++.|.++. T Consensus 40 ~v~~ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~---- 115 (378) T protein:vir:85 40 IHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDS---- 115 (378) T ss_pred HHHHHHHhHhhCceeEEEEeccccccccccccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecC---- Confidence 2333333332222211 11222333222210000 0011112223445556688776554322 Q ss_pred EEEEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccC Q lcl|NC_019418. 144 VAFIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLG 223 (527) Q Consensus 144 i~~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG 223 (527) ..+.+...+ +. .+.+.|..-| -.++..-++ ....+| T Consensus 116 ---------------~~g~~~~~~----~~---~~~~~~~~~d-------------------vih~~~~~~---~~~~~~ 151 (378) T protein:vir:85 116 ---------------ETGELLDLL----FA---NDKKEYKPEE-------------------LVRLVSPFY---INEDTS 151 (378) T ss_pred ---------------CCceEEEEE----ec---CCCEEEcccc-------------------eEEEecCcC---ccchhh Confidence 112111100 00 0111111000 000000000 000000 Q ss_pred ceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHcCcceeeech Q lcl|NC_019418. 224 ERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKMGQRRVIVPE 303 (527) Q Consensus 224 ~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~~~~~i~v~~ 303 (527) .+......+... .-.| .|- ++++.+. . .+. +.. +.+-+.|..+......+ T Consensus 152 ---~~~~a~~~~~~~-~~~~--~~~-g~l~~~~--~--------l~~-~~~----~~~~~~~~~~~~~~~~~-------- 201 (378) T protein:vir:85 152 ---ILDNALASIQTK-LEQG--KLR-GLLKINA--F--------LDI-DNT----QEYREKALATIKNMQEG-------- 201 (378) T ss_pred ---HHHHHHHHHHHH-HhcC--Ccc-eEEEeCC--c--------CCH-HHH----HHHHHHHHHHHHHhhcc-------- Confidence 000000000000 0001 111 1222110 0 000 011 11112221111111000 Q ss_pred hHhcCCCCCCCcccccccccccccceeeeccCCCCCCCcceEeccccChHHHHHHHHHHHHHHHHhcCCCcccccccccc Q lcl|NC_019418. 304 QMTQLKVQDNQGNIAFKRRFDVEQNVYMQVGAGNMDSGGIVDLTTPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQG 383 (527) Q Consensus 304 ~~l~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g 383 (527) ...++-+ .. + ++..++.++.+-...+ .+.++....+|+...|++|+.+.. + T Consensus 202 -------~~~g~~~----vl------------~--~g~~~~~l~~~~~~~~-~~~~~~~~~~Ia~~fgVPp~~l~~--s- 252 (378) T protein:vir:85 202 -------SSYNGLT----PV------------D--NKTEIVELKKDYSVLN-KDEIELIKSELLTGYFMNENILLG--T- 252 (378) T ss_pred -------cccccce----ec------------C--CCceEEeccCChhhhh-HHHHHHHHHHHHHHhCCCHHHhcC--C- Confidence 0000000 00 0 1111222222211222 234455566899999999988732 1 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccC------C-cccCccceEEEeCCCccCCHHHHHH Q lcl|NC_019418. 384 VKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYR------G-TIPELDDISVNLDDGVFTDRHAELD 456 (527) Q Consensus 384 ~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~------~-~~~~~~~v~v~f~d~i~~d~~~~~~ 456 (527) ..|... ...+..+|..++..|-.-... .|+. + ......++.++++.-...|..+.++ T Consensus 253 ---~~e~~~------------~~f~~~tL~P~~~~ie~~l~~-kLl~~~er~~~~~~~~~~~~~f~~~~l~~~d~~~~~~ 316 (378) T protein:vir:85 253 ---ATQEQQ------------IYFYNSTIIPLLIQLEKELTY-KLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELID 316 (378) T ss_pred ---chHHHH------------HHHHHHHHHHHHHHHHHHHHh-hcCChhhhhhhhhccccceeeecchhhhhcCHHHHHH Confidence 112110 112233333333332211110 0110 0 0011123455556667788899999 Q ss_pred HHHHHHhcCCCCHHHHHHhcCCCCHHH-HHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCC Q lcl|NC_019418. 457 YWMKMVAAGFATQKRGIAKTLGITEEE-AEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTV 521 (527) Q Consensus 457 ~~~~~~~aGi~s~~~~i~~~~~~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 521 (527) ...+++.+|+|++-++++++ |+..-+ -..- .+.....+-... ....+..++....++++|- T Consensus 317 ~~~~~~~~G~~T~NE~R~~l-gl~p~~gGD~~--~~~~N~~~~~~~-~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:85 317 LYHENINGPIFTQNQLLVKM-GEQPIEGGDIY--IANLNAVAVKNL-SDLQGSRKDVASTDETNNQ 378 (378) T ss_pred HHHHHHhCCCcCHHHHHHHh-CCCCCCCCCeE--eecccccccccc-hhhcCccCCCCCCCCCCCC Confidence 99999999999999976553 553210 0000 000000000000 0000011111111111111 No 265 >protein:vir:105154 Length: 525 # NCBI annotation: conserved phage-related protein # Family: family:all:6660 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398597;genbank:gi:80159853;genbank:GeneID:3772992 Probab=46.91 E-value=0.72 Score=21.37 Aligned_cols=432 Identities=16% Similarity=0.200 Sum_probs=181.9 Q ss_pred CChHH-HHHHHHHHHHHHhhcccchhhh----ccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecch Q lcl|NC_019418. 1 MSLIQ-KVKDFFNRGRYNMTTSHLSSIL----DHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPI 75 (527) Q Consensus 1 m~~~~-~~k~~~~~~~~~~~~~~~~~~~----~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl 75 (527) -+-.+ .+-.|+..-...|+......-. -.-=++-|+++++.|..-..||.= +. T Consensus 35 y~ty~~~~~~f~~gfv~~~~~ng~i~~v~~~~l~~~f~npd~~~~~i~~l~~y~yi-------------------~~--- 92 (525) T protein:vir:10 35 YNTYDDVVDAFIDGFVMDLCNNGKIKTVNLDTLQLWFNNPDKYINNIVNLLTYYYI-------------------ID--- 92 (525) T ss_pred cchhhhHHHHHHHHHHHHhhcCCceeeeeHHHHHhhhcChHHHHHHHHHHHHHhhh-------------------hc--- Confidence 22222 2344555545555544322100 000112255666666655555421 01 Q ss_pred HHHHHHHHhhhhhcccce---EeeC------C---HHHHHHHHHHHhhhhHHHHHHHHHHHHHhcCCEEEEEEEeC-Cee Q lcl|NC_019418. 76 ARTAAKKIASLVYNEQAE---ISAE------D---ETLNDFLSDMLSNDRFNKNFERYLESALALGGLAMRPYVDG-DKI 142 (527) Q Consensus 76 ~~~i~~~~A~ll~~e~~~---i~~~------d---~~~~~~l~~~l~~n~f~~~~~~~~~~a~~~G~~~~~~~~d~-~~~ 142 (527) .-+.++-+|+|+-||- |.+- + +..+.+|.+-++. .+.-+.++.+.+-.|+ .+-.|.-. ..| T Consensus 93 --~~v~ql~~li~~lp~l~y~i~~~~~~k~~~~~~s~~n~~l~k~i~h---k~ltrdll~q~a~~gt-lig~wlg~~~~p 166 (525) T protein:vir:10 93 --GNVFQLYDLIFSLPPLDYQIKVLKRDKDYKEDLSTINLYLEKKIQH---KQLTRDLLVQLAHSGT-LIGTWLGSKREP 166 (525) T ss_pred --chHHHHHHHHHhcCCcceeehhhhhccchhhHHHHHHHHHHHhHHH---HHHHHHHHHHhhccCc-eeEeeecCCCCc Confidence 1134556777776641 2211 1 2345556554433 2333445555544444 44455532 222 Q ss_pred EEE-EEcCCceEEEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEec----- Q lcl|NC_019418. 143 RVA-FIQAPVFLPLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKS----- 216 (527) Q Consensus 143 ~i~-~v~a~~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~----- 216 (527) -+. |-.-..+||- ...+|...||+-..+.. + |.-+|+.+ +...-.+|.-+|..-++ T Consensus 167 y~~vf~~~kyvfp~-~r~~g~~v~vid~~~f~---~----~~~~~r~~----------~~~~lsp~i~~~~y~~~~~~~~ 228 (525) T protein:vir:10 167 YFNVFNNLKYVFPY-GRAKGKMVAVIDLQWFD---E----MSELERKL----------TFENLSPLITENKYKKWKEYNG 228 (525) T ss_pred chhhhhhhhhhccc-cccCCceEEEEehHHhh---h----hhHHHHHH----------HHHhhchhhhhhhhhHHhhccc Confidence 211 1233446673 44566677765322110 0 11122211 01112233222221111 Q ss_pred CCccccCceeecccccCCcccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHH-cC Q lcl|NC_019418. 217 TSDSQLGERVNLSELYPDLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIK-MG 295 (527) Q Consensus 217 ~~~~~lG~~v~l~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~-~~ 295 (527) .+.+.+ + |..|+ +++.+..-+-.+. -|...+.|||..-|-++.-+.+ .++++ .. T Consensus 229 ~~~~~~-r-------~i~LP-------~e~t~~lr~~tl~-rnqrlG~s~vtp~l~dI~hk~k---------lrd~EqsI 283 (525) T protein:vir:10 229 ENEDAL-R-------YIMLP-------ISKTLVARIHTLS-RNQRLGIPYGTQTLFDIQHKQK---------LRDLEQSI 283 (525) T ss_pred ccchhh-e-------eeecc-------cceeEEeeecccc-cCcccCcchhhhHHHHHHHHHH---------HHHHHHHH Confidence 111100 0 11111 2222211111121 1223356676666655543332 22221 12 Q ss_pred cceeeechhHhcCCCCCCCcccccccccccccceeee----ccCCCCCCCcceEec---------cccCh------HHHH Q lcl|NC_019418. 296 QRRVIVPEQMTQLKVQDNQGNIAFKRRFDVEQNVYMQ----VGAGNMDSGGIVDLT---------TPIRS------SDYI 356 (527) Q Consensus 296 ~~~i~v~~~~l~~~~~~~~~~~~~~~~~d~~~~~~~~----~~~~~~~~~~i~~~~---------~~ir~------e~~~ 356 (527) ..+|+-|-.+|...++... ++..+. ..+|++-.+ ++.+-++++|+..++ |+|.. .+ T Consensus 284 A~kii~a~avLk~gg~~gn-~mk~p~--~~kqkil~gVk~aleK~~kdK~Gi~vi~~Pdfa~~efp~ik~~~~glDg~-- 358 (525) T protein:vir:10 284 ADKIIKAMAVLKFRGKDDN-DSKVKE--SAKRKVLAGVKRALEKGVKDKNGIACIAMPDFATFEFPEIKNGDKTLDPK-- 358 (525) T ss_pred HHHhhhhheeeeeccccCc-cccCch--HHHHHHHHHHHHHHhcccccccCeEEEeccceeecccccccCcccCCCch-- Confidence 2334444445544443322 222111 012333222 223334444555432 22221 22 Q ss_pred HHHHHHHHHHHHhcCCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCc Q lcl|NC_019418. 357 SAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPEL 436 (527) Q Consensus 357 ~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~ 436 (527) -++.+-..|-...|+|..-...+++.-+||. . ..--.|.++.-+...++.+.++|+..++--++ . T Consensus 359 -K~d~I~~DI~~A~GlS~sL~nGdggNyAtas--l-nld~fykkigVm~e~Iee~y~kL~d~Vl~~~k-----------~ 423 (525) T protein:vir:10 359 -KYDSIDNDITNATGISQVLTNGTKGNYASAK--L-NLDVFYKKIGVMLEIIEEIYNQLIDIILGEEK-----------G 423 (525) T ss_pred -hhhhhhhhhhhhhccceeeecCCCCceeeee--e-eHHHHHHHHHHHHHHHHHHHHHHHhhhcCccc-----------C Confidence 3444555677888999665444444443332 1 11235577777777788888888887663222 1 Q ss_pred cceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCC-H---HHHHHHHHHHH--HhcccccccccCCCCCCC Q lcl|NC_019418. 437 DDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKRGIAKTLGIT-E---EEAEKELAEIN--GELPPESDAELALYGKGQ 510 (527) Q Consensus 437 ~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~~i~~~~~~~-d---eea~~el~ri~--~E~~~~~~~~~~~~~~~~ 510 (527) ..--++++.+-|.+.++..+.+.++..-|..++.- +. +.|++ | |++--|.++.+ +.+.+.. ....+.| .+ T Consensus 424 ~nyifnydkd~pi~~kkk~d~LIkL~d~g~s~k~v-ld-l~gis~e~y~E~s~yEtE~lkl~EKi~pp~-~~~v~SG-k~ 499 (525) T protein:vir:10 424 CNYIFQYNKDTPIEREKKLDTLIKLEAQGYSAKYV-LD-ILGISSEEYFEESIYEIEKLKLREKIMPPL-NTNVLSG-KD 499 (525) T ss_pred cceEEecCCCchhhhhhhhhhhhhhhccchhhhhh-hh-hhccCcchHHHHHHHHHHHHHHhhhccccc-cceeeec-cc Confidence 22335567778888999999999999888865543 43 55543 2 23334444443 2222221 1112222 11 Q ss_pred CCCCC----CCCCCCCccccC Q lcl|NC_019418. 511 QNTVG----NSKDTVDDEDEA 527 (527) Q Consensus 511 ~~~~~----~~~~~~~~~~~~ 527 (527) -++.+ ++.++++..-+| T Consensus 500 ~n~iG~P~~dd~~~~dati~s 520 (525) T protein:vir:10 500 GNDIGSPKLDDSDSSDATIES 520 (525) T ss_pred cccccCCccCCCcchhhhhhh Confidence 22222 222333333333 No 266 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=26.95 E-value=1.9 Score=19.06 Aligned_cols=371 Identities=8% Similarity=0.010 Sum_probs=120.8 Q ss_pred HHHHhhcccchhhhccCc-cccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHHHHHhhhhhcccc Q lcl|NC_019418. 14 GRYNMTTSHLSSILDHPK-VAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAAKKIASLVYNEQA 92 (527) Q Consensus 14 ~~~~~~~~~~~~~~~~~~-i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~~~~A~ll~~e~~ 92 (527) || +.+.+..++ ..++. +. .+ ...... ..+..+...--..+++.+|+-+-+-|. T Consensus 1 MG-------lf~~~~~~~~~~~~~-----~~------~~--~~~~~~------~~~~~~~~~~v~~~I~~ia~~iA~lp~ 54 (395) T protein:vir:98 1 MG-------ILDFFSFKKSGTLSD-----DD------SG--STTSEK------LTNVVLKEDALYKCVNYLARIISKSTF 54 (395) T ss_pred Cc-------chhhhcCCCcccccc-----cc------cc--hhhhhh------cchhhhhhHHHHHHHHHHHHHHhhCce Confidence 33 111111110 00000 00 00 000000 000001111112334444544444343 Q ss_pred eEeeCC-HH-HHHHHHHHHhh--hh---HHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcCCceEEEEEcCCceEEE Q lcl|NC_019418. 93 EISAED-ET-LNDFLSDMLSN--DR---FNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQAPVFLPLQSNTQDVSSA 165 (527) Q Consensus 93 ~i~~~d-~~-~~~~l~~~l~~--n~---f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a~~~~P~~~d~~~~~~~ 165 (527) .+--.+ +. ...-+..+|.. |. ...-.+..+...+-.|.+++.|..++.. +.|+.+..+. . T Consensus 55 ~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~~~-----~~~~~~~~~~--------~ 121 (395) T protein:vir:98 55 RLKTPEKLTENQKDWLYWINTKANPNQSASQFWVEVIQKLLVDGETLIFVIPGKGI-----YVADSFTQDK--------K 121 (395) T ss_pred eEEecCCcccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeCCce-----ecCCcccccc--------c Confidence 322111 11 11122233321 11 1222344566666678888776554321 1121111100 0 Q ss_pred EEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCcccceeecCCC Q lcl|NC_019418. 166 AILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDLQPVTPIQGLS 245 (527) Q Consensus 166 a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~g~~ 245 (527) . ....++. +. ...|.+.. .|. +. T Consensus 122 ~----------~~~~~~~-~~-----------------~~~~~~~~-~~~--------------------~~-------- 144 (395) T protein:vir:98 122 I----------SGSQFKV-SR-----------------VQGQTYEK-TFT--------------------FD-------- 144 (395) T ss_pred c----------cCcccce-ee-----------------ecCceeee-Eec--------------------Cc-------- Confidence 0 0000110 00 00111110 010 00 Q ss_pred cccEEEecCCccccccCCCccCcchhhhhHHHHH-HHHHHHHHH-HHHHHcCcceeeechhHhcCCCCCCCccccccccc Q lcl|NC_019418. 246 RPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTID-FINRTYDEF-MWEIKMGQRRVIVPEQMTQLKVQDNQGNIAFKRRF 323 (527) Q Consensus 246 ~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid-~ld~~~s~~-~~e~~~~~~~i~v~~~~l~~~~~~~~~~~~~~~~~ 323 (527) -..|||....+. .+.+.+.+..+..++. .++...... .+-+..+...-.++............ + .....+ T Consensus 145 --evih~k~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~ 216 (395) T protein:vir:98 145 --QVIYLKNDNSDL----MSKVESLWEEYGELLGHVINNQKIANQIRFTMIPPKDKVRERAQENSDGGRQS-K-SDKDFF 216 (395) T ss_pred --cEEEecCCCCCc----cccccchhhhHHHHHHHHHHHHHHHHHHHHhhccccccccccccccCCcHHHH-H-HHHHHH Confidence 023454322221 1222222332222221 112111110 11111111111111111000000000 0 000000 Q ss_pred ccccceeeeccCCC------CCCCcceEec------cccChHHHHHHHHHHHHHHHHhcCCCcccccccccccchHHHHH Q lcl|NC_019418. 324 DVEQNVYMQVGAGN------MDSGGIVDLT------TPIRSSDYISAISEGLKLFEMQIGVSSGMFTFDGQGVKTATEIV 391 (527) Q Consensus 324 d~~~~~~~~~~~~~------~~~~~i~~~~------~~ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~~g~~TAtei~ 391 (527) +..+.+..... ..+..++.++ ....+.++.+.......+|+...|++|..++...+ +..+. T Consensus 217 ---~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l~~~~s---n~e~~- 289 (395) T protein:vir:98 217 ---KRTVEKIRTESVVGIPVTANTNYEEYGSKNTGAVKSYVDDIKKLKDQYMAEFAEMLGIPISLLHGDIA---DNQKN- 289 (395) T ss_pred ---HHHHhhhhcCCcceeecCCCceeEecccccccccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCcc---cHHHH- Confidence 00011111000 0111122222 12345678888888888999999999998863211 11111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHH Q lcl|NC_019418. 392 SENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKMVAAGFATQKR 471 (527) Q Consensus 392 s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~aGi~s~~~ 471 (527) ...-...++.-....++.+|..- ++... .....+.|+|++-+..|..+.++...+++..|+|++-+ T Consensus 290 -~~~f~~~tl~P~~~~ie~~l~~k------------ll~~~-~~~~g~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE 355 (395) T protein:vir:98 290 -YELLLEGPIESLITNIVDGLEYA------------IFDKS-ETLQGSFIKVTGLKNYDLFSISNQADKLISSGFVFIDE 355 (395) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHh------------cCChh-hhcCcceeeehhhhccCHHHHHHHHHHHHhCCCcCHHH Confidence 01111122222222233332210 11100 01122457788878889999999999999999999999 Q ss_pred HHHhc--CCCCHHHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_019418. 472 GIAKT--LGITEEEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDT 520 (527) Q Consensus 472 ~i~~~--~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (527) ++... .++.++...+-+.. ....+ +..++++++. +.+| T Consensus 356 ~R~~~g~~Pi~~~~gD~~~~~--~n~~~-------~~~~gge~~~--~~~~ 395 (395) T protein:vir:98 356 VREEIGLPELPDGLGKVLYMT--KNYES-------VLERGGEVDE--EVET 395 (395) T ss_pred HHHHhCCCCCCCCCCceeeec--cccee-------cccccCCCCC--CCCC Confidence 77553 23333222211110 00000 1111222111 1111 No 267 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=25.86 E-value=2 Score=18.92 Aligned_cols=364 Identities=11% Similarity=0.047 Sum_probs=118.7 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) ||+|++++.. ....+.. . ++.. .+... ..+.. +...--..++ T Consensus 1 Mgl~d~~~~~-----------~~~~~~~------~------------~~~~---~~~~~--~~~~~----l~~~~v~~~i 42 (395) T protein:vir:96 1 MGILDFFSFK-----------KSGTLSD------D------------DSGS---TTSEK--LTNVV----LKEDALYKCV 42 (395) T ss_pred CcchhhhcCC-----------CCccccc------c------------cccc---chhhh--cchhh----hhhHHHHHHH Confidence 9998765331 0000000 0 0000 00000 00000 0000011233 Q ss_pred HHHhhhhhcccceEeeCCH--HHHHHHHHHHhh--h---hHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcCCceE Q lcl|NC_019418. 81 KKIASLVYNEQAEISAEDE--TLNDFLSDMLSN--D---RFNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQAPVFL 153 (527) Q Consensus 81 ~~~A~ll~~e~~~i~~~d~--~~~~~l~~~l~~--n---~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a~~~~ 153 (527) +.+|+-+-.-|..+.-.+. .....+..+|.. | ....-...++...+-.|.+++.+..+.+. +.++.+ T Consensus 43 ~~Ia~~ia~lp~~v~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~~~~-----~~~~~~- 116 (395) T protein:vir:96 43 NYLARIISKSTFRIKAPEKLTENQKDWLYWINTKANPNQSASQFWVEVVQKLLVDGETLIFVIPGKGI-----YVADAF- 116 (395) T ss_pred HHHHHhhccceeEEEeCCccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEcCCce-----ecCCcc- Confidence 3333333332322221111 111122233321 1 11222334555566667777766544321 111111 Q ss_pred EEEEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccC Q lcl|NC_019418. 154 PLQSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYP 233 (527) Q Consensus 154 P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~ 233 (527) +... . + ....|+. +. ...|.+... | +- T Consensus 117 ~~~~---~----~----------~~~~~~~-v~-----------------~~~~~~~~~-~------------~~----- 143 (395) T protein:vir:96 117 TQDK---K----L----------SGNKFKV-SR-----------------VQGQTYEKI-F------------TF----- 143 (395) T ss_pred cccc---c----c----------ccceeee-ee-----------------eccceeeeE-e------------cc----- Confidence 1000 0 0 0001110 00 001111110 0 00 Q ss_pred CcccceeecCCCcccEEEecCCccccccCCCccCcch---hhhhHHHHHHHHHHHH--HHH-HHHHcCcceeeechhHhc Q lcl|NC_019418. 234 DLQPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSI---FDNAKTTIDFINRTYD--EFM-WEIKMGQRRVIVPEQMTQ 307 (527) Q Consensus 234 ~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~---~~~~~~lid~ld~~~s--~~~-~e~~~~~~~i~v~~~~l~ 307 (527) . -..||+.+... ..+.|-+. +..+..+.-++...-+ ++. +-+..+.. +...+. T Consensus 144 ---~----------dvih~k~~~~~----~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~----~~~~~~ 202 (395) T protein:vir:96 144 ---D----------QVIYLKNDNSD----LMLKVESLWEEYGELLGHVINNQKIANQIRFTMTPPKDKVR----ERAQEN 202 (395) T ss_pred ---C----------ceEEecccCCc----cccccccccchHHHHHHHHHHHHHHHHHHHHHhhhcccccc----cceeec Confidence 0 02344432111 11112222 2233333322221111 111 11222211 111111 Q ss_pred CCCCCCCcccccccccccccceeeeccCCCC------CCCcceEeccccChH------HHHHHHHHHHHHHHHhcCCCcc Q lcl|NC_019418. 308 LKVQDNQGNIAFKRRFDVEQNVYMQVGAGNM------DSGGIVDLTTPIRSS------DYISAISEGLKLFEMQIGVSSG 375 (527) Q Consensus 308 ~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~------~~~~i~~~~~~ir~e------~~~~~~~~~l~~i~~~~g~s~~ 375 (527) ........ .....+ +..+...+.+.. ++..++.++..-... ++.+......++|+...|++|. T Consensus 203 ~~~~~~~~--~~~~~~---~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~ 277 (395) T protein:vir:96 203 SDGGRQPK--SDKDFF---KRTIEKIRTESVVGIPVTANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPIS 277 (395) T ss_pred cCchhhHH--HHHHHH---HHHHHHhhcCCcceEEccCCceeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHH Confidence 11111100 000000 111111111110 111223333332222 3333444556789999999999 Q ss_pred cccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHH Q lcl|NC_019418. 376 MFTFDGQGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAEL 455 (527) Q Consensus 376 ~~~~~~~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~ 455 (527) .++...+ ++.+. .+..++.+|..++..|....+. .+..... -.....++|+.-+..|..+.+ T Consensus 278 ~l~~~~s---n~e~~-------------~~~f~~~~L~P~~~~ie~~l~~-~Ll~~~e-~~~~~~f~~~~l~~~d~~~~~ 339 (395) T protein:vir:96 278 LLHGDIA---DNQKN-------------YELLLEGPIESLITNIVDGLEY-AIFDKSE-TLEGSFIKVTGLKNYDLFSIS 339 (395) T ss_pred HhcCCCc---cHHHH-------------HHHHHHHHHHHHHHHHHHHHHh-hcCChhh-hcCceeEeecchhccCHHHHH Confidence 8863222 12221 1122333343333333321110 1111110 012245778777888999999 Q ss_pred HHHHHHHhcCCCCHHHHHHhcCCCCH---HHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCCCccccC Q lcl|NC_019418. 456 DYWMKMVAAGFATQKRGIAKTLGITE---EEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTVDDEDEA 527 (527) Q Consensus 456 ~~~~~~~~aGi~s~~~~i~~~~~~~d---eea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (527) +...+++.+|+|++-+++... |+.. .+..+-+. .....+ +. +.+|++++. T Consensus 340 ~~~~~~~~~G~~T~NE~R~~~-gl~pi~~~~gD~~~~--~~N~~~-------~~------------~~gge~~~~ 392 (395) T protein:vir:96 340 SQADKLISSGFVFIDEVREEI-GLPELPDGLGKVLYM--TKNYES-------VL------------ERGGEVDEE 392 (395) T ss_pred HHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCCceeee--ccccee-------ch------------hccCCCCCC Confidence 999999999999998876543 4432 21111100 000000 00 111111111 No 268 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=25.31 E-value=2.1 Score=18.85 Aligned_cols=354 Identities=10% Similarity=0.053 Sum_probs=109.5 Q ss_pred CChHHHHHHHHHHHHHHhhcccchhhhccCccccCHHHHHHHHHHHHHhcCCCcccccccccCccccCceeecchHHHHH Q lcl|NC_019418. 1 MSLIQKVKDFFNRGRYNMTTSHLSSILDHPKVAVTQSEFRRIQHNLAYYQSKFDDIEYTNTDGDRKRRKMQHLPIARTAA 80 (527) Q Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~y~g~~~~l~~~~~~~~~~~~~~~~lnl~~~i~ 80 (527) ||+|+++ |++ +.-..... +.+. +.. ++ .+ ..+....-..++ T Consensus 1 Mg~f~~l---~~~-------~~~~~~~~------~~~~----------~~~----~~-----~~----~~l~~~~v~~~i 41 (376) T protein:vir:78 1 MGFFSEL---FKR-------NKEIEWMW------DLDF----------LED----KT-----TK----VYLKKMALNTCV 41 (376) T ss_pred Cchhhhh---hcc-------CCcccccc------chhh----------ccc----cc-----hh----hhhhhHHHHHHH Confidence 9998865 332 00000000 0000 000 00 00 000001112233 Q ss_pred HHHhhhhhcccceEeeCCHHHHHHHHHHHh-h-h---hHHHHHHHHHHHHHhcCCEEEEEEEeCCeeEEEEEcCCceEEE Q lcl|NC_019418. 81 KKIASLVYNEQAEISAEDETLNDFLSDMLS-N-D---RFNKNFERYLESALALGGLAMRPYVDGDKIRVAFIQAPVFLPL 155 (527) Q Consensus 81 ~~~A~ll~~e~~~i~~~d~~~~~~l~~~l~-~-n---~f~~~~~~~~~~a~~~G~~~~~~~~d~~~~~i~~v~a~~~~P~ 155 (527) +.+|+-+-.-|..+.-.+.....-+..+|. . | .........+...+-.|.+++.+..++++.... .+|+ T Consensus 42 ~~Ia~~ia~~p~~~~~~~~~~~~~l~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~~~~~~------~~~~ 115 (376) T protein:vir:78 42 KHIARTIAKSDFRLKNGETSVRDKLYYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDDFLIAD------SYVR 115 (376) T ss_pred HHHHHhhcccceeeccccccccchHHHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCCeeecc------ceee Confidence 333333322222221111111111222221 1 1 112222334455555677777665555432111 1222 Q ss_pred EEcCCceEEEEEEEEEEeeCCCcceEEEEEEEEeecccccccceeeecCCceEEEEEEEecCCccccCceeecccccCCc Q lcl|NC_019418. 156 QSNTQDVSSAAILTKTIKTENRKNVYYTLVEFHEWVTPTGQEVGSTKDKSLYRITNELYKSTSDSQLGERVNLSELYPDL 235 (527) Q Consensus 156 ~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l 235 (527) .. .......+ .. ++.-+++. +.......|-|--|-....... T Consensus 116 ~~--~~~~~~~~-~~-----------~~~~~~~~-----------~~~~~~~evih~~~~~~~~~~~------------- 157 (376) T protein:vir:78 116 KE--FAFFPDVF-EG-----------VTVKDYRY-----------NRNFSMDDVIFLEYGNERLSAF------------- 157 (376) T ss_pred cc--cceeeeee-ee-----------eeeeccee-----------eeeeccccEEEeccCCCCchhh------------- Confidence 10 00000000 00 00000000 0000000111100100000000 Q ss_pred ccceeecCCCcccEEEecCCccccccCCCccCcchhhhhHHHHHHHHHHHHHHHHHHHc-Ccceeee--chhHhcCCCCC Q lcl|NC_019418. 236 QPVTPIQGLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRTYDEFMWEIKM-GQRRVIV--PEQMTQLKVQD 312 (527) Q Consensus 236 ~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~ld~~~s~~~~e~~~-~~~~i~v--~~~~l~~~~~~ 312 (527) +.+....+..++.. ....+ +.. +-+.+.+ +...+ +. T Consensus 158 -------------------------------~~~~~~~~~~~~~~---~~~~~---~~~~~~~~~~~~~~~~~~----~~ 196 (376) T protein:vir:78 158 -------------------------------TDGMFEDYGELFGK---MIRAQ---MRNFQIRGAVNFKMAGVA----DK 196 (376) T ss_pred -------------------------------hhHHHHHHHHHHHH---HHHHH---HhcCCCceeEEEccCCCC----CH Confidence 00111111111111 11111 111 1111111 00000 00 Q ss_pred CCcccccccccccccceeeeccCCC------CCCCcceEeccc-----cChHHHHHHHHHHHHHHHHhcCCCcccccccc Q lcl|NC_019418. 313 NQGNIAFKRRFDVEQNVYMQVGAGN------MDSGGIVDLTTP-----IRSSDYISAISEGLKLFEMQIGVSSGMFTFDG 381 (527) Q Consensus 313 ~~~~~~~~~~~d~~~~~~~~~~~~~------~~~~~i~~~~~~-----ir~e~~~~~~~~~l~~i~~~~g~s~~~~~~~~ 381 (527) +. .-.....| +..|.+..... .++..++.++.. ....++.+..+...++|+...|++|..++... T Consensus 197 e~-~~~~~~~~---~~~~~g~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~~~~ 272 (376) T protein:vir:78 197 DK-QTKLQEYI---DKVYASFNNNEIAIVPQLEGFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLHGDM 272 (376) T ss_pred HH-HHHHHHHH---HHHhccccccCcceEEcCCCceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC Confidence 00 00000000 00111100000 001112222211 12346788888888899999999999986432 Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCcccCccceEEEeCCCccCCHHHHHHHHHHH Q lcl|NC_019418. 382 QGVKTATEIVSENSDTYQMRNSIVALVEQSIKELCVSMCELGKVVGIYRGTIPELDDISVNLDDGVFTDRHAELDYWMKM 461 (527) Q Consensus 382 ~g~~TAtei~s~~~~~~~~~~~~~~~~~~al~~li~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~ 461 (527) ++ ..+. . ...+..+|..++..|-...+. .++. +....+.+++..-+..|..+.++...++ T Consensus 273 s~---~e~~---~----------~~f~~~~l~P~~~~ie~~l~~-kll~---~~~~~~~~~~~~ll~~d~~~~~~~~~~~ 332 (376) T protein:vir:78 273 AD---LSNN---M----------KAYMEYCIDPLTKKLEDELNA-KLFT---FSEFLAGEHIKIIHKKDIIENAEAVDKL 332 (376) T ss_pred CC---HHHH---H----------HHHHHHHHHHHHHHHHHHHHh-hhCC---cccceecccchhhcccCHHHHHHHHHHH Confidence 22 1211 1 112223333333333222110 0111 1122233344445567888888888999 Q ss_pred HhcCCCCHHHHHHhcCCCCH---HHHHHHHHHHHHhcccccccccCCCCCCCCCCCCCCCCCC Q lcl|NC_019418. 462 VAAGFATQKRGIAKTLGITE---EEAEKELAEINGELPPESDAELALYGKGQQNTVGNSKDTV 521 (527) Q Consensus 462 ~~aGi~s~~~~i~~~~~~~d---eea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 521 (527) +.+|+|++-+++... |+.. .++.+- . .+. .-.+.++.++|| T Consensus 333 ~~~G~~t~NE~R~~l-g~~p~~~g~~d~~--------------~--~~~--n~~~~~~~~e~g 376 (376) T protein:vir:78 333 VASGSFNRNEVRELL-GAERVDNPELDKY--------------L--ITK--NYQSADEGGEDG 376 (376) T ss_pred HhCCCcCHHHHHHHh-CCCCCCCCCCcee--------------e--ecc--CceehhccccCC Confidence 999999988866443 4432 111000 0 000 011111112222 Done!