Query lcl|NC_015287.1_cdsid_YP_004324197.1 [gene=gp20] [protein=portal vertex protein of head] [protein_id=YP_004324197.1] [location=123925..125616] Match_columns 563 No_of_seqs 76 out of 89 Neff 4.0 Searched_HMMs 1612 Date Thu Nov 7 15:33:53 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_144 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_144_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:104892 Length: 558 100.0 3E-285 2E-288 1580.6 46.7 554 1-562 1-558 (558) 2 protein:vir:106999 Length: 564 100.0 3E-283 2E-286 1569.4 45.3 552 1-563 1-561 (564) 3 protein:vir:104500 Length: 537 100.0 5E-281 3E-284 1557.5 46.7 536 1-563 1-537 (537) 4 protein:vir:103177 Length: 533 100.0 4E-279 2E-282 1546.9 45.6 530 1-557 1-533 (533) 5 protein:vir:81017 Length: 521 100.0 7E-275 5E-278 1523.4 45.6 503 1-522 8-521 (521) 6 protein:vir:6896 Length: 523 # 100.0 9E-275 6E-278 1523.0 43.5 504 1-522 5-523 (523) 7 protein:vir:108049 Length: 524 100.0 2E-273 1E-276 1515.5 42.6 504 1-522 7-524 (524) 8 protein:vir:106282 Length: 521 100.0 3E-273 2E-276 1514.3 42.9 502 1-522 5-521 (521) 9 protein:vir:6596 Length: 521 # 100.0 2E-272 9E-276 1510.8 45.7 503 1-522 8-521 (521) 10 protein:vir:103458 Length: 524 100.0 7E-272 4E-275 1507.3 44.2 504 1-522 9-524 (524) 11 protein:vir:7208 Length: 524 # 100.0 8E-272 5E-275 1506.9 44.2 504 1-522 9-524 (524) 12 protein:vir:101806 Length: 516 100.0 2E-271 1E-274 1504.7 44.6 500 1-521 3-516 (516) 13 protein:vir:101189 Length: 516 100.0 2E-271 1E-274 1504.7 44.6 500 1-521 3-516 (516) 14 protein:vir:100598 Length: 516 100.0 6E-271 4E-274 1502.0 44.4 500 1-521 3-516 (516) 15 protein:vir:98265 Length: 524 100.0 2E-269 1E-272 1493.9 44.3 503 1-522 13-524 (524) 16 protein:vir:5665 Length: 511 # 100.0 3E-266 2E-269 1476.2 43.2 498 1-519 1-511 (511) 17 protein:vir:5839 Length: 533 # 100.0 5E-232 3E-235 1288.7 38.7 495 1-561 4-533 (533) 18 protein:vir:79538 Length: 502 99.5 6.2E-13 3.9E-16 87.5 32.3 474 1-553 1-502 (502) 19 protein:vir:96068 Length: 765 99.5 5.3E-13 3.3E-16 87.9 27.5 469 1-563 37-571 (765) 20 protein:vir:107742 Length: 537 99.4 7.9E-12 4.9E-15 81.4 29.7 458 4-563 1-534 (537) 21 protein:vir:94049 Length: 532 99.4 7.5E-12 4.7E-15 81.6 28.1 463 1-562 1-532 (532) 22 protein:vir:99563 Length: 862 99.3 1.8E-11 1.1E-14 79.4 27.3 463 1-563 39-590 (862) 23 protein:vir:5249 Length: 437 # 99.2 4.1E-10 2.5E-13 72.1 30.0 423 13-556 1-437 (437) 24 protein:vir:107662 Length: 427 99.1 6.9E-10 4.3E-13 70.8 27.5 404 23-559 1-427 (427) 25 protein:vir:95542 Length: 548 99.1 2.1E-09 1.3E-12 68.1 32.9 480 5-563 1-540 (548) 26 protein:vir:104338 Length: 422 99.1 1.9E-09 1.1E-12 68.5 27.7 397 23-552 1-422 (422) 27 protein:vir:96738 Length: 505 99.0 7.1E-09 4.4E-12 65.3 31.3 458 5-562 1-505 (505) 28 protein:vir:6382 Length: 553 # 99.0 8E-09 5E-12 65.0 32.0 495 1-562 1-553 (553) 29 protein:vir:10321 Length: 495 99.0 8.5E-09 5.3E-12 64.8 31.3 463 13-562 1-495 (495) 30 protein:vir:80040 Length: 461 98.9 2.2E-08 1.3E-11 62.6 28.1 436 1-551 1-461 (461) 31 protein:vir:389 Length: 530 # 98.9 2.2E-08 1.4E-11 62.6 32.9 477 1-556 3-530 (530) 32 protein:vir:78227 Length: 480 98.9 1.3E-08 8.3E-12 63.7 24.5 443 40-559 1-480 (480) 33 protein:vir:105782 Length: 449 98.9 1.6E-08 9.8E-12 63.4 24.4 411 1-546 1-449 (449) 34 protein:vir:79703 Length: 505 98.9 2.7E-08 1.7E-11 62.1 30.0 450 5-529 1-505 (505) 35 protein:vir:3420 Length: 533 # 98.8 3E-08 1.9E-11 61.8 32.0 481 9-556 1-533 (533) 36 protein:vir:102727 Length: 945 98.8 3.4E-08 2.1E-11 61.5 27.8 441 1-563 59-539 (945) 37 protein:vir:99916 Length: 504 98.7 7.2E-08 4.5E-11 59.7 26.4 466 13-560 1-504 (504) 38 protein:vir:95599 Length: 563 98.7 1.2E-07 7.7E-11 58.5 33.9 472 1-563 1-540 (563) 39 protein:vir:99312 Length: 563 98.7 1.2E-07 7.7E-11 58.5 33.9 472 1-563 1-540 (563) 40 protein:vir:79772 Length: 648 98.7 1.3E-07 8.2E-11 58.3 35.8 459 1-563 1-521 (648) 41 protein:vir:96240 Length: 511 98.6 1.6E-07 1E-10 57.8 30.2 458 1-553 1-511 (511) 42 protein:vir:9306 Length: 511 # 98.6 1.6E-07 1E-10 57.8 30.4 461 1-554 1-511 (511) 43 protein:vir:2341 Length: 488 # 98.6 1.7E-07 1.1E-10 57.7 25.7 449 23-545 1-488 (488) 44 protein:vir:9922 Length: 489 # 98.6 2.7E-07 1.7E-10 56.6 29.5 455 15-549 1-489 (489) 45 protein:vir:80959 Length: 499 98.6 3E-07 1.9E-10 56.3 28.1 445 1-535 1-499 (499) 46 protein:vir:99781 Length: 511 98.5 3.5E-07 2.2E-10 56.0 29.9 454 1-553 1-511 (511) 47 protein:vir:78537 Length: 480 98.5 3.9E-07 2.4E-10 55.7 24.9 442 40-561 1-480 (480) 48 protein:vir:38 Length: 496 # N 98.5 5.2E-07 3.2E-10 55.0 27.6 445 1-535 1-496 (496) 49 protein:vir:3648 Length: 695 # 98.5 5.2E-07 3.2E-10 55.0 25.0 451 1-563 69-588 (695) 50 protein:vir:106639 Length: 481 98.5 5.8E-07 3.6E-10 54.8 29.4 448 9-540 1-481 (481) 51 protein:vir:96366 Length: 511 98.4 6.1E-07 3.8E-10 54.6 30.3 460 1-554 1-511 (511) 52 protein:vir:78805 Length: 511 98.4 6.1E-07 3.8E-10 54.6 30.3 460 1-554 1-511 (511) 53 protein:vir:106571 Length: 499 98.4 6.4E-07 4E-10 54.5 29.3 455 5-555 1-499 (499) 54 protein:vir:1587 Length: 508 # 98.4 6.5E-07 4.1E-10 54.5 30.0 447 5-534 1-508 (508) 55 protein:vir:2500 Length: 501 # 98.4 7.3E-07 4.5E-10 54.2 28.0 444 13-562 1-501 (501) 56 protein:vir:80796 Length: 574 98.4 8.5E-07 5.3E-10 53.9 30.6 470 1-563 1-539 (574) 57 protein:vir:80644 Length: 551 98.4 9.6E-07 6E-10 53.6 36.9 454 1-563 1-518 (551) 58 protein:vir:96579 Length: 576 98.4 9.8E-07 6.1E-10 53.5 35.9 468 1-563 5-542 (576) 59 protein:vir:103951 Length: 511 98.4 1E-06 6.4E-10 53.4 31.1 459 1-554 1-511 (511) 60 protein:vir:4194 Length: 540 # 98.3 1.2E-06 7.3E-10 53.1 26.3 441 4-563 1-464 (540) 61 protein:vir:4223 Length: 486 # 98.3 1.7E-06 1E-09 52.2 23.2 460 16-554 1-486 (486) 62 protein:vir:98883 Length: 517 98.3 1.9E-06 1.2E-09 52.0 27.7 451 5-535 1-517 (517) 63 protein:vir:101541 Length: 694 98.2 2.1E-06 1.3E-09 51.7 25.4 457 1-563 68-587 (694) 64 protein:vir:3153 Length: 467 # 98.2 2.5E-06 1.5E-09 51.3 30.5 410 60-563 1-465 (467) 65 protein:vir:7768 Length: 484 # 98.2 2.5E-06 1.5E-09 51.3 26.3 458 19-555 1-484 (484) 66 protein:vir:105292 Length: 478 98.2 2.7E-06 1.7E-09 51.1 28.3 440 1-545 1-478 (478) 67 protein:vir:102080 Length: 429 98.2 3E-06 1.8E-09 50.9 26.6 418 1-563 4-428 (429) 68 protein:vir:4828 Length: 382 # 98.2 3.3E-06 2.1E-09 50.6 25.9 375 5-539 1-382 (382) 69 protein:vir:79647 Length: 435 98.2 3.5E-06 2.2E-09 50.5 29.5 411 5-554 1-435 (435) 70 protein:vir:93747 Length: 472 98.1 3.6E-06 2.2E-09 50.4 30.0 434 20-543 1-472 (472) 71 protein:vir:104082 Length: 485 98.1 4E-06 2.5E-09 50.2 26.9 454 16-554 1-485 (485) 72 protein:vir:95806 Length: 440 98.1 4.7E-06 2.9E-09 49.8 27.1 408 42-541 1-440 (440) 73 protein:vir:3028 Length: 500 # 98.1 4.9E-06 3E-09 49.7 33.3 440 1-529 15-500 (500) 74 protein:vir:9815 Length: 500 # 98.1 4.9E-06 3E-09 49.7 33.3 440 1-529 15-500 (500) 75 protein:vir:96179 Length: 468 98.1 5.2E-06 3.2E-09 49.6 28.2 425 1-538 1-468 (468) 76 protein:vir:99522 Length: 470 98.1 5.4E-06 3.3E-09 49.5 31.4 443 1-536 1-470 (470) 77 protein:vir:98444 Length: 434 98.1 5.5E-06 3.4E-09 49.4 31.1 422 43-549 1-434 (434) 78 protein:vir:105002 Length: 432 98.1 5.6E-06 3.5E-09 49.4 25.6 417 1-563 7-431 (432) 79 protein:vir:102855 Length: 432 98.1 5.6E-06 3.5E-09 49.4 25.6 417 1-563 7-431 (432) 80 protein:vir:107605 Length: 432 98.1 5.6E-06 3.5E-09 49.4 25.6 417 1-563 7-431 (432) 81 protein:vir:78907 Length: 518 98.0 6.7E-06 4.2E-09 48.9 30.2 456 5-533 1-518 (518) 82 protein:vir:1326 Length: 457 # 98.0 6.8E-06 4.2E-09 48.9 28.2 430 1-563 4-451 (457) 83 protein:vir:101418 Length: 569 98.0 7.8E-06 4.8E-09 48.6 27.1 496 1-541 15-569 (569) 84 protein:vir:5961 Length: 503 # 98.0 8.4E-06 5.2E-09 48.4 29.5 461 1-555 1-503 (503) 85 protein:vir:7407 Length: 392 # 97.9 1.1E-05 6.8E-09 47.8 22.9 383 1-550 2-392 (392) 86 protein:vir:95113 Length: 474 97.9 1.2E-05 7.4E-09 47.6 27.2 442 1-556 1-474 (474) 87 protein:vir:78589 Length: 695 97.9 1.2E-05 7.7E-09 47.5 25.6 446 1-563 69-588 (695) 88 protein:vir:97336 Length: 492 97.9 1.3E-05 8.3E-09 47.3 28.0 444 5-553 1-492 (492) 89 protein:vir:99072 Length: 479 97.9 1.4E-05 8.4E-09 47.3 27.8 464 1-561 1-479 (479) 90 protein:vir:97171 Length: 512 97.9 1.4E-05 8.5E-09 47.2 30.3 458 1-554 1-512 (512) 91 protein:vir:3843 Length: 397 # 97.8 1.6E-05 9.8E-09 46.9 28.7 395 5-556 1-397 (397) 92 protein:vir:94498 Length: 474 97.8 1.6E-05 9.8E-09 46.9 27.8 436 1-543 1-474 (474) 93 protein:vir:97447 Length: 474 97.8 1.6E-05 9.8E-09 46.9 27.8 436 1-543 1-474 (474) 94 protein:vir:105889 Length: 474 97.8 1.6E-05 1E-08 46.8 27.0 425 2-543 1-474 (474) 95 protein:vir:94101 Length: 474 97.8 1.6E-05 1E-08 46.8 27.0 425 2-543 1-474 (474) 96 protein:vir:106716 Length: 698 97.8 1.7E-05 1E-08 46.8 25.7 443 1-563 69-591 (698) 97 protein:vir:96980 Length: 409 97.8 1.7E-05 1E-08 46.8 26.0 397 1-563 1-404 (409) 98 protein:vir:94426 Length: 409 97.8 1.7E-05 1.1E-08 46.7 27.3 396 1-563 1-404 (409) 99 protein:vir:2427 Length: 485 # 97.8 1.7E-05 1.1E-08 46.7 22.4 457 16-554 1-485 (485) 100 protein:vir:1266 Length: 416 # 97.8 1.8E-05 1.1E-08 46.7 26.2 403 1-563 3-412 (416) 101 protein:vir:94805 Length: 492 97.8 1.8E-05 1.1E-08 46.5 28.6 443 5-543 1-492 (492) 102 protein:vir:9408 Length: 441 # 97.8 2E-05 1.2E-08 46.3 27.4 411 2-563 1-439 (441) 103 protein:vir:79984 Length: 441 97.8 2E-05 1.2E-08 46.3 27.4 411 2-563 1-439 (441) 104 protein:vir:4782 Length: 522 # 97.8 2.2E-05 1.4E-08 46.1 32.9 447 1-536 1-522 (522) 105 protein:vir:96266 Length: 474 97.7 2.3E-05 1.4E-08 46.0 29.0 436 1-553 1-474 (474) 106 protein:vir:95899 Length: 474 97.7 2.3E-05 1.4E-08 46.0 29.0 436 1-553 1-474 (474) 107 protein:vir:107112 Length: 478 97.7 2.6E-05 1.6E-08 45.7 29.0 443 1-562 1-478 (478) 108 protein:vir:100150 Length: 437 97.7 3.1E-05 1.9E-08 45.3 24.1 428 1-554 1-437 (437) 109 protein:vir:2683 Length: 412 # 97.6 3.6E-05 2.2E-08 45.0 28.2 393 5-563 1-407 (412) 110 protein:vir:1236 Length: 483 # 97.6 4E-05 2.5E-08 44.7 31.6 435 1-543 1-483 (483) 111 protein:vir:96494 Length: 501 97.6 4E-05 2.5E-08 44.7 28.8 450 1-552 33-501 (501) 112 protein:vir:7853 Length: 518 # 97.6 4.2E-05 2.6E-08 44.6 31.3 421 1-563 1-468 (518) 113 protein:vir:6240 Length: 457 # 97.6 4.3E-05 2.7E-08 44.5 29.2 431 1-563 4-457 (457) 114 protein:vir:105064 Length: 421 97.6 4.3E-05 2.7E-08 44.5 22.4 406 1-563 3-414 (421) 115 protein:vir:960 Length: 413 # 97.5 4.8E-05 3E-08 44.3 28.3 399 4-563 1-413 (413) 116 protein:vir:100882 Length: 383 97.5 4.9E-05 3.1E-08 44.2 24.5 376 1-553 1-383 (383) 117 protein:vir:1023 Length: 392 # 97.5 5.3E-05 3.3E-08 44.0 26.4 384 1-550 2-392 (392) 118 protein:vir:3989 Length: 392 # 97.5 5.3E-05 3.3E-08 44.0 26.4 384 1-550 2-392 (392) 119 protein:vir:80680 Length: 441 97.5 5.4E-05 3.4E-08 44.0 25.5 407 26-549 1-441 (441) 120 protein:vir:7987 Length: 456 # 97.5 6E-05 3.7E-08 43.7 23.1 412 23-546 1-456 (456) 121 protein:vir:9871 Length: 429 # 97.4 6.8E-05 4.2E-08 43.4 27.9 403 39-541 1-429 (429) 122 protein:vir:81095 Length: 416 97.4 6.8E-05 4.2E-08 43.4 29.2 404 1-563 1-414 (416) 123 protein:vir:4598 Length: 416 # 97.4 6.8E-05 4.2E-08 43.4 29.2 404 1-563 1-414 (416) 124 protein:vir:78641 Length: 278 97.4 7.2E-05 4.4E-08 43.3 21.8 268 87-451 1-278 (278) 125 protein:vir:102950 Length: 471 97.4 7.4E-05 4.6E-08 43.2 31.6 404 44-524 1-471 (471) 126 protein:vir:4952 Length: 386 # 97.4 7.5E-05 4.6E-08 43.2 25.1 380 5-539 1-386 (386) 127 protein:vir:63755 Length: 547 97.4 8.1E-05 5E-08 43.0 36.3 463 1-563 31-535 (547) 128 protein:vir:1380 Length: 422 # 97.4 8.3E-05 5.1E-08 43.0 27.2 415 1-562 1-422 (422) 129 protein:vir:93610 Length: 454 97.3 9.1E-05 5.6E-08 42.8 29.8 422 4-563 1-435 (454) 130 protein:vir:4995 Length: 384 # 97.3 9.2E-05 5.7E-08 42.7 25.3 377 5-539 1-384 (384) 131 protein:vir:3609 Length: 452 # 97.3 0.0001 6.5E-08 42.4 31.0 423 5-541 1-452 (452) 132 protein:vir:79043 Length: 479 97.3 0.00011 6.6E-08 42.4 28.2 432 15-536 1-479 (479) 133 protein:vir:4454 Length: 414 # 97.3 0.00011 7E-08 42.2 26.7 405 5-557 1-414 (414) 134 protein:vir:105461 Length: 470 97.2 0.00012 7.2E-08 42.1 32.1 410 44-535 1-470 (470) 135 protein:vir:98396 Length: 441 97.2 0.00013 8.3E-08 41.8 27.2 412 2-563 1-439 (441) 136 protein:vir:4337 Length: 434 # 97.0 0.00023 1.4E-07 40.5 25.0 421 1-552 1-434 (434) 137 protein:vir:102118 Length: 409 96.9 0.00025 1.6E-07 40.3 26.3 402 5-563 1-409 (409) 138 protein:vir:96839 Length: 474 96.9 0.00026 1.6E-07 40.3 28.0 435 1-547 1-474 (474) 139 protein:vir:8418 Length: 409 # 96.9 0.00029 1.8E-07 40.0 26.6 398 1-562 1-409 (409) 140 protein:vir:97060 Length: 432 96.9 0.00029 1.8E-07 40.0 22.4 414 1-563 1-430 (432) 141 protein:vir:100249 Length: 431 96.8 0.00031 1.9E-07 39.8 26.4 411 1-553 1-431 (431) 142 protein:vir:93943 Length: 409 96.8 0.00033 2E-07 39.7 28.3 398 1-563 1-404 (409) 143 protein:vir:2732 Length: 501 # 96.8 0.00036 2.2E-07 39.5 30.0 448 1-556 37-501 (501) 144 protein:vir:3964 Length: 453 # 96.8 0.00036 2.2E-07 39.5 30.6 424 13-541 1-453 (453) 145 protein:vir:80134 Length: 403 96.7 0.00043 2.7E-07 39.0 28.0 395 1-543 1-403 (403) 146 protein:vir:81072 Length: 432 96.6 0.00046 2.8E-07 38.9 25.3 414 1-563 1-430 (432) 147 protein:vir:4156 Length: 542 # 96.6 0.00046 2.8E-07 38.9 28.0 430 4-563 1-484 (542) 148 protein:vir:101648 Length: 518 96.6 0.00047 2.9E-07 38.8 32.0 431 1-563 1-468 (518) 149 protein:vir:8184 Length: 474 # 96.6 0.00049 3E-07 38.7 27.1 438 15-534 1-474 (474) 150 protein:vir:81152 Length: 411 96.6 0.00049 3E-07 38.7 25.9 402 5-563 1-411 (411) 151 protein:vir:101647 Length: 460 96.6 0.00049 3E-07 38.7 27.2 428 1-562 1-460 (460) 152 protein:vir:80333 Length: 419 96.4 0.00069 4.3E-07 37.9 22.7 404 5-563 1-414 (419) 153 protein:vir:95378 Length: 406 96.3 0.00075 4.6E-07 37.7 28.5 394 5-543 1-406 (406) 154 protein:vir:94666 Length: 723 96.3 0.00076 4.7E-07 37.7 26.4 413 20-563 1-446 (723) 155 protein:vir:10362 Length: 432 96.3 0.00081 5E-07 37.5 26.1 413 1-563 1-430 (432) 156 protein:vir:189 Length: 424 # 96.2 0.00093 5.8E-07 37.2 27.7 403 1-554 10-424 (424) 157 protein:vir:9702 Length: 406 # 96.0 0.0012 7.3E-07 36.6 25.2 390 5-563 1-399 (406) 158 protein:vir:102602 Length: 456 95.9 0.0012 7.7E-07 36.5 24.9 423 20-526 1-456 (456) 159 protein:vir:105819 Length: 456 95.9 0.0012 7.7E-07 36.5 24.9 423 20-526 1-456 (456) 160 protein:vir:483 Length: 413 # 95.6 0.0018 1.1E-06 35.6 25.9 402 1-551 3-413 (413) 161 protein:vir:78083 Length: 537 95.4 0.0021 1.3E-06 35.3 31.5 459 16-558 1-537 (537) 162 protein:vir:8100 Length: 466 # 95.3 0.0024 1.5E-06 35.0 27.8 429 5-563 1-463 (466) 163 protein:vir:5737 Length: 419 # 95.2 0.0025 1.5E-06 34.9 26.0 409 2-559 1-419 (419) 164 protein:vir:100187 Length: 385 95.2 0.0025 1.6E-06 34.8 25.5 376 1-538 1-385 (385) 165 protein:vir:1082 Length: 359 # 95.1 0.0027 1.7E-06 34.6 24.5 337 1-475 1-359 (359) 166 protein:vir:4854 Length: 386 # 95.1 0.0028 1.8E-06 34.5 27.5 379 2-539 1-386 (386) 167 protein:vir:4898 Length: 502 # 94.7 0.0037 2.3E-06 33.9 30.4 469 1-556 1-502 (502) 168 protein:vir:81218 Length: 423 94.6 0.0041 2.5E-06 33.7 27.1 410 5-555 1-423 (423) 169 protein:vir:9568 Length: 410 # 94.5 0.0042 2.6E-06 33.6 29.1 404 20-511 1-410 (410) 170 protein:vir:103219 Length: 201 94.3 0.0047 2.9E-06 33.3 12.2 191 283-540 1-201 (201) 171 protein:vir:94546 Length: 506 94.0 0.0058 3.6E-06 32.8 30.0 445 3-558 1-506 (506) 172 protein:vir:1431 Length: 419 # 93.8 0.0064 4E-06 32.6 23.3 400 3-563 1-414 (419) 173 protein:vir:102330 Length: 451 93.5 0.0072 4.4E-06 32.3 30.5 409 44-528 1-451 (451) 174 protein:vir:9359 Length: 348 # 93.5 0.0072 4.5E-06 32.3 23.4 334 85-563 1-343 (348) 175 protein:vir:1884 Length: 424 # 93.2 0.0086 5.3E-06 31.9 28.7 396 1-561 20-424 (424) 176 protein:vir:733 Length: 453 # 92.5 0.011 6.8E-06 31.3 27.1 416 1-542 5-453 (453) 177 protein:vir:79063 Length: 491 92.5 0.011 6.9E-06 31.3 29.3 425 1-563 1-453 (491) 178 protein:vir:4509 Length: 424 # 92.0 0.013 8.3E-06 30.9 28.0 396 1-563 19-422 (424) 179 protein:vir:100691 Length: 535 91.4 0.016 1E-05 30.4 29.6 459 1-563 1-533 (535) 180 protein:vir:9751 Length: 422 # 90.7 0.02 1.2E-05 29.9 26.9 417 7-510 1-422 (422) 181 protein:vir:1634 Length: 409 # 89.4 0.026 1.6E-05 29.2 25.6 407 7-495 1-409 (409) 182 protein:vir:108215 Length: 469 85.8 0.05 3.1E-05 27.7 30.9 442 13-563 1-466 (469) 183 protein:vir:8317 Length: 409 # 82.8 0.074 4.6E-05 26.8 22.9 373 1-532 4-409 (409) 184 protein:vir:99853 Length: 488 81.7 0.083 5.2E-05 26.5 27.6 421 9-563 1-440 (488) 185 protein:vir:4698 Length: 251 # 80.1 0.098 6.1E-05 26.1 16.8 245 5-331 1-251 (251) 186 protein:vir:107880 Length: 491 79.9 0.1 6.2E-05 26.1 30.9 426 1-563 1-453 (491) 187 protein:vir:94742 Length: 409 79.4 0.1 6.5E-05 26.0 25.8 404 7-495 1-409 (409) 188 protein:vir:99452 Length: 651 67.3 0.25 0.00016 23.8 27.8 485 1-563 1-552 (651) 189 protein:vir:3868 Length: 417 # 61.6 0.35 0.00022 23.1 23.8 395 5-553 1-417 (417) 190 protein:vir:101289 Length: 395 53.2 0.54 0.00033 22.1 23.0 378 5-563 1-390 (395) 191 protein:vir:100650 Length: 395 53.2 0.54 0.00033 22.1 23.0 378 5-563 1-390 (395) 192 protein:vir:9507 Length: 395 # 53.2 0.54 0.00033 22.1 23.0 378 5-563 1-390 (395) 193 protein:vir:104259 Length: 403 52.3 0.56 0.00035 22.0 25.6 390 5-562 1-403 (403) 194 protein:vir:95254 Length: 488 27.4 1.8 0.0011 19.1 28.0 459 1-563 1-488 (488) 195 protein:vir:1661 Length: 378 # 26.1 2 0.0012 18.9 21.1 365 5-563 1-375 (378) 196 protein:vir:6210 Length: 394 # 25.0 2.1 0.0013 18.8 24.8 380 5-563 1-391 (394) 197 protein:vir:94002 Length: 378 24.4 2.2 0.0014 18.7 21.5 367 5-556 1-378 (378) No 1 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=100.00 E-value=2.8e-285 Score=1580.57 Aligned_cols=554 Identities=64% Similarity=1.082 Sum_probs=525.5 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) ||+||||+++..++.+.+..|++||+++||++.++.||++++|++++++++|+++||++||+||+|||||+||++||||| T Consensus 1 m~~lfgf~~~~~~~~~~~~~s~~~p~~ddg~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVnea 80 (558) T protein:vir:10 1 MAKLFGFSIEETQKKSTSIISPVPKNNEDGVDNFISSGFYGQYVDIEGAYRSEYDLIRRYREMALHPEADGAIEDVVNEA 80 (558) T ss_pred CcchhcchhhhhhhhccCCccccCCCccccccceeccceeeeeecccchhhhHHHHHHHHHHHhhccchhhHHHHhhcce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCC Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDP 160 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP 160 (563) ||||++++||+|+|+++++|++||++|+|||++|+++|+|+++||+|||+||||||+|||||||++|||+||+|||+||| T Consensus 81 iv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRiyfHKiid~k~pk~GI~ELr~lDP 160 (558) T protein:vir:10 81 IVSDLYDSPVEVELSNLNASNTLKKKIREEFRYIKEMMDFDKKSHEIFRNWYVDGRVFYLKVIDTKNPQEGIQDLRYIDP 160 (558) T ss_pred eEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccccceeeeeeCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeecc Q lcl|NC_015287. 161 LKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSG 240 (563) Q Consensus 161 ~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSG 240 (563) |+|++||+++++..+.+.+.+++++.. .+.++.+.|||+|+|++..+++.+|.. +.++.||||+||||||||| T Consensus 161 r~i~~Vr~i~~~~~~~~~~~~~~~~~~-~~~~~~~~eyy~Y~~~~~~~~~~~~~~------~~~~~vkI~~dAI~y~hSG 233 (558) T protein:vir:10 161 LKIKFIRQEKRKPGNQDPAIRVRSEQD-VVPNPEFEEFYIYTPKVQHPTGMVGQM------GGKNSIKIAKDSITMCTSG 233 (558) T ss_pred ccceeeeeeccccccccceeeeecccc-eeeccceeEeeeecCCcccccccceee------cCCCceeechhheeeeccc Confidence 999999999999888888888887654 466799999999999988777665543 2345799999999999999 Q ss_pred eecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcc Q lcl|NC_015287. 241 LVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEI 320 (563) Q Consensus 241 l~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev 320 (563) |+|||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+| T Consensus 234 L~d~~~~~i~syLhkAIKp~NQLkmlEDAlVIYRitRAPERRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev 313 (558) T protein:vir:10 234 LVDRNKNRVLSYLHKAIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLKEVMSRYRNKLVYDANTGEV 313 (558) T ss_pred ceecCCCeeeecchHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCcee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHH Q lcl|NC_015287. 321 RDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIK 400 (563) Q Consensus 321 ~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElK 400 (563) +|++||||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++ ||++||++|||||||| T Consensus 314 ~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnLgem~DV~YF~kKLy~aLnVP~SRl~~e~-~f~~Gr~~EItRDEiK 392 (558) T protein:vir:10 314 RDDRKFMSMMEDFWLPRREGGRGTEITTLPGGQNLGELSDVDYFQKKLYRALGVPESRIAAEG-GFNLGRSSEILRDELK 392 (558) T ss_pred cccchhhhhHhhhcccccCCCCccceeeccccCCcchHHHHHHHHHHHHHHhCCCccccCCCC-cccccccchhhHHHHH Confidence 999999999999999999999999999999999999999999999999999999999999886 8999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhh Q lcl|NC_015287. 401 FTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFS 480 (563) Q Consensus 401 F~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S 480 (563) |+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++|||||||| T Consensus 393 F~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S 472 (558) T protein:vir:10 393 FAKFVGRLRKRFAAMFNDMLKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYS 472 (558) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCcccccccccc----CCCCcCCCCCcccccccCCCCcccCCC Q lcl|NC_015287. 481 LDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQ----MPGGAMNPQDPNMGGAPQEGGAEQAPQ 556 (563) Q Consensus 481 ~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~----~~~~~~~p~~~~~~~~~~~~~~~~~~~ 556 (563) ++||||+||+|||+||+++++||++|+++|+|++|+++++.+.++.+.. +.++...|.+|++..+++++.++-.-. T Consensus 473 ~dyi~k~ILr~tDeeI~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 552 (558) T protein:vir:10 473 TEYVRKRVLRQTDMEIEEIDTQIEDEIQKGIIPDPSQIDPITGEPLPQEGDPAMEGMGEQPVDPDLEAQAQAVDAQYSKD 552 (558) T ss_pred hHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCccccChhhccccCccCCchhccCCCCCcccccccchhhhhhhhhhh Confidence 9999999999999999999999999999999999999999988887654 556667788888888888777764433 Q ss_pred CCCCCC Q lcl|NC_015287. 557 MPEGGE 562 (563) Q Consensus 557 ~~~~~~ 562 (563) +.+-.- T Consensus 553 ~~~~~~ 558 (558) T protein:vir:10 553 TKKAEL 558 (558) T ss_pred hhhhcC Confidence 333222 No 2 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=100.00 E-value=3e-283 Score=1569.43 Aligned_cols=552 Identities=55% Similarity=0.933 Sum_probs=508.8 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccc--hhhhHHHHHHHHHHHhhccchhHHHHHhhc Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDG--VFKSEFELIKRYREMALHPEVDSAIEDIIN 78 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~--~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvn 78 (563) ||+||||++++.+ ..+..|++||+++|++++ ++|||+|++++.++ +++|+++||++||+||+|||||+||++||| T Consensus 1 m~~lfgf~i~~~~--~~~~~S~vpp~~~~~~~~-i~~g~~g~~v~~~g~~~~~n~~eLI~~YR~ma~~pEVd~Av~eIVn 77 (564) T protein:vir:10 1 MSQLFGFLINEKE--GQKGQSPVPPNDEASVST-VAGGYFGTYVDTSGGQNSRNEYELIRRYRDMSLHPEVDSAIDEIVN 77 (564) T ss_pred Ccchhcceeeeec--cCCCCCcccCCcCCChhh-hhccccceeeecccccchhhHHHHHHHHHHHhhccchhhHHHHhhc Confidence 9999999998855 445779999999988775 67899999998876 689999999999999999999999999999 Q ss_pred ceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEe Q lcl|NC_015287. 79 EAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYI 158 (563) Q Consensus 79 eaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~l 158 (563) |||||+++++||+|+|+|+++|++||++|+|||++||++|+|+++||+|||+||||||+||||||||+|||+||+|||+| T Consensus 78 eaIv~d~~~~pV~vdL~~~~~s~siK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~eLr~l 157 (564) T protein:vir:10 78 EFVVNDGDDKPVEVDLQNLEIGSGVKKKIRDEFNRILRMMNFNVNAHEIIRNWYVDGRSHYHKVIDLDNPKKGILELRYI 157 (564) T ss_pred ceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeeCCChhhhhhhhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeee Q lcl|NC_015287. 159 DPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCH 238 (563) Q Consensus 159 DP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~h 238 (563) |||+|++||++.++.. .+...+++ .....+.|..+.|||+|||+++..+..++. |+. +.+.+++||||+||||||| T Consensus 158 DPr~i~~vr~i~~~~~-~~~~~v~k-~~~~~~~y~~~~Eyy~Ynp~~~~g~~~~~~-~~~-~~~~~~~ikI~~daI~y~h 233 (564) T protein:vir:10 158 DSLKIRKVRQKLKDVD-PNRKEIEK-GTALQYDYGDFIEYYIYNPKGFAGNIPMVT-GSM-DWSNQEGIKIASDAIAQST 233 (564) T ss_pred cccceeeeeeeccccc-cccceeee-eeeeeccccccccceeeccccccCcccccc-ccc-ccccccceeechhhcceec Confidence 9999999999999853 34555554 466889999999999999998877665553 333 3344678999999999999 Q ss_pred cceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCC Q lcl|NC_015287. 239 SGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTG 318 (563) Q Consensus 239 SGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TG 318 (563) |||+|||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++|| T Consensus 234 SGL~d~~~~~i~gyLhkAIKp~NQLkmlEDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TG 313 (564) T protein:vir:10 234 SGLMDLNKKMTLSFLHKAIKSLNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLRDVMSRYRNKLVYDGQTG 313 (564) T ss_pred ccceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHH Q lcl|NC_015287. 319 EIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDE 398 (563) Q Consensus 319 ev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDE 398 (563) +|+|++||||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++++||++||++|||||| T Consensus 314 evrddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~DV~YF~kKLY~aLnVP~SRl~~e~~~f~~Gr~~EItRDE 393 (564) T protein:vir:10 314 EIRDDKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELKDVEYFKKKLYNSLNLPPSRLTDDNKAFNLGKSTEILRDE 393 (564) T ss_pred eecccchhhhhHhhhcccccCCCcccceeeccccCCcchHHHHHHHHHHHHHHhCCCcccccCCCceeecccccchhHHH Confidence 99999999999999999999999999999999999999999999999999999999999999997799999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchh Q lcl|NC_015287. 399 IKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKY 478 (563) Q Consensus 399 lKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy 478 (563) |||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|||++|||++|++++++++|||||| T Consensus 394 iKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky 473 (564) T protein:vir:10 394 LKFTKFIGRLRKRFAQLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVGKY 473 (564) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCccccc----ccCC---CCc Q lcl|NC_015287. 479 FSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGG----APQE---GGA 551 (563) Q Consensus 479 ~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~----~~~~---~~~ 551 (563) ||++||||+||+|||+||+++++||++|+++|++++|.++++.++.+. .++...|.++.|.. ...+ ..| T Consensus 474 ~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~~~~~~~~----~~~~~~p~~~~~~~~~~~~~~~~~~~~a 549 (564) T protein:vir:10 474 FSTEYIRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVNMLDDMEK----QNQAFAPELQAAQDDLAAEREIKKLNSA 549 (564) T ss_pred cchHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhhcCCCccC----CCCcCCcchhhhccccccccChhhhccC Confidence 999999999999999999999999999999999999999988876544 44444455544422 1111 223 Q ss_pred ccCCCCCCCCCC Q lcl|NC_015287. 552 EQAPQMPEGGEI 563 (563) Q Consensus 552 ~~~~~~~~~~~~ 563 (563) .++|+.++|+|- T Consensus 550 ~~~~~~~~~~~~ 561 (564) T protein:vir:10 550 PKPPPSQQSKSQ 561 (564) T ss_pred CCCCCCCCCcCc Confidence 367778888888 No 3 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=100.00 E-value=4.5e-281 Score=1557.52 Aligned_cols=536 Identities=54% Similarity=0.936 Sum_probs=502.0 Q ss_pred Cc-ccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcc Q lcl|NC_015287. 1 MA-KLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINE 79 (563) Q Consensus 1 m~-~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvne 79 (563) || +||||++++.+...+ ..|++||+++||+.++..|+++|++++++++++|++|||++||+||+|||||+||++|||| T Consensus 1 ~~~~lfg~~i~~~~~~~~-~~s~~~~~~~dg~~~~~~~~~~g~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVne 79 (537) T protein:vir:10 1 MAQQLFGFSLQRAKKVPK-GPSFVQKDSLDGSQPIVGGGYFGYSVDFDGTIRNDHELITRYREMVLNPECDSAVDDVVNE 79 (537) T ss_pred Cccccccceeeccccccc-CCcccCCCcccccceeecccccccccccccccchHHHHHHHHHHHhhccchhhHHHHhhcc Confidence 66 699999999876655 5599999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeC Q lcl|NC_015287. 80 AIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYID 159 (563) Q Consensus 80 aiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lD 159 (563) |||||++++||+|+|++|++|++||++|++||++||++|+|+++||+|||+||||||+|||||||++|||+||+|||+|| T Consensus 80 aiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fhKiid~k~pk~GI~ELr~lD 159 (537) T protein:vir:10 80 TICGNFDDVPISIDLHNLKQSEKIKKLIRSEFDEILRLLDFDNRAYEIFRRWYVDGRLFFHKVIDPKKPRQGLVELRYVD 159 (537) T ss_pred eeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccccceeeeeeC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeec Q lcl|NC_015287. 160 PLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHS 239 (563) Q Consensus 160 P~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hS 239 (563) ||+|++||++.+++++...+.....+ +.....+||+|+|++...++ +++||||+|||||||| T Consensus 160 Pr~i~~vR~i~~~~~~~~~~~~~~~~-----v~~~~~eyf~ynp~g~~~~~-------------~~~vkI~~dAI~y~hS 221 (537) T protein:vir:10 160 PRKIRKVTEYEAKRPEALRTQDLNQQ-----LTQQSASYFLYNPKGLKNST-------------NQGMKIAPDSIAYCHS 221 (537) T ss_pred CccceeeEeecccCCccceEEeccee-----eeecccceeeeccccccccC-------------CCceeccHhheeeecc Confidence 99999999999987643333222221 23346899999998764321 3469999999999999 Q ss_pred ceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCc Q lcl|NC_015287. 240 GLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGE 319 (563) Q Consensus 240 Gl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGe 319 (563) ||+|||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+ T Consensus 222 Gl~d~n~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGe 301 (537) T protein:vir:10 222 GIQDLNKNMVLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGE 301 (537) T ss_pred cceeCCCCeeeeeehhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHH Q lcl|NC_015287. 320 IRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEI 399 (563) Q Consensus 320 v~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDEl 399 (563) |+|++||||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++ ||++||++||||||| T Consensus 302 v~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnlgem~DV~YF~kKLy~aLnVP~SRl~~e~-~f~~Gr~~EItRDEi 380 (537) T protein:vir:10 302 IKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETET-TFNIGRAAEITRDEV 380 (537) T ss_pred ecccchhhhhhhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCC-cccccccchhhHHHH Confidence 9999999999999999999999999999999999999999999999999999999999999885 899999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhh Q lcl|NC_015287. 400 KFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYF 479 (563) Q Consensus 400 KF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~ 479 (563) ||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|||++|||++|++++++++||||||| T Consensus 381 KF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~ 460 (537) T protein:vir:10 381 KFQKFIARLRKRFSELFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYF 460 (537) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccCCCCCC Q lcl|NC_015287. 480 SLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAPQMPE 559 (563) Q Consensus 480 S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 559 (563) |++||||+||+|||+||+++++||++|+++|+|++|.+.++.+.+ .|+.+|..|.-..|..+|.+.++|.+|+ T Consensus 461 s~dyi~k~ILr~tDeeI~~~~k~I~~E~k~~~~~~p~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 533 (537) T protein:vir:10 461 SANYIRTKVLKQTESEIKEIDKEIKQEIADGVIMDPQAMQAMEMG-------IGDEEPVPEGGEEPQTDPNSAVSPADQK 533 (537) T ss_pred chHHHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCcccccccccC-------CCCcccCCCCCCCcccCCccCCCCCCcc Confidence 999999999999999999999999999999999999887776654 3455566665555666777888999999 Q ss_pred CCCC Q lcl|NC_015287. 560 GGEI 563 (563) Q Consensus 560 ~~~~ 563 (563) |||| T Consensus 534 ~~~~ 537 (537) T protein:vir:10 534 RGEL 537 (537) T ss_pred CCCC Confidence 9999 No 4 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=100.00 E-value=4e-279 Score=1546.87 Aligned_cols=530 Identities=55% Similarity=0.940 Sum_probs=487.7 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) ||+||||++++.++. ....|++||+++||+..+.+|+++|+|++++++++|++|||++||+||+|||||+||++||||| T Consensus 1 m~~lfg~~i~~~~~~-~~~~s~~~~~~~dg~~~i~~~~~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVnea 79 (533) T protein:vir:10 1 MSQLFGFSLERAKKA-PKGPSFVQKDNLDGSQPVSGGGYYGYTVDFDGQVRNEYQLISRYREMVLQPECDSAVDDIVNET 79 (533) T ss_pred Ccccccccccccccc-ccCCCCCCCCcccccceeecccccceeeecccccchHHHHHHHHHHHhhccchhhHHHHhhcce Confidence 999999999998665 4466999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCC Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDP 160 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP 160 (563) ||||++++||+|+|++|++|++||++|+|||++||++|+|+++||+|||+||||||+||||||||+|||+||+|||+||| T Consensus 80 iv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~ELr~lDP 159 (533) T protein:vir:10 80 ICGNFDDVPVSVELSNLKVSDKIKKLIREEFGEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPDNPQGGLIELRYIDP 159 (533) T ss_pred eeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCCCccccceeeeeccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeecc Q lcl|NC_015287. 161 LKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSG 240 (563) Q Consensus 161 ~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSG 240 (563) |+|++||++.++.+ ++++.. .....++.++.|||+|+|++...+ .+++||||+||||||||| T Consensus 160 r~i~~vr~i~~~~~--~~~~~~---~~~~~v~~~~~eyf~Ynp~g~~~~-------------~~~~vkI~~dAI~y~hSG 221 (533) T protein:vir:10 160 RKIRKINETEQKRP--EQLRGL---PLNQQLSPKSAEYFLYDPKGLKNS-------------TTQGLKIAPDSICYVHSG 221 (533) T ss_pred cceeeeeeeeccCC--Ccccee---ecchhhhccceeeeeecccccccc-------------CCCceecchhheeeeecc Confidence 99999999998765 333221 122345678899999999876432 134699999999999999 Q ss_pred eecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcc Q lcl|NC_015287. 241 LVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEI 320 (563) Q Consensus 241 l~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev 320 (563) |+|||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+| T Consensus 222 l~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev 301 (533) T protein:vir:10 222 IMDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEI 301 (533) T ss_pred ceeCCCCceeccchHhHHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCcee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHH Q lcl|NC_015287. 321 RDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIK 400 (563) Q Consensus 321 ~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElK 400 (563) +|++||||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++ ||++||++|||||||| T Consensus 302 ~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~DV~YF~kKLY~aLnVP~SRl~~e~-~f~~Gr~~EItRDEiK 380 (533) T protein:vir:10 302 KDDKKFMSMLEDFWLPRREGGRGTEITTLPGGQNLGELEDVKYFQKKLYKSLNVPGSRLETET-TFNVGRAAEITRDEVK 380 (533) T ss_pred cccchhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCC-cccccccchhhHHHHH Confidence 999999999999999999999999999999999999999999999999999999999999885 8999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhh Q lcl|NC_015287. 401 FTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFS 480 (563) Q Consensus 401 F~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S 480 (563) |+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|||++|||++|++++++++|||||||| T Consensus 381 F~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S 460 (533) T protein:vir:10 381 FQKFVARLRKRFSELFTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFS 460 (533) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCcccccccccc---CCCCcCCCCCcccccccCCCCcccCCCC Q lcl|NC_015287. 481 LDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQ---MPGGAMNPQDPNMGGAPQEGGAEQAPQM 557 (563) Q Consensus 481 ~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~---~~~~~~~p~~~~~~~~~~~~~~~~~~~~ 557 (563) ++||||+||+|||+||+++++||++|+++|+|++|.+++++.++.+.-+ .+++++.|..|+....++ .+. T Consensus 461 ~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~ 533 (533) T protein:vir:10 461 VEYMRRQVLKQTDVEMKEIDKQIESEMESGIIADPAAEMDPAMAAGDPDAGGAPAEEVAPEGPDPSDERK-------AEF 533 (533) T ss_pred hHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCCcchhhHHhcCCCCCcCCcccccCCCCCCCcchhhc-------cCC Confidence 9999999999999999999999999999999999999888766543222 222222333332111111 111 No 5 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=100.00 E-value=7.5e-275 Score=1523.44 Aligned_cols=503 Identities=37% Similarity=0.665 Sum_probs=474.4 Q ss_pred CcccccchhhHhhhcccccc-cccCcCCCCcccc---------eeccccccccccccchhhhHHHHHHHHHHHhhccchh Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVV-SPVPQSNEDSSDY---------YVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVD 70 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~-s~~p~~~~d~~~~---------~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd 70 (563) |++||||++++.++++++++ |++||+++||+.. .++|||+++|+|++++++|+++||++||+||+||||| T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~s~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd 87 (521) T protein:vir:81 8 LARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNLPASAWNSLTQQFYSTDQKISTTKQLVNTYRGLMNNHEVE 87 (521) T ss_pred hHhhcCchhhhHHhhhccCccccccCCCCCCceEecccCCCcceeecceeeeecccccchhhHHHHHHHHHHHhhccchh Confidence 99999999999998876655 8889999999853 3679999999999999999999999999999999999 Q ss_pred HHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCc Q lcl|NC_015287. 71 SAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEE 150 (563) Q Consensus 71 ~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~ 150 (563) +||++|||||||||++++||+|+|+++++|++||++|+|||++||++|+|+++||+|||+||||||+||||||| +|||+ T Consensus 88 ~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid-~~pk~ 166 (521) T protein:vir:81 88 NAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKIIG-KNPKD 166 (521) T ss_pred hHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEEc-CCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999 89999 Q ss_pred ceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeec Q lcl|NC_015287. 151 GIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIA 230 (563) Q Consensus 151 GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~ 230 (563) ||+|||+||||+|++||++.++++ +++.+++ ++.|||+|+|+.. .++.+|.+ +.+ ++.|||| T Consensus 167 GI~Elr~lDPr~i~~vr~i~k~~~--~~~~v~~----------~~~e~f~Y~~~~~-~~~~~g~~----~~~-~~~vkI~ 228 (521) T protein:vir:81 167 GIVELRQLDPRNLEYVREIITEDT--PEGKIYK----------ATKEYFIYTVGNS-SYCAGGQV----FSP-NSRVKIP 228 (521) T ss_pred cceeeeeeCCcceeeeeeeccccc--Cccceec----------ceeeeeeeecCCc-ccccccee----ecC-Ccceeec Confidence 999999999999999999999765 5565554 4789999999533 33333322 222 3579999 Q ss_pred ccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcce Q lcl|NC_015287. 231 PDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNK 310 (563) Q Consensus 231 ~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk 310 (563) +||||||||||+|||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++|||| T Consensus 229 ~dAI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~im~k~kNk 308 (521) T protein:vir:81 229 RSAITYAHSGLMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNR 308 (521) T ss_pred hhheeeeeccceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCc-ceecc Q lcl|NC_015287. 311 LVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGD-GFNLG 389 (563) Q Consensus 311 lvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~-g~~~g 389 (563) +|||++||+|+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++++ +|++| T Consensus 309 lvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~G 388 (521) T protein:vir:81 309 VVYDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGG 388 (521) T ss_pred eEeecccccccccccccchhhhhcccccCCCcccceeecccCCCCChHHHHHHHHHHHHHHhCCccccccCCCCcceecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999976543 79999 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 390 RSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVA 469 (563) Q Consensus 390 ~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~ 469 (563) |++|||||||||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|+++++ T Consensus 389 r~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~ 468 (521) T protein:vir:81 389 DGSEITRDELEFSKFIRTRQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIE 468 (521) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccc Q lcl|NC_015287. 470 AVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQ 522 (563) Q Consensus 470 ~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~ 522 (563) +++||||||||++||||+||+|||+||+++++||++|+++|+|++|++++++= T Consensus 469 ~~dpyvGky~s~dyi~k~ILr~tDeei~~~~k~I~~E~~~~~~~~p~~~~~~f 521 (521) T protein:vir:81 469 RITPYIGKYFSNQTVMRDILKYTDDQMDTEKKQIEEEANDPRFKQTPDEIEDF 521 (521) T ss_pred HhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCCcccccCC Confidence 99999999999999999999999999999999999999999999997665443 No 6 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=100.00 E-value=9.1e-275 Score=1522.96 Aligned_cols=504 Identities=38% Similarity=0.679 Sum_probs=474.4 Q ss_pred CcccccchhhHhh----hcccccc-cccCcCCCCcccceecccccc----------ccccccchhhhHHHHHHHHHHHhh Q lcl|NC_015287. 1 MAKLFGFKIQKDD----EQNKNVV-SPVPQSNEDSSDYYVSSGFYG----------QYVDIDGVFKSEFELIKRYREMAL 65 (563) Q Consensus 1 m~~~~~f~~~~~~----~~~~~~~-s~~p~~~~d~~~~~~~~~~~~----------~~~~~~~~~~~~~~LI~~YR~ma~ 65 (563) |++||||..++.+ ++.+..+ |++||++|||++.+..+++++ +|++++++++|+++||++||+||+ T Consensus 5 ~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR~ma~ 84 (523) T protein:vir:68 5 ILSLFAPWAKMDERDYKDQEKENLESITSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLKSTRELIDTYRNLMT 84 (523) T ss_pred hhhhhhhhhhhhhhhhhhhhhccCCCccccCCCCcceeeeccccccccccchhhhhhhhccccccchHHHHHHHHHHHhh Confidence 9999999766544 4455555 889999999988775444433 377899999999999999999999 Q ss_pred ccchhHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecC Q lcl|NC_015287. 66 HPEVDSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDI 145 (563) Q Consensus 66 ~PEvd~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~ 145 (563) |||||+||++|||||||||++++||+|+|+++++|++||++|+|||++||++|+|+++||+|||+||||||+|||||||| T Consensus 85 ~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~ 164 (523) T protein:vir:68 85 NYEVDNAVSEIVSDAIVYEDDTEVVSINLDNTKFSPNIKSMMLDEFNEVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDP 164 (523) T ss_pred ccchhhHHHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhheeeeEEEEEEEeeC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcc Q lcl|NC_015287. 146 KAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQR 225 (563) Q Consensus 146 ~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~ 225 (563) +|||+||+|||+||||+|++||++.+++. +++.+++ ++.|||+|+|++ ..++++|..+. .++ T Consensus 165 k~pk~GI~Elr~lDPr~i~~vr~i~~~~~--~g~~vi~----------~~~e~f~Y~~~~-~~~~~~g~~~~-----~~~ 226 (523) T protein:vir:68 165 KRPKEGIKELRRLDPRQVQYVREVITTTE--AGVKIVK----------GYKEYFIYDTSH-ESYACDGRIYE-----AGT 226 (523) T ss_pred CCccccceeeeeeCCcceeEEEeecCCCC--cchhhhh----------hhhhheeecccc-ccccccccccC-----CCc Confidence 99999999999999999999999999854 6777765 478999999986 44555554332 246 Q ss_pred eeeecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHH Q lcl|NC_015287. 226 PVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMN 305 (563) Q Consensus 226 ~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~ 305 (563) +||||+||||||||||+|||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||+ T Consensus 227 ~ikI~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~ 306 (523) T protein:vir:68 227 KIKIPKAAIVYAHSGLVDCCGKNIIGYLHRAIKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPSRKAAEHMQHVMN 306 (523) T ss_pred ceecchhheeeeeccceeCCCCceeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHH Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCcc Q lcl|NC_015287. 306 RYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDG 385 (563) Q Consensus 306 k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g 385 (563) +||||+|||++||+|+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++++| T Consensus 307 k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~ 386 (523) T protein:vir:68 307 TMKNRIAYDATTGKIKNQQHIMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDVRWFRNALYMALRIPITRIPSDQGG 386 (523) T ss_pred hhcceeEEeccCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCcceeecCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999998778 Q ss_pred eeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHH Q lcl|NC_015287. 386 FNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERL 465 (563) Q Consensus 386 ~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~ 465 (563) |++||++|||||||||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|||++|||++|+ T Consensus 387 f~~Gr~~EItRDEikF~KFI~rLR~rFs~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~ 466 (523) T protein:vir:68 387 IQFDAGTSITRDELSFGKFIRELQHKFEEIFLDPLKTNLILKGIITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRI 466 (523) T ss_pred eecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccc Q lcl|NC_015287. 466 GVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQ 522 (563) Q Consensus 466 ~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~ 522 (563) +++++++||||||||++||||+||+|||+||+++++||++|+++|+|++|++++++= T Consensus 467 ~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~e~~~f 523 (523) T protein:vir:68 467 NMLQMAEPFIGKYISHRTAMKDILQMSDEEIEQEAKQIEEESKEARFQDPDQEQEDF 523 (523) T ss_pred HHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhhcC Confidence 999999999999999999999999999999999999999999999999997776554 No 7 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=100.00 E-value=2.1e-273 Score=1515.55 Aligned_cols=504 Identities=35% Similarity=0.657 Sum_probs=468.5 Q ss_pred CcccccchhhHhhh----ccc-ccccccCcCCCCcccce------eccccc--cccccccchhhhHHHHHHHHHHHhhcc Q lcl|NC_015287. 1 MAKLFGFKIQKDDE----QNK-NVVSPVPQSNEDSSDYY------VSSGFY--GQYVDIDGVFKSEFELIKRYREMALHP 67 (563) Q Consensus 1 m~~~~~f~~~~~~~----~~~-~~~s~~p~~~~d~~~~~------~~~~~~--~~~~~~~~~~~~~~~LI~~YR~ma~~P 67 (563) |++||||..++.+. +.+ ...|++||+++||++.+ ++++++ ++|++++++++|+++||++||+||+|| T Consensus 7 ~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR~ma~~p 86 (524) T protein:vir:10 7 ILSFLKPWANEDEKEYKQQINNNLESVTAPKLDDGAREIETQEQNIPYNALMQQMFGSNEPEVKNTRELIDTYRNLMNNY 86 (524) T ss_pred HHHHhhhhhcchhhhhhhhhccCCCccccCCCCCCceeeccCcccccchhhhhhhhhcccchhhhHHHHHHHHHHHhhcc Confidence 99999996665443 333 34488899999998655 233333 447789999999999999999999999 Q ss_pred chhHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCC Q lcl|NC_015287. 68 EVDSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKA 147 (563) Q Consensus 68 Evd~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~ 147 (563) |||+||++|||||||||++++||+|+|+++++|++||++|+|||++||++|+|+++||+|||+||||||+||||||||+| T Consensus 87 Evd~Av~eIVneaiv~d~~~~pV~l~Ld~~~~s~siK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~~~ 166 (524) T protein:vir:10 87 EVDNAVQEIVSDAIVYEDDKEVVALNLDGTDFSQSIKDKILAEFSEVLNLLNFQRKGTDHFQRWYVDSRIFFHKIINPKK 166 (524) T ss_pred chhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeceEEEEEEeeCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCccee Q lcl|NC_015287. 148 PEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPV 227 (563) Q Consensus 148 ~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~i 227 (563) ||+||+|||+||||+|++||++.+++. +++.+++ +++|||+|+|. ...+++.|. ..+.+++| T Consensus 167 pk~GI~Elr~lDPr~i~~vr~i~~~~~--~~~~vi~----------~~~e~f~Y~~~-~~~~~~~~~-----~~~~~~~i 228 (524) T protein:vir:10 167 MKDGVQELRRLDPRQVQYIREIVTRME--DGVKIVD----------GYREFFVYDTG-HESYCADGR-----IYSAGTKV 228 (524) T ss_pred ccccceeeeeeCCccceeeeeecccCc--ccchhhc----------chhhheeecCC-CcccccCcc-----eecCCcce Confidence 999999999999999999999999854 6666665 47899999993 222232222 22345689 Q ss_pred eecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhh Q lcl|NC_015287. 228 KIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRY 307 (563) Q Consensus 228 ki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~ 307 (563) |||+||||||||||+|||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++| T Consensus 229 kI~~dAIvy~~SGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnlPk~KAeqYl~~im~k~ 308 (524) T protein:vir:10 229 KIPRAAVVYAHSGLLDCCGKNIIGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRVFYIDTGNMPSRKAAAQMQHIMNTM 308 (524) T ss_pred ecchhheeeeccCcccCCCCceeccchHhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCC-cce Q lcl|NC_015287. 308 RNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSG-DGF 386 (563) Q Consensus 308 knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~-~g~ 386 (563) |||+|||++||+|+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++ +|| T Consensus 309 kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~f 388 (524) T protein:vir:10 309 KNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTMPGATGMSDMDDVLYFRTALYRALRIPESRIPSESNSGV 388 (524) T ss_pred CceeEEeccCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCchhccCCCCccc Confidence 9999999999999999999999999999999999999999999999999999999999999999999999998776 489 Q ss_pred eccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHH Q lcl|NC_015287. 387 NLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLG 466 (563) Q Consensus 387 ~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~ 466 (563) ++||++|||||||||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|||++|||++|++ T Consensus 389 ~~gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~ 468 (524) T protein:vir:10 389 MFDAGTAITRDELKFAKWIRQLQNKFEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRIN 468 (524) T ss_pred cccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccc Q lcl|NC_015287. 467 VVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQ 522 (563) Q Consensus 467 ~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~ 522 (563) ++++++||||||||++||||+||+|||+||+++++||++|+++|+|++|++++++= T Consensus 469 ~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 469 MLTMAEPFIGKYISHQTAMKDFLQMTDEEINQEAKQIEEESKEARFQNPDEEEEDF 524 (524) T ss_pred HHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCChhhhcC Confidence 99999999999999999999999999999999999999999999999997766554 No 8 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=100.00 E-value=3.5e-273 Score=1514.29 Aligned_cols=502 Identities=36% Similarity=0.670 Sum_probs=476.1 Q ss_pred CcccccchhhHhh----hccccc-ccccCcCCCCccc--------ceeccccccccccccchhhhHHHHHHHHHHHhhcc Q lcl|NC_015287. 1 MAKLFGFKIQKDD----EQNKNV-VSPVPQSNEDSSD--------YYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHP 67 (563) Q Consensus 1 m~~~~~f~~~~~~----~~~~~~-~s~~p~~~~d~~~--------~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~P 67 (563) |++||||..++.+ +..+++ .|++||+++||++ ..+.||+++++++.++.++|+++||++||+||+|| T Consensus 5 ~l~lf~f~~k~~e~~~~~~~~~~~~s~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~~~~n~~eLI~~YR~ma~~p 84 (521) T protein:vir:10 5 FLKLLQPWMKDDEKRVQSDLSDRIDSFAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAPKIQNTKDLINQYRSLSKYH 84 (521) T ss_pred hhHHhhhhhhhhhhHHhhhhccCccccccccCCCCceeeccCCCccccccchhhhhhccccccchHHHHHHHHHHHhhcc Confidence 9999999766554 334433 3888999999984 33789999999999999999999999999999999 Q ss_pred chhHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCC Q lcl|NC_015287. 68 EVDSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKA 147 (563) Q Consensus 68 Evd~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~ 147 (563) |||+||++|||||||||++++||+|+|+++++|++||++|+|||++||++|+|+++||+|||+||||||+||||||||+| T Consensus 85 Evd~Av~eIvneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~~~ 164 (521) T protein:vir:10 85 EVDNAIDEIINDAIVQEDNRDTVYLDLDKTDWNESVKEMVREEFRTILKLLKFEREGKRHFRRWYVDSRIYFHKMIDPAR 164 (521) T ss_pred chhhHHHhhhcceEEecCCCceEEEEecCcccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeeEEEEEEeeCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCccee Q lcl|NC_015287. 148 PEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPV 227 (563) Q Consensus 148 ~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~i 227 (563) ||+||+|||+||||+|++||++.+++. +++.++++ +.|||+|+|++....+..| + .+++| T Consensus 165 pk~GI~Elr~lDPr~i~~vr~i~k~~~--~~~~v~~~----------~~e~f~Y~~~~~~~~~~~g---~-----~~~~v 224 (521) T protein:vir:10 165 PKDGIKELRLLDPRNVEYYRVNLKSNE--NGNDVYKG----------VKEFFTYGATEDNRYNISG---N-----SNNLV 224 (521) T ss_pred ccccceeeeeeCCcceeeeeeecCCCC--Ccchhhcc----------ceeeeeeccCCCceecCCC---C-----CCcce Confidence 999999999999999999999999864 77777764 6899999997755544433 2 26679 Q ss_pred eecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhh Q lcl|NC_015287. 228 KIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRY 307 (563) Q Consensus 228 ki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~ 307 (563) +||+||||||||||+|||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++| T Consensus 225 kI~~daI~y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~iM~k~ 304 (521) T protein:vir:10 225 QIPIDAIVYSHSGKVDIDGKTIVGYLHNVIKPANQLKMLEDAMVIYRITRAPERRVFYIDVGTMPNKKATQHLNNVMQGL 304 (521) T ss_pred eechhheeeecccceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCccee Q lcl|NC_015287. 308 RNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFN 387 (563) Q Consensus 308 knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~ 387 (563) |||+|||++||+|+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++++||+ T Consensus 305 kNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEI~TLpggqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~f~ 384 (521) T protein:vir:10 305 KNRVVYDSSTGKVKNSSNNLAMTEDYWLMRRDGKATTEVSTLPGAQSMGEMDDVRWFNRKLYESMKIPLSRLPQEGAGVT 384 (521) T ss_pred CceEEEeccCceeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCCcee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999987999 Q ss_pred ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHH Q lcl|NC_015287. 388 LGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGV 467 (563) Q Consensus 388 ~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~ 467 (563) +||++|||||||||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|+++ T Consensus 385 ~Gr~~EItRDEikF~KFI~rLR~rFs~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~ 464 (521) T protein:vir:10 385 FGAGNDITRDELQFTKYIRGLQQQFEPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNL 464 (521) T ss_pred cccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhh--hhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccc Q lcl|NC_015287. 468 VAAVDP--YVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQ 522 (563) Q Consensus 468 ~~~~~p--~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~ 522 (563) ++++|| |||||||++||||+||+|||+||+++++||++|+++|+|++|++++++= T Consensus 465 l~~~dp~~yvGky~s~dyi~k~ILr~tDeeik~~~k~I~~E~~~~~~~~p~~e~~df 521 (521) T protein:vir:10 465 VQTLASAEVTGKYLSHEYVMKNILRMSDEDIKTEREKIDGELKDSVYKNPEDPMEEF 521 (521) T ss_pred HHhhcCccccccccchHHHHHHHhcCCHhHHHHHHHHHHHhhhCCCCCCCcchhhcC Confidence 999999 9999999999999999999999999999999999999999997776554 No 9 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=100.00 E-value=1.5e-272 Score=1510.81 Aligned_cols=503 Identities=37% Similarity=0.668 Sum_probs=473.1 Q ss_pred CcccccchhhHhhhcccccc-cccCcCCCCcccce---------eccccccccccccchhhhHHHHHHHHHHHhhccchh Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVV-SPVPQSNEDSSDYY---------VSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVD 70 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~-s~~p~~~~d~~~~~---------~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd 70 (563) |+.|++++.++.+++.++++ |++||+++||+..+ +.||++++|++++++++|+++||++||+||+||||| T Consensus 8 ~~~~~~~d~~~~~e~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd 87 (521) T protein:vir:65 8 LARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQKISTTKQLVNTYRGLMNNHEVE 87 (521) T ss_pred hhhccCchhhHHHhhhccCCCcccCCCCCCCceeecccCCccccccccceeeeccccchhhhHHHHHHHHHHHhhccchh Confidence 99999999999998888777 55588888888755 358999999999999999999999999999999999 Q ss_pred HHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCc Q lcl|NC_015287. 71 SAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEE 150 (563) Q Consensus 71 ~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~ 150 (563) +||++|||||||||++++||+|+|+++++|++||++|+|||++||++|+|+++||+|||+||||||+||||||| +|||+ T Consensus 88 ~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid-~~pk~ 166 (521) T protein:vir:65 88 NAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKIIG-KNPKD 166 (521) T ss_pred hHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceeEEEEEEc-CCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999 89999 Q ss_pred ceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeec Q lcl|NC_015287. 151 GIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIA 230 (563) Q Consensus 151 GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~ 230 (563) ||+|||+||||+|++||++.+++. +++.++. .+.|||+|+|.... ++.+| + .+.+ ++.|||| T Consensus 167 GI~ELr~lDPr~i~~vr~i~k~~~--~~~~v~~----------~~~e~f~Y~~~~~~-~~~~g---~-~~~~-~~~vkI~ 228 (521) T protein:vir:65 167 GIVELRQLDPRNLEYVREIITEDT--PEGKIYK----------ATKEYFIYTVGNSS-YCAGG---Q-VFSP-NSRVKIP 228 (521) T ss_pred cceeeeeeCCcceeeeeeeccccc--CCcceec----------ceeeeeeeecCCcc-eeccc---e-eecC-Ccceeec Confidence 999999999999999999999865 5555544 47999999883333 22222 2 2333 3469999 Q ss_pred ccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcce Q lcl|NC_015287. 231 PDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNK 310 (563) Q Consensus 231 ~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk 310 (563) +||||||||||+|||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++|||| T Consensus 229 ~dAI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNk 308 (521) T protein:vir:65 229 RSAITYAHSGLMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNR 308 (521) T ss_pred hhheeeeeccceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCC-cceecc Q lcl|NC_015287. 311 LVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSG-DGFNLG 389 (563) Q Consensus 311 lvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~-~g~~~g 389 (563) +|||++||+|+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+.++ ++|++| T Consensus 309 lvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~g 388 (521) T protein:vir:65 309 VVYDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGG 388 (521) T ss_pred eEeecccccccccccccchhhhhcccccCCCCccceeecccCCCcChHHHHHHHHHHHHHHhCCCceeccCCCCcceecc Confidence 9999999999999999999999999999999999999999999999999999999999999999999997654 479999 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 390 RSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVA 469 (563) Q Consensus 390 ~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~ 469 (563) |++|||||||||+|||+|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|+++++ T Consensus 389 r~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~ 468 (521) T protein:vir:65 389 DGSEITRDELEFSKFIRTLQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIE 468 (521) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccc Q lcl|NC_015287. 470 AVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQ 522 (563) Q Consensus 470 ~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~ 522 (563) +++||||||||++||||+||+|||+||+++++||++|+++|+|++|.+++++= T Consensus 469 ~~dpyvGky~S~dyi~k~ILr~tDeei~~~~k~I~~E~~~~~~~~p~~~~~~f 521 (521) T protein:vir:65 469 RITPYIGKYFSNQTVMRDILKYTDDQMDTEKKQIEEEANDPRFKQTPDEIEDF 521 (521) T ss_pred HhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHhhhCCCCCCCcccccCC Confidence 99999999999999999999999999999999999999999999997665443 No 10 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=100.00 E-value=6.7e-272 Score=1507.25 Aligned_cols=504 Identities=37% Similarity=0.675 Sum_probs=470.9 Q ss_pred CcccccchhhHhhhcccccc-cccCcCCCCcccce------ecccccccccc----ccchhhhHHHHHHHHHHHhhccch Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVV-SPVPQSNEDSSDYY------VSSGFYGQYVD----IDGVFKSEFELIKRYREMALHPEV 69 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~-s~~p~~~~d~~~~~------~~~~~~~~~~~----~~~~~~~~~~LI~~YR~ma~~PEv 69 (563) +..|.+.+..+..++.+.++ |++||+++||+..+ ++|+|++++.+ ++++++|+++||++||+||+|||| T Consensus 9 ~~~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~YR~ma~~pEv 88 (524) T protein:vir:10 9 FAPWAKMDERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTYRNLMNNYEV 88 (524) T ss_pred hhccccCcchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHHHHHhhccch Confidence 55666667777776666555 99999999998766 45677777554 666799999999999999999999 Q ss_pred hHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCC Q lcl|NC_015287. 70 DSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPE 149 (563) Q Consensus 70 d~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k 149 (563) |+||++|||||||||++++||+|+|+++++|++||++|+|||++||++|+|+++||+|||+||||||+||||||||+||| T Consensus 89 d~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk 168 (524) T protein:vir:10 89 DNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFNDVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPK 168 (524) T ss_pred hhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEeeCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeee Q lcl|NC_015287. 150 EGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKI 229 (563) Q Consensus 150 ~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki 229 (563) +||+|||+||||+|++||++.+++. ++..+++ ++.|||+|+|.+ ..++++|..+. .+++||| T Consensus 169 ~GI~Elr~lDPr~i~~vr~i~~~~~--~~~~vi~----------~~~e~f~Y~~~~-~~y~~~g~~~~-----~~~~ikI 230 (524) T protein:vir:10 169 EGIKELRRLDPRQVQYVREIITETE--AGTKIVK----------GYKEYFIYDTAH-ESYACDGRMYE-----AGTKIKI 230 (524) T ss_pred ccceeeeeeCCccceeeeeeccCCC--ccchhhc----------chhhheeeccCc-cccccCccccC-----CCcceec Confidence 9999999999999999999999754 5556655 479999999965 44455554332 3568999 Q ss_pred cccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcc Q lcl|NC_015287. 230 APDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRN 309 (563) Q Consensus 230 ~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~kn 309 (563) |+||||||||||+|||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||| T Consensus 231 ~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KN 310 (524) T protein:vir:10 231 PKAAIVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKN 310 (524) T ss_pred chhheeeeeccceeCCCCceeccchhhhHHHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCC-cceec Q lcl|NC_015287. 310 KLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSG-DGFNL 388 (563) Q Consensus 310 klvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~-~g~~~ 388 (563) |+|||++||+|+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++ +||++ T Consensus 311 klvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~ 390 (524) T protein:vir:10 311 RVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDVRWFRQALYMALRVPLSRIPQDQQGGVMF 390 (524) T ss_pred eeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCchhhcCCCCCccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999998876 47999 Q ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 389 GRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVV 468 (563) Q Consensus 389 g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~ 468 (563) ||++|||||||||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|||++|||++|++++ T Consensus 391 gr~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l 470 (524) T protein:vir:10 391 DSGTSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINML 470 (524) T ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccc Q lcl|NC_015287. 469 AAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQ 522 (563) Q Consensus 469 ~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~ 522 (563) ++++||||||||++||||+||+|||+||+++++||++|+++|+|++|++++++= T Consensus 471 ~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 471 TMAEPFIGKYISHRTAMKDILQMTDEEIEQEAKQIEEESKEARFQDPDQEQEDF 524 (524) T ss_pred HHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhhcC Confidence 999999999999999999999999999999999999999999999997776554 No 11 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=100.00 E-value=7.7e-272 Score=1506.93 Aligned_cols=504 Identities=37% Similarity=0.675 Sum_probs=470.8 Q ss_pred CcccccchhhHhhhcccccc-cccCcCCCCcccce------ecccccccccc----ccchhhhHHHHHHHHHHHhhccch Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVV-SPVPQSNEDSSDYY------VSSGFYGQYVD----IDGVFKSEFELIKRYREMALHPEV 69 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~-s~~p~~~~d~~~~~------~~~~~~~~~~~----~~~~~~~~~~LI~~YR~ma~~PEv 69 (563) +..|.+.+..+..++.+.++ |++||+++||+..+ ++|+|++++.+ ++++++|+++||++||+||+|||| T Consensus 9 ~~~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~YR~ma~~pEv 88 (524) T protein:vir:72 9 FAPWAKMDERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTYRNLMNNYEV 88 (524) T ss_pred hhccccCcchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHHHHHhhccch Confidence 55666667777776666555 99999999998766 45677777554 666799999999999999999999 Q ss_pred hHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCC Q lcl|NC_015287. 70 DSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPE 149 (563) Q Consensus 70 d~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k 149 (563) |+||++|||||||||++++||+|+|+++++|++||++|+|||++||++|+|+++||+|||+||||||+|||||||++||| T Consensus 89 d~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk 168 (524) T protein:vir:72 89 DNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFSDVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPK 168 (524) T ss_pred hhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeee Q lcl|NC_015287. 150 EGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKI 229 (563) Q Consensus 150 ~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki 229 (563) +||+|||+||||+|++||++.+++. ++..+++ ++.|||+|+|.+ ..++++|..+. .+++||| T Consensus 169 ~GI~Elr~lDPr~i~~vr~i~~~~~--~~~~vi~----------~~~e~f~Y~~~~-~~y~~~g~~~~-----~~~~ikI 230 (524) T protein:vir:72 169 EGIKELRRLDPRQVQYVREIITETE--AGTKIVK----------GYKEYFIYDTAH-ESYACDGRMYE-----AGTKIKI 230 (524) T ss_pred ccceeeeeeCCccceeeeeeccCCC--ccchhhc----------chhhheeeccCc-cccccCccccC-----CCcceec Confidence 9999999999999999999999754 5556655 479999999965 44455554332 3568999 Q ss_pred cccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcc Q lcl|NC_015287. 230 APDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRN 309 (563) Q Consensus 230 ~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~kn 309 (563) |+||||||||||+|||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||| T Consensus 231 ~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KN 310 (524) T protein:vir:72 231 PKAAVVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKN 310 (524) T ss_pred chhheeeeeccceeCCCCceeccchhhhHhHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCC-cceec Q lcl|NC_015287. 310 KLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSG-DGFNL 388 (563) Q Consensus 310 klvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~-~g~~~ 388 (563) |+|||++||+|+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++ +||++ T Consensus 311 klvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~ 390 (524) T protein:vir:72 311 RVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDIRWFRQALYMALRVPLSRIPQDQQGGVMF 390 (524) T ss_pred eeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCchhhcCCCCCccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999998876 47999 Q ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 389 GRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVV 468 (563) Q Consensus 389 g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~ 468 (563) ||++|||||||||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|||++|||++|++++ T Consensus 391 gr~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l 470 (524) T protein:vir:72 391 DSGTSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINML 470 (524) T ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccc Q lcl|NC_015287. 469 AAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQ 522 (563) Q Consensus 469 ~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~ 522 (563) ++++||||||||++||||+||+|||+||+++++||++|+++|+|++|++++++= T Consensus 471 ~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:72 471 TMAEPFIGKYISHRTAMKDILQMTDEEIEQEAKQIEEESKEARFQDPDQEQEDF 524 (524) T ss_pred HHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhhcC Confidence 999999999999999999999999999999999999999999999997776554 No 12 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=100.00 E-value=2e-271 Score=1504.69 Aligned_cols=500 Identities=37% Similarity=0.695 Sum_probs=469.1 Q ss_pred CcccccchhhHh----hhccc-ccccccCcCCCCcccce-------eccccccccccccchhhhHHHHHHHHHHHhhccc Q lcl|NC_015287. 1 MAKLFGFKIQKD----DEQNK-NVVSPVPQSNEDSSDYY-------VSSGFYGQYVDIDGVFKSEFELIKRYREMALHPE 68 (563) Q Consensus 1 m~~~~~f~~~~~----~~~~~-~~~s~~p~~~~d~~~~~-------~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PE 68 (563) |++||||..++. +++.+ ...|++||+++||++.+ ..||++|++++++++++|+++||++||+||+||| T Consensus 3 ~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pE 82 (516) T protein:vir:10 3 FLDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINNPE 82 (516) T ss_pred chHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhccc Confidence 999999944433 33444 45588999999999766 4599999999999999999999999999999999 Q ss_pred hhHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCC Q lcl|NC_015287. 69 VDSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAP 148 (563) Q Consensus 69 vd~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~ 148 (563) ||+||++|||||||||++++||+|+|+++++|++||++|+|||++|+++|+|+++||+|||+||||||+||||||| || T Consensus 83 vd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid--~~ 160 (516) T protein:vir:10 83 VERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMP--NP 160 (516) T ss_pred hhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEec--Cc Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999998 99 Q ss_pred CcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceee Q lcl|NC_015287. 149 EEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVK 228 (563) Q Consensus 149 k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~ik 228 (563) |+||+|||+||||+|++||++++++. +++.++++ +.|||+|+|.. ..+++.|.+ +.+ ++.|| T Consensus 161 k~GI~Elr~lDPr~i~~vR~i~~~~~--~~~~v~~~----------~~e~~~Y~~~~-~~~~~~g~~----~~~-~~~ik 222 (516) T protein:vir:10 161 KKGIAELRRLDPRFMEYYREIVTSDI--GGTTIVKG----------YREFFIYTTGN-EGYSYNGRI----FEP-NTRIK 222 (516) T ss_pred cccceeeeeeCCcceeeEeeeccccc--ccchhhhh----------hhheeeeccCc-cccccccce----eCC-Cccee Confidence 99999999999999999999999864 66777654 78999999844 444544443 333 34699 Q ss_pred ecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhc Q lcl|NC_015287. 229 IAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYR 308 (563) Q Consensus 229 i~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~k 308 (563) ||+||||||||||+|||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++|| T Consensus 223 I~~dAI~y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~k 302 (516) T protein:vir:10 223 IPRSAVVYASSGLMDCSDRGIIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLK 302 (516) T ss_pred echhheeeecccceeCCCCceeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCcceec Q lcl|NC_015287. 309 NKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNL 388 (563) Q Consensus 309 nklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~ 388 (563) ||+|||++||+|+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++ +|++ T Consensus 303 NklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~-~~~~ 381 (516) T protein:vir:10 303 NRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMDDVRWFNKKLYEALRIPLSRIPRDD-GGMV 381 (516) T ss_pred ceeEEeCCCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCC-Ccee Confidence 999999999999999999999999999999999999999999999999999999999999999999999999886 5555 Q ss_pred --cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHH Q lcl|NC_015287. 389 --GRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLG 466 (563) Q Consensus 389 --g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~ 466 (563) ||++|||||||||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++ T Consensus 382 ~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~ 461 (516) T protein:vir:10 382 IGGQDTAITRDELDFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVD 461 (516) T ss_pred eccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHH Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCcc Q lcl|NC_015287. 467 VVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQL 521 (563) Q Consensus 467 ~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~ 521 (563) ++++++||||||||++||||+||+|||+||+++++||++|+++|+|++|+++++= T Consensus 462 ~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~e~k~I~~E~~~~~~~~p~~~~~f 516 (516) T protein:vir:10 462 ALSQIEPYVGKYVSHDYVMKNILQMTEEQIAQEEKQIEQEAGIKRFQNPENEDDF 516 (516) T ss_pred HHHHhhhhhccccchHHHHHHHhcCCHhhHHHHHHHHHHhhhCCCCCCCCccccC Confidence 9999999999999999999999999999999999999999999999999666443 No 13 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=100.00 E-value=2e-271 Score=1504.69 Aligned_cols=500 Identities=37% Similarity=0.695 Sum_probs=469.1 Q ss_pred CcccccchhhHh----hhccc-ccccccCcCCCCcccce-------eccccccccccccchhhhHHHHHHHHHHHhhccc Q lcl|NC_015287. 1 MAKLFGFKIQKD----DEQNK-NVVSPVPQSNEDSSDYY-------VSSGFYGQYVDIDGVFKSEFELIKRYREMALHPE 68 (563) Q Consensus 1 m~~~~~f~~~~~----~~~~~-~~~s~~p~~~~d~~~~~-------~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PE 68 (563) |++||||..++. +++.+ ...|++||+++||++.+ ..||++|++++++++++|+++||++||+||+||| T Consensus 3 ~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pE 82 (516) T protein:vir:10 3 FLDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINNPE 82 (516) T ss_pred chHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhccc Confidence 999999944433 33444 45588999999999766 4599999999999999999999999999999999 Q ss_pred hhHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCC Q lcl|NC_015287. 69 VDSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAP 148 (563) Q Consensus 69 vd~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~ 148 (563) ||+||++|||||||||++++||+|+|+++++|++||++|+|||++|+++|+|+++||+|||+||||||+||||||| || T Consensus 83 vd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid--~~ 160 (516) T protein:vir:10 83 VERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMP--NP 160 (516) T ss_pred hhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEec--Cc Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999998 99 Q ss_pred CcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceee Q lcl|NC_015287. 149 EEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVK 228 (563) Q Consensus 149 k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~ik 228 (563) |+||+|||+||||+|++||++++++. +++.++++ +.|||+|+|.. ..+++.|.+ +.+ ++.|| T Consensus 161 k~GI~Elr~lDPr~i~~vR~i~~~~~--~~~~v~~~----------~~e~~~Y~~~~-~~~~~~g~~----~~~-~~~ik 222 (516) T protein:vir:10 161 KKGIAELRRLDPRFMEYYREIVTSDI--GGTTIVKG----------YREFFIYTTGN-EGYSYNGRI----FEP-NTRIK 222 (516) T ss_pred cccceeeeeeCCcceeeEeeeccccc--ccchhhhh----------hhheeeeccCc-cccccccce----eCC-Cccee Confidence 99999999999999999999999864 66777654 78999999844 444544443 333 34699 Q ss_pred ecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhc Q lcl|NC_015287. 229 IAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYR 308 (563) Q Consensus 229 i~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~k 308 (563) ||+||||||||||+|||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++|| T Consensus 223 I~~dAI~y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~k 302 (516) T protein:vir:10 223 IPRSAVVYASSGLMDCSDRGIIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLK 302 (516) T ss_pred echhheeeecccceeCCCCceeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCcceec Q lcl|NC_015287. 309 NKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNL 388 (563) Q Consensus 309 nklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~ 388 (563) ||+|||++||+|+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++ +|++ T Consensus 303 NklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~-~~~~ 381 (516) T protein:vir:10 303 NRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMDDVRWFNKKLYEALRIPLSRIPRDD-GGMV 381 (516) T ss_pred ceeEEeCCCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCC-Ccee Confidence 999999999999999999999999999999999999999999999999999999999999999999999999886 5555 Q ss_pred --cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHH Q lcl|NC_015287. 389 --GRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLG 466 (563) Q Consensus 389 --g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~ 466 (563) ||++|||||||||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++ T Consensus 382 ~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~ 461 (516) T protein:vir:10 382 IGGQDTAITRDELDFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVD 461 (516) T ss_pred eccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHH Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCcc Q lcl|NC_015287. 467 VVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQL 521 (563) Q Consensus 467 ~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~ 521 (563) ++++++||||||||++||||+||+|||+||+++++||++|+++|+|++|+++++= T Consensus 462 ~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~e~k~I~~E~~~~~~~~p~~~~~f 516 (516) T protein:vir:10 462 ALSQIEPYVGKYVSHDYVMKNILQMTEEQIAQEEKQIEQEAGIKRFQNPENEDDF 516 (516) T ss_pred HHHHhhhhhccccchHHHHHHHhcCCHhhHHHHHHHHHHhhhCCCCCCCCccccC Confidence 9999999999999999999999999999999999999999999999999666443 No 14 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=100.00 E-value=6.1e-271 Score=1501.99 Aligned_cols=500 Identities=37% Similarity=0.708 Sum_probs=468.7 Q ss_pred Ccccccchh----hHhhhcc-cccccccCcCCCCcccce-------eccccccccccccchhhhHHHHHHHHHHHhhccc Q lcl|NC_015287. 1 MAKLFGFKI----QKDDEQN-KNVVSPVPQSNEDSSDYY-------VSSGFYGQYVDIDGVFKSEFELIKRYREMALHPE 68 (563) Q Consensus 1 m~~~~~f~~----~~~~~~~-~~~~s~~p~~~~d~~~~~-------~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PE 68 (563) |++||||.. .+.+++. ....|++||+++||++.+ +.||++|+|+|++++++|+++||++||+||+||| T Consensus 3 ~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~d~~~~~~~~~~LI~~YR~ma~~pE 82 (516) T protein:vir:10 3 FLDLFKFWDRVDQNEYDERLKQGHESIATPKKDDGATEIEAREGESSYNALMQQFFGIDNNISGTKDLINTYRQLTNNPE 82 (516) T ss_pred chHhcccccchhhHHHHhhhcCCCCcccCCCCccCceeeecCcccccccceeeeeecccCccccHHHHHHHHHHhhhccc Confidence 999999922 2233444 455689999999999877 4589999999999999999999999999999999 Q ss_pred hhHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCC Q lcl|NC_015287. 69 VDSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAP 148 (563) Q Consensus 69 vd~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~ 148 (563) ||+||++|||||||||++++||+|+|+++++|++||++|+|||++|+++|+|+++||+|||+||||||+||||||| || T Consensus 83 vd~Av~eIvneaiv~d~~~~pV~l~l~~~e~s~sik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid--~~ 160 (516) T protein:vir:10 83 VERAVANIVNEAVVYEKGHKVVSLDLDDTEFSSSIKDKILEEFDEICRLLDASRKLDTLFRRWYIDSRIFFHKIMP--NP 160 (516) T ss_pred hhHHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhhhhcceEEEEEEec--Cc Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999998 99 Q ss_pred CcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceee Q lcl|NC_015287. 149 EEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVK 228 (563) Q Consensus 149 k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~ik 228 (563) |+||+|||+||||+|++||++++++ .+++.++++ +.|||+|.+. ...+++.|.+ +.+ ++.|| T Consensus 161 k~GI~elr~lDPr~i~~vR~i~~~~--~~~~~v~~~----------~~e~~~Y~~~-~~~~~~~g~~----~~~-~~~ik 222 (516) T protein:vir:10 161 KEGIVELRRLDPRHVEYYREIVTSD--VGGTSVVKG----------YREFFVYTTG-NEGYAYNGRL----FEP-NTRIK 222 (516) T ss_pred ccceeeeeeeCCcceeeEEeeeccc--Ccchhhhhc----------eeeeeeeecC-ccceeccccc----cCC-CCcee Confidence 9999999999999999999999986 477777764 6899999873 3333433332 222 34699 Q ss_pred ecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhc Q lcl|NC_015287. 229 IAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYR 308 (563) Q Consensus 229 i~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~k 308 (563) ||+||||||||||+|||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++|| T Consensus 223 I~~daI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~K 302 (516) T protein:vir:10 223 IPRSAIVYAHSGLQDCSDRGIVGYLHNAVKPANQLKLLEDALVIYRITRAPERRVFYIDVGNMPNRKATEYVNGIMQSLK 302 (516) T ss_pred cchhheeeeecCcccCCCCceeceehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCcceec Q lcl|NC_015287. 309 NKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNL 388 (563) Q Consensus 309 nklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~ 388 (563) ||+|||++||+|+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++ +|++ T Consensus 303 NklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~SRl~~e~-~~~~ 381 (516) T protein:vir:10 303 NRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVTSLPGAQTMGEMDDVRWFNKKLYEALRIPLSRMPRDD-GGMV 381 (516) T ss_pred ceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCC-Ccee Confidence 999999999999999999999999999999999999999999999999999999999999999999999999986 5555 Q ss_pred --cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHH Q lcl|NC_015287. 389 --GRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLG 466 (563) Q Consensus 389 --g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~ 466 (563) ||++|||||||||+|||.|||+||+.+|.++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++ T Consensus 382 ~~Gr~~EItRDEiKF~KFI~rLR~rFs~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~ 461 (516) T protein:vir:10 382 IGGQDMAITRDELDFRKFIVQLQHNFEEIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVD 461 (516) T ss_pred eccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHH Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCcc Q lcl|NC_015287. 467 VVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQL 521 (563) Q Consensus 467 ~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~ 521 (563) ++++++||||||||++||||+||+|||+||+++++||++|+++|+|++|+++++= T Consensus 462 ~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~~~~~~~~p~~e~~f 516 (516) T protein:vir:10 462 ALSQIEPYVGKYVSHDYVMKNILQMTDEQIAQEEKQIEKEANVKRFQNPENEDDF 516 (516) T ss_pred HHHHhhhhhccccchHHHHHHHhcCCHhHHHHHHHHHHHhhhCCCCCCCCccccC Confidence 9999999999999999999999999999999999999999999999999877554 No 15 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=100.00 E-value=1.8e-269 Score=1493.95 Aligned_cols=503 Identities=38% Similarity=0.680 Sum_probs=465.1 Q ss_pred CcccccchhhHhhhcccccc-cccCcCCCCcccce--------eccccccccccccchhhhHHHHHHHHHHHhhccchhH Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVV-SPVPQSNEDSSDYY--------VSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDS 71 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~-s~~p~~~~d~~~~~--------~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~ 71 (563) +..|.+.+.++.+++.++++ |++||+++||+..+ .+|.++++|+++++.++|+++||++||+||+|||||+ T Consensus 13 ~~~~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~y~~~e~~~~~~~eLI~~YR~ma~~pEvd~ 92 (524) T protein:vir:98 13 FKNFAREDEIELEQQLKNDTGSVAPPKNNDGAYEIETDLNNQKYAGVFQQFYSGQDPAIQNKEQLINTYRGIMSYPEVEN 92 (524) T ss_pred hhhhhhhhhhhHhhhhcCCcccccCCCCCCCceeecCCCCcceecceeeeeccccccccchHHHHHHHHHHHhhccchhh Confidence 44455556666666555444 78899999998544 3455566688999999999999999999999999999 Q ss_pred HHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcc Q lcl|NC_015287. 72 AIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEG 151 (563) Q Consensus 72 Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~G 151 (563) ||++||||||||+++++||+|+|+++++|++||++|+|||++||++|+|+++||+|||+||||||+||||||| +||++| T Consensus 93 Av~eIVneaIv~~~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid-~~~~kG 171 (524) T protein:vir:98 93 AVSEIIDDAIVNEQGKDIITMDLAKTNFSKAIQDKIVEEFDNVLNIYDFDNMGARLFRDWYVDSRIYFHKIMH-KDESKG 171 (524) T ss_pred HHHhhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceeEEEEEEc-CCCCcc Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999 677789 Q ss_pred eeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecc Q lcl|NC_015287. 152 IKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAP 231 (563) Q Consensus 152 I~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~ 231 (563) |+|||+||||+|++||+++++. ..+++.+++ ++.|||+|+|.+.. ++.+|.+.+ ++ ++||||+ T Consensus 172 I~ELr~lDPr~i~~vr~~~~~~-~~~~~~v~~----------~~~e~f~Y~~~~~~-~~~~g~~~~----~~-~~ikI~~ 234 (524) T protein:vir:98 172 IRELRQLDPRCMELIRESITET-LDGGVKVFR----------GYREFFVYSAPKAG-YTYNGQIYQ----AN-QKIKIPR 234 (524) T ss_pred eeeeeeeCCccceeeeeccccc-cccchhhcc----------ceeeeeeeccCCCc-cccccceec----CC-Cceeech Confidence 9999999999999999998875 355666655 47899999997763 444444332 23 4699999 Q ss_pred cceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhccee Q lcl|NC_015287. 232 DAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKL 311 (563) Q Consensus 232 daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knkl 311 (563) ||||||||||+|||++ |+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+ T Consensus 235 dAIvy~hSGL~d~~~~-iisyLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNkl 313 (524) T protein:vir:98 235 SAIVYAHSGLEDCSNN-IIGYLHRAVKPANQLRLLEDAMVIYRITRAPERRVFYIDVGQMGGNKATQYVNNIAQGLKNRV 313 (524) T ss_pred hheeeeccCcccCCCC-eeeehhHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCcee Confidence 9999999999998876 679999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCcceecccc Q lcl|NC_015287. 312 VYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRS 391 (563) Q Consensus 312 vYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~ 391 (563) |||++||+|||++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+.+++||++||| T Consensus 314 vYDa~TGevrddrk~msMlEDyWLpRReGgrgTEItTLpggqnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~ 393 (524) T protein:vir:98 314 VYDARTGTVKNQQNNLSMTEDYWLMRRDGKAITEVSTLPGGQNFSDMDDIKWFNRKLYEALRVPLSRMPRDDGGMQIGGG 393 (524) T ss_pred EeeccCceeeccccccchhhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCceeccCCCCccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999877779999999 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015287. 392 SEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAV 471 (563) Q Consensus 392 ~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~ 471 (563) +|||||||||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|||++|||++|+++++++ T Consensus 394 ~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~ 473 (524) T protein:vir:98 394 GEITRDELKFSKFIRTLQIQFSPVLSDPLKTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQV 473 (524) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccc Q lcl|NC_015287. 472 DPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQ 522 (563) Q Consensus 472 ~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~ 522 (563) +||||||||++||||+||+|||+||+++++||++|+++|+|++|++++++= T Consensus 474 dpyvGky~s~dyi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~p~~e~~~f 524 (524) T protein:vir:98 474 EGVVGKYVSHKYIMKEILRMSDEDIDEQAKLIEEESKEERFKNPEAEEENF 524 (524) T ss_pred ccccccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCcCCccccccC Confidence 999999999999999999999999999999999999999999997776554 No 16 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=100.00 E-value=3.1e-266 Score=1476.18 Aligned_cols=498 Identities=39% Similarity=0.697 Sum_probs=470.8 Q ss_pred CcccccchhhHhhhcccccc-cccCcCCCCcccce--------eccccccccccccchhhhHHHHHHHHHHHhhccchhH Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVV-SPVPQSNEDSSDYY--------VSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDS 71 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~-s~~p~~~~d~~~~~--------~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~ 71 (563) |.-|.+++.++.+++.++++ |++||+++||++.+ ..|+|+|++++.++.++++ |||++||+||+|||||+ T Consensus 1 ~~~w~~~de~~~~~~~~~~~~S~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~-eLI~~YR~ma~~pEvd~ 79 (511) T protein:vir:56 1 MKFWTKEEEQDIQKIEKNPVRSFSAPDNVDGAKEIHTNLLAPQLGHAIIPSDAQSEGTIPVK-ELIKSYRALAEYHEVDD 79 (511) T ss_pred CCCccchhhhhhhhhccCCcccccCCCCCCCceEEecccccceecceeccccccccCccchH-HHHHHHHHHhhccchhh Confidence 88899999999999988887 88899999998766 5677999999999999996 99999999999999999 Q ss_pred HHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcc Q lcl|NC_015287. 72 AIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEG 151 (563) Q Consensus 72 Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~G 151 (563) ||++|||||||||++++||+|+|+++++|++||++|+|||++||++|+|+++||++||+||||||+||||||||+ +| T Consensus 80 Av~eIvne~iv~d~~~~pV~l~ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~k---~G 156 (511) T protein:vir:56 80 AIQEIVDEAIVYENDKEVVWLNLDNTDFSENIKAKINEEFDRVVSLLQMRKHGYKWFRKWYVDSRIYFHKILDKD---NN 156 (511) T ss_pred HHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeccc---cc Confidence 999999999999999999999999999999999999999999999999999999999999999999999999974 59 Q ss_pred eeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecc Q lcl|NC_015287. 152 IKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAP 231 (563) Q Consensus 152 I~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~ 231 (563) |+|||+||||+|++||++++++ .+++.++++ +.|||+|+|++....+..++.|. .+++|+||+ T Consensus 157 I~eLr~lDPr~i~~vr~i~~~~--~~~~~v~~~----------~~ey~~Y~~~~~~~~~~~~~~~~-----~~~~vkI~~ 219 (511) T protein:vir:56 157 IIELRPLNPMKMELVREIQKET--IDGVEVVKG----------TLEYYVYKQSDYKMPSWMSATNR-----AQTSFRIPK 219 (511) T ss_pred eeehhhcCcccchhhhhhhccc--ccccccccc----------eeeeeEecCCCcccCcccccccc-----cccceeech Confidence 9999999999999999999975 467777664 78999999987655555444332 356799999 Q ss_pred cceeeeecceecCC--CCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcc Q lcl|NC_015287. 232 DAISFCHSGLVDRN--KQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRN 309 (563) Q Consensus 232 daI~y~hSGl~d~~--~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~kn 309 (563) ||||||||||+||| +++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||| T Consensus 220 daI~y~hSGL~d~~~~~g~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~kN 299 (511) T protein:vir:56 220 DAIVFAHSGLMRGCADDPYIIGYLDRAIKPANQLKMLEDALVIYRLARAPERRVFYVDVGNLPTQKAQQYVNGIMQNVKN 299 (511) T ss_pred hheeeecccceeccCCCCeeeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCc Confidence 99999999999965 5579999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCC--ccee Q lcl|NC_015287. 310 KLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSG--DGFN 387 (563) Q Consensus 310 klvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~--~g~~ 387 (563) |+|||++||+|+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++ +||+ T Consensus 300 klVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kKLy~aLnVP~SRl~~e~q~~~f~ 379 (511) T protein:vir:56 300 RVVYDTQTGQVKNTTNAMSMLEDYYLPRREGSKGTEVSTLPGGQSLGDIEDVLYFNRKLYKAMRIPTSRAASEDQTGGIN 379 (511) T ss_pred eEEEeccCceeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999775 4799 Q ss_pred ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHH Q lcl|NC_015287. 388 LGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGV 467 (563) Q Consensus 388 ~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~ 467 (563) +|||+|||||||||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|||++|||++|+++ T Consensus 380 ~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~ 459 (511) T protein:vir:56 380 FGQGAEITRDELKFTKFVKRLQTKFETVITDPLKHQLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNA 459 (511) T ss_pred cccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccC Q lcl|NC_015287. 468 VAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQ 519 (563) Q Consensus 468 ~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~ 519 (563) +++++||||||||++||||+||+|||+||+++++||++|+++++|+.|++.= T Consensus 460 l~~~dpyvGky~S~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~e~~f 511 (511) T protein:vir:56 460 MRDIQDYAGKYYSHKYIQKNILRLSDDQITAMQSEIDEEETNPRFQQDDQGF 511 (511) T ss_pred HHHhcchhccccchHHHHHHHhccCHHHHHHHHHHHHHhhcCCCCCCcccCC Confidence 9999999999999999999999999999999999999999999999875543 No 17 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=100.00 E-value=4.8e-232 Score=1288.73 Aligned_cols=495 Identities=20% Similarity=0.325 Sum_probs=441.4 Q ss_pred CcccccchhhHhhhccccccccc-CcCCCCccccee--------ccccccccccccchhhhHHHHHHHHHHHh-hccchh Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPV-PQSNEDSSDYYV--------SSGFYGQYVDIDGVFKSEFELIKRYREMA-LHPEVD 70 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~-p~~~~d~~~~~~--------~~~~~~~~~~~~~~~~~~~~LI~~YR~ma-~~PEvd 70 (563) |..|.+.+.+...++..+++-++ +|.++||++.+. .++.++++++ +.++|+++||++||+|| +||||| T Consensus 4 ~~~w~~~de~~~~~~~~~~~~~~~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~g--g~~~n~~eLI~~YR~ma~~~pEVd 81 (533) T protein:vir:58 4 LEKYKKLNEAVNFTNFLSPMYGMGAPHGAGGSSMIPINMYHPFATAGYASRFYG--GIEFNRFFLYDMYDRMDYTDPLIS 81 (533) T ss_pred cchhhhhhHHHHHHHhhchhhcccCccCCCCCccccCCCCcchhhhhhhhhhhc--cccccHHHHHHHHHHhhccCcchh Confidence 77888888888888877777444 666676664332 1233333332 56799999999999997 689999 Q ss_pred HHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCc Q lcl|NC_015287. 71 SAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEE 150 (563) Q Consensus 71 ~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~ 150 (563) +||++||||||||+++++||+|+|+++++|++||++|+ ++|+|+++||+|||+||||||+||||++ ++||+ T Consensus 82 ~AideIvneaiv~d~~~~pV~v~l~~~e~s~~iK~kI~-------~lldf~~~~~~~fR~WYVDGriy~Hkii--k~~k~ 152 (533) T protein:vir:58 82 TVLDIIADECTIPNENGNIVDVVTKDIELAKAILSYLD-------YVINIEKNAYPIIRNMIKYGDMFLHILE--KGSDG 152 (533) T ss_pred hHHHhhhceeeEecCCCceeEeecccccccHHHHHHHH-------HHhcchhhhhHHHHhhhhcceeEEEecc--CCccc Confidence 99999999999999999999999999999999998764 7999999999999999999999999977 46899 Q ss_pred ceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeec Q lcl|NC_015287. 151 GIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIA 230 (563) Q Consensus 151 GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~ 230 (563) ||+|||+||||+|++||++.++ .+||+|++.+..+ + ++ ...++|| T Consensus 153 GI~elr~lDPr~i~~vr~~~t~-----------------------~eyyvy~~~~~~~-------~----s~-~~~~kI~ 197 (533) T protein:vir:58 153 TIEKFQVVSPYIFSKRYNPETD-----------------------TWYYVITDVYRNV-------V----SG-YFNEDIP 197 (533) T ss_pred chhhheecCCeeeEEEEeeccc-----------------------eEEEeeccccccc-------c----cC-ccccccc Confidence 9999999999999999999886 2899999965321 1 11 1238999 Q ss_pred ccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcce Q lcl|NC_015287. 231 PDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNK 310 (563) Q Consensus 231 ~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk 310 (563) +|+|+||||||+|||+++++||||+|||||||||||||||||||+||||+|||||||||||||.||+|||++||++|||| T Consensus 198 ~daI~y~~SGl~d~~~~~iisyLhkAiKp~NQLkmiEDAlVIYRisRAPeRRvFYIDVGNlpk~KAeqYl~~im~k~kNk 277 (533) T protein:vir:58 198 EEDVIHFSHKIDTNFFPYGRSYLESARAIWNQLRLMEDALMLYRVVRSVDRRVFYVDVGNVPPDKINEYLTNIAMQYKRD 277 (533) T ss_pred hhheeeeeeccccCCCCceehhhhHHHHHHHHHHHHHHHHHHHhhcCChhheEEEEeecCCCccCHHHHHHHHHHhcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEeeCCCCccccccccc---chhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCccee Q lcl|NC_015287. 311 LVYDASTGEIRDDRKHM---SMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFN 387 (563) Q Consensus 311 lvYd~~TGev~d~~~~m---sMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~ 387 (563) +|||++||+|+|++|+| |||||||||||+||||||||||||| |||+|+||+||++|||+|||||+|||++++ +| T Consensus 278 lvYDa~TGev~ddrk~m~~~sMlEDyWLpRReGgrgTEI~TLpGg-~lgemeDV~YF~kkLy~ALnVP~sRl~~e~-~f- 354 (533) T protein:vir:58 278 YWVRNNQNQFLGIDNYFSIESILKDYFIPRRGDRRAVEIDILQGS-KVDLAEDVEYMLNRLISALKVPKAFIGYEG-DV- 354 (533) T ss_pred eEEeccCCeEeeccchhhhhhhHhhhcccccCCCccceeeecCCC-CCCcHHHHHHHHHHHHHHhCCCeeecCCCC-CC- Confidence 99999999999999998 9999999999999999999999987 599999999999999999999999999987 45 Q ss_pred ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHH Q lcl|NC_015287. 388 LGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGV 467 (563) Q Consensus 388 ~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~ 467 (563) ||++|||||||||+|||+|||+||+++| ++||+||||||++|| +|+|++||||+|+|++|||++|+++ T Consensus 355 -gr~~eItRDEiKF~KFI~rLR~rF~~ll----~~qLilk~iit~eew-------~~~f~~Dn~f~ElKe~Eil~~Ri~~ 422 (533) T protein:vir:58 355 -NAKNTLATQDIKFNNTIKRIQGFFVEEL----ERMVRMNKEFADQDF-------RLVMNRSNSIVEGERFAVIEQRIGI 422 (533) T ss_pred -ccchhhhHHHHHHHHHHHHHHHHHHHHH----hcccccccCcchhhe-------eeeeeccchHHHHHHHHHHHHHHHH Confidence 9999999999999999999999998776 559999999999999 6999999999999999999999999 Q ss_pred HHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccC Q lcl|NC_015287. 468 VAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQ 547 (563) Q Consensus 468 ~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~ 547 (563) +++++||||| +||||+||+||| ||++++++||+|+++|+|++|+++++..++++.++.+.+..+|..+++.+-.+ T Consensus 423 l~~~dpyvgk----~yi~k~ILr~td-ei~~q~e~ie~E~~~~~~~~~~~~~e~~~~~~~~~~~~p~~~~~~~~~~~~~~ 497 (533) T protein:vir:58 423 AERLKGWVRE----DWIYSNILQIPY-DLKPQEEVAEAAGGGGLFDTGGFGEETTPADFLGERGSPIESPRGRTEFDFGT 497 (533) T ss_pred HHHhcchhhH----HHHHHHHhcCCh-hhhHHHHHHHHhhcCCCCCCCCcccccCCcccCccccCcccCCCChhhHhccc Confidence 9999999999 689999999999 78888899999999999999999999999999999888888877655443333 Q ss_pred C----------------------CCcccCCCCCCCC Q lcl|NC_015287. 548 E----------------------GGAEQAPQMPEGG 561 (563) Q Consensus 548 ~----------------------~~~~~~~~~~~~~ 561 (563) + .+|+++|+||+-. T Consensus 498 ~~~~~~~~~~~~~~a~~~~~~~~g~~~~~~~~p~~~ 533 (533) T protein:vir:58 498 EGGEELGGELNLGGAFEEFEEETGGGEEELPFPEEE 533 (533) T ss_pred CCcccccccccccccchhhhhhcCCcccCCCCCCCC Confidence 2 4455666666655 No 18 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.52 E-value=6.2e-13 Score=87.50 Aligned_cols=474 Identities=12% Similarity=0.058 Sum_probs=244.7 Q ss_pred Cc---ccccc-h----hhHhhhcccccccccCcCCCCcccceeccccccccccccc-hhhhHHHHHHHHHHH-hhccchh Q lcl|NC_015287. 1 MA---KLFGF-K----IQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDG-VFKSEFELIKRYREM-ALHPEVD 70 (563) Q Consensus 1 m~---~~~~f-~----~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~-~~~~~~~LI~~YR~m-a~~PEvd 70 (563) |. +.-++ + .++.. .+..+. .-+.+.+.--.+.+. ...+.+. ...+...|..+=|.| .++|-+. T Consensus 1 mn~~dr~i~~~sP~~~~~R~~--ar~~~~----~y~aa~~~r~~~~~~-~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~ 73 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKAARLR--SRAVIQ----AYEAVKTTRTHKARR-ENRTADQLSQYGAVSLREQARYLDNNHDLVI 73 (502) T ss_pred CchHhhHHhhcChHHHHHHHh--hHHHHh----hccccCcccccCCCC-CCCChHHHHHHHHHHHHHHHHHHHhcChHHH Confidence 11 11111 1 11110 111000 001111111111111 1112222 223567788888999 6999999 Q ss_pred HHHHHhhcceEEecCCCc-EEEEEeeccccchHHHHHHHHHHHHHHH------HhcchhhhHHHHHHHhhhhhhheeeee Q lcl|NC_015287. 71 SAIEDIINEAIVSDQNDS-PVEIDLENLPASKQLKDLIREEFKTVKE------ILNFDDKCHEILRNWYVDGRIYYHKVI 143 (563) Q Consensus 71 ~Ai~eIvneaiv~d~~~~-pV~l~l~~~~~s~sik~~I~eeF~~i~~------ll~F~~~~~~~~R~WYvDGri~fhkvi 143 (563) .||+.+++-+|=+. .-. ...+...+....+.+.++|..+|+.|++ .++|..--.-.+|.|.+||..|-.++. T Consensus 74 ~av~~~~~nvVG~g-gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~ 152 (502) T protein:vir:79 74 GVFDKLEERVVGKN-GIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVS 152 (502) T ss_pred HHHHHHHHhhccCC-ceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEee Confidence 99999998665322 111 1223345556778888999999999986 455666667799999999999999888 Q ss_pred cCCCC-Ccce---eEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCc Q lcl|NC_015287. 144 DIKAP-EEGI---KEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTP 219 (563) Q Consensus 144 D~~~~-k~GI---~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~ 219 (563) ++... +.|. ..|..|||..|.--+ .......++|+... +..-.-||++.... .. T Consensus 153 ~~~~~~~~g~~~~l~lq~iepd~l~~~~--~~~~~i~~GVe~d~--------~Gr~~aY~i~~~hP---gd--------- 210 (502) T protein:vir:79 153 GRINSLTPSAGVHFWLEALEPDFIPMTS--DESNRLNQGVFVDD--------WGRPEKYLVYKSRP---VS--------- 210 (502) T ss_pred cccCccCCCcccceEEEEecchhcCCCC--CCCCeeEeeeEECC--------CCceEEEEEeecCC---CC--------- Confidence 76532 3444 368889997764321 11222234454422 23344577764311 11 Q ss_pred cccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHH Q lcl|NC_015287. 220 FSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQY 299 (563) Q Consensus 220 ~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqY 299 (563) ......++||.+.|.+.+... ...-.-++|.|..+++.+.+|.-.+||..+-...-|-.=-+..-+.|.-....+ T Consensus 211 -~~~~~~~rvpA~~vlH~f~~~-r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~--- 285 (502) T protein:vir:79 211 -GRQMETKEVDAERMLHLKFVR-RLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDG--- 285 (502) T ss_pred -CcccceeEechhheEEeeccc-CCccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccccc--- Confidence 011345789999997776643 223334589999999999999999999999988888765444444333111000 Q ss_pred HHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCChhh Q lcl|NC_015287. 300 LKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSD-IEYFQKKLYRSLGVPESR 378 (563) Q Consensus 300 l~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sR 378 (563) .+.. +.....+|-.-==+ ..-.-|.+|..+.....-+..++ ++...+.+=.+|+||-.- T Consensus 286 -------------~~~~-----~~~~~~~l~pG~i~--~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~ 345 (502) T protein:vir:79 286 -------------NGSK-----ENERELTIQPGIIY--DDLKPGEEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSS 345 (502) T ss_pred -------------CCCC-----CccccccccCCccc--cccCCCceeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHH Confidence 0000 11111111000000 00122445555554443333333 334444466789999999 Q ss_pred ccCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHhcCCCChhHHHHhhhceEEEe--eccchH Q lcl|NC_015287. 379 IAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLK----TQLILKNIVTPEDWEVLGDHIQYDF--VYDNHF 452 (563) Q Consensus 379 l~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk----~qLiLkgI~t~eew~~~~~~i~~~f--~~Dn~f 452 (563) |..| |+ +.-+.+.---+.|-+.+.++|+.|..-|..++- ...+|.|.++.-.|..-.......| -.--+. T Consensus 346 lt~D---~s-~nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~i 421 (502) T protein:vir:79 346 TARN---YN-GTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWI 421 (502) T ss_pred Hhcc---cc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCcccc Confidence 9765 33 233445555577999999999988877766533 3567888876433433223333333 333345 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCC Q lcl|NC_015287. 453 AELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPG 532 (563) Q Consensus 453 ~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~ 532 (563) -.+||+.-...+++. -+-|.+-+..+--..-++.++|..+..+...+.|+-.+.+ +...++. T Consensus 422 DP~Ke~~a~~~~i~~---------Gl~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~------~~~~~~~--- 483 (502) T protein:vir:79 422 DPVKEAEAWKIQIRG---------GAATESDWVRAGGRNPDDVKRRRKAEIDENRKLDLVFDTD------PASDKGG--- 483 (502) T ss_pred ChHHHHHHHHHHHHc---------CCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCC------CCCCCCC--- Confidence 566766655444442 3445555555544444444444444444334445533211 1100000 Q ss_pred CcCCCCCcccccccCCCCccc Q lcl|NC_015287. 533 GAMNPQDPNMGGAPQEGGAEQ 553 (563) Q Consensus 533 ~~~~p~~~~~~~~~~~~~~~~ 553 (563) ...|..+ ..++.+....|+ T Consensus 484 -~~~~~~~-~e~~~~~~~~e~ 502 (502) T protein:vir:79 484 -SSAATKR-QEPQHTDDQSEE 502 (502) T ss_pred -CCCCCCC-CCCCCCCCCCCC Confidence 0011111 111111122222 No 19 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.46 E-value=5.3e-13 Score=87.88 Aligned_cols=469 Identities=15% Similarity=0.137 Sum_probs=210.2 Q ss_pred Cccccc-chhhHhhhccc---cccccc-Cc--CCCCcc-----c---ceecccccccccc------ccchhhhHHHHHHH Q lcl|NC_015287. 1 MAKLFG-FKIQKDDEQNK---NVVSPV-PQ--SNEDSS-----D---YYVSSGFYGQYVD------IDGVFKSEFELIKR 59 (563) Q Consensus 1 m~~~~~-f~~~~~~~~~~---~~~s~~-p~--~~~d~~-----~---~~~~~~~~~~~~~------~~~~~~~~~~LI~~ 59 (563) |+++-. ++|-...++.. ....+. |. .-+|++ . ....|+...+-.. .......-++|... T Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~gyql~al 116 (765) T protein:vir:96 37 MIKLGKIRGWNVEPEKAPVIRSVKDFLEPGLSVAMDSAYGDGPTPAAKAAAGGQNPYVVPTMLQDWYNSQGFIGYQACAI 116 (765) T ss_pred chhHHHHhhcccccccCCCCCCCCcccCcccceeccccccccccchHHHhhhccCccchhhHHHhhhcccCCccHHHHHH Confidence 444432 23322222211 111111 11 011111 0 0011111111000 11112333577777 Q ss_pred HHHHhhccchhHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHH--HHhhhhhh Q lcl|NC_015287. 60 YREMALHPEVDSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILR--NWYVDGRI 137 (563) Q Consensus 60 YR~ma~~PEvd~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R--~WYvDGri 137 (563) |++ |+.+..||+.+++||+- ..+.|.-++-+.++...++|.. .++-|++..+..+.+| |-|-.|-+ T Consensus 117 Y~~---~~l~rkiVd~pAeDa~R-----~g~~I~~~~~e~~~~~~~~l~~----~~~rl~v~~~l~ea~~~~RlyGga~i 184 (765) T protein:vir:96 117 ISQ---HWLVDKACSMSGEDAAR-----NGWELKSDGRKLSDEQSALIAR----RDMEFRVKDNLVELNRFKNVFGVRIA 184 (765) T ss_pred HHh---CchhhhhhhcchHHhhc-----CCceeecCccccCHHHHHHHHH----HHHHhhHHHHHHHHHHHhhhceeeEE Confidence 765 99999999999999964 3355555544455544444444 4444566777777777 66666665 Q ss_pred heeeeecCCCC-------------CcceeEEEEeCCccceee--eeeeccCCccccceeeecccccccccccccceeeec Q lcl|NC_015287. 138 YYHKVIDIKAP-------------EEGIKEVRYIDPLKIKLV--RKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYD 202 (563) Q Consensus 138 ~fhkviD~~~~-------------k~GI~eLr~lDP~~i~~v--R~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~ 202 (563) ++ .++-.++ +.+++.|..|||..+... -++..+ +..-.|.. .++|..+ T Consensus 185 ~i--~i~~~D~~~l~~PL~~~~I~kg~~kgl~vldp~~~~~~~v~e~~~D--------------p~sp~fg~-P~~y~i~ 247 (765) T protein:vir:96 185 LF--VVESDDPDYYEKPFNPDGIAPGSYKGISQIDPYWAMPQLTAESTAD--------------PSAEHFYE-PDFWIIS 247 (765) T ss_pred EE--EecccCcchhhccccccccccceeeEEEEechhhcccccchhcccc--------------ccccccCc-ceeeeec Confidence 54 2221121 225566777776443321 000000 00001111 1222221 Q ss_pred CccccccccccccCcCccccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHHHH--HHHHHHHHHhcCcc Q lcl|NC_015287. 203 PSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMI--EDSLVIYRLSRAPE 280 (563) Q Consensus 203 ~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~--EDalVIyRi~RAPe 280 (563) ++ ..|.++.|.+....+-+.. -+..+..++|-|+++...+..+... +.+.++++-. . T Consensus 248 -------------g~--~IH~SRli~~~g~~lpd~l---k~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~---~ 306 (765) T protein:vir:96 248 -------------GK--KYHRSHLVVVRGPQPPDIL---KPTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSKR---T 306 (765) T ss_pred -------------Cc--eeccceEEEecCCCchhhh---ccccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhc---c Confidence 11 2344555555555443332 2334455678888876555554322 3445555422 2 Q ss_pred ceEEEccCCCC-chHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHH Q lcl|NC_015287. 281 RRIFYIDVGNL-PKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELS 359 (563) Q Consensus 281 RRvFyIDvGnl-pk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~ 359 (563) +++.+|.... ...++...--+.++++|+ +.| .+-+ +++-+++++. .+|+-++ T Consensus 307 -~v~k~~~~~~l~~~~~l~~r~~~~~~~r~------n~g-------~~~i-----------d~ee~~e~~s--~~lsgl~ 359 (765) T protein:vir:96 307 -STIHVDVEKAIANEDAFNARLAFWIANRD------NHG-------VKVI-----------GIDETMEQFD--TNLSDFD 359 (765) T ss_pred -ceeeechHhhhccHHHHHHHHHHHHHhcC------Cce-------eEEe-----------cCCcceeEEe--cccCCHH Confidence 3566665532 111111111122333332 111 1100 0112344443 3566677 Q ss_pred HH-HHHHHHHHHhcCCChhhccCCC-cceeccccchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCChhHHH Q lcl|NC_015287. 360 DI-EYFQKKLYRSLGVPESRIAGSG-DGFNLGRSSEILRDEIKFTKFVGRMRKR-FSHLFNDMLKTQLILKNIVTPEDWE 436 (563) Q Consensus 360 DV-~YF~~kLy~aL~VP~sRl~~~~-~g~~~g~~~eItRDElKF~KfI~rLr~r-Fs~if~d~Lk~qLiLkgI~t~eew~ 436 (563) |+ ..|...+=-+.+||+.||-+.+ .|+|-+.-. |.-.|+.+|+++|.. +..++..+++. |++-+.+. T Consensus 360 d~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe~----D~~nYyD~I~s~Qe~~l~p~le~L~~l-i~~s~~i~----- 429 (765) T protein:vir:96 360 SVIMNQYQLVAAIAKTPATKLLGTSPKGFNATGEH----ETISYHEELESIQEHIFDPLLERHYLL-LAKSESID----- 429 (765) T ss_pred HHHHHHHHHHHhhhCCCeeeeccCCcccccCcchH----HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcCCC----- Confidence 74 6789999999999999997654 566532211 334499999999965 45555544444 55555432 Q ss_pred HhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhc--------cChhhHHHHHHHHHHHhc Q lcl|NC_015287. 437 VLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILK--------QKDEEVIEIDKQMEKEIK 508 (563) Q Consensus 437 ~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~--------~tdeei~e~~~qie~E~~ 508 (563) ..+.|.|..=..=+|...+||...+.++++.+-.- -.+|.+-+++.+.. +++++++.+. .+..|.. T Consensus 430 ---~d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~--Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~~-~~~pe~~ 503 (765) T protein:vir:96 430 ---VQLEIVWNPVDSTTSQQQAELNNKKAATDEIYINS--GVVSPDEVRERLRDDPRSGYNRLTDDQAETEP-GMSPENL 503 (765) T ss_pred ---CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhc--CCCCHHHHHHHHhccccCCCCCCCcccccccc-CCCcccc Confidence 35888898777778888999999999998876333 25777777775432 2333332110 1110000 Q ss_pred CCCCCCCcc--------cCccccccccccCCCCcCC--C-CCcccccccCCCCcccCCCCCCCC---CC Q lcl|NC_015287. 509 DGKLADPME--------VQQLQLGVHPEQMPGGAMN--P-QDPNMGGAPQEGGAEQAPQMPEGG---EI 563 (563) Q Consensus 509 ~~~~~~p~~--------~~~~~~~~~~~~~~~~~~~--p-~~~~~~~~~~~~~~~~~~~~~~~~---~~ 563 (563) . -+..+++ +...+..+...+..+..++ | .....+.+.+.+.++.+|+.++.. |- T Consensus 504 ~-~~~~~~~~~~~~~~e~~~~~a~p~~~eg~~~~~~~~p~~~~p~~~~~~~~~g~~~~~p~~~~p~~~~ 571 (765) T protein:vir:96 504 A-ELEKAGAQSAKAKGEAERAEAQAGAVEGAGDPVPAAPRGTKPLAKAAEEGAGEAATPPSRPNPRAEL 571 (765) T ss_pred c-cccCCCcccccccCccccccCCCCccCCCCcccccCCcccCCccccccccCccccCccccccccccc Confidence 0 0000000 0000000000000001110 1 001111111122222222211111 11 No 20 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.39 E-value=7.9e-12 Score=81.44 Aligned_cols=458 Identities=10% Similarity=0.048 Sum_probs=195.1 Q ss_pred cccchhhHhhhccccc----------------------------cccc---CcCCC-----Ccc-cceec---------- Q lcl|NC_015287. 4 LFGFKIQKDDEQNKNV----------------------------VSPV---PQSNE-----DSS-DYYVS---------- 36 (563) Q Consensus 4 ~~~f~~~~~~~~~~~~----------------------------~s~~---p~~~~-----d~~-~~~~~---------- 36 (563) +|+|--|++.+-.+.+ .+.+ +|.+. ++. ...+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~ 80 (537) T protein:vir:10 1 MFKFWRKKTVEAVQSSIAERIEPRVGIFGAGDDEKPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTF 80 (537) T ss_pred CCCccccccccccccccccccccccCCCcccchhhHHHHHHhhhhccCCCCCccCcccccccccccchhccccccchhhh Confidence 6666222221110000 0011 11110 110 00110 Q ss_pred cccccccc------cccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEecCCCcEEEEEeecc-ccchHHHHHHHH Q lcl|NC_015287. 37 SGFYGQYV------DIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSDQNDSPVEIDLENL-PASKQLKDLIRE 109 (563) Q Consensus 37 ~~~~~~~~------~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d~~~~pV~l~l~~~-~~s~sik~~I~e 109 (563) |.+.++.. .....-..-++|...|+ .|+++..||+.++++|+. ..+.|..++. +.+....++|.. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~---~~~l~r~iVd~~A~d~~r-----~~~~i~~~~~~~~~~~~~~~l~~ 152 (537) T protein:vir:10 81 SAYANPNLSEGLVLWYAQQAFIGHQMCALIA---THWLVNKACSQMPRDAMR-----KGYKIISDDGNELDPKDAKFIDR 152 (537) T ss_pred hhhccccccchhhhhccccCCccHHHHHHHH---hCchhhhhhhhhhHHhhc-----CCceeecCCcccccHHHHHHHHH Confidence 00001000 00000112256666665 699999999999998843 3344444332 244444444444 Q ss_pred HHHHHHHHhcchhhhHHHHHHHhhhhhhheeeee---cCCC----------CCcceeEEEEeCCccceee--eeeeccCC Q lcl|NC_015287. 110 EFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVI---DIKA----------PEEGIKEVRYIDPLKIKLV--RKLKTDNS 174 (563) Q Consensus 110 eF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkvi---D~~~----------~k~GI~eLr~lDP~~i~~v--R~~~~~~~ 174 (563) + ++-|++..+..+.+|.=-+.|.=+.-..+ |+.. .+.+++.|+.|||..+... .++..+.. T Consensus 153 ~----~~~l~~~~~l~~a~~~~rlyG~~~i~i~v~~~D~~~~~~Pl~~~~i~kg~~k~l~vidp~~~~~~~~~~~~~dp~ 228 (537) T protein:vir:10 153 Y----DRAFNIKKHAIQFVRKGRIFGIRIALFKVDSPDPYYYEKPFNIDGVMPGAYKGIVQIDPYWCAPLLDAQASSNPV 228 (537) T ss_pred H----HHHhhHHHHHHHHHHhcccccceEEEEeecCcCCcccccccccccccccceeEEEEechhhcccccchhhhccCC Confidence 4 44455566655555543344432221122 3221 1335778888888655442 12211110 Q ss_pred ccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeecceecCCCCccchhHH Q lcl|NC_015287. 175 LRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLH 254 (563) Q Consensus 175 ~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~ 254 (563) .-.|.. .++|..+ |. ..|.++.|.+....+-+.+ -.+.+.-++|-|+ T Consensus 229 --------------sp~fg~-P~~y~v~-------------g~--~iH~SRli~f~g~~~p~~~---~~~~~~~G~Svlq 275 (537) T protein:vir:10 229 --------------SMHFYE-PTYWLIN-------------GK--KYHRSHLAIYINDEVVDFL---KPSYIYGGVPLPQ 275 (537) T ss_pred --------------ccccCC-ceeeeec-------------Ce--EecceeEEEecCCCCchhh---hcccCcccccHHH Confidence 011111 1233221 11 2233444444444433322 2344455678888 Q ss_pred HHHHHHHHHHHHHH--HHHHHHHhcCccceEEEccCCC-CchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhh Q lcl|NC_015287. 255 KSIKALNQLRMIED--SLVIYRLSRAPERRIFYIDVGN-LPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLE 331 (563) Q Consensus 255 ~Aik~~NqL~m~ED--alVIyRi~RAPeRRvFyIDvGn-lpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlE 331 (563) ++...+.++..... +.++|+-. =+++.+|... |....+..-.-+.++++|+ +.|- |-+- T Consensus 276 ~~~~~l~~~~~t~~~~~~l~~~~~----~~v~k~~~~~~l~~~~~~~~r~~~~~~~r~------n~g~-------~~id- 337 (537) T protein:vir:10 276 QIMERVYAAERTANEGPMLAMTKR----QTVLKVDAAQVLANKQQFDETMSWWTATRD------NYQV-------RVVD- 337 (537) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcC----CceeeechHHhhcCHHHHHHHHHHHHhhcC------Ccce-------eEec- Confidence 87665554433222 23333322 2366665311 1111111111122333332 1111 1000 Q ss_pred hhcccccCCCCccceeecCCCCCcchHHHH-HHHHHHHHHhcCCChhhccCCC-cceeccccchhhHHHHHHHHHHHHHH Q lcl|NC_015287. 332 DFWLPRREGGRGTEITTLPGGQNLGELSDI-EYFQKKLYRSLGVPESRIAGSG-DGFNLGRSSEILRDEIKFTKFVGRMR 409 (563) Q Consensus 332 DywLpRReGgrgTEIsTLpGg~nLgei~DV-~YF~~kLy~aL~VP~sRl~~~~-~g~~~g~~~eItRDElKF~KfI~rLr 409 (563) .+ +-+++++- .+|+-++|+ ..|...+=.+++||+.||-+.+ .||+- ++ .-|.-.|+.||+++| T Consensus 338 ------~e---~e~~e~~~--~~lsgl~~~l~~~~~~iAa~~~IP~t~L~G~sp~Glna--tG--e~D~~~yyd~I~~~Q 402 (537) T protein:vir:10 338 ------KD---NEDVVQID--TTLNDLDKVIMNQYQLVCAIARTPAPKMLGTVPTGFNS--TG--DYEEASYHEECESTQ 402 (537) T ss_pred ------CC---CceeEEEe--ccCCCHHHHHHHHHHHHHhhhCCCceeeccCCcccccc--ch--hHHHHHHHHHHHHHH Confidence 00 11122221 245556664 5677778888899999995443 34542 11 223344999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHh Q lcl|NC_015287. 410 KRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKIL 489 (563) Q Consensus 410 ~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il 489 (563) .++..++..+++. |+...+..+ ..++|.|..=...++...+|+...+.++++.+-.- -.+|.+-+++.+- T Consensus 403 e~l~p~l~~l~~l-l~~~~~~~~-------~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~--G~i~~~Evr~~L~ 472 (537) T protein:vir:10 403 DDMRPLIDRHHQL-VCRSHLRKR-------IRVKVEFPPMDAPKESERADTFLKKMQAAKLAFEM--GAVDGVDVNEYLR 472 (537) T ss_pred HHHHHHHHHHHHH-HHHhcCCCC-------cceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHc--CCCCHHHHHHHHh Confidence 9888877776643 222222222 35788888777788988999999999888776432 1455555544433 Q ss_pred ccChhhHHHHHHHHHHHhcCCCCCCCcccCccc-cccccccC-CCCcCCCCCcccccccCCCCcccCCCCCCCCCC Q lcl|NC_015287. 490 KQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQ-LGVHPEQM-PGGAMNPQDPNMGGAPQEGGAEQAPQMPEGGEI 563 (563) Q Consensus 490 ~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~-~~~~~~~~-~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 563 (563) ...+. .-.++.+.++.++..+ .......+ +.....|+..+++++... +++++.-..+|-- T Consensus 473 ~~~~~------------g~~~l~~~~~~ed~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~a~ 534 (537) T protein:vir:10 473 MDPTL------------GFTSITPAMRPTDAEDIDVDDEGKPVRIIEDQPAPSEMFGATSS--GESANDPRDSGAA 534 (537) T ss_pred ccCcc------------ccccccCCCChhhhhcccCCccCCcCCCCCCCCCccccCCCCcc--ccccCCCccCccc Confidence 21110 0112222211111111 11111111 111111111222222211 1111111111111 No 21 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.37 E-value=7.5e-12 Score=81.57 Aligned_cols=463 Identities=11% Similarity=0.135 Sum_probs=197.4 Q ss_pred Cc----------------ccccchhhHhhhccccccc-------c-cCcCCCCcccce-----eccccccc-ccc-ccch Q lcl|NC_015287. 1 MA----------------KLFGFKIQKDDEQNKNVVS-------P-VPQSNEDSSDYY-----VSSGFYGQ-YVD-IDGV 49 (563) Q Consensus 1 m~----------------~~~~f~~~~~~~~~~~~~s-------~-~p~~~~d~~~~~-----~~~~~~~~-~~~-~~~~ 49 (563) || +---..-|+..-+.+..+. . +|+..++..... ..++-... .+. .... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~~~ 80 (532) T protein:vir:94 1 MADTDPTPRPEITYATLQQAQRVDAKRATHTSLGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSFVEAT 80 (532) T ss_pred CCCCCCCCCcceehhhhhhHhhhhhhhhhhhhhhhhhhhhhcccccccccccccccccccccccCccccccccccccccc Confidence 11 0000111222111111110 1 122222221111 11111000 000 1222 Q ss_pred hhhHHHHHHHHHHHhhccchhHHHHHhhcceEEecCCCcEEEEEeecc-ccchHHHHHHHHHHHHHHHHhcchhhhHHHH Q lcl|NC_015287. 50 FKSEFELIKRYREMALHPEVDSAIEDIINEAIVSDQNDSPVEIDLENL-PASKQLKDLIREEFKTVKEILNFDDKCHEIL 128 (563) Q Consensus 50 ~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d~~~~pV~l~l~~~-~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~ 128 (563) ....++|...|+ .|+++..||+.|+++|+. ..+.|..++- +..+....+|..+++. |++..+..+.+ T Consensus 81 ~~~~~~l~a~Y~---~~~l~r~~Vd~~aed~~r-----~~~~i~~~~~~~~~~~~~~~i~~~~~~----l~v~~~l~~a~ 148 (532) T protein:vir:94 81 SWPGFPTLALLA---QLPEYRTMHETPADECVR-----AWGKITCSSKDELAADKATRITQKLEQ----YNVRTLVRTVV 148 (532) T ss_pred ccchHHHHHHHH---cCchhhhhhccchHHHhh-----CCceEeeCCccccchHHHHHHHHHHHh----hhHHHHHHHHH Confidence 345667888886 599999999999998864 2334433321 2344455555555443 44455555544 Q ss_pred HHHhhhh--hhheeee-------------ecCCC-CCcceeEEEEeCCccceeeeeeeccCCccccceeeeccccccccc Q lcl|NC_015287. 129 RNWYVDG--RIYYHKV-------------IDIKA-PEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIEN 192 (563) Q Consensus 129 R~WYvDG--ri~fhkv-------------iD~~~-~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~ 192 (563) |.=-+.| -++++.- ++++. .+.++++|..|||..|..--. . . .++..-.| T Consensus 149 ~~~rlyG~a~i~i~v~~~~~~~~~~~p~~l~~~~I~~g~~~~l~vld~~~v~p~~~--~-------~-----~dp~sp~f 214 (532) T protein:vir:94 149 IHDQAYGGAHVFPHLKMDGDSVPADAPLLLSPSFVQRGCLIGFATIEPMWLSPNAY--N-------A-----TDPTLPSF 214 (532) T ss_pred HhhhcccceEEEEEeccCCccccccccccccccccccceeeEEEeechheeccccc--c-------c-----cccccccc Confidence 4322333 2344321 11111 112356677777654322100 0 0 00111111 Q ss_pred ccccceeeecCccccccccccccCcCccccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHH--H Q lcl|NC_015287. 193 PEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDS--L 270 (563) Q Consensus 193 ~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDa--l 270 (563) ..-.-|++. .|. ..|.++.|.+....+-+. +-...+.-++|.|+++...+.+......+ . T Consensus 215 g~P~~y~v~-------------~g~--~iH~SRli~f~g~~~p~~---~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~ 276 (532) T protein:vir:94 215 YKPDSWIAT-------------SGK--KIHSSRIHTVVGRPVGDM---LKAAYSFRGVSISQLAMPYVDNWLRTRQSVSD 276 (532) T ss_pred CCceeEEEc-------------cCe--eeccceEEEecCCCchhh---hccccccccccHHHHHHHHHHHHHHHHHHHHH Confidence 111111111 000 123333333333332221 12223344688898887777766554433 3 Q ss_pred HHHHHhcCccceEEEccCCCCc-hHHHHHHHHH--HHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCcc-ce Q lcl|NC_015287. 271 VIYRLSRAPERRIFYIDVGNLP-KMKAEQYLKE--VMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGT-EI 346 (563) Q Consensus 271 VIyRi~RAPeRRvFyIDvGnlp-k~KAeqYl~~--im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgT-EI 346 (563) ++++.. =.|+.++..++- ....++..+. +++++|. +.|- +-| .+++ ++ T Consensus 277 l~~~~~----~~v~k~~~a~~ls~~~~~~~~~r~~~~~~~~~------n~g~-------~~i-----------d~~~e~~ 328 (532) T protein:vir:94 277 TVKQFS----MTNLATDMAQLLAPGGAQSLDARLQLFNLYRD------NRNI-------GAL-----------DKGTEEI 328 (532) T ss_pred HHHhcC----CceeeechHHhhcchhHHHHHHHHHHHHhhcC------Cccc-------eEE-----------cCCCcee Confidence 355422 234445432221 1111222111 1122221 1121 000 0111 23 Q ss_pred eecCCCCCcchHHHH-HHHHHHHHHhcCCChhhccCCC-cceeccccchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHH Q lcl|NC_015287. 347 TTLPGGQNLGELSDI-EYFQKKLYRSLGVPESRIAGSG-DGFNLGRSSEILRDEIKFTKFVGRMRKR-FSHLFNDMLKTQ 423 (563) Q Consensus 347 sTLpGg~nLgei~DV-~YF~~kLy~aL~VP~sRl~~~~-~g~~~g~~~eItRDElKF~KfI~rLr~r-Fs~if~d~Lk~q 423 (563) +++. .+|+-++|+ ..|...+-.+++||+.||-+.+ +||+-+.-. |.-.|+.||+++|.. +..++..+++.- T Consensus 329 e~~~--~~lsgl~~~l~~~~~~iAaa~~IP~t~LfG~sp~GlnstGe~----D~~~yyd~I~s~Qe~~l~p~le~l~~~l 402 (532) T protein:vir:94 329 QQTN--TPLSGLDSLQAQSQEQMAAVSHIPLVKLLGITPNGLNASSDG----EIRVWYDFIAGYQATNLTPLMEWIIDLI 402 (532) T ss_pred EEEe--cccCCHHHHHHHHHHHHHhHhCCCeeeeecCCcccccccchH----HHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3332 345555554 7889999999999999995442 345431111 333499999999955 566666666433 Q ss_pred HHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccCh---------- Q lcl|NC_015287. 424 LILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKD---------- 493 (563) Q Consensus 424 LiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~td---------- 493 (563) +.......+ ..++|.|..=...++...+|+...+.++++.+-.- -.+|.+-+++.+-...+ T Consensus 403 ~~s~~g~~~-------~d~~~~f~pL~~~s~kEkAei~~~~a~a~~~~~~~--Gvi~~~Evr~~l~~~~~~~~~~~~~~~ 473 (532) T protein:vir:94 403 QLSEYGQID-------PGLAWEWSPLMELDDKELAEVRQLNASTDSTLMEL--GVIDAKMVQQRLAADPTSGYAGALGER 473 (532) T ss_pred HHHhcCCCC-------CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhc--CCCCHHHHHHHHhcCCccccccccccc Confidence 222222112 35778898766677888899999999998776432 36777777775532221 Q ss_pred hhHHHHHHHHHHHhcCCCCCCCcccCcccccccc-ccCCCCcCCCCCcccccccC-CCCcccCCCCCCCCC Q lcl|NC_015287. 494 EEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHP-EQMPGGAMNPQDPNMGGAPQ-EGGAEQAPQMPEGGE 562 (563) Q Consensus 494 eei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~-~~~~~~~~~p~~~~~~~~~~-~~~~~~~~~~~~~~~ 562 (563) +++.+.+.+..+. +.. ..++++++... .+.+.++.+|.+- .++.+ +|++++. |-|.. T Consensus 474 ~~~~~~~~~~~~~-~~~------~~~~~~~~~~~~~~~~~~~~d~~~~--~~~~~~~~~~~~~---~~~~~ 532 (532) T protein:vir:94 474 DELDDVEEIAKQL-MAA------ALNPPATAPQTPNPQPDSEDDQTDN--QPDAQADPAQNDQ---PVGNR 532 (532) T ss_pred cccccccchhhhh-ccc------ccCCCCCCCCCCCCCCCCCCCCCCC--ccCCCccccccCC---CcCCC Confidence 1222222211111 111 11111111110 1112222222221 12222 2333333 33444 No 22 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.31 E-value=1.8e-11 Score=79.44 Aligned_cols=463 Identities=12% Similarity=0.056 Sum_probs=190.6 Q ss_pred Ccccc---------------------------cchhhHhhhccccccccc---C----cCCCCcc-ccee----cccccc Q lcl|NC_015287. 1 MAKLF---------------------------GFKIQKDDEQNKNVVSPV---P----QSNEDSS-DYYV----SSGFYG 41 (563) Q Consensus 1 m~~~~---------------------------~f~~~~~~~~~~~~~s~~---p----~~~~d~~-~~~~----~~~~~~ 41 (563) |+++= ++.-++.........+++ + ....|+- .... .+.... T Consensus 39 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~ 118 (862) T protein:vir:99 39 LARTRQNWPVQKEKPNPIIRSVKDFPFVEISDSVNAKSVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEGKQSS 118 (862) T ss_pred HHhhcccCCcccccCCCCCCcccccccccccccccchhhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhccccccccc Confidence 22221 111111100000000111 0 0000110 0000 000011 Q ss_pred cc------ccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEecCCCcEEEEEe--eccccchHHHHHHHHHHHH Q lcl|NC_015287. 42 QY------VDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSDQNDSPVEIDL--ENLPASKQLKDLIREEFKT 113 (563) Q Consensus 42 ~~------~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d~~~~pV~l~l--~~~~~s~sik~~I~eeF~~ 113 (563) ++ ..........++|...|+ .|+.+..||+.++++|+.. .+.|.- ++-+..+...++|.++++. T Consensus 119 y~~~~~~~~~~~~~~f~gyql~alY~---~~~larkiVd~pAeDatR~-----g~~I~~~~d~~e~~~e~~~~ie~~~~r 190 (862) T protein:vir:99 119 YAVPEALQDWYLSQGFIGHQACALIA---QHWLVDKACSLAGEDAIRN-----GWHLKSLGEGEEIDEESLEKFKAIDVE 190 (862) T ss_pred cccchhccccccccCcccHHHHHHHH---hCchhhhhhhhhhHHHhhC-----CceEeecCcccccCHHHHHHHHHHHHH Confidence 10 011122233456666655 5999999999999998642 222222 1113344444555555544 Q ss_pred HHHHhcchhhhHHHHHHHhhhhhhheeee--ecCCCC-------------CcceeEEEEeCCccceee--eeeeccCCcc Q lcl|NC_015287. 114 VKEILNFDDKCHEILRNWYVDGRIYYHKV--IDIKAP-------------EEGIKEVRYIDPLKIKLV--RKLKTDNSLR 176 (563) Q Consensus 114 i~~ll~F~~~~~~~~R~WYvDGri~fhkv--iD~~~~-------------k~GI~eLr~lDP~~i~~v--R~~~~~~~~~ 176 (563) |+...+..+.+|-=-+.|+ ...+ ++-.++ +.+++.|+.|||..+... .++..+.. T Consensus 191 ----L~v~~~l~eair~~RLyGg--a~ililv~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp~w~~p~~v~~~~~Dp~-- 262 (862) T protein:vir:99 191 ----FKVKENLIEFNRFKNVFGI--RVAIFVVDSEDPDYYEKPFNPDGITPGSYRGISQIDPYWMMPMLTAESTADPS-- 262 (862) T ss_pred ----hhHHHHHHHHHHhcccccc--eEEEEEecCcCchhhhcCcCcccccccceeEEEEechhhhccccccccccccc-- Confidence 4444444444432222333 2233 222222 224677888887554432 11111110 Q ss_pred ccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeecceecCCCCccchhHHHH Q lcl|NC_015287. 177 GAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKS 256 (563) Q Consensus 177 ~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~A 256 (563) +. .|.. .++|..+. . ..+.++.|.+..+.+-+.++ ...+.-++|-|+++ T Consensus 263 -----------sp-~yGk-P~~y~I~g-------------~--~IH~SRliif~g~~vpd~lk---~ay~f~G~SvLe~i 311 (862) T protein:vir:99 263 -----------SQ-FFYE-PEFWIISG-------------Q--KYHRSHLIIARGPQPADILK---PTYIFGGIPLVQRI 311 (862) T ss_pred -----------cc-ccCC-ceeeeecC-------------e--eeccceeEEecCCCchhhhh---ccCCccCccHHHHH Confidence 00 0111 12222211 1 24556666666666655433 34445568888876 Q ss_pred HHHHHHHHHH--HHHHHHHHHhcCccceEEEccCCC-CchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhh Q lcl|NC_015287. 257 IKALNQLRMI--EDSLVIYRLSRAPERRIFYIDVGN-LPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDF 333 (563) Q Consensus 257 ik~~NqL~m~--EDalVIyRi~RAPeRRvFyIDvGn-lpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDy 333 (563) ...+...... .-+.++++.. -+++.+|... |....+-..=.++++++|+- .| .+-+ T Consensus 312 yd~L~~~d~t~~saa~Ll~ka~----l~v~ktd~l~~l~~ed~l~~r~~~~~~~rdN------~G-------i~li---- 370 (862) T protein:vir:99 312 YERVYAAERTANEAPLLAMNKR----TTAIHTDTAKAIANEDKFIQRLMFWVRYRDN------HA-------VKVL---- 370 (862) T ss_pred HHHHHHHHHHHHHHHHHHHHhc----cceeechhHhhhccHHHHHHHHHHHHhccCc------ce-------eEEe---- Confidence 5444333221 2334455432 2344554432 22111111111224444431 11 0000 Q ss_pred cccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCChhhccCCC-cceeccccchhhHHHHHHHHHHHHHHHH Q lcl|NC_015287. 334 WLPRREGGRGTEITTLPGGQNLGELSD-IEYFQKKLYRSLGVPESRIAGSG-DGFNLGRSSEILRDEIKFTKFVGRMRKR 411 (563) Q Consensus 334 wLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl~~~~-~g~~~g~~~eItRDElKF~KfI~rLr~r 411 (563) +++-+++++- .+|+-++| +..|...+=.+.+||+.||-+.+ .|++-+.-. |.-.|..+|+++|.. T Consensus 371 -------D~eEe~e~ls--~slSGL~dll~~~~q~IAaas~IP~tiLfGqspaGlnATGE~----D~~nYyD~I~s~QE~ 437 (862) T protein:vir:99 371 -------GTDETMEQFD--TSLADFDAVIMGQYQLVASIAKTPATKLLGTAPKGFNSTGEF----ETISYHEELESIQEH 437 (862) T ss_pred -------cCCCceeEEe--cccCChHHHHHHHHHHHHhhhCCCceeecccCcccccCchHH----HHHHHHHHHHHHHHH Confidence 0112234333 34444555 57888899999999999996554 455432222 333499999999964 Q ss_pred -HHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhc Q lcl|NC_015287. 412 -FSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILK 490 (563) Q Consensus 412 -Fs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~ 490 (563) +..++..++ .|+......+ ..+.|.|..=..=+|...+|+.....++++.+-.- -.+|.+-++..+.. T Consensus 438 ~L~P~LerL~--~li~~~lg~~-------~d~~ieFnpL~~~sekEkAEi~kk~Aea~~~lv~s--GvispdEvR~~L~~ 506 (862) T protein:vir:99 438 VYMPFLQRHY--LISRLSLGIQ-------HEIDVVMEPVASMTAQQQADLNKTKAEGGKVLIDG--GVISPDEERNRIRD 506 (862) T ss_pred HHHHHHHHHH--HHHHHhcCCC-------CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhc--CCCCHHHHHHHHHh Confidence 444444433 2332222112 35788888777778888899998888888766332 25666666665321 Q ss_pred --------cChhhHHHHHHHHHHHhcCCCCCCCcccCcccc------ccccccCCCCcCC--CCCcccccccCCCCcccC Q lcl|NC_015287. 491 --------QKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQL------GVHPEQMPGGAMN--PQDPNMGGAPQEGGAEQA 554 (563) Q Consensus 491 --------~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~------~~~~~~~~~~~~~--p~~~~~~~~~~~~~~~~~ 554 (563) ++++++++....-.++..+ ...+++....++ +...+ ...+..+ |..+ ...|++..+ T Consensus 507 ~~~~g~~~l~ded~E~d~~~~~e~~~~--~e~~g~a~~~ap~de~~aga~~~-~~e~d~~~~p~~~-----~~~~g~~~~ 578 (862) T protein:vir:99 507 DKRSGYNRLTKEDAEETPGASPENLAA--YQKAGAAQETASAKETQAGAAVT-TAEGDQPNVQMVP-----SMKPGQMVG 578 (862) T ss_pred cCCcCCCCCCcccccccCCCCcccccc--cccCCcccccccccccccccCCc-cccCCcccccccC-----CCCCCCccc Confidence 2333333111000000000 000111000000 00000 0001110 1111 111111111 Q ss_pred CC---CCCCCCC Q lcl|NC_015287. 555 PQ---MPEGGEI 563 (563) Q Consensus 555 ~~---~~~~~~~ 563 (563) +. -..+-++ T Consensus 579 ~t~~~~a~~p~~ 590 (862) T protein:vir:99 579 PEVGITAPMPED 590 (862) T ss_pred cccccccCCCcc Confidence 11 1111111 No 23 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.22 E-value=4.1e-10 Score=72.07 Aligned_cols=423 Identities=13% Similarity=0.115 Sum_probs=198.6 Q ss_pred hhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEecCCCcEEEE Q lcl|NC_015287. 13 DEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSDQNDSPVEI 92 (563) Q Consensus 13 ~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d~~~~pV~l 92 (563) +....+..+++.- .++ ......+.+++... .+-.+|...| +.|+.+..||+.++.+|+-+- +.| T Consensus 1 ~~~~D~~~~~~~~----~g~-~~~~~~~~~~~~~~---~~~~~l~a~Y---~~~~l~~~~vd~~a~d~~r~~-----~~i 64 (437) T protein:vir:52 1 MKFFDGIKSLALK----LGS-KQEQTYYSPSLSLT---DDLVQLEALW---RDNWIANKVCIKRPEDMVRNW-----REI 64 (437) T ss_pred CchhhhhHhHHhc----CCC-ccccceeecCcccc---ccHHHHHHHH---HhCchhhHHhhcchHHhhcCC-----ceE Confidence 0000011111110 000 01111222222221 3445666666 579999999999999887543 333 Q ss_pred EeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecC------CCCCcceeEEEEeCCccceee Q lcl|NC_015287. 93 DLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDI------KAPEEGIKEVRYIDPLKIKLV 166 (563) Q Consensus 93 ~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~------~~~k~GI~eLr~lDP~~i~~v 166 (563) .-++ ... .-.+.|...++-|++..+..+.+|-=-+.|.=+.-.++|- .+++.+++.++.|||.+|.++ T Consensus 65 ~~~d--~~~----~~~~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~ 138 (437) T protein:vir:52 65 YSND--LNS----KQLDLFTKFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPT 138 (437) T ss_pred ecCC--CCH----HHHHHHHHHHHhhcHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechhhcccc Confidence 2222 222 2223455555566677777766653334555333335553 235678899999998877655 Q ss_pred eeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeecceecCCC Q lcl|NC_015287. 167 RKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNK 246 (563) Q Consensus 167 R~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~ 246 (563) -....+. . + -.|. -.++|.+++. +.+...|.++.|.+....+. .+.+. T Consensus 139 ~~~~~dp-----~----s-----~~fg-~p~~y~v~~~-----------~~~~~iH~SRii~~~~~~~~------~~~~~ 186 (437) T protein:vir:52 139 GTKDDDV-----L----S-----PNFG-RYSEYSILGG-----------SQSITVHHSRLIILNANDAP------LSDND 186 (437) T ss_pred ccccccc-----c----c-----cccC-cceEEEEecC-----------CcceeEccceeEEecCccCC------Ccccc Confidence 2221111 1 0 0111 1234444220 11112333333333222221 23345 Q ss_pred CccchhHHHHHHHHHHHHHHHHH--HHHHHHhcCccceEEEccCC--CCch--HHHHHHHHHHHHhhcceeEeeCCCCcc Q lcl|NC_015287. 247 QTILSYLHKSIKALNQLRMIEDS--LVIYRLSRAPERRIFYIDVG--NLPK--MKAEQYLKEVMNRYRNKLVYDASTGEI 320 (563) Q Consensus 247 ~~~~syL~~Aik~~NqL~m~EDa--lVIyRi~RAPeRRvFyIDvG--nlpk--~KAeqYl~~im~k~knklvYd~~TGev 320 (563) ..+.|.|+++...+..+...+.+ .++++ +..+ ++.++.- .|.. ..+..-..+.++++|+ T Consensus 187 ~~G~s~le~~~~~i~~~~~~~~~~~~l~~~---~~~~-v~k~~~l~~~l~~~~~~~~~~~~~~~~~~~~----------- 251 (437) T protein:vir:52 187 IWGVSDLEKIIDVLKRFDSASVNVGDLIFE---SKID-IFKIAGLSDKIAAGMENEVASVISAVQEIKS----------- 251 (437) T ss_pred ccCCchHHHHHHHHHHHHHHHHHHHHHHHH---cCCC-ceecchHHHHhcCCcHHHHHHHHHHHHHhcC----------- Confidence 55789999987777555444433 33443 4333 4555420 1111 1111111222333321 Q ss_pred cccccccchhhhhcccccCCCCccceeecCCCCCcchHHHH-HHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHH Q lcl|NC_015287. 321 RDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDI-EYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEI 399 (563) Q Consensus 321 ~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV-~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDEl 399 (563) ....+-| ..+-+++++. .+|+-++|+ ..|+..+=.+.+||+.||-+.+.+ |.++ =.-|.- T Consensus 252 --~~~~~~~-----------d~~~~~e~~~--~~~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s~~---Glas-ge~D~~ 312 (437) T protein:vir:52 252 --ATNSLLL-----------DAENEYDRKE--LTFTGLKDLLTEFRNAVAGAADMPVTILFGQSVS---GLAS-GDEDIQ 312 (437) T ss_pred --CCceEEE-----------cCCcceEEEe--cCcCCHHHHHHHHHHHHHHHhcCchhhhcCcCcc---cccc-cHHHHH Confidence 1111111 0112334432 245556664 688999999999999999654321 2222 123444 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchh Q lcl|NC_015287. 400 KFTKFVGRMRKR-FSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKY 478 (563) Q Consensus 400 KF~KfI~rLr~r-Fs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy 478 (563) .|..+|+++|.. +..+...+++. |++. +|-.+-+.++|.|..=..-++...+|+...+.++++.+-.- | . T Consensus 313 ~yyd~i~~~Qe~~l~p~le~l~~~-i~~~------~~g~~~~~~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~-g-~ 383 (437) T protein:vir:52 313 NYHEAIRRLQETRLRPIFEIIDPL-ICNE------LFGGLPADWWFEFVPLTTVKQEQQINMLNTFATAANTLIQN-G-V 383 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-HHHH------hcCCCCCcceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhc-C-C Confidence 499999999964 56666665552 3222 12222245888998777778888889988888887654322 1 2 Q ss_pred hhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccCCC Q lcl|NC_015287. 479 FSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAPQ 556 (563) Q Consensus 479 ~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~ 556 (563) ++++|+. +.+.+ .|.|+.=.++ +........+..+..+.|.+. ..+++++++|+ T Consensus 384 ------------i~~~e~r---~~L~~---~g~~~~i~~~-~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~ 437 (437) T protein:vir:52 384 ------------LNEYQIA---NELRE---SGLFANISAE-HIEELKNADEFAGNFEEPEKM-----EGAQVQNSEDQ 437 (437) T ss_pred ------------CCHHHHH---HHHHh---cCCCCCCCcc-ccccccCCCCCCCccCCCCCC-----CCCCCCCCCCC Confidence 3333332 22222 2555411111 111111112222222222222 11334445555 No 24 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.14 E-value=6.9e-10 Score=70.80 Aligned_cols=404 Identities=15% Similarity=0.165 Sum_probs=184.4 Q ss_pred cCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEecCCCcEEEEEeeccccchH Q lcl|NC_015287. 23 VPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSDQNDSPVEIDLENLPASKQ 102 (563) Q Consensus 23 ~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~s 102 (563) |+..+.|+-.....|+.-+.+.... .-+...+|...| +.|+.+..||+.++++|+-+ .+.|.- +. T Consensus 1 ~~~~~~d~~~~~~~~~~~~~~~~~~-~~~~~~~l~a~Y---~~~~l~~~~Vd~~aed~~r~-----g~~i~g------~~ 65 (427) T protein:vir:10 1 MKIVKHDGYNDIFNGGADGSPKPFF-MSDASYHVGSFY---NDNATAKRIVDVIPEEMVTA-----GFKMSG------VK 65 (427) T ss_pred CCccccchHHHHhhcCCCCcccCcc-ccCchHHHHHHH---HcCchhhhhhccchHHhhcC-----CccccC------cc Confidence 6777777644333333322222111 112344666555 46999999999999999843 233321 11 Q ss_pred HHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhh--hheeeeec------CCCCCcceeEEEEeCCccceeeeeeeccCC Q lcl|NC_015287. 103 LKDLIREEFKTVKEILNFDDKCHEILRNWYVDGR--IYYHKVID------IKAPEEGIKEVRYIDPLKIKLVRKLKTDNS 174 (563) Q Consensus 103 ik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGr--i~fhkviD------~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~ 174 (563) -+++ |+..++.|++..+..+.+|.=-+.|. ++++ |-| |.+++.+++.|+.+||..+..- ++..+ T Consensus 66 ~~~~----~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~-v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~-~~~~d-- 137 (427) T protein:vir:10 66 DEKE----FKSLWDSYKLDSSLVDLLCWARLYGGAAMVAI-IKDNRMLTSQAKPGAKLEGVRVYDRFAITVE-KRVTN-- 137 (427) T ss_pred HHHH----HHHHHHHhhHHHHHHHHHHhccccceeEEEEE-ecCCCccccccCCCcceeEEEEechhccccc-ccccC-- Confidence 2234 44444556777777777776556664 3333 223 4467889999999998665332 11111 Q ss_pred ccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeecceecCCCCccchhHH Q lcl|NC_015287. 175 LRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLH 254 (563) Q Consensus 175 ~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~ 254 (563) +..-.|.. .++|..++.. ++..-..|.++.+.+.-..+.+. +-..++..+.|.|. T Consensus 138 ------------p~s~~fg~-P~~y~v~~~~---------~~~~~~iH~SRli~~~g~~~p~~---~~~~~~~~G~S~l~ 192 (427) T protein:vir:10 138 ------------ARSPRYGE-PEIYKVSPGD---------NMQPYLIHHSRVFIADGERVAQQ---ARKQNQGWGASVLN 192 (427) T ss_pred ------------ccccccCc-ceEEEEecCC---------CCcceEEccccEEEecCCCchhh---hcccCCcccchhhh Confidence 11111111 2344443210 00111222333333332222221 11123333456666 Q ss_pred HHHHHHHHHHHHHH-----HHHHHHHhcCccceEEEc-cCCCC---chH--HHHHHHHHHHHhhcceeEeeCCCCccccc Q lcl|NC_015287. 255 KSIKALNQLRMIED-----SLVIYRLSRAPERRIFYI-DVGNL---PKM--KAEQYLKEVMNRYRNKLVYDASTGEIRDD 323 (563) Q Consensus 255 ~Aik~~NqL~m~ED-----alVIyRi~RAPeRRvFyI-DvGnl---pk~--KAeqYl~~im~k~knklvYd~~TGev~d~ 323 (563) +++ ++.|+-+|- +.++++- -.+ |+.+ +++++ +.. .+..-+.. +.+.|+ .+|-+ T Consensus 193 ~~~--~~~i~~~~~~~~~~~~l~~k~---~~~-v~k~~~l~~~~~~~~~~~~~~~r~~~-~~~~~~------~~~~~--- 256 (427) T protein:vir:10 193 KSL--IDAICDYDYCESLATQILRRK---QQA-VWKVKGLAEMCDDDDAQYAARLRLAQ-VDDNSG------VGRAI--- 256 (427) T ss_pred HHH--HHHHHHHHHHHHHHHHHHHHh---ccc-cccchhHHHHhcCccchHHHHHHHHH-HHHhcC------cccce--- Confidence 542 454544443 3345543 222 3344 33221 111 11111111 111111 11111 Q ss_pred ccccchhhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCChhhccCCC-cceeccccchhhHHHHHH Q lcl|NC_015287. 324 RKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSD-IEYFQKKLYRSLGVPESRIAGSG-DGFNLGRSSEILRDEIKF 401 (563) Q Consensus 324 ~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl~~~~-~g~~~g~~~eItRDElKF 401 (563) .+ | +.+-+++++. .+|+-++| +..|...+=.+.+||+.||-+.+ +|+|- -+ .-|.-.| T Consensus 257 ----~l--~--------~~~e~~e~~~--~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~sp~Glns-tg---d~D~~ny 316 (427) T protein:vir:10 257 ----GI--D--------AETEEYDVLN--SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSA-SQ---NTALETF 316 (427) T ss_pred ----ee--e--------cCCCceeEEe--cccCChHHHHHHHHHHHHhhhCCCeeeeccCCcccccc-ch---hHHHHHH Confidence 00 0 1112233331 23444555 47899999999999999995442 24431 11 1244459 Q ss_pred HHHHHHHHHH-HHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhh Q lcl|NC_015287. 402 TKFVGRMRKR-FSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFS 480 (563) Q Consensus 402 ~KfI~rLr~r-Fs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S 480 (563) ..+|+++|.. +..+...+++ ||.. + +.++|.|..-..=+|...+||...+.++++.+-.- | .++ T Consensus 317 yd~i~~~Qe~~l~p~l~~l~~--~i~~---s--------~~~~~~f~pL~~~s~kEkaei~~~~a~a~~~~~~~-g-vi~ 381 (427) T protein:vir:10 317 YKLVDRKREEDYRPLLEFLLP--FIVD---E--------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITE-Q-IID 381 (427) T ss_pred HHHHHHHHHHHHHHHHHHHHH--Hhhc---C--------CCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhc-C-CCC Confidence 9999999954 4444433333 2221 1 35778899888888989999999999888765433 1 233 Q ss_pred hHHHHHHHhccChhhHHHHHHHHHHHhc-CCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccCCCCCC Q lcl|NC_015287. 481 LDYVRRKILKQKDEEVIEIDKQMEKEIK-DGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAPQMPE 559 (563) Q Consensus 481 ~~~i~k~Il~~tdeei~e~~~qie~E~~-~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 559 (563) .+-+++. +..... .|.. +.++...++.+ .+..+ +|+.++-..-.| T Consensus 382 ~~e~r~~---------------L~~~~~~~~~~-~~~~~~~e~~~-----------------~~~e~-~p~~~e~~~d~~ 427 (427) T protein:vir:10 382 LEEARDT---------------LRSIAPEFKLK-DGNNINIREPE-----------------ETTEP-EPGLGEKLEDEN 427 (427) T ss_pred HHHHHHH---------------HHhhhccccCC-CCccccccccc-----------------hhcCC-CCCCCCCCCCCC Confidence 3333321 111111 1111 11111111110 00001 111111111111 No 25 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.10 E-value=2.1e-09 Score=68.14 Aligned_cols=480 Identities=12% Similarity=0.082 Sum_probs=237.5 Q ss_pred ccchhhHhhhcccccccccCc-C------------CCCcccce-eccccccccccc-cchhhhHHHHHHHHHHH-hhccc Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQ-S------------NEDSSDYY-VSSGFYGQYVDI-DGVFKSEFELIKRYREM-ALHPE 68 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~-~------------~~d~~~~~-~~~~~~~~~~~~-~~~~~~~~~LI~~YR~m-a~~PE 68 (563) |++ ....++++.| - .=+++... -..++.. ..+. .....+...|..+=|.| .++|- T Consensus 1 Mn~--------iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r~~~~~~~-~~s~~~~i~~~~~~lr~RaRdL~rNn~~ 71 (548) T protein:vir:95 1 MNL--------IDRLLEPLAPELVARRLAAREAIQAYEAARPGRTHKAKRQ-PLGADTSLQKSAVSMREQCRKLDEDHDL 71 (548) T ss_pred Cch--------HHhHhhhcchHHHHHHHHhHHHhccccccCccccccccCC-CCChHHHHHHHHHHHHHHHHHHHhcChH Confidence 222 2222222211 0 00111110 1111111 1121 22224567788888888 88999 Q ss_pred hhHHHHHhhcceEEec-CCCcEEEEEeeccccchHHHHHHHHHHHHHHHH------hcchhhhHHHHHHHhhhhhhheee Q lcl|NC_015287. 69 VDSAIEDIINEAIVSD-QNDSPVEIDLENLPASKQLKDLIREEFKTVKEI------LNFDDKCHEILRNWYVDGRIYYHK 141 (563) Q Consensus 69 vd~Ai~eIvneaiv~d-~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l------l~F~~~~~~~~R~WYvDGri~fhk 141 (563) +..||+-+++-+|=.. -.-.|-.+..+. +..+.+.+.|...|..+++- ++|-.--.-.+|.|.+||..|-.+ T Consensus 72 a~~av~~~~~nvVG~~G~~i~p~~l~~d~-~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~ 150 (548) T protein:vir:95 72 VTGLLDRLEERVVGGSGIGVEPLPLRLDG-SVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQK 150 (548) T ss_pred HHHHHHHHHHhccCccccceeeeecCCCH-HHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEe Confidence 9999999988655321 122232333332 34566778899999999863 456555667999999999999888 Q ss_pred eecCCCC-Ccce---eEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCc Q lcl|NC_015287. 142 VIDIKAP-EEGI---KEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQ 217 (563) Q Consensus 142 viD~~~~-k~GI---~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~ 217 (563) ..++... ..|. ..|..|+|..|.-=..-. .....++|+.... ..-.-||++... +...... T Consensus 151 ~~~~~~~~~~g~~~~~~lqliepd~l~~~~~~~-~~~i~~GIE~D~~--------Grp~aY~i~~~h---Pgd~~~~--- 215 (548) T protein:vir:95 151 LMGRVPNYTFATSVPFALELLEPDYLPFSYNNL-SKGIVQGIERDTW--------RRKRAYHLLKDH---PGNLQTL--- 215 (548) T ss_pred eecccccccCCcccceEEEEechhhcCCCCCCC-CCceeeeeEECCC--------CceEEEEEeecC---CCccccc--- Confidence 7765421 2233 467889997764211111 1122334444222 223346666431 1111110 Q ss_pred CccccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHH Q lcl|NC_015287. 218 TPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAE 297 (563) Q Consensus 218 ~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAe 297 (563) ......++++.+.|.+.+.-. ...-.-++|.|..+++.+.+|.-.+||..+-...-|-.=-+..=+-+. ... T Consensus 216 ---~~~~~~~rvpA~~VlHif~~~-r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~---~~~- 287 (548) T protein:vir:95 216 ---GGSLAVKRVEAERIIHIAYRK-RIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPD---SYT- 287 (548) T ss_pred ---ccccceeeechhHheeccccc-CCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCc---ccc- Confidence 112345789999997775532 222334579999999999999999999999988888764333322221 100 Q ss_pred HHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCc---------chHHHH-HHHHHH Q lcl|NC_015287. 298 QYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNL---------GELSDI-EYFQKK 367 (563) Q Consensus 298 qYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nL---------gei~DV-~YF~~k 367 (563) ...+ -.+.....+| .-|+=|.+|+.|+.+ +..++. +-..+. T Consensus 288 -----------------~~~~-~~~~~~~~~~-----------~pG~iv~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~ 338 (548) T protein:vir:95 288 -----------------VEPG-KDRKNRTIPI-----------APGMVFDDLEPGEDVGMIESNRPNPFLEGFRNGQLRM 338 (548) T ss_pred -----------------CCCC-cccccccccc-----------cCCccccccCCCceeeecCCCCCCCCHHHHHHHHHHH Confidence 0000 0011111111 123435555554433 232332 223333 Q ss_pred HHHhcCCChhhccCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHhcCCCChhHHHHhhhceE Q lcl|NC_015287. 368 LYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKT----QLILKNIVTPEDWEVLGDHIQ 443 (563) Q Consensus 368 Ly~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~----qLiLkgI~t~eew~~~~~~i~ 443 (563) +=.+|+||-+-|..| |+ +.=|.+.-.-+.|-+.+.++|..|..-|..++-. ..+|+|.++.-.|..-..... T Consensus 339 IAaglGipYe~ltgD---~s-~nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~ 414 (548) T protein:vir:95 339 IGAGTRSTYSSVSRA---YD-GTYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYA 414 (548) T ss_pred HHhhcCCCHHHHhcc---cc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhhee Confidence 456799999999765 33 2334455556779999999999988777665433 467888775322322233445 Q ss_pred EEeec--cchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHh----cCCCCCC--C Q lcl|NC_015287. 444 YDFVY--DNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEI----KDGKLAD--P 515 (563) Q Consensus 444 ~~f~~--Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~----~~~~~~~--p 515 (563) ..|.. --+.-.+||+.-...+++. -+-|.+-+..+ ...|.++..+|+..|. ..|+..+ | T Consensus 415 ~~W~~P~~~~iDP~Kea~A~~~~i~~---------Gl~T~~~~~a~----~G~D~~ev~~q~a~E~~~~~~~GL~~~~~~ 481 (548) T protein:vir:95 415 AVYQGPVMPWINPMHEANAWELLVKA---------GFADEAEVARA----RGRDPRELKKSRETEIKANRAAGLVFSSDA 481 (548) T ss_pred eeeecCCccccChHHHHHHHHHHHHc---------CCCCHHHHHHH----hCCCHHHHHHHHHHHHHHHHHcCCCCCCcc Confidence 55543 3345667877666555543 34455544444 3334444444444443 2343221 1 Q ss_pred c-----ccCccccccccccCCCCcCCCCCc-----ccccccCCCCcccCCCCCC-CCCC Q lcl|NC_015287. 516 M-----EVQQLQLGVHPEQMPGGAMNPQDP-----NMGGAPQEGGAEQAPQMPE-GGEI 563 (563) Q Consensus 516 ~-----~~~~~~~~~~~~~~~~~~~~p~~~-----~~~~~~~~~~~~~~~~~~~-~~~~ 563 (563) - ....+...+.....+++..-|.+- +--+|--|.++..-|.-.| ||-= T Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 540 (548) T protein:vir:95 482 YHQLVKSGMDPVEAVQKVYLGVGKMLTADEARELVNRYGAGLPVPGPDFPNESNNGGAD 540 (548) T ss_pred cccccccccCCCCchhhhccccccccccchhHHhhccCCCCCcCCCCCCCcccccCCCC Confidence 0 001111111111111111111110 0011111122221111111 1111 No 26 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.09 E-value=1.9e-09 Score=68.46 Aligned_cols=397 Identities=14% Similarity=0.135 Sum_probs=179.1 Q ss_pred cCcCCCCcccceec-----cccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEecCCCcEEEEEeecc Q lcl|NC_015287. 23 VPQSNEDSSDYYVS-----SGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSDQNDSPVEIDLENL 97 (563) Q Consensus 23 ~p~~~~d~~~~~~~-----~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d~~~~pV~l~l~~~ 97 (563) |. ..|+-..... +.+++++.. ++..+|...| +.|+.+..||+.++++|+ ...+.|. T Consensus 1 ~~--~~D~~~n~~~gg~~~~~~~~~~~~-----~~~~~l~a~Y---~~~~l~~~~Vd~~aed~~-----r~g~~i~---- 61 (422) T protein:vir:10 1 MV--KTDSYANIFLGGSDGSEIYGSLQN-----QAPTILASLY---ADNALVRRIIDTIPETAL-----AAGFHID---- 61 (422) T ss_pred Cc--cchhhHHHHcCCCCCccccCcccc-----cCHHHHHHHH---HhChhhHHHHhhhhHHHh-----cCCcccc---- Confidence 11 1122111111 112222211 3446666666 579999999999999996 2233332 Q ss_pred ccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhh--hheeeeecCC------CCCcceeEEEEeCCccceeeeee Q lcl|NC_015287. 98 PASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGR--IYYHKVIDIK------APEEGIKEVRYIDPLKIKLVRKL 169 (563) Q Consensus 98 ~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGr--i~fhkviD~~------~~k~GI~eLr~lDP~~i~~vR~~ 169 (563) ++.-++++.++++ .|++..+..+.+|.=-+.|. ++++ |-|.+ +++..|+.|+.+||..+...... T Consensus 62 --~~~~~~~~~~~~~----~l~~~~~l~~a~~~~rl~G~a~i~i~-v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~~~~ 134 (422) T protein:vir:10 62 --GIDDEPAFWSRWD----DLEMTQNINDAWSWARLFGGAAIVAI-VKDNRALTSPVREGAELETVRVYDRTQVKVQTRE 134 (422) T ss_pred --CCCHHHHHHHHHH----HhhHHHHHHHHHHhhccccceEEEEE-ecCCCCccccccccCceeeEEeeccccccchhcc Confidence 2333456666665 35667777776665545554 4443 22432 45677889999998765542110 Q ss_pred eccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeecceecCCCCcc Q lcl|NC_015287. 170 KTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTI 249 (563) Q Consensus 170 ~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~ 249 (563) . ++..-.|.. .++|.+++.. .+..-..|.++.+.+..+.+-+. +-..+..-+ T Consensus 135 -~--------------dp~s~~fg~-P~~y~v~~~~---------~~~~~~iH~SRli~~~g~~~p~~---~~~~~~~~G 186 (422) T protein:vir:10 135 -E--------------NPRNARFGE-PLTYRITTNE---------SDMFYDVHYSRIHIIDGERIPNV---MRRQNDGWG 186 (422) T ss_pred -c--------------CccccccCc-ceEEEEecCC---------CCcceeeccceeEEeCCCCchhh---hcccCCccc Confidence 0 111111112 2344443210 01111223333333333333221 222233344 Q ss_pred chhHHHH-HHHHHHHH-HHHH-HHHHHHHhcCccceEEEccC-CCCc--hHHHHHHHH--HHHHhhcceeEeeCCCCccc Q lcl|NC_015287. 250 LSYLHKS-IKALNQLR-MIED-SLVIYRLSRAPERRIFYIDV-GNLP--KMKAEQYLK--EVMNRYRNKLVYDASTGEIR 321 (563) Q Consensus 250 ~syL~~A-ik~~NqL~-m~ED-alVIyRi~RAPeRRvFyIDv-Gnlp--k~KAeqYl~--~im~k~knklvYd~~TGev~ 321 (563) .|-|.++ ..-+..+. .++- +.++++-. =+|+.++. .++- ..+..+-+. ..+.++|+ .+|-+. T Consensus 187 ~S~l~~~~~~~i~~~~~~~~~~~~l~~~~~----~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~------~~~~~~ 256 (422) T protein:vir:10 187 RSVLSSDILDSIKDYTNCERLATQLLKRKQ----QAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSG------VGQAIG 256 (422) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHhc----cccccchhHHHhcCCccchHHHHHHHHHHHHhcC------Ccccee Confidence 6767654 23222222 2222 34455542 23455552 2210 011111111 11222221 112111 Q ss_pred ccccccchhhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCChhhccCCC-cceeccccchhhHHHH Q lcl|NC_015287. 322 DDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSD-IEYFQKKLYRSLGVPESRIAGSG-DGFNLGRSSEILRDEI 399 (563) Q Consensus 322 d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl~~~~-~g~~~g~~~eItRDEl 399 (563) +. +.+-+++++. .+|+-++| +..|...+-.+.+||+.||-+.+ +||| .+++ -|-- T Consensus 257 -------l~----------~~~e~~e~~~--~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~Gln--atgd--~d~~ 313 (422) T protein:vir:10 257 -------ID----------AESEEYSVLN--SDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGVS--SSQN--TALE 313 (422) T ss_pred -------Ee----------cCCcceEEEe--cccCChHHHHHHHHHHHHhhhCCCeeeeccCCccccc--ccch--HHHH Confidence 10 0111233331 23444455 57899999999999999995443 2443 2111 2223 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchh Q lcl|NC_015287. 400 KFTKFVGRMRKR-FSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKY 478 (563) Q Consensus 400 KF~KfI~rLr~r-Fs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy 478 (563) .|+.+|+++|.. +..+...+++. |.. . +.++|.|..-..=+|...+|+...+.++++.+-.- | . T Consensus 314 ~yyd~i~~~Qe~~l~p~l~~l~~~--i~~----s-------~~~~~~f~pL~~~sekekaei~~~~a~a~~~~~~~-g-~ 378 (422) T protein:vir:10 314 TFHKLVDRKRNAELLPILEFLIPF--IVN----A-------EEWSVEFNPLAQESSKDKAEILEKNVNSIAALIAA-G-A 378 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH--hcc----c-------CCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhc-C-C Confidence 499999999964 45554444443 221 1 35778898888788888899998888888765332 1 2 Q ss_pred hhhHHHHHHHhccChhhHHHHHHHHHHHhc-CCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcc Q lcl|NC_015287. 479 FSLDYVRRKILKQKDEEVIEIDKQMEKEIK-DGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAE 552 (563) Q Consensus 479 ~S~~~i~k~Il~~tdeei~e~~~qie~E~~-~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 552 (563) +|.+-+++.+ .+... .+.+. ...+++.+.... ..+|.++|++. T Consensus 379 i~~~e~r~~L---------------~~~~~~~~~~~-~~~~~~~~~~~~---------------~~~~~~~~~~d 422 (422) T protein:vir:10 379 MDIDEARDTL---------------RTIAPEVKIND-GSVETEVTISET---------------SNDPLEVPTDD 422 (422) T ss_pred CCHHHHHHHh---------------hhhcccccCCC-CCCccccchhhc---------------CCCCCCCCCCC Confidence 3333333222 11111 11111 111111111000 00001111111 No 27 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=98.99 E-value=7.1e-09 Score=65.26 Aligned_cols=458 Identities=12% Similarity=0.096 Sum_probs=230.5 Q ss_pred ccchhhHhhhccccccccc-CcC---------CCCcccce-eccccc--ccccc-ccchhhhHHHHHHHHHHH-hhccch Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPV-PQS---------NEDSSDYY-VSSGFY--GQYVD-IDGVFKSEFELIKRYREM-ALHPEV 69 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~-p~~---------~~d~~~~~-~~~~~~--~~~~~-~~~~~~~~~~LI~~YR~m-a~~PEv 69 (563) |+-.+++-.-..+ .+++. ++. .=+++... -..++. ....+ ...-..+...|..+=|.| .++|-+ T Consensus 1 ~~r~~~~~~~~dr-~i~~~~~~~~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a 79 (505) T protein:vir:96 1 MKRAEKKPSLAQR-MVNWAWYRYVEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSINNPYA 79 (505) T ss_pred CCCCccccchhhc-ccchhhhhhHHHHHHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHH Confidence 2221111110000 11111 000 00111111 111221 11111 122234677889999999 699999 Q ss_pred hHHHHHhhcceEEecCCCcEEE-EEeeccccchHHHHHHHHHHHHHHHH--------hcchhhhHHHHHHHhhhhhhhee Q lcl|NC_015287. 70 DSAIEDIINEAIVSDQNDSPVE-IDLENLPASKQLKDLIREEFKTVKEI--------LNFDDKCHEILRNWYVDGRIYYH 140 (563) Q Consensus 70 d~Ai~eIvneaiv~d~~~~pV~-l~l~~~~~s~sik~~I~eeF~~i~~l--------l~F~~~~~~~~R~WYvDGri~fh 140 (563) ..||+.+++-+| -..+-.|.. +.-.+...++.+.++|..+|..+++- ++|..--...+|.|.+||..|-. T Consensus 80 ~~av~~~~~nvV-G~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~ 158 (505) T protein:vir:96 80 KRFYQLLKNNVI-GPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGEVLVR 158 (505) T ss_pred HHHHHHHHHHhc-CCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCceEEE Confidence 999999998654 222322222 22234457788999999999999863 23444456689999999999877 Q ss_pred eeecCCCCCcceeEEEEeCCccceeeeeeec--cCCccccceeeecccccccccccccceeeecCc-cccccccccccCc Q lcl|NC_015287. 141 KVIDIKAPEEGIKEVRYIDPLKIKLVRKLKT--DNSLRGAINQVNSKNPADIENPEIEEYYQYDPS-QTQGKNALGAIGQ 217 (563) Q Consensus 141 kviD~~~~k~GI~eLr~lDP~~i~~vR~~~~--~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~-~~~~~~~~g~~g~ 217 (563) .+..+. ..-|. .|..|||..|.--..-.. .....++|+... +..-.-||++... +....+. T Consensus 159 ~~~~~~-~~~~~-~lqliepd~l~~~~n~~~~~~~~i~~GIe~d~--------~Gr~~aY~i~~~hPgd~~~~~------ 222 (505) T protein:vir:96 159 EHRGYP-NKWGY-ALQILECDRLDLNYNADLQNGNRIRMSIELDA--------WERPVAYHLLVNHPGDNSYCY------ 222 (505) T ss_pred EeecCC-CCcce-EEEEechhhcCCCCCcccCCcCeEEeceEECC--------CCceEEEEEeecCCCcccccc------ Confidence 665432 22334 488899987654322211 112345665533 2334457777431 1111110 Q ss_pred CccccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHH Q lcl|NC_015287. 218 TPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAE 297 (563) Q Consensus 218 ~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAe 297 (563) ........+|+.+-|.+.+.-. ...-.-++|.|+.+++.+.+|.-.+||..+-...-|-.=-+..=|.+.+.... T Consensus 223 --~~~~~~~~rvpa~~vlH~f~~~-r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~-- 297 (505) T protein:vir:96 223 --HYAGQTYERVPADEIIHTFVPW-RPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYDQPP-- 297 (505) T ss_pred --ccccccccccCHhHhhhhhccc-CCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCCCcc-- Confidence 0111334678888886554322 12223347999999999999999999999988887765444433444332110 Q ss_pred HHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCC---------cchHHHH-HHHHHH Q lcl|NC_015287. 298 QYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQN---------LGELSDI-EYFQKK 367 (563) Q Consensus 298 qYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~n---------Lgei~DV-~YF~~k 367 (563) .| ..|+ ...+ -+-|| |.+|+.|+. -+..++. +-..+. T Consensus 298 ---------------~~-~~~~-----~~~~-----------l~pG~-i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~ 344 (505) T protein:vir:96 298 ---------------ED-DQGE-----IVEE-----------VEAGT-YQLLPYGIRFKEHKIDHPHTNFGAFVKSSLRG 344 (505) T ss_pred ---------------cc-ccCc-----cccc-----------cCCce-eeecCCCCeeeeeCCCCCCCCHHHHHHHHHHH Confidence 00 0110 0000 11233 555555553 3333333 333344 Q ss_pred HHHhcCCChhhccCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHhcCCCChhHHHHh-hhc Q lcl|NC_015287. 368 LYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDM-----LKTQLILKNIVTPEDWEVL-GDH 441 (563) Q Consensus 368 Ly~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~-----Lk~qLiLkgI~t~eew~~~-~~~ 441 (563) +=.+|+||-+-|..|=++.|. |.+.-.-+.|-+.+.++|..|..-|..+ |+ ..+|.|.++.-.+..- .-. T Consensus 345 iaaglgi~ye~lt~D~s~~nY---SS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~-~a~l~G~i~~p~~~~~~~~~ 420 (505) T protein:vir:96 345 VAAGMGPAYNRLAHDLEGVNF---SSLRSGELDERDLYKLLQFFVVTELLERVAGNLIS-MSLLTQALPLNMVDIDRLSQ 420 (505) T ss_pred HHhhcCCCHHHHhcccccccH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHcCCcCCCCccchhhce Confidence 556899999988654222222 2333344669999999999988755554 43 5678887764333311 012 Q ss_pred eEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHh----cCCCCCCCcc Q lcl|NC_015287. 442 IQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEI----KDGKLADPME 517 (563) Q Consensus 442 i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~----~~~~~~~p~~ 517 (563) ..|..-.--+.-.+||+.-...+++. -.-|.+-+..+- -.|.++..+||..|. +.|+-+.+.. T Consensus 421 ~~w~~p~~~~iDP~Ke~~a~~~~i~~---------G~~t~~~~~a~~----G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~ 487 (505) T protein:vir:96 421 YAFQPRGWDWVDPAKDSKAHSESIKN---------RTRSRSSIIRAA----GDDPEDVFDEIAWEEQLMRDKGVNPTPPE 487 (505) T ss_pred eeeccCCccccChHHHHHHHHHHHHc---------CCCCHHHHHHHc----CCCHHHHHHHHHHHHHHHHHcCCCCCCCC Confidence 33333333455667777666555543 234555444443 333444444444433 3444221100 Q ss_pred cCccccccccccCCCCcCCCCCcccccccCCCCcccCCCCCCCCC Q lcl|NC_015287. 518 VQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAPQMPEGGE 562 (563) Q Consensus 518 ~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~ 562 (563) ....++.+++.++. .|-| T Consensus 488 --------------~~~~~~~~~~~~~~-------------~~d~ 505 (505) T protein:vir:96 488 --------------QESKDATTDEEDDS-------------ASDD 505 (505) T ss_pred --------------CCCCCCCCCCCCCC-------------CCCC Confidence 00001111100000 0111 No 28 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=98.98 E-value=8e-09 Score=64.97 Aligned_cols=495 Identities=12% Similarity=0.055 Sum_probs=236.0 Q ss_pred CcccccchhhHhhhccccccccc-Cc---C----CCCc-ccce-eccccccccccccch-hhhHHHHHHHHHHH-hhccc Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPV-PQ---S----NEDS-SDYY-VSSGFYGQYVDIDGV-FKSEFELIKRYREM-ALHPE 68 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~-p~---~----~~d~-~~~~-~~~~~~~~~~~~~~~-~~~~~~LI~~YR~m-a~~PE 68 (563) |-+-.+- .....++- |. - .-++ +... ..+++.....+.... ..+...|..+=|.| .++|- T Consensus 1 m~~~~~r--------~~~~~a~~~~~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~ 72 (553) T protein:vir:63 1 MTKVTVR--------KLSEVTSGRPEQSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGF 72 (553) T ss_pred Ccchhhh--------hhcccccccchhhhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChH Confidence 1111110 00000111 00 0 0011 1100 112222223222222 34567788888888 88999 Q ss_pred hhHHHHHhhcceEEecCCCcEE-EEE---e--eccccchHHHHHHHHHHHHHHH----------HhcchhhhHHHHHHHh Q lcl|NC_015287. 69 VDSAIEDIINEAIVSDQNDSPV-EID---L--ENLPASKQLKDLIREEFKTVKE----------ILNFDDKCHEILRNWY 132 (563) Q Consensus 69 vd~Ai~eIvneaiv~d~~~~pV-~l~---l--~~~~~s~sik~~I~eeF~~i~~----------ll~F~~~~~~~~R~WY 132 (563) +..||+.+++-+|=. +-.|- ..+ | -+-+..+.+.++|..+|..+++ .++|..--.-.+|.|+ T Consensus 73 a~~av~~~~~nvVG~--Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~ 150 (553) T protein:vir:63 73 TNGAVGYQRDSIVGA--QYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYV 150 (553) T ss_pred HHHHHHHHHHhhccC--CceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHH Confidence 999999999866532 33321 111 1 1114678888899999999874 3345555567889999 Q ss_pred hhhhhheeeeecCCCCCcce--eEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeec--Cccccc Q lcl|NC_015287. 133 VDGRIYYHKVIDIKAPEEGI--KEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYD--PSQTQG 208 (563) Q Consensus 133 vDGri~fhkviD~~~~k~GI--~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~--~~~~~~ 208 (563) +||..|-..+.++. ..+- ..|..|||-.|.--+.........++|++.... .-.-|+++. |-... T Consensus 151 ~dGE~~~~~~~~~~--~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE~d~~G--------r~vaY~i~~~hPgd~~- 219 (553) T protein:vir:63 151 KTGEVLATAEWDRA--ANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQYDKRG--------RPQGYWIQVAHPGDLY- 219 (553) T ss_pred hCCceEEEeeeccC--CCCcccceEEEechhhcCCCCCCCCCCeeEeeeEECCCC--------ceEEEEeeccCCCccc- Confidence 99999987777643 1222 357888997765543333333344566653322 233466663 31111 Q ss_pred cccccccCcCccccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccC Q lcl|NC_015287. 209 KNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDV 288 (563) Q Consensus 209 ~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDv 288 (563) ...+........ ....+++.+.|.+.+...- ..-.-++|.|+.+++.+.+|.-.+||...-...-|-. +.+|-. T Consensus 220 -~~~~~~~~~~r~--~~~~~v~a~~vlH~f~~~r-~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~--a~fi~~ 293 (553) T protein:vir:63 220 -QMAPDMYKWKFV--QQSKPWGRRQVIHILEPRE-PDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASY--AAAIES 293 (553) T ss_pred -cccccccceeee--ccccccChhHheecccccC-CCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhh--eeeeec Confidence 100100000000 1235788888876655432 2233458999999999999999999999998888866 333333 Q ss_pred CCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhccccc----CCC------CccceeecCCCCCcch- Q lcl|NC_015287. 289 GNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRR----EGG------RGTEITTLPGGQNLGE- 357 (563) Q Consensus 289 Gnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRR----eGg------rgTEIsTLpGg~nLge- 357 (563) + +|...+.+.+..--.. +..+|- ....+.-+..+.-..+ ++| -|.+|+.+.....-+. T Consensus 294 ~-~~~~~~~~~~~~~~~~-------~~~~~~---~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~ 362 (553) T protein:vir:63 294 E-LPPEFIHSQMSGGSPN-------ADMVGI---FGKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGTPGGVG 362 (553) T ss_pred C-CChhhhhhhccccccc-------cccccc---ccccccccccccccccceeecCceeeecCCCCeeeecCCCCCCCCH Confidence 2 2443333322210000 000000 0000000001110000 111 1333444433332222 Q ss_pred HHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHhcCCCChh Q lcl|NC_015287. 358 LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLK----TQLILKNIVTPE 433 (563) Q Consensus 358 i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk----~qLiLkgI~t~e 433 (563) -+=++...+.+=.+|+||-.-|..|=.+.|. |.+.-.-+.|-+.+.++|..|..-|..++- ...+|.|-++-- T Consensus 363 ~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nY---SS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p 439 (553) T protein:vir:63 363 SEFEASLNRHLASAFGMSYEEFTRDFSKANY---SSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMP 439 (553) T ss_pred HHHHHHHHHHHHhhcCCCHHHHhhhcccccH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCC Confidence 2344555666667899999999765222333 233444567999999999988877776543 355788876422 Q ss_pred HHH-H--------hh--hceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHH Q lcl|NC_015287. 434 DWE-V--------LG--DHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQ 502 (563) Q Consensus 434 ew~-~--------~~--~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~q 502 (563) .+. . .. -...+..-.-.+.--+||+.-...+++. -+-|.+-+..+--..-++-+.|..+. T Consensus 440 ~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~---------G~~t~~~~~a~~G~D~~~v~~q~a~e 510 (553) T protein:vir:63 440 PGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVMRIDA---------GLSTYEREIARLGGDFRKSFAQRARE 510 (553) T ss_pred CcccchhhcchhhhhhhhceeeecCCccccChHHHHHHHHHHHHc---------CCCCHHHHHHHhCCCHHHHHHHHHHH Confidence 111 0 00 1133334444455667777666555543 23455555444443333333333333 Q ss_pred HHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccCCCCCCCCC Q lcl|NC_015287. 503 MEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAPQMPEGGE 562 (563) Q Consensus 503 ie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~ 562 (563) .+...+.|+..+......... +....++... .+ +++.-.+-|| T Consensus 511 ~~~~~~~Gl~~~~~~~~~~~~-------~~~~~~~~~~--~~--------~~~~~~~~~e 553 (553) T protein:vir:63 511 DALLKKYGLTFNLSAKRSLGD-------GRDAATGIAE--DP--------AAAQTSQQGE 553 (553) T ss_pred HHHHHHcCCCCCCCCccccCC-------CcccCCCCCC--CC--------CCCCcccccC Confidence 333333455321111000000 0000000000 00 1111111122 No 29 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=98.97 E-value=8.5e-09 Score=64.83 Aligned_cols=463 Identities=13% Similarity=0.086 Sum_probs=227.0 Q ss_pred hhcccccc-cccCcCCCCcccceec----c----ccccccccccchhhhHHHHHHHHHHH-hhccchhHHHHHhhcceEE Q lcl|NC_015287. 13 DEQNKNVV-SPVPQSNEDSSDYYVS----S----GFYGQYVDIDGVFKSEFELIKRYREM-ALHPEVDSAIEDIINEAIV 82 (563) Q Consensus 13 ~~~~~~~~-s~~p~~~~d~~~~~~~----~----~~~~~~~~~~~~~~~~~~LI~~YR~m-a~~PEvd~Ai~eIvneaiv 82 (563) +......+ |+.+.--...+...+. | ++-+...+ .....+...|..+=|.| .++|-+..||+-+++-+|= T Consensus 1 m~~~~~~~~a~~~~~~~~~~~~~y~aa~~~~~~~~~~~~s~d-~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~~vVG 79 (495) T protein:vir:10 1 MNMTPSGYQSLASGLLVPVGASAYEGASGGHRWQDIGDYGPD-TAVASGIQTLRARSHHNVRNNPWATNAVATWVAAAVG 79 (495) T ss_pred CCcccccccccchhhhhHHHhhhhhccccCcccCCCCCCChh-HHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcC Confidence 22222222 2222111111100110 1 11111112 12224567788888888 7899999999999986653 Q ss_pred ecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHH------hcchhhhHHHHHHHhhhhhhheeeeecCCCCCcce---e Q lcl|NC_015287. 83 SDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEI------LNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGI---K 153 (563) Q Consensus 83 ~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l------l~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI---~ 153 (563) . + +.+.. ..-.+.+.++|..+|..+++- ++|..--...+|.|.+||..|--+..++. ..|. . T Consensus 80 ~--G---i~p~~--~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~--~~g~~~~~ 150 (495) T protein:vir:10 80 N--G---LTPRW--RMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPL--SEGLSVPL 150 (495) T ss_pred C--C---ccccc--CCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeeccc--CCCCccce Confidence 2 2 12221 123578889999999999973 45555566789999999999976656543 3333 4 Q ss_pred EEEEeCCccceeeeeee-cc--CCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeec Q lcl|NC_015287. 154 EVRYIDPLKIKLVRKLK-TD--NSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIA 230 (563) Q Consensus 154 eLr~lDP~~i~~vR~~~-~~--~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~ 230 (563) .|..|||..|.--.... .. ....++|+.... ..-.-|+++....... . ........++|| T Consensus 151 ~lqliepd~l~~~~~~~~~~~g~~i~~GIe~d~~--------Gr~vaY~i~~~hpgd~-~--------~~~~~~~~~rvp 213 (495) T protein:vir:10 151 QLQIIEPDMLASDIPDETLPSGGYVKGGIRFSNG--------GKRKAYCFYRNHPAES-S--------LIGDPVDTVWIK 213 (495) T ss_pred EEEEechhhcCCCCCCCCCCCCCEEEeceEECCC--------CceEEEEEeecCCCcc-c--------ccccccceeeec Confidence 78899998764221111 11 112345555332 2233466663311110 0 011123457899 Q ss_pred ccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcce Q lcl|NC_015287. 231 PDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNK 310 (563) Q Consensus 231 ~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk 310 (563) .+-|.|.+- . ...-.-++|.|+..+ .+++|.-.+||.+.-...-|-. +.+|-..+ |..-+.+-+. T Consensus 214 A~~vlH~f~-~-r~gQ~RGis~la~i~-~l~~l~~y~dael~~a~i~A~~--~~fi~~~~-~~~~~~~~~~--------- 278 (495) T protein:vir:10 214 AEHVLHVTV-L-TVRSDAGAPWFQLLL-RLNELDQYEDAELVRKKTAALF--AAFIQEAT-ADSTGGPTIG--------- 278 (495) T ss_pred hhheEeccc-c-CCCcccCcchhHHHH-HHHHhhHHHHHHHHHHHHhhhh--eeeeecCC-CccccccccC--------- Confidence 888865532 2 222223368998655 5899999999999999888855 33332211 1111100000 Q ss_pred eEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCChhhccCCCcceecc Q lcl|NC_015287. 311 LVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSD-IEYFQKKLYRSLGVPESRIAGSGDGFNLG 389 (563) Q Consensus 311 lvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl~~~~~g~~~g 389 (563) ..++-.+.....+| +.==++. =.-|.+|+.+.....-+..++ ++...+.+=.+|+||-+-|..|-.+.|. T Consensus 279 -----~~~~~~~~~~~~~l-~pG~i~~--L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nY- 349 (495) T protein:vir:10 279 -----QPKRSKGGKRITGL-NPGTLQY--LQPGQEVKFSNPADVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNY- 349 (495) T ss_pred -----ccccccCcccceec-CCceeee--cCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccH- Confidence 00111111111111 0000000 013445555554443344444 5566666677899999999765333333 Q ss_pred ccchhhHHHHHHHHHHHHHHHH-HHHHHHH-----HHHHHHHhcCCCC-hhHHHHhhhceEEEe--eccchHHHHHHHHH Q lcl|NC_015287. 390 RSSEILRDEIKFTKFVGRMRKR-FSHLFND-----MLKTQLILKNIVT-PEDWEVLGDHIQYDF--VYDNHFAELKETEL 460 (563) Q Consensus 390 ~~~eItRDElKF~KfI~rLr~r-Fs~if~d-----~Lk~qLiLkgI~t-~eew~~~~~~i~~~f--~~Dn~f~ElKe~Ei 460 (563) +.+.-.-+.|-+.+.++|.+ |..-|.. .|+ ..+|.|.++ +.-|+.........| -.--+.-.+||+.- T Consensus 350 --SS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~-~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A 426 (495) T protein:vir:10 350 --SSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMD-FAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLA 426 (495) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHcCCCCCCCchhhhHhhhccccccCCccccChHHHHHH Confidence 33444456799999999875 5544444 344 456677664 444443323333344 44445567787766 Q ss_pred HHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCc Q lcl|NC_015287. 461 LNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDP 540 (563) Q Consensus 461 ~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~ 540 (563) ...+++. -.-|.+.+..+--..-++-.+|..+..+...+.|+..+. +|... T Consensus 427 ~~~~i~~---------G~~s~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~--------------------~p~~~ 477 (495) T protein:vir:10 427 DLGDVRA---------GFAPISDKQAERGYDMEELFDMISDANQLIDEYDLRLDS--------------------DPRYV 477 (495) T ss_pred HHHHHHc---------CCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCC--------------------CCCcC Confidence 6555542 445666555554333333333333333333334442211 11110 Q ss_pred ccccccCCCCcccCCCCCCCCC Q lcl|NC_015287. 541 NMGGAPQEGGAEQAPQMPEGGE 562 (563) Q Consensus 541 ~~~~~~~~~~~~~~~~~~~~~~ 562 (563) ....+.+ +..++...+.| T Consensus 478 ~~~~~~~----~~~~~~~~~~e 495 (495) T protein:vir:10 478 NGSGAEQ----KSVMEAALNNE 495 (495) T ss_pred CCccCCC----CCCCCCCCCCC Confidence 0000100 11111111222 No 30 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=98.88 E-value=2.2e-08 Score=62.60 Aligned_cols=436 Identities=11% Similarity=0.103 Sum_probs=184.2 Q ss_pred CcccccchhhHhhhccc---ccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhh Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNK---NVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDII 77 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~---~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIv 77 (563) |++- ...+...... +.-+.+. ..+.++ .-.+..++.|.. +..-+-.+|...|| .|+-+..||+.++ T Consensus 1 ~~~~---~~a~~~~~~~~a~~~~~~~~--~~g~~~-~~d~~~~~~~~~--~~~~~~~~l~~lY~---~~~l~r~iVd~~a 69 (461) T protein:vir:80 1 MYSI---DKAKQAKIDSKIVNRNDFMV--GHGKAN-SRDKLTRQTPGN--GQKLDLKACENLYA---SNSIAMNIVDIIS 69 (461) T ss_pred Cccc---hhhhhhhhhhhhhhhhHHHh--hcCCcc-hhhhhhccccCc--ccccCHHHHHHHHH---hCCccchhhccch Confidence 3321 0000000000 0001110 001011 111111121111 11123445556665 7999999999999 Q ss_pred cceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhh--hheeeeecCCC-------- Q lcl|NC_015287. 78 NEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGR--IYYHKVIDIKA-------- 147 (563) Q Consensus 78 neaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGr--i~fhkviD~~~-------- 147 (563) .+|+-. .+.|.-+ +++.+++|.++++. |++..+..+.+|.=-+.|. ++.+ |.|.+. T Consensus 70 ~d~~r~-----g~~i~~~----~~~~~~~~~~~~~~----l~~~~~l~~~~~~~rl~G~a~i~i~-v~d~~~~~~~~~~p 135 (461) T protein:vir:80 70 EDMVRA-----GWSLKTD----NKEMKKNIESKWRK----LKTKDRFQKLYADKRLYGDGFLSIG-VVSSNREQADLSTA 135 (461) T ss_pred HHhhcC-----CeeeecC----CHHHHHHHHHHHHH----hhHHHHHHHHHHhhcccccEEEEEE-eecCCccccCccCC Confidence 999642 3444333 35566666666554 4556677777766666665 4333 223211 Q ss_pred -CCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccc-cccccccCcCccccCcc Q lcl|NC_015287. 148 -PEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQG-KNALGAIGQTPFSTKQR 225 (563) Q Consensus 148 -~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~-~~~~g~~g~~~~~~~~~ 225 (563) .+.++.++.+|+|.....| + ... .. .++..-.|.. .++|.++...... ....+ ..+.. T Consensus 136 l~~~~~~~~~~l~~~~~~~i-------~---~~~-~~-~dp~sp~fg~-P~~y~i~~~~~~~~~~~~~-------~~~~~ 195 (461) T protein:vir:80 136 IDPKTIKSIPYINTFNTQKV-------T---QLY-LN-QDMFSEHFGE-VEFFEVNRVSQLGEEILSG-------TTAST 195 (461) T ss_pred cccccccceeEEEecccccc-------c---hhh-hc-ccCcCccccc-ceEEEEecccccccccccc-------ccCcc Confidence 0233344444443211111 0 000 00 1111111222 3445443322111 11111 11223 Q ss_pred eeeecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHH--HHHHHHHhcCccceEEEccCC-CCchHHHHHHHHH Q lcl|NC_015287. 226 PVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIED--SLVIYRLSRAPERRIFYIDVG-NLPKMKAEQYLKE 302 (563) Q Consensus 226 ~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~ED--alVIyRi~RAPeRRvFyIDvG-nlpk~KAeqYl~~ 302 (563) ..+|+++-|+....+-+. +..-+.|.|+++...+..+..... +.++++ +-.+ +|.+|.- .+......+. .. T Consensus 196 ~~~iH~SRii~~~~~~~~-~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~---~~~~-v~k~~~l~~~~~~~~~~~-~~ 269 (461) T protein:vir:80 196 SEQIHRSRIIHEQGLRFE-GETKGRSIFESLYDIITVMDTSLWSVGQILYD---FAFK-VYKTDDIDALNKDDKANL-TA 269 (461) T ss_pred ceEEccccEEEecCCCCC-ccccCcchHHHHHHHHHHHHHHHHHHHHHHHH---hCCC-ceecchHHhhhchHHHHH-HH Confidence 467888876555333333 223368899887766655543332 223333 4333 4555421 1111122222 22 Q ss_pred HHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCChhhccC Q lcl|NC_015287. 303 VMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSD-IEYFQKKLYRSLGVPESRIAG 381 (563) Q Consensus 303 im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl~~ 381 (563) .+..++ ..+|-+- + + .+-+++++- .+|+-++| +..|...+--+.+||+.+|-+ T Consensus 270 ~~~~~~------~~~g~~~-------------~----d-~~e~~e~~~--~~lsgl~~~l~~~~~~iaa~s~iP~t~L~G 323 (461) T protein:vir:80 270 MLDFMF------RTEALAI-------------I----K-GDEQLTKES--TNVSGMKDLLDYGWDYLAGAVRMPKTVLKG 323 (461) T ss_pred HHHHhc------CCceEEE-------------E----c-CCcceEEEe--cCcCCHHHHHHHHHHHHhhhhcCCeeeeec Confidence 232222 1222110 0 0 011122222 24555555 468888999999999999966 Q ss_pred CCcceeccccchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH-h--cC-CCChhHHHHhhhceEEEeeccchHHHHH Q lcl|NC_015287. 382 SGDGFNLGRSSEILRDEIKFTKFVGRMRKR-FSHLFNDMLKTQLI-L--KN-IVTPEDWEVLGDHIQYDFVYDNHFAELK 456 (563) Q Consensus 382 ~~~g~~~g~~~eItRDElKF~KfI~rLr~r-Fs~if~d~Lk~qLi-L--kg-I~t~eew~~~~~~i~~~f~~Dn~f~ElK 456 (563) ++.|.+ |+ =.-|.-.|..+|.++|.. +..+...+++.=+. . .+ ++.++ ...++|.|..=..-+|.. T Consensus 324 ~s~g~~---as-ge~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~-----~~~~~i~f~~L~~~s~ke 394 (461) T protein:vir:80 324 QEAGTL---TG-AQYDVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPD-----SFEWAIEFNPLWNLDSKT 394 (461) T ss_pred ccCCcc---cc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCcc-----ccceEEEeCCCCCCCHHH Confidence 544433 22 122333499999999964 55555555543211 1 11 22221 134678888777778888 Q ss_pred HHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhc-CCCCCCCcccCccccccccccCCCCcC Q lcl|NC_015287. 457 ETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIK-DGKLADPMEVQQLQLGVHPEQMPGGAM 535 (563) Q Consensus 457 e~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~-~~~~~~p~~~~~~~~~~~~~~~~~~~~ 535 (563) .+|++..+.++++.+-.- -.+|.+-++........ .. .+.++ +...+.+.- ...... T Consensus 395 kAe~~~~~a~a~~~~~~~--g~is~~e~r~~l~~~~~-------------~~~~~~~~--~~~~~~~~~-----~~~~~~ 452 (461) T protein:vir:80 395 DAEVRKLTAEADQIYIVN--GVLDPDEVKETRFGRFG-------------LENSSKFS--GDSAEIDKL-----AKLVYD 452 (461) T ss_pred HHHHHHHHHHHHHHHHhc--CCCCHHHHHHHHHHhcC-------------CCCCccCC--CCCchhhhh-----hhhccc Confidence 899999999888665332 13444444332211000 00 00010 000000000 000000 Q ss_pred CCCCcccccccCCCCc Q lcl|NC_015287. 536 NPQDPNMGGAPQEGGA 551 (563) Q Consensus 536 ~p~~~~~~~~~~~~~~ 551 (563) .|.+- .+.+ T Consensus 453 ~~~~e-------~~~g 461 (461) T protein:vir:80 453 AYAKK-------NADG 461 (461) T ss_pred ccccc-------CCCC Confidence 00000 0000 No 31 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=98.88 E-value=2.2e-08 Score=62.59 Aligned_cols=477 Identities=10% Similarity=0.044 Sum_probs=237.0 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCccc-ceeccccccccccccc-hhhhHHHHHHHHHHH-hhccchhHHHHHhh Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSD-YYVSSGFYGQYVDIDG-VFKSEFELIKRYREM-ALHPEVDSAIEDII 77 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~-~~~~~~~~~~~~~~~~-~~~~~~~LI~~YR~m-a~~PEvd~Ai~eIv 77 (563) |..++|-+-.. ....+.--..+++. .-..+++.....+... -..+...|..+=|.| .++|-+..||+.++ T Consensus 3 ~~~~~~~~~~~-------~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 75 (530) T protein:vir:38 3 IPSLVGPDGKT-------SLREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQ 75 (530) T ss_pred cceeecCcccc-------chHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 22222221000 00000000001111 0112223222223222 234677888889999 59999999999999 Q ss_pred cceEEecCCCcEEE------EEeeccccchHHHHHHHHHHHHHHHH----------hcchhhhHHHHHHHhhhhhhheee Q lcl|NC_015287. 78 NEAIVSDQNDSPVE------IDLENLPASKQLKDLIREEFKTVKEI----------LNFDDKCHEILRNWYVDGRIYYHK 141 (563) Q Consensus 78 neaiv~d~~~~pV~------l~l~~~~~s~sik~~I~eeF~~i~~l----------l~F~~~~~~~~R~WYvDGri~fhk 141 (563) +-+|=. +-.|-. |.++. +..+.+.+.|..+|..+++- ++|..--.-.+|.|.+||..|--+ T Consensus 76 ~nvVG~--Gi~~~~~p~~~~l~~~~-~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~ 152 (530) T protein:vir:38 76 DHIVGS--FFRLSYRPSWRYLGINE-EDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQA 152 (530) T ss_pred HHhhCC--CceeeeccchhhcCCCH-hHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEe Confidence 865532 333322 22232 45778888999999999853 445555566889999999999887 Q ss_pred eecCCCCCcce---eEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeec--CccccccccccccC Q lcl|NC_015287. 142 VIDIKAPEEGI---KEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYD--PSQTQGKNALGAIG 216 (563) Q Consensus 142 viD~~~~k~GI---~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~--~~~~~~~~~~g~~g 216 (563) ..++. .|. ..|..|+|-.|.--..........++|++... ..-.-||++. |.+....+ T Consensus 153 ~~~~~---~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~--------Gr~~aY~i~~~~~~~~~~~~------ 215 (530) T protein:vir:38 153 TWDSD---STRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKINDS--------GAALGYYVSDDGYPGWMAQN------ 215 (530) T ss_pred eeccC---CCCccceEEEEechhhcCCCCCCCCCCeeEeeeEECCC--------CceEEEEEeeccCCCccccc------ Confidence 77753 332 45888999776654433333334456665332 2234566663 32211110 Q ss_pred cCccccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHH Q lcl|NC_015287. 217 QTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKA 296 (563) Q Consensus 217 ~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KA 296 (563) ...-...+.++.+.|.+...-. ...-.-++|.|..+++.+++|.-.+||...-...-|-.=-++.=+.+......+ T Consensus 216 ---~~~~~~~~~v~a~~vlH~f~~~-r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~ 291 (530) T protein:vir:38 216 ---WTYIPRELPGGRPSFIHVFEPM-EDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDF 291 (530) T ss_pred ---cceeeeeeccChhHeEeecccc-CCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccccccc Confidence 0000124667777886664432 122334479999999999999999999999888777653333323322111000 Q ss_pred -------------HHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch-HHHHH Q lcl|NC_015287. 297 -------------EQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDIE 362 (563) Q Consensus 297 -------------eqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~ 362 (563) ..+... +..+.+..+..-.-|.|. .-.-|-+|+.+..+.--+. -+=++ T Consensus 292 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~l~pG~i~-----------------~L~pGe~i~~~~p~~p~~~~~~f~~ 353 (530) T protein:vir:38 292 ILGADNKEQQSKLTGWLGE-MAAYYSAAPVRLGGARVP-----------------HLLPGDSLNLQSAQDTDNGYSTFEQ 353 (530) T ss_pred cccCCcccccccccccchh-hhhcccccceeccCceee-----------------ecCCCCeeeeeCCCCCCCCHHHHHH Confidence 000000 111111111111222111 0012444555544332222 23445 Q ss_pred HHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHhcCCCCh----- Q lcl|NC_015287. 363 YFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLF-----NDMLKTQLILKNIVTP----- 432 (563) Q Consensus 363 YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if-----~d~Lk~qLiLkgI~t~----- 432 (563) ...+.+=.+|+||-+-|..|=.+.|. +.+.-.-+.|-+.+.++|..|..=| ..-|+ ..+|.|.++. T Consensus 354 ~~lr~iaaglGi~ye~lt~D~s~~nY---SS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~-~av~~G~i~~p~~~~ 429 (530) T protein:vir:38 354 SLLRYIAAGLGVSYEQLSRNYSQMSY---STARASANESWAYFMGRRKFVASRQACQMFLCWLE-EAIVRRVVTLPSKAR 429 (530) T ss_pred HHHHHHHhhcCCCHHHHhcccccccH---HHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHH-HHHHcCCccCCCCCC Confidence 55566667899999999655222222 2333344669999999999887533 44444 4578887763 Q ss_pred -hHHHHhhhceEEEe--eccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcC Q lcl|NC_015287. 433 -EDWEVLGDHIQYDF--VYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKD 509 (563) Q Consensus 433 -eew~~~~~~i~~~f--~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~ 509 (563) +.|..........| -.--+.-.+||+.-...+++. -+-|.+-+..+--..-++-++|..+..+...+. T Consensus 430 ~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~---------G~~s~~~~~a~~G~D~~~v~~q~a~e~~~~~~~ 500 (530) T protein:vir:38 430 FSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEA---------GLSTYEKECAKRGDDYQEIFAQQVRESMERRAA 500 (530) T ss_pred CCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHc---------CCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHc Confidence 23333223333333 344445667877766555553 345666555554333333333333333333333 Q ss_pred CCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccCCC Q lcl|NC_015287. 510 GKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAPQ 556 (563) Q Consensus 510 ~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~ 556 (563) |+-. |..... .++ .+...++..|.+. .+++ T Consensus 501 Gl~~-~~~~~~-~~~---~~~~~~~~~~~d~------------~~~a 530 (530) T protein:vir:38 501 GLNP-PAWAAA-AFE---AGVKKSNEEEQDG------------ARAA 530 (530) T ss_pred CCCC-CCCccc-ccC---CCCCCCCCCCCCC------------CCCC Confidence 4321 111000 000 0001111111111 1111 No 32 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=98.87 E-value=1.3e-08 Score=63.75 Aligned_cols=443 Identities=12% Similarity=0.101 Sum_probs=196.2 Q ss_pred ccccccccchhhhH-HHHHHHHHHHhhccchh-------------------------HHHHHhhcceEEecCCCcEEEEE Q lcl|NC_015287. 40 YGQYVDIDGVFKSE-FELIKRYREMALHPEVD-------------------------SAIEDIINEAIVSDQNDSPVEID 93 (563) Q Consensus 40 ~~~~~~~~~~~~~~-~~LI~~YR~ma~~PEvd-------------------------~Ai~eIvneaiv~d~~~~pV~l~ 93 (563) ..+.-+.-..+... ..-..+|+.+..|.+=+ -+|+..++=. . T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l------------~ 68 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL------------D 68 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhh------------c Confidence 11110100000000 00011222222222211 2222221100 0 Q ss_pred eeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeec--CCCCCcceeEEEEeCCccceeeeeeec Q lcl|NC_015287. 94 LENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVID--IKAPEEGIKEVRYIDPLKIKLVRKLKT 171 (563) Q Consensus 94 l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD--~~~~k~GI~eLr~lDP~~i~~vR~~~~ 171 (563) ++....++ -+...+.+..|++..+|+....++++.-.+.||-|.+.-.. .....+|..-++.++|+.+-.+.--.. T Consensus 69 ~~g~~~~~--d~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~i~~~~p~~~~~~~D~~~ 146 (480) T protein:vir:78 69 IEGFRISE--DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRN 146 (480) T ss_pred cCceecCC--CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeEEEEEcccceEEEEcCCC Confidence 11111111 12345667778888899999999999999999988764221 112357888999999999877642111 Q ss_pred cCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeeccc--c-eeeeecceecCCCCc Q lcl|NC_015287. 172 DNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPD--A-ISFCHSGLVDRNKQT 248 (563) Q Consensus 172 ~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~d--a-I~y~hSGl~d~~~~~ 248 (563) .......+.+.+.. ........+.+|.+.........+....... .....+.-+.. + |.|++---.+ +.. T Consensus 147 ~~~~~~~i~~~~~~----~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~g~vPvv~f~n~~~~~--~~~ 219 (480) T protein:vir:78 147 TRRVTRAVRLYTTR----DDVAVPDRATLYLPDETVPLRRNGGLNDQWV-VDGDVIKHGLGVVPVVPLTNDPRLG--NRY 219 (480) T ss_pred ccceEEEEEEEEee----cCCCceEEEEEEeCCeEEEEEecCCCccccc-cccccccCCCCCcceEEeecccccC--Ccc Confidence 11111122221111 1111222333443322222211111000000 00000000011 1 2233221111 112 Q ss_pred cchhHHHHHHH-HHH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccc Q lcl|NC_015287. 249 ILSYLHKSIKA-LNQ-LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKH 326 (563) Q Consensus 249 ~~syL~~Aik~-~Nq-L~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~ 326 (563) +.|=|.+.+++ ... =+++-+.+++-...-.|.|-|.=.+....+..+. +. .. T Consensus 220 G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~---------------------~~-----~~ 273 (480) T protein:vir:78 220 GRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGE---------------------NT-----TL 273 (480) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhhcCCccccccccc---------------------cc-----hh Confidence 23444443322 121 2355566777777777777664222222111100 00 00 Q ss_pred cchhh-hhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHHHH Q lcl|NC_015287. 327 MSMLE-DFWLPRREGGRGTEITTLPGGQNLGE-LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTKF 404 (563) Q Consensus 327 msMlE-DywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~Kf 404 (563) -.+.. ..|++ |..+++.++++.. +.- ++-++-....++..-++|..-|... +-|-..+..|---+.....- T Consensus 274 ~~~~~~~~~~~----~~~~~~~~~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~~g~~--~~n~~Sg~Alk~~~~~l~~k 346 (480) T protein:vir:78 274 DIYYGRILTLA----SEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSS--SENPASAEAIIATDSRIVKM 346 (480) T ss_pred hhhhhhhccCC----CCCceEEecCccC-HHHHHHHHHHHHHHHhcccCCChHHhccc--cCcchHHHHHHHHHHHHHHH Confidence 01111 12332 3346677777643 332 3446666677777788888777422 11111222344445557778 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHH Q lcl|NC_015287. 405 VGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYV 484 (563) Q Consensus 405 I~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i 484 (563) +.++|+.|..-+.+.++.-+.+.|.--..+|. .|.+.|..-..=+. .+.++.+.++-.-.+-.+|.+++ T Consensus 347 a~~~~~~f~~~l~~~~~l~~~~~g~~~~~~~~----~i~v~f~~~~~~s~-------~~~ad~~~kl~~~g~~~~s~et~ 415 (480) T protein:vir:78 347 AERKGRIFGGAWERAMRIAMQIMGREVTEEYT----RLETVWRDPSTPTV-------AAKADAVSKLYANGQGPIPKEQA 415 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCccccce----eeeEEecCCCCCCH-------HHHHHHHHHHHHhccccCCHHHH Confidence 89999999999999999888888865455554 36677753221122 24455555554443456788888 Q ss_pred HHHHhccChhhHHHHHHHHHHHhcCC--CCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccCCCCCC Q lcl|NC_015287. 485 RRKILKQKDEEVIEIDKQMEKEIKDG--KLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAPQMPE 559 (563) Q Consensus 485 ~k~Il~~tdeei~e~~~qie~E~~~~--~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 559 (563) ... |.+++.+++++++..+++.... ....+...+ +.+..+|+.+++.++++..++..--..-. T Consensus 416 ~~~-lg~~~d~~~~~~~~~~e~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 416 RID-LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQ-----------ADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) T ss_pred Hhc-CCCCHhHHHHHHHHHHHHHHHHHHHhhcccccc-----------CCCCCCCCCCCCCCccccccCCCCcccCC Confidence 776 6899988888776544443221 111111111 11111233343333332222221111111 No 33 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=98.85 E-value=1.6e-08 Score=63.36 Aligned_cols=411 Identities=15% Similarity=0.131 Sum_probs=185.8 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCccc-------ceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHH Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSD-------YYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAI 73 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~-------~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai 73 (563) |++=.-|-++-.-.+ ..+.+ -. |+-. +.-...++++++. ...+-.+|...|| .+.....|| T Consensus 1 ~~~~~~~~~~~~~~~--~~~~~---~r-d~l~~~~~glg~~r~~~~~~~g~~---~~~~~~~l~~~Yr---~~~ia~~iV 68 (449) T protein:vir:10 1 MTDKLTLAVNHALND--ARMAR---AR-MGLMVPTMGLDNKRHSAWCEYGFP---ELVTYENLYSLYR---RGGIAHGAV 68 (449) T ss_pred CchhhHHHHhhhcch--hHHHH---HH-HHHHHHHhcCCcccchhhhhcCCc---ccCCHHHHHHHHh---cCchhHHHH Confidence 544322211100000 00000 00 1100 0111234444332 3366778999998 577788899 Q ss_pred HHhhcceEEecCCCcEEE-EEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHH--HHhhhhhhheeeeecC----- Q lcl|NC_015287. 74 EDIINEAIVSDQNDSPVE-IDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILR--NWYVDGRIYYHKVIDI----- 145 (563) Q Consensus 74 ~eIvneaiv~d~~~~pV~-l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R--~WYvDGri~fhkviD~----- 145 (563) +-++++|. ......++ .+.++......++.++.+.|.. .+-.+..+..| +-|-.+-++.+. -|. T Consensus 69 d~~~d~~~--~~~~~i~~g~~~~~~~~~~~~e~~~~~l~~~-----~~~~~l~ea~~~~rl~Gga~i~i~v-~d~~~l~~ 140 (449) T protein:vir:10 69 EKLVGKCW--QTNPEIIEGDDADDSEDETSWEKKSKQVFTN-----RLWRSFAEADRRRLVGRYAGILLHI-RDEKDWNL 140 (449) T ss_pred Hhhhhhhh--hcCcccccCccccchhhhHHHHHHHHHHHHH-----HHHHHHHHHHHhhhccCcEEEEEEe-cCCCCCCc Confidence 99988662 22111111 1122222233344444433321 12234444444 334444466552 232 Q ss_pred -CCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCc Q lcl|NC_015287. 146 -KAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQ 224 (563) Q Consensus 146 -~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~ 224 (563) -++..||..|+.++...|..- ++.. ++..-.| .-.+||.+++.. .|.. . T Consensus 141 Pl~~~~~i~~i~v~~~~~i~~~-~~~~--------------dp~sp~y-g~P~~y~v~~~~------~g~~--------~ 190 (449) T protein:vir:10 141 PATKGRGLQKVSVSWAGSLKVA-EWDT--------------GINSKTY-GQPKLWKYTERL------PNGS--------S 190 (449) T ss_pred ccccCcceeeEEeeccccCChh-hhhc--------------CCCCCCC-CCceEEEEeeec------cCCC--------c Confidence 233445555555553322211 1101 0111111 134555554311 0100 1 Q ss_pred ceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHH------HHHHhcC----ccceEEEccCCCCchH Q lcl|NC_015287. 225 RPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLV------IYRLSRA----PERRIFYIDVGNLPKM 294 (563) Q Consensus 225 ~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalV------IyRi~RA----PeRRvFyIDvGnlpk~ 294 (563) +.++|+.+=+. .|....-.+.|+|+++ ||.|.-+|-+.. +-...|. -++ .+|+.+|... T Consensus 191 ~~~~iH~SRl~-----~~~~~~~~g~~~L~~~---yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~---~~~~~~l~~~ 259 (449) T protein:vir:10 191 RRVDIHPDRVF-----ILGDYSEDAIGFLEPA---YNAFVSLEKVEGGSGESFLKNAARQLNVNFEK---EIDFTNLASL 259 (449) T ss_pred cceeeccceeE-----eecCCCCCChhHHHHH---HHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhh---hhhhhhhhHH Confidence 12445544431 1111112245888886 455554444321 1111121 112 2455555443 Q ss_pred HHH------HHHHHHH---HhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHH-HHHH Q lcl|NC_015287. 295 KAE------QYLKEVM---NRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSD-IEYF 364 (563) Q Consensus 295 KAe------qYl~~im---~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF 364 (563) ++. +-+++.+ ++-.+.+..|.. +|| +..+| +++-++| +..| T Consensus 260 ~~~~~e~~~~~~~~~~~~~~~~~~~~~i~~~--------------~d~----------~~~~~-----~~sgl~d~l~~~ 310 (449) T protein:vir:10 260 YGVSIDELQDKFNEVAGEINRGNDVLMTTQG--------------ATV----------TPLVT-----SVADPTATYNVN 310 (449) T ss_pred hhCCchHHHHHHHHHHHHHhccchheeecCC--------------cce----------EEEec-----ccCChhHHHHHH Confidence 321 1112212 221111111100 111 22222 4555556 5668 Q ss_pred HHHHHHhcCCChhhccCCC-cceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceE Q lcl|NC_015287. 365 QKKLYRSLGVPESRIAGSG-DGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQ 443 (563) Q Consensus 365 ~~kLy~aL~VP~sRl~~~~-~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~ 443 (563) ...+=-+.+||+.||-+.+ +|+| |+ -|.-.|...|...|.++......++.. |+.-++..+. ..+. T Consensus 311 ~q~iaaa~~IP~t~L~Gqsp~gln---st---~D~~nyyd~i~~~Q~~l~p~le~l~~~-l~~s~~g~~~------~d~~ 377 (449) T protein:vir:10 311 LQTAAAGVDIPTRILIGNQQAERS---ST---EDQKYFNARCQSRRVDLSFEIEDFCDK-LIELKIIDAV------AKKA 377 (449) T ss_pred HHHHHHHhCCCeeeeeccCccccc---cc---hhHHHHHHHHHHHHHhhhHHHHHHHHH-HHHhhcCCCC------Ccee Confidence 8889999999999995432 3565 22 355559999999999988888887764 6777776653 4588 Q ss_pred EEeeccchHHHHHHHHHHHHHHHHHHHhhhhhch-hhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccc Q lcl|NC_015287. 444 YDFVYDNHFAELKETELLNERLGVVAAVDPYVGK-YFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQ 522 (563) Q Consensus 444 ~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGK-y~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~ 522 (563) |.|..=..=+|...+||.....++++.+-..++. .||. +|+. +.... +++-..|. T Consensus 378 i~f~pL~~~t~kEkAei~k~~A~a~~~~~~ag~~~~~~~------------~EiR---~~~~~---~~~~~~~~------ 433 (449) T protein:vir:10 378 VIWDDLNEQTGTEKLTNAKTMGEINQTMLGSGDNPAFSR------------EEIR---TAAGY---DNDDEEPL------ 433 (449) T ss_pred EEeCCCCCCCHHHHHHHHHHHHHHHHHHHHccccCCcCH------------HHHH---HHhcc---cCCCCCCC------ Confidence 9999988889999999999999998776665322 2332 3333 21111 11111010 Q ss_pred cccccccCCCCcCCCCCccccccc Q lcl|NC_015287. 523 LGVHPEQMPGGAMNPQDPNMGGAP 546 (563) Q Consensus 523 ~~~~~~~~~~~~~~p~~~~~~~~~ 546 (563) .....++++++.+.|+ T Consensus 434 --------~~e~~de~~~~~d~~a 449 (449) T protein:vir:10 434 --------GEEDGDEEDKATDSAA 449 (449) T ss_pred --------CCCCCccccccCCcCC Confidence 0000111222222222 No 34 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=98.85 E-value=2.7e-08 Score=62.06 Aligned_cols=450 Identities=12% Similarity=0.119 Sum_probs=211.6 Q ss_pred ccchhhHhhhcccccc--cccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhh--ccchh---------- Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVV--SPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMAL--HPEVD---------- 70 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~--s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~--~PEvd---------- 70 (563) ||. +.+-+...+..+ ..+.+..-.-.+ ...+.- .....+.|+.||+|.. +|.+. T Consensus 1 m~~-~~~ik~~~~~~~~~~~~~~~~~~i~d----------~~~i~~-~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~ 68 (505) T protein:vir:79 1 MAF-WDTLKNLFRKGSAAVGMTKSLGQIID----------DPRINL-PADEVERIARDKRYYMDDFKQVTHKNSYGDTQK 68 (505) T ss_pred Cch-HHHHHHHHHHhhhhhcchhhhhhhhc----------ccCCCC-CHHHHHHHHHHHHHhcCCCccccccccCCCccc Confidence 666 444443333210 001110000000 001110 1234556777777742 44442 Q ss_pred ----------HHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhhee Q lcl|NC_015287. 71 ----------SAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYH 140 (563) Q Consensus 71 ----------~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fh 140 (563) .++++.++ +|+. .|+.+.+++ +.-++.++.+++-.+|.....+.+..+.+-|..+|+ T Consensus 69 ~~~~slnl~~~i~~~~A~--ll~~---e~~~i~~~d--------~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k 135 (505) T protein:vir:79 69 HELQSVNVTKLASAKLAS--LIFN---EQCQVTVSD--------ETANDFLDDVFQQNDFYTTFEEKLEEWIALGSGCVR 135 (505) T ss_pred cceeecchHHHHHHHHHh--hhcC---CCceeecCC--------hHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCeEEE Confidence 22333333 2221 255666654 345667788888889999999999999999999999 Q ss_pred eeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeee----cccccccccccccceeeecC-ccccccc----- Q lcl|NC_015287. 141 KVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVN----SKNPADIENPEIEEYYQYDP-SQTQGKN----- 210 (563) Q Consensus 141 kviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~----~~~~~~~~~~~~~eyf~y~~-~~~~~~~----- 210 (563) ..+|.. =+.+.+++|.++-++..--. . ...+-.+. .......+|- .-||+..+. ++...+. T Consensus 136 ~~~D~~-----~~~i~~v~ad~~~P~~~d~~--~-~~~~a~~~~~~~~~~~~~~~yt-~lE~h~~~~~~~~I~n~ly~~~ 206 (505) T protein:vir:79 136 PYVDSG-----KIKLAWATADQVYPLQADTN--Q-VNELAIASRTTEVENHRTIYYT-LLEFHQWDHGDYVITNELYRSE 206 (505) T ss_pred EEEeCC-----ceEEEEEcCCeeEEEEEcCC--C-eEEEEEEEEEEEecCCcceEEE-EEEEEEecCceEEEEEEEEecC Confidence 999842 35688999998877632111 0 00111110 0011111111 112222111 1111111 Q ss_pred cccccCcCccccCcceeeecccceeeeecc-----ee--------cCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015287. 211 ALGAIGQTPFSTKQRPVKIAPDAISFCHSG-----LV--------DRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSR 277 (563) Q Consensus 211 ~~g~~g~~~~~~~~~~iki~~daI~y~hSG-----l~--------d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~R 277 (563) .....|..-.......-.--.+.++|.+.. .| +..+..++|-++.|.-.+..|-..=+ -+.|.+| T Consensus 207 ~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s--~~~~e~~ 284 (505) T protein:vir:79 207 AAETVGINVPLNSLEQYEGLEPQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHD--QFVDEVK 284 (505) T ss_pred CCCccCcccchhhcccccccCcceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHH--HHHHHHH Confidence 111111100000000000001222222211 11 22234567999999888877765444 3557778 Q ss_pred CccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch Q lcl|NC_015287. 278 APERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE 357 (563) Q Consensus 278 APeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge 357 (563) .=.+|||. +-.=|++...-....... ..-++| .+.+.+.++.-| +| +.-|+++.+.---.+ T Consensus 285 ~g~~~i~v-~~~~l~~~~~~~~~~~~~----~~~~fd------~~~~~y~~~~~~------~~--~~~i~~~~~~ir~e~ 345 (505) T protein:vir:79 285 KGQRRLIV-PAEWLKTGSSYGGQASET----HPPMFD------PDETVYQAMYGD------AS--EVGFHDATSPIRVAD 345 (505) T ss_pred hcccceee-chHHhcccCCCCcccccc----cccCCC------ccceeeeeccCC------CC--CCceEEecccCCHHH Confidence 77777765 211110000000000000 000001 022223332211 12 223777776422222 Q ss_pred -HHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC---C-- Q lcl|NC_015287. 358 -LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIV---T-- 431 (563) Q Consensus 358 -i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~---t-- 431 (563) .+-+..+.+.+....+++-+-|..+++| ..-|+||....-.-..-+.+.|+.|...+.++++.=|-+..+. + T Consensus 346 ~~~~l~~~l~~i~~~~g~s~~~~~~~~~~--~~TAtei~s~~~~l~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g 423 (505) T protein:vir:79 346 YQATMDFFLREFENQTGLSQGTFTTSPSG--IQTATEVVTNNSQTYQTRSSYITQVEKTIKALTYAILELASVPSFYADG 423 (505) T ss_pred HHHHHHHHHHHHHHHhCCChhhcCCCccc--cchHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 3457777888888999999988654332 3346676655555566677777777777777777655433221 0 Q ss_pred --hhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcC Q lcl|NC_015287. 432 --PEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKD 509 (563) Q Consensus 432 --~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~ 509 (563) ...++.-...+.++|...- +.. +++++- ..+.+++ +-.+|..+++++....+|+|.+++-++|++|... T Consensus 424 ~~~~~~~~~~~~i~v~f~d~i-~~d-~~~~~~-~~~~~v~------~Gi~s~e~~l~~~~~~~eeea~~el~ri~~E~~~ 494 (505) T protein:vir:79 424 QARWTGDVDSLDITINFNDGV-FVD-QESKRA-ADLQAVQ------AQVMPKKQFLMRNYGLDEEEADEWLAQIDAENST 494 (505) T ss_pred cccccCCCCceeEEEEeCCCC-CCC-HHHHHH-HHHHHHH------cCCCCHHHHHHhcCCCChHHHHHHHHHHHHhccc Confidence 0011111134667776432 222 222221 1222221 1258888888888889999999999999888432 Q ss_pred CCCCCCcccCcccccccccc Q lcl|NC_015287. 510 GKLADPMEVQQLQLGVHPEQ 529 (563) Q Consensus 510 ~~~~~p~~~~~~~~~~~~~~ 529 (563) ..|++ ..+.|. T Consensus 495 -~~p~~--------~~~gg~ 505 (505) T protein:vir:79 495 -AEPEF--------NQFGGD 505 (505) T ss_pred -cCCCc--------hhccCC Confidence 11222 222222 No 35 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=98.84 E-value=3e-08 Score=61.80 Aligned_cols=481 Identities=9% Similarity=0.048 Sum_probs=231.3 Q ss_pred hhHhhhcccccccccCc-----CCCCcc-cc-eeccccccccccccc-hhhhHHHHHHHHHHH-hhccchhHHHHHhhcc Q lcl|NC_015287. 9 IQKDDEQNKNVVSPVPQ-----SNEDSS-DY-YVSSGFYGQYVDIDG-VFKSEFELIKRYREM-ALHPEVDSAIEDIINE 79 (563) Q Consensus 9 ~~~~~~~~~~~~s~~p~-----~~~d~~-~~-~~~~~~~~~~~~~~~-~~~~~~~LI~~YR~m-a~~PEvd~Ai~eIvne 79 (563) ++.---......+.+.+ .-..++ .. -...++.....+... ...+...|..+=|.| .++|-+..||+-+++- T Consensus 1 ~~~p~~~~~~~~~~~~~~~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~n 80 (533) T protein:vir:34 1 MKTPTIPTLLGPDGMTSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQDH 80 (533) T ss_pred CCCchhhhhhcccccchHHHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHH Confidence 00000000000000000 000110 00 011122222222222 234667788888899 5999999999999986 Q ss_pred eEEecCCCcEE------EEEeeccccchHHHHHHHHHHHHHHHH----------hcchhhhHHHHHHHhhhhhhheeeee Q lcl|NC_015287. 80 AIVSDQNDSPV------EIDLENLPASKQLKDLIREEFKTVKEI----------LNFDDKCHEILRNWYVDGRIYYHKVI 143 (563) Q Consensus 80 aiv~d~~~~pV------~l~l~~~~~s~sik~~I~eeF~~i~~l----------l~F~~~~~~~~R~WYvDGri~fhkvi 143 (563) +|=. +-.|- .|.++. +..+.+.+.|..+|..+++- ++|-.--.-.+|.|.+||..|-.+.. T Consensus 81 vVG~--Gi~~~~~p~~~~lg~~~-~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~ 157 (533) T protein:vir:34 81 IVGS--FFRLSHRPSWRYLGIGE-EEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATW 157 (533) T ss_pred hhCC--CceeeeccchhhcCCCh-hHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEeee Confidence 5432 33222 233443 45678888999999999854 33444445789999999999998877 Q ss_pred cCCCCCcce--eEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeec--CccccccccccccCcCc Q lcl|NC_015287. 144 DIKAPEEGI--KEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYD--PSQTQGKNALGAIGQTP 219 (563) Q Consensus 144 D~~~~k~GI--~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~--~~~~~~~~~~g~~g~~~ 219 (563) ++. +.+- ..|..|||-.|.--+.........++|+.... ..-.-|+++. +.+..... T Consensus 158 ~~~--~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~--------Gr~~aY~i~~~~~~~~~~~~--------- 218 (533) T protein:vir:34 158 DTS--SSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQINDS--------GAALGYYVSEDGYPGWMPQK--------- 218 (533) T ss_pred ccC--CCCccceEEEEechhhcCCCCCCCCCCceEeeeEECCC--------CCeEEEEEeecCCCCccccc--------- Confidence 753 2221 35888999776543322222223345554222 2334577763 32211110 Q ss_pred cccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHH Q lcl|NC_015287. 220 FSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQY 299 (563) Q Consensus 220 ~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqY 299 (563) ...-...+.++.+.|.+.+... ...-.-++|.|..++..+.+|.-.+||...-...-|-.=-++.=+.+......+ T Consensus 219 ~~~~~~~~~v~a~~VlH~f~~~-r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~--- 294 (533) T protein:vir:34 219 WTWIPRELPGGRASFIHVFEPV-EDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDF--- 294 (533) T ss_pred cceeeeeeccChhHeeeecccc-CCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCccccccc--- Confidence 0111224567788887665543 223344589999999999999999999999988888764444333332111000 Q ss_pred HHHHHHhhcceeEeeCCCCcc-cccccccchhhhhcccc---cCCC------CccceeecCCCCCcch-HHHHHHHHHHH Q lcl|NC_015287. 300 LKEVMNRYRNKLVYDASTGEI-RDDRKHMSMLEDFWLPR---REGG------RGTEITTLPGGQNLGE-LSDIEYFQKKL 368 (563) Q Consensus 300 l~~im~k~knklvYd~~TGev-~d~~~~msMlEDywLpR---ReGg------rgTEIsTLpGg~nLge-i~DV~YF~~kL 368 (563) + .....+.- ..-....+...++.-.+ =++| -|.+|+.+..+..-+. -+-++...+.+ T Consensus 295 ~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~i 362 (533) T protein:vir:34 295 I------------LGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYI 362 (533) T ss_pred c------------cCCCcccccccccccchhhhhccCcceeeccCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHH Confidence 0 00000000 00000111111111000 0111 1233333333222222 22444455566 Q ss_pred HHhcCCChhhccCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHhcCCCCh------hHHHHh Q lcl|NC_015287. 369 YRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKT----QLILKNIVTP------EDWEVL 438 (563) Q Consensus 369 y~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~----qLiLkgI~t~------eew~~~ 438 (563) =.+|+||-+-|..|=.+.|. |.+.-.-+.|.+++.++|..|..=|..++-. ..+|.|.++. +-|..- T Consensus 363 AaglGi~ye~lt~D~s~~nY---SS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~ 439 (533) T protein:vir:34 363 AAGLGVSYEQLSRNYAQMSY---STARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEAR 439 (533) T ss_pred HhhcCCCHHHHhhhcccccH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhH Confidence 67899999999655212222 3344445679999999999888655443332 4567887752 223322 Q ss_pred hh--ceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCc Q lcl|NC_015287. 439 GD--HIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPM 516 (563) Q Consensus 439 ~~--~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~ 516 (563) .. ...|..-.--+.--+||+.-...+++. -.-|.+-+..+--..-++-+.|..+..+...+.|+-. |. T Consensus 440 ~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~---------G~~s~~~~~a~~G~D~~ev~~q~a~e~~~~~~~gl~~-~~ 509 (533) T protein:vir:34 440 SAWGNCDWIGSGRMAIDGLKEVQEAVMLIEA---------GLSTYEKECAKRGDDYQEIFAQQVRETMERRAAGLKP-PA 509 (533) T ss_pred HhhhceeeccCCccccChHHHHHHHHHHHHc---------CCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHhcCCCC-CC Confidence 22 234434444555667877666555543 3455555555443333333333333322223333321 11 Q ss_pred ccCccccccccccCCCCcCCCCCcccccccCCCCcccCCC Q lcl|NC_015287. 517 EVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAPQ 556 (563) Q Consensus 517 ~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~ 556 (563) .. ... .. +.. .+...++.++.+.+ T Consensus 510 ~~---~~~-----~~----s~~----~~~~~~~~~~~~~~ 533 (533) T protein:vir:34 510 WA---AAA-----FE----SGL----RQSTEEEKSDSRAA 533 (533) T ss_pred CC---CcC-----cc----CCC----CCCCCCCcccCCCC Confidence 00 000 00 000 00001111111111 No 36 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=98.83 E-value=3.4e-08 Score=61.50 Aligned_cols=441 Identities=14% Similarity=0.099 Sum_probs=184.1 Q ss_pred CcccccchhhHhhhccccccccc----CcCCCCcccceeccccccccccccchhhhHHHHHHHHHHH-hhccchhHHHHH Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPV----PQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREM-ALHPEVDSAIED 75 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~----p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m-a~~PEvd~Ai~e 75 (563) .-...=|-..+..+..+ .++|. ||-.-+.-+.-..+..+-... .. ...+-+++-++.+ ++++-|..||+. T Consensus 59 ~~~~~~~~~~~~~kk~~-i~~pfkkk~~~~~~d~f~~s~es~s~vtsl-s~---pdaf~~vnVs~~~AlknsaV~scI~~ 133 (945) T protein:vir:10 59 VYSIIIFRKNQVLKKEK-IIVPYNHQEPPFKFNLFEYSPESLMYLPSI-SD---PDAFFLINLFRKYRFNNDSKLIKVSE 133 (945) T ss_pred eeeeeeehhhhHHHhhc-ccccccccccchhhhhhhccCccceecccc-cC---ccceeeehhhhhhhhccHHHHHHHHH Confidence 00011111111111111 01111 110000000000000000000 00 0011133344444 578899999999 Q ss_pred hhcceEEecCCCcEEEE--EeeccccchHHHHHHHHHHHHHHHHh---cchhhhH--------HHHHHHhhhhhhheeee Q lcl|NC_015287. 76 IINEAIVSDQNDSPVEI--DLENLPASKQLKDLIREEFKTVKEIL---NFDDKCH--------EILRNWYVDGRIYYHKV 142 (563) Q Consensus 76 Ivneaiv~d~~~~pV~l--~l~~~~~s~sik~~I~eeF~~i~~ll---~F~~~~~--------~~~R~WYvDGri~fhkv 142 (563) |.+.+.- -|+.+ ..++.......+.. .....+.++| |=...++ .+++.+++.|.-|..++ T Consensus 134 IA~sIAs-----LPlklYrr~edG~~~~~~kk~--~~~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIi 206 (945) T protein:vir:10 134 IPKKLTS-----KELEIYKHIEDKHVNYYLKRI--RDARNILEFLERPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKI 206 (945) T ss_pred HHhhhcc-----CceEEEEecccCccccccccc--ccchHHHHHHhCCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEE Confidence 9986542 24433 12222222222211 1222333444 2222233 35677888999999987 Q ss_pred ecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCcccc Q lcl|NC_015287. 143 IDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFST 222 (563) Q Consensus 143 iD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~ 222 (563) .|.. ..+.+|++|||..|+.++. ++. +. ..+|.+.. + T Consensus 207 Rd~~---G~ii~L~pLdPs~Vti~~d---dDG---~~----------------~y~Yv~~i------------------d 243 (945) T protein:vir:10 207 RDEQ---GNLVAITPVDGTTIKPILS---EDT---GI----------------VVGYVQEV------------------D 243 (945) T ss_pred ECCC---CcEEEEEEECCcceEEEEc---CCC---cE----------------EEEEEEec------------------C Confidence 7632 3488999999988766522 111 11 11222211 0 Q ss_pred Ccceeeecccc-eeeeecceecC-CCCccchhHHHHHHHHHHHHHHHHH-HHHHHHhcCccceEEEccCCCC-------- Q lcl|NC_015287. 223 KQRPVKIAPDA-ISFCHSGLVDR-NKQTILSYLHKSIKALNQLRMIEDS-LVIYRLSRAPERRIFYIDVGNL-------- 291 (563) Q Consensus 223 ~~~~iki~~da-I~y~hSGl~d~-~~~~~~syL~~Aik~~NqL~m~EDa-lVIyRi~RAPeRRvFyIDvGnl-------- 291 (563) +.....++.+. |.+.+.-..|. ..+.++|-|..|++.+.....++.. .-.|+--.|.-+-+..++.++. T Consensus 244 G~~~~~v~a~DvIlhirn~s~DG~~~GyGlSPIeaa~~aI~~alAaek~aar~FskNGa~PsGILsvkg~~~~d~k~~~~ 323 (945) T protein:vir:10 244 GAIVAHFDKRDVVLFRQNLTPDVYMYGYSLPPIEILYKVILSDIFIDKGNLDYYRKGGSIPEGILAIEPPSYKEGDIYPQ 323 (945) T ss_pred CceEEEecCCceEEEeccCCCCcccccCCchHHHHHHHHHHHHHHHHHHHHHHHHhCCCccceEEEecCccccccccccc Confidence 11123444444 54443322221 1334688899999988776666654 4445545677788998886643 Q ss_pred -chHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCC-CcchHHHHHHHHHHHH Q lcl|NC_015287. 292 -PKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQ-NLGELSDIEYFQKKLY 369 (563) Q Consensus 292 -pk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~-nLgei~DV~YF~~kLy 369 (563) .+..+++ +++.+.+... |. +....+ ++ ..|.+++.|.... .+.-++-.+|..+.+. T Consensus 324 LseEq~er-lKe~wee~~s--------G~--NnG~pi-VL----------deGmef~pLs~s~~DaQfLEsrkfs~eeIA 381 (945) T protein:vir:10 324 LSREQLES-IQRQLQAIMM--------GD--YTQVPI-LS----------GGKFTWIDFKGKRRDMQFKELAEFVARKIC 381 (945) T ss_pred cCHHHHHH-HHHHHHHHhC--------Cc--ccccce-ec----------CCCceEEEccCChhHHHHHHHHHHHHHHHH Confidence 3322222 3333332211 20 111111 12 2367777775432 2222344567778899 Q ss_pred HhcCCChhhccCCCcceeccccchhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeec Q lcl|NC_015287. 370 RSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTKF-VGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVY 448 (563) Q Consensus 370 ~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~Kf-I~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~ 448 (563) ++++||...|...+ +- ..+.+.....-|..+ +..+..++...+...|... . ....+.++|.. T Consensus 382 rAFGVPP~lLG~~e-~s---t~SNiEqq~~~Fv~~tL~Pil~~IEqeLNrkLl~~---------~----eg~~i~fdFd~ 444 (945) T protein:vir:10 382 AVYQVSPQDVGILE-GS---NKATAEVMASLTKAKGLEPLMATISKGFDEVVSEF---------R----NEKDIKLWFKE 444 (945) T ss_pred HHhCCCHHHcccCC-CC---CcchHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------c----cCceeEEEecc Confidence 99999999995322 11 112233333446544 7777777776666544211 1 13567888877 Q ss_pred cchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCc------cc Q lcl|NC_015287. 449 DNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQ------LQ 522 (563) Q Consensus 449 Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~------~~ 522 (563) +..-.. ..|+++++.+-.- -++|.+-+|+. +.+. +++..+.--. +. T Consensus 445 ldl~D~-------ksraEal~kli~s--GiLTiNEvRe~-lGLp------------------PIeGGD~lli~~nn~~P~ 496 (945) T protein:vir:10 445 DDLEKE-------RDWWNIIQGQLNT--GFRSINEARME-KGLE------------------PVPWGDVPFSGLRNWKPE 496 (945) T ss_pred hhccCH-------HHHHHHHHHHHhC--CCcCHHHHHHH-hCCC------------------CCCCcceeeecccccccc Confidence 664322 3455555443221 24444444432 2222 2211100000 00 Q ss_pred cccccccCCCCcCCCCCcccccccCCCCcc--cCCCCCCCCCC Q lcl|NC_015287. 523 LGVHPEQMPGGAMNPQDPNMGGAPQEGGAE--QAPQMPEGGEI 563 (563) Q Consensus 523 ~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~--~~~~~~~~~~~ 563 (563) .....+..++....+..+...++.+.+++. +.+...+.... T Consensus 497 d~~~ka~~ga~p~q~aq~~~dqp~~kGGe~dEns~~psE~kda 539 (945) T protein:vir:10 497 DEQAKAQQGAMPPQLAQAMADQPSQQGGGVDENSSVPSEQKNA 539 (945) T ss_pred ccccccccCCCCcccccCCCCCCCCCCCCCCCCCCCCCcccch Confidence 000000000000011111111111111111 11111111111 No 37 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=98.74 E-value=7.2e-08 Score=59.74 Aligned_cols=466 Identities=11% Similarity=0.041 Sum_probs=213.2 Q ss_pred hhcccccccccCcCCCCcccceeccccccccccccchhhhH-HHHHHHHHHHhhccchhH-------------------- Q lcl|NC_015287. 13 DEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSE-FELIKRYREMALHPEVDS-------------------- 71 (563) Q Consensus 13 ~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~-~~LI~~YR~ma~~PEvd~-------------------- 71 (563) +.......|..|. +-++- ..--.+.-..+..+ ..-..+|+.+..|++-+. T Consensus 1 ~~~~~~~~~~~~~-~~~~l--------~~~e~~~i~~L~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~ 71 (504) T protein:vir:99 1 MTEETTSASKFTF-RIPEL--------NDDVVDKVNGLYQQLVDRTPRNLLRASFYDGKYAIRQIGNLIPPEYLRTATVL 71 (504) T ss_pred CCccCCccccccc-ccCCC--------CHHHHHHHHHHHHHHHHHhHHHHHHHHHHhccccchhccccccHHHHHHhhcc Confidence 2222212122111 00000 00000000000000 001122233333332222 Q ss_pred -----HHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCC Q lcl|NC_015287. 72 -----AIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIK 146 (563) Q Consensus 72 -----Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~ 146 (563) +|+..++= |.++....++.- .-.+....|....+|+....++++.=+++||-|. .|. +. T Consensus 72 n~~~~iVd~~a~r------------l~~~Gf~~~d~~--~~~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~-~v~-~~ 135 (504) T protein:vir:99 72 GWSAKAVDTLARR------------CNLESFVWPDGD--YGSIGGPDVWDENFFATKANNAMVSSLIHGPAFL-INT-EG 135 (504) T ss_pred CcHHHHHHHHHhh------------hccceeeCCCCC--hhhHHHHHHHHhcChhhHHHHHHHHHHhhCceeE-EEe-cC Confidence 22222220 001111111110 1134577788889999999999999999999775 333 33 Q ss_pred CCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCcccccccc--ccccCcCccccCc Q lcl|NC_015287. 147 APEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNA--LGAIGQTPFSTKQ 224 (563) Q Consensus 147 ~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~--~g~~g~~~~~~~~ 224 (563) ...++..-++.++|+.+.-|.- .........+.+... .. .......-+|.+........ .+..- ....++. T Consensus 136 ~d~~~~~~I~~~sP~~~~~iyD-~~~~~~~~a~~~~~~--d~---~g~~~~~~~y~~~~~~~~~~~~~~~~~-~~~~~~~ 208 (504) T protein:vir:99 136 GAGEPDSLIHVKSAMQATGEWN-SRRNAMDSLLSITSR--DA---EGHPTGIALYEDGVTVTADMDDDGDWH-ADVRTHK 208 (504) T ss_pred CCCCceeEEEEeccceeEEEEe-CCCCceeEEEEEEEe--cC---CCeEEEEEEEcCCcEEEEEEcCCceee-eccccCC Confidence 3334455678889998766631 111111111111100 00 01111222343322221110 00000 0000111 Q ss_pred ceeeecccceeeeecceecCCCCcc-c-hhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHH Q lcl|NC_015287. 225 RPVKIAPDAISFCHSGLVDRNKQTI-L-SYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKE 302 (563) Q Consensus 225 ~~iki~~daI~y~hSGl~d~~~~~~-~-syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~ 302 (563) .-+| .|.|+|..-.+...|.. + ..|-..+..+| +.|-+.++.=...=.|.|-|+=.+-..++...++. .. T Consensus 209 --~gvP--vV~~~n~~~~~~~~G~sei~~~v~~l~Da~~--~~~~~~~~~~e~~a~p~r~i~G~~~~~~~~~d~~~--~~ 280 (504) T protein:vir:99 209 --LGVP--VEVLPYKPREDRPLGSSRITRPVMSLQQRAL--KGCIRMDGHADVYSFPQLILLGADAKNFRNKDGSM--KP 280 (504) T ss_pred --CCcc--eEEecccccCccccCcccchhhHHHHHHHHH--HHHHHHHHHHHHhcchhhhhccCCccccccccccc--cc Confidence 1123 56677664433332211 1 12222233332 45566677777777777766533322211111100 00 Q ss_pred HHHhhcceeEeeCCCCcccccccccchhhhhcccccC-----CCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCh Q lcl|NC_015287. 303 VMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRRE-----GGRGTEITTLPGGQNLGE-LSDIEYFQKKLYRSLGVPE 376 (563) Q Consensus 303 im~k~knklvYd~~TGev~d~~~~msMlEDywLpRRe-----GgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~ 376 (563) ...-+.+|+ +.||.-+ ++-.+++..++++. |.- ++-++-.-..+...-++|. T Consensus 281 ~~~~~~~~i---------------------~~~~~~~~~~~~~~~~~~~~q~~~~~-l~~~~~~l~~~i~~~a~~t~~P~ 338 (504) T protein:vir:99 281 AWQIALARV---------------------FALPDDEDEPDAARARADVKQFPASS-PQPHIEMLEQIAMMFSGETSIPV 338 (504) T ss_pred hhhhhhhhh---------------------hcCCCccccccccCccceeeecCCCC-hHHHHHHHHHHHHHHHhhhCCCH Confidence 000000111 2233221 22346788888764 433 2335555556666689998 Q ss_pred hhccCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC--hhHHHHhhhceEEEeeccchHHH Q lcl|NC_015287. 377 SRIAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVT--PEDWEVLGDHIQYDFVYDNHFAE 454 (563) Q Consensus 377 sRl~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t--~eew~~~~~~i~~~f~~Dn~f~E 454 (563) .-|.-.+. -|-..+..|.-.+....+-+.+.|++|..-+.+++|.-|.+.+... ..+| ..+.+.|..-. ..- T Consensus 339 ~~lG~~~~-~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~~~~----~~~~v~w~d~~-~~s 412 (504) T protein:vir:99 339 ESLGFSNR-ANPTSADAYIASREDLIAEAEGATDDWSPAFRRSMIRALAIKNGLDRIPPEW----KTIDSKFRSPL-YLS 412 (504) T ss_pred HHhccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccc----ccceeEecCCC-ccC Confidence 88831100 0111233566677778889999999999999999999887766543 2333 34677775322 221 Q ss_pred HHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCc Q lcl|NC_015287. 455 LKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGA 534 (563) Q Consensus 455 lKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~ 534 (563) +.++.+++..+..-+..+++...+....|.++++||+++.+..+++...+.+....+.+. ..+++. T Consensus 413 ------~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~~~~ei~r~~~e~~~~~~~~~~~~l~~~~~--------~~~~~~ 478 (504) T protein:vir:99 413 ------KAAQADAGAKMLGAGPEWLKETEVGLELLGLTPQQAKRALAERRRASSVSIIEALNRRQQ--------EAATAG 478 (504) T ss_pred ------HHHHHHHHHHHHhhccccccchHHHHhhcCCCHHHHHHHHHHHHHHhhHHHHHHHhcccC--------CCCCCC Confidence 246677777776655566766666677779999999987777766654433322111111 111222 Q ss_pred CCCCCcccccccCCCCcccCCCCCCC Q lcl|NC_015287. 535 MNPQDPNMGGAPQEGGAEQAPQMPEG 560 (563) Q Consensus 535 ~~p~~~~~~~~~~~~~~~~~~~~~~~ 560 (563) .++..|...+|...|++..++|--.| T Consensus 479 ~~~~~~~~e~a~~~~~~~~~~p~~~~ 504 (504) T protein:vir:99 479 EDQDQGAGEPPANEPPAALGRPTLVG 504 (504) T ss_pred CCCCcCCCCCCCCCCCccCCCcccCC Confidence 23333444555555566666666667 No 38 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=98.67 E-value=1.2e-07 Score=58.46 Aligned_cols=472 Identities=14% Similarity=0.138 Sum_probs=196.5 Q ss_pred Ccccccc-hhhHhhhcccccccccCcCCCCcc-ccee---------------------cccccc----c--cccccchhh Q lcl|NC_015287. 1 MAKLFGF-KIQKDDEQNKNVVSPVPQSNEDSS-DYYV---------------------SSGFYG----Q--YVDIDGVFK 51 (563) Q Consensus 1 m~~~~~f-~~~~~~~~~~~~~s~~p~~~~d~~-~~~~---------------------~~~~~~----~--~~~~~~~~~ 51 (563) ||.||+- -.-+ +-..++.|.-||-+++..+ ...+ ..-|.. . +..-.+-+. T Consensus 1 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~ 79 (563) T protein:vir:95 1 MADLFKQFRLGK-DYGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMK 79 (563) T ss_pred Chhhhhhhhccc-ccccccccceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCC Confidence 8888864 1111 1111222344444222111 0000 000000 0 111111233 Q ss_pred hHHHHHHHHHHHhhccchhHHHHHhhcceEEe------cCCCcEEEEEeecccc--chHHHHHHHHHHHHHHHHhcch-- Q lcl|NC_015287. 52 SEFELIKRYREMALHPEVDSAIEDIINEAIVS------DQNDSPVEIDLENLPA--SKQLKDLIREEFKTVKEILNFD-- 121 (563) Q Consensus 52 ~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~------d~~~~pV~l~l~~~~~--s~sik~~I~eeF~~i~~ll~F~-- 121 (563) +...|-+.=|.++.+|-|..||+.+.+.+.++ .++.-...|.|.+... ++..+..+. ..+..+..+..+ T Consensus 80 ~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~-~l~~~l~~~~~~~~ 158 (563) T protein:vir:95 80 NEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMK-RIEDFIVNTGKDKD 158 (563) T ss_pred CcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhH-HHHHHhhhcCCCCC Confidence 44444444556667899999999999986643 1222223555554332 222222221 222223222222 Q ss_pred ---hhhHHHH----HHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeeccccccccccc Q lcl|NC_015287. 122 ---DKCHEIL----RNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPE 194 (563) Q Consensus 122 ---~~~~~~~----R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~ 194 (563) ....+++ +..++.|.-|+.++++ .+...-+.+|.+|||..|+.++.... .+ +.. T Consensus 159 p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~-rd~~G~~~~L~pl~p~~V~v~~~~~g------~~------------~~~ 219 (563) T protein:vir:95 159 VDRDSFQTFCKKIVRDTYIYDQVNFEKVFN-KNNKTKLEKFIAVDPSTIFYATDKKG------KI------------IKG 219 (563) T ss_pred CCcchHHHHHHHHHHHHHhcCCeEEEEEEE-ecCCCceEEEEEeCCceeEEEECCCC------ce------------ecc Confidence 1344544 4455667778887775 33344489999999998877632111 11 000 Q ss_pred ccceeeecCccccccccccccCcCccccCcceeeecccceeeeecc-eecCC-CCccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 195 IEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSG-LVDRN-KQTILSYLHKSIKALNQLRMIEDSLVI 272 (563) Q Consensus 195 ~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSG-l~d~~-~~~~~syL~~Aik~~NqL~m~EDalVI 272 (563) . ..|.|-.. +.....++.+.++|-.-+ ..|.. +..++|-|+.|++.+.....+|+...= T Consensus 220 ~-~~y~~~~~------------------g~~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~ 280 (563) T protein:vir:95 220 G-KRFVQVVD------------------KRVVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDR 280 (563) T ss_pred c-eeEEEEeC------------------CceeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHH Confidence 1 11222110 011234555555432112 12222 445689999999999998888887665 Q ss_pred HHHhcCccceEEEccCCC-CchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCC Q lcl|NC_015287. 273 YRLSRAPERRIFYIDVGN-LPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPG 351 (563) Q Consensus 273 yRi~RAPeRRvFyIDvGn-lpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpG 351 (563) +=---+--+-|.-+..+. |.+..+++.-+.+-..|+.- .+.| +..-.+ ..|.+++.|-- T Consensus 281 ~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~----~nag------k~~~vl----------~~G~~~~~l~~ 340 (563) T protein:vir:95 281 FFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGI----NGSW------QIPVVM----------ADDIKFVNMTP 340 (563) T ss_pred HHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccc----cccc------cceEEc----------CCCceEEeccC Confidence 544456677777787664 55554444444433334320 0111 111011 12455666544 Q ss_pred CCCcch-HHHHHHHHHHHHHhcCCChhhccC-CCcceec-cccchhhHH---H--HHHHHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 352 GQNLGE-LSDIEYFQKKLYRSLGVPESRIAG-SGDGFNL-GRSSEILRD---E--IKFTKF-VGRMRKRFSHLFNDMLKT 422 (563) Q Consensus 352 g~nLge-i~DV~YF~~kLy~aL~VP~sRl~~-~~~g~~~-g~~~eItRD---E--lKF~Kf-I~rLr~rFs~if~d~Lk~ 422 (563) ...-.+ ++-.+|..+.+.++.+||...|.- +.+++.- ..++.+++. + +-|... +.-+..++...|.. T Consensus 341 ~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~---- 416 (563) T protein:vir:95 341 TANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNR---- 416 (563) T ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHh---- Confidence 332223 445567889999999999999953 2222211 112223322 2 224333 34444444444433 Q ss_pred HHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHH Q lcl|NC_015287. 423 QLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQ 502 (563) Q Consensus 423 qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~q 502 (563) .|+ ++ + ..++.++|.+... ..|.+.++...-...-++|..-+|.. +.+.+.+ --+.. T Consensus 417 ~L~-----~~--~---~~~~~~~f~r~D~----------~~~~e~~~~~~~~~~G~lT~NE~R~~-~gl~Pi~--gGD~~ 473 (563) T protein:vir:95 417 HII-----SE--Y---GDKYTFQFVGGDT----------KSATDKLNILKLETQIFKTVNEAREE-QGKKPIE--GGDII 473 (563) T ss_pred hhc-----hh--c---ccccEEEeccCCH----------HHHHHHHHHHHHhcCCccCHHHHHHH-hCCCCCC--Cccee Confidence 222 22 1 1457778876543 12333332222122235666666553 3344321 00000 Q ss_pred HHHHhcCCCCC----C-CcccCccccccccccCCCCcCCCCC-----cccccccCCCCcccCCCCCCCCCC Q lcl|NC_015287. 503 MEKEIKDGKLA----D-PMEVQQLQLGVHPEQMPGGAMNPQD-----PNMGGAPQEGGAEQAPQMPEGGEI 563 (563) Q Consensus 503 ie~E~~~~~~~----~-p~~~~~~~~~~~~~~~~~~~~~p~~-----~~~~~~~~~~~~~~~~~~~~~~~~ 563 (563) + ......+ + +...+.....+......++..+|.. +...++..+..++.+ ..+++..- T Consensus 474 ~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 540 (563) T protein:vir:95 474 L---DASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKEIGTD-AQIKGDDN 540 (563) T ss_pred e---cccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCCccccccc-cccccccc Confidence 0 0000000 0 0000000000000000011111111 100110000000000 01111111 No 39 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=98.67 E-value=1.2e-07 Score=58.46 Aligned_cols=472 Identities=14% Similarity=0.138 Sum_probs=196.5 Q ss_pred Ccccccc-hhhHhhhcccccccccCcCCCCcc-ccee---------------------cccccc----c--cccccchhh Q lcl|NC_015287. 1 MAKLFGF-KIQKDDEQNKNVVSPVPQSNEDSS-DYYV---------------------SSGFYG----Q--YVDIDGVFK 51 (563) Q Consensus 1 m~~~~~f-~~~~~~~~~~~~~s~~p~~~~d~~-~~~~---------------------~~~~~~----~--~~~~~~~~~ 51 (563) ||.||+- -.-+ +-..++.|.-||-+++..+ ...+ ..-|.. . +..-.+-+. T Consensus 1 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~ 79 (563) T protein:vir:99 1 MADLFKQFRLGK-DYGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMK 79 (563) T ss_pred Chhhhhhhhccc-ccccccccceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCC Confidence 8888864 1111 1111222344444222111 0000 000000 0 111111233 Q ss_pred hHHHHHHHHHHHhhccchhHHHHHhhcceEEe------cCCCcEEEEEeecccc--chHHHHHHHHHHHHHHHHhcch-- Q lcl|NC_015287. 52 SEFELIKRYREMALHPEVDSAIEDIINEAIVS------DQNDSPVEIDLENLPA--SKQLKDLIREEFKTVKEILNFD-- 121 (563) Q Consensus 52 ~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~------d~~~~pV~l~l~~~~~--s~sik~~I~eeF~~i~~ll~F~-- 121 (563) +...|-+.=|.++.+|-|..||+.+.+.+.++ .++.-...|.|.+... ++..+..+. ..+..+..+..+ T Consensus 80 ~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~-~l~~~l~~~~~~~~ 158 (563) T protein:vir:99 80 NEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMK-RIEDFIVNTGKDKD 158 (563) T ss_pred CcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhH-HHHHHhhhcCCCCC Confidence 44444444556667899999999999986643 1222223555554332 222222221 222223222222 Q ss_pred ---hhhHHHH----HHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeeccccccccccc Q lcl|NC_015287. 122 ---DKCHEIL----RNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPE 194 (563) Q Consensus 122 ---~~~~~~~----R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~ 194 (563) ....+++ +..++.|.-|+.++++ .+...-+.+|.+|||..|+.++.... .+ +.. T Consensus 159 p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~-rd~~G~~~~L~pl~p~~V~v~~~~~g------~~------------~~~ 219 (563) T protein:vir:99 159 VDRDSFQTFCKKIVRDTYIYDQVNFEKVFN-KNNKTKLEKFIAVDPSTIFYATDKKG------KI------------IKG 219 (563) T ss_pred CCcchHHHHHHHHHHHHHhcCCeEEEEEEE-ecCCCceEEEEEeCCceeEEEECCCC------ce------------ecc Confidence 1344544 4455667778887775 33344489999999998877632111 11 000 Q ss_pred ccceeeecCccccccccccccCcCccccCcceeeecccceeeeecc-eecCC-CCccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 195 IEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSG-LVDRN-KQTILSYLHKSIKALNQLRMIEDSLVI 272 (563) Q Consensus 195 ~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSG-l~d~~-~~~~~syL~~Aik~~NqL~m~EDalVI 272 (563) . ..|.|-.. +.....++.+.++|-.-+ ..|.. +..++|-|+.|++.+.....+|+...= T Consensus 220 ~-~~y~~~~~------------------g~~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~ 280 (563) T protein:vir:99 220 G-KRFVQVVD------------------KRVVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDR 280 (563) T ss_pred c-eeEEEEeC------------------CceeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHH Confidence 1 11222110 011234555555432112 12222 445689999999999998888887665 Q ss_pred HHHhcCccceEEEccCCC-CchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCC Q lcl|NC_015287. 273 YRLSRAPERRIFYIDVGN-LPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPG 351 (563) Q Consensus 273 yRi~RAPeRRvFyIDvGn-lpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpG 351 (563) +=---+--+-|.-+..+. |.+..+++.-+.+-..|+.- .+.| +..-.+ ..|.+++.|-- T Consensus 281 ~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~----~nag------k~~~vl----------~~G~~~~~l~~ 340 (563) T protein:vir:99 281 FFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGI----NGSW------QIPVVM----------ADDIKFVNMTP 340 (563) T ss_pred HHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccc----cccc------cceEEc----------CCCceEEeccC Confidence 544456677777787664 55554444444433334320 0111 111011 12455666544 Q ss_pred CCCcch-HHHHHHHHHHHHHhcCCChhhccC-CCcceec-cccchhhHH---H--HHHHHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 352 GQNLGE-LSDIEYFQKKLYRSLGVPESRIAG-SGDGFNL-GRSSEILRD---E--IKFTKF-VGRMRKRFSHLFNDMLKT 422 (563) Q Consensus 352 g~nLge-i~DV~YF~~kLy~aL~VP~sRl~~-~~~g~~~-g~~~eItRD---E--lKF~Kf-I~rLr~rFs~if~d~Lk~ 422 (563) ...-.+ ++-.+|..+.+.++.+||...|.- +.+++.- ..++.+++. + +-|... +.-+..++...|.. T Consensus 341 ~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~---- 416 (563) T protein:vir:99 341 TANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNR---- 416 (563) T ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHh---- Confidence 332223 445567889999999999999953 2222211 112223322 2 224333 34444444444433 Q ss_pred HHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHH Q lcl|NC_015287. 423 QLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQ 502 (563) Q Consensus 423 qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~q 502 (563) .|+ ++ + ..++.++|.+... ..|.+.++...-...-++|..-+|.. +.+.+.+ --+.. T Consensus 417 ~L~-----~~--~---~~~~~~~f~r~D~----------~~~~e~~~~~~~~~~G~lT~NE~R~~-~gl~Pi~--gGD~~ 473 (563) T protein:vir:99 417 HII-----SE--Y---GDKYTFQFVGGDT----------KSATDKLNILKLETQIFKTVNEAREE-QGKKPIE--GGDII 473 (563) T ss_pred hhc-----hh--c---ccccEEEeccCCH----------HHHHHHHHHHHHhcCCccCHHHHHHH-hCCCCCC--Cccee Confidence 222 22 1 1457778876543 12333332222122235666666553 3344321 00000 Q ss_pred HHHHhcCCCCC----C-CcccCccccccccccCCCCcCCCCC-----cccccccCCCCcccCCCCCCCCCC Q lcl|NC_015287. 503 MEKEIKDGKLA----D-PMEVQQLQLGVHPEQMPGGAMNPQD-----PNMGGAPQEGGAEQAPQMPEGGEI 563 (563) Q Consensus 503 ie~E~~~~~~~----~-p~~~~~~~~~~~~~~~~~~~~~p~~-----~~~~~~~~~~~~~~~~~~~~~~~~ 563 (563) + ......+ + +...+.....+......++..+|.. +...++..+..++.+ ..+++..- T Consensus 474 ~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 540 (563) T protein:vir:99 474 L---DASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKEIGTD-AQIKGDDN 540 (563) T ss_pred e---cccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCCccccccc-cccccccc Confidence 0 0000000 0 0000000000000000011111111 100110000000000 01111111 No 40 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=98.66 E-value=1.3e-07 Score=58.29 Aligned_cols=459 Identities=13% Similarity=0.133 Sum_probs=191.9 Q ss_pred Cccc---ccchhhHh------------------hhcccccccccCcCCCCcccce--------------ecccccccccc Q lcl|NC_015287. 1 MAKL---FGFKIQKD------------------DEQNKNVVSPVPQSNEDSSDYY--------------VSSGFYGQYVD 45 (563) Q Consensus 1 m~~~---~~f~~~~~------------------~~~~~~~~s~~p~~~~d~~~~~--------------~~~~~~~~~~~ 45 (563) ||+- -|| |.+. .-++....+.+||....+++.. +..|.++ ..+ T Consensus 1 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~d~~~~~~~r~g~~~~~~~~g-~~~ 78 (648) T protein:vir:79 1 MARKVWGRGF-WSRISLMWRDEDDDKEPLVLEESMQLGEAPGAMPKGGGGGGSAKRDPKMSLVKRIGLAIMDGGGG-GRD 78 (648) T ss_pred Cccchhcchh-hhhhhhhccCccccccccccccccccCCCccccCCCCcccccccccchhHHHHHhHHHHHhhcCC-ccc Confidence 5542 133 2221 1122333344455443222111 1112221 111 Q ss_pred ccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhH Q lcl|NC_015287. 46 IDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCH 125 (563) Q Consensus 46 ~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~ 125 (563) +.....+-..|-+- ...+|-|..||+.|.+.+.-. +..+.-++-.-.+..+. ..++..-+...+.+ T Consensus 79 ~~epp~d~~~l~~l---~~~np~V~~aI~iia~~ia~l-----~~~i~~~~~~~~~~~~~------~~ll~rPn~~~t~~ 144 (648) T protein:vir:79 79 FEEPEFDFNEITSA---YNTEGYVRQAVDKYIEMMFKA-----DWDFVSKNPNAVEYIRM------RFTLMAEATQIPTN 144 (648) T ss_pred cccCCcCHHHHHHH---HhcChHHHHHHHHHHHHHhhC-----cceEEecCCccchhhHH------HHHhhccCCCCCHH Confidence 22122222333232 246899999999998876543 23333322111111111 11222233444555 Q ss_pred HHHHH----HhhhhhhheeeeecCCCC------------CcceeEEEEeCCccceeeeeeeccCCccccceeeecccccc Q lcl|NC_015287. 126 EILRN----WYVDGRIYYHKVIDIKAP------------EEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPAD 189 (563) Q Consensus 126 ~~~R~----WYvDGri~fhkviD~~~~------------k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~ 189 (563) ++++. +++.|--|..++.|.+.- ..-+.+|..|+|.+|+.++. + ++ T Consensus 145 ~f~~~l~~~lll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~d---~----~g----------- 206 (648) T protein:vir:79 145 QLFIEIAEDLVKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKRD---K----FG----------- 206 (648) T ss_pred HHHHHHHHHHHhcCCeEEEEEecCCCccchhhhhhhhccccceeeeEeecCceeEEEEc---C----CC----------- Confidence 55544 457888888888774320 11257788888877665431 1 00 Q ss_pred cccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 190 IENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDS 269 (563) Q Consensus 190 ~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDa 269 (563) ...+|.|.+.+ +...+.++.+.|.|..-+ -+.++-.++|-|..|+..+....-+++. T Consensus 207 -----~~~~Y~y~~~g-----------------~~~~~~~~~~dIIHik~~-~~~d~~~GlSpi~~a~~aI~l~~aa~~~ 263 (648) T protein:vir:79 207 -----MIKGWQQEQEG-----------------QDKPQKFKPEDIVHIYYK-REKGRAFGTPWLLPALDDIRALRQVEEN 263 (648) T ss_pred -----ceeeeEEEecC-----------------CceeEEecCccEEEEccC-CCCCCceeccHHHHHHHHHHHHHHHHHH Confidence 12355664321 122345666666443311 1334555689999999888776666654 Q ss_pred HH-HHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceee Q lcl|NC_015287. 270 LV-IYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITT 348 (563) Q Consensus 270 lV-IyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsT 348 (563) .- .++=---| .-|+.+..+......+++.++.+-..|++..+ . +|+...+... T Consensus 264 ~~~fF~NGa~P-~gil~~~~~~~~~e~~k~~~e~~~~~~~~~~i---~----------------------gg~v~~~~~~ 317 (648) T protein:vir:79 264 VLRLVYRNLHP-LWHVKVGLEQEGFGAEEGEVDLVRGEVENMDV---E----------------------GGMVTTERVN 317 (648) T ss_pred HHHHHhccCCc-cEEEEeCCCccchHHHHHHHHHHHHhcccccc---c----------------------ccccccceee Confidence 43 22222223 45555555555555566666666666655322 1 1111222222 Q ss_pred cCC-C--CCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 349 LPG-G--QNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLI 425 (563) Q Consensus 349 LpG-g--~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLi 425 (563) ++- + +.+.=++-.++..+.+.++.+||...|...++ -+...+ +.. ..-|...|..|+..+..++...+-+.++ T Consensus 318 i~~~~s~~dlqfle~rk~~~~eIa~aFgVPP~lLG~~~~-ss~sta-e~~--~~~~~~~i~~l~~~i~~~le~~~~~~ll 393 (648) T protein:vir:79 318 ISSIASNQIIDAKEYLKHFEQRAFTVLGVSELMMGRGGT-ASRSTG-DNL--SSDFKDRIKALQKVMATFINEFMVKEIL 393 (648) T ss_pred ccccCCHHHHHHHHHHHHHHHHHHHHhCCCHhHcccCCC-ccchHH-HHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 221 1 12222333477889999999999999943221 111111 111 2237778888888777777766555555 Q ss_pred hcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhh-HHHHHHHHH Q lcl|NC_015287. 426 LKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEE-VIEIDKQME 504 (563) Q Consensus 426 LkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdee-i~e~~~qie 504 (563) +..-+. +|-.....++|+|..-.--.+. .|.+.+.++ +-+-++|.+-+|.. +.+.+.+ -.. . T Consensus 394 ~e~~l~--~~l~~d~~ieF~~~~Llr~D~~-------~~a~~~~~l--~~~GilT~NEaR~~-lGlpPi~~g~~-----~ 456 (648) T protein:vir:79 394 MEGGFD--PVLNPDDKVEFRFNEIDMDSKI-------KLENQAVFL--YEHNAISEDEMREL-IGRDPVDDGEG-----R 456 (648) T ss_pred hhhhcc--ccccccceEEEeecccchhhHH-------HHHHHHHHH--HhCCCcCHHHHHHH-hCCCCCCCCCC-----c Confidence 544332 2322234567777531111111 222333222 22346666666653 3444411 000 0 Q ss_pred HHhcCCCCCCCcccCcccccccccc----CCCCcCCCCC-cccccccCCCCcccCC-CCCCCCCC Q lcl|NC_015287. 505 KEIKDGKLADPMEVQQLQLGVHPEQ----MPGGAMNPQD-PNMGGAPQEGGAEQAP-QMPEGGEI 563 (563) Q Consensus 505 ~E~~~~~~~~p~~~~~~~~~~~~~~----~~~~~~~p~~-~~~~~~~~~~~~~~~~-~~~~~~~~ 563 (563) .......++..++..+..+.+.+-. ..+++..+.. .+.+.+.++.+-+..| --+|+... T Consensus 457 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~eg~~~e~~~~~~~~~~~g~~~~~~~~~~~~~~ 521 (648) T protein:vir:79 457 AKMHLQMVTIAQATALAALAPTPAGGSSASASGDKKKKATDNKTKPTNQHGTKTSPKKQTNGRHV 521 (648) T ss_pred cccccccccchhccccccCCCCCCCCCCCCccccccccccCCCCCCCCCCCcCCCCccccchhhh Confidence 0000011110000000000000000 0000000000 0000000000000000 01111111 No 41 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=98.64 E-value=1.6e-07 Score=57.82 Aligned_cols=458 Identities=10% Similarity=0.029 Sum_probs=180.5 Q ss_pred CcccccchhhHhhhcccccc--cccCcCCCCcccceeccccccccccccc---hhhhHHHHHH--------HHHHHhhcc Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVV--SPVPQSNEDSSDYYVSSGFYGQYVDIDG---VFKSEFELIK--------RYREMALHP 67 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~--s~~p~~~~d~~~~~~~~~~~~~~~~~~~---~~~~~~~LI~--------~YR~ma~~P 67 (563) |.+.=.| ++...+.+.+ .+=+. +.+.|. ++..... ...-...+|+ +|+.+..+. T Consensus 1 ~~~~~~~---~~~~~~~~~~~~~~~~~---------~n~~~~-~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy 67 (511) T protein:vir:96 1 MLKVNEF---ETDTDLRGNINYLFNDE---------ANVVYT-YDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYY 67 (511) T ss_pred Cccccch---hhhhhhhhhhhhhhhhh---------hCCccc-cchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHh Confidence 4444444 1111111111 01011 112111 1111110 1111122222 344444333 Q ss_pred chhHH----------------------HHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhH Q lcl|NC_015287. 68 EVDSA----------------------IEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCH 125 (563) Q Consensus 68 Evd~A----------------------i~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~ 125 (563) +=+.. ..-||+- .+.=--..||.+..++ +...+.+..|++..+|+.... T Consensus 68 ~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~-~~~yl~g~p~~~~~~~--------~~~~~~l~~~~~~n~~~~~~~ 138 (511) T protein:vir:96 68 EGKTKNLVELTRRKEEYMADNRVAHDYASYISDF-INGYFLGNPIQYQDDD--------KDVLEAIEAFNDLNDVESHNR 138 (511) T ss_pred cccCccccccCcCcccccCcceeecchHHHHHHH-HHhhhccCCceeecCc--------hHHHHHHHHHHhhcCHHHHHH Confidence 22111 1122221 1111134566666554 234567888888889999999 Q ss_pred HHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCcc Q lcl|NC_015287. 126 EILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQ 205 (563) Q Consensus 126 ~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~ 205 (563) ++.+...+.|+-|.+.-+| ++|-+.+..+||+.+..|..-..+...-..|.........+.....+..+-+|.+.. T Consensus 139 ~~~~~~~i~G~a~~~vy~d----ed~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~ 214 (511) T protein:vir:96 139 SLGLDLSIYGKAYELMIRN----QDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHG 214 (511) T ss_pred HHHHHHHhcCeeEEEEEeC----CCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCc Confidence 9999999999999887665 357789999999999888532222111122222111111111111122222455544 Q ss_pred ccccccccccCcCccccCcceeeeccc--ceeeeecceecCCCCccchhHHHHHHHHHHHHHHH-HHHHHHHHhcCccce Q lcl|NC_015287. 206 TQGKNALGAIGQTPFSTKQRPVKIAPD--AISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIE-DSLVIYRLSRAPERR 282 (563) Q Consensus 206 ~~~~~~~g~~g~~~~~~~~~~iki~~d--aI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~E-DalVIyRi~RAPeRR 282 (563) .......+................+.. +|++. .|+....|-++..+.....+..+- +....-+-++.|-+- T Consensus 215 i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~------~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv 288 (511) T protein:vir:96 215 VYRYLTSRTNGLKLTPRENGFESHSFERMPITEF------SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLL 288 (511) T ss_pred EEEEEecCCCcccccccccccccccCCceeeEEe------cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceee Confidence 333221111100000000000000011 22221 122334566666665555443322 222222334444333 Q ss_pred EEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCccc-ccccccchhhhhccc---ccCCCCccceeecCCCCCcchH Q lcl|NC_015287. 283 IFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIR-DDRKHMSMLEDFWLP---RREGGRGTEITTLPGGQNLGEL 358 (563) Q Consensus 283 vFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~-d~~~~msMlEDywLp---RReGgrgTEIsTLpGg~nLgei 358 (563) +.-... ..++++. +....+-.+...... ....+.|..+..|-...+...+ T Consensus 289 ~~g~~~--------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 342 (511) T protein:vir:96 289 IKGNLN--------------------------LDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGT 342 (511) T ss_pred eecCcc--------------------------CCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHH Confidence 221000 0011111 111111111111100 1112224456666554444433 Q ss_pred -HHHHHHHHHHHHhcCCChhhccCCCcceeccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHH Q lcl|NC_015287. 359 -SDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEIL--RDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDW 435 (563) Q Consensus 359 -~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eIt--RDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew 435 (563) .-+.-+.+.+|.-.++|---. + +|. |..|.+. .-...-..-+.+.++.|..-+...++.=+-+-++....++ T Consensus 343 e~~~~~L~~~I~~~s~~p~~~~--~--~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~ 417 (511) T protein:vir:96 343 EAYKDRLNSDIHMFTNTPNMKD--D--NFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDA 417 (511) T ss_pred HHHHHHHHHHHHHHhCCccccc--c--ccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccc Confidence 334566778888888885322 1 222 2222221 1112233334455555555555555442222222222222 Q ss_pred HHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCC----- Q lcl|NC_015287. 436 EVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDG----- 510 (563) Q Consensus 436 ~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~----- 510 (563) +.-...+++.|...---.+ .+.++++..+ .| .+|.+++++.+-..+| .+++.++|++|..+. T Consensus 418 ~~d~~~i~~~f~~~~p~n~-------~e~~~~~~kl---~G-~iS~et~l~~l~~v~D--~~~E~~ri~~E~~~~~~~~~ 484 (511) T protein:vir:96 418 NKDFNTVRYVYNRNLPKSL-------IEELKAYIDS---GG-KISQTTLMSLFSFFQD--PELEVKKIEEDEKESIKKAQ 484 (511) T ss_pred ccccccceEEeCCCCCCCH-------HHHHHHHHHH---hc-cCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHh Confidence 2222356777864222222 2344555555 34 4899999976654443 334444455553321 Q ss_pred --CCCCCcccCccccccccccCCCCcCCCCCcccccccC-CCCccc Q lcl|NC_015287. 511 --KLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQ-EGGAEQ 553 (563) Q Consensus 511 --~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~-~~~~~~ 553 (563) .+..|+..++ .+...... .+.+++ T Consensus 485 ~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~~ 511 (511) T protein:vir:96 485 KGIYKDPRDIND-------------------DEQDDDTKDTVDKKE 511 (511) T ss_pred hccccCCCCCCC-------------------CCCCCcccccccccC Confidence 1111111111 11111111 111112 No 42 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=98.63 E-value=1.6e-07 Score=57.78 Aligned_cols=461 Identities=11% Similarity=0.043 Sum_probs=179.6 Q ss_pred CcccccchhhHhhhcccccc--cccCcCCCCcccceeccccccccccccc---hhhhHHHHHH--------HHHHHhhcc Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVV--SPVPQSNEDSSDYYVSSGFYGQYVDIDG---VFKSEFELIK--------RYREMALHP 67 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~--s~~p~~~~d~~~~~~~~~~~~~~~~~~~---~~~~~~~LI~--------~YR~ma~~P 67 (563) |.+.=.| ++...+.+.+ .+=+. +.+.|. ++..... ...-...+|+ +|+.+..+. T Consensus 1 ~~~~~~~---~~~~~~~~~~~~~~~~~---------~n~~~~-~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy 67 (511) T protein:vir:93 1 MLKVNEF---ETDTDLRGNINYLFNDE---------ANVVYT-YDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYY 67 (511) T ss_pred Cccccch---hhhhhhhhhhhhhhhhh---------hCCccc-ccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHh Confidence 4444444 1111111111 01011 111111 1111110 1111122222 344444443 Q ss_pred chhHHH----------------------HHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhH Q lcl|NC_015287. 68 EVDSAI----------------------EDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCH 125 (563) Q Consensus 68 Evd~Ai----------------------~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~ 125 (563) +-...| .-||+ ..+.=-...||.+..++ +...+.+..|++..+|+.... T Consensus 68 ~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~-~~~~yl~g~p~~~~~~d--------~~~~~~l~~~~~~n~~~~~~~ 138 (511) T protein:vir:93 68 EGKTKNLVELTRRKEEYMADNRVAHDYASYISD-FINGYFLGNPIQYQDDD--------KDVLEVIEAFNDLNDVESHNR 138 (511) T ss_pred cccCccccccCcCcccccCcceeecchHHHHHH-HHhhhhcccCeeeccCC--------hHHHHHHHHHHhhcCHhHHHH Confidence 322221 11222 11111234566665544 234566778888889999999 Q ss_pred HHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCcc Q lcl|NC_015287. 126 EILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQ 205 (563) Q Consensus 126 ~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~ 205 (563) ++.+..++.|+-|.+.-+| ++|-..+..+||+.+..|.--.........|.+.......+.....+..+=+|.+.. T Consensus 139 ~~~~~~~~~G~ay~~vy~d----e~~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~ 214 (511) T protein:vir:93 139 SLGLDLSIYGKAYELMIRN----QDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHG 214 (511) T ss_pred HHHHHHHhcCeeEEEEEeC----CCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCc Confidence 9999999999999987766 356788999999999888432221111122221111100001111111122454443 Q ss_pred ccccccccccCcCccccCcceee--ecccceeeeecceecCCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccce Q lcl|NC_015287. 206 TQGKNALGAIGQTPFSTKQRPVK--IAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRM-IEDSLVIYRLSRAPERR 282 (563) Q Consensus 206 ~~~~~~~g~~g~~~~~~~~~~ik--i~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m-~EDalVIyRi~RAPeRR 282 (563) .......+..+............ +..=+|++. .|+....|-++..+.....+.. +=+....-+-++.|-+- T Consensus 215 i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~------~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv 288 (511) T protein:vir:93 215 VYRYLTSRTNGLKLTPRENGFESHSFERMPITEF------SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLL 288 (511) T ss_pred EEEEEecCCCccccccccccccccCCCccceEEe------cCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCccee Confidence 33221111110000000000000 001122211 1223345667766666555542 22333333445555443 Q ss_pred EEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCccc-ccccccchhhh-hc--ccccCCCCccceeecCCCCCcch- Q lcl|NC_015287. 283 IFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIR-DDRKHMSMLED-FW--LPRREGGRGTEITTLPGGQNLGE- 357 (563) Q Consensus 283 vFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~-d~~~~msMlED-yw--LpRReGgrgTEIsTLpGg~nLge- 357 (563) +.=..... .++++ +....+-.+.. .| ...-....|..+..|-...+... T Consensus 289 ~~G~~~~~--------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 342 (511) T protein:vir:93 289 IKGNLNLD--------------------------PVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGT 342 (511) T ss_pred eecCcccC--------------------------chhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHH Confidence 33211000 01111 11111111111 11 11112223344555544333332 Q ss_pred HHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHH Q lcl|NC_015287. 358 LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEIL--RDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDW 435 (563) Q Consensus 358 i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eIt--RDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew 435 (563) ..-+.-..+.+|+-.++|---. + +|. |..|.+. .-......-+.+.++.|..-+.+.++.=+-+-++....++ T Consensus 343 ~~~~~~L~~~I~~~s~~P~~~~--~--~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~ 417 (511) T protein:vir:93 343 EAYKDRLNSDIHMFTNTPNMKD--D--NFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDA 417 (511) T ss_pred HHHHHHHHHHHHHHhCCccccc--c--ccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc Confidence 2233445677778888884322 1 222 2322222 1111123345555555555555555442222122222222 Q ss_pred HHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccCh--hhHHHHHHHHHHHhcC---C Q lcl|NC_015287. 436 EVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKD--EEVIEIDKQMEKEIKD---G 510 (563) Q Consensus 436 ~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~td--eei~e~~~qie~E~~~---~ 510 (563) ..-...+.+.|...---.+ .+.++++..+. | .+|.++++..+-..+| +|+++++++-+++.+. + T Consensus 418 ~~d~~~i~~~f~~~~p~n~-------~e~~~~~~kl~---g-~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~ 486 (511) T protein:vir:93 418 NKDFNTVRYVYNRNLPKSL-------IEELKAYIDSG---G-KISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKG 486 (511) T ss_pred ccccccceEEeCCCCCCCH-------HHHHHHHHHHh---c-cCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhh Confidence 2222456777854222222 23445555553 4 4899999987654333 4444444433322221 1 Q ss_pred CCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccC Q lcl|NC_015287. 511 KLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQA 554 (563) Q Consensus 511 ~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~ 554 (563) .+..|+..++++ + +...+.++.++- T Consensus 487 ~~~~~~~~~~~~--------------~-----~~~~~~~~~~~~ 511 (511) T protein:vir:93 487 IYKDPRDINDDE--------------Q-----DDDTKDTVDKKE 511 (511) T ss_pred cccCCCCCCCCC--------------C-----CCcccccccccC Confidence 111111111110 0 111111111111 No 43 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=98.63 E-value=1.7e-07 Score=57.70 Aligned_cols=449 Identities=14% Similarity=0.118 Sum_probs=194.9 Q ss_pred cCcCCC-Ccccceeccccccccccccchhh---hHHHHHHHHH------------------HH-hhccchhHHHHHhhcc Q lcl|NC_015287. 23 VPQSNE-DSSDYYVSSGFYGQYVDIDGVFK---SEFELIKRYR------------------EM-ALHPEVDSAIEDIINE 79 (563) Q Consensus 23 ~p~~~~-d~~~~~~~~~~~~~~~~~~~~~~---~~~~LI~~YR------------------~m-a~~PEvd~Ai~eIvne 79 (563) ++.... |... .+. . +..... .+.+.+.+|= .. ..++=+.-+|+..++ T Consensus 1 ~~~~~~~d~~~-~i~-~-------L~~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~a~- 70 (488) T protein:vir:23 1 MAETESIDPEK-LRD-Q-------LLDAFENKQNELKSSKAYYDAERRPDAIGLAVPLDMRKYLAHVGYPRTYVDAIAE- 70 (488) T ss_pred CCcccCCCHHH-HHH-H-------HHHHHHHHHHHHHHHHHHHhcccchhhcCcccchhhhhhhhhcchHHHHHHHHHH- Confidence 322221 1111 100 0 000111 1111111111 00 111222233333332 Q ss_pred eEEecCCCcEEEEEe-eccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeec----CCCCCcceeE Q lcl|NC_015287. 80 AIVSDQNDSPVEIDL-ENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVID----IKAPEEGIKE 154 (563) Q Consensus 80 aiv~d~~~~pV~l~l-~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD----~~~~k~GI~e 154 (563) ..+.+- +.+.. .....+..-.+...+++..|++..+|+....++.+.+++.|+-|...-.+ .-...+|... T Consensus 71 ~l~~~G----f~~~~~~~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~~ 146 (488) T protein:vir:23 71 RQELEG----FRIPSANGEEPESGGENDPASELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVPL 146 (488) T ss_pred hhhccc----eeccCCcccccccccchhHHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcce Confidence 000000 00000 01111222233556778888888999999999999999999987743221 1224567778 Q ss_pred EEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCc---CccccCcceeeecc Q lcl|NC_015287. 155 VRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQ---TPFSTKQRPVKIAP 231 (563) Q Consensus 155 Lr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~---~~~~~~~~~iki~~ 231 (563) ++.++|+.+-.+.. .........+.+... .....+..+.+|.+......-..+ ++- ....+.- -++| T Consensus 147 i~~~~p~~~~~~~d-~~~~~~~~~~~~~~~-----~~~~~~~~~~~y~~~~~~~~~~~~-~~~~~~~~~~h~~--g~vP- 216 (488) T protein:vir:23 147 IRVEPPTALYAEVD-PRTRKVLYAIRAIYG-----ADGNEIVSATLYLPDTTMTWLRAE-GEWEAPTSTPHGL--EMVP- 216 (488) T ss_pred EEEeccceeEEEEe-cCCCceEEEEEEEEe-----cCCCcEEEEEEEecCcEEEEEecC-CceEeccccccCC--CCcc- Confidence 88999998877743 111111112222111 111223334455443322211100 000 0000000 0111 Q ss_pred cceeeeecceecCCCCccchhHHHHHHHH-H-HHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcc Q lcl|NC_015287. 232 DAISFCHSGLVDRNKQTILSYLHKSIKAL-N-QLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRN 309 (563) Q Consensus 232 daI~y~hSGl~d~~~~~~~syL~~Aik~~-N-qL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~kn 309 (563) .|.|+|..-.. +.-+.|-|.+.++++ . -=+++-+..+.-....-|.|-|.=.+....+.... +. T Consensus 217 -vv~f~n~~~~~--~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~-----------~~ 282 (488) T protein:vir:23 217 -VIPISNRTRLS--DLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGINAE-----------TG 282 (488) T ss_pred -eEEeccccccC--CcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCccccccccc-----------cc Confidence 13355442221 122346565554433 2 23455566666676667777554222111111000 00 Q ss_pred eeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCChhhccCCCcceec Q lcl|NC_015287. 310 KLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDIEYFQKKLYRSLGVPESRIAGSGDGFNL 388 (563) Q Consensus 310 klvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~ 388 (563) +.+.++..| ..|+- ++|-..++-++++.. ++. ++-++=.-..++...++|..-|... +-|- T Consensus 283 ~~~~~~~~~-------------~v~~~--~~g~~~~~~q~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~~g~~--~~n~ 344 (488) T protein:vir:23 283 QRMFDAYMA-------------RILAF--EGGEGAHAEQFSAAE-LRNFVDALDALDRKAASYSGLPPQYLSSS--SDNP 344 (488) T ss_pred chhhhhhhh-------------hhccC--CCCCCceeEecCCCC-hHHHHHHHHHHHHHHhcccCCCHHHhccc--cCcc Confidence 011111111 22322 233445677777643 432 2234444455566778887777322 1111 Q ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-ChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHH Q lcl|NC_015287. 389 GRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIV-TPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGV 467 (563) Q Consensus 389 g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~-t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~ 467 (563) ..+..|..-+..+-.-+.+.++.|..-+.+.++.-+.+.|.. ...+| ..|.+.|..-..-+. .+.+++ T Consensus 345 ~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~----~~i~v~f~~~~~~s~-------~~~ada 413 (488) T protein:vir:23 345 ASAEAIKAAESRLVKKVERKNKIFGGAWEQAMRLAYKMVKGGDIPTEY----YRMETVWRDPSTPTY-------AAKADA 413 (488) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcchhh----ccceEEecCCCCCCH-------HHHHHH Confidence 223345555555777788888999999999888877655543 22333 357888864332222 345555 Q ss_pred HHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcC--CCCCCC--cccCccccccccccCCCCcCCCCCcccc Q lcl|NC_015287. 468 VAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKD--GKLADP--MEVQQLQLGVHPEQMPGGAMNPQDPNMG 543 (563) Q Consensus 468 ~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~--~~~~~p--~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 543 (563) +..+..-....+|.++++..+ .+++.+++++++..++|... +.+... ...++..++ ....+ .+.+|+.+ T Consensus 414 ~~kl~~~g~~~~s~et~~~~l-~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~---~~~~~e~~ 486 (488) T protein:vir:23 414 AAKLFANGAGLIPRERGWVDM-GYTIVEREQMRQWLEQDQKQGLGLIGSLYGASTPEGKPG---EAPVG---EPPAPEPD 486 (488) T ss_pred HHHHHhcccccCCHHHHHHhC-CCCchHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccCC---CCCCC---CCCCCCCC Confidence 555544323578889888775 67887777766544433221 111000 000000000 00111 12222222 Q ss_pred cc Q lcl|NC_015287. 544 GA 545 (563) Q Consensus 544 ~~ 545 (563) .| T Consensus 487 ~a 488 (488) T protein:vir:23 487 AA 488 (488) T ss_pred CC Confidence 22 No 44 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=98.57 E-value=2.7e-07 Score=56.59 Aligned_cols=455 Identities=10% Similarity=0.000 Sum_probs=183.7 Q ss_pred cccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhH---------------------HH Q lcl|NC_015287. 15 QNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDS---------------------AI 73 (563) Q Consensus 15 ~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~---------------------Ai 73 (563) +++.-.+..|....-.... ....-... ......+|+.+..+.+=+. -. T Consensus 1 ~~~~~~~~~~~~~~~~~~~---------~~~~i~~~--~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~~ 69 (489) T protein:vir:99 1 MLQEDFEAIDYESKLWIDQ---------LKNYISRF--KAEQLERLKELKRYYLGDNNIKYRPAKTDKYAADNRIASDFA 69 (489) T ss_pred CCccceeeeCCCCCCCHHH---------HHHHHHHH--HHHHHHHHHHHHHHhcccCccccccccccccCCcceeecchH Confidence 2222222222111100000 00000000 1112233333333332111 11 Q ss_pred HHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCccee Q lcl|NC_015287. 74 EDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIK 153 (563) Q Consensus 74 ~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~ 153 (563) ..||+-. +.=-..+||.+..++ +.+.+.+..|.+..+|+....++.+.+++.|+-|....+++....+|-. T Consensus 70 ~~iv~~~-~~~l~g~~~~~~~~d--------~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~~ 140 (489) T protein:vir:99 70 KYITVFE-QGYMLGVPVEYKNEN--------KDLQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTEV 140 (489) T ss_pred HHHHHHH-hhhhccCCceeecCC--------hhHHHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcce Confidence 1112110 011124566655543 2345566677878899999999999999999999887777766678899 Q ss_pred EEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCcccccccccc--ccCcC---ccccCcceee Q lcl|NC_015287. 154 EVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALG--AIGQT---PFSTKQRPVK 228 (563) Q Consensus 154 eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g--~~g~~---~~~~~~~~ik 228 (563) .+..+||+++..|..-. .....+-.++--............+.+|.+.........+ ..+.. ...+.- -+ T Consensus 141 ~i~~~~p~~~~~v~dd~---~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~--g~ 215 (489) T protein:vir:99 141 KLYQLPAEQTFVIYDDT---YQRNSLMAVHFYDIDYGSGKRKQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFF--KG 215 (489) T ss_pred EEEEEcccceEEEEcCC---CCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEecCCCcccceecccccccC--Cc Confidence 99999999988774211 1122222221100011111223344556554332211111 00000 000000 01 Q ss_pred ecccceeeeecceecCCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhh Q lcl|NC_015287. 229 IAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRY 307 (563) Q Consensus 229 i~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m-~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~ 307 (563) +| .|.|. |.....|-++..+.....+.. +-+....-+..+.|-+-+.-..... ...-++....+.. T Consensus 216 vP--vv~~~-------n~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~---~~~~~~~~~~~~~- 282 (489) T protein:vir:99 216 VP--VNEYA-------NNEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTG---ADENDYLDDGRLN- 282 (489) T ss_pred ee--EEEee-------cCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCccc---ccchhhhhhcccc- Confidence 11 11122 222334555555544444422 2233333344445554443322111 1111111111100 Q ss_pred cceeEeeCCCCccc-ccccccchhhhhcccccCC--CCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCChhhccCCC Q lcl|NC_015287. 308 RNKLVYDASTGEIR-DDRKHMSMLEDFWLPRREG--GRGTEITTLPGGQNLGEL-SDIEYFQKKLYRSLGVPESRIAGSG 383 (563) Q Consensus 308 knklvYd~~TGev~-d~~~~msMlEDywLpRReG--grgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~VP~sRl~~~~ 383 (563) -++..+... .....+ +++..... +.+..+.-|.-..+.+.. .-+.-+.+.+|+-.++|- +..+ T Consensus 283 -----~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~--~~~~- 349 (489) T protein:vir:99 283 -----PNGRLAISIGFKKAQV-----LILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPD--TQDM- 349 (489) T ss_pred -----ccccccccccccccee-----eeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCcc--cccc- Confidence 001111000 011111 11111111 122234434333232221 234566778888889983 2222 Q ss_pred cceeccccchhhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHH-hhhceEEEeeccchHHHHHHHHH Q lcl|NC_015287. 384 DGFNLGRSSEILR--DEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEV-LGDHIQYDFVYDNHFAELKETEL 460 (563) Q Consensus 384 ~g~~~g~~~eItR--DElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~-~~~~i~~~f~~Dn~f~ElKe~Ei 460 (563) +|. |.+|...- -+.--..-+.+-|+.|...+.++++.=+-+-++.....|.. ....|.+.|....--.+. T Consensus 350 -~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~----- 422 (489) T protein:vir:99 350 -KFS-GVQSGESMKYKLMASDNYREKQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDN----- 422 (489) T ss_pred -ccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHH----- Confidence 221 22222221 11112333555566666666666654322223221121211 224577888543332232 Q ss_pred HHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCc Q lcl|NC_015287. 461 LNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDP 540 (563) Q Consensus 461 ~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~ 540 (563) +-++++.++ +| .+|.+++++.+-..++++.+++-++|++|.....- ..++..+. . + +. T Consensus 423 --~~~~~~~kl---~g-iis~et~~~~l~~v~~~d~~~E~~ri~~E~~~~~~-------~~~~~~~~-~-------~-~~ 480 (489) T protein:vir:99 423 --EIVTAAQNL---YG-IVSDQTIFEILNTVTGVDAEAELKRLKEEADKKQS-------LPEPRLVG-D-------A-SG 480 (489) T ss_pred --HHHHHHHHH---hc-cCCHHHHHHhcCCCCchhHHHHHHHHHHHHHHHhc-------cccccccC-C-------C-CC Confidence 234444444 35 38999999988777877888777778777543111 00110000 0 0 00 Q ss_pred ccccccCCC Q lcl|NC_015287. 541 NMGGAPQEG 549 (563) Q Consensus 541 ~~~~~~~~~ 549 (563) +..++...| T Consensus 481 ~~~~~~~~p 489 (489) T protein:vir:99 481 QEEPTAEKP 489 (489) T ss_pred CcCCCCCCC Confidence 000011111 No 45 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=98.55 E-value=3e-07 Score=56.31 Aligned_cols=445 Identities=11% Similarity=0.107 Sum_probs=200.8 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhh--------------- Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMAL--------------- 65 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~--------------- 65 (563) |-+=++.-++..-.+..- ..++.+..+.. .+.++ ......|+.+|++.. T Consensus 1 m~~~~~~~~~~~~~~~~~-~~~~~~~~~~~-~i~~~--------------~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~ 64 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGL-LKSLKDVTDHK-KVNAN--------------DEDYKYIDMWKRLYQGNYAEWHNLNYEHNG 64 (499) T ss_pred ChhHHHHHHHHHHHHhcc-ccchhhhhcCC-CCcCC--------------HHHHHHHHHHHHHhcCCcchhhccccccCC Confidence 433222222222222110 01111111100 00000 111122333333321 Q ss_pred ---------ccchhHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhh Q lcl|NC_015287. 66 ---------HPEVDSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGR 136 (563) Q Consensus 66 ---------~PEvd~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGr 136 (563) .+-...+++..++ .+. ..||.+.+++ +...+.++.+++-.+|.....+++..-.+-|. T Consensus 65 ~~~~~~~~s~n~~~~iv~~~a~--~l~---~ep~~i~~~d--------~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~ 131 (499) T protein:vir:80 65 NPVNRRQLSMNLPKVTAKYMSK--LLF---NEKVKINIDD--------ETAEEFVLNVLKTNGFTKNMERYIEYGEAMGG 131 (499) T ss_pred CccccceeecchHHHHHHHHHH--hhh---CCcceEeeCC--------HHHHHHHHHHHhhccHHHHHHHHHHHHhhcCc Confidence 1222333344333 222 2477777765 35566677788778899999999999999999 Q ss_pred hheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCc----ccccccc- Q lcl|NC_015287. 137 IYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPS----QTQGKNA- 211 (563) Q Consensus 137 i~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~----~~~~~~~- 211 (563) .|||..+|.. |=+.+.+++|.++-++..-- .... .+-.+..-...+..|-- -||..+... +...+.. T Consensus 132 ~~~~~~~D~~----~~~~i~~v~a~~~~Pi~~d~--~~~~-~~~f~~~~~~~~~~y~~-lE~h~~~~~~~~~y~I~n~~~ 203 (499) T protein:vir:80 132 FVIKVYHDGN----KNVKVSFATADCMYPLSNDS--ENVD-ECLIANSFHKNNKYYKL-LEWNEWKGEKEEVYTVTTELY 203 (499) T ss_pred EEEEEEECCC----CcEEEEEEcCCceEEEEecC--CCeE-EEEEEEEEeecCeEEEE-EEEEEecccceeeEEEEEEEE Confidence 9999999953 55678999999988874321 1111 11111111111111111 122111110 0000000 Q ss_pred ----ccccCc-Ccc---ccCc-ceeee-cccceeeeecc-----eecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015287. 212 ----LGAIGQ-TPF---STKQ-RPVKI-APDAISFCHSG-----LVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLS 276 (563) Q Consensus 212 ----~g~~g~-~~~---~~~~-~~iki-~~daI~y~hSG-----l~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~ 276 (563) ....|. .+. ..+. ....+ ..+...|++.- -.+.....++|-|+.|...+..|-..-+.++ |.. T Consensus 204 ~~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~--~e~ 281 (499) T protein:vir:80 204 QSDDPNELGGKVSLKLLFNDIEPVVPLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYY--QEF 281 (499) T ss_pred eccCccccCcccchhhhccCcCCceeecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHH--HHH Confidence 000011 000 0000 00011 11222222210 0123334567889999988888887776654 778 Q ss_pred cCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCC-Cccc-ccccccchhhhhcccccCCCCccceeecCCCCC Q lcl|NC_015287. 277 RAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDAST-GEIR-DDRKHMSMLEDFWLPRREGGRGTEITTLPGGQN 354 (563) Q Consensus 277 RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~T-Gev~-d~~~~msMlEDywLpRReGgrgTEIsTLpGg~n 354 (563) |.=.+|+|. +..=+.. ..=++... +... +++.+..+ +=..++.|--|+++.+.-. T Consensus 282 ~~~~~~i~v-~~~~l~~----------------~~~~~g~~~~~~~~~~~~~~~~------~~~~~~~~~~i~~~~~~ir 338 (499) T protein:vir:80 282 KLGKKKVLV-PSSFVKT----------------AVNLDGSTTQYFDSTDEAFFLY------QGEQDDNGKAIKDISVEIR 338 (499) T ss_pred Hhcccceec-chhhhhc----------------cCCCCCCcccCCCcccceeeEe------eccCCCCcCceeEecCcCC Confidence 887777774 2111100 00011110 1111 11111111 0001112223666665443 Q ss_pred cch-HHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------Hh Q lcl|NC_015287. 355 LGE-LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQL-------IL 426 (563) Q Consensus 355 Lge-i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qL-------iL 426 (563) -.+ ..-+..+.+.+....|+|-+-|..+++| .-.|+||.-....-..-+...++.|..-+.++++.=| .+ T Consensus 339 ~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g--~~TAtei~s~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~ 416 (499) T protein:vir:80 339 STEFIESINAMLRIYAMQVGLSAGTFTFDENG--LKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKLIKAY 416 (499) T ss_pred hHHHHHHHHHHHHHHHHhcCCChhhcCCCccc--chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc Confidence 232 3567788889999999998888644322 1235565444333222344445555544555444422 23 Q ss_pred cCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHH Q lcl|NC_015287. 427 KNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKE 506 (563) Q Consensus 427 kgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E 506 (563) .|. .|+. ..+.++|...-.-.+..+ ++.+.++-- .| .+|..+++.+....+|+|.+++.++|++| T Consensus 417 ~~~----~~~~--~~v~v~f~d~i~~d~~~~-------~~~~~~~~~-~G-i~S~et~l~~~~~~~d~ea~~el~~i~~E 481 (499) T protein:vir:80 417 DGD----TVEL--DTITVDFDDSIAQDEDTT-------INRYTTAKN-QG-MIPLKIALQRAWNITEAEADEWAEMLAKE 481 (499) T ss_pred cCC----CCCc--cceEEEeCCCCCCCHHHH-------HHHHHHHHH-cC-CCCHHHHHhhcCCCChHHHHHHHHHHHHH Confidence 332 2322 578888854332222221 111111110 13 58888888888889999999898999888 Q ss_pred hcCCCCCCCcccCccccccccccCCCCcC Q lcl|NC_015287. 507 IKDGKLADPMEVQQLQLGVHPEQMPGGAM 535 (563) Q Consensus 507 ~~~~~~~~p~~~~~~~~~~~~~~~~~~~~ 535 (563) .... .++| +++ +..+-++ T Consensus 482 ~~~~-~~~~------d~~----g~~ge~e 499 (499) T protein:vir:80 482 KQAE-IPNN------DMT----GIFGEEE 499 (499) T ss_pred hhcC-CCCC------Ccc----ccCCCCC Confidence 6532 2111 111 1111111 No 46 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=98.53 E-value=3.5e-07 Score=55.99 Aligned_cols=454 Identities=11% Similarity=0.038 Sum_probs=178.3 Q ss_pred CcccccchhhHhhhcccccc--cccCcCCCCcccceeccccccccccccc---hhhhHHHHHH--------HHHHHhhcc Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVV--SPVPQSNEDSSDYYVSSGFYGQYVDIDG---VFKSEFELIK--------RYREMALHP 67 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~--s~~p~~~~d~~~~~~~~~~~~~~~~~~~---~~~~~~~LI~--------~YR~ma~~P 67 (563) |.+.-+| ++-..+.+.+ .+=+. +.+.|. ++..-+. ...-...+|+ +|+.+..+. T Consensus 1 ~~~~~~~---~~~~~~~~~~~~~~~~~---------~n~~~~-~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy 67 (511) T protein:vir:99 1 MLKVNEF---ETDTDLRGNINYLFNDE---------ANVVYT-YDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYY 67 (511) T ss_pred Cccccch---hhhhhhhhhhhhhhhhh---------hCCccc-cchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHh Confidence 4333333 1111111111 01011 112221 1111110 0111112222 344444333 Q ss_pred chhHHH----------------------HHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhH Q lcl|NC_015287. 68 EVDSAI----------------------EDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCH 125 (563) Q Consensus 68 Evd~Ai----------------------~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~ 125 (563) +=...| .-||+- .+.=--..||.+..++ +...+.+..|++..+|+.... T Consensus 68 ~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~-~~~yl~g~p~~~~~~d--------~~~~~~l~~~~~~n~~~~~~~ 138 (511) T protein:vir:99 68 EGKTKNLVELTRRKEEYMADNRVAHDYASYISDF-INGYFLGNPIQYQDDD--------KDVLEAIEAFNDLNDVESHNR 138 (511) T ss_pred cccCccccccCcccccccCcceeecchHHHHHHH-HHhhhcccCceeecCc--------hHHHHHHHHHHhhcCHhHHHH Confidence 322111 112221 0011124566665543 235567788888889999999 Q ss_pred HHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCcc Q lcl|NC_015287. 126 EILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQ 205 (563) Q Consensus 126 ~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~ 205 (563) ++.+...+.|+-|.+.-+| ++|-..+..+||+.+..|+.-.........|....-....+.....+..+-+|.+.. T Consensus 139 ~~~~~~~i~G~a~~~vy~d----ed~~~~i~~~~p~~~~~vyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~ 214 (511) T protein:vir:99 139 SLGLDLSIYGKAYELMIRN----QDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHG 214 (511) T ss_pred HHHHHHHhcCeeEEEEEeC----CCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCc Confidence 9999999999999987776 356788999999999888532221111112221111000111111111122454433 Q ss_pred ccccccccccCcCccccCcceee--ecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHH-HHHHHHHHhcCccce Q lcl|NC_015287. 206 TQGKNALGAIGQTPFSTKQRPVK--IAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIE-DSLVIYRLSRAPERR 282 (563) Q Consensus 206 ~~~~~~~g~~g~~~~~~~~~~ik--i~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~E-DalVIyRi~RAPeRR 282 (563) .......+..+............ +..=+|++. .|+....|-++..+.....|..+- +....-+-++.|-+. T Consensus 215 i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~------~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv 288 (511) T protein:vir:99 215 VYRYLTSRTNGLKLTPRENGFESHSFERMPITEF------SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLL 288 (511) T ss_pred EEEEEecCCccccccccccccccCCCCccceEEe------cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhh Confidence 32221111110000000000000 000122221 122233455555555544443322 222222334444443 Q ss_pred EEE---ccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhccccc--------CCCCccceeecCC Q lcl|NC_015287. 283 IFY---IDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRR--------EGGRGTEITTLPG 351 (563) Q Consensus 283 vFy---IDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRR--------eGgrgTEIsTLpG 351 (563) +.- .|.+.+++.+. ++ ..|++.. ..+.|..+..|-. T Consensus 289 ~~G~~~~~~~~~~~~~~------------~~---------------------~~~~~~~~~~~~~~~~~~~~~d~~~l~~ 335 (511) T protein:vir:99 289 IKGNLNLDPVEVRKQKE------------AN---------------------VLFLEPTVYADSEGRETEGSVDGGYIYK 335 (511) T ss_pred hccCcccCchhhccccc------------cc---------------------ceecccccccccccccCCCCcceeEEee Confidence 322 11111111000 00 1222221 1122344555544 Q ss_pred CCCcchHH-HHHHHHHHHHHhcCCChhhccCCCcceeccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015287. 352 GQNLGELS-DIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSE--ILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKN 428 (563) Q Consensus 352 g~nLgei~-DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~e--ItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkg 428 (563) ..+...+. -+.-+.+.+|+...+|---.. +|. |..|. |..-+.....-+.+.++.|..-+.+.++.=+-+-+ T Consensus 336 ~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~----~~~-gn~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~ 410 (511) T protein:vir:99 336 QYDVQGTEAYKDRLNSDIHMFTNTPNMKDD----NFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILK 410 (511) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCcccccc----ccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44433332 356667778888888853221 232 22222 22222223344556666666666666654222212 Q ss_pred C----CChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHH Q lcl|NC_015287. 429 I----VTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQME 504 (563) Q Consensus 429 I----~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie 504 (563) + -...+|. .+.+.|....--.+ .+.++++..+. | .+|.+++++.+-..+| .+++.++|+ T Consensus 411 ~~~~~~~~~~~~----~i~i~f~~~~p~n~-------~e~~~~~~kl~---G-iiS~et~l~~l~~v~D--~~~E~~ri~ 473 (511) T protein:vir:99 411 NTRSIDVSKDFN----TVRYVYNRNLPKSL-------IEELKAYIDSG---G-KISQTTLMSLFSFFQD--PELEVKKIE 473 (511) T ss_pred hcCCcccccccc----cceEEeCCCCCcCH-------HHHHHHHHHHh---c-cCCHHHHHHhCCCCCC--HHHHHHHHH Confidence 2 2333443 46777764322222 23444555553 4 4899999998654443 344444555 Q ss_pred HHhcCCCC-CCCcccCccccccccccCCCCcCCCCCcccccccCCCCccc Q lcl|NC_015287. 505 KEIKDGKL-ADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQ 553 (563) Q Consensus 505 ~E~~~~~~-~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~ 553 (563) +|..+..- .......++++ +.+..++.....+...+ | T Consensus 474 ~E~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~d~~-e 511 (511) T protein:vir:99 474 EDEKESIKKAQKNMYQDPRN-----------INDDEQDDSTKDSIDKK-E 511 (511) T ss_pred HHHHHHHHHHhhcccccCCC-----------CCCCCCCCCCcCccccc-C Confidence 55432110 00000000000 00001111111111111 1 No 47 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=98.51 E-value=3.9e-07 Score=55.70 Aligned_cols=442 Identities=12% Similarity=0.100 Sum_probs=192.1 Q ss_pred ccccccccch----h---hhHHHHHHHHHHHh-------------------hccchhHHHHHhhcceEEecCCCcEEEEE Q lcl|NC_015287. 40 YGQYVDIDGV----F---KSEFELIKRYREMA-------------------LHPEVDSAIEDIINEAIVSDQNDSPVEID 93 (563) Q Consensus 40 ~~~~~~~~~~----~---~~~~~LI~~YR~ma-------------------~~PEvd~Ai~eIvneaiv~d~~~~pV~l~ 93 (563) ....-+.-.. . ..+..+++.|=.-- .+.=+.-+|+..++=. +. +.+. T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l-~~----~g~~-- 73 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL-DI----EGFR-- 73 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHhhh-cc----Ccee-- Confidence 0000010000 0 01111122221100 0111111222221100 00 0111 Q ss_pred eeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeec--CCCCCcceeEEEEeCCccceeeeeeec Q lcl|NC_015287. 94 LENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVID--IKAPEEGIKEVRYIDPLKIKLVRKLKT 171 (563) Q Consensus 94 l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD--~~~~k~GI~eLr~lDP~~i~~vR~~~~ 171 (563) .++ -+...+....|++..+|+....++++.-.+.||-|++.-.. .....+|-.-++.++|+.+-.+.--.. T Consensus 74 -----~~~--d~~~~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~i~~~~p~~~~~i~D~~~ 146 (480) T protein:vir:78 74 -----ISE--DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRN 146 (480) T ss_pred -----cCC--CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeEEEEEcccceEEEEcCCC Confidence 111 11234556777888899999999999999999988763211 111356788899999999877642111 Q ss_pred cCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeeccc--c-eeeeecceecCCCCc Q lcl|NC_015287. 172 DNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPD--A-ISFCHSGLVDRNKQT 248 (563) Q Consensus 172 ~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~d--a-I~y~hSGl~d~~~~~ 248 (563) +......+.+.+.. .....+..+.+|.+.........+..... .......+.-+.. + |.|++.--.+. .. T Consensus 147 ~~~~~~~i~~~~~~----d~~~~~~~~~~y~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~g~vPvv~f~n~~~~~~--~~ 219 (480) T protein:vir:78 147 TRRVTRAVRLYTTR----DDVAVPDRATLYLPDETVPLRRNGGLNDQ-WVVDGDVIKHGLGVVPVVPLTNDPRLGN--RY 219 (480) T ss_pred ccceEEEEEEEEee----cCCcceEEEEEEeCCeEEEEEecCCCccc-ccccccccccCCCCcceEEeecccccCC--cc Confidence 11112222221111 01111222333433222222111110000 0000000000000 1 22333211111 12 Q ss_pred cchhHHHHH----HHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccc Q lcl|NC_015287. 249 ILSYLHKSI----KALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDR 324 (563) Q Consensus 249 ~~syL~~Ai----k~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~ 324 (563) +.|=|.+.+ ..+| ++|-+.+++-....-|.|-+.=.+....+..+ .+.+ T Consensus 220 G~sdi~~~i~~l~Da~~--~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~---------------------~~~~---- 272 (480) T protein:vir:78 220 GRSEISPELRKVTDAAS--RTLMNLQSASQILGTPLRVISGVTTDELTNDG---------------------ENTT---- 272 (480) T ss_pred CccchhHHHHHHHHHHH--HHHHHHHHHHHhhcchhhhhhCCCcccccccc---------------------ccch---- Confidence 344454433 3333 35667777777777787766422222111100 0000 Q ss_pred cccchhhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHHH Q lcl|NC_015287. 325 KHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTK 403 (563) Q Consensus 325 ~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~K 403 (563) -...+-...|++ +-+.++-++++.. +.- ++-++-....++..-++|..-|... +-|-..+..|.--+...-. T Consensus 273 ~~~~~~~~~~~~----~~~~~~~~~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~fg~~--~~n~~Sg~Al~~~~~~l~~ 345 (480) T protein:vir:78 273 LDIYYGRILTLA----SEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSS--SENPASAEAIIATDSRIVK 345 (480) T ss_pred hhhhhhhhccCC----CCCceEEecCccC-HHHHHHHHHHHHHHHhcccCCCHHHhccc--cCchhHHHHHHHHHHHHHH Confidence 000111223443 2346777777653 332 3335566666777778887777422 1111122234444455677 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHH Q lcl|NC_015287. 404 FVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDY 483 (563) Q Consensus 404 fI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~ 483 (563) -+.+.|+.|..-+.+.++.-+.+.|.--..+|. .|.+.|..-..-+. .+.++.+.++-.-.+-.+|.++ T Consensus 346 k~~~~~~~f~~~l~~~~rl~~~~~~~~~~~~~~----~i~v~w~~~~~~s~-------~~~ad~~~kl~~~g~~~~s~et 414 (480) T protein:vir:78 346 MAERKGRIFGGAWERAMRIAMQIMGREVTEEYT----RLETVWRDPSTPTV-------AAKADAVSKLYANGQGPIPKEQ 414 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCCccccce----eeeEEecCCCCCCH-------HHHHHHHHHHHHhcccCCCHHH Confidence 789999999999999999887777754334443 46788854322222 2455556655544345678777 Q ss_pred HHHHHhccChhhHHHHHHHHHHHhcC--CCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccCCCCCCCC Q lcl|NC_015287. 484 VRRKILKQKDEEVIEIDKQMEKEIKD--GKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAPQMPEGG 561 (563) Q Consensus 484 i~k~Il~~tdeei~e~~~qie~E~~~--~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~ 561 (563) ++. +|.+++++++++++..+++... +.+..|...+.++ ..+|..++.+.+++.++... --.++. T Consensus 415 ~~~-~lg~~~d~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~--~~~~~~ 480 (480) T protein:vir:78 415 ARI-DLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADA-----------TPKPTVTETKTETQTSPSGF--NRTKTR 480 (480) T ss_pred HHh-cCCCCHhHHHHHHHHHHHHHHHHHHHhhccccCCCcc-----------ccCCCCCCCCCccCCCcccC--CCcCCC Confidence 665 5689998888877554444321 1111221111111 11222222222222221110 011112 No 48 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=98.47 E-value=5.2e-07 Score=55.04 Aligned_cols=445 Identities=11% Similarity=0.121 Sum_probs=193.3 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHh---------------- Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMA---------------- 64 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma---------------- 64 (563) |-+-++--++..-.+.- -.+.+-...+. ..+.+ -.+..+.|++++.+. T Consensus 1 m~~~~~~~~~~~~~~~~-~~~~~~~~~~~-~~~~~--------------~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~ 64 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMG-LLKALKDVKDH-KKVNA--------------NDEDYKYIDMWKRLYQGHYAEWHNLNYEHNG 64 (496) T ss_pred ChhHHHHHHHHHHHHhc-cchhhHHHHhc-CCCcC--------------CHHHHHHHHHHHHHhcCCCchhhcchhccCC Confidence 22211111222211110 00111000000 00000 011112222222221 Q ss_pred --------hccchhHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhh Q lcl|NC_015287. 65 --------LHPEVDSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGR 136 (563) Q Consensus 65 --------~~PEvd~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGr 136 (563) ..+-...+++..++ .+ -..||.+.+++ +...+.++.|++-.+|.+...+++....+.|. T Consensus 65 ~~~~~~~~~~n~~k~i~~~~a~--~l---~~~p~~i~~~d--------~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~ 131 (496) T protein:vir:38 65 NPVNRRQLSMNLPKVTAKYMSK--LL---FNEKVKINIDD--------KAAEEFVLNVLKTNGFTKNMERYIEYGEAMGG 131 (496) T ss_pred CccccceeecchHHHHHHHHhh--hh---hCCcceEeeCC--------hHHHHHHHHHHhccCHHHHHHHHHHHHhhhCc Confidence 11222223333332 11 24577777765 35566778888888999999999999999999 Q ss_pred hheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeeccccccccccccc-----------ceeeecCcc Q lcl|NC_015287. 137 IYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIE-----------EYYQYDPSQ 205 (563) Q Consensus 137 i~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~-----------eyf~y~~~~ 205 (563) .|+|..+|. .|-+.+.+++|.++-+++.--. .....+ .+..-...+..|--++ +|.+|.... T Consensus 132 ~~~~~~~D~----~~~~~i~~v~~~~~~P~~~~~~--~~~~~~-f~~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~ 204 (496) T protein:vir:38 132 FVIKVYHDG----NKNVKVSFATADCMYPLSNDSE--NVDECV-IANSFHKNNKYYTLLEWNEWQGDVYTVTTELYQSDD 204 (496) T ss_pred EEEEEEEcC----CCcEEEEEEcccceEEEEecCC--cEEEEE-EEEEEEeCCeEEEEEEEEEEeCceEEEEEEEEecCC Confidence 999999985 3557799999999887743211 111111 1110001111111111 111121100 Q ss_pred ccccccccccCcCccccCcc-eee---ecccceeeeecc---eecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015287. 206 TQGKNALGAIGQTPFSTKQR-PVK---IAPDAISFCHSG---LVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRA 278 (563) Q Consensus 206 ~~~~~~~g~~g~~~~~~~~~-~ik---i~~daI~y~hSG---l~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RA 278 (563) ....+....-.....+.. ... ++.-.++|---- -.+.....++|-|+.++.....|-..=..+ .+-+|. T Consensus 205 --~~~~g~~v~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~--~~~~~~ 280 (496) T protein:vir:38 205 --PNELGTKVSLTLLFDDIEPVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSY--YQEFKL 280 (496) T ss_pred --ccccCccccccccccccccceeecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHH--HHHHhh Confidence 000000000000000000 000 011112111000 012233456788999888877776554443 366777 Q ss_pred ccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhccccc-CCCCccceeecCCCCCcch Q lcl|NC_015287. 279 PERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRR-EGGRGTEITTLPGGQNLGE 357 (563) Q Consensus 279 PeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRR-eGgrgTEIsTLpGg~nLge 357 (563) -.+|+|. +.. +++ . ..|..++... -++.-.+.|....- +++.|.-|+++.+.-.-.+ T Consensus 281 ~~~~i~v-~~~---------~l~----~-----~~~~~g~~~~---~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~ 338 (496) T protein:vir:38 281 GKKKVLV-PSS---------FVK----T-----AVNLDGSTTQ---YFDSTDEAFFLYQGDQDDNGKAIKDISVEIRSTE 338 (496) T ss_pred cccceec-chH---------Hhh----c-----cCCCCCcccc---CCCCccceEEEeecCCCcccccceeeccccCHHH Confidence 6666664 211 110 0 0112222111 01111111111110 1222233565555322111 Q ss_pred -HHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHhcCC Q lcl|NC_015287. 358 -LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKT-------QLILKNI 429 (563) Q Consensus 358 -i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~-------qLiLkgI 429 (563) ..-++.+.+.+....++|-+-|..+++|. -.+++|.-....-..-+.+.++.|...+.++++. .+.++|. T Consensus 339 ~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~--~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~ 416 (496) T protein:vir:38 339 FIESINAMLRIYAMQVGLSAGTFTFDENGL--KTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKFIEAYSGE 416 (496) T ss_pred HHHHHHHHHHHHHHhhCCChhhcCCCcccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 23467777889999999999886443221 1245654443333333444444444444444433 3445553 Q ss_pred CChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcC Q lcl|NC_015287. 430 VTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKD 509 (563) Q Consensus 430 ~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~ 509 (563) . |+ ...+.++|.. +..... . +.++.+.++-. .| .+|.++++++....+|+|.+++-++|++|... T Consensus 417 ~----~~--~~~i~v~f~d-~i~~d~--~----~~~~~~~~~~~-~G-iiS~et~l~~~~~~~d~ea~~el~ri~~E~~~ 481 (496) T protein:vir:38 417 V----VE--LDTITVDFDD-SIAQDE--D----TTINRYTNAKN-QG-MIPLKIALQRAWNITEAEADEWAEMLAKEKQA 481 (496) T ss_pred C----CC--ccceEEEeCC-CCCCCH--H----HHHHHHHHHHh-cC-CCCHHHHHHhcCCCChHHHHHHHHHHHHhhhc Confidence 3 22 2457888874 333211 1 12222222211 14 58999998888889999999888888888643 Q ss_pred CCCCCCcccCccccccccccCCCCcC Q lcl|NC_015287. 510 GKLADPMEVQQLQLGVHPEQMPGGAM 535 (563) Q Consensus 510 ~~~~~p~~~~~~~~~~~~~~~~~~~~ 535 (563) . . | +++. +...+.++ T Consensus 482 ~-~--~----~~d~----~~~~~~~e 496 (496) T protein:vir:38 482 E-M--P----NNDM----NGIFGEEE 496 (496) T ss_pred c-C--c----cccc----cCCCCCCC Confidence 1 1 1 1111 11111111 No 49 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=98.47 E-value=5.2e-07 Score=55.02 Aligned_cols=451 Identities=15% Similarity=0.185 Sum_probs=188.3 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCc-ccce------eccccccccccccchhhhHHHHHHHHHHHhhccchhHHH Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDS-SDYY------VSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAI 73 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~-~~~~------~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai 73 (563) +++.|-.+..-+..-.++..+- -.|+ ++.. ..+||-| +-.--.|||+||.++++ T Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~l~~~~~~~F~G---------------y~~la~laQ~~eyr~~~ 129 (695) T protein:vir:36 69 LARQFEVDVSNYTPRERRAASY----ALDFNGTSMDALSFVTSSGFPG---------------FPTLVLLAQLPEYRAMH 129 (695) T ss_pred cceeceecccccCccccchhhh----hhcccccccccchhhhccCcch---------------HHHHHHHhhccchhhHH Confidence 4444443333222222221111 0010 0000 0111111 12234689999999999 Q ss_pred HHhhcceE------EecCCCcE----EEEEeeccccch-HHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeee Q lcl|NC_015287. 74 EDIINEAI------VSDQNDSP----VEIDLENLPASK-QLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKV 142 (563) Q Consensus 74 ~eIvneai------v~d~~~~p----V~l~l~~~~~s~-sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkv 142 (563) +-|..||+ ++...++. +.+.-+..+.++ .-.++|..|++. |+...+.-+.++-=-+.|.=.-... T Consensus 130 ~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~er----L~V~~~l~eaik~aRlfGGa~~~i~ 205 (695) T protein:vir:36 130 EVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIER----LRIRDAVRTTVIHDQAFGRAHPYFK 205 (695) T ss_pred HHHHHHhhcccceecccchhhhhhccccccccccccCchHHHHHHHHHHHH----HHHHHHHHHHHHhhccccceEEEEE Confidence 99999993 33322221 344333333333 345577777763 3334444444444344444221123 Q ss_pred ecCC--------------CCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccc Q lcl|NC_015287. 143 IDIK--------------APEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQG 208 (563) Q Consensus 143 iD~~--------------~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~ 208 (563) |+-. -.|.+++.|+.|||..+-+= . .+. .++..-.|-.-..|++.. T Consensus 206 i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~-------~-~n~------~dP~spdfgkP~~y~V~G------ 265 (695) T protein:vir:36 206 IKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPN-------N-YNS------INPVADDFYKPSTWWMIG------ 265 (695) T ss_pred eccCccccccccccccccccCcceeeeEeecccccccc-------h-hhh------ccchhhccCCCceEEEec------ Confidence 3322 23457788999997543331 1 110 011111222222233311 Q ss_pred cccccccCcCccccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHH-HHHHHHHHHHHHHhcCccceEEEcc Q lcl|NC_015287. 209 KNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQ-LRMIEDSLVIYRLSRAPERRIFYID 287 (563) Q Consensus 209 ~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~Nq-L~m~EDalVIyRi~RAPeRRvFyID 287 (563) . ..|.|+.+.+...++.+. |-......+.|.+..+..-..+ +++...+.=+- .++.-+ ++-.| T Consensus 266 --------~--kIH~SRL~~f~g~plPd~---LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li--~~~~v~-~lk~d 329 (695) T protein:vir:36 266 --------T--EVHATRLHTIVSRPVGDM---LKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIV--KQFSVS-GILMD 329 (695) T ss_pred --------e--EEeeeeEEEecCCCchhh---hhcccccCcccHHHHHHHHHHHHHHHHhHHHHHH--HhhhHH-HHHHH Confidence 0 133344444443333222 1111222345666665544333 34333332221 111111 00112 Q ss_pred CC-CCchHHHHHHH--HHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHH-H Q lcl|NC_015287. 288 VG-NLPKMKAEQYL--KEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIE-Y 363 (563) Q Consensus 288 vG-nlpk~KAeqYl--~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~-Y 363 (563) .. -|.....++.. -+++++||.- -|-+-=|+ =.|||- +. ..+|+-++||. = T Consensus 330 la~aL~~g~~~~l~~R~eli~~~Rsn------~G~~llDk----~~Eefe-------------q~--stslSGLddVi~q 384 (695) T protein:vir:36 330 LAQALMPGANVDLSMRAELINRYRDN------RNILFLDK----ATEEFF-------------QF--NTPLSGLDALQAQ 384 (695) T ss_pred HHHhhcChhHHHHHHHHHHHHHhcCc------cceEEEec----CCcceE-------------EE--ecccCCHHHHHHH Confidence 10 00111112222 2455666521 12110000 013342 11 24788899975 4 Q ss_pred HHHHHHHhcCCChhhccC-CCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHhcCCCChhHHHH Q lcl|NC_015287. 364 FQKKLYRSLGVPESRIAG-SGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKT-----QLILKNIVTPEDWEV 437 (563) Q Consensus 364 F~~kLy~aL~VP~sRl~~-~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~-----qLiLkgI~t~eew~~ 437 (563) |..-+=-+.+||+.||=+ ...|||--.-+ |.-.|...|..+|. ..+...|++ |+-+-|.+. T Consensus 385 f~q~VAgaa~IPltkLfGqSPkGlNATGE~----D~rnYYD~I~s~Qe---~~L~p~L~rl~~ii~rS~~G~id------ 451 (695) T protein:vir:36 385 AQEQMSAVSHIPLIKLLGITPTGLNASSEG----EIRVWYDYVRAYQR---NALQQLMNDVIVMIQLSLFGAVD------ 451 (695) T ss_pred HHHHHHhhhcCchhhhhccCcccccccchh----hHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHhcCCCC------ Confidence 888888899999999943 34477653222 23458899988875 334444433 444445443 Q ss_pred hhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhcc------------------ChhhHHHH Q lcl|NC_015287. 438 LGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQ------------------KDEEVIEI 499 (563) Q Consensus 438 ~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~------------------tdeei~e~ 499 (563) ..|.|+|+.=..-+|..-+||...+.+....+-.- | -++.+-|+..+... +|.||.-+ T Consensus 452 --pdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~-g-vI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~ 527 (695) T protein:vir:36 452 --PSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQE-Q-VIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGV 527 (695) T ss_pred --CcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHh-c-CCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhh Confidence 45889999777778888888888887765433211 1 12222222221111 01111100 Q ss_pred HHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccCCCC-CC-------CCCC Q lcl|NC_015287. 500 DKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAPQM-PE-------GGEI 563 (563) Q Consensus 500 ~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~-~~-------~~~~ 563 (563) ...- + + ..+.++.++..+...+.--.|.-.++....+++.++..+++ .- +|.| T Consensus 528 ~~~~-----~-----~-~~~~~~~~~~~~~~~g~~~~~~v~~~~~~~~~~~ag~~~~~~~aag~v~~~~g~v 588 (695) T protein:vir:36 528 LTYV-----Q-----R-LAEGGDTGAPGGARAGATAPPTVANVNANVNPREAGAQDAAMRAAGAVYVVDGKV 588 (695) T ss_pred Hhhh-----c-----C-cccccccCCCCcccccccCCCcccccccccCccccCCCCccceeeEEEEEeCCEE Confidence 0000 0 0 01111111111212222222222333333344333322222 21 2222 No 50 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=98.46 E-value=5.8e-07 Score=54.78 Aligned_cols=448 Identities=12% Similarity=0.061 Sum_probs=181.9 Q ss_pred hhHhhhcccccccccCcCCCCcccceeccccccccccccchhhh-HHHHHHHHHHHhhccchhHH--------------- Q lcl|NC_015287. 9 IQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKS-EFELIKRYREMALHPEVDSA--------------- 72 (563) Q Consensus 9 ~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~-~~~LI~~YR~ma~~PEvd~A--------------- 72 (563) .+.+ .|.....-+.+..|.++-......-+..- .+..-+.. ..+.+.+|+.+..+.+-... T Consensus 1 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~ 77 (481) T protein:vir:10 1 MTVY-TINNINTKFSPLANDDFVVSDLAELLKEE--NLRNFISRHQTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDK 77 (481) T ss_pred CeeE-eeehhchhcccccCceeeeecchhhcCHH--HHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCcccccccccc Confidence 0000 11111112222223322111000000000 00000010 12233344444333322210 Q ss_pred ---------HHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeee Q lcl|NC_015287. 73 ---------IEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVI 143 (563) Q Consensus 73 ---------i~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkvi 143 (563) ...||+ ..+.=-...||.+..++ +...+.+..+++-.+|+....++.+..++.|+-|.|.-+ T Consensus 78 ~~~ki~~n~~~~ivd-~~~~~l~g~~~~~~~~d--------~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~ 148 (481) T protein:vir:10 78 ADHRAVHNYAKYVSR-FIVGYLTGNPITITHQD--------NQTNDKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYR 148 (481) T ss_pred ccceeecchHHHHHH-HHHhhhccCCceEecCC--------hhHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEe Confidence 111221 11111123455555543 244556777888889999999999999999999998777 Q ss_pred cCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCc--Cccc Q lcl|NC_015287. 144 DIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQ--TPFS 221 (563) Q Consensus 144 D~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~--~~~~ 221 (563) | ++|-..+..+||+++..|..-.........+.+.+... .....+..+-+|.+.........+.... .... T Consensus 149 d----~dg~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~---~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~ 221 (481) T protein:vir:10 149 D----FEDRDTFKVLDPKSTFVVYDQTLDKKVVAGVRYFEKQD---KDKVPVQHVEVYTTDKIYYIEIKGGTYHRVEEVE 221 (481) T ss_pred C----CCCeEEEEEEcccceEEEEcCCCCCceEEEEEEEEEee---CCCceEEEEEEEecCeEEEEEecCCceeeccccc Confidence 6 35788899999999887732221111111222211110 0011122222443333222111110000 0000 Q ss_pred cCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHH Q lcl|NC_015287. 222 TKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYL 300 (563) Q Consensus 222 ~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~-m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl 300 (563) +. +..=+|++.. |.....|-++..+....-+. ++=+....-+.++.|-+-+. |..+. T Consensus 222 ~~-----~g~vPvv~~~------n~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~----g~~~~------- 279 (481) T protein:vir:10 222 HY-----YNDVPIIEYL------NDQFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAII----GNVDL------- 279 (481) T ss_pred cc-----CCceeEEEee------cCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEee----cCcCC------- Confidence 00 1111232211 22223455554444433332 22333334444555544332 11110 Q ss_pred HHHHHhhcceeEeeCCCCc-ccccccccchhhhhcccccCCCCccceeecCCCCCcchHH-HHHHHHHHHHHhcCCChhh Q lcl|NC_015287. 301 KEVMNRYRNKLVYDASTGE-IRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELS-DIEYFQKKLYRSLGVPESR 378 (563) Q Consensus 301 ~~im~k~knklvYd~~TGe-v~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~-DV~YF~~kLy~aL~VP~sR 378 (563) |..+|. ++.++........-+.+ .+.+.++..|....+.+.+. -+.-.++.+|....+|--- T Consensus 280 -------------~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~ 343 (481) T protein:vir:10 280 -------------DSEDAKAFRDANMIHLEPGTNANG---SEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDLN 343 (481) T ss_pred -------------CccchhhhhhccceeccccccccC---CCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccc Confidence 111111 11111111111111111 12223455554444443333 3566677788888888432 Q ss_pred ccCCCcceeccccch-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHH Q lcl|NC_015287. 379 IAGSGDGFNLGRSSE-ILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKE 457 (563) Q Consensus 379 l~~~~~g~~~g~~~e-ItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe 457 (563) +. .+....|++ |.........-+.+.|..|...+.++++.=+-+-++--..+++ ...+.+.|.....-.+. T Consensus 344 ~~----~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~--~~~i~v~f~~~~~~~~~-- 415 (481) T protein:vir:10 344 DE----QFSGVQSGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLKQHN--YAELTITFTPNLPKSMM-- 415 (481) T ss_pred cc----ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccc--cceeeEEeCCCCCcCHH-- Confidence 21 111112222 2222333556678888888888888776543332333222333 24578888755444443 Q ss_pred HHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccCh--hhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcC Q lcl|NC_015287. 458 TELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKD--EEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAM 535 (563) Q Consensus 458 ~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~td--eei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~ 535 (563) +.++++.++. | .+|.+++++.+-..+| +|++.++++-+++.+.... . ...+.++ .+++. T Consensus 416 -----~~a~~~~kl~---g-~is~et~~~~l~~i~d~~~E~~ri~~E~~~~~~~~~~--~---~~~~~~~-----~~~~~ 476 (481) T protein:vir:10 416 -----ESINAFNALS---G-GVSESTRLSLLDFIDNPKEELEKMQEEEAQREKQADK--R---GYGEAFE-----NHLNV 476 (481) T ss_pred -----HHHHHHHHHh---c-cCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhhh--c---cCCccCC-----CCCCC Confidence 3444455553 4 4899999987765443 4555444433333221100 0 0000000 00000 Q ss_pred CCCCc Q lcl|NC_015287. 536 NPQDP 540 (563) Q Consensus 536 ~p~~~ 540 (563) +..++ T Consensus 477 dd~~g 481 (481) T protein:vir:10 477 DDSNG 481 (481) T ss_pred CCCCC Confidence 00000 No 51 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=98.45 E-value=6.1e-07 Score=54.64 Aligned_cols=460 Identities=10% Similarity=0.041 Sum_probs=179.8 Q ss_pred CcccccchhhHhhhcccccc--cccCcCCCCcccceeccccccccccccc---hhhhHHHHHH--------HHHHHhhcc Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVV--SPVPQSNEDSSDYYVSSGFYGQYVDIDG---VFKSEFELIK--------RYREMALHP 67 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~--s~~p~~~~d~~~~~~~~~~~~~~~~~~~---~~~~~~~LI~--------~YR~ma~~P 67 (563) |.+.=.|. +.....+.+ .+=.. +.+.|. ++..... ...-..++|+ +|+.+..+. T Consensus 1 ~~~~~~~~---~~~~~~~~~~~~~~~~---------~n~~~~-~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy 67 (511) T protein:vir:96 1 MLKVNEFE---TDTDLRGNINYLFNDE---------ANVVYT-YDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYY 67 (511) T ss_pred Cccccchh---hhhhhhhhhhhhhhhh---------hCCccc-ccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHh Confidence 44444441 111111111 01011 122221 1111100 1111122222 344444333 Q ss_pred chhHHH----------------------HHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhH Q lcl|NC_015287. 68 EVDSAI----------------------EDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCH 125 (563) Q Consensus 68 Evd~Ai----------------------~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~ 125 (563) +=...| .-||+- .+.=--..||.+..++ +...+.+..|++..+|+..-. T Consensus 68 ~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~-~~~yl~g~p~~~~~~d--------~~~~~~l~~~~~~n~~~~~~~ 138 (511) T protein:vir:96 68 EGKTKNLVELTRRKEEYMADNRVAHDYASYISDF-INGYFLGNPIQYQDDD--------KDVLEAIEAFNDLNDVESHNR 138 (511) T ss_pred hccCccccccCcccccccCcceeecchHHHHHHH-HhhhhcccCceeecCc--------hHHHHHHHHHHhhcChhHHHH Confidence 222111 122221 1111234566666553 235567888888889999999 Q ss_pred HHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCcc Q lcl|NC_015287. 126 EILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQ 205 (563) Q Consensus 126 ~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~ 205 (563) ++.+.+++.|+-|.+.-+| ++|-..+..+||+.+..|..-.........+.+.......+.....+..+-+|.+.. T Consensus 139 ~~~~~~~~~G~a~~~vy~d----~dg~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~ 214 (511) T protein:vir:96 139 SLGLDLSIYGKAYELMIRN----QDDETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHG 214 (511) T ss_pred HHHHHHHhcCeeEEEEEeC----CCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCc Confidence 9999999999999887666 357788999999999888532222111122222111111111111122223555544 Q ss_pred ccccccccccCcCccccC-cceee--ecccceeeeecceecCCCCccchhHHHHHHHHHHHHHH-HHHHHHHHHhcCccc Q lcl|NC_015287. 206 TQGKNALGAIGQTPFSTK-QRPVK--IAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMI-EDSLVIYRLSRAPER 281 (563) Q Consensus 206 ~~~~~~~g~~g~~~~~~~-~~~ik--i~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~-EDalVIyRi~RAPeR 281 (563) .......+..+.. .... ..... +..=+|++. +|.....|=++..+.....+..+ -+....-+.++.|-+ T Consensus 215 i~~~~~~~~~~~~-~~~~~~~~~~~~~g~vPvv~~------~n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~l 287 (511) T protein:vir:96 215 VYRYLTNRTNGLK-LTPRENSFESHSFERMPITEF------SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAML 287 (511) T ss_pred EEEEEecCCCccc-ccccccccccCcCcccceEEe------cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchh Confidence 3322111110000 0000 00000 001122221 11222345566555544443322 122222233444444 Q ss_pred eEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccc-cccccchhhh---hcccccCCCCccceeecCCCCCcch Q lcl|NC_015287. 282 RIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRD-DRKHMSMLED---FWLPRREGGRGTEITTLPGGQNLGE 357 (563) Q Consensus 282 RvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d-~~~~msMlED---ywLpRReGgrgTEIsTLpGg~nLge 357 (563) -+.-....+ .++++. ....+-.+.. +.......+.|..+..|-...+... T Consensus 288 v~~G~~~~~--------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 341 (511) T protein:vir:96 288 LIKGNLNLD--------------------------PVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQG 341 (511) T ss_pred heecCccCC--------------------------chhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHH Confidence 332211110 001110 0011100000 0011112233445555554444433 Q ss_pred H-HHHHHHHHHHHHhcCCChhhccCCCcceeccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC----CC Q lcl|NC_015287. 358 L-SDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSE--ILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKN----IV 430 (563) Q Consensus 358 i-~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~e--ItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkg----I~ 430 (563) + .-+.-+.+.+|.-.++|---. + .|. |..|. |..-......-+.+.++.|..-+.+.++.=+-+-+ +- T Consensus 342 ~e~~~~~L~~~I~~~s~~P~~~~--~--~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~ 416 (511) T protein:vir:96 342 TEAYKDRLNSDIHMFTNTPNMKD--D--NFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSID 416 (511) T ss_pred HHHHHHHHHHHHHHHhCCccccc--c--ccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Confidence 2 344556777888888885322 1 222 22222 22112222344555566666666665554222222 22 Q ss_pred ChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCC Q lcl|NC_015287. 431 TPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDG 510 (563) Q Consensus 431 t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~ 510 (563) ...+| ..+.+.|...---.+ .+.++++..+. | .+|.++++..+-..+ +.+++.++|++|.+.. T Consensus 417 ~~~~~----~~i~~~f~~~~p~n~-------~e~~d~~~kl~---G-~iS~et~l~~l~~v~--d~~~El~ri~~E~~~~ 479 (511) T protein:vir:96 417 ANKDF----NTVRYVYNRNLPKSL-------IEELKAYIDSG---G-KISQTTLMSLFSFFQ--DPELEVKKIEEDEKES 479 (511) T ss_pred ccccc----ccceEEeCCCCCcCH-------HHHHHHHHHHh---c-cCChHHHHHhCCCCC--CHHHHHHHHHHHHHHH Confidence 33334 356778865332222 23455555553 4 489999998754333 3444445555554321 Q ss_pred C-CCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccC Q lcl|NC_015287. 511 K-LADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQA 554 (563) Q Consensus 511 ~-~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~ 554 (563) . .........++. . .+.+......+.+.++. T Consensus 480 ~~~~~~~~~~~~~~------~-------~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:96 480 IKKAQKGIYKDPRD------I-------NDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHHhhccccCCCC------C-------CCCCCCCCccCcccccC Confidence 1 000000000000 0 00000111111111111 No 52 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=98.45 E-value=6.1e-07 Score=54.64 Aligned_cols=460 Identities=10% Similarity=0.041 Sum_probs=179.8 Q ss_pred CcccccchhhHhhhcccccc--cccCcCCCCcccceeccccccccccccc---hhhhHHHHHH--------HHHHHhhcc Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVV--SPVPQSNEDSSDYYVSSGFYGQYVDIDG---VFKSEFELIK--------RYREMALHP 67 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~--s~~p~~~~d~~~~~~~~~~~~~~~~~~~---~~~~~~~LI~--------~YR~ma~~P 67 (563) |.+.=.|. +.....+.+ .+=.. +.+.|. ++..... ...-..++|+ +|+.+..+. T Consensus 1 ~~~~~~~~---~~~~~~~~~~~~~~~~---------~n~~~~-~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy 67 (511) T protein:vir:78 1 MLKVNEFE---TDTDLRGNINYLFNDE---------ANVVYT-YDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYY 67 (511) T ss_pred Cccccchh---hhhhhhhhhhhhhhhh---------hCCccc-ccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHh Confidence 44444441 111111111 01011 122221 1111100 1111122222 344444333 Q ss_pred chhHHH----------------------HHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhH Q lcl|NC_015287. 68 EVDSAI----------------------EDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCH 125 (563) Q Consensus 68 Evd~Ai----------------------~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~ 125 (563) +=...| .-||+- .+.=--..||.+..++ +...+.+..|++..+|+..-. T Consensus 68 ~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~-~~~yl~g~p~~~~~~d--------~~~~~~l~~~~~~n~~~~~~~ 138 (511) T protein:vir:78 68 EGKTKNLVELTRRKEEYMADNRVAHDYASYISDF-INGYFLGNPIQYQDDD--------KDVLEAIEAFNDLNDVESHNR 138 (511) T ss_pred hccCccccccCcccccccCcceeecchHHHHHHH-HhhhhcccCceeecCc--------hHHHHHHHHHHhhcChhHHHH Confidence 222111 122221 1111234566666553 235567888888889999999 Q ss_pred HHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCcc Q lcl|NC_015287. 126 EILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQ 205 (563) Q Consensus 126 ~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~ 205 (563) ++.+.+++.|+-|.+.-+| ++|-..+..+||+.+..|..-.........+.+.......+.....+..+-+|.+.. T Consensus 139 ~~~~~~~~~G~a~~~vy~d----~dg~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~ 214 (511) T protein:vir:78 139 SLGLDLSIYGKAYELMIRN----QDDETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHG 214 (511) T ss_pred HHHHHHHhcCeeEEEEEeC----CCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCc Confidence 9999999999999887666 357788999999999888532222111122222111111111111122223555544 Q ss_pred ccccccccccCcCccccC-cceee--ecccceeeeecceecCCCCccchhHHHHHHHHHHHHHH-HHHHHHHHHhcCccc Q lcl|NC_015287. 206 TQGKNALGAIGQTPFSTK-QRPVK--IAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMI-EDSLVIYRLSRAPER 281 (563) Q Consensus 206 ~~~~~~~g~~g~~~~~~~-~~~ik--i~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~-EDalVIyRi~RAPeR 281 (563) .......+..+.. .... ..... +..=+|++. +|.....|=++..+.....+..+ -+....-+.++.|-+ T Consensus 215 i~~~~~~~~~~~~-~~~~~~~~~~~~~g~vPvv~~------~n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~l 287 (511) T protein:vir:78 215 VYRYLTNRTNGLK-LTPRENSFESHSFERMPITEF------SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAML 287 (511) T ss_pred EEEEEecCCCccc-ccccccccccCcCcccceEEe------cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchh Confidence 3322111110000 0000 00000 001122221 11222345566555544443322 122222233444444 Q ss_pred eEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccc-cccccchhhh---hcccccCCCCccceeecCCCCCcch Q lcl|NC_015287. 282 RIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRD-DRKHMSMLED---FWLPRREGGRGTEITTLPGGQNLGE 357 (563) Q Consensus 282 RvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d-~~~~msMlED---ywLpRReGgrgTEIsTLpGg~nLge 357 (563) -+.-....+ .++++. ....+-.+.. +.......+.|..+..|-...+... T Consensus 288 v~~G~~~~~--------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 341 (511) T protein:vir:78 288 LIKGNLNLD--------------------------PVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQG 341 (511) T ss_pred heecCccCC--------------------------chhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHH Confidence 332211110 001110 0011100000 0011112233445555554444433 Q ss_pred H-HHHHHHHHHHHHhcCCChhhccCCCcceeccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC----CC Q lcl|NC_015287. 358 L-SDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSE--ILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKN----IV 430 (563) Q Consensus 358 i-~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~e--ItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkg----I~ 430 (563) + .-+.-+.+.+|.-.++|---. + .|. |..|. |..-......-+.+.++.|..-+.+.++.=+-+-+ +- T Consensus 342 ~e~~~~~L~~~I~~~s~~P~~~~--~--~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~ 416 (511) T protein:vir:78 342 TEAYKDRLNSDIHMFTNTPNMKD--D--NFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSID 416 (511) T ss_pred HHHHHHHHHHHHHHHhCCccccc--c--ccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Confidence 2 344556777888888885322 1 222 22222 22112222344555566666666665554222222 22 Q ss_pred ChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCC Q lcl|NC_015287. 431 TPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDG 510 (563) Q Consensus 431 t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~ 510 (563) ...+| ..+.+.|...---.+ .+.++++..+. | .+|.++++..+-..+ +.+++.++|++|.+.. T Consensus 417 ~~~~~----~~i~~~f~~~~p~n~-------~e~~d~~~kl~---G-~iS~et~l~~l~~v~--d~~~El~ri~~E~~~~ 479 (511) T protein:vir:78 417 ANKDF----NTVRYVYNRNLPKSL-------IEELKAYIDSG---G-KISQTTLMSLFSFFQ--DPELEVKKIEEDEKES 479 (511) T ss_pred ccccc----ccceEEeCCCCCcCH-------HHHHHHHHHHh---c-cCChHHHHHhCCCCC--CHHHHHHHHHHHHHHH Confidence 33334 356778865332222 23455555553 4 489999998754333 3444445555554321 Q ss_pred C-CCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccC Q lcl|NC_015287. 511 K-LADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQA 554 (563) Q Consensus 511 ~-~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~ 554 (563) . .........++. . .+.+......+.+.++. T Consensus 480 ~~~~~~~~~~~~~~------~-------~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:78 480 IKKAQKGIYKDPRD------I-------NDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHHhhccccCCCC------C-------CCCCCCCCccCcccccC Confidence 1 000000000000 0 00000111111111111 No 53 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=98.44 E-value=6.4e-07 Score=54.54 Aligned_cols=455 Identities=13% Similarity=0.085 Sum_probs=186.6 Q ss_pred ccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHH------------ Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSA------------ 72 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~A------------ 72 (563) +.....+.-......++ .+ .+. .+. ... ...+.+|+.+..+.+-... T Consensus 1 ~~~~~~~~~~~~~~~~~---------~~-~i~-~~i-------~~~---~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~ 59 (499) T protein:vir:10 1 MAVVIDKDLLDDVNEPN---------IE-AIN-YAI-------REL---QNRKKRLDKLSDYYNGKQEIEKHEFDNATVE 59 (499) T ss_pred CccchhhhHHhhhhcCC---------HH-HHH-HHH-------HHH---HHHHHHHHHHHHHhccccchhcCCcCcCCCC Confidence 22222222111100000 00 000 000 000 0111222222222222111 Q ss_pred --------HHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeec Q lcl|NC_015287. 73 --------IEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVID 144 (563) Q Consensus 73 --------i~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD 144 (563) ...||+..+ .=-..+||.+..++ +...++|..+++-.+|+....++.+.+++.|+-|.+.-+| T Consensus 60 ~~ki~~n~~~~Iv~~~~-~~l~g~p~~~~~~~--------~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~ 130 (499) T protein:vir:10 60 AANVMVNHAKYITDMNV-GFMTGNPVKYVAEK--------GKNIDDILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLK 130 (499) T ss_pred cceeecchHHHHHHHHh-hhhcccCceeecCC--------hhHHHHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEec Confidence 112222111 11124566665543 2345667888888899999999999999999998887666 Q ss_pred CCCC-------------CcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCcccccccc Q lcl|NC_015287. 145 IKAP-------------EEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNA 211 (563) Q Consensus 145 ~~~~-------------k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~ 211 (563) .+.. ...-..+..+||+.+..|+....+...-..+......... ....+..+-+|.+........ T Consensus 131 ~~g~~~~~~~~~~~~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~--~~~~~~~~~iyt~~~i~~~~~ 208 (499) T protein:vir:10 131 KTDPISVRDELGNEKLTPNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLE--GNTNGYSITVYMPQRIVEYRT 208 (499) T ss_pred ccccccccccccccccccccceEEEEEcccceEEEecCCCCcceEEEEEEEEEeecC--CCceEEEEEEEeCCeEEEEEe Confidence 4321 2223457889999987774432211111111111000000 001122223444433222111 Q ss_pred ccc---cCcCccccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEcc Q lcl|NC_015287. 212 LGA---IGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLR-MIEDSLVIYRLSRAPERRIFYID 287 (563) Q Consensus 212 ~g~---~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~-m~EDalVIyRi~RAPeRRvFyID 287 (563) .+. .+........ .--+..=+|++. .|.....|=++..+.....+. ++=+....-+.+..|-+-+.-.+ T Consensus 209 ~~~~~~~~~~~~~~~~-~~~~g~vPvv~~------~n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~ 281 (499) T protein:vir:10 209 KTTMEVSANDPIVYDG-ENLFGAVPIIEF------RNNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFG 281 (499) T ss_pred cCCccccCcceecccc-cCCCCccceEEe------cCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCc Confidence 110 0000000000 000111122221 122233465666555555554 33555555566777776665333 Q ss_pred CCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchH-HHHHHHHH Q lcl|NC_015287. 288 VGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGEL-SDIEYFQK 366 (563) Q Consensus 288 vGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei-~DV~YF~~ 366 (563) .+... .....+..+.+.--.+..|..+++|-...+.... .-+.-+.+ T Consensus 282 ~~~~~--------------------------------~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~ 329 (499) T protein:vir:10 282 LGDDK--------------------------------DDIQRLKRGAIEAPPREEGADIEWLTKSFDETQVNLLSQSIEN 329 (499) T ss_pred ccccc--------------------------------chhhhhhhcceeccCCCCCCcceEEeccCCHHHHHHHHHHHHH Confidence 32211 1111111111111122223446666554444332 34566677 Q ss_pred HHHHhcCCChhhccCCCcceeccccchhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhcCC-CChhHHHHhhhce Q lcl|NC_015287. 367 KLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFT---KFVGRMRKRFSHLFNDMLKTQLILKNI-VTPEDWEVLGDHI 442 (563) Q Consensus 367 kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~---KfI~rLr~rFs~if~d~Lk~qLiLkgI-~t~eew~~~~~~i 442 (563) .+|+-..+|- +..+ .| .|..|.+... .+|. .-+.+.++.|...+.++++.-+-+-++ -...+| ..+ T Consensus 330 ~I~~~s~~p~--~~~~--~~-~gn~Sg~Al~-~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~d~----~~i 399 (499) T protein:vir:10 330 DIHKISYVPN--MNDE--KF-MGNVSGEAMK-FKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIKGANDDA----SGC 399 (499) T ss_pred HHHHHhCccc--CCch--hh-cccchHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccc----ccc Confidence 7788888883 2111 12 2232322222 2333 335555666666666666654433222 122334 356 Q ss_pred EEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccCh--hhHHHHHHHHHHHhcCCCCCCCcccCc Q lcl|NC_015287. 443 QYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKD--EEVIEIDKQMEKEIKDGKLADPMEVQQ 520 (563) Q Consensus 443 ~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~td--eei~e~~~qie~E~~~~~~~~p~~~~~ 520 (563) .+.|....--.+ .+.+++++.+ .| .+|.+++++.+-..+| +|++++.++-+++... ...+...+. T Consensus 400 ~i~f~~~~p~n~-------~e~~~~~~kl---~g-~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~--~~~~~~~~~ 466 (499) T protein:vir:10 400 KISLVANIPSNL-------SDVVNNVKNA---DG-IIPRKYTYSWLPDVDNPQDVIDEMNQQDAETIKK--NQEALRGQD 466 (499) T ss_pred eEEeCCCCCCCH-------HHHHHHHHHH---hc-cCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHH--HHhhhccCC Confidence 777765443333 3445555555 34 4899999988654443 4444444332222211 001111111 Q ss_pred cccccccccCCCCcCCCCCcccccccCCCCcccCC Q lcl|NC_015287. 521 LQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAP 555 (563) Q Consensus 521 ~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~ 555 (563) ++.+ .........+|.+++..+..+.+++.-|- T Consensus 467 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (499) T protein:vir:10 467 PDRL--ELEDKQDDSSENDKEAGSNHNQSHRTRAV 499 (499) T ss_pred CCCC--CCCCCCcccCCCCCCCccccccCCCCCCC Confidence 1111 11111122234444444444455444444 No 54 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=98.44 E-value=6.5e-07 Score=54.49 Aligned_cols=447 Identities=13% Similarity=0.088 Sum_probs=209.2 Q ss_pred ccchhhHhhhccccccccc-CcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhh--ccch------------ Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPV-PQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMAL--HPEV------------ 69 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~-p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~--~PEv------------ 69 (563) ||. +++-+...+..+.-+ +..+--. +. . ...+. --....+.|+++++|.. +|-+ T Consensus 1 m~~-~~~~k~~~~~~~~~~~~~~~~~~----~~-~----~~~i~-~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~ 69 (508) T protein:vir:15 1 MGL-IQRIKDLFWKGAAATGVTGSLSK----IT-D----DPRIS-IDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKR 69 (508) T ss_pred CCh-HHHHHHHHHHHHHHhccccchHH----hh-c----ccccc-cCHHHHHHHHHHHHHhcCCCcccccccCCCCcccc Confidence 666 455444433222111 0000000 00 0 00000 00122344555555532 2222 Q ss_pred --------hHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheee Q lcl|NC_015287. 70 --------DSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHK 141 (563) Q Consensus 70 --------d~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhk 141 (563) ..++++.++ +|+. .++.+.+.+ + +.-.+..+.++.-.+|.....+.+....+-|..+|+. T Consensus 70 ~~~sln~~~~i~~~~A~--lv~~---e~~~i~v~~---~----~~~~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~ 137 (508) T protein:vir:15 70 LKNTINMAKTAARRIAS--VVFN---EKAEIHVKD---N----NEADKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRP 137 (508) T ss_pred ceeecchHHHHHHHHHh--hhhC---CCceEEeCC---c----hHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEE Confidence 222222222 2211 255555532 1 1223345778888889999999999999999999999 Q ss_pred eecCCCCCcceeEEEEeCCccceeeeeeeccCCccccce---eeecccccccccccccceeeecC--cccccccccccc- Q lcl|NC_015287. 142 VIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAIN---QVNSKNPADIENPEIEEYYQYDP--SQTQGKNALGAI- 215 (563) Q Consensus 142 viD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~---~~~~~~~~~~~~~~~~eyf~y~~--~~~~~~~~~g~~- 215 (563) .+|.. + +.+.+++|.++-+++.-.... ...+- +.+........|- .-||+.... +++..+...... T Consensus 138 ~~d~~----~-~~i~~v~ad~~~P~~~d~~~~--~~~af~~~~~~~~~~~~~~yt-~lE~h~~~~~~~~~I~n~ly~~~~ 209 (508) T protein:vir:15 138 YIDGN----H-IKIAWVRADQFYPLQSNTNDI--SEAAIASRTQRTESNQTKYYT-LLEFHQWQDNGSYQITNELYKSDS 209 (508) T ss_pred EEeCC----e-eEEEEEcCCeeEEEEEcCCCe--EEEEEEEEEEeecCCCceEEE-EEEEEEEecCcceEEEEEEEecCC Confidence 99843 2 468999999887764321110 00000 0111110111111 111221111 111111111111 Q ss_pred ----CcCccccCcceee-ec--ccceeeeecc-----ee--------cCCCCccchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 216 ----GQTPFSTKQRPVK-IA--PDAISFCHSG-----LV--------DRNKQTILSYLHKSIKALNQLRMIEDSLVIYRL 275 (563) Q Consensus 216 ----g~~~~~~~~~~ik-i~--~daI~y~hSG-----l~--------d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi 275 (563) |..-.. ..+. +. .+.+++.+.. .| +..+..++|-++.|+-.+..|-..-+. +.|. T Consensus 210 ~~~lG~~v~l---~~~~e~~~l~~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~--~~~e 284 (508) T protein:vir:15 210 PDIVGNQVPL---STLPVYKELAPQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQ--FIWE 284 (508) T ss_pred chhcCcccch---hhcccccCCCcceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHH--HHHH Confidence 110000 0000 00 1223332211 11 223445689999999888888766665 5588 Q ss_pred hcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCccc--ccccccchhhhhcccccCCCCccceeecCCCC Q lcl|NC_015287. 276 SRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIR--DDRKHMSMLEDFWLPRREGGRGTEITTLPGGQ 353 (563) Q Consensus 276 ~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~--d~~~~msMlEDywLpRReGgrgTEIsTLpGg~ 353 (563) +|.=.+|||.-+.- + -+|..+|.+- +++-+..|-.| ...|.-|+.+...- T Consensus 285 ~~~~~~~i~v~~~~-l--------------------~~d~~~~~~~~~~~~~~~~~~~~-------~~~~~~i~~~~~~i 336 (508) T protein:vir:15 285 IRLGQKHIAVQPGM-L--------------------RFDDEHKPTFDTEQNVYVGVLSD-------DNNGLGVKDMTTPI 336 (508) T ss_pred HHhcccceeechHH-h--------------------cCCCCCccccCCCCeeEEeccCC-------CCCCCceeEeeccc Confidence 89888898873210 0 0234443321 22223222111 12223355555332 Q ss_pred Ccc-hHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh------ Q lcl|NC_015287. 354 NLG-ELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLIL------ 426 (563) Q Consensus 354 nLg-ei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiL------ 426 (563) --+ -.+-+..+.+.+....+++.+-|..+++ ...-|+||...+-.-..-+.+.|+.|...+.++++.=|-+ T Consensus 337 r~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~--~~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~ 414 (508) T protein:vir:15 337 RTVQYKDAIDHFIKEFEVQIGLSTGTFSYSND--GVKTATEVVSNNSMTYQTRSSYLTMVEKAIDELCQSIFELANAGAL 414 (508) T ss_pred ChHHHHHHHHHHHHHHHHHhCCCchhcccccC--ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 111 2346777888899999999888864432 2334667665555555666666666666666666654322 Q ss_pred -cCCCChh--HHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHH Q lcl|NC_015287. 427 -KNIVTPE--DWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQM 503 (563) Q Consensus 427 -kgI~t~e--ew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qi 503 (563) ++..... .+...-..+.++|...- .+-+++++-+ +.++..- | .+|+.+.+++....+|||.+++-++| T Consensus 415 ~~~g~~~~~~~~~~~~~~v~v~f~D~i--~~d~~~~~~~-----~~~~v~a-G-i~s~e~~i~~~~g~~deea~~el~ri 485 (508) T protein:vir:15 415 FDDGKPLFTLDSASQPLDIECHFDDGV--FVNKDKQLEE-----DAKVLAI-G-ALSKQTFLQRNYGMTDEQAAEELAKI 485 (508) T ss_pred ccccccccccccccCCcceEEEeCCCC--CCCHHHHHHH-----HHHHHhc-C-CCCHHHHHHhcCCCChHHHHHHHHHH Confidence 2221111 11111234666665422 2223332222 2222111 3 58888887778889999999999999 Q ss_pred HHHhcCCCCCCCcccCccccccccccCCCCc Q lcl|NC_015287. 504 EKEIKDGKLADPMEVQQLQLGVHPEQMPGGA 534 (563) Q Consensus 504 e~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~ 534 (563) ++|..... +.++...+..++.|| T Consensus 486 ~~E~~~~~--------~~~~~~~~~~g~~ge 508 (508) T protein:vir:15 486 QSEAPTDT--------FEGGRSAILNGGDGE 508 (508) T ss_pred HHhccccC--------ccccccccCCCCCCC Confidence 99954311 111111222222333 No 55 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=98.42 E-value=7.3e-07 Score=54.24 Aligned_cols=444 Identities=12% Similarity=0.090 Sum_probs=193.5 Q ss_pred hhcccccccccCcCCCCcccceeccccc-cc-----cccccchhhhHHHHHHHHHHHhhccch----------------- Q lcl|NC_015287. 13 DEQNKNVVSPVPQSNEDSSDYYVSSGFY-GQ-----YVDIDGVFKSEFELIKRYREMALHPEV----------------- 69 (563) Q Consensus 13 ~~~~~~~~s~~p~~~~d~~~~~~~~~~~-~~-----~~~~~~~~~~~~~LI~~YR~ma~~PEv----------------- 69 (563) ++.....|.--|+ +.+....-.. .. -.........+. .+|+.+..+++- T Consensus 1 ~~~~~~~~~~~~~-----~~~~~p~~~~~~~~~~~l~~~l~~~~~~~~---~rl~~l~~YY~G~~~~~~~~~~~~~~~~~ 72 (501) T protein:vir:25 1 MTVPVDVIADAPA-----ADVEFPEDSMSREQLGALVADMWRLHISER---QWLDRIYEYTKGLRGRPEVPEGASDEVKE 72 (501) T ss_pred CcccchhhhccCc-----ccccCCcccCChHHHHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCchhccccCChhhhh Confidence 2222222211111 1111100000 00 000000011111 112222222111 Q ss_pred ----------hHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhhe Q lcl|NC_015287. 70 ----------DSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYY 139 (563) Q Consensus 70 ----------d~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~f 139 (563) .-+|+-.++= ++ +....+.+.. -.+....|.+..+|+...+++++.-++.||-|. T Consensus 73 ~~~~~v~n~~~~ivd~~a~~-l~------~~gf~~~d~~--------~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~ 137 (501) T protein:vir:25 73 LAKLSVKNVLSLVRDSFAQN-LS------VVGYRNALAK--------ENDPAWEMWQRNRMDARQAEVHRPALTYGASYV 137 (501) T ss_pred hHhhhhcChHHHHHHHHHhh-hc------ccceecCCcc--------chHHHHHHHHhcChhHHHHHHHHHHhhcCceEE Confidence 1122221110 00 1111111211 123456677888899999999999999999887 Q ss_pred eeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccc-cceeee---cccccc-----cccccccceeeecCccccccc Q lcl|NC_015287. 140 HKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRG-AINQVN---SKNPAD-----IENPEIEEYYQYDPSQTQGKN 210 (563) Q Consensus 140 hkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~-~i~~~~---~~~~~~-----~~~~~~~eyf~y~~~~~~~~~ 210 (563) +.-.|. +| ..++.++|+.+..|- .+++.+. .+..++ .....+ ..|.+ ...|.|..+...... T Consensus 138 ~v~~de----~~-~~i~~~sp~~~~~iy---~D~~~~~~~~~ai~~~~~~~~~~~~~~~~~y~~-~~~~~~~~~~~~~~~ 208 (501) T protein:vir:25 138 TVTPTD----EG-PVFRTRSPRQILAVY---ADPSVDAWPQYALETWVAQKDAKPHRRGVLYDD-TYMYELDLGEVVLGD 208 (501) T ss_pred EEecCC----CC-CeEEEeccccEEEEE---ecCCCCcceeEEEEEEeeccccCcceeEEEecC-eeEEEEecCceeeee Confidence 755542 23 357778998876551 2222222 222211 100000 00111 111222211111100 Q ss_pred cccccCcCccccCcceeeec----------cc---ceeeeecceecCCCCccchhHHHHHHHHHHH-HHHHHHHHHHHHh Q lcl|NC_015287. 211 ALGAIGQTPFSTKQRPVKIA----------PD---AISFCHSGLVDRNKQTILSYLHKSIKALNQL-RMIEDSLVIYRLS 276 (563) Q Consensus 211 ~~g~~g~~~~~~~~~~iki~----------~d---aI~y~hSGl~d~~~~~~~syL~~Aik~~NqL-~m~EDalVIyRi~ 276 (563) .... .....+.......+ .. .|.|+..-. . .+.+.|-++..+...+-+ +++-+.+++-..+ T Consensus 209 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~~--~-~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~ 283 (501) T protein:vir:25 209 AGGG--QATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGRD--A-DDMIVGEVAPLILLQQAINSVNFDRLIVSRFG 283 (501) T ss_pred cccc--ccccccccccccccccccccccCCccceeeEeccCccc--c-CccccchhhhhHHHHHHHHHHHHHHHHHHHhh Confidence 0000 00000000000000 00 111222111 0 122344444433222221 3455777888888 Q ss_pred cCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcc Q lcl|NC_015287. 277 RAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLG 356 (563) Q Consensus 277 RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLg 356 (563) -.|.|-+.=.+....+..+ .+.++ .|+. +| -+++|.++|+..-=+ T Consensus 284 a~p~~~i~G~~~~~~~~~~----------~~~~~----------------------i~~~--~~-~~~~~~q~~~~~~~~ 328 (501) T protein:vir:25 284 ANPQRVISGWTGSKAEVLK----------ASALR----------------------VWTF--ED-PEVKAQAFPPASVEP 328 (501) T ss_pred ccHHHHHhCCCCCccchhh----------hcccc----------------------eecc--CC-CCceEEEecccChHH Confidence 8888877655433332211 11111 2332 12 235677787653212 Q ss_pred hHHHHHHHHHHHHHhcCCChhhccCCCcceecccc-chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHH Q lcl|NC_015287. 357 ELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRS-SEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDW 435 (563) Q Consensus 357 ei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~-~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew 435 (563) -++-++-.-..+...-++|..-|. ++.-..| ..|.--+....+-+.+.|+.|..-+.+++|.-+.++|.....+| T Consensus 329 ~~~~l~~~i~~i~~~s~~P~~~~~----~~~~N~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~~~~~~~~ 404 (501) T protein:vir:25 329 YNLILEEMLQHVAMVAQISPAQVT----GKMINVSAEALAAAEANQQRKLAAKRESFGESWEQLLRLAAEMDDDPDTAAD 404 (501) T ss_pred HHHHHHHHHHHHHhhcCCChhhhc----cccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccc Confidence 233455555566667788987773 2212223 34444455577888999999999999999999989986543333 Q ss_pred HHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCC Q lcl|NC_015287. 436 EVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADP 515 (563) Q Consensus 436 ~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p 515 (563) ..|.+.|..-..= -+.+.++++..+..- | +|.+++...++.+++.|++++.++.+++...+.+..+ T Consensus 405 ----~~i~v~w~~~~~~-------s~~~~ada~~kl~~~-g--is~et~~~~~~g~~~~~ie~~~~~~~e~~~~~~~~~~ 470 (501) T protein:vir:25 405 ----SGAEVLWRDTEAR-------SFGAVVDGITKLASA-G--IPIEHLLSMVPGMTQQTIQAIKDSLRGGEVKSLVDKL 470 (501) T ss_pred ----eeeeEEecCCCCC-------CHHHHHHHHHHHHhc-C--CCHHHHHHHcCCCCHHHHHHHHHHHHHHhHHHHHHHh Confidence 2467777543211 124566666666543 3 6999999999999999999888877777655444322 Q ss_pred cccCccccccccccCCCCcCCCCCcccccccCCCCcccCCCCCCCCC Q lcl|NC_015287. 516 MEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAPQMPEGGE 562 (563) Q Consensus 516 ~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~ 562 (563) ...+ +++. .|..++..+++ ...++.+ |+||- T Consensus 471 ~~~~-~~~~-----------~~~~~~~~~~~-~~~~~~~---~~~g~ 501 (501) T protein:vir:25 471 LSNE-PAPV-----------PPPPPQAAAQA-LNEGGVN---GNGGA 501 (501) T ss_pred hccC-cCCC-----------CCCCCCCCccc-cccccCC---CCCCC Confidence 1111 1110 01111000000 1111111 11222 No 56 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=98.40 E-value=8.5e-07 Score=53.85 Aligned_cols=470 Identities=11% Similarity=0.130 Sum_probs=197.9 Q ss_pred Ccccccc-------hhhHhhhcccc--ccccc---CcCCCCc-----------ccceecccccc-----ccccccchhhh Q lcl|NC_015287. 1 MAKLFGF-------KIQKDDEQNKN--VVSPV---PQSNEDS-----------SDYYVSSGFYG-----QYVDIDGVFKS 52 (563) Q Consensus 1 m~~~~~f-------~~~~~~~~~~~--~~s~~---p~~~~d~-----------~~~~~~~~~~~-----~~~~~~~~~~~ 52 (563) |.+|..- ++++..+...- ...++ ..++..+ ....++..+.+ ..+..-+...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (574) T protein:vir:80 1 MPKWLDKALGIEKSSIEETRNMENYKMHLREIDTNVVNNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIRN 80 (574) T ss_pred CcchhhhhhccchhhHHHHHhhhhhccccchhhhhhhhccCCCHHHHHHhHhhhcccccchhhhhccccccccCcCccCC Confidence 8777642 22211111000 00111 0011111 11111111111 11122233445 Q ss_pred HHHHHHHHHHHhhccchhHHHHHhhcceEEe------cCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhc-----ch Q lcl|NC_015287. 53 EFELIKRYREMALHPEVDSAIEDIINEAIVS------DQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILN-----FD 121 (563) Q Consensus 53 ~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~------d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~-----F~ 121 (563) ...|-+.-+..+..|-|..||+.+++-+..| ....=|..|.+++..-... .+...+...+.++|. ++ T Consensus 81 ~~~~~~~l~~~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~--~~~~~~~~~l~~ll~~~~~~~n 158 (574) T protein:vir:80 81 SQDLHKTLKKFGNNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPT--SHDIANIKRIESFLENTAQFRD 158 (574) T ss_pred cccHHHHHHhhccChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCcc--chhhhhhhHHHHHHhccCCCCC Confidence 5555555666678899999999999876554 2234466776665332111 122223333444442 11 Q ss_pred -------hhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeeccccccccccc Q lcl|NC_015287. 122 -------DKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPE 194 (563) Q Consensus 122 -------~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~ 194 (563) .=.+.+++.+++.|--|+.++.|.. .-+.+|++|||..|+.++....- + ... T Consensus 159 P~~~s~~ef~~~lv~~lll~Gnayi~i~r~~~---G~~~~L~pl~p~~V~v~~d~~~~------~------------~~~ 217 (574) T protein:vir:80 159 PNRDNFTTFCKKLVRATYMYDQVNFEKVFDKD---GNFIKFDTVDPTTIFLATNGEGK------L------------IKN 217 (574) T ss_pred CccccHHHHHHHHHHHHHhcCCeEEEEEECCC---CcEEEEEEEcCceeEEEEcCccc------c------------ccC Confidence 1223455566778999999888754 34999999999998876422110 0 000 Q ss_pred ccceeeecCccccccccccccCcCccccCcceeeecccceeeeecce-ec-CCCCccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 195 IEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSGL-VD-RNKQTILSYLHKSIKALNQLRMIEDSLVI 272 (563) Q Consensus 195 ~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl-~d-~~~~~~~syL~~Aik~~NqL~m~EDalVI 272 (563) -..||.+.. +.....++.+.|.|..-.. .+ .++..++|-|+.|+.++.....+++...= T Consensus 218 ~~~y~~~~~-------------------g~~~~~~~~~eiih~~~~~~~~~~~~~~G~spi~~a~~~i~~~~~a~~~~~~ 278 (574) T protein:vir:80 218 GERFVQVID-------------------NRIVAKFNERELAFAVRNPRADIEVGQYGYPELEIALKQFIAHENTEVFNDR 278 (574) T ss_pred ceEEEEEeC-------------------CceEEEEccccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHH Confidence 112333311 0112345556665543211 11 12445678899999999988888886654 Q ss_pred HHHhcCccceEEEccCCC-CchHHHHHHHHHHHHh-hcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecC Q lcl|NC_015287. 273 YRLSRAPERRIFYIDVGN-LPKMKAEQYLKEVMNR-YRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLP 350 (563) Q Consensus 273 yRi~RAPeRRvFyIDvGn-lpk~KAeqYl~~im~k-~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLp 350 (563) +=---+--+-|..++.+. |-+..++. +++-+.. |.- . .+.| +. .++++ .|.++.-|. T Consensus 279 ~f~ng~~p~gil~~~~~~~ls~e~~~~-lk~~~~~~~~G-~---~n~g------~~-~vl~~---------~G~~~~~l~ 337 (574) T protein:vir:80 279 FFSHGGTTRGILHVKTGQQQSQQALDI-FRREWRSSLAG-I---NGSW------QI-PVVSA---------EDVKFVNMT 337 (574) T ss_pred HHhccCCCceEEEeCCCCCCCHHHHHH-HHHHHHHHhcc-c---cccc------cc-eeecC---------CCceEEEcc Confidence 444445667777777654 44443333 3333322 221 0 0112 11 12211 245566664 Q ss_pred CCCCcchH---HHHHHHHHHHHHhcCCChhhccCCCcc-eeccccchhhH---HH--HHHHHH-HHHHHHHHHHHHHHHH Q lcl|NC_015287. 351 GGQNLGEL---SDIEYFQKKLYRSLGVPESRIAGSGDG-FNLGRSSEILR---DE--IKFTKF-VGRMRKRFSHLFNDML 420 (563) Q Consensus 351 Gg~nLgei---~DV~YF~~kLy~aL~VP~sRl~~~~~g-~~~g~~~eItR---DE--lKF~Kf-I~rLr~rFs~if~d~L 420 (563) . +..++ +--+|..+...++.+||...|.-.+.+ +....+...++ ++ +-|..+ +.-+..++...|...| T Consensus 338 ~--s~~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~~L 415 (574) T protein:vir:80 338 P--SANDMQFEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEKMQASQNKGLQPLLRFIEDTVNTYI 415 (574) T ss_pred C--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 3 33333 334568888999999999999532221 21111111111 11 224443 4444444444444332 Q ss_pred HHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHH Q lcl|NC_015287. 421 KTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEID 500 (563) Q Consensus 421 k~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~ 500 (563) + ++ ++ ..+.+.|.....= .|.+....+.....-++|..-+|.. +.+.+.+ --+ T Consensus 416 ----l-----~~--~~---~~~~~~f~~~d~~----------~~~~~~~~~~~~~~G~lT~NE~R~~-lgl~Pi~--gGD 468 (574) T protein:vir:80 416 ----V-----AE--FG---EKYQFQFRGGDLS----------AQLDKLKIIEQEGKVFRTVNEIRHD-KGLEPIK--GGD 468 (574) T ss_pred ----h-----hh--cC---CceEEEecccchh----------hHHHHHHHHHHHhCCccCHHHHHHH-hCCCCCC--CCC Confidence 2 22 11 3466777765421 1222221111111235666666653 3343311 000 Q ss_pred HHHHHHhcCCCCCCCcccC--------ccccccccccCCCCcCCCC-CcccccccCCCCcccCCCCCCCCCC Q lcl|NC_015287. 501 KQMEKEIKDGKLADPMEVQ--------QLQLGVHPEQMPGGAMNPQ-DPNMGGAPQEGGAEQAPQMPEGGEI 563 (563) Q Consensus 501 ~qie~E~~~~~~~~p~~~~--------~~~~~~~~~~~~~~~~~p~-~~~~~~~~~~~~~~~~~~~~~~~~~ 563 (563) +-+- -..-.+...+.+.. +..........+.++.+|. .|.....-......+++..-+|++. T Consensus 469 ~~~~-~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~~~~~~~~~ 539 (574) T protein:vir:80 469 VILN-GVHIQAIGQALQEEQLEYQRSQDRLNRLLELSGGDVEQPEPEEPKDSQNDTDVSFQDEQQGLNGKSK 539 (574) T ss_pred Eeee-ccceeecccccccccCCccchhccccccccccCCCCCCCCCCCCCCccccccchhhhhhhhhccchh Confidence 0000 00000000000000 0000000000000000000 0000000000112233333344444 No 57 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=98.38 E-value=9.6e-07 Score=53.57 Aligned_cols=454 Identities=14% Similarity=0.172 Sum_probs=192.9 Q ss_pred Ccccccc--------------------------hhhHhhhc-------ccccccccCcCCCCcccceecccccccccccc Q lcl|NC_015287. 1 MAKLFGF--------------------------KIQKDDEQ-------NKNVVSPVPQSNEDSSDYYVSSGFYGQYVDID 47 (563) Q Consensus 1 m~~~~~f--------------------------~~~~~~~~-------~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~ 47 (563) |-.-+|. ..+..++. .+..+.+.|.. ++. -+.++|. .- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~---~~~-~~~~~~~-----~r 71 (551) T protein:vir:80 1 MKNKLGLFESIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVI---GSM-SANPGFK-----TK 71 (551) T ss_pred CchhhhhHHHhhhccCChhhcccccccccceeeecccccHHHHHHhhccCcceeecccc---cce-ecCcccc-----cC Confidence 2211111 00000000 01111111111 000 0111211 12 Q ss_pred chhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEec------CCCcEEEEEeecc--ccchHHHHHHHHHHHHHHHHhc Q lcl|NC_015287. 48 GVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSD------QNDSPVEIDLENL--PASKQLKDLIREEFKTVKEILN 119 (563) Q Consensus 48 ~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d------~~~~pV~l~l~~~--~~s~sik~~I~eeF~~i~~ll~ 119 (563) ..+.+..+|-+..+.++.+|-|..||+.|+|.+.++- .++-+..+.+.+- +.++..+..+. ....+++-.+ T Consensus 72 ~~~~~~~~l~~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~-~i~~~l~~pn 150 (551) T protein:vir:80 72 PSIRNNQDLHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIK-RIESFIEKTG 150 (551) T ss_pred ccccChhHHHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHH-HHHHHHHhcC Confidence 2334556666667777889999999999999766432 2333455666542 23332222211 1222333333 Q ss_pred ch-----hhhHHHHHHHh----hhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeeccccccc Q lcl|NC_015287. 120 FD-----DKCHEILRNWY----VDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADI 190 (563) Q Consensus 120 F~-----~~~~~~~R~WY----vDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~ 190 (563) .. ....++++.|. +.|.-|+.++.|... -+.+|.+|||.+|+.++-... .+ . T Consensus 151 ~~~~p~~~s~~~f~~~lv~dlll~Gnay~~i~rd~~G---~~~~L~~l~p~~V~v~~~~~g------~~-~--------- 211 (551) T protein:vir:80 151 VDNDINRDSFSSFVKKIVRDTYMYDQVNFEKVFNRNQ---SMVRFVAKDPTTIFFATTADG------KI-P--------- 211 (551) T ss_pred CCCCCccchHHHHHHHHHHHHHhcCCEEEEEEECCCC---cEEEEEEeCCceeEEEECCcc------cc-c--------- Confidence 22 24456666654 569899998887532 399999999998876621111 00 0 Q ss_pred ccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeecc-eecC-CCCccchhHHHHHHHHHHHHHHHH Q lcl|NC_015287. 191 ENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSG-LVDR-NKQTILSYLHKSIKALNQLRMIED 268 (563) Q Consensus 191 ~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSG-l~d~-~~~~~~syL~~Aik~~NqL~m~ED 268 (563) .-..+|.+... +...+.++.+.|.|.+-. +.+. .+..++|-|+.|+..+.....++. T Consensus 212 ---~~~~~y~~~~~------------------g~~~~~~~~~eiiH~~~n~~~~~~~~~~G~spi~~a~~~i~~~~a~~~ 270 (551) T protein:vir:80 212 ---DNGNRFVQVID------------------QKIVATFNAREMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEA 270 (551) T ss_pred ---cCceEEEEEeC------------------CcEEEEEcccceEEecccCCCCcccccccccHHHHHHHHHHHHHHHHH Confidence 00112222110 112345667767655422 2222 234467889999999999888888 Q ss_pred HHHHHHHhcCccceEEEccC-CCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCcccee Q lcl|NC_015287. 269 SLVIYRLSRAPERRIFYIDV-GNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEIT 347 (563) Q Consensus 269 alVIyRi~RAPeRRvFyIDv-Gnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIs 347 (563) ...=+----+--+-|..+.. ++|.+..+++.-+.+...|.-- .+.|.+ .++. +.|.++. T Consensus 271 ~~~~~f~Ng~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~----~nag~~-------~vl~---------~~g~~~~ 330 (551) T protein:vir:80 271 FNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGI----NGSWQI-------PVVS---------AEDVKFV 330 (551) T ss_pred HHHHHHHcCCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCc----cccCcc-------cccc---------CCCceEE Confidence 76543333344555666654 3465555544444443333210 011211 1221 1134555 Q ss_pred ecCCCCCcchHH---HHHHHHHHHHHhcCCChhhccCCCcc--e-ecccc---chhhHHHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_015287. 348 TLPGGQNLGELS---DIEYFQKKLYRSLGVPESRIAGSGDG--F-NLGRS---SEILRDEIKFTK-FVGRMRKRFSHLFN 417 (563) Q Consensus 348 TLpGg~nLgei~---DV~YF~~kLy~aL~VP~sRl~~~~~g--~-~~g~~---~eItRDElKF~K-fI~rLr~rFs~if~ 417 (563) .|. .+..++. -.+|..+.+-++.+||...|.-.+.+ + +.+.+ +-+.-....|.. -+.-+..++...|. T Consensus 331 ~l~--~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln 408 (551) T protein:vir:80 331 NMT--PSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFIN 408 (551) T ss_pred Ecc--CChhHHHHHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 553 3333333 34567788999999999999532111 1 11111 112222223433 35556666655555 Q ss_pred HHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccCh-hhH Q lcl|NC_015287. 418 DMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKD-EEV 496 (563) Q Consensus 418 d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~td-eei 496 (563) ..|-+ . + ...+.++|.....-.+ .+|..+...+. .-++|..-+|... .+.+ .+ T Consensus 409 ~~L~~---------~--~---~~~~~f~f~~~~~~~~-------~~~~~~~~~~~---~g~lT~NE~R~~~-gl~P~~e- 462 (551) T protein:vir:80 409 KHIVA---------E--F---GDKYTFQFVGGDIKSE-------LESVKILAEKA---KVAMTVNEVRKEL-NLPGDVI- 462 (551) T ss_pred hhhcc---------c--c---CCceEEEeeccChhhH-------HHHHHHHHHHh---cCCcCHHHHHHHh-CCCCCCC- Confidence 44322 1 1 1457777776543222 23433333222 1246666666433 3322 11 Q ss_pred HHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccCCCCCCCCCC Q lcl|NC_015287. 497 IEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAPQMPEGGEI 563 (563) Q Consensus 497 ~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 563 (563) ..+-++ .|...............+.........+..++...+.+.+.|..|.+.|- T Consensus 463 ----------gGD~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 518 (551) T protein:vir:80 463 ----------GGDIPL-NGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDT 518 (551) T ss_pred ----------CCceee-cccccccccccccccCcchhhhhhccccccCcCCCCCCCCCCCCCCcccc Confidence 000000 01000000000000000000000000000011111111111111111111 No 58 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=98.37 E-value=9.8e-07 Score=53.53 Aligned_cols=468 Identities=12% Similarity=0.104 Sum_probs=186.6 Q ss_pred CcccccchhhHhhhcccccccccCcCC--------------------CCcccce--------eccccccccccccchhhh Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSN--------------------EDSSDYY--------VSSGFYGQYVDIDGVFKS 52 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~--------------------~d~~~~~--------~~~~~~~~~~~~~~~~~~ 52 (563) +|-+|.- -| ..-.++.+..+-|.+ +.+.+.. +.|+ .++.. -...+.. T Consensus 5 ~~~~~~~--~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~p~~~~~~~~-~~~~~-~p~~~~~ 79 (576) T protein:vir:96 5 LADIFKR--LR-LGRDYEDIIDTVPIDDGLQANIRNIEEKSKELNKSLYGKQQAYAEPFLEVMDTN-PEFRT-KRSYMKN 79 (576) T ss_pred HHHHHHH--Hh-ccCccccchhhhhcccChhHHHHHhhhhhhhhccccCCccchhhcceeeeeecC-CCccc-cCcchhh Confidence 1211110 00 000111111111111 1111110 1111 01100 0111111 Q ss_pred HHHHHHHHHHHhhccchhHHHHHhhcceEEe------cCCCcEEEEEeeccc--cchHHHHHHHHHHHHHHHHhcch--- Q lcl|NC_015287. 53 EFELIKRYREMALHPEVDSAIEDIINEAIVS------DQNDSPVEIDLENLP--ASKQLKDLIREEFKTVKEILNFD--- 121 (563) Q Consensus 53 ~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~------d~~~~pV~l~l~~~~--~s~sik~~I~eeF~~i~~ll~F~--- 121 (563) -..+-+.-+.++.+|=|..+|+.|.+.+.++ ..+.-...|.+.+.. .++.-+..+...-..+.+++... T Consensus 80 ~~~~~~~l~~~~~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~ 159 (576) T protein:vir:96 80 SDNLHDVLKQFGNNPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDID 159 (576) T ss_pred hhhhHHHHHHhhcCHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCc Confidence 1222223345567899999999999876542 223333444554333 33333333332222233333221 Q ss_pred -hhhHHHH----HHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeeccccccccccccc Q lcl|NC_015287. 122 -DKCHEIL----RNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIE 196 (563) Q Consensus 122 -~~~~~~~----R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 196 (563) ....+++ +.+++.|.-|+.++++. +...-+.+|.+|||..|+.++.. ++..... .. T Consensus 160 ~~t~~~f~~~lv~dlll~Gna~~~i~~~r-d~~g~~~~L~pl~p~~V~v~~~~-------dg~~~~~-----------~~ 220 (576) T protein:vir:96 160 RDSFQSFCRKIVRDTYTYDQVNFEKVFNK-KNATTMDKFIAVDPSTIFYATDK-------NGKIIKG-----------GK 220 (576) T ss_pred cccHHHHHHHHHHHHHhcCCeEEEEEEec-CCCCceEEEEEeCCceeEEEECC-------CCceeee-----------ee Confidence 1344544 44567799999888774 33333899999999988876321 1110000 01 Q ss_pred ceeeecCccccccccccccCcCccccCcceeeecccceeeeecc-eecCC-CCccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 197 EYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSG-LVDRN-KQTILSYLHKSIKALNQLRMIEDSLVIYR 274 (563) Q Consensus 197 eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSG-l~d~~-~~~~~syL~~Aik~~NqL~m~EDalVIyR 274 (563) .|+.+.. +.....++.+.|+|--.+ ..|.. +..++|-|+.|.+.+.....++....=+= T Consensus 221 ~~~~~~~-------------------~~~~~~~~~~dii~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f 281 (576) T protein:vir:96 221 RFVQVIN-------------------KKVVASFTSREMAMGIRNPRTELSSSGYGLSEVEIAMKQFIAYNNTETFNDRFF 281 (576) T ss_pred EEEEecC-------------------CceEEEecccceEEEeecCCCCcccCcccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111110 111244556665442111 12222 44568889999999988887777654443 Q ss_pred HhcCccceEEEccCC-CCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCC Q lcl|NC_015287. 275 LSRAPERRIFYIDVG-NLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQ 353 (563) Q Consensus 275 i~RAPeRRvFyIDvG-nlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~ 353 (563) ---|--+-|..+..+ .|.+..+++-.+.+-..|+-- .+.| +..-.++ -|.+++.|.-.. T Consensus 282 ~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~----~nag------~~p~vl~----------~G~~~~~ls~~~ 341 (576) T protein:vir:96 282 SHGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGI----NGSW------QVPVVMA----------DDIKFVNMTPTA 341 (576) T ss_pred hccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccc----cccc------cceeecC----------CCceEEeccCCh Confidence 344566777777764 455555544444443334311 0111 1111122 156677774322 Q ss_pred Ccch-HHHHHHHHHHHHHhcCCChhhccCCCcceecc--ccchhhH-----HHHHHHHH-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 354 NLGE-LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLG--RSSEILR-----DEIKFTKF-VGRMRKRFSHLFNDMLKTQL 424 (563) Q Consensus 354 nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g--~~~eItR-----DElKF~Kf-I~rLr~rFs~if~d~Lk~qL 424 (563) .-.+ ++-.+|..+.+.++++||...|.-...+-..| .++.+|+ .-+.|.++ +.-+..++...|.. .| T Consensus 342 ~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~----~L 417 (576) T protein:vir:96 342 NDMQFEKWLTYLINIISALYGIDPAEIGFPNRGGATGGKGGNTLNEADPGKKQQQSQNKGLQPLLRFIEDLINT----HI 417 (576) T ss_pred hhHHHHHHHHHhHHHHHHHhCCCHHHccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHh----hh Confidence 2222 44556788999999999999995322111111 1112222 22224443 44444444444443 22 Q ss_pred HhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHH Q lcl|NC_015287. 425 ILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQME 504 (563) Q Consensus 425 iLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie 504 (563) + +. + ...+.++|.+...= .|.+.++.......-++|..-+|.. +.+.+.+ --++.+ T Consensus 418 l-----~~--~---~~~~~~~f~r~d~~----------~~~e~~~~~~~~~~G~lT~NE~R~~-~gl~pie--gGD~~~- 473 (576) T protein:vir:96 418 I-----SE--Y---SDKYVFQFVGGDTK----------SELDKIKILQEEVKTYKTVNEARKE-KGLKPIE--GGDVLL- 473 (576) T ss_pred c-----hh--c---cCceEEEeccCCHH----------HHHHHHHHHHHHhcCccCHHHHHHH-hCCCCCC--Ccceec- Confidence 2 11 1 24567778765431 2223222222111125566665543 3343311 000000 Q ss_pred HHhcCCCC-CCCcccCccccccc----------cccCCCCcCCCCCcccccccCCCCcccCCCCCCC---CCC Q lcl|NC_015287. 505 KEIKDGKL-ADPMEVQQLQLGVH----------PEQMPGGAMNPQDPNMGGAPQEGGAEQAPQMPEG---GEI 563 (563) Q Consensus 505 ~E~~~~~~-~~p~~~~~~~~~~~----------~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~---~~~ 563 (563) .+.. +..++....+.... .....++...|..++.....+.....+.+....| ++- T Consensus 474 ----~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~g~~~~~~~~~~~~~~~~~ 542 (576) T protein:vir:96 474 ----DGSFIQSMSLNTQKEQYEDTKQKERFDMIQQFLNSPDDEEPQQESTEDKVDGRESNDPTKIDSPVGTDG 542 (576) T ss_pred ----cccccccccccccCCCCCCccccccccccccccCCCCCCCCCCCCCCCcccccccccCCCCCCcccccc Confidence 0000 00000000000000 0000000001111111111111111111111111 000 No 59 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=98.36 E-value=1e-06 Score=53.39 Aligned_cols=459 Identities=11% Similarity=0.041 Sum_probs=176.5 Q ss_pred CcccccchhhHhhhcccccc--cccCcCCCCcccceeccccccccc---cccchhhhHHHHH--------HHHHHHhhcc Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVV--SPVPQSNEDSSDYYVSSGFYGQYV---DIDGVFKSEFELI--------KRYREMALHP 67 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~--s~~p~~~~d~~~~~~~~~~~~~~~---~~~~~~~~~~~LI--------~~YR~ma~~P 67 (563) |.+.=.|. +...+.+.+ .+=+. +.+.|- ++. +......-...+| .+|+.+..+. T Consensus 1 ~~~~~~~~---~~~~~~~~~~~~~~~~---------~n~~~~-~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy 67 (511) T protein:vir:10 1 MLKVNEFE---TDTDLRGNINYLFNDE---------ANVVYT-YDGTESDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYY 67 (511) T ss_pred Cccccchh---hhhhhhhhhhhhhhhh---------hcCCcc-CchhhhhcccCHHHHHHHHHHHHHhhHHHHHHHHHHh Confidence 44444441 111111111 11011 111111 110 0000000011111 2333333322 Q ss_pred chhHHH----------------------HHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhH Q lcl|NC_015287. 68 EVDSAI----------------------EDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCH 125 (563) Q Consensus 68 Evd~Ai----------------------~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~ 125 (563) +=+..| .-||+- .+.=--..||.+..++ +...+.+..|.+..+|+.... T Consensus 68 ~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~-~~~yl~g~p~~~~~~d--------~~~~~~l~~~~~~n~~~~~~~ 138 (511) T protein:vir:10 68 EGKTKNLVELTRRKEEYMADNRVAHDYASYISDF-INGYFLGNPIQYQDDD--------KDVLEAIEAFNDLNDVESHNR 138 (511) T ss_pred cccCccccccCcccccccCcceeecchHHHHHHH-HhhhhcccCceeecCc--------hHHHHHHHHHHhhcCHHHHHH Confidence 211111 112221 1111134566665543 234566788888889999999 Q ss_pred HHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCcc Q lcl|NC_015287. 126 EILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQ 205 (563) Q Consensus 126 ~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~ 205 (563) ++.+.+++.|+-|.+.-+| ++|-..+..+||+.+..|..-..+......|.........+........+-+|.+.. T Consensus 139 ~~~~~~~i~G~ay~~vy~d----edg~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~ 214 (511) T protein:vir:10 139 SLGLDLSIYGKAYEIMIRN----QDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHG 214 (511) T ss_pred HHHHHHHhcCeeEEEEEeC----CCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCc Confidence 9999999999998887665 457789999999999888533222211222222111110111111111122455543 Q ss_pred ccccccccccCcCccccCcceeeeccc--ceeeeecceecCCCCccchhHHHHHHHHHHHHHH-HHHHHHHHHhcCccce Q lcl|NC_015287. 206 TQGKNALGAIGQTPFSTKQRPVKIAPD--AISFCHSGLVDRNKQTILSYLHKSIKALNQLRMI-EDSLVIYRLSRAPERR 282 (563) Q Consensus 206 ~~~~~~~g~~g~~~~~~~~~~iki~~d--aI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~-EDalVIyRi~RAPeRR 282 (563) .......+..+.............+.. +|++. .|+....|-++..+....-+..+ =+....-+-++.|-+- T Consensus 215 i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f------~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv 288 (511) T protein:vir:10 215 VYRYLTSRTNGLKLTPRENGFESHSFERMPITEF------SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLL 288 (511) T ss_pred EEEEEecCCCcccccccccccccccCcceeEEEe------cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceee Confidence 332221111100000000000000011 22221 12233456666666555444322 2222222334444333 Q ss_pred EEE---ccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhh-cccc--cCCCCccceeecCCCCCcc Q lcl|NC_015287. 283 IFY---IDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDF-WLPR--REGGRGTEITTLPGGQNLG 356 (563) Q Consensus 283 vFy---IDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDy-wLpR--ReGgrgTEIsTLpGg~nLg 356 (563) +.- .|.+.+++.+ ...+-.+..- +.+. ...+.|..+..|-...+.. T Consensus 289 ~~g~~~~~~~~~~~~~----------------------------~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~ 340 (511) T protein:vir:10 289 IKGNLNLDPVEVRKQK----------------------------EANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQ 340 (511) T ss_pred eeccccCCchhhccch----------------------------hccceecccccccccccccCCCCcceeEEeecCCHH Confidence 221 1111111100 0111111110 0000 0111223344444333333 Q ss_pred hH-HHHHHHHHHHHHhcCCChhhccCCCcceeccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCC Q lcl|NC_015287. 357 EL-SDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEIL--RDEIKFTKFVGRMRKRFSHLFNDMLKTQLIL----KNI 429 (563) Q Consensus 357 ei-~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eIt--RDElKF~KfI~rLr~rFs~if~d~Lk~qLiL----kgI 429 (563) .+ .-+.-+.+.+|.-.++|---. + +|. |..|.+. .-......-+.+.++.|..-+.+.++.=+-+ .++ T Consensus 341 ~~e~~~~~L~~~I~~~s~~P~~~~--~--~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~ 415 (511) T protein:vir:10 341 GTEAYKDRLNSDIHMFTNTPNMKD--D--NFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSI 415 (511) T ss_pred HHHHHHHHHHHHHHHHhCCccccc--c--ccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCc Confidence 22 345556777888888885222 1 222 2222222 1112233345556666666666655542222 122 Q ss_pred CChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcC Q lcl|NC_015287. 430 VTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKD 509 (563) Q Consensus 430 ~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~ 509 (563) -...+|. .|.+.|...--=.+ .+.++++..+ +| .+|.+++++.+-..+| .+++-++|++|..+ T Consensus 416 ~~~~d~~----~i~i~f~~~~p~d~-------~~~~~~~~kl---~G-~iS~et~~~~l~~v~d--~~~E~~ri~~E~~~ 478 (511) T protein:vir:10 416 DANKDFN----TVRYVYNRNLPKSL-------IEELKAYIDS---GG-KISQTTLMSLFSFFQD--PELEVKKIEEDEKE 478 (511) T ss_pred ccccccc----eeeEEeCCCCCcCH-------HHHHHHHHHH---hc-cCcHHHHHHhCCCCCC--HHHHHHHHHHHHHH Confidence 2334443 57788865322222 2345556655 35 4899999988754443 33444445555332 Q ss_pred CCCC-CCcccCccccccccccCCCCcCCCCCcccccccCCCCcccC Q lcl|NC_015287. 510 GKLA-DPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQA 554 (563) Q Consensus 510 ~~~~-~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~ 554 (563) ..-. .......++. ..+.+.+...++.+.++- T Consensus 479 ~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:10 479 SIKKAQKGIYKDPRD-------------INDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHHHhhhcccCCCC-------------CCCCCCCCcccCcccccC Confidence 1000 0000000000 000111111111111111 No 60 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=98.34 E-value=1.2e-06 Score=53.10 Aligned_cols=441 Identities=11% Similarity=0.072 Sum_probs=188.7 Q ss_pred cccc--hhhHhhhcccccccccCcCCCCcccceeccc---cccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhc Q lcl|NC_015287. 4 LFGF--KIQKDDEQNKNVVSPVPQSNEDSSDYYVSSG---FYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIIN 78 (563) Q Consensus 4 ~~~f--~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~---~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvn 78 (563) +|.+ |+-.-.. .+. .+...+.. ....++ ++...++ -..|- +.+..+|-|.+||+-|.+ T Consensus 1 ~~~~~~~~~~~~~-----~~~-~~~~~~~~-~~~~~~~~~~~~pp~~-------~~~La---~~~~~n~~v~scI~~ia~ 63 (540) T protein:vir:41 1 MFNYHLSIKSLEK-----YRA-IKGDTDSQ-ALKEDRFEEYVEPKVH-------PLVLL---SLLQVNPYHASACSIKAN 63 (540) T ss_pred CCCcccChhhccc-----hhh-hhcccccc-ccccCCCCccccCCCC-------HHHHH---HHHHhcHHHHHHHHHHHH Confidence 5655 2211110 000 01011100 011122 2222222 11222 223567888999999998 Q ss_pred ceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHH----HHHHHhhhhhhheeeeecCCCCCcceeE Q lcl|NC_015287. 79 EAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHE----ILRNWYVDGRIYYHKVIDIKAPEEGIKE 154 (563) Q Consensus 79 eaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~----~~R~WYvDGri~fhkviD~~~~k~GI~e 154 (563) .+.-+ +..+.-++-.+ .. | +-+....+++ ++..+++.|.-|++++.|.. .-+.+ T Consensus 64 ~ia~~-----~~~i~~~~~~~----~~-----~-----lpN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~---G~~~~ 121 (540) T protein:vir:41 64 DILRT-----GYLIDGDDGGV----EE-----L-----LRACRPSFEFILLQALEDLQVFNYCTLEVVRDDQ---GEPVR 121 (540) T ss_pred HHhcC-----CceEecCccch----hh-----h-----ccCCCCCHHHHHHHHHHHHHhcCCeEEEEEECCC---CcEEE Confidence 66432 22333222111 10 0 1233334444 44556778999999988743 34899 Q ss_pred EEEeCCccceeeeeeeccCCccccceeeeccccccccccccccee-eecCccccccccccccCcCccccCcceeeecccc Q lcl|NC_015287. 155 VRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYY-QYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDA 233 (563) Q Consensus 155 Lr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf-~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~da 233 (563) |.+|+|..|+..+--. .-+.+..+. ..-|| .|..... .. ...+.....++.+. T Consensus 122 L~~i~~~~V~v~~~~~------~~~~~~d~~---------~~~~~~~~~~~~~-----~~------~~~g~~~~~~~~~e 175 (540) T protein:vir:41 122 LDYIPAHTVRVHRDGS------RYMQTWDGI---------HVTYFKDYRYEGE-----VN------PDNGEDQDGVGANE 175 (540) T ss_pred EEEeCCcceEEeEcCc------eeEeeecCc---------eeeeeecccccce-----ee------ccccccceeecccc Confidence 9999999887543110 011111110 00011 1111000 00 01112234566666 Q ss_pred eeeeecceec-CCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCch----HHHHHHHHHHHHhhc Q lcl|NC_015287. 234 ISFCHSGLVD-RNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPK----MKAEQYLKEVMNRYR 308 (563) Q Consensus 234 I~y~hSGl~d-~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk----~KAeqYl~~im~k~k 308 (563) |.|.+- .+ .++-.++|-|..|++.+.....+++...=+----|--.-|..++.+-.++ .++.+-+++.+.++- T Consensus 176 ViHir~--~~~~~~~~G~Spi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~ 253 (540) T protein:vir:41 176 IIFIHL--PSPICSYYGVPRYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLI 253 (540) T ss_pred eEEecC--CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHH Confidence 654432 22 34446689999999988888777776543322234445566666433332 222333333222211 Q ss_pred ceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchH---HHHHHHHHHHHHhcCCChhhccC-CCc Q lcl|NC_015287. 309 NKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGEL---SDIEYFQKKLYRSLGVPESRIAG-SGD 384 (563) Q Consensus 309 nklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei---~DV~YF~~kLy~aL~VP~sRl~~-~~~ 384 (563) .. .-.|-..+..+.| .|+.=. .+..|.+++.|. .+..++ +-.++..+.+.++++||...+.- +++ T Consensus 254 ~~----~~~g~~~nag~~~-vLe~~~----~~~~g~~~~pl~--~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~ 322 (540) T protein:vir:41 254 ED----NFKYLKEAPHTPL-VFSIPG----GDTVEVTFTPLN--TSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVG 322 (540) T ss_pred HH----HhccccccccceE-EEecCC----CcccceeEEecc--cchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCC Confidence 11 0111111122222 222110 012345555553 333333 34567788899999999999942 322 Q ss_pred ceeccccchhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHH Q lcl|NC_015287. 385 GFNLGRSSEILRDEIKFTKF-VGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNE 463 (563) Q Consensus 385 g~~~g~~~eItRDElKF~Kf-I~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~ 463 (563) +++.....+.. ..|.+. +.-+..++...+...|.+++ + ..+.+.|..+. +...++ .. T Consensus 323 ~~n~sn~eq~~---~~f~~~tL~P~~~~ie~~ln~~L~~~~---------~-----~~~~i~f~~~~----ll~~D~-~~ 380 (540) T protein:vir:41 323 PLGGNFAEVAR---RTYYESVVRPQQEIVSSVLTDFIQLKL---------D-----PGARFVFNEEI----LMESEF-VH 380 (540) T ss_pred CCCcccHHHHH---HHHHHHHHHHHHHHHHHHHHHhhhhcc---------C-----CceEEEecchh----hcchHH-HH Confidence 33333322222 335444 66777777776665543321 1 23556666543 222221 22 Q ss_pred HHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCC---CcCCCCCc Q lcl|NC_015287. 464 RLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPG---GAMNPQDP 540 (563) Q Consensus 464 R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~---~~~~p~~~ 540 (563) +++.+- -.-++|..-+|.+++.+-+-+ ..+-.|......+......+..+ .+..+... T Consensus 381 ~~~~lv-----~~G~lT~NE~Re~L~g~e~gd--------------d~~l~p~n~~~~~~~~~~~~~~~~~~~~~~k~~~ 441 (540) T protein:vir:41 381 NYALLV-----QCGVLTPSEVREKLFGLDGGP--------------DMFMVPSSIGKSAMKRQKRNYEKNQINEIKRTYA 441 (540) T ss_pred HHHHHH-----hCCCCCHHHHHHHhCcCcCCC--------------cccccccccccccccccccccCCCCccccccccc Confidence 332221 123566666654432222110 11111111000000000000000 11122223 Q ss_pred ccccccCCCCcccCCCCCCCCCC Q lcl|NC_015287. 541 NMGGAPQEGGAEQAPQMPEGGEI 563 (563) Q Consensus 541 ~~~~~~~~~~~~~~~~~~~~~~~ 563 (563) ...++.++...++.|.-..+++| T Consensus 442 ~~~~~~~~~~~~~~~~~~~~~~~ 464 (540) T protein:vir:41 442 KYKPRIQEIISSESPLEDKKKKI 464 (540) T ss_pred hhcccccCccccccccccccccc Confidence 34455555555566666666777 No 61 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=98.28 E-value=1.7e-06 Score=52.25 Aligned_cols=460 Identities=14% Similarity=0.074 Sum_probs=191.9 Q ss_pred ccccccccCcCCCCcccceeccc--------------cccccccccchhhhHH-HHHHHHHHH-hhccchhHHHHHhhcc Q lcl|NC_015287. 16 NKNVVSPVPQSNEDSSDYYVSSG--------------FYGQYVDIDGVFKSEF-ELIKRYREM-ALHPEVDSAIEDIINE 79 (563) Q Consensus 16 ~~~~~s~~p~~~~d~~~~~~~~~--------------~~~~~~~~~~~~~~~~-~LI~~YR~m-a~~PEvd~Ai~eIvne 79 (563) .. +++|-..+.......-.. +..||-+- ..+.... ..=+.+|.+ +-..=+.-+|+..+.- T Consensus 1 ~~---~~~~~~~e~~~~~~~~~~l~~~~~~~~~r~~~l~~YY~G~-~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~~ 76 (486) T protein:vir:42 1 MT---APLPGMEEIEDPAVVREEMISAFEDASKDLASNTSYYDAE-RRPEAIGVTVPREMQQLLAHVGYPRLYVDSVAER 76 (486) T ss_pred CC---CCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhccc-CcchhcccccchhHhhhhhccchHHHHHHHHHhh Confidence 22 333333222211111000 11111110 0110000 000111111 1112223334333321 Q ss_pred eEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCC----CCCcceeEE Q lcl|NC_015287. 80 AIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIK----APEEGIKEV 155 (563) Q Consensus 80 aiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~----~~k~GI~eL 155 (563) .+ ++...+.+ .+...+++..|++-.+|+....++++.-++.||-|.+.-.|.. ...+|...+ T Consensus 77 l~-------~~g~~~~~-------~~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i 142 (486) T protein:vir:42 77 QA-------VEGFRLGD-------ADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPII 142 (486) T ss_pred hc-------ccceecCC-------CchhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEE Confidence 10 11111111 1123456778888889999999999999999999887655431 245778888 Q ss_pred EEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccc-e Q lcl|NC_015287. 156 RYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDA-I 234 (563) Q Consensus 156 r~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~da-I 234 (563) +.+||+.+-.+.-- ........+.+... . .......+-+|.+..+...-.. .|+.... ....-.+..=+ | T Consensus 143 ~~~~p~~~~~i~d~-~~~~~~~~~~~~~~---~--~~~~~~~~~~y~~~~~~~~~~~--~~~~~~~-~~~~h~~g~vPvv 213 (486) T protein:vir:42 143 RVEPPTRMHAEIDP-RINRVSKAIRVAYD---K--EGNEIQAATLYTPMETIGWFRA--DGEWAEW-FNVPHGLGVVPVV 213 (486) T ss_pred EEecccceEEEEeC-CCCCeEEEEEEEEe---c--CCCeEEEEEEEcCCcEEEEEec--CCcEEee-cceecCCCCceEE Confidence 99999987776431 11111111111110 0 0111122223433322222110 0110000 00000011112 2 Q ss_pred eeeecceecCCCCccchhHHH----HHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcce Q lcl|NC_015287. 235 SFCHSGLVDRNKQTILSYLHK----SIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNK 310 (563) Q Consensus 235 ~y~hSGl~d~~~~~~~syL~~----Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk 310 (563) .|++-- +..+.-+.|-|.+ .+..+|. .+-+..++-..+-.|.|-|.-.+....+.... +.+ T Consensus 214 ~~~n~~--~~~~~~G~s~i~~~v~~liDa~~~--~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~-----------~~~ 278 (486) T protein:vir:42 214 PLPNRT--RLSDLYGTSEITPELRSMTDAAAR--ILMLMQATAELMGVPQRLIFGIKPEEIGVDSE-----------TGQ 278 (486) T ss_pred Eecccc--ccCCCCCcccchhhHHHHHHHHHH--HHHHHHHHHHhhcchHHHhhcCCccccccccc-----------ccc Confidence 244321 1111112333333 2333332 34455566666666666555333222111000 011 Q ss_pred eEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccc Q lcl|NC_015287. 311 LVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGR 390 (563) Q Consensus 311 lvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~ 390 (563) .+.++..| ..|..-.+ ..++..+|+..-=.-++-++=.-.++...-++|..-|... +-|-.. T Consensus 279 ~~~~~~~~-------------~~~~~~~~---~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~--~~n~~S 340 (486) T protein:vir:42 279 TLFDAYLA-------------RILAFEDA---EGKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYLSTA--ADNPAS 340 (486) T ss_pred chhhhhhc-------------hhcccCCC---CceEEeecccCHHHHHHHHHHHHHHHhcccCCCHHHhccc--cCchhH Confidence 11122222 22322111 2345566654311223334444455566678887777322 112112 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC-hhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 391 SSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVT-PEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVA 469 (563) Q Consensus 391 ~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t-~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~ 469 (563) +..|.--+..+..-+.+.|..|..-+...++.-+.+.|... +.+| ..|++.|.....=+. .+.++.+. T Consensus 341 g~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~d~----~~i~v~w~~~~~~s~-------~~~ad~~~ 409 (486) T protein:vir:42 341 AEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMKGGDVPPDM----LRMETVWRDPSTPTY-------AAKADAAT 409 (486) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc----eeeeEEecCCCCCCH-------HHHHHHHH Confidence 33455566667788899999999999999998777766532 2233 358888865433222 34455555 Q ss_pred HhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCC Q lcl|NC_015287. 470 AVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEG 549 (563) Q Consensus 470 ~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~ 549 (563) .+..-...+.|.++++ .++..++.+++++++..+++...+... -+......+ ...+...|..++..++.... T Consensus 410 kl~~~~~g~~s~et~~-~~lg~~~d~~~e~~~~~~e~~~~~~~~-~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~ 481 (486) T protein:vir:42 410 KLYGNGQGVIPRERAR-IDMGYSVKEREEMRRWDEEEAAMGLGL-LGTMVDADP------TVPGSPSPTAPPKPQPAIES 481 (486) T ss_pred HHHhcccCCCCHHHHH-hcCCCChhHHHHHHHHHHHHHHHHHHH-HHHhhcCCC------CCCCCCCCCCCCCCCcccCC Confidence 5544333578888887 578899988887776544443222111 000000000 00011111111111111111 Q ss_pred CcccC Q lcl|NC_015287. 550 GAEQA 554 (563) Q Consensus 550 ~~~~~ 554 (563) .++.+ T Consensus 482 ~~~~~ 486 (486) T protein:vir:42 482 SGGDA 486 (486) T ss_pred CCCCC Confidence 11111 No 62 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=98.26 E-value=1.9e-06 Score=51.96 Aligned_cols=451 Identities=14% Similarity=0.142 Sum_probs=217.0 Q ss_pred ccchhhHhhhccccccc-----ccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHh--hccchh------- Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVS-----PVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMA--LHPEVD------- 70 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s-----~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma--~~PEvd------- 70 (563) ||. +.+-+.-.+.... ++-+..+. ..+ .-..+..+-|++|+.|. .+|+|. T Consensus 1 m~~-~~~ik~~~~~~~~~~~~~~~~~~~~~--------------~~i-~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~ 64 (517) T protein:vir:98 1 MKV-IQRIKNFFKRGGYALSGQTLKSINDH--------------EKI-NIDPNELARIERNLRQYEGDYPQVEYINSQGK 64 (517) T ss_pred Cch-HHHHHHHHHHHHHHhcccchhHhhcC--------------Cce-ecCHHHHHHHHHHHHHhcCCCccccccccccc Confidence 555 3333333322111 00000110 011 11233456677777773 345442 Q ss_pred -------------HHHHHhhcceEEecCCCcEEEEEeeccccc---hHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhh Q lcl|NC_015287. 71 -------------SAIEDIINEAIVSDQNDSPVEIDLENLPAS---KQLKDLIREEFKTVKEILNFDDKCHEILRNWYVD 134 (563) Q Consensus 71 -------------~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s---~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvD 134 (563) .+..++.+ +| -..+|+|.+++.... ......-.+..+.++.-.+|.....+.+.....- T Consensus 65 ~~~~~~~sl~~~~~i~~~~A~--Ll---~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~ 139 (517) T protein:vir:98 65 IQERDYMTLNLRKLSADVLSG--LV---FNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFAL 139 (517) T ss_pred ccccceeecCcHHHHHHHhhh--hh---cCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhh Confidence 12222222 11 224666666654321 1222335677889999999999999999999999 Q ss_pred hhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccce-e-eecccccccccccccceeeecC------ccc Q lcl|NC_015287. 135 GRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAIN-Q-VNSKNPADIENPEIEEYYQYDP------SQT 206 (563) Q Consensus 135 Gri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~-~-~~~~~~~~~~~~~~~eyf~y~~------~~~ 206 (563) |..+|-..+|. |-+.+.+++|.++-+++......+ ...+- . .........+|-- -|+..... +.. T Consensus 140 G~~a~k~~~d~-----~~~~I~~v~ad~~~Pl~~~~~~v~-~~ai~~~~~~~~~~~~~~Yt~-lE~H~~~~~~~~~~~y~ 212 (517) T protein:vir:98 140 GGLTVRPYVDN-----GEIEFSWALANAFYPLRSNSNGIS-EGVMKSVTTKVIGNKTVYYTL-LEFHEWEKTEEGESLYV 212 (517) T ss_pred CCEEEEEEEeC-----CeeEEEEEcCCeeEEEEecCCCeE-EEEEEEEEEEeecCCceEEEE-EEEEecCceeccCCcEE Confidence 99999888984 345589999999887654211100 00000 0 0000001111110 11111100 000 Q ss_pred ccccc-----ccccCcCccccCcceeeec--ccceeeeec-------------ceecCCCCccchhHHHHHHHHHHHHHH Q lcl|NC_015287. 207 QGKNA-----LGAIGQTPFSTKQRPVKIA--PDAISFCHS-------------GLVDRNKQTILSYLHKSIKALNQLRMI 266 (563) Q Consensus 207 ~~~~~-----~g~~g~~~~~~~~~~iki~--~daI~y~hS-------------Gl~d~~~~~~~syL~~Aik~~NqL~m~ 266 (563) ..|-. ....|.. .. -..+ +. .+.+++.+. --.+..+..++|-++.|+-.+.-|-.. T Consensus 213 I~n~ly~s~~~~~lG~~-v~--L~~~-~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~ 288 (517) T protein:vir:98 213 ITNELYKSDNEGEIGKR-IP--LEEL-YEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDT 288 (517) T ss_pred EEEEEEecCCCcccccc-cc--cccc-ccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHH Confidence 00000 0001110 00 0000 00 112222211 012223456789999999888887755 Q ss_pred HHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccce Q lcl|NC_015287. 267 EDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEI 346 (563) Q Consensus 267 EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEI 346 (563) -+. +.|..|.=.+|||. +..=|++.... +......++| .+++.++.+.-+ .|+. -| T Consensus 289 ~s~--~~~e~~~g~~~i~v-p~~~l~~~~~~-------~g~~~~~~~d------~~~~~y~~~~~~------~~~~--~i 344 (517) T protein:vir:98 289 YDQ--FWWEIKMGQRTVFV-SDVMLRTVPDE-------SGMPPPQVFD------PDVNVYKSIRMG------TDEE--FV 344 (517) T ss_pred HHH--HHHHHHhCCcceec-ChhhhccccCC-------CCcccCCCCC------cccceeeeccCC------CCCC--ce Confidence 554 45888887777775 22211110000 0000000111 122333332211 1111 14 Q ss_pred eecCCCCCc-chHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 347 TTLPGGQNL-GELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLI 425 (563) Q Consensus 347 sTLpGg~nL-gei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLi 425 (563) +++.+.==- .-..-+.++.+.+-...+++-+-|..++.| ..-|+||...+-.-..-+.+.|+.+...+.++++.-|. T Consensus 345 ~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~--~kTATEi~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~ 422 (517) T protein:vir:98 345 KDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRS--MKTATEIVSENDLTYRTRNDHVYEVEQFIKGLVISVLE 422 (517) T ss_pred eeeccccchHHHHHHHHHHHHHHHHHhCCCcccccccccc--cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 444432100 123456778888888999999999765432 34577887776777778888888888888888887654 Q ss_pred hc-------CCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHH Q lcl|NC_015287. 426 LK-------NIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIE 498 (563) Q Consensus 426 Lk-------gI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e 498 (563) |. |.+.+. ..+.++|.. +.+.. ++++. +.+.++--- | .+|....+++...++|+|.++ T Consensus 423 l~~~~~~~~~~~~~~------~~v~v~f~D-~i~~D-~~~~~-----~~~~~~v~a-G-~ms~~~~i~~~~g~~eeeA~~ 487 (517) T protein:vir:98 423 LAKTYKLFGGEIPSA------EHIGVDFDD-GVFQD-RSALL-----RFYGQAKTF-G-FIPTVEAIQRIFKVPKKTAEQ 487 (517) T ss_pred HHHHHhhcCCCCCCC------cceEEEcCC-CCCCC-HHHHH-----HHHHHHHhc-C-CCCHHHHHHHhCCCChHHHHH Confidence 32 222221 346777753 33333 22221 111121111 3 478888777788999999999 Q ss_pred HHHHHHHHhcCCCCCCCcccCccccccccccCCCCcC Q lcl|NC_015287. 499 IDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAM 535 (563) Q Consensus 499 ~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~ 535 (563) +-.+|++|... .+|....+.. .+...+.++ T Consensus 488 e~~~i~~E~~~---~~~~~~~~~~----~~~~~gd~e 517 (517) T protein:vir:98 488 WLEEIRKDQIE---LDPVTISQRA----QKRMFGDEE 517 (517) T ss_pred HHHHHHHhccc---cCCCCccccc----cCCCCCCCC Confidence 99999888542 1121111111 111111111 No 63 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=98.25 E-value=2.1e-06 Score=51.72 Aligned_cols=457 Identities=15% Similarity=0.180 Sum_probs=187.2 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhh-HHHHHHHHHHHhhccchhHHHHHhhcc Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKS-EFELIKRYREMALHPEVDSAIEDIINE 79 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~-~~~LI~~YR~ma~~PEvd~Ai~eIvne 79 (563) ++++|-.+..-+..-.++..+- .-+|++..++-.+=..+ -+==+-.--.|||+||.+++++-|..| T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e 134 (694) T protein:vir:10 68 LARQFEVDVSNYTPRERRAASY-------------ALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADE 134 (694) T ss_pred hhhhccccccCCCccccchhhh-------------hhccCcccccchhhhhccCcchHHHHHHHhhccchhhHHHHHHHH Confidence 4444433322222111111110 00111111110000000 000012234689999999999999999 Q ss_pred eE------EecCCCcE----EEEEeeccccch-HHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCC-- Q lcl|NC_015287. 80 AI------VSDQNDSP----VEIDLENLPASK-QLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIK-- 146 (563) Q Consensus 80 ai------v~d~~~~p----V~l~l~~~~~s~-sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~-- 146 (563) |+ ++...++. +.+.-+....++ .-.++|..|++. |+...+.-+.++-=-+.|.=.-...|+-. T Consensus 135 ~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~er----l~V~~~l~eaik~aRlfGGa~~~i~I~gdd~ 210 (694) T protein:vir:10 135 CIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIER----LRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQ 210 (694) T ss_pred hhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHH----HHHHHHHHHHHHhhccccceEEEEEeecCcc Confidence 93 33322221 333333333344 344567777664 33344444444433344442111233222 Q ss_pred ------------CCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccc Q lcl|NC_015287. 147 ------------APEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGA 214 (563) Q Consensus 147 ------------~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~ 214 (563) -.|.+++.|+.|||..+-+= . .+. .++..-.|-.-..|++.. T Consensus 211 ~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~-------~-~n~------~dP~spdfgkP~~y~V~G------------ 264 (694) T protein:vir:10 211 IMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPN-------N-YNS------INPVADDFYKPSTWWMIG------------ 264 (694) T ss_pred ccccccccccccccCcceeeeEeecccccccc-------h-hhh------ccchhhccCCCceEEEec------------ Confidence 23457788999997543331 1 110 011111222222233311 Q ss_pred cCcCccccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHH-HHHHHHHHHHHHHhcCccceEEEccCC-CCc Q lcl|NC_015287. 215 IGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQ-LRMIEDSLVIYRLSRAPERRIFYIDVG-NLP 292 (563) Q Consensus 215 ~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~Nq-L~m~EDalVIyRi~RAPeRRvFyIDvG-nlp 292 (563) . ..|.++.+.+...++.+. |-......+.|.+..+..-..+ +++...+.=+- +.+.-+ ++-.|.. -|. T Consensus 265 --~--~IH~SRL~~f~g~plPd~---LKp~y~~~G~Sv~q~~~e~V~~~~rT~~~v~~Li--~~~~v~-~lk~dla~~L~ 334 (694) T protein:vir:10 265 --T--EVHATRLHTIVSRPVGDM---LKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIV--KQFSVS-GILMDLAQALM 334 (694) T ss_pred --e--EEeeeeEEEecCCCchhh---hhcccccCcccHHHHHHHHHHHHHHHHhHHHHHH--HhhhhH-HHHHHHHHhhc Confidence 0 133344444443333222 1111223345666665544333 34333332221 111111 1111211 011 Q ss_pred hHHHHHHH--HHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHH-HHHHHHH Q lcl|NC_015287. 293 KMKAEQYL--KEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIE-YFQKKLY 369 (563) Q Consensus 293 k~KAeqYl--~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~-YF~~kLy 369 (563) ....++.. -+++++||.- -|-+-=|+ =.|||- +. ..+|+-++||. =|..-+= T Consensus 335 ~g~~~~l~~R~eli~~~Rsn------~G~~llDk----~~Eefe-------------q~--stslSGLddVi~qf~q~VA 389 (694) T protein:vir:10 335 PGANVDLSMRAELINRYRDN------RNILFLDK----ATEEFF-------------QF--NTPLSGLDALQAQAQEQMS 389 (694) T ss_pred ChhHHHHHHHHHHHHHhcCc------cceEEEec----CCcceE-------------EE--ecccCCHHHHHHHHHHHHH Confidence 11112222 2455666521 12110000 013442 11 24788899975 4888888 Q ss_pred HhcCCChhhccC-CCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHhcCCCChhHHHHhhhceE Q lcl|NC_015287. 370 RSLGVPESRIAG-SGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKT-----QLILKNIVTPEDWEVLGDHIQ 443 (563) Q Consensus 370 ~aL~VP~sRl~~-~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~-----qLiLkgI~t~eew~~~~~~i~ 443 (563) -+.+||+.||=+ ...|||--.-+ |.-.|...|..+|. ..+...|++ |+-+-|.+. ..|. T Consensus 390 gaa~IPltkLfGqSPkGlNATGE~----D~rnYYD~I~s~Qe---~~L~p~L~rl~~ii~rS~~G~id--------p~i~ 454 (694) T protein:vir:10 390 AVSHIPLIKLLGITPTGLNASSEG----EIRVWYDYVRAYQR---NALQQLMNDVIVMIQLSLFGAVD--------PSIK 454 (694) T ss_pred hhhcCchhhhhccCcccccccchh----hHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHhcCCCC--------Ccce Confidence 899999999943 34477653222 23458889988875 334444433 444445443 4588 Q ss_pred EEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhcc------------------ChhhHHHHHHHHHH Q lcl|NC_015287. 444 YDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQ------------------KDEEVIEIDKQMEK 505 (563) Q Consensus 444 ~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~------------------tdeei~e~~~qie~ 505 (563) |+|+.=..-+|..-+||...+.+....+-.- | -++.+-|+..+... +|.||.-+...- T Consensus 455 ~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~-g-vI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~-- 530 (694) T protein:vir:10 455 WQWNALRELDDLEVAESRYKQAQSDVLYVQE-Q-VIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYV-- 530 (694) T ss_pred EEeCCCCCcCHHHHHHHHhhhhHHHHHHHHh-c-CCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhh-- Confidence 9999777778888888888887765433211 1 12222222221111 111111110000 Q ss_pred HhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccCCCC-CC-------CCCC Q lcl|NC_015287. 506 EIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAPQM-PE-------GGEI 563 (563) Q Consensus 506 E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~-~~-------~~~~ 563 (563) + + ..+.++.++..+...+.--.|.-.++....+++.++..+++ .- .|.| T Consensus 531 ---~-----~-~~~~~~~~~~~~~~~g~~~~~~v~~~~~~~~~~~ag~~~~~~~~ag~v~~~~g~v 587 (694) T protein:vir:10 531 ---Q-----R-LAEGGDTGAPGGARAGATAPPTVANVNANVNPREAGAQDAAMRAAGAVYVVDGKV 587 (694) T ss_pred ---c-----C-cccccccCCCCcccccccCCCcccccccccCccccCCCCccceeeEEEEEeCCEE Confidence 0 0 01111111111212222222222333333344333322222 21 2222 No 64 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=98.22 E-value=2.5e-06 Score=51.33 Aligned_cols=410 Identities=8% Similarity=0.061 Sum_probs=163.9 Q ss_pred HHHHhh-ccchhHHHHHhhcceEEecCCCcEEEEEeec-cccchHHHHHHHHHHHHHHHHhc-------------chhhh Q lcl|NC_015287. 60 YREMAL-HPEVDSAIEDIINEAIVSDQNDSPVEIDLEN-LPASKQLKDLIREEFKTVKEILN-------------FDDKC 124 (563) Q Consensus 60 YR~ma~-~PEvd~Ai~eIvneaiv~d~~~~pV~l~l~~-~~~s~sik~~I~eeF~~i~~ll~-------------F~~~~ 124 (563) -|+|+. +|-|..||+.|.+.+. .-|+.|.... ..... +.....+.+.+.|. +.... T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia-----~~p~~i~~~~~~~~~~----~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~ 71 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVA-----GFGINIIPHPEAEDPD----RDGEQYERVWDFWFGDDSNWQVGPMESERATA 71 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhh-----cCCeEEEEccCccccc----chhhhhhhHHHHhhccCCCccccchhhHhhHH Confidence 677755 6999999999999764 3355544322 11112 12222232222111 12234 Q ss_pred HHHHHHH----hhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceee Q lcl|NC_015287. 125 HEILRNW----YVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQ 200 (563) Q Consensus 125 ~~~~R~W----YvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~ 200 (563) .++++.+ ++.|.-|.+++.|.. ..+++|.+|||.+|+..+-... -+..+ ....-||. T Consensus 72 ~~~~~~~~~~l~l~Gn~~i~~~r~~~---G~~~~l~~l~~~~v~~~~d~~~------~~~~~----------~~~~~~~~ 132 (467) T protein:vir:31 72 TNVLQTAWTDYEAIGWLTIEILTQTD---GTPTGLAYVPGHTIRKRMDERG------FVQLL----------EEKEKYFG 132 (467) T ss_pred HHHHHHHHHHHHhcCCeEEEEEECCC---CcEEEEEEeCCceeEeeeecce------eEeec----------CCceeeEE Confidence 4444444 445888888887643 3489999999988876532111 00000 00112222 Q ss_pred ecCccccccc---cccccCcCccccCcceeeecccceeeeecceec-CCCCccchhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015287. 201 YDPSQTQGKN---ALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVD-RNKQTILSYLHKSIKALNQLRMIEDSLVIYRLS 276 (563) Q Consensus 201 y~~~~~~~~~---~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d-~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~ 276 (563) +......... ...........+....+.++.+.|.|.. ..+ .++-.++|-+..|.+.+..-..++....=+ .. T Consensus 133 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~diih~r--~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~-f~ 209 (467) T protein:vir:31 133 VAGDRYQTNGNGDLDPVFVDADDGSTGTSVSNPANELIFKR--NHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDF-FE 209 (467) T ss_pred eccccceeecccceeeeeeeeccccccceeEeccccEEEec--CCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHH-Hh Confidence 2111100000 0000111111223445778888885442 233 344566899999988876665555443211 11 Q ss_pred cC-ccceEEEccCCCCchHHHHHHHHHHHH-hhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCC Q lcl|NC_015287. 277 RA-PERRIFYIDVGNLPKMKAEQYLKEVMN-RYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQN 354 (563) Q Consensus 277 RA-PeRRvFyIDvGnlpk~KAeqYl~~im~-k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~n 354 (563) ++ --+-|..+.-+.+.+..+ +-+++.+. .|++..- -.++.-.+...|-.+..|++|.. T Consensus 210 ng~~p~gil~~~~~~l~~e~~-~~~~~~~~~~~~~~~~-------------------~~~~~~~g~~n~~~~~~l~~g~~ 269 (467) T protein:vir:31 210 NDGVPRIAIIVKGAELTEKGR-EEMRNLIEDNNEDNHR-------------------TAFIETEKIVQNEDYLNLADGAD 269 (467) T ss_pred ccCCCceEEEecCcCCCHHHH-HHHHHHHHhhhcchhh-------------------hhhhhhcccccccccccccCCCc Confidence 22 223445555455555444 44444443 3332110 00111111111222333333321 Q ss_pred -----------------cchHHHH-HHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHHHH-HHHHHHHHHHH Q lcl|NC_015287. 355 -----------------LGELSDI-EYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTKF-VGRMRKRFSHL 415 (563) Q Consensus 355 -----------------Lgei~DV-~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~Kf-I~rLr~rFs~i 415 (563) =.|+-+. ++..+...++.+||.+.|...+ +-+.+ +.+...-..|.++ +.-+..++... T Consensus 270 ~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~-~~~~~--s~~e~~~~~f~~~~l~P~~~~ie~~ 346 (467) T protein:vir:31 270 RSDVEIRLEPLTVGIDEEASFLEFRGRNEHDILKVHDVPPVIAGVVE-SGAFS--TDAEEQRKEFAEETIQPKQHDFGEL 346 (467) T ss_pred ccccceeEEeccccChhhHHHHHHHHHHHHHHHHHhCCCHHHcccCC-CCCcc--cCHHHHHHHHHHHHHHHHHHHHHHH Confidence 1222222 3455669999999999884221 11222 2233333345444 45555555555 Q ss_pred HHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhh Q lcl|NC_015287. 416 FNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEE 495 (563) Q Consensus 416 f~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdee 495 (563) |...|- +..+. .....|+|++..--.-. +..|++.+..+-. .-++|..-+++.. .+. T Consensus 347 ln~~l~---------~~~~~-~~~~~i~f~~~~l~~~d-------~~~~~~~~~~~~~--~G~~T~NE~R~~~-Gl~--- 403 (467) T protein:vir:31 347 LYELVH---------KQGLD-APDWTIEFELAKPDTKL-------QDVEIASQRVQAM--QGLLTVNELRDEF-GFE--- 403 (467) T ss_pred HHHhhc---------chhhc-cCCceEEEecchhhccC-------HHHHHHHHHHHHh--CCCcCHHHHHHHh-CCC--- Confidence 444332 21111 11134666555422111 2344444433311 1245555444332 221 Q ss_pred HHHHHHHHHHHhcCCCCCCCcc----c--CccccccccccCCCCcCCCCCcccccccCC-----CCcccCCCCCCCCCC Q lcl|NC_015287. 496 VIEIDKQMEKEIKDGKLADPME----V--QQLQLGVHPEQMPGGAMNPQDPNMGGAPQE-----GGAEQAPQMPEGGEI 563 (563) Q Consensus 496 i~e~~~qie~E~~~~~~~~p~~----~--~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~-----~~~~~~~~~~~~~~~ 563 (563) +++++.- . .....++.+....+ ..|..+....+.+. ..-+..-.+-.|..- T Consensus 404 ---------------pi~d~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 465 (467) T protein:vir:31 404 ---------------PFPEEHVYGGETLVAEVTGGSGPGGGIG--DQIEQLVEDRADEIIDSYQADLETEQLIEIGANA 465 (467) T ss_pred ---------------CCCcccccCCcccccccccccCCCCccc--CcCCCCCCCcccchHhhhhhccccchhhhhcccc Confidence 2211000 0 00000000000000 00000000000000 000000000001111 No 65 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=98.22 E-value=2.5e-06 Score=51.32 Aligned_cols=458 Identities=13% Similarity=0.078 Sum_probs=196.7 Q ss_pred cccccCcCCCCcccceec---c----------ccccccccccchhhh-HHHHHHHHHHH-hhccchhHHHHHhhcceEEe Q lcl|NC_015287. 19 VVSPVPQSNEDSSDYYVS---S----------GFYGQYVDIDGVFKS-EFELIKRYREM-ALHPEVDSAIEDIINEAIVS 83 (563) Q Consensus 19 ~~s~~p~~~~d~~~~~~~---~----------~~~~~~~~~~~~~~~-~~~LI~~YR~m-a~~PEvd~Ai~eIvneaiv~ 83 (563) ..+++|-......+-.+. . ....||-|- ..+.. ...+-+.+|.+ +.+.=+.-+|+..+.-.. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~G~-~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~-~ 78 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTERTQDLGDNTAYYESE-RRPDAVGVTVPQQMQKLLAHVGYPRLYIDAIAARQE-L 78 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-ccchhcccccchhHHhhhhhcCcHHHHHHHHHhhhc-c Confidence 111112111111000000 0 000111110 00000 00001111111 112222334444433111 1 Q ss_pred cCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCC----CCcceeEEEEeC Q lcl|NC_015287. 84 DQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKA----PEEGIKEVRYID 159 (563) Q Consensus 84 d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~----~k~GI~eLr~lD 159 (563) + .+. .. + .+...+.+..|.+-.+|+....++++.-++.|+-|++.-.|... ...+...++.++ T Consensus 79 ~----g~~--~~-----~--~~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~ 145 (484) T protein:vir:77 79 E----GFR--LG-----G--ADKADEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEP 145 (484) T ss_pred C----cee--cC-----C--cchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEec Confidence 1 111 11 1 11234667888888999999999999999999988876555332 123455688999 Q ss_pred CccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCcc---c-cCcceeeeccccee Q lcl|NC_015287. 160 PLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPF---S-TKQRPVKIAPDAIS 235 (563) Q Consensus 160 P~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~---~-~~~~~iki~~daI~ 235 (563) |+.+-.+.. +..+..+..++--.. ........+.+|.+......... .|+-.. . +.- -++| .|. T Consensus 146 p~~~~~~~D----~~~~~~~~a~~~~~~--~~~~~~~~~~~y~~~~~~~~~~~--~~~~~~~~~~~~~~--g~vP--vv~ 213 (484) T protein:vir:77 146 PTNLYAQID----PRTRQVMRAIRAIED--EEGNEVIGATLYLPNNTVIWNRE--DGQWVQVANVAHNL--EMVP--VIP 213 (484) T ss_pred cceeEEEec----CCCCceEEEEEEEEe--ecCCcEEEEEEEecCeEEEEEec--CCceEeeccccCCC--CCcc--eEE Confidence 998876642 121221111110000 01112233334444322221110 000000 0 000 0122 233 Q ss_pred eeecceecCCCCccchhHHHHHHHH-HHH-HHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEe Q lcl|NC_015287. 236 FCHSGLVDRNKQTILSYLHKSIKAL-NQL-RMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVY 313 (563) Q Consensus 236 y~hSGl~d~~~~~~~syL~~Aik~~-NqL-~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvY 313 (563) |+|.--.. ..-+.|-+.+.++++ ..+ +.+-+.+++-+..-.|.|-|.-.+...++..+. +...++ T Consensus 214 f~N~~~~~--~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~-----------~~~~~~ 280 (484) T protein:vir:77 214 IPNRTRLS--DLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVKGEELGVDPE-----------TGQTLF 280 (484) T ss_pred eccccccC--ccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCCcchhccccc-----------ccchhh Confidence 44432111 112244455444333 332 445566667676767777654333333222111 001111 Q ss_pred eCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccch Q lcl|NC_015287. 314 DASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSE 393 (563) Q Consensus 314 d~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~e 393 (563) +...| .+|..-- -++.+.+++..+-=+-++-++-.-.++....++|.+-|... +-|...+.. T Consensus 281 ~~~~~-------------~~~~~~~---~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~--~~n~~Sg~A 342 (484) T protein:vir:77 281 DAYLA-------------RILAFED---HESKAQQFSAAELRNFVDALDALDRKAAAYTGLPPYYLSFS--SENPASAEA 342 (484) T ss_pred hhhhh-------------hhcccCC---CCceeEeecCCChHHHHHHHHHHHHHHhcccCCCHHHhccc--cCcchHHHH Confidence 11111 3454321 23556677754311223345555566666778888888422 122222334 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-ChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_015287. 394 ILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIV-TPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVD 472 (563) Q Consensus 394 ItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~-t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~ 472 (563) |.--+..+-.-+.+.|+.|..-+.+.++.-+.+.|.. ...+| ..|.+.|..-..-+. .+.++.+.++. T Consensus 343 l~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~----~~i~v~w~~~~~~s~-------~~~ad~~~kl~ 411 (484) T protein:vir:77 343 IRSSESRLVKTVERKNKIFGGAWEQAMRVAYKVMNGGDIPPEY----YRMESIWRDPSTPTY-------AAKADAATKLY 411 (484) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccc----ccceEEecCCCCCCH-------HHHHHHHHHHH Confidence 5555556777788999999999999998877666542 12222 347788854332222 34555555554 Q ss_pred hhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcc Q lcl|NC_015287. 473 PYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAE 552 (563) Q Consensus 473 p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 552 (563) .-.-..+|.++++.. |.+++.+++++++..++|...+.. ..+.....+ +..+ -+|..++..++..++..+ T Consensus 412 ~~g~gi~s~et~~~~-l~~~~~~~~e~~~~~~ee~~~~~~-~~~~~~~~~----~~~~----~~~~~~~~~~~~~~~~~~ 481 (484) T protein:vir:77 412 NNGQGVIPKERARID-MGYSITEREEMRKWDEEEQAQGLG-LMGTMFGTD----PSGG----GNPDNPETPEPQPNPAEE 481 (484) T ss_pred hccCCCCCHHHHHhc-CCCChhHHHHHHHHHHHHHHHHHH-HHhhhcccc----ccCC----CCCCCCCcccccCCCccc Confidence 432246787776655 689998888876655544322110 000000000 0000 012222222222222222 Q ss_pred cCC Q lcl|NC_015287. 553 QAP 555 (563) Q Consensus 553 ~~~ 555 (563) ++- T Consensus 482 ~~~ 484 (484) T protein:vir:77 482 AAA 484 (484) T ss_pred cCC Confidence 221 No 66 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=98.20 E-value=2.7e-06 Score=51.08 Aligned_cols=440 Identities=12% Similarity=0.099 Sum_probs=173.8 Q ss_pred Cccc-ccchhhHhhhccccccccc-CcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchh-------- Q lcl|NC_015287. 1 MAKL-FGFKIQKDDEQNKNVVSPV-PQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVD-------- 70 (563) Q Consensus 1 m~~~-~~f~~~~~~~~~~~~~s~~-p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd-------- 70 (563) |+.. -+.+.....+... ++ |+..... +.+ ..-+..-..-+.+|+.+..+.+-. T Consensus 1 ~~~~~~~~~~~~~~e~~~----~~~~~~~~~~-~~i------------~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~ 63 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVE----QIKPKYETQE-EMI------------LRLVREHKENIDNITMGERYYNHHPDILDAPP 63 (478) T ss_pred CccccCCCCchhHHHHHH----HHhhccCCcH-HHH------------HHHHHHHHHHHHHHHHHHHHhcCCCchhcccc Confidence 4443 2222222222211 11 1111110 000 000000011112222222222111 Q ss_pred -------------------HHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHH Q lcl|NC_015287. 71 -------------------SAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNW 131 (563) Q Consensus 71 -------------------~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~W 131 (563) +-...||+-. +.=-...||.+..++.+ ..+.+..+++ .+|+....++.+.+ T Consensus 64 ~~~~~~~~~~~~~~~ki~~n~~~~ivd~~-~~~l~g~~~~~~~~~d~--------~~~~l~~~~~-n~~~~~~~~~~~~~ 133 (478) T protein:vir:10 64 KRDVNGDYDETKPDWRMYTNYHQNLVDQK-VAYAVANPVTFGVDNDK--------ALKQIQHTLN-HKWDDKLVDILTAA 133 (478) T ss_pred ccccccccccccccceeccchHHHHHHHH-HhhhccCCeeeecCChH--------HHHHHHHHHh-cCHHHHHHHHHHHH Confidence 1112222211 11123567777665422 2333444554 47999999999999 Q ss_pred hhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCcccccccc Q lcl|NC_015287. 132 YVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNA 211 (563) Q Consensus 132 YvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~ 211 (563) ++.|+-|++.-+| ++|-..+..+||+.+..|..-.........+..... .+...+.+|.+........ T Consensus 134 ~~~G~~~~~~~~d----~~g~~~~~~~~p~~~~~i~d~~~~~~~~~~v~~~~~--------~~~~~~~~y~~~~i~~~~~ 201 (478) T protein:vir:10 134 SNKGIEWVQPYVD----EEGEFKTFRVPAEQAVPIWTNKERDELQAFIRVYEL--------DGAERVEYWTKDDVTYYEL 201 (478) T ss_pred HhcCeEEEEEEec----CCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEe--------cCceEEEEEeCCeEEEEEE Confidence 9999999987776 357788999999998877322111111122211111 1112222332222111111 Q ss_pred ccccCcCcc---ccC-cceeeecccceeeeecceec-CCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEE Q lcl|NC_015287. 212 LGAIGQTPF---STK-QRPVKIAPDAISFCHSGLVD-RNKQTILSYLHKSIKALNQLR-MIEDSLVIYRLSRAPERRIFY 285 (563) Q Consensus 212 ~g~~g~~~~---~~~-~~~iki~~daI~y~hSGl~d-~~~~~~~syL~~Aik~~NqL~-m~EDalVIyRi~RAPeRRvFy 285 (563) .+....... ..+ .........+..|..-=++. .|.....|=++..+....-+. ++=+....-+.++.|-+-+.- T Consensus 202 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g 281 (478) T protein:vir:10 202 KEGQLIPDFYRSDDHIQPHYYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKG 281 (478) T ss_pred cCCeeeccccccccccccceecccccccCCccceEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeec Confidence 100000000 000 00000001100000000000 122333455555444444443 444555555666777554433 Q ss_pred ccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchH-HHHHHH Q lcl|NC_015287. 286 IDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGEL-SDIEYF 364 (563) Q Consensus 286 IDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei-~DV~YF 364 (563) .+..+.. +....+....=++++.-+|| ++.+|-...+...+ .-+.-+ T Consensus 282 ~~~~~~~-----------------------------~~~~~~~~~~~~~~~~~~~~---~~~~l~~~~~~~~~~~~~~~l 329 (478) T protein:vir:10 282 YEGEDMK-----------------------------DFMHNLKYYKAISVAGESGS---GVDTIKVEVPIDSVKEYTKML 329 (478) T ss_pred CCccccc-----------------------------hhhhhhhhcceEEecCCCCC---cceEEeecCChHHHHHHHHHH Confidence 2222110 00001110111234433433 35555444444433 456677 Q ss_pred HHHHHHhcCCChhhccCCCcceeccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhce Q lcl|NC_015287. 365 QKKLYRSLGVPESRIAGSGDGFNLGRSSE--ILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHI 442 (563) Q Consensus 365 ~~kLy~aL~VP~sRl~~~~~g~~~g~~~e--ItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i 442 (563) .+.+|+-.++|---. + +|. |..|. |..-+..-..-+.+.+..|...+..+++.=+-+.|+ ..+|. .| T Consensus 330 ~~~i~~~s~~p~~~~--~--~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~g~--~~~~~----~i 398 (478) T protein:vir:10 330 RDYIIEFGQGVDFQQ--D--KFG-NSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRL--DVKVQ----DI 398 (478) T ss_pred HHHHHHHhCccccCc--c--ccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Ccccc----cc Confidence 888999999884222 1 221 22222 222112223345566666666666666554444453 22333 46 Q ss_pred EEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccc Q lcl|NC_015287. 443 QYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQ 522 (563) Q Consensus 443 ~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~ 522 (563) .+.|..-.--.+. +.+++++.+. | .+|.+++++.+-..+| .+++-++|++|..+-.-..++.. T Consensus 399 ~i~f~~~~p~d~~-------e~a~~~~kl~---g-~iS~et~~~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~---- 461 (478) T protein:vir:10 399 EITFNFNVMVNEL-------ENSQIAMNST---G-LLSKETILSNHAWVED--PVAEMERIEQENIELNQQLPDIE---- 461 (478) T ss_pred eEEecCCCCCCHH-------HHHHHHHHHh---C-CCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhccccc---- Confidence 7777543322232 3344555553 3 4899999987753333 33444445444321000000000 Q ss_pred cccccccCCCCcCCCCCcccccc Q lcl|NC_015287. 523 LGVHPEQMPGGAMNPQDPNMGGA 545 (563) Q Consensus 523 ~~~~~~~~~~~~~~p~~~~~~~~ 545 (563) .+ ..+++.+..++..+. T Consensus 462 ~~------~~~~~~~~~~~~~~~ 478 (478) T protein:vir:10 462 EG------LNGEQQRQSENNQPE 478 (478) T ss_pred cc------cCCCCCCCCCCCCCC Confidence 00 000111111100000 No 67 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=98.18 E-value=3e-06 Score=50.89 Aligned_cols=418 Identities=11% Similarity=0.134 Sum_probs=184.5 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) |.+||+|. ++.. ..++....++......- |.....+.+ +. +..+++|-|.+||+-|.+.+ T Consensus 4 ~~~~f~~~-~r~~------~~~~~~~~~~~~~~~~~-g~~~~~~~v-----~~-------~~al~~~~v~~~i~~ia~~i 63 (429) T protein:vir:10 4 VKKFFNFE-KRQT------SQVIELNKDDEKLLEWL-GISPSTISV-----KG-------KNALKVATVFACIKILSESV 63 (429) T ss_pred hhhhhccc-ccCc------ccccccCCChHHHHHHh-cCCCCccee-----ch-------hhhhccHHHHHHHHHHHHhh Confidence 66778772 2221 11222212211111111 111111111 00 12356899999999999866 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-----hcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEE Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-----LNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEV 155 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-----l~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eL 155 (563) ... |+.+--++ + +..+.....-..++|+. +.-..=-+.++..+++.|.-|..++.|.. ..+.+| T Consensus 64 a~l-----~~~~~~~~-~--~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~---G~~~~L 132 (429) T protein:vir:10 64 SKL-----PLKIYQED-E--YGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRK---GKVQAL 132 (429) T ss_pred ccC-----ceEEEEec-C--CceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC---CcEEEE Confidence 543 34432221 1 11111222223333332 11111234556667788999998877643 238899 Q ss_pred EEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeeccccee Q lcl|NC_015287. 156 RYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAIS 235 (563) Q Consensus 156 r~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~ 235 (563) .+|+|..|..++.-.. ... ....-||.++..+ ....++.+.|. T Consensus 133 ~~i~~~~v~v~~~~~~-------~~~-----------~~~~~~~~~~~~g-------------------~~~~~~~~evi 175 (429) T protein:vir:10 133 WPIDASKVTVYIDDVG-------LLN-----------SKTKMWYVVNTGG-------------------QQRVLKPEEIL 175 (429) T ss_pred EEEcCceeEEEEcCcc-------ccc-----------ccceEEEEEccCC-------------------eEEEEccccEE Confidence 9999998876532111 000 0112244443211 12456666665 Q ss_pred eeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeC Q lcl|NC_015287. 236 FCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDA 315 (563) Q Consensus 236 y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~ 315 (563) |..-+. ..++-.+.|.|..|.+.+.....++....=+----+.-+-+..++ +.|.+.++++..+.+...|.. T Consensus 176 h~~~~~-~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~-~~l~~e~~~~~~~~~~~~~~g------ 247 (429) T protein:vir:10 176 HFKNGI-TLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-GDLNEDAKKVFRENFESMSSG------ 247 (429) T ss_pred EecCCC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhcc------ Confidence 553322 334455689999999999999888887766655545556777776 567777776665555444422 Q ss_pred CCCcccccccccchhhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchh Q lcl|NC_015287. 316 STGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG-QNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEI 394 (563) Q Consensus 316 ~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eI 394 (563) .++..+.+ .++ .|.+++.|.-. ..+.-++-.++..+.+.++++||.+.|...+++ +. +.+ T Consensus 248 ----~~n~~~~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~-~~---sn~ 308 (429) T protein:vir:10 248 ----LQNSHRIA-LMP----------VGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKA-TL---NNI 308 (429) T ss_pred ----ccccCcee-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC-Cc---ccH Confidence 11111211 221 24556665321 122223344577889999999999999532211 11 112 Q ss_pred hHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_015287. 395 LRDEIKFTKF-VGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDP 473 (563) Q Consensus 395 tRDElKF~Kf-I~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p 473 (563) ...-..|.++ |.-+-..+... +-+.++++.+|. ..+.+.|.-+.. ... -+..|++.++.+-. T Consensus 309 e~~~~~f~~~~l~P~~~~ie~~---------ln~kl~~~~~~~---~g~~~~fd~~~l----l~~-d~~~~~~~~~~~~~ 371 (429) T protein:vir:10 309 EQQQQQFYTDTLQATLTMYEQE---------MTYKLFLDSELD---KGFYSKFNVDAI----LRA-DIKTRYEAYRTGIQ 371 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHHH---------HHHhhcChhhcC---CCcEEEeechhh----hcC-CHHHHHHHHHHHHh Confidence 2222224332 22222222222 223344555554 233455544332 111 12345555544432 Q ss_pred hhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCccc Q lcl|NC_015287. 474 YVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQ 553 (563) Q Consensus 474 ~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~ 553 (563) - -++|.+-++.. +.+.+.+ ..+-++ .|..-...+. .......+|+.+. + .+. T Consensus 372 ~--G~~T~NE~R~~-~gl~p~~-----------ggD~~~-~~~n~~~~d~-~~~~~~k~g~~~~-~------~~~----- 423 (429) T protein:vir:10 372 G--GFLKPNEARSK-EDLPPEA-----------GGDRLL-VNGNMLPIDM-AGQAYLKGGDTNG-E------VSK----- 423 (429) T ss_pred C--CCcCHHHHHHH-hCCCCCC-----------CcCeee-ecccccchhh-ccccccCCCCCCC-C------CCC----- Confidence 2 24444444432 2222100 000000 0000000000 0000111111110 0 000 Q ss_pred CCCCCCCCCC Q lcl|NC_015287. 554 APQMPEGGEI 563 (563) Q Consensus 554 ~~~~~~~~~~ 563 (563) +|+|= T Consensus 424 -----~~~e~ 428 (429) T protein:vir:10 424 -----EGNEG 428 (429) T ss_pred -----CCCCC Confidence 11111 No 68 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=98.16 E-value=3.3e-06 Score=50.63 Aligned_cols=375 Identities=13% Similarity=0.096 Sum_probs=164.9 Q ss_pred ccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEec Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSD 84 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d 84 (563) ||| +++..+. ++.+.+.........+.++...-.. + + -+..+++|-|..||+-|.+.+.-++ T Consensus 1 Mg~-f~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~-v-~-------~~~~l~~~~v~~~i~~ia~~ia~~~ 62 (382) T protein:vir:48 1 MPI-FNLATES--------PPDNQGGFFDVVDSDFLASLKGNEW-V-S-------AETALRNSDLFSIINQLSNDLATVK 62 (382) T ss_pred Ccc-ccccccC--------CcccccccccchhhhccccccCCcc-c-c-------hHhhhccHHHHHHHHHHHHhhccCc Confidence 444 2221110 0111111111111111111111000 0 0 0122578999999999999664332 Q ss_pred CCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhH----HHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCC Q lcl|NC_015287. 85 QNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCH----EILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDP 160 (563) Q Consensus 85 ~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~----~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP 160 (563) +. +.+ .....+.++ =+-...++ .++..+++.|.-|..++.|.. .-+++|++|+| T Consensus 63 -----~~--~~~-----~~~~~L~~~-------PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~---G~~~~l~~i~~ 120 (382) T protein:vir:48 63 -----LI--TSR-----KKLQGIVDN-------PSNNANRFNFYQSIFAQMLLGGEAFAYRWRNEN---GRDMKWEYLRP 120 (382) T ss_pred -----ee--eec-----chhhhhhhh-------cCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCC---CcEEEEEEEcC Confidence 22 221 111223222 22333344 455566788999998877632 23889999999 Q ss_pred ccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeecc Q lcl|NC_015287. 161 LKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSG 240 (563) Q Consensus 161 ~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSG 240 (563) ..|..++.... + ..+|.|...+ .. ....+.++.+.|.|.. T Consensus 121 ~~v~v~~~~~~-----~------------------~~~y~~~~~~--------~~-------~~~~~~~~~~evih~~-- 160 (382) T protein:vir:48 121 SQVSFNRLDNK-----D------------------GIYYNITFDD--------PR-------IPPKQHVPQNDVLHFR-- 160 (382) T ss_pred ceeEEEEcCCC-----C------------------eEEEEEEecC--------cc-------ccceeEEcCccEEEec-- Confidence 88876532111 0 1123221110 00 0112445566664432 Q ss_pred eecCCC-CccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCc Q lcl|NC_015287. 241 LVDRNK-QTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGE 319 (563) Q Consensus 241 l~d~~~-~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGe 319 (563) ..+.++ -.+.|.|..|++++.....+++...-+----+--+-+..++.. +.+..+++..+..-..+ .+.|. T Consensus 161 ~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~-~~~e~~~~~~~~~~~~~-------~n~g~ 232 (382) T protein:vir:48 161 LLSVDGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGG-GLLDFKTKLSRSRQAMK-------QMQGG 232 (382) T ss_pred CCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC-CChHHHHHHHHHHHhhc-------cCCCC Confidence 233444 3457999999999999999988877766666777888888754 44445544444322211 12232 Q ss_pred ccccccccchhhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHH Q lcl|NC_015287. 320 IRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG-QNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDE 398 (563) Q Consensus 320 v~d~~~~msMlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDE 398 (563) + + .++ + |.+++.|.-. ..+.-++-.+|..+.+.++++||...|...++ ++.+...- T Consensus 233 ~------~-vl~-------~---g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~------~~~~~~~~ 289 (382) T protein:vir:48 233 P------L-VLD-------D---LEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGD------QQSSLEMS 289 (382) T ss_pred e------e-EcC-------C---CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC------cccHHHHH Confidence 1 1 111 2 4555555422 22223456678889999999999999953221 11111111 Q ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhch Q lcl|NC_015287. 399 IKFTKF-VGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGK 477 (563) Q Consensus 399 lKF~Kf-I~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGK 477 (563) ..|.++ |.-+-+.+...|...| +++.+++. ...+..|.+ -...+++.+-. +- T Consensus 290 ~~~~~~~l~p~~~~i~~~l~~~l---------~~~~~~~~-----~~~~~~~~~--------~~~~~~~~l~~-----~g 342 (382) T protein:vir:48 290 SDLYSKAVSRYLRPFLSELSQKL---------SCDVDADI-----FPAVDPTGS--------NYISRINSLVK-----TG 342 (382) T ss_pred HHHHHHHHHHHHHHHHHHHHHHh---------cChhhhhh-----hhhhccchh--------HHHHHHHHHhh-----cC Confidence 224332 3333333333332222 12222211 000111110 01111111100 01 Q ss_pred hhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCC Q lcl|NC_015287. 478 YFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQD 539 (563) Q Consensus 478 y~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~ 539 (563) .++. .|.++ .-.+.|..++|-.. .++.. ....||++++++ T Consensus 343 ~~t~---------------~e~r~---~l~~~g~~~~~~~~-~~~~~---~~~~GGd~~~~~ 382 (382) T protein:vir:48 343 TLAQ---------------NQGLY---ILQQAEILPKELPN-GENPN---STLKGGEEDGQD 382 (382) T ss_pred ccCH---------------HHHHH---HHhhCCCCCcchhh-hhcCC---CCCCCCCCCCCC Confidence 1121 11112 12233443322111 11110 123455555555 No 69 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=98.15 E-value=3.5e-06 Score=50.50 Aligned_cols=411 Identities=15% Similarity=0.187 Sum_probs=171.3 Q ss_pred ccc-hhhHhhhcccccccccCc-CCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEE Q lcl|NC_015287. 5 FGF-KIQKDDEQNKNVVSPVPQ-SNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIV 82 (563) Q Consensus 5 ~~f-~~~~~~~~~~~~~s~~p~-~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv 82 (563) +|. -.++++...+. .+..- .....++ .....++++. .+-.+|...| +.|+.+..||+.++++|+. T Consensus 1 ~~~~m~~~~~~~~~~--D~~~~~~~~~~g~-~~~~~~~~~~-------~~~~~l~~~Y---~~~~l~~~~Vd~~aed~~r 67 (435) T protein:vir:79 1 MGVFMSDKVKAITKE--DGYNEIFGSKDGT-FRPNAFYMQR-------AAFKALSQFY---EEDGMARRIVDVIPEEMVT 67 (435) T ss_pred CCcccccccccchhh--cchhhhhcccccc-cccCcccCCc-------CCHHHHHHHH---hcCchhhhhhccchHHhhc Confidence 332 11111111100 00000 0000001 1111122211 2233555555 4689999999999999986 Q ss_pred ecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhh--hheeeeecC------CCCCcceeE Q lcl|NC_015287. 83 SDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGR--IYYHKVIDI------KAPEEGIKE 154 (563) Q Consensus 83 ~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGr--i~fhkviD~------~~~k~GI~e 154 (563) .- +.| + ++.-+++ |+..++-|+...+..+.+|.=-+.|. ++.+ +.|. -+++..|+. T Consensus 68 ~g-----~~i--~----g~~~~~~----~~~~~~~l~~~~~l~~a~~~~rl~G~~~i~i~-~~d~~~~~~Pl~~~g~i~~ 131 (435) T protein:vir:79 68 PG-----FKV--D----GVKNEKS----FKSRWDELRLNAKIIDALSWSRLFGGSAILAV-VADNKMLKSPVKPGAQLED 131 (435) T ss_pred CC-----cee--c----CCChHHH----HHHHHHHhhHHHHHHHHHHhhhccccEEEEEE-ecCCCCcccccccCCceee Confidence 54 222 2 1112234 44444455667788887775556665 4443 2243 356667888 Q ss_pred EEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccce Q lcl|NC_015287. 155 VRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAI 234 (563) Q Consensus 155 Lr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI 234 (563) ++.|||.++..- ++.. + +..-.|.. .++|..++.+ .+.+-..|.++.+.+.-..+ T Consensus 132 i~v~d~~~i~~~-~~~~-----d---------p~sp~fg~-P~~y~v~~~~---------~~~~~~iH~SRli~~~g~~~ 186 (435) T protein:vir:79 132 IRVYDRYQITIH-ERET-----N---------ARSVRYGE-PKLYKISPGG---------DIPEFFVHYSRICIIDGERV 186 (435) T ss_pred EEeechhhccch-hhcc-----C---------CcccccCc-ceEEEEecCC---------CCCceEEcceeEEEecCCcc Confidence 899998665431 1100 1 11111222 2344443311 01111223333333332222 Q ss_pred eeeecceecCCCCccchhH-HHHHHHHHHHHHHHH--HHHHHHHhcCccceEEEcc-CCCC-----chHHHHHHHHHHHH Q lcl|NC_015287. 235 SFCHSGLVDRNKQTILSYL-HKSIKALNQLRMIED--SLVIYRLSRAPERRIFYID-VGNL-----PKMKAEQYLKEVMN 305 (563) Q Consensus 235 ~y~hSGl~d~~~~~~~syL-~~Aik~~NqL~m~ED--alVIyRi~RAPeRRvFyID-vGnl-----pk~KAeqYl~~im~ 305 (563) - |-.. ...+..+.|.| +++..-+.++..... +.++++-. . +|+.++ +..+ ....+..-+. .++ T Consensus 187 p--~~~~-~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~---~-~v~~~~~l~~~~~~~~~~~~~~~r~~-~~~ 258 (435) T protein:vir:79 187 S--NEKR-RQNDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQ---Q-AVWKARDLALMCDDEEGRYAARLRLA-QVD 258 (435) T ss_pred h--hhhc-cccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhc---C-ccccchhHHHhhcCccchHHHHHHHH-HHH Confidence 1 1111 11223334544 455443333322222 23444431 2 234442 2221 1111111111 122 Q ss_pred hhcce---eEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCChhhccC Q lcl|NC_015287. 306 RYRNK---LVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSD-IEYFQKKLYRSLGVPESRIAG 381 (563) Q Consensus 306 k~knk---lvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl~~ 381 (563) ++|+. ++-|+.+ -+++++. .+|+-++| +.+|...+-.+.+||+.||-+ T Consensus 259 ~~~~~~~~~~i~~~~--------------------------e~~e~~~--~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G 310 (435) T protein:vir:79 259 DESGVGKAIGIDATD--------------------------EEYEVLN--SDVSGVPEFLQEKIDRIVALTGIHEIIIKN 310 (435) T ss_pred HhcCCCCceeEecCC--------------------------cceEEEe--cccCCHHHHHHHHHHHHHhhhCCCeeeecc Confidence 33221 2222111 1233332 24555555 478999999999999999954 Q ss_pred CC-cceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHH Q lcl|NC_015287. 382 SG-DGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETEL 460 (563) Q Consensus 382 ~~-~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei 460 (563) .+ +|++ .++ .-|--.|..+|+++|.. .+..+|++- +.-.+.+ +.+.|.|..=..-+|...+|+ T Consensus 311 ~s~~gln--stg--d~d~~~yyd~i~~~Qe~---~l~p~l~~l-~~li~~s--------~d~~~~f~pL~~~sekEkAei 374 (435) T protein:vir:79 311 KNTGGVS--ASQ--NTALETFYKLIDRKRVE---DYKPILEFL-LPFMISE--------TEWSIEFEPLSVPSDKDKAEI 374 (435) T ss_pred CCccccc--cch--hHHHHHHHHHHHHHHHH---HHHHHHHHH-HHHhhcC--------CCCeEEeCCCCCCCHHHHHHH Confidence 33 2343 111 12334499999999853 333333321 1111122 356788877666678888888 Q ss_pred HHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCc Q lcl|NC_015287. 461 LNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDP 540 (563) Q Consensus 461 ~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~ 540 (563) ...++++++.+-.- | .++.+-+++.+ ..+-.+. |.......+ -|... T Consensus 375 ~~~~a~a~~~~~~~-g-~i~~~e~r~~L------------~~~~~~~--~~~~~~~~~-----------------~~~~~ 421 (435) T protein:vir:79 375 MAKNVESVVKLKAE-Q-AINLKETRDTL------------RSICPDL--KIMDNDNIE-----------------LPEPE 421 (435) T ss_pred HHHHHHHHHHHHhc-C-CCCHHHHHHHH------------HHhcccc--CCCCccccc-----------------CCccc Confidence 88888888765332 1 23333333222 1111111 111100000 00000 Q ss_pred ccccccCCCCcccC Q lcl|NC_015287. 541 NMGGAPQEGGAEQA 554 (563) Q Consensus 541 ~~~~~~~~~~~~~~ 554 (563) +..++..+..++.. T Consensus 422 d~~~~~~~e~g~~~ 435 (435) T protein:vir:79 422 DLDPEPGQEGGLNK 435 (435) T ss_pred cCCCCCCCCCCCCC Confidence 00111111111111 No 70 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=98.15 E-value=3.6e-06 Score=50.44 Aligned_cols=434 Identities=12% Similarity=0.079 Sum_probs=177.5 Q ss_pred ccccCcCCCCcccceeccccccccc-cccc----h-hhhHHHHHHHHHHHhhccchhHH--------------------- Q lcl|NC_015287. 20 VSPVPQSNEDSSDYYVSSGFYGQYV-DIDG----V-FKSEFELIKRYREMALHPEVDSA--------------------- 72 (563) Q Consensus 20 ~s~~p~~~~d~~~~~~~~~~~~~~~-~~~~----~-~~~~~~LI~~YR~ma~~PEvd~A--------------------- 72 (563) .+ |. ....+......|...+- .+.. . +..-...+.+|+.+..+.+-... T Consensus 1 ~~--~~--~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ 76 (472) T protein:vir:93 1 MY--PS--QPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDD 76 (472) T ss_pred CC--CC--CCcchhhhhceeeecCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhcccccccccccc Confidence 11 11 11111111222222110 0000 0 01112233455555444333211 Q ss_pred ------HHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCC Q lcl|NC_015287. 73 ------IEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIK 146 (563) Q Consensus 73 ------i~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~ 146 (563) ...||+- .+.--..+||.+..++.+ ..+..+.+++ .+|+....++.+.+++.|+-|.+.-+| T Consensus 77 ri~~n~~~~ivd~-~~~~l~g~~~~~~~~d~~--------~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d-- 144 (472) T protein:vir:93 77 RMITNFHANLVDQ-KVSYIVGKPIAFKHTDDE--------VVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLD-- 144 (472) T ss_pred ccccchHHHHHHH-HhhhhcccCeeeccCChH--------HHHHHHHHHh-ccHHHHHHHHHHHHhhcCeEEEEEEEC-- Confidence 1112221 111113456666665522 3334444554 479999999999999999988876655 Q ss_pred CCCcceeEEEEeCCccceeeeeeeccCCccccceeeeccccc--ccccccccceeeecCccccccccccccCcCccccCc Q lcl|NC_015287. 147 APEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPA--DIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQ 224 (563) Q Consensus 147 ~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~--~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~ 224 (563) ++|-..+..+||+++..|..-.........+.+....... .++.+.-..+|.+................ .... T Consensus 145 --~d~~~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~ 219 (472) T protein:vir:93 145 --EEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENS---KTHF 219 (472) T ss_pred --CCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeecceeEEEEecCeEEEEEEecCeeeeccccccccc---cccc Confidence 3567889999999988884322221122222222111111 01111111223332211110000000000 0000 Q ss_pred ceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHH Q lcl|NC_015287. 225 RPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEV 303 (563) Q Consensus 225 ~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~-m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~i 303 (563) ..--+..=+|++.. |+....|=++..+....-+. ++=+....-+.+.-|-+-+.-.+.-.. .+..+. T Consensus 220 ~~~~~~~vPvv~~~------nn~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~-----~~~~~~- 287 (472) T protein:vir:93 220 STGSWGKIPFIPFK------NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQEL-----PEFKRL- 287 (472) T ss_pred ccCCCCCcceEEec------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcccc-----hhhHHH- Confidence 00001111333321 12223455555444443333 445555556667777544432221111 111111 Q ss_pred HHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCChhhccCC Q lcl|NC_015287. 304 MNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDIEYFQKKLYRSLGVPESRIAGS 382 (563) Q Consensus 304 m~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~ 382 (563) +..++ -..++ + +..+.+|-...+.+. ..-+.-+.+.+|+..++|---+ + T Consensus 288 ~~~~~-----------------------~~~~~---~--~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~--~ 337 (472) T protein:vir:93 288 LRYYG-----------------------AIKVS---D--NGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSS--D 337 (472) T ss_pred Hhhcc-----------------------ccccC---C--CCcceeEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCc--c Confidence 11110 01111 1 123555533333332 2345666778888888884222 2 Q ss_pred Ccceeccccc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHH Q lcl|NC_015287. 383 GDGFNLGRSS-EILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELL 461 (563) Q Consensus 383 ~~g~~~g~~~-eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~ 461 (563) .|.-..|+ .|.--+.....-+.+.|+.|...+.++++.=+-+-|+ ..+|. .|.+.|.. +.-..+ T Consensus 338 --~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~--~~~~~----~i~v~f~~-~~p~~~------ 402 (472) T protein:vir:93 338 --KFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI--KGEHK----DVDISFNY-NKVANT------ 402 (472) T ss_pred --ccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Ccccc----eeeEEeCC-CCCCCH------ Confidence 12111222 2333333455567788888888888888764444443 34554 46677743 322211 Q ss_pred HHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcc Q lcl|NC_015287. 462 NERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPN 541 (563) Q Consensus 462 ~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~ 541 (563) .+.++++..+ +| .+|.+++++.+-..+| .+++-++|++|..+.....+.. ...+ +. .+.+.+++.+.+ T Consensus 403 ~~~~~~~~k~---~g-iis~et~l~~l~~~~d--~~~E~~ri~~E~~~~~~~~~~~---~~~~--~d-~~~~~~~~~~~~ 470 (472) T protein:vir:93 403 ELQVQTAQQS---MG-IVSHETVLENHPFVED--LQAELERIEQEQMEYNKQLPNL---DDGG--AD-GAQQQERSNNKE 470 (472) T ss_pred HHHHHHHHHH---hc-cCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHhccCc---Cccc--CC-CCCCCCCCCccc Confidence 1234455554 35 4899999998765554 3333344444432211110100 0000 00 011111111111 Q ss_pred cc Q lcl|NC_015287. 542 MG 543 (563) Q Consensus 542 ~~ 543 (563) .. T Consensus 471 ~e 472 (472) T protein:vir:93 471 SE 472 (472) T ss_pred CC Confidence 11 No 71 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=98.13 E-value=4e-06 Score=50.19 Aligned_cols=454 Identities=14% Similarity=0.097 Sum_probs=191.4 Q ss_pred ccccccccCcCCCCcccce---e---cc------ccccccccccchhhh-HHHHHHHHHHH-hhccchhHHHHHhhcceE Q lcl|NC_015287. 16 NKNVVSPVPQSNEDSSDYY---V---SS------GFYGQYVDIDGVFKS-EFELIKRYREM-ALHPEVDSAIEDIINEAI 81 (563) Q Consensus 16 ~~~~~s~~p~~~~d~~~~~---~---~~------~~~~~~~~~~~~~~~-~~~LI~~YR~m-a~~PEvd~Ai~eIvneai 81 (563) -...|+- +..+++..... + .. .+..+|-+- ..+.. ....=+.+|.+ +.+.=+..+|+..++=. T Consensus 1 ~~~~i~~-~~~~~~~~~~~~~l~~~~~~~~~r~~~~~~Yy~G~-~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l- 77 (485) T protein:vir:10 1 MTAPLPG-QEEIEDPAIARDEMVSAFEDSTQNLKTNTSYYEAE-RRPEAIGVTVPIQMQSLLAHVGYPRLYVDSIAERQ- 77 (485) T ss_pred CCCCCCC-CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CcchhcCCCCChhhhhhhhhcCcHHHHHHHHHhhh- Confidence 1111111 11122211111 0 00 000111110 00000 00000111111 11122233444333311 Q ss_pred EecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCC----CCCcceeEEEE Q lcl|NC_015287. 82 VSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIK----APEEGIKEVRY 157 (563) Q Consensus 82 v~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~----~~k~GI~eLr~ 157 (563) +.+ .+.+. ++ +...+.+..|++..+|+....++++.-++.|+-|...-.|.. .+.+|-..++. T Consensus 78 ~~~----g~~~~-~~--------~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~ 144 (485) T protein:vir:10 78 AVE----GFRFG-DA--------DEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRV 144 (485) T ss_pred ccc----ceecC-CC--------chhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEE Confidence 000 11110 11 122345666777888999999999999999999886544432 23456677889 Q ss_pred eCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcC----ccccCcceeeecccc Q lcl|NC_015287. 158 IDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQT----PFSTKQRPVKIAPDA 233 (563) Q Consensus 158 lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~----~~~~~~~~iki~~da 233 (563) ++|+.+..++--.. ......+.+.... ....+..+.+|.+......-. ..++- ...+.- -++| . T Consensus 145 ~~p~~~~~~~D~~~-~~~~~~~~~~~~~-----~~~~~~~~~~y~~~~~~~~~~--~~~~~~~~~~~~~~~--g~vP--v 212 (485) T protein:vir:10 145 EPPTRMYAEIDPRI-GRVSKAIRVAYDA-----EGNEIQAATLYTPNDIFGWYR--VENEWQEWFNNPHGL--GVVP--V 212 (485) T ss_pred EccceeEEEEcCCC-CceeEEEEEEEee-----CCCeEEEEEEEeCCeEEEEEE--cCCceEEeccccCCC--Cccc--E Confidence 99988776642111 1111111111100 011122233343332221110 00000 000000 0111 2 Q ss_pred eeeeecceecCCCCccchhHHHHHHH-HHH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhccee Q lcl|NC_015287. 234 ISFCHSGLVDRNKQTILSYLHKSIKA-LNQ-LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKL 311 (563) Q Consensus 234 I~y~hSGl~d~~~~~~~syL~~Aik~-~Nq-L~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knkl 311 (563) |.|+|..-.. +.-+.|-+...+.. ... -+++-+..++-..+-.|.|-|.=.+..+++...-. ... T Consensus 213 v~~~n~~~~~--~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~-----------~~~ 279 (485) T protein:vir:10 213 VPIPNRTRLS--DLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPET-----------GQT 279 (485) T ss_pred EEeccccccC--CCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcCCcccccccccc-----------cch Confidence 4455442211 11223444433222 121 23566777777777777776653332222211000 000 Q ss_pred EeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCChhhccCCCcceeccc Q lcl|NC_015287. 312 VYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGR 390 (563) Q Consensus 312 vYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~ 390 (563) +.+...| ..|+.-. -+.++-.++... ++- ++-++=.-..++..-++|.+-|... +-|-.. T Consensus 280 ~~~~~~~-------------~i~~~~~---~d~k~~q~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~fg~~--~~n~~S 340 (485) T protein:vir:10 280 LFDAYLA-------------RILAFED---AEGKIQQFSAAE-LANFTNALDQIAKQVAAYTGLPPQYLSTA--ADNPAS 340 (485) T ss_pred hhhhccc-------------ceeccCC---CCceEEeecccc-hHHHHHHHHHHHHHHhcccCCCHHHhccc--cCchhH Confidence 0011111 2233211 123344555432 221 1223333344555577787877432 122222 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC-hhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 391 SSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVT-PEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVA 469 (563) Q Consensus 391 ~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t-~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~ 469 (563) +..|..-+..+..-+.+.|+.|..-+...++.-+.+.|... ..+| ..|.+.|..-..-+. .+..+++. T Consensus 341 g~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~----~~i~v~w~~~~~~~~-------~~~ada~~ 409 (485) T protein:vir:10 341 AEAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYRMMKGGDVPPDM----LRMETVWRDPSTPTY-------AAKADAAS 409 (485) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCcccc----eeeeEEecCCCCCCH-------HHHHHHHH Confidence 33466666668888899999999999999987666655321 2222 457888865433222 24455555 Q ss_pred HhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCC-----CCCCcccCccccccccccCCCCcCCCCCccccc Q lcl|NC_015287. 470 AVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGK-----LADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGG 544 (563) Q Consensus 470 ~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~-----~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 544 (563) .+..-+-..+|.++++. +|.+++.+++++++..+++...+. +..|....+++. . ..+...+ T Consensus 410 kl~~ag~~~~s~et~~~-~lg~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~----------~---~~~~~~~ 475 (485) T protein:vir:10 410 KLYNGGTGVIPRERARK-DMGYSIAEREEMRRWDEEEAAMGLGLIGTMVDPNPTVPGSP----------S---PAPAPKP 475 (485) T ss_pred HHHhccccCCCHHHHHH-hCCCCHhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCC----------C---ccccccC Confidence 55443225788899985 588999999888776655543221 111111111000 0 0011112 Q ss_pred ccCCCCcccC Q lcl|NC_015287. 545 APQEGGAEQA 554 (563) Q Consensus 545 ~~~~~~~~~~ 554 (563) +++..++.-| T Consensus 476 ~~~~~~~~~~ 485 (485) T protein:vir:10 476 AALESGGDAA 485 (485) T ss_pred cCCCCCCCCC Confidence 2222222222 No 72 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=98.10 E-value=4.7e-06 Score=49.79 Aligned_cols=408 Identities=12% Similarity=0.064 Sum_probs=184.6 Q ss_pred ccccccchhhhHHHHHHHHHHHhhccchhHH----------------------HHHhhcceEEecCCCcEEEEEeecccc Q lcl|NC_015287. 42 QYVDIDGVFKSEFELIKRYREMALHPEVDSA----------------------IEDIINEAIVSDQNDSPVEIDLENLPA 99 (563) Q Consensus 42 ~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~A----------------------i~eIvneaiv~d~~~~pV~l~l~~~~~ 99 (563) ...+.-. .. ..+|+.+..+.+-+.. ...||+- .+.=--..||.+...+... T Consensus 1 ~~~~~~~---~~---~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~-~~~~l~g~~~~~~~~~~~~ 73 (440) T protein:vir:95 1 MLAAFLG---SQ---KQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSF-ATGYVIGNPVSIGVMEGGS 73 (440) T ss_pred ChhhHHH---HH---HHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHh-hhhheeccCceEeeCCCcc Confidence 1112211 11 2223333323222211 1112221 0111134456665544222 Q ss_pred chHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccc Q lcl|NC_015287. 100 SKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAI 179 (563) Q Consensus 100 s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i 179 (563) +...+.+..++...+|+....++.|.+++.|+-|.+.-+| ++|-..+..++|+.+..|+--. +.+..+ T Consensus 74 -----~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d----~~~~~~i~~~~p~~~~~~~d~~---~~~~~~ 141 (440) T protein:vir:95 74 -----ADQLSTIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRD----KDKVDRVVLISPLEMFVIRDLT---VEQNII 141 (440) T ss_pred -----HHHHHHHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEec----CCCceEEEEEcccceEEEEcCC---CCCceE Confidence 2234556777878899999999999999999999887666 4566788999999988874211 111111 Q ss_pred eeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeeccc--ceeeeecceecCCCCccchhHHHHH Q lcl|NC_015287. 180 NQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPD--AISFCHSGLVDRNKQTILSYLHKSI 257 (563) Q Consensus 180 ~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~d--aI~y~hSGl~d~~~~~~~syL~~Ai 257 (563) -.++ .+...+...+-+|.+.........+.........+ ...-+.. +|++. +|.....|=++..+ T Consensus 142 ~~i~-----~~~~~~~~~~~vyt~~~~~~~~~~~~~~~~~~~~~--~~~~~~g~vPvv~~------~n~~~g~sd~e~v~ 208 (440) T protein:vir:95 142 AAVH-----LPIYADKVNMTVYTKDKVITYKPYSNNSVRLVVDD--VKKHSYNDVPVVEW------WNNRFRMGDYESEI 208 (440) T ss_pred EEEE-----EEEecCceEEEEEeCCeEEEEEEecCCccceeecc--eeeccCceeeEEEe------eCCCCCCCchhhhH Confidence 1111 11112222233454444332211111000000000 0001111 33321 12222345555555 Q ss_pred HHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhccc Q lcl|NC_015287. 258 KALNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLP 336 (563) Q Consensus 258 k~~NqL~-m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLp 336 (563) .....+. ++-+.+..-+..+.|-+-+.-.+.+.-... ...+.+++.+. .|++ T Consensus 209 ~lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~--------------------e~~~~~~~~~~-------~~~~ 261 (440) T protein:vir:95 209 SLIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKLSP--------------------EDAAKMKDANM-------LFLK 261 (440) T ss_pred HHHHHHHHHHHHHHHHHHHhhcceeeeecccccCCCCc--------------------cchhhhhhccc-------eecc Confidence 5444443 334445555666666655543211110000 00011111111 2322 Q ss_pred cc----CCCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCChhhccCCCcceeccccch--hhHHHHHHHHHHHHHH Q lcl|NC_015287. 337 RR----EGGRGTEITTLPGGQNLGEL-SDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSE--ILRDEIKFTKFVGRMR 409 (563) Q Consensus 337 RR----eGgrgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~e--ItRDElKF~KfI~rLr 409 (563) .. .++.+..++.|-...+++.. .-++-+.+.+|...++|- +..+ +|. |..|. |.--+.....-+.+.| T Consensus 262 ~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p~--~~~~--~~~-~n~Sg~Al~~~~~~l~~k~~~k~ 336 (440) T protein:vir:95 262 TGISTTGQQTTADASYIYKQYDVNGTEAYKNRLANDIHRFSRIPN--LDDD--RFN-STSSGIALLYKMIGLEQVRKDKE 336 (440) T ss_pred cccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcc--cccc--ccc-ccchHHHHHHHHHHHHHHHHHHH Confidence 22 22334457777666665543 447778888999999884 2111 221 22222 2222222444577777 Q ss_pred HHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHh Q lcl|NC_015287. 410 KRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKIL 489 (563) Q Consensus 410 ~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il 489 (563) ..|..-+.++|+.=+-+-+++...+|+. ..+.+.|..--.-.+. +.++++..+ +| .+|.++++..+- T Consensus 337 ~~~~~~l~~~~~li~~~~~~~~~~~~~~--~~v~i~f~~~~p~~~~-------~~ad~~~kl---~g-~iS~et~~~~l~ 403 (440) T protein:vir:95 337 TYFTKALRRRYELISNIHKAINGPVIEA--NKLTFTFHPNIPQDVW-------TEIKAYIEA---GG-EISQETLMENAS 403 (440) T ss_pred HHHHHHHHHHHHHHHHHHhhcCCccccc--ccceEEeCCCCCCCHH-------HHHHHHHHH---hc-cCcHHHHHHhCC Confidence 8888888888876544545555555654 4577777654433332 344445554 34 489999998776 Q ss_pred ccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcc Q lcl|NC_015287. 490 KQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPN 541 (563) Q Consensus 490 ~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~ 541 (563) ..++ .++.++|++|....-.. ..+. .+ ...++ ..++| T Consensus 404 ~~d~---~~E~~ri~~E~~~~~~~---~~~~--~~----~~~~~---~~~~e 440 (440) T protein:vir:95 404 FTDY---KTEHSRILKQGGSSDLE---IGQI--VG----DADVG---QADTE 440 (440) T ss_pred CCCc---HHHHHHHHHHHHHhhhh---HHhh--cc----CCCCC---CcCCC Confidence 4432 12334444443321110 0000 00 00011 11111 No 73 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=98.09 E-value=4.9e-06 Score=49.70 Aligned_cols=440 Identities=16% Similarity=0.217 Sum_probs=196.6 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCccc------ceeccccccc-cccccchhhhHHHHHHHHHHHhhccchhHHH Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSD------YYVSSGFYGQ-YVDIDGVFKSEFELIKRYREMALHPEVDSAI 73 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~------~~~~~~~~~~-~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai 73 (563) .+++|+.+.+......+ |+ + +++-.. -.+.|-+... +.+..+... .+.+.+| +--..++ T Consensus 15 ~~~~~~~~~~~~~~~~~--i~-~---~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~-----~~~~~sl---nl~~~i~ 80 (500) T protein:vir:30 15 KYVMTTQSLTNITDHPK--IA-I---SKLEYDRITTNLKYYKSDWDSVLYLNTDGETK-----KRDLNHL---PIARTAA 80 (500) T ss_pred HHHhhcchhhhhhcccc--cc-C---CHHHHHHHHHHHHHhcCCCCCcccccCCCCcc-----cCceeec---chHHHHH Confidence 11122221111111000 00 0 000000 0022221111 111111100 0111111 1112233 Q ss_pred HHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCccee Q lcl|NC_015287. 74 EDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIK 153 (563) Q Consensus 74 ~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~ 153 (563) ++.++ +|+. .|+.+.+++ +...+.++.+++-.+|.....+.+....+-|..+|+..+|..+ . T Consensus 81 ~~~A~--lv~~---e~~~i~~~d--------~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~-----~ 142 (500) T protein:vir:30 81 KKIAS--LVFN---EQAEIKVDD--------DAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDGDK-----V 142 (500) T ss_pred HHHhh--hhcC---CcceEecCC--------hHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCc-----e Confidence 33333 2221 344555543 3677788889999999999999999999999999999998532 4 Q ss_pred EEEEeCCccceeeeeeeccCCccccce--eeecccccccccccccceeeecCcc--cccccccc-----ccCc-CccccC Q lcl|NC_015287. 154 EVRYIDPLKIKLVRKLKTDNSLRGAIN--QVNSKNPADIENPEIEEYYQYDPSQ--TQGKNALG-----AIGQ-TPFSTK 223 (563) Q Consensus 154 eLr~lDP~~i~~vR~~~~~~~~~~~i~--~~~~~~~~~~~~~~~~eyf~y~~~~--~~~~~~~g-----~~g~-~~~~~~ 223 (563) .+.+++|.++-+++.--.... ...+- +....+..+..|-- -||+...+.. +..+...- ..|. .+. T Consensus 143 ~I~~v~ad~~~P~~~d~~~~~-~~a~~~~~~~~~~~~~~~yt~-lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l--- 217 (500) T protein:vir:30 143 RVAFVQAPVFLPLQSNTQDVS-SAAVVIKSVKTINGKEVYYTL-IEFHEWQSSDDYVISNELYRSDDKAKVGSRVPL--- 217 (500) T ss_pred EEEEEcCCeeEEEEEcCCCeE-EEEEEEEEeeeecCCceEEEE-EEEEEEeCCceeEEEEEEEecccccccCccccc--- Confidence 588999999888744111100 00000 01110001111111 1222211111 11111100 0011 000 Q ss_pred ccee-eecccceeeeecc--e-----------ecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCC Q lcl|NC_015287. 224 QRPV-KIAPDAISFCHSG--L-----------VDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVG 289 (563) Q Consensus 224 ~~~i-ki~~daI~y~hSG--l-----------~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvG 289 (563) ..+ .--.+.+++.+.. | .+.....++|-++.|.-.+..|-..-+.+. |..|.=++|||. +.. T Consensus 218 -~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~--~e~~~g~~~i~v-~~~ 293 (500) T protein:vir:30 218 -SEVYKDLKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFM--WEVKMGQRRVAV-PES 293 (500) T ss_pred -ccccCCcCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHH--HHHHhCcceeee-chH Confidence 001 0001222222111 0 122334467999999998888887777665 888887777765 111 Q ss_pred CCchHHHHHHHHHHHHhhcceeEeeCCCCccc-------ccccccchhhhhcccccCCCCccceeecCCCCCcch-HHHH Q lcl|NC_015287. 290 NLPKMKAEQYLKEVMNRYRNKLVYDASTGEIR-------DDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDI 361 (563) Q Consensus 290 nlpk~KAeqYl~~im~k~knklvYd~~TGev~-------d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV 361 (563) =+. ...+..+|+.. +++.+..|-.+ -++ +.-|+.+...---.+ ..-+ T Consensus 294 ~l~------------------~~~~~~~g~~~~~~~~d~~~~~~~~~~~~-----~~~--~~~i~~~~~~ir~e~~~~~l 348 (500) T protein:vir:30 294 LTA------------------LTVRTTDGDVVPRPRFESDQNVYIRMGGR-----DLD--SSAIQDLTTPIRADDYIKAI 348 (500) T ss_pred Hhc------------------ccCCCCCccccCCcccCCCcceEEEcCCC-----CCc--CcceeEeccccChHHHHHHH Confidence 100 01111222110 11122211100 011 122555443221111 2345 Q ss_pred HHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-------cCCCChhH Q lcl|NC_015287. 362 EYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLIL-------KNIVTPED 434 (563) Q Consensus 362 ~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiL-------kgI~t~ee 434 (563) .++.+.+=.+.+++-+.|..+++| .--|+||.-.+-.-..-+.+.|+.|...+.++++.=|-+ .|.... T Consensus 349 ~~~l~~i~~~~gls~~~~~~~~~g--~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~-- 424 (500) T protein:vir:30 349 NEGLSLFEMQIGVSAGLFSFDGKS--MKTATEIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPS-- 424 (500) T ss_pred HHHHHHHHHHhCCCccccccCcCc--cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCC-- Confidence 567777777788888888654322 223566654444455567777777777777776665432 232222 Q ss_pred HHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCC Q lcl|NC_015287. 435 WEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLAD 514 (563) Q Consensus 435 w~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~ 514 (563) ...+.++|. |+.+.. +++++ +..+.+++ - | .+|..+.+.+....+|+|.+++.++|++|... .. T Consensus 425 ----~~~v~v~f~-d~i~~d-~~~~~-~~~~~~v~----a-G-i~s~~~~i~~~~g~~eeea~~~l~~i~~E~~~-~~-- 488 (500) T protein:vir:30 425 ----MDNISISLD-DGVFTD-RDAEL-DYWIKVVN----A-G-FGTREMAIQKVLNVTEEKAQEIAAEINTGIVD-EI-- 488 (500) T ss_pred ----CcceEEEeC-CCCCCC-HHHHH-HHHHHHHH----c-C-CCCHHHHHHhcCCCCHHHHHHHHHHHHHhccc-cC-- Confidence 134788885 444433 22222 11112211 1 3 47888877777789999999999999887421 11 Q ss_pred CcccCcccccccccc Q lcl|NC_015287. 515 PMEVQQLQLGVHPEQ 529 (563) Q Consensus 515 p~~~~~~~~~~~~~~ 529 (563) ...++.++..|+ T Consensus 489 ---~~~~~~~~~~g~ 500 (500) T protein:vir:30 489 ---NQQRTDTHLYGE 500 (500) T ss_pred ---CCCCccccccCC Confidence 111111111111 No 74 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=98.09 E-value=4.9e-06 Score=49.70 Aligned_cols=440 Identities=16% Similarity=0.217 Sum_probs=196.6 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCccc------ceeccccccc-cccccchhhhHHHHHHHHHHHhhccchhHHH Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSD------YYVSSGFYGQ-YVDIDGVFKSEFELIKRYREMALHPEVDSAI 73 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~------~~~~~~~~~~-~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai 73 (563) .+++|+.+.+......+ |+ + +++-.. -.+.|-+... +.+..+... .+.+.+| +--..++ T Consensus 15 ~~~~~~~~~~~~~~~~~--i~-~---~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~-----~~~~~sl---nl~~~i~ 80 (500) T protein:vir:98 15 KYVMTTQSLTNITDHPK--IA-I---SKLEYDRITTNLKYYKSDWDSVLYLNTDGETK-----KRDLNHL---PIARTAA 80 (500) T ss_pred HHHhhcchhhhhhcccc--cc-C---CHHHHHHHHHHHHHhcCCCCCcccccCCCCcc-----cCceeec---chHHHHH Confidence 11122221111111000 00 0 000000 0022221111 111111100 0111111 1112233 Q ss_pred HHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCccee Q lcl|NC_015287. 74 EDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIK 153 (563) Q Consensus 74 ~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~ 153 (563) ++.++ +|+. .|+.+.+++ +...+.++.+++-.+|.....+.+....+-|..+|+..+|..+ . T Consensus 81 ~~~A~--lv~~---e~~~i~~~d--------~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~-----~ 142 (500) T protein:vir:98 81 KKIAS--LVFN---EQAEIKVDD--------DAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDGDK-----V 142 (500) T ss_pred HHHhh--hhcC---CcceEecCC--------hHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCc-----e Confidence 33333 2221 344555543 3677788889999999999999999999999999999998532 4 Q ss_pred EEEEeCCccceeeeeeeccCCccccce--eeecccccccccccccceeeecCcc--cccccccc-----ccCc-CccccC Q lcl|NC_015287. 154 EVRYIDPLKIKLVRKLKTDNSLRGAIN--QVNSKNPADIENPEIEEYYQYDPSQ--TQGKNALG-----AIGQ-TPFSTK 223 (563) Q Consensus 154 eLr~lDP~~i~~vR~~~~~~~~~~~i~--~~~~~~~~~~~~~~~~eyf~y~~~~--~~~~~~~g-----~~g~-~~~~~~ 223 (563) .+.+++|.++-+++.--.... ...+- +....+..+..|-- -||+...+.. +..+...- ..|. .+. T Consensus 143 ~I~~v~ad~~~P~~~d~~~~~-~~a~~~~~~~~~~~~~~~yt~-lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l--- 217 (500) T protein:vir:98 143 RVAFVQAPVFLPLQSNTQDVS-SAAVVIKSVKTINGKEVYYTL-IEFHEWQSSDDYVISNELYRSDDKAKVGSRVPL--- 217 (500) T ss_pred EEEEEcCCeeEEEEEcCCCeE-EEEEEEEEeeeecCCceEEEE-EEEEEEeCCceeEEEEEEEecccccccCccccc--- Confidence 588999999888744111100 00000 01110001111111 1222211111 11111100 0011 000 Q ss_pred ccee-eecccceeeeecc--e-----------ecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCC Q lcl|NC_015287. 224 QRPV-KIAPDAISFCHSG--L-----------VDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVG 289 (563) Q Consensus 224 ~~~i-ki~~daI~y~hSG--l-----------~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvG 289 (563) ..+ .--.+.+++.+.. | .+.....++|-++.|.-.+..|-..-+.+. |..|.=++|||. +.. T Consensus 218 -~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~--~e~~~g~~~i~v-~~~ 293 (500) T protein:vir:98 218 -SEVYKDLKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFM--WEVKMGQRRVAV-PES 293 (500) T ss_pred -ccccCCcCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHH--HHHHhCcceeee-chH Confidence 001 0001222222111 0 122334467999999998888887777665 888887777765 111 Q ss_pred CCchHHHHHHHHHHHHhhcceeEeeCCCCccc-------ccccccchhhhhcccccCCCCccceeecCCCCCcch-HHHH Q lcl|NC_015287. 290 NLPKMKAEQYLKEVMNRYRNKLVYDASTGEIR-------DDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDI 361 (563) Q Consensus 290 nlpk~KAeqYl~~im~k~knklvYd~~TGev~-------d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV 361 (563) =+. ...+..+|+.. +++.+..|-.+ -++ +.-|+.+...---.+ ..-+ T Consensus 294 ~l~------------------~~~~~~~g~~~~~~~~d~~~~~~~~~~~~-----~~~--~~~i~~~~~~ir~e~~~~~l 348 (500) T protein:vir:98 294 LTA------------------LTVRTTDGDVVPRPRFESDQNVYIRMGGR-----DLD--SSAIQDLTTPIRADDYIKAI 348 (500) T ss_pred Hhc------------------ccCCCCCccccCCcccCCCcceEEEcCCC-----CCc--CcceeEeccccChHHHHHHH Confidence 100 01111222110 11122211100 011 122555443221111 2345 Q ss_pred HHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-------cCCCChhH Q lcl|NC_015287. 362 EYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLIL-------KNIVTPED 434 (563) Q Consensus 362 ~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiL-------kgI~t~ee 434 (563) .++.+.+=.+.+++-+.|..+++| .--|+||.-.+-.-..-+.+.|+.|...+.++++.=|-+ .|.... T Consensus 349 ~~~l~~i~~~~gls~~~~~~~~~g--~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~-- 424 (500) T protein:vir:98 349 NEGLSLFEMQIGVSAGLFSFDGKS--MKTATEIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPS-- 424 (500) T ss_pred HHHHHHHHHHhCCCccccccCcCc--cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCC-- Confidence 567777777788888888654322 223566654444455567777777777777776665432 232222 Q ss_pred HHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCC Q lcl|NC_015287. 435 WEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLAD 514 (563) Q Consensus 435 w~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~ 514 (563) ...+.++|. |+.+.. +++++ +..+.+++ - | .+|..+.+.+....+|+|.+++.++|++|... .. T Consensus 425 ----~~~v~v~f~-d~i~~d-~~~~~-~~~~~~v~----a-G-i~s~~~~i~~~~g~~eeea~~~l~~i~~E~~~-~~-- 488 (500) T protein:vir:98 425 ----MDNISISLD-DGVFTD-RDAEL-DYWIKVVN----A-G-FGTREMAIQKVLNVTEEKAQEIAAEINTGIVD-EI-- 488 (500) T ss_pred ----CcceEEEeC-CCCCCC-HHHHH-HHHHHHHH----c-C-CCCHHHHHHhcCCCCHHHHHHHHHHHHHhccc-cC-- Confidence 134788885 444433 22222 11112211 1 3 47888877777789999999999999887421 11 Q ss_pred CcccCcccccccccc Q lcl|NC_015287. 515 PMEVQQLQLGVHPEQ 529 (563) Q Consensus 515 p~~~~~~~~~~~~~~ 529 (563) ...++.++..|+ T Consensus 489 ---~~~~~~~~~~g~ 500 (500) T protein:vir:98 489 ---NQQRTDTHLYGE 500 (500) T ss_pred ---CCCCccccccCC Confidence 111111111111 No 75 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=98.08 E-value=5.2e-06 Score=49.57 Aligned_cols=425 Identities=10% Similarity=0.071 Sum_probs=170.8 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchh-hhHHHHHHHHHHHhhccchh--------- Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVF-KSEFELIKRYREMALHPEVD--------- 70 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~-~~~~~LI~~YR~ma~~PEvd--------- 70 (563) |++.|=- ..+. ..+..+-.......... .....+ ..-..-+++|+.+..+.+-. T Consensus 1 ~~~~~~~------------~~~~-~~~~~~~~~~~~~~~~~---~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~ 64 (468) T protein:vir:96 1 MIDIFWP------------NEKP-YHERVVEQIKPQYETQE---EMILRLITKHKENVEDITVGERYYNHQPDVLFNAPK 64 (468) T ss_pred CccccCC------------cCce-eehheeecccccccCcH---HHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccc Confidence 3322110 0000 00000000000000000 000000 00111223333332222211 Q ss_pred ------------------HHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHh Q lcl|NC_015287. 71 ------------------SAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWY 132 (563) Q Consensus 71 ------------------~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WY 132 (563) +-...||+-. +.=-...||.+..++. ...+.+..+++ -+|+....++.+.++ T Consensus 65 ~~~~~~~~~~~~~~ki~~n~~~~Iv~~~-~~~l~g~p~~~~~~d~--------~~~~~l~~~~~-n~~~~~~~~~~~~~~ 134 (468) T protein:vir:96 65 RNVKGEIDPFKPDWRMYTNYHQNLVDQK-VAYAVANPVTYGTEDE--------KSLKTIQEVLN-HKWDDKLVDILTAAS 134 (468) T ss_pred ccccccccccccccccccchHHHHHHHH-HhhhccCCceeccCCh--------HHHHHHHHHHh-cCHHHHHHHHHHHHh Confidence 1111122211 1111245666665542 23344455553 478888999999999 Q ss_pred hhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceee---ecccc--cccccccccceeeecCcccc Q lcl|NC_015287. 133 VDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQV---NSKNP--ADIENPEIEEYYQYDPSQTQ 207 (563) Q Consensus 133 vDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~---~~~~~--~~~~~~~~~eyf~y~~~~~~ 207 (563) +.|+-|.+.-+|. +|-..+..+||+.+..|.. +...+..+-.+ ..... .....++...+|.+...... T Consensus 135 ~~G~~~~~v~~d~----~~~~~i~~~~p~~~~~v~~---~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 207 (468) T protein:vir:96 135 NKGVEWIQPYVDE----QGEFKTFRVPAEQAIPIWT---NKERDELKAFIRLYELDGGERVEYWTANDVTFYELKDGQLI 207 (468) T ss_pred hcCeEEEEEEEcC----CCceEEEEEcccceEEEEc---CCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCcee Confidence 9999999876663 4678899999999877732 11111211111 11000 00111111122222211100 Q ss_pred ccccccccCcCccccCcceeeeccc-------ceeeeecceecCCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCc Q lcl|NC_015287. 208 GKNALGAIGQTPFSTKQRPVKIAPD-------AISFCHSGLVDRNKQTILSYLHKSIKALNQLRM-IEDSLVIYRLSRAP 279 (563) Q Consensus 208 ~~~~~g~~g~~~~~~~~~~iki~~d-------aI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m-~EDalVIyRi~RAP 279 (563) ... ....... ......... +|++. .|.....|-++..+.....|.+ +-+..-.-+.++.| T Consensus 208 ~~~---~~~~~~~---~~~~~~~~~~~~~~~iPvv~~------~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p 275 (468) T protein:vir:96 208 PDY---YQGEEHV---QAHYYVGNKSMSWNRVPFIPF------KNNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATEL 275 (468) T ss_pred ecc---ccccccc---ccceeeccccccCCcccEEEe------cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCc Confidence 000 0000000 000011111 22221 1233345667766666555543 33334445666666 Q ss_pred cceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchH- Q lcl|NC_015287. 280 ERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGEL- 358 (563) Q Consensus 280 eRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei- 358 (563) -+-+.-.+... .++.+.. |..+ +. ++++- +++.+ +..|....+.... T Consensus 276 ~lv~~g~~~~~-----~~~~~~~-~~~~--~~---------------------i~~~~-d~~~~--~~~l~~~~~~~~~~ 323 (468) T protein:vir:96 276 IYVLKGYEGED-----LEEFMYN-LKYY--KA---------------------INVDG-DGSGG--VDTIQIDVPVQSAK 323 (468) T ss_pred eeeeecCCccc-----cchhhhh-hhcC--ce---------------------EEecC-CCCCc--ceEEeecCChHHHH Confidence 54433221111 1111111 1111 11 12222 22222 4444333333332 Q ss_pred HHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHH Q lcl|NC_015287. 359 SDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILR-DEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEV 437 (563) Q Consensus 359 ~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItR-DElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~ 437 (563) .-++-+.+.+|+..++|- +..+ +|....|++-.+ .......-+.+.+..|...+.++++.=+-+.|+. -+| T Consensus 324 ~~~~~l~~~I~~~s~~p~--~~~~--~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~--~d~-- 395 (468) T protein:vir:96 324 EYLDMLRDYVIEFGQGVD--FQQD--KFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKLS--IKV-- 395 (468) T ss_pred HHHHHHHHHHHHHhCccc--cccc--ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--ccc-- Confidence 346778888999999983 3222 222112222111 1111334467777777777777776655555542 233 Q ss_pred hhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcc Q lcl|NC_015287. 438 LGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPME 517 (563) Q Consensus 438 ~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~ 517 (563) ..|.+.|...---.+...++ ++.++ | .+|.+++++.+-..+| .+++.++|++|...-.-.+++. T Consensus 396 --~~i~i~f~~~~p~d~~e~a~-------~~~~~----g-~iS~et~i~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~ 459 (468) T protein:vir:96 396 --QDVEITFNFNVMVNELEQSQ-------IGVNS----Q-YLSKETVVTNHPWVDD--PVAEMERIDQEELALPSIEEGL 459 (468) T ss_pred --ceeeEEecCCCCcCHHHHHH-------HHHhc----C-CCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHhhcc Confidence 34677776544333332222 23332 3 6899999987644443 4455555655543211111111 Q ss_pred cCccccccccccCCCCcCCCC Q lcl|NC_015287. 518 VQQLQLGVHPEQMPGGAMNPQ 538 (563) Q Consensus 518 ~~~~~~~~~~~~~~~~~~~p~ 538 (563) .+..+. .|- T Consensus 460 ~~~~~~------------~~~ 468 (468) T protein:vir:96 460 NGKENN------------EPT 468 (468) T ss_pred CCCCCC------------CCC Confidence 111111 111 No 76 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=98.07 E-value=5.4e-06 Score=49.47 Aligned_cols=443 Identities=12% Similarity=0.032 Sum_probs=184.3 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhH--HHHHHHHHHHhhccchhHH------ Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSE--FELIKRYREMALHPEVDSA------ 72 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~--~~LI~~YR~ma~~PEvd~A------ 72 (563) |+-|-+=.+.... .. .++-|.+..-.. ..-.++-+. .....+|+.+..+.+-+.. T Consensus 1 ~~~~~~~~~~~~~---~~--~~~~~~~~~~~~------------~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~~~~~ 63 (470) T protein:vir:99 1 MKDINYGRDKVTG---NS--SFIFPKGEKLTS------------NELLGFIAYNETVLKPRYRENMKLYLGKHKILTAPE 63 (470) T ss_pred CccccCCcccccC---Cc--eEEeCCCCCcCH------------HHHHHHHHHHHHhhHHHHHHHHHHhccccccccCcc Confidence 3322221111100 00 111110100000 000001011 0111234444443332211 Q ss_pred -------------HHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhhe Q lcl|NC_015287. 73 -------------IEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYY 139 (563) Q Consensus 73 -------------i~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~f 139 (563) ...||+. .+.=--..||.+...+ .+.-.+.+..+..-.+|+....++.+..++.|+-|. T Consensus 64 ~~~~~~~ki~~n~~~~Ivd~-~~~~l~g~p~~~~~~~-------d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~ 135 (470) T protein:vir:99 64 KETGADNRIVVNSAKYVVDV-YNGYFCGIEPKLALLN-------DSSKIDEIARWNRQENFFDTINEISKQCDIFGRSIA 135 (470) T ss_pred cccCCcceeecchHHHHHHH-HhhhhccCCeeEeeCC-------chhHHHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEE Confidence 1122221 1111133455555432 112345667777778999999999999999999988 Q ss_pred eeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCc Q lcl|NC_015287. 140 HKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTP 219 (563) Q Consensus 140 hkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~ 219 (563) +.-+| ++|-..+..+||+.+..+..-.........+...+..... .....+.+|.+.........+...... T Consensus 136 ~v~~d----~dg~~~i~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~ 207 (470) T protein:vir:99 136 SIYQG----EDARPHLMYSSPNHAFIIYDDTVQRQPLAFVHYQIDNSNN----WTDAYGVIQYADKFYKFKGYDIEEDTN 207 (470) T ss_pred EEEeC----CCCeEEEEEEccceeEEEEcCCCCcceEEEEEEEEEecCC----eeEEEEEEEecCeEEEEEecccccccc Confidence 87665 3577889999999988773221111111111111110000 001112233222211111100000000 Q ss_pred cccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHH Q lcl|NC_015287. 220 FSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQ 298 (563) Q Consensus 220 ~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~-m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeq 298 (563) ..... .--+..=+|++. .|.....|=++..+....-+. ++=+....-+..+.|.+-+.-. .++..+.-+ T Consensus 208 ~~~~~-~~~~g~vPvv~~------~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~---~~~~~~~g~ 277 (470) T protein:vir:99 208 AAGYA-INPYGLVPAVEF------FENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGF---KLPEDDEGN 277 (470) T ss_pred ccccc-ccCCCccceEee------cCCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecC---Ccccccccc Confidence 00000 000111122221 122223455555444444443 4555566667777776655432 222211101 Q ss_pred HHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHH-HHHHHHHHHHHhcCCChh Q lcl|NC_015287. 299 YLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELS-DIEYFQKKLYRSLGVPES 377 (563) Q Consensus 299 Yl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~-DV~YF~~kLy~aL~VP~s 377 (563) -+..+ -.+++++ +|=.+++.|..+.+|....+..... -+.-+.+.+|...++|-. T Consensus 278 ~~~~~---~~~~~~~---------------------~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~ 333 (470) T protein:vir:99 278 PKFDF---KNNRVLY---------------------VSQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNI 333 (470) T ss_pred hhhhh---hhcceee---------------------ecCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCccc Confidence 11110 0122221 2222334455677787666666554 377888999999999942 Q ss_pred hccCCCcceecccc--chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHH Q lcl|NC_015287. 378 RIAGSGDGFNLGRS--SEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAEL 455 (563) Q Consensus 378 Rl~~~~~g~~~g~~--~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~El 455 (563) -+ + +|. |.. ..|...+.....-+.+.+..|...+.++++.-+-+-+.....+++. ..|.+.|...-.-.+. T Consensus 334 ~~--~--~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~--~~i~v~f~~~~p~~~~ 406 (470) T protein:vir:99 334 QD--K--NFA-GNSSGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNNKQDQELW--SELDFKFTRNLPEDMA 406 (470) T ss_pred cc--c--ccc-cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccc--ccceEEeCCCCCcCHH Confidence 22 2 221 222 2333333334455677777777777777765443333333232322 3588888544333332 Q ss_pred HHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccC-hhhHHHHHHHHHHHhcC-CCCCCCcccCccccccccccCCCC Q lcl|NC_015287. 456 KETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQK-DEEVIEIDKQMEKEIKD-GKLADPMEVQQLQLGVHPEQMPGG 533 (563) Q Consensus 456 Ke~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~t-deei~e~~~qie~E~~~-~~~~~p~~~~~~~~~~~~~~~~~~ 533 (563) +.++++..+. | .+|.++++..+-... ++|+++++++-++.... ..+-.|.... +..+.+ T Consensus 407 -------e~a~~~~kl~---g-iis~et~l~~l~~vd~~~E~eri~~E~~~~~~~~~~~~~~~d~~--------~~d~~~ 467 (470) T protein:vir:99 407 -------SAIDNAKNAE---G-IVSKKTQLGMIPDIEPDAEMKQIAKEKADAIKQTQQLSMPIDIL--------KRDNNA 467 (470) T ss_pred -------HHHHHHHHHh---c-cCCHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCCcC--------CCCCCc Confidence 3444455543 5 389999998765432 24444443332221110 0000000000 000111 Q ss_pred cCC Q lcl|NC_015287. 534 AMN 536 (563) Q Consensus 534 ~~~ 536 (563) +++ T Consensus 468 ee~ 470 (470) T protein:vir:99 468 EEE 470 (470) T ss_pred cCC Confidence 111 No 77 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=98.07 E-value=5.5e-06 Score=49.42 Aligned_cols=422 Identities=11% Similarity=0.058 Sum_probs=188.4 Q ss_pred cccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchh Q lcl|NC_015287. 43 YVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDD 122 (563) Q Consensus 43 ~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~ 122 (563) |+- ++..+-.+.-++++-..=+.-+|+..++=+ +.+- +...|.+. .+....|.+-.+|+. T Consensus 1 ~l~-----~~~~~~~~~~~~~~v~n~~~~ivd~~~~~l-~~~g------f~~~d~~~--------~~~~~~i~~~N~~d~ 60 (434) T protein:vir:98 1 MLP-----KNAEQAFLDFQRKARTNFCGLIANASVHRL-LALG------VTGPDGEP--------DTRASRWWQANRLDS 60 (434) T ss_pred CCC-----CCccHHHHHhhhhhhccchHHHHHHHHhhh-ccCc------eecCCCch--------HHHHHHHHHhcChhH Confidence 111 111111222222233345566677666521 1111 12233222 233455677788999 Q ss_pred hhHHHHHHHhhhhhhheeeeecCCC---CCcceeEEEEeCCccceeeeeeeccCCccccceeeecc---cc-cc-ccccc Q lcl|NC_015287. 123 KCHEILRNWYVDGRIYYHKVIDIKA---PEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSK---NP-AD-IENPE 194 (563) Q Consensus 123 ~~~~~~R~WYvDGri~fhkviD~~~---~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~---~~-~~-~~~~~ 194 (563) ...++++.-++.||-|+..-.|+.. ..++-..++.+||+.+..++--... .....+.+.... .. .. +.+.. T Consensus 61 ~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~-~~~~ai~~~~~~~~~~~~~~~~~~~~ 139 (434) T protein:vir:98 61 RQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETG-EPLVGLKVWHNDIDGFGYARVFFDDT 139 (434) T ss_pred HHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCC-ceEEEEEEEEeccCCceEEEEEEeCc Confidence 9999999999999999976555321 1122344778999887666421111 111112111100 00 00 00111 Q ss_pred ccceeeecCccccccccccccCcCccccCcceeeeccc-ceeeeecceecCCCCccchhHHHHHHHHHHH-HHHHHHHHH Q lcl|NC_015287. 195 IEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPD-AISFCHSGLVDRNKQTILSYLHKSIKALNQL-RMIEDSLVI 272 (563) Q Consensus 195 ~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~d-aI~y~hSGl~d~~~~~~~syL~~Aik~~NqL-~m~EDalVI 272 (563) ..-|+......................+....-.+..= .|.|++.--.+. -+.|=++..+....-+ +++-+.+++ T Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~---~g~sd~e~vi~liDa~~~~~s~~~~~ 216 (434) T protein:vir:98 140 SFPYRTRERTGARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGE---DPEPEFAGVLDIQDRVNLGILNRMAA 216 (434) T ss_pred EEEEEEeeccccccccccccceecccccccccCCCCccceEEeccCCCcCc---CCcchhhhHHHHHHHHHHHHHHHHHH Confidence 11111111100000000000000000000000001111 222454422211 1234444433333322 345567777 Q ss_pred HHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCC Q lcl|NC_015287. 273 YRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG 352 (563) Q Consensus 273 yRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg 352 (563) -+.+-.|.|-+. |--+.. .-|...+.+.....++.-....|+.- +-++++..+++. T Consensus 217 ~~~~a~p~~~i~----G~~~~~-----------------~~~~~~~~~~~~~~~~~~~~~i~~~~---~~~~~~~q~~~~ 272 (434) T protein:vir:98 217 SRFSGFRQKWIK----GHKFAK-----------------RTDPATGMTVVDQPFVPSPSAVWASE---GENTQFGQLDAT 272 (434) T ss_pred HHHhcchhhhhc----CCCccc-----------------ccccccccchhhhhhhccccccccCC---CCCceEEEecCc Confidence 777777766553 211110 00122222222222222222345542 224567777764 Q ss_pred CCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccc--cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_015287. 353 QNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGR--SSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIV 430 (563) Q Consensus 353 ~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~--~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~ 430 (563) .-=+=++-++-.-..+....++|.+-|..+ .+. +..|.-.+...-.-+.+.|+.|..-+.++++.-+.+.|+ T Consensus 273 ~~~~~~~~l~~~i~~~~~~~~~p~~~~~~~-----~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~g~- 346 (434) T protein:vir:98 273 DLSGFLKEHASDVRDMLTISQTPTYLYATD-----LVNISADTIGALDILHVAKVREHIASFSEGLESVLALAAAQAGV- 346 (434) T ss_pred chHHHHHHHHHHHHHHhcccCCCHHHhccc-----cCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC- Confidence 322223345666777888889998888422 222 223444444567778888999999999999887777775 Q ss_pred ChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCC Q lcl|NC_015287. 431 TPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDG 510 (563) Q Consensus 431 t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~ 510 (563) +. +| ..+++.|..-..=+ +.+.++++..+..- | +|.++++ +.|.++++|++++.++.+++.... T Consensus 347 ~~-~~----~~~~v~w~~~~~~s-------~~~~ada~~kl~~~-g--~~~e~~~-~~lg~~~~e~~r~~~e~~~~~~~~ 410 (434) T protein:vir:98 347 PE-DY----TEAEVRWANPAHVT-------MAVKADAATKLKSI-G--YPLDVIA-EELDESPARVRRIVAGAASQALLA 410 (434) T ss_pred Ch-hh----eeeeEEecCCCCCC-------HHHHHHHHHHHHhc-C--CcHHHHH-HhCCCCHHHHHHHHHHHHHHHHHH Confidence 22 23 24778886533222 23455666665442 2 5777766 557899999887776655543221 Q ss_pred CCCCCcccCccccccccccCCCCcCCCCCcccccccCCC Q lcl|NC_015287. 511 KLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEG 549 (563) Q Consensus 511 ~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~ 549 (563) ....+ ...+ +.+| ++|+. .+++++ T Consensus 411 ~~~~~-~~~~---------~~~g----~~~~~-~~~~dg 434 (434) T protein:vir:98 411 ASLLP-APGA---------PSAG----NVPDS-GGAVDG 434 (434) T ss_pred Hhhhc-cCCC---------CCCC----CCCcc-cCCCCC Confidence 11111 0000 0111 11111 111111 No 78 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=98.06 E-value=5.6e-06 Score=49.36 Aligned_cols=417 Identities=11% Similarity=0.131 Sum_probs=182.1 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) |..||+|. ++...+ ++.. +.++.....-.|.....+.+ | -+..+++|.|.+||+-|.+.+ T Consensus 7 ~~~~~~~~-~r~~~~------~~~~-~~~~~~~~~~~g~~~~~~~v-----~-------~~~al~~~~v~~~i~~ia~~i 66 (432) T protein:vir:10 7 VKKFFNFE-KRQTSQ------VIEL-NKDDEKLLEWLGISPSTISV-----K-------GKNALKVATVFACIKILSESV 66 (432) T ss_pred HHHhcCcc-ccCccc------cccc-CCchHHHHHHhCCCcCcccc-----c-------hhhhhccHHHHHHHHHHHHhh Confidence 55566662 221111 1111 11111111001111111111 1 123467899999999999866 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcchhhhH----HHHHHHhhhhhhheeeeecCCCCCcceeEE Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFDDKCH----EILRNWYVDGRIYYHKVIDIKAPEEGIKEV 155 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~~~~~----~~~R~WYvDGri~fhkviD~~~~k~GI~eL 155 (563) .- -|+.+--++ + ...+.........+|+. =+-...++ .++..+.+.|.-|..++.|.. .-+++| T Consensus 67 a~-----lp~~~~~~~-~--~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~---G~~~~L 135 (432) T protein:vir:10 67 SK-----LPLKIYQED-E--YGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRK---GKVQAL 135 (432) T ss_pred cc-----CceEEEEec-C--CceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC---CcEEEE Confidence 43 244432221 1 11111222333334432 11223333 445555667988888877643 238999 Q ss_pred EEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeeccccee Q lcl|NC_015287. 156 RYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAIS 235 (563) Q Consensus 156 r~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~ 235 (563) .+|+|..|..++.-. ++.. ....-||.+... .....++.+.|. T Consensus 136 ~~i~~~~v~v~~d~~-------~~~~-----------~~~~~~y~~~~~-------------------g~~~~~~~~eii 178 (432) T protein:vir:10 136 WPIDASKVTVYIDDV-------GLLN-----------SKTKMWYVVNTG-------------------GQQRVLKPEEIL 178 (432) T ss_pred EEEcCceeEEEEcCc-------cccc-----------ccceEEEEEecC-------------------CeEEEEccccEE Confidence 999998887553210 1100 011224444321 112457777775 Q ss_pred eeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeC Q lcl|NC_015287. 236 FCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDA 315 (563) Q Consensus 236 y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~ 315 (563) |..-+ ...++-.+.|.|..|++++.....+++...=+----+.-+-|..++ +.|.+..+++..+.+...|.. T Consensus 179 h~r~~-~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g------ 250 (432) T protein:vir:10 179 HFKNG-ITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-GDLNEDAKKVFRENFESMSSG------ 250 (432) T ss_pred EecCC-CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhcc------ Confidence 55433 2345556789999999999999888887666544445556777776 467776666655554444432 Q ss_pred CCCcccccccccchhhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchh Q lcl|NC_015287. 316 STGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG-QNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEI 394 (563) Q Consensus 316 ~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eI 394 (563) +++..+.+ .++ .|.++..|.-. ..+.-++-.++..+.+.++++||...|...+++ +. +.+ T Consensus 251 ----~~n~~~~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~-~~---s~~ 311 (432) T protein:vir:10 251 ----LQNSHRIA-LMP----------VGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKA-TL---NNI 311 (432) T ss_pred ----cccCCcce-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC-Cc---ccH Confidence 11111211 221 24556665421 122223445677899999999999999532111 11 112 Q ss_pred hHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_015287. 395 LRDEIKFTKF-VGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDP 473 (563) Q Consensus 395 tRDElKF~Kf-I~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p 473 (563) ...-+.|.++ |.-+-.++.. .|- +.++++.+|. ..+.+.|..+. |... -+..|++++..+-. T Consensus 312 e~~~~~~~~~~l~P~~~~ie~----~ln-----~kLl~~~~~~---~g~~~~fd~~~----l~~~-d~~~~~~~~~~~~~ 374 (432) T protein:vir:10 312 EQQQQQFYTDTLQATLTMYEQ----EMT-----YKLFLDSELD---KGFYSKFNVDA----ILRA-DIKTRYEAYRTGIQ 374 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHH----HHH-----HhhcChhhcC---CCcEEEeechh----hhcC-CHHHHHHHHHHHHh Confidence 2222224332 2223222222 222 2334555554 23445554322 2111 12345555544332 Q ss_pred hhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCccc Q lcl|NC_015287. 474 YVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQ 553 (563) Q Consensus 474 ~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~ 553 (563) - -++|.+-++. ++.+.+ ++..+.--.+. +- .|.+- .......++++ T Consensus 375 ~--G~~t~NE~R~-~~g~~p------------------i~ggD~~~~~~-----n~------~~~~~--~~~~~~k~~~~ 420 (432) T protein:vir:10 375 G--GFLKPNEARS-KEDLPP------------------EAGGDRLLVNG-----NM------LPIDM--AGQAYLKGGDT 420 (432) T ss_pred C--CCcCHHHHHH-HhCCCC------------------CCCCCeEeecc-----cc------cchhh--ccccccCCCCC Confidence 2 2444444433 222222 21100000000 00 01100 00000000111 Q ss_pred CCCCC-CCCCC Q lcl|NC_015287. 554 APQMP-EGGEI 563 (563) Q Consensus 554 ~~~~~-~~~~~ 563 (563) ..+.. +|+|= T Consensus 421 ~~~~~~~~~~~ 431 (432) T protein:vir:10 421 NGEVSKEGNEG 431 (432) T ss_pred CCCCCCCCCCC Confidence 11100 11111 No 79 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=98.06 E-value=5.6e-06 Score=49.36 Aligned_cols=417 Identities=11% Similarity=0.131 Sum_probs=182.1 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) |..||+|. ++...+ ++.. +.++.....-.|.....+.+ | -+..+++|.|.+||+-|.+.+ T Consensus 7 ~~~~~~~~-~r~~~~------~~~~-~~~~~~~~~~~g~~~~~~~v-----~-------~~~al~~~~v~~~i~~ia~~i 66 (432) T protein:vir:10 7 VKKFFNFE-KRQTSQ------VIEL-NKDDEKLLEWLGISPSTISV-----K-------GKNALKVATVFACIKILSESV 66 (432) T ss_pred HHHhcCcc-ccCccc------cccc-CCchHHHHHHhCCCcCcccc-----c-------hhhhhccHHHHHHHHHHHHhh Confidence 55566662 221111 1111 11111111001111111111 1 123467899999999999866 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcchhhhH----HHHHHHhhhhhhheeeeecCCCCCcceeEE Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFDDKCH----EILRNWYVDGRIYYHKVIDIKAPEEGIKEV 155 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~~~~~----~~~R~WYvDGri~fhkviD~~~~k~GI~eL 155 (563) .- -|+.+--++ + ...+.........+|+. =+-...++ .++..+.+.|.-|..++.|.. .-+++| T Consensus 67 a~-----lp~~~~~~~-~--~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~---G~~~~L 135 (432) T protein:vir:10 67 SK-----LPLKIYQED-E--YGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRK---GKVQAL 135 (432) T ss_pred cc-----CceEEEEec-C--CceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC---CcEEEE Confidence 43 244432221 1 11111222333334432 11223333 445555667988888877643 238999 Q ss_pred EEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeeccccee Q lcl|NC_015287. 156 RYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAIS 235 (563) Q Consensus 156 r~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~ 235 (563) .+|+|..|..++.-. ++.. ....-||.+... .....++.+.|. T Consensus 136 ~~i~~~~v~v~~d~~-------~~~~-----------~~~~~~y~~~~~-------------------g~~~~~~~~eii 178 (432) T protein:vir:10 136 WPIDASKVTVYIDDV-------GLLN-----------SKTKMWYVVNTG-------------------GQQRVLKPEEIL 178 (432) T ss_pred EEEcCceeEEEEcCc-------cccc-----------ccceEEEEEecC-------------------CeEEEEccccEE Confidence 999998887553210 1100 011224444321 112457777775 Q ss_pred eeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeC Q lcl|NC_015287. 236 FCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDA 315 (563) Q Consensus 236 y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~ 315 (563) |..-+ ...++-.+.|.|..|++++.....+++...=+----+.-+-|..++ +.|.+..+++..+.+...|.. T Consensus 179 h~r~~-~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g------ 250 (432) T protein:vir:10 179 HFKNG-ITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-GDLNEDAKKVFRENFESMSSG------ 250 (432) T ss_pred EecCC-CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhcc------ Confidence 55433 2345556789999999999999888887666544445556777776 467776666655554444432 Q ss_pred CCCcccccccccchhhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchh Q lcl|NC_015287. 316 STGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG-QNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEI 394 (563) Q Consensus 316 ~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eI 394 (563) +++..+.+ .++ .|.++..|.-. ..+.-++-.++..+.+.++++||...|...+++ +. +.+ T Consensus 251 ----~~n~~~~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~-~~---s~~ 311 (432) T protein:vir:10 251 ----LQNSHRIA-LMP----------VGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKA-TL---NNI 311 (432) T ss_pred ----cccCCcce-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC-Cc---ccH Confidence 11111211 221 24556665421 122223445677899999999999999532111 11 112 Q ss_pred hHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_015287. 395 LRDEIKFTKF-VGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDP 473 (563) Q Consensus 395 tRDElKF~Kf-I~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p 473 (563) ...-+.|.++ |.-+-.++.. .|- +.++++.+|. ..+.+.|..+. |... -+..|++++..+-. T Consensus 312 e~~~~~~~~~~l~P~~~~ie~----~ln-----~kLl~~~~~~---~g~~~~fd~~~----l~~~-d~~~~~~~~~~~~~ 374 (432) T protein:vir:10 312 EQQQQQFYTDTLQATLTMYEQ----EMT-----YKLFLDSELD---KGFYSKFNVDA----ILRA-DIKTRYEAYRTGIQ 374 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHH----HHH-----HhhcChhhcC---CCcEEEeechh----hhcC-CHHHHHHHHHHHHh Confidence 2222224332 2223222222 222 2334555554 23445554322 2111 12345555544332 Q ss_pred hhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCccc Q lcl|NC_015287. 474 YVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQ 553 (563) Q Consensus 474 ~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~ 553 (563) - -++|.+-++. ++.+.+ ++..+.--.+. +- .|.+- .......++++ T Consensus 375 ~--G~~t~NE~R~-~~g~~p------------------i~ggD~~~~~~-----n~------~~~~~--~~~~~~k~~~~ 420 (432) T protein:vir:10 375 G--GFLKPNEARS-KEDLPP------------------EAGGDRLLVNG-----NM------LPIDM--AGQAYLKGGDT 420 (432) T ss_pred C--CCcCHHHHHH-HhCCCC------------------CCCCCeEeecc-----cc------cchhh--ccccccCCCCC Confidence 2 2444444433 222222 21100000000 00 01100 00000000111 Q ss_pred CCCCC-CCCCC Q lcl|NC_015287. 554 APQMP-EGGEI 563 (563) Q Consensus 554 ~~~~~-~~~~~ 563 (563) ..+.. +|+|= T Consensus 421 ~~~~~~~~~~~ 431 (432) T protein:vir:10 421 NGEVSKEGNEG 431 (432) T ss_pred CCCCCCCCCCC Confidence 11100 11111 No 80 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=98.06 E-value=5.6e-06 Score=49.36 Aligned_cols=417 Identities=11% Similarity=0.131 Sum_probs=182.1 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) |..||+|. ++...+ ++.. +.++.....-.|.....+.+ | -+..+++|.|.+||+-|.+.+ T Consensus 7 ~~~~~~~~-~r~~~~------~~~~-~~~~~~~~~~~g~~~~~~~v-----~-------~~~al~~~~v~~~i~~ia~~i 66 (432) T protein:vir:10 7 VKKFFNFE-KRQTSQ------VIEL-NKDDEKLLEWLGISPSTISV-----K-------GKNALKVATVFACIKILSESV 66 (432) T ss_pred HHHhcCcc-ccCccc------cccc-CCchHHHHHHhCCCcCcccc-----c-------hhhhhccHHHHHHHHHHHHhh Confidence 55566662 221111 1111 11111111001111111111 1 123467899999999999866 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcchhhhH----HHHHHHhhhhhhheeeeecCCCCCcceeEE Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFDDKCH----EILRNWYVDGRIYYHKVIDIKAPEEGIKEV 155 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~~~~~----~~~R~WYvDGri~fhkviD~~~~k~GI~eL 155 (563) .- -|+.+--++ + ...+.........+|+. =+-...++ .++..+.+.|.-|..++.|.. .-+++| T Consensus 67 a~-----lp~~~~~~~-~--~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~---G~~~~L 135 (432) T protein:vir:10 67 SK-----LPLKIYQED-E--YGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRK---GKVQAL 135 (432) T ss_pred cc-----CceEEEEec-C--CceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC---CcEEEE Confidence 43 244432221 1 11111222333334432 11223333 445555667988888877643 238999 Q ss_pred EEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeeccccee Q lcl|NC_015287. 156 RYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAIS 235 (563) Q Consensus 156 r~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~ 235 (563) .+|+|..|..++.-. ++.. ....-||.+... .....++.+.|. T Consensus 136 ~~i~~~~v~v~~d~~-------~~~~-----------~~~~~~y~~~~~-------------------g~~~~~~~~eii 178 (432) T protein:vir:10 136 WPIDASKVTVYIDDV-------GLLN-----------SKTKMWYVVNTG-------------------GQQRVLKPEEIL 178 (432) T ss_pred EEEcCceeEEEEcCc-------cccc-----------ccceEEEEEecC-------------------CeEEEEccccEE Confidence 999998887553210 1100 011224444321 112457777775 Q ss_pred eeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeC Q lcl|NC_015287. 236 FCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDA 315 (563) Q Consensus 236 y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~ 315 (563) |..-+ ...++-.+.|.|..|++++.....+++...=+----+.-+-|..++ +.|.+..+++..+.+...|.. T Consensus 179 h~r~~-~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g------ 250 (432) T protein:vir:10 179 HFKNG-ITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-GDLNEDAKKVFRENFESMSSG------ 250 (432) T ss_pred EecCC-CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhcc------ Confidence 55433 2345556789999999999999888887666544445556777776 467776666655554444432 Q ss_pred CCCcccccccccchhhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchh Q lcl|NC_015287. 316 STGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG-QNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEI 394 (563) Q Consensus 316 ~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eI 394 (563) +++..+.+ .++ .|.++..|.-. ..+.-++-.++..+.+.++++||...|...+++ +. +.+ T Consensus 251 ----~~n~~~~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~-~~---s~~ 311 (432) T protein:vir:10 251 ----LQNSHRIA-LMP----------VGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKA-TL---NNI 311 (432) T ss_pred ----cccCCcce-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC-Cc---ccH Confidence 11111211 221 24556665421 122223445677899999999999999532111 11 112 Q ss_pred hHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_015287. 395 LRDEIKFTKF-VGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDP 473 (563) Q Consensus 395 tRDElKF~Kf-I~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p 473 (563) ...-+.|.++ |.-+-.++.. .|- +.++++.+|. ..+.+.|..+. |... -+..|++++..+-. T Consensus 312 e~~~~~~~~~~l~P~~~~ie~----~ln-----~kLl~~~~~~---~g~~~~fd~~~----l~~~-d~~~~~~~~~~~~~ 374 (432) T protein:vir:10 312 EQQQQQFYTDTLQATLTMYEQ----EMT-----YKLFLDSELD---KGFYSKFNVDA----ILRA-DIKTRYEAYRTGIQ 374 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHH----HHH-----HhhcChhhcC---CCcEEEeechh----hhcC-CHHHHHHHHHHHHh Confidence 2222224332 2223222222 222 2334555554 23445554322 2111 12345555544332 Q ss_pred hhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCccc Q lcl|NC_015287. 474 YVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQ 553 (563) Q Consensus 474 ~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~ 553 (563) - -++|.+-++. ++.+.+ ++..+.--.+. +- .|.+- .......++++ T Consensus 375 ~--G~~t~NE~R~-~~g~~p------------------i~ggD~~~~~~-----n~------~~~~~--~~~~~~k~~~~ 420 (432) T protein:vir:10 375 G--GFLKPNEARS-KEDLPP------------------EAGGDRLLVNG-----NM------LPIDM--AGQAYLKGGDT 420 (432) T ss_pred C--CCcCHHHHHH-HhCCCC------------------CCCCCeEeecc-----cc------cchhh--ccccccCCCCC Confidence 2 2444444433 222222 21100000000 00 01100 00000000111 Q ss_pred CCCCC-CCCCC Q lcl|NC_015287. 554 APQMP-EGGEI 563 (563) Q Consensus 554 ~~~~~-~~~~~ 563 (563) ..+.. +|+|= T Consensus 421 ~~~~~~~~~~~ 431 (432) T protein:vir:10 421 NGEVSKEGNEG 431 (432) T ss_pred CCCCCCCCCCC Confidence 11100 11111 No 81 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=98.02 E-value=6.7e-06 Score=48.94 Aligned_cols=456 Identities=15% Similarity=0.113 Sum_probs=210.3 Q ss_pred ccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHH-----------HHHhhccchhHHH Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRY-----------REMALHPEVDSAI 73 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~Y-----------R~ma~~PEvd~Ai 73 (563) ||. |.+-++-.+.-..--|+..+......+.-.+ .+.......... +..+ |.+ ..|--..++ T Consensus 1 ~~~-~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~-~~~~w~~~~~~~~~~~~~-~~~l~~~i~ 73 (518) T protein:vir:78 1 MGV-WSVMTRFIKGWLNGKPNGSEPELIPKYLPLV----PDNQKEWSKDSY-LTSLWAQGYVPTVHDKLM-NSGTGNEIV 73 (518) T ss_pred Ccc-hhhHHHHHHHhhcCCCCccchhccHHHhhhc----ccchhhhhhhhh-hhhhcccCCCCccccccc-cCChHHHHH Confidence 555 3333333332222222222111111110000 000000000000 0000 111 223334455 Q ss_pred HHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCccee Q lcl|NC_015287. 74 EDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIK 153 (563) Q Consensus 74 ~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~ 153 (563) .++++ +|+. .+|+|.+.....+++ +..++..+.+++-.+|.....+.+..+.+-|..+|+..+|.. =. T Consensus 74 ~~~A~--ll~~---e~~~i~v~~~~~~d~--e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~-----~~ 141 (518) T protein:vir:78 74 VVAAE--YISG---KPLSIDVTGVNGSKD--ENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINILNG-----RP 141 (518) T ss_pred HHHHH--hhcC---CCceEEecCccccCc--HHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEECC-----ee Confidence 66665 4433 356666654443332 356778889999999999999999999999999999999842 34 Q ss_pred EEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceee----------------ecCccccccccc----- Q lcl|NC_015287. 154 EVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQ----------------YDPSQTQGKNAL----- 212 (563) Q Consensus 154 eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~----------------y~~~~~~~~~~~----- 212 (563) .+.+++|-++-++... . ..+.++.-... ...+...||. |.+ +...+... T Consensus 142 ~i~~v~ad~~~P~~~~---g---~~~~~~f~~~~---~~~~k~~~y~~lE~he~~~~~~~~~~~~~-~~I~n~ly~~~~~ 211 (518) T protein:vir:78 142 SISVHSSSQFWIDFKN---N---EPFRFNFFEEI---PTSNKADIYYLVESREIKQWDKEGKKLSG-GFVTYSVIKIDGD 211 (518) T ss_pred EEEEEcCCeeEEEeec---C---cEEEEEEEEEe---ecCCcceeEEEEEeeccccccceeecccc-eeEEEEEeeecCc Confidence 7889999998776321 1 11222111000 0000011111 110 00001000 Q ss_pred -cccCc-CccccCcceeeecc---------------cceeeeec---ceecCCCCccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 213 -GAIGQ-TPFSTKQRPVKIAP---------------DAISFCHS---GLVDRNKQTILSYLHKSIKALNQLRMIEDSLVI 272 (563) Q Consensus 213 -g~~g~-~~~~~~~~~iki~~---------------daI~y~hS---Gl~d~~~~~~~syL~~Aik~~NqL~m~EDalVI 272 (563) +..++ -+...... ...++ -.+.|..- -..+..+..++|-|+.|.-++..|-..=+ -+ T Consensus 212 ~~v~~~~~~~~~~l~-~~~~~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s--~~ 288 (518) T protein:vir:78 212 KTTPISAERLPEQIT-SYLHTNDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFT--VY 288 (518) T ss_pred ccccccccccccccc-cccccccCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHH--HH Confidence 00000 00000000 00000 01222111 11222334578999999988888876666 45 Q ss_pred HHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhccccc----CCCC-cccee Q lcl|NC_015287. 273 YRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRR----EGGR-GTEIT 347 (563) Q Consensus 273 yRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRR----eGgr-gTEIs 347 (563) .|..|.=++|||.-+ .=|++ ..+..++.-. ....--.++|.+.. +|+. .+.|+ T Consensus 289 ~~e~~~g~~~i~v~~-~~l~~------------------~~~~~~~~~~---~~fd~~~~~y~~i~~~~~~~~~~~~~i~ 346 (518) T protein:vir:78 289 MREGEKTKTKIAASE-RMFRK------------------KVNKSTDKEE---WSMNVDEDYFMQFKGTLDAGAKLNDMIQ 346 (518) T ss_pred HHHHHhCCceeeech-hHhcc------------------CCCCCCCccc---cccCCCCceEEEecCcCCCCCcccccee Confidence 688888777777621 11110 0011111000 00000112232221 1222 23466 Q ss_pred ecCCCCCcch-HHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015287. 348 TLPGGQNLGE-LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLIL 426 (563) Q Consensus 348 TLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiL 426 (563) ++...=.-.+ ..-+..+.+.+....+++-+-|..++ + . =-++||.+..-+-...+.+.|+.+...+.++++.-|-| T Consensus 347 ~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~-~-~-~TATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l 423 (518) T protein:vir:78 347 FMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGN-R-E-VKATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYL 423 (518) T ss_pred eeecccChHHHHHHHHHHHHHHHHhhCCChhhcCccc-c-c-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6544321111 23467778888888899887774321 1 1 12455555544555678888888888887777764332 Q ss_pred -cCCCChhHHHHhh--hceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHh-ccChhhHHHHHHH Q lcl|NC_015287. 427 -KNIVTPEDWEVLG--DHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKIL-KQKDEEVIEIDKQ 502 (563) Q Consensus 427 -kgI~t~eew~~~~--~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il-~~tdeei~e~~~q 502 (563) +..+-...|.... ..+.++|.. +.+.... ..++.++++.- +-.+|+++.++++. ..+|+|.+++-++ T Consensus 424 ~~~~~~~~~~~~~~~~~~v~i~f~D-~i~~D~~------~~~~~~~~~v~--aGimS~e~~i~~~~~~~~deea~~e~~r 494 (518) T protein:vir:78 424 LTGGTNNKEKAIMRDEIRVIIEFPD-PMSVNLN------ELSSTLNNMNS--ALAMSVEEKVKLIHPKWEDEEIQAEVKR 494 (518) T ss_pred HHhhcCccccccCCCceeEEEEeCC-CCCCCHH------HHHHHHHHHHh--cCCCCHHHHHHHhCCCCCHHHHHHHHHH Confidence 2221111111111 347777764 3333211 22222222221 13688888776653 6899999999999 Q ss_pred HHHHhcCCCCCCCcccCccccccccccCCCC Q lcl|NC_015287. 503 MEKEIKDGKLADPMEVQQLQLGVHPEQMPGG 533 (563) Q Consensus 503 ie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~ 533 (563) |++|......++|.+...-++ .+| T Consensus 495 i~~E~~~~~~~~p~~~~g~~~-------~~g 518 (518) T protein:vir:78 495 IYLENAIGEVPDPEAIGGMET-------KGG 518 (518) T ss_pred HHHHhcccCCCCCccccCCCC-------CCC Confidence 999976655554433322111 111 No 82 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=98.02 E-value=6.8e-06 Score=48.92 Aligned_cols=430 Identities=13% Similarity=0.069 Sum_probs=176.6 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) +..|||+.-+.......+. ...|. +++..........+..+ . -+..+++|.|..||+-|.+.+ T Consensus 4 ~~~l~~r~~~~~~~~~~~~--~~~~~-~~~~~~~~~~~~~g~~V-------------~-~~~al~~~~V~~~v~~Ia~~i 66 (457) T protein:vir:13 4 WSALFGRGHSPALDGIEAR--AWEPY-DPSIYNLGAVAASGETV-------------T-PHDALQVSAVFASVRLLSETI 66 (457) T ss_pred hhhhhcccccccccccccc--ccccc-chHHHhhcccccCCcee-------------c-hHHhhccHHHHHHHHHHHHhh Confidence 3334443222111111110 00110 11100000000011111 1 134567899999999999976 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcc---hhhhHHHHHHHh----hhhhhheeeeecCCCCCccee Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNF---DDKCHEILRNWY----VDGRIYYHKVIDIKAPEEGIK 153 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F---~~~~~~~~R~WY----vDGri~fhkviD~~~~k~GI~ 153 (563) ..+ |+.+--++ . ...++.... .++.+|+- ...++++++.+. +.|--|..+ .+. ...++ T Consensus 67 A~l-----p~~~~~~~-~--~~~~~~~~~---~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i-~~~---~g~~~ 131 (457) T protein:vir:13 67 ATL-----PLSTYSKR-G--GSRKEIVTP---EWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAV-RWQ---GPNIV 131 (457) T ss_pred ccC-----ceEEEEec-C--Ccccccccc---hHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEE-Eec---CCcEE Confidence 543 33332221 1 111112122 23344443 234566666554 457766654 332 35789 Q ss_pred EEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccc Q lcl|NC_015287. 154 EVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDA 233 (563) Q Consensus 154 eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~da 233 (563) +|.+|+|..|..++....... ...|+.|..... | . ......++.+. T Consensus 132 ~l~~l~p~~v~v~~~~~~~~~--------------------~~~~~~y~~~~~------~---~-----~~~~~~~~~~d 177 (457) T protein:vir:13 132 GLDVLDPTKIHVHMVMVDGLR--------------------RKVFEAYDIDAD------G---N-----EVLLGWFTPRD 177 (457) T ss_pred EEEEEccCceEEEEecCCCcc--------------------ceeEEEEEEecC------C---c-----eeeEEeeCccc Confidence 999999988877643332110 122333422110 0 0 01113355566 Q ss_pred eeeeecceecCCC-CccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeE Q lcl|NC_015287. 234 ISFCHSGLVDRNK-QTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLV 312 (563) Q Consensus 234 I~y~hSGl~d~~~-~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklv 312 (563) |.|. =..+.++ -.+.|-|..|++.+.....+++...=+=---+--+-|..++ +.|-+...++..+.+...|+.. T Consensus 178 iih~--~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~ls~e~~~~~~~~~~~~~~g~-- 252 (457) T protein:vir:13 178 VLHI--PGMMLPGDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVP-GTMSEEGLARAREAWRAANSGV-- 252 (457) T ss_pred eEEe--cCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcC-CCCCHHHHHHHHHHHHHHhcCc-- Confidence 6333 2344444 35689999999999888888876554444444555666665 5666655555444444433320 Q ss_pred eeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchH---HHHHHHHHHHHHhcCCChhhccCCCcceecc Q lcl|NC_015287. 313 YDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGEL---SDIEYFQKKLYRSLGVPESRIAGSGDGFNLG 389 (563) Q Consensus 313 Yd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei---~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g 389 (563) .+.|. .+ .|+ .|.+++.|. .+..++ +=-+|....+.++++||...|..-. +-+.. T Consensus 253 --~nag~------~~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~~~~ 310 (457) T protein:vir:13 253 --DNAHR------VA-LLT----------EGAKFSKVA--MSPDEAQFLQTRQFQVPEIARIFGVPPHLISDAT-NSTSW 310 (457) T ss_pred --cccCc------ce-ecC----------CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-Ccccc Confidence 01121 11 121 244555552 223332 3334778889999999999994211 11111 Q ss_pred ccchhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 390 RSSEILRDEIKFTKFV-GRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVV 468 (563) Q Consensus 390 ~~~eItRDElKF~KfI-~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~ 468 (563) . +.+...-+.|.+++ .-+..++ .+.|-..|+ ++.+. ....+.|.-+. +... -+..|++++ T Consensus 311 ~-sn~eq~~~~f~~~tl~P~~~~i----e~~ln~~L~-----~~~~~----~~~~i~fd~~~----l~~~-D~~~r~~~~ 371 (457) T protein:vir:13 311 G-SGLAEQNIAFTMFSLRPWLERI----EAGFNRLLF-----AETAD----RFRFVKFNLDE----IKRG-APKERMELW 371 (457) T ss_pred c-chHHHHHHHHHHHHHHHHHHHH----HHHHHHhhc-----Ccccc----CceeEEeechh----hhcc-CHHHHHHHH Confidence 1 22334444576653 3333333 333333333 33332 22334454332 2111 123556665 Q ss_pred HHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCcccccc--cccc---CCCCcCCCCC-ccc Q lcl|NC_015287. 469 AAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGV--HPEQ---MPGGAMNPQD-PNM 542 (563) Q Consensus 469 ~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~--~~~~---~~~~~~~p~~-~~~ 542 (563) ..+-.- -++|.+-+|.. +.+.+ ++.+..+.-.-+.. ..+. ...+...|.. +.. T Consensus 372 ~~~~~~--G~~T~NE~R~~-~gl~P------------------i~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~ 430 (457) T protein:vir:13 372 SLGLQN--GIYSIDEVRAA-EDMTP------------------LPDGLGEKYRVPLNLGEVGEEPEPEPAPAPPAIEPPA 430 (457) T ss_pred HHHHhC--CCcCHHHHHHH-hCCCC------------------CCCCcccceeeccccccccccccccccCCCCCCCCCc Confidence 544322 24555544432 23333 22221110000000 0000 0000000000 000 Q ss_pred ccccCCCCcccCCCCCCCCCC Q lcl|NC_015287. 543 GGAPQEGGAEQAPQMPEGGEI 563 (563) Q Consensus 543 ~~~~~~~~~~~~~~~~~~~~~ 563 (563) ..+.+++-.+.++.--+++|= T Consensus 431 ~~~~~~~~~~g~~d~~~~~~~ 451 (457) T protein:vir:13 431 EEPDEEPEPEGKPDDEGATEE 451 (457) T ss_pred cccCCCCCCCCCCccccCCCC Confidence 000000100111111111111 No 83 >protein:vir:101418 Length: 569 # NCBI annotation: Prt # Family: family:all:9458 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006480;genbank:gi:46401636;genbank:GeneID:2777482 Probab=97.99 E-value=7.8e-06 Score=48.58 Aligned_cols=496 Identities=15% Similarity=0.133 Sum_probs=254.9 Q ss_pred CcccccchhhHhhhccccc--------c-ccc-CcCCCC-------cc-cceeccccccc-c-ccccchhhhHHHHHHHH Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNV--------V-SPV-PQSNED-------SS-DYYVSSGFYGQ-Y-VDIDGVFKSEFELIKRY 60 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~--------~-s~~-p~~~~d-------~~-~~~~~~~~~~~-~-~~~~~~~~~~~~LI~~Y 60 (563) ++-.|+..-.+..-...+. + |-. .|.-.+ |. |+...+.+-+. + .+--.-.+++.|++-.| T Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~p~~~~~~~~~~~~~~t~~~D~~~~g~~~~~~~~~~pr~R~qiY~~~ 94 (569) T protein:vir:10 15 LAGVFKDNGERDNILLSALAVHGGSGYLFSRAGAPVQLSGFLGGKPGDSGMAGDGLVDGSRFIFDEVQLPEDRLQRYPLL 94 (569) T ss_pred HhhhhhcCCccchhhhhhheeecCcceEEeecCcchhhhhhhccCccccchhhhhHHHHHHHHhhhccCchhHHHHHHHH Confidence 3333333111111000000 0 000 000000 00 00000111111 1 11112348999999999 Q ss_pred HHHhhccchhHHHHHhhcceEEecC-CCcEEEEEe-ecc-ccchHHHHHHHHHHHH-HHHHhcchhhhHHHHHHHhhhhh Q lcl|NC_015287. 61 REMALHPEVDSAIEDIINEAIVSDQ-NDSPVEIDL-ENL-PASKQLKDLIREEFKT-VKEILNFDDKCHEILRNWYVDGR 136 (563) Q Consensus 61 R~ma~~PEvd~Ai~eIvneaiv~d~-~~~pV~l~l-~~~-~~s~sik~~I~eeF~~-i~~ll~F~~~~~~~~R~WYvDGr 136 (563) -+|+.+|-|..|+..=|.-|.-.++ ..+.|.|.= .+. ......+++|..|... +..+ +++.++.+-|.--+-|. T Consensus 95 eeM~~~p~Ia~AlniHVtaALggde~TGd~vfI~p~~~~~~a~~daakai~~el~~dl~~~--iNr~~~~lA~~~~aFGd 172 (569) T protein:vir:10 95 EEMAVYSTIATALNIHITHALSFDKKTGQTFSIVPVHNGNDSDYDAAQALCGELMNDIGRT--INKEVAGWAFIMSVFGV 172 (569) T ss_pred HHHhcCchhhhhhhhhhheeecccccccceEEEEeecCCCCCcchHHHHHHHHHHHHHHHH--HHHHhhHHHHHHHhhhh Confidence 9999999999999999999988887 677777754 222 2444555588888777 4444 68899999888888887 Q ss_pred hheeeeecCCCCCcceeEEE---EeCCccceeeeeeeccCC-----ccccceeeecccccccccccccceeeecCccccc Q lcl|NC_015287. 137 IYYHKVIDIKAPEEGIKEVR---YIDPLKIKLVRKLKTDNS-----LRGAINQVNSKNPADIENPEIEEYYQYDPSQTQG 208 (563) Q Consensus 137 i~fhkviD~~~~k~GI~eLr---~lDP~~i~~vR~~~~~~~-----~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~ 208 (563) -|=.+ + -+...||..|. +-=|.-|+....--+... ..+--..+.-.++-+..- ...|.+... T Consensus 173 sYaRi-Y--~~~~~GV~dl~~s~yt~PsfIqpFE~g~~tvGF~~~~~~~~~~ti~~l~p~qm~r-------mKmPrm~~i 242 (569) T protein:vir:10 173 AYVRP-Y--AKEGIGITSFECSYYTLPSFIKEFEVSGNLAGFSGDYLKDASGKMVFADPWAIIP-------MKIPYWRPK 242 (569) T ss_pred hheee-e--ccCCceeEEEEecccccccccchhhhcCceEEeecccCCccccceeeechhhhhh-------hcccceeec Confidence 66442 2 45578898774 333433333311100000 000000000000000000 011111111 Q ss_pred cccccccCcCccccCcceee---ecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEE Q lcl|NC_015287. 209 KNALGAIGQTPFSTKQRPVK---IAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFY 285 (563) Q Consensus 209 ~~~~g~~g~~~~~~~~~~ik---i~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFy 285 (563) .+.+-+..+.. .-.+. .++.+|-=+|.| -|||+.|-+||-+|..-=-+|--=|+--|.--|+.= T Consensus 243 ~q~~~v~~g~~----~~~L~~d~~~~~Pi~psn~G---------gSFL~~ae~pf~~l~~Al~sL~~qri~dSv~~~~It 309 (569) T protein:vir:10 243 SNLMPVHTGHK----AYSLLDNPEERTPIETQNYG---------TSLLEYAYEPYMNLRSAIRSLKATRFNASKIDRIIG 309 (569) T ss_pred cccchhhhhhh----heeecccccccccccchhhh---------hHHHHHHHhHHHHHHHHHHhccchhhHHHHHhHHhh Confidence 11111100000 00011 111222222222 399999999999999988888888999999999999 Q ss_pred ccCCCCchHHHHHHHHHHHHhhcce---eEeeCCCCcccccccccchhhhhcccccCCCCc-cceeecCCCCCcchHHHH Q lcl|NC_015287. 286 IDVGNLPKMKAEQYLKEVMNRYRNK---LVYDASTGEIRDDRKHMSMLEDFWLPRREGGRG-TEITTLPGGQNLGELSDI 361 (563) Q Consensus 286 IDvGnlpk~KAeqYl~~im~k~knk---lvYd~~TGev~d~~~~msMlEDywLpRReGgrg-TEIsTLpGg~nLgei~DV 361 (563) +-.-.|||+++.+|+|.+-.-.|+. +.--+..|+-- -=.----||-=+.|+| --|+|=.+-.++--|+|| T Consensus 310 lnm~gM~p~qr~~y~r~lt~~LKr~~d~ie~a~~gg~~~------~~~~~H~LPv~gekq~~~tvDt~~~~A~~~gIEdv 383 (569) T protein:vir:10 310 LAMNSLDPVKAADYSRTITQTLKRAADLMERRARGANNM------PTVTNTLLPIMGDGKGQMTIDTQTIQADINGIEDI 383 (569) T ss_pred ccccCCCHHHHhHHHHHHHHHHHHHHHHHHHHhccCccc------cccceeeeeeecCccccccccccccccCcccHHHH Confidence 9999999999999999876655543 22122223211 1111122666666663 366666666677799999 Q ss_pred HHHHHHHHHhcCCChhhcc-CCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCCChhHHHHh Q lcl|NC_015287. 362 EYFQKKLYRSLGVPESRIA-GSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILK--NIVTPEDWEVL 438 (563) Q Consensus 362 ~YF~~kLy~aL~VP~sRl~-~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLk--gI~t~eew~~~ 438 (563) ...-|.|--||++-+|-|. +|.=+--+|-+.-+.. -+.=..=-+-||.-.++.|..++-..+-.| ++-+++|+ T Consensus 384 M~~~R~LagaLGlD~SMlGwAD~LsGGLGeGG~frt-SaQaa~RS~~iRqa~~e~in~iidiH~~fKYgevf~~~dr--- 459 (569) T protein:vir:10 384 LTYMRQLAAALGLDYTLLGWADQMSGGLGEGGFLRT-AIQAAMRASWIQQGVEEFIQRAIDIHLAFKYGKVYPEGDR--- 459 (569) T ss_pred HHHHHHHHhhhccchhHhhHHHHhcccccccHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCcccCCCCc--- Confidence 9999999999999999984 1100000222221110 011111124578888888888888877665 55555554 Q ss_pred hhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhch------h-h---hhHHHHHHHhccChhhHHHHHHHHHHHhc Q lcl|NC_015287. 439 GDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGK------Y-F---SLDYVRRKILKQKDEEVIEIDKQMEKEIK 508 (563) Q Consensus 439 ~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGK------y-~---S~~~i~k~Il~~tdeei~e~~~qie~E~~ 508 (563) .-+++|..++-=.+-.+.+-..+|++.++-+.-..-- + | ...|+.+.++.+.. +..++ +-.|. T Consensus 460 --P~~V~F~s~~tAl~~E~~~n~~~raN~a~i~~Q~la~l~e~n~Lg~de~~m~y~l~d~~~~De---~~~e~-l~ae~- 532 (569) T protein:vir:10 460 --PYKIEFHSVNTALQQEHNDNRDSQANYATIVTQILDAVSNNSVLANSDAFKRYLFSDVLEIDE---KISEA-LVNEL- 532 (569) T ss_pred --ceEEEeccchHHHHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccHHHHHHHHHHHhhcch---hHHHH-HHhhc- Confidence 4789999999888888888888998876544211110 0 1 11344444444322 11111 22232 Q ss_pred CCCCCCCcccCccccccccccC-------CCCcCCCCCcc Q lcl|NC_015287. 509 DGKLADPMEVQQLQLGVHPEQM-------PGGAMNPQDPN 541 (563) Q Consensus 509 ~~~~~~p~~~~~~~~~~~~~~~-------~~~~~~p~~~~ 541 (563) -+.|.+++-.-.+...-++ ++--+..++-+ T Consensus 533 ---~akp~DEe~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 569 (569) T protein:vir:10 533 ---KAKSEDDDHLMDSIIKTPPQELAQILESVFKEGNDND 569 (569) T ss_pred ---CCCcchhHHHHHHHhcCChHHHHHHHHHHhhccCCCC Confidence 2223332211111110000 00000011111 No 84 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=97.98 E-value=8.4e-06 Score=48.40 Aligned_cols=461 Identities=14% Similarity=0.145 Sum_probs=182.1 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHH------- Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAI------- 73 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai------- 73 (563) ||-+|-.+- .+..+....+...+.....-....+. .| .. . .-+++|+.+..+++-+..| T Consensus 1 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~i~-~~----i~-------~-~~~~~~~~~~~YY~g~~~i~~~~~~~ 66 (503) T protein:vir:59 1 MADIYPLGK-THTEELNEIIVESAKEIAEPDTTMIQ-KL----ID-------E-HNPEPLLKGVRYYMCENDIEKKRRTY 66 (503) T ss_pred CcccccCCh-hhHHhHHHhhhhhhhhccchhHHHHH-HH----HH-------h-hcHHHHHHHHHHhccccchhhccchh Confidence 777665421 12222222221111111110000000 00 00 0 0112233333333222211 Q ss_pred -----------------------HHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHH Q lcl|NC_015287. 74 -----------------------EDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRN 130 (563) Q Consensus 74 -----------------------~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~ 130 (563) ..||+- .+.=--.+||.+..++ +...+.+ +.+.+ .+|+....++.+. T Consensus 67 ~~~~~~~~~~~~~~~~ri~~n~~~~ivd~-~~~yl~g~~~~~~~~d----~~~~~~l----~~~~~-n~~~~~~~~~~~~ 136 (503) T protein:vir:59 67 YDAAGQQLVDDTKTNNRTSHAWHKLFVDQ-KTQYLVGEPVTFTSDN----KTLLEYV----NELAD-DDFDDILNETVKN 136 (503) T ss_pred cccccccccccccccceeecchHHHHHHH-HHhhhhcCCeeeccCc----HHHHHHH----HHHHh-cCHHHHHHHHHHH Confidence 112211 1111134566666554 2222233 33333 4799999999999 Q ss_pred HhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccc Q lcl|NC_015287. 131 WYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKN 210 (563) Q Consensus 131 WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~ 210 (563) .++.|+-|.+.-+|. +|-..+..+||+.+..++.-.........|...+..... ...+...-+|.+......- T Consensus 137 ~~~~G~~~~~v~~d~----dg~~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~---~~~~~~~evy~~~~i~~~~ 209 (503) T protein:vir:59 137 MSNKGIEYWHPFVDE----EGEFDYVIFPAEEMIVVYKDNTRRDILFALRYYSYKGIM---GEETQKAELYTDTHVYYYE 209 (503) T ss_pred HhhCCeEEEEEeecC----CCceEEEEEccceeEEEEeCCCCCceEEEEEEEEEecCC---CceEEEEEEEeCCcEEEEE Confidence 999999999877763 477889999999988875432222211222221111000 0111111133332221111 Q ss_pred cccccC----cCccccCcceeeecccceeeeecceec-CCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEE Q lcl|NC_015287. 211 ALGAIG----QTPFSTKQRPVKIAPDAISFCHSGLVD-RNKQTILSYLHKSIKALNQLR-MIEDSLVIYRLSRAPERRIF 284 (563) Q Consensus 211 ~~g~~g----~~~~~~~~~~iki~~daI~y~hSGl~d-~~~~~~~syL~~Aik~~NqL~-m~EDalVIyRi~RAPeRRvF 284 (563) ..+..- ..........+.....+..|..--++- .++....|-|+.++.....+. ++-+.+..-+.++.|-+.+- T Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~~~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~ 289 (503) T protein:vir:59 210 KIDGVYQMDYSYGENNPRPHMTKGGQAIGWGRVPIIPFKNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLK 289 (503) T ss_pred EcCCcccccccccccccccceeecceeccCCccceEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEee Confidence 111000 000000000011111111111000111 123334566666666555554 33555556677777755443 Q ss_pred EccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchH-HHHHH Q lcl|NC_015287. 285 YIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGEL-SDIEY 363 (563) Q Consensus 285 yIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei-~DV~Y 363 (563) -.+.-+.+ +.... |..+ +++ .+| ++| .+..|-...+.+.. .-+.- T Consensus 290 g~~~~~~~-----~~~~~-~~~~--~~~---------------------~~~--~~~---~~~~l~~~~~~~~~~~~~~~ 335 (503) T protein:vir:59 290 NYDGENPK-----EFTAN-LRYH--SVI---------------------KVS--GDG---GVDTLRAEIPVDSAAKELER 335 (503) T ss_pred cCCccccc-----hhhhh-hhcc--cce---------------------ecc--CCC---cceeEeccCCHHHHHHHHHH Confidence 33222211 11111 1111 111 011 111 24444443343332 24466 Q ss_pred HHHHHHHhcCCChhhccCCCcceeccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhc Q lcl|NC_015287. 364 FQKKLYRSLGVPESRIAGSGDGFNLGRSSE--ILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDH 441 (563) Q Consensus 364 F~~kLy~aL~VP~sRl~~~~~g~~~g~~~e--ItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~ 441 (563) +++.+|+...+|---. + .| .|.+|. |.........-+.+.++.|...+.++++.=+-+-++....++... .. T Consensus 336 l~~~i~~~s~~p~~~~--~--~~-~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~-~~ 409 (503) T protein:vir:59 336 IQDELYKSAQAVDNSP--E--TI-GGGATGPALENLYALLDLKANMAERKIRAGLRLFFWFFAEYLRNTGKGDFNPD-KE 409 (503) T ss_pred HHHHHHHHhcccCCCc--c--cc-cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccc-cc Confidence 6777888888883221 1 11 122222 211112233446666677777776666653333344444444433 34 Q ss_pred eEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccCh--hhHHHHHHHHHHHhc-CCCCCCCccc Q lcl|NC_015287. 442 IQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKD--EEVIEIDKQMEKEIK-DGKLADPMEV 518 (563) Q Consensus 442 i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~td--eei~e~~~qie~E~~-~~~~~~p~~~ 518 (563) |.+.|...---.+. +.++++..+-.- | .+|.+++++.+-.-+| +|++.++++.+++.. ...+.++ T Consensus 410 i~i~f~~~~p~d~~-------~~~~~~~kl~~~-G-iiS~et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~--- 477 (503) T protein:vir:59 410 LTMTFTRTRIQNDS-------EIVQSLVQGVTG-G-IMSKETAVARNPFVQDPEEELARIEEEMNQYAEMQGNLLDD--- 477 (503) T ss_pred eeEEeCCCCCCCHH-------HHHHHHHHHHhC-C-CCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhccccCc--- Confidence 88888644333332 344444443221 3 5899999988644333 444444333222221 1111111 Q ss_pred CccccccccccCCCCcCCCCCcccccccCCCCcccCC Q lcl|NC_015287. 519 QQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAP 555 (563) Q Consensus 519 ~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~ 555 (563) .++...+.+.+|..+ ++. ..+.++|. T Consensus 478 -------~~~~~~~~~~~~~~~---~~~-~~~~g~~~ 503 (503) T protein:vir:59 478 -------EGGDDDLEEDDPNAG---AAE-SGGAGQVS 503 (503) T ss_pred -------cCCCCCCCcCCCCCC---ccc-CCCCCCcC Confidence 111111111112221 111 22222333 No 85 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=97.92 E-value=1.1e-05 Score=47.79 Aligned_cols=383 Identities=13% Similarity=0.141 Sum_probs=163.6 Q ss_pred CcccccchhhHhhhc-ccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcc Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQ-NKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINE 79 (563) Q Consensus 1 m~~~~~f~~~~~~~~-~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvne 79 (563) |-.+|+| +++.... ..+..++......+..-...-++..+.++ . -+..+++|-|..||+-|.+. T Consensus 2 ~m~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v-------------~-~~~al~~~~v~~~v~~ia~~ 66 (392) T protein:vir:74 2 ILPILNF-INQTNDPPEAGSVQSYFPDGNDAQIMESLLGDNNEWV-------------S-ARAALRNSDLFSIILQLSSD 66 (392) T ss_pred cchhhhh-hhcccCcccccccccccccCchhhhhhhccCCCCccc-------------c-hhhhhcchHHHHHHHHHHHh Confidence 3344443 2222111 11111111111111100000011111111 1 12346789999999999997 Q ss_pred eEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhh----hHHHHHHHhhhhhhheeeeecCCCCCcceeEE Q lcl|NC_015287. 80 AIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDK----CHEILRNWYVDGRIYYHKVIDIKAPEEGIKEV 155 (563) Q Consensus 80 aiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~----~~~~~R~WYvDGri~fhkviD~~~~k~GI~eL 155 (563) +.-. |+.+ .+ .....+.+ -=+-... -+.++..+++.|.-|..++.|.+ ..+.+| T Consensus 67 ia~l-----p~~~--~~-----~~~~~l~~-------~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~---G~~~~L 124 (392) T protein:vir:74 67 LAIV-----KINA--EK-----KKNQGIID-------NPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNAN---GADMKW 124 (392) T ss_pred hccC-----ceee--cc-----chhhhhhh-------hcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCC---CcEEEE Confidence 6432 2222 11 01111221 1122222 34456678888998888777643 348999 Q ss_pred EEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeeccccee Q lcl|NC_015287. 156 RYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAIS 235 (563) Q Consensus 156 r~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~ 235 (563) .+|+|.+|..++.-. ++ .-+|.|...+ +. ....+.++.+.|. T Consensus 125 ~~i~~~~v~v~~~~~------~~-----------------~~~y~~~~~~-------~~--------~~~~~~~~~~evi 166 (392) T protein:vir:74 125 EYLRPSQVNTYYFEY------EN-----------------GMYYNITFDD-------PK--------IEPILQAPQSDLI 166 (392) T ss_pred EEEcCceeEEEEcCC------Cc-----------------eEEEEEEecC-------Cc--------cceeEEEcCccEE Confidence 999999886553110 01 1122221100 00 0112456677775 Q ss_pred eeecceecCCCC-ccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEee Q lcl|NC_015287. 236 FCHSGLVDRNKQ-TILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYD 314 (563) Q Consensus 236 y~hSGl~d~~~~-~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd 314 (563) |.+ ..+.++. .+.|-|..|+..+.....+++...-+=---+--+-+..++-+..+..++.+.++ ..|... T Consensus 167 h~~--~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~~~~~~~---~~~~~~---- 237 (392) T protein:vir:74 167 HMK--LLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRS---RSFMKR---- 237 (392) T ss_pred Eec--CCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHH---HHHhcc---- Confidence 554 3445553 468999999999999988887766555555556677777766555544433332 233221 Q ss_pred CCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCChhhccCCCcceeccccch Q lcl|NC_015287. 315 ASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSE 393 (563) Q Consensus 315 ~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~e 393 (563) .+.| +.+ .|+ .|++++.|.-...-.| ++=.+|..+.+.++++||...|...+. ..+. T Consensus 238 ~n~g------~~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-----~~~~ 295 (392) T protein:vir:74 238 SRSG------GPV-VLD----------DLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGD-----QQSS 295 (392) T ss_pred ccCC------Cee-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-----cccH Confidence 1122 111 221 2566776643322223 455678889999999999999943211 1111 Q ss_pred hhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_015287. 394 ILRDEIKFTK-FVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVD 472 (563) Q Consensus 394 ItRDElKF~K-fI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~ 472 (563) +... ..|.. .+.-+.+++..-+. ..|. +.-+++. ...+..|. ..+...+..+- T Consensus 296 ~e~~-~~~~~~~l~p~~~~ie~~l~----~~l~-----~~~~~~~-----~~~~~~d~-----------~~~~~~~~~l~ 349 (392) T protein:vir:74 296 IQQI-SGMYASALNRYLRPAISELE----YKLS-----DHISVNM-----RPAIDPLG-----------DNYLSTISTAT 349 (392) T ss_pred HHHH-HHHHHHHHHHHHHHHHHHHH----Hhcc-----chhcccc-----hhhhcCCH-----------HHHHHHHHHHH Confidence 1111 22332 23333333333332 2221 1111110 00000000 01111111110 Q ss_pred hhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCC Q lcl|NC_015287. 473 PYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGG 550 (563) Q Consensus 473 p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~ 550 (563) . +.. +|..|.-+ .. .+.|.. |++.-..++- +...+| .+.+|+| T Consensus 350 ~--~g~------------~t~near~---~~---~~~g~~--pne~r~~enl--~~~~~G-----------d~~~p~p 392 (392) T protein:vir:74 350 R--WGA------------LAENQATF---VL---QEAGYI--PKDLPAPENT--NKKTTG-----------QSNEPVP 392 (392) T ss_pred h--CCC------------cCHHHHHH---HH---HhCCCC--ccccchhcCC--CCCCCC-----------CCCCCCC Confidence 0 112 22222221 11 123432 4433222211 001111 1111222 No 86 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=97.90 E-value=1.2e-05 Score=47.59 Aligned_cols=442 Identities=12% Similarity=0.093 Sum_probs=177.0 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchh---------- Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVD---------- 70 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd---------- 70 (563) |-..|...+++.-... -++..-+........ + . ....... .-+.+|+.+..+.+-. T Consensus 1 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~-i-----~---~~i~~~~---~~~~~~~~~~~Yy~g~~~i~~r~~~~ 66 (474) T protein:vir:95 1 MFNIIRMPWDKPYGEE--VVEQLKPQFETQEEM-I-----I---RLIDDHR---KQLDKITVGQRYYDKDNDIVKQMKKV 66 (474) T ss_pred CcceeecCCCCchhhH--HHHhhhhccCChHHH-H-----H---HHHHHHH---HHHHHHHHHHHHhcccCchhcccccc Confidence 6666665544321100 011110000000000 0 0 0000001 1112222222221111 Q ss_pred -----------------HHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhh Q lcl|NC_015287. 71 -----------------SAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYV 133 (563) Q Consensus 71 -----------------~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYv 133 (563) +-...||+- .+.=-...||.+..++ + ...+.++.+++ .+|+....++.+.+++ T Consensus 67 ~~~~~~~~~~~~~ki~~n~~~~Ivd~-~~~~l~g~p~~~~~~d----~----~~~~~l~~~~~-n~~~~~~~e~~~~~~~ 136 (474) T protein:vir:95 67 DVYGNIDYDKPDWRITTNFHQNLVDQ-KVSYVASKPVTYSCED----E----SVLKIIHDVLD-TRWDNKLIDILTATSN 136 (474) T ss_pred ccccccccccccceeccchHHHHHHH-HHhhhccCCceeccCc----h----HHHHHHHHHHh-ccHHHHHHHHHHHHhh Confidence 111122221 1111235677776654 2 23334444444 4799999999999999 Q ss_pred hhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccc--cccccccceeeecCcccccccc Q lcl|NC_015287. 134 DGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPAD--IENPEIEEYYQYDPSQTQGKNA 211 (563) Q Consensus 134 DGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~--~~~~~~~eyf~y~~~~~~~~~~ 211 (563) .|+-|.+.-+| ++|-..+..++|+++-.|..-.........+.......... .+.++...+|.+...+...... T Consensus 137 ~G~~~~~v~~d----~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~ 212 (474) T protein:vir:95 137 KGIDWLQVYIN----ENGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYY 212 (474) T ss_pred cCcEEEEEEec----CCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEEcCeeEEEEEeCCeEEEEEEcCCccccccc Confidence 99998886565 35678899999999887732211111111222211111110 0111111122322222111111 Q ss_pred ccccCcCccccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCC Q lcl|NC_015287. 212 LGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLR-MIEDSLVIYRLSRAPERRIFYIDVGN 290 (563) Q Consensus 212 ~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~-m~EDalVIyRi~RAPeRRvFyIDvGn 290 (563) ....+ .......--+..=+|++. .|+....|=++..+.....+. ++-+....-+.++.|-+-+.-.+... T Consensus 213 ~~~~~---~~~~~~~~~~g~iPvv~~------~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~ 283 (474) T protein:vir:95 213 YGANH---IQSHFSNGNWGRVPFIAF------KNNPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQD 283 (474) T ss_pred cCccc---ccccccccCCCccceEee------cCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccc Confidence 11000 000000000001122221 122234566666666666554 45555556677777765544333221 Q ss_pred CchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchH-HHHHHHHHHHH Q lcl|NC_015287. 291 LPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGEL-SDIEYFQKKLY 369 (563) Q Consensus 291 lpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei-~DV~YF~~kLy 369 (563) ... ...-+.. .+. +++ +++ .+++.|-...+++.. .-+.-+.+.+| T Consensus 284 ~~~------~~~~~~~--~~~---------------------i~~---~~~--~~~~~l~~~~~~~~~~~~~~~l~~~i~ 329 (474) T protein:vir:95 284 LEE------FMRGLKY--YKA---------------------INV---DGD--GGVETIQVEVPVSSTKEYIDLMRAYIM 329 (474) T ss_pred chh------hhhhhhc--cce---------------------eec---cCC--CceeEEeecCCHHHHHHHHHHHHHHHH Confidence 110 0000100 001 111 122 224444443454433 23466778888 Q ss_pred HhcCCChhhccCCCcceeccccc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeec Q lcl|NC_015287. 370 RSLGVPESRIAGSGDGFNLGRSS-EILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVY 448 (563) Q Consensus 370 ~aL~VP~sRl~~~~~g~~~g~~~-eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~ 448 (563) ....+|- +..+ +|.-+.|+ .|..-+..-..-+.+.+..|..-+.++++.=+-+-|+ ..+| ..|.+.|.. T Consensus 330 ~~s~~p~--~~~~--~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~--~~d~----~~i~v~f~~ 399 (474) T protein:vir:95 330 EFGQGVD--FQTD--KFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNL--KMDV----KDIEISFNF 399 (474) T ss_pred HHhCCcc--cccc--cccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Cccc----ceeeEEecc Confidence 8899983 3222 22211121 2222222233346677777777777777654444443 2334 456777754 Q ss_pred cchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccc Q lcl|NC_015287. 449 DNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPE 528 (563) Q Consensus 449 Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~ 528 (563) ..--.+. +.++++.++ | .+|.+++++.+-..+| .+++-++|++|..+.....+.... . .. T Consensus 400 ~~p~d~~-------e~a~~~~~~----g-~iS~et~i~~l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~---~---~~ 459 (474) T protein:vir:95 400 NRMMNDA-------EQSQIIAQS----Q-YLSRETLVKSSPLVDD--YKAELERIEQEQMEYNKQLPNLDD---G---GA 459 (474) T ss_pred CCCcCHH-------HHHHHHHhc----C-CCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhccccccc---c---cC Confidence 3332222 233344442 4 5899999987644443 333444555554321111110000 0 00 Q ss_pred cCCCCcCCCCCcccccccCCCCcccCCC Q lcl|NC_015287. 529 QMPGGAMNPQDPNMGGAPQEGGAEQAPQ 556 (563) Q Consensus 529 ~~~~~~~~p~~~~~~~~~~~~~~~~~~~ 556 (563) ......+.+ +++.|. T Consensus 460 d~~~~~~~~-------------~~~~~~ 474 (474) T protein:vir:95 460 DGAQQQERS-------------NDKESE 474 (474) T ss_pred CCCcCCCCC-------------ccCCCC Confidence 000011111 111111 No 87 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=97.89 E-value=1.2e-05 Score=47.50 Aligned_cols=446 Identities=17% Similarity=0.206 Sum_probs=183.2 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCc-ccce------eccccccccccccchhhhHHHHHHHHHHHhhccchhHHH Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDS-SDYY------VSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAI 73 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~-~~~~------~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai 73 (563) +++.|-.+..-+....++..+- -.|+ ++.. ..+||-| +-.--.|||+||.++++ T Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~l~~~~~~~F~G---------------y~~la~laQ~~eyr~~~ 129 (695) T protein:vir:78 69 LARQFEVDVSNYTPRERRAASY----ALDFNGTSMDALSFVTSSGFPG---------------FPTLVLLAQLPEYRAMH 129 (695) T ss_pred cceeceeccccCCccccchhhh----hhcccccccccchhhhccCcch---------------HHHHHHHhhccchhhHH Confidence 4555543333222222221111 0011 0000 1111111 12234689999999999 Q ss_pred HHhhcceE------EecCCCcE----EEEEeeccccch-HHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeee Q lcl|NC_015287. 74 EDIINEAI------VSDQNDSP----VEIDLENLPASK-QLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKV 142 (563) Q Consensus 74 ~eIvneai------v~d~~~~p----V~l~l~~~~~s~-sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkv 142 (563) +-|..||+ ++...++. +.+.-+....++ .-.++|..|++. |+...+.-+.++-=-+.|.=.-... T Consensus 130 ~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~er----L~V~~~l~eaik~aRlfGGa~~~i~ 205 (695) T protein:vir:78 130 EVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIER----LRIRDAVRTTVIHDQAFGRAHPYFK 205 (695) T ss_pred HHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHH----HHHHHHHHHHHHhhccccceEEEEE Confidence 99999993 33322221 333333333444 344567777664 3334444444443334444211123 Q ss_pred ecCC--------------CCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccc Q lcl|NC_015287. 143 IDIK--------------APEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQG 208 (563) Q Consensus 143 iD~~--------------~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~ 208 (563) |+-. -.|.+++.|+.|||..+-+= . .+. .++..-.|-.-..|++.. T Consensus 206 i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~-------~-~n~------~dP~spdfgkP~~y~V~G------ 265 (695) T protein:vir:78 206 IKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPN-------N-YNS------INPVADDFYKPSTWWMIG------ 265 (695) T ss_pred eccCccccccccccccccccCcceeeeEeecccccccc-------h-hhh------ccchhhccCCCceEEEec------ Confidence 3222 23457788999997543331 1 110 011111222222233311 Q ss_pred cccccccCcCccccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHH-HHHHHHHHHHHHHhcCccceEEEcc Q lcl|NC_015287. 209 KNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQ-LRMIEDSLVIYRLSRAPERRIFYID 287 (563) Q Consensus 209 ~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~Nq-L~m~EDalVIyRi~RAPeRRvFyID 287 (563) . ..+.|+.+.+...++.+. |-......+.|.+..+..-..+ +++...+.=+- +.+.-+ ++-.| T Consensus 266 --------~--kIH~SRL~~f~g~plPd~---LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li--~~~~v~-~lk~d 329 (695) T protein:vir:78 266 --------T--EVHATRLHTIVSRPVGDM---LKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIV--KQFSVS-GILMD 329 (695) T ss_pred --------e--EEeeeeEEEecCCCchhh---hhcccccCcccHHHHHHHHHHHHHHHHhHHHHHH--HhhhhH-HHHHH Confidence 0 133344444443333222 1112223345666665544333 44433332221 111111 11112 Q ss_pred CC-CCchHHHHHHH--HHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHH-H Q lcl|NC_015287. 288 VG-NLPKMKAEQYL--KEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIE-Y 363 (563) Q Consensus 288 vG-nlpk~KAeqYl--~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~-Y 363 (563) .. -|.....++.. -+++++||.- -|-+-=|+ =.|||- +. ..+|+-++||. = T Consensus 330 la~~L~~g~~~~l~~R~eli~~~Rsn------~G~~llDk----~~Eefe-------------q~--stslSGLddVi~q 384 (695) T protein:vir:78 330 LAQALMPGANVDLSMRAELINRYRDN------RNILFLDK----ATEEFF-------------QF--NTPLSGLDALQAQ 384 (695) T ss_pred HHHhhcChhHHHHHHHHHHHHHhcCc------cceEEEec----CCcceE-------------EE--ecccCCHHHHHHH Confidence 11 01111122222 2455666521 12110000 013442 11 24788899975 4 Q ss_pred HHHHHHHhcCCChhhccC-CCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHhcCCCChhHHHH Q lcl|NC_015287. 364 FQKKLYRSLGVPESRIAG-SGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKT-----QLILKNIVTPEDWEV 437 (563) Q Consensus 364 F~~kLy~aL~VP~sRl~~-~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~-----qLiLkgI~t~eew~~ 437 (563) |..-+=-+.+||+.||=+ ...|||--.-+ |.-.|...|..+|. ..+...|++ |+-+-|.+. T Consensus 385 f~q~VAgaa~IPltkLfGqSPkGlNATGE~----D~rnYYD~I~s~Qe---~~L~p~L~rl~~ii~rS~~G~id------ 451 (695) T protein:vir:78 385 AQEQMSAVSHIPLIKLLGITPTGLNASSEG----EIRVWYDYVRAYQR---NALQQLMNDVIVMIQLSLFGAVD------ 451 (695) T ss_pred HHHHHHhhhcCchhhhhccCCccccccchh----hHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHhcCCCC------ Confidence 888888899999999943 34477653222 23458889988875 334444433 444445443 Q ss_pred hhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCC---- Q lcl|NC_015287. 438 LGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLA---- 513 (563) Q Consensus 438 ~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~---- 513 (563) ..|.|+|+.=..-+|..-+||...+.+....+-.- | -++.+-|+ ..+..+. .+.|- T Consensus 452 --pdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~-g-vI~~~evr---------------~rL~~d~-~s~Y~~~~D 511 (695) T protein:vir:78 452 --PSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQE-Q-VIRPDQVA---------------ARLNTEP-DGPYAGKLD 511 (695) T ss_pred --CcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHh-c-CCCHHHHH---------------HHHhcCC-Ccccccccc Confidence 45889999777778888888888887765433211 1 11222222 2222211 11111 Q ss_pred ------CCccc-------------CccccccccccCCCCcCCCCCcccccccCCC-CcccCCCCCC-------CCCC Q lcl|NC_015287. 514 ------DPMEV-------------QQLQLGVHPEQMPGGAMNPQDPNMGGAPQEG-GAEQAPQMPE-------GGEI 563 (563) Q Consensus 514 ------~p~~~-------------~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~-~~~~~~~~~~-------~~~~ 563 (563) .|.+. +.++.++..+...|.--.|.-.+...-+-+. ++.+.+.+.- .|.| T Consensus 512 ~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ag~~~~~~~aag~v~~~~g~v 588 (695) T protein:vir:78 512 ANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARAGATAPPTVANVNANVKPREAGAQDAAMRAAGAVYVVDGKV 588 (695) T ss_pred cccCCCcCccchhhhhHhhhcCcccccccCCCCCCCCCCCCCCceeeeeccccccccCCCCcccceeEEEEEeCCEE Confidence 00000 0011111111111111111111111111110 1112222221 2222 No 88 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=97.87 E-value=1.3e-05 Score=47.30 Aligned_cols=444 Identities=13% Similarity=0.075 Sum_probs=178.4 Q ss_pred ccchhhHhhhcccccccccCcCCCCcccceecc------cccccc-ccccchhh---------hHHHHHHHHHHHhhccc Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSS------GFYGQY-VDIDGVFK---------SEFELIKRYREMALHPE 68 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~------~~~~~~-~~~~~~~~---------~~~~LI~~YR~ma~~PE 68 (563) +. -.+.||.|.+.---|+.++.+- -|...+ .+.+.... .-...+.+|+.+..+.+ T Consensus 1 ~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~ 71 (492) T protein:vir:97 1 MQ---------FIQLISQVAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYE 71 (492) T ss_pred Ch---------HHHHHHHHHHHHhcCCceeeccchhhhhHhhhcccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 22 2233444433222233222211 111111 11111111 11223445555555544 Q ss_pred hhHHH---------------------------HHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcch Q lcl|NC_015287. 69 VDSAI---------------------------EDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFD 121 (563) Q Consensus 69 vd~Ai---------------------------~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~ 121 (563) -...| ..||+- .+.=--..||.+..+|. ...+..+.+++ .+|+ T Consensus 72 g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~-~~~yl~g~p~~~~~~d~--------~~~~~l~~~~~-n~~~ 141 (492) T protein:vir:97 72 QRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQ-KVSYIVGKPIAFKHTDD--------EVVKRIDEVLG-NRFD 141 (492) T ss_pred ccCccccccccccccccccccccccccccchHHHHHHH-HhhhhcccCceeccCch--------HHHHHHHHHHh-ccHH Confidence 43211 111111 11111344555555442 23444555554 5789 Q ss_pred hhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeeccccc--cccccccccee Q lcl|NC_015287. 122 DKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPA--DIENPEIEEYY 199 (563) Q Consensus 122 ~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~--~~~~~~~~eyf 199 (563) ....++.+.+++.|+-|.+.-+| ++|-..++.++|+.+..|..-.........+......... .++.+....+| T Consensus 142 ~~~~~~~~~~~~~G~a~~~v~~d----~dg~~~~~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~y~~~~v~~~ 217 (492) T protein:vir:97 142 DKLHSVLTGASNKGIEWLHPYLD----EEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYY 217 (492) T ss_pred HHHHHHHHHHhhcCeEEEEEEec----CCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeeccceeEEEEecCeEEEE Confidence 99999999999999999887665 3577889999999988884322211112222221111100 01111112233 Q ss_pred eecCccccccccccccCcCccccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcC Q lcl|NC_015287. 200 QYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLR-MIEDSLVIYRLSRA 278 (563) Q Consensus 200 ~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~-m~EDalVIyRi~RA 278 (563) .+................ . .....--+..=+|++.. |+....|=++..+....-+. ++=+....-+-++- T Consensus 218 ~~~~~~~~~~~~~~~~~~--~-~~~~~~~~g~vPvv~~~------nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~ 288 (492) T protein:vir:97 218 VYENGSLIPDYSNNLENS--K-THFSTGSWGKIPFIPFK------NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNE 288 (492) T ss_pred EEecCeeeeccccccccc--c-cccccCCCCCcceEEec------CCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcc Confidence 332211110000000000 0 00000001111233221 11223444544443333332 23344444455555 Q ss_pred ccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch- Q lcl|NC_015287. 279 PERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE- 357 (563) Q Consensus 279 PeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge- 357 (563) |-+-+.-.+.-+.+ +... .+..++...-++ +..+.+|-...+... T Consensus 289 ~~l~~~g~~~~~~~-----~~~~---------------------------~~~~~~~~~~~~--~~~~~~l~~~~~~~~~ 334 (492) T protein:vir:97 289 LTYVLKNYDDQELP-----EFKR---------------------------LLRYYGAIKVSD--NGGVDTIQVEVPVENS 334 (492) T ss_pred ceeeeecCCcccch-----hHHH---------------------------HHhhccceecCC--CCcceeEeccCCHHHH Confidence 64444322211111 1111 111111111111 223555544333332 Q ss_pred HHHHHHHHHHHHHhcCCChhhccCCCcceeccccch-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHH Q lcl|NC_015287. 358 LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSE-ILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWE 436 (563) Q Consensus 358 i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~e-ItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~ 436 (563) ..-+.-+++.+|+-..+|- +..+ .|.-..|++ |.--+.....-+.+.++.|..-+.++++.=+-+-|+ ..+|. T Consensus 335 ~~~~~~L~~~I~~~s~~p~--~~~~--~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~--~~~~~ 408 (492) T protein:vir:97 335 KKYLDELYQKIMLFGQAVD--FSSD--KFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI--KGEHK 408 (492) T ss_pred HHHHHHHHHHHHHHhCCCC--CCcc--ccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--Ccccc Confidence 2334556677788888884 2222 222112222 222223344556777777888777777753333343 33555 Q ss_pred HhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCc Q lcl|NC_015287. 437 VLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPM 516 (563) Q Consensus 437 ~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~ 516 (563) . |.+.|....--.+. +.++++..+. | .+|.+++++.+-..+| .+++-++|++|..+..-..++ T Consensus 409 ~----i~v~f~~~~p~~~~-------e~a~~~~kl~---G-~iS~et~l~~l~~v~d--~~~Eleri~~E~~~~~~~~~~ 471 (492) T protein:vir:97 409 D----VDISFNYNKVANTE-------LQVQTAQQSM---G-IVSHETVLENHPFVED--LQAELERIEQEQTEYNKQLPN 471 (492) T ss_pred e----eeEEecCCCCCCHH-------HHHHHHHHHh---c-cCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHhhhc Confidence 4 56777543322222 3345555553 4 4899999998765554 333444455443321111111 Q ss_pred ccCccccccccccCCCCcCCCCCcccccccCCCCccc Q lcl|NC_015287. 517 EVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQ 553 (563) Q Consensus 517 ~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~ 553 (563) .. ..+ ...+...+.|++ ..+| T Consensus 472 ~~---~~~---~~~~~~~~~~~~----------~~~e 492 (492) T protein:vir:97 472 LD---DGG---ADSAQQQERSNN----------KESE 492 (492) T ss_pred cc---cCC---CCCCcccccccc----------cccC Confidence 00 000 000000111111 1111 No 89 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=97.87 E-value=1.4e-05 Score=47.28 Aligned_cols=464 Identities=10% Similarity=0.033 Sum_probs=191.0 Q ss_pred CcccccchhhHhh--hccc----ccc-cccCcCCCCcccceeccccccccccc-cchhhhHHHHHHHHHHHhhccchhHH Q lcl|NC_015287. 1 MAKLFGFKIQKDD--EQNK----NVV-SPVPQSNEDSSDYYVSSGFYGQYVDI-DGVFKSEFELIKRYREMALHPEVDSA 72 (563) Q Consensus 1 m~~~~~f~~~~~~--~~~~----~~~-s~~p~~~~d~~~~~~~~~~~~~~~~~-~~~~~~~~~LI~~YR~ma~~PEvd~A 72 (563) |-+|=.-+++..+ +... ..+ ...|- . +--.-++.|.. ++ .-...+.....+..+.++.+.=+.-+ T Consensus 1 ~~~~p~~~l~~~~~~~~~~~~l~~~~~~~~~r-~-~~~~~YY~g~~-----~i~~~~~~~~~~~~~~~~~~~~~n~~~~i 73 (479) T protein:vir:99 1 MIDLPDEDLSSEGLAKYLETKVFPKMNTECER-L-DDFEAWTKNGQ-----EVPDLATRHKNKEREVLQQLSRKPWMGLM 73 (479) T ss_pred CccCCcccCChhHHHHHHHHHHHHHHHHHhHH-H-HHHHHHHhcCC-----cccccccccCChhHHHHHHHhhcCcHHHH Confidence 1111111010000 0000 000 00000 0 00000011110 00 00011222233333333333334445 Q ss_pred HHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecC--CCCCc Q lcl|NC_015287. 73 IEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDI--KAPEE 150 (563) Q Consensus 73 i~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~--~~~k~ 150 (563) |+..++=. .+..+...+. ...+++..|++..+|+....++++.-++.|+-|.. |.-. ....+ T Consensus 74 Vd~~~~~l-------~~~gf~~~d~--------~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~af~~-v~~~~~~~d~~ 137 (479) T protein:vir:99 74 VNSFAQQL-------IVDGYRKTGT--------NENAKGWDTWRLNQMDKQQFWLNRAVLTFGYAFIK-VTSGISPLDGT 137 (479) T ss_pred HHHHHhhc-------ccccccCCCc--------hhhHHHHHHHHhcChhHHHHHHHHHHhhcCceEEE-EecCCCCcCCC Confidence 55444411 1111222221 12345667888888999999999999999996653 3210 12357 Q ss_pred ceeEEEEeCCccceeeeeeeccCCccc-cceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeee Q lcl|NC_015287. 151 GIKEVRYIDPLKIKLVRKLKTDNSLRG-AINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKI 229 (563) Q Consensus 151 GI~eLr~lDP~~i~~vR~~~~~~~~~~-~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki 229 (563) |...++.+||+.+-.|. .+.+... .+..++... .....||.+..........++.--.....+.- -++ T Consensus 138 g~~~i~~~~p~~~~~iy---dd~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~--g~v 206 (479) T protein:vir:99 138 TVARIKCIDPRDAFAIW---EDPYWDEWPKYLLERQP------NGQYWWWTEEDYSIFEFKQGKFIYRETVSHDY--GHI 206 (479) T ss_pred CceEEEEechhheEEEe---cCCcccceeeEEEeecC------ceeEEEEecceEEEEEecCCceeeccccccCC--CCc Confidence 88899999999988772 2222222 222222211 11122222211100000000000000000000 011 Q ss_pred cccceeeeecceecCCCCccchhHHHHHHHHHHH-HHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhc Q lcl|NC_015287. 230 APDAISFCHSGLVDRNKQTILSYLHKSIKALNQL-RMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYR 308 (563) Q Consensus 230 ~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL-~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~k 308 (563) | .|.|+|..-.+. -+.|=++..+.....+ +.+-+..++-.....|.|-++-. .++.... .-...+.-+. T Consensus 207 P--vv~f~n~~~~~~---~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~---~~~~~~~--~~~~~~~~~~ 276 (479) T protein:vir:99 207 P--FVRYVNVMDLRG---VCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATGL---MLPEGAN--ADQEKMRFAQ 276 (479) T ss_pred c--eEEeecCCCcCc---CCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCC---Ccccccc--cchhcccccc Confidence 1 244555432221 1234344333333222 34556666667777777765522 1111000 0000000000 Q ss_pred ceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCChhhccCCCccee Q lcl|NC_015287. 309 NKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDIEYFQKKLYRSLGVPESRIAGSGDGFN 387 (563) Q Consensus 309 nklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~ 387 (563) ++ .|.. .+-+.++-++++.. +.. ++-++=.-..+...-++|..-|. + T Consensus 277 ~~----------------------i~~~---~~~~~~~~q~~~~~-~~~~~~~l~~~i~~i~~~t~~p~~~~g-----~- 324 (479) T protein:vir:99 277 ES----------------------MLIS---QNEKASFGAIPAAP-LDGLLNAYKESLLEFLALAQLPPHIAG-----Q- 324 (479) T ss_pred cc----------------------ceee---cCCCceEEEecccc-hHHHHHHHHHHHHHHhccCCCCHHHcc-----c- Confidence 11 1211 12245666777543 322 22233233344444567765552 2 Q ss_pred ccccc--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHH Q lcl|NC_015287. 388 LGRSS--EILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERL 465 (563) Q Consensus 388 ~g~~~--eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~ 465 (563) .|.+| .|.--+...-.-+.+.|+.|..-+.+.|+.-+.+.|..-..++ -.|.+.|.. ....- +.+.+ T Consensus 325 ~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~~~~~~~~~----~~i~~~w~~-~~~~s------~~~~a 393 (479) T protein:vir:99 325 IVNVAADALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKIEGRTEEATD----LDFTITWQD-VTIQS------LAQFA 393 (479) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccc----eeeeEEecC-CCCCC------HHHHH Confidence 12222 3333333356778899999999999999988888886433222 236777743 21111 12455 Q ss_pred HHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccc Q lcl|NC_015287. 466 GVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGA 545 (563) Q Consensus 466 ~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~ 545 (563) +.+.++..- | .+|.+++++.+...++.+++++.+..+++...+.+.+. ...+..+....+ .|.....+++ T Consensus 394 d~~~kl~~a-g-~is~et~l~~l~gv~~~~~e~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~---~~~~~~~~~~ 463 (479) T protein:vir:99 394 DAWAKMVES-L-KIPAEGVWDMIPNLDQSTVNGWKEIYDREGDFGKYMRK-----LQNGPDPAEQRG---GPNGATNMQQ 463 (479) T ss_pred HHHHHHHhc-C-CCCHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHH-----HhcccCcccccC---CCCCCCCCCC Confidence 555554332 3 49999999999889998888776655544222211110 000100000000 0111111222 Q ss_pred cCCCCcccCCCCCCCC Q lcl|NC_015287. 546 PQEGGAEQAPQMPEGG 561 (563) Q Consensus 546 ~~~~~~~~~~~~~~~~ 561 (563) +.+.+++.|---..|| T Consensus 464 ~~~~~~~~~~~~~~~~ 479 (479) T protein:vir:99 464 ANNKTGEPASLNKSGA 479 (479) T ss_pred CCCCCcchhccCCCCC Confidence 2222333333333444 No 90 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=97.87 E-value=1.4e-05 Score=47.23 Aligned_cols=458 Identities=11% Similarity=0.056 Sum_probs=174.0 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhh----------hH--HHHHHHHHHHhhccc Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFK----------SE--FELIKRYREMALHPE 68 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~----------~~--~~LI~~YR~ma~~PE 68 (563) |.+.=-|.-+-.-......+ -|+.. ..-|. +..++.... +. .....+|+.+..+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~--~~~~~---------~~~~~-~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~ 68 (512) T protein:vir:97 1 MLKANEFETDTDLRENRNYL--FNDEA---------NVVYT-YDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYE 68 (512) T ss_pred CccceeccCceeeeeCceee--ecccc---------ccccc-cCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhc Confidence 33322221100000000000 01110 11111 111111111 11 111223444444433 Q ss_pred hhHHH----------------------HHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHH Q lcl|NC_015287. 69 VDSAI----------------------EDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHE 126 (563) Q Consensus 69 vd~Ai----------------------~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~ 126 (563) =...| .-||+ ..+.=--..||.+..++ +...+.++.|++-.+|+....+ T Consensus 69 g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Ivd-~~~~yl~g~p~~~~~~d--------~~~~~~l~~~~~~n~~~~~~~~ 139 (512) T protein:vir:97 69 GKTKNLVELTRRKEEYMADNRVAHDYASYISD-FINGYFLGNPIQCQDDD--------KDVLEAIEAFNDLNDVESHNRS 139 (512) T ss_pred ccCccccccCcccccccCcceeecchHHHHHH-HHhhhhcccCceeccCC--------hHHHHHHHHHHhhcCHHHHHHH Confidence 22221 11111 01111123455554443 2345667778888899999999 Q ss_pred HHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccc Q lcl|NC_015287. 127 ILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQT 206 (563) Q Consensus 127 ~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~ 206 (563) +.+..++.|+-|.+.-+|. +|-..+..++|+.+..|..-.........+.+.......+.....+..+-+|.+... T Consensus 140 ~~~~~~i~G~ay~~vy~de----d~~~~i~~~~p~~~~~iyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i 215 (512) T protein:vir:97 140 LGLDLSIYGKAYELMIRNQ----DDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGV 215 (512) T ss_pred HHHHHHhcCeEEEEEEeCC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcE Confidence 9999999999999877763 567889999999988884222211111122211111001111111121224444332 Q ss_pred cccccccccCcCccccCcceeeeccc--ceeeeecceecCCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceE Q lcl|NC_015287. 207 QGKNALGAIGQTPFSTKQRPVKIAPD--AISFCHSGLVDRNKQTILSYLHKSIKALNQLRM-IEDSLVIYRLSRAPERRI 283 (563) Q Consensus 207 ~~~~~~g~~g~~~~~~~~~~iki~~d--aI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m-~EDalVIyRi~RAPeRRv 283 (563) ......+..+.....-.......+.. +|++. .|+....|-++.++.....|.. +=+....-+-++.|-+-+ T Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~------~nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~ 289 (512) T protein:vir:97 216 YRYLTSRTNGLKLTPRENGFESHSFERMPITEF------SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI 289 (512) T ss_pred EEEEecCCCcccccccccccccccCcccceEee------cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeee Confidence 22211111000000000000000011 22221 1223345666766665555443 222233334444444433 Q ss_pred EEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccc-cccccchhhhh----cccccCCCCccceeecCCCCCcch- Q lcl|NC_015287. 284 FYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRD-DRKHMSMLEDF----WLPRREGGRGTEITTLPGGQNLGE- 357 (563) Q Consensus 284 FyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d-~~~~msMlEDy----wLpRReGgrgTEIsTLpGg~nLge- 357 (563) .-....+ ..++.. ....+..+++. ..+.-+++.|..+..|-...+... T Consensus 290 ~G~~~~~--------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~ 343 (512) T protein:vir:97 290 KGNLNLD--------------------------PVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGT 343 (512) T ss_pred ecCccCC--------------------------chhhhhhhhcccccccccchhhcccccCCCCCcceEEEeecCCHHHH Confidence 2211111 000000 00111111111 111111222333444433333322 Q ss_pred HHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---cCC-CC Q lcl|NC_015287. 358 LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEIL--RDEIKFTKFVGRMRKRFSHLFNDMLKTQLIL---KNI-VT 431 (563) Q Consensus 358 i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eIt--RDElKF~KfI~rLr~rFs~if~d~Lk~qLiL---kgI-~t 431 (563) -.-+.-+.+.+|.-.++|---.+ .|. |.+|.+. .-......-+.+.++.|..-+.+.++.=+-+ ++. -. T Consensus 344 e~~~~~L~~~I~~~s~~p~~~~~----~~~-gn~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~ 418 (512) T protein:vir:97 344 EAYKDRLNSDIHMFTNTPNMKDD----NFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA 418 (512) T ss_pred HHHHHHHHHHHHHHhCCcccCcc----ccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccc Confidence 23355566777888888863331 222 2322222 1111123335555555665555555442222 222 22 Q ss_pred hhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccCh--hhHHHHHHHHHHHhcC Q lcl|NC_015287. 432 PEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKD--EEVIEIDKQMEKEIKD 509 (563) Q Consensus 432 ~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~td--eei~e~~~qie~E~~~ 509 (563) ..+|. .|++.|...---.+ .+.++++..+ +| .+|.+++++.+-..+| +|+++++++-+++.+. T Consensus 419 ~~d~~----~i~~~f~~~~p~~~-------~e~~~~~~kl---~g-iiS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~ 483 (512) T protein:vir:97 419 NKDFN----TVRYVYNRNLPKSL-------IEELKAYIDS---GG-KISQTTLMSLFSFFQDPELEVKKIEEDEKESIKK 483 (512) T ss_pred ccccc----cceEEeCCCCCcCH-------HHHHHHHHHH---hc-cCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Confidence 33343 47788864222222 2344555555 35 4899999988754443 4444444433332221 Q ss_pred ---CCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccC Q lcl|NC_015287. 510 ---GKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQA 554 (563) Q Consensus 510 ---~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~ 554 (563) ..+.+|+..++. +.+...++.+.++- T Consensus 484 ~~~~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~ 512 (512) T protein:vir:97 484 AQKGIYKDPRDINDD-------------------EQDDDTKDTVDKKE 512 (512) T ss_pred HhhcccCCCCCCCCC-------------------CCCCCccccccccC Confidence 111111111100 00111111111111 No 91 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=97.83 E-value=1.6e-05 Score=46.91 Aligned_cols=395 Identities=14% Similarity=0.152 Sum_probs=173.1 Q ss_pred ccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEec Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSD 84 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d 84 (563) ||| +++.. ..++..|.. +++-.....++-.+.++. .+...++|.|..||.-|.+.+.-++ T Consensus 1 M~~-f~~~~-~~~~~~~~~----~~~~~~~~~~~~~~~~v~--------------~~~al~~~~V~~~v~~ia~~ia~~p 60 (397) T protein:vir:38 1 MPL-LKLNK-SHSQGFSLN----DPDWVNFLTGGEAQKYVS--------------ADTALKNSDIFSLIMQLSGDLAMVR 60 (397) T ss_pred Ccc-hhhhh-cccCcccCC----chhhhhhhcCCcCCceec--------------hHHhhccHHHHHHHHHHHHHHhhCc Confidence 555 22211 111111211 111111112222222221 1233569999999999998775443 Q ss_pred CCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccce Q lcl|NC_015287. 85 QNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIK 164 (563) Q Consensus 85 ~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~ 164 (563) - ..++ ..-.++...-. ..+....=-+.+++.++++|.-|..++.|.. ..+++|.+|+|..|+ T Consensus 61 ~-------~~~~-----~~~~~l~~~PN---~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~---g~~~~l~~l~~~~v~ 122 (397) T protein:vir:38 61 Y-------TSES-----DRSQSIISNPS---VTANGYSFWQGMFAQLLLDGNCYAYRHKNTN---GVDLSWEYLRPSQVQ 122 (397) T ss_pred c-------cccc-----cHHHHHHhcCC---CCCCHHHHHHHHHHHhhhcCCEEEEEEECCC---CcEEEEEEEcCceeE Confidence 2 1111 11111111111 1112222345667788899999998877642 348899999998876 Q ss_pred eeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeecceecC Q lcl|NC_015287. 165 LVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDR 244 (563) Q Consensus 165 ~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~ 244 (563) .++... . +. .+|.+.- .+.. ......++.+.|.|.. ..+. T Consensus 123 i~~~~~---~--~~------------------~~y~~~~--------~~~~-------~~~~~~~~~~eiih~~--~~~~ 162 (397) T protein:vir:38 123 PMLLQD---G--SG------------------LIYNINF--------DEPA-------IGYMENVPAADVIHIR--LLSK 162 (397) T ss_pred EEEcCC---C--ce------------------EEEEEEe--------cccc-------ccceeEecCccEEEec--CCCC Confidence 542211 1 00 1222210 0000 0112456677775443 3444 Q ss_pred CCC-ccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCccccc Q lcl|NC_015287. 245 NKQ-TILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDD 323 (563) Q Consensus 245 ~~~-~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~ 323 (563) ++. .+.|-|..|.+.+.....+++...-+---.+.-+-++.++.+ +.+. +.+-+++.+..++.- .+.| T Consensus 163 ~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~-~~~e-~~~~~~~~~~~~~~~----~n~~----- 231 (397) T protein:vir:38 163 NGGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKG-GLLD-AETRIARSKEISKQI----HNSD----- 231 (397) T ss_pred CCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC-CCHH-HHHHHHHHHHHHhcc----cccC----- Confidence 554 358999999999999999988877655556666777777765 3443 334444444332210 1111 Q ss_pred ccccchhhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHH Q lcl|NC_015287. 324 RKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFT 402 (563) Q Consensus 324 ~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~ 402 (563) +.+ .+ ..|.+++.|.-..+-.+ ++=.++..+.+.++++||...|....+ ..+.+.....-|. T Consensus 232 -~~~-vl----------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~-----~~~~~e~~~~~~~ 294 (397) T protein:vir:38 232 -GPV-VI----------DALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGD-----QQSSITQISGQYA 294 (397) T ss_pred -Cce-ec----------CCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-----cccHHHHHHHHHH Confidence 111 11 12566666654333333 455678899999999999999954322 1122322222232 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhH Q lcl|NC_015287. 403 KFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLD 482 (563) Q Consensus 403 KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~ 482 (563) +-+.-+...+ .+.|-..| .++.+|+ +.|.-+- -+..|.+.++.+-. +.++|.+ T Consensus 295 ~~l~P~~~~i----e~~ln~~l-----~~~~~~~-------~~~~~~~---------d~~~~~~~~~~~~~--~G~~t~n 347 (397) T protein:vir:38 295 KSLNRYVQAI----VGELNDKL-----HANISAN-------IRFAIDA---------MGDQYASTISSSVK--GGTIAGN 347 (397) T ss_pred HHHHHHHHHH----HHHHHHhc-----cChhccc-------ccccccC---------CHHHHHHHHHHHHh--CCCcCHH Confidence 2233333332 33333332 3333332 2222111 12345555544321 1244444 Q ss_pred HHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccCCC Q lcl|NC_015287. 483 YVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAPQ 556 (563) Q Consensus 483 ~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~ 556 (563) -+++. +.+.+.+ .+-.+.+.. ............+|+.++.+. +++. +.|+ T Consensus 348 E~R~~-lg~~p~~-------------~~d~~~~~~--~~~~~~~~~~~~~g~~~~~~~------~e~~--~~~~ 397 (397) T protein:vir:38 348 QARFI-LQNSGYL-------------AKDLPDPEK--EPQQAIQLIQQEGGENDGNNS------DERG--SDPE 397 (397) T ss_pred HHHHH-hCCCCCC-------------CCccccccc--cccccccccccccCCCCCCCC------CCCC--CCCC Confidence 44432 2222200 000000100 000000001111111111111 0100 0111 No 92 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=97.83 E-value=1.6e-05 Score=46.90 Aligned_cols=436 Identities=12% Similarity=0.077 Sum_probs=177.7 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchh---------- Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVD---------- 70 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd---------- 70 (563) |-..|...+.+.. ....+++.-+.+......+. ...... ..-+.+|+.+..+.+-+ T Consensus 1 ~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~i~---------~~i~~~---~~~~~~~~~~~~YY~g~~~i~~~~~~~ 66 (474) T protein:vir:94 1 MFNIIRMPWDKPY--GEEVVEQLKPQFETQEEMIV---------RLIDDH---RKQLDKITVGQRYYDKDNDIVKQMKKV 66 (474) T ss_pred CcccccccCCCch--hhHHHHhhhhcccCHHHHHH---------HHHHHH---HHHHHHHHHHHHHhccccchhcccchh Confidence 4444444333222 11122333222221111110 000000 11112222222222211 Q ss_pred -----------------HHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhh Q lcl|NC_015287. 71 -----------------SAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYV 133 (563) Q Consensus 71 -----------------~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYv 133 (563) +-...||+- .+.=--..||.+..++- ...+..+.+++ .+|+....++.+.+++ T Consensus 67 ~~~~~~~~~~~~~ki~~n~~k~Ivd~-~~~~l~g~p~~~~~~d~--------~~~~~l~~~~~-n~~~~~~~e~~~~~~~ 136 (474) T protein:vir:94 67 DVHGNIDYDKPDWRITTNFHQNLVDQ-KVSYVASKPVTYSCEDE--------NVLKVIHDVLD-TRWDNKLIDILTATSN 136 (474) T ss_pred ccccccccccCcceeecchHHHHHHH-HHhhhhcCCceeccCcH--------HHHHHHHHHHh-ccHHHHHHHHHHHHhh Confidence 112222221 11122345666666542 22333334444 5799999999999999 Q ss_pred hhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCcccccccccc Q lcl|NC_015287. 134 DGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALG 213 (563) Q Consensus 134 DGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g 213 (563) .|+-|.+.-+| ++|-..+..+||+.+..|.--.........+...+. .+...+-+|.+.........+ T Consensus 137 ~G~~~~~~~~d----~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~--------~~~~~~~~yt~~~~~~y~~~~ 204 (474) T protein:vir:94 137 KGIDWLQVYIN----ENGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKF--------NNEEKVEFWTDTTVTYYVLEN 204 (474) T ss_pred cCceEEEEEec----CCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEe--------cCeEEEEEEeCCeEEEEEEcC Confidence 99998886665 357788999999998887321111111112222111 111222234333222211111 Q ss_pred ccCcCccccCccee-------eecccceeeeecceecCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEE Q lcl|NC_015287. 214 AIGQTPFSTKQRPV-------KIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLR-MIEDSLVIYRLSRAPERRIFY 285 (563) Q Consensus 214 ~~g~~~~~~~~~~i-------ki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~-m~EDalVIyRi~RAPeRRvFy 285 (563) .............+ -+..=+|++. .|+....|=++..+.....+. ++-+....-+.++.|-+-+.- T Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~------~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g 278 (474) T protein:vir:94 205 GGLIPDYYYGANHVQSHFSNGNWGRVPFIAF------KNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKG 278 (474) T ss_pred CccccccccCcCcccccccccCCCccceEEe------cCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec Confidence 00000000000000 0111123321 122234566666666665554 334444444556666554332 Q ss_pred ccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHH-HHHHH Q lcl|NC_015287. 286 IDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELS-DIEYF 364 (563) Q Consensus 286 IDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~-DV~YF 364 (563) .+.-. ...++..+..+++-.=+++.+ ++.|-.-.+.+... -+.-+ T Consensus 279 ~~~~~--------------------------------~~~~~~~~~~~~~i~~~~~~~--~~~l~~~~~~~~~~~~~~~l 324 (474) T protein:vir:94 279 YEGED--------------------------------LEEFMRGLKYYKAINVDGDGG--VETIQVEVPVSSTKEYIDLM 324 (474) T ss_pred CCccc--------------------------------chhhhhhhhccceeeccCCCc--eeEEeecCCHHHHHHHHHHH Confidence 21100 011111112222211223222 44444434444333 45666 Q ss_pred HHHHHHhcCCChhhccCCCcceeccccc--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhce Q lcl|NC_015287. 365 QKKLYRSLGVPESRIAGSGDGFNLGRSS--EILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHI 442 (563) Q Consensus 365 ~~kLy~aL~VP~sRl~~~~~g~~~g~~~--eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i 442 (563) .+.+|+...+|- +..+ +|. |..| .|..-......-+.+.+..|...+.++++.=+-+-|+ ..+|. .| T Consensus 325 ~~~I~~~s~~p~--~~~~--~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~--~~d~~----~i 393 (474) T protein:vir:94 325 RVYIMEFGQGVD--FQTD--KFG-SAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNL--KTDVK----DI 393 (474) T ss_pred HHHHHHHhCccc--cCcc--ccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Ccccc----ee Confidence 777888889883 2222 221 2222 2222222244445666677777777766654444444 23444 46 Q ss_pred EEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccc Q lcl|NC_015287. 443 QYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQ 522 (563) Q Consensus 443 ~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~ 522 (563) .+.|....--.+. +-++++.++ | .+|.+++++.+-..+| .+++.++|++|..+-.-..|.... T Consensus 394 ~v~f~~~~p~~~~-------e~a~~~~~~----g-~iS~et~l~~l~~v~D--~~~E~eri~~E~~~~~~~~~~~~~--- 456 (474) T protein:vir:94 394 EISFNFNRMMNDA-------EQSQIIAQS----Q-YLSRETLVKSSPLVDD--YKAELERIEQEQMEYNKQLPNLDD--- 456 (474) T ss_pred eEEeccCcccCHH-------HHHHHHHHc----C-CCCHHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhccccCC--- Confidence 7778543322232 222334442 2 5899999988755444 344455555554321101111000 Q ss_pred cccccccCCCCcCCCCCcccc Q lcl|NC_015287. 523 LGVHPEQMPGGAMNPQDPNMG 543 (563) Q Consensus 523 ~~~~~~~~~~~~~~p~~~~~~ 543 (563) .+ ...+...+.|.+.+.. T Consensus 457 ~~---~~~~~~~~~~~~~~~e 474 (474) T protein:vir:94 457 GG---ADGAQQQEGSNNKESE 474 (474) T ss_pred CC---CCCcccCCCCcccccC Confidence 00 0000111112221111 No 93 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=97.83 E-value=1.6e-05 Score=46.90 Aligned_cols=436 Identities=12% Similarity=0.077 Sum_probs=177.7 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchh---------- Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVD---------- 70 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd---------- 70 (563) |-..|...+.+.. ....+++.-+.+......+. ...... ..-+.+|+.+..+.+-+ T Consensus 1 ~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~i~---------~~i~~~---~~~~~~~~~~~~YY~g~~~i~~~~~~~ 66 (474) T protein:vir:97 1 MFNIIRMPWDKPY--GEEVVEQLKPQFETQEEMIV---------RLIDDH---RKQLDKITVGQRYYDKDNDIVKQMKKV 66 (474) T ss_pred CcccccccCCCch--hhHHHHhhhhcccCHHHHHH---------HHHHHH---HHHHHHHHHHHHHhccccchhcccchh Confidence 4444444333222 11122333222221111110 000000 11112222222222211 Q ss_pred -----------------HHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhh Q lcl|NC_015287. 71 -----------------SAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYV 133 (563) Q Consensus 71 -----------------~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYv 133 (563) +-...||+- .+.=--..||.+..++- ...+..+.+++ .+|+....++.+.+++ T Consensus 67 ~~~~~~~~~~~~~ki~~n~~k~Ivd~-~~~~l~g~p~~~~~~d~--------~~~~~l~~~~~-n~~~~~~~e~~~~~~~ 136 (474) T protein:vir:97 67 DVHGNIDYDKPDWRITTNFHQNLVDQ-KVSYVASKPVTYSCEDE--------NVLKVIHDVLD-TRWDNKLIDILTATSN 136 (474) T ss_pred ccccccccccCcceeecchHHHHHHH-HHhhhhcCCceeccCcH--------HHHHHHHHHHh-ccHHHHHHHHHHHHhh Confidence 112222221 11122345666666542 22333334444 5799999999999999 Q ss_pred hhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCcccccccccc Q lcl|NC_015287. 134 DGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALG 213 (563) Q Consensus 134 DGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g 213 (563) .|+-|.+.-+| ++|-..+..+||+.+..|.--.........+...+. .+...+-+|.+.........+ T Consensus 137 ~G~~~~~~~~d----~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~--------~~~~~~~~yt~~~~~~y~~~~ 204 (474) T protein:vir:97 137 KGIDWLQVYIN----ENGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKF--------NNEEKVEFWTDTTVTYYVLEN 204 (474) T ss_pred cCceEEEEEec----CCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEe--------cCeEEEEEEeCCeEEEEEEcC Confidence 99998886665 357788999999998887321111111112222111 111222234333222211111 Q ss_pred ccCcCccccCccee-------eecccceeeeecceecCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEE Q lcl|NC_015287. 214 AIGQTPFSTKQRPV-------KIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLR-MIEDSLVIYRLSRAPERRIFY 285 (563) Q Consensus 214 ~~g~~~~~~~~~~i-------ki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~-m~EDalVIyRi~RAPeRRvFy 285 (563) .............+ -+..=+|++. .|+....|=++..+.....+. ++-+....-+.++.|-+-+.- T Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~------~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g 278 (474) T protein:vir:97 205 GGLIPDYYYGANHVQSHFSNGNWGRVPFIAF------KNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKG 278 (474) T ss_pred CccccccccCcCcccccccccCCCccceEEe------cCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec Confidence 00000000000000 0111123321 122234566666666665554 334444444556666554332 Q ss_pred ccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHH-HHHHH Q lcl|NC_015287. 286 IDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELS-DIEYF 364 (563) Q Consensus 286 IDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~-DV~YF 364 (563) .+.-. ...++..+..+++-.=+++.+ ++.|-.-.+.+... -+.-+ T Consensus 279 ~~~~~--------------------------------~~~~~~~~~~~~~i~~~~~~~--~~~l~~~~~~~~~~~~~~~l 324 (474) T protein:vir:97 279 YEGED--------------------------------LEEFMRGLKYYKAINVDGDGG--VETIQVEVPVSSTKEYIDLM 324 (474) T ss_pred CCccc--------------------------------chhhhhhhhccceeeccCCCc--eeEEeecCCHHHHHHHHHHH Confidence 21100 011111112222211223222 44444434444333 45666 Q ss_pred HHHHHHhcCCChhhccCCCcceeccccc--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhce Q lcl|NC_015287. 365 QKKLYRSLGVPESRIAGSGDGFNLGRSS--EILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHI 442 (563) Q Consensus 365 ~~kLy~aL~VP~sRl~~~~~g~~~g~~~--eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i 442 (563) .+.+|+...+|- +..+ +|. |..| .|..-......-+.+.+..|...+.++++.=+-+-|+ ..+|. .| T Consensus 325 ~~~I~~~s~~p~--~~~~--~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~--~~d~~----~i 393 (474) T protein:vir:97 325 RVYIMEFGQGVD--FQTD--KFG-SAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNL--KTDVK----DI 393 (474) T ss_pred HHHHHHHhCccc--cCcc--ccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Ccccc----ee Confidence 777888889883 2222 221 2222 2222222244445666677777777766654444444 23444 46 Q ss_pred EEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccc Q lcl|NC_015287. 443 QYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQ 522 (563) Q Consensus 443 ~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~ 522 (563) .+.|....--.+. +-++++.++ | .+|.+++++.+-..+| .+++.++|++|..+-.-..|.... T Consensus 394 ~v~f~~~~p~~~~-------e~a~~~~~~----g-~iS~et~l~~l~~v~D--~~~E~eri~~E~~~~~~~~~~~~~--- 456 (474) T protein:vir:97 394 EISFNFNRMMNDA-------EQSQIIAQS----Q-YLSRETLVKSSPLVDD--YKAELERIEQEQMEYNKQLPNLDD--- 456 (474) T ss_pred eEEeccCcccCHH-------HHHHHHHHc----C-CCCHHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhccccCC--- Confidence 7778543322232 222334442 2 5899999988755444 344455555554321101111000 Q ss_pred cccccccCCCCcCCCCCcccc Q lcl|NC_015287. 523 LGVHPEQMPGGAMNPQDPNMG 543 (563) Q Consensus 523 ~~~~~~~~~~~~~~p~~~~~~ 543 (563) .+ ...+...+.|.+.+.. T Consensus 457 ~~---~~~~~~~~~~~~~~~e 474 (474) T protein:vir:97 457 GG---ADGAQQQEGSNNKESE 474 (474) T ss_pred CC---CCCcccCCCCcccccC Confidence 00 0000111112221111 No 94 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=97.83 E-value=1.6e-05 Score=46.81 Aligned_cols=425 Identities=14% Similarity=0.107 Sum_probs=178.0 Q ss_pred cccccchhhHhhhc-----ccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhh----------- Q lcl|NC_015287. 2 AKLFGFKIQKDDEQ-----NKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMAL----------- 65 (563) Q Consensus 2 ~~~~~f~~~~~~~~-----~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~----------- 65 (563) --|.++-..-.++. ++..|....+ ....-..+.+.|+.... T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~-----------------------~~~r~~~~~~~y~~~~~~~~~~~~~~~~ 57 (474) T protein:vir:10 1 MTLYKLIDDIEAQGILPKHIEALIESHKD-----------------------DRERMVNLYNRYKTHIDYVPIFKRRPIE 57 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhh-----------------------hhHHHHHHHHHHhhhcchhhhhcchhhh Confidence 11222211000000 0000110000 00001122223322211 Q ss_pred -----------------------ccchhHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchh Q lcl|NC_015287. 66 -----------------------HPEVDSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDD 122 (563) Q Consensus 66 -----------------------~PEvd~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~ 122 (563) ++-....|+..+. . --..||.+..++ +++..+.+.+.+..+++-.+|+. T Consensus 58 ~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~--y---l~g~pv~~~~~~---~~~~~e~~~~~l~~~~~~n~~~~ 129 (474) T protein:vir:10 58 EKEDFETGGNVRRLDVSVNNKLNNSFDSEIVDTRVG--Y---LHGVPVTYDLDE---NAEKNEKLKKFITNFAIRNSVDD 129 (474) T ss_pred hhhhhhhcccccccccCcccccccchHHHHHHhHhh--h---eeccceeEeeCC---CCcchHHHHHHHHHHHhhcCHhH Confidence 1222222222222 1 135688887753 22334577777888888889999 Q ss_pred hhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecc-cccccccccccceeee Q lcl|NC_015287. 123 KCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSK-NPADIENPEIEEYYQY 201 (563) Q Consensus 123 ~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~-~~~~~~~~~~~eyf~y 201 (563) ...++.+...+.|+-|.+.-+| ++|-..++.+||+.+-.|.. ++. ..+..++.- .........+....+| T Consensus 130 ~~~~~~~~~~~~G~a~~~~~~d----~~~~~~~~~i~p~~~~~v~d----~~~-~~~~~i~~~~~~~~~~~~~~~~~~~y 200 (474) T protein:vir:10 130 EDSEIGKMAAICGYGARLAYID----TNGDIRIKNIDPYNVIFVGD----NIL-EPTYSLRYFYEKDDDNGTDYVYAEFY 200 (474) T ss_pred HHHHHHHHHhhcCeEEEEEEeC----CCCeeEEEEEcccceEEEEc----CCC-ceEEEEEEEEEeeCCCceEEEEEEEE Confidence 9999999999999988876565 45678899999999887742 221 222221110 0000000111223444 Q ss_pred cCccccccccccccCcCccccCcce-eeecccceeeeecceecCCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCc Q lcl|NC_015287. 202 DPSQTQGKNALGAIGQTPFSTKQRP-VKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRM-IEDSLVIYRLSRAP 279 (563) Q Consensus 202 ~~~~~~~~~~~g~~g~~~~~~~~~~-iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m-~EDalVIyRi~RAP 279 (563) .+.........+.++........+. -++| .|.|+ |.....|-|+..+.....+.. +-+....-+-++.| T Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP--vv~~~-------n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~ 271 (474) T protein:vir:10 201 DNAYYYVFRGEGIDALQEVGRYEHLFDYNP--LFGVP-------NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLA 271 (474) T ss_pred cCceEEEEeecCCCcccccccccCCCCccc--eEEec-------CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 4433322221111110000000000 0111 01122 333446777776666655543 33333444555555 Q ss_pred cceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcc-hH Q lcl|NC_015287. 280 ERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLG-EL 358 (563) Q Consensus 280 eRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLg-ei 358 (563) -+-+.-. .++. +-+..+ .. .+. .|++ + .+..+..|--..+.. .. T Consensus 272 ~l~i~g~---~~~~----~~~~~~-~~--~~~---------------------i~~~--~--~~~~~~~l~~~~~~~~~~ 316 (474) T protein:vir:10 272 YLVLRGM---GMSE----EMIQET-QK--SGA---------------------FELF--D--KDMDVKYLTKDVNDTMIE 316 (474) T ss_pred hhhhccC---CCCc----hhhhhh-hh--cce---------------------eEec--C--CCCceeEEeccCCHHHHH Confidence 5443322 1121 111111 00 111 1111 0 112344443333322 22 Q ss_pred HHHHHHHHHHHHhcCCChhhccCCCcceeccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---cCC-CCh Q lcl|NC_015287. 359 SDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEIL--RDEIKFTKFVGRMRKRFSHLFNDMLKTQLIL---KNI-VTP 432 (563) Q Consensus 359 ~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eIt--RDElKF~KfI~rLr~rFs~if~d~Lk~qLiL---kgI-~t~ 432 (563) .-+.-+.+.+|....+|-- ..+ +|. |..|... .-......-+.+.+..|..-+.+.++.=+-+ +|. ..+ T Consensus 317 ~~~~~l~~~I~~~s~~p~~--~~~--~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~ 391 (474) T protein:vir:10 317 NHLDRIEKNIMRFAKSVNF--NSD--EFN-GNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDD 391 (474) T ss_pred HHHHHHHHHHHHHhCCccc--ccc--ccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCc Confidence 3456677888888888842 112 222 2222222 1111123345566666666666666553322 221 123 Q ss_pred hHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCC Q lcl|NC_015287. 433 EDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKL 512 (563) Q Consensus 433 eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~ 512 (563) .+| ..|.+.|....--.+.. .+++++.+. | .+|.+++++..-..+| .+++-++|++|..+-.- T Consensus 392 ~~~----~~i~~~f~~~~p~d~~e-------~a~~~~kl~---g-~iS~et~~~~l~~v~d--~~~E~eri~~E~~e~~~ 454 (474) T protein:vir:10 392 DSY----LNLIFKFTRNIPVNKLE-------ESQVLINLK---G-QVSERTRLGQSQLVDD--VDYELDEMEKESLEFND 454 (474) T ss_pred ccc----ccceEEeCCCCCCCHHH-------HHHHHHHHh---c-cCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHh Confidence 334 35778887654444433 334444442 4 4899999998754443 44444455544432111 Q ss_pred CCCcccCccccccccccCCCCcCCCCCcccc Q lcl|NC_015287. 513 ADPMEVQQLQLGVHPEQMPGGAMNPQDPNMG 543 (563) Q Consensus 513 ~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 543 (563) ..|+....... ..+++-+.+ T Consensus 455 ~~~~~~~~~~~-----------~~~~~~~s~ 474 (474) T protein:vir:10 455 KLPDIDEGDAN-----------DKSQNNQSE 474 (474) T ss_pred hcccccCCCcC-----------CCCccccCC Confidence 11111100000 001111011 No 95 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=97.83 E-value=1.6e-05 Score=46.81 Aligned_cols=425 Identities=14% Similarity=0.107 Sum_probs=178.0 Q ss_pred cccccchhhHhhhc-----ccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhh----------- Q lcl|NC_015287. 2 AKLFGFKIQKDDEQ-----NKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMAL----------- 65 (563) Q Consensus 2 ~~~~~f~~~~~~~~-----~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~----------- 65 (563) --|.++-..-.++. ++..|....+ ....-..+.+.|+.... T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~-----------------------~~~r~~~~~~~y~~~~~~~~~~~~~~~~ 57 (474) T protein:vir:94 1 MTLYKLIDDIEAQGILPKHIEALIESHKD-----------------------DRERMVNLYNRYKTHIDYVPIFKRRPIE 57 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhh-----------------------hhHHHHHHHHHHhhhcchhhhhcchhhh Confidence 11222211000000 0000110000 00001122223322211 Q ss_pred -----------------------ccchhHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchh Q lcl|NC_015287. 66 -----------------------HPEVDSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDD 122 (563) Q Consensus 66 -----------------------~PEvd~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~ 122 (563) ++-....|+..+. . --..||.+..++ +++..+.+.+.+..+++-.+|+. T Consensus 58 ~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~--y---l~g~pv~~~~~~---~~~~~e~~~~~l~~~~~~n~~~~ 129 (474) T protein:vir:94 58 EKEDFETGGNVRRLDVSVNNKLNNSFDSEIVDTRVG--Y---LHGVPVTYDLDE---NAEKNEKLKKFITNFAIRNSVDD 129 (474) T ss_pred hhhhhhhcccccccccCcccccccchHHHHHHhHhh--h---eeccceeEeeCC---CCcchHHHHHHHHHHHhhcCHhH Confidence 1222222222222 1 135688887753 22334577777888888889999 Q ss_pred hhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecc-cccccccccccceeee Q lcl|NC_015287. 123 KCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSK-NPADIENPEIEEYYQY 201 (563) Q Consensus 123 ~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~-~~~~~~~~~~~eyf~y 201 (563) ...++.+...+.|+-|.+.-+| ++|-..++.+||+.+-.|.. ++. ..+..++.- .........+....+| T Consensus 130 ~~~~~~~~~~~~G~a~~~~~~d----~~~~~~~~~i~p~~~~~v~d----~~~-~~~~~i~~~~~~~~~~~~~~~~~~~y 200 (474) T protein:vir:94 130 EDSEIGKMAAICGYGARLAYID----TNGDIRIKNIDPYNVIFVGD----NIL-EPTYSLRYFYEKDDDNGTDYVYAEFY 200 (474) T ss_pred HHHHHHHHHhhcCeEEEEEEeC----CCCeeEEEEEcccceEEEEc----CCC-ceEEEEEEEEEeeCCCceEEEEEEEE Confidence 9999999999999988876565 45678899999999887742 221 222221110 0000000111223444 Q ss_pred cCccccccccccccCcCccccCcce-eeecccceeeeecceecCCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCc Q lcl|NC_015287. 202 DPSQTQGKNALGAIGQTPFSTKQRP-VKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRM-IEDSLVIYRLSRAP 279 (563) Q Consensus 202 ~~~~~~~~~~~g~~g~~~~~~~~~~-iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m-~EDalVIyRi~RAP 279 (563) .+.........+.++........+. -++| .|.|+ |.....|-|+..+.....+.. +-+....-+-++.| T Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP--vv~~~-------n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~ 271 (474) T protein:vir:94 201 DNAYYYVFRGEGIDALQEVGRYEHLFDYNP--LFGVP-------NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLA 271 (474) T ss_pred cCceEEEEeecCCCcccccccccCCCCccc--eEEec-------CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 4433322221111110000000000 0111 01122 333446777776666655543 33333444555555 Q ss_pred cceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcc-hH Q lcl|NC_015287. 280 ERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLG-EL 358 (563) Q Consensus 280 eRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLg-ei 358 (563) -+-+.-. .++. +-+..+ .. .+. .|++ + .+..+..|--..+.. .. T Consensus 272 ~l~i~g~---~~~~----~~~~~~-~~--~~~---------------------i~~~--~--~~~~~~~l~~~~~~~~~~ 316 (474) T protein:vir:94 272 YLVLRGM---GMSE----EMIQET-QK--SGA---------------------FELF--D--KDMDVKYLTKDVNDTMIE 316 (474) T ss_pred hhhhccC---CCCc----hhhhhh-hh--cce---------------------eEec--C--CCCceeEEeccCCHHHHH Confidence 5443322 1121 111111 00 111 1111 0 112344443333322 22 Q ss_pred HHHHHHHHHHHHhcCCChhhccCCCcceeccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---cCC-CCh Q lcl|NC_015287. 359 SDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEIL--RDEIKFTKFVGRMRKRFSHLFNDMLKTQLIL---KNI-VTP 432 (563) Q Consensus 359 ~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eIt--RDElKF~KfI~rLr~rFs~if~d~Lk~qLiL---kgI-~t~ 432 (563) .-+.-+.+.+|....+|-- ..+ +|. |..|... .-......-+.+.+..|..-+.+.++.=+-+ +|. ..+ T Consensus 317 ~~~~~l~~~I~~~s~~p~~--~~~--~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~ 391 (474) T protein:vir:94 317 NHLDRIEKNIMRFAKSVNF--NSD--EFN-GNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDD 391 (474) T ss_pred HHHHHHHHHHHHHhCCccc--ccc--ccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCc Confidence 3456677888888888842 112 222 2222222 1111123345566666666666666553322 221 123 Q ss_pred hHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCC Q lcl|NC_015287. 433 EDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKL 512 (563) Q Consensus 433 eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~ 512 (563) .+| ..|.+.|....--.+.. .+++++.+. | .+|.+++++..-..+| .+++-++|++|..+-.- T Consensus 392 ~~~----~~i~~~f~~~~p~d~~e-------~a~~~~kl~---g-~iS~et~~~~l~~v~d--~~~E~eri~~E~~e~~~ 454 (474) T protein:vir:94 392 DSY----LNLIFKFTRNIPVNKLE-------ESQVLINLK---G-QVSERTRLGQSQLVDD--VDYELDEMEKESLEFND 454 (474) T ss_pred ccc----ccceEEeCCCCCCCHHH-------HHHHHHHHh---c-cCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHh Confidence 334 35778887654444433 334444442 4 4899999998754443 44444455544432111 Q ss_pred CCCcccCccccccccccCCCCcCCCCCcccc Q lcl|NC_015287. 513 ADPMEVQQLQLGVHPEQMPGGAMNPQDPNMG 543 (563) Q Consensus 513 ~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 543 (563) ..|+....... ..+++-+.+ T Consensus 455 ~~~~~~~~~~~-----------~~~~~~~s~ 474 (474) T protein:vir:94 455 KLPDIDEGDAN-----------DKSQNNQSE 474 (474) T ss_pred hcccccCCCcC-----------CCCccccCC Confidence 11111100000 001111011 No 96 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=97.82 E-value=1.7e-05 Score=46.79 Aligned_cols=443 Identities=17% Similarity=0.252 Sum_probs=182.6 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCc-ccce------eccccccccccccchhhhHHHHHHHHHHHhhccchhHHH Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDS-SDYY------VSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAI 73 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~-~~~~------~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai 73 (563) +++.|-.+..-+....++..+- -.|+ ++.. ..+||-| +-.--.|||+||.++++ T Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~l~~~~~~~F~G---------------y~~la~laQ~~eyr~~~ 129 (698) T protein:vir:10 69 LARQFEVDVSNYTPRERRAASY----ALDFNGTSMDALSFVTSSGFPG---------------FPTLVLLAQLPEYRAMH 129 (698) T ss_pred ccccceeccccCCccccchhhh----hhcccccccccchhhhccCcch---------------HHHHHHHhhccchhhHH Confidence 4555543333222222221111 0011 0000 1111111 12234689999999999 Q ss_pred HHhhcceE------EecCCCcE----EEEEeeccccch-HHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhh--hhee Q lcl|NC_015287. 74 EDIINEAI------VSDQNDSP----VEIDLENLPASK-QLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGR--IYYH 140 (563) Q Consensus 74 ~eIvneai------v~d~~~~p----V~l~l~~~~~s~-sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGr--i~fh 140 (563) +-|..||+ ++...++. +.+.-+....++ .-.++|..|++. |+...+.-+.++-=-+.|. +|+ T Consensus 130 ~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~er----l~V~~~l~eai~~aRlfGGa~~~i- 204 (698) T protein:vir:10 130 EVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIER----LRIRDAVRTTVIHDQAFGRAHPYF- 204 (698) T ss_pred HHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHH----HHHHHHHHHHHHhcccccceEEEE- Confidence 99999993 33322221 333333334444 344567777654 3334444444444344444 233 Q ss_pred eeecCC--------------CCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccc Q lcl|NC_015287. 141 KVIDIK--------------APEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQT 206 (563) Q Consensus 141 kviD~~--------------~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~ 206 (563) .|+-+ -.|.+++.|+.|||..+-+= . .+.+ ++..-.|-.-..|++. T Consensus 205 -~I~gdd~~l~~PL~~~~~~I~kGslKGL~ViDp~~vtP~-------~-~n~~------dP~spdfgkP~~y~V~----- 264 (698) T protein:vir:10 205 -KIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPN-------N-YNSI------NPVADDFYKPSTWWMI----- 264 (698) T ss_pred -EeecCccccccccccccccccCccceeeeeecccccccc-------h-hhhc------cchhhccCCCceEEEe----- Confidence 33322 22457777888887543321 1 1100 1111112222222221 Q ss_pred cccccccccCcCccccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHH-HHHHHHHH-HHHHHhcCccceEE Q lcl|NC_015287. 207 QGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQ-LRMIEDSL-VIYRLSRAPERRIF 284 (563) Q Consensus 207 ~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~Nq-L~m~EDal-VIyRi~RAPeRRvF 284 (563) |. ..+.++.+.+...++... |-......+.|.+.++..-..+ ++....+. ++.+..... + T Consensus 265 ---------G~--~IH~SRL~~~vg~pvpd~---LKp~y~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~~~~----l 326 (698) T protein:vir:10 265 ---------GS--EVHATRLHTIVSRPVGDM---LKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSG----I 326 (698) T ss_pred ---------cc--eecceeEEEecCCCchhh---hcchhccCCccHHHHHHHHHHHHHHHhhhHHHHHHHhhHHH----H Confidence 11 134444444444433222 1122222345666665554433 33333322 222211111 0 Q ss_pred EccCCC-CchHHHHHHHH--HHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHH Q lcl|NC_015287. 285 YIDVGN-LPKMKAEQYLK--EVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDI 361 (563) Q Consensus 285 yIDvGn-lpk~KAeqYl~--~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV 361 (563) ..|... |.....++... +++++||.- -|-+-=|+ =.|||- +. ..+|+-++|| T Consensus 327 ~~dla~aL~~g~~~~l~~R~eli~~~Rsn------~G~~llDk----~~Eefe-------------q~--st~lSGLddV 381 (698) T protein:vir:10 327 LMDLAQALTPGANVDLSMRAELINRYRDN------RNILFLDK----ATEEFF-------------QF--NTPLSGLDAL 381 (698) T ss_pred HHHHHHhcCChhhHHHHHHHHHHHHhcCc------cceEEEec----CCcceE-------------EE--ecCcCCHHHH Confidence 111110 00011111111 344455421 12110000 013331 11 2478888887 Q ss_pred H-HHHHHHHHhcCCChhhccC-CCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHhcCCCChhH Q lcl|NC_015287. 362 E-YFQKKLYRSLGVPESRIAG-SGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKT-----QLILKNIVTPED 434 (563) Q Consensus 362 ~-YF~~kLy~aL~VP~sRl~~-~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~-----qLiLkgI~t~ee 434 (563) . =|..-+=-+.+||+.||=+ ...|||--.-++ .-.|...|..+|. ..+...|++ |+-+-|-+. T Consensus 382 i~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D----~rnYYD~I~s~Qe---~~L~p~L~rl~~ii~rS~~G~id--- 451 (698) T protein:vir:10 382 QAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGE----IRVWYDYVRAYQR---NALQQLMNDVIVMIQLSLFGAVD--- 451 (698) T ss_pred HHHHHHHHHhhhcCchhhhhccCCcccCccchhh----HHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHhcCCCC--- Confidence 4 5888888899999999943 344776522222 3458999988775 333343433 444445443 Q ss_pred HHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCC Q lcl|NC_015287. 435 WEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLAD 514 (563) Q Consensus 435 w~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~ 514 (563) ..|.|.|+.=..-+|..-+||...+.+.-..+-.- -++ +..+.+..+..+. .+.|.. T Consensus 452 -----p~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~------------gvI-----~~~evr~rL~~d~-~s~Y~~ 508 (698) T protein:vir:10 452 -----PSIKWQWNALRELDDLEVAEARYKQAQSDVLYVQE------------QVI-----RPDQVAARLNTEP-DGPYAG 508 (698) T ss_pred -----CcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHh------------cCC-----CHHHHHHHHhccC-CCcccc Confidence 45889999877778888888888888765433211 111 1122223333221 123311 Q ss_pred -CcccCc------ccc-cccc---ccCC-CCcCCCCCc--ccccccCCCCcccCCCCCC--------------------C Q lcl|NC_015287. 515 -PMEVQQ------LQL-GVHP---EQMP-GGAMNPQDP--NMGGAPQEGGAEQAPQMPE--------------------G 560 (563) Q Consensus 515 -p~~~~~------~~~-~~~~---~~~~-~~~~~p~~~--~~~~~~~~~~~~~~~~~~~--------------------~ 560 (563) -++.++ ++. +..+ +... ++...|..| -...+.-||++.+-+.--+ | T Consensus 509 ~~d~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~giv~~~g 588 (698) T protein:vir:10 509 KLDANDDPGAPADDDIDGVLTYVQRMAEGGDTGAPTAPGGARAGATAPPAAANVNANANPREAGAQDAAMRAAGIVFRAG 588 (698) T ss_pred ccCCcccCCCCCCCcchHHHhhhcCCcCCCCcccccccccccCCCCCCcccccccCCCCccccCcccceeeEEEEEEEcC Confidence 000000 000 0000 0000 000111111 1111222222222111111 1 Q ss_pred CCC Q lcl|NC_015287. 561 GEI 563 (563) Q Consensus 561 ~~~ 563 (563) ++| T Consensus 589 ~~v 591 (698) T protein:vir:10 589 DKV 591 (698) T ss_pred CeE Confidence 122 No 97 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=97.82 E-value=1.7e-05 Score=46.78 Aligned_cols=397 Identities=13% Similarity=0.140 Sum_probs=168.9 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) |++--=|+..+.. ..++..- ++... .. ...+.....+.++- .+...++|-|..||+-|.+.+ T Consensus 1 ~~~~~~~~~~k~~--~~~~~~~-~~~~~-~~--~~~~~~~~~~~~v~------------~~~a~~~~~V~~ci~~ia~~i 62 (409) T protein:vir:96 1 MAKENIVTRIKKK--LIDNWID-QSASK-LY--DFSPWKNKSFWGVI------------NNTLETNETIFSAITKLSNSM 62 (409) T ss_pred CccccchhhhhhH--Hhhhhhc-ccccc-cc--ccccccCccccccc------------hhhHhhhHHHHHHHHHHHHhh Confidence 4432222221111 0000000 00000 00 00000001111111 112346788999999999876 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcchhhhH----HHHHHHhhhhhhheeeeecCCCCCcceeEE Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFDDKCH----EILRNWYVDGRIYYHKVIDIKAPEEGIKEV 155 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~~~~~----~~~R~WYvDGri~fhkviD~~~~k~GI~eL 155 (563) .-++ +.+ +.+++..+ ....++|+. =+-...++ .++..++++|.-|..++.|.. .-+++| T Consensus 63 a~lp-----~~~-~~~~~~~~-------~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~---G~~~~L 126 (409) T protein:vir:96 63 ASLP-----LKM-YEDYKVVN-------TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY---HQPSKL 126 (409) T ss_pred hhCc-----eEE-eecccccc-------hhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCC---CcEEEE Confidence 6432 332 12222211 112222221 12223343 445566778998887766643 227899 Q ss_pred EEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeeccccee Q lcl|NC_015287. 156 RYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAIS 235 (563) Q Consensus 156 r~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~ 235 (563) .+|+|..|..++. ++. ..-||.|.... .+.+.++.+.|. T Consensus 127 ~~l~~~~v~v~~~---~~~--------------------~~~~y~~~~~~------------------g~~~~~~~~evi 165 (409) T protein:vir:96 127 FLLNPDVVEMLIE---NQS--------------------RELYYSIHAAT------------------GNKLIVHNMDML 165 (409) T ss_pred EEEcCceeEEEEe---CCC--------------------cEEEEEEEcCC------------------ceEEEEccccEE Confidence 9999988876632 110 01233332210 112345555553 Q ss_pred eeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeC Q lcl|NC_015287. 236 FCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDA 315 (563) Q Consensus 236 y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~ 315 (563) |-. +....++-.+.|.|..|.........++.. ..+...+.| . +...-.+.|.+.+++...+.+.+.|.| T Consensus 166 h~r-~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~-~~~~~~~~~-~-~i~~~~~~l~~e~~~~~~~~~~~~~~n------ 235 (409) T protein:vir:96 166 HFK-HIVASNMVQGISPIDVLKNTTDFDNAVRTF-NLTEMQKPD-S-FMLKYGSNVSTEKRQQVLEDFKQYYEE------ 235 (409) T ss_pred EeC-CCCCCCccccccHHHHHHHHHHHHHHHHHH-HHHhcCCCc-e-eEEecCCCCCHHHHHHHHHHHHHHhhc------ Confidence 321 222344445678888888777766666554 355555544 2 344445777777776666665554432 Q ss_pred CCCcccccccccchhhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchh Q lcl|NC_015287. 316 STGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG-QNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEI 394 (563) Q Consensus 316 ~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eI 394 (563) .|.+ + .++ -|.+++.|.-. +.+.-++-..|..+.+.++++||.+.|...++ -+.+.. T Consensus 236 -~g~~------~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~~~s~~--- 293 (409) T protein:vir:96 236 -NGGI------L-FQE----------PGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSN-TNFAKN--- 293 (409) T ss_pred -CCCe------e-ecC----------CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-CCcccH--- Confidence 2321 1 121 25677777422 22222334446678899999999999954321 122222 Q ss_pred hHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_015287. 395 LRDEIKFTKF-VGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDP 473 (563) Q Consensus 395 tRDElKF~Kf-I~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p 473 (563) .-.-+.|.++ |.-+-.++. +.|-. .++++.++. ....+.|.-+. +.... +..|++.+..+-. T Consensus 294 e~~~~~f~~~~l~P~~~~ie----~~l~~-----~Ll~~~~~~---~g~~i~fd~~~----ll~~d-~~~~~e~~~~~~~ 356 (409) T protein:vir:96 294 EELNRFYLQHTLLPIVKQYE----EEFNR-----KLLTKTDRE---KNRYFKFNVKS----YLRAD-SATQAEVYFKAVR 356 (409) T ss_pred HHHHHHHHHHHHHHHHHHHH----HHHHh-----hcCCccccc---CcceEEeechh----hhccC-HHHHHHHHHHHHh Confidence 1122235555 333333222 22333 334455554 23445554332 22222 2456666544422 Q ss_pred hhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCccc Q lcl|NC_015287. 474 YVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQ 553 (563) Q Consensus 474 ~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~ 553 (563) - -+++..-++.. ..-+++ |+-+ .. -..+. ..|.+. + ++ T Consensus 357 ~--G~~T~NE~R~~-------------------~g~~pi--~ggD---~~-----~~~~n-~~~~~~-------~---~~ 394 (409) T protein:vir:96 357 S--GYYTINDIREW-------------------EDLPPV--EGGD---KP-----LISGD-LYPIDT-------P---LE 394 (409) T ss_pred C--CCCCHHHHHHH-------------------hCCCCC--CCcc---ee-----eeccc-cccccc-------c---hh Confidence 2 13333333221 122222 1100 00 00000 001110 0 00 Q ss_pred CCCCCCCCCC Q lcl|NC_015287. 554 APQMPEGGEI 563 (563) Q Consensus 554 ~~~~~~~~~~ 563 (563) .-+-.+|||- T Consensus 395 ~~~~~~gG~~ 404 (409) T protein:vir:96 395 LRKSLKGGDK 404 (409) T ss_pred hcccccCCCC Confidence 0011233333 No 98 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=97.82 E-value=1.7e-05 Score=46.74 Aligned_cols=396 Identities=14% Similarity=0.125 Sum_probs=168.4 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) |++=-=+++.+. .+.........+... ++..........+ ..+...++|.|..||+-|.+.+ T Consensus 1 ~~~~~~~~~~k~---------~~~~~~~~~~~~~~~-~~~~~~~~~~~~v--------~~~~a~~~~~v~~~i~~Ia~~i 62 (409) T protein:vir:94 1 MAKENIVTRIKK---------KLIDNWIDQSASKLY-DFSPWKNKSFWGV--------INNTLETNETIFSAITKLSNSM 62 (409) T ss_pred Ccccccchhhhh---------HHhhhhhcCCccccc-ccccccCcccccc--------chhhhhccHHHHHHHHHHHHhh Confidence 332211111111 111000000000000 0100000110001 1122456889999999999876 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcchhh----hHHHHHHHhhhhhhheeeeecCCCCCcceeEE Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFDDK----CHEILRNWYVDGRIYYHKVIDIKAPEEGIKEV 155 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~~~----~~~~~R~WYvDGri~fhkviD~~~~k~GI~eL 155 (563) .-++ +.+- +.++..+ ....++|+. =+-... -+.++..++++|.-|..++.|... =+++| T Consensus 63 a~lp-----~~~~-~~~~~~~-------~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G---~~~~L 126 (409) T protein:vir:94 63 ASLP-----LKMY-EDYKVVN-------TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYH---QPSKL 126 (409) T ss_pred hhCc-----eeEe-ecccccc-------hhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC---cEEEE Confidence 5332 2221 2222111 112222321 122222 345566778889988877666432 27899 Q ss_pred EEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeeccccee Q lcl|NC_015287. 156 RYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAIS 235 (563) Q Consensus 156 r~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~ 235 (563) .+|+|..|..++.- +. ..-||.|... . .+.+.++.+.|. T Consensus 127 ~~l~~~~v~v~~~~---~~--------------------~~~~y~~~~~----------~--------g~~~~~~~~dvi 165 (409) T protein:vir:94 127 FLLNPDVVEMLIEN---QS--------------------RELYYSIHAA----------T--------GNKLIVHNMDML 165 (409) T ss_pred EEEcCceeEEEEeC---CC--------------------cEEEEEEEcC----------C--------ceEEEEccccEE Confidence 99999887655321 00 0123333221 0 112456666663 Q ss_pred eeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeC Q lcl|NC_015287. 236 FCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDA 315 (563) Q Consensus 236 y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~ 315 (563) |.. +....++-.+.|-|..|.+.......++.. -+....+.| .++..--+.+.+.+++...+.+.+.|. T Consensus 166 h~r-~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~-~~~~~~~~~--~~i~~~~~~l~~e~~~~~~~~~~~~~~------- 234 (409) T protein:vir:94 166 HFK-HIVASNMVQGISPIDVLKNTTDFDNAVRTF-NLTEMQKPD--SFMLKYGSNVGKEKRQQVLEDFKQYYE------- 234 (409) T ss_pred Eec-CCCCCCccccccHHHHHHHHHHHHHHHHHH-HHHhcCCCC--eeEEecCCCCCHHHHHHHHHHHHHHhh------- Confidence 331 222334445577788887777766666554 344555544 233334556666666555555444332 Q ss_pred CCCcccccccccchhhhhcccccCCCCccceeecCCCC-CcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchh Q lcl|NC_015287. 316 STGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQ-NLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEI 394 (563) Q Consensus 316 ~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~-nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eI 394 (563) ++|.+ + .++ -|.+++.|.-.. .+.-++-..|-.+.+.++++||...|...++ -+. +.+ T Consensus 235 ~~g~~------~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~~~---sn~ 293 (409) T protein:vir:94 235 ENGGI------L-FQE----------PGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSN-TNF---AKN 293 (409) T ss_pred cCCCe------e-ecC----------CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-CCc---ccH Confidence 23322 1 121 257777775332 2222444556678899999999999964321 111 122 Q ss_pred hHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_015287. 395 LRDEIKFTKFV-GRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDP 473 (563) Q Consensus 395 tRDElKF~KfI-~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p 473 (563) .-.-+.|.+++ .-+-.++ .+.|-. .++++.+|.. ...+.|..+. +.... +..|++.+..+-. T Consensus 294 e~~~~~f~~~~l~P~~~~i----e~~ln~-----~Ll~~~~~~~---~~~i~fd~~~----ll~~d-~~~~~~~~~~~~~ 356 (409) T protein:vir:94 294 EELNRFYLQHTLLPIVKQY----EEEFNR-----KLLTKTDREK---NRYFKFNVKS----YLRAD-SATQAEVYFKAVR 356 (409) T ss_pred HHHHHHHHHHHHHHHHHHH----HHHHHH-----hhCCcccccC---cceEEeechh----hhccC-HHHHHHHHHHHHh Confidence 22223355542 2222222 222333 3344555542 2344554322 22222 2455665544422 Q ss_pred hhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCC-cccccccCCCCcc Q lcl|NC_015287. 474 YVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQD-PNMGGAPQEGGAE 552 (563) Q Consensus 474 ~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~-~~~~~~~~~~~~~ 552 (563) - -+++..-+|. ++.+ +++ |+-+ ..- ..+. ..|.+ ++..+ T Consensus 357 ~--G~~T~NE~R~-~~g~------------------~p~--~ggD---~~~-----~~~n-~~~~~~~~~~~-------- 396 (409) T protein:vir:94 357 S--GYYTINDIRE-WEDL------------------PPV--EGGD---KPL-----ISGD-LYPIDTPLELR-------- 396 (409) T ss_pred C--CCcCHHHHHH-HhCC------------------CCC--CCcC---eEe-----eccc-ccccccchhhc-------- Confidence 2 2344443332 1222 222 1100 000 0000 00111 10000 Q ss_pred cCCCCCCCCCC Q lcl|NC_015287. 553 QAPQMPEGGEI 563 (563) Q Consensus 553 ~~~~~~~~~~~ 563 (563) .-.+|||= T Consensus 397 ---~~~kGG~~ 404 (409) T protein:vir:94 397 ---KSLKGGDK 404 (409) T ss_pred ---ccccCCCC Confidence 01233322 No 99 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=97.82 E-value=1.7e-05 Score=46.73 Aligned_cols=457 Identities=14% Similarity=0.107 Sum_probs=186.8 Q ss_pred ccccccccCcCCCCcccceeccccc--------------cccccccchhh-----hHHHHHHHHHHHhhccchhHHHHHh Q lcl|NC_015287. 16 NKNVVSPVPQSNEDSSDYYVSSGFY--------------GQYVDIDGVFK-----SEFELIKRYREMALHPEVDSAIEDI 76 (563) Q Consensus 16 ~~~~~s~~p~~~~d~~~~~~~~~~~--------------~~~~~~~~~~~-----~~~~LI~~YR~ma~~PEvd~Ai~eI 76 (563) .. +++|-..+...+...-..+. .||-+- ..+. -..++ +.+|... .=+.-+|+.. T Consensus 1 ~~---~~i~~~~~~~~~~~~~~~L~~~~~~~~~r~~~~~~YY~G~-~~i~~~~~~~~~~~-~~~~~~~--n~~~~ivd~~ 73 (485) T protein:vir:24 1 MT---APLPGQEEIADPAIARDEMVSAFEDQNQNLRSNTSYYEAE-RRPEAIGVTVPVQM-QSLLAHV--GYPRLYVDSI 73 (485) T ss_pred CC---CCCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc-CchhhcCcccchhh-hhhhhcc--chHHHHHHHH Confidence 22 33332222211111100000 001000 0000 00010 1111111 1122333332 Q ss_pred hcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCC----CCCcce Q lcl|NC_015287. 77 INEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIK----APEEGI 152 (563) Q Consensus 77 vneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~----~~k~GI 152 (563) +.=. ..+.+.+. + .+.-.+.+..|.+..+|+....++++.-++.||-|.+.-.|.+ -..+|- T Consensus 74 ~~~l-----~~~g~~~~--~-------~~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~ 139 (485) T protein:vir:24 74 AERQ-----AVEGFRLG--D-------ADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNV 139 (485) T ss_pred hhhh-----ccCceecC--C-------CchhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCc Confidence 2200 01112111 1 1123345677777888999999999999999998886544432 123455 Q ss_pred eEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeeccc Q lcl|NC_015287. 153 KEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPD 232 (563) Q Consensus 153 ~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~d 232 (563) ..++.++|+.+-.++- ++.+.....++.... -.......+-+|.+..+...... .|.- .......-.+..= T Consensus 140 ~~i~~~~p~~~~~i~D----~~~~~~~~~~~~~~~--~~~~~~~~~~~y~~~~~~~~~~~--~~~~-~~~~~~~h~~g~v 210 (485) T protein:vir:24 140 PLIRVEPPTRMYAEID----PRIGRPAKAIRVAYD--AEGNEIQAATLYTPNETFGWFRA--EGEW-VEWFSDPHGLGAV 210 (485) T ss_pred ceEEEeccceeEEEee----CCcCceeEEEEEEEe--ecCCeEEEEEEEcCCcEEEEEec--CCce-EeecccccCCCcc Confidence 6788899988765532 111111111110000 00111222233443322221110 0100 0000000001111 Q ss_pred c-eeeeecceecCCCCccchhHHHHHHHH-HH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcc Q lcl|NC_015287. 233 A-ISFCHSGLVDRNKQTILSYLHKSIKAL-NQ-LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRN 309 (563) Q Consensus 233 a-I~y~hSGl~d~~~~~~~syL~~Aik~~-Nq-L~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~kn 309 (563) + |.|.|..-. .+--+.|-|.+.++++ .. -+++-+..++-..+-.|.|-+.=.+....+...- +. T Consensus 211 Pvv~f~n~~~~--~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~-----------~~ 277 (485) T protein:vir:24 211 PVVPLPNRTRL--SDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPE-----------TG 277 (485) T ss_pred cEEEeccCccc--CCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCccccccccc-----------cc Confidence 2 224433211 1112244455444333 22 2556677777777777777554222111111000 00 Q ss_pred eeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCChhhccCCCcceec Q lcl|NC_015287. 310 KLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLG-ELSDIEYFQKKLYRSLGVPESRIAGSGDGFNL 388 (563) Q Consensus 310 klvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLg-ei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~ 388 (563) ..+.+...| ..|+.--+ +.++..++... +. -++-++-.-..+...-++|..-|...+ -|- T Consensus 278 ~~~~~~~~~-------------~i~~~~~~---~~~~~q~~~~~-~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~--~n~ 338 (485) T protein:vir:24 278 QTLFDAYLA-------------RILAFEDA---EGKIQQFSAAE-LANFTNALDQIAKQVAAYTGLPPQYLSTAA--DNP 338 (485) T ss_pred cchhhhccc-------------ceeccCCC---CceEEeecccc-hHHHHHHHHHHHHHHhcccCCCHHHhcccc--Ccc Confidence 111112222 23443222 23355555432 22 122233333344444577777773221 111 Q ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-ChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHH Q lcl|NC_015287. 389 GRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIV-TPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGV 467 (563) Q Consensus 389 g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~-t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~ 467 (563) ..+..|.--+...-.-+.+.|+.|..-+...++.-+.+.+-. ...+| ..|.+.|.....=+. .+.++. T Consensus 339 ~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~d~----~~i~v~f~~~~~~s~-------~~~ad~ 407 (485) T protein:vir:24 339 ASAEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLMKGGDVPPDM----LRMETVWRDPSTPTY-------AAKADA 407 (485) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccc----ceeeEEecCCCCCCH-------HHHHHH Confidence 122334445556777788999999999999988765554422 22233 467888864332222 345555 Q ss_pred HHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccC Q lcl|NC_015287. 468 VAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQ 547 (563) Q Consensus 468 ~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~ 547 (563) +..+..-.-..+|.+++++ .|.+++++++++++..+++...+.... +.. ....+..++.+.. ...++..++++ T Consensus 408 ~~kl~~~g~~~~s~et~~~-~l~~~~d~~~e~~~~~ee~~~~~~~~~----~~~-~~~~~~~~~~~~~-~e~~~~~~~~~ 480 (485) T protein:vir:24 408 ATKLYGNGQGVIPRERARK-DMGYSIAEREEMRRWDEEEAAMGLGLL----GTM-VDADPTVPGSPNP-TPAPKPQPAIE 480 (485) T ss_pred HHHHHhcccccCCHHHHHh-hCCCCHhHHHHHHHHHHHHhhhhhhHH----Hhh-cccCCCCCCCCCC-CCCCCCccCCC Confidence 5555444225789999885 588999998887766555533221110 000 0000001111110 11111111111 Q ss_pred CCCcccC Q lcl|NC_015287. 548 EGGAEQA 554 (563) Q Consensus 548 ~~~~~~~ 554 (563) +++.| T Consensus 481 --~~~~a 485 (485) T protein:vir:24 481 --GGDSA 485 (485) T ss_pred --CCCCC Confidence 11122 No 100 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=97.81 E-value=1.8e-05 Score=46.66 Aligned_cols=403 Identities=9% Similarity=0.032 Sum_probs=171.5 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) ..+||+- .... + .+.+..++ .......|+.+.....+ .....+++|.|..||+-|.+.+ T Consensus 3 ~~~~f~~----~~~~-~---~~~~~~~~-~~~~~~~~~~~~~~~~v------------~~~~al~~~~v~~~i~~Ia~~i 61 (416) T protein:vir:12 3 LERMFEK----RSGS-S---DHEDGFNN-ILLNMFGGRKTASGERV------------SESNSLVQPDIFACVNVLSDDI 61 (416) T ss_pred cchhccc----ccCc-c---ccCccchh-HHHHhhcCcccccCcee------------chhhhhccHHHHHHHHHHHHhh Confidence 3344432 1100 0 00011010 00001111111111111 1234567899999999999876 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHH-H----hcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEE Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKE-I----LNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEV 155 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~-l----l~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eL 155 (563) .-+ |+.+- ..++ ...+.....-.-.+++ - +....=.+.++..+++.|.-|..++.|.. .-+.+| T Consensus 62 a~l-----~~~~~-~~~~--~~~~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~---G~~~~L 130 (416) T protein:vir:12 62 AKL-----PIHTY-KRTD--GGIERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSH---GYPEAL 130 (416) T ss_pred hhC-----ceEEE-EecC--CccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC---CcEEEE Confidence 533 22221 1111 1111111111111111 1 11122234556666778998888776632 228899 Q ss_pred EEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeeccccee Q lcl|NC_015287. 156 RYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAIS 235 (563) Q Consensus 156 r~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~ 235 (563) .+|+|..|..+.. ++. + .-||.+... ...+.++.+.|. T Consensus 131 ~~l~~~~v~v~~~---~~~---~-----------------~~~~~~~~~-------------------g~~~~~~~~eii 168 (416) T protein:vir:12 131 FPLRPDYTNAYVH---PTT---G-----------------MLWYQTVLN-------------------GKAIELYDYEVL 168 (416) T ss_pred EEECCcceEEEEe---CCC---c-----------------EEEEEEecC-------------------CeEEEecCccEE Confidence 9999988765421 110 0 113333211 113556677665 Q ss_pred eeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeC Q lcl|NC_015287. 236 FCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDA 315 (563) Q Consensus 236 y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~ 315 (563) |.. .+..++-.+.|.|+.|.+++......+....=+=-..+.-+-|..++ +.+.+..+++. ++-.++.. T Consensus 169 h~~--~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~-~~~~~~~~------- 237 (416) T protein:vir:12 169 HFK--GLSTDGIHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVP-AFLDEKPKENV-RKEWKRVN------- 237 (416) T ss_pred Eec--CcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecC-CCCCHHHHHHH-HHHHHHHh------- Confidence 554 34556666789999999999988888887665544556667777776 35555544443 33333221 Q ss_pred CCCcccccccccchhhhhcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchh Q lcl|NC_015287. 316 STGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPG-GQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEI 394 (563) Q Consensus 316 ~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpG-g~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eI 394 (563) +.|. .+ .++ .|++++.|.= ...+.-++-.+|....+.++++||.+-|...+++ +. +.+ T Consensus 238 ~~~~------~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~-t~---sn~ 296 (416) T protein:vir:12 238 KVEN------IA-IID----------YGLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKA-TF---SNI 296 (416) T ss_pred cCCC------ee-ecC----------CCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCC-Cc---ccH Confidence 1121 11 221 1455555532 1223334556677889999999999999532211 11 111 Q ss_pred hHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_015287. 395 LRDEIKFTKF-VGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDP 473 (563) Q Consensus 395 tRDElKF~Kf-I~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p 473 (563) .-.-..|..+ |.-+-.++...|. ..| +++.++. ....+.|.-+. +... -...|++.+..+-. T Consensus 297 e~~~~~f~~~~l~P~~~~ie~~l~----~~l-----~~~~~~~---~g~~i~fd~~~----l~~~-d~~~~~~~~~~~~~ 359 (416) T protein:vir:12 297 EHQSIEYVRNTLQPWIVNFEQELN----VKL-----FLDHDQK---SGHYVKFNIDS----ELRG-DSKTQAEYLKTLHE 359 (416) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH----Hhh-----cCchhhc---CCceEEeechh----hhcc-CHHHHHHHHHHHHh Confidence 1121234333 3334333333333 222 3333332 22334444332 2111 12345555544432 Q ss_pred hhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCccc Q lcl|NC_015287. 474 YVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQ 553 (563) Q Consensus 474 ~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~ 553 (563) - -+++.+-+|.. +.+.+ ++..+.--.+ .-..|.+. ....+.+.++. T Consensus 360 ~--G~~T~NE~R~~-~gl~P------------------i~ggd~~~~~-----------~n~~~~~~--~~~~~~~~~~~ 405 (416) T protein:vir:12 360 T--GVLNKDEIREL-LERNP------------------IENGDKYISS-----------LNYVFLDF--LEEYQRLKAGG 405 (416) T ss_pred C--CCcCHHHHHHH-hCCCC------------------CCCcceeeec-----------cccccccc--cchhhcccccc Confidence 2 24444444442 22222 2110000000 00001000 00000000000 Q ss_pred CCCCCCCCCC Q lcl|NC_015287. 554 APQMPEGGEI 563 (563) Q Consensus 554 ~~~~~~~~~~ 563 (563) -.+|||= T Consensus 406 ---~~~gge~ 412 (416) T protein:vir:12 406 ---AMKGGDN 412 (416) T ss_pred ---ccCCCCC Confidence 0111111 No 101 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=97.80 E-value=1.8e-05 Score=46.53 Aligned_cols=443 Identities=13% Similarity=0.088 Sum_probs=180.0 Q ss_pred ccchhhHhhhcccccccccCcCCCCcccceeccc------ccccc-ccccchh---------hhHHHHHHHHHHHhhccc Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSG------FYGQY-VDIDGVF---------KSEFELIKRYREMALHPE 68 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~------~~~~~-~~~~~~~---------~~~~~LI~~YR~ma~~PE 68 (563) +.| .+.||.|.+.---|+.+.++-| +...+ .+.+... ..-.+.+.+|+.+..+.+ T Consensus 1 ~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~ 71 (492) T protein:vir:94 1 MQF---------IQLISQVAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYE 71 (492) T ss_pred ChH---------HHHHHHHHHHHhcCCceeecCccchhhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 112 2233444332222222222111 11111 0111110 011223345555555544 Q ss_pred hhHHH---------------------------HHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcch Q lcl|NC_015287. 69 VDSAI---------------------------EDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFD 121 (563) Q Consensus 69 vd~Ai---------------------------~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~ 121 (563) -+..| ..||+- .+.=--..||.+..++. ...+.++.+++ .+|+ T Consensus 72 g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~-~~~yl~G~p~~~~~~d~--------~~~~~l~~~~~-n~~~ 141 (492) T protein:vir:94 72 QRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQ-KVSYIVGKPIAFKHTDD--------EVVKRIDEVLG-NRFD 141 (492) T ss_pred cccccccccccccccccccccccccccccchHHHHHHH-HHhhhcccCceeccCch--------HHHHHHHHHHh-ccHH Confidence 43211 111221 11111345666655542 23444455554 4789 Q ss_pred hhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccc--ccccccccccee Q lcl|NC_015287. 122 DKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNP--ADIENPEIEEYY 199 (563) Q Consensus 122 ~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~--~~~~~~~~~eyf 199 (563) ....++.+..++.|+-|.+.-+| ++|-..++.+||+.+..|..-.........+........ ..++.+.-..+| T Consensus 142 ~~~~~~~~~a~~~G~a~~~v~~d----~dg~~~~~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~~y~~~~v~~~ 217 (492) T protein:vir:94 142 DKLHSVLTGASNKGIEWLHPYLD----EEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYY 217 (492) T ss_pred HHHHHHHHHHhhCCeEEEEEEec----CCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeeccceeEEEEecCeEEEE Confidence 99999999999999999987665 357788999999998777432111111222222111110 011111111233 Q ss_pred eecCccccccccccccCcCccccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcC Q lcl|NC_015287. 200 QYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLR-MIEDSLVIYRLSRA 278 (563) Q Consensus 200 ~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~-m~EDalVIyRi~RA 278 (563) .+...+..........+ ...+...--+..=+|++.- |+....|=++..+....-+. ++=+....-+.+.. T Consensus 218 ~~~~~~~~~~~~~~~~~---~~~~~~~~~~g~vPvv~~~------nn~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~ 288 (492) T protein:vir:94 218 VYENGSLIPDYSNNLEN---SKTHFSTGSWGKIPFIPFK------NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNE 288 (492) T ss_pred EEecCeeeecccccccc---ccccccccCCCccceEEec------CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 33221111100000000 0000000001111233221 12223455555444444333 23444445566666 Q ss_pred ccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchH Q lcl|NC_015287. 279 PERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGEL 358 (563) Q Consensus 279 PeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei 358 (563) |-+-+.-.+..+.+. . ...+..++..--++ +..+.+|-...+.+.+ T Consensus 289 p~lv~~g~~~~~~~~-----~---------------------------~~~~~~~~~~~~~~--~~~~~~l~~~~~~~~~ 334 (492) T protein:vir:94 289 LTYVLKNYDDQELPE-----F---------------------------KRLLRYYGAIKVSD--NGGVDTIQVEVPVENS 334 (492) T ss_pred ceeeeecCCcccchh-----h---------------------------HHHHhhccceecCC--CCcceeEeccCCHHHH Confidence 655443333222111 0 11111111111112 2235555443333332 Q ss_pred -HHHHHHHHHHHHhcCCChhhccCCCcceeccccch-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHH Q lcl|NC_015287. 359 -SDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSE-ILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWE 436 (563) Q Consensus 359 -~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~e-ItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~ 436 (563) .-+.-+.+.+|+-.++|- +..+ .|.-+.|++ |..-+.....-+.+.++.|..-+.++++.=+-+-|+ ..+|. T Consensus 335 ~~~~~~l~~~I~~~s~~p~--~~~~--~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~--~~~~~ 408 (492) T protein:vir:94 335 KKYLDELYQKIMLFGQAVD--FSSD--KFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI--KGEHK 408 (492) T ss_pred HHHHHHHHHHHHHHhCCcC--CCcc--ccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--Ccccc Confidence 344667778888889884 2222 222112222 222223344556777777777777776653333343 33454 Q ss_pred HhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCc Q lcl|NC_015287. 437 VLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPM 516 (563) Q Consensus 437 ~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~ 516 (563) .|.+.|....--.+ .+.++++..+ +| .+|.+++++.+-..+| .+++-++|++|..+..-..++ T Consensus 409 ----~i~v~f~~~~p~~~-------~e~~~~~~kl---~g-iiS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~~ 471 (492) T protein:vir:94 409 ----DVDISFNYNKVANT-------ELQVQTAQQS---MG-IVSHETVLENHPFVED--LQAELERIEQEQMEYNKQLPN 471 (492) T ss_pred ----eeeEEecCCCCCCH-------HHHHHHHHHH---hc-cCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhccc Confidence 46677754333222 2334555555 35 4899999987754443 333444444443221111111 Q ss_pred cc-CccccccccccCCCCcCCCCCcccc Q lcl|NC_015287. 517 EV-QQLQLGVHPEQMPGGAMNPQDPNMG 543 (563) Q Consensus 517 ~~-~~~~~~~~~~~~~~~~~~p~~~~~~ 543 (563) .. ..++ .+.....+++.+.. T Consensus 472 ~~~~~~~-------~~~~~~~~~~~e~e 492 (492) T protein:vir:94 472 LDDGGAD-------SAQQQERSNNKESE 492 (492) T ss_pred cccccCC-------CCccccCCccccCC Confidence 10 0000 00111112222111 No 102 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=97.78 E-value=2e-05 Score=46.35 Aligned_cols=411 Identities=12% Similarity=0.101 Sum_probs=176.8 Q ss_pred ccccc-----chhhHhhhcccccccccCcC-----------CCCccc-ceeccccccccccccchhhhHHHHHHHHH--H Q lcl|NC_015287. 2 AKLFG-----FKIQKDDEQNKNVVSPVPQS-----------NEDSSD-YYVSSGFYGQYVDIDGVFKSEFELIKRYR--E 62 (563) Q Consensus 2 ~~~~~-----f~~~~~~~~~~~~~s~~p~~-----------~~d~~~-~~~~~~~~~~~~~~~~~~~~~~~LI~~YR--~ 62 (563) -+|.. .+||-.++-.+ ..-++... ..+.+. ....+++.+... ..|. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~ 66 (441) T protein:vir:94 1 MHWYNTDCYFVDFKSRKQSRK-ELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKL-------------RQYKDIE 66 (441) T ss_pred CccccCccccccccccccchh-hhhccccccccccccccCCCcchHHHHHHhcccCcccc-------------cccchhh Confidence 12221 12222211111 00011111 111100 001111111110 1121 2 Q ss_pred HhhccchhHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchh----h----hHHHHHHHhhh Q lcl|NC_015287. 63 MALHPEVDSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDD----K----CHEILRNWYVD 134 (563) Q Consensus 63 ma~~PEvd~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~----~----~~~~~R~WYvD 134 (563) -+++|.|..||+-|.+.+.-. |+.+- ++.+.. .-+.++.+|+-+- . -+.++..+... T Consensus 67 al~~~~V~~cv~~Ia~~iA~l-----p~~~~-~~~~~~---------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~ 131 (441) T protein:vir:94 67 AIRHSDIFTAVMMIASDLARM-----PIRVT-VNGQIN---------YSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLT 131 (441) T ss_pred hhccHHHHHHHHHHHHhhccC-----ceeee-cCcccc---------ccchHHHHHhcccCcCCCHHHHHHHHHHHHhhc Confidence 356888999999998866543 44432 222211 1233445554322 2 34455566788 Q ss_pred hhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccc Q lcl|NC_015287. 135 GRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGA 214 (563) Q Consensus 135 Gri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~ 214 (563) |.-|..++.|.. .=+++|.+|+|..|..++ +.. +. ..|+.+...+ . T Consensus 132 Gnay~~i~r~~~---G~~~~L~~i~~~~v~v~~----d~~--g~-----------------~~~~~~~~~~--------~ 177 (441) T protein:vir:94 132 SHGYIEITRDKT---GEPMNLTFRKTSEIELKS----DAR--GR-----------------LYYFHQRIDS--------N 177 (441) T ss_pred CCeEEEEEECCC---CcEEEEEEEcCceeEEEE----CCC--cc-----------------EEEEEEEecc--------C Confidence 999998877632 117899999998876442 111 11 1122221100 0 Q ss_pred cCcCccccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchH Q lcl|NC_015287. 215 IGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKM 294 (563) Q Consensus 215 ~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~ 294 (563) + ......++.+.|.|.. .++.++-.+.|-|+.|++++.....+++...=+=---|--+-|..++ |.+... T Consensus 178 ~-------~~~~~~~~~~dvih~k--~~~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~ 247 (441) T protein:vir:94 178 G-------NNIERNVKFEDMLDIK--FYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNK 247 (441) T ss_pred C-------ceeEEEEccccEEEec--cCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC-CCCCCH Confidence 0 0112345555553332 24556666789999999999888888777654433445566777777 455434 Q ss_pred HHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCC-CcchHHHHHHHHHHHHHhcC Q lcl|NC_015287. 295 KAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQ-NLGELSDIEYFQKKLYRSLG 373 (563) Q Consensus 295 KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~-nLgei~DV~YF~~kLy~aL~ 373 (563) +|.+=+++-+++.- +| ..+..+.+ .++ .|.+++.|.=.. .+.=++-.++..+.+.++++ T Consensus 248 e~~e~~r~~~~~~~--------~G-~~nag~~~-vl~----------~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fg 307 (441) T protein:vir:94 248 KARDRAREEFHKSF--------SG-TKQAGKVV-VLD----------ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFG 307 (441) T ss_pred HHHHHHHHHHHHHh--------cC-ccccCcce-ecC----------CCceEEEccCChhHHHHHHHHHHhHHHHHHHhC Confidence 44333333222211 12 22222222 222 145555553211 22223445677888999999 Q ss_pred CChhhccCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHH Q lcl|NC_015287. 374 VPESRIAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFA 453 (563) Q Consensus 374 VP~sRl~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ 453 (563) ||.+.|..+..++ .+.-..+-|..-+.-+-.++...|.. .|. ++ +. ...+.|.-+. T Consensus 308 VPp~~lg~~~~~~------s~~q~~~~~~~tl~P~~~~ie~eln~----kl~-----~~--~~----~~~~~fd~~~--- 363 (441) T protein:vir:94 308 IPLHKFGIETANM------SITDANLDYLSTLKPYITCVCAELNF----KFN-----DE--YV----NREFKFDTTE--- 363 (441) T ss_pred CCHHHcCCCCCCc------cHHHHHHHHHHHHHHHHHHHHHHHhh----hcc-----cc--cc----CceEEeechh--- Confidence 9999995432222 12333343444444444444333332 221 21 21 2334443322 Q ss_pred HHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCC Q lcl|NC_015287. 454 ELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGG 533 (563) Q Consensus 454 ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~ 533 (563) +.... ...|++.++.+-.- -+ +|..|+- +...-++++.++...-.-+. T Consensus 364 -llr~D-~~~~~~~~~~~i~~--G~------------~T~NE~R-------~~~gl~Pi~ggd~~~~~~~~--------- 411 (441) T protein:vir:94 364 -IRVVD-EKTQAEIDKINIDS--GK------------MNIDEIR-------QRDGLAPIPGGNGSIHRVDL--------- 411 (441) T ss_pred -hhccC-HHHHHHHHHHHHhC--CC------------cCHHHHH-------HHhCCCCCCCCCcceEeecc--------- Confidence 21111 23444444333211 12 4443332 12333444333221100000 Q ss_pred cCCCCCcccccccCCCCcccCCCCCCCCCC Q lcl|NC_015287. 534 AMNPQDPNMGGAPQEGGAEQAPQMPEGGEI 563 (563) Q Consensus 534 ~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 563 (563) -.-|.+. ....+.....+...-.+|||- T Consensus 412 n~~~~~~--~~~~~~~~~~~~~~~~kgGe~ 439 (441) T protein:vir:94 412 NHVNIEL--VDEYQMNKSRATDKKLKGGEE 439 (441) T ss_pred ccccccc--ccccccccccccccccCCCCC Confidence 0001111 111112222233344667776 No 103 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=97.78 E-value=2e-05 Score=46.35 Aligned_cols=411 Identities=12% Similarity=0.101 Sum_probs=176.8 Q ss_pred ccccc-----chhhHhhhcccccccccCcC-----------CCCccc-ceeccccccccccccchhhhHHHHHHHHH--H Q lcl|NC_015287. 2 AKLFG-----FKIQKDDEQNKNVVSPVPQS-----------NEDSSD-YYVSSGFYGQYVDIDGVFKSEFELIKRYR--E 62 (563) Q Consensus 2 ~~~~~-----f~~~~~~~~~~~~~s~~p~~-----------~~d~~~-~~~~~~~~~~~~~~~~~~~~~~~LI~~YR--~ 62 (563) -+|.. .+||-.++-.+ ..-++... ..+.+. ....+++.+... ..|. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~ 66 (441) T protein:vir:79 1 MHWYNTDCYFVDFKSRKQSRK-ELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKL-------------RQYKDIE 66 (441) T ss_pred CccccCccccccccccccchh-hhhccccccccccccccCCCcchHHHHHHhcccCcccc-------------cccchhh Confidence 12221 12222211111 00011111 111100 001111111110 1121 2 Q ss_pred HhhccchhHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchh----h----hHHHHHHHhhh Q lcl|NC_015287. 63 MALHPEVDSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDD----K----CHEILRNWYVD 134 (563) Q Consensus 63 ma~~PEvd~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~----~----~~~~~R~WYvD 134 (563) -+++|.|..||+-|.+.+.-. |+.+- ++.+.. .-+.++.+|+-+- . -+.++..+... T Consensus 67 al~~~~V~~cv~~Ia~~iA~l-----p~~~~-~~~~~~---------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~ 131 (441) T protein:vir:79 67 AIRHSDIFTAVMMIASDLARM-----PIRVT-VNGQIN---------YSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLT 131 (441) T ss_pred hhccHHHHHHHHHHHHhhccC-----ceeee-cCcccc---------ccchHHHHHhcccCcCCCHHHHHHHHHHHHhhc Confidence 356888999999998866543 44432 222211 1233445554322 2 34455566788 Q ss_pred hhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccc Q lcl|NC_015287. 135 GRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGA 214 (563) Q Consensus 135 Gri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~ 214 (563) |.-|..++.|.. .=+++|.+|+|..|..++ +.. +. ..|+.+...+ . T Consensus 132 Gnay~~i~r~~~---G~~~~L~~i~~~~v~v~~----d~~--g~-----------------~~~~~~~~~~--------~ 177 (441) T protein:vir:79 132 SHGYIEITRDKT---GEPMNLTFRKTSEIELKS----DAR--GR-----------------LYYFHQRIDS--------N 177 (441) T ss_pred CCeEEEEEECCC---CcEEEEEEEcCceeEEEE----CCC--cc-----------------EEEEEEEecc--------C Confidence 999998877632 117899999998876442 111 11 1122221100 0 Q ss_pred cCcCccccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchH Q lcl|NC_015287. 215 IGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKM 294 (563) Q Consensus 215 ~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~ 294 (563) + ......++.+.|.|.. .++.++-.+.|-|+.|++++.....+++...=+=---|--+-|..++ |.+... T Consensus 178 ~-------~~~~~~~~~~dvih~k--~~~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~ 247 (441) T protein:vir:79 178 G-------NNIERNVKFEDMLDIK--FYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNK 247 (441) T ss_pred C-------ceeEEEEccccEEEec--cCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC-CCCCCH Confidence 0 0112345555553332 24556666789999999999888888777654433445566777777 455434 Q ss_pred HHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCC-CcchHHHHHHHHHHHHHhcC Q lcl|NC_015287. 295 KAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQ-NLGELSDIEYFQKKLYRSLG 373 (563) Q Consensus 295 KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~-nLgei~DV~YF~~kLy~aL~ 373 (563) +|.+=+++-+++.- +| ..+..+.+ .++ .|.+++.|.=.. .+.=++-.++..+.+.++++ T Consensus 248 e~~e~~r~~~~~~~--------~G-~~nag~~~-vl~----------~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fg 307 (441) T protein:vir:79 248 KARDRAREEFHKSF--------SG-TKQAGKVV-VLD----------ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFG 307 (441) T ss_pred HHHHHHHHHHHHHh--------cC-ccccCcce-ecC----------CCceEEEccCChhHHHHHHHHHHhHHHHHHHhC Confidence 44333333222211 12 22222222 222 145555553211 22223445677888999999 Q ss_pred CChhhccCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHH Q lcl|NC_015287. 374 VPESRIAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFA 453 (563) Q Consensus 374 VP~sRl~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ 453 (563) ||.+.|..+..++ .+.-..+-|..-+.-+-.++...|.. .|. ++ +. ...+.|.-+. T Consensus 308 VPp~~lg~~~~~~------s~~q~~~~~~~tl~P~~~~ie~eln~----kl~-----~~--~~----~~~~~fd~~~--- 363 (441) T protein:vir:79 308 IPLHKFGIETANM------SITDANLDYLSTLKPYITCVCAELNF----KFN-----DE--YV----NREFKFDTTE--- 363 (441) T ss_pred CCHHHcCCCCCCc------cHHHHHHHHHHHHHHHHHHHHHHHhh----hcc-----cc--cc----CceEEeechh--- Confidence 9999995432222 12333343444444444444333332 221 21 21 2334443322 Q ss_pred HHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCC Q lcl|NC_015287. 454 ELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGG 533 (563) Q Consensus 454 ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~ 533 (563) +.... ...|++.++.+-.- -+ +|..|+- +...-++++.++...-.-+. T Consensus 364 -llr~D-~~~~~~~~~~~i~~--G~------------~T~NE~R-------~~~gl~Pi~ggd~~~~~~~~--------- 411 (441) T protein:vir:79 364 -IRVVD-EKTQAEIDKINIDS--GK------------MNIDEIR-------QRDGLAPIPGGNGSIHRVDL--------- 411 (441) T ss_pred -hhccC-HHHHHHHHHHHHhC--CC------------cCHHHHH-------HHhCCCCCCCCCcceEeecc--------- Confidence 21111 23444444333211 12 4443332 12333444333221100000 Q ss_pred cCCCCCcccccccCCCCcccCCCCCCCCCC Q lcl|NC_015287. 534 AMNPQDPNMGGAPQEGGAEQAPQMPEGGEI 563 (563) Q Consensus 534 ~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 563 (563) -.-|.+. ....+.....+...-.+|||- T Consensus 412 n~~~~~~--~~~~~~~~~~~~~~~~kgGe~ 439 (441) T protein:vir:79 412 NHVNIEL--VDEYQMNKSRATDKKLKGGEE 439 (441) T ss_pred ccccccc--ccccccccccccccccCCCCC Confidence 0001111 111112222233344667776 No 104 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=97.75 E-value=2.2e-05 Score=46.11 Aligned_cols=447 Identities=15% Similarity=0.137 Sum_probs=199.1 Q ss_pred Ccc------cccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhh--ccc---- Q lcl|NC_015287. 1 MAK------LFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMAL--HPE---- 68 (563) Q Consensus 1 m~~------~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~--~PE---- 68 (563) |.= ||+--..+...+....|.--|+... -......|+..+.|.. +|+ T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~---------------------~~~~~~~i~~~~~~y~g~~~~~~~~ 59 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAV---------------------TQEEYDRIKRNLVYYQSKWDDVQYK 59 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCC---------------------CHHHHHHHHHHHHHhcCCccccccc Confidence 211 1111111111111111110010000 0122223333333321 111 Q ss_pred ----------------hhHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHh Q lcl|NC_015287. 69 ----------------VDSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWY 132 (563) Q Consensus 69 ----------------vd~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WY 132 (563) -..++.+.++ +|+. .|+++.+++ +...+.++.++.-.+|.....+.+-.+. T Consensus 60 ~~~~~~~~~~~~slnl~~~i~~~~A~--lv~~---e~~~i~v~d--------~~~~~~l~~~l~~n~f~~~~~~~~e~a~ 126 (522) T protein:vir:47 60 NTDGDIKSRPMNHLPIARTASKKIAS--LVYN---EQATITTKN--------EILQKFLDDMLTNDRFNKNFERYLESCL 126 (522) T ss_pred ccCcchhcccceecchHHHHHHHHhh--hhcC---CcceeecCC--------hHHHHHHHHHHhhcchHHHHHHHHHHhh Confidence 1233333333 2222 355555553 3677788888888899999999999999 Q ss_pred hhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCcccccee---eecccccccccccccceeeecCc----- Q lcl|NC_015287. 133 VDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQ---VNSKNPADIENPEIEEYYQYDPS----- 204 (563) Q Consensus 133 vDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~---~~~~~~~~~~~~~~~eyf~y~~~----- 204 (563) .-|-.+|...+|. |-+.+.+++|.++-+++....... ..+.+ +.+......+|- .-||-.+... T Consensus 127 a~G~~a~k~~~d~-----~~~~i~~v~ad~~~P~~~~~~~~~--e~a~~~~~~~~~~~~~~~yt-~lE~he~~~~~~~~~ 198 (522) T protein:vir:47 127 ALGGLAMRPYIDG-----DKVRVAFIQAPVFFPLESNTQDVS--SAAILTKTIKSEGRKNVYYT-LVEFHEWVTADGQET 198 (522) T ss_pred ccCCEEEEEEEcC-----CceEEEEEcCCceEEEEEcCCceE--EEEEEEEEEeecccceeEEE-EEEEeeecccccccc Confidence 9999999999983 345789999999888743211110 01111 111111111111 0111111000 Q ss_pred --------ccccccc-----ccccCc-CccccCccee-eec--ccceeeeec-----ce--------ecCCCCccchhHH Q lcl|NC_015287. 205 --------QTQGKNA-----LGAIGQ-TPFSTKQRPV-KIA--PDAISFCHS-----GL--------VDRNKQTILSYLH 254 (563) Q Consensus 205 --------~~~~~~~-----~g~~g~-~~~~~~~~~i-ki~--~daI~y~hS-----Gl--------~d~~~~~~~syL~ 254 (563) +...+.. ....|. .+.. .+ .|+ .+.+++.+- +. .++.+..++|-++ T Consensus 199 ~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~----~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~ 274 (522) T protein:vir:47 199 GSTNDKKYYRITNELYRSDVNDVLGQRVNLS----ELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFD 274 (522) T ss_pred cccccCCceEEEEEEeecCCCcccCcccccc----ccccccCCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhh Confidence 0000000 000010 0000 00 000 122222211 01 1223445678888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhc Q lcl|NC_015287. 255 KSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFW 334 (563) Q Consensus 255 ~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDyw 334 (563) .|+-.+--|-..=+ .+.|.+|.=++|||. |-.=++ ...+..+|+...-..+- --+.+| T Consensus 275 ~~~~~id~lD~~~s--~~~~e~~~g~~~i~v-~~~~l~------------------~~~~~~~g~~~~~~~fd-~~~~~f 332 (522) T protein:vir:47 275 NAKTTIDFINRSYD--EFMWEVRMGQRRVIV-PEHLTQ------------------RQYQRPDGTIDFRPRFD-VEQNVY 332 (522) T ss_pred hhHHHHHHHHHHHH--HHHHHHHhccceeec-chHHhc------------------cCCCCCCcccccccccC-cccceE Confidence 88887777664444 456778887888775 111110 11122333211000000 001112 Q ss_pred ccccCC-CCccceeecCCCCCcchHH-HHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 335 LPRREG-GRGTEITTLPGGQNLGELS-DIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRF 412 (563) Q Consensus 335 LpRReG-grgTEIsTLpGg~nLgei~-DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rF 412 (563) .+-..+ +-|--|+++...---++.. =+..+.+.+=...+++-+-|..+++| .--++||...+-.-..-+.+.|+.+ T Consensus 333 ~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~--~kTAtEi~s~~~~~~~t~~~~~~~~ 410 (522) T protein:vir:47 333 MQIGGSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQG--MKTATEIVSENSDTYQMRSSIVALV 410 (522) T ss_pred eecCCCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccccc--cccHHHHHHHHHHHHHHHHHHHHHH Confidence 111110 0111255554432222211 24555566666677777777554332 2335676665555555677777777 Q ss_pred HHHHHHHHHHHH-------HhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHH Q lcl|NC_015287. 413 SHLFNDMLKTQL-------ILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVR 485 (563) Q Consensus 413 s~if~d~Lk~qL-------iLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~ 485 (563) ...+.++++.=| ++.|.... + ..|.++|. |+.+.. +++++-+.+ .+++ .| .+|..+.+ T Consensus 411 ~~al~~lv~~i~~l~~~~~~~~~~~~~-~-----~~i~v~f~-D~i~~D-~~~~~~~~~-~~v~-----aG-~~s~e~~i 475 (522) T protein:vir:47 411 EQSIKELCVSMCELGKAVGVYSGEIPE-L-----DDISVNLD-DGVFTD-RHAELDYWA-KMVA-----AG-FSTKKRAI 475 (522) T ss_pred HHHHHHHHHHHHHHHhhhhhccCCCCC-c-----ceeEEEcC-CCCCCC-HHHHHHHHH-HHHh-----cC-CCCHHHHH Confidence 777777766644 23332222 1 34778887 555554 222222211 2111 13 68888876 Q ss_pred HHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCC Q lcl|NC_015287. 486 RKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMN 536 (563) Q Consensus 486 k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~ 536 (563) .+....+|+|.+++-++|++|.... .|. +..--++..+++..++++- T Consensus 476 ~~~~g~~eeea~~el~ri~~E~~~~---~~~-~~~~~~~~~~~~~~~d~~~ 522 (522) T protein:vir:47 476 GKTLNISGVEAEKELNAINSELLPM---NDA-ELAIYGMHDQNEEKADDKG 522 (522) T ss_pred HhcCCCChHHHHHHHHHHHHhhccC---CCC-CCCCCCCCCcccccCCCCC Confidence 7778899999999999998885431 110 1001111111222222221 No 105 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=97.74 E-value=2.3e-05 Score=46.03 Aligned_cols=436 Identities=12% Similarity=0.104 Sum_probs=171.0 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceec-cccccccc-c-ccchhhhHHHHHHHHHHHhhccchhHHH---- Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVS-SGFYGQYV-D-IDGVFKSEFELIKRYREMALHPEVDSAI---- 73 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~-~~~~~~~~-~-~~~~~~~~~~LI~~YR~ma~~PEvd~Ai---- 73 (563) |-..--..+ +..-+..... -.+-.... . +..-+..-.+.+.+|..+..+.+-+..| T Consensus 1 ~~~~~~~~~-----------------~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~ 63 (474) T protein:vir:96 1 MINIIRMPW-----------------DKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQA 63 (474) T ss_pred CcccccCCC-----------------CCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccc Confidence 221111111 1111111100 00000000 0 0011111123344555555444433211 Q ss_pred -----------------------HHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHH Q lcl|NC_015287. 74 -----------------------EDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRN 130 (563) Q Consensus 74 -----------------------~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~ 130 (563) .-||+- .+.=--..||.+..++. ...+.++.+++ -+|+....++.+. T Consensus 64 ~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~-~~~yl~g~p~~~~~~~~--------~~~~~l~~~~~-n~~~~~~~~l~~~ 133 (474) T protein:vir:96 64 YKQDLHGNIDYTKPDWRITTNFHQNLVDQ-KVSYVAGKPVTYAHDDD--------KVLDVIHQVLD-TRWDNKLIDILTA 133 (474) T ss_pred chhhhcccccccccccccccchHHHHHHh-hhhhhcccCceeccCCh--------HHHHHHHHHHh-ccHHHHHHHHHHH Confidence 122221 11112345666665542 23344455554 4799999999999 Q ss_pred HhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeeccccc--ccccccccceeeecCccccc Q lcl|NC_015287. 131 WYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPA--DIENPEIEEYYQYDPSQTQG 208 (563) Q Consensus 131 WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~--~~~~~~~~eyf~y~~~~~~~ 208 (563) +++.|+-|.+.-+| ++|-..+..+||+.+..|..-.........+...+.+... ..+.+.-..+|.+...+... T Consensus 134 ~~~~G~~~~~~~~d----~~~~~~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~ 209 (474) T protein:vir:96 134 ASNKGIDWLQVYIN----EDGELKLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIP 209 (474) T ss_pred HhhCCeEEEEeeeC----CCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceee Confidence 99999999987666 3577889999999988874322211111122221111100 01111112222222211111 Q ss_pred cccccccCcCccccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEcc Q lcl|NC_015287. 209 KNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLR-MIEDSLVIYRLSRAPERRIFYID 287 (563) Q Consensus 209 ~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~-m~EDalVIyRi~RAPeRRvFyID 287 (563) ....+..+ .......--+..=+|++. . ++....|=|+..+.....+. ++-+....-+-++.|-+-+.-.+ T Consensus 210 ~~~~~~~~---~~~~~~~~~~~~vPvv~~-----~-nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~ 280 (474) T protein:vir:96 210 DFYYGDEH---IQTHFSTGSWERVPFIAF-----K-NNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYE 280 (474) T ss_pred cccccccc---ccCcccccCCCccceEEe-----c-CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCC Confidence 00000000 000000000111122211 0 12223455666555555544 33333434455566654333221 Q ss_pred CCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHH Q lcl|NC_015287. 288 VGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDIEYFQK 366 (563) Q Consensus 288 vGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~ 366 (563) .-++. +.+.. |..+ +++ |++ ++ -.+..|-...+.+. -.-+.-+.+ T Consensus 281 ~~~~~-----~~~~~-~~~~--~~i---------------------~~~---~~--~~~~~l~~~~~~~~~~~~~~~l~~ 326 (474) T protein:vir:96 281 GEDLS-----EFMEG-LKYY--KAI---------------------NVS---SD--GGVETIQVEVPVASTKEYLDMMRA 326 (474) T ss_pred ccccc-----chhhh-hhcc--cee---------------------ecc---CC--CceeEEeccCCHHHHHHHHHHHHH Confidence 11111 11111 1111 111 121 11 12444433223222 133555667 Q ss_pred HHHHhcCCChhhccCCCcceeccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEE Q lcl|NC_015287. 367 KLYRSLGVPESRIAGSGDGFNLGRSSEIL--RDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQY 444 (563) Q Consensus 367 kLy~aL~VP~sRl~~~~~g~~~g~~~eIt--RDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~ 444 (563) .+|...++|---. + +|. |..|... .-......-+.+.+..|...+.++|+.=+-+-|+. .+| ..|.+ T Consensus 327 ~I~~~s~~p~~~~--~--~~~-~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~--~d~----~~i~i 395 (474) T protein:vir:96 327 YIVEFGQGVDFQT--D--KFG-SATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIK--LDA----KEIEI 395 (474) T ss_pred HHHHHhCCcCccc--c--ccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--ccc----ceeeE Confidence 7899999984222 2 222 2222222 11111334566677777777777776544444542 233 45778 Q ss_pred EeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcC--CCCCCCcccCccc Q lcl|NC_015287. 445 DFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKD--GKLADPMEVQQLQ 522 (563) Q Consensus 445 ~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~--~~~~~p~~~~~~~ 522 (563) .|....--.++. .++++.++ | .+|.++++..+-..+| .+++-++|++|..+ ...+.+ .. T Consensus 396 ~f~~~~p~~~~e-------~a~~~~~~----g-iiS~et~~~~lp~v~D--~~~E~eri~~E~~~~~~~~~~~-----~~ 456 (474) T protein:vir:96 396 TFNFNVMVNDLE-------QSQIGAQS----Q-YLSKETLVRHHPWVDD--PKAELERLDEEQLELNKQLPNL-----DD 456 (474) T ss_pred EecCCCccCHHH-------HHHHHHHc----C-CCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhcccc-----cc Confidence 886544333322 23334332 3 5899999977654443 33444555555332 111100 00 Q ss_pred cccccccCCCCcCCCCCcccccccCCCCccc Q lcl|NC_015287. 523 LGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQ 553 (563) Q Consensus 523 ~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~ 553 (563) . ..+...++..+..++ .+ T Consensus 457 ~----------~~~~~~~~~~~~~~e---~~ 474 (474) T protein:vir:96 457 G----------GADGAQQQQQSENNQ---SK 474 (474) T ss_pred c----------cCCCCCCcCCCCccc---cC Confidence 0 000000000000000 00 No 106 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=97.74 E-value=2.3e-05 Score=46.03 Aligned_cols=436 Identities=12% Similarity=0.104 Sum_probs=171.0 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceec-cccccccc-c-ccchhhhHHHHHHHHHHHhhccchhHHH---- Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVS-SGFYGQYV-D-IDGVFKSEFELIKRYREMALHPEVDSAI---- 73 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~-~~~~~~~~-~-~~~~~~~~~~LI~~YR~ma~~PEvd~Ai---- 73 (563) |-..--..+ +..-+..... -.+-.... . +..-+..-.+.+.+|..+..+.+-+..| T Consensus 1 ~~~~~~~~~-----------------~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~ 63 (474) T protein:vir:95 1 MINIIRMPW-----------------DKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQA 63 (474) T ss_pred CcccccCCC-----------------CCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccc Confidence 221111111 1111111100 00000000 0 0011111123344555555444433211 Q ss_pred -----------------------HHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHH Q lcl|NC_015287. 74 -----------------------EDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRN 130 (563) Q Consensus 74 -----------------------~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~ 130 (563) .-||+- .+.=--..||.+..++. ...+.++.+++ -+|+....++.+. T Consensus 64 ~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~-~~~yl~g~p~~~~~~~~--------~~~~~l~~~~~-n~~~~~~~~l~~~ 133 (474) T protein:vir:95 64 YKQDLHGNIDYTKPDWRITTNFHQNLVDQ-KVSYVAGKPVTYAHDDD--------KVLDVIHQVLD-TRWDNKLIDILTA 133 (474) T ss_pred chhhhcccccccccccccccchHHHHHHh-hhhhhcccCceeccCCh--------HHHHHHHHHHh-ccHHHHHHHHHHH Confidence 122221 11112345666665542 23344455554 4799999999999 Q ss_pred HhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeeccccc--ccccccccceeeecCccccc Q lcl|NC_015287. 131 WYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPA--DIENPEIEEYYQYDPSQTQG 208 (563) Q Consensus 131 WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~--~~~~~~~~eyf~y~~~~~~~ 208 (563) +++.|+-|.+.-+| ++|-..+..+||+.+..|..-.........+...+.+... ..+.+.-..+|.+...+... T Consensus 134 ~~~~G~~~~~~~~d----~~~~~~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~ 209 (474) T protein:vir:95 134 ASNKGIDWLQVYIN----EDGELKLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIP 209 (474) T ss_pred HhhCCeEEEEeeeC----CCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceee Confidence 99999999987666 3577889999999988874322211111122221111100 01111112222222211111 Q ss_pred cccccccCcCccccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEcc Q lcl|NC_015287. 209 KNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLR-MIEDSLVIYRLSRAPERRIFYID 287 (563) Q Consensus 209 ~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~-m~EDalVIyRi~RAPeRRvFyID 287 (563) ....+..+ .......--+..=+|++. . ++....|=|+..+.....+. ++-+....-+-++.|-+-+.-.+ T Consensus 210 ~~~~~~~~---~~~~~~~~~~~~vPvv~~-----~-nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~ 280 (474) T protein:vir:95 210 DFYYGDEH---IQTHFSTGSWERVPFIAF-----K-NNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYE 280 (474) T ss_pred cccccccc---ccCcccccCCCccceEEe-----c-CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCC Confidence 00000000 000000000111122211 0 12223455666555555544 33333434455566654333221 Q ss_pred CCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHH Q lcl|NC_015287. 288 VGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDIEYFQK 366 (563) Q Consensus 288 vGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~ 366 (563) .-++. +.+.. |..+ +++ |++ ++ -.+..|-...+.+. -.-+.-+.+ T Consensus 281 ~~~~~-----~~~~~-~~~~--~~i---------------------~~~---~~--~~~~~l~~~~~~~~~~~~~~~l~~ 326 (474) T protein:vir:95 281 GEDLS-----EFMEG-LKYY--KAI---------------------NVS---SD--GGVETIQVEVPVASTKEYLDMMRA 326 (474) T ss_pred ccccc-----chhhh-hhcc--cee---------------------ecc---CC--CceeEEeccCCHHHHHHHHHHHHH Confidence 11111 11111 1111 111 121 11 12444433223222 133555667 Q ss_pred HHHHhcCCChhhccCCCcceeccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEE Q lcl|NC_015287. 367 KLYRSLGVPESRIAGSGDGFNLGRSSEIL--RDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQY 444 (563) Q Consensus 367 kLy~aL~VP~sRl~~~~~g~~~g~~~eIt--RDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~ 444 (563) .+|...++|---. + +|. |..|... .-......-+.+.+..|...+.++|+.=+-+-|+. .+| ..|.+ T Consensus 327 ~I~~~s~~p~~~~--~--~~~-~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~--~d~----~~i~i 395 (474) T protein:vir:95 327 YIVEFGQGVDFQT--D--KFG-SATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIK--LDA----KEIEI 395 (474) T ss_pred HHHHHhCCcCccc--c--ccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--ccc----ceeeE Confidence 7899999984222 2 222 2222222 11111334566677777777777776544444542 233 45778 Q ss_pred EeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcC--CCCCCCcccCccc Q lcl|NC_015287. 445 DFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKD--GKLADPMEVQQLQ 522 (563) Q Consensus 445 ~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~--~~~~~p~~~~~~~ 522 (563) .|....--.++. .++++.++ | .+|.++++..+-..+| .+++-++|++|..+ ...+.+ .. T Consensus 396 ~f~~~~p~~~~e-------~a~~~~~~----g-iiS~et~~~~lp~v~D--~~~E~eri~~E~~~~~~~~~~~-----~~ 456 (474) T protein:vir:95 396 TFNFNVMVNDLE-------QSQIGAQS----Q-YLSKETLVRHHPWVDD--PKAELERLDEEQLELNKQLPNL-----DD 456 (474) T ss_pred EecCCCccCHHH-------HHHHHHHc----C-CCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhcccc-----cc Confidence 886544333322 23334332 3 5899999977654443 33444555555332 111100 00 Q ss_pred cccccccCCCCcCCCCCcccccccCCCCccc Q lcl|NC_015287. 523 LGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQ 553 (563) Q Consensus 523 ~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~ 553 (563) . ..+...++..+..++ .+ T Consensus 457 ~----------~~~~~~~~~~~~~~e---~~ 474 (474) T protein:vir:95 457 G----------GADGAQQQQQSENNQ---SK 474 (474) T ss_pred c----------cCCCCCCcCCCCccc---cC Confidence 0 000000000000000 00 No 107 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=97.71 E-value=2.6e-05 Score=45.72 Aligned_cols=443 Identities=11% Similarity=0.105 Sum_probs=171.2 Q ss_pred Cccc-ccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchh--------- Q lcl|NC_015287. 1 MAKL-FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVD--------- 70 (563) Q Consensus 1 m~~~-~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd--------- 70 (563) |+.. -+.+..-..+..++. . ++..... +.+ . --+..-.+-+.+|+.+..+.+-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~-~--~~~~~~~-~~i------~------~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~ 64 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQI-K--PKYETQE-EMI------L------RLVREHKENIDNITMGERYYNHHPDILDAPFK 64 (478) T ss_pred CccccccCCchhhhHHHHHh-h--hccCChH-HHH------H------HHHHHHHHHHHHHHHHHHHhcccccccccchh Confidence 5444 222222222222110 0 1100000 000 0 00000011122233332222211 Q ss_pred ------------------HHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHh Q lcl|NC_015287. 71 ------------------SAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWY 132 (563) Q Consensus 71 ------------------~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WY 132 (563) +-...||+- .+.=-..+||.+..++.+ ..+....+++ -+|+..-.++.+.+. T Consensus 65 ~~~~~~~~~~~~~~ki~~n~~k~ivd~-~~~yl~g~p~~~~~~~~~--------~~~~l~~~~~-n~~~~~~~~~~~~~~ 134 (478) T protein:vir:10 65 RDVNGDYDETKPDWRMYTNYHQNLVDQ-KVAYAVANPVTFGVDNDK--------ALKQIQHTLN-HKWDDKLVDILTAAS 134 (478) T ss_pred hhcccccccccccceeccchHHHHHHH-HhhhhcccCceeecCChH--------HHHHHHHHHh-ccHHHHHHHHHHHHh Confidence 111222221 111113466676665422 2333344443 478999999999999 Q ss_pred hhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeeccccccc--ccccccceeeecCccccccc Q lcl|NC_015287. 133 VDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADI--ENPEIEEYYQYDPSQTQGKN 210 (563) Q Consensus 133 vDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~--~~~~~~eyf~y~~~~~~~~~ 210 (563) +.|+-|.+.-+|. +|-..+..+||+.+..|..-.........+..-.......+ +.++--.+|.+......... T Consensus 135 ~~G~~~~~v~~d~----~~~~~~~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~ 210 (478) T protein:vir:10 135 NKGIEWVQPYVDE----EGEFKTFRVPAEQAVPIWTNKERDELQAFIRVYELDGAERVEYWTKDDVTFYELKEGQLIPDF 210 (478) T ss_pred hCCeEEEEEEecC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEeeeCceEEEEEeCCcEEEEEecCCeeeccc Confidence 9999998876764 46788999999998877432111111111111111110000 00111112222111111000 Q ss_pred cccccCcCcc-ccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccC Q lcl|NC_015287. 211 ALGAIGQTPF-STKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLR-MIEDSLVIYRLSRAPERRIFYIDV 288 (563) Q Consensus 211 ~~g~~g~~~~-~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~-m~EDalVIyRi~RAPeRRvFyIDv 288 (563) ..+..+.... ..+...--+..=+|+.. .|.....|-|+..+....-|. ++-+....-+-++.|-+-+.-.+. T Consensus 211 ~~~~~~~~~~~~~~~~~~~~g~vPvv~~------~n~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~ 284 (478) T protein:vir:10 211 YRSEDHIQPHYYQGNKLMSWGRVPFIPF------KNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEG 284 (478) T ss_pred cccccccccceecccccccCCcceEEEe------ccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCc Confidence 0000000000 00000000000011111 123334566666555555554 334444445666677554432222 Q ss_pred CCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHH-HHHHHHHH Q lcl|NC_015287. 289 GNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELS-DIEYFQKK 367 (563) Q Consensus 289 Gnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~-DV~YF~~k 367 (563) .+... .... |..++ =+|++.-+| .+++.|-...+...+. -+.-+.+. T Consensus 285 ~~~~~-----~~~~-~~~~~-----------------------~~~~~~~~~---~~~~~l~~~~~~~~~~~~~~~l~~~ 332 (478) T protein:vir:10 285 EDMKD-----FMHN-LKYYK-----------------------AISVAGESG---SGVDTIKVEVPIDSVKEYTKMLRDY 332 (478) T ss_pred ccccc-----hhhh-hhhCc-----------------------eeEecCCCC---CcceEEeecCCHHHHHHHHHHHHHH Confidence 11110 0000 11111 112322222 3356555444544433 36677888 Q ss_pred HHHhcCCChhhccCCCcceeccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEE Q lcl|NC_015287. 368 LYRSLGVPESRIAGSGDGFNLGRSSE--ILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYD 445 (563) Q Consensus 368 Ly~aL~VP~sRl~~~~~g~~~g~~~e--ItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~ 445 (563) +|...++|- +..+ +|. |..|. |..-..-...-+.+.+..|...+.++|+.=+-+.|+ ..+|. .|.+. T Consensus 333 I~~~s~~p~--~~~~--~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~--~~d~~----~i~i~ 401 (478) T protein:vir:10 333 IIEFGQGVD--FQQD--KFG-NSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRL--DVRVQ----DIEIT 401 (478) T ss_pred HHHHhCCcC--cCcc--ccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Ccccc----cceEE Confidence 999999984 2222 222 22222 221111233335556666666666665543333443 33443 46777 Q ss_pred eeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCcccccc Q lcl|NC_015287. 446 FVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGV 525 (563) Q Consensus 446 f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~ 525 (563) |...---.+... +++++.+. | .+|.+++++.+-...| .+++.++|++|..+..-..|+.. T Consensus 402 f~~~~p~~~~e~-------~~~~~~~~---g-~iS~et~i~~~~~v~d--~~~E~~ri~~E~~~~~~~~~~~~------- 461 (478) T protein:vir:10 402 FNFNVMVNELEN-------SQIAMNST---G-LLSKETILGNHSWVQD--PVAEMERIEQENIELNQQLPDIE------- 461 (478) T ss_pred eCCCCCCCHHHH-------HHHHHHHh---C-CCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHhccccC------- Confidence 754332223222 33444443 4 4899999976654333 33444445554332110001000 Q ss_pred ccccCCCCcCCCCCcccccccCCCCcccCCCCCCCCC Q lcl|NC_015287. 526 HPEQMPGGAMNPQDPNMGGAPQEGGAEQAPQMPEGGE 562 (563) Q Consensus 526 ~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~ 562 (563) ++..++.. .++-.+|.| T Consensus 462 ------~~~~d~~~--------------~~~~d~~~e 478 (478) T protein:vir:10 462 ------EGLNDEQQ--------------RQSEDNQSE 478 (478) T ss_pred ------CCCccccc--------------ccCcCCCCC Confidence 00000000 001111111 No 108 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=97.66 E-value=3.1e-05 Score=45.30 Aligned_cols=428 Identities=12% Similarity=0.102 Sum_probs=178.2 Q ss_pred CcccccchhhHhhhcccccccccCcCCC-CcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcc Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNE-DSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINE 79 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~-d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvne 79 (563) |-+.+..-..+-++...+-. -+|.+. ++.-...-||.... .+.. + .....+.+|-|..||+.|.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~--g~~~s~~~~~~~~~~~~~~~~-~g~~---------v-~~~~al~~~~v~~ci~~Ia~~ 67 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKWL--GVPISLTDGSFWSAWGGMGSS-SGET---------V-TADSALQLSAVWSCVRLIAET 67 (437) T ss_pred CCcchhhhhhhhHHhhhhhc--CCcccCCchhHHHhhcccccC-CCce---------e-chHhhhccHHHHHHHHHHHHH Confidence 22211111111111111100 012221 11110111111111 0100 0 023346889999999999996 Q ss_pred eEEecCCCcEEEEEeeccccchHHHHH-HHHHHHHHHH-HhcchhhhHH----HHHHHhhhhhhheeeeecCCCCCccee Q lcl|NC_015287. 80 AIVSDQNDSPVEIDLENLPASKQLKDL-IREEFKTVKE-ILNFDDKCHE----ILRNWYVDGRIYYHKVIDIKAPEEGIK 153 (563) Q Consensus 80 aiv~d~~~~pV~l~l~~~~~s~sik~~-I~eeF~~i~~-ll~F~~~~~~----~~R~WYvDGri~fhkviD~~~~k~GI~ 153 (563) +.-. |+.+-=.+ . +..++. ......++|+ --+-...+++ ++..+.+.|--|..++.| ...++ T Consensus 68 ia~l-----p~~~~~~~-~--~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~----~g~~~ 135 (437) T protein:vir:10 68 IATL-----PLNLYQTK-P--DGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRS----AGVLI 135 (437) T ss_pred HhhC-----ceeEEEEc-C--CCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEec----CCcEE Confidence 5422 22221110 0 000000 1112222332 1233334444 445556778888776554 24689 Q ss_pred EEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccc Q lcl|NC_015287. 154 EVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDA 233 (563) Q Consensus 154 eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~da 233 (563) +|.+|+|..|..++. + ++ .-+|.|.. .+ .....++.+. T Consensus 136 ~L~~l~p~~v~i~~~---~----~g-----------------~~~y~~~~----------~~--------g~~~~~~~~d 173 (437) T protein:vir:10 136 GLELMLPQRTTVKRL---T----SG-----------------ALQYTYRN----------VD--------GTVSTLAEDD 173 (437) T ss_pred EEEEEcCcceEEEEC---C----CC-----------------eEEEEEEe----------cC--------ceEEEEcccc Confidence 999999988765421 0 00 01222311 00 1124566676 Q ss_pred eeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEe Q lcl|NC_015287. 234 ISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVY 313 (563) Q Consensus 234 I~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvY 313 (563) |.|.. .+..++-.+.|-|..|.+.+.....+++...=+----+--+-|..++ +.|.+.++++..+.+-.+|+.- T Consensus 174 Iih~r--~~~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g~--- 247 (437) T protein:vir:10 174 VFHVR--GFSLDGLMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTD-QILQKEKRAEIRTDLAEQFGGA--- 247 (437) T ss_pred EEEec--CcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhcCc--- Confidence 65543 23445666789999999999988888877665555556667777776 6788888777666655544320 Q ss_pred eCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCChhhccCCCcceeccccc Q lcl|NC_015287. 314 DASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSS 392 (563) Q Consensus 314 d~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~ 392 (563) .+.|.+ + .++ .|++++.|.-...-.+ ++=-++-.+.+.++++||...|..... -+... + T Consensus 248 -~nag~~------~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~t~~~-s 307 (437) T protein:vir:10 248 -MQAGKT------M-VLE----------AGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEK-STSWG-T 307 (437) T ss_pred -cccCcc------e-ecc----------CCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-ccccc-c Confidence 011211 1 121 2456666642222222 233346678899999999999943211 11111 1 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_015287. 393 EILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVD 472 (563) Q Consensus 393 eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~ 472 (563) .+...-+.|..+ .|+-.+. .+.+.|-+- ++++.++.. -.| .|.-+. +.... +..|.+.++.+- T Consensus 308 n~e~~~~~f~~~--tl~P~~~-~ie~~l~~k-----ll~~~e~~~--~~~--~fd~~~----ll~~d-~~~r~~~~~~~~ 370 (437) T protein:vir:10 308 GIEQQTLGFLTF--TLRPWLT-RIEQAARRS-----LLRPGERDQ--FYA--EFSVEG----LLRAD-SAGRAAFYSTMT 370 (437) T ss_pred hHHHHHHHHHHH--HHHHHHH-HHHHHHHhh-----ccCccccCc--eEE--EEechh----hhccC-HHHHHHHHHHHH Confidence 222222335443 2332222 233333333 345556642 234 444322 22211 356666665542 Q ss_pred hhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCcccccccccc-CCCCcCCCCCcccccccCCCCc Q lcl|NC_015287. 473 PYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQ-MPGGAMNPQDPNMGGAPQEGGA 551 (563) Q Consensus 473 p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~-~~~~~~~p~~~~~~~~~~~~~~ 551 (563) .- -++|.+-+|. ++.+.+.+ .-+..+. ...+.++- +..++..+...... ..+++..+++ .-+. T Consensus 371 ~~--G~~T~NE~R~-~~gl~pi~--gg~~~~~--~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~ 434 (437) T protein:vir:10 371 QN--GLMTRDECRA-KENLPPMG--GNAAVLT--VQSALLPI-DKLGEHTTATAAQDALKAWLYQEEK--------TRAT 434 (437) T ss_pred hC--CCcCHHHHHH-HhCCCCCC--CCcceEe--ecCcccch-hhccCcCCCcchhccccccCCCCCC--------CCcc Confidence 22 2445555543 22333211 0000000 00000100 00000000000000 0000000000 0011 Q ss_pred ccC Q lcl|NC_015287. 552 EQA 554 (563) Q Consensus 552 ~~~ 554 (563) ++. T Consensus 435 ~e~ 437 (437) T protein:vir:10 435 QER 437 (437) T ss_pred ccC Confidence 111 No 109 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=97.63 E-value=3.6e-05 Score=44.96 Aligned_cols=393 Identities=14% Similarity=0.168 Sum_probs=168.9 Q ss_pred ccchhhH--hhhcccccc--cccCcCCCCcccceecccccc-ccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcc Q lcl|NC_015287. 5 FGFKIQK--DDEQNKNVV--SPVPQSNEDSSDYYVSSGFYG-QYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINE 79 (563) Q Consensus 5 ~~f~~~~--~~~~~~~~~--s~~p~~~~d~~~~~~~~~~~~-~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvne 79 (563) |.| +.+ .-+..++.+ ..+ ..+......-+++.+ .+.++ ..+..+++|-|..||+-|.+. T Consensus 1 m~~-~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~v------------~~~~a~~~~~v~~~i~~ia~~ 64 (412) T protein:vir:26 1 MNV-IAKENIVTRIKKKLIDNWI---DQSTSKLYDFSPWKNRSFWGV------------INNTLETNETIFSAITKLSNS 64 (412) T ss_pred Ccc-chhhhhhhhhhhhHhhhhh---cccccccccccccCCcccccc------------chhhhhccHHHHHHHHHHHHh Confidence 444 211 111111111 000 000100000011111 11111 123457889999999999997 Q ss_pred eEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcchhhhH----HHHHHHhhhhhhheeeeecCCCCCcceeE Q lcl|NC_015287. 80 AIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFDDKCH----EILRNWYVDGRIYYHKVIDIKAPEEGIKE 154 (563) Q Consensus 80 aiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~~~~~----~~~R~WYvDGri~fhkviD~~~~k~GI~e 154 (563) +.-. |+.+- +..+. .......+|+. =+-...++ .++..++++|.-|..++.|.. .-+++ T Consensus 65 iA~l-----p~~~~-~~~~~-------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~---G~~~~ 128 (412) T protein:vir:26 65 MASL-----PLKMY-EDYKV-------VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY---HQPSK 128 (412) T ss_pred HhhC-----ceeEe-ecccc-------ccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCC---CcEEE Confidence 7643 23221 11111 11222233321 12223344 455577788998888776643 23789 Q ss_pred EEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccce Q lcl|NC_015287. 155 VRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAI 234 (563) Q Consensus 155 Lr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI 234 (563) |.+|+|..|..++.-. . ..-+|.|.... ...+.++.+.| T Consensus 129 L~~l~~~~v~v~~~~~---~--------------------~~~~y~~~~~~------------------g~~~~~~~~ev 167 (412) T protein:vir:26 129 LFLLNPDVVEMLIENQ---S--------------------RELYYSIHAAT------------------GNKLIVHNMDM 167 (412) T ss_pred EEEEcCceeEEEEeCC---C--------------------cEEEEEEEcCC------------------ceEEEEccccE Confidence 9999998887653211 0 01233332210 11245666766 Q ss_pred eeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEee Q lcl|NC_015287. 235 SFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYD 314 (563) Q Consensus 235 ~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd 314 (563) .|.. +....++-.+.|-|..|.+.......+++. -++.-.+.| . +....-+.|-+.++++..+.+.+.+. T Consensus 168 ih~~-~~~~~~~~~G~s~i~~~~~~i~~~~a~~~~-~~~~~~~~~-~-~i~~~~~~l~~e~~~~~~~~~~~~~~------ 237 (412) T protein:vir:26 168 LHFK-HIVASNMVQGISPIDVLKNTTDFDNAVRTF-NLTEMQKPD-S-FMLKYGSNVGKEKRQQVLEDFKQYYE------ 237 (412) T ss_pred EEeC-CCCCCCCcccccHHHHHHHHHHHHHHHHHH-HHHhcCCCC-c-eEEecCCCCCHHHHHHHHHHHHHHhh------ Confidence 4331 112334445577788887777766666665 344444443 3 33334567777777666665544332 Q ss_pred CCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHH---HHHHHHHHHHHhcCCChhhccCCCcceecccc Q lcl|NC_015287. 315 ASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELS---DIEYFQKKLYRSLGVPESRIAGSGDGFNLGRS 391 (563) Q Consensus 315 ~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~---DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~ 391 (563) +.|. .+ .+ ..|.+++.|. .+..+++ -..|-...+.++++||...|...++ -+.+.+ T Consensus 238 -~~g~------~~-vl----------~~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~-~~~sn~ 296 (412) T protein:vir:26 238 -ENGG------IL-FQ----------EPGVEIEPLP--KKYVSEDIVASENLTRERVANVFQLPSVFLNARSN-TNFAKN 296 (412) T ss_pred -cCCC------ee-ec----------CCCceEEEcC--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-CCcccH Confidence 2232 11 11 1356777774 2333332 3335668899999999999964321 222222 Q ss_pred chhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 392 SEILRDEIKFTKF-VGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAA 470 (563) Q Consensus 392 ~eItRDElKF~Kf-I~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~ 470 (563) .+.. .-|.++ |.-+..+ +.+.|-.. ++++.+|.. ...|.|.-+. +.... +.+|++.+.. T Consensus 297 e~~~---~~f~~~~l~P~~~~----ie~~ln~k-----Ll~~~~~~~---~~~~~fd~~~----l~~~d-~~~~~~~~~~ 356 (412) T protein:vir:26 297 EELN---RFYLQHTLLPIVKQ----YEEEFNRK-----LLTKTDREK---NRYFKFNVKS----YLRAD-SATQAEVYFK 356 (412) T ss_pred HHHH---HHHHHHHHHHHHHH----HHHHHHhh-----cCCcccccC---cceEEeechh----hhccC-HHHHHHHHHH Confidence 2222 125554 3333222 23333333 344445542 2334454322 22111 2345555543 Q ss_pred hhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCC Q lcl|NC_015287. 471 VDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGG 550 (563) Q Consensus 471 ~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~ 550 (563) +-.- -+ +|..|+-++ ..-+++ |+ .+..- ..+ -..|.+...+ T Consensus 357 ~~~~--G~------------~t~NE~R~~-------~gl~p~--~g---gD~~~-----~~~-n~~~~~~~~~------- 397 (412) T protein:vir:26 357 AVRS--GY------------YTINDIREW-------EDLPPV--EG---GDKPL-----ISG-DLYPIDTPLE------- 397 (412) T ss_pred HHhC--CC------------cCHHHHHHH-------hCCCCC--CC---cCeee-----ecc-cccccccchh------- Confidence 3221 12 333333211 122222 11 00000 000 0011111000 Q ss_pred cccCCCCCCCCCC Q lcl|NC_015287. 551 AEQAPQMPEGGEI 563 (563) Q Consensus 551 ~~~~~~~~~~~~~ 563 (563) .-+-.+|||- T Consensus 398 ---~~~~~~gG~~ 407 (412) T protein:vir:26 398 ---LRKSLKGGDK 407 (412) T ss_pred ---hcccccCCCC Confidence 0011233333 No 110 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=97.60 E-value=4e-05 Score=44.72 Aligned_cols=435 Identities=12% Similarity=0.061 Sum_probs=176.2 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceecc------cccccc-ccccchhh---------hHHHHHHHHHHHh Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSS------GFYGQY-VDIDGVFK---------SEFELIKRYREMA 64 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~------~~~~~~-~~~~~~~~---------~~~~LI~~YR~ma 64 (563) ||+-. --|+.+.+.. -++..+ .+.....- .-...+.+|+.+. T Consensus 1 ~~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~i~~~i~~~~~~~~r~~~l~ 58 (483) T protein:vir:12 1 MAQAL----------------------IKGGNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQ 58 (483) T ss_pred Cccch----------------------hcCCceeecCcchhhhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 33211 1111111110 011111 11111110 1122234565555 Q ss_pred hccchhHHH---------------------------HHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHH Q lcl|NC_015287. 65 LHPEVDSAI---------------------------EDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEI 117 (563) Q Consensus 65 ~~PEvd~Ai---------------------------~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l 117 (563) .+.+-...| ..||+- .+.=--..||.+..++. ...+..+.+++ T Consensus 59 ~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~-~~~~l~G~p~~~~~~d~--------~~~~~l~~~~~- 128 (483) T protein:vir:12 59 EYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQ-KVSYIVGKPIAFKHTDD--------EVVKRIDEVLG- 128 (483) T ss_pred HHhccccccccccccccccccccccccccccccchHHHHHHH-HhhhhcccCceeccCCh--------HHHHHHHHHHh- Confidence 554443211 111211 11111345666655542 23344444544 Q ss_pred hcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeeccccc--ccccccc Q lcl|NC_015287. 118 LNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPA--DIENPEI 195 (563) Q Consensus 118 l~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~--~~~~~~~ 195 (563) .+|+....++.+..++.|+-|.+.-+|. +|-..++.+||+.+..|..-.........+......... .++.+.. T Consensus 129 n~~~~~~~~~~~~~~~~G~~y~~v~~d~----d~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~ 204 (483) T protein:vir:12 129 NRFDDKLHSVLTGASNKGIEWLHPYLDE----EGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVT 204 (483) T ss_pred ccHHHHHHHHHHHHhhCCeEEEEEEEcC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeecceEEEEEecCe Confidence 4789999999999999999999877763 466889999999988874322211112222221111100 0111111 Q ss_pred cceeeecCccccccccccccCcCccccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHH-HHHHHHHHHH Q lcl|NC_015287. 196 EEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLR-MIEDSLVIYR 274 (563) Q Consensus 196 ~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~-m~EDalVIyR 274 (563) ..+|.+................ . .....--+..=+|++.. |+....|=++..+.....+. ++=+....-+ T Consensus 205 v~~~~~~~~~~~~~~~~~~~~~--~-~~~~~~~~g~vPvv~~~------nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~ 275 (483) T protein:vir:12 205 VNYYVYENGSLIPDYSNNLENS--K-THFSTGSWGKIPFIPFK------NNDLEISDIFMYKTLIDAYNRRLSDLSNTFK 275 (483) T ss_pred EEEEEEeCCeeeeccccccccc--c-cccccCCCCccceEEec------CCCCCCCchhhHHHHHHHHHHHHHHHHHHHH Confidence 2233332211110000000000 0 00000001111233321 12223444554444443333 3455555557 Q ss_pred HhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCC Q lcl|NC_015287. 275 LSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQN 354 (563) Q Consensus 275 i~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~n 354 (563) .++.|-+-+.-.+.-++. +..+ .+..++...-++ |.++.+|-...+ T Consensus 276 ~~~~~~lv~~g~~~~~~~-----~~~~---------------------------~~~~~~~~~~~~--~~~~~~l~~~~~ 321 (483) T protein:vir:12 276 DSNELTYVLTNYDDQELP-----EFKR---------------------------LLRYYGAIKVSD--NGGVDTIQVEVP 321 (483) T ss_pred HhcCceeeeecCCcccch-----hHHH---------------------------hhhhccccccCC--CCcceEEeecCC Confidence 777775544322222211 1111 111111111111 123555544334 Q ss_pred cchH-HHHHHHHHHHHHhcCCChhhccCCCcceeccccc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCh Q lcl|NC_015287. 355 LGEL-SDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSS-EILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTP 432 (563) Q Consensus 355 Lgei-~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~-eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~ 432 (563) .+.. .-+.-+.+.+|+..++|---+ + .|.-+.|+ .|.--+.....-+.+.++.|...+.++++.=+-+-|+ . T Consensus 322 ~~~~~~~~~~l~~~I~~~s~~p~~~~--~--~~~~n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~--~ 395 (483) T protein:vir:12 322 VENSKKYLDELYQKIMLFGQAVDFSS--D--KFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI--K 395 (483) T ss_pred HHHHHHHHHHHHHHHHHHhCCCCCCc--c--ccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--C Confidence 4332 334566677888888884222 2 22211222 2222223344556777788888888877753333343 3 Q ss_pred hHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCC Q lcl|NC_015287. 433 EDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKL 512 (563) Q Consensus 433 eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~ 512 (563) .+|. .|.+.|....--.+. +-+++++.+. | .+|.+++++.+-..+| .+++-++|++|..+..- T Consensus 396 ~~~~----~i~v~f~~~~p~~~~-------~~a~~~~kl~---G-iiS~et~~~~~~~v~d--~~~E~~ri~~E~~~~~~ 458 (483) T protein:vir:12 396 GEHK----DVDISFNYNKVANTE-------LQVQTAQQSM---G-IVSHETVLENHPFVED--LQAELERIEQEQMEYNK 458 (483) T ss_pred Cccc----eeeEEeCCCCCCCHH-------HHHHHHHHHh---c-cCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHh Confidence 4555 456777543322222 2244555553 4 4899999998765443 33333444444322111 Q ss_pred CCCcccCccccccccccCCCCcCCCCCcccc Q lcl|NC_015287. 513 ADPMEVQQLQLGVHPEQMPGGAMNPQDPNMG 543 (563) Q Consensus 513 ~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 543 (563) ..+.. .+.. ....+...++.+.+.. T Consensus 459 ~~~~~-----~~~~-~d~~~~~~~~~~~e~e 483 (483) T protein:vir:12 459 QLPNL-----DDGG-ADGAQQQERSNNKESE 483 (483) T ss_pred hcccc-----cccc-cCCcccCCCCCcccCC Confidence 11100 0000 0001111111111111 No 111 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=97.59 E-value=4e-05 Score=44.68 Aligned_cols=450 Identities=10% Similarity=0.066 Sum_probs=188.7 Q ss_pred Ccc--------cccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHH--HHhhccchh Q lcl|NC_015287. 1 MAK--------LFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYR--EMALHPEVD 70 (563) Q Consensus 1 m~~--------~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR--~ma~~PEvd 70 (563) .-. +-+| ++++.......+.- ..-.+.|. ...+.+........+ .-..++=.. T Consensus 33 ~~~~~~~~~~~i~~~-i~~~~~~~~~r~~~--------~~~yY~g~--------~~~i~~~~~~~~~~~~~~ri~~n~~k 95 (501) T protein:vir:96 33 LEELMVNNWELLKNF-INHHKLRQAPRIQE--------LLDYARGE--------NHDVLKSGRRKDNEMADKRAVHNYGR 95 (501) T ss_pred cccccCChHHHHHHH-HHHHHHHHHHHHHH--------HHHHhcCC--------CCcccCccccCccccccceeecchHH Confidence 000 1111 11111000000000 00000000 000000000000000 002233344 Q ss_pred HHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCc Q lcl|NC_015287. 71 SAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEE 150 (563) Q Consensus 71 ~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~ 150 (563) -+|+..+. . --.+||.+..++.+-++ .+.+.+..+++..+|+....++.+..++.|+-|.+.-.|. + T Consensus 96 ~Ivd~~~~--y---l~g~p~~~~~~~~~~~~----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~de----d 162 (501) T protein:vir:96 96 MISKFKTG--Y---LAGNPIRVEYDDNDDNS----QNDDAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSE----Y 162 (501) T ss_pred HHHHHHhh--h---hcccCeeEeeCCccchh----HHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcC----C Confidence 44554443 1 12578888887644444 5666677788889999999999999999999999877763 4 Q ss_pred ceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeec Q lcl|NC_015287. 151 GIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIA 230 (563) Q Consensus 151 GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~ 230 (563) |-..+..+||+.+..|..-.........|.+-..... -.....+-+|.+.........+...-....++ -+. T Consensus 163 g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~----~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~----~~g 234 (501) T protein:vir:96 163 DETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTL----QSAKDVVEIYTDEHIYTLDASDDFNEISVTTH----AFG 234 (501) T ss_pred CceEEEEEccceeEEEEcCCCCCceEEEEEEEEeecC----CCcEEEEEEEcCCcEEEEeeCCCceecccccc----CCC Confidence 6788999999999887322111111122222110000 01122233454443332221111000000000 011 Q ss_pred ccceeeeecceecCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcc Q lcl|NC_015287. 231 PDAISFCHSGLVDRNKQTILSYLHKSIKALNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRN 309 (563) Q Consensus 231 ~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~-m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~kn 309 (563) .=+|++. .|.....|=++..+.....+. ++=+....-+.++.|-+-+.-.+....+... .+ |.. + T Consensus 235 ~vPvv~~------~nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~-----~~-~~~--~ 300 (501) T protein:vir:96 235 TVPITEY------LNNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQA-----SD-MKR--T 300 (501) T ss_pred ccceEEe------cCCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccch-----hh-hhh--c Confidence 1123221 122234566666555544443 4444455556667776666543322211100 00 000 1 Q ss_pred eeEeeCCCCcccccccccchhhhhcccccCC----CCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCChhhccCCCc Q lcl|NC_015287. 310 KLVYDASTGEIRDDRKHMSMLEDFWLPRREG----GRGTEITTLPGGQNLGEL-SDIEYFQKKLYRSLGVPESRIAGSGD 384 (563) Q Consensus 310 klvYd~~TGev~d~~~~msMlEDywLpRReG----grgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~VP~sRl~~~~~ 384 (563) + .++++-..+ +.+..+..|-...+...+ .-++-+++.+|...++|---+. T Consensus 301 ~---------------------~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~---- 355 (501) T protein:vir:96 301 R---------------------LMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDT---- 355 (501) T ss_pred C---------------------eeeecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCcc---- Confidence 1 122222221 122345555443333222 2234556777888888843332 Q ss_pred ceeccccc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCh-hHHHHhhhceEEEeeccchHHHHHHHHHHH Q lcl|NC_015287. 385 GFNLGRSS-EILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTP-EDWEVLGDHIQYDFVYDNHFAELKETELLN 462 (563) Q Consensus 385 g~~~g~~~-eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~-eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~ 462 (563) +|....|+ .|.--.......+.+.++.|..-+.++++.=+-+-++... .+++ ...|.+.|...-.-.+ . T Consensus 356 ~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d--~~~i~i~f~~~~p~n~-------~ 426 (501) T protein:vir:96 356 NFSGNTSGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKDFD--ESLLKITFTPNLPKSL-------N 426 (501) T ss_pred cccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc--cccceEEeCCCCCcCH-------H Confidence 22111122 2222222345667777788888887777654333232211 1222 1347788865333222 3 Q ss_pred HHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCC-CCCcccCccccccccccCCCCcCCCCCcc Q lcl|NC_015287. 463 ERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKL-ADPMEVQQLQLGVHPEQMPGGAMNPQDPN 541 (563) Q Consensus 463 ~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~-~~p~~~~~~~~~~~~~~~~~~~~~p~~~~ 541 (563) +.++++..+. | .+|.+++++.+-..+| .+++.++|++|...--. ..+++- .+..+ +.+-...+ T Consensus 427 e~ad~~~kl~---g-~iS~et~~~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~-~~~~~---------~~~~~~~e 490 (501) T protein:vir:96 427 EQVSILTGLG---G-QVSQETALSLSGLVES--PNEELDKINKEMSEIDFKGYSNDF-NEHVG---------KYTDEVKE 490 (501) T ss_pred HHHHHHHHHh---c-cCchHHHHHhCCCCCC--HHHHHHHHHHHHHHhhccccccch-hhccc---------ccCCcCCC Confidence 4455666664 4 4899999998764443 34444555555432111 011111 01110 00000000 Q ss_pred cccccCCCCcc Q lcl|NC_015287. 542 MGGAPQEGGAE 552 (563) Q Consensus 542 ~~~~~~~~~~~ 552 (563) ..+...+-..| T Consensus 491 ~~~d~~e~~~~ 501 (501) T protein:vir:96 491 THTDDFEREYE 501 (501) T ss_pred CCCCccccccC Confidence 00000010111 No 112 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=97.58 E-value=4.2e-05 Score=44.61 Aligned_cols=421 Identities=14% Similarity=0.126 Sum_probs=179.8 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) |.--=|+. ..+|. ..+-...+..+.+.+...+.. +.+...+.- +-.+++|-|..||+-|.+.+ T Consensus 1 ~~~~~~~~----------~~~p~---~~~~~~~~~~~~~~~~~~g~~--~~~~~~~~~--~~~~~~~~V~acV~~IA~~i 63 (518) T protein:vir:78 1 MLLANGQT----------LSAPA---MAELSPQMQDSYYYAPAVGMQ--LERQFSLYG--GIYKNQPWVRTVIAKRAQAL 63 (518) T ss_pred CcccCcee----------eccch---hhhhhhhhhhcccccceecee--cccccchhh--HHhhhhHHHHHHHHHHHHhh Confidence 11111110 11111 100011111111111111111 111111111 01257899999999999965 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHh----cchhhhHHHHHHHh----hhhhhheeeeecCCCCCcce Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEIL----NFDDKCHEILRNWY----VDGRIYYHKVIDIKAPEEGI 152 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll----~F~~~~~~~~R~WY----vDGri~fhkviD~~~~k~GI 152 (563) - .-|+.+--.+ . + ...++.+..+.+| |-...++++++.|. +.|.-|..++.|.. .-+ T Consensus 64 A-----~lp~~l~~~~-~--~----~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~---G~~ 128 (518) T protein:vir:78 64 A-----RLPVKCMFTS-G--D----TETEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKS---GTP 128 (518) T ss_pred c-----cCceEEEEEc-C--C----ccccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC---CcE Confidence 4 2355443322 1 1 1122333333333 23345556555555 66888888776543 348 Q ss_pred eEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeeccc Q lcl|NC_015287. 153 KEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPD 232 (563) Q Consensus 153 ~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~d 232 (563) .+|.+|+|..|..++.. .. ...+|.|.... +. +...+.++.+ T Consensus 129 ~~L~~l~p~~Vtv~~~~---~~--------------------~~~~y~~~~~~-------~~--------~~~~~~~~~~ 170 (518) T protein:vir:78 129 EKLMPMHPSRVAIKRNS---RT--------------------GRYEYYFQAGA-------GV--------GTQLVSFADD 170 (518) T ss_pred EEEEEECCCceEEEEcC---CC--------------------CEEEEEEEecC-------Cc--------cceeEEecCC Confidence 89999999876644211 00 11233332211 11 1223567777 Q ss_pred ceeeeecceecCCCC-ccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhccee Q lcl|NC_015287. 233 AISFCHSGLVDRNKQ-TILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKL 311 (563) Q Consensus 233 aI~y~hSGl~d~~~~-~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knkl 311 (563) .|.|.. .+..++. .+.|-|..|++++.....+++...=+----+.-+-|...+ +.|.+..+++.-+.+...|+-- T Consensus 171 eIiHir--~~~~dg~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~-~~ls~e~~~~~k~~~~~~~~G~- 246 (518) T protein:vir:78 171 EVVPIR--FFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSPEAQQRLREQFDRAHAGS- 246 (518) T ss_pred cEEEec--CCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecC-CCCCHHHHHHHHHHHHHHhcCc- Confidence 775543 3334443 3578999999998888888877544433345556677776 6677666655444444444311 Q ss_pred EeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCChhhccCCCcceec Q lcl|NC_015287. 312 VYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE---LSDIEYFQKKLYRSLGVPESRIAGSGDGFNL 388 (563) Q Consensus 312 vYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~ 388 (563) .+.|.+ + .|+ .|.+++.|. .+.-+ ++-.+|....+.++++||...|...++ -+. T Consensus 247 ---~nag~~------~-vL~----------~G~~~~~l~--~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~-st~ 303 (518) T protein:vir:78 247 ---SNTGKT------M-VVE----------EGMEPIPLQ--LTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDR-ATF 303 (518) T ss_pred ---ccCCce------e-EcC----------CCceEEecc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCC-CCc Confidence 112221 1 222 144555553 23333 334457789999999999999942211 111 Q ss_pred cccchhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHH Q lcl|NC_015287. 389 GRSSEILRDEIKFTKF-VGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGV 467 (563) Q Consensus 389 g~~~eItRDElKF~Kf-I~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~ 467 (563) +.+.-.-+-|.++ |.-+-.++...|...|-++ ++. ...+ .|.-+ ++... -+..|++. T Consensus 304 ---sn~e~~~~~f~~~tL~P~~~~ie~eln~~L~~~-----------~~~-~~~~--~fd~~----~Llr~-D~~~r~~~ 361 (518) T protein:vir:78 304 ---SNISAQMRAFYRDTMAIPIARIQSAMDKYVGQY-----------WVR-KNRM--KFDID----DVIQP-DWEAKSES 361 (518) T ss_pred ---hhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-----------ccC-cceE--Eeech----hhhcc-CHHHHHHH Confidence 1122222335554 4445555554444443222 111 1233 34322 22111 12456666 Q ss_pred HHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCcc---cc---ccccccCCCCcCCCCCcc Q lcl|NC_015287. 468 VAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQL---QL---GVHPEQMPGGAMNPQDPN 541 (563) Q Consensus 468 ~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~---~~---~~~~~~~~~~~~~p~~~~ 541 (563) +..+-.. -+++.+-+|. .+.+.+ .++|.-..-- +. +..+.....|.+.|..+. T Consensus 362 ~~~~~~~--G~lT~NE~R~-~~gl~p------------------ie~~~gD~~~v~~n~~pl~~~~~~~~~g~~~~~~~~ 420 (518) T protein:vir:78 362 TQKMVNS--GVATPNEGRE-IMGLPR------------------SDDPKADELYANSALQPLGATPDGAVEGEEAPAPKR 420 (518) T ss_pred HHHHHhC--CCcCHHHHHH-HhCCCC------------------CCCCCCceeeecccceecccccccccCCCCCCCCCC Confidence 6554332 2455555543 223333 2111111000 00 000000001111111110 Q ss_pred cccccCCC-CcccC---------------------------CCCCCCCCC Q lcl|NC_015287. 542 MGGAPQEG-GAEQA---------------------------PQMPEGGEI 563 (563) Q Consensus 542 ~~~~~~~~-~~~~~---------------------------~~~~~~~~~ 563 (563) ++.++. .+++. -++|.|.|. T Consensus 421 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 468 (518) T protein:vir:78 421 --PASTPVASLDQSPPASVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKES 468 (518) T ss_pred --CCcccccccccCccccCCCCCcccccccccccccchhcccCCCCcccc Confidence 011000 00011 122333333 No 113 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=97.57 E-value=4.3e-05 Score=44.51 Aligned_cols=431 Identities=12% Similarity=0.076 Sum_probs=173.3 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) ++.|||...+.... ...-.+..+...... ..|+ ....+ . -+. -+..+++|-|-.||+-|.+.+ T Consensus 4 ~~~l~~~~~~~~~~--~~~~~~~~~~~~~~~---~~~~--~~~~g---~------~v~-~~~al~~~~v~~~i~~ia~~i 66 (457) T protein:vir:62 4 WSALFGRGHSPALD--AAEGRAWEPYDPSIY---NLGA--TASSG---E------RVT-PHDALQVSAVFASVRLLSETI 66 (457) T ss_pred hhhhhccccccccc--cccccccccchhhhh---hccc--cccCC---c------eec-hHHhhccHHHHHHHHHHHHhH Confidence 33334321110000 000000000000000 0000 00000 0 000 134567899999999998855 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHH----HhhhhhhheeeeecCCCCCcceeEEE Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRN----WYVDGRIYYHKVIDIKAPEEGIKEVR 156 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~----WYvDGri~fhkviD~~~~k~GI~eLr 156 (563) .- -|+.|--++ . ...+..-.-.+..+++.-+=...++++++. +.+.|--|.-+ .+. ..++++|. T Consensus 67 A~-----lp~~~~~~~-~--~~~~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i-~~~---~g~~~~l~ 134 (457) T protein:vir:62 67 AT-----LPLSTYSKR-G--GTRKEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAV-RWA---GPNIAGLD 134 (457) T ss_pred hh-----CceEEEEec-C--CccccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEE-EeC---CCcEEEEE Confidence 42 244432221 1 111111111122222111111235555544 45567766543 332 46799999 Q ss_pred EeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceee Q lcl|NC_015287. 157 YIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISF 236 (563) Q Consensus 157 ~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y 236 (563) +|+|..|...+....... .+.|+.|.-.. .| . ......++.+.|.| T Consensus 135 ~l~p~~v~v~~~~~~~~~--------------------~~~~~~y~~~~------~g---~-----~~~~~~~~~~eiih 180 (457) T protein:vir:62 135 VLDPTKIHVHMVMVDGLR--------------------RKVFEAYDIDA------DG---N-----EVLLGWFTPRDVLH 180 (457) T ss_pred EEcCcceEEEEeccCCcc--------------------ceeEEEEEEcc------CC---c-----eeEEEeeCccceEE Confidence 999998876643222110 12233332210 00 0 01113456666644 Q ss_pred eecceecCCC-CccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeC Q lcl|NC_015287. 237 CHSGLVDRNK-QTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDA 315 (563) Q Consensus 237 ~hSGl~d~~~-~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~ 315 (563) .. ..+.++ -.+.|-++.|++.+.....+++...-+=---+--+-|..++ |.|-+..+++..+.+...|+.. . T Consensus 181 ~r--~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~ls~e~~~~~~~~~~~~~~G~----~ 253 (457) T protein:vir:62 181 IP--GMMLPGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVP-GTMSEEGLARAREAWRAANSGV----D 253 (457) T ss_pred ec--CCCCCCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcC-CCCCHHHHHHHHHHHHHHhcCc----c Confidence 43 344444 35689999999999988888877654433334455677776 5776665555444443333311 0 Q ss_pred CCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchH---HHHHHHHHHHHHhcCCChhhccCCCcceeccccc Q lcl|NC_015287. 316 STGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGEL---SDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSS 392 (563) Q Consensus 316 ~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei---~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~ 392 (563) +.|. .+ .|+ .|.+++.|. .+..++ +=-+|....+.++++||...|..-++ -+... + T Consensus 254 nag~------~~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~~~~~-s 312 (457) T protein:vir:62 254 NAHR------VA-LLT----------EGAKFSKVA--MSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATN-STSWG-S 312 (457) T ss_pred ccCc------ce-ecC----------CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC-ccccc-c Confidence 1121 11 121 245555552 333332 33346778899999999999942211 11111 2 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_015287. 393 EILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVD 472 (563) Q Consensus 393 eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~ 472 (563) .+...-+.|.+++ |+--+ ..+.+.|-..|+ ++.+. ....+.|.-+.. .. .-+..|++++..+- T Consensus 313 n~eq~~~~f~~~~--l~P~~-~~ie~~ln~~L~-----~~~~~----~~~~i~fd~~~l----~~-~d~~~r~~~~~~~~ 375 (457) T protein:vir:62 313 GLAEQNIAFTMFS--LRPWL-ERIEAGFNRLLF-----AETAD----RFRFVKFNLDEI----KR-GAPKERMELWSLGL 375 (457) T ss_pred hHHHHHHHHHHHH--HHHHH-HHHHHHHHhhhc-----Ccccc----CceEEEeechhh----hc-cCHHHHHHHHHHHH Confidence 2333334466653 32211 233333444433 33332 223445544331 11 12245666665443 Q ss_pred hhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccC--------------ccccccccccCCCCcCCCC Q lcl|NC_015287. 473 PYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQ--------------QLQLGVHPEQMPGGAMNPQ 538 (563) Q Consensus 473 p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~--------------~~~~~~~~~~~~~~~~~p~ 538 (563) .- -+++..-+|+.. .+.+ ++.+.... +..+.+.+.....+...|+ T Consensus 376 ~~--G~~T~NE~R~~~-gl~p------------------i~~g~~D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (457) T protein:vir:62 376 QN--GIYSIDEVRAAE-DMTP------------------LPDGLGEKYRVPLNLGEIGEEPEPEPAPAPPAIDPPAEEPA 434 (457) T ss_pred hC--CCcCHHHHHHHh-CCCC------------------CCCCCcceeeeccccccccccccccccCCCccCCCCccCCC Confidence 22 245555554422 3332 22210000 0000000001111111111 Q ss_pred Cc-ccccccCCCCcccCCCCCCCCCC Q lcl|NC_015287. 539 DP-NMGGAPQEGGAEQAPQMPEGGEI 563 (563) Q Consensus 539 ~~-~~~~~~~~~~~~~~~~~~~~~~~ 563 (563) ++ +.+...+.|..++. -.|.+- T Consensus 435 ~~~~~~~~~~~~d~~~~---~~~~~~ 457 (457) T protein:vir:62 435 DDEEPDNAEGDPDEGET---EDDDDA 457 (457) T ss_pred CCCCCCCCCCCCccccc---cccccC Confidence 11 00000001110000 001111 No 114 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=97.57 E-value=4.3e-05 Score=44.50 Aligned_cols=406 Identities=13% Similarity=0.119 Sum_probs=170.1 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) .++||+- +++. +|....-+. -..+..++.....+.+- . +...++|-|..||+-|.+.+ T Consensus 3 ~~~~~~~-----~~~~---~s~~~~w~~--~~~~~~~~~~~~g~~vt-----~-------~~al~~~~v~~~i~~Ia~~i 60 (421) T protein:vir:10 3 IPQMFEG-----KKRS---VSGGGFWEA--MLGGVRSSHSKAGVMIT-----P-------ETALALSAVRACVTLLAESV 60 (421) T ss_pred Ccchhcc-----cccc---cCcchhhHH--HhhhhccCcccCCceec-----h-------HHhhccHHHHHHHHHHHHhh Confidence 2333322 2211 111111000 00011111111111111 0 13468899999999999975 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcchhhhHHH----HHHHhhhhhhheeeeecCCCCCcceeEE Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFDDKCHEI----LRNWYVDGRIYYHKVIDIKAPEEGIKEV 155 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~~~~~~~----~R~WYvDGri~fhkviD~~~~k~GI~eL 155 (563) .-+ |+.|--.+-. .+.+.........+|+. -|-...++++ +..++++|.-|..++.|.. .-+++| T Consensus 61 A~l-----p~~~~~~~~~--g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~---G~~~~L 130 (421) T protein:vir:10 61 AQL-----PVELYRRDKN--GGRQRATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGK---GYPKEL 130 (421) T ss_pred ccC-----ceEEEEEcCC--CceeecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC---CcEEEE Confidence 532 3333111100 00000111112222321 2333445444 4456688998888777643 338899 Q ss_pred EEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeeccccee Q lcl|NC_015287. 156 RYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAIS 235 (563) Q Consensus 156 r~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~ 235 (563) .+|+|..|..++. + ++ ..||.+... |. .++.+.|. T Consensus 131 ~~l~~~~v~v~~~---~----~g-----------------~~~y~~~~~-----------g~----------~~~~~eii 165 (421) T protein:vir:10 131 IPINPKKVIVLKG---P----DG-----------------MPYYEIPEI-----------GE----------TLPMRMMH 165 (421) T ss_pred EEecCceEEEEEC---C----Cc-----------------eEEEEEcCC-----------Cc----------EEchhhEE Confidence 9999988765421 1 11 113333111 00 24455664 Q ss_pred eeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeC Q lcl|NC_015287. 236 FCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDA 315 (563) Q Consensus 236 y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~ 315 (563) |.+ .++.++-.+.|.|+.|.+++.....+++...=+=---+--+-+...+- +++..+.++-...+..+++++.-=-. T Consensus 166 h~~--~~~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-~~~~~~~~e~~~~~~~~~~~~~~g~~ 242 (421) T protein:vir:10 166 HVK--VFSLDGYIGSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPK-EAPAIKSQEKIDQLLAKWTDRYSGIN 242 (421) T ss_pred Eec--CcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecC-ccCccCCHHHHHHHHHHHHHHhcCcc Confidence 443 245566667899999999998888777665433333344555666664 23222233333334444444321001 Q ss_pred CCCcccccccccchhhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchh Q lcl|NC_015287. 316 STGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG-QNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEI 394 (563) Q Consensus 316 ~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eI 394 (563) +.|.+ + .++ .|++++.|--. ..+.-++-.++..+.+.++.+||..-|...+.+ +. +-+ T Consensus 243 n~~~~------~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~-t~---sn~ 301 (421) T protein:vir:10 243 NMFSV------A-LLQ----------EGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKA-TN---NNI 301 (421) T ss_pred ccCcc------e-ecC----------CCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCC-cc---ccH Confidence 11211 1 111 35666666432 122223344578888999999999999532211 11 112 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_015287. 395 LRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPY 474 (563) Q Consensus 395 tRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~ 474 (563) .-.-+-|.++ .|+-.+ ..+.+.|-. .+++++++. ...++|..+.... .-+..|++.+..+-.- T Consensus 302 e~~~~~f~~~--tl~P~~-~~ie~~ln~-----kL~~~~~~~----~~~v~fd~~~l~~-----~d~~~~~~~~~~~~~~ 364 (421) T protein:vir:10 302 EHQGLQFVMY--TLLAWL-KRHEGALQR-----DLLLPSERR----DLYIEFNVSGLLR-----GDQKSRYESYALGRQW 364 (421) T ss_pred HHHHHHHHHH--HHHHHH-HHHHHHHhh-----hccCccccC----CeEEEEechhhhc-----cCHHHHHHHHHHHHhC Confidence 2222335443 232211 122333333 344555443 3455565433211 1123445544332211 Q ss_pred hchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccC Q lcl|NC_015287. 475 VGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQA 554 (563) Q Consensus 475 vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~ 554 (563) -+ ||..|+- + ....++++ +-..-. .|.+ +.+..+..+++.. T Consensus 365 --G~------------~T~NE~R---~----~~gl~p~~--ggD~~~--------------~~~n--~~~~~~~~~~~~~ 405 (421) T protein:vir:10 365 --GW------------LSVNDIR---R----MENLPPIA--GGDKYL--------------TPLN--MVDSAQIIPGDKK 405 (421) T ss_pred --CC------------cCHHHHH---H----HhCCCCCC--Ccceee--------------eccc--cccccccccCCCC Confidence 12 3443333 1 12222322 100000 0111 1111111111222 Q ss_pred CCCCCCCCC Q lcl|NC_015287. 555 PQMPEGGEI 563 (563) Q Consensus 555 ~~~~~~~~~ 563 (563) |.-..|+|. T Consensus 406 ~~~~~~~e~ 414 (421) T protein:vir:10 406 PTAQQMAEI 414 (421) T ss_pred cccccCccc Confidence 222233333 No 115 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=97.54 E-value=4.8e-05 Score=44.25 Aligned_cols=399 Identities=13% Similarity=0.102 Sum_probs=168.7 Q ss_pred cccchhhHhhhcc---cccccccC----cCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHh Q lcl|NC_015287. 4 LFGFKIQKDDEQN---KNVVSPVP----QSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDI 76 (563) Q Consensus 4 ~~~f~~~~~~~~~---~~~~s~~p----~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eI 76 (563) +-|+|.+++.... +.+-++.. ........... ....+.+.++ ....|..++++|-|..||+-| T Consensus 1 ~~~~~~~~~~~~m~~F~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~---------~~~~~~~~~~~~~v~~cI~~i 70 (413) T protein:vir:96 1 MPGVSEIRKDKNLKFFNNKRSPTEESKAKDEIPKAPQVV-MTLPNFFKEL---------ISDGYTKLSDSPEVRMAVDCI 70 (413) T ss_pred CCccchhhhhhcCCccccCCCcchhhhhhcccccccccc-ccchhhHhhh---------ccchhHHHhhchHHHHHHHHH Confidence 2344444432110 00000000 00000000000 0000111111 122345678899999999999 Q ss_pred hcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcchhhhHH----HHHHHhhhhhhheeeeecCCCCCcc Q lcl|NC_015287. 77 INEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFDDKCHE----ILRNWYVDGRIYYHKVIDIKAPEEG 151 (563) Q Consensus 77 vneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~~~~~~----~~R~WYvDGri~fhkviD~~~~k~G 151 (563) .+.+.-+ |+.+--.+ ++-++.+.....++++. =+=...+++ ++..+...|.-|.-++.|.+ ..- T Consensus 71 a~~ia~~-----~~~~~~~~----~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~--g~~ 139 (413) T protein:vir:96 71 ADLVSNM-----TIQLMQNG----ETGDKRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVS--GDK 139 (413) T ss_pred HHhhccC-----ceEEEEec----CCCccccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC--CCc Confidence 9976432 34332121 11111222222223221 111223344 45556677888887666532 123 Q ss_pred eeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecc Q lcl|NC_015287. 152 IKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAP 231 (563) Q Consensus 152 I~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~ 231 (563) +++|.+|+|..|+..+. + +. -+|.|...+ ..++. T Consensus 140 ~~~L~~l~~~~v~~~~~---~----~~------------------~~y~~~~~~---------------------~~~~~ 173 (413) T protein:vir:96 140 IIGLTPISPYKVTFNVS---D----DD------------------LDYSITFDN---------------------KEYDP 173 (413) T ss_pred eEEEEEecCceeEEEEc---C----Ce------------------EEEEEeecC---------------------cEEch Confidence 78999999988765421 1 11 122332110 01223 Q ss_pred cce-eeeecceecCC-CCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcc Q lcl|NC_015287. 232 DAI-SFCHSGLVDRN-KQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRN 309 (563) Q Consensus 232 daI-~y~hSGl~d~~-~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~kn 309 (563) +.| ++.+. .+.+ +-.+.|.|..|.+++.....+++...=+----+.-+-++.++ ++|.+..+++..+.+...|.. T Consensus 174 ~evih~k~~--~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g 250 (413) T protein:vir:96 174 STLLHFVLN--PSIERPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVD-SDSDELSDEEGRENFEEMYLK 250 (413) T ss_pred hhEEEEecc--CCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeC-CCCCHHHHHHHHHHHHHHhcC Confidence 334 33321 2222 334689999999999988888887665555556667778877 567777776655554444432 Q ss_pred eeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCcceecc Q lcl|NC_015287. 310 KLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLG 389 (563) Q Consensus 310 klvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g 389 (563) .++..+.+ .+++ +|...+++..+. -..+.-++-..|-.+.+.++++||...|... T Consensus 251 ----------~~n~g~~~-vl~~------~~~~~~~~~~~~-~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~------- 305 (413) T protein:vir:96 251 ----------RKEAGKPW-IIPE------GMVNVQQIKPLT-LNDLAINDAVTLDKKTVAGIFGVPAFLLGVG------- 305 (413) T ss_pred ----------ccccCcee-eecC------CcccccccccCC-hhHHHHHHHHHHHHHHHHHHhCCCHHHcCCC------- Confidence 11111111 1110 111112332221 1233334455677889999999999999321 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 390 RSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVA 469 (563) Q Consensus 390 ~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~ 469 (563) .+++-+ -..|.++ .|+- +...+.+.|-..|+-+| ..+.|.- .++.... +.+|.+.+. T Consensus 306 ~~~~~~--~~~~~~~--~l~P-~~~~ie~~ln~~ll~~~-------------~~~~fd~----~~ll~~d-~~~~~~~~~ 362 (413) T protein:vir:96 306 TYNKDE--FNNFINT--KIMS-IAQVIQQTYNKLIVEED-------------MYFSLNP----RSLYNYS-LTEMVSAGA 362 (413) T ss_pred cchHHH--HHHHHHH--HHHH-HHHHHHHHHHHhhCCCC-------------cEEEEec----hhhhccC-HHHHHHHHH Confidence 111111 1224332 2333 22334455555544322 3344433 2232221 235555554 Q ss_pred HhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCC Q lcl|NC_015287. 470 AVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEG 549 (563) Q Consensus 470 ~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~ 549 (563) .+-.- -+ +|..|+- + . ..-+++ |+.+ ..- .. .-..|.+ +....++ T Consensus 363 ~~~~~--G~------------~t~NE~R---~---~-~g~~p~--~~gd---~~~-----~~-~n~~~~~--~~~~~~~- 407 (413) T protein:vir:96 363 QMTQL--NA------------LRRNEFR---N---W-VGMPPD--AEMD---DLL-----VL-ENYLQQK--DLVNQKK- 407 (413) T ss_pred HHHhC--CC------------cCHHHHH---H---H-hCCCCC--CCcc---eee-----ec-ccccchh--hcccccC- Confidence 43221 12 3443332 1 1 122222 2100 000 00 0001111 1111111 Q ss_pred CcccCCCCCCCCCC Q lcl|NC_015287. 550 GAEQAPQMPEGGEI 563 (563) Q Consensus 550 ~~~~~~~~~~~~~~ 563 (563) .+|||- T Consensus 408 --------~~~~dt 413 (413) T protein:vir:96 408 --------LIQDET 413 (413) T ss_pred --------CCCCCC Confidence 122222 No 116 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=97.53 E-value=4.9e-05 Score=44.19 Aligned_cols=376 Identities=15% Similarity=0.076 Sum_probs=159.7 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) |.=|....+++.+.+ +..++.++ +......|+..+.++.- +.-.++|-|..||+-|.+.+ T Consensus 1 Mg~~~~~~~~k~~~~-----~~~~~~~~-~~~~~~~~~~~~~~v~~--------------~~~l~~~~v~~~i~~ia~~i 60 (383) T protein:vir:10 1 MGLLTPKNFSKRNAK-----NMVYPSNP-AFFTTTVGGMQLSYVSA--------------LSALQNTNVYSVINRIASDV 60 (383) T ss_pred CCccccccccccccc-----ccccccch-hhhhhhccCccccccch--------------hHhhcchHHHHHHHHHHHhh Confidence 444332222222222 22233222 11112222222222211 12345788999999999866 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhh----HHHHHHHhhhhhhheeeeecCCCCCcceeEEE Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKC----HEILRNWYVDGRIYYHKVIDIKAPEEGIKEVR 156 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~----~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr 156 (563) .-. |+. +.+... ..+++-=+=...+ ..+++.+..+|.-|..++-| ..++. T Consensus 61 a~~-----~~~--~~~~~~------------~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~-------~~~~~ 114 (383) T protein:vir:10 61 SSA-----HFK--TENTAT------------LNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ-------NLEHI 114 (383) T ss_pred ccC-----cee--ecccch------------hhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC-------ceeEe Confidence 543 332 222111 1223222222233 34455566788888876544 35677 Q ss_pred EeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceee Q lcl|NC_015287. 157 YIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISF 236 (563) Q Consensus 157 ~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y 236 (563) .++|-.|+.++. . ++ .+|.|... .+...+.++.+.|.| T Consensus 115 p~~~~~v~~~~~-----~--~~------------------~~~~~~~~-----------------~~~~~~~~~~~evih 152 (383) T protein:vir:10 115 PNSDVQINYLPG-----N--MG------------------IVYTVLES-----------------NDRPKMVLRQDQMLH 152 (383) T ss_pred ecCcceEEEEEc-----C--Cc------------------eEEEEEEc-----------------CCceEEEEcccceEE Confidence 888766554411 0 00 11211110 011235677777754 Q ss_pred eecceec-CCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeC Q lcl|NC_015287. 237 CHSGLVD-RNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDA 315 (563) Q Consensus 237 ~hSGl~d-~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~ 315 (563) ...--.+ .++..++|.|..|++.+.....++....=+----+--+-+..++-+ +-..++.+=+++.++++... . T Consensus 153 ~r~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~-~~~~e~~~~~~~~~~~~~~~----~ 227 (383) T protein:vir:10 153 FRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNY-LSDGKDLESAREEFEKANTG----D 227 (383) T ss_pred eccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCC-CCCHHHHHHHHHHHHHHhCc----c Confidence 4211111 1233468999999999999888888765443333555566666644 33334444444444444321 1 Q ss_pred CCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch-HHHH-HHHHHHHHHhcCCChhhccCCCcceeccccch Q lcl|NC_015287. 316 STGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDI-EYFQKKLYRSLGVPESRIAGSGDGFNLGRSSE 393 (563) Q Consensus 316 ~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV-~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~e 393 (563) +.|.+ + .+ ..|.+++.|.-...-.+ +.+. ++-.+.+.++++||.+.|.....+ +... +. T Consensus 228 n~~~~------~-vl----------~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~-~~~~-sn 288 (383) T protein:vir:10 228 NSGRL------M-VL----------PDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTST-ESQH-SN 288 (383) T ss_pred ccCCc------c-cc----------CCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCC-CCcc-cc Confidence 22322 1 22 12677777754322222 2333 344688999999999999532211 0111 11 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_015287. 394 ILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDP 473 (563) Q Consensus 394 ItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p 473 (563) +.....-|.+-+.-+-+.+ .+.|...|. + ..|+|++. .. ... -+..|++.+..+-. T Consensus 289 ~eq~~~~~~~~l~P~~~~i----e~~l~~~l~-----~--------~~~~f~~~--~l----~~~-d~~~~~~~~~~~~~ 344 (383) T protein:vir:10 289 IDQIKATYLANLNSYVNPI----VDELRLKMN-----A--------PDLELDIK--DM----LDV-DDSILINQVSNLAK 344 (383) T ss_pred HHHHHHHHHHHHHHHHHHH----HHHHHHhhC-----C--------ceEEeech--hh----hcc-CHHHHHHHHHHHHh Confidence 2211122433333333333 333433332 1 22443332 21 111 11234444332221 Q ss_pred hhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCccc Q lcl|NC_015287. 474 YVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQ 553 (563) Q Consensus 474 ~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~ 553 (563) - -+ +|..|+-+ .....+++.++...... ..... +.+.++ T Consensus 345 ~--G~------------~t~nE~R~-------~lg~~p~~~~d~~~~~~---~~~~~-----------------~gGd~e 383 (383) T protein:vir:10 345 S--GV------------LGAEQAQF-------ILTRSGFLPDNLPEFKP---LTNET-----------------KGGDDK 383 (383) T ss_pred C--CC------------cCHHHHHH-------HhCCCcccCCcccccCC---CcccC-----------------CCCCCC Confidence 1 12 34433321 12222333221111000 00011 111111 No 117 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=97.51 E-value=5.3e-05 Score=44.05 Aligned_cols=384 Identities=13% Similarity=0.142 Sum_probs=164.0 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) |-.+|+| +.+...... ..++..... ++......+.+.+. .+. -+.. +..+++|-|..||+-|.+.+ T Consensus 2 ~m~~f~~-~~~~~~~~~-~~~~~~~~~-~~~~~~~~~~~~~~----~~~------~v~~-~~al~~~~v~~~i~~ia~~i 67 (392) T protein:vir:10 2 ILPILNF-INQTNDPPE-VGSVQSYFP-DGNDAQIMESLLGD----NNE------WVSA-RAALRNSDLFSIILQLSSDL 67 (392) T ss_pred cchhhhh-hhccccccc-ccccccccc-cCchhhhhhhhcCC----CCc------eech-HHhhccHHHHHHHHHHHHhh Confidence 4455554 222211111 111111111 11111111222111 000 0111 23356899999999999965 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhh----HHHHHHHhhhhhhheeeeecCCCCCcceeEEE Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKC----HEILRNWYVDGRIYYHKVIDIKAPEEGIKEVR 156 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~----~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr 156 (563) .-+ |+ .+.+. .... +++-=+-...+ +.++..+++.|.-|..++.|.. ..+++|. T Consensus 68 a~l-----p~--~~~~~-----~~~~-------l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~---g~~~~L~ 125 (392) T protein:vir:10 68 AIV-----KI--NAEKK-----KNQG-------IIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNAN---GADMKWE 125 (392) T ss_pred ccC-----ce--eeccc-----hhhh-------HhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCC---CcEEEEE Confidence 432 22 22111 0111 22222223333 3455577888999988887743 3489999 Q ss_pred EeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceee Q lcl|NC_015287. 157 YIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISF 236 (563) Q Consensus 157 ~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y 236 (563) +|+|..|..++... ++ .-+|.|...+ +. ......++.+.|.| T Consensus 126 ~l~~~~v~~~~~~~------~~-----------------~~~y~~~~~~-------~~--------~~~~~~~~~~eiih 167 (392) T protein:vir:10 126 YLRPSQVNTYYFEY------EN-----------------GMYYNITFDD-------PK--------IEPILQAPQSDLIH 167 (392) T ss_pred EEcCceeEEEEcCC------Cc-----------------eEEEEEEecC-------cc--------cceeEEEccccEEE Confidence 99999887553110 01 1122221110 00 01124566677755 Q ss_pred eecceecCCCC-ccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeC Q lcl|NC_015287. 237 CHSGLVDRNKQ-TILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDA 315 (563) Q Consensus 237 ~hSGl~d~~~~-~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~ 315 (563) .. ..+.++. .+.|.|..|...+.....+++...-+=---+--+-+..++.+..+..++.+.++ ..|+.. . T Consensus 168 ~~--~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~---~~~~~~----~ 238 (392) T protein:vir:10 168 MK--LLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRS---RSFMKR----S 238 (392) T ss_pred ec--CCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHH---HHHhcc----c Confidence 43 2344553 468999999999999988887765433333455667777665545444433322 233321 1 Q ss_pred CCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCChhhccCCCcceeccccchh Q lcl|NC_015287. 316 STGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEI 394 (563) Q Consensus 316 ~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eI 394 (563) +.|. .+ .+ ..|++++.|.-...-.| ++=.+|..+.+.++++||...|...+ .++.. T Consensus 239 ~~g~------~~-vl----------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~------~~~~~ 295 (392) T protein:vir:10 239 RSGG------PV-VL----------DDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQG------DQQSS 295 (392) T ss_pred cCCC------ee-ec----------CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC------CcccH Confidence 1221 11 11 12567777754333333 45567888999999999999994321 11111 Q ss_pred hHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_015287. 395 LRDEIKFTKFV-GRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDP 473 (563) Q Consensus 395 tRDElKF~KfI-~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p 473 (563) ...-..|..++ .-+-.++... |...|. +.-+++. ...+..|. ..+...+..+-. T Consensus 296 ~~~~~~f~~~~l~P~~~~ie~~----l~~~L~-----~~~~~d~-----~~~~~~d~-----------~~~~~~~~~l~~ 350 (392) T protein:vir:10 296 IQQISGMYASALNRYLRPAISE----LEYKLS-----DHISVNM-----RPAIDPLG-----------DNYLSTISTATR 350 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHH----HHHhcc-----ccccccc-----hhhhccCH-----------HHHHHHHHHHHh Confidence 11112244322 2233333222 222221 1111110 00010110 011111111100 Q ss_pred hhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCC Q lcl|NC_015287. 474 YVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGG 550 (563) Q Consensus 474 ~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~ 550 (563) +-. +|..|.- +.. ...|.. |++.-+.++- .|..+ ..+++|+| T Consensus 351 --~g~------------~t~nE~r---~~l---~~~g~~--p~e~r~~e~l-----------~~~~~--Gd~~~p~p 392 (392) T protein:vir:10 351 --WGA------------LAENQAT---FVL---QEAGYI--PKDLPAPENT-----------NKKTT--GQSNEPVP 392 (392) T ss_pred --CCC------------cCHHHHH---HHH---HhcCCC--ccccchhcCC-----------CCCCC--CCCCCCCC Confidence 011 2332222 211 122332 4433222211 11111 11122222 No 118 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=97.51 E-value=5.3e-05 Score=44.05 Aligned_cols=384 Identities=13% Similarity=0.142 Sum_probs=164.0 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) |-.+|+| +.+...... ..++..... ++......+.+.+. .+. -+.. +..+++|-|..||+-|.+.+ T Consensus 2 ~m~~f~~-~~~~~~~~~-~~~~~~~~~-~~~~~~~~~~~~~~----~~~------~v~~-~~al~~~~v~~~i~~ia~~i 67 (392) T protein:vir:39 2 ILPILNF-INQTNDPPE-VGSVQSYFP-DGNDAQIMESLLGD----NNE------WVSA-RAALRNSDLFSIILQLSSDL 67 (392) T ss_pred cchhhhh-hhccccccc-ccccccccc-cCchhhhhhhhcCC----CCc------eech-HHhhccHHHHHHHHHHHHhh Confidence 4455554 222211111 111111111 11111111222111 000 0111 23356899999999999965 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhh----HHHHHHHhhhhhhheeeeecCCCCCcceeEEE Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKC----HEILRNWYVDGRIYYHKVIDIKAPEEGIKEVR 156 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~----~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr 156 (563) .-+ |+ .+.+. .... +++-=+-...+ +.++..+++.|.-|..++.|.. ..+++|. T Consensus 68 a~l-----p~--~~~~~-----~~~~-------l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~---g~~~~L~ 125 (392) T protein:vir:39 68 AIV-----KI--NAEKK-----KNQG-------IIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNAN---GADMKWE 125 (392) T ss_pred ccC-----ce--eeccc-----hhhh-------HhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCC---CcEEEEE Confidence 432 22 22111 0111 22222223333 3455577888999988887743 3489999 Q ss_pred EeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceee Q lcl|NC_015287. 157 YIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISF 236 (563) Q Consensus 157 ~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y 236 (563) +|+|..|..++... ++ .-+|.|...+ +. ......++.+.|.| T Consensus 126 ~l~~~~v~~~~~~~------~~-----------------~~~y~~~~~~-------~~--------~~~~~~~~~~eiih 167 (392) T protein:vir:39 126 YLRPSQVNTYYFEY------EN-----------------GMYYNITFDD-------PK--------IEPILQAPQSDLIH 167 (392) T ss_pred EEcCceeEEEEcCC------Cc-----------------eEEEEEEecC-------cc--------cceeEEEccccEEE Confidence 99999887553110 01 1122221110 00 01124566677755 Q ss_pred eecceecCCCC-ccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeC Q lcl|NC_015287. 237 CHSGLVDRNKQ-TILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDA 315 (563) Q Consensus 237 ~hSGl~d~~~~-~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~ 315 (563) .. ..+.++. .+.|.|..|...+.....+++...-+=---+--+-+..++.+..+..++.+.++ ..|+.. . T Consensus 168 ~~--~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~---~~~~~~----~ 238 (392) T protein:vir:39 168 MK--LLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRS---RSFMKR----S 238 (392) T ss_pred ec--CCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHH---HHHhcc----c Confidence 43 2344553 468999999999999988887765433333455667777665545444433322 233321 1 Q ss_pred CCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCChhhccCCCcceeccccchh Q lcl|NC_015287. 316 STGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEI 394 (563) Q Consensus 316 ~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eI 394 (563) +.|. .+ .+ ..|++++.|.-...-.| ++=.+|..+.+.++++||...|...+ .++.. T Consensus 239 ~~g~------~~-vl----------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~------~~~~~ 295 (392) T protein:vir:39 239 RSGG------PV-VL----------DDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQG------DQQSS 295 (392) T ss_pred cCCC------ee-ec----------CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC------CcccH Confidence 1221 11 11 12567777754333333 45567888999999999999994321 11111 Q ss_pred hHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_015287. 395 LRDEIKFTKFV-GRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDP 473 (563) Q Consensus 395 tRDElKF~KfI-~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p 473 (563) ...-..|..++ .-+-.++... |...|. +.-+++. ...+..|. ..+...+..+-. T Consensus 296 ~~~~~~f~~~~l~P~~~~ie~~----l~~~L~-----~~~~~d~-----~~~~~~d~-----------~~~~~~~~~l~~ 350 (392) T protein:vir:39 296 IQQISGMYASALNRYLRPAISE----LEYKLS-----DHISVNM-----RPAIDPLG-----------DNYLSTISTATR 350 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHH----HHHhcc-----ccccccc-----hhhhccCH-----------HHHHHHHHHHHh Confidence 11112244322 2233333222 222221 1111110 00010110 011111111100 Q ss_pred hhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCC Q lcl|NC_015287. 474 YVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGG 550 (563) Q Consensus 474 ~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~ 550 (563) +-. +|..|.- +.. ...|.. |++.-+.++- .|..+ ..+++|+| T Consensus 351 --~g~------------~t~nE~r---~~l---~~~g~~--p~e~r~~e~l-----------~~~~~--Gd~~~p~p 392 (392) T protein:vir:39 351 --WGA------------LAENQAT---FVL---QEAGYI--PKDLPAPENT-----------NKKTT--GQSNEPVP 392 (392) T ss_pred --CCC------------cCHHHHH---HHH---HhcCCC--ccccchhcCC-----------CCCCC--CCCCCCCC Confidence 011 2332222 211 122332 4433222211 11111 11122222 No 119 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=97.51 E-value=5.4e-05 Score=43.98 Aligned_cols=407 Identities=12% Similarity=0.019 Sum_probs=182.2 Q ss_pred CCCCcccceeccccccccccccchhhhH-HHHHHHHHHHhhccchh-------------------------HHHHHhhcc Q lcl|NC_015287. 26 SNEDSSDYYVSSGFYGQYVDIDGVFKSE-FELIKRYREMALHPEVD-------------------------SAIEDIINE 79 (563) Q Consensus 26 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~-~~LI~~YR~ma~~PEvd-------------------------~Ai~eIvne 79 (563) .++|....+ ..+.++ .....+|+.+..|.+=. -+|+..++= T Consensus 1 ~~~~~~~~i-------------~~l~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~ 67 (441) T protein:vir:80 1 MNSDELALI-------------EGMYDRIQRLSSWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEER 67 (441) T ss_pred CCccHHHHH-------------HHHHHHHHHHHHHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHHHhh Confidence 222221111 000000 01111222222222221 222222210 Q ss_pred eEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeC Q lcl|NC_015287. 80 AIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYID 159 (563) Q Consensus 80 aiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lD 159 (563) . + +..+.. .+ .+..+.|.+-.+|+....++++.-++.|+-|.+.-.| ++|...++.++ T Consensus 68 l-~------~~g~~~-----~d------~~~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d----~~g~~~i~~~~ 125 (441) T protein:vir:80 68 L-D------WLGWTN-----GD------GYGLDGVYAANRLATASCDVHLDALIFGLSFVAIIPH----GDGTVSVRPQS 125 (441) T ss_pred h-c------cccccC-----CC------hHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeC----CCCceEEEEEc Confidence 0 0 001110 11 1345667777889999999999999999998876554 46778899999 Q ss_pred CccceeeeeeeccCCccccce--eeecccccccccccccceeeecCccccccccccccCcCccccCcceeeeccc---ce Q lcl|NC_015287. 160 PLKIKLVRKLKTDNSLRGAIN--QVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPD---AI 234 (563) Q Consensus 160 P~~i~~vR~~~~~~~~~~~i~--~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~d---aI 234 (563) |+++..|.- +.....+. ++... ..++.....+|.+......-..+.+.-. . .....-+.. .| T Consensus 126 p~~~~~i~d----~~~~~~~~~~~~~~~-----~~~~~~~~~vy~~~~~~~~~~~~~~~~~-~---~~~~~~~~g~vPvv 192 (441) T protein:vir:80 126 PKNCTGKFS----ADGSRLDAGLVVQQT-----CDPEVVEAELLLPDVIVQVERRGSREWV-E---VDRIPNVLGAVPLV 192 (441) T ss_pred cceEEEEEe----CCCCceeEEEEEEEE-----ecCceEEEEEEecCeEEEEEEcCCccee-e---ccccccCCCceeEE Confidence 999887742 11112211 11111 1112222334444333222111110000 0 000000111 12 Q ss_pred eeeecceecCCCCccchhHHHHHHHHH-H-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeE Q lcl|NC_015287. 235 SFCHSGLVDRNKQTILSYLHKSIKALN-Q-LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLV 312 (563) Q Consensus 235 ~y~hSGl~d~~~~~~~syL~~Aik~~N-q-L~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklv 312 (563) .|.+.--.++. -+.|-|...++++- . -+++-+..++-+.+..|.|-+.=.+.+..+. +.. + T Consensus 193 ~~~n~~~~~~~--~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~G~~~~~~~~--------~~~---~---- 255 (441) T protein:vir:80 193 PIVNRRRTSRI--DGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVTGVSADEFSQ--------PGW---V---- 255 (441) T ss_pred EeeccccCCcc--CCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeeecCCcccccc--------chh---h---- Confidence 23322111111 12343433333321 1 2456677778888888877554222111110 000 0 Q ss_pred eeCCCCcccccccccchhhhhc-ccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCcceecccc Q lcl|NC_015287. 313 YDASTGEIRDDRKHMSMLEDFW-LPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRS 391 (563) Q Consensus 313 Yd~~TGev~d~~~~msMlEDyw-LpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~ 391 (563) ...| -+| +|.-++|.+.++..+|++.-=.-++-++=....++...++|.+-|...+ -+...+ T Consensus 256 --~~~~-------------~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~--~~~~Sg 318 (441) T protein:vir:80 256 --LSMA-------------SVWAVDKDDDGDTPNVGSFPVNSPTPYSDQMRLLAQLTAGEAAVPERYFGFIT--SNPPSG 318 (441) T ss_pred --hccc-------------ccccCCCCCCCCcceeEecCccchHHHHHHHHHHHHHHhcccCCCHHHhccCC--CcchHH Confidence 0001 122 3444445556777777643112233344445677777888877773221 111112 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015287. 392 SEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAV 471 (563) Q Consensus 392 ~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~ 471 (563) ..|.--+...-.-+.+.|+.|..-+...++.-+-+.|.. .++......+.+.|.....=. +.+.++.+.++ T Consensus 319 ~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~--~~~~~~~~~i~~~f~~~~~~~-------~~e~ad~~~kl 389 (441) T protein:vir:80 319 EALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSR--VDEADFFGDVGLRWRDASTPT-------RAATADAVTKL 389 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CcccccceeeeEEeCCCCCcC-------HHHHHHHHHHH Confidence 223333344666678888888888888887655554543 334443467888887633212 23555555555 Q ss_pred hhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCC Q lcl|NC_015287. 472 DPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEG 549 (563) Q Consensus 472 ~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~ 549 (563) ..-+-...|.++++ ..+..++.|++++++..+++.. .+..- .+.....+.++ T Consensus 390 ~~~g~~~~s~~~~~-~~l~~~~~e~~~~~~e~~e~~~--~~~~~-----------------------~~~~~~~~~~~ 441 (441) T protein:vir:80 390 VGAGILPADSRTVL-EMLGLDDVQVEAVMRHRAESSD--PLAVL-----------------------AGAISRQTNEV 441 (441) T ss_pred HhcCcccccHHHHH-HhCCCCHHHHHHHHHHHHHHHH--HHHHH-----------------------hhhhhcccccC Confidence 44422345767766 5667899888877664333211 11100 00011111111 No 120 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=97.47 E-value=6e-05 Score=43.71 Aligned_cols=412 Identities=12% Similarity=0.054 Sum_probs=170.5 Q ss_pred cCcCCCCcccceeccccccccccccchhh-hHHHHHHHHHHHhhccch---------------------------hHHHH Q lcl|NC_015287. 23 VPQSNEDSSDYYVSSGFYGQYVDIDGVFK-SEFELIKRYREMALHPEV---------------------------DSAIE 74 (563) Q Consensus 23 ~p~~~~d~~~~~~~~~~~~~~~~~~~~~~-~~~~LI~~YR~ma~~PEv---------------------------d~Ai~ 74 (563) +-+.+.+. +-..+- .-.....+|+.+..+.+- ..+|+ T Consensus 1 ~~~~t~~~---------------~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd 65 (456) T protein:vir:79 1 MTASTPAE---------------WLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRD 65 (456) T ss_pred CCCCCHHH---------------HHHHHHHHHHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHH Confidence 11111100 000000 001112223333333333 33333 Q ss_pred HhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeE Q lcl|NC_015287. 75 DIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKE 154 (563) Q Consensus 75 eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~e 154 (563) ..+.=+ ..+||.+...+ ...+.+.+..+.+-.+|+....++++.-++.|+-|.+.-.| .+|-.. T Consensus 66 ~~~~~l-----~~~g~~~~~~~-------d~~~~~~~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~----edg~~~ 129 (456) T protein:vir:79 66 SVADRI-----IPNGITVGGSA-------DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR----DDGTAT 129 (456) T ss_pred HHHhhh-----ccCCeecCCCC-------CccHHHHHHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeC----CCCceE Confidence 333211 12244432211 01223345566677789999999999999999988875444 346667 Q ss_pred EEEeCCccceeeeeeeccCCcccccee-eecccccccccccccceeeecCcccccccccc----ccCcCccccCccee-- Q lcl|NC_015287. 155 VRYIDPLKIKLVRKLKTDNSLRGAINQ-VNSKNPADIENPEIEEYYQYDPSQTQGKNALG----AIGQTPFSTKQRPV-- 227 (563) Q Consensus 155 Lr~lDP~~i~~vR~~~~~~~~~~~i~~-~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g----~~g~~~~~~~~~~i-- 227 (563) ++.++|+.+-.+.- +.....+.. ++--... ......+.+|.+.+....-..+ -............. T Consensus 130 i~~~~p~~~~~i~d----~~~~~~~~~~~~~~~~~---d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 202 (456) T protein:vir:79 130 ITADSPETMVVSVD----PLQPWRIRSAMRWWRDL---DAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVP 202 (456) T ss_pred EEEeccceeEEEEc----CCCCCceEEEEEEEEec---CCceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceee Confidence 88999998876631 111111111 1100000 0000111122222211100000 00000000000000 Q ss_pred ------eecccceeeeecceecCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHH Q lcl|NC_015287. 228 ------KIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYL 300 (563) Q Consensus 228 ------ki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~-m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl 300 (563) .+..=+|++. ++.-..|=++..+.....+. ++-+.++.-..+-.|.|-+.=.+- .+|.. T Consensus 203 ~~~~~~~~~~~pvv~~-------~N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~-~~~~~------ 268 (456) T protein:vir:79 203 VGDAVVTGSPPPVVVY-------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEH-RLPKV------ 268 (456) T ss_pred cccccCCCCceeEEEe-------cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCc-ccccc------ Confidence 0111122222 12223455555444333322 222333333333333332211110 01110 Q ss_pred HHHHHhhcceeEeeCCCCccccc-ccccchhhhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCChhh Q lcl|NC_015287. 301 KEVMNRYRNKLVYDASTGEIRDD-RKHMSMLEDFWLPRREGGRGTEITTLPGGQNLG-ELSDIEYFQKKLYRSLGVPESR 378 (563) Q Consensus 301 ~~im~k~knklvYd~~TGev~d~-~~~msMlEDywLpRReGgrgTEIsTLpGg~nLg-ei~DV~YF~~kLy~aL~VP~sR 378 (563) | .+|+.-+. ..+....--.|+ +..|.+|..++.. +++ -.+-++-+-..++...++|..- T Consensus 269 -------------d-~~g~~i~~~~~~~~~~~~~~~----~~~~~~~~q~~~~-~~~~~~~~l~~~i~~i~~~t~~p~~~ 329 (456) T protein:vir:79 269 -------------D-ENGNAIDYASIFEAAPGALWE----LPPGVDIWESQTN-DFTPMLSAIKEHIRQLSSATKTPLPM 329 (456) T ss_pred -------------c-ccccccchhhhhhhhcccccc----CCCCcceeeeccc-ChHHHHHHHHHHHHHHHhhcCCChhH Confidence 0 11111000 000000001122 1123444444433 222 2334667777788888999998 Q ss_pred ccCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHH Q lcl|NC_015287. 379 IAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKET 458 (563) Q Consensus 379 l~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~ 458 (563) |.... -|. .+..|---+..+-.-+.+.|+.|..-+.+.++.-+.+.|.. + + ..|++.|..-..=+. T Consensus 330 ~~~~~--~N~-Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~g~~--~-~----~~i~v~w~~~~~~s~---- 395 (456) T protein:vir:79 330 LMPDS--ANQ-SAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES--V-E----DTVDVSFESPDRVTL---- 395 (456) T ss_pred hcccc--cCc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--c-c----ccceEEeCCCCCcCH---- Confidence 85321 122 22234444455677788899999999999999888888852 2 1 347888865332222 Q ss_pred HHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCC Q lcl|NC_015287. 459 ELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQ 538 (563) Q Consensus 459 Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~ 538 (563) .+.++++..+..- .+.|.+. ...+|.++++|+++.+.+...+...+....|-+..+++.+ T Consensus 396 ---~~~ada~~kl~~~--G~~~~~~-~~~~lg~~~~~i~~~e~~r~~~e~~~~~~~~~~~~~~~~~-------------- 455 (456) T protein:vir:79 396 ---GEKYSAASLAKAA--GESWASI-RRNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQEDGS-------------- 455 (456) T ss_pred ---HHHHHHHHHHHhc--CCChHHH-HHhcCCCCHHHHHHHHHHHHHHHHHHHhhhHhhcCCCCCC-------------- Confidence 2445555554321 2455554 4578999999887654433333222221122111111111 Q ss_pred Cccccccc Q lcl|NC_015287. 539 DPNMGGAP 546 (563) Q Consensus 539 ~~~~~~~~ 546 (563) . T Consensus 456 -------~ 456 (456) T protein:vir:79 456 -------R 456 (456) T ss_pred -------C Confidence 0 No 121 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=97.44 E-value=6.8e-05 Score=43.44 Aligned_cols=403 Identities=14% Similarity=0.112 Sum_probs=181.8 Q ss_pred cccccccccchhhhH-HHHHHHHHHHhhccchhHHH--------------------HHhhcceEEecCCCcEEEEEeecc Q lcl|NC_015287. 39 FYGQYVDIDGVFKSE-FELIKRYREMALHPEVDSAI--------------------EDIINEAIVSDQNDSPVEIDLENL 97 (563) Q Consensus 39 ~~~~~~~~~~~~~~~-~~LI~~YR~ma~~PEvd~Ai--------------------~eIvneaiv~d~~~~pV~l~l~~~ 97 (563) +.- ..-..+-+. ...+.+|+.+..+.+-+..| .-||+ ..+.=--..||.+..++ T Consensus 1 l~~---~~l~~~i~~~~~~~~r~~~l~~yy~g~~~il~~~~~~~~~~~~ki~~n~~~~ivd-~~~~~l~g~~~~~~~~~- 75 (429) T protein:vir:98 1 MTK---DLLSELIQKHRSFNLSYSAYKQLYEGDHAILQQKQKEQYKPDNRLVVNFAKYIVD-TFNGYFIGVPVQTSHEN- 75 (429) T ss_pred CCH---HHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccCCCcceeecchHHHHHH-HHhhhhcccCceeecCC- Confidence 000 111111111 12234444444444433211 11111 01111123455555443 Q ss_pred ccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccc Q lcl|NC_015287. 98 PASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRG 177 (563) Q Consensus 98 ~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~ 177 (563) +...+.++.+.+-.+|+....++.|.+++.|+-|.+..+| ++|-..++.++|+.+-.|..-..+ .. T Consensus 76 -------~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d----~~g~~~~~~~~p~~~~~v~dd~~~---~~ 141 (429) T protein:vir:98 76 -------KQVSNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFND----ENAEAGITYLTPLEAFIVYDDSIR---QK 141 (429) T ss_pred -------hHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEec----CCCcEEEEEEcccceEEEEeCCCC---Cc Confidence 2455667778888899999999999999999998876665 468899999999998777432111 12 Q ss_pred cceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeecceecCCCCccchhHHHHH Q lcl|NC_015287. 178 AINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSI 257 (563) Q Consensus 178 ~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Ai 257 (563) .+-.++--. .-.....+..|...........+.+.. . .+...--+..=+|++. .|+....|-++..+ T Consensus 142 ~~~~i~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~--~-~~~~~~~~g~vPvv~~------~n~~~g~sd~e~v~ 208 (429) T protein:vir:98 142 PLFAVRYFY----NKGGVLEGSYSDASNITYFKDGEKGIE--I-GESEPHPFDGVPMIEY------VENEERQSLLASVV 208 (429) T ss_pred eEEEEEEEE----ecCceEEEEEEeCceEEEEEecCCceE--e-cccccccCCccceEEe------cCCCCCCCcHHHHH Confidence 221111000 001111111222211111110000000 0 0000000111123221 12334456677666 Q ss_pred HHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhccc Q lcl|NC_015287. 258 KALNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLP 336 (563) Q Consensus 258 k~~NqL~m-~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLp 336 (563) .....+.. +-+....-+..+.|.+-+.-.+. .. +-+++++ . ++++ .+| T Consensus 209 ~liD~~d~~~s~~~~~~~~~~~p~~~i~g~~~---~~----~~~~~~~-~--~~~~---------------------~~~ 257 (429) T protein:vir:98 209 TLINAFNKAISEKANDVEYFADAYLKILGAEL---DD----ETLKSLR-D--TRII---------------------NLK 257 (429) T ss_pred HHHHHHHHHHHHHHHHHHHhcCceeeeecCCC---Cc----chhhhHh-h--Ccee---------------------ecc Confidence 66665543 44555556777888776653322 11 1112111 1 1111 111 Q ss_pred ccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCChhhccCCCcceeccccchhhH--HHHHHHHHHHHHHHHHH Q lcl|NC_015287. 337 RREGGRGTEITTLPGGQNLGELSD-IEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILR--DEIKFTKFVGRMRKRFS 413 (563) Q Consensus 337 RReGgrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItR--DElKF~KfI~rLr~rFs 413 (563) - +||.+..+..|--..+.+.... ++-+.+.+|+...+|- +..+ + +|.+|.... -+.....-+.+.|+.|. T Consensus 258 ~-~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~--~~~~--~--~gn~Sg~Al~~~~~~l~~k~~~~~~~~~ 330 (429) T protein:vir:98 258 D-TDAQQLTVEFLQKPDADATQEHLLDRLENLIFRTAMVAN--ISDE--S--FGTASGIALRYRLQAMDNLAKTKERKFM 330 (429) T ss_pred C-CCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCccc--cCcc--c--cccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 2344555665544445544433 6778888899999993 3322 2 244333222 22223445566666666 Q ss_pred HHHHHHHHHHHHhcCCCC-hhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccC Q lcl|NC_015287. 414 HLFNDMLKTQLILKNIVT-PEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQK 492 (563) Q Consensus 414 ~if~d~Lk~qLiLkgI~t-~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~t 492 (563) .-+.++++.=+-+-++.. ..+| ..|.+.|.....-.+. +.++++.++ +| .+|.++++..+-.-+ T Consensus 331 ~~l~~~~~li~~~~~~~~~~~d~----~~i~v~f~~~~p~~~~-------~~a~~~~kl---~g-~is~et~~~~l~~v~ 395 (429) T protein:vir:98 331 SGMNRRYKLIASYPTSKIGPKDW----IGIKYKFTRNLPANLL-------EESQIAGNL---AG-IVSEETQVGVLSIVE 395 (429) T ss_pred HHHHHHHHHHHHHhccCCCcccc----ccceEEeCCCCCcCHH-------HHHHHHHHH---hc-cCchHHHHHhCCCCC Confidence 666665544333323222 2222 3477888654333332 334455555 24 589999998775444 Q ss_pred hhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcc Q lcl|NC_015287. 493 DEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPN 541 (563) Q Consensus 493 deei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~ 541 (563) | .+++.++|++|.... ++ + + ...-.+..++.+.+ T Consensus 396 d--~~~E~~ri~~E~~~~-~~-~-~----------~~~~~~~~~~~~~~ 429 (429) T protein:vir:98 396 N--PQKEIERKNSDKSTL-IS-R-Q----------AGGLNGQNTTTILE 429 (429) T ss_pred C--HHHHHHHHHHHHHHH-HH-H-H----------HhhhcCCCCCCCCC Confidence 4 334444555554321 00 0 0 00111111222222 No 122 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=97.43 E-value=6.8e-05 Score=43.42 Aligned_cols=404 Identities=12% Similarity=0.080 Sum_probs=179.7 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) |.=|+++ .++ . +..+..+-...... -.++.+..... +.. ..-..++-|..||+-|.+.+ T Consensus 1 Mg~f~~~-~~r-------~-~~~~~~~~~~~~~~-~~~~~~~~~~~---~~~--------~~al~~~~v~~cv~~Ia~~i 59 (416) T protein:vir:81 1 MGIFYKN-EKR-------D-LQYNEDDLQMMVQT-LPGFQGTKLRQ---YKD--------IEAIRHSDIFTAVMMIASDL 59 (416) T ss_pred CCccccc-ccc-------c-ccCCCcchhHHHHH-hccccccCccc---cch--------hhhhcchHHHHHHHHHHHhh Confidence 5443333 111 0 11111111111101 11111110000 000 12356788999999998866 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcch--------hhhHHHHHHHhhhhhhheeeeecCCCCCcce Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFD--------DKCHEILRNWYVDGRIYYHKVIDIKAPEEGI 152 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~--------~~~~~~~R~WYvDGri~fhkviD~~~~k~GI 152 (563) .-+ |+.+- ++.+... -+.++.+|+-+ .=-+.++..+..+|.-|..++.|.. .=+ T Consensus 60 A~~-----p~~~~-~~~~~~~---------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~---G~~ 121 (416) T protein:vir:81 60 ARM-----PIRVT-VNGQINY---------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT---GEP 121 (416) T ss_pred ccC-----ceEEe-cCccccc---------cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC---CcE Confidence 543 34332 2222111 12234444322 1135666777889998888877643 117 Q ss_pred eEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeeccc Q lcl|NC_015287. 153 KEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPD 232 (563) Q Consensus 153 ~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~d 232 (563) ++|.+|+|..|..++ +.. +. ..|+.+.. .+.++ .....++.+ T Consensus 122 ~~L~~i~~~~v~v~~----~~~--g~-----------------~~~~~~~~--------~~~~~-------~~~~~~~~~ 163 (416) T protein:vir:81 122 MNLTFRKTSEIELKS----DAR--GR-----------------LYYFHQRI--------DSNGN-------NIERNVKFE 163 (416) T ss_pred EEEEEEcCceeEEEE----CCC--cc-----------------EEEEEEEe--------cCCCc-------eeEEEEccc Confidence 899999998876542 111 11 01222111 00000 111345566 Q ss_pred ceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHH-HHHhhccee Q lcl|NC_015287. 233 AISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKE-VMNRYRNKL 311 (563) Q Consensus 233 aI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~-im~k~knkl 311 (563) .|.|.. .++.++-.+.|-|+.|++++......++..-=+=-.-+--+-|..++ |.+...+|.+=+++ +...|. T Consensus 164 evihir--~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~~~~~~~~~~~~~~~~~~~~--- 237 (416) T protein:vir:81 164 DMLDIK--FYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFHKSFS--- 237 (416) T ss_pred cEEEec--cCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC-CCCCCHHHHHHHHHHHHHHhc--- Confidence 664433 24555666789999999999888888887664444445566677777 45543444333333 222221 Q ss_pred EeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCChhhccCCCcceeccc Q lcl|NC_015287. 312 VYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGR 390 (563) Q Consensus 312 vYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~ 390 (563) | .++..+.+ .++ .|.+++.|.-...-.| ++-.++.++.+.++++||.+.|..+..++ T Consensus 238 ------g-~~nag~~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~---- 295 (416) T protein:vir:81 238 ------G-TKQAGKVV-VLD----------ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANM---- 295 (416) T ss_pred ------C-ccccCcee-ecC----------CCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCc---- Confidence 1 11111222 121 1455665532222222 34456678899999999999995433222 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 391 SSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAA 470 (563) Q Consensus 391 ~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~ 470 (563) .+.-..+-|..-|.-+-.++...|...| .++ +. ...| .|.-+ ++.... ...|++.++. T Consensus 296 --~~~~~~~~~~~~l~P~~~~ie~~ln~~l---------~~~--~~--~~~~--~f~~~----~l~~~D-~~~~~~~~~~ 353 (416) T protein:vir:81 296 --SITDANLDYLSTLKPYITCVCAELNFKF---------NDE--YV--NREF--KFDTT----EIRVVD-EKTQAEIDKI 353 (416) T ss_pred --cHHHHHHHHHHHHHHHHHHHHHHHhhhc---------ccc--cc--CceE--EEech----hhhccC-HHHHHHHHHH Confidence 2233334454444444444444444332 121 21 1234 44322 221111 2344444433 Q ss_pred hhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCC Q lcl|NC_015287. 471 VDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGG 550 (563) Q Consensus 471 ~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~ 550 (563) +-.- --||..|+- ++..-++++.++...-.-+. ..-|.+ +....+... T Consensus 354 ~~~~--------------G~~T~NE~R-------~~~gl~p~~~gd~~~~~~~~---------n~~~~~--~~~~~~~~~ 401 (416) T protein:vir:81 354 NIDS--------------GKMNIDEIR-------QRDGLAPIPGGNGSIHRVDL---------NHVNIE--LVDEYQMNK 401 (416) T ss_pred HHhC--------------CCcCHHHHH-------HHhCCCCCCCCCcceEeecc---------cccccc--cccccCccc Confidence 2211 124554433 12344455444322111110 000111 112222233 Q ss_pred cccCCCCCCCCCC Q lcl|NC_015287. 551 AEQAPQMPEGGEI 563 (563) Q Consensus 551 ~~~~~~~~~~~~~ 563 (563) +.....-.+|||- T Consensus 402 ~~~~~~~~kgGe~ 414 (416) T protein:vir:81 402 SRATDKKLKGGEE 414 (416) T ss_pred ccccccccCCCCC Confidence 3334455677777 No 123 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=97.43 E-value=6.8e-05 Score=43.42 Aligned_cols=404 Identities=12% Similarity=0.080 Sum_probs=179.7 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) |.=|+++ .++ . +..+..+-...... -.++.+..... +.. ..-..++-|..||+-|.+.+ T Consensus 1 Mg~f~~~-~~r-------~-~~~~~~~~~~~~~~-~~~~~~~~~~~---~~~--------~~al~~~~v~~cv~~Ia~~i 59 (416) T protein:vir:45 1 MGIFYKN-EKR-------D-LQYNEDDLQMMVQT-LPGFQGTKLRQ---YKD--------IEAIRHSDIFTAVMMIASDL 59 (416) T ss_pred CCccccc-ccc-------c-ccCCCcchhHHHHH-hccccccCccc---cch--------hhhhcchHHHHHHHHHHHhh Confidence 5443333 111 0 11111111111101 11111110000 000 12356788999999998866 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcch--------hhhHHHHHHHhhhhhhheeeeecCCCCCcce Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFD--------DKCHEILRNWYVDGRIYYHKVIDIKAPEEGI 152 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~--------~~~~~~~R~WYvDGri~fhkviD~~~~k~GI 152 (563) .-+ |+.+- ++.+... -+.++.+|+-+ .=-+.++..+..+|.-|..++.|.. .=+ T Consensus 60 A~~-----p~~~~-~~~~~~~---------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~---G~~ 121 (416) T protein:vir:45 60 ARM-----PIRVT-VNGQINY---------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT---GEP 121 (416) T ss_pred ccC-----ceEEe-cCccccc---------cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC---CcE Confidence 543 34332 2222111 12234444322 1135666777889998888877643 117 Q ss_pred eEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeeccc Q lcl|NC_015287. 153 KEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPD 232 (563) Q Consensus 153 ~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~d 232 (563) ++|.+|+|..|..++ +.. +. ..|+.+.. .+.++ .....++.+ T Consensus 122 ~~L~~i~~~~v~v~~----~~~--g~-----------------~~~~~~~~--------~~~~~-------~~~~~~~~~ 163 (416) T protein:vir:45 122 MNLTFRKTSEIELKS----DAR--GR-----------------LYYFHQRI--------DSNGN-------NIERNVKFE 163 (416) T ss_pred EEEEEEcCceeEEEE----CCC--cc-----------------EEEEEEEe--------cCCCc-------eeEEEEccc Confidence 899999998876542 111 11 01222111 00000 111345566 Q ss_pred ceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHH-HHHhhccee Q lcl|NC_015287. 233 AISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKE-VMNRYRNKL 311 (563) Q Consensus 233 aI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~-im~k~knkl 311 (563) .|.|.. .++.++-.+.|-|+.|++++......++..-=+=-.-+--+-|..++ |.+...+|.+=+++ +...|. T Consensus 164 evihir--~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~~~~~~~~~~~~~~~~~~~~--- 237 (416) T protein:vir:45 164 DMLDIK--FYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFHKSFS--- 237 (416) T ss_pred cEEEec--cCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC-CCCCCHHHHHHHHHHHHHHhc--- Confidence 664433 24555666789999999999888888887664444445566677777 45543444333333 222221 Q ss_pred EeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCChhhccCCCcceeccc Q lcl|NC_015287. 312 VYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGR 390 (563) Q Consensus 312 vYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~ 390 (563) | .++..+.+ .++ .|.+++.|.-...-.| ++-.++.++.+.++++||.+.|..+..++ T Consensus 238 ------g-~~nag~~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~---- 295 (416) T protein:vir:45 238 ------G-TKQAGKVV-VLD----------ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANM---- 295 (416) T ss_pred ------C-ccccCcee-ecC----------CCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCc---- Confidence 1 11111222 121 1455665532222222 34456678899999999999995433222 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 391 SSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAA 470 (563) Q Consensus 391 ~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~ 470 (563) .+.-..+-|..-|.-+-.++...|...| .++ +. ...| .|.-+ ++.... ...|++.++. T Consensus 296 --~~~~~~~~~~~~l~P~~~~ie~~ln~~l---------~~~--~~--~~~~--~f~~~----~l~~~D-~~~~~~~~~~ 353 (416) T protein:vir:45 296 --SITDANLDYLSTLKPYITCVCAELNFKF---------NDE--YV--NREF--KFDTT----EIRVVD-EKTQAEIDKI 353 (416) T ss_pred --cHHHHHHHHHHHHHHHHHHHHHHHhhhc---------ccc--cc--CceE--EEech----hhhccC-HHHHHHHHHH Confidence 2233334454444444444444444332 121 21 1234 44322 221111 2344444433 Q ss_pred hhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCC Q lcl|NC_015287. 471 VDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGG 550 (563) Q Consensus 471 ~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~ 550 (563) +-.- --||..|+- ++..-++++.++...-.-+. ..-|.+ +....+... T Consensus 354 ~~~~--------------G~~T~NE~R-------~~~gl~p~~~gd~~~~~~~~---------n~~~~~--~~~~~~~~~ 401 (416) T protein:vir:45 354 NIDS--------------GKMNIDEIR-------QRDGLAPIPGGNGSIHRVDL---------NHVNIE--LVDEYQMNK 401 (416) T ss_pred HHhC--------------CCcCHHHHH-------HHhCCCCCCCCCcceEeecc---------cccccc--cccccCccc Confidence 2211 124554433 12344455444322111110 000111 112222233 Q ss_pred cccCCCCCCCCCC Q lcl|NC_015287. 551 AEQAPQMPEGGEI 563 (563) Q Consensus 551 ~~~~~~~~~~~~~ 563 (563) +.....-.+|||- T Consensus 402 ~~~~~~~~kgGe~ 414 (416) T protein:vir:45 402 SRATDKKLKGGEE 414 (416) T ss_pred ccccccccCCCCC Confidence 3334455677777 No 124 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=97.42 E-value=7.2e-05 Score=43.31 Aligned_cols=268 Identities=14% Similarity=0.164 Sum_probs=136.9 Q ss_pred CcEEEEEe-eccccchHHHHHHHHHHHHHHHH-----hcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCC Q lcl|NC_015287. 87 DSPVEIDL-ENLPASKQLKDLIREEFKTVKEI-----LNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDP 160 (563) Q Consensus 87 ~~pV~l~l-~~~~~s~sik~~I~eeF~~i~~l-----l~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP 160 (563) -.-+.+.+ ++++-. .....++|+. +....=-+.+++.++++|.-|..++.|.+ ..+++|.+|+| T Consensus 1 ia~l~~~~~~~~~~~-------~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~---G~~~~l~~l~~ 70 (278) T protein:vir:78 1 MASLPLKMYEDYKVV-------NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY---HQPSKLFLLNP 70 (278) T ss_pred CccceeEEEecCccc-------ccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCC---CcEEEEEEECC Confidence 11122222 222111 1223333321 11222245566777888999998887632 23899999999 Q ss_pred ccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeecc Q lcl|NC_015287. 161 LKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSG 240 (563) Q Consensus 161 ~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSG 240 (563) -.|...+. ++. ...||.|... +...+.++.+.|.|.. . T Consensus 71 ~~v~v~~~---~~~--------------------~~~~y~~~~~------------------~g~~~~~~~~evih~~-~ 108 (278) T protein:vir:78 71 DVVEMLIE---NQS--------------------RELYYSIHAA------------------TGNKLIVHNMDMLHFK-H 108 (278) T ss_pred ceeEEEEc---CCC--------------------ceEEEEEEcC------------------CceEEEEccccEEEEC-C Confidence 88764321 110 1124444321 0123556677775442 1 Q ss_pred eecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcc Q lcl|NC_015287. 241 LVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEI 320 (563) Q Consensus 241 l~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev 320 (563) ....++-.+.|.+..|.+++.....++..- .+...+.| .-+++ .-+.|.+..+++..+. +++.. + +.| T Consensus 109 ~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~-~~~~~~~~-~~i~~-~~~~l~~e~~~~~~~~----~~~~~--~-~~g-- 176 (278) T protein:vir:78 109 IVASNMVQGISPIDVLKNTTDFDNAVRTFN-LTEMQKPD-SFMLK-YGSNVGKEKRQQVLED----FKQYY--E-ENG-- 176 (278) T ss_pred CCCCCCeeeccHHHHHHHHHHHHHHHHHHH-HHHhcCCC-cEEEE-eCCCCCHHHHHHHHHH----HHHHh--c-cCC-- Confidence 112334456899999999999888887764 45555554 44444 4467777666554443 33221 1 223 Q ss_pred cccccccchhhhhcccccCCCCccceeecCCCCCcchHHH---HHHHHHHHHHhcCCChhhccCCCcceeccccchhhHH Q lcl|NC_015287. 321 RDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSD---IEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRD 397 (563) Q Consensus 321 ~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~D---V~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRD 397 (563) +.+ .++ .|+++..|. .+.-+++- .++..+.+.++++||.+.+...+ +-+. +.+.-. T Consensus 177 ----~~~-vl~----------~g~~~~~l~--~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~-~~~~---sn~~~~ 235 (278) T protein:vir:78 177 ----GIL-FQE----------PGVEIEPLP--KKYVSEDIVASENLTRERVANVFQLPSVFLNARS-NTNF---AKNEEL 235 (278) T ss_pred ----Cce-ecC----------CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC-CCCc---ccHHHH Confidence 222 221 256777774 33334333 35788999999999999985322 1121 112222 Q ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccch Q lcl|NC_015287. 398 EIKFTKF-VGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNH 451 (563) Q Consensus 398 ElKF~Kf-I~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~ 451 (563) ...|..+ |..+..++...|. .. ++++.|+.. ..+|+|+... - T Consensus 236 ~~~~~~~~l~P~~~~i~~~ln----~~-----L~~~~e~~~-g~~~~f~~~~--l 278 (278) T protein:vir:78 236 NRFYLQHTLLPIVKQYEEEFN----RK-----LLTKTDREK-IGILNLTLNL--I 278 (278) T ss_pred HHHHHHHHHHHHHHHHHHHHH----hh-----cCChhHhcC-CceEEEeccc--C Confidence 2235443 4444444444433 33 255555552 2455555443 2 No 125 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=97.41 E-value=7.4e-05 Score=43.22 Aligned_cols=404 Identities=11% Similarity=0.064 Sum_probs=170.4 Q ss_pred ccccchhhhHHHH-------HHHHHHHhhccchhH-----------------------------------HHHHhhcceE Q lcl|NC_015287. 44 VDIDGVFKSEFEL-------IKRYREMALHPEVDS-----------------------------------AIEDIINEAI 81 (563) Q Consensus 44 ~~~~~~~~~~~~L-------I~~YR~ma~~PEvd~-----------------------------------Ai~eIvneai 81 (563) ++++.-.+-...+ +.+|..+..+.+-.. =...||+-.+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 80 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKK 80 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhh Confidence 3333222211222 222333332222221 1222333111 Q ss_pred EecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCc Q lcl|NC_015287. 82 VSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPL 161 (563) Q Consensus 82 v~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~ 161 (563) .=--..||.+..+|. ...+.++.+.+ .+|+....++.+.+++.|+-|.+.-+|.+ +|-..+..+||+ T Consensus 81 -~yl~G~p~~~~~~~~--------~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~---~g~~~~~~~~p~ 147 (471) T protein:vir:10 81 -AYALTYPPTFDVDDK--------KVNDMIVDVLG-DDYERISKQLCVNAGNAGIAWLHVWKDAS---DNSFRYACVDSK 147 (471) T ss_pred -hhhcccCceeccCCh--------HHHHHHHHHHh-cCHHHHHHHHHHHHhhCCeEEEEEEeeCC---CCeeEEEEEccc Confidence 111346677666542 34445555655 57999999999999999999999888754 567889999999 Q ss_pred cceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCcc------------ccCc---ce Q lcl|NC_015287. 162 KIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPF------------STKQ---RP 226 (563) Q Consensus 162 ~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~------------~~~~---~~ 226 (563) .+..|..-..+......+...... ..........+-+|.+.+.......+.+..... .+.. .. T Consensus 148 ~~~~i~d~~~~~~~~~~ir~~~~~--~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (471) T protein:vir:10 148 EVIPIYSKSLDKKSIGVLRVYSSI--DETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNS 225 (471) T ss_pred ceEEEEcCCCCCceEEEEEEEEee--ccCCCceeEEEEEEeCCcEEEEEecCCccccccccccccccccccccccccccc Confidence 987764322111111112111000 000001111122232222111111000000000 0000 00 Q ss_pred ee--ecccceeeeecceecCCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHH Q lcl|NC_015287. 227 VK--IAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEV 303 (563) Q Consensus 227 ik--i~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m-~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~i 303 (563) .. +..=+|++. .|+....|-|+..+.....+.+ +=+..-.-+-+..|-+-+.-.+.-.++ +.+.. T Consensus 226 ~~~~~g~iPvv~~------~n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~-----~~~~~- 293 (471) T protein:vir:10 226 FKHDFGLVPFIPF------KNNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQ-----EFLED- 293 (471) T ss_pred ccCCCCceeEEEe------ccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccc-----hhHHH- Confidence 00 000022221 1222334556554444443332 222222334445553333222211111 11111 Q ss_pred HHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCChhhccCC Q lcl|NC_015287. 304 MNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDIEYFQKKLYRSLGVPESRIAGS 382 (563) Q Consensus 304 m~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~ 382 (563) +.+ .+.+.-..+| .+.|-.++.|--..+... -.-+.-..+.+|...++|- +..+ T Consensus 294 ~~~--~~~i~~~~~~---------------------~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~--~~~~ 348 (471) T protein:vir:10 294 LKR--YKMIKMDNDG---------------------MGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVN--PETD 348 (471) T ss_pred hhc--CCeEEecCCC---------------------CccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcC--CCcc Confidence 111 1122111111 122233444443333332 2345667778888888883 2222 Q ss_pred CcceeccccchhhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHH Q lcl|NC_015287. 383 GDGFNLGRSSEILRDEIKFTK---FVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETE 459 (563) Q Consensus 383 ~~g~~~g~~~eItRDElKF~K---fI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~E 459 (563) + +|.+|..... .++.- -+.+.|+.|...|.++++.=+-+-|+ .+|. .|.+.|...---.+...++ T Consensus 349 --~--~gn~Sg~Alk-~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~---~d~~----~i~i~f~~~~p~n~~e~~~ 416 (471) T protein:vir:10 349 --K--LGNSSGVALK-FLYSLLELKAGNMETQFRSGYATLVKMILKHLGL---SDKL----KIKQTWTRNSINNDTEMAQ 416 (471) T ss_pred --c--ccCccHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc---CCCc----eeEEEeCCCCCCCHHHHHH Confidence 2 2444443332 33333 46777777777777776543322232 3443 4677777655444433333 Q ss_pred HHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCC---CCCCCcccCccccc Q lcl|NC_015287. 460 LLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDG---KLADPMEVQQLQLG 524 (563) Q Consensus 460 i~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~---~~~~p~~~~~~~~~ 524 (563) +++.+ .| .+|.+++++.+-..+| .+++.++|++|..+- ...-++..++++.. T Consensus 417 -------~~~kl---~g-~iS~et~~~~~p~v~D--~~~E~eri~~E~~~~~~~~~~~~~~~~~~e~~ 471 (471) T protein:vir:10 417 -------VVSTL---AT-ITSRENVAKSNPIVED--WQDELRLQKAEQEGRSEKLYDMEEVEHESEVE 471 (471) T ss_pred -------HHHHH---hc-cCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhcccccCCCCCccccC Confidence 33333 34 4899999998876664 233344454443221 11111111111111 No 126 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=97.40 E-value=7.5e-05 Score=43.21 Aligned_cols=380 Identities=12% Similarity=0.136 Sum_probs=161.1 Q ss_pred ccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEec Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSD 84 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d 84 (563) ||| +.+.... +. + ++.+.+.-.....+++.+... .+.. +.. +..+++|-|..||+-|.+.+.-+ T Consensus 1 M~~-f~~~~~~-~~---~-~~~~~~~~~~~~~~~~~~~~~--~~~~------v~~-~~al~~~~v~~~i~~ia~~ia~~- 64 (386) T protein:vir:49 1 MPI-FNITNLA-TE---S-PPINQESFFDIADSDFLASLN--SSEW------VSA-ENALKNSDLFSIISQLSNDLATA- 64 (386) T ss_pred Cch-hhhhccC-CC---C-cccchhhhhhhhhcccccccc--CCce------ech-hhhhccHHHHHHHHHHHHHhhhC- Confidence 444 2221111 11 1 111111111111111111100 0000 011 12346899999999999865432 Q ss_pred CCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhh----hHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCC Q lcl|NC_015287. 85 QNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDK----CHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDP 160 (563) Q Consensus 85 ~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~----~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP 160 (563) |+. +.+ ... +.++.--+-... -+.++..++..|.-|..++.|... -+++|.+|+| T Consensus 65 ----p~~--~~~-----~~~-------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g---~~~~l~~i~~ 123 (386) T protein:vir:49 65 ----KIT--TSR-----KQL-------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNG---RDMKWEYLRP 123 (386) T ss_pred ----cee--ecc-----chh-------hhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCC---cEEEEEEecC Confidence 222 111 111 122222222223 345667788899999887776432 3789999999 Q ss_pred ccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeecc Q lcl|NC_015287. 161 LKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSG 240 (563) Q Consensus 161 ~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSG 240 (563) ..|..++.-.. + .-+|.|... +. +......++.+.|.|.. T Consensus 124 ~~v~v~~~~~~-----~------------------~~~y~~~~~--------~~-------~~~~~~~~~~~evih~~-- 163 (386) T protein:vir:49 124 SQVSFNRLDNQ-----N------------------GLYYNITFD--------DP-------HIAPKQHVPQNDILHFR-- 163 (386) T ss_pred ceeEEEEcCCC-----c------------------eEEEEEEEc--------Cc-------cccceeEEccccEEEec-- Confidence 88765532100 0 112222110 00 01122456666664432 Q ss_pred eecCCCC-ccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCc Q lcl|NC_015287. 241 LVDRNKQ-TILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGE 319 (563) Q Consensus 241 l~d~~~~-~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGe 319 (563) ..+.++. .+.|.|..|++.+.....+++...-+--..+--+-+..++....+.. ++ .++..+... + .+.|. T Consensus 164 ~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~-~~-~~~~~~~~~-----~-~n~g~ 235 (386) T protein:vir:49 164 LLSVDGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGLLDF-KT-KVSRSRQAM-----K-QMQGG 235 (386) T ss_pred CCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChHH-HH-HHHHHHHHh-----c-cCCCC Confidence 3444443 45899999999999999998888766666677788888886554433 33 333323221 1 22332 Q ss_pred ccccccccchhhhhcccccCCCCccceeecCCCC-CcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHH Q lcl|NC_015287. 320 IRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQ-NLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDE 398 (563) Q Consensus 320 v~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~-nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDE 398 (563) + + .++ .|.+++.|.-.. .+.-++-.++....+.++++||.+.|..++++..-+ ...+ T Consensus 236 ~------~-vl~----------~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~-----~~~~ 293 (386) T protein:vir:49 236 P------L-VLD----------DLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDGDQQSSL-----EMIY 293 (386) T ss_pred c------e-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccchH-----HHHH Confidence 2 1 111 245666663221 222234457888999999999999996432211111 1112 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchh Q lcl|NC_015287. 399 IKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKY 478 (563) Q Consensus 399 lKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy 478 (563) ..+..| |+..+..+.. .|...|. . .+. |..+ ++.+... ..+...+..+- -+-. T Consensus 294 ~~~~~~---i~~~l~~i~~-~~~~~l~-~-------------~~~--~~~~----~~~~~d~-~~~~~~~~~l~--~~g~ 346 (386) T protein:vir:49 294 NIYFKS---VSRYLRPFVS-EMSKKLS-C-------------EVD--VDIS----PAVDPTG-SNYISLINSMV--KSGT 346 (386) T ss_pred HHHHHH---HHHHHHHHHH-HHHHHhc-c-------------hhc--ccch----hhhccCH-HHHHHHHHHHH--hCCC Confidence 222222 3333333222 1222221 1 111 1110 0000000 01111111110 0011 Q ss_pred hhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCC Q lcl|NC_015287. 479 FSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQD 539 (563) Q Consensus 479 ~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~ 539 (563) +|..|+-++ + ...|..+.+-... .........+|+.+..| T Consensus 347 ------------~t~nE~r~~---l---~~~~~~~~~~~~~---~~~~~~~~~gGd~~~~~ 386 (386) T protein:vir:49 347 ------------LAQNQGLYI---L---QQAEILPKELPDG---KNPNRTSLKGGEINEQD 386 (386) T ss_pred ------------cCHHHHHHH---H---hhCCCCCCcCcch---hccCCCCCCCCCCCCCC Confidence 333333221 1 1223322111110 01111122333333333 No 127 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=97.38 E-value=8.1e-05 Score=43.00 Aligned_cols=463 Identities=12% Similarity=0.111 Sum_probs=183.7 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) |......+..+.. .. .++ ...+..+..-.|++.| ..-+...+..+|-+.=+..+.+|-|..||+.|+|.+ T Consensus 31 ~~~~~~~~~~k~~------~~-~~~--~~~~~~~~~~~~~~g~-~~~~~~~~~~~l~~l~~~~~~npiv~~~I~~~a~~i 100 (547) T protein:vir:63 31 IQQREQEQISKAM------NN-KEV--AYSQPVIGSMSANPGF-KTKPSIRNNQDLHGVLKKFGGNIILNAIINTRSNQV 100 (547) T ss_pred hhhhhHHHHHHhh------cc-cch--hhhchhhheeeccccc-ccCCccCChhHHHHHHHHhhcCHHHHHHHHHHHHHH Confidence 1111111111110 00 000 0001111111111111 112223344444333344467899999999999976 Q ss_pred EEec------CCCcEEEEEeec--cccchHHHHHHHHHHHHHHHHhcch-----hhhHHHHHHH----hhhhhhheeeee Q lcl|NC_015287. 81 IVSD------QNDSPVEIDLEN--LPASKQLKDLIREEFKTVKEILNFD-----DKCHEILRNW----YVDGRIYYHKVI 143 (563) Q Consensus 81 iv~d------~~~~pV~l~l~~--~~~s~sik~~I~eeF~~i~~ll~F~-----~~~~~~~R~W----YvDGri~fhkvi 143 (563) .++- .+.-...+.+++ .+..+..+..+. +.+.+++-.++. ....++++.| ++.|.-|+.++. T Consensus 101 a~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~-~l~~~l~~pn~~~~p~~~s~~~f~~~lv~d~ll~Gn~~~~i~r 179 (547) T protein:vir:63 101 SMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIK-RIESFIEKTGVDNDINRDSFSSFVKKIVRDTYMYDQVNFEKVF 179 (547) T ss_pred hhhhhhhhhhccCCCceeEecccccccChhhHHHHH-HHHHHHHhhCCCCCCccchHHHHHHHHHHHHHhhCCEEEEEEE Confidence 6431 122223444443 223333333322 233444434433 1445555555 466888998888 Q ss_pred cCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccC Q lcl|NC_015287. 144 DIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTK 223 (563) Q Consensus 144 D~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~ 223 (563) |... -+.+|.+|||..|+.+... ++... .-..+|.+... + T Consensus 180 d~~G---~~~~L~~l~p~~V~~~~~~-------~g~~~------------~~~~~y~~~~~------------------~ 219 (547) T protein:vir:63 180 NRNQ---SMVRFVAKDPTTIFFATTA-------DGKIP------------DNGNRFVQVID------------------Q 219 (547) T ss_pred CCCC---cEEEEEEecCceeEEEECC-------ccccc------------cCceEEEEEcC------------------C Confidence 7532 3999999999988766211 11000 00112222110 0 Q ss_pred cceeeecccceeeeecc-eecCC-CCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCC-CCchHHHHHHH Q lcl|NC_015287. 224 QRPVKIAPDAISFCHSG-LVDRN-KQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVG-NLPKMKAEQYL 300 (563) Q Consensus 224 ~~~iki~~daI~y~hSG-l~d~~-~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvG-nlpk~KAeqYl 300 (563) ...+.++.+.|.|.+.. +.+.. +..++|-|..|++++.....++....=+=---+--+-|..+... +|.+..+++.- T Consensus 220 ~~~~~~~~~eiih~r~n~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lk 299 (547) T protein:vir:63 220 KIVATFNAREMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFK 299 (547) T ss_pred cEEEEeccccEEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCCCHHHHHHHH Confidence 11245666666665432 22222 34467889999999998888877665444334555666666644 35444443333 Q ss_pred HHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHH---HHHHHHHHHHhcCCChh Q lcl|NC_015287. 301 KEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSD---IEYFQKKLYRSLGVPES 377 (563) Q Consensus 301 ~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~D---V~YF~~kLy~aL~VP~s 377 (563) +.+...|. | ..+..+. .++.+ .|.++..|- .+..++.- .+|..+.+-++.+||.. T Consensus 300 ~~~~~~~~---------G-~~nagk~-~vl~~---------~g~~~~~l~--~~~~d~qfle~~~~~~~~Ia~afgVPP~ 357 (547) T protein:vir:63 300 REWKNSLS---------G-INGSWQI-PVVSA---------EDVKFVNMT--PSARDMEFEKWLNYLINVISALYGIDPA 357 (547) T ss_pred HHHHHHhc---------C-ccccccc-ccccC---------CCceEEEcC--CChhHHHHHHHHHHHHHHHHHHhCCCHH Confidence 33333332 1 1111111 12211 134455553 33344433 34567889999999999 Q ss_pred hccCCCcc-eeccccchh-----hHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccc Q lcl|NC_015287. 378 RIAGSGDG-FNLGRSSEI-----LRDEIKFTK-FVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDN 450 (563) Q Consensus 378 Rl~~~~~g-~~~g~~~eI-----tRDElKF~K-fI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn 450 (563) .|.-.+.+ .....+..+ .-....|.. -+.-+..++...|...|- +. + ...+.++|.... T Consensus 358 ~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~tL~P~~~~ie~~ln~~L~---------~~--~---~~~~~~~f~~~~ 423 (547) T protein:vir:63 358 EINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHIV---------AE--F---GDKYTFQFVGGD 423 (547) T ss_pred HcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------cc--c---CCceEEEeeccc Confidence 99532111 111111111 212222333 355555555555544332 11 1 134677776544 Q ss_pred hHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccCh-hhHHHHHHHH-----HHHhcCCCCCCCcccCccccc Q lcl|NC_015287. 451 HFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKD-EEVIEIDKQM-----EKEIKDGKLADPMEVQQLQLG 524 (563) Q Consensus 451 ~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~td-eei~e~~~qi-----e~E~~~~~~~~p~~~~~~~~~ 524 (563) .-.+ .+|..+...+. .-++|..-+|.. +.+.+ .+= -+.-+ ..-........+..+...+.- T Consensus 424 ~~~~-------~~~~~~~~~~~---~g~lT~NE~R~~-~gl~P~~eg--GD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (547) T protein:vir:63 424 IKSE-------LESVKILAEKA---KVAMTVNEVRKE-LNLPGDVIG--GDIPLNGVIVQRIGQLMQQEQFEHEKQQSNL 490 (547) T ss_pred cccH-------HHHHHHHHHHh---CCCcCHHHHHHH-hCCCCCCCC--CceeecccccccccccccccCCccccchhhc Confidence 3222 23333322221 124566666653 33432 100 00000 000000000000000000000 Q ss_pred cccccCCCCcC---CCCCcc---cccccCCCCcccCCCCCCCCCC Q lcl|NC_015287. 525 VHPEQMPGGAM---NPQDPN---MGGAPQEGGAEQAPQMPEGGEI 563 (563) Q Consensus 525 ~~~~~~~~~~~---~p~~~~---~~~~~~~~~~~~~~~~~~~~~~ 563 (563) ..+....++.. .|..|. ....+.+-.....++..|-|+- T Consensus 491 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 535 (547) T protein:vir:63 491 QMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDGQRKDKDNANAGKQ 535 (547) T ss_pred cccccccCCCCCCCCCCCCCCcccCCCcCccccccCccccchhhh Confidence 00000000000 011110 0000000000011111111111 No 128 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=97.37 E-value=8.3e-05 Score=42.96 Aligned_cols=415 Identities=12% Similarity=0.110 Sum_probs=170.2 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) |.=|-++ +++....... .+.+. . +.......+++...+ +...... +.. +..+++|-|..||+.|.+.+ T Consensus 1 MG~f~~l-f~~~~~~~~~--~~~~~-~-~~~~~~~~~~~~~~~-g~~~~~~-----v~~-~~al~~~~v~~ci~~ia~~i 68 (422) T protein:vir:13 1 MGFLRGL-FNKKNNNDEK--RSNYD-E-DIGIDISDSNFWEKF-GIKLNFS-----VRG-KRALKENTVYVCTKIRAESI 68 (422) T ss_pred Cchhhhh-hhccCCccch--hhhhh-h-ccccccCcchhhhhc-cccCCcc-----cch-hhhhccHHHHHHHHHHHHhh Confidence 3321121 1111111000 00000 0 000111111222211 1110000 000 12246788999999888854 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcch----hhhHHHHHHHhhhhhhheeeeecCCCCCcceeEE Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFD----DKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEV 155 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~----~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eL 155 (563) .- -|+.+- ++.+ .++ ......+|+. =|=. .=-+.++..+++.|.-|..++.|.. .-+++| T Consensus 69 A~-----lp~~~~-~~~~---~~~---~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~---G~~~~L 133 (422) T protein:vir:13 69 GK-----LSLKIY-KDKE---EYK---EHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRK---GKIIGL 133 (422) T ss_pred hh-----CceEEE-ecCc---ccc---cchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC---CcEEEE Confidence 32 233331 1111 111 1112222221 0111 2234456667888999998887753 238999 Q ss_pred EEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeeccccee Q lcl|NC_015287. 156 RYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAIS 235 (563) Q Consensus 156 r~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~ 235 (563) .+|+|..|..++. ++ +..... -.-+|.|... + .....++.+.|. T Consensus 134 ~~i~~~~v~~~~~---~~----~~~~~~-----------~~~~y~~~~~----------~--------g~~~~~~~~eii 177 (422) T protein:vir:13 134 YPINSDNVTKIID---DD----NFLSSL-----------SKVWYVVTDK----------N--------GKEHKLLPDEML 177 (422) T ss_pred EEECCcceEEEEc---CC----cceecc-----------ceEEEEEEeC----------C--------CeEEEEcccceE Confidence 9999998776632 11 111100 0123444221 0 112456666664 Q ss_pred eeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeC Q lcl|NC_015287. 236 FCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDA 315 (563) Q Consensus 236 y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~ 315 (563) |... .+..++-.+.|-|..|.+++.....+++...=+----+--+-|...+ ++|-+..+++..+.+...|.- T Consensus 178 h~~~-~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g------ 249 (422) T protein:vir:13 178 HFIG-DITLDGLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYV-GDLDEKAKKIFKKEFESMSNG------ 249 (422) T ss_pred EEcC-CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeC-CCCCHHHHHHHHHHHHHHhcC------ Confidence 4432 23445556689999999999888888776543333334566677776 466666555555444433321 Q ss_pred CCCcccccccccchhhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchh Q lcl|NC_015287. 316 STGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG-QNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEI 394 (563) Q Consensus 316 ~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eI 394 (563) ..+..+.+ .++ + |++++.|.=. ..+.-++-.++....+.++++||...|...+++ +. +.+ T Consensus 250 ----~~n~~~~~-vl~-------~---g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~-~~---sn~ 310 (422) T protein:vir:13 250 ----LENAHSIS-LLP-------F---GYQFQPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERA-TF---NNL 310 (422) T ss_pred ----ccccCCce-ecC-------C---CceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC-Cc---ccH Confidence 11111121 111 1 3444443211 122224455678889999999999999643221 11 112 Q ss_pred hHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_015287. 395 LRDEIKFTKF-VGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDP 473 (563) Q Consensus 395 tRDElKF~Kf-I~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p 473 (563) .-.-..|..+ |.-+-.++... |-. .++++.+. .....+.|..+. +... -+..|.++++.+-. T Consensus 311 e~~~~~f~~~~l~P~~~~ie~~----l~~-----~Ll~~~~~---~~g~~i~fd~~~----l~r~-d~~~~~~~~~~~~~ 373 (422) T protein:vir:13 311 TEQQKDFYVTTLQSSLTVYEQE----IQD-----KLFSQYET---LQDVKAEFNVDT----ILRS-DIKTRYEAYRIGIQ 373 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHH----HHH-----hhCChhhh---cCCceEEeechh----hhcC-CHHHHHHHHHHHHh Confidence 2222234332 22222222222 222 23333322 123444554322 1110 11234544433322 Q ss_pred hhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCccc Q lcl|NC_015287. 474 YVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQ 553 (563) Q Consensus 474 ~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~ 553 (563) - -+ +|..|+- + ...-+++ |+ . +.- -.+ ...-|.+ ..++.+ T Consensus 374 ~--G~------------~T~NE~R---~----~~gl~p~--~g--g-D~~-----~~~-~n~~~l~--------~~~~~~ 413 (422) T protein:vir:13 374 G--GF------------IEANEAR---R----RENLPPV--EG--G-DRL-----LVN-GNMIPIE--------MAGEQY 413 (422) T ss_pred C--CC------------cCHHHHH---H----HhCCCCC--CC--c-Cee-----eec-cCccchh--------hccccc Confidence 1 12 3333322 1 1122222 11 0 000 000 0001111 111122 Q ss_pred CCCCCCCCC Q lcl|NC_015287. 554 APQMPEGGE 562 (563) Q Consensus 554 ~~~~~~~~~ 562 (563) ...--+||+ T Consensus 414 ~~~g~~~g~ 422 (422) T protein:vir:13 414 KKGGEKGGK 422 (422) T ss_pred ccCCCcCCC Confidence 222344555 No 129 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=97.34 E-value=9.1e-05 Score=42.75 Aligned_cols=422 Identities=13% Similarity=0.137 Sum_probs=175.1 Q ss_pred cccchhhHhhhcccccccccCcCCCCcccce---eccccccccc-cccchhhhHHHHHHHHHHHhhccchhHHHHHhhcc Q lcl|NC_015287. 4 LFGFKIQKDDEQNKNVVSPVPQSNEDSSDYY---VSSGFYGQYV-DIDGVFKSEFELIKRYREMALHPEVDSAIEDIINE 79 (563) Q Consensus 4 ~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~---~~~~~~~~~~-~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvne 79 (563) +|.| +.+..+..+. .++....+-... ....|.+... ++. + .-...+++|.|..||+.|.+. T Consensus 1 ~~~~-~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~g~~~~g~~--v--------~~~~al~~~~V~~~v~~Ia~~ 65 (454) T protein:vir:93 1 MWNL-LRRTRKNQKS----GRDVREAGWTSLFQAVAEPFAGAWQQGVK--A--------DPEAVLSFHAVFACISLISQD 65 (454) T ss_pred CCCc-cccCcccccc----cccccchhhhhhhhhhhhhhcchhhcCcc--c--------ChHHhhccHHHHHHHHHHHHh Confidence 5555 2222211111 111111111111 1111222100 110 0 012456788999999999985 Q ss_pred eEEecCCCcEEEEEeeccccchHHHHHHHHH-HHHHHHHhcchhhh----HHHHHHHhhhhhhheeeeecCCCCCcceeE Q lcl|NC_015287. 80 AIVSDQNDSPVEIDLENLPASKQLKDLIREE-FKTVKEILNFDDKC----HEILRNWYVDGRIYYHKVIDIKAPEEGIKE 154 (563) Q Consensus 80 aiv~d~~~~pV~l~l~~~~~s~sik~~I~ee-F~~i~~ll~F~~~~----~~~~R~WYvDGri~fhkviD~~~~k~GI~e 154 (563) +.-. |+.|-=.+ . +..++.+.+- +..+++-=+-...+ +.++..++++|.-|.-++.|.. .-+.+ T Consensus 66 iA~l-----p~~~~~~~-~--~g~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~---G~~~~ 134 (454) T protein:vir:93 66 IAKM-----RLRLMQTD-A--QGIRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNAR---GQIKE 134 (454) T ss_pred hccC-----ceEEEEec-c--CCccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCC---CcEEE Confidence 5432 44442111 1 1112222211 11111111222233 4455667788998888877643 22889 Q ss_pred EEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccce Q lcl|NC_015287. 155 VRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAI 234 (563) Q Consensus 155 Lr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI 234 (563) |.+|+|..|+.++. ++ + .-+|.|..... .+ ....+.++.+.| T Consensus 135 L~~i~~~~v~v~~~---~~---g------------------~~~y~~~~~~~-----~~---------~~~~~~~~~~eV 176 (454) T protein:vir:93 135 LRILDWNRVEPLVA---DD---G------------------EVFYRITPDRN-----CG---------ITEAVTVPAREV 176 (454) T ss_pred EEEEcCcceEEEEc---CC---C------------------cEEEEEEeccc-----cc---------cceeEEecCcce Confidence 99999998876521 11 0 12333322110 01 122356777777 Q ss_pred eeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEee Q lcl|NC_015287. 235 SFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYD 314 (563) Q Consensus 235 ~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd 314 (563) .|...+. ..++-.+.|-+..|++.+.....+++...=+=---+--+-|..++ +.|-+..+++--+ .++...- - T Consensus 177 iH~k~~~-~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~----~~~~~~~-g 249 (454) T protein:vir:93 177 IHDRFNC-FFHPLIGLPPVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIP-GSITEENAKKLKS----NWDSGYT-G 249 (454) T ss_pred EEeccCC-CCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecC-CCCCHHHHHHHHH----HHHHHhc-c Confidence 6554332 334556789999999999999999987653222223345666666 5666555443222 2222210 0 Q ss_pred CCCCcccccccccchhhhhcccccCCCCccceeecCCCC-CcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccch Q lcl|NC_015287. 315 ASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQ-NLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSE 393 (563) Q Consensus 315 ~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~-nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~e 393 (563) .++|.+ + +++ .|.+++.|.=.. .+.-++-.+|..+.+.++++||...|...+++ +. +. T Consensus 250 ~n~g~~------~-vl~----------~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~-t~---sn 308 (454) T protein:vir:93 250 ENAGKT------A-ILS----------NGAKYNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPP-SS---DN 308 (454) T ss_pred cccCCc------e-ecc----------CCceEEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCC-cc---hh Confidence 122321 1 221 134454443211 22223444577789999999999999532211 11 11 Q ss_pred hhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_015287. 394 ILRDEIKFTK-FVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVD 472 (563) Q Consensus 394 ItRDElKF~K-fI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~ 472 (563) +.-....|.+ -|.-+..++...+...| + +..+ ..+.|.-+. +... -+..|++.+..+- T Consensus 309 ~e~~~~~f~~~~l~P~~~~ie~~ln~~L----~-----~~~~-------~~~~f~~~~----ll~~-D~~~r~~~~~~~~ 367 (454) T protein:vir:93 309 VEALEQQYYSQCLQTLIESIELLLDEAL----E-----TGEN-------ESTEFDVTT----LLRM-DSERRMKTLGDAV 367 (454) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhh----c-----CCCC-------cEEEeechh----hhcc-CHHHHHHHHHHHH Confidence 2222223443 34445555554444333 2 2221 234444322 2111 1235566554442 Q ss_pred hhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccC-ccccccccc-cCCCCcCCCCCcccccccCCCC Q lcl|NC_015287. 473 PYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQ-QLQLGVHPE-QMPGGAMNPQDPNMGGAPQEGG 550 (563) Q Consensus 473 p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~-~~~~~~~~~-~~~~~~~~p~~~~~~~~~~~~~ 550 (563) .- -++|.+-+|. ++.+ ++++..++-- ....-+... ....+...|..+...++.++ T Consensus 368 ~~--G~~T~NE~R~-~~gl------------------~pi~ggD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 424 (454) T protein:vir:93 368 KN--TLLTPNEARK-RENL------------------PPLAGGDALYLQQQNYSLEALSRRDAREDPFASSGKTASVP-- 424 (454) T ss_pred hC--CCcCHHHHHH-HhCC------------------CCCCCCCeeeeccCccchHhhhccCcccCCCCCCccCCCCC-- Confidence 22 2444444443 1222 2222110000 000000000 00111111222111111111 Q ss_pred cccCCCCCCCCCC Q lcl|NC_015287. 551 AEQAPQMPEGGEI 563 (563) Q Consensus 551 ~~~~~~~~~~~~~ 563 (563) +.++-.+|++= T Consensus 425 --~~~~~~d~~~~ 435 (454) T protein:vir:93 425 --QAVAASDGNKA 435 (454) T ss_pred --CCCCCCCCCCC Confidence 01111111111 No 130 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=97.33 E-value=9.2e-05 Score=42.72 Aligned_cols=377 Identities=16% Similarity=0.146 Sum_probs=161.0 Q ss_pred ccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEec Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSD 84 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d 84 (563) ||| +++.... +..|+.+.+.......+++-+...+-.. . + -+...++|-|..||+-|.+.+.-+ T Consensus 1 Mgl-f~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~-v-~-------~~~al~~~~V~~~i~~Ia~~ia~l- 64 (384) T protein:vir:49 1 MPI-FNITNLA-----TESPPSNQDSFFDITDPEFLDALNGSEW-V-S-------AETALKNSDLFSIISQLSNDLATA- 64 (384) T ss_pred Ccc-ccccccC-----cccccccchhhccccchhhcccccCCce-e-c-------hhhhhccHHHHHHHHHHHHHHhhC- Confidence 555 3221111 1122222221111111111111100000 0 0 122357899999999999866543 Q ss_pred CCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhh----HHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCC Q lcl|NC_015287. 85 QNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKC----HEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDP 160 (563) Q Consensus 85 ~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~----~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP 160 (563) |+.+ .+ ... +.+++--+-...+ +.++..+++.|.-|.-++.|.. .-+++|.+|+| T Consensus 65 ----~~~~--~~-----~~~-------~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~---g~~~~L~~l~~ 123 (384) T protein:vir:49 65 ----KITT--SR-----KQL-------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNEN---GRDMKWEYLRP 123 (384) T ss_pred ----ceee--ec-----chh-------hhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCC---CcEEEEEEEcC Confidence 2222 11 111 1222222222233 3456677788998887776643 23889999999 Q ss_pred ccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeecc Q lcl|NC_015287. 161 LKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSG 240 (563) Q Consensus 161 ~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSG 240 (563) ..|+.++. ++. + .-+|.|...+ +.. ...+.++.+.|.|.. T Consensus 124 ~~v~v~~~---~~~--~------------------~~~y~~~~~~-------~~~--------~~~~~~~~~eVih~~-- 163 (384) T protein:vir:49 124 SQVSFNRL---DNQ--N------------------GLYYNITFDD-------PRI--------PPKQHVPQGDILHFR-- 163 (384) T ss_pred ceeEEEEc---CCC--c------------------eEEEEEEecC-------ccc--------cceeEecCccEEEec-- Confidence 88776532 110 0 1133332211 000 112456666664443 Q ss_pred eecCCCC-ccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCc Q lcl|NC_015287. 241 LVDRNKQ-TILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGE 319 (563) Q Consensus 241 l~d~~~~-~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGe 319 (563) ....++. .+.|.|..|+..++....++....-+--.-+--+-+..++.+..+..++.+ ...++... .+.|. T Consensus 164 ~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~----~~~~~~~~----~n~~~ 235 (384) T protein:vir:49 164 LLSVDGGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKTKQ----SRSRQAMK----QMQGG 235 (384) T ss_pred CCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHH----HHHHHhcc----cCCcc Confidence 2333442 457999999999998888888776544444566777777766655554433 22332211 12221 Q ss_pred ccccccccchhhhhcccccCCCCccceeecCCCC-CcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHH Q lcl|NC_015287. 320 IRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQ-NLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDE 398 (563) Q Consensus 320 v~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~-nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDE 398 (563) .+ .=. .|.++..|.-.. .+.-++-.++..+.+.++++||.+.|...+. +.++. +.-| T Consensus 236 ------~~--------vl~---~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~----~~~~~-~~~~ 293 (384) T protein:vir:49 236 ------PL--------VLD---DLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGD----KQSSL-EMIY 293 (384) T ss_pred ------ce--------ecC---CCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC----ccccH-HHHH Confidence 11 111 245666664322 2222455678889999999999999954322 11110 1111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchh Q lcl|NC_015287. 399 IKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKY 478 (563) Q Consensus 399 lKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy 478 (563) --+..|+.-.-.-+...+...|-..|.+. +.. .+..+. . T Consensus 294 ~~~~~~i~~~l~pi~~~i~~~l~~~l~~~-~~~-------------~~~~~~--------------------------~- 332 (384) T protein:vir:49 294 NIYFKAVSRFLRPFVSELSKKLSCEVDAD-ILP-------------AVDPTG--------------------------S- 332 (384) T ss_pred HHHHHHHHHHHHHHHHHHHHHhchhhhhh-hhh-------------hhhccc--------------------------h- Confidence 22333433322222222222221111000 000 000011 1 Q ss_pred hhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCc-cccccccccCCCCcCCCCC Q lcl|NC_015287. 479 FSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQ-LQLGVHPEQMPGGAMNPQD 539 (563) Q Consensus 479 ~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~-~~~~~~~~~~~~~~~~p~~ 539 (563) ...+....+++..-..+.+. .+.-...|.. |++.-+ .+.. +.++|+.+=.- T Consensus 333 -~~~~~~~~l~~~~~~t~~e~---~~~l~~~g~~--~ne~r~~~~~~----p~~gGd~~~~~ 384 (384) T protein:vir:49 333 -NYIGLINSMVKTGTLAQNQG---LYVLQQAEIL--PKDLPEGETDS----TLKGGETNEQY 384 (384) T ss_pred -HHHHHHHHHhhcCcccHHHH---HHHHhhCCCC--ChhHHHHcCCC----CCCCCCCCCCC Confidence 11222333333222222222 2222333433 222111 1111 11111111111 No 131 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=97.29 E-value=0.0001 Score=42.41 Aligned_cols=423 Identities=14% Similarity=0.157 Sum_probs=181.8 Q ss_pred ccchhhHhhhccccccccc----CcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHH-------- Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPV----PQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSA-------- 72 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~----p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~A-------- 72 (563) +++ .|+ -+.+.+-....+.. + +..-..-+.+|+.+..+.+=... T Consensus 1 ~~~-------------~~~~~~~~~~~~~~~~~~i~~-----~------i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~ 56 (452) T protein:vir:36 1 MKY-------------KPPKLMTFSKDEPITVEVVTK-----F------MEKHKLEVARYEYLKNMYLGIMAIDDEPAKD 56 (452) T ss_pred Ccc-------------cCceeEEcCCccCCCHHHHHH-----H------HHHHHHHHHHHHHHHHHhccccccccCcccc Confidence 111 110 00000000000000 0 00001112233333333332211 Q ss_pred ------------HHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhhee Q lcl|NC_015287. 73 ------------IEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYH 140 (563) Q Consensus 73 ------------i~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fh 140 (563) ...||+ ..+.=-..+||.+..++ +...+.+..+++-.+|+....++.+.+++.|+-|+| T Consensus 57 ~~~~~~ki~~n~~~~ivd-~~~~~l~g~~~~~~~~d--------~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~ 127 (452) T protein:vir:36 57 SWKPDNRLAVNFTKYIVD-TFTGYFNGIPVKKSHSD--------KEILTKLQEFDNLNDMEDEESELAKMACIYGRAFEF 127 (452) T ss_pred ccCccceeecchHHHHHH-HHhhhhcccCceeecCC--------hhHHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEE Confidence 111111 01111123455555443 234566778888889999999999999999999999 Q ss_pred eeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCcc Q lcl|NC_015287. 141 KVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPF 220 (563) Q Consensus 141 kviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~ 220 (563) .-+|. +|-..+..+||+.+..|..-. .....+-.++-- ..........+|.+.........+ ++... T Consensus 128 v~~d~----~g~~~i~~~~p~~~~~v~d~~---~~~~~~~~i~~~----~~~~~~~~~~vyt~~~i~~~~~~~--~~~~~ 194 (452) T protein:vir:36 128 LYQDE----DTQTNVVYNSPENMFMVYDDT---VKQEPLFAVRYG----VDEDKKLQGEVYTLLETIKISGEN--DEISF 194 (452) T ss_pred EEecC----CCeeEEEEEcccceEEEEcCC---CCCceEEEEEEE----EecCceEEEEEEecCeEEEEEEcC--CceEE Confidence 87763 577889999999988774321 112222111100 000112223344443322211111 00000 Q ss_pred ccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHH Q lcl|NC_015287. 221 STKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQY 299 (563) Q Consensus 221 ~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m-~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqY 299 (563) .... .--+..=+|++. .|...+.|-++..+.....+.. +=+....-+..+.|-+-+.- +.++... T Consensus 195 ~~~~-~~~~g~iPvv~~------~n~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g---~~~~~~~---- 260 (452) T protein:vir:36 195 GEGT-YNPYPDLPVVEF------YFNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLG---AAVEEED---- 260 (452) T ss_pred ecce-eccCCcccEEEe------cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeec---CCcCchh---- Confidence 0000 000111123322 1223345666655555554433 34444555667777665542 2222211 Q ss_pred HHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHH-HHHHHHHHHHHhcCCChhh Q lcl|NC_015287. 300 LKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELS-DIEYFQKKLYRSLGVPESR 378 (563) Q Consensus 300 l~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~-DV~YF~~kLy~aL~VP~sR 378 (563) +.++ .. ++++.=..+| .+.+..+.+|....+.+.+. -+.-+.+.+|.-..+|- T Consensus 261 ~~~~-~~--~~~~~~~~~~---------------------~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~-- 314 (452) T protein:vir:36 261 LKNI-RS--NRVINYYADG---------------------EGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVAN-- 314 (452) T ss_pred hhhh-hh--cceEEecCCC---------------------CccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCccc-- Confidence 1111 11 1222111111 22334566666555544433 35667788888889984 Q ss_pred ccCCCcceeccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CCChhHHHHhhhceEEEeeccchHHHH Q lcl|NC_015287. 379 IAGSGDGFNLGRSSE--ILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKN-IVTPEDWEVLGDHIQYDFVYDNHFAEL 455 (563) Q Consensus 379 l~~~~~g~~~g~~~e--ItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkg-I~t~eew~~~~~~i~~~f~~Dn~f~El 455 (563) +..+ . +|.+|. |..-+.....-+.+.++.|..-+..+++.=+-+.+ .-...+|. .|.+.|...---.+ T Consensus 315 ~~~~--~--~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~----~i~i~f~~~~p~d~- 385 (452) T protein:vir:36 315 ISDE--S--FGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNVSNKDSWK----DIEYTFTRNEPKDI- 385 (452) T ss_pred cCcc--c--ccCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccc----cceEEeCCCCCcCH- Confidence 3222 1 233322 22222234455666777777777777765433322 22333454 46777764332223 Q ss_pred HHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcC Q lcl|NC_015287. 456 KETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAM 535 (563) Q Consensus 456 Ke~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~ 535 (563) .+.+++++.+ +| .+|.+++++.+-..+ +.+++-++|++|........++ .+..+.+ ...+. T Consensus 386 ------~~~a~~~~k~---~g-~iS~et~~~~~~~~~--d~~~E~~ri~~E~~~~~~~~~~-~~~~~~~------~~~~~ 446 (452) T protein:vir:36 386 ------KEQAETANIL---MG-ITSQETALSVISVIP--DVQAEMEKIKKEEASTAIFDKD-KQPSEKG------TDTVV 446 (452) T ss_pred ------HHHHHHHHHH---hc-cCChHHHHHhCCCCC--CHHHHHHHHHHHHHHHHHHHhh-ccCCCCc------ccccC Confidence 2334455555 34 489999998775433 2444445555554321111110 0000000 00111 Q ss_pred CCCCcc Q lcl|NC_015287. 536 NPQDPN 541 (563) Q Consensus 536 ~p~~~~ 541 (563) +..+.| T Consensus 447 ~~~~~e 452 (452) T protein:vir:36 447 SETNEE 452 (452) T ss_pred ccccCC Confidence 111111 No 132 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=97.28 E-value=0.00011 Score=42.37 Aligned_cols=432 Identities=11% Similarity=0.051 Sum_probs=169.3 Q ss_pred cccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHH---------------------- Q lcl|NC_015287. 15 QNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSA---------------------- 72 (563) Q Consensus 15 ~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~A---------------------- 72 (563) ++.- =-+|++- -.....+.........-....+ +..+.+|+.+..+.+=... T Consensus 1 ~~~~--~~~~~~~---~~~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~ 74 (479) T protein:vir:79 1 MLNI--YISETDL---IKVQLKKESTINLVKVIEHYIL-KHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVN 74 (479) T ss_pred CCCc--eecccce---EeeccccCChhHHHHHHHHHHh-hhhHHHHHHHHHHhccCCcccccccccccccccccccccCc Confidence 1110 0001100 0000011111111111111100 1123334444333331111 Q ss_pred -------HHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecC Q lcl|NC_015287. 73 -------IEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDI 145 (563) Q Consensus 73 -------i~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~ 145 (563) ...||+..+ .=--.+||.+..++ + .+.+.++.+.+ .+|+....++.+...+.|+-|.+.-+| T Consensus 75 ~ki~~~~~~~Ivd~~~-~~l~g~p~~~~~~~----~----~~~~~~~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d- 143 (479) T protein:vir:79 75 NKAINNYHKLLVDQKV-GYSVGNPIVFNADD----D----NLTKLLNDLLG-EEFDDTITELYLNASNKGVEWLHPYIN- 143 (479) T ss_pred ceeecchHHHHHHHHH-hhhhcCCceeccCC----H----HHHHHHHHHHh-cCHHHHHHHHHHHHHhcCeEEEEEEeC- Confidence 112222111 11124566666554 2 22233334443 589999999999999999999987665 Q ss_pred CCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccc-------cccccccccceeeecCccccccccccccCcC Q lcl|NC_015287. 146 KAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNP-------ADIENPEIEEYYQYDPSQTQGKNALGAIGQT 218 (563) Q Consensus 146 ~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~-------~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~ 218 (563) ++|-..+..+||+.+..|..-.........+..-+.... ...+.++...||.+...++............ T Consensus 144 ---~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~y~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~ 220 (479) T protein:vir:79 144 ---RKGEFKYVIIPAEEAIPIWDSKRQRELVAFIRFYYIEDIDGNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKM 220 (479) T ss_pred ---CCCceEEEEEccceeEEEEeCCCCCceEEEEEEEEEeecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccc Confidence 357788999999998887321111111111111000000 0001111122333322221111110000000 Q ss_pred ccccCcceeeeccc-------ceeeeecceecCCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCC Q lcl|NC_015287. 219 PFSTKQRPVKIAPD-------AISFCHSGLVDRNKQTILSYLHKSIKALNQLRM-IEDSLVIYRLSRAPERRIFYIDVGN 290 (563) Q Consensus 219 ~~~~~~~~iki~~d-------aI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m-~EDalVIyRi~RAPeRRvFyIDvGn 290 (563) ...... ...+... +|++. .|+....|-++..+....-+.. +-+....-+.++-|-.-+--.+ |. T Consensus 221 ~~~~~~-~~~~~~~~~~~~~vPvv~~------~nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~-~~ 292 (479) T protein:vir:79 221 TDIQEG-HFRINNKEQGWGKVPFIPF------KNNEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYP-GT 292 (479) T ss_pred cccccc-cccccccccCCCcccEEEe------cCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCC-cc Confidence 000000 0001110 22221 1333345667766666555543 3344445555565543322111 11 Q ss_pred CchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHH Q lcl|NC_015287. 291 LPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDIEYFQKKLY 369 (563) Q Consensus 291 lpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy 369 (563) ...+... +.+.. .|+ .+ +++-+ ++.|....+... -.-++-+++.+| T Consensus 293 ----~~~~~~~--------------------~~~~~-~~i---~~---~~~~~--~~~l~~~~~~~~~~~~~~~l~~~i~ 339 (479) T protein:vir:79 293 ----SLQEFID--------------------NIRYY-KSI---KV---DGGGG--VDKLEINIPVEAKKELLDRLEKNII 339 (479) T ss_pred ----ccccchh--------------------hhhhc-cce---ec---CCCCc--ceEEeccCCHHHHHHHHHHHHHHHH Confidence 0000000 00000 011 01 12222 333333333322 234556677788 Q ss_pred HhcCCChhhccCCCcceeccccchhhHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEee Q lcl|NC_015287. 370 RSLGVPESRIAGSGDGFNLGRSSEILRD--EIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFV 447 (563) Q Consensus 370 ~aL~VP~sRl~~~~~g~~~g~~~eItRD--ElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~ 447 (563) +...+|-. ..+ + +|.+|.+... ......-+.+.+..|...+.++++.=+-+-++....+++. ..+.+.|. T Consensus 340 ~~s~~p~~--~~~--~--~gn~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~--~~i~i~f~ 411 (479) T protein:vir:79 340 IFGQGVNP--ESQ--N--TGDKSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISGNKSYDY--KTVQITFN 411 (479) T ss_pred HHhCcccc--ccc--c--ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccc--ccceEEeC Confidence 88888843 222 2 2444433322 1123334666777777777777765443334443334442 45777886 Q ss_pred ccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCcccccccc Q lcl|NC_015287. 448 YDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHP 527 (563) Q Consensus 448 ~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~ 527 (563) ..---.+.. -+++++.+ +| .+|++++++.+-..+| .+++-++|++|.....-..... +. T Consensus 412 ~~~p~~~~~-------~a~~~~kl---~g-~iS~et~l~~l~~v~d--~~~E~~ri~~E~~~~~~~~~~~--------~~ 470 (479) T protein:vir:79 412 HSMIINEAE-------KIDMAAKS---TG-IVSDETIVSNHPWVED--VNDELERLKKQEDTQKEYDDLI--------PN 470 (479) T ss_pred CCCCcCHHH-------HHHHHHHH---hc-cCcHHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHhcc--------Cc Confidence 543333332 23444444 35 4899999987654333 3444445555533211000000 00 Q ss_pred ccCCCCcCC Q lcl|NC_015287. 528 EQMPGGAMN 536 (563) Q Consensus 528 ~~~~~~~~~ 536 (563) ...++++++ T Consensus 471 ~~~~~~~e~ 479 (479) T protein:vir:79 471 NQDGVIDET 479 (479) T ss_pred ccCCCcCcC Confidence 011111111 No 133 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=97.26 E-value=0.00011 Score=42.24 Aligned_cols=405 Identities=13% Similarity=0.107 Sum_probs=173.8 Q ss_pred ccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEec Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSD 84 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d 84 (563) ||| +++--.. + ++.+......-.....+++. ...+.. =+. ....++|-|..||+-|.+.+. T Consensus 1 Mg~-f~~lf~r-~---~~~~~~~~~~~~~~~~~~~~-~~~g~~---v~~-------~~al~~~~v~~~i~~Ia~~ia--- 61 (414) T protein:vir:44 1 MVF-FSGLFQR-K---SDAPVTTPAELADAIGLSYD-TYTGKQ---ISS-------QRAMRLTAVFSCVRVLAESVG--- 61 (414) T ss_pred Cch-hhhhhcc-C---ccCcccchhhHhHhhccCcc-ccCCce---ech-------hhhhccHHHHHHHHHHHHHhc--- Confidence 454 2211110 1 11111111100001111111 111100 000 122468889999999988654 Q ss_pred CCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhc--------chhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEE Q lcl|NC_015287. 85 QNDSPVEIDLENLPASKQLKDLIREEFKTVKEILN--------FDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVR 156 (563) Q Consensus 85 ~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~--------F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr 156 (563) .-|+.|--.+ +..++.. .-..++.+|+ ...=-+.++..+++.|.-|..++.+ ...+.+|. T Consensus 62 --~~p~~~~~~~----~~~~~~~--~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~----~g~~~~L~ 129 (414) T protein:vir:44 62 --MLPCNLYHLN----GSLKQRA--TGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA----FGEVAELL 129 (414) T ss_pred --cCceEEEEec----CCceeec--ccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC----CCcEEEEE Confidence 1222221111 0001000 0111222221 2222455677778889888765433 35688999 Q ss_pred EeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceee Q lcl|NC_015287. 157 YIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISF 236 (563) Q Consensus 157 ~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y 236 (563) +|+|.+|..++. .. + .-+|.+... + .....++.+.|.| T Consensus 130 ~l~~~~v~~~~~----~~--~------------------~~~y~~~~~----------~--------g~~~~~~~~evih 167 (414) T protein:vir:44 130 PVDPGCVVPKLN----SS--W------------------EPVYQVTFP----------D--------GSTDVLSQEDIWH 167 (414) T ss_pred EEcCceEEEEEC----CC--C------------------cEEEEEEec----------C--------ceEEEEccccEEE Confidence 999988764421 11 0 012222110 0 1124566777755 Q ss_pred eecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCC Q lcl|NC_015287. 237 CHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDAS 316 (563) Q Consensus 237 ~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~ 316 (563) .. .++.++-.+.|-+..|..++.....+++...-+----+--+-++.++ ++|.+..+++..+.+...|+.- .+ T Consensus 168 ~~--~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g~----~n 240 (414) T protein:vir:44 168 VR--TLTLDGLVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTE-QTLSDQAYERLKKDFEERHTGL----GN 240 (414) T ss_pred ec--CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC-CCCCHHHHHHHHHHHHHHhcCc----cc Confidence 43 23556666789999999999888888877665554445567788887 5677776666666555555420 11 Q ss_pred CCcccccccccchhhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhh Q lcl|NC_015287. 317 TGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG-QNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEIL 395 (563) Q Consensus 317 TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eIt 395 (563) .| +.+ .++ .|++++.|.-. ..+.=++-.++..+.+.++++||.+.|...++ -+... +. T Consensus 241 ~~------~~~-vl~----------~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~-~t~~n---~e 299 (414) T protein:vir:44 241 AH------RPM-ILE----------MGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDR-ATFNN---IE 299 (414) T ss_pred cC------cce-ecC----------CCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-CCccc---HH Confidence 12 111 111 24566666321 12222345667788899999999999954321 11111 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_015287. 396 RDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYV 475 (563) Q Consensus 396 RDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~v 475 (563) -.-..|.+++ |+- +...+.+.|-.. ++++.++. .+.+.|..+. +...+ +..|.+.++.+-.- T Consensus 300 ~~~~~~~~~~--l~P-~~~~ie~~ln~~-----L~~~~~~~----~~~i~fd~~~----ll~~d-~~~~~~~~~~~~~~- 361 (414) T protein:vir:44 300 ELGLGFINYS--LVP-YLTRIEQRINTG-----LVRKSKQG----VFYAKFNAGA----LLRGD-MKSRFEAYATGINW- 361 (414) T ss_pred HHHHHHHHHH--HHH-HHHHHHHHHHhh-----cCCccccC----ceEEEEechh----hhccC-HHHHHHHHHHHHhC- Confidence 1112344432 322 222333444333 35555553 3345555432 22111 22445444333211 Q ss_pred chhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccCC Q lcl|NC_015287. 476 GKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAP 555 (563) Q Consensus 476 GKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~ 555 (563) -+++.+-+|. ++.+ ++++..+.--.+.+ ..-.|.++.......+...++.| T Consensus 362 -G~~t~NE~R~-~~gl------------------~p~~ggD~~~~~~n---------~~~~~~~~~~~~~~~~~~~~d~~ 412 (414) T protein:vir:44 362 -GIYSPNDCRD-LEDM------------------NPRPGGDVYLTPMN---------MTTKPSDGSKAGKQKDNANADET 412 (414) T ss_pred -CCcCHHHHHH-HhCC------------------CCCCCcceeccccc---------ccccCCccccCCCCCCCCCCCCC Confidence 2344444442 1112 22211100000000 00001111101111111111111 Q ss_pred CC Q lcl|NC_015287. 556 QM 557 (563) Q Consensus 556 ~~ 557 (563) +- T Consensus 413 ~~ 414 (414) T protein:vir:44 413 TS 414 (414) T ss_pred CC Confidence 11 No 134 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=97.25 E-value=0.00012 Score=42.14 Aligned_cols=410 Identities=14% Similarity=0.115 Sum_probs=176.9 Q ss_pred cccc-------chhhhHHHHHHHHHHHhhccchhHHH-------------------------------HHhhcceEEecC Q lcl|NC_015287. 44 VDID-------GVFKSEFELIKRYREMALHPEVDSAI-------------------------------EDIINEAIVSDQ 85 (563) Q Consensus 44 ~~~~-------~~~~~~~~LI~~YR~ma~~PEvd~Ai-------------------------------~eIvneaiv~d~ 85 (563) +.++ -......+.+.+|+.+..+.+-..+| ..||+-.+-+ - T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~y-l 79 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGY-V 79 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhh-e Confidence 2211 11122345666777776655544322 1222211110 1 Q ss_pred CCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCcccee Q lcl|NC_015287. 86 NDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKL 165 (563) Q Consensus 86 ~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~ 165 (563) -..||.+..++. ...++|.+ +++ -+|+..-.++.+.|.+.|+-|.+.-+| ++|...+..+||..+.. T Consensus 80 ~G~p~~~~~~d~----~~~~~l~~----~~~-~~~~~~~~~l~~~~~~~G~a~~~~y~d----~~~~~~~~~~~p~~~~~ 146 (470) T protein:vir:10 80 ASVFPDIDVGKD----ADNKKIID----VLG-DDRALTLNGLLVDSSNAGRAWLHYWID----EDGNFRYGIIQPDQITP 146 (470) T ss_pred eccceeeecCch----HHHHHHHH----HHh-hhHHHHHHHHHHHHhhcCeeEEEEEec----CCCceEEEEEcccceEE Confidence 245677666542 22233333 333 257777788999999999999998777 45788899999999888 Q ss_pred eeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCc-----Ccc-cc------C---cceeeec Q lcl|NC_015287. 166 VRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQ-----TPF-ST------K---QRPVKIA 230 (563) Q Consensus 166 vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~-----~~~-~~------~---~~~iki~ 230 (563) |..-.........|..-......+ ...+..+-+|.+.........+.... ... .. . .....-. T Consensus 147 v~d~~~~~~~~a~ir~y~~~~~~~--~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (470) T protein:vir:10 147 IYATTLDNKLLGILRSYKQLDPDS--GKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKHN 224 (470) T ss_pred EEcCCCCCceEEEEEEEEeeecCC--ceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccccccccccccC Confidence 843222111112221111000000 00011112222222211111110000 000 00 0 0000000 Q ss_pred cc--ceeeeecceecCCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhh Q lcl|NC_015287. 231 PD--AISFCHSGLVDRNKQTILSYLHKSIKALNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRY 307 (563) Q Consensus 231 ~d--aI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m-~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~ 307 (563) .. +|++. .|+....|=|+..+.....+.. +=+....-+.++.|-.-+.-.+.-+++. .+. -+.++ T Consensus 225 ~g~vPvv~~------~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~-----~~~-~~~~~ 292 (470) T protein:vir:10 225 FGRVPFIEF------SKNKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQ-----FMN-DLRKY 292 (470) T ss_pred CCeeeEEEe------ecCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccch-----hhh-hhhhc Confidence 00 11111 1223345667766665555533 3333444455555555444322212111 111 12222 Q ss_pred cceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCChhhccCCCcce Q lcl|NC_015287. 308 RNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGEL-SDIEYFQKKLYRSLGVPESRIAGSGDGF 386 (563) Q Consensus 308 knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~VP~sRl~~~~~g~ 386 (563) +-. +++-.+.|.|..+..|--..+.... .-++-+.+.+|+-..+|- +..+ + T Consensus 293 ~~i-----------------------~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~--~~~~--~- 344 (470) T protein:vir:10 293 KSI-----------------------KINNTGNGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGID--PANF--E- 344 (470) T ss_pred CeE-----------------------eccCCCCCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCC--CCcc--c- Confidence 211 2222233344445665555554432 344566778888889884 3222 2 Q ss_pred eccccchhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHH Q lcl|NC_015287. 387 NLGRSSEILRDEIKFT---KFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNE 463 (563) Q Consensus 387 ~~g~~~eItRDElKF~---KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~ 463 (563) +|.+|..... .+|. .-+.+.|+.|...|.++++.=+-+-|+. .-+| ..|.+.|.+.--=.++..+ T Consensus 345 -~gn~Sg~Alk-~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~~~-~~d~----~~i~i~f~~~~p~d~~e~~----- 412 (470) T protein:vir:10 345 -SSNASGVAIK-MLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS-DADK----RHISQHWTRTKVEDSLTKA----- 412 (470) T ss_pred -cccchHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-Cccc----ceeeEEeccCCCCCHHHHH----- Confidence 2444433322 2222 3366677777777777666433222432 2233 3577777664433333333 Q ss_pred HHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcC Q lcl|NC_015287. 464 RLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAM 535 (563) Q Consensus 464 R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~ 535 (563) ++++.+ +| .+|.+++++..-..+ +.+++-++|++|..+. .+ ...+.++. .+....+++ T Consensus 413 --~~~~~~---~g-~iS~et~l~~~p~v~--D~~~E~eri~~E~~e~-~~--~~~~~~~~---~~~~~dde~ 470 (470) T protein:vir:10 413 --QIVSTV---AN-YSSKEAVAKANPIVD--DWQQELKDLAKDKEEN-DP--YSNQADEL---NGKGVNDEQ 470 (470) T ss_pred --HHHHHH---hc-cCcHHHHHHhCCCCC--CHHHHHHHHHHHHHHH-HH--hhcccccc---CCCCCCCCC Confidence 334443 35 489999998754333 3444555555554321 10 00000000 000000111 No 135 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=97.20 E-value=0.00013 Score=41.82 Aligned_cols=412 Identities=12% Similarity=0.102 Sum_probs=171.9 Q ss_pred ccccc-----chhhHhhhcccccccccCcC-----------CCCcccc-eeccccccccccccchhhhHHHHHHHHHHHh Q lcl|NC_015287. 2 AKLFG-----FKIQKDDEQNKNVVSPVPQS-----------NEDSSDY-YVSSGFYGQYVDIDGVFKSEFELIKRYREMA 64 (563) Q Consensus 2 ~~~~~-----f~~~~~~~~~~~~~s~~p~~-----------~~d~~~~-~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma 64 (563) -+|.. .+|+-.++-.+ ...++++. .++...- ...+++.+... . ... -..-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~--~-~~~--------~~~al 68 (441) T protein:vir:98 1 MHWYNTDCYFVDFKSRKQSRK-ELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKL--R-QYK--------DIEAI 68 (441) T ss_pred CceecCccceeccccccchhh-hhhccccccccccccccCCCcchHHHHHHhhcccccCc--c-ccc--------hhhhh Confidence 12211 12222111111 00111111 0110000 01111211110 0 000 01235 Q ss_pred hccchhHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcch----hh----hHHHHHHHhhhhh Q lcl|NC_015287. 65 LHPEVDSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFD----DK----CHEILRNWYVDGR 136 (563) Q Consensus 65 ~~PEvd~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~----~~----~~~~~R~WYvDGr 136 (563) ++|-|..||+-|.+.+.-. |+.+- ++.+.. .-+.++.+|+-+ .. -+.++......|. T Consensus 69 ~~~~V~acv~~Ia~~iA~l-----pl~~~-~~~~~~---------~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gn 133 (441) T protein:vir:98 69 RHSDIFTAVMMIASDLARM-----PIRVT-VNGQIN---------YSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSH 133 (441) T ss_pred ccHHHHHHHHHHHHhhccC-----ceEEe-cCCccc---------ccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCC Confidence 6889999999999876643 33331 221111 112234444322 12 2445556677899 Q ss_pred hheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccC Q lcl|NC_015287. 137 IYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIG 216 (563) Q Consensus 137 i~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g 216 (563) -|.+++.|.. .-+++|.+|+|..|..++. + . +.+ .|+.+... +.+ T Consensus 134 ay~~i~r~~~---G~~~~L~~i~~~~v~v~~~---~-~--g~~-----------------~~~~~~~~--------~~~- 178 (441) T protein:vir:98 134 GYIEITRDKT---GEPMNLTFRKTSEIELKLD---A-R--GRL-----------------YYFHQRID--------SNG- 178 (441) T ss_pred eEEEEEEcCC---CcEEEEEEEcCceeEEEEC---C-C--CcE-----------------EEEEEEec--------cCc- Confidence 9998877742 2378999999988776531 1 1 111 12222110 000 Q ss_pred cCccccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHH Q lcl|NC_015287. 217 QTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKA 296 (563) Q Consensus 217 ~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KA 296 (563) ......++.+.|.+.. .++.++-.++|-|+.|.+++.....+++...=+=.--+--+-|..++ |.+...+| T Consensus 179 ------~~~~~~~~~~dviHir--~~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~-~~~~~~e~ 249 (441) T protein:vir:98 179 ------NNIERNVKFEDMLDIK--FYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKA 249 (441) T ss_pred ------ceeeEEEccccEEEec--cCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC-CCCCCHHH Confidence 0112345555553332 24455556789999999888887777776542222223445566666 44433344 Q ss_pred HHHHH-HHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCC-CcchHHHHHHHHHHHHHhcCC Q lcl|NC_015287. 297 EQYLK-EVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQ-NLGELSDIEYFQKKLYRSLGV 374 (563) Q Consensus 297 eqYl~-~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~-nLgei~DV~YF~~kLy~aL~V 374 (563) .+=++ .+...|. | ..+..+.+ .|+ .|.+++.|.-.. .+.=++--+|..+.+.++++| T Consensus 250 ~~~~~~~~~~~~~---------G-~~nag~~~-vl~----------~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgV 308 (441) T protein:vir:98 250 RDRAREEFHKSFS---------G-TKQAGKVV-VLD----------ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGI 308 (441) T ss_pred HHHHHHHHHHHhc---------C-ccccCcce-ecC----------CCceEEEccCChhHHHHHHHHHHhHHHHHHHhCC Confidence 33233 2323332 2 22222222 222 245666663222 222234456677889999999 Q ss_pred ChhhccCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHH Q lcl|NC_015287. 375 PESRIAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAE 454 (563) Q Consensus 375 P~sRl~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~E 454 (563) |...|..+.+++ .+....+-|. ..|+--+. .+.+.|-..|. ++. ....+.|.-+. T Consensus 309 Pp~~lg~~~~~~------s~~q~~~~y~---~tl~P~~~-~ie~~ln~~L~-----~~~------~~~~~~fd~~~---- 363 (441) T protein:vir:98 309 PLHKFGIETANM------SITDANLDYL---STLKPYIT-CVCAELNFKFN-----DEY------VNREFKFDTTE---- 363 (441) T ss_pred CHHHcCCCCCCc------cHHHHHHHHH---HHHHHHHH-HHHHHHHhhcc-----ccc------cCceEEEechh---- Confidence 999995433222 1222233343 44543222 22333333332 221 12234443322 Q ss_pred HHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCc Q lcl|NC_015287. 455 LKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGA 534 (563) Q Consensus 455 lKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~ 534 (563) |.... +..|++.++.+-.- | + +|..|+-+ ...-++++.++...-.-+. - T Consensus 364 llr~d-~~~~~~~~~~~~~~-G-~------------~T~NE~R~-------~~gl~pi~gGd~~~~~~~~---------n 412 (441) T protein:vir:98 364 IRVVD-EKTQAEIDKINIDS-G-K------------MNIDEIRQ-------RDGLAPIPGGNGSIHRVDL---------N 412 (441) T ss_pred hhccC-HHHHHHHHHHHHhC-C-C------------cCHHHHHH-------HhCCCCCCCCCcceEeecc---------c Confidence 21111 23455554433222 1 2 45444331 1233344322211100000 0 Q ss_pred CCCCCcccccccCCCCcccCCCCCCCCCC Q lcl|NC_015287. 535 MNPQDPNMGGAPQEGGAEQAPQMPEGGEI 563 (563) Q Consensus 535 ~~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 563 (563) ..|.+. ....+.-...+...-.+|||- T Consensus 413 ~~~~~~--~~~~q~~~~~~~~~~~kgGe~ 439 (441) T protein:vir:98 413 HVNIEL--VDEYQMNKSRATDKKLKGGEE 439 (441) T ss_pred cccccc--ccccccccccccccccCCCCC Confidence 011111 111222222333445677777 No 136 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=96.97 E-value=0.00023 Score=40.54 Aligned_cols=421 Identities=14% Similarity=0.110 Sum_probs=178.5 Q ss_pred CcccccchhhHhhhcccccccccCcCCC--Ccccce--eccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHh Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNE--DSSDYY--VSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDI 76 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~--d~~~~~--~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eI 76 (563) |++-++-=. ++..+.|++.- -++.+. ..+.+...+.+..+. +.. -+ ..+..+++|-|..||+-| T Consensus 1 ~~~~l~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~--~g~-~v-~~~~al~~~~V~~~i~~i 68 (434) T protein:vir:43 1 MSKSLGKVL--------SSATSAPRSSLFGWGGKTIRLTDGAFWSQFLGRESS--SGK-KV-TVDKAMKLSAVWACVRLI 68 (434) T ss_pred Cccchhhhh--------hhcccccchhhhcccccccccCchHHHHHHhcCCcc--CCc-ee-chhhhhccHHHHHHHHHH Confidence 777664311 11222222210 001111 011111111111100 000 01 234567789999999999 Q ss_pred hcceEEecCCCcEEEEEeeccccchHHHHH-HHHHHHHHHHH-hcchhhhHHH----HHHHhhhhhhheeeeecCCCCCc Q lcl|NC_015287. 77 INEAIVSDQNDSPVEIDLENLPASKQLKDL-IREEFKTVKEI-LNFDDKCHEI----LRNWYVDGRIYYHKVIDIKAPEE 150 (563) Q Consensus 77 vneaiv~d~~~~pV~l~l~~~~~s~sik~~-I~eeF~~i~~l-l~F~~~~~~~----~R~WYvDGri~fhkviD~~~~k~ 150 (563) .+.+-.. |+.+--.+- +..++. .....-++|+. =+-...++++ +....++|--|..+.-| .. T Consensus 69 a~~ia~l-----p~~~~~~~~---~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~----~G 136 (434) T protein:vir:43 69 STSVAGL-----PLGVYERKA---DGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA----AG 136 (434) T ss_pred HHhhhhC-----ceEEEEEcC---CCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC----CC Confidence 9865532 333311110 011111 11122222221 2333455555 44556678876654332 34 Q ss_pred ceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeec Q lcl|NC_015287. 151 GIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIA 230 (563) Q Consensus 151 GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~ 230 (563) -+++|.+|+|-.|..++. + ++ ...|+.+... ...+.++ T Consensus 137 ~~~~L~~l~p~~v~~~~~---~----~g----------------~~~y~~~~~~-------------------g~~~~~~ 174 (434) T protein:vir:43 137 RPAALDFLLPSRVDLECD---E----NG----------------RLKYFYTTKK-------------------GARREIE 174 (434) T ss_pred cEEEEEEEcCcceEEEEc---C----CC----------------eEEEEEEecC-------------------ceEEEEc Confidence 588999999988775531 1 01 0123333221 1236778 Q ss_pred ccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcce Q lcl|NC_015287. 231 PDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNK 310 (563) Q Consensus 231 ~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk 310 (563) .+.|.|.+. ++.++-.++|-+..|+..+.....+++...-+----+--.-|..++ +.|.+.+++ =+++.++++..- T Consensus 175 ~~eVih~~~--~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~l~~e~~~-~~r~~~~~~~g~ 250 (434) T protein:vir:43 175 RTNMLHIPA--FTLDGRIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVD-RILQPAQRE-EFREYVKSVSGA 250 (434) T ss_pred cccEEEecC--cCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecC-CCCCHHHHH-HHHHHHHHhcCc Confidence 888866652 3666667789999999998888888776543332223334555555 456654444 356655554211 Q ss_pred eEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCChhhccCCCcceecc Q lcl|NC_015287. 311 LVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG-QNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLG 389 (563) Q Consensus 311 lvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g 389 (563) .+.|.+ + +++ .|.+++.|.-. +.+.-++-.++..+.+.++++||..-|...+.+=+.+ T Consensus 251 ----~nag~~------~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~ 309 (434) T protein:vir:43 251 ----MNSGRS------P-VLE----------QGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWG 309 (434) T ss_pred ----cccCCc------c-ccC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCcccc Confidence 112221 1 221 25666666321 1222233456778899999999999885322111111 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 390 RSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVA 469 (563) Q Consensus 390 ~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~ 469 (563) +-+...-+-|.+++ |+--+.. +.+.| -+.+.+.+++. ...+.|.-+. +... -+..|.+.+. T Consensus 310 --s~~e~~~~~f~~~~--L~P~~~~-ie~~l-----n~kL~~~~~~~----~~~~~fd~~~----llr~-d~~~r~~~~~ 370 (434) T protein:vir:43 310 --TGLEQQMLAFLTFS--ISSITNQ-IQQCV-----NKRLLTAPERI----RYYAEFSLEG----FLKA-DSAGRAAWYS 370 (434) T ss_pred --chHHHHHHHHHHHH--HHHHHHH-HHHHH-----HhhcCChhhhc----CceEEEechh----hhcc-CHHHHHHHHH Confidence 11211112244332 3332221 12222 22345666654 2344554332 2111 1234555555 Q ss_pred HhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCC--CCCcccccccC Q lcl|NC_015287. 470 AVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMN--PQDPNMGGAPQ 547 (563) Q Consensus 470 ~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~--p~~~~~~~~~~ 547 (563) .+-.- -++|.+-++.. +.+ ++++..+.--.+-+-......+....+ +..++..+..+ T Consensus 371 ~~~~~--G~~T~NE~R~~-~gl------------------~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~ 429 (434) T protein:vir:43 371 TMAQN--GFMTRNEGRRK-ENL------------------PELPGGDILTVQSNLVPIDQLGQSNKSQAVRAALMNWFSQ 429 (434) T ss_pred HHHhC--CCcCHHHHHHH-hCC------------------CCCCCCCeEeeccCccchhhhhccCCCcchhhhhhccCCC Confidence 44222 23444444331 222 222111000000000000000000000 11111122222 Q ss_pred CCCcc Q lcl|NC_015287. 548 EGGAE 552 (563) Q Consensus 548 ~~~~~ 552 (563) |.++| T Consensus 430 ~~~~~ 434 (434) T protein:vir:43 430 PEPQE 434 (434) T ss_pred CCCCC Confidence 22222 No 137 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=96.93 E-value=0.00025 Score=40.31 Aligned_cols=402 Identities=13% Similarity=0.132 Sum_probs=172.1 Q ss_pred ccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEec Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSD 84 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d 84 (563) |-|+.....+ . +++.. ++........+.-.+.++. -+..+++|-|..||+-|.+.+.-+ T Consensus 1 m~f~~~~~~~--~---~~~~~-~~~~~~~~~g~~~~~~~v~--------------~~~al~~~~v~~~i~~ia~~ia~l- 59 (409) T protein:vir:10 1 MLFRKGFKNQ--S---QEISI-DDKKILEWLGINPSETYVN--------------GKSCLKQATVFGCIRILSDNISKL- 59 (409) T ss_pred CcccccccCc--C---CCCCC-ChHHHHHHhcCCcCcceec--------------hhhhhccHHHHHHHHHHHHhhhhC- Confidence 2243221110 0 11111 1111111111111111111 123457899999999998876522 Q ss_pred CCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcchhh----hHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeC Q lcl|NC_015287. 85 QNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFDDK----CHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYID 159 (563) Q Consensus 85 ~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~~~----~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lD 159 (563) |+.|--++ . ..+.....-..++|+. =+=... -+.++..+++.|.-|..++.|.. .-+++|.+|+ T Consensus 60 ----p~~~~~~~-~---~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~---G~~~~L~~i~ 128 (409) T protein:vir:10 60 ----PIKIYQKK-D---GIKRVPDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKN---GEIKGLYPLK 128 (409) T ss_pred ----ceEEEEec-C---CeeeccCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC---CcEEEEEEEc Confidence 33331111 1 0000001111222221 111223 34455566778988888777643 2388999999 Q ss_pred CccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeec Q lcl|NC_015287. 160 PLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHS 239 (563) Q Consensus 160 P~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hS 239 (563) |..|+.+.. +. +... ....-+|.|.+. + .....++.+.|.|... T Consensus 129 ~~~V~v~~~---~~----~~~~-----------~~~~~~y~~~~~----------~--------g~~~~~~~~evih~r~ 172 (409) T protein:vir:10 129 SDGMKIFVD---DT----GLLN-----------SENNVWYLYTDD----------L--------GQRHKFMSDEILHFKG 172 (409) T ss_pred CCceEEEEc---CC----cccc-----------ccceEEEEEEeC----------C--------ceeEEeccccEEEecC Confidence 988765421 11 1100 011123333221 0 1134677777766542 Q ss_pred ceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCc Q lcl|NC_015287. 240 GLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGE 319 (563) Q Consensus 240 Gl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGe 319 (563) ++.++-.+.|-|+.|..++.....+++...=+=--.+.-+-|..++ +.|.+..+++ +++.+++...- T Consensus 173 --~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~l~~e~~~~-~~~~~~~~~~g--------- 239 (409) T protein:vir:10 173 --LTADGLAGLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYA-GDLNPEAEEV-FKENFERMSSG--------- 239 (409) T ss_pred --cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC-CCCCHHHHHH-HHHHHHHHhcc--------- Confidence 3445556789999999999888888776554433344556777776 4566655544 33333322111 Q ss_pred ccccccccchhhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHH Q lcl|NC_015287. 320 IRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG-QNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDE 398 (563) Q Consensus 320 v~d~~~~msMlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDE 398 (563) .++..+.+ +++ .|++++.|.-. ..+.-++-.++..+.+.++++||.+.|...+++ + .+.+.-.- T Consensus 240 ~~n~~~~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~-~---~~~~e~~~ 304 (409) T protein:vir:10 240 LKNAHRIA-MLP----------IGYKFEPISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRA-T---HSNITEQN 304 (409) T ss_pred ccccCCce-ecC----------CCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCC-c---cccHHHHH Confidence 11111222 221 25666666431 233334556789999999999999999532221 1 11122222 Q ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhch Q lcl|NC_015287. 399 IKFTKF-VGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGK 477 (563) Q Consensus 399 lKF~Kf-I~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGK 477 (563) +.|..+ |.-+..+ +.+.|-..| +++.+.. ....+.|.-+. +... -+..|.+.+..+-.- - T Consensus 305 ~~f~~~~l~P~~~~----ie~~ln~kL-----~~~~~~~---~~~~~~fd~~~----ll~~-d~~~~~~~~~~~~~~--G 365 (409) T protein:vir:10 305 REFYIDTLQSILNM----YELEINYKL-----FLISEIK---NGFYSKFNVDT----ILRA-DIKTRYESYKEAIQN--G 365 (409) T ss_pred HHHHHHHHHHHHHH----HHHHHHHhh-----cCchhcc---CCcEEEEechh----hhcc-CHHHHHHHHHHHHhC--C Confidence 335443 2222222 233333333 3333322 33344444332 1111 112344444332211 1 Q ss_pred hhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccCCCC Q lcl|NC_015287. 478 YFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAPQM 557 (563) Q Consensus 478 y~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~ 557 (563) + +|..|+- + ....++++.. +.. - -|.+ +.+. +.. ..+. T Consensus 366 ~------------~T~NE~R---~----~lgl~p~~gg-----D~~-----~------~~~n--~~~~--~~~---~~~~ 403 (409) T protein:vir:10 366 F------------KTPNEIR---E----LEEDEPLEGG-----DVL-----L------INGN--MIPV--KMA---GEQY 403 (409) T ss_pred C------------cCHHHHH---H----HhCCCCCCCc-----Cee-----e------eccC--ccch--hhc---cccc Confidence 2 3443332 1 1222222110 000 0 0111 0100 000 1122 Q ss_pred CCCCCC Q lcl|NC_015287. 558 PEGGEI 563 (563) Q Consensus 558 ~~~~~~ 563 (563) .+|||= T Consensus 404 ~kgGe~ 409 (409) T protein:vir:10 404 SKGGEK 409 (409) T ss_pred cccCCC Confidence 344444 No 138 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=96.92 E-value=0.00026 Score=40.26 Aligned_cols=435 Identities=11% Similarity=0.105 Sum_probs=174.6 Q ss_pred Ccccccc-hhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHH------- Q lcl|NC_015287. 1 MAKLFGF-KIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSA------- 72 (563) Q Consensus 1 m~~~~~f-~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~A------- 72 (563) |+..+=- .-+++++.. .++-....+-.+.. ..-+..-...+.+|+.+..+.+-+.. T Consensus 1 ~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~i------------~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~ 64 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVV----EQIKPKYETQEEMI------------IRLINDHKPKIDDITVGERYYNHDPDVLRLAPK 64 (474) T ss_pred CeeeccCCCchhhhhHH----HHhhhccCChHHHH------------HHHHHHHHHHHHHHHHHHHHhccCCcchhccch Confidence 6655322 111111211 11111111111100 00001111223344444333222211 Q ss_pred --------------------HHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHh Q lcl|NC_015287. 73 --------------------IEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWY 132 (563) Q Consensus 73 --------------------i~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WY 132 (563) ...||+-. +.=--..||.+..++. ...+.++.+++ -+|...-.++.+..+ T Consensus 65 ~~~~~~~~~~~~~~ki~~n~~~~Ivd~~-~~~l~g~p~~~~~~d~--------~~~~~l~~~~~-n~~~~~~~~~~~~~~ 134 (474) T protein:vir:96 65 LDNKGEIDPLKPDWRMFTNYHQNLVDQK-VAYAVANPVTFSSDDD--------KSLKTIQEVLN-HKWDDKLVDILTAAS 134 (474) T ss_pred hcccccccccccchhcccchHHHHHHhh-hhhhcccCceeecCch--------HHHHHHHHHHh-cCHHHHHHHHHHHHH Confidence 11222211 1112356777776652 23344455554 368888999999999 Q ss_pred hhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccc--cccccccceeeecCccccccc Q lcl|NC_015287. 133 VDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPAD--IENPEIEEYYQYDPSQTQGKN 210 (563) Q Consensus 133 vDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~--~~~~~~~eyf~y~~~~~~~~~ 210 (563) +.|+-|.+.-+| .+|-..+..+||+.+..|..-.........+...+...... ++.++...+|.+......... T Consensus 135 ~~G~~~~~~y~d----~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~vr~~~~~~~~~~~~yt~~~v~~~~~~~~~~~~~~ 210 (474) T protein:vir:96 135 NKGIEWLQPYID----ENGEFKTFRVPAEQAIPIWTNKERDTLKAFIRYYRLDGAERVEYWTDSDVTYYEYQDGILIPDY 210 (474) T ss_pred hcCeeEEEEEec----CCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeecCceEEEEEeCCeEEEEEecCCceeecc Confidence 999999887665 35677899999999877733211111112222211111110 011111122222211111000 Q ss_pred cccccCcCccccCc--ceeeeccc--ceeeeecceecCCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEE Q lcl|NC_015287. 211 ALGAIGQTPFSTKQ--RPVKIAPD--AISFCHSGLVDRNKQTILSYLHKSIKALNQLRM-IEDSLVIYRLSRAPERRIFY 285 (563) Q Consensus 211 ~~g~~g~~~~~~~~--~~iki~~d--aI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m-~EDalVIyRi~RAPeRRvFy 285 (563) . ....+..... ....-+.. +|++. .|.....|=++..+.....+.. +-+..-.-+.++.|-+-+.- T Consensus 211 ~---~~~~~~~~~~~~~~~~~~~g~iPvv~~------~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g 281 (474) T protein:vir:96 211 Y---HGEEHIQSHYYVGNKRVSWGRVPFIPF------KNNPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKG 281 (474) T ss_pred c---cccccccccccccccccCCCceeEEEe------ccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeec Confidence 0 0000000000 00000000 22211 1223345666666666655543 33444445666666544332 Q ss_pred ccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchH-HHHHHH Q lcl|NC_015287. 286 IDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGEL-SDIEYF 364 (563) Q Consensus 286 IDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei-~DV~YF 364 (563) .+.... .+.+.. |.. ++++ +++ +.|.+++.|-...++... .-+.-. T Consensus 282 ~~~~~~-----~~~~~~-~~~--~~~i---------------------~~~----~~~~~~~~l~~~~~~~~~~~~~~~l 328 (474) T protein:vir:96 282 YEGQDL-----DEFMRN-LKY--YKAI---------------------NVD----GDGSGVDTIQIEVPVQSSKEYLDML 328 (474) T ss_pred CCcccc-----cchhhh-hhc--CceE---------------------Eec----CCCCceeEEeecCChHHHHHHHHHH Confidence 221110 000000 111 1111 111 123345555544444332 234556 Q ss_pred HHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhc Q lcl|NC_015287. 365 QKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFT---KFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDH 441 (563) Q Consensus 365 ~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~---KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~ 441 (563) .+.+|+..++|-.-. + +|. |..|.+... .++. .-+.+.+..|..-+.++|+.=|-+.|+ ..+| .. T Consensus 329 ~~~i~~~s~~p~~~~--~--~~~-~n~Sg~Al~-~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~--~~~~----~~ 396 (474) T protein:vir:96 329 RDYVIEFGQGVDFQQ--D--KFG-NSPSGIALK-FMYSNLDLKANKLKNKTLTALQELLQYIIDFYKL--NIKV----QD 396 (474) T ss_pred HHHHHHHhCCccccc--c--ccc-cccHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Cccc----ce Confidence 778999999994322 2 221 222222211 2222 345666677777777776654444443 2233 34 Q ss_pred eEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCcc Q lcl|NC_015287. 442 IQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQL 521 (563) Q Consensus 442 i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~ 521 (563) |.+.|...--..+.. .++++.++ | .+|.+++++.+-.-+| .+++.++|++|..+..-..+....+. T Consensus 397 i~i~f~~~~p~~~~e-------~~~~~~~a----g-~iS~et~~~~~~~v~d--~~~E~~ri~~E~~e~~~~~~~~~~~~ 462 (474) T protein:vir:96 397 VEITFNFNVMVNELE-------QSQIGVQS----Q-YLSKETVVTNHPWVDD--PVAELERIEQDNIDFNKQLPPLEGDA 462 (474) T ss_pred eeEEeccCCCcCHHH-------HHHHHHhc----C-CCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhccccccccc Confidence 667775544333322 22233332 3 5899999987654443 44555566655443111111111111 Q ss_pred ccccccccCCCCcCCCCCcccccccC Q lcl|NC_015287. 522 QLGVHPEQMPGGAMNPQDPNMGGAPQ 547 (563) Q Consensus 522 ~~~~~~~~~~~~~~~p~~~~~~~~~~ 547 (563) ..+ .+.. ...++ T Consensus 463 ~~~-----~~d~---------~~e~~ 474 (474) T protein:vir:96 463 NGR-----AQDN---------ESETN 474 (474) T ss_pred ccc-----cCCC---------cccCC Confidence 100 0000 11111 No 139 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=96.87 E-value=0.00029 Score=40.00 Aligned_cols=398 Identities=11% Similarity=0.058 Sum_probs=168.1 Q ss_pred CcccccchhhHh-hhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcc Q lcl|NC_015287. 1 MAKLFGFKIQKD-DEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINE 79 (563) Q Consensus 1 m~~~~~f~~~~~-~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvne 79 (563) |.=|-++ .++. ++......+..+ ..+. ...+.+.. -.+....++|-|..||+.|.+. T Consensus 1 Mgl~~~~-f~~~~~~~~~~~~~~~~---~~~~----~~~~~g~~--------------v~~~~al~~~~v~~~v~~ia~~ 58 (409) T protein:vir:84 1 MSLFTRI-FSGPSEERTLTKISGIP---SPAE----DWAMHGDR--------------PGANSAMTLGAFYACVTLLADT 58 (409) T ss_pred Cchhhhh-hcCCCcccccccccccc---cccc----hhhccCcc--------------cchhhhhccHHHHHHHHHHHHh Confidence 4332221 1111 111111111110 0000 00011101 1233445689999999999997 Q ss_pred eEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhc----chhhhHH----HHHHHhhhhhhheeeeecCCCCCcc Q lcl|NC_015287. 80 AIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILN----FDDKCHE----ILRNWYVDGRIYYHKVIDIKAPEEG 151 (563) Q Consensus 80 aiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~----F~~~~~~----~~R~WYvDGri~fhkviD~~~~k~G 151 (563) +.-+ |+.+--++ +..+. +...++.+|+ -...+++ ++..+++.|.-|-.+.+ .+...- T Consensus 59 iA~l-----p~~~~~~~----~~~~~----~~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~--~~~~g~ 123 (409) T protein:vir:84 59 VASL-----SIDAYRKK----DNVRI----PVSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISA--RDEANR 123 (409) T ss_pred hhhC-----ceEEEEec----CCccc----ccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEE--ECCCCc Confidence 7533 23221111 11110 1122333332 2333444 45566677776654332 233445 Q ss_pred eeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecc Q lcl|NC_015287. 152 IKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAP 231 (563) Q Consensus 152 I~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~ 231 (563) +++|++|+|..|...+.-.. .+. .-++.|.- +| -.++. T Consensus 124 ~~~L~~l~p~~v~v~~~~~~-----~~~----------------~~~~~~~~-----------~g----------~~~~~ 161 (409) T protein:vir:84 124 PTAIMPIHPDCIHVTDAKDE-----DGD----------------WIEPVYRI-----------DG----------KVVPN 161 (409) T ss_pred eEEEEEEcCceeEEEEcCCC-----cce----------------EEEEEecC-----------Cc----------eEEch Confidence 89999999987664421111 000 00111211 01 12445 Q ss_pred cceeeeecceecCCCC-ccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcce Q lcl|NC_015287. 232 DAISFCHSGLVDRNKQ-TILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNK 310 (563) Q Consensus 232 daI~y~hSGl~d~~~~-~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk 310 (563) +.|. |--.+..++. .++|-|+.|.+.+.....+++...-+----+--+-+.-++ |+|.+..+++..+.....+.| T Consensus 162 ~dvi--h~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~n- 237 (409) T protein:vir:84 162 HRIM--HIKRYPVAGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSD-ADLTPDQVKQTQKQWIQSHHN- 237 (409) T ss_pred hhEE--EecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecC-CCCCHHHHHHHHHHHHHHhcc- Confidence 5553 3323444443 4678999999888888777776654333333345555554 678777777766666554432 Q ss_pred eEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCChhhccCCCcceecc Q lcl|NC_015287. 311 LVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG-QNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLG 389 (563) Q Consensus 311 lvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g 389 (563) .|. .+ .++ .|.+++.|--. ..+.-++-.++..+.+.++++||.+.|...+.+ +.. T Consensus 238 ------~g~------~~-vl~----------~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~-~~~ 293 (409) T protein:vir:84 238 ------RRL------PA-VMS----------AGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKS-TSW 293 (409) T ss_pred ------CCC------ee-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc-ccc Confidence 232 11 121 24455555321 122234455677899999999999999532111 111 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 390 RSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVA 469 (563) Q Consensus 390 ~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~ 469 (563) ++-|...-+.|..++ |+--+. .+.+.|...| .+| ..|+|++. .|.... +..|++.+. T Consensus 294 -~sn~e~~~~~f~~~~--l~P~~~-~ie~~l~~~L-~~g-----------~~i~fd~~------~l~~~d-~~~~~~~~~ 350 (409) T protein:vir:84 294 -GTGIEEQGINFVRHT--LLPWLR-CIEQALDTFL-PRG-----------QFVKFNVD------GLMRGD-VTARFTAYQ 350 (409) T ss_pred -cchHHHHHHHHHHHH--HHHHHH-HHHHHHHHhc-cCC-----------CeEEEech------hhhccC-HHHHHHHHH Confidence 122333333455442 332222 2233333332 222 23444432 222221 234555544 Q ss_pred HhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCC Q lcl|NC_015287. 470 AVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEG 549 (563) Q Consensus 470 ~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~ 549 (563) .+-.- -. +|..|+- + +...++++..+ ..-...+- .|.+ ..++.++. T Consensus 351 ~~~~~--G~------------~t~NE~R---~----~~g~~p~~ggD-----~~~~~~n~------~~~~--~~~~~~~~ 396 (409) T protein:vir:84 351 MGLQN--GI------------WSVNEVR---A----WEDAPPIPEGD-----IHLQPMNF------VPLG--YVPPEEPA 396 (409) T ss_pred HHHhC--CC------------cCHHHHH---H----HhCCCCCCCcc-----eeeecccc------cccc--cCCccccC Confidence 33222 12 3433322 1 12223332110 00000000 1111 01122222 Q ss_pred CcccCCCCCCCCC Q lcl|NC_015287. 550 GAEQAPQMPEGGE 562 (563) Q Consensus 550 ~~~~~~~~~~~~~ 562 (563) .+.+.+...+|++ T Consensus 397 ~~~~~~~~~~gn~ 409 (409) T protein:vir:84 397 QEPQPNSATEGNK 409 (409) T ss_pred cCCCCCCccCCCC Confidence 2223344455666 No 140 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=96.86 E-value=0.00029 Score=39.97 Aligned_cols=414 Identities=16% Similarity=0.170 Sum_probs=165.9 Q ss_pred Cc--ccccchhhHhhhcccccccccCcC--CCCccccee--ccccccccccc-cchhhhHHHHHHHHHHHhhccchhHHH Q lcl|NC_015287. 1 MA--KLFGFKIQKDDEQNKNVVSPVPQS--NEDSSDYYV--SSGFYGQYVDI-DGVFKSEFELIKRYREMALHPEVDSAI 73 (563) Q Consensus 1 m~--~~~~f~~~~~~~~~~~~~s~~p~~--~~d~~~~~~--~~~~~~~~~~~-~~~~~~~~~LI~~YR~ma~~PEvd~Ai 73 (563) |. +-+|| +.+. +..+.|+. +..+..+.. .+.....+... .... -+ .-...+++|-|..|| T Consensus 1 ~~~~~~~g~-~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~-----~v-~~~~a~~~~aV~~~v 67 (432) T protein:vir:97 1 MPDEKKLGL-LGQL------KAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGA-----AV-NADAIMRLDAVAACV 67 (432) T ss_pred CCCcccCch-hhhh------HhhcCCccccccccccccccCchhhhhhcccccccCc-----cc-chHhhhcchHHHHHH Confidence 22 12222 1111 11111111 111111000 00000000000 0000 01 112345789999999 Q ss_pred HHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcchhhhHH----HHHHHhhhhhhheeeeecCCCC Q lcl|NC_015287. 74 EDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFDDKCHE----ILRNWYVDGRIYYHKVIDIKAP 148 (563) Q Consensus 74 ~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~~~~~~----~~R~WYvDGri~fhkviD~~~~ 148 (563) +-|.+.+.-+ |+.|--++ .+..++.+..-..++++. =|=...+++ ++..++++|.-|..++.+. T Consensus 68 ~~Ia~~ia~l-----p~~~y~~~---~~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~~--- 136 (432) T protein:vir:97 68 KLVSQAVAAM-----PLMMYMRT---PDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTD--- 136 (432) T ss_pred HHHHHhhccC-----ceEEEEec---CCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecC--- Confidence 9999865533 44442222 112222222223333321 112234444 4445667788877776642 Q ss_pred CcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceee Q lcl|NC_015287. 149 EEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVK 228 (563) Q Consensus 149 k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~ik 228 (563) ..+++|.+|+|..|..++.. . +. -+|.|... + ...+. T Consensus 137 -g~~~~L~~l~p~~v~v~~~~----~--g~------------------~~y~~~~~----------~--------g~~~~ 173 (432) T protein:vir:97 137 -GRIESLQYLANDRLTITTDT----K--GN------------------TAYRYRRT----------D--------GQMID 173 (432) T ss_pred -CcEEEEEEEcCcceEEEEcC----C--Cc------------------EEEEEEec----------C--------ceEEE Confidence 35899999999887655221 0 11 12222110 0 12355 Q ss_pred ecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhc Q lcl|NC_015287. 229 IAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYR 308 (563) Q Consensus 229 i~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~k 308 (563) ++.+.|.+.. .++.++-.++|.|..|++++.....+++...=+=---+--.-|..+| +.|-+..++.. + .+|. T Consensus 174 ~~~~~iih~r--~~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~-~~l~~e~~~~~-~---~~~~ 246 (432) T protein:vir:97 174 IPRQQIWKIM--GYSLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQID-RFLTDDQYDSF-S---KKVS 246 (432) T ss_pred EccccEEEec--CcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecC-CCCCHHHHHHH-H---HHHh Confidence 7777775443 23556667789999999999877777765432221122334455555 44544443332 2 2221 Q ss_pred ceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCChhhccCCC-c Q lcl|NC_015287. 309 NKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE---LSDIEYFQKKLYRSLGVPESRIAGSG-D 384 (563) Q Consensus 309 nklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~~~-~ 384 (563) . -.+.|. .+ .|+ .|.+++.|. .+..+ ++-.+|....+.++++||...|...+ + T Consensus 247 ~----~~nag~------~~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~ 303 (432) T protein:vir:97 247 G----SVEAGR------AP-LLE----------GGMDVKSLG--LNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAG 303 (432) T ss_pred h----hhcCCC------ce-ecC----------CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCc Confidence 0 001121 11 221 234455552 22223 33456888999999999999995322 1 Q ss_pred ceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHH Q lcl|NC_015287. 385 GFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNER 464 (563) Q Consensus 385 g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R 464 (563) .+..| +.|.-.-+.|.++ .|+--+. .+.+.|-. +++++.++. .+.++|..+. +... -...| T Consensus 304 t~~~~--s~~e~~~~~f~~~--tl~P~~~-~ie~~ln~-----kLl~~~e~~----~~~~~fd~~~----llr~-d~~~r 364 (432) T protein:vir:97 304 TTSWG--SGIESQQLGFLTM--TLSPWLR-RIEQSIAL-----NLLTPAERR----RYFADFDTSA----LLRA-DSAAR 364 (432) T ss_pred ccccc--hhHHHHHHHHHHH--HHHHHHH-HHHHHHhh-----hccCccccC----ceEEEeechh----hhcc-CHHHH Confidence 22222 2233333345543 3333221 23333333 334445442 2445554432 2111 13356 Q ss_pred HHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCccccc Q lcl|NC_015287. 465 LGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGG 544 (563) Q Consensus 465 ~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 544 (563) .+.++.+-.- -++|.+-++.. +.+ ++++ ++..--.-++ ...|.+....+ T Consensus 365 ~~~~~~~~~~--G~~T~NE~R~~-~gl------------------pp~~-g~~~~~~~~~---------~~~pl~~~~~~ 413 (432) T protein:vir:97 365 SSYYSQLVNN--GLMTRDEAREI-EGL------------------PKLG-GNAAVLTVQS---------AMVPLDSIGLQ 413 (432) T ss_pred HHHHHHHHhC--CCCCHHHHHHH-hCC------------------CCCC-CCcceEeecc---------cccchhhhccc Confidence 6665544221 23444333321 112 2221 0000000000 00011100000 Q ss_pred ccCCCCcccCCCCCCCCCC Q lcl|NC_015287. 545 APQEGGAEQAPQMPEGGEI 563 (563) Q Consensus 545 ~~~~~~~~~~~~~~~~~~~ 563 (563) +. +....+.+-..-.++ T Consensus 414 ~~--~~~~~~~~~~~~~~~ 430 (432) T protein:vir:97 414 AS--PEPASGLGNQQQDKV 430 (432) T ss_pred CC--CCCCCCCCCcccccc Confidence 00 000000000000011 No 141 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=96.83 E-value=0.00031 Score=39.83 Aligned_cols=411 Identities=15% Similarity=0.131 Sum_probs=179.0 Q ss_pred CcccccchhhHhhh---cccccccccCc----CCCCccccee--ccccccccccccchhhhHHHHHHHHHHHhhccchhH Q lcl|NC_015287. 1 MAKLFGFKIQKDDE---QNKNVVSPVPQ----SNEDSSDYYV--SSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDS 71 (563) Q Consensus 1 m~~~~~f~~~~~~~---~~~~~~s~~p~----~~~d~~~~~~--~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~ 71 (563) |. ||.+ +.+... ..+..+.|..+ .....+.+.. ....-..+.+... .+.. -+ ......++|-|.. T Consensus 1 Mg-l~d~-~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~--~~g~-~v-~~~~al~~~~V~~ 74 (431) T protein:vir:10 1 MG-LFDF-IRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGE--LNGG-TG-RETRALRNMAVLR 74 (431) T ss_pred Cc-chhh-hhcCcccccccccccccccccccccccccccccccccchHHHHhhccCc--cCcc-ee-chhhhhccHHHHH Confidence 44 4444 222111 11111111111 0110011010 0000000000000 0000 01 1234467899999 Q ss_pred HHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcc----hhhh----HHHHHHHhhhhhhheeeee Q lcl|NC_015287. 72 AIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNF----DDKC----HEILRNWYVDGRIYYHKVI 143 (563) Q Consensus 72 Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F----~~~~----~~~~R~WYvDGri~fhkvi 143 (563) ||+.|.+.+-- -|+.|- +..+-. +.... ..+..+|+- ...+ +.++..+.++|.-|..++. T Consensus 75 ci~~Ia~~iA~-----lp~~v~-~~~~~~---~~~~~---~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r 142 (431) T protein:vir:10 75 CVTLISGTIGM-----LPMNLI-SSDDSK---QVLTD---DPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVW 142 (431) T ss_pred HHHHHHHhhcc-----CceEEE-EecCce---eeecc---chHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEE Confidence 99999886642 244431 111100 11111 223333332 1223 3445666788998888776 Q ss_pred cCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccC Q lcl|NC_015287. 144 DIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTK 223 (563) Q Consensus 144 D~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~ 223 (563) | ..++++|.+|+|..|..++. ++ + .-+|.|... + T Consensus 143 ~----~g~~~~L~pl~~~~v~~~~~---~~---~------------------~~~y~~~~~------------------~ 176 (431) T protein:vir:10 143 S----GNRPIRLIPMDRGSAKGRLT---ST---W------------------QIVYDYTTP------------------T 176 (431) T ss_pred c----CCceEEEEEEcCceeEEEEc---CC---C------------------eEEEEEEeC------------------C Confidence 5 24688999999988775521 10 0 113333210 0 Q ss_pred cceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHH Q lcl|NC_015287. 224 QRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEV 303 (563) Q Consensus 224 ~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~i 303 (563) ...+.++.+.|.|.. .++.++-.+.|.|+.|.+.+.....+++...=+----|--+-|.-.+ ++|.+.++++.-+.+ T Consensus 177 g~~~~~~~~dViHir--~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~ls~e~~~~~~~~~ 253 (431) T protein:vir:10 177 GDKIELPAREVFHLR--DLSIDGVSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIEVP-KELSDNAYGRMKASV 253 (431) T ss_pred ceEEEEchhhEEEec--CcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecC-CCCCHHHHHHHHHHH Confidence 123567777774442 24556667789999999999988888887765555555556666666 467766665555554 Q ss_pred HHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCChhhccCC Q lcl|NC_015287. 304 MNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG-QNLGELSDIEYFQKKLYRSLGVPESRIAGS 382 (563) Q Consensus 304 m~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~ 382 (563) ...|..- .+.|. .+ .+ + .|.+++.|.-. ..+.-++--+|-...+.++++||..-|... T Consensus 254 ~~~~~g~----~n~g~------~~-vl--------~--~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~ 312 (431) T protein:vir:10 254 QENHTGS----ENAGS------WM-LL--------E--EGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMD 312 (431) T ss_pred HHHhcCc----cccCC------ce-ec--------C--CCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCC Confidence 4444320 11221 11 11 1 14455555321 122222334455678999999999999643 Q ss_pred CcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHH Q lcl|NC_015287. 383 GDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLN 462 (563) Q Consensus 383 ~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~ 462 (563) +++ + .+.+.-.-+-|.+++ |+--+. .+.+.|-.. +++++++. ...+.|.-+. |.... +. T Consensus 313 ~~~-t---~sn~eq~~~~f~~~t--L~P~~~-~ie~~ln~~-----Ll~~~~~~----~~~~~fd~~~----llr~d-~~ 371 (431) T protein:vir:10 313 DTS-W---GSGIEQLAIFFIQYG--LSHWFV-SWEQAAARA-----FLPEKMLG----QRQFKFNEGA----LLRGT-LN 371 (431) T ss_pred CCC-c---cccHHHHHHHHHHHH--HHHHHH-HHHHHHHhh-----ccChhhcC----CceEEEechh----hhccC-HH Confidence 221 1 111222223354442 333222 223333333 34555543 2344454322 22111 34 Q ss_pred HHHHHHHHhhhh--hchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCc Q lcl|NC_015287. 463 ERLGVVAAVDPY--VGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDP 540 (563) Q Consensus 463 ~R~~~~~~~~p~--vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~ 540 (563) .|.+.++.+-.- -..+ ||.-|+-+ ...-++++.|..+.- ..|.+. T Consensus 372 ~r~~~~~~~~~~G~~~g~------------lT~NE~R~-------~~gl~p~~~~~gD~~--------------~~p~n~ 418 (431) T protein:vir:10 372 DQAAFFSKALGAGGQSPW------------MKQNEVRE-------MLDLPRADDPVADQL--------------RNPMTQ 418 (431) T ss_pred HHHHHHHHHHhcccccCc------------cCHHHHHH-------HhCCCCCCCccccce--------------eccccc Confidence 555555443221 1122 34333321 123334433322110 013332 Q ss_pred ccccccCCCCccc Q lcl|NC_015287. 541 NMGGAPQEGGAEQ 553 (563) Q Consensus 541 ~~~~~~~~~~~~~ 553 (563) ..+++.++||+.- T Consensus 419 ~~~~~~~~~p~~~ 431 (431) T protein:vir:10 419 KQKGSGDEPPATT 431 (431) T ss_pred ccCCCCCCCCCCC Confidence 2222222222211 No 142 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=96.80 E-value=0.00033 Score=39.68 Aligned_cols=398 Identities=14% Similarity=0.134 Sum_probs=165.8 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) |++-==++..+.. .. +.. ............+.....+.++ ..+..+++|-|..||+-|.+.+ T Consensus 1 ~~~~~~~~~~~~~--~~---~~~-~~~~~~~~~~~~~~~~~~~~~v------------~~~~~~~~~~V~~ci~~Ia~~i 62 (409) T protein:vir:93 1 MAKENIVTRIKKK--LI---DNW-IDQSTSKLYDFSPWKNRSFWGV------------INNTLETNETIFSAITKLSNSM 62 (409) T ss_pred CCccchhhhhhhh--hh---hhh-hccccccccccccccCcccccc------------chhhhhccHHHHHHHHHHHHhh Confidence 3332111111000 00 000 0000000000111011111111 1123468899999999999976 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcchhhh----HHHHHHHhhhhhhheeeeecCCCCCcceeEE Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFDDKC----HEILRNWYVDGRIYYHKVIDIKAPEEGIKEV 155 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~~~~----~~~~R~WYvDGri~fhkviD~~~~k~GI~eL 155 (563) .-. |+.+- +.++.. .....++|+. =+=...+ +.++..++++|.-|..++.|.. .-+++| T Consensus 63 a~l-----p~~~~-~~~~~~-------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~---G~~~~L 126 (409) T protein:vir:93 63 ASL-----PLKMY-EDYKVV-------NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY---HQPSKL 126 (409) T ss_pred hhC-----ceeEe-eccccc-------cchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCC---CcEEEE Confidence 532 22221 221111 1122223321 1222233 4455666788988887776632 227899 Q ss_pred EEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeeccccee Q lcl|NC_015287. 156 RYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAIS 235 (563) Q Consensus 156 r~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~ 235 (563) .+|+|..|..++.- + . ..-+|.+.... ...+.++.+.|. T Consensus 127 ~~l~~~~v~~~~~~--~-~--------------------~~~~y~~~~~~------------------g~~~~~~~~eVi 165 (409) T protein:vir:93 127 FLLNPDVVEMLIEN--Q-S--------------------RELYYSIHAAT------------------GNKLIVHNMDML 165 (409) T ss_pred EEEcCceeEEEEeC--C-C--------------------cEEEEEEEcCC------------------ceEEEEccccEE Confidence 99999887665321 0 0 01233332210 123456677774 Q ss_pred eeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeC Q lcl|NC_015287. 236 FCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDA 315 (563) Q Consensus 236 y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~ 315 (563) |.. +....++-.+.|-|..|.+.......++..- +. ...+|-.-+ ..--+.|-+.+++...+.+.+.|. T Consensus 166 h~r-~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~-~~-~~~~~~~~i-~~~~~~l~~e~~~~~~~~~~~~~~------- 234 (409) T protein:vir:93 166 HFK-HIVASNMVQGISPIDVLKNTTDFDNAVRTFN-LT-EMQKPDSFM-LKYGSNVGKEKRQQVLEDFKQYYE------- 234 (409) T ss_pred EeC-CCCCCCccccccHHHHHHHHHHHHHHHHHHH-HH-hcCCCCceE-EecCCCCCHHHHHHHHHHHHHHhh------- Confidence 442 2223344455677777777766655555542 33 333343333 333456666666655555444332 Q ss_pred CCCcccccccccchhhhhcccccCCCCccceeecCCCC-CcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchh Q lcl|NC_015287. 316 STGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQ-NLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEI 394 (563) Q Consensus 316 ~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~-nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eI 394 (563) +.|.+ + .+ + .|.+++.|.-.. .+.-++-..|-...+.++++||...|...++ -+.+. + T Consensus 235 ~~g~~------~-vl--------~--~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~-~~~sn---~ 293 (409) T protein:vir:93 235 ENGGI------L-FQ--------E--PGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSN-TNFAK---N 293 (409) T ss_pred cCCCe------e-ec--------C--CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-CCccc---H Confidence 22321 1 11 1 256777774221 2222333446778899999999999954321 12222 1 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_015287. 395 LRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPY 474 (563) Q Consensus 395 tRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~ 474 (563) .-.-..|..++ |+-.+. .+.+.|-.. ++++.+|.. ...|.|.-+ ++.... +..|++.+..+-.- T Consensus 294 e~~~~~f~~~~--l~P~~~-~ie~~l~~~-----Ll~~~~~~~---~~~~~fd~~----~ll~~d-~~~~~~~~~~~~~~ 357 (409) T protein:vir:93 294 EELNRFYLQHT--LLPIVK-QYEEEFNRK-----LLTKTDREK---NRYFKFNVK----SYLRAD-SATQAEVYFKAVRS 357 (409) T ss_pred HHHHHHHHHHH--HHHHHH-HHHHHHHhh-----cCCcccccC---cceEEeech----hhhccC-HHHHHHHHHHHHhC Confidence 11112355543 333221 133333333 345555542 234555432 222222 34556655444322 Q ss_pred hchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccC Q lcl|NC_015287. 475 VGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQA 554 (563) Q Consensus 475 vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~ 554 (563) -+++..-++.. ..-+++ |+- +..- . ..-..|.+. + .+. T Consensus 358 --G~~T~NE~R~~-------------------~g~~p~--~gg---D~~~-----~-~~n~~~~~~--------~--~~~ 395 (409) T protein:vir:93 358 --GYYTINDIREW-------------------EDLPPV--EGG---DKPL-----I-SGDLYPIDT--------P--LEL 395 (409) T ss_pred --CCcCHHHHHHH-------------------hCCCCC--CCc---Ceee-----e-ccccccccc--------c--hhh Confidence 13333333321 122222 110 0000 0 000011110 0 000 Q ss_pred CCCCCCCCC Q lcl|NC_015287. 555 PQMPEGGEI 563 (563) Q Consensus 555 ~~~~~~~~~ 563 (563) -+-.+|||- T Consensus 396 ~~~~~gG~~ 404 (409) T protein:vir:93 396 RKSLKGGDK 404 (409) T ss_pred cccccCCCC Confidence 011233333 No 143 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=96.76 E-value=0.00036 Score=39.47 Aligned_cols=448 Identities=10% Similarity=0.048 Sum_probs=180.5 Q ss_pred Cc---c-cccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHH---HHHhhccchhHHH Q lcl|NC_015287. 1 MA---K-LFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRY---REMALHPEVDSAI 73 (563) Q Consensus 1 m~---~-~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~Y---R~ma~~PEvd~Ai 73 (563) +. + +-++ +..+.......+.- ..-.+.|.. ..+.+.......+ .. ..++-+.-+| T Consensus 37 ~~~~~~~l~~~-i~~~~~~~~~r~~~--------l~~yY~g~~--------~~i~~~~~~~~~~~~~~k-i~~n~~k~Iv 98 (501) T protein:vir:27 37 MVNNWELLKNF-INHHKLRQAPRIQE--------LLDYARGEN--------HDVLQFGRRKDREMADKR-AVHNYGRMIS 98 (501) T ss_pred ccccHHHHHHH-HHHHHHHHHHHHHH--------HHHHhcCCC--------ccccccCccCccccccce-eccchHHHHH Confidence 00 0 1111 11111110000000 000001100 0000000000000 01 1234444455 Q ss_pred HHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCccee Q lcl|NC_015287. 74 EDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIK 153 (563) Q Consensus 74 ~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~ 153 (563) +..+. .. -.+||.+..++.+..+ .+.+.+..|++..+|+....++.+..++.|+-|.+.-+|. +|=. T Consensus 99 d~~~~--yl---~g~p~~~~~~d~~~~~----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~de----d~~~ 165 (501) T protein:vir:27 99 KFKTG--YL---AGNPIRVEYDDNDNNS----QNDDTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRNE----YDET 165 (501) T ss_pred HHHhh--hh---cccCeeEecCCccchH----HHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeCC----CCce Confidence 55443 11 3568888887654444 4555567788889999999999999999999998876663 4556 Q ss_pred EEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccc Q lcl|NC_015287. 154 EVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDA 233 (563) Q Consensus 154 eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~da 233 (563) .+..+||+.+..|..-.........|.+....... .....+-+|.+.........+. ........+ -+..=+ T Consensus 166 ~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~----~~~~~~~vyt~~~v~~~~~~~~---~~~~~~~~~-~~g~vP 237 (501) T protein:vir:27 166 RIKRLNPLETFVIYDNSLEDNSIAAVRYYNRGTLQ----NAKDVVEIYTNEHIYTLDASDD---FNEISVTTH-AFGTVP 237 (501) T ss_pred EEEEEccceeEEEecCCCCCceEEEEEEEEeeecC----CcEEEEEEEeCCeEEEEEeCCc---eeecccccc-CCCccc Confidence 78999999998874322221111222221111000 1122233444433222111110 000000000 011113 Q ss_pred eeeeecceecCCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeE Q lcl|NC_015287. 234 ISFCHSGLVDRNKQTILSYLHKSIKALNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLV 312 (563) Q Consensus 234 I~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m-~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklv 312 (563) |++. .|.....|=++..+.....+.. +=+....-+.++.|-+-+.-.+..+.+.. T Consensus 238 vv~~------~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~------------------ 293 (501) T protein:vir:27 238 ITEF------LNNVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQ------------------ 293 (501) T ss_pred EEEe------cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccc------------------ Confidence 3332 1223345666665555544433 23333344556666555543222221110 Q ss_pred eeCCCCcccccccccchhhhhcccccC----CCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCChhhccCCCccee Q lcl|NC_015287. 313 YDASTGEIRDDRKHMSMLEDFWLPRRE----GGRGTEITTLPGGQNLGEL-SDIEYFQKKLYRSLGVPESRIAGSGDGFN 387 (563) Q Consensus 313 Yd~~TGev~d~~~~msMlEDywLpRRe----GgrgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~VP~sRl~~~~~g~~ 387 (563) ...++.. -..++.--+ ++.+..+..|-...+...+ .-++-+.+.+|.-.++|---. + .|. T Consensus 294 ----~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~--~--~~~ 358 (501) T protein:vir:27 294 ----ASDMKRT-------RLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSD--T--NFS 358 (501) T ss_pred ----hhhhhhc-------CceeecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCc--c--ccc Confidence 0000000 011221111 1112234444322222211 234555677888888884222 1 232 Q ss_pred ccccchhhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCh-hHHHHhhhceEEEeeccchHHHHHHHHHHHHH Q lcl|NC_015287. 388 LGRSSEILR--DEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTP-EDWEVLGDHIQYDFVYDNHFAELKETELLNER 464 (563) Q Consensus 388 ~g~~~eItR--DElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~-eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R 464 (563) |..|.... -......-+.+.++.|..-+.++++.=+-+-+++.. .+++ ...|.+.|...-.-.+ .+. T Consensus 359 -~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d--~~~i~v~f~~~~p~n~-------~e~ 428 (501) T protein:vir:27 359 -GNTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFD--ESLLKITFTPNLPKSL-------NEQ 428 (501) T ss_pred -cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--cccceEEeCCCCCcCH-------HHH Confidence 22222221 111234556666777777777766553322222221 1222 1357888865333222 233 Q ss_pred HHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCcccccccccc-CCCCcCCCCCcccc Q lcl|NC_015287. 465 LGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQ-MPGGAMNPQDPNMG 543 (563) Q Consensus 465 ~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~-~~~~~~~p~~~~~~ 543 (563) ++++..+. | .+|.+++++.+-..+| .+++.++|++|..+.. +..... +.... ...+...++.. T Consensus 429 ad~~~kl~---g-~iS~et~l~~l~~v~D--~~~E~eri~~E~~e~~---~~~~~~----~~~~~~~~~~d~~~~~~--- 492 (501) T protein:vir:27 429 VSILTGLG---G-QVSQETALSLSGLVES--PNEELDKINKEVSEID---FKGYSN----DFNEHVGKYTDEVKETH--- 492 (501) T ss_pred HHHHHHHh---c-cCcHHHHHHhCCCCCC--HHHHHHHHHHHHHhhh---HhhhcC----ccccccccccCCCCCCc--- Confidence 44555553 4 3899999988754443 3344444555533211 000000 00001 11111111111 Q ss_pred cccCCCCcccCCC Q lcl|NC_015287. 544 GAPQEGGAEQAPQ 556 (563) Q Consensus 544 ~~~~~~~~~~~~~ 556 (563) +-..|.|++ T Consensus 493 ----~d~~e~~~~ 501 (501) T protein:vir:27 493 ----TDDFERAYE 501 (501) T ss_pred ----cccccccCC Confidence 111222333 No 144 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=96.76 E-value=0.00036 Score=39.46 Aligned_cols=424 Identities=14% Similarity=0.132 Sum_probs=183.3 Q ss_pred hhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHH-------------------- Q lcl|NC_015287. 13 DEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSA-------------------- 72 (563) Q Consensus 13 ~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~A-------------------- 72 (563) ++..++..=-+|++..-... .+. .+.. ... .-+.+|+.+..+.+=+.. T Consensus 1 ~~~~~~~~~~~p~d~~~~~~-~l~-~~i~---~~~-------~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~ 68 (453) T protein:vir:39 1 MKYKPPKLMTFPKDEPITNE-VVT-KFME---KHR-------LEVARYEYLKNMYRGIMAIDAEPTKDLWKPDNRLTVNF 68 (453) T ss_pred CeecCCcceEcCCCCCCCHH-HHH-HHHH---HHH-------HHHHHHHHHHHHhhccCchhcCCCccccCccceeecch Confidence 22222221111332211100 000 0000 000 111223323222221111 Q ss_pred HHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcce Q lcl|NC_015287. 73 IEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGI 152 (563) Q Consensus 73 i~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI 152 (563) ...||+ ..+.=--.+||.+..++ +...+.+..|++-.+|+....++.+.+++.|+-|.+.-.| ++|- T Consensus 69 ~~~ivd-~~~~~l~g~~~~~~~~d--------~~~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d----~~g~ 135 (453) T protein:vir:39 69 TKYIVD-TFTGYFNGIPVKKSHSD--------KETLSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQN----EETQ 135 (453) T ss_pred HHHHHH-HHhhhhcccCceeccCC--------hHHHHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEec----CCCc Confidence 111111 00001123444444432 2455678888888999999999999999999999886665 3577 Q ss_pred eEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeeccc Q lcl|NC_015287. 153 KEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPD 232 (563) Q Consensus 153 ~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~d 232 (563) ..++.++|+.+..|..-.........+..-... .....+-+|.+.........+ +..... + ...-+.. T Consensus 136 ~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~-------~~~~~~~~yt~~~i~~~~~~~--~~~~~~-~--~~~~~~g 203 (453) T protein:vir:39 136 TNVIYNTPENMFMVYDDTIKQEPLFAVRYGYDD-------DYKLYGEVYTKETTYALNGTM--GFYNMT-E--QAPNPFD 203 (453) T ss_pred eEEEEEcccceEEEecCCCCCeEEEEEEEEEeC-------CeEEEEEEEeCCeEEEEEecC--Cceeee-c--ccccCCC Confidence 889999999998885322221111122221111 011112233332222111100 000000 0 0000111 Q ss_pred --ceeeeecceecCCCCccchhHHHHHHHHHHH-HHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhc- Q lcl|NC_015287. 233 --AISFCHSGLVDRNKQTILSYLHKSIKALNQL-RMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYR- 308 (563) Q Consensus 233 --aI~y~hSGl~d~~~~~~~syL~~Aik~~NqL-~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~k- 308 (563) +|++.. |...+.|=++..+....-+ +++-+....-+.++.|-+-+.-. .++.... .+.+ T Consensus 204 ~vPvv~~~------n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~---~~~~~~~--------~~~~~ 266 (453) T protein:vir:39 204 DLPVVEFY------FNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGA---AVEEEDL--------KNIRS 266 (453) T ss_pred ceeEEEec------CCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecC---CCCchhh--------hhhhh Confidence 222221 2223445555544444333 34555666667778886655432 2222111 1111 Q ss_pred ceeEeeCCCCcccccccccchhhhhccccc-CCCCccceeecCCCCCcchHH-HHHHHHHHHHHhcCCChhhccCCCcce Q lcl|NC_015287. 309 NKLVYDASTGEIRDDRKHMSMLEDFWLPRR-EGGRGTEITTLPGGQNLGELS-DIEYFQKKLYRSLGVPESRIAGSGDGF 386 (563) Q Consensus 309 nklvYd~~TGev~d~~~~msMlEDywLpRR-eGgrgTEIsTLpGg~nLgei~-DV~YF~~kLy~aL~VP~sRl~~~~~g~ 386 (563) +++. . ++-. ..+.+.++.+|....+.+.+. -++-+.+.+|....+|- +..+ +| T Consensus 267 ~~~~-~--------------------~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~--~~ 321 (453) T protein:vir:39 267 NRVI-N--------------------YYGESSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVAN--ISDE--SF 321 (453) T ss_pred ccee-e--------------------ecCCCCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcc--cccc--cc Confidence 1211 0 1100 112344577776555555444 35667777788888884 3222 22 Q ss_pred eccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-ChhHHHHhhhceEEEeeccchHHHHHHHHHHHH Q lcl|NC_015287. 387 NLGRSSEIL--RDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIV-TPEDWEVLGDHIQYDFVYDNHFAELKETELLNE 463 (563) Q Consensus 387 ~~g~~~eIt--RDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~-t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~ 463 (563) |.+|... --+.....-+.+.|+.|..-+..+++.=+-+-+.. ...+|. .|.+.|...--=.+ .+ T Consensus 322 --gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~----~i~v~f~~~~p~~~-------~~ 388 (453) T protein:vir:39 322 --GSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVSNKEAWK----DIEYTFTRNEPKDI-------KE 388 (453) T ss_pred --cCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccc----cceEEeCCCCCcCH-------HH Confidence 4433222 22222345567777778877777777533332322 233333 46788864332222 23 Q ss_pred HHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcc Q lcl|NC_015287. 464 RLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPN 541 (563) Q Consensus 464 R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~ 541 (563) .++++..+ +| .+|.+++++.+-..+| .+++-++|++|........+.. .++.. ...++.+|.+.| T Consensus 389 ~a~~~~kl---~g-~is~et~l~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~-~~~~~------~~~~~~~~~~~e 453 (453) T protein:vir:39 389 QAETANIL---MG-ITSQETALSVISVIPD--VQAEMEKIKKEEASTAIFDKDK-QPSEK------GTDTVVPETNEE 453 (453) T ss_pred HHHHHHHH---hc-cCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHHhc-cCCCC------CCCCCCCCcCCC Confidence 45555555 34 4899999987653333 3444455555544322111100 01110 111111222222 No 145 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=96.65 E-value=0.00043 Score=39.01 Aligned_cols=395 Identities=16% Similarity=0.180 Sum_probs=162.1 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) |.= |+| ++++. ..++ +...+. ...+..+... -...|..++.+|.|..||+-|.+.+ T Consensus 1 Mg~-~~~-f~~k~-----~~~~--~~~~~~---~~~~~~~~~~------------~~~~~~~~~~~~~V~~~I~~ia~~i 56 (403) T protein:vir:80 1 MGL-FNF-FRRKT-----RSEP--TNAISW---FLTQEAYDTL------------AIPGYTRLSDNPEVRMAVHKIAELI 56 (403) T ss_pred Ccc-ccc-ccccc-----cccc--cchhhh---hccccccccc------------ccchhhhhhhhHHHHHHHHHHHHhh Confidence 432 232 22211 1111 110000 0111111100 0223556888999999999998855 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcchhhhHHHHHHH----hhhhh--hheeeeecCCCCCccee Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFDDKCHEILRNW----YVDGR--IYYHKVIDIKAPEEGIK 153 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~~~~~~~~R~W----YvDGr--i~fhkviD~~~~k~GI~ 153 (563) .-. |+.+- ++.+ +. ...+......+++. =+=...++++++.+ +.+|. -|..++.| .+.-++ T Consensus 57 A~~-----p~~~~-~~~~--~g-~~~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~---~~g~~~ 124 (403) T protein:vir:80 57 SSM-----TIHLM-QNTD--NG-DIRIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYT---TSGLID 124 (403) T ss_pred hhC-----ceEEE-EecC--Cc-eeecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEc---CCCcEE Confidence 422 33331 1111 11 11122222233221 11122455555543 44443 34444333 223388 Q ss_pred EEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccc Q lcl|NC_015287. 154 EVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDA 233 (563) Q Consensus 154 eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~da 233 (563) +|.+|+|..|..++. + + +. .+| |+. ..++.+. T Consensus 125 ~L~~l~p~~v~~~~~---~-~---g~-----------------~~~-y~~-----------------------~~~~~~e 156 (403) T protein:vir:80 125 ELIPLAPSKVSFVDT---D-T---GY-----------------QIW-YQG-----------------------KAYNYDE 156 (403) T ss_pred EEEEEcCCeeEEEEc---C-C---ce-----------------EEE-Eee-----------------------cccchhh Confidence 999999988765421 1 1 10 111 211 0123344 Q ss_pred eeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEe Q lcl|NC_015287. 234 ISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVY 313 (563) Q Consensus 234 I~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvY 313 (563) |.|-.-.....++-.+.|.+..+..+++.....++...-+----+--+-|..++. .+....+++-.+.+..+|..- T Consensus 157 iih~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-~~~~~~~~~~~~~~~~~~~~~--- 232 (403) T protein:vir:80 157 VLHFIVNPDPEKPYMGRGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDA-ATAELSSEEGRNAVFKKYLEA--- 232 (403) T ss_pred EEEEeccCCCcCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCChHHHHHHHHHHHHHHhhh--- Confidence 4332222233344456899999999999999888877666554455666777764 455555555555544443321 Q ss_pred eCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccch Q lcl|NC_015287. 314 DASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSE 393 (563) Q Consensus 314 d~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~e 393 (563) .++|.+- + +| -++..+++++.|. -.++.=++-.++-...+.++++||...|.- |.. T Consensus 233 -~~~g~~~-------~-----~~-~~~~~~~~~~~l~-~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~-------~~~-- 288 (403) T protein:vir:80 233 -SEAGQPW-------I-----IP-AELLDVEQVKPLS-LKDLAIHETVELDKRTVAGIFGVPAFLLGV-------GKY-- 288 (403) T ss_pred -hhcCCee-------e-----ec-ccccccceeccCC-HHHHHHHHHHHHhHHHHHHHhCCCHHHcCC-------CCc-- Confidence 0122111 0 11 0112234444342 234444455667778899999999999831 111 Q ss_pred hhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_015287. 394 ILRDEIKFTKFVG-RMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVD 472 (563) Q Consensus 394 ItRDElKF~KfI~-rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~ 472 (563) ++. .|..|+. .|+ -+...+.+.|-..|+ ++.| +.+.|.-+.. -... ..+|++.+..+- T Consensus 289 -~~~--~~~~f~~~~l~-P~~~~ie~~l~~kll-----~~~~-------~~~~f~~~~l----l~~d-~~~~~~~~~~~~ 347 (403) T protein:vir:80 289 -DKD--EYNNFINSTIL-PIAKGIEQELTRKLL-----ISPD-------LYFKFNPRSL----YAYD-LKELAEVGSNMY 347 (403) T ss_pred -cHH--HHHHHHHHHHH-HHHHHHHHHHHHhcc-----CCCC-------cEEEeechhh----hccC-HHHHHHHHHHHH Confidence 111 1233332 233 233334444444443 4443 2445543321 1111 234555554442 Q ss_pred hhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccc Q lcl|NC_015287. 473 PYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMG 543 (563) Q Consensus 473 p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 543 (563) .- -++|..-+|.. +.+.+.+ ..+-.+. +..-...+...-.....+|+..+.+.+.+ T Consensus 348 ~~--Gi~t~NE~R~~-~gl~p~~-----------ggd~~~~-~~n~~pl~~~~~~~~~k~ge~~~~~~~~~ 403 (403) T protein:vir:80 348 VR--GLMEGNEVRDW-LGLSPKE-----------GLSELVI-LENYIPLDKIGDQNKLKGGEKGGADGQTD 403 (403) T ss_pred hC--CCcCHHHHHHH-hCCCCCC-----------CCCeEee-cccccchhhccchhhccCCCCCCCCCCCC Confidence 21 23444443332 2222210 0000000 00000000000000011111111111111 No 146 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=96.63 E-value=0.00046 Score=38.90 Aligned_cols=414 Identities=16% Similarity=0.191 Sum_probs=167.5 Q ss_pred Ccc--cccchhhHhhhcccccccccCcCCCCcccce--ec--cccccccc-cccchhhhHHHHHHHHHHHhhccchhHHH Q lcl|NC_015287. 1 MAK--LFGFKIQKDDEQNKNVVSPVPQSNEDSSDYY--VS--SGFYGQYV-DIDGVFKSEFELIKRYREMALHPEVDSAI 73 (563) Q Consensus 1 m~~--~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~--~~--~~~~~~~~-~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai 73 (563) |+. -||| +.+...... +.+++ ..++.... +. +...+... +....+ ......++|-|..|| T Consensus 1 ~~~~~~mg~-f~r~~~~~~-~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~g~~v--------~~~~al~~~~V~~~i 67 (432) T protein:vir:81 1 MPDEKKLGL-FGQLKAMFV-PPDPV---DIGGGQTFTPVNATARDLGIIISDTGAAV--------NADAIMRLDAVAACV 67 (432) T ss_pred CCchhhcch-hhhhhhhcc-ccccc---ccccccccccCccchhhhcccccccCccc--------chHhhhccHHHHHHH Confidence 442 2444 332211110 00111 11111111 00 00000000 000001 123445789999999 Q ss_pred HHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcchhhhH----HHHHHHhhhhhhheeeeecCCCC Q lcl|NC_015287. 74 EDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFDDKCH----EILRNWYVDGRIYYHKVIDIKAP 148 (563) Q Consensus 74 ~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~~~~~----~~~R~WYvDGri~fhkviD~~~~ 148 (563) ..|.+.+.-+ |+.|--+. .+..++.+..-..++|+. =|-...++ .++..++.+|.-|..++.+. T Consensus 68 ~~Ia~~ia~l-----p~~~y~~~---~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~~--- 136 (432) T protein:vir:81 68 KLVSQAIAAM-----PLTMYMRT---PDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTD--- 136 (432) T ss_pred HHHHHhhhhC-----ceeeEEec---CCcceecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecC--- Confidence 9999966533 33331111 111122222222222221 11122333 34556677888877765532 Q ss_pred CcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceee Q lcl|NC_015287. 149 EEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVK 228 (563) Q Consensus 149 k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~ik 228 (563) ..+++|.+|+|..|..++. + . + ..+|.|... +...+. T Consensus 137 -g~~~~L~~l~~~~v~v~~~---~-~---g-----------------~~~y~~~~~------------------~g~~~~ 173 (432) T protein:vir:81 137 -GRIESLQYLANDRLTITTD---P-K---G-----------------NTAYRYRRT------------------DGQMID 173 (432) T ss_pred -CcEEEEEEEcCCceEEEEC---C-C---C-----------------cEEEEEEec------------------CceEEE Confidence 4589999999987765521 1 0 0 112333210 112356 Q ss_pred ecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhc--CccceEEEccCCCCchHHHHHHHHHHHHh Q lcl|NC_015287. 229 IAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSR--APERRIFYIDVGNLPKMKAEQYLKEVMNR 306 (563) Q Consensus 229 i~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~R--APeRRvFyIDvGnlpk~KAeqYl~~im~k 306 (563) ++.+.|.+. -.++.++-.++|-|+.|.+.+..-..+++... +.++ +--.-|..+| +.|-+..++...+ + T Consensus 174 ~~~~~iih~--r~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~--~~f~ng~~~~gil~~~-~~l~~e~~~~~~~----~ 244 (432) T protein:vir:81 174 IPKQQIWKI--MGYSLDGENGLSAIRYGAQIFGTAIAAEAQAA--RAFRNGQLQSVYYQID-RFLTDDQYDSFAK----K 244 (432) T ss_pred EccccEEEe--cCCCCCCcccccHHHHHHHHHHHHHHHHHHHH--HHHhcCCCcceEEecC-CCCCHHHHHHHHH----H Confidence 777777443 23455666778999999998887777766543 2232 2223455555 5565544444332 2 Q ss_pred hcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCChhhccCCC-c Q lcl|NC_015287. 307 YRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG-QNLGELSDIEYFQKKLYRSLGVPESRIAGSG-D 384 (563) Q Consensus 307 ~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~-~ 384 (563) |..- .+.| +.+ .|+ .|++++.|.=. +.+.=++-.+|....+.++++||...|...+ + T Consensus 245 ~~~~----~nag------~~~-vl~----------~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~ 303 (432) T protein:vir:81 245 VSGS----VEAG------RAP-LLE----------GGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAG 303 (432) T ss_pred Hhhh----hcCC------Cce-ecC----------CCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCc Confidence 2110 0112 111 222 24556555321 1222233456788899999999999995332 1 Q ss_pred ceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHH Q lcl|NC_015287. 385 GFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNER 464 (563) Q Consensus 385 g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R 464 (563) .+..| +.+...-+-|.++ .|+--+. .+.+.|-.. +.++.++. ...+.|..+. |... -+.+| T Consensus 304 ~~~~~--sn~eq~~~~f~~~--tl~P~~~-~ie~~l~~k-----Ll~~~~~~----~~~~~fd~~~----llr~-d~~~r 364 (432) T protein:vir:81 304 TTSWG--SGIESQQLGFLTM--TLSPWLR-RIEQSIALN-----LLSPAERR----RYFADFDTSA----LLRA-DSAAR 364 (432) T ss_pred ccccc--chHHHHHHHHHHH--HHHHHHH-HHHHHHHhh-----ccCccccC----ceEEEeechh----hhcc-CHHHH Confidence 22222 2222222335443 3333222 233334433 34455543 3455555432 2111 13456 Q ss_pred HHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCccccc Q lcl|NC_015287. 465 LGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGG 544 (563) Q Consensus 465 ~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 544 (563) .+.++.+-.- -+ ||..|+-++ ..-++++ +...-.. .. ....|.+....+ T Consensus 365 ~~~~~~~~~~--G~------------~t~NE~R~~-------~glpp~~--g~~~~~~-------~~-~~~~pl~~~~~~ 413 (432) T protein:vir:81 365 SSYYSQLVNN--GL------------MTRDEAREI-------EGLPKLG--GNAAVLT-------VQ-SAMVPLDSIGLQ 413 (432) T ss_pred HHHHHHHHhC--CC------------CCHHHHHHH-------hCCCCCC--CCcceEe-------ec-CcccchhhhccC Confidence 6666544221 23 344333211 1222221 1000000 00 000011100000 Q ss_pred ccCCCCcccCCCCCCCCCC Q lcl|NC_015287. 545 APQEGGAEQAPQMPEGGEI 563 (563) Q Consensus 545 ~~~~~~~~~~~~~~~~~~~ 563 (563) + ++....+.+-...-++ T Consensus 414 ~--~~~~~~~~~n~~~~~~ 430 (432) T protein:vir:81 414 A--SPEPASGLGNQQQDKV 430 (432) T ss_pred C--CCCCCCCCCCcccccc Confidence 0 0000000000000111 No 147 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=96.62 E-value=0.00046 Score=38.89 Aligned_cols=430 Identities=9% Similarity=0.072 Sum_probs=172.8 Q ss_pred ccc--chhhHhhh---cccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHH-hhccchhHHHHHhh Q lcl|NC_015287. 4 LFG--FKIQKDDE---QNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREM-ALHPEVDSAIEDII 77 (563) Q Consensus 4 ~~~--f~~~~~~~---~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m-a~~PEvd~Ai~eIv 77 (563) +|. ||++.-.. ..++..++ ++ .+ ....++++...++ ... +.+| ..+|-|.+||+-|. T Consensus 1 ~~~~~~~i~s~~~~~~i~~~~~~s-~~----~~-~~~~~~~~~pp~~-------~~~----la~l~~~n~~v~scI~~ia 63 (542) T protein:vir:41 1 MFNYHLSIRSLEKYKAIKREEVES-QA----LG-ETRFEEYVEPKVN-------PLV----LLSLLQVNPYHASACSIKA 63 (542) T ss_pred Cccccccccccccchhhhhccccc-cc----cc-cccCCccccCCCC-------HHH----HHHHHhhcHHHHHHHHHHH Confidence 666 33332221 11111111 11 00 0111233332222 222 2233 45789999999999 Q ss_pred cceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhh----hHHHHHHHhhhhhhheeeeecCCCCCccee Q lcl|NC_015287. 78 NEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDK----CHEILRNWYVDGRIYYHKVIDIKAPEEGIK 153 (563) Q Consensus 78 neaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~----~~~~~R~WYvDGri~fhkviD~~~~k~GI~ 153 (563) +.+.-+ |+.+.-++. .....| + -+-.-. ...+++.+++.|.-|.+++.|.+ .-+. T Consensus 64 ~~IA~l-----~~~~~~~~~--------~~l~~~---l--pN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~---G~~~ 122 (542) T protein:vir:41 64 NDIIRT-----GYILEGDDE--------GVVDEF---I--RACKPSFEYVLLRALEDLQVFNYCTLEVVRDDR---GDPI 122 (542) T ss_pred HHHhhC-----ceeeecccc--------hhhhhh---c--CCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC---CcEE Confidence 866543 333322111 111111 1 122223 34456677788999999888754 3499 Q ss_pred EEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccc Q lcl|NC_015287. 154 EVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDA 233 (563) Q Consensus 154 eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~da 233 (563) +|++|||.+|+..+-. ...+....+. ....|-.|... .......| .....++.+. T Consensus 123 ~L~~l~~~~v~v~~d~------~~~~~~~~~~--------~~~~~~~y~~~----~~~~~~~g-------~~~~~~~~~e 177 (542) T protein:vir:41 123 RFEYIPSHTIRVHKDG------SRYRQTWDGV--------NITHFKDYRYE----GEINPETG-------EDQDSVGANE 177 (542) T ss_pred EEEEEcCcceEEEEcC------CeeEeeecCC--------cceeEEeeccc----cccccccc-------ccccccCccc Confidence 9999999988755310 0111111110 00011111110 00000111 1123455555 Q ss_pred eeeeecceec-CCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCC---------CCchHHHHHHHHHH Q lcl|NC_015287. 234 ISFCHSGLVD-RNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVG---------NLPKMKAEQYLKEV 303 (563) Q Consensus 234 I~y~hSGl~d-~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvG---------nlpk~KAeqYl~~i 303 (563) |.|.. ..+ .++-.++|-+..|+..+.....+++...=+=--.+--+-|.++..+ .+-+...+..-+.+ T Consensus 178 IiHir--~~~~~~~~~Glspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~ 255 (542) T protein:vir:41 178 LVFIH--IPSPVCSYYGVPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALI 255 (542) T ss_pred EEEec--CCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHH Confidence 53332 223 2344558999999998888777776654433333556667777543 22233333333333 Q ss_pred HHhhcceeEeeCCCCcccccccccchhhhhcccccCC-CCccceeecCCCCCcchH---HHHHHHHHHHHHhcCCChhhc Q lcl|NC_015287. 304 MNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREG-GRGTEITTLPGGQNLGEL---SDIEYFQKKLYRSLGVPESRI 379 (563) Q Consensus 304 m~k~knklvYd~~TGev~d~~~~msMlEDywLpRReG-grgTEIsTLpGg~nLgei---~DV~YF~~kLy~aL~VP~sRl 379 (563) ...|+.-. .+.| +.+ .++ .-.| ..|.+++.|. .+..++ +-..+..+.+.++++||...| T Consensus 256 ~~~~~g~~---~n~g------k~~-vL~-----~~~~~~~g~~~~pl~--~~~~d~qfle~~~~~~~~Ia~afgVPp~~l 318 (542) T protein:vir:41 256 EDNFKHLK---EAPH------TPL-VFS-----IPGGDTVKVTFTPLN--TSQKELSFREYAAEKKYDIAAAHMIDPYRL 318 (542) T ss_pred HHHHhhhh---cccC------cee-Eee-----ccCCcccceeEEEcC--CChhHHHHHHHHHHHHHHHHHHhCCCHHHh Confidence 33332100 0111 111 111 1111 1344555543 333333 233566788999999999999 Q ss_pred cCC-CcceeccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHH Q lcl|NC_015287. 380 AGS-GDGFNLGRSSEILRDEIKF-TKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKE 457 (563) Q Consensus 380 ~~~-~~g~~~g~~~eItRDElKF-~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe 457 (563) ... +++++ +++ +...-..| ..-|.-+++++...+...|-+ +.+ ..+.+.|..+.....- T Consensus 319 G~~~~~t~n--~sn-~Eq~~~~f~~~tL~P~~~~ie~~ln~~L~~---------~~~-----~~~~~~f~~~~ll~~d-- 379 (542) T protein:vir:41 319 GIADTGPLG--GNF-AEVTRRTYYESVVRPQQNIISSILTDFFQV---------KFN-----PKTRFKFNDETLLESD-- 379 (542) T ss_pred CcCCCcccc--ccc-HHHHHHHHHHHHHHHHHHHHHHHHHhhccc---------ccC-----CceEEEecchhhcchH-- Confidence 532 22222 221 22222334 344566777776666644332 222 2345666654433221 Q ss_pred HHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcc-----------------cCc Q lcl|NC_015287. 458 TELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPME-----------------VQQ 520 (563) Q Consensus 458 ~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~-----------------~~~ 520 (563) +...++.+ .-.-++|.+-+|.++..+-+- ..++..|.. ..+ T Consensus 380 ------~~~~~~~~--v~~GilT~NE~Re~L~g~~pg--------------dd~~l~p~~~~~~~~~~~~~n~~~~~~~~ 437 (542) T protein:vir:41 380 ------SVRNCALL--VQSGVLTPAEARERLFGLDGG--------------PDIFMVPSKGAAKSVKRQERNYEKNQIRE 437 (542) T ss_pred ------HHHHHHHH--HhCCCCCHHHHHHhhCCCCCC--------------CccccccccccccccccCCcCCCCCchhh Confidence 11111111 112455555555433222210 001101100 000 Q ss_pred cccccccccCCCCcCCCCCcccccccCCC----CcccCC-------CCCCCCCC Q lcl|NC_015287. 521 LQLGVHPEQMPGGAMNPQDPNMGGAPQEG----GAEQAP-------QMPEGGEI 563 (563) Q Consensus 521 ~~~~~~~~~~~~~~~~p~~~~~~~~~~~~----~~~~~~-------~~~~~~~~ 563 (563) .......+ .|..-+.....+++ +..+.| ..+-|+.. T Consensus 438 ~~k~~~k~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (542) T protein:vir:41 438 IRKIYAKY-------RPRFNEIISSKLSAEEKKKKIDESLAEFRAEAYEAGKKM 484 (542) T ss_pred hhhccccc-------CccccccccccccchhhcccccchhhhhHHhHHhcCceE Confidence 00000000 01000000000000 000111 11122221 No 148 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=96.60 E-value=0.00047 Score=38.80 Aligned_cols=431 Identities=12% Similarity=0.103 Sum_probs=172.7 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceecccccc-ccccccchhhhHHHHHHHHHH-HhhccchhHHHHHhhc Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYG-QYVDIDGVFKSEFELIKRYRE-MALHPEVDSAIEDIIN 78 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~-~~~~~~~~~~~~~~LI~~YR~-ma~~PEvd~Ai~eIvn 78 (563) |.- .| +.+..+|.. -+-. .. ....|.. ...+.. +.+ ....|-. .+++|-|..||+.|.+ T Consensus 1 ~~~-~~---------~~~~~~p~~--~e~~-~~-~~~~~~~~~~~~~~--~~~---~~~~~~~~a~~~~~V~acV~~IA~ 61 (518) T protein:vir:10 1 MLL-AN---------GQTLSAPAM--AELS-PQ-MQDSYYYAPAVGMQ--LER---QFSLYGGIYKNQPWVRTVIAKRAQ 61 (518) T ss_pred Ccc-cC---------ceeecCchh--hhhh-hh-hhccccccccccee--ccc---ccchhhHHHhhhHHHHHHHHHHHH Confidence 110 00 111111110 0000 00 0011111 001111 000 1112222 3578999999999999 Q ss_pred ceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHH----hhhhhhheeeeecCCCCCcceeE Q lcl|NC_015287. 79 EAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNW----YVDGRIYYHKVIDIKAPEEGIKE 154 (563) Q Consensus 79 eaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~W----YvDGri~fhkviD~~~~k~GI~e 154 (563) .+- .-|+.+--.+- +..++.....+..+++-=|-...++++++.| .+.|.-|..++.|.+ ..+++ T Consensus 62 ~iA-----~lpl~l~~~~~---~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~---G~~~~ 130 (518) T protein:vir:10 62 ALA-----RLPVKCMFTSG---DTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKS---GTPEK 130 (518) T ss_pred hhc-----cCceEEEEEcC---CCceeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC---CcEEE Confidence 664 22444322211 1111111111222222223344555555554 466888888776542 34889 Q ss_pred EEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccce Q lcl|NC_015287. 155 VRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAI 234 (563) Q Consensus 155 Lr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI 234 (563) |.+|+|..|...+.- .. ...+|.|.... +. +...+.++.+.| T Consensus 131 L~~l~p~~v~v~~~~---~~--------------------~~~~y~~~~~~-------~~--------~~~~~~~~~~eV 172 (518) T protein:vir:10 131 LMPMHPSRVAIKRNS---RT--------------------GRYEYYFQAGA-------GV--------GTQLVSFADDEV 172 (518) T ss_pred EEEECCCceEEEEcC---CC--------------------CEEEEEEEecC-------Cc--------cceEEEecCCcE Confidence 999999876644211 00 11233342211 00 122356777777 Q ss_pred eeeecceecCCCC-ccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEe Q lcl|NC_015287. 235 SFCHSGLVDRNKQ-TILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVY 313 (563) Q Consensus 235 ~y~hSGl~d~~~~-~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvY 313 (563) .|.+ .+..++. .+.|-|..|++++.....+++...=+----+.-+-|.-++ +.|.+..+++--+.+-..|+- T Consensus 173 iHir--~~s~dg~~~G~spi~~a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~-~~ls~e~~~~~k~~~~~~~~G---- 245 (518) T protein:vir:10 173 VPIR--FFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQFDRAHSG---- 245 (518) T ss_pred EEec--CCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecC-CCCCHHHHHHHHHHHHHHhcC---- Confidence 5543 3444443 4679999999988888888877433332234455566665 445555444433333233321 Q ss_pred eCCCCcccccccccchhhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccc Q lcl|NC_015287. 314 DASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG-QNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSS 392 (563) Q Consensus 314 d~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~ 392 (563) ..+.|.+ + .|+ + |.+++.|.=. ..+.-++-.+|..+.+.++++||...|...++ -+. + T Consensus 246 ~~nag~v------~-vL~-------~---G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~-~t~---s 304 (518) T protein:vir:10 246 SSNTGKT------M-VVE-------E---GMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDR-ATF---S 304 (518) T ss_pred ccccCcc------e-EcC-------C---CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCC-CCc---h Confidence 0112211 1 222 2 4445544321 12223445568889999999999999942211 111 1 Q ss_pred hhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015287. 393 EILRDEIKFTKF-VGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAV 471 (563) Q Consensus 393 eItRDElKF~Kf-I~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~ 471 (563) .+.-.-+.|.++ |.-+..++...|...| ++.. +. ...|+|+. +. +... -+..|.+.+..+ T Consensus 305 n~eq~~~~f~~~tL~P~l~~ie~~ln~~L---------~~~~--~~-~~~~~fd~--~~----llr~-D~~~r~~~~~~~ 365 (518) T protein:vir:10 305 NISAQMRAFYRDTMAIPIARIQSAMDKYV---------GQYW--VR-KNRMKFDI--DD----VIQP-DWEAKSESTQKM 365 (518) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhh---------cccc--cC-CceEEEec--hh----hhcc-CHHHHHHHHHHH Confidence 122222335443 4444444444444333 2221 11 23344332 22 2111 123566666555 Q ss_pred hhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCC-C Q lcl|NC_015287. 472 DPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEG-G 550 (563) Q Consensus 472 ~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~-~ 550 (563) -.- -++|..-+|+ ++.+.+.+ .+..+-.|...+-. .. +..+.....|..+|..+ .++.++. . T Consensus 366 ~~~--G~lT~NE~R~-~~Gl~pie---------~~~gD~~~~~~n~~-pl--~~~~~~~~~g~~~~~~~--~~~~~~~~~ 428 (518) T protein:vir:10 366 VNS--GVATPNEGRE-IMGLPRSD---------DPKADELYANSALQ-PL--GATPDGAVEGEEAPAPK--RPASTPVAS 428 (518) T ss_pred HhC--CCcCHHHHHH-HhCCCCCC---------CCCCCeeeecccce-ec--ccccccccCCCCCCCCC--CCCcccccc Confidence 332 3555555543 22333311 00011111100000 00 00000111111111111 0111100 0 Q ss_pred cccCC---------------------------CCCCCCCC Q lcl|NC_015287. 551 AEQAP---------------------------QMPEGGEI 563 (563) Q Consensus 551 ~~~~~---------------------------~~~~~~~~ 563 (563) +++.| ++|-|.|- T Consensus 429 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 468 (518) T protein:vir:10 429 LDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKES 468 (518) T ss_pred ccccccccCCCCCcccccccccccccchhccccCCCcccc Confidence 00111 12222222 No 149 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=96.59 E-value=0.00049 Score=38.73 Aligned_cols=438 Identities=11% Similarity=0.032 Sum_probs=207.0 Q ss_pred cccccccccCcCCCCcccce------ec------cccccccccccc---hhhhHHHHHHHHHHHhhccchhHHHHHhhc- Q lcl|NC_015287. 15 QNKNVVSPVPQSNEDSSDYY------VS------SGFYGQYVDIDG---VFKSEFELIKRYREMALHPEVDSAIEDIIN- 78 (563) Q Consensus 15 ~~~~~~s~~p~~~~d~~~~~------~~------~~~~~~~~~~~~---~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvn- 78 (563) +....--.+|-.+++..... .. -....||-+-.. -..+..+-.+.+|.+.+. +.-||+-.++ T Consensus 1 ~~~~~~~~~~gl~~~~~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~nw--~~~~Vd~~a~r 78 (474) T protein:vir:81 1 MIQQQTVRIPSLSNDENALINGLLAQIENLRWKNLLRTSYYENKRTIQYVGTLIPPQYFNLGLVLGW--TGKAVDALARR 78 (474) T ss_pred CcCCCcCcCCCCChhHHHHHHHHHHHHHHHhhHHHHHHHHhccCCChhhccccccHHHHHHHhhcCh--HHHHHHHHHhh Confidence 11111111121122111110 00 000111111100 000111111233333333 3446665544 Q ss_pred ---ceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEE Q lcl|NC_015287. 79 ---EAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEV 155 (563) Q Consensus 79 ---eaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eL 155 (563) +-+++.+++ +-+ .....|....+|+....++++.=++.||-|. +|-+....++...+ T Consensus 79 l~~~Gf~~~d~~----------~~~--------~~l~~iw~~N~ld~~~~~~~~~al~~G~sf~--~V~~~~d~~~~~~i 138 (474) T protein:vir:81 79 CNLEGFVWPDGD----------LDS--------LGGTEVVDDNHLLSEIDSAIVAAMQHGPAFL--INTVGEDDEPEALI 138 (474) T ss_pred hcccceECCCCC----------ccc--------hHHHHHHHhcChhHHHHHHHHHHHhhCceeE--EEecCCCCCceeEE Confidence 122211100 001 2245677888999999999999999999986 44444556667888 Q ss_pred EEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCc--CccccCcceeeecccc Q lcl|NC_015287. 156 RYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQ--TPFSTKQRPVKIAPDA 233 (563) Q Consensus 156 r~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~--~~~~~~~~~iki~~da 233 (563) +.++|+.+--+.- .........+.+.... . .+....+.+|.|..+......+..+. ....++ ..-+| . T Consensus 139 ~~~sp~~~~~~~D-~~~~~~~~al~~~~~~--~---~g~~~~~~ly~~~~~~~~~~~~~~~~w~~~~~~~--~~gvP--v 208 (474) T protein:vir:81 139 HVKDASEATGEWN-RRRRGLNNLLSIIDKD--K---EGKVLSLALYLDNETVTAQRDKATLKWQVDRDEH--VYGVP--A 208 (474) T ss_pred EEeccceEEEEEe-CCCCcceeeeEEEEEc--C---CCcEEEEEEEeCCcEEEEEEcCccceeeeccCCC--CCCcc--e Confidence 9999998775531 1111111111111100 0 11122333444433222111110000 000011 11133 5 Q ss_pred eeeeecceecCCCCccchhH-HHHHHHHHH-HHHHHHHHHHHHHhcCccceEEEccCCCC------chHHHHHHHHHHHH Q lcl|NC_015287. 234 ISFCHSGLVDRNKQTILSYL-HKSIKALNQ-LRMIEDSLVIYRLSRAPERRIFYIDVGNL------PKMKAEQYLKEVMN 305 (563) Q Consensus 234 I~y~hSGl~d~~~~~~~syL-~~Aik~~Nq-L~m~EDalVIyRi~RAPeRRvFyIDvGnl------pk~KAeqYl~~im~ 305 (563) |.|+|.--.+...| .|-+ +..+...+- -|.+.+.++.=...=.|+|-|+=++-... |..+-.+|+- T Consensus 209 V~~~n~~~~~~~~G--~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~~~~~~d~d~~~~~~~~~~~~---- 282 (474) T protein:vir:81 209 QVLPYKPAPKRPFG--QSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLLGADESALKNADGTIKSVWEARLG---- 282 (474) T ss_pred EEecccccccCcCC--ccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeecCChhhcccccccccchhhhhHH---- Confidence 67777643333222 2222 122222121 25678888888888899888864332111 1111112221 Q ss_pred hhcceeEeeCCCCcccccccccchhhhhc-ccccCC-C----CccceeecCCCCCcch-HHHHHHHHHHHHHhcCCChhh Q lcl|NC_015287. 306 RYRNKLVYDASTGEIRDDRKHMSMLEDFW-LPRREG-G----RGTEITTLPGGQNLGE-LSDIEYFQKKLYRSLGVPESR 378 (563) Q Consensus 306 k~knklvYd~~TGev~d~~~~msMlEDyw-LpRReG-g----rgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sR 378 (563) ..| +|.-+. . .+++|..+|+.. |.- ++-++=.-..+-...++|.+- T Consensus 283 --------------------------~i~~~~~d~d~~~~~~~~~~~~q~~~a~-l~~~~~~l~~~~~~~a~~t~iP~~~ 335 (474) T protein:vir:81 283 --------------------------RIKGLPDDADADIPQLARADVKQFPAAS-PDAHWSDINGLAKLFAREASLPDTA 335 (474) T ss_pred --------------------------HHhcCCCcccccccccccccccccCCCC-hhHHHHHHHHHHHHHHhhhCCCHHH Confidence 122 333222 1 235667777653 332 222333334455566999888 Q ss_pred ccCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHH Q lcl|NC_015287. 379 IAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKET 458 (563) Q Consensus 379 l~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~ 458 (563) |.-.+ .-|-..+..|.-.|....+-+.+.|+.|..=+.+++|.-+.+.|-...++|..--..+.+.|..-.. T Consensus 336 lG~~~-~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~------- 407 (474) T protein:vir:81 336 VAISG-LSNPTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVAIDEIPDEWKSIDAKWRDPRY------- 407 (474) T ss_pred hcccc-cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccchhhccceeEecCCCc------- Confidence 83100 0111223467777788889999999999999999999999998876666555444567777743221 Q ss_pred HHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCc Q lcl|NC_015287. 459 ELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGA 534 (563) Q Consensus 459 Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~ 534 (563) --+.++.+.+..+..- |.-+.-..+..++|.+|+.||++....++++...+.+..-...+.+.. .+. T Consensus 408 ~s~a~~aDa~~Kl~~a-~~~~~~~~~~~~~lg~t~~~i~~~~~~~~~~~~~~~~~~l~~~~~~~~--------~aq 474 (474) T protein:vir:81 408 LSKSAQADAGMKQLAA-VPWLAETEVGLELIGLTPQQARRAMADKRRVQGRGTLQALIDRSNNGA--------TAQ 474 (474) T ss_pred cCHHHHHHHHHHHHhc-ccCCCcHHHHHhhcCCCHHHHHHHHHHHHHHhHHHHHHHHHhcCCCCC--------CCC Confidence 1224556665555432 444555667788899999999887766666544433321000000000 000 No 150 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=96.59 E-value=0.00049 Score=38.73 Aligned_cols=402 Identities=10% Similarity=0.085 Sum_probs=170.5 Q ss_pred ccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEec Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSD 84 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d 84 (563) ||| +.+-....+.....+. ..++. . +.+. +.... .-+....+|-|..||+-|.+.+.- T Consensus 1 MG~-~~~~~~~~~~~~~~~~-~~~~~----~-~~~~----g~~~~---------~~~~al~~~~V~~~v~~Ia~~iA~-- 58 (411) T protein:vir:81 1 MGW-WSRLTRFFRPRNETVD-MTNPL----L-LQWL----GVDPD---------TPRNQLSEATYFACLKILSESLGK-- 58 (411) T ss_pred Cch-HHHHHhhccCcccccc-cchHH----H-HHHh----cCccc---------ChhhhhccHHHHHHHHHHHHhHhh-- Confidence 554 2221111111000000 00110 0 1111 11100 113446789999999999996542 Q ss_pred CCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcc----hhhh----HHHHHHHhhhhhhheeeeecCCCCCcceeEEE Q lcl|NC_015287. 85 QNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNF----DDKC----HEILRNWYVDGRIYYHKVIDIKAPEEGIKEVR 156 (563) Q Consensus 85 ~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F----~~~~----~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr 156 (563) -|+.+--++- +..++.. -..++++|+- ...+ +.++..+.++|.-|..++.| ...+.+|. T Consensus 59 ---lp~~~~~~~~---~~~~~~~---~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~----~g~~~~l~ 125 (411) T protein:vir:81 59 ---LPLKMYQKTE---RGIVKSD---REELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYS----GPQLQALW 125 (411) T ss_pred ---CceeEEEecC---Cceeeec---ccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec----CCceEEEE Confidence 2333211110 0000011 1122333322 1233 34455566778888877765 25689999 Q ss_pred EeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceee Q lcl|NC_015287. 157 YIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISF 236 (563) Q Consensus 157 ~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y 236 (563) +|+|..|..++.-.. ... ....-||.|.... +...+.++.+.|.| T Consensus 126 ~l~~~~v~~~~~~~~-------~~~-----------~~~~~~~~~~~~~-----------------~g~~~~~~~~eiih 170 (411) T protein:vir:81 126 ILPSQYVTIVVDDRG-------LLG-----------EKNAIWYRYNDPY-----------------DGKMYVFRNDEILH 170 (411) T ss_pred EECCceEEEEEcCcc-------ccc-----------ccceEEEEEEecC-----------------CceEEEEccccEEE Confidence 999998876632111 100 0112233342100 11235677777765 Q ss_pred eecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCC Q lcl|NC_015287. 237 CHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDAS 316 (563) Q Consensus 237 ~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~ 316 (563) ..-+ .+.++-.+.|-+..|...+.....+++...-+--.-+--+-+..++ +.|.+..+++..+.+...|.-- .+ T Consensus 171 ~k~~-~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g~----~n 244 (411) T protein:vir:81 171 FKTS-VTFDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYT-GDLNQEARDRLVKGFEQFANGS----KN 244 (411) T ss_pred EcCC-CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC-CCCCHHHHHHHHHHHHHHhcCc----cc Confidence 5422 3445666789999999988888888776654443434445555665 4666665555444433333210 11 Q ss_pred CCcccccccccchhhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhh Q lcl|NC_015287. 317 TGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG-QNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEIL 395 (563) Q Consensus 317 TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eIt 395 (563) .| +.+ .++ .|.+++.|.-. ..+.-++-.++..+.+.++++||...|...++ -+.+.+.+ T Consensus 245 ~g------~~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~t~~n~e~-- 304 (411) T protein:vir:81 245 AG------KII-PVP----------LGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEK-SSYASAEA-- 304 (411) T ss_pred cC------Cce-ecC----------CCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-CCchhHHH-- Confidence 12 111 111 24556666321 12222344568899999999999999943211 12222211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_015287. 396 RDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYV 475 (563) Q Consensus 396 RDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~v 475 (563) .-+.|.+++ |+-- ...+.+.|-.. ++++.++. ....+.|.-+. +...+ ...|++.++.+-.- T Consensus 305 -~~~~f~~~~--l~P~-~~~ie~~l~~~-----ll~~~~~~---~~~~~~fd~~~----ll~~d-~~~~~~~~~~~~~~- 366 (411) T protein:vir:81 305 -QNLAFYVDT--LLYV-LKQYEEEITYK-----ILSNDLIS---QGHYFKFNVNV----ILRAD-IKTQMDSLSTAVQN- 366 (411) T ss_pred -HHHHHHHHH--HHHH-HHHHHHHHHhh-----cCChhhcC---CCcEEEeechh----hhccC-HHHHHHHHHHHHhC- Confidence 112344442 3221 12223333333 34444443 23344454332 11111 12344443322211 Q ss_pred chhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccCC Q lcl|NC_015287. 476 GKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAP 555 (563) Q Consensus 476 GKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~ 555 (563) -+ +|..|+- +. + .-++. |+ . +.. .-.....|.+. ... T Consensus 367 -g~------------~t~NE~R---~~---~-gl~p~--~g--g-D~~------~~~~n~~pl~~--~~~---------- 403 (411) T protein:vir:81 367 -GI------------MTPNEAR---DY---L-DMPAD--DY--G-NNL------MANGNYIPLSM--LGA---------- 403 (411) T ss_pred -CC------------cCHHHHH---HH---h-CCCCC--CC--C-Cee------eeccCccchhh--hhh---------- Confidence 12 3443332 11 1 11111 11 0 000 00000112211 111 Q ss_pred CCCCCCCC Q lcl|NC_015287. 556 QMPEGGEI 563 (563) Q Consensus 556 ~~~~~~~~ 563 (563) +-.+|||= T Consensus 404 ~~~kgGd~ 411 (411) T protein:vir:81 404 NYGKGGDS 411 (411) T ss_pred hhccCCCC Confidence 12244444 No 151 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=96.58 E-value=0.00049 Score=38.72 Aligned_cols=428 Identities=13% Similarity=0.110 Sum_probs=182.2 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) ||..++-=.++...+..+ ..+.-.. .....+.++......... ...+.+|.|..||+-|.+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~-------~~~~~~~-~~g~~~~~~~~~~~~~~~---------~~a~~~~~v~~~v~~ia~~i 63 (460) T protein:vir:10 1 MANRIIRALRELTGLDNK-------FNDAFIK-YIGQTFTKYDNNGKTYLE---------QGYNINPDVYSCISQMAAKT 63 (460) T ss_pred CchhHHHHHhhhhccCCC-------chHHHHH-hhccccCCCccchhhhhH---------HHHhcchHHHHHHHHHHHhh Confidence 666554322222111111 1110000 011111111111111111 12467799999999999865 Q ss_pred EE-----ecCCCcEEEEEeecc-ccchHHHHHHHHHHHHHHH--------------HhcchhhhH----HHHHHHhhhhh Q lcl|NC_015287. 81 IV-----SDQNDSPVEIDLENL-PASKQLKDLIREEFKTVKE--------------ILNFDDKCH----EILRNWYVDGR 136 (563) Q Consensus 81 iv-----~d~~~~pV~l~l~~~-~~s~sik~~I~eeF~~i~~--------------ll~F~~~~~----~~~R~WYvDGr 136 (563) .- +..+.+...-..... ...+.+-..++..+++.+. -=+-...+. .++..+++.|. T Consensus 64 A~lp~~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gn 143 (460) T protein:vir:10 64 VAVPYTIKVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGN 143 (460) T ss_pred hhCceEEEeccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCC Confidence 42 222222211111110 1122233334444433321 112223333 44556678899 Q ss_pred hheeeeecCCCCCcc-eeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCcccccccccccc Q lcl|NC_015287. 137 IYYHKVIDIKAPEEG-IKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAI 215 (563) Q Consensus 137 i~fhkviD~~~~k~G-I~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~ 215 (563) -|..++.+...-..| +.+|.+|+|..|...+. ++ +..+.+ .....+|.|.. T Consensus 144 ay~~i~r~~~~~~~G~~~~L~~l~~~~v~v~~~---~~----~~~~~~---------~~~~~~~~~~~------------ 195 (460) T protein:vir:10 144 CYFYLMSPDDGINAGVPSQMYVLPAHLIKIVLK---DD----INLLST---------DSPIKSYMLIQ------------ 195 (460) T ss_pred eEEEEEecCCCccCceeEEEEEEcCceEEEEEc---CC----Cceeee---------eeeeeEEEEec------------ Confidence 898888776544556 67899999988765521 11 111100 00011222211 Q ss_pred CcCccccCcceeeecccceeeee--cceecCC--CCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCC Q lcl|NC_015287. 216 GQTPFSTKQRPVKIAPDAISFCH--SGLVDRN--KQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNL 291 (563) Q Consensus 216 g~~~~~~~~~~iki~~daI~y~h--SGl~d~~--~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnl 291 (563) +.....++.+.|.|.. +-..+.+ +-.++|.+..|.+.+.....+++...-+--.-++-. ..+..-+.| T Consensus 196 -------~g~~~~~~~~evih~r~~~~~~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~~~-~i~~~~~~l 267 (460) T protein:vir:10 196 -------GDQFIEFNEDEVIHTKYANPNFDLQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQNGGVFG-FIHGGSTGL 267 (460) T ss_pred -------CceeEEecccceEEEecCCCCcccccCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc-eeeecCCCC Confidence 1123456666664432 2222222 224578999999998888888888776655556554 456666777 Q ss_pred chHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHH Q lcl|NC_015287. 292 PKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG-QNLGELSDIEYFQKKLYR 370 (563) Q Consensus 292 pk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~ 370 (563) .+..+++..+.+...|+.. .+.|.+ + .++ .|.+++.|.-. ..+.-++-.+|..+.+.+ T Consensus 268 ~~e~~~~~~~~~~~~~~g~----~n~g~~------~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~ 326 (460) T protein:vir:10 268 TQPQADSLKQRLTEMDKSP----DRLSQI------A-GAS----------GEIAFTKISLNTDELKPFDYLKYDQKAICN 326 (460) T ss_pred CHHHHHHHHHHHHHHhcCc----cccCCc------e-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHH Confidence 7777766666655555421 112222 1 121 25666666432 122224556688899999 Q ss_pred hcCCChhhccCC-CcceeccccchhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeec Q lcl|NC_015287. 371 SLGVPESRIAGS-GDGFNLGRSSEILRDEIKFTKF-VGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVY 448 (563) Q Consensus 371 aL~VP~sRl~~~-~~g~~~g~~~eItRDElKF~Kf-I~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~ 448 (563) +++||...|... +++.+.. .+.-.-+.|..+ |.-+..++...|. ..|+ ++.+.. ....|.++|.. T Consensus 327 ~fgVPp~~lg~~~~~t~~~s---n~e~~~~~f~~~~l~P~~~~ie~~ln----~kl~-----~~~~~~-~~~~i~~d~~~ 393 (460) T protein:vir:10 327 ALGWSDKLLNNNEGGGLNTG---NLEEERKRVVTDNIQPDLVILKQAFD----KKFI-----KRFKGY-ENAVIEWDISE 393 (460) T ss_pred HhCCCHHHhCCCCCCCCccc---cHHHHHHHHHHHHHHHHHHHHHHHHH----Hhhc-----Cccccc-CCceEEeecch Confidence 999999998532 2111111 122222234443 3333333333333 3322 222211 11334444432 Q ss_pred cchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccc Q lcl|NC_015287. 449 DNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPE 528 (563) Q Consensus 449 Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~ 528 (563) + ..++.+...+..+ + + ..-+|..|+- + ...-++.++|..+.---+ .+ T Consensus 394 ------l---~~l~~d~~~~~~~-------~------~-~g~~T~NE~R---~----~~g~~pi~~~~gD~~~~~---~n 440 (460) T protein:vir:10 394 ------L---PEMQTDMVAMASW-------L------N-TIPVTPNEIR---I----AMKYETLNQDGMDIVFMP---SN 440 (460) T ss_pred ------h---hhHHHHHHHHHHH-------H------h-CCCCCHHHHH---H----HhCCCCCCCCCCCeeeec---cc Confidence 2 1122222222111 1 1 1225654443 1 123334432221110000 00 Q ss_pred cCCCCcCCCCCcccccccCCCCcccCCCCCCCCC Q lcl|NC_015287. 529 QMPGGAMNPQDPNMGGAPQEGGAEQAPQMPEGGE 562 (563) Q Consensus 529 ~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~ 562 (563) -. |.+ .......+..-|++. T Consensus 441 ~~------~~~--------~~~~~~~~~~~nq~~ 460 (460) T protein:vir:10 441 KV------RID--------DVSNNLIDSAFNQNQ 460 (460) T ss_pred cc------chh--------hcccccCCCcccCCC Confidence 00 110 000011111111222 No 152 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=96.37 E-value=0.00069 Score=37.90 Aligned_cols=404 Identities=10% Similarity=0.073 Sum_probs=164.2 Q ss_pred ccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEec Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSD 84 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d 84 (563) |-|+ +.... .+++..+..++--. ..-|+... ..+.. + | -...+++|-|..||+.|.+.+.-. T Consensus 1 m~~~-~~~~~----~~~~~~~~~~~~~~-~~~g~~~s-~~~~~--v-~-------~~~al~~~~v~~cv~~ia~~ia~l- 62 (419) T protein:vir:80 1 MFFS-RQLLS----NLGQTQPGSGGWVS-ALLGSARS-EAGQV--V-T-------PASALSLTVLQNCVTLLAESIAQL- 62 (419) T ss_pred CCcc-ccccc----ccCcCCCCcchhhH-Hhhccccc-ccCcc--c-C-------hHHhhccHHHHHHHHHHHHhhccC- Confidence 2222 00000 01111111111000 01111111 00000 0 0 123457899999999999965432 Q ss_pred CCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcc----hhhh----HHHHHHHhhhhhhheeeeecCCCCCcceeEEE Q lcl|NC_015287. 85 QNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNF----DDKC----HEILRNWYVDGRIYYHKVIDIKAPEEGIKEVR 156 (563) Q Consensus 85 ~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F----~~~~----~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr 156 (563) |+.+-=.+ .+..+.... ..+..+|+. ...+ +.++..+++.|.-|.-++.|.. .-+.+|. T Consensus 63 ----p~~~~~~~---~~~~~~~~~---~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~---G~~~~L~ 129 (419) T protein:vir:80 63 ----PVELYERS---GDDRKPATD---HPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQD---GVIQGLY 129 (419) T ss_pred ----ceEEEEec---CCCcccccc---cHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC---CcEEEEE Confidence 33332111 011111111 122233322 2333 3445566777988887766643 2278999 Q ss_pred EeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceee Q lcl|NC_015287. 157 YIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISF 236 (563) Q Consensus 157 ~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y 236 (563) +|+|..|..++. + . + .-+|.+.. . ..++.+.|.+ T Consensus 130 ~i~~~~v~i~~~---~-~---~-----------------~~~y~~~~----------~------------~~~~~~~i~h 163 (419) T protein:vir:80 130 PLDNEAVTVMKG---P-D---L-----------------KPMYRVAG----------A------------DPLPQRLVHH 163 (419) T ss_pred EecCceEEEEEC---C-C---c-----------------eEEEEEcC----------c------------cccchhheEE Confidence 999988765421 1 0 0 01121110 0 0133444432 Q ss_pred eecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCC Q lcl|NC_015287. 237 CHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDAS 316 (563) Q Consensus 237 ~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~ 316 (563) --.++.++-.+.|-++.|...+.....+++...=+---.+--+-++.++... +..+.++-+..+...+....-=..+ T Consensus 164 --~~~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~n 240 (419) T protein:vir:80 164 --VRWMSINGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDA-PALKDQASVDRITDGWNAKFGGSGN 240 (419) T ss_pred --ecCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCC-CcccCHHHHHHHHHHHHHHhcCccc Confidence 2234556667789999999999988888877665544556566677776422 2111122223333333322210011 Q ss_pred CCcccccccccchhhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhh Q lcl|NC_015287. 317 TGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG-QNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEIL 395 (563) Q Consensus 317 TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eIt 395 (563) .|.+ + .++ .|.+++-|.-. +.+.-++-.++..+.+.++++||...|...+. -+... +. T Consensus 241 ~g~~------~-vl~----------~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~-~t~~n---~e 299 (419) T protein:vir:80 241 AKKV------A-LLQ----------EGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELER-ATFSN---IE 299 (419) T ss_pred cCCc------e-ecC----------CCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCC-CCccc---HH Confidence 2222 1 221 24566655321 22223344567789999999999999953221 11211 22 Q ss_pred HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_015287. 396 RDEIKFTKFV-GRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPY 474 (563) Q Consensus 396 RDElKF~KfI-~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~ 474 (563) -.-+.|..++ .-+..+ +.+.|-. .++++.++.. ..+.|..+. +... -+..|++.++.+-.- T Consensus 300 ~~~~~f~~~~l~P~~~~----ie~~l~~-----kll~~~~~~~----~~i~fd~~~----l~~~-d~~~~~~~~~~~~~~ 361 (419) T protein:vir:80 300 HQSLQFVIYTLLPWVKR----HEQAKTR-----DLLLPSERKQ----YFIEYNLAG----LLRG-DQSSRYAAYAVGRQW 361 (419) T ss_pred HHHHHHHHHHHHHHHHH----HHHHHhh-----hccCccccCC----eEEEEechh----hhcc-CHHHHHHHHHHHHhC Confidence 2223455552 222222 2233333 3345555432 334444322 2111 123555555443211 Q ss_pred hchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccC Q lcl|NC_015287. 475 VGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQA 554 (563) Q Consensus 475 vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~ 554 (563) -+ +|..|+-+ . ...++++..+.--.+ ..-.++..| .+.+.++.. T Consensus 362 --G~------------~T~NE~R~------~-~g~~p~~gGD~~~~~----------~n~~~~~~~-----~~~~~~~~~ 405 (419) T protein:vir:80 362 --GW------------LSINDIRR------L-ENMPPVKGGDIYLSP----------MNMVDASKP-----QPIPMGKTE 405 (419) T ss_pred --CC------------cCHHHHHH------H-hCCCCCCCcceeeec----------ccccccccc-----ccccCCCCC Confidence 12 34444331 1 122222211000000 000000000 000011111 Q ss_pred CCCCCCCCC Q lcl|NC_015287. 555 PQMPEGGEI 563 (563) Q Consensus 555 ~~~~~~~~~ 563 (563) |....=.|+ T Consensus 406 ~~~~~~~~~ 414 (419) T protein:vir:80 406 PTKAALDEI 414 (419) T ss_pred chhhhHHHH Confidence 111111222 No 153 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=96.32 E-value=0.00075 Score=37.72 Aligned_cols=394 Identities=13% Similarity=0.115 Sum_probs=166.3 Q ss_pred ccchhhHhh-hcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEe Q lcl|NC_015287. 5 FGFKIQKDD-EQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVS 83 (563) Q Consensus 5 ~~f~~~~~~-~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~ 83 (563) ||| +.+-. ...++..++ ...... ...++.+.....+ .|....++|.|.+||+-|.+.+.-+ T Consensus 1 Mg~-f~~~~~~~~~~~~~~----~~~~~~-~~~~~~~~~~~~~------------~~~~~~~~~~v~~~i~~ia~~ia~~ 62 (406) T protein:vir:95 1 MGL-FDRWRRTKRKSKIRA----DTGYVG-LFMSGEDVSFLVP------------GYVRLSDNPEVRMAVHKIADLISSM 62 (406) T ss_pred Ccc-hhhhccccccccccc----cchhhh-hhccCcccCcccc------------CHHHHhhcHHHHHHHHHHHHhhccC Confidence 333 11110 001111111 111111 1111111111111 1334568999999999999976432 Q ss_pred cCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhc----chhhhHH----HHHHHhhhhh--hheeeeecCCCCCccee Q lcl|NC_015287. 84 DQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILN----FDDKCHE----ILRNWYVDGR--IYYHKVIDIKAPEEGIK 153 (563) Q Consensus 84 d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~----F~~~~~~----~~R~WYvDGr--i~fhkviD~~~~k~GI~ 153 (563) |+.+--.+ + .-++.+. ..+..+|. =...+++ +++.++.+|. .|.+++.|. +.-++ T Consensus 63 -----~~~~~~~~-~---~~~~~~~---~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~---~g~~~ 127 (406) T protein:vir:95 63 -----TIYLMQNT-E---DGDIRIR---NELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTA---DGLID 127 (406) T ss_pred -----ceEEEEec-C---Ccceeec---chHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECC---CCcEE Confidence 33321111 0 0011111 12222221 2233444 4445566665 444554443 33489 Q ss_pred EEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccc Q lcl|NC_015287. 154 EVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDA 233 (563) Q Consensus 154 eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~da 233 (563) +|++|+|..|+.++.- ++ ..| .|+. ..++.+. T Consensus 128 ~l~~i~~~~v~~~~~~-------~~-----------------~~~-~~~~-----------------------~~~~~~e 159 (406) T protein:vir:95 128 ELVPLTPSKVNFLDTP-------DG-----------------YQV-LYGG-----------------------QTFNYDE 159 (406) T ss_pred EEEEEcCceeEEEEcC-------Ce-----------------EEE-Eecc-----------------------EEEchhH Confidence 9999999887654221 00 011 1110 0133333 Q ss_pred e-eeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeE Q lcl|NC_015287. 234 I-SFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLV 312 (563) Q Consensus 234 I-~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklv 312 (563) | ++.+.+ .+.++-.+.|.|..|..++.....+++...-+.---+.-+-+..++. .+.+..+++..+.+..+|+.-. T Consensus 160 vih~~~~~-~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~-~l~~e~~~~~~~~~~~~~~g~~- 236 (406) T protein:vir:95 160 VLHFIYNP-DPERPYIGRGYRVVLKDIADNLKQATATKKSFMSGKYMPSLIVKVDA-ATAELSSEEGRNAVFKKYLQAT- 236 (406) T ss_pred EEEeeccC-CCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCCHHHHHHHHHHHHHHhcccc- Confidence 4 333222 23344456899999999999998888888777666677777777774 5777777777777666664211 Q ss_pred eeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccc Q lcl|NC_015287. 313 YDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSS 392 (563) Q Consensus 313 Yd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~ 392 (563) +.| +.+=+..+ |..-++++.+.. ..+.=++-.++..+.++++++||...|... ..+ T Consensus 237 ---n~~------~~~v~~~~-------~~~~~~~~~~~~-~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~-------~~~ 292 (406) T protein:vir:95 237 ---EAG------QPWIIPAE-------LLEVEQVKPLSL-KDIAINEAVELDKRTVAGMFGVPAFLLGIG-------EFN 292 (406) T ss_pred ---ccC------CceeecCC-------CccccccccCCh-hHHHHHHHHHHHHHHHHHHhCCCHHHcCCC-------Cch Confidence 001 11100011 111233333321 223334456788899999999999888311 111 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_015287. 393 EILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVD 472 (563) Q Consensus 393 eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~ 472 (563) |-.+..|+..-=.-+...+.+.|-..|+ ++.+ +.+.|..+. +... -...|++.+..+- T Consensus 293 -----~~~~~~~~~~~l~P~~~~ie~~l~~~l~-----~~~~-------~~~~fd~~~----l~~~-d~~~~~~~~~~l~ 350 (406) T protein:vir:95 293 -----RDEYNNFINSTILPIAKGIEQELTRKLL-----ISPD-------LYFKFNPRS----LYAY-DLKELAEVGSNMY 350 (406) T ss_pred -----HHHHHHHHHHHHHHHHHHHHHHHHHhcC-----CCCC-------cEEEeechh----hhcC-CHHHHHHHHHHHH Confidence 1222333322111222334444444443 3333 234454332 2111 1224555554443 Q ss_pred hhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccc Q lcl|NC_015287. 473 PYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMG 543 (563) Q Consensus 473 p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 543 (563) .- -+++.+-++. ++.+.+.+ ..+-.+. |..-...+.........+|+.+.++.+.+ T Consensus 351 ~~--G~~t~NE~R~-~~gl~p~~-----------~gd~~~~-~~n~~~~~~~~~~~~~k~g~~~~~~~~~~ 406 (406) T protein:vir:95 351 VR--GIMEGNEVRD-WLGLSPKE-----------GLSELVI-LENYIPLDKIGDQSKLKGGDNSGADGQTD 406 (406) T ss_pred hC--CCcCHHHHHH-HhCCCCCC-----------Ccceeee-ccCccchhhcccccccCCCCCCCCCCCCC Confidence 22 2444444443 22222210 0000110 00000000000001111222222111111 No 154 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=96.31 E-value=0.00076 Score=37.69 Aligned_cols=413 Identities=12% Similarity=0.104 Sum_probs=171.4 Q ss_pred ccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEecCCCcEEEEEeecccc Q lcl|NC_015287. 20 VSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSDQNDSPVEIDLENLPA 99 (563) Q Consensus 20 ~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d~~~~pV~l~l~~~~~ 99 (563) +|++|....+- .. .++. .+.++. . ...+++|-|..+|.-|.+.+.- -|+.+-=.+.+. T Consensus 1 ~~~~~~~~g~~-~~-~~~~---~~~~~~----~--------~~~~~~~~V~acV~~Ia~~iA~-----lpl~l~~~~~~~ 58 (723) T protein:vir:94 1 MTTFPSGAGGW-NA-WSAD---SVFGNG----A--------KGWSNSAVAYRCISMLANNAAS-----VDLVVRGPDGEL 58 (723) T ss_pred CcccccCCCcc-cc-cccc---cccccc----H--------HHHhhhHHHHHHHHHHHHhhcc-----ceeEEEcCCCcc Confidence 56655532221 11 0000 111111 1 1136789999999998885542 244332111111 Q ss_pred chHHHHHHHHHHHHHHHHhc----chhhhHHHH----HHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeec Q lcl|NC_015287. 100 SKQLKDLIREEFKTVKEILN----FDDKCHEIL----RNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKT 171 (563) Q Consensus 100 s~sik~~I~eeF~~i~~ll~----F~~~~~~~~----R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~ 171 (563) . +...++.+|+ -..++.++. ..+..+|.-|..++.+-.+...-+.+|.+|+|+.+..+..... T Consensus 59 ~---------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~ 129 (723) T protein:vir:94 59 D---------ELHPLSQLWNVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAA 129 (723) T ss_pred c---------hhhHHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCC Confidence 1 1123444443 233444444 4455789999988876544545578999999876544421111 Q ss_pred cCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeecceec-CCCCccc Q lcl|NC_015287. 172 DNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVD-RNKQTIL 250 (563) Q Consensus 172 ~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d-~~~~~~~ 250 (563) + .. +....-+|.|... . ...+.++.+.|.|.. .++ .++-.++ T Consensus 130 ~-----~~------------~~~~~~~y~~~~~----------~--------G~~~~~~~~dIiHir--~~~~~dg~~G~ 172 (723) T protein:vir:94 130 D-----AV------------PQAQIIGYVIERT----------D--------GVRVPVLADEMLWLR--FSDPYDPLAVM 172 (723) T ss_pred c-----cc------------eeeeeeEEEEEec----------C--------ceeEEecccceEEec--CCCCCCCcccc Confidence 1 00 0111112333210 1 112445566664432 122 3444668 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchh Q lcl|NC_015287. 251 SYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSML 330 (563) Q Consensus 251 syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMl 330 (563) |-|..|.+++.....+++.. .++++-=-+--..|-+++|.+..+++-.+.+...|.-- .+.| +.+ +| T Consensus 173 Spi~~a~~~i~~~~aa~~~~--~~~f~NG~~p~giL~~~~l~~e~~~~~~~~~~~~~~G~----~Nag------k~~-vL 239 (723) T protein:vir:94 173 APWKAARAAVDADFYAATWQ--RQSFKNGARPGGVVNLGDMDEQTFTKTVAAFRSQVEGV----QNAG------RHL-LI 239 (723) T ss_pred cHHHHHHHHHHHHHHHHHHH--HHHHhcCCCcceEEEcCCCCHHHHHHHHHHHHHHhhch----hhcC------cce-ee Confidence 99999999999888887754 33443322222233345677666655554444333210 1112 111 11 Q ss_pred h---hhcccccCCCCccceeecCCCCCcchHHHH---HHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHHHH Q lcl|NC_015287. 331 E---DFWLPRREGGRGTEITTLPGGQNLGELSDI---EYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTKF 404 (563) Q Consensus 331 E---DywLpRReGgrgTEIsTLpGg~nLgei~DV---~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~Kf 404 (563) + .-+... +.|++++.|- .+.-++.-+ +|-.+..-++.+||...|..+.+ + +.+.-..+.|..+ T Consensus 240 ~g~~~~~~vl---~~G~~~~~l~--~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~~st-~-----sN~e~~~~~f~~~ 308 (723) T protein:vir:94 240 AGQGSDGGAA---GKGATFTSLS--MSPAEMDYINSRMHSAEEVMLAFGIRKDALLGGST-Y-----ENQAEAKAAVWTE 308 (723) T ss_pred cccccccccc---cCCceEEEcc--CCHHHHHHHHHHHHhHHHHHHHhCCChhHcCCCCC-c-----ccHHHHHHHHHHH Confidence 1 111111 2345555553 333333333 34456689999999988853321 1 1122223345433 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHH Q lcl|NC_015287. 405 VGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYV 484 (563) Q Consensus 405 I~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i 484 (563) + |+-. ...+.+.|-..|+- ... ..+.++|..+-.... =+..|.+.+..+-.- -++|.+-+ T Consensus 309 t--L~P~-~~~ie~~ln~~Ll~-----~~g-----~~~~~~f~~~~lLr~-----D~~~r~~~~~~~v~~--G~~T~NE~ 368 (723) T protein:vir:94 309 T--LIPQ-MEVMASITDLQLLP-----DIG-----WTVEWDFNSVPALQE-----DLEAQAGRNQGYLVN--DVLMVDEV 368 (723) T ss_pred H--HHHH-HHHHHHHHhHhhcc-----ccc-----CceEEeecchhhhhc-----CHHHHHHHHHHHHhC--CCcCHHHH Confidence 2 3322 23344445445432 222 246677764322211 112444444332221 24444444 Q ss_pred HHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccC---------ccccccccccCCCCcC-----CCC---Cccc-cccc Q lcl|NC_015287. 485 RRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQ---------QLQLGVHPEQMPGGAM-----NPQ---DPNM-GGAP 546 (563) Q Consensus 485 ~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~---------~~~~~~~~~~~~~~~~-----~p~---~~~~-~~~~ 546 (563) +.. +.+. +++..+... .+.+.+.|...++..- .+. .|.. .++. T Consensus 369 R~~-lglp------------------Pi~gGd~~~~~~p~~~~~a~~~~~~p~~~e~~~~~~~~~~~~~~~~p~~~~~~~ 429 (723) T protein:vir:94 369 RAT-IGLD------------------PLPGGIGQMTLTPYRAQFAPAPAPAPAVEEGAARMLALLERVAADRPLPELPVR 429 (723) T ss_pred HHH-hCCC------------------CCCCCcccceeccccccccCCCCCCccchhhhHhhhhhccccccccCcCCCCCC Confidence 332 2232 222111110 0001101111111000 000 0000 0111 Q ss_pred CCCCcccCCCCCCCCCC Q lcl|NC_015287. 547 QEGGAEQAPQMPEGGEI 563 (563) Q Consensus 547 ~~~~~~~~~~~~~~~~~ 563 (563) ..++.+.+|.-+.+.+. T Consensus 430 ~~~~~~~~~~~~~~~~~ 446 (723) T protein:vir:94 430 ATTVLHHDPGPDPQQTL 446 (723) T ss_pred CCCCCCCCcccCCchhH Confidence 11111222222222222 No 155 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=96.26 E-value=0.00081 Score=37.52 Aligned_cols=413 Identities=16% Similarity=0.191 Sum_probs=168.9 Q ss_pred Cc--ccccchhhHhhhcccccccccCcC--CCCcccce--ecccccccc--ccccchhhhHHHHHHHHHHHhhccchhHH Q lcl|NC_015287. 1 MA--KLFGFKIQKDDEQNKNVVSPVPQS--NEDSSDYY--VSSGFYGQY--VDIDGVFKSEFELIKRYREMALHPEVDSA 72 (563) Q Consensus 1 m~--~~~~f~~~~~~~~~~~~~s~~p~~--~~d~~~~~--~~~~~~~~~--~~~~~~~~~~~~LI~~YR~ma~~PEvd~A 72 (563) |. +-+|| +.+. +..+.|+. +..+..+. +.+.....+ ..-.+.. + .-...+++|-|..| T Consensus 1 ~~~~~~~~~-~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~------v-~~~~al~~~~V~~~ 66 (432) T protein:vir:10 1 MPDEKKLGL-LGQL------KAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAA------V-NADAIMRLDAVAAC 66 (432) T ss_pred CCCCcccch-hhhh------HhhcCCccccccccccccccCcchhhhhcccccccCcc------c-chhhhhcchHHHHH Confidence 22 11122 0010 11111111 11111100 000000000 0000000 1 11334678999999 Q ss_pred HHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcchhhhHH----HHHHHhhhhhhheeeeecCCC Q lcl|NC_015287. 73 IEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFDDKCHE----ILRNWYVDGRIYYHKVIDIKA 147 (563) Q Consensus 73 i~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~~~~~~----~~R~WYvDGri~fhkviD~~~ 147 (563) |+-|.+.+.-+ |+.|--++ ++..++.+..-...+|+. =+=...+++ ++..++++|--|..++.+ T Consensus 67 i~~Ia~~ia~l-----p~~~y~~~---~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~--- 135 (432) T protein:vir:10 67 VKLVSQAIAAM-----PLTMYMRT---PDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT--- 135 (432) T ss_pred HHHHHHhhhhC-----ceeEEEec---CCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec--- Confidence 99999865432 44442221 111122222222222221 112233333 455667778887776553 Q ss_pred CCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCccee Q lcl|NC_015287. 148 PEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPV 227 (563) Q Consensus 148 ~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~i 227 (563) ...+++|.+|+|..|..++. . . +. .+|.|... +...+ T Consensus 136 -~g~~~~L~~l~~~~v~v~~~---~-~--g~------------------~~y~~~~~------------------~g~~~ 172 (432) T protein:vir:10 136 -DGRIESLQYLANDRLTITTD---T-K--GN------------------TAYRYRRT------------------DGQMI 172 (432) T ss_pred -CCcEEEEEEEcCCceEEEEc---C-C--Cc------------------EEEEEEec------------------CceEE Confidence 24589999999988776532 1 1 11 12222110 01235 Q ss_pred eecccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhh Q lcl|NC_015287. 228 KIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRY 307 (563) Q Consensus 228 ki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~ 307 (563) .++.+.|.+.. .++.++-.++|.|..|++++......++...=+=---+.-.-|..+| +.|-+...++..+ +| T Consensus 173 ~~~~~~iih~~--~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~l~~e~~~~~~~----~~ 245 (432) T protein:vir:10 173 DIPKQQIWKIM--GYSLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQID-RFLTDDQYDSFAK----KV 245 (432) T ss_pred EEcCccEEEec--CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecC-CCCCHHHHHHHHH----HH Confidence 67777775443 34556667789999999999888777775443222223445566665 4565554444322 22 Q ss_pred cceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCChhhccCCC- Q lcl|NC_015287. 308 RNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE---LSDIEYFQKKLYRSLGVPESRIAGSG- 383 (563) Q Consensus 308 knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~~~- 383 (563) . | ..+..+.+ +|+ .|++++.|. .+..+ ++-.+|....+.++++||...|...+ T Consensus 246 ~---------~-~~nag~~~-vl~----------~g~~~~~l~--~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~ 302 (432) T protein:vir:10 246 S---------G-SVEAGRAP-LLE----------GGMDVKSLG--LNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSA 302 (432) T ss_pred h---------h-hhhCCCce-ecC----------CCceEEEcc--CChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccC Confidence 1 1 11111111 222 244555553 23333 33446888899999999999995332 Q ss_pred cceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHH Q lcl|NC_015287. 384 DGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNE 463 (563) Q Consensus 384 ~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~ 463 (563) +.+..|.. +...-+-|..+ .|+--+. .+.+.|-+.| +++.++. ...++|..+. +... -+.. T Consensus 303 ~t~~~~sn--~e~~~~~f~~~--tl~P~~~-~ie~~ln~kL-----~~~~~~~----~~~~~fd~~~----ll~~-d~~~ 363 (432) T protein:vir:10 303 GTTSWGSG--IESQQLGFLSM--TLSPWLR-RIEQSIALNL-----LSPAERR----RYFADFDTSA----LLRA-DSAA 363 (432) T ss_pred Ccccccch--HHHHHHHHHHH--HHHHHHH-HHHHHHHhhh-----cCccccC----ceEEEeechh----hhcc-CHHH Confidence 12322222 22222335443 3333222 2333344333 3444432 3456665433 2111 1345 Q ss_pred HHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccc Q lcl|NC_015287. 464 RLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMG 543 (563) Q Consensus 464 R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 543 (563) |.+.++.+-.- -++|.+-++.. ..-++++ +...-.. ..+ ...|.+.... T Consensus 364 r~~~~~~~~~~--G~~T~NE~R~~-------------------~glppi~--g~~~~~~-------~~~-~~~pl~~~~~ 412 (432) T protein:vir:10 364 RSSYYSQLVNN--GLMTRDEAREI-------------------EGLPKLG--GNAAVLT-------VQS-AMVPLDSIGL 412 (432) T ss_pred HHHHHHHHHhC--CCCCHHHHHHH-------------------hCCCCCC--CCcceEe-------ecC-cccchhhhcc Confidence 66666544211 23333333321 1222222 1000000 000 0001111000 Q ss_pred cccCCCCcccCCCCCCCCCC Q lcl|NC_015287. 544 GAPQEGGAEQAPQMPEGGEI 563 (563) Q Consensus 544 ~~~~~~~~~~~~~~~~~~~~ 563 (563) ++. |....+.+-....++ T Consensus 413 ~~~--~~~~~~~~~~~~~~~ 430 (432) T protein:vir:10 413 QAS--PEPASGLGNQQQDKV 430 (432) T ss_pred cCC--CCCCCCCCCcccccc Confidence 000 000000000001111 No 156 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=96.16 E-value=0.00093 Score=37.20 Aligned_cols=403 Identities=12% Similarity=0.113 Sum_probs=170.8 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcc-c-ceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhc Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSS-D-YYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIIN 78 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~-~-~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvn 78 (563) |+-=-|+ |++-....+.. +...+ .++. + ....+++.+ +.. + |. +..+++|-|..||+-|.+ T Consensus 10 ~~~~~g~-~~~~~~~f~~~-~~~~~--~~~~~~~~~~~~~~~~---~~~--v-~~-------~~al~~~~v~~cv~~Ia~ 72 (424) T protein:vir:18 10 LRTNNGW-WARLKSWFVGG-RLVTP--NQGSQTGPVSAHGYLG---DSS--I-ND-------ERILQISTVWRCVSLIST 72 (424) T ss_pred cCCCCch-HHHHHhhcccc-ccccc--cchhhccccccccccc---ccc--c-cH-------HHhhccHHHHHHHHHHHH Confidence 3333344 33322221111 11111 1111 1 011111111 110 0 11 345778889999999999 Q ss_pred ceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhc----chhhhHHH----HHHHhhhhhhheeeeecCCCCCc Q lcl|NC_015287. 79 EAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILN----FDDKCHEI----LRNWYVDGRIYYHKVIDIKAPEE 150 (563) Q Consensus 79 eaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~----F~~~~~~~----~R~WYvDGri~fhkviD~~~~k~ 150 (563) .+... |+.|- +...-+ .++++.. -..++.+|+ -...++++ +..++..|.-|.-++.|.. . T Consensus 73 ~iA~l-----p~~vy-~~~~~~--~~~~~~~-~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~---G 140 (424) T protein:vir:18 73 LTACL-----PLDVF-ETDQND--NRKKVDL-SNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA---G 140 (424) T ss_pred hhccC-----ceEEE-EeccCC--ceeeecc-ccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC---C Confidence 76543 33321 100000 0001000 011223332 22344443 4455667888876655432 2 Q ss_pred ceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeec Q lcl|NC_015287. 151 GIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIA 230 (563) Q Consensus 151 GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~ 230 (563) -+++|.+|+|..|...+ .. + .-+|.|.. + ...+.++ T Consensus 141 ~~~~L~~l~~~~v~v~~---~~----~------------------~~~y~~~~-----------~--------g~~~~~~ 176 (424) T protein:vir:18 141 DVISLLPLQSANMDVKL---VG----K------------------KVVYRYQR-----------D--------SEYADFS 176 (424) T ss_pred cEEEEEEecCcceEEEE---cC----C------------------eEEEEEEe-----------C--------CeEEEec Confidence 28899999998876421 11 0 11233321 0 1234567 Q ss_pred ccceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHH-HHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcc Q lcl|NC_015287. 231 PDAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVI-YRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRN 309 (563) Q Consensus 231 ~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVI-yRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~kn 309 (563) .+.|.|... ++.++-.++|-|..|+.++.....+++...= |.-...| +-++.++-+.+.+..+++ +++.+.++.. T Consensus 177 ~~eVihir~--~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~-~gil~~~~~~l~~e~~~~-~~~~~~~~~~ 252 (424) T protein:vir:18 177 QKEIFHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKS-PQILSTGEKVLTEQQRSQ-VEENFKEIAG 252 (424) T ss_pred cccEEEecC--cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCc-ceEEEeCCcCCCHHHHHH-HHHHHHHHhC Confidence 777755432 3445666789999999999887777775543 3322233 456666666566655443 4444444321 Q ss_pred eeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCChhhccCCCcceec Q lcl|NC_015287. 310 KLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG-QNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNL 388 (563) Q Consensus 310 klvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~ 388 (563) - .+.| +.+ .|+ .|.+++.|.=. +.+.=++--+|....+.++++||.+.|...+.+ +. T Consensus 253 ~----~nag------~~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~-t~ 310 (424) T protein:vir:18 253 G----PVKK------RLW-ILE----------AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKS-TS 310 (424) T ss_pred C----cccC------Cce-ecc----------CCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCc-cc Confidence 0 1112 111 221 14566655321 222223344577788999999999999432111 11 Q ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 389 GRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVV 468 (563) Q Consensus 389 g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~ 468 (563) .. +.+...-+.|.+++ |+- +...+.+.|-.. +++++++. ...+.|.-+. +... -...|.+.+ T Consensus 311 ~~-sn~eq~~~~f~~~t--l~P-~~~~ie~~ln~~-----L~~~~~~~----~~~~~fd~~~----llr~-d~~~r~~~~ 372 (424) T protein:vir:18 311 WG-SGIEQQNLGFLQYT--LQP-YISRWENSIQRW-----LIPSKDVG----RLHAEHNLDG----LLRG-DSASRAAFM 372 (424) T ss_pred cc-ccHHHHHHHHHHHH--HHH-HHHHHHHHHHhh-----cCCccccC----CeEEEEechh----hhcc-CHHHHHHHH Confidence 11 22333334455443 322 122234444333 45555553 2444554332 2111 123566655 Q ss_pred HHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCC Q lcl|NC_015287. 469 AAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQE 548 (563) Q Consensus 469 ~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~ 548 (563) ..+-.- -+ ||..|+-+ . ..-+++ |+-++-.-++ ...|.+ ......+ T Consensus 373 ~~~~~~--G~------------~T~NE~R~------~-~gl~pi--~ggD~~~~~~---------n~~~l~--~~~~~~~ 418 (424) T protein:vir:18 373 KAMGES--GL------------RTINEMRR------T-DNMPPL--PGGDVAMRQA---------QYVPIT--DLGTNKE 418 (424) T ss_pred HHHHhC--CC------------cCHHHHHH------H-hCCCCC--CCcCeeeecc---------Cccchh--hhhccCC Confidence 544211 12 44433331 1 122222 2110000000 001111 0111111 Q ss_pred CCcccC Q lcl|NC_015287. 549 GGAEQA 554 (563) Q Consensus 549 ~~~~~~ 554 (563) |....| T Consensus 419 ~~~n~a 424 (424) T protein:vir:18 419 PRNNGA 424 (424) T ss_pred ccccCC Confidence 111112 No 157 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=95.98 E-value=0.0012 Score=36.65 Aligned_cols=390 Identities=14% Similarity=0.150 Sum_probs=169.2 Q ss_pred ccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEec Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSD 84 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d 84 (563) |+| |+.. .+.....++.-.....|+....++.. .-+++|.|..||+-|.+.+..+ T Consensus 1 m~~-f~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~Al~~~~V~~~i~~Ia~~iA~l- 55 (406) T protein:vir:97 1 MSF-FQPL--------GTSKVSYDDYISSVLAGDVSQKYLGV---------------SALKNSDILTATSIIAGDIARF- 55 (406) T ss_pred Ccc-cccc--------CCCCCCcchHHHHHhcCCCCcccccc---------------hhhccHHHHHHHHHHHHhhhhC- Confidence 444 2210 11111122222222233322222221 0246789999999999976644 Q ss_pred CCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcch----hhh----HHHHHHHhhhhhhheeeeecCCCCCcceeEEE Q lcl|NC_015287. 85 QNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFD----DKC----HEILRNWYVDGRIYYHKVIDIKAPEEGIKEVR 156 (563) Q Consensus 85 ~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~----~~~----~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr 156 (563) |+.+.=++-+ .+.+ ..+..+|+.. ..+ +.++...+++|--|.-++.|.+ ...+.+|. T Consensus 56 ----p~~~~~~~g~-------~~~~--~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~--~g~~~~L~ 120 (406) T protein:vir:97 56 ----PLVKKDVNGD-------IIHD--EDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPK--TNQALQFQ 120 (406) T ss_pred ----eeEEEecCcc-------cccc--chHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCC--CCeEEEEE Confidence 3322211111 1111 1233444321 233 3344566777887876665532 22378999 Q ss_pred EeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceee Q lcl|NC_015287. 157 YIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISF 236 (563) Q Consensus 157 ~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y 236 (563) +|+|..|...+. ++ + .-+|.|.... +...+.++.+.|. T Consensus 121 ~i~p~~v~v~~~---~~---~------------------~~~y~~~~~~-----------------~~~~~~~~~~evi- 158 (406) T protein:vir:97 121 FYRPSETTVEET---DN---H------------------EIVYTFTDML-----------------TAKQVKCFAHDVI- 158 (406) T ss_pred EECCCeeEEEEc---CC---c------------------eEEEEEEecC-----------------CceEEEEccccEE- Confidence 999988764421 10 1 1122232100 0123456666663 Q ss_pred eecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCC Q lcl|NC_015287. 237 CHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDAS 316 (563) Q Consensus 237 ~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~ 316 (563) |--+++.++-.++|-|..|.+++.....+++...=+----++- .++...-+.|-+..+++. ++-+.++.. ..+ T Consensus 159 -h~r~~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~-~~i~~~~~~l~~e~~~~~-~~~~~~~~~----g~n 231 (406) T protein:vir:97 159 -HWKFFSHDTILGRSPLLSLGDEIDLQTGGINTLIKFFKDGFSS-GILTMKGAQLSGDARQRA-RQEFEKMRE----GSV 231 (406) T ss_pred -EecCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCC-ceEEecCCCCCHHHHHHH-HHHHHHHhc----ccc Confidence 3334556665678999999999988777887655443344553 455555566665554443 332333211 122 Q ss_pred CCcccccccccchhhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhh Q lcl|NC_015287. 317 TGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEIL 395 (563) Q Consensus 317 TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eIt 395 (563) .|.+ + .|+ .|.+++.|.-..+-.| ++--+|-.+.+-++++||...|...++ +. .+. T Consensus 232 ~g~~------~-vl~----------~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~----~~--~~e 288 (406) T protein:vir:97 232 GGSP------L-VFD----------STMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVNSP----NQ--SVA 288 (406) T ss_pred cCce------e-ecC----------CCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCCC----cc--hHH Confidence 2322 1 121 2556666642222112 222233467788899999999953221 11 112 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_015287. 396 RDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYV 475 (563) Q Consensus 396 RDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~v 475 (563) -....|.+++ |+-.+. .+.+.|-.. +.++.+|.. -.|++++.. +++.|++.+..+ T Consensus 289 ~~~~~f~~~~--l~P~~~-~ie~~l~~k-----ll~~~~~~~--~~i~fd~~~-----------~~~~~~~~~~~~---- 343 (406) T protein:vir:97 289 QLMEDYVTND--LPFYFD-AITSELGLK-----TLNDKDRRL--YHIEFDTRS-----------VTGRNVDEIVKL---- 343 (406) T ss_pred HHHHHHHHHH--HHHHHH-HHHHHHhhh-----hcChhhccc--eeEEEecCc-----------cchhhHHHHHHH---- Confidence 2222355542 333222 223333333 345565542 234444322 234454444322 Q ss_pred chhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccCC Q lcl|NC_015287. 476 GKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAP 555 (563) Q Consensus 476 GKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~ 555 (563) .+ .--||..|+-+ +...++++.|+-+.---+. + - .|.+. ....+.. -. T Consensus 344 ---------~~-~g~~T~NE~R~-------~~g~~p~~~~~gD~~~~~~----n----~-~~~~~--~~~~~~~----~~ 391 (406) T protein:vir:97 344 ---------VN-NQILTPNQGLV-------ELGKQKSTDPNMDRYQSSL----N----Y-VFLDK--KEEYQDK----VG 391 (406) T ss_pred ---------Hh-CCCcCHHHHHH-------HhCCCCCCCCCCCeEeecc----C----c-cchhc--ccccccc----cc Confidence 11 11255544431 1233444433211110000 0 0 01110 0000000 01 Q ss_pred CCCCCCCC Q lcl|NC_015287. 556 QMPEGGEI 563 (563) Q Consensus 556 ~~~~~~~~ 563 (563) ..++|||. T Consensus 392 ~~~~gg~~ 399 (406) T protein:vir:97 392 IKGKGGEV 399 (406) T ss_pred cccCCCCC Confidence 12345554 No 158 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=95.93 E-value=0.0012 Score=36.53 Aligned_cols=423 Identities=11% Similarity=0.072 Sum_probs=176.3 Q ss_pred ccccCcCCCCcccce---ecc------cccccccc---c-cchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEecCC Q lcl|NC_015287. 20 VSPVPQSNEDSSDYY---VSS------GFYGQYVD---I-DGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSDQN 86 (563) Q Consensus 20 ~s~~p~~~~d~~~~~---~~~------~~~~~~~~---~-~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d~~ 86 (563) .++..| ++-..-. ..+ .+..||-+ + .-......++-+..+ ...+.=+.-+|+..+.= -. T Consensus 1 ~~~~t~--~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~-k~~~n~~~~ivd~~~~~-----l~ 72 (456) T protein:vir:10 1 MTASTP--AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQR-EARTNWGLMVRDSVADR-----II 72 (456) T ss_pred CCCCCH--HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhh-hhhcchHHHHHHHHHhh-----hc Confidence 111111 0000000 000 00111111 0 000000111111111 12233334444444431 12 Q ss_pred CcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceee Q lcl|NC_015287. 87 DSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLV 166 (563) Q Consensus 87 ~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~v 166 (563) .+||.+...+ .. ...+++..|.+..+|+....++++.-++.||-|.+.-.| .+|-..++.+||+.+-.| T Consensus 73 ~~~~~~~~~~---d~----~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d----~~g~~~i~~~~p~~~~~i 141 (456) T protein:vir:10 73 PNGITVGGSA---DS----DLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR----DDGTATITADSPETMVVS 141 (456) T ss_pred cCCeecCCCC---Cc----chHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeC----CCCceEEEEEccceeEEE Confidence 3466554321 01 122445666777889999999999999999988764333 467778889999987666 Q ss_pred eeeeccCCcccccee-ee--ccccccc------ccccccce----eeecCccccccccccccCcCccccCcceeeecccc Q lcl|NC_015287. 167 RKLKTDNSLRGAINQ-VN--SKNPADI------ENPEIEEY----YQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDA 233 (563) Q Consensus 167 R~~~~~~~~~~~i~~-~~--~~~~~~~------~~~~~~ey----f~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~da 233 (563) .- ++.+..+.. ++ ....... .-..+..| +.|........... ++....-+..+-.+..=+ T Consensus 142 ~d----~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~p 214 (456) T protein:vir:10 142 VD----PLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRI---SDSWVPVGDAVVTGSPPP 214 (456) T ss_pred Ec----CCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeec---CCceeeccccCCCCCcee Confidence 32 111111111 11 0000000 00001111 11111100000000 000000000000111113 Q ss_pred eeeeecceecCCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeE Q lcl|NC_015287. 234 ISFCHSGLVDRNKQTILSYLHKSIKALNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLV 312 (563) Q Consensus 234 I~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m-~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklv 312 (563) |+|.+ +....|=++..+.....+.. +=|.++.-...--|.|-+.-.+.+. +.. T Consensus 215 vv~~~-------N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~-~~~------------------ 268 (456) T protein:vir:10 215 VVVYQ-------NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGL-PNV------------------ 268 (456) T ss_pred EEEec-------CCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccc-ccc------------------ Confidence 33332 22234555555444443332 2233444444444444443222211 110 Q ss_pred eeCCCCcccccccccchhhh-hcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCChhhccCCCcceeccc Q lcl|NC_015287. 313 YDASTGEIRDDRKHMSMLED-FWLPRREGGRGTEITTLPGGQNLGE-LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGR 390 (563) Q Consensus 313 Yd~~TGev~d~~~~msMlED-ywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~ 390 (563) | .+|..-+..+....-.+ .|. +..|..|..++.. ++.. ++-++-.-..+...-++|..-|.... -|. . T Consensus 269 -d-~~g~~~~~~~~~~~~~~~~~~----~~~~~~~~q~~~~-~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~--~N~-S 338 (456) T protein:vir:10 269 -D-ENGNAIDYASIFEAAPGALWE----LPPGVDIWESQAN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDS--ANQ-S 338 (456) T ss_pred -c-ccccccchhhhhhhhcccccc----CCCCcceEEeccc-ChhHHHHHHHHHHHHHHhccCCChHHhcccc--cCh-H Confidence 0 01110000000000000 131 1134556667653 3333 33466777777888899988884321 122 2 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 391 SSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAA 470 (563) Q Consensus 391 ~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~ 470 (563) +..|.--+..+-.-+.+.|+.|..-+.+.+|.-+.+.|.. ++ ..+++.|..-..=+. .+.++++.. T Consensus 339 g~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~---~~----~~~~v~w~~~~~~~~-------~~~ada~~k 404 (456) T protein:vir:10 339 AEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES---VE----DTVDVSFESPDRVTL-------GEKYSAASL 404 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC---cc----cceeEEecCCCCcCH-------HHHHHHHHH Confidence 3344555555777788899999999999999888888843 12 357888855332222 234555555 Q ss_pred hhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcC----CCCCCCcccCccccccc Q lcl|NC_015287. 471 VDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKD----GKLADPMEVQQLQLGVH 526 (563) Q Consensus 471 ~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~----~~~~~p~~~~~~~~~~~ 526 (563) +..- | ..|.. ....+|.++++++++++.+..+|... ++.+.|. ++.. - T Consensus 405 l~~~-g-i~~~~-~~~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~~~----~~~~-~ 456 (456) T protein:vir:10 405 AKAA-G-ESWAS-IRRNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQ----EDGS-R 456 (456) T ss_pred HHHc-C-CChHH-HHHhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhcCC----CCCC-C Confidence 4322 2 34444 55678999999887544333322221 1111110 0000 0 No 159 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=95.93 E-value=0.0012 Score=36.53 Aligned_cols=423 Identities=11% Similarity=0.072 Sum_probs=176.3 Q ss_pred ccccCcCCCCcccce---ecc------cccccccc---c-cchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEecCC Q lcl|NC_015287. 20 VSPVPQSNEDSSDYY---VSS------GFYGQYVD---I-DGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSDQN 86 (563) Q Consensus 20 ~s~~p~~~~d~~~~~---~~~------~~~~~~~~---~-~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d~~ 86 (563) .++..| ++-..-. ..+ .+..||-+ + .-......++-+..+ ...+.=+.-+|+..+.= -. T Consensus 1 ~~~~t~--~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~-k~~~n~~~~ivd~~~~~-----l~ 72 (456) T protein:vir:10 1 MTASTP--AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQR-EARTNWGLMVRDSVADR-----II 72 (456) T ss_pred CCCCCH--HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhh-hhhcchHHHHHHHHHhh-----hc Confidence 111111 0000000 000 00111111 0 000000111111111 12233334444444431 12 Q ss_pred CcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceee Q lcl|NC_015287. 87 DSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLV 166 (563) Q Consensus 87 ~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~v 166 (563) .+||.+...+ .. ...+++..|.+..+|+....++++.-++.||-|.+.-.| .+|-..++.+||+.+-.| T Consensus 73 ~~~~~~~~~~---d~----~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d----~~g~~~i~~~~p~~~~~i 141 (456) T protein:vir:10 73 PNGITVGGSA---DS----DLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR----DDGTATITADSPETMVVS 141 (456) T ss_pred cCCeecCCCC---Cc----chHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeC----CCCceEEEEEccceeEEE Confidence 3466554321 01 122445666777889999999999999999988764333 467778889999987666 Q ss_pred eeeeccCCcccccee-ee--ccccccc------ccccccce----eeecCccccccccccccCcCccccCcceeeecccc Q lcl|NC_015287. 167 RKLKTDNSLRGAINQ-VN--SKNPADI------ENPEIEEY----YQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDA 233 (563) Q Consensus 167 R~~~~~~~~~~~i~~-~~--~~~~~~~------~~~~~~ey----f~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~da 233 (563) .- ++.+..+.. ++ ....... .-..+..| +.|........... ++....-+..+-.+..=+ T Consensus 142 ~d----~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~p 214 (456) T protein:vir:10 142 VD----PLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRI---SDSWVPVGDAVVTGSPPP 214 (456) T ss_pred Ec----CCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeec---CCceeeccccCCCCCcee Confidence 32 111111111 11 0000000 00001111 11111100000000 000000000000111113 Q ss_pred eeeeecceecCCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeE Q lcl|NC_015287. 234 ISFCHSGLVDRNKQTILSYLHKSIKALNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLV 312 (563) Q Consensus 234 I~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m-~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklv 312 (563) |+|.+ +....|=++..+.....+.. +=|.++.-...--|.|-+.-.+.+. +.. T Consensus 215 vv~~~-------N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~-~~~------------------ 268 (456) T protein:vir:10 215 VVVYQ-------NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGL-PNV------------------ 268 (456) T ss_pred EEEec-------CCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccc-ccc------------------ Confidence 33332 22234555555444443332 2233444444444444443222211 110 Q ss_pred eeCCCCcccccccccchhhh-hcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCChhhccCCCcceeccc Q lcl|NC_015287. 313 YDASTGEIRDDRKHMSMLED-FWLPRREGGRGTEITTLPGGQNLGE-LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGR 390 (563) Q Consensus 313 Yd~~TGev~d~~~~msMlED-ywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~ 390 (563) | .+|..-+..+....-.+ .|. +..|..|..++.. ++.. ++-++-.-..+...-++|..-|.... -|. . T Consensus 269 -d-~~g~~~~~~~~~~~~~~~~~~----~~~~~~~~q~~~~-~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~--~N~-S 338 (456) T protein:vir:10 269 -D-ENGNAIDYASIFEAAPGALWE----LPPGVDIWESQAN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDS--ANQ-S 338 (456) T ss_pred -c-ccccccchhhhhhhhcccccc----CCCCcceEEeccc-ChhHHHHHHHHHHHHHHhccCCChHHhcccc--cCh-H Confidence 0 01110000000000000 131 1134556667653 3333 33466777777888899988884321 122 2 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 391 SSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAA 470 (563) Q Consensus 391 ~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~ 470 (563) +..|.--+..+-.-+.+.|+.|..-+.+.+|.-+.+.|.. ++ ..+++.|..-..=+. .+.++++.. T Consensus 339 g~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~---~~----~~~~v~w~~~~~~~~-------~~~ada~~k 404 (456) T protein:vir:10 339 AEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES---VE----DTVDVSFESPDRVTL-------GEKYSAASL 404 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC---cc----cceeEEecCCCCcCH-------HHHHHHHHH Confidence 3344555555777788899999999999999888888843 12 357888855332222 234555555 Q ss_pred hhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcC----CCCCCCcccCccccccc Q lcl|NC_015287. 471 VDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKD----GKLADPMEVQQLQLGVH 526 (563) Q Consensus 471 ~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~----~~~~~p~~~~~~~~~~~ 526 (563) +..- | ..|.. ....+|.++++++++++.+..+|... ++.+.|. ++.. - T Consensus 405 l~~~-g-i~~~~-~~~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~~~----~~~~-~ 456 (456) T protein:vir:10 405 AKAA-G-ESWAS-IRRNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQ----EDGS-R 456 (456) T ss_pred HHHc-C-CChHH-HHHhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhcCC----CCCC-C Confidence 4322 2 34444 55678999999887544333322221 1111110 0000 0 No 160 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=95.58 E-value=0.0018 Score=35.62 Aligned_cols=402 Identities=13% Similarity=0.095 Sum_probs=171.9 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceecccccccc-ccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcc Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQY-VDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINE 79 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~-~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvne 79 (563) ...||+- .. ... +..+....+ ...+++.... ..+. . ....++|-|..||+-|.+. T Consensus 3 f~~~f~r----~~---~~~-~~~~~~~~~----~~~~~~~~~~g~~v~-----~-------~~~l~~~~v~~~i~~Ia~~ 58 (413) T protein:vir:48 3 FSGLFQR----KS---DAP-VTTPAELAE----AIGLSYDTYTGKRIS-----S-------QRAMRLTAVYSCVRVLAES 58 (413) T ss_pred cchhhcc----Cc---cCC-ccchHHHHH----hhhcCcccccCceec-----h-------hhhhccHHHHHHHHHHHHh Confidence 2334332 11 101 111111111 1112221111 1111 0 2235689999999999996 Q ss_pred eEEecCCCcEEEEEeeccccchHHHHHH-HHHHHHHHHH-hcchhh----hHHHHHHHhhhhhhheeeeecCCCCCccee Q lcl|NC_015287. 80 AIVSDQNDSPVEIDLENLPASKQLKDLI-REEFKTVKEI-LNFDDK----CHEILRNWYVDGRIYYHKVIDIKAPEEGIK 153 (563) Q Consensus 80 aiv~d~~~~pV~l~l~~~~~s~sik~~I-~eeF~~i~~l-l~F~~~----~~~~~R~WYvDGri~fhkviD~~~~k~GI~ 153 (563) +.-+ |+.+-=.+ +..++.+ .....++|+. =+-... -+.++..++++|.-|..++-+ ...+. T Consensus 59 iA~~-----p~~~~~~~----~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~----~g~~~ 125 (413) T protein:vir:48 59 VGML-----PCSLYKIS----GTLKTRVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA----LGEVV 125 (413) T ss_pred hhhC-----ceEEEEec----CCcceeecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC----CCcEE Confidence 5422 23321111 1111111 1122233321 112223 345566667778887765443 24589 Q ss_pred EEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeee-cCccccccccccccCcCccccCcceeeeccc Q lcl|NC_015287. 154 EVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQY-DPSQTQGKNALGAIGQTPFSTKQRPVKIAPD 232 (563) Q Consensus 154 eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y-~~~~~~~~~~~g~~g~~~~~~~~~~iki~~d 232 (563) +|.+|+|..|..... .+ + ..+|.+ .+. .....++.+ T Consensus 126 ~L~~l~~~~v~~~~~---~~---~------------------~~~y~~~~~~-------------------g~~~~~~~~ 162 (413) T protein:vir:48 126 ELLPIDPGCVEPKLN---SQ---W------------------QPVYQVTFPD-------------------GSVDVLTQD 162 (413) T ss_pred EEEEEcCceEEEEEc---CC---c------------------eEEEEEEecC-------------------ceEEEEccc Confidence 999999987764311 10 0 011222 110 112345566 Q ss_pred ceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeE Q lcl|NC_015287. 233 AISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLV 312 (563) Q Consensus 233 aI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklv 312 (563) .|.|... +..++-.+.|-|+.|.+.+.....+++...-+---.+.-+-++.++ +.+.+..+++-.+.+...|+.- T Consensus 163 evih~~~--~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~~~~e~~~~~~~~~~~~~~g~-- 237 (413) T protein:vir:48 163 EIWHVRT--LTLDGLVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTE-QKLTPDAYERLKKDFEERHTGL-- 237 (413) T ss_pred cEEEecC--cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeC-CCCCHHHHHHHHHHHHHHhcCc-- Confidence 6655432 3445556789999999999988888877765555556667888887 5677766666666555555421 Q ss_pred eeCCCCcccccccccchhhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCChhhccCCCcceecccc Q lcl|NC_015287. 313 YDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG-QNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRS 391 (563) Q Consensus 313 Yd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~ 391 (563) .+.|. .| .+ ..|.++..|.-. +.+.-++-.++....+.++++||..-|...+++ +. T Consensus 238 --~n~g~------~~-vl----------~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~-t~--- 294 (413) T protein:vir:48 238 --GNAHR------PM-IL----------EMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRA-TF--- 294 (413) T ss_pred --cccCc------ce-ec----------CCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCC-Cc--- Confidence 11221 11 11 134566666321 222224455688899999999999999532211 11 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015287. 392 SEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAV 471 (563) Q Consensus 392 ~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~ 471 (563) +.+.-..+.|.++ .|+- +...+.+. +-++++++.++.. .+|+|++ + +|... -+..|.++++.+ T Consensus 295 ~n~e~~~~~f~~~--~i~P-~~~~ie~~-----l~~~L~~~~~~~~--~~~~fd~--~----~l~~~-d~~~~~~~~~~~ 357 (413) T protein:vir:48 295 NNIEELGLGFINY--SLVP-YLTRIEQR-----INTGLVRESKQGK--FYAKFNA--G----ALLRG-DMKSRFEAYATG 357 (413) T ss_pred ccHHHHHHHHHHH--HHHH-HHHHHHHH-----HHhhccCccccCC--eEEEEec--h----hhhcc-CHHHHHHHHHHH Confidence 1122222335543 2221 11112222 3334555555532 3344433 2 22211 123445444332 Q ss_pred hhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCc Q lcl|NC_015287. 472 DPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGA 551 (563) Q Consensus 472 ~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~ 551 (563) -.- -+++.+-++. ++ ..++++..+.--.+.+. ......++.....++ .+..++..+ T Consensus 358 ~~~--g~~T~NE~R~-~~------------------g~~p~~ggD~~~~~~n~--~~~~~~~~~~~~~~~-~~~~~~~~~ 413 (413) T protein:vir:48 358 INW--GIYSPNDCRD-LE------------------DMNPRPGGDVYLTPMNM--TTSPSAGDDNGKKKE-SGDADKTAS 413 (413) T ss_pred HhC--CCcCHHHHHH-Hh------------------CCCCCCCcceeeccccc--cccccccccCCCCCC-CCCccccCC Confidence 111 2333333332 11 22222110000000000 000000000000000 000000000 No 161 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=95.43 E-value=0.0021 Score=35.28 Aligned_cols=459 Identities=12% Similarity=0.084 Sum_probs=158.9 Q ss_pred ccccccccCcCCCCcccceeccccccccccccchhhhH--HHHHHHHHHHhhccchhHHH-------------------- Q lcl|NC_015287. 16 NKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSE--FELIKRYREMALHPEVDSAI-------------------- 73 (563) Q Consensus 16 ~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~--~~LI~~YR~ma~~PEvd~Ai-------------------- 73 (563) -.+.+.++++..- +.+-. ..+.+- ...+++|+.+..+++=...| T Consensus 1 ~~~~~~~~~~~~~--------~~~~~------~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~ 66 (537) T protein:vir:78 1 MTSPLLNKPIDQL--------GGLLN------TEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYA 66 (537) T ss_pred CCcccccccHHHH--------HHHHH------HHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccc Confidence 1111111111000 00000 000000 12244555554444443322 Q ss_pred ----------HHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeee Q lcl|NC_015287. 74 ----------EDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVI 143 (563) Q Consensus 74 ----------~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkvi 143 (563) ..||+-.+-+ --..||.+..++- --+.+.+.+..+++ -+|++.-+++.+.+.+.|+-|.|.-+ T Consensus 67 ~nnki~~nf~k~Ivd~~~~y-l~G~Pv~~~~~d~-----~~~e~~~~l~~~~~-~~~~~~~~el~~~~s~~G~ay~~~y~ 139 (537) T protein:vir:78 67 SNVKISHGFFTELVDQLAQY-LLSNGVEVKVKDE-----DNTQLDEILQEYFD-EDFQATIDTLVTNASKKGFEGIFART 139 (537) T ss_pred cccccccchHHHHHHHHhhh-hcccCceeecCcc-----hhHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeeEEEeee Confidence 1122211111 2355777766541 11234455555554 47889999999999999999998777 Q ss_pred cCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeee--cccc----------cccccccccceeeecCcccccccc Q lcl|NC_015287. 144 DIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVN--SKNP----------ADIENPEIEEYYQYDPSQTQGKNA 211 (563) Q Consensus 144 D~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~--~~~~----------~~~~~~~~~eyf~y~~~~~~~~~~ 211 (563) |. +|-.....+||+.+..|.. .......-+.+-. .... ..++.++--.||.+...+...... T Consensus 140 de----~~~~~~~~i~p~~~~pv~d--~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~ 213 (537) T protein:vir:78 140 TS----EGKLKFQTVDGLTLIPVFD--DYGVLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYK 213 (537) T ss_pred cC----CCceEEEEEccceeEEEEc--CCCCceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCccccccc Confidence 63 5778899999999877732 1111011011000 0000 000111111222222211111100 Q ss_pred cc-ccCcCccccCcceeeec----ccc------eeeee-cceecC----CCCccchhHHHHHHHHHHHHHHH-HHHHHHH Q lcl|NC_015287. 212 LG-AIGQTPFSTKQRPVKIA----PDA------ISFCH-SGLVDR----NKQTILSYLHKSIKALNQLRMIE-DSLVIYR 274 (563) Q Consensus 212 ~g-~~g~~~~~~~~~~iki~----~da------I~y~h-SGl~d~----~~~~~~syL~~Aik~~NqL~m~E-DalVIyR 274 (563) .. .....+. .....+. .+. ....| .|-+.. |.....|=|+..+.....+.++= +..-..+ T Consensus 214 ~~~~~~~~~i---~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~ 290 (537) T protein:vir:78 214 LDEAYNPNPA---PHVLAIEESTDADFEDTDGYQVLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQ 290 (537) T ss_pred cccccccccc---ceeeeccccccccccccccccccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHH Confidence 00 0000000 0000000 000 00001 011110 12223455665555544443211 1111112 Q ss_pred HhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCC Q lcl|NC_015287. 275 LSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQN 354 (563) Q Consensus 275 i~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~n 354 (563) -++-|---+.-. +|+ +...++.-+..+.+..-.| .|..+..|--..+ T Consensus 291 ~~~~~ilvi~g~------------------------------~~~--~~~~~~~~l~~~~~i~v~~-d~~~v~~l~~~~~ 337 (537) T protein:vir:78 291 DFSEAIYVVKGF------------------------------SGD--STDKLRQNIKAKKMIGVNG-DNAGMEIQTVSIP 337 (537) T ss_pred HhcCceeeeecC------------------------------CCc--cchhHHHHHhhcCceeecC-CCCceeEEEecCC Confidence 222222111111 111 1111111122222111111 1122333332222 Q ss_pred cch-HHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_015287. 355 LGE-LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFT---KFVGRMRKRFSHLFNDMLKTQLILKNIV 430 (563) Q Consensus 355 Lge-i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~---KfI~rLr~rFs~if~d~Lk~qLiLkgI~ 430 (563) ..- -.-+.-..+.+|+-..+|-. . + . .+|.+|.+... .+|. .-+.+.++.|...|...|+.=+-+-++. T Consensus 338 ~~~~e~~ld~L~~~I~~~s~~~~~--~-~--~-~~gn~SGvAlk-~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~ 410 (537) T protein:vir:78 338 YEARKAKMDIDVENIYRSGMGFNS--T-A--V-GDGNVTNVVIK-SRYTLLAMKARKMETSLRKVLRWCADMVVSDIALR 410 (537) T ss_pred HHHHHHHHHHHHHHHHHhcCCCCC--c-c--c-cccCCcHHHHH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 211 11244445556665555532 1 2 1 13444443322 2222 2244455555555554444322221222 Q ss_pred ChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhc-- Q lcl|NC_015287. 431 TPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIK-- 508 (563) Q Consensus 431 t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~-- 508 (563) ...+|+ ...|.+.|.+.---.+...++++. .+. .+-.+|.+++++..-.-+|.|.+ +++++|.. T Consensus 411 ~~~~~d--~~~i~i~f~~~~P~n~~e~a~~~~-------~l~--~~giiS~eT~l~~~p~vdd~e~e---k~~~ee~~~~ 476 (537) T protein:vir:78 411 GLGEYD--SNDICFEIEPHVLANELDIATTRK-------TEA--ETEALKIGNIMTVAPRIGDDETL---KLIAEELDLD 476 (537) T ss_pred CCcccc--cceeeEEeccCCCCCHHHHHHHHH-------HHH--hcCcchHHHHHHhCCCCCCHHHH---HHHHHHHHhh Confidence 222333 246778887755444433333322 111 12468899998875444443322 22222211 Q ss_pred CCCCCCCcccC---cccccccccc-----CCCCc---CCCCCcccccccCCCCcccCCCCC Q lcl|NC_015287. 509 DGKLADPMEVQ---QLQLGVHPEQ-----MPGGA---MNPQDPNMGGAPQEGGAEQAPQMP 558 (563) Q Consensus 509 ~~~~~~p~~~~---~~~~~~~~~~-----~~~~~---~~p~~~~~~~~~~~~~~~~~~~~~ 558 (563) ..-+++...++ ..+..+.... ...+. .+|.+|.-++.+.||.-..|-|.. T Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 537 (537) T protein:vir:78 477 YNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVDPNQPVADPNVVPPTDPNAVPQT 537 (537) T ss_pred hhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCCCCCccCCCCCCCCCCCCCCccCCCC Confidence 00111000000 0001111111 11111 223333333333333222222221 No 162 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=95.27 E-value=0.0024 Score=34.95 Aligned_cols=429 Identities=9% Similarity=-0.025 Sum_probs=174.1 Q ss_pred ccchhhHhhhccccc----cccc--CcCC-------------CCcccceeccccccccccccchhhhHHHHHHHHHHHhh Q lcl|NC_015287. 5 FGFKIQKDDEQNKNV----VSPV--PQSN-------------EDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMAL 65 (563) Q Consensus 5 ~~f~~~~~~~~~~~~----~s~~--p~~~-------------~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~ 65 (563) ||+ +.+--...+.. +... +.+. ++.......|+-...... .+..- .=+..++ T Consensus 1 M~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~-~g~~v-------~~~~a~~ 71 (466) T protein:vir:81 1 MRL-IDRLLSTRGAAPRMSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTELAPD-TFVGL-------ATQAYQA 71 (466) T ss_pred Cch-hHHHhhccCcccccchhhhhhhhhhhhccccccccccccHHHHHhhccccccccCc-ccccc-------chhhhhc Confidence 333 11111111111 0000 0000 000000011111110000 00000 1123466 Q ss_pred ccchhHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHH-HHHHHHHhcchhhhHHHH----HHHhhhhhhhee Q lcl|NC_015287. 66 HPEVDSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREE-FKTVKEILNFDDKCHEIL----RNWYVDGRIYYH 140 (563) Q Consensus 66 ~PEvd~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~ee-F~~i~~ll~F~~~~~~~~----R~WYvDGri~fh 140 (563) +|-|..||+-|.+.+.. -|+.|-=++ +.-++.+.+. ...+++-=+-...+++++ ..+...|-=|.. T Consensus 72 ~~~v~~~i~~Ia~~ia~-----lp~~~~~~~----~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~ 142 (466) T protein:vir:81 72 NGPVFACMLVRQLVFSS-----VRFRWQRLR----DGKPSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWT 142 (466) T ss_pred cHHHHHHHHHHHHhhcc-----CceEEEEec----CCceeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEE Confidence 89999999999986543 233332111 1111111111 111222222334555554 444566887888 Q ss_pred eeecCC-----CCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCcccccccccccc Q lcl|NC_015287. 141 KVIDIK-----APEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAI 215 (563) Q Consensus 141 kviD~~-----~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~ 215 (563) ++.+.. ....-+++|..|+|-.+..++.. + . . ..-+|.|+..+. T Consensus 143 i~r~~~g~l~~~~~g~~~~l~~l~~~~v~~~~~~--~-~----~---------------~~~~y~~~~~~~--------- 191 (466) T protein:vir:81 143 IVDGEFVRMRPDWVDVVVEERMVRGGRGELGGGQ--L-G----W---------------RKVGYLYTEGGR--------- 191 (466) T ss_pred EEecCccccccccCcceeEEEEecCcceEEEEcC--C-C----c---------------eEEEEEEEecCc--------- Confidence 766532 11122689999999776655211 1 0 0 011233322110 Q ss_pred CcCccccCcceeeecccceeeeecceec-CCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchH Q lcl|NC_015287. 216 GQTPFSTKQRPVKIAPDAISFCHSGLVD-RNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKM 294 (563) Q Consensus 216 g~~~~~~~~~~iki~~daI~y~hSGl~d-~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~ 294 (563) ........++.+.|.|.. ++.+ .++-.+.|.+..|++++.....+++...=+=---+--.-|+..+ +.|.+. T Consensus 192 -----~~~~~~~~~~~~dviHir-~~~~~~d~~~G~s~i~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~-~~l~~e 264 (466) T protein:vir:81 192 -----QSGNESVGFLAEDVVHFA-PIPDPLASYRGMSWLTPILREIRADQAMSKHQAKFFDNGATVNLVIKHN-PMADPA 264 (466) T ss_pred -----ccccceeeeccccEEEEc-CCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecC-CCCCHH Confidence 011223456777774432 2222 34445689999999999888877766543333334445566666 457776 Q ss_pred HHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCC-CcchHHHHHHHHHHHHHhcC Q lcl|NC_015287. 295 KAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQ-NLGELSDIEYFQKKLYRSLG 373 (563) Q Consensus 295 KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~-nLgei~DV~YF~~kLy~aL~ 373 (563) .+++..+.+...|+-- .+.|.+ + .++ -|.+++.|.-.. .+.-++-.++..+.+.++.+ T Consensus 265 ~~~~~~~~~~~~~~g~----~n~g~~------~-vl~----------~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fg 323 (466) T protein:vir:81 265 AVKKWADEVNSKHAGV----DNAWKN------L-NLY----------PGADADVVGSNLQEIDFKNVRGGGETRIAAAAG 323 (466) T ss_pred HHHHHHHHHHHHhcCc----cccccc------e-EcC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhC Confidence 6666555555555420 112211 1 221 245666664322 22224455688999999999 Q ss_pred CChhhccCCCcceeccccc--hhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccc Q lcl|NC_015287. 374 VPESRIAGSGDGFNLGRSS--EILRDEIKFTKFV-GRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDN 450 (563) Q Consensus 374 VP~sRl~~~~~g~~~g~~~--eItRDElKF~KfI-~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn 450 (563) ||...|.-.. ..++++ .|...-+.|.+++ .-+-.++.. .|-..|+. ..+ ...+.++|..+. T Consensus 324 VPp~~lG~~~---~~~~st~sn~eq~~~~f~~~tl~P~~~~ie~----~l~~~L~~-----~~~----~~~~~~~f~~~~ 387 (466) T protein:vir:81 324 VPPVIVGLSE---GLAAATYSNYGQARRRLADGTAHPLWQNLSG----CIGHVMPD-----MGP----DVRLWYDADDVP 387 (466) T ss_pred CCHHHccccc---CCCccccccHHHHHHHHHHHHHHHHHHHHHH----HHHhhcCC-----ccc----CcceEEEecchh Confidence 9999994211 122222 1222223355543 444444433 33333332 222 123455665433 Q ss_pred hHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccC Q lcl|NC_015287. 451 HFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQM 530 (563) Q Consensus 451 ~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~ 530 (563) .-.. -++.|.++......+ ...++... .|..|+-+ + .+.++..-.. T Consensus 388 llr~-----d~~~r~~~~~~~~~~------~~~~~~~g--~t~nE~r~----~-----------------~~~gd~~~~~ 433 (466) T protein:vir:81 388 FLRE-----DEKDAADIQKVRAET------INTLITAG--YEPESVVA----A-----------------VNSGDLRLLK 433 (466) T ss_pred hhcc-----CHHHHHHHHHHHHHH------HHHHHHcC--CChhhccc----c-----------------ccCCcccccc Confidence 2111 123344332111111 11222221 13322221 0 0001110001 Q ss_pred CCCcCCCCCcccccccCCCCcccCCCCCCCCCC Q lcl|NC_015287. 531 PGGAMNPQDPNMGGAPQEGGAEQAPQMPEGGEI 563 (563) Q Consensus 531 ~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 563 (563) +.+ +.|.+. .++.+..+++...+-.+||+- T Consensus 434 ~~~-~~~~~~--~~~~~~~~~~~~~~~~~Gg~~ 463 (466) T protein:vir:81 434 HTG-LTSVQL--LPPGVSASASSDTPTSGGADD 463 (466) T ss_pred CCC-cchhhh--cccccccccCCCCcccCCCCc Confidence 111 112211 111111112222223445555 No 163 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=95.24 E-value=0.0025 Score=34.89 Aligned_cols=409 Identities=11% Similarity=0.097 Sum_probs=167.2 Q ss_pred cccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceE Q lcl|NC_015287. 2 AKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAI 81 (563) Q Consensus 2 ~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneai 81 (563) --|++| |++........ .. ..+++. .......|..+.. +..+++|.|..||+.|.+.+- T Consensus 1 m~~~~~-~~~~~~~~~~~----~~-~~~~~~-~~~~~~~g~~v~~--------------~~al~~~~v~~~i~~ia~~ia 59 (419) T protein:vir:57 1 MFIPQF-WKGRPSENRVN----WQ-VVPGGM-RSSSSQAGVIITP--------------ETALALSAVRACVTLLAESVA 59 (419) T ss_pred Ccchhh-hccCCcccccc----cc-cccccc-ccccccCCceech--------------HHhhccHHHHHHHHHHHHhhc Confidence 112222 22221111110 00 000100 0001111111111 223567889999999999654 Q ss_pred EecCCCcEEEEEeeccccchHHHHHHH-HHHHHHHH-Hhcchhhh----HHHHHHHhhhhhhheeeeecCCCCCcceeEE Q lcl|NC_015287. 82 VSDQNDSPVEIDLENLPASKQLKDLIR-EEFKTVKE-ILNFDDKC----HEILRNWYVDGRIYYHKVIDIKAPEEGIKEV 155 (563) Q Consensus 82 v~d~~~~pV~l~l~~~~~s~sik~~I~-eeF~~i~~-ll~F~~~~----~~~~R~WYvDGri~fhkviD~~~~k~GI~eL 155 (563) -. |+.+- ..+.=. -++.+. .-...+|+ --+-...+ +.++..+++.|.-|..++.|.. .-+++| T Consensus 60 ~l-----p~~~~-~~~~~g--~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~---G~~~~L 128 (419) T protein:vir:57 60 QL-----PCVLY-RRTENG--GREIAFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGR---GDITEL 128 (419) T ss_pred cC-----ceEEE-EEcCCC--ceeccccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC---CcEEEE Confidence 22 33331 110000 011111 11222222 12223333 4455667788998888776643 338999 Q ss_pred EEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeeccccee Q lcl|NC_015287. 156 RYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAIS 235 (563) Q Consensus 156 r~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~ 235 (563) .+|+|..|...+ .+ ++ ..||.+.-. + -.++.+.|. T Consensus 129 ~pl~~~~v~v~~---~~----~g-----------------~~~y~~~~~-----------~----------~~~~~~~vi 163 (419) T protein:vir:57 129 IPINPHKVIVLK---GP----DG-----------------MPYYDIPSI-----------G----------EILPMRMVH 163 (419) T ss_pred EEEcCcceEEEE---CC----Cc-----------------eEEEEEcCC-----------c----------eEEchhhEE Confidence 999998876542 11 01 124444210 0 125566664 Q ss_pred eeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeC Q lcl|NC_015287. 236 FCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDA 315 (563) Q Consensus 236 y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~ 315 (563) |.. .+..++-.+.|.+..|.+.+....-+++...=+----+--+-+...+.. +.....++-...+..++.++.- T Consensus 164 h~r--~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-~~~~~~~e~~~~~~~~~~~~~~--- 237 (419) T protein:vir:57 164 HIK--SFSLDGYIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFE-AKAIASQAAVDAILAKWTERYG--- 237 (419) T ss_pred Eec--CcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCc-CCcccCHHHHHHHHHHHHHHhc--- Confidence 443 3445566678999999999888887777665443333444455555421 1111222223344444433321 Q ss_pred CCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHH---HHHHHHHHHHHhcCCChhhccCCCcceeccccc Q lcl|NC_015287. 316 STGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELS---DIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSS 392 (563) Q Consensus 316 ~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~---DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~ 392 (563) | ..+..+.+ .++ .|+++..|- .+.-++. =.++..+.++++++||...|...+.+ + .+ T Consensus 238 --g-~~nag~~~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~-t---~s 297 (419) T protein:vir:57 238 --G-VRNAFSVG-MLQ----------EGMTYKQLS--QDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKS-T---NN 297 (419) T ss_pred --c-ccccccce-ecC----------CCceEEEcC--CChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC-c---cc Confidence 1 11111111 111 245666554 2333333 33566688999999999999532211 1 11 Q ss_pred hhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015287. 393 EILRDEIKFTKF-VGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAV 471 (563) Q Consensus 393 eItRDElKF~Kf-I~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~ 471 (563) -+.-.-+-|.++ |.-+..++... +-+.++++.++. ...+.|..+... . .-+..|++.++.+ T Consensus 298 n~e~~~~~f~~~~l~P~~~~ie~~---------l~~~ll~~~~~~----~~~i~fd~~~ll----~-~d~~~~~~~~~~~ 359 (419) T protein:vir:57 298 NIEHQGLQYVIYTMLAILKRHESA---------MMRDLLLPSERR----DFYIEFNVSSLL----R-GDQKSRYESYALG 359 (419) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHH---------HHhhccCccccC----CeEEEEechhhh----c-cCHHHHHHHHHHH Confidence 122222335444 23333322222 333444555443 344555543321 0 1123455555443 Q ss_pred hhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCc Q lcl|NC_015287. 472 DPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGA 551 (563) Q Consensus 472 ~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~ 551 (563) -.- -+ ||..|+-+ ...-+++ |+-+.-..+. +-.+ ...+.....+++.+.+.. T Consensus 360 ~~~--G~------------~T~NE~R~-------~~gl~p~--~ggD~~~~~~---n~~~--~~~~~~~~~~~~~~~~~~ 411 (419) T protein:vir:57 360 RQW--GW------------LSVNDIRR-------MENLTPI--PGGDKYLTPL---NMVD--SKALTGIGKATPQQLKDI 411 (419) T ss_pred HhC--CC------------cCHHHHHH-------HhCCCCC--CCcCeeeecc---cccc--ccccccccCCCcccCcch Confidence 211 12 44433331 1222222 2111000000 0000 000111111112222233 Q ss_pred ccCCCCCC Q lcl|NC_015287. 552 EQAPQMPE 559 (563) Q Consensus 552 ~~~~~~~~ 559 (563) +.....-| T Consensus 412 ~~~~~~~~ 419 (419) T protein:vir:57 412 EAILCTRN 419 (419) T ss_pred hhhhhccC Confidence 33333333 No 164 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=95.22 E-value=0.0025 Score=34.84 Aligned_cols=376 Identities=16% Similarity=0.124 Sum_probs=162.7 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) |.=|-.+.+.+.+.. +.+...+ ........|+..+.++. . +...++|-|..||+-|.+.+ T Consensus 1 Mg~~~~~~~~~~~~~-----~~~~~~~-~~~~~~~~~~~~~~~v~-------~-------~~al~~~~v~~~i~~ia~~i 60 (385) T protein:vir:10 1 MGLLTPRNFNKRKAK-----NMVYPSN-PAFFTTTVGGMQLSYVS-------A-------LSALQNTNVYSVINRIASDV 60 (385) T ss_pred Cccccchhccccccc-----ccccccc-hhhhhhhccccCccccC-------H-------HHhhccHHHHHHHHHHHHHH Confidence 553322322222111 1111111 11111112222221111 0 12356788999999999865 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHH----HHHHhhhhhhheeeeecCCCCCcceeEEE Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEI----LRNWYVDGRIYYHKVIDIKAPEEGIKEVR 156 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~----~R~WYvDGri~fhkviD~~~~k~GI~eLr 156 (563) .-. |+.+ .+-. ...+++-=+-...++++ +..+.++|.-|..++-| ..++. T Consensus 61 a~~-----p~~v--~~~~------------~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~-------~~~~~ 114 (385) T protein:vir:10 61 ASA-----HFKT--ENTA------------TLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ-------NLEHI 114 (385) T ss_pred hhC-----ceee--eccc------------hhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC-------ceeEe Confidence 543 3333 1111 12233322233344444 44666889988876654 36788 Q ss_pred EeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceee Q lcl|NC_015287. 157 YIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISF 236 (563) Q Consensus 157 ~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y 236 (563) ++||-.|+.++. . ++. -|+.+... +...+.++.+.|.| T Consensus 115 p~~~~~v~~~~~-----~--~~~-----------------~~~~~~~~------------------~~~~~~~~~~eiih 152 (385) T protein:vir:10 115 PNSDVQINYLPG-----N--MGI-----------------VYTVLESN------------------DRPQMVLRQDQMLH 152 (385) T ss_pred ecCCceEEEEEc-----C--Cce-----------------EEEEEEcC------------------CceEEEEccccEEE Confidence 888877665421 0 010 11111110 11235677777755 Q ss_pred eecceecC---CCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEe Q lcl|NC_015287. 237 CHSGLVDR---NKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVY 313 (563) Q Consensus 237 ~hSGl~d~---~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvY 313 (563) .. .+.+ ++-.+.|.|..|+++++....+++...=+----+--+-+..++.+-..+..++ =+++-+.+.... T Consensus 153 ik--~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~-~~~~~~~~~~~~--- 226 (385) T protein:vir:10 153 FR--LMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLE-SAREEFEKANTG--- 226 (385) T ss_pred ec--cCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHH-HHHHHHHHHhCc--- Confidence 43 2222 23346799999999999999998876655445556677777774444444433 334334433221 Q ss_pred eCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch-HHHH-HHHHHHHHHhcCCChhhccCCCcceecccc Q lcl|NC_015287. 314 DASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDI-EYFQKKLYRSLGVPESRIAGSGDGFNLGRS 391 (563) Q Consensus 314 d~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV-~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~ 391 (563) .+.|.+ + .++ .|.+++.|.-...-.+ +.+. +|-.+.+.++++||...|.....+ +. .. T Consensus 227 -~n~~~~------~-vl~----------~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~-~~-~~ 286 (385) T protein:vir:10 227 -DNSGRL------M-VLP----------DGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTST-ES-QH 286 (385) T ss_pred -cccCCc------c-ccC----------CCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCC-Cc-cc Confidence 122322 1 221 2566777643322222 3233 555788999999999999532211 11 11 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015287. 392 SEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAV 471 (563) Q Consensus 392 ~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~ 471 (563) +.+.....-|. ..|+--+. .+.+.|-..|+ ++ .|+|++. ++.... +..|++.++.+ T Consensus 287 sn~eq~~~~~~---~~l~P~~~-~ie~~l~~~l~-----~~--------~~~f~~~------~ll~~d-~~~~~~~~~~~ 342 (385) T protein:vir:10 287 SNIDQIKATYL---ANLNSYVN-PIVDELRLKMN-----AP--------DLELDIK------DMLDVD-DSALINQVSNL 342 (385) T ss_pred ccHHHHHHHHH---HHHHHHHH-HHHHHHHHhhC-----Cc--------eEEeech------hhhccC-HHHHHHHHHHH Confidence 22322222233 34543222 33333444332 21 2333322 222121 23444444332 Q ss_pred hhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCC Q lcl|NC_015287. 472 DPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQ 538 (563) Q Consensus 472 ~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~ 538 (563) -.- -+ +|..|+-++ .+..++|..+ .+......+...+|+.+=+ T Consensus 343 ~~~--G~------------~T~NE~R~~-------~g~~p~p~~~---~~~~~~~~~~~~~g~~~dn 385 (385) T protein:vir:10 343 AKS--GV------------LGAEQAQFI-------LTRSGFLPDN---LPEFKPLTTQVKGGDEGDN 385 (385) T ss_pred HhC--CC------------cCHHHHHHH-------hCCCccCCCC---CccccCcccccCCCCCCCC Confidence 211 12 344333211 1112332111 1111000111111111111 No 165 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=95.12 E-value=0.0027 Score=34.65 Aligned_cols=337 Identities=13% Similarity=0.196 Sum_probs=144.0 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceecccc--ccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhc Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGF--YGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIIN 78 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~--~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvn 78 (563) |+=|=.| ++ ++..+ | ++.-.....+|. .+.++.- +.-+++|-|..||+-|.+ T Consensus 1 M~~~~~f--~~-----r~~~~--~---~~~~~~~~~~~~~~~~~~v~~--------------~~al~~~av~~cv~~ia~ 54 (359) T protein:vir:10 1 MSILNPF--ER-----RSSIT--P---NNYYPFMVQNGSIVPNSLVDA--------------TEALKNSDLYAVTSLISS 54 (359) T ss_pred Ccccchh--hc-----cccCC--C---CcchhhhhccccccCCcccCH--------------HHhhcchHHHHHHHHHHH Confidence 6544224 11 11111 1 111111122221 1212211 113467889999999988 Q ss_pred ceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHH----HHhhhhhhheeeeecCCCCCcceeE Q lcl|NC_015287. 79 EAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILR----NWYVDGRIYYHKVIDIKAPEEGIKE 154 (563) Q Consensus 79 eaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R----~WYvDGri~fhkviD~~~~k~GI~e 154 (563) .+.=. .+.+ .. .+..+++-=|-...+.++.+ .++.+|--|..++.|.. .-+++ T Consensus 55 ~ia~~---------p~~~----~~-------~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~---g~~~~ 111 (359) T protein:vir:10 55 DIAGT---------RFIG----NQ-------VFTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDN---SLMKE 111 (359) T ss_pred hhhcC---------cccc----ch-------HHHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEECCC---CeEEE Confidence 54321 1111 11 12222322233345555444 44566888887766532 22689 Q ss_pred EEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccce Q lcl|NC_015287. 155 VRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAI 234 (563) Q Consensus 155 Lr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI 234 (563) |.+|+|-.+... ..+ +.+ +|.|... . +.....++.+.| T Consensus 112 l~~l~~~~v~i~---~~~----~~~------------------~y~~~~~----~-------------~~~~~~~~~~ev 149 (359) T protein:vir:10 112 LRLIPSNAITID---LTD----DTL------------------TYEVNQF----D-------------DYPSAKYNASEM 149 (359) T ss_pred EEEeCCceEEEE---EcC----CeE------------------EEEEEec----C-------------CceEEEEcccce Confidence 999998776532 111 111 1222110 0 011244566666 Q ss_pred eee-ec--ceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhccee Q lcl|NC_015287. 235 SFC-HS--GLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKL 311 (563) Q Consensus 235 ~y~-hS--Gl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knkl 311 (563) .|. +. +....++-.++|.|+.|++++.....+++...-+=---+--+-+..++-|.+.+..+++.-+ -..++.. - T Consensus 150 ih~~~~~~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~-~~~~~~~-~ 227 (359) T protein:vir:10 150 IHVKIMAYGVDTLHNLVGHSPLESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQGTLSSEAKDSIRK-EFEKANG-G 227 (359) T ss_pred EEeccCCCCCCccCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHH-HHHHHhC-c Confidence 433 11 11123455678999999999998888887654332222334567777777776655544333 2333321 0 Q ss_pred EeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCChhhccCCCcceec Q lcl|NC_015287. 312 VYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE---LSDIEYFQKKLYRSLGVPESRIAGSGDGFNL 388 (563) Q Consensus 312 vYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~ 388 (563) .+.|++- .|+ .|.+++.|. .+.-+ ++-.+|-...+.++++||...|..-+. T Consensus 228 ---~n~g~~~-------vl~----------~g~~~~~l~--~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~---- 281 (359) T protein:vir:10 228 ---NNSGRVM-------VLD----------QSADFSTVS--INADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGD---- 281 (359) T ss_pred ---cccCCce-------ecC----------CCcceeeec--CCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCc---- Confidence 1222221 221 245666653 33333 234566678899999999999942111 Q ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHH-- Q lcl|NC_015287. 389 GRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLG-- 466 (563) Q Consensus 389 g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~-- 466 (563) ..+ ..+.-|-.+..|+...-..+..-+...|-+++.+ +. ...+. |.. ++....+. ..++ T Consensus 282 ~~~-~~~~~e~~~~~~l~~~l~p~~~~l~~~l~~~~~~---------~~-~~~~~--~d~-----~~~~~~~~-~~~~~G 342 (359) T protein:vir:10 282 QQS-SLDQIKDLYVNALNRFIEPLISELRIKCDSSIGV---------DM-SPITD--YSN-----SVFKADIL-NWVKEG 342 (359) T ss_pred ccc-cHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcc---------cc-hhhhh--cCH-----HHHHHHHH-HHHhCC Confidence 111 1111122233332221111111122212111111 00 00111 111 11111111 1111 Q ss_pred --H------HHHhhhhh Q lcl|NC_015287. 467 --V------VAAVDPYV 475 (563) Q Consensus 467 --~------~~~~~p~v 475 (563) + +-.+.|+. T Consensus 343 ~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 343 IIEPTEAKTLLESKGII 359 (359) T ss_pred CcCHHHHHHHhCCCCCC Confidence 1 11234443 No 166 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=95.06 E-value=0.0028 Score=34.54 Aligned_cols=379 Identities=13% Similarity=0.140 Sum_probs=154.2 Q ss_pred cccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceE Q lcl|NC_015287. 2 AKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAI 81 (563) Q Consensus 2 ~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneai 81 (563) -.||++ .... ...|+....+......+.+.+...+-.. + .-+..+++|-|..||+.|.+.+. T Consensus 1 M~~f~~----~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~-v--------~~~~~~~~~~v~~~i~~ia~~ia 62 (386) T protein:vir:48 1 MPIFNI----TNLA-----TESPPISQGGFFDITDPDFLSTLNGSEW-V--------SAESALRNSDLFSIINQLSNDLA 62 (386) T ss_pred Cccccc----cccc-----ccccccccccccccccchhcccccCCce-e--------chhhhhcchHHHHHHHHHHHhhc Confidence 123332 1111 1112222211111111111111111000 0 11223578999999999999765 Q ss_pred EecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHH----HHHHhhhhhhheeeeecCCCCCcceeEEEE Q lcl|NC_015287. 82 VSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEI----LRNWYVDGRIYYHKVIDIKAPEEGIKEVRY 157 (563) Q Consensus 82 v~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~----~R~WYvDGri~fhkviD~~~~k~GI~eLr~ 157 (563) -++ + .+.+ ...+.+++--+-...++++ ++.++++|.-|.-++.|.. .-+++|.+ T Consensus 63 ~~p-----~--~~~~------------~~~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~---g~~~~L~~ 120 (386) T protein:vir:48 63 TVK-----L--TASR------------KQLQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNEN---GRDMKWEY 120 (386) T ss_pred cCc-----e--eecc------------chhHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCC---CcEEEEEE Confidence 432 2 2221 1122333333444455554 5567788998888777642 23889999 Q ss_pred eCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeee Q lcl|NC_015287. 158 IDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFC 237 (563) Q Consensus 158 lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~ 237 (563) |+|..|..++.... + ..||.|.-.+. .. .....++.+.|.|. T Consensus 121 l~~~~v~v~~~~~~-----~------------------~~~y~~~~~~~-------~~--------~~~~~~~~~evih~ 162 (386) T protein:vir:48 121 LRPSQVSFNRLDNK-----D------------------GIYYNITFDDP-------RI--------PPKQHVPQGDVLHF 162 (386) T ss_pred ecCceeEEEEcCCC-----c------------------eEEEEEEecCc-------cc--------cceeEecCccEEEe Confidence 99988765532110 0 12333321110 00 11234555666433 Q ss_pred ecceecCCCC-ccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCC Q lcl|NC_015287. 238 HSGLVDRNKQ-TILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDAS 316 (563) Q Consensus 238 hSGl~d~~~~-~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~ 316 (563) =..+.++. .+.|.|..|++++.....+++...=+----+--+-+.-.+- .+.+...++ +++.... .. .+ T Consensus 163 --~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~-~~~~e~~~~-~~~~~~~----~~--~n 232 (386) T protein:vir:48 163 --KLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKG-GGLLDFKTK-LSRSRQA----MK--QM 232 (386) T ss_pred --cCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCCHHHHHH-HHHHHHH----hh--cC Confidence 23344443 45799999999999988888875544333333444555543 333333332 3322221 11 11 Q ss_pred CCcccccccccchhhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhh Q lcl|NC_015287. 317 TGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE-LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEIL 395 (563) Q Consensus 317 TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eIt 395 (563) .|. .+ .++ .|.++..|.-...-.| ++=.++....+.++++||..-|...++ + +.+. T Consensus 233 ~g~------~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~-----~~~e 289 (386) T protein:vir:48 233 QGG------PL-VLD----------DLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQGD-Q-----QSSL 289 (386) T ss_pred CCC------ce-ecC----------CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-c-----ccHH Confidence 222 11 111 2556666643222222 233466678999999999999953221 1 1122 Q ss_pred HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_015287. 396 RDEIKFTKFV-GRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPY 474 (563) Q Consensus 396 RDElKF~KfI-~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~ 474 (563) -..+.|.+++ .-+-..+...|..-| + +.-+.+ +...+..|. + T Consensus 290 ~~~~~~~~~~l~P~~~~ie~~l~~~l----~-----~~~~~~-----~~~~~~~d~-----------------------~ 332 (386) T protein:vir:48 290 EMSLDLYNKAVSRYLRPFLSELSQKL----S-----CDVDAD-----ILPAVDPTG-----------------------S 332 (386) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhh----c-----chhhcc-----hhhhhccCh-----------------------H Confidence 2223454443 333333333332222 1 110000 000011111 1 Q ss_pred hchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCC Q lcl|NC_015287. 475 VGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQD 539 (563) Q Consensus 475 vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~ 539 (563) .........++- .+....+..+.-...|.. |++....+. .......+|+.+.++ T Consensus 333 -----~~~~~~~~l~~~---g~~t~nE~r~~lg~~~~~--~~~~~~~~~-~~~~~~~gGd~~~~~ 386 (386) T protein:vir:48 333 -----NSVSRINSMVKS---GTLAQNQGLYILQQAEIL--PKELPEGEN-PNKTTLKGGEINGED 386 (386) T ss_pred -----HHHHHHHHHHhC---CCcCHHHHHHHhhcCCCC--CccchhhcC-CCCCccCCCCCCCCC Confidence 001111111110 111111212222222322 222111000 000011112211111 No 167 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=94.72 E-value=0.0037 Score=33.94 Aligned_cols=469 Identities=10% Similarity=0.060 Sum_probs=177.2 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccc---cchhhh--HHHHHHHHHHHhhccchh----- Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDI---DGVFKS--EFELIKRYREMALHPEVD----- 70 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~---~~~~~~--~~~LI~~YR~ma~~PEvd----- 70 (563) |+.---|-.-.-..... ...+-+..|+ .+....+-...... -.++-+ ......+|+.+..+.+-+ T Consensus 1 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~----~~~~~~~~~~~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~ 75 (502) T protein:vir:48 1 MMEQTLFTDSTGQDLVL-NLRFHRESRI----RYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVL 75 (502) T ss_pred CceeEEEEecchhHHHh-hcccChhHHh----hhcccchhhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc Confidence 44332221111000000 0000011110 00000000000000 000000 001112233322222211 Q ss_pred HHH-----------------HHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhh Q lcl|NC_015287. 71 SAI-----------------EDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYV 133 (563) Q Consensus 71 ~Ai-----------------~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYv 133 (563) ..- ..||+ ..+.=--..||.+...+.+.. +.+.+.+..+++..+|+....++.+...+ T Consensus 76 ~~~~~~~~~~~~~ki~~n~~k~Ivd-~~~~yl~g~p~~~~~~d~~~~----~~~~~~l~~~~~~N~~~~~~~~~~~~~~~ 150 (502) T protein:vir:48 76 KSGRRKDNEMADKRAVHNYGRMISK-FKTGYLAGNPIRVEYDDNEDN----SQNDDAIKRIGRINDIDTHNRNLIRDLSQ 150 (502) T ss_pred ccccccccccccceeecchHHHHHH-HHhhhhcccCeeEecCCccch----hHHHHHHHHHHhhcCHhHHHHHHHHHHhh Confidence 100 11111 111112345666666553333 45667778888889999999999999999 Q ss_pred hhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCcccccccccc Q lcl|NC_015287. 134 DGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALG 213 (563) Q Consensus 134 DGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g 213 (563) .|+-|.+.-+|. +|-..++.+||+.+..|..-.........+.+-...... .....+=+|.+.........+ T Consensus 151 ~G~a~~~v~~de----dg~~~i~~~~p~~~~~vydd~~~~~~~~~ir~~~~~~~~----~~~~~~~iyt~~~i~~~~~~~ 222 (502) T protein:vir:48 151 TGRAYEVIYRSE----YDETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQ----NAKDVVEIYTNQHIYTLDASD 222 (502) T ss_pred cCeEEEEEEeCC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEEeecC----CcEEEEEEEeCCeEEEEEeCC Confidence 999999877763 566789999999998874221111111112111100000 111111134333222221111 Q ss_pred ccCcCccccCcceeeecccceeeeecceecCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCc Q lcl|NC_015287. 214 AIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYLHKSIKALNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLP 292 (563) Q Consensus 214 ~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~-m~EDalVIyRi~RAPeRRvFyIDvGnlp 292 (563) ...-....++ -+..=+|++. .|..-..|=++.++....-+. ++=+....-+-++.|-+-+.=......+ T Consensus 223 ~~~~~~~~~~----~~g~vPvv~~------~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~ 292 (502) T protein:vir:48 223 SFNEISVTPH----AFGTVPITEF------LNNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQG 292 (502) T ss_pred ceeeccceec----CCCccceEEe------cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccc Confidence 0000000000 0111123221 122334455655444443332 2233333444555555444322111111 Q ss_pred hHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHH-HHHHHHHHHHHh Q lcl|NC_015287. 293 KMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELS-DIEYFQKKLYRS 371 (563) Q Consensus 293 k~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~-DV~YF~~kLy~a 371 (563) ... .- +.+ .++++ +...... -..+.+..+.+|-...+...+. -++-+.+.+|+. T Consensus 293 ~~~--~~----~~~--~~~~~-------------~~~~~~~----~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~ 347 (502) T protein:vir:48 293 MQA--SD----MKR--TRLMQ-------------LKPPKSA----DGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVF 347 (502) T ss_pred cch--hh----hhh--cceee-------------ccccccc----cccccCcceeEeeecCCHHHHHHHHHHHHHHHHHH Confidence 000 00 000 11110 0000000 0112233455554433333333 356667778888 Q ss_pred cCCChhhccCCCcceeccccc-h-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC-hhHHHHhhhceEEEeec Q lcl|NC_015287. 372 LGVPESRIAGSGDGFNLGRSS-E-ILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVT-PEDWEVLGDHIQYDFVY 448 (563) Q Consensus 372 L~VP~sRl~~~~~g~~~g~~~-e-ItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t-~eew~~~~~~i~~~f~~ 448 (563) .++|---+ + .|. |..| + |..-......-+.+.++.|..-+.+.++.=+-+-++.. ..+++ ...|.+.|.. T Consensus 348 s~~p~~~~--~--~~~-~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d--~~~i~i~f~~ 420 (502) T protein:vir:48 348 TNTPDMSD--N--HFS-GNASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFD--ESRLKITFTP 420 (502) T ss_pred hCCCCcCc--c--ccc-cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--cccceEEeCC Confidence 88884222 1 222 2222 2 22122223344566666677666666655322222221 11222 2457888854 Q ss_pred cchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCC-CCCCcccCcccccccc Q lcl|NC_015287. 449 DNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGK-LADPMEVQQLQLGVHP 527 (563) Q Consensus 449 Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~-~~~p~~~~~~~~~~~~ 527 (563) -..-.+ .+.++++.++. | .+|.+++++.+-..+| .+++.++|++|...-. -..|....+ ..+ T Consensus 421 ~~p~d~-------~e~a~~~~kl~---g-~iS~et~l~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~-~~~--- 483 (502) T protein:vir:48 421 NLPKSL-------YEQVSILNDLG---G-QVSQETALSLSGLVEN--PTEELDKINEESSKIDFKGYPSYFYD-NVG--- 483 (502) T ss_pred CCCcCH-------HHHHHHHHHHh---c-cCcHHHHHHhCCCCCC--HHHHHHHHHHHHHhhhhhcccccccc-ccc--- Confidence 332222 23445555553 4 4899999998654343 3344445555533211 111111110 110 Q ss_pred ccCCCCcCCCCCcccccccCCCCcccCCC Q lcl|NC_015287. 528 EQMPGGAMNPQDPNMGGAPQEGGAEQAPQ 556 (563) Q Consensus 528 ~~~~~~~~~p~~~~~~~~~~~~~~~~~~~ 556 (563) .+....++.+ +-..|..|+ T Consensus 484 ---~~~d~~~e~~-------~~~~~~~~~ 502 (502) T protein:vir:48 484 ---KYTDEVKETH-------TDDFERVYE 502 (502) T ss_pred ---ccCCCccCCC-------CcCcCCCCC Confidence 0000001111 111112222 No 168 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=94.57 E-value=0.0041 Score=33.70 Aligned_cols=410 Identities=11% Similarity=0.091 Sum_probs=163.5 Q ss_pred ccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHH-hhccchhHHHHHhhcceEEe Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREM-ALHPEVDSAIEDIINEAIVS 83 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m-a~~PEvd~Ai~eIvneaiv~ 83 (563) ||| +++-- ..+++.+. |+... ..|.....+.+.-... ..... .++|-|..||+-|.+.+--. T Consensus 1 Mg~-~~~~~-~~~~~~~~-~~~~~------~~~~~~~~~~~~~~~~--------~~~~~~~~~~~v~~~i~~ia~~ia~l 63 (423) T protein:vir:81 1 MGF-LQKLG-LAPSVVAT-PEPIE------LVGPIFESLKLSTKNM--------TVEQIWEDQPHLRTVTTFIARNVASL 63 (423) T ss_pred Cch-hHhhc-cccccccC-ccccc------cccccccccccccchh--------hHHHHHHhhhHHHHHHHHHHHhHhhC Confidence 555 22210 11111111 11110 1111111111110000 11222 57899999999999965421 Q ss_pred cCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhc---chhhhH----HHHHHHhhhhhhheeeeecCCCCCcceeEEE Q lcl|NC_015287. 84 DQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILN---FDDKCH----EILRNWYVDGRIYYHKVIDIKAPEEGIKEVR 156 (563) Q Consensus 84 d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~---F~~~~~----~~~R~WYvDGri~fhkviD~~~~k~GI~eLr 156 (563) |+.|--.+ .+..++.+.+ ..+.++|. =...+. .++..+.+.|.-|..++.|.. ....+.+|+ T Consensus 64 -----p~~~~~~~---~dg~~~~~~~--~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~-~~~~~~~l~ 132 (423) T protein:vir:81 64 -----QLQAFERV---EDGGRERVRE--GHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLG-VDTPTLDIR 132 (423) T ss_pred -----ceEEEEEe---cCCceeeecc--chHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCC-cCcceEEEe Confidence 22221000 1111112211 12223332 111233 345566788988876655532 223345555 Q ss_pred EeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceee Q lcl|NC_015287. 157 YIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISF 236 (563) Q Consensus 157 ~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y 236 (563) ++.|..+.. ...++. .+. -+|.+... . ..+...+.++.+.|.| T Consensus 133 p~~~~~v~~---~~~~~~-~~~------------------~~Y~~~~~----~-----------~~~g~~~~~~~~evih 175 (423) T protein:vir:81 133 PIPVSWVQR---RAYKDG-WGS------------------LDYIIIES----G-----------DNDGRSVKVPGERVIH 175 (423) T ss_pred ecccceeee---eeccCC-Ccc------------------eEEEEEEe----c-----------CCCceEEEEcccceEE Confidence 554433221 111111 000 11221000 0 0012246678888865 Q ss_pred eecceecCCCC-ccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeE-ee Q lcl|NC_015287. 237 CHSGLVDRNKQ-TILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLV-YD 314 (563) Q Consensus 237 ~hSGl~d~~~~-~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklv-Yd 314 (563) .+ .++.++. .+.|-+..|..++.....+++...=+=---+.-+-|+..|-...|.+-.++-.+.+..+++.... -- T Consensus 176 ~r--~~~~~~~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~~~~~~~~~~~~~~ 253 (423) T protein:vir:81 176 RH--GYNPKTMKRGKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAESRTRFMANLRASFSPKS 253 (423) T ss_pred ec--CCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHHHHHHHHHHHHHHhcccc Confidence 54 2444444 46788999999888877777764433222345666777775443322122222333333333221 11 Q ss_pred CCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHH---HHHHHHHHHhcCCChhhccCCCcceecccc Q lcl|NC_015287. 315 ASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDI---EYFQKKLYRSLGVPESRIAGSGDGFNLGRS 391 (563) Q Consensus 315 ~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV---~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~ 391 (563) .++|.+- .|+ .|.+++.|. .+.-++.-+ ++-...+.++++||...+..-+++ +. T Consensus 254 ~n~g~~~-------vl~----------~g~~~~~l~--~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~-t~--- 310 (423) T protein:vir:81 254 SDVGGTL-------LLE----------DGMKAENFH--TTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNA-NY--- 310 (423) T ss_pred ccCCcce-------ecC----------CCceEEecc--CChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCC-Cc--- Confidence 2344332 221 245566663 333344333 355677999999999888421111 11 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015287. 392 SEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAV 471 (563) Q Consensus 392 ~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~ 471 (563) +-+...-..|..++ |+-.+ ..+.+.|-..| ++..+|+.-...|+|++. . |.... ++.|.+.++.+ T Consensus 311 sn~e~~~~~f~~~~--L~P~~-~~ie~~l~~~L-----~~~~~~~~~~~~~~fd~~--~----llr~d-~~~r~~~~~~~ 375 (423) T protein:vir:81 311 SNVREFRKALYGDN--LGSWI-RIIQDVMNLFL-----LPRVGIDNEKFYFEFNLE--E----KLRAS-FEEAAEIKRAA 375 (423) T ss_pred ccHHHHHHHHHHHH--HHHHH-HHHHHHHhhhh-----cCccccccCccEEEecch--h----hhccC-HHHHHHHHHHH Confidence 11222223355542 33211 12334444443 344444433334444332 2 21111 13445544332 Q ss_pred hhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCc Q lcl|NC_015287. 472 DPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGA 551 (563) Q Consensus 472 ~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~ 551 (563) -.-.| + ||.-|+-+. ..-+++ |+ + +.. ..|.|-....+..+|+. T Consensus 376 l~~~G-~------------~T~NE~R~~-------~gl~p~--~g--G-D~~-----------~~p~n~~~~~~~~~~~~ 419 (423) T protein:vir:81 376 VGNVA-W------------MTINEVRAM-------DNLPSI--DG--G-DDL-----------ARPLNTEFGDSEDAPGE 419 (423) T ss_pred HhCCC-C------------cCHHHHHHH-------hCCCCC--CC--c-cee-----------ecccccccCccCCCCCC Confidence 11111 2 444443321 122222 11 0 000 01333222222223333 Q ss_pred ccCC Q lcl|NC_015287. 552 EQAP 555 (563) Q Consensus 552 ~~~~ 555 (563) ++.+ T Consensus 420 ~~~t 423 (423) T protein:vir:81 420 EVET 423 (423) T ss_pred CCCC Confidence 3333 No 169 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=94.52 E-value=0.0042 Score=33.63 Aligned_cols=404 Identities=8% Similarity=-0.009 Sum_probs=193.6 Q ss_pred ccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEecCCCcEEEEEeecccc Q lcl|NC_015287. 20 VSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSDQNDSPVEIDLENLPA 99 (563) Q Consensus 20 ~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d~~~~pV~l~l~~~~~ 99 (563) ++--.+-+ +--..++.|-.--.++++ .-..++-..||... +=+.-||+..++=. +.+-.. ..| T Consensus 1 l~~~~~r~-~~~~~yY~g~~~~~~~~~----~~p~~~~~~~~~v~--nw~~~~Vds~a~rl-~~~Gf~------~~d--- 63 (410) T protein:vir:95 1 MNLYQSRV-NLRYKHYAMQHYEAPTGI----TIPAHIRAKYQAVL--GWAAKGVDSLADRL-IFRAFA------NDD--- 63 (410) T ss_pred CCcchhhH-HHHHHHhcCCCCccccch----hccHHHHhHHHhhc--chhHHHHHHhHhhh-cccccc------CCC--- Confidence 00000000 000001111111001111 11122223344333 33344666654411 111100 011 Q ss_pred chHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccc Q lcl|NC_015287. 100 SKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAI 179 (563) Q Consensus 100 s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i 179 (563) .++..|....+|+.+..+.++.=++.||-|. +|-+. .+|-..++.++|+.+.-+.- +..+... T Consensus 64 ---------~~l~~i~~~N~ld~~~~~~~~~al~~G~sf~--~v~~~--~d~~~~i~~~sP~~~~~i~D----p~~~~~~ 126 (410) T protein:vir:95 64 ---------FNVTEIFDRNNPDIFFDSAILSALIGSCSFV--YISKG--EDDEVRLQVIESSNATGVID----PITGLLV 126 (410) T ss_pred ---------chHHHHHhhcChHHHHHHHHHHHHHhCceeE--EEecC--CCCceEEEEEcccceEEEEe----CCCCceE Confidence 1356778888999999999999999999665 23222 35667889999999876642 2112221 Q ss_pred eeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeecceecCCCCccchhH-HHHHH Q lcl|NC_015287. 180 NQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNKQTILSYL-HKSIK 258 (563) Q Consensus 180 ~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~~~~~syL-~~Aik 258 (563) ..++.. ..+ ..+......+|.|..+....-.+ +.-...+.. -.+ =.|.|+|..=.+.. -+.|-+ +..+. T Consensus 127 ~al~~~-~~~-~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~--g~v--PvV~f~n~~~l~~~--~G~s~I~~~v~~ 196 (410) T protein:vir:95 127 EGYAVL-ARD-DYNRPTLEAYFEPNATHFIPKDG--EPYSVTNET--GIP--LLVPVIHRPDAVRP--FGRSRITRAGMY 196 (410) T ss_pred EEEEEE-Eec-CCCeEEEEEEEeCCcEEEEeeCC--ccccccCCC--CCc--ceEEecccccCCcc--CCccccchhHHH Confidence 111100 000 01112223334332222111111 110011110 001 13444544222222 123323 22222 Q ss_pred HHHH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccc Q lcl|NC_015287. 259 ALNQ-LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPR 337 (563) Q Consensus 259 ~~Nq-L~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpR 337 (563) ..+- -+.|.++++.=...=.|.|-++=+|.-.-|..+-..+ +--=..+|. T Consensus 197 l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~-----------------------------~~~i~~~~~ 247 (410) T protein:vir:95 197 YQKYAKRTLERADITAEFYSWPQKYILGLDPDAEPMEKWKAT-----------------------------VSSLLTISS 247 (410) T ss_pred HHHHHHHHHHHHHHHHHHhcchhheeeccCCCCCcCchhhhh-----------------------------hhhheeccC Confidence 2222 2577888888889999999887665422122111111 111144566 Q ss_pred cCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 338 REGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFN 417 (563) Q Consensus 338 ReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~ 417 (563) -++|-+.+|..++++.==+=++-++=.-..+....++|..-|.... -|-..+..|.-.|....+-+.+.|+.|..-+. T Consensus 248 ~~~~~~~~v~q~~~~~l~~~~~~l~~l~~~~a~~s~lP~~~lg~~~--~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~ 325 (410) T protein:vir:95 248 SDKGVKPSVGQFTTASMSPFTEQLRTAAAGFAGEMGLTLDDLGFVS--DNPSSVEAIKASHENLRLAGRKAQRSLGAGLL 325 (410) T ss_pred CCCCCcceEEecCCCChHHHHHHHHHHHHHHhhhcCCCHHHhcccc--CchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6777778888888765222244555556667777799998884221 11122335777788899999999999999999 Q ss_pred HHHHHHHHhcCCCC--hhHHHHhhhceEEEeec--cchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccCh Q lcl|NC_015287. 418 DMLKTQLILKNIVT--PEDWEVLGDHIQYDFVY--DNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKD 493 (563) Q Consensus 418 d~Lk~qLiLkgI~t--~eew~~~~~~i~~~f~~--Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~td 493 (563) +++|.-|.+.+-.. +.+|. .+.+.|.. |- + .--+.++.+.+..+..- |+-+...-+..+.|+++| T Consensus 326 ~~~rla~~i~~~~~~~~~~~~----~~~v~W~p~~d~---~---~~s~a~~aDa~~Kl~~a-~~g~~~~~~~~~~lg~~~ 394 (410) T protein:vir:95 326 NVAYVAACLRDEFRYTRSQFV----RTAVKWEPLFEA---D---ANTMTMIGDGVVKLNQA-LPGYINAETIRDLTGIAG 394 (410) T ss_pred HHHHHHHHHhcCCCCcccccc----eeeEEeeecCCc---c---hhhHHHHHHHHHHHHHh-ccCCccHHHHHHhcCCCh Confidence 99999888866543 33443 35555542 21 1 11234555554433332 222222345667778999 Q ss_pred hhHHHHHHHHHHHhcCCC Q lcl|NC_015287. 494 EEVIEIDKQMEKEIKDGK 511 (563) Q Consensus 494 eei~e~~~qie~E~~~~~ 511 (563) .++..+.. ++....|- T Consensus 395 ~~~~~~~~--~e~~~~g~ 410 (410) T protein:vir:95 395 DMSAKPVV--SEGGSNGE 410 (410) T ss_pred HHHHHHHH--HHHHhCCC Confidence 87764332 22222222 No 170 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=94.32 E-value=0.0047 Score=33.32 Aligned_cols=191 Identities=15% Similarity=0.201 Sum_probs=86.2 Q ss_pred EEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccC--CC----Ccc-ceeecCCCCCc Q lcl|NC_015287. 283 IFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRRE--GG----RGT-EITTLPGGQNL 355 (563) Q Consensus 283 vFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRRe--Gg----rgT-EIsTLpGg~nL 355 (563) ||.++. +.+-+ +...+++ ++.|.+..++ |+ ++ +.+ +..++ ..+| T Consensus 1 V~k~~~------------------l~~~~--~~~~~~~---~~r~~~~~~~----~~~~~~~~ld~~~e~~e~~--~~~l 51 (201) T protein:vir:10 1 MWKAKG------------------LADLC--DDSDGAA---RLRLAQVDNN----SGVGQAIGIDADSEEYNVL--NSDI 51 (201) T ss_pred CccchH------------------HHHHh--cCChHHH---HHHHHHHHHh----hhhhhhheeecCCcceeee--ecCc Confidence 333211 00000 0000111 1222222111 10 00 000 11111 2366 Q ss_pred chHHH-HHHHHHHHHHhcCCChhhccCC-CcceeccccchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCCh Q lcl|NC_015287. 356 GELSD-IEYFQKKLYRSLGVPESRIAGS-GDGFNLGRSSEILRDEIKFTKFVGRMRKR-FSHLFNDMLKTQLILKNIVTP 432 (563) Q Consensus 356 gei~D-V~YF~~kLy~aL~VP~sRl~~~-~~g~~~g~~~eItRDElKF~KfI~rLr~r-Fs~if~d~Lk~qLiLkgI~t~ 432 (563) +-++| +..|...+=-+.++|+.||-+. .+|+|- +++ -|.-.|..+|..+|.+ +..+...+++ -+..+ T Consensus 52 sGl~d~l~~~~~~iaa~s~iP~t~LfG~sp~Glna--tge--~d~~nyyd~i~~~Qe~~l~p~le~l~~-----~~~~~- 121 (201) T protein:vir:10 52 GGIDTFLSQKFDRIVALSGIHEIILKGKNVGGVSA--SQN--TALETFYGYVDRKRKAELLPLLEFLLP-----FIVTE- 121 (201) T ss_pred CChHHHHHHHHHHHHhHhcCchhhhcCCCCccccc--cch--hHHHHHHHHHHHHHHHHHHHHHHHHHH-----hhcCC- Confidence 77777 4578888999999999999543 335542 111 1223599999999953 3444443333 23332 Q ss_pred hHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCC Q lcl|NC_015287. 433 EDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKL 512 (563) Q Consensus 433 eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~ 512 (563) +.++|.|..=..-+|...+||.....++++.+-.- | . ++.+|+. +.+......+-. T Consensus 122 -------~~~~~~f~pL~~~s~kekAei~~~~a~a~~~~~~~-g-~------------i~~~e~r---~~L~~~~~~~~~ 177 (201) T protein:vir:10 122 -------QEWSVEFNPLSQVSDKDKSEILEKNVNSVAALIAA-G-I------------IDADEAR---DTLRAISTEVKI 177 (201) T ss_pred -------CCceEeeCCCCCCCHHHHHHHHHHHHHHHHHHHHc-C-C------------CCHHHHH---HHHHhcCCcCCC Confidence 35788899888889999999999999888665332 1 2 3333332 222222222222 Q ss_pred CCCcccCccccccccccCCCCcCCCCCc Q lcl|NC_015287. 513 ADPMEVQQLQLGVHPEQMPGGAMNPQDP 540 (563) Q Consensus 513 ~~p~~~~~~~~~~~~~~~~~~~~~p~~~ 540 (563) ++-.-+.+.... ...-++..|.|- T Consensus 178 ~~~~~~~~~~~~----e~~dp~~~~~~~ 201 (201) T protein:vir:10 178 GEGSIQTEVVIN----ESEDPLDVSANN 201 (201) T ss_pred CCCCCCcccccc----ccCCCCCCCCCC Confidence 211111111110 000111111111 No 171 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=93.96 E-value=0.0058 Score=32.83 Aligned_cols=445 Identities=12% Similarity=0.092 Sum_probs=169.3 Q ss_pred ccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhH----------- Q lcl|NC_015287. 3 KLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDS----------- 71 (563) Q Consensus 3 ~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~----------- 71 (563) =++.+ .++.+- ..+-|.+-..-+.. ....+-.... .....+|+.+..+.+-+. T Consensus 1 ~~~~~--~~~~~~-----~~~~~~~~~~l~~~-------~i~~li~~~~--~~~~~r~~~l~~YY~g~~~~i~~~~~~~~ 64 (506) T protein:vir:94 1 MDYDL--TEHKQA-----NLIYQESLENLTPN-------KIMKFITHHF--NYQRPRLEMLDDYYQGYNLKILDKQSRRH 64 (506) T ss_pred CCcch--hhhhcc-----eeecccchhcCCHH-------HHHHHHHHHH--HHHHHHHHHHHHHhcCCCccccccccccc Confidence 12222 111110 01111110111100 0000000000 111123333333322221 Q ss_pred ------------HHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhhe Q lcl|NC_015287. 72 ------------AIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYY 139 (563) Q Consensus 72 ------------Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~f 139 (563) -...||+- .+.=--..||.+..++ +...+.++.|.+-.+|+....++.+.+.+.|+-|. T Consensus 65 ~~~~~~~ki~~n~~~~Iv~~-~~~~l~G~p~~~~~~d--------~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~ 135 (506) T protein:vir:94 65 EDGKADHRATHSFAKYIADF-QTSYSVGNPINVKLPD--------DGSNSGFDTFNKANDVDAENYDLFLDMSRYGRAYE 135 (506) T ss_pred cccCCcceeecchHHHHHHH-hhhhhcccCceeecCc--------chHHHHHHHHHhccCHhHHHHHHHHHHHhcCeEEE Confidence 11122221 0011124566665543 23456677788888999999999999999999998 Q ss_pred eeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeec---ccccccccccccce-eeecCcccccccccccc Q lcl|NC_015287. 140 HKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNS---KNPADIENPEIEEY-YQYDPSQTQGKNALGAI 215 (563) Q Consensus 140 hkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~---~~~~~~~~~~~~ey-f~y~~~~~~~~~~~g~~ 215 (563) +.-+| ++|-..+..+||+.+..|..-... ...+-.|+. ....+.....+..| ..|.+.........+.+ T Consensus 136 ~v~~d----ed~~~~i~~~~p~~~~~v~dd~~~---~~~~~~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~ 208 (506) T protein:vir:94 136 YVYRG----EDNEEHLAKLDPLDTFVIYSTDVD---PKPIMAVRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTPIM 208 (506) T ss_pred EEEec----CCCeeEEEEEcccceEEEecCCCC---CceEEEEEEEeeeeccCCceeEEEEEEEEEeCceEEEeccccCc Confidence 87666 357888999999999888532222 122222211 00000000001111 12322222111110000 Q ss_pred CcCccccCcceeeeccccee-eeecceecCCCCccchhHHHHHHHHHH--------HHHHHHHHHHHHHhcCcc-ceEEE Q lcl|NC_015287. 216 GQTPFSTKQRPVKIAPDAIS-FCHSGLVDRNKQTILSYLHKSIKALNQ--------LRMIEDSLVIYRLSRAPE-RRIFY 285 (563) Q Consensus 216 g~~~~~~~~~~iki~~daI~-y~hSGl~d~~~~~~~syL~~Aik~~Nq--------L~m~EDalVIyRi~RAPe-RRvFy 285 (563) +. ......+ -+..=+|+ |+ |..-..|-++..+..... ...+++..-.+++...-. ....- T Consensus 209 ~~--~~~~~~~-~~g~vPvv~~~-------n~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~ 278 (506) T protein:vir:94 209 GK--MQVDTTK-PITTFPVVEFK-------NSNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEG 278 (506) T ss_pred cc--eeccccc-cCCccceEEec-------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccc Confidence 00 0000000 00000121 21 222233444444443322 334455444444443311 11111 Q ss_pred ccCCCC-----------chHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCcc----ceeecC Q lcl|NC_015287. 286 IDVGNL-----------PKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGT----EITTLP 350 (563) Q Consensus 286 IDvGnl-----------pk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgT----EIsTLp 350 (563) .+.... ......+++.+ |.+ ++. ++++-..+..|+ .+..|- T Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~--~~~---------------------~~~~~~~~~~~~~~~~d~~~l~ 334 (506) T protein:vir:94 279 SDMMNTIDPNDEDAMAKLAKDKLELIKE-MKD--ANM---------------------LLLKSGMTVNGTQTSVDAKYIN 334 (506) T ss_pred hhccccccccccccccccccchhHHHhh-hhh--cCe---------------------eeecccccccCccccccceeee Confidence 111110 00111111111 100 111 222222221121 222222 Q ss_pred CCCCcch-HHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 351 GGQNLGE-LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKF-----TKFVGRMRKRFSHLFNDMLKTQL 424 (563) Q Consensus 351 Gg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF-----~KfI~rLr~rFs~if~d~Lk~qL 424 (563) -..++.. -.-+.-+.+.+|...++|- +..+ +|. |.+| -.-+|+ ..-+.+.|..|..-+.++++.=+ T Consensus 335 ~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~--~~~-~n~S---g~Aik~~~~~l~~k~~~k~~~~~~~l~~~~~li~ 406 (506) T protein:vir:94 335 KTYDVVGSEAYKKRVAGDIHKFSHTPD--LTDE--NFA-SNSS---GVAMQYKVLGTVELASTKRRMFERGLYARYQIIS 406 (506) T ss_pred ecCCHHHHHHHHHHHHHHHHHHhCccc--cccc--ccc-ccch---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222221 1235556778888899984 2222 222 2222 223443 35667777777777777776533 Q ss_pred Hh---cCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHH Q lcl|NC_015287. 425 IL---KNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDK 501 (563) Q Consensus 425 iL---kgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~ 501 (563) -+ ++....-+| ..+++.|...---.+... ++++..+ +| .+|.++++..+- .++ +.+++.+ T Consensus 407 ~~~~~~~~~~~~d~----~~i~i~f~~~~p~d~~e~-------a~~~~kl---~g-~iS~et~~~~lp-~v~-d~~~E~~ 469 (506) T protein:vir:94 407 DIENSIHGDWTFDP----QELTFTFRDNLPADNISQ-------IKALVQA---GA-TLPQKYLYQQLP-GVT-NPQDIVD 469 (506) T ss_pred HHHHhcCCcccccc----ccceEEeCCCCCcCHHHH-------HHHHHHH---hc-cCChHHHHHhCC-CCC-CHHHHHH Confidence 22 222222233 357788865444334333 3344444 24 599999998754 433 2344445 Q ss_pred HHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccCCCCC Q lcl|NC_015287. 502 QMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAPQMP 558 (563) Q Consensus 502 qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~ 558 (563) +|++|.... ++.. +..+ .++.+.+ .++...++..++- T Consensus 470 ri~~E~~~~---~~~~---~~~~----------~~~~~~~----~~~~~~~~~~e~~ 506 (506) T protein:vir:94 470 MMKEQSANG---DYSF---DQNG----------VISNDGQ----TNTTATQTDEEVR 506 (506) T ss_pred HHHHHHHHH---hhcc---hhhc----------CCCcccC----ccccccccccCCC Confidence 555554321 0100 0000 0011110 1111111111111 No 172 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=93.79 E-value=0.0064 Score=32.62 Aligned_cols=400 Identities=11% Similarity=0.076 Sum_probs=161.2 Q ss_pred ccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEE Q lcl|NC_015287. 3 KLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIV 82 (563) Q Consensus 3 ~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv 82 (563) =||+....... .......++--.....++.....+-+ |. ...+++|-|..||+.|.+.+-- T Consensus 1 ~~~~r~~~~~~-------~~~~~~~~~~~~~~~g~~~s~~~~~v-----t~-------~~al~~~~v~~~v~~ia~~iA~ 61 (419) T protein:vir:14 1 MFFSRQLLSNL-------GQTQMSAGGWVSALLGSSRSDSGQVV-----TP-------ASALALTVLQNCVTLLAESIAQ 61 (419) T ss_pred Ccccccccccc-------cccccCcchhhHHhhcCCCccCCccc-----ch-------HHhhccHHHHHHHHHHHHhhcc Confidence 23332111111 11111111111111111111110001 00 1345788899999999996542 Q ss_pred ecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhc----chhhhHHH----HHHHhhhhhhheeeeecCCCCCcceeE Q lcl|NC_015287. 83 SDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILN----FDDKCHEI----LRNWYVDGRIYYHKVIDIKAPEEGIKE 154 (563) Q Consensus 83 ~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~----F~~~~~~~----~R~WYvDGri~fhkviD~~~~k~GI~e 154 (563) -|+.+-=.+ . +.-+... ...+.++|. -...+.++ +..+.+.|.-|..++.|.. .-+++ T Consensus 62 -----lp~~~~~~~-~--~~~~~~~---~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~---G~~~~ 127 (419) T protein:vir:14 62 -----LPIELYERS-G--EDRKPAT---DHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSD---GVIQG 127 (419) T ss_pred -----CceEEEEec-C--Ccccccc---ccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC---CcEEE Confidence 233331111 0 1101111 122333332 22344444 4455667888887766642 23789 Q ss_pred EEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccce Q lcl|NC_015287. 155 VRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAI 234 (563) Q Consensus 155 Lr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI 234 (563) |.+|+|..|..++. . . + .-+|.+... . .++.+.| T Consensus 128 l~pl~~~~v~v~~~---~-~---~-----------------~~~y~~~~~----------~------------~~~~~~i 161 (419) T protein:vir:14 128 LYPLDNEAVTVMRG---S-D---L-----------------KPVYRVRGS----------D------------PMPQRLV 161 (419) T ss_pred EEEecCceEEEEEC---C-C---c-----------------eEEEEEccC----------c------------ccchhhe Confidence 99999988765421 1 0 0 112222110 0 1223444 Q ss_pred eeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCc--hHHHHHHHHHHHHhhcceeE Q lcl|NC_015287. 235 SFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLP--KMKAEQYLKEVMNRYRNKLV 312 (563) Q Consensus 235 ~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlp--k~KAeqYl~~im~k~knklv 312 (563) .|. -.++.++-.+.|-|..|...+.....+++...=+----+--+-++..+...-+ ..++.+-+++. ++.+. T Consensus 162 ~h~--~~~~~dg~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~---~~~~~- 235 (419) T protein:vir:14 162 HHV--RWMSINGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDRITDG---WNAKF- 235 (419) T ss_pred eEe--cCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHH---HHHHh- Confidence 322 23445555678999999999888888877655544445666777877642211 22222223322 22211 Q ss_pred eeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCChhhccCCCcceecc Q lcl|NC_015287. 313 YDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE---LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLG 389 (563) Q Consensus 313 Yd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g 389 (563) +| ..+..+.+ .++ .|.+++.|. .+..+ ++--++..+.+.++++||...+...+.+ +.. T Consensus 236 ----~g-~~nag~~~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~-t~s 296 (419) T protein:vir:14 236 ----GG-SGNAKKVA-LLQ----------EGMTFRPLS--MTNVDAALIDALRLSALDIARIYKIPAHMVNELERA-TFS 296 (419) T ss_pred ----cC-ccccCCce-ecC----------CCceEEEcc--CChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCC-Ccc Confidence 11 11111111 221 245666664 23333 3334577899999999999999532211 111 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 390 RSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVA 469 (563) Q Consensus 390 ~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~ 469 (563) . +.-.-+.|..++ |+-.+. .+. +-+-++++++.++.. ..+.|..+. |-. .-+..|++.++ T Consensus 297 ~---~E~~~~~f~~~~--L~P~~~-~ie-----~~l~~kll~~~~~~~----~~i~fd~~~----l~r-~d~~~~~~~~~ 356 (419) T protein:vir:14 297 N---IEHQSLQFVIYT--LLPWVK-RHE-----QAKTRDLLLPSERKQ----YFIEYNLAG----LLR-GDQSSRYAAYA 356 (419) T ss_pred c---HHHHHHHHHHHH--HHHHHH-HHH-----HHHhhhccCccccCC----eEEEEechh----hhc-cCHHHHHHHHH Confidence 1 111112243332 222111 111 222334456666542 334444322 111 11234555554 Q ss_pred HhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCccc-ccccCC Q lcl|NC_015287. 470 AVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNM-GGAPQE 548 (563) Q Consensus 470 ~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~-~~~~~~ 548 (563) .+-.- -+ +|..|+-+ . ...++++..+.- . .|.+-.. ..+.+. T Consensus 357 ~~~~~--G~------------~T~NE~R~------~-~gl~p~~gGD~~--~--------------~~~n~~~~~~~~~~ 399 (419) T protein:vir:14 357 VGRQW--GW------------LSINDIRR------L-ENMPPVKGGDIY--L--------------SPMNMVDASKPQQL 399 (419) T ss_pred HHHhC--CC------------cCHHHHHH------H-hCCCCCCCcCee--e--------------eccccccccccccc Confidence 43211 12 44434331 1 122233211000 0 0111000 000000 Q ss_pred CCcccCCCCCCCCCC Q lcl|NC_015287. 549 GGAEQAPQMPEGGEI 563 (563) Q Consensus 549 ~~~~~~~~~~~~~~~ 563 (563) +.++..|....=.|+ T Consensus 400 ~~~~~~~~~~~~~e~ 414 (419) T protein:vir:14 400 PVGKSEPTKAAIDEI 414 (419) T ss_pred cCCCCCCccccccch Confidence 111111111111112 No 173 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=93.55 E-value=0.0072 Score=32.34 Aligned_cols=409 Identities=15% Similarity=0.108 Sum_probs=176.1 Q ss_pred ccc--cchh-hhHHHHHHHHHHHhhccchhHHH---------------------------HHhhcceEEecCCCcEEEEE Q lcl|NC_015287. 44 VDI--DGVF-KSEFELIKRYREMALHPEVDSAI---------------------------EDIINEAIVSDQNDSPVEID 93 (563) Q Consensus 44 ~~~--~~~~-~~~~~LI~~YR~ma~~PEvd~Ai---------------------------~eIvneaiv~d~~~~pV~l~ 93 (563) ..+ -..+ ..-...+.+|+.+..+.+-+..| ..|||-.+ .=--+.||.+. T Consensus 1 l~~~~i~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~-~yl~G~p~~~~ 79 (451) T protein:vir:10 1 MELEKIRAIISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKA-SYMFTYPVLFD 79 (451) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhh-hheecccceee Confidence 111 1111 11122334455554444443211 11222111 11124455555 Q ss_pred eeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCC----CCCcceeEEEEeCCccceeeeee Q lcl|NC_015287. 94 LENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIK----APEEGIKEVRYIDPLKIKLVRKL 169 (563) Q Consensus 94 l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~----~~k~GI~eLr~lDP~~i~~vR~~ 169 (563) .++. + ...+..+.+++ .+|+....++.+.+.+.|+-|.+.-+|-+ .+..|-..+..++|+.+-.|..- T Consensus 80 ~~~~---~----~~~~~~~~~~~-n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~vydd 151 (451) T protein:vir:10 80 IDNN---K----ELNEKVTDVLG-NEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIPIYRN 151 (451) T ss_pred cCCc---H----HHHHHHHHHhc-cCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcccceEEEEcC Confidence 4431 1 22233333443 57999999999999999999998877743 23357778899999998777432 Q ss_pred eccCCccccceeeeccccc--ccccccccceeeecCccccccccccccC-cCccccCcceeeecccceeeeecceecCCC Q lcl|NC_015287. 170 KTDNSLRGAINQVNSKNPA--DIENPEIEEYYQYDPSQTQGKNALGAIG-QTPFSTKQRPVKIAPDAISFCHSGLVDRNK 246 (563) Q Consensus 170 ~~~~~~~~~i~~~~~~~~~--~~~~~~~~eyf~y~~~~~~~~~~~g~~g-~~~~~~~~~~iki~~daI~y~hSGl~d~~~ 246 (563) ..+......|......... ...+..+..+-+|.+.........+... +.....+...--+..=+|++. . |+ T Consensus 152 ~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~-----~-nn 225 (451) T protein:vir:10 152 GIERELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQHRFNSVPFVEF-----S-NN 225 (451) T ss_pred CCCCceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccccCCCCeeeEEEe-----c-cC Confidence 2111111111111100000 0000111112233332222211111000 000000000000111122222 1 23 Q ss_pred CccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCccccccc Q lcl|NC_015287. 247 QTILSYLHKSIKALNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRK 325 (563) Q Consensus 247 ~~~~syL~~Aik~~NqL~m-~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~ 325 (563) ....|-++..+.....+.+ +=+..-.-+-+.-|-+-+.-.+... .+ +.+.. |..++--.+... T Consensus 226 ~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~-~~----~~~~~-~~~~~~i~~~~~---------- 289 (451) T protein:vir:10 226 IKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGED-TS----EFLKE-LKRYKTIKTETD---------- 289 (451) T ss_pred CCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCccc-ch----hhHHH-HhhCCeEEecCc---------- Confidence 3345667766666555543 3333334456666655443332211 11 11111 222221111111 Q ss_pred ccchhhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHH-- Q lcl|NC_015287. 326 HMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSD-IEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFT-- 402 (563) Q Consensus 326 ~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~-- 402 (563) +.+.|-.+..|....+...... +.-+.+.+|+..++|- +..+ + +|.+|.+.-. .+|. T Consensus 290 -------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~--~--~gn~Sg~Alk-~~~~~l 349 (451) T protein:vir:10 290 -------------SEGDSGGLKTMQIEIPTEARKIILEILKKQIYESGQGLQ--QDTE--N--FGNASGVALK-FFYRKL 349 (451) T ss_pred -------------CCccCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCccc--cccc--c--cccccHHHHH-HHHHHH Confidence 1122233555555455555444 7788889999999994 3222 2 3555443332 3333 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhh Q lcl|NC_015287. 403 -KFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSL 481 (563) Q Consensus 403 -KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~ 481 (563) .-+.+.++.|..-+.++|+.=+-+-|+ .+|. .|.+.|...---.+ .+.++++..+ +| .+|. T Consensus 350 ~~k~~~k~~~f~~~l~~~~~li~~~~~~---~d~~----~i~i~f~~~~p~n~-------~e~~~~~~kl---~g-~iS~ 411 (451) T protein:vir:10 350 ELKSGLLETEFRTSFDKLIKAILYFLGV---TDYK----KIQQTYTRNMMSND-------LEDADIATKS---VG-IIPT 411 (451) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhCC---CCcc----ceeEEecCCCCCCH-------HHHHHHHHHH---hc-cCch Confidence 345566666666666666544433333 3454 46677755332222 2344555555 35 4999 Q ss_pred HHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccc Q lcl|NC_015287. 482 DYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPE 528 (563) Q Consensus 482 ~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~ 528 (563) +++++.+-..+| .+++.+++++|...-.- +...+.+++.. T Consensus 412 et~~~~~p~v~d--~~~e~~~~~ee~~~~~~-----~~~~~~~~~~~ 451 (451) T protein:vir:10 412 KIILRHHPWVDD--VEEAEKLYLEEKKIQAS-----KVSDDYNNFTE 451 (451) T ss_pred HHHHHhCCCCCC--HHHHHHHHHHHHHHHHH-----HHHhhcCCCCC Confidence 999988765554 33444444444332100 11111111111 No 174 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=93.54 E-value=0.0072 Score=32.33 Aligned_cols=334 Identities=13% Similarity=0.170 Sum_probs=144.0 Q ss_pred CCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcchhhhHHH----HHHHhhhhhhheeeeecCCCCCcceeEEEEeC Q lcl|NC_015287. 85 QNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFDDKCHEI----LRNWYVDGRIYYHKVIDIKAPEEGIKEVRYID 159 (563) Q Consensus 85 ~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~~~~~~~----~R~WYvDGri~fhkviD~~~~k~GI~eLr~lD 159 (563) --.-|+.|- ++.+ .+......+|+. =+-...+.++ +..|.++|--|..++.|.. .-+++|..|+ T Consensus 1 ia~lp~~~~-~~~~-------~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~---G~~~~L~~l~ 69 (348) T protein:vir:93 1 MASLPLKMY-EDYK-------VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY---HQPSKLFLLN 69 (348) T ss_pred CcccceEeE-ecCc-------CcccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC---CcEEEEEEEc Confidence 122244331 2222 112223333332 2233345444 4456677988887776643 2278999999 Q ss_pred CccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeec Q lcl|NC_015287. 160 PLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHS 239 (563) Q Consensus 160 P~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hS 239 (563) |..+..+.. ++. ..-+|.|... +.+.+.++.+.|.|.- T Consensus 70 ~~~v~~~~~---~~~--------------------~~~~y~~~~~------------------~g~~~~~~~~eiih~r- 107 (348) T protein:vir:93 70 PDVVEMLIE---NQS--------------------RELYYSIHAA------------------TGNKLIVHNMDMLHFK- 107 (348) T ss_pred CCceEEEEe---CCC--------------------cEEEEEEEcC------------------CCeEEEEccccEEEec- Confidence 988775522 110 0123333221 0123556777774331 Q ss_pred ceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCc Q lcl|NC_015287. 240 GLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGE 319 (563) Q Consensus 240 Gl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGe 319 (563) +....++-.+.|.|..|...+.....++... +....+.| . +...-.+++-+.++++..+.+...|. +.|. T Consensus 108 ~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~-~~~~~~~~-~-~i~~~~~~l~~e~~~~~~~~~~~~~~-------n~~~ 177 (348) T protein:vir:93 108 HIVASNMVQGISPIDVLKNTTDFDNAVRTFN-LTEMQKPD-S-FMLKYGSNVSTEKRQQVLEDFKQYYE-------ENGG 177 (348) T ss_pred CCCCCCceeeccHHHHHHHHHHHHHHHHHHH-HHhcCCCc-e-eEEecCCCCCHHHHHHHHHHHHHHhh-------cCCC Confidence 2222344445788888888877666666653 33443333 2 33334456666666655555444332 2232 Q ss_pred ccccccccchhhhhcccccCCCCccceeecCCCCCcchHHH---HHHHHHHHHHhcCCChhhccCCCcceeccccchhhH Q lcl|NC_015287. 320 IRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSD---IEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILR 396 (563) Q Consensus 320 v~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~D---V~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItR 396 (563) .+ .+ ..|.+++.|. .+.-+++= .+|..+.+.++++||...|...++ -+.+...+.. T Consensus 178 ------~~-vl----------~~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~-~~~~~~e~~~- 236 (348) T protein:vir:93 178 ------IL-FQ----------EPGVEIEPLP--KKYVSEDIVASENLTRERVANVFQLPSIFLNARSN-TNFAKNEELN- 236 (348) T ss_pred ------ee-ec----------CCCceEEEcC--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-CCcccHHHHH- Confidence 11 11 1356677664 33444333 446788899999999999964322 2222222211 Q ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_015287. 397 DEIKFTKFV-GRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYV 475 (563) Q Consensus 397 DElKF~KfI-~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~v 475 (563) ..|.+++ .-+-.++... +-..++++.+|.. ..+|+|++. . +.... +..|++++..+-.- T Consensus 237 --~~~~~~~l~P~~~~ie~~---------l~~~l~~~~~~~~-g~~i~fd~~--~----l~~~d-~~~~a~~~~~~~~~- 296 (348) T protein:vir:93 237 --RFYLQHTLLPIVKQYEEE---------FNRKLLTKTDREK-NRYFKFNVK--S----YLRAD-SATQAEVYFKAVRS- 296 (348) T ss_pred --HHHHHHHHHHHHHHHHHH---------HHHhhCCcccccC-cceEEeech--h----hhccC-HHHHHHHHHHHHhC- Confidence 1243322 2222222222 2334455666652 233444332 2 21111 34566655444221 Q ss_pred chhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccCC Q lcl|NC_015287. 476 GKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAP 555 (563) Q Consensus 476 GKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~ 555 (563) -+++.+-+|.. ..-++++ +- +.. -..+. ..|.+.. ++.. T Consensus 297 -G~~T~NE~R~~-------------------~g~~p~~--gg---D~~-----~~~~n-~~~~~~~---------~~~~- 335 (348) T protein:vir:93 297 -GYYTINDIREW-------------------EDLPPVE--GG---DKP-----LISGD-LYPIDTP---------LELR- 335 (348) T ss_pred -CCCCHHHHHHH-------------------hCCCCCC--Cc---CeE-----eeccc-ccccccc---------hhhc- Confidence 23333333321 1222221 10 000 00000 0011100 0000 Q ss_pred CCCCCCCC Q lcl|NC_015287. 556 QMPEGGEI 563 (563) Q Consensus 556 ~~~~~~~~ 563 (563) +..+||+- T Consensus 336 ~~~~gg~~ 343 (348) T protein:vir:93 336 KSLKGGDK 343 (348) T ss_pred ccccCCCC Confidence 00122222 No 175 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=93.16 E-value=0.0086 Score=31.92 Aligned_cols=396 Identities=13% Similarity=0.100 Sum_probs=168.0 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEA 80 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvnea 80 (563) +.-||.... ..++.+... . +.....+-+.+..+. -+..+++|-|..||+-|.+.+ T Consensus 20 ~~~~~~~~~---------~~~~~~~~~-~-~~~~~~~~~~~~~v~--------------~~~al~~~~v~~cv~~Ia~~i 74 (424) T protein:vir:18 20 LQSWFVGGR---------LVTPNQGSQ-T-GPVSAHGHLGDSSIN--------------DERILQISTVWRCVSLISTLT 74 (424) T ss_pred HHhhhcccc---------ccccccccc-c-ccccccccccccccc--------------HHHhhccHHHHHHHHHHHHhh Confidence 222332210 111111100 0 000111111111111 134467888999999999866 Q ss_pred EEecCCCcEEEEEeeccccchHHHHHH--HHHHHHHHH-HhcchhhhHHHHH----HHhhhhhhheeeeecCCCCCccee Q lcl|NC_015287. 81 IVSDQNDSPVEIDLENLPASKQLKDLI--REEFKTVKE-ILNFDDKCHEILR----NWYVDGRIYYHKVIDIKAPEEGIK 153 (563) Q Consensus 81 iv~d~~~~pV~l~l~~~~~s~sik~~I--~eeF~~i~~-ll~F~~~~~~~~R----~WYvDGri~fhkviD~~~~k~GI~ 153 (563) .-+ |+.|- .... +..+.++ ......+|+ --+-...++++.+ .++..|.-|.-++.|.. .-++ T Consensus 75 A~l-----p~~~~-~~~~--~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~---G~~~ 143 (424) T protein:vir:18 75 ACL-----PLDVF-ETDQ--NDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA---GDVI 143 (424) T ss_pred ccC-----ceEEE-Eeec--CCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC---CcEE Confidence 432 33321 1000 0011111 111222222 1233444555444 44556887776665532 1289 Q ss_pred EEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccc Q lcl|NC_015287. 154 EVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDA 233 (563) Q Consensus 154 eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~da 233 (563) +|.+|+|-.|...+ + . + .-+|.|.. + ...+.++.+. T Consensus 144 ~L~pl~~~~V~v~~----~-~--~------------------~~~y~~~~-----------~--------g~~~~~~~~e 179 (424) T protein:vir:18 144 SLLPLQSANMDVKL----V-G--K------------------KVVYRYQR-----------D--------SEYADFSQKE 179 (424) T ss_pred EEEEecCcceEEEE----c-C--C------------------eEEEEEEe-----------C--------CeEEEecccc Confidence 99999998875321 1 0 0 11333321 0 1235677777 Q ss_pred eeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEe Q lcl|NC_015287. 234 ISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVY 313 (563) Q Consensus 234 I~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvY 313 (563) |.|.. .++.++-.++|-++.|++.+.....+++..-=+----+--+-|...+-+.+.+..+++. ++.++++.. T Consensus 180 Iih~r--~~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~-~~~~~~~~~---- 252 (424) T protein:vir:18 180 IFHLK--GFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV-EENFKEIAG---- 252 (424) T ss_pred EEEec--CcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCCHHHHHHH-HHHHHHHhC---- Confidence 75443 24456666789999999999887777776543322323334566676666666554443 332332211 Q ss_pred eCCCCcccccccccchhhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCChhhccC-CCcceecccc Q lcl|NC_015287. 314 DASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG-QNLGELSDIEYFQKKLYRSLGVPESRIAG-SGDGFNLGRS 391 (563) Q Consensus 314 d~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~-~~~g~~~g~~ 391 (563) -.+.|. .+ .|+ .|++++.|.=. +.+.=++-.+|..+.+.++++||...|.. +++++ .| T Consensus 253 g~nag~------~~-vl~----------~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~-~~-- 312 (424) T protein:vir:18 253 GPVKKR------LW-ILE----------AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTS-WG-- 312 (424) T ss_pred CcccCC------ce-ecc----------CCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccc-cc-- Confidence 011221 11 221 25666666322 12222444567788999999999999942 22111 11 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015287. 392 SEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAV 471 (563) Q Consensus 392 ~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~ 471 (563) +-+....+.|.+++ |+- +...+.+.|-.. ++++.++. ...+.|.-+.. .. .-+..|.+.+..+ T Consensus 313 sn~eq~~~~f~~~t--l~P-~~~~ie~~l~~~-----L~~~~~~~----~~~~~fd~~~l----lr-~d~~~r~~~~~~~ 375 (424) T protein:vir:18 313 SGIEQQNLGFLQYT--LQP-YISRWENSIQRW-----LIPAKDVG----RIHAEHNLDGL----LR-GDSASRAAFMKAM 375 (424) T ss_pred ccHHHHHHHHHHHH--HHH-HHHHHHHHHHhh-----cCCccccC----CeEEEEechhh----hc-cCHHHHHHHHHHH Confidence 22333334565542 322 122233333333 34555443 24455544331 11 1123555555443 Q ss_pred hhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCc Q lcl|NC_015287. 472 DPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGA 551 (563) Q Consensus 472 ~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~ 551 (563) -.- -+ ||..|+-+ . ..-+++ |+-+ .. .-.....|.+ ......+|. T Consensus 376 ~~~--G~------------~T~NE~R~------~-~gl~pi--~gGD---~~------~~~~n~~~l~--~~~~~~~p~- 420 (424) T protein:vir:18 376 GEA--GL------------RTINEMRR------T-DNLPPL--PGGD---VA------MRQSQYVPIT--DLGTNKEPR- 420 (424) T ss_pred HhC--CC------------cCHHHHHH------H-hCCCCC--CCcC---ee------eeccCccchH--hhhccCCCc- Confidence 211 12 44433331 1 122222 1100 00 0000001111 011111111 Q ss_pred ccCCCCCCCC Q lcl|NC_015287. 552 EQAPQMPEGG 561 (563) Q Consensus 552 ~~~~~~~~~~ 561 (563) .||- T Consensus 421 ------~~ga 424 (424) T protein:vir:18 421 ------NNGA 424 (424) T ss_pred ------cCCC Confidence 1122 No 176 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=92.53 E-value=0.011 Score=31.31 Aligned_cols=416 Identities=13% Similarity=0.134 Sum_probs=175.9 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHH-------- Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSA-------- 72 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~A-------- 72 (563) =-++|-|+.. ... .+..-. ....... .-+.+|+.+..+.+=... T Consensus 5 ~~~~~~~~~~---------------------~~~-~~~~i~---~~i~~~~---~~~~r~~~~~~yy~g~~~i~~~~~~~ 56 (453) T protein:vir:73 5 PIKLMTYSRD---------------------EEI-TDKVVN---DFMKKHQ---EEVERYEYLGNMYKGIMEISSQKAKD 56 (453) T ss_pred cceeeecccc---------------------ccC-CHHHHH---HHHHHHH---HHHHHHHHHHHHhccccchhcCCCCC Confidence 0112222110 000 000000 0000001 111233333322222211 Q ss_pred ------------HHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhhee Q lcl|NC_015287. 73 ------------IEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYH 140 (563) Q Consensus 73 ------------i~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fh 140 (563) ..-||+- .+.=-..+||.+..++ +...+.+..|++-.+|+....++.+..++.|+-|.+ T Consensus 57 ~~~~~~ki~~n~~~~ivd~-~~~~l~g~~~~~~~~d--------~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~ 127 (453) T protein:vir:73 57 SWKPDNRLTNNFAKYIVDT-FVGYFNGIPIKKTHDD--------KSVLEAMQLFDNLNDMEDEESELAKIACVYGRAYEL 127 (453) T ss_pred ccCccceeecchHHHHHHH-hhhhhcccCceeecCC--------hHHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEE Confidence 1112211 1111123455544432 344566777777789999999999999999999998 Q ss_pred eeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCc--C Q lcl|NC_015287. 141 KVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQ--T 218 (563) Q Consensus 141 kviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~--~ 218 (563) .-.|. +|-..+..++|+.+-.|..-. ..+..+-.++-. ..-.+.....+|.+.........+.... . T Consensus 128 v~~d~----~~~~~i~~~~p~~~~~v~dd~---~~~~~~~~i~~~----~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~ 196 (453) T protein:vir:73 128 MYQNE----STESEVIYCSPLNVFMVYDDS---IKQKPLFAVYYG----FDEEGNLSGTVYTLLETISITGKAGEVKFGE 196 (453) T ss_pred EEeCC----CCceEEEEEcccceEEEEeCC---CCceeEEEEEEE----EecCceEEEEEEeCCeEEEEEecCCceEEcc Confidence 77763 577889999999988875322 112222222110 0011122233444332222111000000 0 Q ss_pred ccccCcceeeeccccee-eeecceecCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHH Q lcl|NC_015287. 219 PFSTKQRPVKIAPDAIS-FCHSGLVDRNKQTILSYLHKSIKALNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKMKA 296 (563) Q Consensus 219 ~~~~~~~~iki~~daI~-y~hSGl~d~~~~~~~syL~~Aik~~NqL~-m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KA 296 (563) +..+.-. ++ +|+ |++ ..-..|-++..+.....+. ++=+....-+.++.|.+-+.-. .++. T Consensus 197 ~~~~~~g--~v---Pvv~~~n-------~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~---~~~~--- 258 (453) T protein:vir:73 197 STYNVYS--DL---PIVEYNF-------NEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGA---EVDE--- 258 (453) T ss_pred ceeccCC--ce---eEEEecC-------CCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecC---CCCc--- Confidence 0000000 11 222 222 2223455555554444433 2334444556677777765422 1111 Q ss_pred HHHHHHHHHhhc-ceeEeeCCCCcccccccccchhhhhcccccCC--CCccceeecCCCCCcchH-HHHHHHHHHHHHhc Q lcl|NC_015287. 297 EQYLKEVMNRYR-NKLVYDASTGEIRDDRKHMSMLEDFWLPRREG--GRGTEITTLPGGQNLGEL-SDIEYFQKKLYRSL 372 (563) Q Consensus 297 eqYl~~im~k~k-nklvYd~~TGev~d~~~~msMlEDywLpRReG--grgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL 372 (563) +- ++..+ ++.. -+.+.. +...| +.+.++.+|-...+.+.+ .-+.-+++.+|... T Consensus 259 -~~----~~~~~~~~~~----------------~~~~~~-~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s 316 (453) T protein:vir:73 259 -ED----AKNIKDNRLI----------------NFFDKN-SNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFT 316 (453) T ss_pred -hh----hhcccccccc----------------cccccc-cccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHh Confidence 11 11111 1110 000100 11111 122346666544344432 23566778888998 Q ss_pred CCChhhccCCCcceeccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CCChhHHHHhhhceEEEeecc Q lcl|NC_015287. 373 GVPESRIAGSGDGFNLGRSSE--ILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKN-IVTPEDWEVLGDHIQYDFVYD 449 (563) Q Consensus 373 ~VP~sRl~~~~~g~~~g~~~e--ItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkg-I~t~eew~~~~~~i~~~f~~D 449 (563) ++|- +..+ . +|.+|. |..-+.....-+.+.++.|..-+.++++.=+-+.+ .-...+| ..|.+.|... T Consensus 317 ~~p~--~~~~--~--~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~----~~i~v~f~~~ 386 (453) T protein:vir:73 317 MAAN--ISDE--N--FGNSSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNASNKDAW----KDIEYTFTRN 386 (453) T ss_pred CCcc--cCcc--c--ccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccc----ccceEEeCCC Confidence 9984 3222 2 244433 22222234455667777777777777665333322 2223334 3577888654 Q ss_pred chHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccCh--hhHHHHHHHHHHHhcCCCCCCCcccCcccccccc Q lcl|NC_015287. 450 NHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKD--EEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHP 527 (563) Q Consensus 450 n~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~td--eei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~ 527 (563) .--.+ .+.+++++.+. | .+|.++++..+-..+| +|++.++++-+++.+...-..+ T Consensus 387 ~p~~~-------~~~a~~~~k~~---g-iis~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~------------ 443 (453) T protein:vir:73 387 EPKDI-------KEQAETANILK---G-ITSEETALSVISVIPDVQAEMEKIKKKKLLQLSLTRTSNL------------ 443 (453) T ss_pred CCCCH-------HHHHHHHHHHh---c-cCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccC------------ Confidence 33223 23444555553 4 5999999987765553 4554444432222111000000 Q ss_pred ccCCCCcCCCCCccc Q lcl|NC_015287. 528 EQMPGGAMNPQDPNM 542 (563) Q Consensus 528 ~~~~~~~~~p~~~~~ 542 (563) -..+..+.++ T Consensus 444 -----~~~~~~~~~~ 453 (453) T protein:vir:73 444 -----VRMKQMRGNL 453 (453) T ss_pred -----CcchhhhcCC Confidence 0001112222 No 177 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=92.51 E-value=0.011 Score=31.30 Aligned_cols=425 Identities=11% Similarity=0.021 Sum_probs=184.4 Q ss_pred Cc--------ccccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHH Q lcl|NC_015287. 1 MA--------KLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSA 72 (563) Q Consensus 1 m~--------~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~A 72 (563) |+ ++.+++..+. -....+.... .+-.....++ .+...+.-++.....++.|++|..++.|.++ T Consensus 1 ~~~~i~~~~g~~~~~~~~~~--~~~~~ia~~~----~~~~~~~~~~---~~p~~~~il~~~~~~~~~y~~m~~D~~i~s~ 71 (491) T protein:vir:79 1 MSKGLWVSPTEFVKFGEPDK--SLSSQIATRA----RSIDFFALGM---YLPNPDPVLKALGKDIRVYRELRADAHVGGC 71 (491) T ss_pred CCCeeeCCCCCcccccccch--hHHHHHhhhc----cccccccccc---cCcchhHHHhhccCCHHHHHHHhhChHHHHH Confidence 32 2222211100 0000011000 0000001111 1222333334333457899999999999999 Q ss_pred HHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcce Q lcl|NC_015287. 73 IEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGI 152 (563) Q Consensus 73 i~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI 152 (563) ++.+..-+.-. +..|.-.+ -++ ++.++...+++-++|+.-..+++.-. -.|--.++++.+.++..-.+ T Consensus 72 l~~Rk~av~~~-----~w~i~~~~--~~~----~~a~~i~e~l~~~~~~~~i~~~lda~-~~G~s~~Ei~w~~~~g~~~~ 139 (491) T protein:vir:79 72 VRRRKAAVKAL-----EWGLDRGK--AKS----RVAKSIADVFADLDLSRIATEMLDAV-LYGYQPMEITWGKVGNYIVP 139 (491) T ss_pred HHHHHHHHhCC-----CcEEecCC--CCH----HHHHHHHHHHhcCCHHHHHHHHHHhh-hhcceeEEEEEeecCCeeeE Confidence 99998854422 22222211 122 34445555555568887777776543 36777888887765433334 Q ss_pred eEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeec-c Q lcl|NC_015287. 153 KEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIA-P 231 (563) Q Consensus 153 ~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~-~ 231 (563) ..|.+.+|+. |.|++.+.......+. ...++.++ . T Consensus 140 ~~l~~r~~~~------------------------------------f~~d~~~~l~l~~~~~--------~~~g~~lp~~ 175 (491) T protein:vir:79 140 IDVVGKPADW------------------------------------FVYDPENQLRFRSKEH--------WVQGEELPAR 175 (491) T ss_pred Eeeeeecccc------------------------------------eeeccCCceEEeecCC--------CCCceeecCC Confidence 4555555533 3332221111000000 01112332 3 Q ss_pred cceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHh-cCccceEEEccCCCCchHHHHHHHHHHHHhhcce Q lcl|NC_015287. 232 DAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLS-RAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNK 310 (563) Q Consensus 232 daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~-RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk 310 (563) .-|+|+|.-- .......|-|+.|..+|-=.+......+.+=-. =.|- |+...|-|.-.+.|. ..++.+. ...+. T Consensus 176 k~i~~~~~~~--~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~-~igky~~~a~~~ek~-~l~~al~-~~~~~ 250 (491) T protein:vir:79 176 KFLVPRQEAT--YLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPM-LVGKHPRSASDAETN-LLLDRLE-DMVQD 250 (491) T ss_pred CeEEEEecCC--CCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCe-EEEecCCCCCHHHHH-HHHHHHH-HHhcC Confidence 4677776522 112233688999988888777777666665443 3454 455558776555443 2333322 22221 Q ss_pred eEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcch----HHHHHHHHHHHHHhcCCChhh-ccCCCc- Q lcl|NC_015287. 311 LVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE----LSDIEYFQKKLYRSLGVPESR-IAGSGD- 384 (563) Q Consensus 311 lvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge----i~DV~YF~~kLy~aL~VP~sR-l~~~~~- 384 (563) .| -. +| .|++|+.+.-+..-|. ..=++|..++.-+++- +- |.++++ T Consensus 251 ------a~------~v--------iP-----~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iL---GqtlTt~~~g 302 (491) T protein:vir:79 251 ------AV------AV--------IP-----DDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIALL---GQNQTTEATS 302 (491) T ss_pred ------eE------EE--------ec-----CCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHHh---hhhhccCccc Confidence 11 11 12 4688888753222222 2338899988877762 21 223321 Q ss_pred ceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHH Q lcl|NC_015287. 385 GFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNER 464 (563) Q Consensus 385 g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R 464 (563) ++.+ +++-.+ -+...++..++..+..|.++++--+.+.+-.. ..+.|.|.... ...+...++ T Consensus 303 s~a~---~~vh~~--v~~~i~~~D~~~i~~tln~li~~l~~~N~~~~--------~~p~f~~~e~e-----e~~~~~a~~ 364 (491) T protein:vir:79 303 TRAS---AQAGLE--VTDDIRDGDKAIVVEAMNMLIRWICDLNFDGA--------ARPVFDMWEQE-----QVDEIQAGR 364 (491) T ss_pred chhh---HHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC--------CcceEeecCcC-----chhHHHHHH Confidence 2222 222222 26677777788888888886666555665322 23455554321 111233444 Q ss_pred HHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCccccc Q lcl|NC_015287. 465 LGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGG 544 (563) Q Consensus 465 ~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 544 (563) +..+. +. |-=++.+|+++++- +.+.+..+. +.+.+ . ++.. +........+|.+...+. T Consensus 365 ~~~L~---~~-G~~i~~~~~~e~~G-ip~~~~~e~-----------~~~~~--~--~~~~--~~~~~~~~~~~~~~~~d~ 422 (491) T protein:vir:79 365 DEKLT---RA-GARFTPAYFKRAYN-LQDGDLDER-----------PLPVS--A--VDAV--GAASFAEFEAPDQDALDA 422 (491) T ss_pred HHHHH---hC-CCccCHHHHHHHhC-CCCCCCCcc-----------ccCcC--c--cccc--ccccccccCCCCCcchHH Confidence 44443 33 43477788776553 332111100 01000 0 0000 000000000111111111 Q ss_pred ccCCCCcccCCCC------------CCCCCC Q lcl|NC_015287. 545 APQEGGAEQAPQM------------PEGGEI 563 (563) Q Consensus 545 ~~~~~~~~~~~~~------------~~~~~~ 563 (563) +.....+.++.+. .+++.. T Consensus 423 ~~~~~~~~~~~~~~~~~~~~i~~~l~~~~s~ 453 (491) T protein:vir:79 423 ALNALSARDLNADAQALVAPLLKRIANGASA 453 (491) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCH Confidence 1111111000000 000000 No 178 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=91.99 E-value=0.013 Score=30.87 Aligned_cols=396 Identities=13% Similarity=0.108 Sum_probs=170.0 Q ss_pred CcccccchhhHhhhcccccccc-cCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcc Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSP-VPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINE 79 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~-~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvne 79 (563) ...||+ + ++..++ +|.. . .....++..+. +.. + .-+..+++|-|..||+.|.+. T Consensus 19 ~~~lf~----~-----~~~~~~~~~~~-~---~~~~~~~~~~~--~~~---------v-s~~~al~~~~v~~cv~~Ia~~ 73 (424) T protein:vir:45 19 LDALFR----S-----KSLENPSTPIT-G---DAVDTDGLFRA--DVY---------V-SPETAMKLAAVYSCIYVLSSS 73 (424) T ss_pred HHhhcc----c-----cCCCCCccccc-h---hhhhhhccccC--Cce---------e-chHHhhccHHHHHHHHHHHHH Confidence 112221 1 111111 1211 1 11111111111 110 1 113345688899999999997 Q ss_pred eEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcchhhhHH----HHHHHhhhhhhheeeeecCCCCCcceeE Q lcl|NC_015287. 80 AIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFDDKCHE----ILRNWYVDGRIYYHKVIDIKAPEEGIKE 154 (563) Q Consensus 80 aiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~~~~~~----~~R~WYvDGri~fhkviD~~~~k~GI~e 154 (563) +.-. |+.|-=++ ....++.......++|+. =|=...+++ ++..+.+.|.-|..++.|.. .-+++ T Consensus 74 iA~l-----p~~v~~~~---~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~---G~~~~ 142 (424) T protein:vir:45 74 LAQM-----PLHVMRRH---KGKVEPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRR---GEVIS 142 (424) T ss_pred HhhC-----ceEEEEec---CCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC---CcEEE Confidence 6533 34332111 111111111222233321 122234444 44566777988887766542 33789 Q ss_pred EEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccce Q lcl|NC_015287. 155 VRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAI 234 (563) Q Consensus 155 Lr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI 234 (563) |.+|+|..+...+. . + .-+|.+... ++ ...++.+.| T Consensus 143 L~~l~~~~v~i~~~-----~--~------------------~~~y~~~~~----------~~---------~~~~~~~eV 178 (424) T protein:vir:45 143 LDCCMPWETTLMNT-----G--G------------------RYTYGLYNE----------YG---------AFAISPDDM 178 (424) T ss_pred EEEecCceEEEEEc-----C--C------------------eEEEEEEec----------Cc---------eEEECcccE Confidence 99999987754421 0 0 011222110 00 124556666 Q ss_pred eeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEee Q lcl|NC_015287. 235 SFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYD 314 (563) Q Consensus 235 ~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd 314 (563) .|.. .++.++-.+.|.++.|...+.....+++...=+----|--+-|+.++- .|.+.++++--+.+-..|+- . . T Consensus 179 ih~r--~~~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~~g-~-~- 252 (424) T protein:vir:45 179 IHIR--ALGNNQKMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKS-GLNKESWGWLKDQWQKASQA-L-R- 252 (424) T ss_pred EEec--CcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC-CCCHHHHHHHHHHHHHHhcc-c-c- Confidence 4433 355677778999999999999988888876644444455567777774 46665554444433333321 0 0 Q ss_pred CCCCcccccccccchhhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccch Q lcl|NC_015287. 315 ASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGG-QNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSE 393 (563) Q Consensus 315 ~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~e 393 (563) .+.|. .+ .+ ..|.++..|.=. ..+.=++-.++-.+.+.++++||...|...+.+ +. +. T Consensus 253 ~n~g~------~~-vl----------~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~-t~---sn 311 (424) T protein:vir:45 253 RQENK------TM-LL----------PADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKA-TF---SN 311 (424) T ss_pred ccCCc------ee-Ec----------CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC-Cc---cc Confidence 01121 11 11 124555555321 122224455688889999999999999532211 11 11 Q ss_pred hhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_015287. 394 ILRDEIKFTKF-VGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVD 472 (563) Q Consensus 394 ItRDElKF~Kf-I~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~ 472 (563) +...-+.|.++ +.-+-.++. +.|-.. +.+..|+. ....+.|.-+.. ... -+..|.+.++.+- T Consensus 312 ~eq~~~~f~~~tL~P~~~~ie----~~ln~k-----Ll~~~e~~---~g~~i~fd~~~l----lr~-d~~~r~~~~~~~~ 374 (424) T protein:vir:45 312 ISAQAIQFVRYTMMPWVTNWE----QELNRR-----LFTRAELA---AGYYVRFNLTGL----LRG-TPQERAQFYHFAI 374 (424) T ss_pred HHHHHHHHHHHHHHHHHHHHH----HHHHHh-----cCChhhhc---CCcEEEeechhh----hcc-CHHHHHHHHHHHH Confidence 22222234443 223322222 222222 33444443 223455554331 111 1335555554433 Q ss_pred hhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcc Q lcl|NC_015287. 473 PYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAE 552 (563) Q Consensus 473 p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 552 (563) .- -+ ||..|+-++ ..-++++..++--. |.+. .+ +.+. T Consensus 375 ~~--g~------------~T~NE~R~~-------~gl~pi~ggD~~~~----------------~~n~--~~----~~~~ 411 (424) T protein:vir:45 375 TD--GW------------MSRNEARAF-------EDMNPVEGLDEMLV----------------SVNA--AN----PAGD 411 (424) T ss_pred hC--CC------------cCHHHHHHH-------hCCCCCCCcceeee----------------cccc--cc----cccc Confidence 22 23 343333211 12222211100000 1110 00 0011 Q ss_pred cCCCCCCCCCC Q lcl|NC_015287. 553 QAPQMPEGGEI 563 (563) Q Consensus 553 ~~~~~~~~~~~ 563 (563) ..|+...+|+= T Consensus 412 ~~~~~~~~~~~ 422 (424) T protein:vir:45 412 FKPPKNDEGKT 422 (424) T ss_pred cCCCCCCCCCC Confidence 11111111111 No 179 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=91.38 E-value=0.016 Score=30.41 Aligned_cols=459 Identities=14% Similarity=0.161 Sum_probs=170.2 Q ss_pred Ccccc------cchhhHhh---------hcc-ccccccc---CcCCCCcccceeccccccccccccchhhhHHHHHHHHH Q lcl|NC_015287. 1 MAKLF------GFKIQKDD---------EQN-KNVVSPV---PQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYR 61 (563) Q Consensus 1 m~~~~------~f~~~~~~---------~~~-~~~~s~~---p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR 61 (563) ||-|- ..+.|+.. +.. ...|.+- -.+.-+|-. +..|.-.++. .-...++...+ ..++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~g~~--~~~~~~~~~~~-~~l~ 76 (535) T protein:vir:10 1 MAILKDLRNAFSLSNKKSTSYIELGDYDKDIVNKAIRPGRASARDTVDGID-IADGNVAGQY--SVASISDVLST-KKLL 76 (535) T ss_pred ChhhHHHHHHHHhhhhhhhhhHHHhhhhHHHHHhhhhhhhhhhhccccccc-cccCCccccc--ccCccccccCH-HHHH Confidence 44332 22222210 000 0000000 000001100 0011111111 11111221111 2344 Q ss_pred HH-hhccchhHHHHHhhcceEEe------cCCCcEEEEEeeccc--cchHHHHHHHHHHHHHHHHhc----chhhhHH-- Q lcl|NC_015287. 62 EM-ALHPEVDSAIEDIINEAIVS------DQNDSPVEIDLENLP--ASKQLKDLIREEFKTVKEILN----FDDKCHE-- 126 (563) Q Consensus 62 ~m-a~~PEvd~Ai~eIvneaiv~------d~~~~pV~l~l~~~~--~s~sik~~I~eeF~~i~~ll~----F~~~~~~-- 126 (563) .+ ...|-|..||+.+.+.+.|+ ....-.+.|.|.+.. -++..+. +...+.++|. -....++ T Consensus 77 ~~~~~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~----~~~~l~~lL~~~PN~~~~~~~~~ 152 (535) T protein:vir:10 77 KAYADNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIK----RAHEIEDFIYNTGSEYYEWRDTF 152 (535) T ss_pred HHhccChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhh----hhhHHHHHHHhCCCCCCChhHHH Confidence 44 46788999999988876542 122223444554332 2222222 2223333332 1112222 Q ss_pred --HHHH----Hhhhhhh-heeeeecCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeeccccccccccccccee Q lcl|NC_015287. 127 --ILRN----WYVDGRI-YYHKVIDIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYY 199 (563) Q Consensus 127 --~~R~----WYvDGri-~fhkviD~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf 199 (563) +++. -++-|.- |..++.|. ..-+.+|.+|||.+|+........ .....|+ T Consensus 153 ~~~~~~lv~d~l~~~g~ay~~i~r~~---~G~~~~L~~l~p~~V~v~~d~~~~--------------------~~~~~~~ 209 (535) T protein:vir:10 153 PRLLTKIINDMYVQDQINIERIFKND---SNELDHFNAVDASKVVISYSPRSK--------------------DQPRKFE 209 (535) T ss_pred HHHHHHHHHHHHhhCCceEEEEEECC---CCcEEEEEEeCCceeEEEEcCccc--------------------cCceEEE Confidence 2222 2333444 44444342 334899999999988755321111 0112344 Q ss_pred eecCccccccccccccCcCccccCcceeeecccceeeeecc-eecC-CCCccchhHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015287. 200 QYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSG-LVDR-NKQTILSYLHKSIKALNQLRMIEDSLVIYRLSR 277 (563) Q Consensus 200 ~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSG-l~d~-~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~R 277 (563) .|... .....++.+.|.|.+-. +-+. .+..++|-|+.|++.+.....+++...=+---- T Consensus 210 ~~~~~-------------------~~~~~~~~~eiih~~~~~~~~~~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng 270 (535) T protein:vir:10 210 QFVSE-------------------TKSVKFSERNLTFINYWNLSDTDRRGYGYSPVEASIPLIRAIYDTEQFNARFFSQG 270 (535) T ss_pred EEecC-------------------ceeEEECcccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 33220 11245666666554321 1122 244568889999999999988887655443333 Q ss_pred CccceEEEccCCC---CchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCC Q lcl|NC_015287. 278 APERRIFYIDVGN---LPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQN 354 (563) Q Consensus 278 APeRRvFyIDvGn---lpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~n 354 (563) +--+-|..++... +.+..++. + ...+++..- | ..+..+. .+++. .|.++.-|-- + T Consensus 271 ~~p~giL~~~~~~~~~ls~e~~e~-l---k~~~~~~~~-----G-~~nag~~-~vl~~---------~g~~~~~l~~--~ 328 (535) T protein:vir:10 271 GTTRGILVIDQDGDAQANQMMLAG-I---RRQWTSQGS-----G-LGGAWKI-PILAA---------KDAKFVNMTQ--N 328 (535) T ss_pred CCccEEEEecCCCCcccCHHHHHH-H---HHHHHHHhc-----C-ccccccc-ccccC---------CCceEEecCC--C Confidence 4445667766432 33322222 2 222222110 1 1111111 13321 1333333332 2 Q ss_pred cch---HHHHHHHHHHHHHhcCCChhhccC-CCcceeccc----cchhhHHHHHHHHH----HHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 355 LGE---LSDIEYFQKKLYRSLGVPESRIAG-SGDGFNLGR----SSEILRDEIKFTKF----VGRMRKRFSHLFNDMLKT 422 (563) Q Consensus 355 Lge---i~DV~YF~~kLy~aL~VP~sRl~~-~~~g~~~g~----~~eItRDElKF~Kf----I~rLr~rFs~if~d~Lk~ 422 (563) ..+ ++-.++..+..-++.+||...|.- +.++|+-.. +..-+--|-.+..| +.-+..++...|.. T Consensus 329 ~~D~qfle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie~~ln~---- 404 (535) T protein:vir:10 329 SRDMEFDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLSFIEQVIND---- 404 (535) T ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHhccccCcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhh---- Confidence 223 233456788899999999999852 222332211 11111112222223 33333333333332 Q ss_pred HHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChh---hHHHH Q lcl|NC_015287. 423 QLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDE---EVIEI 499 (563) Q Consensus 423 qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tde---ei~e~ 499 (563) .| ++..+ ..+.|+|... .... ...|.++.+.+. .-.+|..-+|.. +.+.+. |+--+ T Consensus 405 ~L-----l~~~~-----~~~~f~f~~l------~~~d-~~~r~~~~~~~~---~g~lT~NE~R~~-~gl~piegGD~~~~ 463 (535) T protein:vir:10 405 KI-----MRYVD-----TDYRFSFTLG------DAQD-KLQEEQVWKLKL---ANGYFINEYRKD-HGLKTVDGLDVPGF 463 (535) T ss_pred hc-----ccccC-----CeEEEEeccc------cccC-HHHHHHHHHHHH---cCCCCHHHHHHH-hCCCCCCCcccccc Confidence 22 22221 2356666431 1111 234555443332 123677777664 344442 11000 Q ss_pred HHHHH---HHhcCCCCCCCcccCcccccccccc----------CCCCcCCCCCcccccccCCCCcccCCCCCCCCCC Q lcl|NC_015287. 500 DKQME---KEIKDGKLADPMEVQQLQLGVHPEQ----------MPGGAMNPQDPNMGGAPQEGGAEQAPQMPEGGEI 563 (563) Q Consensus 500 ~~qie---~E~~~~~~~~p~~~~~~~~~~~~~~----------~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 563 (563) .-+.+ .....+.-..|+..+.......+.. ..+|..+|..|.. +..+.-+.+.-.+- T Consensus 464 ~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~q~~~~~~~~~~~g~~~~~~~~~-------~~~~~~~~~~~~~~ 533 (535) T protein:vir:10 464 IGSAENFINATGFGQPNVPDSSDDSGSTLGERERQERIQHSKDYEKGKDDPKSPLP-------KPSESDDVSNNEDA 533 (535) T ss_pred ccchhhcccccccccccCCCCCCCccccCCccccCcccccccccccCCCCCCCCCC-------cCCCCCcccccccc Confidence 00000 0000111111111110000000000 0111111111110 00001111111111 No 180 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=90.67 E-value=0.02 Score=29.94 Aligned_cols=417 Identities=10% Similarity=0.067 Sum_probs=193.7 Q ss_pred chhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEecCC Q lcl|NC_015287. 7 FKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSDQN 86 (563) Q Consensus 7 f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d~~ 86 (563) .+.+-- +.+...++.-.+-++ --...+.|..--.+.++ .-..++-..||... +=+.-||+.+++=..+ +-. T Consensus 1 m~~~~i-~~L~~~~~~~~~r~~-~~~~yy~g~~~~~~~~~----~~p~~~~~~~~~v~--nw~~~~Vd~~a~rl~~-~Gf 71 (422) T protein:vir:97 1 MNYMGM-GYLRRKLALFKTGVD-KRYRYYAMDDRDDTRSI----VMPNNVREMYRSVL--EWTAKGVDSLADRIIF-REF 71 (422) T ss_pred CChHHH-HHHHHHHHHHHHHHH-HHHHHHhcCCChhhcCc----cccHHHHHHHHhhc--chhHHHHHHHHhcccc-cee Confidence 000000 000000000000000 00001112111101111 00122222223222 2334566655541111 110 Q ss_pred CcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceee Q lcl|NC_015287. 87 DSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLV 166 (563) Q Consensus 87 ~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~v 166 (563) ...| .++..|.+..+|+....+.++.=++.||-|.-.- .++.+|...++.++|+.+.-+ T Consensus 72 ------~~~d------------~~l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~---~~~~~~~p~i~~~sp~~~~~i 130 (422) T protein:vir:97 72 ------TNDD------------FNAWEIFKANNPDIFFDTAIQSALIASCCFVYIM---PGAEDGLPKMQVIEASKATGI 130 (422) T ss_pred ------eCCc------------hhHHHHHHhcChHHHHHHHHHHHHHhcceeEEEe---eCCCCCeeEEEEechhhEEEE Confidence 1111 1356678888999999999999999999776432 223457788999999998776 Q ss_pred eeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeecceecCCC Q lcl|NC_015287. 167 RKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNK 246 (563) Q Consensus 167 R~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~ 246 (563) +- .........+.+. ... -.+....+.|-+.........+ +......+.-. .+| .|.|+|..-..... T Consensus 131 ~D-~~~~~~~~a~~~~-~~~-----~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~g--~vP--vv~~~n~~~~~~~~ 198 (422) T protein:vir:97 131 LD-PTTFLLTEGYAIL-ESD-----SNGNPTLEAYFTDKDIWYYPKK-GKPYNIKNPTG--HPL--LVPIIHRPDAVRPF 198 (422) T ss_pred Ee-CCCCcceeeEEEE-Eec-----CCCcEEEEEEEcCceEEEEcCC-CccccccCCCC--Ccc--eEEecccCCCcccc Confidence 41 1111111111111 000 0111122222221111111000 00000000000 011 34444443222221 Q ss_pred CccchhH-HHHHHHHHH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccc Q lcl|NC_015287. 247 QTILSYL-HKSIKALNQ-LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDR 324 (563) Q Consensus 247 ~~~~syL-~~Aik~~Nq-L~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~ 324 (563) | .|-+ +..+...+- -+.|.++++.=...=.|.|-++=+|--.-|..+-.. T Consensus 199 G--~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~-------------------------- 250 (422) T protein:vir:97 199 G--RSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYVLGMDPDAKPMEKWRA-------------------------- 250 (422) T ss_pred C--ccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcccCcccccCchhhh-------------------------- Confidence 1 2222 222222221 256788888889999999988765532212211111 Q ss_pred cccchhhhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHHH Q lcl|NC_015287. 325 KHMSMLEDFWLPRREGGRGTEITTLPGGQNLG-ELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTK 403 (563) Q Consensus 325 ~~msMlEDywLpRReGgrgTEIsTLpGg~nLg-ei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~K 403 (563) .|-.=..+|.-+.|-+.+|..++++. |+ =++-++-.-..+....++|.+-|..... |--.+..|.-.|....+ T Consensus 251 ---~~~~i~~~~~de~~~~~~v~q~~~~~-l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~--NpsSa~Ai~a~~~~L~~ 324 (422) T protein:vir:97 251 ---TVSTLLEISKDEDGDKPTVGQFTTAS-MAPFMEHLKMYASLFAGGSGLTLDDLGFPSD--NPSSVESIKAAHENLRA 324 (422) T ss_pred ---hhhhhhccCCCCCCCcceeeecCCCC-hhHHHHHHHHHHHHHhcccCCCHHHhccccC--chhHHHHHHHHHHHHHH Confidence 11123456776777778898888765 33 2334444445555566999998843211 10123367777788999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCC--hhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhh Q lcl|NC_015287. 404 FVGRMRKRFSHLFNDMLKTQLILKNIVT--PEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSL 481 (563) Q Consensus 404 fI~rLr~rFs~if~d~Lk~qLiLkgI~t--~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~ 481 (563) -+.+.|+.|..-+..++|.-+.+.|-.. +++|. .+.+.|. .++= .++..+.+..+.+..+..-+=.+.+. T Consensus 325 ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~----~~~~~w~-p~~~---~~~~s~a~~aDa~~Kl~~a~~~~~~~ 396 (422) T protein:vir:97 325 AGRKAQRSFSSGFLNVAYIAVCLRDEFPYLRNQFM----DTVIKWE-PLFE---ADANMLTLVGDGAIKLNQAIPGFMDA 396 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCcccchhhc----cceEEEc-cCCC---CChHHHHHHHHHHHHHHhhccccccH Confidence 9999999999999999999887766443 33443 3566665 1211 12333455666555443322234444 Q ss_pred HHHHHHHhccChhhHHHHHHHHHHHhcCC Q lcl|NC_015287. 482 DYVRRKILKQKDEEVIEIDKQMEKEIKDG 510 (563) Q Consensus 482 ~~i~k~Il~~tdeei~e~~~qie~E~~~~ 510 (563) +.+++ .|..++.+++ ...++++..+| T Consensus 397 ~~~~~-~lg~~~~~~~--~~~~~~~~~d~ 422 (422) T protein:vir:97 397 DVIRD-LTGVKGADKP--IPAITEVTTDG 422 (422) T ss_pred HHHHH-HcCCCchhHH--HHHHHhhhccC Confidence 65544 4567664433 34455555555 No 181 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=89.43 E-value=0.026 Score=29.24 Aligned_cols=407 Identities=9% Similarity=-0.005 Sum_probs=198.7 Q ss_pred chhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEecCC Q lcl|NC_015287. 7 FKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSDQN 86 (563) Q Consensus 7 f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d~~ 86 (563) .+..-- +.+...++.-.+-+. --...+.|..--.+.+ ..-..++-..||... +=+.-||+.+++=..+ +-. T Consensus 1 ~~~~~i-~~L~~~~~~~~~r~~-~~~~yY~g~~~~~~~~----~~~p~~~~~~~~~v~--nw~~~iVds~a~rl~~-~Gf 71 (409) T protein:vir:16 1 MTEKGI-GYLRFKLSVHKRRAE-MRYEQYAMKHVDRFKG----ITIPQALSQQYRSIL--GWCAKGVDSLADRLVF-REF 71 (409) T ss_pred CCHHHH-HHHHHHHHHHhHHHH-HHHHHHhccCchhhcc----hhhhHHHHHHHhhhc--ChhHHHHHHhHhhccc-ccc Confidence 000000 000000000000000 0000111111000111 111123434455433 3334456655442111 111 Q ss_pred CcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceee Q lcl|NC_015287. 87 DSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLV 166 (563) Q Consensus 87 ~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~v 166 (563) . .+| +++..|....+|+.+..+.++.=++.||-|- +|-+. .+|-..++.++|+.+.-+ T Consensus 72 ~------~~d------------~~l~~i~~~N~ld~~~~~~~~~al~yG~sf~--~v~~~--~dg~~~i~~~sP~~~~~i 129 (409) T protein:vir:16 72 E------NDD------------FTVNEIFEENNPDIFFDSTVLSALIASCSFT--YISKG--ENDAVRLQVIEATNATGI 129 (409) T ss_pred c------Ccc------------hHHHHHHHhcChhHHHHHHHHHHHHhCceeE--EEecC--CCCceEEEEEcccceEEE Confidence 0 011 2466788889999999999999999999665 33332 356788999999988766 Q ss_pred eeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeecceecCCC Q lcl|NC_015287. 167 RKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNK 246 (563) Q Consensus 167 R~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~ 246 (563) .- .........+.+.... ..+....+.+|-|..+....-.+ +......+... .+| .|.|++.--.+.. T Consensus 130 ~D-~~~~~~~~a~~~~~~d-----~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~g--~vP--vV~f~n~~~~~~~- 197 (409) T protein:vir:16 130 ID-PITGLLTEGYAVLERD-----ENNNVVLEAHFLPDRTDYYYRDS-RNNISIANPTG--NPL--LVPIIHRPDAVRP- 197 (409) T ss_pred ee-cccccceeeeEEEEec-----CCCceEEEEEEecCcEEEEEecC-ccccceecCCC--Ccc--eEEeccccccccc- Confidence 31 1111111111111100 00111223334332222111100 00000111110 111 4445544222211 Q ss_pred CccchhHHH-HHHHHHH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccc Q lcl|NC_015287. 247 QTILSYLHK-SIKALNQ-LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDR 324 (563) Q Consensus 247 ~~~~syL~~-Aik~~Nq-L~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~ 324 (563) .+.|-+-. .+...+- -+.+.+.++.=...=.|.|-++=+|...-|..+=..+ T Consensus 198 -~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~------------------------- 251 (409) T protein:vir:16 198 -FGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEPMETWKAT------------------------- 251 (409) T ss_pred -CCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCCccchhhhh------------------------- Confidence 12333322 1111121 2567888888889999999888765432222211001 Q ss_pred cccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHHHH Q lcl|NC_015287. 325 KHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTKF 404 (563) Q Consensus 325 ~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~Kf 404 (563) +-.=..+|.-+.|-+.+|..++++.=-+=++-++-.-..+....++|.+-|.... -|-..+..|.-.|....+- T Consensus 252 ----~~~i~~~~~d~~g~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~--~NpsSa~Ai~a~~~~L~~k 325 (409) T protein:vir:16 252 ----VSSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVS--DNPSSVEAIKASHENLRLA 325 (409) T ss_pred ----hhHhhccCCCCCCCCceEEecCCCChhHHHHHHHHHHHHHhhhcCCCHHHccccc--CchhHHHHHHHHHHHHHHH Confidence 1111235666667778898898864222255677777788888899999984221 1222344677778889999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHH Q lcl|NC_015287. 405 VGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYV 484 (563) Q Consensus 405 I~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i 484 (563) +.|.|+.|..-+..++|.-+.+.|-... +......+.+.|.. +. -.+..-+.++.+.+..+..- |+.+.-.-+ T Consensus 326 a~~k~~~fg~~l~~~~rla~~~~~~~~~--~~~~~~~~~v~W~~-~~---~~~~~s~a~~aDa~~Kl~~a-~~~~~~~~v 398 (409) T protein:vir:16 326 GRKAQRSLGAGLLNVAYLAACLRDDVPY--LREQFSKTKPKWEP-LF---EADASMLSLIGDGAIKLNQA-IPEFINKDT 398 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCc--cchhhccceEEecC-CC---CcchhhHHHHHHHHHHHHhh-cccccchhH Confidence 9999999999999999998887664321 22222456777752 21 12233345666666665553 544544444 Q ss_pred HHHHhccChhh Q lcl|NC_015287. 485 RRKILKQKDEE 495 (563) Q Consensus 485 ~k~Il~~tdee 495 (563) -.+.|.+++.| T Consensus 399 ~~~~~g~~~~d 409 (409) T protein:vir:16 399 IRDLTGIKGAE 409 (409) T ss_pred HHHhccCCCCC Confidence 56667888877 No 182 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=85.75 E-value=0.05 Score=27.69 Aligned_cols=442 Identities=13% Similarity=0.110 Sum_probs=181.6 Q ss_pred hhcccccccccCcCCCCcccceeccccccccccc---c--chhhhHHHHHHHHHHHh-hccchhHHHHHhhcceEEecCC Q lcl|NC_015287. 13 DEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDI---D--GVFKSEFELIKRYREMA-LHPEVDSAIEDIINEAIVSDQN 86 (563) Q Consensus 13 ~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~---~--~~~~~~~~LI~~YR~ma-~~PEvd~Ai~eIvneaiv~d~~ 86 (563) +..-.. -+.|....+ . +.+.|....+.-. + ..++ ....++.|++|. .++.|.++++.+..-+.-. T Consensus 1 ~~~~~~--~~~p~~~~g--~-~~~~~~~~~~~~~~~~e~~~~lr-~~~~~~ly~~m~e~D~~i~s~l~~rk~av~~~--- 71 (469) T protein:vir:10 1 MTERVK--TAAPVSEAG--Y-VFGSGVVDGWTVWDPFEQTPELQ-WPQSVAVYSRMDNEDSRVTSLLEAISLPIRST--- 71 (469) T ss_pred CCCccc--CCCCccchh--h-hhhcccccchhhccccccccccc-cccchHHHHHHHhhChHHHHHHHHHHHHHhcC--- Confidence 111111 122221111 1 2222222111111 1 1122 135688999995 6999999999998743322 Q ss_pred CcEEEEEeeccccchHHHHHHHHHHHHHHHH-------------hcchhhhHHHHHHHhhhhhhheeeeecCCC-CCcce Q lcl|NC_015287. 87 DSPVEIDLENLPASKQLKDLIREEFKTVKEI-------------LNFDDKCHEILRNWYVDGRIYYHKVIDIKA-PEEGI 152 (563) Q Consensus 87 ~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-------------l~F~~~~~~~~R~WYvDGri~fhkviD~~~-~k~GI 152 (563) +..|.-- .-++.+.+.+.+.+...+.. .+|.....+++-..+-.|--++++|..... ..+|- T Consensus 72 --~w~v~p~--~~~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~dG~ 147 (469) T protein:vir:10 72 --PWRIRAN--GASDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPDGR 147 (469) T ss_pred --CceEecC--CCCHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCCCc Confidence 1222211 22455666666665544321 124455555566667788888999887543 23565 Q ss_pred eEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeeccc Q lcl|NC_015287. 153 KEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPD 232 (563) Q Consensus 153 ~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~d 232 (563) ..+..|-||.=+.+..+..... ++....+.. .+.+ ....... ..+...+.++.. T Consensus 148 ~~~~~l~~rp~~~i~~~~~~~~--~~l~~~~~~----------------~~~~---~~~~~~~-----~~~~~~~~lp~~ 201 (469) T protein:vir:10 148 FWLRKLAPRPQWTISKFNVAPD--GGLESIEQI----------------APPA---RTRGSLY-----VANIAPPEIPVN 201 (469) T ss_pred eeeeeeeecCcccceeeeeccC--Cceeeeeec----------------Cccc---ccccccc-----cCCCCccccccC Confidence 5555555554333333333222 111111100 0000 0000000 000112334333 Q ss_pred -ceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHh-cCccceEEEccCCCCchHHHHHHHHHHHHhhcce Q lcl|NC_015287. 233 -AISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLS-RAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNK 310 (563) Q Consensus 233 -aI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~-RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk 310 (563) -|+|.|.-- ...-...|-|.+|..+|-=.+......+.+=-. =.|-| |...+.|.-...| +-|.+++..+++- T Consensus 202 k~i~~~~~~~--~g~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~-vgky~~~a~~~ek--~~l~~a~~~~~~g 276 (469) T protein:vir:10 202 RLVVYTRNKR--PGQWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIP-VGTASSATDEDEV--RKMAALARSVRGG 276 (469) T ss_pred cEEEEEecCC--CCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcce-EEecCCCCCHHHH--HHHHHHHHHHhcC Confidence 577887732 112223688999988887666666555554332 34554 5566666544433 3344444444331 Q ss_pred eEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCc-chHHHHHHHHHHHHHhcCCChhhccCCCcceecc Q lcl|NC_015287. 311 LVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNL-GELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLG 389 (563) Q Consensus 311 lvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nL-gei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g 389 (563) .+.|-|- ..|++|..+..+.+. .-..=++|..+++-+++--..=-.+++++++.+| T Consensus 277 ----~~a~~ii-------------------p~~~~ie~~ea~g~~~~~~~li~~~d~~Isk~iLG~tlTs~~~gGS~a~~ 333 (469) T protein:vir:10 277 ----INAGVGL-------------------AQGQILELLGVSGNLPDIRRAIEGHDRSIALSGLAHFLNLDGKGGSYALA 333 (469) T ss_pred ----CceEEEc-------------------cCCceEEEeecCCCchHHHHHHHHHHHHHHHHHhcccccccCccchhhHH Confidence 0111111 146788877653322 1223367888887776643331112222223222 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_015287. 390 RSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVA 469 (563) Q Consensus 390 ~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~ 469 (563) ..-. | -|...++...+.++..+..-|-..|+.=| |......-++.|..- | .+.+...+++..+. T Consensus 334 ~vh~----e-v~~d~~~sDa~~i~~tln~~li~~l~~lN------~g~~~~~P~~~~~~~----e-~~~~~~a~~i~~l~ 397 (469) T protein:vir:10 334 SVLE----D-PFTQAVHAYATSICRIANQHIIEDLVDIN------FGVDTPAPVLTFDPI----G-SRQDLTAAAVKLLY 397 (469) T ss_pred HHHH----H-HHHHHHHHHHHHHHHHHHHHHHHHHHHhc------CCCCCCccEEEecCC----C-CcHHHHHHHHHHHH Confidence 2111 1 14445566666666666543333333332 221123345555321 1 22333444444443 Q ss_pred Hhhhhhch-hhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCC Q lcl|NC_015287. 470 AVDPYVGK-YFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQE 548 (563) Q Consensus 470 ~~~p~vGK-y~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~ 548 (563) .+ ++... -++.+|+++.+- +.. |.+.++.....-++..+.....|.......+. T Consensus 398 ~~-G~~~~~~~~~~~~~e~~g-ip~---------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 452 (469) T protein:vir:10 398 DA-GVFDDDPAVKRAIRQRFN-LPS---------------------ELNDTPSAEPEEPAAVPNQSAAPARTRSSGNA-- 452 (469) T ss_pred hc-CCccCccccHHHHHHHhC-CCC---------------------CCCCcccccchhcccCCCCCccccccCCCCCc-- Confidence 22 11001 244566664443 221 11111110000000000000011111100000 Q ss_pred CCcccCCCCCCCCCC Q lcl|NC_015287. 549 GGAEQAPQMPEGGEI 563 (563) Q Consensus 549 ~~~~~~~~~~~~~~~ 563 (563) ..+ +.-+.+.-+++ T Consensus 453 ~~~-~~~~~~~~~~l 466 (469) T protein:vir:10 453 DAR-ARAPKADQGVL 466 (469) T ss_pred ccc-cccCCChHHhh Confidence 000 11111122333 No 183 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=82.82 E-value=0.074 Score=26.80 Aligned_cols=373 Identities=11% Similarity=0.087 Sum_probs=153.6 Q ss_pred Ccccccc-------------hhhHhhhcccccccccCcCCC---Ccccceeccccccc---cccccchhhhHHHHHHHHH Q lcl|NC_015287. 1 MAKLFGF-------------KIQKDDEQNKNVVSPVPQSNE---DSSDYYVSSGFYGQ---YVDIDGVFKSEFELIKRYR 61 (563) Q Consensus 1 m~~~~~f-------------~~~~~~~~~~~~~s~~p~~~~---d~~~~~~~~~~~~~---~~~~~~~~~~~~~LI~~YR 61 (563) .+.|||+ |.+.-+.. .+++-.|... -++.....++..++ +..-.+..-+. + T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~t~-------~ 73 (409) T protein:vir:83 4 WSNLFGIPSIPDLPNDNGPVDYNPGDPD---MVEFRGPEEEPEARALPWIRPTAWSGYPESWATPSWGSAQD-------K 73 (409) T ss_pred hhhhcccccCCCcccccccccccCCCCc---eeeccCCCcchhhhhcccccccccccccccccccCccccch-------h Confidence 5677775 11111111 1111111111 11111111111111 11001111111 4 Q ss_pred HHhhccchhHHHHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcchhhhH----HHHHHHhhhhh Q lcl|NC_015287. 62 EMALHPEVDSAIEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFDDKCH----EILRNWYVDGR 136 (563) Q Consensus 62 ~ma~~PEvd~Ai~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~~~~~----~~~R~WYvDGr 136 (563) .++++|-|..||+-|.+.+--. |+.+- ++.... ++..++++. =|-..++. .++..+.+ |- T Consensus 74 ~~~~~~~v~acV~~Ia~~iA~l-----pl~~~-~~~~~~--------~~~~~ll~~~PN~~~t~~~f~~~l~~~lll-Gn 138 (409) T protein:vir:83 74 LRTLIDVAWACIDLNASVLSSM-----PIYRM-RNGRII--------DSVAWMSNPDPEVYTSWQEFAKQLFWDFQL-GE 138 (409) T ss_pred hHhhhHHHHHHHHHHHHhhccC-----ceEEe-eCCccc--------cchhhhcccCCCCCCCHHHHHHHHHHHHhh-CC Confidence 5677899999999999854332 33322 222211 222222221 00112333 34455555 77 Q ss_pred hheeeee-cCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCcccccccccccc Q lcl|NC_015287. 137 IYYHKVI-DIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAI 215 (563) Q Consensus 137 i~fhkvi-D~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~ 215 (563) -|..+|. |.. .=+++|..|+|..|...+ .++ + ..+|.++. .. T Consensus 139 ay~~~i~r~~~---G~~~~L~pl~p~~v~v~~---~~~----g-----------------~~~y~~~~----------~~ 181 (409) T protein:vir:83 139 AFVLPMAHGSD---GYPIRFRVVPPWLVNVEL---KKG----A-----------------RREYRIGG----------LN 181 (409) T ss_pred cEEEEEEECCC---CcEEEEEEECCcceEEEE---cCC----c-----------------eEEEEEcc----------cc Confidence 7776653 322 127899999998776431 111 1 01222211 00 Q ss_pred CcCccccCcceeeecccceeeeecceec-CCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhc--CccceEEEccCCCCc Q lcl|NC_015287. 216 GQTPFSTKQRPVKIAPDAISFCHSGLVD-RNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSR--APERRIFYIDVGNLP 292 (563) Q Consensus 216 g~~~~~~~~~~iki~~daI~y~hSGl~d-~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~R--APeRRvFyIDvGnlp 292 (563) . .+.|. |--.+. .++..+.|-|+.|...+..-...++... +.+. |--.-|...| +.|- T Consensus 182 -------------~-~~eii--Hir~~~~~~~~~G~spi~~~~~~i~~~~a~~~~~~--~~f~nga~p~gil~~~-~~ls 242 (409) T protein:vir:83 182 -------------V-TDEIL--HIRYQGNTADAHGHGPLESAAPRQVVIGLLQKYVQ--NLAETGGVPLYWLGVE-RRLS 242 (409) T ss_pred -------------C-ccceE--EeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHH--HHHhcCCCcceEeecC-CCCC Confidence 0 12332 322222 2334567889999888887777777543 3333 2223444444 4567 Q ss_pred hHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchH---HHHHHHHHHHH Q lcl|NC_015287. 293 KMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGEL---SDIEYFQKKLY 369 (563) Q Consensus 293 k~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei---~DV~YF~~kLy 369 (563) +.++++..+.....|. .+.|. .+ ++++ |+.-++..++ +..++ +-.+|-.+.+- T Consensus 243 ~e~~~~~~~~~~~~~~------~nag~------~~-il~~-------g~~~~~~~~~----s~~d~q~le~r~~~~~eIa 298 (409) T protein:vir:83 243 ETEAVDLMDRWIESRS------KYAGH------PA-LVTG-------GATLNQAKSM----SAQDLSLMELTQFNEARIA 298 (409) T ss_pred HHHHHHHHHHHHHhhC------CccCc------cc-eecC-------CcccccccCC----CHHHHHHHHHHHhhHHHHH Confidence 7677665555443332 12232 11 2221 2221222122 22222 22334456688 Q ss_pred HhcCCChhhccCCC--cceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEee Q lcl|NC_015287. 370 RSLGVPESRIAGSG--DGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFV 447 (563) Q Consensus 370 ~aL~VP~sRl~~~~--~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~ 447 (563) ++.+||...|.-.+ ++.+... |...-+-|.++ .|+--+. .+.+.|-..|+..| +.|+ |. T Consensus 299 ~~fgVPp~llg~~~~~~~~tysn---~eq~~~~f~~~--tL~P~~~-~ie~~l~~~Ll~~~-----------~~~~--f~ 359 (409) T protein:vir:83 299 ILLGVPPFLVGLPGATGSLTYSN---IEQLFSFHDRS--SLRPKAT-AVMAALDRWALPSP-----------QHLE--LN 359 (409) T ss_pred HHhCCCHHHccCCCCcccccccc---HHHHHHHHHHH--HHHHHHH-HHHHHHHHhhCCCC-----------cEEE--ee Confidence 99999999985221 1111111 22222334432 3333222 23444444554332 2344 43 Q ss_pred ccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCcccccccc Q lcl|NC_015287. 448 YDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHP 527 (563) Q Consensus 448 ~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~ 527 (563) -| ++....+ ..|.+.++.+-.- -+ ||..|+-+ .| |.= |.+..++-. T Consensus 360 ~~----~llr~d~-~~r~~~~~~~~~~--G~------------lT~NE~R~------~~---glp--p~~ggd~l~---- 405 (409) T protein:vir:83 360 RD----DYTRPSL-VERATAYKIMIEA--GV------------MEPNEARA------ME---RLH--SEAAAVRLS---- 405 (409) T ss_pred hh----hhhccCH-HHHHHHHHHHHhC--CC------------cCHHHHHH------Hh---CCC--CCCCCcccC---- Confidence 32 2332222 4566666544322 12 44444431 11 221 212111111 Q ss_pred ccCCC Q lcl|NC_015287. 528 EQMPG 532 (563) Q Consensus 528 ~~~~~ 532 (563) +.++ T Consensus 406 -~~gv 409 (409) T protein:vir:83 406 -GGGV 409 (409) T ss_pred -CCCC Confidence 1111 No 184 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=81.70 E-value=0.083 Score=26.50 Aligned_cols=421 Identities=12% Similarity=0.073 Sum_probs=173.4 Q ss_pred hhHhhhcccccccccCcCCC-Ccccceecccccc-ccccccchhhhHHH--HHHHHHHHhhccchhHHHHHhhcceEEec Q lcl|NC_015287. 9 IQKDDEQNKNVVSPVPQSNE-DSSDYYVSSGFYG-QYVDIDGVFKSEFE--LIKRYREMALHPEVDSAIEDIINEAIVSD 84 (563) Q Consensus 9 ~~~~~~~~~~~~s~~p~~~~-d~~~~~~~~~~~~-~~~~~~~~~~~~~~--LI~~YR~ma~~PEvd~Ai~eIvneaiv~d 84 (563) +++.. ++. +.... ++.. +.+.|.. -..... .+-..++ -++.|++|..++.|.++++.+..-+.-. T Consensus 1 v~~~~------l~~-e~at~~~~~d--~~~~~~~~l~~~~~-~il~~a~~g~~~~y~~l~~D~~i~s~l~~rk~av~~~- 69 (488) T protein:vir:99 1 MEKPA------LGR-EIATSGDGRD--ITRPFISGLQVPND-SILQRRGGNDLRVYEEILSDAQVKTVWGQRQLAVVSR- 69 (488) T ss_pred CCccc------hhH-HHHHHHhhhh--hhccccCCCCCCCh-HHHHhhccCCHHHHHHHhhChHHHHHHHHHHHHHhcC- Confidence 11110 000 00000 0000 0011111 011111 1111111 1689999999999999999998844322 Q ss_pred CCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccce Q lcl|NC_015287. 85 QNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIK 164 (563) Q Consensus 85 ~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~ 164 (563) +..|.--+ . + ..-+++.++...+++-++|+.-..+++.- ...|--++++|...++..-.+..|.+.+|+ T Consensus 70 ----~w~i~p~~-~-~-~~~~~~ae~v~~~l~~~~~~~~l~~~lda-~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~--- 138 (488) T protein:vir:99 70 ----EWKVEAGG-D-R-PIDQAAAEHLEQQLQRVGWDRVTSKMLFG-VFYGYAVSELIYGRDDRYITLEAIKVRNRR--- 138 (488) T ss_pred ----CceEEcCC-C-C-hHHHHHHHHHHHHHhCCCHHHHHHHHHhh-hhhcceeEEEEEeecCCeeeEeeeeeeccc--- Confidence 22222111 1 1 12245666667777777888887777754 347888888888765433333444444443 Q ss_pred eeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeecceecC Q lcl|NC_015287. 165 LVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDR 244 (563) Q Consensus 165 ~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~ 244 (563) +|.|++.+....-..+.. .....+..+..-|++.|. -.. T Consensus 139 ---------------------------------~f~~d~~~~l~~~~~~~~------~~g~~lp~~~~~i~~~~~--~~~ 177 (488) T protein:vir:99 139 ---------------------------------RFRYDQDGGLRLLTPNNM------FEGEPCPAPYFWHFSTGA--DND 177 (488) T ss_pred ---------------------------------ceeecCCCceEEeccCCC------CCccccccCceEEEEeec--CCC Confidence 333322111100000000 000112223333444443 111 Q ss_pred CCCccchhHHHHHHHHHHHHHHHHHHHHHHHh-cCccceEEEcc-CCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccc Q lcl|NC_015287. 245 NKQTILSYLHKSIKALNQLRMIEDSLVIYRLS-RAPERRIFYID-VGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRD 322 (563) Q Consensus 245 ~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~-RAPeRRvFyID-vGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d 322 (563) ..-...|-|.+|..+|--.+......+.+--. =.|-| +...| .|.=+..| .+.++. +..+.+.- | T Consensus 178 g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~-igky~~~~a~~~ek-~~l~~a-v~~~~~~~------~---- 244 (488) T protein:vir:99 178 DEPYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTA-VGRYDDKTATPEDK-AKLLAA-LHAIQTDS------A---- 244 (488) T ss_pred CCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCcee-eeecCCCCCCHHHH-HHHHHH-HHHHhcCc------E---- Confidence 12234689999999988888877777776443 35655 44444 33323322 233333 22222221 1 Q ss_pred cccccchhhhhcccccCCCCccceeecCCCCCcch--HHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHH Q lcl|NC_015287. 323 DRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE--LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIK 400 (563) Q Consensus 323 ~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge--i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElK 400 (563) -.+ | .|++|+.+..+..-+. ..=++|..++.-+++-= .=|.+++++- +.|..=+-+|+ T Consensus 245 --~vi--------P-----~~~~ie~~ea~~~~~~~~~~li~~~d~~Isk~iLG--qtlts~~~~G--s~a~~~vh~~v- 304 (488) T protein:vir:99 245 --IIM--------P-----AGMQAELLEAGRSGTADYKTLHDTMDATIAKVGLG--QVASTQGTPG--RLGNDDLQADV- 304 (488) T ss_pred --EEe--------c-----CCceeEEeecCCCChHHHHHHHHHHHHHHHHHHhh--hhhccccccc--chhhHHHHHHH- Confidence 111 1 3688888865332222 23477888888776421 1133333221 12222233333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhh Q lcl|NC_015287. 401 FTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFS 480 (563) Q Consensus 401 F~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S 480 (563) |...++.-.+.++..|..-|-..|+.=|. .... -..+.+++.+ .|-++.+.+.+..+-+..|-=++ T Consensus 305 ~~d~~~aDa~~i~~tln~~li~~l~~~N~-~~~~----~p~~~~~~~e---------~edl~~~a~~~~~l~~~~G~~i~ 370 (488) T protein:vir:99 305 RLDLVKADADLICESFNLGPARWLTEWNF-PGAQ----PPRVYRVIEE---------PEDITAKAERDEKVFRMSGFRPT 370 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCc-CCcC----CceeEecCCC---------cccHHHHHHHHHHHHhhcCCCCC Confidence 44555555666666665423233333231 1111 1334444443 23334455555555444454467 Q ss_pred hHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccCCCC--- Q lcl|NC_015287. 481 LDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAPQM--- 557 (563) Q Consensus 481 ~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~--- 557 (563) .+|+++++- +..++-.+ + ..-+.|... +..+.. |.++....+. ....+-++.+ T Consensus 371 ~~~i~e~~G-ip~~~~~~-------~---~~~~~~~~~-----------~~~~~~-~~~~~~~~~~-~~~~~~~~~~~~~ 426 (488) T protein:vir:99 371 RGYVQETYG-VEVESTQA-------E---ATAPTPSTE-----------FAEGDQ-PSDPAAAMAP-QLAEAMQPVVGNW 426 (488) T ss_pred HHHHHHHcC-CCCccccc-------c---cccCCCccc-----------CCCCCC-CCCchHHHHH-HHHHHHHHHHHHH Confidence 777775553 33211100 0 000000000 000000 0000000000 0000000000 Q ss_pred --------CCCCCC Q lcl|NC_015287. 558 --------PEGGEI 563 (563) Q Consensus 558 --------~~~~~~ 563 (563) -+++-. T Consensus 427 ~~~i~~~l~~a~s~ 440 (488) T protein:vir:99 427 TTQLRTLIEQASSL 440 (488) T ss_pred HHHHHHHHHhcCCH Confidence 000000 No 185 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=80.05 E-value=0.098 Score=26.10 Aligned_cols=245 Identities=13% Similarity=0.099 Sum_probs=117.3 Q ss_pred ccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEec Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSD 84 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d 84 (563) ||+ +++.++. +..++...........+++.+.... .+. -..-.++|-|..||+.|.+.+.-. T Consensus 1 Mgl-F~~~~~r-----~~~~~~~~~~~~~~~~~~~~~~~~~---~v~--------~~~al~~~~v~~~i~~ia~~iA~l- 62 (251) T protein:vir:46 1 MGI-FYKNEKR-----DLQYNEDDLQMMVQTLPSFQGTKLR---QYK--------DIEAIRHSDIFTAVMMIASDLARM- 62 (251) T ss_pred CCc-ccccccc-----ccCCCccchhhhhhhhccccCcCcc---eec--------hhhhhccHHHHHHHHHHHHhHhhC- Confidence 444 2221111 1122222211111122222221100 010 122356888999999998865332 Q ss_pred CCCcEEEEEeeccccchHHHHHHHHHHHHHH-HHhcchhhhHHHH----HHHhhhhhhheeeeecCCCCCcceeEEEEeC Q lcl|NC_015287. 85 QNDSPVEIDLENLPASKQLKDLIREEFKTVK-EILNFDDKCHEIL----RNWYVDGRIYYHKVIDIKAPEEGIKEVRYID 159 (563) Q Consensus 85 ~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~-~ll~F~~~~~~~~----R~WYvDGri~fhkviD~~~~k~GI~eLr~lD 159 (563) |+.+- .+.+... ....-.++ .--+-...+++++ ..++..|.-|.-++.|.. .-+++|.+|+ T Consensus 63 ----p~~~~-~~~~~~~------~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~---G~~~~L~~i~ 128 (251) T protein:vir:46 63 ----PIRVT-VNGQINY------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT---GEPMNLTFRK 128 (251) T ss_pred ----ceEEe-eCccccc------cchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC---CcEEEEEEEC Confidence 33331 1111110 11112222 1223444455554 455666988888877643 2389999999 Q ss_pred CccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeec Q lcl|NC_015287. 160 PLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHS 239 (563) Q Consensus 160 P~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hS 239 (563) |..|..++. ++ +.+ .|+.+.... + .......++.+.|.+.. T Consensus 129 ~~~v~v~~~---~~---g~~-----------------~~~~~~~~~----~-----------~~g~~~~~~~~diiH~r- 169 (251) T protein:vir:46 129 TSEIELKSD---AR---GRL-----------------YYFHQRIDS----N-----------GNNIERNVKFEDMLDIK- 169 (251) T ss_pred CceEEEEEC---CC---CcE-----------------EEEEEEecc----C-----------CcceeEEECCccEEEec- Confidence 988765521 11 111 121111100 0 00113566777774443 Q ss_pred ceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHH-HHHHHHHHHhhcceeEeeCCCC Q lcl|NC_015287. 240 GLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKA-EQYLKEVMNRYRNKLVYDASTG 318 (563) Q Consensus 240 Gl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KA-eqYl~~im~k~knklvYd~~TG 318 (563) .++.++-.+.|-|+.|..++......++...-+=---+--+-+..++ |.|...+| ++..+.....|.-- .+.| T Consensus 170 -~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~~e~~~~~~~~~~~~~~g~----~n~g 243 (251) T protein:vir:46 170 -FYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFPKVLVEL----NKLG 243 (251) T ss_pred -CcCCCCeeecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeC-CCCCCHHHHHHHHHHHHHHhcCc----cccc Confidence 35666667789999999999888888876554433334456677777 56644444 44444444444311 2334 Q ss_pred cccccccccchhh Q lcl|NC_015287. 319 EIRDDRKHMSMLE 331 (563) Q Consensus 319 ev~d~~~~msMlE 331 (563) .+-. .|-| T Consensus 244 ~~~~-----gm~~ 251 (251) T protein:vir:46 244 KLSY-----SMNQ 251 (251) T ss_pred cccc-----ccCC Confidence 4332 1222 No 186 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=79.88 E-value=0.1 Score=26.06 Aligned_cols=426 Identities=12% Similarity=0.037 Sum_probs=179.5 Q ss_pred Ccc--------cccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHH Q lcl|NC_015287. 1 MAK--------LFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSA 72 (563) Q Consensus 1 m~~--------~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~A 72 (563) |+. ...++..+. .....+.. ...+.++ .++|. .....++-++..-..++.|++|..++.|.++ T Consensus 1 m~~~i~~~~g~p~~~~~~~~--~~~~~ia~----~~~~~~~-~~~~~--~~~~~~~iLr~~~~~~~~y~~m~~D~~i~s~ 71 (491) T protein:vir:10 1 MSKGLWVSPTEFVTFGEPDK--SLSSQIAT----RARSIDF-FALGM--YLPNPDPVLKALGKDIRVYRELRADAHVGGC 71 (491) T ss_pred CCCceeCCCCCccCcccCCh--HHHHHHHh----hhccccc-ccccC--CccchHHHHHhcCCCHHHHHHHhhChHHHHH Confidence 221 011100000 00000000 0000000 01111 1223333333333347899999999999999 Q ss_pred HHHhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcce Q lcl|NC_015287. 73 IEDIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGI 152 (563) Q Consensus 73 i~eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI 152 (563) ++.+..-+.-. +..|.--+ -++ ++.++-..+++-++|+.-..+++.- ...|--.+++|...++..-.+ T Consensus 72 l~~Rk~av~~~-----~w~i~~~~--~~~----~~~e~v~e~l~~~~~~~~l~~~lda-~~~G~s~~Ei~w~~~~g~~~~ 139 (491) T protein:vir:10 72 VRRRKAAVKAL-----EWGLDRGK--AKS----RVAKSIADVFADLDLSRIVTEMLDA-VLYGYQPMEITWGKVGNYIVP 139 (491) T ss_pred HHHHHHHHhCC-----CcEEecCC--CCH----HHHHHHHHHHhcCCHHHHHHHHHHh-hhhcceeEEEEEeecCCeeEE Confidence 99998744322 22222111 122 3334444455556788777777653 347777788877754333334 Q ss_pred eEEEEeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeee-cc Q lcl|NC_015287. 153 KEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKI-AP 231 (563) Q Consensus 153 ~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki-~~ 231 (563) ..+.+++|+ +|.|++.+.......+.+ ..++.+ +. T Consensus 140 ~~l~~r~~~------------------------------------~f~~d~~~~l~~~~~~~~--------~~g~~l~~~ 175 (491) T protein:vir:10 140 IDVVGKPAD------------------------------------WFVYDPENQLRFRSKDHW--------MQGEELPAR 175 (491) T ss_pred EEeeeeccc------------------------------------ceeeccCCceEEecCCCC--------CCcceecCC Confidence 444455543 333332211111100000 011222 33 Q ss_pred cceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHh-cCccceEEEccCCCCchHHHHHHHHHHHHhhcce Q lcl|NC_015287. 232 DAISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLS-RAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNK 310 (563) Q Consensus 232 daI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~-RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk 310 (563) .-|+|+|.-- ...-...|-|..|..+|--.+......+.+=-. =.|-|- ...|.|.-.+.|. +.++.+ ....+. T Consensus 176 k~i~~~~~~~--~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~i-gky~~~a~~~ek~-~l~~al-~~~~~~ 250 (491) T protein:vir:10 176 KFLVPRQEAT--YLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLV-GKHPRSASDGEKN-LLLDCL-EDMVQD 250 (491) T ss_pred CEEEEEecCC--CCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEE-EecCCCCCHHHHH-HHHHHH-HHHhcC Confidence 4677777632 112234688999999888888766665555433 445544 4447776444432 233332 223222 Q ss_pred eEeeCCCCcccccccccchhhhhcccccCCCCccceeecCCCCCcchH----HHHHHHHHHHHHhcCCChhhccCCCc-c Q lcl|NC_015287. 311 LVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGEL----SDIEYFQKKLYRSLGVPESRIAGSGD-G 385 (563) Q Consensus 311 lvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei----~DV~YF~~kLy~aL~VP~sRl~~~~~-g 385 (563) - | -. +| .|++|+.+.-+.+-|.. .=++|..++.-+++-= .=|.++++ + T Consensus 251 a------~------~v--------iP-----~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLG--qtlTt~~~gs 303 (491) T protein:vir:10 251 A------V------AV--------VP-----DDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIALLG--QNQTTEATST 303 (491) T ss_pred c------E------EE--------ec-----CCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHHhh--hhcccCcccc Confidence 1 1 11 12 46889888543333332 2388998887776531 11333322 1 Q ss_pred eeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHH Q lcl|NC_015287. 386 FNLGRSSEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERL 465 (563) Q Consensus 386 ~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~ 465 (563) ..+ +++-.+ -+...++..++..+..+..+++--+.+++... ...+|.|.... |..+.+. T Consensus 304 ~a~---~~vh~~--v~~di~~~D~~~i~~tln~li~~l~~~N~~~~--------~~p~f~~~~~~--------e~~~~~a 362 (491) T protein:vir:10 304 RAS---AQAGLE--VTDDIRDGDKAVVSEAMNMLIRWICDLNFDGA--------DRPVFDMWEQE--------QVDEIQA 362 (491) T ss_pred hhH---HHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC--------CcceEEecCcC--------chhHHHH Confidence 222 232222 25666677777777777776655555554321 23455554321 2223344 Q ss_pred HHHHHhhhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccc Q lcl|NC_015287. 466 GVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGA 545 (563) Q Consensus 466 ~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~ 545 (563) +.+..+.+. |==++.+|+++++- +...+..+ .+.+.+ .....+. .......+|.++..+.+ T Consensus 363 ~~~~~L~~~-G~~i~~~~i~e~~G-ip~~~~~~-----------~~~~~~--~~~~~~~----~~~~~~~~~~~~~~d~~ 423 (491) T protein:vir:10 363 GRDQKLTQA-GARFTPAYFKRAYN-LQDGDLDE-----------RPLPVS--AVDTVGA----ASFAEFEAPDQDALDAA 423 (491) T ss_pred HHHHHHHhC-CCcCCHHHHHHHhC-CCCCCcCc-----------cccccC--CCCCccc----ccccccCCCCCCchHHH Confidence 444444443 43477788776653 22211110 001000 0000000 00000000111100000 Q ss_pred cCCCCcccCCCC------------CCCCCC Q lcl|NC_015287. 546 PQEGGAEQAPQM------------PEGGEI 563 (563) Q Consensus 546 ~~~~~~~~~~~~------------~~~~~~ 563 (563) .....++++-+. .+++.. T Consensus 424 ~~~~~~~~~~~~~~~~~~~i~~~l~~~~s~ 453 (491) T protein:vir:10 424 LNTLSARDLNADAQALVAPLLKRIANGASA 453 (491) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCH Confidence 000000000000 000000 No 187 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=79.44 E-value=0.1 Score=25.96 Aligned_cols=404 Identities=8% Similarity=0.009 Sum_probs=194.1 Q ss_pred chhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEecCC Q lcl|NC_015287. 7 FKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSDQN 86 (563) Q Consensus 7 f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d~~ 86 (563) .+..-- ..+.+.++.-.+-+. --...+.|-.--.+.+ ..-..++-..||... +=+.-||+.+++=..+ +-. T Consensus 1 ~~~~~i-~~L~~~~~~~~~r~~-~~~~yY~g~~~~~~~~----~~~p~~~~~~~~~v~--nw~~~iVds~a~rl~~-~Gf 71 (409) T protein:vir:94 1 MTEKGI-GYLRFKLSVHKRRAE-MRYDQYAMKYVDRFKG----ITIPQALSQQYRSIL--GWCAKGVDSLADRLVF-REF 71 (409) T ss_pred CCHHHH-HHHHHHHHHHhHHHH-HHHHHhcccCchhhcC----hhhhHHHHHHHhhhc--chhHHHHHHhHhhccc-Ccc Confidence 000000 000000000000000 0000011111000111 111233444555443 3344466655542111 111 Q ss_pred CcEEEEEeeccccchHHHHHHHHHHHHHHHHhcchhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEEEeCCccceee Q lcl|NC_015287. 87 DSPVEIDLENLPASKQLKDLIREEFKTVKEILNFDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYIDPLKIKLV 166 (563) Q Consensus 87 ~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lDP~~i~~v 166 (563) . .+| +++..|....+|+.+..+.++.=++.||-|.- |. +. .+|-..++.++|+.+--+ T Consensus 72 ~------~~d------------~~l~~i~~~N~ld~~~~~~~~~aliyG~sf~~-v~-~~--~dg~~~i~~~sp~~~~~i 129 (409) T protein:vir:94 72 E------NDD------------FTVNEIFEENNPDIFFDSAVLSSLIASCSFTY-IS-KG--ENDAVRLQVIEAVNATGI 129 (409) T ss_pred c------CCc------------hHHHHHHHhcChhHHHHHHHHHHHHhcceeEE-Ee-cC--CCCceEEEEeccceEEEE Confidence 0 011 24667888889999999999999999996652 33 22 356678899999887655 Q ss_pred eeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeecceecCCC Q lcl|NC_015287. 167 RKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHSGLVDRNK 246 (563) Q Consensus 167 R~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hSGl~d~~~ 246 (563) .- +..+.....++-.... .........+|-|..+......+ ++.....+... .+| .|.|++..-.+... T Consensus 130 ~D----~~~~~~~~a~~~~~~d--~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~n~~g--~vP--vV~f~n~~~~~~~~ 198 (409) T protein:vir:94 130 ID----PITGLLTEGYAVLERD--ENNNVVLEAHFLPDRTDYYYRDS-RNNISIANPTG--HPL--LVPIIHRPDAVRPF 198 (409) T ss_pred Ee----cCCCceeeeEEEEEec--CCCceEEEEEEecCcEEEEEecC-ceeEeeeCCCC--Ccc--eEEecccccccccc Confidence 32 1111111111100000 01111222333332222111000 00000000000 111 34455442222221 Q ss_pred CccchhHHH-HHHHHHH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccc Q lcl|NC_015287. 247 QTILSYLHK-SIKALNQ-LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDR 324 (563) Q Consensus 247 ~~~~syL~~-Aik~~Nq-L~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~ 324 (563) +.|-+-. .+...+- -+.+.+.++.=...=.|.|-++=+|...-|..+-..++. T Consensus 199 --G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~----------------------- 253 (409) T protein:vir:94 199 --GRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEPMETWKATVS----------------------- 253 (409) T ss_pred --CccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCcccchhhhhHH----------------------- Confidence 1233322 1111121 256788899999999999998866543222221111111 Q ss_pred cccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHHHH Q lcl|NC_015287. 325 KHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTKF 404 (563) Q Consensus 325 ~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~Kf 404 (563) .=..+|.-+.|-+.+|..++++.=-+=++-++=.-..+...-++|.+-|.... -|-..+..|.-.|....+- T Consensus 254 ------~i~~~~~d~dg~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~t~lP~~~lg~~~--~NpsSa~Al~a~~~~L~~~ 325 (409) T protein:vir:94 254 ------SMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVS--DNPSSVEAIKASHENLRLA 325 (409) T ss_pred ------HhhcCCCCCCCCCceEEecCCCChhHHHHHHHHHHHHHhhhcCCCHHHhcccc--CchhHHHHHHHHHHHHHHH Confidence 11335666666778898898865222245566666777788899998884321 1212233566677778889 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCC--hhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhch-hhhh Q lcl|NC_015287. 405 VGRMRKRFSHLFNDMLKTQLILKNIVT--PEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGK-YFSL 481 (563) Q Consensus 405 I~rLr~rFs~if~d~Lk~qLiLkgI~t--~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGK-y~S~ 481 (563) +.|.|+.|..-+.+++|.-+.+.|-.. +++| ..+.+.|. ++.-++ +--+.++.+.+..+..- |. +.+. T Consensus 326 a~~k~~~fg~~~~~~~rla~~i~~~~~~~~~~~----~~~~v~W~-p~~~~~---~~~~a~~aDa~~Kl~~a-g~~~~~~ 396 (409) T protein:vir:94 326 GRKAQRSLGAGLLNVAYLAACLRDDAPYLREQF----RKTKPKWE-PLFEAD---ASMLSLIGDGAIKLNQA-IPEFINK 396 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCcccccc----ccceEEec-cCCCcc---hHHHHHHHHHHHHHHHh-cccccch Confidence 999999999999999998777665443 3444 34677776 332222 23345666666655553 33 3333 Q ss_pred HHHHHHHhccChhh Q lcl|NC_015287. 482 DYVRRKILKQKDEE 495 (563) Q Consensus 482 ~~i~k~Il~~tdee 495 (563) + +-.+.|++++.| T Consensus 397 ~-~~~~~lG~~~~d 409 (409) T protein:vir:94 397 D-TIRDLTGIEGGE 409 (409) T ss_pred h-HHHHHcCCCCCC Confidence 4 555667788877 No 188 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=67.32 E-value=0.25 Score=23.85 Aligned_cols=485 Identities=11% Similarity=0.090 Sum_probs=184.4 Q ss_pred CcccccchhhHhhhcccc-c-ccccCcCC---CCcccceeccccccccccccchhhhHHHHHHHHHHHhh-ccchhHHHH Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKN-V-VSPVPQSN---EDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMAL-HPEVDSAIE 74 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~-~-~s~~p~~~---~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~-~PEvd~Ai~ 74 (563) |+ .++.+.+... + ..+..+.+ .+.+.....-+++..-..+++-. +-.. -+.|+. .|=+..||+ T Consensus 1 ~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~-~~~~----L~~~~e~~~~~~~~i~ 69 (651) T protein:vir:99 1 MT------DTTGETQETKVHVEGLGGEADLAKSPNSTQIPDHRIQSHNVGVNPPY-NPDR----LAAFLELNETLATGIR 69 (651) T ss_pred CC------CccceeeeeEEEeecccccccccccccccccchhhhcccCCCCCCCC-CHHH----HHHHHhcChHHHHHHH Confidence 21 1111100000 0 00000000 11111111223322222233321 3333 456666 888999999 Q ss_pred HhhcceEEecCCCcEEEEEeeccccchHHHHHHHHHHHH-------HHHHhcchhhhHHHHHHHhhh----hhhheeeee Q lcl|NC_015287. 75 DIINEAIVSDQNDSPVEIDLENLPASKQLKDLIREEFKT-------VKEILNFDDKCHEILRNWYVD----GRIYYHKVI 143 (563) Q Consensus 75 eIvneaiv~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~-------i~~ll~F~~~~~~~~R~WYvD----Gri~fhkvi 143 (563) .+.+......-+-.|- .+++....++..+++..+..+. ..+-++|+....+++++-.-| |--|.+.+. T Consensus 70 ~~~~~iag~g~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIr 148 (651) T protein:vir:99 70 KKSRYEVGFGFDLVPA-QGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLT 148 (651) T ss_pred HHhhhhhccCceeeec-ccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhh Confidence 9888654332221121 2223334555556555444322 223356777777777654433 556677666 Q ss_pred cCCCCCcceeEEEEeCCccceeeeeeeccCCccccceeeeccccccccc---------cccccee-eecCcc-ccc---- Q lcl|NC_015287. 144 DIKAPEEGIKEVRYIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIEN---------PEIEEYY-QYDPSQ-TQG---- 208 (563) Q Consensus 144 D~~~~k~GI~eLr~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~---------~~~~eyf-~y~~~~-~~~---- 208 (563) +... + +..|.+++|..++.-+.....+.....+...+- +.....+ ....-|| +|.... ... T Consensus 149 n~~g--~-pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~p-n~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~ 224 (651) T protein:vir:99 149 DIEG--R-PVGLAYVPARTVRVRRPQNRFDQPRHPEEGRYV-DGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVID 224 (651) T ss_pred cCcc--c-hhhhhhcChhheeeecccccccchhhhhhhccc-ccccchhHHHHHHHHHhcCcceEEEeeccccceeeeec Confidence 5332 2 567888888766543221110000000000000 0000000 0000011 110000 000 Q ss_pred ----------cccc----------cccCcCccccCcceeeecccceeeeecceec-CCCCccchhHHHHHHHHHHHHHHH Q lcl|NC_015287. 209 ----------KNAL----------GAIGQTPFSTKQRPVKIAPDAISFCHSGLVD-RNKQTILSYLHKSIKALNQLRMIE 267 (563) Q Consensus 209 ----------~~~~----------g~~g~~~~~~~~~~iki~~daI~y~hSGl~d-~~~~~~~syL~~Aik~~NqL~m~E 267 (563) .... ...|............++.+.|.|. =..+ .++-.++|-|..|+..+.....++ T Consensus 225 ~~~~~v~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~eViHi--r~~~~~~g~~G~spl~~a~~~i~~a~~a~ 302 (651) T protein:vir:99 225 ESGDEPTIRYREDEESEREPIFVDRETGDVTTGDANGLENRPANELIFI--PNPSILEDDYGVPDWVSAIRTISADEAAK 302 (651) T ss_pred cCCcceeEEeccCcceeeeeecccceeeeEEEcCCCceeEecccceEEe--cCCCCCCCcccccHHHHHHHHHHHHHHHH Confidence 0000 0000000011122344555666333 2223 344466899999999999988888 Q ss_pred HHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCCcccccccccchhhh---h-cccccCCCCc Q lcl|NC_015287. 268 DSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTGEIRDDRKHMSMLED---F-WLPRREGGRG 343 (563) Q Consensus 268 DalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TGev~d~~~~msMlED---y-wLpRReGgrg 343 (563) +...-+=---+--+-|+.+.-+.|-+..++.. ++-+++.. . +.| +.| .|+- . +++. +.| T Consensus 303 ~~~~~~f~NG~~p~gil~~~~~~ls~e~~~~l-r~~~~~~~-----~-nag------k~~-vL~~~~~~~~~~~---~~g 365 (651) T protein:vir:99 303 DYNRDFFDNDTIPRMVIKVTGGELSEESKRDL-RQMLNGLR-----E-ESH------RAV-VLEVEKFQSQLDE---DVE 365 (651) T ss_pred HHHHHHHhccCCCceEEEecCCCCCHHHHHHH-HHHHHHHh-----c-cCC------ceE-Eeecccccccccc---cCC Confidence 87664444446667788876566655554443 44343211 1 112 111 1110 0 0000 113 Q ss_pred cceeecCCCCCcc---h-HHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHHHHHHHH-HHHHHHHHHHHHHHH Q lcl|NC_015287. 344 TEITTLPGGQNLG---E-LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRDEIKFTK-FVGRMRKRFSHLFND 418 (563) Q Consensus 344 TEIsTLpGg~nLg---e-i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRDElKF~K-fI~rLr~rFs~if~d 418 (563) .++..|- ..+. | ++=-+|....+-++.+||.+.|...+ ..+ .|+ +.-.-+.|.. -+.-+..++...+.. T Consensus 366 ~~~~pls--~~~~~D~qfle~r~~~~~eIa~afgVPp~~lG~~~-~~~--~sn-~E~~~~~f~~~tL~P~~~~ie~eln~ 439 (651) T protein:vir:99 366 IELEPMG--QGISEEMDFRQFREKNEHEIAKVLEVPPVKIGVTD-SAN--RSN-SDQQDKDFALEVIQPEQHTFAEWLYQ 439 (651) T ss_pred ceEEEcC--cCchhhHHHHHHHHHHHHHHHHHhCCCHHHhccCC-CCC--ccc-HHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3333331 2111 1 12225667779999999999995322 112 111 2222233433 345555555444433 Q ss_pred HHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhhchhhhhHHHHHHHhccChhhHHH Q lcl|NC_015287. 419 MLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYVGKYFSLDYVRRKILKQKDEEVIE 498 (563) Q Consensus 419 ~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vGKy~S~~~i~k~Il~~tdeei~e 498 (563) . |+ +..+... ...|.++|..+. +... -...|.+.++.+-. .-++|..-+++. +.+. T Consensus 440 k----Ll-----~~~e~~~-~~~i~~ef~~~~----llr~-D~~~~~e~~~~~i~--~G~~T~NE~R~~-lglp------ 495 (651) T protein:vir:99 440 I----IH-----QQALGVT-DWTIEYELRGAD----QPKQ-EAQLAEQRVRAMRL--AGVGLVDEAREE-LGLD------ 495 (651) T ss_pred h----hc-----Ccccccc-CceEEEEeccch----hhhc-cHHHHHHHHHHHHh--CCCcCHHHHHHH-hCCC------ Confidence 2 22 2221111 134666666433 1111 11234444433221 134555544432 2222 Q ss_pred HHHHHHHHhcCCCCCCCcccCccc--cccccccCCCCcCCCCCcccccccCCCCcccCCCCCC---CCCC Q lcl|NC_015287. 499 IDKQMEKEIKDGKLADPMEVQQLQ--LGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQAPQMPE---GGEI 563 (563) Q Consensus 499 ~~~qie~E~~~~~~~~p~~~~~~~--~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~---~~~~ 563 (563) ++++|....-.. +-...+....|.+++..++ .++.++.+..++..++. =.|. T Consensus 496 ------------pi~~~~gd~~l~~~~~~~~g~~~~gge~~~~~~-~~~~~~~~~~e~~~~~~~~~~~e~ 552 (651) T protein:vir:99 496 ------------PLGEPYGEMTLSEFEAEVAGDVAGGGETEAVHE-PPEENKIGEREWDTVKSELTTKDP 552 (651) T ss_pred ------------CCCCccccccccccccccccccccCCCCccccc-Cccccccccchhhhhhhhhcccch Confidence 222211100000 0000011111111222211 11112222222222211 1222 No 189 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=61.62 E-value=0.35 Score=23.08 Aligned_cols=395 Identities=11% Similarity=0.103 Sum_probs=156.8 Q ss_pred ccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEec Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSD 84 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d 84 (563) ||| +...-...... ..... .+++......|++ ++.. -+++|.|-.||+.|.+.+.-. T Consensus 1 m~~-~~~~~~~~~~~-~~~~~-~~~~~~~~~~g~~----~~~~---------------Al~~~~V~~cv~~ia~~iA~l- 57 (417) T protein:vir:38 1 MKL-FRGLATEVDPH-WADHL-LDSGVIPSFRGGY----LGIS---------------ALRNSDVLTAVSIVSGDVSRF- 57 (417) T ss_pred Ccc-ccccccCCCcc-chhhh-cccccccccCCce----echh---------------hcccHHHHHHHHHHHHhhccC- Confidence 333 21100000000 00000 0111111122332 2210 147889999999999866532 Q ss_pred CCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcchhhhHHH----HHHHhhhhhhheeeeecCCCCCcceeEEEEeC Q lcl|NC_015287. 85 QNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFDDKCHEI----LRNWYVDGRIYYHKVIDIKAPEEGIKEVRYID 159 (563) Q Consensus 85 ~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~~~~~~~----~R~WYvDGri~fhkviD~~~~k~GI~eLr~lD 159 (563) |+.+--.+.+ ..++ ......+|+. =+=...+.++ +....+.|.-|..++.|... .-+.+|.+|. T Consensus 58 ----p~~~~~~~~~--~~~~---~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g--~~~~~l~~l~ 126 (417) T protein:vir:38 58 ----PLVITDSSTD--EVID---LANIEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPIT--NEPAMFEFYA 126 (417) T ss_pred ----eeEEEEcCCc--ceec---cchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCC--CEEEEEEEeC Confidence 4443221111 1011 1112222221 1222334443 45556778888877665431 2277889999 Q ss_pred CccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccce-eeee Q lcl|NC_015287. 160 PLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAI-SFCH 238 (563) Q Consensus 160 P~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI-~y~h 238 (563) |..+...+. + . +. -+|.|.... + .....++.+.| ++. T Consensus 127 p~~v~v~~~---~-~--~~------------------~~y~~~~~~----------~-------~~~~~~~~~dviH~r- 164 (417) T protein:vir:38 127 PSQTQVDTS---D-P--DN------------------IIYRFTPYN----------S-------SMQKVCGFEDVIHWK- 164 (417) T ss_pred CceEEEEEc---C-C--Ce------------------EEEEEEEcC----------C-------cEEEEecCcceEEec- Confidence 988764421 1 0 11 122232210 0 11123444444 333 Q ss_pred cceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCC Q lcl|NC_015287. 239 SGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTG 318 (563) Q Consensus 239 SGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TG 318 (563) .++.++-.++|-|..|.+++.....++....=+=-.=+.-+-|+..+ |.|.+.++++..+.+-..|.- .+.| T Consensus 165 --~~~~d~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~-~~l~~e~~~~~~~~~~~~~~g-----~n~g 236 (417) T protein:vir:38 165 --FFSYDTIMGRSPLLSLGDEIGLQESGVSTLQKFFKSGLKGSIIKAKE-SRLSAEARQKIREDFERAQAG-----ADAG 236 (417) T ss_pred --CCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC-CCCCHHHHHHHHHHHHHHhcc-----cccC Confidence 34556666789999999998887777775543222224445566655 556666655543333222211 1223 Q ss_pred cccccccccchhhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhh Q lcl|NC_015287. 319 EIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE---LSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEIL 395 (563) Q Consensus 319 ev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eIt 395 (563) .+ + .+ + .|.+.+.|. -+..+ ++--+|-...+.++++||.+.|...++ .+.+. T Consensus 237 ~~------~-vl-----~-----~g~~~~~l~--~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~------~s~~e 291 (417) T protein:vir:38 237 SP------I-IV-----D-----ATMDYQPLE--VDTNVLNLINSNNYSTAQIAKALRVPAYRLAQNSP------NQSVK 291 (417) T ss_pred Cc------e-ec-----c-----CCceEEEcc--CCHHHHHHHHHHHhhHHHHHHHhCCCHHHhCCCCc------chhHH Confidence 22 1 11 1 144555542 22222 233344467788999999999953211 11122 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_015287. 396 RDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDPYV 475 (563) Q Consensus 396 RDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~v 475 (563) ..-+.|.+. .|+-.+.. +.+.|-.. ++++.++.. +.+.|..++ +.... +.++...++ T Consensus 292 ~~~~~~~~~--tl~P~~~~-ie~~l~~~-----Ll~~~~~~~----~~~~fd~~~----l~~~~----~~~~~~~~~--- 348 (417) T protein:vir:38 292 QLADDYIRN--DLPFYFEP-ITSEFELK-----LLDDAQRHQ----YCIGFDTKS----VNGLP----IADVNTAVN--- 348 (417) T ss_pred HHHHHHHHH--HHHHHHHH-HHHHHHhh-----hcChhhccc----ceEEechhh----hhHHH----HHHHHHHHh--- Confidence 222223332 34332222 22333333 345555532 344453222 11110 111111111 Q ss_pred chhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccC--cc-----------ccccccccCCCCcCCCCCccc Q lcl|NC_015287. 476 GKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQ--QL-----------QLGVHPEQMPGGAMNPQDPNM 542 (563) Q Consensus 476 GKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~--~~-----------~~~~~~~~~~~~~~~p~~~~~ 542 (563) .-+++.+-+|+ + ..-++++.++.+. .+ ...+......+|+. +.+. T Consensus 349 ~G~~T~NE~R~-~------------------~gl~pi~~g~~d~~~~~~n~~~~d~~~~~~~~~~~~~kgg~~---~~~~ 406 (417) T protein:vir:38 349 GGLWTGNEGRA-E------------------LGKKPLKDPNMDRIQSTLNTVFLDQKEAYQAEHAAELKGGDT---NAKG 406 (417) T ss_pred CCCcCHHHHHH-H------------------hCCCCCCCCCCCeeeecccccccccccccccccccccCCCCC---CCCC Confidence 12333333332 1 2233443332111 00 00000111111111 1111 Q ss_pred ccccCCCCccc Q lcl|NC_015287. 543 GGAPQEGGAEQ 553 (563) Q Consensus 543 ~~~~~~~~~~~ 553 (563) .+..+...+-+ T Consensus 407 ~~~~~~~~~~~ 417 (417) T protein:vir:38 407 NQNGSGTNANS 417 (417) T ss_pred CCcCCCCcCCC Confidence 11111111111 No 190 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=53.16 E-value=0.54 Score=22.07 Aligned_cols=378 Identities=13% Similarity=0.091 Sum_probs=133.7 Q ss_pred ccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEec Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSD 84 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d 84 (563) ||| +++--+..+ ++..- .+.-.+ .++- . +...++|-|.+||+-|.+.+..+ T Consensus 1 Mg~-f~~lf~~~~---~~~~~----------~~~~~~--~~v~----~--------~~~~~~~~v~~~i~~Ia~~iA~~- 51 (395) T protein:vir:10 1 MSI-LEKIFKTRK---DITYM----------LDLDMI--EDLS----Q--------QAYVKRLAIDSCIEFVARAVAQS- 51 (395) T ss_pred Cch-hhhhhccCc---ccccc----------ccchhc--cccc----h--------hhhhhhHHHHHHHHHHHHhhccc- Confidence 777 333211110 11000 000000 0110 0 12357789999999999977654 Q ss_pred CCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcchhhhHH----HHHHHhhhhhhheeeeecCCCCCcceeEEEEeC Q lcl|NC_015287. 85 QNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFDDKCHE----ILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYID 159 (563) Q Consensus 85 ~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~~~~~~----~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lD 159 (563) |+.+--++ . .+......+++. =+=.-.+++ ++...+.+|. ||-.+.+. .| +..++ T Consensus 52 ----p~~~~~~~-~-------~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~-~~~~~~~~----~~---~~~~~ 111 (395) T protein:vir:10 52 ----HFKVLEGN-R-------IQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNE-VLIVVSDS----KE---LLIAD 111 (395) T ss_pred ----eeEeccCC-c-------cccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCc-eEEEEecC----CC---eEecC Confidence 23222111 1 111222222221 112223333 3344445554 55443332 12 44566 Q ss_pred CccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeec Q lcl|NC_015287. 160 PLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHS 239 (563) Q Consensus 160 P~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hS 239 (563) |..+...+.. +.. +..+..+ . ++. ...++.+.|.|.. T Consensus 112 ~~~~~~~~~~--~~~--------------------~~~~~~~-~-----------~~~--------~~~~~~~evih~~- 148 (395) T protein:vir:10 112 SFYREEYALY--DDI--------------------FKDVTVK-D-----------YTY--------QRTFTMQEVIYLK- 148 (395) T ss_pred CccceeEeec--Ccc--------------------eeEEEEc-C-----------cee--------eeeeccccEEEEc- Confidence 5444333211 100 0001111 0 000 1235566664442 Q ss_pred ceecCC-CCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCC Q lcl|NC_015287. 240 GLVDRN-KQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTG 318 (563) Q Consensus 240 Gl~d~~-~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TG 318 (563) +.... ...+.|-|+.|.+...... . -++- -+--+-+..+..+. ...++.+=+++.+.++.+-. ++..+ T Consensus 149 -~~~~~~~~~G~spi~~~~~~~~~~~---~---~~~~-~~~~~gii~~~~~~-~~~e~~~~~~~~~~~~~~~~--~~~~~ 217 (395) T protein:vir:10 149 -YNNNKVTHFVESLFEDYGKIFGRMI---G---AQLK-NYQIRGILKSASSA-YDEKNIEKLQAFTNKLFNTF--NKNQL 217 (395) T ss_pred -cCCCCcccccchHHHHHHHHHHHHH---H---HHHh-cCCCceEEEeCCCC-CCHHHHHHHHHHHHHHhccc--cccCc Confidence 12222 2234565666555443211 1 1221 12223444455443 34444444554444433211 00000 Q ss_pred cccccccccchhhhhcccccCCCCccceeec---CCCCCcchHHHH---HHHHHHHHHhcCCChhhccCCCcceeccccc Q lcl|NC_015287. 319 EIRDDRKHMSMLEDFWLPRREGGRGTEITTL---PGGQNLGELSDI---EYFQKKLYRSLGVPESRIAGSGDGFNLGRSS 392 (563) Q Consensus 319 ev~d~~~~msMlEDywLpRReGgrgTEIsTL---pGg~nLgei~DV---~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~ 392 (563) ..+. ++ .|.+++.| |-..++.+++=+ ++-.+.+.++.+||.+.|.+. .+ T Consensus 218 ------~v~~-l~----------~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~--------~s 272 (395) T protein:vir:10 218 ------AIAP-LI----------EGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYGE--------TA 272 (395) T ss_pred ------ceEE-cC----------CCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcCc--------cc Confidence 0000 11 12333322 222233222222 356677999999999999422 11 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_015287. 393 EILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVD 472 (563) Q Consensus 393 eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~ 472 (563) .+...-..|.+. .|+- +...+.+.|-..| .++.+|.. .++|++ ..+-... +.+|.++++.+- T Consensus 273 n~e~~~~~~~~~--~l~P-~~~~ie~~l~~kL-----~~~~~~~~---~~~f~~------~~l~~~D-~~~~~~~~~~~~ 334 (395) T protein:vir:10 273 DLEKNTLVFEKF--CLTP-LLKKIQNELNAKL-----ITQSMYLK---DTRIEI------VGVNKKD-PLQYAEAIDKLV 334 (395) T ss_pred CHHHHHHHHHHH--HHHH-HHHHHHHHHHHhh-----cChhhhcc---cceecc------hhhhccC-HHHHHHHHHHHH Confidence 122222335542 2322 1222333333333 44444432 233333 2222111 235566554432 Q ss_pred hhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcc Q lcl|NC_015287. 473 PYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAE 552 (563) Q Consensus 473 p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 552 (563) .- -+ +|..|+- + ...-++++.|..+.---+. +.. ...++.+.+ .. T Consensus 335 ~~--G~------------lt~NE~R---~----~~g~~p~~~g~~d~~~~~~---n~~--~~~~~~~~~---------~~ 379 (395) T protein:vir:10 335 SS--GS------------FTRNEVR---I----MLGEEPSDNPELDEYLITK---NYE--KANSGENDE---------KE 379 (395) T ss_pred hC--CC------------cCHHHHH---H----HhCCCCCCCCCCceeeecc---ccc--ccccccccc---------Cc Confidence 21 22 3443332 1 1233344333111100000 000 000111111 11 Q ss_pred cCCCCCCCCCC Q lcl|NC_015287. 553 QAPQMPEGGEI 563 (563) Q Consensus 553 ~~~~~~~~~~~ 563 (563) .+...++||+- T Consensus 380 ~~~~~~kgg~~ 390 (395) T protein:vir:10 380 KDENTLKGGDE 390 (395) T ss_pred ccccccCCCCC Confidence 11223344544 No 191 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=53.16 E-value=0.54 Score=22.07 Aligned_cols=378 Identities=13% Similarity=0.091 Sum_probs=133.7 Q ss_pred ccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEec Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSD 84 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d 84 (563) ||| +++--+..+ ++..- .+.-.+ .++- . +...++|-|.+||+-|.+.+..+ T Consensus 1 Mg~-f~~lf~~~~---~~~~~----------~~~~~~--~~v~----~--------~~~~~~~~v~~~i~~Ia~~iA~~- 51 (395) T protein:vir:10 1 MSI-LEKIFKTRK---DITYM----------LDLDMI--EDLS----Q--------QAYVKRLAIDSCIEFVARAVAQS- 51 (395) T ss_pred Cch-hhhhhccCc---ccccc----------ccchhc--cccc----h--------hhhhhhHHHHHHHHHHHHhhccc- Confidence 777 333211110 11000 000000 0110 0 12357789999999999977654 Q ss_pred CCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcchhhhHH----HHHHHhhhhhhheeeeecCCCCCcceeEEEEeC Q lcl|NC_015287. 85 QNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFDDKCHE----ILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYID 159 (563) Q Consensus 85 ~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~~~~~~----~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lD 159 (563) |+.+--++ . .+......+++. =+=.-.+++ ++...+.+|. ||-.+.+. .| +..++ T Consensus 52 ----p~~~~~~~-~-------~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~-~~~~~~~~----~~---~~~~~ 111 (395) T protein:vir:10 52 ----HFKVLEGN-R-------IQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNE-VLIVVSDS----KE---LLIAD 111 (395) T ss_pred ----eeEeccCC-c-------cccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCc-eEEEEecC----CC---eEecC Confidence 23222111 1 111222222221 112223333 3344445554 55443332 12 44566 Q ss_pred CccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeec Q lcl|NC_015287. 160 PLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHS 239 (563) Q Consensus 160 P~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hS 239 (563) |..+...+.. +.. +..+..+ . ++. ...++.+.|.|.. T Consensus 112 ~~~~~~~~~~--~~~--------------------~~~~~~~-~-----------~~~--------~~~~~~~evih~~- 148 (395) T protein:vir:10 112 SFYREEYALY--DDI--------------------FKDVTVK-D-----------YTY--------QRTFTMQEVIYLK- 148 (395) T ss_pred CccceeEeec--Ccc--------------------eeEEEEc-C-----------cee--------eeeeccccEEEEc- Confidence 5444333211 100 0001111 0 000 1235566664442 Q ss_pred ceecCC-CCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCC Q lcl|NC_015287. 240 GLVDRN-KQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTG 318 (563) Q Consensus 240 Gl~d~~-~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TG 318 (563) +.... ...+.|-|+.|.+...... . -++- -+--+-+..+..+. ...++.+=+++.+.++.+-. ++..+ T Consensus 149 -~~~~~~~~~G~spi~~~~~~~~~~~---~---~~~~-~~~~~gii~~~~~~-~~~e~~~~~~~~~~~~~~~~--~~~~~ 217 (395) T protein:vir:10 149 -YNNNKVTHFVESLFEDYGKIFGRMI---G---AQLK-NYQIRGILKSASSA-YDEKNIEKLQAFTNKLFNTF--NKNQL 217 (395) T ss_pred -cCCCCcccccchHHHHHHHHHHHHH---H---HHHh-cCCCceEEEeCCCC-CCHHHHHHHHHHHHHHhccc--cccCc Confidence 12222 2234565666555443211 1 1221 12223444455443 34444444554444433211 00000 Q ss_pred cccccccccchhhhhcccccCCCCccceeec---CCCCCcchHHHH---HHHHHHHHHhcCCChhhccCCCcceeccccc Q lcl|NC_015287. 319 EIRDDRKHMSMLEDFWLPRREGGRGTEITTL---PGGQNLGELSDI---EYFQKKLYRSLGVPESRIAGSGDGFNLGRSS 392 (563) Q Consensus 319 ev~d~~~~msMlEDywLpRReGgrgTEIsTL---pGg~nLgei~DV---~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~ 392 (563) ..+. ++ .|.+++.| |-..++.+++=+ ++-.+.+.++.+||.+.|.+. .+ T Consensus 218 ------~v~~-l~----------~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~--------~s 272 (395) T protein:vir:10 218 ------AIAP-LI----------EGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYGE--------TA 272 (395) T ss_pred ------ceEE-cC----------CCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcCc--------cc Confidence 0000 11 12333322 222233222222 356677999999999999422 11 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_015287. 393 EILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVD 472 (563) Q Consensus 393 eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~ 472 (563) .+...-..|.+. .|+- +...+.+.|-..| .++.+|.. .++|++ ..+-... +.+|.++++.+- T Consensus 273 n~e~~~~~~~~~--~l~P-~~~~ie~~l~~kL-----~~~~~~~~---~~~f~~------~~l~~~D-~~~~~~~~~~~~ 334 (395) T protein:vir:10 273 DLEKNTLVFEKF--CLTP-LLKKIQNELNAKL-----ITQSMYLK---DTRIEI------VGVNKKD-PLQYAEAIDKLV 334 (395) T ss_pred CHHHHHHHHHHH--HHHH-HHHHHHHHHHHhh-----cChhhhcc---cceecc------hhhhccC-HHHHHHHHHHHH Confidence 122222335542 2322 1222333333333 44444432 233333 2222111 235566554432 Q ss_pred hhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcc Q lcl|NC_015287. 473 PYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAE 552 (563) Q Consensus 473 p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 552 (563) .- -+ +|..|+- + ...-++++.|..+.---+. +.. ...++.+.+ .. T Consensus 335 ~~--G~------------lt~NE~R---~----~~g~~p~~~g~~d~~~~~~---n~~--~~~~~~~~~---------~~ 379 (395) T protein:vir:10 335 SS--GS------------FTRNEVR---I----MLGEEPSDNPELDEYLITK---NYE--KANSGENDE---------KE 379 (395) T ss_pred hC--CC------------cCHHHHH---H----HhCCCCCCCCCCceeeecc---ccc--ccccccccc---------Cc Confidence 21 22 3443332 1 1233344333111100000 000 000111111 11 Q ss_pred cCCCCCCCCCC Q lcl|NC_015287. 553 QAPQMPEGGEI 563 (563) Q Consensus 553 ~~~~~~~~~~~ 563 (563) .+...++||+- T Consensus 380 ~~~~~~kgg~~ 390 (395) T protein:vir:10 380 KDENTLKGGDE 390 (395) T ss_pred ccccccCCCCC Confidence 11223344544 No 192 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=53.16 E-value=0.54 Score=22.07 Aligned_cols=378 Identities=13% Similarity=0.091 Sum_probs=133.7 Q ss_pred ccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEec Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSD 84 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d 84 (563) ||| +++--+..+ ++..- .+.-.+ .++- . +...++|-|.+||+-|.+.+..+ T Consensus 1 Mg~-f~~lf~~~~---~~~~~----------~~~~~~--~~v~----~--------~~~~~~~~v~~~i~~Ia~~iA~~- 51 (395) T protein:vir:95 1 MSI-LEKIFKTRK---DITYM----------LDLDMI--EDLS----Q--------QAYVKRLAIDSCIEFVARAVAQS- 51 (395) T ss_pred Cch-hhhhhccCc---ccccc----------ccchhc--cccc----h--------hhhhhhHHHHHHHHHHHHhhccc- Confidence 777 333211110 11000 000000 0110 0 12357789999999999977654 Q ss_pred CCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcchhhhHH----HHHHHhhhhhhheeeeecCCCCCcceeEEEEeC Q lcl|NC_015287. 85 QNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFDDKCHE----ILRNWYVDGRIYYHKVIDIKAPEEGIKEVRYID 159 (563) Q Consensus 85 ~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~~~~~~----~~R~WYvDGri~fhkviD~~~~k~GI~eLr~lD 159 (563) |+.+--++ . .+......+++. =+=.-.+++ ++...+.+|. ||-.+.+. .| +..++ T Consensus 52 ----p~~~~~~~-~-------~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~-~~~~~~~~----~~---~~~~~ 111 (395) T protein:vir:95 52 ----HFKVLEGN-R-------IQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNE-VLIVVSDS----KE---LLIAD 111 (395) T ss_pred ----eeEeccCC-c-------cccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCc-eEEEEecC----CC---eEecC Confidence 23222111 1 111222222221 112223333 3344445554 55443332 12 44566 Q ss_pred CccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeeeec Q lcl|NC_015287. 160 PLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFCHS 239 (563) Q Consensus 160 P~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~hS 239 (563) |..+...+.. +.. +..+..+ . ++. ...++.+.|.|.. T Consensus 112 ~~~~~~~~~~--~~~--------------------~~~~~~~-~-----------~~~--------~~~~~~~evih~~- 148 (395) T protein:vir:95 112 SFYREEYALY--DDI--------------------FKDVTVK-D-----------YTY--------QRTFTMQEVIYLK- 148 (395) T ss_pred CccceeEeec--Ccc--------------------eeEEEEc-C-----------cee--------eeeeccccEEEEc- Confidence 5444333211 100 0001111 0 000 1235566664442 Q ss_pred ceecCC-CCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCCC Q lcl|NC_015287. 240 GLVDRN-KQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDASTG 318 (563) Q Consensus 240 Gl~d~~-~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~TG 318 (563) +.... ...+.|-|+.|.+...... . -++- -+--+-+..+..+. ...++.+=+++.+.++.+-. ++..+ T Consensus 149 -~~~~~~~~~G~spi~~~~~~~~~~~---~---~~~~-~~~~~gii~~~~~~-~~~e~~~~~~~~~~~~~~~~--~~~~~ 217 (395) T protein:vir:95 149 -YNNNKVTHFVESLFEDYGKIFGRMI---G---AQLK-NYQIRGILKSASSA-YDEKNIEKLQAFTNKLFNTF--NKNQL 217 (395) T ss_pred -cCCCCcccccchHHHHHHHHHHHHH---H---HHHh-cCCCceEEEeCCCC-CCHHHHHHHHHHHHHHhccc--cccCc Confidence 12222 2234565666555443211 1 1221 12223444455443 34444444554444433211 00000 Q ss_pred cccccccccchhhhhcccccCCCCccceeec---CCCCCcchHHHH---HHHHHHHHHhcCCChhhccCCCcceeccccc Q lcl|NC_015287. 319 EIRDDRKHMSMLEDFWLPRREGGRGTEITTL---PGGQNLGELSDI---EYFQKKLYRSLGVPESRIAGSGDGFNLGRSS 392 (563) Q Consensus 319 ev~d~~~~msMlEDywLpRReGgrgTEIsTL---pGg~nLgei~DV---~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~ 392 (563) ..+. ++ .|.+++.| |-..++.+++=+ ++-.+.+.++.+||.+.|.+. .+ T Consensus 218 ------~v~~-l~----------~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~--------~s 272 (395) T protein:vir:95 218 ------AIAP-LI----------EGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYGE--------TA 272 (395) T ss_pred ------ceEE-cC----------CCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcCc--------cc Confidence 0000 11 12333322 222233222222 356677999999999999422 11 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_015287. 393 EILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVD 472 (563) Q Consensus 393 eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~ 472 (563) .+...-..|.+. .|+- +...+.+.|-..| .++.+|.. .++|++ ..+-... +.+|.++++.+- T Consensus 273 n~e~~~~~~~~~--~l~P-~~~~ie~~l~~kL-----~~~~~~~~---~~~f~~------~~l~~~D-~~~~~~~~~~~~ 334 (395) T protein:vir:95 273 DLEKNTLVFEKF--CLTP-LLKKIQNELNAKL-----ITQSMYLK---DTRIEI------VGVNKKD-PLQYAEAIDKLV 334 (395) T ss_pred CHHHHHHHHHHH--HHHH-HHHHHHHHHHHhh-----cChhhhcc---cceecc------hhhhccC-HHHHHHHHHHHH Confidence 122222335542 2322 1222333333333 44444432 233333 2222111 235566554432 Q ss_pred hhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcc Q lcl|NC_015287. 473 PYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAE 552 (563) Q Consensus 473 p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 552 (563) .- -+ +|..|+- + ...-++++.|..+.---+. +.. ...++.+.+ .. T Consensus 335 ~~--G~------------lt~NE~R---~----~~g~~p~~~g~~d~~~~~~---n~~--~~~~~~~~~---------~~ 379 (395) T protein:vir:95 335 SS--GS------------FTRNEVR---I----MLGEEPSDNPELDEYLITK---NYE--KANSGENDE---------KE 379 (395) T ss_pred hC--CC------------cCHHHHH---H----HhCCCCCCCCCCceeeecc---ccc--ccccccccc---------Cc Confidence 21 22 3443332 1 1233344333111100000 000 000111111 11 Q ss_pred cCCCCCCCCCC Q lcl|NC_015287. 553 QAPQMPEGGEI 563 (563) Q Consensus 553 ~~~~~~~~~~~ 563 (563) .+...++||+- T Consensus 380 ~~~~~~kgg~~ 390 (395) T protein:vir:95 380 KDENTLKGGDE 390 (395) T ss_pred ccccccCCCCC Confidence 11223344544 No 193 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=52.28 E-value=0.56 Score=21.97 Aligned_cols=390 Identities=13% Similarity=0.141 Sum_probs=147.4 Q ss_pred ccc-hhhHhhhc-ccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEE Q lcl|NC_015287. 5 FGF-KIQKDDEQ-NKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIV 82 (563) Q Consensus 5 ~~f-~~~~~~~~-~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv 82 (563) ||| ||-...-. .+..+....+. +.+.. ..+ ..+. +..+.+|.|..||..|.+.+.- T Consensus 1 mg~~~~~~~~~~~~~~~~~~~~~~----------~~~~~----~~~--~~t~------~~~~~~~~v~~cv~~Ia~~ia~ 58 (403) T protein:vir:10 1 MGFKSWITEKLNPGQRIIRDMEPV----------SHRTN----RKP--FTTG------QAYSKIEILNRTANMVIDSAAE 58 (403) T ss_pred Ccchhhhhhccchhhhhhhccccc----------ccccC----Ccc--cccH------HHHHHHHHHHHHHHHHHHHHhh Confidence 666 44432211 11111111111 00010 000 0000 2234678899999999885543 Q ss_pred ecCCCcEEEEEeeccccchHHHHHHHHHHHHHHHH-hcchhhhHH----HHHHHhhhhhhheeeeecCCCCCcceeEEEE Q lcl|NC_015287. 83 SDQNDSPVEIDLENLPASKQLKDLIREEFKTVKEI-LNFDDKCHE----ILRNWYVDGRIYYHKVIDIKAPEEGIKEVRY 157 (563) Q Consensus 83 ~d~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~l-l~F~~~~~~----~~R~WYvDGri~fhkviD~~~~k~GI~eLr~ 157 (563) + |+.|- +.......-+..+...+.++|+. =+=...+.+ ++-.++..|.-|.++ + | .+|.. T Consensus 59 ~-----p~~v~-~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~--~------~-~~l~~ 123 (403) T protein:vir:10 59 C-----SYTVG-DKYNIVTYANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYW--D------G-TSLYH 123 (403) T ss_pred C-----ceeEe-ecccccccccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEE--e------C-ceeEe Confidence 2 22221 11111111111222344555543 222233333 344455778776543 2 1 25677 Q ss_pred eCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecc-cceee Q lcl|NC_015287. 158 IDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAP-DAISF 236 (563) Q Consensus 158 lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~-daI~y 236 (563) |.|..+... . + . ++ ...+|.|... +.+.. +.|++ T Consensus 124 l~~~~~~v~---~-~-~--~~----------------~~~~~~~~~~----------------------~~~~~~eiih~ 158 (403) T protein:vir:10 124 VPAALMQVE---A-D-A--NK----------------FIKKFIFNNQ----------------------INYRVDEIIFI 158 (403) T ss_pred ecCcceEEE---E-c-C--Cc----------------eEEEEEecCc----------------------eeecccceEEe Confidence 776554321 1 1 0 11 1112333110 01112 22333 Q ss_pred eecceec--CCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEee Q lcl|NC_015287. 237 CHSGLVD--RNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYD 314 (563) Q Consensus 237 ~hSGl~d--~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd 314 (563) .+..++. .++-.+.|-+..|++.+..-..++....=+--.-+.-.-|...+ +.|-+..+++.-+.+-..|... T Consensus 159 ~~~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~l~~e~~~~~~~~~~~~~~g~---- 233 (403) T protein:vir:10 159 KDNSYVCGTNSQISGQSRVATVIDSLEKRSKMLNFKEKFLDNGTVIGLILETD-EILNKKLRERKQEELQLDYNPS---- 233 (403) T ss_pred cccccccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeC-CCCCHHHHHHHHHHHHHHhCCc---- Confidence 2222211 12334467788888877777666665432222223334455554 4566555544443333333210 Q ss_pred CCCCcccccccccchhhhhcccccCCCCccceeecC---CCCCcchHHHHHHHHHHHHHhcCCChhhccCCCcceecccc Q lcl|NC_015287. 315 ASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLP---GGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRS 391 (563) Q Consensus 315 ~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTLp---Gg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~ 391 (563) .+.|.+ + .|+ | |.+++.|. --..+.-++..+|..+.+.++++||..-|... . . T Consensus 234 ~n~g~~------~-vl~--------~--g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~-~------~ 289 (403) T protein:vir:10 234 TGQSSV------L-ILD--------G--GMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGG-N------N 289 (403) T ss_pred ccCcce------e-ecC--------C--CceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCC-C------C Confidence 112221 1 121 1 33444332 12244457778889999999999999998421 1 1 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015287. 392 SEILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAV 471 (563) Q Consensus 392 ~eItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~ 471 (563) +.+...-+.|.+++ |+-.+.. +.+.|-..| ...+.+++........ + ...|++.++.+ T Consensus 290 sn~e~~~~~f~~~t--l~P~~~~-ie~~l~~~L--------------~~~~~~d~~~~~~l~~--D---~~~~~~~~~~~ 347 (403) T protein:vir:10 290 ANIRPNIELFYYMT--IIPMLNK-LTSSLTFFF--------------GYKITPNTKEVAALTP--D---KEAEAKHLTSL 347 (403) T ss_pred cCHHHHHHHHHHHH--HHHHHHH-HHHHHHHhc--------------Cceeeeccchhhhccc--C---HHHHHHHHHHH Confidence 22333334466543 3322222 222232222 1234444432111100 0 12344433322 Q ss_pred hhhhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCc Q lcl|NC_015287. 472 DPYVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGA 551 (563) Q Consensus 472 ~p~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~ 551 (563) -.- -+ +|..|+- + ...-++.++|... . -..|.+-...+....++. T Consensus 348 ~~~--G~------------lT~NE~R---~----~~gl~pi~~~~~d---~-----------~~~p~n~~~~~~~~~~~e 392 (403) T protein:vir:10 348 VNN--GI------------ITGNEAR---S----ELNLEPLDDEQMN---K-----------IRIPANVAGSATGVSGQE 392 (403) T ss_pred HhC--CC------------cCHHHHH---H----HhCCCCCCccccc---c-----------cccccccccccccCCCCc Confidence 111 12 3432222 1 1122233221110 0 011222211111111111 Q ss_pred ccCCCCCCCCC Q lcl|NC_015287. 552 EQAPQMPEGGE 562 (563) Q Consensus 552 ~~~~~~~~~~~ 562 (563) ...|.-..=|| T Consensus 393 ~~~~~~~~~g~ 403 (403) T protein:vir:10 393 GGRPKGSTEGD 403 (403) T ss_pred CCCCCCCcCCC Confidence 12222222223 No 194 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=27.37 E-value=1.8 Score=19.12 Aligned_cols=459 Identities=15% Similarity=0.102 Sum_probs=174.4 Q ss_pred CcccccchhhHhhhcccccccccCcCCCCcccceeccccccccc-cccchhhhHHHHHHHHHHHhhccchhHHHHHhhcc Q lcl|NC_015287. 1 MAKLFGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYV-DIDGVFKSEFELIKRYREMALHPEVDSAIEDIINE 79 (563) Q Consensus 1 m~~~~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~-~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvne 79 (563) ||+ .....|.++|...-.-...+..|..+.+. .....++ ..++++.|+.|..++.|.++++.+..- T Consensus 1 ~~~------------~~~~~~gl~p~rl~~i~~~~~~~~~~~~~~~~~~~Lr-~~~~~~ly~~m~~D~hi~s~l~~Rk~a 67 (488) T protein:vir:95 1 MAD------------ITETQESLPPFRMGEVGSLGLKVKNGRIYEEPRQALR-FPESIKTFQLMMRDPAVAASVNIIKMF 67 (488) T ss_pred CCC------------ccccCCCCCHHHHHHHHHHhhccccchhhccchhhhc-ccchHHHHHHHhhChHHHHHHHHHHHH Confidence 221 11122334442211111112222333221 2222333 346788999999999999999999885 Q ss_pred eEEecCCCcEEEEEeeccccchHHHH-HHHHHHHHHHHHhc--chhhhHHHHHHHhhhhhhheeeeecCCCCCcceeEEE Q lcl|NC_015287. 80 AIVSDQNDSPVEIDLENLPASKQLKD-LIREEFKTVKEILN--FDDKCHEILRNWYVDGRIYYHKVIDIKAPEEGIKEVR 156 (563) Q Consensus 80 aiv~d~~~~pV~l~l~~~~~s~sik~-~I~eeF~~i~~ll~--F~~~~~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr 156 (563) +.-.+ -.|.-. ..-++..|. .+.+.++++++-+. |..-..+++. .+-.|--.+++|.+......|...+. T Consensus 68 v~~~~-----w~v~p~-~~~~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~ld-a~~~G~s~~Eivw~~~~~~~~~~~~~ 140 (488) T protein:vir:95 68 VRKVN-----WRFVPP-KGKEQDPKMLERADFFNSLMDDMEHDWADFINSVMS-FCTYGFCVNEKVYKKRQGKKGKYQSK 140 (488) T ss_pred HhcCC-----ceEecC-CCCchhHHHHHHHHHHHHHHhccCccHHHHHHHHHH-hhcccceeeeeeeecccccccccccc Confidence 44322 112111 123344443 35677888887665 4454555542 45567777788777654444444333 Q ss_pred EeCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccc---------cCcCccccCccee Q lcl|NC_015287. 157 YIDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGA---------IGQTPFSTKQRPV 227 (563) Q Consensus 157 ~lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~---------~g~~~~~~~~~~i 227 (563) +-|.+- ......++. +..-.+|.|++.+..--..-+. ....+. ...++ T Consensus 141 ~~dg~~-~~~~i~~Rp--------------------q~~~~~f~~d~d~~l~~~~~~~~~~~~~~~~~~~~~~--~~~~~ 197 (488) T protein:vir:95 141 FDDGLI-GWAKLPIRN--------------------QSTLDKWYFDEDFRRVTGVRQNLRNVSHIAGAINLGE--RPLTR 197 (488) T ss_pred ccCCee-eeeeeeecC--------------------cccccceeeccCCCceeeccccccccccccccccccc--ccccc Confidence 333211 000000110 0011244444332111000000 000000 11234 Q ss_pred eeccc-ceeeeecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHH-hcCccceEEE----ccCC-CCchHHHHHHH Q lcl|NC_015287. 228 KIAPD-AISFCHSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRL-SRAPERRIFY----IDVG-NLPKMKAEQYL 300 (563) Q Consensus 228 ki~~d-aI~y~hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi-~RAPeRRvFy----IDvG-nlpk~KAeqYl 300 (563) .+|.. -|+|.|.-- ...-...|-|.+|..+|-=.+......+.+=- .=.|-.-..| .+.- .--+.+..+.+ T Consensus 198 ~lP~~kfi~~~~~~~--~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~ 275 (488) T protein:vir:95 198 KLPRAKFMLFKYDDE--YGNPEGRSPLLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYC 275 (488) T ss_pred cccccceEEEeecCC--CCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHH Confidence 45544 477777632 11222368888888888655555444443311 0022222222 1111 11122234445 Q ss_pred HHHHHhhcceeEeeCCCCcccccccccchhhhhcccccCCCCccceeec--CCCCCcchHHHHHHHHHHHHHhcCCChhh Q lcl|NC_015287. 301 KEVMNRYRNKLVYDASTGEIRDDRKHMSMLEDFWLPRREGGRGTEITTL--PGGQNLGELSDIEYFQKKLYRSLGVPESR 378 (563) Q Consensus 301 ~~im~k~knklvYd~~TGev~d~~~~msMlEDywLpRReGgrgTEIsTL--pGg~nLgei~DV~YF~~kLy~aL~VP~sR 378 (563) +.|...++. +...|-|-...--...-+++ .|+..+ .|++...-..=++|..+..-+++--..-- T Consensus 276 ~~i~~~~~~----~~~ag~iiP~g~~~~~k~~~----------~e~~l~~~~~~~~~~~~~li~~~d~~Isk~iLGqtLT 341 (488) T protein:vir:95 276 KTVVNDMIA----NDRAGLIWPRYIDPDTKEDI----------FEFSLVSRQGAKAYDTGSIIDRYSKQIMMAFMSDVLA 341 (488) T ss_pred HHHHHHhhc----cchhheeeccccccccchhh----------hhhhccccccCCchhHHHHHHHHHHHHHHHHhccccc Confidence 555444433 12233222111000000111 122222 23333333345899999988877433211 Q ss_pred ccCC-CcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHH Q lcl|NC_015287. 379 IAGS-GDGFNLGRSSEILRDEIKFTKFVGRMRKRFSHLFND-MLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELK 456 (563) Q Consensus 379 l~~~-~~g~~~g~~~eItRDElKF~KfI~rLr~rFs~if~d-~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElK 456 (563) +..+ ++++.+| + +-.|+ |...++...+.++..|.. +++.-+.++. ......-++.|-. .|-. T Consensus 342 ~~~~~~Gs~Al~---~-vh~ev-~~~i~~aDa~~i~~tln~~li~~l~~~Nf-------g~~~~~P~~~~~~----~e~~ 405 (488) T protein:vir:95 342 MGQSKYGSFSLA---D-SKTSL-LAMSVDILLKQIKNVINRDLVAQTYALNM-------WDDEEHVQITYDD----IETP 405 (488) T ss_pred cccCcchhhhHH---H-HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcC-------CCCCCccEEEecC----cChh Confidence 2211 1112221 1 11112 444555566666666653 3333333432 1112223344432 1223 Q ss_pred HHHHHHHHHHHHHHhhhhhchhhh----hHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccc-cCC Q lcl|NC_015287. 457 ETELLNERLGVVAAVDPYVGKYFS----LDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPE-QMP 531 (563) Q Consensus 457 e~Ei~~~R~~~~~~~~p~vGKy~S----~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~-~~~ 531 (563) +.+-+.+++..|..+ |-.++ .+|+++++- +.. |.+.++......+. .+. T Consensus 406 Dl~~~ae~~~~L~~~----G~~i~~~~~~~~i~e~~g-ip~---------------------~~~~e~~~~~~~~~~~~~ 459 (488) T protein:vir:95 406 DLEAIGSYIQKTVAV----GALEVDKELSNKLREHIG-LPP---------------------ADESQPVSEKLSPNSQSR 459 (488) T ss_pred hHHHHHHHHHHHHhC----CCccccHHHHHHHHHHhC-CCC---------------------CCCCccccccCCCCCCCC Confidence 333344444444332 43333 344543332 211 11111110000000 000 Q ss_pred CCcCCCCCcccccccCCCCcccCCCCCCCCCC Q lcl|NC_015287. 532 GGAMNPQDPNMGGAPQEGGAEQAPQMPEGGEI 563 (563) Q Consensus 532 ~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 563 (563) .+.-.+ .+ ..++...++.++|+..|--+= T Consensus 460 ~~~~~~-~~--~~~~~~~~~~~~~~~a~~~~~ 488 (488) T protein:vir:95 460 SGDGYK-TA--GEGTAKTPSAKDPSTANKANK 488 (488) T ss_pred CCcccC-CC--cccCCcccccccchhhhhccC Confidence 000000 00 001111111222222221111 No 195 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=26.07 E-value=2 Score=18.95 Aligned_cols=365 Identities=11% Similarity=0.072 Sum_probs=143.4 Q ss_pred ccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEec Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSD 84 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d 84 (563) ||| |.+...-... ...++.+... .+.+ + . + .| +++-|..||+-|.+.+.-+ T Consensus 1 Mg~-f~~~~~~~~~-----~~~~~~~~~~----~~~~-----~------~--~-~~----~~~~v~~~i~~Ia~~iA~l- 51 (378) T protein:vir:16 1 MNL-FGKVVSFSRG-----KLNNDTQRVT----AWQN-----E------A--V-EY----TSAFVTNIHNKIANEITKV- 51 (378) T ss_pred Ccc-chhhhhhhcc-----cccCCcceee----eccc-----c------h--h-hH----HHHHHHHHHHHHHhhhhhC- Confidence 777 5443321111 1111111110 0100 0 0 0 12 3344889999999866533 Q ss_pred CCCcEEEEEeec--cccchHHHHHHHHHHHHHHHH-hcchhhhHH----HHHHHhhhhhhheeeeecCCCCCcceeEEEE Q lcl|NC_015287. 85 QNDSPVEIDLEN--LPASKQLKDLIREEFKTVKEI-LNFDDKCHE----ILRNWYVDGRIYYHKVIDIKAPEEGIKEVRY 157 (563) Q Consensus 85 ~~~~pV~l~l~~--~~~s~sik~~I~eeF~~i~~l-l~F~~~~~~----~~R~WYvDGri~fhkviD~~~~k~GI~eLr~ 157 (563) |+++--.. -...+...+.+.....++|+. =+=...+++ ++..+...|.-|-.++.|.. ..++.+ T Consensus 52 ----~~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~-----~g~~~~ 122 (378) T protein:vir:16 52 ----EFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDN-----TGELLD 122 (378) T ss_pred ----ceeEEEEcccccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecC-----CceEEE Confidence 33331111 012222223333444444442 111223444 44466777887766665521 122222 Q ss_pred eCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeee Q lcl|NC_015287. 158 IDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFC 237 (563) Q Consensus 158 lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~ 237 (563) +-|.. ....++.+.|.|. T Consensus 123 l~~~~--------------------------------------------------------------~~~~~~~~diih~ 140 (378) T protein:vir:16 123 LLFAD--------------------------------------------------------------DKKEYKPEELVRL 140 (378) T ss_pred EEecC--------------------------------------------------------------CeeEecccceEEe Confidence 22210 0123334444443 Q ss_pred ecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCC Q lcl|NC_015287. 238 HSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDAST 317 (563) Q Consensus 238 hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~T 317 (563) .+- ..+..+.|.|+.|++.++. + ..-+--|-+..++ +.|....+.+....+...|++..--+. . T Consensus 141 r~~---~~~~~~~s~l~~~~~~i~~------~-----~~~~~~~g~l~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~-~ 204 (378) T protein:vir:16 141 TSP---FYINEDTSILDNALASIQT------K-----LEQGKLRGLLKIN-AFLDIDNTQEYREKALTTIKNMQEGSS-Y 204 (378) T ss_pred cCc---cCccchhHHHHHHHHHHHH------H-----HhcCccceeeEeC-CcCCHHHHHHHHHHHHHHHHHhhcccc-c Confidence 321 2233456888887766432 1 1122223344443 445555555555555555544322111 1 Q ss_pred CcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHH Q lcl|NC_015287. 318 GEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRD 397 (563) Q Consensus 318 Gev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRD 397 (563) | +.+. + ..|.+++.|.-.....++...+|-++.+.++++||.+.|.+ ..+|-. T Consensus 205 g------~~~v-l----------~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~g--------~~~e~~-- 257 (378) T protein:vir:16 205 N------GLTP-V----------DNKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILLG--------TASQEQ-- 257 (378) T ss_pred c------cceE-c----------CCCceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcC--------CchHHH-- Confidence 1 1121 1 12455666654445556778899999999999999999832 112211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhh---hceEEEeeccchHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_015287. 398 EIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLG---DHIQYDFVYDNHFAELKETELLNERLGVVAAVDPY 474 (563) Q Consensus 398 ElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~---~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~ 474 (563) -+-|..+ .|+-.+ ..+.+.|.. .+++++|+.... ....++|.-+. +... -+..|++.+..+-.- T Consensus 258 ~~~f~~~--tl~P~~-~~ie~~l~~-----kLl~~~e~~~~~~~~~~~~~~f~~~~----l~~~-d~~~~~~~~~~~~~~ 324 (378) T protein:vir:16 258 QIYFYNS--TIIPLL-IQLEKELTY-----KLISTNRRRVVKGNLYYERIIVDNQL----FKFA-TLKELIDLYHENING 324 (378) T ss_pred HHHHHHH--HHHHHH-HHHHHHHHh-----hcCChhhhhhhhhcccccceeeccch----hhhc-CHHHHHHHHHHHHhC Confidence 1123332 132211 122333333 345666666432 22334444322 1111 122555554433322 Q ss_pred hchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCcccC Q lcl|NC_015287. 475 VGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQA 554 (563) Q Consensus 475 vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~ 554 (563) | + +|.-|+-+ ....++++..+.--.+. + ..|.+. ...++++ . T Consensus 325 -G-~------------~T~NE~R~-------~~g~~p~~ggD~~~~~~-----n------~~~~~~---~~~~~~~---~ 366 (378) T protein:vir:16 325 -P-I------------FTQNQLLV-------KMGEQPIEGGDVYIANL-----N------AVAVKN---LSDLQGS---R 366 (378) T ss_pred -C-C------------cCHHHHHH-------HhCCCCCCCCCeEeecc-----c------cccccc---hhhhcCc---c Confidence 1 2 34433321 12222222110000000 0 001110 0000000 0 Q ss_pred CCCCCCCCC Q lcl|NC_015287. 555 PQMPEGGEI 563 (563) Q Consensus 555 ~~~~~~~~~ 563 (563) .+..+|+|- T Consensus 367 ~~~~~~~e~ 375 (378) T protein:vir:16 367 KDVTSTDET 375 (378) T ss_pred CCCCCCCCC Confidence 011122222 No 196 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=25.00 E-value=2.1 Score=18.81 Aligned_cols=380 Identities=13% Similarity=0.136 Sum_probs=150.7 Q ss_pred ccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEec Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSD 84 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d 84 (563) ||+ +.+-........+...-.++..+. .....+..+..+ ...++|-|..||+-|.+.+.-+ T Consensus 1 MGl-~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~vt~~--------------~al~~~~v~~~i~~Ia~~iA~l- 61 (394) T protein:vir:62 1 MGL-RDRFSNYLFKKAEKRGYLDNVLGK---SIRYSGVYVTDS--------------NILQSSDVYELLQDISNQMVLA- 61 (394) T ss_pred Cch-hhhhhhhccCCCCchhhhhhhhhc---ccccCccccChh--------------hhhccHHHHHHHHHHHHhhccc- Confidence 333 111100011110110000110000 111111111111 1246788999999999976543 Q ss_pred CCCcEEEEEeeccccchHHHHHHHHHHHHHHHHhc---chhhhH----HHHHHHhhhhhhheeeeecCCCCCcceeEEEE Q lcl|NC_015287. 85 QNDSPVEIDLENLPASKQLKDLIREEFKTVKEILN---FDDKCH----EILRNWYVDGRIYYHKVIDIKAPEEGIKEVRY 157 (563) Q Consensus 85 ~~~~pV~l~l~~~~~s~sik~~I~eeF~~i~~ll~---F~~~~~----~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~ 157 (563) |+.|-=++ .+ .+.+.- .+.||. =...+. .++..++++|--|..+ + .++ -| . T Consensus 62 ----p~~v~~~~---g~----~~~~~~--~~~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i--~-~~~-~~-----~ 119 (394) T protein:vir:62 62 ----DIVVEDEF---GN----EIKDDI--ALQILRNPNNYLTQSEFIKLMTNTYLLEGETFPIL--N-GAQ-IH-----L 119 (394) T ss_pred ----ceEEEcCC---Cc----ccchhh--HHHHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEE--e-cce-ee-----c Confidence 33332111 11 122211 122221 112233 3445566667765543 2 111 11 1 Q ss_pred eCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeee Q lcl|NC_015287. 158 IDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFC 237 (563) Q Consensus 158 lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~ 237 (563) +++ +.+.... ....+|.++ ..+++.+.|.|. T Consensus 120 ~~~------------------~~~~~~~--------~~~~~~~~~-----------------------~~~~~~~eiih~ 150 (394) T protein:vir:62 120 ASN------------------VFTELDD--------NLVEHFNIG-----------------------GHEIPPCMIRHV 150 (394) T ss_pred ccc------------------ceEEECC--------ceEEEEeeC-----------------------CEEechhheEEe Confidence 111 1111100 001122211 134667777443 Q ss_pred ecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHH-HHHHHHHhhcceeEeeCC Q lcl|NC_015287. 238 HSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQ-YLKEVMNRYRNKLVYDAS 316 (563) Q Consensus 238 hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeq-Yl~~im~k~knklvYd~~ 316 (563) -.++.++-.+.|.|+.|.+++.....++....=+----+--+-+..++..--|...+.+ -.+.+..+|+.. .+ T Consensus 151 --r~~~~d~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~~~g~----~n 224 (394) T protein:vir:62 151 --KNIGADHLRGKGILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDAHINPQNGAQSKLINAILDQLESI----DE 224 (394) T ss_pred --cCcCCCCccccChHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCCCCcCHHHHHHHHHHHHHHhccc----cc Confidence 33455666678999999999988777777655444444666777777655323333322 222222333211 11 Q ss_pred CCcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHH---HHHHHHHHHhcCCChhhccCCCcceeccccch Q lcl|NC_015287. 317 TGEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDI---EYFQKKLYRSLGVPESRIAGSGDGFNLGRSSE 393 (563) Q Consensus 317 TGev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV---~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~e 393 (563) .|.+ + . | ++|.|.++..|. .+.-+++-+ ++-.+.+.++++||..-|.... .+ . T Consensus 225 ~g~~------~-v-----l---~~g~~~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~------~s-n 280 (394) T protein:vir:62 225 ARSV------K-M-----I---PLGKGYSIDTLK--SPLDDEKTLAYLNVYKKDLGKFLGINVDTYTELI------KE-D 280 (394) T ss_pred cCce------e-E-----e---eCCCceeEEecC--CCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCCC------Cc-C Confidence 1211 0 1 1 123333444333 233344444 3567889999999999994211 11 1 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_015287. 394 ILRDEIKFTKFVGRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLGDHIQYDFVYDNHFAELKETELLNERLGVVAAVDP 473 (563) Q Consensus 394 ItRDElKF~KfI~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p 473 (563) +...-..|.++ .|+-. ...+.+.|-..| +++++ ...+.++|..+..... ..|.+++..+-. T Consensus 281 ~e~~~~~~~~~--~l~P~-~~~ie~~l~~kl-----l~~~~----~~~~~~~fd~~~~~~~-------~~~~~~~~~~~~ 341 (394) T protein:vir:62 281 IEKAMMYIHNK--AVRPI-MKNFEDHLSLLF-----YAQNS----GKRIKFKINILDFVTY-------SNKTNIGYNLVR 341 (394) T ss_pred HHHHHHHHHHH--HHHHH-HHHHHHHHhhhh-----cCccc----cCceEEEechhhhcCH-------HHHHHHHHHHHh Confidence 22222334433 23321 122233333333 34443 2457777876443222 123333322211 Q ss_pred hhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCccc Q lcl|NC_015287. 474 YVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQ 553 (563) Q Consensus 474 ~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~ 553 (563) .--||..|+-+ ..+-+++++|....---+ ....|.+. .. . T Consensus 342 --------------~g~~T~NE~R~-------~~gl~p~~~~~gd~~~~~---------~n~~~~~~--------~~--~ 381 (394) T protein:vir:62 342 --------------TAITSPDNVAD-------MLGFPKQNTKESQAIYIS---------NDVTEIGK--------KE--A 381 (394) T ss_pred --------------CCCcCHHHHHH-------HhCCCCCCCCCCCeeecc---------cccccccc--------cc--c Confidence 11255544431 223344433211110000 01112211 11 1 Q ss_pred CCCCCCCCCC Q lcl|NC_015287. 554 APQMPEGGEI 563 (563) Q Consensus 554 ~~~~~~~~~~ 563 (563) ...-++|||- T Consensus 382 ~~~~~kgge~ 391 (394) T protein:vir:62 382 TDGSLGGGEE 391 (394) T ss_pred ccccCCCCCC Confidence 1223466666 No 197 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=24.36 E-value=2.2 Score=18.72 Aligned_cols=367 Identities=11% Similarity=0.060 Sum_probs=135.0 Q ss_pred ccchhhHhhhcccccccccCcCCCCcccceeccccccccccccchhhhHHHHHHHHHHHhhccchhHHHHHhhcceEEec Q lcl|NC_015287. 5 FGFKIQKDDEQNKNVVSPVPQSNEDSSDYYVSSGFYGQYVDIDGVFKSEFELIKRYREMALHPEVDSAIEDIINEAIVSD 84 (563) Q Consensus 5 ~~f~~~~~~~~~~~~~s~~p~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~ma~~PEvd~Ai~eIvneaiv~d 84 (563) ||| |.+...-.+ + +-.++........++ .+ .| +.+-|..||+-|.+.+. T Consensus 1 Mg~-f~~~~~~~~---~--~~~~~~~~~~~~~~~----------~~--------~~----~~~~v~~~v~~IA~~iA--- 49 (378) T protein:vir:94 1 MNL-FGKVVSFSR---G--KLNNDTQRVTAWQNE----------AV--------EY----TSAFVTNIHNKIANEIT--- 49 (378) T ss_pred CCc-cccchhccc---c--cccCCcceeeeeccc----------hh--------HH----HHHHHHHHHHHHHhhhh--- Confidence 565 322111111 1 111111110000000 00 12 22338899999998654 Q ss_pred CCCcEEEEEeecc--ccchHHHHHHHHHHHHHHHH-hcchhhh----HHHHHHHhhhhhhheeeeecCCCCCcceeEEEE Q lcl|NC_015287. 85 QNDSPVEIDLENL--PASKQLKDLIREEFKTVKEI-LNFDDKC----HEILRNWYVDGRIYYHKVIDIKAPEEGIKEVRY 157 (563) Q Consensus 85 ~~~~pV~l~l~~~--~~s~sik~~I~eeF~~i~~l-l~F~~~~----~~~~R~WYvDGri~fhkviD~~~~k~GI~eLr~ 157 (563) .-|+++--.+. ...+.....+..-..++|+. =+=...+ +.++..+..+|.-|-+++.|.. ..++.+ T Consensus 50 --~lp~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~-----~g~~~~ 122 (378) T protein:vir:94 50 --KVEFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDN-----TGELLD 122 (378) T ss_pred --hCceeeEEEcccCcccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCC-----CceEEE Confidence 23443321110 01111122222223333332 0111222 4455566777888877776532 122333 Q ss_pred eCCccceeeeeeeccCCccccceeeecccccccccccccceeeecCccccccccccccCcCccccCcceeeecccceeee Q lcl|NC_015287. 158 IDPLKIKLVRKLKTDNSLRGAINQVNSKNPADIENPEIEEYYQYDPSQTQGKNALGAIGQTPFSTKQRPVKIAPDAISFC 237 (563) Q Consensus 158 lDP~~i~~vR~~~~~~~~~~~i~~~~~~~~~~~~~~~~~eyf~y~~~~~~~~~~~g~~g~~~~~~~~~~iki~~daI~y~ 237 (563) +.|.. ..+.++.+.|.|. T Consensus 123 l~p~~--------------------------------------------------------------~~~~~~~~diiH~ 140 (378) T protein:vir:94 123 LLFAD--------------------------------------------------------------DKKEYKPEELVRL 140 (378) T ss_pred EEecC--------------------------------------------------------------CeeEeeeeeeEEe Confidence 33211 0122333444444 Q ss_pred ecceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeeCCC Q lcl|NC_015287. 238 HSGLVDRNKQTILSYLHKSIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKMKAEQYLKEVMNRYRNKLVYDAST 317 (563) Q Consensus 238 hSGl~d~~~~~~~syL~~Aik~~NqL~m~EDalVIyRi~RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knklvYd~~T 317 (563) ++. + ++..+.|-|+.|.+.+.. + +..+--+-+..++ |.|....+.+....+...|+++.-.+. . T Consensus 141 ~~~-~--~~~~g~s~l~~~~~~i~~------~-----~~~~~~~gil~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~-~ 204 (378) T protein:vir:94 141 TSP-F--YINEDTSILDNALASIQT------K-----LEQGKLRGLLKIN-AFLDIDNTQEYREKALTTIKNMQEGSS-Y 204 (378) T ss_pred cCc-C--CccchhHHHHHHHHHHHH------H-----HhcccccceeeeC-CcCCHHHHHHHHHHHHHHHHHhhcccc-c Confidence 332 1 233456777777765432 1 1122223444443 455555554444444444444322111 1 Q ss_pred CcccccccccchhhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCChhhccCCCcceeccccchhhHH Q lcl|NC_015287. 318 GEIRDDRKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELSDIEYFQKKLYRSLGVPESRIAGSGDGFNLGRSSEILRD 397 (563) Q Consensus 318 Gev~d~~~~msMlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~g~~~g~~~eItRD 397 (563) | ..+ .++ .|.+++-|.=.....++...+|-.+.+.++++||...|.+ ..+| + T Consensus 205 g------~~~-vl~----------~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVP~~~l~~--------~~se---~ 256 (378) T protein:vir:94 205 N------GLT-PVD----------NKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILLG--------TASQ---E 256 (378) T ss_pred c------cce-ecC----------CCceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcC--------ChHH---H Confidence 1 111 111 1333443332233445677888899999999999999932 1111 1 Q ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHhh---hceEEEeeccchHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_015287. 398 EIKFTKFV-GRMRKRFSHLFNDMLKTQLILKNIVTPEDWEVLG---DHIQYDFVYDNHFAELKETELLNERLGVVAAVDP 473 (563) Q Consensus 398 ElKF~KfI-~rLr~rFs~if~d~Lk~qLiLkgI~t~eew~~~~---~~i~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p 473 (563) ....|+ ..|+-. ...+.+. +-..+++++|+.... ..+.+.|..+. +.. .-+..|++.+..+-. T Consensus 257 --~~~~f~~~tL~P~-~~~ie~~-----l~~~Ll~~~er~~g~~~~~~~~~~f~~~~----l~~-~d~~~~~~~~~~~~~ 323 (378) T protein:vir:94 257 --QQIYFYNSTIIPL-LIQLEKE-----LTYKLISTNRRRVVKGNLYYERIIVDNQL----FKF-ATLKELIDLYHENIN 323 (378) T ss_pred --HHHHHHHHHHHHH-HHHHHHH-----HHhhcCChhHhhhhhhcccccceeecchh----hhh-cCHHHHHHHHHHHHh Confidence 112222 122221 1112233 333456666665421 22334444322 111 112355555544333 Q ss_pred hhchhhhhHHHHHHHhccChhhHHHHHHHHHHHhcCCCCCCCcccCccccccccccCCCCcCCCCCcccccccCCCCccc Q lcl|NC_015287. 474 YVGKYFSLDYVRRKILKQKDEEVIEIDKQMEKEIKDGKLADPMEVQQLQLGVHPEQMPGGAMNPQDPNMGGAPQEGGAEQ 553 (563) Q Consensus 474 ~vGKy~S~~~i~k~Il~~tdeei~e~~~qie~E~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~ 553 (563) - -+ ||.-|+-++ ..-++++.++.--.+.+ -. +..++.+.+.......++.|. T Consensus 324 ~--G~------------~T~NE~R~~-------~gl~p~~gGD~~~~~~n-----~~--~~~~~~~~~~~~~~~~~~~e~ 375 (378) T protein:vir:94 324 G--PI------------FTQNQLLVK-------MGEQPIEGGDVYIANLN-----AV--AVKNLSDLQGSRKDVTSTDET 375 (378) T ss_pred C--CC------------cCHHHHHHH-------hCCCCCCCCCeeeeccc-----cc--ccccchhhcCCcCCCCCCCCC Confidence 2 12 343333211 12223321110000000 00 000011100011111111111 Q ss_pred CCC Q lcl|NC_015287. 554 APQ 556 (563) Q Consensus 554 ~~~ 556 (563) .-+ T Consensus 376 ~n~ 378 (378) T protein:vir:94 376 NNQ 378 (378) T ss_pred CCC Confidence 101 Done!