Query lcl|NC_019530.1_cdsid_YP_007008105.1 [gene=F416_gp125] [protein=portal vertex protein of the head] [protein_id=YP_007008105.1] [location=complement(105102..106784)] Match_columns 560 No_of_seqs 77 out of 89 Neff 4.0 Searched_HMMs 1612 Date Thu Nov 7 19:13:25 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_125 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_125_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:108049 Length: 524 100.0 2E-274 1E-277 1520.8 43.1 509 15-534 1-524 (524) 2 protein:vir:104892 Length: 558 100.0 6E-274 4E-277 1518.6 43.5 530 1-560 1-553 (558) 3 protein:vir:6896 Length: 523 # 100.0 5E-273 3E-276 1513.7 42.5 507 15-534 1-523 (523) 4 protein:vir:81017 Length: 521 100.0 1E-272 9E-276 1510.8 43.6 510 11-534 1-521 (521) 5 protein:vir:98265 Length: 524 100.0 2E-272 1E-275 1510.0 43.6 515 6-535 1-524 (524) 6 protein:vir:6596 Length: 521 # 100.0 7E-272 4E-275 1507.3 43.4 510 11-534 1-521 (521) 7 protein:vir:103458 Length: 524 100.0 6E-272 4E-275 1507.4 42.8 512 1-534 1-524 (524) 8 protein:vir:7208 Length: 524 # 100.0 7E-272 4E-275 1507.1 42.8 512 1-534 1-524 (524) 9 protein:vir:104500 Length: 537 100.0 3E-271 2E-274 1503.7 44.0 523 1-560 1-537 (537) 10 protein:vir:103177 Length: 533 100.0 4E-271 2E-274 1503.2 44.2 517 1-555 1-533 (533) 11 protein:vir:101189 Length: 516 100.0 5E-271 3E-274 1502.6 42.9 509 12-535 1-516 (516) 12 protein:vir:101806 Length: 516 100.0 5E-271 3E-274 1502.6 42.9 509 12-535 1-516 (516) 13 protein:vir:100598 Length: 516 100.0 1E-270 8E-274 1500.2 42.8 507 12-533 1-516 (516) 14 protein:vir:106999 Length: 564 100.0 8E-271 5E-274 1501.4 41.3 531 8-560 1-554 (564) 15 protein:vir:106282 Length: 521 100.0 1E-268 6E-272 1489.9 42.2 509 1-535 1-521 (521) 16 protein:vir:5665 Length: 511 # 100.0 3E-265 2E-268 1470.8 43.0 493 25-531 1-511 (511) 17 protein:vir:5839 Length: 533 # 100.0 4E-227 3E-230 1261.7 35.4 476 22-560 1-512 (533) 18 protein:vir:94049 Length: 532 99.3 2.4E-11 1.5E-14 78.8 27.2 456 1-560 32-529 (532) 19 protein:vir:96068 Length: 765 99.3 3E-11 1.9E-14 78.3 25.7 461 1-560 37-560 (765) 20 protein:vir:107742 Length: 537 99.2 2.3E-10 1.4E-13 73.5 28.5 452 15-558 1-537 (537) 21 protein:vir:104338 Length: 422 99.2 2.7E-11 1.7E-14 78.5 23.3 395 45-534 1-422 (422) 22 protein:vir:5249 Length: 437 # 99.0 1.3E-09 7.8E-13 69.4 25.2 411 45-558 1-437 (437) 23 protein:vir:107662 Length: 427 99.0 8.9E-10 5.5E-13 70.2 21.2 403 43-559 1-427 (427) 24 protein:vir:99563 Length: 862 98.9 5E-09 3.1E-12 66.1 24.9 464 1-560 66-584 (862) 25 protein:vir:79772 Length: 648 98.9 1.2E-08 7.5E-12 64.0 30.5 459 1-560 1-518 (648) 26 protein:vir:63755 Length: 547 98.9 1E-08 6.4E-12 64.4 25.4 466 6-560 1-524 (547) 27 protein:vir:79538 Length: 502 98.9 1.6E-08 1E-11 63.3 30.2 470 15-558 1-502 (502) 28 protein:vir:80040 Length: 461 98.8 3.1E-08 1.9E-11 61.7 24.5 423 28-535 1-461 (461) 29 protein:vir:105782 Length: 449 98.8 7.9E-09 4.9E-12 65.0 21.0 418 27-553 1-449 (449) 30 protein:vir:80796 Length: 574 98.8 5.1E-08 3.2E-11 60.6 25.4 464 1-560 1-525 (574) 31 protein:vir:101418 Length: 569 98.7 1.8E-08 1.1E-11 63.1 21.5 503 1-559 1-569 (569) 32 protein:vir:2341 Length: 488 # 98.7 4.8E-08 3E-11 60.7 23.2 456 42-554 1-488 (488) 33 protein:vir:78227 Length: 480 98.7 2.2E-08 1.4E-11 62.5 20.2 439 64-560 1-478 (480) 34 protein:vir:2500 Length: 501 # 98.6 8.1E-08 5E-11 59.5 21.9 448 23-555 1-501 (501) 35 protein:vir:10321 Length: 495 98.6 2.4E-07 1.5E-10 56.9 28.8 455 1-560 1-487 (495) 36 protein:vir:80644 Length: 551 98.5 3.7E-07 2.3E-10 55.8 30.9 460 8-560 1-528 (551) 37 protein:vir:9922 Length: 489 # 98.5 3.9E-07 2.4E-10 55.7 23.4 452 34-547 1-489 (489) 38 protein:vir:95542 Length: 548 98.5 4.8E-07 3E-10 55.2 29.7 477 15-560 1-543 (548) 39 protein:vir:6382 Length: 553 # 98.4 8.4E-07 5.2E-10 53.9 31.0 479 27-559 1-553 (553) 40 protein:vir:99916 Length: 504 98.4 8.4E-07 5.2E-10 53.9 23.2 453 51-560 1-497 (504) 41 protein:vir:96738 Length: 505 98.4 8.9E-07 5.5E-10 53.8 28.1 457 4-558 1-505 (505) 42 protein:vir:7853 Length: 518 # 98.4 9E-07 5.6E-10 53.7 24.3 423 34-560 1-452 (518) 43 protein:vir:78537 Length: 480 98.4 1.1E-06 6.8E-10 53.3 21.8 435 64-560 1-478 (480) 44 protein:vir:80959 Length: 499 98.4 1.1E-06 7E-10 53.2 25.0 443 23-539 1-499 (499) 45 protein:vir:79647 Length: 435 98.3 1.2E-06 7.1E-10 53.1 26.3 405 9-540 1-435 (435) 46 protein:vir:7768 Length: 484 # 98.3 3.1E-07 1.9E-10 56.2 18.1 452 38-557 1-484 (484) 47 protein:vir:102080 Length: 429 98.3 1.3E-06 7.8E-10 52.9 23.2 413 15-560 1-421 (429) 48 protein:vir:107605 Length: 432 98.3 1.9E-06 1.1E-09 52.0 24.4 413 15-537 1-432 (432) 49 protein:vir:105002 Length: 432 98.3 1.9E-06 1.1E-09 52.0 24.4 413 15-537 1-432 (432) 50 protein:vir:102855 Length: 432 98.3 1.9E-06 1.1E-09 52.0 24.4 413 15-537 1-432 (432) 51 protein:vir:98444 Length: 434 98.2 2.2E-06 1.4E-09 51.6 27.0 404 61-558 1-434 (434) 52 protein:vir:79703 Length: 505 98.2 1.5E-06 9.2E-10 52.5 19.7 441 9-535 1-505 (505) 53 protein:vir:389 Length: 530 # 98.2 2.3E-06 1.4E-09 51.5 30.9 460 37-559 1-530 (530) 54 protein:vir:99312 Length: 563 98.2 2.4E-06 1.5E-09 51.4 27.2 467 1-560 1-552 (563) 55 protein:vir:95599 Length: 563 98.2 2.4E-06 1.5E-09 51.4 27.2 467 1-560 1-552 (563) 56 protein:vir:102727 Length: 945 98.2 2.6E-06 1.6E-09 51.2 22.3 451 1-560 33-540 (945) 57 protein:vir:4223 Length: 486 # 98.2 1.9E-06 1.2E-09 51.9 19.9 450 39-556 1-486 (486) 58 protein:vir:2427 Length: 485 # 98.2 2.8E-06 1.7E-09 51.1 21.1 444 39-559 1-485 (485) 59 protein:vir:3843 Length: 397 # 98.2 2.8E-06 1.8E-09 51.0 23.2 395 15-548 1-397 (397) 60 protein:vir:3153 Length: 467 # 98.2 2.9E-06 1.8E-09 50.9 26.8 403 80-560 1-445 (467) 61 protein:vir:99072 Length: 479 98.2 3E-06 1.9E-09 50.8 22.8 441 39-555 1-479 (479) 62 protein:vir:101647 Length: 460 98.2 3.3E-06 2.1E-09 50.6 24.4 422 27-560 1-456 (460) 63 protein:vir:96579 Length: 576 98.2 3.4E-06 2.1E-09 50.5 29.9 481 1-560 1-538 (576) 64 protein:vir:104082 Length: 485 98.1 3.6E-06 2.2E-09 50.4 23.6 450 38-559 1-485 (485) 65 protein:vir:78641 Length: 278 98.1 4.2E-06 2.6E-09 50.1 21.0 269 107-463 1-278 (278) 66 protein:vir:1587 Length: 508 # 98.1 4.1E-06 2.5E-09 50.1 19.7 429 12-542 1-508 (508) 67 protein:vir:38 Length: 496 # N 98.0 6.8E-06 4.2E-09 48.9 26.3 422 53-539 1-496 (496) 68 protein:vir:3420 Length: 533 # 98.0 8.2E-06 5.1E-09 48.5 30.5 477 27-558 1-533 (533) 69 protein:vir:4194 Length: 540 # 98.0 9.3E-06 5.8E-09 48.2 25.3 439 15-560 1-457 (540) 70 protein:vir:99522 Length: 470 97.9 1.1E-05 7.1E-09 47.7 27.6 425 29-558 1-470 (470) 71 protein:vir:4156 Length: 542 # 97.9 1.2E-05 7.4E-09 47.6 24.5 437 11-560 1-494 (542) 72 protein:vir:101541 Length: 694 97.9 1.3E-05 8E-09 47.4 24.2 443 1-560 60-574 (694) 73 protein:vir:3648 Length: 695 # 97.9 1.4E-05 8.4E-09 47.3 26.2 462 1-560 12-575 (695) 74 protein:vir:1326 Length: 457 # 97.9 1.4E-05 8.4E-09 47.3 25.6 435 15-559 1-457 (457) 75 protein:vir:9815 Length: 500 # 97.9 1.4E-05 9E-09 47.1 20.4 429 9-538 1-500 (500) 76 protein:vir:3028 Length: 500 # 97.9 1.4E-05 9E-09 47.1 20.4 429 9-538 1-500 (500) 77 protein:vir:78589 Length: 695 97.8 1.6E-05 9.9E-09 46.9 21.9 444 1-560 60-575 (695) 78 protein:vir:6240 Length: 457 # 97.8 1.8E-05 1.1E-08 46.6 22.6 440 15-560 1-450 (457) 79 protein:vir:94426 Length: 409 97.7 2.2E-05 1.4E-08 46.1 21.1 398 22-560 1-407 (409) 80 protein:vir:81152 Length: 411 97.7 3.1E-05 1.9E-08 45.3 18.5 400 6-557 1-411 (411) 81 protein:vir:101648 Length: 518 97.6 3.3E-05 2E-08 45.2 23.2 428 34-560 1-452 (518) 82 protein:vir:1380 Length: 422 # 97.6 4.1E-05 2.5E-08 44.7 21.1 418 15-559 1-422 (422) 83 protein:vir:105292 Length: 478 97.5 5.3E-05 3.3E-08 44.0 25.3 431 21-548 1-478 (478) 84 protein:vir:102118 Length: 409 97.5 5.5E-05 3.4E-08 43.9 20.4 400 15-557 1-409 (409) 85 protein:vir:2683 Length: 412 # 97.5 5.9E-05 3.7E-08 43.8 25.1 398 6-560 1-409 (412) 86 protein:vir:98883 Length: 517 97.4 7E-05 4.4E-08 43.4 20.8 451 12-548 1-517 (517) 87 protein:vir:93747 Length: 472 97.4 7.3E-05 4.5E-08 43.3 27.2 419 46-548 1-472 (472) 88 protein:vir:4337 Length: 434 # 97.4 7.3E-05 4.5E-08 43.3 21.0 423 14-553 1-434 (434) 89 protein:vir:99781 Length: 511 97.4 7.7E-05 4.8E-08 43.1 23.6 436 12-560 1-507 (511) 90 protein:vir:105889 Length: 474 97.4 8.2E-05 5.1E-08 43.0 24.4 418 22-548 1-474 (474) 91 protein:vir:94101 Length: 474 97.4 8.2E-05 5.1E-08 43.0 24.4 418 22-548 1-474 (474) 92 protein:vir:9871 Length: 429 # 97.4 8.2E-05 5.1E-08 43.0 22.2 402 27-548 1-429 (429) 93 protein:vir:106639 Length: 481 97.4 8.6E-05 5.3E-08 42.9 24.8 459 1-560 1-480 (481) 94 protein:vir:7987 Length: 456 # 97.2 0.00012 7.3E-08 42.1 23.3 406 58-537 1-456 (456) 95 protein:vir:8184 Length: 474 # 97.2 0.00014 8.4E-08 41.8 23.3 432 35-538 1-474 (474) 96 protein:vir:3609 Length: 452 # 97.1 0.00015 9.5E-08 41.5 28.2 419 29-542 1-452 (452) 97 protein:vir:96179 Length: 468 97.1 0.00016 9.7E-08 41.4 24.3 417 17-544 1-468 (468) 98 protein:vir:100882 Length: 383 97.1 0.00016 9.8E-08 41.4 21.5 374 15-558 1-383 (383) 99 protein:vir:9306 Length: 511 # 97.1 0.00017 1E-07 41.3 25.9 439 21-560 1-508 (511) 100 protein:vir:3964 Length: 453 # 97.1 0.00017 1.1E-07 41.3 27.7 420 34-548 1-453 (453) 101 protein:vir:960 Length: 413 # 97.1 0.00017 1.1E-07 41.2 24.1 402 1-559 1-413 (413) 102 protein:vir:100249 Length: 431 97.1 0.00019 1.1E-07 41.0 23.4 409 9-556 1-431 (431) 103 protein:vir:102330 Length: 451 97.1 0.00019 1.2E-07 41.0 24.6 395 66-542 1-451 (451) 104 protein:vir:96980 Length: 409 97.0 0.00021 1.3E-07 40.8 25.0 396 22-560 1-406 (409) 105 protein:vir:107112 Length: 478 97.0 0.00024 1.5E-07 40.5 25.4 431 15-548 1-478 (478) 106 protein:vir:78907 Length: 518 96.9 0.00026 1.6E-07 40.3 25.9 461 12-542 1-518 (518) 107 protein:vir:95378 Length: 406 96.8 0.00032 2E-07 39.7 23.5 397 15-558 1-406 (406) 108 protein:vir:1082 Length: 359 # 96.8 0.00033 2E-07 39.7 20.9 325 6-487 1-359 (359) 109 protein:vir:95806 Length: 440 96.8 0.00035 2.2E-07 39.5 25.9 406 66-548 1-440 (440) 110 protein:vir:106716 Length: 698 96.8 0.00036 2.2E-07 39.5 23.0 449 1-560 60-578 (698) 111 protein:vir:93943 Length: 409 96.8 0.00036 2.2E-07 39.5 24.5 396 1-560 1-406 (409) 112 protein:vir:100150 Length: 437 96.7 0.00037 2.3E-07 39.4 24.8 411 27-559 1-437 (437) 113 protein:vir:1236 Length: 483 # 96.7 0.00039 2.4E-07 39.3 27.2 423 42-548 1-483 (483) 114 protein:vir:96240 Length: 511 96.7 0.00042 2.6E-07 39.1 25.9 454 1-557 1-511 (511) 115 protein:vir:4995 Length: 384 # 96.6 0.00047 2.9E-07 38.8 18.7 382 15-524 1-384 (384) 116 protein:vir:9359 Length: 348 # 96.5 0.00059 3.7E-07 38.3 24.2 334 107-560 1-345 (348) 117 protein:vir:483 Length: 413 # 96.3 0.00074 4.6E-07 37.8 23.8 401 22-560 1-412 (413) 118 protein:vir:102950 Length: 471 96.3 0.00076 4.7E-07 37.7 28.2 397 64-534 1-471 (471) 119 protein:vir:4828 Length: 382 # 96.3 0.00076 4.7E-07 37.7 20.4 372 15-540 1-382 (382) 120 protein:vir:4782 Length: 522 # 96.3 0.00078 4.9E-07 37.6 22.5 442 12-542 1-522 (522) 121 protein:vir:78805 Length: 511 96.3 0.00079 4.9E-07 37.6 26.4 447 21-560 1-508 (511) 122 protein:vir:96366 Length: 511 96.3 0.00079 4.9E-07 37.6 26.4 447 21-560 1-508 (511) 123 protein:vir:106571 Length: 499 96.2 0.00083 5.2E-07 37.5 26.8 436 50-559 1-499 (499) 124 protein:vir:96839 Length: 474 96.2 0.00086 5.3E-07 37.4 24.8 409 58-557 1-474 (474) 125 protein:vir:97171 Length: 512 96.2 0.00091 5.6E-07 37.3 27.8 443 42-560 1-509 (512) 126 protein:vir:80680 Length: 441 96.2 0.00093 5.8E-07 37.2 19.0 415 27-550 1-441 (441) 127 protein:vir:93610 Length: 454 96.2 0.00094 5.8E-07 37.2 24.4 425 18-560 1-445 (454) 128 protein:vir:97336 Length: 492 96.2 0.00094 5.8E-07 37.2 23.2 442 6-548 1-492 (492) 129 protein:vir:9702 Length: 406 # 96.1 0.001 6.3E-07 37.0 24.3 389 15-559 1-406 (406) 130 protein:vir:1266 Length: 416 # 96.1 0.0011 6.6E-07 36.9 22.7 405 22-560 1-411 (416) 131 protein:vir:105461 Length: 470 95.9 0.0013 8.1E-07 36.4 26.4 405 66-539 1-470 (470) 132 protein:vir:100187 Length: 385 95.8 0.0015 9E-07 36.1 22.6 376 6-559 1-385 (385) 133 protein:vir:5961 Length: 503 # 95.7 0.0015 9.6E-07 36.0 26.0 448 8-559 1-503 (503) 134 protein:vir:103951 Length: 511 95.6 0.0017 1.1E-06 35.8 26.5 444 12-560 1-508 (511) 135 protein:vir:7407 Length: 392 # 95.4 0.0021 1.3E-06 35.2 23.9 386 1-551 1-392 (392) 136 protein:vir:8418 Length: 409 # 95.4 0.0022 1.3E-06 35.2 23.4 395 15-560 1-409 (409) 137 protein:vir:95113 Length: 474 95.3 0.0023 1.4E-06 35.1 26.6 436 23-548 1-474 (474) 138 protein:vir:4454 Length: 414 # 95.3 0.0024 1.5E-06 34.9 26.2 407 15-553 1-414 (414) 139 protein:vir:2732 Length: 501 # 95.2 0.0026 1.6E-06 34.8 27.0 435 42-560 1-498 (501) 140 protein:vir:80134 Length: 403 95.2 0.0026 1.6E-06 34.7 21.4 387 6-560 1-398 (403) 141 protein:vir:95899 Length: 474 95.1 0.0027 1.7E-06 34.6 23.6 433 36-548 1-474 (474) 142 protein:vir:96266 Length: 474 95.1 0.0027 1.7E-06 34.6 23.6 433 36-548 1-474 (474) 143 protein:vir:79043 Length: 479 95.1 0.0029 1.8E-06 34.5 25.4 419 38-559 1-479 (479) 144 protein:vir:79984 Length: 441 94.7 0.0036 2.2E-06 34.0 22.8 417 1-559 1-441 (441) 145 protein:vir:9408 Length: 441 # 94.7 0.0036 2.2E-06 34.0 22.8 417 1-559 1-441 (441) 146 protein:vir:80333 Length: 419 94.6 0.004 2.5E-06 33.7 21.6 401 15-560 1-417 (419) 147 protein:vir:4952 Length: 386 # 94.4 0.0044 2.7E-06 33.5 22.4 381 15-560 1-385 (386) 148 protein:vir:96494 Length: 501 94.4 0.0045 2.8E-06 33.4 25.6 449 12-560 1-497 (501) 149 protein:vir:100691 Length: 535 94.3 0.0049 3.1E-06 33.2 26.1 471 1-560 1-527 (535) 150 protein:vir:4898 Length: 502 # 94.2 0.005 3.1E-06 33.2 26.1 446 36-552 1-502 (502) 151 protein:vir:1023 Length: 392 # 94.2 0.0052 3.2E-06 33.1 28.0 382 1-551 1-392 (392) 152 protein:vir:3989 Length: 392 # 94.2 0.0052 3.2E-06 33.1 28.0 382 1-551 1-392 (392) 153 protein:vir:5737 Length: 419 # 94.1 0.0053 3.3E-06 33.1 24.3 403 22-560 1-418 (419) 154 protein:vir:81072 Length: 432 94.0 0.0057 3.5E-06 32.9 26.3 413 1-560 1-429 (432) 155 protein:vir:4598 Length: 416 # 94.0 0.0058 3.6E-06 32.8 23.2 400 9-559 1-416 (416) 156 protein:vir:81095 Length: 416 94.0 0.0058 3.6E-06 32.8 23.2 400 9-559 1-416 (416) 157 protein:vir:94805 Length: 492 93.6 0.007 4.4E-06 32.4 25.3 439 6-548 1-492 (492) 158 protein:vir:94498 Length: 474 93.4 0.0077 4.8E-06 32.2 23.8 425 11-548 1-474 (474) 159 protein:vir:97447 Length: 474 93.4 0.0077 4.8E-06 32.2 23.8 425 11-548 1-474 (474) 160 protein:vir:105064 Length: 421 93.2 0.0084 5.2E-06 32.0 24.6 404 9-560 1-417 (421) 161 protein:vir:9568 Length: 410 # 93.2 0.0085 5.2E-06 31.9 26.1 390 52-523 1-410 (410) 162 protein:vir:8100 Length: 466 # 92.9 0.0095 5.9E-06 31.7 21.2 436 15-559 1-466 (466) 163 protein:vir:733 Length: 453 # 92.8 0.0097 6E-06 31.6 26.5 416 15-535 1-453 (453) 164 protein:vir:9751 Length: 422 # 92.6 0.011 6.7E-06 31.4 23.8 408 27-522 1-422 (422) 165 protein:vir:102602 Length: 456 92.6 0.011 6.8E-06 31.3 23.8 427 27-543 1-456 (456) 166 protein:vir:105819 Length: 456 92.6 0.011 6.8E-06 31.3 23.8 427 27-543 1-456 (456) 167 protein:vir:103219 Length: 201 92.5 0.0024 1.5E-06 35.0 7.4 193 293-558 1-201 (201) 168 protein:vir:81218 Length: 423 91.6 0.015 9.5E-06 30.5 23.7 407 6-559 1-423 (423) 169 protein:vir:97060 Length: 432 91.5 0.016 9.8E-06 30.5 25.4 419 1-560 2-430 (432) 170 protein:vir:104259 Length: 403 90.7 0.019 1.2E-05 30.0 19.7 387 15-559 1-403 (403) 171 protein:vir:10362 Length: 432 90.7 0.019 1.2E-05 30.0 26.0 419 1-560 2-430 (432) 172 protein:vir:4854 Length: 386 # 89.6 0.026 1.6E-05 29.3 23.1 376 15-560 1-385 (386) 173 protein:vir:100328 Length: 346 89.0 0.029 1.8E-05 29.0 19.5 326 27-466 1-346 (346) 174 protein:vir:189 Length: 424 # 87.8 0.036 2.2E-05 28.5 25.1 402 1-556 10-424 (424) 175 protein:vir:79063 Length: 491 87.3 0.04 2.5E-05 28.3 29.9 413 1-560 1-420 (491) 176 protein:vir:99853 Length: 488 86.3 0.046 2.9E-05 27.9 27.8 414 27-560 1-427 (488) 177 protein:vir:94742 Length: 409 86.1 0.048 3E-05 27.8 25.4 395 27-507 1-409 (409) 178 protein:vir:3868 Length: 417 # 85.3 0.054 3.3E-05 27.5 21.3 409 15-559 1-417 (417) 179 protein:vir:94546 Length: 506 83.3 0.07 4.3E-05 26.9 27.5 439 10-560 1-505 (506) 180 protein:vir:94666 Length: 723 81.9 0.081 5.1E-05 26.6 26.6 415 39-560 1-440 (723) 181 protein:vir:1634 Length: 409 # 77.0 0.13 8.1E-05 25.4 21.5 395 27-507 1-409 (409) 182 protein:vir:99452 Length: 651 76.4 0.14 8.4E-05 25.3 25.2 483 15-560 1-566 (651) 183 protein:vir:1884 Length: 424 # 73.7 0.17 0.0001 24.8 23.9 405 1-556 8-424 (424) 184 protein:vir:98396 Length: 441 72.0 0.19 0.00012 24.6 21.6 415 1-559 1-441 (441) 185 protein:vir:108215 Length: 469 64.4 0.3 0.00019 23.4 29.6 440 39-559 1-469 (469) 186 protein:vir:4698 Length: 251 # 61.4 0.35 0.00022 23.1 13.7 246 15-341 1-251 (251) 187 protein:vir:4509 Length: 424 # 54.2 0.51 0.00032 22.2 28.7 408 1-558 8-424 (424) 188 protein:vir:95254 Length: 488 44.0 0.82 0.00051 21.1 27.3 455 29-560 1-482 (488) 189 protein:vir:1431 Length: 419 # 41.2 0.94 0.00058 20.7 25.1 399 22-560 1-417 (419) 190 protein:vir:107880 Length: 491 40.7 0.96 0.00059 20.7 31.1 414 1-560 1-420 (491) 191 protein:vir:78083 Length: 537 38.8 1.1 0.00065 20.5 27.3 431 42-560 1-525 (537) 192 protein:vir:8317 Length: 409 # 36.9 1.1 0.00071 20.3 20.7 391 6-540 1-409 (409) 193 protein:vir:1661 Length: 378 # 28.1 1.8 0.0011 19.2 25.9 369 9-559 1-378 (378) 194 protein:vir:98816 Length: 446 21.4 2.6 0.0016 18.3 26.1 416 29-531 1-446 (446) No 1 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=100.00 E-value=2.2e-274 Score=1520.83 Aligned_cols=509 Identities=31% Similarity=0.562 Sum_probs=485.3 Q ss_pred hhhhhhhhhhcccccchhhh-----ccccccccCCCCCCCCceeEecccccee--------ccccccchhhHHHHHHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDEL-----LLTKATVVALDDAQDGSIILQGGANTYN--------YVGVESELLSVKTVVEEYQ 81 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~-----~~~k~~s~~~~~~~dG~~~i~~~~~~~~--------y~~~e~~~~~~~~LI~~YR 81 (560) +.+|++-+..-++|.+++|+ .++++.|+++|+++||+++|+++.|+.+ |++++++++|++|||++|| T Consensus 1 ~~~~~~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR 80 (524) T protein:vir:10 1 MANFNTILSFLKPWANEDEKEYKQQINNNLESVTAPKLDDGAREIETQEQNIPYNALMQQMFGSNEPEVKNTRELIDTYR 80 (524) T ss_pred CCchhhHHHHhhhhhcchhhhhhhhhccCCCccccCCCCCCceeeccCcccccchhhhhhhhhcccchhhhHHHHHHHHH Confidence 55566666666777776665 5789999999999999999999877643 6678999999999999999 Q ss_pred HHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhh Q lcl|NC_019530. 82 SMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYH 161 (560) Q Consensus 82 ~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~h 161 (560) +||+|||||+||++|||||||||++++||+|+|++++ +|+++|++|+|||++||+||+|+++||++||+||||||+||| T Consensus 81 ~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~Ld~~~-~s~siK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fH 159 (524) T protein:vir:10 81 NLMNNYEVDNAVQEIVSDAIVYEDDKEVVALNLDGTD-FSQSIKDKILAEFSEVLNLLNFQRKGTDHFQRWYVDSRIFFH 159 (524) T ss_pred HHhhccchhhHHHHhhcceeEecCCCceEEEEecccC-cchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeceEEEE Confidence 9999999999999999999999999999999999998 899999999999999999999999999999999999999999 Q ss_pred heecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecch Q lcl|NC_019530. 162 VIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDD 241 (560) Q Consensus 162 kiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~ 241 (560) ||||++|||+||+|||+||||+|++||++.++. .++++++++++|||+|+|+..+++++|+ ++++++.++||. T Consensus 160 kiid~~~pk~GI~Elr~lDPr~i~~vr~i~~~~-~~~~~vi~~~~e~f~Y~~~~~~~~~~~~------~~~~~~~ikI~~ 232 (524) T protein:vir:10 160 KIINPKKMKDGVQELRRLDPRQVQYIREIVTRM-EDGVKIVDGYREFFVYDTGHESYCADGR------IYSAGTKVKIPR 232 (524) T ss_pred EEeeCCCccccceeeeeeCCccceeeeeecccC-cccchhhcchhhheeecCCCcccccCcc------eecCCcceecch Confidence 999999999999999999999999999999986 4668999999999999999888888886 567788999999 Q ss_pred hceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhhe Q lcl|NC_019530. 242 ESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRN 321 (560) Q Consensus 242 daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~ 321 (560) |||+||||||+|||+|+|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+ T Consensus 233 dAIvy~~SGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnlPk~KAeqYl~~im~k~kNKl 312 (524) T protein:vir:10 233 AAVVYAHSGLLDCCGKNIIGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRVFYIDTGNMPSRKAAAQMQHIMNTMKNRV 312 (524) T ss_pred hheeeeccCcccCCCCceeccchHhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCcee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCC--Ccccccc Q lcl|NC_019530. 322 AYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQEE--GSINIGG 399 (560) Q Consensus 322 vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~--~~~~l~~ 399 (560) |||++||||+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++ ++|++|| T Consensus 313 vYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~f~~gr 392 (524) T protein:vir:10 313 VYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTMPGATGMSDMDDVLYFRTALYRALRIPESRIPSESNSGVMFDA 392 (524) T ss_pred EEeccCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCchhccCCCCccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999766 4799998 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHH Q lcl|NC_019530. 400 SNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLAS 479 (560) Q Consensus 400 ~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~ 479 (560) + +||||||+||+|||+|||+|||.+|.++||+|||||||||++||++ |+++|+|+|++||||+|+||+|||++|+++ T Consensus 393 ~--~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~-i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~ 469 (524) T protein:vir:10 393 G--TAITRDELKFAKWIRQLQNKFEEIFLDPLKTNLILKKIITEDEWER-EINNIKVTFNRDSYFSEMKDAEIMERRINM 469 (524) T ss_pred c--chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHH-HhhcceEEeeecchHHHHHHHHHHHHHHHH Confidence 6 8999999999999999999999999999999999999999999987 999999999999999999999999999999 Q ss_pred HHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccc Q lcl|NC_019530. 480 LNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGN 534 (560) Q Consensus 480 l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~ 534 (560) |+++|||||||||++||||+||||||+||++++|||++|+++|+|++|+++++-= T Consensus 470 l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 470 LTMAEPFIGKYISHQTAMKDFLQMTDEEINQEAKQIEEESKEARFQNPDEEEEDF 524 (524) T ss_pred HHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCChhhhcC Confidence 9999999999999999999999999999999999999999999999998875422 No 2 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=100.00 E-value=5.7e-274 Score=1518.62 Aligned_cols=530 Identities=32% Similarity=0.566 Sum_probs=489.4 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHH Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEY 80 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~Y 80 (560) |+ ..=.+|-+++++..++++|+++|+++||++.+.+|+++++|+++++.++|++|||++| T Consensus 1 m~--------------------~lfgf~~~~~~~~~~~~~s~~~p~~ddg~~~~~~~g~~~~~~~~~~~~~~~~eLI~~Y 60 (558) T protein:vir:10 1 MA--------------------KLFGFSIEETQKKSTSIISPVPKNNEDGVDNFISSGFYGQYVDIEGAYRSEYDLIRRY 60 (558) T ss_pred Cc--------------------chhcchhhhhhhhccCCccccCCCccccccceeccceeeeeecccchhhhHHHHHHHH Confidence 21 1112445678899999999999999999999999999999999999999999999999 Q ss_pred HHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhh Q lcl|NC_019530. 81 QSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAY 160 (560) Q Consensus 81 R~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~ 160 (560) |+||+|||||+||++|||||||||++++||+|+|++++ +|+++|++|+|||++||+||+|+++||++||+||||||+|| T Consensus 61 R~ma~~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~-~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRiyf 139 (558) T protein:vir:10 61 REMALHPEADGAIEDVVNEAIVSDLYDSPVEVELSNLN-ASNTLKKKIREEFRYIKEMMDFDKKSHEIFRNWYVDGRVFY 139 (558) T ss_pred HHHhhccchhhHHHHhhcceeEecCCCceEEEEecccC-cchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEE Confidence 99999999999999999999999999999999999998 79999999999999999999999999999999999999999 Q ss_pred hheecCCCCccCeeEeEEeCccccchhhhhcccCCcc----------ccccccccchheeecCccccccccccccccccc Q lcl|NC_019530. 161 HVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREG----------GIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNF 230 (560) Q Consensus 161 hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g----------~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~ 230 (560) |||||++|||+||+|||+||||+|++||+++++..++ ++..+.++.+||+|+|+..+++..++ . T Consensus 140 HKiid~k~pk~GI~ELr~lDPr~i~~Vr~i~~~~~~~~~~~~~~~~~~~~~~~~~~eyy~Y~~~~~~~~~~~~------~ 213 (558) T protein:vir:10 140 LKVIDTKNPQEGIQDLRYIDPLKIKFIRQEKRKPGNQDPAIRVRSEQDVVPNPEFEEFYIYTPKVQHPTGMVG------Q 213 (558) T ss_pred EEEEeCCCccccceeeeeeCcccceeeeeeccccccccceeeeecccceeeccceeEeeeecCCcccccccce------e Confidence 9999999999999999999999999999998874322 34556788999999998665543322 3 Q ss_pred cccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHH Q lcl|NC_019530. 231 QPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYM 310 (560) Q Consensus 231 ~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl 310 (560) .+.++.++||.|||+||||||+|||+|+|+||||+|||||||||||||||||||+||||||||||||||||||.|||||| T Consensus 214 ~~~~~~vkI~~dAI~y~hSGL~d~~~~~i~syLhkAIKp~NQLkmlEDAlVIYRitRAPERRvFYIDVGnLPk~KAeqYl 293 (558) T protein:vir:10 214 MGGKNSIKIAKDSITMCTSGLVDRNKNRVLSYLHKAIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYL 293 (558) T ss_pred ecCCCceeechhheeeecccceecCCCeeeecchHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHH Confidence 45677899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccC Q lcl|NC_019530. 311 TMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQ 390 (560) Q Consensus 311 ~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~ 390 (560) ++||++||||+|||++||+|+|++||||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+ T Consensus 294 r~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnLgem~DV~YF~kKLy~aLnVP~SRl~ 373 (558) T protein:vir:10 294 KEVMSRYRNKLVYDANTGEVRDDRKFMSMMEDFWLPRREGGRGTEITTLPGGQNLGELSDVDYFQKKLYRALGVPESRIA 373 (558) T ss_pred HHHHHhccceEEEeccCceecccchhhhhHhhhcccccCCCCccceeeccccCCcchHHHHHHHHHHHHHHhCCCccccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHH Q lcl|NC_019530. 391 EEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQEN 470 (560) Q Consensus 391 ~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~ 470 (560) ++++|++||+ +||||||+||+|||.|||+|||.+|.++||+|||||||||++||++ |+++|+|+|++||||+|+||+ T Consensus 374 ~e~~f~~Gr~--~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~-i~~~I~~~f~~Dn~f~ElKe~ 450 (558) T protein:vir:10 374 AEGGFNLGRS--SEILRDELKFAKFVGRLRKRFAAMFNDMLKTQLVLKNIVTPEDWKT-MEDHIQYDFLYDNQFAELKES 450 (558) T ss_pred CCCccccccc--chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHH-HhhcceEEeeecchHHHHHHH Confidence 9999999997 8999999999999999999999999999999999999999999986 999999999999999999999 Q ss_pred HHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccc--c-------ccccccccC Q lcl|NC_019530. 471 AILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADE--T-------GNYSGSDVS 541 (560) Q Consensus 471 Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e--~-------g~~~~~~~~ 541 (560) |||++|+++|+++|||||||||++||||+||||||+||++++|||++|+++|+|++|++++ . |+.++++.+ T Consensus 451 Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeeI~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~ 530 (558) T protein:vir:10 451 ELMEGRLGMLATIEPYIGKYYSTEYVRKRVLRQTDMEIEEIDTQIEDEIQKGIIPDPSQIDPITGEPLPQEGDPAMEGMG 530 (558) T ss_pred HHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCccccChhhccccCccCCchhccCC Confidence 9999999999999999999999999999999999999999999999999999999999763 2 344456666 Q ss_pred CCCCCccccccc---CC-cccCC Q lcl|NC_019530. 542 PLKFKPETIPFS---GS-TDDSI 560 (560) Q Consensus 542 ~~~~~p~~~~~~---~~-~~~~~ 560 (560) ..|+.|++--.+ ++ +|..| T Consensus 531 ~~~~~~~~~~~~~~~~~~~~~~~ 553 (558) T protein:vir:10 531 EQPVDPDLEAQAQAVDAQYSKDT 553 (558) T ss_pred CCCcccccccchhhhhhhhhhhh Confidence 667777654222 11 23333 No 3 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=100.00 E-value=4.5e-273 Score=1513.68 Aligned_cols=507 Identities=33% Similarity=0.567 Sum_probs=482.0 Q ss_pred hhhhhhhhhhcccccchhh-----hccccccccCCCCCCCCceeEecccccee----------ccccccchhhHHHHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDE-----LLLTKATVVALDDAQDGSIILQGGANTYN----------YVGVESELLSVKTVVEE 79 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e-----~~~~k~~s~~~~~~~dG~~~i~~~~~~~~----------y~~~e~~~~~~~~LI~~ 79 (560) |.+ +.+..-++|.+++| +.++++.|+++|+++||+++++++++++. |++++++++|++|||++ T Consensus 1 m~f--~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~ 78 (523) T protein:vir:68 1 MKF--NILSLFAPWAKMDERDYKDQEKENLESITSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLKSTRELIDT 78 (523) T ss_pred CCC--chhhhhhhhhhhhhhhhhhhhhccCCCccccCCCCcceeeeccccccccccchhhhhhhhccccccchHHHHHHH Confidence 222 23444455666655 46788899999999999999998766553 88999999999999999 Q ss_pred HHHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhh Q lcl|NC_019530. 80 YQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQA 159 (560) Q Consensus 80 YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~ 159 (560) ||+||+|||||+||++|||||||||++++||+|+|++++ +|+++|++|+|||++||++|+|+++||+|||+||||||+| T Consensus 79 YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~-~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~ 157 (523) T protein:vir:68 79 YRNLMTNYEVDNAVSEIVSDAIVYEDDTEVVSINLDNTK-FSPNIKSMMLDEFNEVLNHLSFQRKGSDHFRRWYVDSRIF 157 (523) T ss_pred HHHHhhccchhhHHHHhhcceeeecCCCceEEEEecccc-cchHHHHHHHHHHHHHHHHhccchhhhHHHHhheeeeEEE Confidence 999999999999999999999999999999999999997 8999999999999999999999999999999999999999 Q ss_pred hhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceec Q lcl|NC_019530. 160 YHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVF 239 (560) Q Consensus 160 ~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i 239 (560) ||||||++|||+||+|||+||||+|++||++.+++. +|++++++++|||+|+|+...++++|+ ++++++.++| T Consensus 158 fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~-~g~~vi~~~~e~f~Y~~~~~~~~~~g~------~~~~~~~ikI 230 (523) T protein:vir:68 158 FHKIIDPKRPKEGIKELRRLDPRQVQYVREVITTTE-AGVKIVKGYKEYFIYDTSHESYACDGR------IYEAGTKIKI 230 (523) T ss_pred EEEEeeCCCccccceeeeeeCCcceeEEEeecCCCC-cchhhhhhhhhheeecccccccccccc------ccCCCcceec Confidence 999999999999999999999999999999999875 778999999999999999999999997 5667789999 Q ss_pred chhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhh Q lcl|NC_019530. 240 DDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKN 319 (560) Q Consensus 240 ~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~kn 319 (560) |.||||||||||+|||+|+|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||| T Consensus 231 ~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kN 310 (523) T protein:vir:68 231 PKAAIVYAHSGLVDCCGKNIIGYLHRAIKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPSRKAAEHMQHVMNTMKN 310 (523) T ss_pred chhheeeeeccceeCCCCceeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhhcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred heeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCC-ccccc Q lcl|NC_019530. 320 RNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQEEG-SINIG 398 (560) Q Consensus 320 k~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~-~~~l~ 398 (560) |+|||++||||+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++ +|++| T Consensus 311 KlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~G 390 (523) T protein:vir:68 311 RIAYDATTGKIKNQQHIMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDVRWFRNALYMALRIPITRIPSDQGGIQFD 390 (523) T ss_pred eeEEeccCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCcceeecCCCcceecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999998774 79999 Q ss_pred ccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHH Q lcl|NC_019530. 399 GSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLA 478 (560) Q Consensus 399 ~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~ 478 (560) |+ +||||||+||+|||+|||+||+.+|.++||+|||||||||++||++ |+++|+|+|++||||+|+||+|||++|++ T Consensus 391 r~--~EItRDEikF~KFI~rLR~rFs~lf~~~Lk~qLilKgiit~eew~~-i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~ 467 (523) T protein:vir:68 391 AG--TSITRDELSFGKFIRELQHKFEEIFLDPLKTNLILKGIITEDEWND-EINNIKIKFHRDSYFSELKDAEILERRIN 467 (523) T ss_pred cc--cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHH-HhhcceEeeeecchHHHHHHHHHHHHHHH Confidence 96 8999999999999999999999999999999999999999999987 89999999999999999999999999999 Q ss_pred HHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccc Q lcl|NC_019530. 479 SLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGN 534 (560) Q Consensus 479 ~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~ 534 (560) +|+++|||||||||++||||+||+|||+||++++|||++|+++|+|++|+++++-= T Consensus 468 ~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~e~~~f 523 (523) T protein:vir:68 468 MLQMAEPFIGKYISHRTAMKDILQMSDEEIEQEAKQIEEESKEARFQDPDQEQEDF 523 (523) T ss_pred HHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhhcC Confidence 99999999999999999999999999999999999999999999999999875432 No 4 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=100.00 E-value=1.5e-272 Score=1510.84 Aligned_cols=510 Identities=31% Similarity=0.525 Sum_probs=485.8 Q ss_pred hhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEec---------cccceeccccccchhhHHHHHHHHH Q lcl|NC_019530. 11 LFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQG---------GANTYNYVGVESELLSVKTVVEEYQ 81 (560) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~---------~~~~~~y~~~e~~~~~~~~LI~~YR 81 (560) .|-.+.++..|...+.- +.+++++++++|+++|+++||++++++ |+++++|++++++++|++|||++|| T Consensus 1 ~~~~l~~~~~~~~~~~~--~~~~~~~~~~~s~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~~~~~~~eLI~~YR 78 (521) T protein:vir:81 1 MFSRLKMLARWADFDND--KYEEQIKDKAESIAAPKNNDGATEVEINDNLPASAWNSLTQQFYSTDQKISTTKQLVNTYR 78 (521) T ss_pred CcchhhhhHhhcCchhh--hHHhhhccCccccccCCCCCCceEecccCCCcceeecceeeeecccccchhhHHHHHHHHH Confidence 67777777666555422 457788999999999999999998865 5678999999999999999999999 Q ss_pred HHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhh Q lcl|NC_019530. 82 SMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYH 161 (560) Q Consensus 82 ~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~h 161 (560) +||+|||||+||++|||||||||++++||+|+|++++ +|+++|++|+|||++||+||+|+++||++||+||||||+||| T Consensus 79 ~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~-~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fh 157 (521) T protein:vir:81 79 GLMNNHEVENAVQNIVNDAIVFEEGHEVVSLNLEATG-FSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFH 157 (521) T ss_pred HHhhccchhhHHHHhhcceeEecCCCceEEEEecccc-cchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEE Confidence 9999999999999999999999999999999999997 899999999999999999999999999999999999999999 Q ss_pred heecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecch Q lcl|NC_019530. 162 VIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDD 241 (560) Q Consensus 162 kiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~ 241 (560) |||| +|||+||+|||+||||+|++||++++.+..+ ++++++++|||+|+|+..+++++|+. .++++.++||. T Consensus 158 kiid-~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~-~~v~~~~~e~f~Y~~~~~~~~~~g~~------~~~~~~vkI~~ 229 (521) T protein:vir:81 158 KIIG-KNPKDGIVELRQLDPRNLEYVREIITEDTPE-GKIYKATKEYFIYTVGNSSYCAGGQV------FSPNSRVKIPR 229 (521) T ss_pred EEEc-CCccccceeeeeeCCcceeeeeeecccccCc-cceecceeeeeeeecCCcccccccee------ecCCcceeech Confidence 9999 8999999999999999999999999988755 79999999999999999999888873 34567899999 Q ss_pred hceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhhe Q lcl|NC_019530. 242 ESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRN 321 (560) Q Consensus 242 daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~ 321 (560) |||+||||||+|||+|+|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+ T Consensus 230 dAI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~im~k~kNkl 309 (521) T protein:vir:81 230 SAITYAHSGLMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRV 309 (521) T ss_pred hheeeeeccceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCcee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCC--CCcccccc Q lcl|NC_019530. 322 AYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQE--EGSINIGG 399 (560) Q Consensus 322 vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~--~~~~~l~~ 399 (560) |||++||||+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++ +++|++|| T Consensus 310 vYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr 389 (521) T protein:vir:81 310 VYDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGD 389 (521) T ss_pred EeecccccccccccccchhhhhcccccCCCcccceeecccCCCCChHHHHHHHHHHHHHHhCCccccccCCCCcceeccc Confidence 9999999999999999999999999999999999999999999999999999999999999999999954 45788888 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHH Q lcl|NC_019530. 400 SNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLAS 479 (560) Q Consensus 400 ~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~ 479 (560) + +||||||+||+|||+|||+|||.+|.++||+|||||||||+|||++ |+++|+|+|++||||+|+||+|||++|+++ T Consensus 390 ~--~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~-i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~ 466 (521) T protein:vir:81 390 G--SEITRDELEFSKFIRTRQSQFSEVLRDPLKYNLILKNVITEDDWDR-EINNIKVVFHRDSYYTEVKDAEILERRIGL 466 (521) T ss_pred c--chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHH-HhhcceEEEeecchHHHHHHHHHHHHHHHH Confidence 6 8999999999999999999999999999999999999999999987 999999999999999999999999999999 Q ss_pred HHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccc Q lcl|NC_019530. 480 LNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGN 534 (560) Q Consensus 480 l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~ 534 (560) |+++|||||||||++||||+||||||+||++++|||++|+++|+|++|+++++|= T Consensus 467 l~~~dpyvGky~s~dyi~k~ILr~tDeei~~~~k~I~~E~~~~~~~~p~~~~~~f 521 (521) T protein:vir:81 467 IERITPYIGKYFSNQTVMRDILKYTDDQMDTEKKQIEEEANDPRFKQTPDEIEDF 521 (521) T ss_pred HHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCCcccccCC Confidence 9999999999999999999999999999999999999999999999999887665 No 5 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=100.00 E-value=2.1e-272 Score=1509.96 Aligned_cols=515 Identities=33% Similarity=0.608 Sum_probs=486.1 Q ss_pred chhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEec--------cccceeccccccchhhHHHHH Q lcl|NC_019530. 6 RGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQG--------GANTYNYVGVESELLSVKTVV 77 (560) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~--------~~~~~~y~~~e~~~~~~~~LI 77 (560) -.|.|+|++++.++-|.+.++. ..+++++++++|+++|+++||++++++ |+++..|++.+++++|++||| T Consensus 1 ~~~~~~~~~l~~~~~~~~~d~~--~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~y~~~e~~~~~~~eLI 78 (524) T protein:vir:98 1 MNFLGFGNVLSFFKNFAREDEI--ELEQQLKNDTGSVAPPKNNDGAYEIETDLNNQKYAGVFQQFYSGQDPAIQNKEQLI 78 (524) T ss_pred CCCcchhhHHHHhhhhhhhhhh--hHhhhhcCCcccccCCCCCCCceeecCCCCcceecceeeeeccccccccchHHHHH Confidence 4455666777777766666655 556679999999999999999999974 566677889999999999999 Q ss_pred HHHHHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhh Q lcl|NC_019530. 78 EEYQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGR 157 (560) Q Consensus 78 ~~YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr 157 (560) ++||+||+|||||+||++||||||||+++++||+|+|++++ +|+++|++|+|||++||+||+|+++||++||+|||||| T Consensus 79 ~~YR~ma~~pEvd~Av~eIVneaIv~~~~~~pV~l~L~~~~-~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgR 157 (524) T protein:vir:98 79 NTYRGIMSYPEVENAVSEIIDDAIVNEQGKDIITMDLAKTN-FSKAIQDKIVEEFDNVLNIYDFDNMGARLFRDWYVDSR 157 (524) T ss_pred HHHHHHhhccchhhHHHhhhcceeEecCCCceEEEEecccc-cchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcce Confidence 99999999999999999999999999999999999999997 89999999999999999999999999999999999999 Q ss_pred hhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccce Q lcl|NC_019530. 158 QAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQEL 237 (560) Q Consensus 158 ~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~ 237 (560) +||||||| ++|++||+|||+||||+|++||++.++..++++.++++++|||+|+|.+.+++++|+ +.++++.+ T Consensus 158 i~fhkiid-~~~~kGI~ELr~lDPr~i~~vr~~~~~~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~------~~~~~~~i 230 (524) T protein:vir:98 158 IYFHKIMH-KDESKGIRELRQLDPRCMELIRESITETLDGGVKVFRGYREFFVYSAPKAGYTYNGQ------IYQANQKI 230 (524) T ss_pred eEEEEEEc-CCCCcceeeeeeeCCccceeeeeccccccccchhhccceeeeeeeccCCCccccccc------eecCCCce Confidence 99999999 666789999999999999999999999888889999999999999999988888776 44556789 Q ss_pred ecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhh Q lcl|NC_019530. 238 VFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKF 317 (560) Q Consensus 238 ~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~ 317 (560) +||.|||+||||||+||+++ |+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++| T Consensus 231 kI~~dAIvy~hSGL~d~~~~-iisyLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~ 309 (524) T protein:vir:98 231 KIPRSAIVYAHSGLEDCSNN-IIGYLHRAVKPANQLRLLEDAMVIYRITRAPERRVFYIDVGQMGGNKATQYVNNIAQGL 309 (524) T ss_pred eechhheeeeccCcccCCCC-eeeehhHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhc Confidence 99999999999999998876 67999999999999999999999999999999999999999999999999999999999 Q ss_pred hhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccC-CCCccc Q lcl|NC_019530. 318 KNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQ-EEGSIN 396 (560) Q Consensus 318 knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~-~~~~~~ 396 (560) |||+||||+||+|+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+ ++++|+ T Consensus 310 kNklvYDa~TGevrddrk~msMlEDyWLpRReGgrgTEItTLpggqnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~ 389 (524) T protein:vir:98 310 KNRVVYDARTGTVKNQQNNLSMTEDYWLMRRDGKAITEVSTLPGGQNFSDMDDIKWFNRKLYEALRVPLSRMPRDDGGMQ 389 (524) T ss_pred CceeEeeccCceeeccccccchhhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCceeccCCCCccc Confidence 9999999999999999999999999999999999999999999999999999999999999999999999997 467999 Q ss_pred ccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHH Q lcl|NC_019530. 397 IGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDR 476 (560) Q Consensus 397 l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~R 476 (560) +||+ +||||||+||+|||.|||+|||.+|.++||+|||||||||+|||++ |+++|+|+|++||||+|+||+|||++| T Consensus 390 ~Gr~--~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~-i~~~I~~~f~~Dn~f~ElKe~Eil~~R 466 (524) T protein:vir:98 390 IGGG--GEITRDELKFSKFIRTLQIQFSPVLSDPLKTNLIAKKIITEDEWEE-NVSKISFVFQQDSYYAEVKDIEILERR 466 (524) T ss_pred cccc--cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHH-HhhcceEEEeecchHHHHHHHHHHHHH Confidence 9996 8999999999999999999999999999999999999999999987 899999999999999999999999999 Q ss_pred HHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccc Q lcl|NC_019530. 477 LASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNY 535 (560) Q Consensus 477 l~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~ 535 (560) +++|+++|||||||||++||||+||||||+||++++|||++|+++|+|++|+++++- + T Consensus 467 ~~~l~~~dpyvGky~s~dyi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~p~~e~~~-f 524 (524) T protein:vir:98 467 LNLMSQVEGVVGKYVSHKYIMKEILRMSDEDIDEQAKLIEEESKEERFKNPEAEEEN-F 524 (524) T ss_pred HHHHHHhccccccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCcCCcccccc-C Confidence 999999999999999999999999999999999999999999999999999987532 2 No 6 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=100.00 E-value=6.6e-272 Score=1507.30 Aligned_cols=510 Identities=31% Similarity=0.529 Sum_probs=483.6 Q ss_pred hhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEec---------cccceeccccccchhhHHHHHHHHH Q lcl|NC_019530. 11 LFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQG---------GANTYNYVGVESELLSVKTVVEEYQ 81 (560) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~---------~~~~~~y~~~e~~~~~~~~LI~~YR 81 (560) .|-.+.++..|..-+.- +.+|+++++++|+++|+++||+++++. |+++++|++.+++++|++|||++|| T Consensus 1 ~~~~l~~~~~~~~~d~~--~~~e~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~~~~~~~eLI~~YR 78 (521) T protein:vir:65 1 MFSRLKMLARWADFDND--KYEEQIKDKAESIAAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQKISTTKQLVNTYR 78 (521) T ss_pred CccchhhhhhccCchhh--HHHhhhccCCCcccCCCCCCCceeecccCCccccccccceeeeccccchhhhHHHHHHHHH Confidence 55556555555444321 457889999999999999999999963 6788889999999999999999999 Q ss_pred HHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhh Q lcl|NC_019530. 82 SMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYH 161 (560) Q Consensus 82 ~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~h 161 (560) +||+|||||+||++|||||||||++++||+|+|++++ +|+++|++|+|||++||+||+|+++||++||+||||||+||| T Consensus 79 ~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~-~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fh 157 (521) T protein:vir:65 79 GLMNNHEVENAVQNIVNDAIVFEEGHEVVSLNLEATG-FSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFH 157 (521) T ss_pred HHhhccchhhHHHHhhcceeEecCCCceEEEEecccc-cchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceeEEE Confidence 9999999999999999999999999999999999997 899999999999999999999999999999999999999999 Q ss_pred heecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecch Q lcl|NC_019530. 162 VIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDD 241 (560) Q Consensus 162 kiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~ 241 (560) |||| +|||+||+|||+||||+|++||++++.+..+ ++++++++|||+|+|+..+++++|+. .++++.++||. T Consensus 158 kiid-~~pk~GI~ELr~lDPr~i~~vr~i~k~~~~~-~~v~~~~~e~f~Y~~~~~~~~~~g~~------~~~~~~vkI~~ 229 (521) T protein:vir:65 158 KIIG-KNPKDGIVELRQLDPRNLEYVREIITEDTPE-GKIYKATKEYFIYTVGNSSYCAGGQV------FSPNSRVKIPR 229 (521) T ss_pred EEEc-CCccccceeeeeeCCcceeeeeeecccccCC-cceecceeeeeeeecCCcceecccee------ecCCcceeech Confidence 9999 8999999999999999999999999988755 69999999999999999999988873 33467899999 Q ss_pred hceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhhe Q lcl|NC_019530. 242 ESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRN 321 (560) Q Consensus 242 daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~ 321 (560) |||+||||||+|||+|+|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+ T Consensus 230 dAI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNkl 309 (521) T protein:vir:65 230 SAITYAHSGLMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRV 309 (521) T ss_pred hheeeeeccceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCcee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCC--CCcccccc Q lcl|NC_019530. 322 AYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQE--EGSINIGG 399 (560) Q Consensus 322 vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~--~~~~~l~~ 399 (560) |||++||||+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+. +++|++|| T Consensus 310 vYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr 389 (521) T protein:vir:65 310 VYDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGD 389 (521) T ss_pred EeecccccccccccccchhhhhcccccCCCCccceeecccCCCcChHHHHHHHHHHHHHHhCCCceeccCCCCcceeccc Confidence 9999999999999999999999999999999999999999999999999999999999999999999854 45788888 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHH Q lcl|NC_019530. 400 SNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLAS 479 (560) Q Consensus 400 ~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~ 479 (560) + +||||||+||+|||+|||+|||.+|.++||+|||||||||+|||++ |+++|+|+|++||||+|+||+|||++|+++ T Consensus 390 ~--~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~-i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~ 466 (521) T protein:vir:65 390 G--SEITRDELEFSKFIRTLQSQFSEVLRDPLKYNLILKNVITEDDWDR-EINNIKVVFHRDSYYTEVKDAEILERRIGL 466 (521) T ss_pred c--chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHH-HhhcceEEeeecchHHHHHHHHHHHHHHHH Confidence 6 8999999999999999999999999999999999999999999987 899999999999999999999999999999 Q ss_pred HHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccc Q lcl|NC_019530. 480 LNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGN 534 (560) Q Consensus 480 l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~ 534 (560) |+++|||||||||++||||+||||||+||++++|||++|+++|+|++|+++++|= T Consensus 467 l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~k~I~~E~~~~~~~~p~~~~~~f 521 (521) T protein:vir:65 467 IERITPYIGKYFSNQTVMRDILKYTDDQMDTEKKQIEEEANDPRFKQTPDEIEDF 521 (521) T ss_pred HHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHhhhCCCCCCCcccccCC Confidence 9999999999999999999999999999999999999999999999999887665 No 7 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=100.00 E-value=6.2e-272 Score=1507.43 Aligned_cols=512 Identities=31% Similarity=0.565 Sum_probs=477.3 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEec------cccceecccc----ccch Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQG------GANTYNYVGV----ESEL 70 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~------~~~~~~y~~~----e~~~ 70 (560) |+ |..|++++.|.+-|.. ...++.+++..|+++|+++||+++++. +++++.|+++ ++++ T Consensus 1 m~---------~~~L~~~~~w~~~de~--~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~ 69 (524) T protein:vir:10 1 MK---------FNVLSLFAPWAKMDER--NFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGM 69 (524) T ss_pred CC---------CchhhHhhccccCcch--hhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhccccccc Confidence 22 2334444333333322 223446789999999999999999965 4566666665 5568 Q ss_pred hhHHHHHHHHHHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHH Q lcl|NC_019530. 71 LSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIR 150 (560) Q Consensus 71 ~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R 150 (560) +|++|||++||+||+|||||+||++|||||||||++++||+|+|++++ +|+++|++|+|||++||++|+|+++||+||| T Consensus 70 ~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~-~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR 148 (524) T protein:vir:10 70 KTTRELIDTYRNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSK-FSPKIKNMMLDEFNDVLNHLSFQRKGSDHFR 148 (524) T ss_pred chHHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEecCcC-cchHHHHHHHHHHHHHHHHhccchhhhHHHh Confidence 999999999999999999999999999999999999999999999997 8999999999999999999999999999999 Q ss_pred HhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccc Q lcl|NC_019530. 151 KWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNF 230 (560) Q Consensus 151 ~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~ 230 (560) +||||||+|||||||++|||+||+|||+||||+|++||++.++.. ++.+++++++|||+|+|+..+++++|+ . T Consensus 149 ~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~-~~~~vi~~~~e~f~Y~~~~~~y~~~g~------~ 221 (524) T protein:vir:10 149 RWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREIITETE-AGTKIVKGYKEYFIYDTAHESYACDGR------M 221 (524) T ss_pred hheeeeEEEEEEEeeCCCccccceeeeeeCCccceeeeeeccCCC-ccchhhcchhhheeeccCccccccCcc------c Confidence 999999999999999999999999999999999999999999865 668999999999999999999999987 4 Q ss_pred cccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHH Q lcl|NC_019530. 231 QPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYM 310 (560) Q Consensus 231 ~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl 310 (560) +++++.++||.||||||||||+|||+|+|+||||+|||||||||||||||||||+||||||||||||||||||.|||||| T Consensus 222 ~~~~~~ikI~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl 301 (524) T protein:vir:10 222 YEAGTKIKIPKAAIVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHM 301 (524) T ss_pred cCCCcceecchhheeeeeccceeCCCCceeccchhhhHHHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHH Confidence 56678999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccC Q lcl|NC_019530. 311 TMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQ 390 (560) Q Consensus 311 ~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~ 390 (560) ++||++||||+|||++||+|+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+ T Consensus 302 ~~im~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~ 381 (524) T protein:vir:10 302 QHVMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDVRWFRQALYMALRVPLSRIP 381 (524) T ss_pred HHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCchhhcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999997 Q ss_pred CC--CcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHH Q lcl|NC_019530. 391 EE--GSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQ 468 (560) Q Consensus 391 ~~--~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elk 468 (560) ++ ++|++||+ +||||||+||+|||.|||+|||.+|.++||+|||||||||++||++ |+++|+|+|++||||+|+| T Consensus 382 ~d~~~~f~~gr~--~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~-i~~~I~~~f~~Dn~f~ElK 458 (524) T protein:vir:10 382 QDQQGGVMFDSG--TSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWND-EINNIKIEFHRDSYFTELK 458 (524) T ss_pred CCCCcccccccc--chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHH-HhhcceEEeeecchHHHHH Confidence 76 56999986 8999999999999999999999999999999999999999999987 9999999999999999999 Q ss_pred HHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccc Q lcl|NC_019530. 469 ENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGN 534 (560) Q Consensus 469 e~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~ 534 (560) |+|||++|+++|+++|||||||||++||||+||+|||+||++++|||++|+++|+|++|+++++-= T Consensus 459 e~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 459 EAEILERRINMLTMAEPFIGKYISHRTAMKDILQMTDEEIEQEAKQIEEESKEARFQDPDQEQEDF 524 (524) T ss_pred HHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhhcC Confidence 999999999999999999999999999999999999999999999999999999999999875432 No 8 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=100.00 E-value=7.1e-272 Score=1507.11 Aligned_cols=512 Identities=31% Similarity=0.564 Sum_probs=477.2 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEec------cccceecccc----ccch Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQG------GANTYNYVGV----ESEL 70 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~------~~~~~~y~~~----e~~~ 70 (560) |+ |..|++++.|.+-|.. ...++.+++..|+++|+++||+++++. +++++.|+++ ++++ T Consensus 1 m~---------~~~L~~~~~w~~~de~--~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~ 69 (524) T protein:vir:72 1 MK---------FNVLSLFAPWAKMDER--NFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGM 69 (524) T ss_pred CC---------CchhhHhhccccCcch--hhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhccccccc Confidence 22 2334444333333322 223446789999999999999999965 4566666665 5568 Q ss_pred hhHHHHHHHHHHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHH Q lcl|NC_019530. 71 LSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIR 150 (560) Q Consensus 71 ~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R 150 (560) +|++|||++||+||+|||||+||++|||||||||++++||+|+|++++ +|+++|++|+|||++||++|+|+++||+||| T Consensus 70 ~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~-~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR 148 (524) T protein:vir:72 70 KTTRELIDTYRNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSK-FSPKIKNMMLDEFSDVLNHLSFQRKGSDHFR 148 (524) T ss_pred chHHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEecCcC-cchHHHHHHHHHHHHHHHHhccchhhhHHHh Confidence 999999999999999999999999999999999999999999999997 8999999999999999999999999999999 Q ss_pred HhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccc Q lcl|NC_019530. 151 KWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNF 230 (560) Q Consensus 151 ~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~ 230 (560) +||||||+|||||||++|||+||+|||+||||+|++||++.++.. ++.+++++++|||+|+|+..+++++|+ . T Consensus 149 ~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~-~~~~vi~~~~e~f~Y~~~~~~y~~~g~------~ 221 (524) T protein:vir:72 149 RWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREIITETE-AGTKIVKGYKEYFIYDTAHESYACDGR------M 221 (524) T ss_pred hheeeeEEEEEEEEeCCCccccceeeeeeCCccceeeeeeccCCC-ccchhhcchhhheeeccCccccccCcc------c Confidence 999999999999999999999999999999999999999999865 668999999999999999999999987 4 Q ss_pred cccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHH Q lcl|NC_019530. 231 QPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYM 310 (560) Q Consensus 231 ~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl 310 (560) +++++.++||.||||||||||+|||+|+|+||||+|||||||||||||||||||+||||||||||||||||||.|||||| T Consensus 222 ~~~~~~ikI~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl 301 (524) T protein:vir:72 222 YEAGTKIKIPKAAVVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHM 301 (524) T ss_pred cCCCcceecchhheeeeeccceeCCCCceeccchhhhHhHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHH Confidence 56678999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccC Q lcl|NC_019530. 311 TMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQ 390 (560) Q Consensus 311 ~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~ 390 (560) ++||++||||+|||++||+|+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+ T Consensus 302 ~~im~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~ 381 (524) T protein:vir:72 302 QHVMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDIRWFRQALYMALRVPLSRIP 381 (524) T ss_pred HHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCchhhcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999997 Q ss_pred CC--CcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHH Q lcl|NC_019530. 391 EE--GSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQ 468 (560) Q Consensus 391 ~~--~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elk 468 (560) ++ ++|++||+ +||||||+||+|||.|||+|||.+|.++||+|||||||||++||++ |+++|+|+|++||||+|+| T Consensus 382 ~d~~~~f~~gr~--~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~-i~~~I~~~f~~Dn~f~ElK 458 (524) T protein:vir:72 382 QDQQGGVMFDSG--TSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWND-EINNIKIEFHRDSYFAELK 458 (524) T ss_pred CCCCcccccccc--chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHH-HhhcceEEeeecchHHHHH Confidence 76 56999986 8999999999999999999999999999999999999999999987 9999999999999999999 Q ss_pred HHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccc Q lcl|NC_019530. 469 ENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGN 534 (560) Q Consensus 469 e~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~ 534 (560) |+|||++|+++|+++|||||||||++||||+||+|||+||++++|||++|+++|+|++|+++++-= T Consensus 459 e~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:72 459 EAEILERRINMLTMAEPFIGKYISHRTAMKDILQMTDEEIEQEAKQIEEESKEARFQDPDQEQEDF 524 (524) T ss_pred HHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhhcC Confidence 999999999999999999999999999999999999999999999999999999999999875422 No 9 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=100.00 E-value=3e-271 Score=1503.68 Aligned_cols=523 Identities=36% Similarity=0.651 Sum_probs=486.8 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHH Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEY 80 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~Y 80 (560) ||- .+.+-.+ ++++.+++..|+++|+++||++.+.+++++++|++++++++|++|||++| T Consensus 1 ~~~-----------------~lfg~~i---~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~g~~~~~e~~~~~~~eLI~~Y 60 (537) T protein:vir:10 1 MAQ-----------------QLFGFSL---QRAKKVPKGPSFVQKDSLDGSQPIVGGGYFGYSVDFDGTIRNDHELITRY 60 (537) T ss_pred Ccc-----------------cccccee---ecccccccCCcccCCCcccccceeecccccccccccccccchHHHHHHHH Confidence 321 1222222 23356789999999999999999999999999999999999999999999 Q ss_pred HHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhh Q lcl|NC_019530. 81 QSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAY 160 (560) Q Consensus 81 R~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~ 160 (560) |+||+|||||+||++|||||||||++++||+|+|++++ +|+++|++|+|||++||+||+|+++||++||+||||||+|| T Consensus 61 R~ma~~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~-~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~f 139 (537) T protein:vir:10 61 REMVLNPECDSAVDDVVNETICGNFDDVPISIDLHNLK-QSEKIKKLIRSEFDEILRLLDFDNRAYEIFRRWYVDGRLFF 139 (537) T ss_pred HHHhhccchhhHHHHhhcceeEecCCCceEEEEecccc-cchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEE Confidence 99999999999999999999999999999999999997 79999999999999999999999999999999999999999 Q ss_pred hheecCCCCccCeeEeEEeCccccchhhhhcccCCcc------ccccccccchheeecCccccccccccccccccccccc Q lcl|NC_019530. 161 HVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREG------GIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQ 234 (560) Q Consensus 161 hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g------~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~ 234 (560) |||||++|||+||+|||+||||+|++||++++++..+ +.+++++..+||+|||+... .+++ T Consensus 140 hKiid~k~pk~GI~ELr~lDPr~i~~vR~i~~~~~~~~~~~~~~~~v~~~~~eyf~ynp~g~~-------------~~~~ 206 (537) T protein:vir:10 140 HKVIDPKKPRQGLVELRYVDPRKIRKVTEYEAKRPEALRTQDLNQQLTQQSASYFLYNPKGLK-------------NSTN 206 (537) T ss_pred EEEEeCCCccccceeeeeeCCccceeeEeecccCCccceEEecceeeeecccceeeecccccc-------------ccCC Confidence 9999999999999999999999999999998876533 34577788999999996432 2346 Q ss_pred cceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHH Q lcl|NC_019530. 235 QELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMM 314 (560) Q Consensus 235 ~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im 314 (560) ++++||.|||+||||||+|||+|+|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++|| T Consensus 207 ~~vkI~~dAI~y~hSGl~d~n~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM 286 (537) T protein:vir:10 207 QGMKIAPDSIAYCHSGIQDLNKNMVLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVM 286 (537) T ss_pred CceeccHhheeeecccceeCCCCeeeeeehhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHH Confidence 78999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCc Q lcl|NC_019530. 315 GKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQEEGS 394 (560) Q Consensus 315 ~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~ 394 (560) ++||||+|||++||+|+|++||||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++++ T Consensus 287 ~k~KNklVYDa~TGev~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnlgem~DV~YF~kKLy~aLnVP~SRl~~e~~ 366 (537) T protein:vir:10 287 GRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETT 366 (537) T ss_pred HhccceEEEeccCceecccchhhhhhhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHH Q lcl|NC_019530. 395 INIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILN 474 (560) Q Consensus 395 ~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~ 474 (560) |++||+ +||||||+||+|||.|||+|||.+|.++||+|||||||||++||++ |+++|+|+|++||||+|+||+|||+ T Consensus 367 f~~Gr~--~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~-i~~~I~~~f~~Dn~f~ElKe~Eil~ 443 (537) T protein:vir:10 367 FNIGRA--AEITRDEVKFQKFIARLRKRFSELFVDLLKTQLILKGICSIEEWEE-MKEHIQFDFIADNYFTELKEIEIRN 443 (537) T ss_pred cccccc--chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHH-HhhcceEEeeecchHHHHHHHHHHH Confidence 999997 8999999999999999999999999999999999999999999986 9999999999999999999999999 Q ss_pred HHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCC---CCCccccc Q lcl|NC_019530. 475 DRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPL---KFKPETIP 551 (560) Q Consensus 475 ~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~---~~~p~~~~ 551 (560) +|+++|+++|||||||||++||||+||||||+||++++|||++|+++|+|++|+++++|+.++++++++ .+.|+.-| T Consensus 444 ~Rl~~l~~~dpyvGky~s~dyi~k~ILr~tDeeI~~~~k~I~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~ 523 (537) T protein:vir:10 444 ERMNEVAQMDPYVGKYFSANYIRTKVLKQTESEIKEIDKEIKQEIADGVIMDPQAMQAMEMGIGDEEPVPEGGEEPQTDP 523 (537) T ss_pred HHHHHHHHhhhhhhcccchHHHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCcccccccccCCCCcccCCCCCCCcccCC Confidence 999999999999999999999999999999999999999999999999999999999999998777644 35566556 Q ss_pred ccCCcc-----cCC Q lcl|NC_019530. 552 FSGSTD-----DSI 560 (560) Q Consensus 552 ~~~~~~-----~~~ 560 (560) .+|... +.| T Consensus 524 ~~~~~~~~~~~~~~ 537 (537) T protein:vir:10 524 NSAVSPADQKRGEL 537 (537) T ss_pred ccCCCCCCccCCCC Confidence 444321 122 No 10 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=100.00 E-value=3.7e-271 Score=1503.16 Aligned_cols=517 Identities=37% Similarity=0.675 Sum_probs=478.0 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHH Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEY 80 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~Y 80 (560) |+ -||| -.+.++ +..++..|+++|+++||++++++++++++|++++++++|++|||++| T Consensus 1 m~-------~lfg-----------~~i~~~---~~~~~~~s~~~~~~~dg~~~i~~~~~~~~~~~~e~~~~~~~eLI~~Y 59 (533) T protein:vir:10 1 MS-------QLFG-----------FSLERA---KKAPKGPSFVQKDNLDGSQPVSGGGYYGYTVDFDGQVRNEYQLISRY 59 (533) T ss_pred Cc-------cccc-----------cccccc---cccccCCCCCCCCcccccceeecccccceeeecccccchHHHHHHHH Confidence 22 2222 233222 22578889999999999999999999999999999999999999999 Q ss_pred HHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhh Q lcl|NC_019530. 81 QSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAY 160 (560) Q Consensus 81 R~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~ 160 (560) |+||+|||||+||++|||||||||++++||+|+|++++ +|+++|++|+|||++||+||+|+++||++||+||||||+|| T Consensus 60 R~ma~~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~-~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~f 138 (533) T protein:vir:10 60 REMVLQPECDSAVDDIVNETICGNFDDVPVSVELSNLK-VSDKIKKLIREEFGEILRLLDFENRSYEIFRRWYVDGRLFY 138 (533) T ss_pred HHHhhccchhhHHHHhhcceeeecCCCceEEEEecccc-cchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEE Confidence 99999999999999999999999999999999999997 89999999999999999999999999999999999999999 Q ss_pred hheecCCCCccCeeEeEEeCccccchhhhhcccCCcc------ccccccccchheeecCccccccccccccccccccccc Q lcl|NC_019530. 161 HVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREG------GIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQ 234 (560) Q Consensus 161 hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g------~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~ 234 (560) |||||++|||+||+|||+||||+|++||++.++..++ +.+++.++.|||+|||+...+ +++ T Consensus 139 Hkiid~~~pk~GI~ELr~lDPr~i~~vr~i~~~~~~~~~~~~~~~~v~~~~~eyf~Ynp~g~~~-------------~~~ 205 (533) T protein:vir:10 139 HKVIDPDNPQGGLIELRYIDPRKIRKINETEQKRPEQLRGLPLNQQLSPKSAEYFLYDPKGLKN-------------STT 205 (533) T ss_pred EEEecCCCccccceeeeeccccceeeeeeeeccCCCccceeecchhhhccceeeeeeccccccc-------------cCC Confidence 9999999999999999999999999999998886655 356899999999999974432 235 Q ss_pred cceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHH Q lcl|NC_019530. 235 QELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMM 314 (560) Q Consensus 235 ~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im 314 (560) ++++||.|||+||||||+|||+|+|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++|| T Consensus 206 ~~vkI~~dAI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM 285 (533) T protein:vir:10 206 QGLKIAPDSICYVHSGIMDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVM 285 (533) T ss_pred CceecchhheeeeeccceeCCCCceeccchHhHHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHH Confidence 68999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCc Q lcl|NC_019530. 315 GKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQEEGS 394 (560) Q Consensus 315 ~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~ 394 (560) ++||||+|||++||+|+|++||||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++++ T Consensus 286 ~k~KNklVYDa~TGev~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~DV~YF~kKLY~aLnVP~SRl~~e~~ 365 (533) T protein:vir:10 286 GRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEITTLPGGQNLGELEDVKYFQKKLYKSLNVPGSRLETETT 365 (533) T ss_pred HhccceEEEeccCceecccchhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHH Q lcl|NC_019530. 395 INIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILN 474 (560) Q Consensus 395 ~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~ 474 (560) |++||+ +||||||+||+|||.|||+|||.+|.++||+|||||||||++||++ |+++|+|+|++||||+|+||+|||+ T Consensus 366 f~~Gr~--~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~-i~~~I~~~f~~Dn~f~ElKe~Eil~ 442 (533) T protein:vir:10 366 FNVGRA--AEITRDEVKFQKFVARLRKRFSELFTDLLKTQLVLKGVISIEEWDQ-MKEHIQYDYIADNYFAELKEIEIRN 442 (533) T ss_pred cccccc--chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHH-HhhcceEeeeecchHHHHHHHHHHH Confidence 999997 8999999999999999999999999999999999999999999987 9999999999999999999999999 Q ss_pred HHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccc----------ccccccccCCCC Q lcl|NC_019530. 475 DRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADET----------GNYSGSDVSPLK 544 (560) Q Consensus 475 ~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~----------g~~~~~~~~~~~ 544 (560) +|+++|+++|||||||||++||||+||||||+||++++|||++|+++|+|++|+++++ |+.+.++++|.. T Consensus 443 ~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (533) T protein:vir:10 443 ERMNQVATMDPFVGKYFSVEYMRRQVLKQTDVEMKEIDKQIESEMESGIIADPAAEMDPAMAAGDPDAGGAPAEEVAPEG 522 (533) T ss_pred HHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCCcchhhHHhcCCCCCcCCcccccCCCCC Confidence 9999999999999999999999999999999999999999999999999999998843 444444444444 Q ss_pred CCcccccccCC Q lcl|NC_019530. 545 FKPETIPFSGS 555 (560) Q Consensus 545 ~~p~~~~~~~~ 555 (560) -.|+.-.-|-- T Consensus 523 ~~~~~~~~~~~ 533 (533) T protein:vir:10 523 PDPSDERKAEF 533 (533) T ss_pred CCcchhhccCC Confidence 44433211111 No 11 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=100.00 E-value=4.7e-271 Score=1502.59 Aligned_cols=509 Identities=31% Similarity=0.560 Sum_probs=478.7 Q ss_pred hhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEec-------cccceeccccccchhhHHHHHHHHHHHh Q lcl|NC_019530. 12 FGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQG-------GANTYNYVGVESELLSVKTVVEEYQSMA 84 (560) Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~-------~~~~~~y~~~e~~~~~~~~LI~~YR~ma 84 (560) .+.++.|.-|.+-++. +.+|+++++++|+++|+++||++++++ ++++++|.+.+++++|+++||++||+|| T Consensus 1 ~~~~~lf~f~~~~d~~--~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma 78 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQN--EYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLI 78 (516) T ss_pred CCchHhcccccchhhh--HHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHh Confidence 2223332222222211 236668999999999999999999986 7778888899999999999999999999 Q ss_pred hccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhhee Q lcl|NC_019530. 85 QQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIV 164 (560) Q Consensus 85 ~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkii 164 (560) +|||||+||++|||||||||++++||+|+|++++ +|+++|++|+|||++||+||+|+++||++||+||||||+|||||| T Consensus 79 ~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~-~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKii 157 (516) T protein:vir:10 79 NNPEVERAVANIVNEAIVYERGHKVVSLDLDDTD-FGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIM 157 (516) T ss_pred hccchhhHHHHhhcceeEecCCCceEEEEecccC-cchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEe Confidence 9999999999999999999999999999999997 899999999999999999999999999999999999999999999 Q ss_pred cCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhce Q lcl|NC_019530. 165 DPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESI 244 (560) Q Consensus 165 D~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI 244 (560) | |||+||+|||+||||+|++||+|++++. +++++++++++||+|+|+..+++++|+ ++++++.++||.||| T Consensus 158 d--~~k~GI~Elr~lDPr~i~~vR~i~~~~~-~~~~v~~~~~e~~~Y~~~~~~~~~~g~------~~~~~~~ikI~~dAI 228 (516) T protein:vir:10 158 P--NPKKGIAELRRLDPRFMEYYREIVTSDI-GGTTIVKGYREFFIYTTGNEGYSYNGR------IFEPNTRIKIPRSAV 228 (516) T ss_pred c--CccccceeeeeeCCcceeeEeeeccccc-ccchhhhhhhheeeeccCccccccccc------eeCCCcceeechhhe Confidence 7 9999999999999999999999999885 558999999999999999999999887 355667899999999 Q ss_pred eeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeec Q lcl|NC_019530. 245 VYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYD 324 (560) Q Consensus 245 ~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd 324 (560) |||||||+|||+|.|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+||| T Consensus 229 ~y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYD 308 (516) T protein:vir:10 229 VYASSGLMDCSDRGIIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYD 308 (516) T ss_pred eeecccceeCCCCceeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhh Q lcl|NC_019530. 325 RTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAE 404 (560) Q Consensus 325 ~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~e 404 (560) ++||||+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++++|++..+.++| T Consensus 309 a~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~E 388 (516) T protein:vir:10 309 SNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTA 388 (516) T ss_pred CCCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccch Confidence 99999999999999999999999999999999999999999999999999999999999999999999999884444599 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019530. 405 ITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVE 484 (560) Q Consensus 405 ItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~ 484 (560) |||||+||+|||+|||+|||.+|.++||+|||||||||++||++ |+++|+|+|++||||+|+||+|||++|+++|+++| T Consensus 389 ItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~-i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~d 467 (516) T protein:vir:10 389 ITRDELDFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDE-QINNIKVNFHQDSYYTELKDIETLRLRVDALSQIE 467 (516) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHH-HhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhh Confidence 99999999999999999999999999999999999999999987 99999999999999999999999999999999999 Q ss_pred hhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccc Q lcl|NC_019530. 485 PFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNY 535 (560) Q Consensus 485 p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~ 535 (560) ||||||||++||||+||+|||+||++++|||++|+++|+|++|++++ ++ T Consensus 468 pyvGky~s~~yi~k~ILr~tDeei~~e~k~I~~E~~~~~~~~p~~~~--~f 516 (516) T protein:vir:10 468 PYVGKYVSHDYVMKNILQMTEEQIAQEEKQIEQEAGIKRFQNPENED--DF 516 (516) T ss_pred hhhccccchHHHHHHHhcCCHhhHHHHHHHHHHhhhCCCCCCCCccc--cC Confidence 99999999999999999999999999999999999999999997763 22 No 12 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=100.00 E-value=4.7e-271 Score=1502.59 Aligned_cols=509 Identities=31% Similarity=0.560 Sum_probs=478.7 Q ss_pred hhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEec-------cccceeccccccchhhHHHHHHHHHHHh Q lcl|NC_019530. 12 FGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQG-------GANTYNYVGVESELLSVKTVVEEYQSMA 84 (560) Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~-------~~~~~~y~~~e~~~~~~~~LI~~YR~ma 84 (560) .+.++.|.-|.+-++. +.+|+++++++|+++|+++||++++++ ++++++|.+.+++++|+++||++||+|| T Consensus 1 ~~~~~lf~f~~~~d~~--~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma 78 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQN--EYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLI 78 (516) T ss_pred CCchHhcccccchhhh--HHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHh Confidence 2223332222222211 236668999999999999999999986 7778888899999999999999999999 Q ss_pred hccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhhee Q lcl|NC_019530. 85 QQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIV 164 (560) Q Consensus 85 ~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkii 164 (560) +|||||+||++|||||||||++++||+|+|++++ +|+++|++|+|||++||+||+|+++||++||+||||||+|||||| T Consensus 79 ~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~-~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKii 157 (516) T protein:vir:10 79 NNPEVERAVANIVNEAIVYERGHKVVSLDLDDTD-FGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIM 157 (516) T ss_pred hccchhhHHHHhhcceeEecCCCceEEEEecccC-cchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEe Confidence 9999999999999999999999999999999997 899999999999999999999999999999999999999999999 Q ss_pred cCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhce Q lcl|NC_019530. 165 DPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESI 244 (560) Q Consensus 165 D~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI 244 (560) | |||+||+|||+||||+|++||+|++++. +++++++++++||+|+|+..+++++|+ ++++++.++||.||| T Consensus 158 d--~~k~GI~Elr~lDPr~i~~vR~i~~~~~-~~~~v~~~~~e~~~Y~~~~~~~~~~g~------~~~~~~~ikI~~dAI 228 (516) T protein:vir:10 158 P--NPKKGIAELRRLDPRFMEYYREIVTSDI-GGTTIVKGYREFFIYTTGNEGYSYNGR------IFEPNTRIKIPRSAV 228 (516) T ss_pred c--CccccceeeeeeCCcceeeEeeeccccc-ccchhhhhhhheeeeccCccccccccc------eeCCCcceeechhhe Confidence 7 9999999999999999999999999885 558999999999999999999999887 355667899999999 Q ss_pred eeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeec Q lcl|NC_019530. 245 VYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYD 324 (560) Q Consensus 245 ~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd 324 (560) |||||||+|||+|.|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+||| T Consensus 229 ~y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYD 308 (516) T protein:vir:10 229 VYASSGLMDCSDRGIIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYD 308 (516) T ss_pred eeecccceeCCCCceeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhh Q lcl|NC_019530. 325 RTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAE 404 (560) Q Consensus 325 ~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~e 404 (560) ++||||+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++++|++..+.++| T Consensus 309 a~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~E 388 (516) T protein:vir:10 309 SNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTA 388 (516) T ss_pred CCCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccch Confidence 99999999999999999999999999999999999999999999999999999999999999999999999884444599 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019530. 405 ITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVE 484 (560) Q Consensus 405 ItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~ 484 (560) |||||+||+|||+|||+|||.+|.++||+|||||||||++||++ |+++|+|+|++||||+|+||+|||++|+++|+++| T Consensus 389 ItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~-i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~d 467 (516) T protein:vir:10 389 ITRDELDFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDE-QINNIKVNFHQDSYYTELKDIETLRLRVDALSQIE 467 (516) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHH-HhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhh Confidence 99999999999999999999999999999999999999999987 99999999999999999999999999999999999 Q ss_pred hhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccc Q lcl|NC_019530. 485 PFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNY 535 (560) Q Consensus 485 p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~ 535 (560) ||||||||++||||+||+|||+||++++|||++|+++|+|++|++++ ++ T Consensus 468 pyvGky~s~~yi~k~ILr~tDeei~~e~k~I~~E~~~~~~~~p~~~~--~f 516 (516) T protein:vir:10 468 PYVGKYVSHDYVMKNILQMTEEQIAQEEKQIEQEAGIKRFQNPENED--DF 516 (516) T ss_pred hhhccccchHHHHHHHhcCCHhhHHHHHHHHHHhhhCCCCCCCCccc--cC Confidence 99999999999999999999999999999999999999999997763 22 No 13 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=100.00 E-value=1.3e-270 Score=1500.20 Aligned_cols=507 Identities=32% Similarity=0.571 Sum_probs=478.4 Q ss_pred hhhhhhh--hhhhhhcccccchhhhccccccccCCCCCCCCceeEec-------cccceeccccccchhhHHHHHHHHHH Q lcl|NC_019530. 12 FGGGGLV--NAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQG-------GANTYNYVGVESELLSVKTVVEEYQS 82 (560) Q Consensus 12 ~~~~~~~--~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~-------~~~~~~y~~~e~~~~~~~~LI~~YR~ 82 (560) .+.++.| +.++++. +.+++++++++|+++|+++||++++++ +++.++|++.+++++++++||++||+ T Consensus 1 ~~~~~lf~f~~~~d~~----~~~~~~~~~~~s~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~d~~~~~~~~~~LI~~YR~ 76 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQN----EYDERLKQGHESIATPKKDDGATEIEAREGESSYNALMQQFFGIDNNISGTKDLINTYRQ 76 (516) T ss_pred CCchHhcccccchhhH----HHHhhhcCCCCcccCCCCccCceeeecCcccccccceeeeeecccCccccHHHHHHHHHH Confidence 2222222 2222232 246678899999999999999999987 57778888999999999999999999 Q ss_pred HhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhh Q lcl|NC_019530. 83 MAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHV 162 (560) Q Consensus 83 ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hk 162 (560) ||+|||||+||++|||||||||++++||+|+|++++ +|+++|++|+|||++||+||+|+++||++||+||||||+|||| T Consensus 77 ma~~pEvd~Av~eIvneaiv~d~~~~pV~l~l~~~e-~s~sik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhK 155 (516) T protein:vir:10 77 LTNNPEVERAVANIVNEAVVYEKGHKVVSLDLDDTE-FSSSIKDKILEEFDEICRLLDASRKLDTLFRRWYIDSRIFFHK 155 (516) T ss_pred hhhccchhHHHHHhhcceeEecCCCceEEEEecccc-cchHHHHHHHHHHHHHHHHhccchhhhHHHHhhhhcceEEEEE Confidence 999999999999999999999999999999999997 8999999999999999999999999999999999999999999 Q ss_pred eecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchh Q lcl|NC_019530. 163 IVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDE 242 (560) Q Consensus 163 iiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~d 242 (560) ||| |||+||+|||+||||+|++||++++++. ++++++++++|||+|+|+..+++++|+ ++++++.++||.| T Consensus 156 iid--~~k~GI~elr~lDPr~i~~vR~i~~~~~-~~~~v~~~~~e~~~Y~~~~~~~~~~g~------~~~~~~~ikI~~d 226 (516) T protein:vir:10 156 IMP--NPKEGIVELRRLDPRHVEYYREIVTSDV-GGTSVVKGYREFFVYTTGNEGYAYNGR------LFEPNTRIKIPRS 226 (516) T ss_pred Eec--CcccceeeeeeeCCcceeeEEeeecccC-cchhhhhceeeeeeeecCccceecccc------ccCCCCceecchh Confidence 997 9999999999999999999999999984 668999999999999999999998887 3456778999999 Q ss_pred ceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhhee Q lcl|NC_019530. 243 SIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNA 322 (560) Q Consensus 243 aI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~v 322 (560) ||+||||||+|||++.|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+| T Consensus 227 aI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~KNklv 306 (516) T protein:vir:10 227 AIVYAHSGLQDCSDRGIVGYLHNAVKPANQLKLLEDALVIYRITRAPERRVFYIDVGNMPNRKATEYVNGIMQSLKNRVV 306 (516) T ss_pred heeeeecCcccCCCCceeceehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccch Q lcl|NC_019530. 323 YDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNL 402 (560) Q Consensus 323 Yd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~ 402 (560) ||++||||+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++++|+++.+.+ T Consensus 307 YDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~ 386 (516) T protein:vir:10 307 YDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVTSLPGAQTMGEMDDVRWFNKKLYEALRIPLSRMPRDDGGMVIGGQD 386 (516) T ss_pred EeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeecccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998844445 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 403 AEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNT 482 (560) Q Consensus 403 ~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~ 482 (560) +||||||+||+|||.|||+|||.+|.++||+|||||||||++||++ |+++|+|+|++||||+|+||+|||++|+++|++ T Consensus 387 ~EItRDEiKF~KFI~rLR~rFs~lF~~~L~~qLilKgIit~eeW~~-i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~ 465 (516) T protein:vir:10 387 MAITRDELDFRKFIVQLQHNFEEIFLDPLKTNLIYKKIILESEWEE-QINNIKVNFHQDSYYTELKDIETLRQRVDALSQ 465 (516) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCHHHHHH-HhhcceEEeeecchHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999999999987 899999999999999999999999999999999 Q ss_pred hhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccc Q lcl|NC_019530. 483 VEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETG 533 (560) Q Consensus 483 ~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g 533 (560) +|||||||||++||||+||+|||+||++++|||++|+++|+|++|+++++= T Consensus 466 ~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~~~~~~~~p~~e~~f 516 (516) T protein:vir:10 466 IEPYVGKYVSHDYVMKNILQMTDEQIAQEEKQIEKEANVKRFQNPENEDDF 516 (516) T ss_pred hhhhhccccchHHHHHHHhcCCHhHHHHHHHHHHHhhhCCCCCCCCccccC Confidence 999999999999999999999999999999999999999999999887433 No 14 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=100.00 E-value=7.9e-271 Score=1501.37 Aligned_cols=531 Identities=33% Similarity=0.559 Sum_probs=485.0 Q ss_pred hhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceecccccc--chhhHHHHHHHHHHHhh Q lcl|NC_019530. 8 FFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVES--ELLSVKTVVEEYQSMAQ 85 (560) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~--~~~~~~~LI~~YR~ma~ 85 (560) ..-|||. ++ +++..+++.|++||+++||+..+ .++++++|+++++ +++|++|||++||+||+ T Consensus 1 m~~lfgf-----------~i----~~~~~~~~~S~vpp~~~~~~~~i-~~g~~g~~v~~~g~~~~~n~~eLI~~YR~ma~ 64 (564) T protein:vir:10 1 MSQLFGF-----------LI----NEKEGQKGQSPVPPNDEASVSTV-AGGYFGTYVDTSGGQNSRNEYELIRRYRDMSL 64 (564) T ss_pred Ccchhcc-----------ee----eeeccCCCCCcccCCcCCChhhh-hccccceeeecccccchhhHHHHHHHHHHHhh Confidence 2223331 12 34456789999999999999987 5578899999986 68999999999999999 Q ss_pred ccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheec Q lcl|NC_019530. 86 QPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVD 165 (560) Q Consensus 86 ~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD 165 (560) |||||+||+|||||||||+++++||+|+|++++ +|+++|++|+|||++||+||+|+++||++||+||||||+||||||| T Consensus 65 ~pEVd~Av~eIVneaIv~d~~~~pV~vdL~~~~-~s~siK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid 143 (564) T protein:vir:10 65 HPEVDSAIDEIVNEFVVNDGDDKPVEVDLQNLE-IGSGVKKKIRDEFNRILRMMNFNVNAHEIIRNWYVDGRSHYHKVID 143 (564) T ss_pred ccchhhHHHHhhcceeEecCCCceEEEEecccC-cchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEee Confidence 999999999999999999999999999999997 8999999999999999999999999999999999999999999999 Q ss_pred CCCCccCeeEeEEeCccccchhhhhcccCCccccccccccc---------hheeecCccccccccccccccccccccccc Q lcl|NC_019530. 166 PTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVT---------LKYYYNPNYNRNQFTGQSGTSQNFQPSQQE 236 (560) Q Consensus 166 ~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~---------e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~ 236 (560) ++|||+||+|||+||||+|++||++.++...++.+++++++ |||+|||.....+.++. +...++++++. T Consensus 144 ~~~pk~GI~eLr~lDPr~i~~vr~i~~~~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~~~~g~~~~~--~~~~~~~~~~~ 221 (564) T protein:vir:10 144 LDNPKKGILELRYIDSLKIRKVRQKLKDVDPNRKEIEKGTALQYDYGDFIEYYIYNPKGFAGNIPMV--TGSMDWSNQEG 221 (564) T ss_pred CCChhhhhhhhhhhcccceeeeeeeccccccccceeeeeeeeeccccccccceeeccccccCccccc--ccccccccccc Confidence 99999999999999999999999999987777889998876 78888887555555444 33346788999 Q ss_pred eecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHh Q lcl|NC_019530. 237 LVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGK 316 (560) Q Consensus 237 ~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k 316 (560) ++||.+||+||||||+|||+|+|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++ T Consensus 222 ikI~~daI~y~hSGL~d~~~~~i~gyLhkAIKp~NQLkmlEDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k 301 (564) T protein:vir:10 222 IKIASDAIAQSTSGLMDLNKKMTLSFLHKAIKSLNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLRDVMSR 301 (564) T ss_pred eeechhhcceecccceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCC-cc Q lcl|NC_019530. 317 FKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQEEG-SI 395 (560) Q Consensus 317 ~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~-~~ 395 (560) ||||+|||++||+|+|++||||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++ +| T Consensus 302 ~KNklVYDa~TGevrddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~DV~YF~kKLY~aLnVP~SRl~~e~~~f 381 (564) T protein:vir:10 302 YRNKLVYDGQTGEIRDDKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELKDVEYFKKKLYNSLNLPPSRLTDDNKAF 381 (564) T ss_pred cCceEEEeccCceecccchhhhhHhhhcccccCCCcccceeeccccCCcchHHHHHHHHHHHHHHhCCCcccccCCCcee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999985 79 Q ss_pred cccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHH Q lcl|NC_019530. 396 NIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILND 475 (560) Q Consensus 396 ~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~ 475 (560) ++||+ +||||||+||+|||.|||+||+.+|.++||+|||||||||++||++ |+++|+|+|++||||+||||+|||++ T Consensus 382 ~~Gr~--~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~-i~~~I~~~f~~Dn~f~ElKe~Eil~~ 458 (564) T protein:vir:10 382 NLGKS--TEILRDELKFTKFIGRLRKRFAQLFHDILKTQLILKGIITPEDWDD-MEEHIQYDFLFDNHFNELKEQEMQLQ 458 (564) T ss_pred ecccc--cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHH-HhhcceEEeeecchHHHHHHHHHHHH Confidence 99997 8999999999999999999999999999999999999999999987 99999999999999999999999999 Q ss_pred HHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccccccccc-----------CCCC Q lcl|NC_019530. 476 RLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDV-----------SPLK 544 (560) Q Consensus 476 Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~-----------~~~~ 544 (560) |+++|+++|||||||||++||||+||+|||+||++++|||++|+++|++++|.++++|+.++.++ ++.+ T Consensus 459 Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~~~~~~~~~~~~~~p~~~~~~~~~~ 538 (564) T protein:vir:10 459 RVNLATQMDPFVGKYFSTEYIRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVNMLDDMEKQNQAFAPELQAAQDDLA 538 (564) T ss_pred HHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhhcCCCccCCCCcCCcchhhhccccc Confidence 99999999999999999999999999999999999999999999999999998887776443333 2233 Q ss_pred CCcccccccCCcccCC Q lcl|NC_019530. 545 FKPETIPFSGSTDDSI 560 (560) Q Consensus 545 ~~p~~~~~~~~~~~~~ 560 (560) ..|+.-+..+.++-.- T Consensus 539 ~~~~~~~~~~a~~~~~ 554 (564) T protein:vir:10 539 AEREIKKLNSAPKPPP 554 (564) T ss_pred cccChhhhccCCCCCC Confidence 3444444333333222 No 15 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=100.00 E-value=9.8e-269 Score=1489.91 Aligned_cols=509 Identities=34% Similarity=0.603 Sum_probs=472.3 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEecc--------ccceeccccccchhh Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGG--------ANTYNYVGVESELLS 72 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~--------~~~~~y~~~e~~~~~ 72 (560) |+ |-.++++.-|.+-++. +..++.++++.|+++|+++||+++|+++ +++++|++.++.++| T Consensus 1 m~---------~~~l~lf~f~~k~~e~--~~~~~~~~~~~s~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~~~~n 69 (521) T protein:vir:10 1 MN---------PIFLKLLQPWMKDDEK--RVQSDLSDRIDSFAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAPKIQN 69 (521) T ss_pred CC---------cchhHHhhhhhhhhhh--HHhhhhccCccccccccCCCCceeeccCCCccccccchhhhhhccccccch Confidence 32 1222222222222211 2244578899999999999999988765 455789999999999 Q ss_pred HHHHHHHHHHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHh Q lcl|NC_019530. 73 VKTVVEEYQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKW 152 (560) Q Consensus 73 ~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~w 152 (560) +++||++||+||+|||||+||++|||||||||++++||+|+|++++ +|+++|++|+|||++||+||+|+++||++||+| T Consensus 70 ~~eLI~~YR~ma~~pEvd~Av~eIvneaiv~d~~~~pV~i~Ld~~~-~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~W 148 (521) T protein:vir:10 70 TKDLINQYRSLSKYHEVDNAIDEIINDAIVQEDNRDTVYLDLDKTD-WNESVKEMVREEFRTILKLLKFEREGKRHFRRW 148 (521) T ss_pred HHHHHHHHHHHhhccchhhHHHhhhcceEEecCCCceEEEEecCcc-cchHHHHHHHHHHHHHHHHhccchhhhHHHhhh Confidence 9999999999999999999999999999999999999999999997 699999999999999999999999999999999 Q ss_pred hhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCcccc-cccccccccccccc Q lcl|NC_019530. 153 YVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNR-NQFTGQSGTSQNFQ 231 (560) Q Consensus 153 YvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~-~~~~g~~~~~~~~~ 231 (560) |||||+|||||||++|||+||+|||+||||+|++||++.+++. +++.++++++|||+|+|.... ++.+| T Consensus 149 YVDgRi~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i~k~~~-~~~~v~~~~~e~f~Y~~~~~~~~~~~g--------- 218 (521) T protein:vir:10 149 YVDSRIYFHKMIDPARPKDGIKELRLLDPRNVEYYRVNLKSNE-NGNDVYKGVKEFFTYGATEDNRYNISG--------- 218 (521) T ss_pred eeeeeEEEEEEeeCCCccccceeeeeeCCcceeeeeeecCCCC-CcchhhccceeeeeeccCCCceecCCC--------- Confidence 9999999999999999999999999999999999999999885 778999999999999975432 22222 Q ss_pred ccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHH Q lcl|NC_019530. 232 PSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMT 311 (560) Q Consensus 232 ~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~ 311 (560) .+++.++||+||||||||||+|||+|+|+||||+|||||||||||||||||||+||||||||||||||||||.||||||+ T Consensus 219 ~~~~~vkI~~daI~y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~ 298 (521) T protein:vir:10 219 NSNNLVQIPIDAIVYSHSGKVDIDGKTIVGYLHNVIKPANQLKMLEDAMVIYRITRAPERRVFYIDVGTMPNKKATQHLN 298 (521) T ss_pred CCCcceeechhheeeecccceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHH Confidence 35789999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCC Q lcl|NC_019530. 312 MMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQE 391 (560) Q Consensus 312 ~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~ 391 (560) +||++||||+|||++||+|+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++ T Consensus 299 ~iM~k~kNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEI~TLpggqnlgem~DV~YF~kkLy~aLnVP~sRl~~ 378 (521) T protein:vir:10 299 NVMQGLKNRVVYDSSTGKVKNSSNNLAMTEDYWLMRRDGKATTEVSTLPGAQSMGEMDDVRWFNRKLYESMKIPLSRLPQ 378 (521) T ss_pred HHHHhcCceEEEeccCceeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CC-cccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHH Q lcl|NC_019530. 392 EG-SINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQEN 470 (560) Q Consensus 392 ~~-~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~ 470 (560) ++ +|++||+ +||||||+||+|||+|||+|||.+|.++||+|||||||||++||++ |+++|+|+|++||||+|+||+ T Consensus 379 e~~~f~~Gr~--~EItRDEikF~KFI~rLR~rFs~~f~~~L~~qLilKgiit~eew~~-i~~~I~~~f~~Dn~f~ElKe~ 455 (521) T protein:vir:10 379 EGAGVTFGAG--NDITRDELQFTKYIRGLQQQFEPIFLNPLRTNLMLKGKMSVSEWEE-QAENIKVVFSKDSYYEEIKDV 455 (521) T ss_pred CCCceecccc--cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHH-HhhcceEEeeecchHHHHHHH Confidence 85 7999997 7999999999999999999999999999999999999999999987 999999999999999999999 Q ss_pred HHHHHHHHHHHHhhh--hhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccc Q lcl|NC_019530. 471 AILNDRLASLNTVEP--FVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNY 535 (560) Q Consensus 471 Ei~~~Rl~~l~~~~p--~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~ 535 (560) |||++|+++|+++|| |||||||++||||+||||||+||++++|||++|+++|+|++|+++++- + T Consensus 456 eil~~R~~~l~~~dp~~yvGky~s~dyi~k~ILr~tDeeik~~~k~I~~E~~~~~~~~p~~e~~d-f 521 (521) T protein:vir:10 456 EILERRVNLVQTLASAEVTGKYLSHEYVMKNILRMSDEDIKTEREKIDGELKDSVYKNPEDPMEE-F 521 (521) T ss_pred HHHHHHHHHHHhhcCccccccccchHHHHHHHhcCCHhHHHHHHHHHHHhhhCCCCCCCcchhhc-C Confidence 999999999999999 999999999999999999999999999999999999999999987432 2 No 16 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=100.00 E-value=3e-265 Score=1470.79 Aligned_cols=493 Identities=32% Similarity=0.560 Sum_probs=463.7 Q ss_pred cccccchhhh-----ccccccccCCCCCCCCceeEec--------cccceeccccccchhhHHHHHHHHHHHhhccchhH Q lcl|NC_019530. 25 DKLAQKQDEL-----LLTKATVVALDDAQDGSIILQG--------GANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRK 91 (560) Q Consensus 25 ~~~~~~~~e~-----~~~k~~s~~~~~~~dG~~~i~~--------~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~ 91 (560) -++|+++||+ +++++.|+++|+++||++++++ |+++++|++.++.++++ |||++||+||+|||||+ T Consensus 1 ~~~w~~~de~~~~~~~~~~~~S~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~-eLI~~YR~ma~~pEvd~ 79 (511) T protein:vir:56 1 MKFWTKEEEQDIQKIEKNPVRSFSAPDNVDGAKEIHTNLLAPQLGHAIIPSDAQSEGTIPVK-ELIKSYRALAEYHEVDD 79 (511) T ss_pred CCCccchhhhhhhhhccCCcccccCCCCCCCceEEecccccceecceeccccccccCccchH-HHHHHHHHHhhccchhh Confidence 6788888776 6888999999999999999964 56788888999888885 99999999999999999 Q ss_pred HHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCcc Q lcl|NC_019530. 92 AVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKG 171 (560) Q Consensus 92 AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~ 171 (560) ||++|||||||||++++||+|+|++++ +|+++|++|+|||++||+||+|+++||++||+||||||+|||||||| |+ T Consensus 80 Av~eIvne~iv~d~~~~pV~l~ld~~~-~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~---k~ 155 (511) T protein:vir:56 80 AIQEIVDEAIVYENDKEVVWLNLDNTD-FSENIKAKINEEFDRVVSLLQMRKHGYKWFRKWYVDSRIYFHKILDK---DN 155 (511) T ss_pred HHHHhhcceeEecCCCceEEEEecccC-cchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecc---cc Confidence 999999999999999999999999998 89999999999999999999999999999999999999999999987 45 Q ss_pred CeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeeccc Q lcl|NC_019530. 172 GIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGE 251 (560) Q Consensus 172 GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl 251 (560) ||+|||+||||+|++||++++++.. ++++++++.+||+|+|.+ ....++. +++ ..+++.++||.|||||||||| T Consensus 156 GI~eLr~lDPr~i~~vr~i~~~~~~-~~~v~~~~~ey~~Y~~~~-~~~~~~~-~~~---~~~~~~vkI~~daI~y~hSGL 229 (511) T protein:vir:56 156 NIIELRPLNPMKMELVREIQKETID-GVEVVKGTLEYYVYKQSD-YKMPSWM-SAT---NRAQTSFRIPKDAIVFAHSGL 229 (511) T ss_pred ceeehhhcCcccchhhhhhhccccc-ccccccceeeeeEecCCC-cccCccc-ccc---cccccceeechhheeeecccc Confidence 9999999999999999999997764 599999999999999864 2222222 222 224678999999999999999 Q ss_pred ccC--CCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCc Q lcl|NC_019530. 252 EPL--ANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGK 329 (560) Q Consensus 252 ~~~--~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGe 329 (560) +|| ++|+|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+ T Consensus 230 ~d~~~~~g~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~kNklVYDa~TGe 309 (511) T protein:vir:56 230 MRGCADDPYIIGYLDRAIKPANQLKMLEDALVIYRLARAPERRVFYVDVGNLPTQKAQQYVNGIMQNVKNRVVYDTQTGQ 309 (511) T ss_pred eeccCCCCeeeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCce Confidence 995 666899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCC---Ccccccccchhhhh Q lcl|NC_019530. 330 ITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQEE---GSINIGGSNLAEIT 406 (560) Q Consensus 330 v~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~---~~~~l~~~~~~eIt 406 (560) |+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++ ++|++||+ +||| T Consensus 310 v~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~--~EIt 387 (511) T protein:vir:56 310 VKNTTNAMSMLEDYYLPRREGSKGTEVSTLPGGQSLGDIEDVLYFNRKLYKAMRIPTSRAASEDQTGGINFGQG--AEIT 387 (511) T ss_pred eccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCccccccccc--hhhh Confidence 999999999999999999999999999999999999999999999999999999999999977 47999986 8999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_019530. 407 QEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPF 486 (560) Q Consensus 407 RDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~ 486 (560) |||+||+|||.|||+|||.+|.++||+|||||||||++||++ |+++|+|+|++||||+|+||+|||++|+++|+++||| T Consensus 388 RDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~-i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpy 466 (511) T protein:vir:56 388 RDELKFTKFVKRLQTKFETVITDPLKHQLIVNNIITEEEWDA-NHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDY 466 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHH-HhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcch Confidence 999999999999999999999999999999999999999987 9999999999999999999999999999999999999 Q ss_pred hcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccc Q lcl|NC_019530. 487 VGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADE 531 (560) Q Consensus 487 vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e 531 (560) ||||||++||||+||+|||+||++++|||++|+++|+|++|+++= T Consensus 467 vGky~S~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~e~~f 511 (511) T protein:vir:56 467 AGKYYSHKYIQKNILRLSDDQITAMQSEIDEEETNPRFQQDDQGF 511 (511) T ss_pred hccccchHHHHHHHhccCHHHHHHHHHHHHHhhcCCCCCCcccCC Confidence 999999999999999999999999999999999999999988773 No 17 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=100.00 E-value=4.2e-227 Score=1261.67 Aligned_cols=476 Identities=19% Similarity=0.314 Sum_probs=421.7 Q ss_pred hhhcccccchhhh-----ccccccccCCCCCCCCceeEeccccce-ecc-----ccccchhhHHHHHHHHHHHh-hccch Q lcl|NC_019530. 22 VDTDKLAQKQDEL-----LLTKATVVALDDAQDGSIILQGGANTY-NYV-----GVESELLSVKTVVEEYQSMA-QQPEI 89 (560) Q Consensus 22 ~~~~~~~~~~~e~-----~~~k~~s~~~~~~~dG~~~i~~~~~~~-~y~-----~~e~~~~~~~~LI~~YR~ma-~~PEv 89 (560) .-+.++|++.+|+ ++++..|+++|+.+||+++|..+.+.. .+. -+.+.++|++|||++||+|| +|||| T Consensus 1 ~~~~~~w~~~de~~~~~~~~~~~~~~~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~gg~~~n~~eLI~~YR~ma~~~pEV 80 (533) T protein:vir:58 1 MPSLEKYKKLNEAVNFTNFLSPMYGMGAPHGAGGSSMIPINMYHPFATAGYASRFYGGIEFNRFFLYDMYDRMDYTDPLI 80 (533) T ss_pred CCCcchhhhhhHHHHHHHhhchhhcccCccCCCCCccccCCCCcchhhhhhhhhhhccccccHHHHHHHHHHhhccCcch Confidence 5566777777665 688999999999999999998654321 111 12467899999999999997 68999 Q ss_pred hHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCC Q lcl|NC_019530. 90 RKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNK 169 (560) Q Consensus 90 d~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~ 169 (560) |+||++||||||||+++++||+|+|++++ +|+++|++| +++|+|+++||++||+||||||+||||++ ++| T Consensus 81 d~AideIvneaiv~d~~~~pV~v~l~~~e-~s~~iK~kI-------~~lldf~~~~~~~fR~WYVDGriy~Hkii--k~~ 150 (533) T protein:vir:58 81 STVLDIIADECTIPNENGNIVDVVTKDIE-LAKAILSYL-------DYVINIEKNAYPIIRNMIKYGDMFLHILE--KGS 150 (533) T ss_pred hhHHHhhhceeeEecCCCceeEeeccccc-ccHHHHHHH-------HHHhcchhhhhHHHHhhhhcceeEEEecc--CCc Confidence 99999999999999999999999999997 899999876 47999999999999999999999999976 578 Q ss_pred ccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeec Q lcl|NC_019530. 170 KGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDS 249 (560) Q Consensus 170 k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hs 249 (560) |+||+|||+||||+|++||++.++ .+||+|+|.+.+ ..++.+.++||+++|+|||| T Consensus 151 k~GI~elr~lDPr~i~~vr~~~t~------------~eyyvy~~~~~~------------~~s~~~~~kI~~daI~y~~S 206 (533) T protein:vir:58 151 DGTIEKFQVVSPYIFSKRYNPETD------------TWYYVITDVYRN------------VVSGYFNEDIPEEDVIHFSH 206 (533) T ss_pred ccchhhheecCCeeeEEEEeeccc------------eEEEeecccccc------------cccCccccccchhheeeeee Confidence 999999999999999999988775 489999986332 11233568999999999999 Q ss_pred ccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCc Q lcl|NC_019530. 250 GEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGK 329 (560) Q Consensus 250 Gl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGe 329 (560) ||++||+++++||||+|||||||||||||||||||+||||||||||||||||||.||+|||++||++||||+|||++||+ T Consensus 207 Gl~d~~~~~iisyLhkAiKp~NQLkmiEDAlVIYRisRAPeRRvFYIDVGNlpk~KAeqYl~~im~k~kNklvYDa~TGe 286 (533) T protein:vir:58 207 KIDTNFFPYGRSYLESARAIWNQLRLMEDALMLYRVVRSVDRRVFYVDVGNVPPDKINEYLTNIAMQYKRDYWVRNNQNQ 286 (533) T ss_pred ccccCCCCceehhhhHHHHHHHHHHHHHHHHHHHhhcCChhheEEEEeecCCCccCHHHHHHHHHHhcccceEEeccCCe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCcccch---hhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhh Q lcl|NC_019530. 330 ITGNAHLM---GIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEIT 406 (560) Q Consensus 330 v~d~~~~m---smlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eIt 406 (560) |+|++|+| ||||||||||||||||||||||||| |||+|+||+||++|||+|||||+|||+++++| ||+ +||| T Consensus 287 v~ddrk~m~~~sMlEDyWLpRReGgrgTEI~TLpGg-~lgemeDV~YF~kkLy~ALnVP~sRl~~e~~f--gr~--~eIt 361 (533) T protein:vir:58 287 FLGIDNYFSIESILKDYFIPRRGDRRAVEIDILQGS-KVDLAEDVEYMLNRLISALKVPKAFIGYEGDV--NAK--NTLA 361 (533) T ss_pred EeeccchhhhhhhHhhhcccccCCCccceeeecCCC-CCCcHHHHHHHHHHHHHHhCCCeeecCCCCCC--ccc--hhhh Confidence 99999999 9999999999999999999999987 59999999999999999999999999999986 675 8999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_019530. 407 QEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPF 486 (560) Q Consensus 407 RDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~ 486 (560) ||||||+|||+|||+||+.+| ++||+||||||+||| +|+|++||||+|+|++|||++|+++|+++||| T Consensus 362 RDEiKF~KFI~rLR~rF~~ll----~~qLilk~iit~eew--------~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~dpy 429 (533) T protein:vir:58 362 TQDIKFNNTIKRIQGFFVEEL----ERMVRMNKEFADQDF--------RLVMNRSNSIVEGERFAVIEQRIGIAERLKGW 429 (533) T ss_pred HHHHHHHHHHHHHHHHHHHHH----hcccccccCcchhhe--------eeeeeccchHHHHHHHHHHHHHHHHHHHhcch Confidence 999999999999999988776 569999999999999 59999999999999999999999999999999 Q ss_pred hcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcccc-ccc------------ Q lcl|NC_019530. 487 VGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETI-PFS------------ 553 (560) Q Consensus 487 vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~-~~~------------ 553 (560) ||| +||||+||+||| ||++++++|++|+++|+|++|+.++ +++|.++.||.. |.- T Consensus 430 vgk----~yi~k~ILr~td-ei~~q~e~ie~E~~~~~~~~~~~~~-------e~~~~~~~~~~~~p~~~~~~~~~~~~~~ 497 (533) T protein:vir:58 430 VRE----DWIYSNILQIPY-DLKPQEEVAEAAGGGGLFDTGGFGE-------ETTPADFLGERGSPIESPRGRTEFDFGT 497 (533) T ss_pred hhH----HHHHHHHhcCCh-hhhHHHHHHHHhhcCCCCCCCCccc-------ccCCcccCccccCcccCCCChhhHhccc Confidence 999 689999999998 6777788999999999999987652 334444444444 221 Q ss_pred ------CCccc--CC Q lcl|NC_019530. 554 ------GSTDD--SI 560 (560) Q Consensus 554 ------~~~~~--~~ 560 (560) ||..+ +- T Consensus 498 ~~~~~~~~~~~~~~a 512 (533) T protein:vir:58 498 EGGEELGGELNLGGA 512 (533) T ss_pred CCccccccccccccc Confidence 11110 00 No 18 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.30 E-value=2.4e-11 Score=78.78 Aligned_cols=456 Identities=12% Similarity=0.115 Sum_probs=195.8 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHH Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEY 80 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~Y 80 (560) --+--|++. ++.... -....+...+....+..-|.-.. +.....| .....-...+|...| T Consensus 32 ~~~~~~~~~------------~~~~~~----~~~~~~~~~~~~a~~~g~~~~~~--~~~~~~~--~~~~~~~~~~l~a~Y 91 (532) T protein:vir:94 32 SLGLATAHE------------IDPTAY----SPYERNAAQNAMAMDYGLQTGRN--GRNALSF--VEATSWPGFPTLALL 91 (532) T ss_pred hhhhhhhhh------------hccccc----ccccccccccccccccccCcccc--ccccccc--ccccccchHHHHHHH Confidence 000111110 000000 00000000000000000000000 0000111 111222455777777 Q ss_pred HHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhh--hh Q lcl|NC_019530. 81 QSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDG--RQ 158 (560) Q Consensus 81 R~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDG--r~ 158 (560) + .||+++.||+.|.++|+. ..+.|..++.+++.+..+++|..+++. |++..+..+.+|.=-+.| -+ T Consensus 92 ~---~~~l~r~~Vd~~aed~~r-----~~~~i~~~~~~~~~~~~~~~i~~~~~~----l~v~~~l~~a~~~~rlyG~a~i 159 (532) T protein:vir:94 92 A---QLPEYRTMHETPADECVR-----AWGKITCSSKDELAADKATRITQKLEQ----YNVRTLVRTVVIHDQAYGGAHV 159 (532) T ss_pred H---cCchhhhhhccchHHHhh-----CCceEeeCCccccchHHHHHHHHHHHh----hhHHHHHHHHHHhhhcccceEE Confidence 6 599999999999998864 344454554444445555566655444 444555555554322333 23 Q ss_pred hhhhe-------------ecCCC-CccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccc Q lcl|NC_019530. 159 AYHVI-------------VDPTN-KKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQS 224 (560) Q Consensus 159 ~~hki-------------iD~~~-~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~ 224 (560) +++.- ++++. .+.+++.|+.|||..+.+- .....+..+ + .++-|. +|.. T Consensus 160 ~i~v~~~~~~~~~~~p~~l~~~~I~~g~~~~l~vld~~~v~p~-~~~~~dp~s---------p-~fg~P~--~y~v---- 222 (532) T protein:vir:94 160 FPHLKMDGDSVPADAPLLLSPSFVQRGCLIGFATIEPMWLSPN-AYNATDPTL---------P-SFYKPD--SWIA---- 222 (532) T ss_pred EEEeccCCccccccccccccccccccceeeEEEeechheeccc-ccccccccc---------c-ccCCce--eEEE---- Confidence 33211 11111 1224567778887665441 000000000 0 111111 0100 Q ss_pred cccccccccccceecchhceeeeecccc-----cCCCCcchhhhHHHHHHHHHHHHHHHH--HHHHHHhcCccceEEEcc Q lcl|NC_019530. 225 GTSQNFQPSQQELVFDDESIVYIDSGEE-----PLANGIVPGLLNPAIRPLNNLVTTEDA--TVIYAITRAPEKRAFYLD 297 (560) Q Consensus 225 ~~~~~~~~~~~~~~i~~daI~y~hsGl~-----~~~~~~i~syL~~AiK~~NqL~m~EDa--lVIyRitRAPeRRvFyID 297 (560) .+..+||++-+++....-+ ........|.|+++...+.+......+ .++|+-. =.|+.++ T Consensus 223 ---------~~g~~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~----~~v~k~~ 289 (532) T protein:vir:94 223 ---------TSGKKIHSSRIHTVVGRPVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFS----MTNLATD 289 (532) T ss_pred ---------ccCeeeccceEEEecCCCchhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcC----Cceeeec Confidence 0123688886655431111 111223467888887777766554443 3355422 2344444 Q ss_pred CCCCc-hHHHHHHHHH--HHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCcc-ceeecCCCCCcchHHHH-H Q lcl|NC_019530. 298 VGTLG-KKSAEEYMTM--MMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNAT-EIATVGGGNQLGEMDHV-N 372 (560) Q Consensus 298 vGnlp-k~KAeqYl~~--im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgT-EIsTLpGg~nLgei~DV-~ 372 (560) ..++- ....++..+. +++++|. ..| .|-| .+++ +++++. .+|+-++|+ . T Consensus 290 ~a~~ls~~~~~~~~~r~~~~~~~~~------n~g-------~~~i-----------d~~~e~~e~~~--~~lsgl~~~l~ 343 (532) T protein:vir:94 290 MAQLLAPGGAQSLDARLQLFNLYRD------NRN-------IGAL-----------DKGTEEIQQTN--TPLSGLDSLQA 343 (532) T ss_pred hHHhhcchhHHHHHHHHHHHHhhcC------Ccc-------ceEE-----------cCCCceeEEEe--cccCCHHHHHH Confidence 32221 1111111111 1122211 112 1111 0111 244442 345555554 7 Q ss_pred HHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhccCCChhHHHHhhh Q lcl|NC_019530. 373 YFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRR-YSHFFMEFLRRQLILKGVTDEKDWNEKIK 451 (560) Q Consensus 373 YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~r-fs~if~d~Lk~qLiLkgiit~eew~~~i~ 451 (560) .|...+--+++||+.||-.++.-.++.+ +.+ |.-.|+.||+++|.. +..+...+++. |++.-+.+. . T Consensus 344 ~~~~~iAaa~~IP~t~LfG~sp~Glnst-Ge~---D~~~yyd~I~s~Qe~~l~p~le~l~~~-l~~s~~g~~-------~ 411 (532) T protein:vir:94 344 QSQEQMAAVSHIPLVKLLGITPNGLNAS-SDG---EIRVWYDFIAGYQATNLTPLMEWIIDL-IQLSEYGQI-------D 411 (532) T ss_pred HHHHHHHhHhCCCeeeeecCCccccccc-chH---HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHhcCCC-------C Confidence 8899999999999999865543223222 111 333499999999955 45555555533 222222221 1 Q ss_pred hhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCH-----------HHHHHHHHHHHHhhh Q lcl|NC_019530. 452 PFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSD-----------EEVKEQQTKIAEEKK 520 (560) Q Consensus 452 ~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tD-----------eEI~e~~kqIe~E~~ 520 (560) +.+.|.|..=...++...+|+...+.++++.+-. .-.+|.+-+++. |++.. +++++...++++.+. T Consensus 412 ~d~~~~f~pL~~~s~kEkAei~~~~a~a~~~~~~--~Gvi~~~Evr~~-l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 488 (532) T protein:vir:94 412 PGLAWEWSPLMELDDKELAEVRQLNASTDSTLME--LGVIDAKMVQQR-LAADPTSGYAGALGERDELDDVEEIAKQLMA 488 (532) T ss_pred CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--cCCCCHHHHHHH-HhcCCccccccccccccccccccchhhhhcc Confidence 2356667655556676778888888888877632 237888888864 54332 333333333333332 Q ss_pred cCCCCCCCccccccccccccC--CCCCCcccccccCCcccCC Q lcl|NC_019530. 521 KGLYPKVQADETGNYSGSDVS--PLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 521 ~~~~~~p~~~e~g~~~~~~~~--~~~~~p~~~~~~~~~~~~~ 560 (560) +. ..+|...+++...+++.. +-.-.|.-....|+.+.-+ T Consensus 489 ~~-~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 529 (532) T protein:vir:94 489 AA-LNPPATAPQTPNPQPDSEDDQTDNQPDAQADPAQNDQPV 529 (532) T ss_pred cc-cCCCCCCCCCCCCCCCCCCCCCCCccCCCccccccCCCc Confidence 22 222222222221111111 1112233333344444444 No 19 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.26 E-value=3e-11 Score=78.28 Aligned_cols=461 Identities=14% Similarity=0.092 Sum_probs=200.9 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhccc--c--cchhhhcc--ccccccCCCCCCCCceeEeccccceeccc-----cccc Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKL--A--QKQDELLL--TKATVVALDDAQDGSIILQGGANTYNYVG-----VESE 69 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~--~~~~e~~~--~k~~s~~~~~~~dG~~~i~~~~~~~~y~~-----~e~~ 69 (560) |+-.| .+.. ..+...+. . .++-++.. --.+|..++...|+-....++........ +... T Consensus 37 ~~~~~--------~~~~--~~~~~~~~~~~~~~~~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 106 (765) T protein:vir:96 37 MIKLG--------KIRG--WNVEPEKAPVIRSVKDFLEPGLSVAMDSAYGDGPTPAAKAAAGGQNPYVVPTMLQDWYNSQ 106 (765) T ss_pred chhHH--------HHhh--cccccccCCCCCCCCcccCcccceeccccccccccchHHHhhhccCccchhhHHHhhhccc Confidence 32222 1111 00000000 0 00000000 00111111111111111111111000000 0111 Q ss_pred hhhHHHHHHHHHHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHH Q lcl|NC_019530. 70 LLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKI 149 (560) Q Consensus 70 ~~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~ 149 (560) .-.-+||...|++ |+.++.||+.+..||+- ..+.|.-+.. +..+...++ |+..++-|++..+..+.+ T Consensus 107 ~f~gyql~alY~~---~~l~rkiVd~pAeDa~R-----~g~~I~~~~~-e~~~~~~~~----l~~~~~rl~v~~~l~ea~ 173 (765) T protein:vir:96 107 GFIGYQACAIISQ---HWLVDKACSMSGEDAAR-----NGWELKSDGR-KLSDEQSAL----IARRDMEFRVKDNLVELN 173 (765) T ss_pred CCccHHHHHHHHh---CchhhhhhhcchHHhhc-----CCceeecCcc-ccCHHHHHH----HHHHHHHhhHHHHHHHHH Confidence 1234577776665 99999999999999864 2355554442 234444444 444455556678888887 Q ss_pred H--HhhhhhhhhhhheecCCC-------------CccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCc Q lcl|NC_019530. 150 R--KWYVDGRQAYHVIVDPTN-------------KKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPN 214 (560) Q Consensus 150 R--~wYvDGr~~~hkiiD~~~-------------~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~ 214 (560) | |.|-.|-+++ +++..+ .+.+++.|+.|||..+.....- +.......-.+|.| T Consensus 174 ~~~RlyGga~i~i--~i~~~D~~~l~~PL~~~~I~kg~~kgl~vldp~~~~~~~v~---------e~~~Dp~sp~fg~P- 241 (765) T protein:vir:96 174 RFKNVFGVRIALF--VVESDDPDYYEKPFNPDGIAPGSYKGISQIDPYWAMPQLTA---------ESTADPSAEHFYEP- 241 (765) T ss_pred HHhhhceeeEEEE--EecccCcchhhccccccccccceeeEEEEechhhcccccch---------hccccccccccCcc- Confidence 7 7777665554 333222 1235677888888666553210 00000001112222 Q ss_pred cccccccccccccccccccccceecchhceeeeecccc-----cCCCCcchhhhHHHHHHHHHHHHH--HHHHHHHHHhc Q lcl|NC_019530. 215 YNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEE-----PLANGIVPGLLNPAIRPLNNLVTT--EDATVIYAITR 287 (560) Q Consensus 215 ~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~-----~~~~~~i~syL~~AiK~~NqL~m~--EDalVIyRitR 287 (560) ..|..+| -+||++-++.....-+ +.......|-|+++...+..+... +-+.++++- T Consensus 242 -~~y~i~g--------------~~IH~SRli~~~g~~lpd~lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~-- 304 (765) T protein:vir:96 242 -DFWIISG--------------KKYHRSHLVVVRGPQPPDILKPTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSK-- 304 (765) T ss_pred -eeeeecC--------------ceeccceEEEecCCCchhhhccccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHh-- Confidence 1111111 2567775544332211 112223557787776555544322 344555542 Q ss_pred CccceEEEccCCCCc-hHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcc Q lcl|NC_019530. 288 APEKRAFYLDVGTLG-KKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLG 366 (560) Q Consensus 288 APeRRvFyIDvGnlp-k~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLg 366 (560) -. +++.+|....- ..++...--+.++++|+ ..| .+-+ +++-+++++. .+|+ T Consensus 305 -~~-~v~k~~~~~~l~~~~~l~~r~~~~~~~r~------n~g-------~~~i-----------d~ee~~e~~s--~~ls 356 (765) T protein:vir:96 305 -RT-STIHVDVEKAIANEDAFNARLAFWIANRD------NHG-------VKVI-----------GIDETMEQFD--TNLS 356 (765) T ss_pred -cc-ceeeechHhhhccHHHHHHHHHHHHHhcC------Cce-------eEEe-----------cCCcceeEEe--cccC Confidence 22 35666655321 11111111112222221 111 1111 0112344444 3666 Q ss_pred hHHHH-HHHHHHHHHhcCCCccccCCCC--cccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCh Q lcl|NC_019530. 367 EMDHV-NYFREKLYDALMIPKSRLQEEG--SINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDE 443 (560) Q Consensus 367 ei~DV-~YF~~kLy~aL~VP~sRl~~~~--~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~ 443 (560) -++|+ ..|...+=-+.+||+.||-.++ |+|-.+. . |.-.|..+|+++|...-.-+++-|=.-|++.+.+. T Consensus 357 gl~d~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe--~----D~~nYyD~I~s~Qe~~l~p~le~L~~li~~s~~i~- 429 (765) T protein:vir:96 357 DFDSVIMNQYQLVAAIAKTPATKLLGTSPKGFNATGE--H----ETISYHEELESIQEHIFDPLLERHYLLLAKSESID- 429 (765) T ss_pred CHHHHHHHHHHHHHhhhCCCeeeeccCCcccccCcch--H----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC- Confidence 67774 6789999999999999996654 5543322 1 34449999999996543333333333455555432 Q ss_pred hHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHh--------CCCHHHHHHHHHHH Q lcl|NC_019530. 444 KDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVL--------RMSDEEVKEQQTKI 515 (560) Q Consensus 444 eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL--------~~tDeEI~e~~kqI 515 (560) +.+.|.|..=..=+|...+|+...+.++++.+-.- -.+|.+-+++.+- .++|+|++.+. .+ T Consensus 430 --------~d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~--Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~~-~~ 498 (765) T protein:vir:96 430 --------VQLEIVWNPVDSTTSQQQAELNNKKAATDEIYINS--GVVSPDEVRERLRDDPRSGYNRLTDDQAETEP-GM 498 (765) T ss_pred --------CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhc--CCCCHHHHHHHHhccccCCCCCCCcccccccc-CC Confidence 33566676655567777888888888888775222 3788888887532 24565554221 12 Q ss_pred HHhhhcCCCCCCC--------ccccccccccccCCCCCCccccc----------ccCCcccCC Q lcl|NC_019530. 516 AEEKKKGLYPKVQ--------ADETGNYSGSDVSPLKFKPETIP----------FSGSTDDSI 560 (560) Q Consensus 516 e~E~~~~~~~~p~--------~~e~g~~~~~~~~~~~~~p~~~~----------~~~~~~~~~ 560 (560) ..|-. +-+..+. ++++....+....+.+-+-...| -.|++..+- T Consensus 499 ~pe~~-~~~~~~~~~~~~~~~e~~~~~a~p~~~eg~~~~~~~~p~~~~p~~~~~~~~~g~~~~ 560 (765) T protein:vir:96 499 SPENL-AELEKAGAQSAKAKGEAERAEAQAGAVEGAGDPVPAAPRGTKPLAKAAEEGAGEAAT 560 (765) T ss_pred Ccccc-ccccCCCcccccccCccccccCCCCccCCCCcccccCCcccCCccccccccCccccC Confidence 11110 0011111 11111100000000000000001 001111110 No 20 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.22 E-value=2.3e-10 Score=73.46 Aligned_cols=452 Identities=12% Similarity=0.102 Sum_probs=195.0 Q ss_pred hhhhhhhhhhcccccchhhhcccc----------------------ccccCCC-----CCCCC---ceeEeccc----cc Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTK----------------------ATVVALD-----DAQDG---SIILQGGA----NT 60 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k----------------------~~s~~~~-----~~~dG---~~~i~~~~----~~ 60 (560) |+-|+.+...+-+..++.++..++ ..++.+| ...|| +...++.. +. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~ 80 (537) T protein:vir:10 1 MFKFWRKKTVEAVQSSIAERIEPRVGIFGAGDDEKPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTF 80 (537) T ss_pred CCCccccccccccccccccccccccCCCcccchhhHHHHHHhhhhccCCCCCccCcccccccccccchhccccccchhhh Confidence 222222211111111112221110 0011111 11121 11111100 00 Q ss_pred eecc----------ccc-cchhhHHHHHHHHHHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHH Q lcl|NC_019530. 61 YNYV----------GVE-SELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESIT 129 (560) Q Consensus 61 ~~y~----------~~e-~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~ 129 (560) +.+. .+. +.+ .-++|...|+ .|++++.||+.+..+|+- ..+.+..++.++.++...++|. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~-~~~~l~a~Y~---~~~l~r~iVd~~A~d~~r-----~~~~i~~~~~~~~~~~~~~~l~ 151 (537) T protein:vir:10 81 SAYANPNLSEGLVLWYAQQAF-IGHQMCALIA---THWLVNKACSQMPRDAMR-----KGYKIISDDGNELDPKDAKFID 151 (537) T ss_pred hhhccccccchhhhhccccCC-ccHHHHHHHH---hCchhhhhhhhhhHHhhc-----CCceeecCCcccccHHHHHHHH Confidence 0000 011 112 2356776665 699999999999998853 3455555554444555555555 Q ss_pred HHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhhee---cCCC----------CccCeeEeEEeCccccchhh--hhcccC Q lcl|NC_019530. 130 ECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIV---DPTN----------KKGGIKKLVMLDSRCIRPVY--IVEKAM 194 (560) Q Consensus 130 eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkii---D~~~----------~k~GI~eLr~LDPr~i~~vr--~i~~~~ 194 (560) .+.+ -|++..+..+.+|.=-+.|.=+.-..+ |+.. .+.+|+.|+.|||..+.+.. .+. .+ T Consensus 152 ~~~~----~l~~~~~l~~a~~~~rlyG~~~i~i~v~~~D~~~~~~Pl~~~~i~kg~~k~l~vidp~~~~~~~~~~~~-~d 226 (537) T protein:vir:10 152 RYDR----AFNIKKHAIQFVRKGRIFGIRIALFKVDSPDPYYYEKPFNIDGVMPGAYKGIVQIDPYWCAPLLDAQAS-SN 226 (537) T ss_pred HHHH----HhhHHHHHHHHHHhcccccceEEEEeecCcCCcccccccccccccccceeEEEEechhhcccccchhhh-cc Confidence 5544 445566666665532333432221122 3221 23467889999997776532 111 11 Q ss_pred CccccccccccchheeecCccccccccccccccccccccccceecchhceeeeecc-----cccCCCCcchhhhHHHHHH Q lcl|NC_019530. 195 REGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSG-----EEPLANGIVPGLLNPAIRP 269 (560) Q Consensus 195 ~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsG-----l~~~~~~~i~syL~~AiK~ 269 (560) ... -.+|-|. .|.. +..+||++-++..+.. +-........|.|+++... T Consensus 227 p~s----------p~fg~P~--~y~v--------------~g~~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~ 280 (537) T protein:vir:10 227 PVS----------MHFYEPT--YWLI--------------NGKKYHRSHLAIYINDEVVDFLKPSYIYGGVPLPQQIMER 280 (537) T ss_pred CCc----------cccCCce--eeee--------------cCeEecceeEEEecCCCCchhhhcccCcccccHHHHHHHH Confidence 000 0112221 1111 1236777766543311 1111223356778776655 Q ss_pred HHHHHHHHH--HHHHHHHhcCccceEEEccCCC-CchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhccc Q lcl|NC_019530. 270 LNNLVTTED--ATVIYAITRAPEKRAFYLDVGT-LGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLP 346 (560) Q Consensus 270 ~NqL~m~ED--alVIyRitRAPeRRvFyIDvGn-lpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLp 346 (560) +.+...... +.++|+-. =+++.+|.-. |....+..-.-++++++|+ ..| .|-+- T Consensus 281 l~~~~~t~~~~~~l~~~~~----~~v~k~~~~~~l~~~~~~~~r~~~~~~~r~------n~g-------~~~id------ 337 (537) T protein:vir:10 281 VYAAERTANEGPMLAMTKR----QTVLKVDAAQVLANKQQFDETMSWWTATRD------NYQ-------VRVVD------ 337 (537) T ss_pred HHHHHHHHHHHHHHHHhcC----CceeeechHHhhcCHHHHHHHHHHHHhhcC------Ccc-------eeEec------ Confidence 544433222 23333322 2356665311 1111111111112223222 111 11100 Q ss_pred ccCCCCccceeecCCCCCcchHHHH-HHHHHHHHHhcCCCccccCCCC--cccccccchhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 347 RREGQNATEIATVGGGNQLGEMDHV-NYFREKLYDALMIPKSRLQEEG--SINIGGSNLAEITQEELRFSKFCAGLRRRY 423 (560) Q Consensus 347 RReGgrgTEIsTLpGg~nLgei~DV-~YF~~kLy~aL~VP~sRl~~~~--~~~l~~~~~~eItRDElkF~Kfi~rLr~rf 423 (560) .+ +-++.++- .+|+-++|+ ..|...+=.+++||+.||-.++ |+|-.+ .+ |.-.|+.+|+++|.++ T Consensus 338 -~e---~e~~e~~~--~~lsgl~~~l~~~~~~iAa~~~IP~t~L~G~sp~GlnatG---e~---D~~~yyd~I~~~Qe~l 405 (537) T protein:vir:10 338 -KD---NEDVVQID--TTLNDLDKVIMNQYQLVCAIARTPAPKMLGTVPTGFNSTG---DY---EEASYHEECESTQDDM 405 (537) T ss_pred -CC---CceeEEEe--ccCCCHHHHHHHHHHHHHhhhCCCceeeccCCccccccch---hH---HHHHHHHHHHHHHHHH Confidence 00 11222222 345556664 5777778888899999985543 444222 21 3344999999999988 Q ss_pred HHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCC Q lcl|NC_019530. 424 SHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRM 503 (560) Q Consensus 424 s~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~ 503 (560) ..++..+++. |+...+..+. .+.|.|..=...+|...+|+...+.++++.+-.- -.+|.+-+++. |+. T Consensus 406 ~p~l~~l~~l-l~~~~~~~~~--------~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~--G~i~~~Evr~~-L~~ 473 (537) T protein:vir:10 406 RPLIDRHHQL-VCRSHLRKRI--------RVKVEFPPMDAPKESERADTFLKKMQAAKLAFEM--GAVDGVDVNEY-LRM 473 (537) T ss_pred HHHHHHHHHH-HHHhcCCCCc--------ceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHc--CCCCHHHHHHH-Hhc Confidence 7776665533 2222332322 2566676656677888899988888888776432 27788877755 443 Q ss_pred C-------------HHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccccC-Cccc Q lcl|NC_019530. 504 S-------------DEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSG-STDD 558 (560) Q Consensus 504 t-------------DeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~-~~~~ 558 (560) . ++++++. .++.|.+..- -.|.+..+++.++....+. .|+--+.+| +|+| T Consensus 474 ~~~~g~~~l~~~~~~ed~e~~--~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~--~~~~~~~~~a~~~~ 537 (537) T protein:vir:10 474 DPTLGFTSITPAMRPTDAEDI--DVDDEGKPVR-IIEDQPAPSEMFGATSSGE--SANDPRDSGAAFED 537 (537) T ss_pred cCccccccccCCCChhhhhcc--cCCccCCcCC-CCCCCCCccccCCCCcccc--ccCCCccCccccCC Confidence 2 1222111 1111111100 0011111111111111111 111112222 2333 No 21 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.22 E-value=2.7e-11 Score=78.48 Aligned_cols=395 Identities=13% Similarity=0.125 Sum_probs=179.0 Q ss_pred CCCCCCceeEecc-ccceeccccccchhhHHHHHHHHHHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHH Q lcl|NC_019530. 45 DDAQDGSIILQGG-ANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDT 123 (560) Q Consensus 45 ~~~~dG~~~i~~~-~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~s 123 (560) .+..||=...-.| .....+.+.- .-++-.+|...| +.|+.++.||+.+..||+-. .+.| +. +. T Consensus 1 ~~~~D~~~n~~~gg~~~~~~~~~~-~~~~~~~l~a~Y---~~~~l~~~~Vd~~aed~~r~-----g~~i--~~-----~~ 64 (422) T protein:vir:10 1 MVKTDSYANIFLGGSDGSEIYGSL-QNQAPTILASLY---ADNALVRRIIDTIPETALAA-----GFHI--DG-----ID 64 (422) T ss_pred CccchhhHHHHcCCCCCccccCcc-cccCHHHHHHHH---HhChhhHHHHhhhhHHHhcC-----Cccc--cC-----CC Confidence 3333332222111 1111111111 122446666666 46999999999999999632 2223 22 12 Q ss_pred HHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhhee-cC------CCCccCeeEeEEeCccccchhhhhcccCCc Q lcl|NC_019530. 124 VKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIV-DP------TNKKGGIKKLVMLDSRCIRPVYIVEKAMRE 196 (560) Q Consensus 124 ik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkii-D~------~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~ 196 (560) -++++.++++. |++..+..+.+|.=-+.|.=+.-..+ |. -+++..|+.|+.|||..+.+.... .+.. T Consensus 65 ~~~~~~~~~~~----l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~~~~--~dp~ 138 (422) T protein:vir:10 65 DEPAFWSRWDD----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQTRE--ENPR 138 (422) T ss_pred HHHHHHHHHHH----hhHHHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccccchhcc--cCcc Confidence 23456666553 56677777766644444433222222 43 346778999999999877653111 0100 Q ss_pred cccccccccchheeecCccccccccccccccccccccccceecchhceeeeecc-----cccCCCCcchhhhHHH-HHHH Q lcl|NC_019530. 197 GGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSG-----EEPLANGIVPGLLNPA-IRPL 270 (560) Q Consensus 197 g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsG-----l~~~~~~~i~syL~~A-iK~~ 270 (560) . -.++.|. .|...+. ..+...+||++-++..+.- +-........|-|.++ ...+ T Consensus 139 s----------~~fg~P~--~y~v~~~--------~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i 198 (422) T protein:vir:10 139 N----------ARFGEPL--TYRITTN--------ESDMFYDVHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILDSI 198 (422) T ss_pred c----------cccCcce--EEEEecC--------CCCcceeeccceeEEeCCCCchhhhcccCCcccchhHHHHHHHHH Confidence 0 0111111 1111111 1223467888865543210 1111222234666653 2322 Q ss_pred HHHHH-HHH-HHHHHHHhcCccceEEEccC-CCCc--hHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcc Q lcl|NC_019530. 271 NNLVT-TED-ATVIYAITRAPEKRAFYLDV-GTLG--KKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWL 345 (560) Q Consensus 271 NqL~m-~ED-alVIyRitRAPeRRvFyIDv-Gnlp--k~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywL 345 (560) ..+.. ++- +.++++-. =+|+.++. .++- ..+..+-+..++..-++| ..+|- |.+. T Consensus 199 ~~~~~~~~~~~~l~~~~~----~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~----~~~~~-------~~l~----- 258 (422) T protein:vir:10 199 KDYTNCERLATQLLKRKQ----QAVWKAKGLAELCDDSEGFGAARLRLAQVDNNS----GVGQA-------IGID----- 258 (422) T ss_pred HHHHHHHHHHHHHHHHhc----cccccchhHHHhcCCccchHHHHHHHHHHHHhc----CCccc-------eeEe----- Confidence 22222 222 34455542 23455552 2210 011111111111111112 11121 1110 Q ss_pred cccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHH-H Q lcl|NC_019530. 346 PRREGQNATEIATVGGGNQLGEMDH-VNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRR-Y 423 (560) Q Consensus 346 pRReGgrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~r-f 423 (560) +.+-+++++. .+|+-++| +..|...+-.+.+||+.||-.++.-.++.++ .+ |--.|+.+|+++|.. + T Consensus 259 -----~~~e~~e~~~--~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~Glnatg-d~---d~~~yyd~i~~~Qe~~l 327 (422) T protein:vir:10 259 -----AESEEYSVLN--SDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGVSSSQ-NT---ALETFHKLVDRKRNAEL 327 (422) T ss_pred -----cCCcceEEEe--cccCChHHHHHHHHHHHHhhhCCCeeeeccCCcccccccc-hH---HHHHHHHHHHHHHHHHH Confidence 1112233332 23444555 4789999999999999999655543333221 22 223499999999964 3 Q ss_pred HHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCC Q lcl|NC_019530. 424 SHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRM 503 (560) Q Consensus 424 s~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~ 503 (560) ..+...+++. |+-+++ +.|.|..-..=+|...+|+...+.++++.+-.- -.+|.+-+++. |+. T Consensus 328 ~p~l~~l~~~------i~~s~~--------~~~~f~pL~~~sekekaei~~~~a~a~~~~~~~--g~i~~~e~r~~-L~~ 390 (422) T protein:vir:10 328 LPILEFLIPF------IVNAEE--------WSVEFNPLAQESSKDKAEILEKNVNSIAALIAA--GAMDIDEARDT-LRT 390 (422) T ss_pred HHHHHHHHHH------hcccCC--------cEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhc--CCCCHHHHHHH-hhh Confidence 3333333222 222334 456676666667777788888888887775332 25677777644 322 Q ss_pred CHHH------HHHHHHHHHHhhhcCCCCCCCcccccc Q lcl|NC_019530. 504 SDEE------VKEQQTKIAEEKKKGLYPKVQADETGN 534 (560) Q Consensus 504 tDeE------I~e~~kqIe~E~~~~~~~~p~~~e~g~ 534 (560) .-.+ +..++-..+++. ++|.++.+++ T Consensus 391 ~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~d 422 (422) T protein:vir:10 391 IAPEVKINDGSVETEVTISETS-----NDPLEVPTDD 422 (422) T ss_pred hcccccCCCCCCccccchhhcC-----CCCCCCCCCC Confidence 1000 001000111110 2222222222 No 22 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.05 E-value=1.3e-09 Score=69.38 Aligned_cols=411 Identities=12% Similarity=0.092 Sum_probs=193.2 Q ss_pred CCCCCCceeEecccc----ceeccccccchhhHHHHHHHHHHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccch Q lcl|NC_019530. 45 DDAQDGSIILQGGAN----TYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGI 120 (560) Q Consensus 45 ~~~~dG~~~i~~~~~----~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~ 120 (560) .+..||=.....|.. ...|.-......+-.+|...| +.|+.++.||+.+..||+-+- +.+.-++. T Consensus 1 ~~~~D~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~a~Y---~~~~l~~~~vd~~a~d~~r~~-----~~i~~~d~--- 69 (437) T protein:vir:52 1 MKFFDGIKSLALKLGSKQEQTYYSPSLSLTDDLVQLEALW---RDNWIANKVCIKRPEDMVRNW-----REIYSNDL--- 69 (437) T ss_pred CchhhhhHhHHhcCCCccccceeecCccccccHHHHHHHH---HhCchhhHHhhcchHHhhcCC-----ceEecCCC--- Confidence 222222111111100 011111111223445666666 579999999988888877543 22322221 Q ss_pred hHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecC------CCCccCeeEeEEeCccccchhhhhcccC Q lcl|NC_019530. 121 SDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDP------TNKKGGIKKLVMLDSRCIRPVYIVEKAM 194 (560) Q Consensus 121 s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~------~~~k~GI~eLr~LDPr~i~~vr~i~~~~ 194 (560) .+...+++ +..++-|++..+..+.+|-==..|.=+.-.++|. -+++++++.|+.|||.++.++-. ++ T Consensus 70 ~~~~~~~~----~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~~---~~ 142 (437) T protein:vir:52 70 NSKQLDLF----TKFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPTGT---KD 142 (437) T ss_pred CHHHHHHH----HHHHHhhcHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechhhcccccc---cc Confidence 22333344 4445555667777776552223444333334443 33567899999999988876421 11 Q ss_pred CccccccccccchheeecCccccccccccccccccccccccceecchhceeeeeccc--ccCCCCcchhhhHHHHHHHHH Q lcl|NC_019530. 195 REGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGE--EPLANGIVPGLLNPAIRPLNN 272 (560) Q Consensus 195 ~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl--~~~~~~~i~syL~~AiK~~Nq 272 (560) .+ ...--+|.|. .|...+ .+...+||++-+++....- .+.......|.|+++...+.. T Consensus 143 ~d--------p~s~~fg~p~--~y~v~~----------~~~~~~iH~SRii~~~~~~~~~~~~~~~G~s~le~~~~~i~~ 202 (437) T protein:vir:52 143 DD--------VLSPNFGRYS--EYSILG----------GSQSITVHHSRLIILNANDAPLSDNDIWGVSDLEKIIDVLKR 202 (437) T ss_pred cc--------ccccccCcce--EEEEec----------CCcceeEccceeEEecCccCCCccccccCCchHHHHHHHHHH Confidence 00 0000112221 111111 1234678888766553211 222444567899998777765 Q ss_pred HHHHHHH--HHHHHHhcCccceEEEccC--CCCchHHHHHHHH---HHHHhhhhheeecccCCcccCcccchhhHhhhcc Q lcl|NC_019530. 273 LVTTEDA--TVIYAITRAPEKRAFYLDV--GTLGKKSAEEYMT---MMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWL 345 (560) Q Consensus 273 L~m~EDa--lVIyRitRAPeRRvFyIDv--Gnlpk~KAeqYl~---~im~k~knk~vYd~~TGev~d~~~~msmlEDywL 345 (560) +...+.+ .++++ +... ++.++. ..|.. ..++-++ +.++++|+ ....+-| T Consensus 203 ~~~~~~~~~~l~~~---~~~~-v~k~~~l~~~l~~-~~~~~~~~~~~~~~~~~~-------------~~~~~~~------ 258 (437) T protein:vir:52 203 FDSASVNVGDLIFE---SKID-IFKIAGLSDKIAA-GMENEVASVISAVQEIKS-------------ATNSLLL------ 258 (437) T ss_pred HHHHHHHHHHHHHH---cCCC-ceecchHHHHhcC-CcHHHHHHHHHHHHHhcC-------------CCceEEE------ Confidence 5444433 33443 4333 455541 01111 0111122 22222221 1112211 Q ss_pred cccCCCCccceeecCCCCCcchHHHH-HHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHH-H Q lcl|NC_019530. 346 PRREGQNATEIATVGGGNQLGEMDHV-NYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRR-Y 423 (560) Q Consensus 346 pRReGgrgTEIsTLpGg~nLgei~DV-~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~r-f 423 (560) ..+-+++++. .+|+-++|+ ..|+..+=.+.+||+.||-.++.-.++ + +. -|.-.|..+|+++|.. + T Consensus 259 -----d~~~~~e~~~--~~~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s~~Gla-s-ge---~D~~~yyd~i~~~Qe~~l 326 (437) T protein:vir:52 259 -----DAENEYDRKE--LTFTGLKDLLTEFRNAVAGAADMPVTILFGQSVSGLA-S-GD---EDIQNYHEAIRRLQETRL 326 (437) T ss_pred -----cCCcceEEEe--cCcCCHHHHHHHHHHHHHHHhcCchhhhcCcCccccc-c-cH---HHHHHHHHHHHHHHHHHH Confidence 0122344443 345556665 688999999999999999666543332 2 12 2334499999999964 4 Q ss_pred HHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCC Q lcl|NC_019530. 424 SHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRM 503 (560) Q Consensus 424 s~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~ 503 (560) ..+...+++ .|++...-. +-+.+.|.|..=..-+|...+|+...+.++++.+-.- -.+|.+-+++. |+ T Consensus 327 ~p~le~l~~-~i~~~~~g~-------~~~~~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~--g~i~~~e~r~~-L~- 394 (437) T protein:vir:52 327 RPIFEIIDP-LICNELFGG-------LPADWWFEFVPLTTVKQEQQINMLNTFATAANTLIQN--GVLNEYQIANE-LR- 394 (437) T ss_pred HHHHHHHHH-HHHHHhcCC-------CCCcceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhc--CCCCHHHHHHH-HH- Confidence 554444444 233331111 1123566776555567777788888888777765322 14555555433 22 Q ss_pred CHHHHHHHHHHHHHhhhcCCCCC-CCcc---ccccccccccCCCCCC-cccccccCCccc Q lcl|NC_019530. 504 SDEEVKEQQTKIAEEKKKGLYPK-VQAD---ETGNYSGSDVSPLKFK-PETIPFSGSTDD 558 (560) Q Consensus 504 tDeEI~e~~kqIe~E~~~~~~~~-p~~~---e~g~~~~~~~~~~~~~-p~~~~~~~~~~~ 558 (560) + .|.|+. |+++ .++.+.+.++...+-. ++..|..++ +- T Consensus 395 -------------~---~g~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 437 (437) T protein:vir:52 395 -------------E---SGLFANISAEHIEELKNADEFAGNFEEPEKMEGAQVQNSE-DQ 437 (437) T ss_pred -------------h---cCCCCCCCccccccccCCCCCCCccCCCCCCCCCCCCCCC-CC Confidence 1 244442 1111 1111111222222222 222222222 22 No 23 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=98.96 E-value=8.9e-10 Score=70.21 Aligned_cols=403 Identities=14% Similarity=0.151 Sum_probs=178.0 Q ss_pred CCCCCCCCceeEecccc-ceeccccccchhhHHHHHHHHHHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchh Q lcl|NC_019530. 43 ALDDAQDGSIILQGGAN-TYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGIS 121 (560) Q Consensus 43 ~~~~~~dG~~~i~~~~~-~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s 121 (560) ...-+.||=..+-+++. ...|+... -....+|...| +.|+.++.||+.+..||+-+- +.+ +. ++ T Consensus 1 ~~~~~~d~~~~~~~~~~~~~~~~~~~--~~~~~~l~a~Y---~~~~l~~~~Vd~~aed~~r~g-----~~i--~g-~~-- 65 (427) T protein:vir:10 1 MKIVKHDGYNDIFNGGADGSPKPFFM--SDASYHVGSFY---NDNATAKRIVDVIPEEMVTAG-----FKM--SG-VK-- 65 (427) T ss_pred CCccccchHHHHhhcCCCCcccCccc--cCchHHHHHHH---HcCchhhhhhccchHHhhcCC-----ccc--cC-cc-- Confidence 11122222111111111 11222221 11234665555 569999999999999988432 223 22 11 Q ss_pred HHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhh-eec------CCCCccCeeEeEEeCccccchhhhhcccC Q lcl|NC_019530. 122 DTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHV-IVD------PTNKKGGIKKLVMLDSRCIRPVYIVEKAM 194 (560) Q Consensus 122 ~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hk-iiD------~~~~k~GI~eLr~LDPr~i~~vr~i~~~~ 194 (560) -+++| +..++.|++..+..+.+|.=-+.|.=+.-. |-| |.+++.+|+.|+.|||..+.+-... .+ T Consensus 66 --~~~~~----~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~~~~--~d 137 (427) T protein:vir:10 66 --DEKEF----KSLWDSYKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFAITVEKRV--TN 137 (427) T ss_pred --HHHHH----HHHHHHhhHHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhcccccccc--cC Confidence 12334 444445566777777776544555333211 223 4477889999999999776552111 01 Q ss_pred CccccccccccchheeecCccccccccccccccccccccccceecchhceeeeec-cc---c-cCCCCcchhhhHHHHHH Q lcl|NC_019530. 195 REGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDS-GE---E-PLANGIVPGLLNPAIRP 269 (560) Q Consensus 195 ~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hs-Gl---~-~~~~~~i~syL~~AiK~ 269 (560) ... --+|.|. .|...+. ......+||++-++..+. -+ . ........|.|.+++ T Consensus 138 p~s----------~~fg~P~--~y~v~~~--------~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~-- 195 (427) T protein:vir:10 138 ARS----------PRYGEPE--IYKVSPG--------DNMQPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSL-- 195 (427) T ss_pred ccc----------cccCcce--EEEEecC--------CCCcceEEccccEEEecCCCchhhhcccCCcccchhhhHHH-- Confidence 000 0011111 1111111 112346888886654421 00 0 012222345565542 Q ss_pred HHHHHHHHH-----HHHHHHHhcCccceEEEc-cCCCC-chHHHH-HHHHHHHHhhhhheeecccCCcccCcccchhhHh Q lcl|NC_019530. 270 LNNLVTTED-----ATVIYAITRAPEKRAFYL-DVGTL-GKKSAE-EYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAE 341 (560) Q Consensus 270 ~NqL~m~ED-----alVIyRitRAPeRRvFyI-DvGnl-pk~KAe-qYl~~im~k~knk~vYd~~TGev~d~~~~msmlE 341 (560) ++.|+-+|- +.++++- -.+ |+.+ +++++ ...+++ +-...++.--++| ..+|- +.+ T Consensus 196 ~~~i~~~~~~~~~~~~l~~k~---~~~-v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~----~~~~~-------~~l-- 258 (427) T protein:vir:10 196 IDAICDYDYCESLATQILRRK---QQA-VWKVKGLAEMCDDDDAQYAARLRLAQVDDNS----GVGRA-------IGI-- 258 (427) T ss_pred HHHHHHHHHHHHHHHHHHHHh---ccc-cccchhHHHHhcCccchHHHHHHHHHHHHhc----Ccccc-------eee-- Confidence 454544443 3345543 222 3334 33221 001111 1111111111111 11111 111 Q ss_pred hhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHH Q lcl|NC_019530. 342 DYWLPRREGQNATEIATVGGGNQLGEMDH-VNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLR 420 (560) Q Consensus 342 DywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr 420 (560) | +.+-+++++. .+|+-++| +..|...+=.+.+||+.||-.++.-.++.++ .+ |.-.|..+|+++| T Consensus 259 ~--------~~~e~~e~~~--~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~sp~Glnstg-d~---D~~nyyd~i~~~Q 324 (427) T protein:vir:10 259 D--------AETEEYDVLN--SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQ-NT---ALETFYKLVDRKR 324 (427) T ss_pred e--------cCCCceeEEe--cccCChHHHHHHHHHHHHhhhCCCeeeeccCCccccccch-hH---HHHHHHHHHHHHH Confidence 0 1112233332 34444555 4789999999999999999655543333221 22 3344999999999 Q ss_pred HHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHH Q lcl|NC_019530. 421 RRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNV 500 (560) Q Consensus 421 ~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~I 500 (560) ... +...|++-+-+ |+-.++ +.|.|..-..=+|...+|+...+.++++.+-.- -.++.+-+++. T Consensus 325 e~~---l~p~l~~l~~~--i~~s~~--------~~~~f~pL~~~s~kEkaei~~~~a~a~~~~~~~--gvi~~~e~r~~- 388 (427) T protein:vir:10 325 EED---YRPLLEFLLPF--IVDEEE--------WSIEFEPLSVPSKKEESEITKNNVESVTKAITE--QIIDLEEARDT- 388 (427) T ss_pred HHH---HHHHHHHHHHH--hhcCCC--------cEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhc--CCCCHHHHHHH- Confidence 542 22333321111 222233 466777777777888888988888888775332 14566655543 Q ss_pred hCCCHHHHHHHHHHHHHhhhcCCC--CCCCccccccccccccCCCCCCcccccccCCcccC Q lcl|NC_019530. 501 LRMSDEEVKEQQTKIAEEKKKGLY--PKVQADETGNYSGSDVSPLKFKPETIPFSGSTDDS 559 (560) Q Consensus 501 L~~tDeEI~e~~kqIe~E~~~~~~--~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~ 559 (560) |+ .+-.+ .|+. .+...++.+. ...|+.-+--|.+|.- T Consensus 389 L~-----------~~~~~--~~~~~~~~~~~e~~~~---------~~e~~p~~~e~~~d~~ 427 (427) T protein:vir:10 389 LR-----------SIAPE--FKLKDGNNINIREPEE---------TTEPEPGLGEKLEDEN 427 (427) T ss_pred HH-----------hhhcc--ccCCCCccccccccch---------hcCCCCCCCCCCCCCC Confidence 21 11111 1110 1111111111 0111111112222222 No 24 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=98.95 E-value=5e-09 Score=66.09 Aligned_cols=464 Identities=12% Similarity=0.103 Sum_probs=185.8 Q ss_pred CCcccchhhh--hhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccce-------eccccccchh Q lcl|NC_019530. 1 MAGYGRGFFG--LFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTY-------NYVGVESELL 71 (560) Q Consensus 1 ~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~-------~y~~~e~~~~ 71 (560) |+- -.|+-| +.|.....++.+..- .. ....++.+.-..-....-+++... .+..+....- T Consensus 66 ~~~-~~~~~~~~~~~~~~a~~~a~~~~-------~~---~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~~~f 134 (862) T protein:vir:99 66 VEI-SDSVNAKSVSGKNFAMDSAVRSA-------IK---AITGFAMDDGGGAPVPIGAEGKQSSYAVPEALQDWYLSQGF 134 (862) T ss_pred ccc-cccccchhhhhhhhcchhhcchh-------hh---hhhhhhhhcchhhhhhccccccccccccchhccccccccCc Confidence 100 001111 111111112111000 00 000111110000000000010111 1122222222 Q ss_pred hHHHHHHHHHHHhhccchhHHHHHhhcceeeecCCCceEEEeeec-ccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHH Q lcl|NC_019530. 72 SVKTVVEEYQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDK-VEGISDTVKESITECFKEVMHLMDFDNTAYQKIR 150 (560) Q Consensus 72 ~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~-~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R 150 (560) ..++|...|+ .|+.++.||+.+..||+-. .+.|.-.. .+...+...++|.++++. |+...+..+.+| T Consensus 135 ~gyql~alY~---~~~larkiVd~pAeDatR~-----g~~I~~~~d~~e~~~e~~~~ie~~~~r----L~v~~~l~eair 202 (862) T protein:vir:99 135 IGHQACALIA---QHWLVDKACSLAGEDAIRN-----GWHLKSLGEGEEIDEESLEKFKAIDVE----FKVKENLIEFNR 202 (862) T ss_pred ccHHHHHHHH---hCchhhhhhhhhhHHHhhC-----CceEeecCcccccCHHHHHHHHHHHHH----hhHHHHHHHHHH Confidence 3456766555 5999999999999988642 22232211 112233344455555443 444555555544 Q ss_pred HhhhhhhhhhhheecCCCC-------------ccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCcccc Q lcl|NC_019530. 151 KWYVDGRQAYHVIVDPTNK-------------KGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNR 217 (560) Q Consensus 151 ~wYvDGr~~~hkiiD~~~~-------------k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~ 217 (560) -=-+.|+-+.-.+++-.++ +.+|+.|+.|||..+.++...... .....-.+|.| .. T Consensus 203 ~~RLyGga~ililv~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp~w~~p~~v~~~~---------~Dp~sp~yGkP--~~ 271 (862) T protein:vir:99 203 FKNVFGIRVAIFVVDSEDPDYYEKPFNPDGITPGSYRGISQIDPYWMMPMLTAEST---------ADPSSQFFYEP--EF 271 (862) T ss_pred hcccccceEEEEEecCcCchhhhcCcCcccccccceeEEEEechhhhccccccccc---------ccccccccCCc--ee Confidence 2222333211112222222 235678888888766653211000 00000112222 11 Q ss_pred ccccccccccccccccccceecchhceeeeecccc-----cCCCCcchhhhHHHHHHHHHHHHH--HHHHHHHHHhcCcc Q lcl|NC_019530. 218 NQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEE-----PLANGIVPGLLNPAIRPLNNLVTT--EDATVIYAITRAPE 290 (560) Q Consensus 218 ~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~-----~~~~~~i~syL~~AiK~~NqL~m~--EDalVIyRitRAPe 290 (560) |...| -+||++-++...-.-+ ...+....|-|+++...+...... .-+.++++.. T Consensus 272 y~I~g--------------~~IH~SRliif~g~~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~---- 333 (862) T protein:vir:99 272 WIISG--------------QKYHRSHLIIARGPQPADILKPTYIFGGIPLVQRIYERVYAAERTANEAPLLAMNKR---- 333 (862) T ss_pred eeecC--------------eeeccceeEEecCCCchhhhhccCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhc---- Confidence 11111 2466654443321111 112223467777654443333211 2233455432 Q ss_pred ceEEEccCCC-CchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHH Q lcl|NC_019530. 291 KRAFYLDVGT-LGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMD 369 (560) Q Consensus 291 RRvFyIDvGn-lpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~ 369 (560) -+++.+|... |....+-..=.++++++|+- .| .+-| +++-+++++- .+|+-++ T Consensus 334 l~v~ktd~l~~l~~ed~l~~r~~~~~~~rdN------~G-------i~li-----------D~eEe~e~ls--~slSGL~ 387 (862) T protein:vir:99 334 TTAIHTDTAKAIANEDKFIQRLMFWVRYRDN------HA-------VKVL-----------GTDETMEQFD--TSLADFD 387 (862) T ss_pred cceeechhHhhhccHHHHHHHHHHHHhccCc------ce-------eEEe-----------cCCCceeEEe--cccCChH Confidence 2344454432 22111110001223333321 11 1100 1112344443 3455555 Q ss_pred H-HHHHHHHHHHhcCCCccccCCCC--cccccccchhhhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhccCCChhH Q lcl|NC_019530. 370 H-VNYFREKLYDALMIPKSRLQEEG--SINIGGSNLAEITQEELRFSKFCAGLRRR-YSHFFMEFLRRQLILKGVTDEKD 445 (560) Q Consensus 370 D-V~YF~~kLy~aL~VP~sRl~~~~--~~~l~~~~~~eItRDElkF~Kfi~rLr~r-fs~if~d~Lk~qLiLkgiit~ee 445 (560) | +..|...+=.+.+||+.||-.++ |+|-.+. + |.-.|..+|+++|.+ +..++ +.|. .|+....-.+++ T Consensus 388 dll~~~~q~IAaas~IP~tiLfGqspaGlnATGE--~----D~~nYyD~I~s~QE~~L~P~L-erL~-~li~~~lg~~~d 459 (862) T protein:vir:99 388 AVIMGQYQLVASIAKTPATKLLGTAPKGFNSTGE--F----ETISYHEELESIQEHVYMPFL-QRHY-LISRLSLGIQHE 459 (862) T ss_pred HHHHHHHHHHHhhhCCCceeecccCcccccCchH--H----HHHHHHHHHHHHHHHHHHHHH-HHHH-HHHHHhcCCCCc Confidence 5 57888899999999999986554 5443221 1 334499999999964 33333 3332 233332222333 Q ss_pred HHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHh--------CCCHHHHHHHHHHHHH Q lcl|NC_019530. 446 WNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVL--------RMSDEEVKEQQTKIAE 517 (560) Q Consensus 446 w~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL--------~~tDeEI~e~~kqIe~ 517 (560) +.|.|..=..=+|...+|+.....++++.+-. .-.+|.+-++.... .++|+++++......+ T Consensus 460 --------~~ieFnpL~~~sekEkAEi~kk~Aea~~~lv~--sGvispdEvR~~L~~~~~~g~~~l~ded~E~d~~~~~e 529 (862) T protein:vir:99 460 --------IDVVMEPVASMTAQQQADLNKTKAEGGKVLID--GGVISPDEERNRIRDDKRSGYNRLTKEDAEETPGASPE 529 (862) T ss_pred --------ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--cCCCCHHHHHHHHHhcCCcCCCCCCcccccccCCCCcc Confidence 45666555556677778888888888776532 13788888887621 2556666532211111 Q ss_pred hhhcC------CCCCCCccccc-cc---cccccCCCCCCcccccccCC--cccCC Q lcl|NC_019530. 518 EKKKG------LYPKVQADETG-NY---SGSDVSPLKFKPETIPFSGS--TDDSI 560 (560) Q Consensus 518 E~~~~------~~~~p~~~e~g-~~---~~~~~~~~~~~p~~~~~~~~--~~~~~ 560 (560) +..+. .-..|..++++ .. .-++.+..+..|..-|-+++ |+..- T Consensus 530 ~~~~~e~~g~a~~~ap~de~~aga~~~~~e~d~~~~p~~~~~~~g~~~~~t~~~~ 584 (862) T protein:vir:99 530 NLAAYQKAGAAQETASAKETQAGAAVTTAEGDQPNVQMVPSMKPGQMVGPEVGIT 584 (862) T ss_pred cccccccCCcccccccccccccccCCccccCCcccccccCCCCCCCccccccccc Confidence 11110 01111111111 11 11122222222322222111 11111 No 25 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=98.94 E-value=1.2e-08 Score=63.99 Aligned_cols=459 Identities=13% Similarity=0.138 Sum_probs=197.5 Q ss_pred CC--cccchhhhhhhhhhhhhhhhhhcccccchhhhc----cc----cccccCCCCCCCCc----------eeEec---c Q lcl|NC_019530. 1 MA--GYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELL----LT----KATVVALDDAQDGS----------IILQG---G 57 (560) Q Consensus 1 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~----~~----k~~s~~~~~~~dG~----------~~i~~---~ 57 (560) || -+||||..+.- -+|++++... +. --+.+.+.+-.+|. ..+.. . T Consensus 1 ~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~d~~~~~~~r~g~~ 68 (648) T protein:vir:79 1 MARKVWGRGFWSRIS------------LMWRDEDDDKEPLVLEESMQLGEAPGAMPKGGGGGGSAKRDPKMSLVKRIGLA 68 (648) T ss_pred Cccchhcchhhhhhh------------hhccCccccccccccccccccCCCccccCCCCcccccccccchhHHHHHhHHH Confidence 66 48999976543 3455443221 00 01112221122222 11100 0 Q ss_pred ccce--eccccccchhhHHHHHHHHHHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHH Q lcl|NC_019530. 58 ANTY--NYVGVESELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEV 135 (560) Q Consensus 58 ~~~~--~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~i 135 (560) .-.+ ...++...-.+-.+|-+.| ..+|-|..||+.|.+.+.-. +..+.-++-.+ .+..+ -.+ . T Consensus 69 ~~~~~~g~~~~~epp~d~~~l~~l~---~~np~V~~aI~iia~~ia~l-----~~~i~~~~~~~-~~~~~----~~~--l 133 (648) T protein:vir:79 69 IMDGGGGGRDFEEPEFDFNEITSAY---NTEGYVRQAVDKYIEMMFKA-----DWDFVSKNPNA-VEYIR----MRF--T 133 (648) T ss_pred HHhhcCCccccccCCcCHHHHHHHH---hcChHHHHHHHHHHHHHhhC-----cceEEecCCcc-chhhH----HHH--H Confidence 0001 1112222222333343322 46899999997666554332 22222222111 11111 111 1 Q ss_pred HHHhCCchhhHHH----HHHhhhhhhhhhhheecCCCC------------ccCeeEeEEeCccccchhhhhcccCCcccc Q lcl|NC_019530. 136 MHLMDFDNTAYQK----IRKWYVDGRQAYHVIVDPTNK------------KGGIKKLVMLDSRCIRPVYIVEKAMREGGI 199 (560) Q Consensus 136 l~ll~f~~~~~~~----~R~wYvDGr~~~hkiiD~~~~------------k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~ 199 (560) +..-+.....+++ +..+++.|.-|..++.|.+.- ..-+++|..|+|.++...+ +..|. T Consensus 134 l~rPn~~~t~~~f~~~l~~~lll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~-----d~~g~- 207 (648) T protein:vir:79 134 LMAEATQIPTNQLFIEIAEDLVKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKR-----DKFGM- 207 (648) T ss_pred hhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCCCccchhhhhhhhccccceeeeEeecCceeEEEE-----cCCCc- Confidence 2222233444444 444557888888777655320 1125788889997776543 22221 Q ss_pred ccccccchheeecCccccccccccccccccccccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHH Q lcl|NC_019530. 200 EAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDA 279 (560) Q Consensus 200 ~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDa 279 (560) -.+|.|.+. | .+..+.++.+.|.|...+. +.++-+.+|-|..|+..+.....+++. T Consensus 208 ------~~~Y~y~~~-------g----------~~~~~~~~~~dIIHik~~~-~~d~~~GlSpi~~a~~aI~l~~aa~~~ 263 (648) T protein:vir:79 208 ------IKGWQQEQE-------G----------QDKPQKFKPEDIVHIYYKR-EKGRAFGTPWLLPALDDIRALRQVEEN 263 (648) T ss_pred ------eeeeEEEec-------C----------CceeEEecCccEEEEccCC-CCCCceeccHHHHHHHHHHHHHHHHHH Confidence 123445421 1 1234557778787776432 335556778999998888776666654 Q ss_pred HH-HHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceee Q lcl|NC_019530. 280 TV-IYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIAT 358 (560) Q Consensus 280 lV-IyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsT 358 (560) .- .++=---| .-|+.+..+......+++.++.+-..|++-.+ . +|+...+... T Consensus 264 ~~~fF~NGa~P-~gil~~~~~~~~~e~~k~~~e~~~~~~~~~~i---~----------------------gg~v~~~~~~ 317 (648) T protein:vir:79 264 VLRLVYRNLHP-LWHVKVGLEQEGFGAEEGEVDLVRGEVENMDV---E----------------------GGMVTTERVN 317 (648) T ss_pred HHHHHhccCCc-cEEEEeCCCccchHHHHHHHHHHHHhcccccc---c----------------------ccccccceee Confidence 43 22222223 45555555554445555566655555544321 1 1111222222 Q ss_pred cCCC---CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 359 VGGG---NQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQL 435 (560) Q Consensus 359 LpGg---~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qL 435 (560) ++-. +.+.-++-.++..+...++.+||...|+..++ ..+++++...+ -|...|..|+..+..++...+-+.+ T Consensus 318 i~~~~s~~dlqfle~rk~~~~eIa~aFgVPP~lLG~~~~--ss~stae~~~~---~~~~~i~~l~~~i~~~le~~~~~~l 392 (648) T protein:vir:79 318 ISSIASNQIIDAKEYLKHFEQRAFTVLGVSELMMGRGGT--ASRSTGDNLSS---DFKDRIKALQKVMATFINEFMVKEI 392 (648) T ss_pred ccccCCHHHHHHHHHHHHHHHHHHHHhCCCHhHcccCCC--ccchHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2211 11222333477889999999999999975443 23333332222 3777888888777777776665555 Q ss_pred HhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHH----HH Q lcl|NC_019530. 436 ILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVK----EQ 511 (560) Q Consensus 436 iLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~----e~ 511 (560) ++...++ +|-. ....|.|+|..=.--.+. .|.+.+.++ +-+-++|.+-++. .+.|..-+=. .. T Consensus 393 l~e~~l~--~~l~-~d~~ieF~~~~Llr~D~~-------~~a~~~~~l--~~~GilT~NEaR~-~lGlpPi~~g~~~~~l 459 (648) T protein:vir:79 393 LMEGGFD--PVLN-PDDKVEFRFNEIDMDSKI-------KLENQAVFL--YEHNAISEDEMRE-LIGRDPVDDGEGRAKM 459 (648) T ss_pred hhhhhcc--cccc-ccceEEEeecccchhhHH-------HHHHHHHHH--HhCCCcCHHHHHH-HhCCCCCCCCCCcccc Confidence 5554433 2211 223456665421111111 122222221 2235778888774 4676542100 00 Q ss_pred HHH---HHHh-hhcCCCCCCCcc---ccccccccccCCCCCCccc---ccccCCcccCC Q lcl|NC_019530. 512 QTK---IAEE-KKKGLYPKVQAD---ETGNYSGSDVSPLKFKPET---IPFSGSTDDSI 560 (560) Q Consensus 512 ~kq---Ie~E-~~~~~~~~p~~~---e~g~~~~~~~~~~~~~p~~---~~~~~~~~~~~ 560 (560) ..+ ...+ ...+..+.|-.. +.+.++...-++-.-.|+- +..+.++..+- T Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~eg~~~e~~~~~~~~~~~g~~~~~~~~~~~ 518 (648) T protein:vir:79 460 HLQMVTIAQATALAALAPTPAGGSSASASGDKKKKATDNKTKPTNQHGTKTSPKKQTNG 518 (648) T ss_pred ccccccchhccccccCCCCCCCCCCCCccccccccccCCCCCCCCCCCcCCCCccccch Confidence 000 0000 000000111000 0000000000000000000 00000000000 No 26 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=98.91 E-value=1e-08 Score=64.36 Aligned_cols=466 Identities=11% Similarity=0.122 Sum_probs=198.3 Q ss_pred chhhhhhhhhhh-hhhhhhhccccc-----------chhhhccccccccCCCCCCCCceeEeccccceeccccccchhhH Q lcl|NC_019530. 6 RGFFGLFGGGGL-VNAKVDTDKLAQ-----------KQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSV 73 (560) Q Consensus 6 ~~~~~~~~~~~~-~~~~~~~~~~~~-----------~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~ 73 (560) -|+|-.++..+- -.+++...+... ...+|-++++. .+ - -+..+. +.-........+...+- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~-~~--~---~~~~~~-~~~~~~g~~~~~~~~~~ 73 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKE-VA--Y---SQPVIG-SMSANPGFKTKPSIRNN 73 (547) T ss_pred CchhhhhhhhcCCccccccccccccccchhhhhhhHHHHHHhhcccc-hh--h---hchhhh-eeecccccccCCccCCh Confidence 566666655331 112222222211 11112111111 00 0 001110 11111111112223344 Q ss_pred HHHHHHHHHHhhccchhHHHHHhhcceeeec------CCCceEEEeeecccc-hhHHHHHHHHHHHHHHHHHhCCc---- Q lcl|NC_019530. 74 KTVVEEYQSMAQQPEIRKAVDIIVNDVVTCE------EDETPVTVNLDKVEG-ISDTVKESITECFKEVMHLMDFD---- 142 (560) Q Consensus 74 ~~LI~~YR~ma~~PEvd~AI~eIvneaiv~d------~~~~~V~l~Ld~~~~-~s~sik~~I~eeF~~il~ll~f~---- 142 (560) .+|-+.=+..+.+|-|..||+.|.+.+..+- .+.-...+.+.+.+. .....+..+ .+.+.+++-.++. T Consensus 74 ~~l~~l~~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~-~~l~~~l~~pn~~~~p~ 152 (547) T protein:vir:63 74 QDLHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATI-KRIESFIEKTGVDNDIN 152 (547) T ss_pred hHHHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHH-HHHHHHHHhhCCCCCCc Confidence 4443333444678999999988887655321 122222333333211 111122222 2233333333332 Q ss_pred -hhhHHHHHHh----hhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCcccc Q lcl|NC_019530. 143 -NTAYQKIRKW----YVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNR 217 (560) Q Consensus 143 -~~~~~~~R~w----YvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~ 217 (560) ....++++.| ++.|.-|+.++.|.. .-+.+|++|||.+|+.+. +.++ .+.....-|+++. T Consensus 153 ~~s~~~f~~~lv~d~ll~Gn~~~~i~rd~~---G~~~~L~~l~p~~V~~~~-----~~~g--~~~~~~~~y~~~~----- 217 (547) T protein:vir:63 153 RDSFSSFVKKIVRDTYMYDQVNFEKVFNRN---QSMVRFVAKDPTTIFFAT-----TADG--KIPDNGNRFVQVI----- 217 (547) T ss_pred cchHHHHHHHHHHHHHhhCCEEEEEEECCC---CcEEEEEEecCceeEEEE-----CCcc--ccccCceEEEEEc----- Confidence 1344555554 566888888777653 238999999999887752 2222 1111111111111 Q ss_pred ccccccccccccccccccceecchhceeeeecccc-c-CCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEE Q lcl|NC_019530. 218 NQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEE-P-LANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFY 295 (560) Q Consensus 218 ~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~-~-~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFy 295 (560) ..+....++.+.|.|.+.... + ..+++.+|-|..|++++.....++....=+=---|--+-|.. T Consensus 218 --------------~~~~~~~~~~~eiih~r~n~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~ 283 (547) T protein:vir:63 218 --------------DQKIVATFNAREMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQ 283 (547) T ss_pred --------------CCcEEEEeccccEEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEE Confidence 012234677887887763222 1 124567788999999999888887766544333354555666 Q ss_pred ccCC-CCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHH--- Q lcl|NC_019530. 296 LDVG-TLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHV--- 371 (560) Q Consensus 296 IDvG-nlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV--- 371 (560) +... +|.+..+++.-+.+...|. | ..+..+.+ ++.+ .|.++..|- .+..++.-+ T Consensus 284 ~~~~~~ls~e~~~~lk~~~~~~~~---------G-~~nagk~~-vl~~---------~g~~~~~l~--~~~~d~qfle~~ 341 (547) T protein:vir:63 284 IKAAQQQSQHALEIFKREWKNSLS---------G-INGSWQIP-VVSA---------EDVKFVNMT--PSARDMEFEKWL 341 (547) T ss_pred ecCCCCCCHHHHHHHHHHHHHHhc---------C-cccccccc-cccC---------CCceEEEcC--CChhHHHHHHHH Confidence 6644 3444333333222222332 2 12222222 2211 134555553 344444444 Q ss_pred HHHHHHHHHhcCCCccccCCCC--------cccccccchhhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhccCCC Q lcl|NC_019530. 372 NYFREKLYDALMIPKSRLQEEG--------SINIGGSNLAEITQEELRFSK-FCAGLRRRYSHFFMEFLRRQLILKGVTD 442 (560) Q Consensus 372 ~YF~~kLy~aL~VP~sRl~~~~--------~~~l~~~~~~eItRDElkF~K-fi~rLr~rfs~if~d~Lk~qLiLkgiit 442 (560) +|..+.+-++.+||...|+..+ +-++.++++.+..+. |.. -+.-+..++...|...| + + T Consensus 342 ~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~---~~~~tL~P~~~~ie~~ln~~L----~-----~ 409 (547) T protein:vir:63 342 NYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQA---SKNKGLQPLLGFIEDFINKHI----V-----A 409 (547) T ss_pred HHHHHHHHHHhCCCHHHcCcccccccccccccccchhhHHHHHHH---HHHHHHHHHHHHHHHHHHhhc----c-----c Confidence 4466889999999999996422 223344444333222 433 24445544444443332 2 2 Q ss_pred hhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHH-H----------HHHH Q lcl|NC_019530. 443 EKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDE-E----------VKEQ 511 (560) Q Consensus 443 ~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDe-E----------I~e~ 511 (560) . + ...+.|+|.....-.+ .+|..+.+.+. .-++|..-++. .+.|... + +... T Consensus 410 ~--~----~~~~~~~f~~~~~~~~-------~~~~~~~~~~~---~g~lT~NE~R~-~~gl~P~~egGD~~~~~~~~~~~ 472 (547) T protein:vir:63 410 E--F----GDKYTFQFVGGDIKSE-------LESVKILAEKA---KVAMTVNEVRK-ELNLPGDVIGGDIPLNGVIVQRI 472 (547) T ss_pred c--c----CCceEEEeeccccccH-------HHHHHHHHHHh---CCCcCHHHHHH-HhCCCCCCCCCceeecccccccc Confidence 1 1 1235666654433222 22333222221 23578888884 4777541 1 0000 Q ss_pred HHHHHHhhhcCCCCCCCcccccc--ccccccCCCCCCcccccccC--CcccCC Q lcl|NC_019530. 512 QTKIAEEKKKGLYPKVQADETGN--YSGSDVSPLKFKPETIPFSG--STDDSI 560 (560) Q Consensus 512 ~kqIe~E~~~~~~~~p~~~e~g~--~~~~~~~~~~~~p~~~~~~~--~~~~~~ 560 (560) ..+..++..+.- +++....+.. .+..+.++....|.--...| +.|+++ T Consensus 473 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 524 (547) T protein:vir:63 473 GQLMQQEQFEHE-KQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDGQR 524 (547) T ss_pred cccccccCCccc-cchhhccccccccCCCCCCCCCCCCCCcccCCCcCccccc Confidence 011111110000 0000000000 01111122222222222222 223333 No 27 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=98.91 E-value=1.6e-08 Score=63.31 Aligned_cols=470 Identities=13% Similarity=0.068 Sum_probs=231.8 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccch-hhHHHHHHHHHHH-hhccchhHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESEL-LSVKTVVEEYQSM-AQQPEIRKA 92 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~-~~~~~LI~~YR~m-a~~PEvd~A 92 (560) |..+|..+-.-.+...- ....... .. ...+|+-............+.+..+ .+...|..+=|.| .++|=+..| T Consensus 1 mn~~dr~i~~~sP~~~~-~R~~ar~-~~---~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~a 75 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKA-ARLRSRA-VI---QAYEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGV 75 (502) T ss_pred CchHhhHHhhcChHHHH-HHHhhHH-HH---hhccccCcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHH Confidence 33333332222221110 0000000 00 1122221111111111111222222 2456677777888 599999999 Q ss_pred HHHhhcceeeecCCCce-EEEeeecccchhHHHHHHHHHHHHHHHH------HhCCchhhHHHHHHhhhhhhhhhhheec Q lcl|NC_019530. 93 VDIIVNDVVTCEEDETP-VTVNLDKVEGISDTVKESITECFKEVMH------LMDFDNTAYQKIRKWYVDGRQAYHVIVD 165 (560) Q Consensus 93 I~eIvneaiv~d~~~~~-V~l~Ld~~~~~s~sik~~I~eeF~~il~------ll~f~~~~~~~~R~wYvDGr~~~hkiiD 165 (560) |+.+++-+|=.. .-.+ ....-.+. ...+.+.++|..+|+.+++ .++|..--.-.+|.|.+||..|-.++.+ T Consensus 76 v~~~~~nvVG~g-gi~~~~~~~~~~~-~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~ 153 (502) T protein:vir:79 76 FDKLEERVVGKN-GIIVEPHPVLRNG-AIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSG 153 (502) T ss_pred HHHHHHhhccCC-ceeeeeccCCCCh-hHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeec Confidence 999998777332 1000 01112222 2356688999999999986 4566666677999999999999988887 Q ss_pred CCCC-ccCe---eEeEEeCccccchhhhhcccCCccccccccccc--------hheeecCcccccccccccccccccccc Q lcl|NC_019530. 166 PTNK-KGGI---KKLVMLDSRCIRPVYIVEKAMREGGIEAIESVT--------LKYYYNPNYNRNQFTGQSGTSQNFQPS 233 (560) Q Consensus 166 ~~~~-k~GI---~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~--------e~~~Y~p~~~~~~~~g~~~~~~~~~~~ 233 (560) +... +.|. ..|..|||.+|.--+ .++ -.+..|++ .|+++...+ |. ... T Consensus 154 ~~~~~~~g~~~~l~lq~iepd~l~~~~------~~~-~~i~~GVe~d~~Gr~~aY~i~~~hP------gd-------~~~ 213 (502) T protein:vir:79 154 RINSLTPSAGVHFWLEALEPDFIPMTS------DES-NRLNQGVFVDDWGRPEKYLVYKSRP------VS-------GRQ 213 (502) T ss_pred ccCccCCCcccceEEEEecchhcCCCC------CCC-CeeEeeeEECCCCceEEEEEeecCC------CC-------Ccc Confidence 6542 3343 368899998875321 112 23333332 444443211 11 112 Q ss_pred ccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHH Q lcl|NC_019530. 234 QQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMM 313 (560) Q Consensus 234 ~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~i 313 (560) +...++|.+.|.|+.... ...---.+|.|..+++.+.+|.-.+||..+-...-|--=-+..-+.|.-....+ T Consensus 214 ~~~~rvpA~~vlH~f~~~-r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~------- 285 (502) T protein:vir:79 214 METKEVDAERMLHLKFVR-RLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDG------- 285 (502) T ss_pred cceeEechhheEEeeccc-CCccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccccc------- Confidence 345789999999888753 334455679999999999999999999999988888765444444333111000 Q ss_pred HHhhhhheeecccCCcccCcccchhhHhhhccc--c-cCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCcccc Q lcl|NC_019530. 314 MGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLP--R-REGQNATEIATVGGGNQLGEMDH-VNYFREKLYDALMIPKSRL 389 (560) Q Consensus 314 m~k~knk~vYd~~TGev~d~~~~msmlEDywLp--R-ReGgrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl 389 (560) .+...+ ....+| | | + ..-.-|.+|..+.....-+..++ ++...+.+=.+|+||-.-| T Consensus 286 ---------~~~~~~-----~~~~~l-~----pG~i~~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~l 346 (502) T protein:vir:79 286 ---------NGSKEN-----ERELTI-Q----PGIIYDDLKPGEEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSST 346 (502) T ss_pred ---------CCCCCc-----cccccc-c----CCccccccCCCceeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHH Confidence 000001 111111 1 0 0 00122455666555444444333 3444444667899999988 Q ss_pred CCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHhccCCChhHHHHhhhhhc--eeeeeecch Q lcl|NC_019530. 390 QEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLR----RQLILKGVTDEKDWNEKIKPFI--KFEFTSDSY 463 (560) Q Consensus 390 ~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk----~qLiLkgiit~eew~~~i~~~I--~~~f~~Dn~ 463 (560) ..+-+.| - |.+.---+.|-+.+.++|+.|..-|..++- ...+|.|.+..-.|.. -.... .|.-..=-+ T Consensus 347 t~D~s~n--y---Ss~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~-~~~~~~~~W~~p~~~~ 420 (502) T protein:vir:79 347 ARNYNGT--Y---SAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLD-RSSLYTAVYSGPVMPW 420 (502) T ss_pred hccccch--H---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCC-chhhcceeeecCCccc Confidence 7663322 2 233334566999999999988877766533 3566778775322211 11111 222223233 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHH-HHHHHHHHHhhhcCCCCCCCccccccccccccCC Q lcl|NC_019530. 464 IREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEV-KEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSP 542 (560) Q Consensus 464 f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI-~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~ 542 (560) .-.+||+.-...+++ .-.-|.+-+..+ .+..-+|. +|.++..+...+.|+..+..... ..+.++.+. T Consensus 421 iDP~Ke~~a~~~~i~---------~Gl~t~~~~~a~-~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~--~~~~~~~~~ 488 (502) T protein:vir:79 421 IDPVKEAEAWKIQIR---------GGAATESDWVRA-GGRNPDDVKRRRKAEIDENRKLDLVFDTDPAS--DKGGSSAAT 488 (502) T ss_pred cChHHHHHHHHHHHH---------cCCCCHHHHHHH-cCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCC--CCCCCCCCC Confidence 445665554444333 123344434333 23322221 12222222223446533221110 001111111 Q ss_pred CCCCcccccccCCccc Q lcl|NC_019530. 543 LKFKPETIPFSGSTDD 558 (560) Q Consensus 543 ~~~~p~~~~~~~~~~~ 558 (560) ..-.|+-. .+...+ T Consensus 489 ~~~e~~~~--~~~~e~ 502 (502) T protein:vir:79 489 KRQEPQHT--DDQSEE 502 (502) T ss_pred CCCCCCCC--CCCCCC Confidence 11111111 111111 No 28 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=98.80 E-value=3.1e-08 Score=61.75 Aligned_cols=423 Identities=11% Similarity=0.091 Sum_probs=186.0 Q ss_pred ccchhhhccccccccCCCCC----CCC-ceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHHHhhcceee Q lcl|NC_019530. 28 AQKQDELLLTKATVVALDDA----QDG-SIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVT 102 (560) Q Consensus 28 ~~~~~e~~~~k~~s~~~~~~----~dG-~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv 102 (560) -.+-++....+..+-+-... .-| ++.-++... .+... +...+-.+|-..|| .|+-+..||+.++.+|+- T Consensus 1 ~~~~~~a~~~~~~~~a~~~~~~~~~~g~~~~~d~~~~--~~~~~-~~~~~~~~l~~lY~---~~~l~r~iVd~~a~d~~r 74 (461) T protein:vir:80 1 MYSIDKAKQAKIDSKIVNRNDFMVGHGKANSRDKLTR--QTPGN-GQKLDLKACENLYA---SNSIAMNIVDIISEDMVR 74 (461) T ss_pred CccchhhhhhhhhhhhhhhhHHHhhcCCcchhhhhhc--cccCc-ccccCHHHHHHHHH---hCCccchhhccchHHhhc Confidence 11122222222221111100 000 011111010 11111 11224445556665 789999999999888863 Q ss_pred ecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhh-eecCCC---------CccC Q lcl|NC_019530. 103 CEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHV-IVDPTN---------KKGG 172 (560) Q Consensus 103 ~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hk-iiD~~~---------~k~G 172 (560) . .+.+.-+ .+..+++|.++++. |+...+..+.+|.=-+.|.=+.-. +.|.+. .+.+ T Consensus 75 ~-----g~~i~~~-----~~~~~~~~~~~~~~----l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~~pl~~~~ 140 (461) T protein:vir:80 75 A-----GWSLKTD-----NKEMKKNIESKWRK----LKTKDRFQKLYADKRLYGDGFLSIGVVSSNREQADLSTAIDPKT 140 (461) T ss_pred C-----CeeeecC-----CHHHHHHHHHHHHH----hhHHHHHHHHHHhhcccccEEEEEEeecCCccccCccCCccccc Confidence 2 3444322 34456666666554 345667777766655666432211 222211 1233 Q ss_pred eeEeEEeCc---cccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeec Q lcl|NC_019530. 173 IKKLVMLDS---RCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDS 249 (560) Q Consensus 173 I~eLr~LDP---r~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hs 249 (560) ++.+.+|+| ..+... .. ..+..+ -.=+.-++|.+++.....-.. ..........+||++-+++... T Consensus 141 ~~~~~~l~~~~~~~i~~~-~~-~~dp~s---p~fg~P~~y~i~~~~~~~~~~------~~~~~~~~~~~iH~SRii~~~~ 209 (461) T protein:vir:80 141 IKSIPYINTFNTQKVTQL-YL-NQDMFS---EHFGEVEFFEVNRVSQLGEEI------LSGTTASTSEQIHRSRIIHEQG 209 (461) T ss_pred ccceeEEEeccccccchh-hh-cccCcC---cccccceEEEEeccccccccc------cccccCccceEEccccEEEecC Confidence 344444444 333321 11 111101 000111233333211111000 0112234457899998887765 Q ss_pred ccccCCCCcchhhhHHHHHHHHHHHHHHH--HHHHHHHhcCccceEEEccC-CCCchHHHHHHHHHHHHhhhhheeeccc Q lcl|NC_019530. 250 GEEPLANGIVPGLLNPAIRPLNNLVTTED--ATVIYAITRAPEKRAFYLDV-GTLGKKSAEEYMTMMMGKFKNRNAYDRT 326 (560) Q Consensus 250 Gl~~~~~~~i~syL~~AiK~~NqL~m~ED--alVIyRitRAPeRRvFyIDv-Gnlpk~KAeqYl~~im~k~knk~vYd~~ 326 (560) +-++ +.....|.|+++...+..+..... +.++++ +-.+ +|.+|. ..+......+. ...+..++ .. T Consensus 210 ~~~~-~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~---~~~~-v~k~~~l~~~~~~~~~~~-~~~~~~~~------~~ 277 (461) T protein:vir:80 210 LRFE-GETKGRSIFESLYDIITVMDTSLWSVGQILYD---FAFK-VYKTDDIDALNKDDKANL-TAMLDFMF------RT 277 (461) T ss_pred CCCC-ccccCcchHHHHHHHHHHHHHHHHHHHHHHHH---hCCC-ceecchHHhhhchHHHHH-HHHHHHhc------CC Confidence 4332 333457888877766655543332 223333 4333 455542 11111111122 22222222 12 Q ss_pred CCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCCCCcccccccchhhh Q lcl|NC_019530. 327 TGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDH-VNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEI 405 (560) Q Consensus 327 TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eI 405 (560) +|- + + + + ..-+++++- .+|+-++| +..|...+--+.+||+.+|-.++...+. + +. T Consensus 278 ~g~-------~--~----~----d-~~e~~e~~~--~~lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~a-s-ge-- 333 (461) T protein:vir:80 278 EAL-------A--I----I----K-GDEQLTKES--TNVSGMKDLLDYGWDYLAGAVRMPKTVLKGQEAGTLT-G-AQ-- 333 (461) T ss_pred ceE-------E--E----E----c-CCcceEEEe--cCcCCHHHHHHHHHHHHhhhhcCCeeeeecccCCccc-c-ch-- Confidence 221 1 0 0 0 111222222 24555555 4688889999999999998555432222 1 12 Q ss_pred hHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhc----cCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 406 TQEELRFSKFCAGLRRR-YSHFFMEFLRRQLILK----GVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASL 480 (560) Q Consensus 406 tRDElkF~Kfi~rLr~r-fs~if~d~Lk~qLiLk----giit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l 480 (560) -|.-.|..+|+++|.. +..+...+++.-+.-. ..+.++. ..+.|.|..=..-+|...+|++..+.+++ T Consensus 334 -~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~------~~~~i~f~~L~~~s~kekAe~~~~~a~a~ 406 (461) T protein:vir:80 334 -YDVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDS------FEWAIEFNPLWNLDSKTDAEVRKLTAEAD 406 (461) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccc------cceEEEeCCCCCCCHHHHHHHHHHHHHHH Confidence 1333499999999965 4545444444322211 1233331 12455565544556777788888888887 Q ss_pred HHhhhhhcchhhHHHHHHHHhC---CC--------HHHHHHHHHHHHHhhhcCCCCCCCccccccc Q lcl|NC_019530. 481 NTVEPFVGSIFSIDYVMRNVLR---MS--------DEEVKEQQTKIAEEKKKGLYPKVQADETGNY 535 (560) Q Consensus 481 ~~~~p~vGky~S~~~i~k~IL~---~t--------DeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~ 535 (560) +.+-.- -.+|.+-++..... ++ +.|+++.+++.. ++.++|+|+- T Consensus 407 ~~~~~~--g~is~~e~r~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~e~~~g 461 (461) T protein:vir:80 407 QIYIVN--GVLDPDEVKETRFGRFGLENSSKFSGDSAEIDKLAKLVY---------DAYAKKNADG 461 (461) T ss_pred HHHHhc--CCCCHHHHHHHHHHhcCCCCCccCCCCCchhhhhhhhcc---------ccccccCCCC Confidence 765322 26777777654321 11 112222222211 1122222221 No 29 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=98.79 E-value=7.9e-09 Score=65.00 Aligned_cols=418 Identities=14% Similarity=0.112 Sum_probs=185.8 Q ss_pred cccchhhh-cccccc---------ccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHHHh Q lcl|NC_019530. 27 LAQKQDEL-LLTKAT---------VVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVDII 96 (560) Q Consensus 27 ~~~~~~e~-~~~k~~---------s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~eI 96 (560) +..| .+. .+..+. +.+.+-..=|+-. ....++..| ....+-.+|...|| .+.....||+-+ T Consensus 1 ~~~~-~~~~~~~~~~~~~~~~~rd~l~~~~~glg~~r-~~~~~~~g~----~~~~~~~~l~~~Yr---~~~ia~~iVd~~ 71 (449) T protein:vir:10 1 MTDK-LTLAVNHALNDARMARARMGLMVPTMGLDNKR-HSAWCEYGF----PELVTYENLYSLYR---RGGIAHGAVEKL 71 (449) T ss_pred Cchh-hHHHHhhhcchhHHHHHHHHHHHHHhcCCccc-chhhhhcCC----cccCCHHHHHHHHh---cCchhHHHHHhh Confidence 1111 111 000011 1111111111111 111222222 23457778998998 577788889887 Q ss_pred hcceeeecCCCceEE-EeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHH--Hhhhhhhhhhhh-----eecCCC Q lcl|NC_019530. 97 VNDVVTCEEDETPVT-VNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIR--KWYVDGRQAYHV-----IVDPTN 168 (560) Q Consensus 97 vneaiv~d~~~~~V~-l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R--~wYvDGr~~~hk-----iiD~~~ 168 (560) +++|. ......++ .+.+..+ ....++.++.+.|.. .+-.+..+..| +.|-.+-++++. .-.|-+ T Consensus 72 ~d~~~--~~~~~i~~g~~~~~~~-~~~~~e~~~~~l~~~-----~~~~~l~ea~~~~rl~Gga~i~i~v~d~~~l~~Pl~ 143 (449) T protein:vir:10 72 VGKCW--QTNPEIIEGDDADDSE-DETSWEKKSKQVFTN-----RLWRSFAEADRRRLVGRYAGILLHIRDEKDWNLPAT 143 (449) T ss_pred hhhhh--hcCcccccCccccchh-hhHHHHHHHHHHHHH-----HHHHHHHHHHHhhhccCcEEEEEEecCCCCCCcccc Confidence 76551 11111111 1111211 122233333333321 12234444444 234333344321 112334 Q ss_pred CccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeee Q lcl|NC_019530. 169 KKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYID 248 (560) Q Consensus 169 ~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~h 248 (560) +.+||+.|+.++...+.+-. ... +. ..-.+|.| .+|..+++.. .++.+.++||++-+...- T Consensus 144 ~~~~i~~i~v~~~~~i~~~~-~~~-dp----------~sp~yg~P--~~y~v~~~~~-----g~~~~~~~iH~SRl~~~~ 204 (449) T protein:vir:10 144 KGRGLQKVSVSWAGSLKVAE-WDT-GI----------NSKTYGQP--KLWKYTERLP-----NGSSRRVDIHPDRVFILG 204 (449) T ss_pred cCcceeeEEeeccccCChhh-hhc-CC----------CCCCCCCc--eEEEEeeecc-----CCCccceeeccceeEeec Confidence 56678877777765554421 111 11 00111222 1222222211 123345678888644321 Q ss_pred cccccCCCCcchhhhHHHHHHHHHHHHHHHHHH------HHHHhcC----ccceEEEccCCCCchHHHHHHHHHHHHhhh Q lcl|NC_019530. 249 SGEEPLANGIVPGLLNPAIRPLNNLVTTEDATV------IYAITRA----PEKRAFYLDVGTLGKKSAEEYMTMMMGKFK 318 (560) Q Consensus 249 sGl~~~~~~~i~syL~~AiK~~NqL~m~EDalV------IyRitRA----PeRRvFyIDvGnlpk~KAeqYl~~im~k~k 318 (560) .+ ..+ ..|+|+++ ||.|.-+|-+.. .-...|. -++ .+|+.+|...++.- +.++..++. T Consensus 205 ~~--~~~---g~~~L~~~---yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~---~~~~~~l~~~~~~~-~e~~~~~~~ 272 (449) T protein:vir:10 205 DY--SED---AIGFLEPA---YNAFVSLEKVEGGSGESFLKNAARQLNVNFEK---EIDFTNLASLYGVS-IDELQDKFN 272 (449) T ss_pred CC--CCC---ChhHHHHH---HHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhh---hhhhhhhhHHhhCC-chHHHHHHH Confidence 11 111 24788876 455554444321 1111111 122 24555555433211 111111111 Q ss_pred hheee-cccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCCCCccc Q lcl|NC_019530. 319 NRNAY-DRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDH-VNYFREKLYDALMIPKSRLQEEGSIN 396 (560) Q Consensus 319 nk~vY-d~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl~~~~~~~ 396 (560) .++.+ +..++ ..|-+- +-+.+++. .+++-++| +..|...+=-+.+||+.||=..+... T Consensus 273 ~~~~~~~~~~~------~~~i~~------------~~d~~~~~--~~~sgl~d~l~~~~q~iaaa~~IP~t~L~Gqsp~g 332 (449) T protein:vir:10 273 EVAGEINRGND------VLMTTQ------------GATVTPLV--TSVADPTATYNVNLQTAAAGVDIPTRILIGNQQAE 332 (449) T ss_pred HHHHHHhccch------heeecC------------CcceEEEe--cccCChhHHHHHHHHHHHHHhCCCeeeeeccCccc Confidence 11100 01111 011111 11122222 24455566 56788889999999999995554322 Q ss_pred ccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHH Q lcl|NC_019530. 397 IGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDR 476 (560) Q Consensus 397 l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~R 476 (560) +. ++ -|.-.|...|...|.++......++.. |+..++..+. +.+.|.|..=..=+|...+||.... T Consensus 333 ln---st---~D~~nyyd~i~~~Q~~l~p~le~l~~~-l~~s~~g~~~-------~d~~i~f~pL~~~t~kEkAei~k~~ 398 (449) T protein:vir:10 333 RS---ST---EDQKYFNARCQSRRVDLSFEIEDFCDK-LIELKIIDAV-------AKKAVIWDDLNEQTGTEKLTNAKTM 398 (449) T ss_pred cc---cc---hhHHHHHHHHHHHHHhhhHHHHHHHHH-HHHhhcCCCC-------CceeEEeCCCCCCCHHHHHHHHHHH Confidence 21 22 255569999999999988887777764 6677776653 1257778777778898999999999 Q ss_pred HHHHHHhhhhhcc-hhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCccccccc Q lcl|NC_019530. 477 LASLNTVEPFVGS-IFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFS 553 (560) Q Consensus 477 l~~l~~~~p~vGk-y~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~ 553 (560) .++++.+-..++. .||.+-|+ ..+++ |-..+ .+.|+++++.+. ++.=|+| T Consensus 399 A~a~~~~~~ag~~~~~~~~EiR-~~~~~--------------~~~~~-~~~~~e~~de~~-----------~~~d~~a 449 (449) T protein:vir:10 399 GEINQTMLGSGDNPAFSREEIR-TAAGY--------------DNDDE-EPLGEEDGDEED-----------KATDSAA 449 (449) T ss_pred HHHHHHHHHccccCCcCHHHHH-HHhcc--------------cCCCC-CCCCCCCCcccc-----------ccCCcCC Confidence 9998887666443 55555444 23333 21011 111111111111 1111111 No 30 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=98.78 E-value=5.1e-08 Score=60.57 Aligned_cols=464 Identities=11% Similarity=0.098 Sum_probs=203.6 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhccc----------------ccch-hhhccccccccCCCCCCCCceeEeccccceec Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKL----------------AQKQ-DELLLTKATVVALDDAQDGSIILQGGANTYNY 63 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~-~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y 63 (560) |.-+-.-+.|.-- .++..+-++... -.++ .++-.+.+. .+-...-.|...+ ... T Consensus 1 ~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~------~~~ 71 (574) T protein:vir:80 1 MPKWLDKALGIEK--SSIEETRNMENYKMHLREIDTNVVNNEPYSMESIEKGMNGKT-TAYMQPIIGEMSV------NPG 71 (574) T ss_pred Ccchhhhhhccch--hhHHHHHhhhhhccccchhhhhhhhccCCCHHHHHHhHhhhc-ccccchhhhhccc------ccc Confidence 4444333333211 112222222211 1111 111111111 1111111111111 111 Q ss_pred cccccchhhHHHHHHHHHHHhhccchhHHHHHhhcceeee------cCCCceEEEeeecccchhHHHHHHHHHHHHHHHH Q lcl|NC_019530. 64 VGVESELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVTC------EEDETPVTVNLDKVEGISDTVKESITECFKEVMH 137 (560) Q Consensus 64 ~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~------d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ 137 (560) ...-+...+...|-+.-+..+..|-|..||..+++-+..| +-..-|..|.+.+.++ ... .+...+...+.+ T Consensus 72 ~~~~~~~~~~~~~~~~l~~~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~-~~~--~~~~~~~~~l~~ 148 (574) T protein:vir:80 72 YKTKPSIRNSQDLHKTLKKFGNNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEA-EPT--SHDIANIKRIES 148 (574) T ss_pred ccCcCccCCcccHHHHHHhhccChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCC-Ccc--chhhhhhhHHHH Confidence 1112223333344444456677899999998888765443 2333445555544322 111 122223333444 Q ss_pred HhC-----C-------chhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCcccccccccc Q lcl|NC_019530. 138 LMD-----F-------DNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESV 205 (560) Q Consensus 138 ll~-----f-------~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~ 205 (560) +|. + ..=.+.+++.+++.|.-|+.++.|.. .-+.+|++|||.+|..++. ..+ .+.... T Consensus 149 ll~~~~~~~nP~~~s~~ef~~~lv~~lll~Gnayi~i~r~~~---G~~~~L~pl~p~~V~v~~d-----~~~--~~~~~~ 218 (574) T protein:vir:80 149 FLENTAQFRDPNRDNFTTFCKKLVRATYMYDQVNFEKVFDKD---GNFIKFDTVDPTTIFLATN-----GEG--KLIKNG 218 (574) T ss_pred HHhccCCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEECCC---CcEEEEEEEcCceeEEEEc-----Ccc--ccccCc Confidence 442 1 11233455566778999998887653 3589999999999987531 112 111111 Q ss_pred chheeecCccccccccccccccccccccccceecchhceeeeecccc--cCCCCcchhhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 206 TLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEE--PLANGIVPGLLNPAIRPLNNLVTTEDATVIY 283 (560) Q Consensus 206 ~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~--~~~~~~i~syL~~AiK~~NqL~m~EDalVIy 283 (560) .-|+.+.. + +....++.+.|.|..-... ...+++.+|-|+.|+.++.....+++...=+ T Consensus 219 ~~y~~~~~--------g-----------~~~~~~~~~eiih~~~~~~~~~~~~~~G~spi~~a~~~i~~~~~a~~~~~~~ 279 (574) T protein:vir:80 219 ERFVQVID--------N-----------RIVAKFNERELAFAVRNPRADIEVGQYGYPELEIALKQFIAHENTEVFNDRF 279 (574) T ss_pred eEEEEEeC--------C-----------ceEEEEccccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 11222221 1 1233566777777642211 1235567788999999999888888876544 Q ss_pred HHhcCccceEEEccCCC-CchHHHHHHHHHHHHh-hhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCC Q lcl|NC_019530. 284 AITRAPEKRAFYLDVGT-LGKKSAEEYMTMMMGK-FKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGG 361 (560) Q Consensus 284 RitRAPeRRvFyIDvGn-lpk~KAeqYl~~im~k-~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpG 361 (560) =---+--+-|..++.+. |-+..++ =+++-+.. |. | ..+.-+.+ ++++ .|.++.-|. T Consensus 280 f~ng~~p~gil~~~~~~~ls~e~~~-~lk~~~~~~~~---------G-~~n~g~~~-vl~~---------~G~~~~~l~- 337 (574) T protein:vir:80 280 FSHGGTTRGILHVKTGQQQSQQALD-IFRREWRSSLA---------G-INGSWQIP-VVSA---------EDVKFVNMT- 337 (574) T ss_pred HhccCCCceEEEeCCCCCCCHHHHH-HHHHHHHHHhc---------c-ccccccce-eecC---------CCceEEEcc- Confidence 44445567777777654 4443333 23333322 22 1 11111221 2221 245666664 Q ss_pred CCCcchHHH---HHHHHHHHHHhcCCCccccCCCCccccc--------ccchhhhhHHHHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_019530. 362 GNQLGEMDH---VNYFREKLYDALMIPKSRLQEEGSINIG--------GSNLAEITQEELRFSKF-CAGLRRRYSHFFME 429 (560) Q Consensus 362 g~nLgei~D---V~YF~~kLy~aL~VP~sRl~~~~~~~l~--------~~~~~eItRDElkF~Kf-i~rLr~rfs~if~d 429 (560) .+..++.- -+|..+...++.+||...|+..+....+ ++++.+.. +.|..+ +.-+..++...| T Consensus 338 -~s~~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~~---~~f~~~tL~P~~~~ie~~l-- 411 (574) T protein:vir:80 338 -PSANDMQFEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEKM---QASQNKGLQPLLRFIEDTV-- 411 (574) T ss_pred -CChhHHHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHHHHH---HHHHHHHHHHHHHHHHHHH-- Confidence 33444433 4558888999999999999754332222 22222211 124443 333443333333 Q ss_pred HHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHH- Q lcl|NC_019530. 430 FLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEV- 508 (560) Q Consensus 430 ~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI- 508 (560) -+.|+ ++.+ ..+.+.|.....=+ |.+....+.-...-++|.+-++. .|.|.+-+= T Consensus 412 --n~~Ll-----~~~~------~~~~~~f~~~d~~~----------~~~~~~~~~~~~~G~lT~NE~R~-~lgl~Pi~gG 467 (574) T protein:vir:80 412 --NTYIV-----AEFG------EKYQFQFRGGDLSA----------QLDKLKIIEQEGKVFRTVNEIRH-DKGLEPIKGG 467 (574) T ss_pred --Hhhhh-----hhcC------CceEEEecccchhh----------HHHHHHHHHHHhCCccCHHHHHH-HhCCCCCCCC Confidence 33332 2222 23566666553311 22222111112224788888885 477754320 Q ss_pred ---------HHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 509 ---------KEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 509 ---------~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) .....+.++...+.--.....++...... +...-+|+..|..++.|++- T Consensus 468 D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~p~~~~~d~~~ 525 (574) T protein:vir:80 468 DVILNGVHIQAIGQALQEEQLEYQRSQDRLNRLLELSG---GDVEQPEPEEPKDSQNDTDV 525 (574) T ss_pred CEeeeccceeecccccccccCCccchhccccccccccC---CCCCCCCCCCCCCccccccc Confidence 00011111110000000000000111111 11122333455555555555 No 31 >protein:vir:101418 Length: 569 # NCBI annotation: Prt # Family: family:all:9458 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006480;genbank:gi:46401636;genbank:GeneID:2777482 Probab=98.74 E-value=1.8e-08 Score=63.06 Aligned_cols=503 Identities=15% Similarity=0.130 Sum_probs=268.3 Q ss_pred CCc-------ccchhhhhhhhhhhhhhhhhhcccccchhhhccccccc-cCCCCCCCCceeEeccccc-----e-e--cc Q lcl|NC_019530. 1 MAG-------YGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATV-VALDDAQDGSIILQGGANT-----Y-N--YV 64 (560) Q Consensus 1 ~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s-~~~~~~~dG~~~i~~~~~~-----~-~--y~ 64 (560) ||- .-..|.|.|---.-.|+.+-+---|..-.--+.+.+-. ++.+-.. |....+.+.+. + . +. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~p~~~~~~~-~~~~~~~t~~~D~~~~g~~~~~~ 79 (569) T protein:vir:10 1 MADNKITLSSVRKALAGVFKDNGERDNILLSALAVHGGSGYLFSRAGAPVQLSGFL-GGKPGDSGMAGDGLVDGSRFIFD 79 (569) T ss_pred CCcchhHHHHHHHHHhhhhhcCCccchhhhhhheeecCcceEEeecCcchhhhhhh-ccCccccchhhhhHHHHHHHHhh Confidence 332 11222333322222222222222222211111111111 1111111 11111111111 1 1 11 Q ss_pred ccccchhhHHHHHHHHHHHhhccchhHHHHHhhcceeeec-CCCceEEEeee-cccchhHHHHHHHHHHHHH-HHHHhCC Q lcl|NC_019530. 65 GVESELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVTCE-EDETPVTVNLD-KVEGISDTVKESITECFKE-VMHLMDF 141 (560) Q Consensus 65 ~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~d-~~~~~V~l~Ld-~~~~~s~sik~~I~eeF~~-il~ll~f 141 (560) ..+.. ++.+|++-.|-+|+.+|-|..|+-.=|.-|.-.+ .-.+.|.+.=- +.+.-....+++|..|... +..+ + T Consensus 80 ~~~~p-r~R~qiY~~~eeM~~~p~Ia~AlniHVtaALggde~TGd~vfI~p~~~~~~a~~daakai~~el~~dl~~~--i 156 (569) T protein:vir:10 80 EVQLP-EDRLQRYPLLEEMAVYSTIATALNIHITHALSFDKKTGQTFSIVPVHNGNDSDYDAAQALCGELMNDIGRT--I 156 (569) T ss_pred hccCc-hhHHHHHHHHHHHhcCchhhhhhhhhhheeecccccccceEEEEeecCCCCCcchHHHHHHHHHHHHHHHH--H Confidence 22333 7899999999999999999999988787777666 33445544221 1111122344577777766 4455 5 Q ss_pred chhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeE---EeCccccchhhhhcccCCccccccccccchhe-eecCcccc Q lcl|NC_019530. 142 DNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLV---MLDSRCIRPVYIVEKAMREGGIEAIESVTLKY-YYNPNYNR 217 (560) Q Consensus 142 ~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr---~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~-~Y~p~~~~ 217 (560) ++.++.+-|.--+-|.-|= .++ .+...||..|. +-=|--|.+. +..++ ..-|+ .|.++... T Consensus 157 Nr~~~~lA~~~~aFGdsYa-RiY--~~~~~GV~dl~~s~yt~PsfIqpF---E~g~~---------tvGF~~~~~~~~~~ 221 (569) T protein:vir:10 157 NKEVAGWAFIMSVFGVAYV-RPY--AKEGIGITSFECSYYTLPSFIKEF---EVSGN---------LAGFSGDYLKDASG 221 (569) T ss_pred HHHhhHHHHHHHhhhhhhe-eee--ccCCceeEEEEecccccccccchh---hhcCc---------eEEeecccCCcccc Confidence 7888988888888885554 333 44577888774 2223222221 11111 00000 02211110 Q ss_pred cccccccccccccccccc--ceecchh----ceeeeecccc------------c-CCCCcchhhhHHHHHHHHHHHHHHH Q lcl|NC_019530. 218 NQFTGQSGTSQNFQPSQQ--ELVFDDE----SIVYIDSGEE------------P-LANGIVPGLLNPAIRPLNNLVTTED 278 (560) Q Consensus 218 ~~~~g~~~~~~~~~~~~~--~~~i~~d----aI~y~hsGl~------------~-~~~~~i~syL~~AiK~~NqL~m~ED 278 (560) +.. .-.+++ ++|+|.= ++.=.|+|.. . .+.++.-|||+.|-+||-+|..-=- T Consensus 222 ---ti~------~l~p~qm~rmKmPrm~~i~q~~~v~~g~~~~~L~~d~~~~~Pi~psn~GgSFL~~ae~pf~~l~~Al~ 292 (569) T protein:vir:10 222 ---KMV------FADPWAIIPMKIPYWRPKSNLMPVHTGHKAYSLLDNPEERTPIETQNYGTSLLEYAYEPYMNLRSAIR 292 (569) T ss_pred ---cee------eechhhhhhhcccceeeccccchhhhhhhheeecccccccccccchhhhhHHHHHHHhHHHHHHHHHH Confidence 000 000111 1122211 1111222221 1 1445556999999999999999888 Q ss_pred HHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhh---eeecccCCcccCcccchhhHhhhcccccCCCC-cc Q lcl|NC_019530. 279 ATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNR---NAYDRTTGKITGNAHLMGIAEDYWLPRREGQN-AT 354 (560) Q Consensus 279 alVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk---~vYd~~TGev~d~~~~msmlEDywLpRReGgr-gT 354 (560) +|--=|+--|.--|+.=+-.-.|||+++.+|+|.+-.-.|+. +.==+..|+- ..+ .----||-=+.++ +- T Consensus 293 sL~~qri~dSv~~~~Itlnm~gM~p~qr~~y~r~lt~~LKr~~d~ie~a~~gg~~-----~~~-~~~H~LPv~gekq~~~ 366 (569) T protein:vir:10 293 SLKATRFNASKIDRIIGLAMNSLDPVKAADYSRTITQTLKRAADLMERRARGANN-----MPT-VTNTLLPIMGDGKGQM 366 (569) T ss_pred hccchhhHHHHHhHHhhccccCCCHHHHhHHHHHHHHHHHHHHHHHHHHhccCcc-----ccc-cceeeeeeecCccccc Confidence 888889999999999999999999999999999887766652 2211223321 111 0112356656666 34 Q ss_pred ceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCC--cccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 355 EIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQEEG--SINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLR 432 (560) Q Consensus 355 EIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~--~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk 432 (560) -|+|=.+-.++--|+||...-|.|--||++-.|-|+--. +..||-++.= |.-+.=..=-+-||.-.++.|..++- T Consensus 367 tvDt~~~~A~~~gIEdvM~~~R~LagaLGlD~SMlGwAD~LsGGLGeGG~f---rtSaQaa~RS~~iRqa~~e~in~iid 443 (569) T protein:vir:10 367 TIDTQTIQADINGIEDILTYMRQLAAALGLDYTLLGWADQMSGGLGEGGFL---RTAIQAAMRASWIQQGVEEFIQRAID 443 (569) T ss_pred cccccccccCcccHHHHHHHHHHHHhhhccchhHhhHHHHhcccccccHHH---HHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 666666666677899999999999999999999984211 1123322110 11111111124588889999999999 Q ss_pred HHHHhc--cCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHh----hhh-----hc-chhhHHHHHHHH Q lcl|NC_019530. 433 RQLILK--GVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTV----EPF-----VG-SIFSIDYVMRNV 500 (560) Q Consensus 433 ~qLiLk--giit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~----~p~-----vG-ky~S~~~i~k~I 500 (560) ..+-.| ++..++|.. -+++|.+++-=.|-.+.+-..+|++.+.-+ +.. .| .==-..|+.+++ T Consensus 444 iH~~fKYgevf~~~drP------~~V~F~s~~tAl~~E~~~n~~~raN~a~i~~Q~la~l~e~n~Lg~de~~m~y~l~d~ 517 (569) T protein:vir:10 444 IHLAFKYGKVYPEGDRP------YKIEFHSVNTALQQEHNDNRDSQANYATIVTQILDAVSNNSVLANSDAFKRYLFSDV 517 (569) T ss_pred HHhhhhcCcccCCCCcc------eEEEeccchHHHHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccHHHHHHHHHHH Confidence 998888 788888764 588999999999988888888887765433 221 11 101124566677 Q ss_pred hCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcccc------cccCCcccC Q lcl|NC_019530. 501 LRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETI------PFSGSTDDS 559 (560) Q Consensus 501 L~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~------~~~~~~~~~ 559 (560) |.| |+.+. +.+-.|. .++|..||-.-++.. +-+|+-+ -|.-+.|.. T Consensus 518 ~~~-De~~~---e~l~ae~----~akp~DEe~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~ 569 (569) T protein:vir:10 518 LEI-DEKIS---EALVNEL----KAKSEDDDHLMDSII-----KTPPQELAQILESVFKEGNDND 569 (569) T ss_pred hhc-chhHH---HHHHhhc----CCCcchhHHHHHHHh-----cCChHHHHHHHHHHhhccCCCC Confidence 777 43332 2234443 455655543333221 2222211 122222222 No 32 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=98.72 E-value=4.8e-08 Score=60.69 Aligned_cols=456 Identities=11% Similarity=0.067 Sum_probs=199.8 Q ss_pred cCCCCCCCCceeEec--c----------ccceeccccccchhh-----HHHHHHHHHHHhhccchhHHHHHhhcceeeec Q lcl|NC_019530. 42 VALDDAQDGSIILQG--G----------ANTYNYVGVESELLS-----VKTVVEEYQSMAQQPEIRKAVDIIVNDVVTCE 104 (560) Q Consensus 42 ~~~~~~~dG~~~i~~--~----------~~~~~y~~~e~~~~~-----~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~d 104 (560) ++...+.|-...+.. . -...+|.+- ..+.. ..++ +.+| ..++=+.-+|+..++ .++.+ T Consensus 1 ~~~~~~~d~~~~i~~L~~~~~~~~~r~~~~~~Yy~g~-~~i~~~~~~~~~~~-~~~~--~~~n~~~~ivd~~a~-~l~~~ 75 (488) T protein:vir:23 1 MAETESIDPEKLRDQLLDAFENKQNELKSSKAYYDAE-RRPDAIGLAVPLDM-RKYL--AHVGYPRTYVDAIAE-RQELE 75 (488) T ss_pred CCcccCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-cchhhcCcccchhh-hhhh--hhcchHHHHHHHHHH-hhhcc Confidence 555555544322210 0 001111111 11100 0000 0111 112222233333221 11111 Q ss_pred CCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheec----CCCCccCeeEeEEeC Q lcl|NC_019530. 105 EDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVD----PTNKKGGIKKLVMLD 180 (560) Q Consensus 105 ~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD----~~~~k~GI~eLr~LD 180 (560) - ..+........+..-.+...+.+..|++..+|+.+..++.+.+++.|+-|...-.+ +-....|...++.++ T Consensus 76 G----f~~~~~~~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~ 151 (488) T protein:vir:23 76 G----FRIPSANGEEPESGGENDPASELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVPLIRVEP 151 (488) T ss_pred c----eeccCCcccccccccchhHHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcceEEEec Confidence 1 11111110111111223456677788888899999999999999999987732211 112455666788888 Q ss_pred ccccchhhhhcccCCcccccccc-----ccchheeecCccccccccccccccccccccccceecchhceeeeecccccCC Q lcl|NC_019530. 181 SRCIRPVYIVEKAMREGGIEAIE-----SVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLA 255 (560) Q Consensus 181 Pr~i~~vr~i~~~~~~g~~~~~~-----~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~ 255 (560) |+.+=.+..=.+....-++.... .+....+|.|..-+........-......+..--++|. |.|++..-. . T Consensus 152 p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPv--v~f~n~~~~--~ 227 (488) T protein:vir:23 152 PTALYAEVDPRTRKVLYAIRAIYGADGNEIVSATLYLPDTTMTWLRAEGEWEAPTSTPHGLEMVPV--IPISNRTRL--S 227 (488) T ss_pred cceeEEEEecCCCceEEEEEEEEecCCCcEEEEEEEecCcEEEEEecCCceEeccccccCCCCcce--EEecccccc--C Confidence 88776553210000000010000 01112234443222221111000000000000112222 345553221 2 Q ss_pred CCcchhhhHHHHHHH-H-HHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCc Q lcl|NC_019530. 256 NGIVPGLLNPAIRPL-N-NLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGN 333 (560) Q Consensus 256 ~~~i~syL~~AiK~~-N-qL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~ 333 (560) .....|-|.+.++++ . -=+++-+..+.-....-|.|-|.=.+....+.... +.+.+.++..| T Consensus 228 ~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~-----------~~~~~~~~~~~----- 291 (488) T protein:vir:23 228 DLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGINAE-----------TGQRMFDAYMA----- 291 (488) T ss_pred CcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCccccccccc-----------ccchhhhhhhh----- Confidence 223345565554443 1 23455566666666666776554222111111000 00111111122 Q ss_pred ccchhhHhhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHH Q lcl|NC_019530. 334 AHLMGIAEDYWLPRREGQNATEIATVGGGNQLG-EMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRF 412 (560) Q Consensus 334 ~~~msmlEDywLpRReGgrgTEIsTLpGg~nLg-ei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF 412 (560) ..|.- ++|-..++-++++.+ ++ -++-++=.-..++..-++|..-|...+. | .+.+..|.--+..+ T Consensus 292 --------~v~~~--~~g~~~~~~q~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~-n--~~Sg~Al~~~~~~l 357 (488) T protein:vir:23 292 --------RILAF--EGGEGAHAEQFSAAE-LRNFVDALDALDRKAASYSGLPPQYLSSSSD-N--PASAEAIKAAESRL 357 (488) T ss_pred --------hhccC--CCCCCceeEecCCCC-hHHHHHHHHHHHHHHhcccCCCHHHhccccC-c--chHHHHHHHHHHHH Confidence 22322 234445677787643 43 2333444455566678888777754332 1 11234566666677 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCCC-hhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchh Q lcl|NC_019530. 413 SKFCAGLRRRYSHFFMEFLRRQLILKGVTD-EKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIF 491 (560) Q Consensus 413 ~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit-~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~ 491 (560) -.-+.+.++.|..-+...++.-+.+.|... +.+|. .|.+.|..-..= -+.+.++.+..+-.-+...+ T Consensus 358 ~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~-----~i~v~f~~~~~~-------s~~~~ada~~kl~~~g~~~~ 425 (488) T protein:vir:23 358 VKKVERKNKIFGGAWEQAMRLAYKMVKGGDIPTEYY-----RMETVWRDPSTP-------TYAAKADAAAKLFANGAGLI 425 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcchhhc-----cceEEecCCCCC-------CHHHHHHHHHHHHhcccccC Confidence 778888888888888888887776655432 23332 366777422111 12244555555543323578 Q ss_pred hHHHHHHHHhCCCHHHHHHHHHHHHHhhhc--CCCCCCCccccccccccccCCCCCCcccccccC Q lcl|NC_019530. 492 SIDYVMRNVLRMSDEEVKEQQTKIAEEKKK--GLYPKVQADETGNYSGSDVSPLKFKPETIPFSG 554 (560) Q Consensus 492 S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~--~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~ 554 (560) |.++++.. |.+++++++++++..++|... +.+........+.+.+++. +..-.|+..|.+- T Consensus 426 s~et~~~~-l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~e~~~a 488 (488) T protein:vir:23 426 PRERGWVD-MGYTIVEREQMRQWLEQDQKQGLGLIGSLYGASTPEGKPGEA-PVGEPPAPEPDAA 488 (488) T ss_pred CHHHHHHh-CCCCchHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccCCCC-CCCCCCCCCCCCC Confidence 99999966 799999998776554444311 1111111111111111111 1222344444443 No 33 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=98.68 E-value=2.2e-08 Score=62.52 Aligned_cols=439 Identities=10% Similarity=0.046 Sum_probs=192.0 Q ss_pred cccccchhhHHHHHH-------HHHHHhhccchhHHHHHhhcc--------eeeecCCCceE-----EEeeecccchhHH Q lcl|NC_019530. 64 VGVESELLSVKTVVE-------EYQSMAQQPEIRKAVDIIVND--------VVTCEEDETPV-----TVNLDKVEGISDT 123 (560) Q Consensus 64 ~~~e~~~~~~~~LI~-------~YR~ma~~PEvd~AI~eIvne--------aiv~d~~~~~V-----~l~Ld~~~~~s~s 123 (560) ...+... ...|++ +|+.+..|.+=+..|...-.+ -++..--...| .+.++ +|.-+ T Consensus 1 ~~t~~~~--i~~L~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~---g~~~~ 75 (480) T protein:vir:78 1 MTTYHEH--VERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIE---GFRIS 75 (480) T ss_pred CCCHHHH--HHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhhccC---ceecC Confidence 1111110 011111 222222222222111110000 00000000000 00000 01000 Q ss_pred HHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheec--CCCCccCeeEeEEeCccccchhhhh--cccCCcccc Q lcl|NC_019530. 124 VKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVD--PTNKKGGIKKLVMLDSRCIRPVYIV--EKAMREGGI 199 (560) Q Consensus 124 ik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD--~~~~k~GI~eLr~LDPr~i~~vr~i--~~~~~~g~~ 199 (560) --+...+.+..|++..+|+.+..++++.-.+.|+-|.+.-.. .....+|...++.++|..+-++..- ..... ..+ T Consensus 76 ~d~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~i~~~~p~~~~~~~D~~~~~~~~-~~i 154 (480) T protein:vir:78 76 EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVT-RAV 154 (480) T ss_pred CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeEEEEEcccceEEEEcCCCccceE-EEE Confidence 011334556677777889999999999999999987643211 1113568888999999888766421 11110 011 Q ss_pred cccc------ccchheeecCccccccc-cccccccccccccc--c-ceecchhceeeeecccccCCCCcchhhhHHHHHH Q lcl|NC_019530. 200 EAIE------SVTLKYYYNPNYNRNQF-TGQSGTSQNFQPSQ--Q-ELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRP 269 (560) Q Consensus 200 ~~~~------~~~e~~~Y~p~~~~~~~-~g~~~~~~~~~~~~--~-~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~ 269 (560) .... .+...-+|.|..-++.. .+........-... . .-++|. |.|++---++.+ ...|=|.+.+++ T Consensus 155 ~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv--v~f~n~~~~~~~--~G~s~i~~~v~~ 230 (480) T protein:vir:78 155 RLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRLGNR--YGRSEISPELRK 230 (480) T ss_pred EEEEeecCCCceEEEEEEeCCeEEEEEecCCCccccccccccccCCCCCcce--EEeecccccCCc--cCcccchhhHHH Confidence 1110 11112234443222111 01100000000000 0 001111 123321111111 222334433322 Q ss_pred H-HH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHh-hhccc Q lcl|NC_019530. 270 L-NN-LVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAE-DYWLP 346 (560) Q Consensus 270 ~-Nq-L~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlE-DywLp 346 (560) + .. =+++-+.+++-...-.|.|-|.=.+....+..+. + .++-.+.. ..|++ T Consensus 231 l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~---------------------~-----~~~~~~~~~~~~~~ 284 (480) T protein:vir:78 231 VTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGE---------------------N-----TTLDIYYGRILTLA 284 (480) T ss_pred HHHHHHHHHHHHHHHHHhhcchhhhhhcCCccccccccc---------------------c-----chhhhhhhhhccCC Confidence 1 11 1355566777777777777664222222111100 0 00111111 12332 Q ss_pred ccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 347 RREGQNATEIATVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSH 425 (560) Q Consensus 347 RReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~ 425 (560) |..+++.++++. ++.- ++-++-....++..-++|..=|+..+. | -+.+..|.--+.....-+.++|+.|.. T Consensus 285 ----~~~~~~~~~~~~-~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~-n--~~Sg~Alk~~~~~l~~ka~~~~~~f~~ 356 (480) T protein:vir:78 285 ----SEAAKISEFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSE-N--PASAEAIIATDSRIVKMAERKGRIFGG 356 (480) T ss_pred ----CCCceEEecCcc-CHHHHHHHHHHHHHHHhcccCCChHHhccccC-c--chHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 334678888864 3433 444666667777778888877765432 1 112344555566677788999999999 Q ss_pred HHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCH Q lcl|NC_019530. 426 FFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSD 505 (560) Q Consensus 426 if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tD 505 (560) -+...++.-+.+.|.-...+|. .|.+.|. |+.-. -+.+.++.+.++-.-++-.+|.+++... |.+++ T Consensus 357 ~l~~~~~l~~~~~g~~~~~~~~-----~i~v~f~-~~~~~------s~~~~ad~~~kl~~~g~~~~s~et~~~~-lg~~~ 423 (480) T protein:vir:78 357 AWERAMRIAMQIMGREVTEEYT-----RLETVWR-DPSTP------TVAAKADAVSKLYANGQGPIPKEQARID-LGYTA 423 (480) T ss_pred HHHHHHHHHHHHcCCCccccce-----eeeEEec-CCCCC------CHHHHHHHHHHHHHhccccCCHHHHHhc-CCCCH Confidence 9999999877777764455553 2556663 21111 1123455555554433456899999965 89999 Q ss_pred HHHHHHHHHHHHhhhcCCCCCCCc-cccccccccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 506 EEVKEQQTKIAEEKKKGLYPKVQA-DETGNYSGSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 506 eEI~e~~kqIe~E~~~~~~~~p~~-~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) ++++++++..+++..+.+-..+.. +..++..+..+.+. ..++.-.+.|+.-.+- T Consensus 424 d~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 478 (480) T protein:vir:78 424 TQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTE-TKTETQTSPSGFNRTK 478 (480) T ss_pred hHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCCCCCC-CCCccccccCCCCccc Confidence 999988765444432221111111 11222222222211 1222222222222222 No 34 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=98.63 E-value=8.1e-08 Score=59.47 Aligned_cols=448 Identities=13% Similarity=0.078 Sum_probs=192.8 Q ss_pred hhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH-------- Q lcl|NC_019530. 23 DTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD-------- 94 (560) Q Consensus 23 ~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~-------- 94 (560) -+..+ ++-.-...--+.-|+.+. .+....-. -..+......+. .+|+.+..+++-+..+- T Consensus 1 ~~~~~----~~~~~~~~~~~~~p~~~~-~~~~~~~l----~~~l~~~~~~~~---~rl~~l~~YY~G~~~~~~~~~~~~~ 68 (501) T protein:vir:25 1 MTVPV----DVIADAPAADVEFPEDSM-SREQLGAL----VADMWRLHISER---QWLDRIYEYTKGLRGRPEVPEGASD 68 (501) T ss_pred Ccccc----hhhhccCcccccCCcccC-ChHHHHHH----HHHHHHHHHHHH---HHHHHHHHHHhcCCCchhccccCCh Confidence 01000 100000000111111110 00000000 000000000011 12222222222111110 Q ss_pred -------Hhh-c------ceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhh Q lcl|NC_019530. 95 -------IIV-N------DVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAY 160 (560) Q Consensus 95 -------eIv-n------eaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~ 160 (560) .+| | ++.+..-. +..+.+.+.+ . .+....|.+..+|+...+++++.-++.||-|. T Consensus 69 ~~~~~~~~~v~n~~~~ivd~~a~~l~--~~gf~~~d~~--~-------~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~ 137 (501) T protein:vir:25 69 EVKELAKLSVKNVLSLVRDSFAQNLS--VVGYRNALAK--E-------NDPAWEMWQRNRMDARQAEVHRPALTYGASYV 137 (501) T ss_pred hhhhhHhhhhcChHHHHHHHHHhhhc--ccceecCCcc--c-------hHHHHHHHHhcChhHHHHHHHHHHhhcCceEE Confidence 011 1 01000000 1112222221 1 12345677788899999999999999999887 Q ss_pred hheecCCCCccCeeEeEEeCccccchhhhhcccCC--ccccccc------cccchheeecCcccccccccc----c-ccc Q lcl|NC_019530. 161 HVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMR--EGGIEAI------ESVTLKYYYNPNYNRNQFTGQ----S-GTS 227 (560) Q Consensus 161 hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~--~g~~~~~------~~~~e~~~Y~p~~~~~~~~g~----~-~~~ 227 (560) +.-.|. + | ..++.++|+.+-.|-.=....+ .-.+... ......-+|+|..-+...... . ... T Consensus 138 ~v~~de--~--~-~~i~~~sp~~~~~iy~D~~~~~~~~~ai~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~ 212 (501) T protein:vir:25 138 TVTPTD--E--G-PVFRTRSPRQILAVYADPSVDAWPQYALETWVAQKDAKPHRRGVLYDDTYMYELDLGEVVLGDAGGG 212 (501) T ss_pred EEecCC--C--C-CeEEEeccccEEEEEecCCCCcceeEEEEEEeeccccCcceeEEEecCeeEEEEecCceeeeecccc Confidence 544432 2 3 3577888877754410000000 0000000 000011123321111000000 0 000 Q ss_pred ccccccccce-ecc----------hhc---eeeeecccccCCCCcchhhhHHHHHHHHHH-HHHHHHHHHHHHhcCccce Q lcl|NC_019530. 228 QNFQPSQQEL-VFD----------DES---IVYIDSGEEPLANGIVPGLLNPAIRPLNNL-VTTEDATVIYAITRAPEKR 292 (560) Q Consensus 228 ~~~~~~~~~~-~i~----------~da---I~y~hsGl~~~~~~~i~syL~~AiK~~NqL-~m~EDalVIyRitRAPeRR 292 (560) .......... ..+ -.. |.|+..- ...+...|-++..+...+-+ +++-+.+++-...-.|.|- T Consensus 213 ~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~---~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~ 289 (501) T protein:vir:25 213 QATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGR---DADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRV 289 (501) T ss_pred ccccccccccccccccccccccCCccceeeEeccCcc---ccCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHH Confidence 0000000000 000 001 1222211 01233445444433222221 3555777887888888877 Q ss_pred EEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHH Q lcl|NC_019530. 293 AFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVN 372 (560) Q Consensus 293 vFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~ 372 (560) +.=.+....+..++ ..| ..|+. +| -+++|.++|+.+-=+-++-++ T Consensus 290 i~G~~~~~~~~~~~-------------------~~~-------------~i~~~--~~-~~~~~~q~~~~~~~~~~~~l~ 334 (501) T protein:vir:25 290 ISGWTGSKAEVLKA-------------------SAL-------------RVWTF--ED-PEVKAQAFPPASVEPYNLILE 334 (501) T ss_pred HhCCCCCccchhhh-------------------ccc-------------ceecc--CC-CCceEEEecccChHHHHHHHH Confidence 76444333221110 111 12332 12 235677888743222334455 Q ss_pred HHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhh Q lcl|NC_019530. 373 YFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKP 452 (560) Q Consensus 373 YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~ 452 (560) -.-..+...-++|..-|+..++ |. | +..|.--+....+-+.+.|+.|..-+...++.-+.++|.....+|. T Consensus 335 ~~i~~i~~~s~~P~~~~~~~~~-N~--S-g~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~~~~~~~~~----- 405 (501) T protein:vir:25 335 EMLQHVAMVAQISPAQVTGKMI-NV--S-AEALAAAEANQQRKLAAKRESFGESWEQLLRLAAEMDDDPDTAADS----- 405 (501) T ss_pred HHHHHHHhhcCCChhhhccccC-Ch--H-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccce----- Confidence 5555666677889776663322 22 2 4556666777888899999999999999999988888864433331 Q ss_pred hceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccc- Q lcl|NC_019530. 453 FIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADE- 531 (560) Q Consensus 453 ~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e- 531 (560) .|.+.| .++.- .-+.+.++++..+..- | +|.+++...++.+|++||+++.++.++|...+....+...+ T Consensus 406 ~i~v~w-~~~~~------~s~~~~ada~~kl~~~-g--is~et~~~~~~g~~~~~ie~~~~~~~e~~~~~~~~~~~~~~~ 475 (501) T protein:vir:25 406 GAEVLW-RDTEA------RSFGAVVDGITKLASA-G--IPIEHLLSMVPGMTQQTIQAIKDSLRGGEVKSLVDKLLSNEP 475 (501) T ss_pred eeeEEe-cCCCC------CCHHHHHHHHHHHHhc-C--CCHHHHHHHcCCCCHHHHHHHHHHHHHHhHHHHHHHhhccCc Confidence 256666 33321 1223556666665543 3 79999999999999999998888887776554443332211 Q ss_pred -cccccccccC-CCCCCcccccccCC Q lcl|NC_019530. 532 -TGNYSGSDVS-PLKFKPETIPFSGS 555 (560) Q Consensus 532 -~g~~~~~~~~-~~~~~p~~~~~~~~ 555 (560) ++.+.++... ...-.-+..|-+|| T Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~g~ 501 (501) T protein:vir:25 476 APVPPPPPQAAAQALNEGGVNGNGGA 501 (501) T ss_pred CCCCCCCCCCCccccccccCCCCCCC Confidence 1111111111 01111223344444 No 35 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=98.58 E-value=2.4e-07 Score=56.89 Aligned_cols=455 Identities=13% Similarity=0.108 Sum_probs=219.3 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceec-cccccchhhHHHHHHH Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNY-VGVESELLSVKTVVEE 79 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y-~~~e~~~~~~~~LI~~ 79 (560) |-=.=+|++.+-..... + ..--..+|+-.......-..+ .+.+. ..+...|..+ T Consensus 1 m~~~~~~~~a~~~~~~~--------------------~----~~~~~y~aa~~~~~~~~~~~~s~d~~~-~~~~~~lr~R 55 (495) T protein:vir:10 1 MNMTPSGYQSLASGLLV--------------------P----VGASAYEGASGGHRWQDIGDYGPDTAV-ASGIQTLRAR 55 (495) T ss_pred CCcccccccccchhhhh--------------------H----HHhhhhhccccCcccCCCCCCChhHHH-HHHHHHHHHH Confidence 11111111110000000 0 000012222111111110111 11111 2345667888 Q ss_pred HHHH-hhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHH------HhCCchhhHHHHHHh Q lcl|NC_019530. 80 YQSM-AQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMH------LMDFDNTAYQKIRKW 152 (560) Q Consensus 80 YR~m-a~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~------ll~f~~~~~~~~R~w 152 (560) =|.| .++|=+..||+-+++-+|=. .+.+.... -.+.+.++|..+|+.+++ .++|.---...+|.| T Consensus 56 aRdl~rNn~~a~~av~~~~~~vVG~-----Gi~p~~~~---~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~ 127 (495) T protein:vir:10 56 SHHNVRNNPWATNAVATWVAAAVGN-----GLTPRWRM---KEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTV 127 (495) T ss_pred HHHHHhcChHHHHHHHHHHHhhcCC-----CcccccCC---chHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHH Confidence 8888 78999999999999887643 23333322 246789999999999997 355655567789999 Q ss_pred hhhhhhhhhheecCCCCccCe-eEeEEeCccccchhhhhcccCCccccccccccc--------hheeecCcccccccccc Q lcl|NC_019530. 153 YVDGRQAYHVIVDPTNKKGGI-KKLVMLDSRCIRPVYIVEKAMREGGIEAIESVT--------LKYYYNPNYNRNQFTGQ 223 (560) Q Consensus 153 YvDGr~~~hkiiD~~~~k~GI-~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~--------e~~~Y~p~~~~~~~~g~ 223 (560) .+||..|--+..++..+.... ..|..|||.+|.--.. .....++..+..+++ .|.++... .|- T Consensus 128 ~~dGE~f~~~~~~~~~~g~~~~~~lqliepd~l~~~~~--~~~~~~g~~i~~GIe~d~~Gr~vaY~i~~~h------pgd 199 (495) T protein:vir:10 128 INSGEAFVIKKPRPLSEGLSVPLQLQIIEPDMLASDIP--DETLPSGGYVKGGIRFSNGGKRKAYCFYRNH------PAE 199 (495) T ss_pred HhCCceEEEEeecccCCCCccceEEEEechhhcCCCCC--CCCCCCCCEEEeceEECCCCceEEEEEeecC------CCc Confidence 999999876666554322111 4789999988753210 111122223333322 44444321 111 Q ss_pred ccccccccccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCch Q lcl|NC_019530. 224 SGTSQNFQPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGK 303 (560) Q Consensus 224 ~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk 303 (560) ...........++|.+.|.|.. -. ...---.+|.|+..+ .+++|.-.+||..+-...-|-. +.+|-..+ |. T Consensus 200 ---~~~~~~~~~~~rvpA~~vlH~f-~~-r~gQ~RGis~la~i~-~l~~l~~y~dael~~a~i~A~~--~~fi~~~~-~~ 270 (495) T protein:vir:10 200 ---SSLIGDPVDTVWIKAEHVLHVT-VL-TVRSDAGAPWFQLLL-RLNELDQYEDAELVRKKTAALF--AAFIQEAT-AD 270 (495) T ss_pred ---ccccccccceeeechhheEecc-cc-CCCcccCcchhHHHH-HHHHhhHHHHHHHHHHHHhhhh--eeeeecCC-Cc Confidence 1111223345678988888775 32 333333458998655 5899999999999999888855 33332211 11 Q ss_pred HHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhc Q lcl|NC_019530. 304 KSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDH-VNYFREKLYDAL 382 (560) Q Consensus 304 ~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL 382 (560) .-+.+-.. ..++-.+.....+ ++.==++. =.-|.+|+.+.....-+..++ ++...+.+=.+| T Consensus 271 ~~~~~~~~--------------~~~~~~~~~~~~~-l~pG~i~~--L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaagl 333 (495) T protein:vir:10 271 STGGPTIG--------------QPKRSKGGKRITG-LNPGTLQY--LQPGQEVKFSNPADVGTTYEPWLRYQLLSIAKGY 333 (495) T ss_pred cccccccC--------------ccccccCccccee-cCCceeee--cCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhc Confidence 11100000 0011111111111 10000000 013455655555444444444 566666777889 Q ss_pred CCCccccCCCC-cccccccchhhhhHHHHHHHHHHHHHHHH-HHHHHHHH-----HHHHHHhccCCC-hhHHHHhhhhhc Q lcl|NC_019530. 383 MIPKSRLQEEG-SINIGGSNLAEITQEELRFSKFCAGLRRR-YSHFFMEF-----LRRQLILKGVTD-EKDWNEKIKPFI 454 (560) Q Consensus 383 ~VP~sRl~~~~-~~~l~~~~~~eItRDElkF~Kfi~rLr~r-fs~if~d~-----Lk~qLiLkgiit-~eew~~~i~~~I 454 (560) +||-+-|..+- +.|. |-+.-.-+.|-+.+.++|.+ |..-|..+ |+ ..+|.|.++ ++.|+. ..... T Consensus 334 Gi~Ye~ltgD~s~~nY-----SS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~-~a~l~G~i~~p~~~~~-~~~~~ 406 (495) T protein:vir:10 334 GITYEMLTGDLRGVNY-----SSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMD-FAVASGAVVIPDYLQR-RRYYN 406 (495) T ss_pred CCCHHHHhcccccccH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHcCCCCCCCchhh-hHhhh Confidence 99999886553 2221 22323455799999999875 44444443 33 455677654 443332 11112 Q ss_pred --eeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhh----hcCCCCCCC Q lcl|NC_019530. 455 --KFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEK----KKGLYPKVQ 528 (560) Q Consensus 455 --~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~----~~~~~~~p~ 528 (560) .|..-.=-+.-.+||+.-...+++ +-.-|.+-+..+ .+.. .++..+||++|. +.|+.-+. T Consensus 407 ~~~w~~p~~~~vDP~Ke~~A~~~~i~---------~G~~s~~~~~a~-~G~D---~~~v~~q~a~e~~~~~~~Gl~~~~- 472 (495) T protein:vir:10 407 RVSWRTPRWEEVDPLKKHLADLGDVR---------AGFAPISDKQAE-RGYD---MEELFDMISDANQLIDEYDLRLDS- 472 (495) T ss_pred ccccccCCccccChHHHHHHHHHHHH---------cCCCCHHHHHHH-cCCC---HHHHHHHHHHHHHHHHHcCCCCCC- Confidence 232333334556676665554443 233455555444 3333 333333333333 34543221 Q ss_pred ccccccccccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 529 ADETGNYSGSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 529 ~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) .|..++.+|++.++- T Consensus 473 -----------------~p~~~~~~~~~~~~~ 487 (495) T protein:vir:10 473 -----------------DPRYVNGSGAEQKSV 487 (495) T ss_pred -----------------CCCcCCCccCCCCCC Confidence 111111111111111 No 36 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=98.52 E-value=3.7e-07 Score=55.82 Aligned_cols=460 Identities=11% Similarity=0.106 Sum_probs=202.1 Q ss_pred hhhhhhhhhhhh-------hhhhhcc-----------cccchhhhcccccc-ccCCCCCCCCceeEeccccceecccccc Q lcl|NC_019530. 8 FFGLFGGGGLVN-------AKVDTDK-----------LAQKQDELLLTKAT-VVALDDAQDGSIILQGGANTYNYVGVES 68 (560) Q Consensus 8 ~~~~~~~~~~~~-------~~~~~~~-----------~~~~~~e~~~~k~~-s~~~~~~~dG~~~i~~~~~~~~y~~~e~ 68 (560) .-.-.|.++++- +++...+ +.....+|-+++.. ....|- .|...+. .....-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~--~~~~~~~------~~~~~r~ 72 (551) T protein:vir:80 1 MKNKLGLFESIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPV--IGSMSAN------PGFKTKP 72 (551) T ss_pred CchhhhhHHHhhhccCChhhcccccccccceeeecccccHHHHHHhhccCcceeeccc--ccceecC------cccccCc Confidence 111111112111 2222221 11111222222111 111111 1222221 1111223 Q ss_pred chhhHHHHHHHHHHHhhccchhHHHHHhhcceeeec------CCCceEEEeeeccc-chhHHHHHHHHHHHHHHHHHhCC Q lcl|NC_019530. 69 ELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVTCE------EDETPVTVNLDKVE-GISDTVKESITECFKEVMHLMDF 141 (560) Q Consensus 69 ~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~d------~~~~~V~l~Ld~~~-~~s~sik~~I~eeF~~il~ll~f 141 (560) ...+..+|-+..+.++.+|-|..||..|.+.+..+. .+.-+-.+.+.+.+ ..+...+..+ +....+++-.+. T Consensus 73 ~~~~~~~l~~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~-~~i~~~l~~pn~ 151 (551) T protein:vir:80 73 SIRNNQDLHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATI-KRIESFIEKTGV 151 (551) T ss_pred cccChhHHHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHH-HHHHHHHHhcCC Confidence 344556676667777889999999988888654322 12223334443321 1111111111 112222322222 Q ss_pred c-----hhhHHHHHHh----hhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeec Q lcl|NC_019530. 142 D-----NTAYQKIRKW----YVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYN 212 (560) Q Consensus 142 ~-----~~~~~~~R~w----YvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~ 212 (560) . ....++++.| ++.|.-|+.++.|.. .-+.+|++|||.+|..+. +.+|. ......-|+++. T Consensus 152 ~~~p~~~s~~~f~~~lv~dlll~Gnay~~i~rd~~---G~~~~L~~l~p~~V~v~~-----~~~g~--~~~~~~~y~~~~ 221 (551) T protein:vir:80 152 DNDINRDSFSSFVKKIVRDTYMYDQVNFEKVFNRN---QSMVRFVAKDPTTIFFAT-----TADGK--IPDNGNRFVQVI 221 (551) T ss_pred CCCCccchHHHHHHHHHHHHHhcCCEEEEEEECCC---CcEEEEEEeCCceeEEEE-----CCccc--cccCceEEEEEe Confidence 2 2444555555 466988988887653 348999999999988752 22231 111111111221 Q ss_pred Cccccccccccccccccccccccceecchhceeeeecc-ccc-CCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCcc Q lcl|NC_019530. 213 PNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSG-EEP-LANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPE 290 (560) Q Consensus 213 p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsG-l~~-~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPe 290 (560) + .+..+.++.+.|.|.+-. +.+ ..+++.+|-|+.|+..+.....++....=+----+-- T Consensus 222 ~-------------------g~~~~~~~~~eiiH~~~n~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p 282 (551) T protein:vir:80 222 D-------------------QKIVATFNAREMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTT 282 (551) T ss_pred C-------------------CcEEEEEcccceEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCc Confidence 1 122446777778777522 111 2345677889999999999888888765333333445 Q ss_pred ceEEEccC-CCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHH Q lcl|NC_019530. 291 KRAFYLDV-GTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMD 369 (560) Q Consensus 291 RRvFyIDv-Gnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~ 369 (560) +-|..+.. ++|.+..+++.-+.+...|. | ..+.-+.+ ++. +.|.++..|. .+..++. T Consensus 283 ~giL~~~~~~~lt~e~~~~lk~~~~~~~~---------G-~~nag~~~-vl~---------~~g~~~~~l~--~~~~D~q 340 (551) T protein:vir:80 283 RGILQIKAAQQQSQHALEIFKREWKNSLS---------G-INGSWQIP-VVS---------AEDVKFVNMT--PSARDME 340 (551) T ss_pred ceEEEEcCCCCCCHHHHHHHHHHHHHHhc---------C-ccccCccc-ccc---------CCCceEEEcc--CChhHHH Confidence 55555554 34555444444443333332 2 11111111 221 1234555553 3444433 Q ss_pred H---HHHHHHHHHHhcCCCccccCCCC--------cccccccchhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019530. 370 H---VNYFREKLYDALMIPKSRLQEEG--------SINIGGSNLAEITQEELRFSKF-CAGLRRRYSHFFMEFLRRQLIL 437 (560) Q Consensus 370 D---V~YF~~kLy~aL~VP~sRl~~~~--------~~~l~~~~~~eItRDElkF~Kf-i~rLr~rfs~if~d~Lk~qLiL 437 (560) - .+|..+..-++.+||...|+-.+ +-++.++++.+..+ .|... +.-+..++...|...| T Consensus 341 fle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~---~f~~~tL~P~~~~ie~~ln~~L------ 411 (551) T protein:vir:80 341 FEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQ---ASKNKGLQPLLGFIEDFINKHI------ 411 (551) T ss_pred HHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccchhhHHHHHH---HHHHHHHHHHHHHHHHHHHhhh------ Confidence 3 45577889999999999996422 12234444433222 24433 5555555544444332 Q ss_pred ccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCH-HHH-------- Q lcl|NC_019530. 438 KGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSD-EEV-------- 508 (560) Q Consensus 438 kgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tD-eEI-------- 508 (560) +++ + ...+.|+|.....-.+ .+|..+.+.+. .-++|..-+++ .+.+.. .|- T Consensus 412 ---~~~--~----~~~~~f~f~~~~~~~~-------~~~~~~~~~~~---~g~lT~NE~R~-~~gl~P~~egGD~~~~~~ 471 (551) T protein:vir:80 412 ---VAE--F----GDKYTFQFVGGDIKSE-------LESVKILAEKA---KVAMTVNEVRK-ELNLPGDVIGGDIPLNGV 471 (551) T ss_pred ---ccc--c----CCceEEEeeccChhhH-------HHHHHHHHHHh---cCCcCHHHHHH-HhCCCCCCCCCceeeccc Confidence 221 1 2346777765543222 23333333322 23678888884 477754 110 Q ss_pred --HHHH-----HHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccccCCcc--cCC Q lcl|NC_019530. 509 --KEQQ-----TKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTD--DSI 560 (560) Q Consensus 509 --~e~~-----kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~--~~~ 560 (560) .-.. ++.+.|.......++... .+..+..+.+..|.--+..|..+ +++ T Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 528 (551) T protein:vir:80 472 IVQRIGQLMQQEQFEHEKQQSNLQMLQEQ----TGNRVSTDVEDIPDGKDTTGDIGKDGQR 528 (551) T ss_pred ccccccccccccCcchhhhhhccccccCc----CCCCCCCCCCCCCCccccCCCccccccc Confidence 0000 000000000000000000 11111122222333333333222 222 No 37 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=98.51 E-value=3.9e-07 Score=55.71 Aligned_cols=452 Identities=10% Similarity=0.002 Sum_probs=179.4 Q ss_pred hccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH------------------- Q lcl|NC_019530. 34 LLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD------------------- 94 (560) Q Consensus 34 ~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~------------------- 94 (560) -..+....+..+...+... ...+-... ......+|+.+..+.+=+..|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~----------~~~~i~~~--~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~ 68 (489) T protein:vir:99 1 MLQEDFEAIDYESKLWIDQ----------LKNYISRF--KAEQLERLKELKRYYLGDNNIKYRPAKTDKYAADNRIASDF 68 (489) T ss_pred CCccceeeeCCCCCCCHHH----------HHHHHHHH--HHHHHHHHHHHHHHhcccCccccccccccccCCcceeecch Confidence 0000000000000000000 00000000 1122334444444443222111 Q ss_pred --HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccC Q lcl|NC_019530. 95 --IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGG 172 (560) Q Consensus 95 --eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~G 172 (560) -||+ ..+.---..||++..++. .+.+....+.+..+|+....++.+.+++.|+-|....+.+....+| T Consensus 69 ~~~iv~-~~~~~l~g~~~~~~~~d~---------~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~ 138 (489) T protein:vir:99 69 AKYITV-FEQGYMLGVPVEYKNENK---------DLQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKT 138 (489) T ss_pred HHHHHH-HHhhhhccCCceeecCCh---------hHHHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCc Confidence 1111 011111234444444332 2455666777778999999999999999999998777776666778 Q ss_pred eeEeEEeCccccchhhhhcccC-Ccccccccc-------ccchheeecCccccccccccccccc--cc-cccccceecch Q lcl|NC_019530. 173 IKKLVMLDSRCIRPVYIVEKAM-REGGIEAIE-------SVTLKYYYNPNYNRNQFTGQSGTSQ--NF-QPSQQELVFDD 241 (560) Q Consensus 173 I~eLr~LDPr~i~~vr~i~~~~-~~g~~~~~~-------~~~e~~~Y~p~~~~~~~~g~~~~~~--~~-~~~~~~~~i~~ 241 (560) -..+..+||+++-++..-.... ..-.+.... .+.-..+|.|..-+........... .. .-+...-++|. T Consensus 139 ~~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~vPv 218 (489) T protein:vir:99 139 EVKLYQLPAEQTFVIYDDTYQRNSLMAVHFYDIDYGSGKRKQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFFKGVPV 218 (489) T ss_pred ceEEEEEcccceEEEEcCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEecCCCcccceecccccccCCceeE Confidence 8899999999987764211110 001111110 1122334655433221111000000 00 00000001111 Q ss_pred hceeeeecccccCCCCcchhhhHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhh Q lcl|NC_019530. 242 ESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVT-TEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNR 320 (560) Q Consensus 242 daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m-~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk 320 (560) |.|. ++....|-++..+.....+.. +-+....-+..+.|-+-+.=..... ...-++....+ T Consensus 219 --v~~~-------n~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~---~~~~~~~~~~~------ 280 (489) T protein:vir:99 219 --NEYA-------NNEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTG---ADENDYLDDGR------ 280 (489) T ss_pred --EEee-------cCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCccc---ccchhhhhhcc------ Confidence 1111 222333455554444443322 2232222333444444443222111 11111111110 Q ss_pred eeecccCC-cccCcccchhhHhhhcccccCC--CCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCccc Q lcl|NC_019530. 321 NAYDRTTG-KITGNAHLMGIAEDYWLPRREG--QNATEIATVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGSIN 396 (560) Q Consensus 321 ~vYd~~TG-ev~d~~~~msmlEDywLpRReG--grgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~ 396 (560) ..-+...+ .+......+ +++..... +.+..+.-|.-..+.+. ..-+.-+.+.+|+-.++|- +..++ +. T Consensus 281 ~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~--~~~~~-~~ 352 (489) T protein:vir:99 281 LNPNGRLAISIGFKKAQV-----LILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPD--TQDMK-FS 352 (489) T ss_pred ccccccccccccccccee-----eeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCcc--ccccc-cc Confidence 00000001 000000111 11111111 11223443433233222 2234566778888889983 22221 11 Q ss_pred ccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHH Q lcl|NC_019530. 397 IGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDR 476 (560) Q Consensus 397 l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~R 476 (560) |...+..|..-+..-..-+.+-|+.|...+.++++.-+-+-++.....|...-...|.+.|...---.+. +- T Consensus 353 -~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~-------~~ 424 (489) T protein:vir:99 353 -GVQSGESMKYKLMASDNYREKQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDN-------EI 424 (489) T ss_pred -ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHH-------HH Confidence 1222233333322233345555666666666666533322233221111111112366777433221121 22 Q ss_pred HHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCc Q lcl|NC_019530. 477 LASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKP 547 (560) Q Consensus 477 l~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p 547 (560) ++++.++ +| .+|.+++++.+=..|+++-+++-++|++|.... ...++....|+.. ++-.+.+-.| T Consensus 425 ~~~~~kl---~g-iis~et~~~~l~~v~~~d~~~E~~ri~~E~~~~-~~~~~~~~~~~~~-~~~~~~~~~p 489 (489) T protein:vir:99 425 VTAAQNL---YG-IVSDQTIFEILNTVTGVDAEAELKRLKEEADKK-QSLPEPRLVGDAS-GQEEPTAEKP 489 (489) T ss_pred HHHHHHH---hc-cCCHHHHHHhcCCCCchhHHHHHHHHHHHHHHH-hccccccccCCCC-CCcCCCCCCC Confidence 3444444 35 489999998866777778888788888875332 1112222222221 2222333344 No 38 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=98.48 E-value=4.8e-07 Score=55.22 Aligned_cols=477 Identities=13% Similarity=0.059 Sum_probs=235.5 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccch-hhHHHHHHHHHHH-hhccchhHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESEL-LSVKTVVEEYQSM-AQQPEIRKA 92 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~-~~~~~LI~~YR~m-a~~PEvd~A 92 (560) |..+|..+..--+.+- -....... .-...+|+-............+.+..+ .+-..|..+=|.| .++|=+..| T Consensus 1 Mn~iDr~i~~~sP~~a-~~R~~ar~----~~~~y~aa~~~r~~~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~a 75 (548) T protein:vir:95 1 MNLIDRLLEPLAPELV-ARRLAARE----AIQAYEAARPGRTHKAKRQPLGADTSLQKSAVSMREQCRKLDEDHDLVTGL 75 (548) T ss_pred CchHHhHhhhcchHHH-HHHHHhHH----HhccccccCccccccccCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHH Confidence 4444433332211000 00000000 001233332211111111112222222 2355677788888 789999999 Q ss_pred HHHhhcceeeec-CCCceEEEeeecccchhHHHHHHHHHHHHHHHH------HhCCchhhHHHHHHhhhhhhhhhhheec Q lcl|NC_019530. 93 VDIIVNDVVTCE-EDETPVTVNLDKVEGISDTVKESITECFKEVMH------LMDFDNTAYQKIRKWYVDGRQAYHVIVD 165 (560) Q Consensus 93 I~eIvneaiv~d-~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~------ll~f~~~~~~~~R~wYvDGr~~~hkiiD 165 (560) |+-+++-+|=.. ..-.|-.+..+.. ..+.+.+.|...|+.+++ .++|..--.-.+|.|.+||..|..+..+ T Consensus 76 v~~~~~nvVG~~G~~i~p~~l~~d~~--~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~ 153 (548) T protein:vir:95 76 LDRLEERVVGGSGIGVEPLPLRLDGS--VHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMG 153 (548) T ss_pred HHHHHHhccCccccceeeeecCCCHH--HHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeec Confidence 999988777422 2333334444442 356678889999999986 4556666677999999999999877776 Q ss_pred CCCC-ccCe---eEeEEeCccccchhhhhcccCCcccccccccc--chheeecCccccccccccccccccccccccceec Q lcl|NC_019530. 166 PTNK-KGGI---KKLVMLDSRCIRPVYIVEKAMREGGIEAIESV--TLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVF 239 (560) Q Consensus 166 ~~~~-k~GI---~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~--~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i 239 (560) +... ..|. ..|..|+|.+|.--..-....-..||++..-. .-|.++... .|.. ......+...++ T Consensus 154 ~~~~~~~g~~~~~~lqliepd~l~~~~~~~~~~i~~GIE~D~~Grp~aY~i~~~h------Pgd~---~~~~~~~~~~rv 224 (548) T protein:vir:95 154 RVPNYTFATSVPFALELLEPDYLPFSYNNLSKGIVQGIERDTWRRKRAYHLLKDH------PGNL---QTLGGSLAVKRV 224 (548) T ss_pred ccccccCCcccceEEEEechhhcCCCCCCCCCceeeeeEECCCCceEEEEEeecC------CCcc---cccccccceeee Confidence 5321 2233 46889999887531100000001233332211 145555421 1110 011223456789 Q ss_pred chhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhh Q lcl|NC_019530. 240 DDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKN 319 (560) Q Consensus 240 ~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~kn 319 (560) |.+.|.|+..-. ...---.+|.|+.+++.+.+|.-.+||..+-...-|-.=-+..=+-+. ... T Consensus 225 pA~~VlHif~~~-r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~---~~~------------- 287 (548) T protein:vir:95 225 EAERIIHIAYRK-RIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPD---SYT------------- 287 (548) T ss_pred chhHheeccccc-CCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCc---ccc------------- Confidence 999998888643 333445679999999999999999999999988888763333322221 100 Q ss_pred heeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCc---------chHHHHH-HHHHHHHHhcCCCcccc Q lcl|NC_019530. 320 RNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQL---------GEMDHVN-YFREKLYDALMIPKSRL 389 (560) Q Consensus 320 k~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nL---------gei~DV~-YF~~kLy~aL~VP~sRl 389 (560) ...++ .+.....+| + -|+=|.+|+.|+.+ +..++.. -..+.+=.+|+||-+-| T Consensus 288 -----~~~~~-~~~~~~~~~-~----------pG~iv~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~l 350 (548) T protein:vir:95 288 -----VEPGK-DRKNRTIPI-A----------PGMVFDDLEPGEDVGMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSV 350 (548) T ss_pred -----CCCCc-ccccccccc-c----------CCccccccCCCceeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHH Confidence 00000 011111111 1 13334444444333 3333332 33334556799999998 Q ss_pred CCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHhccCCChhHHHHhhhhhceeeee--ecch Q lcl|NC_019530. 390 QEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRR----QLILKGVTDEKDWNEKIKPFIKFEFT--SDSY 463 (560) Q Consensus 390 ~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~----qLiLkgiit~eew~~~i~~~I~~~f~--~Dn~ 463 (560) ..+-+.| - |-+--.-+.|-+.+.++|..|..-|..++-. ..+|+|.+..-.|.. -...+...|. .=-+ T Consensus 351 tgD~s~n--Y---SS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~-~~~~~~~~W~~P~~~~ 424 (548) T protein:vir:95 351 SRAYDGT--Y---SAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVD-HRTLYAAVYQGPVMPW 424 (548) T ss_pred hcccchh--H---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCC-chhheeeeeecCCccc Confidence 7664322 2 2233345669999999999988777766433 466778764221111 1222333332 2334 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhh----hcCCCCC------------- Q lcl|NC_019530. 464 IREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEK----KKGLYPK------------- 526 (560) Q Consensus 464 f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~----~~~~~~~------------- 526 (560) .-.+||+.-...+++. -.-|.+-+..+ .+.. .++..+||..|. +.|+..+ T Consensus 425 iDP~Kea~A~~~~i~~---------Gl~T~~~~~a~-~G~D---~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~~~~~~ 491 (548) T protein:vir:95 425 INPMHEANAWELLVKA---------GFADEAEVARA-RGRD---PRELKKSRETEIKANRAAGLVFSSDAYHQLVKSGMD 491 (548) T ss_pred cChHHHHHHHHHHHHc---------CCCCHHHHHHH-hCCC---HHHHHHHHHHHHHHHHHcCCCCCCcccccccccccC Confidence 4566766655544431 23444444443 3333 333333333332 3343221 Q ss_pred CCccc-----------ccccccccc----CCCCCC-ccccc--ccCCcccCC Q lcl|NC_019530. 527 VQADE-----------TGNYSGSDV----SPLKFK-PETIP--FSGSTDDSI 560 (560) Q Consensus 527 p~~~e-----------~g~~~~~~~----~~~~~~-p~~~~--~~~~~~~~~ 560 (560) |.++. .|+++-+.+ ++++.+ |++.- -.|+.|+.- T Consensus 492 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 543 (548) T protein:vir:95 492 PVEAVQKVYLGVGKMLTADEARELVNRYGAGLPVPGPDFPNESNNGGADGQP 543 (548) T ss_pred CCCchhhhccccccccccchhHHhhccCCCCCcCCCCCCCcccccCCCCCCC Confidence 11110 111111111 111111 11110 145555544 No 39 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=98.40 E-value=8.4e-07 Score=53.90 Aligned_cols=479 Identities=11% Similarity=0.019 Sum_probs=228.7 Q ss_pred cccchhhhcccccc------ccCCCCCCCCceeEec--cccceeccccccch-hhHHHHHHHHHHH-hhccchhHHHHHh Q lcl|NC_019530. 27 LAQKQDELLLTKAT------VVALDDAQDGSIILQG--GANTYNYVGVESEL-LSVKTVVEEYQSM-AQQPEIRKAVDII 96 (560) Q Consensus 27 ~~~~~~e~~~~k~~------s~~~~~~~dG~~~i~~--~~~~~~y~~~e~~~-~~~~~LI~~YR~m-a~~PEvd~AI~eI 96 (560) +-+.-.......+. +....-..+|+..... ++...+..+.+..+ .+...|..+=|.| .++|=+..||+.+ T Consensus 1 m~~~~~r~~~~~a~~~~~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~~~ 80 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGRPEQSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVGYQ 80 (553) T ss_pred CcchhhhhhcccccccchhhhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Confidence 11110100000000 0011112333321111 11112222223332 2455577888888 7899999999999 Q ss_pred hcceeeecCCCceEEEeee--------cccchhHHHHHHHHHHHHHHHH----------HhCCchhhHHHHHHhhhhhhh Q lcl|NC_019530. 97 VNDVVTCEEDETPVTVNLD--------KVEGISDTVKESITECFKEVMH----------LMDFDNTAYQKIRKWYVDGRQ 158 (560) Q Consensus 97 vneaiv~d~~~~~V~l~Ld--------~~~~~s~sik~~I~eeF~~il~----------ll~f~~~~~~~~R~wYvDGr~ 158 (560) ++-+|=. + +.+... .+....+.+.++|..+|..+++ .++|..--.-.+|.|++||.. T Consensus 81 ~~nvVG~--G---i~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~ 155 (553) T protein:vir:63 81 RDSIVGA--Q---YRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEV 155 (553) T ss_pred HHhhccC--C---ceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCce Confidence 8877631 2 222211 1122467788999999999874 345555667788999999999 Q ss_pred hhhheecCCCCccCe--eEeEEeCccccchhhhhcccCCccccccccccc--------hheeec--Cccccccccccccc Q lcl|NC_019530. 159 AYHVIVDPTNKKGGI--KKLVMLDSRCIRPVYIVEKAMREGGIEAIESVT--------LKYYYN--PNYNRNQFTGQSGT 226 (560) Q Consensus 159 ~~hkiiD~~~~k~GI--~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~--------e~~~Y~--p~~~~~~~~g~~~~ 226 (560) |-.....+. ..+- ..|..|||-+|.--.. .. .+..+..+++ .|.+++ |........+... T Consensus 156 ~~~~~~~~~--~~~~~~~~lq~ie~drl~~~~~----~~-~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~~~~~~~~~- 227 (553) T protein:vir:63 156 LATAEWDRA--ANRPYATCFQMVSTDRLSNPYQ----QL-DTPTLRRGVQYDKRGRPQGYWIQVAHPGDLYQMAPDMYK- 227 (553) T ss_pred EEEeeeccC--CCCcccceEEEechhhcCCCCC----CC-CCCeeEeeeEECCCCceEEEEeeccCCCccccccccccc- Confidence 887666542 1221 3578999977765321 11 2223444432 455554 3321111111100 Q ss_pred cccccccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHH Q lcl|NC_019530. 227 SQNFQPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSA 306 (560) Q Consensus 227 ~~~~~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KA 306 (560) ........+++.+.|.|+.... ...---.+|.|+.+++.+.+|.-.+||...-...-|-. +.+|-.+ .|...+ T Consensus 228 ---~~r~~~~~~v~a~~vlH~f~~~-r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~--a~fi~~~-~~~~~~ 300 (553) T protein:vir:63 228 ---WKFVQQSKPWGRRQVIHILEPR-EPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASY--AAAIESE-LPPEFI 300 (553) T ss_pred ---eeeeccccccChhHheeccccc-CCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhh--eeeeecC-CChhhh Confidence 0011123468888888888653 22334457999999999999999999999998888766 3333332 243333 Q ss_pred HHHHHHH-----------------HHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch-H Q lcl|NC_019530. 307 EEYMTMM-----------------MGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE-M 368 (560) Q Consensus 307 eqYl~~i-----------------m~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i 368 (560) .+.+..- ...+..+.+..-..|.|.. =.-|.+|..+.....-+. . T Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~-----------------L~pGe~i~~~~p~~p~~~~~ 363 (553) T protein:vir:63 301 HSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPH-----------------LFPGTKLNLKPMGTPGGVGS 363 (553) T ss_pred hhhcccccccccccccccccccccccccccccceeecCceeee-----------------cCCCCeeeecCCCCCCCCHH Confidence 3222210 0000011111111111100 012334444444333333 2 Q ss_pred HHHHHHHHHHHHhcCCCccccCCC-CcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHhccCCCh Q lcl|NC_019530. 369 DHVNYFREKLYDALMIPKSRLQEE-GSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLR----RQLILKGVTDE 443 (560) Q Consensus 369 ~DV~YF~~kLy~aL~VP~sRl~~~-~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk----~qLiLkgiit~ 443 (560) +=++...+.+=.+|+||-+-|..+ ++.|.. -+--.-+.|-+.+.++|..|..-|..++- ...+|.|.|.. T Consensus 364 ~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYS-----S~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~ 438 (553) T protein:vir:63 364 EFEASLNRHLASAFGMSYEEFTRDFSKANYS-----SIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPM 438 (553) T ss_pred HHHHHHHHHHHhhcCCCHHHHhhhcccccHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccC Confidence 344555666667899999988665 233322 12234566999999999988777776643 35577777642 Q ss_pred hHH-H-------Hhhhhh--ceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHH-HHH Q lcl|NC_019530. 444 KDW-N-------EKIKPF--IKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVK-EQQ 512 (560) Q Consensus 444 eew-~-------~~i~~~--I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~-e~~ 512 (560) -.+ . ...... ..|..-.=.+.--+||+.-...+++. -+-|.+-+..+ ++..-+|.. |.. T Consensus 439 p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~---------G~~t~~~~~a~-~G~D~~~v~~q~a 508 (553) T protein:vir:63 439 PPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVMRIDA---------GLSTYEREIAR-LGGDFRKSFAQRA 508 (553) T ss_pred CCcccchhhcchhhhhhhhceeeecCCccccChHHHHHHHHHHHHc---------CCCCHHHHHHH-hCCCHHHHHHHHH Confidence 111 0 000000 22222333344556666554444432 22344444433 233332222 222 Q ss_pred HHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccccCCcccC Q lcl|NC_019530. 513 TKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDDS 559 (560) Q Consensus 513 kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~ 559 (560) +..+.-.+.|+..+.........+ +..+..-.+..-+..++.++- T Consensus 509 ~e~~~~~~~Gl~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~e 553 (553) T protein:vir:63 509 REDALLKKYGLTFNLSAKRSLGDG--RDAATGIAEDPAAAQTSQQGE 553 (553) T ss_pred HHHHHHHHcCCCCCCCCccccCCC--cccCCCCCCCCCCCCcccccC Confidence 222222245654332211111111 111111111112223333333 No 40 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=98.40 E-value=8.4e-07 Score=53.89 Aligned_cols=453 Identities=9% Similarity=0.019 Sum_probs=204.9 Q ss_pred ceeEeccccceec--cccccc-hhhHHHHH-------HHHHHHhhccchhHHHHHhhc--------ceeeecCCCceE-- Q lcl|NC_019530. 51 SIILQGGANTYNY--VGVESE-LLSVKTVV-------EEYQSMAQQPEIRKAVDIIVN--------DVVTCEEDETPV-- 110 (560) Q Consensus 51 ~~~i~~~~~~~~y--~~~e~~-~~~~~~LI-------~~YR~ma~~PEvd~AI~eIvn--------eaiv~d~~~~~V-- 110 (560) -|.|.+|....+. .++... .....+|+ .+|+.+..|++-+..|..+-. =.+|+.=-...| T Consensus 1 ~~~~~~~~~~~~~~~~~l~~~e~~~i~~L~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n~~~~iVd~ 80 (504) T protein:vir:99 1 MTEETTSASKFTFRIPELNDDVVDKVNGLYQQLVDRTPRNLLRASFYDGKYAIRQIGNLIPPEYLRTATVLGWSAKAVDT 80 (504) T ss_pred CCccCCcccccccccCCCCHHHHHHHHHHHHHHHHHhHHHHHHHHHHhccccchhccccccHHHHHHhhccCcHHHHHHH Confidence 3444443333222 111111 00111122 233344444433333221110 001111000000 Q ss_pred ---EE---eeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCcccc Q lcl|NC_019530. 111 ---TV---NLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCI 184 (560) Q Consensus 111 ---~l---~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i 184 (560) .+ .+...+ -+ .-.+....|....+|+....++++.-++.||-|. +|-+....++..-++.++|+.+ T Consensus 81 ~a~rl~~~Gf~~~d-~~-----~~~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~--~v~~~~d~~~~~~I~~~sP~~~ 152 (504) T protein:vir:99 81 LARRCNLESFVWPD-GD-----YGSIGGPDVWDENFFATKANNAMVSSLIHGPAFL--INTEGGAGEPDSLIHVKSAMQA 152 (504) T ss_pred HHhhhccceeeCCC-CC-----hhhHHHHHHHHhcChhhHHHHHHHHHHhhCceeE--EEecCCCCCceeEEEEecccee Confidence 00 011111 01 1124466778888999999999999999999775 4434433445556778888876 Q ss_pred chhhhhcccCCccccccc----c-ccchheeecCcccccc-ccccccccccccccccceecchhceeeeecccccCCCCc Q lcl|NC_019530. 185 RPVYIVEKAMREGGIEAI----E-SVTLKYYYNPNYNRNQ-FTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLANGI 258 (560) Q Consensus 185 ~~vr~i~~~~~~g~~~~~----~-~~~e~~~Y~p~~~~~~-~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~~~ 258 (560) -.+..=......-++... + ......+|.|..-++. ..+.........+..-+ +| .|.|++..-++.+.|. T Consensus 153 ~~iyD~~~~~~~~a~~~~~~d~~g~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g--vP--vV~~~n~~~~~~~~G~ 228 (504) T protein:vir:99 153 TGEWNSRRNAMDSLLSITSRDAEGHPTGIALYEDGVTVTADMDDDGDWHADVRTHKLG--VP--VEVLPYKPREDRPLGS 228 (504) T ss_pred EEEEeCCCCceeEEEEEEEecCCCeEEEEEEEcCCcEEEEEEcCCceeeeccccCCCC--cc--eEEecccccCccccCc Confidence 544310000000000000 0 0111223444422211 11110000111111111 33 5677765433333221 Q ss_pred c--hhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccc Q lcl|NC_019530. 259 V--PGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHL 336 (560) Q Consensus 259 i--~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~ 336 (560) . .-.|-..+..+| +.|-+.++.=...=.|.|-|+=.+-..++...++ .+..++ +..|.| T Consensus 229 sei~~~v~~l~Da~~--~~~~~~~~~~e~~a~p~r~i~G~~~~~~~~~d~~-----~~~~~~------~~~~~i------ 289 (504) T protein:vir:99 229 SRITRPVMSLQQRAL--KGCIRMDGHADVYSFPQLILLGADAKNFRNKDGS-----MKPAWQ------IALARV------ 289 (504) T ss_pred ccchhhHHHHHHHHH--HHHHHHHHHHHHhcchhhhhccCCcccccccccc-----ccchhh------hhhhhh------ Confidence 1 112223333333 4566667777777777776653332221111110 000000 000110 Q ss_pred hhhHhhhcccccC-----CCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHH Q lcl|NC_019530. 337 MGIAEDYWLPRRE-----GQNATEIATVGGGNQLGEM-DHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEEL 410 (560) Q Consensus 337 msmlEDywLpRRe-----GgrgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDEl 410 (560) +.||.-+ ++-.+++..++++ +|.-. +-++-.-..+...-++|..-|+-.+. .+.+.+..|.-.+. T Consensus 290 ------~~~~~~~~~~~~~~~~~~~~q~~~~-~l~~~~~~l~~~i~~~a~~t~~P~~~lG~~~~--~n~sSa~Ai~~~~~ 360 (504) T protein:vir:99 290 ------FALPDDEDEPDAARARADVKQFPAS-SPQPHIEMLEQIAMMFSGETSIPVESLGFSNR--ANPTSADAYIASRE 360 (504) T ss_pred ------hcCCCccccccccCccceeeecCCC-ChHHHHHHHHHHHHHHHhhhCCCHHHhccccc--ccccHHHHHHHHHH Confidence 2244321 2234678888885 34432 33555555666668999888864332 11223456777888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC--hhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhc Q lcl|NC_019530. 411 RFSKFCAGLRRRYSHFFMEFLRRQLILKGVTD--EKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVG 488 (560) Q Consensus 411 kF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit--~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vG 488 (560) ...+-+.+.|++|..-+...+|.-+.+.+... +++|. .+.+.|. |.-.. .+.++.+.+..+-.-+. T Consensus 361 ~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~~~~~-----~~~v~w~-d~~~~------s~a~~aDa~~Kl~~ag~ 428 (504) T protein:vir:99 361 DLIAEAEGATDDWSPAFRRSMIRALAIKNGLDRIPPEWK-----TIDSKFR-SPLYL------SKAAQADAGAKMLGAGP 428 (504) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccc-----cceeEec-CCCcc------CHHHHHHHHHHHHhhcc Confidence 88999999999999999999998877765443 34442 2555563 32222 23456777777655544 Q ss_pred chhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcc--ccccccccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 489 SIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQAD--ETGNYSGSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 489 ky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~--e~g~~~~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) .+++...++-..|.+|++||+.+..+.+++...++.....+. ++++. +..+-.|..-|+...++... T Consensus 429 ~l~~~~~~l~~~lg~~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~-----~~~~~~~~~e~a~~~~~~~~ 497 (504) T protein:vir:99 429 EWLKETEVGLELLGLTPQQAKRALAERRRASSVSIIEALNRRQQEAATA-----GEDQDQGAGEPPANEPPAAL 497 (504) T ss_pred ccccchHHHHhhcCCCHHHHHHHHHHHHHHhhHHHHHHHhcccCCCCCC-----CCCCCcCCCCCCCCCCCccC Confidence 566654444466799999999777766665433321111110 11111 11122222223222221111 No 41 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=98.39 E-value=8.9e-07 Score=53.75 Aligned_cols=457 Identities=13% Similarity=0.047 Sum_probs=223.6 Q ss_pred ccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCC----------CCCCCceeEec-cccc--eeccccccc- Q lcl|NC_019530. 4 YGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALD----------DAQDGSIILQG-GANT--YNYVGVESE- 69 (560) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~----------~~~dG~~~i~~-~~~~--~~y~~~e~~- 69 (560) .||-- ..-.+.++..+.+.+ ..-+++..... .+.. .+..+.+.. T Consensus 1 ~~r~~----------------------~~~~~~dr~i~~~~~~~~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i 58 (505) T protein:vir:96 1 MKRAE----------------------KKPSLAQRMVNWAWYRYVEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEI 58 (505) T ss_pred CCCCc----------------------cccchhhcccchhhhhhHHHHHHhhhhcccccCCCccccccCCCCCCChHHHH Confidence 00000 011122222222211 11222221111 1111 111111222 Q ss_pred hhhHHHHHHHHHHH-hhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHH--------hC Q lcl|NC_019530. 70 LLSVKTVVEEYQSM-AQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHL--------MD 140 (560) Q Consensus 70 ~~~~~~LI~~YR~m-a~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l--------l~ 140 (560) ..+-..|..+=|.| .++|=+..||+.+++-+|= .++-.+..--....-...+.+.++|..+|+.+++- ++ T Consensus 59 ~~~~~~lr~RaRdL~rNn~~a~~av~~~~~nvVG-~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~ 137 (505) T protein:vir:96 59 YADLASLVQRAREQSINNPYAKRFYQLLKNNVIG-PKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYH 137 (505) T ss_pred HHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhcC-CCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCC Confidence 22456688888888 5899999999999887762 11211211111111124567999999999999853 33 Q ss_pred CchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhccc-CC--cccccccccc--chheeec--C Q lcl|NC_019530. 141 FDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKA-MR--EGGIEAIESV--TLKYYYN--P 213 (560) Q Consensus 141 f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~-~~--~g~~~~~~~~--~e~~~Y~--p 213 (560) |..--...+|.|.+||..|..+...+.. .-|. .|..|||.+|.--..-... ++ ..||++..-. ..|+++. | T Consensus 138 f~~lq~l~~r~~~~dGE~f~~~~~~~~~-~~~~-~lqliepd~l~~~~n~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~hP 215 (505) T protein:vir:96 138 FVTLLHLWMETLARDGEVLVREHRGYPN-KWGY-ALQILECDRLDLNYNADLQNGNRIRMSIELDAWERPVAYHLLVNHP 215 (505) T ss_pred HHHHHHHHHHHHhhCCceEEEEeecCCC-Ccce-EEEEechhhcCCCCCcccCCcCeEEeceEECCCCceEEEEEeecCC Confidence 4444567899999999988766654322 2333 4889999887643211111 10 1244443322 2455554 2 Q ss_pred ccccccccccccccccccccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceE Q lcl|NC_019530. 214 NYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRA 293 (560) Q Consensus 214 ~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRv 293 (560) ...+... ........++|.+.|.|+..-. ...---.+|.|+.+++.+.+|.-.+||..+-...-|--=-+ T Consensus 216 gd~~~~~---------~~~~~~~~rvpa~~vlH~f~~~-r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~f 285 (505) T protein:vir:96 216 GDNSYCY---------HYAGQTYERVPADEIIHTFVPW-RPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGF 285 (505) T ss_pred Ccccccc---------ccccccccccCHhHhhhhhccc-CCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheee Confidence 2111111 1122345678888888777532 22333457999999999999999999999988887765444 Q ss_pred EEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCC--------- Q lcl|NC_019530. 294 FYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQ--------- 364 (560) Q Consensus 294 FyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~n--------- 364 (560) ..=|.+.+.... .| ..|+ ...+ -+.|| |.+|+.|+. T Consensus 286 i~~~~~~~~~~~-----------------~~-~~~~-----~~~~-----------l~pG~-i~~L~pGe~i~~~~~~~p 330 (505) T protein:vir:96 286 YEQDPEAYDQPP-----------------ED-DQGE-----IVEE-----------VEAGT-YQLLPYGIRFKEHKIDHP 330 (505) T ss_pred eecCCccCCCcc-----------------cc-ccCc-----cccc-----------cCCce-eeecCCCCeeeeeCCCCC Confidence 333444332110 00 0111 0000 01233 555555543 Q ss_pred cchHHHH-HHHHHHHHHhcCCCccccCCC-CcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHhc Q lcl|NC_019530. 365 LGEMDHV-NYFREKLYDALMIPKSRLQEE-GSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLR----RQLILK 438 (560) Q Consensus 365 Lgei~DV-~YF~~kLy~aL~VP~sRl~~~-~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk----~qLiLk 438 (560) -+..++. +-..+.+=.+|+||-+-|..+ ++.|.. .+--.-+.|-+.+.++|..|..-|..++- ...+|. T Consensus 331 ~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~~nYS-----S~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~ 405 (505) T protein:vir:96 331 HTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEGVNFS-----SLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLT 405 (505) T ss_pred CCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc Confidence 3343333 333344567899998888654 232211 12223556999999999998875555432 256677 Q ss_pred cCCChhHHHHhhhhh--ceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHH-HHHHHH Q lcl|NC_019530. 439 GVTDEKDWNEKIKPF--IKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVK-EQQTKI 515 (560) Q Consensus 439 giit~eew~~~i~~~--I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~-e~~kqI 515 (560) |.+..-.+.. ... ..|....=-+.-.+||+.-...+++. -.-|.+-+..+ .+..-+|+- |.+... T Consensus 406 G~i~~p~~~~--~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~---------G~~t~~~~~a~-~G~D~~~v~~q~a~e~ 473 (505) T protein:vir:96 406 QALPLNMVDI--DRLSQYAFQPRGWDWVDPAKDSKAHSESIKN---------RTRSRSSIIRA-AGDDPEDVFDEIAWEE 473 (505) T ss_pred CCcCCCCccc--hhhceeeeccCCccccChHHHHHHHHHHHHc---------CCCCHHHHHHH-cCCCHHHHHHHHHHHH Confidence 8775332211 111 22323333344566766655554431 23344444444 333322222 222222 Q ss_pred HHhhhcCCCCCCCccccccccccccCCCCCCcccccccCCccc Q lcl|NC_019530. 516 AEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDD 558 (560) Q Consensus 516 e~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~ 558 (560) +.-.+.|+-+.+.... .+-.|.--+.++..|+ T Consensus 474 ~~~~~~Gl~~~~~~~~-----------~~~~~~~~~~~~~~d~ 505 (505) T protein:vir:96 474 QLMRDKGVNPTPPEQE-----------SKDATTDEEDDSASDD 505 (505) T ss_pred HHHHHcCCCCCCCCCC-----------CCCCCCCCCCCCCCCC Confidence 2222456533221110 0111111222333344 No 42 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=98.39 E-value=9e-07 Score=53.73 Aligned_cols=423 Identities=14% Similarity=0.081 Sum_probs=187.3 Q ss_pred hccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHHHhhcceeeecCCCceEEEe Q lcl|NC_019530. 34 LLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVN 113 (560) Q Consensus 34 ~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~ 113 (560) -++..-.+++.|-.-+=...+..+.+..+..+. .+.+...+.- .-.+++|-|.+||+-|-+.+.- -|+.+- T Consensus 1 ~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~--~~~~~~~~~~--~~~~~~~~V~acV~~IA~~iA~-----lp~~l~ 71 (518) T protein:vir:78 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGM--QLERQFSLYG--GIYKNQPWVRTVIAKRAQALAR-----LPVKCM 71 (518) T ss_pred CcccCceeeccchhhhhhhhhhhcccccceece--ecccccchhh--HHhhhhHHHHHHHHHHHHhhcc-----CceEEE Confidence 344444444444322222222222222222221 1111111111 1135789999999877776542 244443 Q ss_pred eecccchhHHHHHHHHHHHHHHHHHh----CCchhhHHHHHHh----hhhhhhhhhheecCCCCccCeeEeEEeCccccc Q lcl|NC_019530. 114 LDKVEGISDTVKESITECFKEVMHLM----DFDNTAYQKIRKW----YVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIR 185 (560) Q Consensus 114 Ld~~~~~s~sik~~I~eeF~~il~ll----~f~~~~~~~~R~w----YvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~ 185 (560) -.+.++ ..++.+..+.+| +-...++++++.| .+.|.-|..++.|. +.-+.+|++|+|.+|. T Consensus 72 ~~~~~~--------~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~---~G~~~~L~~l~p~~Vt 140 (518) T protein:vir:78 72 FTSGDT--------ETEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK---SGTPEKLMPMHPSRVA 140 (518) T ss_pred EEcCCc--------cccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC---CCcEEEEEEECCCceE Confidence 322221 112223333222 2333455555544 46688888766544 3458999999998776 Q ss_pred hhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeecccccCCCC-cchhhhH Q lcl|NC_019530. 186 PVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLANG-IVPGLLN 264 (560) Q Consensus 186 ~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~~-~i~syL~ 264 (560) ..+ +.++ ...+|.|... .+ ..+..+.++.+.|.|... ...++. ..+|-|. T Consensus 141 v~~-----~~~~-------~~~~y~~~~~------~~---------~~~~~~~~~~~eIiHir~--~~~dg~~~G~Spi~ 191 (518) T protein:vir:78 141 IKR-----NSRT-------GRYEYYFQAG------AG---------VGTQLVSFADDEVVPIRF--FNPDGLERGLSLME 191 (518) T ss_pred EEE-----cCCC-------CEEEEEEEec------CC---------ccceeEEecCCcEEEecC--CCCCcccccccHHH Confidence 532 1111 1223444321 00 112345688888888762 222322 3578899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhc Q lcl|NC_019530. 265 PAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYW 344 (560) Q Consensus 265 ~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDyw 344 (560) .|.+++.....+++...=+----+.-+-|...+ +.|.+..+++.-+.+...|+-- ...|.+ + .++ T Consensus 192 ~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~-~~ls~e~~~~~k~~~~~~~~G~----~nag~~------~-vL~--- 256 (518) T protein:vir:78 192 SLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSPEAQQRLREQFDRAHAGS----SNTGKT------M-VVE--- 256 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCccEEEecC-CCCCHHHHHHHHHHHHHHhcCc----ccCCce------e-EcC--- Confidence 999988888888877444333335556666776 6666665555444433333311 112321 1 222 Q ss_pred ccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHH-HHHHH Q lcl|NC_019530. 345 LPRREGQNATEIATVGGGNQLGE---MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKF-CAGLR 420 (560) Q Consensus 345 LpRReGgrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kf-i~rLr 420 (560) .|.+++.|. .+.-+ ++-.+|..+.+.++++||...|+..++.+ .+++.+..+. |.++ |.-+- T Consensus 257 -------~G~~~~~l~--~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st--~sn~e~~~~~---f~~~tL~P~~ 322 (518) T protein:vir:78 257 -------EGMEPIPLQ--LTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRAT--FSNISAQMRA---FYRDTMAIPI 322 (518) T ss_pred -------CCceEEecc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCC--chhHHHHHHH---HHHHHHHHHH Confidence 244555553 33333 34445778999999999999996443322 2333333333 5554 33333 Q ss_pred HHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHH Q lcl|NC_019530. 421 RRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNV 500 (560) Q Consensus 421 ~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~I 500 (560) .++...|...| +++.+ -..++. |..+ ++... -+..|++.++.+-.- -++|.+-+++ . T Consensus 323 ~~ie~eln~~L---------~~~~~----~~~~~~--fd~~----~Llr~-D~~~r~~~~~~~~~~--G~lT~NE~R~-~ 379 (518) T protein:vir:78 323 ARIQSAMDKYV---------GQYWV----RKNRMK--FDID----DVIQP-DWEAKSESTQKMVNS--GVATPNEGRE-I 379 (518) T ss_pred HHHHHHHHHhh---------ccccc----CcceEE--eech----hhhcc-CHHHHHHHHHHHHhC--CCcCHHHHHH-H Confidence 33333333222 23211 122233 3222 22222 123555555554322 3788888874 5 Q ss_pred hCCCHHHHHHHHH--------HHH-------HhhhcCCCCCCCccccccccccccCCCCCCcccccccCCc-ccCC Q lcl|NC_019530. 501 LRMSDEEVKEQQT--------KIA-------EEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGST-DDSI 560 (560) Q Consensus 501 L~~tDeEI~e~~k--------qIe-------~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~-~~~~ 560 (560) ++|...|=..-++ -+. +....+.-++|.+. ++.++ +-.+.+-.|..-|-+-+. +++. T Consensus 380 ~gl~pie~~~gD~~~v~~n~~pl~~~~~~~~~g~~~~~~~~~~~~-~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 452 (518) T protein:vir:78 380 MGLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPAST-PVASL--DQSPPASVPGLSPTNSDRSTDSG 452 (518) T ss_pred hCCCCCCCCCCceeeecccceecccccccccCCCCCCCCCCCCcc-ccccc--ccCccccCCCCCccccccccccc Confidence 7775533000000 000 00000100111110 00000 111111111111111100 1111 No 43 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=98.36 E-value=1.1e-06 Score=53.26 Aligned_cols=435 Identities=10% Similarity=0.039 Sum_probs=187.4 Q ss_pred cccccchhhHHHHH-------HHHHHHhhccchhHHHHHhh--------cceeeecCCCceE-----EEeeecccchhHH Q lcl|NC_019530. 64 VGVESELLSVKTVV-------EEYQSMAQQPEIRKAVDIIV--------NDVVTCEEDETPV-----TVNLDKVEGISDT 123 (560) Q Consensus 64 ~~~e~~~~~~~~LI-------~~YR~ma~~PEvd~AI~eIv--------neaiv~d~~~~~V-----~l~Ld~~~~~s~s 123 (560) ...+.. -...|+ .+|..+..+.+=+..|...- +=-++..--...| .|.++. ++-. T Consensus 1 ~~t~~d--~i~~L~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g---~~~~ 75 (480) T protein:vir:78 1 MTTYHE--HVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEG---FRIS 75 (480) T ss_pred CCCHHH--HHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHhhhccCc---eecC Confidence 111111 011111 11222222222111110000 0000000000000 000000 0000 Q ss_pred HHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCC----CCccCeeEeEEeCccccchhhhh--cccCCcc Q lcl|NC_019530. 124 VKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPT----NKKGGIKKLVMLDSRCIRPVYIV--EKAMREG 197 (560) Q Consensus 124 ik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~----~~k~GI~eLr~LDPr~i~~vr~i--~~~~~~g 197 (560) --+...+....|++..+|+.+..++++.-.+.|+-|.+. .+. ...+|-..++.++|+.+=.+..= ..... . T Consensus 76 ~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v--~~~~~~~~d~~~~~~i~~~~p~~~~~i~D~~~~~~~~-~ 152 (480) T protein:vir:78 76 EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITV--SHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVT-R 152 (480) T ss_pred CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEe--ecCccccCCCCCeeEEEEEcccceEEEEcCCCccceE-E Confidence 001224455667777889999999999999999988743 221 12467778889999887665321 11111 1 Q ss_pred cccccc------ccchheeecCccccccc-cccccccccc--ccccc-ceecchhceeeeecccccCCCCcchhhhHHHH Q lcl|NC_019530. 198 GIEAIE------SVTLKYYYNPNYNRNQF-TGQSGTSQNF--QPSQQ-ELVFDDESIVYIDSGEEPLANGIVPGLLNPAI 267 (560) Q Consensus 198 ~~~~~~------~~~e~~~Y~p~~~~~~~-~g~~~~~~~~--~~~~~-~~~i~~daI~y~hsGl~~~~~~~i~syL~~Ai 267 (560) .+.... .+...-+|.|..-++.. .+........ -+... --++|. |.|++---.+. -...|=|.+.+ T Consensus 153 ~i~~~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv--v~f~n~~~~~~--~~G~sdi~~~i 228 (480) T protein:vir:78 153 AVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRLGN--RYGRSEISPEL 228 (480) T ss_pred EEEEEEeecCCcceEEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcce--EEeecccccCC--ccCccchhHHH Confidence 111110 01112334443222211 1110000000 00000 001222 22333211111 12234444433 Q ss_pred ----HHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhh Q lcl|NC_019530. 268 ----RPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDY 343 (560) Q Consensus 268 ----K~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDy 343 (560) ..+| ++|-+.+++-....-|.|-+.=.+....+..+- .++ ..+. +-... T Consensus 229 ~~l~Da~~--~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~-----------~~~--~~~~------------~~~~~ 281 (480) T protein:vir:78 229 RKVTDAAS--RTLMNLQSASQILGTPLRVISGVTTDELTNDGE-----------NTT--LDIY------------YGRIL 281 (480) T ss_pred HHHHHHHH--HHHHHHHHHHHhhcchhhhhhCCCccccccccc-----------cch--hhhh------------hhhhc Confidence 3333 355677777777777877664222221111000 000 0011 11123 Q ss_pred cccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHH Q lcl|NC_019530. 344 WLPRREGQNATEIATVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRR 422 (560) Q Consensus 344 wLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~r 422 (560) |++ +-+.++-++++. ++.- ++-++-....++..-++|..-|...+. | -+.+..|.--+...-.-+.+.|+. T Consensus 282 ~~~----~~~~~~~~~~~~-~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~-n--~~Sg~Al~~~~~~l~~k~~~~~~~ 353 (480) T protein:vir:78 282 TLA----SEAAKISEFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSE-N--PASAEAIIATDSRIVKMAERKGRI 353 (480) T ss_pred cCC----CCCceEEecCcc-CHHHHHHHHHHHHHHHhcccCCCHHHhccccC-c--hhHHHHHHHHHHHHHHHHHHHHHH Confidence 443 234678888874 3433 333566666677778888777764332 1 111234555566677788999999 Q ss_pred HHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhC Q lcl|NC_019530. 423 YSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLR 502 (560) Q Consensus 423 fs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~ 502 (560) |..-+...++.-+.+.|.-...+|. .|.+.|..-..=. +.+.++.+.++-.-+.-.+|.++++ .+|. T Consensus 354 f~~~l~~~~rl~~~~~~~~~~~~~~-----~i~v~w~~~~~~s-------~~~~ad~~~kl~~~g~~~~s~et~~-~~lg 420 (480) T protein:vir:78 354 FGGAWERAMRIAMQIMGREVTEEYT-----RLETVWRDPSTPT-------VAAKADAVSKLYANGQGPIPKEQAR-IDLG 420 (480) T ss_pred HHHHHHHHHHHHHHHcCCCccccce-----eeeEEecCCCCCC-------HHHHHHHHHHHHHhcccCCCHHHHH-hcCC Confidence 9999999999877777754444553 2566663221111 1245555665554434578989988 5689 Q ss_pred CCHHHHHHHHHHHHHhhhcCC--CCCCCccccccccccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 503 MSDEEVKEQQTKIAEEKKKGL--YPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 503 ~tDeEI~e~~kqIe~E~~~~~--~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) +++++++++++..++|..... ...|.. .++++.+....+.. +++.=++.++.-.+- T Consensus 421 ~~~d~~~e~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 478 (480) T protein:vir:78 421 YTATQREQMRDWDKQETEDMIDTLYSTTK-AQADATPKPTVTET-KTETQTSPSGFNRTK 478 (480) T ss_pred CCHhHHHHHHHHHHHHHHHHHHHhhcccc-CCCccccCCCCCCC-CCccCCCcccCCCcC Confidence 999999987755544432211 111110 11111111111111 111111111111111 No 44 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=98.35 E-value=1.1e-06 Score=53.20 Aligned_cols=443 Identities=12% Similarity=0.112 Sum_probs=197.8 Q ss_pred hhcccccchhhhccccccc-cCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhh--ccchhHH------- Q lcl|NC_019530. 23 DTDKLAQKQDELLLTKATV-VALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQ--QPEIRKA------- 92 (560) Q Consensus 23 ~~~~~~~~~~e~~~~k~~s-~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~--~PEvd~A------- 92 (560) =.+++++.-.+-...-... .+.+-.+. .++.. -....+-|+.+|.|.. ||.+..- T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~i~~-~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~ 65 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLKSLKDVTDH--------------KKVNA-NDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGN 65 (499) T ss_pred ChhHHHHHHHHHHHHhccccchhhhhcC--------------CCCcC-CHHHHHHHHHHHHHhcCCcchhhccccccCCC Confidence 3333333322222110000 00000000 00000 1123345666666643 3322210 Q ss_pred -----------HHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhh Q lcl|NC_019530. 93 -----------VDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYH 161 (560) Q Consensus 93 -----------I~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~h 161 (560) =..||++..=+ --..||++.+++. ...+.++.+++-.+|.....+++..-..-|..||| T Consensus 66 ~~~~~~~s~n~~~~iv~~~a~~-l~~ep~~i~~~d~---------~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~ 135 (499) T protein:vir:80 66 PVNRRQLSMNLPKVTAKYMSKL-LFNEKVKINIDDE---------TAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIK 135 (499) T ss_pred ccccceeecchHHHHHHHHHHh-hhCCcceEeeCCH---------HHHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEE Confidence 01112211111 0113555555542 45566677777778999999999999999999999 Q ss_pred heecCCCCccCeeEeEEeCccccchhhhhcccC-Cccccccccc---cc------hheeecCcccc-c--------ccc- Q lcl|NC_019530. 162 VIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAM-REGGIEAIES---VT------LKYYYNPNYNR-N--------QFT- 221 (560) Q Consensus 162 kiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~-~~g~~~~~~~---~~------e~~~Y~p~~~~-~--------~~~- 221 (560) ..+|.+ |=..+.+++|.++=++. .+. .-..+-++.. .. |+..++..... + ..+ T Consensus 136 ~~~D~~----~~~~i~~v~a~~~~Pi~---~d~~~~~~~~f~~~~~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~ 208 (499) T protein:vir:80 136 VYHDGN----KNVKVSFATADCMYPLS---NDSENVDECLIANSFHKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDP 208 (499) T ss_pred EEECCC----CcEEEEEEcCCceEEEE---ecCCCeEEEEEEEEEeecCeEEEEEEEEEecccceeeEEEEEEEEeccCc Confidence 999864 32568999998877652 111 1111111110 00 11111110000 0 000 Q ss_pred ---ccccccccccccc-cceecc-hhceeeeeccc-----ccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccc Q lcl|NC_019530. 222 ---GQSGTSQNFQPSQ-QELVFD-DESIVYIDSGE-----EPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEK 291 (560) Q Consensus 222 ---g~~~~~~~~~~~~-~~~~i~-~daI~y~hsGl-----~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeR 291 (560) |.......+.+.. ....++ .+...|++.-. .+.......|-++.|...+..|-..-+.+ .|..|.=.+ T Consensus 209 ~~lG~~v~l~~~~~~~~~~~~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~--~~e~~~~~~ 286 (499) T protein:vir:80 209 NELGGKVSLKLLFNDIEPVVPLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSY--YQEFKLGKK 286 (499) T ss_pred cccCcccchhhhccCcCCceeecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHH--HHHHHhccc Confidence 1000000000000 000000 01111121100 01122234678888888888887776664 477888777 Q ss_pred eEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcc-cccCCCCccceeecCCCCCcch-HH Q lcl|NC_019530. 292 RAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWL-PRREGQNATEIATVGGGNQLGE-MD 369 (560) Q Consensus 292 RvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywL-pRReGgrgTEIsTLpGg~nLge-i~ 369 (560) |+|. +..=++ ...-++ |+.. .-++.-.+.|-. +=..++.|--|+++.+.-.-.+ .. T Consensus 287 ~i~v-~~~~l~----------------~~~~~~---g~~~--~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~ 344 (499) T protein:vir:80 287 KVLV-PSSFVK----------------TAVNLD---GSTT--QYFDSTDEAFFLYQGEQDDNGKAIKDISVEIRSTEFIE 344 (499) T ss_pred ceec-chhhhh----------------ccCCCC---CCcc--cCCCcccceeeEeeccCCCCcCceeEecCcCChHHHHH Confidence 7764 211110 000011 1100 000000111111 0011122223677666333332 35 Q ss_pred HHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---ccCCChhHH Q lcl|NC_019530. 370 HVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLIL---KGVTDEKDW 446 (560) Q Consensus 370 DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiL---kgiit~eew 446 (560) -++.+.+.+....++|-+-|+.+++ |-..|+||.-....-..-+...++.|-.-+.++++.-|-+ -+..+-..| T Consensus 345 ~l~~~l~~i~~~~g~s~~~fg~~~~---g~~TAtei~s~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~ 421 (499) T protein:vir:80 345 SINAMLRIYAMQVGLSAGTFTFDEN---GLKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKLIKAYDGDTV 421 (499) T ss_pred HHHHHHHHHHHhcCCChhhcCCCcc---cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCC Confidence 6778889999999999998887654 2234677754443333345555555555555555443321 122222223 Q ss_pred HHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCC Q lcl|NC_019530. 447 NEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPK 526 (560) Q Consensus 447 ~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~ 526 (560) +. ..+.++|... ...- .+. .++.+.++-. .| .+|.++++.+....||+|.+++.++|++|.... . T Consensus 422 ~~---~~v~v~f~d~-i~~d-~~~-----~~~~~~~~~~-~G-i~S~et~l~~~~~~~d~ea~~el~~i~~E~~~~-~-- 486 (499) T protein:vir:80 422 EL---DTITVDFDDS-IAQD-EDT-----TINRYTTAKN-QG-MIPLKIALQRAWNITEAEADEWAEMLAKEKQAE-I-- 486 (499) T ss_pred Cc---cceEEEeCCC-CCCC-HHH-----HHHHHHHHHH-cC-CCCHHHHHhhcCCCChHHHHHHHHHHHHHhhcC-C-- Confidence 21 2467777433 2221 111 1111111111 13 588899988888999999999999999996432 2 Q ss_pred CCccccccccccc Q lcl|NC_019530. 527 VQADETGNYSGSD 539 (560) Q Consensus 527 p~~~e~g~~~~~~ 539 (560) |+.+..|-++... T Consensus 487 ~~~d~~g~~ge~e 499 (499) T protein:vir:80 487 PNNDMTGIFGEEE 499 (499) T ss_pred CCCCccccCCCCC Confidence 2222222223222 No 45 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=98.35 E-value=1.2e-06 Score=53.14 Aligned_cols=405 Identities=14% Similarity=0.177 Sum_probs=173.1 Q ss_pred hhhhhhhhhhhhhhhhcccccchhhhccccccccCCC-CCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhcc Q lcl|NC_019530. 9 FGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALD-DAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQP 87 (560) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~-~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~P 87 (560) .||| ..++ .+.....++.... ...+|. .....|.+.. .+-.+|-..| +.|+ T Consensus 1 ~~~~----m~~~------------~~~~~~~D~~~~~~~~~~g~------~~~~~~~~~~---~~~~~l~~~Y---~~~~ 52 (435) T protein:vir:79 1 MGVF----MSDK------------VKAITKEDGYNEIFGSKDGT------FRPNAFYMQR---AAFKALSQFY---EEDG 52 (435) T ss_pred CCcc----cccc------------cccchhhcchhhhhcccccc------cccCcccCCc---CCHHHHHHHH---hcCc Confidence 2222 1111 1111122222111 111121 1111111111 2334565444 4689 Q ss_pred chhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhh--hhhhheec Q lcl|NC_019530. 88 EIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGR--QAYHVIVD 165 (560) Q Consensus 88 Evd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr--~~~hkiiD 165 (560) .++.||+.+..||+-+- +. ++..+ -+++ |+..++-|+...+..+.+|.=-+.|. ++.+ +.| T Consensus 53 l~~~~Vd~~aed~~r~g-----~~--i~g~~-----~~~~----~~~~~~~l~~~~~l~~a~~~~rl~G~~~i~i~-~~d 115 (435) T protein:vir:79 53 MARRIVDVIPEEMVTPG-----FK--VDGVK-----NEKS----FKSRWDELRLNAKIIDALSWSRLFGGSAILAV-VAD 115 (435) T ss_pred hhhhhhccchHHhhcCC-----ce--ecCCC-----hHHH----HHHHHHHhhHHHHHHHHHHhhhccccEEEEEE-ecC Confidence 99999999998887654 22 22211 1223 44444445667778887775555554 3332 123 Q ss_pred ------CCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceec Q lcl|NC_019530. 166 ------PTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVF 239 (560) Q Consensus 166 ------~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i 239 (560) |-+++..|+.++.|||.++.+-- .. .+... -.+|.|. .|...+. ......+| T Consensus 116 ~~~~~~Pl~~~g~i~~i~v~d~~~i~~~~-~~-~dp~s----------p~fg~P~--~y~v~~~--------~~~~~~~i 173 (435) T protein:vir:79 116 NKMLKSPVKPGAQLEDIRVYDRYQITIHE-RE-TNARS----------VRYGEPK--LYKISPG--------GDIPEFFV 173 (435) T ss_pred CCCcccccccCCceeeEEeechhhccchh-hc-cCCcc----------cccCcce--EEEEecC--------CCCCceEE Confidence 33567788899999997775421 10 01000 0112221 1111111 11224578 Q ss_pred chhceeeeecc-cc--cC--CCCcchhhh-HHHHHHHHHHHHHHH--HHHHHHHhcCccceEEEc-cCCCC-----chHH Q lcl|NC_019530. 240 DDESIVYIDSG-EE--PL--ANGIVPGLL-NPAIRPLNNLVTTED--ATVIYAITRAPEKRAFYL-DVGTL-----GKKS 305 (560) Q Consensus 240 ~~daI~y~hsG-l~--~~--~~~~i~syL-~~AiK~~NqL~m~ED--alVIyRitRAPeRRvFyI-DvGnl-----pk~K 305 (560) |++-++..+.- +- .. ......|.| +++...+.+...... +.++++-. . +|+.+ ++..+ .... T Consensus 174 H~SRli~~~g~~~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~---~-~v~~~~~l~~~~~~~~~~~~ 249 (435) T protein:vir:79 174 HYSRICIIDGERVSNEKRRQNDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQ---Q-AVWKARDLALMCDDEEGRYA 249 (435) T ss_pred cceeEEEecCCcchhhhccccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhc---C-ccccchhHHHhhcCccchHH Confidence 88876544211 10 01 112223434 444443333322222 23444431 2 23444 22221 1111 Q ss_pred HHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHH-HHHHHHHHHhcCC Q lcl|NC_019530. 306 AEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHV-NYFREKLYDALMI 384 (560) Q Consensus 306 AeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV-~YF~~kLy~aL~V 384 (560) +..-+. .++++|+. +| .|.+. | ..-+++++. .+|+-++|+ .+|...+-.+.+| T Consensus 250 ~~~r~~-~~~~~~~~------~~-------~~~i~---------~-~~e~~e~~~--~~lsgl~~~~~~~~~~iaaa~~I 303 (435) T protein:vir:79 250 ARLRLA-QVDDESGV------GK-------AIGID---------A-TDEEYEVLN--SDVSGVPEFLQEKIDRIVALTGI 303 (435) T ss_pred HHHHHH-HHHHhcCC------CC-------ceeEe---------c-CCcceEEEe--cccCCHHHHHHHHHHHHHhhhCC Confidence 111111 12233321 11 11110 1 111233332 245555554 7899999999999 Q ss_pred CccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchH Q lcl|NC_019530. 385 PKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYI 464 (560) Q Consensus 385 P~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f 464 (560) |+.||-.++.-.++.+ +.+ |--.|..+|+++|. ..+..+|++-+-+ .+.+ ++ +.|.|..=..- T Consensus 304 P~t~L~G~s~~glnst-gd~---d~~~yyd~i~~~Qe---~~l~p~l~~l~~l-i~~s-~d--------~~~~f~pL~~~ 366 (435) T protein:vir:79 304 HEIIIKNKNTGGVSAS-QNT---ALETFYKLIDRKRV---EDYKPILEFLLPF-MISE-TE--------WSIEFEPLSVP 366 (435) T ss_pred CeeeeccCCccccccc-hhH---HHHHHHHHHHHHHH---HHHHHHHHHHHHH-hhcC-CC--------CeEEeCCCCCC Confidence 9999965554333332 122 33349999999985 3444444432111 1222 33 45556544445 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHh-hhcCC---CCCCCc--ccccccccc Q lcl|NC_019530. 465 REQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEE-KKKGL---YPKVQA--DETGNYSGS 538 (560) Q Consensus 465 ~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E-~~~~~---~~~p~~--~e~g~~~~~ 538 (560) +|...+|+...++++++.+-.- -.+|.+-+++.+ + ...... ..... ++++.. ++++.++++ T Consensus 367 sekEkAei~~~~a~a~~~~~~~--g~i~~~e~r~~L-~----------~~~~~~~~~~~~~~~~~~~~d~~~~~~~e~g~ 433 (435) T protein:vir:79 367 SDKDKAEIMAKNVESVVKLKAE--QAINLKETRDTL-R----------SICPDLKIMDNDNIELPEPEDLDPEPGQEGGL 433 (435) T ss_pred CHHHHHHHHHHHHHHHHHHHhc--CCCCHHHHHHHH-H----------HhccccCCCCcccccCCccccCCCCCCCCCCC Confidence 6666788888888877765332 245555555432 1 111111 01111 111111 111222211 Q ss_pred cc Q lcl|NC_019530. 539 DV 540 (560) Q Consensus 539 ~~ 540 (560) .- T Consensus 434 ~~ 435 (435) T protein:vir:79 434 NK 435 (435) T ss_pred CC Confidence 11 No 46 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=98.34 E-value=3.1e-07 Score=56.24 Aligned_cols=452 Identities=13% Similarity=0.061 Sum_probs=192.6 Q ss_pred cccccCCCCCCCCceeEec--c----------ccceeccccccchhh-HHHHHHHHHHH-hhccchhHHHHHhhcceeee Q lcl|NC_019530. 38 KATVVALDDAQDGSIILQG--G----------ANTYNYVGVESELLS-VKTVVEEYQSM-AQQPEIRKAVDIIVNDVVTC 103 (560) Q Consensus 38 k~~s~~~~~~~dG~~~i~~--~----------~~~~~y~~~e~~~~~-~~~LI~~YR~m-a~~PEvd~AI~eIvneaiv~ 103 (560) -++-.+...+.|....++. . -...+|.+- ..+.. ...+-+.+|.. +.+.=+.-+|+..+.-. +. T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~G~-~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l-~~ 78 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTERTQDLGDNTAYYESE-RRPDAVGVTVPQQMQKLLAHVGYPRLYIDAIAARQ-EL 78 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-ccchhcccccchhHHhhhhhcCcHHHHHHHHHhhh-cc Confidence 0000111111111111110 0 001111111 11100 00000111111 11222233333332211 11 Q ss_pred cCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCC----ccCeeEeEEe Q lcl|NC_019530. 104 EEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNK----KGGIKKLVML 179 (560) Q Consensus 104 d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~----k~GI~eLr~L 179 (560) + .+. ....+ ...+.+..+.+-.+|+....++++.-++.|+-|.+.-.|.... ..+...++.+ T Consensus 79 ~----g~~--~~~~~--------~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~ 144 (484) T protein:vir:77 79 E----GFR--LGGAD--------KADEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVE 144 (484) T ss_pred C----cee--cCCcc--------hhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEe Confidence 1 111 11111 2345577788888999999999999999999888655443321 2344568889 Q ss_pred CccccchhhhhcccCCccccccc-----cccchheeecCcccccccccccccccc--ccccccceecchhceeeeecccc Q lcl|NC_019530. 180 DSRCIRPVYIVEKAMREGGIEAI-----ESVTLKYYYNPNYNRNQFTGQSGTSQN--FQPSQQELVFDDESIVYIDSGEE 252 (560) Q Consensus 180 DPr~i~~vr~i~~~~~~g~~~~~-----~~~~e~~~Y~p~~~~~~~~g~~~~~~~--~~~~~~~~~i~~daI~y~hsGl~ 252 (560) +|+.+-.+..=.+....-.+... ..+....+|.|..-+............ ..-... ++|. |.|++.--+ T Consensus 145 ~p~~~~~~~D~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g--~vPv--v~f~N~~~~ 220 (484) T protein:vir:77 145 PPTNLYAQIDPRTRQVMRAIRAIEDEEGNEVIGATLYLPNNTVIWNREDGQWVQVANVAHNLE--MVPV--IPIPNRTRL 220 (484) T ss_pred ccceeEEEecCCCCceEEEEEEEEeecCCcEEEEEEEecCeEEEEEecCCceEeeccccCCCC--Ccce--EEecccccc Confidence 99887655321000000000000 011122234443222111110000000 000011 2221 345442111 Q ss_pred cCCCCcchhhhHHHHHHH-HHH-HHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcc Q lcl|NC_019530. 253 PLANGIVPGLLNPAIRPL-NNL-VTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKI 330 (560) Q Consensus 253 ~~~~~~i~syL~~AiK~~-NqL-~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev 330 (560) ..-...|-+.+.++++ ..+ +++-+.+++-+..-.|.|-|.-.+....+..+. +...+++...| T Consensus 221 --~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~-----------~~~~~~~~~~~-- 285 (484) T protein:vir:77 221 --SDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVKGEELGVDPE-----------TGQTLFDAYLA-- 285 (484) T ss_pred --CccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCCcchhccccc-----------ccchhhhhhhh-- Confidence 1112234454444433 232 445566666666667777554333333221110 00111111112 Q ss_pred cCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHH Q lcl|NC_019530. 331 TGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEEL 410 (560) Q Consensus 331 ~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDEl 410 (560) .+|..-- -++.+.+++..+-=+-++-++-.-.++...-++|.+-|...+. |. +.+..|.--+. T Consensus 286 -----------~~~~~~~---~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~-n~--~Sg~Al~~~~~ 348 (484) T protein:vir:77 286 -----------RILAFED---HESKAQQFSAAELRNFVDALDALDRKAAAYTGLPPYYLSFSSE-NP--ASAEAIRSSES 348 (484) T ss_pred -----------hhcccCC---CCceeEeecCCChHHHHHHHHHHHHHHhcccCCCHHHhccccC-cc--hHHHHHHHHHH Confidence 3554322 2355777776432123344555556666677888888865432 21 22345666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-ChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcc Q lcl|NC_019530. 411 RFSKFCAGLRRRYSHFFMEFLRRQLILKGVT-DEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGS 489 (560) Q Consensus 411 kF~Kfi~rLr~rfs~if~d~Lk~qLiLkgii-t~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGk 489 (560) .+-.-+.+.|+.|..-+...++.-+.+.|.. .+.+|. .|.+.|.. +...- +.+.++.+.++-.-+-. T Consensus 349 ~l~~ka~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~~-----~i~v~w~~-~~~~s------~~~~ad~~~kl~~~g~g 416 (484) T protein:vir:77 349 RLVKTVERKNKIFGGAWEQAMRVAYKVMNGGDIPPEYY-----RMESIWRD-PSTPT------YAAKADAATKLYNNGQG 416 (484) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccccc-----cceEEecC-CCCCC------HHHHHHHHHHHHhccCC Confidence 6777888899999998888888777666542 222332 36666733 22111 12444555555333224 Q ss_pred hhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCC-CCCCCc---cccccccccccCCCCCCcccccccCCcc Q lcl|NC_019530. 490 IFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGL-YPKVQA---DETGNYSGSDVSPLKFKPETIPFSGSTD 557 (560) Q Consensus 490 y~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~-~~~p~~---~e~g~~~~~~~~~~~~~p~~~~~~~~~~ 557 (560) .+|.++++ ..|.+++.+++++++..++|.-.+. ..++.. .+.|+.+..+..+. ++.-|..+++- T Consensus 417 i~s~et~~-~~l~~~~~~~~e~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~ 484 (484) T protein:vir:77 417 VIPKERAR-IDMGYSITEREEMRKWDEEEQAQGLGLMGTMFGTDPSGGGNPDNPETPE---PQPNPAEEAAA 484 (484) T ss_pred CCCHHHHH-hcCCCChhHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCCCCccc---ccCCCccccCC Confidence 78999988 5599999999987665555532110 001111 11122221111111 22222222222 No 47 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=98.33 E-value=1.3e-06 Score=52.94 Aligned_cols=413 Identities=13% Similarity=0.129 Sum_probs=185.9 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) |+.++.+.+..+ ++ +..+.... .++....+-.+...+...+ +. +..+++|-|.+||. T Consensus 1 M~~~~~~f~~~~--r~-------~~~~~~~~--~~~~~~~~~~g~~~~~~~v-----~~-------~~al~~~~v~~~i~ 57 (429) T protein:vir:10 1 MDSVKKFFNFEK--RQ-------TSQVIELN--KDDEKLLEWLGISPSTISV-----KG-------KNALKVATVFACIK 57 (429) T ss_pred Cchhhhhhcccc--cC-------cccccccC--CChHHHHHHhcCCCCccee-----ch-------hhhhccHHHHHHHH Confidence 444444333221 00 11111111 1111111111111111111 11 12356899999997 Q ss_pred HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHH-----hCCchhhHHHHHHhhhhhhhhhhheecCCCC Q lcl|NC_019530. 95 IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHL-----MDFDNTAYQKIRKWYVDGRQAYHVIVDPTNK 169 (560) Q Consensus 95 eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l-----l~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~ 169 (560) -|-+.+.-. |+.+--+..++ .++....-..++|+. +....--+.++..+++.|.-|..++.|. T Consensus 58 ~ia~~ia~l-----~~~~~~~~~~~----~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~--- 125 (429) T protein:vir:10 58 ILSESVSKL-----PLKIYQEDEYG----IQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDR--- 125 (429) T ss_pred HHHHhhccC-----ceEEEEecCCc----eeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECC--- Confidence 766655432 33332222221 111222223333332 1112223455666788898888877654 Q ss_pred ccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeec Q lcl|NC_019530. 170 KGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDS 249 (560) Q Consensus 170 k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hs 249 (560) ...+.+|.+|+|..+..++ +..+. ......-||.++. .+....++.+.|.|..- T Consensus 126 ~G~~~~L~~i~~~~v~v~~-----~~~~~--~~~~~~~~~~~~~-------------------~g~~~~~~~~evih~~~ 179 (429) T protein:vir:10 126 KGKVQALWPIDASKVTVYI-----DDVGL--LNSKTKMWYVVNT-------------------GGQQRVLKPEEILHFKN 179 (429) T ss_pred CCcEEEEEEEcCceeEEEE-----cCccc--ccccceEEEEEcc-------------------CCeEEEEccccEEEecC Confidence 3348899999999887643 11111 1111222333331 12345688888888764 Q ss_pred ccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCc Q lcl|NC_019530. 250 GEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGK 329 (560) Q Consensus 250 Gl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGe 329 (560) +. ..++-...|.|..|.+++.....++.+..=+----+.-+-+..++ +.|.+.++++..+.+...|.. T Consensus 180 ~~-~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~-~~l~~e~~~~~~~~~~~~~~g---------- 247 (429) T protein:vir:10 180 GI-TLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-GDLNEDAKKVFRENFESMSSG---------- 247 (429) T ss_pred CC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhcc---------- Confidence 32 334445678999999999998888887766644445556777776 567777776655554433321 Q ss_pred ccCcccchhhHhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhh Q lcl|NC_019530. 330 ITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE---MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEIT 406 (560) Q Consensus 330 v~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eIt 406 (560) +++.-+.+ .++ .|.+++.|. .+..+ ++-.++..+.+.++++||.+-|....+.+ .+++.+.. T Consensus 248 ~~n~~~~~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~--~sn~e~~~ 312 (429) T protein:vir:10 248 LQNSHRIA-LMP----------VGYQFQPIS--LNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT--LNNIEQQQ 312 (429) T ss_pred ccccCcee-ecC----------CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHH Confidence 11111222 222 245566653 23333 33345778899999999999996433322 23333332 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_019530. 407 QEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPF 486 (560) Q Consensus 407 RDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~ 486 (560) +. |.++ .|+- +...+. +-+-+.++++.+|.. -..+.|..+. +... -+..|++.++.+-.- T Consensus 313 ~~---f~~~--~l~P-~~~~ie-----~~ln~kl~~~~~~~~----g~~~~fd~~~----ll~~-d~~~~~~~~~~~~~~ 372 (429) T protein:vir:10 313 QQ---FYTD--TLQA-TLTMYE-----QEMTYKLFLDSELDK----GFYSKFNVDA----ILRA-DIKTRYEAYRTGIQG 372 (429) T ss_pred HH---HHHH--HHHH-HHHHHH-----HHHHHhhcChhhcCC----CcEEEeechh----hhcC-CHHHHHHHHHHHHhC Confidence 22 4332 1221 111112 222334567666642 2334443332 2111 122455554443222 Q ss_pred hcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 487 VGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 487 vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) -++|.+-++. +|.+...+ .-++.+ .+++..+-+. -++..+-.|.+++.. T Consensus 373 --G~~T~NE~R~-~~gl~p~~--ggD~~~---------------~~~n~~~~d~-----~~~~~~k~g~~~~~~ 421 (429) T protein:vir:10 373 --GFLKPNEARS-KEDLPPEA--GGDRLL---------------VNGNMLPIDM-----AGQAYLKGGDTNGEV 421 (429) T ss_pred --CCcCHHHHHH-HhCCCCCC--CcCeee---------------ecccccchhh-----ccccccCCCCCCCCC Confidence 3677777763 45553210 000000 0111100000 000000011111111 No 48 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=98.27 E-value=1.9e-06 Score=52.01 Aligned_cols=413 Identities=12% Similarity=0.101 Sum_probs=184.6 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) |+.++...+.-.+.++. +...+... .++...++-.+...+.+.+ + -+..+++|.|.+||. T Consensus 1 M~~~~r~~~~~~~~~r~------~~~~~~~~--~~~~~~~~~~g~~~~~~~v-----~-------~~~al~~~~v~~~i~ 60 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQ------TSQVIELN--KDDEKLLEWLGISPSTISV-----K-------GKNALKVATVFACIK 60 (432) T ss_pred CChHHHHHHhcCccccC------cccccccC--CchHHHHHHhCCCcCcccc-----c-------hhhhhccHHHHHHHH Confidence 33322221111111111 00111111 1111111101101111111 1 123467899999997 Q ss_pred HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHH-hCCchh----hHHHHHHhhhhhhhhhhheecCCCC Q lcl|NC_019530. 95 IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHL-MDFDNT----AYQKIRKWYVDGRQAYHVIVDPTNK 169 (560) Q Consensus 95 eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l-l~f~~~----~~~~~R~wYvDGr~~~hkiiD~~~~ 169 (560) -|.+.+.- -|+.+.-+..++ .++........+|+. =+-... .+.++..+++.|.-|..++.|. T Consensus 61 ~ia~~ia~-----lp~~~~~~~~~~----~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~--- 128 (432) T protein:vir:10 61 ILSESVSK-----LPLKIYQEDEYG----IQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDR--- 128 (432) T ss_pred HHHHhhcc-----CceEEEEecCCc----eeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECC--- Confidence 77665542 233332222111 111222233333332 112223 3445555667798888777654 Q ss_pred ccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeec Q lcl|NC_019530. 170 KGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDS 249 (560) Q Consensus 170 k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hs 249 (560) ..-+.+|.+|+|..|..++ +..+. ......-||.+++ .+....++.+.|.|..- T Consensus 129 ~G~~~~L~~i~~~~v~v~~-----d~~~~--~~~~~~~~y~~~~-------------------~g~~~~~~~~eiih~r~ 182 (432) T protein:vir:10 129 KGKVQALWPIDASKVTVYI-----DDVGL--LNSKTKMWYVVNT-------------------GGQQRVLKPEEILHFKN 182 (432) T ss_pred CCcEEEEEEEcCceeEEEE-----cCccc--ccccceEEEEEec-------------------CCeEEEEccccEEEecC Confidence 3338999999998887643 11111 1111122233321 12345688888888864 Q ss_pred ccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCc Q lcl|NC_019530. 250 GEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGK 329 (560) Q Consensus 250 Gl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGe 329 (560) +. ..+.-..+|.|..|++++.....+++...=+----+.-+-|..++ +.|.+..+++..+.+...|.. T Consensus 183 ~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g---------- 250 (432) T protein:vir:10 183 GI-TLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-GDLNEDAKKVFRENFESMSSG---------- 250 (432) T ss_pred CC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhcc---------- Confidence 42 334455779999999999999888887665544445556777776 467666665554444433321 Q ss_pred ccCcccchhhHhhhcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHH Q lcl|NC_019530. 330 ITGNAHLMGIAEDYWLPRREGQNATEIATVGG-GNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQE 408 (560) Q Consensus 330 v~d~~~~msmlEDywLpRReGgrgTEIsTLpG-g~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRD 408 (560) +++..+.+ .++ .|.++..|.- ...+.-++-.++..+.+.++++||...|+..+..+ .+++.+..+. T Consensus 251 ~~n~~~~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~--~s~~e~~~~~ 317 (432) T protein:vir:10 251 LQNSHRIA-LMP----------VGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT--LNNIEQQQQQ 317 (432) T ss_pred cccCCcce-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHHHH Confidence 11221222 222 2455666532 11222234456778999999999999996433322 2233333222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhc Q lcl|NC_019530. 409 ELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVG 488 (560) Q Consensus 409 ElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vG 488 (560) |.++ .|+- +...+.+.| -+.++++.+|.. -+.+.|..+ +|... -+..|++.++.+-.- T Consensus 318 ---~~~~--~l~P-~~~~ie~~l-----n~kLl~~~~~~~----g~~~~fd~~----~l~~~-d~~~~~~~~~~~~~~-- 375 (432) T protein:vir:10 318 ---FYTD--TLQA-TLTMYEQEM-----TYKLFLDSELDK----GFYSKFNVD----AILRA-DIKTRYEAYRTGIQG-- 375 (432) T ss_pred ---HHHH--HHHH-HHHHHHHHH-----HHhhcChhhcCC----CcEEEeech----hhhcC-CHHHHHHHHHHHHhC-- Confidence 4433 1221 111122222 234456666542 233444322 22211 123455554443221 Q ss_pred chhhHHHHHHHHhCCCHHH-------------HHHHHHHHHHhhhcCCCCCCCccccccccc Q lcl|NC_019530. 489 SIFSIDYVMRNVLRMSDEE-------------VKEQQTKIAEEKKKGLYPKVQADETGNYSG 537 (560) Q Consensus 489 ky~S~~~i~k~IL~~tDeE-------------I~e~~kqIe~E~~~~~~~~p~~~e~g~~~~ 537 (560) -++|.+-+++ ++.|.+.+ +++..+ ...|+| -...+...+|+++- T Consensus 376 G~~t~NE~R~-~~g~~pi~ggD~~~~~~n~~~~~~~~~---~~~k~~-~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 376 GFLKPNEARS-KEDLPPEAGGDRLLVNGNMLPIDMAGQ---AYLKGG-DTNGEVSKEGNEGN 432 (432) T ss_pred CCcCHHHHHH-HhCCCCCCCCCeEeecccccchhhccc---cccCCC-CCCCCCCCCCCCCC Confidence 3777777764 46664321 000000 000111 00011111111111 No 49 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=98.27 E-value=1.9e-06 Score=52.01 Aligned_cols=413 Identities=12% Similarity=0.101 Sum_probs=184.6 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) |+.++...+.-.+.++. +...+... .++...++-.+...+.+.+ + -+..+++|.|.+||. T Consensus 1 M~~~~r~~~~~~~~~r~------~~~~~~~~--~~~~~~~~~~g~~~~~~~v-----~-------~~~al~~~~v~~~i~ 60 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQ------TSQVIELN--KDDEKLLEWLGISPSTISV-----K-------GKNALKVATVFACIK 60 (432) T ss_pred CChHHHHHHhcCccccC------cccccccC--CchHHHHHHhCCCcCcccc-----c-------hhhhhccHHHHHHHH Confidence 33322221111111111 00111111 1111111101101111111 1 123467899999997 Q ss_pred HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHH-hCCchh----hHHHHHHhhhhhhhhhhheecCCCC Q lcl|NC_019530. 95 IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHL-MDFDNT----AYQKIRKWYVDGRQAYHVIVDPTNK 169 (560) Q Consensus 95 eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l-l~f~~~----~~~~~R~wYvDGr~~~hkiiD~~~~ 169 (560) -|.+.+.- -|+.+.-+..++ .++........+|+. =+-... .+.++..+++.|.-|..++.|. T Consensus 61 ~ia~~ia~-----lp~~~~~~~~~~----~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~--- 128 (432) T protein:vir:10 61 ILSESVSK-----LPLKIYQEDEYG----IQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDR--- 128 (432) T ss_pred HHHHhhcc-----CceEEEEecCCc----eeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECC--- Confidence 77665542 233332222111 111222233333332 112223 3445555667798888777654 Q ss_pred ccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeec Q lcl|NC_019530. 170 KGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDS 249 (560) Q Consensus 170 k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hs 249 (560) ..-+.+|.+|+|..|..++ +..+. ......-||.+++ .+....++.+.|.|..- T Consensus 129 ~G~~~~L~~i~~~~v~v~~-----d~~~~--~~~~~~~~y~~~~-------------------~g~~~~~~~~eiih~r~ 182 (432) T protein:vir:10 129 KGKVQALWPIDASKVTVYI-----DDVGL--LNSKTKMWYVVNT-------------------GGQQRVLKPEEILHFKN 182 (432) T ss_pred CCcEEEEEEEcCceeEEEE-----cCccc--ccccceEEEEEec-------------------CCeEEEEccccEEEecC Confidence 3338999999998887643 11111 1111122233321 12345688888888864 Q ss_pred ccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCc Q lcl|NC_019530. 250 GEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGK 329 (560) Q Consensus 250 Gl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGe 329 (560) +. ..+.-..+|.|..|++++.....+++...=+----+.-+-|..++ +.|.+..+++..+.+...|.. T Consensus 183 ~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g---------- 250 (432) T protein:vir:10 183 GI-TLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-GDLNEDAKKVFRENFESMSSG---------- 250 (432) T ss_pred CC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhcc---------- Confidence 42 334455779999999999999888887665544445556777776 467666665554444433321 Q ss_pred ccCcccchhhHhhhcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHH Q lcl|NC_019530. 330 ITGNAHLMGIAEDYWLPRREGQNATEIATVGG-GNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQE 408 (560) Q Consensus 330 v~d~~~~msmlEDywLpRReGgrgTEIsTLpG-g~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRD 408 (560) +++..+.+ .++ .|.++..|.- ...+.-++-.++..+.+.++++||...|+..+..+ .+++.+..+. T Consensus 251 ~~n~~~~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~--~s~~e~~~~~ 317 (432) T protein:vir:10 251 LQNSHRIA-LMP----------VGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT--LNNIEQQQQQ 317 (432) T ss_pred cccCCcce-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHHHH Confidence 11221222 222 2455666532 11222234456778999999999999996433322 2233333222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhc Q lcl|NC_019530. 409 ELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVG 488 (560) Q Consensus 409 ElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vG 488 (560) |.++ .|+- +...+.+.| -+.++++.+|.. -+.+.|..+ +|... -+..|++.++.+-.- T Consensus 318 ---~~~~--~l~P-~~~~ie~~l-----n~kLl~~~~~~~----g~~~~fd~~----~l~~~-d~~~~~~~~~~~~~~-- 375 (432) T protein:vir:10 318 ---FYTD--TLQA-TLTMYEQEM-----TYKLFLDSELDK----GFYSKFNVD----AILRA-DIKTRYEAYRTGIQG-- 375 (432) T ss_pred ---HHHH--HHHH-HHHHHHHHH-----HHhhcChhhcCC----CcEEEeech----hhhcC-CHHHHHHHHHHHHhC-- Confidence 4433 1221 111122222 234456666542 233444322 22211 123455554443221 Q ss_pred chhhHHHHHHHHhCCCHHH-------------HHHHHHHHHHhhhcCCCCCCCccccccccc Q lcl|NC_019530. 489 SIFSIDYVMRNVLRMSDEE-------------VKEQQTKIAEEKKKGLYPKVQADETGNYSG 537 (560) Q Consensus 489 ky~S~~~i~k~IL~~tDeE-------------I~e~~kqIe~E~~~~~~~~p~~~e~g~~~~ 537 (560) -++|.+-+++ ++.|.+.+ +++..+ ...|+| -...+...+|+++- T Consensus 376 G~~t~NE~R~-~~g~~pi~ggD~~~~~~n~~~~~~~~~---~~~k~~-~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 376 GFLKPNEARS-KEDLPPEAGGDRLLVNGNMLPIDMAGQ---AYLKGG-DTNGEVSKEGNEGN 432 (432) T ss_pred CCcCHHHHHH-HhCCCCCCCCCeEeecccccchhhccc---cccCCC-CCCCCCCCCCCCCC Confidence 3777777764 46664321 000000 000111 00011111111111 No 50 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=98.27 E-value=1.9e-06 Score=52.01 Aligned_cols=413 Identities=12% Similarity=0.101 Sum_probs=184.6 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) |+.++...+.-.+.++. +...+... .++...++-.+...+.+.+ + -+..+++|.|.+||. T Consensus 1 M~~~~r~~~~~~~~~r~------~~~~~~~~--~~~~~~~~~~g~~~~~~~v-----~-------~~~al~~~~v~~~i~ 60 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQ------TSQVIELN--KDDEKLLEWLGISPSTISV-----K-------GKNALKVATVFACIK 60 (432) T ss_pred CChHHHHHHhcCccccC------cccccccC--CchHHHHHHhCCCcCcccc-----c-------hhhhhccHHHHHHHH Confidence 33322221111111111 00111111 1111111101101111111 1 123467899999997 Q ss_pred HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHH-hCCchh----hHHHHHHhhhhhhhhhhheecCCCC Q lcl|NC_019530. 95 IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHL-MDFDNT----AYQKIRKWYVDGRQAYHVIVDPTNK 169 (560) Q Consensus 95 eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l-l~f~~~----~~~~~R~wYvDGr~~~hkiiD~~~~ 169 (560) -|.+.+.- -|+.+.-+..++ .++........+|+. =+-... .+.++..+++.|.-|..++.|. T Consensus 61 ~ia~~ia~-----lp~~~~~~~~~~----~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~--- 128 (432) T protein:vir:10 61 ILSESVSK-----LPLKIYQEDEYG----IQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDR--- 128 (432) T ss_pred HHHHhhcc-----CceEEEEecCCc----eeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECC--- Confidence 77665542 233332222111 111222233333332 112223 3445555667798888777654 Q ss_pred ccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeec Q lcl|NC_019530. 170 KGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDS 249 (560) Q Consensus 170 k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hs 249 (560) ..-+.+|.+|+|..|..++ +..+. ......-||.+++ .+....++.+.|.|..- T Consensus 129 ~G~~~~L~~i~~~~v~v~~-----d~~~~--~~~~~~~~y~~~~-------------------~g~~~~~~~~eiih~r~ 182 (432) T protein:vir:10 129 KGKVQALWPIDASKVTVYI-----DDVGL--LNSKTKMWYVVNT-------------------GGQQRVLKPEEILHFKN 182 (432) T ss_pred CCcEEEEEEEcCceeEEEE-----cCccc--ccccceEEEEEec-------------------CCeEEEEccccEEEecC Confidence 3338999999998887643 11111 1111122233321 12345688888888864 Q ss_pred ccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCc Q lcl|NC_019530. 250 GEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGK 329 (560) Q Consensus 250 Gl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGe 329 (560) +. ..+.-..+|.|..|++++.....+++...=+----+.-+-|..++ +.|.+..+++..+.+...|.. T Consensus 183 ~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g---------- 250 (432) T protein:vir:10 183 GI-TLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-GDLNEDAKKVFRENFESMSSG---------- 250 (432) T ss_pred CC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhcc---------- Confidence 42 334455779999999999999888887665544445556777776 467666665554444433321 Q ss_pred ccCcccchhhHhhhcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHH Q lcl|NC_019530. 330 ITGNAHLMGIAEDYWLPRREGQNATEIATVGG-GNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQE 408 (560) Q Consensus 330 v~d~~~~msmlEDywLpRReGgrgTEIsTLpG-g~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRD 408 (560) +++..+.+ .++ .|.++..|.- ...+.-++-.++..+.+.++++||...|+..+..+ .+++.+..+. T Consensus 251 ~~n~~~~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~--~s~~e~~~~~ 317 (432) T protein:vir:10 251 LQNSHRIA-LMP----------VGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT--LNNIEQQQQQ 317 (432) T ss_pred cccCCcce-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHHHH Confidence 11221222 222 2455666532 11222234456778999999999999996433322 2233333222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhc Q lcl|NC_019530. 409 ELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVG 488 (560) Q Consensus 409 ElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vG 488 (560) |.++ .|+- +...+.+.| -+.++++.+|.. -+.+.|..+ +|... -+..|++.++.+-.- T Consensus 318 ---~~~~--~l~P-~~~~ie~~l-----n~kLl~~~~~~~----g~~~~fd~~----~l~~~-d~~~~~~~~~~~~~~-- 375 (432) T protein:vir:10 318 ---FYTD--TLQA-TLTMYEQEM-----TYKLFLDSELDK----GFYSKFNVD----AILRA-DIKTRYEAYRTGIQG-- 375 (432) T ss_pred ---HHHH--HHHH-HHHHHHHHH-----HHhhcChhhcCC----CcEEEeech----hhhcC-CHHHHHHHHHHHHhC-- Confidence 4433 1221 111122222 234456666542 233444322 22211 123455554443221 Q ss_pred chhhHHHHHHHHhCCCHHH-------------HHHHHHHHHHhhhcCCCCCCCccccccccc Q lcl|NC_019530. 489 SIFSIDYVMRNVLRMSDEE-------------VKEQQTKIAEEKKKGLYPKVQADETGNYSG 537 (560) Q Consensus 489 ky~S~~~i~k~IL~~tDeE-------------I~e~~kqIe~E~~~~~~~~p~~~e~g~~~~ 537 (560) -++|.+-+++ ++.|.+.+ +++..+ ...|+| -...+...+|+++- T Consensus 376 G~~t~NE~R~-~~g~~pi~ggD~~~~~~n~~~~~~~~~---~~~k~~-~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 376 GFLKPNEARS-KEDLPPEAGGDRLLVNGNMLPIDMAGQ---AYLKGG-DTNGEVSKEGNEGN 432 (432) T ss_pred CCcCHHHHHH-HhCCCCCCCCCeEeecccccchhhccc---cccCCC-CCCCCCCCCCCCCC Confidence 3777777764 46664321 000000 000111 00011111111111 No 51 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=98.24 E-value=2.2e-06 Score=51.61 Aligned_cols=404 Identities=10% Similarity=0.024 Sum_probs=189.5 Q ss_pred eeccccccchhhHHHHHHHHHHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhC Q lcl|NC_019530. 61 YNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMD 140 (560) Q Consensus 61 ~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~ 140 (560) +-+.......+ .-++++...=+.-+|+..++ .++.+- +...+.+ ..+....|++-.+ T Consensus 1 ~l~~~~~~~~~-------~~~~~~v~n~~~~ivd~~~~-~l~~~g------f~~~d~~---------~~~~~~~i~~~N~ 57 (434) T protein:vir:98 1 MLPKNAEQAFL-------DFQRKARTNFCGLIANASVH-RLLALG------VTGPDGE---------PDTRASRWWQANR 57 (434) T ss_pred CCCCCccHHHH-------HhhhhhhccchHHHHHHHHh-hhccCc------eecCCCc---------hHHHHHHHHHhcC Confidence 11112221111 11123333455666665554 222221 1222222 1223344666678 Q ss_pred CchhhHHHHHHhhhhhhhhhhheecCCC---CccCeeEeEEeCccccchhhhhcccCCcccccccc------ccchheee Q lcl|NC_019530. 141 FDNTAYQKIRKWYVDGRQAYHVIVDPTN---KKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIE------SVTLKYYY 211 (560) Q Consensus 141 f~~~~~~~~R~wYvDGr~~~hkiiD~~~---~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~------~~~e~~~Y 211 (560) |+....++++.-++.||-|+..-.|+.. ..++-..++.+||+.+=.++.-......-++.... .....++| T Consensus 58 ~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~~~~~~~~~~~~ 137 (434) T protein:vir:98 58 LDSRQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDIDGFGYARVFFD 137 (434) T ss_pred hhHHHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEeccCCceEEEEEEe Confidence 9999999999999999999865554321 11223346778887765443211000001111110 01111111 Q ss_pred cCcccccccccccccccccccccc--------c------eecchhceeeeecccccCCCCcchhhhHHHHHHHHHH-HHH Q lcl|NC_019530. 212 NPNYNRNQFTGQSGTSQNFQPSQQ--------E------LVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNL-VTT 276 (560) Q Consensus 212 ~p~~~~~~~~g~~~~~~~~~~~~~--------~------~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL-~m~ 276 (560) + .....+.....+......... . =++| .|.|++.-..+ +...|=++..+....-+ +++ T Consensus 138 ~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vP--vv~f~N~~~~~---~~g~sd~e~vi~liDa~~~~~ 210 (434) T protein:vir:98 138 D--TSFPYRTRERTGARLPWGPDSWVYTGTADSGDVHDLGGMQ--LVEFARMPDLG---EDPEPEFAGVLDIQDRVNLGI 210 (434) T ss_pred C--cEEEEEEeeccccccccccccceecccccccccCCCCccc--eEEeccCCCcC---cCCcchhhhHHHHHHHHHHHH Confidence 1 111111111000000000000 0 0111 12244431111 12334444433333322 345 Q ss_pred HHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccce Q lcl|NC_019530. 277 EDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEI 356 (560) Q Consensus 277 EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEI 356 (560) -+.+++-+..-.|.|-+. |--+... -|...+.+.....+..-....|+.- +-++++ T Consensus 211 s~~~~~~~~~a~p~~~i~----G~~~~~~-----------------~~~~~~~~~~~~~~~~~~~~i~~~~---~~~~~~ 266 (434) T protein:vir:98 211 LNRMAASRFSGFRQKWIK----GHKFAKR-----------------TDPATGMTVVDQPFVPSPSAVWASE---GENTQF 266 (434) T ss_pred HHHHHHHHHhcchhhhhc----CCCcccc-----------------cccccccchhhhhhhccccccccCC---CCCceE Confidence 567777777777766553 2111100 0112222222222222222345542 234667 Q ss_pred eecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 357 ATVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQL 435 (560) Q Consensus 357 sTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qL 435 (560) ..+++.. ++. ++-++-.-..+....++|.+-|..+. +.+.+..|.-.+...-.-+.+.|+.|..-+..+++.-+ T Consensus 267 ~q~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~~~~~~----~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~ 341 (434) T protein:vir:98 267 GQLDATD-LSGFLKEHASDVRDMLTISQTPTYLYATDL----VNISADTIGALDILHVAKVREHIASFSEGLESVLALAA 341 (434) T ss_pred EEecCcc-hHHHHHHHHHHHHHHhcccCCCHHHhcccc----CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7787643 333 33456667788888999988886431 22234566666677778888899999999999988777 Q ss_pred HhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHH Q lcl|NC_019530. 436 ILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKI 515 (560) Q Consensus 436 iLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqI 515 (560) .+.|+ +.+ |. .+.+.|..-. .. -+.+.++++..+..- | +|.++++ +.|.++++||+++.++. T Consensus 342 ~~~g~-~~~-~~-----~~~v~w~~~~-~~------s~~~~ada~~kl~~~-g--~~~e~~~-~~lg~~~~e~~r~~~e~ 403 (434) T protein:vir:98 342 AQAGV-PED-YT-----EAEVRWANPA-HV------TMAVKADAATKLKSI-G--YPLDVIA-EELDESPARVRRIVAGA 403 (434) T ss_pred HhcCC-Chh-he-----eeeEEecCCC-CC------CHHHHHHHHHHHHhc-C--CcHHHHH-HhCCCCHHHHHHHHHHH Confidence 77786 433 31 2556664422 11 123455666665432 2 6888877 55899999999877766 Q ss_pred HHhhhcC-----CCCCCCccccccccccccCCCCCCcccccccCCccc Q lcl|NC_019530. 516 AEEKKKG-----LYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDD 558 (560) Q Consensus 516 e~E~~~~-----~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~ 558 (560) +++.... ...+|.+.+.+++ -|.||+ T Consensus 404 ~~~~~~~~~~~~~~~~~~~g~~~~~-----------------~~~~dg 434 (434) T protein:vir:98 404 ASQALLAASLLPAPGAPSAGNVPDS-----------------GGAVDG 434 (434) T ss_pred HHHHHHHHhhhccCCCCCCCCCCcc-----------------cCCCCC Confidence 6653221 1111111111111 122333 No 52 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=98.23 E-value=1.5e-06 Score=52.53 Aligned_cols=441 Identities=10% Similarity=0.083 Sum_probs=200.3 Q ss_pred hhhhhhhhhhhhhhhhcccccchhhhcccccc--ccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhh- Q lcl|NC_019530. 9 FGLFGGGGLVNAKVDTDKLAQKQDELLLTKAT--VVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQ- 85 (560) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~--s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~- 85 (560) .|+|..++ +++..-.......+-+++-.+ -+..| ....+-|+.||+|.. T Consensus 1 m~~~~~ik---~~~~~~~~~~~~~~~~~~i~d~~~i~~~-------------------------~~~~~~i~~~~~~Y~g 52 (505) T protein:vir:79 1 MAFWDTLK---NLFRKGSAAVGMTKSLGQIIDDPRINLP-------------------------ADEVERIARDKRYYMD 52 (505) T ss_pred CchHHHHH---HHHHHhhhhhcchhhhhhhhcccCCCCC-------------------------HHHHHHHHHHHHHhcC Confidence 34444333 333211000000111110000 01111 012233444444432 Q ss_pred -ccchhH-HH---------------HHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHH Q lcl|NC_019530. 86 -QPEIRK-AV---------------DIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQK 148 (560) Q Consensus 86 -~PEvd~-AI---------------~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~ 148 (560) +|.|.. ++ ..|+++..=+=.+ .|+++.+++. ...+..+.+++-.+|.....+. T Consensus 53 ~~~~l~~~~~~~~~~~~~~~slnl~~~i~~~~A~ll~~-e~~~i~~~d~---------~~~e~l~~i~~~n~f~~~~~~~ 122 (505) T protein:vir:79 53 DFKQVTHKNSYGDTQKHELQSVNVTKLASAKLASLIFN-EQCQVTVSDE---------TANDFLDDVFQQNDFYTTFEEK 122 (505) T ss_pred CCccccccccCCCccccceeecchHHHHHHHHHhhhcC-CCceeecCCh---------HHHHHHHHHHHhccHHHHHHHH Confidence 232210 00 0111110000011 1444555432 4466678888888899999999 Q ss_pred HHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCcccccc--cc-------ccchhe------eec- Q lcl|NC_019530. 149 IRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEA--IE-------SVTLKY------YYN- 212 (560) Q Consensus 149 ~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~--~~-------~~~e~~------~Y~- 212 (560) +..+.+-|..+|+..+|..+ ..+.+++|.++=++. .+..+..+. +. ....|| ..+ T Consensus 123 ~e~a~a~G~~~~k~~~D~~~-----~~i~~v~ad~~~P~~----~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~ 193 (505) T protein:vir:79 123 LEEWIALGSGCVRPYVDSGK-----IKLAWATADQVYPLQ----ADTNQVNELAIASRTTEVENHRTIYYTLLEFHQWDH 193 (505) T ss_pred HHHHhhcCCeEEEEEEeCCc-----eEEEEEcCCeeEEEE----EcCCCeEEEEEEEEEEEecCCcceEEEEEEEEEecC Confidence 99999999999999998532 458889998866642 111110011 10 000011 110 Q ss_pred -Cc----ccccccccccccccc---ccccccceecchhceeeeecc--c---c--------cCCCCcchhhhHHHHHHHH Q lcl|NC_019530. 213 -PN----YNRNQFTGQSGTSQN---FQPSQQELVFDDESIVYIDSG--E---E--------PLANGIVPGLLNPAIRPLN 271 (560) Q Consensus 213 -p~----~~~~~~~g~~~~~~~---~~~~~~~~~i~~daI~y~hsG--l---~--------~~~~~~i~syL~~AiK~~N 271 (560) +. ..+...++.+.+..+ .-+.+. .+. +.++|.+.. + + +......+|-++.|+-.+. T Consensus 194 ~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~--~l~-~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id 270 (505) T protein:vir:79 194 GDYVITNELYRSEAAETVGINVPLNSLEQYE--GLE-PQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVID 270 (505) T ss_pred ceEEEEEEEEecCCCCccCcccchhhccccc--ccC-cceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHH Confidence 00 000000000000000 000000 000 111221110 0 0 1112234688888887777 Q ss_pred HHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCC Q lcl|NC_019530. 272 NLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQ 351 (560) Q Consensus 272 qL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGg 351 (560) .|-..=+ -+.|..|.=.+|||. +-.=|++...-....... ..-++| .+.+.+.++.- +++ T Consensus 271 ~lD~~~s--~~~~e~~~g~~~i~v-~~~~l~~~~~~~~~~~~~----~~~~fd------~~~~~y~~~~~-------~~~ 330 (505) T protein:vir:79 271 AINRTHD--QFVDEVKKGQRRLIV-PAEWLKTGSSYGGQASET----HPPMFD------PDETVYQAMYG-------DAS 330 (505) T ss_pred HHHHHHH--HHHHHHHhcccceee-chHHhcccCCCCcccccc----cccCCC------ccceeeeeccC-------CCC Confidence 7764443 355777777777665 211110000000000000 000000 11122222221 122 Q ss_pred CccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 352 NATEIATVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEF 430 (560) Q Consensus 352 rgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~ 430 (560) +.-|+++.+.---.+ .+-+..+.+.+....+++-+-|..++.. -..|+||....-.-..-+.+.|+.|...+.++ T Consensus 331 -~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~---~~TAtei~s~~~~l~~t~~~~~~~~~~al~~l 406 (505) T protein:vir:79 331 -EVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSG---IQTATEVVTNNSQTYQTRSSYITQVEKTIKAL 406 (505) T ss_pred -CCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccc---cchHHHHHHHHhHHHHHHHHHHHHHHHHHHHH Confidence 223778776322222 3457778888899999999999876542 23468887666566667778888888888887 Q ss_pred HHHHHHhccC---CChhHHHH--hhh-hhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCC Q lcl|NC_019530. 431 LRRQLILKGV---TDEKDWNE--KIK-PFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMS 504 (560) Q Consensus 431 Lk~qLiLkgi---it~eew~~--~i~-~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~t 504 (560) ++.-|-+..+ .+...|.. .+. ..+.++|... .+.. +++++- ..+.+++. | ..|.++++++-...| T Consensus 407 i~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~-i~~d-~~~~~~-~~~~~v~~-----G-i~s~e~~l~~~~~~~ 477 (505) T protein:vir:79 407 TYAILELASVPSFYADGQARWTGDVDSLDITINFNDG-VFVD-QESKRA-ADLQAVQA-----Q-VMPKKQFLMRNYGLD 477 (505) T ss_pred HHHHHHHHHHhcccccccccccCCCCceeEEEEeCCC-CCCC-HHHHHH-HHHHHHHc-----C-CCCHHHHHHhcCCCC Confidence 7776654432 11111100 011 1345555432 2221 222221 12222211 3 588899888889999 Q ss_pred HHHHHHHHHHHHHhhhcCCCCCCCccccccc Q lcl|NC_019530. 505 DEEVKEQQTKIAEEKKKGLYPKVQADETGNY 535 (560) Q Consensus 505 DeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~ 535 (560) |+|.+++-++|++|... ..|+....|++ T Consensus 478 eeea~~el~ri~~E~~~---~~p~~~~~gg~ 505 (505) T protein:vir:79 478 EEEADEWLAQIDAENST---AEPEFNQFGGD 505 (505) T ss_pred hHHHHHHHHHHHHhccc---cCCCchhccCC Confidence 99999999999999532 23444444444 No 53 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=98.23 E-value=2.3e-06 Score=51.52 Aligned_cols=460 Identities=9% Similarity=0.020 Sum_probs=224.6 Q ss_pred ccccccCCCC----------CCCCceeEe--ccccceeccccccc-hhhHHHHHHHHHHH-hhccchhHHHHHhhcceee Q lcl|NC_019530. 37 TKATVVALDD----------AQDGSIILQ--GGANTYNYVGVESE-LLSVKTVVEEYQSM-AQQPEIRKAVDIIVNDVVT 102 (560) Q Consensus 37 ~k~~s~~~~~----------~~dG~~~i~--~~~~~~~y~~~e~~-~~~~~~LI~~YR~m-a~~PEvd~AI~eIvneaiv 102 (560) -|..++..|+ ...|+..-. ..+...+..+.++. ..+-..|..+=|.| .++|-+..||+.+++-+|= T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG 80 (530) T protein:vir:38 1 MKIPSLVGPDGKTSLREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQDHIVG 80 (530) T ss_pred CccceeecCccccchHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhhC Confidence 1111111111 111111100 11111122222333 22456688888888 5899999999988887664 Q ss_pred ecCCCceEE------EeeecccchhHHHHHHHHHHHHHHHH----------HhCCchhhHHHHHHhhhhhhhhhhheecC Q lcl|NC_019530. 103 CEEDETPVT------VNLDKVEGISDTVKESITECFKEVMH----------LMDFDNTAYQKIRKWYVDGRQAYHVIVDP 166 (560) Q Consensus 103 ~d~~~~~V~------l~Ld~~~~~s~sik~~I~eeF~~il~----------ll~f~~~~~~~~R~wYvDGr~~~hkiiD~ 166 (560) . +-.|-. |.++.. -.+.+.+.|..+|..+++ .++|..--.-.+|.|.+||..|..+..++ T Consensus 81 ~--Gi~~~~~p~~~~l~~~~~--~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~ 156 (530) T protein:vir:38 81 S--FFRLSYRPSWRYLGINEE--DSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQATWDS 156 (530) T ss_pred C--CceeeeccchhhcCCCHh--HHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEeeecc Confidence 2 322221 333443 367788999999999985 34555666678899999999998777665 Q ss_pred CCCccCe---eEeEEeCccccchhhhhccc-CCcccccccccc--chheeec--Ccccccccccccccccccccccccee Q lcl|NC_019530. 167 TNKKGGI---KKLVMLDSRCIRPVYIVEKA-MREGGIEAIESV--TLKYYYN--PNYNRNQFTGQSGTSQNFQPSQQELV 238 (560) Q Consensus 167 ~~~k~GI---~eLr~LDPr~i~~vr~i~~~-~~~g~~~~~~~~--~e~~~Y~--p~~~~~~~~g~~~~~~~~~~~~~~~~ 238 (560) ..|. ..|..|+|-+|.--...... .=..||++.+.. ..|++|. |....... .......+. T Consensus 157 ---~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~---------~~~~~~~~~ 224 (530) T protein:vir:38 157 ---DSTRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKINDSGAALGYYVSDDGYPGWMAQN---------WTYIPRELP 224 (530) T ss_pred ---CCCCccceEEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEEeeccCCCccccc---------cceeeeeec Confidence 3343 46889999887643221100 001234432222 2455553 22111110 000112355 Q ss_pred cchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHH------------ Q lcl|NC_019530. 239 FDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSA------------ 306 (560) Q Consensus 239 i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KA------------ 306 (560) ++.+.|.|+..-. ...---.+|.|..+++.+.+|.-.+||...-...-|--=-++.=+.+......+ T Consensus 225 v~a~~vlH~f~~~-r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~ 303 (530) T protein:vir:38 225 GGRPSFIHVFEPM-EDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGADNKEQQSK 303 (530) T ss_pred cChhHeEeecccc-CCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccccccccccCCccccccc Confidence 7777788777542 223445679999999999999999999998888777553333222222110000 Q ss_pred -HHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCC Q lcl|NC_019530. 307 -EEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE-MDHVNYFREKLYDALMI 384 (560) Q Consensus 307 -eqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~V 384 (560) ..+... +..+.+..+..-..|.| . .-.-|-+|+.+..+.--+. -+=++...+.+=.+|+| T Consensus 304 ~~~~~~~-~~~~~~~~~~~l~pG~i----------~-------~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi 365 (530) T protein:vir:38 304 LTGWLGE-MAAYYSAAPVRLGGARV----------P-------HLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGV 365 (530) T ss_pred ccccchh-hhhcccccceeccCcee----------e-------ecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCC Confidence 001110 11111211222222211 0 0112445555554433333 34455556666778999 Q ss_pred CccccCCC-CcccccccchhhhhHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHhccCCCh------hHHHHhhhh Q lcl|NC_019530. 385 PKSRLQEE-GSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFF-----MEFLRRQLILKGVTDE------KDWNEKIKP 452 (560) Q Consensus 385 P~sRl~~~-~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if-----~d~Lk~qLiLkgiit~------eew~~~i~~ 452 (560) |-+-|..+ ++.|.. -+--.-+.|-+.+.++|..|..=| .-.|+ ..++.|.+.. +.|. .... T Consensus 366 ~ye~lt~D~s~~nYS-----S~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~-~av~~G~i~~p~~~~~~~~~-~~~a 438 (530) T protein:vir:38 366 SYEQLSRNYSQMSYS-----TARASANESWAYFMGRRKFVASRQACQMFLCWLE-EAIVRRVVTLPSKARFSFQE-ARTA 438 (530) T ss_pred CHHHHhcccccccHH-----HHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHH-HHHHcCCccCCCCCCCCchh-hHHh Confidence 99988655 232321 122235569999999999886544 33344 4577887762 2221 1111 Q ss_pred hc--eeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhh----hcCCCCC Q lcl|NC_019530. 453 FI--KFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEK----KKGLYPK 526 (560) Q Consensus 453 ~I--~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~----~~~~~~~ 526 (560) .. .|..-.=-+.-.+||+.-...+++. -.-|.+-+..+ .+..-+ +..+||++|. +.|+-.+ T Consensus 439 ~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~---------G~~s~~~~~a~-~G~D~~---~v~~q~a~e~~~~~~~Gl~~~ 505 (530) T protein:vir:38 439 WGNANWIGSGRMAIDGLKEVQEAVMLIEA---------GLSTYEKECAK-RGDDYQ---EIFAQQVRESMERRAAGLNPP 505 (530) T ss_pred hhceeeecCCccccChHHHHHHHHHHHHc---------CCCCHHHHHHH-cCCCHH---HHHHHHHHHHHHHHHcCCCCC Confidence 11 2222222334556666655544432 23444444444 333322 3333333332 3454322 Q ss_pred CCccccccccccccCCCCCCcccccccCCcccC Q lcl|NC_019530. 527 VQADETGNYSGSDVSPLKFKPETIPFSGSTDDS 559 (560) Q Consensus 527 p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~ 559 (560) ... ......+...+ +.-|..|+ +.+ T Consensus 506 -~~~-~~~~~~~~~~~-----~~~~~d~~-~~a 530 (530) T protein:vir:38 506 -AWA-AAAFEAGVKKS-----NEEEQDGA-RAA 530 (530) T ss_pred -CCc-ccccCCCCCCC-----CCCCCCCC-CCC Confidence 111 01111111111 11121111 122 No 54 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=98.22 E-value=2.4e-06 Score=51.37 Aligned_cols=467 Identities=11% Similarity=0.082 Sum_probs=191.2 Q ss_pred CCcccchhhhhhhhhhh---hhh--hhhhccc-----------ccc-----hhhhccccccc-cCCCCCCCCceeEeccc Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGL---VNA--KVDTDKL-----------AQK-----QDELLLTKATV-VALDDAQDGSIILQGGA 58 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~---~~~--~~~~~~~-----------~~~-----~~e~~~~k~~s-~~~~~~~dG~~~i~~~~ 58 (560) || -||-++.. ... .+++-++ ..+ +-+|-++++.. -+.| +...+ T Consensus 1 ~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~--------~~~~~ 65 (563) T protein:vir:99 1 MA-------DLFKQFRLGKDYGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEP--------FIEMM 65 (563) T ss_pred Ch-------hhhhhhhcccccccccccceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhh--------hHhhh Confidence 33 22222111 110 1111111 000 01111111111 0111 11111 Q ss_pred c--ceeccccccchhhHHHHHHHHHHHhhccchhHHHHHhhcceeee----cCCCce--EEEeeecccc-hhHHHHHHHH Q lcl|NC_019530. 59 N--TYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVTC----EEDETP--VTVNLDKVEG-ISDTVKESIT 129 (560) Q Consensus 59 ~--~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~----d~~~~~--V~l~Ld~~~~-~s~sik~~I~ 129 (560) . .+.|.. .+.+.+...|-+.=|.++..|-|..||+.+.+.+.++ ....+. ..+.|...+. -++..+..+ T Consensus 66 ~~~~~~~~~-~~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~- 143 (563) T protein:vir:99 66 DTNPEFRDK-RSYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEM- 143 (563) T ss_pred ccccccccc-ccCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhh- Confidence 1 111111 1122233333333456667899999999888875542 122222 3444443321 011111111 Q ss_pred HHHHHHHHHhCCc-----hhhHHH----HHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccc Q lcl|NC_019530. 130 ECFKEVMHLMDFD-----NTAYQK----IRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIE 200 (560) Q Consensus 130 eeF~~il~ll~f~-----~~~~~~----~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~ 200 (560) ...+..+..+..+ ....++ ++..++.|.-|+.+++..+ ...-+.+|++|||..|..++ +..+ . T Consensus 144 ~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd-~~G~~~~L~pl~p~~V~v~~-----~~~g--~ 215 (563) T protein:vir:99 144 KRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKN-NKTKLEKFIAVDPSTIFYAT-----DKKG--K 215 (563) T ss_pred HHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEec-CCCceEEEEEeCCceeEEEE-----CCCC--c Confidence 1222223222221 134444 4445666777777776433 24448999999999888753 1222 1 Q ss_pred cccccchheeecCccccccccccccccccccccccceecchhceee-eeccccc-CCCCcchhhhHHHHHHHHHHHHHHH Q lcl|NC_019530. 201 AIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVY-IDSGEEP-LANGIVPGLLNPAIRPLNNLVTTED 278 (560) Q Consensus 201 ~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y-~hsGl~~-~~~~~i~syL~~AiK~~NqL~m~ED 278 (560) ++....-|+++. .| +....++.+.++| ...=..+ ..+++.+|-|+.|++++.....+|+ T Consensus 216 ~~~~~~~y~~~~--------~g-----------~~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~ 276 (563) T protein:vir:99 216 IIKGGKRFVQVV--------DK-----------RVVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTES 276 (563) T ss_pred eeccceeEEEEe--------CC-----------ceeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHH Confidence 111111111111 11 1223455555443 3221112 1255678899999999999888888 Q ss_pred HHHHHHHhcCccceEEEccCCC-CchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCcccee Q lcl|NC_019530. 279 ATVIYAITRAPEKRAFYLDVGT-LGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIA 357 (560) Q Consensus 279 alVIyRitRAPeRRvFyIDvGn-lpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIs 357 (560) ...=+=---+--+-|.-+..+. |.+..+++.-+.+-..|+. ..+.-+..-.++ .|.+++ T Consensus 277 ~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G----------~~nagk~~~vl~----------~G~~~~ 336 (563) T protein:vir:99 277 FNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSG----------INGSWQIPVVMA----------DDIKFV 336 (563) T ss_pred HHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcc----------ccccccceEEcC----------CCceEE Confidence 7665544446667777777664 4544444333333333331 111112211221 245566 Q ss_pred ecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCc---------ccccccchhhhhHHHHHHHHH-HHHHHHHHHHH Q lcl|NC_019530. 358 TVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGS---------INIGGSNLAEITQEELRFSKF-CAGLRRRYSHF 426 (560) Q Consensus 358 TLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~---------~~l~~~~~~eItRDElkF~Kf-i~rLr~rfs~i 426 (560) .|--...-.+ ++-.+|..+.+.++.+||...|+.... .++.++++.+..+. |... +.-+..++... T Consensus 337 ~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~---f~~~tL~P~l~~ie~~ 413 (563) T protein:vir:99 337 NMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQ---SQNKGLQPLLRFIEDL 413 (563) T ss_pred eccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHHH---HHHHHHHHHHHHHHHH Confidence 6643222222 455567889999999999999964322 12222332222222 4433 23333333333 Q ss_pred HHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHH Q lcl|NC_019530. 427 FMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDE 506 (560) Q Consensus 427 f~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDe 506 (560) | -+.|+ +..+ .++.++|.+...= .|.+.++...-...-++|..-+++ .+.|..- T Consensus 414 l----n~~L~-----~~~~------~~~~~~f~r~D~~----------~~~e~~~~~~~~~~G~lT~NE~R~-~~gl~Pi 467 (563) T protein:vir:99 414 V----NRHII-----SEYG------DKYTFQFVGGDTK----------SATDKLNILKLETQIFKTVNEARE-EQGKKPI 467 (563) T ss_pred H----Hhhhc-----hhcc------cccEEEeccCCHH----------HHHHHHHHHHHhcCCccCHHHHHH-HhCCCCC Confidence 3 22222 2211 2467777766432 233333222222224677777774 4676543 Q ss_pred H---HH------------HHHHHHHHhh------------hcCC-CCCCCcccccccc---ccccCCCCCCcccccccCC Q lcl|NC_019530. 507 E---VK------------EQQTKIAEEK------------KKGL-YPKVQADETGNYS---GSDVSPLKFKPETIPFSGS 555 (560) Q Consensus 507 E---I~------------e~~kqIe~E~------------~~~~-~~~p~~~e~g~~~---~~~~~~~~~~p~~~~~~~~ 555 (560) + +- +..++.+.+. .++. -+++..+..+... .+..+++.-.|..-|--.| T Consensus 468 ~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 547 (563) T protein:vir:99 468 EGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTS 547 (563) T ss_pred CCcceeecccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCCccccccccccccccccccccCc Confidence 2 00 0000000000 0000 0000000000000 1111222222222222222 Q ss_pred cccCC Q lcl|NC_019530. 556 TDDSI 560 (560) Q Consensus 556 ~~~~~ 560 (560) ++.-- T Consensus 548 ~~~~~ 552 (563) T protein:vir:99 548 NKGQG 552 (563) T ss_pred ccccc Confidence 22211 No 55 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=98.22 E-value=2.4e-06 Score=51.37 Aligned_cols=467 Identities=11% Similarity=0.082 Sum_probs=191.2 Q ss_pred CCcccchhhhhhhhhhh---hhh--hhhhccc-----------ccc-----hhhhccccccc-cCCCCCCCCceeEeccc Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGL---VNA--KVDTDKL-----------AQK-----QDELLLTKATV-VALDDAQDGSIILQGGA 58 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~---~~~--~~~~~~~-----------~~~-----~~e~~~~k~~s-~~~~~~~dG~~~i~~~~ 58 (560) || -||-++.. ... .+++-++ ..+ +-+|-++++.. -+.| +...+ T Consensus 1 ~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~--------~~~~~ 65 (563) T protein:vir:95 1 MA-------DLFKQFRLGKDYGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEP--------FIEMM 65 (563) T ss_pred Ch-------hhhhhhhcccccccccccceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhh--------hHhhh Confidence 33 22222111 110 1111111 000 01111111111 0111 11111 Q ss_pred c--ceeccccccchhhHHHHHHHHHHHhhccchhHHHHHhhcceeee----cCCCce--EEEeeecccc-hhHHHHHHHH Q lcl|NC_019530. 59 N--TYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVTC----EEDETP--VTVNLDKVEG-ISDTVKESIT 129 (560) Q Consensus 59 ~--~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~----d~~~~~--V~l~Ld~~~~-~s~sik~~I~ 129 (560) . .+.|.. .+.+.+...|-+.=|.++..|-|..||+.+.+.+.++ ....+. ..+.|...+. -++..+..+ T Consensus 66 ~~~~~~~~~-~~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~- 143 (563) T protein:vir:95 66 DTNPEFRDK-RSYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEM- 143 (563) T ss_pred ccccccccc-ccCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhh- Confidence 1 111111 1122233333333456667899999999888875542 122222 3444443321 011111111 Q ss_pred HHHHHHHHHhCCc-----hhhHHH----HHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccc Q lcl|NC_019530. 130 ECFKEVMHLMDFD-----NTAYQK----IRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIE 200 (560) Q Consensus 130 eeF~~il~ll~f~-----~~~~~~----~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~ 200 (560) ...+..+..+..+ ....++ ++..++.|.-|+.+++..+ ...-+.+|++|||..|..++ +..+ . T Consensus 144 ~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd-~~G~~~~L~pl~p~~V~v~~-----~~~g--~ 215 (563) T protein:vir:95 144 KRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKN-NKTKLEKFIAVDPSTIFYAT-----DKKG--K 215 (563) T ss_pred HHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEec-CCCceEEEEEeCCceeEEEE-----CCCC--c Confidence 1222223222221 134444 4445666777777776433 24448999999999888753 1222 1 Q ss_pred cccccchheeecCccccccccccccccccccccccceecchhceee-eeccccc-CCCCcchhhhHHHHHHHHHHHHHHH Q lcl|NC_019530. 201 AIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVY-IDSGEEP-LANGIVPGLLNPAIRPLNNLVTTED 278 (560) Q Consensus 201 ~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y-~hsGl~~-~~~~~i~syL~~AiK~~NqL~m~ED 278 (560) ++....-|+++. .| +....++.+.++| ...=..+ ..+++.+|-|+.|++++.....+|+ T Consensus 216 ~~~~~~~y~~~~--------~g-----------~~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~ 276 (563) T protein:vir:95 216 IIKGGKRFVQVV--------DK-----------RVVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTES 276 (563) T ss_pred eeccceeEEEEe--------CC-----------ceeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHH Confidence 111111111111 11 1223455555443 3221112 1255678899999999999888888 Q ss_pred HHHHHHHhcCccceEEEccCCC-CchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCcccee Q lcl|NC_019530. 279 ATVIYAITRAPEKRAFYLDVGT-LGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIA 357 (560) Q Consensus 279 alVIyRitRAPeRRvFyIDvGn-lpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIs 357 (560) ...=+=---+--+-|.-+..+. |.+..+++.-+.+-..|+. ..+.-+..-.++ .|.+++ T Consensus 277 ~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G----------~~nagk~~~vl~----------~G~~~~ 336 (563) T protein:vir:95 277 FNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSG----------INGSWQIPVVMA----------DDIKFV 336 (563) T ss_pred HHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcc----------ccccccceEEcC----------CCceEE Confidence 7665544446667777777664 4544444333333333331 111112211221 245566 Q ss_pred ecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCc---------ccccccchhhhhHHHHHHHHH-HHHHHHHHHHH Q lcl|NC_019530. 358 TVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGS---------INIGGSNLAEITQEELRFSKF-CAGLRRRYSHF 426 (560) Q Consensus 358 TLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~---------~~l~~~~~~eItRDElkF~Kf-i~rLr~rfs~i 426 (560) .|--...-.+ ++-.+|..+.+.++.+||...|+.... .++.++++.+..+. |... +.-+..++... T Consensus 337 ~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~---f~~~tL~P~l~~ie~~ 413 (563) T protein:vir:95 337 NMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQ---SQNKGLQPLLRFIEDL 413 (563) T ss_pred eccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHHH---HHHHHHHHHHHHHHHH Confidence 6643222222 455567889999999999999964322 12222332222222 4433 23333333333 Q ss_pred HHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHH Q lcl|NC_019530. 427 FMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDE 506 (560) Q Consensus 427 f~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDe 506 (560) | -+.|+ +..+ .++.++|.+...= .|.+.++...-...-++|..-+++ .+.|..- T Consensus 414 l----n~~L~-----~~~~------~~~~~~f~r~D~~----------~~~e~~~~~~~~~~G~lT~NE~R~-~~gl~Pi 467 (563) T protein:vir:95 414 V----NRHII-----SEYG------DKYTFQFVGGDTK----------SATDKLNILKLETQIFKTVNEARE-EQGKKPI 467 (563) T ss_pred H----Hhhhc-----hhcc------cccEEEeccCCHH----------HHHHHHHHHHHhcCCccCHHHHHH-HhCCCCC Confidence 3 22222 2211 2467777766432 233333222222224677777774 4676543 Q ss_pred H---HH------------HHHHHHHHhh------------hcCC-CCCCCcccccccc---ccccCCCCCCcccccccCC Q lcl|NC_019530. 507 E---VK------------EQQTKIAEEK------------KKGL-YPKVQADETGNYS---GSDVSPLKFKPETIPFSGS 555 (560) Q Consensus 507 E---I~------------e~~kqIe~E~------------~~~~-~~~p~~~e~g~~~---~~~~~~~~~~p~~~~~~~~ 555 (560) + +- +..++.+.+. .++. -+++..+..+... .+..+++.-.|..-|--.| T Consensus 468 ~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 547 (563) T protein:vir:95 468 EGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTS 547 (563) T ss_pred CCcceeecccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCCccccccccccccccccccccCc Confidence 2 00 0000000000 0000 0000000000000 1111222222222222222 Q ss_pred cccCC Q lcl|NC_019530. 556 TDDSI 560 (560) Q Consensus 556 ~~~~~ 560 (560) ++.-- T Consensus 548 ~~~~~ 552 (563) T protein:vir:95 548 NKGQG 552 (563) T ss_pred ccccc Confidence 22211 No 56 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=98.21 E-value=2.6e-06 Score=51.24 Aligned_cols=451 Identities=10% Similarity=0.075 Sum_probs=191.8 Q ss_pred CCcccc---hhhhhhh--------------hhhhhhhhhhhcccccchhhhcccccc-ccCCCCCCCCceeEecccccee Q lcl|NC_019530. 1 MAGYGR---GFFGLFG--------------GGGLVNAKVDTDKLAQKQDELLLTKAT-VVALDDAQDGSIILQGGANTYN 62 (560) Q Consensus 1 ~~~~~~---~~~~~~~--------------~~~~~~~~~~~~~~~~~~~e~~~~k~~-s~~~~~~~dG~~~i~~~~~~~~ 62 (560) -..-|. ||--|.- ..+..++-++.+++..+...+..+... ++ ...+.....-.+ T Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~kk~~i~~pfkkk~~~~~~d~f---~~s~es~s~vts----- 104 (945) T protein:vir:10 33 SLSRGKDYPGFKPLLTYRALAWNSTVVYSIIIFRKNQVLKKEKIIVPYNHQEPPFKFNLF---EYSPESLMYLPS----- 104 (945) T ss_pred hhhcccCCCCcchhhhhhhhhccceeeeeeeeehhhhHHHhhcccccccccccchhhhhh---hccCccceeccc----- Confidence 111121 2222211 112223333555554444333221000 00 000000000000 Q ss_pred ccccccchhhHHHHHHHHHHH-hhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhC- Q lcl|NC_019530. 63 YVGVESELLSVKTVVEEYQSM-AQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMD- 140 (560) Q Consensus 63 y~~~e~~~~~~~~LI~~YR~m-a~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~- 140 (560) ..+-+ ..-+++.++.+ +++|-|..||+-|-+.+.- -|+.+-=+..++-.+...+++ .....+.++|. T Consensus 105 ls~pd-----af~~vnVs~~~AlknsaV~scI~~IA~sIAs-----LPlklYrr~edG~~~~~~kk~-~~~hpL~~LL~r 173 (945) T protein:vir:10 105 ISDPD-----AFFLINLFRKYRFNNDSKLIKVSEIPKKLTS-----KELEIYKHIEDKHVNYYLKRI-RDARNILEFLER 173 (945) T ss_pred ccCcc-----ceeeehhhhhhhhccHHHHHHHHHHHhhhcc-----CceEEEEecccCccccccccc-ccchHHHHHHhC Confidence 00000 00122333333 5788899999777665532 233331111111111111111 12223334442 Q ss_pred --Cchhh--------HHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchhee Q lcl|NC_019530. 141 --FDNTA--------YQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYY 210 (560) Q Consensus 141 --f~~~~--------~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~ 210 (560) =.-.+ +.+++.+++.|.-|..++.|. ...+.+|++|||.+|..++ +.+|+. ..+|. T Consensus 174 PNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd~---~G~ii~L~pLdPs~Vti~~-----ddDG~~------~y~Yv 239 (945) T protein:vir:10 174 PDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRDE---QGNLVAITPVDGTTIKPIL-----SEDTGI------VVGYV 239 (945) T ss_pred CCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEECC---CCcEEEEEEECCcceEEEE-----cCCCcE------EEEEE Confidence 11222 235577888999999877653 3347899999999887642 222321 01111 Q ss_pred ecCccccccccccccccccccccccceecchhc-eeeeeccccc-CCCCcchhhhHHHHHHHHHHHHHHHH-HHHHHHhc Q lcl|NC_019530. 211 YNPNYNRNQFTGQSGTSQNFQPSQQELVFDDES-IVYIDSGEEP-LANGIVPGLLNPAIRPLNNLVTTEDA-TVIYAITR 287 (560) Q Consensus 211 Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~da-I~y~hsGl~~-~~~~~i~syL~~AiK~~NqL~m~EDa-lVIyRitR 287 (560) +. .+| +....++.+. |+|.+.-..+ ...|+.+|-|..|.+.+.....++.. .-.|+--. T Consensus 240 ~~-------idG-----------~~~~~v~a~DvIlhirn~s~DG~~~GyGlSPIeaa~~aI~~alAaek~aar~FskNG 301 (945) T protein:vir:10 240 QE-------VDG-----------AIVAHFDKRDVVLFRQNLTPDVYMYGYSLPPIEILYKVILSDIFIDKGNLDYYRKGG 301 (945) T ss_pred Ee-------cCC-----------ceEEEecCCceEEEeccCCCCcccccCCchHHHHHHHHHHHHHHHHHHHHHHHHhCC Confidence 11 011 1223444444 5555432111 12345678899999988776666654 44444455 Q ss_pred CccceEEEccCCCCchHH--------HHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeec Q lcl|NC_019530. 288 APEKRAFYLDVGTLGKKS--------AEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATV 359 (560) Q Consensus 288 APeRRvFyIDvGnlpk~K--------AeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTL 359 (560) |.-+-+..++.++....| +.+-+++.+.+.. .|. .+ ..-+ ++ ..|.+++.| T Consensus 302 a~PsGILsvkg~~~~d~k~~~~LseEq~erlKe~wee~~--------sG~-Nn-G~pi-VL----------deGmef~pL 360 (945) T protein:vir:10 302 SIPEGILAIEPPSYKEGDIYPQLSREQLESIQRQLQAIM--------MGD-YT-QVPI-LS----------GGKFTWIDF 360 (945) T ss_pred CccceEEEecCccccccccccccCHHHHHHHHHHHHHHh--------CCc-cc-ccce-ec----------CCCceEEEc Confidence 777888988876543222 2222222222221 121 11 1111 22 236777777 Q ss_pred CCCCCcchH---HHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 360 GGGNQLGEM---DHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKF-CAGLRRRYSHFFMEFLRRQL 435 (560) Q Consensus 360 pGg~nLgei---~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kf-i~rLr~rfs~if~d~Lk~qL 435 (560) .. +..++ +-.+|..+...++.+||...|+..++. .++++.+..+ -|..+ +..+..++...+...| T Consensus 361 s~--s~~DaQfLEsrkfs~eeIArAFGVPP~lLG~~e~s--t~SNiEqq~~---~Fv~~tL~Pil~~IEqeLNrkL---- 429 (945) T protein:vir:10 361 KG--KRRDMQFKELAEFVARKICAVYQVSPQDVGILEGS--NKATAEVMAS---LTKAKGLEPLMATISKGFDEVV---- 429 (945) T ss_pred cC--ChhHHHHHHHHHHHHHHHHHHhCCCHHHcccCCCC--CcchHHHHHH---HHHHHHHHHHHHHHHHHHHHhc---- Confidence 54 33333 445667788999999999999755442 3344343333 36544 6666666555544332 Q ss_pred HhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHH Q lcl|NC_019530. 436 ILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKI 515 (560) Q Consensus 436 iLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqI 515 (560) .+.. -...+.++|..+..-.. ..|++.++.+-. .-++|.+-+++ .+++.+-+ --++.+ T Consensus 430 -----l~~~-----eg~~i~fdFd~ldl~D~-------ksraEal~kli~--sGiLTiNEvRe-~lGLpPIe--GGD~ll 487 (945) T protein:vir:10 430 -----SEFR-----NEKDIKLWFKEDDLEKE-------RDWWNIIQGQLN--TGFRSINEARM-EKGLEPVP--WGDVPF 487 (945) T ss_pred -----cccc-----cCceeEEEecchhccCH-------HHHHHHHHHHHh--CCCcCHHHHHH-HhCCCCCC--Ccceee Confidence 1211 12346777776653221 245555544321 23778888774 46664421 000000 Q ss_pred HHhhhcCCCCCCCcc-cccccc-ccccCCCCCCcccccccCC----------cccCC Q lcl|NC_019530. 516 AEEKKKGLYPKVQAD-ETGNYS-GSDVSPLKFKPETIPFSGS----------TDDSI 560 (560) Q Consensus 516 e~E~~~~~~~~p~~~-e~g~~~-~~~~~~~~~~p~~~~~~~~----------~~~~~ 560 (560) -. . +. + .|..+ ..+..+ ..+....+..++..|-.|+ ++++. T Consensus 488 i~-~-nn-~-~P~d~~~ka~~ga~p~q~aq~~~dqp~~kGGe~dEns~~psE~kda~ 540 (945) T protein:vir:10 488 SG-L-RN-W-KPEDEQAKAQQGAMPPQLAQAMADQPSQQGGGVDENSSVPSEQKNAG 540 (945) T ss_pred ec-c-cc-c-cccccccccccCCCCcccccCCCCCCCCCCCCCCCCCCCCCcccchH Confidence 00 0 00 0 00000 000000 0000111111222222222 22222 No 57 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=98.21 E-value=1.9e-06 Score=51.89 Aligned_cols=450 Identities=13% Similarity=0.075 Sum_probs=191.6 Q ss_pred ccccCCCCC--CCCceeEec--c----------ccceeccccccchhhHHH-HHHHHHHH-hhccchhHHHHHhhcceee Q lcl|NC_019530. 39 ATVVALDDA--QDGSIILQG--G----------ANTYNYVGVESELLSVKT-VVEEYQSM-AQQPEIRKAVDIIVNDVVT 102 (560) Q Consensus 39 ~~s~~~~~~--~dG~~~i~~--~----------~~~~~y~~~e~~~~~~~~-LI~~YR~m-a~~PEvd~AI~eIvneaiv 102 (560) .+++++-.. ++....+.. . -...+|.+ ...+..... .=+.+|.+ ....=+.-+|+..+.-. + T Consensus 1 ~~~~~~~~~e~~~~~~~~~~l~~~~~~~~~r~~~l~~YY~G-~~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~~l-~ 78 (486) T protein:vir:42 1 MTAPLPGMEEIEDPAVVREEMISAFEDASKDLASNTSYYDA-ERRPEAIGVTVPREMQQLLAHVGYPRLYVDSVAERQ-A 78 (486) T ss_pred CCCCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhcc-cCcchhcccccchhHhhhhhccchHHHHHHHHHhhh-c Confidence 222221111 111101110 0 00111111 111110000 00011111 11122233333222211 1 Q ss_pred ecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCC----CCccCeeEeEE Q lcl|NC_019530. 103 CEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPT----NKKGGIKKLVM 178 (560) Q Consensus 103 ~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~----~~k~GI~eLr~ 178 (560) ++.+.+.+.+ ...+++..|++-.+|+....++++.-++.||-|.+.-.|.. ...+|...++. T Consensus 79 ------~~g~~~~~~~--------~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~ 144 (486) T protein:vir:42 79 ------VEGFRLGDAD--------EADEELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRV 144 (486) T ss_pred ------ccceecCCCc--------hhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEE Confidence 1111122211 22355677777788999999999999999998876554431 23467778888 Q ss_pred eCccccchhhhhcccCCccccccc-----cccchheeecCcccccccccccccccc-ccccccceecchhceeeeecccc Q lcl|NC_019530. 179 LDSRCIRPVYIVEKAMREGGIEAI-----ESVTLKYYYNPNYNRNQFTGQSGTSQN-FQPSQQELVFDDESIVYIDSGEE 252 (560) Q Consensus 179 LDPr~i~~vr~i~~~~~~g~~~~~-----~~~~e~~~Y~p~~~~~~~~g~~~~~~~-~~~~~~~~~i~~daI~y~hsGl~ 252 (560) +||+.+=.+..-......-.+... +.+...-+|.|..-++.. ..++.... ...+..--++|. |.|++-=-. T Consensus 145 ~~p~~~~~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~-~~~~~~~~~~~~~h~~g~vPv--v~~~n~~~~ 221 (486) T protein:vir:42 145 EPPTRMHAEIDPRINRVSKAIRVAYDKEGNEIQAATLYTPMETIGWF-RADGEWAEWFNVPHGLGVVPV--VPLPNRTRL 221 (486) T ss_pred ecccceEEEEeCCCCCeEEEEEEEEecCCCeEEEEEEEcCCcEEEEE-ecCCcEEeecceecCCCCceE--EEecccccc Confidence 998876665421110000000000 011112235554222211 11110000 000000001222 234432111 Q ss_pred cCCCCcchhhhHH----HHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCC Q lcl|NC_019530. 253 PLANGIVPGLLNP----AIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTG 328 (560) Q Consensus 253 ~~~~~~i~syL~~----AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TG 328 (560) .++ ...|-|.+ .+..+|. .+-+..++-...-.|.|-|.-.+....+.... +.+.+.++..| T Consensus 222 ~~~--~G~s~i~~~v~~liDa~~~--~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~-----------~~~~~~~~~~~ 286 (486) T protein:vir:42 222 SDL--YGTSEITPELRSMTDAAAR--ILMLMQATAELMGVPQRLIFGIKPEEIGVDSE-----------TGQTLFDAYLA 286 (486) T ss_pred CCC--CCcccchhhHHHHHHHHHH--HHHHHHHHHHhhcchHHHhhcCCccccccccc-----------cccchhhhhhc Confidence 111 12233332 2233332 34455555566666666554333222111000 01111222222 Q ss_pred cccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHH Q lcl|NC_019530. 329 KITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQE 408 (560) Q Consensus 329 ev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRD 408 (560) ..|..-.+ ..++..+|+.+-=.-++-++=.-.++...-++|..-|...+.-+ +.+..|.-- T Consensus 287 -------------~~~~~~~~---~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~---~Sg~Al~~~ 347 (486) T protein:vir:42 287 -------------RILAFEDA---EGKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNP---ASAEAIRAA 347 (486) T ss_pred -------------hhcccCCC---CceEEeecccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCch---hHHHHHHHH Confidence 22332111 23455666643222333344444555666788877775443211 123456667 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC-hhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_019530. 409 ELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTD-EKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFV 487 (560) Q Consensus 409 ElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit-~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~v 487 (560) +.....-+.+.|..|..-+...++.-+.+.|... +.+|. .|.+.|..-..=. +.+.++.+..+..-+ T Consensus 348 ~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~d~~-----~i~v~w~~~~~~s-------~~~~ad~~~kl~~~~ 415 (486) T protein:vir:42 348 ESRLIKKVERKNLMFGGAWEEAMRIAYRIMKGGDVPPDML-----RMETVWRDPSTPT-------YAAKADAATKLYGNG 415 (486) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccce-----eeeEEecCCCCCC-------HHHHHHHHHHHHhcc Confidence 7778888899999999999999987777766532 23332 3677774332211 223445555544332 Q ss_pred cchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCC-----CCCCccccccccccccCCCCCCcccccccCCc Q lcl|NC_019530. 488 GSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLY-----PKVQADETGNYSGSDVSPLKFKPETIPFSGST 556 (560) Q Consensus 488 Gky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~-----~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~ 556 (560) --++|.++++ ..|.+++++++++++.-++|...+.- .......+|.+.+. .+..=.|...++.|.. T Consensus 416 ~g~~s~et~~-~~lg~~~d~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~ 486 (486) T protein:vir:42 416 QGVIPRERAR-IDMGYSVKEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPSPT--APPKPQPAIESSGGDA 486 (486) T ss_pred cCCCCHHHHH-hcCCCChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCCC--CCCCCCcccCCCCCCC Confidence 2478999998 67999999999877654444322211 11111112222111 1122334444444433 No 58 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=98.20 E-value=2.8e-06 Score=51.06 Aligned_cols=444 Identities=12% Similarity=0.059 Sum_probs=185.3 Q ss_pred ccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccc-------------------------hhHHH Q lcl|NC_019530. 39 ATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPE-------------------------IRKAV 93 (560) Q Consensus 39 ~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PE-------------------------vd~AI 93 (560) .+++++-..+.-.+...-. .+-... ..-..+|+.+..+.+ +.-+| T Consensus 1 ~~~~i~~~~~~~~~~~~~~-------~L~~~~---~~~~~r~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iv 70 (485) T protein:vir:24 1 MTAPLPGQEEIADPAIARD-------EMVSAF---EDQNQNLRSNTSYYEAERRPEAIGVTVPVQMQSLLAHVGYPRLYV 70 (485) T ss_pred CCCCCCCCCcccchHHHHH-------HHHHHH---HHHHHHHHHHHHHHhccCchhhcCcccchhhhhhhhccchHHHHH Confidence 2222221111111100000 000000 000111222222211 12222 Q ss_pred HHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCC----C Q lcl|NC_019530. 94 DIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTN----K 169 (560) Q Consensus 94 ~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~----~ 169 (560) +..+. .++ -.++. +.+.+ ...+.+..|.+..+|+....++++.-++.||-|.+.-.|.+. . T Consensus 71 d~~~~-~l~----~~g~~--~~~~~--------~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~ 135 (485) T protein:vir:24 71 DSIAE-RQA----VEGFR--LGDAD--------EADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGW 135 (485) T ss_pred HHHhh-hhc----cCcee--cCCCc--------hhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCccccccc Confidence 21111 011 01111 11211 223445667777889999999999999999988754443321 1 Q ss_pred ccCeeEeEEeCccccchhhhhcccCCccccccc-----cccchheeecCccccccccccccccccccc-cccceecchhc Q lcl|NC_019530. 170 KGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAI-----ESVTLKYYYNPNYNRNQFTGQSGTSQNFQP-SQQELVFDDES 243 (560) Q Consensus 170 k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~-----~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~-~~~~~~i~~da 243 (560) .+|-..++.++|+.+-.++.-........+... ..+...-+|.+..-++ +...++.-..... +-.--.+|. T Consensus 136 ~~~~~~i~~~~p~~~~~i~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~-~~~~~~~~~~~~~~~h~~g~vPv-- 212 (485) T protein:vir:24 136 DPNVPLIRVEPPTRMYAEIDPRIGRPAKAIRVAYDAEGNEIQAATLYTPNETFG-WFRAEGEWVEWFSDPHGLGAVPV-- 212 (485) T ss_pred CCCcceEEEeccceeEEEeeCCcCceeEEEEEEEeecCCeEEEEEEEcCCcEEE-EEecCCceEeecccccCCCcccE-- Confidence 234456788888877555321100000000000 0111122344432221 1111100000000 000001222 Q ss_pred eeeeecccccCCCCcchhhhHHHHHHH-HH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhhe Q lcl|NC_019530. 244 IVYIDSGEEPLANGIVPGLLNPAIRPL-NN-LVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRN 321 (560) Q Consensus 244 I~y~hsGl~~~~~~~i~syL~~AiK~~-Nq-L~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~ 321 (560) |.|.+-.- +..-+..|-|.+.++++ .. -+++-|..++-...-.|.|-+.=.+....+...- +... T Consensus 213 v~f~n~~~--~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~-----------~~~~ 279 (485) T protein:vir:24 213 VPLPNRTR--LSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPE-----------TGQT 279 (485) T ss_pred EEeccCcc--cCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCccccccccc-----------cccc Confidence 23444211 11123334454444433 22 2556677777777777777554222111110000 0011 Q ss_pred eecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCccccCCCCccccccc Q lcl|NC_019530. 322 AYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLG-EMDHVNYFREKLYDALMIPKSRLQEEGSINIGGS 400 (560) Q Consensus 322 vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLg-ei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~ 400 (560) +.++..| ..|+.--+ +.++..++..+ +. -++-++-.-..+...-++|..-|...+.-+ + T Consensus 280 ~~~~~~~-------------~i~~~~~~---~~~~~q~~~~~-~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~---~ 339 (485) T protein:vir:24 280 LFDAYLA-------------RILAFEDA---EGKIQQFSAAE-LANFTNALDQIAKQVAAYTGLPPQYLSTAADNP---A 339 (485) T ss_pred hhhhccc-------------ceeccCCC---CceEEeecccc-hHHHHHHHHHHHHHHhcccCCCHHHhccccCcc---h Confidence 1122222 23443222 23455566522 22 222233333344445577777775443211 1 Q ss_pred chhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-ChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHH Q lcl|NC_019530. 401 NLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVT-DEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLAS 479 (560) Q Consensus 401 ~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgii-t~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~ 479 (560) .+..|.--+...-.-+.+.|+.|..-+...++.-+.+.+-. .+.+|. .|.+.|..-..=. +.+.++. T Consensus 340 Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~d~~-----~i~v~f~~~~~~s-------~~~~ad~ 407 (485) T protein:vir:24 340 SAEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLMKGGDVPPDML-----RMETVWRDPSTPT-------YAAKADA 407 (485) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCccccc-----eeeEEecCCCCCC-------HHHHHHH Confidence 23456666777888889999999999998888766554432 223332 3677774332111 2245555 Q ss_pred HHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCC-CCCcc-ccccccccccCCCCCCcccccccCCcc Q lcl|NC_019530. 480 LNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYP-KVQAD-ETGNYSGSDVSPLKFKPETIPFSGSTD 557 (560) Q Consensus 480 l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~-~p~~~-e~g~~~~~~~~~~~~~p~~~~~~~~~~ 557 (560) +..+-.-+--.+|.+++++ .|.+++++++++++..++|...+.-. +...+ ..++.+ ...++ -.++-.|+.-+.| T Consensus 408 ~~kl~~~g~~~~s~et~~~-~l~~~~d~~~e~~~~~ee~~~~~~~~~~~~~~~~~~~~~-~~~~~--e~~~~~~~~~~~~ 483 (485) T protein:vir:24 408 ATKLYGNGQGVIPRERARK-DMGYSIAEREEMRRWDEEEAAMGLGLLGTMVDADPTVPG-SPNPT--PAPKPQPAIEGGD 483 (485) T ss_pred HHHHHhcccccCCHHHHHh-hCCCCHhHHHHHHHHHHHHhhhhhhHHHhhcccCCCCCC-CCCCC--CCCCCccCCCCCC Confidence 5555443224799999995 59999999998877666654222100 00000 000011 01111 1222233332333 Q ss_pred cC Q lcl|NC_019530. 558 DS 559 (560) Q Consensus 558 ~~ 559 (560) .. T Consensus 484 ~a 485 (485) T protein:vir:24 484 SA 485 (485) T ss_pred CC Confidence 33 No 59 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=98.19 E-value=2.8e-06 Score=51.00 Aligned_cols=395 Identities=14% Similarity=0.112 Sum_probs=182.2 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) |++|+.+ +...+..++..+. -...+.+ +..+.++. . +....+|.|.+||. T Consensus 1 M~~f~~~------------~~~~~~~~~~~~~---~~~~~~~-~~~~~~v~-------~-------~~al~~~~V~~~v~ 50 (397) T protein:vir:38 1 MPLLKLN------------KSHSQGFSLNDPD---WVNFLTG-GEAQKYVS-------A-------DTALKNSDIFSLIM 50 (397) T ss_pred Ccchhhh------------hcccCcccCCchh---hhhhhcC-CcCCceec-------h-------HHhhccHHHHHHHH Confidence 3333211 0001112222111 0011111 11111111 1 13356899999998 Q ss_pred HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCee Q lcl|NC_019530. 95 IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIK 174 (560) Q Consensus 95 eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~ 174 (560) -|.+.+..++ +..+. ..-.++..+-. ..+....=.+.+++.++++|.-|..++.|. ...+. T Consensus 51 ~ia~~ia~~p-------~~~~~------~~~~~l~~~PN---~~~s~~~f~~~~~~~lll~Gna~~~i~r~~---~g~~~ 111 (397) T protein:vir:38 51 QLSGDLAMVR-------YTSES------DRSQSIISNPS---VTANGYSFWQGMFAQLLLDGNCYAYRHKNT---NGVDL 111 (397) T ss_pred HHHHHHhhCc-------ccccc------cHHHHHHhcCC---CCCCHHHHHHHHHHHhhhcCCEEEEEEECC---CCcEE Confidence 8877665433 21111 01111211111 122233334667778899999888777654 34589 Q ss_pred EeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeecccccC Q lcl|NC_019530. 175 KLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPL 254 (560) Q Consensus 175 eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~ 254 (560) +|++|+|..+...+ +.+++ ..+|.|.- . ....+....++.+.|.|..- .+. T Consensus 112 ~l~~l~~~~v~i~~-----~~~~~-------~~~y~~~~-------~--------~~~~~~~~~~~~~eiih~~~--~~~ 162 (397) T protein:vir:38 112 SWEYLRPSQVQPML-----LQDGS-------GLIYNINF-------D--------EPAIGYMENVPAADVIHIRL--LSK 162 (397) T ss_pred EEEEEcCceeEEEE-----cCCCc-------eEEEEEEe-------c--------cccccceeEecCccEEEecC--CCC Confidence 99999998887532 11221 11122210 0 11123345688888887763 333 Q ss_pred CCC-cchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCc Q lcl|NC_019530. 255 ANG-IVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGN 333 (560) Q Consensus 255 ~~~-~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~ 333 (560) ++. +..|-|..|.+.+.....+++...-+---.+.-+-++.++.+ +.+. +.+-+++.+..++.- ...| T Consensus 163 ~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~-~~~e-~~~~~~~~~~~~~~~----~n~~----- 231 (397) T protein:vir:38 163 NGGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKG-GLLD-AETRIARSKEISKQI----HNSD----- 231 (397) T ss_pred CCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC-CCHH-HHHHHHHHHHHHhcc----cccC----- Confidence 443 467889999999999999988877655555666777777755 3433 333344434322210 1112 Q ss_pred ccchhhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHH Q lcl|NC_019530. 334 AHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRF 412 (560) Q Consensus 334 ~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF 412 (560) +.+ .++ .|.+++.|.-..+-.+ ++=.++..+.+.++++||...|....+-+ ++ +..... | T Consensus 232 -~~~-vl~----------~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~---~~---~e~~~~-~ 292 (397) T protein:vir:38 232 -GPV-VID----------ALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQ---SS---ITQISG-Q 292 (397) T ss_pred -Cce-ecC----------CCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc---cH---HHHHHH-H Confidence 111 121 2566666654333223 45567889999999999999997544321 22 221222 3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhh Q lcl|NC_019530. 413 SKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFS 492 (560) Q Consensus 413 ~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S 492 (560) .. ..|+- +...+.+.|-+.| +++.+|+ +.+.+..| +..|.+.++.+- -+-++| T Consensus 293 ~~--~~l~P-~~~~ie~~ln~~l-----~~~~~~~------~~~~~~~d-----------~~~~~~~~~~~~--~~G~~t 345 (397) T protein:vir:38 293 YA--KSLNR-YVQAIVGELNDKL-----HANISAN------IRFAIDAM-----------GDQYASTISSSV--KGGTIA 345 (397) T ss_pred HH--HHHHH-HHHHHHHHHHHhc-----cChhccc------ccccccCC-----------HHHHHHHHHHHH--hCCCcC Confidence 22 23432 1122233333333 3433432 22222122 234555554432 234778 Q ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcc Q lcl|NC_019530. 493 IDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPE 548 (560) Q Consensus 493 ~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~ 548 (560) .+-+++ +|++..-+=. ....-+.......+.....+|+..+.+.++..=+|| T Consensus 346 ~nE~R~-~lg~~p~~~~---d~~~~~~~~~~~~~~~~~~~g~~~~~~~~e~~~~~~ 397 (397) T protein:vir:38 346 GNQARF-ILQNSGYLAK---DLPDPEKEPQQAIQLIQQEGGENDGNNSDERGSDPE 397 (397) T ss_pred HHHHHH-HhCCCCCCCC---ccccccccccccccccccccCCCCCCCCCCCCCCCC Confidence 888875 4666431100 000000000000001111122222222222333344 No 60 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=98.19 E-value=2.9e-06 Score=50.93 Aligned_cols=403 Identities=11% Similarity=0.094 Sum_probs=168.8 Q ss_pred HHHHhh-ccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhC-------------Cchhh Q lcl|NC_019530. 80 YQSMAQ-QPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMD-------------FDNTA 145 (560) Q Consensus 80 YR~ma~-~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~-------------f~~~~ 145 (560) -|+|+. +|-|..||.-|.+.+. .-|+.|....-..... +.....+.+.+.+. +.... T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia-----~~p~~i~~~~~~~~~~----~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~ 71 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVA-----GFGINIIPHPEAEDPD----RDGEQYERVWDFWFGDDSNWQVGPMESERATA 71 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhh-----cCCeEEEEccCccccc----chhhhhhhHHHHhhccCCCccccchhhHhhHH Confidence 567765 6999999977666653 2344443221111111 12222222222111 11233 Q ss_pred HHHHHH----hhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCcccccccc Q lcl|NC_019530. 146 YQKIRK----WYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFT 221 (560) Q Consensus 146 ~~~~R~----wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~ 221 (560) .++++. .++.|.-|.+++.|. +..+.+|.+|||.+|+..+.- ...+....+..-||.+.......... T Consensus 72 ~~~~~~~~~~l~l~Gn~~i~~~r~~---~G~~~~l~~l~~~~v~~~~d~-----~~~~~~~~~~~~~~~~~~~~~~~~~~ 143 (467) T protein:vir:31 72 TNVLQTAWTDYEAIGWLTIEILTQT---DGTPTGLAYVPGHTIRKRMDE-----RGFVQLLEEKEKYFGVAGDRYQTNGN 143 (467) T ss_pred HHHHHHHHHHHHhcCCeEEEEEECC---CCcEEEEEEeCCceeEeeeec-----ceeEeecCCceeeEEeccccceeecc Confidence 344333 345588888888654 345889999999888764311 11111222222222221110000000 Q ss_pred ccc---cccccccccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcC-ccceEEEcc Q lcl|NC_019530. 222 GQS---GTSQNFQPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRA-PEKRAFYLD 297 (560) Q Consensus 222 g~~---~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRA-PeRRvFyID 297 (560) +.- .........+..+.++.+.|.|...- -..+.-+.+|-+..|.+.+..-..++....=+ ..++ --+-|..+. T Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~diih~r~~-~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~-f~ng~~p~gil~~~ 221 (467) T protein:vir:31 144 GDLDPVFVDADDGSTGTSVSNPANELIFKRNH-SPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDF-FENDGVPRIAIIVK 221 (467) T ss_pred cceeeeeeeeccccccceeEeccccEEEecCC-CCCCCcccccHHHHHHHHHHHHHHHHHHHHHH-HhccCCCceEEEec Confidence 000 00001223456778899988887521 12344466788898888876665555443211 1122 223344555 Q ss_pred CCCCchHHHHHHHHHHH-HhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCC------------ Q lcl|NC_019530. 298 VGTLGKKSAEEYMTMMM-GKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQ------------ 364 (560) Q Consensus 298 vGnlpk~KAeqYl~~im-~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~n------------ 364 (560) -+.+.+ ++.+-+++.+ ..|++. ++ . .++.--+...|-.+..|++|.. T Consensus 222 ~~~l~~-e~~~~~~~~~~~~~~~~-------~~------~------~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls 281 (467) T protein:vir:31 222 GAELTE-KGREEMRNLIEDNNEDN-------HR------T------AFIETEKIVQNEDYLNLADGADRSDVEIRLEPLT 281 (467) T ss_pred CcCCCH-HHHHHHHHHHHhhhcch-------hh------h------hhhhhcccccccccccccCCCcccccceeEEecc Confidence 455544 4444444444 333321 10 0 0111111112222333333321 Q ss_pred -----cchHHHH-HHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019530. 365 -----LGEMDHV-NYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKF-CAGLRRRYSHFFMEFLRRQLIL 437 (560) Q Consensus 365 -----Lgei~DV-~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kf-i~rLr~rfs~if~d~Lk~qLiL 437 (560) =.++-+. ++..+...++.+||.+.|+...+.+.+ +++++... .|.++ +.-+.+++...|. ..| T Consensus 282 ~~~~~d~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~~-s~~e~~~~---~f~~~~l~P~~~~ie~~ln----~~l-- 351 (467) T protein:vir:31 282 VGIDEEASFLEFRGRNEHDILKVHDVPPVIAGVVESGAFS-TDAEEQRK---EFAEETIQPKQHDFGELLY----ELV-- 351 (467) T ss_pred ccChhhHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCcc-cCHHHHHH---HHHHHHHHHHHHHHHHHHH----Hhh-- Confidence 1122222 345566999999999998654333322 33333333 35444 3444444333333 222 Q ss_pred ccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHH Q lcl|NC_019530. 438 KGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAE 517 (560) Q Consensus 438 kgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~ 517 (560) ++..+.. -..+|.|++..-- ..-+..|++.+..+ +-.-++|.+-++ ..+++..- . + T Consensus 352 ---~~~~~~~--~~~~i~f~~~~l~-------~~d~~~~~~~~~~~--~~~G~~T~NE~R-~~~Gl~pi--~-------d 407 (467) T protein:vir:31 352 ---HKQGLDA--PDWTIEFELAKPD-------TKLQDVEIASQRVQ--AMQGLLTVNELR-DEFGFEPF--P-------E 407 (467) T ss_pred ---cchhhcc--CCceEEEecchhh-------ccCHHHHHHHHHHH--HhCCCcCHHHHH-HHhCCCCC--C-------c Confidence 3322211 1223555443321 11223444444443 112377777777 34666321 0 0 Q ss_pred hhhcCCCCCCCccccccccccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 518 EKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 518 E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) |.-.+. ..+.....|+..+++.. .++..|......+++ T Consensus 408 ~~~~~~-~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~ 445 (467) T protein:vir:31 408 EHVYGG-ETLVAEVTGGSGPGGGI----GDQIEQLVEDRADEI 445 (467) T ss_pred ccccCC-cccccccccccCCCCcc----cCcCCCCCCCcccch Confidence 000000 00111122222222111 112222111111111 No 61 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=98.18 E-value=3e-06 Score=50.83 Aligned_cols=441 Identities=10% Similarity=0.027 Sum_probs=186.4 Q ss_pred ccccCCCCCCCCcee--Eecc-------------ccceeccccccch-----hhHHHHHHHHHHHhhccchhHHHHHhhc Q lcl|NC_019530. 39 ATVVALDDAQDGSII--LQGG-------------ANTYNYVGVESEL-----LSVKTVVEEYQSMAQQPEIRKAVDIIVN 98 (560) Q Consensus 39 ~~s~~~~~~~dG~~~--i~~~-------------~~~~~y~~~e~~~-----~~~~~LI~~YR~ma~~PEvd~AI~eIvn 98 (560) -+-++..+-+..... +... -...+|.+- ..+ .+.....+..+.++.+.=+.-+|+..++ T Consensus 1 ~~~~p~~~l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~-~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~ 79 (479) T protein:vir:99 1 MIDLPDEDLSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNG-QEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSFAQ 79 (479) T ss_pred CccCCcccCChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcC-CcccccccccCChhHHHHHHHhhcCcHHHHHHHHHh Confidence 011110000000000 0000 000111110 000 0111111222222222222223322221 Q ss_pred ceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCC---CCccCeeE Q lcl|NC_019530. 99 DVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPT---NKKGGIKK 175 (560) Q Consensus 99 eaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~---~~k~GI~e 175 (560) = . .+..+...+.+ ..+++..|++..+|+....++++.-++.|+-|. ++.+. ...+|... T Consensus 80 ~-l------~~~gf~~~d~~---------~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~af~--~v~~~~~~~d~~g~~~ 141 (479) T protein:vir:99 80 Q-L------IVDGYRKTGTN---------ENAKGWDTWRLNQMDKQQFWLNRAVLTFGYAFI--KVTSGISPLDGTTVAR 141 (479) T ss_pred h-c------ccccccCCCch---------hhHHHHHHHHhcChhHHHHHHHHHHhhcCceEE--EEecCCCCcCCCCceE Confidence 0 0 11122222211 234556677778899999999999999998665 33321 23578889 Q ss_pred eEEeCccccchhhhhcccCCcccccccc--ccchheeecCccccccccccccc-cc-cccccccceecchhceeeeeccc Q lcl|NC_019530. 176 LVMLDSRCIRPVYIVEKAMREGGIEAIE--SVTLKYYYNPNYNRNQFTGQSGT-SQ-NFQPSQQELVFDDESIVYIDSGE 251 (560) Q Consensus 176 Lr~LDPr~i~~vr~i~~~~~~g~~~~~~--~~~e~~~Y~p~~~~~~~~g~~~~-~~-~~~~~~~~~~i~~daI~y~hsGl 251 (560) ++.+||+.+-.+..=..... -.+..+. .....-+|.+. ....+.-.++. .. ...+. .--++| .|.|+|..- T Consensus 142 i~~~~p~~~~~iydd~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~h-~~g~vP--vv~f~n~~~ 216 (479) T protein:vir:99 142 IKCIDPRDAFAIWEDPYWDE-WPKYLLERQPNGQYWWWTEE-DYSIFEFKQGKFIYRETVSH-DYGHIP--FVRYVNVMD 216 (479) T ss_pred EEEechhheEEEecCCcccc-eeeEEEeecCceeEEEEecc-eEEEEEecCCceeecccccc-CCCCcc--eEEeecCCC Confidence 99999998876631111110 0001110 11111112111 11111100000 00 00000 001122 233455321 Q ss_pred ccCCCCcchhhhHHHHHHHHHH-HHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcc Q lcl|NC_019530. 252 EPLANGIVPGLLNPAIRPLNNL-VTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKI 330 (560) Q Consensus 252 ~~~~~~~i~syL~~AiK~~NqL-~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev 330 (560) .+ ....|=++..+.....+ +.+-+..++-.....|.|-++=. .++.... .. +++ -+.. .+ T Consensus 217 ~~---~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~---~~~~~~~--~~---~~~--~~~~----~~-- 277 (479) T protein:vir:99 217 LR---GVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATGL---MLPEGAN--AD---QEK--MRFA----QE-- 277 (479) T ss_pred cC---cCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCC---Ccccccc--cc---hhc--cccc----cc-- Confidence 11 12233333333222222 34556666666777777765521 1111000 00 000 0000 01 Q ss_pred cCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHH Q lcl|NC_019530. 331 TGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEEL 410 (560) Q Consensus 331 ~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDEl 410 (560) -.|.. .+-+.++-++|+..--.-++-++=.-..+...-++|..-|+.. +.+.+..|.--+. T Consensus 278 -----------~i~~~---~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~g~~-----~n~Sg~Al~~~~~ 338 (479) T protein:vir:99 278 -----------SMLIS---QNEKASFGAIPAAPLDGLLNAYKESLLEFLALAQLPPHIAGQI-----VNVAADALAAGTR 338 (479) T ss_pred -----------cceee---cCCCceEEEecccchHHHHHHHHHHHHHHhccCCCCHHHcccc-----cchHHHHHHHHHH Confidence 11211 1224567777764322222323333334444557776655321 1122345555555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcch Q lcl|NC_019530. 411 RFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSI 490 (560) Q Consensus 411 kF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky 490 (560) ..-.-+.+.|+.|..-+.+.|+.-+.+.|.--+.++ . .|.+.|. |.... -+.+.++.+.++-.- | . T Consensus 339 ~l~~ka~~~~~~f~~al~~~~~l~~~~~~~~~~~~~---~--~i~~~w~-~~~~~------s~~~~ad~~~kl~~a-g-~ 404 (479) T protein:vir:99 339 QTMQKLFEKQATWKASHNQTMRLVNKIEGRTEEATD---L--DFTITWQ-DVTIQ------SLAQFADAWAKMVES-L-K 404 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccc---e--eeeEEec-CCCCC------CHHHHHHHHHHHHhc-C-C Confidence 567788899999999999999988888886433322 1 2555552 32111 122355555554332 3 4 Q ss_pred hhHHHHHHHHhCCCHHHHHHHHHHHHHhh-----hcCCCC--CCCccc---cccccccccCCCCCCcccccccCC Q lcl|NC_019530. 491 FSIDYVMRNVLRMSDEEVKEQQTKIAEEK-----KKGLYP--KVQADE---TGNYSGSDVSPLKFKPETIPFSGS 555 (560) Q Consensus 491 ~S~~~i~k~IL~~tDeEI~e~~kqIe~E~-----~~~~~~--~p~~~e---~g~~~~~~~~~~~~~p~~~~~~~~ 555 (560) +|.+++++.+...|+.+++.+.+..+++. .+.+.. +|.++. .|.....+-++.+=.|.-+|-+|+ T Consensus 405 is~et~l~~l~gv~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (479) T protein:vir:99 405 IPAEGVWDMIPNLDQSTVNGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNGATNMQQANNKTGEPASLNKSGA 479 (479) T ss_pred CCHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCCCCCCCCCCCCCCCCcchhccCCCCC Confidence 99999999988999999987655544442 111222 222111 111112223333455777777787 No 62 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=98.16 E-value=3.3e-06 Score=50.60 Aligned_cols=422 Identities=12% Similarity=0.090 Sum_probs=183.0 Q ss_pred cccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHHHhhccee----- Q lcl|NC_019530. 27 LAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVV----- 101 (560) Q Consensus 27 ~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneai----- 101 (560) ++..-.. ...++.+...+ ..+.-...-+..+.. +. . +...... ...+.+|.|..||+-|-+.+. T Consensus 1 ~~~~~~~-~~~~~~~~~~~-~~~~~~~~~g~~~~~-~~-~-----~~~~~~~--~~a~~~~~v~~~v~~ia~~iA~lp~~ 69 (460) T protein:vir:10 1 MANRIIR-ALRELTGLDNK-FNDAFIKYIGQTFTK-YD-N-----NGKTYLE--QGYNINPDVYSCISQMAAKTVAVPYT 69 (460) T ss_pred CchhHHH-HHhhhhccCCC-chHHHHHhhccccCC-Cc-c-----chhhhhH--HHHhcchHHHHHHHHHHHhhhhCceE Confidence 1111111 11111111111 111111110111111 11 0 0011110 024677999999977776533 Q ss_pred eecCCCceEEEeeecccchhHHHHHHHHHHHHHHHH-----------Hh---CCchhh----HHHHHHhhhhhhhhhhhe Q lcl|NC_019530. 102 TCEEDETPVTVNLDKVEGISDTVKESITECFKEVMH-----------LM---DFDNTA----YQKIRKWYVDGRQAYHVI 163 (560) Q Consensus 102 v~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~-----------ll---~f~~~~----~~~~R~wYvDGr~~~hki 163 (560) |++...+...-.........+.+...++..+++.+. |+ +-...+ +.++..+++.|.-|..++ T Consensus 70 v~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~ 149 (460) T protein:vir:10 70 IKVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLM 149 (460) T ss_pred EEeccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEE Confidence 333333332222222212222333334444433321 11 222233 344456678888888777 Q ss_pred ecCCCCccC-eeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchh Q lcl|NC_019530. 164 VDPTNKKGG-IKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDE 242 (560) Q Consensus 164 iD~~~~k~G-I~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~d 242 (560) .+...-..| +.+|++|+|..+.... +.++.........-+|.| ...+....++.+ T Consensus 150 r~~~~~~~G~~~~L~~l~~~~v~v~~-----~~~~~~~~~~~~~~~~~~-------------------~~~g~~~~~~~~ 205 (460) T protein:vir:10 150 SPDDGINAGVPSQMYVLPAHLIKIVL-----KDDINLLSTDSPIKSYML-------------------IQGDQFIEFNED 205 (460) T ss_pred ecCCCccCceeEEEEEEcCceEEEEE-----cCCCceeeeeeeeeEEEE-------------------ecCceeEEeccc Confidence 666544455 6789999998887642 222211111100011111 112345678888 Q ss_pred ceeeeecc--cccCCCC--cchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhh Q lcl|NC_019530. 243 SIVYIDSG--EEPLANG--IVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFK 318 (560) Q Consensus 243 aI~y~hsG--l~~~~~~--~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~k 318 (560) .|.|+... ..+.+++ +.+|.+..|.+.+.....+++...-+--.-++-. ..+..-+.|.+..+++..+.+...|+ T Consensus 206 evih~r~~~~~~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~~~-~i~~~~~~l~~e~~~~~~~~~~~~~~ 284 (460) T protein:vir:10 206 EVIHTKYANPNFDLQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQNGGVFG-FIHGGSTGLTQPQADSLKQRLTEMDK 284 (460) T ss_pred ceEEEecCCCCcccccCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc-eeeecCCCCCHHHHHHHHHHHHHHhc Confidence 88777421 1122222 4567888888888888888887776655556554 45666677777766665555555544 Q ss_pred hheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccc Q lcl|NC_019530. 319 NRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGG-NQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINI 397 (560) Q Consensus 319 nk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l 397 (560) .. ...|.+ + .++ .|.+++.|.-. ..+.-++-.+|..+.+.++++||...|+...+.+. T Consensus 285 g~----~n~g~~------~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~ 343 (460) T protein:vir:10 285 SP----DRLSQI------A-GAS----------GEIAFTKISLNTDELKPFDYLKYDQKAICNALGWSDKLLNNNEGGGL 343 (460) T ss_pred Cc----cccCCc------e-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCC Confidence 22 112322 2 222 25666666431 22222455668889999999999999976544334 Q ss_pred cccchhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHH Q lcl|NC_019530. 398 GGSNLAEITQEELRFSKF-CAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDR 476 (560) Q Consensus 398 ~~~~~~eItRDElkF~Kf-i~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~R 476 (560) .++++.+..+. |..+ |.-+-.++ .+.|-..| +++.+.. ...+|.|+|. ++. .++.+ T Consensus 344 ~~sn~e~~~~~---f~~~~l~P~~~~i----e~~ln~kl-----~~~~~~~--~~~~i~~d~~------~l~---~l~~d 400 (460) T protein:vir:10 344 NTGNLEEERKR---VVTDNIQPDLVIL----KQAFDKKF-----IKRFKGY--ENAVIEWDIS------ELP---EMQTD 400 (460) T ss_pred ccccHHHHHHH---HHHHHHHHHHHHH----HHHHHHhh-----cCccccc--CCceEEeecc------hhh---hHHHH Confidence 45544444333 5554 23333222 33333333 3333321 2233444432 221 12222 Q ss_pred HHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCccccccc--- Q lcl|NC_019530. 477 LASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFS--- 553 (560) Q Consensus 477 l~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~--- 553 (560) +..+.. -+-.-++|.+-+++ ++.| | |-.++-|+.- -++.+.+|.. T Consensus 401 ~~~~~~--~~~~g~~T~NE~R~-~~g~---------------------~-pi~~~~gD~~-------~~~~n~~~~~~~~ 448 (460) T protein:vir:10 401 MVAMAS--WLNTIPVTPNEIRI-AMKY---------------------E-TLNQDGMDIV-------FMPSNKVRIDDVS 448 (460) T ss_pred HHHHHH--HHhCCCCCHHHHHH-HhCC---------------------C-CCCCCCCCee-------eecccccchhhcc Confidence 222211 11112344444332 2222 1 1112222211 1111111111 Q ss_pred -CCcccCC Q lcl|NC_019530. 554 -GSTDDSI 560 (560) Q Consensus 554 -~~~~~~~ 560 (560) ..+|+.. T Consensus 449 ~~~~~~~~ 456 (460) T protein:vir:10 449 NNLIDSAF 456 (460) T ss_pred cccCCCcc Confidence 0111222 No 63 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=98.16 E-value=3.4e-06 Score=50.53 Aligned_cols=481 Identities=11% Similarity=0.086 Sum_probs=191.2 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccch-------h--hhcccccc-ccCCCCCCCCceeEeccccceeccccccch Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQ-------D--ELLLTKAT-VVALDDAQDGSIILQGGANTYNYVGVESEL 70 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~--e~~~~k~~-s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~ 70 (560) |..--.-.|-+|.+.-.-.....+-.+...- + +..++|.. +....-.++..-++.++-....-+..-... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~p~~~~~~~~~~~~~~p~~~~~~ 80 (576) T protein:vir:96 1 MVTRLADIFKRLRLGRDYEDIIDTVPIDDGLQANIRNIEEKSKELNKSLYGKQQAYAEPFLEVMDTNPEFRTKRSYMKNS 80 (576) T ss_pred ChhhHHHHHHHHhccCccccchhhhhcccChhHHHHHhhhhhhhhccccCCccchhhcceeeeeecCCCccccCcchhhh Confidence 2221112222222111101111111110000 0 01111211 111111111111122211111111111111 Q ss_pred hhHHHHHHHHHHHhhccchhHHHHHhhcceee----ecCCCc--eEEEeeecccc-hhHHHHHHHHHHHHHHHHHhCCc- Q lcl|NC_019530. 71 LSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVT----CEEDET--PVTVNLDKVEG-ISDTVKESITECFKEVMHLMDFD- 142 (560) Q Consensus 71 ~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv----~d~~~~--~V~l~Ld~~~~-~s~sik~~I~eeF~~il~ll~f~- 142 (560) .+...+ -+.++.+|=|..||..|.+.+.+ +..... ...+.|...+. .++.-+..+...-..+.+++... T Consensus 81 ~~~~~~---l~~~~~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~ 157 (576) T protein:vir:96 81 DNLHDV---LKQFGNNPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKD 157 (576) T ss_pred hhhHHH---HHHhhcCHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCC Confidence 122233 34566789999999888776543 222222 33444433321 12222222222112233333221 Q ss_pred ---hhhHHH----HHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCcc Q lcl|NC_019530. 143 ---NTAYQK----IRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNY 215 (560) Q Consensus 143 ---~~~~~~----~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~ 215 (560) ....++ ++.+++.|.-|+.++++.+. ..-+.+|.+|||.+|..++ +.++. ++....-|+.+. T Consensus 158 p~~~t~~~f~~~lv~dlll~Gna~~~i~~~rd~-~g~~~~L~pl~p~~V~v~~-----~~dg~--~~~~~~~~~~~~--- 226 (576) T protein:vir:96 158 IDRDSFQSFCRKIVRDTYTYDQVNFEKVFNKKN-ATTMDKFIAVDPSTIFYAT-----DKNGK--IIKGGKRFVQVI--- 226 (576) T ss_pred CccccHHHHHHHHHHHHHhcCCeEEEEEEecCC-CCceEEEEEeCCceeEEEE-----CCCCc--eeeeeeEEEEec--- Confidence 134444 44456778888888876432 2338999999999988753 22231 111111111111 Q ss_pred ccccccccccccccccccccceecchhceee-eeccccc-CCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceE Q lcl|NC_019530. 216 NRNQFTGQSGTSQNFQPSQQELVFDDESIVY-IDSGEEP-LANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRA 293 (560) Q Consensus 216 ~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y-~hsGl~~-~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRv 293 (560) + ......++.+.|+| .+.=..+ ..+++.+|-|+.|.+++.....++....=+=---|--+-| T Consensus 227 ------~----------~~~~~~~~~~dii~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~gi 290 (576) T protein:vir:96 227 ------N----------KKVVASFTSREMAMGIRNPRTELSSSGYGLSEVEIAMKQFIAYNNTETFNDRFFSHGGTTRGI 290 (576) T ss_pred ------C----------CceEEEecccceEEEeecCCCCcccCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE Confidence 0 11233455665544 3221111 1245677889999999988887777654443334556677 Q ss_pred EEccCC-CCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCC-cchHHHH Q lcl|NC_019530. 294 FYLDVG-TLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQ-LGEMDHV 371 (560) Q Consensus 294 FyIDvG-nlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~n-Lgei~DV 371 (560) ..+..+ .+.+..+++-.+.+-..|+- ..+..+.+-+++ .|.+++.|.--.. +.-++-. T Consensus 291 L~~~~~~~ls~e~~~~lr~~~~~~~~G----------~~nag~~p~vl~----------~G~~~~~ls~~~~d~qfle~~ 350 (576) T protein:vir:96 291 LQIKSEQQQSQRALENFKREWKSSFSG----------INGSWQVPVVMA----------DDIKFVNMTPTANDMQFEKWL 350 (576) T ss_pred EEeCCCCCCCHHHHHHHHHHHHHHhcc----------ccccccceeecC----------CCceEEeccCChhhHHHHHHH Confidence 777754 45555554444433333332 111222222233 2566777743222 2224445 Q ss_pred HHHHHHHHHhcCCCccccCCCCc-c--------cccccchhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccCC Q lcl|NC_019530. 372 NYFREKLYDALMIPKSRLQEEGS-I--------NIGGSNLAEITQEELRFSKF-CAGLRRRYSHFFMEFLRRQLILKGVT 441 (560) Q Consensus 372 ~YF~~kLy~aL~VP~sRl~~~~~-~--------~l~~~~~~eItRDElkF~Kf-i~rLr~rfs~if~d~Lk~qLiLkgii 441 (560) +|..+.+.++++||...|+.... . .+.++++.+..+. |.++ +.-+..++...|. ..|+ T Consensus 351 ~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~sn~e~~~~~---f~~~tL~P~~~~ie~~ln----~~Ll----- 418 (576) T protein:vir:96 351 TYLINIISALYGIDPAEIGFPNRGGATGGKGGNTLNEADPGKKQQQ---SQNKGLQPLLRFIEDLIN----THII----- 418 (576) T ss_pred HHhHHHHHHHhCCCHHHccccccccccccccccccccccHHHHHHH---HHHHHHHHHHHHHHHHHH----hhhc----- Confidence 67889999999999999964322 1 1223333333333 4443 3333333333333 3332 Q ss_pred ChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHH---H-------HHH Q lcl|NC_019530. 442 DEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEE---V-------KEQ 511 (560) Q Consensus 442 t~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeE---I-------~e~ 511 (560) +. + ...+.++|.+...= .|.+.++.......-++|..-++. .+.+..-+ . ... T Consensus 419 ~~--~----~~~~~~~f~r~d~~----------~~~e~~~~~~~~~~G~lT~NE~R~-~~gl~piegGD~~~~~~~~~~~ 481 (576) T protein:vir:96 419 SE--Y----SDKYVFQFVGGDTK----------SELDKIKILQEEVKTYKTVNEARK-EKGLKPIEGGDVLLDGSFIQSM 481 (576) T ss_pred hh--c----cCceEEEeccCCHH----------HHHHHHHHHHHHhcCccCHHHHHH-HhCCCCCCCcceeccccccccc Confidence 21 1 12366777766432 222332222222223677777764 46664321 0 000 Q ss_pred HHHHHHhhhcC-----CCCCCCcccccccc---ccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 512 QTKIAEEKKKG-----LYPKVQADETGNYS---GSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 512 ~kqIe~E~~~~-----~~~~p~~~e~g~~~---~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) ..+..++..+. ...++.+..+++.. ...-+..+...+.-.-.|+.|+.+ T Consensus 482 ~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~g~~~~~~~~~~~~~ 538 (576) T protein:vir:96 482 SLNTQKEQYEDTKQKERFDMIQQFLNSPDDEEPQQESTEDKVDGRESNDPTKIDSPV 538 (576) T ss_pred cccccCCCCCCccccccccccccccCCCCCCCCCCCCCCCcccccccccCCCCCCcc Confidence 00000000000 00000000000000 000000001111111122222222 No 64 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=98.15 E-value=3.6e-06 Score=50.42 Aligned_cols=450 Identities=13% Similarity=0.056 Sum_probs=188.2 Q ss_pred cccccCCCC-CCCCceeEec--c----------ccceeccccccchhh-HHHHHHHHHHH-hhccchhHHHHHhhcceee Q lcl|NC_019530. 38 KATVVALDD-AQDGSIILQG--G----------ANTYNYVGVESELLS-VKTVVEEYQSM-AQQPEIRKAVDIIVNDVVT 102 (560) Q Consensus 38 k~~s~~~~~-~~dG~~~i~~--~----------~~~~~y~~~e~~~~~-~~~LI~~YR~m-a~~PEvd~AI~eIvneaiv 102 (560) -.-.++.+. .++....+.. . ...-+|.+ ...+.. ....=+.+|.+ +...=+..+|+..++= ++ T Consensus 1 ~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~r~~~~~~Yy~G-~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~-l~ 78 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIARDEMVSAFEDSTQNLKTNTSYYEA-ERRPEAIGVTVPIQMQSLLAHVGYPRLYVDSIAER-QA 78 (485) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCcchhcCCCCChhhhhhhhhcCcHHHHHHHHHhh-hc Confidence 111122221 1111111110 0 00011111 011100 00000011111 1112223333322221 11 Q ss_pred ecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCC----CCccCeeEeEE Q lcl|NC_019530. 103 CEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPT----NKKGGIKKLVM 178 (560) Q Consensus 103 ~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~----~~k~GI~eLr~ 178 (560) . ..+...+.+ ...+.+..|++..+|+....++++.-++.|+-|...-.|.. .+.+|-..++. T Consensus 79 ~------~g~~~~~~~--------~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~ 144 (485) T protein:vir:10 79 V------EGFRFGDAD--------EADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRV 144 (485) T ss_pred c------cceecCCCc--------hhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEE Confidence 1 111111111 12344566777788999999999999999998875443322 22445567888 Q ss_pred eCccccchhhhhcccCCcccccccc----c-cchheeecCccccccccccccccccccccccceecchhceeeeeccccc Q lcl|NC_019530. 179 LDSRCIRPVYIVEKAMREGGIEAIE----S-VTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEP 253 (560) Q Consensus 179 LDPr~i~~vr~i~~~~~~g~~~~~~----~-~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~ 253 (560) ++|+.+-.++.-......-.+-... + ..-.-+|.|..-+..............-+..--++|. |.|++..-.. T Consensus 145 ~~p~~~~~~~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv--v~~~n~~~~~ 222 (485) T protein:vir:10 145 EPPTRMYAEIDPRIGRVSKAIRVAYDAEGNEIQAATLYTPNDIFGWYRVENEWQEWFNNPHGLGVVPV--VPIPNRTRLS 222 (485) T ss_pred EccceeEEEEcCCCCceeEEEEEEEeeCCCeEEEEEEEeCCeEEEEEEcCCceEEeccccCCCCcccE--EEeccccccC Confidence 9998876654211110000000000 0 1112234443222211110000000000000012232 4455542211 Q ss_pred CCCCcchhhhHHH----HHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCc Q lcl|NC_019530. 254 LANGIVPGLLNPA----IRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGK 329 (560) Q Consensus 254 ~~~~~i~syL~~A----iK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGe 329 (560) .+ ...|-+.+. +..+| +++-+..++-..+-.|.|-|.=.+..+++...- +...+.++..| T Consensus 223 ~~--~G~s~i~~~v~~liDa~~--~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~-----------~~~~~~~~~~~- 286 (485) T protein:vir:10 223 DL--YGTSEITPELRSMTDAAA--RILMLMQATAELMGVPQRLIFGIKPEEIGVDPE-----------TGQTLFDAYLA- 286 (485) T ss_pred CC--CCccchhHHHHHHHHHHH--HHHHHHHHHHHhhcchHHHHhcCCccccccccc-----------ccchhhhhccc- Confidence 11 223333332 22333 356677777777777777655333222221100 00000111112 Q ss_pred ccCcccchhhHhhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHH Q lcl|NC_019530. 330 ITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLG-EMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQE 408 (560) Q Consensus 330 v~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLg-ei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRD 408 (560) ..|+.- +-+.++-.++..+ ++ -++-++=.-..++..-++|.+-|...+. | .+.+..|..- T Consensus 287 ------------~i~~~~---~~d~k~~q~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~-n--~~Sg~Al~~~ 347 (485) T protein:vir:10 287 ------------RILAFE---DAEGKIQQFSAAE-LANFTNALDQIAKQVAAYTGLPPQYLSTAAD-N--PASAEAIRAA 347 (485) T ss_pred ------------ceeccC---CCCceEEeecccc-hHHHHHHHHHHHHHHhcccCCCHHHhccccC-c--hhHHHHHHHH Confidence 234331 1223455555532 22 1222333334455557778777754432 1 1223457777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC-hhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_019530. 409 ELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTD-EKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFV 487 (560) Q Consensus 409 ElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit-~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~v 487 (560) +..+..-+.+.|+.|..-+...++.-+.+.|... ..+| ..|.+.|..- ...- +.+.++++..+..-+ T Consensus 348 ~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~-----~~i~v~w~~~-~~~~------~~~~ada~~kl~~ag 415 (485) T protein:vir:10 348 ESRLIKKVERKNSIFGGAWEEAMRLAYRMMKGGDVPPDM-----LRMETVWRDP-STPT------YAAKADAASKLYNGG 415 (485) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCcccc-----eeeeEEecCC-CCCC------HHHHHHHHHHHHhcc Confidence 7778888899999999888888887666655422 2233 1366777433 2221 123455555554432 Q ss_pred cchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCC-----CCCCCccccccccccccCCCCCCcccccccCCcccC Q lcl|NC_019530. 488 GSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGL-----YPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDDS 559 (560) Q Consensus 488 Gky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~-----~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~ 559 (560) -..+|.+++++ .|.+++++++++++..++|...+. ...|.. +..+.++..+.+-+|+ .-+|+ |++ T Consensus 416 ~~~~s~et~~~-~lg~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~--~~~~~-~~~ 485 (485) T protein:vir:10 416 TGVIPRERARK-DMGYSIAEREEMRRWDEEEAAMGLGLIGTMVDPNP---TVPGSPSPAPAPKPAA--LESGG-DAA 485 (485) T ss_pred ccCCCHHHHHH-hCCCCHhHHHHHHHHHHHHHHHHHHHHHHhhccCC---CCCCCCCccccccCcC--CCCCC-CCC Confidence 24789999995 599999999988776666643221 111111 1111111111111111 11222 222 No 65 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=98.12 E-value=4.2e-06 Score=50.07 Aligned_cols=269 Identities=10% Similarity=0.117 Sum_probs=142.2 Q ss_pred CceEEEeeecccchhHHHHHHHHHHHHHHHHH-----hCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCc Q lcl|NC_019530. 107 ETPVTVNLDKVEGISDTVKESITECFKEVMHL-----MDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDS 181 (560) Q Consensus 107 ~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l-----l~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDP 181 (560) -.-+.+.+-..++.. .....++|+. +....-.+.+++.++++|.-|..++.|. ...+.+|++|+| T Consensus 1 ia~l~~~~~~~~~~~-------~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~---~G~~~~l~~l~~ 70 (278) T protein:vir:78 1 MASLPLKMYEDYKVV-------NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDI---YHQPSKLFLLNP 70 (278) T ss_pred CccceeEEEecCccc-------ccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECC---CCcEEEEEEECC Confidence 333344332221111 1222233321 2222334566777788898888777654 234899999999 Q ss_pred cccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeecccccCCCCcchh Q lcl|NC_019530. 182 RCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLANGIVPG 261 (560) Q Consensus 182 r~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~~~i~s 261 (560) -.|...+ +.+++ ..+|.|... .+..+.++.+.|.|..- ....+.-+..| T Consensus 71 ~~v~v~~-----~~~~~-------~~~y~~~~~------------------~g~~~~~~~~evih~~~-~~~~~~~~G~s 119 (278) T protein:vir:78 71 DVVEMLI-----ENQSR-------ELYYSIHAA------------------TGNKLIVHNMDMLHFKH-IVASNMVQGIS 119 (278) T ss_pred ceeEEEE-----cCCCc-------eEEEEEEcC------------------CceEEEEccccEEEECC-CCCCCCeeecc Confidence 8886532 11221 123333311 12345678888877651 11223335668 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHh Q lcl|NC_019530. 262 LLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAE 341 (560) Q Consensus 262 yL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlE 341 (560) .+..|.+++.....++... .+...+.| .-+++ .-++|.+..+++..+ ++++.. ...|. .+ .++ T Consensus 120 ~~~~~~~~i~~~~~~~~~~-~~~~~~~~-~~i~~-~~~~l~~e~~~~~~~----~~~~~~---~~~g~------~~-vl~ 182 (278) T protein:vir:78 120 PIDVLKNTTDFDNAVRTFN-LTEMQKPD-SFMLK-YGSNVGKEKRQQVLE----DFKQYY---EENGG------IL-FQE 182 (278) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHhcCCC-cEEEE-eCCCCCHHHHHHHHH----HHHHHh---ccCCC------ce-ecC Confidence 9999999999888887764 45555555 44444 446777766655433 333321 12232 22 221 Q ss_pred hhcccccCCCCccceeecCCCCCcchHHH---HHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHH-HH Q lcl|NC_019530. 342 DYWLPRREGQNATEIATVGGGNQLGEMDH---VNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKF-CA 417 (560) Q Consensus 342 DywLpRReGgrgTEIsTLpGg~nLgei~D---V~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kf-i~ 417 (560) .|+++..|. .+.-+++- .++..+.+.++++||.+.++...+.+ .+++++..+. |..+ |. T Consensus 183 ----------~g~~~~~l~--~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~--~sn~~~~~~~---~~~~~l~ 245 (278) T protein:vir:78 183 ----------PGVEIEPLP--KKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTN--FAKNEELNRF---YLQHTLL 245 (278) T ss_pred ----------CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHHHH---HHHHHHH Confidence 256777774 34444443 35788999999999999997544422 3333333322 5444 33 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecch Q lcl|NC_019530. 418 GLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSY 463 (560) Q Consensus 418 rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~ 463 (560) .+.. .+.+.|-+. +++++++.. ..+|+|+. |.- T Consensus 246 P~~~----~i~~~ln~~-----L~~~~e~~~--g~~~~f~~--~~l 278 (278) T protein:vir:78 246 PIVK----QYEEEFNRK-----LLTKTDREK--IGILNLTL--NLI 278 (278) T ss_pred HHHH----HHHHHHHhh-----cCChhHhcC--CceEEEec--ccC Confidence 3333 333444433 367776642 23455443 322 No 66 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=98.10 E-value=4.1e-06 Score=50.12 Aligned_cols=429 Identities=12% Similarity=0.093 Sum_probs=194.9 Q ss_pred hhhhhhhhhhhhhc--cc-ccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhh--c Q lcl|NC_019530. 12 FGGGGLVNAKVDTD--KL-AQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQ--Q 86 (560) Q Consensus 12 ~~~~~~~~~~~~~~--~~-~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~--~ 86 (560) .|....+++++.-- ++ -.+--++..++. -++.++ ...+.|++++.|.. + T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~-~i~~~~-------------------------~~~~ri~~~~~~y~g~~ 54 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDP-RISIDP-------------------------DEYVRIQTDLDYYSDKL 54 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhccc-ccccCH-------------------------HHHHHHHHHHHHhcCCC Confidence 22233333332110 00 000011111100 011110 01112222223311 1 Q ss_pred cc------------------------hhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_019530. 87 PE------------------------IRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFD 142 (560) Q Consensus 87 PE------------------------vd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~ 142 (560) |- |+..-.-+.+| ++++.+++.+ ...+..+.++.-.+|. T Consensus 55 ~~~~~~~~~~~~~~~~~~sln~~~~i~~~~A~lv~~e---------~~~i~v~~~~--------~~~e~l~~il~~n~f~ 117 (508) T protein:vir:15 55 QYIHYQASDGIKKKRLKNTINMAKTAARRIASVVFNE---------KAEIHVKDNN--------EADKFLNDVLEDNDFK 117 (508) T ss_pred cccccccCCCCccccceeecchHHHHHHHHHhhhhCC---------CceEEeCCch--------HHHHHHHHHHHhccHH Confidence 11 11111112222 4444443321 1223347788888899 Q ss_pred hhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCcccc-c--cccc-------cchhee-- Q lcl|NC_019530. 143 NTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGI-E--AIES-------VTLKYY-- 210 (560) Q Consensus 143 ~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~-~--~~~~-------~~e~~~-- 210 (560) ....+.+....+-|..+|+..+|.. + ..+.+++|.++-+++ .+. +++ + ++.. -..||+ T Consensus 118 ~~~~~~~e~a~a~G~~~~k~~~d~~----~-~~i~~v~ad~~~P~~----~d~-~~~~~~af~~~~~~~~~~~~~~yt~l 187 (508) T protein:vir:15 118 NKFEEALEKGVALGGFAMRPYIDGN----H-IKIAWVRADQFYPLQ----SNT-NDISEAAIASRTQRTESNQTKYYTLL 187 (508) T ss_pred HHHHHHHHHHhhcCceEEEEEEeCC----e-eEEEEEcCCeeEEEE----EcC-CCeEEEEEEEEEEeecCCCceEEEEE Confidence 9999999999999999999999853 2 458889998865542 111 111 0 1100 011221 Q ss_pred ----ec---Ccccc-ccccccccccccccccccce----ecchhceeeeeccc-----c--------cCCCCcchhhhHH Q lcl|NC_019530. 211 ----YN---PNYNR-NQFTGQSGTSQNFQPSQQEL----VFDDESIVYIDSGE-----E--------PLANGIVPGLLNP 265 (560) Q Consensus 211 ----Y~---p~~~~-~~~~g~~~~~~~~~~~~~~~----~i~~daI~y~hsGl-----~--------~~~~~~i~syL~~ 265 (560) .. |..-. ..|-..+..+.....+...+ -++ +.+++.+..- + +......+|-++. T Consensus 188 E~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~-~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~ 266 (508) T protein:vir:15 188 EFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELA-PQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDN 266 (508) T ss_pred EEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccCCC-cceEecCCCcceeEEecCCccccccCCCCcCCchHhh Confidence 10 00000 00111000000000000000 011 1122222110 0 1122335688888 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcc--cCcccchhhHhhh Q lcl|NC_019530. 266 AIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKI--TGNAHLMGIAEDY 343 (560) Q Consensus 266 AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev--~d~~~~msmlEDy 343 (560) |+-.+..|-..-+. +.|.+|.=.+|||.-+.= +-+|..+|.. .+++-+..|-. T Consensus 267 ~~~lid~lD~~~s~--~~~e~~~~~~~i~v~~~~---------------------l~~d~~~~~~~~~~~~~~~~~~~-- 321 (508) T protein:vir:15 267 AKHVLDDINDTHDQ--FIWEIRLGQKHIAVQPGM---------------------LRFDDEHKPTFDTEQNVYVGVLS-- 321 (508) T ss_pred hHHHHHHHHHHHHH--HHHHHHhcccceeechHH---------------------hcCCCCCccccCCCCeeEEeccC-- Confidence 88888777766655 558889888888862210 0123333322 11121222111 Q ss_pred cccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHH Q lcl|NC_019530. 344 WLPRREGQNATEIATVGGGNQLG-EMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRR 422 (560) Q Consensus 344 wLpRReGgrgTEIsTLpGg~nLg-ei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~r 422 (560) +...|.-|+.+...--.+ -.+-+..+.+.+....+++.+-|+.+++ +-..|+||...+-.-..-+.+.|+. T Consensus 322 -----~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~---~~~TAtei~s~~~~~~~t~~~~~~~ 393 (508) T protein:vir:15 322 -----DDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSND---GVKTATEVVSNNSMTYQTRSSYLTM 393 (508) T ss_pred -----CCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccC---ccccHHHHHHHHHHHHHHHHHHHHH Confidence 112223355555422222 2455777888999999999998886654 2234788876666666667777777 Q ss_pred HHHHHHHHHHHHHHhcc-------C---CChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhh Q lcl|NC_019530. 423 YSHFFMEFLRRQLILKG-------V---TDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFS 492 (560) Q Consensus 423 fs~if~d~Lk~qLiLkg-------i---it~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S 492 (560) |...+.++++.-|-+-. . .+..... ....|.++|... ..+-+++++-+ +.++..- | ..| T Consensus 394 ~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~--~~~~v~v~f~D~--i~~d~~~~~~~-----~~~~v~a-G-i~s 462 (508) T protein:vir:15 394 VEKAIDELCQSIFELANAGALFDDGKPLFTLDSAS--QPLDIECHFDDG--VFVNKDKQLEE-----DAKVLAI-G-ALS 462 (508) T ss_pred HHHHHHHHHHHHHHHHHHhcccccccccccccccc--CCcceEEEeCCC--CCCCHHHHHHH-----HHHHHhc-C-CCC Confidence 77777776666543321 1 1111100 011244444322 22223333222 2222211 3 588 Q ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCC Q lcl|NC_019530. 493 IDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSP 542 (560) Q Consensus 493 ~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~ 542 (560) +++.+++....||||.+++-++|++|.... .|....+ +...|+.++ T Consensus 463 ~e~~i~~~~g~~deea~~el~ri~~E~~~~---~~~~~~~-~~~~g~~ge 508 (508) T protein:vir:15 463 KQTFLQRNYGMTDEQAAEELAKIQSEAPTD---TFEGGRS-AILNGGDGE 508 (508) T ss_pred HHHHHHhcCCCChHHHHHHHHHHHHhcccc---Ccccccc-ccCCCCCCC Confidence 888877778999999999999999996332 1111111 122233332 No 67 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=98.02 E-value=6.8e-06 Score=48.92 Aligned_cols=422 Identities=12% Similarity=0.138 Sum_probs=188.4 Q ss_pred eEec------cccce-ecccc------ccch---hhHHHHHHHHHHHhh--ccchhHHHHH------------------h Q lcl|NC_019530. 53 ILQG------GANTY-NYVGV------ESEL---LSVKTVVEEYQSMAQ--QPEIRKAVDI------------------I 96 (560) Q Consensus 53 ~i~~------~~~~~-~y~~~------e~~~---~~~~~LI~~YR~ma~--~PEvd~AI~e------------------I 96 (560) .++. +...- .+... ...+ .+..+.|+.++.+.. |+.+...-.. | T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~~k~i 80 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSMNLPKVT 80 (496) T ss_pred ChhHHHHHHHHHHHHhccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeecchHHHH Confidence 3331 00000 01000 0110 112234555555532 3332211100 1 Q ss_pred hcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEe Q lcl|NC_019530. 97 VNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKL 176 (560) Q Consensus 97 vneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eL 176 (560) +++..-+ --..||++.+++ +...+.++.+++-.+|.+...+++....+.|..|+|..+|.+ |=..+ T Consensus 81 ~~~~a~~-l~~~p~~i~~~d---------~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~----~~~~i 146 (496) T protein:vir:38 81 AKYMSKL-LFNEKVKINIDD---------KAAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGN----KNVKV 146 (496) T ss_pred HHHHhhh-hhCCcceEeeCC---------hHHHHHHHHHHhccCHHHHHHHHHHHHhhhCcEEEEEEEcCC----CcEEE Confidence 1110000 012344444443 245566778888888999999999999999999999999863 33568 Q ss_pred EEeCccccchhhhhcccCCccccc---cccc---cchhee-----e-c--C----cccccccccccccccc----ccccc Q lcl|NC_019530. 177 VMLDSRCIRPVYIVEKAMREGGIE---AIES---VTLKYY-----Y-N--P----NYNRNQFTGQSGTSQN----FQPSQ 234 (560) Q Consensus 177 r~LDPr~i~~vr~i~~~~~~g~~~---~~~~---~~e~~~-----Y-~--p----~~~~~~~~g~~~~~~~----~~~~~ 234 (560) .+++|.++-+++- +. +.+. ++.. -..+|. + . + +..+...++.+.+..+ ..... T Consensus 147 ~~v~~~~~~P~~~----~~-~~~~~~~f~~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~ 221 (496) T protein:vir:38 147 SFATADCMYPLSN----DS-ENVDECVIANSFHKNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDDI 221 (496) T ss_pred EEEcccceEEEEe----cC-CcEEEEEEEEEEEeCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCcccccccccccc Confidence 9999998766521 11 2111 1100 000110 0 0 0 0000000110000000 00000 Q ss_pred c-cee---cchhceeeeecc---cccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHH Q lcl|NC_019530. 235 Q-ELV---FDDESIVYIDSG---EEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAE 307 (560) Q Consensus 235 ~-~~~---i~~daI~y~hsG---l~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAe 307 (560) . ... ++.-.++|.--- -.+.......|-|+.++..+..|-..=..+ .+-++.-.+|+|. +.. T Consensus 222 ~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~--~~~~~~~~~~i~v-~~~-------- 290 (496) T protein:vir:38 222 EPVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSY--YQEFKLGKKKVLV-PSS-------- 290 (496) T ss_pred ccceeecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHH--HHHHhhcccceec-chH-------- Confidence 0 000 011111111000 011223345678888887777775554443 3667776666654 211 Q ss_pred HHHHHHHHhhhhheeecccCCcccCcccchhhHhhhccccc-CCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCC Q lcl|NC_019530. 308 EYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRR-EGQNATEIATVGGGNQLGE-MDHVNYFREKLYDALMIP 385 (560) Q Consensus 308 qYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRR-eGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP 385 (560) +++ . . -|..++.. .-++.-.+.|-...- +++.+.-|+++.+.-...+ ..-++.+.+.+....++| T Consensus 291 -~l~----~---~--~~~~g~~~---~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~ 357 (496) T protein:vir:38 291 -FVK----T---A--VNLDGSTT---QYFDSTDEAFFLYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLS 357 (496) T ss_pred -Hhh----c---c--CCCCCccc---cCCCCccceEEEeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCC Confidence 110 0 0 01111111 011111111211111 1222234666655322221 344677788999999999 Q ss_pred ccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHhccCCChhHHHHhhhhhceeee Q lcl|NC_019530. 386 KSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRR-------QLILKGVTDEKDWNEKIKPFIKFEF 458 (560) Q Consensus 386 ~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~-------qLiLkgiit~eew~~~i~~~I~~~f 458 (560) -+-|+.+++ |-..|+||.-....-..-+.+.++.|...+.++++. +..++|.. .+. ..+.+.| T Consensus 358 ~~~f~~~~~---g~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~~-~~~------~~i~v~f 427 (496) T protein:vir:38 358 AGTFTFDEN---GLKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKFIEAYSGEV-VEL------DTITVDF 427 (496) T ss_pred hhhcCCCcc---ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC-CCc------cceEEEe Confidence 999887654 223467775444443334444454454544444443 34445542 121 2366777 Q ss_pred eecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccccccc Q lcl|NC_019530. 459 TSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGS 538 (560) Q Consensus 459 ~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~ 538 (560) .. +...- . .+. ++.+.++-. .| .+|.++++++....||+|.+++.++|++|.... . |+.+..+-.+.. T Consensus 428 ~d-~i~~d-~-~~~----~~~~~~~~~-~G-iiS~et~l~~~~~~~d~ea~~el~ri~~E~~~~-~--~~~d~~~~~~~~ 495 (496) T protein:vir:38 428 DD-SIAQD-E-DTT----INRYTNAKN-QG-MIPLKIALQRAWNITEAEADEWAEMLAKEKQAE-M--PNNDMNGIFGEE 495 (496) T ss_pred CC-CCCCC-H-HHH----HHHHHHHHh-cC-CCCHHHHHHhcCCCChHHHHHHHHHHHHhhhcc-C--ccccccCCCCCC Confidence 63 22221 0 111 222222211 14 589999988888999999999999999996433 1 222211111111 Q ss_pred c Q lcl|NC_019530. 539 D 539 (560) Q Consensus 539 ~ 539 (560) + T Consensus 496 e 496 (496) T protein:vir:38 496 E 496 (496) T ss_pred C Confidence 1 No 68 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=97.98 E-value=8.2e-06 Score=48.47 Aligned_cols=477 Identities=10% Similarity=0.021 Sum_probs=221.9 Q ss_pred cccchhhhc--cccccccCC-CCCCCCceeEe--ccccceeccccccc-hhhHHHHHHHHHHH-hhccchhHHHHHhhcc Q lcl|NC_019530. 27 LAQKQDELL--LTKATVVAL-DDAQDGSIILQ--GGANTYNYVGVESE-LLSVKTVVEEYQSM-AQQPEIRKAVDIIVND 99 (560) Q Consensus 27 ~~~~~~e~~--~~k~~s~~~-~~~~dG~~~i~--~~~~~~~y~~~e~~-~~~~~~LI~~YR~m-a~~PEvd~AI~eIvne 99 (560) ++.+--... .....+... .....|+.... ..+......+.+.. ..+-..|..+=|.| .++|-+..||+.+++- T Consensus 1 ~~~p~~~~~~~~~~~~~~~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~n 80 (533) T protein:vir:34 1 MKTPTIPTLLGPDGMTSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQDH 80 (533) T ss_pred CCCchhhhhhcccccchHHHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHH Confidence 111100000 000000000 00111110000 01111111122222 22455577788888 5999999999998887 Q ss_pred eeee----cCCCceEEEeeecccchhHHHHHHHHHHHHHHHHH----------hCCchhhHHHHHHhhhhhhhhhhheec Q lcl|NC_019530. 100 VVTC----EEDETPVTVNLDKVEGISDTVKESITECFKEVMHL----------MDFDNTAYQKIRKWYVDGRQAYHVIVD 165 (560) Q Consensus 100 aiv~----d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l----------l~f~~~~~~~~R~wYvDGr~~~hkiiD 165 (560) +|=. .-.-+-..|.++.. -.+.+.+.|..+|..+++- ++|..--...+|.|.+||..|-.+..+ T Consensus 81 vVG~Gi~~~~~p~~~~lg~~~~--~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~ 158 (533) T protein:vir:34 81 IVGSFFRLSHRPSWRYLGIGEE--EARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWD 158 (533) T ss_pred hhCCCceeeeccchhhcCCChh--HHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEeeec Confidence 7532 11111122445553 3678899999999998753 344445567899999999999988776 Q ss_pred CCCCccCe--eEeEEeCccccchhhhhcccCC--cccccccccc--chheeec--Cccccccccccccccccccccccce Q lcl|NC_019530. 166 PTNKKGGI--KKLVMLDSRCIRPVYIVEKAMR--EGGIEAIESV--TLKYYYN--PNYNRNQFTGQSGTSQNFQPSQQEL 237 (560) Q Consensus 166 ~~~~k~GI--~eLr~LDPr~i~~vr~i~~~~~--~g~~~~~~~~--~e~~~Y~--p~~~~~~~~g~~~~~~~~~~~~~~~ 237 (560) +. +.+- ..|..|||-+|.--+.-. +.. ..||++.+.. ..|.+|. |....... .......+ T Consensus 159 ~~--~g~~~~~~lq~ie~d~l~~~~~~~-~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~---------~~~~~~~~ 226 (533) T protein:vir:34 159 TS--SSRLFRTQFRMVSPKRISNPNNTG-DSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQK---------WTWIPREL 226 (533) T ss_pred cC--CCCccceEEEEechhhcCCCCCCC-CCCceEeeeEECCCCCeEEEEEeecCCCCccccc---------cceeeeee Confidence 53 1211 368899998776432111 000 1334432222 2455553 22111100 00011244 Q ss_pred ecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhh Q lcl|NC_019530. 238 VFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKF 317 (560) Q Consensus 238 ~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~ 317 (560) .++.+.|.|..... ...---.+|.|..+++.+.+|.-.+||...-...-|-.=-++.=+.+......+ +- T Consensus 227 ~v~a~~VlH~f~~~-r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~---~~------ 296 (533) T protein:vir:34 227 PGGRASFIHVFEPV-EDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDF---IL------ 296 (533) T ss_pred ccChhHeeeecccc-CCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCccccccc---cc------ Confidence 57778888888653 333445679999999999999999999999888888664443333322111000 00 Q ss_pred hhheeecccCC-cccCcccchhhHhhhcccc---cCCC------CccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCc Q lcl|NC_019530. 318 KNRNAYDRTTG-KITGNAHLMGIAEDYWLPR---REGQ------NATEIATVGGGNQLGE-MDHVNYFREKLYDALMIPK 386 (560) Q Consensus 318 knk~vYd~~TG-ev~d~~~~msmlEDywLpR---ReGg------rgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~ 386 (560) ....+ .-..-...++...++.-.+ =++| -|.+|+.+.....-+. .+-++-..+.+=.+|+||- T Consensus 297 ------~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~y 370 (533) T protein:vir:34 297 ------GANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSY 370 (533) T ss_pred ------CCCcccccccccccchhhhhccCcceeeccCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCH Confidence 00000 0000000111111111000 0111 1333443333322223 2344455566667899999 Q ss_pred cccCCC-CcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHhccCCChh-----HHHHhhhhh--c Q lcl|NC_019530. 387 SRLQEE-GSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRR----QLILKGVTDEK-----DWNEKIKPF--I 454 (560) Q Consensus 387 sRl~~~-~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~----qLiLkgiit~e-----ew~~~i~~~--I 454 (560) +-|..+ ++.|. |-+--.-+.|-+++.++|+.|..=|..++-. ..+|.|.+..- +|....... . T Consensus 371 e~lt~D~s~~nY-----SS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~ 445 (533) T protein:vir:34 371 EQLSRNYAQMSY-----STARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNC 445 (533) T ss_pred HHHhhhcccccH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhce Confidence 988665 33332 1222345669999999999887655444333 45678877521 111111111 2 Q ss_pred eeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhh----hcCCCCCCCcc Q lcl|NC_019530. 455 KFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEK----KKGLYPKVQAD 530 (560) Q Consensus 455 ~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~----~~~~~~~p~~~ 530 (560) .|..-.=-+.--+||+.-...+++. -.-|.+-+..+ .+..- ++...||+.|+ +.|+-.+-... T Consensus 446 ~w~~p~~~~iDP~Ke~~a~~~~i~~---------G~~s~~~~~a~-~G~D~---~ev~~q~a~e~~~~~~~gl~~~~~~~ 512 (533) T protein:vir:34 446 DWIGSGRMAIDGLKEVQEAVMLIEA---------GLSTYEKECAK-RGDDY---QEIFAQQVRETMERRAAGLKPPAWAA 512 (533) T ss_pred eeccCCccccChHHHHHHHHHHHHc---------CCCCHHHHHHH-cCCCH---HHHHHHHHHHHHHHHhcCCCCCCCCC Confidence 2222333344556666555444432 23444544444 33332 33333333332 44543221110 Q ss_pred ccccccccccCCCCCCcccccccCCccc Q lcl|NC_019530. 531 ETGNYSGSDVSPLKFKPETIPFSGSTDD 558 (560) Q Consensus 531 e~g~~~~~~~~~~~~~p~~~~~~~~~~~ 558 (560) ....... -+++--|.++++-- T Consensus 513 ----~~~~s~~---~~~~~~~~~~~~~~ 533 (533) T protein:vir:34 513 ----AAFESGL---RQSTEEEKSDSRAA 533 (533) T ss_pred ----cCccCCC---CCCCCCCcccCCCC Confidence 0000000 01111122222211 No 69 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=97.95 E-value=9.3e-06 Score=48.16 Aligned_cols=439 Identities=12% Similarity=0.103 Sum_probs=182.2 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) +++..-++.+..=|. ..++...+.+.....-+ . | +++.+ +-..|- +.+..+|-|.+||+ T Consensus 1 ~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~-------~----~--~~pp~-~~~~La---~~~~~n~~v~scI~ 59 (540) T protein:vir:41 1 MFNYHLSIKSLEKYR----AIKGDTDSQALKEDRFE-------E----Y--VEPKV-HPLVLL---SLLQVNPYHASACS 59 (540) T ss_pred CCCcccChhhccchh----hhhccccccccccCCCC-------c----c--ccCCC-CHHHHH---HHHHhcHHHHHHHH Confidence 444444444432111 11111222111111000 0 0 01111 222233 22356788899997 Q ss_pred HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhH----HHHHHhhhhhhhhhhheecCCCCc Q lcl|NC_019530. 95 IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAY----QKIRKWYVDGRQAYHVIVDPTNKK 170 (560) Q Consensus 95 eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~----~~~R~wYvDGr~~~hkiiD~~~~k 170 (560) -|.+.+.-+. ..+.-+... +.+. +-+....+. .++..+++.|.-|+.++.|. . T Consensus 60 ~ia~~ia~~~-----~~i~~~~~~-----~~~~----------lpN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~---~ 116 (540) T protein:vir:41 60 IKANDILRTG-----YLIDGDDGG-----VEEL----------LRACRPSFEFILLQALEDLQVFNYCTLEVVRDD---Q 116 (540) T ss_pred HHHHHHhcCC-----ceEecCccc-----hhhh----------ccCCCCCHHHHHHHHHHHHHhcCCeEEEEEECC---C Confidence 7766654322 222211111 1111 112233333 44556777899999888754 3 Q ss_pred cCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeecc Q lcl|NC_019530. 171 GGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSG 250 (560) Q Consensus 171 ~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsG 250 (560) .-+.+|.+|+|..|+..+ +..+.+....+...+|+-. +.+.+. ...........++.+.|.|.+.. T Consensus 117 G~~~~L~~i~~~~V~v~~-----~~~~~~~~~d~~~~~~~~~-----~~~~~~----~~~~~g~~~~~~~~~eViHir~~ 182 (540) T protein:vir:41 117 GEPVRLDYIPAHTVRVHR-----DGSRYMQTWDGIHVTYFKD-----YRYEGE----VNPDNGEDQDGVGANEIIFIHLP 182 (540) T ss_pred CcEEEEEEeCCcceEEeE-----cCceeEeeecCceeeeeec-----ccccce----eeccccccceeecccceEEecCC Confidence 458999999999887543 1111111111111111100 000110 01112233456788888887643 Q ss_pred cccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCch----HHHHHHHHHHHHhhhhheeeccc Q lcl|NC_019530. 251 EEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGK----KSAEEYMTMMMGKFKNRNAYDRT 326 (560) Q Consensus 251 l~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk----~KAeqYl~~im~k~knk~vYd~~ 326 (560) -..++-+.+|-|..|.+++.....+++...=+----|--.-|..++.+-.++ .++.+-+++.+.++-.. .. T Consensus 183 -~~~~~~~G~Spi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~----~~ 257 (540) T protein:vir:41 183 -SPICSYYGVPRYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIED----NF 257 (540) T ss_pred -CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHH----Hh Confidence 1234456778999999988888777776533322233445566666333222 22333333333222111 11 Q ss_pred CCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchH---HHHHHHHHHHHHhcCCCccccCCCCcccccccchh Q lcl|NC_019530. 327 TGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEM---DHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLA 403 (560) Q Consensus 327 TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei---~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~ 403 (560) .| +.++.+-.-.+|.=. .+..|.+++.|. .+..++ +-.++..+.+.++++||...++...+....++++. T Consensus 258 ~g-~~~nag~~~vLe~~~----~~~~g~~~~pl~--~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~e 330 (540) T protein:vir:41 258 KY-LKEAPHTPLVFSIPG----GDTVEVTFTPLN--TSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAE 330 (540) T ss_pred cc-ccccccceEEEecCC----CcccceeEEecc--cchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHH Confidence 12 111111111233110 012345555553 333333 34457788899999999999964332234455555 Q ss_pred hhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 404 EITQEELRFSKF-CAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNT 482 (560) Q Consensus 404 eItRDElkF~Kf-i~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~ 482 (560) +..+. |.+. +.-+.+++...+...|.++ .+ ..+.+.|..+. +...++ ..+++.+. T Consensus 331 q~~~~---f~~~tL~P~~~~ie~~ln~~L~~~---------~~------~~~~i~f~~~~----ll~~D~-~~~~~~lv- 386 (540) T protein:vir:41 331 VARRT---YYESVVRPQQEIVSSVLTDFIQLK---------LD------PGARFVFNEEI----LMESEF-VHNYALLV- 386 (540) T ss_pred HHHHH---HHHHHHHHHHHHHHHHHHHhhhhc---------cC------CceEEEecchh----hcchHH-HHHHHHHH- Confidence 55444 6554 5666666665555443221 11 12344454442 222221 22333221 Q ss_pred hhhhhcchhhHHHHHHHHhCCCHHHHHHHH------HHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccccCCc Q lcl|NC_019530. 483 VEPFVGSIFSIDYVMRNVLRMSDEEVKEQQ------TKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGST 556 (560) Q Consensus 483 ~~p~vGky~S~~~i~k~IL~~tDeEI~e~~------kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~ 556 (560) -+-++|.+-++.+++.+..-.-.-+. ..+....+++-.++|.+.+ -..-+.+|.+-+- -++ T Consensus 387 ----~~G~lT~NE~Re~L~g~e~gdd~~l~p~n~~~~~~~~~~~~~~~~~~~~~~--------k~~~~~~~~~~~~-~~~ 453 (540) T protein:vir:41 387 ----QCGVLTPSEVREKLFGLDGGPDMFMVPSSIGKSAMKRQKRNYEKNQINEIK--------RTYAKYKPRIQEI-ISS 453 (540) T ss_pred ----hCCCCCHHHHHHHhCcCcCCCcccccccccccccccccccccCCCCccccc--------cccchhcccccCc-ccc Confidence 12367777777554344211100000 0000000000000000000 0000111111110 011 Q ss_pred ccCC Q lcl|NC_019530. 557 DDSI 560 (560) Q Consensus 557 ~~~~ 560 (560) +.++ T Consensus 454 ~~~~ 457 (540) T protein:vir:41 454 ESPL 457 (540) T ss_pred cccc Confidence 1111 No 70 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=97.91 E-value=1.1e-05 Score=47.68 Aligned_cols=425 Identities=11% Similarity=0.038 Sum_probs=180.6 Q ss_pred cchhhhccccccccCCCCCCCCceeEeccccceeccccccch--hhHHHH--------HHHHHHHhhccchhHHHH---- Q lcl|NC_019530. 29 QKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESEL--LSVKTV--------VEEYQSMAQQPEIRKAVD---- 94 (560) Q Consensus 29 ~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~--~~~~~L--------I~~YR~ma~~PEvd~AI~---- 94 (560) .+ .++ -|.-.++. |...-...+.++ ....++ ..+|+.+..+.+-+..|. T Consensus 1 ~~----~~~-----------~~~~~~~~--~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~~~~~ 63 (470) T protein:vir:99 1 MK----DIN-----------YGRDKVTG--NSSFIFPKGEKLTSNELLGFIAYNETVLKPRYRENMKLYLGKHKILTAPE 63 (470) T ss_pred Cc----ccc-----------CCcccccC--CceEEeCCCCCcCHHHHHHHHHHHHHhhHHHHHHHHHHhccccccccCcc Confidence 00 000 00000000 000000011111 111111 224555555444332211 Q ss_pred ---------------HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhh Q lcl|NC_019530. 95 ---------------IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQA 159 (560) Q Consensus 95 ---------------eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~ 159 (560) .||+ ..+.---..||++..++.+ +..+.+..+..-.+|+....++.+..++.|+-| T Consensus 64 ~~~~~~~ki~~n~~~~Ivd-~~~~~l~g~p~~~~~~~d~--------~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~ 134 (470) T protein:vir:99 64 KETGADNRIVVNSAKYVVD-VYNGYFCGIEPKLALLNDS--------SKIDEIARWNRQENFFDTINEISKQCDIFGRSI 134 (470) T ss_pred cccCCcceeecchHHHHHH-HHhhhhccCCeeEeeCCch--------hHHHHHHHHHHhcCHhHHHHHHHHHHHhcCeeE Confidence 1111 1111112334555443321 234456667777889999999999999999988 Q ss_pred hhheecCCCCccCeeEeEEeCccccchhhhhcccC-Ccccccccc--c----cchheeecCcccccccccccccc--ccc Q lcl|NC_019530. 160 YHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAM-REGGIEAIE--S----VTLKYYYNPNYNRNQFTGQSGTS--QNF 230 (560) Q Consensus 160 ~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~-~~g~~~~~~--~----~~e~~~Y~p~~~~~~~~g~~~~~--~~~ 230 (560) .+.-+|. +|-..+..+||+.+-++..-.... ....+.... . ..-+.+|.+..-+....+..... ... T Consensus 135 ~~v~~d~----dg~~~i~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (470) T protein:vir:99 135 ASIYQGE----DARPHLMYSSPNHAFIIYDDTVQRQPLAFVHYQIDNSNNWTDAYGVIQYADKFYKFKGYDIEEDTNAAG 210 (470) T ss_pred EEEEeCC----CCeEEEEEEccceeEEEEcCCCCcceEEEEEEEEEecCCeeEEEEEEEecCeEEEEEeccccccccccc Confidence 7666543 466789999999987763211000 000011100 0 00112344332221111110100 000 Q ss_pred cccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHH Q lcl|NC_019530. 231 QPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEY 309 (560) Q Consensus 231 ~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqY 309 (560) .....--++|. |.|. ++....|=++..+....-+. ++=+....-+..+.|.+-+.- ..++..+.-+- T Consensus 211 ~~~~~~g~vPv--v~~~-------n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g---~~~~~~~~g~~ 278 (470) T protein:vir:99 211 YAINPYGLVPA--VEFF-------ENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIG---FKLPEDDEGNP 278 (470) T ss_pred ccccCCCccce--Eeec-------CCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeec---CCcccccccch Confidence 00000001221 1111 22223344444444433332 455556666677777665543 22221111011 Q ss_pred HHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHH-HHHHHHHHHHHhcCCCccc Q lcl|NC_019530. 310 MTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMD-HVNYFREKLYDALMIPKSR 388 (560) Q Consensus 310 l~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~-DV~YF~~kLy~aL~VP~sR 388 (560) +..+ -.+++. -+|=-+++.+..+.+|....+..... -+.-+.+.+|...++|-.- T Consensus 279 ~~~~---~~~~~~---------------------~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~ 334 (470) T protein:vir:99 279 KFDF---KNNRVL---------------------YVSQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQ 334 (470) T ss_pred hhhh---hhccee---------------------eecCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCcccc Confidence 1110 001211 12222334455677887766666554 4788889999999999422 Q ss_pred cCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC--ChhHHHHhhhhhceeeeeecchHHH Q lcl|NC_019530. 389 LQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVT--DEKDWNEKIKPFIKFEFTSDSYIRE 466 (560) Q Consensus 389 l~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgii--t~eew~~~i~~~I~~~f~~Dn~f~E 466 (560) ++.. + |...+..|...+.....-+.+.++.|...+.+.++.-+-+-+.. ...+|. .|.+.|. ++.-.- T Consensus 335 ~~~~-~---~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~-----~i~v~f~-~~~p~~ 404 (470) T protein:vir:99 335 DKNF-A---GNSSGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNNKQDQELWS-----ELDFKFT-RNLPED 404 (470) T ss_pred cccc-c---cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccc-----cceEEeC-CCCCcC Confidence 2221 1 22234556555555566677777777777777766543332332 233332 3677774 333321 Q ss_pred HHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCH--HHHHHHHHHHHHhhhcC-CCCCCCccccccccccccCCC Q lcl|NC_019530. 467 QQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSD--EEVKEQQTKIAEEKKKG-LYPKVQADETGNYSGSDVSPL 543 (560) Q Consensus 467 lke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tD--eEI~e~~kqIe~E~~~~-~~~~p~~~e~g~~~~~~~~~~ 543 (560) ..+.++.+..+ +| .+|.++++.. |...| +|++.++++-++..... ..-.|.....++..+++ T Consensus 405 ------~~e~a~~~~kl---~g-iis~et~l~~-l~~vd~~~E~eri~~E~~~~~~~~~~~~~~~d~~~~d~~~ee---- 469 (470) T protein:vir:99 405 ------MASAIDNAKNA---EG-IVSKKTQLGM-IPDIEPDAEMKQIAKEKADAIKQTQQLSMPIDILKRDNNAEE---- 469 (470) T ss_pred ------HHHHHHHHHHH---hc-cCCHHHHHHh-CCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCCcCCCCCCccC---- Confidence 11334445544 35 4899999977 55554 44443333322221100 00011111111111111 Q ss_pred CCCcccccccCCccc Q lcl|NC_019530. 544 KFKPETIPFSGSTDD 558 (560) Q Consensus 544 ~~~p~~~~~~~~~~~ 558 (560) + T Consensus 470 --------------~ 470 (470) T protein:vir:99 470 --------------E 470 (470) T ss_pred --------------C Confidence 1 No 71 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=97.90 E-value=1.2e-05 Score=47.59 Aligned_cols=437 Identities=11% Similarity=0.100 Sum_probs=183.0 Q ss_pred hhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchh Q lcl|NC_019530. 11 LFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIR 90 (560) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd 90 (560) +|.-..|+....+..++..++ ..+...+ .+.+ +.+. ++.+ +-..|-+ -+..+|-|. T Consensus 1 ~~~~~~~i~s~~~~~~i~~~~-------~~s~~~~-~~~~------~~~~------~pp~-~~~~la~---l~~~n~~v~ 56 (542) T protein:vir:41 1 MFNYHLSIRSLEKYKAIKREE-------VESQALG-ETRF------EEYV------EPKV-NPLVLLS---LLQVNPYHA 56 (542) T ss_pred Cccccccccccccchhhhhcc-------ccccccc-cccC------Cccc------cCCC-CHHHHHH---HHhhcHHHH Confidence 555555555544333332111 1110000 0000 0111 1111 2233332 224578899 Q ss_pred HHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHH-HhCCchhhHHHHHHhhhhhhhhhhheecCCCC Q lcl|NC_019530. 91 KAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMH-LMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNK 169 (560) Q Consensus 91 ~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~-ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~ 169 (560) +||+-|.+.+.-+ |+ .++..+. .....|- -+ .+.+..-...+++.+++.|.-|.+++.|.+ T Consensus 57 scI~~ia~~IA~l-----~~--~~~~~~~-------~~l~~~l--pN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~-- 118 (542) T protein:vir:41 57 SACSIKANDIIRT-----GY--ILEGDDE-------GVVDEFI--RACKPSFEYVLLRALEDLQVFNYCTLEVVRDDR-- 118 (542) T ss_pred HHHHHHHHHHhhC-----ce--eeecccc-------hhhhhhc--CCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC-- Confidence 9998887766543 22 2322211 1111110 01 112222344566677888999998887663 Q ss_pred ccCeeEeEEeCccccchhhhhcccCCccccccccccch--heeecCccccccccccccccccccccccceecchhceeee Q lcl|NC_019530. 170 KGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTL--KYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYI 247 (560) Q Consensus 170 k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e--~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~ 247 (560) .-+.+|++|||.+|+..+ +..+.......... +..|.-. ..... ........++.+.|.|. T Consensus 119 -G~~~~L~~l~~~~v~v~~-----d~~~~~~~~~~~~~~~~~~y~~~---~~~~~--------~~g~~~~~~~~~eIiHi 181 (542) T protein:vir:41 119 -GDPIRFEYIPSHTIRVHK-----DGSRYRQTWDGVNITHFKDYRYE---GEINP--------ETGEDQDSVGANELVFI 181 (542) T ss_pred -CcEEEEEEEcCcceEEEE-----cCCeeEeeecCCcceeEEeeccc---ccccc--------cccccccccCcccEEEe Confidence 458999999999887643 11111111111111 1112210 00111 11112345677777776 Q ss_pred ecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCC---------CCchHHHHHHHHHHHHhhh Q lcl|NC_019530. 248 DSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVG---------TLGKKSAEEYMTMMMGKFK 318 (560) Q Consensus 248 hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvG---------nlpk~KAeqYl~~im~k~k 318 (560) ... -..++-+.+|-+..|+..+.....+++...=+=--.+--+-|.++..+ .+-+...+..-+.+...|+ T Consensus 182 r~~-~~~~~~~Glspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~ 260 (542) T protein:vir:41 182 HIP-SPVCSYYGVPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFK 260 (542) T ss_pred cCC-CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHh Confidence 532 113444567899999998888877776654432233555667777543 2222223222222223332 Q ss_pred hheeecccCCcccCcccchhhHhhhcccccCC-CCccceeecCCCCCcchHH---HHHHHHHHHHHhcCCCccccCCCCc Q lcl|NC_019530. 319 NRNAYDRTTGKITGNAHLMGIAEDYWLPRREG-QNATEIATVGGGNQLGEMD---HVNYFREKLYDALMIPKSRLQEEGS 394 (560) Q Consensus 319 nk~vYd~~TGev~d~~~~msmlEDywLpRReG-grgTEIsTLpGg~nLgei~---DV~YF~~kLy~aL~VP~sRl~~~~~ 394 (560) .-. ...|+ .+ .++ .-.| ..|.+++.| +.+..+++ -..+..+.+.++++||...|+...+ T Consensus 261 g~~---~n~gk------~~-vL~-----~~~~~~~g~~~~pl--~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~ 323 (542) T protein:vir:41 261 HLK---EAPHT------PL-VFS-----IPGGDTVKVTFTPL--NTSQKELSFREYAAEKKYDIAAAHMIDPYRLGIADT 323 (542) T ss_pred hhh---cccCc------ee-Eee-----ccCCcccceeEEEc--CCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCC Confidence 100 01111 11 111 1111 134455554 33333333 3355678899999999999976555 Q ss_pred ccccccchhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHH Q lcl|NC_019530. 395 INIGGSNLAEITQEELRFSKF-CAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAIL 473 (560) Q Consensus 395 ~~l~~~~~~eItRDElkF~Kf-i~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~ 473 (560) ....++++.+..+. |.++ |.-+++++...+...| +++.++ .+.|.|..+.....-+ T Consensus 324 ~t~n~sn~Eq~~~~---f~~~tL~P~~~~ie~~ln~~L---------~~~~~~------~~~~~f~~~~ll~~d~----- 380 (542) T protein:vir:41 324 GPLGGNFAEVTRRT---YYESVVRPQQNIISSILTDFF---------QVKFNP------KTRFKFNDETLLESDS----- 380 (542) T ss_pred cccccccHHHHHHH---HHHHHHHHHHHHHHHHHHhhc---------ccccCC------ceEEEecchhhcchHH----- Confidence 44566655444333 5443 4556665555555332 222221 2455565443322211 Q ss_pred HHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHH--H-HHH---HHHHHHhhhcCCCCCCCccccccccccccCCCCCCc Q lcl|NC_019530. 474 NDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEE--V-KEQ---QTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKP 547 (560) Q Consensus 474 ~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeE--I-~e~---~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p 547 (560) ..++..+ .-+-++|.+-++.++..+..-+ . ... .++++...++. -++..++-+.. -.+..| T Consensus 381 ~~~~~~~-----v~~GilT~NE~Re~L~g~~pgdd~~l~p~~~~~~~~~~~~~n~-~~~~~~~~~k~-------~~k~~~ 447 (542) T protein:vir:41 381 VRNCALL-----VQSGVLTPAEARERLFGLDGGPDIFMVPSKGAAKSVKRQERNY-EKNQIREIRKI-------YAKYRP 447 (542) T ss_pred HHHHHHH-----HhCCCCCHHHHHHhhCCCCCCCccccccccccccccccCCcCC-CCCchhhhhhc-------ccccCc Confidence 1111111 1224677777765433332110 0 000 00000000000 00000000000 001111 Q ss_pred cccc----------------------------------ccCCcccCC Q lcl|NC_019530. 548 ETIP----------------------------------FSGSTDDSI 560 (560) Q Consensus 548 ~~~~----------------------------------~~~~~~~~~ 560 (560) ++-+ .-||.-+|. T Consensus 448 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (542) T protein:vir:41 448 RFNEIISSKLSAEEKKKKIDESLAEFRAEAYEAGKKMLIIGGDMGSM 494 (542) T ss_pred cccccccccccchhhcccccchhhhhHHhHHhcCceEEEeecCchhh Confidence 1111 011111111 No 72 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=97.88 E-value=1.3e-05 Score=47.40 Aligned_cols=443 Identities=14% Similarity=0.194 Sum_probs=181.7 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCC----CCC-CceeEeccccceeccccccchhhHHH Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDD----AQD-GSIILQGGANTYNYVGVESELLSVKT 75 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~----~~d-G~~~i~~~~~~~~y~~~e~~~~~~~~ 75 (560) .--|- +.++.+.++-..... ++.... .-| +.+..++-+++ ..+++.-=.- T Consensus 60 ~~~~~------------------~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~l~~~-----~~~~F~Gy~~ 114 (694) T protein:vir:10 60 AEPSP------------------SLRLARQFEVDVSNY--TPRERRAASYALDFNGTSMDALSFV-----TSSGFPGFPT 114 (694) T ss_pred CCCCc------------------chhhhhhccccccCC--CccccchhhhhhccCcccccchhhh-----hccCcchHHH Confidence 11110 111111111000000 000000 000 01111111000 0111211122 Q ss_pred HHHHHHHHhhccchhHHHHHhhcceeee------cCCCc----eEEEeeecccchhH-HHHHHHHHHHHHHHHHhCCchh Q lcl|NC_019530. 76 VVEEYQSMAQQPEIRKAVDIIVNDVVTC------EEDET----PVTVNLDKVEGISD-TVKESITECFKEVMHLMDFDNT 144 (560) Q Consensus 76 LI~~YR~ma~~PEvd~AI~eIvneaiv~------d~~~~----~V~l~Ld~~~~~s~-sik~~I~eeF~~il~ll~f~~~ 144 (560) |- .|||+||.+++++-|..||+-. ...++ .+++.-+..+ .++ .-.++|..|++. |+...+ T Consensus 115 la----~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~-~~d~dqi~~L~~e~er----l~V~~~ 185 (694) T protein:vir:10 115 LV----LLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAAS-TSDGDQLKQINDEIER----LRIRDA 185 (694) T ss_pred HH----HHhhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccc-cccHHHHHHHHHHHHH----HHHHHH Confidence 33 7899999999999999888432 22222 1333322222 122 334566666654 222344 Q ss_pred hHHHHHHhhhhhhhhhhheecCC--------------CCccCeeEeEEeCccccchhhhhcccCCccccccccccchhee Q lcl|NC_019530. 145 AYQKIRKWYVDGRQAYHVIVDPT--------------NKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYY 210 (560) Q Consensus 145 ~~~~~R~wYvDGr~~~hkiiD~~--------------~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~ 210 (560) ..+.++-=-+.|.=.-...|+.. -.|.+++.|+.|||..+-+- ....++-... .+ T Consensus 186 l~eaik~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~----------~~n~~dP~sp-df 254 (694) T protein:vir:10 186 VRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPN----------NYNSINPVAD-DF 254 (694) T ss_pred HHHHHHhhccccceEEEEEeecCccccccccccccccccCcceeeeEeecccccccc----------hhhhccchhh-cc Confidence 44443322233322211223222 23557888999999776662 1111111111 23 Q ss_pred ecCccccccccccccccccccccccceecchhceeeeecccccCCC-------CcchhhhHHHHHHHHH-HHHHHHHHHH Q lcl|NC_019530. 211 YNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLAN-------GIVPGLLNPAIRPLNN-LVTTEDATVI 282 (560) Q Consensus 211 Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~-------~~i~syL~~AiK~~Nq-L~m~EDalVI 282 (560) |.|. +|...| .+||.+-++....- ..+. ...+|.+..+..-..+ +++...+.=+ T Consensus 255 gkP~--~y~V~G--------------~~IH~SRL~~f~g~--plPd~LKp~y~~~G~Sv~q~~~e~V~~~~rT~~~v~~L 316 (694) T protein:vir:10 255 YKPS--TWWMIG--------------TEVHATRLHTIVSR--PVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDI 316 (694) T ss_pred CCCc--eEEEec--------------eEEeeeeEEEecCC--CchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHH Confidence 3332 221111 13443332211110 1111 1112444444433222 3333322211 Q ss_pred HHHhcCccceEEEccCC-CCchHHHHHHH--HHHHHhhhhheeecccCCcccCcccchh-hHhhhcccccCCCCccceee Q lcl|NC_019530. 283 YAITRAPEKRAFYLDVG-TLGKKSAEEYM--TMMMGKFKNRNAYDRTTGKITGNAHLMG-IAEDYWLPRREGQNATEIAT 358 (560) Q Consensus 283 yRitRAPeRRvFyIDvG-nlpk~KAeqYl--~~im~k~knk~vYd~~TGev~d~~~~ms-mlEDywLpRReGgrgTEIsT 358 (560) +..+.-+ ++-.|.. -|.....++.. -+++++||.- -|-+ .+. =.|+|- + T Consensus 317 --i~~~~v~-~lk~dla~~L~~g~~~~l~~R~eli~~~Rsn------~G~~-----llDk~~Eefe-------------q 369 (694) T protein:vir:10 317 --VKQFSVS-GILMDLAQALMPGANVDLSMRAELINRYRDN------RNIL-----FLDKATEEFF-------------Q 369 (694) T ss_pred --HHhhhhH-HHHHHHHHhhcChhHHHHHHHHHHHHHhcCc------cceE-----EEecCCcceE-------------E Confidence 1111110 0011211 00111112222 2455566531 1100 000 013332 2 Q ss_pred cCCCCCcchHHHHH-HHHHHHHHhcCCCccccCCCC--cccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH-- Q lcl|NC_019530. 359 VGGGNQLGEMDHVN-YFREKLYDALMIPKSRLQEEG--SINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRR-- 433 (560) Q Consensus 359 LpGg~nLgei~DV~-YF~~kLy~aL~VP~sRl~~~~--~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~-- 433 (560) . ..+|+-++||- =|..-+=-+.+||+.||=..+ |||--+. + |.-.|...|..+|. ..+...|++ T Consensus 370 ~--stslSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE--~----D~rnYYD~I~s~Qe---~~L~p~L~rl~ 438 (694) T protein:vir:10 370 F--NTPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSE--G----EIRVWYDYVRAYQR---NALQQLMNDVI 438 (694) T ss_pred E--ecccCCHHHHHHHHHHHHHhhhcCchhhhhccCcccccccch--h----hHHHHHHHHHHHHH---HHHHHHHHHHH Confidence 2 25788889975 488888889999999985444 6654443 2 45559999988885 445555554 Q ss_pred ---HHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHH---------- Q lcl|NC_019530. 434 ---QLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNV---------- 500 (560) Q Consensus 434 ---qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~I---------- 500 (560) |+-+.|.+ -+.|.|.|..=..-+|..-+||...+.+....+-.- --++.+-|+..+ T Consensus 439 ~ii~rS~~G~i---------dp~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~--gvI~~~evr~rL~~d~~s~Y~~ 507 (694) T protein:vir:10 439 VMIQLSLFGAV---------DPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQE--QVIRPDQVAARLNTEPDGPYAG 507 (694) T ss_pred HHHHHHhcCCC---------CCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHh--cCCCHHHHHHHHhcCCCccccc Confidence 33333443 345778887666677777788877776654333111 122333333221 Q ss_pred --------hCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccc----cCCcccCC Q lcl|NC_019530. 501 --------LRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPF----SGSTDDSI 560 (560) Q Consensus 501 --------L~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~----~~~~~~~~ 560 (560) ..-+|+||.-+...-+.+- --++.-+.|+...|+.+|.-+.+..++. +|..+-.| T Consensus 508 ~~D~~d~p~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~g~~~~~~v~~~~~~~~~~~ag~~~~~~ 574 (694) T protein:vir:10 508 KLDANDDPGVPADDDIDGVLTYVQRLA-----EGGDTGAPGGARAGATAPPTVANVNANVNPREAGAQDAAM 574 (694) T ss_pred ccccccCCCcCccchhhhhHhhhcCcc-----cccccCCCCcccccccCCCcccccccccCccccCCCCccc Confidence 1123334433333322221 1122223333444555544444444444 55555444 No 73 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=97.87 E-value=1.4e-05 Score=47.28 Aligned_cols=462 Identities=15% Similarity=0.201 Sum_probs=186.3 Q ss_pred CCcccc--------------------------hhhhhhhhhhhhhhh-----hhhcccccchhhhccccccccCCCCCCC Q lcl|NC_019530. 1 MAGYGR--------------------------GFFGLFGGGGLVNAK-----VDTDKLAQKQDELLLTKATVVALDDAQD 49 (560) Q Consensus 1 ~~~~~~--------------------------~~~~~~~~~~~~~~~-----~~~~~~~~~~~e~~~~k~~s~~~~~~~d 49 (560) +|--|| .-|.+-|-+..+|.- --+.++.+.++-..... ++. .. T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~----~~ 85 (695) T protein:vir:36 12 LARTGRRPEVAKAATLAAAATIATAAAAQPVPADFARRGALNALDAAPVVEPSPSLRLARQFEVDVSNY--TPR----ER 85 (695) T ss_pred HhhcCCCcchhhhhhhhhhhhhhhhccccccchhhhhcccccccccccccCCCcccccceeceeccccc--Ccc----cc Confidence 111111 011111222111110 01222222222111000 000 00 Q ss_pred CceeEeccccceecccc----ccchhhHHHHHHHHHHHhhccchhHHHHHhhcceeee------cCCCc----eEEEeee Q lcl|NC_019530. 50 GSIILQGGANTYNYVGV----ESELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVTC------EEDET----PVTVNLD 115 (560) Q Consensus 50 G~~~i~~~~~~~~y~~~----e~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~------d~~~~----~V~l~Ld 115 (560) -+....-..+...|.++ .+++.-=.-|- +|||+||.+++++-|..||+-. ...++ .+++.-+ T Consensus 86 ~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la----~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~ 161 (695) T protein:vir:36 86 RAASYALDFNGTSMDALSFVTSSGFPGFPTLV----LLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGN 161 (695) T ss_pred chhhhhhcccccccccchhhhccCcchHHHHH----HHhhccchhhHHHHHHHHhhcccceecccchhhhhhcccccccc Confidence 00000000001111111 11121122233 7899999999999999988432 22222 1333333 Q ss_pred cccchhH-HHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCC--------------CCccCeeEeEEeC Q lcl|NC_019530. 116 KVEGISD-TVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPT--------------NKKGGIKKLVMLD 180 (560) Q Consensus 116 ~~~~~s~-sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~--------------~~k~GI~eLr~LD 180 (560) ..+. ++ .-.++|..|++. +++ ..+..+.++-=-+.|.=.-...|+.. -.|.+++.|+.|| T Consensus 162 ~~~~-~d~dqik~L~~e~er-L~V---~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViD 236 (695) T protein:vir:36 162 AAST-SDGDQLKQINDEIER-LRI---RDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVE 236 (695) T ss_pred cccc-CchHHHHHHHHHHHH-HHH---HHHHHHHHHhhccccceEEEEEeccCccccccccccccccccCcceeeeEeec Confidence 3221 22 345577777763 333 44444443332333332211233222 2355788899999 Q ss_pred ccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeecccccCCC---- Q lcl|NC_019530. 181 SRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLAN---- 256 (560) Q Consensus 181 Pr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~---- 256 (560) |..+-+- ....++-... .+|.|. +|...| .+||.+-++....- ..+. T Consensus 237 p~~vtP~----------~~n~~dP~sp-dfgkP~--~y~V~G--------------~kIH~SRL~~f~g~--plPd~LKp 287 (695) T protein:vir:36 237 PYWVTPN----------NYNSINPVAD-DFYKPS--TWWMIG--------------TEVHATRLHTIVSR--PVGDMLKP 287 (695) T ss_pred ccccccc----------hhhhccchhh-ccCCCc--eEEEec--------------eEEeeeeEEEecCC--Cchhhhhc Confidence 9776662 1111111111 233332 221111 13443332211110 1111 Q ss_pred ---CcchhhhHHHHHHHHH-HHHHHHHHHHHHHhcCccceEEEccCC-CCchHHHHHHH--HHHHHhhhhheeecccCCc Q lcl|NC_019530. 257 ---GIVPGLLNPAIRPLNN-LVTTEDATVIYAITRAPEKRAFYLDVG-TLGKKSAEEYM--TMMMGKFKNRNAYDRTTGK 329 (560) Q Consensus 257 ---~~i~syL~~AiK~~Nq-L~m~EDalVIyRitRAPeRRvFyIDvG-nlpk~KAeqYl--~~im~k~knk~vYd~~TGe 329 (560) ...+|.+..+..-..+ +++...+.=+ +.++.-+ ++-.|.. -|.....++.. -+++++||.- -|- T Consensus 288 ~y~~~GiSv~q~~~e~V~~~~rT~~~v~~L--i~~~~v~-~lk~dla~aL~~g~~~~l~~R~eli~~~Rsn------~G~ 358 (695) T protein:vir:36 288 TYSFAGISMTQLAMPYIDNWLRTRQSVSDI--VKQFSVS-GILMDLAQALMPGANVDLSMRAELINRYRDN------RNI 358 (695) T ss_pred ccccCcccHHHHHHHHHHHHHHHHhHHHHH--HHhhhHH-HHHHHHHHhhcChhHHHHHHHHHHHHHhcCc------cce Confidence 1112344444333222 3333222211 1111100 0011110 00011112222 2455566531 110 Q ss_pred ccCcccchh-hHhhhcccccCCCCccceeecCCCCCcchHHHHH-HHHHHHHHhcCCCccccCCCC--cccccccchhhh Q lcl|NC_019530. 330 ITGNAHLMG-IAEDYWLPRREGQNATEIATVGGGNQLGEMDHVN-YFREKLYDALMIPKSRLQEEG--SINIGGSNLAEI 405 (560) Q Consensus 330 v~d~~~~ms-mlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~-YF~~kLy~aL~VP~sRl~~~~--~~~l~~~~~~eI 405 (560) + .+. =.|+|- +. ..+|+-++||- =|..-+=-+.+||+.||=..+ |||--+. + T Consensus 359 ~-----llDk~~Eefe-------------q~--stslSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE--~-- 414 (695) T protein:vir:36 359 L-----FLDKATEEFF-------------QF--NTPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSE--G-- 414 (695) T ss_pred E-----EEecCCcceE-------------EE--ecccCCHHHHHHHHHHHHHhhhcCchhhhhccCcccccccch--h-- Confidence 0 000 013332 22 25788889975 488888889999999985444 6654443 2 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 406 TQEELRFSKFCAGLRRRYSHFFMEFLRR-----QLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASL 480 (560) Q Consensus 406 tRDElkF~Kfi~rLr~rfs~if~d~Lk~-----qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l 480 (560) |.-.|...|..+|. ..+...|++ |+-..|.+ -+.|.|.|..=..-+|..-+||...+.+.. T Consensus 415 --D~rnYYD~I~s~Qe---~~L~p~L~rl~~ii~rS~~G~i---------dpdi~~~fnPL~qmtd~EkAeI~~k~A~~d 480 (695) T protein:vir:36 415 --EIRVWYDYVRAYQR---NALQQLMNDVIVMIQLSLFGAV---------DPSIKWQWNALRELDDLEVAESRYKQAQSD 480 (695) T ss_pred --hHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHhcCCC---------CCcceEEeCCCCCcCHHHHHHHHhhhhHHH Confidence 45559999988885 445555554 33333443 345778887666677777788877776654 Q ss_pred HHhhhhhcchhhHHHHHHHH------------------hCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCC Q lcl|NC_019530. 481 NTVEPFVGSIFSIDYVMRNV------------------LRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSP 542 (560) Q Consensus 481 ~~~~p~vGky~S~~~i~k~I------------------L~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~ 542 (560) ..+-.- --++.+-|+..+ ..-+|+||.-+...-+.+- --++.-+.|+...|+.+| T Consensus 481 ~~~~~~--gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~g~~~~ 553 (695) T protein:vir:36 481 VLYVQE--QVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLA-----EGGDTGAPGGARAGATAP 553 (695) T ss_pred HHHHHh--cCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcCcc-----cccccCCCCcccccccCC Confidence 333111 122333333221 1122333333333322221 112222333344445554 Q ss_pred CCCCcccccc----cCCcccCC Q lcl|NC_019530. 543 LKFKPETIPF----SGSTDDSI 560 (560) Q Consensus 543 ~~~~p~~~~~----~~~~~~~~ 560 (560) .-+.+..++. +|..+-.| T Consensus 554 ~~v~~~~~~~~~~~ag~~~~~~ 575 (695) T protein:vir:36 554 PTVANVNANVNPREAGAQDAAM 575 (695) T ss_pred CcccccccccCccccCCCCccc Confidence 4444444444 55555444 No 74 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=97.87 E-value=1.4e-05 Score=47.26 Aligned_cols=435 Identities=15% Similarity=0.092 Sum_probs=181.5 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) |++++.+... ..+...+..++-.-.+.++....+-+....+.++. -+..+++|.|..||. T Consensus 1 Mg~~~~l~~r------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~V~--------------~~~al~~~~V~~~v~ 60 (457) T protein:vir:13 1 MGFWSALFGR------GHSPALDGIEARAWEPYDPSIYNLGAVAASGETVT--------------PHDALQVSAVFASVR 60 (457) T ss_pred Cchhhhhhcc------cccccccccccccccccchHHHhhcccccCCceec--------------hHHhhccHHHHHHHH Confidence 3333332221 11111121221111111111111100000011111 124467899999997 Q ss_pred HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCC---chhhHHHHHHhh----hhhhhhhhheecCC Q lcl|NC_019530. 95 IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDF---DNTAYQKIRKWY----VDGRQAYHVIVDPT 167 (560) Q Consensus 95 eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f---~~~~~~~~R~wY----vDGr~~~hkiiD~~ 167 (560) -|.+.+.-+ |+.+--+... ..++.... .++.+++- ...++++++.+. ..|.-|..+ .+. T Consensus 61 ~Ia~~iA~l-----p~~~~~~~~~----~~~~~~~~---~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i-~~~- 126 (457) T protein:vir:13 61 LLSETIATL-----PLSTYSKRGG----SRKEIVTP---EWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAV-RWQ- 126 (457) T ss_pred HHHHhhccC-----ceEEEEecCC----cccccccc---hHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEE-Eec- Confidence 777765433 3333222211 11111112 23344442 234556655444 457666543 322 Q ss_pred CCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeee Q lcl|NC_019530. 168 NKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYI 247 (560) Q Consensus 168 ~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~ 247 (560) ...+++|.+|+|.++..++...... ....|+.|... ..|.. .....++.+.|.|. T Consensus 127 --~g~~~~l~~l~p~~v~v~~~~~~~~---------~~~~~~~y~~~-----~~~~~---------~~~~~~~~~diih~ 181 (457) T protein:vir:13 127 --GPNIVGLDVLDPTKIHVHMVMVDGL---------RRKVFEAYDID-----ADGNE---------VLLGWFTPRDVLHI 181 (457) T ss_pred --CCcEEEEEEEccCceEEEEecCCCc---------cceeEEEEEEe-----cCCce---------eeEEeeCccceEEe Confidence 3578999999998887754221111 11223334311 11110 11234677777777 Q ss_pred ecccccCCC-CcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeeccc Q lcl|NC_019530. 248 DSGEEPLAN-GIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRT 326 (560) Q Consensus 248 hsGl~~~~~-~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~ 326 (560) .. ...++ -+..|-+..|.+.+.....+++...=+=---+--+-|..++ +.|-+..+++..+.+...|+.. .. T Consensus 182 ~~--~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~ls~e~~~~~~~~~~~~~~g~----~n 254 (457) T protein:vir:13 182 PG--MMLPGDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVP-GTMSEEGLARAREAWRAANSGV----DN 254 (457) T ss_pred cC--CCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcC-CCCCHHHHHHHHHHHHHHhcCc----cc Confidence 63 23333 35678899999998888888876554433344445666665 5666555544444433333221 01 Q ss_pred CCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHH---HHHHHHHHHHHhcCCCccccCCCCcccccccchh Q lcl|NC_019530. 327 TGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMD---HVNYFREKLYDALMIPKSRLQEEGSINIGGSNLA 403 (560) Q Consensus 327 TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~---DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~ 403 (560) .|. .+ .++ .|.+++.|. .+..++. =-+|-...+.++++||...|+.-.+.+...++.+ T Consensus 255 ag~------~~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e 315 (457) T protein:vir:13 255 AHR------VA-LLT----------EGAKFSKVA--MSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLA 315 (457) T ss_pred cCc------ce-ecC----------CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHH Confidence 122 12 222 245555552 2333333 3347788899999999999965444344444443 Q ss_pred hhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 404 EITQEELRFSKFC-AGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNT 482 (560) Q Consensus 404 eItRDElkF~Kfi-~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~ 482 (560) +.. +.|.+++ .-+.. .+.+.|-..| +++.+.. ...+.|.-+ ++... -+..|++.+.. T Consensus 316 q~~---~~f~~~tl~P~~~----~ie~~ln~~L-----~~~~~~~-----~~~i~fd~~----~l~~~-D~~~r~~~~~~ 373 (457) T protein:vir:13 316 EQN---IAFTMFSLRPWLE----RIEAGFNRLL-----FAETADR-----FRFVKFNLD----EIKRG-APKERMELWSL 373 (457) T ss_pred HHH---HHHHHHHHHHHHH----HHHHHHHHhh-----cCccccC-----ceeEEeech----hhhcc-CHHHHHHHHHH Confidence 333 3366653 33332 2333333333 3433321 122334322 22211 22345555554 Q ss_pred hhhhhcchhhHHHHHHHHhCCCHHH-----HHHHHH---HHHHhhhcCCCCCCCcccccccccccc--CCCCCCcccccc Q lcl|NC_019530. 483 VEPFVGSIFSIDYVMRNVLRMSDEE-----VKEQQT---KIAEEKKKGLYPKVQADETGNYSGSDV--SPLKFKPETIPF 552 (560) Q Consensus 483 ~~p~vGky~S~~~i~k~IL~~tDeE-----I~e~~k---qIe~E~~~~~~~~p~~~e~g~~~~~~~--~~~~~~p~~~~~ 552 (560) +-. .-++|.+-++. .+.|..-+ .--+.. .+.+..+...-+.|.+.++..+.+... ...+-+++---- T Consensus 374 ~~~--~G~~T~NE~R~-~~gl~Pi~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~d~~~~~~ 450 (457) T protein:vir:13 374 GLQ--NGIYSIDEVRA-AEDMTPLPDGLGEKYRVPLNLGEVGEEPEPEPAPAPPAIEPPAEEPDEEPEPEGKPDDEGATE 450 (457) T ss_pred HHh--CCCcCHHHHHH-HhCCCCCCCCcccceeeccccccccccccccccCCCCCCCCCccccCCCCCCCCCCccccCCC Confidence 322 23788888874 47775421 000000 000000000000111111000000000 000000000000 Q ss_pred cCCcccC Q lcl|NC_019530. 553 SGSTDDS 559 (560) Q Consensus 553 ~~~~~~~ 559 (560) +--.||| T Consensus 451 ~~~~~~~ 457 (457) T protein:vir:13 451 EDDEDDA 457 (457) T ss_pred CcccccC Confidence 0111333 No 75 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=97.86 E-value=1.4e-05 Score=47.11 Aligned_cols=429 Identities=10% Similarity=0.118 Sum_probs=194.4 Q ss_pred hhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhh--c Q lcl|NC_019530. 9 FGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQ--Q 86 (560) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~--~ 86 (560) .|+|..++.+-.+... ++-.+.-.+.... .-++.++ ...+.|++++.|.. + T Consensus 1 m~~~~~~k~~~~~~~~-~~~~~~~~~~~~~-~~i~~~~-------------------------~~~~~i~~~~~~Y~g~~ 53 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKY-VMTTQSLTNITDH-PKIAISK-------------------------LEYDRITTNLKYYKSDW 53 (500) T ss_pred CchHHHHHHHHHHHHH-Hhhcchhhhhhcc-ccccCCH-------------------------HHHHHHHHHHHHhcCCC Confidence 3344333333221110 1111111111110 0011111 11122223333321 1 Q ss_pred cch--------------------hHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhH Q lcl|NC_019530. 87 PEI--------------------RKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAY 146 (560) Q Consensus 87 PEv--------------------d~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~ 146 (560) |.+ ..++++. ...++. .++++.+++ +...+.++.+++-.+|..... T Consensus 54 ~~~~~~~~~~~~~~~~~~slnl~~~i~~~~-A~lv~~----e~~~i~~~d---------~~~~~~l~~il~~n~f~~~~~ 119 (500) T protein:vir:98 54 DSVLYLNTDGETKKRDLNHLPIARTAAKKI-ASLVFN----EQAEIKVDD---------DAANEFISETLKNDRFNKNFE 119 (500) T ss_pred CCcccccCCCCcccCceeecchHHHHHHHH-hhhhcC----CcceEecCC---------hHHHHHHHHHHhhccHHHHHH Confidence 111 1111111 111111 233444433 256677888999899999999 Q ss_pred HHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCcccccc------cc---ccchhe------ee Q lcl|NC_019530. 147 QKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEA------IE---SVTLKY------YY 211 (560) Q Consensus 147 ~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~------~~---~~~e~~------~Y 211 (560) +.+....+-|..+|+..+|... ..+.+++|.++-+++ .+..+.... .+ +...|| .+ T Consensus 120 ~~~e~a~a~G~~~~k~~~d~~~-----~~I~~v~ad~~~P~~----~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~ 190 (500) T protein:vir:98 120 RYLESCLALGGLAMRPYVDGDK-----VRVAFVQAPVFLPLQ----SNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEW 190 (500) T ss_pred HHHHHHhhcCCEEEEEEEeCCc-----eEEEEEcCCeeEEEE----EcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEE Confidence 9999999999999999998643 347889998876652 111111100 00 011111 11 Q ss_pred cCcccc----ccccccccccccccccccce--ecchhcee--------eeecc-----cccCCCCcchhhhHHHHHHHHH Q lcl|NC_019530. 212 NPNYNR----NQFTGQSGTSQNFQPSQQEL--VFDDESIV--------YIDSG-----EEPLANGIVPGLLNPAIRPLNN 272 (560) Q Consensus 212 ~p~~~~----~~~~g~~~~~~~~~~~~~~~--~i~~daI~--------y~hsG-----l~~~~~~~i~syL~~AiK~~Nq 272 (560) .+...+ ..+-..+..+....-+-+.+ .++ +.++ |++.= -.+......+|-++.|+-.+.. T Consensus 191 ~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~-~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~ 269 (500) T protein:vir:98 191 QSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLK-DEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDF 269 (500) T ss_pred eCCceeEEEEEEEecccccccCcccccccccCCcC-cceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHH Confidence 111000 00000000000000000000 011 1111 11110 0011122356889999988888 Q ss_pred HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCccc-------CcccchhhHhhhcc Q lcl|NC_019530. 273 LVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKIT-------GNAHLMGIAEDYWL 345 (560) Q Consensus 273 L~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~-------d~~~~msmlEDywL 345 (560) |-..-+.+. |..|.=++|||. +..=++ ...+..+|+.. +++-+..|-. T Consensus 270 lD~~~s~~~--~e~~~g~~~i~v-~~~~l~------------------~~~~~~~g~~~~~~~~d~~~~~~~~~~~---- 324 (500) T protein:vir:98 270 INTTYDEFM--WEVKMGQRRVAV-PESLTA------------------LTVRTTDGDVVPRPRFESDQNVYIRMGG---- 324 (500) T ss_pred HHHHHHHHH--HHHHhCcceeee-chHHhc------------------ccCCCCCccccCCcccCCCcceEEEcCC---- Confidence 887777665 788887777765 211100 01111122110 1111111100 Q ss_pred cccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 346 PRREGQNATEIATVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYS 424 (560) Q Consensus 346 pRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs 424 (560) -+++ +.-|+.+...--..+ ..-+.++.+.+=.+.+++-+.|..+++. -..|+||.-.+-.-..-+.+.|+.|. T Consensus 325 --~~~~-~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g---~~TAtei~s~~~~~~~t~~~~~~~~~ 398 (500) T protein:vir:98 325 --RDLD-SSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKS---MKTATEIVSENSDTYQMRNSIVALVE 398 (500) T ss_pred --CCCc-CcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCc---cccHHHHHHHHHHHHHHHHHHHHHHH Confidence 0111 122555543221222 2345667777777888999988876652 23478886555555566777777777 Q ss_pred HHHHHHHHHHHHh-------ccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHH Q lcl|NC_019530. 425 HFFMEFLRRQLIL-------KGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVM 497 (560) Q Consensus 425 ~if~d~Lk~qLiL-------kgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~ 497 (560) ..+.++++.-|-+ +|... .+ ..+.++|. |+.+.. ++++.- ..+.+++ - | .+|.++.+ T Consensus 399 ~al~~lv~~il~~~~~~~~~~~~~~-~~------~~v~v~f~-d~i~~d-~~~~~~-~~~~~v~----a-G-i~s~~~~i 462 (500) T protein:vir:98 399 QSLKELVISIFEIAKAYDLYQSEVP-SM------DNISISLD-DGVFTD-RDAELD-YWIKVVN----A-G-FGTREMAI 462 (500) T ss_pred HHHHHHHHHHHHHHHHHhhcCCCCC-CC------cceEEEeC-CCCCCC-HHHHHH-HHHHHHH----c-C-CCCHHHHH Confidence 7777777765533 22222 22 23566663 443332 222211 1111111 1 3 58888877 Q ss_pred HHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccccccc Q lcl|NC_019530. 498 RNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGS 538 (560) Q Consensus 498 k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~ 538 (560) ++...+||+|.+++.++|++|.... ...+ +.+++..++ T Consensus 463 ~~~~g~~eeea~~~l~~i~~E~~~~-~~~~--~~~~~~~g~ 500 (500) T protein:vir:98 463 QKVLNVTEEKAQEIAAEINTGIVDE-INQQ--RTDTHLYGE 500 (500) T ss_pred HhcCCCCHHHHHHHHHHHHHhcccc-CCCC--CccccccCC Confidence 6778999999999999999995222 1111 112222222 No 76 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=97.86 E-value=1.4e-05 Score=47.11 Aligned_cols=429 Identities=10% Similarity=0.118 Sum_probs=194.4 Q ss_pred hhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhh--c Q lcl|NC_019530. 9 FGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQ--Q 86 (560) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~--~ 86 (560) .|+|..++.+-.+... ++-.+.-.+.... .-++.++ ...+.|++++.|.. + T Consensus 1 m~~~~~~k~~~~~~~~-~~~~~~~~~~~~~-~~i~~~~-------------------------~~~~~i~~~~~~Y~g~~ 53 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKY-VMTTQSLTNITDH-PKIAISK-------------------------LEYDRITTNLKYYKSDW 53 (500) T ss_pred CchHHHHHHHHHHHHH-Hhhcchhhhhhcc-ccccCCH-------------------------HHHHHHHHHHHHhcCCC Confidence 3344333333221110 1111111111110 0011111 11122223333321 1 Q ss_pred cch--------------------hHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhH Q lcl|NC_019530. 87 PEI--------------------RKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAY 146 (560) Q Consensus 87 PEv--------------------d~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~ 146 (560) |.+ ..++++. ...++. .++++.+++ +...+.++.+++-.+|..... T Consensus 54 ~~~~~~~~~~~~~~~~~~slnl~~~i~~~~-A~lv~~----e~~~i~~~d---------~~~~~~l~~il~~n~f~~~~~ 119 (500) T protein:vir:30 54 DSVLYLNTDGETKKRDLNHLPIARTAAKKI-ASLVFN----EQAEIKVDD---------DAANEFISETLKNDRFNKNFE 119 (500) T ss_pred CCcccccCCCCcccCceeecchHHHHHHHH-hhhhcC----CcceEecCC---------hHHHHHHHHHHhhccHHHHHH Confidence 111 1111111 111111 233444433 256677888999899999999 Q ss_pred HHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCcccccc------cc---ccchhe------ee Q lcl|NC_019530. 147 QKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEA------IE---SVTLKY------YY 211 (560) Q Consensus 147 ~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~------~~---~~~e~~------~Y 211 (560) +.+....+-|..+|+..+|... ..+.+++|.++-+++ .+..+.... .+ +...|| .+ T Consensus 120 ~~~e~a~a~G~~~~k~~~d~~~-----~~I~~v~ad~~~P~~----~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~ 190 (500) T protein:vir:30 120 RYLESCLALGGLAMRPYVDGDK-----VRVAFVQAPVFLPLQ----SNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEW 190 (500) T ss_pred HHHHHHhhcCCEEEEEEEeCCc-----eEEEEEcCCeeEEEE----EcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEE Confidence 9999999999999999998643 347889998876652 111111100 00 011111 11 Q ss_pred cCcccc----ccccccccccccccccccce--ecchhcee--------eeecc-----cccCCCCcchhhhHHHHHHHHH Q lcl|NC_019530. 212 NPNYNR----NQFTGQSGTSQNFQPSQQEL--VFDDESIV--------YIDSG-----EEPLANGIVPGLLNPAIRPLNN 272 (560) Q Consensus 212 ~p~~~~----~~~~g~~~~~~~~~~~~~~~--~i~~daI~--------y~hsG-----l~~~~~~~i~syL~~AiK~~Nq 272 (560) .+...+ ..+-..+..+....-+-+.+ .++ +.++ |++.= -.+......+|-++.|+-.+.. T Consensus 191 ~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~-~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~ 269 (500) T protein:vir:30 191 QSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLK-DEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDF 269 (500) T ss_pred eCCceeEEEEEEEecccccccCcccccccccCCcC-cceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHH Confidence 111000 00000000000000000000 011 1111 11110 0011122356889999988888 Q ss_pred HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCccc-------CcccchhhHhhhcc Q lcl|NC_019530. 273 LVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKIT-------GNAHLMGIAEDYWL 345 (560) Q Consensus 273 L~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~-------d~~~~msmlEDywL 345 (560) |-..-+.+. |..|.=++|||. +..=++ ...+..+|+.. +++-+..|-. T Consensus 270 lD~~~s~~~--~e~~~g~~~i~v-~~~~l~------------------~~~~~~~g~~~~~~~~d~~~~~~~~~~~---- 324 (500) T protein:vir:30 270 INTTYDEFM--WEVKMGQRRVAV-PESLTA------------------LTVRTTDGDVVPRPRFESDQNVYIRMGG---- 324 (500) T ss_pred HHHHHHHHH--HHHHhCcceeee-chHHhc------------------ccCCCCCccccCCcccCCCcceEEEcCC---- Confidence 887777665 788887777765 211100 01111122110 1111111100 Q ss_pred cccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 346 PRREGQNATEIATVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYS 424 (560) Q Consensus 346 pRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs 424 (560) -+++ +.-|+.+...--..+ ..-+.++.+.+=.+.+++-+.|..+++. -..|+||.-.+-.-..-+.+.|+.|. T Consensus 325 --~~~~-~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g---~~TAtei~s~~~~~~~t~~~~~~~~~ 398 (500) T protein:vir:30 325 --RDLD-SSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKS---MKTATEIVSENSDTYQMRNSIVALVE 398 (500) T ss_pred --CCCc-CcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCc---cccHHHHHHHHHHHHHHHHHHHHHHH Confidence 0111 122555543221222 2345667777777888999988876652 23478886555555566777777777 Q ss_pred HHHHHHHHHHHHh-------ccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHH Q lcl|NC_019530. 425 HFFMEFLRRQLIL-------KGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVM 497 (560) Q Consensus 425 ~if~d~Lk~qLiL-------kgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~ 497 (560) ..+.++++.-|-+ +|... .+ ..+.++|. |+.+.. ++++.- ..+.+++ - | .+|.++.+ T Consensus 399 ~al~~lv~~il~~~~~~~~~~~~~~-~~------~~v~v~f~-d~i~~d-~~~~~~-~~~~~v~----a-G-i~s~~~~i 462 (500) T protein:vir:30 399 QSLKELVISIFEIAKAYDLYQSEVP-SM------DNISISLD-DGVFTD-RDAELD-YWIKVVN----A-G-FGTREMAI 462 (500) T ss_pred HHHHHHHHHHHHHHHHHhhcCCCCC-CC------cceEEEeC-CCCCCC-HHHHHH-HHHHHHH----c-C-CCCHHHHH Confidence 7777777765533 22222 22 23566663 443332 222211 1111111 1 3 58888877 Q ss_pred HHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccccccc Q lcl|NC_019530. 498 RNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGS 538 (560) Q Consensus 498 k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~ 538 (560) ++...+||+|.+++.++|++|.... ...+ +.+++..++ T Consensus 463 ~~~~g~~eeea~~~l~~i~~E~~~~-~~~~--~~~~~~~g~ 500 (500) T protein:vir:30 463 QKVLNVTEEKAQEIAAEINTGIVDE-INQQ--RTDTHLYGE 500 (500) T ss_pred HhcCCCCHHHHHHHHHHHHHhcccc-CCCC--CccccccCC Confidence 6778999999999999999995222 1111 112222222 No 77 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=97.83 E-value=1.6e-05 Score=46.88 Aligned_cols=444 Identities=15% Similarity=0.194 Sum_probs=177.1 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCC-ceeEeccccceeccccccchhhHHHHHHH Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDG-SIILQGGANTYNYVGVESELLSVKTVVEE 79 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG-~~~i~~~~~~~~y~~~e~~~~~~~~LI~~ 79 (560) .|--.- -++..+.+-++...-+-. ++ -+..-.-|+ .+..++-+++ .-+++.-=.-|- T Consensus 60 ~~~~~~--~~~~~~~~~~~~~~~~~~--~~-----------~~~~~~~~~~~~~~~~l~~~-----~~~~F~Gy~~la-- 117 (695) T protein:vir:78 60 VAEPSP--SLRLARQFEVDVSNYTPR--ER-----------RAASYALDFNGTSMDALSFV-----TSSGFPGFPTLV-- 117 (695) T ss_pred ccCCCc--ccccceeceeccccCCcc--cc-----------chhhhhhcccccccccchhh-----hccCcchHHHHH-- Confidence 111110 111111111111111110 00 000000110 0111111010 011121112233 Q ss_pred HHHHhhccchhHHHHHhhcceeee------cCCCc----eEEEeeecccchhH-HHHHHHHHHHHHHHHHhCCchhhHHH Q lcl|NC_019530. 80 YQSMAQQPEIRKAVDIIVNDVVTC------EEDET----PVTVNLDKVEGISD-TVKESITECFKEVMHLMDFDNTAYQK 148 (560) Q Consensus 80 YR~ma~~PEvd~AI~eIvneaiv~------d~~~~----~V~l~Ld~~~~~s~-sik~~I~eeF~~il~ll~f~~~~~~~ 148 (560) +|||+||.+++++-|..||+-. ...++ .+++.-+..+ .++ .-.++|..|++. |+...+..+. T Consensus 118 --~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~-~~d~dqi~~L~~e~er----L~V~~~l~ea 190 (695) T protein:vir:78 118 --LLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAAS-TSDGDQLKQINDEIER----LRIRDAVRTT 190 (695) T ss_pred --HHhhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccc-cccHHHHHHHHHHHHH----HHHHHHHHHH Confidence 7899999999999999988432 22222 1333322222 122 334566666654 2224444444 Q ss_pred HHHhhhhhhhhhhheecCC--------------CCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCc Q lcl|NC_019530. 149 IRKWYVDGRQAYHVIVDPT--------------NKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPN 214 (560) Q Consensus 149 ~R~wYvDGr~~~hkiiD~~--------------~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~ 214 (560) ++-=-+.|.=.-...|+.. -.|.+++.|+.|||..+-+- ....++-... .+|.|. T Consensus 191 ik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~----------~~n~~dP~sp-dfgkP~ 259 (695) T protein:vir:78 191 VIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPN----------NYNSINPVAD-DFYKPS 259 (695) T ss_pred HHhhccccceEEEEEeccCccccccccccccccccCcceeeeEeecccccccc----------hhhhccchhh-ccCCCc Confidence 3322233322211223222 23557888999999776662 1111111111 233332 Q ss_pred cccccccccccccccccccccceecchhceeeeecccccCCC-------CcchhhhHHHHHHHHH-HHHHHHHHHHHHHh Q lcl|NC_019530. 215 YNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLAN-------GIVPGLLNPAIRPLNN-LVTTEDATVIYAIT 286 (560) Q Consensus 215 ~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~-------~~i~syL~~AiK~~Nq-L~m~EDalVIyRit 286 (560) +|...| .+||.+-++....- ..+. ...+|.+..+..-..+ +++...+.=+ +. T Consensus 260 --~y~V~G--------------~kIH~SRL~~f~g~--plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~L--i~ 319 (695) T protein:vir:78 260 --TWWMIG--------------TEVHATRLHTIVSR--PVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDI--VK 319 (695) T ss_pred --eEEEec--------------eEEeeeeEEEecCC--CchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHH--HH Confidence 221111 13443332211110 1111 1112444444433222 3333322211 11 Q ss_pred cCccceEEEccCC-CCchHHHHHHH--HHHHHhhhhheeecccCCcccCcccchh-hHhhhcccccCCCCccceeecCCC Q lcl|NC_019530. 287 RAPEKRAFYLDVG-TLGKKSAEEYM--TMMMGKFKNRNAYDRTTGKITGNAHLMG-IAEDYWLPRREGQNATEIATVGGG 362 (560) Q Consensus 287 RAPeRRvFyIDvG-nlpk~KAeqYl--~~im~k~knk~vYd~~TGev~d~~~~ms-mlEDywLpRReGgrgTEIsTLpGg 362 (560) .+.-+ ++-.|.. -|.....++.. -+++++||.- -|-+ .+. =.|+|- +. . T Consensus 320 ~~~v~-~lk~dla~~L~~g~~~~l~~R~eli~~~Rsn------~G~~-----llDk~~Eefe-------------q~--s 372 (695) T protein:vir:78 320 QFSVS-GILMDLAQALMPGANVDLSMRAELINRYRDN------RNIL-----FLDKATEEFF-------------QF--N 372 (695) T ss_pred hhhhH-HHHHHHHHhhcChhHHHHHHHHHHHHHhcCc------cceE-----EEecCCcceE-------------EE--e Confidence 11111 0111211 00111112222 2455566531 1100 000 013332 22 2 Q ss_pred CCcchHHHHH-HHHHHHHHhcCCCccccCCCC--cccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH-----H Q lcl|NC_019530. 363 NQLGEMDHVN-YFREKLYDALMIPKSRLQEEG--SINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRR-----Q 434 (560) Q Consensus 363 ~nLgei~DV~-YF~~kLy~aL~VP~sRl~~~~--~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~-----q 434 (560) .+|+-++||- =|..-+=-+.+||+.||=..+ |||--+. + |.-.|...|..+|. ..+...|++ | T Consensus 373 tslSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE--~----D~rnYYD~I~s~Qe---~~L~p~L~rl~~ii~ 443 (695) T protein:vir:78 373 TPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSE--G----EIRVWYDYVRAYQR---NALQQLMNDVIVMIQ 443 (695) T ss_pred cccCCHHHHHHHHHHHHHhhhcCchhhhhccCCccccccch--h----hHHHHHHHHHHHHH---HHHHHHHHHHHHHHH Confidence 5788889975 488888889999999985444 6654443 2 45559999988885 445555554 3 Q ss_pred HHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHH-------------- Q lcl|NC_019530. 435 LILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNV-------------- 500 (560) Q Consensus 435 LiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~I-------------- 500 (560) +-..|.+ -+.|.|.|..=..-+|..-+||...+.+....+-.- --++.+-|+..+ T Consensus 444 rS~~G~i---------dpdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~--gvI~~~evr~rL~~d~~s~Y~~~~D~ 512 (695) T protein:vir:78 444 LSLFGAV---------DPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQE--QVIRPDQVAARLNTEPDGPYAGKLDA 512 (695) T ss_pred HHhcCCC---------CCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHh--cCCCHHHHHHHHhcCCCccccccccc Confidence 3333443 345778887666677777788877776654333211 123333333221 Q ss_pred ----hCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccc--------cCCcccCC Q lcl|NC_019530. 501 ----LRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPF--------SGSTDDSI 560 (560) Q Consensus 501 ----L~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~--------~~~~~~~~ 560 (560) ..-+|+||.-+...-+.+ +++++.|+.+ +.-.|---+|.+.+. +|..+-.| T Consensus 513 ~d~p~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~-~~~~g~~~~~~~~~~~~~~~~~~ag~~~~~~ 575 (695) T protein:vir:78 513 NDDPGVPADDDIDGVLTYVQRL--------AEGGDTGAPG-GARAGATAPPTVANVNANVKPREAGAQDAAM 575 (695) T ss_pred ccCCCcCccchhhhhHhhhcCc--------ccccccCCCC-CCCCCCCCCCceeeeeccccccccCCCCccc Confidence 111222333332222222 2222222222 112222233333322 44444443 No 78 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=97.80 E-value=1.8e-05 Score=46.60 Aligned_cols=440 Identities=15% Similarity=0.099 Sum_probs=183.3 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) |+.++.+.. ++.... .+......-...+...... + ..+..+ ..-+ -+..+++|-|.+||. T Consensus 1 Mg~~~~l~~-----~~~~~~-~~~~~~~~~~~~~~~~~~~-~---~~~~~g---~~v~-------~~~al~~~~v~~~i~ 60 (457) T protein:vir:62 1 MGFWSALFG-----RGHSPA-LDAAEGRAWEPYDPSIYNL-G---ATASSG---ERVT-------PHDALQVSAVFASVR 60 (457) T ss_pred Cchhhhhhc-----cccccc-cccccccccccchhhhhhc-c---ccccCC---ceec-------hHHhhccHHHHHHHH Confidence 444443221 111111 1111000000000000000 0 000000 0001 124566899999996 Q ss_pred HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHH----HhhhhhhhhhhheecCCCCc Q lcl|NC_019530. 95 IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIR----KWYVDGRQAYHVIVDPTNKK 170 (560) Q Consensus 95 eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R----~wYvDGr~~~hkiiD~~~~k 170 (560) -|.+.+.-+ |+.|--.. +. ..+..-...+..+++.-+=.-.++++++ .+.+.|.-|.. |.+. . T Consensus 61 ~ia~~iA~l-----p~~~~~~~-~~---~~~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~-i~~~---~ 127 (457) T protein:vir:62 61 LLSETIATL-----PLSTYSKR-GG---TRKEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLA-VRWA---G 127 (457) T ss_pred HHHHhHhhC-----ceEEEEec-CC---ccccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEE-EEeC---C Confidence 666544322 33332211 11 0111111112222211110123444444 45566876643 3332 4 Q ss_pred cCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeecc Q lcl|NC_019530. 171 GGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSG 250 (560) Q Consensus 171 ~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsG 250 (560) .++.+|.+|+|.++...+.... + .....|+.|.-. ..|. ......++.+.|.|+.- T Consensus 128 g~~~~l~~l~p~~v~v~~~~~~----~-----~~~~~~~~y~~~-----~~g~---------~~~~~~~~~~eiih~r~- 183 (457) T protein:vir:62 128 PNIAGLDVLDPTKIHVHMVMVD----G-----LRRKVFEAYDID-----ADGN---------EVLLGWFTPRDVLHIPG- 183 (457) T ss_pred CcEEEEEEEcCcceEEEEeccC----C-----ccceeEEEEEEc-----cCCc---------eeEEEeeCccceEEecC- Confidence 6799999999988876542111 1 011122333311 0111 01123567777877762 Q ss_pred cccCCC-CcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCc Q lcl|NC_019530. 251 EEPLAN-GIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGK 329 (560) Q Consensus 251 l~~~~~-~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGe 329 (560) ...++ -+.+|-++.|++.+.....+++...=+=---+--+-|..++ |.|-+..+++..+.+...|+. T Consensus 184 -~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~ls~e~~~~~~~~~~~~~~G---------- 251 (457) T protein:vir:62 184 -MMLPGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVP-GTMSEEGLARAREAWRAANSG---------- 251 (457) T ss_pred -CCCCCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcC-CCCCHHHHHHHHHHHHHHhcC---------- Confidence 33333 35678899999999888888876654433334445666776 566655554444433333321 Q ss_pred ccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHH---HHHHHHHHHHHhcCCCccccCCCCcccccccchhhhh Q lcl|NC_019530. 330 ITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMD---HVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEIT 406 (560) Q Consensus 330 v~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~---DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eIt 406 (560) .++.-+.+ .++ .|.+++.|. .+..++. =-+|-...+.++++||...|+.-.+.+...++.++.. T Consensus 252 ~~nag~~~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~ 318 (457) T protein:vir:62 252 VDNAHRVA-LLT----------EGAKFSKVA--MSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQN 318 (457) T ss_pred ccccCcce-ecC----------CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHH Confidence 11111122 222 245555552 3433333 3346778899999999999976555444555544444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_019530. 407 QEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPF 486 (560) Q Consensus 407 RDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~ 486 (560) +. |.+++ |+--+ ..+.+.|-..| +++.+.. ...+.|..+. +.. .-+..|++.+..+-. T Consensus 319 ~~---f~~~~--l~P~~-~~ie~~ln~~L-----~~~~~~~-----~~~i~fd~~~----l~~-~d~~~r~~~~~~~~~- 376 (457) T protein:vir:62 319 IA---FTMFS--LRPWL-ERIEAGFNRLL-----FAETADR-----FRFVKFNLDE----IKR-GAPKERMELWSLGLQ- 376 (457) T ss_pred HH---HHHHH--HHHHH-HHHHHHHHhhh-----cCccccC-----ceEEEeechh----hhc-cCHHHHHHHHHHHHh- Confidence 43 66653 33211 23333444444 3443321 1233443332 211 123356666555432 Q ss_pred hcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhh--hcCCCCCCCccccccccccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 487 VGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEK--KKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 487 vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~--~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) +-++|.+-+++ ++.|..-+=-..++-+..=. ..+..+++.....+....+.....+-+++..--.|++|+.. T Consensus 377 -~G~~T~NE~R~-~~gl~pi~~g~~D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 450 (457) T protein:vir:62 377 -NGIYSIDEVRA-AEDMTPLPDGLGEKYRVPLNLGEIGEEPEPEPAPAPPAIDPPAEEPADDEEPDNAEGDPDEGE 450 (457) T ss_pred -CCCcCHHHHHH-HhCCCCCCCCCcceeeeccccccccccccccccCCCccCCCCccCCCCCCCCCCCCCCCcccc Confidence 24788888884 46775421000000000000 00000000000000000000000111122222235555444 No 79 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=97.75 E-value=2.2e-05 Score=46.08 Aligned_cols=398 Identities=10% Similarity=0.100 Sum_probs=171.3 Q ss_pred hhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHHHhhccee Q lcl|NC_019530. 22 VDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVV 101 (560) Q Consensus 22 ~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneai 101 (560) .+.+++..+.-..+..+... .+- .+.... +.....+|.++ ..+...++|.|.+||+-|.+.+. T Consensus 1 ~~~~~~~~~~k~~~~~~~~~--~~~--~~~~~~-~~~~~~~~~~v------------~~~~a~~~~~v~~~i~~Ia~~ia 63 (409) T protein:vir:94 1 MAKENIVTRIKKKLIDNWID--QSA--SKLYDF-SPWKNKSFWGV------------INNTLETNETIFSAITKLSNSMA 63 (409) T ss_pred CcccccchhhhhHHhhhhhc--CCc--cccccc-ccccCcccccc------------chhhhhccHHHHHHHHHHHHhhh Confidence 11122211111111111100 000 000000 00001111111 11123568899999987777655 Q ss_pred eecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHH-hCCchh----hHHHHHHhhhhhhhhhhheecCCCCccCeeEe Q lcl|NC_019530. 102 TCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHL-MDFDNT----AYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKL 176 (560) Q Consensus 102 v~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l-l~f~~~----~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eL 176 (560) -++ +.+- ..++. .+ .. ..++|+. =+-... -+.++..+++.|.-|..++.|... =+.+| T Consensus 64 ~lp-----~~~~-~~~~~-~~---~~----~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G---~~~~L 126 (409) T protein:vir:94 64 SLP-----LKMY-EDYKV-VN---TE----VSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYH---QPSKL 126 (409) T ss_pred hCc-----eeEe-ecccc-cc---hh----HHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC---cEEEE Confidence 332 2221 11111 11 11 1122221 111222 345566778889877766655432 27899 Q ss_pred EEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeecccccCCC Q lcl|NC_019530. 177 VMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLAN 256 (560) Q Consensus 177 r~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~ 256 (560) ++|+|.++..++. .+++ ..+|.|.. ..+..+.++.+.|.|.. +.-..+. T Consensus 127 ~~l~~~~v~v~~~-----~~~~-------~~~y~~~~------------------~~g~~~~~~~~dvih~r-~~~~~~~ 175 (409) T protein:vir:94 127 FLLNPDVVEMLIE-----NQSR-------ELYYSIHA------------------ATGNKLIVHNMDMLHFK-HIVASNM 175 (409) T ss_pred EEEcCceeEEEEe-----CCCc-------EEEEEEEc------------------CCceEEEEccccEEEec-CCCCCCc Confidence 9999988876421 1111 11223321 11234567777777764 1112233 Q ss_pred CcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccc Q lcl|NC_019530. 257 GIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHL 336 (560) Q Consensus 257 ~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~ 336 (560) -+..|-|..|.+.+.....++.. -+....+.| .++..--+.+.+.+++...+.+.+.|. ..|.+ T Consensus 176 ~~G~s~l~~~~~~i~~~~~~~~~-~~~~~~~~~--~~i~~~~~~l~~e~~~~~~~~~~~~~~-------~~g~~------ 239 (409) T protein:vir:94 176 VQGISPIDVLKNTTDFDNAVRTF-NLTEMQKPD--SFMLKYGSNVGKEKRQQVLEDFKQYYE-------ENGGI------ 239 (409) T ss_pred cccccHHHHHHHHHHHHHHHHHH-HHHhcCCCC--eeEEecCCCCCHHHHHHHHHHHHHHhh-------cCCCe------ Confidence 33456677777766666556554 344555544 233334456666655554444443332 23422 Q ss_pred hhhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHH Q lcl|NC_019530. 337 MGIAEDYWLPRREGQNATEIATVGGG-NQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKF 415 (560) Q Consensus 337 msmlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kf 415 (560) + .++ .|.+++.|.-. +.+.-++-..|-.+.+.++++||...|+..++. ..++..+..+. |.++ T Consensus 240 ~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~--~~sn~e~~~~~---f~~~ 303 (409) T protein:vir:94 240 L-FQE----------PGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNT--NFAKNEELNRF---YLQH 303 (409) T ss_pred e-ecC----------CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC--CcccHHHHHHH---HHHH Confidence 1 222 35777777532 222224444566788999999999999765442 23333333333 6555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHH Q lcl|NC_019530. 416 CAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDY 495 (560) Q Consensus 416 i~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~ 495 (560) + |+-.+ ..+.+.|- ..++++.+|.. . ..|.|..+ ++...+ +..|++.++.+-. .-+++..- T Consensus 304 ~--l~P~~-~~ie~~ln-----~~Ll~~~~~~~--~--~~i~fd~~----~ll~~d-~~~~~~~~~~~~~--~G~~T~NE 364 (409) T protein:vir:94 304 T--LLPIV-KQYEEEFN-----RKLLTKTDREK--N--RYFKFNVK----SYLRAD-SATQAEVYFKAVR--SGYYTIND 364 (409) T ss_pred H--HHHHH-HHHHHHHH-----HhhCCcccccC--c--ceEEeech----hhhccC-HHHHHHHHHHHHh--CCCcCHHH Confidence 3 33211 11222333 33456666532 2 23444332 333222 3455655554422 24677777 Q ss_pred HHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccc--cccCCCCCCcccc-cccCCcccCC Q lcl|NC_019530. 496 VMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSG--SDVSPLKFKPETI-PFSGSTDDSI 560 (560) Q Consensus 496 i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~--~~~~~~~~~p~~~-~~~~~~~~~~ 560 (560) ++. ++.+.+- |. ||.-. ....++....+.- +..||.++.= T Consensus 365 ~R~-~~g~~p~--------------------~g----gD~~~~~~n~~~~~~~~~~~~~~kGG~~n~~ 407 (409) T protein:vir:94 365 IRE-WEDLPPV--------------------EG----GDKPLISGDLYPIDTPLELRKSLKGGDKNVN 407 (409) T ss_pred HHH-HhCCCCC--------------------CC----cCeEeecccccccccchhhcccccCCCCCcC Confidence 763 4555322 11 11110 0111111100000 1122211111 No 80 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=97.66 E-value=3.1e-05 Score=45.30 Aligned_cols=400 Identities=12% Similarity=0.093 Sum_probs=171.0 Q ss_pred chhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhh Q lcl|NC_019530. 6 RGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQ 85 (560) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~ 85 (560) -|+|.. +.. +. ++..+ ..+...| ... ....+..+ + -+.... T Consensus 1 MG~~~~---~~~---~~------~~~~~-----~~~~~~~------~~~--~~~g~~~~-------~-------~~~al~ 41 (411) T protein:vir:81 1 MGWWSR---LTR---FF------RPRNE-----TVDMTNP------LLL--QWLGVDPD-------T-------PRNQLS 41 (411) T ss_pred CchHHH---HHh---hc------cCccc-----ccccchH------HHH--HHhcCccc-------C-------hhhhhc Confidence 223221 110 00 00000 0010000 000 00000000 0 123457 Q ss_pred ccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhC--------CchhhHHHHHHhhhhhh Q lcl|NC_019530. 86 QPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMD--------FDNTAYQKIRKWYVDGR 157 (560) Q Consensus 86 ~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~--------f~~~~~~~~R~wYvDGr 157 (560) +|-|..||.-|.+.+.- -|+.+--.+.++..+ .. -..++++|+ ...-.+.++..+..+|. T Consensus 42 ~~~V~~~v~~Ia~~iA~-----lp~~~~~~~~~~~~~----~~---~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gn 109 (411) T protein:vir:81 42 EATYFACLKILSESLGK-----LPLKMYQKTERGIVK----SD---REELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGN 109 (411) T ss_pred cHHHHHHHHHHHHhHhh-----CceeEEEecCCceee----ec---ccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCC Confidence 89999999777765432 122221111111000 00 111222222 12223445556677787 Q ss_pred hhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccce Q lcl|NC_019530. 158 QAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQEL 237 (560) Q Consensus 158 ~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~ 237 (560) -|..++.| ...+.+|++|+|..+..++. + .+ .......-+|.|. . ...+... T Consensus 110 a~~~i~r~----~g~~~~l~~l~~~~v~~~~~---~--~~--~~~~~~~~~~~~~---------~--------~~~g~~~ 161 (411) T protein:vir:81 110 AYVWCQYS----GPQLQALWILPSQYVTIVVD---D--RG--LLGEKNAIWYRYN---------D--------PYDGKMY 161 (411) T ss_pred eEEEEEec----CCceEEEEEECCceEEEEEc---C--cc--cccccceEEEEEE---------e--------cCCceEE Confidence 77766654 35689999999999887531 1 11 0011111122222 1 1123455 Q ss_pred ecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhh Q lcl|NC_019530. 238 VFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKF 317 (560) Q Consensus 238 ~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~ 317 (560) .++.+.|.|..-+. +.++-+..|-+..|...+.....+++...=+----+--+-+..++ +.|.+..+++..+.+...| T Consensus 162 ~~~~~eiih~k~~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~ 239 (411) T protein:vir:81 162 VFRNDEILHFKTSV-TFDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYT-GDLNQEARDRLVKGFEQFA 239 (411) T ss_pred EEccccEEEEcCCC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC-CCCCHHHHHHHHHHHHHHh Confidence 78889888886442 335556678888888888888877776654443334345555554 4566655544443333322 Q ss_pred hhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCccc Q lcl|NC_019530. 318 KNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGG-NQLGEMDHVNYFREKLYDALMIPKSRLQEEGSIN 396 (560) Q Consensus 318 knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~ 396 (560) .-- ...| +.+ .++ .|.+++.|.-. ..+.-++-.++..+.+.++++||...|+...+.+ T Consensus 240 ~g~----~n~g------~~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t 298 (411) T protein:vir:81 240 NGS----KNAG------KII-PVP----------LGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSS 298 (411) T ss_pred cCc----cccC------Cce-ecC----------CCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC Confidence 210 0112 122 221 24566666321 1222234456889999999999999996444323 Q ss_pred ccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHH Q lcl|NC_019530. 397 IGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDR 476 (560) Q Consensus 397 l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~R 476 (560) . +++.+..+ .|.+++ |+- +...+.+.|- .++++++++.. . ..+.|..+ ++...+ ...| T Consensus 299 ~--~n~e~~~~---~f~~~~--l~P-~~~~ie~~l~-----~~ll~~~~~~~--~--~~~~fd~~----~ll~~d-~~~~ 356 (411) T protein:vir:81 299 Y--ASAEAQNL---AFYVDT--LLY-VLKQYEEEIT-----YKILSNDLISQ--G--HYFKFNVN----VILRAD-IKTQ 356 (411) T ss_pred c--hhHHHHHH---HHHHHH--HHH-HHHHHHHHHH-----hhcCChhhcCC--C--cEEEeech----hhhccC-HHHH Confidence 2 22232222 254442 332 1122223333 34456655432 2 33444333 222111 1233 Q ss_pred HHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccccc--ccccCCCCCCcccccccC Q lcl|NC_019530. 477 LASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYS--GSDVSPLKFKPETIPFSG 554 (560) Q Consensus 477 l~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~--~~~~~~~~~~p~~~~~~~ 554 (560) ++.++.+-. .-++|.+-++ +++.|. |.|. |+.. +....|+..-++-.. .| T Consensus 357 ~~~~~~~~~--~g~~t~NE~R-~~~gl~--------------------p~~g----gD~~~~~~n~~pl~~~~~~~~-kg 408 (411) T protein:vir:81 357 MDSLSTAVQ--NGIMTPNEAR-DYLDMP--------------------ADDY----GNNLMANGNYIPLSMLGANYG-KG 408 (411) T ss_pred HHHHHHHHh--CCCcCHHHHH-HHhCCC--------------------CCCC----CCeeeeccCccchhhhhhhhc-cC Confidence 443333211 1255555554 233332 2221 2221 111111111111000 12 Q ss_pred Ccc Q lcl|NC_019530. 555 STD 557 (560) Q Consensus 555 ~~~ 557 (560) |.. T Consensus 409 Gd~ 411 (411) T protein:vir:81 409 GDS 411 (411) T ss_pred CCC Confidence 211 No 81 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=97.65 E-value=3.3e-05 Score=45.17 Aligned_cols=428 Identities=13% Similarity=0.091 Sum_probs=177.8 Q ss_pred hccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHH-HhhccchhHHHHHhhcceeeecCCCceEEE Q lcl|NC_019530. 34 LLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQS-MAQQPEIRKAVDIIVNDVVTCEEDETPVTV 112 (560) Q Consensus 34 ~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~-ma~~PEvd~AI~eIvneaiv~d~~~~~V~l 112 (560) -++..-.+++.|-.-+=...+..+.......+.. +.+ ....|-. .+++|-|.+||+-|.+.+- .-|+.+ T Consensus 1 ~~~~~~~~~~~p~~~e~~~~~~~~~~~~~~~~~~--~~~---~~~~~~~~a~~~~~V~acV~~IA~~iA-----~lpl~l 70 (518) T protein:vir:10 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQ--LER---QFSLYGGIYKNQPWVRTVIAKRAQALA-----RLPVKC 70 (518) T ss_pred CcccCceeecCchhhhhhhhhhccccccccccee--ccc---ccchhhHHHhhhHHHHHHHHHHHHhhc-----cCceEE Confidence 3333333333332111111111111111111111 100 1111222 3578999999988777663 223333 Q ss_pred eeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHH----HhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhh Q lcl|NC_019530. 113 NLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIR----KWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVY 188 (560) Q Consensus 113 ~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R----~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr 188 (560) --...+. .++.....+..+++-=+-...++++.+ .+.+.|.-|..++.|. ..-+.+|++|+|.+|...+ T Consensus 71 ~~~~~~~----~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~---~G~~~~L~~l~p~~v~v~~ 143 (518) T protein:vir:10 71 MFTSGDT----ETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK---SGTPEKLMPMHPSRVAIKR 143 (518) T ss_pred EEEcCCC----ceeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECC---CCcEEEEEEECCCceEEEE Confidence 2222111 111111112222222223344445444 4456788888766544 3348899999998776532 Q ss_pred hhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeecccccCCCC-cchhhhHHHH Q lcl|NC_019530. 189 IVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLANG-IVPGLLNPAI 267 (560) Q Consensus 189 ~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~~-~i~syL~~Ai 267 (560) .. .+ ...+|.|... .+ ..+..+.++.+.|.|.+. ...++. ..+|-|..|. T Consensus 144 ----~~-~~-------~~~~y~~~~~------~~---------~~~~~~~~~~~eViHir~--~s~dg~~~G~spi~~a~ 194 (518) T protein:vir:10 144 ----NS-RT-------GRYEYYFQAG------AG---------VGTQLVSFADDEVVPIRF--FNPDGLERGLSLMESLK 194 (518) T ss_pred ----cC-CC-------CEEEEEEEec------CC---------ccceEEEecCCcEEEecC--CCCCcccccccHHHHHH Confidence 11 11 1223444321 00 112345688888888863 232332 4678899999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccc Q lcl|NC_019530. 268 RPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPR 347 (560) Q Consensus 268 K~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpR 347 (560) +++.....+++...=+----+.-+-|.-++ +.|.+..+++--+.+-..|+-- ...|. .+ .++ T Consensus 195 ~~i~~~~a~~~~~~~~f~ng~~p~gil~~~-~~ls~e~~~~~k~~~~~~~~G~----~nag~------v~-vL~------ 256 (518) T protein:vir:10 195 STIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQFDRAHSGS----SNTGK------TM-VVE------ 256 (518) T ss_pred HHHHHHHHHHHHHHHHHhcCCCccEEEecC-CCCCHHHHHHHHHHHHHHhcCc----cccCc------ce-EcC------ Confidence 888888888777433322224445555555 4455544433222222222210 11121 11 232 Q ss_pred cCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHH-HHHHHHHHHH Q lcl|NC_019530. 348 REGQNATEIATVGG-GNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKF-CAGLRRRYSH 425 (560) Q Consensus 348 ReGgrgTEIsTLpG-g~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kf-i~rLr~rfs~ 425 (560) + |.+++.|.= ...+.-++-.+|..+.+.++++||...|+..++.+ .+++.+..+. |.++ |.-+-.++.. T Consensus 257 -~---G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t--~sn~eq~~~~---f~~~tL~P~l~~ie~ 327 (518) T protein:vir:10 257 -E---GMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRAT--FSNISAQMRA---FYRDTMAIPIARIQS 327 (518) T ss_pred -C---CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCC--chhHHHHHHH---HHHHHHHHHHHHHHH Confidence 2 445555431 12222344456888999999999999996433322 2334443333 5544 3333333333 Q ss_pred HHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCH Q lcl|NC_019530. 426 FFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSD 505 (560) Q Consensus 426 if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tD 505 (560) -|... ++++.+ -..+++|+ . .++... -+..|++.++.+-.- -++|.+-++ ..++|.. T Consensus 328 ~ln~~---------L~~~~~----~~~~~~fd--~----~~llr~-D~~~r~~~~~~~~~~--G~lT~NE~R-~~~Gl~p 384 (518) T protein:vir:10 328 AMDKY---------VGQYWV----RKNRMKFD--I----DDVIQP-DWEAKSESTQKMVNS--GVATPNEGR-EIMGLPR 384 (518) T ss_pred HHHHh---------hccccc----CCceEEEe--c----hhhhcc-CHHHHHHHHHHHHhC--CCcCHHHHH-HHhCCCC Confidence 22222 233221 12223332 2 222221 123556555554322 478888887 4577754 Q ss_pred HHHHHHHHH--------H----HHhhh---cCCCCCCCccccccccccccCCCCCCcccccccCC-cccCC Q lcl|NC_019530. 506 EEVKEQQTK--------I----AEEKK---KGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGS-TDDSI 560 (560) Q Consensus 506 eEI~e~~kq--------I----e~E~~---~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~-~~~~~ 560 (560) .|-..-++. + +...+ .+.-++|.+. ++.++ +-.+.+-.|..-|-+-+ .+++. T Consensus 385 ie~~~gD~~~~~~n~~pl~~~~~~~~~g~~~~~~~~~~~~-~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 452 (518) T protein:vir:10 385 SDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPAST-PVASL--DQSPPTSVPGLSPTNSDRSTDSG 452 (518) T ss_pred CCCCCCCeeeecccceecccccccccCCCCCCCCCCCCcc-ccccc--cccccccCCCCCccccccccccc Confidence 331000000 0 00000 0000111110 00000 00111111111111100 01111 No 82 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=97.59 E-value=4.1e-05 Score=44.65 Aligned_cols=418 Identities=11% Similarity=0.042 Sum_probs=177.3 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) |+.++....--+ +..+.......+...+..+++....-+. . .+.. . +. +..+++|-|..||. T Consensus 1 MG~f~~lf~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~g~-~----~~~~--v-~~-------~~al~~~~v~~ci~ 62 (422) T protein:vir:13 1 MGFLRGLFNKKN---NNDEKRSNYDEDIGIDISDSNFWEKFGI-K----LNFS--V-RG-------KRALKENTVYVCTK 62 (422) T ss_pred CchhhhhhhccC---CccchhhhhhhccccccCcchhhhhccc-c----CCcc--c-ch-------hhhhccHHHHHHHH Confidence 333332211000 0011111111112222222222211110 0 0000 0 00 02245788999995 Q ss_pred HhhcceeeecCCCceEEEeeecccch-hHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCe Q lcl|NC_019530. 95 IIVNDVVTCEEDETPVTVNLDKVEGI-SDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGI 173 (560) Q Consensus 95 eIvneaiv~d~~~~~V~l~Ld~~~~~-s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI 173 (560) -|.+.+.- -|+.+- ...+.. ...+-..+..+=. ..+....=.+.++..+++.|.-|..++.|.. .-+ T Consensus 63 ~ia~~iA~-----lp~~~~-~~~~~~~~~~~~~lL~~~PN---~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~---G~~ 130 (422) T protein:vir:13 63 IRAESIGK-----LSLKIY-KDKEEYKEHELYYLLRYKPN---PLMSSINFWKCLETQRTLKGNAYAYIERDRK---GKI 130 (422) T ss_pred HHHHhhhh-----CceEEE-ecCcccccchHHHHHhhhcc---cCCCHHHHHHHHHHHHhhcCCeEEEEEECCC---CcE Confidence 55544321 222221 111110 1111111111110 1111112344556667888988888776653 348 Q ss_pred eEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeeccccc Q lcl|NC_019530. 174 KKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEP 253 (560) Q Consensus 174 ~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~ 253 (560) .+|++|+|.+|..+. +.++. .......+|.|.+ ..+....++.+.|.|.... .. T Consensus 131 ~~L~~i~~~~v~~~~-----~~~~~--~~~~~~~~y~~~~------------------~~g~~~~~~~~eiih~~~~-~~ 184 (422) T protein:vir:13 131 IGLYPINSDNVTKII-----DDDNF--LSSLSKVWYVVTD------------------KNGKEHKLLPDEMLHFIGD-IT 184 (422) T ss_pred EEEEEECCcceEEEE-----cCCcc--eeccceEEEEEEe------------------CCCeEEEEcccceEEEcCC-CC Confidence 999999999888653 12221 1111122333321 1223456778888777632 23 Q ss_pred CCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCc Q lcl|NC_019530. 254 LANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGN 333 (560) Q Consensus 254 ~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~ 333 (560) .+.-+..|-+..|.+++.....+++...=+----|--+-|...+ ++|-+..+++..+.+...|.- +.+. T Consensus 185 ~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g----------~~n~ 253 (422) T protein:vir:13 185 LDGLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYV-GDLDEKAKKIFKKEFESMSNG----------LENA 253 (422) T ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeC-CCCCHHHHHHHHHHHHHHhcC----------cccc Confidence 34445678999999999888888776543333334566677776 466655555444444333321 1111 Q ss_pred ccchhhHhhhcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHH Q lcl|NC_019530. 334 AHLMGIAEDYWLPRREGQNATEIATVGG-GNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRF 412 (560) Q Consensus 334 ~~~msmlEDywLpRReGgrgTEIsTLpG-g~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF 412 (560) .+.+ .++ + |++++.|.= ...+.-++-.++....+.++++||...|...++.+ .++.++.. ..| T Consensus 254 ~~~~-vl~-------~---g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~--~sn~e~~~---~~f 317 (422) T protein:vir:13 254 HSIS-LLP-------F---GYQFQPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERAT--FNNLTEQQ---KDF 317 (422) T ss_pred CCce-ecC-------C---CceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHH---HHH Confidence 1222 221 2 344444421 12222344556788899999999999997544322 23323222 224 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhh Q lcl|NC_019530. 413 SKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFS 492 (560) Q Consensus 413 ~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S 492 (560) ..+ .|+--+. .+.+.| -..++++.+.. ....+.|..+ ++... -+..|++.++.+-.- -++| T Consensus 318 ~~~--~l~P~~~-~ie~~l-----~~~Ll~~~~~~----~g~~i~fd~~----~l~r~-d~~~~~~~~~~~~~~--G~~T 378 (422) T protein:vir:13 318 YVT--TLQSSLT-VYEQEI-----QDKLFSQYETL----QDVKAEFNVD----TILRS-DIKTRYEAYRIGIQG--GFIE 378 (422) T ss_pred HHH--HHHHHHH-HHHHHH-----HHhhCChhhhc----CCceEEeech----hhhcC-CHHHHHHHHHHHHhC--CCcC Confidence 332 1221111 111222 22345554421 2233344322 22111 122345544433221 3666 Q ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccccc--ccccCCCCCCcccccccCCcccC Q lcl|NC_019530. 493 IDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYS--GSDVSPLKFKPETIPFSGSTDDS 559 (560) Q Consensus 493 ~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~--~~~~~~~~~~p~~~~~~~~~~~~ 559 (560) .+-+++ .|+|..- |. ||.- +....++..-++..+-.|.+.+- T Consensus 379 ~NE~R~-~~gl~p~--------------------~g----gD~~~~~~n~~~l~~~~~~~~~~g~~~g~ 422 (422) T protein:vir:13 379 ANEARR-RENLPPV--------------------EG----GDRLLVNGNMIPIEMAGEQYKKGGEKGGK 422 (422) T ss_pred HHHHHH-HhCCCCC--------------------CC----cCeeeeccCccchhhcccccccCCCcCCC Confidence 666663 3555331 11 1111 11111111112222222222222 No 83 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=97.51 E-value=5.3e-05 Score=44.04 Aligned_cols=431 Identities=10% Similarity=0.052 Sum_probs=166.7 Q ss_pred hhhhcccccch-hhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHH------ Q lcl|NC_019530. 21 KVDTDKLAQKQ-DELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAV------ 93 (560) Q Consensus 21 ~~~~~~~~~~~-~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI------ 93 (560) -++-..++.|. -++. ..++-+..+.+.. . +..-+..-..-+.+|+.+..+.+-...| T Consensus 1 ~~~~~~~~~~~~~~e~---~~~~~~~~~~~~~-~------------i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~ 64 (478) T protein:vir:10 1 MISINWPWDKPYHEQV---VEQIKPKYETQEE-M------------ILRLVREHKENIDNITMGERYYNHHPDILDAPPK 64 (478) T ss_pred CccccCCCCchhHHHH---HHHHhhccCCcHH-H------------HHHHHHHHHHHHHHHHHHHHHhcCCCchhccccc Confidence 11111111110 0000 0000000000000 0 0000000111223344443333322211 Q ss_pred ---------------------HHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHh Q lcl|NC_019530. 94 ---------------------DIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKW 152 (560) Q Consensus 94 ---------------------~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~w 152 (560) ..||+ ..+.=--..||++..++. ... +.+..+++ .+|+....++.+.+ T Consensus 65 ~~~~~~~~~~~~~~ki~~n~~~~ivd-~~~~~l~g~~~~~~~~~d-----~~~----~~l~~~~~-n~~~~~~~~~~~~~ 133 (478) T protein:vir:10 65 RDVNGDYDETKPDWRMYTNYHQNLVD-QKVAYAVANPVTFGVDND-----KAL----KQIQHTLN-HKWDDKLVDILTAA 133 (478) T ss_pred cccccccccccccceeccchHHHHHH-HHHhhhccCCeeeecCCh-----HHH----HHHHHHHh-cCHHHHHHHHHHHH Confidence 11111 111112245555544432 122 33344444 47899999999999 Q ss_pred hhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccC-Ccccccc--ccccchheeecCcccc-ccccccccccc Q lcl|NC_019530. 153 YVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAM-REGGIEA--IESVTLKYYYNPNYNR-NQFTGQSGTSQ 228 (560) Q Consensus 153 YvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~-~~g~~~~--~~~~~e~~~Y~p~~~~-~~~~g~~~~~~ 228 (560) ++.|+-|.+.-+|. +|-..+..+||+.+-++..-.... ....+.. .+....+-+|.+..-. +...+...... T Consensus 134 ~~~G~~~~~~~~d~----~g~~~~~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~ 209 (478) T protein:vir:10 134 SNKGIEWVQPYVDE----EGEFKTFRVPAEQAVPIWTNKERDELQAFIRVYELDGAERVEYWTKDDVTYYELKEGQLIPD 209 (478) T ss_pred HhcCeEEEEEEecC----CCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCeeecc Confidence 99999998776654 466789999999887763211000 0011111 1111112233332111 10111000000 Q ss_pred cc-------------cccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEE Q lcl|NC_019530. 229 NF-------------QPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTEDATVIYAITRAPEKRAF 294 (560) Q Consensus 229 ~~-------------~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~EDalVIyRitRAPeRRvF 294 (560) .. ..+-.--++|. |.|. ++....|=++..+....-+. ++=+....-+-++.|-+-+. T Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~~vPv--v~~~-------n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~ 280 (478) T protein:vir:10 210 FYRSDDHIQPHYYQGNKLMSWGRVPF--IPFK-------NNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILK 280 (478) T ss_pred ccccccccccceecccccccCCccce--EEec-------cCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeee Confidence 00 00000001121 1111 22233344444433333333 44455555566677755443 Q ss_pred EccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchH-HHHHH Q lcl|NC_019530. 295 YLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEM-DHVNY 373 (560) Q Consensus 295 yIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei-~DV~Y 373 (560) =.+..+.. +... .+....=++++.-+|+ ++.+|-...+...+ ..+.- T Consensus 281 g~~~~~~~-----~~~~------------------------~~~~~~~~~~~~~~~~---~~~~l~~~~~~~~~~~~~~~ 328 (478) T protein:vir:10 281 GYEGEDMK-----DFMH------------------------NLKYYKAISVAGESGS---GVDTIKVEVPIDSVKEYTKM 328 (478) T ss_pred cCCccccc-----hhhh------------------------hhhhcceEEecCCCCC---cceEEeecCChHHHHHHHHH Confidence 22221110 0000 0000001244443443 35555444444433 55677 Q ss_pred HHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhh Q lcl|NC_019530. 374 FREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPF 453 (560) Q Consensus 374 F~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~ 453 (560) +.+.+|+-.++|---. +++. |...+..|..-+..-..-+.+.+..|...+...++.-+-+.|+ + .+|. . T Consensus 329 l~~~i~~~s~~p~~~~---~~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~g~-~-~~~~-----~ 397 (478) T protein:vir:10 329 LRDYIIEFGQGVDFQQ---DKFG-NSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRL-D-VKVQ-----D 397 (478) T ss_pred HHHHHHHHhCccccCc---cccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-C-cccc-----c Confidence 7888999999984221 1111 1222233443333334445666666666666666554444554 2 2332 2 Q ss_pred ceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccc Q lcl|NC_019530. 454 IKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETG 533 (560) Q Consensus 454 I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g 533 (560) |.+.|..----.+. +.+++++.+ .| .+|.+++++. |...++ .+++.++|++|..+- .+....-..| T Consensus 398 i~i~f~~~~p~d~~-------e~a~~~~kl---~g-~iS~et~~~~-l~~v~D-~~~E~~ri~~E~~~~-~~~~~~~~~~ 463 (478) T protein:vir:10 398 IEITFNFNVMVNEL-------ENSQIAMNS---TG-LLSKETILSN-HAWVED-PVAEMERIEQENIEL-NQQLPDIEEG 463 (478) T ss_pred ceEEecCCCCCCHH-------HHHHHHHHH---hC-CCChHHHHHh-CCCCCC-HHHHHHHHHHHHHHH-Hhhccccccc Confidence 55666432211222 234455555 34 4899999976 566432 333344444443211 1110000111 Q ss_pred cccccccCCCCCCcc Q lcl|NC_019530. 534 NYSGSDVSPLKFKPE 548 (560) Q Consensus 534 ~~~~~~~~~~~~~p~ 548 (560) ....+.....--+|| T Consensus 464 ~~~~~~~~~~~~~~~ 478 (478) T protein:vir:10 464 LNGEQQRQSENNQPE 478 (478) T ss_pred cCCCCCCCCCCCCCC Confidence 111111111111222 No 84 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=97.50 E-value=5.5e-05 Score=43.94 Aligned_cols=400 Identities=12% Similarity=0.109 Sum_probs=176.6 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) |.+ ......+..++..++...+. .-++...+.++.. ++.+++|-|..||. T Consensus 1 m~f--------------~~~~~~~~~~~~~~~~~~~~--~~g~~~~~~~v~~--------------~~al~~~~v~~~i~ 50 (409) T protein:vir:10 1 MLF--------------RKGFKNQSQEISIDDKKILE--WLGINPSETYVNG--------------KSCLKQATVFGCIR 50 (409) T ss_pred Ccc--------------cccccCcCCCCCCChHHHHH--HhcCCcCcceech--------------hhhhccHHHHHHHH Confidence 221 11111122222222211111 1011111112211 23457899999998 Q ss_pred HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHH-----hCCchhhHHHHHHhhhhhhhhhhheecCCCC Q lcl|NC_019530. 95 IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHL-----MDFDNTAYQKIRKWYVDGRQAYHVIVDPTNK 169 (560) Q Consensus 95 eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l-----l~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~ 169 (560) -|-+.+.-+ |+.|--+ .++ .+..+ ..-..++|+. +....=.+.++..+++.|.-|..++.|.. T Consensus 51 ~ia~~ia~l-----p~~~~~~-~~~-~~~~~---~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~-- 118 (409) T protein:vir:10 51 ILSDNISKL-----PIKIYQK-KDG-IKRVP---DHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKN-- 118 (409) T ss_pred HHHHhhhhC-----ceEEEEe-cCC-eeecc---CchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC-- Confidence 776655422 2222111 111 00000 0111122221 11112234455667778988887666543 Q ss_pred ccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeec Q lcl|NC_019530. 170 KGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDS 249 (560) Q Consensus 170 k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hs 249 (560) .-+.+|.+|+|..|..+. +..+. ......-+|.|.+ ..+....++.+.|.|.+. T Consensus 119 -G~~~~L~~i~~~~V~v~~-----~~~~~--~~~~~~~~y~~~~------------------~~g~~~~~~~~evih~r~ 172 (409) T protein:vir:10 119 -GEIKGLYPLKSDGMKIFV-----DDTGL--LNSENNVWYLYTD------------------DLGQRHKFMSDEILHFKG 172 (409) T ss_pred -CcEEEEEEEcCCceEEEE-----cCCcc--ccccceEEEEEEe------------------CCceeEEeccccEEEecC Confidence 348899999998877542 11121 1111112223321 123355788888888873 Q ss_pred ccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCc Q lcl|NC_019530. 250 GEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGK 329 (560) Q Consensus 250 Gl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGe 329 (560) ...++-...|-|+.|..++.....+++...=+=--.+.-+-|..++ +.|.+..+++ +++.+++... | T Consensus 173 --~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~l~~e~~~~-~~~~~~~~~~--------g- 239 (409) T protein:vir:10 173 --LTADGLAGLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYA-GDLNPEAEEV-FKENFERMSS--------G- 239 (409) T ss_pred --cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC-CCCCHHHHHH-HHHHHHHHhc--------c- Confidence 2445556678999999999888888776554433334456677766 4566554433 3333332211 1 Q ss_pred ccCcccchhhHhhhcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHH Q lcl|NC_019530. 330 ITGNAHLMGIAEDYWLPRREGQNATEIATVGG-GNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQE 408 (560) Q Consensus 330 v~d~~~~msmlEDywLpRReGgrgTEIsTLpG-g~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRD 408 (560) .++..+.+ +++ .|++++.|.- ...+.-++-.++..+.+.++++||.+-|+..++.+ .+++++..+. T Consensus 240 ~~n~~~~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~--~~~~e~~~~~ 306 (409) T protein:vir:10 240 LKNAHRIA-MLP----------IGYKFEPISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRAT--HSNITEQNRE 306 (409) T ss_pred ccccCCce-ecC----------CCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCc--cccHHHHHHH Confidence 11111222 222 2566666632 12333345567889999999999999997544322 3333333333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhc Q lcl|NC_019530. 409 ELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVG 488 (560) Q Consensus 409 ElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vG 488 (560) |..++ |+- +...+.+.|-. .++++.+.. .. ..+.|.-+. +... -+..|.+.++.+ ++ T Consensus 307 ---f~~~~--l~P-~~~~ie~~ln~-----kL~~~~~~~--~~--~~~~fd~~~----ll~~-d~~~~~~~~~~~---~~ 363 (409) T protein:vir:10 307 ---FYIDT--LQS-ILNMYELEINY-----KLFLISEIK--NG--FYSKFNVDT----ILRA-DIKTRYESYKEA---IQ 363 (409) T ss_pred ---HHHHH--HHH-HHHHHHHHHHH-----hhcCchhcc--CC--cEEEEechh----hhcc-CHHHHHHHHHHH---Hh Confidence 54432 322 11222333333 334555432 12 233343332 2211 122344443332 22 Q ss_pred -chhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccccc--ccccCCCCCCcccccccCCcc Q lcl|NC_019530. 489 -SIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYS--GSDVSPLKFKPETIPFSGSTD 557 (560) Q Consensus 489 -ky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~--~~~~~~~~~~p~~~~~~~~~~ 557 (560) -+++.+-++. +|++.. -|. ||.. +....++..-++-.+ -||-- T Consensus 364 ~G~~T~NE~R~-~lgl~p--------------------~~g----gD~~~~~~n~~~~~~~~~~~~-kgGe~ 409 (409) T protein:vir:10 364 NGFKTPNEIRE-LEEDEP--------------------LEG----GDVLLINGNMIPVKMAGEQYS-KGGEK 409 (409) T ss_pred CCCcCHHHHHH-HhCCCC--------------------CCC----cCeeeeccCccchhhcccccc-ccCCC Confidence 2666666653 344421 111 2211 111111111111111 11111 No 85 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=97.48 E-value=5.9e-05 Score=43.76 Aligned_cols=398 Identities=11% Similarity=0.090 Sum_probs=180.3 Q ss_pred chhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhh Q lcl|NC_019530. 6 RGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQ 85 (560) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~ 85 (560) --||+.-|++..++.++...-+. +.......+.+-.. .++..+ ..+..++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~-------------~~~~~v------------~~~~a~~ 50 (412) T protein:vir:26 1 MNVIAKENIVTRIKKKLIDNWID-----QSTSKLYDFSPWKN-------------RSFWGV------------INNTLET 50 (412) T ss_pred CccchhhhhhhhhhhhHhhhhhc-----ccccccccccccCC-------------cccccc------------chhhhhc Confidence 34555555555555444333110 00000000110000 011111 1124568 Q ss_pred ccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHH-hCCchhh----HHHHHHhhhhhhhhh Q lcl|NC_019530. 86 QPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHL-MDFDNTA----YQKIRKWYVDGRQAY 160 (560) Q Consensus 86 ~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l-l~f~~~~----~~~~R~wYvDGr~~~ 160 (560) +|-|..||+-|.+.+.-. |+.+- ...+. .+ .....+|+. =+-...+ +.++..++++|.-|. T Consensus 51 ~~~v~~~i~~ia~~iA~l-----p~~~~-~~~~~-~~-------~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~ 116 (412) T protein:vir:26 51 NETIFSAITKLSNSMASL-----PLKMY-EDYKV-VN-------TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYV 116 (412) T ss_pred cHHHHHHHHHHHHhHhhC-----ceeEe-ecccc-cc-------chHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEE Confidence 899999998777766543 22221 11111 11 112223321 1112233 445667788898887 Q ss_pred hheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecc Q lcl|NC_019530. 161 HVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFD 240 (560) Q Consensus 161 hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~ 240 (560) .++.|. ..-+.+|.+|+|..|...+. .+++ .-+|.|.. ..+....++ T Consensus 117 ~i~r~~---~G~~~~L~~l~~~~v~v~~~-----~~~~-------~~~y~~~~------------------~~g~~~~~~ 163 (412) T protein:vir:26 117 LIERDI---YHQPSKLFLLNPDVVEMLIE-----NQSR-------ELYYSIHA------------------ATGNKLIVH 163 (412) T ss_pred EEEECC---CCcEEEEEEEcCceeEEEEe-----CCCc-------EEEEEEEc------------------CCceEEEEc Confidence 666554 33378999999988876431 1121 11233321 112345677 Q ss_pred hhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhh Q lcl|NC_019530. 241 DESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNR 320 (560) Q Consensus 241 ~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk 320 (560) .+.|.|..- .-..+.-+..|-|..|.+.......+++. .++...+.| . +....-+.|-+.++++..+.+.+.+. T Consensus 164 ~~evih~~~-~~~~~~~~G~s~i~~~~~~i~~~~a~~~~-~~~~~~~~~-~-~i~~~~~~l~~e~~~~~~~~~~~~~~-- 237 (412) T protein:vir:26 164 NMDMLHFKH-IVASNMVQGISPIDVLKNTTDFDNAVRTF-NLTEMQKPD-S-FMLKYGSNVGKEKRQQVLEDFKQYYE-- 237 (412) T ss_pred cccEEEeCC-CCCCCCcccccHHHHHHHHHHHHHHHHHH-HHHhcCCCC-c-eEEecCCCCCHHHHHHHHHHHHHHhh-- Confidence 787877741 11223333456677777766666556655 344444443 3 33334566776666665555443332 Q ss_pred eeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHH---HHHHHHHHHHhcCCCccccCCCCcccc Q lcl|NC_019530. 321 NAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDH---VNYFREKLYDALMIPKSRLQEEGSINI 397 (560) Q Consensus 321 ~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~D---V~YF~~kLy~aL~VP~sRl~~~~~~~l 397 (560) ..|.+ + .++ .|.+++.|. .+.-+++- ..|-.+.+.++++||...|+..++. T Consensus 238 -----~~g~~------~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~-- 291 (412) T protein:vir:26 238 -----ENGGI------L-FQE----------PGVEIEPLP--KKYVSEDIVASENLTRERVANVFQLPSVFLNARSNT-- 291 (412) T ss_pred -----cCCCe------e-ecC----------CCceEEEcC--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC-- Confidence 22321 1 121 356777774 33333333 3356688999999999999765442 Q ss_pred cccchhhhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHH Q lcl|NC_019530. 398 GGSNLAEITQEELRFSKFC-AGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDR 476 (560) Q Consensus 398 ~~~~~~eItRDElkF~Kfi-~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~R 476 (560) ..+++++..+. |.+++ .-+.. .+.+.|-. +++++.+|.. . ..|.|..+ ++.... +.+| T Consensus 292 ~~sn~e~~~~~---f~~~~l~P~~~----~ie~~ln~-----kLl~~~~~~~--~--~~~~fd~~----~l~~~d-~~~~ 350 (412) T protein:vir:26 292 NFAKNEELNRF---YLQHTLLPIVK----QYEEEFNR-----KLLTKTDREK--N--RYFKFNVK----SYLRAD-SATQ 350 (412) T ss_pred CcccHHHHHHH---HHHHHHHHHHH----HHHHHHHh-----hcCCcccccC--c--ceEEeech----hhhccC-HHHH Confidence 33334444444 55552 22221 12233333 3455666532 2 23444332 222222 3345 Q ss_pred HHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccc--cccCCCCCCcccccccC Q lcl|NC_019530. 477 LASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSG--SDVSPLKFKPETIPFSG 554 (560) Q Consensus 477 l~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~--~~~~~~~~~p~~~~~~~ 554 (560) ++.++.+-.- -+++.+-+++ +|+|.+ -|. ||... ....++...++.-+..+ T Consensus 351 ~~~~~~~~~~--G~~t~NE~R~-~~gl~p--------------------~~g----gD~~~~~~n~~~~~~~~~~~~~~~ 403 (412) T protein:vir:26 351 AEVYFKAVRS--GYYTINDIRE-WEDLPP--------------------VEG----GDKPLISGDLYPIDTPLELRKSLK 403 (412) T ss_pred HHHHHHHHhC--CCcCHHHHHH-HhCCCC--------------------CCC----cCeeeecccccccccchhhccccc Confidence 5554433221 3566666663 344422 122 22211 11122222222222222 Q ss_pred CcccCC Q lcl|NC_019530. 555 STDDSI 560 (560) Q Consensus 555 ~~~~~~ 560 (560) |-+..- T Consensus 404 gG~~n~ 409 (412) T protein:vir:26 404 GGDKNV 409 (412) T ss_pred CCCCCc Confidence 211111 No 86 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=97.42 E-value=7e-05 Score=43.36 Aligned_cols=451 Identities=10% Similarity=0.070 Sum_probs=207.5 Q ss_pred hhhhhhhhhhhhhc--ccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHh--hcc Q lcl|NC_019530. 12 FGGGGLVNAKVDTD--KLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMA--QQP 87 (560) Q Consensus 12 ~~~~~~~~~~~~~~--~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma--~~P 87 (560) .|...-+++|+..- ++-.+.-.+..+ ...+..|+. ..+-|++|+.|. .+| T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~-~~~i~~~~~-------------------------~~~~I~~w~~~Y~g~~~ 54 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSIND-HEKINIDPN-------------------------ELARIERNLRQYEGDYP 54 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhc-CCceecCHH-------------------------HHHHHHHHHHHhcCCCc Confidence 33333444443221 111111111111 111222211 222333333332 122 Q ss_pred chh-------------------HHHHHhhcceeeecCCCceEEEeeecccchh--HHHHHHHHHHHHHHHHHhCCchhhH Q lcl|NC_019530. 88 EIR-------------------KAVDIIVNDVVTCEEDETPVTVNLDKVEGIS--DTVKESITECFKEVMHLMDFDNTAY 146 (560) Q Consensus 88 Evd-------------------~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s--~sik~~I~eeF~~il~ll~f~~~~~ 146 (560) +|. .+|-.=....+ -..+|++.+++.+..+ ........+..+.+++-.+|..... T Consensus 55 ~~~~~~~~~~~~~~~~~sl~~~~~i~~~~A~Ll----~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~ 130 (517) T protein:vir:98 55 QVEYINSQGKIQERDYMTLNLRKLSADVLSGLV----FNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLS 130 (517) T ss_pred ccccccccccccccceeecCcHHHHHHHhhhhh----cCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHH Confidence 221 01100001111 1134556665533211 1122345677889999999999999 Q ss_pred HHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCcccccc------cc---ccchhee----ecC Q lcl|NC_019530. 147 QKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEA------IE---SVTLKYY----YNP 213 (560) Q Consensus 147 ~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~------~~---~~~e~~~----Y~p 213 (560) +.+.....-|..+|-..+|.. -..+.+++|-++-+++ .+..+.+.. .+ .-..||+ +.+ T Consensus 131 ~~~e~a~a~G~~a~k~~~d~~-----~~~I~~v~ad~~~Pl~----~~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~ 201 (517) T protein:vir:98 131 DYLEPTFALGGLTVRPYVDNG-----EIEFSWALANAFYPLR----SNSNGISEGVMKSVTTKVIGNKTVYYTLLEFHEW 201 (517) T ss_pred HHHHHHhhhCCEEEEEEEeCC-----eeEEEEEcCCeeEEEE----ecCCCeEEEEEEEEEEEeecCCceEEEEEEEEec Confidence 999999999999998888843 3458899998886642 122121111 00 0111221 111 Q ss_pred cccc-----c--------ccccccccccc----ccccc-cceecc-hhceeeeec-----ccccCCCCcchhhhHHHHHH Q lcl|NC_019530. 214 NYNR-----N--------QFTGQSGTSQN----FQPSQ-QELVFD-DESIVYIDS-----GEEPLANGIVPGLLNPAIRP 269 (560) Q Consensus 214 ~~~~-----~--------~~~g~~~~~~~----~~~~~-~~~~i~-~daI~y~hs-----Gl~~~~~~~i~syL~~AiK~ 269 (560) .... + ..+..+.+..+ .++.. ..+.++ .+--.|+|. --.+......+|-++.|+-. T Consensus 202 ~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L~~~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~ 281 (517) T protein:vir:98 202 EKTEEGESLYVITNELYKSDNEGEIGKRIPLEELYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVST 281 (517) T ss_pred CceeccCCcEEEEEEEEecCCCccccccccccccccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHH Confidence 1100 0 00000000000 00000 000110 000112221 00011234466888888877 Q ss_pred HHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccC Q lcl|NC_019530. 270 LNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRRE 349 (560) Q Consensus 270 ~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRRe 349 (560) +--|-..-+. +.|..|.=.+|||. +..=|++.... +......++| .+++.++.+..+ . T Consensus 282 ~d~lD~~~s~--~~~e~~~g~~~i~v-p~~~l~~~~~~-------~g~~~~~~~d------~~~~~y~~~~~~------~ 339 (517) T protein:vir:98 282 LKKINDTYDQ--FWWEIKMGQRTVFV-SDVMLRTVPDE-------SGMPPPQVFD------PDVNVYKSIRMG------T 339 (517) T ss_pred HHHHHHHHHH--HHHHHHhCCcceec-ChhhhccccCC-------CCcccCCCCC------cccceeeeccCC------C Confidence 7777755444 45888887777765 22211100000 0000001111 122233322111 1 Q ss_pred CCCccceeecCCCCCc-chHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 350 GQNATEIATVGGGNQL-GEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFM 428 (560) Q Consensus 350 GgrgTEIsTLpGg~nL-gei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~ 428 (560) |+. -|+++.+.==. .-..-+.++.+.+-...++|-+-|..++. +-..|+||...+-.-..-+.+.|+.+...+. T Consensus 340 ~~~--~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~---~~kTATEi~s~~~~~~~t~~~~~~~~~~aL~ 414 (517) T protein:vir:98 340 DEE--FVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGR---SMKTATEIVSENDLTYRTRNDHVYEVEQFIK 414 (517) T ss_pred CCC--ceeeeccccchHHHHHHHHHHHHHHHHHhCCCccccccccc---ccccHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111 24444441111 12445677888888899999999987765 2245889987777777788888888888888 Q ss_pred HHHHHHHHhc---cCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCH Q lcl|NC_019530. 429 EFLRRQLILK---GVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSD 505 (560) Q Consensus 429 d~Lk~qLiLk---giit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tD 505 (560) ++++..|.|. ++++..-+. ...+.++|- |+.+.. ++++.- .+.++-.- | .+|....+++...+|| T Consensus 415 ~lv~~i~~l~~~~~~~~~~~~~---~~~v~v~f~-D~i~~D-~~~~~~-----~~~~~v~a-G-~ms~~~~i~~~~g~~e 482 (517) T protein:vir:98 415 GLVISVLELAKTYKLFGGEIPS---AEHIGVDFD-DGVFQD-RSALLR-----FYGQAKTF-G-FIPTVEAIQRIFKVPK 482 (517) T ss_pred HHHHHHHHHHHHHhhcCCCCCC---CcceEEEcC-CCCCCC-HHHHHH-----HHHHHHhc-C-CCCHHHHHHHhCCCCh Confidence 8888776443 222221111 012455553 323322 222221 11121111 3 4788887777889999 Q ss_pred HHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcc Q lcl|NC_019530. 506 EEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPE 548 (560) Q Consensus 506 eEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~ 548 (560) +|.+++-.+|++|.... -|-+..+.+.+.. +-+.| T Consensus 483 eeA~~e~~~i~~E~~~~-~~~~~~~~~~~~~-------~gd~e 517 (517) T protein:vir:98 483 KTAEQWLEEIRKDQIEL-DPVTISQRAQKRM-------FGDEE 517 (517) T ss_pred HHHHHHHHHHHHhcccc-CCCCccccccCCC-------CCCCC Confidence 99999999999997422 1112222222222 11222 No 87 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=97.41 E-value=7.3e-05 Score=43.26 Aligned_cols=419 Identities=11% Similarity=0.038 Sum_probs=172.1 Q ss_pred CCCCCceeEeccccceeccccccchh---------hHHHHHHHHHHHhhccchhHHH----------------------- Q lcl|NC_019530. 46 DAQDGSIILQGGANTYNYVGVESELL---------SVKTVVEEYQSMAQQPEIRKAV----------------------- 93 (560) Q Consensus 46 ~~~dG~~~i~~~~~~~~y~~~e~~~~---------~~~~LI~~YR~ma~~PEvd~AI----------------------- 93 (560) -...|++.-+- .+...+...+.... ...+.+.+|+.+..+.+-...| T Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~ 79 (472) T protein:vir:93 1 MYPSQPTQTEI-FDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMI 79 (472) T ss_pred CCCCCCcchhh-hhceeeecCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhccccccccccccccc Confidence 22223322211 11111222222110 1122334555555554333211 Q ss_pred ----HHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCC Q lcl|NC_019530. 94 ----DIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNK 169 (560) Q Consensus 94 ----~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~ 169 (560) -.||+ ..+.---..||++..++.+ ..+..+.+++ .+|+....++.+.+++.|+-|.+.-+|. T Consensus 80 ~n~~~~ivd-~~~~~l~g~~~~~~~~d~~---------~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~--- 145 (472) T protein:vir:93 80 TNFHANLVD-QKVSYIVGKPIAFKHTDDE---------VVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDE--- 145 (472) T ss_pred cchHHHHHH-HHhhhhcccCeeeccCChH---------HHHHHHHHHh-ccHHHHHHHHHHHHhhcCeEEEEEEECC--- Confidence 01111 1111123455555444432 2233344443 4789999999999999999887655543 Q ss_pred ccCeeEeEEeCccccchhhhhcccCC-cccccccc--ccchheeecCcccc-ccccccccc---------cccccccccc Q lcl|NC_019530. 170 KGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEAIE--SVTLKYYYNPNYNR-NQFTGQSGT---------SQNFQPSQQE 236 (560) Q Consensus 170 k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~~~--~~~e~~~Y~p~~~~-~~~~g~~~~---------~~~~~~~~~~ 236 (560) +|-..+..+||+.+-++..-....+ ...+.... .....-+|++..-+ +.+.+.... ......+..- T Consensus 146 -d~~~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (472) T protein:vir:93 146 -EGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSW 224 (472) T ss_pred -CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeecceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCC Confidence 4557899999999887742110000 01111111 00001112221111 000110000 0000000000 Q ss_pred eecchhceeeeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHH Q lcl|NC_019530. 237 LVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMG 315 (560) Q Consensus 237 ~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~ 315 (560) -.+| |++.. ++....|-++..+....-+. ++=+....-+....|-+-+.-.+.-. ..+..+. +. T Consensus 225 ~~vP---vv~~~------nn~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~-----~~~~~~~-~~ 289 (472) T protein:vir:93 225 GKIP---FIPFK------NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQE-----LPEFKRL-LR 289 (472) T ss_pred CCcc---eEEec------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCccc-----chhhHHH-Hh Confidence 0111 22221 11122344444433333332 45555555666777754443222111 1111111 11 Q ss_pred hhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCc Q lcl|NC_019530. 316 KFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGS 394 (560) Q Consensus 316 k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~ 394 (560) .++ -..++ + +..+.+|-...+.+. ..-+.-+++.+|+..++|---++.-+ T Consensus 290 ~~~-----------------------~~~~~---~--~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~- 340 (472) T protein:vir:93 290 YYG-----------------------AIKVS---D--NGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFG- 340 (472) T ss_pred hcc-----------------------ccccC---C--CCcceeEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccc- Confidence 000 01111 1 123555533333332 23466677788889998843222111 Q ss_pred ccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHH Q lcl|NC_019530. 395 INIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILN 474 (560) Q Consensus 395 ~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~ 474 (560) |...+..|.--+.....-+.+.++.|...+.++++.-+-+-|+ ..+|. .|.+.|.. +.-.- +. T Consensus 341 ---~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~--~~~~~-----~i~v~f~~-~~p~~------~~ 403 (472) T protein:vir:93 341 ---SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI--KGEHK-----DVDISFNY-NKVAN------TE 403 (472) T ss_pred ---cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Ccccc-----eeeEEeCC-CCCCC------HH Confidence 1222344555555566667888888888888887765544454 34553 25566633 22221 11 Q ss_pred HHHHHHHHhhhhhcchhhHHHHHHHHhCCCH--HHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcc Q lcl|NC_019530. 475 DRLASLNTVEPFVGSIFSIDYVMRNVLRMSD--EEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPE 548 (560) Q Consensus 475 ~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tD--eEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~ 548 (560) +.++++..+ +| .+|.+++++.+-..+| +|+++++++-++.++. .++..+...+.+.++..+..=.+| T Consensus 404 ~~~~~~~k~---~g-iis~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~---~~~~~~~~~d~~~~~~~~~~~~~e 472 (472) T protein:vir:93 404 LQVQTAQQS---MG-IVSHETVLENHPFVEDLQAELERIEQEQMEYNKQ---LPNLDDGGADGAQQQERSNNKESE 472 (472) T ss_pred HHHHHHHHH---hc-cCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHh---ccCcCcccCCCCCCCCCCCcccCC Confidence 224445554 35 4999999987433443 4444444433332211 111111111111111111122222 No 88 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=97.41 E-value=7.3e-05 Score=43.26 Aligned_cols=423 Identities=14% Similarity=0.103 Sum_probs=179.9 Q ss_pred hhhhhhhhhhhcccccchhhhccccccccCCCC-CCCCceeEec--cccceeccccccchhhHHHHHHHHHHHhhccchh Q lcl|NC_019530. 14 GGGLVNAKVDTDKLAQKQDELLLTKATVVALDD-AQDGSIILQG--GANTYNYVGVESELLSVKTVVEEYQSMAQQPEIR 90 (560) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~-~~dG~~~i~~--~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd 90 (560) ...++. +...++.+.+... ..-|...+.. +.+-..+.+-.+. +.. -+ ..+..+++|-|. T Consensus 1 ~~~~l~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~--~g~-~v-~~~~al~~~~V~ 62 (434) T protein:vir:43 1 MSKSLG--------------KVLSSATSAPRSSLFGWGGKTIRLTDGAFWSQFLGRESS--SGK-KV-TVDKAMKLSAVW 62 (434) T ss_pred Cccchh--------------hhhhhcccccchhhhcccccccccCchHHHHHHhcCCcc--CCc-ee-chhhhhccHHHH Confidence 111111 1111111111100 0001111100 0000000000000 000 00 133556789999 Q ss_pred HHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHH-HHHHHHHHHH-hCCchhhHHH----HHHhhhhhhhhhhhee Q lcl|NC_019530. 91 KAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESI-TECFKEVMHL-MDFDNTAYQK----IRKWYVDGRQAYHVIV 164 (560) Q Consensus 91 ~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I-~eeF~~il~l-l~f~~~~~~~----~R~wYvDGr~~~hkii 164 (560) .||.-|-+.+-. -|+.+--...++ .++.+ ....-++|+. =+-...++++ +..++.+|.-|..+.- T Consensus 63 ~~i~~ia~~ia~-----lp~~~~~~~~~g----~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~ 133 (434) T protein:vir:43 63 ACVRLISTSVAG-----LPLGVYERKADG----SRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRR 133 (434) T ss_pred HHHHHHHHhhhh-----CceEEEEEcCCC----ccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEe Confidence 999666655432 233332111111 11111 1112222221 1233344444 4556677876654332 Q ss_pred cCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhce Q lcl|NC_019530. 165 DPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESI 244 (560) Q Consensus 165 D~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI 244 (560) | ..-+.+|.+|+|-.|..++ +..|. ..|+++.+ .+....++.+.| T Consensus 134 ~----~G~~~~L~~l~p~~v~~~~-----~~~g~-------~~y~~~~~-------------------~g~~~~~~~~eV 178 (434) T protein:vir:43 134 A----AGRPAALDFLLPSRVDLEC-----DENGR-------LKYFYTTK-------------------KGARREIERTNM 178 (434) T ss_pred C----CCcEEEEEEEcCcceEEEE-----cCCCe-------EEEEEEec-------------------CceEEEEccccE Confidence 2 3457899999998887542 11221 11222221 134567889999 Q ss_pred eeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeec Q lcl|NC_019530. 245 VYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYD 324 (560) Q Consensus 245 ~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd 324 (560) .|.+. .+.++-..+|-+..|+..+.....+++...-+----+--.-|..++ +.|.+.+++ =+++.++++..- T Consensus 179 ih~~~--~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~l~~e~~~-~~r~~~~~~~g~---- 250 (434) T protein:vir:43 179 LHIPA--FTLDGRIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVD-RILQPAQRE-EFREYVKSVSGA---- 250 (434) T ss_pred EEecC--cCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecC-CCCCHHHHH-HHHHHHHHhcCc---- Confidence 88873 3556666778899999998888888776543332223334455554 456554443 356655544211 Q ss_pred ccCCcccCcccchhhHhhhcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchh Q lcl|NC_019530. 325 RTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGG-GNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLA 403 (560) Q Consensus 325 ~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpG-g~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~ 403 (560) ...|. .+ +++ .|.+++.|.- .+.+.-++-.++..+.+.++++||..-|+...+.+..+++.+ T Consensus 251 ~nag~------~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e 313 (434) T protein:vir:43 251 MNSGR------SP-VLE----------QGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLE 313 (434) T ss_pred cccCC------cc-ccC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHH Confidence 11232 22 232 2566666632 122222344567788899999999999975444334455544 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019530. 404 EITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTV 483 (560) Q Consensus 404 eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~ 483 (560) +..+. |.+++ |+--+.. +.+. +-.++++++++. + ..++|.-+ .+... -...|.+.++.+ T Consensus 314 ~~~~~---f~~~~--L~P~~~~-ie~~-----ln~kL~~~~~~~---~--~~~~fd~~----~llr~-d~~~r~~~~~~~ 372 (434) T protein:vir:43 314 QQMLA---FLTFS--ISSITNQ-IQQC-----VNKRLLTAPERI---R--YYAEFSLE----GFLKA-DSAGRAAWYSTM 372 (434) T ss_pred HHHHH---HHHHH--HHHHHHH-HHHH-----HHhhcCChhhhc---C--ceEEEech----hhhcc-CHHHHHHHHHHH Confidence 44444 55443 4322111 1222 223456777652 2 23444333 22211 123455555544 Q ss_pred hhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccccc-ccccCCCCCCccccccc Q lcl|NC_019530. 484 EPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYS-GSDVSPLKFKPETIPFS 553 (560) Q Consensus 484 ~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~-~~~~~~~~~~p~~~~~~ 553 (560) -. +-++|.+-+++ .+++..- +.-++-.-. .. +.+-+.-.+.+.. ....+.....|+..|-- T Consensus 373 ~~--~G~~T~NE~R~-~~gl~p~--~ggD~~~~~---~n-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 373 AQ--NGFMTRNEGRR-KENLPEL--PGGDILTVQ---SN-LVPIDQLGQSNKSQAVRAALMNWFSQPEPQE 434 (434) T ss_pred Hh--CCCcCHHHHHH-HhCCCCC--CCCCeEeec---cC-ccchhhhhccCCCcchhhhhhccCCCCCCCC Confidence 21 23778888874 4666542 100000000 00 0000000000000 00000111111112211 No 89 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=97.39 E-value=7.7e-05 Score=43.12 Aligned_cols=436 Identities=12% Similarity=0.071 Sum_probs=173.2 Q ss_pred hhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCc--eeEec--cccceeccccccc----hhhHHHHHH----- Q lcl|NC_019530. 12 FGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGS--IILQG--GANTYNYVGVESE----LLSVKTVVE----- 78 (560) Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~--~~i~~--~~~~~~y~~~e~~----~~~~~~LI~----- 78 (560) .--|--|..+++.. |. ..+.. +..+ .|...+-. ..-...+|+ T Consensus 1 ~~~~~~~~~~~~~~------------------------~~~~~~~~~~~n~~~-~~~~~e~~~~~~~~~i~~~i~~~~~~ 55 (511) T protein:vir:99 1 MLKVNEFETDTDLR------------------------GNINYLFNDEANVVY-TYDGTESDLLQNVNEVSKYIEHHMDY 55 (511) T ss_pred Cccccchhhhhhhh------------------------hhhhhhhhhhhCCcc-ccchhhhhhhccHHHHHHHHHHHHHh Confidence 00000111111110 11 01111 1111 11111111 111222332 Q ss_pred ---HHHHHhhccchhHHHH----------------------HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHH Q lcl|NC_019530. 79 ---EYQSMAQQPEIRKAVD----------------------IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFK 133 (560) Q Consensus 79 ---~YR~ma~~PEvd~AI~----------------------eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~ 133 (560) +|+.+..+.+=...|- -||+- .+.=--..||++..++. ...+.+. T Consensus 56 ~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~-~~~yl~g~p~~~~~~d~---------~~~~~l~ 125 (511) T protein:vir:99 56 QRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDF-INGYFLGNPIQYQDDDK---------DVLEAIE 125 (511) T ss_pred hHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHH-HHhhhcccCceeecCch---------HHHHHHH Confidence 3555554443222211 11110 01111234445443332 3456777 Q ss_pred HHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCC-cccccccc--------- Q lcl|NC_019530. 134 EVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEAIE--------- 203 (560) Q Consensus 134 ~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~~~--------- 203 (560) .+++..+|+....++.+...+.|+-|.+.-+|. +|-..+..+||+.+-+|..-....+ ...+.... T Consensus 126 ~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~de----d~~~~i~~~~p~~~~~vyd~~~~~~~~~~vr~~~~~~~~~~~~ 201 (511) T protein:vir:99 126 AFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ----DDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDE 201 (511) T ss_pred HHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCC----CCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecccCcc Confidence 888888999999999999999999998777654 3567899999999988753211111 11111110 Q ss_pred ccchh-eeecCcccccccccccccccccc-----ccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHH- Q lcl|NC_019530. 204 SVTLK-YYYNPNYNRNQFTGQSGTSQNFQ-----PSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTT- 276 (560) Q Consensus 204 ~~~e~-~~Y~p~~~~~~~~g~~~~~~~~~-----~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~- 276 (560) ....+ -+|++..-+.............. .+-.--.+|. |.|+. +....|-++..+.....+..+ T Consensus 202 ~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv--v~~~n-------n~~g~sd~e~v~~liDa~d~~~ 272 (511) T protein:vir:99 202 DEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPI--TEFSN-------NERRKGDYEKVITLIDLYDNAE 272 (511) T ss_pred ceEEEEEEEeCCcEEEEEecCCccccccccccccccCCCCccce--EEecC-------CCCCCCchhhhHHHHHHHHHHH Confidence 00011 14555433322221111000000 0000001221 22221 122234455444444433322 Q ss_pred HHHHHHHHHhcCccceEEE---ccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhccccc----- Q lcl|NC_019530. 277 EDATVIYAITRAPEKRAFY---LDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRR----- 348 (560) Q Consensus 277 EDalVIyRitRAPeRRvFy---IDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRR----- 348 (560) =+....-+-++.|-+.+.- .|.+.+++.+. ++ ..|++.. T Consensus 273 S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~------------~~---------------------~~~~~~~~~~~~ 319 (511) T protein:vir:99 273 SDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKE------------AN---------------------VLFLEPTVYADS 319 (511) T ss_pred HHHHHHHHHhhchhhhhccCcccCchhhccccc------------cc---------------------ceeccccccccc Confidence 1222222333444433321 11111111000 00 1222221 Q ss_pred ---CCCCccceeecCCCCCcchHH-HHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 349 ---EGQNATEIATVGGGNQLGEMD-HVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYS 424 (560) Q Consensus 349 ---eGgrgTEIsTLpGg~nLgei~-DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs 424 (560) ..+.|..+..|-...+...+. -+.-+.+.+|+...+|---.+. |. |...+..|..-+..-..-+.+.++.|. T Consensus 320 ~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~---~~-gn~Sg~Alk~~~~~l~~ka~~k~~~~~ 395 (511) T protein:vir:99 320 EGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDN---FS-GTQSGEAMKYKLFGLEQRTKTKEGLFT 395 (511) T ss_pred ccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc---cc-ccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 112234455554444443333 3566677888888888532221 11 222234454444444455666666666 Q ss_pred HHHHHHHHHHHHhcc----CCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHH Q lcl|NC_019530. 425 HFFMEFLRRQLILKG----VTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNV 500 (560) Q Consensus 425 ~if~d~Lk~qLiLkg----iit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~I 500 (560) .-+.+.++.-+-+-+ +-.+.+|. .|.+.|...---.+ .+.++.+..+ +| .+|.+++++. T Consensus 396 ~~l~~~~~li~~~~~~~~~~~~~~~~~-----~i~i~f~~~~p~n~-------~e~~~~~~kl---~G-iiS~et~l~~- 458 (511) T protein:vir:99 396 KGLRRRAKLLETILKNTRSIDVSKDFN-----TVRYVYNRNLPKSL-------IEELKAYIDS---GG-KISQTTLMSL- 458 (511) T ss_pred HHHHHHHHHHHHHHHhcCCcccccccc-----cceEEeCCCCCcCH-------HHHHHHHHHH---hc-cCCHHHHHHh- Confidence 666666654322212 22344553 25666754322112 2344455555 35 5999999977 Q ss_pred hCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 501 LRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 501 L~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) |...+ +.+++.++|++|..+. .++......++..+.+.. ...+++.++. T Consensus 459 l~~v~-D~~~E~~ri~~E~~~~-~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~ 507 (511) T protein:vir:99 459 FSFFQ-DPELEVKKIEEDEKES-IKKAQKNMYQDPRNINDD---------EQDDSTKDSI 507 (511) T ss_pred CCCCC-CHHHHHHHHHHHHHHH-HHHHhhcccccCCCCCCC---------CCCCCCcCcc Confidence 55543 2334444555554321 111111111111111110 0011222222 No 90 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=97.37 E-value=8.2e-05 Score=43.00 Aligned_cols=418 Identities=13% Similarity=0.109 Sum_probs=171.6 Q ss_pred hhhcccccchhhhcccc--ccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccch---------- Q lcl|NC_019530. 22 VDTDKLAQKQDELLLTK--ATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEI---------- 89 (560) Q Consensus 22 ~~~~~~~~~~~e~~~~k--~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEv---------- 89 (560) .+..|+-...+++..+. +...+.- ......--..+.+.|+....++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~--------------------~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~ 60 (474) T protein:vir:10 1 MTLYKLIDDIEAQGILPKHIEALIES--------------------HKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKE 60 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHH--------------------hhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhh Confidence 11111100000000000 0000000 000000011122333332111100 Q ss_pred --------------------hHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHH Q lcl|NC_019530. 90 --------------------RKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKI 149 (560) Q Consensus 90 --------------------d~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~ 149 (560) -+=...||+-.+-+ --..||++..+... ..-+.+.+.+..+++-.+|+....++. T Consensus 61 ~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~y-l~g~pv~~~~~~~~----~~~e~~~~~l~~~~~~n~~~~~~~~~~ 135 (474) T protein:vir:10 61 DFETGGNVRRLDVSVNNKLNNSFDSEIVDTRVGY-LHGVPVTYDLDENA----EKNEKLKKFITNFAIRNSVDDEDSEIG 135 (474) T ss_pred hhhhcccccccccCcccccccchHHHHHHhHhhh-eeccceeEeeCCCC----cchHHHHHHHHHHHhhcCHhHHHHHHH Confidence 01111222211111 22457777775432 233567777888888889999999999 Q ss_pred HHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccc-----------cchheeecCccccc Q lcl|NC_019530. 150 RKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIES-----------VTLKYYYNPNYNRN 218 (560) Q Consensus 150 R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~-----------~~e~~~Y~p~~~~~ 218 (560) +...+.|+-|.+.-+|. +|-..++.+||+.+-+|.. +....+..+.- +....+|++..- + T Consensus 136 ~~~~~~G~a~~~~~~d~----~~~~~~~~i~p~~~~~v~d----~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~-~ 206 (474) T protein:vir:10 136 KMAAICGYGARLAYIDT----NGDIRIKNIDPYNVIFVGD----NILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYY-Y 206 (474) T ss_pred HHHhhcCeEEEEEEeCC----CCeeEEEEEcccceEEEEc----CCCceEEEEEEEEEeeCCCceEEEEEEEEcCceE-E Confidence 99999999887665554 4557899999999876641 11011111100 001123433221 1 Q ss_pred ccccccc-ccccccccccce-ecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEE Q lcl|NC_019530. 219 QFTGQSG-TSQNFQPSQQEL-VFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVT-TEDATVIYAITRAPEKRAFY 295 (560) Q Consensus 219 ~~~g~~~-~~~~~~~~~~~~-~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m-~EDalVIyRitRAPeRRvFy 295 (560) .+.+... .-....+..... .+| +++. +++....|-++..+.....+.. +-+....-+-++.|-+-+.- T Consensus 207 ~~~~~~~~~~~~~~~~~~~~g~vP-----vv~~----~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g 277 (474) T protein:vir:10 207 VFRGEGIDALQEVGRYEHLFDYNP-----LFGV----PNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRG 277 (474) T ss_pred EEeecCCCcccccccccCCCCccc-----eEEe----cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcc Confidence 1111110 000000000000 111 1111 1333345666666655554443 33333344445555443332 Q ss_pred ccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcc-hHHHHHHH Q lcl|NC_019530. 296 LDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLG-EMDHVNYF 374 (560) Q Consensus 296 IDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLg-ei~DV~YF 374 (560) . .++. +-+..+ +. .|. .|++- .+..+..|--..+.. ...-++-+ T Consensus 278 ~---~~~~----~~~~~~----~~-------~~~-------------i~~~~----~~~~~~~l~~~~~~~~~~~~~~~l 322 (474) T protein:vir:10 278 M---GMSE----EMIQET----QK-------SGA-------------FELFD----KDMDVKYLTKDVNDTMIENHLDRI 322 (474) T ss_pred C---CCCc----hhhhhh----hh-------cce-------------eEecC----CCCceeEEeccCCHHHHHHHHHHH Confidence 1 1121 111111 00 111 12211 112344443333332 23445667 Q ss_pred HHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC----CChhHHHHhh Q lcl|NC_019530. 375 REKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGV----TDEKDWNEKI 450 (560) Q Consensus 375 ~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgi----it~eew~~~i 450 (560) .+.+|....+|-- ..+ ++. |...+..|..-......-+.+.++.|..-+...++.-+-+-++ ..+.+|. T Consensus 323 ~~~I~~~s~~p~~--~~~-~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~--- 395 (474) T protein:vir:10 323 EKNIMRFAKSVNF--NSD-EFN-GNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYL--- 395 (474) T ss_pred HHHHHHHhCCccc--ccc-ccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccc--- Confidence 7888898888842 111 111 2222333433333334445666666666666665554432222 2344442 Q ss_pred hhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCC-----CC Q lcl|NC_019530. 451 KPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGL-----YP 525 (560) Q Consensus 451 ~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~-----~~ 525 (560) .|.+.|..----.+.. .+++++.+ +| .+|.+++++. |...+ +.+++-++|++|..+.. .. T Consensus 396 --~i~~~f~~~~p~d~~e-------~a~~~~kl---~g-~iS~et~~~~-l~~v~-d~~~E~eri~~E~~e~~~~~~~~~ 460 (474) T protein:vir:10 396 --NLIFKFTRNIPVNKLE-------ESQVLINL---KG-QVSERTRLGQ-SQLVD-DVDYELDEMEKESLEFNDKLPDID 460 (474) T ss_pred --cceEEeCCCCCCCHHH-------HHHHHHHH---hc-cCchHHHHHh-CCCCC-CHHHHHHHHHHHHHHHHhhccccc Confidence 3677776543333333 33444444 25 4999999976 65543 24445555555542221 11 Q ss_pred CCCccccccccccccCCCCCCcc Q lcl|NC_019530. 526 KVQADETGNYSGSDVSPLKFKPE 548 (560) Q Consensus 526 ~p~~~e~g~~~~~~~~~~~~~p~ 548 (560) .++++++.++. .+| T Consensus 461 ~~~~~~~~~~~---------~s~ 474 (474) T protein:vir:10 461 EGDANDKSQNN---------QSE 474 (474) T ss_pred CCCcCCCCccc---------cCC Confidence 12222111111 111 No 91 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=97.37 E-value=8.2e-05 Score=43.00 Aligned_cols=418 Identities=13% Similarity=0.109 Sum_probs=171.6 Q ss_pred hhhcccccchhhhcccc--ccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccch---------- Q lcl|NC_019530. 22 VDTDKLAQKQDELLLTK--ATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEI---------- 89 (560) Q Consensus 22 ~~~~~~~~~~~e~~~~k--~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEv---------- 89 (560) .+..|+-...+++..+. +...+.- ......--..+.+.|+....++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~--------------------~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~ 60 (474) T protein:vir:94 1 MTLYKLIDDIEAQGILPKHIEALIES--------------------HKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKE 60 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHH--------------------hhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhh Confidence 11111100000000000 0000000 000000011122333332111100 Q ss_pred --------------------hHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHH Q lcl|NC_019530. 90 --------------------RKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKI 149 (560) Q Consensus 90 --------------------d~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~ 149 (560) -+=...||+-.+-+ --..||++..+... ..-+.+.+.+..+++-.+|+....++. T Consensus 61 ~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~y-l~g~pv~~~~~~~~----~~~e~~~~~l~~~~~~n~~~~~~~~~~ 135 (474) T protein:vir:94 61 DFETGGNVRRLDVSVNNKLNNSFDSEIVDTRVGY-LHGVPVTYDLDENA----EKNEKLKKFITNFAIRNSVDDEDSEIG 135 (474) T ss_pred hhhhcccccccccCcccccccchHHHHHHhHhhh-eeccceeEeeCCCC----cchHHHHHHHHHHHhhcCHhHHHHHHH Confidence 01111222211111 22457777775432 233567777888888889999999999 Q ss_pred HHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccc-----------cchheeecCccccc Q lcl|NC_019530. 150 RKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIES-----------VTLKYYYNPNYNRN 218 (560) Q Consensus 150 R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~-----------~~e~~~Y~p~~~~~ 218 (560) +...+.|+-|.+.-+|. +|-..++.+||+.+-+|.. +....+..+.- +....+|++..- + T Consensus 136 ~~~~~~G~a~~~~~~d~----~~~~~~~~i~p~~~~~v~d----~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~-~ 206 (474) T protein:vir:94 136 KMAAICGYGARLAYIDT----NGDIRIKNIDPYNVIFVGD----NILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYY-Y 206 (474) T ss_pred HHHhhcCeEEEEEEeCC----CCeeEEEEEcccceEEEEc----CCCceEEEEEEEEEeeCCCceEEEEEEEEcCceE-E Confidence 99999999887665554 4557899999999876641 11011111100 001123433221 1 Q ss_pred ccccccc-ccccccccccce-ecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEE Q lcl|NC_019530. 219 QFTGQSG-TSQNFQPSQQEL-VFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVT-TEDATVIYAITRAPEKRAFY 295 (560) Q Consensus 219 ~~~g~~~-~~~~~~~~~~~~-~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m-~EDalVIyRitRAPeRRvFy 295 (560) .+.+... .-....+..... .+| +++. +++....|-++..+.....+.. +-+....-+-++.|-+-+.- T Consensus 207 ~~~~~~~~~~~~~~~~~~~~g~vP-----vv~~----~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g 277 (474) T protein:vir:94 207 VFRGEGIDALQEVGRYEHLFDYNP-----LFGV----PNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRG 277 (474) T ss_pred EEeecCCCcccccccccCCCCccc-----eEEe----cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcc Confidence 1111110 000000000000 111 1111 1333345666666655554443 33333344445555443332 Q ss_pred ccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcc-hHHHHHHH Q lcl|NC_019530. 296 LDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLG-EMDHVNYF 374 (560) Q Consensus 296 IDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLg-ei~DV~YF 374 (560) . .++. +-+..+ +. .|. .|++- .+..+..|--..+.. ...-++-+ T Consensus 278 ~---~~~~----~~~~~~----~~-------~~~-------------i~~~~----~~~~~~~l~~~~~~~~~~~~~~~l 322 (474) T protein:vir:94 278 M---GMSE----EMIQET----QK-------SGA-------------FELFD----KDMDVKYLTKDVNDTMIENHLDRI 322 (474) T ss_pred C---CCCc----hhhhhh----hh-------cce-------------eEecC----CCCceeEEeccCCHHHHHHHHHHH Confidence 1 1121 111111 00 111 12211 112344443333332 23445667 Q ss_pred HHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC----CChhHHHHhh Q lcl|NC_019530. 375 REKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGV----TDEKDWNEKI 450 (560) Q Consensus 375 ~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgi----it~eew~~~i 450 (560) .+.+|....+|-- ..+ ++. |...+..|..-......-+.+.++.|..-+...++.-+-+-++ ..+.+|. T Consensus 323 ~~~I~~~s~~p~~--~~~-~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~--- 395 (474) T protein:vir:94 323 EKNIMRFAKSVNF--NSD-EFN-GNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYL--- 395 (474) T ss_pred HHHHHHHhCCccc--ccc-ccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccc--- Confidence 7888898888842 111 111 2222333433333334445666666666666665554432222 2344442 Q ss_pred hhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCC-----CC Q lcl|NC_019530. 451 KPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGL-----YP 525 (560) Q Consensus 451 ~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~-----~~ 525 (560) .|.+.|..----.+.. .+++++.+ +| .+|.+++++. |...+ +.+++-++|++|..+.. .. T Consensus 396 --~i~~~f~~~~p~d~~e-------~a~~~~kl---~g-~iS~et~~~~-l~~v~-d~~~E~eri~~E~~e~~~~~~~~~ 460 (474) T protein:vir:94 396 --NLIFKFTRNIPVNKLE-------ESQVLINL---KG-QVSERTRLGQ-SQLVD-DVDYELDEMEKESLEFNDKLPDID 460 (474) T ss_pred --cceEEeCCCCCCCHHH-------HHHHHHHH---hc-cCchHHHHHh-CCCCC-CHHHHHHHHHHHHHHHHhhccccc Confidence 3677776543333333 33444444 25 4999999976 65543 24445555555542221 11 Q ss_pred CCCccccccccccccCCCCCCcc Q lcl|NC_019530. 526 KVQADETGNYSGSDVSPLKFKPE 548 (560) Q Consensus 526 ~p~~~e~g~~~~~~~~~~~~~p~ 548 (560) .++++++.++. .+| T Consensus 461 ~~~~~~~~~~~---------~s~ 474 (474) T protein:vir:94 461 EGDANDKSQNN---------QSE 474 (474) T ss_pred CCCcCCCCccc---------cCC Confidence 12222111111 111 No 92 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=97.37 E-value=8.2e-05 Score=42.98 Aligned_cols=402 Identities=12% Similarity=0.090 Sum_probs=177.8 Q ss_pred cccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH------------ Q lcl|NC_019530. 27 LAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD------------ 94 (560) Q Consensus 27 ~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~------------ 94 (560) +..+.-.+..+ .-..-+.+|+.+..+.+-+..|- T Consensus 1 l~~~~l~~~i~----------------------------------~~~~~~~r~~~l~~yy~g~~~il~~~~~~~~~~~~ 46 (429) T protein:vir:98 1 MTKDLLSELIQ----------------------------------KHRSFNLSYSAYKQLYEGDHAILQQKQKEQYKPDN 46 (429) T ss_pred CCHHHHHHHHH----------------------------------HHHHHHHHHHHHHHHhccccccccccccccCCCcc Confidence 10000000000 00112344555554544433221 Q ss_pred --------HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecC Q lcl|NC_019530. 95 --------IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDP 166 (560) Q Consensus 95 --------eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~ 166 (560) -||+ ..+.---..||++..++ +...+.++.+.+-.+|+....++.|.+++.|+-|.+..+| T Consensus 47 ki~~n~~~~ivd-~~~~~l~g~~~~~~~~~---------~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d- 115 (429) T protein:vir:98 47 RLVVNFAKYIVD-TFNGYFIGVPVQTSHEN---------KQVSNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFND- 115 (429) T ss_pred eeecchHHHHHH-HHhhhhcccCceeecCC---------hHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEec- Confidence 0111 11111122334443332 2345567777777889999999999999999988865554 Q ss_pred CCCccCeeEeEEeCccccchhhhhcccC-Cccccccc--c-ccchheeecCccccccccccccccccccccccceecchh Q lcl|NC_019530. 167 TNKKGGIKKLVMLDSRCIRPVYIVEKAM-REGGIEAI--E-SVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDE 242 (560) Q Consensus 167 ~~~k~GI~eLr~LDPr~i~~vr~i~~~~-~~g~~~~~--~-~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~d 242 (560) ++|-..++.++|+.+-++..-.... ....+... . ....+.+|.+....+...+.........-+...-++| T Consensus 116 ---~~g~~~~~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP-- 190 (429) T protein:vir:98 116 ---ENAEAGITYLTPLEAFIVYDDSIRQKPLFAVRYFYNKGGVLEGSYSDASNITYFKDGEKGIEIGESEPHPFDGVP-- 190 (429) T ss_pred ---CCCcEEEEEEcccceEEEEeCCCCCceEEEEEEEEecCceEEEEEEeCceEEEEEecCCceEecccccccCCccc-- Confidence 3578899999999987663211100 01111110 1 1111112222211111111100000000000000111 Q ss_pred ceeeeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhhe Q lcl|NC_019530. 243 SIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRN 321 (560) Q Consensus 243 aI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~ 321 (560) |++. .++....|-++..+.....+. ++-+....-+..+.|-+-+.=.+. .. +-+++++. +++ T Consensus 191 -vv~~------~n~~~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~~~---~~----~~~~~~~~---~~~ 253 (429) T protein:vir:98 191 -MIEY------VENEERQSLLASVVTLINAFNKAISEKANDVEYFADAYLKILGAEL---DD----ETLKSLRD---TRI 253 (429) T ss_pred -eEEe------cCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCC---Cc----chhhhHhh---Cce Confidence 1211 122334455666555555553 344555556777778776653222 11 11111110 111 Q ss_pred eecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCCCCccccccc Q lcl|NC_019530. 322 AYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDH-VNYFREKLYDALMIPKSRLQEEGSINIGGS 400 (560) Q Consensus 322 vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl~~~~~~~l~~~ 400 (560) . .+|- +||.+..+..|--..+.+.... ++-+.+.+|+...+|- +..++. |.. T Consensus 254 ~---------------------~~~~-~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~--~~~~~~---gn~ 306 (429) T protein:vir:98 254 I---------------------NLKD-TDAQQLTVEFLQKPDADATQEHLLDRLENLIFRTAMVAN--ISDESF---GTA 306 (429) T ss_pred e---------------------eccC-CCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCccc--cCcccc---ccc Confidence 1 1222 3344555665544445544443 6788888899999993 333332 223 Q ss_pred chhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC-hhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHH Q lcl|NC_019530. 401 NLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTD-EKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLAS 479 (560) Q Consensus 401 ~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit-~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~ 479 (560) .+..|.--+.....-+.+.|+.|..-+...++.-+-+-++.. +.+|. .|.+.|..--.-.+. +.+++ T Consensus 307 Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~d~~-----~i~v~f~~~~p~~~~-------~~a~~ 374 (429) T protein:vir:98 307 SGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKIGPKDWI-----GIKYKFTRNLPANLL-------EESQI 374 (429) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccc-----cceEEeCCCCCcCHH-------HHHHH Confidence 334454444445555666666666666665544433333322 22332 366777543222221 23445 Q ss_pred HHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcc Q lcl|NC_019530. 480 LNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPE 548 (560) Q Consensus 480 l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~ 548 (560) +..+ +| .+|.+++++. |...++ -+++.++|++|..+- .. .++++.. ++.++. .+| T Consensus 375 ~~kl---~g-~is~et~~~~-l~~v~d-~~~E~~ri~~E~~~~-~~----~~~~~~~-~~~~~~--~~~ 429 (429) T protein:vir:98 375 AGNL---AG-IVSEETQVGV-LSIVEN-PQKEIERKNSDKSTL-IS----RQAGGLN-GQNTTT--ILE 429 (429) T ss_pred HHHH---hc-cCchHHHHHh-CCCCCC-HHHHHHHHHHHHHHH-HH----HHHhhhc-CCCCCC--CCC Confidence 5554 34 5999999966 565432 234455666664321 01 1111111 111111 111 No 93 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=97.36 E-value=8.6e-05 Score=42.88 Aligned_cols=459 Identities=14% Similarity=0.064 Sum_probs=184.2 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHH Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEY 80 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~Y 80 (560) |.-|--+-+++..+..+.+.++-.+ +...-+.....+..+ .....--.... ....+|.+-...+.........+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~i~~~i~---~~~~~~~~~~~--~~~~yY~g~~~~i~~~~~~~~~~ 74 (481) T protein:vir:10 1 MTVYTINNINTKFSPLANDDFVVSD-LAELLKEENLRNFIS---RHQTEQVPRLE--MLESYYLNRNTDILAGERRLQKY 74 (481) T ss_pred CeeEeeehhchhcccccCceeeeec-chhhcCHHHHHHHHH---HHHHHHHHHHH--HHHHHhcCCCcccccCccccccc Confidence 7777666666555544444332211 000000000000000 00000000000 00001111100000000000000 Q ss_pred HHH----hhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhh Q lcl|NC_019530. 81 QSM----AQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDG 156 (560) Q Consensus 81 R~m----a~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDG 156 (560) ..- ...+=+.-+|+..+ .---..||++..++. ...+.+..+++-.+|+....++.+..++-| T Consensus 75 ~~~~~~ki~~n~~~~ivd~~~-----~~l~g~~~~~~~~d~---------~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G 140 (481) T protein:vir:10 75 GDKADHRAVHNYAKYVSRFIV-----GYLTGNPITITHQDN---------QTNDKIIELNDLNDADEVNSDLALNLSIYG 140 (481) T ss_pred cccccceeecchHHHHHHHHH-----hhhccCCceEecCCh---------hHHHHHHHHHHhcChhHHHHHHHHHHHhcC Confidence 000 01122222222222 112245666665543 334466677777899999999999999999 Q ss_pred hhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCC-cccccccc-------ccchheeecCccccccccccccccc Q lcl|NC_019530. 157 RQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEAIE-------SVTLKYYYNPNYNRNQFTGQSGTSQ 228 (560) Q Consensus 157 r~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~~~-------~~~e~~~Y~p~~~~~~~~g~~~~~~ 228 (560) +-|.+.-+|. +|-..+..+||+++-++..-....+ ...+..+. .+...-+|.+..-+....+.+.... T Consensus 141 ~~~~~~~~d~----dg~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~ 216 (481) T protein:vir:10 141 RAYEIVYRDF----EDRDTFKVLDPKSTFVVYDQTLDKKVVAGVRYFEKQDKDKVPVQHVEVYTTDKIYYIEIKGGTYHR 216 (481) T ss_pred eEEEEEEeCC----CCeEEEEEEcccceEEEEcCCCCCceEEEEEEEEEeeCCCceEEEEEEEecCeEEEEEecCCceee Confidence 9998777653 4778899999999877632111000 00011111 0111123443322211111111000 Q ss_pred cccccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHH Q lcl|NC_019530. 229 NFQPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTEDATVIYAITRAPEKRAFYLDVGTLGKKSAE 307 (560) Q Consensus 229 ~~~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAe 307 (560) ...-+-.--++| |++.. ++....|-++..+....-+. ++-+....-+.++.|-+-+. |..+ T Consensus 217 ~~~~~~~~g~vP---vv~~~------n~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~----g~~~----- 278 (481) T protein:vir:10 217 VEEVEHYYNDVP---IIEYL------NDQFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAII----GNVD----- 278 (481) T ss_pred cccccccCCcee---EEEee------cCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEee----cCcC----- Confidence 000000000122 12211 12222344444333333221 23333334445555554432 2111 Q ss_pred HHHHHHHHhhhhheeecccCC-cccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHH-HHHHHHHHHHHhcCCC Q lcl|NC_019530. 308 EYMTMMMGKFKNRNAYDRTTG-KITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMD-HVNYFREKLYDALMIP 385 (560) Q Consensus 308 qYl~~im~k~knk~vYd~~TG-ev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~-DV~YF~~kLy~aL~VP 385 (560) .|..+| .++.++.+......-+.+ .+.+.++..|....+...+. .++-.++.+|....+| T Consensus 279 ---------------~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p 340 (481) T protein:vir:10 279 ---------------LDSEDAKAFRDANMIHLEPGTNANG---SEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTP 340 (481) T ss_pred ---------------CCccchhhhhhccceeccccccccC---CCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCc Confidence 011111 111111111111111111 12223455554434443333 3566777888998998 Q ss_pred ccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHH Q lcl|NC_019530. 386 KSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIR 465 (560) Q Consensus 386 ~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~ 465 (560) ---++.- +.+ ..+..|.........-+.+.|+.|...+.++++.-+-+-++-...+++. ..|.+.|...-.-. T Consensus 341 ~~~~~~~-~~n---~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~---~~i~v~f~~~~~~~ 413 (481) T protein:vir:10 341 DLNDEQF-SGV---QSGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLKQHNY---AELTITFTPNLPKS 413 (481) T ss_pred ccccccc-ccc---cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccc---ceeeEEeCCCCCcC Confidence 4322221 112 2234454444556667888888888888887765444434433333321 23677775443222 Q ss_pred HHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCH---HHHHHHHHHHHHhhhcC---CCCCCCccccccccccc Q lcl|NC_019530. 466 EQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSD---EEVKEQQTKIAEEKKKG---LYPKVQADETGNYSGSD 539 (560) Q Consensus 466 Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tD---eEI~e~~kqIe~E~~~~---~~~~p~~~e~g~~~~~~ 539 (560) +. +.++++..+ +| .+|.+++++. |...+ +|++.++++-+++.+.. .++++.++ + T Consensus 414 ~~-------~~a~~~~kl---~g-~is~et~~~~-l~~i~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~--~------ 473 (481) T protein:vir:10 414 MM-------ESINAFNAL---SG-GVSESTRLSL-LDFIDNPKEELEKMQEEEAQREKQADKRGYGEAFEN--H------ 473 (481) T ss_pred HH-------HHHHHHHHH---hc-cCChHHHHHh-CCCCCCHHHHHHHHHHHHHHHHhhhhhccCCccCCC--C------ Confidence 32 344445555 34 4899999977 55543 45554444443333211 11111111 1 Q ss_pred cCCCCCCcccccccCCcccCC Q lcl|NC_019530. 540 VSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 540 ~~~~~~~p~~~~~~~~~~~~~ 560 (560) ++.||+= T Consensus 474 --------------~~~dd~~ 480 (481) T protein:vir:10 474 --------------LNVDDSN 480 (481) T ss_pred --------------CCCCCCC Confidence 1111111 No 94 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=97.24 E-value=0.00012 Score=42.12 Aligned_cols=406 Identities=11% Similarity=0.034 Sum_probs=171.0 Q ss_pred ccceeccccccchhhH-HHHHHHHHHHhhccchhHHHHHhhc----------------------ceeeecCCCceEEEee Q lcl|NC_019530. 58 ANTYNYVGVESELLSV-KTVVEEYQSMAQQPEIRKAVDIIVN----------------------DVVTCEEDETPVTVNL 114 (560) Q Consensus 58 ~~~~~y~~~e~~~~~~-~~LI~~YR~ma~~PEvd~AI~eIvn----------------------eaiv~d~~~~~V~l~L 114 (560) +..-.-.+.-+.+..+ .....+|+.+..+.+-+..|..+-. ++.+..-..+||++.. T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~ 80 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhccCCeecCC Confidence 0000000000000000 1122234444444444433322111 1111111122333322 Q ss_pred ecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccC Q lcl|NC_019530. 115 DKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAM 194 (560) Q Consensus 115 d~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~ 194 (560) ..-. .+.+.+..+.+-.+|+....++++.-++.|+-|.+.-.|. +|-..++.++|+.+-.+..=.... T Consensus 81 ~~d~--------~~~~~~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~e----dg~~~i~~~~p~~~~~i~d~~~~~ 148 (456) T protein:vir:79 81 SADS--------DLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD----DGTATITADSPETMVVSVDPLQPW 148 (456) T ss_pred CCCc--------cHHHHHHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeCC----CCceEEEEeccceeEEEEcCCCCC Confidence 1111 1223445566667899999999999999999877544332 344578899998876653210000 Q ss_pred C-cccccccc---ccc-hheeecCcccc-cc-----ccccccccccccccccce-------ecchhceeeeecccccCCC Q lcl|NC_019530. 195 R-EGGIEAIE---SVT-LKYYYNPNYNR-NQ-----FTGQSGTSQNFQPSQQEL-------VFDDESIVYIDSGEEPLAN 256 (560) Q Consensus 195 ~-~g~~~~~~---~~~-e~~~Y~p~~~~-~~-----~~g~~~~~~~~~~~~~~~-------~i~~daI~y~hsGl~~~~~ 256 (560) + ...+.... ... ...+|.+.... +. +...+.... .....+.. .+..=.|++. ++ T Consensus 149 ~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~pvv~~-------~N 220 (456) T protein:vir:79 149 RIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLV-TRISDSWVPVGDAVVTGSPPPVVVY-------QN 220 (456) T ss_pred ceEEEEEEEEecCCceeEEEEEcCCceEEEEEEEEeeccccceee-eccCCceeecccccCCCCceeEEEe-------cC Confidence 0 00011110 000 01112211110 00 000000000 00000000 0000111111 11 Q ss_pred CcchhhhHHHHHHHHH--------HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCC Q lcl|NC_019530. 257 GIVPGLLNPAIRPLNN--------LVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTG 328 (560) Q Consensus 257 ~~i~syL~~AiK~~Nq--------L~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TG 328 (560) ....|=++..+..... +..+|.....+|+.-....-.+-.|..+-+- . ..+.++ +..| T Consensus 221 ~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i----~----~~~~~~------~~~~ 286 (456) T protein:vir:79 221 PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAI----D----YASIFE------AAPG 286 (456) T ss_pred CCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCccccccccccccc----c----hhhhhh------hhcc Confidence 1223344444333322 2333443444444433222122222111000 0 001111 0111 Q ss_pred cccCcccchhhHhhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhH Q lcl|NC_019530. 329 KITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLG-EMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQ 407 (560) Q Consensus 329 ev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLg-ei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItR 407 (560) . .|+ +..|.+|..++.. +++ -.+-++-+-..++...++|..-|...++ |. | +..|.- T Consensus 287 ~-------------~~~----~~~~~~~~q~~~~-~~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~-N~--S-g~Al~~ 344 (456) T protein:vir:79 287 A-------------LWE----LPPGVDIWESQTN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQ--S-AEGAHN 344 (456) T ss_pred c-------------ccc----CCCCcceeeeccc-ChHHHHHHHHHHHHHHHhhcCCChhHhccccc-Cc--H-HHHHHH Confidence 0 122 1123445445542 222 2344677777888888999988875433 22 2 345666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_019530. 408 EELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFV 487 (560) Q Consensus 408 DElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~v 487 (560) -+..+-.-+.+.|+.|..-+...++.-+.+.|.. +++ .|.+.|. ++-+..+ .+.++++..+.. T Consensus 345 ~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~g~~--~~~------~i~v~w~-~~~~~s~------~~~ada~~kl~~-- 407 (456) T protein:vir:79 345 IEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES--VED------TVDVSFE-SPDRVTL------GEKYSAASLAKA-- 407 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--ccc------cceEEeC-CCCCcCH------HHHHHHHHHHHh-- Confidence 6666778888999999999999999888888852 222 2556664 3333221 244555555432 Q ss_pred cchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccc Q lcl|NC_019530. 488 GSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSG 537 (560) Q Consensus 488 Gky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~ 537 (560) ..+.|.++++ .+|.+|++||++.+.+-.+|..++...+|.+..+-+.+- T Consensus 408 ~G~~~~~~~~-~~lg~~~~~i~~~e~~r~~~e~~~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:79 408 AGESWASIRR-NILNYNADQIKQDDLDRAREQITLFAGNPVQRPQEDGSR 456 (456) T ss_pred cCCChHHHHH-hcCCCCHHHHHHHHHHHHHHHHHHHhhhHhhcCCCCCCC Confidence 1356666654 789999999975444333333333333332221111111 No 95 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=97.19 E-value=0.00014 Score=41.80 Aligned_cols=432 Identities=12% Similarity=0.040 Sum_probs=200.5 Q ss_pred ccccccccCCCCCCCCceeEec---------cc---cceeccccccch-----hhHHHHHHHHHHHhhccchhHHHHHhh Q lcl|NC_019530. 35 LLTKATVVALDDAQDGSIILQG---------GA---NTYNYVGVESEL-----LSVKTVVEEYQSMAQQPEIRKAVDIIV 97 (560) Q Consensus 35 ~~~k~~s~~~~~~~dG~~~i~~---------~~---~~~~y~~~e~~~-----~~~~~LI~~YR~ma~~PEvd~AI~eIv 97 (560) .++-.+.-++-..+|....+.. .- ...+|.+- ..+ .-..++ +.+|.+. +=+.-||+-.+ T Consensus 1 ~~~~~~~~~~gl~~~~~~~~~~L~~~~~~~~~~~~~~~~Yy~G~-~~~~~~~~~~p~~~-r~~~~v~--nw~~~~Vd~~a 76 (474) T protein:vir:81 1 MIQQQTVRIPSLSNDENALINGLLAQIENLRWKNLLRTSYYENK-RTIQYVGTLIPPQY-FNLGLVL--GWTGKAVDALA 76 (474) T ss_pred CcCCCcCcCCCCChhHHHHHHHHHHHHHHHhhHHHHHHHHhccC-CChhhccccccHHH-HHHHhhc--ChHHHHHHHHH Confidence 1111111111111111100100 00 01111111 111 001111 2223222 22334554433 Q ss_pred cceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeE Q lcl|NC_019530. 98 NDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLV 177 (560) Q Consensus 98 neaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr 177 (560) + -++.+-. . +.+.+..+ .....|....+|+....++++.=++.||-|. +|-+..+.++-..++ T Consensus 77 ~-rl~~~Gf----~--~~d~~~~~--------~~l~~iw~~N~ld~~~~~~~~~al~~G~sf~--~V~~~~d~~~~~~i~ 139 (474) T protein:vir:81 77 R-RCNLEGF----V--WPDGDLDS--------LGGTEVVDDNHLLSEIDSAIVAAMQHGPAFL--INTVGEDDEPEALIH 139 (474) T ss_pred h-hhcccce----E--CCCCCccc--------hHHHHHHHhcChhHHHHHHHHHHHhhCceeE--EEecCCCCCceeEEE Confidence 2 1111110 0 11111001 1234566788899999999999999999986 555555566677788 Q ss_pred EeCccccchhhhhcccCCccccccc----c-ccchheeecCccccccccccccccccccccccceecchhceeeeecccc Q lcl|NC_019530. 178 MLDSRCIRPVYIVEKAMREGGIEAI----E-SVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEE 252 (560) Q Consensus 178 ~LDPr~i~~vr~i~~~~~~g~~~~~----~-~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~ 252 (560) .++|+.+-.+..=.+....-.+... + ......+|.|...+..........-..-..-...-+| .+.|+|.--+ T Consensus 140 ~~sp~~~~~~~D~~~~~~~~al~~~~~~~~g~~~~~~ly~~~~~~~~~~~~~~~~w~~~~~~~~~gvP--vV~~~n~~~~ 217 (474) T protein:vir:81 140 VKDASEATGEWNRRRRGLNNLLSIIDKDKEGKVLSLALYLDNETVTAQRDKATLKWQVDRDEHVYGVP--AQVLPYKPAP 217 (474) T ss_pred EeccceEEEEEeCCCCcceeeeEEEEEcCCCcEEEEEEEeCCcEEEEEEcCccceeeeccCCCCCCcc--eEEecccccc Confidence 8888877655311100000001100 0 1112334555433322111111000000001111133 5777776333 Q ss_pred cCCCCc------chhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCC------chHHHHHHHHHHHHhhhhh Q lcl|NC_019530. 253 PLANGI------VPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTL------GKKSAEEYMTMMMGKFKNR 320 (560) Q Consensus 253 ~~~~~~------i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnl------pk~KAeqYl~~im~k~knk 320 (560) +.+.|. +.+..+++ -|++.+.++.=...=.|.|-|+=.+-... |..+-.+|+- T Consensus 218 ~~~~G~s~i~e~v~~l~da~------~r~~~~~~~~~e~~a~pqr~i~G~~~~~~~d~d~~~~~~~~~~~~--------- 282 (474) T protein:vir:81 218 KRPFGQSRITKPMMGLQDAG------VRELARREGHMDVFSYPEFWLLGADESALKNADGTIKSVWEARLG--------- 282 (474) T ss_pred cCcCCccccchhHHHHHHHH------HHHHHHHHHHHHHhcchhheeecCChhhcccccccccchhhhhHH--------- Confidence 332221 12233333 25678888888899899988864332111 1111111111 Q ss_pred eeecccCCcccCcccchhhHhhhc-ccccC-CC----CccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCC Q lcl|NC_019530. 321 NAYDRTTGKITGNAHLMGIAEDYW-LPRRE-GQ----NATEIATVGGGNQLGEM-DHVNYFREKLYDALMIPKSRLQEEG 393 (560) Q Consensus 321 ~vYd~~TGev~d~~~~msmlEDyw-LpRRe-Gg----rgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~VP~sRl~~~~ 393 (560) ..| +|.-+ |. .+++|..+|+. +|.-. +-++=.-..+-...++|.+-|+-.+ T Consensus 283 ---------------------~i~~~~~d~d~~~~~~~~~~~~q~~~a-~l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~ 340 (474) T protein:vir:81 283 ---------------------RIKGLPDDADADIPQLARADVKQFPAA-SPDAHWSDINGLAKLFAREASLPDTAVAISG 340 (474) T ss_pred ---------------------HHhcCCCcccccccccccccccccCCC-ChhHHHHHHHHHHHHHHhhhCCCHHHhcccc Confidence 122 33322 21 23567777764 34432 2233334445556799988876322 Q ss_pred cccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHH Q lcl|NC_019530. 394 SINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAIL 473 (560) Q Consensus 394 ~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~ 473 (560) .-|. +.+..|.-.|....+-+.+.|+.|..=+...++.-+.+.|-...++|..... .+.+.| +|- +.-.+ T Consensus 341 ~~np--~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~-~~~v~W-~d~------~~~s~ 410 (474) T protein:vir:81 341 LSNP--TSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVAIDEIPDEWK-SIDAKW-RDP------RYLSK 410 (474) T ss_pred cccc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccchhhc-cceeEe-cCC------CccCH Confidence 1111 2245677778888999999999999999999999999987655444322111 244445 221 12233 Q ss_pred HHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCC-CCcccccccccc Q lcl|NC_019530. 474 NDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPK-VQADETGNYSGS 538 (560) Q Consensus 474 ~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~-p~~~e~g~~~~~ 538 (560) .++.+.+..+..- |.-+....+..++|++|+.||+.....++++.-.+.+.. .....+|..+ + T Consensus 411 a~~aDa~~Kl~~a-~~~~~~~~~~~~~lg~t~~~i~~~~~~~~~~~~~~~~~~l~~~~~~~~~a-q 474 (474) T protein:vir:81 411 SAQADAGMKQLAA-VPWLAETEVGLELIGLTPQQARRAMADKRRVQGRGTLQALIDRSNNGATA-Q 474 (474) T ss_pred HHHHHHHHHHHhc-ccCCCcHHHHHhhcCCCHHHHHHHHHHHHHHhHHHHHHHHHhcCCCCCCC-C Confidence 4556665555432 434444555668899999999987766666543222221 1111111111 1 No 96 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=97.14 E-value=0.00015 Score=41.49 Aligned_cols=419 Identities=12% Similarity=0.107 Sum_probs=180.7 Q ss_pred cchhhhccccccccCCCCCCCCceeEeccccceeccccccchhh-HHHHHHHHHHHhhccchhHHHH------------- Q lcl|NC_019530. 29 QKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLS-VKTVVEEYQSMAQQPEIRKAVD------------- 94 (560) Q Consensus 29 ~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~-~~~LI~~YR~ma~~PEvd~AI~------------- 94 (560) .|... -.-+..|...+=.. ..-..+.+ -..-+.+|+.+..+.+=+..|. T Consensus 1 ~~~~~-----~~~~~~~~~~~~~~------------~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~k 63 (452) T protein:vir:36 1 MKYKP-----PKLMTFSKDEPITV------------EVVTKFMEKHKLEVARYEYLKNMYLGIMAIDDEPAKDSWKPDNR 63 (452) T ss_pred CcccC-----ceeEEcCCccCCCH------------HHHHHHHHHHHHHHHHHHHHHHHhccccccccCccccccCccce Confidence 00000 00011111111000 00001101 0111234444444443322110 Q ss_pred -------HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCC Q lcl|NC_019530. 95 -------IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPT 167 (560) Q Consensus 95 -------eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~ 167 (560) -|| +..+.---..||++..++ +...+.+..+++-.+|+...+++.+..++.|+-|.+.-+|. T Consensus 64 i~~n~~~~iv-d~~~~~l~g~~~~~~~~d---------~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~- 132 (452) T protein:vir:36 64 LAVNFTKYIV-DTFTGYFNGIPVKKSHSD---------KEILTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDE- 132 (452) T ss_pred eecchHHHHH-HHHhhhhcccCceeecCC---------hhHHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecC- Confidence 111 111111123444444332 23455677788888999999999999999999999877664 Q ss_pred CCccCeeEeEEeCccccchhhhhcccCC-cccccccc---ccchheeecCccccccccccccccccccccccceecchhc Q lcl|NC_019530. 168 NKKGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEAIE---SVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDES 243 (560) Q Consensus 168 ~~k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~~~---~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~da 243 (560) +|-..+..+||+.+-+|..-....+ ...+.... ...-.-+|.+..-+......+........+..--++| T Consensus 133 ---~g~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~iP--- 206 (452) T protein:vir:36 133 ---DTQTNVVYNSPENMFMVYDDTVKQEPLFAVRYGVDEDKKLQGEVYTLLETIKISGENDEISFGEGTYNPYPDLP--- 206 (452) T ss_pred ---CCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEEecCeEEEEEEcCCceEEecceeccCCccc--- Confidence 4567899999999877642111100 00010100 1111123544322221111111000000000000122 Q ss_pred eeeeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhh-hhe Q lcl|NC_019530. 244 IVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFK-NRN 321 (560) Q Consensus 244 I~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~k-nk~ 321 (560) +++. .++....|-++..+.....+. ++-+....-+..+.|-+-+.- +.++... +.+.+ +++ T Consensus 207 vv~~------~n~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g---~~~~~~~--------~~~~~~~~~ 269 (452) T protein:vir:36 207 VVEF------YFNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLG---AAVEEED--------LKNIRSNRV 269 (452) T ss_pred EEEe------cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeec---CCcCchh--------hhhhhhcce Confidence 2221 123334455555444444443 334445555667777665542 2222111 11111 121 Q ss_pred eecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHH-HHHHHHHHHHHhcCCCccccCCCCccccccc Q lcl|NC_019530. 322 AYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMD-HVNYFREKLYDALMIPKSRLQEEGSINIGGS 400 (560) Q Consensus 322 vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~-DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~ 400 (560) ..=..+ +.+.+..+.+|....+.+.+. -+.-+.+.+|.-..+|- +..++. |.. T Consensus 270 ~~~~~~---------------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~~---gn~ 323 (452) T protein:vir:36 270 INYYAD---------------------GEGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVAN--ISDESF---GSS 323 (452) T ss_pred EEecCC---------------------CCccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCccc--cCcccc---cCC Confidence 111111 122334566666555544433 35667788888899994 333322 222 Q ss_pred chhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-ChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHH Q lcl|NC_019530. 401 NLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVT-DEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLAS 479 (560) Q Consensus 401 ~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgii-t~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~ 479 (560) .+..|..-+.....-+.+.++.|...+...++.-+-+.+.. ...+|. .|.+.|...---.+ .+.+++ T Consensus 324 Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~-----~i~i~f~~~~p~d~-------~~~a~~ 391 (452) T protein:vir:36 324 SGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNVSNKDSWK-----DIEYTFTRNEPKDI-------KEQAET 391 (452) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccc-----cceEEeCCCCCcCH-------HHHHHH Confidence 33455544445556677777777777777776544333332 334553 35666653322122 233445 Q ss_pred HHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCC----CCCCCccccccccccccCC Q lcl|NC_019530. 480 LNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGL----YPKVQADETGNYSGSDVSP 542 (560) Q Consensus 480 l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~----~~~p~~~e~g~~~~~~~~~ 542 (560) ++.+ +| .+|.+++++. |..+++ .+++-++|++|..... ...+..+...+....+..+ T Consensus 392 ~~k~---~g-~iS~et~~~~-~~~~~d-~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~e 452 (452) T protein:vir:36 392 ANIL---MG-ITSQETALSV-ISVIPD-VQAEMEKIKKEEASTAIFDKDKQPSEKGTDTVVSETNEE 452 (452) T ss_pred HHHH---hc-cCChHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHHHhhccCCCCcccccCccccCC Confidence 5554 35 4899999976 565532 3344455555542211 1111111111111112221 No 97 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=97.14 E-value=0.00016 Score=41.45 Aligned_cols=417 Identities=11% Similarity=0.053 Sum_probs=167.1 Q ss_pred hhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhh-------HHHHHHHHHHHhhccch Q lcl|NC_019530. 17 LVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLS-------VKTVVEEYQSMAQQPEI 89 (560) Q Consensus 17 ~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~-------~~~LI~~YR~ma~~PEv 89 (560) -++.+-..++.|-+++ +...+...+....- -.+-+++|+.+..+.+- T Consensus 1 ~~~~~~~~~~~~~~~~--------------------------~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g 54 (468) T protein:vir:96 1 MIDIFWPNEKPYHERV--------------------------VEQIKPQYETQEEMILRLITKHKENVEDITVGERYYNH 54 (468) T ss_pred CccccCCcCceeehhe--------------------------eecccccccCcHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 1111122222221111 11111111110000 11122333333333222 Q ss_pred hHHH---------------------------HHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_019530. 90 RKAV---------------------------DIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFD 142 (560) Q Consensus 90 d~AI---------------------------~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~ 142 (560) ...| ..||+. .+.=--..||++..++. ...+.+..+++ -+|+ T Consensus 55 ~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~-~~~~l~g~p~~~~~~d~---------~~~~~l~~~~~-n~~~ 123 (468) T protein:vir:96 55 QPDVLFNAPKRNVKGEIDPFKPDWRMYTNYHQNLVDQ-KVAYAVANPVTYGTEDE---------KSLKTIQEVLN-HKWD 123 (468) T ss_pred CCccccccccccccccccccccccccccchHHHHHHH-HHhhhccCCceeccCCh---------HHHHHHHHHHh-cCHH Confidence 2111 111111 11111234555544432 22334444443 4788 Q ss_pred hhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCcccc-ccc-----cccchheeecCccc Q lcl|NC_019530. 143 NTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGI-EAI-----ESVTLKYYYNPNYN 216 (560) Q Consensus 143 ~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~-~~~-----~~~~e~~~Y~p~~~ 216 (560) ....++.+..++.|+-|.+.-+|. +|-..+..+||+.+-+|.. ....+.+ .++ +.....-+|.+..- T Consensus 124 ~~~~~~~~~~~~~G~~~~~v~~d~----~~~~~i~~~~p~~~~~v~~---~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~ 196 (468) T protein:vir:96 124 DKLVDILTAASNKGVEWIQPYVDE----QGEFKTFRVPAEQAIPIWT---NKERDELKAFIRLYELDGGERVEYWTANDV 196 (468) T ss_pred HHHHHHHHHHhhcCeEEEEEEEcC----CCceEEEEEcccceEEEEc---CCCCCceEEEEEEEEecCceEEEEEeCCeE Confidence 899999999999999998766654 3557789999999877631 1111110 011 11111112222211 Q ss_pred ccc-cccccccc-----ccccccccce---ecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHH-HHHHHHHHHHh Q lcl|NC_019530. 217 RNQ-FTGQSGTS-----QNFQPSQQEL---VFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVT-TEDATVIYAIT 286 (560) Q Consensus 217 ~~~-~~g~~~~~-----~~~~~~~~~~---~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m-~EDalVIyRit 286 (560) ... ..+..... ......+..+ ..+...|-.++. .++....|-++..+.....|.+ +-+..-.-+-. T Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~----~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~ 272 (468) T protein:vir:96 197 TFYELKDGQLIPDYYQGEEHVQAHYYVGNKSMSWNRVPFIPF----KNNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEA 272 (468) T ss_pred EEEEEcCCceeecccccccccccceeeccccccCCcccEEEe----cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHh Confidence 100 00000000 0000000000 000000111110 1223334566665555555542 33334445566 Q ss_pred cCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcc Q lcl|NC_019530. 287 RAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLG 366 (560) Q Consensus 287 RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLg 366 (560) +.|-+-+.-.+... .++.+.. |.. ++ -++++= +++.+ +..|....+.. T Consensus 273 ~~p~lv~~g~~~~~-----~~~~~~~-~~~--~~---------------------~i~~~~-d~~~~--~~~l~~~~~~~ 320 (468) T protein:vir:96 273 TELIYVLKGYEGED-----LEEFMYN-LKY--YK---------------------AINVDG-DGSGG--VDTIQIDVPVQ 320 (468) T ss_pred cCceeeeecCCccc-----cchhhhh-hhc--Cc---------------------eEEecC-CCCCc--ceEEeecCChH Confidence 66654433211111 1111110 100 01 123322 22222 44444333333 Q ss_pred hH-HHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhH Q lcl|NC_019530. 367 EM-DHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKD 445 (560) Q Consensus 367 ei-~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~ee 445 (560) .. .-++-+.+.+|+..++|- +..++ +. |...+..|.........-+.+.+..|...+.++++.-+-+.|+ + -+ T Consensus 321 ~~~~~~~~l~~~I~~~s~~p~--~~~~~-~~-~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~-~-~d 394 (468) T protein:vir:96 321 SAKEYLDMLRDYVIEFGQGVD--FQQDK-FG-NSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKL-S-IK 394 (468) T ss_pred HHHHHHHHHHHHHHHHhCccc--ccccc-cc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-C-cc Confidence 33 347778888999999983 32222 11 2222333433333344556777777777777777665555564 2 23 Q ss_pred HHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCC Q lcl|NC_019530. 446 WNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYP 525 (560) Q Consensus 446 w~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~ 525 (560) |. .|.+.|...---.+...+ +++.++ | .+|.+++++. |...++ -+++.++|++|...-. T Consensus 395 ~~-----~i~i~f~~~~p~d~~e~a-------~~~~~~----g-~iS~et~i~~-l~~v~D-~~~E~~ri~~E~~~~~-- 453 (468) T protein:vir:96 395 VQ-----DVEITFNFNVMVNELEQS-------QIGVNS----Q-YLSKETVVTN-HPWVDD-PVAEMERIDQEELALP-- 453 (468) T ss_pred cc-----eeeEEecCCCCcCHHHHH-------HHHHhc----C-CCchHHHHHh-CCCCCC-HHHHHHHHHHHHHHHH-- Confidence 32 356666443322232222 233332 3 6899999976 555432 4455666666653221 Q ss_pred CCCccccccccccccCCCC Q lcl|NC_019530. 526 KVQADETGNYSGSDVSPLK 544 (560) Q Consensus 526 ~p~~~e~g~~~~~~~~~~~ 544 (560) + .+-+.+++.+..+- T Consensus 454 ~----~~~~~~~~~~~~~~ 468 (468) T protein:vir:96 454 S----IEEGLNGKENNEPT 468 (468) T ss_pred H----HhhccCCCCCCCCC Confidence 1 11111111111111 No 98 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=97.13 E-value=0.00016 Score=41.42 Aligned_cols=374 Identities=13% Similarity=0.123 Sum_probs=165.2 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) |+.++.+ .+.|.+.+ +...+. +++......+.....++.- +.-.++|-|.+||. T Consensus 1 Mg~~~~~-----~~~k~~~~------~~~~~~-~~~~~~~~~~~~~~~~v~~--------------~~~l~~~~v~~~i~ 54 (383) T protein:vir:10 1 MGLLTPK-----NFSKRNAK------NMVYPS-NPAFFTTTVGGMQLSYVSA--------------LSALQNTNVYSVIN 54 (383) T ss_pred CCccccc-----cccccccc------cccccc-chhhhhhhccCccccccch--------------hHhhcchHHHHHHH Confidence 2222211 11111111 111111 1111111111112222111 12345788999997 Q ss_pred HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchh----hHHHHHHhhhhhhhhhhheecCCCCc Q lcl|NC_019530. 95 IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNT----AYQKIRKWYVDGRQAYHVIVDPTNKK 170 (560) Q Consensus 95 eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~----~~~~~R~wYvDGr~~~hkiiD~~~~k 170 (560) -|.+.+.-. | +.+.+.. . ..+++-=+=... -+.+++.+..+|.-|..++-| T Consensus 55 ~ia~~ia~~-----~--~~~~~~~-----~--------~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~----- 109 (383) T protein:vir:10 55 RIASDVSSA-----H--FKTENTA-----T--------LNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ----- 109 (383) T ss_pred HHHHhhccC-----c--eeecccc-----h--------hhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC----- Confidence 777655433 2 2333321 0 112221111122 344556666788877755432 Q ss_pred cCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeec- Q lcl|NC_019530. 171 GGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDS- 249 (560) Q Consensus 171 ~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hs- 249 (560) ..++..++|.++.+++ +..+ .++.|.. ...+..+.++.+.|.|..- T Consensus 110 --~~~~~p~~~~~v~~~~-----~~~~---------~~~~~~~-----------------~~~~~~~~~~~~evih~r~~ 156 (383) T protein:vir:10 110 --NLEHIPNSDVQINYLP-----GNMG---------IVYTVLE-----------------SNDRPKMVLRQDQMLHFRLM 156 (383) T ss_pred --ceeEeecCcceEEEEE-----cCCc---------eEEEEEE-----------------cCCceEEEEcccceEEeccC Confidence 2578888887766532 1111 1122210 0112356688888877641 Q ss_pred ccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCc Q lcl|NC_019530. 250 GEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGK 329 (560) Q Consensus 250 Gl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGe 329 (560) ..-..+..+.+|.|..|++++.....++....=+----+--+-+..++-+ +-..++.+=+++.++++..- ...|. T Consensus 157 ~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~-~~~~e~~~~~~~~~~~~~~~----~n~~~ 231 (383) T protein:vir:10 157 PDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNY-LSDGKDLESAREEFEKANTG----DNSGR 231 (383) T ss_pred CCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCC-CCCHHHHHHHHHHHHHHhCc----cccCC Confidence 11112233567999999999999988888765443333445556666643 33344444444444444221 11222 Q ss_pred ccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHH---H-HHHHHHHHHhcCCCccccCCCCcccccccchhhh Q lcl|NC_019530. 330 ITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDH---V-NYFREKLYDALMIPKSRLQEEGSINIGGSNLAEI 405 (560) Q Consensus 330 v~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~D---V-~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eI 405 (560) .+ .++ .|.+++.|.- +..+++. . ++-.+.+.++++||.+.|+...+.+...+++.++ T Consensus 232 ------~~-vl~----------~g~~~~~l~~--~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~eq~ 292 (383) T protein:vir:10 232 ------LM-VLP----------DGFDYTQLEM--KTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQI 292 (383) T ss_pred ------cc-ccC----------CCceEEecCC--ChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHHHH Confidence 22 222 3677877753 3334433 3 3446889999999999997544333344433322 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019530. 406 TQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEP 485 (560) Q Consensus 406 tRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p 485 (560) ..-|.+ .|+--+. .+.+.|...|. + +.+.|+ .+ .+... -+..|++.++.+-. T Consensus 293 ---~~~~~~---~l~P~~~-~ie~~l~~~l~-----~---------~~~~f~--~~----~l~~~-d~~~~~~~~~~~~~ 344 (383) T protein:vir:10 293 ---KATYLA---NLNSYVN-PIVDELRLKMN-----A---------PDLELD--IK----DMLDV-DDSILINQVSNLAK 344 (383) T ss_pred ---HHHHHH---HHHHHHH-HHHHHHHHhhC-----C---------ceEEee--ch----hhhcc-CHHHHHHHHHHHHh Confidence 122433 3443222 23333443332 2 123332 22 22111 12234443332221 Q ss_pred hhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccccCCccc Q lcl|NC_019530. 486 FVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDD 558 (560) Q Consensus 486 ~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~ 558 (560) +-++|.+-+++ +|.+.. ++. . +.. ...+..-|.-||.+- T Consensus 345 --~G~~t~nE~R~-~lg~~p------------------~~~--~---------d~~--~~~~~~~~~~gGd~e 383 (383) T protein:vir:10 345 --SGVLGAEQAQF-ILTRSG------------------FLP--D---------NLP--EFKPLTNETKGGDDK 383 (383) T ss_pred --CCCcCHHHHHH-HhCCCc------------------ccC--C---------ccc--ccCCCcccCCCCCCC Confidence 12455555542 233322 111 1 111 122233455566555 No 99 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=97.11 E-value=0.00017 Score=41.27 Aligned_cols=439 Identities=11% Similarity=0.027 Sum_probs=171.7 Q ss_pred hhhhcccccchhhhccccccccCCCCCCCCc--eeEec--cccceeccccccc----hhhHHHHHH--------HHHHHh Q lcl|NC_019530. 21 KVDTDKLAQKQDELLLTKATVVALDDAQDGS--IILQG--GANTYNYVGVESE----LLSVKTVVE--------EYQSMA 84 (560) Q Consensus 21 ~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~--~~i~~--~~~~~~y~~~e~~----~~~~~~LI~--------~YR~ma 84 (560) -++-.++ +.++- .-|. ..+.. ++.+ .|...+-. ..-...+|+ +|+.+. T Consensus 1 ~~~~~~~------------~~~~~---~~~~~~~~~~~~~n~~~-~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~ 64 (511) T protein:vir:93 1 MLKVNEF------------ETDTD---LRGNINYLFNDEANVVY-TYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLS 64 (511) T ss_pred Cccccch------------hhhhh---hhhhhhhhhhhhhCCcc-cccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHH Confidence 1111111 00000 0010 01111 1111 11111111 111222333 455555 Q ss_pred hccchhHHHH----------------------HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_019530. 85 QQPEIRKAVD----------------------IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFD 142 (560) Q Consensus 85 ~~PEvd~AI~----------------------eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~ 142 (560) .+.+-...|- -||+ ..+.=--..||++..++. ...+.+..+++..+|+ T Consensus 65 ~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~-~~~~yl~g~p~~~~~~d~---------~~~~~l~~~~~~n~~~ 134 (511) T protein:vir:93 65 DYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISD-FINGYFLGNPIQYQDDDK---------DVLEVIEAFNDLNDVE 134 (511) T ss_pred HHhcccCccccccCcCcccccCcceeecchHHHHHH-HHhhhhcccCeeeccCCh---------HHHHHHHHHHhhcCHh Confidence 4444333221 1111 111112234555543332 3456677788888999 Q ss_pred hhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCC-cccccccc---------ccchh-eee Q lcl|NC_019530. 143 NTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEAIE---------SVTLK-YYY 211 (560) Q Consensus 143 ~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~~~---------~~~e~-~~Y 211 (560) ....++.+..++.|+-|.+.-+|. +|-..+..+||+.+-+|..-....+ ...+.... ....+ -+| T Consensus 135 ~~~~~~~~~~~~~G~ay~~vy~de----~~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~iy 210 (511) T protein:vir:93 135 SHNRSLGLDLSIYGKAYELMIRNQ----DDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLF 210 (511) T ss_pred HHHHHHHHHHHhcCeeEEEEEeCC----CCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEE Confidence 999999999999999998766653 4567799999999987742111111 11111110 00011 145 Q ss_pred cCccccccccccccccccc-----cccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHH-HHHHHHHHHH Q lcl|NC_019530. 212 NPNYNRNQFTGQSGTSQNF-----QPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVT-TEDATVIYAI 285 (560) Q Consensus 212 ~p~~~~~~~~g~~~~~~~~-----~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m-~EDalVIyRi 285 (560) ++..-+............. ..+..--++|. +.|. ++....|-++..+.....+.. +=+....-+- T Consensus 211 t~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv--v~~~-------nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~ 281 (511) T protein:vir:93 211 TSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPI--TEFS-------NNERRKGDYEKVITLIDLYDNAESDTANYMSD 281 (511) T ss_pred eCCcEEEEEecCCCccccccccccccccCCCccce--EEec-------CCCCCCCchhhHHHHHHHHHHHHHHHHHHHHH Confidence 5543332222111100000 00000001221 1222 222334556665555554432 2233333344 Q ss_pred hcCccceEEEc---cCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchh-hHhhhc--ccccCCCCccceeec Q lcl|NC_019530. 286 TRAPEKRAFYL---DVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMG-IAEDYW--LPRREGQNATEIATV 359 (560) Q Consensus 286 tRAPeRRvFyI---DvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~ms-mlEDyw--LpRReGgrgTEIsTL 359 (560) ++.|-+-+.=. |.+.+++ +....+. +....| ...-....|..+..| T Consensus 282 ~~~~~lv~~G~~~~~~~~~~~----------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 333 (511) T protein:vir:93 282 LNDAMLLIKGNLNLDPVEVRK----------------------------QKEANVLFLEPTVYADSEGRETEGSVDGGYI 333 (511) T ss_pred hhCcceeeecCcccCchhhcc----------------------------cccccceecccccccccccccCCCCcceeEE Confidence 55554443321 1111110 0000110 001111 111122233445555 Q ss_pred CCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh- Q lcl|NC_019530. 360 GGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLIL- 437 (560) Q Consensus 360 pGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiL- 437 (560) -...+... -.-+.-..+.+|+-.++|---.+.-+ .|+ .+..|..-...-..-+.+.++.|..-+...++.-+-+ T Consensus 334 ~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~-~n~---Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l 409 (511) T protein:vir:93 334 YKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQ---SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETIL 409 (511) T ss_pred eecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44333322 23344456777888888853222111 122 2233433333334445555555555555554442222 Q ss_pred --ccCC-ChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCH---HHHHHH Q lcl|NC_019530. 438 --KGVT-DEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSD---EEVKEQ 511 (560) Q Consensus 438 --kgii-t~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tD---eEI~e~ 511 (560) ++.. .+.+|. .+.+.|..----. +.+.++++..+ .| .+|.+++++. |..++ +||+.+ T Consensus 410 ~~~~~~~~~~d~~-----~i~~~f~~~~p~n-------~~e~~~~~~kl---~g-~iS~et~~~~-l~~v~d~~~E~~ri 472 (511) T protein:vir:93 410 KNTWSIDANKDFN-----TVRYVYNRNLPKS-------LIEELKAYIDS---GG-KISQTTLMSL-FSFFQDPELEVKKI 472 (511) T ss_pred HhccCcccccccc-----cceEEeCCCCCCC-------HHHHHHHHHHH---hc-cCchHHHHHh-CCCCCCHHHHHHHH Confidence 2222 233442 2666674321111 22344555555 35 4999999976 55543 455444 Q ss_pred HHHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 512 QTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 512 ~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) +++-+++.+. ......++..+.... -+.-+|.+.+. T Consensus 473 ~~E~~~~~~~-----~~~~~~~~~~~~~~~--------~~~~~~~~~~~ 508 (511) T protein:vir:93 473 EEDEKESIKK-----AQKGIYKDPRDINDD--------EQDDDTKDTVD 508 (511) T ss_pred HHHHHHHHHH-----HhhhcccCCCCCCCC--------CCCCccccccc Confidence 4443333211 111111111100000 01122222222 No 100 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=97.10 E-value=0.00017 Score=41.25 Aligned_cols=420 Identities=11% Similarity=0.123 Sum_probs=183.0 Q ss_pred hccccccccCCCCCCCCceeEeccccceeccccccchhhH-HHHHHHHHHHhhccchhHHHHHhhcce------------ Q lcl|NC_019530. 34 LLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSV-KTVVEEYQSMAQQPEIRKAVDIIVNDV------------ 100 (560) Q Consensus 34 ~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~-~~LI~~YR~ma~~PEvd~AI~eIvnea------------ 100 (560) -.-.+-..+.-|++..-.. ..-..+.+. ..-+.+|+.+..+.+=+..| .+.- T Consensus 1 ~~~~~~~~~~~p~d~~~~~------------~~l~~~i~~~~~~~~r~~~~~~yy~g~~~i---~~~~~~~~~~~~~ki~ 65 (453) T protein:vir:39 1 MKYKPPKLMTFPKDEPITN------------EVVTKFMEKHRLEVARYEYLKNMYRGIMAI---DAEPTKDLWKPDNRLT 65 (453) T ss_pred CeecCCcceEcCCCCCCCH------------HHHHHHHHHHHHHHHHHHHHHHHhhccCch---hcCCCccccCccceee Confidence 0001111222222221000 000111111 11123444444444332211 1110 Q ss_pred ----------eeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCc Q lcl|NC_019530. 101 ----------VTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKK 170 (560) Q Consensus 101 ----------iv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k 170 (560) .+.---..||.+..++ +...+.+..+++-.+|+....++.+..++.|+-|.+.-.|. T Consensus 66 ~n~~~~ivd~~~~~l~g~~~~~~~~d---------~~~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~---- 132 (453) T protein:vir:39 66 VNFTKYIVDTFTGYFNGIPVKKSHSD---------KETLSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNE---- 132 (453) T ss_pred cchHHHHHHHHhhhhcccCceeccCC---------hHHHHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecC---- Confidence 0000112333333222 23456788888888999999999999999999888665543 Q ss_pred cCeeEeEEeCccccchhhhhcccC-Ccccccccc--c-cchheeecCccccccccccccccccccc-cccceecchhcee Q lcl|NC_019530. 171 GGIKKLVMLDSRCIRPVYIVEKAM-REGGIEAIE--S-VTLKYYYNPNYNRNQFTGQSGTSQNFQP-SQQELVFDDESIV 245 (560) Q Consensus 171 ~GI~eLr~LDPr~i~~vr~i~~~~-~~g~~~~~~--~-~~e~~~Y~p~~~~~~~~g~~~~~~~~~~-~~~~~~i~~daI~ 245 (560) +|-..++.++|+.+-++..=.... ....+.... . ..-.-+|.|..-+. +.+.++.-....+ +..--.+|. |. T Consensus 133 ~g~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~~yt~~~i~~-~~~~~~~~~~~~~~~~~~g~vPv--v~ 209 (453) T protein:vir:39 133 ETQTNVIYNTPENMFMVYDDTIKQEPLFAVRYGYDDDYKLYGEVYTKETTYA-LNGTMGFYNMTEQAPNPFDDLPV--VE 209 (453) T ss_pred CCceEEEEEcccceEEEecCCCCCeEEEEEEEEEeCCeEEEEEEEeCCeEEE-EEecCCceeeecccccCCCceeE--EE Confidence 466889999999998874211110 001111111 0 11112244433222 2221111111000 000001221 11 Q ss_pred eeecccccCCCCcchhhhHHHHHHHHHH-HHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhh-hheee Q lcl|NC_019530. 246 YIDSGEEPLANGIVPGLLNPAIRPLNNL-VTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFK-NRNAY 323 (560) Q Consensus 246 y~hsGl~~~~~~~i~syL~~AiK~~NqL-~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~k-nk~vY 323 (560) |. ++....|-++..+....-+ +++-+....-+..+.|-+-+.=. .++... +.+.+ +++. T Consensus 210 ~~-------n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~---~~~~~~--------~~~~~~~~~~- 270 (453) T protein:vir:39 210 FY-------FNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGA---AVEEED--------LKNIRSNRVI- 270 (453) T ss_pred ec-------CCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecC---CCCchh--------hhhhhhccee- Confidence 21 2223334454444433332 34556666667777786655432 222211 11111 1111 Q ss_pred cccCCcccCcccchhhHhhhccccc-CCCCccceeecCCCCCcchHH-HHHHHHHHHHHhcCCCccccCCCCcccccccc Q lcl|NC_019530. 324 DRTTGKITGNAHLMGIAEDYWLPRR-EGQNATEIATVGGGNQLGEMD-HVNYFREKLYDALMIPKSRLQEEGSINIGGSN 401 (560) Q Consensus 324 d~~TGev~d~~~~msmlEDywLpRR-eGgrgTEIsTLpGg~nLgei~-DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~ 401 (560) .++-. ..+.+.++.+|-...+.+.+. .++-+.+.+|....+|- +..+ + +|... T Consensus 271 --------------------~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~-~--~gn~S 325 (453) T protein:vir:39 271 --------------------NYYGESSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVAN--ISDE-S--FGSSS 325 (453) T ss_pred --------------------eecCCCCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcc--cccc-c--ccCCh Confidence 11111 112344577776555555444 35667777888888984 3222 2 23333 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-ChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 402 LAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVT-DEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASL 480 (560) Q Consensus 402 ~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgii-t~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l 480 (560) +..|.--+.....-+.+.|+.|..-+...|+.-+-+-+.. ...+|. .|.+.|... --.- +.+.++++ T Consensus 326 g~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~-----~i~v~f~~~-~p~~------~~~~a~~~ 393 (453) T protein:vir:39 326 GVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVSNKEAWK-----DIEYTFTRN-EPKD------IKEQAETA 393 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccc-----cceEEeCCC-CCcC------HHHHHHHH Confidence 3445444444556677777777777777776543333332 233342 366777433 2211 22344555 Q ss_pred HHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcc Q lcl|NC_019530. 481 NTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPE 548 (560) Q Consensus 481 ~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~ 548 (560) ..+ +| .+|.+++++. |...++ .+++-++|++|........ .+.+.+..+..+..+ +-.-| T Consensus 394 ~kl---~g-~is~et~l~~-l~~v~D-~~~E~~ri~~E~~~~~~~~-~~~~~~~~~~~~~~~-~~~~e 453 (453) T protein:vir:39 394 NIL---MG-ITSQETALSV-ISVIPD-VQAEMEKIKKEEASTAIFD-KDKQPSEKGTDTVVP-ETNEE 453 (453) T ss_pred HHH---hc-cCChHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHHH-HhccCCCCCCCCCCC-CcCCC Confidence 555 35 4999999976 566542 3344455666543322111 111111111111111 11111 No 101 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=97.10 E-value=0.00017 Score=41.25 Aligned_cols=402 Identities=13% Similarity=0.096 Sum_probs=170.9 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhh---ccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHH Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDEL---LLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVV 77 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~---~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI 77 (560) |.| +.... +...-+++.+.... .+.+.+.... .+........ +.+. .. T Consensus 1 ~~~-----------~~~~~-~~~~m~~F~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~----~~~~---------~~ 51 (413) T protein:vir:96 1 MPG-----------VSEIR-KDKNLKFFNNKRSPTEESKAKDEIPKA----PQVVMTLPNF----FKEL---------IS 51 (413) T ss_pred CCc-----------cchhh-hhhcCCccccCCCcchhhhhhcccccc----ccccccchhh----Hhhh---------cc Confidence 322 11111 22222332221111 1111111111 1110000000 0011 11 Q ss_pred HHHHHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHH-hCCchhh----HHHHHHh Q lcl|NC_019530. 78 EEYQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHL-MDFDNTA----YQKIRKW 152 (560) Q Consensus 78 ~~YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l-l~f~~~~----~~~~R~w 152 (560) ..|..++.+|-|.+||+-|.+.+.-+ |+.+--...++ ++.+.....++++. =+=...+ +.++..+ T Consensus 52 ~~~~~~~~~~~v~~cI~~ia~~ia~~-----~~~~~~~~~~~-----~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~l 121 (413) T protein:vir:96 52 DGYTKLSDSPEVRMAVDCIADLVSNM-----TIQLMQNGETG-----DKRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSM 121 (413) T ss_pred chhHHHhhchHHHHHHHHHHHhhccC-----ceEEEEecCCC-----ccccccHHHHHHHhccccCCCHHHHHHHHHHHH Confidence 22445788999999998887776432 33332222111 11122222222221 0111223 4445566 Q ss_pred hhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccc Q lcl|NC_019530. 153 YVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQP 232 (560) Q Consensus 153 YvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~ 232 (560) ...|.-|.-++.|.+ ..-+.+|.+|+|.+|...+ +. ++ -+|.|... + T Consensus 122 ll~Gn~~~~i~r~~~--g~~~~~L~~l~~~~v~~~~-----~~-~~--------~~y~~~~~-------~---------- 168 (413) T protein:vir:96 122 LLEGNGNAVVKPQVS--GDKIIGLTPISPYKVTFNV-----SD-DD--------LDYSITFD-------N---------- 168 (413) T ss_pred hhcCCeEEEEEEcCC--CCceEEEEEecCceeEEEE-----cC-Ce--------EEEEEeec-------C---------- Confidence 677877776554432 1237899999998887632 11 11 12233211 1 Q ss_pred cccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHH Q lcl|NC_019530. 233 SQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTM 312 (560) Q Consensus 233 ~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~ 312 (560) ..++.+.|.|.-..-...+.-...|.|..|.+++.....+++...=+----+.-+-+..++ ++|.+..+++..+. T Consensus 169 ----~~~~~~evih~k~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~l~~e~~~~~~~~ 243 (413) T protein:vir:96 169 ----KEYDPSTLLHFVLNPSIERPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVD-SDSDELSDEEGREN 243 (413) T ss_pred ----cEEchhhEEEEeccCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeC-CCCCHHHHHHHHHH Confidence 1244566665532111112235789999999999998888887665555556666777777 56777766655554 Q ss_pred HHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCC Q lcl|NC_019530. 313 MMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQEE 392 (560) Q Consensus 313 im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~ 392 (560) +...|.. .++.-+.+ +++ -+|...+++..+.- ..+.-++-.+|-.+.+.++++||...|+.. T Consensus 244 ~~~~~~g----------~~n~g~~~-vl~------~~~~~~~~~~~~~~-~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~ 305 (413) T protein:vir:96 244 FEEMYLK----------RKEAGKPW-IIP------EGMVNVQQIKPLTL-NDLAINDAVTLDKKTVAGIFGVPAFLLGVG 305 (413) T ss_pred HHHHhcC----------ccccCcee-eec------CCcccccccccCCh-hHHHHHHHHHHHHHHHHHHhCCCHHHcCCC Confidence 4433322 11111122 121 11111123333321 233334555677899999999999999522 Q ss_pred CcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHH Q lcl|NC_019530. 393 GSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAI 472 (560) Q Consensus 393 ~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei 472 (560) ++ ++-+ -..|.++ .|+- +...+.+.|-..|+-+| ..+.|. +.++.... T Consensus 306 ~~--------~~~~--~~~~~~~--~l~P-~~~~ie~~ln~~ll~~~--------------~~~~fd----~~~ll~~d- 353 (413) T protein:vir:96 306 TY--------NKDE--FNNFINT--KIMS-IAQVIQQTYNKLIVEED--------------MYFSLN----PRSLYNYS- 353 (413) T ss_pred cc--------hHHH--HHHHHHH--HHHH-HHHHHHHHHHHhhCCCC--------------cEEEEe----chhhhccC- Confidence 11 1111 1124333 2332 22334555555543221 223332 33333222 Q ss_pred HHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccc---cccccccccCCCCCCccc Q lcl|NC_019530. 473 LNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADE---TGNYSGSDVSPLKFKPET 549 (560) Q Consensus 473 ~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e---~g~~~~~~~~~~~~~p~~ 549 (560) +.+|++.++.+-. .-++|.+-++. .+++. |-|..++ +++..+-++. .+. T Consensus 354 ~~~~~~~~~~~~~--~G~~t~NE~R~-~~g~~--------------------p~~~gd~~~~~~n~~~~~~~-----~~~ 405 (413) T protein:vir:96 354 LTEMVSAGAQMTQ--LNALRRNEFRN-WVGMP--------------------PDAEMDDLLVLENYLQQKDL-----VNQ 405 (413) T ss_pred HHHHHHHHHHHHh--CCCcCHHHHHH-HhCCC--------------------CCCCcceeeecccccchhhc-----ccc Confidence 3355555544322 12555555542 23332 2222111 1111111110 000 Q ss_pred ccccCCcccC Q lcl|NC_019530. 550 IPFSGSTDDS 559 (560) Q Consensus 550 ~~~~~~~~~~ 559 (560) -+ .++ +++ T Consensus 406 ~~-~~~-~dt 413 (413) T protein:vir:96 406 KK-LIQ-DET 413 (413) T ss_pred cC-CCC-CCC Confidence 00 011 111 No 102 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=97.07 E-value=0.00019 Score=41.04 Aligned_cols=409 Identities=13% Similarity=0.124 Sum_probs=181.7 Q ss_pred hhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCC-CCCceeEecccc--ceecc---ccccchhhHHHHHHHHHH Q lcl|NC_019530. 9 FGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDA-QDGSIILQGGAN--TYNYV---GVESELLSVKTVVEEYQS 82 (560) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~-~dG~~~i~~~~~--~~~y~---~~e~~~~~~~~LI~~YR~ 82 (560) -|||.-+...++....-++. -| ... ++..|.. ..|. .+.+... -..|. ...+.. + .... T Consensus 1 Mgl~d~~r~~~~~~~~~~~~---~~-~~~---~~~~~~~~~~g~-~~~~~~~~~~~~~~~~~~~~g~~------v-~~~~ 65 (431) T protein:vir:10 1 MGLFDFIRREKQPEAQARPH---VE-PSF---QASTPTTSIPGE-TFEGLDDPRLKEYIRRGELNGGT------G-RETR 65 (431) T ss_pred CcchhhhhcCcccccccccc---cc-ccc---cccccccccccc-ccccccchHHHHhhccCccCcce------e-chhh Confidence 23333222222111111110 00 111 1111111 1121 1111000 00000 000000 0 1134 Q ss_pred HhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCC----chh----hHHHHHHhhh Q lcl|NC_019530. 83 MAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDF----DNT----AYQKIRKWYV 154 (560) Q Consensus 83 ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f----~~~----~~~~~R~wYv 154 (560) .+++|-|..||.-|-+.+--. |+.|-=. .+. ++.+.+ ..+..+|+- ... -+.++..+++ T Consensus 66 al~~~~V~~ci~~Ia~~iA~l-----p~~v~~~-~~~-----~~~~~~--~~~~~lL~~~PN~~~t~~~f~~~l~~~lll 132 (431) T protein:vir:10 66 ALRNMAVLRCVTLISGTIGML-----PMNLISS-DDS-----KQVLTD--DPAHRLLKYKPNDWQTPMEFKSLMQLRALL 132 (431) T ss_pred hhccHHHHHHHHHHHHhhccC-----ceEEEEe-cCc-----eeeecc--chHHHHHhhccCCCCCHHHHHHHHHHHHhh Confidence 467899999997776655422 3333111 110 011111 122333321 112 3445566778 Q ss_pred hhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccc Q lcl|NC_019530. 155 DGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQ 234 (560) Q Consensus 155 DGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~ 234 (560) +|.-|..++.|. .++.+|.+|+|.++..++ .. .+. -+|.|.. ..+ T Consensus 133 ~Gna~~~i~r~~----g~~~~L~pl~~~~v~~~~---~~--~~~--------~~y~~~~------------------~~g 177 (431) T protein:vir:10 133 DGESMARIVWSG----NRPIRLIPMDRGSAKGRL---TS--TWQ--------IVYDYTT------------------PTG 177 (431) T ss_pred cCCeEEEEEEcC----CceEEEEEEcCceeEEEE---cC--CCe--------EEEEEEe------------------CCc Confidence 898887766652 467899999998887642 11 121 1232321 122 Q ss_pred cceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHH Q lcl|NC_019530. 235 QELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMM 314 (560) Q Consensus 235 ~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im 314 (560) ....++.+.|.|... ...++-...|.|+.|.+++.....+++...=+----|--+-|.-.+ ++|.+.++++.-+.+. T Consensus 178 ~~~~~~~~dViHir~--~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~ls~e~~~~~~~~~~ 254 (431) T protein:vir:10 178 DKIELPAREVFHLRD--LSIDGVSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIEVP-KELSDNAYGRMKASVQ 254 (431) T ss_pred eEEEEchhhEEEecC--cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecC-CCCCHHHHHHHHHHHH Confidence 345688888877752 2445555678999999999888888887765555555555666665 4666665554444444 Q ss_pred HhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCccccCC Q lcl|NC_019530. 315 GKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE---MDHVNYFREKLYDALMIPKSRLQE 391 (560) Q Consensus 315 ~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~ 391 (560) ..|..- ...|. .+ .++ .|.+++.|. .+..+ ++--+|-...+.++++||..-|.. T Consensus 255 ~~~~g~----~n~g~------~~-vl~----------~g~~~~~l~--~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~ 311 (431) T protein:vir:10 255 ENHTGS----ENAGS------WM-LLE----------EGATAKQFS--NTAASAQQIENRNHQIEEVARMYGVPRPLLMM 311 (431) T ss_pred HHhcCc----cccCC------ce-ecC----------CCceEEEcc--CChhHHHHHHHHHHhHHHHHHHhCCCHHHhCC Confidence 444320 11122 22 121 245555553 22223 333345567899999999999976 Q ss_pred CCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHH Q lcl|NC_019530. 392 EGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENA 471 (560) Q Consensus 392 ~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~E 471 (560) ..+- ..++..+..+. |.+++ |+--+. .+.+.|- +.+++++++. ...+.|. +.+|...+ T Consensus 312 ~~~~--t~sn~eq~~~~---f~~~t--L~P~~~-~ie~~ln-----~~Ll~~~~~~-----~~~~~fd----~~~llr~d 369 (431) T protein:vir:10 312 DDTS--WGSGIEQLAIF---FIQYG--LSHWFV-SWEQAAA-----RAFLPEKMLG-----QRQFKFN----EGALLRGT 369 (431) T ss_pred CCCC--ccccHHHHHHH---HHHHH--HHHHHH-HHHHHHH-----hhccChhhcC-----CceEEEe----chhhhccC Confidence 5431 23444444333 55442 332211 2233333 3345655542 1334443 33333222 Q ss_pred HHHHHHHHHHHhhhhh--cchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccc---ccccccccCCCCCC Q lcl|NC_019530. 472 ILNDRLASLNTVEPFV--GSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADET---GNYSGSDVSPLKFK 546 (560) Q Consensus 472 i~~~Rl~~l~~~~p~v--Gky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~---g~~~~~~~~~~~~~ 546 (560) +..|.+.++.+=.-+ .-++|.+-+++ ++.|.. ++.|..|+- .+..+.+.+. -+ T Consensus 370 -~~~r~~~~~~~~~~G~~~g~lT~NE~R~-~~gl~p------------------~~~~~gD~~~~p~n~~~~~~~~--~~ 427 (431) T protein:vir:10 370 -LNDQAAFFSKALGAGGQSPWMKQNEVRE-MLDLPR------------------ADDPVADQLRNPMTQKQKGSGD--EP 427 (431) T ss_pred -HHHHHHHHHHHHhcccccCccCHHHHHH-HhCCCC------------------CCCccccceecccccccCCCCC--CC Confidence 445666655543221 23566666663 344311 122222211 1111000000 01 Q ss_pred cccccccCCc Q lcl|NC_019530. 547 PETIPFSGST 556 (560) Q Consensus 547 p~~~~~~~~~ 556 (560) |++ | T Consensus 428 ----p~~--~ 431 (431) T protein:vir:10 428 ----PAT--T 431 (431) T ss_pred ----CCC--C Confidence 221 2 No 103 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=97.07 E-value=0.00019 Score=41.04 Aligned_cols=395 Identities=14% Similarity=0.103 Sum_probs=169.9 Q ss_pred cccchh-----hHHHHHHHHHHHhhccchhHHHH---------------------------HhhcceeeecCCCceEEEe Q lcl|NC_019530. 66 VESELL-----SVKTVVEEYQSMAQQPEIRKAVD---------------------------IIVNDVVTCEEDETPVTVN 113 (560) Q Consensus 66 ~e~~~~-----~~~~LI~~YR~ma~~PEvd~AI~---------------------------eIvneaiv~d~~~~~V~l~ 113 (560) +..... ....-+.+|+.+..+.+-+..|. .||+-. +.=--..||++. T Consensus 1 l~~~~i~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~-~~yl~G~p~~~~ 79 (451) T protein:vir:10 1 MELEKIRAIISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEK-ASYMFTYPVLFD 79 (451) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhh-hhheecccceee Confidence 211111 11123344555555555443221 111111 111123444444 Q ss_pred eecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCC----CccCeeEeEEeCccccchhhh Q lcl|NC_019530. 114 LDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTN----KKGGIKKLVMLDSRCIRPVYI 189 (560) Q Consensus 114 Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~----~k~GI~eLr~LDPr~i~~vr~ 189 (560) .++.+ ...+..+.+++ -+|+....++.+.+.+.|+-|.+.-+|.+. +..|-..+..++|+.+-+|.. T Consensus 80 ~~~~~--------~~~~~~~~~~~-n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~vyd 150 (451) T protein:vir:10 80 IDNNK--------ELNEKVTDVLG-NEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIPIYR 150 (451) T ss_pred cCCcH--------HHHHHHHHHhc-cCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcccceEEEEc Confidence 33321 12222333333 579999999999999999999987776542 234667788999998866532 Q ss_pred hcccCC-ccccccc------------cccchheeecCcccccccc--ccccccc--cccccccceecchhceeeeecccc Q lcl|NC_019530. 190 VEKAMR-EGGIEAI------------ESVTLKYYYNPNYNRNQFT--GQSGTSQ--NFQPSQQELVFDDESIVYIDSGEE 252 (560) Q Consensus 190 i~~~~~-~g~~~~~------------~~~~e~~~Y~p~~~~~~~~--g~~~~~~--~~~~~~~~~~i~~daI~y~hsGl~ 252 (560) =....+ .-.+... +.+.-.-+|.+..-+.... +...... ...-+-.--++| |++.- T Consensus 151 d~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP---vv~~~---- 223 (451) T protein:vir:10 151 NGIERELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQHRFNSVP---FVEFS---- 223 (451) T ss_pred CCCCCceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccccCCCCeee---EEEec---- Confidence 100000 0001000 0011111233322111110 1000000 000000000111 12111 Q ss_pred cCCCCcchhhhHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhh-heeecccCCcc Q lcl|NC_019530. 253 PLANGIVPGLLNPAIRPLNNLVT-TEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKN-RNAYDRTTGKI 330 (560) Q Consensus 253 ~~~~~~i~syL~~AiK~~NqL~m-~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~kn-k~vYd~~TGev 330 (560) ++....|-++..+.....+.+ +=+..-.-+-+.-|-+-+.-.+... . .+ .+.+.+. +++- T Consensus 224 --nn~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~-~----~~----~~~~~~~~~~i~------- 285 (451) T protein:vir:10 224 --NNIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGED-T----SE----FLKELKRYKTIK------- 285 (451) T ss_pred --cCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCccc-c----hh----hHHHHhhCCeEE------- Confidence 233345667666665555542 3333334455666655443322211 1 11 1222222 2110 Q ss_pred cCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHH Q lcl|NC_019530. 331 TGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDH-VNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEE 409 (560) Q Consensus 331 ~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDE 409 (560) ++-.+.+.|-.+..|....+...... +.-+.+.+|+..++|- +..+ + +|...+..|.--. T Consensus 286 --------------~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~-~--~gn~Sg~Alk~~~ 346 (451) T protein:vir:10 286 --------------TETDSEGDSGGLKTMQIEIPTEARKIILEILKKQIYESGQGLQ--QDTE-N--FGNASGVALKFFY 346 (451) T ss_pred --------------ecCcCCccCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCccc--cccc-c--cccccHHHHHHHH Confidence 00001122233555555455555544 7888899999999994 2222 2 2333233444333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcc Q lcl|NC_019530. 410 LRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGS 489 (560) Q Consensus 410 lkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGk 489 (560) ...-.-+.+.++.|...+...|+.-+-+-|+ .+|. .|.+.|...---.+ .+.++++..+ +| T Consensus 347 ~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~---~d~~-----~i~i~f~~~~p~n~-------~e~~~~~~kl---~g- 407 (451) T protein:vir:10 347 RKLELKSGLLETEFRTSFDKLIKAILYFLGV---TDYK-----KIQQTYTRNMMSND-------LEDADIATKS---VG- 407 (451) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC---CCcc-----ceeEEecCCCCCCH-------HHHHHHHHHH---hc- Confidence 3344456666666666666666544433343 3453 25556643321112 2344555555 35 Q ss_pred hhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCC Q lcl|NC_019530. 490 IFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSP 542 (560) Q Consensus 490 y~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~ 542 (560) .+|.+++++. |...|. .+++.++|++|.++- ++...++.++. +. T Consensus 408 ~iS~et~~~~-~p~v~d-~~~e~~~~~ee~~~~--~~~~~~~~~~~-----~~ 451 (451) T protein:vir:10 408 IIPTKIILRH-HPWVDD-VEEAEKLYLEEKKIQ--ASKVSDDYNNF-----TE 451 (451) T ss_pred cCchHHHHHh-CCCCCC-HHHHHHHHHHHHHHH--HHHHHhhcCCC-----CC Confidence 4999999977 555442 344445554443221 11111111111 11 No 104 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=97.01 E-value=0.00021 Score=40.75 Aligned_cols=396 Identities=10% Similarity=0.098 Sum_probs=173.5 Q ss_pred hhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHHHhhccee Q lcl|NC_019530. 22 VDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVV 101 (560) Q Consensus 22 ~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneai 101 (560) .+.++++.+.-....++...... . +.... ++.....|.++. .....++|-|.+||.-|.+.+. T Consensus 1 ~~~~~~~~~~k~~~~~~~~~~~~--~--~~~~~-~~~~~~~~~~v~------------~~~a~~~~~V~~ci~~ia~~ia 63 (409) T protein:vir:96 1 MAKENIVTRIKKKLIDNWIDQSA--S--KLYDF-SPWKNKSFWGVI------------NNTLETNETIFSAITKLSNSMA 63 (409) T ss_pred CccccchhhhhhHHhhhhhcccc--c--ccccc-ccccCccccccc------------hhhHhhhHHHHHHHHHHHHhhh Confidence 33333333322222221111100 0 00000 001111111111 0123467889999987777655 Q ss_pred eecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHH-hCCchhh----HHHHHHhhhhhhhhhhheecCCCCccCeeEe Q lcl|NC_019530. 102 TCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHL-MDFDNTA----YQKIRKWYVDGRQAYHVIVDPTNKKGGIKKL 176 (560) Q Consensus 102 v~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l-l~f~~~~----~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eL 176 (560) -++ +.+- ...+. .+ .. ..++|+. =+-...+ +.++..++++|.-|..++.|.. .-+.+| T Consensus 64 ~lp-----~~~~-~~~~~-~~---~~----l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~---G~~~~L 126 (409) T protein:vir:96 64 SLP-----LKMY-EDYKV-VN---TE----VSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY---HQPSKL 126 (409) T ss_pred hCc-----eEEe-ecccc-cc---hh----HHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCC---CcEEEE Confidence 432 2221 11111 11 11 1122221 1112233 4455566778987776655432 237899 Q ss_pred EEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeecccccCCC Q lcl|NC_019530. 177 VMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLAN 256 (560) Q Consensus 177 r~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~ 256 (560) ++|+|..|..++ . .+++ ..+|.|.. ..++...++.+.|.|..- .-..+. T Consensus 127 ~~l~~~~v~v~~----~-~~~~-------~~~y~~~~------------------~~g~~~~~~~~evih~r~-~~~~~~ 175 (409) T protein:vir:96 127 FLLNPDVVEMLI----E-NQSR-------ELYYSIHA------------------ATGNKLIVHNMDMLHFKH-IVASNM 175 (409) T ss_pred EEEcCceeEEEE----e-CCCc-------EEEEEEEc------------------CCceEEEEccccEEEeCC-CCCCCc Confidence 999998887653 1 1111 11222221 112344567777766531 112233 Q ss_pred CcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccc Q lcl|NC_019530. 257 GIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHL 336 (560) Q Consensus 257 ~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~ 336 (560) -+..|.|..|...+.....++.. ..+...+.| . +...-.+.|.+.+++...+.+.+.|.| .|.+ T Consensus 176 ~~G~s~l~~~~~~i~~~~~~~~~-~~~~~~~~~-~-~i~~~~~~l~~e~~~~~~~~~~~~~~n-------~g~~------ 239 (409) T protein:vir:96 176 VQGISPIDVLKNTTDFDNAVRTF-NLTEMQKPD-S-FMLKYGSNVSTEKRQQVLEDFKQYYEE-------NGGI------ 239 (409) T ss_pred cccccHHHHHHHHHHHHHHHHHH-HHHhcCCCc-e-eEEecCCCCCHHHHHHHHHHHHHHhhc-------CCCe------ Confidence 34567788877777766666554 355555544 2 334445677777666655555444432 2322 Q ss_pred hhhHhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHH Q lcl|NC_019530. 337 MGIAEDYWLPRREGQNATEIATVGGGNQLGE---MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFS 413 (560) Q Consensus 337 msmlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~ 413 (560) + .++ -|.+++.|. .+..+ ++-..|..+.+.++++||.+.|+..++.+ .++..+..+. |. T Consensus 240 ~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~--~s~~e~~~~~---f~ 301 (409) T protein:vir:96 240 L-FQE----------PGVEIEPLP--KKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTN--FAKNEELNRF---YL 301 (409) T ss_pred e-ecC----------CCceEEEcC--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHHHH---HH Confidence 1 221 256777774 33333 33344667889999999999997654422 3333433333 55 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhH Q lcl|NC_019530. 414 KFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSI 493 (560) Q Consensus 414 Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~ 493 (560) +++ |+-.+ ..+.+.|- ..++++.++.. . ..|+|..+ ++...+ +..|++.++.+-.- -+++. T Consensus 302 ~~~--l~P~~-~~ie~~l~-----~~Ll~~~~~~~--g--~~i~fd~~----~ll~~d-~~~~~e~~~~~~~~--G~~T~ 362 (409) T protein:vir:96 302 QHT--LLPIV-KQYEEEFN-----RKLLTKTDREK--N--RYFKFNVK----SYLRAD-SATQAEVYFKAVRS--GYYTI 362 (409) T ss_pred HHH--HHHHH-HHHHHHHH-----hhcCCcccccC--c--ceEEeech----hhhccC-HHHHHHHHHHHHhC--CCCCH Confidence 553 32211 11222233 34456666532 2 33444333 333222 34555555544221 36777 Q ss_pred HHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccc--cccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 494 DYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSG--SDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 494 ~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~--~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) .-++. .|.+.+- |. ||.-. ....++....+.-...+|-|.+- T Consensus 363 NE~R~-~~g~~pi--------------------~g----gD~~~~~~n~~~~~~~~~~~~~~~gG~~n~ 406 (409) T protein:vir:96 363 NDIRE-WEDLPPV--------------------EG----GDKPLISGDLYPIDTPLELRKSLKGGDKNV 406 (409) T ss_pred HHHHH-HhCCCCC--------------------CC----cceeeecccccccccchhhcccccCCCCCc Confidence 77764 3555321 11 11110 00001000000000111111111 No 105 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=96.96 E-value=0.00024 Score=40.45 Aligned_cols=431 Identities=10% Similarity=0.064 Sum_probs=163.9 Q ss_pred hhhhhhhhhhcccccch-hhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQ-DELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAV 93 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~-~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI 93 (560) |..++ +++++. .++.... +-+....... . +.--+..-.+-+.+|+.+..+.+-...| T Consensus 1 ~~~~~------~~~~~~~~~~~~~~---~~~~~~~~~~-~------------i~~~i~~~~~~~~r~~~~~~Yy~g~~~i 58 (478) T protein:vir:10 1 MISIN------WPWDKPYHEQVVEQ---IKPKYETQEE-M------------ILRLVREHKENIDNITMGERYYNHHPDI 58 (478) T ss_pred Ccccc------ccCCchhhhHHHHH---hhhccCChHH-H------------HHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 11111 111100 0000000 0000000000 0 0000000111223344444333322111 Q ss_pred ---------------------------HHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhH Q lcl|NC_019530. 94 ---------------------------DIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAY 146 (560) Q Consensus 94 ---------------------------~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~ 146 (560) ..||+ ..+.---..||++..++.+ .. +....+++ -+|+.... T Consensus 59 ~~~~~~~~~~~~~~~~~~~~ki~~n~~k~ivd-~~~~yl~g~p~~~~~~~~~-----~~----~~l~~~~~-n~~~~~~~ 127 (478) T protein:vir:10 59 LDAPFKRDVNGDYDETKPDWRMYTNYHQNLVD-QKVAYAVANPVTFGVDNDK-----AL----KQIQHTLN-HKWDDKLV 127 (478) T ss_pred cccchhhhcccccccccccceeccchHHHHHH-HHhhhhcccCceeecCChH-----HH----HHHHHHHh-ccHHHHHH Confidence 11111 1111122345555444422 22 33333443 47899999 Q ss_pred HHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCc-ccccc--ccccchheeecCcccc-ccccc Q lcl|NC_019530. 147 QKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMRE-GGIEA--IESVTLKYYYNPNYNR-NQFTG 222 (560) Q Consensus 147 ~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~-g~~~~--~~~~~e~~~Y~p~~~~-~~~~g 222 (560) ++.+.+.+.|+-|.+.-+|.+ |-..+..+||+.+-+|..-....+. -.+.. .+.....-+|.+..-. +...+ T Consensus 128 ~~~~~~~~~G~~~~~v~~d~~----~~~~~~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~i~~~~~~~ 203 (478) T protein:vir:10 128 DILTAASNKGIEWVQPYVDEE----GEFKTFRVPAEQAVPIWTNKERDELQAFIRVYELDGAERVEYWTKDDVTFYELKE 203 (478) T ss_pred HHHHHHhhCCeEEEEEEecCC----CceEEEEEcccceEEEEcCCCCCceEEEEEEEeeeCceEEEEEeCCcEEEEEecC Confidence 999999999999887666653 4578999999998776421111100 00000 1111111122221110 10111 Q ss_pred ccccccccc-ccc-cce--e----cchhceeeeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceE Q lcl|NC_019530. 223 QSGTSQNFQ-PSQ-QEL--V----FDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTEDATVIYAITRAPEKRA 293 (560) Q Consensus 223 ~~~~~~~~~-~~~-~~~--~----i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~EDalVIyRitRAPeRRv 293 (560) ......... ..+ ... . -+...|-.++. .++....|-|+..+....-+. ++-+....-+-.+.|-+-+ T Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~----~n~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~ 279 (478) T protein:vir:10 204 GQLIPDFYRSEDHIQPHYYQGNKLMSWGRVPFIPF----KNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYIL 279 (478) T ss_pred CeeeccccccccccccceecccccccCCcceEEEe----ccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceee Confidence 000000000 000 000 0 00000001110 123334466666555555444 3344444456666665544 Q ss_pred EEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchH-HHHH Q lcl|NC_019530. 294 FYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEM-DHVN 372 (560) Q Consensus 294 FyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei-~DV~ 372 (560) .-.+..+.. + .... |..+ + =+|++.-+| .++..|-...+...+ .-+. T Consensus 280 ~g~~~~~~~--~---~~~~-~~~~--~---------------------~~~~~~~~~---~~~~~l~~~~~~~~~~~~~~ 327 (478) T protein:vir:10 280 KGYEGEDMK--D---FMHN-LKYY--K---------------------AISVAGESG---SGVDTIKVEVPIDSVKEYTK 327 (478) T ss_pred ecCCccccc--c---hhhh-hhhC--c---------------------eeEecCCCC---CcceEEeecCCHHHHHHHHH Confidence 322221111 0 0000 1111 1 123332232 335655554455443 3367 Q ss_pred HHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhh Q lcl|NC_019530. 373 YFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKP 452 (560) Q Consensus 373 YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~ 452 (560) -+.+.+|+..++|- +.. .++. |...+..|..-......-+.+.++.|...+..+|+.-+-+.|+ + .+|. T Consensus 328 ~l~~~I~~~s~~p~--~~~-~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~-~-~d~~----- 396 (478) T protein:vir:10 328 MLRDYIIEFGQGVD--FQQ-DKFG-NSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRL-D-VRVQ----- 396 (478) T ss_pred HHHHHHHHHhCCcC--cCc-cccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-C-cccc----- Confidence 78888999999994 211 2211 1222333443333334445566666666666665544444454 2 3342 Q ss_pred hceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcC-----CCCCC Q lcl|NC_019530. 453 FIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKG-----LYPKV 527 (560) Q Consensus 453 ~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~-----~~~~p 527 (560) .|.+.|.. +...-+. +-+++++.+ .| .+|.+++++. |...++ -+++.++|++|..+. -+.++ T Consensus 397 ~i~i~f~~-~~p~~~~------e~~~~~~~~---~g-~iS~et~i~~-~~~v~d-~~~E~~ri~~E~~~~~~~~~~~~~~ 463 (478) T protein:vir:10 397 DIEITFNF-NVMVNEL------ENSQIAMNS---TG-LLSKETILGN-HSWVQD-PVAEMERIEQENIELNQQLPDIEEG 463 (478) T ss_pred cceEEeCC-CCCCCHH------HHHHHHHHH---hC-CCChHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHhccccCCC Confidence 25666643 3322111 123344444 24 4899999976 554432 233334444443221 11111 Q ss_pred CccccccccccccCCCCCCcc Q lcl|NC_019530. 528 QADETGNYSGSDVSPLKFKPE 548 (560) Q Consensus 528 ~~~e~g~~~~~~~~~~~~~p~ 548 (560) ..+++-.. +.+.. +| T Consensus 464 ~~d~~~~~-~~d~~-----~e 478 (478) T protein:vir:10 464 LNDEQQRQ-SEDNQ-----SE 478 (478) T ss_pred Cccccccc-CcCCC-----CC Confidence 11111111 11111 12 No 106 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=96.92 E-value=0.00026 Score=40.27 Aligned_cols=461 Identities=14% Similarity=0.131 Sum_probs=202.9 Q ss_pred hhhhhhhhhhhhhc-ccccchhh-hccccccccCCCCCCCCceeE-ecccccee-ccccccchhhHHHHHHHHHHHhhcc Q lcl|NC_019530. 12 FGGGGLVNAKVDTD-KLAQKQDE-LLLTKATVVALDDAQDGSIIL-QGGANTYN-YVGVESELLSVKTVVEEYQSMAQQP 87 (560) Q Consensus 12 ~~~~~~~~~~~~~~-~~~~~~~e-~~~~k~~s~~~~~~~dG~~~i-~~~~~~~~-y~~~e~~~~~~~~LI~~YR~ma~~P 87 (560) .|....++.+++.- +......| .+.++.-...+.+..+..... .+...... ++.... |.+ ..| T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~------------~~~-~~~ 67 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVHD------------KLM-NSG 67 (518) T ss_pred CcchhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCcccc------------ccc-cCC Confidence 33333344443321 11111111 111111111111111110000 00000000 111110 111 111 Q ss_pred chhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCC Q lcl|NC_019530. 88 EIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPT 167 (560) Q Consensus 88 Evd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~ 167 (560) --..++.++. ..++ ..+|++.++..+...+ +..++..+.+++-.+|..+..+.+..+..-|..+|+..+|.. T Consensus 68 l~~~i~~~~A-~ll~----~e~~~i~v~~~~~~d~---e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~ 139 (518) T protein:vir:78 68 TGNEIVVVAA-EYIS----GKPLSIDVTGVNGSKD---ENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINILNG 139 (518) T ss_pred hHHHHHHHHH-Hhhc----CCCceEEecCccccCc---HHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEECC Confidence 1111222211 1121 2244566655443222 256777888999999999999999999999999999999853 Q ss_pred CCccCeeEeEEeCccccchhhhhcccCCccccccccc-----cchhee----------------ecCcccccccc----c Q lcl|NC_019530. 168 NKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIES-----VTLKYY----------------YNPNYNRNQFT----G 222 (560) Q Consensus 168 ~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~-----~~e~~~----------------Y~p~~~~~~~~----g 222 (560) . ..+.+++|-++-++. .+.+...+-++.. -..+|+ |++..-.+... + T Consensus 140 ~-----~~i~~v~ad~~~P~~---~~g~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~ 211 (518) T protein:vir:78 140 R-----PSISVHSSSQFWIDF---KNNEPFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGD 211 (518) T ss_pred e-----eEEEEEcCCeeEEEe---ecCcEEEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCc Confidence 3 368889998887752 1111111111110 001111 11111000000 0 Q ss_pred c-cccccc---------ccccc--ccee----cchhceeeeec-c--cccCCCCcchhhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 223 Q-SGTSQN---------FQPSQ--QELV----FDDESIVYIDS-G--EEPLANGIVPGLLNPAIRPLNNLVTTEDATVIY 283 (560) Q Consensus 223 ~-~~~~~~---------~~~~~--~~~~----i~~daI~y~hs-G--l~~~~~~~i~syL~~AiK~~NqL~m~EDalVIy 283 (560) . ...+.. ....+ .... .++-.+.|..- + -.+......+|-|+.|.-++..|-..=+ -+. T Consensus 212 ~~v~~~~~~~~~~l~~~~~~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s--~~~ 289 (518) T protein:vir:78 212 KTTPISAERLPEQITSYLHTNDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFT--VYM 289 (518) T ss_pred ccccccccccccccccccccccCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHH--HHH Confidence 0 000000 00000 0000 00111122110 0 0011223356888888888777776665 466 Q ss_pred HHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhccccc----CCCC-ccceee Q lcl|NC_019530. 284 AITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRR----EGQN-ATEIAT 358 (560) Q Consensus 284 RitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRR----eGgr-gTEIsT 358 (560) |..|.=++|||.-+ .=|+ ...|..++. ....+- --.+++.+.. +|+. .+.|++ T Consensus 290 ~e~~~g~~~i~v~~-~~l~------------------~~~~~~~~~--~~~~fd-~~~~~y~~i~~~~~~~~~~~~~i~~ 347 (518) T protein:vir:78 290 REGEKTKTKIAASE-RMFR------------------KKVNKSTDK--EEWSMN-VDEDYFMQFKGTLDAGAKLNDMIQF 347 (518) T ss_pred HHHHhCCceeeech-hHhc------------------cCCCCCCCc--cccccC-CCCceEEEecCcCCCCCccccceee Confidence 88888777777621 1111 000111110 000000 0113333321 1222 234666 Q ss_pred cCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019530. 359 VGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLIL 437 (560) Q Consensus 359 LpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiL 437 (560) +...=.-.+ ..-+..+.+.+....+++-+-|+.+++ ..+ |+||.+..-+-...+.+.|+.+...+.++++.-|-| T Consensus 348 ~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~---~~T-ATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l 423 (518) T protein:vir:78 348 MQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNR---EVK-ATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYL 423 (518) T ss_pred eecccChHHHHHHHHHHHHHHHHhhCCChhhcCcccc---ccc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 654322222 333677788888899999888865432 223 678877766666788888888777777777764433 Q ss_pred -ccCCChhHHHHhh-hhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHH-hCCCHHHHHHHHHH Q lcl|NC_019530. 438 -KGVTDEKDWNEKI-KPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNV-LRMSDEEVKEQQTK 514 (560) Q Consensus 438 -kgiit~eew~~~i-~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~I-L~~tDeEI~e~~kq 514 (560) +.......|.... ...+.++|-. +.+.. ++ +.++.++++-.- | ..|+++.+++. -..||+|.+++-++ T Consensus 424 ~~~~~~~~~~~~~~~~~~v~i~f~D-~i~~D-~~-----~~~~~~~~~v~a-G-imS~e~~i~~~~~~~~deea~~e~~r 494 (518) T protein:vir:78 424 LTGGTNNKEKAIMRDEIRVIIEFPD-PMSVN-LN-----ELSSTLNNMNSA-L-AMSVEEKVKLIHPKWEDEEIQAEVKR 494 (518) T ss_pred HHhhcCccccccCCCceeEEEEeCC-CCCCC-HH-----HHHHHHHHHHhc-C-CCCHHHHHHHhCCCCCHHHHHHHHHH Confidence 2221111110000 0125555542 22222 11 112222222211 3 68999877664 36899999999999 Q ss_pred HHHhhhcCCCCCCCccccccccccccCC Q lcl|NC_019530. 515 IAEEKKKGLYPKVQADETGNYSGSDVSP 542 (560) Q Consensus 515 Ie~E~~~~~~~~p~~~e~g~~~~~~~~~ 542 (560) |++|.... ..|+.++.++..+.. | T Consensus 495 i~~E~~~~--~~~~p~~~~g~~~~~--g 518 (518) T protein:vir:78 495 IYLENAIG--EVPDPEAIGGMETKG--G 518 (518) T ss_pred HHHHhccc--CCCCCccccCCCCCC--C Confidence 99997554 333333333322111 1 No 107 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=96.81 E-value=0.00032 Score=39.73 Aligned_cols=397 Identities=16% Similarity=0.146 Sum_probs=170.0 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) |+.++.+ +..+.+ ..+... +.......++...+...+ .|....++|.|.+||+ T Consensus 1 Mg~f~~~------~~~~~~-----~~~~~~----~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~v~~~i~ 53 (406) T protein:vir:95 1 MGLFDRW------RRTKRK-----SKIRAD----TGYVGLFMSGEDVSFLVP------------GYVRLSDNPEVRMAVH 53 (406) T ss_pred Ccchhhh------cccccc-----cccccc----chhhhhhccCcccCcccc------------CHHHHhhcHHHHHHHH Confidence 3333221 111100 000000 000000011111111110 1234567899999998 Q ss_pred HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHH-----HhCCchhhHHHHHHhhhhhh--hhhhheecCC Q lcl|NC_019530. 95 IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMH-----LMDFDNTAYQKIRKWYVDGR--QAYHVIVDPT 167 (560) Q Consensus 95 eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~-----ll~f~~~~~~~~R~wYvDGr--~~~hkiiD~~ 167 (560) -|.+.+.-+ |+.+--...++ .. .+.....+.++ .+....-.+.+++.++.+|. .|..++.|. T Consensus 54 ~ia~~ia~~-----~~~~~~~~~~~-~~----~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~- 122 (406) T protein:vir:95 54 KIADLISSM-----TIYLMQNTEDG-DI----RIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTA- 122 (406) T ss_pred HHHHhhccC-----ceEEEEecCCc-ce----eecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECC- Confidence 877766322 22221111111 11 11111111121 11222233444555566654 344343333 Q ss_pred CCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeee Q lcl|NC_019530. 168 NKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYI 247 (560) Q Consensus 168 ~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~ 247 (560) +.-+.+|++|+|.++..++ ..+ + |.|. +.+ ..++.+.|.|. T Consensus 123 --~g~~~~l~~i~~~~v~~~~---~~~--~-----------~~~~-------~~~--------------~~~~~~evih~ 163 (406) T protein:vir:95 123 --DGLIDELVPLTPSKVNFLD---TPD--G-----------YQVL-------YGG--------------QTFNYDEVLHF 163 (406) T ss_pred --CCcEEEEEEEcCceeEEEE---cCC--e-----------EEEE-------ecc--------------EEEchhHEEEe Confidence 3448999999998887642 111 1 1121 011 13566666655 Q ss_pred ecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccC Q lcl|NC_019530. 248 DSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTT 327 (560) Q Consensus 248 hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~T 327 (560) .-.-...+.-+..|.+..|..++.....+++...-+.---+.-+-+..++. .+.+..+++..+.+..+|+.-. .. T Consensus 164 ~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~-~l~~e~~~~~~~~~~~~~~g~~----n~ 238 (406) T protein:vir:95 164 IYNPDPERPYIGRGYRVVLKDIADNLKQATATKKSFMSGKYMPSLIVKVDA-ATAELSSEEGRNAVFKKYLQAT----EA 238 (406) T ss_pred eccCCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCCHHHHHHHHHHHHHHhcccc----cc Confidence 311123344457799999999999998888888766666667777777764 5777777777666666554310 01 Q ss_pred CcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhH Q lcl|NC_019530. 328 GKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQ 407 (560) Q Consensus 328 Gev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItR 407 (560) | +.+ +++ -+|..-++++.+.. ..+.-++-.++..+.++++++||...|+..+ + . T Consensus 239 ~------~~~-v~~------~~~~~~~~~~~~~~-~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~-------~-~---- 292 (406) T protein:vir:95 239 G------QPW-IIP------AELLEVEQVKPLSL-KDIAINEAVELDKRTVAGMFGVPAFLLGIGE-------F-N---- 292 (406) T ss_pred C------Cce-eec------CCCccccccccCCh-hHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-------c-h---- Confidence 1 111 111 01111233333322 2333345567888999999999998884211 1 1 Q ss_pred HHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_019530. 408 EELRFSKFCAG-LRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPF 486 (560) Q Consensus 408 DElkF~Kfi~r-Lr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~ 486 (560) |-.+..|+.. |+ -+...+.+.|-..| +++.++ .+.|. +.++.... ...|++.+..+-. T Consensus 293 -~~~~~~~~~~~l~-P~~~~ie~~l~~~l-----~~~~~~--------~~~fd----~~~l~~~d-~~~~~~~~~~l~~- 351 (406) T protein:vir:95 293 -RDEYNNFINSTIL-PIAKGIEQELTRKL-----LISPDL--------YFKFN----PRSLYAYD-LKELAEVGSNMYV- 351 (406) T ss_pred -HHHHHHHHHHHHH-HHHHHHHHHHHHhc-----CCCCCc--------EEEee----chhhhcCC-HHHHHHHHHHHHh- Confidence 1122233221 22 12223333443333 343332 23343 23332221 2245555544422 Q ss_pred hcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCC-CccccccccccccCCCCCCcccccccCCccc Q lcl|NC_019530. 487 VGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKV-QADETGNYSGSDVSPLKFKPETIPFSGSTDD 558 (560) Q Consensus 487 vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p-~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~ 558 (560) .-+++.+-+++ .|++...+ ..++-+-.-.-.++-..+ .....|++..++.+. -+ T Consensus 352 -~G~~t~NE~R~-~~gl~p~~--~gd~~~~~~n~~~~~~~~~~~~~k~g~~~~~~~~--------------~~ 406 (406) T protein:vir:95 352 -RGIMEGNEVRD-WLGLSPKE--GLSELVILENYIPLDKIGDQSKLKGGDNSGADGQ--------------TD 406 (406) T ss_pred -CCCcCHHHHHH-HhCCCCCC--CcceeeeccCccchhhcccccccCCCCCCCCCCC--------------CC Confidence 23777777774 46665321 111000000000000000 000111111111111 11 No 108 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=96.81 E-value=0.00033 Score=39.70 Aligned_cols=325 Identities=11% Similarity=0.167 Sum_probs=143.3 Q ss_pred chhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccc--cceeccccccchhhHHHHHHHHHHH Q lcl|NC_019530. 6 RGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGA--NTYNYVGVESELLSVKTVVEEYQSM 83 (560) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~--~~~~y~~~e~~~~~~~~LI~~YR~m 83 (560) -|+|+.| +++ .+.. |+ +.-..+.+++ ..+.++..+ .- T Consensus 1 M~~~~~f---------------~~r---------~~~~-~~--~~~~~~~~~~~~~~~~~v~~~--------------~a 39 (359) T protein:vir:10 1 MSILNPF---------------ERR---------SSIT-PN--NYYPFMVQNGSIVPNSLVDAT--------------EA 39 (359) T ss_pred Ccccchh---------------hcc---------ccCC-CC--cchhhhhccccccCCcccCHH--------------Hh Confidence 2333322 111 1111 11 1111111111 111122111 12 Q ss_pred hhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHH----HHHHhhhhhhhh Q lcl|NC_019530. 84 AQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQ----KIRKWYVDGRQA 159 (560) Q Consensus 84 a~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~----~~R~wYvDGr~~ 159 (560) +++|-|..||.-|.+.+.-. .+.. .... ..+++-=+-...+.+ ++..++.+|.-| T Consensus 40 l~~~av~~cv~~ia~~ia~~---------p~~~-----~~~~-------~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay 98 (359) T protein:vir:10 40 LKNSDLYAVTSLISSDIAGT---------RFIG-----NQVF-------TSVLNNPSHLTNAFSFWQTAILNLLLNGNVF 98 (359) T ss_pred hcchHHHHHHHHHHHhhhcC---------cccc-----chHH-------HHHhhcccccCCHHHHHHHHHHhccccCceE Confidence 46788889997766544321 1111 1111 112222222234444 444555678877 Q ss_pred hhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceec Q lcl|NC_019530. 160 YHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVF 239 (560) Q Consensus 160 ~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i 239 (560) ..++.|.. .-+++|++|+|-.+.... .+ ++ -+|.|... ..+....+ T Consensus 99 ~~i~r~~~---g~~~~l~~l~~~~v~i~~----~~--~~--------~~y~~~~~-----------------~~~~~~~~ 144 (359) T protein:vir:10 99 LAILKGDN---SLMKELRLIPSNAITIDL----TD--DT--------LTYEVNQF-----------------DDYPSAKY 144 (359) T ss_pred EEEEECCC---CeEEEEEEeCCceEEEEE----cC--Ce--------EEEEEEec-----------------CCceEEEE Confidence 76665442 236899999997776421 11 11 11222110 11234557 Q ss_pred chhceeeeec---ccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHh Q lcl|NC_019530. 240 DDESIVYIDS---GEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGK 316 (560) Q Consensus 240 ~~daI~y~hs---Gl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k 316 (560) +.+.|.|..- +.-..++-..+|.|+.|.+++.....+++...-+=---+--+-+..++-|.+.+..+++ +++...+ T Consensus 145 ~~~evih~~~~~~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~-~~~~~~~ 223 (359) T protein:vir:10 145 NASEMIHVKIMAYGVDTLHNLVGHSPLESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQGTLSSEAKDS-IRKEFEK 223 (359) T ss_pred cccceEEeccCCCCCCccCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHH-HHHHHHH Confidence 7777776642 11122344567899999999998888887654332222223567777777766655443 3332233 Q ss_pred hhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchH---HHHHHHHHHHHHhcCCCccccCCCC Q lcl|NC_019530. 317 FKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEM---DHVNYFREKLYDALMIPKSRLQEEG 393 (560) Q Consensus 317 ~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei---~DV~YF~~kLy~aL~VP~sRl~~~~ 393 (560) +..- ...|++ + .++ .|.+++.|. .+.-++ +-.+|-...+.++++||...|+..+ T Consensus 224 ~~~~----~n~g~~------~-vl~----------~g~~~~~l~--~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~ 280 (359) T protein:vir:10 224 ANGG----NNSGRV------M-VLD----------QSADFSTVS--INADVANYLNSMNWGRTQIAKAFGVSDSYLNGTG 280 (359) T ss_pred HhCc----cccCCc------e-ecC----------CCcceeeec--CCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC Confidence 3210 112221 1 222 255666663 333333 3445667889999999999996433 Q ss_pred cccccccchhhhhHHHHHHHHHH----HHHHHHHHHHHH---------------HHHHH---HHHhccCCChhHHHHhhh Q lcl|NC_019530. 394 SINIGGSNLAEITQEELRFSKFC----AGLRRRYSHFFM---------------EFLRR---QLILKGVTDEKDWNEKIK 451 (560) Q Consensus 394 ~~~l~~~~~~eItRDElkF~Kfi----~rLr~rfs~if~---------------d~Lk~---qLiLkgiit~eew~~~i~ 451 (560) +.+-.. +.+ |-.+..|+ ..+-.+++.-|. ..++. .++-.|++|+.|+.+.+ T Consensus 281 ~~~~~~---~~~---e~~~~~~l~~~l~p~~~~l~~~l~~~~~~~~~~~~~~d~~~~~~~~~~~~~~G~~t~NE~R~~l- 353 (359) T protein:vir:10 281 DQQSSL---DQI---KDLYVNALNRFIEPLISELRIKCDSSIGVDMSPITDYSNSVFKADILNWVKEGIIEPTEAKTLL- 353 (359) T ss_pred cccccH---HHH---HHHHHHHHHHHHHHHHHHHHHHhhhhhcccchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh- Confidence 322112 222 11122221 111122211111 11111 12223445555444321 Q ss_pred hhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_019530. 452 PFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFV 487 (560) Q Consensus 452 ~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~v 487 (560) .+.|+. T Consensus 354 ------------------------------~~~pv~ 359 (359) T protein:vir:10 354 ------------------------------ESKGII 359 (359) T ss_pred ------------------------------CCCCCC Confidence 223333 No 109 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=96.77 E-value=0.00035 Score=39.55 Aligned_cols=406 Identities=10% Similarity=-0.001 Sum_probs=180.6 Q ss_pred cccchhhHHHHHHHHHHHhhccchhHHH----------------------HHhhcceeeecCCCceEEEeeecccchhHH Q lcl|NC_019530. 66 VESELLSVKTVVEEYQSMAQQPEIRKAV----------------------DIIVNDVVTCEEDETPVTVNLDKVEGISDT 123 (560) Q Consensus 66 ~e~~~~~~~~LI~~YR~ma~~PEvd~AI----------------------~eIvneaiv~d~~~~~V~l~Ld~~~~~s~s 123 (560) +-..+.+. -..+|+.+..+.+-+..+ ..||+- .+.=--..||++..++.. + T Consensus 1 ~~~~~~~~--~~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~-~~~~l~g~~~~~~~~~~~--~-- 73 (440) T protein:vir:95 1 MLAAFLGS--QKQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSF-ATGYVIGNPVSIGVMEGG--S-- 73 (440) T ss_pred ChhhHHHH--HHHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHh-hhhheeccCceEeeCCCc--c-- Confidence 22222221 123344444444333221 011111 111113334444433321 1 Q ss_pred HHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCC-cccccc- Q lcl|NC_019530. 124 VKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEA- 201 (560) Q Consensus 124 ik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~- 201 (560) +...+.+..++...+|+....++.|..++.|+-|.+.-+|. +|=..+..++|+.+-+++.=....+ ...+.. T Consensus 74 --~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~----~~~~~i~~~~p~~~~~~~d~~~~~~~~~~i~~~ 147 (440) T protein:vir:95 74 --ADQLSTIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDK----DKVDRVVLISPLEMFVIRDLTVEQNIIAAVHLP 147 (440) T ss_pred --HHHHHHHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEE Confidence 12445667778888999999999999999999888766653 3556788999998877642111000 000110 Q ss_pred ccccch-heeecCccccccc--cccccccccccc-cccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHH-HH Q lcl|NC_019530. 202 IESVTL-KYYYNPNYNRNQF--TGQSGTSQNFQP-SQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TT 276 (560) Q Consensus 202 ~~~~~e-~~~Y~p~~~~~~~--~g~~~~~~~~~~-~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~ 276 (560) ...... .-+|.+..-+... +..........+ +..--++|.- .|+ ++....|=++..+.....+. ++ T Consensus 148 ~~~~~~~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv--~~~-------n~~~g~sd~e~v~~lida~~~~~ 218 (440) T protein:vir:95 148 IYADKVNMTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVPVV--EWW-------NNRFRMGDYESEISLIDAYDAGQ 218 (440) T ss_pred EecCceEEEEEeCCeEEEEEEecCCccceeecceeeccCceeeEE--Eee-------CCCCCCCchhhhHHHHHHHHHHH Confidence 000011 1134433221100 000000000000 0000012211 111 11222344444444433332 33 Q ss_pred HHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhccccc----CCCC Q lcl|NC_019530. 277 EDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRR----EGQN 352 (560) Q Consensus 277 EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRR----eGgr 352 (560) -+.+..-+..+.|-+.+.-.+.+.-..... .. ++++.+. .|++.. .++. T Consensus 219 s~~~~~~~~~~~~~~v~~g~~~~~~~~~e~-------~~-------------~~~~~~~-------~~~~~~~~~~~~~~ 271 (440) T protein:vir:95 219 SDTANYMSDLNDAMLLVKGDLDGIKLSPED-------AA-------------KMKDANM-------LFLKTGISTTGQQT 271 (440) T ss_pred HHHHHHHHHhhcceeeeecccccCCCCccc-------hh-------------hhhhccc-------eecccccccccCCC Confidence 444555566666665554321111000000 11 1111111 333322 2233 Q ss_pred ccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 353 ATEIATVGGGNQLGEM-DHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFL 431 (560) Q Consensus 353 gTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~L 431 (560) +..++.|-...++... .-++-+.+.+|....+|-=-++.- +.|+ .+..|.--+.....-+.+.|..|..-+.++| T Consensus 272 ~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~---Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~ 347 (440) T protein:vir:95 272 TADASYIYKQYDVNGTEAYKNRLANDIHRFSRIPNLDDDRF-NSTS---SGIALLYKMIGLEQVRKDKETYFTKALRRRY 347 (440) T ss_pred CcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc-cccc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4457777665665543 457788889999999984111111 1122 2333443344445557777788888888777 Q ss_pred HHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHH Q lcl|NC_019530. 432 RRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQ 511 (560) Q Consensus 432 k~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~ 511 (560) +.-+-+-+++...+|+. ..+.+.|..----.+. +.++++..+ +| .+|.++++.. |..+|.+ ++ T Consensus 348 ~li~~~~~~~~~~~~~~---~~v~i~f~~~~p~~~~-------~~ad~~~kl---~g-~iS~et~~~~-l~~~d~~--~E 410 (440) T protein:vir:95 348 ELISNIHKAINGPVIEA---NKLTFTFHPNIPQDVW-------TEIKAYIEA---GG-EISQETLMEN-ASFTDYK--TE 410 (440) T ss_pred HHHHHHHhhcCCccccc---ccceEEeCCCCCCCHH-------HHHHHHHHH---hc-cCcHHHHHHh-CCCCCcH--HH Confidence 76544445555555543 2366666544322222 344445554 35 5999999987 5665543 33 Q ss_pred HHHHHHhhhcCCCCCCCccccccccccccCCCCCCcc Q lcl|NC_019530. 512 QTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPE 548 (560) Q Consensus 512 ~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~ 548 (560) .++|++|...- . .+..+.. ++..+..-++| T Consensus 411 ~~ri~~E~~~~-~-----~~~~~~~-~~~~~~~~~~e 440 (440) T protein:vir:95 411 HSRILKQGGSS-D-----LEIGQIV-GDADVGQADTE 440 (440) T ss_pred HHHHHHHHHHh-h-----hhHHhhc-cCCCCCCcCCC Confidence 45555554211 1 1111111 11122233333 No 110 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=96.75 E-value=0.00036 Score=39.46 Aligned_cols=449 Identities=15% Similarity=0.217 Sum_probs=176.2 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCC-ceeEeccccceeccccccchhhHHHHHHH Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDG-SIILQGGANTYNYVGVESELLSVKTVVEE 79 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG-~~~i~~~~~~~~y~~~e~~~~~~~~LI~~ 79 (560) .|--.- -++..+.+-++...-+-. ++ . +..-.-|+ .+..++-+++ .-+++.-=.-|- T Consensus 60 ~~~~~~--~~~~~~~~~~~~~~~~~~--~~-------~----~~~~~~~~~~~~~~~l~~~-----~~~~F~Gy~~la-- 117 (698) T protein:vir:10 60 VAEPSP--SLRLARQFEVDVSNYTPR--ER-------R----AASYALDFNGTSMDALSFV-----TSSGFPGFPTLV-- 117 (698) T ss_pred ccCCCc--cccccccceeccccCCcc--cc-------c----hhhhhhcccccccccchhh-----hccCcchHHHHH-- Confidence 111110 111111111111111110 00 0 00000110 0111111010 011121112233 Q ss_pred HHHHhhccchhHHHHHhhcceeee------cCCCc----eEEEeeecccchhH-HHHHHHHHHHHHHHHHhCCchhhHHH Q lcl|NC_019530. 80 YQSMAQQPEIRKAVDIIVNDVVTC------EEDET----PVTVNLDKVEGISD-TVKESITECFKEVMHLMDFDNTAYQK 148 (560) Q Consensus 80 YR~ma~~PEvd~AI~eIvneaiv~------d~~~~----~V~l~Ld~~~~~s~-sik~~I~eeF~~il~ll~f~~~~~~~ 148 (560) +|||+||.+++++-|..||+-. ...++ .+++.-+..+ .++ .-.++|..|++. |+...+..+. T Consensus 118 --~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~-~~d~dqi~~L~~e~er----l~V~~~l~ea 190 (698) T protein:vir:10 118 --LLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAAS-TSDGDQLKQINDEIER----LRIRDAVRTT 190 (698) T ss_pred --HHhhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccc-cccHHHHHHHHHHHHH----HHHHHHHHHH Confidence 7899999999999999988432 22222 1333322222 122 334566666654 2334444444 Q ss_pred HHHhhhhhhhhhhheecCC--------------CCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCc Q lcl|NC_019530. 149 IRKWYVDGRQAYHVIVDPT--------------NKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPN 214 (560) Q Consensus 149 ~R~wYvDGr~~~hkiiD~~--------------~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~ 214 (560) ++-=-+.|.=.....|+.. -.|.+++.|+.|||..+-+- ....++-... .+|.|. T Consensus 191 i~~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGL~ViDp~~vtP~----------~~n~~dP~sp-dfgkP~ 259 (698) T protein:vir:10 191 VIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPN----------NYNSINPVAD-DFYKPS 259 (698) T ss_pred HHhcccccceEEEEEeecCccccccccccccccccCccceeeeeecccccccc----------hhhhccchhh-ccCCCc Confidence 3332233322111123222 23557888999999766652 1111111111 233332 Q ss_pred cccccccccccccccccccccceecchhceeeeecccccCCCCc-------chhhhHHHHHHHHH-HHHHHHHH-HHHHH Q lcl|NC_019530. 215 YNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLANGI-------VPGLLNPAIRPLNN-LVTTEDAT-VIYAI 285 (560) Q Consensus 215 ~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~~~-------i~syL~~AiK~~Nq-L~m~EDal-VIyRi 285 (560) +|...|+ +||.+-++....- ..+... ..|.+..+..-+.+ +++...+. ++... T Consensus 260 --~y~V~G~--------------~IH~SRL~~~vg~--pvpd~LKp~y~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~ 321 (698) T protein:vir:10 260 --TWWMIGS--------------EVHATRLHTIVSR--PVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQF 321 (698) T ss_pred --eEEEecc--------------eecceeEEEecCC--CchhhhcchhccCCccHHHHHHHHHHHHHHHhhhHHHHHHHh Confidence 2222221 3444433211110 001111 22444444433222 23222222 11111 Q ss_pred hcCccceEEEccCCC-CchHHHHHHHH--HHHHhhhhheeecccCCcccCcccchh-hHhhhcccccCCCCccceeecCC Q lcl|NC_019530. 286 TRAPEKRAFYLDVGT-LGKKSAEEYMT--MMMGKFKNRNAYDRTTGKITGNAHLMG-IAEDYWLPRREGQNATEIATVGG 361 (560) Q Consensus 286 tRAPeRRvFyIDvGn-lpk~KAeqYl~--~im~k~knk~vYd~~TGev~d~~~~ms-mlEDywLpRReGgrgTEIsTLpG 361 (560) .... +-.|... |.....++... +++++||.- -|-+ .+. =.|+| .+. T Consensus 322 ~~~~----l~~dla~aL~~g~~~~l~~R~eli~~~Rsn------~G~~-----llDk~~Eef-------------eq~-- 371 (698) T protein:vir:10 322 SVSG----ILMDLAQALTPGANVDLSMRAELINRYRDN------RNIL-----FLDKATEEF-------------FQF-- 371 (698) T ss_pred hHHH----HHHHHHHhcCChhhHHHHHHHHHHHHhcCc------cceE-----EEecCCcce-------------EEE-- Confidence 1100 0111100 00001111111 344455431 1100 000 01233 122 Q ss_pred CCCcchHHHHH-HHHHHHHHhcCCCccccCCCC--cccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----- Q lcl|NC_019530. 362 GNQLGEMDHVN-YFREKLYDALMIPKSRLQEEG--SINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRR----- 433 (560) Q Consensus 362 g~nLgei~DV~-YF~~kLy~aL~VP~sRl~~~~--~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~----- 433 (560) ..+|+-++||. =|..-+=-+.+||+.||=..+ |||--+. + |.-.|...|..+|. ..+...|++ T Consensus 372 st~lSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE--~----D~rnYYD~I~s~Qe---~~L~p~L~rl~~ii 442 (698) T protein:vir:10 372 NTPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSE--G----EIRVWYDYVRAYQR---NALQQLMNDVIVMI 442 (698) T ss_pred ecCcCCHHHHHHHHHHHHHhhhcCchhhhhccCCcccCccch--h----hHHHHHHHHHHHHH---HHHHHHHHHHHHHH Confidence 25788888874 588888889999999985444 6654443 2 45559999998885 445555554 Q ss_pred HHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHH-----------hC Q lcl|NC_019530. 434 QLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNV-----------LR 502 (560) Q Consensus 434 qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~I-----------L~ 502 (560) |+-+.|-+ -+.|.|.|..=..-+|..-+||...+.+.-+.+-.- --++.+-|+... +. T Consensus 443 ~rS~~G~i---------dp~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~--gvI~~~evr~rL~~d~~s~Y~~~~d 511 (698) T protein:vir:10 443 QLSLFGAV---------DPSIKWQWNALRELDDLEVAEARYKQAQSDVLYVQE--QVIRPDQVAARLNTEPDGPYAGKLD 511 (698) T ss_pred HHHhcCCC---------CCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHh--cCCCHHHHHHHHhccCCCccccccC Confidence 33333443 345788888666777877888888777654433221 123333333221 11 Q ss_pred CC-------HHHHHHHHHHHHHhhhcCCCCCCCc-cccccccccccCCCC---CCccccc-ccCCcccCC Q lcl|NC_019530. 503 MS-------DEEVKEQQTKIAEEKKKGLYPKVQA-DETGNYSGSDVSPLK---FKPETIP-FSGSTDDSI 560 (560) Q Consensus 503 ~t-------DeEI~e~~kqIe~E~~~~~~~~p~~-~e~g~~~~~~~~~~~---~~p~~~~-~~~~~~~~~ 560 (560) .+ |.+|+.+.++...-...| +..+ ..+++.-+|+-.|.+ .+++..| ++|+.+-+| T Consensus 512 ~~d~p~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 578 (698) T protein:vir:10 512 ANDDPGAPADDDIDGVLTYVQRMAEGG---DTGAPTAPGGARAGATAPPAAANVNANANPREAGAQDAAM 578 (698) T ss_pred CcccCCCCCCCcchHHHhhhcCCcCCC---CcccccccccccCCCCCCcccccccCCCCccccCccccee Confidence 11 123333333322221111 1111 112222222222222 2223332 333333333 No 111 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=96.75 E-value=0.00036 Score=39.46 Aligned_cols=396 Identities=12% Similarity=0.109 Sum_probs=170.4 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHH Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEY 80 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~Y 80 (560) |.- -++++.+++.+.+.-+ .....+-+.. +......+.++ .. T Consensus 1 ~~~--------~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~-~~~~~~~~~~v------------~~ 42 (409) T protein:vir:93 1 MAK--------ENIVTRIKKKLIDNWI-----------------DQSTSKLYDF-SPWKNRSFWGV------------IN 42 (409) T ss_pred CCc--------cchhhhhhhhhhhhhh-----------------cccccccccc-ccccCcccccc------------ch Confidence 111 0111112222111100 0000011111 00000111111 11 Q ss_pred HHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHH-hCCchh----hHHHHHHhhhh Q lcl|NC_019530. 81 QSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHL-MDFDNT----AYQKIRKWYVD 155 (560) Q Consensus 81 R~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l-l~f~~~----~~~~~R~wYvD 155 (560) +..+++|-|..||+-|.+.+.-. | +.+-......+ ....++|+. =+=... -+.++..++++ T Consensus 43 ~~~~~~~~V~~ci~~Ia~~ia~l-----p--~~~~~~~~~~~-------~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~ 108 (409) T protein:vir:93 43 NTLETNETIFSAITKLSNSMASL-----P--LKMYEDYKVVN-------TEVSDLLTVSPNNSLSSFDFINQIETIRNEK 108 (409) T ss_pred hhhhccHHHHHHHHHHHHhhhhC-----c--eeEeecccccc-------chHHHHHhhhcccCCCHHHHHHHHHHHHhhc Confidence 23468889999998777766532 2 22211111111 111222221 111122 34556667788 Q ss_pred hhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCcccccccccccccccccccccc Q lcl|NC_019530. 156 GRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQ 235 (560) Q Consensus 156 Gr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~ 235 (560) |.-|..++.|.. .-+.+|++|+|..+...+ +.+++ ..+|.|.+ ..+. T Consensus 109 Gnay~~i~r~~~---G~~~~L~~l~~~~v~~~~-----~~~~~-------~~~y~~~~------------------~~g~ 155 (409) T protein:vir:93 109 GNAYVLIERDIY---HQPSKLFLLNPDVVEMLI-----ENQSR-------ELYYSIHA------------------ATGN 155 (409) T ss_pred CceEEEEEECCC---CcEEEEEEEcCceeEEEE-----eCCCc-------EEEEEEEc------------------CCce Confidence 987776665532 237899999998887643 11121 11233321 1123 Q ss_pred ceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHH Q lcl|NC_019530. 236 ELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMG 315 (560) Q Consensus 236 ~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~ 315 (560) .+.++.+.|.|.. +.-..+.-+..|-|..|.+++.....++..- +. -..+|-.-+ ..--+.|-+.+++...+.+.+ T Consensus 156 ~~~~~~~eVih~r-~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~-~~-~~~~~~~~i-~~~~~~l~~e~~~~~~~~~~~ 231 (409) T protein:vir:93 156 KLIVHNMDMLHFK-HIVASNMVQGISPIDVLKNTTDFDNAVRTFN-LT-EMQKPDSFM-LKYGSNVGKEKRQQVLEDFKQ 231 (409) T ss_pred EEEEccccEEEeC-CCCCCCccccccHHHHHHHHHHHHHHHHHHH-HH-hcCCCCceE-EecCCCCCHHHHHHHHHHHHH Confidence 4568888887775 1112233334566777776666555555542 33 333343333 233456666665555444433 Q ss_pred hhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCccccCCC Q lcl|NC_019530. 316 KFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE---MDHVNYFREKLYDALMIPKSRLQEE 392 (560) Q Consensus 316 k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~~ 392 (560) .|. ..|.+ + .++ .|.+++.|. .+.-+ ++-..|-...+.++++||...|+.. T Consensus 232 ~~~-------~~g~~------~-vl~----------~g~~~~~l~--~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~ 285 (409) T protein:vir:93 232 YYE-------ENGGI------L-FQE----------PGVEIEPLP--KKYVSEDIVASENLTRERVANVFQLPSVFLNAR 285 (409) T ss_pred Hhh-------cCCCe------e-ecC----------CCceEEEcC--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC Confidence 332 22321 1 121 256777774 33333 3333467788999999999999765 Q ss_pred CcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHH Q lcl|NC_019530. 393 GSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAI 472 (560) Q Consensus 393 ~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei 472 (560) ++.+ .++..+..+. |..++ |+-.+ ..+.+.|-+ .++++.++.. . ..|+|..+ ++...+ T Consensus 286 ~~~~--~sn~e~~~~~---f~~~~--l~P~~-~~ie~~l~~-----~Ll~~~~~~~--~--~~~~fd~~----~ll~~d- 343 (409) T protein:vir:93 286 SNTN--FAKNEELNRF---YLQHT--LLPIV-KQYEEEFNR-----KLLTKTDREK--N--RYFKFNVK----SYLRAD- 343 (409) T ss_pred CCCC--cccHHHHHHH---HHHHH--HHHHH-HHHHHHHHh-----hcCCcccccC--c--ceEEeech----hhhccC- Confidence 4422 3333333333 65553 33211 122333333 3456666532 2 33444332 333222 Q ss_pred HHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccccc--ccccCCCCCCcccc Q lcl|NC_019530. 473 LNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYS--GSDVSPLKFKPETI 550 (560) Q Consensus 473 ~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~--~~~~~~~~~~p~~~ 550 (560) +.+|++.++.+-.- -+++.+-+++ +|++.+- |. ||.- +..+.++.-..+.- T Consensus 344 ~~~~~~~~~~~~~~--G~~T~NE~R~-~~g~~p~--------------------~g----gD~~~~~~n~~~~~~~~~~~ 396 (409) T protein:vir:93 344 SATQAEVYFKAVRS--GYYTINDIRE-WEDLPPV--------------------EG----GDKPLISGDLYPIDTPLELR 396 (409) T ss_pred HHHHHHHHHHHHhC--CCcCHHHHHH-HhCCCCC--------------------CC----cCeeeecccccccccchhhc Confidence 34556555544221 3666666664 3454221 11 2211 11111111111111 Q ss_pred cccCCcccCC Q lcl|NC_019530. 551 PFSGSTDDSI 560 (560) Q Consensus 551 ~~~~~~~~~~ 560 (560) +..+|-+... T Consensus 397 ~~~~gG~~n~ 406 (409) T protein:vir:93 397 KSLKGGDKNV 406 (409) T ss_pred ccccCCCCCc Confidence 1122211111 No 112 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=96.74 E-value=0.00037 Score=39.39 Aligned_cols=411 Identities=14% Similarity=0.151 Sum_probs=186.5 Q ss_pred cccchhhh---cccccc-ccCCCCCC-CCce-eEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_019530. 27 LAQKQDEL---LLTKAT-VVALDDAQ-DGSI-ILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDV 100 (560) Q Consensus 27 ~~~~~~e~---~~~k~~-s~~~~~~~-dG~~-~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvnea 100 (560) +++.++.. .+.+.. ++-+|-+. ++.. ..-++.... .+... + ..+.+.+|-|..||+-|.+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~g~~~s~~~~~~~~~~~~~~~~--~g~~v---~-------~~~al~~~~v~~ci~~Ia~~i 68 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKWLGVPISLTDGSFWSAWGGMGSS--SGETV---T-------ADSALQLSAVWSCVRLIAETI 68 (437) T ss_pred CCcchhhhhhhhHHhhhhhcCCcccCCchhHHHhhcccccC--CCcee---c-------hHhhhccHHHHHHHHHHHHHH Confidence 22222222 111111 12222221 1111 111111111 11100 0 124467899999998777755 Q ss_pred eeecCCCceEEEeeecccchhHHHHHHH-HHHHHHHHH-HhCCchhh----HHHHHHhhhhhhhhhhheecCCCCccCee Q lcl|NC_019530. 101 VTCEEDETPVTVNLDKVEGISDTVKESI-TECFKEVMH-LMDFDNTA----YQKIRKWYVDGRQAYHVIVDPTNKKGGIK 174 (560) Q Consensus 101 iv~d~~~~~V~l~Ld~~~~~s~sik~~I-~eeF~~il~-ll~f~~~~----~~~~R~wYvDGr~~~hkiiD~~~~k~GI~ 174 (560) .-. |+.+-=...++ .++.+ .....++|+ --+-...+ +.++..+.+.|.-|..++.| ...++ T Consensus 69 a~l-----p~~~~~~~~~g----~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~----~g~~~ 135 (437) T protein:vir:10 69 ATL-----PLNLYQTKPDG----TRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRS----AGVLI 135 (437) T ss_pred hhC-----ceeEEEEcCCC----ceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEec----CCcEE Confidence 422 22221111111 00000 111222222 12222233 34455566778777755443 25689 Q ss_pred EeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeecccccC Q lcl|NC_019530. 175 KLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPL 254 (560) Q Consensus 175 eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~ 254 (560) +|.+|+|-++...+ +..+. .+|.|. ...+....++.+.|.|... ... T Consensus 136 ~L~~l~p~~v~i~~-----~~~g~--------~~y~~~------------------~~~g~~~~~~~~dIih~r~--~~~ 182 (437) T protein:vir:10 136 GLELMLPQRTTVKR-----LTSGA--------LQYTYR------------------NVDGTVSTLAEDDVFHVRG--FSL 182 (437) T ss_pred EEEEEcCcceEEEE-----CCCCe--------EEEEEE------------------ecCceEEEEccccEEEecC--cCC Confidence 99999998876542 11121 122221 0112345678888887752 334 Q ss_pred CCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcc Q lcl|NC_019530. 255 ANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNA 334 (560) Q Consensus 255 ~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~ 334 (560) ++-+.+|-|..|.+++.....+++...=+----+--+-|..++ +.|.+.++++..+.+-.+|+.- ...|.+ T Consensus 183 d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g~----~nag~~---- 253 (437) T protein:vir:10 183 DGLMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTD-QILQKEKRAEIRTDLAEQFGGA----MQAGKT---- 253 (437) T ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhcCc----cccCcc---- Confidence 5556678899999999988888877665555555667777776 6688877766665554444321 011221 Q ss_pred cchhhHhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHH Q lcl|NC_019530. 335 HLMGIAEDYWLPRREGQNATEIATVGGGNQLGE---MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELR 411 (560) Q Consensus 335 ~~msmlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElk 411 (560) + .++ .|++++.|. .+..+ ++=-++-.+.+.++++||...|+...+.+..+++.++..+. T Consensus 254 --~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~--- 315 (437) T protein:vir:10 254 --M-VLE----------AGMKYQAIT--MNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQTLG--- 315 (437) T ss_pred --e-ecc----------CCceEEecc--CChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHH--- Confidence 1 222 256666663 23333 33334667889999999999997655544555554444443 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchh Q lcl|NC_019530. 412 FSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIF 491 (560) Q Consensus 412 F~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~ 491 (560) |..++ |+--+. .+.+.|-+ +++++.++.. -+|.|+ .+ ++.... +.+|.+.++.+-. .-++ T Consensus 316 f~~~t--l~P~~~-~ie~~l~~-----kll~~~e~~~---~~~~fd--~~----~ll~~d-~~~r~~~~~~~~~--~G~~ 375 (437) T protein:vir:10 316 FLTFT--LRPWLT-RIEQAARR-----SLLRPGERDQ---FYAEFS--VE----GLLRAD-SAGRAAFYSTMTQ--NGLM 375 (437) T ss_pred HHHHH--HHHHHH-HHHHHHHh-----hccCccccCc---eEEEEe--ch----hhhccC-HHHHHHHHHHHHh--CCCc Confidence 55442 332111 22333333 4456666532 233333 22 332222 3466666555421 1367 Q ss_pred hHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCccccc-----------ccCCcccC Q lcl|NC_019530. 492 SIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIP-----------FSGSTDDS 559 (560) Q Consensus 492 S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~-----------~~~~~~~~ 559 (560) |.+-+++ +|.|.+-+ --+..+ + .+++..+.++.+.+.+|.--. -.-+|+.- T Consensus 376 T~NE~R~-~~gl~pi~--gg~~~~--------~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 437 (437) T protein:vir:10 376 TRDECRA-KENLPPMG--GNAAVL--------T------VQSALLPIDKLGEHTTATAAQDALKAWLYQEEKTRATQER 437 (437) T ss_pred CHHHHHH-HhCCCCCC--CCcceE--------e------ecCcccchhhccCcCCCcchhccccccCCCCCCCCccccC Confidence 7777774 46664322 000000 0 011111111111000000000 00011111 No 113 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=96.71 E-value=0.00039 Score=39.26 Aligned_cols=423 Identities=13% Similarity=0.067 Sum_probs=172.8 Q ss_pred cCCCCCCCCceeEecc-------ccceeccccccchhh--H-------HHHHHHHHHHhhccchhHHHH----------- Q lcl|NC_019530. 42 VALDDAQDGSIILQGG-------ANTYNYVGVESELLS--V-------KTVVEEYQSMAQQPEIRKAVD----------- 94 (560) Q Consensus 42 ~~~~~~~dG~~~i~~~-------~~~~~y~~~e~~~~~--~-------~~LI~~YR~ma~~PEvd~AI~----------- 94 (560) +++..---|. +.... .+..-+.+.+..... - ...+.+|+.+..+.+-...|. T Consensus 1 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~e~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~ 79 (483) T protein:vir:12 1 MAQALIKGGN-ILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGA 79 (483) T ss_pred CccchhcCCc-eeecCcchhhhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc Confidence 1111111111 11110 111222223322211 1 122345666666554442210 Q ss_pred ----------------HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhh Q lcl|NC_019530. 95 ----------------IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQ 158 (560) Q Consensus 95 ----------------eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~ 158 (560) -|| +..+.---..||++..++. ...+..+.+++ .+|+....++.+..++.|+- T Consensus 80 ~~~~~~~~ki~~n~~k~Iv-d~~~~~l~G~p~~~~~~d~---------~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~ 148 (483) T protein:vir:12 80 VDPLKPDDRMITNFHANLV-DQKVSYIVGKPIAFKHTDD---------EVVKRIDEVLG-NRFDDKLHSVLTGASNKGIE 148 (483) T ss_pred ccccccccccccchHHHHH-HHHhhhhcccCceeccCCh---------HHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeE Confidence 011 1111112234555544432 22333444443 47889999999999999999 Q ss_pred hhhheecCCCCccCeeEeEEeCccccchhhhhcccCC-cccccccc--ccchheeecCcccccc-cccccccc------- Q lcl|NC_019530. 159 AYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEAIE--SVTLKYYYNPNYNRNQ-FTGQSGTS------- 227 (560) Q Consensus 159 ~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~~~--~~~e~~~Y~p~~~~~~-~~g~~~~~------- 227 (560) |.+.-+|. +|-..++.+||+.+-++..-....+ ...+.... .....-+|+|..-.+. +.+..... T Consensus 149 y~~v~~d~----d~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~ 224 (483) T protein:vir:12 149 WLHPYLDE----EGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLE 224 (483) T ss_pred EEEEEEcC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeecceEEEEEecCeEEEEEEeCCeeeeccccccc Confidence 98777664 3557899999999877632110000 00111111 0001112322211110 01110000 Q ss_pred --ccccccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchH Q lcl|NC_019530. 228 --QNFQPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTEDATVIYAITRAPEKRAFYLDVGTLGKK 304 (560) Q Consensus 228 --~~~~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~EDalVIyRitRAPeRRvFyIDvGnlpk~ 304 (560) .....+..--++| |++.. ++....|-++..+.....+. ++=+....-+.++.|-+-+.-.+.-+++ T Consensus 225 ~~~~~~~~~~~g~vP---vv~~~------nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~-- 293 (483) T protein:vir:12 225 NSKTHFSTGSWGKIP---FIPFK------NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELP-- 293 (483) T ss_pred ccccccccCCCCccc---eEEec------CCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccch-- Confidence 0000000000111 22221 12222344444443333333 3455555557777775544322222211 Q ss_pred HHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchH-HHHHHHHHHHHHhcC Q lcl|NC_019530. 305 SAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEM-DHVNYFREKLYDALM 383 (560) Q Consensus 305 KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~ 383 (560) +..+ ++..++...-++ |.++.+|-...+.+.. .-+.-+.+.+|+..+ T Consensus 294 ---~~~~---------------------------~~~~~~~~~~~~--~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~ 341 (483) T protein:vir:12 294 ---EFKR---------------------------LLRYYGAIKVSD--NGGVDTIQVEVPVENSKKYLDELYQKIMLFGQ 341 (483) T ss_pred ---hHHH---------------------------hhhhccccccCC--CCcceEEeecCCHHHHHHHHHHHHHHHHHHhC Confidence 1111 111111111111 1235555443444332 344566777888889 Q ss_pred CCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecch Q lcl|NC_019530. 384 IPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSY 463 (560) Q Consensus 384 VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~ 463 (560) +|---++ .+. |...+..|.--+.....-+.+.++.|...+...++.-+-+-|+ ..+|.. |.+.|...-- T Consensus 342 ~p~~~~~---~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~--~~~~~~-----i~v~f~~~~p 410 (483) T protein:vir:12 342 AVDFSSD---KFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI--KGEHKD-----VDISFNYNKV 410 (483) T ss_pred CCCCCcc---ccc-cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CCccce-----eeEEeCCCCC Confidence 8842222 111 1122344544444455567777877777777777754444453 345532 5555643222 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCH--HHHHHHHHHHHHhhhcCCCCCCCccccccccccccC Q lcl|NC_019530. 464 IREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSD--EEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVS 541 (560) Q Consensus 464 f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tD--eEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~ 541 (560) -.+. +-+++++.+ +| .+|.+++++.+-..+| +|+++++++-+++.+.- ++ ......+....+.. T Consensus 411 ~~~~-------~~a~~~~kl---~G-iiS~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~--~~-~~~~~~d~~~~~~~ 476 (483) T protein:vir:12 411 ANTE-------LQVQTAQQS---MG-IVSHETVLENHPFVEDLQAELERIEQEQMEYNKQL--PN-LDDGGADGAQQQER 476 (483) T ss_pred CCHH-------HHHHHHHHH---hc-cCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhc--cc-ccccccCCcccCCC Confidence 1121 224445555 35 4899999987433332 45544444433333211 11 11111111112222 Q ss_pred CCCCCcc Q lcl|NC_019530. 542 PLKFKPE 548 (560) Q Consensus 542 ~~~~~p~ 548 (560) +..-..| T Consensus 477 ~~~~e~e 483 (483) T protein:vir:12 477 SNNKESE 483 (483) T ss_pred CCcccCC Confidence 2222333 No 114 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=96.67 E-value=0.00042 Score=39.07 Aligned_cols=454 Identities=11% Similarity=0.050 Sum_probs=169.6 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEec--cccceeccccccc----hhhHH Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQG--GANTYNYVGVESE----LLSVK 74 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~--~~~~~~y~~~e~~----~~~~~ 74 (560) |. -|--|..+++.. .. -...+.. ++.+ .|...+-. ..-.. T Consensus 1 ~~-----------~~~~~~~~~~~~---~~-------------------~~~~~~~~~n~~~-~~~~~e~~~~~~~~~i~ 46 (511) T protein:vir:96 1 ML-----------KVNEFETDTDLR---GN-------------------INYLFNDEANVVY-TYDGTESDLLQNVNEVS 46 (511) T ss_pred Cc-----------cccchhhhhhhh---hh-------------------hhhhhhhhhCCcc-ccchhhhhhhccHHHHH Confidence 00 000011111100 00 0001111 1111 11111111 11122 Q ss_pred HHHH--------HHHHHhhccchhHHH----------------------HHhhcceeeecCCCceEEEeeecccchhHHH Q lcl|NC_019530. 75 TVVE--------EYQSMAQQPEIRKAV----------------------DIIVNDVVTCEEDETPVTVNLDKVEGISDTV 124 (560) Q Consensus 75 ~LI~--------~YR~ma~~PEvd~AI----------------------~eIvneaiv~d~~~~~V~l~Ld~~~~~s~si 124 (560) .+|+ +|+.+..+.+=+..| .-||+ ..+.=--..||++..++. T Consensus 47 ~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~-~~~~yl~g~p~~~~~~~~------- 118 (511) T protein:vir:96 47 KYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISD-FINGYFLGNPIQYQDDDK------- 118 (511) T ss_pred HHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcceeecchHHHHHH-HHHhhhccCCceeecCch------- Confidence 2332 455554433322211 11111 111111245555544332 Q ss_pred HHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCC-cccccccc Q lcl|NC_019530. 125 KESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEAIE 203 (560) Q Consensus 125 k~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~~~ 203 (560) ...+.+..+++..+|+....++.+...+.|+-|.+.-+|. +|-..+..+||+.+-+|..-....+ ...+.... T Consensus 119 --~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~de----d~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~ 192 (511) T protein:vir:96 119 --DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ----DDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLR 192 (511) T ss_pred --HHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCC----CCceEEEEEccceeEEEEcCCCCCceEEEEEEEE Confidence 3456678888888999999999999999999888766543 4668899999999988753111111 01111110 Q ss_pred ---------ccchh-eeecCccccccccccccccccccccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHH Q lcl|NC_019530. 204 ---------SVTLK-YYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNL 273 (560) Q Consensus 204 ---------~~~e~-~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL 273 (560) ....+ -+|++..-+......+..............-+-..|-.++. +++....|-++..+.....+ T Consensus 193 ~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~----~nn~~g~gd~e~v~~liDa~ 268 (511) T protein:vir:96 193 TKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEF----SNNERRKGDYEKVITLIDLY 268 (511) T ss_pred eeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCCceeeEEe----cCCCCCCCchhhhHHHHHHH Confidence 00011 13555433322211111000000000000000011111111 12223345566555554443 Q ss_pred HHHH-HHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCccc-CcccchhhHhhhc---cccc Q lcl|NC_019530. 274 VTTE-DATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKIT-GNAHLMGIAEDYW---LPRR 348 (560) Q Consensus 274 ~m~E-DalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~-d~~~~msmlEDyw---LpRR 348 (560) ..+- +....-+-++.|-+-+.= .+.. .++++. +....+-.+.... .... T Consensus 269 d~~~S~~~~~~~~~~~~~lv~~g----~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 322 (511) T protein:vir:96 269 DNAESDTANYMSDLNDAMLLIKG----NLNL----------------------DPVEVRKQKEANVLFLEPTVYADSEGR 322 (511) T ss_pred HHHHHHHHHHHHHhhCceeeeec----CccC----------------------Cchhhcccccccceecccccccccccc Confidence 3322 222222334444333221 1000 000000 0000110111100 0011 Q ss_pred CCCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 349 EGQNATEIATVGGGNQLGEM-DHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFF 427 (560) Q Consensus 349 eGgrgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if 427 (560) ..+.|..+..|-...+...+ .-+.-+.+.+|.-.++|---. ++|. |...+..|.--...-..-+.+.++.|...+ T Consensus 323 ~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~---~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l 398 (511) T protein:vir:96 323 ETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKD---DNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGL 398 (511) T ss_pred cCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccc---cccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12224456666554444433 344566788888888885322 2211 122233444333334444555555555555 Q ss_pred HHHHHHHHHhccC----CChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCC Q lcl|NC_019530. 428 MEFLRRQLILKGV----TDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRM 503 (560) Q Consensus 428 ~d~Lk~qLiLkgi----it~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~ 503 (560) ...++.-+-+-++ -.+.+|. .|.+.|... --.- +.+.++++..+ .| .+|.+++++. |.. T Consensus 399 ~~~~~li~~~~~~~~~~~~~~d~~-----~i~~~f~~~-~p~n------~~e~~~~~~kl---~G-~iS~et~l~~-l~~ 461 (511) T protein:vir:96 399 RRRAKLLETILKNTWSIDANKDFN-----TVRYVYNRN-LPKS------LIEELKAYIDS---GG-KISQTTLMSL-FSF 461 (511) T ss_pred HHHHHHHHHHHHhhcCcccccccc-----cceEEeCCC-CCCC------HHHHHHHHHHH---hc-cCChHHHHHh-CCC Confidence 5554442222122 2334443 366777532 2221 12334555555 35 4999999965 565 Q ss_pred CHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccccCCcc Q lcl|NC_019530. 504 SDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTD 557 (560) Q Consensus 504 tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~ 557 (560) +++ .+++.++|++|..+. .+.......++..+..... -.++--+-+.=.. T Consensus 462 v~D-~~~E~~ri~~E~~~~-~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 511 (511) T protein:vir:96 462 FQD-PELEVKKIEEDEKES-IKKAQKGIYKDPRDINDDE--QDDDTKDTVDKKE 511 (511) T ss_pred CCC-HHHHHHHHHHHHHHH-HHHHhhccccCCCCCCCCC--CCCcccccccccC Confidence 432 233444444443221 1111111111111110000 0000000000000 No 115 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=96.61 E-value=0.00047 Score=38.85 Aligned_cols=382 Identities=12% Similarity=0.040 Sum_probs=159.8 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) |+.|+- .. +..+.......++. ...+++- + +.-..+.|+. . +..+++|-|..||+ T Consensus 1 Mglf~~-~~------~~~~~~~~~~~~~~-~~~~~~~--~-~~~~~~~~v~-------~-------~~al~~~~V~~~i~ 55 (384) T protein:vir:49 1 MPIFNI-TN------LATESPPSNQDSFF-DITDPEF--L-DALNGSEWVS-------A-------ETALKNSDLFSIIS 55 (384) T ss_pred Cccccc-cc------cCcccccccchhhc-cccchhh--c-ccccCCceec-------h-------hhhhccHHHHHHHH Confidence 333321 00 00000000011111 0001110 0 0001111111 1 12357889999998 Q ss_pred HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCee Q lcl|NC_019530. 95 IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIK 174 (560) Q Consensus 95 eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~ 174 (560) -|.+.+.-++ +.+.... ...+.++=. ..+....=-+.++..+++.|.-|.-++.|.. .-++ T Consensus 56 ~Ia~~ia~l~-------~~~~~~~------~~~l~~~PN---~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~---g~~~ 116 (384) T protein:vir:49 56 QLSNDLATAK-------ITTSRKQ------LQGIVDNPS---NNANRFNFYQSIFAQMLLGGEAFAYRWRNEN---GRDM 116 (384) T ss_pred HHHHHHhhCc-------eeeecch------hhhhhhccC---CCCCHHHHHHHHHHHhhhcCCeEEEEEECCC---CcEE Confidence 8877654332 2222211 111111111 1112222334566677888988886665543 2378 Q ss_pred EeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeecccccC Q lcl|NC_019530. 175 KLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPL 254 (560) Q Consensus 175 eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~ 254 (560) +|.+|+|-.+..++ .. +++ ..+|.|... +. ..+....++.+.|.|... ... T Consensus 117 ~L~~l~~~~v~v~~---~~--~~~-------~~~y~~~~~-------~~--------~~~~~~~~~~~eVih~~~--~~~ 167 (384) T protein:vir:49 117 KWEYLRPSQVSFNR---LD--NQN-------GLYYNITFD-------DP--------RIPPKQHVPQGDILHFRL--LSV 167 (384) T ss_pred EEEEEcCceeEEEE---cC--CCc-------eEEEEEEec-------Cc--------cccceeEecCccEEEecC--CCC Confidence 99999998887642 11 121 123333311 10 112345677888887762 233 Q ss_pred CCC-cchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCc Q lcl|NC_019530. 255 ANG-IVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGN 333 (560) Q Consensus 255 ~~~-~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~ 333 (560) +++ ..+|.|..|+..++....++....=+--.-+--+-+..++.+..+..++.+ ...++... ...|.+ T Consensus 168 ~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~----~~~~~~~~----~n~~~~--- 236 (384) T protein:vir:49 168 DGGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKTKQ----SRSRQAMK----QMQGGP--- 236 (384) T ss_pred CCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHH----HHHHHhcc----cCCccc--- Confidence 332 356899999999998888888776544444556677777766555544433 22222211 122221 Q ss_pred ccchhhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHH Q lcl|NC_019530. 334 AHLMGIAEDYWLPRREGQNATEIATVGGG-NQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRF 412 (560) Q Consensus 334 ~~~msmlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF 412 (560) + +=.+ |.++..|.-. ..+.-++-.++..+.+.++++||.+.|...++-.... +.+ |-.+ T Consensus 237 ---~--------vl~~---g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~---~~~---~~~~ 296 (384) T protein:vir:49 237 ---L--------VLDD---LEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDKQSSL---EMI---YNIY 296 (384) T ss_pred ---e--------ecCC---CceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccH---HHH---HHHH Confidence 1 1122 4566666422 2222245567888999999999999997654322222 222 1123 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhh Q lcl|NC_019530. 413 SKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFS 492 (560) Q Consensus 413 ~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S 492 (560) ..|+.-.-.-+..-+...|-+.|.+. +....+.+ ...+. | .+.++-.+.+ ..|.++.+.+-.. | +.+ T Consensus 297 ~~~i~~~l~pi~~~i~~~l~~~l~~~-~~~~~~~~---~~~~~--~----~~~~l~~~~~-~t~~e~~~~l~~~-g-~~~ 363 (384) T protein:vir:49 297 FKAVSRFLRPFVSELSKKLSCEVDAD-ILPAVDPT---GSNYI--G----LINSMVKTGT-LAQNQGLYVLQQA-E-ILP 363 (384) T ss_pred HHHHHHHHHHHHHHHHHHhchhhhhh-hhhhhhcc---chHHH--H----HHHHHhhcCc-ccHHHHHHHHhhC-C-CCC Confidence 33433322222222222222211110 00000000 00000 0 1122221111 1222222222111 1 223 Q ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCC Q lcl|NC_019530. 493 IDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLY 524 (560) Q Consensus 493 ~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~ 524 (560) .+ ++ +.+.+... +-.-.+-.| T Consensus 364 ne-~r-~~~~~~p~---------~gGd~~~~~ 384 (384) T protein:vir:49 364 KD-LP-EGETDSTL---------KGGETNEQY 384 (384) T ss_pred hh-HH-HHcCCCCC---------CCCCCCCCC Confidence 22 22 22333221 111111122 No 116 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=96.47 E-value=0.00059 Score=38.28 Aligned_cols=334 Identities=9% Similarity=0.107 Sum_probs=151.6 Q ss_pred CceEEEeeecccchhHHHHHHHHHHHHHHHHH-hCCchhhHHH----HHHhhhhhhhhhhheecCCCCccCeeEeEEeCc Q lcl|NC_019530. 107 ETPVTVNLDKVEGISDTVKESITECFKEVMHL-MDFDNTAYQK----IRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDS 181 (560) Q Consensus 107 ~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l-l~f~~~~~~~----~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDP 181 (560) -.-+.+.|-.... .+......+|+. =+-...+.++ +..|.++|.-|..++.|.. .-+++|.+|+| T Consensus 1 ia~lp~~~~~~~~-------~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~---G~~~~L~~l~~ 70 (348) T protein:vir:93 1 MASLPLKMYEDYK-------VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY---HQPSKLFLLNP 70 (348) T ss_pred CcccceEeEecCc-------CcccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC---CcEEEEEEEcC Confidence 2222333322111 111122233321 1222344444 4456677887776665442 33789999999 Q ss_pred cccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeecccccCCCCcchh Q lcl|NC_019530. 182 RCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLANGIVPG 261 (560) Q Consensus 182 r~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~~~i~s 261 (560) ..+..+. +..++ ..+|.|.. ..++.+.++.+.|.|.. +....+.-+..| T Consensus 71 ~~v~~~~-----~~~~~-------~~~y~~~~------------------~~g~~~~~~~~eiih~r-~~~~~~~~~G~s 119 (348) T protein:vir:93 71 DVVEMLI-----ENQSR-------ELYYSIHA------------------ATGNKLIVHNMDMLHFK-HIVASNMVQGIS 119 (348) T ss_pred CceEEEE-----eCCCc-------EEEEEEEc------------------CCCeEEEEccccEEEec-CCCCCCceeecc Confidence 8887642 11121 11222321 11235568888887773 111223333457 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHh Q lcl|NC_019530. 262 LLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAE 341 (560) Q Consensus 262 yL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlE 341 (560) .+..|...+.....++... +....+.| . +...-.+++-+.++++..+.+...|. ..|.+ + . T Consensus 120 ~~~~~~~~i~~~~~~~~~~-~~~~~~~~-~-~i~~~~~~l~~e~~~~~~~~~~~~~~-------n~~~~------~-v-- 180 (348) T protein:vir:93 120 PIDVLKNTTDFDNAVRTFN-LTEMQKPD-S-FMLKYGSNVSTEKRQQVLEDFKQYYE-------ENGGI------L-F-- 180 (348) T ss_pred HHHHHHHHHHHHHHHHHHH-HHhcCCCc-e-eEEecCCCCCHHHHHHHHHHHHHHhh-------cCCCe------e-e-- Confidence 7888888777666666653 33433333 2 33333455666665555444433332 22321 1 1 Q ss_pred hhcccccCCCCccceeecCCCCCcchHHH---HHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHH-H Q lcl|NC_019530. 342 DYWLPRREGQNATEIATVGGGNQLGEMDH---VNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFC-A 417 (560) Q Consensus 342 DywLpRReGgrgTEIsTLpGg~nLgei~D---V~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi-~ 417 (560) |+ .|.+++.|. .+.-+++= -+|..+.+.++++||...|...++.+ .+++.+..+. |.+++ . T Consensus 181 ---l~-----~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~--~~~~e~~~~~---~~~~~l~ 245 (348) T protein:vir:93 181 ---QE-----PGVEIEPLP--KKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTN--FAKNEELNRF---YLQHTLL 245 (348) T ss_pred ---cC-----CCceEEEcC--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHHHH---HHHHHHH Confidence 11 356677664 34444433 34678889999999999997655433 2233333332 44433 2 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHH Q lcl|NC_019530. 418 GLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVM 497 (560) Q Consensus 418 rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~ 497 (560) -+-++ +.+. +-..++++.+|.. ..+|+|+ .+ ++.... +..|+++++.+-. .-+++.+-++ T Consensus 246 P~~~~----ie~~-----l~~~l~~~~~~~~--g~~i~fd--~~----~l~~~d-~~~~a~~~~~~~~--~G~~T~NE~R 305 (348) T protein:vir:93 246 PIVKQ----YEEE-----FNRKLLTKTDREK--NRYFKFN--VK----SYLRAD-SATQAEVYFKAVR--SGYYTINDIR 305 (348) T ss_pred HHHHH----HHHH-----HHHhhCCcccccC--cceEEee--ch----hhhccC-HHHHHHHHHHHHh--CCCCCHHHHH Confidence 22111 1111 2335567777642 3334443 22 222211 3455555544422 1355555555 Q ss_pred HHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccccc--ccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 498 RNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYS--GSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 498 k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~--~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) . .|.+. |-|. |+.. .....+....++.-+..+|-|.+- T Consensus 306 ~-~~g~~--------------------p~~g----gD~~~~~~n~~~~~~~~~~~~~~~gg~~n~ 345 (348) T protein:vir:93 306 E-WEDLP--------------------PVEG----GDKPLISGDLYPIDTPLELRKSLKGGDKNV 345 (348) T ss_pred H-HhCCC--------------------CCCC----cCeEeecccccccccchhhcccccCCCCCc Confidence 3 23432 2222 2221 111222222222222222222222 No 117 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=96.33 E-value=0.00074 Score=37.76 Aligned_cols=401 Identities=15% Similarity=0.144 Sum_probs=179.8 Q ss_pred hhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHHHhhccee Q lcl|NC_019530. 22 VDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVV 101 (560) Q Consensus 22 ~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneai 101 (560) ...+++-++..+.. +..+....+ .+..+....+...+ +. ....++|-|.+||+-|.+.+. T Consensus 1 ~~f~~~f~r~~~~~-----~~~~~~~~~---~~~~~~~~~~g~~v-----~~-------~~~l~~~~v~~~i~~Ia~~iA 60 (413) T protein:vir:48 1 MFFSGLFQRKSDAP-----VTTPAELAE---AIGLSYDTYTGKRI-----SS-------QRAMRLTAVYSCVRVLAESVG 60 (413) T ss_pred CccchhhccCccCC-----ccchHHHHH---hhhcCcccccCcee-----ch-------hhhhccHHHHHHHHHHHHhhh Confidence 33333322211111 111111111 01111111111111 11 223568899999988887654 Q ss_pred eecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHH-----hCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEe Q lcl|NC_019530. 102 TCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHL-----MDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKL 176 (560) Q Consensus 102 v~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l-----l~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eL 176 (560) -+ | +.+-..++ +..++.......++|+. +....--+.++..+++.|.-|..++.+ ...+.+| T Consensus 61 ~~-----p--~~~~~~~~--~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~----~g~~~~L 127 (413) T protein:vir:48 61 ML-----P--CSLYKISG--TLKTRVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA----LGEVVEL 127 (413) T ss_pred hC-----c--eEEEEecC--CcceeecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC----CCcEEEE Confidence 22 2 22211110 00011111122233321 222233455666677788777654432 3458999 Q ss_pred EEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeecccccCCC Q lcl|NC_019530. 177 VMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLAN 256 (560) Q Consensus 177 r~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~ 256 (560) ++|+|.+|.... +..+. .+|.|. .+.+....++.+.|.|... ...+. T Consensus 128 ~~l~~~~v~~~~-----~~~~~--------~~y~~~------------------~~~g~~~~~~~~evih~~~--~~~d~ 174 (413) T protein:vir:48 128 LPIDPGCVEPKL-----NSQWQ--------PVYQVT------------------FPDGSVDVLTQDEIWHVRT--LTLDG 174 (413) T ss_pred EEEcCceEEEEE-----cCCce--------EEEEEE------------------ecCceEEEEccccEEEecC--cCCCC Confidence 999998876531 11121 112221 0112234577888887763 24455 Q ss_pred CcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccc Q lcl|NC_019530. 257 GIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHL 336 (560) Q Consensus 257 ~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~ 336 (560) -+..|-|+.|.+++.....+++...-+---.+.-+-++.++ +.+.+..+++-.+.+...|+.- ...|. . T Consensus 175 ~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~~~~e~~~~~~~~~~~~~~g~----~n~g~------~ 243 (413) T protein:vir:48 175 LVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTE-QKLTPDAYERLKKDFEERHTGL----GNAHR------P 243 (413) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeC-CCCCHHHHHHHHHHHHHHhcCc----cccCc------c Confidence 56778999999999888888877765544456667888887 5677766665555555444321 11222 1 Q ss_pred hhhHhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHH Q lcl|NC_019530. 337 MGIAEDYWLPRREGQNATEIATVGGGNQLGE---MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFS 413 (560) Q Consensus 337 msmlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~ 413 (560) | .++ .|.++..|. .+..+ ++-.++....+.++++||..-|+..++.+ .++..+. .+.|. T Consensus 244 ~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~~n~e~~---~~~f~ 305 (413) T protein:vir:48 244 M-ILE----------MGLDWKSMA--LNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRAT--FNNIEEL---GLGFI 305 (413) T ss_pred e-ecC----------CCceEEecc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCC--cccHHHH---HHHHH Confidence 1 121 356666663 33333 44556788999999999999997543322 2222222 22254 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhH Q lcl|NC_019530. 414 KFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSI 493 (560) Q Consensus 414 Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~ 493 (560) ++ .|+- +...+.+. +-++++++.++. ..+|.|+ .+ ++... -+..|++.++.+- -+-+++. T Consensus 306 ~~--~i~P-~~~~ie~~-----l~~~L~~~~~~~---~~~~~fd--~~----~l~~~-d~~~~~~~~~~~~--~~g~~T~ 365 (413) T protein:vir:48 306 NY--SLVP-YLTRIEQR-----INTGLVRESKQG---KFYAKFN--AG----ALLRG-DMKSRFEAYATGI--NWGIYSP 365 (413) T ss_pred HH--HHHH-HHHHHHHH-----HHhhccCccccC---CeEEEEe--ch----hhhcc-CHHHHHHHHHHHH--hCCCcCH Confidence 43 2321 11112222 334556666652 2234433 22 33222 1234444444321 1235666 Q ss_pred HHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccc---cccccccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 494 DYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADE---TGNYSGSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 494 ~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e---~g~~~~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) +-++. ++++.+- |..+. +.+..+.+..+.+-.++. -+|-.|.+- T Consensus 366 NE~R~-~~g~~p~--------------------~ggD~~~~~~n~~~~~~~~~~~~~~~--~~~~~~~~~ 412 (413) T protein:vir:48 366 NDCRD-LEDMNPR--------------------PGGDVYLTPMNMTTSPSAGDDNGKKK--ESGDADKTA 412 (413) T ss_pred HHHHH-HhCCCCC--------------------CCcceeeccccccccccccccCCCCC--CCCCccccC Confidence 66663 3555321 11111 011000000000000000 011111111 No 118 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=96.31 E-value=0.00076 Score=37.68 Aligned_cols=397 Identities=11% Similarity=0.064 Sum_probs=158.3 Q ss_pred cccccchhhHHHH-------HHHHHHHhhccchhHH-----------------------------------HHHhhccee Q lcl|NC_019530. 64 VGVESELLSVKTV-------VEEYQSMAQQPEIRKA-----------------------------------VDIIVNDVV 101 (560) Q Consensus 64 ~~~e~~~~~~~~L-------I~~YR~ma~~PEvd~A-----------------------------------I~eIvneai 101 (560) .+++.-.+-...+ +.+|..+..+.+-... ...||+-.+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 80 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKK 80 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhh Confidence 2222211111111 2223333333322211 112222111 Q ss_pred eecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCc Q lcl|NC_019530. 102 TCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDS 181 (560) Q Consensus 102 v~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDP 181 (560) -+ --..||++..++. ...+.++.+.+ .+|+....++.+.+++.|+-|.+.-+|.+ +|=..+..+|| T Consensus 81 ~y-l~G~p~~~~~~~~---------~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~---~g~~~~~~~~p 146 (471) T protein:vir:10 81 AY-ALTYPPTFDVDDK---------KVNDMIVDVLG-DDYERISKQLCVNAGNAGIAWLHVWKDAS---DNSFRYACVDS 146 (471) T ss_pred hh-hcccCceeccCCh---------HHHHHHHHHHh-cCHHHHHHHHHHHHhhCCeEEEEEEeeCC---CCeeEEEEEcc Confidence 11 1235556554442 23344555555 57999999999999999999998888764 45577889999 Q ss_pred cccchhhhhcccCC-ccccccc--------cccchheeecCccccccccccccc-ccccccc----------ccceec-- Q lcl|NC_019530. 182 RCIRPVYIVEKAMR-EGGIEAI--------ESVTLKYYYNPNYNRNQFTGQSGT-SQNFQPS----------QQELVF-- 239 (560) Q Consensus 182 r~i~~vr~i~~~~~-~g~~~~~--------~~~~e~~~Y~p~~~~~~~~g~~~~-~~~~~~~----------~~~~~i-- 239 (560) +.+-+|..=....+ ...+... +.+.-+-+|.+..-+. +.+.... ...+... +..... T Consensus 147 ~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~-y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (471) T protein:vir:10 147 KEVIPIYSKSLDKKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSF-YRHEKEKPLEELETFQAISLIDTMNGDRSSDNS 225 (471) T ss_pred cceEEEEcCCCCCceEEEEEEEEeeccCCCceeEEEEEEeCCcEEE-EEecCCccccccccccccccccccccccccccc Confidence 88766531110000 0011110 0011111222211111 1100000 0000000 000000 Q ss_pred ---chhceeeeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHH Q lcl|NC_019530. 240 ---DDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMG 315 (560) Q Consensus 240 ---~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~ 315 (560) .-..|-+++. .++....|-|+..+.....+. ++=+..-.-+-+..|-+-+.-.+.-.++ +.+.. +. T Consensus 226 ~~~~~g~iPvv~~----~n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~-----~~~~~-~~ 295 (471) T protein:vir:10 226 FKHDFGLVPFIPF----KNNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQ-----EFLED-LK 295 (471) T ss_pred ccCCCCceeEEEe----ccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccc-----hhHHH-hh Confidence 0001111111 022223345554444433333 1222222334444553333222211111 11111 11 Q ss_pred hhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCc Q lcl|NC_019530. 316 KFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGS 394 (560) Q Consensus 316 k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~ 394 (560) + .+...=..+| .+.|-.++.|--..+... -.-++-..+.+|...++|- +..++. T Consensus 296 ~--~~~i~~~~~~---------------------~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~--~~~~~~ 350 (471) T protein:vir:10 296 R--YKMIKMDNDG---------------------MGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVN--PETDKL 350 (471) T ss_pred c--CCeEEecCCC---------------------CccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcC--CCcccc Confidence 1 1111111111 122233444443333332 2445667788888999883 222222 Q ss_pred ccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHH Q lcl|NC_019530. 395 INIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILN 474 (560) Q Consensus 395 ~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~ 474 (560) |...+..|.--..--..-+.+.|+.|...+...++.-+-+-|. .+|. .|.+.|...---.+... T Consensus 351 ---gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~---~d~~-----~i~i~f~~~~p~n~~e~----- 414 (471) T protein:vir:10 351 ---GNSSGVALKFLYSLLELKAGNMETQFRSGYATLVKMILKHLGL---SDKL-----KIKQTWTRNSINNDTEM----- 414 (471) T ss_pred ---cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc---CCCc-----eeEEEeCCCCCCCHHHH----- Confidence 2222233422222233346777777777776666533322233 2442 25566654433333222 Q ss_pred HHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcC-----CCCCCCcccccc Q lcl|NC_019530. 475 DRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKG-----LYPKVQADETGN 534 (560) Q Consensus 475 ~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~-----~~~~p~~~e~g~ 534 (560) +++++.+ .| .+|.+++++.+-..+|. +++.++|++|...- .+....++++-+ T Consensus 415 --~~~~~kl---~g-~iS~et~~~~~p~v~D~--~~E~eri~~E~~~~~~~~~~~~~~~~~~e~~ 471 (471) T protein:vir:10 415 --AQVVSTL---AT-ITSRENVAKSNPIVEDW--QDELRLQKAEQEGRSEKLYDMEEVEHESEVE 471 (471) T ss_pred --HHHHHHH---hc-cCchHHHHHhCCCCCCH--HHHHHHHHHHHHHHHhcccccCCCCCccccC Confidence 2233333 35 49999999886555543 23345555554221 111111111111 No 119 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=96.31 E-value=0.00076 Score=37.68 Aligned_cols=372 Identities=10% Similarity=0.059 Sum_probs=163.2 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) |+.|+..... +..+ .++........+..+... .... +. +..+++|-|.+||+ T Consensus 1 Mg~f~~~~~~---------~~~~----------~~~~~~~~~~~~~~~~~~-~~~v-~~-------~~~l~~~~v~~~i~ 52 (382) T protein:vir:48 1 MPIFNLATES---------PPDN----------QGGFFDVVDSDFLASLKG-NEWV-SA-------ETALRNSDLFSIIN 52 (382) T ss_pred CccccccccC---------Cccc----------ccccccchhhhccccccC-Cccc-ch-------HhhhccHHHHHHHH Confidence 3333221110 0000 011111111111000000 0000 11 12257899999998 Q ss_pred HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCee Q lcl|NC_019530. 95 IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIK 174 (560) Q Consensus 95 eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~ 174 (560) -|.+.+.-++ +.+.... .+.+.++-... +....-.+.++..+++.|.-|..++.|.+ .-+. T Consensus 53 ~ia~~ia~~~-------~~~~~~~------~~~L~~~PN~~---~t~~~f~~~l~~~l~l~Gna~~~i~rd~~---G~~~ 113 (382) T protein:vir:48 53 QLSNDLATVK-------LITSRKK------LQGIVDNPSNN---ANRFNFYQSIFAQMLLGGEAFAYRWRNEN---GRDM 113 (382) T ss_pred HHHHhhccCc-------eeeecch------hhhhhhhcCCC---CCHHHHHHHHHHHhhhcCCEEEEEEECCC---CcEE Confidence 8887664332 2222221 11222222211 11122233455567788988887776542 3378 Q ss_pred EeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeecccccC Q lcl|NC_019530. 175 KLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPL 254 (560) Q Consensus 175 eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~ 254 (560) +|++|+|..+..++ .. +++ ..+|.|... +. ..+....++.+.|.|... ... T Consensus 114 ~l~~i~~~~v~v~~---~~--~~~-------~~~y~~~~~-------~~--------~~~~~~~~~~~evih~~~--~~~ 164 (382) T protein:vir:48 114 KWEYLRPSQVSFNR---LD--NKD-------GIYYNITFD-------DP--------RIPPKQHVPQNDVLHFRL--LSV 164 (382) T ss_pred EEEEEcCceeEEEE---cC--CCC-------eEEEEEEec-------Cc--------cccceeEEcCccEEEecC--CCC Confidence 99999998887643 11 121 112333211 10 112345677777777752 233 Q ss_pred CC-CcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCc Q lcl|NC_019530. 255 AN-GIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGN 333 (560) Q Consensus 255 ~~-~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~ 333 (560) ++ -...|.|..|++++.....++....=+----+--+-+..++-. +.+..+++..+..-..+ ...|.+ T Consensus 165 ~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~-~~~e~~~~~~~~~~~~~-------~n~g~~--- 233 (382) T protein:vir:48 165 DGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGG-GLLDFKTKLSRSRQAMK-------QMQGGP--- 233 (382) T ss_pred CCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC-CChHHHHHHHHHHHhhc-------cCCCCe--- Confidence 33 2456889999999999999988877666666777888888754 44445544444322211 112322 Q ss_pred ccchhhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHH Q lcl|NC_019530. 334 AHLMGIAEDYWLPRREGQNATEIATVGGG-NQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRF 412 (560) Q Consensus 334 ~~~msmlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF 412 (560) + .++ + |.+++.|.-- ..+.-++-.+|..+.+.++++||...|+..+.. ++.++-.+. | T Consensus 234 ---~-vl~-------~---g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~~----~~~~~~~~~---~ 292 (382) T protein:vir:48 234 ---L-VLD-------D---LEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGDQ----QSSLEMSSD---L 292 (382) T ss_pred ---e-EcC-------C---CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc----ccHHHHHHH---H Confidence 1 121 2 4455555421 222234556788899999999999999643321 112222333 4 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchh Q lcl|NC_019530. 413 SKF-CAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIF 491 (560) Q Consensus 413 ~Kf-i~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~ 491 (560) .++ |.-+-+++. +.|...| +++.+.+. .. . +..|.+ -...+++.+-+ +-.+ T Consensus 293 ~~~~l~p~~~~i~----~~l~~~l-----~~~~~~~~--~~--~--~~~~~~--------~~~~~~~~l~~-----~g~~ 344 (382) T protein:vir:48 293 YSKAVSRYLRPFL----SELSQKL-----SCDVDADI--FP--A--VDPTGS--------NYISRINSLVK-----TGTL 344 (382) T ss_pred HHHHHHHHHHHHH----HHHHHHh-----cChhhhhh--hh--h--hccchh--------HHHHHHHHHhh-----cCcc Confidence 332 222222222 2222222 22222211 10 1 111111 11122222211 1234 Q ss_pred hHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCc-------ccccccccccc Q lcl|NC_019530. 492 SIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQA-------DETGNYSGSDV 540 (560) Q Consensus 492 S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~-------~e~g~~~~~~~ 540 (560) +..-++ ++|. ..|..+++.. ..+|++..+.+ T Consensus 345 t~~e~r-~~l~-----------------~~g~~~~~~~~~~~~~~~~~GGd~~~~~ 382 (382) T protein:vir:48 345 AQNQGL-YILQ-----------------QAEILPKELPNGENPNSTLKGGEEDGQD 382 (382) T ss_pred CHHHHH-HHHh-----------------hCCCCCcchhhhhcCCCCCCCCCCCCCC Confidence 444443 2221 1222222111 11222221111 No 120 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=96.29 E-value=0.00078 Score=37.62 Aligned_cols=442 Identities=10% Similarity=0.044 Sum_probs=195.2 Q ss_pred hhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhh--ccch Q lcl|NC_019530. 12 FGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQ--QPEI 89 (560) Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~--~PEv 89 (560) .|....+++++.--. .+-..+...+++.-+ .+.. ......-|+..+.|.. +|++ T Consensus 1 m~~~~~~k~~~~k~~--~~~~~~~~~~i~~~~---------------------~i~~-~~~~~~~i~~~~~~y~g~~~~~ 56 (522) T protein:vir:47 1 MSLFQKVKDFFSRGR--YYMQTSNLNSILEHP---------------------KIAV-TQEEYDRIKRNLVYYQSKWDDV 56 (522) T ss_pred CchHHHHHHHHHHHH--HHhhcccchhccccC---------------------CCCC-CHHHHHHHHHHHHHhcCCcccc Confidence 444445555544211 111111111111000 0000 0122223334444422 1221 Q ss_pred h-------------------HHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHH Q lcl|NC_019530. 90 R-------------------KAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIR 150 (560) Q Consensus 90 d-------------------~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R 150 (560) . .+|-.-..+.++. .++++.+++ +...+..+.+++-.+|.....+.+- T Consensus 57 ~~~~~~~~~~~~~~~slnl~~~i~~~~A~lv~~----e~~~i~v~d---------~~~~~~l~~~l~~n~f~~~~~~~~e 123 (522) T protein:vir:47 57 QYKNTDGDIKSRPMNHLPIARTASKKIASLVYN----EQATITTKN---------EILQKFLDDMLTNDRFNKNFERYLE 123 (522) T ss_pred cccccCcchhcccceecchHHHHHHHHhhhhcC----CcceeecCC---------hHHHHHHHHHHhhcchHHHHHHHHH Confidence 0 0010001111111 233444433 2566777888888899999999999 Q ss_pred HhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccc---cccc----------c--chheeec--- Q lcl|NC_019530. 151 KWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIE---AIES----------V--TLKYYYN--- 212 (560) Q Consensus 151 ~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~---~~~~----------~--~e~~~Y~--- 212 (560) .+..-|-.+|...+|. |-..+.+++|-++-|++ ++..+.++ +... + -||-.|. T Consensus 124 ~a~a~G~~a~k~~~d~-----~~~~i~~v~ad~~~P~~----~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~ 194 (522) T protein:vir:47 124 SCLALGGLAMRPYIDG-----DKVRVAFIQAPVFFPLE----SNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTAD 194 (522) T ss_pred HhhccCCEEEEEEEcC-----CceEEEEEcCCceEEEE----EcCCceEEEEEEEEEEeecccceeEEEEEEEeeecccc Confidence 9999999999999974 23458888988877752 11111111 1100 0 0111010 Q ss_pred -----------Ccccc-ccccccccccccccccccce----ecchhceee--------eec--c---cccCCCCcchhhh Q lcl|NC_019530. 213 -----------PNYNR-NQFTGQSGTSQNFQPSQQEL----VFDDESIVY--------IDS--G---EEPLANGIVPGLL 263 (560) Q Consensus 213 -----------p~~~~-~~~~g~~~~~~~~~~~~~~~----~i~~daI~y--------~hs--G---l~~~~~~~i~syL 263 (560) |..=. ..|-+.++.+....-+-+.+ .++ +.+++ ++. . -.+......+|-+ T Consensus 195 ~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~-~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~ 273 (522) T protein:vir:47 195 GQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLSELDKYKNLE-PVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIF 273 (522) T ss_pred cccccccccCCceEEEEEEeecCCCcccCccccccccccccCCC-CceEeCCCCcceEEEecCCcccccccCCCcCCchh Confidence 00000 00000000000000000000 011 11111 110 0 0011223345778 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhh Q lcl|NC_019530. 264 NPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDY 343 (560) Q Consensus 264 ~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDy 343 (560) +.|+-.+--|-..=+ .+.|.+|.=++|||. |-.=++ ..-+..+|+...-..+. --+.+ T Consensus 274 ~~~~~~id~lD~~~s--~~~~e~~~g~~~i~v-~~~~l~------------------~~~~~~~g~~~~~~~fd-~~~~~ 331 (522) T protein:vir:47 274 DNAKTTIDFINRSYD--EFMWEVRMGQRRVIV-PEHLTQ------------------RQYQRPDGTIDFRPRFD-VEQNV 331 (522) T ss_pred hhhHHHHHHHHHHHH--HHHHHHHhccceeec-chHHhc------------------cCCCCCCcccccccccC-cccce Confidence 887777666654333 456778887788765 111110 11122233211100000 01112 Q ss_pred cccccCC-CCccceeecCCCCCcchHH-HHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHH Q lcl|NC_019530. 344 WLPRREG-QNATEIATVGGGNQLGEMD-HVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRR 421 (560) Q Consensus 344 wLpRReG-grgTEIsTLpGg~nLgei~-DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~ 421 (560) |.+-..+ +-|--|+++...---++.. =+..+.+.+=...+++-+-|..+++. -..|+||...+-.-..-+.+.|+ T Consensus 332 f~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~---~kTAtEi~s~~~~~~~t~~~~~~ 408 (522) T protein:vir:47 332 YMQIGGSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQG---MKTATEIVSENSDTYQMRSSIVA 408 (522) T ss_pred EeecCCCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccccc---cccHHHHHHHHHHHHHHHHHHHH Confidence 2211110 0111255554432222221 24555666666778888888776552 23478887666666666788888 Q ss_pred HHHHHHHHHHHHHHHhc---cCC---ChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHH Q lcl|NC_019530. 422 RYSHFFMEFLRRQLILK---GVT---DEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDY 495 (560) Q Consensus 422 rfs~if~d~Lk~qLiLk---gii---t~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~ 495 (560) .+...+.++++.-|-|- +.+ .++++ .|.++|- |+.+.. +++++-+.+ .+++ .| .+|.++ T Consensus 409 ~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~------~i~v~f~-D~i~~D-~~~~~~~~~-~~v~-----aG-~~s~e~ 473 (522) T protein:vir:47 409 LVEQSIKELCVSMCELGKAVGVYSGEIPELD------DISVNLD-DGVFTD-RHAELDYWA-KMVA-----AG-FSTKKR 473 (522) T ss_pred HHHHHHHHHHHHHHHHHhhhhhccCCCCCcc------eeEEEcC-CCCCCC-HHHHHHHHH-HHHh-----cC-CCCHHH Confidence 77777777777765332 122 22322 2556665 444443 222222211 1111 13 688888 Q ss_pred HHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccc----ccccccCC Q lcl|NC_019530. 496 VMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGN----YSGSDVSP 542 (560) Q Consensus 496 i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~----~~~~~~~~ 542 (560) .+++...+||+|.+++-++|++|.... .++..+-.|. +..+|..| T Consensus 474 ~i~~~~g~~eeea~~el~ri~~E~~~~--~~~~~~~~~~~~~~~~~~d~~~ 522 (522) T protein:vir:47 474 AIGKTLNISGVEAEKELNAINSELLPM--NDAELAIYGMHDQNEEKADDKG 522 (522) T ss_pred HHHhcCCCChHHHHHHHHHHHHhhccC--CCCCCCCCCCCCcccccCCCCC Confidence 766778999999999999999996432 1111111110 11112222 No 121 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=96.28 E-value=0.00079 Score=37.59 Aligned_cols=447 Identities=11% Similarity=0.041 Sum_probs=171.2 Q ss_pred hhhhcccccchhhhccccccccCCCCCCCCceeEecccc-ceeccccccc----hhhHHHHHH--------HHHHHhhcc Q lcl|NC_019530. 21 KVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGAN-TYNYVGVESE----LLSVKTVVE--------EYQSMAQQP 87 (560) Q Consensus 21 ~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~-~~~y~~~e~~----~~~~~~LI~--------~YR~ma~~P 87 (560) -++-.++ |.-.+.. .--...+..-.| ...|...+.. .....++|+ +|+.+..+. T Consensus 1 ~~~~~~~-----~~~~~~~--------~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy 67 (511) T protein:vir:78 1 MLKVNEF-----ETDTDLR--------GNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYY 67 (511) T ss_pred Cccccch-----hhhhhhh--------hhhhhhhhhhhCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHh Confidence 1111111 0000000 000001111000 1112222111 111223333 344444443 Q ss_pred chhHHHH----------------------HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhh Q lcl|NC_019530. 88 EIRKAVD----------------------IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTA 145 (560) Q Consensus 88 Evd~AI~----------------------eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~ 145 (560) +=...|- -||+ ..+.=--..||++..++. ...+.+..+++..+|+... T Consensus 68 ~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~-~~~~yl~g~p~~~~~~d~---------~~~~~l~~~~~~n~~~~~~ 137 (511) T protein:vir:78 68 EGKTKNLVELTRRKEEYMADNRVAHDYASYISD-FINGYFLGNPIQYQDDDK---------DVLEAIEAFNDLNDVESHN 137 (511) T ss_pred hccCccccccCcccccccCcceeecchHHHHHH-HHhhhhcccCceeecCch---------HHHHHHHHHHhhcChhHHH Confidence 3222210 1111 111111234555544332 3456778888888999999 Q ss_pred HHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCC-cccccccc-----c-----cchheeecCc Q lcl|NC_019530. 146 YQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEAIE-----S-----VTLKYYYNPN 214 (560) Q Consensus 146 ~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~~~-----~-----~~e~~~Y~p~ 214 (560) .++.+.+++.|+-|.+.-+|. +|-..+..+||+.+-+|..-....+ ..++.... . +.-.-+|++. T Consensus 138 ~~~~~~~~~~G~a~~~vy~d~----dg~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~ 213 (511) T protein:vir:78 138 RSLGLDLSIYGKAYELMIRNQ----DDETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSH 213 (511) T ss_pred HHHHHHHHhcCeeEEEEEeCC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCC Confidence 999999999999888766553 4668899999999988753111111 11111110 0 0011246655 Q ss_pred ccccccccccccccccccccc-----ceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHH-HHHHHHHHHHhcC Q lcl|NC_019530. 215 YNRNQFTGQSGTSQNFQPSQQ-----ELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVT-TEDATVIYAITRA 288 (560) Q Consensus 215 ~~~~~~~g~~~~~~~~~~~~~-----~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m-~EDalVIyRitRA 288 (560) .-+....+.+........-.. --.+|. +.|+. +....|-++..+.....+.. +-+....-+..+. T Consensus 214 ~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv--v~~~n-------~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~ 284 (511) T protein:vir:78 214 GVYRYLTNRTNGLKLTPRENSFESHSFERMPI--TEFSN-------NERRKGDYEKVITLIDLYDNAESDTANYMSDLND 284 (511) T ss_pred cEEEEEecCCCcccccccccccccCcCcccce--EEecC-------CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhc Confidence 433322221110000000000 001111 11211 12223445544444333332 1122222233344 Q ss_pred ccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCc-ccchhhHhh---hcccccCCCCccceeecCCCCC Q lcl|NC_019530. 289 PEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGN-AHLMGIAED---YWLPRREGQNATEIATVGGGNQ 364 (560) Q Consensus 289 PeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~-~~~msmlED---ywLpRReGgrgTEIsTLpGg~n 364 (560) |-+-+.=....+ .++++.. ...+-.+.. +.......+.|..+..|-...+ T Consensus 285 ~~lv~~G~~~~~--------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 338 (511) T protein:vir:78 285 AMLLIKGNLNLD--------------------------PVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYD 338 (511) T ss_pred chhheecCccCC--------------------------chhhcccccccceeccccceeccccccCCCCcceeEEeecCC Confidence 443332211000 0000000 000000000 0111112233445555554444 Q ss_pred cchH-HHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc----c Q lcl|NC_019530. 365 LGEM-DHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILK----G 439 (560) Q Consensus 365 Lgei-~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLk----g 439 (560) ...+ .-+.-+.+.+|.-.++|---.+.-+ |...+..|.--+..-..-+.+.++.|..-+...++.-+-+- + T Consensus 339 ~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~----~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~ 414 (511) T protein:vir:78 339 VQGTEAYKDRLNSDIHMFTNTPNMKDDNFS----GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRS 414 (511) T ss_pred HHHHHHHHHHHHHHHHHHhCCccccccccc----cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 4333 3455667788888888853222211 11223334433333344455566666666655555422222 2 Q ss_pred CCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhh Q lcl|NC_019530. 440 VTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEK 519 (560) Q Consensus 440 iit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~ 519 (560) +-.+.+|. .|.+.|...---.+ .+.++++..+ +| .+|.+++++. |..+++ .+++.++|++|. T Consensus 415 ~~~~~~~~-----~i~~~f~~~~p~n~-------~e~~d~~~kl---~G-~iS~et~l~~-l~~v~d-~~~El~ri~~E~ 476 (511) T protein:vir:78 415 IDANKDFN-----TVRYVYNRNLPKSL-------IEELKAYIDS---GG-KISQTTLMSL-FSFFQD-PELEVKKIEEDE 476 (511) T ss_pred Cccccccc-----cceEEeCCCCCcCH-------HHHHHHHHHH---hc-cCChHHHHHh-CCCCCC-HHHHHHHHHHHH Confidence 22344443 36677754322112 2344555555 35 4999999976 666542 334444555554 Q ss_pred hcCCCCCCCccccccccccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 520 KKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 520 ~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) ++.. +.......++....+.. .-.++ +...+. T Consensus 477 ~~~~-~~~~~~~~~~~~~~~~~--~~~~~------~~~~~~ 508 (511) T protein:vir:78 477 KESI-KKAQKGIYKDPRDINDD--EQDDD------TKDTVD 508 (511) T ss_pred HHHH-HHHhhccccCCCCCCCC--CCCCC------ccCccc Confidence 3211 11111111111111110 00011 111111 No 122 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=96.28 E-value=0.00079 Score=37.59 Aligned_cols=447 Identities=11% Similarity=0.041 Sum_probs=171.2 Q ss_pred hhhhcccccchhhhccccccccCCCCCCCCceeEecccc-ceeccccccc----hhhHHHHHH--------HHHHHhhcc Q lcl|NC_019530. 21 KVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGAN-TYNYVGVESE----LLSVKTVVE--------EYQSMAQQP 87 (560) Q Consensus 21 ~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~-~~~y~~~e~~----~~~~~~LI~--------~YR~ma~~P 87 (560) -++-.++ |.-.+.. .--...+..-.| ...|...+.. .....++|+ +|+.+..+. T Consensus 1 ~~~~~~~-----~~~~~~~--------~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy 67 (511) T protein:vir:96 1 MLKVNEF-----ETDTDLR--------GNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYY 67 (511) T ss_pred Cccccch-----hhhhhhh--------hhhhhhhhhhhCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHh Confidence 1111111 0000000 000001111000 1112222111 111223333 344444443 Q ss_pred chhHHHH----------------------HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhh Q lcl|NC_019530. 88 EIRKAVD----------------------IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTA 145 (560) Q Consensus 88 Evd~AI~----------------------eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~ 145 (560) +=...|- -||+ ..+.=--..||++..++. ...+.+..+++..+|+... T Consensus 68 ~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~-~~~~yl~g~p~~~~~~d~---------~~~~~l~~~~~~n~~~~~~ 137 (511) T protein:vir:96 68 EGKTKNLVELTRRKEEYMADNRVAHDYASYISD-FINGYFLGNPIQYQDDDK---------DVLEAIEAFNDLNDVESHN 137 (511) T ss_pred hccCccccccCcccccccCcceeecchHHHHHH-HHhhhhcccCceeecCch---------HHHHHHHHHHhhcChhHHH Confidence 3222210 1111 111111234555544332 3456778888888999999 Q ss_pred HHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCC-cccccccc-----c-----cchheeecCc Q lcl|NC_019530. 146 YQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEAIE-----S-----VTLKYYYNPN 214 (560) Q Consensus 146 ~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~~~-----~-----~~e~~~Y~p~ 214 (560) .++.+.+++.|+-|.+.-+|. +|-..+..+||+.+-+|..-....+ ..++.... . +.-.-+|++. T Consensus 138 ~~~~~~~~~~G~a~~~vy~d~----dg~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~ 213 (511) T protein:vir:96 138 RSLGLDLSIYGKAYELMIRNQ----DDETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSH 213 (511) T ss_pred HHHHHHHHhcCeeEEEEEeCC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCC Confidence 999999999999888766553 4668899999999988753111111 11111110 0 0011246655 Q ss_pred ccccccccccccccccccccc-----ceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHH-HHHHHHHHHHhcC Q lcl|NC_019530. 215 YNRNQFTGQSGTSQNFQPSQQ-----ELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVT-TEDATVIYAITRA 288 (560) Q Consensus 215 ~~~~~~~g~~~~~~~~~~~~~-----~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m-~EDalVIyRitRA 288 (560) .-+....+.+........-.. --.+|. +.|+. +....|-++..+.....+.. +-+....-+..+. T Consensus 214 ~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv--v~~~n-------~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~ 284 (511) T protein:vir:96 214 GVYRYLTNRTNGLKLTPRENSFESHSFERMPI--TEFSN-------NERRKGDYEKVITLIDLYDNAESDTANYMSDLND 284 (511) T ss_pred cEEEEEecCCCcccccccccccccCcCcccce--EEecC-------CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhc Confidence 433322221110000000000 001111 11211 12223445544444333332 1122222233344 Q ss_pred ccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCc-ccchhhHhh---hcccccCCCCccceeecCCCCC Q lcl|NC_019530. 289 PEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGN-AHLMGIAED---YWLPRREGQNATEIATVGGGNQ 364 (560) Q Consensus 289 PeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~-~~~msmlED---ywLpRReGgrgTEIsTLpGg~n 364 (560) |-+-+.=....+ .++++.. ...+-.+.. +.......+.|..+..|-...+ T Consensus 285 ~~lv~~G~~~~~--------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 338 (511) T protein:vir:96 285 AMLLIKGNLNLD--------------------------PVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYD 338 (511) T ss_pred chhheecCccCC--------------------------chhhcccccccceeccccceeccccccCCCCcceeEEeecCC Confidence 443332211000 0000000 000000000 0111112233445555554444 Q ss_pred cchH-HHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc----c Q lcl|NC_019530. 365 LGEM-DHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILK----G 439 (560) Q Consensus 365 Lgei-~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLk----g 439 (560) ...+ .-+.-+.+.+|.-.++|---.+.-+ |...+..|.--+..-..-+.+.++.|..-+...++.-+-+- + T Consensus 339 ~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~----~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~ 414 (511) T protein:vir:96 339 VQGTEAYKDRLNSDIHMFTNTPNMKDDNFS----GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRS 414 (511) T ss_pred HHHHHHHHHHHHHHHHHHhCCccccccccc----cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 4333 3455667788888888853222211 11223334433333344455566666666655555422222 2 Q ss_pred CCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhh Q lcl|NC_019530. 440 VTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEK 519 (560) Q Consensus 440 iit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~ 519 (560) +-.+.+|. .|.+.|...---.+ .+.++++..+ +| .+|.+++++. |..+++ .+++.++|++|. T Consensus 415 ~~~~~~~~-----~i~~~f~~~~p~n~-------~e~~d~~~kl---~G-~iS~et~l~~-l~~v~d-~~~El~ri~~E~ 476 (511) T protein:vir:96 415 IDANKDFN-----TVRYVYNRNLPKSL-------IEELKAYIDS---GG-KISQTTLMSL-FSFFQD-PELEVKKIEEDE 476 (511) T ss_pred Cccccccc-----cceEEeCCCCCcCH-------HHHHHHHHHH---hc-cCChHHHHHh-CCCCCC-HHHHHHHHHHHH Confidence 22344443 36677754322112 2344555555 35 4999999976 666542 334444555554 Q ss_pred hcCCCCCCCccccccccccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 520 KKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 520 ~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) ++.. +.......++....+.. .-.++ +...+. T Consensus 477 ~~~~-~~~~~~~~~~~~~~~~~--~~~~~------~~~~~~ 508 (511) T protein:vir:96 477 KESI-KKAQKGIYKDPRDINDD--EQDDD------TKDTVD 508 (511) T ss_pred HHHH-HHHhhccccCCCCCCCC--CCCCC------ccCccc Confidence 3211 11111111111111110 00011 111111 No 123 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=96.24 E-value=0.00083 Score=37.47 Aligned_cols=436 Identities=11% Similarity=0.036 Sum_probs=171.3 Q ss_pred CceeEeccccc----eeccccccchhhHHHHHHHHHHHhhccchhHHHH--------------------HhhcceeeecC Q lcl|NC_019530. 50 GSIILQGGANT----YNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD--------------------IIVNDVVTCEE 105 (560) Q Consensus 50 G~~~i~~~~~~----~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~--------------------eIvneaiv~d~ 105 (560) -++.++...-. ....-+.--+.--..-+.+|+.+..+.+-...|. .||+..+ .=- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~Iv~~~~-~~l 79 (499) T protein:vir:10 1 MAVVIDKDLLDDVNEPNIEAINYAIRELQNRKKRLDKLSDYYNGKQEIEKHEFDNATVEAANVMVNHAKYITDMNV-GFM 79 (499) T ss_pred CccchhhhHHhhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCCcCcCCCCcceeecchHHHHHHHHh-hhh Confidence 11111111000 0000000000000112234444444443332221 1111111 111 Q ss_pred CCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCC-------------ccC Q lcl|NC_019530. 106 DETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNK-------------KGG 172 (560) Q Consensus 106 ~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~-------------k~G 172 (560) -..||.+..++. ...++|..+++-.+|+....++.+..++.|+-|.+.-+|.+.. ... T Consensus 80 ~g~p~~~~~~~~---------~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~~~ 150 (499) T protein:vir:10 80 TGNPVKYVAEKG---------KNIDDILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLTPNT 150 (499) T ss_pred cccCceeecCCh---------hHHHHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEeccccccccccccccccccccc Confidence 233444444332 3455677788888999999999999999999888776665432 111 Q ss_pred eeEeEEeCccccchhhhhcccCC-cccccccc--------ccchheeecCcccccccccccc-cc----cccccccccee Q lcl|NC_019530. 173 IKKLVMLDSRCIRPVYIVEKAMR-EGGIEAIE--------SVTLKYYYNPNYNRNQFTGQSG-TS----QNFQPSQQELV 238 (560) Q Consensus 173 I~eLr~LDPr~i~~vr~i~~~~~-~g~~~~~~--------~~~e~~~Y~p~~~~~~~~g~~~-~~----~~~~~~~~~~~ 238 (560) =..+..+||+.+-+|+.-..... ...+.... .+...-+|.|..-+........ .. .....+..--+ T Consensus 151 ~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~ 230 (499) T protein:vir:10 151 ELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEVSANDPIVYDGENLFGA 230 (499) T ss_pred ceEEEEEcccceEEEecCCCCcceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccccCcceecccccCCCCc Confidence 24477899988766642111100 00010000 0111123544322211110000 00 00000000001 Q ss_pred cchhceeeeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhh Q lcl|NC_019530. 239 FDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKF 317 (560) Q Consensus 239 i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~ 317 (560) +|. |.|. ++....|-++..+.....+. ++-+....-+-...|-+-+.-.+.+... T Consensus 231 vPv--v~~~-------n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~--------------- 286 (499) T protein:vir:10 231 VPI--IEFR-------NNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGLGDDK--------------- 286 (499) T ss_pred cce--EEec-------CCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccc--------------- Confidence 221 1121 22233455555555555444 3345555556677776666533322211 Q ss_pred hhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCccc Q lcl|NC_019530. 318 KNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGSIN 396 (560) Q Consensus 318 knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~ 396 (560) .....+..+.+.--.+..|..+.+|-...+... ..-+.-+.+.+|+-..+|- +..+ .+ T Consensus 287 -----------------~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~-~~- 345 (499) T protein:vir:10 287 -----------------DDIQRLKRGAIEAPPREEGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPN--MNDE-KF- 345 (499) T ss_pred -----------------chhhhhhhcceeccCCCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCccc--CCch-hh- Confidence 111111111111112222334666655444433 2345667778888888883 1111 11 Q ss_pred ccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-ChhHHHHhhhhhceeeeeecchHHHHHHHHHHHH Q lcl|NC_019530. 397 IGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVT-DEKDWNEKIKPFIKFEFTSDSYIREQQENAILND 475 (560) Q Consensus 397 l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgii-t~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~ 475 (560) .|...+..|..-......-+.+.++.|...+.+.++.-+-+-++. ...+|. .|.+.|...---.+ .+ T Consensus 346 ~gn~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~d~~-----~i~i~f~~~~p~n~-------~e 413 (499) T protein:vir:10 346 MGNVSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIKGANDDAS-----GCKISLVANIPSNL-------SD 413 (499) T ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccc-----cceEEeCCCCCCCH-------HH Confidence 122223334333333334455666666666666666544433321 223342 35666644432222 24 Q ss_pred HHHHHHHhhhhhcchhhHHHHHHHHhCCC---HHHHHHHHHHHHHhhhcCCCCCCCcc---ccccccccccCCCCCCccc Q lcl|NC_019530. 476 RLASLNTVEPFVGSIFSIDYVMRNVLRMS---DEEVKEQQTKIAEEKKKGLYPKVQAD---ETGNYSGSDVSPLKFKPET 549 (560) Q Consensus 476 Rl~~l~~~~p~vGky~S~~~i~k~IL~~t---DeEI~e~~kqIe~E~~~~~~~~p~~~---e~g~~~~~~~~~~~~~p~~ 549 (560) .+++++.+ .| .+|.+++++. |... ++|++.+.++-+++.+.. .++... +.+++......+.+-.++- T Consensus 414 ~~~~~~kl---~g-~iS~et~~~~-l~~v~d~~~E~~ri~~E~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (499) T protein:vir:10 414 VVNNVKNA---DG-IIPRKYTYSW-LPDVDNPQDVIDEMNQQDAETIKKN--QEALRGQDPDRLELEDKQDDSSENDKEA 486 (499) T ss_pred HHHHHHHH---hc-cCChHHHHHh-CCCCCCHHHHHHHHHHHHHHHHHHH--HhhhccCCCCCCCCCCCCcccCCCCCCC Confidence 45555555 35 4999999977 5553 344444433332221110 111000 1111000000000000000 Q ss_pred -cc--ccCCcccC Q lcl|NC_019530. 550 -IP--FSGSTDDS 559 (560) Q Consensus 550 -~~--~~~~~~~~ 559 (560) .. .+|-|--. T Consensus 487 ~~~~~~~~~~~~~ 499 (499) T protein:vir:10 487 GSNHNQSHRTRAV 499 (499) T ss_pred ccccccCCCCCCC Confidence 00 01111111 No 124 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=96.22 E-value=0.00086 Score=37.39 Aligned_cols=409 Identities=10% Similarity=0.046 Sum_probs=167.3 Q ss_pred ccceeccccccc----------------------hhhHHHHHHHHHHHhhccchhHHHH--------------------- Q lcl|NC_019530. 58 ANTYNYVGVESE----------------------LLSVKTVVEEYQSMAQQPEIRKAVD--------------------- 94 (560) Q Consensus 58 ~~~~~y~~~e~~----------------------~~~~~~LI~~YR~ma~~PEvd~AI~--------------------- 94 (560) +-..+|+..... +..-..-+.+|+.+..+.+-+..|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki 80 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRLINDHKPKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKPDWRM 80 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhccCCcchhccchhcccccccccccchhc Confidence 222222211111 0011123345666554433332111 Q ss_pred ------HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCC Q lcl|NC_019530. 95 ------IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTN 168 (560) Q Consensus 95 ------eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~ 168 (560) .||+ .-+.=--..||++..++.+ ..+.++.+++ -+|+....++.+..++.|+-|.+.-+|. T Consensus 81 ~~n~~~~Ivd-~~~~~l~g~p~~~~~~d~~---------~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~y~d~-- 147 (474) T protein:vir:96 81 FTNYHQNLVD-QKVAYAVANPVTFSSDDDK---------SLKTIQEVLN-HKWDDKLVDILTAASNKGIEWLQPYIDE-- 147 (474) T ss_pred ccchHHHHHH-hhhhhhcccCceeecCchH---------HHHHHHHHHh-cCHHHHHHHHHHHHHhcCeeEEEEEecC-- Confidence 1111 1111123455666555432 2334444543 4788889999999999999888766654 Q ss_pred CccCeeEeEEeCccccchhhhhcccCC-cccccc--ccccchheeecCcccc-ccccccccccccc----ccccc----c Q lcl|NC_019530. 169 KKGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEA--IESVTLKYYYNPNYNR-NQFTGQSGTSQNF----QPSQQ----E 236 (560) Q Consensus 169 ~k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~--~~~~~e~~~Y~p~~~~-~~~~g~~~~~~~~----~~~~~----~ 236 (560) +|-..+..+||+.+-++..-....+ ...+.. .+....+-+|.+..-. +...+........ ..... . T Consensus 148 --~~~~~i~~~~p~~~~~v~d~~~~~~~~~~vr~~~~~~~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (474) T protein:vir:96 148 --NGEFKTFRVPAEQAIPIWTNKERDTLKAFIRYYRLDGAERVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYYVGN 225 (474) T ss_pred --CCceEEEEEcccceEEEEcCCCCCceEEEEEEEeecCceEEEEEeCCeEEEEEecCCceeeccccccccccccccccc Confidence 3557799999999877642110000 011111 1111112223222111 0010100000000 00000 0 Q ss_pred eecchhceeeeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHH Q lcl|NC_019530. 237 LVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMG 315 (560) Q Consensus 237 ~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~ 315 (560) ..-+...|-.++. .++....|=++..+...+.+. ++-+..-.-+..+.|-+-+.-.+.... .+.+. -|. T Consensus 226 ~~~~~g~iPvv~~----~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~-----~~~~~-~~~ 295 (474) T protein:vir:96 226 KRVSWGRVPFIPF----KNNPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYEGQDL-----DEFMR-NLK 295 (474) T ss_pred cccCCCceeEEEe----ccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccc-----cchhh-hhh Confidence 0001111111111 123334455666555555554 333444444666666544332221110 00000 011 Q ss_pred hhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCc Q lcl|NC_019530. 316 KFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEM-DHVNYFREKLYDALMIPKSRLQEEGS 394 (560) Q Consensus 316 k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~VP~sRl~~~~~ 394 (560) . ++++ ++| +.|.+++.|-...++... ..+.-..+.+|+..++|-.-. + + T Consensus 296 ~--~~~i---------------------~~~----~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~--~-~ 345 (474) T protein:vir:96 296 Y--YKAI---------------------NVD----GDGSGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQ--D-K 345 (474) T ss_pred c--CceE---------------------Eec----CCCCceeEEeecCChHHHHHHHHHHHHHHHHHhCCccccc--c-c Confidence 1 1111 111 123345555544444332 334566788999999994322 1 1 Q ss_pred ccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHH Q lcl|NC_019530. 395 INIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILN 474 (560) Q Consensus 395 ~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~ 474 (560) +. +...+..|..-...--.-+.+.+..|..-+.+.|+.-+-+.|+ . .+|. .|.+.|...--..+.. T Consensus 346 ~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~-~-~~~~-----~i~i~f~~~~p~~~~e------ 411 (474) T protein:vir:96 346 FG-NSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKL-N-IKVQ-----DVEITFNFNVMVNELE------ 411 (474) T ss_pred cc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-C-cccc-----eeeEEeccCCCcCHHH------ Confidence 11 1111223322222222345666666666666666654444454 2 2332 2556664443332321 Q ss_pred HHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccccc--ccccCCCCCCcccccc Q lcl|NC_019530. 475 DRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYS--GSDVSPLKFKPETIPF 552 (560) Q Consensus 475 ~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~--~~~~~~~~~~p~~~~~ 552 (560) -++++.++ | .+|.+++++. |...++ -+++.++|++|..+.. +...+.++++. .++... T Consensus 412 -~~~~~~~a----g-~iS~et~~~~-~~~v~d-~~~E~~ri~~E~~e~~--~~~~~~~~~~~~~~~d~~~---------- 471 (474) T protein:vir:96 412 -QSQIGVQS----Q-YLSKETVVTN-HPWVDD-PVAELERIEQDNIDFN--KQLPPLEGDANGRAQDNES---------- 471 (474) T ss_pred -HHHHHHhc----C-CCchHHHHHh-CCCCCC-HHHHHHHHHHHHHHHH--hcccccccccccccCCCcc---------- Confidence 22333332 3 6899999976 554432 3456666666653321 11111111111 111110 Q ss_pred cCCcc Q lcl|NC_019530. 553 SGSTD 557 (560) Q Consensus 553 ~~~~~ 557 (560) .|| T Consensus 472 --e~~ 474 (474) T protein:vir:96 472 --ETN 474 (474) T ss_pred --cCC Confidence 011 No 125 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=96.18 E-value=0.00091 Score=37.25 Aligned_cols=443 Identities=12% Similarity=0.032 Sum_probs=170.2 Q ss_pred cCCCCCCCCceeE--------ecccc-ceeccccccchhhHHH----H--------HHHHHHHhhccchhHHHHHhhc-- Q lcl|NC_019530. 42 VALDDAQDGSIIL--------QGGAN-TYNYVGVESELLSVKT----V--------VEEYQSMAQQPEIRKAVDIIVN-- 98 (560) Q Consensus 42 ~~~~~~~dG~~~i--------~~~~~-~~~y~~~e~~~~~~~~----L--------I~~YR~ma~~PEvd~AI~eIvn-- 98 (560) +.--+.=.|...+ ....+ .+.|..++.......+ + ..+|+.+..+.+=...|-.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~ 80 (512) T protein:vir:97 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) T ss_pred CccceeccCceeeeeCceeeeccccccccccCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcc Confidence 0000111111111 11111 1223344333322111 1 2245555444443332211000 Q ss_pred -------------------ceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhh Q lcl|NC_019530. 99 -------------------DVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQA 159 (560) Q Consensus 99 -------------------eaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~ 159 (560) +..+.---..||++..++ +...+.+..+++-.+|+....++.+..++.|+-| T Consensus 81 ~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d---------~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay 151 (512) T protein:vir:97 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDD---------KDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAY 151 (512) T ss_pred cccccCcceeecchHHHHHHHHhhhhcccCceeccCC---------hHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeEE Confidence 011111122333333322 1344567777777889999999999999999999 Q ss_pred hhheecCCCCccCeeEeEEeCccccchhhhhcccCC-cccccccc---------ccchh-eeecCccccccccccccccc Q lcl|NC_019530. 160 YHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEAIE---------SVTLK-YYYNPNYNRNQFTGQSGTSQ 228 (560) Q Consensus 160 ~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~~~---------~~~e~-~~Y~p~~~~~~~~g~~~~~~ 228 (560) .+.-+|. +|-..+..++|+.+-+|..-....+ ..++.... ....+ -+|++..-+........... T Consensus 152 ~~vy~de----d~~~~i~~~~p~~~~~iyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~ 227 (512) T protein:vir:97 152 ELMIRNQ----DDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLK 227 (512) T ss_pred EEEEeCC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccc Confidence 8777654 3557799999999877642111000 01111110 00111 24555433322211111000 Q ss_pred cccccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHH Q lcl|NC_019530. 229 NFQPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVT-TEDATVIYAITRAPEKRAFYLDVGTLGKKSAE 307 (560) Q Consensus 229 ~~~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m-~EDalVIyRitRAPeRRvFyIDvGnlpk~KAe 307 (560) ...-......-+-..|-.++. +++....|-++.++.....+.. +=+....-+-++.|-+-+.=....+ .. T Consensus 228 ~~~~~~~~~~~~~g~vPvv~~----~nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~-----~~ 298 (512) T protein:vir:97 228 LTPRENGFESHSFERMPITEF----SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLD-----PV 298 (512) T ss_pred ccccccccccccCcccceEee----cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCC-----ch Confidence 000000000000111111111 1222334556665555544442 2222333344444444332211111 00 Q ss_pred HHHHHHHHhhhhheeecccCCcccCcccchhhHhh----hcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhc Q lcl|NC_019530. 308 EYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAED----YWLPRREGQNATEIATVGGGNQLGE-MDHVNYFREKLYDAL 382 (560) Q Consensus 308 qYl~~im~k~knk~vYd~~TGev~d~~~~msmlED----ywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL 382 (560) +. .+.+. ..+..+++ +..+.-+++.|..+..|-...+... -.-+.-+.+.+|.-. T Consensus 299 ~~-----~~~~~---------------~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s 358 (512) T protein:vir:97 299 EV-----RKQKE---------------ANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFT 358 (512) T ss_pred hh-----hhhhh---------------cccccccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHh Confidence 00 00000 00000111 1111111222333444433333322 234556677788888 Q ss_pred CCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---ccC-CChhHHHHhhhhhceeee Q lcl|NC_019530. 383 MIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLIL---KGV-TDEKDWNEKIKPFIKFEF 458 (560) Q Consensus 383 ~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiL---kgi-it~eew~~~i~~~I~~~f 458 (560) ++|---.+.-+ |...+..|..-...-..-+.+.++.|..-+...++.-+-+ ++. -.+.+|. .|.+.| T Consensus 359 ~~p~~~~~~~~----gn~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~-----~i~~~f 429 (512) T protein:vir:97 359 NTPNMKDDNFS----GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFN-----TVRYVY 429 (512) T ss_pred CCcccCccccc----ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccc-----cceEEe Confidence 88864322211 2222333443333334445555655555555554442221 222 2344553 366777 Q ss_pred eecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCH---HHHHHHHHHHHHhhhcCCCCCCCccccccc Q lcl|NC_019530. 459 TSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSD---EEVKEQQTKIAEEKKKGLYPKVQADETGNY 535 (560) Q Consensus 459 ~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tD---eEI~e~~kqIe~E~~~~~~~~p~~~e~g~~ 535 (560) ... .-.- +.+.++++..+ +| .+|.+++++. |...+ +|++.+.++-+++++. ......++. T Consensus 430 ~~~-~p~~------~~e~~~~~~kl---~g-iiS~et~~~~-l~~v~d~~~E~eri~~E~~~~~~~-----~~~~~~~~~ 492 (512) T protein:vir:97 430 NRN-LPKS------LIEELKAYIDS---GG-KISQTTLMSL-FSFFQDPELEVKKIEEDEKESIKK-----AQKGIYKDP 492 (512) T ss_pred CCC-CCcC------HHHHHHHHHHH---hc-cCchHHHHHh-CCCCCCHHHHHHHHHHHHHHHHHH-----HhhcccCCC Confidence 542 2111 12334555555 35 4999999976 56543 4444444443333211 111111111 Q ss_pred cccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 536 SGSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 536 ~~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) .+.+... |...+...+. T Consensus 493 ~~~~~~~--------~~~~~~~~~~ 509 (512) T protein:vir:97 493 RDINDDE--------QDDDTKDTVD 509 (512) T ss_pred CCCCCCC--------CCCCcccccc Confidence 1111110 0111112222 No 126 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=96.16 E-value=0.00093 Score=37.20 Aligned_cols=415 Identities=8% Similarity=0.024 Sum_probs=175.5 Q ss_pred cccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhH-HHHHHHHHHHhhccchhHHHHHh--------h Q lcl|NC_019530. 27 LAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSV-KTVVEEYQSMAQQPEIRKAVDII--------V 97 (560) Q Consensus 27 ~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~-~~LI~~YR~ma~~PEvd~AI~eI--------v 97 (560) +...+.+-+ ..+..+ .....+|+.+..+.+=...|... - T Consensus 1 ~~~~~~~~i--------------------------------~~l~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~ 48 (441) T protein:vir:80 1 MNSDELALI--------------------------------EGMYDRIQRLSSWHCCIEGYYEGSNRVRDLGVAIPPELQ 48 (441) T ss_pred CCccHHHHH--------------------------------HHHHHHHHHHHHHHHHHHHHHhcCCcchhcCcccchhhh Confidence 100000000 000000 00112233333322222111110 0 Q ss_pred cceeeecCCCceE-----EE---eeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCC Q lcl|NC_019530. 98 NDVVTCEEDETPV-----TV---NLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNK 169 (560) Q Consensus 98 neaiv~d~~~~~V-----~l---~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~ 169 (560) |--+|..--...| .+ .++..+ . +..+.|.+-.+|+...+++++.-++.|+-|.+.-.| T Consensus 49 ~~k~~~n~~~~ivd~~~~~l~~~g~~~~d--~--------~~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d---- 114 (441) T protein:vir:80 49 RVQTVVSWPGIAVDALEERLDWLGWTNGD--G--------YGLDGVYAANRLATASCDVHLDALIFGLSFVAIIPH---- 114 (441) T ss_pred hhhhhcchHHHHHHHHHhhhccccccCCC--h--------HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeC---- Confidence 0000000000000 00 111111 1 234556666789999999999999999988765443 Q ss_pred ccCeeEeEEeCccccchhhhhcccCCcccccccc----ccchheeecCcccccccc-ccccccccccccccceecchhce Q lcl|NC_019530. 170 KGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIE----SVTLKYYYNPNYNRNQFT-GQSGTSQNFQPSQQELVFDDESI 244 (560) Q Consensus 170 k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~----~~~e~~~Y~p~~~~~~~~-g~~~~~~~~~~~~~~~~i~~daI 244 (560) ++|-..++.++|+++-.+..=......-...... .....-+|.|..-..... +......+-.-+..--++|. | T Consensus 115 ~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~~~~~~~~vy~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv--v 192 (441) T protein:vir:80 115 GDGTVSVRPQSPKNCTGKFSADGSRLDAGLVVQQTCDPEVVEAELLLPDVIVQVERRGSREWVEVDRIPNVLGAVPL--V 192 (441) T ss_pred CCCceEEEEEccceEEEEEeCCCCceeEEEEEEEEecCceEEEEEEecCeEEEEEEcCCcceeeccccccCCCceeE--E Confidence 3577789999999987764211000000000000 011112244332221111 10000000000000001221 2 Q ss_pred eeeecccccCCCCcchhhhHHHHHHHH-H-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhhee Q lcl|NC_019530. 245 VYIDSGEEPLANGIVPGLLNPAIRPLN-N-LVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNA 322 (560) Q Consensus 245 ~y~hsGl~~~~~~~i~syL~~AiK~~N-q-L~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~v 322 (560) .|.+---.++. ...|-|...++++- - -+++-+..++-+.+..|.|-+.=.+.+..+. + ..+ T Consensus 193 ~~~n~~~~~~~--~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~G~~~~~~~~--------~---~~~---- 255 (441) T protein:vir:80 193 PIVNRRRTSRI--DGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVTGVSADEFSQ--------P---GWV---- 255 (441) T ss_pred EeeccccCCcc--CCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeeecCCcccccc--------c---hhh---- Confidence 23322111111 11233333333321 1 2456677778888888877554222111110 0 000 Q ss_pred ecccCCcccCcccchhhHhhhc-ccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccc Q lcl|NC_019530. 323 YDRTTGKITGNAHLMGIAEDYW-LPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSN 401 (560) Q Consensus 323 Yd~~TGev~d~~~~msmlEDyw-LpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~ 401 (560) ...|. +| +|--++|.+.++..+|++.-=.-++-++=....++...++|.+-|...+..+ . | T Consensus 256 --~~~~~-------------i~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~-~-S- 317 (441) T protein:vir:80 256 --LSMAS-------------VWAVDKDDDGDTPNVGSFPVNSPTPYSDQMRLLAQLTAGEAAVPERYFGFITSNP-P-S- 317 (441) T ss_pred --hcccc-------------cccCCCCCCCCcceeEecCccchHHHHHHHHHHHHHHhcccCCCHHHhccCCCcc-h-H- Confidence 01121 23 3444455556777788642212233344455667777888877775443211 1 1 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 402 LAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLN 481 (560) Q Consensus 402 ~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~ 481 (560) +..|.--+...-.-+.+.++.|..-+...++.-+-+.|..... .... ..|.+.|..-- ..-+.+.++.+. T Consensus 318 g~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~--~~~~-~~i~~~f~~~~-------~~~~~e~ad~~~ 387 (441) T protein:vir:80 318 GEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVDE--ADFF-GDVGLRWRDAS-------TPTRAATADAVT 387 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc--cccc-eeeeEEeCCCC-------CcCHHHHHHHHH Confidence 2335444445666677888888888888777655555542221 1111 23666665432 122234555555 Q ss_pred Hhhhhhcc-hhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcccc Q lcl|NC_019530. 482 TVEPFVGS-IFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETI 550 (560) Q Consensus 482 ~~~p~vGk-y~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~ 550 (560) ++..- |. ..|.++++ ..|.++++|++++.+.-+++. ++.+.. ++..+=.|+-+ T Consensus 388 kl~~~-g~~~~s~~~~~-~~l~~~~~e~~~~~~e~~e~~------~~~~~~--------~~~~~~~~~~~ 441 (441) T protein:vir:80 388 KLVGA-GILPADSRTVL-EMLGLDDVQVEAVMRHRAESS------DPLAVL--------AGAISRQTNEV 441 (441) T ss_pred HHHhc-CcccccHHHHH-HhCCCCHHHHHHHHHHHHHHH------HHHHHH--------hhhhhcccccC Confidence 55443 44 46888887 568999999997765433331 111111 11111112222 No 127 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=96.15 E-value=0.00094 Score=37.18 Aligned_cols=425 Identities=10% Similarity=0.084 Sum_probs=178.2 Q ss_pred hhhhhhhcccccchhhhcc--ccccccCCCCCCCCceeEeccccce-eccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 18 VNAKVDTDKLAQKQDELLL--TKATVVALDDAQDGSIILQGGANTY-NYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 18 ~~~~~~~~~~~~~~~e~~~--~k~~s~~~~~~~dG~~~i~~~~~~~-~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) ..++.... +..++.. .+..+... .. +. + +..+.+ +..++. . + -...+++|.|..||+ T Consensus 1 ~~~~~~~~----~~~~~~~~~~~~~~~~~-~~--~~--~-~~~~~g~~~~g~~--v-~-------~~~al~~~~V~~~v~ 60 (454) T protein:vir:93 1 MWNLLRRT----RKNQKSGRDVREAGWTS-LF--QA--V-AEPFAGAWQQGVK--A-D-------PEAVLSFHAVFACIS 60 (454) T ss_pred CCCccccC----cccccccccccchhhhh-hh--hh--h-hhhhcchhhcCcc--c-C-------hHHhhccHHHHHHHH Confidence 11111111 1111111 01111100 00 00 0 000111 001110 0 0 124467889999997 Q ss_pred HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHH-HHHHHHHhCCc----hhhHHHHHHhhhhhhhhhhheecCCCC Q lcl|NC_019530. 95 IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITEC-FKEVMHLMDFD----NTAYQKIRKWYVDGRQAYHVIVDPTNK 169 (560) Q Consensus 95 eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~ee-F~~il~ll~f~----~~~~~~~R~wYvDGr~~~hkiiD~~~~ 169 (560) -|.+.+.-. |+.|-=...++ .++.+.+- +..+++-=+-. .=.+.++..++++|.-|..++.|.. T Consensus 61 ~Ia~~iA~l-----p~~~~~~~~~g----~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~-- 129 (454) T protein:vir:93 61 LISQDIAKM-----RLRLMQTDAQG----IRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNAR-- 129 (454) T ss_pred HHHHhhccC-----ceEEEEeccCC----ccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCC-- Confidence 776654422 33332111111 22222221 11111111222 2234455667888988887776543 Q ss_pred ccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeec Q lcl|NC_019530. 170 KGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDS 249 (560) Q Consensus 170 k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hs 249 (560) .-+.+|.+|+|.+|..++ +.+| .-+|.|.+... ...+....++.+.|.|+.. T Consensus 130 -G~~~~L~~i~~~~v~v~~-----~~~g--------~~~y~~~~~~~--------------~~~~~~~~~~~~eViH~k~ 181 (454) T protein:vir:93 130 -GQIKELRILDWNRVEPLV-----ADDG--------EVFYRITPDRN--------------CGITEAVTVPAREVIHDRF 181 (454) T ss_pred -CcEEEEEEEcCcceEEEE-----cCCC--------cEEEEEEeccc--------------cccceeEEecCcceEEecc Confidence 238899999999887642 1122 12344432111 1122355688888888865 Q ss_pred ccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCc Q lcl|NC_019530. 250 GEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGK 329 (560) Q Consensus 250 Gl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGe 329 (560) +. ..++-+.+|-+..|.+.+.....+++...=+=---+--+-|..++ +.|-+..+++ +...++...-= ...|. T Consensus 182 ~~-~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~----~~~~~~~~~~g-~n~g~ 254 (454) T protein:vir:93 182 NC-FFHPLIGLPPVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIP-GSITEENAKK----LKSNWDSGYTG-ENAGK 254 (454) T ss_pred CC-CCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecC-CCCCHHHHHH----HHHHHHHHhcc-cccCC Confidence 43 334456789999999999999999987653211113335566665 5665554433 22333322100 11222 Q ss_pred ccCcccchhhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHH Q lcl|NC_019530. 330 ITGNAHLMGIAEDYWLPRREGQNATEIATVGGG-NQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQE 408 (560) Q Consensus 330 v~d~~~~msmlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRD 408 (560) .+ +++ .|.+++.|.=. ..+.-++-.+|..+.+.++++||...|+..++-+ .+++++..+. T Consensus 255 ------~~-vl~----------~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~sn~e~~~~~ 315 (454) T protein:vir:93 255 ------TA-ILS----------NGAKYNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPS--SDNVEALEQQ 315 (454) T ss_pred ------ce-ecc----------CCceEEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCc--chhHHHHHHH Confidence 11 232 24445444321 1222234445777899999999999997544322 3444444333 Q ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_019530. 409 ELRFSKF-CAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFV 487 (560) Q Consensus 409 ElkF~Kf-i~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~v 487 (560) |.++ |.-+..++...+.. .| .+.++ ..+.|..+ ++... -+..|++.+..+-. T Consensus 316 ---f~~~~l~P~~~~ie~~ln~----~L-----~~~~~--------~~~~f~~~----~ll~~-D~~~r~~~~~~~~~-- 368 (454) T protein:vir:93 316 ---YYSQCLQTLIESIELLLDE----AL-----ETGEN--------ESTEFDVT----TLLRM-DSERRMKTLGDAVK-- 368 (454) T ss_pred ---HHHHHHHHHHHHHHHHHHH----hh-----cCCCC--------cEEEeech----hhhcc-CHHHHHHHHHHHHh-- Confidence 5443 34444444433332 22 23222 12334322 22221 12356665555422 Q ss_pred cchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcC--CCCCCCccccccccccccCCCCCCcccc--------cccCCcc Q lcl|NC_019530. 488 GSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKG--LYPKVQADETGNYSGSDVSPLKFKPETI--------PFSGSTD 557 (560) Q Consensus 488 Gky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~--~~~~p~~~e~g~~~~~~~~~~~~~p~~~--------~~~~~~~ 557 (560) .-++|.+-++. .+.|..-+ --++-+-.-..-+ -..++++.++. ..+.+..+-.|+.. +....+| T Consensus 369 ~G~~T~NE~R~-~~gl~pi~--ggD~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~d~~~~~~e~~~d 442 (454) T protein:vir:93 369 NTLLTPNEARK-RENLPPLA--GGDALYLQQQNYSLEALSRRDAREDP---FASSGKTASVPQAVAASDGNKAITETEHD 442 (454) T ss_pred CCCcCHHHHHH-HhCCCCCC--CCCeeeeccCccchHhhhccCcccCC---CCCCccCCCCCCCCCCCCCCCCccCCccc Confidence 24778888874 47775422 0000000000000 00000000000 00111111111111 1111111 Q ss_pred cCC Q lcl|NC_019530. 558 DSI 560 (560) Q Consensus 558 ~~~ 560 (560) .+- T Consensus 443 ~~~ 445 (454) T protein:vir:93 443 AVK 445 (454) T ss_pred hhh Confidence 111 No 128 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=96.15 E-value=0.00094 Score=37.17 Aligned_cols=442 Identities=11% Similarity=0.055 Sum_probs=166.9 Q ss_pred chhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchh---------hHHHH Q lcl|NC_019530. 6 RGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELL---------SVKTV 76 (560) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~---------~~~~L 76 (560) --|.-|..|+.- +.++ ..+-...+.+...++-.. .+ +.+.+.... .-.+. T Consensus 1 ~~~~~~~~~~~~--~~~~-----------~~~~~~~~~~~~~~~~~~-----~~---~~~~~~~~~~~~i~~~i~~~~~~ 59 (492) T protein:vir:97 1 MQFIQLISQVAQ--ALIK-----------GGNILYPSQPTQTEIFDA-----IV---RTNNKPETLEEMIVRYIKQHLEK 59 (492) T ss_pred ChHHHHHHHHHH--HHhc-----------CCceeeccchhhhhHhhh-----cc---cCCCchhhHHHHHHHHHHHHHHH Confidence 112222222110 0011 111111121111111110 00 111111110 11123 Q ss_pred HHHHHHHhhccchhHHHH---------------------------HhhcceeeecCCCceEEEeeecccchhHHHHHHHH Q lcl|NC_019530. 77 VEEYQSMAQQPEIRKAVD---------------------------IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESIT 129 (560) Q Consensus 77 I~~YR~ma~~PEvd~AI~---------------------------eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~ 129 (560) +.+|+.+..+.+-...|. -||+ ..+.---..||++..++. ... T Consensus 60 ~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd-~~~~yl~g~p~~~~~~d~---------~~~ 129 (492) T protein:vir:97 60 LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVD-QKVSYIVGKPIAFKHTDD---------EVV 129 (492) T ss_pred HHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHH-HHhhhhcccCceeccCch---------HHH Confidence 344555555444432210 1111 111111234455544432 223 Q ss_pred HHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccC-Ccccccccc--ccc Q lcl|NC_019530. 130 ECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAM-REGGIEAIE--SVT 206 (560) Q Consensus 130 eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~-~~g~~~~~~--~~~ 206 (560) +..+.+++ .+|+....++.+.+++.|+-|.+.-+|. +|-..++.++|+.+-++..-.... ....+.... ... T Consensus 130 ~~l~~~~~-n~~~~~~~~~~~~~~~~G~a~~~v~~d~----dg~~~~~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~ 204 (492) T protein:vir:97 130 KRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDE----EGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENET 204 (492) T ss_pred HHHHHHHh-ccHHHHHHHHHHHHhhcCeEEEEEEecC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEeeccce Confidence 34444544 5789999999999999999888766543 466789999999987764211000 001111100 000 Q ss_pred hheeecCcccc-cccccccccccccccccccee-------cchhceeeeecccccCCCCcchhhhHHHHHHHHHHH-HHH Q lcl|NC_019530. 207 LKYYYNPNYNR-NQFTGQSGTSQNFQPSQQELV-------FDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTE 277 (560) Q Consensus 207 e~~~Y~p~~~~-~~~~g~~~~~~~~~~~~~~~~-------i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~E 277 (560) ..-+|+|..-. +.+.+... ........+... +..=.|++.. ++....|-++..+....-+. ++= T Consensus 205 ~~~~y~~~~v~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~g~vPvv~~~------nn~~g~sd~e~v~~liDa~d~~~S 277 (492) T protein:vir:97 205 KVEYWDKVTVNYYVYENGSL-IPDYSNNLENSKTHFSTGSWGKIPFIPFK------NNDLEISDIFMYKTLIDAYNRRLS 277 (492) T ss_pred eEEEEecCeEEEEEEecCee-eecccccccccccccccCCCCCcceEEec------CCCCCCCchHhHHHHHHHHHHHHH Confidence 11122322111 10111100 000000000000 0000112211 12223344444333333332 333 Q ss_pred HHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCcccee Q lcl|NC_019530. 278 DATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIA 357 (560) Q Consensus 278 DalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIs 357 (560) +....-+-++-|-+-+.-.+.-..+ +... ++..++...-++ +..+. T Consensus 278 ~~~~~~~~~~~~~l~~~g~~~~~~~-----~~~~---------------------------~~~~~~~~~~~~--~~~~~ 323 (492) T protein:vir:97 278 DLSNTFKDSNELTYVLKNYDDQELP-----EFKR---------------------------LLRYYGAIKVSD--NGGVD 323 (492) T ss_pred HHHHHHHHhccceeeeecCCcccch-----hHHH---------------------------HHhhccceecCC--CCcce Confidence 4444445555664444322211111 1111 111111111111 22355 Q ss_pred ecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 358 TVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLI 436 (560) Q Consensus 358 TLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLi 436 (560) +|-...+... ..-+.-.++.+|+-..+|- +.. +.|. |...+..|.--+.....-+.+.++.|...+...++.-+- T Consensus 324 ~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~--~~~-~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~ 399 (492) T protein:vir:97 324 TIQVEVPVENSKKYLDELYQKIMLFGQAVD--FSS-DKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFE 399 (492) T ss_pred eEeccCCHHHHHHHHHHHHHHHHHHhCCCC--CCc-cccc-cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5544334332 2334556677788888884 211 1111 122233454444445556677777777777777775444 Q ss_pred hccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHH Q lcl|NC_019530. 437 LKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIA 516 (560) Q Consensus 437 Lkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe 516 (560) +-|+ ..+|.. |.+.|..----.+. +.++++..+ +| .+|.+++++. |...++ .+++-++|+ T Consensus 400 ~~~~--~~~~~~-----i~v~f~~~~p~~~~-------e~a~~~~kl---~G-~iS~et~l~~-l~~v~d-~~~Eleri~ 459 (492) T protein:vir:97 400 HFDI--KGEHKD-----VDISFNYNKVANTE-------LQVQTAQQS---MG-IVSHETVLEN-HPFVED-LQAELERIE 459 (492) T ss_pred HhcC--Ccccce-----eeEEecCCCCCCHH-------HHHHHHHHH---hc-cCchHHHHHh-CCCCCC-HHHHHHHHH Confidence 4454 335532 55666432221121 234555555 35 4999999987 444331 223334444 Q ss_pred HhhhcCCCCCCCccccc-cccccccCCCCCCcc Q lcl|NC_019530. 517 EEKKKGLYPKVQADETG-NYSGSDVSPLKFKPE 548 (560) Q Consensus 517 ~E~~~~~~~~p~~~e~g-~~~~~~~~~~~~~p~ 548 (560) +|..+..-..++....| +.+.++..+..=.-| T Consensus 460 ~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 492 (492) T protein:vir:97 460 QEQTEYNKQLPNLDDGGADSAQQQERSNNKESE 492 (492) T ss_pred HHHHHHHHhhhccccCCCCCCcccccccccccC Confidence 44321100111100000 000000000000111 No 129 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=96.09 E-value=0.001 Score=37.00 Aligned_cols=389 Identities=13% Similarity=0.143 Sum_probs=167.6 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) |.+|+ +. .. .+.. .++.-..+-++.....|+.. .-+++|.|..||. T Consensus 1 m~~f~----------~~----~~--~~~~---~~~~~~~~~~~~~~~~~~~~---------------~Al~~~~V~~~i~ 46 (406) T protein:vir:97 1 MSFFQ----------PL----GT--SKVS---YDDYISSVLAGDVSQKYLGV---------------SALKNSDILTATS 46 (406) T ss_pred Ccccc----------cc----CC--CCCC---cchHHHHHhcCCCCcccccc---------------hhhccHHHHHHHH Confidence 22221 10 00 0000 01111111112112222221 1245788999998 Q ss_pred HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCC--------chhhHHHHHHhhhhhhhhhhheecC Q lcl|NC_019530. 95 IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDF--------DNTAYQKIRKWYVDGRQAYHVIVDP 166 (560) Q Consensus 95 eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f--------~~~~~~~~R~wYvDGr~~~hkiiD~ 166 (560) -|.+.+.-+ |+. ....++ + .+.+ ..+..||+. ..=-+.++..++.+|.-|.-++.|. T Consensus 47 ~Ia~~iA~l-----p~~--~~~~~g--~----~~~~--~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~ 111 (406) T protein:vir:97 47 IIAGDIARF-----PLV--KKDVNG--D----IIHD--EDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDP 111 (406) T ss_pred HHHHhhhhC-----eeE--EEecCc--c----cccc--chHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC Confidence 877766543 222 222221 1 1111 123333332 1223344556677787776555443 Q ss_pred CCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceee Q lcl|NC_019530. 167 TNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVY 246 (560) Q Consensus 167 ~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y 246 (560) + ..-+.+|.+|+|.++...+. ..+ ..+|.|.. ...+..+.++.+.|.| T Consensus 112 ~--~g~~~~L~~i~p~~v~v~~~-----~~~--------~~~y~~~~-----------------~~~~~~~~~~~~evih 159 (406) T protein:vir:97 112 K--TNQALQFQFYRPSETTVEET-----DNH--------EIVYTFTD-----------------MLTAKQVKCFAHDVIH 159 (406) T ss_pred C--CCeEEEEEEECCCeeEEEEc-----CCc--------eEEEEEEe-----------------cCCceEEEEccccEEE Confidence 2 23478999999988865321 112 11233321 0112345677787776 Q ss_pred eecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeeccc Q lcl|NC_019530. 247 IDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRT 326 (560) Q Consensus 247 ~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~ 326 (560) .. ..+.++=..+|-|..|.+++.....+++...=+----++- .++...-+.|-+..+++. ++-+.++.. ... T Consensus 160 ~r--~~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~-~~i~~~~~~l~~e~~~~~-~~~~~~~~~----g~n 231 (406) T protein:vir:97 160 WK--FFSHDTILGRSPLLSLGDEIDLQTGGINTLIKFFKDGFSS-GILTMKGAQLSGDARQRA-RQEFEKMRE----GSV 231 (406) T ss_pred ec--CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCC-ceEEecCCCCCHHHHHHH-HHHHHHHhc----ccc Confidence 65 3444443467889999998888777887655443344553 455555566655444433 332322211 112 Q ss_pred CCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhh Q lcl|NC_019530. 327 TGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEI 405 (560) Q Consensus 327 TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eI 405 (560) .|.+ + .++ .|.+++.|.-..+-.+ ++--+|-.+.+-++++||...|+..+. .++.++. T Consensus 232 ~g~~------~-vl~----------~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~----~~~~e~~ 290 (406) T protein:vir:97 232 GGSP------L-VFD----------STMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVNSP----NQSVAQL 290 (406) T ss_pred cCce------e-ecC----------CCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCCC----cchHHHH Confidence 2322 1 222 2566666642111111 122233367788899999999964333 1233444 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019530. 406 TQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEP 485 (560) Q Consensus 406 tRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p 485 (560) .+. |.+++ |+-.+. .+.+.|-+. ++++.+|.. -+|.|++.. .+..|++.+..+ T Consensus 291 ~~~---f~~~~--l~P~~~-~ie~~l~~k-----ll~~~~~~~---~~i~fd~~~-----------~~~~~~~~~~~~-- 343 (406) T protein:vir:97 291 MED---YVTND--LPFYFD-AITSELGLK-----TLNDKDRRL---YHIEFDTRS-----------VTGRNVDEIVKL-- 343 (406) T ss_pred HHH---HHHHH--HHHHHH-HHHHHHhhh-----hcChhhccc---eeEEEecCc-----------cchhhHHHHHHH-- Confidence 443 65543 433222 223333333 356666532 235554432 233444444332 Q ss_pred hhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcc---ccccccccccCCCCCCcccccccCCc-----c Q lcl|NC_019530. 486 FVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQAD---ETGNYSGSDVSPLKFKPETIPFSGST-----D 557 (560) Q Consensus 486 ~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~---e~g~~~~~~~~~~~~~p~~~~~~~~~-----~ 557 (560) +-+-.++.+-++. +|++. -.+.|..+ .+++..+.+..+..-.+.-.+-.||+ | T Consensus 344 ~~~g~~T~NE~R~-~~g~~------------------p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~gg~~~~~~~ 404 (406) T protein:vir:97 344 VNNQILTPNQGLV-ELGKQ------------------KSTDPNMDRYQSSLNYVFLDKKEEYQDKVGIKGKGGEVNAEED 404 (406) T ss_pred HhCCCcCHHHHHH-HhCCC------------------CCCCCCCCeEeeccCccchhcccccccccccccCCCCCCCCCC Confidence 1112344444432 22321 11222222 12222222221111111111222333 2 Q ss_pred cC Q lcl|NC_019530. 558 DS 559 (560) Q Consensus 558 ~~ 559 (560) +| T Consensus 405 ~~ 406 (406) T protein:vir:97 405 KS 406 (406) T ss_pred CC Confidence 33 No 130 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=96.06 E-value=0.0011 Score=36.89 Aligned_cols=405 Identities=13% Similarity=0.103 Sum_probs=177.9 Q ss_pred hhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHHHhhccee Q lcl|NC_019530. 22 VDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVV 101 (560) Q Consensus 22 ~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneai 101 (560) ...+++-.+.. ..+.+ .+...+-....-++..+...+.+ .....+++|.|..||+-|-+.+. T Consensus 1 m~~~~~f~~~~----~~~~~--~~~~~~~~~~~~~~~~~~~~~~v------------~~~~al~~~~v~~~i~~Ia~~ia 62 (416) T protein:vir:12 1 MLLERMFEKRS----GSSDH--EDGFNNILLNMFGGRKTASGERV------------SESNSLVQPDIFACVNVLSDDIA 62 (416) T ss_pred Cccchhccccc----Ccccc--CccchhHHHHhhcCcccccCcee------------chhhhhccHHHHHHHHHHHHhhh Confidence 22222211111 00000 00000000001111111111111 11245678999999987766554 Q ss_pred eecCCCceEEEeeecccch---h-HHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeE Q lcl|NC_019530. 102 TCEEDETPVTVNLDKVEGI---S-DTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLV 177 (560) Q Consensus 102 v~d~~~~~V~l~Ld~~~~~---s-~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr 177 (560) -+ |+.+--...++. . ..+-..+.++-... +....-.+.++..+++.|.-|..++.|.. .-+.+|. T Consensus 63 ~l-----~~~~~~~~~~~~~~~~~~~l~~~l~~~PN~~---~t~~~f~~~~v~~lll~Gna~~~i~r~~~---G~~~~L~ 131 (416) T protein:vir:12 63 KL-----PIHTYKRTDGGIERKPEHKSAHAVYARPNPY---MTAFTWKKLMMTHVLTWGNAYSYIQFGSH---GYPEALF 131 (416) T ss_pred hC-----ceEEEEecCCccccccccHHHHHHHhhcccC---CCHHHHHHHHHHHHhhcCCeEEEEEECCC---CcEEEEE Confidence 32 222211111100 0 00111122211111 22223344556667788988876665432 2388999 Q ss_pred EeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeecccccCCCC Q lcl|NC_019530. 178 MLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLANG 257 (560) Q Consensus 178 ~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~~ 257 (560) +|+|..|..+. +..++ ..||.|.. .+..+.++.+.|.|... ...++- T Consensus 132 ~l~~~~v~v~~-----~~~~~-------~~~~~~~~-------------------~g~~~~~~~~eiih~~~--~~~~~~ 178 (416) T protein:vir:12 132 PLRPDYTNAYV-----HPTTG-------MLWYQTVL-------------------NGKAIELYDYEVLHFKG--LSTDGI 178 (416) T ss_pred EECCcceEEEE-----eCCCc-------EEEEEEec-------------------CCeEEEecCccEEEecC--cCCCCc Confidence 99998887542 11121 11222211 12345688888887763 344555 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccch Q lcl|NC_019530. 258 IVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLM 337 (560) Q Consensus 258 ~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~m 337 (560) +..|.|+.|.+++......+....=+=-..+.-+-|..++ +.+.+..+++ +++-+++..| .|.+ + T Consensus 179 ~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~-~~~~~~~~~~-------~~~~------~ 243 (416) T protein:vir:12 179 HGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVP-AFLDEKPKEN-VRKEWKRVNK-------VENI------A 243 (416) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecC-CCCCHHHHHH-HHHHHHHHhc-------CCCe------e Confidence 6678999999999988888887665544556667777776 3455544433 3333332211 1221 1 Q ss_pred hhHhhhcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHH- Q lcl|NC_019530. 338 GIAEDYWLPRREGQNATEIATVGG-GNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKF- 415 (560) Q Consensus 338 smlEDywLpRReGgrgTEIsTLpG-g~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kf- 415 (560) .++ .|++++.|.- ...+.-++-.+|..+.+.++++||.+-|+...+.+ .+++.+..+. |..+ T Consensus 244 -vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~sn~e~~~~~---f~~~~ 307 (416) T protein:vir:12 244 -IID----------YGLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKAT--FSNIEHQSIE---YVRNT 307 (416) T ss_pred -ecC----------CCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCC--cccHHHHHHH---HHHHH Confidence 221 2455555532 12233345566778899999999999997544322 3333434333 4433 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHH Q lcl|NC_019530. 416 CAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDY 495 (560) Q Consensus 416 i~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~ 495 (560) |.-+-.+ +.+.|-+ .++++.++. -..+|.|+ -+ .+... -...|++.+..+-.- -++|.+- T Consensus 308 l~P~~~~----ie~~l~~-----~l~~~~~~~--~g~~i~fd--~~----~l~~~-d~~~~~~~~~~~~~~--G~~T~NE 367 (416) T protein:vir:12 308 LQPWIVN----FEQELNV-----KLFLDHDQK--SGHYVKFN--ID----SELRG-DSKTQAEYLKTLHET--GVLNKDE 367 (416) T ss_pred HHHHHHH----HHHHHHH-----hhcCchhhc--CCceEEee--ch----hhhcc-CHHHHHHHHHHHHhC--CCcCHHH Confidence 2223222 2222322 334555442 12234443 22 22211 123455555544222 3778888 Q ss_pred HHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 496 VMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 496 i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) ++. .|.|.+-| --++-+ .+++..+.+..+ .++.....|.+.+.= T Consensus 368 ~R~-~~gl~Pi~--ggd~~~---------------~~~n~~~~~~~~---~~~~~~~~~~~~gge 411 (416) T protein:vir:12 368 IRE-LLERNPIE--NGDKYI---------------SSLNYVFLDFLE---EYQRLKAGGAMKGGD 411 (416) T ss_pred HHH-HhCCCCCC--Ccceee---------------eccccccccccc---hhhccccccccCCCC Confidence 774 46664421 000000 001111100000 000011111000000 No 131 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=95.88 E-value=0.0013 Score=36.39 Aligned_cols=405 Identities=13% Similarity=0.092 Sum_probs=172.6 Q ss_pred cccchhh---------HHHHHHHHHHHhhccchhHHHH-------------------------------HhhcceeeecC Q lcl|NC_019530. 66 VESELLS---------VKTVVEEYQSMAQQPEIRKAVD-------------------------------IIVNDVVTCEE 105 (560) Q Consensus 66 ~e~~~~~---------~~~LI~~YR~ma~~PEvd~AI~-------------------------------eIvneaiv~d~ 105 (560) ++....+ ..+.+.+|+.+..+.+-..+|. .||+-.+-+ - T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~y-l 79 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGY-V 79 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhh-e Confidence 3332211 2345666777666655544321 111111111 1 Q ss_pred CCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccc Q lcl|NC_019530. 106 DETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIR 185 (560) Q Consensus 106 ~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~ 185 (560) -..||++..++. ...+++.+.| + -+|+....++.+.|.+.|+-|.+.-+|. +|...+..+||..+- T Consensus 80 ~G~p~~~~~~d~-----~~~~~l~~~~----~-~~~~~~~~~l~~~~~~~G~a~~~~y~d~----~~~~~~~~~~p~~~~ 145 (470) T protein:vir:10 80 ASVFPDIDVGKD-----ADNKKIIDVL----G-DDRALTLNGLLVDSSNAGRAWLHYWIDE----DGNFRYGIIQPDQIT 145 (470) T ss_pred eccceeeecCch-----HHHHHHHHHH----h-hhHHHHHHHHHHHHhhcCeeEEEEEecC----CCceEEEEEcccceE Confidence 234555544432 2333444433 3 2577788889999999999999877754 467889999999888 Q ss_pred hhhhhcccCC-cccccccc--------ccchheeecCcccccccccccccccccccccc-----ceec-c-hhceeeee- Q lcl|NC_019530. 186 PVYIVEKAMR-EGGIEAIE--------SVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQ-----ELVF-D-DESIVYID- 248 (560) Q Consensus 186 ~vr~i~~~~~-~g~~~~~~--------~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~-----~~~i-~-~daI~y~h- 248 (560) ++..=....+ .-.+.+.. .+..+-+|.+..-.+.....+. .....+... .... + ...-++.| T Consensus 146 ~v~d~~~~~~~~a~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (470) T protein:vir:10 146 PIYATTLDNKLLGILRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATD-STVIEPYNIITSYDLSAGYETGQSNTLKHN 224 (470) T ss_pred EEEcCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCc-ceeccccccccccccccccccccccccccC Confidence 7742110000 00111110 0111122332222111110000 000000000 0000 0 00000111 Q ss_pred ccccc----CCCCcchhhhHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheee Q lcl|NC_019530. 249 SGEEP----LANGIVPGLLNPAIRPLNNLVT-TEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAY 323 (560) Q Consensus 249 sGl~~----~~~~~i~syL~~AiK~~NqL~m-~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vY 323 (560) +|-++ +++....|=|+..+...+.+.. +=+....-+.+..|-.-+.-.+.-+++ +.+. -+.+++- T Consensus 225 ~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~-----~~~~-~~~~~~~---- 294 (470) T protein:vir:10 225 FGRVPFIEFSKNKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLH-----QFMN-DLRKYKS---- 294 (470) T ss_pred CCeeeEEEeecCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccc-----hhhh-hhhhcCe---- Confidence 11111 1223344566665555555432 333343445555555444332211211 1111 1111111 Q ss_pred cccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccccch Q lcl|NC_019530. 324 DRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEM-DHVNYFREKLYDALMIPKSRLQEEGSINIGGSNL 402 (560) Q Consensus 324 d~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~ 402 (560) .+++-.+.+.|-.+..|--..+.... .-++-+.+.+|+-..+|- +..++. |...+ T Consensus 295 -------------------i~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~--~~~~~~---gn~Sg 350 (470) T protein:vir:10 295 -------------------IKINNTGNGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGID--PANFES---SNASG 350 (470) T ss_pred -------------------EeccCCCCCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCC--CCcccc---ccchH Confidence 12333233344445665554554433 344667778888899994 333322 22222 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 403 AEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNT 482 (560) Q Consensus 403 ~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~ 482 (560) ..|..-...--.-+.+.++.|...+...++.=+-+-|+. .-+|. .|.+.|...--=.+... +++++. T Consensus 351 ~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~~~-~~d~~-----~i~i~f~~~~p~d~~e~-------~~~~~~ 417 (470) T protein:vir:10 351 VAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS-DADKR-----HISQHWTRTKVEDSLTK-------AQIVST 417 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-Ccccc-----eeeEEeccCCCCCHHHH-------HHHHHH Confidence 333222222233466677777777777666433333542 23342 35666654432222222 333444 Q ss_pred hhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcC--CCCCCCccccccccccc Q lcl|NC_019530. 483 VEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKG--LYPKVQADETGNYSGSD 539 (560) Q Consensus 483 ~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~--~~~~p~~~e~g~~~~~~ 539 (560) + +| .+|.+++++. |...++ .+++-++|++|..+. ..+++.+.+.++...+. T Consensus 418 ~---~g-~iS~et~l~~-~p~v~D-~~~E~eri~~E~~e~~~~~~~~~~~~~~~~dde~ 470 (470) T protein:vir:10 418 V---AN-YSSKEAVAKA-NPIVDD-WQQELKDLAKDKEENDPYSNQADELNGKGVNDEQ 470 (470) T ss_pred H---hc-cCcHHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHhhccccccCCCCCCCCC Confidence 3 45 4899999976 665432 344455555554222 11222111111111111 No 132 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=95.79 E-value=0.0015 Score=36.14 Aligned_cols=376 Identities=12% Similarity=0.117 Sum_probs=163.9 Q ss_pred chhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhh Q lcl|NC_019530. 6 RGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQ 85 (560) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~ 85 (560) -|+|..++ .. +. ..+..+... +........++....++. . +...+ T Consensus 1 Mg~~~~~~----------~~----~~----~~~~~~~~~---~~~~~~~~~~~~~~~~v~-------~-------~~al~ 45 (385) T protein:vir:10 1 MGLLTPRN----------FN----KR----KAKNMVYPS---NPAFFTTTVGGMQLSYVS-------A-------LSALQ 45 (385) T ss_pred Cccccchh----------cc----cc----ccccccccc---chhhhhhhccccCccccC-------H-------HHhhc Confidence 12222111 00 00 001111110 000011111111111111 1 12356 Q ss_pred ccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHH----HHHHhhhhhhhhhh Q lcl|NC_019530. 86 QPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQ----KIRKWYVDGRQAYH 161 (560) Q Consensus 86 ~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~----~~R~wYvDGr~~~h 161 (560) +|-|.+||.-|.+.+.-. |+.+ .... ...+++-=+-...+++ ++..+..+|.-|.. T Consensus 46 ~~~v~~~i~~ia~~ia~~-----p~~v--~~~~-------------~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~ 105 (385) T protein:vir:10 46 NTNVYSVINRIASDVASA-----HFKT--ENTA-------------TLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIP 105 (385) T ss_pred cHHHHHHHHHHHHHHhhC-----ceee--eccc-------------hhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEE Confidence 788999998777765433 2322 2211 1112222222223333 44466678988876 Q ss_pred heecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecch Q lcl|NC_019530. 162 VIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDD 241 (560) Q Consensus 162 kiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~ 241 (560) ++-| ..++.++||.++++++ +..+ -+|.|.. ...+....++. T Consensus 106 i~r~-------~~~~~p~~~~~v~~~~-----~~~~---------~~~~~~~-----------------~~~~~~~~~~~ 147 (385) T protein:vir:10 106 LVGQ-------NLEHIPNSDVQINYLP-----GNMG---------IVYTVLE-----------------SNDRPQMVLRQ 147 (385) T ss_pred EEcC-------ceeEeecCCceEEEEE-----cCCc---------eEEEEEE-----------------cCCceEEEEcc Confidence 5533 2678899998877642 1111 1122211 01123466888 Q ss_pred hceeeeec-ccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhh Q lcl|NC_019530. 242 ESIVYIDS-GEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNR 320 (560) Q Consensus 242 daI~y~hs-Gl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk 320 (560) +.|.|..- ..-..+.-+.+|.|..|.++++....+++...=+----+--+-+..++.+-..+..+ +=+++-+.+.... T Consensus 148 ~eiihik~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~-~~~~~~~~~~~~~ 226 (385) T protein:vir:10 148 DQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDL-ESAREEFEKANTG 226 (385) T ss_pred ccEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHH-HHHHHHHHHHhCc Confidence 88888752 111112334578999999999999999887665544445566777777444444333 3333333333211 Q ss_pred eeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchH---HHH-HHHHHHHHHhcCCCccccCCCCccc Q lcl|NC_019530. 321 NAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEM---DHV-NYFREKLYDALMIPKSRLQEEGSIN 396 (560) Q Consensus 321 ~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei---~DV-~YF~~kLy~aL~VP~sRl~~~~~~~ 396 (560) ...|. .+ .++ .|.+++.|.- +..++ .+. +|-.+.+.++++||...|+...+.+ T Consensus 227 ----~n~~~------~~-vl~----------~g~~~~~l~~--~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~ 283 (385) T protein:vir:10 227 ----DNSGR------LM-VLP----------DGFDYTQLEM--KTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTE 283 (385) T ss_pred ----cccCC------cc-ccC----------CCceEEecCC--ChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCC Confidence 11222 22 222 2566776643 33332 233 5557889999999999997644333 Q ss_pred ccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHH Q lcl|NC_019530. 397 IGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDR 476 (560) Q Consensus 397 l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~R 476 (560) ...+++ .....-| +..|+--+ ..+.+.|...|+ ++ .+.|+ +.++...+ +..| T Consensus 284 ~~~sn~---eq~~~~~---~~~l~P~~-~~ie~~l~~~l~-----~~---------~~~f~------~~~ll~~d-~~~~ 335 (385) T protein:vir:10 284 SQHSNI---DQIKATY---LANLNSYV-NPIVDELRLKMN-----AP---------DLELD------IKDMLDVD-DSAL 335 (385) T ss_pred cccccH---HHHHHHH---HHHHHHHH-HHHHHHHHHhhC-----Cc---------eEEee------chhhhccC-HHHH Confidence 344432 2222223 33454322 233344444432 21 13332 22333222 2344 Q ss_pred HHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccccCCc Q lcl|NC_019530. 477 LASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGST 556 (560) Q Consensus 477 l~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~ 556 (560) ++.++.+-.- -++|.+-++. ++++. | +|+ +.|+. ..++...+. .|-. T Consensus 336 ~~~~~~~~~~--G~~T~NE~R~-~~g~~-----------------p-~p~----~~~~~-------~~~~~~~~~-~g~~ 382 (385) T protein:vir:10 336 INQVSNLAKS--GVLGAEQAQF-ILTRS-----------------G-FLP----DNLPE-------FKPLTTQVK-GGDE 382 (385) T ss_pred HHHHHHHHhC--CCcCHHHHHH-HhCCC-----------------c-cCC----CCCcc-------ccCcccccC-CCCC Confidence 4444432211 3555555553 33331 1 111 00111 111111111 1111 Q ss_pred ccC Q lcl|NC_019530. 557 DDS 559 (560) Q Consensus 557 ~~~ 559 (560) +|- T Consensus 383 ~dn 385 (385) T protein:vir:10 383 GDN 385 (385) T ss_pred CCC Confidence 111 No 133 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=95.73 E-value=0.0015 Score=36.00 Aligned_cols=448 Identities=14% Similarity=0.068 Sum_probs=171.3 Q ss_pred hhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHH-----HHHHHH Q lcl|NC_019530. 8 FFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTV-----VEEYQS 82 (560) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~L-----I~~YR~ 82 (560) .+.+|-.-+..-..++.-.. . ..+...+........+ +++|+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~--~------------------------------~~~~~~~~~~~~i~~~i~~~~~~~~~~ 48 (503) T protein:vir:59 1 MADIYPLGKTHTEELNEIIV--E------------------------------SAKEIAEPDTTMIQKLIDEHNPEPLLK 48 (503) T ss_pred CcccccCChhhHHhHHHhhh--h------------------------------hhhhccchhHHHHHHHHHhhcHHHHHH Confidence 11111111000000000000 0 0000000000001111 234555 Q ss_pred HhhccchhHHHHHhh-----------------------------cceeeecCCCceEEEeeecccchhHHHHHHHHHHHH Q lcl|NC_019530. 83 MAQQPEIRKAVDIIV-----------------------------NDVVTCEEDETPVTVNLDKVEGISDTVKESITECFK 133 (560) Q Consensus 83 ma~~PEvd~AI~eIv-----------------------------neaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~ 133 (560) +..+++-+..|..-. -+..+.---..||++..++. ...+.+ + T Consensus 49 ~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~yl~g~~~~~~~~d~-----~~~~~l----~ 119 (503) T protein:vir:59 49 GVRYYMCENDIEKKRRTYYDAAGQQLVDDTKTNNRTSHAWHKLFVDQKTQYLVGEPVTFTSDNK-----TLLEYV----N 119 (503) T ss_pred HHHHhccccchhhccchhcccccccccccccccceeecchHHHHHHHHHhhhhcCCeeeccCcH-----HHHHHH----H Confidence 555544333221111 01111112234555544432 223333 2 Q ss_pred HHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCC-ccccccccc-------c Q lcl|NC_019530. 134 EVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEAIES-------V 205 (560) Q Consensus 134 ~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~~~~-------~ 205 (560) .+.+ .+|+....++.+..++.|+-|.+.-+|. +|-..++.+||+++-+++.=....+ ...+..+.. + T Consensus 120 ~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~----dg~~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~ 194 (503) T protein:vir:59 120 ELAD-DDFDDILNETVKNMSNKGIEYWHPFVDE----EGEFDYVIFPAEEMIVVYKDNTRRDILFALRYYSYKGIMGEET 194 (503) T ss_pred HHHh-cCHHHHHHHHHHHHhhCCeEEEEEeecC----CCceEEEEEccceeEEEEeCCCCCceEEEEEEEEEecCCCceE Confidence 2333 4799999999999999999998877654 3567899999999887642211110 011111110 0 Q ss_pred chheeecCccccccccccccc-cccc--cc-cccceecchhceeeeeccccc----CCCCcchhhhHHHHHHHHHHH-HH Q lcl|NC_019530. 206 TLKYYYNPNYNRNQFTGQSGT-SQNF--QP-SQQELVFDDESIVYIDSGEEP----LANGIVPGLLNPAIRPLNNLV-TT 276 (560) Q Consensus 206 ~e~~~Y~p~~~~~~~~g~~~~-~~~~--~~-~~~~~~i~~daI~y~hsGl~~----~~~~~i~syL~~AiK~~NqL~-m~ 276 (560) .-.-+|.+..-+......... .... .. ..........+.. +|.++ .++....|-|+.++.....+. ++ T Consensus 195 ~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~vPiv~~~nn~~~~sd~~~~~~liDa~d~~~ 271 (503) T protein:vir:59 195 QKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIG---WGRVPIIPFKNNEEMVSDLKFYKDLIDNYDSIT 271 (503) T ss_pred EEEEEEeCCcEEEEEEcCCcccccccccccccccceeecceecc---CCccceEEecCCCCCCcchhhhHHHHHHHHHHH Confidence 111134443222111110000 0000 00 0000000000000 11110 123334455666555555544 34 Q ss_pred HHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccce Q lcl|NC_019530. 277 EDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEI 356 (560) Q Consensus 277 EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEI 356 (560) -+.+..-+.++.|-+.+--.+.-+.+ +.... |..+ +++ .+| +++ .+ T Consensus 272 s~~~~~~~~~~~~~~v~~g~~~~~~~-----~~~~~-~~~~--~~~---------------------~~~--~~~---~~ 317 (503) T protein:vir:59 272 SSTMDSFSDFQQIVYVLKNYDGENPK-----EFTAN-LRYH--SVI---------------------KVS--GDG---GV 317 (503) T ss_pred HHHHHHHHHhcCCeeEeecCCccccc-----hhhhh-hhcc--cce---------------------ecc--CCC---cc Confidence 55555667777775544323222211 11110 1110 100 011 111 24 Q ss_pred eecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 357 ATVGGGNQLGEM-DHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQL 435 (560) Q Consensus 357 sTLpGg~nLgei-~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qL 435 (560) ..|-...+.+.. .-+.-+++.+|+...+|---.+.- + |...+..|......-..-+.+.++.|...+.++++.-+ T Consensus 318 ~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~---~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~ 393 (503) T protein:vir:59 318 DTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPETI-G---GGATGPALENLYALLDLKANMAERKIRAGLRLFFWFFA 393 (503) T ss_pred eeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-c---ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 444443344332 345667778888888883221111 1 22223444433333444566666666666666655433 Q ss_pred HhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCH---HHHHHHH Q lcl|NC_019530. 436 ILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSD---EEVKEQQ 512 (560) Q Consensus 436 iLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tD---eEI~e~~ 512 (560) -+-++....++.+ .. .|.+.|... ...-+. +.++.+..+-.- | .+|++++++. |..++ +|++.++ T Consensus 394 ~~~~~~~~~~~~~-~~-~i~i~f~~~-~p~d~~------~~~~~~~kl~~~-G-iiS~et~l~~-l~~v~d~~~E~~ri~ 461 (503) T protein:vir:59 394 EYLRNTGKGDFNP-DK-ELTMTFTRT-RIQNDS------EIVQSLVQGVTG-G-IMSKETAVAR-NPFVQDPEEELARIE 461 (503) T ss_pred HHHHhccCccccc-cc-ceeEEeCCC-CCCCHH------HHHHHHHHHHhC-C-CCchHHHHHh-CCCCCCHHHHHHHHH Confidence 3334433333332 22 367777433 222211 234444433221 3 5899999977 55443 4444444 Q ss_pred HHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccccCCcccC Q lcl|NC_019530. 513 TKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDDS 559 (560) Q Consensus 513 kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~ 559 (560) ++.+++.+. ......++.|.....++.+.+-..+ -.||.+-+ T Consensus 462 ~E~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~g~~~ 503 (503) T protein:vir:59 462 EEMNQYAEM--QGNLLDDEGGDDDLEEDDPNAGAAE---SGGAGQVS 503 (503) T ss_pred HHHHHHHhh--hccccCccCCCCCCCcCCCCCCccc---CCCCCCcC Confidence 333322210 1111111222211111111111111 11222222 No 134 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=95.64 E-value=0.0017 Score=35.78 Aligned_cols=444 Identities=11% Similarity=0.062 Sum_probs=167.6 Q ss_pred hhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEec--cccceeccccccch----hhHHHHH-------- Q lcl|NC_019530. 12 FGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQG--GANTYNYVGVESEL----LSVKTVV-------- 77 (560) Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~--~~~~~~y~~~e~~~----~~~~~LI-------- 77 (560) .=-|--|..+++.. .. -...+.. +..+ .|...+... .....+| T Consensus 1 ~~~~~~~~~~~~~~---~~-------------------~~~~~~~~~n~~~-~~~~~~~~~~~~~~~i~~~i~~~~~~~~ 57 (511) T protein:vir:10 1 MLKVNEFETDTDLR---GN-------------------INYLFNDEANVVY-TYDGTESDLLQNVNEVSKCIEHHMDYQR 57 (511) T ss_pred Cccccchhhhhhhh---hh-------------------hhhhhhhhhcCCc-cCchhhhhcccCHHHHHHHHHHHHHhhH Confidence 00000011111100 00 0001111 1111 111111111 1111222 Q ss_pred HHHHHHhhccchhHHH----------------------HHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHH Q lcl|NC_019530. 78 EEYQSMAQQPEIRKAV----------------------DIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEV 135 (560) Q Consensus 78 ~~YR~ma~~PEvd~AI----------------------~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~i 135 (560) .+|+.+..+.+=+..| .-||+ ..+.=--..||++..++ +...+.+..+ T Consensus 58 ~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~-~~~~yl~g~p~~~~~~d---------~~~~~~l~~~ 127 (511) T protein:vir:10 58 PRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISD-FINGYFLGNPIQYQDDD---------KDVLEAIEAF 127 (511) T ss_pred HHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHH-HHhhhhcccCceeecCc---------hHHHHHHHHH Confidence 2344444433322211 01111 11111123444443332 1344667778 Q ss_pred HHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCC-cccccccc---------cc Q lcl|NC_019530. 136 MHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEAIE---------SV 205 (560) Q Consensus 136 l~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~~~---------~~ 205 (560) .+..+|+....++.+.+.+.|+-|.+.-+|. +|-..+..+||+.+-+|..-....+ ..++.... .. T Consensus 128 ~~~n~~~~~~~~~~~~~~i~G~ay~~vy~de----dg~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~ 203 (511) T protein:vir:10 128 NDLNDVESHNRSLGLDLSIYGKAYEIMIRNQ----DDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDE 203 (511) T ss_pred HhhcCHHHHHHHHHHHHHhcCeeEEEEEeCC----CCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecccCccce Confidence 8888999999999999999999887665543 4668899999999988753211111 11111110 00 Q ss_pred chh-eeecCccccccccccccccccc-----cccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHH-HHH Q lcl|NC_019530. 206 TLK-YYYNPNYNRNQFTGQSGTSQNF-----QPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVT-TED 278 (560) Q Consensus 206 ~e~-~~Y~p~~~~~~~~g~~~~~~~~-----~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m-~ED 278 (560) ..+ -+|++..-+............. ..+-.--++|. |.|. ++....|-++..+....-+.. +=+ T Consensus 204 ~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPv--v~f~-------nn~~g~gd~e~v~~liDa~d~~~S~ 274 (511) T protein:vir:10 204 VFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPI--TEFS-------NNERRKGDYEKVITLIDLYDNAESD 274 (511) T ss_pred EEEEEEEeCCcEEEEEecCCCcccccccccccccccCcceeE--EEec-------CCCCCCCchhhhHHHHHHHHHHHHH Confidence 111 1355543332222111100000 00000001221 1121 222334556655555443332 222 Q ss_pred HHHHHHHhcCccceEEE---ccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhh-hcccc--cCCCC Q lcl|NC_019530. 279 ATVIYAITRAPEKRAFY---LDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAED-YWLPR--REGQN 352 (560) Q Consensus 279 alVIyRitRAPeRRvFy---IDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlED-ywLpR--ReGgr 352 (560) ....-+-++.|-+-+.= .|.+.+++.+. ..+-.+.. .+.+. ...+. T Consensus 275 ~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~----------------------------~~~~~~~~~~~~~~~~~~~~~ 326 (511) T protein:vir:10 275 TANYMSDLNDAMLLIKGNLNLDPVEVRKQKE----------------------------ANVLFLEPTVYADSEGRETEG 326 (511) T ss_pred HHHHHHHhhCceeeeeccccCCchhhccchh----------------------------ccceecccccccccccccCCC Confidence 22222334444433321 11111111100 00000000 00000 01112 Q ss_pred ccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 353 ATEIATVGGGNQLGEM-DHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFL 431 (560) Q Consensus 353 gTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~L 431 (560) |..+..|-...+...+ .-+.-+.+.+|.-.++|---. ++|. |...+..|.--+.....-+.+.++.|...+...+ T Consensus 327 ~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~---~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~ 402 (511) T protein:vir:10 327 SVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKD---DNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRA 402 (511) T ss_pred CcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccc---cccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2334444433333322 345667778888888885222 1211 1222333433333344445566666666666555 Q ss_pred HHHHHh----ccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHH Q lcl|NC_019530. 432 RRQLIL----KGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEE 507 (560) Q Consensus 432 k~qLiL----kgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeE 507 (560) +.-+-+ .++-.+.+|. .|.+.|..---=. +.+.++++..+ +| .+|.+++++. |..+++ T Consensus 403 ~li~~~~~~~~~~~~~~d~~-----~i~i~f~~~~p~d-------~~~~~~~~~kl---~G-~iS~et~~~~-l~~v~d- 464 (511) T protein:vir:10 403 KLLETILKNTRSIDANKDFN-----TVRYVYNRNLPKS-------LIEELKAYIDS---GG-KISQTTLMSL-FSFFQD- 464 (511) T ss_pred HHHHHHHHhhCCcccccccc-----eeeEEeCCCCCcC-------HHHHHHHHHHH---hc-cCcHHHHHHh-CCCCCC- Confidence 442222 1222345553 3667775421111 12345556655 35 4999999977 555432 Q ss_pred HHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 508 VKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 508 I~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) -+++.++|++|..+. .+.......++..+.... .-. ..+...+. T Consensus 465 ~~~E~~ri~~E~~~~-~~~~~~~~~~~~~~~~~~--~~~------~~~~~~~~ 508 (511) T protein:vir:10 465 PELEVKKIEEDEKES-IKKAQKGIYKDPRDINDD--EQD------DDTKDTVD 508 (511) T ss_pred HHHHHHHHHHHHHHH-HHHHhhhcccCCCCCCCC--CCC------CcccCccc Confidence 233344555554322 111111111111111110 000 11111111 No 135 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=95.41 E-value=0.0021 Score=35.24 Aligned_cols=386 Identities=12% Similarity=0.118 Sum_probs=164.2 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCc-eeEeccccceeccccccchhhHHHHHHH Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGS-IILQGGANTYNYVGVESELLSVKTVVEE 79 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~-~~i~~~~~~~~y~~~e~~~~~~~~LI~~ 79 (560) |. .|||+.| . ++ .........+...+...+.+ ...-++ ..+.++ + T Consensus 1 m~---m~~~~~~----------~-----~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~v-------~------- 46 (392) T protein:vir:74 1 MI---LPILNFI----------N-----QT-NDPPEAGSVQSYFPDGNDAQIMESLLG-DNNEWV-------S------- 46 (392) T ss_pred Cc---chhhhhh----------h-----cc-cCcccccccccccccCchhhhhhhccC-CCCccc-------c------- Confidence 21 1222211 1 00 00111111111111111111 000001 001111 1 Q ss_pred HHHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhh Q lcl|NC_019530. 80 YQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQA 159 (560) Q Consensus 80 YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~ 159 (560) -+..+++|-|..||+-|.+.+.- +.+.+.... .+.+.+.-.. .+.-..-.+.++..+++.|.-| T Consensus 47 ~~~al~~~~v~~~v~~ia~~ia~-------lp~~~~~~~------~~~l~~~PN~---~~t~~~f~~~~~~~lll~Gna~ 110 (392) T protein:vir:74 47 ARAALRNSDLFSIILQLSSDLAI-------VKINAEKKK------NQGIIDNPST---NANKHGFWQSMFAQLLLGGEAF 110 (392) T ss_pred hhhhhcchHHHHHHHHHHHhhcc-------Cceeeccch------hhhhhhhcCC---CCCHHHHHHHHHHHhhhcCCEE Confidence 12346789999999877776542 222332221 0111111111 1111122344566788889888 Q ss_pred hhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceec Q lcl|NC_019530. 160 YHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVF 239 (560) Q Consensus 160 ~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i 239 (560) ..++.|. ...+.+|++|+|.+|..++ +.+++. .+|.|... + ...+....+ T Consensus 111 ~~i~r~~---~G~~~~L~~i~~~~v~v~~-----~~~~~~-------~~y~~~~~-------~--------~~~~~~~~~ 160 (392) T protein:vir:74 111 AYRWRNA---NGADMKWEYLRPSQVNTYY-----FEYENG-------MYYNITFD-------D--------PKIEPILQA 160 (392) T ss_pred EEEEECC---CCcEEEEEEEcCceeEEEE-----cCCCce-------EEEEEEec-------C--------CccceeEEE Confidence 7777654 3448999999999887542 222321 12223210 0 011234568 Q ss_pred chhceeeeecccccCCCC-cchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhh Q lcl|NC_019530. 240 DDESIVYIDSGEEPLANG-IVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFK 318 (560) Q Consensus 240 ~~daI~y~hsGl~~~~~~-~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~k 318 (560) +.+.|.|.+. ...++. +..|-|..|+..+.....++....=+=---+--+-+..++-+..+..++.+.++ ..|. T Consensus 161 ~~~evih~~~--~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~~~~~~~---~~~~ 235 (392) T protein:vir:74 161 PQSDLIHMKL--LSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRS---RSFM 235 (392) T ss_pred cCccEEEecC--CCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHH---HHHh Confidence 8888888873 344442 467999999999999888887766554455556667777766555444433322 2333 Q ss_pred hheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccc Q lcl|NC_019530. 319 NRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGSINI 397 (560) Q Consensus 319 nk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l 397 (560) .. ...|. .+ .++ .|++++.|.-...-.| ++=.+|..+.+.++++||...++..+..+ T Consensus 236 ~~----~n~g~------~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~- 293 (392) T protein:vir:74 236 KR----SRSGG------PV-VLD----------DLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQ- 293 (392) T ss_pred cc----ccCCC------ee-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc- Confidence 21 11221 22 222 2566776643222222 45567888999999999999996533211 Q ss_pred cccchhhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHH Q lcl|NC_019530. 398 GGSNLAEITQEELRFSK-FCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDR 476 (560) Q Consensus 398 ~~~~~~eItRDElkF~K-fi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~R 476 (560) ++. +-.+. |.. .+.-+.+++ .+.|...|. +.-+++. . ..+..| .. .+ T Consensus 294 --~~~-e~~~~---~~~~~l~p~~~~i----e~~l~~~l~-----~~~~~~~--~----~~~~~d--~~---------~~ 341 (392) T protein:vir:74 294 --SSI-QQISG---MYASALNRYLRPA----ISELEYKLS-----DHISVNM--R----PAIDPL--GD---------NY 341 (392) T ss_pred --cHH-HHHHH---HHHHHHHHHHHHH----HHHHHHhcc-----chhcccc--h----hhhcCC--HH---------HH Confidence 211 11122 322 222222222 222333332 1111111 0 000011 00 11 Q ss_pred HHHHHHhhhhhc-chhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccc-cccccccCCCCCCccccc Q lcl|NC_019530. 477 LASLNTVEPFVG-SIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETG-NYSGSDVSPLKFKPETIP 551 (560) Q Consensus 477 l~~l~~~~p~vG-ky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g-~~~~~~~~~~~~~p~~~~ 551 (560) .. .++..++ -.+|..-++ ++. ...|..|+-....++ ..-++.. ..|.+| T Consensus 342 ~~---~~~~l~~~g~~t~near------------~~~------~~~g~~pne~r~~enl~~~~~Gd-----~~~p~p 392 (392) T protein:vir:74 342 LS---TISTATRWGALAENQAT------------FVL------QEAGYIPKDLPAPENTNKKTTGQ-----SNEPVP 392 (392) T ss_pred HH---HHHHHHhCCCcCHHHHH------------HHH------HhCCCCccccchhcCCCCCCCCC-----CCCCCC Confidence 11 1222211 233333333 221 123443321111111 1111111 122333 No 136 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=95.38 E-value=0.0022 Score=35.18 Aligned_cols=395 Identities=14% Similarity=0.132 Sum_probs=171.0 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) |+.|+.+... +..++...+..+.+.+ +..... .+.. -.+....++|-|..||. T Consensus 1 Mgl~~~~f~~-----~~~~~~~~~~~~~~~~----------~~~~~~--~g~~----------v~~~~al~~~~v~~~v~ 53 (409) T protein:vir:84 1 MSLFTRIFSG-----PSEERTLTKISGIPSP----------AEDWAM--HGDR----------PGANSAMTLGAFYACVT 53 (409) T ss_pred CchhhhhhcC-----CCcccccccccccccc----------cchhhc--cCcc----------cchhhhhccHHHHHHHH Confidence 3333322111 1111111111111100 000000 0000 01223456889999998 Q ss_pred HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhC----Cchhh----HHHHHHhhhhhhhhhhheecC Q lcl|NC_019530. 95 IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMD----FDNTA----YQKIRKWYVDGRQAYHVIVDP 166 (560) Q Consensus 95 eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~----f~~~~----~~~~R~wYvDGr~~~hkiiD~ 166 (560) -|.+.+.-+ |+.+--+. +. .+. +...++.+|+ -...+ +.++..+++.|.-|..+. . T Consensus 54 ~ia~~iA~l-----p~~~~~~~-~~----~~~----~~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~--~ 117 (409) T protein:vir:84 54 LLADTVASL-----SIDAYRKK-DN----VRI----PVSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYIS--A 117 (409) T ss_pred HHHHhhhhC-----ceEEEEec-CC----ccc----ccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEE--E Confidence 887776533 22221111 10 000 1122333332 22223 444556667786664332 2 Q ss_pred CCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceee Q lcl|NC_019530. 167 TNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVY 246 (560) Q Consensus 167 ~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y 246 (560) .+...-+.+|++|+|..+.... ..+..+ ..+ ++.|. .. ..+++.+.|.| T Consensus 118 ~~~~g~~~~L~~l~p~~v~v~~---~~~~~~-~~~------~~~~~-------~~--------------g~~~~~~dvih 166 (409) T protein:vir:84 118 RDEANRPTAIMPIHPDCIHVTD---AKDEDG-DWI------EPVYR-------ID--------------GKVVPNHRIMH 166 (409) T ss_pred ECCCCceEEEEEEcCceeEEEE---cCCCcc-eEE------EEEec-------CC--------------ceEEchhhEEE Confidence 3334558999999997776431 112111 000 11111 01 12366777777 Q ss_pred eecccccCCC-CcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecc Q lcl|NC_019530. 247 IDSGEEPLAN-GIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDR 325 (560) Q Consensus 247 ~hsGl~~~~~-~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~ 325 (560) ... ...++ -+..|-++.|.+.+.....+++...-+----+--+-+.-++ |+|.+..+++..+.....+.| T Consensus 167 ~~~--~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~n------ 237 (409) T protein:vir:84 167 IKR--YPVAGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSD-ADLTPDQVKQTQKQWIQSHHN------ 237 (409) T ss_pred ecC--CCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecC-CCCCHHHHHHHHHHHHHHhcc------ Confidence 753 23333 24668899999888888777776653332223334555554 577777776666655544432 Q ss_pred cCCcccCcccchhhHhhhcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhh Q lcl|NC_019530. 326 TTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGG-GNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAE 404 (560) Q Consensus 326 ~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpG-g~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~e 404 (560) .|. .+ .++ .|.+++.+-- ...+.-++-.++..+.+.++++||.+.|+...+-+...++.++ T Consensus 238 -~g~------~~-vl~----------~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~ 299 (409) T protein:vir:84 238 -RRL------PA-VMS----------AGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEE 299 (409) T ss_pred -CCC------ee-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHH Confidence 232 11 121 2445555432 1122334455677899999999999999764443444444332 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019530. 405 ITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVE 484 (560) Q Consensus 405 ItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~ 484 (560) .. +.|..++ |+--+. .+.+.|...| ..| .+|.|++ ..|...+ +..|++.+..+- T Consensus 300 ~~---~~f~~~~--l~P~~~-~ie~~l~~~L-~~g------------~~i~fd~------~~l~~~d-~~~~~~~~~~~~ 353 (409) T protein:vir:84 300 QG---INFVRHT--LLPWLR-CIEQALDTFL-PRG------------QFVKFNV------DGLMRGD-VTARFTAYQMGL 353 (409) T ss_pred HH---HHHHHHH--HHHHHH-HHHHHHHHhc-cCC------------CeEEEec------hhhhccC-HHHHHHHHHHHH Confidence 22 2354442 332222 2233333332 112 1233332 2333222 234555544443 Q ss_pred hhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccc---cccccc-cccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 485 PFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADE---TGNYSG-SDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 485 p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e---~g~~~~-~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) .- -++|.+-+++ ++.+.+ -|..++ +.+..+ ++.++.+...+.-|-++ |++.= T Consensus 354 ~~--G~~t~NE~R~-~~g~~p--------------------~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~-~~gn~ 409 (409) T protein:vir:84 354 QN--GIWSVNEVRA-WEDAPP--------------------IPEGDIHLQPMNFVPLGYVPPEEPAQEPQPNSA-TEGNK 409 (409) T ss_pred hC--CCcCHHHHHH-HhCCCC--------------------CCCcceeeecccccccccCCccccCcCCCCCCc-cCCCC Confidence 22 3666666663 345532 122111 111111 11111111122222222 22222 No 137 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=95.33 E-value=0.0023 Score=35.07 Aligned_cols=436 Identities=9% Similarity=0.040 Sum_probs=167.3 Q ss_pred hhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHH--------- Q lcl|NC_019530. 23 DTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAV--------- 93 (560) Q Consensus 23 ~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI--------- 93 (560) -++.+.++-++... ..............---+..-+..-..-+.+|+.+..+.+-...| T Consensus 1 ~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~~~~~ 68 (474) T protein:vir:95 1 MFNIIRMPWDKPYG------------EEVVEQLKPQFETQEEMIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDV 68 (474) T ss_pred CcceeecCCCCchh------------hHHHHhhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccCchhcccccccc Confidence 11111111110000 000000000000000000000000011122333333332222111 Q ss_pred ------------------HHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhh Q lcl|NC_019530. 94 ------------------DIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVD 155 (560) Q Consensus 94 ------------------~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvD 155 (560) ..||+ ..+.--...||++..++.+ ..+ .++.+++ -+|+.+..++.+.+++. T Consensus 69 ~~~~~~~~~~~ki~~n~~~~Ivd-~~~~~l~g~p~~~~~~d~~-----~~~----~l~~~~~-n~~~~~~~e~~~~~~~~ 137 (474) T protein:vir:95 69 YGNIDYDKPDWRITTNFHQNLVD-QKVSYVASKPVTYSCEDES-----VLK----IIHDVLD-TRWDNKLIDILTATSNK 137 (474) T ss_pred ccccccccccceeccchHHHHHH-HHHhhhccCCceeccCchH-----HHH----HHHHHHh-ccHHHHHHHHHHHHhhc Confidence 11122 1122233456666554421 223 3333443 47899999999999999 Q ss_pred hhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCC-cccccccc--ccchheeecCcccc-cccc-ccccccccc Q lcl|NC_019530. 156 GRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEAIE--SVTLKYYYNPNYNR-NQFT-GQSGTSQNF 230 (560) Q Consensus 156 Gr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~~~--~~~e~~~Y~p~~~~-~~~~-g~~~~~~~~ 230 (560) |+-|.+.-+|. +|-..+..++|+++-+|..=....+ ...+.... ....+-+|++..-+ +... +........ T Consensus 138 G~~~~~v~~d~----~~~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~ 213 (474) T protein:vir:95 138 GIDWLQVYINE----NGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYY 213 (474) T ss_pred CcEEEEEEecC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEEcCeeEEEEEeCCeEEEEEEcCCcccccccc Confidence 99887655543 3557899999998877632110000 00011100 11111123322111 0000 000000000 Q ss_pred cccc---cceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHH Q lcl|NC_019530. 231 QPSQ---QELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTEDATVIYAITRAPEKRAFYLDVGTLGKKSA 306 (560) Q Consensus 231 ~~~~---~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~EDalVIyRitRAPeRRvFyIDvGnlpk~KA 306 (560) .... ....-+...|-+++. .++....|=++..+.....+. ++-+....-+-.+.|-+-+.-.+..... T Consensus 214 ~~~~~~~~~~~~~~g~iPvv~~----~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~---- 285 (474) T protein:vir:95 214 GANHIQSHFSNGNWGRVPFIAF----KNNPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLE---- 285 (474) T ss_pred CcccccccccccCCCccceEee----cCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccch---- Confidence 0000 000000011111110 012223455666555555554 4555555667777776555433322111 Q ss_pred HHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCC Q lcl|NC_019530. 307 EEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEM-DHVNYFREKLYDALMIP 385 (560) Q Consensus 307 eqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~VP 385 (560) . +.+. +..+++=.=+++. +++.|-...+++.. .-+.-+.+.+|....+| T Consensus 286 -~--------~~~~-------------------~~~~~~i~~~~~~--~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p 335 (474) T protein:vir:95 286 -E--------FMRG-------------------LKYYKAINVDGDG--GVETIQVEVPVSSTKEYIDLMRAYIMEFGQGV 335 (474) T ss_pred -h--------hhhh-------------------hhccceeeccCCC--ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCc Confidence 0 0010 0011110011222 24444433455433 33566778889999999 Q ss_pred ccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHH Q lcl|NC_019530. 386 KSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIR 465 (560) Q Consensus 386 ~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~ 465 (560) - +..++ +. |...+..|..-+..-..-+.+.+..|...+.++++.-+-+-|+ ..+|. .|.+.|...---. T Consensus 336 ~--~~~~~-~~-~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~--~~d~~-----~i~v~f~~~~p~d 404 (474) T protein:vir:95 336 D--FQTDK-FG-SAPSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNL--KMDVK-----DIEISFNFNRMMN 404 (474) T ss_pred c--ccccc-cc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Ccccc-----eeeEEeccCCCcC Confidence 3 22222 11 1122334544444444456777777777777777654444454 23342 3556664332221 Q ss_pred HHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCC-CCccccccccccccCCCC Q lcl|NC_019530. 466 EQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPK-VQADETGNYSGSDVSPLK 544 (560) Q Consensus 466 Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~-p~~~e~g~~~~~~~~~~~ 544 (560) +. +.++++.++ | .+|.+++++. |..+++ -+++.++|++|..+..... +..+-.++...++..+.. T Consensus 405 ~~-------e~a~~~~~~----g-~iS~et~i~~-l~~v~d-~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~ 470 (474) T protein:vir:95 405 DA-------EQSQIIAQS----Q-YLSRETLVKS-SPLVDD-YKAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSND 470 (474) T ss_pred HH-------HHHHHHHhc----C-CCchHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHhcccccccccCCCCcCCCCCcc Confidence 21 233344443 4 5899999976 555432 2233444555532111111 111101111111112222 Q ss_pred CCcc Q lcl|NC_019530. 545 FKPE 548 (560) Q Consensus 545 ~~p~ 548 (560) -.|+ T Consensus 471 ~~~~ 474 (474) T protein:vir:95 471 KESE 474 (474) T ss_pred CCCC Confidence 2333 No 138 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=95.26 E-value=0.0024 Score=34.93 Aligned_cols=407 Identities=12% Similarity=0.094 Sum_probs=176.7 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) |++|+.+.. +..+. ....|... ...+..+. ..+.+... +. ....++|-|.+||. T Consensus 1 Mg~f~~lf~------r~~~~------~~~~~~~~--~~~~~~~~--~~~~g~~v---~~-------~~al~~~~v~~~i~ 54 (414) T protein:vir:44 1 MVFFSGLFQ------RKSDA------PVTTPAEL--ADAIGLSY--DTYTGKQI---SS-------QRAMRLTAVFSCVR 54 (414) T ss_pred Cchhhhhhc------cCccC------cccchhhH--hHhhccCc--cccCCcee---ch-------hhhhccHHHHHHHH Confidence 444443211 11110 01111000 00000000 01111100 10 12246888999997 Q ss_pred Hhhccee-----eecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCC Q lcl|NC_019530. 95 IIVNDVV-----TCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNK 169 (560) Q Consensus 95 eIvneai-----v~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~ 169 (560) -|.+.+. ++..++...... . ...+-..+..+-. ..+....-.+.++..++..|.-|..++.+ T Consensus 55 ~Ia~~ia~~p~~~~~~~~~~~~~~-~-----~~~~~~lL~~~PN---~~~t~~~f~~~~~~~~ll~Gna~~~i~~~---- 121 (414) T protein:vir:44 55 VLAESVGMLPCNLYHLNGSLKQRA-T-----GERLHKLISTHPN---GYMTPQEFWELVVTCLCLRGNFYAYKVKA---- 121 (414) T ss_pred HHHHHhccCceEEEEecCCceeec-c-----cchHHHHHHhhcc---cCCCHHHHHHHHHHHHhhcCCeEEEEEeC---- Confidence 7766543 222221111000 0 0001111111111 01222233456677788889877654422 Q ss_pred ccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeec Q lcl|NC_019530. 170 KGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDS 249 (560) Q Consensus 170 k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hs 249 (560) ...+.+|.+|+|.++...+ +..+ +.+|.|.. ..+....++.+.|.|... T Consensus 122 ~g~~~~L~~l~~~~v~~~~-----~~~~--------~~~y~~~~------------------~~g~~~~~~~~evih~~~ 170 (414) T protein:vir:44 122 FGEVAELLPVDPGCVVPKL-----NSSW--------EPVYQVTF------------------PDGSTDVLSQEDIWHVRT 170 (414) T ss_pred CCcEEEEEEEcCceEEEEE-----CCCC--------cEEEEEEe------------------cCceEEEEccccEEEecC Confidence 3568999999998886532 1111 11122211 112344678888887762 Q ss_pred ccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCc Q lcl|NC_019530. 250 GEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGK 329 (560) Q Consensus 250 Gl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGe 329 (560) ...++-+..|-+..|..++.....+++...-+----+--+-++.++ ++|.+..+++..+.+...|+. T Consensus 171 --~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g---------- 237 (414) T protein:vir:44 171 --LTLDGLVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTE-QTLSDQAYERLKKDFEERHTG---------- 237 (414) T ss_pred --CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC-CCCCHHHHHHHHHHHHHHhcC---------- Confidence 2445556678899999988888888877665544445557788877 466666665555554444432 Q ss_pred ccCcccchhhHhhhcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHH Q lcl|NC_019530. 330 ITGNAHLMGIAEDYWLPRREGQNATEIATVGG-GNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQE 408 (560) Q Consensus 330 v~d~~~~msmlEDywLpRReGgrgTEIsTLpG-g~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRD 408 (560) +.+.-+.+ .++ .|++++.|.- ...+.-++-.++..+.+.++++||.+.|+..++.+ .+++++..+. T Consensus 238 ~~n~~~~~-vl~----------~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t--~~n~e~~~~~ 304 (414) T protein:vir:44 238 LGNAHRPM-ILE----------MGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRAT--FNNIEELGLG 304 (414) T ss_pred ccccCcce-ecC----------CCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHHHH Confidence 11111122 121 2556666632 12222234456778889999999999997644322 3333333222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhc Q lcl|NC_019530. 409 ELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVG 488 (560) Q Consensus 409 ElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vG 488 (560) |.+++ |+- +...+.+.|-. +++++.++. .+.++|..+ ++...+ +..|.+.++.+- -+ T Consensus 305 ---~~~~~--l~P-~~~~ie~~ln~-----~L~~~~~~~-----~~~i~fd~~----~ll~~d-~~~~~~~~~~~~--~~ 361 (414) T protein:vir:44 305 ---FINYS--LVP-YLTRIEQRINT-----GLVRKSKQG-----VFYAKFNAG----ALLRGD-MKSRFEAYATGI--NW 361 (414) T ss_pred ---HHHHH--HHH-HHHHHHHHHHh-----hcCCccccC-----ceEEEEech----hhhccC-HHHHHHHHHHHH--hC Confidence 44432 322 12223333333 445656542 234445433 232211 223444444322 12 Q ss_pred chhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCc-cccccccccccCCCCCCccccccc Q lcl|NC_019530. 489 SIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQA-DETGNYSGSDVSPLKFKPETIPFS 553 (560) Q Consensus 489 ky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~-~e~g~~~~~~~~~~~~~p~~~~~~ 553 (560) -+++.+-++. ++.|..-+ --++.+-.-. . -..|.. .+.+.++..++.+ -|.| T Consensus 362 G~~t~NE~R~-~~gl~p~~--ggD~~~~~~n--~-~~~~~~~~~~~~~~~~~~~d-------~~~~ 414 (414) T protein:vir:44 362 GIYSPNDCRD-LEDMNPRP--GGDVYLTPMN--M-TTKPSDGSKAGKQKDNANAD-------ETTS 414 (414) T ss_pred CCcCHHHHHH-HhCCCCCC--Ccceeccccc--c-cccCCccccCCCCCCCCCCC-------CCCC Confidence 4677777773 45554311 0000000000 0 000000 0000000000000 0000 No 139 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=95.18 E-value=0.0026 Score=34.77 Aligned_cols=435 Identities=11% Similarity=0.073 Sum_probs=172.3 Q ss_pred cCCCCCCCCceeE-------ecccc-ceeccccccchhh----HHH--------HHHHHHHHhhccchhH-HHHH----- Q lcl|NC_019530. 42 VALDDAQDGSIIL-------QGGAN-TYNYVGVESELLS----VKT--------VVEEYQSMAQQPEIRK-AVDI----- 95 (560) Q Consensus 42 ~~~~~~~dG~~~i-------~~~~~-~~~y~~~e~~~~~----~~~--------LI~~YR~ma~~PEvd~-AI~e----- 95 (560) +-+...-+|+.-. ....+ .+.+...+....+ .+. ...+|+.+..+.+-+. .|.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~~~~ 80 (501) T protein:vir:27 1 MEQTLFTDSTGQDLVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLQFGRRK 80 (501) T ss_pred CCceeEEeccchhhhhhcccChhHHHhhccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccCccC Confidence 1111111111000 00000 0111111111000 111 2234454444443321 1100 Q ss_pred ----------------hhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhh Q lcl|NC_019530. 96 ----------------IVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQA 159 (560) Q Consensus 96 ----------------Ivneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~ 159 (560) || +..+.---..||++..++.+ ..+.+.+.+..+++..+|+....++.+..++.|+-| T Consensus 81 ~~~~~~~ki~~n~~k~Iv-d~~~~yl~g~p~~~~~~d~~-----~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~ 154 (501) T protein:vir:27 81 DREMADKRAVHNYGRMIS-KFKTGYLAGNPIRVEYDDND-----NNSQNDDTIKRIGRINDIDSHNRTLIRDLSQTGRAY 154 (501) T ss_pred ccccccceeccchHHHHH-HHHhhhhcccCeeEecCCcc-----chHHHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEE Confidence 11 11111113355666665543 234556667778888999999999999999999998 Q ss_pred hhheecCCCCccCeeEeEEeCccccchhhhhcccCC-cccccccc------ccchheeecCcccccccccccccccc-cc Q lcl|NC_019530. 160 YHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEAIE------SVTLKYYYNPNYNRNQFTGQSGTSQN-FQ 231 (560) Q Consensus 160 ~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~~~------~~~e~~~Y~p~~~~~~~~g~~~~~~~-~~ 231 (560) .+.-.|.+ |=..+..+||+.+-+|..=....+ ..++.... .+...-+|++..-+. +......... .. T Consensus 155 ~~vy~ded----~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~~~~vyt~~~v~~-~~~~~~~~~~~~~ 229 (501) T protein:vir:27 155 EVIYRNEY----DETRIKRLNPLETFVIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNEHIYT-LDASDDFNEISVT 229 (501) T ss_pred EEEEeCCC----CceEEEEEccceeEEEecCCCCCceEEEEEEEEeeecCCcEEEEEEEeCCeEEE-EEeCCceeecccc Confidence 87666543 446688999999877642111110 01111110 111122344432221 1111000000 00 Q ss_pred c-cccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHH Q lcl|NC_019530. 232 P-SQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVT-TEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEY 309 (560) Q Consensus 232 ~-~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m-~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqY 309 (560) + +.+ ++| |++. .++....|=++.++.....+.. +=+....-+..+.|-+-+.-.+..+.+... T Consensus 230 ~~~~g--~vP---vv~~------~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~---- 294 (501) T protein:vir:27 230 THAFG--TVP---ITEF------LNNVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQA---- 294 (501) T ss_pred ccCCC--ccc---EEEe------cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccch---- Confidence 0 001 111 2221 1223334555555444444332 233333445556665555432222211000 Q ss_pred HHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccC----CCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCC Q lcl|NC_019530. 310 MTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRRE----GQNATEIATVGGGNQLGEM-DHVNYFREKLYDALMI 384 (560) Q Consensus 310 l~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRRe----GgrgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~V 384 (560) ..++..+ ..+++--+ ++.+..+..|-.-.+...+ .-++-+.+.+|.-.++ T Consensus 295 ------------------~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~ 349 (501) T protein:vir:27 295 ------------------SDMKRTR-------LMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNI 349 (501) T ss_pred ------------------hhhhhcC-------ceeecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCC Confidence 0000000 11221111 1112234444322222222 2345567778888898 Q ss_pred CccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC---hhHHHHhhhhhceeeeeec Q lcl|NC_019530. 385 PKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTD---EKDWNEKIKPFIKFEFTSD 461 (560) Q Consensus 385 P~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit---~eew~~~i~~~I~~~f~~D 461 (560) |---.+ +|. |...+..|.--......-+.+.++.|..-+...++.-+-+-++.. ..+| ..|.+.|... T Consensus 350 p~~~~~---~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~-----~~i~v~f~~~ 420 (501) T protein:vir:27 350 PDMSDT---NFS-GNTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDE-----SLLKITFTPN 420 (501) T ss_pred cccCcc---ccc-cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc-----ccceEEeCCC Confidence 842222 211 222233343333334455666677777776666654333323322 2223 1367777543 Q ss_pred chHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCH---HHHHHHHHHHHHhhhcCCCCCCCcccccccccc Q lcl|NC_019530. 462 SYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSD---EEVKEQQTKIAEEKKKGLYPKVQADETGNYSGS 538 (560) Q Consensus 462 n~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tD---eEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~ 538 (560) -.-.+ .+-++++..+. | .+|.+++++. |...+ +|++.++++-+++...+...+-+ ++.|+.... T Consensus 421 ~p~n~-------~e~ad~~~kl~---g-~iS~et~l~~-l~~v~D~~~E~eri~~E~~e~~~~~~~~~~~-~~~~~~~d~ 487 (501) T protein:vir:27 421 LPKSL-------NEQVSILTGLG---G-QVSQETALSL-SGLVESPNEELDKINKEVSEIDFKGYSNDFN-EHVGKYTDE 487 (501) T ss_pred CCcCH-------HHHHHHHHHHh---c-cCcHHHHHHh-CCCCCCHHHHHHHHHHHHHhhhHhhhcCccc-cccccccCC Confidence 22122 22334555553 4 4899999987 44443 44444443333222222111111 111111100 Q ss_pred ccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 539 DVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 539 ~~~~~~~~p~~~~~~~~~~~~~ 560 (560) ... .+++|..= T Consensus 488 ~~~-----------~~~d~~e~ 498 (501) T protein:vir:27 488 VKE-----------THTDDFER 498 (501) T ss_pred CCC-----------Cccccccc Confidence 000 11111111 No 140 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=95.17 E-value=0.0026 Score=34.74 Aligned_cols=387 Identities=15% Similarity=0.163 Sum_probs=164.3 Q ss_pred chhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhh Q lcl|NC_019530. 6 RGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQ 85 (560) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~ 85 (560) -|+|++|+ +|...... .++......++.. . .-...|..++. T Consensus 1 Mg~~~~f~---------------~k~~~~~~---~~~~~~~~~~~~~---------~------------~~~~~~~~~~~ 41 (403) T protein:vir:80 1 MGLFNFFR---------------RKTRSEPT---NAISWFLTQEAYD---------T------------LAIPGYTRLSD 41 (403) T ss_pred Cccccccc---------------cccccccc---chhhhhccccccc---------c------------cccchhhhhhh Confidence 23332221 01000000 0000000000000 0 00122456788 Q ss_pred ccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHH-hCCchhhHHHHHHh----hhh--hhh Q lcl|NC_019530. 86 QPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHL-MDFDNTAYQKIRKW----YVD--GRQ 158 (560) Q Consensus 86 ~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l-l~f~~~~~~~~R~w----YvD--Gr~ 158 (560) +|.|.+||+-|.+.+.-. |+.+-=...++ . ..+......+|+. =+=...++++++.+ +.+ |.- T Consensus 42 ~~~V~~~I~~ia~~iA~~-----p~~~~~~~~~g-~----~~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna 111 (403) T protein:vir:80 42 NPEVRMAVHKIAELISSM-----TIHLMQNTDNG-D----IRIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNS 111 (403) T ss_pred hHHHHHHHHHHHHhhhhC-----ceEEEEecCCc-e----eecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccE Confidence 999999998776654321 23221111111 0 1122222222221 11112445554433 444 333 Q ss_pred hhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCcccccccccccccccccccccccee Q lcl|NC_019530. 159 AYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELV 238 (560) Q Consensus 159 ~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~ 238 (560) |..++. +.+.-+.+|.+|+|..|..+. +..+. . +.|+ + .. T Consensus 112 ~i~~~~---~~~g~~~~L~~l~p~~v~~~~-----~~~g~-------~--~~y~---------~--------------~~ 151 (403) T protein:vir:80 112 VVFPKY---TTSGLIDELIPLAPSKVSFVD-----TDTGY-------Q--IWYQ---------G--------------KA 151 (403) T ss_pred EEEEEE---cCCCcEEEEEEEcCCeeEEEE-----cCCce-------E--EEEe---------e--------------cc Confidence 443332 223348899999998887642 11120 0 1232 1 12 Q ss_pred cchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhh Q lcl|NC_019530. 239 FDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFK 318 (560) Q Consensus 239 i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~k 318 (560) ++.+.|.|..-.....+.=...|.+..+..+++.....++...-+----+--+-|..++- .+....+++..+.+..+|. T Consensus 152 ~~~~eiih~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-~~~~~~~~~~~~~~~~~~~ 230 (403) T protein:vir:80 152 YNYDEVLHFIVNPDPEKPYMGRGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDA-ATAELSSEEGRNAVFKKYL 230 (403) T ss_pred cchhhEEEEeccCCCcCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCChHHHHHHHHHHHHHHh Confidence 455666654421122222246688999999999998888877665544455566766764 4555555555555444443 Q ss_pred hheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCccccc Q lcl|NC_019530. 319 NRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIG 398 (560) Q Consensus 319 nk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~ 398 (560) .- ..+|.+ + + +| -.+..+++++.|. -..+.-++-.++-...+.++++||...|+-... T Consensus 231 ~~----~~~g~~------~-~-----~~-~~~~~~~~~~~l~-~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~---- 288 (403) T protein:vir:80 231 EA----SEAGQP------W-I-----IP-AELLDVEQVKPLS-LKDLAIHETVELDKRTVAGIFGVPAFLLGVGKY---- 288 (403) T ss_pred hh----hhcCCe------e-e-----ec-ccccccceeccCC-HHHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCc---- Confidence 22 112321 1 1 11 1122334454442 234444555677788899999999998842111 Q ss_pred ccchhhhhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHH Q lcl|NC_019530. 399 GSNLAEITQEELRFSKFCA-GLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRL 477 (560) Q Consensus 399 ~~~~~eItRDElkF~Kfi~-rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl 477 (560) + +. .|..|+. .|+ -+...+.+.|-+.| +++.++ .|.|..+ .+-..+ ..+|+ T Consensus 289 ----~---~~--~~~~f~~~~l~-P~~~~ie~~l~~kl-----l~~~~~--------~~~f~~~----~ll~~d-~~~~~ 340 (403) T protein:vir:80 289 ----D---KD--EYNNFINSTIL-PIAKGIEQELTRKL-----LISPDL--------YFKFNPR----SLYAYD-LKELA 340 (403) T ss_pred ----c---HH--HHHHHHHHHHH-HHHHHHHHHHHHhc-----cCCCCc--------EEEeech----hhhccC-HHHHH Confidence 1 11 1333432 233 22233334444444 444442 3334322 221111 33455 Q ss_pred HHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcc---ccccccccccCCCCCCcccccccC Q lcl|NC_019530. 478 ASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQAD---ETGNYSGSDVSPLKFKPETIPFSG 554 (560) Q Consensus 478 ~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~---e~g~~~~~~~~~~~~~p~~~~~~~ 554 (560) +.+..+-. +-++|.+-++. .+.|.+.+ ..+ .+++..| +..-++--...| T Consensus 341 ~~~~~~~~--~Gi~t~NE~R~-~~gl~p~~--------------------ggd~~~~~~n~~p-----l~~~~~~~~~k~ 392 (403) T protein:vir:80 341 EVGSNMYV--RGLMEGNEVRD-WLGLSPKE--------------------GLSELVILENYIP-----LDKIGDQNKLKG 392 (403) T ss_pred HHHHHHHh--CCCcCHHHHHH-HhCCCCCC--------------------CCCeEeecccccc-----hhhccchhhccC Confidence 55554422 24677777764 35654321 111 0111110 000001001111 Q ss_pred CcccCC Q lcl|NC_019530. 555 STDDSI 560 (560) Q Consensus 555 ~~~~~~ 560 (560) |.++.- T Consensus 393 ge~~~~ 398 (403) T protein:vir:80 393 GEKGGA 398 (403) T ss_pred CCCCCC Confidence 111111 No 141 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=95.11 E-value=0.0027 Score=34.62 Aligned_cols=433 Identities=10% Similarity=0.066 Sum_probs=165.7 Q ss_pred cccccccCCCCCCCCc-eeEeccccceeccccccc-hhhHHHHHHHHHHHhhccchhHHHH------------------- Q lcl|NC_019530. 36 LTKATVVALDDAQDGS-IILQGGANTYNYVGVESE-LLSVKTVVEEYQSMAQQPEIRKAVD------------------- 94 (560) Q Consensus 36 ~~k~~s~~~~~~~dG~-~~i~~~~~~~~y~~~e~~-~~~~~~LI~~YR~ma~~PEvd~AI~------------------- 94 (560) -..+.+.|...+..+. +.......... ...-.. +..-.+.+.+|..+..+.+-+..|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ 79 (474) T protein:vir:95 1 MINIIRMPWDKPYGEEVVEQMKPKVETQ-EEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDW 79 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccch-HHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhccccccccccc Confidence 1122222222222111 00000000000 000000 1111223445555555444332221 Q ss_pred --------HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecC Q lcl|NC_019530. 95 --------IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDP 166 (560) Q Consensus 95 --------eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~ 166 (560) -||+ ..+.---..||++..++. ...+.++.+++ -+|+....++.+.+++.|+-|.+.-+|. T Consensus 80 ki~~n~~k~Iv~-~~~~yl~g~p~~~~~~~~---------~~~~~l~~~~~-n~~~~~~~~l~~~~~~~G~~~~~~~~d~ 148 (474) T protein:vir:95 80 RITTNFHQNLVD-QKVSYVAGKPVTYAHDDD---------KVLDVIHQVLD-TRWDNKLIDILTAASNKGIDWLQVYINE 148 (474) T ss_pred ccccchHHHHHH-hhhhhhcccCceeccCCh---------HHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEeeeCC Confidence 1111 111112345555544432 22344444544 4799999999999999999998766654 Q ss_pred CCCccCeeEeEEeCccccchhhhhcccCC-cccccc--ccccchheeecCccccccccccccccccccccccce-----e Q lcl|NC_019530. 167 TNKKGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEA--IESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQEL-----V 238 (560) Q Consensus 167 ~~~k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~--~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~-----~ 238 (560) +|-..+..+||+.+-+|..-....+ ...+.. .++...+-+|.+..-.+.....+............. . T Consensus 149 ----~~~~~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (474) T protein:vir:95 149 ----DGELKLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFST 224 (474) T ss_pred ----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccc Confidence 4668899999999888742110000 000100 111111222333221111000000000000000000 0 Q ss_pred cchhceeeeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhh Q lcl|NC_019530. 239 FDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKF 317 (560) Q Consensus 239 i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~ 317 (560) -.-..|-.++. .++....|=|+..+.....+. ++-+....-+-.+.|-+-+.-.+.-++. +. T Consensus 225 ~~~~~vPvv~~----~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~-----~~-------- 287 (474) T protein:vir:95 225 GSWERVPFIAF----KNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLS-----EF-------- 287 (474) T ss_pred cCCCccceEEe----cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCccccc-----ch-------- Confidence 00000111110 022223355555555554443 3333333445555554433322111110 11 Q ss_pred hhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCccc Q lcl|NC_019530. 318 KNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGSIN 396 (560) Q Consensus 318 knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~ 396 (560) +..+..+++-.-+++ -.+..|-...+.+. -.-+.-+.+.+|...++|---.+ +|. T Consensus 288 -------------------~~~~~~~~~i~~~~~--~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~---~~~ 343 (474) T protein:vir:95 288 -------------------MEGLKYYKAINVSSD--GGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTD---KFG 343 (474) T ss_pred -------------------hhhhhccceeeccCC--CceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcccc---ccc Confidence 111111111111111 12444433223222 23455567778999999842221 211 Q ss_pred ccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHH Q lcl|NC_019530. 397 IGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDR 476 (560) Q Consensus 397 l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~R 476 (560) +...+..|..-......-+.+.++.|...+...|+.-+-+-|+ . .+|. .|.+.|...---.++ +. T Consensus 344 -~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~-~-~d~~-----~i~i~f~~~~p~~~~-------e~ 408 (474) T protein:vir:95 344 -SATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKI-K-LDAK-----EIEITFNFNVMVNDL-------EQ 408 (474) T ss_pred -cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-C-cccc-----eeeEEecCCCccCHH-------HH Confidence 1122223433333334556667767777777766654444454 2 2332 367777544322222 22 Q ss_pred HHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcC--CCCCCCccccccccccccCCCCCCcc Q lcl|NC_019530. 477 LASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKG--LYPKVQADETGNYSGSDVSPLKFKPE 548 (560) Q Consensus 477 l~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~--~~~~p~~~e~g~~~~~~~~~~~~~p~ 548 (560) ++++.++ | .+|.+++++. |..+++ -+++-++|++|..+- ..+.+.....++ +.....+..-..| T Consensus 409 a~~~~~~----g-iiS~et~~~~-lp~v~D-~~~E~eri~~E~~~~~~~~~~~~~~~~~~-~~~~~~~~~~e~~ 474 (474) T protein:vir:95 409 SQIGAQS----Q-YLSKETLVRH-HPWVDD-PKAELERLDEEQLELNKQLPNLDDGGADG-AQQQQQSENNQSK 474 (474) T ss_pred HHHHHHc----C-CCChHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHhhccccccccCCC-CCCcCCCCccccC Confidence 3334332 3 6999999966 665542 233344555553211 111111000000 0000000011111 No 142 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=95.11 E-value=0.0027 Score=34.62 Aligned_cols=433 Identities=10% Similarity=0.066 Sum_probs=165.7 Q ss_pred cccccccCCCCCCCCc-eeEeccccceeccccccc-hhhHHHHHHHHHHHhhccchhHHHH------------------- Q lcl|NC_019530. 36 LTKATVVALDDAQDGS-IILQGGANTYNYVGVESE-LLSVKTVVEEYQSMAQQPEIRKAVD------------------- 94 (560) Q Consensus 36 ~~k~~s~~~~~~~dG~-~~i~~~~~~~~y~~~e~~-~~~~~~LI~~YR~ma~~PEvd~AI~------------------- 94 (560) -..+.+.|...+..+. +.......... ...-.. +..-.+.+.+|..+..+.+-+..|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ 79 (474) T protein:vir:96 1 MINIIRMPWDKPYGEEVVEQMKPKVETQ-EEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDW 79 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccch-HHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhccccccccccc Confidence 1122222222222111 00000000000 000000 1111223445555555444332221 Q ss_pred --------HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecC Q lcl|NC_019530. 95 --------IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDP 166 (560) Q Consensus 95 --------eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~ 166 (560) -||+ ..+.---..||++..++. ...+.++.+++ -+|+....++.+.+++.|+-|.+.-+|. T Consensus 80 ki~~n~~k~Iv~-~~~~yl~g~p~~~~~~~~---------~~~~~l~~~~~-n~~~~~~~~l~~~~~~~G~~~~~~~~d~ 148 (474) T protein:vir:96 80 RITTNFHQNLVD-QKVSYVAGKPVTYAHDDD---------KVLDVIHQVLD-TRWDNKLIDILTAASNKGIDWLQVYINE 148 (474) T ss_pred ccccchHHHHHH-hhhhhhcccCceeccCCh---------HHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEeeeCC Confidence 1111 111112345555544432 22344444544 4799999999999999999998766654 Q ss_pred CCCccCeeEeEEeCccccchhhhhcccCC-cccccc--ccccchheeecCccccccccccccccccccccccce-----e Q lcl|NC_019530. 167 TNKKGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEA--IESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQEL-----V 238 (560) Q Consensus 167 ~~~k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~--~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~-----~ 238 (560) +|-..+..+||+.+-+|..-....+ ...+.. .++...+-+|.+..-.+.....+............. . T Consensus 149 ----~~~~~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (474) T protein:vir:96 149 ----DGELKLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFST 224 (474) T ss_pred ----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccc Confidence 4668899999999888742110000 000100 111111222333221111000000000000000000 0 Q ss_pred cchhceeeeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhh Q lcl|NC_019530. 239 FDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKF 317 (560) Q Consensus 239 i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~ 317 (560) -.-..|-.++. .++....|=|+..+.....+. ++-+....-+-.+.|-+-+.-.+.-++. +. T Consensus 225 ~~~~~vPvv~~----~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~-----~~-------- 287 (474) T protein:vir:96 225 GSWERVPFIAF----KNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLS-----EF-------- 287 (474) T ss_pred cCCCccceEEe----cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCccccc-----ch-------- Confidence 00000111110 022223355555555554443 3333333445555554433322111110 11 Q ss_pred hhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCccc Q lcl|NC_019530. 318 KNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGSIN 396 (560) Q Consensus 318 knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~ 396 (560) +..+..+++-.-+++ -.+..|-...+.+. -.-+.-+.+.+|...++|---.+ +|. T Consensus 288 -------------------~~~~~~~~~i~~~~~--~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~---~~~ 343 (474) T protein:vir:96 288 -------------------MEGLKYYKAINVSSD--GGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTD---KFG 343 (474) T ss_pred -------------------hhhhhccceeeccCC--CceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcccc---ccc Confidence 111111111111111 12444433223222 23455567778999999842221 211 Q ss_pred ccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHH Q lcl|NC_019530. 397 IGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDR 476 (560) Q Consensus 397 l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~R 476 (560) +...+..|..-......-+.+.++.|...+...|+.-+-+-|+ . .+|. .|.+.|...---.++ +. T Consensus 344 -~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~-~-~d~~-----~i~i~f~~~~p~~~~-------e~ 408 (474) T protein:vir:96 344 -SATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKI-K-LDAK-----EIEITFNFNVMVNDL-------EQ 408 (474) T ss_pred -cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-C-cccc-----eeeEEecCCCccCHH-------HH Confidence 1122223433333334556667767777777766654444454 2 2332 367777544322222 22 Q ss_pred HHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcC--CCCCCCccccccccccccCCCCCCcc Q lcl|NC_019530. 477 LASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKG--LYPKVQADETGNYSGSDVSPLKFKPE 548 (560) Q Consensus 477 l~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~--~~~~p~~~e~g~~~~~~~~~~~~~p~ 548 (560) ++++.++ | .+|.+++++. |..+++ -+++-++|++|..+- ..+.+.....++ +.....+..-..| T Consensus 409 a~~~~~~----g-iiS~et~~~~-lp~v~D-~~~E~eri~~E~~~~~~~~~~~~~~~~~~-~~~~~~~~~~e~~ 474 (474) T protein:vir:96 409 SQIGAQS----Q-YLSKETLVRH-HPWVDD-PKAELERLDEEQLELNKQLPNLDDGGADG-AQQQQQSENNQSK 474 (474) T ss_pred HHHHHHc----C-CCChHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHhhccccccccCCC-CCCcCCCCccccC Confidence 3334332 3 6999999966 665542 233344555553211 111111000000 0000000011111 No 143 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=95.05 E-value=0.0029 Score=34.52 Aligned_cols=419 Identities=11% Similarity=0.113 Sum_probs=164.1 Q ss_pred cccccCCCCCCCCceeEeccccceeccccccc-----hhh--HHHHHHHHHHHhhccchhHHH----------------- Q lcl|NC_019530. 38 KATVVALDDAQDGSIILQGGANTYNYVGVESE-----LLS--VKTVVEEYQSMAQQPEIRKAV----------------- 93 (560) Q Consensus 38 k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~-----~~~--~~~LI~~YR~ma~~PEvd~AI----------------- 93 (560) -... .+.........+...... +.. .++.+.+|+.+..+.+=...| T Consensus 1 ~~~~----------~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~ 70 (479) T protein:vir:79 1 MLNI----------YISETDLIKVQLKKESTINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDF 70 (479) T ss_pred CCCc----------eecccceEeeccccCChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCccccccccccccccccccc Confidence 0000 000000101111111000 000 112334455554444322211 Q ss_pred ------------HHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhh Q lcl|NC_019530. 94 ------------DIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYH 161 (560) Q Consensus 94 ------------~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~h 161 (560) ..||+..+-+ --..||++..++.+ +.+..+.+.+ .+|+....++.+...+.|+-|.+ T Consensus 71 ~~~~~ki~~~~~~~Ivd~~~~~-l~g~p~~~~~~~~~---------~~~~~~~~~~-n~~~~~~~~~~~~~~~~G~~~~~ 139 (479) T protein:vir:79 71 TKVNNKAINNYHKLLVDQKVGY-SVGNPIVFNADDDN---------LTKLLNDLLG-EEFDDTITELYLNASNKGVEWLH 139 (479) T ss_pred ccCcceeecchHHHHHHHHHhh-hhcCCceeccCCHH---------HHHHHHHHHh-cCHHHHHHHHHHHHHhcCeEEEE Confidence 1122211111 12345555444321 2222333333 58999999999999999999887 Q ss_pred heecCCCCccCeeEeEEeCccccchhhhhcccCC-cccccccc-------ccchheeecCcccccccccccccc-ccc-- Q lcl|NC_019530. 162 VIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEAIE-------SVTLKYYYNPNYNRNQFTGQSGTS-QNF-- 230 (560) Q Consensus 162 kiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~~~-------~~~e~~~Y~p~~~~~~~~g~~~~~-~~~-- 230 (560) .-+|. +|-..+..+||+.+-+|..-....+ ...+.... .+...-+|.+..-.+.....+... ... T Consensus 140 v~~d~----~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~y~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~ 215 (479) T protein:vir:79 140 PYINR----KGEFKYVIIPAEEAIPIWDSKRQRELVAFIRFYYIEDIDGNKIKRVEYYTENDITYFIERGNSFIQEFLYD 215 (479) T ss_pred EEeCC----CCceEEEEEccceeEEEEeCCCCCceEEEEEEEEEeecCCceEEEEEEEeCCcEEEEEecCCccccccccc Confidence 66543 4667899999998877632110000 00011100 000111222221111100000000 000 Q ss_pred ------cccccceecch-----hceeeeecccccCCCCcchhhhHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccC Q lcl|NC_019530. 231 ------QPSQQELVFDD-----ESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVT-TEDATVIYAITRAPEKRAFYLDV 298 (560) Q Consensus 231 ------~~~~~~~~i~~-----daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m-~EDalVIyRitRAPeRRvFyIDv 298 (560) ........... ..|-+++. +++....|-++..+....-+.. +-+....-+.++-|-.-+--.+ T Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~----~nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~- 290 (479) T protein:vir:79 216 EYGKMTDIQEGHFRINNKEQGWGKVPFIPF----KNNEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYP- 290 (479) T ss_pred ccccccccccccccccccccCCCcccEEEe----cCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCC- Confidence 00000000000 01111111 1333344666665555554442 3344445555565543322111 Q ss_pred CCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHH Q lcl|NC_019530. 299 GTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLG-EMDHVNYFREK 377 (560) Q Consensus 299 Gnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLg-ei~DV~YF~~k 377 (560) |. ...+... .+ +.. .|+. + +++-+.+.-|.+ .+.. .-..++-+++. T Consensus 291 ~~----~~~~~~~--------~~------------~~~-~~i~---~---~~~~~~~~l~~~--~~~~~~~~~~~~l~~~ 337 (479) T protein:vir:79 291 GT----SLQEFID--------NI------------RYY-KSIK---V---DGGGGVDKLEIN--IPVEAKKELLDRLEKN 337 (479) T ss_pred cc----ccccchh--------hh------------hhc-ccee---c---CCCCcceEEecc--CCHHHHHHHHHHHHHH Confidence 11 0000000 00 000 0110 1 122223333333 3332 22445666777 Q ss_pred HHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceee Q lcl|NC_019530. 378 LYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFE 457 (560) Q Consensus 378 Ly~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~ 457 (560) +|+...+|-. ..++. |...+..|..-...-..-+.+.+..|...+.++++.-+-+-++....+++. ..|.+. T Consensus 338 i~~~s~~p~~--~~~~~---gn~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~---~~i~i~ 409 (479) T protein:vir:79 338 IIIFGQGVNP--ESQNT---GDKSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISGNKSYDY---KTVQIT 409 (479) T ss_pred HHHHhCcccc--ccccc---cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccc---ccceEE Confidence 8888899843 22222 222234444443444455677777777777777765444434444333332 236677 Q ss_pred eeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccc Q lcl|NC_019530. 458 FTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSG 537 (560) Q Consensus 458 f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~ 537 (560) |..----.+.. -+++++.+ +| .+|++++++. |...++ .+++-++|++|...-.-.. +..++.. T Consensus 410 f~~~~p~~~~~-------~a~~~~kl---~g-~iS~et~l~~-l~~v~d-~~~E~~ri~~E~~~~~~~~---~~~~~~~- 472 (479) T protein:vir:79 410 FNHSMIINEAE-------KIDMAAKS---TG-IVSDETIVSN-HPWVED-VNDELERLKKQEDTQKEYD---DLIPNNQ- 472 (479) T ss_pred eCCCCCcCHHH-------HHHHHHHH---hc-cCcHHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHHH---hccCccc- Confidence 74432222322 23444444 35 4999999976 565432 3344455555542110000 0000000 Q ss_pred cccCCCCCCcccccccCCcccC Q lcl|NC_019530. 538 SDVSPLKFKPETIPFSGSTDDS 559 (560) Q Consensus 538 ~~~~~~~~~p~~~~~~~~~~~~ 559 (560) .+ .+|++ T Consensus 473 ~~---------------~~~e~ 479 (479) T protein:vir:79 473 DG---------------VIDET 479 (479) T ss_pred CC---------------CcCcC Confidence 00 00111 No 144 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=94.74 E-value=0.0036 Score=33.97 Aligned_cols=417 Identities=12% Similarity=0.104 Sum_probs=172.0 Q ss_pred CCcccc-----hhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCC-CceeEeccccceeccccccchhhHH Q lcl|NC_019530. 1 MAGYGR-----GFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQD-GSIILQGGANTYNYVGVESELLSVK 74 (560) Q Consensus 1 ~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~d-G~~~i~~~~~~~~y~~~e~~~~~~~ 74 (560) |--|.. +|+++= .+.+ ......+..+.+ .-+...|..+- .....-.+ +.+.. . T Consensus 1 ~~~~~~~~~~~~~~~~~---~~~~-~~~~~~lf~~~e------~R~~~~~~~~~~~~~~~~~~-~~~~~----~------ 59 (441) T protein:vir:79 1 MHWYNTDCYFVDFKSRK---QSRK-ELVVVGIFYKNE------KRDLQYNEDDLQMMVQTLPG-FQGTK----L------ 59 (441) T ss_pred CccccCccccccccccc---cchh-hhhccccccccc------cccccCCCcchHHHHHHhcc-cCccc----c------ Confidence 322221 222110 0000 000000111111 01111111110 00000001 11100 0 Q ss_pred HHHHHH--HHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCch----h---- Q lcl|NC_019530. 75 TVVEEY--QSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDN----T---- 144 (560) Q Consensus 75 ~LI~~Y--R~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~----~---- 144 (560) ..| ..-+.+|.|..||.-|.+.+.-. |+.+.=+... . .-+.++.+|+-.- . T Consensus 60 ---~~~~~~~al~~~~V~~cv~~Ia~~iA~l-----p~~~~~~~~~--~---------~~~~~~~lL~~~PN~~~t~~~f 120 (441) T protein:vir:79 60 ---RQYKDIEAIRHSDIFTAVMMIASDLARM-----PIRVTVNGQI--N---------YSDRIVNLLNTRPNPMYNGYIF 120 (441) T ss_pred ---cccchhhhhccHHHHHHHHHHHHhhccC-----ceeeecCccc--c---------ccchHHHHHhcccCcCCCHHHH Confidence 011 12356788999997666655432 3333211110 0 1123444443221 1 Q ss_pred hHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccc Q lcl|NC_019530. 145 AYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQS 224 (560) Q Consensus 145 ~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~ 224 (560) -+.++..++..|.-|..++.|.. .=+++|.+|+|..|..++ +..|. .++.|.. .++. T Consensus 121 ~~~~~~~lll~Gnay~~i~r~~~---G~~~~L~~i~~~~v~v~~-----d~~g~--------~~~~~~~------~~~~- 177 (441) T protein:vir:79 121 KLVVFVSALLTSHGYIEITRDKT---GEPMNLTFRKTSEIELKS-----DARGR--------LYYFHQR------IDSN- 177 (441) T ss_pred HHHHHHHHhhcCCeEEEEEECCC---CcEEEEEEEcCceeEEEE-----CCCcc--------EEEEEEE------eccC- Confidence 34555667788998887776532 227899999998887532 12231 1222210 0010 Q ss_pred cccccccccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchH Q lcl|NC_019530. 225 GTSQNFQPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKK 304 (560) Q Consensus 225 ~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~ 304 (560) ..+....++.+.|.|... .+.++-+.+|-|+.|.+++.....+++...=+=---|--+-|..++ |.+... T Consensus 178 -------~~~~~~~~~~~dvih~k~--~~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~ 247 (441) T protein:vir:79 178 -------GNNIERNVKFEDMLDIKF--YSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNK 247 (441) T ss_pred -------CceeEEEEccccEEEecc--CCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC-CCCCCH Confidence 112234577777777653 3445445678899999998888878777654433345556777777 455444 Q ss_pred HHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcC Q lcl|NC_019530. 305 SAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGG-NQLGEMDHVNYFREKLYDALM 383 (560) Q Consensus 305 KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~ 383 (560) +|.+=+++-+++. .+| .++..+.+ .++ .|.+++.|.=. ..+.-++-.++..+.+.++++ T Consensus 248 e~~e~~r~~~~~~--------~~G-~~nag~~~-vl~----------~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fg 307 (441) T protein:vir:79 248 KARDRAREEFHKS--------FSG-TKQAGKVV-VLD----------ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFG 307 (441) T ss_pred HHHHHHHHHHHHH--------hcC-ccccCcce-ecC----------CCceEEEccCChhHHHHHHHHHHhHHHHHHHhC Confidence 4433333322211 122 22222232 333 24556655321 122223445677888999999 Q ss_pred CCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecch Q lcl|NC_019530. 384 IPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSY 463 (560) Q Consensus 384 VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~ 463 (560) ||.+.|+.... .++.++. .+-|..-+.-+-.++ .+.|-..| .+ ++. . ..+.|.- T Consensus 308 VPp~~lg~~~~----~~s~~q~---~~~~~~tl~P~~~~i----e~eln~kl-----~~--~~~---~--~~~~fd~--- 361 (441) T protein:vir:79 308 IPLHKFGIETA----NMSITDA---NLDYLSTLKPYITCV----CAELNFKF-----ND--EYV---N--REFKFDT--- 361 (441) T ss_pred CCHHHcCCCCC----CccHHHH---HHHHHHHHHHHHHHH----HHHHhhhc-----cc--ccc---C--ceEEeec--- Confidence 99999964332 1112232 222433333333332 23333322 22 121 1 2233422 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcc---cccccccccc Q lcl|NC_019530. 464 IREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQAD---ETGNYSGSDV 540 (560) Q Consensus 464 f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~---e~g~~~~~~~ 540 (560) .++...+ ...|++.++.+-. .-++|.+-+++ .+.+.. ++.++.+ ..++..+-++ T Consensus 362 -~~llr~D-~~~~~~~~~~~i~--~G~~T~NE~R~-~~gl~P------------------i~ggd~~~~~~~~n~~~~~~ 418 (441) T protein:vir:79 362 -TEIRVVD-EKTQAEIDKINID--SGKMNIDEIRQ-RDGLAP------------------IPGGNGSIHRVDLNHVNIEL 418 (441) T ss_pred -hhhhccC-HHHHHHHHHHHHh--CCCcCHHHHHH-HhCCCC------------------CCCCCcceEeeccccccccc Confidence 2222211 2344554443322 13566666663 344422 1111111 1122222111 Q ss_pred CCC--CCCcccc--cccCCcccC Q lcl|NC_019530. 541 SPL--KFKPETI--PFSGSTDDS 559 (560) Q Consensus 541 ~~~--~~~p~~~--~~~~~~~~~ 559 (560) .+. .-+++.. +.-||.++- T Consensus 419 ~~~~~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:79 419 VDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred ccccccccccccccccCCCCCCC Confidence 111 0011111 112333333 No 145 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=94.74 E-value=0.0036 Score=33.97 Aligned_cols=417 Identities=12% Similarity=0.104 Sum_probs=172.0 Q ss_pred CCcccc-----hhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCC-CceeEeccccceeccccccchhhHH Q lcl|NC_019530. 1 MAGYGR-----GFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQD-GSIILQGGANTYNYVGVESELLSVK 74 (560) Q Consensus 1 ~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~d-G~~~i~~~~~~~~y~~~e~~~~~~~ 74 (560) |--|.. +|+++= .+.+ ......+..+.+ .-+...|..+- .....-.+ +.+.. . T Consensus 1 ~~~~~~~~~~~~~~~~~---~~~~-~~~~~~lf~~~e------~R~~~~~~~~~~~~~~~~~~-~~~~~----~------ 59 (441) T protein:vir:94 1 MHWYNTDCYFVDFKSRK---QSRK-ELVVVGIFYKNE------KRDLQYNEDDLQMMVQTLPG-FQGTK----L------ 59 (441) T ss_pred CccccCccccccccccc---cchh-hhhccccccccc------cccccCCCcchHHHHHHhcc-cCccc----c------ Confidence 322221 222110 0000 000000111111 01111111110 00000001 11100 0 Q ss_pred HHHHHH--HHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCch----h---- Q lcl|NC_019530. 75 TVVEEY--QSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDN----T---- 144 (560) Q Consensus 75 ~LI~~Y--R~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~----~---- 144 (560) ..| ..-+.+|.|..||.-|.+.+.-. |+.+.=+... . .-+.++.+|+-.- . T Consensus 60 ---~~~~~~~al~~~~V~~cv~~Ia~~iA~l-----p~~~~~~~~~--~---------~~~~~~~lL~~~PN~~~t~~~f 120 (441) T protein:vir:94 60 ---RQYKDIEAIRHSDIFTAVMMIASDLARM-----PIRVTVNGQI--N---------YSDRIVNLLNTRPNPMYNGYIF 120 (441) T ss_pred ---cccchhhhhccHHHHHHHHHHHHhhccC-----ceeeecCccc--c---------ccchHHHHHhcccCcCCCHHHH Confidence 011 12356788999997666655432 3333211110 0 1123444443221 1 Q ss_pred hHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccc Q lcl|NC_019530. 145 AYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQS 224 (560) Q Consensus 145 ~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~ 224 (560) -+.++..++..|.-|..++.|.. .=+++|.+|+|..|..++ +..|. .++.|.. .++. T Consensus 121 ~~~~~~~lll~Gnay~~i~r~~~---G~~~~L~~i~~~~v~v~~-----d~~g~--------~~~~~~~------~~~~- 177 (441) T protein:vir:94 121 KLVVFVSALLTSHGYIEITRDKT---GEPMNLTFRKTSEIELKS-----DARGR--------LYYFHQR------IDSN- 177 (441) T ss_pred HHHHHHHHhhcCCeEEEEEECCC---CcEEEEEEEcCceeEEEE-----CCCcc--------EEEEEEE------eccC- Confidence 34555667788998887776532 227899999998887532 12231 1222210 0010 Q ss_pred cccccccccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchH Q lcl|NC_019530. 225 GTSQNFQPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKK 304 (560) Q Consensus 225 ~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~ 304 (560) ..+....++.+.|.|... .+.++-+.+|-|+.|.+++.....+++...=+=---|--+-|..++ |.+... T Consensus 178 -------~~~~~~~~~~~dvih~k~--~~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~ 247 (441) T protein:vir:94 178 -------GNNIERNVKFEDMLDIKF--YSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNK 247 (441) T ss_pred -------CceeEEEEccccEEEecc--CCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC-CCCCCH Confidence 112234577777777653 3445445678899999998888878777654433345556777777 455444 Q ss_pred HHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcC Q lcl|NC_019530. 305 SAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGG-NQLGEMDHVNYFREKLYDALM 383 (560) Q Consensus 305 KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~ 383 (560) +|.+=+++-+++. .+| .++..+.+ .++ .|.+++.|.=. ..+.-++-.++..+.+.++++ T Consensus 248 e~~e~~r~~~~~~--------~~G-~~nag~~~-vl~----------~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fg 307 (441) T protein:vir:94 248 KARDRAREEFHKS--------FSG-TKQAGKVV-VLD----------ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFG 307 (441) T ss_pred HHHHHHHHHHHHH--------hcC-ccccCcce-ecC----------CCceEEEccCChhHHHHHHHHHHhHHHHHHHhC Confidence 4433333322211 122 22222232 333 24556655321 122223445677888999999 Q ss_pred CCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecch Q lcl|NC_019530. 384 IPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSY 463 (560) Q Consensus 384 VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~ 463 (560) ||.+.|+.... .++.++. .+-|..-+.-+-.++ .+.|-..| .+ ++. . ..+.|.- T Consensus 308 VPp~~lg~~~~----~~s~~q~---~~~~~~tl~P~~~~i----e~eln~kl-----~~--~~~---~--~~~~fd~--- 361 (441) T protein:vir:94 308 IPLHKFGIETA----NMSITDA---NLDYLSTLKPYITCV----CAELNFKF-----ND--EYV---N--REFKFDT--- 361 (441) T ss_pred CCHHHcCCCCC----CccHHHH---HHHHHHHHHHHHHHH----HHHHhhhc-----cc--ccc---C--ceEEeec--- Confidence 99999964332 1112232 222433333333332 23333322 22 121 1 2233422 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcc---cccccccccc Q lcl|NC_019530. 464 IREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQAD---ETGNYSGSDV 540 (560) Q Consensus 464 f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~---e~g~~~~~~~ 540 (560) .++...+ ...|++.++.+-. .-++|.+-+++ .+.+.. ++.++.+ ..++..+-++ T Consensus 362 -~~llr~D-~~~~~~~~~~~i~--~G~~T~NE~R~-~~gl~P------------------i~ggd~~~~~~~~n~~~~~~ 418 (441) T protein:vir:94 362 -TEIRVVD-EKTQAEIDKINID--SGKMNIDEIRQ-RDGLAP------------------IPGGNGSIHRVDLNHVNIEL 418 (441) T ss_pred -hhhhccC-HHHHHHHHHHHHh--CCCcCHHHHHH-HhCCCC------------------CCCCCcceEeeccccccccc Confidence 2222211 2344554443322 13566666663 344422 1111111 1122222111 Q ss_pred CCC--CCCcccc--cccCCcccC Q lcl|NC_019530. 541 SPL--KFKPETI--PFSGSTDDS 559 (560) Q Consensus 541 ~~~--~~~p~~~--~~~~~~~~~ 559 (560) .+. .-+++.. +.-||.++- T Consensus 419 ~~~~~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:94 419 VDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred ccccccccccccccccCCCCCCC Confidence 111 0011111 112333333 No 146 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=94.60 E-value=0.004 Score=33.74 Aligned_cols=401 Identities=12% Similarity=0.069 Sum_probs=167.6 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCcee-EeccccceeccccccchhhHHHHHHHHHHHhhccchhHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSII-LQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAV 93 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~-i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI 93 (560) |++- +.+.. .+. ...|.+ ++-.. +-++....+.+.+ |. ...+++|-|..|| T Consensus 1 m~~~-------~~~~~----~~~----~~~~~~-~~~~~~~~g~~~s~~~~~v-----~~-------~~al~~~~v~~cv 52 (419) T protein:vir:80 1 MFFS-------RQLLS----NLG----QTQPGS-GGWVSALLGSARSEAGQVV-----TP-------ASALSLTVLQNCV 52 (419) T ss_pred CCcc-------ccccc----ccC----cCCCCc-chhhHHhhcccccccCccc-----Ch-------HHhhccHHHHHHH Confidence 1111 11000 000 000100 00000 0011100000111 00 1345789999999 Q ss_pred HHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHH-hCCchhh----HHHHHHhhhhhhhhhhheecCCC Q lcl|NC_019530. 94 DIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHL-MDFDNTA----YQKIRKWYVDGRQAYHVIVDPTN 168 (560) Q Consensus 94 ~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l-l~f~~~~----~~~~R~wYvDGr~~~hkiiD~~~ 168 (560) +-|.+.+.-. |+ .+-...+ +..++.......++|+. =+=...+ +.++..+++.|.-|..++.|.. T Consensus 53 ~~ia~~ia~l-----p~--~~~~~~~--~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~- 122 (419) T protein:vir:80 53 TLLAESIAQL-----PV--ELYERSG--DDRKPATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQD- 122 (419) T ss_pred HHHHHhhccC-----ce--EEEEecC--CCcccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC- Confidence 8887765432 22 2222111 11111111111222221 1112223 4455566777887776665542 Q ss_pred CccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeee Q lcl|NC_019530. 169 KKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYID 248 (560) Q Consensus 169 ~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~h 248 (560) .-+.+|++|+|..|..++ +.++. .+|.+. |. ..++.+.|.|.. T Consensus 123 --G~~~~L~~i~~~~v~i~~-----~~~~~--------~~y~~~---------~~-------------~~~~~~~i~h~~ 165 (419) T protein:vir:80 123 --GVIQGLYPLDNEAVTVMK-----GPDLK--------PMYRVA---------GA-------------DPLPQRLVHHVR 165 (419) T ss_pred --CcEEEEEEecCceEEEEE-----CCCce--------EEEEEc---------Cc-------------cccchhheEEec Confidence 237899999998887532 11121 112221 11 125666676665 Q ss_pred cccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCC Q lcl|NC_019530. 249 SGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTG 328 (560) Q Consensus 249 sGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TG 328 (560) . .+.++-+.+|-++.|..++.....+++...=+---.+--+-++.++.. ++..+.++-+..+...+....-=....| T Consensus 166 ~--~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~-~~~~~~~~~~~~~~~~~~~~~~g~~n~g 242 (419) T protein:vir:80 166 W--MSINGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPTD-APALKDQASVDRITDGWNAKFGGSGNAK 242 (419) T ss_pred C--CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCC-CCcccCHHHHHHHHHHHHHHhcCccccC Confidence 3 344555677889999999998888887766554455556667777642 2222222223333333333321001112 Q ss_pred cccCcccchhhHhhhcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhH Q lcl|NC_019530. 329 KITGNAHLMGIAEDYWLPRREGQNATEIATVGG-GNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQ 407 (560) Q Consensus 329 ev~d~~~~msmlEDywLpRReGgrgTEIsTLpG-g~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItR 407 (560) .+ + +++ .|.+++-|.- ...+.-++-.++-.+.+.++++||...|+..++.+ .+++.+..+ T Consensus 243 ~~------~-vl~----------~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t--~~n~e~~~~ 303 (419) T protein:vir:80 243 KV------A-LLQ----------EGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERAT--FSNIEHQSL 303 (419) T ss_pred Cc------e-ecC----------CCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCC--cccHHHHHH Confidence 21 1 222 2556665532 12222334455778999999999999997543322 233333333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_019530. 408 EELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFV 487 (560) Q Consensus 408 DElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~v 487 (560) .|..++ |+--+ ..+.+.|- ++++++.++. ..+|. |..+ ++... -+..|++.++.+-. T Consensus 304 ---~f~~~~--l~P~~-~~ie~~l~-----~kll~~~~~~---~~~i~--fd~~----~l~~~-d~~~~~~~~~~~~~-- 360 (419) T protein:vir:80 304 ---QFVIYT--LLPWV-KRHEQAKT-----RDLLLPSERK---QYFIE--YNLA----GLLRG-DQSSRYAAYAVGRQ-- 360 (419) T ss_pred ---HHHHHH--HHHHH-HHHHHHHh-----hhccCccccC---CeEEE--Eech----hhhcc-CHHHHHHHHHHHHh-- Confidence 365553 22111 12223333 3445655542 12233 3322 22211 23345555544311 Q ss_pred cchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccc-cCCCCCCcccccccCCcc--------c Q lcl|NC_019530. 488 GSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSD-VSPLKFKPETIPFSGSTD--------D 558 (560) Q Consensus 488 Gky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~-~~~~~~~p~~~~~~~~~~--------~ 558 (560) +-+++.+-+++ ++.+. |-|. ||..... +--..-.|+..|.....+ + T Consensus 361 ~G~~T~NE~R~-~~g~~--------------------p~~g----GD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~ 415 (419) T protein:vir:80 361 WGWLSINDIRR-LENMP--------------------PVKG----GDIYLSPMNMVDASKPQPIPMGKTEPTKAALDEIG 415 (419) T ss_pred CCCcCHHHHHH-HhCCC--------------------CCCC----cceeeeccccccccccccccCCCCCchhhhHHHHH Confidence 13444444442 23331 2122 2211000 000000111111111111 1 Q ss_pred CC Q lcl|NC_019530. 559 SI 560 (560) Q Consensus 559 ~~ 560 (560) .+ T Consensus 416 ~~ 417 (419) T protein:vir:80 416 RI 417 (419) T ss_pred hh Confidence 11 No 147 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=94.43 E-value=0.0044 Score=33.49 Aligned_cols=381 Identities=11% Similarity=0.094 Sum_probs=162.2 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) |+.|+..- +..+. .++.. ++-.....+.+... . ..+..-+. +..+++|-|.+||. T Consensus 1 M~~f~~~~-------~~~~~-----~~~~~----~~~~~~~~~~~~~~-~-~~~~~v~~-------~~al~~~~v~~~i~ 55 (386) T protein:vir:49 1 MPIFNITN-------LATES-----PPINQ----ESFFDIADSDFLAS-L-NSSEWVSA-------ENALKNSDLFSIIS 55 (386) T ss_pred Cchhhhhc-------cCCCC-----cccch----hhhhhhhhcccccc-c-cCCceech-------hhhhccHHHHHHHH Confidence 33322110 00000 00111 11000000000000 0 00000011 12346899999997 Q ss_pred HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCee Q lcl|NC_019530. 95 IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIK 174 (560) Q Consensus 95 eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~ 174 (560) -|.+.+.- +.+.+.... .+.+.++=. ..+....=.+.++..++..|.-|..++.|... -+. T Consensus 56 ~ia~~ia~-------~p~~~~~~~------~~~l~~~PN---~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g---~~~ 116 (386) T protein:vir:49 56 QLSNDLAT-------AKITTSRKQ------LQGIVDNPS---NNANRFNFYQSIFAQMLLGGEAFAYRWRNDNG---RDM 116 (386) T ss_pred HHHHHhhh-------Cceeeccch------hhhhhhccC---CCCCHHHHHHHHHHHhhhcCCEEEEEEECCCC---cEE Confidence 77665532 222222221 111211111 11122223356677788899888877665533 378 Q ss_pred EeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeecccccC Q lcl|NC_019530. 175 KLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPL 254 (560) Q Consensus 175 eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~ 254 (560) +|.+|+|..|..++ +.+++ ..+|.|.. ++ ...+....++.+.|.|... ... T Consensus 117 ~l~~i~~~~v~v~~-----~~~~~-------~~~y~~~~-------~~--------~~~~~~~~~~~~evih~~~--~~~ 167 (386) T protein:vir:49 117 KWEYLRPSQVSFNR-----LDNQN-------GLYYNITF-------DD--------PHIAPKQHVPQNDILHFRL--LSV 167 (386) T ss_pred EEEEecCceeEEEE-----cCCCc-------eEEEEEEE-------cC--------ccccceeEEccccEEEecC--CCC Confidence 99999998887643 11121 11233321 00 0112345677788877763 333 Q ss_pred CCC-cchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCc Q lcl|NC_019530. 255 ANG-IVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGN 333 (560) Q Consensus 255 ~~~-~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~ 333 (560) +++ ...|.|..|++.+.....+++...=+--..+--+-+..++-...+. .+++ ++..+. ... ...|.+ T Consensus 168 ~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~-~~~~-~~~~~~----~~~--~n~g~~--- 236 (386) T protein:vir:49 168 DGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGLLD-FKTK-VSRSRQ----AMK--QMQGGP--- 236 (386) T ss_pred CCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChH-HHHH-HHHHHH----Hhc--cCCCCc--- Confidence 333 4679999999999999999888876666667777888887554443 3332 333222 111 223432 Q ss_pred ccchhhHhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHH Q lcl|NC_019530. 334 AHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE---MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEEL 410 (560) Q Consensus 334 ~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDEl 410 (560) + .++ .|.+++.|. .+..+ ++-.++....+.++++||.+.|+.+.+ ..++++.+ |. T Consensus 237 ---~-vl~----------~g~~~~~l~--~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~---~~~~~~~~---~~ 294 (386) T protein:vir:49 237 ---L-VLD----------DLEDFTPLE--IKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDGD---QQSSLEMI---YN 294 (386) T ss_pred ---e-ecC----------CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC---ccchHHHH---HH Confidence 1 111 255666663 23333 344578889999999999999975433 22222211 21 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcch Q lcl|NC_019530. 411 RFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSI 490 (560) Q Consensus 411 kF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky 490 (560) .+..| |+..+..+.. .|...| ...+.++. + ++.+.+. ..+...++.+ +-+-+ T Consensus 295 ~~~~~---i~~~l~~i~~-~~~~~l---------------~~~~~~~~--~----~~~~~d~-~~~~~~~~~l--~~~g~ 346 (386) T protein:vir:49 295 IYFKS---VSRYLRPFVS-EMSKKL---------------SCEVDVDI--S----PAVDPTG-SNYISLINSM--VKSGT 346 (386) T ss_pred HHHHH---HHHHHHHHHH-HHHHHh---------------cchhcccc--h----hhhccCH-HHHHHHHHHH--HhCCC Confidence 12222 3332222221 122222 11111111 0 0000000 0111111111 00122 Q ss_pred hhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 491 FSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 491 ~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) +|..|+-++.. ..|..+++... + ..|...|.-||.++-= T Consensus 347 ------------~t~nE~r~~l~------~~~~~~~~~~~--~-----------~~~~~~~~~gGd~~~~ 385 (386) T protein:vir:49 347 ------------LAQNQGLYILQ------QAEILPKELPD--G-----------KNPNRTSLKGGEINEQ 385 (386) T ss_pred ------------cCHHHHHHHHh------hCCCCCCcCcc--h-----------hccCCCCCCCCCCCCC Confidence 34444333211 22332221111 0 0111112223332222 No 148 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=94.39 E-value=0.0045 Score=33.43 Aligned_cols=449 Identities=12% Similarity=0.055 Sum_probs=181.7 Q ss_pred hhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhh----HHHHH--------HH Q lcl|NC_019530. 12 FGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLS----VKTVV--------EE 79 (560) Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~----~~~LI--------~~ 79 (560) .-+--+.+..+..+.+..++...- ...+.+...+....+ ...+| .+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~----------------------~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r 58 (501) T protein:vir:96 1 MEQTLFTDSTGQERVLNLRFHRES----------------------RIRYRADNLEELMVNNWELLKNFINHHKLRQAPR 58 (501) T ss_pred CceeeeeecccceeccccccchhH----------------------HhhhcccccccccCChHHHHHHHHHHHHHHHHHH Confidence 000001111111111111110000 000111111111000 11122 23 Q ss_pred HHHHhhccchhH-HH---------------------HHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHH Q lcl|NC_019530. 80 YQSMAQQPEIRK-AV---------------------DIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMH 137 (560) Q Consensus 80 YR~ma~~PEvd~-AI---------------------~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ 137 (560) |+.+..+.+-.. .| .-||+ ..+.---..||++..++.+ -.+.+.+.+..+++ T Consensus 59 ~~~~~~yY~g~~~~i~~~~~~~~~~~~~~ri~~n~~k~Ivd-~~~~yl~g~p~~~~~~~~~-----~~~~~~~~l~~~~~ 132 (501) T protein:vir:96 59 IQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISK-FKTGYLAGNPIRVEYDDND-----DNSQNDDAIKRIGR 132 (501) T ss_pred HHHHHHHhcCCCCcccCccccCccccccceeecchHHHHHH-HHhhhhcccCeeEeeCCcc-----chhHHHHHHHHHHH Confidence 444433332221 11 00111 1111122456666665532 22456677778888 Q ss_pred HhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCC-cccccccc------ccchhee Q lcl|NC_019530. 138 LMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEAIE------SVTLKYY 210 (560) Q Consensus 138 ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~~~------~~~e~~~ 210 (560) ..+|+....++.+..++.|+-|.+.-.|. +|-..+..+||+.+-+|..=....+ ..++.... ....+-+ T Consensus 133 ~n~~~~~~~~~~~~~~~~G~a~~~v~~de----dg~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~v 208 (501) T protein:vir:96 133 INDLDSLNRTLIRDLSQTGRAYEVIYRSE----YDETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQSAKDVVEI 208 (501) T ss_pred hcCHHHHHHHHHHHHhhcCeEEEEEEEcC----CCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeecCCCcEEEEEE Confidence 89999999999999999999998777654 3557789999999887742111000 01111110 0111223 Q ss_pred ecCccccccccccccccccccccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCc Q lcl|NC_019530. 211 YNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTEDATVIYAITRAP 289 (560) Q Consensus 211 Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~EDalVIyRitRAP 289 (560) |.+..-+. +............+-.--++| |++. .+.....|=++..+.....+. ++=+....-+..+.| T Consensus 209 yt~~~i~~-~~~~~~~~~~~~~~~~~g~vP---vv~~------~nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~ 278 (501) T protein:vir:96 209 YTDEHIYT-LDASDDFNEISVTTHAFGTVP---ITEY------LNNIDGIGDYETELYLIDLYDSAESDTANHMSDMADA 278 (501) T ss_pred EcCCcEEE-EeeCCCceeccccccCCCccc---eEEe------cCCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCc Confidence 44432221 111111000000000000112 2221 122233455555544444433 444445555666677 Q ss_pred cceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCC----CCccceeecCCCCCc Q lcl|NC_019530. 290 EKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREG----QNATEIATVGGGNQL 365 (560) Q Consensus 290 eRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReG----grgTEIsTLpGg~nL 365 (560) -+-+.-.+....+... .. |.. ++ .++++-..+ +.+..+..|-...+. T Consensus 279 ~l~i~G~~~~~~~~~~-----~~-~~~--~~---------------------~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 329 (501) T protein:vir:96 279 ILAIYGDLALPKGMQA-----SD-MKR--TR---------------------LMQLKPPKSADGKEGTVKAEYLTKSYDV 329 (501) T ss_pred eeeeecccccCcccch-----hh-hhh--cC---------------------eeeecccccccccccCcceeeEeccCCH Confidence 6665443322211100 00 000 11 122222222 122345555443333 Q ss_pred chH-HHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCh- Q lcl|NC_019530. 366 GEM-DHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDE- 443 (560) Q Consensus 366 gei-~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~- 443 (560) ..+ .-++-+++.+|...++|---++.-+ .| ..+..|.--.......+.+.++.|..-+...++.-+-+-++..+ T Consensus 330 ~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n---~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~ 405 (501) T protein:vir:96 330 SGAEAYKTRLNRDIHIFTNTPDMSDTNFS-GN---TSGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEF 405 (501) T ss_pred HHHHHHHHHHHHHHHHHhCCcccCccccc-cc---chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 222 2245556777888899843332221 12 22344544444456677777777777777777654433333221 Q ss_pred hHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCC Q lcl|NC_019530. 444 KDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGL 523 (560) Q Consensus 444 eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~ 523 (560) .+++ . ..|.+.|..--.-. +.+.++++..+. | .+|.+++++. |...++ .+++.++|++|..+- T Consensus 406 ~~~d--~-~~i~i~f~~~~p~n-------~~e~ad~~~kl~---g-~iS~et~~~~-l~~v~D-~~~E~~ri~~E~~~~- 468 (501) T protein:vir:96 406 KDFD--E-SLLKITFTPNLPKS-------LNEQVSILTGLG---G-QVSQETALSL-SGLVES-PNEELDKINKEMSEI- 468 (501) T ss_pred cccc--c-ccceEEeCCCCCcC-------HHHHHHHHHHHh---c-cCchHHHHHh-CCCCCC-HHHHHHHHHHHHHHh- Confidence 1221 1 13666675432212 224455566663 5 4999999987 555432 234445555554221 Q ss_pred CCCCCccccccccccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 524 YPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 524 ~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) .+++...+..+.. +.... + .-.+.+|+.= T Consensus 469 ~~~~~~~~~~~~~-~~~~~-----~--~~e~~~d~~e 497 (501) T protein:vir:96 469 DFKGYSNDFNEHV-GKYTD-----E--VKETHTDDFE 497 (501) T ss_pred hccccccchhhcc-cccCC-----c--CCCCCCCccc Confidence 1111111000000 00000 0 0001111111 No 149 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=94.26 E-value=0.0049 Score=33.24 Aligned_cols=471 Identities=12% Similarity=0.106 Sum_probs=180.1 Q ss_pred CCcc--cchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeE---eccccceeccccccchhhHHH Q lcl|NC_019530. 1 MAGY--GRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIIL---QGGANTYNYVGVESELLSVKT 75 (560) Q Consensus 1 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i---~~~~~~~~y~~~e~~~~~~~~ 75 (560) ||-. -|.+|.|++---+.... + ...++...+|+.-...--.-|...-+ .++.....-.....+..+-.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~-----~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 74 (535) T protein:vir:10 1 MAILKDLRNAFSLSNKKSTSYIE-----L-GDYDKDIVNKAIRPGRASARDTVDGIDIADGNVAGQYSVASISDVLSTKK 74 (535) T ss_pred ChhhHHHHHHHHhhhhhhhhhHH-----H-hhhhHHHHHhhhhhhhhhhhccccccccccCCcccccccCccccccCHHH Confidence 4432 24444444321111111 1 11133333333211111111111111 111100000001111112233 Q ss_pred HHHHHHHH-hhccchhHHHHHhhcceeee------cCCCceEEEeeecccch-hHHHHHHHHHHHHHHHHH-hCCchhhH Q lcl|NC_019530. 76 VVEEYQSM-AQQPEIRKAVDIIVNDVVTC------EEDETPVTVNLDKVEGI-SDTVKESITECFKEVMHL-MDFDNTAY 146 (560) Q Consensus 76 LI~~YR~m-a~~PEvd~AI~eIvneaiv~------d~~~~~V~l~Ld~~~~~-s~sik~~I~eeF~~il~l-l~f~~~~~ 146 (560) | +.+ ...|-|..||+.+.+.+.++ ......+.+.|...+.- ++..+.+. ..+.++|+. -+-....+ T Consensus 75 l----~~~~~~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~~-~~l~~lL~~~PN~~~~~~ 149 (535) T protein:vir:10 75 L----LKAYADNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKRA-HEIEDFIYNTGSEYYEWR 149 (535) T ss_pred H----HHHhccChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhhh-hHHHHHHHhCCCCCCChh Confidence 3 333 46788888888877765432 12233345555443211 11111111 122233321 11111222 Q ss_pred H----HHHHh----hhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccc Q lcl|NC_019530. 147 Q----KIRKW----YVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRN 218 (560) Q Consensus 147 ~----~~R~w----YvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~ 218 (560) + +++.. ++-|.-.|-.|+ .+...-+.+|++|||.+|.... +..+. +....|+.|.. T Consensus 150 ~~~~~~~~~lv~d~l~~~g~ay~~i~--r~~~G~~~~L~~l~p~~V~v~~-----d~~~~----~~~~~~~~~~~----- 213 (535) T protein:vir:10 150 DTFPRLLTKIINDMYVQDQINIERIF--KNDSNELDHFNAVDASKVVISY-----SPRSK----DQPRKFEQFVS----- 213 (535) T ss_pred HHHHHHHHHHHHHHHhhCCceEEEEE--ECCCCcEEEEEEeCCceeEEEE-----cCccc----cCceEEEEEec----- Confidence 2 22222 222333333332 2334458999999999887642 11111 11223333431 Q ss_pred cccccccccccccccccceecchhceeeeecc-ccc-CCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEc Q lcl|NC_019530. 219 QFTGQSGTSQNFQPSQQELVFDDESIVYIDSG-EEP-LANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYL 296 (560) Q Consensus 219 ~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsG-l~~-~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyI 296 (560) .+....++.+.|.|.+-. +-+ ..+++.+|-|+.|.+.+.....+++...=+----+--+-|..+ T Consensus 214 --------------~~~~~~~~~~eiih~~~~~~~~~~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~ 279 (535) T protein:vir:10 214 --------------ETKSVKFSERNLTFINYWNLSDTDRRGYGYSPVEASIPLIRAIYDTEQFNARFFSQGGTTRGILVI 279 (535) T ss_pred --------------CceeEEECcccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEe Confidence 112345777777776521 111 2345567889999999999988887655443333444566666 Q ss_pred cCCC---CchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchH---HH Q lcl|NC_019530. 297 DVGT---LGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEM---DH 370 (560) Q Consensus 297 DvGn---lpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei---~D 370 (560) +... +.+..+ +.+...+++.. .| ..+..+. .+++. .|.++.-|- .+..++ +- T Consensus 280 ~~~~~~~ls~e~~----e~lk~~~~~~~-----~G-~~nag~~-~vl~~---------~g~~~~~l~--~~~~D~qfle~ 337 (535) T protein:vir:10 280 DQDGDAQANQMML----AGIRRQWTSQG-----SG-LGGAWKI-PILAA---------KDAKFVNMT--QNSRDMEFDKF 337 (535) T ss_pred cCCCCcccCHHHH----HHHHHHHHHHh-----cC-ccccccc-ccccC---------CCceEEecC--CChhHHHHHHH Confidence 5432 222222 22222333221 12 1222222 23331 134443333 233333 33 Q ss_pred HHHHHHHHHHhcCCCccccCCCCccccccc--chhhh--hHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHhccCCC Q lcl|NC_019530. 371 VNYFREKLYDALMIPKSRLQEEGSINIGGS--NLAEI--TQEELRFSKFC----AGLRRRYSHFFMEFLRRQLILKGVTD 442 (560) Q Consensus 371 V~YF~~kLy~aL~VP~sRl~~~~~~~l~~~--~~~eI--tRDElkF~Kfi----~rLr~rfs~if~d~Lk~qLiLkgiit 442 (560) .++..+..-++.+||...|+-....+...+ ..... +--|-.+..|+ .-+.+++.. .|-.. +++ T Consensus 338 ~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie~----~ln~~-----Ll~ 408 (535) T protein:vir:10 338 LNFMIYDTAAIFQMQPEEINFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLSFIEQ----VINDK-----IMR 408 (535) T ss_pred HHHHHHHHHHHhCCCHHHhccccCcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHH----HHhhh-----ccc Confidence 456788899999999999865333221111 00000 01122222332 222222222 22222 233 Q ss_pred hhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHH---HHHHHHHHH--- Q lcl|NC_019530. 443 EKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEE---VKEQQTKIA--- 516 (560) Q Consensus 443 ~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeE---I~e~~kqIe--- 516 (560) ..+ ..+.|+|.. +.... ...|.++.+.+. .-++|.+-+++ .+.|..-| +--+.-+.. T Consensus 409 ~~~------~~~~f~f~~------l~~~d-~~~r~~~~~~~~---~g~lT~NE~R~-~~gl~piegGD~~~~~~~~~~~~ 471 (535) T protein:vir:10 409 YVD------TDYRFSFTL------GDAQD-KLQEEQVWKLKL---ANGYFINEYRK-DHGLKTVDGLDVPGFIGSAENFI 471 (535) T ss_pred ccC------CeEEEEecc------ccccC-HHHHHHHHHHHH---cCCCCHHHHHH-HhCCCCCCCccccccccchhhcc Confidence 221 134555532 12112 234555554332 23579998885 47876532 100000000 Q ss_pred Hhhh--cCCCCCCCcccccccc----------ccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 517 EEKK--KGLYPKVQADETGNYS----------GSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 517 ~E~~--~~~~~~p~~~e~g~~~----------~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) .... .+--|++.++...... .-+....+-+|+......|+.++. T Consensus 472 ~~~~~~~~~~p~~~~~~~~~~~~~~~q~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 527 (535) T protein:vir:10 472 NATGFGQPNVPDSSDDSGSTLGERERQERIQHSKDYEKGKDDPKSPLPKPSESDDV 527 (535) T ss_pred cccccccccCCCCCCCccccCCccccCcccccccccccCCCCCCCCCCcCCCCCcc Confidence 0000 0101111111000000 000001122333222223333333 No 150 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=94.23 E-value=0.005 Score=33.19 Aligned_cols=446 Identities=11% Similarity=0.034 Sum_probs=174.8 Q ss_pred cccccccCCCCCCCCce-----eEeccccc-eeccccccch----hhHHHHH--------HHHHHHhhccchhHHHHHhh Q lcl|NC_019530. 36 LTKATVVALDDAQDGSI-----ILQGGANT-YNYVGVESEL----LSVKTVV--------EEYQSMAQQPEIRKAVDIIV 97 (560) Q Consensus 36 ~~k~~s~~~~~~~dG~~-----~i~~~~~~-~~y~~~e~~~----~~~~~LI--------~~YR~ma~~PEvd~AI~eIv 97 (560) --+.++++.. +|.+ ......+. +.+...+... .-.+.+| .+|+.+..+.+-+.- .|. T Consensus 1 ~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~--~i~ 75 (502) T protein:vir:48 1 MMEQTLFTDS---TGQDLVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENH--DVL 75 (502) T ss_pred CceeEEEEec---chhHHHhhcccChhHHhhhcccchhhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--ccc Confidence 1111111110 0100 01111111 1111111111 1111222 234444333332210 011 Q ss_pred c-----------------------ceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhh Q lcl|NC_019530. 98 N-----------------------DVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYV 154 (560) Q Consensus 98 n-----------------------eaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYv 154 (560) + +..+.---..||++..++. ...+.+.+.+..+++..+|+....++.+...+ T Consensus 76 ~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~-----~~~~~~~~~l~~~~~~N~~~~~~~~~~~~~~~ 150 (502) T protein:vir:48 76 KSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDN-----EDNSQNDDAIKRIGRINDIDTHNRNLIRDLSQ 150 (502) T ss_pred ccccccccccccceeecchHHHHHHHHhhhhcccCeeEecCCc-----cchhHHHHHHHHHHhhcCHhHHHHHHHHHHhh Confidence 1 1111111233444444432 23356777788888889999999999999999 Q ss_pred hhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCC-cccccccc-----c-cchheeecCcccccccccccccc Q lcl|NC_019530. 155 DGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEAIE-----S-VTLKYYYNPNYNRNQFTGQSGTS 227 (560) Q Consensus 155 DGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~~~-----~-~~e~~~Y~p~~~~~~~~g~~~~~ 227 (560) .|+-|.+.-+|. +|-..++.+||+.+-+|..=....+ ..++.... . +...-+|++..-+. +.+..... T Consensus 151 ~G~a~~~v~~de----dg~~~i~~~~p~~~~~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~iyt~~~i~~-~~~~~~~~ 225 (502) T protein:vir:48 151 TGRAYEVIYRSE----YDETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNQHIYT-LDASDSFN 225 (502) T ss_pred cCeEEEEEEeCC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEEeecCCcEEEEEEEeCCeEEE-EEeCCcee Confidence 999998776654 3557799999999977742111110 01111110 0 01111344432221 11111100 Q ss_pred ccccccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHH Q lcl|NC_019530. 228 QNFQPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTEDATVIYAITRAPEKRAFYLDVGTLGKKSA 306 (560) Q Consensus 228 ~~~~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~EDalVIyRitRAPeRRvFyIDvGnlpk~KA 306 (560) ....-+..--++| |++. .++....|=++.++....-+. ++=+....-+-.+.|-+-+.=......+... T Consensus 226 ~~~~~~~~~g~vP---vv~~------~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~- 295 (502) T protein:vir:48 226 EISVTPHAFGTVP---ITEF------LNNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGMQA- 295 (502) T ss_pred eccceecCCCccc---eEEe------cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccccch- Confidence 0000000000111 1111 123334455555444433332 2233333444555555444322111111000 Q ss_pred HHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHH-HHHHHHHHHHHhcCCC Q lcl|NC_019530. 307 EEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMD-HVNYFREKLYDALMIP 385 (560) Q Consensus 307 eqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~-DV~YF~~kLy~aL~VP 385 (560) .- +.+ .++.+ +. -+..--..+.+..+.+|-...+...+. -++-+.+.+|+..++| T Consensus 296 -~~----~~~--~~~~~----------------~~-~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p 351 (502) T protein:vir:48 296 -SD----MKR--TRLMQ----------------LK-PPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTP 351 (502) T ss_pred -hh----hhh--cceee----------------cc-ccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCC Confidence 00 000 01100 00 000000112233455554433433333 3566777888888888 Q ss_pred ccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC---CChhHHHHhhhhhceeeeeecc Q lcl|NC_019530. 386 KSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGV---TDEKDWNEKIKPFIKFEFTSDS 462 (560) Q Consensus 386 ~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgi---it~eew~~~i~~~I~~~f~~Dn 462 (560) ---++ +|. |...+..|..-......-+.+.++.|..-+...++.-+-+-++ ....+|. .|.+.|.. + T Consensus 352 ~~~~~---~~~-~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~-----~i~i~f~~-~ 421 (502) T protein:vir:48 352 DMSDN---HFS-GNASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDES-----RLKITFTP-N 421 (502) T ss_pred CcCcc---ccc-cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc-----cceEEeCC-C Confidence 42222 211 2222344544444445556666666666666666553332232 2223332 36777743 3 Q ss_pred hHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCC--CCCCcccccccc-ccc Q lcl|NC_019530. 463 YIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLY--PKVQADETGNYS-GSD 539 (560) Q Consensus 463 ~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~--~~p~~~e~g~~~-~~~ 539 (560) .-.- +.+.++++..+. | .+|.+++++. |..+++ -+++.++|++|...--. .++...+.++.+ .+. T Consensus 422 ~p~d------~~e~a~~~~kl~---g-~iS~et~l~~-l~~v~D-~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~d~~ 489 (502) T protein:vir:48 422 LPKS------LYEQVSILNDLG---G-QVSQETALSL-SGLVEN-PTEELDKINEESSKIDFKGYPSYFYDNVGKYTDEV 489 (502) T ss_pred CCcC------HHHHHHHHHHHh---c-cCcHHHHHHh-CCCCCC-HHHHHHHHHHHHHhhhhhcccccccccccccCCCc Confidence 3221 123445555553 4 4899999977 666442 22334444444321100 111111111111 111 Q ss_pred cCCCCCCcccccc Q lcl|NC_019530. 540 VSPLKFKPETIPF 552 (560) Q Consensus 540 ~~~~~~~p~~~~~ 552 (560) ....+-.-|-+|- T Consensus 490 ~e~~~~~~~~~~~ 502 (502) T protein:vir:48 490 KETHTDDFERVYE 502 (502) T ss_pred cCCCCcCcCCCCC Confidence 1111111122222 No 151 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=94.16 E-value=0.0052 Score=33.11 Aligned_cols=382 Identities=11% Similarity=0.116 Sum_probs=161.3 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCcee-EeccccceeccccccchhhHHHHHHH Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSII-LQGGANTYNYVGVESELLSVKTVVEE 79 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~-i~~~~~~~~y~~~e~~~~~~~~LI~~ 79 (560) |. -|||.. +.. +...........+.. +|... +-+... +..+.. +. T Consensus 1 m~---m~~f~~----------~~~-----~~~~~~~~~~~~~~~----~~~~~~~~~~~~-----~~~~~~------v~- 46 (392) T protein:vir:10 1 MI---LPILNF----------INQ-----TNDPPEVGSVQSYFP----DGNDAQIMESLL-----GDNNEW------VS- 46 (392) T ss_pred Cc---chhhhh----------hhc-----ccccccccccccccc----cCchhhhhhhhc-----CCCCce------ec- Confidence 11 122211 110 000000011111111 11111 000000 000000 11 Q ss_pred HHHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchh----hHHHHHHhhhh Q lcl|NC_019530. 80 YQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNT----AYQKIRKWYVD 155 (560) Q Consensus 80 YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~----~~~~~R~wYvD 155 (560) -+..+++|-|.+||.-|.+.+.-+ .+.+.... . ..+.+ -=+-... -+.++..+++. T Consensus 47 ~~~al~~~~v~~~i~~ia~~ia~l-------p~~~~~~~--~----~~l~~-------~PN~~~t~~~f~~~~~~~lll~ 106 (392) T protein:vir:10 47 ARAALRNSDLFSIILQLSSDLAIV-------KINAEKKK--N----QGIID-------NPSTNANKHGFWQSMFAQLLLG 106 (392) T ss_pred hHHhhccHHHHHHHHHHHHhhccC-------ceeeccch--h----hhHhh-------cCCCCCCHHHHHHHHHHHhhhc Confidence 123356899999998777765432 22232221 1 11111 1122222 34455678888 Q ss_pred hhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCcccccccccccccccccccccc Q lcl|NC_019530. 156 GRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQ 235 (560) Q Consensus 156 Gr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~ 235 (560) |.-|..++.|. ...+.+|++|+|..|..++ +.+++. .+|.|... +. ..+. T Consensus 107 Gna~~~i~r~~---~g~~~~L~~l~~~~v~~~~-----~~~~~~-------~~y~~~~~-------~~--------~~~~ 156 (392) T protein:vir:10 107 GEAFAYRWRNA---NGADMKWEYLRPSQVNTYY-----FEYENG-------MYYNITFD-------DP--------KIEP 156 (392) T ss_pred CcEEEEEEECC---CCcEEEEEEEcCceeEEEE-----cCCCce-------EEEEEEec-------Cc--------ccce Confidence 99888877665 3448999999999887643 222321 12333211 00 1122 Q ss_pred ceecchhceeeeecccccCCCC-cchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHH Q lcl|NC_019530. 236 ELVFDDESIVYIDSGEEPLANG-IVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMM 314 (560) Q Consensus 236 ~~~i~~daI~y~hsGl~~~~~~-~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im 314 (560) ...++.+.|.|... .+.++. +..|.|..|...++....+++...=+=---+--+-+..++.+..+..++.+.++ T Consensus 157 ~~~~~~~eiih~~~--~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~--- 231 (392) T protein:vir:10 157 ILQAPQSDLIHMKL--LSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRS--- 231 (392) T ss_pred eEEEccccEEEecC--CCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHH--- Confidence 45577888888763 343442 468999999999999998887765333333445566667655444444433322 Q ss_pred HhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCC Q lcl|NC_019530. 315 GKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEG 393 (560) Q Consensus 315 ~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~ 393 (560) ..|+.. ...|. .+ .++ .|++++.|.-...-.+ ++=.+|..+.+.++++||...|+..+ T Consensus 232 ~~~~~~----~~~g~------~~-vl~----------~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~ 290 (392) T protein:vir:10 232 RSFMKR----SRSGG------PV-VLD----------DLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQG 290 (392) T ss_pred HHHhcc----ccCCC------ee-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC Confidence 233321 11222 11 121 2567777754333333 55567888999999999999996433 Q ss_pred cccccccchhhhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHH Q lcl|NC_019530. 394 SINIGGSNLAEITQEELRFSKFC-AGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAI 472 (560) Q Consensus 394 ~~~l~~~~~~eItRDElkF~Kfi-~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei 472 (560) ..+ ++. +-.+. |..++ .-+-++ +.+.|...|. +.-+++. + ..+..|. . T Consensus 291 ~~~---~~~-~~~~~---f~~~~l~P~~~~----ie~~l~~~L~-----~~~~~d~--~----~~~~~d~--~------- 339 (392) T protein:vir:10 291 DQQ---SSI-QQISG---MYASALNRYLRP----AISELEYKLS-----DHISVNM--R----PAIDPLG--D------- 339 (392) T ss_pred Ccc---cHH-HHHHH---HHHHHHHHHHHH----HHHHHHHhcc-----ccccccc--h----hhhccCH--H------- Confidence 211 111 11122 44322 222222 2223333332 1111111 0 0011110 0 Q ss_pred HHHHHHHHHHhhhhh-cchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccc-ccccccCCCCCCcccc Q lcl|NC_019530. 473 LNDRLASLNTVEPFV-GSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGN-YSGSDVSPLKFKPETI 550 (560) Q Consensus 473 ~~~Rl~~l~~~~p~v-Gky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~-~~~~~~~~~~~~p~~~ 550 (560) .+... ++..+ +-.+|..-++ ++. ...|..|+-....++- .-+++ -..|.+ T Consensus 340 --~~~~~---~~~l~~~g~~t~nE~r------------~~l------~~~g~~p~e~r~~e~l~~~~~G-----d~~~p~ 391 (392) T protein:vir:10 340 --NYLST---ISTATRWGALAENQAT------------FVL------QEAGYIPKDLPAPENTNKKTTG-----QSNEPV 391 (392) T ss_pred --HHHHH---HHHHHhCCCcCHHHHH------------HHH------HhcCCCccccchhcCCCCCCCC-----CCCCCC Confidence 11111 11111 1233333332 211 1234332211111110 00111 112333 Q ss_pred c Q lcl|NC_019530. 551 P 551 (560) Q Consensus 551 ~ 551 (560) | T Consensus 392 p 392 (392) T protein:vir:10 392 P 392 (392) T ss_pred C Confidence 3 No 152 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=94.16 E-value=0.0052 Score=33.11 Aligned_cols=382 Identities=11% Similarity=0.116 Sum_probs=161.3 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCcee-EeccccceeccccccchhhHHHHHHH Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSII-LQGGANTYNYVGVESELLSVKTVVEE 79 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~-i~~~~~~~~y~~~e~~~~~~~~LI~~ 79 (560) |. -|||.. +.. +...........+.. +|... +-+... +..+.. +. T Consensus 1 m~---m~~f~~----------~~~-----~~~~~~~~~~~~~~~----~~~~~~~~~~~~-----~~~~~~------v~- 46 (392) T protein:vir:39 1 MI---LPILNF----------INQ-----TNDPPEVGSVQSYFP----DGNDAQIMESLL-----GDNNEW------VS- 46 (392) T ss_pred Cc---chhhhh----------hhc-----ccccccccccccccc----cCchhhhhhhhc-----CCCCce------ec- Confidence 11 122211 110 000000011111111 11111 000000 000000 11 Q ss_pred HHHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchh----hHHHHHHhhhh Q lcl|NC_019530. 80 YQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNT----AYQKIRKWYVD 155 (560) Q Consensus 80 YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~----~~~~~R~wYvD 155 (560) -+..+++|-|.+||.-|.+.+.-+ .+.+.... . ..+.+ -=+-... -+.++..+++. T Consensus 47 ~~~al~~~~v~~~i~~ia~~ia~l-------p~~~~~~~--~----~~l~~-------~PN~~~t~~~f~~~~~~~lll~ 106 (392) T protein:vir:39 47 ARAALRNSDLFSIILQLSSDLAIV-------KINAEKKK--N----QGIID-------NPSTNANKHGFWQSMFAQLLLG 106 (392) T ss_pred hHHhhccHHHHHHHHHHHHhhccC-------ceeeccch--h----hhHhh-------cCCCCCCHHHHHHHHHHHhhhc Confidence 123356899999998777765432 22232221 1 11111 1122222 34455678888 Q ss_pred hhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCcccccccccccccccccccccc Q lcl|NC_019530. 156 GRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQ 235 (560) Q Consensus 156 Gr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~ 235 (560) |.-|..++.|. ...+.+|++|+|..|..++ +.+++. .+|.|... +. ..+. T Consensus 107 Gna~~~i~r~~---~g~~~~L~~l~~~~v~~~~-----~~~~~~-------~~y~~~~~-------~~--------~~~~ 156 (392) T protein:vir:39 107 GEAFAYRWRNA---NGADMKWEYLRPSQVNTYY-----FEYENG-------MYYNITFD-------DP--------KIEP 156 (392) T ss_pred CcEEEEEEECC---CCcEEEEEEEcCceeEEEE-----cCCCce-------EEEEEEec-------Cc--------ccce Confidence 99888877665 3448999999999887643 222321 12333211 00 1122 Q ss_pred ceecchhceeeeecccccCCCC-cchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHH Q lcl|NC_019530. 236 ELVFDDESIVYIDSGEEPLANG-IVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMM 314 (560) Q Consensus 236 ~~~i~~daI~y~hsGl~~~~~~-~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im 314 (560) ...++.+.|.|... .+.++. +..|.|..|...++....+++...=+=---+--+-+..++.+..+..++.+.++ T Consensus 157 ~~~~~~~eiih~~~--~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~--- 231 (392) T protein:vir:39 157 ILQAPQSDLIHMKL--LSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRS--- 231 (392) T ss_pred eEEEccccEEEecC--CCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHH--- Confidence 45577888888763 343442 468999999999999998887765333333445566667655444444433322 Q ss_pred HhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCC Q lcl|NC_019530. 315 GKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEG 393 (560) Q Consensus 315 ~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~ 393 (560) ..|+.. ...|. .+ .++ .|++++.|.-...-.+ ++=.+|..+.+.++++||...|+..+ T Consensus 232 ~~~~~~----~~~g~------~~-vl~----------~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~ 290 (392) T protein:vir:39 232 RSFMKR----SRSGG------PV-VLD----------DLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQG 290 (392) T ss_pred HHHhcc----ccCCC------ee-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC Confidence 233321 11222 11 121 2567777754333333 55567888999999999999996433 Q ss_pred cccccccchhhhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHH Q lcl|NC_019530. 394 SINIGGSNLAEITQEELRFSKFC-AGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAI 472 (560) Q Consensus 394 ~~~l~~~~~~eItRDElkF~Kfi-~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei 472 (560) ..+ ++. +-.+. |..++ .-+-++ +.+.|...|. +.-+++. + ..+..|. . T Consensus 291 ~~~---~~~-~~~~~---f~~~~l~P~~~~----ie~~l~~~L~-----~~~~~d~--~----~~~~~d~--~------- 339 (392) T protein:vir:39 291 DQQ---SSI-QQISG---MYASALNRYLRP----AISELEYKLS-----DHISVNM--R----PAIDPLG--D------- 339 (392) T ss_pred Ccc---cHH-HHHHH---HHHHHHHHHHHH----HHHHHHHhcc-----ccccccc--h----hhhccCH--H------- Confidence 211 111 11122 44322 222222 2223333332 1111111 0 0011110 0 Q ss_pred HHHHHHHHHHhhhhh-cchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccc-ccccccCCCCCCcccc Q lcl|NC_019530. 473 LNDRLASLNTVEPFV-GSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGN-YSGSDVSPLKFKPETI 550 (560) Q Consensus 473 ~~~Rl~~l~~~~p~v-Gky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~-~~~~~~~~~~~~p~~~ 550 (560) .+... ++..+ +-.+|..-++ ++. ...|..|+-....++- .-+++ -..|.+ T Consensus 340 --~~~~~---~~~l~~~g~~t~nE~r------------~~l------~~~g~~p~e~r~~e~l~~~~~G-----d~~~p~ 391 (392) T protein:vir:39 340 --NYLST---ISTATRWGALAENQAT------------FVL------QEAGYIPKDLPAPENTNKKTTG-----QSNEPV 391 (392) T ss_pred --HHHHH---HHHHHhCCCcCHHHHH------------HHH------HhcCCCccccchhcCCCCCCCC-----CCCCCC Confidence 11111 11111 1233333332 211 1234332211111110 00111 112333 Q ss_pred c Q lcl|NC_019530. 551 P 551 (560) Q Consensus 551 ~ 551 (560) | T Consensus 392 p 392 (392) T protein:vir:39 392 P 392 (392) T ss_pred C Confidence 3 No 153 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=94.14 E-value=0.0053 Score=33.07 Aligned_cols=403 Identities=11% Similarity=0.103 Sum_probs=170.1 Q ss_pred hhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHHHhhccee Q lcl|NC_019530. 22 VDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVV 101 (560) Q Consensus 22 ~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneai 101 (560) ....++|.+.... +.......+|.... .. ...+... +. +..+++|.|..||.-|.+.+. T Consensus 1 m~~~~~~~~~~~~------~~~~~~~~~~~~~~---~~--~~~g~~v---~~-------~~al~~~~v~~~i~~ia~~ia 59 (419) T protein:vir:57 1 MFIPQFWKGRPSE------NRVNWQVVPGGMRS---SS--SQAGVII---TP-------ETALALSAVRACVTLLAESVA 59 (419) T ss_pred CcchhhhccCCcc------cccccccccccccc---cc--ccCCcee---ch-------HHhhccHHHHHHHHHHHHhhc Confidence 2333333332111 11111111111110 00 0001100 11 123567889999987777654 Q ss_pred eecCCCceEEEeeecccchhHHHHHHHHH-HHHHHHH-HhCCchh----hHHHHHHhhhhhhhhhhheecCCCCccCeeE Q lcl|NC_019530. 102 TCEEDETPVTVNLDKVEGISDTVKESITE-CFKEVMH-LMDFDNT----AYQKIRKWYVDGRQAYHVIVDPTNKKGGIKK 175 (560) Q Consensus 102 v~d~~~~~V~l~Ld~~~~~s~sik~~I~e-eF~~il~-ll~f~~~----~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~e 175 (560) -. |+.+-=...++ .++.+.+ -...+|+ --+-... .+.++..+++.|.-|..++.|. ..-+.+ T Consensus 60 ~l-----p~~~~~~~~~g----~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~---~G~~~~ 127 (419) T protein:vir:57 60 QL-----PCVLYRRTENG----GREIAFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNG---RGDITE 127 (419) T ss_pred cC-----ceEEEEEcCCC----ceeccccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECC---CCcEEE Confidence 32 22221111111 1111111 1222221 1122223 3455567778898877766554 334899 Q ss_pred eEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeecccccCC Q lcl|NC_019530. 176 LVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLA 255 (560) Q Consensus 176 Lr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~ 255 (560) |.+|+|..|...+ +.++ ..||.|. + ..-.++.+.|.|... ...+ T Consensus 128 L~pl~~~~v~v~~-----~~~g--------~~~y~~~---------~------------~~~~~~~~~vih~r~--~~~d 171 (419) T protein:vir:57 128 LIPINPHKVIVLK-----GPDG--------MPYYDIP---------S------------IGEILPMRMVHHIKS--FSLD 171 (419) T ss_pred EEEEcCcceEEEE-----CCCc--------eEEEEEc---------C------------CceEEchhhEEEecC--cCCC Confidence 9999998887532 2222 1233332 0 012367777777753 2345 Q ss_pred CCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCccc Q lcl|NC_019530. 256 NGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAH 335 (560) Q Consensus 256 ~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~ 335 (560) +-+..|.+..|.+.+....-+++...=+----+--+-+...+.. +.....++-...+..++.++.- | .++.-+ T Consensus 172 ~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-~~~~~~~e~~~~~~~~~~~~~~-----g-~~nag~ 244 (419) T protein:vir:57 172 GYIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFE-AKAIASQAAVDAILAKWTERYG-----G-VRNAFS 244 (419) T ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCc-CCcccCHHHHHHHHHHHHHHhc-----c-cccccc Confidence 55567889999998888887777665443333444455555421 1222222333444445544321 1 111111 Q ss_pred chhhHhhhcccccCCCCccceeecCCCCCcchHH---HHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHH Q lcl|NC_019530. 336 LMGIAEDYWLPRREGQNATEIATVGGGNQLGEMD---HVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRF 412 (560) Q Consensus 336 ~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~---DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF 412 (560) .+ .++ .|+++..|- .+.-++. =.++..+.++++++||..-|+..+..+ .+++++..+ -| T Consensus 245 ~~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~sn~e~~~~---~f 306 (419) T protein:vir:57 245 VG-MLQ----------EGMTYKQLS--QDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKST--NNNIEHQGL---QY 306 (419) T ss_pred ce-ecC----------CCceEEEcC--CChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCc--cccHHHHHH---HH Confidence 11 121 256666664 3444433 334666889999999999997544322 233333222 25 Q ss_pred HHHH-HHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchh Q lcl|NC_019530. 413 SKFC-AGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIF 491 (560) Q Consensus 413 ~Kfi-~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~ 491 (560) .+++ .-+..+ +. +-+-++++++.++. . ..+.|..+.. .. .-+..|++.++.+-. .-++ T Consensus 307 ~~~~l~P~~~~----ie-----~~l~~~ll~~~~~~---~--~~i~fd~~~l----l~-~d~~~~~~~~~~~~~--~G~~ 365 (419) T protein:vir:57 307 VIYTMLAILKR----HE-----SAMMRDLLLPSERR---D--FYIEFNVSSL----LR-GDQKSRYESYALGRQ--WGWL 365 (419) T ss_pred HHHHHHHHHHH----HH-----HHHHhhccCccccC---C--eEEEEechhh----hc-cCHHHHHHHHHHHHh--CCCc Confidence 4442 222211 12 22334555665542 2 3444444332 11 122345555444321 1255 Q ss_pred hHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccc---ccccccc-ccCC-CCCCcccccccCCcccCC Q lcl|NC_019530. 492 SIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADE---TGNYSGS-DVSP-LKFKPETIPFSGSTDDSI 560 (560) Q Consensus 492 S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e---~g~~~~~-~~~~-~~~~p~~~~~~~~~~~~~ 560 (560) +.+-++. ++.+. |-|..|+ +++..+. ...+ .+=.|+-+|-+=+..-+= T Consensus 366 T~NE~R~-~~gl~--------------------p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 418 (419) T protein:vir:57 366 SVNDIRR-MENLT--------------------PIPGGDKYLTPLNMVDSKALTGIGKATPQQLKDIEAILCTR 418 (419) T ss_pred CHHHHHH-HhCCC--------------------CCCCcCeeeeccccccccccccccCCCcccCcchhhhhhcc Confidence 5555552 23331 2232221 1111000 0000 011111111111100000 No 154 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=94.01 E-value=0.0057 Score=32.90 Aligned_cols=413 Identities=12% Similarity=0.138 Sum_probs=175.5 Q ss_pred CCccc-chhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHH Q lcl|NC_019530. 1 MAGYG-RGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEE 79 (560) Q Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~ 79 (560) |.--- -|||++. +.+ +.+..........++ .|-+..+.. .... .. ...... . T Consensus 1 ~~~~~~mg~f~r~---~~~---------~~~~~~~~~~~~~~~-~~~~~~~~~------~~~~-~~-~~g~~v------~ 53 (432) T protein:vir:81 1 MPDEKKLGLFGQL---KAM---------FVPPDPVDIGGGQTF-TPVNATARD------LGII-IS-DTGAAV------N 53 (432) T ss_pred CCchhhcchhhhh---hhh---------ccccccccccccccc-ccCccchhh------hccc-cc-ccCccc------c Confidence 11100 0233321 111 111111000000011 111000000 0000 00 000000 1 Q ss_pred HHHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHH-hCCchhh----HHHHHHhhh Q lcl|NC_019530. 80 YQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHL-MDFDNTA----YQKIRKWYV 154 (560) Q Consensus 80 YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l-l~f~~~~----~~~~R~wYv 154 (560) ....+++|-|..||.-|.+.+.-+ |+.+--...++ .++.+.....++|+. =+-...+ +.++..++. T Consensus 54 ~~~al~~~~V~~~i~~Ia~~ia~l-----p~~~y~~~~~g----~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll 124 (432) T protein:vir:81 54 ADAIMRLDAVAACVKLVSQAIAAM-----PLTMYMRTPDG----RKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLL 124 (432) T ss_pred hHhhhccHHHHHHHHHHHHhhhhC-----ceeeEEecCCc----ceecccchHHHHHHhcccccCCHHHHHHHHHHHHhh Confidence 123457799999997777765433 33322121111 111111222222221 0111222 344556777 Q ss_pred hhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccc Q lcl|NC_019530. 155 DGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQ 234 (560) Q Consensus 155 DGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~ 234 (560) +|.-|..++.+ ...+++|.+|+|-++...+ +..| ..+|.|.. ..+ T Consensus 125 ~Gnayv~i~~~----~g~~~~L~~l~~~~v~v~~-----~~~g--------~~~y~~~~------------------~~g 169 (432) T protein:vir:81 125 DGTAYVRKVVT----DGRIESLQYLANDRLTITT-----DPKG--------NTAYRYRR------------------TDG 169 (432) T ss_pred cCCeEEEEEec----CCcEEEEEEEcCCceEEEE-----CCCC--------cEEEEEEe------------------cCc Confidence 88776655443 2468999999998776642 1122 11232321 112 Q ss_pred cceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhc--CccceEEEccCCCCchHHHHHHHHH Q lcl|NC_019530. 235 QELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITR--APEKRAFYLDVGTLGKKSAEEYMTM 312 (560) Q Consensus 235 ~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitR--APeRRvFyIDvGnlpk~KAeqYl~~ 312 (560) ....++.+.|.|... .+.++-+.+|-|+.|.+.+..-..+++... +.++ +--.-|..+| +.|-+..++...+ T Consensus 170 ~~~~~~~~~iih~r~--~~~dg~~G~spi~~~~~~i~~~~~~~~~~~--~~f~ng~~~~gil~~~-~~l~~e~~~~~~~- 243 (432) T protein:vir:81 170 QMIDIPKQQIWKIMG--YSLDGENGLSAIRYGAQIFGTAIAAEAQAA--RAFRNGQLQSVYYQID-RFLTDDQYDSFAK- 243 (432) T ss_pred eEEEEccccEEEecC--CCCCCcccccHHHHHHHHHHHHHHHHHHHH--HHHhcCCCcceEEecC-CCCCHHHHHHHHH- Confidence 345688888877752 344555667899999988887777766543 2232 2223445554 4555444433322 Q ss_pred HHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCcccc Q lcl|NC_019530. 313 MMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE---MDHVNYFREKLYDALMIPKSRL 389 (560) Q Consensus 313 im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl 389 (560) +|..- ...|. .+ .++ .|++++.|. .+..+ ++-.+|....+.++++||...| T Consensus 244 ---~~~~~----~nag~------~~-vl~----------~g~~~~~l~--~~~~d~q~le~~~~~~~~Ia~~fgVPp~~l 297 (432) T protein:vir:81 244 ---KVSGS----VEAGR------AP-LLE----------GGMDVKSLG--LNPVDAQLLQSRQYSVESICRFFGVPPSMI 297 (432) T ss_pred ---HHhhh----hcCCC------ce-ecC----------CCceEEEcc--CCHHHHHHHHHHHHHHHHHHHHhCCCHHHc Confidence 22210 11122 11 222 245566553 33333 3444578889999999999999 Q ss_pred CCCCcccc-cccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHH Q lcl|NC_019530. 390 QEEGSINI-GGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQ 468 (560) Q Consensus 390 ~~~~~~~l-~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elk 468 (560) +.....+. ..++..+..+. |.++ .|+--+. .+.+.|-. +++++.++. ...++|..+ ++. T Consensus 298 g~~~~~~~~~~sn~eq~~~~---f~~~--tl~P~~~-~ie~~l~~-----kLl~~~~~~-----~~~~~fd~~----~ll 357 (432) T protein:vir:81 298 GHSSAGTTSWGSGIESQQLG---FLTM--TLSPWLR-RIEQSIAL-----NLLSPAERR-----RYFADFDTS----ALL 357 (432) T ss_pred CCcCCccccccchHHHHHHH---HHHH--HHHHHHH-HHHHHHHh-----hccCccccC-----ceEEEeech----hhh Confidence 75433221 22333333333 5543 3432221 23333333 445665542 234445433 222 Q ss_pred HHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcc----ccccccccccCCCC Q lcl|NC_019530. 469 ENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQAD----ETGNYSGSDVSPLK 544 (560) Q Consensus 469 e~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~----e~g~~~~~~~~~~~ 544 (560) ... +.+|.+.++.+-. +-++|.+-+++ .|.+.. -|..+ .+++..+-+..+.+ T Consensus 358 r~d-~~~r~~~~~~~~~--~G~~t~NE~R~-~~glpp--------------------~~g~~~~~~~~~~~~pl~~~~~~ 413 (432) T protein:vir:81 358 RAD-SAARSSYYSQLVN--NGLMTRDEARE-IEGLPK--------------------LGGNAAVLTVQSAMVPLDSIGLQ 413 (432) T ss_pred ccC-HHHHHHHHHHHHh--CCCCCHHHHHH-HhCCCC--------------------CCCCcceEeecCcccchhhhccC Confidence 221 3456666555421 23666666663 344421 11111 12222222222233 Q ss_pred CCcccccccCCcccCC Q lcl|NC_019530. 545 FKPETIPFSGSTDDSI 560 (560) Q Consensus 545 ~~p~~~~~~~~~~~~~ 560 (560) -.|+.....|..++.- T Consensus 414 ~~~~~~~~~~n~~~~~ 429 (432) T protein:vir:81 414 ASPEPASGLGNQQQDK 429 (432) T ss_pred CCCCCCCCCCCccccc Confidence 3343333233222222 No 155 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=93.95 E-value=0.0058 Score=32.83 Aligned_cols=400 Identities=12% Similarity=0.085 Sum_probs=170.0 Q ss_pred hhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccc Q lcl|NC_019530. 9 FGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPE 88 (560) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PE 88 (560) -|+|. . . ++ ..+..++....+....-.+........+ +. ..-+.+|- T Consensus 1 Mg~f~------~------~-----~~----r~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~-------~~al~~~~ 47 (416) T protein:vir:45 1 MGIFY------K------N-----EK----RDLQYNEDDLQMMVQTLPGFQGTKLRQY-----KD-------IEAIRHSD 47 (416) T ss_pred CCccc------c------c-----cc----ccccCCCcchhHHHHHhccccccCcccc-----ch-------hhhhcchH Confidence 22221 0 0 00 0111111111111100000000000000 00 12356778 Q ss_pred hhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCc--------hhhHHHHHHhhhhhhhhh Q lcl|NC_019530. 89 IRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFD--------NTAYQKIRKWYVDGRQAY 160 (560) Q Consensus 89 vd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~--------~~~~~~~R~wYvDGr~~~ 160 (560) |..||+-|-+.+.-+ |+.+.=+... . . -+.++.+|+-. .--+.++..+...|.-|. T Consensus 48 v~~cv~~Ia~~iA~~-----p~~~~~~~~~--~--~-------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~ 111 (416) T protein:vir:45 48 IFTAVMMIASDLARM-----PIRVTVNGQI--N--Y-------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYI 111 (416) T ss_pred HHHHHHHHHHhhccC-----ceEEecCccc--c--c-------cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEE Confidence 899997776655432 3333211111 0 0 12233333321 223566677788999888 Q ss_pred hheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecc Q lcl|NC_019530. 161 HVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFD 240 (560) Q Consensus 161 hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~ 240 (560) .++.|.. .=+++|.+|+|..+..++ +..|. .++.|.. ..+. .......++ T Consensus 112 ~i~r~~~---G~~~~L~~i~~~~v~v~~-----~~~g~--------~~~~~~~------~~~~--------~~~~~~~~~ 161 (416) T protein:vir:45 112 EITRDKT---GEPMNLTFRKTSEIELKS-----DARGR--------LYYFHQR------IDSN--------GNNIERNVK 161 (416) T ss_pred EEEECCC---CcEEEEEEEcCceeEEEE-----CCCcc--------EEEEEEE------ecCC--------CceeEEEEc Confidence 7776542 226899999998886542 22231 1222210 0010 011234577 Q ss_pred hhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhh Q lcl|NC_019530. 241 DESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNR 320 (560) Q Consensus 241 ~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk 320 (560) .+.|.|.+. .+.++=...|-|+.|++++......++..-=+=-.-+--+-|..++ |.+...+|.+=+++-.++.. T Consensus 162 ~~evihir~--~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~~~~~~~~~~~~~~~~~~~-- 236 (416) T protein:vir:45 162 FEDMLDIKF--YSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFHKSF-- 236 (416) T ss_pred cccEEEecc--CCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC-CCCCCHHHHHHHHHHHHHHh-- Confidence 777777763 3444445668899999999888888887664433444556666776 45544444333333222211 Q ss_pred eeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCc-chHHHHHHHHHHHHHhcCCCccccCCCCcccccc Q lcl|NC_019530. 321 NAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQL-GEMDHVNYFREKLYDALMIPKSRLQEEGSINIGG 399 (560) Q Consensus 321 ~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nL-gei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~ 399 (560) +| .++.-+.+ .++ .|.+++.|.-...- .-++-.++.++.+.++++||.+.|+.+... T Consensus 237 ------~g-~~nag~~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~---- 294 (416) T protein:vir:45 237 ------SG-TKQAGKVV-VLD----------ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETAN---- 294 (416) T ss_pred ------cC-ccccCcee-ecC----------CCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCC---- Confidence 12 12222222 222 24556655321111 224445667889999999999999643321 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHH Q lcl|NC_019530. 400 SNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLAS 479 (560) Q Consensus 400 ~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~ 479 (560) ++.++. .+-|..-|.-+-.++..-+. ..| .+ ++. ...+. |. +.++...+ ...|++. T Consensus 295 ~~~~~~---~~~~~~~l~P~~~~ie~~ln----~~l-----~~--~~~---~~~~~--f~----~~~l~~~D-~~~~~~~ 350 (416) T protein:vir:45 295 MSITDA---NLDYLSTLKPYITCVCAELN----FKF-----ND--EYV---NREFK--FD----TTEIRVVD-EKTQAEI 350 (416) T ss_pred ccHHHH---HHHHHHHHHHHHHHHHHHHh----hhc-----cc--ccc---CceEE--Ee----chhhhccC-HHHHHHH Confidence 111222 23243333333333333222 222 22 121 12233 32 22332211 2344554 Q ss_pred HHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcc---ccccccccccCCC--CCCccc--ccc Q lcl|NC_019530. 480 LNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQAD---ETGNYSGSDVSPL--KFKPET--IPF 552 (560) Q Consensus 480 l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~---e~g~~~~~~~~~~--~~~p~~--~~~ 552 (560) ++.+-.- -++|.+-+++ .|++.. ++.++.+ ..++..+-++.+. .-+++. -+. T Consensus 351 ~~~~~~~--G~~T~NE~R~-~~gl~p------------------~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~ 409 (416) T protein:vir:45 351 DKINIDS--GKMNIDEIRQ-RDGLAP------------------IPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKL 409 (416) T ss_pred HHHHHhC--CCcCHHHHHH-HhCCCC------------------CCCCCcceEeecccccccccccccCccccccccccc Confidence 4443221 2556665553 244421 2222221 1222222222211 011111 123 Q ss_pred cCCcccC Q lcl|NC_019530. 553 SGSTDDS 559 (560) Q Consensus 553 ~~~~~~~ 559 (560) .||.++- T Consensus 410 kgGe~n~ 416 (416) T protein:vir:45 410 KGGEENE 416 (416) T ss_pred CCCCCCC Confidence 3444444 No 156 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=93.95 E-value=0.0058 Score=32.83 Aligned_cols=400 Identities=12% Similarity=0.085 Sum_probs=170.0 Q ss_pred hhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccc Q lcl|NC_019530. 9 FGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPE 88 (560) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PE 88 (560) -|+|. . . ++ ..+..++....+....-.+........+ +. ..-+.+|- T Consensus 1 Mg~f~------~------~-----~~----r~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~-------~~al~~~~ 47 (416) T protein:vir:81 1 MGIFY------K------N-----EK----RDLQYNEDDLQMMVQTLPGFQGTKLRQY-----KD-------IEAIRHSD 47 (416) T ss_pred CCccc------c------c-----cc----ccccCCCcchhHHHHHhccccccCcccc-----ch-------hhhhcchH Confidence 22221 0 0 00 0111111111111100000000000000 00 12356778 Q ss_pred hhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCc--------hhhHHHHHHhhhhhhhhh Q lcl|NC_019530. 89 IRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFD--------NTAYQKIRKWYVDGRQAY 160 (560) Q Consensus 89 vd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~--------~~~~~~~R~wYvDGr~~~ 160 (560) |..||+-|-+.+.-+ |+.+.=+... . . -+.++.+|+-. .--+.++..+...|.-|. T Consensus 48 v~~cv~~Ia~~iA~~-----p~~~~~~~~~--~--~-------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~ 111 (416) T protein:vir:81 48 IFTAVMMIASDLARM-----PIRVTVNGQI--N--Y-------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYI 111 (416) T ss_pred HHHHHHHHHHhhccC-----ceEEecCccc--c--c-------cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEE Confidence 899997776655432 3333211111 0 0 12233333321 223566677788999888 Q ss_pred hheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecc Q lcl|NC_019530. 161 HVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFD 240 (560) Q Consensus 161 hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~ 240 (560) .++.|.. .=+++|.+|+|..+..++ +..|. .++.|.. ..+. .......++ T Consensus 112 ~i~r~~~---G~~~~L~~i~~~~v~v~~-----~~~g~--------~~~~~~~------~~~~--------~~~~~~~~~ 161 (416) T protein:vir:81 112 EITRDKT---GEPMNLTFRKTSEIELKS-----DARGR--------LYYFHQR------IDSN--------GNNIERNVK 161 (416) T ss_pred EEEECCC---CcEEEEEEEcCceeEEEE-----CCCcc--------EEEEEEE------ecCC--------CceeEEEEc Confidence 7776542 226899999998886542 22231 1222210 0010 011234577 Q ss_pred hhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhh Q lcl|NC_019530. 241 DESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNR 320 (560) Q Consensus 241 ~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk 320 (560) .+.|.|.+. .+.++=...|-|+.|++++......++..-=+=-.-+--+-|..++ |.+...+|.+=+++-.++.. T Consensus 162 ~~evihir~--~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~~~~~~~~~~~~~~~~~~~-- 236 (416) T protein:vir:81 162 FEDMLDIKF--YSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFHKSF-- 236 (416) T ss_pred cccEEEecc--CCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC-CCCCCHHHHHHHHHHHHHHh-- Confidence 777777763 3444445668899999999888888887664433444556666776 45544444333333222211 Q ss_pred eeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCc-chHHHHHHHHHHHHHhcCCCccccCCCCcccccc Q lcl|NC_019530. 321 NAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQL-GEMDHVNYFREKLYDALMIPKSRLQEEGSINIGG 399 (560) Q Consensus 321 ~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nL-gei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~ 399 (560) +| .++.-+.+ .++ .|.+++.|.-...- .-++-.++.++.+.++++||.+.|+.+... T Consensus 237 ------~g-~~nag~~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~---- 294 (416) T protein:vir:81 237 ------SG-TKQAGKVV-VLD----------ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETAN---- 294 (416) T ss_pred ------cC-ccccCcee-ecC----------CCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCC---- Confidence 12 12222222 222 24556655321111 224445667889999999999999643321 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHH Q lcl|NC_019530. 400 SNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLAS 479 (560) Q Consensus 400 ~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~ 479 (560) ++.++. .+-|..-|.-+-.++..-+. ..| .+ ++. ...+. |. +.++...+ ...|++. T Consensus 295 ~~~~~~---~~~~~~~l~P~~~~ie~~ln----~~l-----~~--~~~---~~~~~--f~----~~~l~~~D-~~~~~~~ 350 (416) T protein:vir:81 295 MSITDA---NLDYLSTLKPYITCVCAELN----FKF-----ND--EYV---NREFK--FD----TTEIRVVD-EKTQAEI 350 (416) T ss_pred ccHHHH---HHHHHHHHHHHHHHHHHHHh----hhc-----cc--ccc---CceEE--Ee----chhhhccC-HHHHHHH Confidence 111222 23243333333333333222 222 22 121 12233 32 22332211 2344554 Q ss_pred HHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcc---ccccccccccCCC--CCCccc--ccc Q lcl|NC_019530. 480 LNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQAD---ETGNYSGSDVSPL--KFKPET--IPF 552 (560) Q Consensus 480 l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~---e~g~~~~~~~~~~--~~~p~~--~~~ 552 (560) ++.+-.- -++|.+-+++ .|++.. ++.++.+ ..++..+-++.+. .-+++. -+. T Consensus 351 ~~~~~~~--G~~T~NE~R~-~~gl~p------------------~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~ 409 (416) T protein:vir:81 351 DKINIDS--GKMNIDEIRQ-RDGLAP------------------IPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKL 409 (416) T ss_pred HHHHHhC--CCcCHHHHHH-HhCCCC------------------CCCCCcceEeecccccccccccccCccccccccccc Confidence 4443221 2556665553 244421 2222221 1222222222211 011111 123 Q ss_pred cCCcccC Q lcl|NC_019530. 553 SGSTDDS 559 (560) Q Consensus 553 ~~~~~~~ 559 (560) .||.++- T Consensus 410 kgGe~n~ 416 (416) T protein:vir:81 410 KGGEENE 416 (416) T ss_pred CCCCCCC Confidence 3444444 No 157 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=93.59 E-value=0.007 Score=32.38 Aligned_cols=439 Identities=11% Similarity=0.050 Sum_probs=169.1 Q ss_pred chhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccch---------hhHHHH Q lcl|NC_019530. 6 RGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESEL---------LSVKTV 76 (560) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~---------~~~~~L 76 (560) --|.-|..|+.- +.+++- +-...+.+...+. ....+. .+.+... ..-.+. T Consensus 1 ~~~~~~~~~~~~--~~~~~~-----------~~~~~~~~~~~~~-----~~~~~~---~~~~~~~~~~~i~~~i~~~~~~ 59 (492) T protein:vir:94 1 MQFIQLISQVAQ--ALIKGG-----------NILYPSQPTQTEI-----FDAIVR---TNNKPETLEEMIVRYIKQHLEK 59 (492) T ss_pred ChHHHHHHHHHH--HHhcCC-----------ceeecCccchhhh-----hhcccc---cCCchhhHHHHHHHHHHHHHHH Confidence 112222222110 011111 1111111111110 000000 0111110 001122 Q ss_pred HHHHHHHhhccchhHHH---------------------------HHhhcceeeecCCCceEEEeeecccchhHHHHHHHH Q lcl|NC_019530. 77 VEEYQSMAQQPEIRKAV---------------------------DIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESIT 129 (560) Q Consensus 77 I~~YR~ma~~PEvd~AI---------------------------~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~ 129 (560) +.+|+.+..+.+-+..| .-||+- .+.---..||++..++. ... T Consensus 60 ~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~-~~~yl~G~p~~~~~~d~---------~~~ 129 (492) T protein:vir:94 60 LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQ-KVSYIVGKPIAFKHTDD---------EVV 129 (492) T ss_pred HHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHH-HHhhhcccCceeccCch---------HHH Confidence 34455555544433211 011111 11111234555544432 223 Q ss_pred HHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccC-Ccccccccc--ccc Q lcl|NC_019530. 130 ECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAM-REGGIEAIE--SVT 206 (560) Q Consensus 130 eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~-~~g~~~~~~--~~~ 206 (560) +..+.+++ .+|+....++.+..++.|+-|.+.-+|. +|-..++.+||+.+-++..-.... ....+.... ... T Consensus 130 ~~l~~~~~-n~~~~~~~~~~~~a~~~G~a~~~v~~d~----dg~~~~~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~ 204 (492) T protein:vir:94 130 KRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDE----EGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENET 204 (492) T ss_pred HHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEecC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEeeccce Confidence 34444443 4789999999999999999998766543 466789999999987763211000 000111111 000 Q ss_pred hheeecCcccc-cccccccccccccccccccee-------cchhceeeeecccccCCCCcchhhhHHHHHHHHHHH-HHH Q lcl|NC_019530. 207 LKYYYNPNYNR-NQFTGQSGTSQNFQPSQQELV-------FDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTE 277 (560) Q Consensus 207 e~~~Y~p~~~~-~~~~g~~~~~~~~~~~~~~~~-------i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~E 277 (560) ..-+|++..-. +...+... ............ +..=.|++.. ++....|=++..+....-+. ++= T Consensus 205 ~~~~y~~~~v~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~g~vPvv~~~------nn~~~~sd~e~v~~liDa~d~~~S 277 (492) T protein:vir:94 205 KVEYWDKVTVNYYVYENGSL-IPDYSNNLENSKTHFSTGSWGKIPFIPFK------NNDLEISDIFMYKTLIDAYNRRLS 277 (492) T ss_pred eEEEEecCeEEEEEEecCee-eeccccccccccccccccCCCccceEEec------CCCCCCCchHHHHHHHHHHHHHHH Confidence 11122221111 00000000 000000000000 0000111111 12223345554444433333 234 Q ss_pred HHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCcccee Q lcl|NC_019530. 278 DATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIA 357 (560) Q Consensus 278 DalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIs 357 (560) +....-+.+..|-+-+.-.+..+.+. .+..+..++..--++ +..+. T Consensus 278 ~~~~~~~~~~~p~lv~~g~~~~~~~~--------------------------------~~~~~~~~~~~~~~~--~~~~~ 323 (492) T protein:vir:94 278 DLSNTFKDSNELTYVLKNYDDQELPE--------------------------------FKRLLRYYGAIKVSD--NGGVD 323 (492) T ss_pred HHHHHHHHhcCceeeeecCCcccchh--------------------------------hHHHHhhccceecCC--CCcce Confidence 44445566666655443332222111 111111122211112 22355 Q ss_pred ecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 358 TVGGGNQLGEM-DHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLI 436 (560) Q Consensus 358 TLpGg~nLgei-~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLi 436 (560) +|-...+.+.+ .-+.-+.+.+|+-.++|-- ..+ .|. |...+..|..-+.....-+.+.++.|..-+...++.-+- T Consensus 324 ~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~--~~~-~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~ 399 (492) T protein:vir:94 324 TIQVEVPVENSKKYLDELYQKIMLFGQAVDF--SSD-KFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFE 399 (492) T ss_pred eEeccCCHHHHHHHHHHHHHHHHHHhCCcCC--Ccc-ccc-cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 55443343333 3446677788888999842 221 111 112233454444445566777777777777776665444 Q ss_pred hccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCH---HHHHHHHH Q lcl|NC_019530. 437 LKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSD---EEVKEQQT 513 (560) Q Consensus 437 Lkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tD---eEI~e~~k 513 (560) +-|+ ..+|. .|.+.|...---.+ .+.++++..+ +| .+|.+++++. |..++ +|++++.+ T Consensus 400 ~~~~--~~~~~-----~i~v~f~~~~p~~~-------~e~~~~~~kl---~g-iiS~et~~~~-l~~v~d~~~E~eri~~ 460 (492) T protein:vir:94 400 HFDI--KGEHK-----DVDISFNYNKVANT-------ELQVQTAQQS---MG-IVSHETVLEN-HPFVEDLQAELERIEQ 460 (492) T ss_pred HhcC--Ccccc-----eeeEEecCCCCCCH-------HHHHHHHHHH---hc-cCchHHHHHh-CCCCCCHHHHHHHHHH Confidence 4443 33443 25566643322112 2234555555 35 4999999976 55544 34444443 Q ss_pred HHHHhhhcCCCCCCCcccccc-ccccccCCCCCCcc Q lcl|NC_019530. 514 KIAEEKKKGLYPKVQADETGN-YSGSDVSPLKFKPE 548 (560) Q Consensus 514 qIe~E~~~~~~~~p~~~e~g~-~~~~~~~~~~~~p~ 548 (560) +.++.++. .++....+. .++++.....=.-| T Consensus 461 E~~~~~~~----~~~~~~~~~~~~~~~~~~~~~e~e 492 (492) T protein:vir:94 461 EQMEYNKQ----LPNLDDGGADSAQQQERSNNKESE 492 (492) T ss_pred HHHHHHhh----ccccccccCCCCccccCCccccCC Confidence 33333221 111110000 00011110000011 No 158 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=93.40 E-value=0.0077 Score=32.18 Aligned_cols=425 Identities=10% Similarity=0.059 Sum_probs=163.5 Q ss_pred hhhhhhhhhhhhhhcccccch-hhhccccccccCCCCCCCCceeEeccccceeccccccch-hhHHHHHHHHHHHhhccc Q lcl|NC_019530. 11 LFGGGGLVNAKVDTDKLAQKQ-DELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESEL-LSVKTVVEEYQSMAQQPE 88 (560) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~-~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~-~~~~~LI~~YR~ma~~PE 88 (560) .|+.+. ...++.-..+ -|++..+ ...+.... ..+ ..-..-+.+|+.+..+.+ T Consensus 1 ~~~~~~-----~~~~~~~~~~~~~~~~~~-------~~~~~~~i--------------~~~i~~~~~~~~~~~~~~~YY~ 54 (474) T protein:vir:94 1 MFNIIR-----MPWDKPYGEEVVEQLKPQ-------FETQEEMI--------------VRLIDDHRKQLDKITVGQRYYD 54 (474) T ss_pred Cccccc-----ccCCCchhhHHHHhhhhc-------ccCHHHHH--------------HHHHHHHHHHHHHHHHHHHHhc Confidence 111000 0011100000 0000000 00000000 000 000011122333333222 Q ss_pred hhHH---------------------------HHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCC Q lcl|NC_019530. 89 IRKA---------------------------VDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDF 141 (560) Q Consensus 89 vd~A---------------------------I~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f 141 (560) -+.. ..-||+ ..+.---..||++..++.+ ..+..+.+++ .+| T Consensus 55 g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd-~~~~~l~g~p~~~~~~d~~---------~~~~l~~~~~-n~~ 123 (474) T protein:vir:94 55 KDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVD-QKVSYVASKPVTYSCEDEN---------VLKVIHDVLD-TRW 123 (474) T ss_pred cccchhcccchhccccccccccCcceeecchHHHHHH-HHHhhhhcCCceeccCcHH---------HHHHHHHHHh-ccH Confidence 2111 111221 1122223455565544422 2222333433 578 Q ss_pred chhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhccc-CCcccccc--ccccchheeecCccccc Q lcl|NC_019530. 142 DNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKA-MREGGIEA--IESVTLKYYYNPNYNRN 218 (560) Q Consensus 142 ~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~-~~~g~~~~--~~~~~e~~~Y~p~~~~~ 218 (560) +....++.+.+++.|+-|.+.-+|. +|-..+..+||+.+-++..-... .....+.. .+...-+-+|.+..-+. T Consensus 124 ~~~~~e~~~~~~~~G~~~~~~~~d~----~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~ 199 (474) T protein:vir:94 124 DNKLIDILTATSNKGIDWLQVYINE----NGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTY 199 (474) T ss_pred HHHHHHHHHHHhhcCceEEEEEecC----CCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEE Confidence 9999999999999999888665543 46678999999998877421000 00001111 11111112244332111 Q ss_pred --cccccccccccccc--------cccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhc Q lcl|NC_019530. 219 --QFTGQSGTSQNFQP--------SQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTEDATVIYAITR 287 (560) Q Consensus 219 --~~~g~~~~~~~~~~--------~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~EDalVIyRitR 287 (560) ..++.......... +...-++| +++.- ++....|=++..+.....+. ++-+....-+.++ T Consensus 200 y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP---vv~~~------nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~ 270 (474) T protein:vir:94 200 YVLENGGLIPDYYYGANHVQSHFSNGNWGRVP---FIAFK------NNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESV 270 (474) T ss_pred EEEcCCccccccccCcCcccccccccCCCccc---eEEec------CCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 01111000000000 00000111 12110 12233455666555555544 3344444445566 Q ss_pred CccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch Q lcl|NC_019530. 288 APEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE 367 (560) Q Consensus 288 APeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge 367 (560) .|-+-+.-.+.- +...++..+..+++-.=+++.+ ++.|-.-.+.+. T Consensus 271 ~~~lv~~g~~~~--------------------------------~~~~~~~~~~~~~~i~~~~~~~--~~~l~~~~~~~~ 316 (474) T protein:vir:94 271 ELIYILKGYEGE--------------------------------DLEEFMRGLKYYKAINVDGDGG--VETIQVEVPVSS 316 (474) T ss_pred CceeeeecCCcc--------------------------------cchhhhhhhhccceeeccCCCc--eeEEeecCCHHH Confidence 665443321110 0011111122222222223323 444444344443 Q ss_pred HH-HHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHH Q lcl|NC_019530. 368 MD-HVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDW 446 (560) Q Consensus 368 i~-DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew 446 (560) .. -+.-+.+.+|+...+|- +..+ ++. |...+..|..-+.....-+.+.+..|...+..+++.-+-+-|+ ..+| T Consensus 317 ~~~~~~~l~~~I~~~s~~p~--~~~~-~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~--~~d~ 390 (474) T protein:vir:94 317 TKEYIDLMRVYIMEFGQGVD--FQTD-KFG-SAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNL--KTDV 390 (474) T ss_pred HHHHHHHHHHHHHHHhCccc--cCcc-ccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Cccc Confidence 33 45667777888899983 2221 111 2222334544444444556667777777666666654444454 2344 Q ss_pred HHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcC---- Q lcl|NC_019530. 447 NEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKG---- 522 (560) Q Consensus 447 ~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~---- 522 (560) . .|.+.|...---.+. +-++++.++ | .+|.+++++. |...++ .+++.++|++|..+- T Consensus 391 ~-----~i~v~f~~~~p~~~~-------e~a~~~~~~----g-~iS~et~l~~-l~~v~D-~~~E~eri~~E~~~~~~~~ 451 (474) T protein:vir:94 391 K-----DIEISFNFNRMMNDA-------EQSQIIAQS----Q-YLSRETLVKS-SPLVDD-YKAELERIEQEQMEYNKQL 451 (474) T ss_pred c-----eeeEEeccCcccCHH-------HHHHHHHHc----C-CCCHHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHhhc Confidence 3 366777433221222 222334442 3 5899999977 454332 234444555554221 Q ss_pred -CCCCCCccccccccccccCCCCCCcc Q lcl|NC_019530. 523 -LYPKVQADETGNYSGSDVSPLKFKPE 548 (560) Q Consensus 523 -~~~~p~~~e~g~~~~~~~~~~~~~p~ 548 (560) -+.+..++ .+..+-.+..=.-| T Consensus 452 ~~~~~~~~~----~~~~~~~~~~~~~e 474 (474) T protein:vir:94 452 PNLDDGGAD----GAQQQEGSNNKESE 474 (474) T ss_pred cccCCCCCC----CcccCCCCcccccC Confidence 11111111 00001111111111 No 159 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=93.40 E-value=0.0077 Score=32.18 Aligned_cols=425 Identities=10% Similarity=0.059 Sum_probs=163.5 Q ss_pred hhhhhhhhhhhhhhcccccch-hhhccccccccCCCCCCCCceeEeccccceeccccccch-hhHHHHHHHHHHHhhccc Q lcl|NC_019530. 11 LFGGGGLVNAKVDTDKLAQKQ-DELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESEL-LSVKTVVEEYQSMAQQPE 88 (560) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~-~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~-~~~~~LI~~YR~ma~~PE 88 (560) .|+.+. ...++.-..+ -|++..+ ...+.... ..+ ..-..-+.+|+.+..+.+ T Consensus 1 ~~~~~~-----~~~~~~~~~~~~~~~~~~-------~~~~~~~i--------------~~~i~~~~~~~~~~~~~~~YY~ 54 (474) T protein:vir:97 1 MFNIIR-----MPWDKPYGEEVVEQLKPQ-------FETQEEMI--------------VRLIDDHRKQLDKITVGQRYYD 54 (474) T ss_pred Cccccc-----ccCCCchhhHHHHhhhhc-------ccCHHHHH--------------HHHHHHHHHHHHHHHHHHHHhc Confidence 111000 0011100000 0000000 00000000 000 000011122333333222 Q ss_pred hhHH---------------------------HHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCC Q lcl|NC_019530. 89 IRKA---------------------------VDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDF 141 (560) Q Consensus 89 vd~A---------------------------I~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f 141 (560) -+.. ..-||+ ..+.---..||++..++.+ ..+..+.+++ .+| T Consensus 55 g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd-~~~~~l~g~p~~~~~~d~~---------~~~~l~~~~~-n~~ 123 (474) T protein:vir:97 55 KDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVD-QKVSYVASKPVTYSCEDEN---------VLKVIHDVLD-TRW 123 (474) T ss_pred cccchhcccchhccccccccccCcceeecchHHHHHH-HHHhhhhcCCceeccCcHH---------HHHHHHHHHh-ccH Confidence 2111 111221 1122223455565544422 2222333433 578 Q ss_pred chhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhccc-CCcccccc--ccccchheeecCccccc Q lcl|NC_019530. 142 DNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKA-MREGGIEA--IESVTLKYYYNPNYNRN 218 (560) Q Consensus 142 ~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~-~~~g~~~~--~~~~~e~~~Y~p~~~~~ 218 (560) +....++.+.+++.|+-|.+.-+|. +|-..+..+||+.+-++..-... .....+.. .+...-+-+|.+..-+. T Consensus 124 ~~~~~e~~~~~~~~G~~~~~~~~d~----~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~ 199 (474) T protein:vir:97 124 DNKLIDILTATSNKGIDWLQVYINE----NGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTY 199 (474) T ss_pred HHHHHHHHHHHhhcCceEEEEEecC----CCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEE Confidence 9999999999999999888665543 46678999999998877421000 00001111 11111112244332111 Q ss_pred --cccccccccccccc--------cccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhc Q lcl|NC_019530. 219 --QFTGQSGTSQNFQP--------SQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTEDATVIYAITR 287 (560) Q Consensus 219 --~~~g~~~~~~~~~~--------~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~EDalVIyRitR 287 (560) ..++.......... +...-++| +++.- ++....|=++..+.....+. ++-+....-+.++ T Consensus 200 y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP---vv~~~------nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~ 270 (474) T protein:vir:97 200 YVLENGGLIPDYYYGANHVQSHFSNGNWGRVP---FIAFK------NNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESV 270 (474) T ss_pred EEEcCCccccccccCcCcccccccccCCCccc---eEEec------CCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 01111000000000 00000111 12110 12233455666555555544 3344444445566 Q ss_pred CccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch Q lcl|NC_019530. 288 APEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE 367 (560) Q Consensus 288 APeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge 367 (560) .|-+-+.-.+.- +...++..+..+++-.=+++.+ ++.|-.-.+.+. T Consensus 271 ~~~lv~~g~~~~--------------------------------~~~~~~~~~~~~~~i~~~~~~~--~~~l~~~~~~~~ 316 (474) T protein:vir:97 271 ELIYILKGYEGE--------------------------------DLEEFMRGLKYYKAINVDGDGG--VETIQVEVPVSS 316 (474) T ss_pred CceeeeecCCcc--------------------------------cchhhhhhhhccceeeccCCCc--eeEEeecCCHHH Confidence 665443321110 0011111122222222223323 444444344443 Q ss_pred HH-HHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHH Q lcl|NC_019530. 368 MD-HVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDW 446 (560) Q Consensus 368 i~-DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew 446 (560) .. -+.-+.+.+|+...+|- +..+ ++. |...+..|..-+.....-+.+.+..|...+..+++.-+-+-|+ ..+| T Consensus 317 ~~~~~~~l~~~I~~~s~~p~--~~~~-~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~--~~d~ 390 (474) T protein:vir:97 317 TKEYIDLMRVYIMEFGQGVD--FQTD-KFG-SAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNL--KTDV 390 (474) T ss_pred HHHHHHHHHHHHHHHhCccc--cCcc-ccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Cccc Confidence 33 45667777888899983 2221 111 2222334544444444556667777777666666654444454 2344 Q ss_pred HHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcC---- Q lcl|NC_019530. 447 NEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKG---- 522 (560) Q Consensus 447 ~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~---- 522 (560) . .|.+.|...---.+. +-++++.++ | .+|.+++++. |...++ .+++.++|++|..+- T Consensus 391 ~-----~i~v~f~~~~p~~~~-------e~a~~~~~~----g-~iS~et~l~~-l~~v~D-~~~E~eri~~E~~~~~~~~ 451 (474) T protein:vir:97 391 K-----DIEISFNFNRMMNDA-------EQSQIIAQS----Q-YLSRETLVKS-SPLVDD-YKAELERIEQEQMEYNKQL 451 (474) T ss_pred c-----eeeEEeccCcccCHH-------HHHHHHHHc----C-CCCHHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHhhc Confidence 3 366777433221222 222334442 3 5899999977 454332 234444555554221 Q ss_pred -CCCCCCccccccccccccCCCCCCcc Q lcl|NC_019530. 523 -LYPKVQADETGNYSGSDVSPLKFKPE 548 (560) Q Consensus 523 -~~~~p~~~e~g~~~~~~~~~~~~~p~ 548 (560) -+.+..++ .+..+-.+..=.-| T Consensus 452 ~~~~~~~~~----~~~~~~~~~~~~~e 474 (474) T protein:vir:97 452 PNLDDGGAD----GAQQQEGSNNKESE 474 (474) T ss_pred cccCCCCCC----CcccCCCCcccccC Confidence 11111111 00001111111111 No 160 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=93.20 E-value=0.0084 Score=31.96 Aligned_cols=404 Identities=13% Similarity=0.096 Sum_probs=174.6 Q ss_pred hhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccc Q lcl|NC_019530. 9 FGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPE 88 (560) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PE 88 (560) .+++...++- + +..| .+ +....-.++............ +. +..+++|- T Consensus 1 m~~~~~~~~~--------------~----~~~s--~~----~~w~~~~~~~~~~~~~~g~~v-t~-------~~al~~~~ 48 (421) T protein:vir:10 1 MFIPQMFEGK--------------K----RSVS--GG----GFWEAMLGGVRSSHSKAGVMI-TP-------ETALALSA 48 (421) T ss_pred CCCcchhccc--------------c----cccC--cc----hhhHHHhhhhccCcccCCcee-ch-------HHhhccHH Confidence 1111111111 1 1111 00 010000011111111000000 11 13468899 Q ss_pred hhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHH-HHHHHHHHH-hCCchhhHH----HHHHhhhhhhhhhhh Q lcl|NC_019530. 89 IRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESIT-ECFKEVMHL-MDFDNTAYQ----KIRKWYVDGRQAYHV 162 (560) Q Consensus 89 vd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~-eeF~~il~l-l~f~~~~~~----~~R~wYvDGr~~~hk 162 (560) |..||.-|-+.+.-+ |+.|--.+.++ .++.+. ...-++|+. -+-...+++ ++..++++|.-|..+ T Consensus 49 v~~~i~~Ia~~iA~l-----p~~~~~~~~~g----~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i 119 (421) T protein:vir:10 49 VRACVTLLAESVAQL-----PVELYRRDKNG----GRQRATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSII 119 (421) T ss_pred HHHHHHHHHHhhccC-----ceEEEEEcCCC----ceeecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEE Confidence 999997777765432 33221111111 011111 112222221 222333444 445567889888877 Q ss_pred eecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchh Q lcl|NC_019530. 163 IVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDE 242 (560) Q Consensus 163 iiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~d 242 (560) +.|.. .-+.+|.+|+|-+|..++ +.+| ..||.+.. .| -.++.+ T Consensus 120 ~r~~~---G~~~~L~~l~~~~v~v~~-----~~~g--------~~~y~~~~-------~g--------------~~~~~~ 162 (421) T protein:vir:10 120 DRDGK---GYPKELIPINPKKVIVLK-----GPDG--------MPYYEIPE-------IG--------------ETLPMR 162 (421) T ss_pred EEcCC---CcEEEEEEecCceEEEEE-----CCCc--------eEEEEEcC-------CC--------------cEEchh Confidence 66542 347899999998887642 2122 11222210 11 135667 Q ss_pred ceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhhee Q lcl|NC_019530. 243 SIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNA 322 (560) Q Consensus 243 aI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~v 322 (560) .|.|.+. ...++-+..|.|+.|.+++.....+++...=+=---+--+-+...+- +++..+.++-...+..+++++.- T Consensus 163 eiih~~~--~~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-~~~~~~~~e~~~~~~~~~~~~~~ 239 (421) T protein:vir:10 163 MMHHVKV--FSLDGYIGSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPK-EAPAIKSQEKIDQLLAKWTDRYS 239 (421) T ss_pred hEEEecC--cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecC-ccCccCCHHHHHHHHHHHHHHhc Confidence 7777763 34454456788999999988887777665433333344455666663 33333333444444555555421 Q ss_pred ecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCccccCCCCcccccc Q lcl|NC_019530. 323 YDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE---MDHVNYFREKLYDALMIPKSRLQEEGSINIGG 399 (560) Q Consensus 323 Yd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~ 399 (560) | ..+..+.+ .++ .|++++.|- .+.-+ ++-.++..+.+.++.+||..-|+..+..+ . T Consensus 240 -----g-~~n~~~~~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~ 298 (421) T protein:vir:10 240 -----G-INNMFSVA-LLQ----------EGMSYKQMS--QDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKAT--N 298 (421) T ss_pred -----C-ccccCcce-ecC----------CCceEEecC--CChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCc--c Confidence 2 11111221 121 356666664 33333 33445788889999999999996544322 2 Q ss_pred cchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHH Q lcl|NC_019530. 400 SNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLAS 479 (560) Q Consensus 400 ~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~ 479 (560) ++.++..+. |.++ .|+--+ ..+.+.|-+ ++++++++. ...++|..+.. ... -+..|++. T Consensus 299 sn~e~~~~~---f~~~--tl~P~~-~~ie~~ln~-----kL~~~~~~~-----~~~v~fd~~~l----~~~-d~~~~~~~ 357 (421) T protein:vir:10 299 NNIEHQGLQ---FVMY--TLLAWL-KRHEGALQR-----DLLLPSERR-----DLYIEFNVSGL----LRG-DQKSRYES 357 (421) T ss_pred ccHHHHHHH---HHHH--HHHHHH-HHHHHHHhh-----hccCccccC-----CeEEEEechhh----hcc-CHHHHHHH Confidence 344433333 5443 333211 122233333 445665542 23455544432 111 12344544 Q ss_pred HHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccc---ccccc-ccccCCCCCCcccccccCC Q lcl|NC_019530. 480 LNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADE---TGNYS-GSDVSPLKFKPETIPFSGS 555 (560) Q Consensus 480 l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e---~g~~~-~~~~~~~~~~p~~~~~~~~ 555 (560) ++.+-. .-+++.+-+++ +|.+.+ -|..++ +++.. .++..+..-.|. +-.+. T Consensus 358 ~~~~~~--~G~~T~NE~R~-~~gl~p--------------------~~ggD~~~~~~n~~~~~~~~~~~~~~~--~~~~~ 412 (421) T protein:vir:10 358 YALGRQ--WGWLSVNDIRR-MENLPP--------------------IAGGDKYLTPLNMVDSAQIIPGDKKPT--AQQMA 412 (421) T ss_pred HHHHHh--CCCcCHHHHHH-HhCCCC--------------------CCCcceeeeccccccccccccCCCCcc--cccCc Confidence 444321 13566666663 344422 122111 11111 011111111111 11222 Q ss_pred cccCC Q lcl|NC_019530. 556 TDDSI 560 (560) Q Consensus 556 ~~~~~ 560 (560) .+|-+ T Consensus 413 e~d~~ 417 (421) T protein:vir:10 413 EIDTI 417 (421) T ss_pred ccccc Confidence 33333 No 161 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=93.19 E-value=0.0085 Score=31.95 Aligned_cols=390 Identities=9% Similarity=0.043 Sum_probs=194.4 Q ss_pred eeEecccc---ceeccccccchh-----hHHHHHHHHHHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHH Q lcl|NC_019530. 52 IILQGGAN---TYNYVGVESELL-----SVKTVVEEYQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDT 123 (560) Q Consensus 52 ~~i~~~~~---~~~y~~~e~~~~-----~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~s 123 (560) ......-+ ..+|.+ ...+. -..++-..||... +=+.-||+..++ .++.+-. +.-+ + T Consensus 1 l~~~~~r~~~~~~yY~g-~~~~~~~~~~~p~~~~~~~~~v~--nw~~~~Vds~a~-rl~~~Gf--------~~~d---~- 64 (410) T protein:vir:95 1 MNLYQSRVNLRYKHYAM-QHYEAPTGITIPAHIRAKYQAVL--GWAAKGVDSLAD-RLIFRAF--------ANDD---F- 64 (410) T ss_pred CCcchhhHHHHHHHhcC-CCCccccchhccHHHHhHHHhhc--chhHHHHHHhHh-hhccccc--------cCCC---c- Confidence 11111111 111211 11111 1122322344322 334445544433 1222211 1111 1 Q ss_pred HHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCcccccccc Q lcl|NC_019530. 124 VKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIE 203 (560) Q Consensus 124 ik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~ 203 (560) .+..|....+|+.+..+.++.=++.||-|. +|-+. .+|-..++.++|+.+-.+..=.+....-.+.+.. T Consensus 65 -------~l~~i~~~N~ld~~~~~~~~~al~~G~sf~--~v~~~--~d~~~~i~~~sP~~~~~i~Dp~~~~~~~al~~~~ 133 (410) T protein:vir:95 65 -------NVTEIFDRNNPDIFFDSAILSALIGSCSFV--YISKG--EDDEVRLQVIESSNATGVIDPITGLLVEGYAVLA 133 (410) T ss_pred -------hHHHHHhhcChHHHHHHHHHHHHHhCceeE--EEecC--CCCceEEEEEcccceEEEEeCCCCceEEEEEEEE Confidence 245667778899999999999999999665 33332 2344678889888877653211000000011100 Q ss_pred -----ccchheeecCccccccccccccccccccccccceecchhceeeeecccccCCCCcchhhh-HHHHHHHHH-HHHH Q lcl|NC_019530. 204 -----SVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLANGIVPGLL-NPAIRPLNN-LVTT 276 (560) Q Consensus 204 -----~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL-~~AiK~~Nq-L~m~ 276 (560) ......+|.|..... +..... ...+ +..-+ .+| .|.|+|-.=++.+ ..-|-+ +..+...+- -+.| T Consensus 134 ~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~-~~~~g-~vP--vV~f~n~~~l~~~--~G~s~I~~~v~~l~da~~r~~ 205 (410) T protein:vir:95 134 RDDYNRPTLEAYFEPNATHF-IPKDGE-PYSV-TNETG-IPL--LVPVIHRPDAVRP--FGRSRITRAGMYYQKYAKRTL 205 (410) T ss_pred ecCCCeEEEEEEEeCCcEEE-EeeCCc-cccc-cCCCC-Ccc--eEEecccccCCcc--CCccccchhHHHHHHHHHHHH Confidence 011223444432221 111100 0001 11101 111 2455543222222 112222 222222222 2678 Q ss_pred HHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccce Q lcl|NC_019530. 277 EDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEI 356 (560) Q Consensus 277 EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEI 356 (560) .++++.=...=.|.|-++=+|.-.-|..+-. +..|. =..+|.-++|-+.+| T Consensus 206 ~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~-----------------~~~~~------------i~~~~~~~~~~~~~v 256 (410) T protein:vir:95 206 ERADITAEFYSWPQKYILGLDPDAEPMEKWK-----------------ATVSS------------LLTISSSDKGVKPSV 256 (410) T ss_pred HHHHHHHHHhcchhheeeccCCCCCcCchhh-----------------hhhhh------------heeccCCCCCCcceE Confidence 8888999999999998876654221221111 11111 145677777777889 Q ss_pred eecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 357 ATVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQL 435 (560) Q Consensus 357 sTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qL 435 (560) ..++++ +|+. ++-++=.-..+....++|..-|+..+. |- +.+..|.-.|....+-+.+.|+.|..-+.++++.-+ T Consensus 257 ~q~~~~-~l~~~~~~l~~l~~~~a~~s~lP~~~lg~~~~-Np--sSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~ 332 (410) T protein:vir:95 257 GQFTTA-SMSPFTEQLRTAAAGFAGEMGLTLDDLGFVSD-NP--SSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAA 332 (410) T ss_pred EecCCC-ChHHHHHHHHHHHHHHhhhcCCCHHHhccccC-ch--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 889884 4443 344555556666777999888876543 11 223457777888999999999999999999999988 Q ss_pred HhccCCC--hhHHHHhhhhhceeeee--ecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHH Q lcl|NC_019530. 436 ILKGVTD--EKDWNEKIKPFIKFEFT--SDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQ 511 (560) Q Consensus 436 iLkgiit--~eew~~~i~~~I~~~f~--~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~ 511 (560) .+.+-.. +.+|.. +.+.|. .| .++-.+.++.+.+..+..-+=.+.+.++++ +.|++||++|... T Consensus 333 ~i~~~~~~~~~~~~~-----~~v~W~p~~d------~~~~s~a~~aDa~~Kl~~a~~g~~~~~~~~-~~lg~~~~~~~~~ 400 (410) T protein:vir:95 333 CLRDEFRYTRSQFVR-----TAVKWEPLFE------ADANTMTMIGDGVVKLNQALPGYINAETIR-DLTGIAGDMSAKP 400 (410) T ss_pred HHhcCCCCcccccce-----eeEEeeecCC------cchhhHHHHHHHHHHHHHhccCCccHHHHH-HhcCCChHHHHHH Confidence 7765433 445532 333342 23 122334556665444433211356667777 5599999987643 Q ss_pred HHHHHHhhhcCC Q lcl|NC_019530. 512 QTKIAEEKKKGL 523 (560) Q Consensus 512 ~kqIe~E~~~~~ 523 (560) . +++....|- T Consensus 401 ~--~~e~~~~g~ 410 (410) T protein:vir:95 401 V--VSEGGSNGE 410 (410) T ss_pred H--HHHHHhCCC Confidence 2 222223332 No 162 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=92.91 E-value=0.0095 Score=31.67 Aligned_cols=436 Identities=10% Similarity=-0.000 Sum_probs=172.7 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCC---C-------C----ceeEeccccceeccccccchhhHHHHHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQ---D-------G----SIILQGGANTYNYVGVESELLSVKTVVEEY 80 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~---d-------G----~~~i~~~~~~~~y~~~e~~~~~~~~LI~~Y 80 (560) |+.++.....-+ ...+...+..+...+. + + ....-+|+... .....+..-+ = T Consensus 1 M~~~~~l~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~-~~~~~g~~v~-------~ 66 (466) T protein:vir:81 1 MRLIDRLLSTRG------AAPRMSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTE-LAPDTFVGLA-------T 66 (466) T ss_pred CchhHHHhhccC------cccccchhhhhhhhhhhhccccccccccccHHHHHhhcccccc-ccCccccccc-------h Confidence 333332222211 0111111100100000 0 0 00000111100 0000000001 1 Q ss_pred HHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHH-HHHHHHHhCCchhhHHH----HHHhhhh Q lcl|NC_019530. 81 QSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITEC-FKEVMHLMDFDNTAYQK----IRKWYVD 155 (560) Q Consensus 81 R~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~ee-F~~il~ll~f~~~~~~~----~R~wYvD 155 (560) +..+.+|-|..||.-|.+.+.- -|+.|.=+... -++.+.+. ...+++-=+-...++++ +..+... T Consensus 67 ~~a~~~~~v~~~i~~Ia~~ia~-----lp~~~~~~~~~-----~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~ 136 (466) T protein:vir:81 67 QAYQANGPVFACMLVRQLVFSS-----VRFRWQRLRDG-----KPSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLA 136 (466) T ss_pred hhhhccHHHHHHHHHHHHhhcc-----CceEEEEecCC-----ceeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhc Confidence 1345689999999766665432 23332211111 01111111 11122211223344444 4445566 Q ss_pred hhhhhhheecCC-----CCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccc Q lcl|NC_019530. 156 GRQAYHVIVDPT-----NKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNF 230 (560) Q Consensus 156 Gr~~~hkiiD~~-----~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~ 230 (560) |.=|..++.+.. ....-+.+|++|+|-++..++ .. ++.. .-.|.|.. ++. T Consensus 137 Gnay~~i~r~~~g~l~~~~~g~~~~l~~l~~~~v~~~~---~~--~~~~------~~~y~~~~-------~~~------- 191 (466) T protein:vir:81 137 GNSYWTIVDGEFVRMRPDWVDVVVEERMVRGGRGELGG---GQ--LGWR------KVGYLYTE-------GGR------- 191 (466) T ss_pred CCeEEEEEecCccccccccCcceeEEEEecCcceEEEE---cC--CCce------EEEEEEEe-------cCc------- Confidence 877776665432 112236899999998776643 11 1210 11123321 111 Q ss_pred cccccceecchhceeeeecccc-cCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHH Q lcl|NC_019530. 231 QPSQQELVFDDESIVYIDSGEE-PLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEY 309 (560) Q Consensus 231 ~~~~~~~~i~~daI~y~hsGl~-~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqY 309 (560) ........++.+.|.|.. ++- +.+.-+.+|.+..|++++.....+++...=+=---+--.-|+..+ +.|.+..+++. T Consensus 192 ~~~~~~~~~~~~dviHir-~~~~~~d~~~G~s~i~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~ 269 (466) T protein:vir:81 192 QSGNESVGFLAEDVVHFA-PIPDPLASYRGMSWLTPILREIRADQAMSKHQAKFFDNGATVNLVIKHN-PMADPAAVKKW 269 (466) T ss_pred ccccceeeeccccEEEEc-CCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecC-CCCCHHHHHHH Confidence 111234567888887775 121 234445678999999998888877766543333334445566665 45666666555 Q ss_pred HHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCc Q lcl|NC_019530. 310 MTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE---MDHVNYFREKLYDALMIPK 386 (560) Q Consensus 310 l~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~ 386 (560) .+.+...|+-- ...|. .+ .++ -|.+++.|.- +..+ ++-.++..+.+.++.+||. T Consensus 270 ~~~~~~~~~g~----~n~g~------~~-vl~----------~g~~~~~l~~--~~~d~q~le~~~~~~~~Ia~~fgVPp 326 (466) T protein:vir:81 270 ADEVNSKHAGV----DNAWK------NL-NLY----------PGADADVVGS--NLQEIDFKNVRGGGETRIAAAAGVPP 326 (466) T ss_pred HHHHHHHhcCc----ccccc------ce-EcC----------CCceEEEccC--ChhHHHHHHHHHHHHHHHHHHhCCCH Confidence 55544444321 11122 11 222 2456666643 3333 4445688999999999999 Q ss_pred cccCCCCc-ccccccchhhhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchH Q lcl|NC_019530. 387 SRLQEEGS-INIGGSNLAEITQEELRFSKFC-AGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYI 464 (560) Q Consensus 387 sRl~~~~~-~~l~~~~~~eItRDElkF~Kfi-~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f 464 (560) ..|+-..+ .....+++++..+. |.+++ .-+-.++ .+.|-+.| +++.+. . .+.++|..+.. T Consensus 327 ~~lG~~~~~~~st~sn~eq~~~~---f~~~tl~P~~~~i----e~~l~~~L-----~~~~~~---~--~~~~~f~~~~l- 388 (466) T protein:vir:81 327 VIVGLSEGLAAATYSNYGQARRR---LADGTAHPLWQNL----SGCIGHVM-----PDMGPD---V--RLWYDADDVPF- 388 (466) T ss_pred HHcccccCCCccccccHHHHHHH---HHHHHHHHHHHHH----HHHHHhhc-----CCcccC---c--ceEEEecchhh- Confidence 99964322 12233444444444 65554 3333333 33343333 333321 1 23455543322 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCC Q lcl|NC_019530. 465 REQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLK 544 (560) Q Consensus 465 ~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~ 544 (560) ... -++.|.++......++ ..+.... +|..|+-.. ...-...+++. ......+..+......+ T Consensus 389 ---lr~-d~~~r~~~~~~~~~~~------~~~~~~g--~t~nE~r~~----~~~gd~~~~~~-~~~~~~~~~~~~~~~~~ 451 (466) T protein:vir:81 389 ---LRE-DEKDAADIQKVRAETI------NTLITAG--YEPESVVAA----VNSGDLRLLKH-TGLTSVQLLPPGVSASA 451 (466) T ss_pred ---hcc-CHHHHHHHHHHHHHHH------HHHHHcC--CChhhcccc----ccCCccccccC-CCcchhhhccccccccc Confidence 111 1234444322211111 1222222 233333311 11000001110 00000000000000011 Q ss_pred CCcccccccCCcccC Q lcl|NC_019530. 545 FKPETIPFSGSTDDS 559 (560) Q Consensus 545 ~~p~~~~~~~~~~~~ 559 (560) -.|+...--|..++. T Consensus 452 ~~~~~~~~Gg~~ngn 466 (466) T protein:vir:81 452 SSDTPTSGGADDNGN 466 (466) T ss_pred CCCCcccCCCCcCCC Confidence 111111111222222 No 163 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=92.85 E-value=0.0097 Score=31.61 Aligned_cols=416 Identities=13% Similarity=0.110 Sum_probs=171.1 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) +- ....--+..+++|.+.... + ..+ +..-..-+.+|+.+..+.+=...|. T Consensus 1 ~~----~~~~~~~~~~~~~~~~~~~--------------i---------~~~---i~~~~~~~~r~~~~~~yy~g~~~i~ 50 (453) T protein:vir:73 1 MN----LKPIKLMTYSRDEEITDKV--------------V---------NDF---MKKHQEEVERYEYLGNMYKGIMEIS 50 (453) T ss_pred Cc----cccceeeeccccccCCHHH--------------H---------HHH---HHHHHHHHHHHHHHHHHhccccchh Confidence 00 0000000000011000000 0 000 0000111234444443333322221 Q ss_pred H--------------------hhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhh Q lcl|NC_019530. 95 I--------------------IVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYV 154 (560) Q Consensus 95 e--------------------Ivneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYv 154 (560) . ||+ ..+.--...||++..++ +...+.+..+++-.+|+....++.+..++ T Consensus 51 ~~~~~~~~~~~~ki~~n~~~~ivd-~~~~~l~g~~~~~~~~d---------~~~~~~l~~~~~~n~~~~~~~~~~~~~~~ 120 (453) T protein:vir:73 51 SQKAKDSWKPDNRLTNNFAKYIVD-TFVGYFNGIPIKKTHDD---------KSVLEAMQLFDNLNDMEDEESELAKIACV 120 (453) T ss_pred cCCCCCccCccceeecchHHHHHH-HhhhhhcccCceeecCC---------hHHHHHHHHHHHhcChhHHHHHHHHHHHh Confidence 1 111 00111112333333222 23455677777778999999999999999 Q ss_pred hhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCC-cccccc---ccccchheeecCccccccccccccccccc Q lcl|NC_019530. 155 DGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEA---IESVTLKYYYNPNYNRNQFTGQSGTSQNF 230 (560) Q Consensus 155 DGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~---~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~ 230 (560) .|+-|.+.-.|. +|-..+..++|+.+-++..-..... ...+.. .++....-+|.+..-+ .+.+..+.-... T Consensus 121 ~G~~~~~v~~d~----~~~~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~-~~~~~~~~~~~~ 195 (453) T protein:vir:73 121 YGRAYELMYQNE----STESEVIYCSPLNVFMVYDDSIKQKPLFAVYYGFDEEGNLSGTVYTLLETI-SITGKAGEVKFG 195 (453) T ss_pred cCeEEEEEEeCC----CCceEEEEEcccceEEEEeCCCCceeEEEEEEEEecCceEEEEEEeCCeEE-EEEecCCceEEc Confidence 999988776654 4667788999999877642111100 000100 0111112234433211 111111100000 Q ss_pred cc-cccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHH Q lcl|NC_019530. 231 QP-SQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEE 308 (560) Q Consensus 231 ~~-~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeq 308 (560) .+ +..--++|. |.|++ +....|-++..+.....+. ++-+....-+.++.|-+-+.-. +++. + T Consensus 196 ~~~~~~~g~vPv--v~~~n-------~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~---~~~~----~ 259 (453) T protein:vir:73 196 ESTYNVYSDLPI--VEYNF-------NEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGA---EVDE----E 259 (453) T ss_pred cceeccCCceeE--EEecC-------CCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecC---CCCc----h Confidence 00 000001221 12222 1222344444444443332 2344444556667777765422 1111 1 Q ss_pred HHHHHHHhhh-hheeecccCCcccCcccchhhHhhhcccccCC--CCccceeecCCCCCcchH-HHHHHHHHHHHHhcCC Q lcl|NC_019530. 309 YMTMMMGKFK-NRNAYDRTTGKITGNAHLMGIAEDYWLPRREG--QNATEIATVGGGNQLGEM-DHVNYFREKLYDALMI 384 (560) Q Consensus 309 Yl~~im~k~k-nk~vYd~~TGev~d~~~~msmlEDywLpRReG--grgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~V 384 (560) - +...+ ++.. -+.+.. +...| +.+.++.+|-...+.+.+ ..+.-+++.+|...++ T Consensus 260 ~----~~~~~~~~~~----------------~~~~~~-~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~ 318 (453) T protein:vir:73 260 D----AKNIKDNRLI----------------NFFDKN-SNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMA 318 (453) T ss_pred h----hhcccccccc----------------cccccc-cccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCC Confidence 1 11111 1110 000000 11111 122346665443444433 3356678888999999 Q ss_pred CccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-CChhHHHHhhhhhceeeeeecch Q lcl|NC_019530. 385 PKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGV-TDEKDWNEKIKPFIKFEFTSDSY 463 (560) Q Consensus 385 P~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgi-it~eew~~~i~~~I~~~f~~Dn~ 463 (560) |- +..+. +|...+..|.--+.....-+.+.++.|...+.+.++.-+-+.+. -.+.+|. .|.+.|...-- T Consensus 319 p~--~~~~~---~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~-----~i~v~f~~~~p 388 (453) T protein:vir:73 319 AN--ISDEN---FGNSSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNASNKDAWK-----DIEYTFTRNEP 388 (453) T ss_pred cc--cCccc---ccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccc-----cceEEeCCCCC Confidence 94 33332 23333445555555556667777777777777776654433322 1233442 36677744322 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCH---HHHHHHHHHHHHhhhc---CCCCCCCccccccc Q lcl|NC_019530. 464 IREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSD---EEVKEQQTKIAEEKKK---GLYPKVQADETGNY 535 (560) Q Consensus 464 f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tD---eEI~e~~kqIe~E~~~---~~~~~p~~~e~g~~ 535 (560) -.+ .+.+++++.+. | .+|.+++++. |..++ +||+.++++-+++.+. +...+|. ++.|+- T Consensus 389 ~~~-------~~~a~~~~k~~---g-iis~et~~~~-~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~-~~~~~~ 453 (453) T protein:vir:73 389 KDI-------KEQAETANILK---G-ITSEETALSV-ISVIPDVQAEMEKIKKKKLLQLSLTRTSNLVRMK-QMRGNL 453 (453) T ss_pred CCH-------HHHHHHHHHHh---c-cCcHHHHHHh-CCCCCCHHHHHHHHHHHHHHHHHHHHhccCCcch-hhhcCC Confidence 112 23344555553 5 5999999976 55543 4555443332222211 1111111 122222 No 164 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=92.57 E-value=0.011 Score=31.35 Aligned_cols=408 Identities=12% Similarity=0.082 Sum_probs=199.1 Q ss_pred cccchhhhccccccccCCCCCCCCceeEeccccceeccccccchh-----hHHHHHHHHHHHhhccchhHHHHHhhccee Q lcl|NC_019530. 27 LAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELL-----SVKTVVEEYQSMAQQPEIRKAVDIIVNDVV 101 (560) Q Consensus 27 ~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~-----~~~~LI~~YR~ma~~PEvd~AI~eIvneai 101 (560) +..+--+++......-.+-. --...+|.+ ...+. -..++-..||... +=+.-||+.+++ .+ T Consensus 1 m~~~~i~~L~~~~~~~~~r~----------~~~~~yy~g-~~~~~~~~~~~p~~~~~~~~~v~--nw~~~~Vd~~a~-rl 66 (422) T protein:vir:97 1 MNYMGMGYLRRKLALFKTGV----------DKRYRYYAM-DDRDDTRSIVMPNNVREMYRSVL--EWTAKGVDSLAD-RI 66 (422) T ss_pred CChHHHHHHHHHHHHHHHHH----------HHHHHHHhc-CCChhhcCccccHHHHHHHHhhc--chhHHHHHHHHh-cc Confidence 21122222222111110000 000111111 11110 1122222222212 233445544333 11 Q ss_pred eecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCc Q lcl|NC_019530. 102 TCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDS 181 (560) Q Consensus 102 v~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDP 181 (560) +.+- ++..+ .+...+....+|+....+.++.=++.||=|. +|-++ +..|-..++.++| T Consensus 67 ~~~G--------f~~~d-----------~~l~~~w~~N~ld~~~~~~~~~al~~G~sf~--~v~~~-~~~~~p~i~~~sp 124 (422) T protein:vir:97 67 IFRE--------FTNDD-----------FNAWEIFKANNPDIFFDTAIQSALIASCCFV--YIMPG-AEDGLPKMQVIEA 124 (422) T ss_pred ccce--------eeCCc-----------hhHHHHHHhcChHHHHHHHHHHHHHhcceeE--EEeeC-CCCCeeEEEEech Confidence 1110 11111 1244567778899999999999999999777 44232 2346678999999 Q ss_pred cccchhhhhcccCCcccccccc----ccchheeecCccccccccccccccccccccccceecchhceeeeecccccCCCC Q lcl|NC_019530. 182 RCIRPVYIVEKAMREGGIEAIE----SVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLANG 257 (560) Q Consensus 182 r~i~~vr~i~~~~~~g~~~~~~----~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~~ 257 (560) +.+-.++.=.+....-.+.+.. +......|-|....+.+... ...... +..-+ .+| .|.|++-.-+....| T Consensus 125 ~~~~~i~D~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~g-~vP--vv~~~n~~~~~~~~G 199 (422) T protein:vir:97 125 SKATGILDPTTFLLTEGYAILESDSNGNPTLEAYFTDKDIWYYPKK-GKPYNI-KNPTG-HPL--LVPIIHRPDAVRPFG 199 (422) T ss_pred hhEEEEEeCCCCcceeeEEEEEecCCCcEEEEEEEcCceEEEEcCC-Cccccc-cCCCC-Ccc--eEEecccCCCccccC Confidence 8887663211100000011110 00001112121111111111 000000 11101 122 344555322222222 Q ss_pred cchhhh-HHHHHHHHH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCccc Q lcl|NC_019530. 258 IVPGLL-NPAIRPLNN-LVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAH 335 (560) Q Consensus 258 ~i~syL-~~AiK~~Nq-L~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~ 335 (560) . |-+ +..+...+- -++|.++++.=...=.|.|-++=+|--.-|..+-.. T Consensus 200 ~--s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~--------------------------- 250 (422) T protein:vir:97 200 R--SRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYVLGMDPDAKPMEKWRA--------------------------- 250 (422) T ss_pred c--cccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcccCcccccCchhhh--------------------------- Confidence 1 222 222222221 256788889999999999988765532212111111 Q ss_pred chhhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHH Q lcl|NC_019530. 336 LMGIAEDYWLPRREGQNATEIATVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSK 414 (560) Q Consensus 336 ~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~K 414 (560) .|-.=..+|.-+.|-+.+|..+++. +|+- ++-++-.-..+...-++|.+-|+..+. | -+.+..|.-.|....+ T Consensus 251 --~~~~i~~~~~de~~~~~~v~q~~~~-~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~-N--psSa~Ai~a~~~~L~~ 324 (422) T protein:vir:97 251 --TVSTLLEISKDEDGDKPTVGQFTTA-SMAPFMEHLKMYASLFAGGSGLTLDDLGFPSD-N--PSSVESIKAAHENLRA 324 (422) T ss_pred --hhhhhhccCCCCCCCcceeeecCCC-ChhHHHHHHHHHHHHHhcccCCCHHHhccccC-c--hhHHHHHHHHHHHHHH Confidence 1112245677777777889999884 4442 333444444455556999988876654 1 1224568888888999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCC--ChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhh Q lcl|NC_019530. 415 FCAGLRRRYSHFFMEFLRRQLILKGVT--DEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFS 492 (560) Q Consensus 415 fi~rLr~rfs~if~d~Lk~qLiLkgii--t~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S 492 (560) -+.+-|+.|..-+...++.-+.+.|-. .+++|.. +.+.|. .++= .++..+.+..+.+..+-.=+=.+.+ T Consensus 325 ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~~-----~~~~w~-p~~~---~~~~s~a~~aDa~~Kl~~a~~~~~~ 395 (422) T protein:vir:97 325 AGRKAQRSFSSGFLNVAYIAVCLRDEFPYLRNQFMD-----TVIKWE-PLFE---ADANMLTLVGDGAIKLNQAIPGFMD 395 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCcccchhhcc-----ceEEEc-cCCC---CChHHHHHHHHHHHHHHhhcccccc Confidence 999999999999999999988776532 2456632 455554 1110 2344455666665555442224667 Q ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHhhhcC Q lcl|NC_019530. 493 IDYVMRNVLRMSDEEVKEQQTKIAEEKKKG 522 (560) Q Consensus 493 ~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~ 522 (560) .+++++. |++|+.+++ ...++++.-+| T Consensus 396 ~~~~~~~-lg~~~~~~~--~~~~~~~~~d~ 422 (422) T protein:vir:97 396 ADVIRDL-TGVKGADKP--IPAITEVTTDG 422 (422) T ss_pred HHHHHHH-cCCCchhHH--HHHHHhhhccC Confidence 7888855 899876554 34566666566 No 165 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=92.55 E-value=0.011 Score=31.34 Aligned_cols=427 Identities=12% Similarity=0.036 Sum_probs=168.7 Q ss_pred cccchhhhccccccccCCCCCCCCceeEeccccceeccccccchh-----hHHHHHHHHHHHhhccchhHHHHHhhccee Q lcl|NC_019530. 27 LAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELL-----SVKTVVEEYQSMAQQPEIRKAVDIIVNDVV 101 (560) Q Consensus 27 ~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~-----~~~~LI~~YR~ma~~PEvd~AI~eIvneai 101 (560) +..-..++...+...-. .. .-... -....+|.+- ..+. ...++-+..+ ...++=+.-+|+..+. T Consensus 1 ~~~~t~~~~~~~l~~~~---~~-~~~r~--~~l~~Yy~g~-~~i~~~~~~~~~~~~~~~~-k~~~n~~~~ivd~~~~--- 69 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRI---DD-GMSRV--RLLARYSNGD-APLPELTRNTSAAWRSFQR-EARTNWGLMVRDSVAD--- 69 (456) T ss_pred CCCCCHHHHHHHHHHHH---HH-HHHHH--HHHHHHHhcC-CCchhcCcccChhhhhhhh-hhhcchHHHHHHHHHh--- Confidence 00000000000000000 00 00000 0000111110 0000 0011111000 0112222333322222 Q ss_pred eecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCc Q lcl|NC_019530. 102 TCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDS 181 (560) Q Consensus 102 v~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDP 181 (560) .-..+||++..++-. ...+.+..+.+..+|+....++++.-++.||-|.+.-.| .+|-..++.+|| T Consensus 70 --~l~~~~~~~~~~~d~--------~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d----~~g~~~i~~~~p 135 (456) T protein:vir:10 70 --RIIPNGITVGGSADS--------DLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR----DDGTATITADSP 135 (456) T ss_pred --hhccCCeecCCCCCc--------chHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeC----CCCceEEEEEcc Confidence 222345554433211 112345556667789999999999999999977643322 356678889999 Q ss_pred cccchhhhhcccCC-cccccccc---------------c----cchheeecCccccccccccccccccccccccceecch Q lcl|NC_019530. 182 RCIRPVYIVEKAMR-EGGIEAIE---------------S----VTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDD 241 (560) Q Consensus 182 r~i~~vr~i~~~~~-~g~~~~~~---------------~----~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~ 241 (560) +.+-.+..=....+ ...+.... + ++..+.|..... .......+...... ....-... T Consensus 136 ~~~~~i~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~--~~~~~~~~ 212 (456) T protein:vir:10 136 ETMVVSVDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRR-RLVTRISDSWVPVG--DAVVTGSP 212 (456) T ss_pred ceeEEEEcCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccc-eeeeecCCceeecc--ccCCCCCc Confidence 88765532110000 00000000 0 000011110000 00000000000000 00000000 Q ss_pred hceeeeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHH-HHHHHHHHhhhh Q lcl|NC_019530. 242 ESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTEDATVIYAITRAPEKRAFYLDVGTLGKKSAE-EYMTMMMGKFKN 319 (560) Q Consensus 242 daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAe-qYl~~im~k~kn 319 (560) =.|+|.+ +....|=++..+.....+. ++=|.++.-...--|.|-+.-.+.+. +...+. .=+ +.+..++ T Consensus 213 ~pvv~~~-------N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~-~~~d~~g~~~-~~~~~~~- 282 (456) T protein:vir:10 213 PPVVVYQ-------NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGL-PNVDENGNAI-DYASIFE- 282 (456) T ss_pred eeEEEec-------CCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccc-cccccccccc-chhhhhh- Confidence 1122222 1122344444443333322 11233333344444444433222111 110000 000 0000111 Q ss_pred heeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCccccc Q lcl|NC_019530. 320 RNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGSINIG 398 (560) Q Consensus 320 k~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~ 398 (560) +..|. .|. ...|..|..++. .+++. ++-++-.-..+...-++|..-|+..++ |. T Consensus 283 -----~~~~~-------------~~~----~~~~~~~~q~~~-~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~-N~- 337 (456) T protein:vir:10 283 -----AAPGA-------------LWE----LPPGVDIWESQA-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQ- 337 (456) T ss_pred -----hhccc-------------ccc----CCCCcceEEecc-cChhHHHHHHHHHHHHHHhccCCChHHhccccc-Ch- Confidence 11121 132 123455667775 33443 344677777788888999888865432 22 Q ss_pred ccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHH Q lcl|NC_019530. 399 GSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLA 478 (560) Q Consensus 399 ~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~ 478 (560) | +..|.--+..+-.-+.+.|+.|..-+.+.++.-+.+.|.. + + ..|.+.|.. +-..- +.+.++ T Consensus 338 -S-g~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~--~-~-----~~~~v~w~~-~~~~~------~~~~ad 400 (456) T protein:vir:10 338 -S-AEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES--V-E-----DTVDVSFES-PDRVT------LGEKYS 400 (456) T ss_pred -H-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--c-c-----cceeEEecC-CCCcC------HHHHHH Confidence 2 3556666667778888899999999999999888888853 2 1 125555633 21111 223455 Q ss_pred HHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHH-HHHHhhhcCCCCCCCccccccccccccCCC Q lcl|NC_019530. 479 SLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQT-KIAEEKKKGLYPKVQADETGNYSGSDVSPL 543 (560) Q Consensus 479 ~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~k-qIe~E~~~~~~~~p~~~e~g~~~~~~~~~~ 543 (560) ++..+..- -..|..++ ..+|++++++|++++. .+++|. ++.-.+|.+. |.+++.- T Consensus 401 a~~kl~~~--gi~~~~~~-~~~lg~~~~~i~~~e~er~~~e~-~~~~~~~~~~------~~~~~~~ 456 (456) T protein:vir:10 401 AASLAKAA--GESWASIR-RNILNYNADQIKQDDLDRAREQI-TLFAGNPVQR------PQEDGSR 456 (456) T ss_pred HHHHHHHc--CCChHHHH-HhhCCCCHHHHHHHHHHHHHHHH-HHHhhhhhhc------CCCCCCC Confidence 55554321 24665555 5789999999875433 333332 2211111111 1111111 No 166 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=92.55 E-value=0.011 Score=31.34 Aligned_cols=427 Identities=12% Similarity=0.036 Sum_probs=168.7 Q ss_pred cccchhhhccccccccCCCCCCCCceeEeccccceeccccccchh-----hHHHHHHHHHHHhhccchhHHHHHhhccee Q lcl|NC_019530. 27 LAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELL-----SVKTVVEEYQSMAQQPEIRKAVDIIVNDVV 101 (560) Q Consensus 27 ~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~-----~~~~LI~~YR~ma~~PEvd~AI~eIvneai 101 (560) +..-..++...+...-. .. .-... -....+|.+- ..+. ...++-+..+ ...++=+.-+|+..+. T Consensus 1 ~~~~t~~~~~~~l~~~~---~~-~~~r~--~~l~~Yy~g~-~~i~~~~~~~~~~~~~~~~-k~~~n~~~~ivd~~~~--- 69 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRI---DD-GMSRV--RLLARYSNGD-APLPELTRNTSAAWRSFQR-EARTNWGLMVRDSVAD--- 69 (456) T ss_pred CCCCCHHHHHHHHHHHH---HH-HHHHH--HHHHHHHhcC-CCchhcCcccChhhhhhhh-hhhcchHHHHHHHHHh--- Confidence 00000000000000000 00 00000 0000111110 0000 0011111000 0112222333322222 Q ss_pred eecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCc Q lcl|NC_019530. 102 TCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDS 181 (560) Q Consensus 102 v~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDP 181 (560) .-..+||++..++-. ...+.+..+.+..+|+....++++.-++.||-|.+.-.| .+|-..++.+|| T Consensus 70 --~l~~~~~~~~~~~d~--------~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d----~~g~~~i~~~~p 135 (456) T protein:vir:10 70 --RIIPNGITVGGSADS--------DLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR----DDGTATITADSP 135 (456) T ss_pred --hhccCCeecCCCCCc--------chHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeC----CCCceEEEEEcc Confidence 222345554433211 112345556667789999999999999999977643322 356678889999 Q ss_pred cccchhhhhcccCC-cccccccc---------------c----cchheeecCccccccccccccccccccccccceecch Q lcl|NC_019530. 182 RCIRPVYIVEKAMR-EGGIEAIE---------------S----VTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDD 241 (560) Q Consensus 182 r~i~~vr~i~~~~~-~g~~~~~~---------------~----~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~ 241 (560) +.+-.+..=....+ ...+.... + ++..+.|..... .......+...... ....-... T Consensus 136 ~~~~~i~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~--~~~~~~~~ 212 (456) T protein:vir:10 136 ETMVVSVDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRR-RLVTRISDSWVPVG--DAVVTGSP 212 (456) T ss_pred ceeEEEEcCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccc-eeeeecCCceeecc--ccCCCCCc Confidence 88765532110000 00000000 0 000011110000 00000000000000 00000000 Q ss_pred hceeeeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHH-HHHHHHHHhhhh Q lcl|NC_019530. 242 ESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLV-TTEDATVIYAITRAPEKRAFYLDVGTLGKKSAE-EYMTMMMGKFKN 319 (560) Q Consensus 242 daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~-m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAe-qYl~~im~k~kn 319 (560) =.|+|.+ +....|=++..+.....+. ++=|.++.-...--|.|-+.-.+.+. +...+. .=+ +.+..++ T Consensus 213 ~pvv~~~-------N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~-~~~d~~g~~~-~~~~~~~- 282 (456) T protein:vir:10 213 PPVVVYQ-------NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGL-PNVDENGNAI-DYASIFE- 282 (456) T ss_pred eeEEEec-------CCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccc-cccccccccc-chhhhhh- Confidence 1122222 1122344444443333322 11233333344444444433222111 110000 000 0000111 Q ss_pred heeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCccccc Q lcl|NC_019530. 320 RNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGSINIG 398 (560) Q Consensus 320 k~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~ 398 (560) +..|. .|. ...|..|..++. .+++. ++-++-.-..+...-++|..-|+..++ |. T Consensus 283 -----~~~~~-------------~~~----~~~~~~~~q~~~-~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~-N~- 337 (456) T protein:vir:10 283 -----AAPGA-------------LWE----LPPGVDIWESQA-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQ- 337 (456) T ss_pred -----hhccc-------------ccc----CCCCcceEEecc-cChhHHHHHHHHHHHHHHhccCCChHHhccccc-Ch- Confidence 11121 132 123455667775 33443 344677777788888999888865432 22 Q ss_pred ccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHH Q lcl|NC_019530. 399 GSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLA 478 (560) Q Consensus 399 ~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~ 478 (560) | +..|.--+..+-.-+.+.|+.|..-+.+.++.-+.+.|.. + + ..|.+.|.. +-..- +.+.++ T Consensus 338 -S-g~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~--~-~-----~~~~v~w~~-~~~~~------~~~~ad 400 (456) T protein:vir:10 338 -S-AEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES--V-E-----DTVDVSFES-PDRVT------LGEKYS 400 (456) T ss_pred -H-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--c-c-----cceeEEecC-CCCcC------HHHHHH Confidence 2 3556666667778888899999999999999888888853 2 1 125555633 21111 223455 Q ss_pred HHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHH-HHHHhhhcCCCCCCCccccccccccccCCC Q lcl|NC_019530. 479 SLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQT-KIAEEKKKGLYPKVQADETGNYSGSDVSPL 543 (560) Q Consensus 479 ~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~k-qIe~E~~~~~~~~p~~~e~g~~~~~~~~~~ 543 (560) ++..+..- -..|..++ ..+|++++++|++++. .+++|. ++.-.+|.+. |.+++.- T Consensus 401 a~~kl~~~--gi~~~~~~-~~~lg~~~~~i~~~e~er~~~e~-~~~~~~~~~~------~~~~~~~ 456 (456) T protein:vir:10 401 AASLAKAA--GESWASIR-RNILNYNADQIKQDDLDRAREQI-TLFAGNPVQR------PQEDGSR 456 (456) T ss_pred HHHHHHHc--CCChHHHH-HhhCCCCHHHHHHHHHHHHHHHH-HHHhhhhhhc------CCCCCCC Confidence 55554321 24665555 5789999999875433 333332 2211111111 1111111 No 167 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=92.51 E-value=0.0024 Score=34.99 Aligned_cols=193 Identities=12% Similarity=0.144 Sum_probs=85.5 Q ss_pred EEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccC--C-----CCccceeecCCCCCc Q lcl|NC_019530. 293 AFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRRE--G-----QNATEIATVGGGNQL 365 (560) Q Consensus 293 vFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRRe--G-----grgTEIsTLpGg~nL 365 (560) ||.++. +.+-+- ...++ -++.|.+..++ |+ + +.+=+..++. .+| T Consensus 1 V~k~~~------------------l~~~~~--~~~~~---~~~r~~~~~~~----~~~~~~~~ld~~~e~~e~~~--~~l 51 (201) T protein:vir:10 1 MWKAKG------------------LADLCD--DSDGA---ARLRLAQVDNN----SGVGQAIGIDADSEEYNVLN--SDI 51 (201) T ss_pred CccchH------------------HHHHhc--CChHH---HHHHHHHHHHh----hhhhhhheeecCCcceeeee--cCc Confidence 333211 000000 00000 01111111111 11 0 0001122222 466 Q ss_pred chHHH-HHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChh Q lcl|NC_019530. 366 GEMDH-VNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEK 444 (560) Q Consensus 366 gei~D-V~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~e 444 (560) +-++| +..|...+=-+.++|+.||-.++.-.++.++-+ |.-.|..+|..+|.+. +..+|++ |+.-+. -++ T Consensus 52 sGl~d~l~~~~~~iaa~s~iP~t~LfG~sp~Glnatge~----d~~nyyd~i~~~Qe~~---l~p~le~-l~~~~~-~~~ 122 (201) T protein:vir:10 52 GGIDTFLSQKFDRIVALSGIHEIILKGKNVGGVSASQNT----ALETFYGYVDRKRKAE---LLPLLEF-LLPFIV-TEQ 122 (201) T ss_pred CChHHHHHHHHHHHHhHhcCchhhhcCCCCccccccchh----HHHHHHHHHHHHHHHH---HHHHHHH-HHHhhc-CCC Confidence 67777 467888899999999999966654344433212 3345999999999533 3344443 222233 334 Q ss_pred HHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCC Q lcl|NC_019530. 445 DWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLY 524 (560) Q Consensus 445 ew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~ 524 (560) + ++|.|..=..-+|...+||.....++++.+-.- -.+|.+-+++.+-.. ...|-. T Consensus 123 ~--------~~~~f~pL~~~s~kekAei~~~~a~a~~~~~~~--g~i~~~e~r~~L~~~---------------~~~~~~ 177 (201) T protein:vir:10 123 E--------WSVEFNPLSQVSDKDKSEILEKNVNSVAALIAA--GIIDADEARDTLRAI---------------STEVKI 177 (201) T ss_pred C--------ceEeeCCCCCCCHHHHHHHHHHHHHHHHHHHHc--CCCCHHHHHHHHHhc---------------CCcCCC Confidence 4 456666666678888889988888888775332 245555555442111 111211 Q ss_pred CCCCccccccccccccCCCCCCcccccccCCccc Q lcl|NC_019530. 525 PKVQADETGNYSGSDVSPLKFKPETIPFSGSTDD 558 (560) Q Consensus 525 ~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~ 558 (560) ++-.-++.-+.+..+ +|+- =|.|+ T Consensus 178 ~~~~~~~~~~~~e~~------dp~~----~~~~~ 201 (201) T protein:vir:10 178 GEGSIQTEVVINESE------DPLD----VSANN 201 (201) T ss_pred CCCCCCccccccccC------CCCC----CCCCC Confidence 110000000000000 0000 01111 No 168 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=91.55 E-value=0.015 Score=30.54 Aligned_cols=407 Identities=11% Similarity=0.064 Sum_probs=166.1 Q ss_pred chhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHH-h Q lcl|NC_019530. 6 RGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSM-A 84 (560) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~m-a 84 (560) -||+..+|. ...+.+-+++..-.|.... .+...-... + ..++ . T Consensus 1 Mg~~~~~~~---------------------~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~-~-------~~~~~~ 44 (423) T protein:vir:81 1 MGFLQKLGL---------------------APSVVATPEPIELVGPIFE-------SLKLSTKNM-T-------VEQIWE 44 (423) T ss_pred CchhHhhcc---------------------ccccccCcccccccccccc-------ccccccchh-h-------HHHHHH Confidence 233333321 0111111111111222111 111110111 1 1122 4 Q ss_pred hccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCc---hhh----HHHHHHhhhhhh Q lcl|NC_019530. 85 QQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFD---NTA----YQKIRKWYVDGR 157 (560) Q Consensus 85 ~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~---~~~----~~~~R~wYvDGr 157 (560) .+|-|..||+-|.+.+--. |+.|-=...++ .++.+.+ ..+.+||.-. ..+ +.++..+.+.|. T Consensus 45 ~~~~v~~~i~~ia~~ia~l-----p~~~~~~~~dg----~~~~~~~--~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn 113 (423) T protein:vir:81 45 DQPHLRTVTTFIARNVASL-----QLQAFERVEDG----GRERVRE--GHLARVCKLANSDMTMYDLLERTMFDLCLYDE 113 (423) T ss_pred hhhHHHHHHHHHHHhHhhC-----ceEEEEEecCC----ceeeecc--chHHHHhhcCCCCCCHHHHHHHHHHHHhhcCC Confidence 7899999997777754321 22221111111 1112211 1122333211 123 344556678898 Q ss_pred hhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccce Q lcl|NC_019530. 158 QAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQEL 237 (560) Q Consensus 158 ~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~ 237 (560) -|..+..|..- ...+.+|+++.|..+.. ....+..+ ..+|.|.- . ....+..+ T Consensus 114 a~~~i~rd~~~-~~~~~~l~p~~~~~v~~---~~~~~~~~--------~~~Y~~~~-------~--------~~~~g~~~ 166 (423) T protein:vir:81 114 FFWLLPGDLGV-DTPTLDIRPIPVSWVQR---RAYKDGWG--------SLDYIIIE-------S--------GDNDGRSV 166 (423) T ss_pred eEEEEEecCCc-CcceEEEeecccceeee---eeccCCCc--------ceEEEEEE-------e--------cCCCceEE Confidence 77655444322 12344455544433322 11112111 11122210 0 01123456 Q ss_pred ecchhceeeeecccccCCCC-cchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHh Q lcl|NC_019530. 238 VFDDESIVYIDSGEEPLANG-IVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGK 316 (560) Q Consensus 238 ~i~~daI~y~hsGl~~~~~~-~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k 316 (560) .++.+.|.|.+. ...++. +..|-++.|..++.....+++...=+=---+.-+-|+..|-...|.+-.++-.+.+..+ T Consensus 167 ~~~~~evih~r~--~~~~~~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~~~~~ 244 (423) T protein:vir:81 167 KVPGERVIHRHG--YNPKTMKRGKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAESRTRFMAN 244 (423) T ss_pred EEcccceEEecC--CCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHHHHHHHHHH Confidence 788898888872 233333 46788999999888877777764433222345566777775543322222222334444 Q ss_pred hhhhee-ecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHH---HHHHHHHHhcCCCccccCCC Q lcl|NC_019530. 317 FKNRNA-YDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVN---YFREKLYDALMIPKSRLQEE 392 (560) Q Consensus 317 ~knk~v-Yd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~---YF~~kLy~aL~VP~sRl~~~ 392 (560) ++.... --..+|++ + .++ .|.+++.|. .+.-++.-++ +-...+.++++||...++.- T Consensus 245 ~~~~~~~~~~n~g~~------~-vl~----------~g~~~~~l~--~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~ 305 (423) T protein:vir:81 245 LRASFSPKSSDVGGT------L-LLE----------DGMKAENFH--TTSKDEQTVETTKLSLQTVAQVYGINPTMVGQL 305 (423) T ss_pred HHHHhccccccCCcc------e-ecC----------CCceEEecc--CChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCC Confidence 443321 11223432 2 222 245666663 3444444433 55677999999999988643 Q ss_pred CcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHH Q lcl|NC_019530. 393 GSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAI 472 (560) Q Consensus 393 ~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei 472 (560) ++.+ .++.++..+. |..++ |+-.+ ..+.+.|-..| +++.+|+. ....+.|..+ .|....+ T Consensus 306 ~~~t--~sn~e~~~~~---f~~~~--L~P~~-~~ie~~l~~~L-----~~~~~~~~---~~~~~~fd~~----~llr~d~ 365 (423) T protein:vir:81 306 DNAN--YSNVREFRKA---LYGDN--LGSWI-RIIQDVMNLFL-----LPRVGIDN---EKFYFEFNLE----EKLRASF 365 (423) T ss_pred CCCC--cccHHHHHHH---HHHHH--HHHHH-HHHHHHHhhhh-----cCcccccc---CccEEEecch----hhhccCH Confidence 3322 2333333333 65552 33211 22344444444 44444421 1123334222 2222222 Q ss_pred HHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccc---cccccccccCCCCCCccc Q lcl|NC_019530. 473 LNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADE---TGNYSGSDVSPLKFKPET 549 (560) Q Consensus 473 ~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e---~g~~~~~~~~~~~~~p~~ 549 (560) +.|.+.++.+-.-. -++|.+-+++ ++.| -|-|..|+ +.+..+++.. T Consensus 366 -~~r~~~~~~~l~~~-G~~T~NE~R~-~~gl--------------------~p~~gGD~~~~p~n~~~~~~~-------- 414 (423) T protein:vir:81 366 -EEAAEIKRAAVGNV-AWMTINEVRA-MDNL--------------------PSIDGGDDLARPLNTEFGDSE-------- 414 (423) T ss_pred -HHHHHHHHHHHhCC-CCcCHHHHHH-HhCC--------------------CCCCCcceeecccccccCccC-------- Confidence 34454444321111 2455555542 2222 22222221 1111111111 Q ss_pred ccccCCcccC Q lcl|NC_019530. 550 IPFSGSTDDS 559 (560) Q Consensus 550 ~~~~~~~~~~ 559 (560) +..|.+.++ T Consensus 415 -~~~~~~~~t 423 (423) T protein:vir:81 415 -DAPGEEVET 423 (423) T ss_pred -CCCCCCCCC Confidence 111222222 No 169 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=91.46 E-value=0.016 Score=30.47 Aligned_cols=419 Identities=12% Similarity=0.136 Sum_probs=175.6 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHH Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEY 80 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~Y 80 (560) .---|-||||+. +++-.. .+..-.....|+. |.+..+.- . ++.-...... . +. T Consensus 2 ~~~~~~g~~~~~---~~~~~~---------~~~~~~~~~~~~~-~~~~~~~~-~-~~~~~~~g~~----v-~~------- 54 (432) T protein:vir:97 2 PDEKKLGLLGQL---KAMFVP---------PDPVDIGGGQTFT-PVNATARD-L-GIIISDTGAA----V-NA------- 54 (432) T ss_pred CCcccCchhhhh---HhhcCC---------ccccccccccccc-cCchhhhh-h-cccccccCcc----c-ch------- Confidence 222344566543 222111 0000000001111 11100000 0 0000000000 0 11 Q ss_pred HHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHH-hCCchhhHH----HHHHhhhh Q lcl|NC_019530. 81 QSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHL-MDFDNTAYQ----KIRKWYVD 155 (560) Q Consensus 81 R~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l-l~f~~~~~~----~~R~wYvD 155 (560) ...+++|-|..||.-|-+.+.-+ |+.|--...++ .++.+..-.-++|+. =+=...+++ ++..+++. T Consensus 55 ~~a~~~~aV~~~v~~Ia~~ia~l-----p~~~y~~~~~g----~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~ 125 (432) T protein:vir:97 55 DAIMRLDAVAACVKLVSQAVAAM-----PLMMYMRTPDG----RKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLD 125 (432) T ss_pred HhhhcchHHHHHHHHHHHhhccC-----ceEEEEecCCC----cccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhc Confidence 23457899999997776655432 44333222221 222222222233321 111233333 44566777 Q ss_pred hhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCcccccccccccccccccccccc Q lcl|NC_019530. 156 GRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQ 235 (560) Q Consensus 156 Gr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~ 235 (560) |.-|..++.+. ..+.+|.+|+|.++..++ +..|. .+|.|.. ..+. T Consensus 126 Gnay~~~~~~~----g~~~~L~~l~p~~v~v~~-----~~~g~--------~~y~~~~------------------~~g~ 170 (432) T protein:vir:97 126 GTAYVRKVVTD----GRIESLQYLANDRLTITT-----DTKGN--------TAYRYRR------------------TDGQ 170 (432) T ss_pred CCeEEEEEecC----CcEEEEEEEcCcceEEEE-----cCCCc--------EEEEEEe------------------cCce Confidence 87776665532 358999999998887642 12221 1222210 1123 Q ss_pred ceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHH Q lcl|NC_019530. 236 ELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMG 315 (560) Q Consensus 236 ~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~ 315 (560) ...++.+.|.|... .+.++-+.+|.|..|.+++.....+++...=+=---+--.-|..+| +.|-+..++. +++ T Consensus 171 ~~~~~~~~iih~r~--~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~-~~l~~e~~~~-~~~--- 243 (432) T protein:vir:97 171 MIDIPRQQIWKIMG--YSLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQID-RFLTDDQYDS-FSK--- 243 (432) T ss_pred EEEEccccEEEecC--cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecC-CCCCHHHHHH-HHH--- Confidence 45688888887752 3445556778999999999877777765432211122223455555 3444443332 222 Q ss_pred hhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCccccCCC Q lcl|NC_019530. 316 KFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE---MDHVNYFREKLYDALMIPKSRLQEE 392 (560) Q Consensus 316 k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~~ 392 (560) +|..- ...|+ .+ +++ .|.+++.|. .+..+ ++-.+|....+.++++||...|+.. T Consensus 244 ~~~~~----~nag~------~~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~ 300 (432) T protein:vir:97 244 KVSGS----VEAGR------AP-LLE----------GGMDVKSLG--LNPVDAQLLQSRQYSVESICRFFGVPPSMIGHS 300 (432) T ss_pred HHhhh----hcCCC------ce-ecC----------CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCc Confidence 22110 01122 11 222 244455552 23333 3345688899999999999999754 Q ss_pred Ccccc-cccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHH Q lcl|NC_019530. 393 GSINI-GGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENA 471 (560) Q Consensus 393 ~~~~l-~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~E 471 (560) ...+. -.++.++..+. |.++ .|+--+ ..+.+.|-+ +++++.++. .+.++|..+ .+... T Consensus 301 ~~~t~~~~s~~e~~~~~---f~~~--tl~P~~-~~ie~~ln~-----kLl~~~e~~-----~~~~~fd~~----~llr~- 359 (432) T protein:vir:97 301 SAGTTSWGSGIESQQLG---FLTM--TLSPWL-RRIEQSIAL-----NLLTPAERR-----RYFADFDTS----ALLRA- 359 (432) T ss_pred CCcccccchhHHHHHHH---HHHH--HHHHHH-HHHHHHHhh-----hccCccccC-----ceEEEeech----hhhcc- Confidence 32111 12333333222 5544 343211 122233332 445665542 134444433 22222 Q ss_pred HHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCccccc Q lcl|NC_019530. 472 ILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIP 551 (560) Q Consensus 472 i~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~ 551 (560) -...|.+.++.+ ..+-++|.+-+++ ++.|...+ ..+ ..+. .+++..+-++.+.+-.|+.-- T Consensus 360 d~~~r~~~~~~~--~~~G~~T~NE~R~-~~glpp~~--g~~---------~~~~-----~~~~~~pl~~~~~~~~~~~~~ 420 (432) T protein:vir:97 360 DSAARSSYYSQL--VNNGLMTRDEARE-IEGLPKLG--GNA---------AVLT-----VQSAMVPLDSIGLQASPEPAS 420 (432) T ss_pred CHHHHHHHHHHH--HhCCCCCHHHHHH-HhCCCCCC--CCc---------ceEe-----ecccccchhhhcccCCCCCCC Confidence 133566665544 2223677777763 35552211 000 0000 111111222222222222110 Q ss_pred ccCCc-ccCC Q lcl|NC_019530. 552 FSGST-DDSI 560 (560) Q Consensus 552 ~~~~~-~~~~ 560 (560) ..++. .+.+ T Consensus 421 ~~~~~~~~~~ 430 (432) T protein:vir:97 421 GLGNQQQDKV 430 (432) T ss_pred CCCCcccccc Confidence 01111 1111 No 170 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=90.73 E-value=0.019 Score=29.98 Aligned_cols=387 Identities=12% Similarity=0.117 Sum_probs=143.5 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) |+.+ +|+ .+++ .-........+++.. ..+..+-. +. +..+.+|.|..||. T Consensus 1 mg~~-~~~-~~~~--~~~~~~~~~~~~~~~------------------~~~~~~~~--t~------~~~~~~~~v~~cv~ 50 (403) T protein:vir:10 1 MGFK-SWI-TEKL--NPGQRIIRDMEPVSH------------------RTNRKPFT--TG------QAYSKIEILNRTAN 50 (403) T ss_pred Ccch-hhh-hhcc--chhhhhhhccccccc------------------ccCCcccc--cH------HHHHHHHHHHHHHH Confidence 3322 222 1111 000011111111100 00000000 00 12346788889996 Q ss_pred HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHH-hCCchhhHH----HHHHhhhhhhhhhhheecCCCC Q lcl|NC_019530. 95 IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHL-MDFDNTAYQ----KIRKWYVDGRQAYHVIVDPTNK 169 (560) Q Consensus 95 eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l-l~f~~~~~~----~~R~wYvDGr~~~hkiiD~~~~ 169 (560) -|.+.+..+ .+.|.........-...+...+.++|+. =+-.-.+.+ ++-.++..|.-|.+ ++. T Consensus 51 ~Ia~~ia~~-------p~~v~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~--~~~--- 118 (403) T protein:vir:10 51 MVIDSAAEC-------SYTVGDKYNIVTYANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIY--WDG--- 118 (403) T ss_pred HHHHHHhhC-------ceeEeecccccccccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEE--EeC--- Confidence 666544322 1222111100111111222334455543 222233333 33445567876643 221 Q ss_pred ccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeee- Q lcl|NC_019530. 170 KGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYID- 248 (560) Q Consensus 170 k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~h- 248 (560) .+|.+|.|..+... .+..+ ++ -+|.|+ ++ ..++.+.|.|.. T Consensus 119 ----~~l~~l~~~~~~v~-----~~~~~---~~----~~~~~~--------~~--------------~~~~~~eiih~~~ 160 (403) T protein:vir:10 119 ----TSLYHVPAALMQVE-----ADANK---FI----KKFIFN--------NQ--------------INYRVDEIIFIKD 160 (403) T ss_pred ----ceeEeecCcceEEE-----EcCCc---eE----EEEEec--------Cc--------------eeecccceEEecc Confidence 35677777554421 11111 11 112221 01 112233333332 Q ss_pred cccccC--CCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeeccc Q lcl|NC_019530. 249 SGEEPL--ANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRT 326 (560) Q Consensus 249 sGl~~~--~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~ 326 (560) ...+.. ++-..+|-+..|++.++.-..++....=+----+.-.-|...+ +.|-+..+++.-+.+-..|... .. T Consensus 161 ~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~l~~e~~~~~~~~~~~~~~g~----~n 235 (403) T protein:vir:10 161 NSYVCGTNSQISGQSRVATVIDSLEKRSKMLNFKEKFLDNGTVIGLILETD-EILNKKLRERKQEELQLDYNPS----TG 235 (403) T ss_pred cccccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeC-CCCCHHHHHHHHHHHHHHhCCc----cc Confidence 111111 2223456777777777766666654432211223334455544 4565555544333333333211 11 Q ss_pred CCcccCcccchhhHhhhcccccCCCCccceeecC---CCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchh Q lcl|NC_019530. 327 TGKITGNAHLMGIAEDYWLPRREGQNATEIATVG---GGNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLA 403 (560) Q Consensus 327 TGev~d~~~~msmlEDywLpRReGgrgTEIsTLp---Gg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~ 403 (560) .|++ + .++ | |.+++.+. --..+..++..+|..+.+.++++||..-|+... .++.+ T Consensus 236 ~g~~------~-vl~--------~--g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-----~sn~e 293 (403) T protein:vir:10 236 QSSV------L-ILD--------G--GMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGGN-----NANIR 293 (403) T ss_pred Ccce------e-ecC--------C--CceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-----CcCHH Confidence 1222 1 122 1 33444332 122344577788999999999999999985321 23333 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019530. 404 EITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTV 483 (560) Q Consensus 404 eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~ 483 (560) +. -..|.+++ |+-.+.. +.+.|-.. +..++.+++.......+ + ...|++.++.+ T Consensus 294 ~~---~~~f~~~t--l~P~~~~-ie~~l~~~---------------L~~~~~~d~~~~~~l~~--D---~~~~~~~~~~~ 347 (403) T protein:vir:10 294 PN---IELFYYMT--IIPMLNK-LTSSLTFF---------------FGYKITPNTKEVAALTP--D---KEAEAKHLTSL 347 (403) T ss_pred HH---HHHHHHHH--HHHHHHH-HHHHHHHh---------------cCceeeeccchhhhccc--C---HHHHHHHHHHH Confidence 32 23365543 3222211 22222222 22234444322211100 0 12444444432 Q ss_pred hhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCC-Ccc----ccccccccccCCCCCCcccccccCCccc Q lcl|NC_019530. 484 EPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKV-QAD----ETGNYSGSDVSPLKFKPETIPFSGSTDD 558 (560) Q Consensus 484 ~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p-~~~----e~g~~~~~~~~~~~~~p~~~~~~~~~~~ 558 (560) -. .-+++.+-++ .++++..-+ +|--+..+.+. .+. ..|+++ -+ | .++|.+ T Consensus 348 ~~--~G~lT~NE~R-~~~gl~pi~---------~~~~d~~~~p~n~~~~~~~~~~~e~--------~~----~-~~~~~g 402 (403) T protein:vir:10 348 VN--NGIITGNEAR-SELNLEPLD---------DEQMNKIRIPANVAGSATGVSGQEG--------GR----P-KGSTEG 402 (403) T ss_pred Hh--CCCcCHHHHH-HHhCCCCCC---------cccccccccccccccccccCCCCcC--------CC----C-CCCcCC Confidence 11 2366777776 336664421 11111111110 000 000000 00 0 122222 Q ss_pred C Q lcl|NC_019530. 559 S 559 (560) Q Consensus 559 ~ 559 (560) - T Consensus 403 ~ 403 (403) T protein:vir:10 403 D 403 (403) T ss_pred C Confidence 2 No 171 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=90.71 E-value=0.019 Score=29.97 Aligned_cols=419 Identities=12% Similarity=0.119 Sum_probs=179.2 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHH Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEY 80 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~Y 80 (560) .---|-|+||+. +++-.. .+..-.....| ..|.+..+.-. +.....+...+ +. T Consensus 2 ~~~~~~~~~~~~---~~~~~~---------~~~~~~~~~~~-~~~~~~~~~~~--~~~~s~~g~~v-----~~------- 54 (432) T protein:vir:10 2 PDEKKLGLLGQL---KAMFVP---------PDPVDIGGGQT-FTPVNATARDL--GIIISDTGAAV-----NA------- 54 (432) T ss_pred CCCcccchhhhh---HhhcCC---------ccccccccccc-cccCcchhhhh--cccccccCccc-----ch------- Confidence 222334555543 222111 01100000011 11111110000 00000000000 11 Q ss_pred HHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHH-hCCchh----hHHHHHHhhhh Q lcl|NC_019530. 81 QSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHL-MDFDNT----AYQKIRKWYVD 155 (560) Q Consensus 81 R~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l-l~f~~~----~~~~~R~wYvD 155 (560) ...+++|-|..||.-|-+.+.-+ |+.|--...++ .++.+..-.-.+|+. =+-... -+.++..+++. T Consensus 55 ~~al~~~~V~~~i~~Ia~~ia~l-----p~~~y~~~~~g----~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~ 125 (432) T protein:vir:10 55 DAIMRLDAVAACVKLVSQAIAAM-----PLTMYMRTPDG----RKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLD 125 (432) T ss_pred hhhhcchHHHHHHHHHHHhhhhC-----ceeEEEecCCC----cccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhc Confidence 23457899999997776655432 33332222221 112222222222221 111123 34455567788 Q ss_pred hhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCcccccccccccccccccccccc Q lcl|NC_019530. 156 GRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQ 235 (560) Q Consensus 156 Gr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~ 235 (560) |.-|..++.+ ...+++|.+|+|.++..++ +..|. .+|.|.. ..+. T Consensus 126 Gnay~~~~~~----~g~~~~L~~l~~~~v~v~~-----~~~g~--------~~y~~~~------------------~~g~ 170 (432) T protein:vir:10 126 GTAYVRKVVT----DGRIESLQYLANDRLTITT-----DTKGN--------TAYRYRR------------------TDGQ 170 (432) T ss_pred CCeEEEEEec----CCcEEEEEEEcCCceEEEE-----cCCCc--------EEEEEEe------------------cCce Confidence 8777655542 2468999999998887653 11221 1222211 1123 Q ss_pred ceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHH Q lcl|NC_019530. 236 ELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMG 315 (560) Q Consensus 236 ~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~ 315 (560) ...++.+.|.|... .+.++-+.+|.|..|.+++......++...=+=---+.-.-|..+| +.|-+...++..+. ++ T Consensus 171 ~~~~~~~~iih~~~--~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~l~~e~~~~~~~~-~~ 246 (432) T protein:vir:10 171 MIDIPKQQIWKIMG--YSLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQID-RFLTDDQYDSFAKK-VS 246 (432) T ss_pred EEEEcCccEEEecC--CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecC-CCCCHHHHHHHHHH-Hh Confidence 45678888877753 3445556779999999999888777775443222223334555555 45554444432222 11 Q ss_pred hhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCccccCCC Q lcl|NC_019530. 316 KFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE---MDHVNYFREKLYDALMIPKSRLQEE 392 (560) Q Consensus 316 k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~~ 392 (560) ... ..|+ .+ +++ .|++++.|. .+..+ ++-.+|....+.++++||...|+.. T Consensus 247 ~~~-------nag~------~~-vl~----------~g~~~~~l~--~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~ 300 (432) T protein:vir:10 247 GSV-------EAGR------AP-LLE----------GGMDVKSLG--LNPVDAQLLQSRQYSVESICRFFGVPPSMIGHS 300 (432) T ss_pred hhh-------hCCC------ce-ecC----------CCceEEEcc--CChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCc Confidence 111 1122 11 232 244555553 33333 3344688889999999999999754 Q ss_pred Cc-ccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHH Q lcl|NC_019530. 393 GS-INIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENA 471 (560) Q Consensus 393 ~~-~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~E 471 (560) .. .....++..+..+. |..+ .|+--+. .+.+.|-+ +++++.++. ...++|..+ ++... T Consensus 301 ~~~t~~~~sn~e~~~~~---f~~~--tl~P~~~-~ie~~ln~-----kL~~~~~~~-----~~~~~fd~~----~ll~~- 359 (432) T protein:vir:10 301 SAGTTSWGSGIESQQLG---FLSM--TLSPWLR-RIEQSIAL-----NLLSPAERR-----RYFADFDTS----ALLRA- 359 (432) T ss_pred cCCcccccchHHHHHHH---HHHH--HHHHHHH-HHHHHHHh-----hhcCccccC-----ceEEEeech----hhhcc- Confidence 32 12122333333333 5443 3433221 23333333 334554432 244555443 22222 Q ss_pred HHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCccccc Q lcl|NC_019530. 472 ILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIP 551 (560) Q Consensus 472 i~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~ 551 (560) -..+|.+.++.+- .+-++|.+-+++ +|.|..-+ ..+.. +. .+++..+-+..+.+-.|+..- T Consensus 360 d~~~r~~~~~~~~--~~G~~T~NE~R~-~~glppi~--g~~~~---------~~-----~~~~~~pl~~~~~~~~~~~~~ 420 (432) T protein:vir:10 360 DSAARSSYYSQLV--NNGLMTRDEARE-IEGLPKLG--GNAAV---------LT-----VQSAMVPLDSIGLQASPEPAS 420 (432) T ss_pred CHHHHHHHHHHHH--hCCCCCHHHHHH-HhCCCCCC--CCcce---------Ee-----ecCcccchhhhcccCCCCCCC Confidence 2345666665542 223677777774 35653211 00000 00 011111111111111222111 Q ss_pred ccCCc-ccCC Q lcl|NC_019530. 552 FSGST-DDSI 560 (560) Q Consensus 552 ~~~~~-~~~~ 560 (560) ..+.. .+.+ T Consensus 421 ~~~~~~~~~~ 430 (432) T protein:vir:10 421 GLGNQQQDKV 430 (432) T ss_pred CCCCcccccc Confidence 11111 1111 No 172 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=89.56 E-value=0.026 Score=29.31 Aligned_cols=376 Identities=10% Similarity=0.091 Sum_probs=156.3 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) |++|+.. .++ +..... ...+.+............+ +..- .-+..+++|-|.+||. T Consensus 1 M~~f~~~------~~~---~~~~~~-------~~~~~~~~~~~~~~~~~~~--~~~v-------~~~~~~~~~~v~~~i~ 55 (386) T protein:vir:48 1 MPIFNIT------NLA---TESPPI-------SQGGFFDITDPDFLSTLNG--SEWV-------SAESALRNSDLFSIIN 55 (386) T ss_pred Ccccccc------ccc---cccccc-------ccccccccccchhcccccC--Ccee-------chhhhhcchHHHHHHH Confidence 2222210 000 000111 1111111111100000000 0000 1122357899999998 Q ss_pred HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHH----HHHHhhhhhhhhhhheecCCCCc Q lcl|NC_019530. 95 IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQ----KIRKWYVDGRQAYHVIVDPTNKK 170 (560) Q Consensus 95 eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~----~~R~wYvDGr~~~hkiiD~~~~k 170 (560) -|.+.+.-+ .+.+.... .+.+++--+-.-.+++ +++.+++.|.-|.-++.|. . T Consensus 56 ~ia~~ia~~-------p~~~~~~~-------------~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~---~ 112 (386) T protein:vir:48 56 QLSNDLATV-------KLTASRKQ-------------LQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNE---N 112 (386) T ss_pred HHHHhhccC-------ceeeccch-------------hHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECC---C Confidence 888876543 22222211 1122222333334444 4556788898888666654 2 Q ss_pred cCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeecc Q lcl|NC_019530. 171 GGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSG 250 (560) Q Consensus 171 ~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsG 250 (560) .-+.+|++|+|..+..++ +..++ ..+|.|.-. +. ..+....++.+.|.|.. T Consensus 113 g~~~~L~~l~~~~v~v~~-----~~~~~-------~~~y~~~~~-------~~--------~~~~~~~~~~~evih~~-- 163 (386) T protein:vir:48 113 GRDMKWEYLRPSQVSFNR-----LDNKD-------GIYYNITFD-------DP--------RIPPKQHVPQGDVLHFK-- 163 (386) T ss_pred CcEEEEEEecCceeEEEE-----cCCCc-------eEEEEEEec-------Cc--------cccceeEecCccEEEec-- Confidence 347899999998887542 11121 122333211 10 11234567777777765 Q ss_pred cccCCC-CcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCc Q lcl|NC_019530. 251 EEPLAN-GIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGK 329 (560) Q Consensus 251 l~~~~~-~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGe 329 (560) ..+.++ -+..|.|..|++++.....+++...=+----+--+-+.-.+- .+.+...++ +++.. +... ...|. T Consensus 164 ~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~-~~~~e~~~~-~~~~~----~~~~--~n~g~ 235 (386) T protein:vir:48 164 LLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKG-GGLLDFKTK-LSRSR----QAMK--QMQGG 235 (386) T ss_pred CCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCCHHHHHH-HHHHH----HHhh--cCCCC Confidence 223333 246789999999999988888875543333333344444443 333333322 32222 2211 11232 Q ss_pred ccCcccchhhHhhhcccccCCCCccceeecCCCCCcchH---HHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhh Q lcl|NC_019530. 330 ITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEM---DHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEIT 406 (560) Q Consensus 330 v~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei---~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eIt 406 (560) + + .++ .|.++..|. .+.-++ +=.++....+.++++||..-|+..++. ++.++-. T Consensus 236 ~------~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~----~~~e~~~ 292 (386) T protein:vir:48 236 P------L-VLD----------DLEEFTPLE--IKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQGDQ----QSSLEMS 292 (386) T ss_pred c------e-ecC----------CCceEEEcC--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc----ccHHHHH Confidence 1 1 121 256666664 333333 334666789999999999999653332 2233333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_019530. 407 QEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPF 486 (560) Q Consensus 407 RDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~ 486 (560) ++ |.+++ |+--+. .+.+.|-..|. +.-+.+ + ...+..|.+. +... ++.. T Consensus 293 ~~---~~~~~--l~P~~~-~ie~~l~~~l~-----~~~~~~--~----~~~~~~d~~~--------~~~~------~~~l 341 (386) T protein:vir:48 293 LD---LYNKA--VSRYLR-PFLSELSQKLS-----CDVDAD--I----LPAVDPTGSN--------SVSR------INSM 341 (386) T ss_pred HH---HHHHH--HHHHHH-HHHHHHHHhhc-----chhhcc--h----hhhhccChHH--------HHHH------HHHH Confidence 33 54443 322111 12222222221 110100 0 0011112111 0111 1111 Q ss_pred h-cchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 487 V-GSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 487 v-Gky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) + +-+++..-+++ +|.+ .|+.+ ++...+ ..++..|..||.++-= T Consensus 342 ~~~g~~t~nE~r~-~lg~-----------------~~~~~--~~~~~~-----------~~~~~~~~~gGd~~~~ 385 (386) T protein:vir:48 342 VKSGTLAQNQGLY-ILQQ-----------------AEILP--KELPEG-----------ENPNKTTLKGGEINGE 385 (386) T ss_pred HhCCCcCHHHHHH-Hhhc-----------------CCCCC--ccchhh-----------cCCCCCccCCCCCCCC Confidence 1 22344444432 2222 22211 110000 1122223334333322 No 173 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=88.97 E-value=0.029 Score=29.01 Aligned_cols=326 Identities=13% Similarity=0.099 Sum_probs=136.7 Q ss_pred cccchhhhc--------cccccccC---CCCCCCCceeEeccccceecc--c--cccchhhHHHHHHHHHHHhhccchhH Q lcl|NC_019530. 27 LAQKQDELL--------LTKATVVA---LDDAQDGSIILQGGANTYNYV--G--VESELLSVKTVVEEYQSMAQQPEIRK 91 (560) Q Consensus 27 ~~~~~~e~~--------~~k~~s~~---~~~~~dG~~~i~~~~~~~~y~--~--~e~~~~~~~~LI~~YR~ma~~PEvd~ 91 (560) +.++..... ....+++. |.+-.+|.... .+-..+. + +++.+ +...|-+-+|..+.|--+-. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~---~~~~~~~~~~~~~~pp~-~~~~la~l~~~~~~h~~~i~ 76 (346) T protein:vir:10 1 MKKQLRKNLTQNDRLQPQAQTEIFSFGDPIPVLDRADIL---NYLECSAMYEKWYNPPM-SFDGLAKSLRSSTHHESAII 76 (346) T ss_pred CCcccCCCCCcccccccccCeEEEecCCcceecCchhHH---HHHHHhhcCCceEecCC-CHHHHHHHHHhhhhcchhhh Confidence 211111100 00000110 00011111000 0000000 0 12222 33445555555554421111 Q ss_pred HHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCcc Q lcl|NC_019530. 92 AVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKG 171 (560) Q Consensus 92 AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~ 171 (560) |=.+++.. -++.-+. .+.-.+| ..++-.|++-|.-|++++.|. .. T Consensus 77 ~k~n~l~~-------------l~~~Pn~---------------~~t~~~f----~~~~~d~ll~Gnay~~i~r~~---~G 121 (346) T protein:vir:10 77 TKANILLS-------------TCEVDSR---------------YLSRRDL----SSFVKDYLVFGNAYFEVVRNR---LG 121 (346) T ss_pred hhhhhHHH-------------HHhCCCC---------------CCCHHHH----HHHHHHHHhcCCeEEEEEEcC---CC Confidence 11111000 0000000 0001112 233446777899999888654 44 Q ss_pred CeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeeccc Q lcl|NC_019530. 172 GIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGE 251 (560) Q Consensus 172 GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl 251 (560) .+.+|.+|+|.+|+.. .+.++ . -|..+.+ .+....++.+.|.|... T Consensus 122 ~~~~L~pl~~~~v~~~-----~~~~~-~-------~~~~~~~-------------------~g~~~~~~~~dIih~r~-- 167 (346) T protein:vir:10 122 QVQRIESPLAKYVRKG-----LEAGQ-F-------YYVPQRF-------------------DHQEHEFAKGSIYHLLE-- 167 (346) T ss_pred cEEEEEEecCCceEEE-----EcCCe-E-------EEEEEcc-------------------CCeEEEEecccEEEecC-- Confidence 5889999999887642 11111 0 0111111 12355678888877762 Q ss_pred ccC-CCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcc Q lcl|NC_019530. 252 EPL-ANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKI 330 (560) Q Consensus 252 ~~~-~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev 330 (560) .+. +.-+.+|-+..|+..+..-...++...=|=--=|--.-|+|+--++|.+..+++ +++-+.+.+ | . T Consensus 168 ~~~~~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG~~~~~il~~~d~~l~~e~~~~-i~~~~~~~~---------g-~ 236 (346) T protein:vir:10 168 PDINQDIYGLPQYLSALQSAWLNESATLFRRKYFLNGAHAGFVFYMSDASQKQEDVEN-IRQQLKQSK---------G-V 236 (346) T ss_pred CCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHH-HHHHHHHhc---------C-c Confidence 232 233456778888777766555555432211111334566777556766554443 444344332 2 1 Q ss_pred cCcccchhhHhhhcccccCCCCccceeecCCCCCcchH--HHH-HHHHHHHHHhcCCCccccCCCCcccccccchhhhhH Q lcl|NC_019530. 331 TGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEM--DHV-NYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQ 407 (560) Q Consensus 331 ~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei--~DV-~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItR 407 (560) .+..+.+ +++ --....|.+++.|. ...-++ -.+ .+-++...++.+||...++--.+-+.+.+++++..+ T Consensus 237 ~n~~~~~-vl~-----~~~~~~gi~~~pis--~~~~d~qf~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s~~e~~~~ 308 (346) T protein:vir:10 237 GNFKNLF-VHA-----PNGKKDGIQIIPIA--DVSAKDEFFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFGNVADAAE 308 (346) T ss_pred cccCcee-Eec-----CCCCccceeEEecC--CChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHH Confidence 1111211 111 11112355555553 222222 222 244667999999999998643332234444444444 Q ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHH Q lcl|NC_019530. 408 EELRFSKF-CAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIRE 466 (560) Q Consensus 408 DElkF~Kf-i~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~E 466 (560) . |.+. |.-|+.+|..++..+.. + + |+|+-..=--++| T Consensus 309 ~---f~~~~l~P~~~~iee~n~~L~~-e-----~-------------i~F~~~~ll~~~~ 346 (346) T protein:vir:10 309 V---FFITEIEPLQERLKEFNQWLGQ-E-----V-------------IKFKPSKLLQRTQ 346 (346) T ss_pred H---HHHHHHHHHHHHHHHHHhhccc-c-----e-------------eeechhhhcccCC Confidence 4 6666 78888888775543211 1 1 1111000000111 No 174 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=87.83 E-value=0.036 Score=28.49 Aligned_cols=402 Identities=14% Similarity=0.135 Sum_probs=173.5 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHH Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEY 80 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~Y 80 (560) |+ --+|+++. ++.|....+.- .+.....-|... .+++.+ + ..-+. T Consensus 10 ~~-~~~g~~~~------~~~~f~~~~~~-------------~~~~~~~~~~~~--~~~~~~---~---~~v~~------- 54 (424) T protein:vir:18 10 LR-TNNGWWAR------LKSWFVGGRLV-------------TPNQGSQTGPVS--AHGYLG---D---SSIND------- 54 (424) T ss_pred cC-CCCchHHH------HHhhccccccc-------------cccchhhccccc--cccccc---c---ccccH------- Confidence 11 12333332 12222111110 000000011110 000000 0 00011 Q ss_pred HHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhC----CchhhHH----HHHHh Q lcl|NC_019530. 81 QSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMD----FDNTAYQ----KIRKW 152 (560) Q Consensus 81 R~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~----f~~~~~~----~~R~w 152 (560) +..+++|-|..||.-|.+.+.-. |+.|-=...+. .++++.. -..+..+|+ -...+++ ++..+ T Consensus 55 ~~al~~~~v~~cv~~Ia~~iA~l-----p~~vy~~~~~~----~~~~~~~-~~~l~~lL~~~PN~~~t~~~f~~~~~~~l 124 (424) T protein:vir:18 55 ERILQISTVWRCVSLISTLTACL-----PLDVFETDQND----NRKKVDL-SNPLARLLRYSPNQYMTAQEFREAMTMQL 124 (424) T ss_pred HHhhccHHHHHHHHHHHHhhccC-----ceEEEEeccCC----ceeeecc-ccHHHHHHhhccCCCCCHHHHHHHHHHHH Confidence 24567888999997777655422 22221111010 1111100 011222222 2233333 34455 Q ss_pred hhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccc Q lcl|NC_019530. 153 YVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQP 232 (560) Q Consensus 153 YvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~ 232 (560) +..|.-|.-++.|. ..-+++|.+|+|..|...+ . ++ .-+|.|.. T Consensus 125 ll~Gnay~~i~r~~---~G~~~~L~~l~~~~v~v~~----~---~~-------~~~y~~~~------------------- 168 (424) T protein:vir:18 125 CFYGNAYALVDRNS---AGDVISLLPLQSANMDVKL----V---GK-------KVVYRYQR------------------- 168 (424) T ss_pred hhcCCeEEEEEECC---CCcEEEEEEecCcceEEEE----c---CC-------eEEEEEEe------------------- Confidence 66787776554432 2237899999998876421 1 11 11233321 Q ss_pred cccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHH-HHHhcCccceEEEccCCCCchHHHHHHHH Q lcl|NC_019530. 233 SQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVI-YAITRAPEKRAFYLDVGTLGKKSAEEYMT 311 (560) Q Consensus 233 ~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVI-yRitRAPeRRvFyIDvGnlpk~KAeqYl~ 311 (560) .+....++.+.|.|... ...++-..+|-|..|..++.....+++...= |.=.-.| +-+...+-+.+.+..+++ ++ T Consensus 169 ~g~~~~~~~~eVihir~--~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~-~gil~~~~~~l~~e~~~~-~~ 244 (424) T protein:vir:18 169 DSEYADFSQKEIFHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKS-PQILSTGEKVLTEQQRSQ-VE 244 (424) T ss_pred CCeEEEeccccEEEecC--cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCc-ceEEEeCCcCCCHHHHHH-HH Confidence 12345688888887753 2345556678899999999887777775543 3332233 456666655566554433 34 Q ss_pred HHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccC Q lcl|NC_019530. 312 MMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGG-NQLGEMDHVNYFREKLYDALMIPKSRLQ 390 (560) Q Consensus 312 ~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~ 390 (560) +.+.++.. |. +.-+.+ .++ .|.+++.|.=. ..+.-++--++....+.++++||.+.|+ T Consensus 245 ~~~~~~~~--------~~--nag~~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg 303 (424) T protein:vir:18 245 ENFKEIAG--------GP--VKKRLW-ILE----------AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVG 303 (424) T ss_pred HHHHHHhC--------Cc--ccCCce-ecc----------CCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhC Confidence 43433321 21 111122 222 25566665321 2222234455777899999999999997 Q ss_pred CCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHH Q lcl|NC_019530. 391 EEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQEN 470 (560) Q Consensus 391 ~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~ 470 (560) ...+.+..+++..+. -+.|.+++ |+- +...+.+.|-+ .+++++++. ...+.|.-| ++... T Consensus 304 ~~~~~t~~~sn~eq~---~~~f~~~t--l~P-~~~~ie~~ln~-----~L~~~~~~~-----~~~~~fd~~----~llr~ 363 (424) T protein:vir:18 304 DVEKSTSWGSGIEQQ---NLGFLQYT--LQP-YISRWENSIQR-----WLIPSKDVG-----RLHAEHNLD----GLLRG 363 (424) T ss_pred CCCCcccccccHHHH---HHHHHHHH--HHH-HHHHHHHHHHh-----hcCCccccC-----CeEEEEech----hhhcc Confidence 655444444443333 23355443 322 11223344433 445666552 133444333 22222 Q ss_pred HHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccc---cccccccccCCCCCCc Q lcl|NC_019530. 471 AILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADE---TGNYSGSDVSPLKFKP 547 (560) Q Consensus 471 Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e---~g~~~~~~~~~~~~~p 547 (560) -...|++.+..+-. .-+++.+-+++ .+.|.+ -|..|+ +++..+-++.+..-+ T Consensus 364 -d~~~r~~~~~~~~~--~G~~T~NE~R~-~~gl~p--------------------i~ggD~~~~~~n~~~l~~~~~~~~- 418 (424) T protein:vir:18 364 -DSASRAAFMKAMGE--SGLRTINEMRR-TDNMPP--------------------LPGGDVAMRQAQYVPITDLGTNKE- 418 (424) T ss_pred -CHHHHHHHHHHHHh--CCCcCHHHHHH-HhCCCC--------------------CCCcCeeeeccCccchhhhhccCC- Confidence 12356665554421 13566665553 244421 122121 122111111111111 Q ss_pred ccccccCCc Q lcl|NC_019530. 548 ETIPFSGST 556 (560) Q Consensus 548 ~~~~~~~~~ 556 (560) |.-++. T Consensus 419 ---~~~n~a 424 (424) T protein:vir:18 419 ---PRNNGA 424 (424) T ss_pred ---ccccCC Confidence 111111 No 175 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=87.31 E-value=0.04 Score=28.27 Aligned_cols=413 Identities=13% Similarity=0.080 Sum_probs=190.3 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHH Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEY 80 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~Y 80 (560) |+. |+++.-+. ++......+....+ ++... . .++.....+-++....-++.....++.| T Consensus 1 ~~~------~i~~~~g~---~~~~~~~~~~~~~~-------ia~~~-~----~~~~~~~~~~~p~~~~il~~~~~~~~~y 59 (491) T protein:vir:79 1 MSK------GLWVSPTE---FVKFGEPDKSLSSQ-------IATRA-R----SIDFFALGMYLPNPDPVLKALGKDIRVY 59 (491) T ss_pred CCC------eeeCCCCC---cccccccchhHHHH-------Hhhhc-c----ccccccccccCcchhHHHhhccCCHHHH Confidence 554 22222222 23222221111111 11100 0 0000001111222222222222246899 Q ss_pred HHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhh Q lcl|NC_019530. 81 QSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAY 160 (560) Q Consensus 81 R~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~ 160 (560) +.|..++.|.++++.+..-+.-.+ ..+...+. ++.+.+.|.+ .++-++|+.-..+++.-. -.|--++ T Consensus 60 ~~m~~D~~i~s~l~~Rk~av~~~~-------w~i~~~~~-~~~~a~~i~e----~l~~~~~~~~i~~~lda~-~~G~s~~ 126 (491) T protein:vir:79 60 RELRADAHVGGCVRRRKAAVKALE-------WGLDRGKA-KSRVAKSIAD----VFADLDLSRIATEMLDAV-LYGYQPM 126 (491) T ss_pred HHHhhChHHHHHHHHHHHHHhCCC-------cEEecCCC-CHHHHHHHHH----HHhcCCHHHHHHHHHHhh-hhcceeE Confidence 999999999999998877554332 12221111 2223444444 444567887777776533 3677777 Q ss_pred hheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecc Q lcl|NC_019530. 161 HVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFD 240 (560) Q Consensus 161 hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~ 240 (560) +++.+.++..-.+.+|...+|+.+.+ +|...-...+... ...+..++ T Consensus 127 Ei~w~~~~g~~~~~~l~~r~~~~f~~-------------------------d~~~~l~l~~~~~--------~~~g~~lp 173 (491) T protein:vir:79 127 EITWGKVGNYIVPIDVVGKPADWFVY-------------------------DPENQLRFRSKEH--------WVQGEELP 173 (491) T ss_pred EEEEeecCCeeeEEeeeeecccceee-------------------------ccCCceEEeecCC--------CCCceeec Confidence 77776654444445666666654443 2211110001110 01122333 Q ss_pred -hhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHh-cCccceEEEccCCCCchHHHHHHHHHHHHhhh Q lcl|NC_019530. 241 -DESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAIT-RAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFK 318 (560) Q Consensus 241 -~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRit-RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~k 318 (560) ..-|+|+|.- ...+....|-|+.|..+|-=.+......+.+=-. =.|- |+...|-|+-.+.|. ..++. +.... T Consensus 174 ~~k~i~~~~~~--~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~-~igky~~~a~~~ek~-~l~~a-l~~~~ 248 (491) T protein:vir:79 174 ARKFLVPRQEA--TYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPM-LVGKHPRSASDAETN-LLLDR-LEDMV 248 (491) T ss_pred CCCeEEEEecC--CCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCe-EEEecCCCCCHHHHH-HHHHH-HHHHh Confidence 3466777742 2233445688999999888777777666665443 3444 445557776555443 22222 22222 Q ss_pred hheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch----HHHHHHHHHHHHHhcCCCccc-cCCCC Q lcl|NC_019530. 319 NRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE----MDHVNYFREKLYDALMIPKSR-LQEEG 393 (560) Q Consensus 319 nk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge----i~DV~YF~~kLy~aL~VP~sR-l~~~~ 393 (560) +. .|=| +| .|++|+.+.-+..-|. ..=++|..++.-+++- +- |.+++ T Consensus 249 ~~------a~~v--------------iP-----~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iL---GqtlTt~~ 300 (491) T protein:vir:79 249 QD------AVAV--------------IP-----DDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIALL---GQNQTTEA 300 (491) T ss_pred cC------eEEE--------------ec-----CCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHHh---hhhhccCc Confidence 21 1101 12 4688888753222222 2338899998877762 21 33333 Q ss_pred cccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHH Q lcl|NC_019530. 394 SINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAIL 473 (560) Q Consensus 394 ~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~ 473 (560) +. ++ .++++-.+ -+...++..++..+..|.++++--+.+++--. +...|.|.. ..+ +. T Consensus 301 ~g--s~-a~~~vh~~--v~~~i~~~D~~~i~~tln~li~~l~~~N~~~~---------~~p~f~~~e------~ee--~~ 358 (491) T protein:vir:79 301 TS--TR-ASAQAGLE--VTDDIRDGDKAIVVEAMNMLIRWICDLNFDGA---------ARPVFDMWE------QEQ--VD 358 (491) T ss_pred cc--ch-hhHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC---------CcceEeecC------cCc--hh Confidence 21 11 22344322 36777888888888888886666555665311 123444432 222 22 Q ss_pred HHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCccccccc Q lcl|NC_019530. 474 NDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFS 553 (560) Q Consensus 474 ~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~ 553 (560) ..+.+.++.+-+. |-=++.+|++++ +++...+.++.- .+.+... .+..+ ...+.. T Consensus 359 ~~~a~~~~~L~~~-G~~i~~~~~~e~-~Gip~~~~~e~~-----------~~~~~~~-~~~~~-----------~~~~~~ 413 (491) T protein:vir:79 359 EIQAGRDEKLTRA-GARFTPAYFKRA-YNLQDGDLDERP-----------LPVSAVD-AVGAA-----------SFAEFE 413 (491) T ss_pred HHHHHHHHHHHhC-CCccCHHHHHHH-hCCCCCCCCccc-----------cCcCccc-ccccc-----------cccccC Confidence 2334444444443 544899999865 676543322110 1111000 00000 000111 Q ss_pred CCcccCC Q lcl|NC_019530. 554 GSTDDSI 560 (560) Q Consensus 554 ~~~~~~~ 560 (560) +..++.. T Consensus 414 ~~~~~~~ 420 (491) T protein:vir:79 414 APDQDAL 420 (491) T ss_pred CCCCcch Confidence 1111111 No 176 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=86.31 E-value=0.046 Score=27.89 Aligned_cols=414 Identities=10% Similarity=0.062 Sum_probs=179.2 Q ss_pred cccchhhhccccccccCCCCCCCCceeEeccccceeccc-c---ccchhhHHH--HHHHHHHHhhccchhHHHHHhhcce Q lcl|NC_019530. 27 LAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVG-V---ESELLSVKT--VVEEYQSMAQQPEIRKAVDIIVNDV 100 (560) Q Consensus 27 ~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~-~---e~~~~~~~~--LI~~YR~ma~~PEvd~AI~eIvnea 100 (560) +.++...+. + +|...+.....+|.+ . ...+...++ -++.|+.|..++.|.++++.+..-+ T Consensus 1 v~~~~l~~e------~--------at~~~~~d~~~~~~~~l~~~~~~il~~a~~g~~~~y~~l~~D~~i~s~l~~rk~av 66 (488) T protein:vir:99 1 MEKPALGRE------I--------ATSGDGRDITRPFISGLQVPNDSILQRRGGNDLRVYEEILSDAQVKTVWGQRQLAV 66 (488) T ss_pred CCccchhHH------H--------HHHHhhhhhhccccCCCCCCChHHHHhhccCCHHHHHHHhhChHHHHHHHHHHHHH Confidence 111110000 0 010000000011111 0 011111111 1689999999999999998877655 Q ss_pred eeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeC Q lcl|NC_019530. 101 VTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLD 180 (560) Q Consensus 101 iv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LD 180 (560) .-.+ ..|.--+.++ .-+++.++....++-++|+.-..+++-- ...|--+++++...++..-.+..|.+.+ T Consensus 67 ~~~~-----w~i~p~~~~~----~~~~~ae~v~~~l~~~~~~~~l~~~lda-~~~G~s~~Ei~w~~~~g~~~~~~l~~r~ 136 (488) T protein:vir:99 67 VSRE-----WKVEAGGDRP----IDQAAAEHLEQQLQRVGWDRVTSKMLFG-VFYGYAVSELIYGRDDRYITLEAIKVRN 136 (488) T ss_pred hcCC-----ceEEcCCCCh----HHHHHHHHHHHHHhCCCHHHHHHHHHhh-hhhcceeEEEEEeecCCeeeEeeeeeec Confidence 4332 1221111111 1234555666666667888877777654 3377777888776544333444555555 Q ss_pred ccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceec--chhceeeeecccccCCCCc Q lcl|NC_019530. 181 SRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVF--DDESIVYIDSGEEPLANGI 258 (560) Q Consensus 181 Pr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i--~~daI~y~hsGl~~~~~~~ 258 (560) |+.+. |++...-...+++.. ..+..+ +..=+++.|.- ....-. T Consensus 137 ~~~f~-------------------------~d~~~~l~~~~~~~~--------~~g~~lp~~~~~i~~~~~~--~~g~p~ 181 (488) T protein:vir:99 137 RRRFR-------------------------YDQDGGLRLLTPNNM--------FEGEPCPAPYFWHFSTGAD--NDDEPY 181 (488) T ss_pred cccee-------------------------ecCCCceEEeccCCC--------CCccccccCceEEEEeecC--CCCCcc Confidence 54333 322111111111111 112223 22334445431 112334 Q ss_pred chhhhHHHHHHHHHHHHHHHHHHHHHHh-cCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccch Q lcl|NC_019530. 259 VPGLLNPAIRPLNNLVTTEDATVIYAIT-RAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLM 337 (560) Q Consensus 259 i~syL~~AiK~~NqL~m~EDalVIyRit-RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~m 337 (560) ..|-|.+|..+|--.+......+.+--. =.|-|-.-|=+.|.=+..| .+.++. +....+.- |=| T Consensus 182 g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a~~~ek-~~l~~a-v~~~~~~~------~~v------- 246 (488) T protein:vir:99 182 GLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTATPEDK-AKLLAA-LHAIQTDS------AII------- 246 (488) T ss_pred cchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCCCHHHH-HHHHHH-HHHHhcCc------EEE------- Confidence 4689999999988888877777776443 3565533332233323322 223332 22232221 111 Q ss_pred hhHhhhcccccCCCCccceeecCCCCCcch--HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHH Q lcl|NC_019530. 338 GIAEDYWLPRREGQNATEIATVGGGNQLGE--MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKF 415 (560) Q Consensus 338 smlEDywLpRReGgrgTEIsTLpGg~nLge--i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kf 415 (560) +| .|++|..+..+..-+. ..=++|..++.-+++-=- =|..+++ ++|.+..-+-+|+ +... T Consensus 247 -------iP-----~~~~ie~~ea~~~~~~~~~~li~~~d~~Isk~iLGq--tlts~~~---~Gs~a~~~vh~~v-~~d~ 308 (488) T protein:vir:99 247 -------MP-----AGMQAELLEAGRSGTADYKTLHDTMDATIAKVGLGQ--VASTQGT---PGRLGNDDLQADV-RLDL 308 (488) T ss_pred -------ec-----CCceeEEeecCCCChHHHHHHHHHHHHHHHHHHhhh--hhccccc---ccchhhHHHHHHH-HHHH Confidence 12 3688888865332222 234788888887764211 1333332 1222222234444 4555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHH Q lcl|NC_019530. 416 CAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDY 495 (560) Q Consensus 416 i~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~ 495 (560) +..-.+.++..|..-|-..|+.=|. .... -+.+.|++... |-++.+.+.+..+-+..|-=++.+| T Consensus 309 ~~aDa~~i~~tln~~li~~l~~~N~-~~~~-----~p~~~~~~~e~---------edl~~~a~~~~~l~~~~G~~i~~~~ 373 (488) T protein:vir:99 309 VKADADLICESFNLGPARWLTEWNF-PGAQ-----PPRVYRVIEEP---------EDITAKAERDEKVFRMSGFRPTRGY 373 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCc-CCcC-----CceeEecCCCc---------ccHHHHHHHHHHHHhhcCCCCCHHH Confidence 6666666666665433333333332 2211 13344444332 3344555555555555565688899 Q ss_pred HHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcc-ccccccccccCCCCCCccccc-ccCCcccCC Q lcl|NC_019530. 496 VMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQAD-ETGNYSGSDVSPLKFKPETIP-FSGSTDDSI 560 (560) Q Consensus 496 i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~-e~g~~~~~~~~~~~~~p~~~~-~~~~~~~~~ 560 (560) ++++ +++...+-.+. ..-+.|... .++.... + .....-++..- ++...++.+ T Consensus 374 i~e~-~Gip~~~~~~~----------~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~~~~~ 427 (488) T protein:vir:99 374 VQET-YGVEVESTQAE----------ATAPTPSTEFAEGDQPS-D-PAAAMAPQLAEAMQPVVGNWT 427 (488) T ss_pred HHHH-cCCCCcccccc----------cccCCCcccCCCCCCCC-C-chHHHHHHHHHHHHHHHHHHH Confidence 9866 77754321110 001111111 0000000 0 00000000000 000000000 No 177 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=86.06 E-value=0.048 Score=27.80 Aligned_cols=395 Identities=11% Similarity=0.060 Sum_probs=197.3 Q ss_pred cccchhhhccccccccCCCCCCCCceeEeccccceeccccc----cchhhHHHHHHHHHHHhhccchhHHHHHhhcceee Q lcl|NC_019530. 27 LAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVE----SELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVT 102 (560) Q Consensus 27 ~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e----~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv 102 (560) +...--.++......-. +.+ .-...+|.+-. ....-..++-..||... +=+.-||+.+++= ++ T Consensus 1 ~~~~~i~~L~~~~~~~~-~r~---------~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~--nw~~~iVds~a~r-l~ 67 (409) T protein:vir:94 1 MTEKGIGYLRFKLSVHK-RRA---------EMRYDQYAMKYVDRFKGITIPQALSQQYRSIL--GWCAKGVDSLADR-LV 67 (409) T ss_pred CCHHHHHHHHHHHHHHh-HHH---------HHHHHHhcccCchhhcChhhhHHHHHHHhhhc--chhHHHHHHhHhh-cc Confidence 11111111111110000 000 00011111110 01111233444555433 3344455544331 11 Q ss_pred ecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCcc Q lcl|NC_019530. 103 CEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSR 182 (560) Q Consensus 103 ~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr 182 (560) .+- ++..+ +.+..|....+|+.+..+.++.=++.||-|. +|-++ .+|-..++.++|+ T Consensus 68 ~~G--------f~~~d-----------~~l~~i~~~N~ld~~~~~~~~~aliyG~sf~--~v~~~--~dg~~~i~~~sp~ 124 (409) T protein:vir:94 68 FRE--------FENDD-----------FTVNEIFEENNPDIFFDSAVLSSLIASCSFT--YISKG--ENDAVRLQVIEAV 124 (409) T ss_pred cCc--------ccCCc-----------hHHHHHHHhcChhHHHHHHHHHHHHhcceeE--EEecC--CCCceEEEEeccc Confidence 111 11111 1345677788899999999999999999665 33332 2455678888887 Q ss_pred ccchhhhhcccCCcccccccc-----ccchheeecCccccccccccccccccccccccceecchhceeeeecccccCCCC Q lcl|NC_019530. 183 CIRPVYIVEKAMREGGIEAIE-----SVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLANG 257 (560) Q Consensus 183 ~i~~vr~i~~~~~~g~~~~~~-----~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~~ 257 (560) .+-.+..=.+..-.-.+.+.. ......+|.|..-+... ..++..... ++.-+ .+| .|.|++-.-.+..-| T Consensus 125 ~~~~i~D~~~~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~-~n~~g-~vP--vV~f~n~~~~~~~~G 199 (409) T protein:vir:94 125 NATGIIDPITGLLTEGYAVLERDENNNVVLEAHFLPDRTDYYY-RDSRNNISI-ANPTG-HPL--LVPIIHRPDAVRPFG 199 (409) T ss_pred eEEEEEecCCCceeeeEEEEEecCCCceEEEEEEecCcEEEEE-ecCceeEee-eCCCC-Ccc--eEEeccccccccccC Confidence 766553211000000000000 00111234443222211 111111111 11111 122 345555322222222 Q ss_pred cchhhhHHHHHH-HHH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCccc Q lcl|NC_019530. 258 IVPGLLNPAIRP-LNN-LVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAH 335 (560) Q Consensus 258 ~i~syL~~AiK~-~Nq-L~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~ 335 (560) .|-+-..+.+ .+- -+.+.++++.=...=.|.|-++=+|...-|..+-..++ +| T Consensus 200 --~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~--------~~--------------- 254 (409) T protein:vir:94 200 --RSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEPMETWKATV--------SS--------------- 254 (409) T ss_pred --ccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCcccchhhhhH--------HH--------------- Confidence 2323222221 121 25678889999999999999986654322222111111 11 Q ss_pred chhhHhhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHH Q lcl|NC_019530. 336 LMGIAEDYWLPRREGQNATEIATVGGGNQLG-EMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSK 414 (560) Q Consensus 336 ~msmlEDywLpRReGgrgTEIsTLpGg~nLg-ei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~K 414 (560) =..+|.-+.|-+.+|..++++. |+ =++-++=.-..+...-++|.+-|+..+. | -+.+..|.-.|....+ T Consensus 255 ------i~~~~~d~dg~~~~v~q~~~~~-l~~~~~~l~~~~~~~a~~t~lP~~~lg~~~~-N--psSa~Al~a~~~~L~~ 324 (409) T protein:vir:94 255 ------MLQFTKDEDGDKPTLGQFTQPS-MSPFTEQLRTAAAGFAGETGLTLDDLGFVSD-N--PSSVEAIKASHENLRL 324 (409) T ss_pred ------hhcCCCCCCCCCceEEecCCCC-hhHHHHHHHHHHHHHhhhcCCCHHHhccccC-c--hhHHHHHHHHHHHHHH Confidence 1346766777778898998844 44 3455556666777778999888876554 1 1223557777888889 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCC--hhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhh Q lcl|NC_019530. 415 FCAGLRRRYSHFFMEFLRRQLILKGVTD--EKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFS 492 (560) Q Consensus 415 fi~rLr~rfs~if~d~Lk~qLiLkgiit--~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S 492 (560) -+.+-|+.|..-+..+++.-+.+.|-.. +++|.. +.+.|. ++.-++ +-.+.++.+.+..+..-+=.+.+ T Consensus 325 ~a~~k~~~fg~~~~~~~rla~~i~~~~~~~~~~~~~-----~~v~W~-p~~~~~---~~~~a~~aDa~~Kl~~ag~~~~~ 395 (409) T protein:vir:94 325 AGRKAQRSLGAGLLNVAYLAACLRDDAPYLREQFRK-----TKPKWE-PLFEAD---ASMLSLIGDGAIKLNQAIPEFIN 395 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCCCcccccccc-----ceEEec-cCCCcc---hHHHHHHHHHHHHHHHhcccccc Confidence 9999999999999999998776655433 456632 555564 332222 33446676666666554224566 Q ss_pred HHHHHHHHhCCCHHH Q lcl|NC_019530. 493 IDYVMRNVLRMSDEE 507 (560) Q Consensus 493 ~~~i~k~IL~~tDeE 507 (560) .++++. -|++|+.| T Consensus 396 ~~~~~~-~lG~~~~d 409 (409) T protein:vir:94 396 KDTIRD-LTGIEGGE 409 (409) T ss_pred hhHHHH-HcCCCCCC Confidence 787774 49999999 No 178 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=85.30 E-value=0.054 Score=27.54 Aligned_cols=409 Identities=11% Similarity=0.095 Sum_probs=162.6 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) |+.+. ..... .+. .......+.|.....+++ |++. +-+++|.|..||. T Consensus 1 m~~~~----------~~~~~-~~~--~~~~~~~~~~~~~~~~g~----~~~~---------------~Al~~~~V~~cv~ 48 (417) T protein:vir:38 1 MKLFR----------GLATE-VDP--HWADHLLDSGVIPSFRGG----YLGI---------------SALRNSDVLTAVS 48 (417) T ss_pred Ccccc----------ccccC-CCc--cchhhhcccccccccCCc----eech---------------hhcccHHHHHHHH Confidence 22221 10000 000 000001122332222222 2211 1146788999997 Q ss_pred HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHH-hCCchhhHH----HHHHhhhhhhhhhhheecCCCC Q lcl|NC_019530. 95 IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHL-MDFDNTAYQ----KIRKWYVDGRQAYHVIVDPTNK 169 (560) Q Consensus 95 eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l-l~f~~~~~~----~~R~wYvDGr~~~hkiiD~~~~ 169 (560) -|-+.+.-. |+.+--...++..+ .....++|+. =+=...+.+ ++....+.|.-|..++.|... T Consensus 49 ~ia~~iA~l-----p~~~~~~~~~~~~~------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g- 116 (417) T protein:vir:38 49 IVSGDVSRF-----PLVITDSSTDEVID------LANIEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPIT- 116 (417) T ss_pred HHHHhhccC-----eeEEEEcCCcceec------cchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCC- Confidence 777655422 33332222222111 0112222221 122223333 355566778777766654422 Q ss_pred ccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeec Q lcl|NC_019530. 170 KGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDS 249 (560) Q Consensus 170 k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hs 249 (560) .-+.+|.+|.|..+...+ ...+ ..+|.|.+. ++ +....++.+.|.|... T Consensus 117 -~~~~~l~~l~p~~v~v~~-----~~~~--------~~~y~~~~~------~~-----------~~~~~~~~~dviH~r~ 165 (417) T protein:vir:38 117 -NEPAMFEFYAPSQTQVDT-----SDPD--------NIIYRFTPY------NS-----------SMQKVCGFEDVIHWKF 165 (417) T ss_pred -CEEEEEEEeCCceEEEEE-----cCCC--------eEEEEEEEc------CC-----------cEEEEecCcceEEecC Confidence 227889999998887532 1112 112333311 11 1123456666666552 Q ss_pred ccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCc Q lcl|NC_019530. 250 GEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGK 329 (560) Q Consensus 250 Gl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGe 329 (560) .+.+.-+.+|-|..|.+++.....++....=+=-.=+.-+-|+..+ |.|.+.++++..+.+-..|.- ...|. T Consensus 166 --~~~d~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~-~~l~~e~~~~~~~~~~~~~~g-----~n~g~ 237 (417) T protein:vir:38 166 --FSYDTIMGRSPLLSLGDEIGLQESGVSTLQKFFKSGLKGSIIKAKE-SRLSAEARQKIREDFERAQAG-----ADAGS 237 (417) T ss_pred --CCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC-CCCCHHHHHHHHHHHHHHhcc-----cccCC Confidence 3445555778999999988887777775543222223444565555 556665554433322222211 11232 Q ss_pred ccCcccchhhHhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhh Q lcl|NC_019530. 330 ITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE---MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEIT 406 (560) Q Consensus 330 v~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eIt 406 (560) .+ .+ + .|.+.+.|. .+..+ ++--+|-.+.+.++++||.+.|+..+. .+++++.. T Consensus 238 ------~~-vl-----~-----~g~~~~~l~--~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~----~s~~e~~~ 294 (417) T protein:vir:38 238 ------PI-IV-----D-----ATMDYQPLE--VDTNVLNLINSNNYSTAQIAKALRVPAYRLAQNSP----NQSVKQLA 294 (417) T ss_pred ------ce-ec-----c-----CCceEEEcc--CCHHHHHHHHHHHhhHHHHHHHhCCCHHHhCCCCc----chhHHHHH Confidence 22 12 1 245555542 22222 233344467788999999999964322 22333333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_019530. 407 QEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPF 486 (560) Q Consensus 407 RDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~ 486 (560) +. |.+. .|+-.+.. +.+.|-. .++++.++.. ..+.|-.+ ++.... +.++...++ T Consensus 295 ~~---~~~~--tl~P~~~~-ie~~l~~-----~Ll~~~~~~~-----~~~~fd~~----~l~~~~----~~~~~~~~~-- 348 (417) T protein:vir:38 295 DD---YIRN--DLPFYFEP-ITSEFEL-----KLLDDAQRHQ-----YCIGFDTK----SVNGLP----IADVNTAVN-- 348 (417) T ss_pred HH---HHHH--HHHHHHHH-HHHHHHh-----hhcChhhccc-----ceEEechh----hhhHHH----HHHHHHHHh-- Confidence 32 4432 44432222 2233333 3356665521 23334222 221111 111112221 Q ss_pred hcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccccCCcccC Q lcl|NC_019530. 487 VGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDDS 559 (560) Q Consensus 487 vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~ 559 (560) .-+++.+-+++ ++.+..-+=-..++-.-.- .-.......+ .+-.......++..-..+--...|+..+| T Consensus 349 -~G~~T~NE~R~-~~gl~pi~~g~~d~~~~~~-n~~~~d~~~~-~~~~~~~~~kgg~~~~~~~~~~~~~~~~~ 417 (417) T protein:vir:38 349 -GGLWTGNEGRA-ELGKKPLKDPNMDRIQSTL-NTVFLDQKEA-YQAEHAAELKGGDTNAKGNQNGSGTNANS 417 (417) T ss_pred -CCCcCHHHHHH-HhCCCCCCCCCCCeeeecc-cccccccccc-cccccccccCCCCCCCCCCCcCCCCcCCC Confidence 23677777763 4666432110000000000 0000000000 00000000001111111112223333344 No 179 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=83.29 E-value=0.07 Score=26.93 Aligned_cols=439 Identities=12% Similarity=0.063 Sum_probs=170.7 Q ss_pred hhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhH--HHHHHHHHHHhhcc Q lcl|NC_019530. 10 GLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSV--KTVVEEYQSMAQQP 87 (560) Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~--~~LI~~YR~ma~~P 87 (560) =||. +.+..+..=+-+...|.+... .+ .++.+. .....+|+.+..+. T Consensus 1 ~~~~----~~~~~~~~~~~~~~~~~l~~~--------------------------~i-~~li~~~~~~~~~r~~~l~~YY 49 (506) T protein:vir:94 1 MDYD----LTEHKQANLIYQESLENLTPN--------------------------KI-MKFITHHFNYQRPRLEMLDDYY 49 (506) T ss_pred CCcc----hhhhhcceeecccchhcCCHH--------------------------HH-HHHHHHHHHHHHHHHHHHHHHh Confidence 1111 111111111101100100000 00 000000 01112344444333 Q ss_pred chhHHH-----------------------HHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchh Q lcl|NC_019530. 88 EIRKAV-----------------------DIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNT 144 (560) Q Consensus 88 Evd~AI-----------------------~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~ 144 (560) +-+..+ -.||+- .+.=--..||++..++. ...+.+..+++-.+|+.. T Consensus 50 ~g~~~~i~~~~~~~~~~~~~~~ki~~n~~~~Iv~~-~~~~l~G~p~~~~~~d~---------~~~~~l~~~~~~N~~~~~ 119 (506) T protein:vir:94 50 QGYNLKILDKQSRRHEDGKADHRATHSFAKYIADF-QTSYSVGNPINVKLPDD---------GSNSGFDTFNKANDVDAE 119 (506) T ss_pred cCCCccccccccccccccCCcceeecchHHHHHHH-hhhhhcccCceeecCcc---------hHHHHHHHHHhccCHhHH Confidence 322211 111111 11111234555544432 234567777778899999 Q ss_pred hHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCC-cccccccc-----c-----cch-heeec Q lcl|NC_019530. 145 AYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMR-EGGIEAIE-----S-----VTL-KYYYN 212 (560) Q Consensus 145 ~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~-~g~~~~~~-----~-----~~e-~~~Y~ 212 (560) ..++.+.+.+.|+-|.+.-+|. +|-..+..+||+++-+|..=....+ .-.+.... + +.. ...|. T Consensus 120 ~~~~~~~~~~~G~a~~~v~~de----d~~~~i~~~~p~~~~~v~dd~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~yt 195 (506) T protein:vir:94 120 NYDLFLDMSRYGRAYEYVYRGE----DNEEHLAKLDPLDTFVIYSTDVDPKPIMAVRYHQIELVDDNQVSTINYVPETWT 195 (506) T ss_pred HHHHHHHHHhcCeEEEEEEecC----CCeeEEEEEcccceEEEecCCCCCceEEEEEEEeeeeccCCceeEEEEEEEEEe Confidence 9999999999999988766653 4667899999999988752111110 00011000 0 000 11233 Q ss_pred Cccccccccccccccccc-cccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHH--------HHHHHHHHHHH Q lcl|NC_019530. 213 PNYNRNQFTGQSGTSQNF-QPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNN--------LVTTEDATVIY 283 (560) Q Consensus 213 p~~~~~~~~g~~~~~~~~-~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~Nq--------L~m~EDalVIy 283 (560) +..- +.+.+.+...... ...-.--++|. |.|+ +.....|-++..+..... ...+++..-.+ T Consensus 196 ~~~~-~~~~~~~~~~~~~~~~~~~~g~vPv--v~~~-------n~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~ 265 (506) T protein:vir:94 196 ADTY-TLYNPTPIMGKMQVDTTKPITTFPV--VEFK-------NSNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAM 265 (506) T ss_pred CceE-EEeccccCccceeccccccCCccce--EEec-------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHH Confidence 3221 1122211111000 00000001221 1111 122223444444433322 33445544444 Q ss_pred HHhcCcc-ceEEEccCCCC-----------chHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCC Q lcl|NC_019530. 284 AITRAPE-KRAFYLDVGTL-----------GKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQ 351 (560) Q Consensus 284 RitRAPe-RRvFyIDvGnl-----------pk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGg 351 (560) ++...-. ....-.+.... ....+.+++.+ |. +++. ++++-..+. T Consensus 266 l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--~~~~---------------------~~~~~~~~~ 321 (506) T protein:vir:94 266 LIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKE-MK--DANM---------------------LLLKSGMTV 321 (506) T ss_pred HHHhcCccccccchhccccccccccccccccccchhHHHhh-hh--hcCe---------------------eeecccccc Confidence 4443311 11111111110 00111111111 00 0111 222222221 Q ss_pred Ccc----ceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 352 NAT----EIATVGGGNQLGE-MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHF 426 (560) Q Consensus 352 rgT----EIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~i 426 (560) .|+ .+..|--..++.. -.-+.-..+.+|....+|- +..+ ++. |...+..|.--+..-..-+.+.|..|..- T Consensus 322 ~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~-~~~-~n~Sg~Aik~~~~~l~~k~~~k~~~~~~~ 397 (506) T protein:vir:94 322 NGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPD--LTDE-NFA-SNSSGVAMQYKVLGTVELASTKRRMFERG 397 (506) T ss_pred cCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccc--cccc-ccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 122 2222222122221 2335556778888899994 2222 211 22223334433333446677777777777 Q ss_pred HHHHHHHHHHhcc---CCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCC Q lcl|NC_019530. 427 FMEFLRRQLILKG---VTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRM 503 (560) Q Consensus 427 f~d~Lk~qLiLkg---iit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~ 503 (560) +...++.-+-+-+ ....-+|. .|.+.|...---.+... ++++..+ +| .+|.++++.. |.. T Consensus 398 l~~~~~li~~~~~~~~~~~~~d~~-----~i~i~f~~~~p~d~~e~-------a~~~~kl---~g-~iS~et~~~~-lp~ 460 (506) T protein:vir:94 398 LYARYQIISDIENSIHGDWTFDPQ-----ELTFTFRDNLPADNISQ-------IKALVQA---GA-TLPQKYLYQQ-LPG 460 (506) T ss_pred HHHHHHHHHHHHHhcCCccccccc-----cceEEeCCCCCcCHHHH-------HHHHHHH---hc-cCChHHHHHh-CCC Confidence 7776655333222 22233342 25666644433223222 3344444 34 5999999976 666 Q ss_pred CHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 504 SDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 504 tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) +++. +++.++|++|.... .+..+..+..+ .. .+.-...+.++-.| T Consensus 461 v~d~-~~E~~ri~~E~~~~---~~~~~~~~~~~------~~--~~~~~~~~~~~~e~ 505 (506) T protein:vir:94 461 VTNP-QDIVDMMKEQSANG---DYSFDQNGVIS------ND--GQTNTTATQTDEEV 505 (506) T ss_pred CCCH-HHHHHHHHHHHHHH---hhcchhhcCCC------cc--cCccccccccccCC Confidence 6532 23445555554221 11111111111 11 11223455555556 No 180 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=81.91 E-value=0.081 Score=26.55 Aligned_cols=415 Identities=11% Similarity=0.107 Sum_probs=177.2 Q ss_pred ccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHHHhhcceeeecCCCceEEEeeeccc Q lcl|NC_019530. 39 ATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVE 118 (560) Q Consensus 39 ~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~ 118 (560) .+|+ +..-|....-+ .-.+.++ +. ...+++|-|..||.-|.+.+.-. | +.+...+ T Consensus 1 ~~~~---~~~~g~~~~~~---~~~~~~~-----~~-------~~~~~~~~V~acV~~Ia~~iA~l-----p--l~l~~~~ 55 (723) T protein:vir:94 1 MTTF---PSGAGGWNAWS---ADSVFGN-----GA-------KGWSNSAVAYRCISMLANNAASV-----D--LVVRGPD 55 (723) T ss_pred Cccc---ccCCCcccccc---ccccccc-----cH-------HHHhhhHHHHHHHHHHHHhhccc-----e--eEEEcCC Confidence 2222 12222211100 0011111 11 12367899999997666554322 2 3332222 Q ss_pred chhHHHHHHHHHHHHHHHHHhC----CchhhHHH----HHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhh Q lcl|NC_019530. 119 GISDTVKESITECFKEVMHLMD----FDNTAYQK----IRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIV 190 (560) Q Consensus 119 ~~s~sik~~I~eeF~~il~ll~----f~~~~~~~----~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i 190 (560) + +.. +...++.+|+ -..++.++ +..+...|.-|..++.+..+...-+.+|.+|+|+.+..+. T Consensus 56 ~--~~~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~-- 125 (723) T protein:vir:94 56 G--ELD------ELHPLSQLWNVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVA-- 125 (723) T ss_pred C--ccc------hhhHHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEee-- Confidence 1 111 1123444443 22334443 4455678998888877655555557899999997665431 Q ss_pred cccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeecccccCCCCcchhhhHHHHHHH Q lcl|NC_019530. 191 EKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPL 270 (560) Q Consensus 191 ~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~ 270 (560) ..+ + -.+.......|.|. ...++.+.++.+.|.|..- .-..++-+.+|-|..|.+++ T Consensus 126 -~~~--~-~~~~~~~~~~y~~~------------------~~~G~~~~~~~~dIiHir~-~~~~dg~~G~Spi~~a~~~i 182 (723) T protein:vir:94 126 -TRA--A-DAVPQAQIIGYVIE------------------RTDGVRVPVLADEMLWLRF-SDPYDPLAVMAPWKAARAAV 182 (723) T ss_pred -cCC--C-ccceeeeeeEEEEE------------------ecCceeEEecccceEEecC-CCCCCCcccccHHHHHHHHH Confidence 111 1 11111111112221 1223455677888877752 11234446778999999999 Q ss_pred HHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHh---hhcccc Q lcl|NC_019530. 271 NNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAE---DYWLPR 347 (560) Q Consensus 271 NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlE---DywLpR 347 (560) .....+++.. .++++-=-+--..|-+++|.+..+++-.+.+...|.-- ...|+ .+ ++| .-+... T Consensus 183 ~~~~aa~~~~--~~~f~NG~~p~giL~~~~l~~e~~~~~~~~~~~~~~G~----~Nagk------~~-vL~g~~~~~~vl 249 (723) T protein:vir:94 183 DADFYAATWQ--RQSFKNGARPGGVVNLGDMDEQTFTKTVAAFRSQVEGV----QNAGR------HL-LIAGQGSDGGAA 249 (723) T ss_pred HHHHHHHHHH--HHHHhcCCCcceEEEcCCCCHHHHHHHHHHHHHHhhch----hhcCc------ce-eecccccccccc Confidence 9888887754 34443322222333345676666555444443333210 11232 11 122 111111 Q ss_pred cCCCCccceeecCCCCCcchHHHH---HHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 348 REGQNATEIATVGGGNQLGEMDHV---NYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYS 424 (560) Q Consensus 348 ReGgrgTEIsTLpGg~nLgei~DV---~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs 424 (560) +.|++++.|. .+.-++.-+ +|-.+..-++.+||...|...+. .++..+- .+.|..++ |+- +. T Consensus 250 ---~~G~~~~~l~--~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~~st----~sN~e~~---~~~f~~~t--L~P-~~ 314 (723) T protein:vir:94 250 ---GKGATFTSLS--MSPAEMDYINSRMHSAEEVMLAFGIRKDALLGGST----YENQAEA---KAAVWTET--LIP-QM 314 (723) T ss_pred ---cCCceEEEcc--CCHHHHHHHHHHHHhHHHHHHHhCCChhHcCCCCC----cccHHHH---HHHHHHHH--HHH-HH Confidence 1345555553 343344333 34456689999999988854332 1222222 22354332 332 22 Q ss_pred HHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCC Q lcl|NC_019530. 425 HFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMS 504 (560) Q Consensus 425 ~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~t 504 (560) ..+.+.|-..|+ +...| .+.|+|..+..... -+..|.+.++.+-. +-++|.+-++. ++.+. T Consensus 315 ~~ie~~ln~~Ll-----~~~g~------~~~~~f~~~~lLr~-----D~~~r~~~~~~~v~--~G~~T~NE~R~-~lglp 375 (723) T protein:vir:94 315 EVMASITDLQLL-----PDIGW------TVEWDFNSVPALQE-----DLEAQAGRNQGYLV--NDVLMVDEVRA-TIGLD 375 (723) T ss_pred HHHHHHHhHhhc-----ccccC------ceEEeecchhhhhc-----CHHHHHHHHHHHHh--CCCcCHHHHHH-HhCCC Confidence 334445555443 22222 25566654322211 11234444433211 24778887774 47774 Q ss_pred HHHHHHHHHHHH-HhhhcCCC--CCCCc-cccccccccccCCCCC---CcccccccCCc----ccCC Q lcl|NC_019530. 505 DEEVKEQQTKIA-EEKKKGLY--PKVQA-DETGNYSGSDVSPLKF---KPETIPFSGST----DDSI 560 (560) Q Consensus 505 DeEI~e~~kqIe-~E~~~~~~--~~p~~-~e~g~~~~~~~~~~~~---~p~~~~~~~~~----~~~~ 560 (560) +-+ .=+.++- .-...... +.|.. .++|....-++..... +.+-+|..+++ |++- T Consensus 376 Pi~--gGd~~~~~~p~~~~~a~~~~~~p~~~e~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~ 440 (723) T protein:vir:94 376 PLP--GGIGQMTLTPYRAQFAPAPAPAPAVEEGAARMLALLERVAADRPLPELPVRATTVLHHDPGP 440 (723) T ss_pred CCC--CCcccceeccccccccCCCCCCccchhhhHhhhhhccccccccCcCCCCCCCCCCCCCCccc Confidence 321 0000000 00000000 11111 1222222222212111 11112222222 2222 No 181 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=76.95 E-value=0.13 Score=25.44 Aligned_cols=395 Identities=11% Similarity=0.046 Sum_probs=195.8 Q ss_pred cccchhhhccccccccCCCCCCCCceeEeccccceeccccc----cchhhHHHHHHHHHHHhhccchhHHHHHhhcceee Q lcl|NC_019530. 27 LAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVE----SELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVT 102 (560) Q Consensus 27 ~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e----~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv 102 (560) +.+.--+++......-.+- + .-...+|.+-. -...-..++-..||... +=+.-||+.+++ .++ T Consensus 1 ~~~~~i~~L~~~~~~~~~r-~---------~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~--nw~~~iVds~a~-rl~ 67 (409) T protein:vir:16 1 MTEKGIGYLRFKLSVHKRR-A---------EMRYEQYAMKHVDRFKGITIPQALSQQYRSIL--GWCAKGVDSLAD-RLV 67 (409) T ss_pred CCHHHHHHHHHHHHHHhHH-H---------HHHHHHHhccCchhhcchhhhHHHHHHHhhhc--ChhHHHHHHhHh-hcc Confidence 1111111111111100000 0 00011111110 00111223333344332 233334444332 111 Q ss_pred ecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCcc Q lcl|NC_019530. 103 CEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSR 182 (560) Q Consensus 103 ~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr 182 (560) .+- ++..+ +....|....+|+.+..+.++.=++.||-|. +|-+.. +|-..++.++|+ T Consensus 68 ~~G--------f~~~d-----------~~l~~i~~~N~ld~~~~~~~~~al~yG~sf~--~v~~~~--dg~~~i~~~sP~ 124 (409) T protein:vir:16 68 FRE--------FENDD-----------FTVNEIFEENNPDIFFDSTVLSALIASCSFT--YISKGE--NDAVRLQVIEAT 124 (409) T ss_pred ccc--------ccCcc-----------hHHHHHHHhcChhHHHHHHHHHHHHhCceeE--EEecCC--CCceEEEEEccc Confidence 111 11111 2345677788899999999999999999665 443432 456788899998 Q ss_pred ccchhhhhcccCCcccccccc----cc-chheeecCccccccccccccccccccccccceecchhceeeeecccccCCCC Q lcl|NC_019530. 183 CIRPVYIVEKAMREGGIEAIE----SV-TLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEPLANG 257 (560) Q Consensus 183 ~i~~vr~i~~~~~~g~~~~~~----~~-~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~~ 257 (560) .+-.+..=.+....-.+.+.. +. ..+-+|.|.......... ...... +..-+ .+| .|.|++.--++.+-| T Consensus 125 ~~~~i~D~~~~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~g-~vP--vV~f~n~~~~~~~~G 199 (409) T protein:vir:16 125 NATGIIDPITGLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDS-RNNISI-ANPTG-NPL--LVPIIHRPDAVRPFG 199 (409) T ss_pred ceEEEeecccccceeeeEEEEecCCCceEEEEEEecCcEEEEEecC-ccccce-ecCCC-Ccc--eEEecccccccccCC Confidence 776553111000000000000 01 112334443322211111 100001 11111 122 345555322222222 Q ss_pred cchhhhHHHH-HHHHH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCccc Q lcl|NC_019530. 258 IVPGLLNPAI-RPLNN-LVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAH 335 (560) Q Consensus 258 ~i~syL~~Ai-K~~Nq-L~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~ 335 (560) .|-+-..+ ...+- -+.+.++++.=...=.|.|-++=+|...-|..+ ++ +..| T Consensus 200 --~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~-----------~~------~~~~------- 253 (409) T protein:vir:16 200 --RSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEPMET-----------WK------ATVS------- 253 (409) T ss_pred --ccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCCccch-----------hh------hhhh------- Confidence 23232212 11122 256788889989999999999876643223211 11 1111 Q ss_pred chhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHH Q lcl|NC_019530. 336 LMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKF 415 (560) Q Consensus 336 ~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kf 415 (560) .=..+|.-+.|-+.+|..++++.=-+=++-++-.-..++..-++|.+-|+..+. |- +.+..|.-.|....+- T Consensus 254 -----~i~~~~~d~~g~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~-Np--sSa~Ai~a~~~~L~~k 325 (409) T protein:vir:16 254 -----SMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSD-NP--SSVEAIKASHENLRLA 325 (409) T ss_pred -----HhhccCCCCCCCCceEEecCCCChhHHHHHHHHHHHHHhhhcCCCHHHcccccC-ch--hHHHHHHHHHHHHHHH Confidence 113367667777788999988532233566666677788888999988876553 21 2245677788889999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCC--ChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhh- Q lcl|NC_019530. 416 CAGLRRRYSHFFMEFLRRQLILKGVT--DEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFS- 492 (560) Q Consensus 416 i~rLr~rfs~if~d~Lk~qLiLkgii--t~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S- 492 (560) +.+.|+.|..-+..+++.-+.+.|-. .+++|. .+.+.|. ++.. .++..+.++.+.+..+..- |..+. T Consensus 326 a~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~-----~~~v~W~-~~~~---~~~~s~a~~aDa~~Kl~~a-~~~~~~ 395 (409) T protein:vir:16 326 GRKAQRSLGAGLLNVAYLAACLRDDVPYLREQFS-----KTKPKWE-PLFE---ADASMLSLIGDGAIKLNQA-IPEFIN 395 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCccchhhc-----cceEEec-CCCC---cchhhHHHHHHHHHHHHhh-cccccc Confidence 99999999999999999888776542 234442 2455553 2211 1234456676766666553 54443 Q ss_pred HHHHHHHHhCCCHHH Q lcl|NC_019530. 493 IDYVMRNVLRMSDEE 507 (560) Q Consensus 493 ~~~i~k~IL~~tDeE 507 (560) .+++. +-|++|+.| T Consensus 396 ~~v~~-~~~g~~~~d 409 (409) T protein:vir:16 396 KDTIR-DLTGIKGAE 409 (409) T ss_pred hhHHH-HhccCCCCC Confidence 46665 558999999 No 182 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=76.44 E-value=0.14 Score=25.34 Aligned_cols=483 Identities=12% Similarity=0.102 Sum_probs=191.2 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCC--CCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhh-ccchhH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALD--DAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQ-QPEIRK 91 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~--~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~-~PEvd~ 91 (560) |--.+..++..+++... . ..-++ .++.+...- .-.++..-..++... +-.+|. .|+. .|=+.. T Consensus 1 ~~~~~~~~~~~~~~~~~--~------~~~~~~~~~~~~~~~~-~~~~~~~~~~~~p~~-~~~~L~----~~~e~~~~~~~ 66 (651) T protein:vir:99 1 MTDTTGETQETKVHVEG--L------GGEADLAKSPNSTQIP-DHRIQSHNVGVNPPY-NPDRLA----AFLELNETLAT 66 (651) T ss_pred CCCccceeeeeEEEeec--c------cccccccccccccccc-hhhhcccCCCCCCCC-CHHHHH----HHHhcChHHHH Confidence 11111111111111000 0 00000 001111100 001111112233333 455565 7777 888999 Q ss_pred HHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHH-------HHHhCCchhhHHHHHHhhh----hhhhhh Q lcl|NC_019530. 92 AVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEV-------MHLMDFDNTAYQKIRKWYV----DGRQAY 160 (560) Q Consensus 92 AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~i-------l~ll~f~~~~~~~~R~wYv----DGr~~~ 160 (560) ||+.+.+...-..-+-.| ..+++..+ .++..++++++..+.- ..-++|+....+++++-.- -|--|+ T Consensus 67 ~i~~~~~~iag~g~~~~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~i 144 (651) T protein:vir:99 67 GIRKKSRYEVGFGFDLVP-AQGVDGDD-ASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLAL 144 (651) T ss_pred HHHHHhhhhhccCceeee-cccCCCCc-cchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhh Confidence 996655543211111111 12223332 3555566665543332 2334667777777664433 366677 Q ss_pred hheecCCCCccCeeEeEEeCccccchhhhhcc------------cCCc-cc------cccccccchh-eeecCccccc-- Q lcl|NC_019530. 161 HVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEK------------AMRE-GG------IEAIESVTLK-YYYNPNYNRN-- 218 (560) Q Consensus 161 hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~------------~~~~-g~------~~~~~~~~e~-~~Y~p~~~~~-- 218 (560) +.|.+.... +..|.+++|..++..+.-.. .... .. +.+.....-| ..|.-..... T Consensus 145 eiIrn~~g~---pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~ 221 (651) T protein:vir:99 145 EMLTDIEGR---PVGLAYVPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEV 221 (651) T ss_pred hhhhcCccc---hhhhhhcChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceee Confidence 777654332 56777888876543221000 0000 00 0000000011 1111000000 Q ss_pred --cccc-------------------ccccc--ccccccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHH Q lcl|NC_019530. 219 --QFTG-------------------QSGTS--QNFQPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVT 275 (560) Q Consensus 219 --~~~g-------------------~~~~~--~~~~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m 275 (560) ...+ ..... ...........++.+.|.|..-. -..++-+.+|-|..|+..+..... T Consensus 222 ~~~~~~~~v~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~eViHir~~-~~~~g~~G~spl~~a~~~i~~a~~ 300 (651) T protein:vir:99 222 VIDESGDEPTIRYREDEESEREPIFVDRETGDVTTGDANGLENRPANELIFIPNP-SILEDDYGVPDWVSAIRTISADEA 300 (651) T ss_pred eeccCCcceeEEeccCcceeeeeecccceeeeEEEcCCCceeEecccceEEecCC-CCCCCcccccHHHHHHHHHHHHHH Confidence 0000 00000 00111223345677777776521 112444667889999999999888 Q ss_pred HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhh---h-cccccCCC Q lcl|NC_019530. 276 TEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAED---Y-WLPRREGQ 351 (560) Q Consensus 276 ~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlED---y-wLpRReGg 351 (560) +++...-|=---+--+-|+.+.-+.|-+..++. +++.+++... ..|+ .| .|+- . +++. + T Consensus 301 a~~~~~~~f~NG~~p~gil~~~~~~ls~e~~~~-lr~~~~~~~~------nagk------~~-vL~~~~~~~~~~~---~ 363 (651) T protein:vir:99 301 AKDYNRDFFDNDTIPRMVIKVTGGELSEESKRD-LRQMLNGLRE------ESHR------AV-VLEVEKFQSQLDE---D 363 (651) T ss_pred HHHHHHHHHhccCCCceEEEecCCCCCHHHHHH-HHHHHHHHhc------cCCc------eE-Eeecccccccccc---c Confidence 888766444334566777777655555554443 3444432111 1121 11 1110 0 0000 1 Q ss_pred CccceeecCCCCCcc-h---HHHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHH-HHHHHHHHHHH Q lcl|NC_019530. 352 NATEIATVGGGNQLG-E---MDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKF-CAGLRRRYSHF 426 (560) Q Consensus 352 rgTEIsTLpGg~nLg-e---i~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kf-i~rLr~rfs~i 426 (560) .|.++..|- ...- + ++=-+|....+-++.+||.+.|+..++. .++++.+..+. |.++ +.-+.+++ T Consensus 364 ~g~~~~pls--~~~~~D~qfle~r~~~~~eIa~afgVPp~~lG~~~~~--~~sn~E~~~~~---f~~~tL~P~~~~i--- 433 (651) T protein:vir:99 364 VEIELEPMG--QGISEEMDFRQFREKNEHEIAKVLEVPPVKIGVTDSA--NRSNSDQQDKD---FALEVIQPEQHTF--- 433 (651) T ss_pred CCceEEEcC--cCchhhHHHHHHHHHHHHHHHHHhCCCHHHhccCCCC--CcccHHHHHHH---HHHHHHHHHHHHH--- Confidence 133333331 1111 1 2222566777999999999999765442 34444433332 5433 33344333 Q ss_pred HHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHH Q lcl|NC_019530. 427 FMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDE 506 (560) Q Consensus 427 f~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDe 506 (560) .+.|-+.| +++.+... ...|.|+|..+. +...+ ...|.+.++.+- -+-++|.+-+++ .++|..- T Consensus 434 -e~eln~kL-----l~~~e~~~--~~~i~~ef~~~~----llr~D-~~~~~e~~~~~i--~~G~~T~NE~R~-~lglppi 497 (651) T protein:vir:99 434 -AEWLYQII-----HQQALGVT--DWTIEYELRGAD----QPKQE-AQLAEQRVRAMR--LAGVGLVDEARE-ELGLDPL 497 (651) T ss_pred -HHHHHHhh-----cCcccccc--CceEEEEeccch----hhhcc-HHHHHHHHHHHH--hCCCcCHHHHHH-HhCCCCC Confidence 33333333 33333211 123555555432 11111 123333333322 124788888874 4777431 Q ss_pred ---HHHHH----HH-----HHHHhhhcCCCCCCCccccccccccccCC-CCCCc--ccccccCCcccCC Q lcl|NC_019530. 507 ---EVKEQ----QT-----KIAEEKKKGLYPKVQADETGNYSGSDVSP-LKFKP--ETIPFSGSTDDSI 560 (560) Q Consensus 507 ---EI~e~----~k-----qIe~E~~~~~~~~p~~~e~g~~~~~~~~~-~~~~p--~~~~~~~~~~~~~ 560 (560) ..... +. ..+.-.+++.-.+|.+.+.|+.-...+.. +..++ +++|.+-|.=.|| T Consensus 498 ~~~~gd~~l~~~~~~~~g~~~~gge~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~~~~v~ss~~~~~ 566 (651) T protein:vir:99 498 GEPYGEMTLSEFEAEVAGDVAGGGETEAVHEPPEENKIGEREWDTVKSELTTKDPIEQMQFSSSNLDEG 566 (651) T ss_pred CCccccccccccccccccccccCCCCcccccCccccccccchhhhhhhhhcccchhhhhhHHHHHHHhh Confidence 11100 00 00000011111112222222221112211 11111 3444444433333 No 183 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=73.66 E-value=0.17 Score=24.84 Aligned_cols=405 Identities=13% Similarity=0.133 Sum_probs=172.6 Q ss_pred CCcc-cchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHH Q lcl|NC_019530. 1 MAGY-GRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEE 79 (560) Q Consensus 1 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~ 79 (560) |--- -||+++.+ + .+....+. ......+ .-|.... .+...+.. . +. T Consensus 8 ~~~~~~~g~~~~~---~---~~~~~~~~-------~~~~~~~------~~~~~~~-~~~~~~~~------v-~~------ 54 (424) T protein:vir:18 8 IDLRTNNGWWARL---Q---SWFVGGRL-------VTPNQGS------QTGPVSA-HGHLGDSS------I-ND------ 54 (424) T ss_pred EeecCCCchHHHH---H---hhhccccc-------ccccccc------ccccccc-cccccccc------c-cH------ Confidence 2111 24555442 1 11111111 0000011 1111111 00000000 0 11 Q ss_pred HHHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHH--HHHHHHHHHH-hCCchhhHHH----HHHh Q lcl|NC_019530. 80 YQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESI--TECFKEVMHL-MDFDNTAYQK----IRKW 152 (560) Q Consensus 80 YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I--~eeF~~il~l-l~f~~~~~~~----~R~w 152 (560) +..+++|-|..||.-|-+.+.- -|+. +-..++.. .++++ ......+|+. =+-...++++ +..+ T Consensus 55 -~~al~~~~v~~cv~~Ia~~iA~-----lp~~--~~~~~~~~--~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l 124 (424) T protein:vir:18 55 -ERILQISTVWRCVSLISTLTAC-----LPLD--VFETDQND--NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQL 124 (424) T ss_pred -HHhhccHHHHHHHHHHHHhhcc-----CceE--EEEeecCC--ceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHH Confidence 2446778899999766655432 1222 21111000 11111 1112222221 2233444444 3445 Q ss_pred hhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccc Q lcl|NC_019530. 153 YVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQP 232 (560) Q Consensus 153 YvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~ 232 (560) +..|.-|.-++.|. ..-+++|.+|+|-.|...+ . ++ ..+|.|.. T Consensus 125 ll~Gnay~~i~r~~---~G~~~~L~pl~~~~V~v~~----~---~~-------~~~y~~~~------------------- 168 (424) T protein:vir:18 125 CFYGNAYALVDRNS---AGDVISLLPLQSANMDVKL----V---GK-------KVVYRYQR------------------- 168 (424) T ss_pred hhcCCeEEEEEECC---CCcEEEEEEecCcceEEEE----c---CC-------eEEEEEEe------------------- Confidence 56687776554433 2228899999998876421 1 11 11233321 Q ss_pred cccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHH Q lcl|NC_019530. 233 SQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTM 312 (560) Q Consensus 233 ~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~ 312 (560) .+....++.+.|.|... ...++-..+|-++.|.+++.....+++...=+----+--+-|...+-+.+.+..+++. ++ T Consensus 169 ~g~~~~~~~~eIih~r~--~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~-~~ 245 (424) T protein:vir:18 169 DSEYADFSQKEIFHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV-EE 245 (424) T ss_pred CCeEEEeccccEEEecC--cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCCHHHHHHH-HH Confidence 12345688888887762 3445556678899999998887777776543322223234566666565665554432 32 Q ss_pred HHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCC Q lcl|NC_019530. 313 MMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGG-NQLGEMDHVNYFREKLYDALMIPKSRLQE 391 (560) Q Consensus 313 im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~ 391 (560) .++++. +|+ +.-+.+ .++ .|++++.|.=. ..+.-++-.+|..+.+.++++||...|+. T Consensus 246 ~~~~~~--------~g~--nag~~~-vl~----------~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~ 304 (424) T protein:vir:18 246 NFKEIA--------GGP--VKKRLW-ILE----------AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGD 304 (424) T ss_pred HHHHHh--------CCc--ccCCce-ecc----------CCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCC Confidence 222221 121 111112 222 25666666321 22222444567788999999999999975 Q ss_pred CCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHH Q lcl|NC_019530. 392 EGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENA 471 (560) Q Consensus 392 ~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~E 471 (560) ..+-+..+++..+..+ .|.+++ |+- +...+.+.|-+ +++++.++. . ..+.|..|. +.. . T Consensus 305 ~~~~t~~~sn~eq~~~---~f~~~t--l~P-~~~~ie~~l~~-----~L~~~~~~~---~--~~~~fd~~~----llr-~ 363 (424) T protein:vir:18 305 VEKSTSWGSGIEQQNL---GFLQYT--LQP-YISRWENSIQR-----WLIPAKDVG---R--IHAEHNLDG----LLR-G 363 (424) T ss_pred CCCcccccccHHHHHH---HHHHHH--HHH-HHHHHHHHHHh-----hcCCccccC---C--eEEEEechh----hhc-c Confidence 5443334444433333 365542 322 11223333333 445655542 1 334443332 211 1 Q ss_pred HHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcc---ccccccccccCCCCCCcc Q lcl|NC_019530. 472 ILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQAD---ETGNYSGSDVSPLKFKPE 548 (560) Q Consensus 472 i~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~---e~g~~~~~~~~~~~~~p~ 548 (560) -..+|++.+..+-. .-++|.+-++. .+.|.. -|..| .+++..+-+..+..-+ T Consensus 364 d~~~r~~~~~~~~~--~G~~T~NE~R~-~~gl~p--------------------i~gGD~~~~~~n~~~l~~~~~~~~-- 418 (424) T protein:vir:18 364 DSASRAAFMKAMGE--AGLRTINEMRR-TDNLPP--------------------LPGGDVAMRQSQYVPITDLGTNKE-- 418 (424) T ss_pred CHHHHHHHHHHHHh--CCCcCHHHHHH-HhCCCC--------------------CCCcCeeeeccCccchHhhhccCC-- Confidence 23345555554421 12556555553 234321 12211 1111111111000001 Q ss_pred cccccCCc Q lcl|NC_019530. 549 TIPFSGST 556 (560) Q Consensus 549 ~~~~~~~~ 556 (560) |.-.|. T Consensus 419 --p~~~ga 424 (424) T protein:vir:18 419 --PRNNGA 424 (424) T ss_pred --CccCCC Confidence 111111 No 184 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=71.95 E-value=0.19 Score=24.55 Aligned_cols=415 Identities=13% Similarity=0.136 Sum_probs=166.3 Q ss_pred CCcccc-----hhhhhhhhhhhhhhhhh---hcccccchhhhccccccccCCCCCCCCceeEe--ccccceeccccccch Q lcl|NC_019530. 1 MAGYGR-----GFFGLFGGGGLVNAKVD---TDKLAQKQDELLLTKATVVALDDAQDGSIILQ--GGANTYNYVGVESEL 70 (560) Q Consensus 1 ~~~~~~-----~~~~~~~~~~~~~~~~~---~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~--~~~~~~~y~~~e~~~ 70 (560) |--|.. -|+|+ +...+ ...+..+.+. -+...+. .+....+. .+....+...+. T Consensus 1 ~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~f~~~e~------r~~~~~~-~~~~~~~~~~~~~~~~~~~~~~--- 63 (441) T protein:vir:98 1 MHWYNTDCYFVDFKSR-------KQSRKELVVVGIFYKNEK------RDLQYNE-DDLQMMVQTLPGFQGTKLRQYK--- 63 (441) T ss_pred CceecCccceeccccc-------cchhhhhhcccccccccc------ccccCCC-cchHHHHHHhhcccccCccccc--- Confidence 222211 01100 00000 0000000010 0111111 11000000 000000000000 Q ss_pred hhHHHHHHHHHHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCc----hh-- Q lcl|NC_019530. 71 LSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFD----NT-- 144 (560) Q Consensus 71 ~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~----~~-- 144 (560) . ..-+++|-|..||.-|.+.+.-. |+.+-=+.... .-+.++.+|.-. -. T Consensus 64 -~--------~~al~~~~V~acv~~Ia~~iA~l-----pl~~~~~~~~~-----------~~~~~~~lL~~~PN~~~t~~ 118 (441) T protein:vir:98 64 -D--------IEAIRHSDIFTAVMMIASDLARM-----PIRVTVNGQIN-----------YSDRIVNLLNTRPNPMYNGY 118 (441) T ss_pred -h--------hhhhccHHHHHHHHHHHHhhccC-----ceEEecCCccc-----------ccchHHHHHhcccccCCCHH Confidence 0 12356788999997777665532 23321111100 012233443321 12 Q ss_pred --hHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccc Q lcl|NC_019530. 145 --AYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTG 222 (560) Q Consensus 145 --~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g 222 (560) -+.++..+...|.-|..++.|.. .-+++|.+|+|..|...+ +.+|. .+|.|.. .++ T Consensus 119 ~f~~~l~~~lll~Gnay~~i~r~~~---G~~~~L~~i~~~~v~v~~-----~~~g~--------~~~~~~~------~~~ 176 (441) T protein:vir:98 119 IFKLVVFVSALLTSHGYIEITRDKT---GEPMNLTFRKTSEIELKL-----DARGR--------LYYFHQR------IDS 176 (441) T ss_pred HHHHHHHHHHhhcCCeEEEEEEcCC---CcEEEEEEEcCceeEEEE-----CCCCc--------EEEEEEE------ecc Confidence 34455666778998887776542 237899999998887642 22231 1222210 000 Q ss_pred cccccccccccccceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCc Q lcl|NC_019530. 223 QSGTSQNFQPSQQELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLG 302 (560) Q Consensus 223 ~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlp 302 (560) . ..+....++.+.|.|... .+.++=+.+|-|+.|.+++.....+++...=+=---+--+-|..++ |.+. T Consensus 177 ~--------~~~~~~~~~~~dviHir~--~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~-~~~~ 245 (441) T protein:vir:98 177 N--------GNNIERNVKFEDMLDIKF--YSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLD 245 (441) T ss_pred C--------cceeeEEEccccEEEecc--CCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC-CCCC Confidence 0 112234577777777653 2344445568888888888877777776542212223345555665 4443 Q ss_pred hHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHh Q lcl|NC_019530. 303 KKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGG-NQLGEMDHVNYFREKLYDA 381 (560) Q Consensus 303 k~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~a 381 (560) ..+|.+=+++ .++... +| ..+.-+.+ .++ .|.+++.|.-. ..+.-++--+|..+.+.++ T Consensus 246 ~~e~~~~~~~---~~~~~~-----~G-~~nag~~~-vl~----------~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~ 305 (441) T protein:vir:98 246 NKKARDRARE---EFHKSF-----SG-TKQAGKVV-VLD----------ESMTFDQLEVDTEVLKLIRENKSSTREIAGV 305 (441) T ss_pred CHHHHHHHHH---HHHHHh-----cC-ccccCcce-ecC----------CCceEEEccCChhHHHHHHHHHHhHHHHHHH Confidence 3444332322 222211 12 22222232 332 24566666321 1222234456677889999 Q ss_pred cCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeec Q lcl|NC_019530. 382 LMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSD 461 (560) Q Consensus 382 L~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~D 461 (560) ++||...|+.+.+ .++.++.. +-| +..|+--+. .+.+.|-..| .++. . .. .+.|.. T Consensus 306 fgVPp~~lg~~~~----~~s~~q~~---~~y---~~tl~P~~~-~ie~~ln~~L-----~~~~--~---~~--~~~fd~- 361 (441) T protein:vir:98 306 FGIPLHKFGIETA----NMSITDAN---LDY---LSTLKPYIT-CVCAELNFKF-----NDEY--V---NR--EFKFDT- 361 (441) T ss_pred hCCCHHHcCCCCC----CccHHHHH---HHH---HHHHHHHHH-HHHHHHHhhc-----cccc--c---Cc--eEEEec- Confidence 9999999964332 11122322 223 345553222 2333333333 2221 1 12 233322 Q ss_pred chHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcc---cccccccc Q lcl|NC_019530. 462 SYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQAD---ETGNYSGS 538 (560) Q Consensus 462 n~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~---e~g~~~~~ 538 (560) .+|...+ +..|++.++.+-.- -++|.+-+++ ++.+.+- +.++.+ .+++..+- T Consensus 362 ---~~llr~d-~~~~~~~~~~~~~~--G~~T~NE~R~-~~gl~pi------------------~gGd~~~~~~~~n~~~~ 416 (441) T protein:vir:98 362 ---TEIRVVD-EKTQAEIDKINIDS--GKMNIDEIRQ-RDGLAPI------------------PGGNGSIHRVDLNHVNI 416 (441) T ss_pred ---hhhhccC-HHHHHHHHHHHHhC--CCcCHHHHHH-HhCCCCC------------------CCCCcceEeeccccccc Confidence 2222211 23455555443222 2566666663 3444221 111111 11222111 Q ss_pred ccCCC--CCCccc--ccccCCcccC Q lcl|NC_019530. 539 DVSPL--KFKPET--IPFSGSTDDS 559 (560) Q Consensus 539 ~~~~~--~~~p~~--~~~~~~~~~~ 559 (560) ++.+. .-+++. -...||.++- T Consensus 417 ~~~~~~q~~~~~~~~~~~kgGe~ne 441 (441) T protein:vir:98 417 ELVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred ccccccccccccccccccCCCCCCC Confidence 11110 000111 1122333333 No 185 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=64.40 E-value=0.3 Score=23.45 Aligned_cols=440 Identities=12% Similarity=0.050 Sum_probs=181.1 Q ss_pred ccccCCCCCCCC-ceeEeccccc---eecccccc--chhhHHHHHHHHHHHh-hccchhHHHHHhhcceeeecCCCceEE Q lcl|NC_019530. 39 ATVVALDDAQDG-SIILQGGANT---YNYVGVES--ELLSVKTVVEEYQSMA-QQPEIRKAVDIIVNDVVTCEEDETPVT 111 (560) Q Consensus 39 ~~s~~~~~~~dG-~~~i~~~~~~---~~y~~~e~--~~~~~~~LI~~YR~ma-~~PEvd~AI~eIvneaiv~d~~~~~V~ 111 (560) -++.+.|+..-. .-++.+.+-. ..|...+. .++ ..+.++.|+.|. .++.|.++++.+..-+.-.+ -. T Consensus 1 ~~~~~~~~~p~~~~g~~~~~~~~~~~~~~~~~e~~~~lr-~~~~~~ly~~m~e~D~~i~s~l~~rk~av~~~~-----w~ 74 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFGSGVVDGWTVWDPFEQTPELQ-WPQSVAVYSRMDNEDSRVTSLLEAISLPIRSTP-----WR 74 (469) T ss_pred CCCcccCCCCccchhhhhhcccccchhhccccccccccc-cccchHHHHHHHhhChHHHHHHHHHHHHHhcCC-----ce Confidence 222222222211 1111111111 22222222 122 235688999996 69999999988876543222 11 Q ss_pred EeeecccchhHHHHHHHHHHHHHHHHHh-------------CCchhhHHHHHHhhhhhhhhhhheecCCCC-ccCeeEeE Q lcl|NC_019530. 112 VNLDKVEGISDTVKESITECFKEVMHLM-------------DFDNTAYQKIRKWYVDGRQAYHVIVDPTNK-KGGIKKLV 177 (560) Q Consensus 112 l~Ld~~~~~s~sik~~I~eeF~~il~ll-------------~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~-k~GI~eLr 177 (560) | .--+ -++.+.+.+.+.....+... .|.....+++-..+-.|--++++|...... ..|-..+. T Consensus 75 v--~p~~-~~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~dG~~~~~ 151 (469) T protein:vir:10 75 I--RANG-ASDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPDGRFWLR 151 (469) T ss_pred E--ecCC-CCHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCCCceeee Confidence 1 1111 24556777777665544321 234445555555566787888888765431 23433333 Q ss_pred EeCccccchhhhhcccCCccccccccccchheeecCccccccccccccc----cccccccccceecchh-ceeeeecccc Q lcl|NC_019530. 178 MLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGT----SQNFQPSQQELVFDDE-SIVYIDSGEE 252 (560) Q Consensus 178 ~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~----~~~~~~~~~~~~i~~d-aI~y~hsGl~ 252 (560) .|-||.=+.+. +|.|++....-...+.... ............+|+. -|+|.|.. T Consensus 152 ~l~~rp~~~i~-------------------~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lp~~k~i~~~~~~-- 210 (469) T protein:vir:10 152 KLAPRPQWTIS-------------------KFNVAPDGGLESIEQIAPPARTRGSLYVANIAPPEIPVNRLVVYTRNK-- 210 (469) T ss_pred eeeecCcccce-------------------eeeeccCCceeeeeecCcccccccccccCCCCccccccCcEEEEEecC-- Confidence 33332211110 1222221111111110000 0000011223344444 47888853 Q ss_pred cCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHh-cCccceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCccc Q lcl|NC_019530. 253 PLANGIVPGLLNPAIRPLNNLVTTEDATVIYAIT-RAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKIT 331 (560) Q Consensus 253 ~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRit-RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~ 331 (560) ...+-.-.|-|.+|..+|-=.+......+.+=-. =.|-| |...+.|.-...| +-|.+++..+++- ...|=| T Consensus 211 ~~g~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~-vgky~~~a~~~ek--~~l~~a~~~~~~g----~~a~~i- 282 (469) T protein:vir:10 211 RPGQWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIP-VGTASSATDEDEV--RKMAALARSVRGG----INAGVG- 282 (469) T ss_pred CCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcce-EEecCCCCCHHHH--HHHHHHHHHHhcC----CceEEE- Confidence 1222334578999988877666666555554332 24554 5566666544433 3333444444331 111111 Q ss_pred CcccchhhHhhhcccccCCCCccceeecCCCCCc-chHHHHHHHHHHHHHhcCCCccccCCCCcccccccc-hhhhhHHH Q lcl|NC_019530. 332 GNAHLMGIAEDYWLPRREGQNATEIATVGGGNQL-GEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSN-LAEITQEE 409 (560) Q Consensus 332 d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nL-gei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~-~~eItRDE 409 (560) +| .|++|..+..+.+. .-..=++|..+++-+++--.. |..+++ |+|. ++++-. | T Consensus 283 -------------ip-----~~~~ie~~ea~g~~~~~~~li~~~d~~Isk~iLG~t--lTs~~~---gGS~a~~~vh~-e 338 (469) T protein:vir:10 283 -------------LA-----QGQILELLGVSGNLPDIRRAIEGHDRSIALSGLAHF--LNLDGK---GGSYALASVLE-D 338 (469) T ss_pred -------------cc-----CCceEEEeecCCCchHHHHHHHHHHHHHHHHHhccc--ccccCc---cchhhHHHHHH-H Confidence 11 46788877653222 222336788888777664332 223321 1222 233322 2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcc Q lcl|NC_019530. 410 LRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGS 489 (560) Q Consensus 410 lkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGk 489 (560) -|...++...+.++..+..-|-..|+.=|. -++ ...-+|.|..- | .+.+...+++..|..+==.+-. T Consensus 339 -v~~d~~~sDa~~i~~tln~~li~~l~~lN~-g~~------~~~P~~~~~~~----e-~~~~~~a~~i~~l~~~G~~~~~ 405 (469) T protein:vir:10 339 -PFTQAVHAYATSICRIANQHIIEDLVDINF-GVD------TPAPVLTFDPI----G-SRQDLTAAAVKLLYDAGVFDDD 405 (469) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CCC------CCccEEEecCC----C-CcHHHHHHHHHHHHhcCCccCc Confidence 255566666666677776544444443332 111 11234444211 1 1233344444444432111001 Q ss_pred hhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccccCCcccC Q lcl|NC_019530. 490 IFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDDS 559 (560) Q Consensus 490 y~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~ 559 (560) -++.+|+++. +++...+-.+. ..+.+.++. .|.+...+.........-.-.|..-+-++--+|. T Consensus 406 ~~~~~~~~e~-~gip~~~~~~~--~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~da 469 (469) T protein:vir:10 406 PAVKRAIRQR-FNLPSELNDTP--SAEPEEPAA---VPNQSAAPARTRSSGNADARARAPKADQGVLFDA 469 (469) T ss_pred cccHHHHHHH-hCCCCCCCCcc--cccchhccc---CCCCCccccccCCCCCcccccccCCChHHhhccC Confidence 3677888855 67642222111 111111110 0100000000000000000011111111111222 No 186 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=61.39 E-value=0.35 Score=23.05 Aligned_cols=246 Identities=14% Similarity=0.085 Sum_probs=111.0 Q ss_pred hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_019530. 15 GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD 94 (560) Q Consensus 15 ~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~ 94 (560) |+.|+.+ ++ +.... ++...........+........+ + -..-.++|-|..||+ T Consensus 1 MglF~~~-----------~~---r~~~~-~~~~~~~~~~~~~~~~~~~~~~v-----~-------~~~al~~~~v~~~i~ 53 (251) T protein:vir:46 1 MGIFYKN-----------EK---RDLQY-NEDDLQMMVQTLPSFQGTKLRQY-----K-------DIEAIRHSDIFTAVM 53 (251) T ss_pred CCccccc-----------cc---cccCC-CccchhhhhhhhccccCcCccee-----c-------hhhhhccHHHHHHHH Confidence 2222110 00 00000 01000000000001100010101 0 112346788999997 Q ss_pred HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHH----HHHhhhhhhhhhhheecCCCCc Q lcl|NC_019530. 95 IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQK----IRKWYVDGRQAYHVIVDPTNKK 170 (560) Q Consensus 95 eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~----~R~wYvDGr~~~hkiiD~~~~k 170 (560) -|.+.+.-. |+.+.=+........+...+. .--+-...++++ +..++..|.-|.-++.|.. T Consensus 54 ~ia~~iA~l-----p~~~~~~~~~~~~~~~~~ll~-------~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~--- 118 (251) T protein:vir:46 54 MIASDLARM-----PIRVTVNGQINYSDRIVNLLN-------TRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT--- 118 (251) T ss_pred HHHHhHhhC-----ceEEeeCccccccchHHHHHh-------ccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC--- Confidence 666654322 222211111110111111111 112233334444 4455667888887776553 Q ss_pred cCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeecc Q lcl|NC_019530. 171 GGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSG 250 (560) Q Consensus 171 ~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsG 250 (560) .-+.+|.+|+|.+|..++ +.+|. .++.|... . ....+....++.+.|.|... T Consensus 119 G~~~~L~~i~~~~v~v~~-----~~~g~--------~~~~~~~~------~--------~~~~g~~~~~~~~diiH~r~- 170 (251) T protein:vir:46 119 GEPMNLTFRKTSEIELKS-----DARGR--------LYYFHQRI------D--------SNGNNIERNVKFEDMLDIKF- 170 (251) T ss_pred CcEEEEEEECCceEEEEE-----CCCCc--------EEEEEEEe------c--------cCCcceeEEECCccEEEecC- Confidence 348999999998887642 11231 12222110 0 01123456788888888873 Q ss_pred cccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHH-HHHHHHHHHhhhhheeecccCCc Q lcl|NC_019530. 251 EEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSA-EEYMTMMMGKFKNRNAYDRTTGK 329 (560) Q Consensus 251 l~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KA-eqYl~~im~k~knk~vYd~~TGe 329 (560) .+.++-+..|-|+.|..++......++...-+=---+--+-+..++ |.|...+| ++..+.....|.-- ...|+ T Consensus 171 -~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~~e~~~~~~~~~~~~~~g~----~n~g~ 244 (251) T protein:vir:46 171 -YSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFPKVLVEL----NKLGK 244 (251) T ss_pred -cCCCCeeecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeC-CCCCCHHHHHHHHHHHHHHhcCc----ccccc Confidence 3556556778899999999888888876554333334446677777 56644444 44444444444321 12233 Q ss_pred ccCcccchhhHh Q lcl|NC_019530. 330 ITGNAHLMGIAE 341 (560) Q Consensus 330 v~d~~~~msmlE 341 (560) |-. .|=| T Consensus 245 ~~~-----gm~~ 251 (251) T protein:vir:46 245 LSY-----SMNQ 251 (251) T ss_pred ccc-----ccCC Confidence 211 1112 No 187 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=54.16 E-value=0.51 Score=22.19 Aligned_cols=408 Identities=12% Similarity=0.034 Sum_probs=176.8 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHH Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEY 80 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~Y 80 (560) .--+|.|--.+| +.+... +. .++.+.|...+ .....+....+.++. - T Consensus 8 ~~~~~~~~~~~~------~~lf~~-----~~-------~~~~~~~~~~~-~~~~~~~~~~~~~vs--------------~ 54 (424) T protein:vir:45 8 HWLWPEGGRVLL------DALFRS-----KS-------LENPSTPITGD-AVDTDGLFRADVYVS--------------P 54 (424) T ss_pred ceecCcchhHHH------Hhhccc-----cC-------CCCCccccchh-hhhhhccccCCceec--------------h Confidence 111222211111 111110 00 00111111111 000000000111111 1 Q ss_pred HHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHH-hCCchhhHH----HHHHhhhh Q lcl|NC_019530. 81 QSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHL-MDFDNTAYQ----KIRKWYVD 155 (560) Q Consensus 81 R~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~l-l~f~~~~~~----~~R~wYvD 155 (560) +..+++|-|..||.-|.+.+.-. |+.|-=+... ..++......-++|+. =+-...+++ ++..++.. T Consensus 55 ~~al~~~~v~~cv~~Ia~~iA~l-----p~~v~~~~~~----~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~ 125 (424) T protein:vir:45 55 ETAMKLAAVYSCIYVLSSSLAQM-----PLHVMRRHKG----KVEPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGW 125 (424) T ss_pred HHhhccHHHHHHHHHHHHHHhhC-----ceEEEEecCC----ceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhc Confidence 23456888999997777765432 3333211111 1111111122222221 111223333 44566777 Q ss_pred hhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCcccccccccccccccccccccc Q lcl|NC_019530. 156 GRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQ 235 (560) Q Consensus 156 Gr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~ 235 (560) |.-|..++.|. ..-+++|.+|+|.++...+ .++ +..|.|.. ... T Consensus 126 Gna~~~i~r~~---~G~~~~L~~l~~~~v~i~~-------~~~-------~~~y~~~~-------------------~~~ 169 (424) T protein:vir:45 126 GNGYTWVKRNR---RGEVISLDCCMPWETTLMN-------TGG-------RYTYGLYN-------------------EYG 169 (424) T ss_pred CCeEEEEEEcC---CCcEEEEEEecCceEEEEE-------cCC-------eEEEEEEe-------------------cCc Confidence 88887666544 3447899999998876532 111 11122211 011 Q ss_pred ceecchhceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHH Q lcl|NC_019530. 236 ELVFDDESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMG 315 (560) Q Consensus 236 ~~~i~~daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~ 315 (560) ..+++.+.|.|... .+.++-+..|.++.|...+.....+++...=+----|--+-|+.++- .|.+.++++.-+.+-. T Consensus 170 ~~~~~~~eVih~r~--~~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~ 246 (424) T protein:vir:45 170 AFAISPDDMIHIRA--LGNNQKMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKS-GLNKESWGWLKDQWQK 246 (424) T ss_pred eEEECcccEEEecC--cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC-CCCHHHHHHHHHHHHH Confidence 33577777777752 35566677899999999999988888876644444455567777774 4665555443333322 Q ss_pred hhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCc Q lcl|NC_019530. 316 KFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGG-NQLGEMDHVNYFREKLYDALMIPKSRLQEEGS 394 (560) Q Consensus 316 k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~ 394 (560) .|+- .. ...|. .+ .++ .|.++..|.-- ..+.-++-.++-.+.+.++++||...|+..++ T Consensus 247 ~~~g--~~-~n~g~------~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~ 306 (424) T protein:vir:45 247 ASQA--LR-RQENK------TM-LLP----------ADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEK 306 (424) T ss_pred Hhcc--cc-ccCCc------ee-EcC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC Confidence 2221 00 01122 11 121 24555555321 12222455568888999999999999975443 Q ss_pred ccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHH Q lcl|NC_019530. 395 INIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILN 474 (560) Q Consensus 395 ~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~ 474 (560) .+ .+++.+..+. |.++ .|+--+ ..+.+.|-. +++++.++.. -..+.|..+. +... -+. T Consensus 307 ~t--~sn~eq~~~~---f~~~--tL~P~~-~~ie~~ln~-----kLl~~~e~~~----g~~i~fd~~~----llr~-d~~ 364 (424) T protein:vir:45 307 AT--FSNISAQAIQ---FVRY--TMMPWV-TNWEQELNR-----RLFTRAELAA----GYYVRFNLTG----LLRG-TPQ 364 (424) T ss_pred CC--cccHHHHHHH---HHHH--HHHHHH-HHHHHHHHH-----hcCChhhhcC----CcEEEeechh----hhcc-CHH Confidence 22 2344444443 5444 232211 112222322 3456555431 2344454332 2111 133 Q ss_pred HHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccC--CCCCCcccc-c Q lcl|NC_019530. 475 DRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVS--PLKFKPETI-P 551 (560) Q Consensus 475 ~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~--~~~~~p~~~-~ 551 (560) .|.+.++.+-.- -++|.+-+++ ++.+. |-|. ||......+ +..=.++.- + T Consensus 365 ~r~~~~~~~~~~--g~~T~NE~R~-~~gl~--------------------pi~g----gD~~~~~~n~~~~~~~~~~~~~ 417 (424) T protein:vir:45 365 ERAQFYHFAITD--GWMSRNEARA-FEDMN--------------------PVEG----LDEMLVSVNAANPAGDFKPPKN 417 (424) T ss_pred HHHHHHHHHHhC--CCcCHHHHHH-HhCCC--------------------CCCC----cceeeecccccccccccCCCCC Confidence 555555543321 3556555553 23332 1121 222110000 000011111 1 Q ss_pred ccCCccc Q lcl|NC_019530. 552 FSGSTDD 558 (560) Q Consensus 552 ~~~~~~~ 558 (560) --|.+++ T Consensus 418 ~~~~~~~ 424 (424) T protein:vir:45 418 DEGKTNE 424 (424) T ss_pred CCCCCCC Confidence 2233444 No 188 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=44.04 E-value=0.82 Score=21.05 Aligned_cols=455 Identities=13% Similarity=0.084 Sum_probs=173.3 Q ss_pred cchhhhccccccccCCCCCCCCce-eEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHHHhhcceeeecCCC Q lcl|NC_019530. 29 QKQDELLLTKATVVALDDAQDGSI-ILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVVTCEEDE 107 (560) Q Consensus 29 ~~~~e~~~~k~~s~~~~~~~dG~~-~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneaiv~d~~~ 107 (560) ..+-|... |-.+|...-.-. ..+.+.+...|......++ ..++++.|+.|..++.|.++++.+..-+.-++= T Consensus 1 ~~~~~~~~----~gl~p~rl~~i~~~~~~~~~~~~~~~~~~~Lr-~~~~~~ly~~m~~D~hi~s~l~~Rk~av~~~~w-- 73 (488) T protein:vir:95 1 MADITETQ----ESLPPFRMGEVGSLGLKVKNGRIYEEPRQALR-FPESIKTFQLMMRDPAVAASVNIIKMFVRKVNW-- 73 (488) T ss_pred CCCccccC----CCCCHHHHHHHHHHhhccccchhhccchhhhc-ccchHHHHHHHhhChHHHHHHHHHHHHHhcCCc-- Confidence 11111111 111221110000 0111112222222333333 345788999999999999999888776553321 Q ss_pred ceEEEeeecccchhHHHHH-HHHHHHHHHHHHhC--CchhhHHHHHHhhhhhhhhhhheecCCCCccCeeEeEEeCcccc Q lcl|NC_019530. 108 TPVTVNLDKVEGISDTVKE-SITECFKEVMHLMD--FDNTAYQKIRKWYVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCI 184 (560) Q Consensus 108 ~~V~l~Ld~~~~~s~sik~-~I~eeF~~il~ll~--f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i 184 (560) .+.--++-++..|. .+.+.++++++-+. |..-..+++ ..+-.|--+++++.+......|+..+.+-|.+-. T Consensus 74 -----~v~p~~~~~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~~~ 147 (488) T protein:vir:95 74 -----RFVPPKGKEQDPKMLERADFFNSLMDDMEHDWADFINSVM-SFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGLIG 147 (488) T ss_pred -----eEecCCCCchhHHHHHHHHHHHHHHhccCccHHHHHHHHH-HhhcccceeeeeeeeccccccccccccccCCeee Confidence 12111111333332 34677777777665 334444443 2344566666677665443333333222232111 Q ss_pred chhhhhcccCCccccccccccchheeecCcccccccccccc-------ccccccccccceecchh-ceeeeecccccCCC Q lcl|NC_019530. 185 RPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSG-------TSQNFQPSQQELVFDDE-SIVYIDSGEEPLAN 256 (560) Q Consensus 185 ~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~-------~~~~~~~~~~~~~i~~d-aI~y~hsGl~~~~~ 256 (560) .. ..+.. .+..-.+|.|++...---...+.. ....+.-....+.||+. =|+|.|.- ...+ T Consensus 148 ~~-~i~~R---------pq~~~~~f~~d~d~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~kfi~~~~~~--~~g~ 215 (488) T protein:vir:95 148 WA-KLPIR---------NQSTLDKWYFDEDFRRVTGVRQNLRNVSHIAGAINLGERPLTRKLPRAKFMLFKYDD--EYGN 215 (488) T ss_pred ee-eeeec---------CcccccceeeccCCCceeecccccccccccccccccccccccccccccceEEEeecC--CCCc Confidence 00 00000 001112344543221100000000 00000001123445444 47777742 2223 Q ss_pred CcchhhhHHHHHHHHHHHHHHHHHHHHHHh-cCccceEEE----ccCC-CCchHHHHHHHHHHHHhhhhheeecccCCcc Q lcl|NC_019530. 257 GIVPGLLNPAIRPLNNLVTTEDATVIYAIT-RAPEKRAFY----LDVG-TLGKKSAEEYMTMMMGKFKNRNAYDRTTGKI 330 (560) Q Consensus 257 ~~i~syL~~AiK~~NqL~m~EDalVIyRit-RAPeRRvFy----IDvG-nlpk~KAeqYl~~im~k~knk~vYd~~TGev 330 (560) -.-.|-|.+|..+|-=.+...-..+.+=-- =.|-.-..| .+.- .--+.+..+.++.|....+.. ...|-| T Consensus 216 p~g~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~~~~~~~----~~ag~i 291 (488) T protein:vir:95 216 PEGRSPLLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVVNDMIAN----DRAGLI 291 (488) T ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHHHHhhcc----chhhee Confidence 334588999988886555554444433110 022222222 1111 111222334444444443332 223322 Q ss_pred cCcccchhhHhhhcccccCCCCccceeec--CCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccch-hhhhH Q lcl|NC_019530. 331 TGNAHLMGIAEDYWLPRREGQNATEIATV--GGGNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNL-AEITQ 407 (560) Q Consensus 331 ~d~~~~msmlEDywLpRReGgrgTEIsTL--pGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~-~eItR 407 (560) -..---....+++ .|+..+ .|++...-..=++|..+..-+++--..--++.+++ ++.+ +++ - T Consensus 292 iP~g~~~~~k~~~----------~e~~l~~~~~~~~~~~~~li~~~d~~Isk~iLGqtLT~~~~~~----Gs~Al~~v-h 356 (488) T protein:vir:95 292 WPRYIDPDTKEDI----------FEFSLVSRQGAKAYDTGSIIDRYSKQIMMAFMSDVLAMGQSKY----GSFSLADS-K 356 (488) T ss_pred eccccccccchhh----------hhhhccccccCCchhHHHHHHHHHHHHHHHHhccccccccCcc----hhhhHHHH-H Confidence 1000000000111 122222 23333334455899999988887433211222211 2222 333 2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_019530. 408 EELRFSKFCAGLRRRYSHFFME-FLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPF 486 (560) Q Consensus 408 DElkF~Kfi~rLr~rfs~if~d-~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~ 486 (560) .|+ |...++...+.++..|.. +++.-+.++.- ..+. -+.+.|+.... .+.+-+.+++..|.. - T Consensus 357 ~ev-~~~i~~aDa~~i~~tln~~li~~l~~~Nfg-~~~~-----~P~~~~~~~e~------~Dl~~~ae~~~~L~~---~ 420 (488) T protein:vir:95 357 TSL-LAMSVDILLKQIKNVINRDLVAQTYALNMW-DDEE-----HVQITYDDIET------PDLEAIGSYIQKTVA---V 420 (488) T ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-CCCC-----ccEEEecCcCh------hhHHHHHHHHHHHHh---C Confidence 233 445566666666666654 44444444421 1111 12233332222 333334444444443 2 Q ss_pred hcchhh----HHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccccC-CcccCC Q lcl|NC_019530. 487 VGSIFS----IDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSG-STDDSI 560 (560) Q Consensus 487 vGky~S----~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~-~~~~~~ 560 (560) |-.++ .+|++++ +++...+-.+ .+ ..+ ..|.++.....+ .+..++-.+.+= .-|.++ T Consensus 421 -G~~i~~~~~~~~i~e~-~gip~~~~~e---~~----~~~--~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~ 482 (488) T protein:vir:95 421 -GALEVDKELSNKLREH-IGLPPADESQ---PV----SEK--LSPNSQSRSGDG------YKTAGEGTAKTPSAKDPST 482 (488) T ss_pred -CCccccHHHHHHHHHH-hCCCCCCCCc---cc----ccc--CCCCCCCCCCcc------cCCCcccCCcccccccchh Confidence 54454 5778755 6664321110 00 001 011111000000 000011010000 111222 No 189 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=41.24 E-value=0.94 Score=20.74 Aligned_cols=399 Identities=11% Similarity=0.050 Sum_probs=164.4 Q ss_pred hhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHHHhhccee Q lcl|NC_019530. 22 VDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVDIIVNDVV 101 (560) Q Consensus 22 ~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~eIvneai 101 (560) ..+.+.+ . .....+...++ +--...+ ++....+.+-+ |. ...+++|-|.+||+-|-+.+. T Consensus 1 ~~~~r~~----~--~~~~~~~~~~~-~~~~~~~-g~~~s~~~~~v-----t~-------~~al~~~~v~~~v~~ia~~iA 60 (419) T protein:vir:14 1 MFFSRQL----L--SNLGQTQMSAG-GWVSALL-GSSRSDSGQVV-----TP-------ASALALTVLQNCVTLLAESIA 60 (419) T ss_pred Ccccccc----c--ccccccccCcc-hhhHHhh-cCCCccCCccc-----ch-------HHhhccHHHHHHHHHHHHhhc Confidence 1111110 0 00001111110 0000011 11111111111 11 134578889999987777543 Q ss_pred eecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhC----CchhhHHH----HHHhhhhhhhhhhheecCCCCccCe Q lcl|NC_019530. 102 TCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMD----FDNTAYQK----IRKWYVDGRQAYHVIVDPTNKKGGI 173 (560) Q Consensus 102 v~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~----f~~~~~~~----~R~wYvDGr~~~hkiiD~~~~k~GI 173 (560) -. |+.+-=...+ ..++.. ...+.++|. -...+.++ +..+.+.|.-|..++.|.. .-+ T Consensus 61 ~l-----p~~~~~~~~~----~~~~~~---~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~---G~~ 125 (419) T protein:vir:14 61 QL-----PIELYERSGE----DRKPAT---DHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSD---GVI 125 (419) T ss_pred cC-----ceEEEEecCC----cccccc---ccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC---CcE Confidence 22 2222111111 001111 112223332 12234444 4556677887776655442 337 Q ss_pred eEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchhceeeeeccccc Q lcl|NC_019530. 174 KKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDESIVYIDSGEEP 253 (560) Q Consensus 174 ~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~daI~y~hsGl~~ 253 (560) ++|++|+|..|..++ +.++. .+|.|. +. ..++.+.|.|... .+ T Consensus 126 ~~l~pl~~~~v~v~~-----~~~~~--------~~y~~~---------~~-------------~~~~~~~i~h~~~--~~ 168 (419) T protein:vir:14 126 QGLYPLDNEAVTVMR-----GSDLK--------PVYRVR---------GS-------------DPMPQRLVHHVRW--MS 168 (419) T ss_pred EEEEEecCceEEEEE-----CCCce--------EEEEEc---------cC-------------cccchhheeEecC--cC Confidence 899999998887642 11221 122221 11 0144555655542 33 Q ss_pred CCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCc--hHHHHHHHHHHHHhhhhheeecccCCccc Q lcl|NC_019530. 254 LANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLG--KKSAEEYMTMMMGKFKNRNAYDRTTGKIT 331 (560) Q Consensus 254 ~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlp--k~KAeqYl~~im~k~knk~vYd~~TGev~ 331 (560) .++-+..|-+..|..++.....+++...=+----+--+-++..+-..-+ ..++. +.+...++.+. +| .. T Consensus 169 ~dg~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~---~~~~~~~~~~~-----~g-~~ 239 (419) T protein:vir:14 169 INGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASV---DRITDGWNAKF-----GG-SG 239 (419) T ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHH---HHHHHHHHHHh-----cC-cc Confidence 4444566888999988888888877655444444566777777642211 12222 22223333221 12 11 Q ss_pred CcccchhhHhhhcccccCCCCccceeecCCCCCcchH---HHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHH Q lcl|NC_019530. 332 GNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEM---DHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQE 408 (560) Q Consensus 332 d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei---~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRD 408 (560) +..+.+ .++ .|.+++.|. .+..++ +--++..+.+.++++||...+....+.+ .++..+..+. T Consensus 240 nag~~~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t--~s~~E~~~~~ 304 (419) T protein:vir:14 240 NAKKVA-LLQ----------EGMTFRPLS--MTNVDAALIDALRLSALDIARIYKIPAHMVNELERAT--FSNIEHQSLQ 304 (419) T ss_pred ccCCce-ecC----------CCceEEEcc--CChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCC--cccHHHHHHH Confidence 111222 232 256666664 333333 3345778999999999999997543322 2323333322 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhc Q lcl|NC_019530. 409 ELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVG 488 (560) Q Consensus 409 ElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vG 488 (560) |..++ |+- +...++ +-+-++++++.++. ..+|. |..+. |.. .-+..|++.++.+-. + T Consensus 305 ---f~~~~--L~P-----~~~~ie-~~l~~kll~~~~~~---~~~i~--fd~~~----l~r-~d~~~~~~~~~~~~~--~ 361 (419) T protein:vir:14 305 ---FVIYT--LLP-----WVKRHE-QAKTRDLLLPSERK---QYFIE--YNLAG----LLR-GDQSSRYAAYAVGRQ--W 361 (419) T ss_pred ---HHHHH--HHH-----HHHHHH-HHHhhhccCccccC---CeEEE--Eechh----hhc-cCHHHHHHHHHHHHh--C Confidence 44432 322 111111 22233456776653 22344 43332 211 123345555554321 1 Q ss_pred chhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccc---cccccccccCCCCCCcccccccCCcccC--C Q lcl|NC_019530. 489 SIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADE---TGNYSGSDVSPLKFKPETIPFSGSTDDS--I 560 (560) Q Consensus 489 ky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e---~g~~~~~~~~~~~~~p~~~~~~~~~~~~--~ 560 (560) -++|.+-++. .+.+ -|-|.-|. +++..+.+.....-.++.-|..+.++.. + T Consensus 362 G~~T~NE~R~-~~gl--------------------~p~~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~e~~~~ 417 (419) T protein:vir:14 362 GWLSINDIRR-LENM--------------------PPVKGGDIYLSPMNMVDASKPQQLPVGKSEPTKAAIDEIGRI 417 (419) T ss_pred CCcCHHHHHH-HhCC--------------------CCCCCcCeeeeccccccccccccccCCCCCCccccccchhcc Confidence 3455555542 2333 12222221 1111111111000011112222222221 1 No 190 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=40.72 E-value=0.96 Score=20.68 Aligned_cols=414 Identities=13% Similarity=0.089 Sum_probs=189.4 Q ss_pred CCcccchhhhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHH Q lcl|NC_019530. 1 MAGYGRGFFGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEY 80 (560) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~Y 80 (560) |+. |..+-.|.- +......+..-.. ++ ... .+.. +-+++ .++...+.-++..-..++.| T Consensus 1 m~~---~i~~~~g~p------~~~~~~~~~~~~~-------ia-~~~-~~~~-~~~~~--~~~~~~~~iLr~~~~~~~~y 59 (491) T protein:vir:10 1 MSK---GLWVSPTEF------VTFGEPDKSLSSQ-------IA-TRA-RSID-FFALG--MYLPNPDPVLKALGKDIRVY 59 (491) T ss_pred CCC---ceeCCCCCc------cCcccCChHHHHH-------HH-hhh-cccc-ccccc--CCccchHHHHHhcCCCHHHH Confidence 554 222222221 2222111111111 11 000 0100 11111 12222322222222236799 Q ss_pred HHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhh Q lcl|NC_019530. 81 QSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAY 160 (560) Q Consensus 81 R~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~ 160 (560) +.|..++.|.++++.+..-+.-.+ ..+...+. ++...+.|.+.| +-++|+.-..+++.- ...|--++ T Consensus 60 ~~m~~D~~i~s~l~~Rk~av~~~~-------w~i~~~~~-~~~~~e~v~e~l----~~~~~~~~l~~~lda-~~~G~s~~ 126 (491) T protein:vir:10 60 RELRADAHVGGCVRRRKAAVKALE-------WGLDRGKA-KSRVAKSIADVF----ADLDLSRIVTEMLDA-VLYGYQPM 126 (491) T ss_pred HHHhhChHHHHHHHHHHHHHhCCC-------cEEecCCC-CHHHHHHHHHHH----hcCCHHHHHHHHHHh-hhhcceeE Confidence 999999999999988866544221 12221111 233444555544 445777777777643 34677777 Q ss_pred hheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecc Q lcl|NC_019530. 161 HVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFD 240 (560) Q Consensus 161 hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~ 240 (560) +++.......-.+.+|.+.+|+.+.+- ++++.. |.+. + ....+..++ T Consensus 127 Ei~w~~~~g~~~~~~l~~r~~~~f~~d-------~~~~l~----------~~~~-------~---------~~~~g~~l~ 173 (491) T protein:vir:10 127 EITWGKVGNYIVPIDVVGKPADWFVYD-------PENQLR----------FRSK-------D---------HWMQGEELP 173 (491) T ss_pred EEEEeecCCeeEEEEeeeecccceeec-------cCCceE----------EecC-------C---------CCCCcceec Confidence 777765443334455666666544431 112111 1100 0 001122333 Q ss_pred h-hceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHh-cCccceEEEccCCCCchHHHHHHHHHHHHhhh Q lcl|NC_019530. 241 D-ESIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAIT-RAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFK 318 (560) Q Consensus 241 ~-daI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRit-RAPeRRvFyIDvGnlpk~KAeqYl~~im~k~k 318 (560) + .-|+|+|.- ....-...|-|++|..+|--.+...-..+.+=-. =.|-|-. ..|.|.-.+.|. +.++. +.... T Consensus 174 ~~k~i~~~~~~--~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~ig-ky~~~a~~~ek~-~l~~a-l~~~~ 248 (491) T protein:vir:10 174 ARKFLVPRQEA--TYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVG-KHPRSASDGEKN-LLLDC-LEDMV 248 (491) T ss_pred CCCEEEEEecC--CCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEE-ecCCCCCHHHHH-HHHHH-HHHHh Confidence 3 367777753 2223445588999999988888666665555433 3455444 447776444432 23332 22222 Q ss_pred hheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchH----HHHHHHHHHHHHhcCCCccccCCCCc Q lcl|NC_019530. 319 NRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEM----DHVNYFREKLYDALMIPKSRLQEEGS 394 (560) Q Consensus 319 nk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei----~DV~YF~~kLy~aL~VP~sRl~~~~~ 394 (560) +.-+ + | +| .|++|+.+.-+.+-|.. .=++|..++.-+++-= .=|.++++ T Consensus 249 ~~a~-----~-v--------------iP-----~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLG--qtlTt~~~ 301 (491) T protein:vir:10 249 QDAV-----A-V--------------VP-----DDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIALLG--QNQTTEAT 301 (491) T ss_pred cCcE-----E-E--------------ec-----CCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHHhh--hhcccCcc Confidence 2211 1 1 22 46889888543333332 3388998888777531 11333332 Q ss_pred ccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHH Q lcl|NC_019530. 395 INIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILN 474 (560) Q Consensus 395 ~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~ 474 (560) . ++ .++++-.+ -+...++..++..+..+..+++--+.+++- .. +..+|.|... . |..+ T Consensus 302 g--s~-a~~~vh~~--v~~di~~~D~~~i~~tln~li~~l~~~N~~-~~--------~~p~f~~~~~------~--e~~~ 359 (491) T protein:vir:10 302 S--TR-ASAQAGLE--VTDDIRDGDKAVVSEAMNMLIRWICDLNFD-GA--------DRPVFDMWEQ------E--QVDE 359 (491) T ss_pred c--ch-hHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-CC--------CcceEEecCc------C--chhH Confidence 1 12 22444332 366777777777778888766655555543 21 1234444321 1 2233 Q ss_pred HHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCcccccccC Q lcl|NC_019530. 475 DRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKPETIPFSG 554 (560) Q Consensus 475 ~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~ 554 (560) .+.+.+..+-+. |==++.+|++++ +++...+.++. + .+.+.....+... .-...+ T Consensus 360 ~~a~~~~~L~~~-G~~i~~~~i~e~-~Gip~~~~~~~----------~-~~~~~~~~~~~~~------------~~~~~~ 414 (491) T protein:vir:10 360 IQAGRDQKLTQA-GARFTPAYFKRA-YNLQDGDLDER----------P-LPVSAVDTVGAAS------------FAEFEA 414 (491) T ss_pred HHHHHHHHHHhC-CCcCCHHHHHHH-hCCCCCCcCcc----------c-cccCCCCCccccc------------ccccCC Confidence 444555555443 444899999866 67654333211 0 1111101000000 000111 Q ss_pred CcccCC Q lcl|NC_019530. 555 STDDSI 560 (560) Q Consensus 555 ~~~~~~ 560 (560) ...+.. T Consensus 415 ~~~~~~ 420 (491) T protein:vir:10 415 PDQDAL 420 (491) T ss_pred CCCCch Confidence 111111 No 191 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=38.77 E-value=1.1 Score=20.47 Aligned_cols=431 Identities=13% Similarity=0.054 Sum_probs=147.7 Q ss_pred cCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccchhHHHH--------------------------- Q lcl|NC_019530. 42 VALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPEIRKAVD--------------------------- 94 (560) Q Consensus 42 ~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PEvd~AI~--------------------------- 94 (560) +++| -.-.........+..-+.... ....+++|+.+..+++=...|. T Consensus 1 ~~~~-~~~~~~~~~~~~~~~~i~~~~-----~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~n 74 (537) T protein:vir:78 1 MTSP-LLNKPIDQLGGLLNTEITTYM-----ASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHG 74 (537) T ss_pred CCcc-cccccHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccc Confidence 1111 111111111111111100000 0123455666655555444321 Q ss_pred ---HhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCCCcc Q lcl|NC_019530. 95 ---IIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTNKKG 171 (560) Q Consensus 95 ---eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~~k~ 171 (560) .||+-.+-+ --..||++..++.+ + +.+.+.+..+++ -+|++..+++.+.+.+.|+-|.|.-+|. + T Consensus 75 f~k~Ivd~~~~y-l~G~Pv~~~~~d~~--~----~e~~~~l~~~~~-~~~~~~~~el~~~~s~~G~ay~~~y~de----~ 142 (537) T protein:vir:78 75 FFTELVDQLAQY-LLSNGVEVKVKDED--N----TQLDEILQEYFD-EDFQATIDTLVTNASKKGFEGIFARTTS----E 142 (537) T ss_pred hHHHHHHHHhhh-hcccCceeecCcch--h----HHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeeEEEeeecC----C Confidence 111111111 22456666554421 2 234455555554 5788899999999999999888777665 3 Q ss_pred CeeEeEEeCccccchhhhhcccCCccccccccccch------------------heeecCcccccc--cccccccc--cc Q lcl|NC_019530. 172 GIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTL------------------KYYYNPNYNRNQ--FTGQSGTS--QN 229 (560) Q Consensus 172 GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e------------------~~~Y~p~~~~~~--~~g~~~~~--~~ 229 (560) |-..+..+||+.+=+|.. ..+ +..-.++- +-+|.+..-++. .++..... .. T Consensus 143 ~~~~~~~i~p~~~~pv~d-----~~~--~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~ 215 (537) T protein:vir:78 143 GKLKFQTVDGLTLIPVFD-----DYG--VLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLD 215 (537) T ss_pred CceEEEEEccceeEEEEc-----CCC--CceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCccccccccc Confidence 567788999988765531 101 11000000 112222211110 00000000 00 Q ss_pred ccccccc----eecc----hhc------eeeee-cccccC----CCCcchhhhHHHHHHHHHHHHHH-HHHHHHHHhcCc Q lcl|NC_019530. 230 FQPSQQE----LVFD----DES------IVYID-SGEEPL----ANGIVPGLLNPAIRPLNNLVTTE-DATVIYAITRAP 289 (560) Q Consensus 230 ~~~~~~~----~~i~----~da------I~y~h-sGl~~~----~~~~i~syL~~AiK~~NqL~m~E-DalVIyRitRAP 289 (560) ....... ..+. .+. ....| ||-+++ ++-...|=|+..+.....+.++= +..-..+-++.| T Consensus 216 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ 295 (537) T protein:vir:78 216 EAYNPNPAPHVLAIEESTDADFEDTDGYQVLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEA 295 (537) T ss_pred ccccccccceeeeccccccccccccccccccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCc Confidence 0000000 0000 000 00001 222211 22233455665555444433211 111111122222 Q ss_pred cceEEEccCCCCchHHHHHHHHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcch-H Q lcl|NC_019530. 290 EKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGE-M 368 (560) Q Consensus 290 eRRvFyIDvGnlpk~KAeqYl~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i 368 (560) ---+.-. +|+ +...++..+..+.+..-.| .|..+..|--..+..- - T Consensus 296 ilvi~g~------------------------------~~~--~~~~~~~~l~~~~~i~v~~-d~~~v~~l~~~~~~~~~e 342 (537) T protein:vir:78 296 IYVVKGF------------------------------SGD--STDKLRQNIKAKKMIGVNG-DNAGMEIQTVSIPYEARK 342 (537) T ss_pred eeeeecC------------------------------CCc--cchhHHHHHhhcCceeecC-CCCceeEEEecCCHHHHH Confidence 1111111 111 1111222222332211111 1122333332222211 1 Q ss_pred HHHHHHHHHHHHhcCCCccccCCCCcccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HhccCCChhH Q lcl|NC_019530. 369 DHVNYFREKLYDALMIPKSRLQEEGSINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQL---ILKGVTDEKD 445 (560) Q Consensus 369 ~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qL---iLkgiit~ee 445 (560) .-+.-..+.+|+-..+|-. ..... |...+..|..--.---.-+.+.++.|...|...|+.=+ -++|. ..-+ T Consensus 343 ~~ld~L~~~I~~~s~~~~~--~~~~~---gn~SGvAlk~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~~-~~~d 416 (537) T protein:vir:78 343 AKMDIDVENIYRSGMGFNS--TAVGD---GNVTNVVIKSRYTLLAMKARKMETSLRKVLRWCADMVVSDIALRGL-GEYD 416 (537) T ss_pred HHHHHHHHHHHHhcCCCCC--ccccc---cCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC-cccc Confidence 2244455566666666642 22222 22212222110000111233444444444443333222 22343 2333 Q ss_pred HHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHH-HHHHHHHHHHHhh----- Q lcl|NC_019530. 446 WNEKIKPFIKFEFTSDSYIREQQENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDE-EVKEQQTKIAEEK----- 519 (560) Q Consensus 446 w~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDe-EI~e~~kqIe~E~----- 519 (560) |. .|.+.|...---.+...++ ++..+. .+-.+|.+++++. +-..|+ |.+ +++++|. T Consensus 417 ~~-----~i~i~f~~~~P~n~~e~a~-------~~~~l~--~~giiS~eT~l~~-~p~vdd~e~e---k~~~ee~~~~~~ 478 (537) T protein:vir:78 417 SN-----DICFEIEPHVLANELDIAT-------TRKTEA--ETEALKIGNIMTV-APRIGDDETL---KLIAEELDLDYN 478 (537) T ss_pred cc-----eeeEEeccCCCCCHHHHHH-------HHHHHH--hcCcchHHHHHHh-CCCCCCHHHH---HHHHHHHHhhhh Confidence 42 4677777653333322222 222211 2247899999976 555443 222 2222221 Q ss_pred --hcC----------CCCCCCccccccccccccCCCCCCcccccccCCcccCC Q lcl|NC_019530. 520 --KKG----------LYPKVQADETGNYSGSDVSPLKFKPETIPFSGSTDDSI 560 (560) Q Consensus 520 --~~~----------~~~~p~~~e~g~~~~~~~~~~~~~p~~~~~~~~~~~~~ 560 (560) +.. ..++++ ++.+++.++. -+|..-|-.++.|.+. T Consensus 479 ~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~---~~~~~d~~~~~~~~~~ 525 (537) T protein:vir:78 479 ELKDALAEQDAQSLDVSPDVQ---AMLDGLPVNA---NQPPVDPNQPVADPNV 525 (537) T ss_pred hhhhhhhhhcccccCcCcchh---hhcCCCCCCC---CCCCCCccCCCCCCCC Confidence 000 011111 1111111110 0011111222222222 No 192 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=36.89 E-value=1.1 Score=20.26 Aligned_cols=391 Identities=12% Similarity=0.094 Sum_probs=154.6 Q ss_pred chhh-hhhhh-----hhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeE-ec-cccceeccccccchhhHHHHH Q lcl|NC_019530. 6 RGFF-GLFGG-----GGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIIL-QG-GANTYNYVGVESELLSVKTVV 77 (560) Q Consensus 6 ~~~~-~~~~~-----~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i-~~-~~~~~~y~~~e~~~~~~~~LI 77 (560) -||. +|||. +.....+++.. .-+.......-...++....+...- .+ +++...+..-.....+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~t~---- 72 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVDYN----PGDPDMVEFRGPEEEPEARALPWIRPTAWSGYPESWATPSWGSAQD---- 72 (409) T ss_pred CchhhhhcccccCCCccccccccccc----CCCCceeeccCCCcchhhhhcccccccccccccccccccCccccch---- Confidence 2222 23332 11111111111 0011111110111111111111100 00 11110111001111111 Q ss_pred HHHHHHhhccchhHHHHHhhcceeeecCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHh-CCchhh----HHHHHHh Q lcl|NC_019530. 78 EEYQSMAQQPEIRKAVDIIVNDVVTCEEDETPVTVNLDKVEGISDTVKESITECFKEVMHLM-DFDNTA----YQKIRKW 152 (560) Q Consensus 78 ~~YR~ma~~PEvd~AI~eIvneaiv~d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll-~f~~~~----~~~~R~w 152 (560) +.+++.|-|..||+-|-+.+.-. |+.+- ++.. .. ++..++++.- +-...+ +.++..+ T Consensus 73 ---~~~~~~~~v~acV~~Ia~~iA~l-----pl~~~-~~~~-~~--------~~~~~ll~~~PN~~~t~~~f~~~l~~~l 134 (409) T protein:vir:83 73 ---KLRTLIDVAWACIDLNASVLSSM-----PIYRM-RNGR-II--------DSVAWMSNPDPEVYTSWQEFAKQLFWDF 134 (409) T ss_pred ---hhHhhhHHHHHHHHHHHHhhccC-----ceEEe-eCCc-cc--------cchhhhcccCCCCCCCHHHHHHHHHHHH Confidence 34667889999997666644322 22221 1111 11 1122222210 011223 3344555 Q ss_pred hhhhhhhhhheecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccc Q lcl|NC_019530. 153 YVDGRQAYHVIVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQP 232 (560) Q Consensus 153 YvDGr~~~hkiiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~ 232 (560) .. |.-|..+|.... ..=+++|.+|+|.++...+ .+ ++ ..+|.|+ +.. T Consensus 135 ll-Gnay~~~i~r~~--~G~~~~L~pl~p~~v~v~~---~~--~g--------~~~y~~~---------~~~-------- 181 (409) T protein:vir:83 135 QL-GEAFVLPMAHGS--DGYPIRFRVVPPWLVNVEL---KK--GA--------RREYRIG---------GLN-------- 181 (409) T ss_pred hh-CCcEEEEEEECC--CCcEEEEEEECCcceEEEE---cC--Cc--------eEEEEEc---------ccc-------- Confidence 55 777766553221 1227899999998876532 11 12 1223332 110 Q ss_pred cccceecchhceeeeeccccc-CCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhc--CccceEEEccCCCCchHHHHHH Q lcl|NC_019530. 233 SQQELVFDDESIVYIDSGEEP-LANGIVPGLLNPAIRPLNNLVTTEDATVIYAITR--APEKRAFYLDVGTLGKKSAEEY 309 (560) Q Consensus 233 ~~~~~~i~~daI~y~hsGl~~-~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitR--APeRRvFyIDvGnlpk~KAeqY 309 (560) . .+.|.|+.. .. .+..+.+|-|+.|...+..-...++... +.+. |--.-|...| +.|-+.++++. T Consensus 182 ------~-~~eiiHir~--~~~~~~~~G~spi~~~~~~i~~~~a~~~~~~--~~f~nga~p~gil~~~-~~ls~e~~~~~ 249 (409) T protein:vir:83 182 ------V-TDEILHIRY--QGNTADAHGHGPLESAAPRQVVIGLLQKYVQ--NLAETGGVPLYWLGVE-RRLSETEAVDL 249 (409) T ss_pred ------C-ccceEEeCC--CCCCCCcccccHHHHHHHHHHHHHHHHHHHH--HHHhcCCCcceEeecC-CCCCHHHHHHH Confidence 0 134554431 22 2344567889988888887777777543 3333 2223344444 45666666665 Q ss_pred HHHHHHhhhhheeecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCcccc Q lcl|NC_019530. 310 MTMMMGKFKNRNAYDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRL 389 (560) Q Consensus 310 l~~im~k~knk~vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl 389 (560) .+.....|. ...|. .+ +++ +|+.-++..++.. ..+.-++-.+|-.+.+-++.+||...| T Consensus 250 ~~~~~~~~~------~nag~------~~-il~-------~g~~~~~~~~~s~-~d~q~le~r~~~~~eIa~~fgVPp~ll 308 (409) T protein:vir:83 250 MDRWIESRS------KYAGH------PA-LVT-------GGATLNQAKSMSA-QDLSLMELTQFNEARIAILLGVPPFLV 308 (409) T ss_pred HHHHHHhhC------CccCc------cc-eec-------CCcccccccCCCH-HHHHHHHHHHhhHHHHHHHhCCCHHHc Confidence 554433221 12232 11 233 1222122222221 111112223445566889999999999 Q ss_pred CCCC-cccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhhhhceeeeeecchHHHHH Q lcl|NC_019530. 390 QEEG-SINIGGSNLAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQ 468 (560) Q Consensus 390 ~~~~-~~~l~~~~~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~~~I~~~f~~Dn~f~Elk 468 (560) +-.+ +-+.+.++.++..+. |.++ .|+--+ ..+.+.|-..|+.. .+ +++|++ .++. T Consensus 309 g~~~~~~~~tysn~eq~~~~---f~~~--tL~P~~-~~ie~~l~~~Ll~~-----~~-------~~~f~~------~~ll 364 (409) T protein:vir:83 309 GLPGATGSLTYSNIEQLFSF---HDRS--SLRPKA-TAVMAALDRWALPS-----PQ-------HLELNR------DDYT 364 (409) T ss_pred cCCCCccccccccHHHHHHH---HHHH--HHHHHH-HHHHHHHHHhhCCC-----Cc-------EEEeeh------hhhh Confidence 7432 223344554444443 5432 333222 22334444444432 21 233322 2333 Q ss_pred HHHHHHHHHHHHHHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccccc-cccc Q lcl|NC_019530. 469 ENAILNDRLASLNTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYS-GSDV 540 (560) Q Consensus 469 e~Ei~~~Rl~~l~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~-~~~~ 540 (560) ..++ .+|.+.++.+-.- -++|.+-+++. +.| | | .+-|+.. .+++ T Consensus 365 r~d~-~~r~~~~~~~~~~--G~lT~NE~R~~-~gl---------------------p-p--~~ggd~l~~~gv 409 (409) T protein:vir:83 365 RPSL-VERATAYKIMIEA--GVMEPNEARAM-ERL---------------------H-S--EAAAVRLSGGGV 409 (409) T ss_pred ccCH-HHHHHHHHHHHhC--CCcCHHHHHHH-hCC---------------------C-C--CCCCcccCCCCC Confidence 3332 3566666554322 34555554421 222 1 1 1222222 1222 No 193 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=28.15 E-value=1.8 Score=19.22 Aligned_cols=369 Identities=13% Similarity=0.105 Sum_probs=149.7 Q ss_pred hhhhhhhhhhhhhhhhcccccchhhhccccccccCCCCCCCCceeEeccccceeccccccchhhHHHHHHHHHHHhhccc Q lcl|NC_019530. 9 FGLFGGGGLVNAKVDTDKLAQKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLSVKTVVEEYQSMAQQPE 88 (560) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~~~~LI~~YR~ma~~PE 88 (560) -|+|+-..++..+.-+ . +..... .+.. .. + .| +.+- T Consensus 1 Mg~f~~~~~~~~~~~~----------------------~-~~~~~~-------~~~~---~~------~-~~----~~~~ 36 (378) T protein:vir:16 1 MNLFGKVVSFSRGKLN----------------------N-DTQRVT-------AWQN---EA------V-EY----TSAF 36 (378) T ss_pred Cccchhhhhhhccccc----------------------C-Ccceee-------eccc---ch------h-hH----HHHH Confidence 4444444443332100 0 000000 0100 00 0 11 3334 Q ss_pred hhHHHHHhhcceeeecCCCceEEEee-ecccchhHHHHHHHHHHHHHHHHH-hCCchhhHH----HHHHhhhhhhhhhhh Q lcl|NC_019530. 89 IRKAVDIIVNDVVTCEEDETPVTVNL-DKVEGISDTVKESITECFKEVMHL-MDFDNTAYQ----KIRKWYVDGRQAYHV 162 (560) Q Consensus 89 vd~AI~eIvneaiv~d~~~~~V~l~L-d~~~~~s~sik~~I~eeF~~il~l-l~f~~~~~~----~~R~wYvDGr~~~hk 162 (560) |..||.-|.+.+..+ |+.+-- .......+..++.+.....++|+. =+=.-.+++ ++..+...|.-|..+ T Consensus 37 v~~~i~~Ia~~iA~l-----~~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~ 111 (378) T protein:vir:16 37 VTNIHNKIANEITKV-----EFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYA 111 (378) T ss_pred HHHHHHHHHhhhhhC-----ceeEEEEcccccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEE Confidence 888897777765433 222211 111111222223333444444442 122223444 344666778766655 Q ss_pred eecCCCCccCeeEeEEeCccccchhhhhcccCCccccccccccchheeecCccccccccccccccccccccccceecchh Q lcl|NC_019530. 163 IVDPTNKKGGIKKLVMLDSRCIRPVYIVEKAMREGGIEAIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDE 242 (560) Q Consensus 163 iiD~~~~k~GI~eLr~LDPr~i~~vr~i~~~~~~g~~~~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~d 242 (560) +.|... .++.++-| .+....++.+ T Consensus 112 ~~d~~~-----g~~~~l~~---------------------------------------------------~~~~~~~~~~ 135 (378) T protein:vir:16 112 VFDDNT-----GELLDLLF---------------------------------------------------ADDKKEYKPE 135 (378) T ss_pred EeecCC-----ceEEEEEe---------------------------------------------------cCCeeEeccc Confidence 544211 12222221 0112346667 Q ss_pred ceeeeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhhhhee Q lcl|NC_019530. 243 SIVYIDSGEEPLANGIVPGLLNPAIRPLNNLVTTEDATVIYAITRAPEKRAFYLDVGTLGKKSAEEYMTMMMGKFKNRNA 322 (560) Q Consensus 243 aI~y~hsGl~~~~~~~i~syL~~AiK~~NqL~m~EDalVIyRitRAPeRRvFyIDvGnlpk~KAeqYl~~im~k~knk~v 322 (560) .|.|..+- .+.....|.|+.|.+.++. ++ .-+--|-+..++ +.|....+.+....+...|++..- T Consensus 136 diih~r~~---~~~~~~~s~l~~~~~~i~~------~~-----~~~~~~g~l~~~-~~l~~~~~~~~~~~~~~~~~~~~~ 200 (378) T protein:vir:16 136 ELVRLTSP---FYINEDTSILDNALASIQT------KL-----EQGKLRGLLKIN-AFLDIDNTQEYREKALTTIKNMQE 200 (378) T ss_pred ceEEecCc---cCccchhHHHHHHHHHHHH------HH-----hcCccceeeEeC-CcCCHHHHHHHHHHHHHHHHHhhc Confidence 78877642 3455567888888776532 11 122223344443 455555555555555555554322 Q ss_pred ecccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccccch Q lcl|NC_019530. 323 YDRTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQLGEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSNL 402 (560) Q Consensus 323 Yd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~~ 402 (560) |. +.-+.+ .++ .|.+++.|.-.....++...+|-++.+.++++||.+.|.. +.+ T Consensus 201 -----~~--~~g~~~-vl~----------~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~g--------~~~ 254 (378) T protein:vir:16 201 -----GS--SYNGLT-PVD----------NKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILLG--------TAS 254 (378) T ss_pred -----cc--ccccce-EcC----------CCceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcC--------Cch Confidence 21 111122 121 2455666655445556788899999999999999998832 222 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHhhh--hhceeeeeecchHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 403 AEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLILKGVTDEKDWNEKIK--PFIKFEFTSDSYIREQQENAILNDRLASL 480 (560) Q Consensus 403 ~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLiLkgiit~eew~~~i~--~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l 480 (560) .+..+. |..+ .|+-.+ ..+.+.|.. .+++++++..... ....++|.-+ .+.... +..|++.+ T Consensus 255 e~~~~~---f~~~--tl~P~~-~~ie~~l~~-----kLl~~~e~~~~~~~~~~~~~~f~~~----~l~~~d-~~~~~~~~ 318 (378) T protein:vir:16 255 QEQQIY---FYNS--TIIPLL-IQLEKELTY-----KLISTNRRRVVKGNLYYERIIVDNQ----LFKFAT-LKELIDLY 318 (378) T ss_pred HHHHHH---HHHH--HHHHHH-HHHHHHHHh-----hcCChhhhhhhhhcccccceeeccc----hhhhcC-HHHHHHHH Confidence 222221 4333 233211 222333333 4456666643221 1122333222 222221 23555554 Q ss_pred HHhhhhhcchhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccccccccccCCCCCCc-ccccccCCcccC Q lcl|NC_019530. 481 NTVEPFVGSIFSIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADETGNYSGSDVSPLKFKP-ETIPFSGSTDDS 559 (560) Q Consensus 481 ~~~~p~vGky~S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e~g~~~~~~~~~~~~~p-~~~~~~~~~~~~ 559 (560) ..+-.- -++|.+-++. ++++..-+ .-+ -++. +.+..+-+..+..... .-.+-.|.++.- T Consensus 319 ~~~~~~--G~~T~NE~R~-~~g~~p~~--ggD---------~~~~------~~n~~~~~~~~~~~~~~~~~~~~~e~~ne 378 (378) T protein:vir:16 319 HENING--PIFTQNQLLV-KMGEQPIE--GGD---------VYIA------NLNAVAVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred HHHHhC--CCcCHHHHHH-HhCCCCCC--CCC---------eEee------ccccccccchhhhcCccCCCCCCCCCCCC Confidence 443322 2677777763 45553211 000 0000 0000000000000000 000000111111 No 194 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=21.38 E-value=2.6 Score=18.30 Aligned_cols=416 Identities=12% Similarity=0.036 Sum_probs=175.0 Q ss_pred cchhhhccccccccCCCCCCCCceeEeccccceeccccccchhh----HHHHHHHHHHHh-hccchhHHHHHhhcceeee Q lcl|NC_019530. 29 QKQDELLLTKATVVALDDAQDGSIILQGGANTYNYVGVESELLS----VKTVVEEYQSMA-QQPEIRKAVDIIVNDVVTC 103 (560) Q Consensus 29 ~~~~e~~~~k~~s~~~~~~~dG~~~i~~~~~~~~y~~~e~~~~~----~~~LI~~YR~ma-~~PEvd~AI~eIvneaiv~ 103 (560) .+-+= ...+.|--.-+-++..++.+....|.+-++..+. ..+=++-|++|. .++.|.++++..-.-+.-. T Consensus 1 ~~~~~-----~~~p~~~~~~~~~~~~~~~~~~~g~~~~D~~lr~~gg~~~~~~~l~~~m~e~D~~v~s~l~~Rk~av~~~ 75 (446) T protein:vir:98 1 MNMEV-----RNAPTPAIRRRTIYAMEHLGLATSYLSEDGGYKRAGKPTYQQLSAWDEAAQTEPIIAQGLDSIALSVLNK 75 (446) T ss_pred Ccccc-----cCCCchhhhhhhhhccccchhhcccCCcchHhhhcCCChHHHHHHHHHHHhcchHHHHHHHHHHHHhhcC Confidence 00000 0001111111222333333333334444443321 223357899997 4999999998876655544 Q ss_pred cCCCceEEEeeecccchhHHHHHHHHHHHHHHHHHhCCchhhHHHHHHhhhhhhhhhhheecCCC----Cc---cCeeEe Q lcl|NC_019530. 104 EEDETPVTVNLDKVEGISDTVKESITECFKEVMHLMDFDNTAYQKIRKWYVDGRQAYHVIVDPTN----KK---GGIKKL 176 (560) Q Consensus 104 d~~~~~V~l~Ld~~~~~s~sik~~I~eeF~~il~ll~f~~~~~~~~R~wYvDGr~~~hkiiD~~~----~k---~GI~eL 176 (560) +=.-.| + ++ ++.+..+++|+-++|+--.+.+. .-...|=-+++++..... |+ .|++.+ T Consensus 76 ~w~V~p---------~-~~----~~a~~v~~~l~~~~~~~~~~~~l-dai~~G~s~~Eivw~~~~g~~~p~~~~d~~~~~ 140 (446) T protein:vir:98 76 VGPYQH---------G-DK----RIKKFIDDQLRNRAKTWISHCVK-SIMTYGFSLSEQIYAHGARDNMPATVLDDIVNY 140 (446) T ss_pred CceecC---------c-cH----HHHHHHHHHHhhcCchhHHHHHH-HHHhhCceeeeEEEeecccccccchhhcccccc Confidence 322222 1 22 34555566667777766666653 223356666666664321 11 112222 Q ss_pred EEeCccccchhhhhcccCCccccc--cccccchheeecCccccccccccccccccccccccceecchhc-eeeeeccccc Q lcl|NC_019530. 177 VMLDSRCIRPVYIVEKAMREGGIE--AIESVTLKYYYNPNYNRNQFTGQSGTSQNFQPSQQELVFDDES-IVYIDSGEEP 253 (560) Q Consensus 177 r~LDPr~i~~vr~i~~~~~~g~~~--~~~~~~e~~~Y~p~~~~~~~~g~~~~~~~~~~~~~~~~i~~da-I~y~hsGl~~ 253 (560) .|+.++.+ .+..+.+- ...+...+.......-.....+... ......+..+.||..- ++|.|..- T Consensus 141 ---~~~~~r~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~g~~~~iP~~kfi~~~~~~~-- 208 (446) T protein:vir:98 141 ---HPLQVMLI-----ANDNGRIVDGDTVTASQYKSGYWVPLPPYRIGDPP--KKVDVVGSHVRLPSHKRLFINYNTK-- 208 (446) T ss_pred ---ccccceee-----eccCCccccccccchhhcccccccCcccchhhhhh--hhcccCcccccccccceEEEEecCC-- Confidence 22222111 01111000 0001111111110000000000000 0011223345556544 67777532 Q ss_pred CCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHh-cCccceEEEccCCCCchHH-------HHHHHHH-HHHhhhhheeec Q lcl|NC_019530. 254 LANGIVPGLLNPAIRPLNNLVTTEDATVIYAIT-RAPEKRAFYLDVGTLGKKS-------AEEYMTM-MMGKFKNRNAYD 324 (560) Q Consensus 254 ~~~~~i~syL~~AiK~~NqL~m~EDalVIyRit-RAPeRRvFyIDvGnlpk~K-------AeqYl~~-im~k~knk~vYd 324 (560) ..+-+-.|-|+.|..+|-=.+-.+...+.+=-- =.|-|.--| +-|..++.| ++..+.+ +++..++--. | T Consensus 209 ~~~p~G~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vGky-p~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~~~-d 286 (446) T protein:vir:98 209 GNNPWGTSCLTSVLDYSIFKRAFRDMMLIALDRYGTPLIYVIV-PPGNTGVVEEAPDGTEITTTIAEQAEDALRRLST-D 286 (446) T ss_pred CCCccccchHHHHHHHHHHHHhhHHHHHHHHhHcCCceeEEee-cCCCCcccccchhHHHHHHHHHHHHHHHHHhccc-c Confidence 222344588999999887766666555554333 244444333 344433332 2222221 3333333211 1 Q ss_pred ccCCcccCcccchhhHhhhcccccCCCCccceeecCCCCCc--chHHHHHHHHHHHHHhcCCCccccCCCCcccccccc- Q lcl|NC_019530. 325 RTTGKITGNAHLMGIAEDYWLPRREGQNATEIATVGGGNQL--GEMDHVNYFREKLYDALMIPKSRLQEEGSINIGGSN- 401 (560) Q Consensus 325 ~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nL--gei~DV~YF~~kLy~aL~VP~sRl~~~~~~~l~~~~- 401 (560) .-| +++- --+| -|+||..+..+.+. .-..=|+|+.+..-+++-...=-|+.+.+- ++|. T Consensus 287 -a~~-------ii~~---~~~P-----~g~eie~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~--~GS~a 348 (446) T protein:vir:98 287 -SGL-------VLTQ---LSKE-----QPVQVGALTTGNNFSDSFERAISLCDNNMLMGMGIPNLLVQNRETT--FGTGR 348 (446) T ss_pred -cee-------eeec---ccCC-----CCceEEeeccccCChhhHHHHHHHHHHHHHHHHhcccccccccccc--cchhh Confidence 011 1100 0001 35778776554332 233458999999999887775444433221 1221 Q ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hccCCChhHHHHhhhhhceeeeeecchHHHHHHHHHHHHHHHHH Q lcl|NC_019530. 402 LAEITQEELRFSKFCAGLRRRYSHFFMEFLRRQLI-LKGVTDEKDWNEKIKPFIKFEFTSDSYIREQQENAILNDRLASL 480 (560) Q Consensus 402 ~~eItRDElkF~Kfi~rLr~rfs~if~d~Lk~qLi-Lkgiit~eew~~~i~~~I~~~f~~Dn~f~Elke~Ei~~~Rl~~l 480 (560) +++. ..|+ |...++...+.++..|..-|-..|+ +++= ..+.-...+.+.+++++ .|-.+.+.+.+++. T Consensus 349 la~v-h~~V-~~d~~~aDa~~i~~tln~~Li~~l~~lNf~-~~~~~~~~~~~~~~~~~------~e~eDl~~~a~~~~-- 417 (446) T protein:vir:98 349 ASEI-QLEL-FDGKINSIFDTVIHAFTEQVIGNLIRLNFD-PALYPLASNTGYITRLP------GRATDLAALVEAIK-- 417 (446) T ss_pred hHHH-HHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-ccccccccccccceecc------CChhhHHHHHHHHH-- Confidence 2322 3333 5566666777777777644434444 4421 11110011112233333 34344444444444 Q ss_pred HHhhhhhcchh--hHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccc Q lcl|NC_019530. 481 NTVEPFVGSIF--SIDYVMRNVLRMSDEEVKEQQTKIAEEKKKGLYPKVQADE 531 (560) Q Consensus 481 ~~~~p~vGky~--S~~~i~k~IL~~tDeEI~e~~kqIe~E~~~~~~~~p~~~e 531 (560) .+-+. |-.. +.+||++. +++ |++.++. T Consensus 418 -~L~~~-G~~~p~~~~~ire~-~gi---------------------P~~~~~~ 446 (446) T protein:vir:98 418 -QMHDM-GFLVDGDKDHIRSI-TGL---------------------PDAISST 446 (446) T ss_pred -HHHhC-CccccccHHHHHHH-hCc---------------------CCCCCCC Confidence 44433 3222 45666655 443 3322222 Done!