Query lcl|NC_018087.1_cdsid_YP_006489123.1 [gene=ZZ1p339] [protein=gp20 portal head vertex protein] [protein_id=YP_006489123.1] [location=complement(133628..135190)] Match_columns 520 No_of_seqs 77 out of 89 Neff 4.1 Searched_HMMs 1612 Date Thu Nov 7 16:09:37 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_339 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_339_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:108049 Length: 524 100.0 2E-286 1E-289 1587.1 45.9 518 3-520 1-522 (524) 2 protein:vir:6896 Length: 523 # 100.0 6E-286 4E-289 1584.3 46.0 517 3-520 1-521 (523) 3 protein:vir:103458 Length: 524 100.0 1E-283 7E-287 1571.8 45.8 517 3-520 1-522 (524) 4 protein:vir:7208 Length: 524 # 100.0 1E-283 8E-287 1571.5 45.8 517 3-520 1-522 (524) 5 protein:vir:101189 Length: 516 100.0 2E-283 1E-286 1571.1 46.0 513 6-520 1-515 (516) 6 protein:vir:101806 Length: 516 100.0 2E-283 1E-286 1571.1 46.0 513 6-520 1-515 (516) 7 protein:vir:98265 Length: 524 100.0 3E-283 2E-286 1569.7 47.1 518 1-520 1-522 (524) 8 protein:vir:100598 Length: 516 100.0 1E-282 9E-286 1565.7 46.3 513 6-520 1-515 (516) 9 protein:vir:81017 Length: 521 100.0 9E-282 5E-285 1561.4 46.5 515 3-520 1-519 (521) 10 protein:vir:106282 Length: 521 100.0 2E-281 1E-284 1559.9 46.4 514 3-520 1-519 (521) 11 protein:vir:6596 Length: 521 # 100.0 3E-281 2E-284 1558.8 46.4 515 3-520 1-519 (521) 12 protein:vir:104892 Length: 558 100.0 6E-278 3E-281 1540.6 45.2 501 8-520 1-512 (558) 13 protein:vir:103177 Length: 533 100.0 1E-274 7E-278 1522.6 43.9 493 8-520 1-500 (533) 14 protein:vir:5665 Length: 511 # 100.0 3E-273 2E-276 1514.3 43.8 504 12-519 1-511 (511) 15 protein:vir:104500 Length: 537 100.0 7E-273 4E-276 1512.6 43.6 494 7-520 1-501 (537) 16 protein:vir:106999 Length: 564 100.0 5E-273 3E-276 1513.4 41.4 499 8-520 1-515 (564) 17 protein:vir:5839 Length: 533 # 100.0 1E-234 7E-238 1303.2 37.6 463 9-520 1-471 (533) 18 protein:vir:107742 Length: 537 99.4 7.8E-13 4.9E-16 87.0 26.7 430 11-520 1-515 (537) 19 protein:vir:94049 Length: 532 99.3 7E-11 4.3E-14 76.3 27.3 438 3-520 1-499 (532) 20 protein:vir:79538 Length: 502 99.2 8.9E-10 5.5E-13 70.2 33.8 451 1-519 1-502 (502) 21 protein:vir:96068 Length: 765 99.2 2.1E-10 1.3E-13 73.7 25.7 437 1-520 4-514 (765) 22 protein:vir:104338 Length: 422 99.1 7.1E-10 4.4E-13 70.7 26.1 393 37-518 1-422 (422) 23 protein:vir:80040 Length: 461 99.1 1.6E-09 1E-12 68.7 27.6 425 15-520 1-459 (461) 24 protein:vir:5249 Length: 437 # 99.1 2.5E-09 1.5E-12 67.8 28.8 405 25-518 1-437 (437) 25 protein:vir:107662 Length: 427 99.0 5E-09 3.1E-12 66.1 27.2 396 35-520 1-423 (427) 26 protein:vir:99563 Length: 862 99.0 7.9E-09 4.9E-12 65.0 26.7 439 1-520 66-550 (862) 27 protein:vir:105782 Length: 449 98.8 3.3E-08 2E-11 61.6 24.5 412 1-520 1-441 (449) 28 protein:vir:101418 Length: 569 98.7 7.8E-08 4.8E-11 59.6 27.3 477 3-520 1-541 (569) 29 protein:vir:38 Length: 496 # N 98.6 2.3E-07 1.4E-10 57.0 25.4 438 4-520 1-491 (496) 30 protein:vir:95542 Length: 548 98.6 2.3E-07 1.5E-10 56.9 33.0 444 1-520 1-485 (548) 31 protein:vir:96579 Length: 576 98.6 2.4E-07 1.5E-10 56.9 30.6 454 1-520 1-518 (576) 32 protein:vir:3843 Length: 397 # 98.6 2.5E-07 1.5E-10 56.8 24.6 376 9-518 1-397 (397) 33 protein:vir:95599 Length: 563 98.6 2.6E-07 1.6E-10 56.7 28.6 444 1-520 1-519 (563) 34 protein:vir:99312 Length: 563 98.6 2.6E-07 1.6E-10 56.7 28.6 444 1-520 1-519 (563) 35 protein:vir:96738 Length: 505 98.5 3.5E-07 2.2E-10 56.0 31.5 440 16-520 1-490 (505) 36 protein:vir:3420 Length: 533 # 98.5 4.4E-07 2.7E-10 55.4 29.8 453 27-520 1-513 (533) 37 protein:vir:6382 Length: 553 # 98.5 4.6E-07 2.8E-10 55.3 31.2 463 3-520 1-528 (553) 38 protein:vir:79647 Length: 435 98.4 6.1E-07 3.8E-10 54.7 28.5 398 12-520 1-423 (435) 39 protein:vir:3153 Length: 467 # 98.4 6.9E-07 4.3E-10 54.4 27.2 382 78-520 1-445 (467) 40 protein:vir:389 Length: 530 # 98.4 7.5E-07 4.7E-10 54.2 32.3 450 29-520 1-511 (530) 41 protein:vir:80644 Length: 551 98.4 8.5E-07 5.3E-10 53.9 30.0 444 5-520 1-512 (551) 42 protein:vir:4194 Length: 540 # 98.3 1.4E-06 9E-10 52.6 22.4 418 11-520 1-450 (540) 43 protein:vir:10321 Length: 495 98.3 1.6E-06 9.7E-10 52.4 29.8 448 1-520 1-477 (495) 44 protein:vir:79703 Length: 505 98.2 2.2E-06 1.4E-09 51.6 29.5 433 1-520 1-502 (505) 45 protein:vir:80959 Length: 499 98.2 2.4E-06 1.5E-09 51.4 27.1 428 4-520 1-494 (499) 46 protein:vir:79772 Length: 648 98.2 3E-06 1.9E-09 50.8 30.6 437 3-520 1-484 (648) 47 protein:vir:63755 Length: 547 98.2 3E-06 1.9E-09 50.8 28.3 439 9-520 1-510 (547) 48 protein:vir:102080 Length: 429 98.2 3E-06 1.9E-09 50.8 27.4 404 1-520 1-421 (429) 49 protein:vir:78227 Length: 480 98.1 3.9E-06 2.4E-09 50.2 23.4 393 68-520 1-463 (480) 50 protein:vir:5737 Length: 419 # 98.1 5E-06 3.1E-09 49.6 25.5 393 9-520 1-414 (419) 51 protein:vir:93610 Length: 454 98.1 5.6E-06 3.4E-09 49.4 26.0 403 11-520 1-437 (454) 52 protein:vir:105002 Length: 432 98.0 7E-06 4.3E-09 48.9 26.8 406 1-520 1-424 (432) 53 protein:vir:107605 Length: 432 98.0 7E-06 4.3E-09 48.9 26.8 406 1-520 1-424 (432) 54 protein:vir:102855 Length: 432 98.0 7E-06 4.3E-09 48.9 26.8 406 1-520 1-424 (432) 55 protein:vir:3028 Length: 500 # 98.0 7.9E-06 4.9E-09 48.5 26.2 443 1-520 1-497 (500) 56 protein:vir:9815 Length: 500 # 98.0 7.9E-06 4.9E-09 48.5 26.2 443 1-520 1-497 (500) 57 protein:vir:102727 Length: 945 98.0 9.2E-06 5.7E-09 48.2 25.0 426 1-520 50-529 (945) 58 protein:vir:97060 Length: 432 98.0 9.5E-06 5.9E-09 48.1 25.1 404 3-520 1-430 (432) 59 protein:vir:80796 Length: 574 97.9 1.2E-05 7.7E-09 47.5 27.9 442 1-520 1-521 (574) 60 protein:vir:1587 Length: 508 # 97.9 1.3E-05 8.1E-09 47.4 28.1 423 43-520 1-502 (508) 61 protein:vir:9922 Length: 489 # 97.9 1.3E-05 8.2E-09 47.3 24.9 427 46-520 1-483 (489) 62 protein:vir:78907 Length: 518 97.8 1.8E-05 1.1E-08 46.7 29.9 412 45-520 1-510 (518) 63 protein:vir:2683 Length: 412 # 97.8 1.8E-05 1.1E-08 46.6 27.7 391 9-520 1-412 (412) 64 protein:vir:4156 Length: 542 # 97.8 1.8E-05 1.1E-08 46.6 25.4 408 11-520 1-438 (542) 65 protein:vir:81095 Length: 416 97.8 2.1E-05 1.3E-08 46.3 23.5 389 9-519 1-416 (416) 66 protein:vir:4598 Length: 416 # 97.8 2.1E-05 1.3E-08 46.3 23.5 389 9-519 1-416 (416) 67 protein:vir:100249 Length: 431 97.8 2.2E-05 1.4E-08 46.1 26.7 412 9-520 1-427 (431) 68 protein:vir:100882 Length: 383 97.7 2.3E-05 1.4E-08 46.0 26.4 373 9-518 1-383 (383) 69 protein:vir:7853 Length: 518 # 97.7 2.7E-05 1.7E-08 45.7 28.0 398 26-520 1-426 (518) 70 protein:vir:4337 Length: 434 # 97.7 2.8E-05 1.7E-08 45.6 26.6 406 1-519 1-434 (434) 71 protein:vir:78537 Length: 480 97.6 3.4E-05 2.1E-08 45.1 25.2 404 42-520 1-463 (480) 72 protein:vir:78641 Length: 278 97.6 3.5E-05 2.2E-08 45.0 21.6 269 105-451 1-278 (278) 73 protein:vir:960 Length: 413 # 97.6 3.6E-05 2.2E-08 44.9 25.8 395 1-520 1-413 (413) 74 protein:vir:10362 Length: 432 97.6 3.7E-05 2.3E-08 44.9 27.1 401 3-520 1-430 (432) 75 protein:vir:3989 Length: 392 # 97.6 3.8E-05 2.4E-08 44.8 23.3 380 7-517 1-392 (392) 76 protein:vir:1023 Length: 392 # 97.6 3.8E-05 2.4E-08 44.8 23.3 380 7-517 1-392 (392) 77 protein:vir:95378 Length: 406 97.5 5.2E-05 3.2E-08 44.1 23.3 382 9-520 1-405 (406) 78 protein:vir:105461 Length: 470 97.5 5.2E-05 3.2E-08 44.1 29.3 397 44-519 1-470 (470) 79 protein:vir:98444 Length: 434 97.5 5.8E-05 3.6E-08 43.8 28.6 393 61-520 1-425 (434) 80 protein:vir:9702 Length: 406 # 97.5 5.9E-05 3.6E-08 43.8 23.8 379 12-520 1-399 (406) 81 protein:vir:96980 Length: 409 97.4 6.6E-05 4.1E-08 43.5 26.8 386 3-520 1-395 (409) 82 protein:vir:80134 Length: 403 97.4 7.1E-05 4.4E-08 43.3 24.4 380 9-518 1-403 (403) 83 protein:vir:101648 Length: 518 97.4 7.8E-05 4.8E-08 43.1 26.3 400 26-520 1-432 (518) 84 protein:vir:105064 Length: 421 97.4 7.9E-05 4.9E-08 43.1 20.6 398 10-520 1-417 (421) 85 protein:vir:5961 Length: 503 # 97.4 8.6E-05 5.3E-08 42.9 27.4 412 42-520 1-490 (503) 86 protein:vir:4995 Length: 384 # 97.3 9.4E-05 5.8E-08 42.7 27.2 373 9-520 1-383 (384) 87 protein:vir:4828 Length: 382 # 97.3 0.0001 6.2E-08 42.5 26.2 369 9-519 1-382 (382) 88 protein:vir:2341 Length: 488 # 97.3 0.00011 6.7E-08 42.3 24.5 435 34-520 1-472 (488) 89 protein:vir:4454 Length: 414 # 97.2 0.00013 7.9E-08 41.9 27.2 386 9-520 1-411 (414) 90 protein:vir:7768 Length: 484 # 97.2 0.00014 8.7E-08 41.7 21.3 434 27-520 1-471 (484) 91 protein:vir:7407 Length: 392 # 97.2 0.00014 8.9E-08 41.7 25.0 362 1-480 1-392 (392) 92 protein:vir:94426 Length: 409 97.2 0.00014 8.9E-08 41.7 28.2 391 3-520 1-409 (409) 93 protein:vir:99916 Length: 504 97.1 0.00016 9.8E-08 41.4 24.3 429 43-520 1-483 (504) 94 protein:vir:4952 Length: 386 # 97.1 0.00017 1E-07 41.3 24.5 377 9-519 1-386 (386) 95 protein:vir:1236 Length: 483 # 97.1 0.00017 1.1E-07 41.2 29.3 407 34-520 1-477 (483) 96 protein:vir:94101 Length: 474 97.1 0.00018 1.1E-07 41.1 26.4 428 9-520 1-469 (474) 97 protein:vir:105889 Length: 474 97.1 0.00018 1.1E-07 41.1 26.4 428 9-520 1-469 (474) 98 protein:vir:1326 Length: 457 # 97.1 0.00018 1.1E-07 41.1 26.1 408 9-520 1-433 (457) 99 protein:vir:98883 Length: 517 96.9 0.00024 1.5E-07 40.4 26.6 437 1-519 1-517 (517) 100 protein:vir:8418 Length: 409 # 96.9 0.00027 1.7E-07 40.2 25.7 383 9-520 1-404 (409) 101 protein:vir:102118 Length: 409 96.9 0.00029 1.8E-07 40.0 24.4 391 3-519 1-409 (409) 102 protein:vir:1082 Length: 359 # 96.8 0.00032 2E-07 39.7 22.5 339 1-475 1-359 (359) 103 protein:vir:6240 Length: 457 # 96.8 0.00033 2E-07 39.7 26.5 409 9-520 1-433 (457) 104 protein:vir:483 Length: 413 # 96.8 0.00034 2.1E-07 39.6 25.3 391 21-520 1-410 (413) 105 protein:vir:81072 Length: 432 96.8 0.00036 2.3E-07 39.4 28.6 403 3-520 1-430 (432) 106 protein:vir:101647 Length: 460 96.7 0.00036 2.3E-07 39.4 29.2 420 4-520 1-460 (460) 107 protein:vir:102950 Length: 471 96.7 0.00038 2.3E-07 39.4 29.9 381 62-520 1-470 (471) 108 protein:vir:4854 Length: 386 # 96.7 0.0004 2.5E-07 39.2 26.6 376 9-519 1-386 (386) 109 protein:vir:100187 Length: 385 96.7 0.00043 2.7E-07 39.0 26.8 363 1-520 1-370 (385) 110 protein:vir:9871 Length: 429 # 96.6 0.00049 3E-07 38.8 29.0 384 46-520 1-428 (429) 111 protein:vir:101541 Length: 694 96.6 0.00052 3.2E-07 38.6 24.2 424 1-520 54-523 (694) 112 protein:vir:4509 Length: 424 # 96.6 0.00052 3.2E-07 38.6 25.5 399 3-519 1-424 (424) 113 protein:vir:1380 Length: 422 # 96.5 0.00053 3.3E-07 38.5 26.9 400 9-518 1-422 (422) 114 protein:vir:94805 Length: 492 96.5 0.00055 3.4E-07 38.5 25.9 415 9-520 1-486 (492) 115 protein:vir:105292 Length: 478 96.5 0.00058 3.6E-07 38.3 27.7 408 8-520 1-469 (478) 116 protein:vir:106639 Length: 481 96.4 0.00062 3.8E-07 38.2 27.2 422 1-520 28-476 (481) 117 protein:vir:1266 Length: 416 # 96.4 0.0007 4.3E-07 37.9 26.2 393 3-519 1-416 (416) 118 protein:vir:78589 Length: 695 96.2 0.00088 5.4E-07 37.3 23.8 435 1-520 35-524 (695) 119 protein:vir:3609 Length: 452 # 96.2 0.00092 5.7E-07 37.2 31.7 400 27-520 1-445 (452) 120 protein:vir:106571 Length: 499 96.1 0.00099 6.2E-07 37.0 30.5 397 42-520 1-475 (499) 121 protein:vir:96494 Length: 501 96.1 0.001 6.2E-07 37.0 27.8 449 1-520 3-485 (501) 122 protein:vir:7987 Length: 456 # 96.0 0.0011 6.8E-07 36.8 21.5 389 35-519 1-456 (456) 123 protein:vir:79984 Length: 441 96.0 0.0011 6.9E-07 36.8 26.8 408 1-519 1-441 (441) 124 protein:vir:9408 Length: 441 # 96.0 0.0011 6.9E-07 36.8 26.8 408 1-519 1-441 (441) 125 protein:vir:3964 Length: 453 # 96.0 0.0012 7.3E-07 36.7 29.2 400 16-519 1-453 (453) 126 protein:vir:104082 Length: 485 95.9 0.0012 7.6E-07 36.5 26.8 429 29-520 1-469 (485) 127 protein:vir:189 Length: 424 # 95.9 0.0013 8.1E-07 36.4 29.7 399 1-518 8-424 (424) 128 protein:vir:4782 Length: 522 # 95.9 0.0013 8.2E-07 36.4 28.6 426 1-520 1-507 (522) 129 protein:vir:2500 Length: 501 # 95.8 0.0014 9E-07 36.2 27.8 411 1-520 21-486 (501) 130 protein:vir:107112 Length: 478 95.8 0.0015 9.2E-07 36.1 28.7 395 45-520 1-469 (478) 131 protein:vir:2427 Length: 485 # 95.8 0.0015 9.3E-07 36.1 24.6 415 34-520 1-472 (485) 132 protein:vir:99522 Length: 470 95.7 0.0015 9.5E-07 36.0 30.3 416 24-520 1-470 (470) 133 protein:vir:97447 Length: 474 95.7 0.0016 9.8E-07 35.9 28.6 395 26-520 1-468 (474) 134 protein:vir:94498 Length: 474 95.7 0.0016 9.8E-07 35.9 28.6 395 26-520 1-468 (474) 135 protein:vir:93747 Length: 472 95.7 0.0016 9.9E-07 35.9 29.9 397 45-520 1-465 (472) 136 protein:vir:80680 Length: 441 95.6 0.0017 1.1E-06 35.7 28.8 397 38-520 1-441 (441) 137 protein:vir:9306 Length: 511 # 95.6 0.0018 1.1E-06 35.7 30.3 426 8-520 1-496 (511) 138 protein:vir:96179 Length: 468 95.5 0.0019 1.2E-06 35.5 29.7 399 45-520 1-467 (468) 139 protein:vir:100150 Length: 437 95.5 0.002 1.2E-06 35.4 28.5 406 1-520 1-431 (437) 140 protein:vir:96240 Length: 511 95.5 0.002 1.3E-06 35.3 28.9 426 8-520 1-496 (511) 141 protein:vir:98396 Length: 441 95.4 0.0021 1.3E-06 35.3 26.3 406 3-519 1-441 (441) 142 protein:vir:93943 Length: 409 95.4 0.0022 1.4E-06 35.1 27.8 390 3-520 1-409 (409) 143 protein:vir:3648 Length: 695 # 95.3 0.0023 1.4E-06 35.1 24.8 436 1-520 35-524 (695) 144 protein:vir:99781 Length: 511 95.3 0.0024 1.5E-06 34.9 28.4 419 8-520 1-496 (511) 145 protein:vir:95113 Length: 474 95.1 0.0027 1.7E-06 34.7 26.5 395 30-520 1-467 (474) 146 protein:vir:94666 Length: 723 94.8 0.0034 2.1E-06 34.1 25.0 386 43-520 1-406 (723) 147 protein:vir:100691 Length: 535 94.8 0.0035 2.2E-06 34.0 28.9 441 1-520 1-492 (535) 148 protein:vir:79063 Length: 491 94.4 0.0045 2.8E-06 33.5 25.1 401 3-520 1-420 (491) 149 protein:vir:96366 Length: 511 94.4 0.0045 2.8E-06 33.4 29.5 425 8-520 1-496 (511) 150 protein:vir:78805 Length: 511 94.4 0.0045 2.8E-06 33.4 29.5 425 8-520 1-496 (511) 151 protein:vir:2732 Length: 501 # 94.1 0.0054 3.4E-06 33.0 27.7 443 1-520 8-481 (501) 152 protein:vir:97336 Length: 492 94.0 0.0058 3.6E-06 32.9 28.1 415 9-520 1-485 (492) 153 protein:vir:95806 Length: 440 93.9 0.0062 3.8E-06 32.7 29.1 386 60-519 1-440 (440) 154 protein:vir:94546 Length: 506 93.6 0.0069 4.3E-06 32.4 24.3 419 3-520 1-505 (506) 155 protein:vir:4898 Length: 502 # 93.5 0.0075 4.6E-06 32.2 27.7 442 3-520 1-493 (502) 156 protein:vir:4223 Length: 486 # 93.3 0.0081 5E-06 32.0 22.0 430 29-520 1-474 (486) 157 protein:vir:105819 Length: 456 93.2 0.0083 5.2E-06 32.0 23.9 411 32-519 1-456 (456) 158 protein:vir:102602 Length: 456 93.2 0.0083 5.2E-06 32.0 23.9 411 32-519 1-456 (456) 159 protein:vir:9359 Length: 348 # 93.1 0.009 5.6E-06 31.8 26.6 325 103-520 1-348 (348) 160 protein:vir:103951 Length: 511 93.1 0.009 5.6E-06 31.8 29.8 421 8-520 1-496 (511) 161 protein:vir:99072 Length: 479 92.9 0.0096 5.9E-06 31.7 29.9 417 1-520 1-451 (479) 162 protein:vir:79043 Length: 479 92.8 0.0098 6.1E-06 31.6 27.5 407 11-520 1-479 (479) 163 protein:vir:733 Length: 453 # 92.7 0.01 6.3E-06 31.5 29.9 405 5-520 1-449 (453) 164 protein:vir:80333 Length: 419 92.7 0.01 6.4E-06 31.5 21.7 392 3-520 1-409 (419) 165 protein:vir:81152 Length: 411 92.7 0.01 6.4E-06 31.5 27.9 387 12-517 1-411 (411) 166 protein:vir:8184 Length: 474 # 92.6 0.011 6.6E-06 31.4 25.6 409 45-518 1-474 (474) 167 protein:vir:4698 Length: 251 # 92.4 0.012 7.2E-06 31.2 16.8 242 9-331 1-251 (251) 168 protein:vir:106716 Length: 698 91.8 0.014 8.7E-06 30.7 25.7 418 1-520 55-524 (698) 169 protein:vir:8100 Length: 466 # 91.6 0.015 9.3E-06 30.6 23.2 435 1-520 1-463 (466) 170 protein:vir:1884 Length: 424 # 91.5 0.015 9.5E-06 30.5 28.1 401 1-518 8-424 (424) 171 protein:vir:97171 Length: 512 90.7 0.02 1.2E-05 30.0 29.2 429 8-520 1-497 (512) 172 protein:vir:81218 Length: 423 90.4 0.021 1.3E-05 29.8 28.0 407 12-520 1-421 (423) 173 protein:vir:99853 Length: 488 89.4 0.026 1.6E-05 29.2 25.2 393 18-520 1-401 (488) 174 protein:vir:9568 Length: 410 # 87.9 0.036 2.2E-05 28.5 27.3 396 9-511 1-410 (410) 175 protein:vir:103219 Length: 201 87.5 0.037 2.3E-05 28.4 9.6 189 283-518 1-201 (201) 176 protein:vir:96266 Length: 474 84.7 0.059 3.6E-05 27.3 28.0 402 30-520 1-473 (474) 177 protein:vir:95899 Length: 474 84.7 0.059 3.6E-05 27.3 28.0 402 30-520 1-473 (474) 178 protein:vir:100328 Length: 346 83.9 0.065 4E-05 27.1 22.2 316 27-454 1-346 (346) 179 protein:vir:96839 Length: 474 83.5 0.068 4.2E-05 27.0 29.0 394 45-520 1-467 (474) 180 protein:vir:102330 Length: 451 83.0 0.072 4.5E-05 26.8 31.6 385 46-517 1-451 (451) 181 protein:vir:9751 Length: 422 # 82.8 0.074 4.6E-05 26.8 27.3 409 3-510 1-422 (422) 182 protein:vir:94742 Length: 409 80.5 0.094 5.9E-05 26.2 28.6 396 3-495 1-409 (409) 183 protein:vir:107880 Length: 491 79.0 0.11 6.8E-05 25.9 24.6 407 3-520 1-420 (491) 184 protein:vir:3868 Length: 417 # 78.7 0.11 7E-05 25.8 26.8 381 9-520 1-407 (417) 185 protein:vir:1661 Length: 378 # 76.6 0.13 8.3E-05 25.4 23.5 356 12-519 1-378 (378) 186 protein:vir:8317 Length: 409 # 74.3 0.16 9.9E-05 25.0 25.7 383 12-520 1-404 (409) 187 protein:vir:1634 Length: 409 # 70.4 0.21 0.00013 24.3 27.7 398 3-495 1-409 (409) 188 protein:vir:108215 Length: 469 54.6 0.5 0.00031 22.2 27.6 421 27-520 1-466 (469) 189 protein:vir:267 Length: 348 # 50.3 0.61 0.00038 21.7 24.0 338 16-470 1-348 (348) 190 protein:vir:93867 Length: 378 45.2 0.78 0.00048 21.2 24.1 354 12-518 1-378 (378) 191 protein:vir:98567 Length: 340 44.9 0.79 0.00049 21.2 24.6 325 1-441 1-340 (340) 192 protein:vir:1431 Length: 419 # 35.8 1.2 0.00075 20.1 28.2 390 3-520 1-414 (419) 193 protein:vir:78083 Length: 537 30.5 1.6 0.00098 19.5 27.0 410 34-520 1-501 (537) 194 protein:vir:79511 Length: 448 30.0 1.6 0.001 19.4 23.5 422 15-520 1-433 (448) 195 protein:vir:5691 Length: 344 # 25.6 2 0.0013 18.9 22.2 332 1-451 1-344 (344) No 1 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=100.00 E-value=1.8e-286 Score=1587.13 Aligned_cols=518 Identities=65% Similarity=1.091 Sum_probs=506.9 Q ss_pred cc-cccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeeccc--ccccccccccccccccccchhHHHHHHHHH Q lcl|NC_018087. 3 ML-ADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQ--DIAYNGVFQKLYGSQDPTATSTRELINTYR 79 (520) Q Consensus 3 ~~-~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~--~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR 79 (520) |+ |+++|+|||||+++++++++++++++++|++||+++|||++|+++ .++++|++|++|+++++.++|+++||++|| T Consensus 1 ~~~~~~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR 80 (524) T protein:vir:10 1 MANFNTILSFLKPWANEDEKEYKQQINNNLESVTAPKLDDGAREIETQEQNIPYNALMQQMFGSNEPEVKNTRELIDTYR 80 (524) T ss_pred CCchhhHHHHhhhhhcchhhhhhhhhccCCCccccCCCCCCceeeccCcccccchhhhhhhhhcccchhhhHHHHHHHHH Confidence 43 789999999999999999999999999999999999999999997 446678888999999999999999999999 Q ss_pred HHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEE Q lcl|NC_018087. 80 SLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHK 159 (520) Q Consensus 80 ~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hk 159 (520) +||+|||||+||++|||||||||++++||+|+|++++||++||++|+|||++||+||+|+++||++||+||||||+|||| T Consensus 81 ~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~Ld~~~~s~siK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHk 160 (524) T protein:vir:10 81 NLMNNYEVDNAVQEIVSDAIVYEDDKEVVALNLDGTDFSQSIKDKILAEFSEVLNLLNFQRKGTDHFQRWYVDSRIFFHK 160 (524) T ss_pred HHhhccchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeceEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeec Q lcl|NC_018087. 160 IINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHS 239 (520) Q Consensus 160 vid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hS 239 (520) |||+++||+||+|||+||||+|++||++.+++.+|+.+++++.|||+|+|+.++|+++++.++++++||||++||||||| T Consensus 161 iid~~~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~~~~~~~~~~~~~~ikI~~dAIvy~~S 240 (524) T protein:vir:10 161 IINPKKMKDGVQELRRLDPRQVQYIREIVTRMEDGVKIVDGYREFFVYDTGHESYCADGRIYSAGTKVKIPRAAVVYAHS 240 (524) T ss_pred EeeCCCccccceeeeeeCCccceeeeeecccCcccchhhcchhhheeecCCCcccccCcceecCCcceecchhheeeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCc Q lcl|NC_018087. 240 GLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGK 319 (520) Q Consensus 240 GL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGe 319 (520) ||+|||++.++||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+ T Consensus 241 GL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnlPk~KAeqYl~~im~k~kNKlvYDa~TGe 320 (524) T protein:vir:10 241 GLLDCCGKNIIGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRVFYIDTGNMPSRKAAAQMQHIMNTMKNRVVYDASTGK 320 (524) T ss_pred CcccCCCCceeccchHhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeccCCe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCC-ccccccccchhhHHH Q lcl|NC_018087. 320 VKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQ-TQNVFDMSTAISRDE 398 (520) Q Consensus 320 v~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~-~~~~~G~~~eItRDE 398 (520) |+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++ +++.+||++|||||| T Consensus 321 v~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~f~~gr~~EItRDE 400 (524) T protein:vir:10 321 IKNQQHNMSMTEDYWLQRRDGKAVTEVDTMPGATGMSDMDDVLYFRTALYRALRIPESRIPSESNSGVMFDAGTAITRDE 400 (524) T ss_pred eccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCchhccCCCCccccccccchhhHHH Confidence 9999999999999999999999999999999999999999999999999999999999998765 345569999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchh Q lcl|NC_018087. 399 LSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKY 478 (520) Q Consensus 399 lkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky 478 (520) +||+|||+|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++|||||| T Consensus 401 iKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky 480 (524) T protein:vir:10 401 LKFAKWIRQLQNKFEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRINMLTMAEPFIGKY 480 (524) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 479 ISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 479 ~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) ||++||||+||+|||+||++++|||++|+++|+|++|++|+= T Consensus 481 ~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~ 522 (524) T protein:vir:10 481 ISHQTAMKDFLQMTDEEINQEAKQIEEESKEARFQNPDEEEE 522 (524) T ss_pred chhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCChhhh Confidence 999999999999999999999999999999999999988776 No 2 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=100.00 E-value=6e-286 Score=1584.26 Aligned_cols=517 Identities=66% Similarity=1.097 Sum_probs=506.9 Q ss_pred cccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeeccccc----ccccccccccccccccchhHHHHHHHH Q lcl|NC_018087. 3 MLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDI----AYNGVFQKLYGSQDPTATSTRELINTY 78 (520) Q Consensus 3 ~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~----a~~g~~~~~~~~~~~~~~~~~~LI~~Y 78 (520) |.| +.|+|||||+++|++++++++++.++|++||+++|||++++++.. +++|.+|++|+|++++++|+++||++| T Consensus 1 m~f-~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~Y 79 (523) T protein:vir:68 1 MKF-NILSLFAPWAKMDERDYKDQEKENLESITSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLKSTRELIDTY 79 (523) T ss_pred CCC-chhhhhhhhhhhhhhhhhhhhhccCCCccccCCCCcceeeeccccccccccchhhhhhhhccccccchHHHHHHHH Confidence 888 599999999999999999999999999999999999999997622 457888889999999999999999999 Q ss_pred HHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEE Q lcl|NC_018087. 79 RSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFH 158 (520) Q Consensus 79 R~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~h 158 (520) |+||+|||||+||++|||||||||++++||+|+|++++||++||++|+|||++||++|+|+++||++||+||||||+||| T Consensus 80 R~ma~~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fh 159 (523) T protein:vir:68 80 RNLMTNYEVDNAVSEIVSDAIVYEDDTEVVSINLDNTKFSPNIKSMMLDEFNEVLNHLSFQRKGSDHFRRWYVDSRIFFH 159 (523) T ss_pred HHHhhccchhhHHHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhheeeeEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEee Q lcl|NC_018087. 159 KIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAH 238 (520) Q Consensus 159 kvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~h 238 (520) ||||+++||+||+|||+||||+|++||++.++.++|+.++++++|||+|+|..++|+++++.+++|++||||++|||||| T Consensus 160 Kiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~g~~vi~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~h 239 (523) T protein:vir:68 160 KIIDPKRPKEGIKELRRLDPRQVQYVREVITTTEAGVKIVKGYKEYFIYDTSHESYACDGRIYEAGTKIKIPKAAIVYAH 239 (523) T ss_pred EEeeCCCccccceeeeeeCCcceeEEEeecCCCCcchhhhhhhhhheeeccccccccccccccCCCcceecchhheeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCC Q lcl|NC_018087. 239 SGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTG 318 (520) Q Consensus 239 SGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TG 318 (520) |||+|||++.++||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++|| T Consensus 240 SGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNKlvYDa~TG 319 (523) T protein:vir:68 240 SGLVDCCGKNIIGYLHRAIKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPSRKAAEHMQHVMNTMKNRIAYDATTG 319 (523) T ss_pred ccceeCCCCceeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhhcceeEEeccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHH Q lcl|NC_018087. 319 KVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDE 398 (520) Q Consensus 319 ev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDE 398 (520) +|+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++|++.|||++|||||| T Consensus 320 ev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDE 399 (523) T protein:vir:68 320 KIKNQQHIMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDVRWFRNALYMALRIPITRIPSDQGGIQFDAGTSITRDE 399 (523) T ss_pred eeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCcceeecCCCcceecccccchhHHH Confidence 99999999999999999999999999999999999999999999999999999999999999887655679999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchh Q lcl|NC_018087. 399 LSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKY 478 (520) Q Consensus 399 lkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky 478 (520) +||+|||+|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++|||||| T Consensus 400 ikF~KFI~rLR~rFs~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky 479 (523) T protein:vir:68 400 LSFGKFIRELQHKFEEIFLDPLKTNLILKGIITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINMLQMAEPFIGKY 479 (523) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 479 ISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 479 ~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) ||++||||+||+|||+||++++|||++|+++|+|++|++|+= T Consensus 480 ~s~~yi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~e~~ 521 (523) T protein:vir:68 480 ISHRTAMKDILQMSDEEIEQEAKQIEEESKEARFQDPDQEQE 521 (523) T ss_pred chhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhh Confidence 999999999999999999999999999999999999999876 No 3 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=100.00 E-value=1.2e-283 Score=1571.75 Aligned_cols=517 Identities=65% Similarity=1.086 Sum_probs=504.3 Q ss_pred cccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccc--c--ccccccccccccccccchhHHHHHHHH Q lcl|NC_018087. 3 MLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQD--I--AYNGVFQKLYGSQDPTATSTRELINTY 78 (520) Q Consensus 3 ~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~--~--a~~g~~~~~~~~~~~~~~~~~~LI~~Y 78 (520) |-| ++|+||+||+++|+.+++++++.+++|++||+++|||.++++.. . .++|.++.++++++++++|+++||++| T Consensus 1 m~~-~~L~~~~~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~Y 79 (524) T protein:vir:10 1 MKF-NVLSLFAPWAKMDERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTY 79 (524) T ss_pred CCC-chhhHhhccccCcchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHH Confidence 777 59999999999999999999999999999999999999997642 2 356777888888999999999999999 Q ss_pred HHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEE Q lcl|NC_018087. 79 RSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFH 158 (520) Q Consensus 79 R~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~h 158 (520) |+||+|||||+||++|||||||||++++||+|+|+++++|++||++|+|||++||++|+|+++||++||+||||||+||| T Consensus 80 R~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fh 159 (524) T protein:vir:10 80 RNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFNDVLNHLSFQRKGSDHFRRWYVDSRIFFH 159 (524) T ss_pred HHHhhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEee Q lcl|NC_018087. 159 KIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAH 238 (520) Q Consensus 159 kvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~h 238 (520) ||||+++||+||+|||+||||+|++||++.+++.+|+.++++++|||+|+|+.++|+++++.++++++||||++|||||| T Consensus 160 Kiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~dAI~y~h 239 (524) T protein:vir:10 160 KIIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKAAIVYAH 239 (524) T ss_pred EEeeCCCccccceeeeeeCCccceeeeeeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecchhheeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCC Q lcl|NC_018087. 239 SGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTG 318 (520) Q Consensus 239 SGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TG 318 (520) |||+|||++.++||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++|| T Consensus 240 SGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TG 319 (524) T protein:vir:10 240 SGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTG 319 (524) T ss_pred ccceeCCCCceeccchhhhHHHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCC-ccccccccchhhHH Q lcl|NC_018087. 319 KVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQ-TQNVFDMSTAISRD 397 (520) Q Consensus 319 ev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~-~~~~~G~~~eItRD 397 (520) +|+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++ +++.|||++||||| T Consensus 320 ev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRD 399 (524) T protein:vir:10 320 KIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDVRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRD 399 (524) T ss_pred eeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHH Confidence 99999999999999999999999999999999999999999999999999999999999998775 44556999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccch Q lcl|NC_018087. 398 ELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGK 477 (520) Q Consensus 398 ElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgk 477 (520) |+||+|||+|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||| T Consensus 400 EikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGk 479 (524) T protein:vir:10 400 ELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINMLTMAEPFIGK 479 (524) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 478 YISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 478 y~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) |||++||||+||+|||+||++++|||++|+++|+|++|++||= T Consensus 480 y~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~ 522 (524) T protein:vir:10 480 YISHRTAMKDILQMTDEEIEQEAKQIEEESKEARFQDPDQEQE 522 (524) T ss_pred cchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhh Confidence 9999999999999999999999999999999999999999876 No 4 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=100.00 E-value=1.3e-283 Score=1571.46 Aligned_cols=517 Identities=65% Similarity=1.084 Sum_probs=504.2 Q ss_pred cccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccc--c--ccccccccccccccccchhHHHHHHHH Q lcl|NC_018087. 3 MLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQD--I--AYNGVFQKLYGSQDPTATSTRELINTY 78 (520) Q Consensus 3 ~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~--~--a~~g~~~~~~~~~~~~~~~~~~LI~~Y 78 (520) |-| ++|+||+||+++|+.+++++++.+++|++||+++|||.++++.. . .++|.++.++++++++++|+++||++| T Consensus 1 m~~-~~L~~~~~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~Y 79 (524) T protein:vir:72 1 MKF-NVLSLFAPWAKMDERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTY 79 (524) T ss_pred CCC-chhhHhhccccCcchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHH Confidence 777 59999999999999999999999999999999999999997642 2 356777888888999999999999999 Q ss_pred HHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEE Q lcl|NC_018087. 79 RSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFH 158 (520) Q Consensus 79 R~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~h 158 (520) |+||+|||||+||++|||||||||++++||+|+|+++++|++||++|+|||++||++|+|+++||++||+||||||+||| T Consensus 80 R~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fh 159 (524) T protein:vir:72 80 RNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFSDVLNHLSFQRKGSDHFRRWYVDSRIFFH 159 (524) T ss_pred HHHhhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEee Q lcl|NC_018087. 159 KIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAH 238 (520) Q Consensus 159 kvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~h 238 (520) ||||+++||+||+|||+||||+|++||++.+++.+|+.++++++|||+|+|+.++|+++++.++++++||||++|||||| T Consensus 160 Kiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~dAI~y~h 239 (524) T protein:vir:72 160 KIIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKAAVVYAH 239 (524) T ss_pred EEEeCCCccccceeeeeeCCccceeeeeeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecchhheeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCC Q lcl|NC_018087. 239 SGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTG 318 (520) Q Consensus 239 SGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TG 318 (520) |||+|||++.++||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++|| T Consensus 240 SGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TG 319 (524) T protein:vir:72 240 SGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTG 319 (524) T ss_pred ccceeCCCCceeccchhhhHhHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCC-ccccccccchhhHH Q lcl|NC_018087. 319 KVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQ-TQNVFDMSTAISRD 397 (520) Q Consensus 319 ev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~-~~~~~G~~~eItRD 397 (520) +|+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++ +++.|||++||||| T Consensus 320 ev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRD 399 (524) T protein:vir:72 320 KIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDIRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRD 399 (524) T ss_pred eeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHH Confidence 99999999999999999999999999999999999999999999999999999999999998775 44556999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccch Q lcl|NC_018087. 398 ELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGK 477 (520) Q Consensus 398 ElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgk 477 (520) |+||+|||+|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||| T Consensus 400 EikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGk 479 (524) T protein:vir:72 400 ELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINMLTMAEPFIGK 479 (524) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 478 YISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 478 y~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) |||++||||+||+|||+||++++|||++|+++|+|++|++|+= T Consensus 480 y~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~ 522 (524) T protein:vir:72 480 YISHRTAMKDILQMTDEEIEQEAKQIEEESKEARFQDPDQEQE 522 (524) T ss_pred cchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhh Confidence 9999999999999999999999999999999999999999866 No 5 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=100.00 E-value=1.5e-283 Score=1571.05 Aligned_cols=513 Identities=54% Similarity=0.898 Sum_probs=502.6 Q ss_pred ccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeeccccc-ccccccccccccccccchhHHHHHHHHHHHhhc Q lcl|NC_018087. 6 DSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDI-AYNGVFQKLYGSQDPTATSTRELINTYRSLLNN 84 (520) Q Consensus 6 ~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~-a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~ 84 (520) -+.|+|||||+++|+.+++++++++++||+||+++|||++|+++.. ++.||++++|+++++.++|+++||++||+||+| T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~ 80 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINN 80 (516) T ss_pred CCchHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhc Confidence 4678999999999999999999999999999999999999998743 446777777779999999999999999999999 Q ss_pred cchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecC Q lcl|NC_018087. 85 YEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPN 164 (520) Q Consensus 85 pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~ 164 (520) ||||+||++|||||||||++++||+|+|+++++|++||++|+|||++|++||+|+++||++||+||||||+|||||+| T Consensus 81 pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid-- 158 (516) T protein:vir:10 81 PEVERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMP-- 158 (516) T ss_pred cchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEec-- Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred CCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccccC Q lcl|NC_018087. 165 RPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDC 244 (520) Q Consensus 165 ~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~ 244 (520) |||+||+|||+||||+|++||++.++..+|+.+++++++||+|+|+...|+.+++.|+++++||||+||||||||||+|| T Consensus 159 ~~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSGL~d~ 238 (516) T protein:vir:10 159 NPKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSAVVYASSGLMDC 238 (516) T ss_pred CccccceeeeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeechhheeeecccceeC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCcccccc Q lcl|NC_018087. 245 CGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQA 324 (520) Q Consensus 245 ~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~ 324 (520) |++.++||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+|+|++ T Consensus 239 ~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddr 318 (516) T protein:vir:10 239 SDRGIIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQK 318 (516) T ss_pred CCCceeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccch Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCcccc-ccccchhhHHHHHHHH Q lcl|NC_018087. 325 NMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNV-FDMSTAISRDELSFDK 403 (520) Q Consensus 325 ~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~-~G~~~eItRDElkF~K 403 (520) |+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++++++ |||++||||||+||+| T Consensus 319 k~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~K 398 (516) T protein:vir:10 319 RNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRK 398 (516) T ss_pred hhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHH Confidence 999999999999999999999999999999999999999999999999999999999998876 6999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHH Q lcl|NC_018087. 404 FISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHT 483 (520) Q Consensus 404 FI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~ 483 (520) ||+|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||||||++| T Consensus 399 FI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~y 478 (516) T protein:vir:10 399 FVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDY 478 (516) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 484 AMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 484 i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) |||+||+|||+||++++|||++|+++|+|++|++|+= T Consensus 479 i~k~ILr~tDeei~~e~k~I~~E~~~~~~~~p~~~~~ 515 (516) T protein:vir:10 479 VMKNILQMTEEQIAQEEKQIEQEAGIKRFQNPENEDD 515 (516) T ss_pred HHHHHhcCCHhhHHHHHHHHHHhhhCCCCCCCCcccc Confidence 9999999999999999999999999999999988877 No 6 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=100.00 E-value=1.5e-283 Score=1571.05 Aligned_cols=513 Identities=54% Similarity=0.898 Sum_probs=502.6 Q ss_pred ccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeeccccc-ccccccccccccccccchhHHHHHHHHHHHhhc Q lcl|NC_018087. 6 DSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDI-AYNGVFQKLYGSQDPTATSTRELINTYRSLLNN 84 (520) Q Consensus 6 ~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~-a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~ 84 (520) -+.|+|||||+++|+.+++++++++++||+||+++|||++|+++.. ++.||++++|+++++.++|+++||++||+||+| T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~ 80 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINN 80 (516) T ss_pred CCchHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhc Confidence 4678999999999999999999999999999999999999998743 446777777779999999999999999999999 Q ss_pred cchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecC Q lcl|NC_018087. 85 YEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPN 164 (520) Q Consensus 85 pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~ 164 (520) ||||+||++|||||||||++++||+|+|+++++|++||++|+|||++|++||+|+++||++||+||||||+|||||+| T Consensus 81 pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid-- 158 (516) T protein:vir:10 81 PEVERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMP-- 158 (516) T ss_pred cchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEec-- Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred CCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccccC Q lcl|NC_018087. 165 RPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDC 244 (520) Q Consensus 165 ~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~ 244 (520) |||+||+|||+||||+|++||++.++..+|+.+++++++||+|+|+...|+.+++.|+++++||||+||||||||||+|| T Consensus 159 ~~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSGL~d~ 238 (516) T protein:vir:10 159 NPKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSAVVYASSGLMDC 238 (516) T ss_pred CccccceeeeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeechhheeeecccceeC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCcccccc Q lcl|NC_018087. 245 CGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQA 324 (520) Q Consensus 245 ~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~ 324 (520) |++.++||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+|+|++ T Consensus 239 ~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddr 318 (516) T protein:vir:10 239 SDRGIIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQK 318 (516) T ss_pred CCCceeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccch Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCcccc-ccccchhhHHHHHHHH Q lcl|NC_018087. 325 NMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNV-FDMSTAISRDELSFDK 403 (520) Q Consensus 325 ~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~-~G~~~eItRDElkF~K 403 (520) |+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++++++ |||++||||||+||+| T Consensus 319 k~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~K 398 (516) T protein:vir:10 319 RNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRK 398 (516) T ss_pred hhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHH Confidence 999999999999999999999999999999999999999999999999999999999998876 6999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHH Q lcl|NC_018087. 404 FISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHT 483 (520) Q Consensus 404 FI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~ 483 (520) ||+|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||||||++| T Consensus 399 FI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~y 478 (516) T protein:vir:10 399 FVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDY 478 (516) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 484 AMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 484 i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) |||+||+|||+||++++|||++|+++|+|++|++|+= T Consensus 479 i~k~ILr~tDeei~~e~k~I~~E~~~~~~~~p~~~~~ 515 (516) T protein:vir:10 479 VMKNILQMTEEQIAQEEKQIEQEAGIKRFQNPENEDD 515 (516) T ss_pred HHHHHhcCCHhhHHHHHHHHHHhhhCCCCCCCCcccc Confidence 9999999999999999999999999999999988877 No 7 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=100.00 E-value=2.7e-283 Score=1569.70 Aligned_cols=518 Identities=50% Similarity=0.870 Sum_probs=503.3 Q ss_pred Cccccc-cchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeeccc--ccccccccccccccccccchhHHHHHHH Q lcl|NC_018087. 1 MSMLAD-SDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQ--DIAYNGVFQKLYGSQDPTATSTRELINT 77 (520) Q Consensus 1 ~~~~~~-~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~--~~a~~g~~~~~~~~~~~~~~~~~~LI~~ 77 (520) |.|++. ++|+||+||.++|+++++++++++++|++||+++|||.+|+++ +++++|+++++|+++++.++|+++||++ T Consensus 1 ~~~~~~~~~l~~~~~~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~y~~~e~~~~~~~eLI~~ 80 (524) T protein:vir:98 1 MNFLGFGNVLSFFKNFAREDEIELEQQLKNDTGSVAPPKNNDGAYEIETDLNNQKYAGVFQQFYSGQDPAIQNKEQLINT 80 (524) T ss_pred CCCcchhhHHHHhhhhhhhhhhhHhhhhcCCcccccCCCCCCCceeecCCCCcceecceeeeeccccccccchHHHHHHH Confidence 888865 9999999999999999999999999999999999999999986 5677899999999999999999999999 Q ss_pred HHHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeE Q lcl|NC_018087. 78 YRSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFF 157 (520) Q Consensus 78 YR~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~ 157 (520) ||+||+|||||+||++||||||||+++++||+|+|++++||++||++|+|||++||+||+|+++||++||+||||||+|| T Consensus 81 YR~ma~~pEvd~Av~eIVneaIv~~~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~f 160 (524) T protein:vir:98 81 YRGIMSYPEVENAVSEIIDDAIVNEQGKDIITMDLAKTNFSKAIQDKIVEEFDNVLNIYDFDNMGARLFRDWYVDSRIYF 160 (524) T ss_pred HHHHhhccchhhHHHhhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceeEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeecCCCCCCeeeeEecCccceeeeeeccCCC-CcccccccceecceeecCcccccccccceecCCcceecCcccEEE Q lcl|NC_018087. 158 HKIINPNRPKDGIIELRRLDPRNVQFVRELDTKM-ENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVY 236 (520) Q Consensus 158 hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~-~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y 236 (520) ||||| ++|++||+|||+||||+|++||++.++. .+|+.+++++.|||+|+|...+|+.+++.|++|++||||++|||| T Consensus 161 hkiid-~~~~kGI~ELr~lDPr~i~~vr~~~~~~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dAIvy 239 (524) T protein:vir:98 161 HKIMH-KDESKGIRELRQLDPRCMELIRESITETLDGGVKVFRGYREFFVYSAPKAGYTYNGQIYQANQKIKIPRSAIVY 239 (524) T ss_pred EEEEc-CCCCcceeeeeeeCCccceeeeeccccccccchhhccceeeeeeeccCCCccccccceecCCCceeechhheee Confidence 99999 6677899999999999999999988887 578899999999999999988999999999999999999999999 Q ss_pred eecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecC Q lcl|NC_018087. 237 AHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDAR 316 (520) Q Consensus 237 ~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~ 316 (520) |||||+||+++ ++||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+||||+ T Consensus 240 ~hSGL~d~~~~-iisyLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~ 318 (524) T protein:vir:98 240 AHSGLEDCSNN-IIGYLHRAVKPANQLRLLEDAMVIYRITRAPERRVFYIDVGQMGGNKATQYVNNIAQGLKNRVVYDAR 318 (524) T ss_pred eccCcccCCCC-eeeehhHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecc Confidence 99999999975 67999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhH Q lcl|NC_018087. 317 TGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISR 396 (520) Q Consensus 317 TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItR 396 (520) ||+|+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+.+++++.+||++|||| T Consensus 319 TGevrddrk~msMlEDyWLpRReGgrgTEItTLpggqnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItR 398 (524) T protein:vir:98 319 TGTVKNQQNNLSMTEDYWLMRRDGKAITEVSTLPGGQNFSDMDDIKWFNRKLYEALRVPLSRMPRDDGGMQIGGGGEITR 398 (524) T ss_pred CceeeccccccchhhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCceeccCCCCccccccccchhH Confidence 99999999999999999999999999999999999999999999999999999999999999986544444599999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_018087. 397 DELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIG 476 (520) Q Consensus 397 DElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vg 476 (520) ||+||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++|||| T Consensus 399 DEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvG 478 (524) T protein:vir:98 399 DELKFSKFIRTLQIQFSPVLSDPLKTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQVEGVVG 478 (524) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 477 KYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 477 ky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) ||||++||||+||+|||+||++++|||++|+++|+|++|++|+= T Consensus 479 ky~s~dyi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~p~~e~~ 522 (524) T protein:vir:98 479 KYVSHKYIMKEILRMSDEDIDEQAKLIEEESKEERFKNPEAEEE 522 (524) T ss_pred cccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCcCCccccc Confidence 99999999999999999999999999999999999999998877 No 8 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=100.00 E-value=1.5e-282 Score=1565.68 Aligned_cols=513 Identities=54% Similarity=0.910 Sum_probs=501.5 Q ss_pred ccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccc-cccccccccccccccchhHHHHHHHHHHHhhc Q lcl|NC_018087. 6 DSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIA-YNGVFQKLYGSQDPTATSTRELINTYRSLLNN 84 (520) Q Consensus 6 ~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a-~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~ 84 (520) -+.|+|||||.++|+.+++++++++++||+||+++|||++++++..+ ..||+++.|+|+++.++++++||++||+||+| T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~d~~~~~~~~~~LI~~YR~ma~~ 80 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKQGHESIATPKKDDGATEIEAREGESSYNALMQQFFGIDNNISGTKDLINTYRQLTNN 80 (516) T ss_pred CCchHhcccccchhhHHHHhhhcCCCCcccCCCCccCceeeecCcccccccceeeeeecccCccccHHHHHHHHHHhhhc Confidence 46789999999999999999999999999999999999999997443 34666666669999999999999999999999 Q ss_pred cchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecC Q lcl|NC_018087. 85 YEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPN 164 (520) Q Consensus 85 pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~ 164 (520) ||||+||++|||||||||++++||+|+|+++++|++||++|+|||++|++||+|+++||++||+||||||+|||||+| T Consensus 81 pEvd~Av~eIvneaiv~d~~~~pV~l~l~~~e~s~sik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid-- 158 (516) T protein:vir:10 81 PEVERAVANIVNEAVVYEKGHKVVSLDLDDTEFSSSIKDKILEEFDEICRLLDASRKLDTLFRRWYIDSRIFFHKIMP-- 158 (516) T ss_pred cchhHHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhhhhcceEEEEEEec-- Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred CCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccccC Q lcl|NC_018087. 165 RPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDC 244 (520) Q Consensus 165 ~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~ 244 (520) |||+||+|||+||||+|++||++.+++.+|+.++++++|||+|.++.++|+.+++.|+++++||||++|||||||||+|| T Consensus 159 ~~k~GI~elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~daI~y~hSGl~d~ 238 (516) T protein:vir:10 159 NPKEGIVELRRLDPRHVEYYREIVTSDVGGTSVVKGYREFFVYTTGNEGYAYNGRLFEPNTRIKIPRSAIVYAHSGLQDC 238 (516) T ss_pred CcccceeeeeeeCCcceeeEEeeecccCcchhhhhceeeeeeeecCccceeccccccCCCCceecchhheeeeecCcccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCcccccc Q lcl|NC_018087. 245 CGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQA 324 (520) Q Consensus 245 ~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~ 324 (520) |++.++||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+|+|++ T Consensus 239 ~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~KNklvYDa~TGev~ddr 318 (516) T protein:vir:10 239 SDRGIVGYLHNAVKPANQLKLLEDALVIYRITRAPERRVFYIDVGNMPNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQK 318 (516) T ss_pred CCCceeceehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccch Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCcccc-ccccchhhHHHHHHHH Q lcl|NC_018087. 325 NMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNV-FDMSTAISRDELSFDK 403 (520) Q Consensus 325 ~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~-~G~~~eItRDElkF~K 403 (520) |+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++++++ |||++||||||+||+| T Consensus 319 k~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItRDEiKF~K 398 (516) T protein:vir:10 319 RNLSMTEDYWLMRRDGKSVTEVTSLPGAQTMGEMDDVRWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITRDELDFRK 398 (516) T ss_pred hhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHH Confidence 999999999999999999999999999999999999999999999999999999999998876 6999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHH Q lcl|NC_018087. 404 FISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHT 483 (520) Q Consensus 404 FI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~ 483 (520) ||+|||+|||.+|.++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||||||++| T Consensus 399 FI~rLR~rFs~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~~y 478 (516) T protein:vir:10 399 FIVQLQHNFEEIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVGKYVSHDY 478 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 484 AMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 484 i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) |||+||+|||+||++++|||++|+++|+|++|++|+= T Consensus 479 i~k~ILr~tDeei~~~~k~I~~E~~~~~~~~p~~e~~ 515 (516) T protein:vir:10 479 VMKNILQMTDEQIAQEEKQIEKEANVKRFQNPENEDD 515 (516) T ss_pred HHHHHhcCCHhHHHHHHHHHHHhhhCCCCCCCCcccc Confidence 9999999999999999999999999999999998877 No 9 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=100.00 E-value=8.7e-282 Score=1561.45 Aligned_cols=515 Identities=54% Similarity=0.912 Sum_probs=502.5 Q ss_pred cccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeeccc---ccccccccccccccccccchhHHHHHHHHH Q lcl|NC_018087. 3 MLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQ---DIAYNGVFQKLYGSQDPTATSTRELINTYR 79 (520) Q Consensus 3 ~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~---~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR 79 (520) |. .+|+||+||..++.++++++++++++||+||+++|||.+++++ +.+..||++++|+|++++++|++|||++|| T Consensus 1 ~~--~~l~~~~~~~~~~~~~~~~~~~~~~~s~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~~~~~~~eLI~~YR 78 (521) T protein:vir:81 1 MF--SRLKMLARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNLPASAWNSLTQQFYSTDQKISTTKQLVNTYR 78 (521) T ss_pred Cc--chhhhhHhhcCchhhhHHhhhccCccccccCCCCCCceEecccCCCcceeecceeeeecccccchhhHHHHHHHHH Confidence 43 6899999999999999999999999999999999999999985 446679999999999999999999999999 Q ss_pred HHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEE Q lcl|NC_018087. 80 SLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHK 159 (520) Q Consensus 80 ~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hk 159 (520) +||+|||||+||++|||||||||++++||+|+|+++++|++||+||+|||++||+||+|+++||++||+||||||+|||| T Consensus 79 ~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhk 158 (521) T protein:vir:81 79 GLMNNHEVENAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHK 158 (521) T ss_pred HHhhccchhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeec Q lcl|NC_018087. 160 IINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHS 239 (520) Q Consensus 160 vid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hS 239 (520) ||| ++||+||+|||+||||+|++||++.++..+++.++++++|||+|+|+...|+.+++.++++++||||++||||||| T Consensus 159 iid-~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~hS 237 (521) T protein:vir:81 159 IIG-KNPKDGIVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYAHS 237 (521) T ss_pred EEc-CCccccceeeeeeCCcceeeeeeecccccCccceecceeeeeeeecCCccccccceeecCCcceeechhheeeeec Confidence 999 8999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCc Q lcl|NC_018087. 240 GLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGK 319 (520) Q Consensus 240 GL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGe 319 (520) ||+|||++.++||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+ T Consensus 238 Gl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~im~k~kNklvYDa~TGe 317 (521) T protein:vir:81 238 GLMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGK 317 (521) T ss_pred cceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeeccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCc-cccccccchhhHHH Q lcl|NC_018087. 320 VKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQT-QNVFDMSTAISRDE 398 (520) Q Consensus 320 v~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~-~~~~G~~~eItRDE 398 (520) |+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++++ ++.+||++|||||| T Consensus 318 v~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDE 397 (521) T protein:vir:81 318 LKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDE 397 (521) T ss_pred ccccccccchhhhhcccccCCCcccceeecccCCCCChHHHHHHHHHHHHHHhCCccccccCCCCcceeccccchhhHHH Confidence 99999999999999999999999999999999999999999999999999999999999965543 23349999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchh Q lcl|NC_018087. 399 LSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKY 478 (520) Q Consensus 399 lkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky 478 (520) +||+|||+|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++|||||| T Consensus 398 iKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky 477 (521) T protein:vir:81 398 LEFSKFIRTRQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKY 477 (521) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 479 ISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 479 ~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) ||++||||+||+|||+||++++|||++|+++|+|++|++++= T Consensus 478 ~s~dyi~k~ILr~tDeei~~~~k~I~~E~~~~~~~~p~~~~~ 519 (521) T protein:vir:81 478 FSNQTVMRDILKYTDDQMDTEKKQIEEEANDPRFKQTPDEIE 519 (521) T ss_pred cchHHHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCCccccc Confidence 999999999999999999999999999999999999998777 No 10 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=100.00 E-value=1.7e-281 Score=1559.90 Aligned_cols=514 Identities=49% Similarity=0.849 Sum_probs=499.3 Q ss_pred cccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeeccc--ccccccccccccccccccchhHHHHHHHHHH Q lcl|NC_018087. 3 MLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQ--DIAYNGVFQKLYGSQDPTATSTRELINTYRS 80 (520) Q Consensus 3 ~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~--~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ 80 (520) |.| ..|+|||||+++++.+++++++++++|++||+++|||++|+++ .++++|++++.|+|.++.++|+++||++||+ T Consensus 1 m~~-~~l~lf~f~~k~~e~~~~~~~~~~~~s~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~~~~n~~eLI~~YR~ 79 (521) T protein:vir:10 1 MNP-IFLKLLQPWMKDDEKRVQSDLSDRIDSFAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAPKIQNTKDLINQYRS 79 (521) T ss_pred CCc-chhHHhhhhhhhhhhHHhhhhccCccccccccCCCCceeeccCCCccccccchhhhhhccccccchHHHHHHHHHH Confidence 777 6999999999999999999999999999999999999999998 4456899999999999999999999999999 Q ss_pred HhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEe Q lcl|NC_018087. 81 LLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKI 160 (520) Q Consensus 81 ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkv 160 (520) ||+|||||+||++|||||||||++++||+|+|+++++|++||++|+|||++||+||+|+++||++||+||||||+||||| T Consensus 80 ma~~pEvd~Av~eIvneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHki 159 (521) T protein:vir:10 80 LSKYHEVDNAIDEIINDAIVQEDNRDTVYLDLDKTDWNESVKEMVREEFRTILKLLKFEREGKRHFRRWYVDSRIYFHKM 159 (521) T ss_pred HhhccchhhHHHhhhcceEEecCCCceEEEEecCcccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeeEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccc-cccccceecCCcceecCcccEEEeec Q lcl|NC_018087. 161 INPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELES-YQCGHQHFAAGTKIKIPYSAMVYAHS 239 (520) Q Consensus 161 id~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~-~~~~~~~~~~~~~~~I~~~aI~y~hS 239 (520) ||+++||+||+|||+||||+|++||++.+++++++.++++++|||+|+|...+ +++++ +++++|+||++||||||| T Consensus 160 id~~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~~g---~~~~~vkI~~daI~y~hS 236 (521) T protein:vir:10 160 IDPARPKDGIKELRLLDPRNVEYYRVNLKSNENGNDVYKGVKEFFTYGATEDNRYNISG---NSNNLVQIPIDAIVYSHS 236 (521) T ss_pred eeCCCccccceeeeeeCCcceeeeeeecCCCCCcchhhccceeeeeeccCCCceecCCC---CCCcceeechhheeeecc Confidence 99999999999999999999999999999999999999999999999987644 33332 468889999999999999 Q ss_pred ccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCc Q lcl|NC_018087. 240 GLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGK 319 (520) Q Consensus 240 GL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGe 319 (520) ||+|||++.++||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+ T Consensus 237 GL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~iM~k~kNklVYDa~TGe 316 (521) T protein:vir:10 237 GKVDIDGKTIVGYLHNVIKPANQLKMLEDAMVIYRITRAPERRVFYIDVGTMPNKKATQHLNNVMQGLKNRVVYDSSTGK 316 (521) T ss_pred cceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHH Q lcl|NC_018087. 320 VKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDEL 399 (520) Q Consensus 320 v~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDEl 399 (520) |+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++++++||+++||||||+ T Consensus 317 v~ddrk~msMlEDyWLpRReGgrgTEI~TLpggqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr~~EItRDEi 396 (521) T protein:vir:10 317 VKNSSNNLAMTEDYWLMRRDGKATTEVSTLPGAQSMGEMDDVRWFNRKLYESMKIPLSRLPQEGAGVTFGAGNDITRDEL 396 (521) T ss_pred eccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCCceecccccchhHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999976566799999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhc--ccch Q lcl|NC_018087. 400 SFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEP--YIGK 477 (520) Q Consensus 400 kF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p--~vgk 477 (520) ||+|||+|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|+++++++|| |||| T Consensus 397 kF~KFI~rLR~rFs~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~yvGk 476 (521) T protein:vir:10 397 QFTKYIRGLQQQFEPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEVTGK 476 (521) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCcccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999 9999 Q ss_pred hhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 478 YISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 478 y~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) |||++||||+||+|||+||++++|||++|+++|+|++|++|+= T Consensus 477 y~s~dyi~k~ILr~tDeeik~~~k~I~~E~~~~~~~~p~~e~~ 519 (521) T protein:vir:10 477 YLSHEYVMKNILRMSDEDIKTEREKIDGELKDSVYKNPEDPME 519 (521) T ss_pred ccchHHHHHHHhcCCHhHHHHHHHHHHHhhhCCCCCCCcchhh Confidence 9999999999999999999999999999999999999999866 No 11 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=100.00 E-value=2.6e-281 Score=1558.83 Aligned_cols=515 Identities=55% Similarity=0.920 Sum_probs=501.1 Q ss_pred cccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccc---cccccccccccccccccchhHHHHHHHHH Q lcl|NC_018087. 3 MLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQD---IAYNGVFQKLYGSQDPTATSTRELINTYR 79 (520) Q Consensus 3 ~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~---~a~~g~~~~~~~~~~~~~~~~~~LI~~YR 79 (520) |. ..|+||+||.++|+++++++++++++|++||+++|||++++++. .+..||++++|+|++++++|++|||++|| T Consensus 1 ~~--~~l~~~~~~~~~d~~~~~e~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~~~~~~~eLI~~YR 78 (521) T protein:vir:65 1 MF--SRLKMLARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQKISTTKQLVNTYR 78 (521) T ss_pred Cc--cchhhhhhccCchhhHHHhhhccCCCcccCCCCCCCceeecccCCccccccccceeeeccccchhhhHHHHHHHHH Confidence 44 57999999999999999999999999999999999999999863 34458999999999999999999999999 Q ss_pred HHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEE Q lcl|NC_018087. 80 SLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHK 159 (520) Q Consensus 80 ~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hk 159 (520) +||+|||||+||++|||||||||++++||+|+|+++++|++||+||+|||++||+||+|+++||++||+||||||+|||| T Consensus 79 ~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhk 158 (521) T protein:vir:65 79 GLMNNHEVENAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHK 158 (521) T ss_pred HHhhccchhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceeEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeec Q lcl|NC_018087. 160 IINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHS 239 (520) Q Consensus 160 vid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hS 239 (520) ||| ++||+||+|||+||||+|++||++.++.++++.++++++|||+|+|+...|+.+++.++++++||||++||||||| T Consensus 159 iid-~~pk~GI~ELr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~hS 237 (521) T protein:vir:65 159 IIG-KNPKDGIVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYAHS 237 (521) T ss_pred EEc-CCccccceeeeeeCCcceeeeeeecccccCCcceecceeeeeeeecCCcceeccceeecCCcceeechhheeeeec Confidence 999 8999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCc Q lcl|NC_018087. 240 GLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGK 319 (520) Q Consensus 240 GL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGe 319 (520) ||+|||++.++||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+ T Consensus 238 Gl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGe 317 (521) T protein:vir:65 238 GLMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGK 317 (521) T ss_pred cceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeeccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCc-cccccccchhhHHH Q lcl|NC_018087. 320 VKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQT-QNVFDMSTAISRDE 398 (520) Q Consensus 320 v~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~-~~~~G~~~eItRDE 398 (520) |+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+.+++ ++.+||++|||||| T Consensus 318 v~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~EItRDE 397 (521) T protein:vir:65 318 LKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDE 397 (521) T ss_pred ccccccccchhhhhcccccCCCCccceeecccCCCcChHHHHHHHHHHHHHHhCCCceeccCCCCcceeccccchhhHHH Confidence 99999999999999999999999999999999999999999999999999999999999865543 33349999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchh Q lcl|NC_018087. 399 LSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKY 478 (520) Q Consensus 399 lkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky 478 (520) +||+|||+|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++|||||| T Consensus 398 iKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky 477 (521) T protein:vir:65 398 LEFSKFIRTLQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKY 477 (521) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 479 ISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 479 ~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) ||++||||+||+|||+||++++|||++|+++|+|++|++++= T Consensus 478 ~S~dyi~k~ILr~tDeei~~~~k~I~~E~~~~~~~~p~~~~~ 519 (521) T protein:vir:65 478 FSNQTVMRDILKYTDDQMDTEKKQIEEEANDPRFKQTPDEIE 519 (521) T ss_pred cchHHHHHHHhccCHHHHHHHHHHHHHhhhCCCCCCCccccc Confidence 999999999999999999999999999999999999998777 No 12 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=100.00 E-value=5.5e-278 Score=1540.60 Aligned_cols=501 Identities=34% Similarity=0.614 Sum_probs=484.3 Q ss_pred chhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccch Q lcl|NC_018087. 8 DLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEV 87 (520) Q Consensus 8 ~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEv 87 (520) .++|||||++.+++ ++++++||+||+++||++.++ +||+++.|+|+++.++|+++||++||+||+|||| T Consensus 1 m~~lfgf~~~~~~~-----~~~~~~s~~~p~~ddg~~~~~------~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEv 69 (558) T protein:vir:10 1 MAKLFGFSIEETQK-----KSTSIISPVPKNNEDGVDNFI------SSGFYGQYVDIEGAYRSEYDLIRRYREMALHPEA 69 (558) T ss_pred Ccchhcchhhhhhh-----hccCCccccCCCcccccccee------ccceeeeeecccchhhhHHHHHHHHHHHhhccch Confidence 67999999997664 488999999999999998775 5888888899999999999999999999999999 Q ss_pred hHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCCC Q lcl|NC_018087. 88 DNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPK 167 (520) Q Consensus 88 d~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k 167 (520) |+||++|||||||||++++||+|+|+++++|++||++|+|||++|++||+|+++||++|||||||||+|||||||++||| T Consensus 70 d~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRiyfHKiid~k~pk 149 (558) T protein:vir:10 70 DGAIEDVVNEAIVSDLYDSPVEVELSNLNASNTLKKKIREEFRYIKEMMDFDKKSHEIFRNWYVDGRVFYLKVIDTKNPQ 149 (558) T ss_pred hhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCeeeeEecCccceeeeeeccCCCCcccc-----------cccceecceeecCcccccccccceecCCcceecCcccEEE Q lcl|NC_018087. 168 DGIIELRRLDPRNVQFVRELDTKMENGVK-----------VVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVY 236 (520) Q Consensus 168 ~GI~elr~lDPr~i~~vr~i~~~~~~~~~-----------~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y 236 (520) +||+|||+||||+|++||++.++..|++. ++.++.|||+|+|+...++..+..++++++||||++|||| T Consensus 150 ~GI~ELr~lDPr~i~~Vr~i~~~~~~~~~~~~~~~~~~~~~~~~~~eyy~Y~~~~~~~~~~~~~~~~~~~vkI~~dAI~y 229 (558) T protein:vir:10 150 EGIQDLRYIDPLKIKFIRQEKRKPGNQDPAIRVRSEQDVVPNPEFEEFYIYTPKVQHPTGMVGQMGGKNSIKIAKDSITM 229 (558) T ss_pred ccceeeeeeCcccceeeeeeccccccccceeeeecccceeeccceeEeeeecCCcccccccceeecCCCceeechhheee Confidence 99999999999999999999999765543 4457889999999988787777778889999999999999 Q ss_pred eecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecC Q lcl|NC_018087. 237 AHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDAR 316 (520) Q Consensus 237 ~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~ 316 (520) |||||+|||++.++||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++ T Consensus 230 ~hSGL~d~~~~~i~syLhkAIKp~NQLkmlEDAlVIYRitRAPERRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~ 309 (558) T protein:vir:10 230 CTSGLVDRNKNRVLSYLHKAIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLKEVMSRYRNKLVYDAN 309 (558) T ss_pred ecccceecCCCeeeecchHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhH Q lcl|NC_018087. 317 TGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISR 396 (520) Q Consensus 317 TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItR 396 (520) ||+|+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++++|++ ||++|||| T Consensus 310 TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnLgem~DV~YF~kKLy~aLnVP~SRl~~e~~f~~-Gr~~EItR 388 (558) T protein:vir:10 310 TGEVRDDRKFMSMMEDFWLPRREGGRGTEITTLPGGQNLGELSDVDYFQKKLYRALGVPESRIAAEGGFNL-GRSSEILR 388 (558) T ss_pred CceecccchhhhhHhhhcccccCCCCccceeeccccCCcchHHHHHHHHHHHHHHhCCCccccCCCCcccc-cccchhhH Confidence 99999999999999999999999999999999999999999999999999999999999999999988776 99999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_018087. 397 DELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIG 476 (520) Q Consensus 397 DElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vg 476 (520) ||+||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++|||| T Consensus 389 DEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvG 468 (558) T protein:vir:10 389 DELKFAKFVGRLRKRFAAMFNDMLKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIG 468 (558) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 477 KYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 477 ky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) ||||++||||+||+|||+||++++|||++|+++|+|++|++++- T Consensus 469 ky~S~dyi~k~ILr~tDeeI~~~~kqI~~E~k~~~~~~p~~~~~ 512 (558) T protein:vir:10 469 KYYSTEYVRKRVLRQTDMEIEEIDTQIEDEIQKGIIPDPSQIDP 512 (558) T ss_pred cccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCccccCh Confidence 99999999999999999999999999999999999999998866 No 13 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=100.00 E-value=1.1e-274 Score=1522.57 Aligned_cols=493 Identities=36% Similarity=0.666 Sum_probs=478.3 Q ss_pred chhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccch Q lcl|NC_018087. 8 DLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEV 87 (520) Q Consensus 8 ~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEv 87 (520) .-+||||.++..++ +++++||+||+++||+.++. +|++++.|++++++++|++|||++||+||+|||| T Consensus 1 m~~lfg~~i~~~~~------~~~~~s~~~~~~~dg~~~i~------~~~~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEv 68 (533) T protein:vir:10 1 MSQLFGFSLERAKK------APKGPSFVQKDNLDGSQPVS------GGGYYGYTVDFDGQVRNEYQLISRYREMVLQPEC 68 (533) T ss_pred Cccccccccccccc------cccCCCCCCCCcccccceee------cccccceeeecccccchHHHHHHHHHHHhhccch Confidence 45899999999866 48899999999999999997 4789999999999999999999999999999999 Q ss_pred hHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCCC Q lcl|NC_018087. 88 DNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPK 167 (520) Q Consensus 88 d~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k 167 (520) |+||++|||||||||++++||+|+|+++++|++||++|+|||++||+||+|+++||++|||||||||+|||||||++||| T Consensus 69 d~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk 148 (533) T protein:vir:10 69 DSAVDDIVNETICGNFDDVPVSVELSNLKVSDKIKKLIREEFGEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPDNPQ 148 (533) T ss_pred hhHHHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCeeeeEecCccceeeeeeccCCCCcccc-------cccceecceeecCcccccccccceecCCcceecCcccEEEeecc Q lcl|NC_018087. 168 DGIIELRRLDPRNVQFVRELDTKMENGVK-------VVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSG 240 (520) Q Consensus 168 ~GI~elr~lDPr~i~~vr~i~~~~~~~~~-------~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSG 240 (520) +||+|||+||||+|++||++.++.+++++ +++++.|||+|+|+. ..++++++||||++|||||||| T Consensus 149 ~GI~ELr~lDPr~i~~vr~i~~~~~~~~~~~~~~~~v~~~~~eyf~Ynp~g-------~~~~~~~~vkI~~dAI~y~hSG 221 (533) T protein:vir:10 149 GGLIELRYIDPRKIRKINETEQKRPEQLRGLPLNQQLSPKSAEYFLYDPKG-------LKNSTTQGLKIAPDSICYVHSG 221 (533) T ss_pred ccceeeeeccccceeeeeeeeccCCCccceeecchhhhccceeeeeecccc-------ccccCCCceecchhheeeeecc Confidence 99999999999999999999999988865 789999999999862 2346788899999999999999 Q ss_pred cccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCcc Q lcl|NC_018087. 241 LVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKV 320 (520) Q Consensus 241 L~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev 320 (520) |+|||++.++||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+| T Consensus 222 l~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev 301 (533) T protein:vir:10 222 IMDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEI 301 (533) T ss_pred ceeCCCCceeccchHhHHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCcee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHH Q lcl|NC_018087. 321 KNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELS 400 (520) Q Consensus 321 ~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElk 400 (520) +|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++++|++ ||++||||||+| T Consensus 302 ~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~DV~YF~kKLY~aLnVP~SRl~~e~~f~~-Gr~~EItRDEiK 380 (533) T protein:vir:10 302 KDDKKFMSMLEDFWLPRREGGRGTEITTLPGGQNLGELEDVKYFQKKLYKSLNVPGSRLETETTFNV-GRAAEITRDEVK 380 (533) T ss_pred cccchhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCcccc-cccchhhHHHHH Confidence 9999999999999999999999999999999999999999999999999999999999999998876 999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhh Q lcl|NC_018087. 401 FDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYIS 480 (520) Q Consensus 401 F~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S 480 (520) |+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++|||||||| T Consensus 381 F~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S 460 (533) T protein:vir:10 381 FQKFVARLRKRFSELFTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFS 460 (533) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 481 NHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 481 ~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) ++||||+||+|||+||++++|||++|+++|+|++|++|.= T Consensus 461 ~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~~~~ 500 (533) T protein:vir:10 461 VEYMRRQVLKQTDVEMKEIDKQIESEMESGIIADPAAEMD 500 (533) T ss_pred hHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCCcchhh Confidence 9999999999999999999999999999999999988733 No 14 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=100.00 E-value=3.4e-273 Score=1514.34 Aligned_cols=504 Identities=43% Similarity=0.756 Sum_probs=479.3 Q ss_pred hcchhhhhhhHHHhhhccCCCcccCCCCCCCceeeccccc--ccccccccccccccccchhHHHHHHHHHHHhhccchhH Q lcl|NC_018087. 12 FAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDI--AYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDN 89 (520) Q Consensus 12 f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~--a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~ 89 (520) |+||.++|+++++++++++++|++||+++|||++|+++.. +.+|+++++|.+.++.++++ |||++||+||+|||||+ T Consensus 1 ~~~w~~~de~~~~~~~~~~~~S~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~-eLI~~YR~ma~~pEvd~ 79 (511) T protein:vir:56 1 MKFWTKEEEQDIQKIEKNPVRSFSAPDNVDGAKEIHTNLLAPQLGHAIIPSDAQSEGTIPVK-ELIKSYRALAEYHEVDD 79 (511) T ss_pred CCCccchhhhhhhhhccCCcccccCCCCCCCceEEecccccceecceeccccccccCccchH-HHHHHHHHHhhccchhh Confidence 9999999999999999999999999999999999999743 45688999999999999885 99999999999999999 Q ss_pred HHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCCCCC Q lcl|NC_018087. 90 AVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDG 169 (520) Q Consensus 90 Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~G 169 (520) ||++|||||||||++++||+|+|++++||++||++|+|||++||+||+|+++||++||+||||||+||||||||+ +| T Consensus 80 Av~eIvne~iv~d~~~~pV~l~ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~k---~G 156 (511) T protein:vir:56 80 AIQEIVDEAIVYENDKEVVWLNLDNTDFSENIKAKINEEFDRVVSLLQMRKHGYKWFRKWYVDSRIYFHKILDKD---NN 156 (511) T ss_pred HHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeccc---cc Confidence 999999999999999999999999999999999999999999999999999999999999999999999999965 49 Q ss_pred eeeeEecCccceeeeeeccCCCCcccccccceecceeecCcc-cccccccceecCCcceecCcccEEEeecccccCC--C Q lcl|NC_018087. 170 IIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTEL-ESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDCC--G 246 (520) Q Consensus 170 I~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~-~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~~--~ 246 (520) |+|||+||||+|++||++.++..+++.+++++.|||+|+|.. +..+..+..+.+++.|+||++|||||||||+||| + T Consensus 157 I~eLr~lDPr~i~~vr~i~~~~~~~~~v~~~~~ey~~Y~~~~~~~~~~~~~~~~~~~~vkI~~daI~y~hSGL~d~~~~~ 236 (511) T protein:vir:56 157 IIELRPLNPMKMELVREIQKETIDGVEVVKGTLEYYVYKQSDYKMPSWMSATNRAQTSFRIPKDAIVFAHSGLMRGCADD 236 (511) T ss_pred eeehhhcCcccchhhhhhhcccccccccccceeeeeEecCCCcccCcccccccccccceeechhheeeecccceeccCCC Confidence 999999999999999999999999999999999999999863 2222222223356779999999999999999965 4 Q ss_pred CcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCcccccccc Q lcl|NC_018087. 247 KNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANM 326 (520) Q Consensus 247 ~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~ 326 (520) +.++||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+|+|++|+ T Consensus 237 g~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~ 316 (511) T protein:vir:56 237 PYIIGYLDRAIKPANQLKMLEDALVIYRLARAPERRVFYVDVGNLPTQKAQQYVNGIMQNVKNRVVYDTQTGQVKNTTNA 316 (511) T ss_pred CeeeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchhh Confidence 46999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred chhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCC--ccccccccchhhHHHHHHHHH Q lcl|NC_018087. 327 MALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQ--TQNVFDMSTAISRDELSFDKF 404 (520) Q Consensus 327 msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~--~~~~~G~~~eItRDElkF~KF 404 (520) ||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++ +++.|||++||||||+||+|| T Consensus 317 msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~EItRDEiKF~KF 396 (511) T protein:vir:56 317 MSMLEDYYLPRREGSKGTEVSTLPGGQSLGDIEDVLYFNRKLYKAMRIPTSRAASEDQTGGINFGQGAEITRDELKFTKF 396 (511) T ss_pred hhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCccccccccchhhhHHHHHHHHH Confidence 999999999999999999999999999999999999999999999999999999764 345569999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHH Q lcl|NC_018087. 405 ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTA 484 (520) Q Consensus 405 I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i 484 (520) |+|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||||||++|| T Consensus 397 I~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~~yi 476 (511) T protein:vir:56 397 VKRLQTKFETVITDPLKHQLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGKYYSHKYI 476 (511) T ss_pred HHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhccccchHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhCCCHHHHHHHHHHHHHhhhcCCccCCcccc Q lcl|NC_018087. 485 MKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEE 519 (520) Q Consensus 485 ~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~ 519 (520) ||+||+|||+||++++|||++|+++++|++|++.= T Consensus 477 ~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~e~~f 511 (511) T protein:vir:56 477 QKNILRLSDDQITAMQSEIDEEETNPRFQQDDQGF 511 (511) T ss_pred HHHHhccCHHHHHHHHHHHHHhhcCCCCCCcccCC Confidence 99999999999999999999999999999987666 No 15 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=100.00 E-value=7e-273 Score=1512.62 Aligned_cols=494 Identities=36% Similarity=0.681 Sum_probs=478.0 Q ss_pred cchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccc Q lcl|NC_018087. 7 SDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYE 86 (520) Q Consensus 7 ~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pE 86 (520) -.-+||||++++.++ .++++|++||+++||+.++. .|++++.|+++++.++|++|||++||+||+||| T Consensus 1 ~~~~lfg~~i~~~~~------~~~~~s~~~~~~~dg~~~~~------~~~~~g~~~~~e~~~~~~~eLI~~YR~ma~~pE 68 (537) T protein:vir:10 1 MAQQLFGFSLQRAKK------VPKGPSFVQKDSLDGSQPIV------GGGYFGYSVDFDGTIRNDHELITRYREMVLNPE 68 (537) T ss_pred Cccccccceeecccc------cccCCcccCCCcccccceee------cccccccccccccccchHHHHHHHHHHHhhccc Confidence 235899999998765 48999999999999999986 588999999999999999999999999999999 Q ss_pred hhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCC Q lcl|NC_018087. 87 VDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRP 166 (520) Q Consensus 87 vd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~ 166 (520) ||+||++|||||||||++++||+|+|+++++|++||++|+|||++||+||+|+++||++|||||||||+|||||||++|| T Consensus 69 vd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fhKiid~k~p 148 (537) T protein:vir:10 69 CDSAVDDVVNETICGNFDDVPISIDLHNLKQSEKIKKLIRSEFDEILRLLDFDNRAYEIFRRWYVDGRLFFHKVIDPKKP 148 (537) T ss_pred hhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCeeeeEecCccceeeeeeccCCCCcccc-------cccceecceeecCcccccccccceecCCcceecCcccEEEeec Q lcl|NC_018087. 167 KDGIIELRRLDPRNVQFVRELDTKMENGVK-------VVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHS 239 (520) Q Consensus 167 k~GI~elr~lDPr~i~~vr~i~~~~~~~~~-------~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hS 239 (520) |+||+|||+||||+|++||+|.++++++.+ +++++.+||+|+|+. ..++++++||||++||+|||| T Consensus 149 k~GI~ELr~lDPr~i~~vR~i~~~~~~~~~~~~~~~~v~~~~~eyf~ynp~g-------~~~~~~~~vkI~~dAI~y~hS 221 (537) T protein:vir:10 149 RQGLVELRYVDPRKIRKVTEYEAKRPEALRTQDLNQQLTQQSASYFLYNPKG-------LKNSTNQGMKIAPDSIAYCHS 221 (537) T ss_pred cccceeeeeeCCccceeeEeecccCCccceEEecceeeeecccceeeecccc-------ccccCCCceeccHhheeeecc Confidence 999999999999999999999999987776 567788999999862 345788899999999999999 Q ss_pred ccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCc Q lcl|NC_018087. 240 GLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGK 319 (520) Q Consensus 240 GL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGe 319 (520) ||+|||++.++||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+ T Consensus 222 Gl~d~n~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGe 301 (537) T protein:vir:10 222 GIQDLNKNMVLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGE 301 (537) T ss_pred cceeCCCCeeeeeehhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHH Q lcl|NC_018087. 320 VKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDEL 399 (520) Q Consensus 320 v~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDEl 399 (520) |+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++++|++ ||++||||||+ T Consensus 302 v~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnlgem~DV~YF~kKLy~aLnVP~SRl~~e~~f~~-Gr~~EItRDEi 380 (537) T protein:vir:10 302 IKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNI-GRAAEITRDEV 380 (537) T ss_pred ecccchhhhhhhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCcccc-cccchhhHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999998776 99999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhh Q lcl|NC_018087. 400 SFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYI 479 (520) Q Consensus 400 kF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~ 479 (520) ||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||||| T Consensus 381 KF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~ 460 (537) T protein:vir:10 381 KFQKFIARLRKRFSELFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYF 460 (537) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 480 SNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 480 S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) |++||||+||+|||+||++++|||++|+++|+|++|++++= T Consensus 461 s~dyi~k~ILr~tDeeI~~~~k~I~~E~k~~~~~~p~~~~~ 501 (537) T protein:vir:10 461 SANYIRTKVLKQTESEIKEIDKEIKQEIADGVIMDPQAMQA 501 (537) T ss_pred chHHHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCcccccc Confidence 99999999999999999999999999999999999988665 No 16 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=100.00 E-value=5e-273 Score=1513.42 Aligned_cols=499 Identities=33% Similarity=0.619 Sum_probs=477.1 Q ss_pred chhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccc--cchhHHHHHHHHHHHhhcc Q lcl|NC_018087. 8 DLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDP--TATSTRELINTYRSLLNNY 85 (520) Q Consensus 8 ~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~--~~~~~~~LI~~YR~ma~~p 85 (520) .-+||||.+++++. ++++|++||+++||++.++ ||++++|++..+ .++|+++||++||+||+|| T Consensus 1 m~~lfgf~i~~~~~-------~~~~S~vpp~~~~~~~~i~-------~g~~g~~v~~~g~~~~~n~~eLI~~YR~ma~~p 66 (564) T protein:vir:10 1 MSQLFGFLINEKEG-------QKGQSPVPPNDEASVSTVA-------GGYFGTYVDTSGGQNSRNEYELIRRYRDMSLHP 66 (564) T ss_pred Ccchhcceeeeecc-------CCCCCcccCCcCCChhhhh-------ccccceeeecccccchhhHHHHHHHHHHHhhcc Confidence 45899999987664 6899999999999999985 778888887775 6899999999999999999 Q ss_pred chhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCC Q lcl|NC_018087. 86 EVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNR 165 (520) Q Consensus 86 Evd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~ 165 (520) |||+||++||||||||+++++||+|+|++++||++||+||+|||++||+||+|+++||++||+||||||+|||||||+++ T Consensus 67 EVd~Av~eIVneaIv~d~~~~pV~vdL~~~~~s~siK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~ 146 (564) T protein:vir:10 67 EVDSAIDEIVNEFVVNDGDDKPVEVDLQNLEIGSGVKKKIRDEFNRILRMMNFNVNAHEIIRNWYVDGRSHYHKVIDLDN 146 (564) T ss_pred chhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeeCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCeeeeEecCccceeeeeeccCCC-Ccccccccce---------ecceeecCc----ccccccccceecCCcceecCc Q lcl|NC_018087. 166 PKDGIIELRRLDPRNVQFVRELDTKM-ENGVKVVKGY---------REYFLYDTE----LESYQCGHQHFAAGTKIKIPY 231 (520) Q Consensus 166 ~k~GI~elr~lDPr~i~~vr~i~~~~-~~~~~~~~~~---------~ey~~y~~~----~~~~~~~~~~~~~~~~~~I~~ 231 (520) ||+||+|||+||||+|++||++.+++ .++..+++++ .|||+|+|+ ..++++|++.++++++|+||. T Consensus 147 pk~GI~eLr~lDPr~i~~vr~i~~~~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~~~~g~~~~~~~~~~~~~~~~ikI~~ 226 (564) T protein:vir:10 147 PKKGILELRYIDSLKIRKVRQKLKDVDPNRKEIEKGTALQYDYGDFIEYYIYNPKGFAGNIPMVTGSMDWSNQEGIKIAS 226 (564) T ss_pred hhhhhhhhhhhcccceeeeeeeccccccccceeeeeeeeeccccccccceeeccccccCcccccccccccccccceeech Confidence 99999999999999999999999986 5677777765 499999954 456677888888999999999 Q ss_pred ccEEEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhccee Q lcl|NC_018087. 232 SAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRI 311 (520) Q Consensus 232 ~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knkl 311 (520) +|||||||||+|||++.++||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+ T Consensus 227 daI~y~hSGL~d~~~~~i~gyLhkAIKp~NQLkmlEDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNkl 306 (564) T protein:vir:10 227 DAIAQSTSGLMDLNKKMTLSFLHKAIKSLNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLRDVMSRYRNKL 306 (564) T ss_pred hhcceecccceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCcccccccc Q lcl|NC_018087. 312 SYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMS 391 (520) Q Consensus 312 vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~ 391 (520) |||++||+|+|++|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++++++.+||+ T Consensus 307 VYDa~TGevrddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~DV~YF~kKLY~aLnVP~SRl~~e~~~f~~Gr~ 386 (564) T protein:vir:10 307 VYDGQTGEIRDDKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELKDVEYFKKKLYNSLNLPPSRLTDDNKAFNLGKS 386 (564) T ss_pred EEeccCceecccchhhhhHhhhcccccCCCcccceeeccccCCcchHHHHHHHHHHHHHHhCCCcccccCCCceeecccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999754555999 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018087. 392 TAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLM 471 (520) Q Consensus 392 ~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~ 471 (520) +|||||||||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|+++++++ T Consensus 387 ~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~ 466 (564) T protein:vir:10 387 TEILRDELKFTKFIGRLRKRFAQLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQM 466 (564) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 472 EPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 472 ~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) +||||||||++||||+||+|||+||++++|||++|+++|++++|++++- T Consensus 467 dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~~ 515 (564) T protein:vir:10 467 DPFVGKYFSTEYIRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVNM 515 (564) T ss_pred hhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhhc Confidence 9999999999999999999999999999999999999999999966665 No 17 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=100.00 E-value=1.1e-234 Score=1303.18 Aligned_cols=463 Identities=17% Similarity=0.303 Sum_probs=435.8 Q ss_pred hhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeeccc---ccccccccccccccccccchhHHHHHHHHHHHh-hc Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQ---DIAYNGVFQKLYGSQDPTATSTRELINTYRSLL-NN 84 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~---~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma-~~ 84 (520) +-.|+||.++++.+++++++++..|+++|+++||+++|+.+ +++++|.++++|++ .++|+++||++||+|| +| T Consensus 1 ~~~~~~w~~~de~~~~~~~~~~~~~~~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~gg---~~~n~~eLI~~YR~ma~~~ 77 (533) T protein:vir:58 1 MPSLEKYKKLNEAVNFTNFLSPMYGMGAPHGAGGSSMIPINMYHPFATAGYASRFYGG---IEFNRFFLYDMYDRMDYTD 77 (533) T ss_pred CCCcchhhhhhHHHHHHHhhchhhcccCccCCCCCccccCCCCcchhhhhhhhhhhcc---ccccHHHHHHHHHHhhccC Confidence 66799999999999999999999999999999999999976 45667778888864 5779999999999997 68 Q ss_pred cchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecC Q lcl|NC_018087. 85 YEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPN 164 (520) Q Consensus 85 pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~ 164 (520) ||||+||++||||||||+++++||+|+|++++||++||++|+ ++|||+++||++||+||||||+||||+++ T Consensus 78 pEVd~AideIvneaiv~d~~~~pV~v~l~~~e~s~~iK~kI~-------~lldf~~~~~~~fR~WYVDGriy~Hkiik-- 148 (533) T protein:vir:58 78 PLISTVLDIIADECTIPNENGNIVDVVTKDIELAKAILSYLD-------YVINIEKNAYPIIRNMIKYGDMFLHILEK-- 148 (533) T ss_pred cchhhHHHhhhceeeEecCCCceeEeecccccccHHHHHHHH-------HHhcchhhhhHHHHhhhhcceeEEEeccC-- Confidence 999999999999999999999999999999999999998765 79999999999999999999999999765 Q ss_pred CCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccccC Q lcl|NC_018087. 165 RPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDC 244 (520) Q Consensus 165 ~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~ 244 (520) +|++||+|||+||||+|++||++.++ .+||+|++... ++++++++++||++||+||||||+|| T Consensus 149 ~~k~GI~elr~lDPr~i~~vr~~~t~-----------~eyyvy~~~~~------~~~s~~~~~kI~~daI~y~~SGl~d~ 211 (533) T protein:vir:58 149 GSDGTIEKFQVVSPYIFSKRYNPETD-----------TWYYVITDVYR------NVVSGYFNEDIPEEDVIHFSHKIDTN 211 (533) T ss_pred CcccchhhheecCCeeeEEEEeeccc-----------eEEEeeccccc------ccccCccccccchhheeeeeeccccC Confidence 78999999999999999999999876 38999998743 24577888999999999999999999 Q ss_pred CCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCcccccc Q lcl|NC_018087. 245 CGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQA 324 (520) Q Consensus 245 ~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~ 324 (520) |++.++||||+|||||||||||||||||||+||||+|||||||||||||.||+|||++||++||||+|||++||+|+|++ T Consensus 212 ~~~~iisyLhkAiKp~NQLkmiEDAlVIYRisRAPeRRvFYIDVGNlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddr 291 (533) T protein:vir:58 212 FFPYGRSYLESARAIWNQLRLMEDALMLYRVVRSVDRRVFYVDVGNVPPDKINEYLTNIAMQYKRDYWVRNNQNQFLGID 291 (533) T ss_pred CCCceehhhhHHHHHHHHHHHHHHHHHHHhhcCChhheEEEEeecCCCccCHHHHHHHHHHhcccceEEeccCCeEeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccc---hhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHH Q lcl|NC_018087. 325 NMM---ALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSF 401 (520) Q Consensus 325 ~~m---smlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF 401 (520) |+| ||||||||||||||||||||||||| |||+|+||+||++|||+|||||+|||+++++| ||++||||||||| T Consensus 292 k~m~~~sMlEDyWLpRReGgrgTEI~TLpGg-~lgemeDV~YF~kkLy~ALnVP~sRl~~e~~f---gr~~eItRDEiKF 367 (533) T protein:vir:58 292 NYFSIESILKDYFIPRRGDRRAVEIDILQGS-KVDLAEDVEYMLNRLISALKVPKAFIGYEGDV---NAKNTLATQDIKF 367 (533) T ss_pred chhhhhhhHhhhcccccCCCccceeeecCCC-CCCcHHHHHHHHHHHHHHhCCCeeecCCCCCC---ccchhhhHHHHHH Confidence 999 9999999999999999999999987 59999999999999999999999999999874 9999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhH Q lcl|NC_018087. 402 DKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISN 481 (520) Q Consensus 402 ~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~ 481 (520) +|||+|||+||+++| +.||+||||||++|| +|+|++||||+|+|++|||++|++++++++||||| T Consensus 368 ~KFI~rLR~rF~~ll----~~qLilk~iit~eew-------~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~dpyvgk---- 432 (533) T protein:vir:58 368 NNTIKRIQGFFVEEL----ERMVRMNKEFADQDF-------RLVMNRSNSIVEGERFAVIEQRIGIAERLKGWVRE---- 432 (533) T ss_pred HHHHHHHHHHHHHHH----hcccccccCcchhhe-------eeeeeccchHHHHHHHHHHHHHHHHHHHhcchhhH---- Confidence 999999999998876 559999999999999 59999999999999999999999999999999998 Q ss_pred HHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCcc-ccC Q lcl|NC_018087. 482 HTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEP-EEI 520 (520) Q Consensus 482 ~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~-e~~ 520 (520) +||||+||+||| ||++++++|++|.++|+|++|+. +|+ T Consensus 433 ~yi~k~ILr~td-ei~~q~e~ie~E~~~~~~~~~~~~~e~ 471 (533) T protein:vir:58 433 DWIYSNILQIPY-DLKPQEEVAEAAGGGGLFDTGGFGEET 471 (533) T ss_pred HHHHHHHhcCCh-hhhHHHHHHHHhhcCCCCCCCCccccc Confidence 589999999998 77777899999999999999954 455 No 18 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.43 E-value=7.8e-13 Score=86.95 Aligned_cols=430 Identities=13% Similarity=0.154 Sum_probs=204.0 Q ss_pred hhcchhhhhhhHHHhhhccC--------------------------CCcccCCCC-----CCCceeeccccc----ccc- Q lcl|NC_018087. 11 MFAFWHKVDDTEYDKIINDK--------------------------AESITAPKF-----DDGATEVDSQDI----AYN- 54 (520) Q Consensus 11 ~f~~~~~~~~~~~~~~~~~~--------------------------~~s~~~p~~-----~dg~~~i~~~~~----a~~- 54 (520) ||+||.|......++.+-.+ ...+.+|.. .||+...-..+. +.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~ 80 (537) T protein:vir:10 1 MFKFWRKKTVEAVQSSIAERIEPRVGIFGAGDDEKPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTF 80 (537) T ss_pred CCCccccccccccccccccccccccCCCcccchhhHHHHHHhhhhccCCCCCccCcccccccccccchhccccccchhhh Confidence 99999886633222210000 001111111 122111000000 000 Q ss_pred c---cc------ccccccccccchhHHHHHHHHHHHhhccchhHHHHhhhceeeEecCCCcEEEEeecc-chhhhHHHHH Q lcl|NC_018087. 55 G---VF------QKLYGSQDPTATSTRELINTYRSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQ-TAFTENIRNL 124 (520) Q Consensus 55 g---~~------~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~-~~~s~~ik~~ 124 (520) | +. ..+|... .+ .-++|...|+ .|+++..||+.++++|+- + .+.|..++ .+.+....++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~--~~-~~~~l~a~Y~---~~~l~r~iVd~~A~d~~r--~---~~~i~~~~~~~~~~~~~~~ 149 (537) T protein:vir:10 81 SAYANPNLSEGLVLWYAQQ--AF-IGHQMCALIA---THWLVNKACSQMPRDAMR--K---GYKIISDDGNELDPKDAKF 149 (537) T ss_pred hhhccccccchhhhhcccc--CC-ccHHHHHHHH---hCchhhhhhhhhhHHhhc--C---CceeecCCcccccHHHHHH Confidence 0 00 0111111 11 1246666665 699999999999999943 2 23333332 2344444445 Q ss_pred HHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCC-------------CCCCeeeeEecCccceeee--eeccC Q lcl|NC_018087. 125 ISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNR-------------PKDGIIELRRLDPRNVQFV--RELDT 189 (520) Q Consensus 125 I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~-------------~k~GI~elr~lDPr~i~~v--r~i~~ 189 (520) |..+.+ -|++..+..+.++.=-+.|.=+.-..++-.+ .+.+++.|+.|||..+.+. ..+.. T Consensus 150 l~~~~~----~l~~~~~l~~a~~~~rlyG~~~i~i~v~~~D~~~~~~Pl~~~~i~kg~~k~l~vidp~~~~~~~~~~~~~ 225 (537) T protein:vir:10 150 IDRYDR----AFNIKKHAIQFVRKGRIFGIRIALFKVDSPDPYYYEKPFNIDGVMPGAYKGIVQIDPYWCAPLLDAQASS 225 (537) T ss_pred HHHHHH----HhhHHHHHHHHHHhcccccceEEEEeecCcCCcccccccccccccccceeEEEEechhhcccccchhhhc Confidence 554444 3455566666555323335433333332111 2335778888888766653 11111 Q ss_pred CCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecc-----cccCCCCcchhhhHHHHHHHHHHH Q lcl|NC_018087. 190 KMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSG-----LVDCCGKNIIGYLHRAVKPANQLK 264 (520) Q Consensus 190 ~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSG-----L~d~~~~~~~syL~~aik~~NqL~ 264 (520) |... ..| |.| +.|.. .+.+||++-|+..... +-.+.++...|-|+++...+.+.. T Consensus 226 ---dp~s-----p~f--g~P--~~y~v--------~g~~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~ 285 (537) T protein:vir:10 226 ---NPVS-----MHF--YEP--TYWLI--------NGKKYHRSHLAIYINDEVVDFLKPSYIYGGVPLPQQIMERVYAAE 285 (537) T ss_pred ---cCCc-----ccc--CCc--eeeee--------cCeEecceeEEEecCCCCchhhhcccCcccccHHHHHHHHHHHHH Confidence 1110 000 111 11111 1237888877765422 122334456788888766554443 Q ss_pred HHHH--HHHHHHHhcCccceEEEccCCC-CchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCC Q lcl|NC_018087. 265 LLED--AMMIYRITRAPDRRVFYIDTGN-MPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGK 341 (520) Q Consensus 265 m~ED--alVIyRi~RApeRRvFyIDvGn-lpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGg 341 (520) .... +.++|+- - =+++.+|... |....+-.-.-+.++++|+ ..| .|-+ -.+ T Consensus 286 ~t~~~~~~l~~~~---~-~~v~k~~~~~~l~~~~~~~~r~~~~~~~r~------n~g-------~~~i-------d~e-- 339 (537) T protein:vir:10 286 RTANEGPMLAMTK---R-QTVLKVDAAQVLANKQQFDETMSWWTATRD------NYQ-------VRVV-------DKD-- 339 (537) T ss_pred HHHHHHHHHHHhc---C-CceeeechHHhhcCHHHHHHHHHHHHhhcC------Ccc-------eeEe-------cCC-- Confidence 3222 2333322 2 2366665321 2112111111122333332 111 1100 000 Q ss_pred CCcceeecCCCCCcChHHHH-HHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 342 AVTEVETLPGMTGMNEMDDI-LYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPL 420 (520) Q Consensus 342 rgTEIsTLpGg~nLgei~DV-~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~L 420 (520) +-+++++- .+|+-++|+ ..|...+=.+++||+.||-.++..++ +.+ =.-|.-.|+.+|+++|.++..++..++ T Consensus 340 -~e~~e~~~--~~lsgl~~~l~~~~~~iAa~~~IP~t~L~G~sp~Gl-nat--Ge~D~~~yyd~I~~~Qe~l~p~l~~l~ 413 (537) T protein:vir:10 340 -NEDVVQID--TTLNDLDKVIMNQYQLVCAIARTPAPKMLGTVPTGF-NST--GDYEEASYHEECESTQDDMRPLIDRHH 413 (537) T ss_pred -CceeEEEe--ccCCCHHHHHHHHHHHHHhhhCCCceeeccCCcccc-ccc--hhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11222222 245556664 56777788888999999865543222 211 122445599999999998877766655 Q ss_pred HHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCC--------- Q lcl|NC_018087. 421 KSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQM--------- 491 (520) Q Consensus 421 k~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~--------- 491 (520) + |+.+.-..++ ..++|.|..=..-++...+|+...+.++++.+-.- -.+|.+-+++. |+. T Consensus 414 ~--ll~~~~~~~~------~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~--G~i~~~Evr~~-L~~~~~~g~~~l 482 (537) T protein:vir:10 414 Q--LVCRSHLRKR------IRVKVEFPPMDAPKESERADTFLKKMQAAKLAFEM--GAVDGVDVNEY-LRMDPTLGFTSI 482 (537) T ss_pred H--HHHHhcCCCC------cceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHc--CCCCHHHHHHH-HhccCccccccc Confidence 3 3332222221 24788898888889999999999999999887443 26788887765 433 Q ss_pred ----CHHHHHHHHHHHHHhhhc--CCccCCccccC Q lcl|NC_018087. 492 ----SDEDIAAERKLIDEELSD--KIFNPPEPEEI 520 (520) Q Consensus 492 ----tDeeI~~~~kqi~~E~~~--~~~~~p~~e~~ 520 (520) +++++++. .++.|.+. ..-++|++.|- T Consensus 483 ~~~~~~ed~e~~--~~~~~~~~~~~~~~~~~~~~~ 515 (537) T protein:vir:10 483 TPAMRPTDAEDI--DVDDEGKPVRIIEDQPAPSEM 515 (537) T ss_pred cCCCChhhhhcc--cCCccCCcCCCCCCCCCcccc Confidence 33333322 12222222 12223333333 No 19 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.25 E-value=7e-11 Score=76.25 Aligned_cols=438 Identities=11% Similarity=0.095 Sum_probs=202.7 Q ss_pred cccccch--------hhhcchhhhhhhHHH-h--------hhccCCCcccCCCCCCCceeeccccccccccc---ccccc Q lcl|NC_018087. 3 MLADSDL--------KMFAFWHKVDDTEYD-K--------IINDKAESITAPKFDDGATEVDSQDIAYNGVF---QKLYG 62 (520) Q Consensus 3 ~~~~~~l--------~~f~~~~~~~~~~~~-~--------~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~---~~~~~ 62 (520) |+ |+.. .-++-....+.+.-. + .+.+ ...+|+..+++...++.. .+.+++- +.++. T Consensus 1 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~a~~-~g~~~~~~~~~~~~~ 76 (532) T protein:vir:94 1 MA-DTDPTPRPEITYATLQQAQRVDAKRATHTSLGLATAHEIDP--TAYSPYERNAAQNAMAMD-YGLQTGRNGRNALSF 76 (532) T ss_pred CC-CCCCCCCcceehhhhhhHhhhhhhhhhhhhhhhhhhhhhcc--cccccccccccccccccc-cccCccccccccccc Confidence 22 2111 112222222222110 0 1111 112333333322212110 0000100 01111 Q ss_pred cccccchhHHHHHHHHHHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccc-hhhhHHHHHHHHHHHHHHHHhcchhh Q lcl|NC_018087. 63 SQDPTATSTRELINTYRSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQT-AFTENIRNLISDEFNSVLNMLNFQRK 141 (520) Q Consensus 63 ~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~-~~s~~ik~~I~eeF~~i~~ll~f~k~ 141 (520) . ....-..++|...|+ .||++..||+.++++|+.. .+.|.-++. ++.+....+|..+.+. |++..+ T Consensus 77 ~-~~~~~~~~~l~a~Y~---~~~l~r~~Vd~~aed~~r~-----~~~i~~~~~~~~~~~~~~~i~~~~~~----l~v~~~ 143 (532) T protein:vir:94 77 V-EATSWPGFPTLALLA---QLPEYRTMHETPADECVRA-----WGKITCSSKDELAADKATRITQKLEQ----YNVRTL 143 (532) T ss_pred c-cccccchHHHHHHHH---cCchhhhhhccchHHHhhC-----CceEeeCCccccchHHHHHHHHHHHh----hhHHHH Confidence 1 111124557777776 5999999999999999742 223322222 2344455555555444 344555 Q ss_pred hHHHHHh--hccccceeEEEe-------------eecCC-CCCCeeeeEecCccceeeeeeccCCCCcccccccceecce Q lcl|NC_018087. 142 GSDHFKR--WYVDSRVFFHKI-------------INPNR-PKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYF 205 (520) Q Consensus 142 g~~~fRr--WYvDgri~~hkv-------------id~~~-~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~ 205 (520) ..+.+|. -|=+|-++++.. ++++. .+.+++.|+.|||..+.+- ... ..| T Consensus 144 l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p~~l~~~~I~~g~~~~l~vld~~~v~p~-~~~--~~d------------ 208 (532) T protein:vir:94 144 VRTVVIHDQAYGGAHVFPHLKMDGDSVPADAPLLLSPSFVQRGCLIGFATIEPMWLSPN-AYN--ATD------------ 208 (532) T ss_pred HHHHHHhhhcccceEEEEEeccCCccccccccccccccccccceeeEEEeechheeccc-ccc--ccc------------ Confidence 5554442 333334454432 11111 1223567777777655441 000 000 Q ss_pred eecCcccccccccceecCCcceecCcccEEEeeccc-----ccCCCCcchhhhHHHHHHHHHHHHHHHHH--HHHHHhcC Q lcl|NC_018087. 206 LYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGL-----VDCCGKNIIGYLHRAVKPANQLKLLEDAM--MIYRITRA 278 (520) Q Consensus 206 ~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL-----~d~~~~~~~syL~~aik~~NqL~m~EDal--VIyRi~RA 278 (520) |... .++-...+...++.+||+|-+++....- ....++...|.|+++...+.+......+. ++++-. T Consensus 209 ---p~sp-~fg~P~~y~v~~g~~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~-- 282 (532) T protein:vir:94 209 ---PTLP-SFYKPDSWIATSGKKIHSSRIHTVVGRPVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFS-- 282 (532) T ss_pred ---cccc-ccCCceeEEEccCeeeccceEEEecCCCchhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcC-- Confidence 1000 0111111111223478998887763221 22234456788988877777665544433 344422 Q ss_pred ccceEEEccCCCCc-hHHHHHHHHH--HHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCc-ceeecCCCCC Q lcl|NC_018087. 279 PDRRVFYIDTGNMP-ARKAAQHMQH--IMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVT-EVETLPGMTG 354 (520) Q Consensus 279 peRRvFyIDvGnlp-k~KAeqyl~~--im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgT-EIsTLpGg~n 354 (520) =.|+.++..++- ....++..+. .++++|+ .+|- +- ++ +++ +++++. .+ T Consensus 283 --~~v~k~~~a~~ls~~~~~~~~~r~~~~~~~~~------n~g~-------~~------id-----~~~e~~e~~~--~~ 334 (532) T protein:vir:94 283 --MTNLATDMAQLLAPGGAQSLDARLQLFNLYRD------NRNI-------GA------LD-----KGTEEIQQTN--TP 334 (532) T ss_pred --CceeeechHHhhcchhHHHHHHHHHHHHhhcC------Cccc-------eE------Ec-----CCCceeEEEe--cc Confidence 234445432221 1111221111 1122221 1111 10 00 111 344442 34 Q ss_pred cChHHHH-HHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCCh Q lcl|NC_018087. 355 MNEMDDI-LYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHK-FEEIFLSPLKSNLLLKRVITE 432 (520) Q Consensus 355 Lgei~DV-~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~r-Fs~if~d~Lk~QLiLkgi~t~ 432 (520) |+-++|+ ..|...+=.+++||+.||-.++...+ +.+++= |.-.|+.||+++|.. +..+...+++. |++.-... T Consensus 335 lsgl~~~l~~~~~~iAaa~~IP~t~LfG~sp~Gl-nstGe~--D~~~yyd~I~s~Qe~~l~p~le~l~~~-l~~s~~g~- 409 (532) T protein:vir:94 335 LSGLDSLQAQSQEQMAAVSHIPLVKLLGITPNGL-NASSDG--EIRVWYDFIAGYQATNLTPLMEWIIDL-IQLSEYGQ- 409 (532) T ss_pred cCCHHHHHHHHHHHHHhHhCCCeeeeecCCcccc-cccchH--HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHhcCC- Confidence 4555554 78899999999999999976654433 222221 444599999999955 45555554432 22211111 Q ss_pred hhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCC-----------HHHHHHHHH Q lcl|NC_018087. 433 DEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMS-----------DEDIAAERK 501 (520) Q Consensus 433 eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~t-----------DeeI~~~~k 501 (520) ..+.++|.|..=...++...+|+...+.++++.+-.- -.+|.+-+++. |++. ++++++... T Consensus 410 -----~~~d~~~~f~pL~~~s~kEkAei~~~~a~a~~~~~~~--Gvi~~~Evr~~-l~~~~~~~~~~~~~~~~~~~~~~~ 481 (532) T protein:vir:94 410 -----IDPGLAWEWSPLMELDDKELAEVRQLNASTDSTLMEL--GVIDAKMVQQR-LAADPTSGYAGALGERDELDDVEE 481 (532) T ss_pred -----CCCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhc--CCCCHHHHHHH-HhcCCccccccccccccccccccc Confidence 1235778898777788888899999999998877332 26788888865 4432 234444444 Q ss_pred HHHHhhhcCCccCCccccC Q lcl|NC_018087. 502 LIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 502 qi~~E~~~~~~~~p~~e~~ 520 (520) ++++...+. ..+|...+- T Consensus 482 ~~~~~~~~~-~~~~~~~~~ 499 (532) T protein:vir:94 482 IAKQLMAAA-LNPPATAPQ 499 (532) T ss_pred hhhhhcccc-cCCCCCCCC Confidence 444333322 222222111 No 20 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.17 E-value=8.9e-10 Score=70.20 Aligned_cols=451 Identities=13% Similarity=0.084 Sum_probs=243.6 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRS 80 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ 80 (520) |+++ |.+..+|.+-...........+. .-+|+.--. . .+..............+...|..+=|. T Consensus 1 mn~~-dr~i~~~sP~~~~~R~~ar~~~~----------~y~aa~~~r----~-~~~~~~~~s~~~~~~~~~~~lr~RaRd 64 (502) T protein:vir:79 1 MAIL-DDVIGVFSPGWKAARLRSRAVIQ----------AYEAVKTTR----T-HKARRENRTADQLSQYGAVSLREQARY 64 (502) T ss_pred CchH-hhHHhhcChHHHHHHHhhHHHHh----------hccccCccc----c-cCCCCCCCChHHHHHHHHHHHHHHHHH Confidence 7654 77777876633332222222110 112221000 0 111111111111112256677788888 Q ss_pred H-hhccchhHHHHhhhceeeEecCCCcE-EEEeeccchhhhHHHHHHHHHHHHHHH------HhcchhhhHHHHHhhccc Q lcl|NC_018087. 81 L-LNNYEVDNAVQEIVSDAIVYEEGFDV-VSIDLDQTAFTENIRNLISDEFNSVLN------MLNFQRKGSDHFKRWYVD 152 (520) Q Consensus 81 m-a~~pEvd~Ai~eIvneaiv~d~~~~~-V~l~Ld~~~~s~~ik~~I~eeF~~i~~------ll~f~k~g~~~fRrWYvD 152 (520) | .++|-+..||+.+++-+|=.. +-.+ ..+.-++..+.+.+.++|..+|+.-++ .++|..--...+|.|.+| T Consensus 65 l~rNn~~a~~av~~~~~nvVG~g-gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~d 143 (502) T protein:vir:79 65 LDNNHDLVIGVFDKLEERVVGKN-GIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRD 143 (502) T ss_pred HHhcChHHHHHHHHHHHhhccCC-ceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhC Confidence 8 599999999999998876332 1111 123344555677788899999987764 566666666799999999 Q ss_pred cceeEEEeeecCCC-CCCe---eeeEecCccceeeeeeccCCCCcccccccce--------ecceeecCcccccccccce Q lcl|NC_018087. 153 SRVFFHKIINPNRP-KDGI---IELRRLDPRNVQFVRELDTKMENGVKVVKGY--------REYFLYDTELESYQCGHQH 220 (520) Q Consensus 153 gri~~hkvid~~~~-k~GI---~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~--------~ey~~y~~~~~~~~~~~~~ 220 (520) |-.|..+++++... +.|. ..|..|||..|.-- ..++..+..|+ .-|+++....-. T Consensus 144 GE~f~~~~~~~~~~~~~g~~~~l~lq~iepd~l~~~------~~~~~~i~~GVe~d~~Gr~~aY~i~~~hPgd------- 210 (502) T protein:vir:79 144 GEVFAQMVSGRINSLTPSAGVHFWLEALEPDFIPMT------SDESNRLNQGVFVDDWGRPEKYLVYKSRPVS------- 210 (502) T ss_pred CceEEEEeecccCccCCCcccceEEEEecchhcCCC------CCCCCeeEeeeEECCCCceEEEEEeecCCCC------- Confidence 99999999976532 3333 47888999777432 12333444443 457777432111 Q ss_pred ecCCcceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHH Q lcl|NC_018087. 221 FAAGTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHM 300 (520) Q Consensus 221 ~~~~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl 300 (520) ......++||.+.|+|..... .+.---.+|.|..+++.+.+|.-.+||...-...-|-.=-+..-+.|.-....+ T Consensus 211 ~~~~~~~rvpA~~vlH~f~~~-r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~---- 285 (502) T protein:vir:79 211 GRQMETKEVDAERMLHLKFVR-RLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDG---- 285 (502) T ss_pred CcccceeEechhheEEeeccc-CCccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccccc---- Confidence 112234799999999997653 344334579999999999999999999999988888765444444333111000 Q ss_pred HHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHH-HHHHHHHHHhcCCChhhc Q lcl|NC_018087. 301 QHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDI-LYFRKALYMALRVPLSRI 379 (520) Q Consensus 301 ~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV-~YF~kkLy~aL~VP~SRl 379 (520) .+.. +.....+|-.-==++ .-.-|.+|..+.....-+..++. +...+.+=.+|+||-.-| T Consensus 286 ------------~~~~-----~~~~~~~l~pG~i~~--~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~l 346 (502) T protein:vir:79 286 ------------NGSK-----ENERELTIQPGIIYD--DLKPGEEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSST 346 (502) T ss_pred ------------CCCC-----CccccccccCCcccc--ccCCCceeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHH Confidence 0000 111111110000000 01124455555554444444443 334444667899999888 Q ss_pred cCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHhcCCCChhhHHhhhhceEEEe--eccchHH Q lcl|NC_018087. 380 PDEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLK----SNLLLKRVITEDEWEAELNNIKIVF--HKNSYFS 453 (520) Q Consensus 380 ~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk----~QLiLkgi~t~eew~~~~~~I~~~f--~~Dn~f~ 453 (520) ..+-+ +.-+.+.---+.|-+.+.++|+.|..-|..++- ...+|.|.++.-.|..-.......| ..--+.- T Consensus 347 t~D~s----~nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iD 422 (502) T protein:vir:79 347 ARNYN----GTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWID 422 (502) T ss_pred hcccc----chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccC Confidence 77633 212344445677999999999988887776533 3567788776433332222333333 3334455 Q ss_pred HHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHH-HHHHHHHHHhhhcCCccCCc---------------- Q lcl|NC_018087. 454 EMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDI-AAERKLIDEELSDKIFNPPE---------------- 516 (520) Q Consensus 454 ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI-~~~~kqi~~E~~~~~~~~p~---------------- 516 (520) .+||+.-...+++. -+-|.+-+..+ .+..-+|. +|.++..+...+.++..+.+ T Consensus 423 P~Ke~~a~~~~i~~---------Gl~t~~~~~a~-~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~e 492 (502) T protein:vir:79 423 PVKEAEAWKIQIRG---------GAATESDWVRA-GGRNPDDVKRRRKAEIDENRKLDLVFDTDPASDKGGSSAATKRQE 492 (502) T ss_pred hHHHHHHHHHHHHc---------CCCCHHHHHHH-cCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCCCCCCCCCCCCCCC Confidence 66776655444431 12244444433 23322221 12222222222233321111 Q ss_pred -------ccc Q lcl|NC_018087. 517 -------PEE 519 (520) Q Consensus 517 -------~e~ 519 (520) .|| T Consensus 493 ~~~~~~~~e~ 502 (502) T protein:vir:79 493 PQHTDDQSEE 502 (502) T ss_pred CCCCCCCCCC Confidence 111 No 21 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.16 E-value=2.1e-10 Score=73.66 Aligned_cols=437 Identities=11% Similarity=0.113 Sum_probs=207.7 Q ss_pred Ccccc----------------------------ccchhhhcchhhhhhhHHHhhhccCCCcccCCC------C--CCCce Q lcl|NC_018087. 1 MSMLA----------------------------DSDLKMFAFWHKVDDTEYDKIINDKAESITAPK------F--DDGAT 44 (520) Q Consensus 1 ~~~~~----------------------------~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~------~--~dg~~ 44 (520) +|.+| --.+....=|....++- .+-.+..-+..|. . .+|+. T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~a~ds~~~~~~~ 80 (765) T protein:vir:96 4 LSWIFGRKKDNAACSESAPEKVARIPQHDPLDPMIKLGKIRGWNVEPEKA---PVIRSVKDFLEPGLSVAMDSAYGDGPT 80 (765) T ss_pred eeeecccccccccccccCchhhhhcCCCCCcccchhHHHHhhcccccccC---CCCCCCCcccCcccceecccccccccc Confidence 00000 01111111222222211 1111111111111 1 11111 Q ss_pred eecccccccccccc--ccccc----ccccchhHHHHHHHHHHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhh Q lcl|NC_018087. 45 EVDSQDIAYNGVFQ--KLYGS----QDPTATSTRELINTYRSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFT 118 (520) Q Consensus 45 ~i~~~~~a~~g~~~--~~~~~----~~~~~~~~~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s 118 (520) ..-. +..|+.. .+... .....-..++|...|++ |+.+..||+.++++|+- + .+.|.-+..+.+ T Consensus 81 ~~~~---~~~~~~~~~~~~~~~~~~~~~~~f~gyql~alY~~---~~l~rkiVd~pAeDa~R--~---g~~I~~~~~e~~ 149 (765) T protein:vir:96 81 PAAK---AAAGGQNPYVVPTMLQDWYNSQGFIGYQACAIISQ---HWLVDKACSMSGEDAAR--N---GWELKSDGRKLS 149 (765) T ss_pred chHH---HhhhccCccchhhHHHhhhcccCCccHHHHHHHHh---CchhhhhhhcchHHhhc--C---CceeecCccccC Confidence 1100 0001110 00000 00111233567777765 99999999999999964 2 345555555555 Q ss_pred hHHHHHHHHHHHHHHHHhcchhhhHHHHH--hhccccceeEEEeeec----------CC-CCCCeeeeEecCccceeeee Q lcl|NC_018087. 119 ENIRNLISDEFNSVLNMLNFQRKGSDHFK--RWYVDSRVFFHKIINP----------NR-PKDGIIELRRLDPRNVQFVR 185 (520) Q Consensus 119 ~~ik~~I~eeF~~i~~ll~f~k~g~~~fR--rWYvDgri~~hkvid~----------~~-~k~GI~elr~lDPr~i~~vr 185 (520) +...++|..+.+. |++..+..+.+| |-|-.|-+++-.-.+. ++ .+.+++.|+.|||..+...- T Consensus 150 ~~~~~~l~~~~~r----l~v~~~l~ea~~~~RlyGga~i~i~i~~~D~~~l~~PL~~~~I~kg~~kgl~vldp~~~~~~~ 225 (765) T protein:vir:96 150 DEQSALIARRDME----FRVKDNLVELNRFKNVFGVRIALFVVESDDPDYYEKPFNPDGIAPGSYKGISQIDPYWAMPQL 225 (765) T ss_pred HHHHHHHHHHHHH----hhHHHHHHHHHHHhhhceeeEEEEEecccCcchhhccccccccccceeeEEEEechhhccccc Confidence 5555555544444 456777777777 7787777765432211 11 12245667777775544421 Q ss_pred eccCCC-CcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccc-----cCCCCcchhhhHHHHHH Q lcl|NC_018087. 186 ELDTKM-ENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLV-----DCCGKNIIGYLHRAVKP 259 (520) Q Consensus 186 ~i~~~~-~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~-----d~~~~~~~syL~~aik~ 259 (520) ..+. .|... + ..|.| +.|.. + +-+||++-|+.....-+ +..++...|-|+++... T Consensus 226 --v~e~~~Dp~s------p-~fg~P--~~y~i-------~-g~~IH~SRli~~~g~~lpd~lk~~~~~~G~Svlq~~yd~ 286 (765) T protein:vir:96 226 --TAESTADPSA------E-HFYEP--DFWII-------S-GKKYHRSHLVVVRGPQPPDILKPTYIFGGIPLTQRIYER 286 (765) T ss_pred --chhccccccc------c-ccCcc--eeeee-------c-CceeccceEEEecCCCchhhhccccCccCccHHHHHHHH Confidence 0000 01000 0 00111 11111 1 12678887666543332 23344567888887655 Q ss_pred HHHHHHH--HHHHHHHHHhcCccceEEEccCCCCc-hHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhccc Q lcl|NC_018087. 260 ANQLKLL--EDAMMIYRITRAPDRRVFYIDTGNMP-ARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQ 336 (520) Q Consensus 260 ~NqL~m~--EDalVIyRi~RApeRRvFyIDvGnlp-k~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLp 336 (520) +..+... +-+.++++- -. +++.+|....- ..++-..--+.++++|+ ..| .+-+ T Consensus 287 I~~~~~t~~~~a~Ll~k~---~~-~v~k~~~~~~l~~~~~l~~r~~~~~~~r~------n~g-------~~~i------- 342 (765) T protein:vir:96 287 VYAAERTANEAPLLAMSK---RT-STIHVDVEKAIANEDAFNARLAFWIANRD------NHG-------VKVI------- 342 (765) T ss_pred HHHHHHHHHHHHHHHHHh---cc-ceeeechHhhhccHHHHHHHHHHHHHhcC------Cce-------eEEe------- Confidence 5554322 344455542 22 35666655321 11111111122333332 011 1111 Q ss_pred ccCCCCCcceeecCCCCCcChHHHH-HHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHH-HH Q lcl|NC_018087. 337 RRDGKAVTEVETLPGMTGMNEMDDI-LYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKF-EE 414 (520) Q Consensus 337 RReGgrgTEIsTLpGg~nLgei~DV-~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rF-s~ 414 (520) +++-+++++. .+|+-++|+ ..|...+=-+.+||+.||-.++..++ +.+++= |.-.|+.+|+.+|... .. T Consensus 343 ----d~ee~~e~~s--~~lsgl~d~l~~~~~~iAaas~IP~t~LfGqsp~Gl-nATGe~--D~~nYyD~I~s~Qe~~l~p 413 (765) T protein:vir:96 343 ----GIDETMEQFD--TNLSDFDSVIMNQYQLVAAIAKTPATKLLGTSPKGF-NATGEH--ETISYHEELESIQEHIFDP 413 (765) T ss_pred ----cCCcceeEEe--cccCCHHHHHHHHHHHHHhhhCCCeeeeccCCcccc-cCcchH--HHHHHHHHHHHHHHHHHHH Confidence 0112455444 356667774 67889999999999999977652222 222221 4455999999999654 44 Q ss_pred HHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHh----- Q lcl|NC_018087. 415 IFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFL----- 489 (520) Q Consensus 415 if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL----- 489 (520) +...+++ -|++-+.+. +.+.|.|..=..=+|...+|+...+.++++.+-.- -.+|.+-+++.+- T Consensus 414 ~le~L~~-li~~s~~i~--------~d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~--Gvis~dEvR~~L~~~~~~ 482 (765) T protein:vir:96 414 LLERHYL-LLAKSESID--------VQLEIVWNPVDSTTSQQQAELNNKKAATDEIYINS--GVVSPDEVRERLRDDPRS 482 (765) T ss_pred HHHHHHH-HHHHhcCCC--------CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhc--CCCCHHHHHHHHhccccC Confidence 4443333 344544332 35888899888888999999999999999887333 2578888887532 Q ss_pred ---CCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 490 ---QMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 490 ---~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) .++++|++.+. .+..|..+. ...+.+.+. T Consensus 483 g~~~l~d~~~e~~~-~~~pe~~~~-~~~~~~~~~ 514 (765) T protein:vir:96 483 GYNRLTDDQAETEP-GMSPENLAE-LEKAGAQSA 514 (765) T ss_pred CCCCCCcccccccc-CCCcccccc-ccCCCcccc Confidence 24555554221 121111111 011111111 No 22 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.10 E-value=7.1e-10 Score=70.73 Aligned_cols=393 Identities=12% Similarity=0.102 Sum_probs=189.8 Q ss_pred CCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccch Q lcl|NC_018087. 37 PKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTA 116 (520) Q Consensus 37 p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~ 116 (520) =...||-...-.+- .-.+.+++++.. ++-.+|...| +.|+.+..||+.++++|+- + .+.|. T Consensus 1 ~~~~D~~~n~~~gg-~~~~~~~~~~~~-----~~~~~l~a~Y---~~~~l~~~~Vd~~aed~~r--~---g~~i~----- 61 (422) T protein:vir:10 1 MVKTDSYANIFLGG-SDGSEIYGSLQN-----QAPTILASLY---ADNALVRRIIDTIPETALA--A---GFHID----- 61 (422) T ss_pred CccchhhHHHHcCC-CCCccccCcccc-----cCHHHHHHHH---HhChhhHHHHhhhhHHHhc--C---Ccccc----- Confidence 12222221111000 000112222221 2345666666 5799999999999999962 2 22232 Q ss_pred hhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEee-ecC------CCCCCeeeeEecCccceeeeeeccC Q lcl|NC_018087. 117 FTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKII-NPN------RPKDGIIELRRLDPRNVQFVRELDT 189 (520) Q Consensus 117 ~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvi-d~~------~~k~GI~elr~lDPr~i~~vr~i~~ 189 (520) ++.-+.++.++++. |++..+..+.+|.=-+-|.=+.-..+ |+. +++..++.++.+||..+.+. .+. T Consensus 62 -~~~~~~~~~~~~~~----l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~-~~~- 134 (422) T protein:vir:10 62 -GIDDEPAFWSRWDD----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQ-TRE- 134 (422) T ss_pred -CCCHHHHHHHHHHH----hhHHHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccccch-hcc- Confidence 22234456666654 45667777755543344433333333 322 35667888999999777652 111 Q ss_pred CCCcccc-cccceecceeecCcccccccccceecCCcceecCcccEEEeecc-----cccCCCCcchhhhHHH-HHHHHH Q lcl|NC_018087. 190 KMENGVK-VVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSG-----LVDCCGKNIIGYLHRA-VKPANQ 262 (520) Q Consensus 190 ~~~~~~~-~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSG-----L~d~~~~~~~syL~~a-ik~~Nq 262 (520) .|... .+.... +|.+.+ ...+.+.+||++=++..+.- |-..+++...|-|.++ ..-+.. T Consensus 135 --~dp~s~~fg~P~-~y~v~~-----------~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~ 200 (422) T protein:vir:10 135 --ENPRNARFGEPL-TYRITT-----------NESDMFYDVHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILDSIKD 200 (422) T ss_pred --cCccccccCcce-EEEEec-----------CCCCcceeeccceeEEeCCCCchhhhcccCCcccchhHHHHHHHHHHH Confidence 11111 111111 221111 12233478998876655311 1122333345667654 232222 Q ss_pred HH-HHHH-HHHHHHHhcCccceEEEccC-CCC---chHHHHHHHH-HHHHhhcceeEeecCCCccccccccchhhhhhcc Q lcl|NC_018087. 263 LK-LLED-AMMIYRITRAPDRRVFYIDT-GNM---PARKAAQHMQ-HIMNSHRNRISYDARTGKVKNQANMMALTEDYWL 335 (520) Q Consensus 263 L~-m~ED-alVIyRi~RApeRRvFyIDv-Gnl---pk~KAeqyl~-~im~~~knklvYd~~TGev~d~~~~msmlEDywL 335 (520) +. ..+- +.++++-. =+|+.++. .++ +....+.-.+ ..+.+.|+ .+|-+. + T Consensus 201 ~~~~~~~~~~l~~~~~----~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~------~~~~~~-------l------ 257 (422) T protein:vir:10 201 YTNCERLATQLLKRKQ----QAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSG------VGQAIG-------I------ 257 (422) T ss_pred HHHHHHHHHHHHHHhc----cccccchhHHHhcCCccchHHHHHHHHHHHHhcC------Ccccee-------E------ Confidence 22 2222 34455532 23455552 221 1111111111 11222222 112111 1 Q ss_pred cccCCCCCcceeecCCCCCcChHHH-HHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHH-HH Q lcl|NC_018087. 336 QRRDGKAVTEVETLPGMTGMNEMDD-ILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHK-FE 413 (520) Q Consensus 336 pRReGgrgTEIsTLpGg~nLgei~D-V~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~r-Fs 413 (520) . +.+-+++++. .+++-++| +..|...+-.+.+||+.||-.++...+ +.++ .-|.-.|+.+|+.+|.. +. T Consensus 258 ---~-~~~e~~e~~~--~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~Gl-natg--d~d~~~yyd~i~~~Qe~~l~ 328 (422) T protein:vir:10 258 ---D-AESEEYSVLN--SDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGV-SSSQ--NTALETFHKLVDRKRNAELL 328 (422) T ss_pred ---e-cCCcceEEEe--cccCChHHHHHHHHHHHHhhhCCCeeeeccCCcccc-cccc--hHHHHHHHHHHHHHHHHHHH Confidence 0 0111333331 23444555 478999999999999999976655433 2211 11233599999999964 44 Q ss_pred HHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCC- Q lcl|NC_018087. 414 EIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMS- 492 (520) Q Consensus 414 ~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~t- 492 (520) .+...+++. |+-.+ .++|.|..-..=+|...+|+...+.++++.+-.- -.++.+-+++. |+.. T Consensus 329 p~l~~l~~~------i~~s~-------~~~~~f~pL~~~sekekaei~~~~a~a~~~~~~~--g~i~~~e~r~~-L~~~~ 392 (422) T protein:vir:10 329 PILEFLIPF------IVNAE-------EWSVEFNPLAQESSKDKAEILEKNVNSIAALIAA--GAMDIDEARDT-LRTIA 392 (422) T ss_pred HHHHHHHHH------hcccC-------CcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhc--CCCCHHHHHHH-hhhhc Confidence 444333222 22223 4678898888889999999999999999887443 25688777755 3221 Q ss_pred -----HHHHHHHHHHHHHhhhcCCccCCccc Q lcl|NC_018087. 493 -----DEDIAAERKLIDEELSDKIFNPPEPE 518 (520) Q Consensus 493 -----DeeI~~~~kqi~~E~~~~~~~~p~~e 518 (520) ..++.+++-..+++.+.+. .+|+++ T Consensus 393 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~d 422 (422) T protein:vir:10 393 PEVKINDGSVETEVTISETSNDPL-EVPTDD 422 (422) T ss_pred ccccCCCCCCccccchhhcCCCCC-CCCCCC Confidence 1112222222333333332 333333 No 23 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.09 E-value=1.6e-09 Score=68.73 Aligned_cols=425 Identities=12% Similarity=0.066 Sum_probs=195.8 Q ss_pred hhhhhhhHHHhhhccCCCcccCCCCCCCceeecc-c-ccccccccccccccccccchhHHHHHHHHHHHhhccchhHHHH Q lcl|NC_018087. 15 WHKVDDTEYDKIINDKAESITAPKFDDGATEVDS-Q-DIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNAVQ 92 (520) Q Consensus 15 ~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~-~-~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai~ 92 (520) -.+.++... +.+..+.+.-...... | .++-.+...+.|.. .. .-+-.+|...|| .|+-+..||+ T Consensus 1 ~~~~~~a~~---------~~~~~~a~~~~~~~~~~g~~~~~d~~~~~~~~~-~~-~~~~~~l~~lY~---~~~l~r~iVd 66 (461) T protein:vir:80 1 MYSIDKAKQ---------AKIDSKIVNRNDFMVGHGKANSRDKLTRQTPGN-GQ-KLDLKACENLYA---SNSIAMNIVD 66 (461) T ss_pred Cccchhhhh---------hhhhhhhhhhhHHHhhcCCcchhhhhhccccCc-cc-ccCHHHHHHHHH---hCCccchhhc Confidence 122222210 1111111111100000 0 01111212222211 11 113345556665 7999999999 Q ss_pred hhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCC------ Q lcl|NC_018087. 93 EIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRP------ 166 (520) Q Consensus 93 eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~------ 166 (520) .++.+|+- + .+.|.-+ +++.+++|.++++. |++..+..+.+|.=-+.|.=+.-..+...++ T Consensus 67 ~~a~d~~r--~---g~~i~~~----~~~~~~~~~~~~~~----l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~ 133 (461) T protein:vir:80 67 IISEDMVR--A---GWSLKTD----NKEMKKNIESKWRK----LKTKDRFQKLYADKRLYGDGFLSIGVVSSNREQADLS 133 (461) T ss_pred cchHHhhc--C---CeeeecC----CHHHHHHHHHHHHH----hhHHHHHHHHHHhhcccccEEEEEEeecCCccccCcc Confidence 99999974 2 2344333 34455566666554 3445555565554334444333332321111 Q ss_pred ----CCCeeeeEecCccceeee--eeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecc Q lcl|NC_018087. 167 ----KDGIIELRRLDPRNVQFV--RELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSG 240 (520) Q Consensus 167 ----k~GI~elr~lDPr~i~~v--r~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSG 240 (520) +.++..+.+|+|--...+ ..+. .|...-.-+--++|.+.+.......-...+....+.+||++-|+++..+ T Consensus 134 ~pl~~~~~~~~~~l~~~~~~~i~~~~~~---~dp~sp~fg~P~~y~i~~~~~~~~~~~~~~~~~~~~~iH~SRii~~~~~ 210 (461) T protein:vir:80 134 TAIDPKTIKSIPYINTFNTQKVTQLYLN---QDMFSEHFGEVEFFEVNRVSQLGEEILSGTTASTSEQIHRSRIIHEQGL 210 (461) T ss_pred CCcccccccceeEEEeccccccchhhhc---ccCcCcccccceEEEEeccccccccccccccCccceEEccccEEEecCC Confidence 233444555544222221 0111 1111101111223333322211111122334455689999999988444 Q ss_pred cccCCCCcchhhhHHHHHHHHHHHHHHH--HHHHHHHhcCccceEEEccCC-CCchHHHHHHHHHHHHhhcceeEeecCC Q lcl|NC_018087. 241 LVDCCGKNIIGYLHRAVKPANQLKLLED--AMMIYRITRAPDRRVFYIDTG-NMPARKAAQHMQHIMNSHRNRISYDART 317 (520) Q Consensus 241 L~d~~~~~~~syL~~aik~~NqL~m~ED--alVIyRi~RApeRRvFyIDvG-nlpk~KAeqyl~~im~~~knklvYd~~T 317 (520) - .++.....|.|+++...+..+..... +.+++ ++..+ +|.+|.- .+......+ +...+..+++ .+ T Consensus 211 ~-~~~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~---~~~~~-v~k~~~l~~~~~~~~~~-~~~~~~~~~~------~~ 278 (461) T protein:vir:80 211 R-FEGETKGRSIFESLYDIITVMDTSLWSVGQILY---DFAFK-VYKTDDIDALNKDDKAN-LTAMLDFMFR------TE 278 (461) T ss_pred C-CCccccCcchHHHHHHHHHHHHHHHHHHHHHHH---HhCCC-ceecchHHhhhchHHHH-HHHHHHHhcC------Cc Confidence 3 44444567899887666655543332 22333 34333 4555421 111111112 2222333321 11 Q ss_pred CccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHH-HHHHHHHHHhcCCChhhccCCCccccccccchhhH Q lcl|NC_018087. 318 GKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDI-LYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISR 396 (520) Q Consensus 318 Gev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV-~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItR 396 (520) |- + + +. + +-+++++- .+|+-++|+ ..|...+-.+.++|+.||-.++.+.+ + +=.= T Consensus 279 g~-------~--~----~d----~-~e~~e~~~--~~lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~---a-sge~ 334 (461) T protein:vir:80 279 AL-------A--I----IK----G-DEQLTKES--TNVSGMKDLLDYGWDYLAGAVRMPKTVLKGQEAGTL---T-GAQY 334 (461) T ss_pred eE-------E--E----Ec----C-CcceEEEe--cCcCCHHHHHHHHHHHHhhhhcCCeeeeecccCCcc---c-cchH Confidence 21 0 0 10 0 11222222 234555554 68888999999999999976654322 1 1112 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHH-H--H-hcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018087. 397 DELSFDKFISELQHK-FEEIFLSPLKSN-L--L-LKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLM 471 (520) Q Consensus 397 DElkF~KFI~rLr~r-Fs~if~d~Lk~Q-L--i-Lkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~ 471 (520) |.-.|..+|.++|.. +..+...+++.= + . +..++.++ ...++|.|..=..-+|...+|++..+.++++.+ T Consensus 335 D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~-----~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~ 409 (461) T protein:vir:80 335 DVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPD-----SFEWAIEFNPLWNLDSKTDAEVRKLTAEADQIY 409 (461) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCcc-----ccceEEEeCCCCCCCHHHHHHHHHHHHHHHHHH Confidence 333599999999965 455444444321 1 1 11122222 134678888777788989999999999998877 Q ss_pred hcccchhhhHHHHHHHHhC---CC--------HHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 472 EPYIGKYISNHTAMKDFLQ---MS--------DEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 472 ~p~vgky~S~~~i~k~IL~---~t--------DeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) -.- -.+|.+-++....+ ++ +.|+++.+++.+ ..+.+|+. T Consensus 410 ~~~--g~is~~e~r~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~e~~ 459 (461) T protein:vir:80 410 IVN--GVLDPDEVKETRFGRFGLENSSKFSGDSAEIDKLAKLVY--------DAYAKKNA 459 (461) T ss_pred Hhc--CCCCHHHHHHHHHHhcCCCCCccCCCCCchhhhhhhhcc--------ccccccCC Confidence 332 25787777754322 11 123333333322 23333444 No 24 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.09 E-value=2.5e-09 Score=67.77 Aligned_cols=405 Identities=12% Similarity=0.128 Sum_probs=202.8 Q ss_pred hhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchhHHHHhhhceeeEecCC Q lcl|NC_018087. 25 KIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNAVQEIVSDAIVYEEG 104 (520) Q Consensus 25 ~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~d~~ 104 (520) -..-++-.|++. --| +.- + ...+.++.+. ..+-.+|...| +.|+.+..||+.++.+|+-+--. T Consensus 1 ~~~~D~~~~~~~---~~g-~~~---~----~~~~~~~~~~---~~~~~~l~a~Y---~~~~l~~~~vd~~a~d~~r~~~~ 63 (437) T protein:vir:52 1 MKFFDGIKSLAL---KLG-SKQ---E----QTYYSPSLSL---TDDLVQLEALW---RDNWIANKVCIKRPEDMVRNWRE 63 (437) T ss_pred CchhhhhHhHHh---cCC-Ccc---c----cceeecCccc---cccHHHHHHHH---HhCchhhHHhhcchHHhhcCCce Confidence 011111112111 011 000 0 1111122221 12344666666 57999999999999998754322 Q ss_pred CcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecC------CCCCCeeeeEecCc Q lcl|NC_018087. 105 FDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPN------RPKDGIIELRRLDP 178 (520) Q Consensus 105 ~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~------~~k~GI~elr~lDP 178 (520) |.-++ ..++-.+++.+ .++-|++..+..+.+|-==+-|.=+.-.++|.. +++.+++.++.+|| T Consensus 64 -----i~~~d--~~~~~~~~~~~----~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~ 132 (437) T protein:vir:52 64 -----IYSND--LNSKQLDLFTK----FERSLKLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPK 132 (437) T ss_pred -----EecCC--CCHHHHHHHHH----HHHhhcHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEech Confidence 22122 12222234443 344445556666644422245555555566643 35678899999999 Q ss_pred cceeeeeeccCCCCcccc-cccceecceeecCcccccccccceecCCcceecCcccEEEeecc--cccCCCCcchhhhHH Q lcl|NC_018087. 179 RNVQFVRELDTKMENGVK-VVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSG--LVDCCGKNIIGYLHR 255 (520) Q Consensus 179 r~i~~vr~i~~~~~~~~~-~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSG--L~d~~~~~~~syL~~ 255 (520) .++.++-... .|... .+.....|.| . +.+.+++||++-|++.... -.+.+++..+|.|++ T Consensus 133 ~~v~~~~~~~---~dp~s~~fg~p~~y~v-~-------------~~~~~~~iH~SRii~~~~~~~~~~~~~~~G~s~le~ 195 (437) T protein:vir:52 133 WKISPTGTKD---DDVLSPNFGRYSEYSI-L-------------GGSQSITVHHSRLIILNANDAPLSDNDIWGVSDLEK 195 (437) T ss_pred hhcccccccc---ccccccccCcceEEEE-e-------------cCCcceeEccceeEEecCccCCCccccccCCchHHH Confidence 8887642111 11111 1111111221 1 1123478999998876432 234455567899999 Q ss_pred HHHHHHHHHHHHHH--HHHHHHhcCccceEEEccCC--CCch--HHHHHHHHHHHHhhcceeEeecCCCccccccccchh Q lcl|NC_018087. 256 AVKPANQLKLLEDA--MMIYRITRAPDRRVFYIDTG--NMPA--RKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMAL 329 (520) Q Consensus 256 aik~~NqL~m~EDa--lVIyRi~RApeRRvFyIDvG--nlpk--~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msm 329 (520) +...+..+...+.+ .++++ +... ++.++.- .|.. ..+..-..+.++++|+ ....+-| T Consensus 196 ~~~~i~~~~~~~~~~~~l~~~---~~~~-v~k~~~l~~~l~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~ 258 (437) T protein:vir:52 196 IIDVLKRFDSASVNVGDLIFE---SKID-IFKIAGLSDKIAAGMENEVASVISAVQEIKS-------------ATNSLLL 258 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHH---cCCC-ceecchHHHHhcCCcHHHHHHHHHHHHHhcC-------------CCceEEE Confidence 87777555444432 23443 4333 4555420 1111 1111112222333332 1111211 Q ss_pred hhhhcccccCCCCCcceeecCCCCCcChHHHH-HHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHH Q lcl|NC_018087. 330 TEDYWLQRRDGKAVTEVETLPGMTGMNEMDDI-LYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISEL 408 (520) Q Consensus 330 lEDywLpRReGgrgTEIsTLpGg~nLgei~DV-~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rL 408 (520) +++-+++++. .+++-++|+ ..|+..+=.+.+||+.||-.++...+ + + -.-|.-.|..+|+++ T Consensus 259 -----------d~~~~~e~~~--~~~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s~~Gl-a-s--ge~D~~~yyd~i~~~ 321 (437) T protein:vir:52 259 -----------DAENEYDRKE--LTFTGLKDLLTEFRNAVAGAADMPVTILFGQSVSGL-A-S--GDEDIQNYHEAIRRL 321 (437) T ss_pred -----------cCCcceEEEe--cCcCCHHHHHHHHHHHHHHHhcCchhhhcCcCcccc-c-c--cHHHHHHHHHHHHHH Confidence 0112344443 245556665 68899999999999999977665433 2 1 223455599999999 Q ss_pred HHH-HHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHH Q lcl|NC_018087. 409 QHK-FEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKD 487 (520) Q Consensus 409 r~r-Fs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~ 487 (520) |.. +..+...+++. |++. +|-.+.+.++|.|..=..-++...+|+...+.++++.+-.- -.+|.+-+++. T Consensus 322 Qe~~l~p~le~l~~~-i~~~------~~g~~~~~~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~--g~i~~~e~r~~ 392 (437) T protein:vir:52 322 QETRLRPIFEIIDPL-ICNE------LFGGLPADWWFEFVPLTTVKQEQQINMLNTFATAANTLIQN--GVLNEYQIANE 392 (437) T ss_pred HHHHHHHHHHHHHHH-HHHH------hcCCCCCcceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhc--CCCCHHHHHHH Confidence 964 55555554442 2222 12222345788998887888989999999999998886332 15677777765 Q ss_pred HhC-------CCHHHHHHHHHHH------H-HhhhcCC-ccCCccc Q lcl|NC_018087. 488 FLQ-------MSDEDIAAERKLI------D-EELSDKI-FNPPEPE 518 (520) Q Consensus 488 IL~-------~tDeeI~~~~kqi------~-~E~~~~~-~~~p~~e 518 (520) |+ ++++++++...-- + .|...+. ...|.++ T Consensus 393 -L~~~g~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (437) T protein:vir:52 393 -LRESGLFANISAEHIEELKNADEFAGNFEEPEKMEGAQVQNSEDQ 437 (437) T ss_pred -HHhcCCCCCCCccccccccCCCCCCCccCCCCCCCCCCCCCCCCC Confidence 43 4455544322100 0 0000000 0011111 No 25 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.01 E-value=5e-09 Score=66.11 Aligned_cols=396 Identities=12% Similarity=0.090 Sum_probs=187.6 Q ss_pred cCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchhHHHHhhhceeeEecCCCcEEEEeecc Q lcl|NC_018087. 35 TAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQ 114 (520) Q Consensus 35 ~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~ 114 (520) .+--+.||=..+-.+ ...|..+.++.. ....+|...| +.|+.+..||+.++++|+-+- +.|.-+ T Consensus 1 ~~~~~~d~~~~~~~~--~~~~~~~~~~~~-----~~~~~l~a~Y---~~~~l~~~~Vd~~aed~~r~g-----~~i~g~- 64 (427) T protein:vir:10 1 MKIVKHDGYNDIFNG--GADGSPKPFFMS-----DASYHVGSFY---NDNATAKRIVDVIPEEMVTAG-----FKMSGV- 64 (427) T ss_pred CCccccchHHHHhhc--CCCCcccCcccc-----CchHHHHHHH---HcCchhhhhhccchHHhhcCC-----ccccCc- Confidence 333333433222100 001111222211 1233665555 579999999999999998432 223221 Q ss_pred chhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeee-------cCCCCCCeeeeEecCccceeeeeec Q lcl|NC_018087. 115 TAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIIN-------PNRPKDGIIELRRLDPRNVQFVREL 187 (520) Q Consensus 115 ~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid-------~~~~k~GI~elr~lDPr~i~~vr~i 187 (520) .-+++|..++ +-|++..+..+.+|.=-+-|.=+.-..++ |..++.+++.|+.+||..+.+. .+ T Consensus 65 -----~~~~~~~~~~----~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~-~~ 134 (427) T protein:vir:10 65 -----KDEKEFKSLW----DSYKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFAITVE-KR 134 (427) T ss_pred -----cHHHHHHHHH----HHhhHHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhccccc-cc Confidence 1123444444 44566677777665433444433333332 3356788999999999776552 11 Q ss_pred cCCCCcccc-cccceecceeecCcccccccccceecCCcceecCcccEEEeecc-c---c-cCCCCcchhhhHHHHHHHH Q lcl|NC_018087. 188 DTKMENGVK-VVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSG-L---V-DCCGKNIIGYLHRAVKPAN 261 (520) Q Consensus 188 ~~~~~~~~~-~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSG-L---~-d~~~~~~~syL~~aik~~N 261 (520) . .|... .+... ++|.+.+. ....+++||+|=++....- + . ..+++...|.|.+++ ++ T Consensus 135 ~---~dp~s~~fg~P-~~y~v~~~-----------~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~--~~ 197 (427) T protein:vir:10 135 V---TNARSPRYGEP-EIYKVSPG-----------DNMQPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSL--ID 197 (427) T ss_pred c---cCccccccCcc-eEEEEecC-----------CCCcceEEccccEEEecCCCchhhhcccCCcccchhhhHHH--HH Confidence 1 11111 01111 12211111 1123478999887765311 1 1 122333456666543 45 Q ss_pred HHHHHHH-----HHHHHHHhcCccceEEEc-cCCCC---chHHHHHHHHH-HHHhhcceeEeecCCCccccccccchhhh Q lcl|NC_018087. 262 QLKLLED-----AMMIYRITRAPDRRVFYI-DTGNM---PARKAAQHMQH-IMNSHRNRISYDARTGKVKNQANMMALTE 331 (520) Q Consensus 262 qL~m~ED-----alVIyRi~RApeRRvFyI-DvGnl---pk~KAeqyl~~-im~~~knklvYd~~TGev~d~~~~msmlE 331 (520) .|+-+|. +.++++- -.+ |+.+ +++++ +....+.-++. .+.+.|+ .+|-+ .+ T Consensus 198 ~i~~~~~~~~~~~~l~~k~---~~~-v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~------~~~~~-------~l-- 258 (427) T protein:vir:10 198 AICDYDYCESLATQILRRK---QQA-VWKVKGLAEMCDDDDAQYAARLRLAQVDDNSG------VGRAI-------GI-- 258 (427) T ss_pred HHHHHHHHHHHHHHHHHHh---ccc-cccchhHHHHhcCccchHHHHHHHHHHHHhcC------cccce-------ee-- Confidence 5554443 3344442 222 3334 33221 11111111111 1112221 11111 01 Q ss_pred hhcccccCCCCCcceeecCCCCCcChHHH-HHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHH Q lcl|NC_018087. 332 DYWLQRRDGKAVTEVETLPGMTGMNEMDD-ILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQH 410 (520) Q Consensus 332 DywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~ 410 (520) | +.+-+++++. .+|+-++| +..|...+=.+.+||+.||-.++...+.+-+.+ |.-.|..+|+++|. T Consensus 259 ~--------~~~e~~e~~~--~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~sp~Glnstgd~---D~~nyyd~i~~~Qe 325 (427) T protein:vir:10 259 D--------AETEEYDVLN--SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNT---ALETFYKLVDRKRE 325 (427) T ss_pred e--------cCCCceeEEe--cccCChHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhH---HHHHHHHHHHHHHH Confidence 0 1112233332 23444555 478999999999999999976655444222222 55569999999995 Q ss_pred H-HHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHh Q lcl|NC_018087. 411 K-FEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFL 489 (520) Q Consensus 411 r-Fs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL 489 (520) . +..+...+++ | |+.. +.++|.|..-..=+|...+|+...+.++++.+-.-. .++.+-+++.+- T Consensus 326 ~~l~p~l~~l~~--~----i~~s-------~~~~~~f~pL~~~s~kEkaei~~~~a~a~~~~~~~g--vi~~~e~r~~L~ 390 (427) T protein:vir:10 326 EDYRPLLEFLLP--F----IVDE-------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQ--IIDLEEARDTLR 390 (427) T ss_pred HHHHHHHHHHHH--H----hhcC-------CCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcC--CCCHHHHHHHHH Confidence 4 3333332222 2 2222 247788999999999999999999999998874431 467777775431 Q ss_pred CCCHHH-HHHH-HHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 490 QMSDED-IAAE-RKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 490 ~~tDee-I~~~-~kqi~~E~~~~~~~~p~~e~~ 520 (520) ..++++ +... .-.+++..+..-.++|..|+- T Consensus 391 ~~~~~~~~~~~~~~~~e~~~~~~e~~p~~~e~~ 423 (427) T protein:vir:10 391 SIAPEFKLKDGNNINIREPEETTEPEPGLGEKL 423 (427) T ss_pred hhhccccCCCCccccccccchhcCCCCCCCCCC Confidence 111110 0000 000111111111122222222 No 26 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=98.97 E-value=7.9e-09 Score=64.99 Aligned_cols=439 Identities=10% Similarity=0.041 Sum_probs=187.2 Q ss_pred Cccccc-cchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeeccc--------ccccccccccccccccccchhH Q lcl|NC_018087. 1 MSMLAD-SDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQ--------DIAYNGVFQKLYGSQDPTATST 71 (520) Q Consensus 1 ~~~~~~-~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~--------~~a~~g~~~~~~~~~~~~~~~~ 71 (520) |.+--- .-..+++-..-.+..- .......+-...||-.....+ .++.+-. .+.......-.. T Consensus 66 ~~~~~~~~~~~~~~~~~a~~~a~------~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~---~~~~~~~~~f~g 136 (862) T protein:vir:99 66 VEISDSVNAKSVSGKNFAMDSAV------RSAIKAITGFAMDDGGGAPVPIGAEGKQSSYAVPEA---LQDWYLSQGFIG 136 (862) T ss_pred ccccccccchhhhhhhhcchhhc------chhhhhhhhhhhhcchhhhhhccccccccccccchh---ccccccccCccc Confidence 111000 1111111111111100 000000111111221111100 0000000 011111222233 Q ss_pred HHHHHHHHHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhcc Q lcl|NC_018087. 72 RELINTYRSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYV 151 (520) Q Consensus 72 ~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYv 151 (520) ++|...|+ .|+.+..||+.++++|+-.--. .... -++.+..+...++|.++++. |++..+..+.+|-=-+ T Consensus 137 yql~alY~---~~~larkiVd~pAeDatR~g~~--I~~~-~d~~e~~~e~~~~ie~~~~r----L~v~~~l~eair~~RL 206 (862) T protein:vir:99 137 HQACALIA---QHWLVDKACSLAGEDAIRNGWH--LKSL-GEGEEIDEESLEKFKAIDVE----FKVKENLIEFNRFKNV 206 (862) T ss_pred HHHHHHHH---hCchhhhhhhhhhHHHhhCCce--Eeec-CcccccCHHHHHHHHHHHHH----hhHHHHHHHHHHhccc Confidence 46666665 5999999999999999642211 1111 11222334444555555544 3444554443332223 Q ss_pred ccceeEEEeeecCCC-------------CCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCccccccccc Q lcl|NC_018087. 152 DSRVFFHKIINPNRP-------------KDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGH 218 (520) Q Consensus 152 Dgri~~hkvid~~~~-------------k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~ 218 (520) .|+-+.-.+++-.+| +.+++.|+.|||..+..+-. .++..+|....|. .. T Consensus 207 yGga~ililv~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp~w~~p~~v----------------~~~~~Dp~sp~yG-kP 269 (862) T protein:vir:99 207 FGIRVAIFVVDSEDPDYYEKPFNPDGITPGSYRGISQIDPYWMMPMLT----------------AESTADPSSQFFY-EP 269 (862) T ss_pred ccceEEEEEecCcCchhhhcCcCcccccccceeEEEEechhhhccccc----------------ccccccccccccC-Cc Confidence 444333333332222 22457777888765554210 0111111111110 00 Q ss_pred ceecCCcceecCcccEEEeecccc-----cCCCCcchhhhHHHHHHHHHHHHH--HHHHHHHHHhcCccceEEEccCCC- Q lcl|NC_018087. 219 QHFAAGTKIKIPYSAMVYAHSGLV-----DCCGKNIIGYLHRAVKPANQLKLL--EDAMMIYRITRAPDRRVFYIDTGN- 290 (520) Q Consensus 219 ~~~~~~~~~~I~~~aI~y~hSGL~-----d~~~~~~~syL~~aik~~NqL~m~--EDalVIyRi~RApeRRvFyIDvGn- 290 (520) ..+..+ +-+||++-++.....-+ ...++..+|-|+++...+...... .-+.++++.. -+++.+|... T Consensus 270 ~~y~I~-g~~IH~SRliif~g~~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~----l~v~ktd~l~~ 344 (862) T protein:vir:99 270 EFWIIS-GQKYHRSHLIIARGPQPADILKPTYIFGGIPLVQRIYERVYAAERTANEAPLLAMNKR----TTAIHTDTAKA 344 (862) T ss_pred eeeeec-CeeeccceeEEecCCCchhhhhccCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhc----cceeechhHhh Confidence 011111 12677776655432211 333445678888665444333211 2233454432 2344555432 Q ss_pred CchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHH-HHHHHHHHH Q lcl|NC_018087. 291 MPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDD-ILYFRKALY 369 (520) Q Consensus 291 lpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~kkLy 369 (520) |....+-..=..+++++|+- .| .+- + +++-+++++- .+|+-++| +..|...+= T Consensus 345 l~~ed~l~~r~~~~~~~rdN------~G-------i~l------i-----D~eEe~e~ls--~slSGL~dll~~~~q~IA 398 (862) T protein:vir:99 345 IANEDKFIQRLMFWVRYRDN------HA-------VKV------L-----GTDETMEQFD--TSLADFDAVIMGQYQLVA 398 (862) T ss_pred hccHHHHHHHHHHHHhccCc------ce-------eEE------e-----cCCCceeEEe--cccCChHHHHHHHHHHHH Confidence 22211111001234444431 11 110 0 0112344443 34444555 577888999 Q ss_pred HhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeec Q lcl|NC_018087. 370 MALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHK-FEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHK 448 (520) Q Consensus 370 ~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~r-Fs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~ 448 (520) .+.++|+.||-.++.-++ +.+++ -|.-.|..+|+++|.. +..++.. |- .|+......+ ..+.|.|.. T Consensus 399 aas~IP~tiLfGqspaGl-nATGE--~D~~nYyD~I~s~QE~~L~P~Ler-L~-~li~~~lg~~-------~d~~ieFnp 466 (862) T protein:vir:99 399 SIAKTPATKLLGTAPKGF-NSTGE--FETISYHEELESIQEHVYMPFLQR-HY-LISRLSLGIQ-------HEIDVVMEP 466 (862) T ss_pred hhhCCCceeecccCcccc-cCchH--HHHHHHHHHHHHHHHHHHHHHHHH-HH-HHHHHhcCCC-------CcceEEeCC Confidence 999999999876652222 22222 1344599999999964 4443333 32 2333222222 347788887 Q ss_pred cchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHh--------CCCHHHHHHHHHHHHHhhhcC------CccC Q lcl|NC_018087. 449 NSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFL--------QMSDEDIAAERKLIDEELSDK------IFNP 514 (520) Q Consensus 449 Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL--------~~tDeeI~~~~kqi~~E~~~~------~~~~ 514 (520) =..=+|...+|+.....++++.+-.- -.+|.+-++.... .++|+++++..-.-.++..+. .-.. T Consensus 467 L~~~sekEkAEi~kk~Aea~~~lv~s--GvispdEvR~~L~~~~~~g~~~l~ded~E~d~~~~~e~~~~~e~~g~a~~~a 544 (862) T protein:vir:99 467 VASMTAQQQADLNKTKAEGGKVLIDG--GVISPDEERNRIRDDKRSGYNRLTKEDAEETPGASPENLAAYQKAGAAQETA 544 (862) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHhc--CCCCHHHHHHHHHhcCCcCCCCCCcccccccCCCCcccccccccCCcccccc Confidence 77788888999999999888877332 2578888886521 245566653211111111110 0111 Q ss_pred CccccC Q lcl|NC_018087. 515 PEPEEI 520 (520) Q Consensus 515 p~~e~~ 520 (520) |.+++- T Consensus 545 p~de~~ 550 (862) T protein:vir:99 545 SAKETQ 550 (862) T ss_pred cccccc Confidence 111111 No 27 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=98.80 E-value=3.3e-08 Score=61.63 Aligned_cols=412 Identities=13% Similarity=0.119 Sum_probs=191.2 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeeccc--ccccccccccccccccccchhHHHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQ--DIAYNGVFQKLYGSQDPTATSTRELINTY 78 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~--~~a~~g~~~~~~~~~~~~~~~~~~LI~~Y 78 (520) || ++ |.|- +|....+...--..||-.....+ +.. ...+.++++. ...+-.+|...| T Consensus 1 ~~---~~-~~~~--------------~~~~~~~~~~~~~rd~l~~~~~glg~~r-~~~~~~~g~~---~~~~~~~l~~~Y 58 (449) T protein:vir:10 1 MT---DK-LTLA--------------VNHALNDARMARARMGLMVPTMGLDNKR-HSAWCEYGFP---ELVTYENLYSLY 58 (449) T ss_pred Cc---hh-hHHH--------------HhhhcchhHHHHHHHHHHHHHhcCCccc-chhhhhcCCc---ccCCHHHHHHHH Confidence 11 11 1111 01111111100011111111111 111 1123333332 345677899999 Q ss_pred HHHhhccchhHHHHhhhceeeEecCCCcEE-EEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHH--Hhhccccce Q lcl|NC_018087. 79 RSLLNNYEVDNAVQEIVSDAIVYEEGFDVV-SIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHF--KRWYVDSRV 155 (520) Q Consensus 79 R~ma~~pEvd~Ai~eIvneaiv~d~~~~~V-~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~f--RrWYvDgri 155 (520) | .+.....||+-++++|..- ....+ ..+.+......+++.++.+-|.. + +-.+..+.. +|-|-.+-+ T Consensus 59 r---~~~ia~~iVd~~~d~~~~~--~~~i~~g~~~~~~~~~~~~e~~~~~l~~~--~---~~~~l~ea~~~~rl~Gga~i 128 (449) T protein:vir:10 59 R---RGGIAHGAVEKLVGKCWQT--NPEIIEGDDADDSEDETSWEKKSKQVFTN--R---LWRSFAEADRRRLVGRYAGI 128 (449) T ss_pred h---cCchhHHHHHhhhhhhhhc--CcccccCccccchhhhHHHHHHHHHHHHH--H---HHHHHHHHHHhhhccCcEEE Confidence 8 6778888999999877321 11111 12223333334444455433321 1 112333333 334555556 Q ss_pred eEEEeee------cCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceec Q lcl|NC_018087. 156 FFHKIIN------PNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKI 229 (520) Q Consensus 156 ~~hkvid------~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I 229 (520) +++.- | |-+++.||..|+.++...|.+- ++.. |...-.=+--+||-+.+...+ ++..+++| T Consensus 129 ~i~v~-d~~~l~~Pl~~~~~i~~i~v~~~~~i~~~-~~~~---dp~sp~yg~P~~y~v~~~~~g--------~~~~~~~i 195 (449) T protein:vir:10 129 LLHIR-DEKDWNLPATKGRGLQKVSVSWAGSLKVA-EWDT---GINSKTYGQPKLWKYTERLPN--------GSSRRVDI 195 (449) T ss_pred EEEec-CCCCCCcccccCcceeeEEeeccccCChh-hhhc---CCCCCCCCCceEEEEeeeccC--------CCccceee Confidence 65532 2 2233445555555554333321 1111 100000011122222211111 22345688 Q ss_pred CcccEEEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHH------HHHHhcC----ccceEEEccCCCCchHHHH-- Q lcl|NC_018087. 230 PYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMM------IYRITRA----PDRRVFYIDTGNMPARKAA-- 297 (520) Q Consensus 230 ~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalV------IyRi~RA----peRRvFyIDvGnlpk~KAe-- 297 (520) |++=++..-.+ .....|+|+++ ||.|.-+|-+.. .-...|. -++ .+|+.+|...++. T Consensus 196 H~SRl~~~~~~-----~~~g~~~L~~~---yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~---~~~~~~l~~~~~~~~ 264 (449) T protein:vir:10 196 HPDRVFILGDY-----SEDAIGFLEPA---YNAFVSLEKVEGGSGESFLKNAARQLNVNFEK---EIDFTNLASLYGVSI 264 (449) T ss_pred ccceeEeecCC-----CCCChhHHHHH---HHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhh---hhhhhhhhHHhhCCc Confidence 88876644211 11245788876 565555544321 1111111 112 2455555543321 Q ss_pred ----HHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHH-HHHHHHHHHHhc Q lcl|NC_018087. 298 ----QHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDD-ILYFRKALYMAL 372 (520) Q Consensus 298 ----qyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~kkLy~aL 372 (520) +-+++.+..+.. .++ ..|-+ + +-+.+++. .+++-++| +..|...+=-+. T Consensus 265 e~~~~~~~~~~~~~~~------~~~------~~~i~---------~---~~d~~~~~--~~~sgl~d~l~~~~q~iaaa~ 318 (449) T protein:vir:10 265 DELQDKFNEVAGEINR------GND------VLMTT---------Q---GATVTPLV--TSVADPTATYNVNLQTAAAGV 318 (449) T ss_pred hHHHHHHHHHHHHHhc------cch------heeec---------C---CcceEEEe--cccCChhHHHHHHHHHHHHHh Confidence 112222221110 011 01111 1 11222222 14455666 566888899999 Q ss_pred CCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchH Q lcl|NC_018087. 373 RVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYF 452 (520) Q Consensus 373 ~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f 452 (520) +||+.||-.++...+ .++ -|.-.|...|...|.++......++.. |+.-++..+. +.+.|.|..=..= T Consensus 319 ~IP~t~L~Gqsp~gl--nst---~D~~nyyd~i~~~Q~~l~p~le~l~~~-l~~s~~g~~~------~d~~i~f~pL~~~ 386 (449) T protein:vir:10 319 DIPTRILIGNQQAER--SST---EDQKYFNARCQSRRVDLSFEIEDFCDK-LIELKIIDAV------AKKAVIWDDLNEQ 386 (449) T ss_pred CCCeeeeeccCcccc--ccc---hhHHHHHHHHHHHHHhhhHHHHHHHHH-HHHhhcCCCC------CceeEEeCCCCCC Confidence 999999977665433 233 255569999999999988877776654 6666666543 3588999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccc-hhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 453 SEMKTIEITERRVNVLSLMEPYIG-KYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 453 ~ElKe~Ei~~~R~~~~~~~~p~vg-ky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) +|...+||.....++++.+-..++ ..|+.+-++ ..+++...+ ..+. ++++++|. T Consensus 387 t~kEkAei~k~~A~a~~~~~~ag~~~~~~~~EiR-~~~~~~~~~------------~~~~-~~e~~de~ 441 (449) T protein:vir:10 387 TGTEKLTNAKTMGEINQTMLGSGDNPAFSREEIR-TAAGYDNDD------------EEPL-GEEDGDEE 441 (449) T ss_pred CHHHHHHHHHHHHHHHHHHHHccccCCcCHHHHH-HHhcccCCC------------CCCC-CCCCCccc Confidence 999999999999999887755432 245665555 234442211 1111 11112222 No 28 >protein:vir:101418 Length: 569 # NCBI annotation: Prt # Family: family:all:9458 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006480;genbank:gi:46401636;genbank:GeneID:2777482 Probab=98.73 E-value=7.8e-08 Score=59.56 Aligned_cols=477 Identities=15% Similarity=0.120 Sum_probs=265.3 Q ss_pred ccccc-chhhhcch--h-hhhhhHHHhhhccCCCcccCCCCCCCceeeccc-cccc---cccccc--------cccc--- Q lcl|NC_018087. 3 MLADS-DLKMFAFW--H-KVDDTEYDKIINDKAESITAPKFDDGATEVDSQ-DIAY---NGVFQK--------LYGS--- 63 (520) Q Consensus 3 ~~~~~-~l~~f~~~--~-~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~-~~a~---~g~~~~--------~~~~--- 63 (520) |+-++ -|.-+.-- . -.+..+-++.+. |..+-++-.|..---++ ++|- .++-.+ ++.| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~s~~g~p~~~~~~~~~~~~~~t~~~D~~~~g~~~ 76 (569) T protein:vir:10 1 MADNKITLSSVRKALAGVFKDNGERDNILL----SALAVHGGSGYLFSRAGAPVQLSGFLGGKPGDSGMAGDGLVDGSRF 76 (569) T ss_pred CCcchhHHHHHHHHHhhhhhcCCccchhhh----hhheeecCcceEEeecCcchhhhhhhccCccccchhhhhHHHHHHH Confidence 55442 22211000 0 001111111111 11121211111100001 1111 111111 1111 Q ss_pred ---ccccchhHHHHHHHHHHHhhccchhHHHHhhhceeeEecCC-CcEEEEeeccc--hhhhHHHHHHHHHHHH-HHHHh Q lcl|NC_018087. 64 ---QDPTATSTRELINTYRSLLNNYEVDNAVQEIVSDAIVYEEG-FDVVSIDLDQT--AFTENIRNLISDEFNS-VLNML 136 (520) Q Consensus 64 ---~~~~~~~~~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~d~~-~~~V~l~Ld~~--~~s~~ik~~I~eeF~~-i~~ll 136 (520) --.-.++..|++-.|-+|+.+|-|..|+..=|.-|.=.++- .+.|.|.--.. .-....++||-.|... +..++ T Consensus 77 ~~~~~~~pr~R~qiY~~~eeM~~~p~Ia~AlniHVtaALggde~TGd~vfI~p~~~~~~a~~daakai~~el~~dl~~~i 156 (569) T protein:vir:10 77 IFDEVQLPEDRLQRYPLLEEMAVYSTIATALNIHITHALSFDKKTGQTFSIVPVHNGNDSDYDAAQALCGELMNDIGRTI 156 (569) T ss_pred HhhhccCchhHHHHHHHHHHHhcCchhhhhhhhhhheeecccccccceEEEEeecCCCCCcchHHHHHHHHHHHHHHHHH Confidence 11234799999999999999999999999999988877654 35666633211 1223334466666665 66775 Q ss_pred cchhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEe---cCccceeeeeeccCCCCcccccccceecceeecCcccc Q lcl|NC_018087. 137 NFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRR---LDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELES 213 (520) Q Consensus 137 ~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~---lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~ 213 (520) ++..+.+-+.--+=|.-|-.+- -+.++||.+|.- -=|.-|++. +-+.+.+.= ...|....+. T Consensus 157 --Nr~~~~lA~~~~aFGdsYaRiY---~~~~~GV~dl~~s~yt~PsfIqpF-------E~g~~tvGF---~~~~~~~~~~ 221 (569) T protein:vir:10 157 --NKEVAGWAFIMSVFGVAYVRPY---AKEGIGITSFECSYYTLPSFIKEF-------EVSGNLAGF---SGDYLKDASG 221 (569) T ss_pred --HHHhhHHHHHHHhhhhhheeee---ccCCceeEEEEecccccccccchh-------hhcCceEEe---ecccCCcccc Confidence 7888888888777776655544 356679998742 223222221 222221110 0011111000 Q ss_pred cccccceecCCcceecCcc----cEEEeecccc------cC-CC------CcchhhhHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018087. 214 YQCGHQHFAAGTKIKIPYS----AMVYAHSGLV------DC-CG------KNIIGYLHRAVKPANQLKLLEDAMMIYRIT 276 (520) Q Consensus 214 ~~~~~~~~~~~~~~~I~~~----aI~y~hSGL~------d~-~~------~~~~syL~~aik~~NqL~m~EDalVIyRi~ 276 (520) ....-..-.-..+|+|.= .+.=+|+|.. |+ +. ....|||+.|-+||-+|..-=-+|--=|+- T Consensus 222 -ti~~l~p~qm~rmKmPrm~~i~q~~~v~~g~~~~~L~~d~~~~~Pi~psn~GgSFL~~ae~pf~~l~~Al~sL~~qri~ 300 (569) T protein:vir:10 222 -KMVFADPWAIIPMKIPYWRPKSNLMPVHTGHKAYSLLDNPEERTPIETQNYGTSLLEYAYEPYMNLRSAIRSLKATRFN 300 (569) T ss_pred -ceeeechhhhhhhcccceeeccccchhhhhhhheeecccccccccccchhhhhHHHHHHHhHHHHHHHHHHhccchhhH Confidence 000000000011233332 2233333332 11 11 134689999999999999988888888999 Q ss_pred cCccceEEEccCCCCchHHHHHHHHHHHHhhcc---eeEeecCCCccccccccchhhhhhcccccCCCCC-cceeecCCC Q lcl|NC_018087. 277 RAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRN---RISYDARTGKVKNQANMMALTEDYWLQRRDGKAV-TEVETLPGM 352 (520) Q Consensus 277 RApeRRvFyIDvGnlpk~KAeqyl~~im~~~kn---klvYd~~TGev~d~~~~msmlEDywLpRReGgrg-TEIsTLpGg 352 (520) -|.--|+.=+-.-.|||.++.+|++.+-.-.|+ .+.==+..|+- |-=.----||-=+.|+| -.|+|=.+- T Consensus 301 dSv~~~~Itlnm~gM~p~qr~~y~r~lt~~LKr~~d~ie~a~~gg~~------~~~~~~H~LPv~gekq~~~tvDt~~~~ 374 (569) T protein:vir:10 301 ASKIDRIIGLAMNSLDPVKAADYSRTITQTLKRAADLMERRARGANN------MPTVTNTLLPIMGDGKGQMTIDTQTIQ 374 (569) T ss_pred HHHHhHHhhccccCCCHHHHhHHHHHHHHHHHHHHHHHHHHhccCcc------ccccceeeeeeecCccccccccccccc Confidence 999999999999999999999999987665554 33322334441 11111223677676774 477777777 Q ss_pred CCcChHHHHHHHHHHHHHhcCCChhhcc--CCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--C Q lcl|NC_018087. 353 TGMNEMDDILYFRKALYMALRVPLSRIP--DEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLK--R 428 (520) Q Consensus 353 ~nLgei~DV~YF~kkLy~aL~VP~SRl~--~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLk--g 428 (520) .+.--|+||...-|.|--|||+-+|=|. +.=++++ |.++-+. --..=..=-+-||.-.++.|..++-.++-.| + T Consensus 375 A~~~gIEdvM~~~R~LagaLGlD~SMlGwAD~LsGGL-GeGG~fr-tSaQaa~RS~~iRqa~~e~in~iidiH~~fKYge 452 (569) T protein:vir:10 375 ADINGIEDILTYMRQLAAALGLDYTLLGWADQMSGGL-GEGGFLR-TAIQAAMRASWIQQGVEEFIQRAIDIHLAFKYGK 452 (569) T ss_pred cCcccHHHHHHHHHHHHhhhccchhHhhHHHHhcccc-cccHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCc Confidence 7777999999999999999999999874 2222233 4432221 1111111124478888888888888888776 6 Q ss_pred CCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHh-------hc---ccchhhhHHHHHHHHhCCCHHHHHH Q lcl|NC_018087. 429 VITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLM-------EP---YIGKYISNHTAMKDFLQMSDEDIAA 498 (520) Q Consensus 429 i~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~-------~p---~vgky~S~~~i~k~IL~~tDeeI~~ 498 (520) +-.++|+. -+++|.+++.=.|-.+.+-..+|++.++-+ .. ..-.==-..|+.+++|.| |+.+. T Consensus 453 vf~~~drP-----~~V~F~s~~tAl~~E~~~n~~~raN~a~i~~Q~la~l~e~n~Lg~de~~m~y~l~d~~~~-De~~~- 525 (569) T protein:vir:10 453 VYPEGDRP-----YKIEFHSVNTALQQEHNDNRDSQANYATIVTQILDAVSNNSVLANSDAFKRYLFSDVLEI-DEKIS- 525 (569) T ss_pred ccCCCCcc-----eEEEeccchHHHHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccHHHHHHHHHHHhhc-chhHH- Confidence 66777666 789999999999999999999998876544 22 100000134566677777 33322 Q ss_pred HHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 499 ERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 499 ~~kqi~~E~~~~~~~~p~~e~~ 520 (520) +.+-.|. .++|++||- T Consensus 526 --e~l~ae~----~akp~DEe~ 541 (569) T protein:vir:10 526 --EALVNEL----KAKSEDDDH 541 (569) T ss_pred --HHHHhhc----CCCcchhHH Confidence 2223343 445777765 No 29 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=98.59 E-value=2.3e-07 Score=56.97 Aligned_cols=438 Identities=11% Similarity=0.104 Sum_probs=203.8 Q ss_pred ccccchhhhcchhhhhh--hHHHhhhccCCCcccCCCCCCCceeecccccccccccccc---cccccccchhHHHHHHHH Q lcl|NC_018087. 4 LADSDLKMFAFWHKVDD--TEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKL---YGSQDPTATSTRELINTY 78 (520) Q Consensus 4 ~~~~~l~~f~~~~~~~~--~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~---~~~~~~~~~~~~~LI~~Y 78 (520) .+++....|+-|+++-- +..+..+ .+....++|+.- .-|...---|.|-+... .....+...+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~---~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~-------- 68 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGLLKALKDVK-DHKKVNANDEDY---KYIDMWKRLYQGHYAEWHNLNYEHNGNPVN-------- 68 (496) T ss_pred ChhHHHHHHHHHHHHhccchhhHHHH-hcCCCcCCHHHH---HHHHHHHHHhcCCCchhhcchhccCCCccc-------- Confidence 67888888888888632 2222222 112222222111 11111000011111000 0000000000 Q ss_pred HHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEE Q lcl|NC_018087. 79 RSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFH 158 (520) Q Consensus 79 R~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~h 158 (520) +.....+-...+++..++=+ -.+||++.+++. ...+.++.+++--+|++...+++....+-|..|+| T Consensus 69 ~~~~~~n~~k~i~~~~a~~l-----~~~p~~i~~~d~--------~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~ 135 (496) T protein:vir:38 69 RRQLSMNLPKVTAKYMSKLL-----FNEKVKINIDDK--------AAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIK 135 (496) T ss_pred cceeecchHHHHHHHHhhhh-----hCCcceEeeCCh--------HHHHHHHHHHhccCHHHHHHHHHHHHhhhCcEEEE Confidence 00011122222333332211 136777877764 33445566777778999999999999999999999 Q ss_pred EeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccc----cc--e--e------------cceeecCccccccccc Q lcl|NC_018087. 159 KIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVV----KG--Y--R------------EYFLYDTELESYQCGH 218 (520) Q Consensus 159 kvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~----~~--~--~------------ey~~y~~~~~~~~~~~ 218 (520) ..+|.+ |-..+..++|.++.+++.-.......+.+. .+ + . +|.+|.... +. T Consensus 136 ~~~D~~----~~~~i~~v~~~~~~P~~~~~~~~~~~~f~~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~-~~---- 206 (496) T protein:vir:38 136 VYHDGN----KNVKVSFATADCMYPLSNDSENVDECVIANSFHKNNKYYTLLEWNEWQGDVYTVTTELYQSDD-PN---- 206 (496) T ss_pred EEEcCC----CcEEEEEEcccceEEEEecCCcEEEEEEEEEEEeCCeEEEEEEEEEEeCceEEEEEEEEecCC-cc---- Confidence 999843 446789999999988643222211111110 00 0 0 111121110 00 Q ss_pred ceecCCccee-------c---------CcccEEEeeccc---ccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_018087. 219 QHFAAGTKIK-------I---------PYSAMVYAHSGL---VDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAP 279 (520) Q Consensus 219 ~~~~~~~~~~-------I---------~~~aI~y~hSGL---~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RAp 279 (520) ..+..|. + +..-++|.--.. .++.....+|=|+.++.....|-..=..+ .+-++.- T Consensus 207 ---~~g~~v~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~--~~~~~~~ 281 (496) T protein:vir:38 207 ---ELGTKVSLTLLFDDIEPVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSY--YQEFKLG 281 (496) T ss_pred ---ccCccccccccccccccceeecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHH--HHHHhhc Confidence 0011111 1 111122221000 11222334677888887777775544443 3667776 Q ss_pred cceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccc-cCCCCCcceeecCCCCCcCh- Q lcl|NC_018087. 280 DRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQR-RDGKAVTEVETLPGMTGMNE- 357 (520) Q Consensus 280 eRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpR-ReGgrgTEIsTLpGg~nLge- 357 (520) .+|+|. +.. ++..-. |..++... -++.-.+.|-... -+++.|.-|+++.+.-...+ T Consensus 282 ~~~i~v-~~~-------------~l~~~~-----~~~g~~~~---~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~~ 339 (496) T protein:vir:38 282 KKKVLV-PSS-------------FVKTAV-----NLDGSTTQ---YFDSTDEAFFLYQGDQDDNGKAIKDISVEIRSTEF 339 (496) T ss_pred ccceec-chH-------------HhhccC-----CCCCcccc---CCCCccceEEEeecCCCcccccceeeccccCHHHH Confidence 666654 211 111000 11111111 0111111111111 11222334666665322222 Q ss_pred HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHhcCCC Q lcl|NC_018087. 358 MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKS-------NLLLKRVI 430 (520) Q Consensus 358 i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~-------QLiLkgi~ 430 (520) ..-+..+.+.+....++|-+-+..++++.- -+++|.-....-..-+.+.++.|...+.++++. .+.++|.. T Consensus 340 ~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~--tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~~ 417 (496) T protein:vir:38 340 IESINAMLRIYAMQVGLSAGTFTFDENGLK--TATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKFIEAYSGEV 417 (496) T ss_pred HHHHHHHHHHHHHhhCCChhhcCCCccccc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Confidence 334667778899999999988876554321 245654443333333444444444444444443 34445432 Q ss_pred ChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcC Q lcl|NC_018087. 431 TEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDK 510 (520) Q Consensus 431 t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~ 510 (520) - + ...+.++|...-.-.+.. .++.+.++.. .| .+|.+++.++....||+|.+++.++|++|.... T Consensus 418 ~----~--~~~i~v~f~d~i~~d~~~-------~~~~~~~~~~-~G-iiS~et~l~~~~~~~d~ea~~el~ri~~E~~~~ 482 (496) T protein:vir:38 418 V----E--LDTITVDFDDSIAQDEDT-------TINRYTNAKN-QG-MIPLKIALQRAWNITEAEADEWAEMLAKEKQAE 482 (496) T ss_pred C----C--ccceEEEeCCCCCCCHHH-------HHHHHHHHHh-cC-CCCHHHHHHhcCCCChHHHHHHHHHHHHhhhcc Confidence 2 1 234788887432222222 2222222211 13 479999988888999999999999999988755 Q ss_pred CccCCccccC Q lcl|NC_018087. 511 IFNPPEPEEI 520 (520) Q Consensus 511 ~~~~p~~e~~ 520 (520) . +.++...+ T Consensus 483 ~-~~~d~~~~ 491 (496) T protein:vir:38 483 M-PNNDMNGI 491 (496) T ss_pred C-ccccccCC Confidence 2 32222222 No 30 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=98.59 E-value=2.3e-07 Score=56.94 Aligned_cols=444 Identities=12% Similarity=0.080 Sum_probs=235.9 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRS 80 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ 80 (520) |+++ |.....|.+-.-.....-...+ ..-+|+..-. ..++..........-..+...|..+=|. T Consensus 1 Mn~i-Dr~i~~~sP~~a~~R~~ar~~~----------~~y~aa~~~r-----~~~~~~~~~s~~~~i~~~~~~lr~RaRd 64 (548) T protein:vir:95 1 MNLI-DRLLEPLAPELVARRLAAREAI----------QAYEAARPGR-----THKAKRQPLGADTSLQKSAVSMREQCRK 64 (548) T ss_pred CchH-HhHhhhcchHHHHHHHHhHHHh----------ccccccCccc-----cccccCCCCChHHHHHHHHHHHHHHHHH Confidence 6544 4555555432211111000000 1122222111 1111111111112222356678888888 Q ss_pred H-hhccchhHHHHhhhceeeEec-CCCcEEEEeeccchhhhHHHHHHHHHHHHHHH------HhcchhhhHHHHHhhccc Q lcl|NC_018087. 81 L-LNNYEVDNAVQEIVSDAIVYE-EGFDVVSIDLDQTAFTENIRNLISDEFNSVLN------MLNFQRKGSDHFKRWYVD 152 (520) Q Consensus 81 m-a~~pEvd~Ai~eIvneaiv~d-~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~------ll~f~k~g~~~fRrWYvD 152 (520) | .++|-+..||+-+++-+|=.. -.-.|-.+..+. +..+.+.+.|...|..-++ .++|..--...+|.|.+| T Consensus 65 L~rNn~~a~~av~~~~~nvVG~~G~~i~p~~l~~d~-~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~d 143 (548) T protein:vir:95 65 LDEDHDLVTGLLDRLEERVVGGSGIGVEPLPLRLDG-SVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRD 143 (548) T ss_pred HHhcChHHHHHHHHHHHhccCccccceeeeecCCCH-HHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhC Confidence 8 789999999999998877421 233444444443 3455566678888877664 455666566689999999 Q ss_pred cceeEEEeeecCCC-CCCe---eeeEecCccceeeeeeccCCCCcccccccce--------ecceeecCcccccccccce Q lcl|NC_018087. 153 SRVFFHKIINPNRP-KDGI---IELRRLDPRNVQFVRELDTKMENGVKVVKGY--------REYFLYDTELESYQCGHQH 220 (520) Q Consensus 153 gri~~hkvid~~~~-k~GI---~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~--------~ey~~y~~~~~~~~~~~~~ 220 (520) |-+|..+..++... ..|. ..|..|+|..|.-= ....+..+..|+ .-|+++....-..... T Consensus 144 GE~f~~~~~~~~~~~~~g~~~~~~lqliepd~l~~~-----~~~~~~~i~~GIE~D~~Grp~aY~i~~~hPgd~~~~--- 215 (548) T protein:vir:95 144 GEGLAQKLMGRVPNYTFATSVPFALELLEPDYLPFS-----YNNLSKGIVQGIERDTWRRKRAYHLLKDHPGNLQTL--- 215 (548) T ss_pred CceEEEeeecccccccCCcccceEEEEechhhcCCC-----CCCCCCceeeeeEECCCCceEEEEEeecCCCccccc--- Confidence 99999998875432 2232 47888999877531 111222333443 3577775332111110 Q ss_pred ecCCcceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHH Q lcl|NC_018087. 221 FAAGTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHM 300 (520) Q Consensus 221 ~~~~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl 300 (520) ......++||.+.|+|+.-- ..+.-.-.+|.|..+++.+.+|.-.+||..+-...-|-.=-+..=+-+... . T Consensus 216 ~~~~~~~rvpA~~VlHif~~-~r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~~~---~---- 287 (548) T protein:vir:95 216 GGSLAVKRVEAERIIHIAYR-KRIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPDSY---T---- 287 (548) T ss_pred ccccceeeechhHheecccc-cCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccc---c---- Confidence 11123478999999998644 334333457899999999999999999999998888876433332222110 0 Q ss_pred HHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCC---------cChHHHHHH-HHHHHHH Q lcl|NC_018087. 301 QHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTG---------MNEMDDILY-FRKALYM 370 (520) Q Consensus 301 ~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~n---------Lgei~DV~Y-F~kkLy~ 370 (520) ...+ -.+.....+| + -|+-|.+|+.|+. -+..++... ..+.+=. T Consensus 288 --------------~~~~-~~~~~~~~~~-~----------pG~iv~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~IAa 341 (548) T protein:vir:95 288 --------------VEPG-KDRKNRTIPI-A----------PGMVFDDLEPGEDVGMIESNRPNPFLEGFRNGQLRMIGA 341 (548) T ss_pred --------------CCCC-cccccccccc-c----------CCccccccCCCceeeecCCCCCCCCHHHHHHHHHHHHHh Confidence 0000 0011111111 1 1333445544433 333333333 3333556 Q ss_pred hcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHhcCCCChhhHHhhhhceEEEe Q lcl|NC_018087. 371 ALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKS----NLLLKRVITEDEWEAELNNIKIVF 446 (520) Q Consensus 371 aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~----QLiLkgi~t~eew~~~~~~I~~~f 446 (520) +|+||-+-|..+-+.| =+.+.-.-+.|-+.+.++|..|..-|..++-. ..+|.|.++.-.|..-.......| T Consensus 342 glGipYe~ltgD~s~n----YSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W 417 (548) T protein:vir:95 342 GTRSTYSSVSRAYDGT----YSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVY 417 (548) T ss_pred hcCCCHHHHhcccchh----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeee Confidence 7999999887764322 23344456679999999999988777776433 466888775322222233445555 Q ss_pred ec--cchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhh----hcCCccCCcccc- Q lcl|NC_018087. 447 HK--NSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEEL----SDKIFNPPEPEE- 519 (520) Q Consensus 447 ~~--Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~----~~~~~~~p~~e~- 519 (520) .. --+.-.+||+.-...+++. -+-|.+-+..+ .+..-+| ..+|+..|. .-++..+..+.. T Consensus 418 ~~P~~~~iDP~Kea~A~~~~i~~---------Gl~T~~~~~a~-~G~D~~e---v~~q~a~E~~~~~~~GL~~~~~~~~~ 484 (548) T protein:vir:95 418 QGPVMPWINPMHEANAWELLVKA---------GFADEAEVARA-RGRDPRE---LKKSRETEIKANRAAGLVFSSDAYHQ 484 (548) T ss_pred ecCCccccChHHHHHHHHHHHHc---------CCCCHHHHHHH-hCCCHHH---HHHHHHHHHHHHHHcCCCCCCccccc Confidence 33 3455667777666555442 22355544444 3443333 333333332 223221111100 Q ss_pred C Q lcl|NC_018087. 520 I 520 (520) Q Consensus 520 ~ 520 (520) . T Consensus 485 ~ 485 (548) T protein:vir:95 485 L 485 (548) T ss_pred c Confidence 0 No 31 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=98.58 E-value=2.4e-07 Score=56.90 Aligned_cols=454 Identities=11% Similarity=0.072 Sum_probs=199.1 Q ss_pred Cccccccchhhhcchhhhhh--------hHHHhhhccC-CCcccCCCCCCCceeecccccccc-cccccccccccccchh Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDD--------TEYDKIINDK-AESITAPKFDDGATEVDSQDIAYN-GVFQKLYGSQDPTATS 70 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~--------~~~~~~~~~~-~~s~~~p~~~dg~~~i~~~~~a~~-g~~~~~~~~~~~~~~~ 70 (520) |---.-+.++-|.......+ .++.+.+..- .-+-..++...|.+.....+++.. .|-.++... -..+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~p~~~~~~~~~~~~~~-p~~~~~ 79 (576) T protein:vir:96 1 MVTRLADIFKRLRLGRDYEDIIDTVPIDDGLQANIRNIEEKSKELNKSLYGKQQAYAEPFLEVMDTNPEFRTK-RSYMKN 79 (576) T ss_pred ChhhHHHHHHHHhccCccccchhhhhcccChhHHHHHhhhhhhhhccccCCccchhhcceeeeeecCCCcccc-Ccchhh Confidence 11000112222220000000 0000000000 012234444444444333332100 111111110 001111 Q ss_pred HHHHHHHHHHHhhccchhHHHHhhhceeeEe------cCCCcEEEEeeccc--hhhhHHHHHHHHHHHHHHHHhcch--- Q lcl|NC_018087. 71 TRELINTYRSLLNNYEVDNAVQEIVSDAIVY------EEGFDVVSIDLDQT--AFTENIRNLISDEFNSVLNMLNFQ--- 139 (520) Q Consensus 71 ~~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~------d~~~~~V~l~Ld~~--~~s~~ik~~I~eeF~~i~~ll~f~--- 139 (520) -..+-+.-+.++.+|=|..||+.|.+.+.++ ..+.-...|.|.+. ..++.-+..+...-..+.+++... T Consensus 80 ~~~~~~~l~~~~~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~ 159 (576) T protein:vir:96 80 SDNLHDVLKQFGNNPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDID 159 (576) T ss_pred hhhhHHHHHHhhcCHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCc Confidence 1122233355567899999999999877653 22222223333333 223323333332222333443321 Q ss_pred -hhhHHHHHh----hccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCccccc Q lcl|NC_018087. 140 -RKGSDHFKR----WYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESY 214 (520) Q Consensus 140 -k~g~~~fRr----WYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~ 214 (520) .+..++++. +++-|.-|+.++.+.+ ....+++|.+|||..|.+++... +.. +.....|+.+.. T Consensus 160 ~~t~~~f~~~lv~dlll~Gna~~~i~~~rd-~~g~~~~L~pl~p~~V~v~~~~d-----g~~-~~~~~~~~~~~~----- 227 (576) T protein:vir:96 160 RDSFQSFCRKIVRDTYTYDQVNFEKVFNKK-NATTMDKFIAVDPSTIFYATDKN-----GKI-IKGGKRFVQVIN----- 227 (576) T ss_pred cccHHHHHHHHHHHHHhcCCeEEEEEEecC-CCCceEEEEEeCCceeEEEECCC-----Cce-eeeeeEEEEecC----- Confidence 145555555 5677999999998843 33349999999999999975432 211 111111111111 Q ss_pred ccccceecCCcceecCcccEEEeeccc-ccCC-CCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCC-CC Q lcl|NC_018087. 215 QCGHQHFAAGTKIKIPYSAMVYAHSGL-VDCC-GKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTG-NM 291 (520) Q Consensus 215 ~~~~~~~~~~~~~~I~~~aI~y~hSGL-~d~~-~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvG-nl 291 (520) .+....++.+.|+|.--+. .|.. +...+|-|+.|.+++.....++....=+=---|.-+-|..++.+ .+ T Consensus 228 --------~~~~~~~~~~dii~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~l 299 (576) T protein:vir:96 228 --------KKVVASFTSREMAMGIRNPRTELSSSGYGLSEVEIAMKQFIAYNNTETFNDRFFSHGGTTRGILQIKSEQQQ 299 (576) T ss_pred --------CceEEEecccceEEEeecCCCCcccCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCC Confidence 1122467777877632222 2222 44567889999999888887777665443444566777777764 45 Q ss_pred chHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHH Q lcl|NC_018087. 292 PARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYM 370 (520) Q Consensus 292 pk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~ 370 (520) .+..+++-.+.+-..|+- ..+..+..-.++ .|.+++.|.-...-.+ ++-.+|..+.+.+ T Consensus 300 s~e~~~~lr~~~~~~~~G----------~~nag~~p~vl~----------~G~~~~~ls~~~~d~qfle~~~~~~~~Ia~ 359 (576) T protein:vir:96 300 SQRALENFKREWKSSFSG----------INGSWQVPVVMA----------DDIKFVNMTPTANDMQFEKWLTYLINIISA 359 (576) T ss_pred CHHHHHHHHHHHHHHhcc----------ccccccceeecC----------CCceEEeccCChhhHHHHHHHHHhHHHHHH Confidence 555554444444334442 011111111222 1567777743333333 4444677899999 Q ss_pred hcCCChhhccCC-Ccccccc-ccchhhHH---HH--HHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhce Q lcl|NC_018087. 371 ALRVPLSRIPDE-QTQNVFD-MSTAISRD---EL--SFDKF-ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNI 442 (520) Q Consensus 371 aL~VP~SRl~~~-~~~~~~G-~~~eItRD---El--kF~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I 442 (520) +.+||...|... .+..+++ .++.+|+. +. .|.++ +.-+..++...|.. .|+ +. + ...+ T Consensus 360 afgVPp~~lG~~~~~~~~g~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~----~Ll-----~~--~---~~~~ 425 (576) T protein:vir:96 360 LYGIDPAEIGFPNRGGATGGKGGNTLNEADPGKKQQQSQNKGLQPLLRFIEDLINT----HII-----SE--Y---SDKY 425 (576) T ss_pred HhCCCHHHccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHh----hhc-----hh--c---cCce Confidence 999999999643 3322211 12333332 22 24443 44444444444433 222 21 1 1346 Q ss_pred EEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHH-------------------HHHHHHH Q lcl|NC_018087. 443 KIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDI-------------------AAERKLI 503 (520) Q Consensus 443 ~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI-------------------~~~~kqi 503 (520) .++|.+...=++ .+...+...+ ..-+++..-++. .+.+..-+= +..+.+- T Consensus 426 ~~~f~r~d~~~~-------~e~~~~~~~~---~~G~lT~NE~R~-~~gl~piegGD~~~~~~~~~~~~~~~~~~~~e~~~ 494 (576) T protein:vir:96 426 VFQFVGGDTKSE-------LDKIKILQEE---VKTYKTVNEARK-EKGLKPIEGGDVLLDGSFIQSMSLNTQKEQYEDTK 494 (576) T ss_pred EEEeccCCHHHH-------HHHHHHHHHH---hcCccCHHHHHH-HhCCCCCCCcceeccccccccccccccCCCCCCcc Confidence 777876654322 1122222222 112557776664 466643210 0000000 Q ss_pred HHhhhcCC-----ccCCc--cccC Q lcl|NC_018087. 504 DEELSDKI-----FNPPE--PEEI 520 (520) Q Consensus 504 ~~E~~~~~-----~~~p~--~e~~ 520 (520) +++..++. -++|. .++- T Consensus 495 ~~~~~~~~~~~~~~~~~~~~~~~s 518 (576) T protein:vir:96 495 QKERFDMIQQFLNSPDDEEPQQES 518 (576) T ss_pred ccccccccccccCCCCCCCCCCCC Confidence 01000000 00111 0000 No 32 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=98.58 E-value=2.5e-07 Score=56.79 Aligned_cols=376 Identities=15% Similarity=0.166 Sum_probs=186.7 Q ss_pred hhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchh Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVD 88 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd 88 (520) |.||+.|.+. .+..|...|.-. ..+ .|+..+.++. .+..+++|.|. T Consensus 1 M~~f~~~~~~----------~~~~~~~~~~~~---~~~-------~~~~~~~~v~--------------~~~al~~~~V~ 46 (397) T protein:vir:38 1 MPLLKLNKSH----------SQGFSLNDPDWV---NFL-------TGGEAQKYVS--------------ADTALKNSDIF 46 (397) T ss_pred Ccchhhhhcc----------cCcccCCchhhh---hhh-------cCCcCCceec--------------hHHhhccHHHH Confidence 6667655322 122233322111 011 1111111211 12335799999 Q ss_pred HHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhH----HHHHhhccccceeEEEeeecC Q lcl|NC_018087. 89 NAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGS----DHFKRWYVDSRVFFHKIINPN 164 (520) Q Consensus 89 ~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~----~~fRrWYvDgri~~hkvid~~ 164 (520) .||.-|.+.+..++-. .++ ..+..++.-=|=...++ .+++.+.++|.-|+.++-|. T Consensus 47 ~~v~~ia~~ia~~p~~-------~~~------------~~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~- 106 (397) T protein:vir:38 47 SLIMQLSGDLAMVRYT-------SES------------DRSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNT- 106 (397) T ss_pred HHHHHHHHHHhhCccc-------ccc------------cHHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECC- Confidence 9999999888654321 111 11112221112222333 45666889999999988662 Q ss_pred CCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccccC Q lcl|NC_018087. 165 RPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDC 244 (520) Q Consensus 165 ~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~ 244 (520) ...+++|.+|||..++.++... +...+ |.+.-. ....+..+.++.+.|+|.. ..++ T Consensus 107 --~g~~~~l~~l~~~~v~i~~~~~-----~~~~~-----y~~~~~----------~~~~~~~~~~~~~eiih~~--~~~~ 162 (397) T protein:vir:38 107 --NGVDLSWEYLRPSQVQPMLLQD-----GSGLI-----YNINFD----------EPAIGYMENVPAADVIHIR--LLSK 162 (397) T ss_pred --CCcEEEEEEEcCceeEEEEcCC-----CceEE-----EEEEec----------cccccceeEecCccEEEec--CCCC Confidence 2358999999999998754332 11110 111100 0012234678999998885 3445 Q ss_pred CCC-cchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccc Q lcl|NC_018087. 245 CGK-NIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQ 323 (520) Q Consensus 245 ~~~-~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~ 323 (520) ++. ...|-|..|.+.+.....+++...-+---.+.-+-++.++.+ +.+. +.+-+++.+..++. | ++. T Consensus 163 ~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~-~~~e-~~~~~~~~~~~~~~--------~--~n~ 230 (397) T protein:vir:38 163 NGGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKG-GLLD-AETRIARSKEISKQ--------I--HNS 230 (397) T ss_pred CCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC-CCHH-HHHHHHHHHHHHhc--------c--ccc Confidence 554 458899999999999999988877655556666777777765 3333 34444444443322 1 111 Q ss_pred cccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHH Q lcl|NC_018087. 324 ANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFD 402 (520) Q Consensus 324 ~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~ 402 (520) .+.+ .+ + .|.+++.|.-..+..+ ++=.++..+.+.++++||..-|....+. .+.+.....-|. T Consensus 231 ~~~~-vl-----~-----~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~-----~~~~e~~~~~~~ 294 (397) T protein:vir:38 231 DGPV-VI-----D-----ALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQ-----QSSITQISGQYA 294 (397) T ss_pred CCce-ec-----C-----CCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc-----ccHHHHHHHHHH Confidence 1111 11 1 2566776654444444 4556788999999999999998643321 122322222232 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHH Q lcl|NC_018087. 403 KFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNH 482 (520) Q Consensus 403 KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~ 482 (520) +-+.-+...+.. .|-.. +.++.+|+ +.|.-+- -+..|.+.++.+-. +.+++.+ T Consensus 295 ~~l~P~~~~ie~----~ln~~-----l~~~~~~~-------~~~~~~~---------d~~~~~~~~~~~~~--~G~~t~n 347 (397) T protein:vir:38 295 KSLNRYVQAIVG----ELNDK-----LHANISAN-------IRFAIDA---------MGDQYASTISSSVK--GGTIAGN 347 (397) T ss_pred HHHHHHHHHHHH----HHHHh-----ccChhccc-------ccccccC---------CHHHHHHHHHHHHh--CCCcCHH Confidence 223333333322 22222 23333332 2222111 13455665555422 2467888 Q ss_pred HHHHHHhCCCHH---HHHH---HH---------HHHHHhhhcCCccCCccc Q lcl|NC_018087. 483 TAMKDFLQMSDE---DIAA---ER---------KLIDEELSDKIFNPPEPE 518 (520) Q Consensus 483 ~i~k~IL~~tDe---eI~~---~~---------kqi~~E~~~~~~~~p~~e 518 (520) -+++ +|++.+- |.-. .. ++-+.+..++--+..++| T Consensus 348 E~R~-~lg~~p~~~~d~~~~~~~~~~~~~~~~~~~g~~~~~~~~e~~~~~~ 397 (397) T protein:vir:38 348 QARF-ILQNSGYLAKDLPDPEKEPQQAIQLIQQEGGENDGNNSDERGSDPE 397 (397) T ss_pred HHHH-HhCCCCCCCCccccccccccccccccccccCCCCCCCCCCCCCCCC Confidence 8885 4666431 1000 00 000001111111112222 No 33 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=98.57 E-value=2.6e-07 Score=56.70 Aligned_cols=444 Identities=9% Similarity=0.060 Sum_probs=198.5 Q ss_pred Cccccccchhh--------------------hcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccc Q lcl|NC_018087. 1 MSMLADSDLKM--------------------FAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKL 60 (520) Q Consensus 1 ~~~~~~~~l~~--------------------f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~ 60 (520) |.=|| ..+.| +.+..-+.+....+.++++..+..+.-..+=+.... ..+++ T Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~--------~~~~~ 71 (563) T protein:vir:95 1 MADLF-KQFRLGKDYGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMD--------TNPEF 71 (563) T ss_pred Chhhh-hhhhcccccccccccceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhc--------ccccc Confidence 11111 11111 111111111111222222222222222222111111 11222 Q ss_pred cccccccchhHHHHHHHHHHHhhccchhHHHHhhhceeeEec------CCCcEEEEeeccchh--hhHHHHHHHHHHHHH Q lcl|NC_018087. 61 YGSQDPTATSTRELINTYRSLLNNYEVDNAVQEIVSDAIVYE------EGFDVVSIDLDQTAF--TENIRNLISDEFNSV 132 (520) Q Consensus 61 ~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~d------~~~~~V~l~Ld~~~~--s~~ik~~I~eeF~~i 132 (520) +.. .+-+.+...|-+.=|.++.+|-|..||+.+.+.+.++- ++.-...|.|.+... ++.-+..+. ..... T Consensus 72 ~~~-~~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~-~l~~~ 149 (563) T protein:vir:95 72 RDK-RSYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMK-RIEDF 149 (563) T ss_pred ccc-ccCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhH-HHHHH Confidence 222 22233444444445566678999999999999876531 111112333433321 222222222 22222 Q ss_pred HHHhcch-----hhhHHHHHh----hccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceec Q lcl|NC_018087. 133 LNMLNFQ-----RKGSDHFKR----WYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYRE 203 (520) Q Consensus 133 ~~ll~f~-----k~g~~~fRr----WYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~e 203 (520) +..+..+ .+..+++++ .++.|.-|+.+++.. +...-+.+|.+|||..|..++.-..... +.... T Consensus 150 l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~r-d~~G~~~~L~pl~p~~V~v~~~~~g~~~------~~~~~ 222 (563) T protein:vir:95 150 IVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNK-NNKTKLEKFIAVDPSTIFYATDKKGKII------KGGKR 222 (563) T ss_pred hhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEe-cCCCceEEEEEeCCceeEEEECCCCcee------cccee Confidence 2222222 134555444 566788899888873 3333599999999999998655332111 11111 Q ss_pred ceeecCcccccccccceecCCcceecCcccEEEeecc-cccCC-CCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccc Q lcl|NC_018087. 204 YFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSG-LVDCC-GKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDR 281 (520) Q Consensus 204 y~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSG-L~d~~-~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeR 281 (520) |+.+.. .+....++.+.|+|..-+ ..|+. +...+|-|+.|++++.....+|+...=+=---+.-+ T Consensus 223 y~~~~~-------------g~~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~ 289 (563) T protein:vir:95 223 FVQVVD-------------KRVVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTR 289 (563) T ss_pred EEEEeC-------------CceeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCc Confidence 111100 111246677776653222 23322 455688999999999998888887665544556677 Q ss_pred eEEEccCCC-CchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HH Q lcl|NC_018087. 282 RVFYIDTGN-MPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MD 359 (520) Q Consensus 282 RvFyIDvGn-lpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~ 359 (520) -|.-++.+. |.+..+++.-+.+-..|+. ..+..+..-.+ +.|.+++.|.-...-.+ ++ T Consensus 290 giL~~~~~~~ls~e~~~~~~~~~~~~~~G----------~~nagk~~~vl----------~~G~~~~~l~~~~~d~qfle 349 (563) T protein:vir:95 290 GILQIRSDQQQSQHALENFKREWKSSLSG----------INGSWQIPVVM----------ADDIKFVNMTPTANDMQFEK 349 (563) T ss_pred eEEEeCCCCCCCHHHHHHHHHHHHHHhcc----------ccccccceEEc----------CCCceEEeccCChhHHHHHH Confidence 778888764 4544444433333333432 11111111111 12456666654333334 44 Q ss_pred HHHHHHHHHHHhcCCChhhccCCCccccccc--cchhhHH---HHH--HHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCC Q lcl|NC_018087. 360 DILYFRKALYMALRVPLSRIPDEQTQNVFDM--STAISRD---ELS--FDKF-ISELQHKFEEIFLSPLKSNLLLKRVIT 431 (520) Q Consensus 360 DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~--~~eItRD---Elk--F~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t 431 (520) --+|..+...++.+||...|....+...+|. ++.+++. +.. |... +.-+..++...|.. .| ++ T Consensus 350 ~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~----~L-----~~ 420 (563) T protein:vir:95 350 WLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNR----HI-----IS 420 (563) T ss_pred HHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHh----hh-----ch Confidence 5567889999999999999964332222222 2333332 222 4333 34444444443333 22 22 Q ss_pred hhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHH---H----------H- Q lcl|NC_018087. 432 EDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDED---I----------A- 497 (520) Q Consensus 432 ~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDee---I----------~- 497 (520) ..+ .++.++|.+...=+. .+.++...-...-++|..-+++ .++|.+-+ + . T Consensus 421 ~~~-----~~~~~~f~r~D~~~~----------~e~~~~~~~~~~G~lT~NE~R~-~~gl~Pi~gGD~~~~~~~~~~~~~ 484 (563) T protein:vir:95 421 EYG-----DKYTFQFVGGDTKSA----------TDKLNILKLETQIFKTVNEARE-EQGKKPIEGGDIILDASFLQGTAQ 484 (563) T ss_pred hcc-----cccEEEeccCCHHHH----------HHHHHHHHHhcCCccCHHHHHH-HhCCCCCCCcceeecccccccccc Confidence 221 357778876644222 2222211111123456666664 35664322 0 0 Q ss_pred -HHHHHHHHhhh-----------cCCccCCccccC Q lcl|NC_018087. 498 -AERKLIDEELS-----------DKIFNPPEPEEI 520 (520) Q Consensus 498 -~~~kqi~~E~~-----------~~~~~~p~~e~~ 520 (520) +..++.+.+.. ++--.+|+.|.- T Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (563) T protein:vir:95 485 LQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQS 519 (563) T ss_pred cccccCCCccccchhhhhcccccCCCCCCCCCCCC Confidence 00000000000 000111111111 No 34 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=98.57 E-value=2.6e-07 Score=56.70 Aligned_cols=444 Identities=9% Similarity=0.060 Sum_probs=198.5 Q ss_pred Cccccccchhh--------------------hcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccc Q lcl|NC_018087. 1 MSMLADSDLKM--------------------FAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKL 60 (520) Q Consensus 1 ~~~~~~~~l~~--------------------f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~ 60 (520) |.=|| ..+.| +.+..-+.+....+.++++..+..+.-..+=+.... ..+++ T Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~--------~~~~~ 71 (563) T protein:vir:99 1 MADLF-KQFRLGKDYGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMD--------TNPEF 71 (563) T ss_pred Chhhh-hhhhcccccccccccceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhc--------ccccc Confidence 11111 11111 111111111111222222222222222222111111 11222 Q ss_pred cccccccchhHHHHHHHHHHHhhccchhHHHHhhhceeeEec------CCCcEEEEeeccchh--hhHHHHHHHHHHHHH Q lcl|NC_018087. 61 YGSQDPTATSTRELINTYRSLLNNYEVDNAVQEIVSDAIVYE------EGFDVVSIDLDQTAF--TENIRNLISDEFNSV 132 (520) Q Consensus 61 ~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~d------~~~~~V~l~Ld~~~~--s~~ik~~I~eeF~~i 132 (520) +.. .+-+.+...|-+.=|.++.+|-|..||+.+.+.+.++- ++.-...|.|.+... ++.-+..+. ..... T Consensus 72 ~~~-~~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~-~l~~~ 149 (563) T protein:vir:99 72 RDK-RSYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMK-RIEDF 149 (563) T ss_pred ccc-ccCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhH-HHHHH Confidence 222 22233444444445566678999999999999876531 111112333433321 222222222 22222 Q ss_pred HHHhcch-----hhhHHHHHh----hccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceec Q lcl|NC_018087. 133 LNMLNFQ-----RKGSDHFKR----WYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYRE 203 (520) Q Consensus 133 ~~ll~f~-----k~g~~~fRr----WYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~e 203 (520) +..+..+ .+..+++++ .++.|.-|+.+++.. +...-+.+|.+|||..|..++.-..... +.... T Consensus 150 l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~r-d~~G~~~~L~pl~p~~V~v~~~~~g~~~------~~~~~ 222 (563) T protein:vir:99 150 IVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNK-NNKTKLEKFIAVDPSTIFYATDKKGKII------KGGKR 222 (563) T ss_pred hhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEe-cCCCceEEEEEeCCceeEEEECCCCcee------cccee Confidence 2222222 134555444 566788899888873 3333599999999999998655332111 11111 Q ss_pred ceeecCcccccccccceecCCcceecCcccEEEeecc-cccCC-CCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccc Q lcl|NC_018087. 204 YFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSG-LVDCC-GKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDR 281 (520) Q Consensus 204 y~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSG-L~d~~-~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeR 281 (520) |+.+.. .+....++.+.|+|..-+ ..|+. +...+|-|+.|++++.....+|+...=+=---+.-+ T Consensus 223 y~~~~~-------------g~~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~ 289 (563) T protein:vir:99 223 FVQVVD-------------KRVVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTR 289 (563) T ss_pred EEEEeC-------------CceeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCc Confidence 111100 111246677776653222 23322 455688999999999998888887665544556677 Q ss_pred eEEEccCCC-CchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HH Q lcl|NC_018087. 282 RVFYIDTGN-MPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MD 359 (520) Q Consensus 282 RvFyIDvGn-lpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~ 359 (520) -|.-++.+. |.+..+++.-+.+-..|+. ..+..+..-.+ +.|.+++.|.-...-.+ ++ T Consensus 290 giL~~~~~~~ls~e~~~~~~~~~~~~~~G----------~~nagk~~~vl----------~~G~~~~~l~~~~~d~qfle 349 (563) T protein:vir:99 290 GILQIRSDQQQSQHALENFKREWKSSLSG----------INGSWQIPVVM----------ADDIKFVNMTPTANDMQFEK 349 (563) T ss_pred eEEEeCCCCCCCHHHHHHHHHHHHHHhcc----------ccccccceEEc----------CCCceEEeccCChhHHHHHH Confidence 778888764 4544444433333333432 11111111111 12456666654333334 44 Q ss_pred HHHHHHHHHHHhcCCChhhccCCCccccccc--cchhhHH---HHH--HHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCC Q lcl|NC_018087. 360 DILYFRKALYMALRVPLSRIPDEQTQNVFDM--STAISRD---ELS--FDKF-ISELQHKFEEIFLSPLKSNLLLKRVIT 431 (520) Q Consensus 360 DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~--~~eItRD---Elk--F~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t 431 (520) --+|..+...++.+||...|....+...+|. ++.+++. +.. |... +.-+..++...|.. .| ++ T Consensus 350 ~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~----~L-----~~ 420 (563) T protein:vir:99 350 WLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNR----HI-----IS 420 (563) T ss_pred HHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHh----hh-----ch Confidence 5567889999999999999964332222222 2333332 222 4333 34444444443333 22 22 Q ss_pred hhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHH---H----------H- Q lcl|NC_018087. 432 EDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDED---I----------A- 497 (520) Q Consensus 432 ~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDee---I----------~- 497 (520) ..+ .++.++|.+...=+. .+.++...-...-++|..-+++ .++|.+-+ + . T Consensus 421 ~~~-----~~~~~~f~r~D~~~~----------~e~~~~~~~~~~G~lT~NE~R~-~~gl~Pi~gGD~~~~~~~~~~~~~ 484 (563) T protein:vir:99 421 EYG-----DKYTFQFVGGDTKSA----------TDKLNILKLETQIFKTVNEARE-EQGKKPIEGGDIILDASFLQGTAQ 484 (563) T ss_pred hcc-----cccEEEeccCCHHHH----------HHHHHHHHHhcCCccCHHHHHH-HhCCCCCCCcceeecccccccccc Confidence 221 357778876644222 2222211111123456666664 35664322 0 0 Q ss_pred -HHHHHHHHhhh-----------cCCccCCccccC Q lcl|NC_018087. 498 -AERKLIDEELS-----------DKIFNPPEPEEI 520 (520) Q Consensus 498 -~~~kqi~~E~~-----------~~~~~~p~~e~~ 520 (520) +..++.+.+.. ++--.+|+.|.- T Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (563) T protein:vir:99 485 LQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQS 519 (563) T ss_pred cccccCCCccccchhhhhcccccCCCCCCCCCCCC Confidence 00000000000 000111111111 No 35 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=98.53 E-value=3.5e-07 Score=55.95 Aligned_cols=440 Identities=13% Similarity=0.098 Sum_probs=226.9 Q ss_pred hhhhhhHHHhhhccCCCcccCCCC----------CCCceeecccccccccccc--ccccc-ccccchhHHHHHHHHHHH- Q lcl|NC_018087. 16 HKVDDTEYDKIINDKAESITAPKF----------DDGATEVDSQDIAYNGVFQ--KLYGS-QDPTATSTRELINTYRSL- 81 (520) Q Consensus 16 ~~~~~~~~~~~~~~~~~s~~~p~~----------~dg~~~i~~~~~a~~g~~~--~~~~~-~~~~~~~~~~LI~~YR~m- 81 (520) +++... +-.+-++.++++.|.. =+|+..-. ..+++. ....+ ...-..+-..|..+=|.| T Consensus 1 ~~r~~~--~~~~~dr~i~~~~~~~~~~~~~~~~~y~aa~~~r-----~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~ 73 (505) T protein:vir:96 1 MKRAEK--KPSLAQRMVNWAWYRYVEPQKNAARAFEAARRDR-----LGKAWLRRASRLSADEEIYADLASLVQRAREQS 73 (505) T ss_pred CCCCcc--ccchhhcccchhhhhhHHHHHHhhhhcccccCCC-----ccccccCCCCCCChHHHHHHHHHHHHHHHHHHH Confidence 111110 0011122222222211 11111100 111221 01111 111223566788888999 Q ss_pred hhccchhHHHHhhhceeeEecCCCcEEE-EeeccchhhhHHHHHHHHHHHHHHH--------HhcchhhhHHHHHhhccc Q lcl|NC_018087. 82 LNNYEVDNAVQEIVSDAIVYEEGFDVVS-IDLDQTAFTENIRNLISDEFNSVLN--------MLNFQRKGSDHFKRWYVD 152 (520) Q Consensus 82 a~~pEvd~Ai~eIvneaiv~d~~~~~V~-l~Ld~~~~s~~ik~~I~eeF~~i~~--------ll~f~k~g~~~fRrWYvD 152 (520) .++|-+..||+.+++-+|=+ .+-.|.. +.-....+++.+.++|..+|..-++ .++|..--...+|.|.+| T Consensus 74 rNn~~a~~av~~~~~nvVG~-~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~d 152 (505) T protein:vir:96 74 INNPYAKRFYQLLKNNVIGP-KGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARD 152 (505) T ss_pred hcChHHHHHHHHHHHHhcCC-CcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhC Confidence 59999999999999887632 2222221 2222345667788899999988653 344544456689999999 Q ss_pred cceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccce--------ecceeec--Ccccccccccceec Q lcl|NC_018087. 153 SRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGY--------REYFLYD--TELESYQCGHQHFA 222 (520) Q Consensus 153 gri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~--------~ey~~y~--~~~~~~~~~~~~~~ 222 (520) |-.|......+.. .-| ..|..|||..|.--..- ...++..+..|+ .-|+++. |.......+. . T Consensus 153 GE~f~~~~~~~~~-~~~-~~lqliepd~l~~~~n~--~~~~~~~i~~GIe~d~~Gr~~aY~i~~~hPgd~~~~~~~---~ 225 (505) T protein:vir:96 153 GEVLVREHRGYPN-KWG-YALQILECDRLDLNYNA--DLQNGNRIRMSIELDAWERPVAYHLLVNHPGDNSYCYHY---A 225 (505) T ss_pred CceEEEEeecCCC-Ccc-eEEEEechhhcCCCCCc--ccCCcCeEEeceEECCCCceEEEEEeecCCCcccccccc---c Confidence 9999877765321 223 35888999777542111 112333333333 4677774 3322221111 1 Q ss_pred CCcceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHH Q lcl|NC_018087. 223 AGTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQH 302 (520) Q Consensus 223 ~~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~ 302 (520) .....+||.+.|+|+..-. .+.---.+|.|+.+++.+.+|.-.+||..+-...-|-.=-+..=|.+.+.... T Consensus 226 ~~~~~rvpa~~vlH~f~~~-r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~------- 297 (505) T protein:vir:96 226 GQTYERVPADEIIHTFVPW-RPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYDQPP------- 297 (505) T ss_pred cccccccCHhHhhhhhccc-CCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCCCcc------- Confidence 2334689999998886432 23333457999999999999999999999988887765444433444332110 Q ss_pred HHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCC---------CcChHHHHH-HHHHHHHHhc Q lcl|NC_018087. 303 IMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMT---------GMNEMDDIL-YFRKALYMAL 372 (520) Q Consensus 303 im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~---------nLgei~DV~-YF~kkLy~aL 372 (520) .| ..|+ ...+ -+.|| |.+|+.|+ .-+..++.. -..+.+=.+| T Consensus 298 ----------~~-~~~~-----~~~~-----------l~pG~-i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iaagl 349 (505) T protein:vir:96 298 ----------ED-DQGE-----IVEE-----------VEAGT-YQLLPYGIRFKEHKIDHPHTNFGAFVKSSLRGVAAGM 349 (505) T ss_pred ----------cc-ccCc-----cccc-----------cCCce-eeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhc Confidence 00 0111 0000 11233 55555554 334444433 3334456789 Q ss_pred CCChhhccCCCc-cccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHhcCCCChhhHHhhh-hceEEEe Q lcl|NC_018087. 373 RVPLSRIPDEQT-QNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLK----SNLLLKRVITEDEWEAEL-NNIKIVF 446 (520) Q Consensus 373 ~VP~SRl~~~~~-~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk----~QLiLkgi~t~eew~~~~-~~I~~~f 446 (520) +||-+-|..+-+ .|. +.+.-.-+.|-+.+.++|..|..-|..++- ...+|.|.++.-.+.... -...+.. T Consensus 350 gi~ye~lt~D~s~~nY----SS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~ 425 (505) T protein:vir:96 350 GPAYNRLAHDLEGVNF----SSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQP 425 (505) T ss_pred CCCHHHHhcccccccH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeecc Confidence 999888875532 221 233334566999999999999875655432 256677877643333110 1233333 Q ss_pred eccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHH-HHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 447 HKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAE-RKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 447 ~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~-~kqi~~E~~~~~~~~p~~e~~ 520 (520) -.--+.-.+||+.-...+++. -.-|.+-+..+ .+..-+|.-++ +...+...+.++-..+.+... T Consensus 426 p~~~~iDP~Ke~~a~~~~i~~---------G~~t~~~~~a~-~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~ 490 (505) T protein:vir:96 426 RGWDWVDPAKDSKAHSESIKN---------RTRSRSSIIRA-AGDDPEDVFDEIAWEEQLMRDKGVNPTPPEQES 490 (505) T ss_pred CCccccChHHHHHHHHHHHHc---------CCCCHHHHHHH-cCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCCC Confidence 344455677777766555542 12244444444 34433332222 222222223444222222222 No 36 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=98.50 E-value=4.4e-07 Score=55.44 Aligned_cols=453 Identities=10% Similarity=0.038 Sum_probs=227.6 Q ss_pred hccCCCcccCCCCCCCceeec---------ccccccccccccccccccc-cchhHHHHHHHHHHH-hhccchhHHHHhhh Q lcl|NC_018087. 27 INDKAESITAPKFDDGATEVD---------SQDIAYNGVFQKLYGSQDP-TATSTRELINTYRSL-LNNYEVDNAVQEIV 95 (520) Q Consensus 27 ~~~~~~s~~~p~~~dg~~~i~---------~~~~a~~g~~~~~~~~~~~-~~~~~~~LI~~YR~m-a~~pEvd~Ai~eIv 95 (520) ++. +.+++....+++.... .+.....++.+....+-+. ...+...|..+=|.| .++|-+..||+-++ T Consensus 1 ~~~--p~~~~~~~~~~~~~~~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 78 (533) T protein:vir:34 1 MKT--PTIPTLLGPDGMTSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQ 78 (533) T ss_pred CCC--chhhhhhcccccchHHHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 111 1111112222221110 0000001122211222222 223556677788888 59999999999999 Q ss_pred ceeeEecCCCcEEE------EeeccchhhhHHHHHHHHHHHHHH----------HHhcchhhhHHHHHhhccccceeEEE Q lcl|NC_018087. 96 SDAIVYEEGFDVVS------IDLDQTAFTENIRNLISDEFNSVL----------NMLNFQRKGSDHFKRWYVDSRVFFHK 159 (520) Q Consensus 96 neaiv~d~~~~~V~------l~Ld~~~~s~~ik~~I~eeF~~i~----------~ll~f~k~g~~~fRrWYvDgri~~hk 159 (520) +-+|=. +..|-. |.++. +.++.+.++|..+|..-+ ..++|..--...+|.|.+||-.|..+ T Consensus 79 ~nvVG~--Gi~~~~~p~~~~lg~~~-~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~ 155 (533) T protein:vir:34 79 DHIVGS--FFRLSHRPSWRYLGIGE-EEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQA 155 (533) T ss_pred HHhhCC--CceeeeccchhhcCCCh-hHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEe Confidence 887642 433321 33333 346777788888887654 45556555567899999999999999 Q ss_pred eeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccce--------ecceeecCccccccc-ccceecCCcceecC Q lcl|NC_018087. 160 IINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGY--------REYFLYDTELESYQC-GHQHFAAGTKIKIP 230 (520) Q Consensus 160 vid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~--------~ey~~y~~~~~~~~~-~~~~~~~~~~~~I~ 230 (520) .+++.....=-..|..|||..|.--+. ..++..+..|+ .-|+++.....+... ...+. ...+.+| T Consensus 156 ~~~~~~g~~~~~~lq~ie~d~l~~~~~----~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~--~~~~~v~ 229 (533) T protein:vir:34 156 TWDTSSSRLFRTQFRMVSPKRISNPNN----TGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWI--PRELPGG 229 (533) T ss_pred eeccCCCCccceEEEEechhhcCCCCC----CCCCCceEeeeEECCCCCeEEEEEeecCCCCcccccccee--eeeeccC Confidence 988542211135688899977764221 23444444444 357777432111111 10110 1124577 Q ss_pred cccEEEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcce Q lcl|NC_018087. 231 YSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNR 310 (520) Q Consensus 231 ~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knk 310 (520) .+.|+|..... .+.-.-.+|.|..++..+.+|.-.+||...-...-|-.=-++.=+.+......+. ...-....... T Consensus 230 a~~VlH~f~~~-r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~--~~~~~~~~~~~ 306 (533) T protein:vir:34 230 RASFIHVFEPV-EDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFI--LGANSQEQRER 306 (533) T ss_pred hhHeeeecccc-CCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCcccccccc--cCCCccccccc Confidence 88898887553 3443445799999999999999999999998888887644443333321111000 00000000000 Q ss_pred eEeecCCCccccccccchhhhhhcccc---cCCC------CCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhcc Q lcl|NC_018087. 311 ISYDARTGKVKNQANMMALTEDYWLQR---RDGK------AVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIP 380 (520) Q Consensus 311 lvYd~~TGev~d~~~~msmlEDywLpR---ReGg------rgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~ 380 (520) + ....+...++.-.+ =++| -|.+|+.+..+..-+. .+=++-..+.+=.+|+||-+-|. T Consensus 307 ~------------~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt 374 (533) T protein:vir:34 307 L------------TGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLS 374 (533) T ss_pred c------------cccchhhhhccCcceeeccCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHh Confidence 0 00011111111000 0111 1233444433333333 33344455666678999998887 Q ss_pred CCCc-cccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHhcCCCCh------hhHHhhhh--ceEEEee Q lcl|NC_018087. 381 DEQT-QNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKS----NLLLKRVITE------DEWEAELN--NIKIVFH 447 (520) Q Consensus 381 ~~~~-~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~----QLiLkgi~t~------eew~~~~~--~I~~~f~ 447 (520) .+-+ .|- +.+.-.-+.|-+.+.++|..|..=|..++-. ..+|.|.++. +-|..-.. ...+..- T Consensus 375 ~D~s~~nY----SS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p 450 (533) T protein:vir:34 375 RNYAQMSY----STARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGS 450 (533) T ss_pred hhcccccH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccC Confidence 6632 221 2344446679999999999888655544333 4467887752 22332222 2334344 Q ss_pred ccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHH-HHHHhhhcCCccCCccccC Q lcl|NC_018087. 448 KNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERK-LIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 448 ~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~k-qi~~E~~~~~~~~p~~e~~ 520 (520) .--+.--+||+.-...+++. -.-|.+-+..+ .+..-+|..++.+ ..+...+.++-. |...-. T Consensus 451 ~~~~iDP~Ke~~a~~~~i~~---------G~~s~~~~~a~-~G~D~~ev~~q~a~e~~~~~~~gl~~-~~~~~~ 513 (533) T protein:vir:34 451 GRMAIDGLKEVQEAVMLIEA---------GLSTYEKECAK-RGDDYQEIFAQQVRETMERRAAGLKP-PAWAAA 513 (533) T ss_pred CccccChHHHHHHHHHHHHc---------CCCCHHHHHHH-cCCCHHHHHHHHHHHHHHHHhcCCCC-CCCCCc Confidence 44555667777666555442 22355555444 3443333322222 222222223311 111111 No 37 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=98.49 E-value=4.6e-07 Score=55.34 Aligned_cols=463 Identities=8% Similarity=-0.013 Sum_probs=233.8 Q ss_pred cccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCC-CCceeeccccccccccccccccccccc-chhHHHHHHHHHH Q lcl|NC_018087. 3 MLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFD-DGATEVDSQDIAYNGVFQKLYGSQDPT-ATSTRELINTYRS 80 (520) Q Consensus 3 ~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~-dg~~~i~~~~~a~~g~~~~~~~~~~~~-~~~~~~LI~~YR~ 80 (520) |+ ++.-...+..... .++.+... -++...++......+++.....+.+.. ..+...|..+=|. T Consensus 1 m~-~~~~r~~~~~a~~--------------~~~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRd 65 (553) T protein:vir:63 1 MT-KVTVRKLSEVTSG--------------RPEQSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRD 65 (553) T ss_pred Cc-chhhhhhcccccc--------------cchhhhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHH Confidence 21 1111111111100 00000000 011111111111112222222222222 2355667888888 Q ss_pred H-hhccchhHHHHhhhceeeEecCCCcEEE-Ee-----eccchhhhHHHHHHHHHHHHHH----------HHhcchhhhH Q lcl|NC_018087. 81 L-LNNYEVDNAVQEIVSDAIVYEEGFDVVS-ID-----LDQTAFTENIRNLISDEFNSVL----------NMLNFQRKGS 143 (520) Q Consensus 81 m-a~~pEvd~Ai~eIvneaiv~d~~~~~V~-l~-----Ld~~~~s~~ik~~I~eeF~~i~----------~ll~f~k~g~ 143 (520) | .++|-+..||+.+++-+|= . +..|-. .+ -.+-++.+.+.++|..+|..-+ ..|+|..--. T Consensus 66 L~rNn~~a~~av~~~~~nvVG-~-Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~ 143 (553) T protein:vir:63 66 MADNDGFTNGAVGYQRDSIVG-A-QYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIR 143 (553) T ss_pred HHhcChHHHHHHHHHHHhhcc-C-CceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHH Confidence 8 8899999999999988764 2 433321 11 1123567888888988887643 5566766667 Q ss_pred HHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccce--------ecceeec--Ccccc Q lcl|NC_018087. 144 DHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGY--------REYFLYD--TELES 213 (520) Q Consensus 144 ~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~--------~ey~~y~--~~~~~ 213 (520) ..+|.|.+||-.|......+.....--..|..|||..|.--.. ..++..+..|+ .-|++++ |.... T Consensus 144 l~~r~~~~dGE~~~~~~~~~~~~~~~~~~lq~ie~drl~~~~~----~~~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~ 219 (553) T protein:vir:63 144 LGVVGYVKTGEVLATAEWDRAANRPYATCFQMVSTDRLSNPYQ----QLDTPTLRRGVQYDKRGRPQGYWIQVAHPGDLY 219 (553) T ss_pred HHHHHHHhCCceEEEeeeccCCCCcccceEEEechhhcCCCCC----CCCCCeeEeeeEECCCCceEEEEeeccCCCccc Confidence 7899999999999998887532211124678889977765322 23444555554 3678875 33221 Q ss_pred ccccc-ceec-CCcceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCC Q lcl|NC_018087. 214 YQCGH-QHFA-AGTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNM 291 (520) Q Consensus 214 ~~~~~-~~~~-~~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnl 291 (520) +..+. .... .....++|.+.|+|+...+ .+.---.+|.|+.+++.+.+|.-.+||...-...-|-. +.+|-.+ . T Consensus 220 ~~~~~~~~~~r~~~~~~v~a~~vlH~f~~~-r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~--a~fi~~~-~ 295 (553) T protein:vir:63 220 QMAPDMYKWKFVQQSKPWGRRQVIHILEPR-EPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASY--AAAIESE-L 295 (553) T ss_pred cccccccceeeeccccccChhHheeccccc-CCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhh--eeeeecC-C Confidence 11111 1100 1112478999999887553 34333457999999999999999999999998888866 3333322 2 Q ss_pred chHHHHHHHHHHH-----------------HhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCC Q lcl|NC_018087. 292 PARKAAQHMQHIM-----------------NSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTG 354 (520) Q Consensus 292 pk~KAeqyl~~im-----------------~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~n 354 (520) |...+.+.+..-- ..+..+.+..-.-|.|.. -.-|.+|+.+..... T Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~-----------------L~pGe~i~~~~p~~p 358 (553) T protein:vir:63 296 PPEFIHSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPH-----------------LFPGTKLNLKPMGTP 358 (553) T ss_pred ChhhhhhhcccccccccccccccccccccccccccccceeecCceeee-----------------cCCCCeeeecCCCCC Confidence 3333322222110 000111111111111100 011334444444333 Q ss_pred cCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHhcCC Q lcl|NC_018087. 355 MNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLK----SNLLLKRV 429 (520) Q Consensus 355 Lge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk----~QLiLkgi 429 (520) -+. .+=++...+.+=.+|+||-+-|..+-+... =+.+.-.-+.|-+.+.++|..|..-|..++- ...+|.|. T Consensus 359 ~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~n---YSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~ 435 (553) T protein:vir:63 359 GGVGSEFEASLNRHLASAFGMSYEEFTRDFSKAN---YSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGE 435 (553) T ss_pred CCCHHHHHHHHHHHHHhhcCCCHHHHhhhccccc---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC Confidence 333 344455566666789999988876632211 1234444667999999999998877777643 35577776 Q ss_pred CCh-hhHHhh--------h--hceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHH Q lcl|NC_018087. 430 ITE-DEWEAE--------L--NNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAA 498 (520) Q Consensus 430 ~t~-eew~~~--------~--~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~ 498 (520) ++. .-|... . -...+..-.-.+.--+||+.-...+++. -+-|.+-+..+ ++..-+|..+ T Consensus 436 i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~---------G~~t~~~~~a~-~G~D~~~v~~ 505 (553) T protein:vir:63 436 VPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVMRIDA---------GLSTYEREIAR-LGGDFRKSFA 505 (553) T ss_pred ccCCCcccchhhcchhhhhhhhceeeecCCccccChHHHHHHHHHHHHc---------CCCCHHHHHHH-hCCCHHHHHH Confidence 642 211110 0 1133444444555667777655555442 12244444444 2433332222 Q ss_pred HH-HHHHHhhhcCCccCCccccC Q lcl|NC_018087. 499 ER-KLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 499 ~~-kqi~~E~~~~~~~~p~~e~~ 520 (520) +. +..+.-.+.|+..+..+... T Consensus 506 q~a~e~~~~~~~Gl~~~~~~~~~ 528 (553) T protein:vir:63 506 QRAREDALLKKYGLTFNLSAKRS 528 (553) T ss_pred HHHHHHHHHHHcCCCCCCCCccc Confidence 21 12222222343222111111 No 38 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=98.45 E-value=6.1e-07 Score=54.65 Aligned_cols=398 Identities=14% Similarity=0.137 Sum_probs=180.4 Q ss_pred hcchhhhhhhHHHhhhccCCCcccCCCCCCCceee-cccccccccccccccccccccchhHHHHHHHHHHHhhccchhHH Q lcl|NC_018087. 12 FAFWHKVDDTEYDKIINDKAESITAPKFDDGATEV-DSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNA 90 (520) Q Consensus 12 f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i-~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~A 90 (520) +|.+..++.+. .. ..||-... ...+ +...+..+.+... +-.+|...| +.|+.+..| T Consensus 1 ~~~~m~~~~~~----------~~----~~D~~~~~~~~~~---g~~~~~~~~~~~~---~~~~l~~~Y---~~~~l~~~~ 57 (435) T protein:vir:79 1 MGVFMSDKVKA----------IT----KEDGYNEIFGSKD---GTFRPNAFYMQRA---AFKALSQFY---EEDGMARRI 57 (435) T ss_pred CCccccccccc----------ch----hhcchhhhhcccc---cccccCcccCCcC---CHHHHHHHH---hcCchhhhh Confidence 34444333211 00 11222110 0000 0111112212111 223555555 468999999 Q ss_pred HHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccc--eeEEEeeec----- Q lcl|NC_018087. 91 VQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSR--VFFHKIINP----- 163 (520) Q Consensus 91 i~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgr--i~~hkvid~----- 163 (520) |+.++++|+-.-- .|.-+ .-++++..+++ -|++..+..+.+|.=-+.|. ++++.. |. T Consensus 58 Vd~~aed~~r~g~-----~i~g~------~~~~~~~~~~~----~l~~~~~l~~a~~~~rl~G~~~i~i~~~-d~~~~~~ 121 (435) T protein:vir:79 58 VDVIPEEMVTPGF-----KVDGV------KNEKSFKSRWD----ELRLNAKIIDALSWSRLFGGSAILAVVA-DNKMLKS 121 (435) T ss_pred hccchHHhhcCCc-----eecCC------ChHHHHHHHHH----HhhHHHHHHHHHHhhhccccEEEEEEec-CCCCccc Confidence 9999999987542 22211 11234443333 44556677776554344454 444322 32 Q ss_pred -CCCCCCeeeeEecCccceeeeeeccCCCCcccc-cccceecceeecCcccccccccceecCCcceecCcccEEEeeccc Q lcl|NC_018087. 164 -NRPKDGIIELRRLDPRNVQFVRELDTKMENGVK-VVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGL 241 (520) Q Consensus 164 -~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~-~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL 241 (520) -+++..|+.++.+||.++.+. .+. .|... .+.....|.| .+. +...+.+||+|=++..+..- T Consensus 122 Pl~~~g~i~~i~v~d~~~i~~~-~~~---~dp~sp~fg~P~~y~v-~~~-----------~~~~~~~iH~SRli~~~g~~ 185 (435) T protein:vir:79 122 PVKPGAQLEDIRVYDRYQITIH-ERE---TNARSVRYGEPKLYKI-SPG-----------GDIPEFFVHYSRICIIDGER 185 (435) T ss_pred ccccCCceeeEEeechhhccch-hhc---cCCcccccCcceEEEE-ecC-----------CCCCceEEcceeEEEecCCc Confidence 245667888999999777652 111 11111 1111111211 111 12345789999877653221 Q ss_pred c-----cCCCCcchhhh-HHHHHHHHHHHHHHH--HHHHHHHhcCccceEEEcc-CCCC-----chHHHHHHHHHHHHhh Q lcl|NC_018087. 242 V-----DCCGKNIIGYL-HRAVKPANQLKLLED--AMMIYRITRAPDRRVFYID-TGNM-----PARKAAQHMQHIMNSH 307 (520) Q Consensus 242 ~-----d~~~~~~~syL-~~aik~~NqL~m~ED--alVIyRi~RApeRRvFyID-vGnl-----pk~KAeqyl~~im~~~ 307 (520) + ...++...|-| +++...+.+...... +.++++- -. +|+.++ +.++ ....+..-+ ..++++ T Consensus 186 ~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~---~~-~v~~~~~l~~~~~~~~~~~~~~~r~-~~~~~~ 260 (435) T protein:vir:79 186 VSNEKRRQNDGWGASILNKRLIEAIVDYNYCQELATQLLRRK---QQ-AVWKARDLALMCDDEEGRYAARLRL-AQVDDE 260 (435) T ss_pred chhhhccccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHh---cC-ccccchhHHHhhcCccchHHHHHHH-HHHHHh Confidence 1 11223334444 455443333322222 2344443 12 234442 2221 111111111 112333 Q ss_pred cceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHH-HHHHHHHHHhcCCChhhccCCCccc Q lcl|NC_018087. 308 RNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDI-LYFRKALYMALRVPLSRIPDEQTQN 386 (520) Q Consensus 308 knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV-~YF~kkLy~aL~VP~SRl~~~~~~~ 386 (520) |+. +|. |-+ | | ..-+++++. .+|+-++|+ .+|...+=.+.+||+.||-.++... T Consensus 261 ~~~------~~~-------~~i--~-------~-~~e~~e~~~--~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~g 315 (435) T protein:vir:79 261 SGV------GKA-------IGI--D-------A-TDEEYEVLN--SDVSGVPEFLQEKIDRIVALTGIHEIIIKNKNTGG 315 (435) T ss_pred cCC------CCc-------eeE--e-------c-CCcceEEEe--cccCCHHHHHHHHHHHHHhhhCCCeeeeccCCccc Confidence 321 111 101 0 1 001233332 245555664 7899999999999999997666544 Q ss_pred cccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHH Q lcl|NC_018087. 387 VFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVN 466 (520) Q Consensus 387 ~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~ 466 (520) +.+-+.+ |.-.|..+|+++|.. .+..+|++-+-| ++-. +.+.|.|..=..=+|...+|+...+.+ T Consensus 316 lnstgd~---d~~~yyd~i~~~Qe~---~l~p~l~~l~~l--i~~s-------~d~~~~f~pL~~~sekEkAei~~~~a~ 380 (435) T protein:vir:79 316 VSASQNT---ALETFYKLIDRKRVE---DYKPILEFLLPF--MISE-------TEWSIEFEPLSVPSDKDKAEIMAKNVE 380 (435) T ss_pred cccchhH---HHHHHHHHHHHHHHH---HHHHHHHHHHHH--hhcC-------CCCeEEeCCCCCCCHHHHHHHHHHHHH Confidence 4222222 445599999999853 333334431111 1211 346788887777788888999999999 Q ss_pred HHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 467 VLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 467 ~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) +++.+-.- -.++.+-+++.+-.+. .+. .+..+.... + |+++++ T Consensus 381 a~~~~~~~--g~i~~~e~r~~L~~~~-~~~-----~~~~~~~~~-~--~~~~d~ 423 (435) T protein:vir:79 381 SVVKLKAE--QAINLKETRDTLRSIC-PDL-----KIMDNDNIE-L--PEPEDL 423 (435) T ss_pred HHHHHHhc--CCCCHHHHHHHHHHhc-ccc-----CCCCccccc-C--CccccC Confidence 88887332 1456666665431110 000 000011111 0 111111 No 39 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=98.43 E-value=6.9e-07 Score=54.37 Aligned_cols=382 Identities=10% Similarity=0.072 Sum_probs=173.1 Q ss_pred HHHHhh-ccchhHHHHhhhceeeEecCCCcEEEEeeccc-hhhhHHHHHHHHHHHHHHHHhc-------------chhhh Q lcl|NC_018087. 78 YRSLLN-NYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQT-AFTENIRNLISDEFNSVLNMLN-------------FQRKG 142 (520) Q Consensus 78 YR~ma~-~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~-~~s~~ik~~I~eeF~~i~~ll~-------------f~k~g 142 (520) -|+|+. +|-|..||+.|.+.+.- -|+.|....- .-.. +.....+.+...|. +.... T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~-----~p~~i~~~~~~~~~~----~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~ 71 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAG-----FGINIIPHPEAEDPD----RDGEQYERVWDFWFGDDSNWQVGPMESERATA 71 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhc-----CCeEEEEccCccccc----chhhhhhhHHHHhhccCCCccccchhhHhhHH Confidence 677755 69999999999998852 2333322211 1111 11222222222111 12233 Q ss_pred HHHHHhh----ccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceee-cC------c- Q lcl|NC_018087. 143 SDHFKRW----YVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLY-DT------E- 210 (520) Q Consensus 143 ~~~fRrW----YvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y-~~------~- 210 (520) .++++.+ ++.|.-|..++-| ....+++|.+|||.+|+..+....- +....+..-||.+ +. . T Consensus 72 ~~~~~~~~~~l~l~Gn~~i~~~r~---~~G~~~~l~~l~~~~v~~~~d~~~~----~~~~~~~~~~~~~~~~~~~~~~~~ 144 (467) T protein:vir:31 72 TNVLQTAWTDYEAIGWLTIEILTQ---TDGTPTGLAYVPGHTIRKRMDERGF----VQLLEEKEKYFGVAGDRYQTNGNG 144 (467) T ss_pred HHHHHHHHHHHHhcCCeEEEEEEC---CCCcEEEEEEeCCceeEeeeeccee----EeecCCceeeEEeccccceeeccc Confidence 4444443 4569999998855 3335899999999999876443211 1111111112221 11 0 Q ss_pred -ccccccccceecCCcceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcC-ccceEEEccC Q lcl|NC_018087. 211 -LESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRA-PDRRVFYIDT 288 (520) Q Consensus 211 -~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RA-peRRvFyIDv 288 (520) .............+..+.+|.+.|+|.. ..-..++...+|-+..|.+.+..-..++....=+ ..++ --+-+..+.- T Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~diih~r-~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~-f~ng~~p~gil~~~~ 222 (467) T protein:vir:31 145 DLDPVFVDADDGSTGTSVSNPANELIFKR-NHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDF-FENDGVPRIAIIVKG 222 (467) T ss_pred ceeeeeeeeccccccceeEeccccEEEec-CCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHH-HhccCCCceEEEecC Confidence 1111111222334556899999999884 2223455567888999888876666555443221 1122 2234455554 Q ss_pred CCCchHHHHHHHHHHHHh-hcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCC------------- Q lcl|NC_018087. 289 GNMPARKAAQHMQHIMNS-HRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTG------------- 354 (520) Q Consensus 289 Gnlpk~KAeqyl~~im~~-~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~n------------- 354 (520) +.+.+ ++.+-+++.+++ |++.... .++.--++..|-.+..|+||.. T Consensus 223 ~~l~~-e~~~~~~~~~~~~~~~~~~~-------------------~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~ 282 (467) T protein:vir:31 223 AELTE-KGREEMRNLIEDNNEDNHRT-------------------AFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTV 282 (467) T ss_pred cCCCH-HHHHHHHHHHHhhhcchhhh-------------------hhhhhcccccccccccccCCCcccccceeEEeccc Confidence 55544 444455544433 4432110 0111111112222333333321 Q ss_pred ----cChHHHH-HHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_018087. 355 ----MNEMDDI-LYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKF-ISELQHKFEEIFLSPLKSNLLLKR 428 (520) Q Consensus 355 ----Lgei~DV-~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KF-I~rLr~rFs~if~d~Lk~QLiLkg 428 (520) -.|+-+. ++..+..-++.+||.+-|....+.+. + +.+...-..|.++ +.-+..++...|...| T Consensus 283 ~~~~d~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~-~--s~~e~~~~~f~~~~l~P~~~~ie~~ln~~l-------- 351 (467) T protein:vir:31 283 GIDEEASFLEFRGRNEHDILKVHDVPPVIAGVVESGAF-S--TDAEEQRKEFAEETIQPKQHDFGELLYELV-------- 351 (467) T ss_pred cChhhHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCc-c--cCHHHHHHHHHHHHHHHHHHHHHHHHHHhh-------- Confidence 2232233 34555699999999988854322221 2 2333333445444 4555555555444322 Q ss_pred CCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCH---HHHHHHHH---H Q lcl|NC_018087. 429 VITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSD---EDIAAERK---L 502 (520) Q Consensus 429 i~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tD---eeI~~~~k---q 502 (520) ++.. .......|+|++..--.-.. +.|++.+..+- -.-++|..-+++. +++.+ ++...... . T Consensus 352 -~~~~-~~~~~~~i~f~~~~l~~~d~-------~~~~~~~~~~~--~~G~~T~NE~R~~-~Gl~pi~d~~~~~~~~~~~~ 419 (467) T protein:vir:31 352 -HKQG-LDAPDWTIEFELAKPDTKLQ-------DVEIASQRVQA--MQGLLTVNELRDE-FGFEPFPEEHVYGGETLVAE 419 (467) T ss_pred -cchh-hccCCceEEEecchhhccCH-------HHHHHHHHHHH--hCCCcCHHHHHHH-hCCCCCCcccccCCcccccc Confidence 2211 11112346666553322233 34444444431 1225677777744 66642 11100000 0 Q ss_pred HH-------HhhhcC-CccCCccccC Q lcl|NC_018087. 503 ID-------EELSDK-IFNPPEPEEI 520 (520) Q Consensus 503 i~-------~E~~~~-~~~~p~~e~~ 520 (520) .. ....++ -.++.+++++ T Consensus 420 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 445 (467) T protein:vir:31 420 VTGGSGPGGGIGDQIEQLVEDRADEI 445 (467) T ss_pred cccccCCCCcccCcCCCCCCCcccch Confidence 00 000000 0001112222 No 40 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=98.42 E-value=7.5e-07 Score=54.16 Aligned_cols=450 Identities=9% Similarity=0.012 Sum_probs=229.8 Q ss_pred cCCCcccCCCCCCCc------eeeccccccccccccccccccccc-chhHHHHHHHHHHH-hhccchhHHHHhhhceeeE Q lcl|NC_018087. 29 DKAESITAPKFDDGA------TEVDSQDIAYNGVFQKLYGSQDPT-ATSTRELINTYRSL-LNNYEVDNAVQEIVSDAIV 100 (520) Q Consensus 29 ~~~~s~~~p~~~dg~------~~i~~~~~a~~g~~~~~~~~~~~~-~~~~~~LI~~YR~m-a~~pEvd~Ai~eIvneaiv 100 (520) -+......|+...-. ...+.+.....+++.....+-+.. ..+...|..+=|.| .++|-+..||+.+++-+|= T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG 80 (530) T protein:vir:38 1 MKIPSLVGPDGKTSLREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQDHIVG 80 (530) T ss_pred CccceeecCccccchHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhhC Confidence 122222222211110 000100001112222222222222 33566788888888 5999999999999988764 Q ss_pred ecCCCcEEE------EeeccchhhhHHHHHHHHHHHHHHH----------HhcchhhhHHHHHhhccccceeEEEeeecC Q lcl|NC_018087. 101 YEEGFDVVS------IDLDQTAFTENIRNLISDEFNSVLN----------MLNFQRKGSDHFKRWYVDSRVFFHKIINPN 164 (520) Q Consensus 101 ~d~~~~~V~------l~Ld~~~~s~~ik~~I~eeF~~i~~----------ll~f~k~g~~~fRrWYvDgri~~hkvid~~ 164 (520) . +..|-. |.++. +.++.+.++|..+|..-++ .++|..--...+|.|.+||-.|.-+..++. T Consensus 81 ~--Gi~~~~~p~~~~l~~~~-~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~~ 157 (530) T protein:vir:38 81 S--FFRLSYRPSWRYLGINE-EDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQATWDSD 157 (530) T ss_pred C--CceeeeccchhhcCCCH-hHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEeeeccC Confidence 3 443332 22222 3467788899999987553 455655566689999999999999988853 Q ss_pred CCCCCeeeeEecCccceeeeeeccCCCCcccccccce--------ecceeecCcccccccccceecCCcceecCcccEEE Q lcl|NC_018087. 165 RPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGY--------REYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVY 236 (520) Q Consensus 165 ~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~--------~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y 236 (520) ....=-..|..|+|..|.--.. ..++..+..|+ .-|+++.....+.........+ ..+.+|.+.|+| T Consensus 158 ~g~~~~~~lq~ie~d~l~~~~~----~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~-~~~~v~a~~vlH 232 (530) T protein:vir:38 158 STRLFRTQFKMVSPKRVSNPNN----IGDTRNCRAGVKINDSGAALGYYVSDDGYPGWMAQNWTYIP-RELPGGRPSFIH 232 (530) T ss_pred CCCccceEEEEechhhcCCCCC----CCCCCeeEeeeEECCCCceEEEEEeeccCCCccccccceee-eeeccChhHeEe Confidence 2111125688899977654322 23444444444 4677774322122111111111 125677778988 Q ss_pred eecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHH-------------HHHHHHH Q lcl|NC_018087. 237 AHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKA-------------AQHMQHI 303 (520) Q Consensus 237 ~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KA-------------eqyl~~i 303 (520) +..-. .+.---.+|.|..+++.+.+|.-.+||...-...-|-.=-++.=+.+......+ ..+.. - T Consensus 233 ~f~~~-r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 310 (530) T protein:vir:38 233 VFEPM-EDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGADNKEQQSKLTGWLG-E 310 (530) T ss_pred ecccc-CCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccccccccccCCcccccccccccch-h Confidence 86442 334344578999999999999999999998888777653333323322111100 00000 0 Q ss_pred HHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCC Q lcl|NC_018087. 304 MNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDE 382 (520) Q Consensus 304 m~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~ 382 (520) +..+.+.-+..-.-|.|. .-.-|-+|+.+..+.--+. -+=++...+.+=.+|+||-+-|..+ T Consensus 311 ~~~~~~~~~~~l~pG~i~-----------------~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D 373 (530) T protein:vir:38 311 MAAYYSAAPVRLGGARVP-----------------HLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRN 373 (530) T ss_pred hhhcccccceeccCceee-----------------ecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcc Confidence 111111111111222111 0012445555554433333 3444555566677899999888765 Q ss_pred Cc-cccccccchhhHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHhcCCCCh------hhHHhhhhce--EEEeec Q lcl|NC_018087. 383 QT-QNVFDMSTAISRDELSFDKFISELQHKFEEIF-----LSPLKSNLLLKRVITE------DEWEAELNNI--KIVFHK 448 (520) Q Consensus 383 ~~-~~~~G~~~eItRDElkF~KFI~rLr~rFs~if-----~d~Lk~QLiLkgi~t~------eew~~~~~~I--~~~f~~ 448 (520) -+ .|- +.+.-.-+.|-+.+.+.|..|..=| .--|+ ..+|.|.++. +.|....... .+..-. T Consensus 374 ~s~~nY----SS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~-~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~ 448 (530) T protein:vir:38 374 YSQMSY----STARASANESWAYFMGRRKFVASRQACQMFLCWLE-EAIVRRVVTLPSKARFSFQEARTAWGNANWIGSG 448 (530) T ss_pred cccccH----HHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHH-HHHHcCCccCCCCCCCCchhhHHhhhceeeecCC Confidence 32 221 2333345669999999999887644 33344 4577887763 3333322222 333334 Q ss_pred cchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHH-HHHHhhhcCCccCCccccC Q lcl|NC_018087. 449 NSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERK-LIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 449 Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~k-qi~~E~~~~~~~~p~~e~~ 520 (520) --+.-.+||+.-...+++. -.-|.+-+..+ .+..-+|.-++.+ ..+...+-++-.+-..... T Consensus 449 ~~~iDP~Ke~~a~~~~i~~---------G~~s~~~~~a~-~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~ 511 (530) T protein:vir:38 449 RMAIDGLKEVQEAVMLIEA---------GLSTYEKECAK-RGDDYQEIFAQQVRESMERRAAGLNPPAWAAAA 511 (530) T ss_pred ccccChHHHHHHHHHHHHc---------CCCCHHHHHHH-cCCCHHHHHHHHHHHHHHHHHcCCCCCCCcccc Confidence 4455667877766555542 22344544444 3443333222221 2222222333221111111 No 41 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=98.40 E-value=8.5e-07 Score=53.87 Aligned_cols=444 Identities=13% Similarity=0.100 Sum_probs=203.1 Q ss_pred cccchhhhcchhhh---hhhHHHh-------hhccCCCcc-cCCCCCCCceeeccccccccccc-ccccccccccchhHH Q lcl|NC_018087. 5 ADSDLKMFAFWHKV---DDTEYDK-------IINDKAESI-TAPKFDDGATEVDSQDIAYNGVF-QKLYGSQDPTATSTR 72 (520) Q Consensus 5 ~~~~l~~f~~~~~~---~~~~~~~-------~~~~~~~s~-~~p~~~dg~~~i~~~~~a~~g~~-~~~~~~~~~~~~~~~ 72 (520) .++-|.+|+-.... .+.-++. .++.+..++ +.-+..+|.+...+.+.- .+.- ++.| ..-+.+.+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~-~~~~~~~~~-~~r~~~~~~~ 78 (551) T protein:vir:80 1 MKNKLGLFESIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVI-GSMSANPGF-KTKPSIRNNQ 78 (551) T ss_pred CchhhhhHHHhhhccCChhhcccccccccceeeecccccHHHHHHhhccCcceeecccc-cceecCccc-ccCccccChh Confidence 34444444333310 0000000 000000000 011112222211111110 0011 1112 1233445666 Q ss_pred HHHHHHHHHhhccchhHHHHhhhceeeEecC------CCcEEEEeecc--chhhhHHHHHHHHHHHHHHHHhcch----- Q lcl|NC_018087. 73 ELINTYRSLLNNYEVDNAVQEIVSDAIVYEE------GFDVVSIDLDQ--TAFTENIRNLISDEFNSVLNMLNFQ----- 139 (520) Q Consensus 73 ~LI~~YR~ma~~pEvd~Ai~eIvneaiv~d~------~~~~V~l~Ld~--~~~s~~ik~~I~eeF~~i~~ll~f~----- 139 (520) +|-+..+.++.+|-|..||+.|+|.+..+-. +.-+-.+.+.+ .+.++.-+..+. .-..+++-.+.. T Consensus 79 ~l~~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~-~i~~~l~~pn~~~~p~~ 157 (551) T protein:vir:80 79 DLHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIK-RIESFIEKTGVDNDINR 157 (551) T ss_pred HHHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHH-HHHHHHHhcCCCCCCcc Confidence 7777777888899999999999997664321 22222333332 222332222221 122233333333 Q ss_pred hhhHHHHHhhc----cccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccc Q lcl|NC_018087. 140 RKGSDHFKRWY----VDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQ 215 (520) Q Consensus 140 k~g~~~fRrWY----vDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~ 215 (520) .+..++++.|. +-|.-|+.++-|. ++ -+.+|.+|||.+|+++......... ... +|++... T Consensus 158 ~s~~~f~~~lv~dlll~Gnay~~i~rd~-~G--~~~~L~~l~p~~V~v~~~~~g~~~~-----~~~--~y~~~~~----- 222 (551) T protein:vir:80 158 DSFSSFVKKIVRDTYMYDQVNFEKVFNR-NQ--SMVRFVAKDPTTIFFATTADGKIPD-----NGN--RFVQVID----- 222 (551) T ss_pred chHHHHHHHHHHHHHhcCCEEEEEEECC-CC--cEEEEEEeCCceeEEEECCcccccc-----Cce--EEEEEeC----- Confidence 24456666664 5599999988763 22 4999999999999986433321111 111 1222110 Q ss_pred cccceecCCcceecCcccEEEeecc-cccCC-CCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccC-CCCc Q lcl|NC_018087. 216 CGHQHFAAGTKIKIPYSAMVYAHSG-LVDCC-GKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDT-GNMP 292 (520) Q Consensus 216 ~~~~~~~~~~~~~I~~~aI~y~hSG-L~d~~-~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDv-Gnlp 292 (520) .+..+.++.+.|+|++-. +.++. +...+|-|+.|+..+.....++....=+----+--+-+..+.. ++|. T Consensus 223 -------g~~~~~~~~~eiiH~~~n~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~lt 295 (551) T protein:vir:80 223 -------QKIVATFNAREMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQS 295 (551) T ss_pred -------CcEEEEEcccceEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCCC Confidence 112357888889888532 22222 3446788999999999888888876544333355566666654 3455 Q ss_pred hHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHH---HHHHHHHHH Q lcl|NC_018087. 293 ARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDD---ILYFRKALY 369 (520) Q Consensus 293 k~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~D---V~YF~kkLy 369 (520) +..+++.-+.+...|.-- ...|. . .++. +.|.++..|.. +..++.- .+|..+..- T Consensus 296 ~e~~~~lk~~~~~~~~G~----~nag~------~-~vl~---------~~g~~~~~l~~--~~~D~qfle~~~~~~~~Ia 353 (551) T protein:vir:80 296 QHALEIFKREWKNSLSGI----NGSWQ------I-PVVS---------AEDVKFVNMTP--SARDMEFEKWLNYLINVIS 353 (551) T ss_pred HHHHHHHHHHHHHHhcCc----cccCc------c-cccc---------CCCceEEEccC--ChhHHHHHHHHHHHHHHHH Confidence 544444444443334310 01121 1 1221 11345666643 3333333 455778899 Q ss_pred HhcCCChhhccCCCcccccc-ccchhhHH---H--HHHHH-HHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhce Q lcl|NC_018087. 370 MALRVPLSRIPDEQTQNVFD-MSTAISRD---E--LSFDK-FISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNI 442 (520) Q Consensus 370 ~aL~VP~SRl~~~~~~~~~G-~~~eItRD---E--lkF~K-FI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I 442 (520) ++.+||...|...+....+| .++.+++. + ..|.. -+.-+..++...|...| ++.. ...+ T Consensus 354 ~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~L---------~~~~-----~~~~ 419 (551) T protein:vir:80 354 ALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHI---------VAEF-----GDKY 419 (551) T ss_pred HHhcCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhh---------cccc-----CCce Confidence 99999999986432211111 12222322 2 22433 35556666665555432 2221 1347 Q ss_pred EEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCH-HHH----------H-----HHHHHHHHh Q lcl|NC_018087. 443 KIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSD-EDI----------A-----AERKLIDEE 506 (520) Q Consensus 443 ~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tD-eeI----------~-----~~~kqi~~E 506 (520) .++|.....-.+ .+|.++...+.. -++|..-+++ .+.+.. .+- . ...++.+.+ T Consensus 420 ~f~f~~~~~~~~-------~~~~~~~~~~~~---g~lT~NE~R~-~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~ 488 (551) T protein:vir:80 420 TFQFVGGDIKSE-------LESVKILAEKAK---VAMTVNEVRK-ELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHE 488 (551) T ss_pred EEEeeccChhhH-------HHHHHHHHHHhc---CCcCHHHHHH-HhCCCCCCCCCceeecccccccccccccccCcchh Confidence 777775554333 233343333321 2468887774 467754 110 0 000011111 Q ss_pred hhcCC--------c--cCCccccC Q lcl|NC_018087. 507 LSDKI--------F--NPPEPEEI 520 (520) Q Consensus 507 ~~~~~--------~--~~p~~e~~ 520 (520) ..+.. - ..|++++- T Consensus 489 ~~~~~~~~~~~~~~~~~~~~~~~~ 512 (551) T protein:vir:80 489 KQQSNLQMLQEQTGNRVSTDVEDI 512 (551) T ss_pred hhhhccccccCcCCCCCCCCCCCC Confidence 11100 0 01111111 No 42 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=98.31 E-value=1.4e-06 Score=52.60 Aligned_cols=418 Identities=9% Similarity=0.024 Sum_probs=188.4 Q ss_pred hhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchhHH Q lcl|NC_018087. 11 MFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNA 90 (520) Q Consensus 11 ~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~A 90 (520) ||.|........--..+++...|.+.+.... +.+..+..+ -..|- +.+..+|-|.+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~pp~~-------~~~La---~~~~~n~~v~sc 57 (540) T protein:vir:41 1 MFNYHLSIKSLEKYRAIKGDTDSQALKEDRF-------------EEYVEPKVH-------PLVLL---SLLQVNPYHASA 57 (540) T ss_pred CCCcccChhhccchhhhhccccccccccCCC-------------CccccCCCC-------HHHHH---HHHHhcHHHHHH Confidence 6666555444332223334444444432211 111111111 11222 233567888999 Q ss_pred HHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHH----hhccccceeEEEeeecCCC Q lcl|NC_018087. 91 VQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFK----RWYVDSRVFFHKIINPNRP 166 (520) Q Consensus 91 i~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fR----rWYvDgri~~hkvid~~~~ 166 (520) |+-|.+.+.-+.- .+.-++..+.+ | +-+...++.++++ .+.+.|.-|+.++-|. + T Consensus 58 I~~ia~~ia~~~~-----~i~~~~~~~~~---------~-----lpN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~-~- 116 (540) T protein:vir:41 58 CSIKANDILRTGY-----LIDGDDGGVEE---------L-----LRACRPSFEFILLQALEDLQVFNYCTLEVVRDD-Q- 116 (540) T ss_pred HHHHHHHHhcCCc-----eEecCccchhh---------h-----ccCCCCCHHHHHHHHHHHHHhcCCeEEEEEECC-C- Confidence 9999888764332 22222221110 1 1244445555444 4677899999999763 2 Q ss_pred CCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccccCCC Q lcl|NC_018087. 167 KDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDCCG 246 (520) Q Consensus 167 k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~~~ 246 (520) ..+.+|.+|||..|+..+.-.. ......+...+|.-... +..............+|.+.|+|..-+ -..++ T Consensus 117 -G~~~~L~~i~~~~V~v~~~~~~----~~~~~d~~~~~~~~~~~---~~~~~~~~~g~~~~~~~~~eViHir~~-~~~~~ 187 (540) T protein:vir:41 117 -GEPVRLDYIPAHTVRVHRDGSR----YMQTWDGIHVTYFKDYR---YEGEVNPDNGEDQDGVGANEIIFIHLP-SPICS 187 (540) T ss_pred -CcEEEEEEeCCcceEEeEcCce----eEeeecCceeeeeeccc---ccceeeccccccceeecccceEEecCC-CCCCC Confidence 2499999999999987542110 01111111111111100 001111222233467888999887522 12345 Q ss_pred CcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCch----HHHHHHHHHHHHh-hcceeEeecCCCccc Q lcl|NC_018087. 247 KNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPA----RKAAQHMQHIMNS-HRNRISYDARTGKVK 321 (520) Q Consensus 247 ~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk----~KAeqyl~~im~~-~knklvYd~~TGev~ 321 (520) ...+|-|..|.+++.....+++...=+----|--.-|..++.+-.++ .++.+-+++.+.+ +.+.. .|-.. T Consensus 188 ~~G~Spi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~-----~g~~~ 262 (540) T protein:vir:41 188 YYGVPRYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNF-----KYLKE 262 (540) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHh-----ccccc Confidence 56688999999988888777776543332334455667776443332 2233333333322 21111 11111 Q ss_pred cccccchhhhhhcccccCCCCCcceeecCCCCCcChHH---HHHHHHHHHHHhcCCChhhccC-CCccccccccchhhHH Q lcl|NC_018087. 322 NQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMD---DILYFRKALYMALRVPLSRIPD-EQTQNVFDMSTAISRD 397 (520) Q Consensus 322 d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~---DV~YF~kkLy~aL~VP~SRl~~-~~~~~~~G~~~eItRD 397 (520) +..+.| .++.=. .+..|.+++.|. .+.-+++ -.++..+...++++||...|.. +++... .+ .+.-. T Consensus 263 nag~~~-vLe~~~----~~~~g~~~~pl~--~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n--~s-n~eq~ 332 (540) T protein:vir:41 263 APHTPL-VFSIPG----GDTVEVTFTPLN--TSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLG--GN-FAEVA 332 (540) T ss_pred cccceE-EEecCC----CcccceeEEecc--cchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCC--cc-cHHHH Confidence 222222 222100 112345555553 3333333 3457778899999999999853 222221 11 12222 Q ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_018087. 398 ELSFDKF-ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIG 476 (520) Q Consensus 398 ElkF~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vg 476 (520) ...|.+. +.-+..++...+...|.++ .+ ..+.+.|..+.. ...++ ..+++.+. -. T Consensus 333 ~~~f~~~tL~P~~~~ie~~ln~~L~~~---------~~-----~~~~i~f~~~~l----l~~D~-~~~~~~lv-----~~ 388 (540) T protein:vir:41 333 RRTYYESVVRPQQEIVSSVLTDFIQLK---------LD-----PGARFVFNEEIL----MESEF-VHNYALLV-----QC 388 (540) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhc---------cC-----CceEEEecchhh----cchHH-HHHHHHHH-----hC Confidence 3335443 6667777776666544222 11 124556665432 22232 23333321 12 Q ss_pred hhhhHHHHHHHHhCCCHHHHHHH------HHHHHHhhhcCCccCCcc------------ccC Q lcl|NC_018087. 477 KYISNHTAMKDFLQMSDEDIAAE------RKLIDEELSDKIFNPPEP------------EEI 520 (520) Q Consensus 477 ky~S~~~i~k~IL~~tDeeI~~~------~kqi~~E~~~~~~~~p~~------------e~~ 520 (520) -+++.+-++.+++.+...+-.-+ .........+.-.++|++ +++ T Consensus 389 G~lT~NE~Re~L~g~e~gdd~~l~p~n~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~ 450 (540) T protein:vir:41 389 GVLTPSEVREKLFGLDGGPDMFMVPSSIGKSAMKRQKRNYEKNQINEIKRTYAKYKPRIQEI 450 (540) T ss_pred CCCCHHHHHHHhCcCcCCCcccccccccccccccccccccCCCCccccccccchhcccccCc Confidence 35677777755434432110000 000000000000001100 000 No 43 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=98.30 E-value=1.6e-06 Score=52.41 Aligned_cols=448 Identities=13% Similarity=0.093 Sum_probs=228.9 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRS 80 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ 80 (520) |+|++- .++..-...-. .+.-..-+|+..... . .+..+...+.+ ...+...|..+=|. T Consensus 1 m~~~~~---~~~a~~~~~~~-------------~~~~~~y~aa~~~~~----~-~~~~~~s~d~~-~~~~~~~lr~RaRd 58 (495) T protein:vir:10 1 MNMTPS---GYQSLASGLLV-------------PVGASAYEGASGGHR----W-QDIGDYGPDTA-VASGIQTLRARSHH 58 (495) T ss_pred CCcccc---cccccchhhhh-------------HHHhhhhhccccCcc----c-CCCCCCChhHH-HHHHHHHHHHHHHH Confidence 777765 22222111000 000011122211110 0 01111111222 22356678888888 Q ss_pred H-hhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHH------HhcchhhhHHHHHhhcccc Q lcl|NC_018087. 81 L-LNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLN------MLNFQRKGSDHFKRWYVDS 153 (520) Q Consensus 81 m-a~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~------ll~f~k~g~~~fRrWYvDg 153 (520) | .++|-+..||+-+++-+|=. +..| .-..-++.+.++|..+|..-.+ .++|..--...+|.|.+|| T Consensus 59 l~rNn~~a~~av~~~~~~vVG~--Gi~p-----~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dG 131 (495) T protein:vir:10 59 NVRNNPWATNAVATWVAAAVGN--GLTP-----RWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSG 131 (495) T ss_pred HHhcChHHHHHHHHHHHhhcCC--Cccc-----ccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCC Confidence 8 78999999999999987643 3222 2122245677788888888775 4556555566899999999 Q ss_pred ceeEEEeeecCCCC-CCeeeeEecCcccee-eeeeccCCCCcccccccce--------ecceeecCcc-cccccccceec Q lcl|NC_018087. 154 RVFFHKIINPNRPK-DGIIELRRLDPRNVQ-FVRELDTKMENGVKVVKGY--------REYFLYDTEL-ESYQCGHQHFA 222 (520) Q Consensus 154 ri~~hkvid~~~~k-~GI~elr~lDPr~i~-~vr~i~~~~~~~~~~~~~~--------~ey~~y~~~~-~~~~~~~~~~~ 222 (520) -.|.-+.+++..+. .--..|.-|||..|. ..-+ ....++..+..|+ .-|+++.... ..+..+ . T Consensus 132 E~f~~~~~~~~~~g~~~~~~lqliepd~l~~~~~~--~~~~~g~~i~~GIe~d~~Gr~vaY~i~~~hpgd~~~~~----~ 205 (495) T protein:vir:10 132 EAFVIKKPRPLSEGLSVPLQLQIIEPDMLASDIPD--ETLPSGGYVKGGIRFSNGGKRKAYCFYRNHPAESSLIG----D 205 (495) T ss_pred ceEEEEeecccCCCCccceEEEEechhhcCCCCCC--CCCCCCCEEEeceEECCCCceEEEEEeecCCCcccccc----c Confidence 99988877754332 223689999998875 2211 1113344444443 4677774321 111111 1 Q ss_pred CCcceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHH Q lcl|NC_018087. 223 AGTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQH 302 (520) Q Consensus 223 ~~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~ 302 (520) ...-++||.+.|+|.. - ..+.---.+|.|+..+ .+++|.-.+||...-...-|-.=-+..=+.|. .-+.+-.. T Consensus 206 ~~~~~rvpA~~vlH~f-~-~r~gQ~RGis~la~i~-~l~~l~~y~dael~~a~i~A~~~~fi~~~~~~---~~~~~~~~- 278 (495) T protein:vir:10 206 PVDTVWIKAEHVLHVT-V-LTVRSDAGAPWFQLLL-RLNELDQYEDAELVRKKTAALFAAFIQEATAD---STGGPTIG- 278 (495) T ss_pred ccceeeechhheEecc-c-cCCCcccCcchhHHHH-HHHHhhHHHHHHHHHHHHhhhheeeeecCCCc---cccccccC- Confidence 2234789999998885 3 3454444478998655 58999999999999998888652222222221 11100000 Q ss_pred HHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHH-HHHHHHHHHHhcCCChhhccC Q lcl|NC_018087. 303 IMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDD-ILYFRKALYMALRVPLSRIPD 381 (520) Q Consensus 303 im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~kkLy~aL~VP~SRl~~ 381 (520) ..++-.+.....+| +.==++. =.-|.+|+.+.....-+..++ ++...+.+=.+|+||-+-|.. T Consensus 279 -------------~~~~~~~~~~~~~l-~pG~i~~--L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltg 342 (495) T protein:vir:10 279 -------------QPKRSKGGKRITGL-NPGTLQY--LQPGQEVKFSNPADVGTTYEPWLRYQLLSIAKGYGITYEMLTG 342 (495) T ss_pred -------------ccccccCcccceec-CCceeee--cCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhc Confidence 00111111111111 0000000 012445666655544455555 566666777889999988876 Q ss_pred CCccccccccchhhHHHHHHHHHHHHHHHH-HHHHHHHH-----HHHHHHhcCCC-ChhhHHhhhhceEEEe--eccchH Q lcl|NC_018087. 382 EQTQNVFDMSTAISRDELSFDKFISELQHK-FEEIFLSP-----LKSNLLLKRVI-TEDEWEAELNNIKIVF--HKNSYF 452 (520) Q Consensus 382 ~~~~~~~G~~~eItRDElkF~KFI~rLr~r-Fs~if~d~-----Lk~QLiLkgi~-t~eew~~~~~~I~~~f--~~Dn~f 452 (520) +-+... =+.+.-.-+.|-+.+.++|.+ |..-|..+ |+. .+|.|.+ .+.-|+.-.......| -.--+- T Consensus 343 D~s~~n---YSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~-a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~v 418 (495) T protein:vir:10 343 DLRGVN---YSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDF-AVASGAVVIPDYLQRRRYYNRVSWRTPRWEEV 418 (495) T ss_pred cccccc---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHcCCCCCCCchhhhHhhhccccccCCcccc Confidence 543221 123444456799999999875 55444443 443 4566655 4544443323233334 444456 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHH-HHHHhhhcCCccCCccccC Q lcl|NC_018087. 453 SEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERK-LIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 453 ~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~k-qi~~E~~~~~~~~p~~e~~ 520 (520) -.+||+.-...+++. -.-|.+-+..+ .+..-+|.-++.+ ..+...+.|+.-+.++.-. T Consensus 419 DP~Ke~~A~~~~i~~---------G~~s~~~~~a~-~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~p~~~ 477 (495) T protein:vir:10 419 DPLKKHLADLGDVRA---------GFAPISDKQAE-RGYDMEELFDMISDANQLIDEYDLRLDSDPRYV 477 (495) T ss_pred ChHHHHHHHHHHHHc---------CCCCHHHHHHH-cCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCcC Confidence 677877766555542 22355555544 3443333222211 2222223333211111111 No 44 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=98.24 E-value=2.2e-06 Score=51.60 Aligned_cols=433 Identities=10% Similarity=0.114 Sum_probs=206.0 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRS 80 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ 80 (520) |+|+ +.+-.+|.-|...-... +.+++-.+. | .|+. .....+.|+.||+ T Consensus 1 m~~~-~~ik~~~~~~~~~~~~~--~~~~~i~d~---~-------~i~~-------------------~~~~~~~i~~~~~ 48 (505) T protein:vir:79 1 MAFW-DTLKNLFRKGSAAVGMT--KSLGQIIDD---P-------RINL-------------------PADEVERIARDKR 48 (505) T ss_pred CchH-HHHHHHHHHhhhhhcch--hhhhhhhcc---c-------CCCC-------------------CHHHHHHHHHHHH Confidence 7764 44555555443322111 111110000 0 0000 0011133344444 Q ss_pred Hh--hccch--------------------hHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcc Q lcl|NC_018087. 81 LL--NNYEV--------------------DNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNF 138 (520) Q Consensus 81 ma--~~pEv--------------------d~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f 138 (520) |. ++|.+ ..++++.++ +=-+ +||++.+++. .-++..+.++.--+| T Consensus 49 ~Y~g~~~~l~~~~~~~~~~~~~~~slnl~~~i~~~~A~----ll~~-e~~~i~~~d~--------~~~e~l~~i~~~n~f 115 (505) T protein:vir:79 49 YYMDDFKQVTHKNSYGDTQKHELQSVNVTKLASAKLAS----LIFN-EQCQVTVSDE--------TANDFLDDVFQQNDF 115 (505) T ss_pred HhcCCCccccccccCCCccccceeecchHHHHHHHHHh----hhcC-CCceeecCCh--------HHHHHHHHHHHhccH Confidence 42 12222 122222222 1111 3556666653 234445667766778 Q ss_pred hhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccc------cc------eeccee Q lcl|NC_018087. 139 QRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVV------KG------YREYFL 206 (520) Q Consensus 139 ~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~------~~------~~ey~~ 206 (520) .....+.+..+.+-|..+|+..+|.. =..+..++|.++-++..-.........+. +. ..||+. T Consensus 116 ~~~~~~~~e~a~a~G~~~~k~~~D~~-----~~~i~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~ 190 (505) T protein:vir:79 116 YTTFEEKLEEWIALGSGCVRPYVDSG-----KIKLAWATADQVYPLQADTNQVNELAIASRTTEVENHRTIYYTLLEFHQ 190 (505) T ss_pred HHHHHHHHHHHhhcCCeEEEEEEeCC-----ceEEEEEcCCeeEEEEEcCCCeEEEEEEEEEEEecCCcceEEEEEEEEE Confidence 99999999999999999999999942 24588899988888632111111111110 00 012211 Q ss_pred ecCcccccccccceec------CCcce---ecCc-----ccEEEeecc-----cccC---C-----CCcchhhhHHHHHH Q lcl|NC_018087. 207 YDTELESYQCGHQHFA------AGTKI---KIPY-----SAMVYAHSG-----LVDC---C-----GKNIIGYLHRAVKP 259 (520) Q Consensus 207 y~~~~~~~~~~~~~~~------~~~~~---~I~~-----~aI~y~hSG-----L~d~---~-----~~~~~syL~~aik~ 259 (520) ... ..|......|. .|.+| .+|. +.+++.+.. -+.+ | .+..+|-++.|.-. T Consensus 191 ~~~--~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~ 268 (505) T protein:vir:79 191 WDH--GDYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTV 268 (505) T ss_pred ecC--ceEEEEEEEEecCCCCccCcccchhhcccccccCcceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHH Confidence 110 01111111110 01111 1111 122222111 0111 1 12346778888877 Q ss_pred HHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccC Q lcl|NC_018087. 260 ANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRD 339 (520) Q Consensus 260 ~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRRe 339 (520) +..|-..=+ -+.|.+|.=.+|||. |-.=|++...-....... + .-++| .+.+.+.++.- + T Consensus 269 id~lD~~~s--~~~~e~~~g~~~i~v-~~~~l~~~~~~~~~~~~~---~-~~~fd------~~~~~y~~~~~-------~ 328 (505) T protein:vir:79 269 IDAINRTHD--QFVDEVKKGQRRLIV-PAEWLKTGSSYGGQASET---H-PPMFD------PDETVYQAMYG-------D 328 (505) T ss_pred HHHHHHHHH--HHHHHHHhcccceee-chHHhcccCCCCcccccc---c-ccCCC------ccceeeeeccC-------C Confidence 766664433 456777877777665 211110000000000000 0 00011 02222333221 1 Q ss_pred CCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 340 GKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLS 418 (520) Q Consensus 340 GgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d 418 (520) ++ +.-|+++.+.=-..+ .+-+..+.+.+....+++-+-|..++++. .-++||....-.-..-+.+.|+.|...+.+ T Consensus 329 ~~-~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~--~TAtei~s~~~~l~~t~~~~~~~~~~al~~ 405 (505) T protein:vir:79 329 AS-EVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGI--QTATEVVTNNSQTYQTRSSYITQVEKTIKA 405 (505) T ss_pred CC-CCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCcccc--chHHHHHHHHhHHHHHHHHHHHHHHHHHHH Confidence 22 234888876422232 34577778888889999998887665422 345676655555566677777777777777 Q ss_pred HHHHHHHhcCCCC-------hhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCC Q lcl|NC_018087. 419 PLKSNLLLKRVIT-------EDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQM 491 (520) Q Consensus 419 ~Lk~QLiLkgi~t-------~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~ 491 (520) +++.=|.|..+.- ...++.-...+.++|...-.-. +++++- ..+.+++ . | .+|.++++++.... T Consensus 406 li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d--~~~~~~-~~~~~v~--~---G-i~s~e~~l~~~~~~ 476 (505) T protein:vir:79 406 LTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVD--QESKRA-ADLQAVQ--A---Q-VMPKKQFLMRNYGL 476 (505) T ss_pred HHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCC--HHHHHH-HHHHHHH--c---C-CCCHHHHHHhcCCC Confidence 7776554433211 1111111124667776332222 222221 1111211 1 2 47999988888999 Q ss_pred CHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 492 SDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 492 tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) ||+|-+++.++|++|.... .|+..++ T Consensus 477 ~eeea~~el~ri~~E~~~~---~p~~~~~ 502 (505) T protein:vir:79 477 DEEEADEWLAQIDAENSTA---EPEFNQF 502 (505) T ss_pred ChHHHHHHHHHHHHhcccc---CCCchhc Confidence 9999999999999997653 4666677 No 45 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=98.22 E-value=2.4e-06 Score=51.43 Aligned_cols=428 Identities=12% Similarity=0.116 Sum_probs=202.6 Q ss_pred ccccchhhhcchhhhhhhHHHhhhccC--CCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHH Q lcl|NC_018087. 4 LADSDLKMFAFWHKVDDTEYDKIINDK--AESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSL 81 (520) Q Consensus 4 ~~~~~l~~f~~~~~~~~~~~~~~~~~~--~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~m 81 (520) .|++....++=|.++--. .+.+++. ...++.|. ...+-|+.+|.+ T Consensus 1 m~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~i~~~~-------------------------------~~~~~i~~~~~~ 47 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGL--LKSLKDVTDHKKVNAND-------------------------------EDYKYIDMWKRL 47 (499) T ss_pred ChhHHHHHHHHHHHHhcc--ccchhhhhcCCCCcCCH-------------------------------HHHHHHHHHHHH Confidence 567666666666655211 0011000 00000000 111222333333 Q ss_pred hh------------------------ccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhc Q lcl|NC_018087. 82 LN------------------------NYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLN 137 (520) Q Consensus 82 a~------------------------~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~ 137 (520) .. .+-...+++..++=+ - .+||++.+++... .+..+.++.--+ T Consensus 48 Y~g~~~~~~~~~~~~~~~~~~~~~~s~n~~~~iv~~~a~~l----~-~ep~~i~~~d~~~--------~e~l~~~~~~n~ 114 (499) T protein:vir:80 48 YQGNYAEWHNLNYEHNGNPVNRRQLSMNLPKVTAKYMSKLL----F-NEKVKINIDDETA--------EEFVLNVLKTNG 114 (499) T ss_pred hcCCcchhhccccccCCCccccceeecchHHHHHHHHHHhh----h-CCcceEeeCCHHH--------HHHHHHHHhhcc Confidence 21 122222333333211 1 2577777776433 334455665566 Q ss_pred chhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccc----cc----eecceeecC Q lcl|NC_018087. 138 FQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVV----KG----YREYFLYDT 209 (520) Q Consensus 138 f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~----~~----~~ey~~y~~ 209 (520) |++...+++..-..-|..|||..+|.+ |=..+..++|.++-++..-.........+. .+ ..||..++. T Consensus 115 f~~~~~~~~~~a~~~G~~~~~~~~D~~----~~~~i~~v~a~~~~Pi~~d~~~~~~~~f~~~~~~~~~~y~~lE~h~~~~ 190 (499) T protein:vir:80 115 FTKNMERYIEYGEAMGGFVIKVYHDGN----KNVKVSFATADCMYPLSNDSENVDECLIANSFHKNNKYYKLLEWNEWKG 190 (499) T ss_pred HHHHHHHHHHHHhhcCcEEEEEEECCC----CcEEEEEEcCCceEEEEecCCCeEEEEEEEEEeecCeEEEEEEEEEecc Confidence 899999999999999999999999943 345689999999998643221111111110 00 012222211 Q ss_pred -ccccccccccee------cCCccee-------c---------CcccEEEeeccc---ccCCCCcchhhhHHHHHHHHHH Q lcl|NC_018087. 210 -ELESYQCGHQHF------AAGTKIK-------I---------PYSAMVYAHSGL---VDCCGKNIIGYLHRAVKPANQL 263 (520) Q Consensus 210 -~~~~~~~~~~~~------~~~~~~~-------I---------~~~aI~y~hSGL---~d~~~~~~~syL~~aik~~NqL 263 (520) ....|......| ..|.++. + +..-++|..-.. .++.-...+|-++.|...+..| T Consensus 191 ~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~l 270 (499) T protein:vir:80 191 EKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVPLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTL 270 (499) T ss_pred cceeeEEEEEEEEeccCccccCcccchhhhccCcCCceeecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHH Confidence 111121111111 0111110 1 111122221110 0112223467888888888888 Q ss_pred HHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCC-ccc-cccccchhhhhhcccccCCC Q lcl|NC_018087. 264 KLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTG-KVK-NQANMMALTEDYWLQRRDGK 341 (520) Q Consensus 264 ~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TG-ev~-d~~~~msmlEDywLpRReGg 341 (520) -..-+.+ .|..+.=.+|+|. +..=++ ...-++.... ... +++.+..+ +=..++ T Consensus 271 D~~~s~~--~~e~~~~~~~i~v-~~~~l~----------------~~~~~~g~~~~~~~~~~~~~~~~------~~~~~~ 325 (499) T protein:vir:80 271 DLMFDSY--YQEFKLGKKKVLV-PSSFVK----------------TAVNLDGSTTQYFDSTDEAFFLY------QGEQDD 325 (499) T ss_pred HHHHHHH--HHHHHhcccceec-chhhhh----------------ccCCCCCCcccCCCcccceeeEe------eccCCC Confidence 7766664 3778887777764 211110 0001111110 001 11111110 001122 Q ss_pred CCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 342 AVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPL 420 (520) Q Consensus 342 rgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~L 420 (520) .|--|+++.+.-.-.+ .+-+..+.+.+....|+|-+-+..++++. --++||.-....-..-+...++.|..-+.+++ T Consensus 326 ~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~--~TAtei~s~~~~l~~~~~~~~~~~~~~l~~l~ 403 (499) T protein:vir:80 326 NGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGL--KTATEVVSEKSETYQTKNSHSQLIEQGIKEMI 403 (499) T ss_pred CcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccc--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2234777766443332 35677788899999999988887654422 13456644433322234444444444444444 Q ss_pred HHHHHh---cCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHH Q lcl|NC_018087. 421 KSNLLL---KRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIA 497 (520) Q Consensus 421 k~QLiL---kgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~ 497 (520) +.=|-+ -+...-..|+ ...+.++|...-.-.+..++ +.+.++-- .| .+|.+++..+....||+|.+ T Consensus 404 ~~il~~~~~~~~~~~~~~~--~~~v~v~f~d~i~~d~~~~~-------~~~~~~~~-~G-i~S~et~l~~~~~~~d~ea~ 472 (499) T protein:vir:80 404 VSILEVGKLIKAYDGDTVE--LDTITVDFDDSIAQDEDTTI-------NRYTTAKN-QG-MIPLKIALQRAWNITEAEAD 472 (499) T ss_pred HHHHHHHHHhccccCCCCC--ccceEEEeCCCCCCCHHHHH-------HHHHHHHH-cC-CCCHHHHHhhcCCCChHHHH Confidence 432211 1111112222 24688888543333332221 22222210 02 46899988888899999999 Q ss_pred HHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 498 AERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 498 ~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) ++.++|++|.... .+.|+..-+ T Consensus 473 ~el~~i~~E~~~~-~~~~d~~g~ 494 (499) T protein:vir:80 473 EWAEMLAKEKQAE-IPNNDMTGI 494 (499) T ss_pred HHHHHHHHHhhcC-CCCCCcccc Confidence 9999999998765 344433333 No 46 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=98.18 E-value=3e-06 Score=50.85 Aligned_cols=437 Identities=12% Similarity=0.056 Sum_probs=197.1 Q ss_pred cccccchhhhcchhh-------hhhhHHHhhhc-----cCCCcccCCCCCCCceeeccc-cccc----cccccccccc-- Q lcl|NC_018087. 3 MLADSDLKMFAFWHK-------VDDTEYDKIIN-----DKAESITAPKFDDGATEVDSQ-DIAY----NGVFQKLYGS-- 63 (520) Q Consensus 3 ~~~~~~l~~f~~~~~-------~~~~~~~~~~~-----~~~~s~~~p~~~dg~~~i~~~-~~a~----~g~~~~~~~~-- 63 (520) |+ -++-+ -|||-. .++..---.++ ..+++..||...-+..-..-. -.++ .+...+.+++ T Consensus 1 ~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~d~~~~~~~r~g~~~~~~~~g~~~ 78 (648) T protein:vir:79 1 MA-RKVWG-RGFWSRISLMWRDEDDDKEPLVLEESMQLGEAPGAMPKGGGGGGSAKRDPKMSLVKRIGLAIMDGGGGGRD 78 (648) T ss_pred Cc-cchhc-chhhhhhhhhccCccccccccccccccccCCCccccCCCCcccccccccchhHHHHHhHHHHHhhcCCccc Confidence 32 22221 245543 22211000111 122333333322222111110 0111 0111111111 Q ss_pred ccccchhHHHHHHHHHHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhH Q lcl|NC_018087. 64 QDPTATSTRELINTYRSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGS 143 (520) Q Consensus 64 ~~~~~~~~~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~ 143 (520) .....-+-..|-+- ...+|-|..||+.|.+.+.-.. ..|.-++-.-.+..+.+ .++..-+...+++ T Consensus 79 ~~epp~d~~~l~~l---~~~np~V~~aI~iia~~ia~l~-----~~i~~~~~~~~~~~~~~------~ll~rPn~~~t~~ 144 (648) T protein:vir:79 79 FEEPEFDFNEITSA---YNTEGYVRQAVDKYIEMMFKAD-----WDFVSKNPNAVEYIRMR------FTLMAEATQIPTN 144 (648) T ss_pred cccCCcCHHHHHHH---HhcChHHHHHHHHHHHHHhhCc-----ceEEecCCccchhhHHH------HHhhccCCCCCHH Confidence 22222233333332 2469999999999888765432 22222211111111111 1222334445666 Q ss_pred HHHHh----hccccceeEEEeeecCCC------------CCCeeeeEecCccceeeeeeccCCCCcccccccceecceee Q lcl|NC_018087. 144 DHFKR----WYVDSRVFFHKIINPNRP------------KDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLY 207 (520) Q Consensus 144 ~~fRr----WYvDgri~~hkvid~~~~------------k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y 207 (520) ++++. +.+-|--|..++-|.+.. ..-+.+|.+|+|.+++..++. ++. -.+|+| T Consensus 145 ~f~~~l~~~lll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~d~-----~g~------~~~Y~y 213 (648) T protein:vir:79 145 QLFIEIAEDLVKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKRDK-----FGM------IKGWQQ 213 (648) T ss_pred HHHHHHHHHHHhcCCeEEEEEecCCCccchhhhhhhhccccceeeeEeecCceeEEEEcC-----CCc------eeeeEE Confidence 65555 457898899888663320 112578888999888775431 111 123445 Q ss_pred cCcccccccccceecCCcceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHH-HHHHhcCccceEEEc Q lcl|NC_018087. 208 DTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMM-IYRITRAPDRRVFYI 286 (520) Q Consensus 208 ~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalV-IyRi~RApeRRvFyI 286 (520) .+. +.+.++.++++.|+|...+ -+.++...+|-|..|+..+.....+++..- .++=- |--.-+..+ T Consensus 214 ~~~-----------g~~~~~~~~~~dIIHik~~-~~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NG-a~P~gil~~ 280 (648) T protein:vir:79 214 EQE-----------GQDKPQKFKPEDIVHIYYK-REKGRAFGTPWLLPALDDIRALRQVEENVLRLVYRN-LHPLWHVKV 280 (648) T ss_pred Eec-----------CCceeEEecCccEEEEccC-CCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhcc-CCccEEEEe Confidence 322 1123356788888877522 234555668888888888877666665443 22222 222455555 Q ss_pred cCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh---HHHHHH Q lcl|NC_018087. 287 DTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE---MDDILY 363 (520) Q Consensus 287 DvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~Y 363 (520) ..+......+++.++.+-..|++..+ . +|+...+...++-....-+ ++=.++ T Consensus 281 ~~~~~~~e~~k~~~e~~~~~~~~~~i---~----------------------gg~v~~~~~~i~~~~s~~dlqfle~rk~ 335 (648) T protein:vir:79 281 GLEQEGFGAEEGEVDLVRGEVENMDV---E----------------------GGMVTTERVNISSIASNQIIDAKEYLKH 335 (648) T ss_pred CCCccchHHHHHHHHHHHHhcccccc---c----------------------ccccccceeeccccCCHHHHHHHHHHHH Confidence 55555455555666665555554221 1 1221222222222112112 223467 Q ss_pred HHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceE Q lcl|NC_018087. 364 FRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIK 443 (520) Q Consensus 364 F~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~ 443 (520) ..+..-++.+||...|...++.+ ....+.. ..-|...|..|+..+..++...+-+.+++.+-+. .|-.....++ T Consensus 336 ~~~eIa~aFgVPP~lLG~~~~ss--~stae~~--~~~~~~~i~~l~~~i~~~le~~~~~~ll~e~~l~--~~l~~d~~ie 409 (648) T protein:vir:79 336 FEQRAFTVLGVSELMMGRGGTAS--RSTGDNL--SSDFKDRIKALQKVMATFINEFMVKEILMEGGFD--PVLNPDDKVE 409 (648) T ss_pred HHHHHHHHhCCCHhHcccCCCcc--chHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcc--ccccccceEE Confidence 88899999999999886432221 1111222 2236777888887777777665555555544332 2322234567 Q ss_pred EEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHH----HHHHH---HHHh-hhcCCccCC Q lcl|NC_018087. 444 IVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIA----AERKL---IDEE-LSDKIFNPP 515 (520) Q Consensus 444 ~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~----~~~kq---i~~E-~~~~~~~~p 515 (520) |+|..-..-.+.. |.+.+.++ +-+-++|.+-++. .+.+..-+=. ....+ ...+ ...+..+.| T Consensus 410 F~~~~Llr~D~~~-------~a~~~~~l--~~~GilT~NEaR~-~lGlpPi~~g~~~~~l~~~~~~~~~~~~~~~~~~~~ 479 (648) T protein:vir:79 410 FRFNEIDMDSKIK-------LENQAVFL--YEHNAISEDEMRE-LIGRDPVDDGEGRAKMHLQMVTIAQATALAALAPTP 479 (648) T ss_pred EeecccchhhHHH-------HHHHHHHH--HhCCCcCHHHHHH-HhCCCCCCCCCCccccccccccchhccccccCCCCC Confidence 7765222222222 22333222 2234678888885 4787653200 00000 0001 111111111 Q ss_pred ccccC Q lcl|NC_018087. 516 EPEEI 520 (520) Q Consensus 516 ~~e~~ 520 (520) ..+.- T Consensus 480 ~~~~~ 484 (648) T protein:vir:79 480 AGGSS 484 (648) T ss_pred CCCCC Confidence 11111 No 47 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=98.18 E-value=3e-06 Score=50.84 Aligned_cols=439 Identities=13% Similarity=0.125 Sum_probs=193.4 Q ss_pred hhhhcchhhhh------------hhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHH Q lcl|NC_018087. 9 LKMFAFWHKVD------------DTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELIN 76 (520) Q Consensus 9 l~~f~~~~~~~------------~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~ 76 (520) |.+|.-..++. +..++..+.+...+ ..-+..++.....+++.-..=.++.-| ..-+...+..+|-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~k~~~~~~~~~~~~~~~~~~~~~g~-~~~~~~~~~~~l~~ 78 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQE-QISKAMNNKEVAYSQPVIGSMSANPGF-KTKPSIRNNQDLHG 78 (547) T ss_pred CchhhhhhhhcCCccccccccccccccchhhhhhhHH-HHHHhhcccchhhhchhhheeeccccc-ccCCccCChhHHHH Confidence 22222221111 00011111111000 111111111111111110000000111 12233345555544 Q ss_pred HHHHHhhccchhHHHHhhhceeeEec------CCCc--EEEEeeccchhhhHHHHHHHHHHHHHHHHhcchh-----hhH Q lcl|NC_018087. 77 TYRSLLNNYEVDNAVQEIVSDAIVYE------EGFD--VVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQR-----KGS 143 (520) Q Consensus 77 ~YR~ma~~pEvd~Ai~eIvneaiv~d------~~~~--~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k-----~g~ 143 (520) .=+..+.+|-|..||+.|+|.+..+- .+.- .|.+.-......+.-+..+. +.+.+++-.++.. +.. T Consensus 79 l~~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~-~l~~~l~~pn~~~~p~~~s~~ 157 (547) T protein:vir:63 79 VLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIK-RIESFIEKTGVDNDINRDSFS 157 (547) T ss_pred HHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHH-HHHHHHHhhCCCCCCccchHH Confidence 44455778999999999998766431 1111 23332222233333333332 3333444444432 445 Q ss_pred HHHHhh----ccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccc Q lcl|NC_018087. 144 DHFKRW----YVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQ 219 (520) Q Consensus 144 ~~fRrW----YvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~ 219 (520) ++++.| ++-|.-|+.++-|. ++ -+.+|.+|||..|+.+.... +.....+. +|++.. T Consensus 158 ~f~~~lv~d~ll~Gn~~~~i~rd~-~G--~~~~L~~l~p~~V~~~~~~~-----g~~~~~~~--~y~~~~---------- 217 (547) T protein:vir:63 158 SFVKKIVRDTYMYDQVNFEKVFNR-NQ--SMVRFVAKDPTTIFFATTAD-----GKIPDNGN--RFVQVI---------- 217 (547) T ss_pred HHHHHHHHHHHhhCCEEEEEEECC-CC--cEEEEEEecCceeEEEECCc-----cccccCce--EEEEEc---------- Confidence 566665 46699999888763 22 39999999999999864322 21111111 122210 Q ss_pred eecCCcceecCcccEEEeecc-cccCC-CCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCC-CCchHHH Q lcl|NC_018087. 220 HFAAGTKIKIPYSAMVYAHSG-LVDCC-GKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTG-NMPARKA 296 (520) Q Consensus 220 ~~~~~~~~~I~~~aI~y~hSG-L~d~~-~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvG-nlpk~KA 296 (520) ..+..+.++.+.|+|.+.. +.++. +...+|-|..|++++.....++....=+----+--+-|..+..+ +|. .++ T Consensus 218 --~~~~~~~~~~~eiih~r~n~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~ls-~e~ 294 (547) T protein:vir:63 218 --DQKIVATFNAREMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQS-QHA 294 (547) T ss_pred --CCcEEEEeccccEEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCCC-HHH Confidence 0112247888889888632 22222 34567889999999988888877665444444555666666654 344 344 Q ss_pred HHHHHHHH-HhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHH---HHHHHHHHHhc Q lcl|NC_018087. 297 AQHMQHIM-NSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDI---LYFRKALYMAL 372 (520) Q Consensus 297 eqyl~~im-~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV---~YF~kkLy~aL 372 (520) .+-+++-+ ..|. | ..+..+. .++.+ .|.++..|- .+..++.-+ +|..+.+-++. T Consensus 295 ~~~lk~~~~~~~~---------G-~~nagk~-~vl~~---------~g~~~~~l~--~~~~d~qfle~~~~~~~~Ia~af 352 (547) T protein:vir:63 295 LEIFKREWKNSLS---------G-INGSWQI-PVVSA---------EDVKFVNMT--PSARDMEFEKWLNYLINVISALY 352 (547) T ss_pred HHHHHHHHHHHhc---------C-ccccccc-ccccC---------CCceEEEcC--CChhHHHHHHHHHHHHHHHHHHh Confidence 33333333 3343 1 1111121 12211 134555553 344444443 44668899999 Q ss_pred CCChhhccCCCcc-ccccccchhhHH---H--HHHHH-HHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEE Q lcl|NC_018087. 373 RVPLSRIPDEQTQ-NVFDMSTAISRD---E--LSFDK-FISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIV 445 (520) Q Consensus 373 ~VP~SRl~~~~~~-~~~G~~~eItRD---E--lkF~K-FI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~ 445 (520) +||...|...+.. .....++.+++. + ..|.. -+.-+..++...|...| + +. + ...+.++ T Consensus 353 gVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~tL~P~~~~ie~~ln~~L----~-----~~--~---~~~~~~~ 418 (547) T protein:vir:63 353 GIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHI----V-----AE--F---GDKYTFQ 418 (547) T ss_pred CCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhc----c-----cc--c---CCceEEE Confidence 9999998643211 111112223322 2 22333 35555555555444432 1 21 1 1346677 Q ss_pred eeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHH-H----------HHHH-----HHHHHHhhhc Q lcl|NC_018087. 446 FHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDE-D----------IAAE-----RKLIDEELSD 509 (520) Q Consensus 446 f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDe-e----------I~~~-----~kqi~~E~~~ 509 (520) |.....-.+ .+|..+...+. .-+++..-++.. +.|... + +... .++.+.|..+ T Consensus 419 f~~~~~~~~-------~~~~~~~~~~~---~g~lT~NE~R~~-~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~ 487 (547) T protein:vir:63 419 FVGGDIKSE-------LESVKILAEKA---KVAMTVNEVRKE-LNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQ 487 (547) T ss_pred eeccccccH-------HHHHHHHHHHh---CCCcCHHHHHHH-hCCCCCCCCCceeecccccccccccccccCCccccch Confidence 765444333 22333322221 124677777744 677541 1 0000 0011111000 Q ss_pred ------------CCccCCccccC Q lcl|NC_018087. 510 ------------KIFNPPEPEEI 520 (520) Q Consensus 510 ------------~~~~~p~~e~~ 520 (520) .--++|+++-. T Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~~ 510 (547) T protein:vir:63 488 SNLQMLQEQTGNRVSTDVEDIPD 510 (547) T ss_pred hhccccccccCCCCCCCCCCCCC Confidence 00011111111 No 48 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=98.18 E-value=3e-06 Score=50.84 Aligned_cols=404 Identities=9% Similarity=0.100 Sum_probs=189.5 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRS 80 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ 80 (520) |+.+- +||+|-. +.. ..+ .+.+..+... .+ +.|. ......+ +. +. T Consensus 1 M~~~~----~~f~~~~-r~~----------~~~-~~~~~~~~~~-~~-----~~g~-~~~~~~v-----~~-------~~ 45 (429) T protein:vir:10 1 MDSVK----KFFNFEK-RQT----------SQV-IELNKDDEKL-LE-----WLGI-SPSTISV-----KG-------KN 45 (429) T ss_pred Cchhh----hhhcccc-cCc----------ccc-cccCCChHHH-HH-----HhcC-CCCccee-----ch-------hh Confidence 33221 2333321 111 111 1111111110 11 1111 1111000 01 12 Q ss_pred HhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHH-hcchhhhHH----HHHhhccccce Q lcl|NC_018087. 81 LLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNM-LNFQRKGSD----HFKRWYVDSRV 155 (520) Q Consensus 81 ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~l-l~f~k~g~~----~fRrWYvDgri 155 (520) .+++|-|.+||+-|.+.+.-.+ +.+.-+.. +..+.....-..++|+. =|=..++.+ ++..+.+.|.- T Consensus 46 al~~~~v~~~i~~ia~~ia~l~-----~~~~~~~~---~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna 117 (429) T protein:vir:10 46 ALKVATVFACIKILSESVSKLP-----LKIYQEDE---YGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNS 117 (429) T ss_pred hhccHHHHHHHHHHHHhhccCc-----eEEEEecC---CceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCe Confidence 4578999999999988876532 22221211 00011111112222221 111233434 44557788999 Q ss_pred eEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEE Q lcl|NC_018087. 156 FFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMV 235 (520) Q Consensus 156 ~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~ 235 (520) |+.++-|.. ..+.+|.+|+|..+..+++-..... .....||.++.. +..+.++.+.|+ T Consensus 118 y~~i~r~~~---G~~~~L~~i~~~~v~v~~~~~~~~~------~~~~~~~~~~~~-------------g~~~~~~~~evi 175 (429) T protein:vir:10 118 YANIEFDRK---GKVQALWPIDASKVTVYIDDVGLLN------SKTKMWYVVNTG-------------GQQRVLKPEEIL 175 (429) T ss_pred EEEEEECCC---CcEEEEEEEcCceeEEEEcCccccc------ccceEEEEEccC-------------CeEEEEccccEE Confidence 999986632 2389999999999988654221111 111223333222 234578999999 Q ss_pred EeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeec Q lcl|NC_018087. 236 YAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDA 315 (520) Q Consensus 236 y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~ 315 (520) |..-+ ...++...+|.|..|.+++.....++....=+----+.-+-+..++ +.|.+.++++..+.+...|.. T Consensus 176 h~~~~-~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~-~~l~~e~~~~~~~~~~~~~~g------ 247 (429) T protein:vir:10 176 HFKNG-ITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-GDLNEDAKKVFRENFESMSSG------ 247 (429) T ss_pred EecCC-CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhcc------ Confidence 88533 2455656789999999999999888887766655555556777776 567777776665555444432 Q ss_pred CCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHH---HHHHHHHHHHHhcCCChhhccCCCccccccccc Q lcl|NC_018087. 316 RTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMD---DILYFRKALYMALRVPLSRIPDEQTQNVFDMST 392 (520) Q Consensus 316 ~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~---DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~ 392 (520) .+|..+.+ .++ + |.+++.|. .+..+++ --++..+.+.++++||.+-|....+.. .+ T Consensus 248 ----~~n~~~~~-vl~-------~---g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~----~s 306 (429) T protein:vir:10 248 ----LQNSHRIA-LMP-------V---GYQFQPIS--LNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT----LN 306 (429) T ss_pred ----ccccCcee-ecC-------C---CceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC----cc Confidence 11111222 221 2 45666653 2333333 335678889999999998885322211 11 Q ss_pred hhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018087. 393 AISRDELSFDKF-ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLM 471 (520) Q Consensus 393 eItRDElkF~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~ 471 (520) .+.-.-..|.++ |.-+-..+..- +=+.++++.+|. ....+.|.-+. +... -+..|.+.++.+ T Consensus 307 n~e~~~~~f~~~~l~P~~~~ie~~---------ln~kl~~~~~~~---~g~~~~fd~~~----ll~~-d~~~~~~~~~~~ 369 (429) T protein:vir:10 307 NIEQQQQQFYTDTLQATLTMYEQE---------MTYKLFLDSELD---KGFYSKFNVDA----ILRA-DIKTRYEAYRTG 369 (429) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHH---------HHHhhcChhhcC---CCcEEEeechh----hhcC-CHHHHHHHHHHH Confidence 122222224332 22222222221 222445666654 22345554332 2211 124456666555 Q ss_pred hcccchhhhHHHHHHHHhCCCHHHHHHHHHHHH-------HhhhcCCccC-CccccC Q lcl|NC_018087. 472 EPYIGKYISNHTAMKDFLQMSDEDIAAERKLID-------EELSDKIFNP-PEPEEI 520 (520) Q Consensus 472 ~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~-------~E~~~~~~~~-p~~e~~ 520 (520) -.- -++|.+-++. .|++.+.+ .-++.+- ++..+...+. -+.++. T Consensus 370 ~~~--G~~T~NE~R~-~~gl~p~~--ggD~~~~~~n~~~~d~~~~~~~k~g~~~~~~ 421 (429) T protein:vir:10 370 IQG--GFLKPNEARS-KEDLPPEA--GGDRLLVNGNMLPIDMAGQAYLKGGDTNGEV 421 (429) T ss_pred HhC--CCcCHHHHHH-HhCCCCCC--CcCeeeecccccchhhccccccCCCCCCCCC Confidence 322 3678888874 46765421 1111000 0000000000 011111 No 49 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=98.13 E-value=3.9e-06 Score=50.24 Aligned_cols=393 Identities=12% Similarity=0.067 Sum_probs=186.2 Q ss_pred chhHHHH----H-------HHHHHHhhccchh-------------------------HHHHhhhceeeEecCCCcEEEEe Q lcl|NC_018087. 68 ATSTREL----I-------NTYRSLLNNYEVD-------------------------NAVQEIVSDAIVYEEGFDVVSID 111 (520) Q Consensus 68 ~~~~~~L----I-------~~YR~ma~~pEvd-------------------------~Ai~eIvneaiv~d~~~~~V~l~ 111 (520) ..++.++ + .+|+.+..|++=+ -+|+..++-. + .+.+.+. T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l-~----~~g~~~~ 75 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL-D----IEGFRIS 75 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhh-c----cCceecC Confidence 1111111 1 1222222222222 1222211111 0 0111111 Q ss_pred eccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeec--CCCCCCeeeeEecCccceeeeeec-- Q lcl|NC_018087. 112 LDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINP--NRPKDGIIELRRLDPRNVQFVREL-- 187 (520) Q Consensus 112 Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~--~~~k~GI~elr~lDPr~i~~vr~i-- 187 (520) ++. ...+....|++-=+|+....++++.-.+-|+-|.+.-... ....+|..-++.+||+.+..+.+- T Consensus 76 -~d~--------~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~i~~~~p~~~~~~~D~~~ 146 (480) T protein:vir:78 76 -EDS--------EGLEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRN 146 (480) T ss_pred -CCc--------hhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeEEEEEcccceEEEEcCCC Confidence 111 2233445556556788888999999999999876654221 123568888999999999887652 Q ss_pred cCCCCcccccc------cceecceeecCccccccc--cc---ceecCCcce--ecC-cccEEEeecccccCCCCcchhhh Q lcl|NC_018087. 188 DTKMENGVKVV------KGYREYFLYDTELESYQC--GH---QHFAAGTKI--KIP-YSAMVYAHSGLVDCCGKNIIGYL 253 (520) Q Consensus 188 ~~~~~~~~~~~------~~~~ey~~y~~~~~~~~~--~~---~~~~~~~~~--~I~-~~aI~y~hSGL~d~~~~~~~syL 253 (520) ..+..-.++.+ +.+..+.+|.+....+.. ++ ........+ .++ ...|.|++-- +..+..+.|=| T Consensus 147 ~~~~~~~i~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~--~~~~~~G~s~i 224 (480) T protein:vir:78 147 TRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDP--RLGNRYGRSEI 224 (480) T ss_pred ccceEEEEEEEEeecCCCceEEEEEEeCCeEEEEEecCCCccccccccccccCCCCCcceEEeeccc--ccCCccCcccc Confidence 22333333321 112223344443211110 00 000000000 000 0123333311 12222233444 Q ss_pred HHHHHHH-HH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhh Q lcl|NC_018087. 254 HRAVKPA-NQ-LKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTE 331 (520) Q Consensus 254 ~~aik~~-Nq-L~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlE 331 (520) .+.++++ .. =+++-+.+++-...-.|.|-|.=.+....+..+. +. +.-.+.. T Consensus 225 ~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~---------------------~~-----~~~~~~~ 278 (480) T protein:vir:78 225 SPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGE---------------------NT-----TLDIYYG 278 (480) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhhcCCccccccccc---------------------cc-----hhhhhhh Confidence 4433221 11 1355566777777777777654222222111100 00 0001111 Q ss_pred -hhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHH Q lcl|NC_018087. 332 -DYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQH 410 (520) Q Consensus 332 -DywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~ 410 (520) ..|++ |..+++.++++.+-=.-++-++-....++..-++|..=|...+. +. ..+..|---+.....-+.+.|+ T Consensus 279 ~~~~~~----~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~-n~-~Sg~Alk~~~~~l~~ka~~~~~ 352 (480) T protein:vir:78 279 RILTLA----SEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSE-NP-ASAEAIIATDSRIVKMAERKGR 352 (480) T ss_pred hhccCC----CCCceEEecCccCHHHHHHHHHHHHHHHhcccCCChHHhccccC-cc-hHHHHHHHHHHHHHHHHHHHHH Confidence 12332 33467888887432223444666667777778888766643321 10 1222344445557777889999 Q ss_pred HHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhC Q lcl|NC_018087. 411 KFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQ 490 (520) Q Consensus 411 rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~ 490 (520) .|..-+...++.=+.+.|.--..+|. .|.+.|..-..=+. .+.++.+.++-.-.+-.+|.+++... |. T Consensus 353 ~f~~~l~~~~~l~~~~~g~~~~~~~~----~i~v~f~~~~~~s~-------~~~ad~~~kl~~~g~~~~s~et~~~~-lg 420 (480) T protein:vir:78 353 IFGGAWERAMRIAMQIMGREVTEEYT----RLETVWRDPSTPTV-------AAKADAVSKLYANGQGPIPKEQARID-LG 420 (480) T ss_pred HHHHHHHHHHHHHHHHcCCCccccce----eeeEEecCCCCCCH-------HHHHHHHHHHHHhccccCCHHHHHhc-CC Confidence 99999999998877777754445543 46677753332222 23444444444433455799999965 89 Q ss_pred CCHHHHHHHHHHHHHhhhcCC-------------ccCCccccC Q lcl|NC_018087. 491 MSDEDIAAERKLIDEELSDKI-------------FNPPEPEEI 520 (520) Q Consensus 491 ~tDeeI~~~~kqi~~E~~~~~-------------~~~p~~e~~ 520 (520) +++++++++++..+++..+.+ .+.|++-+. T Consensus 421 ~~~d~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 463 (480) T protein:vir:78 421 YTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTET 463 (480) T ss_pred CCHhHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCCCCCCC Confidence 999999888765444443221 111111111 No 50 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=98.08 E-value=5e-06 Score=49.64 Aligned_cols=393 Identities=12% Similarity=0.102 Sum_probs=186.6 Q ss_pred hhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchh Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVD 88 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd 88 (520) |.||.||.++.... ...+...+|.-... ....+..+ + .+..+++|.|. T Consensus 1 m~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~-------~~~~g~~v-------~-------~~~al~~~~v~ 48 (419) T protein:vir:57 1 MFIPQFWKGRPSEN-----------RVNWQVVPGGMRSS-------SSQAGVII-------T-------PETALALSAVR 48 (419) T ss_pred CcchhhhccCCccc-----------cccccccccccccc-------cccCCcee-------c-------hHHhhccHHHH Confidence 78899998764421 11111111110000 00001111 1 12235678899 Q ss_pred HHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHH-HHHHHHH-HhcchhhhHHH----HHhhccccceeEEEeee Q lcl|NC_018087. 89 NAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISD-EFNSVLN-MLNFQRKGSDH----FKRWYVDSRVFFHKIIN 162 (520) Q Consensus 89 ~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~e-eF~~i~~-ll~f~k~g~~~----fRrWYvDgri~~hkvid 162 (520) .||+-|.+.+.-. |+.|-=.... ..++.+.+ -...+|+ --|-..++.++ +..+.+.|.-|+.++-+ T Consensus 49 ~~i~~ia~~ia~l-----p~~~~~~~~~---g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~ 120 (419) T protein:vir:57 49 ACVTLLAESVAQL-----PCVLYRRTEN---GGREIAFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRN 120 (419) T ss_pred HHHHHHHHhhccC-----ceEEEEEcCC---CceeccccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEC Confidence 9999999987643 2222000000 00111110 1122222 13334455554 44577899988888755 Q ss_pred cCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccc Q lcl|NC_018087. 163 PNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLV 242 (520) Q Consensus 163 ~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~ 242 (520) . ..-+++|.+|+|..+...+. .++. .||.|.. .+-.+|.+.|+|... . T Consensus 121 ~---~G~~~~L~pl~~~~v~v~~~-----~~g~-------~~y~~~~---------------~~~~~~~~~vih~r~--~ 168 (419) T protein:vir:57 121 G---RGDITELIPINPHKVIVLKG-----PDGM-------PYYDIPS---------------IGEILPMRMVHHIKS--F 168 (419) T ss_pred C---CCcEEEEEEEcCcceEEEEC-----CCce-------EEEEEcC---------------CceEEchhhEEEecC--c Confidence 2 22489999999999987432 2222 1333321 112578888887752 3 Q ss_pred cCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCcccc Q lcl|NC_018087. 243 DCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKN 322 (520) Q Consensus 243 d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d 322 (520) ..++...+|.+..|.+.+.....+++...=+----+--+-+...+.. +.....++-...+..++..+.- | .++ T Consensus 169 ~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-~~~~~~~e~~~~~~~~~~~~~~-----g-~~n 241 (419) T protein:vir:57 169 SLDGYIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFE-AKAIASQAAVDAILAKWTERYG-----G-VRN 241 (419) T ss_pred CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCc-CCcccCHHHHHHHHHHHHHHhc-----c-ccc Confidence 55666778999999999888888877665443333444555555422 1222222333444444444221 1 111 Q ss_pred ccccchhhhhhcccccCCCCCcceeecCCCCCcChHHH---HHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHH Q lcl|NC_018087. 323 QANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDD---ILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDEL 399 (520) Q Consensus 323 ~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~D---V~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDEl 399 (520) ..+.+ .++ .|+++..|.- +.-++.- .++..+.++++.+||.+-|...++.. .+.+.-.-+ T Consensus 242 ag~~~-vl~----------~g~~~~~l~~--~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t----~sn~e~~~~ 304 (419) T protein:vir:57 242 AFSVG-MLQ----------EGMTYKQLSQ--DNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKST----NNNIEHQGL 304 (419) T ss_pred cccce-ecC----------CCceEEEcCC--ChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCc----cccHHHHHH Confidence 11222 121 2566666643 3333332 34566889999999998886432211 111222223 Q ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchh Q lcl|NC_018087. 400 SFDKF-ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKY 478 (520) Q Consensus 400 kF~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky 478 (520) -|.++ +.-+...+..- +-+.+.++.++. ...+.|.-+.. .. .-+..|++.++.+-.- -+ T Consensus 305 ~f~~~~l~P~~~~ie~~---------l~~~ll~~~~~~----~~~i~fd~~~l----l~-~d~~~~~~~~~~~~~~--G~ 364 (419) T protein:vir:57 305 QYVIYTMLAILKRHESA---------MMRDLLLPSERR----DFYIEFNVSSL----LR-GDQKSRYESYALGRQW--GW 364 (419) T ss_pred HHHHHHHHHHHHHHHHH---------HHhhccCccccC----CeEEEEechhh----hc-cCHHHHHHHHHHHHhC--CC Confidence 35444 33333333222 333455655543 24455543332 11 1234566666654322 36 Q ss_pred hhHHHHHHHHhCCCHHHHHHHHH-----------HHHHhhhcCCccCCccccC Q lcl|NC_018087. 479 ISNHTAMKDFLQMSDEDIAAERK-----------LIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 479 ~S~~~i~k~IL~~tDeeI~~~~k-----------qi~~E~~~~~~~~p~~e~~ 520 (520) ++.+-++. ++++.+-+ .-++ +++...+..--+.|+.+-+ T Consensus 365 ~T~NE~R~-~~gl~p~~--ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~ 414 (419) T protein:vir:57 365 LSVNDIRR-MENLTPIP--GGDKYLTPLNMVDSKALTGIGKATPQQLKDIEAI 414 (419) T ss_pred cCHHHHHH-HhCCCCCC--CcCeeeeccccccccccccccCCCcccCcchhhh Confidence 78888874 57776421 1111 1111111111122233333 No 51 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=98.06 E-value=5.6e-06 Score=49.39 Aligned_cols=403 Identities=14% Similarity=0.108 Sum_probs=186.0 Q ss_pred hhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccc-cccccccchhHHHHHHHHHHHhhccchhH Q lcl|NC_018087. 11 MFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKL-YGSQDPTATSTRELINTYRSLLNNYEVDN 89 (520) Q Consensus 11 ~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~-~~~~~~~~~~~~~LI~~YR~ma~~pEvd~ 89 (520) ||.||...+..+ ...+|....+...+.. +.+..+.+. ..|. .+. -...+++|.|.. T Consensus 1 ~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~---~~~~~~~g~~~~g~--~v~--------~~~al~~~~V~~ 57 (454) T protein:vir:93 1 MWNLLRRTRKNQ----------KSGRDVREAGWTSLFQ---AVAEPFAGAWQQGV--KAD--------PEAVLSFHAVFA 57 (454) T ss_pred CCCccccCcccc----------cccccccchhhhhhhh---hhhhhhcchhhcCc--ccC--------hHHhhccHHHHH Confidence 888887754432 1111111111111100 000111110 0010 111 124567899999 Q ss_pred HHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHH-HHHHHHHHhcchhhhHH----HHHhhccccceeEEEeeecC Q lcl|NC_018087. 90 AVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISD-EFNSVLNMLNFQRKGSD----HFKRWYVDSRVFFHKIINPN 164 (520) Q Consensus 90 Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~e-eF~~i~~ll~f~k~g~~----~fRrWYvDgri~~hkvid~~ 164 (520) ||+-|.+.+.-. |+.|-=... +..++.+.+ .+..++.-=|-..++.+ ++..+.+.|--|.-++-+.+ T Consensus 58 ~v~~Ia~~iA~l-----p~~~~~~~~---~g~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~ 129 (454) T protein:vir:93 58 CISLISQDIAKM-----RLRLMQTDA---QGIRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNAR 129 (454) T ss_pred HHHHHHHhhccC-----ceEEEEecc---CCccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCC Confidence 999999877644 333211111 111111211 11112222233344555 34457888999998886632 Q ss_pred CCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccccC Q lcl|NC_018087. 165 RPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDC 244 (520) Q Consensus 165 ~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~ 244 (520) .-+.+|.+|+|..++.++. .+|.- +|.|.... ..+.+..+.++.+.|+|+--+. .. T Consensus 130 ---G~~~~L~~i~~~~v~v~~~-----~~g~~-------~y~~~~~~--------~~~~~~~~~~~~~eViH~k~~~-~~ 185 (454) T protein:vir:93 130 ---GQIKELRILDWNRVEPLVA-----DDGEV-------FYRITPDR--------NCGITEAVTVPAREVIHDRFNC-FF 185 (454) T ss_pred ---CcEEEEEEEcCcceEEEEc-----CCCcE-------EEEEEecc--------ccccceeEEecCcceEEeccCC-CC Confidence 2389999999999988532 22221 23332211 1112234678999999885443 44 Q ss_pred CCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEee-cCCCccccc Q lcl|NC_018087. 245 CGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYD-ARTGKVKNQ 323 (520) Q Consensus 245 ~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd-~~TGev~d~ 323 (520) ++...+|-+..|.+.+.....+++...=+=---+--+-+..++ |.|.+..+++- ...++.. |. ..+|. T Consensus 186 ~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~----~~~~~~~--~~g~n~g~---- 254 (454) T protein:vir:93 186 HPLIGLPPVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIP-GSITEENAKKL----KSNWDSG--YTGENAGK---- 254 (454) T ss_pred CCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecC-CCCCHHHHHHH----HHHHHHH--hcccccCC---- Confidence 5656789999999999999999987653322223345666666 56665544432 2223221 11 11222 Q ss_pred cccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHH Q lcl|NC_018087. 324 ANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFD 402 (520) Q Consensus 324 ~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~ 402 (520) .+ +++ + |.+++.|.=...-.| ++-.++....+.++++||...|...++. .+ +.+.-....|. T Consensus 255 --~~-vl~-------~---g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~-t~---sn~e~~~~~f~ 317 (454) T protein:vir:93 255 --TA-ILS-------N---GAKYNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPP-SS---DNVEALEQQYY 317 (454) T ss_pred --ce-ecc-------C---CceEEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCC-cc---hhHHHHHHHHH Confidence 11 221 1 344544432112122 3344466789999999999988643221 11 11222222343 Q ss_pred H-HHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhH Q lcl|NC_018087. 403 K-FISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISN 481 (520) Q Consensus 403 K-FI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~ 481 (520) + -|.-+..++..-+...| .+..+ ..+.|.-+ ++...+ +..|++.+..+-.- -+++. T Consensus 318 ~~~l~P~~~~ie~~ln~~L---------~~~~~-------~~~~f~~~----~ll~~D-~~~r~~~~~~~~~~--G~~T~ 374 (454) T protein:vir:93 318 SQCLQTLIESIELLLDEAL---------ETGEN-------ESTEFDVT----TLLRMD-SERRMKTLGDAVKN--TLLTP 374 (454) T ss_pred HHHHHHHHHHHHHHHHHhh---------cCCCC-------cEEEeech----hhhccC-HHHHHHHHHHHHhC--CCcCH Confidence 3 24444455544444322 12222 13344322 222221 24566666655322 35677 Q ss_pred HHHHHHHhCCCHHH-------------HHHHHHHHHHhhhcC----CccCC--cccc------C Q lcl|NC_018087. 482 HTAMKDFLQMSDED-------------IAAERKLIDEELSDK----IFNPP--EPEE------I 520 (520) Q Consensus 482 ~~i~k~IL~~tDee-------------I~~~~kqi~~E~~~~----~~~~p--~~e~------~ 520 (520) +-++. .+.+.+-+ +....++-..+.+.. ..++| .++. - T Consensus 375 NE~R~-~~gl~pi~ggD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~ 437 (454) T protein:vir:93 375 NEARK-RENLPPLAGGDALYLQQQNYSLEALSRRDAREDPFASSGKTASVPQAVAASDGNKAIT 437 (454) T ss_pred HHHHH-HhCCCCCCCCCeeeeccCccchHhhhccCcccCCCCCCccCCCCCCCCCCCCCCCCcc Confidence 77774 46665422 111111111111000 00000 0000 0 No 52 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=98.02 E-value=7e-06 Score=48.85 Aligned_cols=406 Identities=10% Similarity=0.131 Sum_probs=195.4 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeeccccccccccc-ccccccccccchhHHHHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVF-QKLYGSQDPTATSTRELINTYR 79 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~-~~~~~~~~~~~~~~~~LI~~YR 79 (520) |++ +|.+.++|+|-.+. .. ...+.++ ++..-+ .+.|.. .+.+ + + -+ T Consensus 1 M~~-~~r~~~~~~~~~r~-~~--------~~~~~~~----~~~~~~-----~~~g~~~~~~~------v-~-------~~ 47 (432) T protein:vir:10 1 MKI-VDSVKKFFNFEKRQ-TS--------QVIELNK----DDEKLL-----EWLGISPSTIS------V-K-------GK 47 (432) T ss_pred CCh-HHHHHHhcCccccC-cc--------cccccCC----chHHHH-----HHhCCCcCccc------c-c-------hh Confidence 553 45666677754221 11 1111111 100000 111111 0111 1 1 12 Q ss_pred HHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHH-hcchhhhHHHHHh----hccccc Q lcl|NC_018087. 80 SLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNM-LNFQRKGSDHFKR----WYVDSR 154 (520) Q Consensus 80 ~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~l-l~f~k~g~~~fRr----WYvDgr 154 (520) ..+++|.|.+||+-|.+.+.-. |+.|.-+.. ...+.........+|+. =|-..++.++++. +.+.|- T Consensus 48 ~al~~~~v~~~i~~ia~~ia~l-----p~~~~~~~~---~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn 119 (432) T protein:vir:10 48 NALKVATVFACIKILSESVSKL-----PLKIYQEDE---YGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGN 119 (432) T ss_pred hhhccHHHHHHHHHHHHhhccC-----ceEEEEecC---CceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCC Confidence 3467899999999998877643 222211110 00111111222333321 2223445554444 566799 Q ss_pred eeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccE Q lcl|NC_018087. 155 VFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAM 234 (520) Q Consensus 155 i~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI 234 (520) -|+.++-|. + .-+++|.+|+|..++.+++-. +... .....||.++. .+..+.++++.| T Consensus 120 ay~~i~r~~-~--G~~~~L~~i~~~~v~v~~d~~-----~~~~-~~~~~~y~~~~-------------~g~~~~~~~~ei 177 (432) T protein:vir:10 120 SYANIEFDR-K--GKVQALWPIDASKVTVYIDDV-----GLLN-SKTKMWYVVNT-------------GGQQRVLKPEEI 177 (432) T ss_pred eEEEEEECC-C--CcEEEEEEEcCceeEEEEcCc-----cccc-ccceEEEEEec-------------CCeEEEEccccE Confidence 999988662 2 238999999999998864322 1111 11122333322 123467899999 Q ss_pred EEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEee Q lcl|NC_018087. 235 VYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYD 314 (520) Q Consensus 235 ~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd 314 (520) +|..-+ ...++...+|.|..|++++.....+++...=+----+.-+-+..++ +.|.+..+++..+.+...|.. T Consensus 178 ih~r~~-~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g----- 250 (432) T protein:vir:10 178 LHFKNG-ITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-GDLNEDAKKVFRENFESMSSG----- 250 (432) T ss_pred EEecCC-CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhcc----- Confidence 988533 2456666789999999999999888887766655445556777776 467766666555544444432 Q ss_pred cCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh---HHHHHHHHHHHHHhcCCChhhccCCCcccccccc Q lcl|NC_018087. 315 ARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE---MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMS 391 (520) Q Consensus 315 ~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~ 391 (520) ++|..+.+ .++ .|.++..|. .+..+ ++-.++..+.+.++++||.+-|...++.. . T Consensus 251 -----~~n~~~~~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~----~ 308 (432) T protein:vir:10 251 -----LQNSHRIA-LMP----------VGYQFQPIS--LNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT----L 308 (432) T ss_pred -----cccCCcce-ecC----------CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC----c Confidence 11111222 221 245566653 22233 33445778999999999998885322211 1 Q ss_pred chhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 392 TAISRDELSFDKF-ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSL 470 (520) Q Consensus 392 ~eItRDElkF~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~ 470 (520) +.+.-.-..|.+. |.-+-.++..-| -+.++++.+|. ..+.+.|..+ ++...+ +..|++++.. T Consensus 309 s~~e~~~~~~~~~~l~P~~~~ie~~l---------n~kLl~~~~~~---~g~~~~fd~~----~l~~~d-~~~~~~~~~~ 371 (432) T protein:vir:10 309 NNIEQQQQQFYTDTLQATLTMYEQEM---------TYKLFLDSELD---KGFYSKFNVD----AILRAD-IKTRYEAYRT 371 (432) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHH---------HHhhcChhhcC---CCcEEEeech----hhhcCC-HHHHHHHHHH Confidence 1121122224332 222333332222 22345666554 2234455422 222211 2446666655 Q ss_pred hhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHH-------HhhhcCCccC-CccccC Q lcl|NC_018087. 471 MEPYIGKYISNHTAMKDFLQMSDEDIAAERKLID-------EELSDKIFNP-PEPEEI 520 (520) Q Consensus 471 ~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~-------~E~~~~~~~~-p~~e~~ 520 (520) +-.- -+++.+-+++ ++++.+.+ .-++-+- ++..+...+. -+.++. T Consensus 372 ~~~~--G~~t~NE~R~-~~g~~pi~--ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~~ 424 (432) T protein:vir:10 372 GIQG--GFLKPNEARS-KEDLPPEA--GGDRLLVNGNMLPIDMAGQAYLKGGDTNGEV 424 (432) T ss_pred HHhC--CCcCHHHHHH-HhCCCCCC--CCCeEeecccccchhhccccccCCCCCCCCC Confidence 5332 3678888874 47775421 1110000 0000000000 011111 No 53 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=98.02 E-value=7e-06 Score=48.85 Aligned_cols=406 Identities=10% Similarity=0.131 Sum_probs=195.4 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeeccccccccccc-ccccccccccchhHHHHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVF-QKLYGSQDPTATSTRELINTYR 79 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~-~~~~~~~~~~~~~~~~LI~~YR 79 (520) |++ +|.+.++|+|-.+. .. ...+.++ ++..-+ .+.|.. .+.+ + + -+ T Consensus 1 M~~-~~r~~~~~~~~~r~-~~--------~~~~~~~----~~~~~~-----~~~g~~~~~~~------v-~-------~~ 47 (432) T protein:vir:10 1 MKI-VDSVKKFFNFEKRQ-TS--------QVIELNK----DDEKLL-----EWLGISPSTIS------V-K-------GK 47 (432) T ss_pred CCh-HHHHHHhcCccccC-cc--------cccccCC----chHHHH-----HHhCCCcCccc------c-c-------hh Confidence 553 45666677754221 11 1111111 100000 111111 0111 1 1 12 Q ss_pred HHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHH-hcchhhhHHHHHh----hccccc Q lcl|NC_018087. 80 SLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNM-LNFQRKGSDHFKR----WYVDSR 154 (520) Q Consensus 80 ~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~l-l~f~k~g~~~fRr----WYvDgr 154 (520) ..+++|.|.+||+-|.+.+.-. |+.|.-+.. ...+.........+|+. =|-..++.++++. +.+.|- T Consensus 48 ~al~~~~v~~~i~~ia~~ia~l-----p~~~~~~~~---~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn 119 (432) T protein:vir:10 48 NALKVATVFACIKILSESVSKL-----PLKIYQEDE---YGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGN 119 (432) T ss_pred hhhccHHHHHHHHHHHHhhccC-----ceEEEEecC---CceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCC Confidence 3467899999999998877643 222211110 00111111222333321 2223445554444 566799 Q ss_pred eeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccE Q lcl|NC_018087. 155 VFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAM 234 (520) Q Consensus 155 i~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI 234 (520) -|+.++-|. + .-+++|.+|+|..++.+++-. +... .....||.++. .+..+.++++.| T Consensus 120 ay~~i~r~~-~--G~~~~L~~i~~~~v~v~~d~~-----~~~~-~~~~~~y~~~~-------------~g~~~~~~~~ei 177 (432) T protein:vir:10 120 SYANIEFDR-K--GKVQALWPIDASKVTVYIDDV-----GLLN-SKTKMWYVVNT-------------GGQQRVLKPEEI 177 (432) T ss_pred eEEEEEECC-C--CcEEEEEEEcCceeEEEEcCc-----cccc-ccceEEEEEec-------------CCeEEEEccccE Confidence 999988662 2 238999999999998864322 1111 11122333322 123467899999 Q ss_pred EEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEee Q lcl|NC_018087. 235 VYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYD 314 (520) Q Consensus 235 ~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd 314 (520) +|..-+ ...++...+|.|..|++++.....+++...=+----+.-+-+..++ +.|.+..+++..+.+...|.. T Consensus 178 ih~r~~-~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g----- 250 (432) T protein:vir:10 178 LHFKNG-ITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-GDLNEDAKKVFRENFESMSSG----- 250 (432) T ss_pred EEecCC-CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhcc----- Confidence 988533 2456666789999999999999888887766655445556777776 467766666555544444432 Q ss_pred cCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh---HHHHHHHHHHHHHhcCCChhhccCCCcccccccc Q lcl|NC_018087. 315 ARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE---MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMS 391 (520) Q Consensus 315 ~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~ 391 (520) ++|..+.+ .++ .|.++..|. .+..+ ++-.++..+.+.++++||.+-|...++.. . T Consensus 251 -----~~n~~~~~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~----~ 308 (432) T protein:vir:10 251 -----LQNSHRIA-LMP----------VGYQFQPIS--LNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT----L 308 (432) T ss_pred -----cccCCcce-ecC----------CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC----c Confidence 11111222 221 245566653 22233 33445778999999999998885322211 1 Q ss_pred chhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 392 TAISRDELSFDKF-ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSL 470 (520) Q Consensus 392 ~eItRDElkF~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~ 470 (520) +.+.-.-..|.+. |.-+-.++..-| -+.++++.+|. ..+.+.|..+ ++...+ +..|++++.. T Consensus 309 s~~e~~~~~~~~~~l~P~~~~ie~~l---------n~kLl~~~~~~---~g~~~~fd~~----~l~~~d-~~~~~~~~~~ 371 (432) T protein:vir:10 309 NNIEQQQQQFYTDTLQATLTMYEQEM---------TYKLFLDSELD---KGFYSKFNVD----AILRAD-IKTRYEAYRT 371 (432) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHH---------HHhhcChhhcC---CCcEEEeech----hhhcCC-HHHHHHHHHH Confidence 1121122224332 222333332222 22345666554 2234455422 222211 2446666655 Q ss_pred hhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHH-------HhhhcCCccC-CccccC Q lcl|NC_018087. 471 MEPYIGKYISNHTAMKDFLQMSDEDIAAERKLID-------EELSDKIFNP-PEPEEI 520 (520) Q Consensus 471 ~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~-------~E~~~~~~~~-p~~e~~ 520 (520) +-.- -+++.+-+++ ++++.+.+ .-++-+- ++..+...+. -+.++. T Consensus 372 ~~~~--G~~t~NE~R~-~~g~~pi~--ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~~ 424 (432) T protein:vir:10 372 GIQG--GFLKPNEARS-KEDLPPEA--GGDRLLVNGNMLPIDMAGQAYLKGGDTNGEV 424 (432) T ss_pred HHhC--CCcCHHHHHH-HhCCCCCC--CCCeEeecccccchhhccccccCCCCCCCCC Confidence 5332 3678888874 47775421 1110000 0000000000 011111 No 54 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=98.02 E-value=7e-06 Score=48.85 Aligned_cols=406 Identities=10% Similarity=0.131 Sum_probs=195.4 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeeccccccccccc-ccccccccccchhHHHHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVF-QKLYGSQDPTATSTRELINTYR 79 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~-~~~~~~~~~~~~~~~~LI~~YR 79 (520) |++ +|.+.++|+|-.+. .. ...+.++ ++..-+ .+.|.. .+.+ + + -+ T Consensus 1 M~~-~~r~~~~~~~~~r~-~~--------~~~~~~~----~~~~~~-----~~~g~~~~~~~------v-~-------~~ 47 (432) T protein:vir:10 1 MKI-VDSVKKFFNFEKRQ-TS--------QVIELNK----DDEKLL-----EWLGISPSTIS------V-K-------GK 47 (432) T ss_pred CCh-HHHHHHhcCccccC-cc--------cccccCC----chHHHH-----HHhCCCcCccc------c-c-------hh Confidence 553 45666677754221 11 1111111 100000 111111 0111 1 1 12 Q ss_pred HHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHH-hcchhhhHHHHHh----hccccc Q lcl|NC_018087. 80 SLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNM-LNFQRKGSDHFKR----WYVDSR 154 (520) Q Consensus 80 ~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~l-l~f~k~g~~~fRr----WYvDgr 154 (520) ..+++|.|.+||+-|.+.+.-. |+.|.-+.. ...+.........+|+. =|-..++.++++. +.+.|- T Consensus 48 ~al~~~~v~~~i~~ia~~ia~l-----p~~~~~~~~---~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn 119 (432) T protein:vir:10 48 NALKVATVFACIKILSESVSKL-----PLKIYQEDE---YGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGN 119 (432) T ss_pred hhhccHHHHHHHHHHHHhhccC-----ceEEEEecC---CceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCC Confidence 3467899999999998877643 222211110 00111111222333321 2223445554444 566799 Q ss_pred eeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccE Q lcl|NC_018087. 155 VFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAM 234 (520) Q Consensus 155 i~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI 234 (520) -|+.++-|. + .-+++|.+|+|..++.+++-. +... .....||.++. .+..+.++++.| T Consensus 120 ay~~i~r~~-~--G~~~~L~~i~~~~v~v~~d~~-----~~~~-~~~~~~y~~~~-------------~g~~~~~~~~ei 177 (432) T protein:vir:10 120 SYANIEFDR-K--GKVQALWPIDASKVTVYIDDV-----GLLN-SKTKMWYVVNT-------------GGQQRVLKPEEI 177 (432) T ss_pred eEEEEEECC-C--CcEEEEEEEcCceeEEEEcCc-----cccc-ccceEEEEEec-------------CCeEEEEccccE Confidence 999988662 2 238999999999998864322 1111 11122333322 123467899999 Q ss_pred EEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEee Q lcl|NC_018087. 235 VYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYD 314 (520) Q Consensus 235 ~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd 314 (520) +|..-+ ...++...+|.|..|++++.....+++...=+----+.-+-+..++ +.|.+..+++..+.+...|.. T Consensus 178 ih~r~~-~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g----- 250 (432) T protein:vir:10 178 LHFKNG-ITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-GDLNEDAKKVFRENFESMSSG----- 250 (432) T ss_pred EEecCC-CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhcc----- Confidence 988533 2456666789999999999999888887766655445556777776 467766666555544444432 Q ss_pred cCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh---HHHHHHHHHHHHHhcCCChhhccCCCcccccccc Q lcl|NC_018087. 315 ARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE---MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMS 391 (520) Q Consensus 315 ~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~ 391 (520) ++|..+.+ .++ .|.++..|. .+..+ ++-.++..+.+.++++||.+-|...++.. . T Consensus 251 -----~~n~~~~~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~----~ 308 (432) T protein:vir:10 251 -----LQNSHRIA-LMP----------VGYQFQPIS--LNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT----L 308 (432) T ss_pred -----cccCCcce-ecC----------CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC----c Confidence 11111222 221 245566653 22233 33445778999999999998885322211 1 Q ss_pred chhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 392 TAISRDELSFDKF-ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSL 470 (520) Q Consensus 392 ~eItRDElkF~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~ 470 (520) +.+.-.-..|.+. |.-+-.++..-| -+.++++.+|. ..+.+.|..+ ++...+ +..|++++.. T Consensus 309 s~~e~~~~~~~~~~l~P~~~~ie~~l---------n~kLl~~~~~~---~g~~~~fd~~----~l~~~d-~~~~~~~~~~ 371 (432) T protein:vir:10 309 NNIEQQQQQFYTDTLQATLTMYEQEM---------TYKLFLDSELD---KGFYSKFNVD----AILRAD-IKTRYEAYRT 371 (432) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHH---------HHhhcChhhcC---CCcEEEeech----hhhcCC-HHHHHHHHHH Confidence 1121122224332 222333332222 22345666554 2234455422 222211 2446666655 Q ss_pred hhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHH-------HhhhcCCccC-CccccC Q lcl|NC_018087. 471 MEPYIGKYISNHTAMKDFLQMSDEDIAAERKLID-------EELSDKIFNP-PEPEEI 520 (520) Q Consensus 471 ~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~-------~E~~~~~~~~-p~~e~~ 520 (520) +-.- -+++.+-+++ ++++.+.+ .-++-+- ++..+...+. -+.++. T Consensus 372 ~~~~--G~~t~NE~R~-~~g~~pi~--ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~~ 424 (432) T protein:vir:10 372 GIQG--GFLKPNEARS-KEDLPPEA--GGDRLLVNGNMLPIDMAGQAYLKGGDTNGEV 424 (432) T ss_pred HHhC--CCcCHHHHHH-HhCCCCCC--CCCeEeecccccchhhccccccCCCCCCCCC Confidence 5332 3678888874 47775421 1110000 0000000000 011111 No 55 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=97.99 E-value=7.9e-06 Score=48.54 Aligned_cols=443 Identities=12% Similarity=0.173 Sum_probs=203.0 Q ss_pred Cccccccchhhhcchhhhhh-hHHHhhhccCCCcccCCCCCCCceeecccccccccccccc-cccccccchhHHHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDD-TEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKL-YGSQDPTATSTRELINTY 78 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~-~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~-~~~~~~~~~~~~~LI~~Y 78 (520) |+|+ +.+-++|+--..+-- +.+.+......+++ +| +-...|..+..-|.|-+... |.+..+... .+.+ T Consensus 1 m~~~-~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~-~~---~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~-----~~~~ 70 (500) T protein:vir:30 1 MGVI-QKIKNLVTRSKYVMTTQSLTNITDHPKIAI-SK---LEYDRITTNLKYYKSDWDSVLYLNTDGETK-----KRDL 70 (500) T ss_pred CchH-HHHHHHHHHHHHHhhcchhhhhhccccccC-CH---HHHHHHHHHHHHhcCCCCCcccccCCCCcc-----cCce Confidence 6653 333333331111100 11111111111111 11 11111111111122221111 111111110 1111 Q ss_pred HHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEE Q lcl|NC_018087. 79 RSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFH 158 (520) Q Consensus 79 R~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~h 158 (520) .+| +--..++++.++= +. .+++++.+++. ...+..+.+++--+|.+...+.+....+-|..+|+ T Consensus 71 ~sl---nl~~~i~~~~A~l--v~---~e~~~i~~~d~--------~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k 134 (500) T protein:vir:30 71 NHL---PIARTAAKKIASL--VF---NEQAEIKVDDD--------AANEFISETLKNDRFNKNFERYLESCLALGGLAMR 134 (500) T ss_pred eec---chHHHHHHHHhhh--hc---CCcceEecCCh--------HHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEE Confidence 111 1112222222221 11 14455666543 34456677777788999999999999999999999 Q ss_pred EeeecCCCCCCeeeeEecCccceeeeeeccCCCCccccc---c---cce------ecceeecCccccccccccee----- Q lcl|NC_018087. 159 KIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKV---V---KGY------REYFLYDTELESYQCGHQHF----- 221 (520) Q Consensus 159 kvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~---~---~~~------~ey~~y~~~~~~~~~~~~~~----- 221 (520) ..+|..+ ..+.+++|.++-+++--........-. . ++- .||+...+. ..|..-...| T Consensus 135 ~~~d~~~-----~~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~-~~~~I~n~ly~~~~~ 208 (500) T protein:vir:30 135 PYVDGDK-----VRVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSS-DDYVISNELYRSDDK 208 (500) T ss_pred EEEeCCc-----eEEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCC-ceeEEEEEEEecccc Confidence 9998432 347889999988863311111100000 0 000 122211111 1111111111 Q ss_pred -cCCcceec---C---cccEEEeecc--c---c--------cCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccc Q lcl|NC_018087. 222 -AAGTKIKI---P---YSAMVYAHSG--L---V--------DCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDR 281 (520) Q Consensus 222 -~~~~~~~I---~---~~aI~y~hSG--L---~--------d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeR 281 (520) .-|.++.+ + ++.+++.+.. | + ++.-+..+|-++.|.-.+..|-..-+.+. |..|.=++ T Consensus 209 ~~lG~~v~l~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~--~e~~~g~~ 286 (500) T protein:vir:30 209 AKVGSRVPLSEVYKDLKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFM--WEVKMGQR 286 (500) T ss_pred cccCcccccccccCCcCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHH--HHHHhCcc Confidence 11111110 0 1122222111 0 1 11222356888999888888877777665 78888777 Q ss_pred eEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccc-------cccccchhhhhhcccccCCCCCcceeecCCCCC Q lcl|NC_018087. 282 RVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVK-------NQANMMALTEDYWLQRRDGKAVTEVETLPGMTG 354 (520) Q Consensus 282 RvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~-------d~~~~msmlEDywLpRReGgrgTEIsTLpGg~n 354 (520) |+|. +..=++ ...+..+|+.- +++.+..|- =-+++ +.-|+.+...=- T Consensus 287 ~i~v-~~~~l~------------------~~~~~~~g~~~~~~~~d~~~~~~~~~~------~~~~~-~~~i~~~~~~ir 340 (500) T protein:vir:30 287 RVAV-PESLTA------------------LTVRTTDGDVVPRPRFESDQNVYIRMG------GRDLD-SSAIQDLTTPIR 340 (500) T ss_pred eeee-chHHhc------------------ccCCCCCccccCCcccCCCcceEEEcC------CCCCc-CcceeEeccccC Confidence 7765 211100 01111222210 111122110 00111 123555543222 Q ss_pred cCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh------- Q lcl|NC_018087. 355 MNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLL------- 426 (520) Q Consensus 355 Lge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiL------- 426 (520) ..+ ..-+.++.+.+=.+.+++-+.|..++++. --++||.-.+-.-..-+.+.|+.|...+.++++.=|-| T Consensus 341 ~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~--~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~ 418 (500) T protein:vir:30 341 ADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSM--KTATEIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLY 418 (500) T ss_pred hHHHHHHHHHHHHHHHHHhCCCccccccCcCcc--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 222 23355667777777888888887665422 23456654444455566677777777777766665433 Q ss_pred cCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHh Q lcl|NC_018087. 427 KRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEE 506 (520) Q Consensus 427 kgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E 506 (520) .+... .+ ..+.++|. |+.+.. +++++ +..+.+++ . | .+|.++.+++...+||+|.+++.++|++| T Consensus 419 ~~~~~-~~-----~~v~v~f~-d~i~~d-~~~~~-~~~~~~v~--a---G-i~s~~~~i~~~~g~~eeea~~~l~~i~~E 483 (500) T protein:vir:30 419 QSEVP-SM-----DNISISLD-DGVFTD-RDAEL-DYWIKVVN--A---G-FGTREMAIQKVLNVTEEKAQEIAAEINTG 483 (500) T ss_pred CCCCC-CC-----cceEEEeC-CCCCCC-HHHHH-HHHHHHHH--c---C-CCCHHHHHHhcCCCCHHHHHHHHHHHHHh Confidence 22222 21 34788885 444433 22221 11111111 1 2 36888877777899999999999999999 Q ss_pred hhcCCccCCccccC Q lcl|NC_018087. 507 LSDKIFNPPEPEEI 520 (520) Q Consensus 507 ~~~~~~~~p~~e~~ 520 (520) .....-...++.+| T Consensus 484 ~~~~~~~~~~~~~~ 497 (500) T protein:vir:30 484 IVDEINQQRTDTHL 497 (500) T ss_pred ccccCCCCCccccc Confidence 87665444455566 No 56 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=97.99 E-value=7.9e-06 Score=48.54 Aligned_cols=443 Identities=12% Similarity=0.173 Sum_probs=203.0 Q ss_pred Cccccccchhhhcchhhhhh-hHHHhhhccCCCcccCCCCCCCceeecccccccccccccc-cccccccchhHHHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDD-TEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKL-YGSQDPTATSTRELINTY 78 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~-~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~-~~~~~~~~~~~~~LI~~Y 78 (520) |+|+ +.+-++|+--..+-- +.+.+......+++ +| +-...|..+..-|.|-+... |.+..+... .+.+ T Consensus 1 m~~~-~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~-~~---~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~-----~~~~ 70 (500) T protein:vir:98 1 MGVI-QKIKNLVTRSKYVMTTQSLTNITDHPKIAI-SK---LEYDRITTNLKYYKSDWDSVLYLNTDGETK-----KRDL 70 (500) T ss_pred CchH-HHHHHHHHHHHHHhhcchhhhhhccccccC-CH---HHHHHHHHHHHHhcCCCCCcccccCCCCcc-----cCce Confidence 6653 333333331111100 11111111111111 11 11111111111122221111 111111110 1111 Q ss_pred HHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEE Q lcl|NC_018087. 79 RSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFH 158 (520) Q Consensus 79 R~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~h 158 (520) .+| +--..++++.++= +. .+++++.+++. ...+..+.+++--+|.+...+.+....+-|..+|+ T Consensus 71 ~sl---nl~~~i~~~~A~l--v~---~e~~~i~~~d~--------~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k 134 (500) T protein:vir:98 71 NHL---PIARTAAKKIASL--VF---NEQAEIKVDDD--------AANEFISETLKNDRFNKNFERYLESCLALGGLAMR 134 (500) T ss_pred eec---chHHHHHHHHhhh--hc---CCcceEecCCh--------HHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEE Confidence 111 1112222222221 11 14455666543 34456677777788999999999999999999999 Q ss_pred EeeecCCCCCCeeeeEecCccceeeeeeccCCCCccccc---c---cce------ecceeecCccccccccccee----- Q lcl|NC_018087. 159 KIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKV---V---KGY------REYFLYDTELESYQCGHQHF----- 221 (520) Q Consensus 159 kvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~---~---~~~------~ey~~y~~~~~~~~~~~~~~----- 221 (520) ..+|..+ ..+.+++|.++-+++--........-. . ++- .||+...+. ..|..-...| T Consensus 135 ~~~d~~~-----~~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~-~~~~I~n~ly~~~~~ 208 (500) T protein:vir:98 135 PYVDGDK-----VRVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSS-DDYVISNELYRSDDK 208 (500) T ss_pred EEEeCCc-----eEEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCC-ceeEEEEEEEecccc Confidence 9998432 347889999988863311111100000 0 000 122211111 1111111111 Q ss_pred -cCCcceec---C---cccEEEeecc--c---c--------cCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccc Q lcl|NC_018087. 222 -AAGTKIKI---P---YSAMVYAHSG--L---V--------DCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDR 281 (520) Q Consensus 222 -~~~~~~~I---~---~~aI~y~hSG--L---~--------d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeR 281 (520) .-|.++.+ + ++.+++.+.. | + ++.-+..+|-++.|.-.+..|-..-+.+. |..|.=++ T Consensus 209 ~~lG~~v~l~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~--~e~~~g~~ 286 (500) T protein:vir:98 209 AKVGSRVPLSEVYKDLKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFM--WEVKMGQR 286 (500) T ss_pred cccCcccccccccCCcCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHH--HHHHhCcc Confidence 11111110 0 1122222111 0 1 11222356888999888888877777665 78888777 Q ss_pred eEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccc-------cccccchhhhhhcccccCCCCCcceeecCCCCC Q lcl|NC_018087. 282 RVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVK-------NQANMMALTEDYWLQRRDGKAVTEVETLPGMTG 354 (520) Q Consensus 282 RvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~-------d~~~~msmlEDywLpRReGgrgTEIsTLpGg~n 354 (520) |+|. +..=++ ...+..+|+.- +++.+..|- =-+++ +.-|+.+...=- T Consensus 287 ~i~v-~~~~l~------------------~~~~~~~g~~~~~~~~d~~~~~~~~~~------~~~~~-~~~i~~~~~~ir 340 (500) T protein:vir:98 287 RVAV-PESLTA------------------LTVRTTDGDVVPRPRFESDQNVYIRMG------GRDLD-SSAIQDLTTPIR 340 (500) T ss_pred eeee-chHHhc------------------ccCCCCCccccCCcccCCCcceEEEcC------CCCCc-CcceeEeccccC Confidence 7765 211100 01111222210 111122110 00111 123555543222 Q ss_pred cCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh------- Q lcl|NC_018087. 355 MNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLL------- 426 (520) Q Consensus 355 Lge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiL------- 426 (520) ..+ ..-+.++.+.+=.+.+++-+.|..++++. --++||.-.+-.-..-+.+.|+.|...+.++++.=|-| T Consensus 341 ~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~--~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~ 418 (500) T protein:vir:98 341 ADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSM--KTATEIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLY 418 (500) T ss_pred hHHHHHHHHHHHHHHHHHhCCCccccccCcCcc--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 222 23355667777777888888887665422 23456654444455566677777777777766665433 Q ss_pred cCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHh Q lcl|NC_018087. 427 KRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEE 506 (520) Q Consensus 427 kgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E 506 (520) .+... .+ ..+.++|. |+.+.. +++++ +..+.+++ . | .+|.++.+++...+||+|.+++.++|++| T Consensus 419 ~~~~~-~~-----~~v~v~f~-d~i~~d-~~~~~-~~~~~~v~--a---G-i~s~~~~i~~~~g~~eeea~~~l~~i~~E 483 (500) T protein:vir:98 419 QSEVP-SM-----DNISISLD-DGVFTD-RDAEL-DYWIKVVN--A---G-FGTREMAIQKVLNVTEEKAQEIAAEINTG 483 (500) T ss_pred CCCCC-CC-----cceEEEeC-CCCCCC-HHHHH-HHHHHHHH--c---C-CCCHHHHHHhcCCCCHHHHHHHHHHHHHh Confidence 22222 21 34788885 444433 22221 11111111 1 2 36888877777899999999999999999 Q ss_pred hhcCCccCCccccC Q lcl|NC_018087. 507 LSDKIFNPPEPEEI 520 (520) Q Consensus 507 ~~~~~~~~p~~e~~ 520 (520) .....-...++.+| T Consensus 484 ~~~~~~~~~~~~~~ 497 (500) T protein:vir:98 484 IVDEINQQRTDTHL 497 (500) T ss_pred ccccCCCCCccccc Confidence 87665444455566 No 57 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=97.96 E-value=9.2e-06 Score=48.20 Aligned_cols=426 Identities=10% Similarity=0.032 Sum_probs=188.5 Q ss_pred Ccccccc--chhhhcchhhhhhhHHHh-hhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHH Q lcl|NC_018087. 1 MSMLADS--DLKMFAFWHKVDDTEYDK-IINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINT 77 (520) Q Consensus 1 ~~~~~~~--~l~~f~~~~~~~~~~~~~-~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~ 77 (520) -.++.|+ +-...=|-.. ...++ ++..+--.-.||--.+--..-+. ...+.+-... . ..+-+++- T Consensus 50 ~~~~~~~~~~~~~~~~~~~---~~~kk~~i~~pfkkk~~~~~~d~f~~s~e-s~s~vtsls~---p------daf~~vnV 116 (945) T protein:vir:10 50 RALAWNSTVVYSIIIFRKN---QVLKKEKIIVPYNHQEPPFKFNLFEYSPE-SLMYLPSISD---P------DAFFLINL 116 (945) T ss_pred hhhhccceeeeeeeeehhh---hHHHhhcccccccccccchhhhhhhccCc-cceecccccC---c------cceeeehh Confidence 1111110 0011111111 10000 11111000011111100000000 0000000000 0 01112333 Q ss_pred HHHH-hhccchhHHHHhhhceeeEecCCCcEEEE--eeccchhhhHHHHHHHHHHHHHHHHh---cchhhhHH------- Q lcl|NC_018087. 78 YRSL-LNNYEVDNAVQEIVSDAIVYEEGFDVVSI--DLDQTAFTENIRNLISDEFNSVLNML---NFQRKGSD------- 144 (520) Q Consensus 78 YR~m-a~~pEvd~Ai~eIvneaiv~d~~~~~V~l--~Ld~~~~s~~ik~~I~eeF~~i~~ll---~f~k~g~~------- 144 (520) ++.+ +++|-|..||+-|.+.+.-. |+.+ ..++.......+. +. ....+..+| |=..++.+ T Consensus 117 s~~~AlknsaV~scI~~IA~sIAsL-----PlklYrr~edG~~~~~~kk-~~-~~hpL~~LL~rPNp~mT~~eFwqsFl~ 189 (945) T protein:vir:10 117 FRKYRFNNDSKLIKVSEIPKKLTSK-----ELEIYKHIEDKHVNYYLKR-IR-DARNILEFLERPDPYFSEVNSWEYLLG 189 (945) T ss_pred hhhhhhccHHHHHHHHHHHhhhccC-----ceEEEEecccCcccccccc-cc-cchHHHHHHhCCCcccChhHHHHHHHH Confidence 3444 67899999999999877533 2222 1112211111111 11 223344444 22334444 Q ss_pred -HHHhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecC Q lcl|NC_018087. 145 -HFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAA 223 (520) Q Consensus 145 -~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~ 223 (520) +++.+.+.|.-|+.++-| .+ ..+++|.+|||..+++++.- ++.. ..+|++.. + . T Consensus 190 ~Lv~dLLL~GNAYieIiRd-~~--G~ii~L~pLdPs~Vti~~dd-----DG~~-----~y~Yv~~i-------d-----G 244 (945) T protein:vir:10 190 MVLDDILTIDRGAIVKIRD-EQ--GNLVAITPVDGTTIKPILSE-----DTGI-----VVGYVQEV-------D-----G 244 (945) T ss_pred HHHHHHhhcCCeEEEEEEC-CC--CcEEEEEEECCcceEEEEcC-----CCcE-----EEEEEEec-------C-----C Confidence 446678889999998855 22 24889999999999886432 2211 01122111 1 1 Q ss_pred CcceecCcccEEEeecccccCCCC---cchhhhHHHHHHHHHHHHHHHH-HHHHHHhcCccceEEEccCCCCchH----- Q lcl|NC_018087. 224 GTKIKIPYSAMVYAHSGLVDCCGK---NIIGYLHRAVKPANQLKLLEDA-MMIYRITRAPDRRVFYIDTGNMPAR----- 294 (520) Q Consensus 224 ~~~~~I~~~aI~y~hSGL~d~~~~---~~~syL~~aik~~NqL~m~EDa-lVIyRi~RApeRRvFyIDvGnlpk~----- 294 (520) +....++.+.++|. .-...++|. ..+|-|..|.+.+.....+++. .-.|+--.|.-+-+..++.++.... T Consensus 245 ~~~~~v~a~DvIlh-irn~s~DG~~~GyGlSPIeaa~~aI~~alAaek~aar~FskNGa~PsGILsvkg~~~~d~k~~~~ 323 (945) T protein:vir:10 245 AIVAHFDKRDVVLF-RQNLTPDVYMYGYSLPPIEILYKVILSDIFIDKGNLDYYRKGGSIPEGILAIEPPSYKEGDIYPQ 323 (945) T ss_pred ceEEEecCCceEEE-eccCCCCcccccCCchHHHHHHHHHHHHHHHHHHHHHHHHhCCCccceEEEecCccccccccccc Confidence 12246677665543 222344433 3577899999888776666654 4444445577788999887653322 Q ss_pred ---HHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHH Q lcl|NC_018087. 295 ---KAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYM 370 (520) Q Consensus 295 ---KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~ 370 (520) ++.+-+++.+++... |. +....+ ++ ..|.+++.|.....-.| ++-..|..+...+ T Consensus 324 LseEq~erlKe~wee~~s--------G~--NnG~pi-VL----------deGmef~pLs~s~~DaQfLEsrkfs~eeIAr 382 (945) T protein:vir:10 324 LSREQLESIQRQLQAIMM--------GD--YTQVPI-LS----------GGKFTWIDFKGKRRDMQFKELAEFVARKICA 382 (945) T ss_pred cCHHHHHHHHHHHHHHhC--------Cc--ccccce-ec----------CCCceEEEccCChhHHHHHHHHHHHHHHHHH Confidence 222223333333211 21 111111 12 23677887754333333 3345566788999 Q ss_pred hcCCChhhccCCCccccccccchhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeecc Q lcl|NC_018087. 371 ALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKF-ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKN 449 (520) Q Consensus 371 aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~D 449 (520) +.+||.+.|...++.+ .+.+.-...-|..+ +..+..++...+...|. +.. ....+.++|..+ T Consensus 383 AFGVPP~lLG~~e~st----~SNiEqq~~~Fv~~tL~Pil~~IEqeLNrkLl---------~~~----eg~~i~fdFd~l 445 (945) T protein:vir:10 383 VYQVSPQDVGILEGSN----KATAEVMASLTKAKGLEPLMATISKGFDEVVS---------EFR----NEKDIKLWFKED 445 (945) T ss_pred HhCCCHHHcccCCCCC----cchHHHHHHHHHHHHHHHHHHHHHHHHHHhcc---------ccc----cCceeEEEecch Confidence 9999999996433211 12233334456544 67777777666554331 111 134578888777 Q ss_pred chHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHH----HH------HHHHHHHHhh------------ Q lcl|NC_018087. 450 SYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDED----IA------AERKLIDEEL------------ 507 (520) Q Consensus 450 n~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDee----I~------~~~kqi~~E~------------ 507 (520) ..-.. ..|.++++.+-.- -++|.+-+++. +++.+-+ .- ....+..+.. T Consensus 446 dl~D~-------ksraEal~kli~s--GiLTiNEvRe~-lGLpPIeGGD~lli~~nn~~P~d~~~ka~~ga~p~q~aq~~ 515 (945) T protein:vir:10 446 DLEKE-------RDWWNIIQGQLNT--GFRSINEARME-KGLEPVPWGDVPFSGLRNWKPEDEQAKAQQGAMPPQLAQAM 515 (945) T ss_pred hccCH-------HHHHHHHHHHHhC--CCcCHHHHHHH-hCCCCCCCcceeeeccccccccccccccccCCCCcccccCC Confidence 65433 3455555544221 25677777743 6554421 00 0000000000 Q ss_pred -hcCCccCCccccC Q lcl|NC_018087. 508 -SDKIFNPPEPEEI 520 (520) Q Consensus 508 -~~~~~~~p~~e~~ 520 (520) .++.-+.++++|= T Consensus 516 ~dqp~~kGGe~dEn 529 (945) T protein:vir:10 516 ADQPSQQGGGVDEN 529 (945) T ss_pred CCCCCCCCCCCCCC Confidence 0000000000000 No 58 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=97.95 E-value=9.5e-06 Score=48.13 Aligned_cols=404 Identities=13% Similarity=0.154 Sum_probs=188.6 Q ss_pred cccccchhhhcchhhhhhhHHHhhhccCCCcccCC--CCCCCcee-ecccccccccccccccccccccchhHHHHHHHHH Q lcl|NC_018087. 3 MLADSDLKMFAFWHKVDDTEYDKIINDKAESITAP--KFDDGATE-VDSQDIAYNGVFQKLYGSQDPTATSTRELINTYR 79 (520) Q Consensus 3 ~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p--~~~dg~~~-i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR 79 (520) |+++..+++|+.-. . .++|| .+..|..+ -+.+.+. ..+.....+....+ .-. T Consensus 1 ~~~~~~~g~~~~~~---~------------~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~g~~v--------~~~ 55 (432) T protein:vir:97 1 MPDEKKLGLLGQLK---A------------MFVPPDPVDIGGGQTFTPVNATA--RDLGIIISDTGAAV--------NAD 55 (432) T ss_pred CCCcccCchhhhhH---h------------hcCCccccccccccccccCchhh--hhhcccccccCccc--------chH Confidence 99999999997321 0 11111 11111111 1111000 00000000000111 112 Q ss_pred HHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHH-hcchhhhHHHHH----hhccccc Q lcl|NC_018087. 80 SLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNM-LNFQRKGSDHFK----RWYVDSR 154 (520) Q Consensus 80 ~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~l-l~f~k~g~~~fR----rWYvDgr 154 (520) ..+++|-|..||+-|.+.+.-+ |+.|--+.. +..++.+..-..++++. =|=..++.++.+ .+.+.|. T Consensus 56 ~a~~~~aV~~~v~~Ia~~ia~l-----p~~~y~~~~---~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gn 127 (432) T protein:vir:97 56 AIMRLDAVAACVKLVSQAVAAM-----PLMMYMRTP---DGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGT 127 (432) T ss_pred hhhcchHHHHHHHHHHHhhccC-----ceEEEEecC---CCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCC Confidence 3467899999999999877644 333322111 11111221122222211 122345555444 4677899 Q ss_pred eeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccE Q lcl|NC_018087. 155 VFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAM 234 (520) Q Consensus 155 i~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI 234 (520) -|..++-+ + ..+++|.+|+|..+..+++- +|.. -|.++.. .+..+.++.+.| T Consensus 128 ay~~~~~~--~--g~~~~L~~l~p~~v~v~~~~-----~g~~------~y~~~~~-------------~g~~~~~~~~~i 179 (432) T protein:vir:97 128 AYVRKVVT--D--GRIESLQYLANDRLTITTDT-----KGNT------AYRYRRT-------------DGQMIDIPRQQI 179 (432) T ss_pred eEEEEEec--C--CcEEEEEEEcCcceEEEEcC-----CCcE------EEEEEec-------------CceEEEEccccE Confidence 88888764 2 25899999999999886432 2211 1221111 122367899999 Q ss_pred EEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEee Q lcl|NC_018087. 235 VYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYD 314 (520) Q Consensus 235 ~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd 314 (520) +|.. + ...++...+|.|..|.+++.....+++...=+=---+--.-|...| +.|-+..+++ ++ .+|.. . T Consensus 180 ih~r-~-~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~-~~l~~e~~~~-~~---~~~~~--~-- 248 (432) T protein:vir:97 180 WKIM-G-YSLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQID-RFLTDDQYDS-FS---KKVSG--S-- 248 (432) T ss_pred EEec-C-cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecC-CCCCHHHHHH-HH---HHHhh--h-- Confidence 8884 2 3556667789999999999877777765433222222334455555 3454443333 22 22221 0 Q ss_pred cCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh---HHHHHHHHHHHHHhcCCChhhccCCCcccccccc Q lcl|NC_018087. 315 ARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE---MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMS 391 (520) Q Consensus 315 ~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~ 391 (520) ...|. .+ .++ + |.+++.|. .+..+ ++-.+|....+.++++||.+-|....... +..+ T Consensus 249 ~nag~------~~-vl~-------~---g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t-~~~~ 308 (432) T protein:vir:97 249 VEAGR------AP-LLE-------G---GMDVKSLG--LNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGT-TSWG 308 (432) T ss_pred hcCCC------ce-ecC-------C---CceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcc-cccc Confidence 01122 11 222 2 44555552 22233 33356788899999999999986432211 1222 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018087. 392 TAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLM 471 (520) Q Consensus 392 ~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~ 471 (520) +.+.-.-+.|.++ .|+--+.. +.+.| =.++.++.++. ...++|..+ .+...+ ...|.+.++.+ T Consensus 309 s~~e~~~~~f~~~--tl~P~~~~-ie~~l-----n~kLl~~~e~~----~~~~~fd~~----~llr~d-~~~r~~~~~~~ 371 (432) T protein:vir:97 309 SGIESQQLGFLTM--TLSPWLRR-IEQSI-----ALNLLTPAERR----RYFADFDTS----ALLRAD-SAARSSYYSQL 371 (432) T ss_pred hhHHHHHHHHHHH--HHHHHHHH-HHHHH-----hhhccCccccC----ceEEEeech----hhhccC-HHHHHHHHHHH Confidence 3333333445543 33332222 22222 22345555442 234555432 222222 35577777665 Q ss_pred hcccchhhhHHHHHHHHhCCCHHHHHHHHH------------HHHH-hhhcCC--ccCCccccC Q lcl|NC_018087. 472 EPYIGKYISNHTAMKDFLQMSDEDIAAERK------------LIDE-ELSDKI--FNPPEPEEI 520 (520) Q Consensus 472 ~p~vgky~S~~~i~k~IL~~tDeeI~~~~k------------qi~~-E~~~~~--~~~p~~e~~ 520 (520) -. +-++|.+-+++ .+.|...+ ..+. .+.+ -.+++- -.+.+..++ T Consensus 372 ~~--~G~~T~NE~R~-~~glpp~~--g~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~~~~~~~~ 430 (432) T protein:vir:97 372 VN--NGLMTRDEARE-IEGLPKLG--GNAAVLTVQSAMVPLDSIGLQASPEPASGLGNQQQDKV 430 (432) T ss_pred Hh--CCCCCHHHHHH-HhCCCCCC--CCcceEeecccccchhhhcccCCCCCCCCCCCcccccc Confidence 22 24678888874 47775432 1111 0000 000110 111111112 No 59 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=97.89 E-value=1.2e-05 Score=47.49 Aligned_cols=442 Identities=12% Similarity=0.118 Sum_probs=200.0 Q ss_pred Ccccccc--------chhhh---cchhhhhhhHHHhhhccCCCcc-cCCCCCCCceeecccccc----cccccccccccc Q lcl|NC_018087. 1 MSMLADS--------DLKMF---AFWHKVDDTEYDKIINDKAESI-TAPKFDDGATEVDSQDIA----YNGVFQKLYGSQ 64 (520) Q Consensus 1 ~~~~~~~--------~l~~f---~~~~~~~~~~~~~~~~~~~~s~-~~p~~~dg~~~i~~~~~a----~~g~~~~~~~~~ 64 (520) |.-..|+ +.+|+ .|.+...+.+ +..++.+.-+- ..-+..++...+.....+ +++++ +. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~ 74 (574) T protein:vir:80 1 MPKWLDKALGIEKSSIEETRNMENYKMHLREID-TNVVNNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGY-----KT 74 (574) T ss_pred CcchhhhhhccchhhHHHHHhhhhhccccchhh-hhhhhccCCCHHHHHHhHhhhcccccchhhhhccccccc-----cC Confidence 3222232 22222 2221111111 00111111110 011222222222222111 11222 22 Q ss_pred cccchhHHHHHHHHHHHhhccchhHHHHhhhceeeEe------cCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhc- Q lcl|NC_018087. 65 DPTATSTRELINTYRSLLNNYEVDNAVQEIVSDAIVY------EEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLN- 137 (520) Q Consensus 65 ~~~~~~~~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~------d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~- 137 (520) .+.+.+...|-+.-+..+..|-|..||+.+++-+..| +...=|..|.+.+..-... .+...+...+.++|. T Consensus 75 ~~~~~~~~~~~~~l~~~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~--~~~~~~~~~l~~ll~~ 152 (574) T protein:vir:80 75 KPSIRNSQDLHKTLKKFGNNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPT--SHDIANIKRIESFLEN 152 (574) T ss_pred cCccCCcccHHHHHHhhccChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCcc--chhhhhhhHHHHHHhc Confidence 2334455555555666778899999999998876544 2223344454443321110 112223334444442 Q ss_pred ----ch---hhhHHHH----HhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceeccee Q lcl|NC_018087. 138 ----FQ---RKGSDHF----KRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFL 206 (520) Q Consensus 138 ----f~---k~g~~~f----RrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~ 206 (520) ++ ....+++ +.+++-|--|+.++-|.. ..|.+|.+|||..|.+++........+.. -|+. T Consensus 153 ~~~~~nP~~~s~~ef~~~lv~~lll~Gnayi~i~r~~~---G~~~~L~pl~p~~V~v~~d~~~~~~~~~~------~y~~ 223 (574) T protein:vir:80 153 TAQFRDPNRDNFTTFCKKLVRATYMYDQVNFEKVFDKD---GNFIKFDTVDPTTIFLATNGEGKLIKNGE------RFVQ 223 (574) T ss_pred cCCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEECCC---CcEEEEEEEcCceeEEEEcCccccccCce------EEEE Confidence 11 1233444 445677999999997633 25999999999999997654432221111 0222 Q ss_pred ecCcccccccccceecCCcceecCcccEEEeecccc-c-CCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEE Q lcl|NC_018087. 207 YDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLV-D-CCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVF 284 (520) Q Consensus 207 y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~-d-~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvF 284 (520) +... +....++.+.|+|+.-... + .++...+|-|+.|+.++.....+++...-+=---+.-+-|. T Consensus 224 ~~~g-------------~~~~~~~~~eiih~~~~~~~~~~~~~~G~spi~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil 290 (574) T protein:vir:80 224 VIDN-------------RIVAKFNERELAFAVRNPRADIEVGQYGYPELEIALKQFIAHENTEVFNDRFFSHGGTTRGIL 290 (574) T ss_pred EeCC-------------ceEEEEccccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEE Confidence 2111 1224678888888752221 1 12445678899999999988888887655444446667778 Q ss_pred EccCCC-CchHHHHHHHHHHHHh-hcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHH-- Q lcl|NC_018087. 285 YIDTGN-MPARKAAQHMQHIMNS-HRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDD-- 360 (520) Q Consensus 285 yIDvGn-lpk~KAeqyl~~im~~-~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~D-- 360 (520) .++.+. |-+..+ +=+++-++. |.- ..+..+. .++++ .|.++.-|.. +..++.- T Consensus 291 ~~~~~~~ls~e~~-~~lk~~~~~~~~G----------~~n~g~~-~vl~~---------~G~~~~~l~~--s~~D~qfle 347 (574) T protein:vir:80 291 HVKTGQQQSQQAL-DIFRREWRSSLAG----------INGSWQI-PVVSA---------EDVKFVNMTP--SANDMQFEK 347 (574) T ss_pred EeCCCCCCCHHHH-HHHHHHHHHHhcc----------ccccccc-eeecC---------CCceEEEccC--ChhHHHHHH Confidence 887665 444333 333333332 321 1111111 12211 2456666643 3334433 Q ss_pred -HHHHHHHHHHhcCCChhhccCCCcccccccc-chhhHH--H---HHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCCh Q lcl|NC_018087. 361 -ILYFRKALYMALRVPLSRIPDEQTQNVFDMS-TAISRD--E---LSFDKF-ISELQHKFEEIFLSPLKSNLLLKRVITE 432 (520) Q Consensus 361 -V~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~-~eItRD--E---lkF~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t~ 432 (520) -+|..+...++.+||...|...+.....|.+ ...++. | +.|..+ +.-+..++...|.. .|+ +. T Consensus 348 ~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~----~Ll-----~~ 418 (574) T protein:vir:80 348 WLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEKMQASQNKGLQPLLRFIEDTVNT----YIV-----AE 418 (574) T ss_pred HHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHh----hhh-----hh Confidence 3557888999999999998643322211322 222222 2 224443 44444444444433 222 22 Q ss_pred hhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHH-------------HH-- Q lcl|NC_018087. 433 DEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDED-------------IA-- 497 (520) Q Consensus 433 eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDee-------------I~-- 497 (520) .+ ..+.+.|.....=++. +...+.. ....-+++..=++. .|.|.+-+ +. T Consensus 419 ~~-----~~~~~~f~~~d~~~~~-------~~~~~~~---~~~~G~lT~NE~R~-~lgl~Pi~gGD~~~~~~n~~~~~~~ 482 (574) T protein:vir:80 419 FG-----EKYQFQFRGGDLSAQL-------DKLKIIE---QEGKVFRTVNEIRH-DKGLEPIKGGDVILNGVHIQAIGQA 482 (574) T ss_pred cC-----CceEEEecccchhhHH-------HHHHHHH---HHhCCccCHHHHHH-HhCCCCCCCCCEeeeccceeecccc Confidence 22 3466777765432221 1111111 11123677777775 46665432 00 Q ss_pred HHHHHHHHhhhcCC--------ccCCccc--------cC Q lcl|NC_018087. 498 AERKLIDEELSDKI--------FNPPEPE--------EI 520 (520) Q Consensus 498 ~~~kqi~~E~~~~~--------~~~p~~e--------~~ 520 (520) ...++.+.+..+.. -++|+.. +. T Consensus 483 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 521 (574) T protein:vir:80 483 LQEEQLEYQRSQDRLNRLLELSGGDVEQPEPEEPKDSQN 521 (574) T ss_pred cccccCCccchhccccccccccCCCCCCCCCCCCCCccc Confidence 00000000000000 0011100 00 No 60 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=97.88 E-value=1.3e-05 Score=47.36 Aligned_cols=423 Identities=12% Similarity=0.093 Sum_probs=198.1 Q ss_pred ceeeccccccccccccc--cccccc-----ccc---hhHHHHHHHHHHHhh--ccchhHHH----------------Hhh Q lcl|NC_018087. 43 ATEVDSQDIAYNGVFQK--LYGSQD-----PTA---TSTRELINTYRSLLN--NYEVDNAV----------------QEI 94 (520) Q Consensus 43 ~~~i~~~~~a~~g~~~~--~~~~~~-----~~~---~~~~~LI~~YR~ma~--~pEvd~Ai----------------~eI 94 (520) -.+++.=-..+-.+.+. ....+. .++ ....+.|+++++|.. +|-++.-. ..| T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~~~~sln~~~~i 80 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKRLKNTINMAKTA 80 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccCCCCccccceeecchHHHH Confidence 11111100011000000 010100 111 123345666667743 33332100 111 Q ss_pred hceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeE Q lcl|NC_018087. 95 VSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELR 174 (520) Q Consensus 95 vneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr 174 (520) +++..=+=- .+++++.+.+.+. -.+..+.|+.-=+|.+...+.+....+-|..+|+..+|.. + ..+. T Consensus 81 ~~~~A~lv~-~e~~~i~v~~~~~-------~~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~----~-~~i~ 147 (508) T protein:vir:15 81 ARRIASVVF-NEKAEIHVKDNNE-------ADKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRPYIDGN----H-IKIA 147 (508) T ss_pred HHHHHhhhh-CCCceEEeCCchH-------HHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEeCC----e-eEEE Confidence 111000000 0244454432211 1122355666667899999999999999999999999943 2 4588 Q ss_pred ecCccceeeeeeccCCCCccccccc------ceeccee------ecCcccccccccceec------CCcce---ecCc-- Q lcl|NC_018087. 175 RLDPRNVQFVRELDTKMENGVKVVK------GYREYFL------YDTELESYQCGHQHFA------AGTKI---KIPY-- 231 (520) Q Consensus 175 ~lDPr~i~~vr~i~~~~~~~~~~~~------~~~ey~~------y~~~~~~~~~~~~~~~------~~~~~---~I~~-- 231 (520) +++|.++-+++.-.........+.. .-..||. .... ..|......|. -|..+ .+|. T Consensus 148 ~v~ad~~~P~~~d~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~-~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~ 226 (508) T protein:vir:15 148 WVRADQFYPLQSNTNDISEAAIASRTQRTESNQTKYYTLLEFHQWQDN-GSYQITNELYKSDSPDIVGNQVPLSTLPVYK 226 (508) T ss_pred EEcCCeeEEEEEcCCCeEEEEEEEEEEeecCCCceEEEEEEEEEEecC-cceEEEEEEEecCCchhcCcccchhhccccc Confidence 8999888776321111101100000 0001221 1000 01111111111 11111 1111 Q ss_pred ---ccEEEeeccc-----c--------cCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHH Q lcl|NC_018087. 232 ---SAMVYAHSGL-----V--------DCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARK 295 (520) Q Consensus 232 ---~aI~y~hSGL-----~--------d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~K 295 (520) +.+++.+... + +++.+..+|-++.|.-.+..|-..-+. +.|.+|.-.+|+|.-+.- T Consensus 227 ~l~~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~--~~~e~~~~~~~i~v~~~~------ 298 (508) T protein:vir:15 227 ELAPQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQ--FIWEIRLGQKHIAVQPGM------ 298 (508) T ss_pred CCCcceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHH--HHHHHHhcccceeechHH------ Confidence 1223322111 1 111223467888888777777665555 558888888888863210 Q ss_pred HHHHHHHHHHhhcceeEeecCCCccc--cccccchhhhhhcccccCCCCCcceeecCCCCCcC-hHHHHHHHHHHHHHhc Q lcl|NC_018087. 296 AAQHMQHIMNSHRNRISYDARTGKVK--NQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMN-EMDDILYFRKALYMAL 372 (520) Q Consensus 296 Aeqyl~~im~~~knklvYd~~TGev~--d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLg-ei~DV~YF~kkLy~aL 372 (520) + -+|..+|.+- +++-+..|-. +...|..|+.+...--.+ -.+-+..+.+.+.... T Consensus 299 --------l-------~~d~~~~~~~~~~~~~~~~~~~-------~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~ 356 (508) T protein:vir:15 299 --------L-------RFDDEHKPTFDTEQNVYVGVLS-------DDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQI 356 (508) T ss_pred --------h-------cCCCCCccccCCCCeeEEeccC-------CCCCCCceeEeecccChHHHHHHHHHHHHHHHHHh Confidence 0 1233433321 2222222211 112233366555432222 2445777788899999 Q ss_pred CCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-------cCCCChh--hHHhhhhceE Q lcl|NC_018087. 373 RVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLL-------KRVITED--EWEAELNNIK 443 (520) Q Consensus 373 ~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiL-------kgi~t~e--ew~~~~~~I~ 443 (520) +++.+-|..++++. .-++||...+-.-..-+.+.|+.|...+.++++.=|-| ++..... .+......+. T Consensus 357 gls~~~f~~~~~~~--~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~ 434 (508) T protein:vir:15 357 GLSTGTFSYSNDGV--KTATEVVSNNSMTYQTRSSYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIE 434 (508) T ss_pred CCCchhcccccCcc--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceE Confidence 99988887665422 34567765555555566666666666666665554332 2211111 1122223466 Q ss_pred EEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 444 IVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 444 ~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) ++|...-.-. +++++- .+.++... | .+|++++.++....||||-+++.++|++|........+.-..+ T Consensus 435 v~f~D~i~~d--~~~~~~-----~~~~~v~a-G-i~s~e~~i~~~~g~~deea~~el~ri~~E~~~~~~~~~~~~~~ 502 (508) T protein:vir:15 435 CHFDDGVFVN--KDKQLE-----EDAKVLAI-G-ALSKQTFLQRNYGMTDEQAAEELAKIQSEAPTDTFEGGRSAIL 502 (508) T ss_pred EEeCCCCCCC--HHHHHH-----HHHHHHhc-C-CCCHHHHHHhcCCCChHHHHHHHHHHHHhccccCccccccccC Confidence 7776332222 233222 22222111 2 4788888777789999999999999999987664333322222 No 61 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=97.88 E-value=1.3e-05 Score=47.34 Aligned_cols=427 Identities=12% Similarity=0.041 Sum_probs=179.8 Q ss_pred eccccc-ccccccccccccccccch-hHHHHHHHHHHHhhccchhHHH---------------------HhhhceeeEec Q lcl|NC_018087. 46 VDSQDI-AYNGVFQKLYGSQDPTAT-STRELINTYRSLLNNYEVDNAV---------------------QEIVSDAIVYE 102 (520) Q Consensus 46 i~~~~~-a~~g~~~~~~~~~~~~~~-~~~~LI~~YR~ma~~pEvd~Ai---------------------~eIvneaiv~d 102 (520) +..+++ .......-..-.+..-+. -..+...+|+.+..+++=+..| ..||+-.+-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~~~~iv~~~~~~- 79 (489) T protein:vir:99 1 MLQEDFEAIDYESKLWIDQLKNYISRFKAEQLERLKELKRYYLGDNNIKYRPAKTDKYAADNRIASDFAKYITVFEQGY- 79 (489) T ss_pred CCccceeeeCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccCCcceeecchHHHHHHHHhhh- Confidence 111111 000000000000000000 0112233444444443322111 1122211111 Q ss_pred CCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCcccee Q lcl|NC_018087. 103 EGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQ 182 (520) Q Consensus 103 ~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~ 182 (520) --.+||++..++.. +.+..+.+.+--+|+....++.+.+.+-|+-|....+++....+|-..+..+||+++. T Consensus 80 l~g~~~~~~~~d~~--------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~~~i~~~~p~~~~ 151 (489) T protein:vir:99 80 MLGVPVEYKNENKD--------LQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTEVKLYQLPAEQTF 151 (489) T ss_pred hccCCceeecCChh--------HHHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcceEEEEEcccceE Confidence 11256666655443 3334455666678888999999999999999999888776667788999999999999 Q ss_pred eeeeccC--CCCcccccc-------cceecceeecCccc-cccccc---ceecC----Ccce-ecCcccEEEeecccccC Q lcl|NC_018087. 183 FVRELDT--KMENGVKVV-------KGYREYFLYDTELE-SYQCGH---QHFAA----GTKI-KIPYSAMVYAHSGLVDC 244 (520) Q Consensus 183 ~vr~i~~--~~~~~~~~~-------~~~~ey~~y~~~~~-~~~~~~---~~~~~----~~~~-~I~~~aI~y~hSGL~d~ 244 (520) ++..-.. +..-.++.+ +.+..+.+|.+..- .+.... ..+.. ...+ +|| .|.|. T Consensus 152 ~v~dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~vP--vv~~~------- 222 (489) T protein:vir:99 152 VIYDDTYQRNSLMAVHFYDIDYGSGKRKQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFFKGVP--VNEYA------- 222 (489) T ss_pred EEEcCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEecCCCcccceecccccccCCcee--EEEee------- Confidence 8864221 111222111 11223446655421 111110 00000 0001 122 12221 Q ss_pred CCCcchhhhHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCcc-cc Q lcl|NC_018087. 245 CGKNIIGYLHRAVKPANQLKL-LEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKV-KN 322 (520) Q Consensus 245 ~~~~~~syL~~aik~~NqL~m-~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev-~d 322 (520) |+....|-++..+.....+.. +-+....-+..+.|-+-+.-..... ...-+.....+.. -++..+.. .. T Consensus 223 n~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~---~~~~~~~~~~~~~------~~~~~~~~~~~ 293 (489) T protein:vir:99 223 NNEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTG---ADENDYLDDGRLN------PNGRLAISIGF 293 (489) T ss_pred cCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCccc---ccchhhhhhcccc------ccccccccccc Confidence 223344555555444444422 2233333334445544443322111 1111111111100 00111100 00 Q ss_pred ccccchhhhhhcccccCC--CCCcceeecCCCCCcChH-HHHHHHHHHHHHhcCCChhhccCCCccccccccchhh--HH Q lcl|NC_018087. 323 QANMMALTEDYWLQRRDG--KAVTEVETLPGMTGMNEM-DDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAIS--RD 397 (520) Q Consensus 323 ~~~~msmlEDywLpRReG--grgTEIsTLpGg~nLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eIt--RD 397 (520) ....+ +++..... |.+..+.-|.-..+.... .-+.-+.+.+|+-.++|- +.+++.. |..+... .- T Consensus 294 ~~~~~-----~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~--~~~~~~~---~n~Sg~Al~~~ 363 (489) T protein:vir:99 294 KKAQV-----LILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPD--TQDMKFS---GVQSGESMKYK 363 (489) T ss_pred cccee-----eeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCcc--ccccccc---ccchHHHHHHH Confidence 11111 11111111 112234444333333222 234566778888889984 2222111 2222222 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHh-hhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_018087. 398 ELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEA-ELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIG 476 (520) Q Consensus 398 ElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~-~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vg 476 (520) +..-..-+.+-|+.|...+..+++.=+-+-++.-...|.. ....|.+.|....--.+... ++++.++. | T Consensus 364 ~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~-------~~~~~kl~---g 433 (489) T protein:vir:99 364 LMASDNYREKQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEI-------VTAAQNLY---G 433 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHH-------HHHHHHHh---c Confidence 2222333555556666666665553222223222222221 23457888865444334333 34444443 4 Q ss_pred hhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCC-------ccCCccccC Q lcl|NC_018087. 477 KYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKI-------FNPPEPEEI 520 (520) Q Consensus 477 ky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~-------~~~p~~e~~ 520 (520) .+|.+++++.+=..|+++-+++-++|++|..+.. ..++++++= T Consensus 434 -iis~et~~~~l~~v~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 483 (489) T protein:vir:99 434 -IVSDQTIFEILNTVTGVDAEAELKRLKEEADKKQSLPEPRLVGDASGQEE 483 (489) T ss_pred -cCCHHHHHHhcCCCCchhHHHHHHHHHHHHHHHhccccccccCCCCCCcC Confidence 3799999988666777777777777777754432 111111111 No 62 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=97.81 E-value=1.8e-05 Score=46.66 Aligned_cols=412 Identities=14% Similarity=0.057 Sum_probs=200.4 Q ss_pred eeccc--ccccccccccccccccccchhHHHHHHHHHHHhhc--------------------c----------chhHHHH Q lcl|NC_018087. 45 EVDSQ--DIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNN--------------------Y----------EVDNAVQ 92 (520) Q Consensus 45 ~i~~~--~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~--------------------p----------Evd~Ai~ 92 (520) |.-.+ ..++.+-+.+-..+ ...+||..|-+++.. | --..++. T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~l~~~i~~ 74 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNG------SEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVHDKLMNSGTGNEIVV 74 (518) T ss_pred CcchhhHHHHHHHhhcCCCCc------cchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCccccccccCChHHHHHH Confidence 11000 11222222211111 122445444333211 1 1112223 Q ss_pred hhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCCCCCeee Q lcl|NC_018087. 93 EIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIE 172 (520) Q Consensus 93 eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~e 172 (520) ++++=+ -.++|++.+...+.++ -+..++..+.|++-.+|.++..+.+..+..-|..+|+..+|... .. T Consensus 75 ~~A~ll-----~~e~~~i~v~~~~~~d--~e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~-----~~ 142 (518) T protein:vir:78 75 VAAEYI-----SGKPLSIDVTGVNGSK--DENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINILNGR-----PS 142 (518) T ss_pred HHHHhh-----cCCCceEEecCccccC--cHHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEECCe-----eE Confidence 322221 1135566654333222 13456666788888899999999999999999999999998432 46 Q ss_pred eEecCccceeeeeeccCCC-----------Cccccccc-------------------ceecceeecCcccccccccceec Q lcl|NC_018087. 173 LRRLDPRNVQFVRELDTKM-----------ENGVKVVK-------------------GYREYFLYDTELESYQCGHQHFA 222 (520) Q Consensus 173 lr~lDPr~i~~vr~i~~~~-----------~~~~~~~~-------------------~~~ey~~y~~~~~~~~~~~~~~~ 222 (520) +..++|-++.++..- ... .++..++. ++-.|-+|..... ... T Consensus 143 i~~v~ad~~~P~~~~-g~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~-------~~v 214 (518) T protein:vir:78 143 ISVHSSSQFWIDFKN-NEPFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGD-------KTT 214 (518) T ss_pred EEEEcCCeeEEEeec-CcEEEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCc-------ccc Confidence 888999888885321 100 01111110 0001111111000 000 Q ss_pred CCcceec-----------------------CcccEEEeecc---cccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018087. 223 AGTKIKI-----------------------PYSAMVYAHSG---LVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRIT 276 (520) Q Consensus 223 ~~~~~~I-----------------------~~~aI~y~hSG---L~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~ 276 (520) +.+.+.+ ++..+.|..-. -.+.+-+..+|-|+.|.-++..|-..=+ -+.|.. T Consensus 215 ~~~~~~~~~~l~~~~~~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s--~~~~e~ 292 (518) T protein:vir:78 215 PISAERLPEQITSYLHTNDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFT--VYMREG 292 (518) T ss_pred cccccccccccccccccccCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHH--HHHHHH Confidence 0110111 11112222110 0111222357888888887777766655 456888 Q ss_pred cCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhccccc----CCCC-CcceeecCC Q lcl|NC_018087. 277 RAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRR----DGKA-VTEVETLPG 351 (520) Q Consensus 277 RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRR----eGgr-gTEIsTLpG 351 (520) |.=++|||.-+ .=|+ ...+..++.-. ....--.+++.+.. +|+. .+.|+++.. T Consensus 293 ~~g~~~i~v~~-~~l~------------------~~~~~~~~~~~---~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~ 350 (518) T protein:vir:78 293 EKTKTKIAASE-RMFR------------------KKVNKSTDKEE---WSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQG 350 (518) T ss_pred HhCCceeeech-hHhc------------------cCCCCCCCccc---cccCCCCceEEEecCcCCCCCccccceeeeec Confidence 88777777621 1110 01111111100 00000113333221 1222 234666654 Q ss_pred CCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cCC Q lcl|NC_018087. 352 MTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLL-KRV 429 (520) Q Consensus 352 g~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiL-kgi 429 (520) .=.-.+ ..-+..+.+.+..+.+++-+-|..+++ .. -++||.+..-+-...+.+.|+.+...+.++++.=|-| +.. T Consensus 351 ~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~-~~--TATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~ 427 (518) T protein:vir:78 351 DFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNR-EV--KATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGG 427 (518) T ss_pred ccChHHHHHHHHHHHHHHHHhhCCChhhcCcccc-cc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 322222 233667778888888888877754322 11 3466666655566678888888888777777764332 322 Q ss_pred CChhhHHhh--hhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHH-hCCCHHHHHHHHHHHHHh Q lcl|NC_018087. 430 ITEDEWEAE--LNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDF-LQMSDEDIAAERKLIDEE 506 (520) Q Consensus 430 ~t~eew~~~--~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~I-L~~tDeeI~~~~kqi~~E 506 (520) .-...|... ...+.++|...-.-.+..+++ .++++.-. | .+|+++.+++. -..||+|.+++-++|++| T Consensus 428 ~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~-------~~~~~v~a-G-imS~e~~i~~~~~~~~deea~~e~~ri~~E 498 (518) T protein:vir:78 428 TNNKEKAIMRDEIRVIIEFPDPMSVNLNELSS-------TLNNMNSA-L-AMSVEEKVKLIHPKWEDEEIQAEVKRIYLE 498 (518) T ss_pred cCccccccCCCceeEEEEeCCCCCCCHHHHHH-------HHHHHHhc-C-CCCHHHHHHHhCCCCCHHHHHHHHHHHHHH Confidence 221111111 124777776432222322222 22222111 2 57999877664 368999999999999999 Q ss_pred hhcCCccCCccccC Q lcl|NC_018087. 507 LSDKIFNPPEPEEI 520 (520) Q Consensus 507 ~~~~~~~~p~~e~~ 520 (520) .... +.|+++++ T Consensus 499 ~~~~--~~~~p~~~ 510 (518) T protein:vir:78 499 NAIG--EVPDPEAI 510 (518) T ss_pred hccc--CCCCCccc Confidence 7665 46777777 No 63 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=97.81 E-value=1.8e-05 Score=46.64 Aligned_cols=391 Identities=12% Similarity=0.122 Sum_probs=177.9 Q ss_pred hhhhcchhhhhhhHHHhhhccCCC-cccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccch Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAE-SITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEV 87 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~-s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEv 87 (520) |.||+-..-.. .++++.. ....+. +... +.+.+.....+.++ ..+..+++|-| T Consensus 1 m~~~~~~~~~~------~~~~~~~~~~~~~~---~~~~-----~~~~~~~~~~~~~v------------~~~~a~~~~~v 54 (412) T protein:vir:26 1 MNVIAKENIVT------RIKKKLIDNWIDQS---TSKL-----YDFSPWKNRSFWGV------------INNTLETNETI 54 (412) T ss_pred Cccchhhhhhh------hhhhhHhhhhhccc---cccc-----ccccccCCcccccc------------chhhhhccHHH Confidence 33332111111 1111110 000111 0000 01111111111111 12345788999 Q ss_pred hHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHH-hcchhhhHHH----HHhhccccceeEEEeee Q lcl|NC_018087. 88 DNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNM-LNFQRKGSDH----FKRWYVDSRVFFHKIIN 162 (520) Q Consensus 88 d~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~l-l~f~k~g~~~----fRrWYvDgri~~hkvid 162 (520) ..||+-|.+.+.-.+ +.+.=+... .......+|+. =+-..+++++ +..+.++|--|..++-| T Consensus 55 ~~~i~~ia~~iA~lp-----~~~~~~~~~--------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~ 121 (412) T protein:vir:26 55 FSAITKLSNSMASLP-----LKMYEDYKV--------VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERD 121 (412) T ss_pred HHHHHHHHHhHhhCc-----eeEeecccc--------ccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEEC Confidence 999999999887543 222111111 11122223321 2223445554 44578889999888755 Q ss_pred cCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccc Q lcl|NC_018087. 163 PNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLV 242 (520) Q Consensus 163 ~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~ 242 (520) . ...+++|.+|+|..+++.++-. +.. -+|.|... .+.++.++.+.|+|.. +.- T Consensus 122 ~---~G~~~~L~~l~~~~v~v~~~~~-----~~~------~~y~~~~~------------~g~~~~~~~~evih~~-~~~ 174 (412) T protein:vir:26 122 I---YHQPSKLFLLNPDVVEMLIENQ-----SRE------LYYSIHAA------------TGNKLIVHNMDMLHFK-HIV 174 (412) T ss_pred C---CCcEEEEEEEcCceeEEEEeCC-----CcE------EEEEEEcC------------CceEEEEccccEEEeC-CCC Confidence 2 2248899999999998864321 111 12222211 1234678999998883 222 Q ss_pred cCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCcccc Q lcl|NC_018087. 243 DCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKN 322 (520) Q Consensus 243 d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d 322 (520) ..++...+|-|..|.+.......+++. .+..-.+.| . +..-..+.+-+.++++..+.+.+.+.| .|. T Consensus 175 ~~~~~~G~s~i~~~~~~i~~~~a~~~~-~~~~~~~~~-~-~i~~~~~~l~~e~~~~~~~~~~~~~~~-------~g~--- 241 (412) T protein:vir:26 175 ASNMVQGISPIDVLKNTTDFDNAVRTF-NLTEMQKPD-S-FMLKYGSNVGKEKRQQVLEDFKQYYEE-------NGG--- 241 (412) T ss_pred CCCCcccccHHHHHHHHHHHHHHHHHH-HHHhcCCCC-c-eEEecCCCCCHHHHHHHHHHHHHHhhc-------CCC--- Confidence 345555677777777776666666655 344444433 3 333345667777666666555444432 232 Q ss_pred ccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHH---HHHHHHHHHhcCCChhhccCCCccccccccchhhHHHH Q lcl|NC_018087. 323 QANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDI---LYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDEL 399 (520) Q Consensus 323 ~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV---~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDEl 399 (520) .+ .+ + .|.+++.|. .+.-+++-+ .|-...+.++++||...|...++. .++...+..+ T Consensus 242 ---~~-vl--------~--~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~-~~sn~e~~~~--- 301 (412) T protein:vir:26 242 ---IL-FQ--------E--PGVEIEPLP--KKYVSEDIVASENLTRERVANVFQLPSVFLNARSNT-NFAKNEELNR--- 301 (412) T ss_pred ---ee-ec--------C--CCceEEEcC--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC-CcccHHHHHH--- Confidence 11 11 1 256777774 333333333 356688999999999988754322 2122222222 Q ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchh Q lcl|NC_018087. 400 SFDKF-ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKY 478 (520) Q Consensus 400 kF~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky 478 (520) -|.++ |.-+..++.. .|- ..+.++.+|.. ...+.|.-+ ++.... +.+|++.+..+-.- -+ T Consensus 302 ~f~~~~l~P~~~~ie~----~ln-----~kLl~~~~~~~---~~~~~fd~~----~l~~~d-~~~~~~~~~~~~~~--G~ 362 (412) T protein:vir:26 302 FYLQHTLLPIVKQYEE----EFN-----RKLLTKTDREK---NRYFKFNVK----SYLRAD-SATQAEVYFKAVRS--GY 362 (412) T ss_pred HHHHHHHHHHHHHHHH----HHH-----hhcCCcccccC---cceEEeech----hhhccC-HHHHHHHHHHHHhC--CC Confidence 24444 3333333222 122 23455555542 233444422 222222 34566655554322 35 Q ss_pred hhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCC---------cc--CCccccC Q lcl|NC_018087. 479 ISNHTAMKDFLQMSDEDIAAERKLIDEELSDKI---------FN--PPEPEEI 520 (520) Q Consensus 479 ~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~---------~~--~p~~e~~ 520 (520) ++.+-++. +|++.+-+ .-++-+-.-.-.++ .+ +.++.|= T Consensus 363 ~t~NE~R~-~~gl~p~~--ggD~~~~~~n~~~~~~~~~~~~~~~gG~~n~~e~ 412 (412) T protein:vir:26 363 YTINDIRE-WEDLPPVE--GGDKPLISGDLYPIDTPLELRKSLKGGDKNVNES 412 (412) T ss_pred cCHHHHHH-HhCCCCCC--CcCeeeecccccccccchhhcccccCCCCCcCCC Confidence 67777774 46665432 00110000000000 00 0000000 No 64 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=97.81 E-value=1.8e-05 Score=46.62 Aligned_cols=408 Identities=11% Similarity=0.082 Sum_probs=180.2 Q ss_pred hhc--chhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHH-hhccch Q lcl|NC_018087. 11 MFA--FWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSL-LNNYEV 87 (520) Q Consensus 11 ~f~--~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~m-a~~pEv 87 (520) ||. |+++.-++ .+...+....|.+- +..++++. ..... +... +..| ..+|-| T Consensus 1 ~~~~~~~i~s~~~-~~~i~~~~~~s~~~------------~~~~~~~~-~~pp~-------~~~~----la~l~~~n~~v 55 (542) T protein:vir:41 1 MFNYHLSIRSLEK-YKAIKREEVESQAL------------GETRFEEY-VEPKV-------NPLV----LLSLLQVNPYH 55 (542) T ss_pred Ccccccccccccc-chhhhhcccccccc------------ccccCCcc-ccCCC-------CHHH----HHHHHhhcHHH Confidence 444 34443322 11111222222111 11111122 11111 2222 2233 457899 Q ss_pred hHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHH----HHHhhccccceeEEEeeec Q lcl|NC_018087. 88 DNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSD----HFKRWYVDSRVFFHKIINP 163 (520) Q Consensus 88 d~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~----~fRrWYvDgri~~hkvid~ 163 (520) ..||+-|.+.+.-+. +.+.-++.. ....| +-+-.-++.+ +++.+++-|--|++++-|. T Consensus 56 ~scI~~ia~~IA~l~-----~~~~~~~~~--------~l~~~-----lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~ 117 (542) T protein:vir:41 56 ASACSIKANDIIRTG-----YILEGDDEG--------VVDEF-----IRACKPSFEYVLLRALEDLQVFNYCTLEVVRDD 117 (542) T ss_pred HHHHHHHHHHHhhCc-----eeeecccch--------hhhhh-----cCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC Confidence 999999999876542 223222211 11111 1233334444 4555777899999998664 Q ss_pred CCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecce--eecCcccccccccceecCCcceecCcccEEEeeccc Q lcl|NC_018087. 164 NRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYF--LYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGL 241 (520) Q Consensus 164 ~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~--~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL 241 (520) + .-+.+|++|||.+|+..++... ......+...++ .|... ...+.-....+..++.+.|+|.. . T Consensus 118 ~---G~~~~L~~l~~~~v~v~~d~~~----~~~~~~~~~~~~~~~y~~~-----~~~~~~~g~~~~~~~~~eIiHir--~ 183 (542) T protein:vir:41 118 R---GDPIRFEYIPSHTIRVHKDGSR----YRQTWDGVNITHFKDYRYE-----GEINPETGEDQDSVGANELVFIH--I 183 (542) T ss_pred C---CcEEEEEEEcCcceEEEEcCCe----eEeeecCCcceeEEeeccc-----ccccccccccccccCcccEEEec--C Confidence 3 3599999999999987643211 000111111111 11111 01111111223567888888774 2 Q ss_pred cc-CCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCC---------CCchHHHHHHHHHHHHhhccee Q lcl|NC_018087. 242 VD-CCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTG---------NMPARKAAQHMQHIMNSHRNRI 311 (520) Q Consensus 242 ~d-~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvG---------nlpk~KAeqyl~~im~~~knkl 311 (520) .+ .++...+|-+..|+..+.....+++...-+=--.+--+-|.++..+ .+-+..++..-+.+...|+. T Consensus 184 ~~~~~~~~Glspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g-- 261 (542) T protein:vir:41 184 PSPVCSYYGVPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKH-- 261 (542) T ss_pred CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhh-- Confidence 23 3455668899999998888877776654443333556667777643 22222332222333333332 Q ss_pred EeecCCCccccccccchhhhhhcccccCC-CCCcceeecCCCCCcChHH---HHHHHHHHHHHhcCCChhhccCCCcccc Q lcl|NC_018087. 312 SYDARTGKVKNQANMMALTEDYWLQRRDG-KAVTEVETLPGMTGMNEMD---DILYFRKALYMALRVPLSRIPDEQTQNV 387 (520) Q Consensus 312 vYd~~TGev~d~~~~msmlEDywLpRReG-grgTEIsTLpGg~nLgei~---DV~YF~kkLy~aL~VP~SRl~~~~~~~~ 387 (520) -..+..+.+ . |+.-.| ..|.+++.|. .+..+++ -..+..+.+.++++||...|....+... T Consensus 262 -------~~~n~gk~~-v-----L~~~~~~~~g~~~~pl~--~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~ 326 (542) T protein:vir:41 262 -------LKEAPHTPL-V-----FSIPGGDTVKVTFTPLN--TSQKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPL 326 (542) T ss_pred -------hhcccCcee-E-----eeccCCcccceeEEEcC--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCccc Confidence 001111111 1 111111 1344555553 3333333 2355678899999999999864332221 Q ss_pred ccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHH Q lcl|NC_018087. 388 FDMSTAISRDELSF-DKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVN 466 (520) Q Consensus 388 ~G~~~eItRDElkF-~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~ 466 (520) +++ .+.-.-..| ..-+.-+++++...+...|- ++.++ .+.+.|..+.....-+ ..+++ T Consensus 327 -n~s-n~Eq~~~~f~~~tL~P~~~~ie~~ln~~L~---------~~~~~-----~~~~~f~~~~ll~~d~-----~~~~~ 385 (542) T protein:vir:41 327 -GGN-FAEVTRRTYYESVVRPQQNIISSILTDFFQ---------VKFNP-----KTRFKFNDETLLESDS-----VRNCA 385 (542) T ss_pred -ccc-cHHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------cccCC-----ceEEEecchhhcchHH-----HHHHH Confidence 222 122222334 34456677777666664332 22222 3456666544433211 11222 Q ss_pred HHHHhhcccchhhhHHHHHHHHhCCCHHH--HHHH----HHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 467 VLSLMEPYIGKYISNHTAMKDFLQMSDED--IAAE----RKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 467 ~~~~~~p~vgky~S~~~i~k~IL~~tDee--I~~~----~kqi~~E~~~~~~~~p~~e~~ 520 (520) ++ .-.-+++.+-++.++..+..-+ .-.. .++++...++ +.+.+..|+ T Consensus 386 ~~-----v~~GilT~NE~Re~L~g~~pgdd~~l~p~~~~~~~~~~~~~n--~~~~~~~~~ 438 (542) T protein:vir:41 386 LL-----VQSGVLTPAEARERLFGLDGGPDIFMVPSKGAAKSVKRQERN--YEKNQIREI 438 (542) T ss_pred HH-----HhCCCCCHHHHHHhhCCCCCCCccccccccccccccccCCcC--CCCCchhhh Confidence 21 1223567777765433332211 0000 0000000000 011111111 No 65 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=97.77 E-value=2.1e-05 Score=46.27 Aligned_cols=389 Identities=12% Similarity=0.132 Sum_probs=181.0 Q ss_pred hhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchh Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVD 88 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd 88 (520) |++|.+ .... .+..++++..+. +. ...++.|.....++. ..-+.+|-|. T Consensus 1 Mg~f~~---~~~r----------~~~~~~~~~~~~--~~-~~~~~~~~~~~~~~~---------------~~al~~~~v~ 49 (416) T protein:vir:81 1 MGIFYK---NEKR----------DLQYNEDDLQMM--VQ-TLPGFQGTKLRQYKD---------------IEAIRHSDIF 49 (416) T ss_pred CCcccc---cccc----------cccCCCcchhHH--HH-HhccccccCccccch---------------hhhhcchHHH Confidence 344432 1110 111111111111 10 000011111001100 1235678899 Q ss_pred HHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcch----hhhH----HHHHhhccccceeEEEe Q lcl|NC_018087. 89 NAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQ----RKGS----DHFKRWYVDSRVFFHKI 160 (520) Q Consensus 89 ~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~----k~g~----~~fRrWYvDgri~~hkv 160 (520) .||+-|.+.+.-++ +.+. ++.... .-+.+..+|+-. .++. .++..+.+.|.-|..++ T Consensus 50 ~cv~~Ia~~iA~~p-----~~~~-~~~~~~---------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~ 114 (416) T protein:vir:81 50 TAVMMIASDLARMP-----IRVT-VNGQIN---------YSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEIT 114 (416) T ss_pred HHHHHHHHhhccCc-----eEEe-cCcccc---------ccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEE Confidence 99999988777542 3332 111111 112344444322 2233 34555788999999887 Q ss_pred eecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecc Q lcl|NC_018087. 161 INPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSG 240 (520) Q Consensus 161 id~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSG 240 (520) -|. +. -+++|.+|+|..+..+++-. |.- .|+.+..... ..+....++.+.|+|.. T Consensus 115 r~~-~G--~~~~L~~i~~~~v~v~~~~~-----g~~------~~~~~~~~~~---------~~~~~~~~~~~evihir-- 169 (416) T protein:vir:81 115 RDK-TG--EPMNLTFRKTSEIELKSDAR-----GRL------YYFHQRIDSN---------GNNIERNVKFEDMLDIK-- 169 (416) T ss_pred ECC-CC--cEEEEEEEcCceeEEEECCC-----ccE------EEEEEEecCC---------CceeEEEEccccEEEec-- Confidence 652 21 28899999999998754322 211 1222111100 01112468888898875 Q ss_pred cccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHH-hhcceeEeecCCCc Q lcl|NC_018087. 241 LVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMN-SHRNRISYDARTGK 319 (520) Q Consensus 241 L~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~-~~knklvYd~~TGe 319 (520) ....++...+|-|+.|++++......++...-+--.-+--+-|..++ |.+...+|.+-+++-.+ .|+ | T Consensus 170 ~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~~~~~~~~~~~~~~~~~~~~---------g- 238 (416) T protein:vir:81 170 FYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFHKSFS---------G- 238 (416) T ss_pred cCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC-CCCCCHHHHHHHHHHHHHHhc---------C- Confidence 24667767789999999999888888887765444445566677777 45544455444443333 232 2 Q ss_pred cccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHH Q lcl|NC_018087. 320 VKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDE 398 (520) Q Consensus 320 v~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDE 398 (520) .++..+.+ .++ + |.+++.|.-.....| ++-..+.++.+.++++||.+.|..+.+ ++.+.-.. T Consensus 239 ~~nag~~~-vl~-------~---g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~------~~~~~~~~ 301 (416) T protein:vir:81 239 TKQAGKVV-VLD-------E---SMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA------NMSITDAN 301 (416) T ss_pred ccccCcee-ecC-------C---CceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC------CccHHHHH Confidence 12222222 221 1 456666543222222 344566778999999999988864322 11222233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchh Q lcl|NC_018087. 399 LSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKY 478 (520) Q Consensus 399 lkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky 478 (520) +-|..-+.-+-.++..-+...| .+. +. ...|+|++ .++...+ ...|.+.++.+-.- -+ T Consensus 302 ~~~~~~l~P~~~~ie~~ln~~l---------~~~--~~--~~~~~f~~------~~l~~~D-~~~~~~~~~~~~~~--G~ 359 (416) T protein:vir:81 302 LDYLSTLKPYITCVCAELNFKF---------NDE--YV--NREFKFDT------TEIRVVD-EKTQAEIDKINIDS--GK 359 (416) T ss_pred HHHHHHHHHHHHHHHHHHhhhc---------ccc--cc--CceEEEec------hhhhccC-HHHHHHHHHHHHhC--CC Confidence 4344444444444444333322 221 11 22344332 2222221 24456666554322 36 Q ss_pred hhHHHHHHHHhCCCHHH-----HHHH------HHHHHH------hhhcCCccCCcccc Q lcl|NC_018087. 479 ISNHTAMKDFLQMSDED-----IAAE------RKLIDE------ELSDKIFNPPEPEE 519 (520) Q Consensus 479 ~S~~~i~k~IL~~tDee-----I~~~------~kqi~~------E~~~~~~~~p~~e~ 519 (520) ++.+-+++ .|++.+-+ +-.. .+.+.+ +....-.+-.++-| T Consensus 360 ~T~NE~R~-~~gl~p~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:81 360 MNIDEIRQ-RDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEENE 416 (416) T ss_pred cCHHHHHH-HhCCCCCCCCCcceEeecccccccccccccCcccccccccccCCCCCCC Confidence 67777774 36664421 0000 000000 00001112222223 No 66 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=97.77 E-value=2.1e-05 Score=46.27 Aligned_cols=389 Identities=12% Similarity=0.132 Sum_probs=181.0 Q ss_pred hhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchh Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVD 88 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd 88 (520) |++|.+ .... .+..++++..+. +. ...++.|.....++. ..-+.+|-|. T Consensus 1 Mg~f~~---~~~r----------~~~~~~~~~~~~--~~-~~~~~~~~~~~~~~~---------------~~al~~~~v~ 49 (416) T protein:vir:45 1 MGIFYK---NEKR----------DLQYNEDDLQMM--VQ-TLPGFQGTKLRQYKD---------------IEAIRHSDIF 49 (416) T ss_pred CCcccc---cccc----------cccCCCcchhHH--HH-HhccccccCccccch---------------hhhhcchHHH Confidence 344432 1110 111111111111 10 000011111001100 1235678899 Q ss_pred HHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcch----hhhH----HHHHhhccccceeEEEe Q lcl|NC_018087. 89 NAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQ----RKGS----DHFKRWYVDSRVFFHKI 160 (520) Q Consensus 89 ~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~----k~g~----~~fRrWYvDgri~~hkv 160 (520) .||+-|.+.+.-++ +.+. ++.... .-+.+..+|+-. .++. .++..+.+.|.-|..++ T Consensus 50 ~cv~~Ia~~iA~~p-----~~~~-~~~~~~---------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~ 114 (416) T protein:vir:45 50 TAVMMIASDLARMP-----IRVT-VNGQIN---------YSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEIT 114 (416) T ss_pred HHHHHHHHhhccCc-----eEEe-cCcccc---------ccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEE Confidence 99999988777542 3332 111111 112344444322 2233 34555788999999887 Q ss_pred eecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecc Q lcl|NC_018087. 161 INPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSG 240 (520) Q Consensus 161 id~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSG 240 (520) -|. +. -+++|.+|+|..+..+++-. |.- .|+.+..... ..+....++.+.|+|.. T Consensus 115 r~~-~G--~~~~L~~i~~~~v~v~~~~~-----g~~------~~~~~~~~~~---------~~~~~~~~~~~evihir-- 169 (416) T protein:vir:45 115 RDK-TG--EPMNLTFRKTSEIELKSDAR-----GRL------YYFHQRIDSN---------GNNIERNVKFEDMLDIK-- 169 (416) T ss_pred ECC-CC--cEEEEEEEcCceeEEEECCC-----ccE------EEEEEEecCC---------CceeEEEEccccEEEec-- Confidence 652 21 28899999999998754322 211 1222111100 01112468888898875 Q ss_pred cccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHH-hhcceeEeecCCCc Q lcl|NC_018087. 241 LVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMN-SHRNRISYDARTGK 319 (520) Q Consensus 241 L~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~-~~knklvYd~~TGe 319 (520) ....++...+|-|+.|++++......++...-+--.-+--+-|..++ |.+...+|.+-+++-.+ .|+ | T Consensus 170 ~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~~~~~~~~~~~~~~~~~~~~---------g- 238 (416) T protein:vir:45 170 FYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFHKSFS---------G- 238 (416) T ss_pred cCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC-CCCCCHHHHHHHHHHHHHHhc---------C- Confidence 24667767789999999999888888887765444445566677777 45544455444443333 232 2 Q ss_pred cccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHH Q lcl|NC_018087. 320 VKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDE 398 (520) Q Consensus 320 v~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDE 398 (520) .++..+.+ .++ + |.+++.|.-.....| ++-..+.++.+.++++||.+.|..+.+ ++.+.-.. T Consensus 239 ~~nag~~~-vl~-------~---g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~------~~~~~~~~ 301 (416) T protein:vir:45 239 TKQAGKVV-VLD-------E---SMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA------NMSITDAN 301 (416) T ss_pred ccccCcee-ecC-------C---CceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC------CccHHHHH Confidence 12222222 221 1 456666543222222 344566778999999999988864322 11222233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchh Q lcl|NC_018087. 399 LSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKY 478 (520) Q Consensus 399 lkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky 478 (520) +-|..-+.-+-.++..-+...| .+. +. ...|+|++ .++...+ ...|.+.++.+-.- -+ T Consensus 302 ~~~~~~l~P~~~~ie~~ln~~l---------~~~--~~--~~~~~f~~------~~l~~~D-~~~~~~~~~~~~~~--G~ 359 (416) T protein:vir:45 302 LDYLSTLKPYITCVCAELNFKF---------NDE--YV--NREFKFDT------TEIRVVD-EKTQAEIDKINIDS--GK 359 (416) T ss_pred HHHHHHHHHHHHHHHHHHhhhc---------ccc--cc--CceEEEec------hhhhccC-HHHHHHHHHHHHhC--CC Confidence 4344444444444444333322 221 11 22344332 2222221 24456666554322 36 Q ss_pred hhHHHHHHHHhCCCHHH-----HHHH------HHHHHH------hhhcCCccCCcccc Q lcl|NC_018087. 479 ISNHTAMKDFLQMSDED-----IAAE------RKLIDE------ELSDKIFNPPEPEE 519 (520) Q Consensus 479 ~S~~~i~k~IL~~tDee-----I~~~------~kqi~~------E~~~~~~~~p~~e~ 519 (520) ++.+-+++ .|++.+-+ +-.. .+.+.+ +....-.+-.++-| T Consensus 360 ~T~NE~R~-~~gl~p~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:45 360 MNIDEIRQ-RDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEENE 416 (416) T ss_pred cCHHHHHH-HhCCCCCCCCCcceEeecccccccccccccCcccccccccccCCCCCCC Confidence 67777774 36664421 0000 000000 00001112222223 No 67 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=97.75 E-value=2.2e-05 Score=46.11 Aligned_cols=412 Identities=11% Similarity=0.106 Sum_probs=192.2 Q ss_pred hhhhcchhhhhhhHHHhhhccCCC---cccCCCCCCCceeeccc-ccccccccccccccccccchhHHHHHHHHHHHhhc Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAE---SITAPKFDDGATEVDSQ-DIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNN 84 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~---s~~~p~~~dg~~~i~~~-~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~ 84 (520) ++||.|+-.++..+- .....++ +..+|....+....... +.+.. .+.+ -....+.. + .....+++ T Consensus 1 Mgl~d~~r~~~~~~~--~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~-~~~~-~~~~~g~~------v-~~~~al~~ 69 (431) T protein:vir:10 1 MGLFDFIRREKQPEA--QARPHVEPSFQASTPTTSIPGETFEGLDDPRLK-EYIR-RGELNGGT------G-RETRALRN 69 (431) T ss_pred CcchhhhhcCccccc--ccccccccccccccccccccccccccccchHHH-Hhhc-cCccCcce------e-chhhhhcc Confidence 667777655433321 1111111 11222222211112110 00000 0000 00001100 0 12344678 Q ss_pred cchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcc----hhhhHH----HHHhhcccccee Q lcl|NC_018087. 85 YEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNF----QRKGSD----HFKRWYVDSRVF 156 (520) Q Consensus 85 pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f----~k~g~~----~fRrWYvDgri~ 156 (520) |-|..||+-|.+.+.-. |+.|-=.+ + -++... -..+..+|+. ..++++ ++..+.+.|.-| T Consensus 70 ~~V~~ci~~Ia~~iA~l-----p~~v~~~~----~-~~~~~~--~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~ 137 (431) T protein:vir:10 70 MAVLRCVTLISGTIGML-----PMNLISSD----D-SKQVLT--DDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESM 137 (431) T ss_pred HHHHHHHHHHHHhhccC-----ceEEEEec----C-ceeeec--cchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeE Confidence 99999999998887633 22221110 0 001111 1223334432 233444 344567889988 Q ss_pred EEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEE Q lcl|NC_018087. 157 FHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVY 236 (520) Q Consensus 157 ~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y 236 (520) ..++-| ..++++|.+|||..+...+.- ++.- +|.|... .+..+.++.+.|+| T Consensus 138 ~~i~r~----~g~~~~L~pl~~~~v~~~~~~-----~~~~-------~y~~~~~------------~g~~~~~~~~dViH 189 (431) T protein:vir:10 138 ARIVWS----GNRPIRLIPMDRGSAKGRLTS-----TWQI-------VYDYTTP------------TGDKIELPAREVFH 189 (431) T ss_pred EEEEEc----CCceEEEEEEcCceeEEEEcC-----CCeE-------EEEEEeC------------CceEEEEchhhEEE Confidence 888765 246899999999999875321 1111 2222111 12346789999988 Q ss_pred eecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecC Q lcl|NC_018087. 237 AHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDAR 316 (520) Q Consensus 237 ~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~ 316 (520) .. + ...++...+|.++.|.+++.....+++...=+----|--+-|.-.+ ++|.+.++++.-+.+...|.. T Consensus 190 ir-~-~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~ls~e~~~~~~~~~~~~~~g------- 259 (431) T protein:vir:10 190 LR-D-LSIDGVSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIEVP-KELSDNAYGRMKASVQENHTG------- 259 (431) T ss_pred ec-C-cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecC-CCCCHHHHHHHHHHHHHHhcC------- Confidence 74 3 3567777789999999999888888887766555555556666666 467766665555554444542 Q ss_pred CCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhh Q lcl|NC_018087. 317 TGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAIS 395 (520) Q Consensus 317 TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eIt 395 (520) .+|..+.+ .+ ++ |.+++.|.-...-.| ++--+|-...+.++++||..-|....+. . + +.+. T Consensus 260 ---~~n~g~~~-vl--------~~--g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~-t-~--sn~e 321 (431) T protein:vir:10 260 ---SENAGSWM-LL--------EE--GATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTS-W-G--SGIE 321 (431) T ss_pred ---ccccCCce-ec--------CC--CceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCC-c-c--ccHH Confidence 11111222 11 11 445555532111112 2333445678999999999988743321 1 1 1122 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhh--c Q lcl|NC_018087. 396 RDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLME--P 473 (520) Q Consensus 396 RDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~--p 473 (520) -.-+.|.++ .|+--+.. +.+.|- +.+.+++++. ...+.|.- ++|...+ +..|.+.++.+= + T Consensus 322 q~~~~f~~~--tL~P~~~~-ie~~ln-----~~Ll~~~~~~----~~~~~fd~----~~llr~d-~~~r~~~~~~~~~~G 384 (431) T protein:vir:10 322 QLAIFFIQY--GLSHWFVS-WEQAAA-----RAFLPEKMLG----QRQFKFNE----GALLRGT-LNDQAAFFSKALGAG 384 (431) T ss_pred HHHHHHHHH--HHHHHHHH-HHHHHH-----hhccChhhcC----CceEEEec----hhhhccC-HHHHHHHHHHHHhcc Confidence 222334443 23332221 222222 2334555443 23444542 2333332 466676666552 2 Q ss_pred ccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 474 YIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 474 ~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) .-..++|.+-+++ ++.|..-+=..-++.. .+....+.+..+|- T Consensus 385 ~~~g~lT~NE~R~-~~gl~p~~~~~gD~~~---~p~n~~~~~~~~~~ 427 (431) T protein:vir:10 385 GQSPWMKQNEVRE-MLDLPRADDPVADQLR---NPMTQKQKGSGDEP 427 (431) T ss_pred cccCccCHHHHHH-HhCCCCCCCcccccee---cccccccCCCCCCC Confidence 2234677777774 4666432111111110 01111111111111 No 68 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=97.74 E-value=2.3e-05 Score=46.02 Aligned_cols=373 Identities=13% Similarity=0.065 Sum_probs=171.5 Q ss_pred hhhhcch-hhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccch Q lcl|NC_018087. 9 LKMFAFW-HKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEV 87 (520) Q Consensus 9 l~~f~~~-~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEv 87 (520) +++|.+| .++. +..+.+++.+...-... .|+....++.- +.-.++|-| T Consensus 1 Mg~~~~~~~~k~----------~~~~~~~~~~~~~~~~~-------~~~~~~~~v~~--------------~~~l~~~~v 49 (383) T protein:vir:10 1 MGLLTPKNFSKR----------NAKNMVYPSNPAFFTTT-------VGGMQLSYVSA--------------LSALQNTNV 49 (383) T ss_pred CCcccccccccc----------cccccccccchhhhhhh-------ccCccccccch--------------hHhhcchHH Confidence 4444432 1111 12233333332211111 12222222211 123457889 Q ss_pred hHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHH----HHhhccccceeEEEeeec Q lcl|NC_018087. 88 DNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDH----FKRWYVDSRVFFHKIINP 163 (520) Q Consensus 88 d~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~----fRrWYvDgri~~hkvid~ 163 (520) ..||+-|.+.+.-. |+. +.+... ..+++--|=..++.++ +..+.++|.-|..++=+ T Consensus 50 ~~~i~~ia~~ia~~-----~~~--~~~~~~------------~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~- 109 (383) T protein:vir:10 50 YSVINRIASDVSSA-----HFK--TENTAT------------LNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ- 109 (383) T ss_pred HHHHHHHHHhhccC-----cee--ecccch------------hhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC- Confidence 99999999876654 222 222111 1233322333344444 44456789888876522 Q ss_pred CCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeeccccc Q lcl|NC_018087. 164 NRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVD 243 (520) Q Consensus 164 ~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d 243 (520) ..++.+++|-.+.+.+. .++. + |.++.. ..+..++++.+.|+|.. ... T Consensus 110 ------~~~~~p~~~~~v~~~~~-----~~~~--~-----~~~~~~------------~~~~~~~~~~~evih~r--~~~ 157 (383) T protein:vir:10 110 ------NLEHIPNSDVQINYLPG-----NMGI--V-----YTVLES------------NDRPKMVLRQDQMLHFR--LMP 157 (383) T ss_pred ------ceeEeecCcceEEEEEc-----CCce--E-----EEEEEc------------CCceEEEEcccceEEec--cCC Confidence 46788899877766422 1111 1 111111 11234678999998874 122 Q ss_pred CC---CCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCcc Q lcl|NC_018087. 244 CC---GKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKV 320 (520) Q Consensus 244 ~~---~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev 320 (520) ++ +...+|.|..|.+++.....++....=+----+--+-+..++.+ +-..++.+-+++.++++..- ...|. T Consensus 158 ~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~-~~~~e~~~~~~~~~~~~~~~----~n~~~- 231 (383) T protein:vir:10 158 DPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNY-LSDGKDLESAREEFEKANTG----DNSGR- 231 (383) T ss_pred CCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCC-CCCHHHHHHHHHHHHHHhCc----cccCC- Confidence 22 33468999999999999888888765443333555566666643 43344444455555544321 12222 Q ss_pred ccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHH-HHHHHHHHHhcCCChhhccCCCccccccccchhhHHH Q lcl|NC_018087. 321 KNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDI-LYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDE 398 (520) Q Consensus 321 ~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV-~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDE 398 (520) .+ .+ ..|.+++.|.-...-.+ +.+. .+-.+.+.++++||.+.|....+...-+.+ +.-.. T Consensus 232 -----~~-vl----------~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn--~eq~~ 293 (383) T protein:vir:10 232 -----LM-VL----------PDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSN--IDQIK 293 (383) T ss_pred -----cc-cc----------CCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCcccc--HHHHH Confidence 22 22 12677887754333222 2333 344688999999999998643221111111 22112 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchh Q lcl|NC_018087. 399 LSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKY 478 (520) Q Consensus 399 lkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky 478 (520) .-|.+-+.-+-+.+ .+.|...|. + +.|+|++ +. +... -+..|.+.+..+-.- -+ T Consensus 294 ~~~~~~l~P~~~~i----e~~l~~~l~-----~--------~~~~f~~--~~----l~~~-d~~~~~~~~~~~~~~--G~ 347 (383) T protein:vir:10 294 ATYLANLNSYVNPI----VDELRLKMN-----A--------PDLELDI--KD----MLDV-DDSILINQVSNLAKS--GV 347 (383) T ss_pred HHHHHHHHHHHHHH----HHHHHHhhC-----C--------ceEEeec--hh----hhcc-CHHHHHHHHHHHHhC--CC Confidence 22433333333333 333333331 1 2244333 22 1111 123455555444222 35 Q ss_pred hhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccc Q lcl|NC_018087. 479 ISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPE 518 (520) Q Consensus 479 ~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e 518 (520) ++.+-+++ +|++..-+=.+ +.+-....+-.+-.++| T Consensus 348 ~t~nE~R~-~lg~~p~~~~d---~~~~~~~~~~~~gGd~e 383 (383) T protein:vir:10 348 LGAEQAQF-ILTRSGFLPDN---LPEFKPLTNETKGGDDK 383 (383) T ss_pred cCHHHHHH-HhCCCcccCCc---ccccCCCcccCCCCCCC Confidence 67777664 45554421110 00000000101112222 No 69 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=97.70 E-value=2.7e-05 Score=45.66 Aligned_cols=398 Identities=11% Similarity=0.063 Sum_probs=195.3 Q ss_pred hhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchhHHHHhhhceeeEecCCC Q lcl|NC_018087. 26 IINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNAVQEIVSDAIVYEEGF 105 (520) Q Consensus 26 ~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~d~~~ 105 (520) -+.....++++|...+=+..+.. +.+.+...+..- .+...+.- .-.+++|-|..||+-|.+.+.-. T Consensus 1 ~~~~~~~~~~~p~~~~~~~~~~~------~~~~~~~~g~~~--~~~~~~~~--~~~~~~~~V~acV~~IA~~iA~l---- 66 (518) T protein:vir:78 1 MLLANGQTLSAPAMAELSPQMQD------SYYYAPAVGMQL--ERQFSLYG--GIYKNQPWVRTVIAKRAQALARL---- 66 (518) T ss_pred CcccCceeeccchhhhhhhhhhh------cccccceeceec--ccccchhh--HHhhhhHHHHHHHHHHHHhhccC---- Confidence 55677778888865443333321 111111111111 11111111 11257899999999999987643 Q ss_pred cEEEEeeccchhhhHHHHHHHHHHHHHHHHh----cchhhhHHHHHhhc----cccceeEEEeeecCCCCCCeeeeEecC Q lcl|NC_018087. 106 DVVSIDLDQTAFTENIRNLISDEFNSVLNML----NFQRKGSDHFKRWY----VDSRVFFHKIINPNRPKDGIIELRRLD 177 (520) Q Consensus 106 ~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll----~f~k~g~~~fRrWY----vDgri~~hkvid~~~~k~GI~elr~lD 177 (520) |+.|--.+.. ...++.+..+.+| |-..++.++.+.|. +.|.-|..++-| ....+.+|.+|+ T Consensus 67 -p~~l~~~~~~-------~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~---~~G~~~~L~~l~ 135 (518) T protein:vir:78 67 -PVKCMFTSGD-------TETEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKN---KSGTPEKLMPMH 135 (518) T ss_pred -ceEEEEEcCC-------ccccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEc---CCCcEEEEEEEC Confidence 3333222111 1112233333333 33345666666554 669989988754 223589999999 Q ss_pred ccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccccCCCC-cchhhhHHH Q lcl|NC_018087. 178 PRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDCCGK-NIIGYLHRA 256 (520) Q Consensus 178 Pr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~~~~-~~~syL~~a 256 (520) |..+...+.-. ++ ..+|.|..... ..+..++++.+.|+|.. ...++|. ..+|-|..| T Consensus 136 p~~Vtv~~~~~----~~-------~~~y~~~~~~~---------~~~~~~~~~~~eIiHir--~~~~dg~~~G~Spi~~~ 193 (518) T protein:vir:78 136 PSRVAIKRNSR----TG-------RYEYYFQAGAG---------VGTQLVSFADDEVVPIR--FFNPDGLERGLSLMESL 193 (518) T ss_pred CCceEEEEcCC----CC-------EEEEEEEecCC---------ccceeEEecCCcEEEec--CCCCCcccccccHHHHH Confidence 99888754321 11 11233322110 01123678999998875 3445554 357889999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhccc Q lcl|NC_018087. 257 VKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQ 336 (520) Q Consensus 257 ik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLp 336 (520) .+++.....+++...=+----+.-+-|...+ |.|.+..+++.-+.+...|+-- ...|.+ + .++ T Consensus 194 ~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~-~~ls~e~~~~~k~~~~~~~~G~----~nag~~------~-vL~----- 256 (518) T protein:vir:78 194 KSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSPEAQQRLREQFDRAHAGS----SNTGKT------M-VVE----- 256 (518) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCccEEEecC-CCCCHHHHHHHHHHHHHHhcCc----ccCCce------e-EcC----- Confidence 9988888888877544444445566677776 6676666655444444444320 011221 1 222 Q ss_pred ccCCCCCcceeecCCCCCcChH---HHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHH-HHHHHHHH Q lcl|NC_018087. 337 RRDGKAVTEVETLPGMTGMNEM---DDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKF-ISELQHKF 412 (520) Q Consensus 337 RReGgrgTEIsTLpGg~nLgei---~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KF-I~rLr~rF 412 (520) + |.+++.|. .+.-++ +-.+|....+.++++||...|...++. .+ +.+.-.-+.|..+ |.-+-.++ T Consensus 257 --~---G~~~~~l~--~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~s-t~---sn~e~~~~~f~~~tL~P~~~~i 325 (518) T protein:vir:78 257 --E---GMEPIPLQ--LTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRA-TF---SNISAQMRAFYRDTMAIPIARI 325 (518) T ss_pred --C---CceEEecc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCC-Cc---hhHHHHHHHHHHHHHHHHHHHH Confidence 1 44555553 333333 334477799999999999888532211 11 1222222335444 44444444 Q ss_pred HHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCC Q lcl|NC_018087. 413 EEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMS 492 (520) Q Consensus 413 s~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~t 492 (520) ...+...| ++.. + ....++ |.-+ ++...+ ++.|.+.+..+-.. -+++.+-++. .+++. T Consensus 326 e~eln~~L---------~~~~--~-~~~~~~--fd~~----~Llr~D-~~~r~~~~~~~~~~--G~lT~NE~R~-~~gl~ 383 (518) T protein:vir:78 326 QSAMDKYV---------GQYW--V-RKNRMK--FDID----DVIQPD-WEAKSESTQKMVNS--GVATPNEGRE-IMGLP 383 (518) T ss_pred HHHHHHhh---------cccc--c-CcceEE--eech----hhhccC-HHHHHHHHHHHHhC--CCcCHHHHHH-HhCCC Confidence 44444332 2221 1 112333 3322 222222 24566666666433 3667777774 46776 Q ss_pred HHHHHHHHH---------------HHHHhhhcCCccCCccccC Q lcl|NC_018087. 493 DEDIAAERK---------------LIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 493 DeeI~~~~k---------------qi~~E~~~~~~~~p~~e~~ 520 (520) .-+=..-++ ...+....+.-++|.+..- T Consensus 384 pie~~~gD~~~v~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~ 426 (518) T protein:vir:78 384 RSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPASTPV 426 (518) T ss_pred CCCCCCCceeeecccceecccccccccCCCCCCCCCCCCcccc Confidence 533000000 0000001111112221111 No 70 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=97.69 E-value=2.8e-05 Score=45.55 Aligned_cols=406 Identities=15% Similarity=0.111 Sum_probs=189.5 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhccCCCcccCCCC--CCCceeecccccccccccccccccccccchhHHHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKF--DDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTY 78 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~--~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~Y 78 (520) |+=..++++. ++.+ ++++. .-|...+..++. .+-..+.|.... +. .-+ .. T Consensus 1 ~~~~l~~~~~-------------------~~~~-~~~~~~~~~~~~~~~~~~~----~~~~~~~g~~~~--~g-~~v-~~ 52 (434) T protein:vir:43 1 MSKSLGKVLS-------------------SATS-APRSSLFGWGGKTIRLTDG----AFWSQFLGRESS--SG-KKV-TV 52 (434) T ss_pred Cccchhhhhh-------------------hccc-ccchhhhcccccccccCch----HHHHHHhcCCcc--CC-cee-ch Confidence 3322222221 1111 11110 111122211111 010001111000 00 001 23 Q ss_pred HHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHH-HHHHHHHHHH-hcchhhhHHHHH----hhccc Q lcl|NC_018087. 79 RSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLI-SDEFNSVLNM-LNFQRKGSDHFK----RWYVD 152 (520) Q Consensus 79 R~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I-~eeF~~i~~l-l~f~k~g~~~fR----rWYvD 152 (520) +..+++|-|..||+-|.+.+-.. |+.|--.+. +..+..+ ......+|+. =|-..++.++.+ ...++ T Consensus 53 ~~al~~~~V~~~i~~ia~~ia~l-----p~~~~~~~~---~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~ 124 (434) T protein:vir:43 53 DKAMKLSAVWACVRLISTSVAGL-----PLGVYERKA---DGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLW 124 (434) T ss_pred hhhhccHHHHHHHHHHHHhhhhC-----ceEEEEEcC---CCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhc Confidence 45678899999999998877642 222211110 0001111 1111222221 233455666544 45677 Q ss_pred cceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcc Q lcl|NC_018087. 153 SRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYS 232 (520) Q Consensus 153 gri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~ 232 (520) |--|..+.-+ ...+++|.+|+|..+..++.- +|. ..|+++... +..+.++.+ T Consensus 125 Gnay~~i~~~----~G~~~~L~~l~p~~v~~~~~~-----~g~------~~y~~~~~~-------------g~~~~~~~~ 176 (434) T protein:vir:43 125 GNAYAEIRRA----AGRPAALDFLLPSRVDLECDE-----NGR------LKYFYTTKK-------------GARREIERT 176 (434) T ss_pred CCeEEEEEeC----CCcEEEEEEEcCcceEEEEcC-----CCe------EEEEEEecC-------------ceEEEEccc Confidence 8887665422 235899999999999875431 221 123333221 234689999 Q ss_pred cEEEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeE Q lcl|NC_018087. 233 AMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRIS 312 (520) Q Consensus 233 aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklv 312 (520) .|+|.+ +. +.+|...+|-+..|+..+.....+++...-+----+--.-+..++ +.|.+.++ +=+++.++++..- T Consensus 177 eVih~~-~~-~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~l~~e~~-~~~r~~~~~~~g~-- 250 (434) T protein:vir:43 177 NMLHIP-AF-TLDGRIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVD-RILQPAQR-EEFREYVKSVSGA-- 250 (434) T ss_pred cEEEec-Cc-CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecC-CCCCHHHH-HHHHHHHHHhcCc-- Confidence 999986 33 667767788899999988888888776544332223334555554 45655444 4457666654320 Q ss_pred eecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh---HHHHHHHHHHHHHhcCCChhhccCCCcccccc Q lcl|NC_018087. 313 YDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE---MDDILYFRKALYMALRVPLSRIPDEQTQNVFD 389 (520) Q Consensus 313 Yd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G 389 (520) ...|. .+ +++ .|.+++.|. .+..+ ++-.++..+.+.++++||..-|....+....+ T Consensus 251 --~nag~------~~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~ 309 (434) T protein:vir:43 251 --MNSGR------SP-VLE----------QGITPETIG--INPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWG 309 (434) T ss_pred --cccCC------cc-ccC----------CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCcccc Confidence 11222 21 221 256666663 23222 33455678889999999998885432222112 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 390 MSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLS 469 (520) Q Consensus 390 ~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~ 469 (520) . .+.-.-+-|.++ .|+--+..| .+. +=..+.+.+++. ...+.|.-+. +... -+..|.+.++ T Consensus 310 s--~~e~~~~~f~~~--~L~P~~~~i-e~~-----ln~kL~~~~~~~----~~~~~fd~~~----llr~-d~~~r~~~~~ 370 (434) T protein:vir:43 310 T--GLEQQMLAFLTF--SISSITNQI-QQC-----VNKRLLTAPERI----RYYAEFSLEG----FLKA-DSAGRAAWYS 370 (434) T ss_pred c--hHHHHHHHHHHH--HHHHHHHHH-HHH-----HHhhcCChhhhc----CceEEEechh----hhcc-CHHHHHHHHH Confidence 1 122222224433 233322221 111 222456666654 2334444322 2221 1244566655 Q ss_pred HhhcccchhhhHHHHHHHHhCCCHH-------------HHHHHHH----HHHHhhhcCCccCCcccc Q lcl|NC_018087. 470 LMEPYIGKYISNHTAMKDFLQMSDE-------------DIAAERK----LIDEELSDKIFNPPEPEE 519 (520) Q Consensus 470 ~~~p~vgky~S~~~i~k~IL~~tDe-------------eI~~~~k----qi~~E~~~~~~~~p~~e~ 519 (520) .+-.- -+++.+-++. .+++.+- -++...+ +-.++.....-++|++|| T Consensus 371 ~~~~~--G~~T~NE~R~-~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 371 TMAQN--GFMTRNEGRR-KENLPELPGGDILTVQSNLVPIDQLGQSNKSQAVRAALMNWFSQPEPQE 434 (434) T ss_pred HHHhC--CCcCHHHHHH-HhCCCCCCCCCeEeeccCccchhhhhccCCCcchhhhhhccCCCCCCCC Confidence 55222 3567777774 3565531 1112211 111222233456788888 No 71 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=97.64 E-value=3.4e-05 Score=45.10 Aligned_cols=404 Identities=10% Similarity=0.052 Sum_probs=184.8 Q ss_pred CceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchhHHH---------------------HhhhceeeE Q lcl|NC_018087. 42 GATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNAV---------------------QEIVSDAIV 100 (520) Q Consensus 42 g~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai---------------------~eIvneaiv 100 (520) =+|+-+ ... .+...... -..+|..+..+++=+..| .-||+-.+- T Consensus 1 ~~t~~d--------~i~----~L~~~~~~---~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~ 65 (480) T protein:vir:78 1 MTTYHE--------HVE----RLQGLLAR---DLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSD 65 (480) T ss_pred CCCHHH--------HHH----HHHHHHHH---HHHHHHHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHh Confidence 000000 000 00000000 001122222222211111 111110000 Q ss_pred ecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeee--cCCCCCCeeeeEecCc Q lcl|NC_018087. 101 YEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIIN--PNRPKDGIIELRRLDP 178 (520) Q Consensus 101 ~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid--~~~~k~GI~elr~lDP 178 (520) += ..+.+.+. ++.+ ..+..+.|++--+|+....++++.-.+-|+-|.+.--. .....+|-.-++.+|| T Consensus 66 ~l-~~~g~~~~-~d~~--------~~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~i~~~~p 135 (480) T protein:vir:78 66 RL-DIEGFRIS-EDSE--------GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESP 135 (480) T ss_pred hh-ccCceecC-CCch--------hHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeEEEEEcc Confidence 00 00011111 1111 12334455555678889999999999999988775421 0112457778999999 Q ss_pred cceeeeeec--cCCCCccccccc------ceecceeecCccccccc--cc---ceecCCc----ce-ecCcccEEEeecc Q lcl|NC_018087. 179 RNVQFVREL--DTKMENGVKVVK------GYREYFLYDTELESYQC--GH---QHFAAGT----KI-KIPYSAMVYAHSG 240 (520) Q Consensus 179 r~i~~vr~i--~~~~~~~~~~~~------~~~ey~~y~~~~~~~~~--~~---~~~~~~~----~~-~I~~~aI~y~hSG 240 (520) +.+-.+.+- ..+..-.++.+. .+..+-+|.+....+.. ++ +...... ++ ++| .|.|++-- T Consensus 136 ~~~~~i~D~~~~~~~~~~i~~~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP--vv~f~n~~ 213 (480) T protein:vir:78 136 LYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVP--VVPLTNDP 213 (480) T ss_pred cceEEEEcCCCccceEEEEEEEEeecCCcceEEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcc--eEEeeccc Confidence 999887652 222333333221 11223344443211111 00 0000000 00 111 13343321 Q ss_pred cccCCCCcchhhhHHHH----HHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecC Q lcl|NC_018087. 241 LVDCCGKNIIGYLHRAV----KPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDAR 316 (520) Q Consensus 241 L~d~~~~~~~syL~~ai----k~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~ 316 (520) +..+..+.|=|.+.+ ..+| +++-+.+++-....-|.|-+.=.+....+..+ T Consensus 214 --~~~~~~G~sdi~~~i~~l~Da~~--~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~--------------------- 268 (480) T protein:vir:78 214 --RLGNRYGRSEISPELRKVTDAAS--RTLMNLQSASQILGTPLRVISGVTTDELTNDG--------------------- 268 (480) T ss_pred --ccCCccCccchhHHHHHHHHHHH--HHHHHHHHHHHhhcchhhhhhCCCcccccccc--------------------- Confidence 112222344444433 3333 35567777777777787765422222111100 Q ss_pred CCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhh Q lcl|NC_018087. 317 TGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAIS 395 (520) Q Consensus 317 TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eIt 395 (520) .+.+ -...+-...|++ |-+.++-++++.+ +.- ++-++-....++..-++|..-|...+. +. ..+..|. T Consensus 269 ~~~~----~~~~~~~~~~~~----~~~~~~~~~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~-n~-~Sg~Al~ 337 (480) T protein:vir:78 269 ENTT----LDIYYGRILTLA----SEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSE-NP-ASAEAII 337 (480) T ss_pred ccch----hhhhhhhhccCC----CCCceEEecCccC-HHHHHHHHHHHHHHHhcccCCCHHHhccccC-ch-hHHHHHH Confidence 0000 000111223443 2346788888743 332 333566666677777888766643221 10 1222344 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_018087. 396 RDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYI 475 (520) Q Consensus 396 RDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~v 475 (520) --+...-.-+.+.|+.|..-+...++.=+.+.|---..+| ..|.+.|..-..=+. .+.++.+.++-.-. T Consensus 338 ~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~~~~~~~~~----~~i~v~w~~~~~~s~-------~~~ad~~~kl~~~g 406 (480) T protein:vir:78 338 ATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTV-------AAKADAVSKLYANG 406 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccc----eeeeEEecCCCCCCH-------HHHHHHHHHHHHhc Confidence 4455567778899999999999988877777774433344 347788864433333 23455555554444 Q ss_pred chhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcC-------CccCCcccc------C Q lcl|NC_018087. 476 GKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDK-------IFNPPEPEE------I 520 (520) Q Consensus 476 gky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~-------~~~~p~~e~------~ 520 (520) +..+|.+++.. +|.+++++++++++..++|.... .-.++++.. - T Consensus 407 ~~~~s~et~~~-~lg~~~d~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 463 (480) T protein:vir:78 407 QGPIPKEQARI-DLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTET 463 (480) T ss_pred ccCCCHHHHHh-cCCCCHhHHHHHHHHHHHHHHHHHHHhhccccCCCccccCCCCCCC Confidence 45679999884 58999999988776555443221 111222211 1 No 72 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=97.63 E-value=3.5e-05 Score=45.02 Aligned_cols=269 Identities=12% Similarity=0.141 Sum_probs=140.1 Q ss_pred CcEEEEeeccchhhhHHHHHHHHHHHHHHHH-hcchhhhHH----HHHhhccccceeEEEeeecCCCCCCeeeeEecCcc Q lcl|NC_018087. 105 FDVVSIDLDQTAFTENIRNLISDEFNSVLNM-LNFQRKGSD----HFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPR 179 (520) Q Consensus 105 ~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~l-l~f~k~g~~----~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr 179 (520) -..+.+.+-.. .+.. .....++|+. =|-.-.+.+ +++.+.+.|.-|+.++-+ .+ ..+++|.+|+|. T Consensus 1 ia~l~~~~~~~--~~~~----~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~-~~--G~~~~l~~l~~~ 71 (278) T protein:vir:78 1 MASLPLKMYED--YKVV----NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERD-IY--HQPSKLFLLNPD 71 (278) T ss_pred CccceeEEEec--Cccc----ccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEEC-CC--CcEEEEEEECCc Confidence 11112222111 0111 1122233321 122334444 455577889999998866 22 248999999999 Q ss_pred ceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccccCCCCcchhhhHHHHHH Q lcl|NC_018087. 180 NVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKP 259 (520) Q Consensus 180 ~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~ 259 (520) .+...+.- ++.. .||.|... .+..+.++.+.|+|.. +.-..++....|.+..|.++ T Consensus 72 ~v~v~~~~-----~~~~------~~y~~~~~------------~g~~~~~~~~evih~~-~~~~~~~~~G~s~~~~~~~~ 127 (278) T protein:vir:78 72 VVEMLIEN-----QSRE------LYYSIHAA------------TGNKLIVHNMDMLHFK-HIVASNMVQGISPIDVLKNT 127 (278) T ss_pred eeEEEEcC-----CCce------EEEEEEcC------------CceEEEEccccEEEEC-CCCCCCCeeeccHHHHHHHH Confidence 99874321 1111 13333211 1234688999998884 22234455668899999999 Q ss_pred HHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccC Q lcl|NC_018087. 260 ANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRD 339 (520) Q Consensus 260 ~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRRe 339 (520) +.....++..- .+...+.| .-++. .-+.|.+..+++..+.+-..+. ..|. .+ .++ T Consensus 128 i~~~~~~~~~~-~~~~~~~~-~~i~~-~~~~l~~e~~~~~~~~~~~~~~-------~~g~------~~-vl~-------- 182 (278) T protein:vir:78 128 TDFDNAVRTFN-LTEMQKPD-SFMLK-YGSNVGKEKRQQVLEDFKQYYE-------ENGG------IL-FQE-------- 182 (278) T ss_pred HHHHHHHHHHH-HHHhcCCC-cEEEE-eCCCCCHHHHHHHHHHHHHHhc-------cCCC------ce-ecC-------- Confidence 99888887764 45555554 44444 4467777666554433322221 2232 22 221 Q ss_pred CCCCcceeecCCCCCcChHHHH---HHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHH-HHHHHHHHHHH Q lcl|NC_018087. 340 GKAVTEVETLPGMTGMNEMDDI---LYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKF-ISELQHKFEEI 415 (520) Q Consensus 340 GgrgTEIsTLpGg~nLgei~DV---~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KF-I~rLr~rFs~i 415 (520) .|+++..|. .+.-+++-+ ++..+.+.++++||.+-+....+.+ + +.+.-....|..+ |..+..++..- T Consensus 183 --~g~~~~~l~--~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~---~-sn~~~~~~~~~~~~l~P~~~~i~~~ 254 (278) T protein:vir:78 183 --PGVEIEPLP--KKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTN---F-AKNEELNRFYLQHTLLPIVKQYEEE 254 (278) T ss_pred --CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC---c-ccHHHHHHHHHHHHHHHHHHHHHHH Confidence 256777774 334444433 5788899999999988885432211 1 1121112234443 44444444443 Q ss_pred HHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccch Q lcl|NC_018087. 416 FLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSY 451 (520) Q Consensus 416 f~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~ 451 (520) |.. . +.++.++. -..+|+|+.. .- T Consensus 255 ln~----~-----L~~~~e~~-~g~~~~f~~~--~l 278 (278) T protein:vir:78 255 FNR----K-----LLTKTDRE-KIGILNLTLN--LI 278 (278) T ss_pred HHh----h-----cCChhHhc-CCceEEEecc--cC Confidence 332 2 35666655 2234555543 22 No 73 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=97.62 E-value=3.6e-05 Score=44.94 Aligned_cols=395 Identities=11% Similarity=0.051 Sum_probs=183.4 Q ss_pred Ccccccc----chhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHH Q lcl|NC_018087. 1 MSMLADS----DLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELIN 76 (520) Q Consensus 1 ~~~~~~~----~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~ 76 (520) |+...+. -|+||++.....++. +...+.+.+|... ... .+ ++.++- .+ T Consensus 1 ~~~~~~~~~~~~m~~F~~~~~~~~~~-----~~~~~~~~~~~~~----~~~------~~----~~~~~~---------~~ 52 (413) T protein:vir:96 1 MPGVSEIRKDKNLKFFNNKRSPTEES-----KAKDEIPKAPQVV----MTL------PN----FFKELI---------SD 52 (413) T ss_pred CCccchhhhhhcCCccccCCCcchhh-----hhhcccccccccc----ccc------hh----hHhhhc---------cc Confidence 7766663 234555533222211 1111111111100 000 00 011111 11 Q ss_pred HHHHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcc----hhhhHHH----HHh Q lcl|NC_018087. 77 TYRSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNF----QRKGSDH----FKR 148 (520) Q Consensus 77 ~YR~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f----~k~g~~~----fRr 148 (520) .|..++++|-|..||+-|.+.+.-+ |+.+.-+... -++.+.. .+..+|+. .-+++++ +.. T Consensus 53 ~~~~~~~~~~v~~cI~~ia~~ia~~-----~~~~~~~~~~----~~~~~~~---~~~~ll~~~PN~~~t~~~f~~~~~~~ 120 (413) T protein:vir:96 53 GYTKLSDSPEVRMAVDCIADLVSNM-----TIQLMQNGET----GDKRIKN---DLSRVVDIEPNKYLSRKTFIQWLVRS 120 (413) T ss_pred hhHHHhhchHHHHHHHHHHHhhccC-----ceEEEEecCC----Ccccccc---HHHHHHHhccccCCCHHHHHHHHHHH Confidence 2445788999999999999988643 2222111111 1111222 22333321 2344554 444 Q ss_pred hccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCccee Q lcl|NC_018087. 149 WYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIK 228 (520) Q Consensus 149 WYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~ 228 (520) +.+.|.-|.-++-|.+. .-+++|.+|||..|...+. .+.. +|.|... +.. T Consensus 121 lll~Gn~~~~i~r~~~g--~~~~~L~~l~~~~v~~~~~-----~~~~--------~y~~~~~---------------~~~ 170 (413) T protein:vir:96 121 MLLEGNGNAVVKPQVSG--DKIIGLTPISPYKVTFNVS-----DDDL--------DYSITFD---------------NKE 170 (413) T ss_pred HhhcCCeEEEEEEcCCC--CceEEEEEecCceeEEEEc-----CCeE--------EEEEeec---------------CcE Confidence 56779988887755222 2378999999999887432 1111 2222111 013 Q ss_pred cCcccEEEeecccccC-CCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhh Q lcl|NC_018087. 229 IPYSAMVYAHSGLVDC-CGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSH 307 (520) Q Consensus 229 I~~~aI~y~hSGL~d~-~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~ 307 (520) ++++.|+|.-. ..++ ++....|.+..|.+++.....+++...=+----+.-+-+..++ ++|.+..+++..+.+...| T Consensus 171 ~~~~evih~k~-~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~ 248 (413) T protein:vir:96 171 YDPSTLLHFVL-NPSIERPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVD-SDSDELSDEEGRENFEEMY 248 (413) T ss_pred EchhhEEEEec-cCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeC-CCCCHHHHHHHHHHHHHHh Confidence 46677777631 2233 3445789999999999998888887766555566667788877 5677776666555544444 Q ss_pred cceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCcccc Q lcl|NC_018087. 308 RNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNV 387 (520) Q Consensus 308 knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~ 387 (520) .. .++..+.+ .++ -+|...+++..+.- ..+.-++-..|-.+.+.++++||...|.. T Consensus 249 ~g----------~~n~g~~~-vl~------~~~~~~~~~~~~~~-~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~------ 304 (413) T protein:vir:96 249 LK----------RKEAGKPW-IIP------EGMVNVQQIKPLTL-NDLAINDAVTLDKKTVAGIFGVPAFLLGV------ 304 (413) T ss_pred cC----------ccccCcee-eec------CCcccccccccCCh-hHHHHHHHHHHHHHHHHHHhCCCHHHcCC------ Confidence 32 11111122 111 11111123332321 22323344557788999999999987741 Q ss_pred ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHH Q lcl|NC_018087. 388 FDMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNV 467 (520) Q Consensus 388 ~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~ 467 (520) |.+++-+ -..|.++ .|+--.. .+.+.|-..|+ ++ ...+.|. +.++...+ +.+|.+. T Consensus 305 -~~~~~~~--~~~~~~~--~l~P~~~-~ie~~ln~~ll-----~~--------~~~~~fd----~~~ll~~d-~~~~~~~ 360 (413) T protein:vir:96 305 -GTYNKDE--FNNFINT--KIMSIAQ-VIQQTYNKLIV-----EE--------DMYFSLN----PRSLYNYS-LTEMVSA 360 (413) T ss_pred -CcchHHH--HHHHHHH--HHHHHHH-HHHHHHHHhhC-----CC--------CcEEEEe----chhhhccC-HHHHHHH Confidence 1112111 1224332 2333222 24444444442 21 1334443 22333332 3456666 Q ss_pred HHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhh-----hcCCccCCccccC Q lcl|NC_018087. 468 LSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEEL-----SDKIFNPPEPEEI 520 (520) Q Consensus 468 ~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~-----~~~~~~~p~~e~~ 520 (520) +..+-.- -+++.+-++. .+++.+.+ .-++.+-.-. ..+-.+++..+|= T Consensus 361 ~~~~~~~--G~~t~NE~R~-~~g~~p~~--~gd~~~~~~n~~~~~~~~~~~~~~~~dt 413 (413) T protein:vir:96 361 GAQMTQL--NALRRNEFRN-WVGMPPDA--EMDDLLVLENYLQQKDLVNQKKLIQDET 413 (413) T ss_pred HHHHHhC--CCcCHHHHHH-HhCCCCCC--CcceeeecccccchhhcccccCCCCCCC Confidence 6655332 2567777764 46665532 1111100000 0000000011111 No 74 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=97.62 E-value=3.7e-05 Score=44.90 Aligned_cols=401 Identities=12% Similarity=0.133 Sum_probs=191.0 Q ss_pred cccccchhhhcchhhhhhhHHHhhhccCCCcccCC--CCCCC-ceeecccccccccccccccccccccchhHHHHHHHHH Q lcl|NC_018087. 3 MLADSDLKMFAFWHKVDDTEYDKIINDKAESITAP--KFDDG-ATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYR 79 (520) Q Consensus 3 ~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p--~~~dg-~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR 79 (520) |+++.-+++|+.- +. .++|| .+..| ...-+.+.++.. +.....+....+ .-. T Consensus 1 ~~~~~~~~~~~~~---~~------------~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~s~~g~~v--------~~~ 55 (432) T protein:vir:10 1 MPDEKKLGLLGQL---KA------------MFVPPDPVDIGGGQTFTPVNATARD--LGIIISDTGAAV--------NAD 55 (432) T ss_pred CCCCcccchhhhh---Hh------------hcCCccccccccccccccCcchhhh--hcccccccCccc--------chh Confidence 9999999999731 11 11111 11111 111111111100 000000100111 113 Q ss_pred HHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhc----chhhhHHH----HHhhcc Q lcl|NC_018087. 80 SLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLN----FQRKGSDH----FKRWYV 151 (520) Q Consensus 80 ~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~----f~k~g~~~----fRrWYv 151 (520) ..+++|-|..||+-|.+.+.-+ |+.|--+.. +..++.+. .-+..+|+ =..++.++ +..+.+ T Consensus 56 ~al~~~~V~~~i~~Ia~~ia~l-----p~~~y~~~~---~g~~~~~~---~~l~~lL~~~PN~~~t~~~f~~~l~~~lll 124 (432) T protein:vir:10 56 AIMRLDAVAACVKLVSQAIAAM-----PLTMYMRTP---DGRKEAVN---HPLYTLLLDGPNSTQTAFDFWQVVVTRLLL 124 (432) T ss_pred hhhcchHHHHHHHHHHHhhhhC-----ceeEEEecC---CCcccccc---cHHHHHHHhcccccCCHHHHHHHHHHHHhh Confidence 3567899999999999877644 222211111 11111111 22333332 22444444 445778 Q ss_pred ccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCc Q lcl|NC_018087. 152 DSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPY 231 (520) Q Consensus 152 Dgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~ 231 (520) .|--|..++-+ + ..+++|.+|+|..+..+++- +|.. -|.++... +..++++. T Consensus 125 ~Gnay~~~~~~--~--g~~~~L~~l~~~~v~v~~~~-----~g~~------~y~~~~~~-------------g~~~~~~~ 176 (432) T protein:vir:10 125 DGTAYVRKVVT--D--GRIESLQYLANDRLTITTDT-----KGNT------AYRYRRTD-------------GQMIDIPK 176 (432) T ss_pred cCCeEEEEEec--C--CcEEEEEEEcCCceEEEEcC-----CCcE------EEEEEecC-------------ceEEEEcC Confidence 89988887653 2 35899999999999886532 2221 12221111 22368899 Q ss_pred ccEEEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhccee Q lcl|NC_018087. 232 SAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRI 311 (520) Q Consensus 232 ~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knkl 311 (520) +.|+|... ...+|...+|.|..|.+++......++...=+=---+.-.-|..+| +.|-+...++..+ +|.. T Consensus 177 ~~iih~~~--~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~l~~e~~~~~~~----~~~~-- 247 (432) T protein:vir:10 177 QQIWKIMG--YSLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQID-RFLTDDQYDSFAK----KVSG-- 247 (432) T ss_pred ccEEEecC--CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecC-CCCCHHHHHHHHH----HHhh-- Confidence 99988742 3566767789999999999888777775443322223445566665 4555444443222 2221 Q ss_pred EeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChH---HHHHHHHHHHHHhcCCChhhccCCCccccc Q lcl|NC_018087. 312 SYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEM---DDILYFRKALYMALRVPLSRIPDEQTQNVF 388 (520) Q Consensus 312 vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei---~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~ 388 (520) ..+..+.+ .++ + |++++.|. .+..++ +-.+|....+.++++||...|....... + T Consensus 248 --------~~nag~~~-vl~-------~---g~~~~~l~--~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t-~ 305 (432) T protein:vir:10 248 --------SVEAGRAP-LLE-------G---GMDVKSLG--LNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGT-T 305 (432) T ss_pred --------hhhCCCce-ecC-------C---CceEEEcc--CChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCc-c Confidence 11111121 222 2 44555553 233333 3346888889999999999996432211 1 Q ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 389 DMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVL 468 (520) Q Consensus 389 G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~ 468 (520) +.++.+.-.-+-|..+ .|+--+.. +.+.|-. .++++.++. ...++|..+. +...+ ..+|.+.+ T Consensus 306 ~~~sn~e~~~~~f~~~--tl~P~~~~-ie~~ln~-----kL~~~~~~~----~~~~~fd~~~----ll~~d-~~~r~~~~ 368 (432) T protein:vir:10 306 SWGSGIESQQLGFLSM--TLSPWLRR-IEQSIAL-----NLLSPAERR----RYFADFDTSA----LLRAD-SAARSSYY 368 (432) T ss_pred cccchHHHHHHHHHHH--HHHHHHHH-HHHHHHh-----hhcCccccC----ceEEEeechh----hhccC-HHHHHHHH Confidence 2222232223335443 34332222 2222222 344554432 3455565332 22222 35567776 Q ss_pred HHhhcccchhhhHHHHHHHHhCCCHHHHHHHH------------HHHHH-hhhcCCccCCccc--cC Q lcl|NC_018087. 469 SLMEPYIGKYISNHTAMKDFLQMSDEDIAAER------------KLIDE-ELSDKIFNPPEPE--EI 520 (520) Q Consensus 469 ~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~------------kqi~~-E~~~~~~~~p~~e--~~ 520 (520) +.+-. .-++|.+-+++ .|++..-+ ..+ ..+.+ -.+++--..++++ ++ T Consensus 369 ~~~~~--~G~~T~NE~R~-~~glppi~--g~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~~~~~~~~ 430 (432) T protein:vir:10 369 SQLVN--NGLMTRDEARE-IEGLPKLG--GNAAVLTVQSAMVPLDSIGLQASPEPASGLGNQQQDKV 430 (432) T ss_pred HHHHh--CCCCCHHHHHH-HhCCCCCC--CCcceEeecCcccchhhhcccCCCCCCCCCCCcccccc Confidence 66522 24678888885 47775432 111 11100 0011111111111 11 No 75 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=97.61 E-value=3.8e-05 Score=44.80 Aligned_cols=380 Identities=12% Similarity=0.125 Sum_probs=170.3 Q ss_pred cchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccc Q lcl|NC_018087. 7 SDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYE 86 (520) Q Consensus 7 ~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pE 86 (520) =.++||+|+.+..... ...++..+-. +|.. .+..+.+. +..+. -+.. +..+++|- T Consensus 1 m~m~~f~~~~~~~~~~-------~~~~~~~~~~-~~~~------~~~~~~~~----~~~~~------~v~~-~~al~~~~ 55 (392) T protein:vir:39 1 MILPILNFINQTNDPP-------EVGSVQSYFP-DGND------AQIMESLL----GDNNE------WVSA-RAALRNSD 55 (392) T ss_pred Ccchhhhhhhcccccc-------cccccccccc-cCch------hhhhhhhc----CCCCc------eech-HHhhccHH Confidence 2346777665433221 1111111111 1111 11111111 11110 0111 33457899 Q ss_pred hhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHH----HHHhhccccceeEEEeee Q lcl|NC_018087. 87 VDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSD----HFKRWYVDSRVFFHKIIN 162 (520) Q Consensus 87 vd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~----~fRrWYvDgri~~hkvid 162 (520) |..||+-|.+.+.-++ +.+ .+.. . +.+++-=|-..++.+ ++..+++.|--|..++-| T Consensus 56 v~~~i~~ia~~ia~lp-----~~~--~~~~-----~-------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~ 116 (392) T protein:vir:39 56 LFSIILQLSSDLAIVK-----INA--EKKK-----N-------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRN 116 (392) T ss_pred HHHHHHHHHHhhccCc-----eee--ccch-----h-------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEEC Confidence 9999999999875432 221 1110 0 112222333444454 444688899999998866 Q ss_pred cCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccc Q lcl|NC_018087. 163 PNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLV 242 (520) Q Consensus 163 ~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~ 242 (520) .. ..+++|.+|+|..|+..+.-. +... +|-|... ....+..+.++.+.|+|.. .. T Consensus 117 ~~---g~~~~L~~l~~~~v~~~~~~~-----~~~~------~y~~~~~---------~~~~~~~~~~~~~eiih~~--~~ 171 (392) T protein:vir:39 117 AN---GADMKWEYLRPSQVNTYYFEY-----ENGM------YYNITFD---------DPKIEPILQAPQSDLIHMK--LL 171 (392) T ss_pred CC---CcEEEEEEEcCceeEEEEcCC-----CceE------EEEEEec---------CcccceeEEEccccEEEec--CC Confidence 32 349999999999998764321 1111 1111110 0011223578889998885 34 Q ss_pred cCCCC-cchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccc Q lcl|NC_018087. 243 DCCGK-NIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVK 321 (520) Q Consensus 243 d~~~~-~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~ 321 (520) ++++. ..+|.|..|...++....+++...-+=---+--+-+..++.+..+..++.+.++ ..|+.. ...|. T Consensus 172 ~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~---~~~~~~----~~~g~-- 242 (392) T protein:vir:39 172 SIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRS---RSFMKR----SRSGG-- 242 (392) T ss_pred CCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHH---HHHhcc----ccCCC-- Confidence 55553 468999999999999998887766443334555667777765555444433322 233321 11121 Q ss_pred cccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHH Q lcl|NC_018087. 322 NQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELS 400 (520) Q Consensus 322 d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElk 400 (520) .+ .+ | .|++++.|.-.....+ ++=.+|..+.+.++++||...|...+. ++...-.-.. T Consensus 243 ----~~-vl-----~-----~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~------~~~~~~~~~~ 301 (392) T protein:vir:39 243 ----PV-VL-----D-----DLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGD------QQSSIQQISG 301 (392) T ss_pred ----ee-ec-----C-----CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC------cccHHHHHHH Confidence 11 11 1 2567777754444444 455678889999999999988864321 1111111122 Q ss_pred HHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccc-hHHHHHHHHHHHH----HHHHHHHhhcc Q lcl|NC_018087. 401 FDKF-ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNS-YFSEMKTIEITER----RVNVLSLMEPY 474 (520) Q Consensus 401 F~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn-~f~ElKe~Ei~~~----R~~~~~~~~p~ 474 (520) |..+ +.-+-.++..-+.. .|. +.=+++ +...+..|. .+++.-+ .+++. |-+..+.+.. T Consensus 302 f~~~~l~P~~~~ie~~l~~----~L~-----~~~~~d-----~~~~~~~d~~~~~~~~~-~l~~~g~~t~nE~r~~l~~- 365 (392) T protein:vir:39 302 MYASALNRYLRPAISELEY----KLS-----DHISVN-----MRPAIDPLGDNYLSTIS-TATRWGALAENQATFVLQE- 365 (392) T ss_pred HHHHHHHHHHHHHHHHHHH----hcc-----cccccc-----chhhhccCHHHHHHHHH-HHHhCCCcCHHHHHHHHHh- Confidence 4432 22222333222222 221 111111 000111111 1111000 01100 0011111100 Q ss_pred cchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCcc Q lcl|NC_018087. 475 IGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEP 517 (520) Q Consensus 475 vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~ 517 (520) .| +...+ ++ ....+.. .+.|--++|-+ T Consensus 366 ~g-~~p~e-~r-~~e~l~~-------------~~~Gd~~~p~p 392 (392) T protein:vir:39 366 AG-YIPKD-LP-APENTNK-------------KTTGQSNEPVP 392 (392) T ss_pred cC-CCccc-cc-hhcCCCC-------------CCCCCCCCCCC Confidence 01 12222 11 1112211 12222334444 No 76 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=97.61 E-value=3.8e-05 Score=44.80 Aligned_cols=380 Identities=12% Similarity=0.125 Sum_probs=170.3 Q ss_pred cchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccc Q lcl|NC_018087. 7 SDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYE 86 (520) Q Consensus 7 ~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pE 86 (520) =.++||+|+.+..... ...++..+-. +|.. .+..+.+. +..+. -+.. +..+++|- T Consensus 1 m~m~~f~~~~~~~~~~-------~~~~~~~~~~-~~~~------~~~~~~~~----~~~~~------~v~~-~~al~~~~ 55 (392) T protein:vir:10 1 MILPILNFINQTNDPP-------EVGSVQSYFP-DGND------AQIMESLL----GDNNE------WVSA-RAALRNSD 55 (392) T ss_pred Ccchhhhhhhcccccc-------cccccccccc-cCch------hhhhhhhc----CCCCc------eech-HHhhccHH Confidence 2346777665433221 1111111111 1111 11111111 11110 0111 33457899 Q ss_pred hhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHH----HHHhhccccceeEEEeee Q lcl|NC_018087. 87 VDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSD----HFKRWYVDSRVFFHKIIN 162 (520) Q Consensus 87 vd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~----~fRrWYvDgri~~hkvid 162 (520) |..||+-|.+.+.-++ +.+ .+.. . +.+++-=|-..++.+ ++..+++.|--|..++-| T Consensus 56 v~~~i~~ia~~ia~lp-----~~~--~~~~-----~-------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~ 116 (392) T protein:vir:10 56 LFSIILQLSSDLAIVK-----INA--EKKK-----N-------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRN 116 (392) T ss_pred HHHHHHHHHHhhccCc-----eee--ccch-----h-------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEEC Confidence 9999999999875432 221 1110 0 112222333444454 444688899999998866 Q ss_pred cCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccc Q lcl|NC_018087. 163 PNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLV 242 (520) Q Consensus 163 ~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~ 242 (520) .. ..+++|.+|+|..|+..+.-. +... +|-|... ....+..+.++.+.|+|.. .. T Consensus 117 ~~---g~~~~L~~l~~~~v~~~~~~~-----~~~~------~y~~~~~---------~~~~~~~~~~~~~eiih~~--~~ 171 (392) T protein:vir:10 117 AN---GADMKWEYLRPSQVNTYYFEY-----ENGM------YYNITFD---------DPKIEPILQAPQSDLIHMK--LL 171 (392) T ss_pred CC---CcEEEEEEEcCceeEEEEcCC-----CceE------EEEEEec---------CcccceeEEEccccEEEec--CC Confidence 32 349999999999998764321 1111 1111110 0011223578889998885 34 Q ss_pred cCCCC-cchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccc Q lcl|NC_018087. 243 DCCGK-NIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVK 321 (520) Q Consensus 243 d~~~~-~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~ 321 (520) ++++. ..+|.|..|...++....+++...-+=---+--+-+..++.+..+..++.+.++ ..|+.. ...|. T Consensus 172 ~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~---~~~~~~----~~~g~-- 242 (392) T protein:vir:10 172 SIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRS---RSFMKR----SRSGG-- 242 (392) T ss_pred CCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHH---HHHhcc----ccCCC-- Confidence 55553 468999999999999998887766443334555667777765555444433322 233321 11121 Q ss_pred cccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHH Q lcl|NC_018087. 322 NQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELS 400 (520) Q Consensus 322 d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElk 400 (520) .+ .+ | .|++++.|.-.....+ ++=.+|..+.+.++++||...|...+. ++...-.-.. T Consensus 243 ----~~-vl-----~-----~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~------~~~~~~~~~~ 301 (392) T protein:vir:10 243 ----PV-VL-----D-----DLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGD------QQSSIQQISG 301 (392) T ss_pred ----ee-ec-----C-----CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC------cccHHHHHHH Confidence 11 11 1 2567777754444444 455678889999999999988864321 1111111122 Q ss_pred HHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccc-hHHHHHHHHHHHH----HHHHHHHhhcc Q lcl|NC_018087. 401 FDKF-ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNS-YFSEMKTIEITER----RVNVLSLMEPY 474 (520) Q Consensus 401 F~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn-~f~ElKe~Ei~~~----R~~~~~~~~p~ 474 (520) |..+ +.-+-.++..-+.. .|. +.=+++ +...+..|. .+++.-+ .+++. |-+..+.+.. T Consensus 302 f~~~~l~P~~~~ie~~l~~----~L~-----~~~~~d-----~~~~~~~d~~~~~~~~~-~l~~~g~~t~nE~r~~l~~- 365 (392) T protein:vir:10 302 MYASALNRYLRPAISELEY----KLS-----DHISVN-----MRPAIDPLGDNYLSTIS-TATRWGALAENQATFVLQE- 365 (392) T ss_pred HHHHHHHHHHHHHHHHHHH----hcc-----cccccc-----chhhhccCHHHHHHHHH-HHHhCCCcCHHHHHHHHHh- Confidence 4432 22222333222222 221 111111 000111111 1111000 01100 0011111100 Q ss_pred cchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCcc Q lcl|NC_018087. 475 IGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEP 517 (520) Q Consensus 475 vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~ 517 (520) .| +...+ ++ ....+.. .+.|--++|-+ T Consensus 366 ~g-~~p~e-~r-~~e~l~~-------------~~~Gd~~~p~p 392 (392) T protein:vir:10 366 AG-YIPKD-LP-APENTNK-------------KTTGQSNEPVP 392 (392) T ss_pred cC-CCccc-cc-hhcCCCC-------------CCCCCCCCCCC Confidence 01 12222 11 1112211 12222334444 No 77 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=97.52 E-value=5.2e-05 Score=44.08 Aligned_cols=382 Identities=13% Similarity=0.091 Sum_probs=176.8 Q ss_pred hhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchh Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVD 88 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd 88 (520) ++||.++.+.... +. +....+.... +.++... .... ..+....++|.|. T Consensus 1 Mg~f~~~~~~~~~--------~~----~~~~~~~~~~-------~~~~~~~-----~~~~-------~~~~~~~~~~~v~ 49 (406) T protein:vir:95 1 MGLFDRWRRTKRK--------SK----IRADTGYVGL-------FMSGEDV-----SFLV-------PGYVRLSDNPEVR 49 (406) T ss_pred Ccchhhhcccccc--------cc----ccccchhhhh-------hccCccc-----Cccc-------cCHHHHhhcHHHH Confidence 4444443222111 00 0000010000 1111000 0000 1133556899999 Q ss_pred HHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHh----cchhhhHHHHH----hhccccc--eeEE Q lcl|NC_018087. 89 NAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNML----NFQRKGSDHFK----RWYVDSR--VFFH 158 (520) Q Consensus 89 ~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll----~f~k~g~~~fR----rWYvDgr--i~~h 158 (520) +||+-|.+.+.-++-. +.... ++. .+ .+ .+.+..+| |=..+++++++ .++++|. .|.. T Consensus 50 ~~i~~ia~~ia~~~~~--~~~~~-~~~--~~----~~---~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~ 117 (406) T protein:vir:95 50 MAVHKIADLISSMTIY--LMQNT-EDG--DI----RI---RNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVF 117 (406) T ss_pred HHHHHHHHhhccCceE--EEEec-CCc--ce----ee---cchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEE Confidence 9999999988644221 11110 110 01 11 11222222 12234445444 4566665 4445 Q ss_pred EeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEee Q lcl|NC_018087. 159 KIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAH 238 (520) Q Consensus 159 kvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~h 238 (520) ++-+ .+.-+++|.+|+|..+..++.- ++.. | .| + ...++.+.|+|.- T Consensus 118 ~~~~---~~g~~~~l~~i~~~~v~~~~~~-----~~~~-------~-~~---------~--------~~~~~~~evih~~ 164 (406) T protein:vir:95 118 PKYT---ADGLIDELVPLTPSKVNFLDTP-----DGYQ-------V-LY---------G--------GQTFNYDEVLHFI 164 (406) T ss_pred EEEC---CCCcEEEEEEEcCceeEEEEcC-----CeEE-------E-Ee---------c--------cEEEchhHEEEee Confidence 5433 2334899999999999875332 1111 0 11 1 1246777787764 Q ss_pred cccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCC Q lcl|NC_018087. 239 SGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTG 318 (520) Q Consensus 239 SGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TG 318 (520) -.-...++....|.+..|..++.....+++...-+.---+.-+-+..++. .+.+..+++..+.+..+|+.-. ..| T Consensus 165 ~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~-~l~~e~~~~~~~~~~~~~~g~~----n~~ 239 (406) T protein:vir:95 165 YNPDPERPYIGRGYRVVLKDIADNLKQATATKKSFMSGKYMPSLIVKVDA-ATAELSSEEGRNAVFKKYLQAT----EAG 239 (406) T ss_pred ccCCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCCHHHHHHHHHHHHHHhcccc----ccC Confidence 22234455567899999999999998888888777666677777777774 5777788777777766665310 011 Q ss_pred ccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHH Q lcl|NC_018087. 319 KVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDE 398 (520) Q Consensus 319 ev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDE 398 (520) +.+=+..+ |..-++++.+.. .-+.-++-.++....++++++||..-|.. |..+ | T Consensus 240 ------~~~v~~~~-------~~~~~~~~~~~~-~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~-------~~~~-----~ 293 (406) T protein:vir:95 240 ------QPWIIPAE-------LLEVEQVKPLSL-KDIAINEAVELDKRTVAGMFGVPAFLLGI-------GEFN-----R 293 (406) T ss_pred ------CceeecCC-------CccccccccCCh-hHHHHHHHHHHHHHHHHHHhCCCHHHcCC-------CCch-----H Confidence 11100111 111233333322 22223455578889999999999977731 1111 1 Q ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccch Q lcl|NC_018087. 399 LSFDKFIS-ELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGK 477 (520) Q Consensus 399 lkF~KFI~-rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgk 477 (520) -.+..|+. .|+- +...+.+.|-..| +++.++ .+.|..+ ++...+ ...|.+.+..+-.- - T Consensus 294 ~~~~~~~~~~l~P-~~~~ie~~l~~~l-----~~~~~~-------~~~fd~~----~l~~~d-~~~~~~~~~~l~~~--G 353 (406) T protein:vir:95 294 DEYNNFINSTILP-IAKGIEQELTRKL-----LISPDL-------YFKFNPR----SLYAYD-LKELAEVGSNMYVR--G 353 (406) T ss_pred HHHHHHHHHHHHH-HHHHHHHHHHHhc-----CCCCCc-------EEEeech----hhhcCC-HHHHHHHHHHHHhC--C Confidence 22222322 1222 2222333333333 344432 3444322 232221 23466666555332 3 Q ss_pred hhhHHHHHHHHhCCCHHHHHHHHHH-----------HHHhh-hcCCccCCccccC Q lcl|NC_018087. 478 YISNHTAMKDFLQMSDEDIAAERKL-----------IDEEL-SDKIFNPPEPEEI 520 (520) Q Consensus 478 y~S~~~i~k~IL~~tDeeI~~~~kq-----------i~~E~-~~~~~~~p~~e~~ 520 (520) +++.+-+++ .|++.+.+ .-++. +.+.. ..+--++.+..+= T Consensus 354 ~~t~NE~R~-~~gl~p~~--~gd~~~~~~n~~~~~~~~~~~~~k~g~~~~~~~~~ 405 (406) T protein:vir:95 354 IMEGNEVRD-WLGLSPKE--GLSELVILENYIPLDKIGDQSKLKGGDNSGADGQT 405 (406) T ss_pred CcCHHHHHH-HhCCCCCC--CcceeeeccCccchhhcccccccCCCCCCCCCCCC Confidence 668888874 46775421 11110 00000 0000000000000 No 78 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=97.52 E-value=5.2e-05 Score=44.07 Aligned_cols=397 Identities=13% Similarity=0.099 Sum_probs=176.7 Q ss_pred eeecccccccccccccccccccc-cchhHHHHHHHHHHHhhccchhHHH------------------------------- Q lcl|NC_018087. 44 TEVDSQDIAYNGVFQKLYGSQDP-TATSTRELINTYRSLLNNYEVDNAV------------------------------- 91 (520) Q Consensus 44 ~~i~~~~~a~~g~~~~~~~~~~~-~~~~~~~LI~~YR~ma~~pEvd~Ai------------------------------- 91 (520) -+++ .-..+-. ......+.+.+|+.+..+.+-..+| T Consensus 1 ~~~~------------~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~ 68 (470) T protein:vir:10 1 MELD------------ALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFY 68 (470) T ss_pred CchH------------HHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchH Confidence 0000 0001111 1112245566777776665544322 Q ss_pred HhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCCCCCee Q lcl|NC_018087. 92 QEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGII 171 (520) Q Consensus 92 ~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~ 171 (520) ..||+-.+-+=- ..||++..++.+..+ ++.+.|+. +|+..-.++.+.|.+-|+-|.+.-+|. +|.. T Consensus 69 k~Iv~~~~~yl~-G~p~~~~~~d~~~~~----~l~~~~~~-----~~~~~~~~l~~~~~~~G~a~~~~y~d~----~~~~ 134 (470) T protein:vir:10 69 QLLVDQEAGYVA-SVFPDIDVGKDADNK----KIIDVLGD-----DRALTLNGLLVDSSNAGRAWLHYWIDE----DGNF 134 (470) T ss_pred HHHHHhhhhhee-ccceeeecCchHHHH----HHHHHHhh-----hHHHHHHHHHHHHhhcCeeEEEEEecC----CCce Confidence 123332222222 267777776654444 34433331 456666778899999999999988763 3678 Q ss_pred eeEecCccceeeeeecc--CCCCccccccc--------ceecceeecCcc-cccccccceecCCcce-ec--------Cc Q lcl|NC_018087. 172 ELRRLDPRNVQFVRELD--TKMENGVKVVK--------GYREYFLYDTEL-ESYQCGHQHFAAGTKI-KI--------PY 231 (520) Q Consensus 172 elr~lDPr~i~~vr~i~--~~~~~~~~~~~--------~~~ey~~y~~~~-~~~~~~~~~~~~~~~~-~I--------~~ 231 (520) .+..+||..+-++.+-. ++....++.+. .+.-+-+|++.. ..|...++........ .+ .. T Consensus 135 ~~~~~~p~~~~~v~d~~~~~~~~a~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (470) T protein:vir:10 135 RYGIIQPDQITPIYATTLDNKLLGILRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYE 214 (470) T ss_pred EEEEEcccceEEEEcCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccc Confidence 89999999999885422 22222222111 011223343322 1111111100000000 00 00 Q ss_pred cc--EEEee-ccccc----CCCCcchhhhHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHH Q lcl|NC_018087. 232 SA--MVYAH-SGLVD----CCGKNIIGYLHRAVKPANQLKL-LEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHI 303 (520) Q Consensus 232 ~a--I~y~h-SGL~d----~~~~~~~syL~~aik~~NqL~m-~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~i 303 (520) .. -++.| .|.+. +++....|=|+..+...+.+.. +=+..-.-+.+..|-.-+.-.+.-+++. . ..- T Consensus 215 ~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~-----~-~~~ 288 (470) T protein:vir:10 215 TGQSNTLKHNFGRVPFIEFSKNKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQ-----F-MND 288 (470) T ss_pred cccccccccCCCeeeEEEeecCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccch-----h-hhh Confidence 00 00000 01111 1222344656655555554432 3333334455555554443322112111 1 111 Q ss_pred HHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHH-HHHHHHHHHHHhcCCChhhccCC Q lcl|NC_018087. 304 MNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMD-DILYFRKALYMALRVPLSRIPDE 382 (520) Q Consensus 304 m~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~-DV~YF~kkLy~aL~VP~SRl~~~ 382 (520) +.+++-. +++-.+.|.|..+..|--..+..... -++-+.+.+|+-..+|- +.++ T Consensus 289 ~~~~~~i-----------------------~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~--~~~~ 343 (470) T protein:vir:10 289 LRKYKSI-----------------------KINNTGNGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGID--PANF 343 (470) T ss_pred hhhcCeE-----------------------eccCCCCCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCC--CCcc Confidence 2222211 23333334444566666555554333 34566778888899994 3333 Q ss_pred CccccccccchhhHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHH Q lcl|NC_018087. 383 QTQNVFDMSTAISRD--ELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEI 460 (520) Q Consensus 383 ~~~~~~G~~~eItRD--ElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei 460 (520) + ||.+|..+.. ...--.-+.+.++.|...+..+++.=+-+-|+. .-+| ..|.+.|.+.--=.+...++ T Consensus 344 ~----~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~~~-~~d~----~~i~i~f~~~~p~d~~e~~~- 413 (470) T protein:vir:10 344 E----SSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS-DADK----RHISQHWTRTKVEDSLTKAQ- 413 (470) T ss_pred c----cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-Cccc----ceeeEEeccCCCCCHHHHHH- Confidence 2 2444444321 111223356666666666666655433222432 2233 35777776655544433333 Q ss_pred HHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcC--Cc--------cCCcccc Q lcl|NC_018087. 461 TERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDK--IF--------NPPEPEE 519 (520) Q Consensus 461 ~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~--~~--------~~p~~e~ 519 (520) +++.+ +| .+|.+++++. |...++ -+++-++|++|..+. .. ..+++|| T Consensus 414 ------~~~~~---~g-~iS~et~l~~-~p~v~D-~~~E~eri~~E~~e~~~~~~~~~~~~~~~~dde~ 470 (470) T protein:vir:10 414 ------IVSTV---AN-YSSKEAVAKA-NPIVDD-WQQELKDLAKDKEENDPYSNQADELNGKGVNDEQ 470 (470) T ss_pred ------HHHHH---hc-cCcHHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHhhccccccCCCCCCCCC Confidence 34444 34 3799999976 665432 233344444443222 11 1223333 No 79 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=97.48 E-value=5.8e-05 Score=43.79 Aligned_cols=393 Identities=10% Similarity=0.044 Sum_probs=190.8 Q ss_pred cccccccchhHHHHHHHHHHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchh Q lcl|NC_018087. 61 YGSQDPTATSTRELINTYRSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQR 140 (520) Q Consensus 61 ~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k 140 (520) |. .++..+-.+.-++++-..=+.-+|+..++=+. .+ + +...+.+..+. ...|..-=+|+. T Consensus 1 ~l-----~~~~~~~~~~~~~~~v~n~~~~ivd~~~~~l~-~~-g-----f~~~d~~~~~~--------~~~i~~~N~~d~ 60 (434) T protein:vir:98 1 ML-----PKNAEQAFLDFQRKARTNFCGLIANASVHRLL-AL-G-----VTGPDGEPDTR--------ASRWWQANRLDS 60 (434) T ss_pred CC-----CCCccHHHHHhhhhhhccchHHHHHHHHhhhc-cC-c-----eecCCCchHHH--------HHHHHHhcChhH Confidence 10 11111111122222333455667776666332 11 1 12233333332 233444457888 Q ss_pred hhHHHHHhhccccceeEEEeeecCC---CCCCeeeeEecCccceeeeeeccCCC-Cccccccc------ceecceeecCc Q lcl|NC_018087. 141 KGSDHFKRWYVDSRVFFHKIINPNR---PKDGIIELRRLDPRNVQFVRELDTKM-ENGVKVVK------GYREYFLYDTE 210 (520) Q Consensus 141 ~g~~~fRrWYvDgri~~hkvid~~~---~k~GI~elr~lDPr~i~~vr~i~~~~-~~~~~~~~------~~~ey~~y~~~ 210 (520) ...+.++.-++.||-|+..-.++.. ..++-..++-+||+.+-.+.+-.... .-++.++. .....++|+.. T Consensus 61 ~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~~~~~~~~~~~~~~~ 140 (434) T protein:vir:98 61 RQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDIDGFGYARVFFDDTS 140 (434) T ss_pred HHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEeccCCceEEEEEEeCcE Confidence 8899999999999999988766332 11223347788998877765422111 11111110 01111222211 Q ss_pred ccccccccc----eecCCcce-ec--C---------cccEEEeecccccCCCCcchhhhHHHHHHHHHH-HHHHHHHHHH Q lcl|NC_018087. 211 LESYQCGHQ----HFAAGTKI-KI--P---------YSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQL-KLLEDAMMIY 273 (520) Q Consensus 211 ~~~~~~~~~----~~~~~~~~-~I--~---------~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL-~m~EDalVIy 273 (520) ...+..... .......+ .. + ...|.|++.-..+.. ..|=++..+....-+ +++-+.++.- T Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~~---g~sd~e~vi~liDa~~~~~s~~~~~~ 217 (434) T protein:vir:98 141 FPYRTRERTGARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGED---PEPEFAGVLDIQDRVNLGILNRMAAS 217 (434) T ss_pred EEEEEeeccccccccccccceecccccccccCCCCccceEEeccCCCcCcC---CcchhhhHHHHHHHHHHHHHHHHHHH Confidence 111110000 00000000 00 0 112345554322221 334444433333322 3455667777 Q ss_pred HHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCC Q lcl|NC_018087. 274 RITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMT 353 (520) Q Consensus 274 Ri~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~ 353 (520) +.+-.|.|-+. |--+.. .-|...+.+.....+..-....|+.- +-++++..+++.. T Consensus 218 ~~~a~p~~~i~----G~~~~~-----------------~~~~~~~~~~~~~~~~~~~~~i~~~~---~~~~~~~q~~~~~ 273 (434) T protein:vir:98 218 RFSGFRQKWIK----GHKFAK-----------------RTDPATGMTVVDQPFVPSPSAVWASE---GENTQFGQLDATD 273 (434) T ss_pred HHhcchhhhhc----CCCccc-----------------ccccccccchhhhhhhccccccccCC---CCCceEEEecCcc Confidence 77777766553 211110 00122222222222222222345543 2246777888743 Q ss_pred CcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChh Q lcl|NC_018087. 354 GMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITED 433 (520) Q Consensus 354 nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~e 433 (520) --+-++=++-.-..+....++|.+-|..+.+ + ..+..|.-.+...-.-+.+.|+.|..-+..+++.-+.+.|+ + + T Consensus 274 ~~~~~~~l~~~i~~~~~~~~~p~~~~~~~~~-n--~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~g~-~-~ 348 (434) T protein:vir:98 274 LSGFLKEHASDVRDMLTISQTPTYLYATDLV-N--ISADTIGALDILHVAKVREHIASFSEGLESVLALAAAQAGV-P-E 348 (434) T ss_pred hHHHHHHHHHHHHHHhcccCCCHHHhccccC-C--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-C-h Confidence 3233333566677888889999877763211 1 12234444455567778888888888888888877777774 2 2 Q ss_pred hHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhc---- Q lcl|NC_018087. 434 EWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSD---- 509 (520) Q Consensus 434 ew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~---- 509 (520) +| ..+++.|..-..=+. .+..+++..+..- | +|.++++ +.|.++++||+...++.+++... T Consensus 349 ~~----~~~~v~w~~~~~~s~-------~~~ada~~kl~~~-g--~~~e~~~-~~lg~~~~e~~r~~~e~~~~~~~~~~~ 413 (434) T protein:vir:98 349 DY----TEAEVRWANPAHVTM-------AVKADAATKLKSI-G--YPLDVIA-EELDESPARVRRIVAGAASQALLAASL 413 (434) T ss_pred hh----eeeeEEecCCCCCCH-------HHHHHHHHHHHhc-C--CcHHHHH-HhCCCCHHHHHHHHHHHHHHHHHHHhh Confidence 22 247788865443332 3455566666442 2 5888887 45899999998877766654332 Q ss_pred -CCccCCccccC Q lcl|NC_018087. 510 -KIFNPPEPEEI 520 (520) Q Consensus 510 -~~~~~p~~e~~ 520 (520) +...+|.+.+. T Consensus 414 ~~~~~~~~~g~~ 425 (434) T protein:vir:98 414 LPAPGAPSAGNV 425 (434) T ss_pred hccCCCCCCCCC Confidence 11223333222 No 80 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=97.48 E-value=5.9e-05 Score=43.78 Aligned_cols=379 Identities=12% Similarity=0.131 Sum_probs=171.3 Q ss_pred hcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchhHHH Q lcl|NC_018087. 12 FAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNAV 91 (520) Q Consensus 12 f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai 91 (520) ++||.++... ... .++.-..+ .+|+....|+.. .-+++|.|..|| T Consensus 1 m~~f~~~~~~------------~~~--~~~~~~~~------~~~~~~~~~~~~---------------~Al~~~~V~~~i 45 (406) T protein:vir:97 1 MSFFQPLGTS------------KVS--YDDYISSV------LAGDVSQKYLGV---------------SALKNSDILTAT 45 (406) T ss_pred CccccccCCC------------CCC--cchHHHHH------hcCCCCcccccc---------------hhhccHHHHHHH Confidence 5566432111 111 11111111 112222222111 024678999999 Q ss_pred HhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcc----hhhhHHHH----HhhccccceeEEEeeec Q lcl|NC_018087. 92 QEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNF----QRKGSDHF----KRWYVDSRVFFHKIINP 163 (520) Q Consensus 92 ~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f----~k~g~~~f----RrWYvDgri~~hkvid~ 163 (520) +-|.+.+.-++ +.+.-.+.+ ++.+ ..+..+|+. ..++.++. ....++|--|.-++-|. T Consensus 46 ~~Ia~~iA~lp-----~~~~~~~g~-------~~~~--~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~ 111 (406) T protein:vir:97 46 SIIAGDIARFP-----LVKKDVNGD-------IIHD--EDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDP 111 (406) T ss_pred HHHHHhhhhCe-----eEEEecCcc-------cccc--chHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC Confidence 99999877542 222111111 1111 123444432 23444444 44677888888776543 Q ss_pred CCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeeccccc Q lcl|NC_018087. 164 NRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVD 243 (520) Q Consensus 164 ~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d 243 (520) +. ..+.+|.+++|..+...+. .++. -+|.|... ..+..+.++.+.|+|.. ... T Consensus 112 ~~--g~~~~L~~i~p~~v~v~~~-----~~~~-------~~y~~~~~-----------~~~~~~~~~~~evih~r--~~~ 164 (406) T protein:vir:97 112 KT--NQALQFQFYRPSETTVEET-----DNHE-------IVYTFTDM-----------LTAKQVKCFAHDVIHWK--FFS 164 (406) T ss_pred CC--CeEEEEEEECCCeeEEEEc-----CCce-------EEEEEEec-----------CCceEEEEccccEEEec--CCC Confidence 22 2378999999999987533 1111 12222111 11334678899998774 456 Q ss_pred CCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccc Q lcl|NC_018087. 244 CCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQ 323 (520) Q Consensus 244 ~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~ 323 (520) .++...+|-|..|.+++.....+++...=+----++- .++...-+.|-+..+++ +++-+.++.. ...+|. T Consensus 165 ~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~-~~i~~~~~~l~~e~~~~-~~~~~~~~~~----g~n~g~---- 234 (406) T protein:vir:97 165 HDTILGRSPLLSLGDEIDLQTGGINTLIKFFKDGFSS-GILTMKGAQLSGDARQR-ARQEFEKMRE----GSVGGS---- 234 (406) T ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCC-ceEEecCCCCCHHHHHH-HHHHHHHHhc----ccccCc---- Confidence 6776678999999999888778887665443344553 45555556665544443 3333333221 012232 Q ss_pred cccchhhhhhcccccCCCCCcceeecCCCCCcChHHH-HHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHH Q lcl|NC_018087. 324 ANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDD-ILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFD 402 (520) Q Consensus 324 ~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~ 402 (520) .+ .++ .|.+++.|.-..+--|+-+ -+|-.+.+-++.+||..-|...+.. . .+.-....|. T Consensus 235 --~~-vl~----------~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~----~--~~e~~~~~f~ 295 (406) T protein:vir:97 235 --PL-VFD----------STMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVNSPN----Q--SVAQLMEDYV 295 (406) T ss_pred --ee-ecC----------CCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCCCc----c--hHHHHHHHHH Confidence 22 121 2556666643222222212 2333677888999999988543221 1 1221222354 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHH Q lcl|NC_018087. 403 KFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNH 482 (520) Q Consensus 403 KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~ 482 (520) +++ |+-.+.. +.+.|-. .+.++.+|.. -.|++++.. +++.|++.+..+-. +-.++.+ T Consensus 296 ~~~--l~P~~~~-ie~~l~~-----kll~~~~~~~--~~i~fd~~~-----------~~~~~~~~~~~~~~--~g~~T~N 352 (406) T protein:vir:97 296 TND--LPFYFDA-ITSELGL-----KTLNDKDRRL--YHIEFDTRS-----------VTGRNVDEIVKLVN--NQILTPN 352 (406) T ss_pred HHH--HHHHHHH-HHHHHhh-----hhcChhhccc--eeEEEecCc-----------cchhhHHHHHHHHh--CCCcCHH Confidence 442 3332222 2222222 2345665542 234444321 34455555444311 1245555 Q ss_pred HHHHHHhCCCHHHHHHHHH-----------HHHHhhhcCCccCCccccC Q lcl|NC_018087. 483 TAMKDFLQMSDEDIAAERK-----------LIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 483 ~i~k~IL~~tDeeI~~~~k-----------qi~~E~~~~~~~~p~~e~~ 520 (520) -++. +|++.+-+=..-++ .+ +|-.++.-....--|. T Consensus 353 E~R~-~~g~~p~~~~~gD~~~~~~n~~~~~~~-~~~~~~~~~~~~gg~~ 399 (406) T protein:vir:97 353 QGLV-ELGKQKSTDPNMDRYQSSLNYVFLDKK-EEYQDKVGIKGKGGEV 399 (406) T ss_pred HHHH-HhCCCCCCCCCCCeEeeccCccchhcc-cccccccccccCCCCC Confidence 5553 34443211000000 00 0000000000000000 No 81 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=97.44 E-value=6.6e-05 Score=43.49 Aligned_cols=386 Identities=12% Similarity=0.125 Sum_probs=182.5 Q ss_pred cccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHh Q lcl|NC_018087. 3 MLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLL 82 (520) Q Consensus 3 ~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma 82 (520) |.-+++++=++-+. +..-+..+. . +.... .+.....|.++. -.... T Consensus 1 ~~~~~~~~~~k~~~------~~~~~~~~~--------~-~~~~~-------~~~~~~~~~~v~------------~~~a~ 46 (409) T protein:vir:96 1 MAKENIVTRIKKKL------IDNWIDQSA--------S-KLYDF-------SPWKNKSFWGVI------------NNTLE 46 (409) T ss_pred CccccchhhhhhHH------hhhhhcccc--------c-ccccc-------ccccCccccccc------------hhhHh Confidence 66666655433321 111111111 0 00000 011111111111 11234 Q ss_pred hccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHH-hcchhhhHHHH----HhhccccceeE Q lcl|NC_018087. 83 NNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNM-LNFQRKGSDHF----KRWYVDSRVFF 157 (520) Q Consensus 83 ~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~l-l~f~k~g~~~f----RrWYvDgri~~ 157 (520) ++|-|..||+-|.+.+.-++ +.+. +..+..+ ....++|+. =|-..+++++. ..+.++|.-|+ T Consensus 47 ~~~~V~~ci~~ia~~ia~lp-----~~~~-~~~~~~~-------~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~ 113 (409) T protein:vir:96 47 TNETIFSAITKLSNSMASLP-----LKMY-EDYKVVN-------TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYV 113 (409) T ss_pred hhHHHHHHHHHHHHhhhhCc-----eEEe-ecccccc-------hhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEE Confidence 67889999999998887543 2221 1111111 111222321 22234455544 44677899888 Q ss_pred EEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEe Q lcl|NC_018087. 158 HKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYA 237 (520) Q Consensus 158 hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~ 237 (520) .++-|. ..-+++|.+|+|..+..+++- ++.. -||.|... .+..+.++.+.|+|. T Consensus 114 ~i~r~~---~G~~~~L~~l~~~~v~v~~~~-----~~~~------~~y~~~~~------------~g~~~~~~~~evih~ 167 (409) T protein:vir:96 114 LIERDI---YHQPSKLFLLNPDVVEMLIEN-----QSRE------LYYSIHAA------------TGNKLIVHNMDMLHF 167 (409) T ss_pred EEEECC---CCcEEEEEEEcCceeEEEEeC-----CCcE------EEEEEEcC------------CceEEEEccccEEEe Confidence 877542 223899999999999886431 1111 12222111 123467888888887 Q ss_pred ecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCC Q lcl|NC_018087. 238 HSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDART 317 (520) Q Consensus 238 hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~T 317 (520) - +.-..++-..+|.|..|.........++.. ......+.| . +...-.+.|.+.+++...+...+.|.| . T Consensus 168 r-~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~-~~~~~~~~~-~-~i~~~~~~l~~e~~~~~~~~~~~~~~n-------~ 236 (409) T protein:vir:96 168 K-HIVASNMVQGISPIDVLKNTTDFDNAVRTF-NLTEMQKPD-S-FMLKYGSNVSTEKRQQVLEDFKQYYEE-------N 236 (409) T ss_pred C-CCCCCCccccccHHHHHHHHHHHHHHHHHH-HHHhcCCCc-e-eEEecCCCCCHHHHHHHHHHHHHHhhc-------C Confidence 3 222345555678888887777766666554 345544444 2 344445777777776666655554432 2 Q ss_pred CccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh---HHHHHHHHHHHHHhcCCChhhccCCCccccccccchh Q lcl|NC_018087. 318 GKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE---MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAI 394 (520) Q Consensus 318 Gev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eI 394 (520) |.+ + .++ -|.+++.|. .+.-+ ++-..+..+.+.++++||.+.|...++. .++. + T Consensus 237 g~~------~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~-~~s~---~ 293 (409) T protein:vir:96 237 GGI------L-FQE----------PGVEIEPLP--KKYVSEDIVASENLTRERVANVFQLPSIFLNARSNT-NFAK---N 293 (409) T ss_pred CCe------e-ecC----------CCceEEEcC--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC-Cccc---H Confidence 321 1 121 256777774 23333 3333456688999999999988643321 1111 2 Q ss_pred hHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_018087. 395 SRDELSFDKF-ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEP 473 (520) Q Consensus 395 tRDElkF~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p 473 (520) .-.-+.|..+ |.-+-.++..- +-+.++++.++. ....+.|.-+ ++...+ +..|++++..+-. T Consensus 294 e~~~~~f~~~~l~P~~~~ie~~---------l~~~Ll~~~~~~---~g~~i~fd~~----~ll~~d-~~~~~e~~~~~~~ 356 (409) T protein:vir:96 294 EELNRFYLQHTLLPIVKQYEEE---------FNRKLLTKTDRE---KNRYFKFNVK----SYLRAD-SATQAEVYFKAVR 356 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHH---------HHhhcCCccccc---CcceEEeech----hhhccC-HHHHHHHHHHHHh Confidence 1112235444 33333333221 223345565554 2244555432 333333 3556666665532 Q ss_pred ccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 474 YIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 474 ~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) - -+++..-++. .|++.+-+ --++-+- +....+-...++- T Consensus 357 ~--G~~T~NE~R~-~~g~~pi~--ggD~~~~---~~n~~~~~~~~~~ 395 (409) T protein:vir:96 357 S--GYYTINDIRE-WEDLPPVE--GGDKPLI---SGDLYPIDTPLEL 395 (409) T ss_pred C--CCCCHHHHHH-HhCCCCCC--Ccceeee---cccccccccchhh Confidence 2 2667777764 46765431 0010000 0000000000000 No 82 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=97.42 E-value=7.1e-05 Score=43.31 Aligned_cols=380 Identities=13% Similarity=0.111 Sum_probs=179.0 Q ss_pred hhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchh Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVD 88 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd 88 (520) |+||+||.++... +++.|.... . ++..+... -...|..++.+|.|. T Consensus 1 Mg~~~~f~~k~~~-----------~~~~~~~~~----~-------~~~~~~~~------------~~~~~~~~~~~~~V~ 46 (403) T protein:vir:80 1 MGLFNFFRRKTRS-----------EPTNAISWF----L-------TQEAYDTL------------AIPGYTRLSDNPEVR 46 (403) T ss_pred Ccccccccccccc-----------cccchhhhh----c-------cccccccc------------ccchhhhhhhhHHHH Confidence 6777766543211 111110000 0 00000000 012355688899999 Q ss_pred HHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHH-hcchhhhHHHHHhh----ccccc--eeEEEee Q lcl|NC_018087. 89 NAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNM-LNFQRKGSDHFKRW----YVDSR--VFFHKII 161 (520) Q Consensus 89 ~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~l-l~f~k~g~~~fRrW----YvDgr--i~~hkvi 161 (520) .||+-|.+.+.-. |+.|--+...-.+ .+......+++. =|=..+++++++.+ +.+|. -|..++- T Consensus 47 ~~I~~ia~~iA~~-----p~~~~~~~~~g~~----~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~ 117 (403) T protein:vir:80 47 MAVHKIAELISSM-----TIHLMQNTDNGDI----RIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKY 117 (403) T ss_pred HHHHHHHHhhhhC-----ceEEEEecCCcee----ecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEE Confidence 9999998876532 2221101000001 111112222221 11123455555543 44443 4555543 Q ss_pred ecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeeccc Q lcl|NC_018087. 162 NPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGL 241 (520) Q Consensus 162 d~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL 241 (520) | ...-+++|.+|+|..+.+++.- ++.. + .|.. ..++.+.|+|..-+. T Consensus 118 ~---~~g~~~~L~~l~p~~v~~~~~~-----~g~~-------~-~y~~-----------------~~~~~~eiih~~~~~ 164 (403) T protein:vir:80 118 T---TSGLIDELIPLAPSKVSFVDTD-----TGYQ-------I-WYQG-----------------KAYNYDEVLHFIVNP 164 (403) T ss_pred c---CCCcEEEEEEEcCCeeEEEEcC-----CceE-------E-EEee-----------------cccchhhEEEEeccC Confidence 3 2234899999999999875332 2211 1 1211 135678888765433 Q ss_pred ccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccc Q lcl|NC_018087. 242 VDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVK 321 (520) Q Consensus 242 ~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~ 321 (520) ...++....|.+..+..+++.....++...-+----+--+-|..++. .+....+++..+.+..+|..- ..+|.+ T Consensus 165 ~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-~~~~~~~~~~~~~~~~~~~~~----~~~g~~- 238 (403) T protein:vir:80 165 DPEKPYMGRGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDA-ATAELSSEEGRNAVFKKYLEA----SEAGQP- 238 (403) T ss_pred CCcCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCChHHHHHHHHHHHHHHhhh----hhcCCe- Confidence 44445557889999999999999888877666554455666777764 455555666555555544321 012221 Q ss_pred cccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHH Q lcl|NC_018087. 322 NQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSF 401 (520) Q Consensus 322 d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF 401 (520) + + +| -++..+++++.+. -..+.-++-.++-...+.++++||..-|.- |..+ +. .| T Consensus 239 -----~-~-----~~-~~~~~~~~~~~l~-~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~-------~~~~---~~--~~ 293 (403) T protein:vir:80 239 -----W-I-----IP-AELLDVEQVKPLS-LKDLAIHETVELDKRTVAGIFGVPAFLLGV-------GKYD---KD--EY 293 (403) T ss_pred -----e-e-----ec-ccccccceeccCC-HHHHHHHHHHHHhHHHHHHHhCCCHHHcCC-------CCcc---HH--HH Confidence 1 0 11 1122334444442 133433455567778899999999977731 1111 11 12 Q ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhh Q lcl|NC_018087. 402 DKFIS-ELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYIS 480 (520) Q Consensus 402 ~KFI~-rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S 480 (520) ..|+. .|+--. ..+.+.|-..| +++.++ .+.|.-+.. -..+ ..+|.+.+..+-. +-+++ T Consensus 294 ~~f~~~~l~P~~-~~ie~~l~~kl-----l~~~~~-------~~~f~~~~l----l~~d-~~~~~~~~~~~~~--~Gi~t 353 (403) T protein:vir:80 294 NNFINSTILPIA-KGIEQELTRKL-----LISPDL-------YFKFNPRSL----YAYD-LKELAEVGSNMYV--RGLME 353 (403) T ss_pred HHHHHHHHHHHH-HHHHHHHHHhc-----cCCCCc-------EEEeechhh----hccC-HHHHHHHHHHHHh--CCCcC Confidence 23332 233222 22333333333 344443 344543222 1111 3456666665533 23678 Q ss_pred HHHHHHHHhCCCHHHH-------------H--HHHHHHHHhhhcCCccCCccc Q lcl|NC_018087. 481 NHTAMKDFLQMSDEDI-------------A--AERKLIDEELSDKIFNPPEPE 518 (520) Q Consensus 481 ~~~i~k~IL~~tDeeI-------------~--~~~kqi~~E~~~~~~~~p~~e 518 (520) ..-++. .+++.+.+= + ....+.+...+++- +.+.| T Consensus 354 ~NE~R~-~~gl~p~~ggd~~~~~~n~~pl~~~~~~~~~k~ge~~~~--~~~~~ 403 (403) T protein:vir:80 354 GNEVRD-WLGLSPKEGLSELVILENYIPLDKIGDQNKLKGGEKGGA--DGQTD 403 (403) T ss_pred HHHHHH-HhCCCCCCCCCeEeecccccchhhccchhhccCCCCCCC--CCCCC Confidence 888875 477765320 0 00000110011110 11111 No 83 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=97.39 E-value=7.8e-05 Score=43.12 Aligned_cols=400 Identities=10% Similarity=0.037 Sum_probs=186.6 Q ss_pred hhccCCCcccCCCCCCCceeecc-cccccccccccccccccccchhHHHHHHHHHH-HhhccchhHHHHhhhceeeEecC Q lcl|NC_018087. 26 IINDKAESITAPKFDDGATEVDS-QDIAYNGVFQKLYGSQDPTATSTRELINTYRS-LLNNYEVDNAVQEIVSDAIVYEE 103 (520) Q Consensus 26 ~~~~~~~s~~~p~~~dg~~~i~~-~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~-ma~~pEvd~Ai~eIvneaiv~d~ 103 (520) -+.....++++|...+=...+.. ...++.. ...+.. ....|-. .+++|-|..||+-|.+.+.-. T Consensus 1 ~~~~~~~~~~~p~~~e~~~~~~~~~~~~~~~-----~~~~~~-------~~~~~~~~a~~~~~V~acV~~IA~~iA~l-- 66 (518) T protein:vir:10 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAV-----GMQLER-------QFSLYGGIYKNQPWVRTVIAKRAQALARL-- 66 (518) T ss_pred CcccCceeecCchhhhhhhhhhccccccccc-----ceeccc-------ccchhhHHHhhhHHHHHHHHHHHHhhccC-- Confidence 45567777777753332222211 0011110 001110 1122222 357899999999999987522 Q ss_pred CCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhh----ccccceeEEEeeecCCCCCCeeeeEecCcc Q lcl|NC_018087. 104 GFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRW----YVDSRVFFHKIINPNRPKDGIIELRRLDPR 179 (520) Q Consensus 104 ~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrW----YvDgri~~hkvid~~~~k~GI~elr~lDPr 179 (520) |+.+.-.+.+-. ++.....+..+++-=|-..++.++.+.| .+.|.-|..++-|. ...+++|.+|+|. T Consensus 67 ---pl~l~~~~~~~~---~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~---~G~~~~L~~l~p~ 137 (518) T protein:vir:10 67 ---PVKCMFTSGDTE---TEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK---SGTPEKLMPMHPS 137 (518) T ss_pred ---ceEEEEEcCCCc---eeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECC---CCcEEEEEEECCC Confidence 222211111000 1111111222333334445666665554 46799998887542 2348999999999 Q ss_pred ceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccccCCCC-cchhhhHHHHH Q lcl|NC_018087. 180 NVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDCCGK-NIIGYLHRAVK 258 (520) Q Consensus 180 ~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~~~~-~~~syL~~aik 258 (520) .+...+.-. ++. -+|.|.... ...+..++++.+.|+|.. ...++|. ..+|-|..|.+ T Consensus 138 ~v~v~~~~~----~~~-------~~y~~~~~~---------~~~~~~~~~~~~eViHir--~~s~dg~~~G~spi~~a~~ 195 (518) T protein:vir:10 138 RVAIKRNSR----TGR-------YEYYFQAGA---------GVGTQLVSFADDEVVPIR--FFNPDGLERGLSLMESLKS 195 (518) T ss_pred ceEEEEcCC----CCE-------EEEEEEecC---------CccceEEEecCCcEEEec--CCCCCcccccccHHHHHHH Confidence 888754321 111 123332210 001123678999998885 3445664 46788999998 Q ss_pred HHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhccccc Q lcl|NC_018087. 259 PANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRR 338 (520) Q Consensus 259 ~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRR 338 (520) ++.....+++...=+----+.-+-|.-.+ +.|.+..+++--+.+-..|+- . ...|. .+ .++ T Consensus 196 ~i~~~~a~~~~~~~~f~ng~~p~gil~~~-~~ls~e~~~~~k~~~~~~~~G--~--~nag~------v~-vL~------- 256 (518) T protein:vir:10 196 TIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQFDRAHSG--S--SNTGK------TM-VVE------- 256 (518) T ss_pred HHHHHHHHHHHHHHHHhcCCCccEEEecC-CCCCHHHHHHHHHHHHHHhcC--c--cccCc------ce-EcC------- Confidence 88888888877443333334455666665 445555444333323233321 0 11122 11 222 Q ss_pred CCCCCcceeecCCCCCcCh---HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHH-HHHHHHHHHH Q lcl|NC_018087. 339 DGKAVTEVETLPGMTGMNE---MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKF-ISELQHKFEE 414 (520) Q Consensus 339 eGgrgTEIsTLpGg~nLge---i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KF-I~rLr~rFs~ 414 (520) + |.+++.|. .+.-+ ++-.+|..+.+.++++||...|....+ +.+ +.+.-.-+.|..+ |.-+-.++.. T Consensus 257 ~---G~~~~~l~--~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~-~t~---sn~eq~~~~f~~~tL~P~l~~ie~ 327 (518) T protein:vir:10 257 E---GMEPIPLQ--LTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDR-ATF---SNISAQMRAFYRDTMAIPIARIQS 327 (518) T ss_pred C---CceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCC-CCc---hhHHHHHHHHHHHHHHHHHHHHHH Confidence 2 44555553 22222 444557889999999999988853221 111 1122222334443 3444444444 Q ss_pred HHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHH Q lcl|NC_018087. 415 IFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDE 494 (520) Q Consensus 415 if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDe 494 (520) .+...| .+..+ -...|+|+. +++...+ ++.|.+.++.+-.. -++|.+-+++ .+++..- T Consensus 328 ~ln~~L---------~~~~~---~~~~~~fd~------~~llr~D-~~~r~~~~~~~~~~--G~lT~NE~R~-~~Gl~pi 385 (518) T protein:vir:10 328 AMDKYV---------GQYWV---RKNRMKFDI------DDVIQPD-WEAKSESTQKMVNS--GVATPNEGRE-IMGLPRS 385 (518) T ss_pred HHHHhh---------ccccc---CCceEEEec------hhhhccC-HHHHHHHHHHHHhC--CCcCHHHHHH-HhCCCCC Confidence 333322 23221 122344332 2222222 24566666666433 3667777774 4777543 Q ss_pred HHHHHHH---------------HHHHhhhcCCccCCccc------cC Q lcl|NC_018087. 495 DIAAERK---------------LIDEELSDKIFNPPEPE------EI 520 (520) Q Consensus 495 eI~~~~k---------------qi~~E~~~~~~~~p~~e------~~ 520 (520) |-..-++ .-.+....+.-++|.+. |- T Consensus 386 e~~~gD~~~~~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~ 432 (518) T protein:vir:10 386 DDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPASTPVASLDQS 432 (518) T ss_pred CCCCCCeeeecccceecccccccccCCCCCCCCCCCCcccccccccc Confidence 2100000 00000011111122211 11 No 84 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=97.38 E-value=7.9e-05 Score=43.07 Aligned_cols=398 Identities=12% Similarity=0.093 Sum_probs=185.6 Q ss_pred hhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchhH Q lcl|NC_018087. 10 KMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDN 89 (520) Q Consensus 10 ~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~ 89 (520) =||..|.+..++ |.+.+. + .. +..|+.....+.-...+.. +..+++|-|.. T Consensus 1 m~~~~~~~~~~~-----------~~s~~~---~---w~----~~~~~~~~~~~~~g~~vt~--------~~al~~~~v~~ 51 (421) T protein:vir:10 1 MFIPQMFEGKKR-----------SVSGGG---F---WE----AMLGGVRSSHSKAGVMITP--------ETALALSAVRA 51 (421) T ss_pred CCCcchhccccc-----------ccCcch---h---hH----HHhhhhccCcccCCceech--------HHhhccHHHHH Confidence 234445544432 221111 0 00 0011111111111111111 13468899999 Q ss_pred HHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHH-HHHHHHHH-HhcchhhhHHHHH----hhccccceeEEEeeec Q lcl|NC_018087. 90 AVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLIS-DEFNSVLN-MLNFQRKGSDHFK----RWYVDSRVFFHKIINP 163 (520) Q Consensus 90 Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~-eeF~~i~~-ll~f~k~g~~~fR----rWYvDgri~~hkvid~ 163 (520) ||+-|.+.+.-+ |+.|--.+.. . -++.+. .....+|+ --|-..++.++.+ .+.+.|.-|..++-|. T Consensus 52 ~i~~Ia~~iA~l-----p~~~~~~~~~--g-~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~ 123 (421) T protein:vir:10 52 CVTLLAESVAQL-----PVELYRRDKN--G-GRQRATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDG 123 (421) T ss_pred HHHHHHHhhccC-----ceEEEEEcCC--C-ceeecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC Confidence 999999987643 2222111000 0 000111 11122222 1344455666544 4678899999888663 Q ss_pred CCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeeccccc Q lcl|NC_018087. 164 NRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVD 243 (520) Q Consensus 164 ~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d 243 (520) . .-+++|.+|+|..+..++. .++.. ||.+... +-.+|.+.|+|... .. T Consensus 124 ~---G~~~~L~~l~~~~v~v~~~-----~~g~~-------~y~~~~~---------------g~~~~~~eiih~~~--~~ 171 (421) T protein:vir:10 124 K---GYPKELIPINPKKVIVLKG-----PDGMP-------YYEIPEI---------------GETLPMRMMHHVKV--FS 171 (421) T ss_pred C---CcEEEEEEecCceEEEEEC-----CCceE-------EEEEcCC---------------CcEEchhhEEEecC--cC Confidence 2 2489999999999987542 22221 2222111 11467888887753 45 Q ss_pred CCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccc Q lcl|NC_018087. 244 CCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQ 323 (520) Q Consensus 244 ~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~ 323 (520) .++...+|.|+.|.+++.....+++...=+=---+--+-+...+- +++..+.++-...+..+++++.- | ..+. T Consensus 172 ~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-~~~~~~~~e~~~~~~~~~~~~~~-----g-~~n~ 244 (421) T protein:vir:10 172 LDGYIGSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPK-EAPAIKSQEKIDQLLAKWTDRYS-----G-INNM 244 (421) T ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecC-ccCccCCHHHHHHHHHHHHHHhc-----C-cccc Confidence 667677899999999998888877765543333344556677664 33323333333444444444321 1 1111 Q ss_pred cccchhhhhhcccccCCCCCcceeecCCCCCcChH---HHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHH Q lcl|NC_018087. 324 ANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEM---DDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELS 400 (520) Q Consensus 324 ~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei---~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElk 400 (520) .+.+ .++ .|++++.|.- +.-++ +-.++..+.+.++.+||..-|...++.. + +.+.-.-+- T Consensus 245 ~~~~-vl~----------~g~~~~~l~~--~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t-~---sn~e~~~~~ 307 (421) T protein:vir:10 245 FSVA-LLQ----------EGMSYKQMSQ--DNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKAT-N---NNIEHQGLQ 307 (421) T ss_pred Ccce-ecC----------CCceEEecCC--ChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCc-c---ccHHHHHHH Confidence 1222 111 2566766643 33333 3345788889999999998875322211 1 112222223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhh Q lcl|NC_018087. 401 FDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYIS 480 (520) Q Consensus 401 F~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S 480 (520) |.++ .|+--+.. +.+.|- +.+++++++. ...++|..+... .. -+..|.+.++.+-. .-+++ T Consensus 308 f~~~--tl~P~~~~-ie~~ln-----~kL~~~~~~~----~~~v~fd~~~l~----~~-d~~~~~~~~~~~~~--~G~~T 368 (421) T protein:vir:10 308 FVMY--TLLAWLKR-HEGALQ-----RDLLLPSERR----DLYIEFNVSGLL----RG-DQKSRYESYALGRQ--WGWLS 368 (421) T ss_pred HHHH--HHHHHHHH-HHHHHh-----hhccCccccC----CeEEEEechhhh----cc-CHHHHHHHHHHHHh--CCCcC Confidence 4443 23222221 222222 2345555543 244556543321 11 12445555555422 23678 Q ss_pred HHHHHHHHhCCCHHHHHH----------HHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 481 NHTAMKDFLQMSDEDIAA----------ERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 481 ~~~i~k~IL~~tDeeI~~----------~~kqi~~E~~~~~~~~p~~e~~ 520 (520) .+-++. .|++.+-+=-+ .....+.+.+..--++.+.+++ T Consensus 369 ~NE~R~-~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~e~d~~ 417 (421) T protein:vir:10 369 VNDIRR-MENLPPIAGGDKYLTPLNMVDSAQIIPGDKKPTAQQMAEIDTI 417 (421) T ss_pred HHHHHH-HhCCCCCCCcceeeeccccccccccccCCCCcccccCcccccc Confidence 888875 46765422000 0111111222221122233333 No 85 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=97.36 E-value=8.6e-05 Score=42.87 Aligned_cols=412 Identities=12% Similarity=0.087 Sum_probs=175.2 Q ss_pred Cceeecccccccccccccccccccc----cchhHHHH-----HHHHHHHhhccchhHHH--------------------- Q lcl|NC_018087. 42 GATEVDSQDIAYNGVFQKLYGSQDP----TATSTREL-----INTYRSLLNNYEVDNAV--------------------- 91 (520) Q Consensus 42 g~~~i~~~~~a~~g~~~~~~~~~~~----~~~~~~~L-----I~~YR~ma~~pEvd~Ai--------------------- 91 (520) -|..++...+.+............. .......+ +++|+.+..+++-+..| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~ 80 (503) T protein:vir:59 1 MADIYPLGKTHTEELNEIIVESAKEIAEPDTTMIQKLIDEHNPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKT 80 (503) T ss_pred CcccccCChhhHHhHHHhhhhhhhhccchhHHHHHHHHHhhcHHHHHHHHHHhccccchhhccchhcccccccccccccc Confidence 2222222211111100000000000 00011111 23455555555544322 Q ss_pred ---------HhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeee Q lcl|NC_018087. 92 ---------QEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIIN 162 (520) Q Consensus 92 ---------~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid 162 (520) ..||+-.+-+= -.+||++..++.+..+ .+.+ | .+ =+|+....++.+...+-|+-|.+.-+| T Consensus 81 ~~ri~~n~~~~ivd~~~~yl-~g~~~~~~~~d~~~~~----~l~~-~---~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d 150 (503) T protein:vir:59 81 NNRTSHAWHKLFVDQKTQYL-VGEPVTFTSDNKTLLE----YVNE-L---AD-DDFDDILNETVKNMSNKGIEYWHPFVD 150 (503) T ss_pred cceeecchHHHHHHHHHhhh-hcCCeeeccCcHHHHH----HHHH-H---Hh-cCHHHHHHHHHHHHhhCCeEEEEEeec Confidence 11222111111 2366777666654444 3332 2 11 278888889999999999999988776 Q ss_pred cCCCCCCeeeeEecCccceeeeeeccCC--CCccccccc-------ceecceeecCcc-cccccccceecCCcc------ Q lcl|NC_018087. 163 PNRPKDGIIELRRLDPRNVQFVRELDTK--MENGVKVVK-------GYREYFLYDTEL-ESYQCGHQHFAAGTK------ 226 (520) Q Consensus 163 ~~~~k~GI~elr~lDPr~i~~vr~i~~~--~~~~~~~~~-------~~~ey~~y~~~~-~~~~~~~~~~~~~~~------ 226 (520) . +|-..++.+||+.+.++.+-... ..-.++.+. .+.-.-+|++.. ..|............ T Consensus 151 ~----dg~~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~ 226 (503) T protein:vir:59 151 E----EGEFDYVIFPAEEMIVVYKDNTRRDILFALRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNP 226 (503) T ss_pred C----CCceEEEEEccceeEEEEeCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCccccccccccccc Confidence 3 36678999999999988653321 111121111 011112344332 111111111111100 Q ss_pred ---eecCcccEEEeeccccc----CCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHH Q lcl|NC_018087. 227 ---IKIPYSAMVYAHSGLVD----CCGKNIIGYLHRAVKPANQLK-LLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQ 298 (520) Q Consensus 227 ---~~I~~~aI~y~hSGL~d----~~~~~~~syL~~aik~~NqL~-m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeq 298 (520) +.....+. -.|.+. .++....|=++.++.....+. ++-+.+..-+.++.|-+.+--.+.-+.+ + T Consensus 227 ~~~~~~~~~~~---~~~~vPiv~~~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~-----~ 298 (503) T protein:vir:59 227 RPHMTKGGQAI---GWGRVPIIPFKNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPK-----E 298 (503) T ss_pred ccceeecceec---cCCccceEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccc-----h Confidence 00000000 011110 123334565666555555544 3355555567777775543322222211 1 Q ss_pred HHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChH-HHHHHHHHHHHHhcCCChh Q lcl|NC_018087. 299 HMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEM-DDILYFRKALYMALRVPLS 377 (520) Q Consensus 299 yl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei-~DV~YF~kkLy~aL~VP~S 377 (520) .... |..++ +. .+| ++ ..+..|-...+.+.. .-+.-+++.+|+...+|-- T Consensus 299 ~~~~-~~~~~--~~---------------------~~~--~~---~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~ 349 (503) T protein:vir:59 299 FTAN-LRYHS--VI---------------------KVS--GD---GGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDN 349 (503) T ss_pred hhhh-hhccc--ce---------------------ecc--CC---CcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCC Confidence 1110 11111 00 011 11 124555444444433 2346667788888888841 Q ss_pred hccCCCccccccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHH Q lcl|NC_018087. 378 RIPDEQTQNVFDMSTAIS--RDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEM 455 (520) Q Consensus 378 Rl~~~~~~~~~G~~~eIt--RDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~El 455 (520) .++. ..|..+... .....-..-+.+.+..|...+.++++.=+-+-++....++... ..|.+.|...---.+. T Consensus 350 --~~~~---~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~-~~i~i~f~~~~p~d~~ 423 (503) T protein:vir:59 350 --SPET---IGGGATGPALENLYALLDLKANMAERKIRAGLRLFFWFFAEYLRNTGKGDFNPD-KELTMTFTRTRIQNDS 423 (503) T ss_pred --Cccc---ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccc-cceeEEeCCCCCCCHH Confidence 1121 113222222 2222233445666666666666665543333344444433333 3488888655444442 Q ss_pred HHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCH---HHHHHHHHHHHHhhhcC-CccCCcc-----cc---C Q lcl|NC_018087. 456 KTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSD---EDIAAERKLIDEELSDK-IFNPPEP-----EE---I 520 (520) Q Consensus 456 Ke~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tD---eeI~~~~kqi~~E~~~~-~~~~p~~-----e~---~ 520 (520) +.++++..+-.- | .+|.+++++. |...+ +|++.+.++.+++.+.. -..+++. +| . T Consensus 424 -------~~~~~~~kl~~~-G-iiS~et~l~~-l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (503) T protein:vir:59 424 -------EIVQSLVQGVTG-G-IMSKETAVAR-NPFVQDPEEELARIEEEMNQYAEMQGNLLDDEGGDDDLEEDDPN 490 (503) T ss_pred -------HHHHHHHHHHhC-C-CCchHHHHHh-CCCCCCHHHHHHHHHHHHHHHHhhhccccCccCCCCCCCcCCCC Confidence 344444444211 2 4799999977 54443 55555544333222211 1111111 11 1 No 86 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=97.33 E-value=9.4e-05 Score=42.67 Aligned_cols=373 Identities=12% Similarity=0.122 Sum_probs=169.9 Q ss_pred hhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccc-cccccccccchhHHHHHHHHHHHhhccch Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQ-KLYGSQDPTATSTRELINTYRSLLNNYEV 87 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~-~~~~~~~~~~~~~~~LI~~YR~ma~~pEv 87 (520) ++||.++.+. +.+|+...+........ ++-|... +.++. -+...++|-| T Consensus 1 Mglf~~~~~~--------------~~~~~~~~~~~~~~~~~--~~~~~~~~~~~v~--------------~~~al~~~~V 50 (384) T protein:vir:49 1 MPIFNITNLA--------------TESPPSNQDSFFDITDP--EFLDALNGSEWVS--------------AETALKNSDL 50 (384) T ss_pred CccccccccC--------------cccccccchhhccccch--hhcccccCCceec--------------hhhhhccHHH Confidence 5666543211 22233222222111100 0001000 11111 1223578999 Q ss_pred hHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHH----HHHhhccccceeEEEeeec Q lcl|NC_018087. 88 DNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSD----HFKRWYVDSRVFFHKIINP 163 (520) Q Consensus 88 d~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~----~fRrWYvDgri~~hkvid~ 163 (520) ..||+-|.+.+.-++- .+ .+. ..+.++.--|=..++.+ ++..+.+.|.-|.-++-|. T Consensus 51 ~~~i~~Ia~~ia~l~~-----~~--~~~------------~~~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~ 111 (384) T protein:vir:49 51 FSIISQLSNDLATAKI-----TT--SRK------------QLQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNE 111 (384) T ss_pred HHHHHHHHHHHhhCce-----ee--ecc------------hhhhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECC Confidence 9999999988765432 11 111 11223322333344554 4555788899999887652 Q ss_pred CCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeeccccc Q lcl|NC_018087. 164 NRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVD 243 (520) Q Consensus 164 ~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d 243 (520) +. -+++|.+|+|..++.++.- ++... +|-|... ++ ..+..+.++.+.|+|.. ... T Consensus 112 -~g--~~~~L~~l~~~~v~v~~~~-----~~~~~------~y~~~~~--~~-------~~~~~~~~~~~eVih~~--~~~ 166 (384) T protein:vir:49 112 -NG--RDMKWEYLRPSQVSFNRLD-----NQNGL------YYNITFD--DP-------RIPPKQHVPQGDILHFR--LLS 166 (384) T ss_pred -CC--cEEEEEEEcCceeEEEEcC-----CCceE------EEEEEec--Cc-------cccceeEecCccEEEec--CCC Confidence 22 3899999999999875421 12111 1111111 00 12234678999998885 234 Q ss_pred CCC-CcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCcccc Q lcl|NC_018087. 244 CCG-KNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKN 322 (520) Q Consensus 244 ~~~-~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d 322 (520) +++ ...+|-|..|+..++....++....-+--.-+--+-+..++.+..+..++++ ..+++... ...|. T Consensus 167 ~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~----~~~~~~~~----~n~~~--- 235 (384) T protein:vir:49 167 VDGGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKTKQ----SRSRQAMK----QMQGG--- 235 (384) T ss_pred CCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHH----HHHHHhcc----cCCcc--- Confidence 444 3457899999999998888888776554444666777777766655554433 23333210 11222 Q ss_pred ccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHH Q lcl|NC_018087. 323 QANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSF 401 (520) Q Consensus 323 ~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF 401 (520) .+ +=.+ |.++..|.-...-.+ ++-.++..+.+.++++||.+.|...++. .++. +.-|--+ T Consensus 236 ---~~--------vl~~---g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~----~~~~-~~~~~~~ 296 (384) T protein:vir:49 236 ---PL--------VLDD---LEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDK----QSSL-EMIYNIY 296 (384) T ss_pred ---ce--------ecCC---CceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc----cccH-HHHHHHH Confidence 21 1122 456666643223333 4556788899999999999998754321 1111 1112223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhH Q lcl|NC_018087. 402 DKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISN 481 (520) Q Consensus 402 ~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~ 481 (520) ..|+.-.-.-+..-+...|-..|.+. +... ...+.++.+..-.++++ +...+. T Consensus 297 ~~~i~~~l~pi~~~i~~~l~~~l~~~-~~~~-------------~~~~~~~~~~~~~~l~~-------------~~~~t~ 349 (384) T protein:vir:49 297 FKAVSRFLRPFVSELSKKLSCEVDAD-ILPA-------------VDPTGSNYIGLINSMVK-------------TGTLAQ 349 (384) T ss_pred HHHHHHHHHHHHHHHHHHhchhhhhh-hhhh-------------hhccchHHHHHHHHHhh-------------cCcccH Confidence 33433322222222222221111110 0000 01111111111001110 001122 Q ss_pred HHHHHHHh---CCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 482 HTAMKDFL---QMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 482 ~~i~k~IL---~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) .-+++... -++ .|+ .+.|...| .+..++.|- T Consensus 350 ~e~~~~l~~~g~~~-ne~------r~~~~~~p-~~gGd~~~~ 383 (384) T protein:vir:49 350 NQGLYVLQQAEILP-KDL------PEGETDST-LKGGETNEQ 383 (384) T ss_pred HHHHHHHhhCCCCC-hhH------HHHcCCCC-CCCCCCCCC Confidence 22221110 022 111 11123333 344444444 No 87 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=97.30 E-value=0.0001 Score=42.50 Aligned_cols=369 Identities=10% Similarity=0.102 Sum_probs=170.7 Q ss_pred hhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchh Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVD 88 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd 88 (520) ++||.+..+. .|.+.++....... ++.+. ..+-. .+ + -+..+++|-|. T Consensus 1 Mg~f~~~~~~-----------------~~~~~~~~~~~~~~--~~~~~----~~~~~-~v-~-------~~~~l~~~~v~ 48 (382) T protein:vir:48 1 MPIFNLATES-----------------PPDNQGGFFDVVDS--DFLAS----LKGNE-WV-S-------AETALRNSDLF 48 (382) T ss_pred CccccccccC-----------------Ccccccccccchhh--hcccc----ccCCc-cc-c-------hHhhhccHHHH Confidence 4555432111 11111221111100 00000 00000 01 0 02225789999 Q ss_pred HHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHH----HhhccccceeEEEeeecC Q lcl|NC_018087. 89 NAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHF----KRWYVDSRVFFHKIINPN 164 (520) Q Consensus 89 ~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~f----RrWYvDgri~~hkvid~~ 164 (520) .||+-|.+.+.-++- .+.... . +.++.-=|-..++.++. ..+.+.|.-|..++-| + T Consensus 49 ~~i~~ia~~ia~~~~-------~~~~~~-----~-------~~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd-~ 108 (382) T protein:vir:48 49 SIINQLSNDLATVKL-------ITSRKK-----L-------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRN-E 108 (382) T ss_pred HHHHHHHHhhccCce-------eeecch-----h-------hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEEC-C Confidence 999999998765432 111111 1 11222233344555544 4567889999998865 2 Q ss_pred CCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccccC Q lcl|NC_018087. 165 RPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDC 244 (520) Q Consensus 165 ~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~ 244 (520) + .-+++|.+|+|..+..++... +... +|-|... +. ..+.++.++.+.|+|.. ...+ T Consensus 109 ~--G~~~~l~~i~~~~v~v~~~~~-----~~~~------~y~~~~~--~~-------~~~~~~~~~~~evih~~--~~~~ 164 (382) T protein:vir:48 109 N--GRDMKWEYLRPSQVSFNRLDN-----KDGI------YYNITFD--DP-------RIPPKQHVPQNDVLHFR--LLSV 164 (382) T ss_pred C--CcEEEEEEEcCceeEEEEcCC-----CCeE------EEEEEec--Cc-------cccceeEEcCccEEEec--CCCC Confidence 2 238899999999998864322 1111 1111111 00 11234678888888775 3445 Q ss_pred CC-CcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccc Q lcl|NC_018087. 245 CG-KNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQ 323 (520) Q Consensus 245 ~~-~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~ 323 (520) ++ -...|.|..|++++.....+++...-+----+--+-+..++.+ +.+..+++..+..-..++| .|.+ T Consensus 165 ~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~-~~~e~~~~~~~~~~~~~~n-------~g~~--- 233 (382) T protein:vir:48 165 DGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGG-GLLDFKTKLSRSRQAMKQM-------QGGP--- 233 (382) T ss_pred CCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC-CChHHHHHHHHHHHhhccC-------CCCe--- Confidence 54 3458899999999999999988877766666777888888754 4444454444433222211 2321 Q ss_pred cccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHH Q lcl|NC_018087. 324 ANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFD 402 (520) Q Consensus 324 ~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~ 402 (520) + .+ .+ |.+++.|.-...-.+ ++-.++..+.+.++++||...|...+. ++.+.-.-..|. T Consensus 234 ---~-vl-------~~---g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~------~~~~~~~~~~~~ 293 (382) T protein:vir:48 234 ---L-VL-------DD---LEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGD------QQSSLEMSSDLY 293 (382) T ss_pred ---e-Ec-------CC---CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC------cccHHHHHHHHH Confidence 1 11 12 445655542222223 455678889999999999998853221 111111112233 Q ss_pred H-HHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhH Q lcl|NC_018087. 403 K-FISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISN 481 (520) Q Consensus 403 K-FI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~ 481 (520) + .|.-+-+.+..-+..-| .++.+.+. ...+..|.+ -.+.+++.+-. +-.++. T Consensus 294 ~~~l~p~~~~i~~~l~~~l---------~~~~~~~~-----~~~~~~~~~--------~~~~~~~~l~~-----~g~~t~ 346 (382) T protein:vir:48 294 SKAVSRYLRPFLSELSQKL---------SCDVDADI-----FPAVDPTGS--------NYISRINSLVK-----TGTLAQ 346 (382) T ss_pred HHHHHHHHHHHHHHHHHHh---------cChhhhhh-----hhhhccchh--------HHHHHHHHHhh-----cCccCH Confidence 3 22333333333222221 22222211 011111111 11122222211 123344 Q ss_pred HHHHHHHhC----CCHHHHHHHHHHHHHhhhcCCcc--CCcccc Q lcl|NC_018087. 482 HTAMKDFLQ----MSDEDIAAERKLIDEELSDKIFN--PPEPEE 519 (520) Q Consensus 482 ~~i~k~IL~----~tDeeI~~~~kqi~~E~~~~~~~--~p~~e~ 519 (520) .-++ ++|. ++++..+ .|...+..+ +.++++ T Consensus 347 ~e~r-~~l~~~g~~~~~~~~-------~~~~~~~~~GGd~~~~~ 382 (382) T protein:vir:48 347 NQGL-YILQQAEILPKELPN-------GENPNSTLKGGEEDGQD 382 (382) T ss_pred HHHH-HHHhhCCCCCcchhh-------hhcCCCCCCCCCCCCCC Confidence 4444 2231 3332211 111111111 233333 No 88 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=97.28 E-value=0.00011 Score=42.35 Aligned_cols=435 Identities=9% Similarity=0.044 Sum_probs=192.6 Q ss_pred ccCCCCCCCceeecc---c---ccccccccccccccccccch----h-HHHHHHHHHHHhhccchhHHHHhhhceeeEec Q lcl|NC_018087. 34 ITAPKFDDGATEVDS---Q---DIAYNGVFQKLYGSQDPTAT----S-TRELINTYRSLLNNYEVDNAVQEIVSDAIVYE 102 (520) Q Consensus 34 ~~~p~~~dg~~~i~~---~---~~a~~g~~~~~~~~~~~~~~----~-~~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~d 102 (520) .++-.+.|-...+.. . ...-.--+..+|-|- ..+. . ..++ +.+|. .++=+.-+|+..++=. +.+ T Consensus 1 ~~~~~~~d~~~~i~~L~~~~~~~~~r~~~~~~Yy~g~-~~i~~~~~~~~~~~-~~~~~--~~n~~~~ivd~~a~~l-~~~ 75 (488) T protein:vir:23 1 MAETESIDPEKLRDQLLDAFENKQNELKSSKAYYDAE-RRPDAIGLAVPLDM-RKYLA--HVGYPRTYVDAIAERQ-ELE 75 (488) T ss_pred CCcccCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-cchhhcCcccchhh-hhhhh--hcchHHHHHHHHHHhh-hcc Confidence 222222222211110 0 000000000111110 0000 0 0000 11111 1122223333333211 100 Q ss_pred CCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeec----CCCCCCeeeeEecCc Q lcl|NC_018087. 103 EGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINP----NRPKDGIIELRRLDP 178 (520) Q Consensus 103 ~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~----~~~k~GI~elr~lDP 178 (520) -..-+. ......+..-.+...+.+..|...-+|+....++.+.+++-|+-|...-.+. -.+.+|..-++.++| T Consensus 76 Gf~~~~---~~~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p 152 (488) T protein:vir:23 76 GFRIPS---ANGEEPESGGENDPASELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVPLIRVEPP 152 (488) T ss_pred ceeccC---CcccccccccchhHHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcceEEEecc Confidence 000000 0001111111223445667777777899999999999999999876654321 224556677889999 Q ss_pred cceeeeeecc-CCCCcccccc-----cceecceeecCccccccc-cccee-c---CCcce-ecCcccEEEeecccccCCC Q lcl|NC_018087. 179 RNVQFVRELD-TKMENGVKVV-----KGYREYFLYDTELESYQC-GHQHF-A---AGTKI-KIPYSAMVYAHSGLVDCCG 246 (520) Q Consensus 179 r~i~~vr~i~-~~~~~~~~~~-----~~~~ey~~y~~~~~~~~~-~~~~~-~---~~~~~-~I~~~aI~y~hSGL~d~~~ 246 (520) +.+-.+.+=. ....-++..+ +.+..+.+|.+....++. +.... . ....+ ++|- |.|++.. +..+ T Consensus 153 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPv--v~f~n~~--~~~~ 228 (488) T protein:vir:23 153 TALYAEVDPRTRKVLYAIRAIYGADGNEIVSATLYLPDTTMTWLRAEGEWEAPTSTPHGLEMVPV--IPISNRT--RLSD 228 (488) T ss_pred ceeEEEEecCCCceEEEEEEEEecCCCcEEEEEEEecCcEEEEEecCCceEeccccccCCCCcce--EEecccc--ccCC Confidence 9988876411 1111222211 112223355443221111 10000 0 00011 2222 4455432 2223 Q ss_pred CcchhhhHHHHHHH-H-HHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCcccccc Q lcl|NC_018087. 247 KNIIGYLHRAVKPA-N-QLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQA 324 (520) Q Consensus 247 ~~~~syL~~aik~~-N-qL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~ 324 (520) ..+.|-|.+.++++ . -=+++-+..+.-....-|.|-|.=.+....+.... +.+....+..| T Consensus 229 ~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~-----------~~~~~~~~~~~------ 291 (488) T protein:vir:23 229 LYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGINAE-----------TGQRMFDAYMA------ 291 (488) T ss_pred cCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCccccccccc-----------ccchhhhhhhh------ Confidence 33456665554443 2 22455566666666667777554222111111000 00001111112 Q ss_pred ccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHH Q lcl|NC_018087. 325 NMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKF 404 (520) Q Consensus 325 ~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KF 404 (520) .-|.- ++|.+.++-++++.+-=+-++-++=.-..++...++|..-|...+. +. ..+..|.--+..+-.- T Consensus 292 -------~v~~~--~~g~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~-n~-~Sg~Al~~~~~~l~~k 360 (488) T protein:vir:23 292 -------RILAF--EGGEGAHAEQFSAAELRNFVDALDALDRKAASYSGLPPQYLSSSSD-NP-ASAEAIKAAESRLVKK 360 (488) T ss_pred -------hhccC--CCCCCceeEecCCCChHHHHHHHHHHHHHHhcccCCCHHHhccccC-cc-hHHHHHHHHHHHHHHH Confidence 12322 2344456777887442223333444455566677888766643221 11 1223455555557777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCC-ChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHH Q lcl|NC_018087. 405 ISELQHKFEEIFLSPLKSNLLLKRVI-TEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHT 483 (520) Q Consensus 405 I~rLr~rFs~if~d~Lk~QLiLkgi~-t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~ 483 (520) +.+.++.|..-+...++.=+.+.|.. ...+| ..|.+.|.....=+. .+.++++..+..-....+|.++ T Consensus 361 ~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~----~~i~v~f~~~~~~s~-------~~~ada~~kl~~~g~~~~s~et 429 (488) T protein:vir:23 361 VERKNKIFGGAWEQAMRLAYKMVKGGDIPTEY----YRMETVWRDPSTPTY-------AAKADAAAKLFANGAGLIPRER 429 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcchhh----ccceEEecCCCCCCH-------HHHHHHHHHHHhcccccCCHHH Confidence 88888888888888888776665543 22333 357888864443333 3444555555433334679999 Q ss_pred HHHHHhCCCHHHHHHHHHHHHHhhhcCC-------ccCCccccC Q lcl|NC_018087. 484 AMKDFLQMSDEDIAAERKLIDEELSDKI-------FNPPEPEEI 520 (520) Q Consensus 484 i~k~IL~~tDeeI~~~~kqi~~E~~~~~-------~~~p~~e~~ 520 (520) ++.. |.+++++++++++..++|..+.. -..+++++- T Consensus 430 ~~~~-l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (488) T protein:vir:23 430 GWVD-MGYTIVEREQMRQWLEQDQKQGLGLIGSLYGASTPEGKP 472 (488) T ss_pred HHHh-CCCCchHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccC Confidence 9966 79999988877654444432210 011111111 No 89 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=97.22 E-value=0.00013 Score=41.94 Aligned_cols=386 Identities=10% Similarity=0.050 Sum_probs=187.1 Q ss_pred hhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchh Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVD 88 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd 88 (520) ++||..|.++.+. . +.+.|.. -...+. +++ ....+. .+ +. ....++|-|. T Consensus 1 Mg~f~~lf~r~~~---------~-~~~~~~~--~~~~~~-------~~~-~~~~g~--~v-~~-------~~al~~~~v~ 50 (414) T protein:vir:44 1 MVFFSGLFQRKSD---------A-PVTTPAE--LADAIG-------LSY-DTYTGK--QI-SS-------QRAMRLTAVF 50 (414) T ss_pred CchhhhhhccCcc---------C-cccchhh--HhHhhc-------cCc-cccCCc--ee-ch-------hhhhccHHHH Confidence 6677655554321 1 1111110 000110 000 011110 00 10 1235688899 Q ss_pred HHHHhhhceeeEec-----CCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhc----chhhhHH----HHHhhccccce Q lcl|NC_018087. 89 NAVQEIVSDAIVYE-----EGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLN----FQRKGSD----HFKRWYVDSRV 155 (520) Q Consensus 89 ~Ai~eIvneaiv~d-----~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~----f~k~g~~----~fRrWYvDgri 155 (520) .||+-|.+.+.-++ .+++... .. .-..+..+|+ =...+.+ ++..+.+.|.- T Consensus 51 ~~i~~Ia~~ia~~p~~~~~~~~~~~~--------------~~--~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna 114 (414) T protein:vir:44 51 SCVRVLAESVGMLPCNLYHLNGSLKQ--------------RA--TGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNF 114 (414) T ss_pred HHHHHHHHHhccCceEEEEecCCcee--------------ec--ccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCe Confidence 99999988865222 2211110 00 0111223332 2233333 55567889999 Q ss_pred eEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEE Q lcl|NC_018087. 156 FFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMV 235 (520) Q Consensus 156 ~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~ 235 (520) |..++-+ ...+.+|.+|+|.++...++- ++.. .|.++.+ .+....++.+.|+ T Consensus 115 ~~~i~~~----~g~~~~L~~l~~~~v~~~~~~-----~~~~------~y~~~~~-------------~g~~~~~~~~evi 166 (414) T protein:vir:44 115 YAYKVKA----FGEVAELLPVDPGCVVPKLNS-----SWEP------VYQVTFP-------------DGSTDVLSQEDIW 166 (414) T ss_pred EEEEEeC----CCcEEEEEEEcCceEEEEECC-----CCcE------EEEEEec-------------CceEEEEccccEE Confidence 8876532 246999999999998875321 1111 1222111 1223578999998 Q ss_pred EeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeec Q lcl|NC_018087. 236 YAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDA 315 (520) Q Consensus 236 y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~ 315 (520) |.. + ...++...+|-+..|..++.....+++...-+----+--+-++.+| ++|.+..+++..+.+...|+. T Consensus 167 h~~-~-~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g------ 237 (414) T protein:vir:44 167 HVR-T-LTLDGLVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTE-QTLSDQAYERLKKDFEERHTG------ 237 (414) T ss_pred Eec-C-CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC-CCCCHHHHHHHHHHHHHHhcC------ Confidence 885 3 3567777788999999999888888877766555455567788887 467766666655555555542 Q ss_pred CCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh---HHHHHHHHHHHHHhcCCChhhccCCCccccccccc Q lcl|NC_018087. 316 RTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE---MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMST 392 (520) Q Consensus 316 ~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~ 392 (520) +.|..+.+ .++ .|++++.|.- +.-+ ++-.++....+.++++||.+-|...++.. . + T Consensus 238 ----~~n~~~~~-vl~----------~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t--~--~ 296 (414) T protein:vir:44 238 ----LGNAHRPM-ILE----------MGLDWKSMAL--NAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRAT--F--N 296 (414) T ss_pred ----ccccCcce-ecC----------CCceEEEccC--ChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--c--c Confidence 11111222 111 2456666632 2223 34455778889999999999886433211 0 1 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018087. 393 AISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLME 472 (520) Q Consensus 393 eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~ 472 (520) .+.-.-..|.++ .|+--. ..+.+.|- +.++++.++. .+.+.|..+ ++...+ +..|.+.++.+- T Consensus 297 n~e~~~~~~~~~--~l~P~~-~~ie~~ln-----~~L~~~~~~~----~~~i~fd~~----~ll~~d-~~~~~~~~~~~~ 359 (414) T protein:vir:44 297 NIEELGLGFINY--SLVPYL-TRIEQRIN-----TGLVRKSKQG----VFYAKFNAG----ALLRGD-MKSRFEAYATGI 359 (414) T ss_pred cHHHHHHHHHHH--HHHHHH-HHHHHHHH-----hhcCCccccC----ceEEEEech----hhhccC-HHHHHHHHHHHH Confidence 111111234433 232211 12223222 3445656553 234555533 222222 234555555442 Q ss_pred cccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHh---------hhcCCccCCccccC Q lcl|NC_018087. 473 PYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEE---------LSDKIFNPPEPEEI 520 (520) Q Consensus 473 p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E---------~~~~~~~~p~~e~~ 520 (520) . +-+++.+-++. +++|.+-+ .-++.+-.- .+.+-=+++..++- T Consensus 360 ~--~G~~t~NE~R~-~~gl~p~~--ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~d~ 411 (414) T protein:vir:44 360 N--WGIYSPNDCRD-LEDMNPRP--GGDVYLTPMNMTTKPSDGSKAGKQKDNANADE 411 (414) T ss_pred h--CCCcCHHHHHH-HhCCCCCC--CcceecccccccccCCccccCCCCCCCCCCCC Confidence 2 23678888884 57775422 111111000 00000011111111 No 90 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=97.18 E-value=0.00014 Score=41.70 Aligned_cols=434 Identities=10% Similarity=0.053 Sum_probs=190.4 Q ss_pred hccCCCcccCCCCCCCceeeccc---ccccccccccccccccccchh-HHHHHHHHHHH-hhccchhHHHHhhhceeeEe Q lcl|NC_018087. 27 INDKAESITAPKFDDGATEVDSQ---DIAYNGVFQKLYGSQDPTATS-TRELINTYRSL-LNNYEVDNAVQEIVSDAIVY 101 (520) Q Consensus 27 ~~~~~~s~~~p~~~dg~~~i~~~---~~a~~g~~~~~~~~~~~~~~~-~~~LI~~YR~m-a~~pEvd~Ai~eIvneaiv~ 101 (520) +......-.+.+.++=...+... -..----...+|.|- ..+.. ....-+.+|.. +.+.=+.-+|+..+.-.. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~G~-~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~-~ 78 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTERTQDLGDNTAYYESE-RRPDAVGVTVPQQMQKLLAHVGYPRLYIDAIAARQE-L 78 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-ccchhcccccchhHHhhhhhcCcHHHHHHHHHhhhc-c Confidence 21111111111111100000000 000000001111111 00000 00001111111 111222223333332111 0 Q ss_pred cCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCC----CCCeeeeEecC Q lcl|NC_018087. 102 EEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRP----KDGIIELRRLD 177 (520) Q Consensus 102 d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~----k~GI~elr~lD 177 (520) +.+++. +.. .-.+.+..|..--+|+....++++.=.+.|+-|++.-.+.... ..+...++.++ T Consensus 79 ----~g~~~~-~~~--------~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~ 145 (484) T protein:vir:77 79 ----EGFRLG-GAD--------KADEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEP 145 (484) T ss_pred ----CceecC-Ccc--------hhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEec Confidence 111111 111 1234566677777889999999999999999888766553321 23455689999 Q ss_pred ccceeeeeeccC-CCCcccccc-----cceecceeecCcccccc-c-ccceec---CCcce-ecCcccEEEeecccccCC Q lcl|NC_018087. 178 PRNVQFVRELDT-KMENGVKVV-----KGYREYFLYDTELESYQ-C-GHQHFA---AGTKI-KIPYSAMVYAHSGLVDCC 245 (520) Q Consensus 178 Pr~i~~vr~i~~-~~~~~~~~~-----~~~~ey~~y~~~~~~~~-~-~~~~~~---~~~~~-~I~~~aI~y~hSGL~d~~ 245 (520) |+.+..+.+-.+ +..-++.++ +.+..+.+|.+....+. . ++.... ....+ ++| -|.|++.- +.. T Consensus 146 p~~~~~~~D~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vP--vv~f~N~~--~~~ 221 (484) T protein:vir:77 146 PTNLYAQIDPRTRQVMRAIRAIEDEEGNEVIGATLYLPNNTVIWNREDGQWVQVANVAHNLEMVP--VIPIPNRT--RLS 221 (484) T ss_pred cceeEEEecCCCCceEEEEEEEEeecCCcEEEEEEEecCeEEEEEecCCceEeeccccCCCCCcc--eEEecccc--ccC Confidence 999887754211 111111111 11222334433221110 0 000000 00111 233 24455422 222 Q ss_pred CCcchhhhHHHHHHH-HHH-HHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccc Q lcl|NC_018087. 246 GKNIIGYLHRAVKPA-NQL-KLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQ 323 (520) Q Consensus 246 ~~~~~syL~~aik~~-NqL-~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~ 323 (520) +..+.|=+.+.++++ ..+ +.+-+.+++-+..-.|.|-+.-.+....+.... +.....++..| T Consensus 222 ~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~-----------~~~~~~~~~~~----- 285 (484) T protein:vir:77 222 DLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVKGEELGVDPE-----------TGQTLFDAYLA----- 285 (484) T ss_pred ccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCCcchhccccc-----------ccchhhhhhhh----- Confidence 222344455444333 232 445566666666667777554333333221110 00011111111 Q ss_pred cccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHH Q lcl|NC_018087. 324 ANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDK 403 (520) Q Consensus 324 ~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~K 403 (520) .+|..- +-++.+.++++.+-=+-++-++-.-.++....++|.+-|...+. +. ..+..|.--+..+-. T Consensus 286 --------~~~~~~---~~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~-n~-~Sg~Al~~~~~~l~~ 352 (484) T protein:vir:77 286 --------RILAFE---DHESKAQQFSAAELRNFVDALDALDRKAAAYTGLPPYYLSFSSE-NP-ASAEAIRSSESRLVK 352 (484) T ss_pred --------hhcccC---CCCceeEeecCCChHHHHHHHHHHHHHHhcccCCCHHHhccccC-cc-hHHHHHHHHHHHHHH Confidence 345432 12456777776442223444555556666677888877754322 11 123345555666777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCC-hhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHH Q lcl|NC_018087. 404 FISELQHKFEEIFLSPLKSNLLLKRVIT-EDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNH 482 (520) Q Consensus 404 FI~rLr~rFs~if~d~Lk~QLiLkgi~t-~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~ 482 (520) -+.+.|+.|..-+...++.-+.+.|... ..+| ..|.+.|..-..-+. .+.++.+.++..-....+|.+ T Consensus 353 ka~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~----~~i~v~w~~~~~~s~-------~~~ad~~~kl~~~g~gi~s~e 421 (484) T protein:vir:77 353 TVERKNKIFGGAWEQAMRVAYKVMNGGDIPPEY----YRMESIWRDPSTPTY-------AAKADAATKLYNNGQGVIPKE 421 (484) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCCCccccc----ccceEEecCCCCCCH-------HHHHHHHHHHHhccCCCCCHH Confidence 7888899999988888887666665321 1222 347888864443333 344555555543322467999 Q ss_pred HHHHHHhCCCHHHHHHHHHHHHHhhhcCC-------------ccCCccccC Q lcl|NC_018087. 483 TAMKDFLQMSDEDIAAERKLIDEELSDKI-------------FNPPEPEEI 520 (520) Q Consensus 483 ~i~k~IL~~tDeeI~~~~kqi~~E~~~~~-------------~~~p~~e~~ 520 (520) +++.. |.+++.+++++++..++|.-++. ..+++..+- T Consensus 422 t~~~~-l~~~~~~~~e~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (484) T protein:vir:77 422 RARID-MGYSITEREEMRKWDEEEQAQGLGLMGTMFGTDPSGGGNPDNPET 471 (484) T ss_pred HHHhc-CCCChhHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCCCCc Confidence 98855 89999999887665554432110 011111111 No 91 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=97.17 E-value=0.00014 Score=41.66 Aligned_cols=362 Identities=12% Similarity=0.102 Sum_probs=169.4 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRS 80 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ 80 (520) |. ++||+|+.+..... ..+..+...+...+. .+ ....++..+.++ . -+. T Consensus 1 m~------m~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~--~~----~~~~~~~~g~~v-------------~-~~~ 49 (392) T protein:vir:74 1 MI------LPILNFINQTNDPP-----EAGSVQSYFPDGNDA--QI----MESLLGDNNEWV-------------S-ARA 49 (392) T ss_pred Cc------chhhhhhhcccCcc-----cccccccccccCchh--hh----hhhccCCCCccc-------------c-hhh Confidence 44 35666655432211 011111111111110 00 000011111111 1 133 Q ss_pred HhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHH----HHHhhcccccee Q lcl|NC_018087. 81 LLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSD----HFKRWYVDSRVF 156 (520) Q Consensus 81 ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~----~fRrWYvDgri~ 156 (520) .+++|-|..||+-|.+.+.-. |+.+ .... ... +++-=|-..++.+ ++..+++.|.-| T Consensus 50 al~~~~v~~~v~~ia~~ia~l-----p~~~--~~~~-----~~~-------l~~~PN~~~t~~~f~~~~~~~lll~Gna~ 110 (392) T protein:vir:74 50 ALRNSDLFSIILQLSSDLAIV-----KINA--EKKK-----NQG-------IIDNPSTNANKHGFWQSMFAQLLLGGEAF 110 (392) T ss_pred hhcchHHHHHHHHHHHhhccC-----ceee--ccch-----hhh-------hhhhcCCCCCHHHHHHHHHHHhhhcCCEE Confidence 467899999999999987543 2221 1110 011 1222222334444 455688899999 Q ss_pred EEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEE Q lcl|NC_018087. 157 FHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVY 236 (520) Q Consensus 157 ~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y 236 (520) ..++-|. ...+++|.+|+|..|+..+.-. +... -|.++... ...+..+.++.+.|+| T Consensus 111 ~~i~r~~---~G~~~~L~~i~~~~v~v~~~~~-----~~~~-----~y~~~~~~----------~~~~~~~~~~~~evih 167 (392) T protein:vir:74 111 AYRWRNA---NGADMKWEYLRPSQVNTYYFEY-----ENGM-----YYNITFDD----------PKIEPILQAPQSDLIH 167 (392) T ss_pred EEEEECC---CCcEEEEEEEcCceeEEEEcCC-----CceE-----EEEEEecC----------CccceeEEEcCccEEE Confidence 8888653 2349999999999998764322 1111 11111110 0112235788999988 Q ss_pred eecccccCCCC-cchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeec Q lcl|NC_018087. 237 AHSGLVDCCGK-NIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDA 315 (520) Q Consensus 237 ~hSGL~d~~~~-~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~ 315 (520) .. ...+++. ..+|-|..|+..+.....+++...-+=---+--+-+..++-+..+..++.+.++ ..|+.. . T Consensus 168 ~~--~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~~~~~~~---~~~~~~----~ 238 (392) T protein:vir:74 168 MK--LLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRS---RSFMKR----S 238 (392) T ss_pred ec--CCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHH---HHHhcc----c Confidence 86 3556663 468999999999999988888766555555666777777766555544433322 333321 1 Q ss_pred CCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchh Q lcl|NC_018087. 316 RTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAI 394 (520) Q Consensus 316 ~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eI 394 (520) ..| +.+ .++ .|++++.|.-.....| ++=.+|..+...++++||..-|...+... +.+ T Consensus 239 n~g------~~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~-----~~~ 296 (392) T protein:vir:74 239 RSG------GPV-VLD----------DLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQ-----SSI 296 (392) T ss_pred cCC------Cee-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc-----cHH Confidence 112 122 221 2566777743333333 44467888999999999998885432111 111 Q ss_pred hHHHHHHHH-HHHHHHHHHHHHHHHHHHH----------------------HHHhcCCCChhhHHhhhhceEEEeeccch Q lcl|NC_018087. 395 SRDELSFDK-FISELQHKFEEIFLSPLKS----------------------NLLLKRVITEDEWEAELNNIKIVFHKNSY 451 (520) Q Consensus 395 tRDElkF~K-FI~rLr~rFs~if~d~Lk~----------------------QLiLkgi~t~eew~~~~~~I~~~f~~Dn~ 451 (520) .- -..|.. .+.-+.+++..-+...|-+ .|+-.|++|.+|.-++.... .+..+ T Consensus 297 e~-~~~~~~~~l~p~~~~ie~~l~~~l~~~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~--g~~pn-- 371 (392) T protein:vir:74 297 QQ-ISGMYASALNRYLRPAISELEYKLSDHISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEA--GYIPK-- 371 (392) T ss_pred HH-HHHHHHHHHHHHHHHHHHHHHHhccchhcccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhC--CCCcc-- Confidence 11 111322 2333333333323222211 23333444444443322110 01100 Q ss_pred HHHHHHHHHHHHHHHHHHHh-hcccchhhh Q lcl|NC_018087. 452 FSEMKTIEITERRVNVLSLM-EPYIGKYIS 480 (520) Q Consensus 452 f~ElKe~Ei~~~R~~~~~~~-~p~vgky~S 480 (520) |. +++.++ ..+ .+=...-.. T Consensus 372 --e~------r~~enl-~~~~~Gd~~~p~p 392 (392) T protein:vir:74 372 --DL------PAPENT-NKKTTGQSNEPVP 392 (392) T ss_pred --cc------chhcCC-CCCCCCCCCCCCC Confidence 00 111111 111 111122334 No 92 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=97.17 E-value=0.00014 Score=41.66 Aligned_cols=391 Identities=12% Similarity=0.094 Sum_probs=183.6 Q ss_pred cccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHh Q lcl|NC_018087. 3 MLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLL 82 (520) Q Consensus 3 ~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma 82 (520) |..++++.-++-+.-.+. ++. |.+ +... +.+.....|.++ ..+... T Consensus 1 ~~~~~~~~~~k~~~~~~~------~~~---~~~------~~~~-------~~~~~~~~~~~v------------~~~~a~ 46 (409) T protein:vir:94 1 MAKENIVTRIKKKLIDNW------IDQ---SAS------KLYD-------FSPWKNKSFWGV------------INNTLE 46 (409) T ss_pred CcccccchhhhhHHhhhh------hcC---Ccc------cccc-------cccccCcccccc------------chhhhh Confidence 777777765554331111 100 110 0000 001111111111 111245 Q ss_pred hccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHH-hcchhhhHH----HHHhhccccceeE Q lcl|NC_018087. 83 NNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNM-LNFQRKGSD----HFKRWYVDSRVFF 157 (520) Q Consensus 83 ~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~l-l~f~k~g~~----~fRrWYvDgri~~ 157 (520) ++|.|..||+-|.+.+.-++ +.+- +.++.. .....++|+. =|-..++++ ++..+.+.|.-|. T Consensus 47 ~~~~v~~~i~~Ia~~ia~lp-----~~~~-~~~~~~-------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~ 113 (409) T protein:vir:94 47 TNETIFSAITKLSNSMASLP-----LKMY-EDYKVV-------NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYV 113 (409) T ss_pred ccHHHHHHHHHHHHhhhhCc-----eeEe-eccccc-------chhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEE Confidence 68899999999998887543 2121 111111 1112223321 233334444 4555788899888 Q ss_pred EEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEe Q lcl|NC_018087. 158 HKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYA 237 (520) Q Consensus 158 hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~ 237 (520) .++-|. ..-+++|.+|+|..+..+++- ++.. -||.|... .+.++.++.+.|+|. T Consensus 114 ~i~r~~---~G~~~~L~~l~~~~v~v~~~~-----~~~~------~~y~~~~~------------~g~~~~~~~~dvih~ 167 (409) T protein:vir:94 114 LIERDI---YHQPSKLFLLNPDVVEMLIEN-----QSRE------LYYSIHAA------------TGNKLIVHNMDMLHF 167 (409) T ss_pred EEEECC---CCcEEEEEEEcCceeEEEEeC-----CCcE------EEEEEEcC------------CceEEEEccccEEEe Confidence 877552 223889999999999886431 1111 12222111 123467899999888 Q ss_pred ecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCC Q lcl|NC_018087. 238 HSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDART 317 (520) Q Consensus 238 hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~T 317 (520) - +.-..++...+|-|..|.+.......++.. -+....+.| .++..-.+.+.+.+++...+.+.+.|. ++ T Consensus 168 r-~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~-~~~~~~~~~--~~i~~~~~~l~~e~~~~~~~~~~~~~~-------~~ 236 (409) T protein:vir:94 168 K-HIVASNMVQGISPIDVLKNTTDFDNAVRTF-NLTEMQKPD--SFMLKYGSNVGKEKRQQVLEDFKQYYE-------EN 236 (409) T ss_pred c-CCCCCCccccccHHHHHHHHHHHHHHHHHH-HHHhcCCCC--eeEEecCCCCCHHHHHHHHHHHHHHhh-------cC Confidence 3 222334445567777777776666556554 344444444 233334556666666555555444343 23 Q ss_pred CccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhH Q lcl|NC_018087. 318 GKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISR 396 (520) Q Consensus 318 Gev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItR 396 (520) |.+ + .++ .|.+++.|.-...-.| ++-..|-.+.+.++++||...|...++. .+ +.+.- T Consensus 237 g~~------~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~-~~---sn~e~ 295 (409) T protein:vir:94 237 GGI------L-FQE----------PGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNT-NF---AKNEE 295 (409) T ss_pred CCe------e-ecC----------CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC-Cc---ccHHH Confidence 321 1 121 2567877753322222 3334566788999999999988743321 11 12222 Q ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_018087. 397 DELSFDKF-ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYI 475 (520) Q Consensus 397 DElkF~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~v 475 (520) .-+.|.++ +.-+-.++..- +-+.++++.+|.. ...+.|..+ ++...+ +..|++.+..+-.- T Consensus 296 ~~~~f~~~~l~P~~~~ie~~---------ln~~Ll~~~~~~~---~~~i~fd~~----~ll~~d-~~~~~~~~~~~~~~- 357 (409) T protein:vir:94 296 LNRFYLQHTLLPIVKQYEEE---------FNRKLLTKTDREK---NRYFKFNVK----SYLRAD-SATQAEVYFKAVRS- 357 (409) T ss_pred HHHHHHHHHHHHHHHHHHHH---------HHHhhCCcccccC---cceEEeech----hhhccC-HHHHHHHHHHHHhC- Confidence 23335554 33333333221 2223456665542 234445432 333333 35566666655322 Q ss_pred chhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCC---------cc--CCccccC Q lcl|NC_018087. 476 GKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKI---------FN--PPEPEEI 520 (520) Q Consensus 476 gky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~---------~~--~p~~e~~ 520 (520) -+++..-++. ++++.+-+ --++-+-.-.-.++ .+ +.+..|= T Consensus 358 -G~~T~NE~R~-~~g~~p~~--ggD~~~~~~n~~~~~~~~~~~~~~kGG~~n~~e~ 409 (409) T protein:vir:94 358 -GYYTINDIRE-WEDLPPVE--GGDKPLISGDLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred -CCcCHHHHHH-HhCCCCCC--CcCeEeecccccccccchhhcccccCCCCCcCCC Confidence 3667777764 46665432 00000000000000 00 0000000 No 93 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=97.13 E-value=0.00016 Score=41.43 Aligned_cols=429 Identities=13% Similarity=0.089 Sum_probs=198.5 Q ss_pred ceeecccccccccccccccccccccchhHHHHH-------HHHHHHhhccchhHHHHhhhc--------eeeEecCCCcE Q lcl|NC_018087. 43 ATEVDSQDIAYNGVFQKLYGSQDPTATSTRELI-------NTYRSLLNNYEVDNAVQEIVS--------DAIVYEEGFDV 107 (520) Q Consensus 43 ~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI-------~~YR~ma~~pEvd~Ai~eIvn--------eaiv~d~~~~~ 107 (520) -|+|.++..++..-..+..-+ ......+|+ .+|+.+..|++-+..|..+-. -.+|+.=..-+ T Consensus 1 ~~~~~~~~~~~~~~~~~l~~~---e~~~i~~L~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n~~~~i 77 (504) T protein:vir:99 1 MTEETTSASKFTFRIPELNDD---VVDKVNGLYQQLVDRTPRNLLRASFYDGKYAIRQIGNLIPPEYLRTATVLGWSAKA 77 (504) T ss_pred CCccCCcccccccccCCCCHH---HHHHHHHHHHHHHHHhHHHHHHHHHHhccccchhccccccHHHHHHhhccCcHHHH Confidence 444444333322111111100 000111222 234444444443333322110 00111110000 Q ss_pred E-----EEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCcccee Q lcl|NC_018087. 108 V-----SIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQ 182 (520) Q Consensus 108 V-----~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~ 182 (520) | .|.++.--+.+. +.-.+....|...=+|+....+..+.=++.||-| -.|+... ..++..-++.++|+.+- T Consensus 78 Vd~~a~rl~~~Gf~~~d~--~~~~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af-~~v~~~~-d~~~~~~I~~~sP~~~~ 153 (504) T protein:vir:99 78 VDTLARRCNLESFVWPDG--DYGSIGGPDVWDENFFATKANNAMVSSLIHGPAF-LINTEGG-AGEPDSLIHVKSAMQAT 153 (504) T ss_pred HHHHHhhhccceeeCCCC--ChhhHHHHHHHHhcChhhHHHHHHHHHHhhCcee-EEEecCC-CCCceeEEEEeccceeE Confidence 0 000010000000 0012345556666778888889999999999966 4444322 23345567888998877 Q ss_pred eeeeccC-CCCcccccc----cc-eecceeecCccccccc--cc-cee---cCCcceecCcccEEEeeccccc-CCCCcc Q lcl|NC_018087. 183 FVRELDT-KMENGVKVV----KG-YREYFLYDTELESYQC--GH-QHF---AAGTKIKIPYSAMVYAHSGLVD-CCGKNI 249 (520) Q Consensus 183 ~vr~i~~-~~~~~~~~~----~~-~~ey~~y~~~~~~~~~--~~-~~~---~~~~~~~I~~~aI~y~hSGL~d-~~~~~~ 249 (520) .+.+-.. ...-+..++ ++ ....-+|.+....+.. +. ... .++. .-+| .|.|++..-.+ +.|..- T Consensus 154 ~iyD~~~~~~~~a~~~~~~d~~g~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~-~gvP--vV~~~n~~~~~~~~G~se 230 (504) T protein:vir:99 154 GEWNSRRNAMDSLLSITSRDAEGHPTGIALYEDGVTVTADMDDDGDWHADVRTHK-LGVP--VEVLPYKPREDRPLGSSR 230 (504) T ss_pred EEEeCCCCceeEEEEEEEecCCCeEEEEEEEcCCcEEEEEEcCCceeeeccccCC-CCcc--eEEecccccCccccCccc Confidence 6654111 111111111 01 1112244443222110 10 000 0111 1133 67787764433 233221 Q ss_pred hh-hhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCcccc-ccccc Q lcl|NC_018087. 250 IG-YLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKN-QANMM 327 (520) Q Consensus 250 ~s-yL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d-~~~~m 327 (520) +. .|-..+..+| +.+-+.++.=...=.|.|-|+=.+-..++... |+..+ -+..+ T Consensus 231 i~~~v~~l~Da~~--~~~~~~~~~~e~~a~p~r~i~G~~~~~~~~~d----------------------~~~~~~~~~~~ 286 (504) T protein:vir:99 231 ITRPVMSLQQRAL--KGCIRMDGHADVYSFPQLILLGADAKNFRNKD----------------------GSMKPAWQIAL 286 (504) T ss_pred chhhHHHHHHHHH--HHHHHHHHHHHHhcchhhhhccCCcccccccc----------------------ccccchhhhhh Confidence 11 2333333333 45666677777777777766533322211111 11000 00000 Q ss_pred hhhhhhcccccC-----CCCCcceeecCCCCCcChH-HHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHH Q lcl|NC_018087. 328 ALTEDYWLQRRD-----GKAVTEVETLPGMTGMNEM-DDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSF 401 (520) Q Consensus 328 smlEDywLpRRe-----GgrgTEIsTLpGg~nLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF 401 (520) + .=+.||.-+ ++-++++..+++++ |.-. +=++=.-..+...-++|.+-|.-.+..+. ..+..|.-.+... T Consensus 287 ~--~i~~~~~~~~~~~~~~~~~~~~q~~~~~-l~~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~-sSa~Ai~~~~~~L 362 (504) T protein:vir:99 287 A--RVFALPDDEDEPDAARARADVKQFPASS-PQPHIEMLEQIAMMFSGETSIPVESLGFSNRANP-TSADAYIASREDL 362 (504) T ss_pred h--hhhcCCCccccccccCccceeeecCCCC-hHHHHHHHHHHHHHHHhhhCCCHHHhcccccccc-cHHHHHHHHHHHH Confidence 0 012233221 23356788888853 4433 33555555666668999877642221111 2334666777788 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCC--hhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhh Q lcl|NC_018087. 402 DKFISELQHKFEEIFLSPLKSNLLLKRVIT--EDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYI 479 (520) Q Consensus 402 ~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t--~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~ 479 (520) .+-+.+.|++|..-+..++|.-+.+.+... ..+| ..+.+.|..-..=+. .++.+++..+..-+..++ T Consensus 363 ~~ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~~~~----~~~~v~w~d~~~~s~-------a~~aDa~~Kl~~ag~~l~ 431 (504) T protein:vir:99 363 IAEAEGATDDWSPAFRRSMIRALAIKNGLDRIPPEW----KTIDSKFRSPLYLSK-------AAQADAGAKMLGAGPEWL 431 (504) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccc----ccceeEecCCCccCH-------HHHHHHHHHHHhhccccc Confidence 888999999999999999998887766543 2333 246777854333222 556777777766555566 Q ss_pred hHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccC-----------CccccC Q lcl|NC_018087. 480 SNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNP-----------PEPEEI 520 (520) Q Consensus 480 S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~-----------p~~e~~ 520 (520) +...+.-..|++|++||+....+.+++...++... +..++- T Consensus 432 ~~~~~l~~~lg~~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~~~~~~ 483 (504) T protein:vir:99 432 KETEVGLELLGLTPQQAKRALAERRRASSVSIIEALNRRQQEAATAGEDQDQ 483 (504) T ss_pred cchHHHHhhcCCCHHHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCCCCCCc Confidence 65444446679999999887776666543222110 011111 No 94 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=97.11 E-value=0.00017 Score=41.27 Aligned_cols=377 Identities=10% Similarity=0.132 Sum_probs=169.3 Q ss_pred hhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchh Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVD 88 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd 88 (520) ++||.++.+..++ ++.+.++...... .+.... ...+.. +.. +..+++|-|. T Consensus 1 M~~f~~~~~~~~~--------------~~~~~~~~~~~~~-----~~~~~~---~~~~~~------v~~-~~al~~~~v~ 51 (386) T protein:vir:49 1 MPIFNITNLATES--------------PPINQESFFDIAD-----SDFLAS---LNSSEW------VSA-ENALKNSDLF 51 (386) T ss_pred CchhhhhccCCCC--------------cccchhhhhhhhh-----cccccc---ccCCce------ech-hhhhccHHHH Confidence 6667665432221 1111111111100 000000 000100 000 1235689999 Q ss_pred HHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHH----HHHhhccccceeEEEeeecC Q lcl|NC_018087. 89 NAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSD----HFKRWYVDSRVFFHKIINPN 164 (520) Q Consensus 89 ~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~----~fRrWYvDgri~~hkvid~~ 164 (520) .||+-|.+.+.-++- .+... ..+.++.--+-..++.+ ++..+++.|--|.-++-|.. T Consensus 52 ~~i~~ia~~ia~~p~-------~~~~~------------~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~ 112 (386) T protein:vir:49 52 SIISQLSNDLATAKI-------TTSRK------------QLQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDN 112 (386) T ss_pred HHHHHHHHHhhhCce-------eeccc------------hhhhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCC Confidence 999999987754222 11111 11223322333344444 55568889999998886632 Q ss_pred CCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccccC Q lcl|NC_018087. 165 RPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDC 244 (520) Q Consensus 165 ~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~ 244 (520) .-+++|.+|+|..+..++.- ++... +|.|... .+ ..+..+.++.+.|+|.. ...+ T Consensus 113 ---g~~~~l~~i~~~~v~v~~~~-----~~~~~------~y~~~~~--~~-------~~~~~~~~~~~evih~~--~~~~ 167 (386) T protein:vir:49 113 ---GRDMKWEYLRPSQVSFNRLD-----NQNGL------YYNITFD--DP-------HIAPKQHVPQNDILHFR--LLSV 167 (386) T ss_pred ---CcEEEEEEecCceeEEEEcC-----CCceE------EEEEEEc--Cc-------cccceeEEccccEEEec--CCCC Confidence 23889999999999876432 11111 1122110 00 11233678889988875 3455 Q ss_pred CC-CcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccc Q lcl|NC_018087. 245 CG-KNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQ 323 (520) Q Consensus 245 ~~-~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~ 323 (520) ++ ...+|.|..|++.+.....+++...-+--..+--+-+..++.+..+.. ++ .++..+... + ...|.+ T Consensus 168 ~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~-~~-~~~~~~~~~-----~-~n~g~~--- 236 (386) T protein:vir:49 168 DGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGLLDF-KT-KVSRSRQAM-----K-QMQGGP--- 236 (386) T ss_pred CCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChHH-HH-HHHHHHHHh-----c-cCCCCc--- Confidence 55 346899999999999999998888766666677788888886554433 33 333333221 1 122321 Q ss_pred cccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHH Q lcl|NC_018087. 324 ANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFD 402 (520) Q Consensus 324 ~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~ 402 (520) + .+ + .|.+++.|.-.....+ ++=.++....+.++++||.+.|..+++. .++ .+..+..+. T Consensus 237 ---~-vl--------~--~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~----~~~-~~~~~~~~~ 297 (386) T protein:vir:49 237 ---L-VL--------D--DLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDGDQ----QSS-LEMIYNIYF 297 (386) T ss_pred ---e-ec--------C--CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc----cch-HHHHHHHHH Confidence 1 11 1 2556766632222212 3345788889999999999999643221 111 111122222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHH Q lcl|NC_018087. 403 KFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNH 482 (520) Q Consensus 403 KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~ 482 (520) .| |+..+..+... |...| +. .+. |..+ ++.+.+. ..+...++.+ +-+-.++.. T Consensus 298 ~~---i~~~l~~i~~~-~~~~l-~~-------------~~~--~~~~----~~~~~d~-~~~~~~~~~l--~~~g~~t~n 350 (386) T protein:vir:49 298 KS---VSRYLRPFVSE-MSKKL-SC-------------EVD--VDIS----PAVDPTG-SNYISLINSM--VKSGTLAQN 350 (386) T ss_pred HH---HHHHHHHHHHH-HHHHh-cc-------------hhc--ccch----hhhccCH-HHHHHHHHHH--HhCCCcCHH Confidence 22 33333332221 22222 11 111 1110 0000000 1112222211 011233444 Q ss_pred HHHHHHhC---CCHHHHHHHHHHHHHhhhcCCccCCcccc Q lcl|NC_018087. 483 TAMKDFLQ---MSDEDIAAERKLIDEELSDKIFNPPEPEE 519 (520) Q Consensus 483 ~i~k~IL~---~tDeeI~~~~kqi~~E~~~~~~~~p~~e~ 519 (520) -+++ +|. +...|+-..+..-....+- .+.++++ T Consensus 351 E~r~-~l~~~~~~~~~~~~~~~~~~~~~~g---Gd~~~~~ 386 (386) T protein:vir:49 351 QGLY-ILQQAEILPKELPDGKNPNRTSLKG---GEINEQD 386 (386) T ss_pred HHHH-HHhhCCCCCCcCcchhccCCCCCCC---CCCCCCC Confidence 4442 221 1111111100000000000 1223333 No 95 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=97.09 E-value=0.00017 Score=41.18 Aligned_cols=407 Identities=13% Similarity=0.113 Sum_probs=173.0 Q ss_pred ccCCCCCCCceeeccccccccccccc-ccccccccch---------hHHHHHHHHHHHhhccchhHHH------------ Q lcl|NC_018087. 34 ITAPKFDDGATEVDSQDIAYNGVFQK-LYGSQDPTAT---------STRELINTYRSLLNNYEVDNAV------------ 91 (520) Q Consensus 34 ~~~p~~~dg~~~i~~~~~a~~g~~~~-~~~~~~~~~~---------~~~~LI~~YR~ma~~pEvd~Ai------------ 91 (520) .+..---.|-.....|+++.. -++. ++.+.+.... .-...+.+|+.+..+++-...| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~e~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~ 79 (483) T protein:vir:12 1 MAQALIKGGNILYPSQPTQTE-IFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGA 79 (483) T ss_pred CccchhcCCceeecCcchhhh-hhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc Confidence 111111223333333333221 1111 1111111111 0112234555555554443211 Q ss_pred ---------------HhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhcccccee Q lcl|NC_018087. 92 ---------------QEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVF 156 (520) Q Consensus 92 ---------------~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~ 156 (520) ..||+..+-+ --..||++..++.+..+ .+. .+++ -+|+....++++...+-|+-| T Consensus 80 ~~~~~~~~ki~~n~~k~Ivd~~~~~-l~G~p~~~~~~d~~~~~----~l~----~~~~-n~~~~~~~~~~~~~~~~G~~y 149 (483) T protein:vir:12 80 VDPLKPDDRMITNFHANLVDQKVSY-IVGKPIAFKHTDDEVVK----RID----EVLG-NRFDDKLHSVLTGASNKGIEW 149 (483) T ss_pred ccccccccccccchHHHHHHHHhhh-hcccCceeccCChHHHH----HHH----HHHh-ccHHHHHHHHHHHHhhCCeEE Confidence 1112211111 12366777666654433 222 2332 267788888899999999999 Q ss_pred EEEeeecCCCCCCeeeeEecCccceeeeeecc--CCCCccccccc--c----------eecceeecCcccccccccc--- Q lcl|NC_018087. 157 FHKIINPNRPKDGIIELRRLDPRNVQFVRELD--TKMENGVKVVK--G----------YREYFLYDTELESYQCGHQ--- 219 (520) Q Consensus 157 ~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~--~~~~~~~~~~~--~----------~~ey~~y~~~~~~~~~~~~--- 219 (520) .+.-+|. +|-..++.+||+.+-++.+-. .+..-.++.+. . ...||++............ T Consensus 150 ~~v~~d~----d~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~ 225 (483) T protein:vir:12 150 LHPYLDE----EGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLEN 225 (483) T ss_pred EEEEEcC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeecceEEEEEecCeEEEEEEeCCeeeecccccccc Confidence 9887763 356789999999998875421 11112222111 1 1112222211100000000 Q ss_pred ---eecCCcceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHH Q lcl|NC_018087. 220 ---HFAAGTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLK-LLEDAMMIYRITRAPDRRVFYIDTGNMPARK 295 (520) Q Consensus 220 ---~~~~~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~-m~EDalVIyRi~RApeRRvFyIDvGnlpk~K 295 (520) ...++.-=+|| |+++. ++....|=++..+.....+. ++=+....-+.++.|-+-+.-.+.-+++ T Consensus 226 ~~~~~~~~~~g~vP---vv~~~------nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~--- 293 (483) T protein:vir:12 226 SKTHFSTGSWGKIP---FIPFK------NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELP--- 293 (483) T ss_pred cccccccCCCCccc---eEEec------CCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccch--- Confidence 00011101222 23331 12223444444333333332 3455555556677775543322222211 Q ss_pred HHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChH-HHHHHHHHHHHHhcCC Q lcl|NC_018087. 296 AAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEM-DDILYFRKALYMALRV 374 (520) Q Consensus 296 Aeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei-~DV~YF~kkLy~aL~V 374 (520) +..+ .+..++...-++ |.++.+|-...+.... .-+.-+.+.+|+..++ T Consensus 294 --~~~~---------------------------~~~~~~~~~~~~--~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~ 342 (483) T protein:vir:12 294 --EFKR---------------------------LLRYYGAIKVSD--NGGVDTIQVEVPVENSKKYLDELYQKIMLFGQA 342 (483) T ss_pred --hHHH---------------------------hhhhccccccCC--CCcceEEeecCCHHHHHHHHHHHHHHHHHHhCC Confidence 1111 111111111111 1235555444444333 3345666778888999 Q ss_pred ChhhccCCCccccccccc--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchH Q lcl|NC_018087. 375 PLSRIPDEQTQNVFDMST--AISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYF 452 (520) Q Consensus 375 P~SRl~~~~~~~~~G~~~--eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f 452 (520) |- +.++. +. |..+ .|.--+.....-+.+.++.|...+..+++.=+-+-|+ ..+|. .|.+.|....-- T Consensus 343 p~--~~~~~-~~--~n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~--~~~~~----~i~v~f~~~~p~ 411 (483) T protein:vir:12 343 VD--FSSDK-FG--SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI--KGEHK----DVDISFNYNKVA 411 (483) T ss_pred CC--CCccc-cc--cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CCccc----eeeEEeCCCCCC Confidence 85 22221 11 2222 2322333344556777777777777776653333332 34554 456777544433 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCC--HHHHHHHHHHHHHhhhcC-Cc----cC--CccccC Q lcl|NC_018087. 453 SEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMS--DEDIAAERKLIDEELSDK-IF----NP--PEPEEI 520 (520) Q Consensus 453 ~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~t--DeeI~~~~kqi~~E~~~~-~~----~~--p~~e~~ 520 (520) .+... +++++.+.+ .+|.+++++.+-..+ ++|++++.++-++..++. -+ .+ +++++= T Consensus 412 ~~~~~-------a~~~~kl~G----iiS~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~ 477 (483) T protein:vir:12 412 NTELQ-------VQTAQQSMG----IVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQQERS 477 (483) T ss_pred CHHHH-------HHHHHHHhc----cCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccccccCCcccCCCC Confidence 33222 344555543 379999998743333 245555555443333221 00 00 111111 No 96 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=97.08 E-value=0.00018 Score=41.12 Aligned_cols=428 Identities=15% Similarity=0.109 Sum_probs=173.8 Q ss_pred hhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccc--hhHHHHHHHH-----HHH Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTA--TSTRELINTY-----RSL 81 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~--~~~~~LI~~Y-----R~m 81 (520) +.|.+|-...++++...++-.+-+.--.++.. .+... ..+|-+ +.+.-... .....+-+.| +.. T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~----r~~~~----~~~y~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 71 (474) T protein:vir:94 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRE----RMVNL----YNRYKT-HIDYVPIFKRRPIEEKEDFETGGNVRRL 71 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhH----HHHHH----HHHHhh-hcchhhhhcchhhhhhhhhhhccccccc Confidence 33444443333332211110000000000000 00000 000000 00000000 0000000000 000 Q ss_pred -------hhccchhHHHHhhhceeeEecCCCcEEEEeeccc-hhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhcccc Q lcl|NC_018087. 82 -------LNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQT-AFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDS 153 (520) Q Consensus 82 -------a~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~-~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDg 153 (520) ..++=....|+..+. +=- ..||++..++. +-.+ .+.+.++.+++--+|+....++.+...+-| T Consensus 72 ~~~~~~ki~~n~~~~ivd~~~~----yl~-g~pv~~~~~~~~~~~e----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G 142 (474) T protein:vir:94 72 DVSVNNKLNNSFDSEIVDTRVG----YLH-GVPVTYDLDENAEKNE----KLKKFITNFAIRNSVDDEDSEIGKMAAICG 142 (474) T ss_pred ccCcccccccchHHHHHHhHhh----hee-ccceeEeeCCCCcchH----HHHHHHHHHHhhcCHhHHHHHHHHHHhhcC Confidence 112222333333332 222 36888877543 2334 555566666666788999999999999999 Q ss_pred ceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCccccccc------c--eecceeecCccc-cccccc-ceec- Q lcl|NC_018087. 154 RVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVK------G--YREYFLYDTELE-SYQCGH-QHFA- 222 (520) Q Consensus 154 ri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~------~--~~ey~~y~~~~~-~~~~~~-~~~~- 222 (520) +-|.+.-+|. +|-..++.+||+.+-+|.+-..+..-.++.+. + +....+|++.-. .+...+ ..+. T Consensus 143 ~a~~~~~~d~----~~~~~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~ 218 (474) T protein:vir:94 143 YGARLAYIDT----NGDIRIKNIDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQE 218 (474) T ss_pred eEEEEEEeCC----CCeeEEEEEcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccc Confidence 9888765552 35678999999999888643222211111110 0 112234433211 111110 0000 Q ss_pred ---CCcce-ecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHH Q lcl|NC_018087. 223 ---AGTKI-KIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKL-LEDAMMIYRITRAPDRRVFYIDTGNMPARKAA 297 (520) Q Consensus 223 ---~~~~~-~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m-~EDalVIyRi~RApeRRvFyIDvGnlpk~KAe 297 (520) ....+ +||- |.| +|+....|-++..+.....+.. +-+....-+-++.|-+-+.-. .++. T Consensus 219 ~~~~~~~~g~vPv--v~~-------~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~---~~~~---- 282 (474) T protein:vir:94 219 VGRYEHLFDYNPL--FGV-------PNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGM---GMSE---- 282 (474) T ss_pred cccccCCCCccce--EEe-------cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccC---CCCc---- Confidence 00001 1221 122 2333445666666655555443 333333444455554433221 1221 Q ss_pred HHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcC-hHHHHHHHHHHHHHhcCCCh Q lcl|NC_018087. 298 QHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMN-EMDDILYFRKALYMALRVPL 376 (520) Q Consensus 298 qyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLg-ei~DV~YF~kkLy~aL~VP~ 376 (520) +-+..+. . + |. .|++ ++ |..++.|--..+.. .-.-+.-+.+.+|....+|- T Consensus 283 ~~~~~~~-~--~--------~~-------------i~~~--~~--~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~ 334 (474) T protein:vir:94 283 EMIQETQ-K--S--------GA-------------FELF--DK--DMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVN 334 (474) T ss_pred hhhhhhh-h--c--------ce-------------eEec--CC--CCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcc Confidence 1111110 0 1 11 1221 11 12344543333332 23345667788889888884 Q ss_pred hhccCCCccccccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHH Q lcl|NC_018087. 377 SRIPDEQTQNVFDMSTAIS--RDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSE 454 (520) Q Consensus 377 SRl~~~~~~~~~G~~~eIt--RDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~E 454 (520) +..++. .|..+..+ .-......-+.+.+..|..-+...++.=+-+-++......+.-...+.+.|....--.+ T Consensus 335 --~~~~~~---~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~ 409 (474) T protein:vir:94 335 --FNSDEF---NGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNK 409 (474) T ss_pred --cccccc---cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCH Confidence 222211 13222222 11122233455555666665555555433222222111111122357888886665555 Q ss_pred HHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCcc-------CCccccC Q lcl|NC_018087. 455 MKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFN-------PPEPEEI 520 (520) Q Consensus 455 lKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~-------~p~~e~~ 520 (520) ...++ +++.+. | .+|.+++++. |...+ +.+++-++|++|..+..-+ ++++++- T Consensus 410 ~e~a~-------~~~kl~---g-~iS~et~~~~-l~~v~-d~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~ 469 (474) T protein:vir:94 410 LEESQ-------VLINLK---G-QVSERTRLGQ-SQLVD-DVDYELDEMEKESLEFNDKLPDIDEGDANDKSQ 469 (474) T ss_pred HHHHH-------HHHHHh---c-cCchHHHHHh-CCCCC-CHHHHHHHHHHHHHHHHhhcccccCCCcCCCCc Confidence 44444 444443 3 3799999977 55543 2344444444444221111 1111111 No 97 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=97.08 E-value=0.00018 Score=41.12 Aligned_cols=428 Identities=15% Similarity=0.109 Sum_probs=173.8 Q ss_pred hhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccc--hhHHHHHHHH-----HHH Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTA--TSTRELINTY-----RSL 81 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~--~~~~~LI~~Y-----R~m 81 (520) +.|.+|-...++++...++-.+-+.--.++.. .+... ..+|-+ +.+.-... .....+-+.| +.. T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~----r~~~~----~~~y~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 71 (474) T protein:vir:10 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRE----RMVNL----YNRYKT-HIDYVPIFKRRPIEEKEDFETGGNVRRL 71 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhH----HHHHH----HHHHhh-hcchhhhhcchhhhhhhhhhhccccccc Confidence 33444443333332211110000000000000 00000 000000 00000000 0000000000 000 Q ss_pred -------hhccchhHHHHhhhceeeEecCCCcEEEEeeccc-hhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhcccc Q lcl|NC_018087. 82 -------LNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQT-AFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDS 153 (520) Q Consensus 82 -------a~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~-~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDg 153 (520) ..++=....|+..+. +=- ..||++..++. +-.+ .+.+.++.+++--+|+....++.+...+-| T Consensus 72 ~~~~~~ki~~n~~~~ivd~~~~----yl~-g~pv~~~~~~~~~~~e----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G 142 (474) T protein:vir:10 72 DVSVNNKLNNSFDSEIVDTRVG----YLH-GVPVTYDLDENAEKNE----KLKKFITNFAIRNSVDDEDSEIGKMAAICG 142 (474) T ss_pred ccCcccccccchHHHHHHhHhh----hee-ccceeEeeCCCCcchH----HHHHHHHHHHhhcCHhHHHHHHHHHHhhcC Confidence 112222333333332 222 36888877543 2334 555566666666788999999999999999 Q ss_pred ceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCccccccc------c--eecceeecCccc-cccccc-ceec- Q lcl|NC_018087. 154 RVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVK------G--YREYFLYDTELE-SYQCGH-QHFA- 222 (520) Q Consensus 154 ri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~------~--~~ey~~y~~~~~-~~~~~~-~~~~- 222 (520) +-|.+.-+|. +|-..++.+||+.+-+|.+-..+..-.++.+. + +....+|++.-. .+...+ ..+. T Consensus 143 ~a~~~~~~d~----~~~~~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~ 218 (474) T protein:vir:10 143 YGARLAYIDT----NGDIRIKNIDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQE 218 (474) T ss_pred eEEEEEEeCC----CCeeEEEEEcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccc Confidence 9888765552 35678999999999888643222211111110 0 112234433211 111110 0000 Q ss_pred ---CCcce-ecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHH Q lcl|NC_018087. 223 ---AGTKI-KIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKL-LEDAMMIYRITRAPDRRVFYIDTGNMPARKAA 297 (520) Q Consensus 223 ---~~~~~-~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m-~EDalVIyRi~RApeRRvFyIDvGnlpk~KAe 297 (520) ....+ +||- |.| +|+....|-++..+.....+.. +-+....-+-++.|-+-+.-. .++. T Consensus 219 ~~~~~~~~g~vPv--v~~-------~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~---~~~~---- 282 (474) T protein:vir:10 219 VGRYEHLFDYNPL--FGV-------PNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGM---GMSE---- 282 (474) T ss_pred cccccCCCCccce--EEe-------cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccC---CCCc---- Confidence 00001 1221 122 2333445666666655555443 333333444455554433221 1221 Q ss_pred HHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcC-hHHHHHHHHHHHHHhcCCCh Q lcl|NC_018087. 298 QHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMN-EMDDILYFRKALYMALRVPL 376 (520) Q Consensus 298 qyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLg-ei~DV~YF~kkLy~aL~VP~ 376 (520) +-+..+. . + |. .|++ ++ |..++.|--..+.. .-.-+.-+.+.+|....+|- T Consensus 283 ~~~~~~~-~--~--------~~-------------i~~~--~~--~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~ 334 (474) T protein:vir:10 283 EMIQETQ-K--S--------GA-------------FELF--DK--DMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVN 334 (474) T ss_pred hhhhhhh-h--c--------ce-------------eEec--CC--CCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcc Confidence 1111110 0 1 11 1221 11 12344543333332 23345667788889888884 Q ss_pred hhccCCCccccccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHH Q lcl|NC_018087. 377 SRIPDEQTQNVFDMSTAIS--RDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSE 454 (520) Q Consensus 377 SRl~~~~~~~~~G~~~eIt--RDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~E 454 (520) +..++. .|..+..+ .-......-+.+.+..|..-+...++.=+-+-++......+.-...+.+.|....--.+ T Consensus 335 --~~~~~~---~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~ 409 (474) T protein:vir:10 335 --FNSDEF---NGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNK 409 (474) T ss_pred --cccccc---cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCH Confidence 222211 13222222 11122233455555666665555555433222222111111122357888886665555 Q ss_pred HHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCcc-------CCccccC Q lcl|NC_018087. 455 MKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFN-------PPEPEEI 520 (520) Q Consensus 455 lKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~-------~p~~e~~ 520 (520) ...++ +++.+. | .+|.+++++. |...+ +.+++-++|++|..+..-+ ++++++- T Consensus 410 ~e~a~-------~~~kl~---g-~iS~et~~~~-l~~v~-d~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~ 469 (474) T protein:vir:10 410 LEESQ-------VLINLK---G-QVSERTRLGQ-SQLVD-DVDYELDEMEKESLEFNDKLPDIDEGDANDKSQ 469 (474) T ss_pred HHHHH-------HHHHHh---c-cCchHHHHHh-CCCCC-CHHHHHHHHHHHHHHHHhhcccccCCCcCCCCc Confidence 44444 444443 3 3799999977 55543 2344444444444221111 1111111 No 98 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=97.08 E-value=0.00018 Score=41.10 Aligned_cols=408 Identities=12% Similarity=0.057 Sum_probs=186.1 Q ss_pred hhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchh Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVD 88 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd 88 (520) |+||..|.++...........+..++.-|. .. . .|+.... | .-|. -+..+++|.|. T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~~~~~~~~~~----~~-------~-~~~~~~~--g---------~~V~-~~~al~~~~V~ 56 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEARAWEPYDPS----IY-------N-LGAVAAS--G---------ETVT-PHDALQVSAVF 56 (457) T ss_pred CchhhhhhcccccccccccccccccccchH----HH-------h-hcccccC--C---------ceec-hHHhhccHHHH Confidence 333333322222110000000111111110 00 0 0110000 0 0111 13456789999 Q ss_pred HHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcc---hhhhHHHHHhhc----cccceeEEEee Q lcl|NC_018087. 89 NAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNF---QRKGSDHFKRWY----VDSRVFFHKII 161 (520) Q Consensus 89 ~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f---~k~g~~~fRrWY----vDgri~~hkvi 161 (520) .||+-|.+.+.-+ |+.+--+.. ...++... ..++.+|+- ...+.++++.+. +.|--|..+. T Consensus 57 ~~v~~Ia~~iA~l-----p~~~~~~~~---~~~~~~~~---~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~- 124 (457) T protein:vir:13 57 ASVRLLSETIATL-----PLSTYSKRG---GSRKEIVT---PEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVR- 124 (457) T ss_pred HHHHHHHHhhccC-----ceEEEEecC---Cccccccc---chHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEE- Confidence 9999999987654 222222111 11111111 223344443 234566666544 4687776654 Q ss_pred ecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeeccc Q lcl|NC_018087. 162 NPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGL 241 (520) Q Consensus 162 d~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL 241 (520) + + ...+++|.+|+|..+..++....... ...|+.|.....+ + ......++++.|+|.. . T Consensus 125 ~--~-~g~~~~l~~l~p~~v~v~~~~~~~~~--------~~~~~~y~~~~~~-----~---~~~~~~~~~~diih~~--~ 183 (457) T protein:vir:13 125 W--Q-GPNIVGLDVLDPTKIHVHMVMVDGLR--------RKVFEAYDIDADG-----N---EVLLGWFTPRDVLHIP--G 183 (457) T ss_pred e--c-CCcEEEEEEEccCceEEEEecCCCcc--------ceeEEEEEEecCC-----c---eeeEEeeCccceEEec--C Confidence 3 2 24799999999999998755442111 1123333322110 0 0112356788887775 3 Q ss_pred ccCCC-CcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCcc Q lcl|NC_018087. 242 VDCCG-KNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKV 320 (520) Q Consensus 242 ~d~~~-~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev 320 (520) ..+++ -..+|-+..|.+.+.....+++...-+=---+--+-|..++ |.|-+..+++..+.+...|+. . T Consensus 184 ~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~ls~e~~~~~~~~~~~~~~g--~-------- 252 (457) T protein:vir:13 184 MMLPGDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVP-GTMSEEGLARAREAWRAANSG--V-------- 252 (457) T ss_pred CCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcC-CCCCHHHHHHHHHHHHHHhcC--c-------- Confidence 44544 35678899999988888888876554444444555666665 566665555544444444432 1 Q ss_pred ccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHH---HHHHHHHHHHhcCCChhhccCCCccccccccchhhHH Q lcl|NC_018087. 321 KNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDD---ILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRD 397 (520) Q Consensus 321 ~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~D---V~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRD 397 (520) ++..+.+ .++ + |.+++.|. .+..++.= -+|....+.++++||...|..-.+... .++.+... T Consensus 253 ~nag~~~-vl~-------~---g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--~~sn~eq~ 317 (457) T protein:vir:13 253 DNAHRVA-LLT-------E---GAKFSKVA--MSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTS--WGSGLAEQ 317 (457) T ss_pred cccCcce-ecC-------C---CceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCccc--ccchHHHH Confidence 1111122 121 2 45555553 23333332 347788899999999998853222111 11334444 Q ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_018087. 398 ELSFDKF-ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIG 476 (520) Q Consensus 398 ElkF~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vg 476 (520) -+.|.++ |.-+..++..- |-. .+.++.+. ....+.|.-+. +... -+..|.+++..+-.- T Consensus 318 ~~~f~~~tl~P~~~~ie~~----ln~-----~L~~~~~~----~~~~i~fd~~~----l~~~-D~~~r~~~~~~~~~~-- 377 (457) T protein:vir:13 318 NIAFTMFSLRPWLERIEAG----FNR-----LLFAETAD----RFRFVKFNLDE----IKRG-APKERMELWSLGLQN-- 377 (457) T ss_pred HHHHHHHHHHHHHHHHHHH----HHH-----hhcCcccc----CceeEEeechh----hhcc-CHHHHHHHHHHHHhC-- Confidence 5557665 34444444332 222 33343332 12234444332 2222 224566666655322 Q ss_pred hhhhHHHHHHHHhCCCHHH-----HHHH-------HHHHHHh-hhcCCccCCccccC Q lcl|NC_018087. 477 KYISNHTAMKDFLQMSDED-----IAAE-------RKLIDEE-LSDKIFNPPEPEEI 520 (520) Q Consensus 477 ky~S~~~i~k~IL~~tDee-----I~~~-------~kqi~~E-~~~~~~~~p~~e~~ 520 (520) -++|.+-++. .+.|.+-+ .--. .++-+.+ ...+-.++|..++- T Consensus 378 G~~T~NE~R~-~~gl~Pi~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (457) T protein:vir:13 378 GIYSIDEVRA-AEDMTPLPDGLGEKYRVPLNLGEVGEEPEPEPAPAPPAIEPPAEEP 433 (457) T ss_pred CCcCHHHHHH-HhCCCCCCCCcccceeeccccccccccccccccCCCCCCCCCcccc Confidence 3678888874 47776421 0000 0000000 01111111111111 No 99 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=96.95 E-value=0.00024 Score=40.41 Aligned_cols=437 Identities=11% Similarity=0.095 Sum_probs=205.6 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRS 80 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ 80 (520) |+|+- .+-.+|+-+...-- ..++. +-.++....++ .+..+-|++|+. T Consensus 1 m~~~~-~ik~~~~~~~~~~~----------~~~~~-~~~~~~~i~~~---------------------~~~~~~I~~w~~ 47 (517) T protein:vir:98 1 MKVIQ-RIKNFFKRGGYALS----------GQTLK-SINDHEKINID---------------------PNELARIERNLR 47 (517) T ss_pred CchHH-HHHHHHHHHHHHhc----------ccchh-HhhcCCceecC---------------------HHHHHHHHHHHH Confidence 66542 22222221111000 00110 00111011110 012233444444 Q ss_pred Hh--hccchh--------------------HHHHhhhceeeEecCCCcEEEEeeccchhh--hH-HHHHHHHHHHHHHHH Q lcl|NC_018087. 81 LL--NNYEVD--------------------NAVQEIVSDAIVYEEGFDVVSIDLDQTAFT--EN-IRNLISDEFNSVLNM 135 (520) Q Consensus 81 ma--~~pEvd--------------------~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s--~~-ik~~I~eeF~~i~~l 135 (520) |. .+|+|. .+..++.+=+ -.++|++.+++.... ++ ....-++-.+.++.- T Consensus 48 ~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~~~i~~~~A~Ll-----~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~ 122 (517) T protein:vir:98 48 QYEGDYPQVEYINSQGKIQERDYMTLNLRKLSADVLSGLV-----FNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQH 122 (517) T ss_pred HhcCCCcccccccccccccccceeecCcHHHHHHHhhhhh-----cCCcceEEecccccccccccchhHHHHHHHHHHHh Confidence 42 223321 1222222110 114556666544321 11 112234556788888 Q ss_pred hcchhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccc------cceeccee--- Q lcl|NC_018087. 136 LNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVV------KGYREYFL--- 206 (520) Q Consensus 136 l~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~------~~~~ey~~--- 206 (520) -+|.+...+.+.....-|-.+|-..+|. |-+.+..++|.++-+++--......++-+. ++-..||+ T Consensus 123 n~f~~~~~~~~e~a~a~G~~a~k~~~d~-----~~~~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE 197 (517) T protein:vir:98 123 NKFIKNLSDYLEPTFALGGLTVRPYVDN-----GEIEFSWALANAFYPLRSNSNGISEGVMKSVTTKVIGNKTVYYTLLE 197 (517) T ss_pred ccHHHHHHHHHHHHhhhCCEEEEEEEeC-----CeeEEEEEcCCeeEEEEecCCCeEEEEEEEEEEEeecCCceEEEEEE Confidence 8899999999999999999999888883 234588999999888643111111111000 00011221 Q ss_pred -ecCcc-----cccccccceecCCc----ceecC--------cccEEEeecc-----c--------ccCCCCcchhhhHH Q lcl|NC_018087. 207 -YDTEL-----ESYQCGHQHFAAGT----KIKIP--------YSAMVYAHSG-----L--------VDCCGKNIIGYLHR 255 (520) Q Consensus 207 -y~~~~-----~~~~~~~~~~~~~~----~~~I~--------~~aI~y~hSG-----L--------~d~~~~~~~syL~~ 255 (520) +.+.. ..|......|..++ +..|| .+.+++.+.. - .++..+..+|-++. T Consensus 198 ~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L~~~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~ 277 (517) T protein:vir:98 198 FHEWEKTEEGESLYVITNELYKSDNEGEIGKRIPLEELYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDN 277 (517) T ss_pred EEecCceeccCCcEEEEEEEEecCCCccccccccccccccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhh Confidence 01110 01111111111000 11111 1122222210 0 11122345678888 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcc Q lcl|NC_018087. 256 AVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWL 335 (520) Q Consensus 256 aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywL 335 (520) |+-++--|-..-+. +.|..|.=.+|+|. +..=|++.... +.-....++| .+++.++.+..+ T Consensus 278 a~~~~d~lD~~~s~--~~~e~~~g~~~i~v-p~~~l~~~~~~-------~g~~~~~~~d------~~~~~y~~~~~~--- 338 (517) T protein:vir:98 278 SVSTLKKINDTYDQ--FWWEIKMGQRTVFV-SDVMLRTVPDE-------SGMPPPQVFD------PDVNVYKSIRMG--- 338 (517) T ss_pred hHHHHHHHHHHHHH--HHHHHHhCCcceec-ChhhhccccCC-------CCcccCCCCC------cccceeeeccCC--- Confidence 88777777654444 45888887777775 22211100000 0000000111 122233332111 Q ss_pred cccCCCCCcceeecCCCCCc-ChHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 336 QRRDGKAVTEVETLPGMTGM-NEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKFEE 414 (520) Q Consensus 336 pRReGgrgTEIsTLpGg~nL-gei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~ 414 (520) .|+. -|+++.+.==. .-..-+.++.+.+-...++|-+-|..++.+ . .-++||...+-.-..-+.+.|+.+.. T Consensus 339 ---~~~~--~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~-~-kTATEi~s~~~~~~~t~~~~~~~~~~ 411 (517) T protein:vir:98 339 ---TDEE--FVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRS-M-KTATEIVSENDLTYRTRNDHVYEVEQ 411 (517) T ss_pred ---CCCC--ceeeeccccchHHHHHHHHHHHHHHHHHhCCCcccccccccc-c-ccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1211 25554441111 124456677888888999999999876543 2 34678877777777788888888888 Q ss_pred HHHHHHHHHHHhcCC---CChhhHHhhhhceEEEeeccchHH-HHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhC Q lcl|NC_018087. 415 IFLSPLKSNLLLKRV---ITEDEWEAELNNIKIVFHKNSYFS-EMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQ 490 (520) Q Consensus 415 if~d~Lk~QLiLkgi---~t~eew~~~~~~I~~~f~~Dn~f~-ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~ 490 (520) .+.++++.-|.|... +...-+. ...+.++|. |+.+. +..+++.+ .++.-. | .+|....+++..+ T Consensus 412 aL~~lv~~i~~l~~~~~~~~~~~~~--~~~v~v~f~-D~i~~D~~~~~~~~-------~~~v~a-G-~ms~~~~i~~~~g 479 (517) T protein:vir:98 412 FIKGLVISVLELAKTYKLFGGEIPS--AEHIGVDFD-DGVFQDRSALLRFY-------GQAKTF-G-FIPTVEAIQRIFK 479 (517) T ss_pred HHHHHHHHHHHHHHHHhhcCCCCCC--CcceEEEcC-CCCCCCHHHHHHHH-------HHHHhc-C-CCCHHHHHHHhCC Confidence 888888876654321 1111001 124777775 33332 22222222 122111 3 3688887777889 Q ss_pred CCHHHHHHHHHHHHHhhhcCCccCCc----------ccc Q lcl|NC_018087. 491 MSDEDIAAERKLIDEELSDKIFNPPE----------PEE 519 (520) Q Consensus 491 ~tDeeI~~~~kqi~~E~~~~~~~~p~----------~e~ 519 (520) +||+|-+++-.+|++|....- +-+. +|| T Consensus 480 ~~eeeA~~e~~~i~~E~~~~~-~~~~~~~~~~~~~gd~e 517 (517) T protein:vir:98 480 VPKKTAEQWLEEIRKDQIELD-PVTISQRAQKRMFGDEE 517 (517) T ss_pred CChHHHHHHHHHHHHhccccC-CCCccccccCCCCCCCC Confidence 999999999999999887542 2121 222 No 100 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=96.90 E-value=0.00027 Score=40.17 Aligned_cols=383 Identities=12% Similarity=0.089 Sum_probs=178.7 Q ss_pred hhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchh Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVD 88 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd 88 (520) ++||.-|.+..... .+.+... |.... +....+.+.+ -.+....++|-|. T Consensus 1 Mgl~~~~f~~~~~~---------~~~~~~~---~~~~~-----~~~~~~~g~~--------------v~~~~al~~~~v~ 49 (409) T protein:vir:84 1 MSLFTRIFSGPSEE---------RTLTKIS---GIPSP-----AEDWAMHGDR--------------PGANSAMTLGAFY 49 (409) T ss_pred CchhhhhhcCCCcc---------ccccccc---ccccc-----cchhhccCcc--------------cchhhhhccHHHH Confidence 55555333322110 0111111 11100 0001111111 1233445789999 Q ss_pred HHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHh----cchhhhHHH----HHhhccccceeEEEe Q lcl|NC_018087. 89 NAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNML----NFQRKGSDH----FKRWYVDSRVFFHKI 160 (520) Q Consensus 89 ~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll----~f~k~g~~~----fRrWYvDgri~~hkv 160 (520) .||+.|.+.+.-++- .+--++ +..+. +...+..+| |-...++++ +..+.+.|--|..+. T Consensus 50 ~~v~~ia~~iA~lp~-----~~~~~~----~~~~~----~~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~ 116 (409) T protein:vir:84 50 ACVTLLADTVASLSI-----DAYRKK----DNVRI----PVSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYIS 116 (409) T ss_pred HHHHHHHHhhhhCce-----EEEEec----CCccc----ccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEE Confidence 999999999864421 111110 00010 112233333 233444444 445778888886665 Q ss_pred eecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecc Q lcl|NC_018087. 161 INPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSG 240 (520) Q Consensus 161 id~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSG 240 (520) +. +...-+++|.+|+|..++....-. .++.. -+.+|. . ++.+++.+.|+|.. T Consensus 117 ~~--~~~g~~~~L~~l~p~~v~v~~~~~---~~~~~------~~~~~~-------~--------~g~~~~~~dvih~~-- 168 (409) T protein:vir:84 117 AR--DEANRPTAIMPIHPDCIHVTDAKD---EDGDW------IEPVYR-------I--------DGKVVPNHRIMHIK-- 168 (409) T ss_pred EE--CCCCceEEEEEEcCceeEEEEcCC---CcceE------EEEEec-------C--------CceEEchhhEEEec-- Confidence 54 233358999999998887642211 11111 011111 1 11346788888774 Q ss_pred cccCCCC-cchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCc Q lcl|NC_018087. 241 LVDCCGK-NIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGK 319 (520) Q Consensus 241 L~d~~~~-~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGe 319 (520) ...+++. ..+|-++.|.+.+.....+++...-+----+--+-+.-++ |+|.+..+++..+.....+.| .|. T Consensus 169 ~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~n-------~g~ 240 (409) T protein:vir:84 169 RYPVAGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSD-ADLTPDQVKQTQKQWIQSHHN-------RRL 240 (409) T ss_pred CCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecC-CCCCHHHHHHHHHHHHHHhcc-------CCC Confidence 2444543 4578899999888888777776664333333345555554 678777777766666555433 232 Q ss_pred cccccccchhhhhhcccccCCCCCcceeecCCCC-CcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHH Q lcl|NC_018087. 320 VKNQANMMALTEDYWLQRRDGKAVTEVETLPGMT-GMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDE 398 (520) Q Consensus 320 v~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~-nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDE 398 (520) .+ .++ .|.+++.+--.. .+.-++-.++..+.+.++++||.+.|....+.+..+ +.+.-.- T Consensus 241 ------~~-vl~----------~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~--sn~e~~~ 301 (409) T protein:vir:84 241 ------PA-VMS----------AGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWG--TGIEEQG 301 (409) T ss_pred ------ee-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccc--chHHHHH Confidence 11 121 144555553211 122244455778999999999999986433222111 2233333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchh Q lcl|NC_018087. 399 LSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKY 478 (520) Q Consensus 399 lkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky 478 (520) +.|..++ |+--+.. +.+.|-..| .+| ..|+|++ ..+...++ ..|++.+..+-.- -. T Consensus 302 ~~f~~~~--l~P~~~~-ie~~l~~~L-~~g-----------~~i~fd~------~~l~~~d~-~~~~~~~~~~~~~--G~ 357 (409) T protein:vir:84 302 INFVRHT--LLPWLRC-IEQALDTFL-PRG-----------QFVKFNV------DGLMRGDV-TARFTAYQMGLQN--GI 357 (409) T ss_pred HHHHHHH--HHHHHHH-HHHHHHHhc-cCC-----------CeEEEec------hhhhccCH-HHHHHHHHHHHhC--CC Confidence 3454432 3222221 222222222 122 2344433 23333333 5566666655332 36 Q ss_pred hhHHHHHHHHhCCCHHHHHHHHH-----------HHHHhhhcCCccCCccccC Q lcl|NC_018087. 479 ISNHTAMKDFLQMSDEDIAAERK-----------LIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 479 ~S~~~i~k~IL~~tDeeI~~~~k-----------qi~~E~~~~~~~~p~~e~~ 520 (520) ++.+-+++ ++.+.+- +.-++ +...+.+. .+|++..- T Consensus 358 ~t~NE~R~-~~g~~p~--~ggD~~~~~~n~~~~~~~~~~~~~---~~~~~~~~ 404 (409) T protein:vir:84 358 WSVNEVRA-WEDAPPI--PEGDIHLQPMNFVPLGYVPPEEPA---QEPQPNSA 404 (409) T ss_pred cCHHHHHH-HhCCCCC--CCcceeeecccccccccCCccccC---cCCCCCCc Confidence 78888775 4677542 11111 00000000 11111111 No 101 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=96.87 E-value=0.00029 Score=39.99 Aligned_cols=391 Identities=9% Similarity=0.086 Sum_probs=179.4 Q ss_pred cccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHh Q lcl|NC_018087. 3 MLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLL 82 (520) Q Consensus 3 ~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma 82 (520) |+|.+.. .. +..+...++...+. -+ ++...+.++. -+..+ T Consensus 1 m~f~~~~--------~~----------~~~~~~~~~~~~~~-~~-------g~~~~~~~v~--------------~~~al 40 (409) T protein:vir:10 1 MLFRKGF--------KN----------QSQEISIDDKKILE-WL-------GINPSETYVN--------------GKSCL 40 (409) T ss_pred Ccccccc--------cC----------cCCCCCCChHHHHH-Hh-------cCCcCcceec--------------hhhhh Confidence 5544221 11 11122222111000 00 0111111211 12346 Q ss_pred hccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhc----chhhhHHHH----Hhhccccc Q lcl|NC_018087. 83 NNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLN----FQRKGSDHF----KRWYVDSR 154 (520) Q Consensus 83 ~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~----f~k~g~~~f----RrWYvDgr 154 (520) ++|-|..||+-|.+.+.-++ +.|--+... .+.+ +-..+..+|+ =..++.++. ..+.+.|. T Consensus 41 ~~~~v~~~i~~ia~~ia~lp-----~~~~~~~~~-~~~~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn 108 (409) T protein:vir:10 41 KQATVFGCIRILSDNISKLP-----IKIYQKKDG-IKRV------PDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGN 108 (409) T ss_pred ccHHHHHHHHHHHHhhhhCc-----eEEEEecCC-eeec------cCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCC Confidence 79999999999988876432 212111100 0000 0111222332 233444444 44677899 Q ss_pred eeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccE Q lcl|NC_018087. 155 VFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAM 234 (520) Q Consensus 155 i~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI 234 (520) -|..++-+. + .-+++|.+|+|..|+.+.+-. +... ....-+|.|... .+....++.+.| T Consensus 109 a~~~i~r~~-~--G~~~~L~~i~~~~V~v~~~~~-----~~~~-~~~~~~y~~~~~------------~g~~~~~~~~ev 167 (409) T protein:vir:10 109 AYVALDFKK-N--GEIKGLYPLKSDGMKIFVDDT-----GLLN-SENNVWYLYTDD------------LGQRHKFMSDEI 167 (409) T ss_pred eEEEEEEcC-C--CcEEEEEEEcCCceEEEEcCC-----cccc-ccceEEEEEEeC------------CceeEEeccccE Confidence 999888652 2 248999999999998764322 1110 111112222211 123468899999 Q ss_pred EEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHh-hcceeEe Q lcl|NC_018087. 235 VYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNS-HRNRISY 313 (520) Q Consensus 235 ~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~-~knklvY 313 (520) +|.. + ..+++...+|-|+.|..++.....+++...=+=--.+.-+-|..++ +.+.+..+++ +++.+++ |.- T Consensus 168 ih~r-~-~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~l~~e~~~~-~~~~~~~~~~g---- 239 (409) T protein:vir:10 168 LHFK-G-LTADGLAGLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYA-GDLNPEAEEV-FKENFERMSSG---- 239 (409) T ss_pred EEec-C-cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC-CCCCHHHHHH-HHHHHHHHhcc---- Confidence 9885 2 3567767789999999999888888776554433344556777776 4566554433 4433333 221 Q ss_pred ecCCCccccccccchhhhhhcccccCCCCCcceeecCCC-CCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccc Q lcl|NC_018087. 314 DARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGM-TGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMST 392 (520) Q Consensus 314 d~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~ 392 (520) .++..+.+ +++ .|++++.|.-. ..+.-++-.++..+.+.++++||.+-|...++.+ + + T Consensus 240 ------~~n~~~~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~--~--~ 298 (409) T protein:vir:10 240 ------LKNAHRIA-MLP----------IGYKFEPISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRAT--H--S 298 (409) T ss_pred ------ccccCCce-ecC----------CCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCc--c--c Confidence 11111222 221 24566666421 1222344566889999999999999886332211 1 1 Q ss_pred hhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018087. 393 AISRDELSFDKF-ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLM 471 (520) Q Consensus 393 eItRDElkF~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~ 471 (520) .+.-.-..|..+ |.-+..++ .+.|-. .++++.+.. ....+.|.-+. +... -+..|.+.++.+ T Consensus 299 ~~e~~~~~f~~~~l~P~~~~i----e~~ln~-----kL~~~~~~~---~~~~~~fd~~~----ll~~-d~~~~~~~~~~~ 361 (409) T protein:vir:10 299 NITEQNREFYIDTLQSILNMY----ELEINY-----KLFLISEIK---NGFYSKFNVDT----ILRA-DIKTRYESYKEA 361 (409) T ss_pred cHHHHHHHHHHHHHHHHHHHH----HHHHHH-----hhcCchhcc---CCcEEEEechh----hhcc-CHHHHHHHHHHH Confidence 122222334443 22222222 222222 234444322 22334444322 2221 123455555544 Q ss_pred hcccchhhhHHHHHHHHhCCCHHHHHHHHHHHH-------HhhhcCCccCCcccc Q lcl|NC_018087. 472 EPYIGKYISNHTAMKDFLQMSDEDIAAERKLID-------EELSDKIFNPPEPEE 519 (520) Q Consensus 472 ~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~-------~E~~~~~~~~p~~e~ 519 (520) -.- -+++.+-++. +|++.+-+ --++.+- +...+... +.- |+ T Consensus 362 ~~~--G~~T~NE~R~-~lgl~p~~--ggD~~~~~~n~~~~~~~~~~~~-kgG-e~ 409 (409) T protein:vir:10 362 IQN--GFKTPNEIRE-LEEDEPLE--GGDVLLINGNMIPVKMAGEQYS-KGG-EK 409 (409) T ss_pred HhC--CCcCHHHHHH-HhCCCCCC--CcCeeeeccCccchhhcccccc-ccC-CC Confidence 222 2567777764 46664321 0011000 00000000 000 00 No 102 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=96.82 E-value=0.00032 Score=39.74 Aligned_cols=339 Identities=12% Similarity=0.107 Sum_probs=150.7 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRS 80 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ 80 (520) || +|+.. ... . + +..++.......+ ++...+.++.- +. T Consensus 1 M~--------~~~~f---~~r--------~--~---~~~~~~~~~~~~~----~~~~~~~~v~~--------------~~ 38 (359) T protein:vir:10 1 MS--------ILNPF---ERR--------S--S---ITPNNYYPFMVQN----GSIVPNSLVDA--------------TE 38 (359) T ss_pred Cc--------ccchh---hcc--------c--c---CCCCcchhhhhcc----ccccCCcccCH--------------HH Confidence 33 33311 100 1 1 1111211111100 01111112111 12 Q ss_pred HhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHH----hhcccccee Q lcl|NC_018087. 81 LLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFK----RWYVDSRVF 156 (520) Q Consensus 81 ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fR----rWYvDgri~ 156 (520) -+++|-|..||+-|.+.+.-.+ +.+. ..+..+++-=|-..++.++.+ .+..+|--| T Consensus 39 al~~~av~~cv~~ia~~ia~~p---------~~~~-----------~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay 98 (359) T protein:vir:10 39 ALKNSDLYAVTSLISSDIAGTR---------FIGN-----------QVFTSVLNNPSHLTNAFSFWQTAILNLLLNGNVF 98 (359) T ss_pred hhcchHHHHHHHHHHHhhhcCc---------cccc-----------hHHHHHhhcccccCCHHHHHHHHHHhccccCceE Confidence 3567889999999988654221 1111 112223333333445555544 455679888 Q ss_pred EEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEE Q lcl|NC_018087. 157 FHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVY 236 (520) Q Consensus 157 ~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y 236 (520) ..++-| .+ .-+++|.+|+|..+..... .++.. |.++.. ..+...+++.+.|+| T Consensus 99 ~~i~r~-~~--g~~~~l~~l~~~~v~i~~~-----~~~~~-------y~~~~~------------~~~~~~~~~~~evih 151 (359) T protein:vir:10 99 LAILKG-DN--SLMKELRLIPSNAITIDLT-----DDTLT-------YEVNQF------------DDYPSAKYNASEMIH 151 (359) T ss_pred EEEEEC-CC--CeEEEEEEeCCceEEEEEc-----CCeEE-------EEEEec------------CCceEEEEcccceEE Confidence 877644 22 2378999999988876321 11111 111111 112235788888887 Q ss_pred eec---ccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEe Q lcl|NC_018087. 237 AHS---GLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISY 313 (520) Q Consensus 237 ~hS---GL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvY 313 (520) .-. +.-..+|...+|.|+.|..++.....+++...-+=---+--+-+..++-|++.+..+++ +++-..++.. . T Consensus 152 ~~~~~~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~-~~~~~~~~~~--~- 227 (359) T protein:vir:10 152 VKIMAYGVDTLHNLVGHSPLESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQGTLSSEAKDS-IRKEFEKANG--G- 227 (359) T ss_pred eccCCCCCCccCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHH-HHHHHHHHhC--c- Confidence 732 11123555678999999999998888887655332222334667777777766655443 3333344321 0 Q ss_pred ecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChH---HHHHHHHHHHHHhcCCChhhccCCCccccccc Q lcl|NC_018087. 314 DARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEM---DDILYFRKALYMALRVPLSRIPDEQTQNVFDM 390 (520) Q Consensus 314 d~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei---~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~ 390 (520) ...|++ + .++ .|.+++.|. .+.-++ +-.+|-...+.++++||.+-|...++. T Consensus 228 -~n~g~~------~-vl~----------~g~~~~~l~--~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~----- 282 (359) T protein:vir:10 228 -NNSGRV------M-VLD----------QSADFSTVS--INADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQ----- 282 (359) T ss_pred -cccCCc------e-ecC----------CCcceeeec--CCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCcc----- Confidence 112221 1 221 255666663 333333 344566788999999999998532211 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHH---- Q lcl|NC_018087. 391 STAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVN---- 466 (520) Q Consensus 391 ~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~---- 466 (520) .+..+.-|-.+..|+...-.....-+...|-+++.+ + ....+. |. .++..+++. ..++ T Consensus 283 ~~~~~~~e~~~~~~l~~~l~p~~~~l~~~l~~~~~~---------~-~~~~~~--~d-----~~~~~~~~~-~~~~~G~~ 344 (359) T protein:vir:10 283 QSSLDQIKDLYVNALNRFIEPLISELRIKCDSSIGV---------D-MSPITD--YS-----NSVFKADIL-NWVKEGII 344 (359) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcc---------c-chhhhh--cC-----HHHHHHHHH-HHHhCCCc Confidence 111111122233332111111111111111111111 1 000011 11 112222211 1111 Q ss_pred ------HHHHhhccc Q lcl|NC_018087. 467 ------VLSLMEPYI 475 (520) Q Consensus 467 ------~~~~~~p~v 475 (520) .+-.+.|+. T Consensus 345 t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 345 EPTEAKTLLESKGII 359 (359) T ss_pred CHHHHHHHhCCCCCC Confidence 111335554 No 103 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=96.80 E-value=0.00033 Score=39.68 Aligned_cols=409 Identities=13% Similarity=0.059 Sum_probs=184.1 Q ss_pred hhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccc-cccccccccccccchhHHHHHHHHHHHhhccch Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYN-GVFQKLYGSQDPTATSTRELINTYRSLLNNYEV 87 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~-g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEv 87 (520) ++||..|.++... ++.+. ..|...-+..+.... |+- .. .+.. +. -+..+++|-| T Consensus 1 Mg~~~~l~~~~~~------------~~~~~-~~~~~~~~~~~~~~~~~~~--~~---~g~~------v~-~~~al~~~~v 55 (457) T protein:vir:62 1 MGFWSALFGRGHS------------PALDA-AEGRAWEPYDPSIYNLGAT--AS---SGER------VT-PHDALQVSAV 55 (457) T ss_pred Cchhhhhhccccc------------ccccc-ccccccccchhhhhhcccc--cc---CCce------ec-hHHhhccHHH Confidence 4444444332211 11110 011110000000000 100 00 0100 00 1345678999 Q ss_pred hHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHH-HHHHHHHHHHhcchhhhHHHHHh----hccccceeEEEeee Q lcl|NC_018087. 88 DNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLI-SDEFNSVLNMLNFQRKGSDHFKR----WYVDSRVFFHKIIN 162 (520) Q Consensus 88 d~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I-~eeF~~i~~ll~f~k~g~~~fRr----WYvDgri~~hkvid 162 (520) -.||+-|.+.+.-+ |+.|--+.- ..++.+ .-.+..+++--+=.-++.++++. +.+.|--|.-+. + T Consensus 56 ~~~i~~ia~~iA~l-----p~~~~~~~~----~~~~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~-~ 125 (457) T protein:vir:62 56 FASVRLLSETIATL-----PLSTYSKRG----GTRKEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVR-W 125 (457) T ss_pred HHHHHHHHHhHhhC-----ceEEEEecC----CccccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEE-e Confidence 99999998877533 222221111 111111 11122222211112345555555 556688776553 3 Q ss_pred cCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccc Q lcl|NC_018087. 163 PNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLV 242 (520) Q Consensus 163 ~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~ 242 (520) ...++.+|.+|+|..+...+..... .. ...|+.|.-...+ + ......++++.|+|+. .. T Consensus 126 ---~~g~~~~l~~l~p~~v~v~~~~~~~----~~----~~~~~~y~~~~~g-----~---~~~~~~~~~~eiih~r--~~ 184 (457) T protein:vir:62 126 ---AGPNIAGLDVLDPTKIHVHMVMVDG----LR----RKVFEAYDIDADG-----N---EVLLGWFTPRDVLHIP--GM 184 (457) T ss_pred ---CCCcEEEEEEEcCcceEEEEeccCC----cc----ceeEEEEEEccCC-----c---eeEEEeeCccceEEec--CC Confidence 2357999999999999886543311 11 1123333221110 0 0112357888887775 34 Q ss_pred cCCC-CcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccc Q lcl|NC_018087. 243 DCCG-KNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVK 321 (520) Q Consensus 243 d~~~-~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~ 321 (520) .+++ ...+|-++.|++.+.....+++...-+=---+--+-|..++ |.|-+..+++..+.+...|+. .+ T Consensus 185 ~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~ls~e~~~~~~~~~~~~~~G----------~~ 253 (457) T protein:vir:62 185 MLPGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVP-GTMSEEGLARAREAWRAANSG----------VD 253 (457) T ss_pred CCCCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcC-CCCCHHHHHHHHHHHHHHhcC----------cc Confidence 5555 35688999999999888888877654433334555677776 566665555444444333432 11 Q ss_pred cccccchhhhhhcccccCCCCCcceeecCCCCCcChHH---HHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHH Q lcl|NC_018087. 322 NQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMD---DILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDE 398 (520) Q Consensus 322 d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~---DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDE 398 (520) +..+.+ .++ .|.+++.|. .+.-++. =-+|-...+.++++||...|..-.+.+..| +.+...- T Consensus 254 nag~~~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~--sn~eq~~ 318 (457) T protein:vir:62 254 NAHRVA-LLT----------EGAKFSKVA--MSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWG--SGLAEQN 318 (457) T ss_pred ccCcce-ecC----------CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCccccc--chHHHHH Confidence 111222 221 245555553 3333333 234677889999999999885433222111 2244444 Q ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccch Q lcl|NC_018087. 399 LSFDKFI-SELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGK 477 (520) Q Consensus 399 lkF~KFI-~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgk 477 (520) +.|.+++ .-+..++.. .|-.. ++++.+. ....+.|.-+.. ... -++.|++++..+-.- - T Consensus 319 ~~f~~~~l~P~~~~ie~----~ln~~-----L~~~~~~----~~~~i~fd~~~l----~~~-d~~~r~~~~~~~~~~--G 378 (457) T protein:vir:62 319 IAFTMFSLRPWLERIEA----GFNRL-----LFAETAD----RFRFVKFNLDEI----KRG-APKERMELWSLGLQN--G 378 (457) T ss_pred HHHHHHHHHHHHHHHHH----HHHhh-----hcCcccc----CceEEEeechhh----hcc-CHHHHHHHHHHHHhC--C Confidence 4566653 333333333 23333 3444332 223444543332 111 234566666654322 3 Q ss_pred hhhHHHHHHHHhCCCHHHHHHHHH-----------HHHHhhh--cCCccCCccccC Q lcl|NC_018087. 478 YISNHTAMKDFLQMSDEDIAAERK-----------LIDEELS--DKIFNPPEPEEI 520 (520) Q Consensus 478 y~S~~~i~k~IL~~tDeeI~~~~k-----------qi~~E~~--~~~~~~p~~e~~ 520 (520) +++.+-+++ +++|..-+=-.-++ ...+..+ .+--.+|.+++- T Consensus 379 ~~T~NE~R~-~~gl~pi~~g~~D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (457) T protein:vir:62 379 IYSIDEVRA-AEDMTPLPDGLGEKYRVPLNLGEIGEEPEPEPAPAPPAIDPPAEEP 433 (457) T ss_pred CcCHHHHHH-HhCCCCCCCCCcceeeeccccccccccccccccCCCccCCCCccCC Confidence 678888885 46775431000000 0000001 111111111111 No 104 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=96.79 E-value=0.00034 Score=39.60 Aligned_cols=391 Identities=9% Similarity=0.073 Sum_probs=184.7 Q ss_pred hHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchhHHHHhhhceeeE Q lcl|NC_018087. 21 TEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNAVQEIVSDAIV 100 (520) Q Consensus 21 ~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai~eIvneaiv 100 (520) .-++...+.+... +.... . ... .++++++.. +.+. .+ +. ....++|-|..||+-|.+.+.- T Consensus 1 ~~f~~~f~r~~~~--~~~~~-~-~~~----~~~~~~~~~-~~g~--~v-~~-------~~~l~~~~v~~~i~~Ia~~iA~ 61 (413) T protein:vir:48 1 MFFSGLFQRKSDA--PVTTP-A-ELA----EAIGLSYDT-YTGK--RI-SS-------QRAMRLTAVYSCVRVLAESVGM 61 (413) T ss_pred CccchhhccCccC--Cccch-H-HHH----HhhhcCccc-ccCc--ee-ch-------hhhhccHHHHHHHHHHHHhhhh Confidence 3333333322211 11100 0 000 011111111 1110 01 11 2345789999999999998763 Q ss_pred ecCCCcEEEEeeccchhhhHHHHHH-HHHHHHHHHH-hcchhhhHH----HHHhhccccceeEEEeeecCCCCCCeeeeE Q lcl|NC_018087. 101 YEEGFDVVSIDLDQTAFTENIRNLI-SDEFNSVLNM-LNFQRKGSD----HFKRWYVDSRVFFHKIINPNRPKDGIIELR 174 (520) Q Consensus 101 ~d~~~~~V~l~Ld~~~~s~~ik~~I-~eeF~~i~~l-l~f~k~g~~----~fRrWYvDgri~~hkvid~~~~k~GI~elr 174 (520) ++ +.+- .. ++..+..+ .....++|+. =|-..++.+ ++..+.+.|.-|..++-+ ...+.+|. T Consensus 62 ~p-----~~~~--~~--~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~----~g~~~~L~ 128 (413) T protein:vir:48 62 LP-----CSLY--KI--SGTLKTRVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA----LGEVVELL 128 (413) T ss_pred Cc-----eEEE--Ee--cCCcceeecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC----CCcEEEEE Confidence 32 1111 00 11111111 1111222221 122344444 444567789888776532 23589999 Q ss_pred ecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccccCCCCcchhhhH Q lcl|NC_018087. 175 RLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDCCGKNIIGYLH 254 (520) Q Consensus 175 ~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~ 254 (520) +|+|..+....+ .++.. -|.++.+ .+....++.+.|+|... ...++...+|-|+ T Consensus 129 ~l~~~~v~~~~~-----~~~~~------~y~~~~~-------------~g~~~~~~~~evih~~~--~~~d~~~G~s~i~ 182 (413) T protein:vir:48 129 PIDPGCVEPKLN-----SQWQP------VYQVTFP-------------DGSVDVLTQDEIWHVRT--LTLDGLVGLNPIA 182 (413) T ss_pred EEcCceEEEEEc-----CCceE------EEEEEec-------------CceEEEEccccEEEecC--cCCCCcccccHHH Confidence 999999887432 11111 1222111 12234688899988852 3556667789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhc Q lcl|NC_018087. 255 RAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYW 334 (520) Q Consensus 255 ~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDyw 334 (520) .|.+++.....+++...-+---.+.-+-++.++ +.+.+..+++-.+.+...|+. . ...|. .| .+ T Consensus 183 ~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~~~~e~~~~~~~~~~~~~~g--~--~n~g~------~~-vl---- 246 (413) T protein:vir:48 183 YAREAISLAAATEEHGARLFGNGAVTSGVLRTE-QKLTPDAYERLKKDFEERHTG--L--GNAHR------PM-IL---- 246 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCcceEEEeC-CCCCHHHHHHHHHHHHHHhcC--c--cccCc------ce-ec---- Confidence 999999888888887766555556667888887 567776666665555555543 0 11121 11 11 Q ss_pred ccccCCCCCcceeecCCCCCcCh---HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHH-HHHHHH Q lcl|NC_018087. 335 LQRRDGKAVTEVETLPGMTGMNE---MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKF-ISELQH 410 (520) Q Consensus 335 LpRReGgrgTEIsTLpGg~nLge---i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KF-I~rLr~ 410 (520) ..|.++..|. .+..+ ++-.++....+.++++||..-|...++.. .+.+.-..+.|.++ |.-+-. T Consensus 247 ------~~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t----~~n~e~~~~~f~~~~i~P~~~ 314 (413) T protein:vir:48 247 ------EMGLDWKSMA--LNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRAT----FNNIEELGLGFINYSLVPYLT 314 (413) T ss_pred ------CCCceEEecc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCC----cccHHHHHHHHHHHHHHHHHH Confidence 1255666663 23333 34455778899999999998886432211 11122223335443 222222 Q ss_pred HHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhC Q lcl|NC_018087. 411 KFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQ 490 (520) Q Consensus 411 rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~ 490 (520) ++.. . +-+.++++.++. ..+|+|++ + ++...+ +..|.++++.+-. +-+++.+-++. +++ T Consensus 315 ~ie~----~-----l~~~L~~~~~~~--~~~~~fd~--~----~l~~~d-~~~~~~~~~~~~~--~g~~T~NE~R~-~~g 373 (413) T protein:vir:48 315 RIEQ----R-----INTGLVRESKQG--KFYAKFNA--G----ALLRGD-MKSRFEAYATGIN--WGIYSPNDCRD-LED 373 (413) T ss_pred HHHH----H-----HHhhccCccccC--CeEEEEec--h----hhhccC-HHHHHHHHHHHHh--CCCcCHHHHHH-HhC Confidence 2222 1 334556666553 22344433 2 333221 2445666554422 23567777774 467 Q ss_pred CCHHHHHHHHHHHHHhhhcC-----Cc--cCCc--cccC Q lcl|NC_018087. 491 MSDEDIAAERKLIDEELSDK-----IF--NPPE--PEEI 520 (520) Q Consensus 491 ~tDeeI~~~~kqi~~E~~~~-----~~--~~p~--~e~~ 520 (520) +.+-| .-++..-.-...+ -- ++++ +++- T Consensus 374 ~~p~~--ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~ 410 (413) T protein:vir:48 374 MNPRP--GGDVYLTPMNMTTSPSAGDDNGKKKESGDADK 410 (413) T ss_pred CCCCC--CcceeeccccccccccccccCCCCCCCCCccc Confidence 75431 1111000000000 00 0000 0000 No 105 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=96.75 E-value=0.00036 Score=39.44 Aligned_cols=403 Identities=12% Similarity=0.166 Sum_probs=186.3 Q ss_pred cccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeec-ccccccccccccccccccccchhHHHHHHHHHHH Q lcl|NC_018087. 3 MLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVD-SQDIAYNGVFQKLYGSQDPTATSTRELINTYRSL 81 (520) Q Consensus 3 ~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~-~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~m 81 (520) |-....++||+-. +.. . +...|....+..... .+..+ ..+....++....+ ..... T Consensus 1 ~~~~~~mg~f~r~---~~~-----~-----~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~g~~v--------~~~~a 57 (432) T protein:vir:81 1 MPDEKKLGLFGQL---KAM-----F-----VPPDPVDIGGGQTFTPVNATA--RDLGIIISDTGAAV--------NADAI 57 (432) T ss_pred CCchhhcchhhhh---hhh-----c-----ccccccccccccccccCccch--hhhcccccccCccc--------chHhh Confidence 6666777777632 111 0 001111111111111 00000 11111111111111 12344 Q ss_pred hhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcc----hhhhHHH----HHhhcccc Q lcl|NC_018087. 82 LNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNF----QRKGSDH----FKRWYVDS 153 (520) Q Consensus 82 a~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f----~k~g~~~----fRrWYvDg 153 (520) +++|-|..||..|.+.+.-++ +.|--+.. +..++.+. .-+..+|+. ..++.++ +..+.++| T Consensus 58 l~~~~V~~~i~~Ia~~ia~lp-----~~~y~~~~---~g~~~~~~---~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~G 126 (432) T protein:vir:81 58 MRLDAVAACVKLVSQAIAAMP-----LTMYMRTP---DGRKEAVN---HPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDG 126 (432) T ss_pred hccHHHHHHHHHHHHhhhhCc-----eeeEEecC---Ccceeccc---chHHHHHHhcccccCCHHHHHHHHHHHHhhcC Confidence 578999999999999877542 22211100 11111111 112333322 2344443 44577889 Q ss_pred ceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCccc Q lcl|NC_018087. 154 RVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSA 233 (520) Q Consensus 154 ri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~a 233 (520) .-|..++-+ + ..+++|.+|+|..+...++ .+|.. +|.|.. ..+..+.++.+. T Consensus 127 nayv~i~~~--~--g~~~~L~~l~~~~v~v~~~-----~~g~~-------~y~~~~------------~~g~~~~~~~~~ 178 (432) T protein:vir:81 127 TAYVRKVVT--D--GRIESLQYLANDRLTITTD-----PKGNT-------AYRYRR------------TDGQMIDIPKQQ 178 (432) T ss_pred CeEEEEEec--C--CcEEEEEEEcCCceEEEEC-----CCCcE-------EEEEEe------------cCceEEEEcccc Confidence 998887654 2 3589999999999988643 22221 222211 112346889999 Q ss_pred EEEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhc--CccceEEEccCCCCchHHHHHHHHHHHHhhccee Q lcl|NC_018087. 234 MVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITR--APDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRI 311 (520) Q Consensus 234 I~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~R--ApeRRvFyIDvGnlpk~KAeqyl~~im~~~knkl 311 (520) |+|.. + ...+|-..+|-|+.|.+.+..-..+++... +.++ +--.-+..+| +.|.+..+++.. .+|+.- T Consensus 179 iih~r-~-~~~dg~~G~spi~~~~~~i~~~~~~~~~~~--~~f~ng~~~~gil~~~-~~l~~e~~~~~~----~~~~~~- 248 (432) T protein:vir:81 179 IWKIM-G-YSLDGENGLSAIRYGAQIFGTAIAAEAQAA--RAFRNGQLQSVYYQID-RFLTDDQYDSFA----KKVSGS- 248 (432) T ss_pred EEEec-C-CCCCCcccccHHHHHHHHHHHHHHHHHHHH--HHHhcCCCcceEEecC-CCCCHHHHHHHH----HHHhhh- Confidence 98874 2 355666678899999988887777766543 2232 2223455554 555544443332 223210 Q ss_pred EeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChH---HHHHHHHHHHHHhcCCChhhccCCCccccc Q lcl|NC_018087. 312 SYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEM---DDILYFRKALYMALRVPLSRIPDEQTQNVF 388 (520) Q Consensus 312 vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei---~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~ 388 (520) ...|. .+ .++ + |++++.|. .+.-++ +-.++....+.++.+||..-|........ T Consensus 249 ---~nag~------~~-vl~-------~---g~~~~~l~--~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~- 305 (432) T protein:vir:81 249 ---VEAGR------AP-LLE-------G---GMDVKSLG--LNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTT- 305 (432) T ss_pred ---hcCCC------ce-ecC-------C---CceEEEcc--CCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccc- Confidence 01121 11 222 2 45566553 333333 33457788999999999999864332111 Q ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 389 DMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVL 468 (520) Q Consensus 389 G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~ 468 (520) +.++.+.-.-+-|.++ .|+--+.. +.+-|-. ++.++.++. ...+.|.-+ ++.... +++|.+.+ T Consensus 306 ~~~sn~eq~~~~f~~~--tl~P~~~~-ie~~l~~-----kLl~~~~~~----~~~~~fd~~----~llr~d-~~~r~~~~ 368 (432) T protein:vir:81 306 SWGSGIESQQLGFLTM--TLSPWLRR-IEQSIAL-----NLLSPAERR----RYFADFDTS----ALLRAD-SAARSSYY 368 (432) T ss_pred cccchHHHHHHHHHHH--HHHHHHHH-HHHHHHh-----hccCccccC----ceEEEeech----hhhccC-HHHHHHHH Confidence 2222232222335443 33332222 2222222 345555543 244555433 222222 35677777 Q ss_pred HHhhcccchhhhHHHHHHHHhCCCHHH----HHHHHHH---HHHhhhcC------CccCCccccC Q lcl|NC_018087. 469 SLMEPYIGKYISNHTAMKDFLQMSDED----IAAERKL---IDEELSDK------IFNPPEPEEI 520 (520) Q Consensus 469 ~~~~p~vgky~S~~~i~k~IL~~tDee----I~~~~kq---i~~E~~~~------~~~~p~~e~~ 520 (520) +.+-. .-+++.+-++. .|++..-+ .-..... ++.-..++ --.+.+.+++ T Consensus 369 ~~~~~--~G~~t~NE~R~-~~glpp~~g~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~n~~~~~~ 430 (432) T protein:vir:81 369 SQLVN--NGLMTRDEARE-IEGLPKLGGNAAVLTVQSAMVPLDSIGLQASPEPASGLGNQQQDKV 430 (432) T ss_pred HHHHh--CCCCCHHHHHH-HhCCCCCCCCcceEeecCcccchhhhccCCCCCCCCCCCCcccccc Confidence 66632 23678888885 47776422 1000000 01100000 0111111222 No 106 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=96.75 E-value=0.00036 Score=39.43 Aligned_cols=420 Identities=11% Similarity=0.056 Sum_probs=184.6 Q ss_pred ccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhh Q lcl|NC_018087. 4 LADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLN 83 (520) Q Consensus 4 ~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~ 83 (520) +++-.=+++ + .++ +...+ ..++- +. .++..+.+....... ... ...++ T Consensus 1 ~~~~~~~~~--------~-------~~~-~~~~~-~~~~~--~~----~~g~~~~~~~~~~~~-------~~~--~~a~~ 48 (460) T protein:vir:10 1 MANRIIRAL--------R-------ELT-GLDNK-FNDAF--IK----YIGQTFTKYDNNGKT-------YLE--QGYNI 48 (460) T ss_pred CchhHHHHH--------h-------hhh-ccCCC-chHHH--HH----hhccccCCCccchhh-------hhH--HHHhc Confidence 111111111 1 000 11000 00100 00 001111111100000 000 12467 Q ss_pred ccchhHHHHhhhceeeEe-----cCCCcEEEEeecc-chhhhHHHHHHHHHHHHHHH--------------HhcchhhhH Q lcl|NC_018087. 84 NYEVDNAVQEIVSDAIVY-----EEGFDVVSIDLDQ-TAFTENIRNLISDEFNSVLN--------------MLNFQRKGS 143 (520) Q Consensus 84 ~pEvd~Ai~eIvneaiv~-----d~~~~~V~l~Ld~-~~~s~~ik~~I~eeF~~i~~--------------ll~f~k~g~ 143 (520) +|.|-.||+-|.+.+.-+ ....+...-.... ....+.+...++..+++.+. -=|-..++. T Consensus 49 ~~~v~~~v~~ia~~iA~lp~~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~ 128 (460) T protein:vir:10 49 NPDVYSCISQMAAKTVAVPYTIKVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWA 128 (460) T ss_pred chHHHHHHHHHHHhhhhCceEEEeccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHH Confidence 899999999999886533 2222221111111 11223333444444444332 223334555 Q ss_pred HHHH----hhccccceeEEEeeecCCCCCC-eeeeEecCccceeeeeeccCCCCcccccccceecceeecCccccccccc Q lcl|NC_018087. 144 DHFK----RWYVDSRVFFHKIINPNRPKDG-IIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGH 218 (520) Q Consensus 144 ~~fR----rWYvDgri~~hkvid~~~~k~G-I~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~ 218 (520) ++.+ .+.+.|.-|..++-+......| +.+|.+|+|..+...+... +...... | ....|. T Consensus 129 ~f~~~~~~~lll~Gnay~~i~r~~~~~~~G~~~~L~~l~~~~v~v~~~~~-----~~~~~~~------~--~~~~~~--- 192 (460) T protein:vir:10 129 DIYSLYKTYMRLNGNCYFYLMSPDDGINAGVPSQMYVLPAHLIKIVLKDD-----INLLSTD------S--PIKSYM--- 192 (460) T ss_pred HHHHHHHHHHhhcCCeEEEEEecCCCccCceeEEEEEEcCceEEEEEcCC-----Cceeeee------e--eeeEEE--- Confidence 5444 4678899999888775544445 6789999999998853322 2111110 0 000110 Q ss_pred ceecCCcceecCcccEEEee--cccccCC--CCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchH Q lcl|NC_018087. 219 QHFAAGTKIKIPYSAMVYAH--SGLVDCC--GKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPAR 294 (520) Q Consensus 219 ~~~~~~~~~~I~~~aI~y~h--SGL~d~~--~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~ 294 (520) ....+..+.++.+.|+|+. +-..+++ +...+|.+..|.+.+.....+++...-+--.-++-. ..+..-+.|.+. T Consensus 193 -~~~~g~~~~~~~~evih~r~~~~~~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~~~-~i~~~~~~l~~e 270 (460) T protein:vir:10 193 -LIQGDQFIEFNEDEVIHTKYANPNFDLQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQNGGVFG-FIHGGSTGLTQP 270 (460) T ss_pred -EecCceeEEecccceEEEecCCCCcccccCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc-eeeecCCCCCHH Confidence 1112334688999998873 2333443 334578899998888888888887776655556554 456666777777 Q ss_pred HHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcC Q lcl|NC_018087. 295 KAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALR 373 (520) Q Consensus 295 KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~ 373 (520) .+++..+.+...|+.. ...|. .+ .++ .|.+++.|.-...-.| ++-.+|..+.+.++++ T Consensus 271 ~~~~~~~~~~~~~~g~----~n~g~------~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fg 329 (460) T protein:vir:10 271 QADSLKQRLTEMDKSP----DRLSQ------IA-GAS----------GEIAFTKISLNTDELKPFDYLKYDQKAICNALG 329 (460) T ss_pred HHHHHHHHHHHHhcCc----cccCC------ce-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhC Confidence 7666655555555421 01222 22 121 2556666643222222 4455677899999999 Q ss_pred CChhhccCC-CccccccccchhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccch Q lcl|NC_018087. 374 VPLSRIPDE-QTQNVFDMSTAISRDELSFDKF-ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSY 451 (520) Q Consensus 374 VP~SRl~~~-~~~~~~G~~~eItRDElkF~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~ 451 (520) ||.+-|... ++... +. .+.-.-..|..+ |.-+-.++..-|.. . ++++.+.. ....|.++|.. . T Consensus 330 VPp~~lg~~~~~t~~-~s--n~e~~~~~f~~~~l~P~~~~ie~~ln~----k-----l~~~~~~~-~~~~i~~d~~~--l 394 (460) T protein:vir:10 330 WSDKLLNNNEGGGLN-TG--NLEEERKRVVTDNIQPDLVILKQAFDK----K-----FIKRFKGY-ENAVIEWDISE--L 394 (460) T ss_pred CCHHHhCCCCCCCCc-cc--cHHHHHHHHHHHHHHHHHHHHHHHHHH----h-----hcCccccc-CCceEEeecch--h Confidence 999888643 22111 11 122222334443 33333333332222 2 23333221 12234444431 1 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHH--HHHH---HH---HHHHHhhhcCCccCCccccC Q lcl|NC_018087. 452 FSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDE--DIAA---ER---KLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 452 f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDe--eI~~---~~---kqi~~E~~~~~~~~p~~e~~ 520 (520) .+++ +-++.|.+++. .-.++.+-+++ +++|..- +--+ .. .-++ ...+...+.++...= T Consensus 395 -~~l~--~d~~~~~~~~~------~g~~T~NE~R~-~~g~~pi~~~~gD~~~~~~n~~~~~-~~~~~~~~~~~nq~~ 460 (460) T protein:vir:10 395 -PEMQ--TDMVAMASWLN------TIPVTPNEIRI-AMKYETLNQDGMDIVFMPSNKVRID-DVSNNLIDSAFNQNQ 460 (460) T ss_pred -hhHH--HHHHHHHHHHh------CCCCCHHHHHH-HhCCCCCCCCCCCeeeecccccchh-hcccccCCCcccCCC Confidence 1111 11222322221 12456666663 3555421 1000 00 0000 000000000000000 No 107 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=96.73 E-value=0.00038 Score=39.35 Aligned_cols=381 Identities=11% Similarity=0.078 Sum_probs=167.1 Q ss_pred ccccccchhHHHH-------HHHHHHHhhccchhH-----------------------------------HHHhhhceee Q lcl|NC_018087. 62 GSQDPTATSTREL-------INTYRSLLNNYEVDN-----------------------------------AVQEIVSDAI 99 (520) Q Consensus 62 ~~~~~~~~~~~~L-------I~~YR~ma~~pEvd~-----------------------------------Ai~eIvneai 99 (520) .+++.-.+-...+ +.+|..+..+.+-.. =...||+-.+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 80 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKK 80 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhh Confidence 2333222212222 223333333332221 1223333322 Q ss_pred EecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCcc Q lcl|NC_018087. 100 VYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPR 179 (520) Q Consensus 100 v~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr 179 (520) -+=- ..||++..++.+..+ ..+.+.+ =+|+....++.+.+.+-|+-|.+.-+|.+ +|-..+..+||+ T Consensus 81 ~yl~-G~p~~~~~~~~~~~~--------~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~---~g~~~~~~~~p~ 147 (471) T protein:vir:10 81 AYAL-TYPPTFDVDDKKVND--------MIVDVLG-DDYERISKQLCVNAGNAGIAWLHVWKDAS---DNSFRYACVDSK 147 (471) T ss_pred hhhc-ccCceeccCChHHHH--------HHHHHHh-cCHHHHHHHHHHHHhhCCeEEEEEEeeCC---CCeeEEEEEccc Confidence 2222 367777766654333 3333343 37788888899999999999999988855 467789999999 Q ss_pred ceeeeeeccC--CCCcccccc--------cceecceeecCc-ccccccccce---------------ecCCc-------- Q lcl|NC_018087. 180 NVQFVRELDT--KMENGVKVV--------KGYREYFLYDTE-LESYQCGHQH---------------FAAGT-------- 225 (520) Q Consensus 180 ~i~~vr~i~~--~~~~~~~~~--------~~~~ey~~y~~~-~~~~~~~~~~---------------~~~~~-------- 225 (520) .+-++.+-.. +...+++.+ +.+.-+-+|++. ...|...... ...+. T Consensus 148 ~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (471) T protein:vir:10 148 EVIPIYSKSLDKKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFK 227 (471) T ss_pred ceEEEEcCCCCCceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCccccccccccccccccccccccccccccc Confidence 9887754321 111222211 111122333322 1111111000 00000 Q ss_pred -ce-ecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHH Q lcl|NC_018087. 226 -KI-KIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLK-LLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQH 302 (520) Q Consensus 226 -~~-~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~-m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~ 302 (520) ++ +|| |+++ +|+....|-|+..+.....+. ++=+..-.-+-+..|-+-+.-.+.-.++ +.+.. T Consensus 228 ~~~g~iP---vv~~------~n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~-----~~~~~ 293 (471) T protein:vir:10 228 HDFGLVP---FIPF------KNNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQ-----EFLED 293 (471) T ss_pred CCCCcee---EEEe------ccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccc-----hhHHH Confidence 00 122 2222 122233455554444333333 1222222234444553333222211111 11111 Q ss_pred HHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccC Q lcl|NC_018087. 303 IMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPD 381 (520) Q Consensus 303 im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~ 381 (520) +.++ +...-..+| .+.|-.++.|--..+... -.-+.-..+.+|...++|- +.+ T Consensus 294 -~~~~--~~i~~~~~~---------------------~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~--~~~ 347 (471) T protein:vir:10 294 -LKRY--KMIKMDNDG---------------------MGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVN--PET 347 (471) T ss_pred -hhcC--CeEEecCCC---------------------CccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcC--CCc Confidence 1111 111111111 122334555544344432 3344667788888999884 222 Q ss_pred CCccccccccchhhHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHH Q lcl|NC_018087. 382 EQTQNVFDMSTAISRDELSF---DKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTI 458 (520) Q Consensus 382 ~~~~~~~G~~~eItRDElkF---~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~ 458 (520) ++ ||.++..+.. .++ ..-+.+.|+.|...+...++.=+-+-| ..+|. .+.+.|...---.+...+ T Consensus 348 ~~----~gn~Sg~Alk-~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~---~~d~~----~i~i~f~~~~p~n~~e~~ 415 (471) T protein:vir:10 348 DK----LGNSSGVALK-FLYSLLELKAGNMETQFRSGYATLVKMILKHLG---LSDKL----KIKQTWTRNSINNDTEMA 415 (471) T ss_pred cc----ccCccHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---cCCCc----eeEEEeCCCCCCCHHHHH Confidence 22 2544444432 222 234666677777766666643222222 23443 467778766655554333 Q ss_pred HHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCC------ccccC Q lcl|NC_018087. 459 EITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPP------EPEEI 520 (520) Q Consensus 459 Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p------~~e~~ 520 (520) + +++.+. | .+|.+++++.+-..+|.+ ++-++|++|..+..-..| +++|. T Consensus 416 ~-------~~~kl~---g-~iS~et~~~~~p~v~D~~--~E~eri~~E~~~~~~~~~~~~~~~~~~e~ 470 (471) T protein:vir:10 416 Q-------VVSTLA---T-ITSRENVAKSNPIVEDWQ--DELRLQKAEQEGRSEKLYDMEEVEHESEV 470 (471) T ss_pred H-------HHHHHh---c-cCchHHHHHhCCCCCCHH--HHHHHHHHHHHHHHhcccccCCCCCcccc Confidence 3 333442 3 479999998865555422 333444444322211111 22222 No 108 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=96.70 E-value=0.0004 Score=39.22 Aligned_cols=376 Identities=11% Similarity=0.133 Sum_probs=173.0 Q ss_pred hhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchh Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVD 88 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd 88 (520) ++||+++.+.. .+|+....+....... ++.+... + +.. + .-+..+++|-|. T Consensus 1 M~~f~~~~~~~--------------~~~~~~~~~~~~~~~~--~~~~~~~----~--~~~------v-~~~~~~~~~~v~ 51 (386) T protein:vir:48 1 MPIFNITNLAT--------------ESPPISQGGFFDITDP--DFLSTLN----G--SEW------V-SAESALRNSDLF 51 (386) T ss_pred Ccccccccccc--------------cccccccccccccccc--hhccccc----C--Cce------e-chhhhhcchHHH Confidence 66666543321 1233322222222110 1111000 0 000 0 112235789999 Q ss_pred HHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHH----HHhhccccceeEEEeeecC Q lcl|NC_018087. 89 NAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDH----FKRWYVDSRVFFHKIINPN 164 (520) Q Consensus 89 ~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~----fRrWYvDgri~~hkvid~~ 164 (520) .||..|.+.+.-++- .+.+. -.+.++.--|-.-++.++ +..+.+.|.-|+-++-|. T Consensus 52 ~~i~~ia~~ia~~p~-------~~~~~------------~~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~- 111 (386) T protein:vir:48 52 SIINQLSNDLATVKL-------TASRK------------QLQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNE- 111 (386) T ss_pred HHHHHHHHhhccCce-------eeccc------------hhHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECC- Confidence 999999999866422 12211 123344444555566665 445788899999887662 Q ss_pred CCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccccC Q lcl|NC_018087. 165 RPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDC 244 (520) Q Consensus 165 ~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~ 244 (520) + .-+++|.+|+|..+...+.- ++... +|-|.-. +. ..+..+.++.+.|+|.. ..++ T Consensus 112 ~--g~~~~L~~l~~~~v~v~~~~-----~~~~~------~y~~~~~--~~-------~~~~~~~~~~~evih~~--~~~~ 167 (386) T protein:vir:48 112 N--GRDMKWEYLRPSQVSFNRLD-----NKDGI------YYNITFD--DP-------RIPPKQHVPQGDVLHFK--LLSV 167 (386) T ss_pred C--CcEEEEEEecCceeEEEEcC-----CCceE------EEEEEec--Cc-------cccceeEecCccEEEec--CCCC Confidence 2 24899999999999875432 22211 1111110 00 11233577888888774 3456 Q ss_pred CC-CcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccc Q lcl|NC_018087. 245 CG-KNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQ 323 (520) Q Consensus 245 ~~-~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~ 323 (520) ++ ...+|-|..|.+++.....+++...=+----+--+-+.-.+-+ +.+...++ +++..... + ...|. T Consensus 168 ~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~-~~~e~~~~-~~~~~~~~-----~-~n~g~---- 235 (386) T protein:vir:48 168 DGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKGG-GLLDFKTK-LSRSRQAM-----K-QMQGG---- 235 (386) T ss_pred CCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCC-CCHHHHHH-HHHHHHHh-----h-cCCCC---- Confidence 65 3468899999999999888888755443333444455555443 33333332 33222221 1 11222 Q ss_pred cccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHH Q lcl|NC_018087. 324 ANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFD 402 (520) Q Consensus 324 ~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~ 402 (520) .+ .++ .|.++..|.-.....| ++=.++....+.++++||..-|...++. +.+.-..+.|. T Consensus 236 --~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~------~~~e~~~~~~~ 296 (386) T protein:vir:48 236 --PL-VLD----------DLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQGDQ------QSSLEMSLDLY 296 (386) T ss_pred --ce-ecC----------CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc------ccHHHHHHHHH Confidence 11 111 2566766643222222 2333566789999999999888543221 11222223344 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhH Q lcl|NC_018087. 403 KFI-SELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISN 481 (520) Q Consensus 403 KFI-~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~ 481 (520) +++ .-+-..+..-|..-|-+++ +.+ +...+..|.+. +...++.+ +-+-.++. T Consensus 297 ~~~l~P~~~~ie~~l~~~l~~~~---------~~~-----~~~~~~~d~~~--------~~~~~~~l-----~~~g~~t~ 349 (386) T protein:vir:48 297 NKAVSRYLRPFLSELSQKLSCDV---------DAD-----ILPAVDPTGSN--------SVSRINSM-----VKSGTLAQ 349 (386) T ss_pred HHHHHHHHHHHHHHHHHhhcchh---------hcc-----hhhhhccChHH--------HHHHHHHH-----HhCCCcCH Confidence 432 3333333333322221111 000 11111222211 11111111 11334566 Q ss_pred HHHHHHHhCCCH---HHHHHHHHHHHHhhhcCCccCCcccc Q lcl|NC_018087. 482 HTAMKDFLQMSD---EDIAAERKLIDEELSDKIFNPPEPEE 519 (520) Q Consensus 482 ~~i~k~IL~~tD---eeI~~~~kqi~~E~~~~~~~~p~~e~ 519 (520) .-+++ +|.+.. .|+...+..-. ...++ -+++++| T Consensus 350 nE~r~-~lg~~~~~~~~~~~~~~~~~-~~~~g--Gd~~~~~ 386 (386) T protein:vir:48 350 NQGLY-ILQQAEILPKELPEGENPNK-TTLKG--GEINGED 386 (386) T ss_pred HHHHH-HhhcCCCCCccchhhcCCCC-CccCC--CCCCCCC Confidence 66654 344322 22221111000 11111 1455555 No 109 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=96.66 E-value=0.00043 Score=39.02 Aligned_cols=363 Identities=13% Similarity=0.104 Sum_probs=171.2 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRS 80 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ 80 (520) |+ +|. ..... +.+..+.+.+...+. .. ...|+....++. . +. T Consensus 1 Mg--------~~~---~~~~~------~~~~~~~~~~~~~~~---~~----~~~~~~~~~~v~-------~-------~~ 42 (385) T protein:vir:10 1 MG--------LLT---PRNFN------KRKAKNMVYPSNPAF---FT----TTVGGMQLSYVS-------A-------LS 42 (385) T ss_pred Cc--------ccc---chhcc------cccccccccccchhh---hh----hhccccCccccC-------H-------HH Confidence 33 332 11100 011111111111110 00 011222221111 1 12 Q ss_pred HhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHH----hhcccccee Q lcl|NC_018087. 81 LLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFK----RWYVDSRVF 156 (520) Q Consensus 81 ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fR----rWYvDgri~ 156 (520) ..++|-|..||+-|.+.+.-. |+.+. +.. .+.+++-=|-..++.++.+ .+.++|.-| T Consensus 43 al~~~~v~~~i~~ia~~ia~~-----p~~v~--~~~------------~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~ 103 (385) T protein:vir:10 43 ALQNTNVYSVINRIASDVASA-----HFKTE--NTA------------TLNRLESPSSLIGRFSFWQGALMQLCLSGNDY 103 (385) T ss_pred hhccHHHHHHHHHHHHHHhhC-----ceeee--ccc------------hhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeE Confidence 356788999999999987654 23322 110 1223332334445555444 466889999 Q ss_pred EEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEE Q lcl|NC_018087. 157 FHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVY 236 (520) Q Consensus 157 ~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y 236 (520) ..++=| ..++.++||-.+++++. .++.. |+++... .+..+.++.+.|+| T Consensus 104 ~~i~r~-------~~~~~p~~~~~v~~~~~-----~~~~~-------~~~~~~~------------~~~~~~~~~~eiih 152 (385) T protein:vir:10 104 IPLVGQ-------NLEHIPNSDVQINYLPG-----NMGIV-------YTVLESN------------DRPQMVLRQDQMLH 152 (385) T ss_pred EEEEcC-------ceeEeecCCceEEEEEc-----CCceE-------EEEEEcC------------CceEEEEccccEEE Confidence 887622 46788999988877532 11111 2221111 12346789999988 Q ss_pred eeccccc-CCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeec Q lcl|NC_018087. 237 AHSGLVD-CCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDA 315 (520) Q Consensus 237 ~hSGL~d-~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~ 315 (520) +.---.+ -++-..+|.|..|.++++....+++...-+----+--+-+..++.+-..+. +.+-+++-+++.... . T Consensus 153 ik~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e-~~~~~~~~~~~~~~~----~ 227 (385) T protein:vir:10 153 FRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGK-DLESAREEFEKANTG----D 227 (385) T ss_pred eccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHH-HHHHHHHHHHHHhCc----c Confidence 7511111 122245789999999999999998876655555556677777774444443 333444444443220 1 Q ss_pred CCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHH-HHHHHHHHHhcCCChhhccCCCccccccccch Q lcl|NC_018087. 316 RTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDI-LYFRKALYMALRVPLSRIPDEQTQNVFDMSTA 393 (520) Q Consensus 316 ~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV-~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~e 393 (520) ..|. .+ .++ .|.+++.|.-...-.+ +.+. +|-.+.+.++++||..-|....+.+. ..+. T Consensus 228 n~~~------~~-vl~----------~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~--~~sn 288 (385) T protein:vir:10 228 NSGR------LM-VLP----------DGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTES--QHSN 288 (385) T ss_pred ccCC------cc-ccC----------CCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCc--cccc Confidence 2222 22 221 2567777753322223 2233 55578899999999999864322111 1122 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_018087. 394 ISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEP 473 (520) Q Consensus 394 ItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p 473 (520) +.....-|. ..|+--+.. +.+.|-..|+ ++ .|+|++. ++...+ +..|.+.++.+-. T Consensus 289 ~eq~~~~~~---~~l~P~~~~-ie~~l~~~l~-----~~--------~~~f~~~------~ll~~d-~~~~~~~~~~~~~ 344 (385) T protein:vir:10 289 IDQIKATYL---ANLNSYVNP-IVDELRLKMN-----AP--------DLELDIK------DMLDVD-DSALINQVSNLAK 344 (385) T ss_pred HHHHHHHHH---HHHHHHHHH-HHHHHHHhhC-----Cc--------eEEeech------hhhccC-HHHHHHHHHHHHh Confidence 322222333 345433332 3333333332 21 2443322 333333 2456666655432 Q ss_pred ccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 474 YIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 474 ~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) - -+++.+-++. ++++.. +|+.+..+. T Consensus 345 ~--G~~T~NE~R~-~~g~~p------------------~p~~~~~~~ 370 (385) T protein:vir:10 345 S--GVLGAEQAQF-ILTRSG------------------FLPDNLPEF 370 (385) T ss_pred C--CCcCHHHHHH-HhCCCc------------------cCCCCCccc Confidence 2 2556666663 344422 111111111 No 110 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=96.59 E-value=0.00049 Score=38.75 Aligned_cols=384 Identities=14% Similarity=0.123 Sum_probs=177.5 Q ss_pred ecccccccccccccccccccccc-hhHHHHHHHHHHHhhccchhHHH--------------------HhhhceeeEecCC Q lcl|NC_018087. 46 VDSQDIAYNGVFQKLYGSQDPTA-TSTRELINTYRSLLNNYEVDNAV--------------------QEIVSDAIVYEEG 104 (520) Q Consensus 46 i~~~~~a~~g~~~~~~~~~~~~~-~~~~~LI~~YR~ma~~pEvd~Ai--------------------~eIvneaiv~d~~ 104 (520) +. ...-.++ +.-..-+.+|+.+..+++-+..| .-||+-.+-+= - T Consensus 1 l~--------------~~~l~~~i~~~~~~~~r~~~l~~yy~g~~~il~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l-~ 65 (429) T protein:vir:98 1 MT--------------KDLLSELIQKHRSFNLSYSAYKQLYEGDHAILQQKQKEQYKPDNRLVVNFAKYIVDTFNGYF-I 65 (429) T ss_pred CC--------------HHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccCCCcceeecchHHHHHHHHhhhh-c Confidence 00 0000001 01112234555555554444321 11111111110 1 Q ss_pred CcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceeee Q lcl|NC_018087. 105 FDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFV 184 (520) Q Consensus 105 ~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~v 184 (520) ..||++..++. ...+.++.+.+-.+|+....++++.+.+-|+-|.+..+| ++|-..++.+||+.+..+ T Consensus 66 g~~~~~~~~~~--------~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d----~~g~~~~~~~~p~~~~~v 133 (429) T protein:vir:98 66 GVPVQTSHENK--------QVSNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFND----ENAEAGITYLTPLEAFIV 133 (429) T ss_pred ccCceeecCCh--------HHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEec----CCCcEEEEEEcccceEEE Confidence 25556555543 233455566666788999999999999999988887665 247889999999999887 Q ss_pred eeccCC--CCcccccc--c-ceecceeecCcc-cccccccceecCCc----ce-ecCcccEEEeecccccCCCCcchhhh Q lcl|NC_018087. 185 RELDTK--MENGVKVV--K-GYREYFLYDTEL-ESYQCGHQHFAAGT----KI-KIPYSAMVYAHSGLVDCCGKNIIGYL 253 (520) Q Consensus 185 r~i~~~--~~~~~~~~--~-~~~ey~~y~~~~-~~~~~~~~~~~~~~----~~-~I~~~aI~y~hSGL~d~~~~~~~syL 253 (520) .+-... ..-.++.. . ....+.+|.... ..+..+...+.... ++ ++| |+++ +|+....|-+ T Consensus 134 ~dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP---vv~~------~n~~~g~sd~ 204 (429) T protein:vir:98 134 YDDSIRQKPLFAVRYFYNKGGVLEGSYSDASNITYFKDGEKGIEIGESEPHPFDGVP---MIEY------VENEERQSLL 204 (429) T ss_pred EeCCCCCceEEEEEEEEecCceEEEEEEeCceEEEEEecCCceEecccccccCCccc---eEEe------cCCCCCCCcH Confidence 542211 11112211 1 111122222221 11111111111000 00 122 2222 2233345656 Q ss_pred HHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhh Q lcl|NC_018087. 254 HRAVKPANQLK-LLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTED 332 (520) Q Consensus 254 ~~aik~~NqL~-m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlED 332 (520) +..+.....+. ++-+....-+..+.|.+-+.-.+. +. +-+++++. +++. T Consensus 205 e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~~~---~~----~~~~~~~~---~~~~-------------------- 254 (429) T protein:vir:98 205 ASVVTLINAFNKAISEKANDVEYFADAYLKILGAEL---DD----ETLKSLRD---TRII-------------------- 254 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCC---Cc----chhhhHhh---Ccee-------------------- Confidence 66555555543 344555556777777776643222 11 11111111 1111 Q ss_pred hcccccCCCCCcceeecCCCCCcChHHH-HHHHHHHHHHhcCCChhhccCCCccccccccchhhH--HHHHHHHHHHHHH Q lcl|NC_018087. 333 YWLQRRDGKAVTEVETLPGMTGMNEMDD-ILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISR--DELSFDKFISELQ 409 (520) Q Consensus 333 ywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItR--DElkF~KFI~rLr 409 (520) .+|- +||.+..+..|--..+.+.... ++-+.+.+|+...+|- +..++ ||..+.... -+.....-+.+.| T Consensus 255 -~~~~-~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~--~~~~~----~gn~Sg~Al~~~~~~l~~k~~~~~ 326 (429) T protein:vir:98 255 -NLKD-TDAQQLTVEFLQKPDADATQEHLLDRLENLIFRTAMVAN--ISDES----FGTASGIALRYRLQAMDNLAKTKE 326 (429) T ss_pred -eccC-CCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCccc--cCccc----cccchHHHHHHHHHHHHHHHHHHH Confidence 1222 3345556666644445554443 5788888899999993 33332 243333332 2333444556666 Q ss_pred HHHHHHHHHHHHHHHHhcCCCC-hhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHH Q lcl|NC_018087. 410 HKFEEIFLSPLKSNLLLKRVIT-EDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDF 488 (520) Q Consensus 410 ~rFs~if~d~Lk~QLiLkgi~t-~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~I 488 (520) +.|..-+..+++.=+-+-++.. ..+| ..|.+.|.....-.+.. -+++++++. | .+|.+++++. T Consensus 327 ~~~~~~l~~~~~li~~~~~~~~~~~d~----~~i~v~f~~~~p~~~~~-------~a~~~~kl~---g-~is~et~~~~- 390 (429) T protein:vir:98 327 RKFMSGMNRRYKLIASYPTSKIGPKDW----IGIKYKFTRNLPANLLE-------ESQIAGNLA---G-IVSEETQVGV- 390 (429) T ss_pred HHHHHHHHHHHHHHHHHhccCCCcccc----ccceEEeCCCCCcCHHH-------HHHHHHHHh---c-cCchHHHHHh- Confidence 6666666555544333323222 2222 24778887555444432 344555553 3 4799999977 Q ss_pred hCCCHHHHHHHHHHHHHhhh-------cCCccCCccccC Q lcl|NC_018087. 489 LQMSDEDIAAERKLIDEELS-------DKIFNPPEPEEI 520 (520) Q Consensus 489 L~~tDeeI~~~~kqi~~E~~-------~~~~~~p~~e~~ 520 (520) |...++ -+++-++|++|.. ..+..+.++... T Consensus 391 l~~v~d-~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 428 (429) T protein:vir:98 391 LSIVEN-PQKEIERKNSDKSTLISRQAGGLNGQNTTTIL 428 (429) T ss_pred CCCCCC-HHHHHHHHHHHHHHHHHHHHhhhcCCCCCCCC Confidence 565431 1222333333332 222222222222 No 111 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=96.55 E-value=0.00052 Score=38.60 Aligned_cols=424 Identities=12% Similarity=0.098 Sum_probs=170.5 Q ss_pred Cc----cccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHH Q lcl|NC_018087. 1 MS----MLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELIN 76 (520) Q Consensus 1 ~~----~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~ 76 (520) .. .-|--++.|-.... .+ . |.-.|.....+. +.-......+....+|.++.. ==+- T Consensus 54 ~~~~~~~~~~~~~~~~~~~~-~~---------~---~~~~~~~~~~~~-~~~~~~~~~~~~l~~~~~~~F------~Gy~ 113 (694) T protein:vir:10 54 LDAAPVAEPSPSLRLARQFE-VD---------V---SNYTPRERRAAS-YALDFNGTSMDALSFVTSSGF------PGFP 113 (694) T ss_pred hcccccCCCCcchhhhhhcc-cc---------c---cCCCccccchhh-hhhccCcccccchhhhhccCc------chHH Confidence 00 00111222211110 00 0 111111111110 110000000000011111110 0122 Q ss_pred HHHHHhhccchhHHHHhhhceeeEe------cCCCcE----EEEeeccchhhh-HHHHHHHHHHHHHHHHhcchhhhHHH Q lcl|NC_018087. 77 TYRSLLNNYEVDNAVQEIVSDAIVY------EEGFDV----VSIDLDQTAFTE-NIRNLISDEFNSVLNMLNFQRKGSDH 145 (520) Q Consensus 77 ~YR~ma~~pEvd~Ai~eIvneaiv~------d~~~~~----V~l~Ld~~~~s~-~ik~~I~eeF~~i~~ll~f~k~g~~~ 145 (520) .--.|||+||.+.+++-|..||+-. ....+. +++.-+...-++ .-.++|..|++.+ +...+..+. T Consensus 114 ~la~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl----~V~~~l~ea 189 (694) T protein:vir:10 114 TLVLLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERL----RIRDAVRTT 189 (694) T ss_pred HHHHHhhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHH----HHHHHHHHH Confidence 3356899999999999999999543 222211 222222222222 2334666666553 222333332 Q ss_pred HHhhccccc--eeEEEeeec------------CCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcc Q lcl|NC_018087. 146 FKRWYVDSR--VFFHKIINP------------NRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTEL 211 (520) Q Consensus 146 fRrWYvDgr--i~~hkvid~------------~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~ 211 (520) ++-=-+-|. +|+..-=|. +-+|.+++.|+.|||..+.+- ....++-+.+ ..|.|+ T Consensus 190 ik~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~---------~~n~~dP~sp-dfgkP~- 258 (694) T protein:vir:10 190 VIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPN---------NYNSINPVAD-DFYKPS- 258 (694) T ss_pred HHhhccccceEEEEEeecCccccccccccccccccCcceeeeEeecccccccc---------hhhhccchhh-ccCCCc- Confidence 222122232 233322221 123456777999998766661 1111111111 111111 Q ss_pred cccccccceecCCcceecCcccEEEeecccccCC------CCcchhhhHHHHHHHHH-HHHHHHHHHHHHHhcCccceEE Q lcl|NC_018087. 212 ESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDCC------GKNIIGYLHRAVKPANQ-LKLLEDAMMIYRITRAPDRRVF 284 (520) Q Consensus 212 ~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~~------~~~~~syL~~aik~~Nq-L~m~EDalVIyRi~RApeRRvF 284 (520) .|... +.+||.+=++... |---|+ ++..+|.+..+..-..+ +++...+.=+- +.+.-+ ++ T Consensus 259 -~y~V~--------G~~IH~SRL~~f~-g~plPd~LKp~y~~~G~Sv~q~~~e~V~~~~rT~~~v~~Li--~~~~v~-~l 325 (694) T protein:vir:10 259 -TWWMI--------GTEVHATRLHTIV-SRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIV--KQFSVS-GI 325 (694) T ss_pred -eEEEe--------ceEEeeeeEEEec-CCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHH--HhhhhH-HH Confidence 11111 1144444433222 211111 33345655555543332 33333332221 111111 01 Q ss_pred EccCC-CCchHHHHHHH--HHHHHhhcceeEeecCCCccccccccch-hhhhhcccccCCCCCcceeecCCCCCcChHHH Q lcl|NC_018087. 285 YIDTG-NMPARKAAQHM--QHIMNSHRNRISYDARTGKVKNQANMMA-LTEDYWLQRRDGKAVTEVETLPGMTGMNEMDD 360 (520) Q Consensus 285 yIDvG-nlpk~KAeqyl--~~im~~~knklvYd~~TGev~d~~~~ms-mlEDywLpRReGgrgTEIsTLpGg~nLgei~D 360 (520) -.|.. -|.....++.. -+++++||.-. |-+ .+. -.|||- .++ .+|+-++| T Consensus 326 k~dla~~L~~g~~~~l~~R~eli~~~Rsn~------G~~-----llDk~~Eefe----------q~s-----tslSGLdd 379 (694) T protein:vir:10 326 LMDLAQALMPGANVDLSMRAELINRYRDNR------NIL-----FLDKATEEFF----------QFN-----TPLSGLDA 379 (694) T ss_pred HHHHHHhhcChhHHHHHHHHHHHHHhcCcc------ceE-----EEecCCcceE----------EEe-----cccCCHHH Confidence 11211 01111122222 25566776311 110 000 013443 222 47888999 Q ss_pred HH-HHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHhcCCCChhh Q lcl|NC_018087. 361 IL-YFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKS-----NLLLKRVITEDE 434 (520) Q Consensus 361 V~-YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~-----QLiLkgi~t~ee 434 (520) |. =|..-+=-+.+||+.||=..+.-++ ..++| -|.-.|...|..+|. ..+..+|++ |+-+-|.+. T Consensus 380 Vi~qf~q~VAgaa~IPltkLfGqSPkGl-NATGE--~D~rnYYD~I~s~Qe---~~L~p~L~rl~~ii~rS~~G~id--- 450 (694) T protein:vir:10 380 LQAQAQEQMSAVSHIPLIKLLGITPTGL-NASSE--GEIRVWYDYVRAYQR---NALQQLMNDVIVMIQLSLFGAVD--- 450 (694) T ss_pred HHHHHHHHHHhhhcCchhhhhccCcccc-cccch--hhHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHhcCCCC--- Confidence 75 4888888899999999865553333 12221 244558888887775 334444433 444445443 Q ss_pred HHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccC Q lcl|NC_018087. 435 WEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNP 514 (520) Q Consensus 435 w~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~ 514 (520) ..|.|+|+.=..-+|..-+||...+.+.....-.- --++.+-|+..+ .+|.+=-= -..++ +..+ ... T Consensus 451 -----p~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~--gvI~~~evr~rL--~~d~~s~Y-~~~~D-~~d~--p~~ 517 (694) T protein:vir:10 451 -----PSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQE--QVIRPDQVAARL--NTEPDGPY-AGKLD-ANDD--PGV 517 (694) T ss_pred -----CcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHh--cCCCHHHHHHHH--hcCCCccc-ccccc-cccC--CCc Confidence 45889999888888888899988888764443111 012333333221 00000000 00000 1111 112 Q ss_pred CccccC Q lcl|NC_018087. 515 PEPEEI 520 (520) Q Consensus 515 p~~e~~ 520 (520) |.++|| T Consensus 518 ~~~~~~ 523 (694) T protein:vir:10 518 PADDDI 523 (694) T ss_pred Cccchh Confidence 333344 No 112 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=96.55 E-value=0.00052 Score=38.59 Aligned_cols=399 Identities=13% Similarity=0.128 Sum_probs=190.0 Q ss_pred cc---------cccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccc-cccccccccchhHH Q lcl|NC_018087. 3 ML---------ADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQ-KLYGSQDPTATSTR 72 (520) Q Consensus 3 ~~---------~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~-~~~~~~~~~~~~~~ 72 (520) |+ ++-.+.+|....+..+. .+++.|-..+ .+. ..|.+. +.++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~lf~~~~~----------~~~~~~~~~~---~~~-----~~~~~~~~~~vs--------- 53 (424) T protein:vir:45 1 MLYCWWAHWLWPEGGRVLLDALFRSKSL----------ENPSTPITGD---AVD-----TDGLFRADVYVS--------- 53 (424) T ss_pred CeeEeeeceecCcchhHHHHhhccccCC----------CCCccccchh---hhh-----hhccccCCceec--------- Confidence 32 23334444433332221 1222221111 111 011111 11111 Q ss_pred HHHHHHHHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHH-hcchhhhHHHH----H Q lcl|NC_018087. 73 ELINTYRSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNM-LNFQRKGSDHF----K 147 (520) Q Consensus 73 ~LI~~YR~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~l-l~f~k~g~~~f----R 147 (520) -+..+++|-|..||+-|.+.+.-.+ +.|-=+.. ...+........++|+. =|-..+++++. . T Consensus 54 -----~~~al~~~~v~~cv~~Ia~~iA~lp-----~~v~~~~~---~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~ 120 (424) T protein:vir:45 54 -----PETAMKLAAVYSCIYVLSSSLAQMP-----LHVMRRHK---GKVEPARDHPAFYLVHDEPNTWQTSYKWRELKQR 120 (424) T ss_pred -----hHHhhccHHHHHHHHHHHHHHhhCc-----eEEEEecC---CceeecccchHHHHHHhhcccCCCHHHHHHHHHH Confidence 1334578899999999999876442 22211110 11111111112222221 22234555543 3 Q ss_pred hhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcce Q lcl|NC_018087. 148 RWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKI 227 (520) Q Consensus 148 rWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~ 227 (520) .+.+.|.-|..++-| ...-+++|.+|+|..+...+. ++.. -|.++++ ++.. T Consensus 121 ~lll~Gna~~~i~r~---~~G~~~~L~~l~~~~v~i~~~------~~~~------~y~~~~~--------------~~~~ 171 (424) T protein:vir:45 121 HILGWGNGYTWVKRN---RRGEVISLDCCMPWETTLMNT------GGRY------TYGLYNE--------------YGAF 171 (424) T ss_pred HHhhcCCeEEEEEEc---CCCcEEEEEEecCceEEEEEc------CCeE------EEEEEec--------------CceE Confidence 467789999887754 223489999999998876421 1111 1222211 1234 Q ss_pred ecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhh Q lcl|NC_018087. 228 KIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSH 307 (520) Q Consensus 228 ~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~ 307 (520) +++++.|+|.. + .++++...+|-++.|...+.....+++...=+----|--+-|+.++. .|.+.++++.-+.+-..| T Consensus 172 ~~~~~eVih~r-~-~~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~ 248 (424) T protein:vir:45 172 AISPDDMIHIR-A-LGNNQKMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKS-GLNKESWGWLKDQWQKAS 248 (424) T ss_pred EECcccEEEec-C-cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC-CCCHHHHHHHHHHHHHHh Confidence 68888888875 2 56778788999999999999988888876654444455567777774 466655544434333333 Q ss_pred cceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccc Q lcl|NC_018087. 308 RNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQN 386 (520) Q Consensus 308 knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~ 386 (520) +- ..+ +.|. .+ .+ + .|.++..|.=...-.| ++-.++-.+.+.++++||.+.|....+. T Consensus 249 ~g--~~~-n~g~------~~-vl--------~--~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~- 307 (424) T protein:vir:45 249 QA--LRR-QENK------TM-LL--------P--ADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKA- 307 (424) T ss_pred cc--ccc-cCCc------ee-Ec--------C--CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC- Confidence 32 100 1121 11 11 1 2455555532222222 4445578889999999999998643221 Q ss_pred cccccchhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHH Q lcl|NC_018087. 387 VFDMSTAISRDELSFDKF-ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRV 465 (520) Q Consensus 387 ~~G~~~eItRDElkF~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~ 465 (520) .+ +.+.-.-+.|..+ +.-+-.++..- | =+.+.+..++. ....+.|.-+.. ... -+..|. T Consensus 308 t~---sn~eq~~~~f~~~tL~P~~~~ie~~----l-----n~kLl~~~e~~---~g~~i~fd~~~l----lr~-d~~~r~ 367 (424) T protein:vir:45 308 TF---SNISAQAIQFVRYTMMPWVTNWEQE----L-----NRRLFTRAELA---AGYYVRFNLTGL----LRG-TPQERA 367 (424) T ss_pred Cc---ccHHHHHHHHHHHHHHHHHHHHHHH----H-----HHhcCChhhhc---CCcEEEeechhh----hcc-CHHHHH Confidence 11 1122222234433 22222222221 2 22344555443 223455543332 111 245666 Q ss_pred HHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHH--------HHhhhcCCccCCcccc Q lcl|NC_018087. 466 NVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLI--------DEELSDKIFNPPEPEE 519 (520) Q Consensus 466 ~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi--------~~E~~~~~~~~p~~e~ 519 (520) +.++.+-.- -+++.+-++. ++.+.+-+ .-++-+ ..+...+.-.+++++| T Consensus 368 ~~~~~~~~~--g~~T~NE~R~-~~gl~pi~--ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~ 424 (424) T protein:vir:45 368 QFYHFAITD--GWMSRNEARA-FEDMNPVE--GLDEMLVSVNAANPAGDFKPPKNDEGKTNE 424 (424) T ss_pred HHHHHHHhC--CCcCHHHHHH-HhCCCCCC--CcceeeecccccccccccCCCCCCCCCCCC Confidence 766665432 3678888875 47775421 111110 1111111122233333 No 113 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=96.54 E-value=0.00053 Score=38.55 Aligned_cols=400 Identities=10% Similarity=0.104 Sum_probs=180.1 Q ss_pred hhhhc-chhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccch Q lcl|NC_018087. 9 LKMFA-FWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEV 87 (520) Q Consensus 9 l~~f~-~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEv 87 (520) ++||. +|.+..... +.. .....++..+-.+++ .+..+.......+.. +..+++|-| T Consensus 1 MG~f~~lf~~~~~~~-~~~-~~~~~~~~~~~~~~~-------------~~~~~g~~~~~~v~~--------~~al~~~~v 57 (422) T protein:vir:13 1 MGFLRGLFNKKNNND-EKR-SNYDEDIGIDISDSN-------------FWEKFGIKLNFSVRG--------KRALKENTV 57 (422) T ss_pred CchhhhhhhccCCcc-chh-hhhhhccccccCcch-------------hhhhccccCCcccch--------hhhhccHHH Confidence 44553 333322210 000 000111111111111 111111111111111 122467889 Q ss_pred hHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhc----chhhhHH----HHHhhccccceeEEE Q lcl|NC_018087. 88 DNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLN----FQRKGSD----HFKRWYVDSRVFFHK 159 (520) Q Consensus 88 d~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~----f~k~g~~----~fRrWYvDgri~~hk 159 (520) ..||+-|.+.+.-. |+.+-=+... +++ ..+..+|+ =..++.+ ++..+.+.|.-|..+ T Consensus 58 ~~ci~~ia~~iA~l-----p~~~~~~~~~----~~~------~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i 122 (422) T protein:vir:13 58 YVCTKIRAESIGKL-----SLKIYKDKEE----YKE------HELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYI 122 (422) T ss_pred HHHHHHHHHhhhhC-----ceEEEecCcc----ccc------chHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEE Confidence 99999888876543 2222111111 110 12233332 2223334 444578889999999 Q ss_pred eeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeec Q lcl|NC_018087. 160 IINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHS 239 (520) Q Consensus 160 vid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hS 239 (520) +-|. + .-+++|.+|+|..+..+.+-. +.....+ ..+|.|... .+...+++++.|+|.. T Consensus 123 ~r~~-~--G~~~~L~~i~~~~v~~~~~~~-----~~~~~~~-~~~y~~~~~------------~g~~~~~~~~eiih~~- 180 (422) T protein:vir:13 123 ERDR-K--GKIIGLYPINSDNVTKIIDDD-----NFLSSLS-KVWYVVTDK------------NGKEHKLLPDEMLHFI- 180 (422) T ss_pred EECC-C--CcEEEEEEECCcceEEEEcCC-----cceeccc-eEEEEEEeC------------CCeEEEEcccceEEEc- Confidence 8763 2 249999999999999865422 2111111 113333221 1223578899998885 Q ss_pred ccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCc Q lcl|NC_018087. 240 GLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGK 319 (520) Q Consensus 240 GL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGe 319 (520) +-...++....|-+..|.+++.....+++...=+----+.-+-+...+ ++|-+..+++..+.+...|.- T Consensus 181 ~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g---------- 249 (422) T protein:vir:13 181 GDITLDGLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYV-GDLDEKAKKIFKKEFESMSNG---------- 249 (422) T ss_pred CCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeC-CCCCHHHHHHHHHHHHHHhcC---------- Confidence 224556666789999999998888888776554333335566677776 466665555544444433321 Q ss_pred cccccccchhhhhhcccccCCCCCcceeecCCC-CCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHH Q lcl|NC_018087. 320 VKNQANMMALTEDYWLQRRDGKAVTEVETLPGM-TGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDE 398 (520) Q Consensus 320 v~d~~~~msmlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDE 398 (520) ..+..+.+ .++ + |++++.|.=. ..+.-++-.++....+.++++||.+-|....+.. + +.+.-.- T Consensus 250 ~~n~~~~~-vl~-------~---g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~-~---sn~e~~~ 314 (422) T protein:vir:13 250 LENAHSIS-LLP-------F---GYQFQPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERAT-F---NNLTEQQ 314 (422) T ss_pred ccccCCce-ecC-------C---CceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC-c---ccHHHHH Confidence 11111222 121 2 3444444211 1122244556778899999999998886433211 1 1122222 Q ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccch Q lcl|NC_018087. 399 LSFDKF-ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGK 477 (520) Q Consensus 399 lkF~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgk 477 (520) ..|..+ +.-+-.++..- +-..++++.+.. ....+.|..+. +...+ +..|.++++.+-.- - T Consensus 315 ~~f~~~~l~P~~~~ie~~---------l~~~Ll~~~~~~---~g~~i~fd~~~----l~r~d-~~~~~~~~~~~~~~--G 375 (422) T protein:vir:13 315 KDFYVTTLQSSLTVYEQE---------IQDKLFSQYETL---QDVKAEFNVDT----ILRSD-IKTRYEAYRIGIQG--G 375 (422) T ss_pred HHHHHHHHHHHHHHHHHH---------HHHhhCChhhhc---CCceEEeechh----hhcCC-HHHHHHHHHHHHhC--C Confidence 234322 22222222221 222334443321 22344454222 21111 24455555544322 2 Q ss_pred hhhHHHHHHHHhCCCHHHHHHHHH-----------HHHHhhhcCCccCCccc Q lcl|NC_018087. 478 YISNHTAMKDFLQMSDEDIAAERK-----------LIDEELSDKIFNPPEPE 518 (520) Q Consensus 478 y~S~~~i~k~IL~~tDeeI~~~~k-----------qi~~E~~~~~~~~p~~e 518 (520) ++|.+-++. .|++.+-| .-++ .+.+..+.+= +...+ T Consensus 376 ~~T~NE~R~-~~gl~p~~--ggD~~~~~~n~~~l~~~~~~~~~~g--~~~g~ 422 (422) T protein:vir:13 376 FIEANEARR-RENLPPVE--GGDRLLVNGNMIPIEMAGEQYKKGG--EKGGK 422 (422) T ss_pred CcCHHHHHH-HhCCCCCC--CcCeeeeccCccchhhcccccccCC--CcCCC Confidence 567777774 46665421 0010 0000000000 11111 No 114 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=96.52 E-value=0.00055 Score=38.46 Aligned_cols=415 Identities=13% Similarity=0.138 Sum_probs=176.0 Q ss_pred hhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeeccccccccccccc-ccccccccch---------hHHHHHHHH Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQK-LYGSQDPTAT---------STRELINTY 78 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~-~~~~~~~~~~---------~~~~LI~~Y 78 (520) +.|...-.++.. ..+ -.|-...+.|+++.. -+.. +..+.+.... .-.+.+.+| T Consensus 1 ~~~~~~~~~~~~----~~~------------~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~ 63 (492) T protein:vir:94 1 MQFIQLISQVAQ----ALI------------KGGNILYPSQPTQTE-IFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEI 63 (492) T ss_pred ChHHHHHHHHHH----HHh------------cCCceeecCccchhh-hhhcccccCCchhhHHHHHHHHHHHHHHHHHHH Confidence 112111111111 111 123344444444432 2221 1111111110 011223455 Q ss_pred HHHhhccchhHHH---------------------------HhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHH Q lcl|NC_018087. 79 RSLLNNYEVDNAV---------------------------QEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNS 131 (520) Q Consensus 79 R~ma~~pEvd~Ai---------------------------~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~ 131 (520) +.+..+++-+..| ..||+-.+-+ --..||++..++.+..+.+ +. T Consensus 64 ~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~y-l~G~p~~~~~~d~~~~~~l--------~~ 134 (492) T protein:vir:94 64 SIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY-IVGKPIAFKHTDDEVVKRI--------DE 134 (492) T ss_pred HHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHHhh-hcccCceeccCchHHHHHH--------HH Confidence 5555555443211 1122211111 1236667766665444432 22 Q ss_pred HHHHhcchhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeec--cCCCCccccccc--ceecceee Q lcl|NC_018087. 132 VLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVREL--DTKMENGVKVVK--GYREYFLY 207 (520) Q Consensus 132 i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i--~~~~~~~~~~~~--~~~ey~~y 207 (520) +++ =+|+....++++..++-|+-|.+.-+|. +|-..++.+||+.+-.+.+- ..+..-+++.+. ....+.+| T Consensus 135 ~~~-n~~~~~~~~~~~~a~~~G~a~~~v~~d~----dg~~~~~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~~y 209 (492) T protein:vir:94 135 VLG-NRFDDKLHSVLTGASNKGIEWLHPYLDE----EGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYW 209 (492) T ss_pred HHh-ccHHHHHHHHHHHHhhCCeEEEEEEecC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEeeccceeEEEE Confidence 332 2677888889999999999999877662 36678999999998887541 111112222111 00111122 Q ss_pred cCc-cccccc--ccc-------------eecCCcceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHH Q lcl|NC_018087. 208 DTE-LESYQC--GHQ-------------HFAAGTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLK-LLEDAM 270 (520) Q Consensus 208 ~~~-~~~~~~--~~~-------------~~~~~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~-m~EDal 270 (520) ++. ...|.. +.. ...++.-=+|| |+++- |+....|=++..+....-+. ++=+.. T Consensus 210 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP---vv~~~------nn~~~~sd~e~v~~liDa~d~~~S~~~ 280 (492) T protein:vir:94 210 DKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP---FIPFK------NNDLEISDIFMYKTLIDAYNRRLSDLS 280 (492) T ss_pred ecCeEEEEEEecCeeeeccccccccccccccccCCCccc---eEEec------CCCCCCCchHHHHHHHHHHHHHHHHHH Confidence 221 111100 000 00001000122 22221 12223454544443333333 234444 Q ss_pred HHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecC Q lcl|NC_018087. 271 MIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLP 350 (520) Q Consensus 271 VIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLp 350 (520) ..-+.+..|-+-+.-.+..+.+. .+..+..++..--++ |..+.+|- T Consensus 281 ~~~~~~~~p~lv~~g~~~~~~~~--------------------------------~~~~~~~~~~~~~~~--~~~~~~l~ 326 (492) T protein:vir:94 281 NTFKDSNELTYVLKNYDDQELPE--------------------------------FKRLLRYYGAIKVSD--NGGVDTIQ 326 (492) T ss_pred HHHHHhcCceeeeecCCcccchh--------------------------------hHHHHhhccceecCC--CCcceeEe Confidence 44566666655443332222111 111111122211122 22355554 Q ss_pred CCCCcChH-HHHHHHHHHHHHhcCCChhhccCCCccccccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_018087. 351 GMTGMNEM-DDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTA--ISRDELSFDKFISELQHKFEEIFLSPLKSNLLLK 427 (520) Q Consensus 351 Gg~nLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~e--ItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLk 427 (520) ...+.+.+ .-+.-..+.+|+-.++|- +.++. +. |..+. |..-+.....-+.+.++.|..-+..+++.=+-+- T Consensus 327 ~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~-~~--~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~ 401 (492) T protein:vir:94 327 VEVPVENSKKYLDELYQKIMLFGQAVD--FSSDK-FG--SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHF 401 (492) T ss_pred ccCCHHHHHHHHHHHHHHHHHHhCCcC--CCccc-cc--cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 44444433 334667778888999985 22222 11 22222 2222333445566777777776666666433333 Q ss_pred CCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCH---HHHHHHHHHHH Q lcl|NC_018087. 428 RVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSD---EDIAAERKLID 504 (520) Q Consensus 428 gi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tD---eeI~~~~kqi~ 504 (520) |+ ..+|. .|.+.|....--.+ .+.++++..+. | .+|.+++++. |..++ +|++.+.++.+ T Consensus 402 ~~--~~~~~----~i~v~f~~~~p~~~-------~e~~~~~~kl~---g-iiS~et~~~~-l~~v~d~~~E~eri~~E~~ 463 (492) T protein:vir:94 402 DI--KGEHK----DVDISFNYNKVANT-------ELQVQTAQQSM---G-IVSHETVLEN-HPFVEDLQAELERIEQEQM 463 (492) T ss_pred cC--Ccccc----eeeEEecCCCCCCH-------HHHHHHHHHHh---c-cCchHHHHHh-CCCCCCHHHHHHHHHHHHH Confidence 32 33444 46677754444333 23345555554 4 3799999976 55544 45544444433 Q ss_pred HhhhcCC-cc------CCccccC Q lcl|NC_018087. 505 EELSDKI-FN------PPEPEEI 520 (520) Q Consensus 505 ~E~~~~~-~~------~p~~e~~ 520 (520) +..++.- +. .|++++= T Consensus 464 ~~~~~~~~~~~~~~~~~~~~~~~ 486 (492) T protein:vir:94 464 EYNKQLPNLDDGGADSAQQQERS 486 (492) T ss_pred HHHhhccccccccCCCCccccCC Confidence 3322210 00 0111110 No 115 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=96.49 E-value=0.00058 Score=38.33 Aligned_cols=408 Identities=13% Similarity=0.095 Sum_probs=168.3 Q ss_pred chhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccch Q lcl|NC_018087. 8 DLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEV 87 (520) Q Consensus 8 ~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEv 87 (520) ...+=-+|.+.--.|+-..+ -+....+ ... .+.+ +..-..-+.+|+.+..+++- T Consensus 1 ~~~~~~~~~~~~~~e~~~~~-------~~~~~~~-~~~-----------i~~~-------i~~~~~~~~~~~~~~~yY~g 54 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQI-------KPKYETQ-EEM-----------ILRL-------VREHKENIDNITMGERYYNH 54 (478) T ss_pred CccccCCCCchhHHHHHHHH-------hhccCCc-HHH-----------HHHH-------HHHHHHHHHHHHHHHHHhcC Confidence 00000112222111111111 0000000 000 0000 01111122333333333332 Q ss_pred hHH---------------------------HHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchh Q lcl|NC_018087. 88 DNA---------------------------VQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQR 140 (520) Q Consensus 88 d~A---------------------------i~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k 140 (520) ... ...||+-.+-+ --..||++..++.+..+ .|.+ +++ -+|+. T Consensus 55 ~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~-l~g~~~~~~~~~d~~~~----~l~~----~~~-n~~~~ 124 (478) T protein:vir:10 55 HPDILDAPPKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAY-AVANPVTFGVDNDKALK----QIQH----TLN-HKWDD 124 (478) T ss_pred CCchhccccccccccccccccccceeccchHHHHHHHHHhh-hccCCeeeecCChHHHH----HHHH----HHh-cCHHH Confidence 211 12222221111 12366777666553322 3333 333 37888 Q ss_pred hhHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeecc--CCCCcccccc--cceecceeecCc-ccccc Q lcl|NC_018087. 141 KGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELD--TKMENGVKVV--KGYREYFLYDTE-LESYQ 215 (520) Q Consensus 141 ~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~--~~~~~~~~~~--~~~~ey~~y~~~-~~~~~ 215 (520) ...++.+.+++-|+-|++.-+|. +|-..+..+||+.+.++..-. .+..-.++.+ .+...+.+|.+. ...|. T Consensus 125 ~~~~~~~~~~~~G~~~~~~~~d~----~g~~~~~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~y~~~~i~~~~ 200 (478) T protein:vir:10 125 KLVDILTAASNKGIEWVQPYVDE----EGEFKTFRVPAEQAVPIWTNKERDELQAFIRVYELDGAERVEYWTKDDVTYYE 200 (478) T ss_pred HHHHHHHHHHhcCeEEEEEEecC----CCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEE Confidence 88999999999999998877663 367789999999998874421 1112222211 111223333321 11111 Q ss_pred cccc---------eec----------CCcceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHH Q lcl|NC_018087. 216 CGHQ---------HFA----------AGTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLK-LLEDAMMIYRI 275 (520) Q Consensus 216 ~~~~---------~~~----------~~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~-m~EDalVIyRi 275 (520) .... ..+ +..--++| |+++ +++....|=++..+....-+. ++=+....-+- T Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP---vv~~------~n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~ 271 (478) T protein:vir:10 201 LKEGQLIPDFYRSDDHIQPHYYQGNKLMSWGRVP---FIPF------KNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDE 271 (478) T ss_pred EcCCeeeccccccccccccceecccccccCCccc---eEEe------ccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1000 000 00001222 1211 122233454444444333333 44455555566 Q ss_pred hcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCc Q lcl|NC_018087. 276 TRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGM 355 (520) Q Consensus 276 ~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nL 355 (520) ++.|-+-+.-.+..+.. +....+....=++++.-+|| ++.+|-...+. T Consensus 272 ~~~p~~~~~g~~~~~~~-----------------------------~~~~~~~~~~~~~~~~~~~~---~~~~l~~~~~~ 319 (478) T protein:vir:10 272 SVELIYILKGYEGEDMK-----------------------------DFMHNLKYYKAISVAGESGS---GVDTIKVEVPI 319 (478) T ss_pred hhCceeeeecCCccccc-----------------------------hhhhhhhhcceEEecCCCCC---cceEEeecCCh Confidence 67775543322221110 00000110111234433433 45555444454 Q ss_pred ChH-HHHHHHHHHHHHhcCCChhhccCCCccccccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCh Q lcl|NC_018087. 356 NEM-DDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAIS--RDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITE 432 (520) Q Consensus 356 gei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eIt--RDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~ 432 (520) ..+ .-+.-+.+.+|+-.++|-- ..++ + -|..+..+ .-...-..-+.+.+..|...+..+++.=+-+.|+ . T Consensus 320 ~~~~~~~~~l~~~i~~~s~~p~~--~~~~-~--~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~g~--~ 392 (478) T protein:vir:10 320 DSVKEYTKMLRDYIIEFGQGVDF--QQDK-F--GNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRL--D 392 (478) T ss_pred HHHHHHHHHHHHHHHHHhCcccc--Cccc-c--ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--C Confidence 433 4566778889999999842 2221 1 12222222 2222233335556666666666655544444443 2 Q ss_pred hhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhc--- Q lcl|NC_018087. 433 DEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSD--- 509 (520) Q Consensus 433 eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~--- 509 (520) .+| ..|.+.|..-.--.+.. .+++++.+.+ .+|.+++++. |...++ .+++-++|++|..+ T Consensus 393 ~~~----~~i~i~f~~~~p~d~~e-------~a~~~~kl~g----~iS~et~~~~-l~~v~D-~~~E~~ri~~E~~~~~~ 455 (478) T protein:vir:10 393 VKV----QDIEITFNFNVMVNELE-------NSQIAMNSTG----LLSKETILSN-HAWVED-PVAEMERIEQENIELNQ 455 (478) T ss_pred ccc----ccceEEecCCCCCCHHH-------HHHHHHHHhC----CCChHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHh Confidence 233 34677775444333433 3445555543 3799999976 566432 22333333333321 Q ss_pred ---CCccCCccccC Q lcl|NC_018087. 510 ---KIFNPPEPEEI 520 (520) Q Consensus 510 ---~~~~~p~~e~~ 520 (520) ...+.+..++= T Consensus 456 ~~~~~~~~~~~~~~ 469 (478) T protein:vir:10 456 QLPDIEEGLNGEQQ 469 (478) T ss_pred hccccccccCCCCC Confidence 11111111111 No 116 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=96.45 E-value=0.00062 Score=38.18 Aligned_cols=422 Identities=10% Similarity=0.056 Sum_probs=177.5 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCce-eecccccccccccccccccccccchhHHHHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGAT-EVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYR 79 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~-~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR 79 (520) +.|-+..+.++..=+....-..++.-. .- -.|.+ .+...... +........+ T Consensus 28 ~~~~~~~i~~~i~~~~~~~~~~~~~~~-----~y-----Y~g~~~~i~~~~~~--------~~~~~~~~~~--------- 80 (481) T protein:vir:10 28 ELLKEENLRNFISRHQTEQVPRLEMLE-----SY-----YLNRNTDILAGERR--------LQKYGDKADH--------- 80 (481) T ss_pred hhcCHHHHHHHHHHHHHHHHHHHHHHH-----HH-----hcCCCcccccCccc--------cccccccccc--------- Confidence 111111112211111101000010000 00 00000 00000000 0000000000 Q ss_pred HHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEE Q lcl|NC_018087. 80 SLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHK 159 (520) Q Consensus 80 ~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hk 159 (520) . ...+=+.-+|+..+.-. -..||++..++.... +.+..++.-.+|+....++.+..++-|+-|.+. T Consensus 81 k-i~~n~~~~ivd~~~~~l-----~g~~~~~~~~d~~~~--------~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~ 146 (481) T protein:vir:10 81 R-AVHNYAKYVSRFIVGYL-----TGNPITITHQDNQTN--------DKIIELNDLNDADEVNSDLALNLSIYGRAYEIV 146 (481) T ss_pred e-eecchHHHHHHHHHhhh-----ccCCceEecCChhHH--------HHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEE Confidence 0 01222233333333222 136677776655333 345566666789999999999999999999988 Q ss_pred eeecCCCCCCeeeeEecCccceeeeeeccC--CCCccccccc-------ceecceeecCcc-cccccccceecCC----c Q lcl|NC_018087. 160 IINPNRPKDGIIELRRLDPRNVQFVRELDT--KMENGVKVVK-------GYREYFLYDTEL-ESYQCGHQHFAAG----T 225 (520) Q Consensus 160 vid~~~~k~GI~elr~lDPr~i~~vr~i~~--~~~~~~~~~~-------~~~ey~~y~~~~-~~~~~~~~~~~~~----~ 225 (520) -+|. +|-..++.+||+++.++.+-.. +..-.++.+. .+..+-+|.+.. ..+..++.....- . T Consensus 147 ~~d~----dg~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~ 222 (481) T protein:vir:10 147 YRDF----EDRDTFKVLDPKSTFVVYDQTLDKKVVAGVRYFEKQDKDKVPVQHVEVYTTDKIYYIEIKGGTYHRVEEVEH 222 (481) T ss_pred EeCC----CCeEEEEEEcccceEEEEcCCCCCceEEEEEEEEEeeCCCceEEEEEEEecCeEEEEEecCCceeecccccc Confidence 8763 4778899999999998754321 1112222111 111222444332 1111111111000 0 Q ss_pred ce-ecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHH Q lcl|NC_018087. 226 KI-KIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLK-LLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHI 303 (520) Q Consensus 226 ~~-~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~-m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~i 303 (520) .+ +|| |++.. |+....|-++..+....-+. ++-+....-+.++.|-+-+. |..+ T Consensus 223 ~~g~vP---vv~~~------n~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~----g~~~----------- 278 (481) T protein:vir:10 223 YYNDVP---IIEYL------NDQFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAII----GNVD----------- 278 (481) T ss_pred cCCcee---EEEee------cCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEee----cCcC----------- Confidence 01 222 23221 12223454444333333222 23333334444555544332 1111 Q ss_pred HHhhcceeEeecCCCc-cccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHH-HHHHHHHHHHhcCCChhhccC Q lcl|NC_018087. 304 MNSHRNRISYDARTGK-VKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDD-ILYFRKALYMALRVPLSRIPD 381 (520) Q Consensus 304 m~~~knklvYd~~TGe-v~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~kkLy~aL~VP~SRl~~ 381 (520) .|..+|. ++.++........-+.+ .+.+.++..|....+...+.. +.-.++.+|...++|---.+. T Consensus 279 ---------~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~ 346 (481) T protein:vir:10 279 ---------LDSEDAKAFRDANMIHLEPGTNANG---SEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQ 346 (481) T ss_pred ---------CCccchhhhhhccceeccccccccC---CCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc Confidence 1111111 11111111111111111 122335666555444444333 566677888998998422221 Q ss_pred CCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHH Q lcl|NC_018087. 382 EQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEIT 461 (520) Q Consensus 382 ~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~ 461 (520) .+.+. .+..|.........-+.+.|..|...+..+++.=+-+-++--..+++ ...+.+.|.....-.+... T Consensus 347 -~~~n~--Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~--~~~i~v~f~~~~~~~~~~~---- 417 (481) T protein:vir:10 347 -FSGVQ--SGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLKQHN--YAELTITFTPNLPKSMMES---- 417 (481) T ss_pred -ccccc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccc--cceeeEEeCCCCCcCHHHH---- Confidence 11111 11223333334556678888888888888775433332332222222 2457888876665555333 Q ss_pred HHHHHHHHHhhcccchhhhHHHHHHHHhCCCH---HHHHHHHHHHHHhhhc---CCccCCc--cccC Q lcl|NC_018087. 462 ERRVNVLSLMEPYIGKYISNHTAMKDFLQMSD---EDIAAERKLIDEELSD---KIFNPPE--PEEI 520 (520) Q Consensus 462 ~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tD---eeI~~~~kqi~~E~~~---~~~~~p~--~e~~ 520 (520) ++++.++.+ .+|.+++++. |...+ +|++.++++-+++.+. ..++++. .++. T Consensus 418 ---a~~~~kl~g----~is~et~~~~-l~~i~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 476 (481) T protein:vir:10 418 ---INAFNALSG----GVSESTRLSL-LDFIDNPKEELEKMQEEEAQREKQADKRGYGEAFENHLNV 476 (481) T ss_pred ---HHHHHHHhc----cCChHHHHHh-CCCCCCHHHHHHHHHHHHHHHHhhhhhccCCccCCCCCCC Confidence 344455533 3799999977 55543 4665555544333322 1122211 1111 No 117 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=96.37 E-value=0.0007 Score=37.89 Aligned_cols=393 Identities=9% Similarity=0.058 Sum_probs=182.5 Q ss_pred cccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHh Q lcl|NC_018087. 3 MLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLL 82 (520) Q Consensus 3 ~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma 82 (520) |++++.+ ++.... ..+. ++ ..+ .-. .+++|..+.....+ .....+ T Consensus 1 m~~~~~f-------~~~~~~--------~~~~-~~-~~~----~~~--~~~~~~~~~~~~~v------------~~~~al 45 (416) T protein:vir:12 1 MLLERMF-------EKRSGS--------SDHE-DG-FNN----ILL--NMFGGRKTASGERV------------SESNSL 45 (416) T ss_pred Cccchhc-------ccccCc--------cccC-cc-chh----HHH--HhhcCcccccCcee------------chhhhh Confidence 6665433 222210 0000 00 000 000 01112111111111 123456 Q ss_pred hccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHH-HHHHHHHHhcchhhhHH----HHHhhccccceeE Q lcl|NC_018087. 83 NNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISD-EFNSVLNMLNFQRKGSD----HFKRWYVDSRVFF 157 (520) Q Consensus 83 ~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~e-eF~~i~~ll~f~k~g~~----~fRrWYvDgri~~ 157 (520) ++|.|..||+-|.+.+.-++- .+--+.. ...+..... .+.-++.-=|=..++.+ ++..+.+.|.-|. T Consensus 46 ~~~~v~~~i~~Ia~~ia~l~~-----~~~~~~~---~~~~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~ 117 (416) T protein:vir:12 46 VQPDIFACVNVLSDDIAKLPI-----HTYKRTD---GGIERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYS 117 (416) T ss_pred ccHHHHHHHHHHHHhhhhCce-----EEEEecC---CccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEE Confidence 789999999999988764431 1110000 000111000 11111111122233444 4445677899998 Q ss_pred EEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEe Q lcl|NC_018087. 158 HKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYA 237 (520) Q Consensus 158 hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~ 237 (520) .++-|. + .-+.+|.+|||..++.+..-. ++.. |+.|.. .+..+.++.+.|+|. T Consensus 118 ~i~r~~-~--G~~~~L~~l~~~~v~v~~~~~----~~~~-------~~~~~~-------------~g~~~~~~~~eiih~ 170 (416) T protein:vir:12 118 YIQFGS-H--GYPEALFPLRPDYTNAYVHPT----TGML-------WYQTVL-------------NGKAIELYDYEVLHF 170 (416) T ss_pred EEEECC-C--CcEEEEEEECCcceEEEEeCC----CcEE-------EEEEec-------------CCeEEEecCccEEEe Confidence 887552 2 238999999999998753221 1111 222211 123468899999888 Q ss_pred ecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCC Q lcl|NC_018087. 238 HSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDART 317 (520) Q Consensus 238 hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~T 317 (520) . ...+++...+|.|+.|.+++......+....=+=-..+.-+-|..++ +.+.+..+++ +++-.++..+ . T Consensus 171 ~--~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~-~~~~~~~~~~-------~ 239 (416) T protein:vir:12 171 K--GLSTDGIHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVP-AFLDEKPKEN-VRKEWKRVNK-------V 239 (416) T ss_pred c--CcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecC-CCCCHHHHHH-HHHHHHHHhc-------C Confidence 5 24677777789999999999998888887665555556667777776 3455544433 3333333211 1 Q ss_pred CccccccccchhhhhhcccccCCCCCcceeecCCC-CCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhH Q lcl|NC_018087. 318 GKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGM-TGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISR 396 (520) Q Consensus 318 Gev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg-~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItR 396 (520) |. .+ .++ + |++++.|.=. ..+.-++-.++....+.++++||.+-|....+.+ .+.+.- T Consensus 240 ~~------~~-vl~-------~---g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t----~sn~e~ 298 (416) T protein:vir:12 240 EN------IA-IID-------Y---GLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKAT----FSNIEH 298 (416) T ss_pred CC------ee-ecC-------C---CceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCC----cccHHH Confidence 22 11 221 2 4555555321 1222244556778899999999999886432211 111111 Q ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_018087. 397 DELSFDKF-ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYI 475 (520) Q Consensus 397 DElkF~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~v 475 (520) .-..|..+ |.-+-.++..-|.. .++++.++. ....+.|.-+. +...+ ...|.+.+..+-.- T Consensus 299 ~~~~f~~~~l~P~~~~ie~~l~~---------~l~~~~~~~---~g~~i~fd~~~----l~~~d-~~~~~~~~~~~~~~- 360 (416) T protein:vir:12 299 QSIEYVRNTLQPWIVNFEQELNV---------KLFLDHDQK---SGHYVKFNIDS----ELRGD-SKTQAEYLKTLHET- 360 (416) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH---------hhcCchhhc---CCceEEeechh----hhccC-HHHHHHHHHHHHhC- Confidence 11224322 33333333332222 234444332 22334444322 22221 24566666655332 Q ss_pred chhhhHHHHHHHHhCCCHHHHHHHHHHH--------------HHhhhcCCcc--CCcccc Q lcl|NC_018087. 476 GKYISNHTAMKDFLQMSDEDIAAERKLI--------------DEELSDKIFN--PPEPEE 519 (520) Q Consensus 476 gky~S~~~i~k~IL~~tDeeI~~~~kqi--------------~~E~~~~~~~--~p~~e~ 519 (520) -+++.+-++. .|.|.+-+ .-++.+ ++-...+-.+ +++.|= T Consensus 361 -G~~T~NE~R~-~~gl~Pi~--ggd~~~~~~n~~~~~~~~~~~~~~~~~~~~gge~~~~g 416 (416) T protein:vir:12 361 -GVLNKDEIRE-LLERNPIE--NGDKYISSLNYVFLDFLEEYQRLKAGGAMKGGDNKNEG 416 (416) T ss_pred -CCcCHHHHHH-HhCCCCCC--CcceeeeccccccccccchhhccccccccCCCCCcCCC Confidence 3678888875 46775531 111100 0000000000 111111 No 118 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=96.20 E-value=0.00088 Score=37.34 Aligned_cols=435 Identities=12% Similarity=0.103 Sum_probs=171.6 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhcc-CCCcccCCCCC-----------CCceeeccccccccccccc-cccccccc Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIIND-KAESITAPKFD-----------DGATEVDSQDIAYNGVFQK-LYGSQDPT 67 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~~-~~~s~~~p~~~-----------dg~~~i~~~~~a~~g~~~~-~~~~~~~~ 67 (520) ..-....+..=+|--. -++. .+..++-|+-- .+.+..+.+...++=.+++ ++.++.-. T Consensus 35 ~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 105 (695) T protein:vir:78 35 TATAAQPVPADMGRRG---------ALNALDAAPVAEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFV 105 (695) T ss_pred hhccccccchhhcccc---------cccccccccccCCCcccccceeceeccccCCccccchhhhhhcccccccccchhh Confidence 0000111111111000 0000 00111111110 1111111111001000011 11111000 Q ss_pred chhHHHHHHHHHHHhhccchhHHHHhhhceeeEe------cCCCcE----EEEeeccchhhh-HHHHHHHHHHHHHHHHh Q lcl|NC_018087. 68 ATSTRELINTYRSLLNNYEVDNAVQEIVSDAIVY------EEGFDV----VSIDLDQTAFTE-NIRNLISDEFNSVLNML 136 (520) Q Consensus 68 ~~~~~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~------d~~~~~----V~l~Ld~~~~s~-~ik~~I~eeF~~i~~ll 136 (520) ..+-+==+-.--.|||+||.+.+++-|..||+-. ....+. +++.-+...-++ .-.++|..|++.+ T Consensus 106 ~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erL---- 181 (695) T protein:vir:78 106 TSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERL---- 181 (695) T ss_pred hccCcchHHHHHHHhhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHH---- Confidence 0001111223456899999999999999999543 222211 222222222222 2334666666553 Q ss_pred cchhhhHHHHHhhccccc--eeEEEeeec------------CCCCCCeeeeEecCccceeeeeeccCCCCccccccccee Q lcl|NC_018087. 137 NFQRKGSDHFKRWYVDSR--VFFHKIINP------------NRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYR 202 (520) Q Consensus 137 ~f~k~g~~~fRrWYvDgr--i~~hkvid~------------~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ 202 (520) +...+..+.++-=-+-|. +|+..-=|. +-+|.+++.|+.|||..+.+- ..+.++-+. T Consensus 182 ~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~---------~~n~~dP~s 252 (695) T protein:vir:78 182 RIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPN---------NYNSINPVA 252 (695) T ss_pred HHHHHHHHHHHhhccccceEEEEEeccCccccccccccccccccCcceeeeEeecccccccc---------hhhhccchh Confidence 222333332222222232 233332221 123456777999998766661 111111111 Q ss_pred cceeecCcccccccccceecCCcceecCcccEEEeecccccCC------CCcchhhhHHHHHHHHH-HHHHHHHHHHHHH Q lcl|NC_018087. 203 EYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDCC------GKNIIGYLHRAVKPANQ-LKLLEDAMMIYRI 275 (520) Q Consensus 203 ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~~------~~~~~syL~~aik~~Nq-L~m~EDalVIyRi 275 (520) + ..|.|+ .|... +.+||.+=++... |---|+ ++..+|.+..+..-..+ +++...+.=+- T Consensus 253 p-dfgkP~--~y~V~--------G~kIH~SRL~~f~-g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li-- 318 (695) T protein:vir:78 253 D-DFYKPS--TWWMI--------GTEVHATRLHTIV-SRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIV-- 318 (695) T ss_pred h-ccCCCc--eEEEe--------ceEEeeeeEEEec-CCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHH-- Confidence 1 111111 11111 1144444433322 221111 33345655555543332 33333332221 Q ss_pred hcCccceEEEccCC-CCchHHHHHHH--HHHHHhhcceeEeecCCCccccccccch-hhhhhcccccCCCCCcceeecCC Q lcl|NC_018087. 276 TRAPDRRVFYIDTG-NMPARKAAQHM--QHIMNSHRNRISYDARTGKVKNQANMMA-LTEDYWLQRRDGKAVTEVETLPG 351 (520) Q Consensus 276 ~RApeRRvFyIDvG-nlpk~KAeqyl--~~im~~~knklvYd~~TGev~d~~~~ms-mlEDywLpRReGgrgTEIsTLpG 351 (520) +.+.-+ ++-.|.. -|.....++.. -+++++||.-. |-+ .+. =.|||- .++ T Consensus 319 ~~~~v~-~lk~dla~~L~~g~~~~l~~R~eli~~~Rsn~------G~~-----llDk~~Eefe----------q~s---- 372 (695) T protein:vir:78 319 KQFSVS-GILMDLAQALMPGANVDLSMRAELINRYRDNR------NIL-----FLDKATEEFF----------QFN---- 372 (695) T ss_pred HhhhhH-HHHHHHHHhhcChhHHHHHHHHHHHHHhcCcc------ceE-----EEecCCcceE----------EEe---- Confidence 111111 1111211 01111122222 25566776311 110 000 013443 222 Q ss_pred CCCcChHHHHH-HHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHH Q lcl|NC_018087. 352 MTGMNEMDDIL-YFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKS-----NLL 425 (520) Q Consensus 352 g~nLgei~DV~-YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~-----QLi 425 (520) .+|+-++||. =|..-+=-+.+||+.||=..+.-++ ..++| -|.-.|...|..+|. ..+..+|++ |+- T Consensus 373 -tslSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGl-NATGE--~D~rnYYD~I~s~Qe---~~L~p~L~rl~~ii~rS 445 (695) T protein:vir:78 373 -TPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGL-NASSE--GEIRVWYDYVRAYQR---NALQQLMNDVIVMIQLS 445 (695) T ss_pred -cccCCHHHHHHHHHHHHHhhhcCchhhhhccCCccc-cccch--hhHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHH Confidence 4788899975 4888888899999999865553333 12221 244558888888775 334444433 444 Q ss_pred hcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHH Q lcl|NC_018087. 426 LKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDE 505 (520) Q Consensus 426 Lkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~ 505 (520) +-|.+. ..|.|+|+.=..-+|..-+||...+.+.....-.- --++.+-|+..+ .+|.+=-= -..++ T Consensus 446 ~~G~id--------pdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~--gvI~~~evr~rL--~~d~~s~Y-~~~~D- 511 (695) T protein:vir:78 446 LFGAVD--------PSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQE--QVIRPDQVAARL--NTEPDGPY-AGKLD- 511 (695) T ss_pred hcCCCC--------CcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHh--cCCCHHHHHHHH--hcCCCccc-ccccc- Confidence 445443 45889999888888888899988888764443111 012333333221 00000000 00000 Q ss_pred hhhcCCccCCccccC Q lcl|NC_018087. 506 ELSDKIFNPPEPEEI 520 (520) Q Consensus 506 E~~~~~~~~p~~e~~ 520 (520) +..+ ...|.++|| T Consensus 512 ~~d~--p~~~~~~~~ 524 (695) T protein:vir:78 512 ANDD--PGVPADDDI 524 (695) T ss_pred cccC--CCcCccchh Confidence 1111 112334444 No 119 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=96.17 E-value=0.00092 Score=37.24 Aligned_cols=400 Identities=16% Similarity=0.146 Sum_probs=179.1 Q ss_pred hcc-CCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchhHHH-------------- Q lcl|NC_018087. 27 IND-KAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNAV-------------- 91 (520) Q Consensus 27 ~~~-~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai-------------- 91 (520) ++- +..-+.-|+..+=.. ...+.+.... ..-+.+|+.+..+.+=...| T Consensus 1 ~~~~~~~~~~~~~~~~~~~----------~~i~~~i~~~-------~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~k 63 (452) T protein:vir:36 1 MKYKPPKLMTFSKDEPITV----------EVVTKFMEKH-------KLEVARYEYLKNMYLGIMAIDDEPAKDSWKPDNR 63 (452) T ss_pred CcccCceeEEcCCccCCCH----------HHHHHHHHHH-------HHHHHHHHHHHHHhccccccccCccccccCccce Confidence 100 111122222221100 0011111111 11123444444444332211 Q ss_pred ------HhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCC Q lcl|NC_018087. 92 ------QEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNR 165 (520) Q Consensus 92 ------~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~ 165 (520) ..||+-.+-+ --.+|+++..++. ...+....+++--+|+....+..+.+.+-|+-|++.-+|. T Consensus 64 i~~n~~~~ivd~~~~~-l~g~~~~~~~~d~--------~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~-- 132 (452) T protein:vir:36 64 LAVNFTKYIVDTFTGY-FNGIPVKKSHSDK--------EILTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDE-- 132 (452) T ss_pred eecchHHHHHHHHhhh-hcccCceeecCCh--------hHHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecC-- Confidence 1111111111 1125555555443 2334555666667889999999999999999999887763 Q ss_pred CCCCeeeeEecCccceeeeeeccCC--CCcccccc---cceecceeecCccc-cccccc--ceecCCc--ce-ecCcccE Q lcl|NC_018087. 166 PKDGIIELRRLDPRNVQFVRELDTK--MENGVKVV---KGYREYFLYDTELE-SYQCGH--QHFAAGT--KI-KIPYSAM 234 (520) Q Consensus 166 ~k~GI~elr~lDPr~i~~vr~i~~~--~~~~~~~~---~~~~ey~~y~~~~~-~~~~~~--~~~~~~~--~~-~I~~~aI 234 (520) +|-..++.+||+.+.++.+-... ..-.++.. .+.....+|++..- .+.... ....... .+ +|| | T Consensus 133 --~g~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~iP---v 207 (452) T protein:vir:36 133 --DTQTNVVYNSPENMFMVYDDTVKQEPLFAVRYGVDEDKKLQGEVYTLLETIKISGENDEISFGEGTYNPYPDLP---V 207 (452) T ss_pred --CCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEEecCeEEEEEEcCCceEEecceeccCCccc---E Confidence 36678999999999988543211 11111111 11122234544321 111110 0000000 01 222 2 Q ss_pred EEeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhc-ceeE Q lcl|NC_018087. 235 VYAHSGLVDCCGKNIIGYLHRAVKPANQLK-LLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHR-NRIS 312 (520) Q Consensus 235 ~y~hSGL~d~~~~~~~syL~~aik~~NqL~-m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~k-nklv 312 (520) +++ +|+....|-++..+.....+. ++-+....-+..+.|-+-+.- +.++... +.+.+ +++. T Consensus 208 v~~------~n~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g---~~~~~~~--------~~~~~~~~~~ 270 (452) T protein:vir:36 208 VEF------YFNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLG---AAVEEED--------LKNIRSNRVI 270 (452) T ss_pred EEe------cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeec---CCcCchh--------hhhhhhcceE Confidence 222 223334555555444444443 334444555667777665532 2222111 11111 1111 Q ss_pred eecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHH-HHHHHHHHHHhcCCChhhccCCCcccccccc Q lcl|NC_018087. 313 YDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDD-ILYFRKALYMALRVPLSRIPDEQTQNVFDMS 391 (520) Q Consensus 313 Yd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~ 391 (520) - ++--+.+.|..+.+|....+.+.+.- +.-+.+.+|.-.++|- +..++ ||.. T Consensus 271 ~---------------------~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~----~gn~ 323 (452) T protein:vir:36 271 N---------------------YYADGEGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVAN--ISDES----FGSS 323 (452) T ss_pred E---------------------ecCCCCccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCccc--cCccc----ccCC Confidence 1 11111233445666666555544333 5667788888899994 33322 2333 Q ss_pred c--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-CChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 392 T--AISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRV-ITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVL 468 (520) Q Consensus 392 ~--eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi-~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~ 468 (520) + .|..-+.....-+.+.++.|...+...++.=+-+.+. -...+|. .|.+.|...---.+. +.++++ T Consensus 324 Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~----~i~i~f~~~~p~d~~-------~~a~~~ 392 (452) T protein:vir:36 324 SGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNVSNKDSWK----DIEYTFTRNEPKDIK-------EQAETA 392 (452) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccc----cceEEeCCCCCcCHH-------HHHHHH Confidence 2 2222233344556667777777777766653333232 2333444 467777654443342 334445 Q ss_pred HHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcC-------CccCCccccC Q lcl|NC_018087. 469 SLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDK-------IFNPPEPEEI 520 (520) Q Consensus 469 ~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~-------~~~~p~~e~~ 520 (520) +.+. | .+|.+++++. |..+++ .+++-++|++|..+. .-+.+..++. T Consensus 393 ~k~~---g-~iS~et~~~~-~~~~~d-~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 445 (452) T protein:vir:36 393 NILM---G-ITSQETALSV-ISVIPD-VQAEMEKIKKEEASTAIFDKDKQPSEKGTDTV 445 (452) T ss_pred HHHh---c-cCChHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHHHhhccCCCCccccc Confidence 5553 3 3799999976 565532 333344444443221 1111111111 No 120 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=96.11 E-value=0.00099 Score=37.05 Aligned_cols=397 Identities=12% Similarity=0.065 Sum_probs=175.0 Q ss_pred Cceeeccccccccccccccccccccc---------chhHHHHHHHHHHHhhccchhHHH--------------------H Q lcl|NC_018087. 42 GATEVDSQDIAYNGVFQKLYGSQDPT---------ATSTRELINTYRSLLNNYEVDNAV--------------------Q 92 (520) Q Consensus 42 g~~~i~~~~~a~~g~~~~~~~~~~~~---------~~~~~~LI~~YR~ma~~pEvd~Ai--------------------~ 92 (520) -|..++.. +.+.... ++.-..-+.+|+.+..+++-...| . T Consensus 1 ~~~~~~~~-----------~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~ 69 (499) T protein:vir:10 1 MAVVIDKD-----------LLDDVNEPNIEAINYAIRELQNRKKRLDKLSDYYNGKQEIEKHEFDNATVEAANVMVNHAK 69 (499) T ss_pred Cccchhhh-----------HHhhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCCcCcCCCCcceeecchHH Confidence 11111110 0000000 000111123444444443333221 1 Q ss_pred hhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCC------ Q lcl|NC_018087. 93 EIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRP------ 166 (520) Q Consensus 93 eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~------ 166 (520) .||+..+-+= -..||++..++. ...+++..+++--+|+....++.+.+.+.|+-|.+.-+|.+.. T Consensus 70 ~Iv~~~~~~l-~g~p~~~~~~~~--------~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~ 140 (499) T protein:vir:10 70 YITDMNVGFM-TGNPVKYVAEKG--------KNIDDILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDE 140 (499) T ss_pred HHHHHHhhhh-cccCceeecCCh--------hHHHHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEeccccccccccc Confidence 1222211111 125666655543 2344566777777899999999999999999998887774321 Q ss_pred -------CCCeeeeEecCccceeeeeeccCCCC--cccccc--------cceecceeecCcc-cccccccceecCCc--- Q lcl|NC_018087. 167 -------KDGIIELRRLDPRNVQFVRELDTKME--NGVKVV--------KGYREYFLYDTEL-ESYQCGHQHFAAGT--- 225 (520) Q Consensus 167 -------k~GI~elr~lDPr~i~~vr~i~~~~~--~~~~~~--------~~~~ey~~y~~~~-~~~~~~~~~~~~~~--- 225 (520) +..-..+..+||+.+-.|.+-..... -+++.+ +.+..+-+|++.- ..|...+....... T Consensus 141 ~~~~~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~ 220 (499) T protein:vir:10 141 LGNEKLTPNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEVSANDPI 220 (499) T ss_pred ccccccccccceEEEEEcccceEEEecCCCCcceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccccCccee Confidence 22235588899999888765332211 111110 0112223454432 22221111110000 Q ss_pred ------ce-ecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHH Q lcl|NC_018087. 226 ------KI-KIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLK-LLEDAMMIYRITRAPDRRVFYIDTGNMPARKAA 297 (520) Q Consensus 226 ------~~-~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~-m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAe 297 (520) .+ +|| ||++ +|+....|=++.++.....+. ++-+....-+-...|-+-+.-.+.+... T Consensus 221 ~~~~~~~~g~vP---vv~~------~n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~----- 286 (499) T protein:vir:10 221 VYDGENLFGAVP---IIEF------RNNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGLGDDK----- 286 (499) T ss_pred cccccCCCCccc---eEEe------cCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccc----- Confidence 11 222 2222 112233455555555555444 3345555556666776665533322211 Q ss_pred HHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChH-HHHHHHHHHHHHhcCCCh Q lcl|NC_018087. 298 QHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEM-DDILYFRKALYMALRVPL 376 (520) Q Consensus 298 qyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei-~DV~YF~kkLy~aL~VP~ 376 (520) .....+..+-+.--.+..|..+++|-...+.... .-+.-+.+.+|+-..+|- T Consensus 287 ---------------------------~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~ 339 (499) T protein:vir:10 287 ---------------------------DDIQRLKRGAIEAPPREEGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPN 339 (499) T ss_pred ---------------------------chhhhhhhcceeccCCCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCccc Confidence 1111111111111122223446666554444333 344666777888888884 Q ss_pred hhccCCCccccccccchhhHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-ChhhHHhhhhceEEEeeccchHH Q lcl|NC_018087. 377 SRIPDEQTQNVFDMSTAISRD--ELSFDKFISELQHKFEEIFLSPLKSNLLLKRVI-TEDEWEAELNNIKIVFHKNSYFS 453 (520) Q Consensus 377 SRl~~~~~~~~~G~~~eItRD--ElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~-t~eew~~~~~~I~~~f~~Dn~f~ 453 (520) +.++. +.|..+..+.. ......-+.+.++.|...+.++++.=+-+-++. ...+| ..+.+.|....--. T Consensus 340 --~~~~~---~~gn~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~d~----~~i~i~f~~~~p~n 410 (499) T protein:vir:10 340 --MNDEK---FMGNVSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIKGANDDA----SGCKISLVANIPSN 410 (499) T ss_pred --CCchh---hcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccc----ccceEEeCCCCCCC Confidence 22221 11333333322 222233455556666666666665543332221 22233 35677786555444 Q ss_pred HHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCC---HHHHHHHHHHHHHhhh------cCCccCC-ccccC Q lcl|NC_018087. 454 EMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMS---DEDIAAERKLIDEELS------DKIFNPP-EPEEI 520 (520) Q Consensus 454 ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~t---DeeI~~~~kqi~~E~~------~~~~~~p-~~e~~ 520 (520) + .+.+++++.+. | .+|.+++++. |... ++|++.+.++-++..+ .+.-+++ ++++- T Consensus 411 ~-------~e~~~~~~kl~---g-~iS~et~~~~-l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~ 475 (499) T protein:vir:10 411 L-------SDVVNNVKNAD---G-IIPRKYTYSW-LPDVDNPQDVIDEMNQQDAETIKKNQEALRGQDPDRLELEDK 475 (499) T ss_pred H-------HHHHHHHHHHh---c-cCChHHHHHh-CCCCCCHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCC Confidence 4 34444555553 3 3799999977 5553 3455554443332211 1111111 11111 No 121 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=96.11 E-value=0.001 Score=37.03 Aligned_cols=449 Identities=10% Similarity=0.040 Sum_probs=182.1 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeeccc-cc---ccccccccccccccccchhHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQ-DI---AYNGVFQKLYGSQDPTATSTRELIN 76 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~-~~---a~~g~~~~~~~~~~~~~~~~~~LI~ 76 (520) -....++.|.-.-+..+.-......-...+.+..++++.+--...|... +. .+. -+..+|.|....+........ T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~-~~~~yY~g~~~~i~~~~~~~~ 81 (501) T protein:vir:96 3 QTLFTDSTGQERVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQ-ELLDYARGENHDVLKSGRRKD 81 (501) T ss_pred eeeeeecccceeccccccchhHHhhhcccccccccCChHHHHHHHHHHHHHHHHHHHH-HHHHHhcCCCCcccCccccCc Confidence 0000111111111000000000000000111111111000000000000 00 000 000001010000000000000 Q ss_pred HHH--HHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccc Q lcl|NC_018087. 77 TYR--SLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSR 154 (520) Q Consensus 77 ~YR--~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgr 154 (520) ..+ .-..++=..-+|+..+.-++ ..||++..++.+-++. +.+.++.++.--+|+....++++..++-|+ T Consensus 82 ~~~~~~ri~~n~~k~Ivd~~~~yl~-----g~p~~~~~~~~~~~~~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ 152 (501) T protein:vir:96 82 NEMADKRAVHNYGRMISKFKTGYLA-----GNPIRVEYDDNDDNSQ----NDDAIKRIGRINDLDSLNRTLIRDLSQTGR 152 (501) T ss_pred cccccceeecchHHHHHHHHhhhhc-----ccCeeEeeCCccchhH----HHHHHHHHHHhcCHHHHHHHHHHHHhhcCe Confidence 000 00112223333333333222 4778888777655553 444455667677899999999999999999 Q ss_pred eeEEEeeecCCCCCCeeeeEecCccceeeeeeccC--CCCcccccc------cceecceeecCcc-cccccccce----e Q lcl|NC_018087. 155 VFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDT--KMENGVKVV------KGYREYFLYDTEL-ESYQCGHQH----F 221 (520) Q Consensus 155 i~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~--~~~~~~~~~------~~~~ey~~y~~~~-~~~~~~~~~----~ 221 (520) -|.+.-.|. +|-..+..+||+.+..|.+-.. +..-+++.+ .+...+-+|++.. ..+..++.. . T Consensus 153 a~~~v~~de----dg~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~ 228 (501) T protein:vir:96 153 AYEVIYRSE----YDETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQSAKDVVEIYTDEHIYTLDASDDFNEISV 228 (501) T ss_pred EEEEEEEcC----CCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeecCCCcEEEEEEEcCCcEEEEeeCCCceeccc Confidence 998887763 3667899999999998854321 112222221 1112233554432 111111100 0 Q ss_pred cCCcce-ecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHH Q lcl|NC_018087. 222 AAGTKI-KIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLK-LLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQH 299 (520) Q Consensus 222 ~~~~~~-~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~-m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqy 299 (520) .++ .+ ++| |+++ .|+....|=++..+.....+. ++=+....-+..+.|-+-+.-.+....+... T Consensus 229 ~~~-~~g~vP---vv~~------~nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~---- 294 (501) T protein:vir:96 229 TTH-AFGTVP---ITEY------LNNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQA---- 294 (501) T ss_pred ccc-CCCccc---eEEe------cCCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccch---- Confidence 000 11 222 3332 122234565565554444443 4444455556666776665443322211100 Q ss_pred HHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCC----CCCcceeecCCCCCcChHH-HHHHHHHHHHHhcCC Q lcl|NC_018087. 300 MQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDG----KAVTEVETLPGMTGMNEMD-DILYFRKALYMALRV 374 (520) Q Consensus 300 l~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReG----grgTEIsTLpGg~nLgei~-DV~YF~kkLy~aL~V 374 (520) .+ |.. + ..++++-.++ +.+..+..|-+..+...+. -++-+++.+|...++ T Consensus 295 -~~-~~~--~---------------------~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~ 349 (501) T protein:vir:96 295 -SD-MKR--T---------------------RLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNT 349 (501) T ss_pred -hh-hhh--c---------------------CeeeecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCC Confidence 00 100 0 1122222221 2223455554433332222 234556777888899 Q ss_pred ChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCh-hhHHhhhhceEEEeeccchHH Q lcl|NC_018087. 375 PLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITE-DEWEAELNNIKIVFHKNSYFS 453 (520) Q Consensus 375 P~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~-eew~~~~~~I~~~f~~Dn~f~ 453 (520) |---++.-+ .+. .+..|.--.......+.+.++.|..-+..+++.=+-+-++... .+++ ...|.+.|...-.-. T Consensus 350 p~~~~~~~~-~n~--Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d--~~~i~i~f~~~~p~n 424 (501) T protein:vir:96 350 PDMSDTNFS-GNT--SGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKDFD--ESLLKITFTPNLPKS 424 (501) T ss_pred cccCccccc-ccc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc--cccceEEeCCCCCcC Confidence 842222111 111 1122222233345667777777777777776654433333221 1222 234778886544444 Q ss_pred HHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhc----CCccCCcc---ccC Q lcl|NC_018087. 454 EMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSD----KIFNPPEP---EEI 520 (520) Q Consensus 454 ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~----~~~~~p~~---e~~ 520 (520) + .+.++++..+.+ .+|.+++++. |...++ -+++.++|++|..+ +...+-++ ++- T Consensus 425 ~-------~e~ad~~~kl~g----~iS~et~~~~-l~~v~D-~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 485 (501) T protein:vir:96 425 L-------NEQVSILTGLGG----QVSQETALSL-SGLVES-PNEELDKINKEMSEIDFKGYSNDFNEHVGKYT 485 (501) T ss_pred H-------HHHHHHHHHHhc----cCchHHHHHh-CCCCCC-HHHHHHHHHHHHHHhhccccccchhhcccccC Confidence 4 344455666643 3799999987 555432 22333444444332 21111100 000 No 122 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=96.03 E-value=0.0011 Score=36.81 Aligned_cols=389 Identities=9% Similarity=0.042 Sum_probs=168.2 Q ss_pred cCCCCCCCceeecccccccccccccccccccccchh-HHHHHHHHHHHhhccchh------------------------- Q lcl|NC_018087. 35 TAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATS-TRELINTYRSLLNNYEVD------------------------- 88 (520) Q Consensus 35 ~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~-~~~LI~~YR~ma~~pEvd------------------------- 88 (520) .-|.+.+ ++...+.. -.....+|+.+..+++-+ T Consensus 1 ~~~~t~~---------------------~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~ 59 (456) T protein:vir:79 1 MTASTPA---------------------EWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNW 59 (456) T ss_pred CCCCCHH---------------------HHHHHHHHHHHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcch Confidence 0000000 00000000 011122233333333333 Q ss_pred --HHHHhhhceeeEecCCCcEEEEee-ccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCC Q lcl|NC_018087. 89 --NAVQEIVSDAIVYEEGFDVVSIDL-DQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNR 165 (520) Q Consensus 89 --~Ai~eIvneaiv~d~~~~~V~l~L-d~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~ 165 (520) .+|+..+.-++ .++|++.. ++.+..+ .+..+..--+|+....++++.-.+-|+-|.+.-.| + T Consensus 60 ~~~ivd~~~~~l~-----~~g~~~~~~~d~~~~~--------~~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~-e- 124 (456) T protein:vir:79 60 GLMVRDSVADRII-----PNGITVGGSADSDLAL--------RARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR-D- 124 (456) T ss_pred HHHHHHHHHhhhc-----cCCeecCCCCCccHHH--------HHHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeC-C- Confidence 33333332221 13343322 1222233 23334444577888889999999999988775544 3 Q ss_pred CCCCeeeeEecCccceeeeeeccCC--CCccccccc----ceecceeecCcc-ccc-------ccc--cceecCCcc-ee Q lcl|NC_018087. 166 PKDGIIELRRLDPRNVQFVRELDTK--MENGVKVVK----GYREYFLYDTEL-ESY-------QCG--HQHFAAGTK-IK 228 (520) Q Consensus 166 ~k~GI~elr~lDPr~i~~vr~i~~~--~~~~~~~~~----~~~ey~~y~~~~-~~~-------~~~--~~~~~~~~~-~~ 228 (520) +|-..++.++|+.+-.+.+=... ....+..+. ....+.+|.+.. ..+ ... .......+. +. T Consensus 125 --dg~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 202 (456) T protein:vir:79 125 --DGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVP 202 (456) T ss_pred --CCceEEEEeccceeEEEEcCCCCCceEEEEEEEEecCCceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceee Confidence 34557889999988777541111 111211111 011111222111 000 000 000000000 00 Q ss_pred c-------CcccEEEeecccccCCCCcchhhhHHHHHHHHHH--------HHHHHHHHHHHHhcCccceEEEccCCCCch Q lcl|NC_018087. 229 I-------PYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQL--------KLLEDAMMIYRITRAPDRRVFYIDTGNMPA 293 (520) Q Consensus 229 I-------~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL--------~m~EDalVIyRi~RApeRRvFyIDvGnlpk 293 (520) + +.--|++. ++...+|=++..+.....+ ..+|.....+|+.-....-.+-.|..+-+- T Consensus 203 ~~~~~~~~~~~pvv~~-------~N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i 275 (456) T protein:vir:79 203 VGDAVVTGSPPPVVVY-------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAI 275 (456) T ss_pred cccccCCCCceeEEEe-------cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCccccccccccccc Confidence 0 00111221 2223345555544433322 233333333333332221111111111000 Q ss_pred HHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcC Q lcl|NC_018087. 294 RKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALR 373 (520) Q Consensus 294 ~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~ 373 (520) ...+.++. ..| -.|+ +..|.+|..++..+-=+-.+-++-+-..++...+ T Consensus 276 --------~~~~~~~~------~~~-------------~~~~----~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~t~ 324 (456) T protein:vir:79 276 --------DYASIFEA------APG-------------ALWE----LPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATK 324 (456) T ss_pred --------chhhhhhh------hcc-------------cccc----CCCCcceeeecccChHHHHHHHHHHHHHHHhhcC Confidence 00011110 001 1122 1123445455543222233446777778888889 Q ss_pred CChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHH Q lcl|NC_018087. 374 VPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFS 453 (520) Q Consensus 374 VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ 453 (520) +|..-|...++ |. .+..|---+..+-.-+.+.|+.|..-+...++.=+.+.|.. +. ..|++.|..-..=+ T Consensus 325 ~p~~~~~~~~~-N~--Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~g~~--~~-----~~i~v~w~~~~~~s 394 (456) T protein:vir:79 325 TPLPMLMPDSA-NQ--SAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES--VE-----DTVDVSFESPDRVT 394 (456) T ss_pred CChhHhccccc-Cc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--cc-----ccceEEeCCCCCcC Confidence 99987764432 11 22344445555777788889999999999988878888842 22 24788886544333 Q ss_pred HHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCc------cCCcccc Q lcl|NC_018087. 454 EMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIF------NPPEPEE 519 (520) Q Consensus 454 ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~------~~p~~e~ 519 (520) . .+..+++..+..- ...|.+.++ .+|++|++||++.+.+-..+..++.. ++|++.- T Consensus 395 ~-------~~~ada~~kl~~~--G~~~~~~~~-~~lg~~~~~i~~~e~~r~~~e~~~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:79 395 L-------GEKYSAASLAKAA--GESWASIRR-NILNYNADQIKQDDLDRAREQITLFAGNPVQRPQEDGSR 456 (456) T ss_pred H-------HHHHHHHHHHHhc--CCChHHHHH-hcCCCCHHHHHHHHHHHHHHHHHHHhhhHhhcCCCCCCC Confidence 3 3345555544321 245666554 68999999987644433333233322 2222222 No 123 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=96.02 E-value=0.0011 Score=36.78 Aligned_cols=408 Identities=12% Similarity=0.128 Sum_probs=187.8 Q ss_pred CccccccchhhhcchhhhhhhHHHhhh----ccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKII----NDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELIN 76 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~----~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~ 76 (520) |. -.|+..-|++|..+++.+..-.-. +..+.+.+.|..+ ..+.. ...+++++.. . . T Consensus 1 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~--~~~~~----~~~~~~~~~~----~---------~ 60 (441) T protein:vir:79 1 MH-WYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDD--LQMMV----QTLPGFQGTK----L---------R 60 (441) T ss_pred Cc-cccCccccccccccccchhhhhccccccccccccccCCCcc--hHHHH----HHhcccCccc----c---------c Confidence 21 123444444444443332200000 0011122222111 11100 0001111111 0 0 Q ss_pred HH--HHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcch----hhhHH----HH Q lcl|NC_018087. 77 TY--RSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQ----RKGSD----HF 146 (520) Q Consensus 77 ~Y--R~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~----k~g~~----~f 146 (520) .| ..-+++|.|..||+-|.+.+.-. |+.+. ++.+.. .-+.++.+|+-. -++.+ ++ T Consensus 61 ~~~~~~al~~~~V~~cv~~Ia~~iA~l-----p~~~~-~~~~~~---------~~~~~~~lL~~~PN~~~t~~~f~~~~~ 125 (441) T protein:vir:79 61 QYKDIEAIRHSDIFTAVMMIASDLARM-----PIRVT-VNGQIN---------YSDRIVNLLNTRPNPMYNGYIFKLVVF 125 (441) T ss_pred ccchhhhhccHHHHHHHHHHHHhhccC-----ceeee-cCcccc---------ccchHHHHHhcccCcCCCHHHHHHHHH Confidence 11 12367889999999998877654 23332 111111 123345555432 22334 44 Q ss_pred HhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcc Q lcl|NC_018087. 147 KRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTK 226 (520) Q Consensus 147 RrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~ 226 (520) ..+.+.|--|..++-|. +. -+++|.+|+|..+..+++ .++.- .|+++..... ..+.. T Consensus 126 ~~lll~Gnay~~i~r~~-~G--~~~~L~~i~~~~v~v~~d-----~~g~~------~~~~~~~~~~---------~~~~~ 182 (441) T protein:vir:79 126 VSALLTSHGYIEITRDK-TG--EPMNLTFRKTSEIELKSD-----ARGRL------YYFHQRIDSN---------GNNIE 182 (441) T ss_pred HHHhhcCCeEEEEEECC-CC--cEEEEEEEcCceeEEEEC-----CCccE------EEEEEEeccC---------CceeE Confidence 44678899999988652 21 289999999999987533 12211 1222211100 01122 Q ss_pred eecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHh Q lcl|NC_018087. 227 IKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNS 306 (520) Q Consensus 227 ~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~ 306 (520) ..++.+.|+|.. ....+|-..+|-|+.|.+++.....+++...=+=---|--+-|..++ |.+...+|.+=+++-+++ T Consensus 183 ~~~~~~dvih~k--~~~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~e~~e~~r~~~~~ 259 (441) T protein:vir:79 183 RNVKFEDMLDIK--FYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFHK 259 (441) T ss_pred EEEccccEEEec--cCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC-CCCCCHHHHHHHHHHHHH Confidence 467888888775 24566767789999999999888888877654433445567777777 455444554434433322 Q ss_pred -hcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcC-hHHHHHHHHHHHHHhcCCChhhccCCCc Q lcl|NC_018087. 307 -HRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMN-EMDDILYFRKALYMALRVPLSRIPDEQT 384 (520) Q Consensus 307 -~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLg-ei~DV~YF~kkLy~aL~VP~SRl~~~~~ 384 (520) |. | .++..+.+ .++ + |.+++.|.=...-. -++-.++..+.+.++++||.+.|....+ T Consensus 260 ~~~---------G-~~nag~~~-vl~-------~---G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~ 318 (441) T protein:vir:79 260 SFS---------G-TKQAGKVV-VLD-------E---SMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA 318 (441) T ss_pred Hhc---------C-ccccCcce-ecC-------C---CceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCC Confidence 22 2 22222232 222 1 45566554222111 2344466778899999999998864322 Q ss_pred cccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHH Q lcl|NC_018087. 385 QNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERR 464 (520) Q Consensus 385 ~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R 464 (520) ++.+.-..+-|..-+.-+-.++..-+.. .| .+ ++. ...+.|.-+ ++...+ ...| T Consensus 319 ------~~s~~q~~~~~~~tl~P~~~~ie~eln~----kl-----~~--~~~----~~~~~fd~~----~llr~D-~~~~ 372 (441) T protein:vir:79 319 ------NMSITDANLDYLSTLKPYITCVCAELNF----KF-----ND--EYV----NREFKFDTT----EIRVVD-EKTQ 372 (441) T ss_pred ------CccHHHHHHHHHHHHHHHHHHHHHHHhh----hc-----cc--ccc----CceEEeech----hhhccC-HHHH Confidence 1112222333443344333333333332 22 22 111 233444322 222222 3456 Q ss_pred HHHHHHhhcccchhhhHHHHHHHHhCCCHHH-----H----------HHHH-HHHHH-hhhcCCccCCcccc Q lcl|NC_018087. 465 VNVLSLMEPYIGKYISNHTAMKDFLQMSDED-----I----------AAER-KLIDE-ELSDKIFNPPEPEE 519 (520) Q Consensus 465 ~~~~~~~~p~vgky~S~~~i~k~IL~~tDee-----I----------~~~~-kqi~~-E~~~~~~~~p~~e~ 519 (520) .+.++.+-.- -+++.+-++. .+++.+-+ + +... .|..+ ...+.--+-+|+.| T Consensus 373 ~~~~~~~i~~--G~~T~NE~R~-~~gl~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:79 373 AEIDKINIDS--GKMNIDEIRQ-RDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred HHHHHHHHhC--CCcCHHHHHH-HhCCCCCCCCCcceEeecccccccccccccccccccccccccCCCCCCC Confidence 6666555322 3678888874 46775421 1 0000 00000 00011122333333 No 124 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=96.02 E-value=0.0011 Score=36.78 Aligned_cols=408 Identities=12% Similarity=0.128 Sum_probs=187.8 Q ss_pred CccccccchhhhcchhhhhhhHHHhhh----ccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKII----NDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELIN 76 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~----~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~ 76 (520) |. -.|+..-|++|..+++.+..-.-. +..+.+.+.|..+ ..+.. ...+++++.. . . T Consensus 1 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~--~~~~~----~~~~~~~~~~----~---------~ 60 (441) T protein:vir:94 1 MH-WYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDD--LQMMV----QTLPGFQGTK----L---------R 60 (441) T ss_pred Cc-cccCccccccccccccchhhhhccccccccccccccCCCcc--hHHHH----HHhcccCccc----c---------c Confidence 21 123444444444443332200000 0011122222111 11100 0001111111 0 0 Q ss_pred HH--HHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcch----hhhHH----HH Q lcl|NC_018087. 77 TY--RSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQ----RKGSD----HF 146 (520) Q Consensus 77 ~Y--R~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~----k~g~~----~f 146 (520) .| ..-+++|.|..||+-|.+.+.-. |+.+. ++.+.. .-+.++.+|+-. -++.+ ++ T Consensus 61 ~~~~~~al~~~~V~~cv~~Ia~~iA~l-----p~~~~-~~~~~~---------~~~~~~~lL~~~PN~~~t~~~f~~~~~ 125 (441) T protein:vir:94 61 QYKDIEAIRHSDIFTAVMMIASDLARM-----PIRVT-VNGQIN---------YSDRIVNLLNTRPNPMYNGYIFKLVVF 125 (441) T ss_pred ccchhhhhccHHHHHHHHHHHHhhccC-----ceeee-cCcccc---------ccchHHHHHhcccCcCCCHHHHHHHHH Confidence 11 12367889999999998877654 23332 111111 123345555432 22334 44 Q ss_pred HhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcc Q lcl|NC_018087. 147 KRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTK 226 (520) Q Consensus 147 RrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~ 226 (520) ..+.+.|--|..++-|. +. -+++|.+|+|..+..+++ .++.- .|+++..... ..+.. T Consensus 126 ~~lll~Gnay~~i~r~~-~G--~~~~L~~i~~~~v~v~~d-----~~g~~------~~~~~~~~~~---------~~~~~ 182 (441) T protein:vir:94 126 VSALLTSHGYIEITRDK-TG--EPMNLTFRKTSEIELKSD-----ARGRL------YYFHQRIDSN---------GNNIE 182 (441) T ss_pred HHHhhcCCeEEEEEECC-CC--cEEEEEEEcCceeEEEEC-----CCccE------EEEEEEeccC---------CceeE Confidence 44678899999988652 21 289999999999987533 12211 1222211100 01122 Q ss_pred eecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHh Q lcl|NC_018087. 227 IKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNS 306 (520) Q Consensus 227 ~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~ 306 (520) ..++.+.|+|.. ....+|-..+|-|+.|.+++.....+++...=+=---|--+-|..++ |.+...+|.+=+++-+++ T Consensus 183 ~~~~~~dvih~k--~~~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~e~~e~~r~~~~~ 259 (441) T protein:vir:94 183 RNVKFEDMLDIK--FYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFHK 259 (441) T ss_pred EEEccccEEEec--cCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC-CCCCCHHHHHHHHHHHHH Confidence 467888888775 24566767789999999999888888877654433445567777777 455444554434433322 Q ss_pred -hcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcC-hHHHHHHHHHHHHHhcCCChhhccCCCc Q lcl|NC_018087. 307 -HRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMN-EMDDILYFRKALYMALRVPLSRIPDEQT 384 (520) Q Consensus 307 -~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLg-ei~DV~YF~kkLy~aL~VP~SRl~~~~~ 384 (520) |. | .++..+.+ .++ + |.+++.|.=...-. -++-.++..+.+.++++||.+.|....+ T Consensus 260 ~~~---------G-~~nag~~~-vl~-------~---G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~ 318 (441) T protein:vir:94 260 SFS---------G-TKQAGKVV-VLD-------E---SMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA 318 (441) T ss_pred Hhc---------C-ccccCcce-ecC-------C---CceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCC Confidence 22 2 22222232 222 1 45566554222111 2344466778899999999998864322 Q ss_pred cccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHH Q lcl|NC_018087. 385 QNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERR 464 (520) Q Consensus 385 ~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R 464 (520) ++.+.-..+-|..-+.-+-.++..-+.. .| .+ ++. ...+.|.-+ ++...+ ...| T Consensus 319 ------~~s~~q~~~~~~~tl~P~~~~ie~eln~----kl-----~~--~~~----~~~~~fd~~----~llr~D-~~~~ 372 (441) T protein:vir:94 319 ------NMSITDANLDYLSTLKPYITCVCAELNF----KF-----ND--EYV----NREFKFDTT----EIRVVD-EKTQ 372 (441) T ss_pred ------CccHHHHHHHHHHHHHHHHHHHHHHHhh----hc-----cc--ccc----CceEEeech----hhhccC-HHHH Confidence 1112222333443344333333333332 22 22 111 233444322 222222 3456 Q ss_pred HHHHHHhhcccchhhhHHHHHHHHhCCCHHH-----H----------HHHH-HHHHH-hhhcCCccCCcccc Q lcl|NC_018087. 465 VNVLSLMEPYIGKYISNHTAMKDFLQMSDED-----I----------AAER-KLIDE-ELSDKIFNPPEPEE 519 (520) Q Consensus 465 ~~~~~~~~p~vgky~S~~~i~k~IL~~tDee-----I----------~~~~-kqi~~-E~~~~~~~~p~~e~ 519 (520) .+.++.+-.- -+++.+-++. .+++.+-+ + +... .|..+ ...+.--+-+|+.| T Consensus 373 ~~~~~~~i~~--G~~T~NE~R~-~~gl~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:94 373 AEIDKINIDS--GKMNIDEIRQ-RDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred HHHHHHHHhC--CCcCHHHHHH-HhCCCCCCCCCcceEeecccccccccccccccccccccccccCCCCCCC Confidence 6666555322 3678888874 46775421 1 0000 00000 00011122333333 No 125 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=95.98 E-value=0.0012 Score=36.65 Aligned_cols=400 Identities=17% Similarity=0.153 Sum_probs=179.8 Q ss_pred hhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchhHHH---- Q lcl|NC_018087. 16 HKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNAV---- 91 (520) Q Consensus 16 ~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai---- 91 (520) .|.+ +-.-+.=|++..=... ....+..... .-+.+|+.+..+++=+..| T Consensus 1 ~~~~----------~~~~~~~p~d~~~~~~----------~l~~~i~~~~-------~~~~r~~~~~~yy~g~~~i~~~~ 53 (453) T protein:vir:39 1 MKYK----------PPKLMTFPKDEPITNE----------VVTKFMEKHR-------LEVARYEYLKNMYRGIMAIDAEP 53 (453) T ss_pred Ceec----------CCcceEcCCCCCCCHH----------HHHHHHHHHH-------HHHHHHHHHHHHhhccCchhcCC Confidence 1111 1113333333321111 0111111111 1122333333333322111 Q ss_pred ----------------HhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccce Q lcl|NC_018087. 92 ----------------QEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRV 155 (520) Q Consensus 92 ----------------~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri 155 (520) ..||+-.+-+= -..||++..++. ...+.+..+.+--+|+....+..+.+++-|+- T Consensus 54 ~~~~~~~~~ki~~n~~~~ivd~~~~~l-~g~~~~~~~~d~--------~~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~ 124 (453) T protein:vir:39 54 TKDLWKPDNRLTVNFTKYIVDTFTGYF-NGIPVKKSHSDK--------ETLSKLQEFDNLNDMEDEESELAKMACIYGRA 124 (453) T ss_pred CccccCccceeecchHHHHHHHHhhhh-cccCceeccCCh--------HHHHHHHHHHHhcChhHHHHHHHHHHhhcCeE Confidence 11111110000 124445544432 23445677777788999999999999999998 Q ss_pred eEEEeeecCCCCCCeeeeEecCccceeeeeeccCCC--Cccccccc---ceecceeecCcccc-cccccceecC----Cc Q lcl|NC_018087. 156 FFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKM--ENGVKVVK---GYREYFLYDTELES-YQCGHQHFAA----GT 225 (520) Q Consensus 156 ~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~--~~~~~~~~---~~~ey~~y~~~~~~-~~~~~~~~~~----~~ 225 (520) |.+.-.|. +|-..++.+||+.+..+.+-.... .-.++... ...-.-+|.+..-. +..++..+.. .. T Consensus 125 ~~~v~~d~----~g~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~~yt~~~i~~~~~~~~~~~~~~~~~~ 200 (453) T protein:vir:39 125 FELLYQNE----ETQTNVIYNTPENMFMVYDDTIKQEPLFAVRYGYDDDYKLYGEVYTKETTYALNGTMGFYNMTEQAPN 200 (453) T ss_pred EEEEEecC----CCceEEEEEcccceEEEecCCCCCeEEEEEEEEEeCCeEEEEEEEeCCeEEEEEecCCceeeeccccc Confidence 88876652 467889999999999986532211 11122111 11112334443211 1111111100 00 Q ss_pred ce-ecCcccEEEeecccccCCCCcchhhhHHHHHHHHHH-HHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHH Q lcl|NC_018087. 226 KI-KIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQL-KLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHI 303 (520) Q Consensus 226 ~~-~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL-~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~i 303 (520) .+ .|| |+++. ++....|=++..+....-+ +++-+....-+..+.|-+-+.-. +++... T Consensus 201 ~~g~vP---vv~~~------n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~---~~~~~~-------- 260 (453) T protein:vir:39 201 PFDDLP---VVEFY------FNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGA---AVEEED-------- 260 (453) T ss_pred CCCcee---EEEec------CCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecC---CCCchh-------- Confidence 11 222 22221 2223345454444433333 34555666667777786655432 223211 Q ss_pred HHhhc-ceeEeecCCCccccccccchhhhhhccccc-CCCCCcceeecCCCCCcChHH-HHHHHHHHHHHhcCCChhhcc Q lcl|NC_018087. 304 MNSHR-NRISYDARTGKVKNQANMMALTEDYWLQRR-DGKAVTEVETLPGMTGMNEMD-DILYFRKALYMALRVPLSRIP 380 (520) Q Consensus 304 m~~~k-nklvYd~~TGev~d~~~~msmlEDywLpRR-eGgrgTEIsTLpGg~nLgei~-DV~YF~kkLy~aL~VP~SRl~ 380 (520) +...+ +++. . ++-. +.+.|.++.+|....+.+.+. -++-+.+.+|...++|- +. T Consensus 261 ~~~~~~~~~~-~--------------------~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~ 317 (453) T protein:vir:39 261 LKNIRSNRVI-N--------------------YYGESSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVAN--IS 317 (453) T ss_pred hhhhhhccee-e--------------------ecCCCCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcc--cc Confidence 11111 1111 0 1111 112345677776555655554 35667777888888984 32 Q ss_pred CCCccccccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-ChhhHHhhhhceEEEeeccchHHHHHH Q lcl|NC_018087. 381 DEQTQNVFDMSTAIS--RDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVI-TEDEWEAELNNIKIVFHKNSYFSEMKT 457 (520) Q Consensus 381 ~~~~~~~~G~~~eIt--RDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~-t~eew~~~~~~I~~~f~~Dn~f~ElKe 457 (520) .++ ||.++..+ --+.....-+.+.|+.|..-+...++.=+-+-+.. ...+| ..|.+.|...-.=.+ T Consensus 318 ~~~----~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~----~~i~v~f~~~~p~~~--- 386 (453) T protein:vir:39 318 DES----FGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVSNKEAW----KDIEYTFTRNEPKDI--- 386 (453) T ss_pred ccc----ccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccc----ccceEEeCCCCCcCH--- Confidence 222 24333322 22223345567777777777777766533332322 22233 356788864433333 Q ss_pred HHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCC---------------ccCCcccc Q lcl|NC_018087. 458 IEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKI---------------FNPPEPEE 519 (520) Q Consensus 458 ~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~---------------~~~p~~e~ 519 (520) .+.+++++.+. | .+|.+++++. |...++ .+++-++|++|..+.. -.++++|| T Consensus 387 ----~~~a~~~~kl~---g-~is~et~l~~-l~~v~D-~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~e 453 (453) T protein:vir:39 387 ----KEQAETANILM---G-ITSQETALSV-ISVIPD-VQAEMEKIKKEEASTAIFDKDKQPSEKGTDTVVPETNEE 453 (453) T ss_pred ----HHHHHHHHHHh---c-cCChHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCCCcCCC Confidence 33345555553 3 3799999976 565542 2233334444433221 01122222 No 126 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=95.94 E-value=0.0012 Score=36.53 Aligned_cols=429 Identities=12% Similarity=0.056 Sum_probs=188.2 Q ss_pred cCCCcccCCCCCCCceeecccccccc------cccccccccccccchh-HHHHHHHHHHH-hhccchhHHHHhhhceeeE Q lcl|NC_018087. 29 DKAESITAPKFDDGATEVDSQDIAYN------GVFQKLYGSQDPTATS-TRELINTYRSL-LNNYEVDNAVQEIVSDAIV 100 (520) Q Consensus 29 ~~~~s~~~p~~~dg~~~i~~~~~a~~------g~~~~~~~~~~~~~~~-~~~LI~~YR~m-a~~pEvd~Ai~eIvneaiv 100 (520) -...-+.+...++....+..=-..+. --+..+|-|- ..+.. ....=..+|.+ +.+.=+..+|+..++-.. T Consensus 1 ~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~r~~~~~~Yy~G~-~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~- 78 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIARDEMVSAFEDSTQNLKTNTSYYEAE-RRPEAIGVTVPIQMQSLLAHVGYPRLYVDSIAERQA- 78 (485) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CcchhcCCCCChhhhhhhhhcCcHHHHHHHHHhhhc- Confidence 01111111112222211110000000 0000111110 00000 00000011111 111222334443333221 Q ss_pred ecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecC----CCCCCeeeeEec Q lcl|NC_018087. 101 YEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPN----RPKDGIIELRRL 176 (520) Q Consensus 101 ~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~----~~k~GI~elr~l 176 (520) . +.+++. ++.+.+ +.+..|..--+|+....++++.-.+-|+-|...-.+.. .+.+|-..++.+ T Consensus 79 ~----~g~~~~-~~~~~~--------~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~ 145 (485) T protein:vir:10 79 V----EGFRFG-DADEAD--------EELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRVE 145 (485) T ss_pred c----cceecC-CCchhH--------HHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEEE Confidence 1 011111 111222 33445555667888889999999999999887655532 234566678899 Q ss_pred CccceeeeeeccCC-CCcccccc----cc-eecceeecCccccccc--cccee-c--CCcce-ecCcccEEEeecccccC Q lcl|NC_018087. 177 DPRNVQFVRELDTK-MENGVKVV----KG-YREYFLYDTELESYQC--GHQHF-A--AGTKI-KIPYSAMVYAHSGLVDC 244 (520) Q Consensus 177 DPr~i~~vr~i~~~-~~~~~~~~----~~-~~ey~~y~~~~~~~~~--~~~~~-~--~~~~~-~I~~~aI~y~hSGL~d~ 244 (520) +|+.+..+.+-... ..-+.... .+ ..-..+|.+....+.. ++... . ....+ +|| .|.|++-. ++ T Consensus 146 ~p~~~~~~~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vP--vv~~~n~~--~~ 221 (485) T protein:vir:10 146 PPTRMYAEIDPRIGRVSKAIRVAYDAEGNEIQAATLYTPNDIFGWYRVENEWQEWFNNPHGLGVVP--VVPIPNRT--RL 221 (485) T ss_pred ccceeEEEEcCCCCceeEEEEEEEeeCCCeEEEEEEEeCCeEEEEEEcCCceEEeccccCCCCccc--EEEecccc--cc Confidence 99998877652211 11111111 11 1112344443211111 00000 0 00011 222 25555532 33 Q ss_pred CCCcchhhhHHH----HHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCcc Q lcl|NC_018087. 245 CGKNIIGYLHRA----VKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKV 320 (520) Q Consensus 245 ~~~~~~syL~~a----ik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev 320 (520) .+....|-+.+. +..+| +++-+..++-..+-.|.|-+.=.+..+.+...- + .....++..| T Consensus 222 ~~~~G~s~i~~~v~~liDa~~--~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~---------~--~~~~~~~~~~-- 286 (485) T protein:vir:10 222 SDLYGTSEITPELRSMTDAAA--RILMLMQATAELMGVPQRLIFGIKPEEIGVDPE---------T--GQTLFDAYLA-- 286 (485) T ss_pred CCCCCccchhHHHHHHHHHHH--HHHHHHHHHHHhhcchHHHHhcCCccccccccc---------c--cchhhhhccc-- Confidence 232334444433 23333 356677777777777777655333222221100 0 0000011111 Q ss_pred ccccccchhhhhhcccccCCCCCcceeecCCCCCcC-hHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHH Q lcl|NC_018087. 321 KNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMN-EMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDEL 399 (520) Q Consensus 321 ~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLg-ei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDEl 399 (520) ..|+.- +-+.++-.+++.+ ++ -++-++=.-..++..-++|.+-|...+. +. ..+..|..-+. T Consensus 287 -----------~i~~~~---~~d~k~~q~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~-n~-~Sg~Al~~~~~ 349 (485) T protein:vir:10 287 -----------RILAFE---DAEGKIQQFSAAE-LANFTNALDQIAKQVAAYTGLPPQYLSTAAD-NP-ASAEAIRAAES 349 (485) T ss_pred -----------ceeccC---CCCceEEeecccc-hHHHHHHHHHHHHHHhcccCCCHHHhccccC-ch-hHHHHHHHHHH Confidence 223321 1123455565532 22 1222333334455557778776643322 11 12345666677 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC-hhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchh Q lcl|NC_018087. 400 SFDKFISELQHKFEEIFLSPLKSNLLLKRVIT-EDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKY 478 (520) Q Consensus 400 kF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t-~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky 478 (520) .+..-+.+.|..|..-+...++.-+.+.+... ..+| ..|.+.|.....-+. .+..+++..+..-+-.. T Consensus 350 ~l~~k~~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~----~~i~v~w~~~~~~~~-------~~~ada~~kl~~ag~~~ 418 (485) T protein:vir:10 350 RLIKKVERKNSIFGGAWEEAMRLAYRMMKGGDVPPDM----LRMETVWRDPSTPTY-------AAKADAASKLYNGGTGV 418 (485) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCcccc----eeeeEEecCCCCCCH-------HHHHHHHHHHHhccccC Confidence 78888889999999888888887665555321 1122 358888875544444 33445555554332246 Q ss_pred hhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCC-----ccCC-----ccccC Q lcl|NC_018087. 479 ISNHTAMKDFLQMSDEDIAAERKLIDEELSDKI-----FNPP-----EPEEI 520 (520) Q Consensus 479 ~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~-----~~~p-----~~e~~ 520 (520) +|.+++++ .|.+++++++++++..+++..++. ...| ++.+- T Consensus 419 ~s~et~~~-~lg~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (485) T protein:vir:10 419 IPRERARK-DMGYSIAEREEMRRWDEEEAAMGLGLIGTMVDPNPTVPGSPSP 469 (485) T ss_pred CCHHHHHH-hCCCCHhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCCc Confidence 79999995 599999999988776666544321 1111 11111 No 127 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=95.88 E-value=0.0013 Score=36.38 Aligned_cols=399 Identities=12% Similarity=0.126 Sum_probs=184.8 Q ss_pred Cccccc-cchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHH Q lcl|NC_018087. 1 MSMLAD-SDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYR 79 (520) Q Consensus 1 ~~~~~~-~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR 79 (520) |-|+.+ ..++-|+-|.... +...|. +|+...+.+ ..|-..+.. + |. + T Consensus 8 ~~~~~~~g~~~~~~~~f~~~-------------~~~~~~--~~~~~~~~~---~~~~~~~~~------v-~~-------~ 55 (424) T protein:vir:18 8 IDLRTNNGWWARLKSWFVGG-------------RLVTPN--QGSQTGPVS---AHGYLGDSS------I-ND-------E 55 (424) T ss_pred cccCCCCchHHHHHhhcccc-------------cccccc--chhhccccc---ccccccccc------c-cH-------H Confidence 777766 3333333332211 112221 121111111 011111111 1 11 3 Q ss_pred HHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHh----cchhhhHHHHH----hhcc Q lcl|NC_018087. 80 SLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNML----NFQRKGSDHFK----RWYV 151 (520) Q Consensus 80 ~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll----~f~k~g~~~fR----rWYv 151 (520) ..+++|-|..||+-|.+.+...+ +.|-=... +..+.++. .-..+..+| |-..++.++.+ .+.+ T Consensus 56 ~al~~~~v~~cv~~Ia~~iA~lp-----~~vy~~~~---~~~~~~~~-~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll 126 (424) T protein:vir:18 56 RILQISTVWRCVSLISTLTACLP-----LDVFETDQ---NDNRKKVD-LSNPLARLLRYSPNQYMTAQEFREAMTMQLCF 126 (424) T ss_pred HhhccHHHHHHHHHHHHhhccCc-----eEEEEecc---CCceeeec-cccHHHHHHhhccCCCCCHHHHHHHHHHHHhh Confidence 45778889999999999886432 22110000 00000100 001122333 22345555443 4566 Q ss_pred ccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCc Q lcl|NC_018087. 152 DSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPY 231 (520) Q Consensus 152 Dgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~ 231 (520) .|.-|.-++-+ .+ .-+++|.+|+|..+...+. +... +|.|.. .+..+.+++ T Consensus 127 ~Gnay~~i~r~-~~--G~~~~L~~l~~~~v~v~~~-------~~~~------~y~~~~-------------~g~~~~~~~ 177 (424) T protein:vir:18 127 YGNAYALVDRN-SA--GDVISLLPLQSANMDVKLV-------GKKV------VYRYQR-------------DSEYADFSQ 177 (424) T ss_pred cCCeEEEEEEC-CC--CcEEEEEEecCcceEEEEc-------CCeE------EEEEEe-------------CCeEEEecc Confidence 78888876643 22 2389999999999876321 1111 122211 123468899 Q ss_pred ccEEEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhccee Q lcl|NC_018087. 232 SAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRI 311 (520) Q Consensus 232 ~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knkl 311 (520) +.|+|.. + ...++...+|-+..|+.++.....+++...=+----+--+-+...+-+.+.+..++ -+++.+.++.. T Consensus 178 ~eVihir-~-~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~-~~~~~~~~~~~-- 252 (424) T protein:vir:18 178 KEIFHLK-G-FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRS-QVEENFKEIAG-- 252 (424) T ss_pred ccEEEec-C-cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCcCCCHHHHH-HHHHHHHHHhC-- Confidence 9998884 2 35567677899999999998887777765433222232345666666656655443 34444444322 Q ss_pred EeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccc Q lcl|NC_018087. 312 SYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDM 390 (520) Q Consensus 312 vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~ 390 (520) |. +..+.+ .++ + |.+++.|.=...-.| ++--++....+.++++||.+-|....+.+..| T Consensus 253 ------~~--nag~~~-vl~--------~--g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~- 312 (424) T protein:vir:18 253 ------GP--VKKRLW-ILE--------A--GFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWG- 312 (424) T ss_pred ------Cc--ccCCce-ecc--------C--CceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCccccc- Confidence 21 111122 221 1 456666632211222 33345777889999999999986433222112 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 391 STAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSL 470 (520) Q Consensus 391 ~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~ 470 (520) +.+.-..+.|.+++ |+--... +.+.|- +.++++.++. ...+.|.-+. +...+ ...|.+.+.. T Consensus 313 -sn~eq~~~~f~~~t--l~P~~~~-ie~~ln-----~~L~~~~~~~----~~~~~fd~~~----llr~d-~~~r~~~~~~ 374 (424) T protein:vir:18 313 -SGIEQQNLGFLQYT--LQPYISR-WENSIQ-----RWLIPSKDVG----RLHAEHNLDG----LLRGD-SASRAAFMKA 374 (424) T ss_pred -ccHHHHHHHHHHHH--HHHHHHH-HHHHHH-----hhcCCccccC----CeEEEEechh----hhccC-HHHHHHHHHH Confidence 22333344455442 3222222 233222 3345566553 2344454332 22221 2556666666 Q ss_pred hhcccchhhhHHHHHHHHhCCCHHHHHHHHH--------HHHHhhhcCCccCCccc Q lcl|NC_018087. 471 MEPYIGKYISNHTAMKDFLQMSDEDIAAERK--------LIDEELSDKIFNPPEPE 518 (520) Q Consensus 471 ~~p~vgky~S~~~i~k~IL~~tDeeI~~~~k--------qi~~E~~~~~~~~p~~e 518 (520) +-. .-+++.+-+++ .++|.+-+ .-++ -+..-.+++ -|..+.. T Consensus 375 ~~~--~G~~T~NE~R~-~~gl~pi~--ggD~~~~~~n~~~l~~~~~~~-~~~~n~a 424 (424) T protein:vir:18 375 MGE--SGLRTINEMRR-TDNMPPLP--GGDVAMRQAQYVPITDLGTNK-EPRNNGA 424 (424) T ss_pred HHh--CCCcCHHHHHH-HhCCCCCC--CcCeeeeccCccchhhhhccC-CccccCC Confidence 522 23667777774 36765421 1000 000000000 0011111 No 128 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=95.88 E-value=0.0013 Score=36.38 Aligned_cols=426 Identities=13% Similarity=0.131 Sum_probs=190.3 Q ss_pred CccccccchhhhcchhhhhhhHHH-hhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYD-KIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYR 79 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~-~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR 79 (520) |+|+ +.+-.+| ++--.... +.+++-...+-++-++ ....-|+..+ T Consensus 1 m~~~-~~~k~~~----~k~~~~~~~~~~~~i~~~~~i~~~~-----------------------------~~~~~i~~~~ 46 (522) T protein:vir:47 1 MSLF-QKVKDFF----SRGRYYMQTSNLNSILEHPKIAVTQ-----------------------------EEYDRIKRNL 46 (522) T ss_pred CchH-HHHHHHH----HHHHHHhhcccchhccccCCCCCCH-----------------------------HHHHHHHHHH Confidence 5443 2222222 11111100 0011000011111111 1222233333 Q ss_pred HHhh--ccch--------------------hHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhc Q lcl|NC_018087. 80 SLLN--NYEV--------------------DNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLN 137 (520) Q Consensus 80 ~ma~--~pEv--------------------d~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~ 137 (520) .|.. +|++ ..++.+.++=+ -+ +++++.+++. ...+..+.++.--+ T Consensus 47 ~~y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~~~~A~lv----~~-e~~~i~v~d~--------~~~~~l~~~l~~n~ 113 (522) T protein:vir:47 47 VYYQSKWDDVQYKNTDGDIKSRPMNHLPIARTASKKIASLV----YN-EQATITTKNE--------ILQKFLDDMLTNDR 113 (522) T ss_pred HHhcCCcccccccccCcchhcccceecchHHHHHHHHhhhh----cC-CcceeecCCh--------HHHHHHHHHHhhcc Confidence 3321 1211 11122211111 11 4556666543 44556677777788 Q ss_pred chhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCccccccc------------ceecce Q lcl|NC_018087. 138 FQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVK------------GYREYF 205 (520) Q Consensus 138 f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~------------~~~ey~ 205 (520) |.....+.+-.+..-|-.+|...+|. |-..+.+++|-++.+++--......++-++. -..||- T Consensus 114 f~~~~~~~~e~a~a~G~~a~k~~~d~-----~~~~i~~v~ad~~~P~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~h 188 (522) T protein:vir:47 114 FNKNFERYLESCLALGGLAMRPYIDG-----DKVRVAFIQAPVFFPLESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFH 188 (522) T ss_pred hHHHHHHHHHHhhccCCEEEEEEEcC-----CceEEEEEcCCceEEEEEcCCceEEEEEEEEEEeecccceeEEEEEEEe Confidence 99999999999999999999999983 3356889999988887321111111111000 001221 Q ss_pred eecCc----------cccccccccee------cCCcce---ecC-----cccEEEeec-----ccc--------cCCCCc Q lcl|NC_018087. 206 LYDTE----------LESYQCGHQHF------AAGTKI---KIP-----YSAMVYAHS-----GLV--------DCCGKN 248 (520) Q Consensus 206 ~y~~~----------~~~~~~~~~~~------~~~~~~---~I~-----~~aI~y~hS-----GL~--------d~~~~~ 248 (520) -+... -..|.+....| .-|.+| .+| ++.+++.+- +.+ ++.-+. T Consensus 189 e~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~spl 268 (522) T protein:vir:47 189 EWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLSELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPL 268 (522) T ss_pred eecccccccccccccCCceEEEEEEeecCCCcccCccccccccccccCCCCceEeCCCCcceEEEecCCcccccccCCCc Confidence 11000 00111100000 001111 000 111222110 001 111223 Q ss_pred chhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccch Q lcl|NC_018087. 249 IIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMA 328 (520) Q Consensus 249 ~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~ms 328 (520) .+|-++.|.-.+--|-..=+ .+.|.+|.=++|||. |-.=++ ..-+..+|+...-..+- T Consensus 269 G~S~~~~~~~~id~lD~~~s--~~~~e~~~g~~~i~v-~~~~l~------------------~~~~~~~g~~~~~~~fd- 326 (522) T protein:vir:47 269 GLSIFDNAKTTIDFINRSYD--EFMWEVRMGQRRVIV-PEHLTQ------------------RQYQRPDGTIDFRPRFD- 326 (522) T ss_pred CCchhhhhHHHHHHHHHHHH--HHHHHHHhccceeec-chHHhc------------------cCCCCCCcccccccccC- Confidence 45777777776666653333 456778887888765 111100 01122333211000000 Q ss_pred hhhhhcccccCC-CCCcceeecCCCCCcChHH-HHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHH Q lcl|NC_018087. 329 LTEDYWLQRRDG-KAVTEVETLPGMTGMNEMD-DILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFIS 406 (520) Q Consensus 329 mlEDywLpRReG-grgTEIsTLpGg~nLgei~-DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~ 406 (520) --+..|.+-..+ +-|--|+++...--.++.. =+..+.+.+=...+++-+.+..++++. --++||...+-.-..-+. T Consensus 327 ~~~~~f~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~~--kTAtEi~s~~~~~~~t~~ 404 (522) T protein:vir:47 327 VEQNVYMQIGGSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQGM--KTATEIVSENSDTYQMRS 404 (522) T ss_pred cccceEeecCCCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCcccccc--ccHHHHHHHHHHHHHHHH Confidence 001112211110 0111255555433333222 234555555566777776666554421 235666655555555677 Q ss_pred HHHHHHHHHHHHHHHHHHHh-------cCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhh Q lcl|NC_018087. 407 ELQHKFEEIFLSPLKSNLLL-------KRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYI 479 (520) Q Consensus 407 rLr~rFs~if~d~Lk~QLiL-------kgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~ 479 (520) +.|+.+...+.++++.=|.| .+... .+ ..|.++|. |+.+.. +++++-..+ .+++ . | .+ T Consensus 405 ~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~-~~-----~~i~v~f~-D~i~~D-~~~~~~~~~-~~v~--a---G-~~ 469 (522) T protein:vir:47 405 SIVALVEQSIKELCVSMCELGKAVGVYSGEIP-EL-----DDISVNLD-DGVFTD-RHAELDYWA-KMVA--A---G-FS 469 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhccCCCC-Cc-----ceeEEEcC-CCCCCC-HHHHHHHHH-HHHh--c---C-CC Confidence 77777777777666665433 33222 22 33778887 555544 222222211 1111 1 2 57 Q ss_pred hHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 480 SNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 480 S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) |.++.+++....||+|.+++-++|++|.... .|.+..+ T Consensus 470 s~e~~i~~~~g~~eeea~~el~ri~~E~~~~---~~~~~~~ 507 (522) T protein:vir:47 470 TKKRAIGKTLNISGVEAEKELNAINSELLPM---NDAELAI 507 (522) T ss_pred CHHHHHHhcCCCChHHHHHHHHHHHHhhccC---CCCCCCC Confidence 8888766778999999999999999997653 2222233 No 129 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=95.79 E-value=0.0014 Score=36.15 Aligned_cols=411 Identities=11% Similarity=0.066 Sum_probs=192.4 Q ss_pred Cccccccch----hhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHH Q lcl|NC_018087. 1 MSMLADSDL----KMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELIN 76 (520) Q Consensus 1 ~~~~~~~~l----~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~ 76 (520) -.|-.+.++ +|+..|..+.. .++... . .|.|-....+.+ .. +-+ T Consensus 21 ~~~~~~~~~~l~~~l~~~~~~~~~-rl~~l~-~----------------------YY~G~~~~~~~~--~~------~~~ 68 (501) T protein:vir:25 21 DSMSREQLGALVADMWRLHISERQ-WLDRIY-E----------------------YTKGLRGRPEVP--EG------ASD 68 (501) T ss_pred ccCChHHHHHHHHHHHHHHHHHHH-HHHHHH-H----------------------HHhcCCCchhcc--cc------CCh Confidence 122222222 33333332221 111100 0 000100000000 00 001 Q ss_pred HHHH---HhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhcccc Q lcl|NC_018087. 77 TYRS---LLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDS 153 (520) Q Consensus 77 ~YR~---ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDg 153 (520) .|+. .+-+.=+.-+|+-.++-. +.+ ..+ +.+.+-.+ ....|...-+|+...++.++.-++-| T Consensus 69 ~~~~~~~~~v~n~~~~ivd~~a~~l-~~~----gf~--~~d~~~~~--------~l~~i~~~N~~d~~~~~~~~~a~i~G 133 (501) T protein:vir:25 69 EVKELAKLSVKNVLSLVRDSFAQNL-SVV----GYR--NALAKEND--------PAWEMWQRNRMDARQAEVHRPALTYG 133 (501) T ss_pred hhhhhHhhhhcChHHHHHHHHHhhh-ccc----cee--cCCccchH--------HHHHHHHhcChhHHHHHHHHHHhhcC Confidence 2222 121223334444433321 111 111 11221122 23345556677888899999999999 Q ss_pred ceeEEEeeecCCCCCCeeeeEecCccceeeee-eccCC--CCcccccc------cceecceeecCccccc---------- Q lcl|NC_018087. 154 RVFFHKIINPNRPKDGIIELRRLDPRNVQFVR-ELDTK--MENGVKVV------KGYREYFLYDTELESY---------- 214 (520) Q Consensus 154 ri~~hkvid~~~~k~GI~elr~lDPr~i~~vr-~i~~~--~~~~~~~~------~~~~ey~~y~~~~~~~---------- 214 (520) |-|.+.-.| +. | ..++-++|+.+-.|- +-... ..-.++.. ......-+|++..... T Consensus 134 ~ay~~v~~d-e~---~-~~i~~~sp~~~~~iy~D~~~~~~~~~ai~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~ 208 (501) T protein:vir:25 134 ASYVTVTPT-DE---G-PVFRTRSPRQILAVYADPSVDAWPQYALETWVAQKDAKPHRRGVLYDDTYMYELDLGEVVLGD 208 (501) T ss_pred ceEEEEecC-CC---C-CeEEEeccccEEEEEecCCCCcceeEEEEEEeeccccCcceeEEEecCeeEEEEecCceeeee Confidence 988766554 22 3 357778998876542 11111 11111110 0001111222221100 Q ss_pred -ccccceecCCcce-ecC----------c---ccEEEeecccccCCCCcchhhhHHHH---HHHHHHHHHHHHHHHHHHh Q lcl|NC_018087. 215 -QCGHQHFAAGTKI-KIP----------Y---SAMVYAHSGLVDCCGKNIIGYLHRAV---KPANQLKLLEDAMMIYRIT 276 (520) Q Consensus 215 -~~~~~~~~~~~~~-~I~----------~---~aI~y~hSGL~d~~~~~~~syL~~ai---k~~NqL~m~EDalVIyRi~ 276 (520) ..+.....+.+.. ..+ - .-|.|+..-...+ .+.|-++..+ ..+| +++-++++.-... T Consensus 209 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~~~~~---~g~sdie~v~~l~Da~~--~~~s~~~~~~e~~ 283 (501) T protein:vir:25 209 AGGGQATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGRDADD---MIVGEVAPLILLQQAIN--SVNFDRLIVSRFG 283 (501) T ss_pred ccccccccccccccccccccccccccCCccceeeEeccCccccCc---cccchhhhhHHHHHHHH--HHHHHHHHHHHhh Confidence 0000000000000 000 0 1122333221111 2345444333 3333 3555777777878 Q ss_pred cCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcC Q lcl|NC_018087. 277 RAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMN 356 (520) Q Consensus 277 RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLg 356 (520) -.|.|-+.=.+....+..++ +. | ..|+. +| -+++|.++|+.+--+ T Consensus 284 a~p~~~i~G~~~~~~~~~~~----------~~---------~-------------~i~~~--~~-~~~~~~q~~~~~~~~ 328 (501) T protein:vir:25 284 ANPQRVISGWTGSKAEVLKA----------SA---------L-------------RVWTF--ED-PEVKAQAFPPASVEP 328 (501) T ss_pred ccHHHHHhCCCCCccchhhh----------cc---------c-------------ceecc--CC-CCceEEEecccChHH Confidence 88887776444333321110 11 1 12332 12 235677888754323 Q ss_pred hHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHH Q lcl|NC_018087. 357 EMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWE 436 (520) Q Consensus 357 ei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~ 436 (520) -++-++-.-..+...-++|..-+...++ | -.+..|.--+....+-+.+.|+.|..-+..+++.-+.++|.....+| T Consensus 329 ~~~~l~~~i~~i~~~s~~P~~~~~~~~~-N--~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~~~~~~~~- 404 (501) T protein:vir:25 329 YNLILEEMLQHVAMVAQISPAQVTGKMI-N--VSAEALAAAEANQQRKLAAKRESFGESWEQLLRLAAEMDDDPDTAAD- 404 (501) T ss_pred HHHHHHHHHHHHHhhcCCChhhhccccC-C--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccc- Confidence 3344555555666677888765553221 1 12334444555577888999999999999999998888886543333 Q ss_pred hhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCc Q lcl|NC_018087. 437 AELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPE 516 (520) Q Consensus 437 ~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~ 516 (520) ..|.+.|..-..=+. .+..+++..+..- | +|.+++...++.+|++||+++.++.+++..+++...+. T Consensus 405 ---~~i~v~w~~~~~~s~-------~~~ada~~kl~~~-g--is~et~~~~~~g~~~~~ie~~~~~~~e~~~~~~~~~~~ 471 (501) T protein:vir:25 405 ---SGAEVLWRDTEARSF-------GAVVDGITKLASA-G--IPIEHLLSMVPGMTQQTIQAIKDSLRGGEVKSLVDKLL 471 (501) T ss_pred ---eeeeEEecCCCCCCH-------HHHHHHHHHHHhc-C--CCHHHHHHHcCCCCHHHHHHHHHHHHHHhHHHHHHHhh Confidence 247777754332222 4566666666543 3 69999999999999999998888777776654432221 Q ss_pred c-----------ccC Q lcl|NC_018087. 517 P-----------EEI 520 (520) Q Consensus 517 ~-----------e~~ 520 (520) . ++- T Consensus 472 ~~~~~~~~~~~~~~~ 486 (501) T protein:vir:25 472 SNEPAPVPPPPPQAA 486 (501) T ss_pred ccCcCCCCCCCCCCC Confidence 1 111 No 130 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=95.77 E-value=0.0015 Score=36.08 Aligned_cols=395 Identities=12% Similarity=0.108 Sum_probs=166.6 Q ss_pred eeccc---ccccccccccccccccccchhH-----------HHHHHHHHHHhhccchhHH-------------------- Q lcl|NC_018087. 45 EVDSQ---DIAYNGVFQKLYGSQDPTATST-----------RELINTYRSLLNNYEVDNA-------------------- 90 (520) Q Consensus 45 ~i~~~---~~a~~g~~~~~~~~~~~~~~~~-----------~~LI~~YR~ma~~pEvd~A-------------------- 90 (520) |++.+ +.+++.- +.-.++.+.... .+-+.+|+.+..+.+-... T Consensus 1 ~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~ 77 (478) T protein:vir:10 1 MISINWPWDKPYHEQ---VVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKPD 77 (478) T ss_pred CccccccCCchhhhH---HHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccchhhhccccccccccc Confidence 33322 0011100 011111111111 1122345555444333221 Q ss_pred -------HHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeec Q lcl|NC_018087. 91 -------VQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINP 163 (520) Q Consensus 91 -------i~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~ 163 (520) ...||+-.+-+= -..||++..++.+..+ .+.+-|+ =+|+..-.++.+.+.+-|+-|.+.-+|. T Consensus 78 ~ki~~n~~k~ivd~~~~yl-~g~p~~~~~~~~~~~~----~l~~~~~-----n~~~~~~~~~~~~~~~~G~~~~~v~~d~ 147 (478) T protein:vir:10 78 WRMYTNYHQNLVDQKVAYA-VANPVTFGVDNDKALK----QIQHTLN-----HKWDDKLVDILTAASNKGIEWVQPYVDE 147 (478) T ss_pred ceeccchHHHHHHHHhhhh-cccCceeecCChHHHH----HHHHHHh-----ccHHHHHHHHHHHHhhCCeEEEEEEecC Confidence 112222211111 1266677666654333 3333332 2677888888999999999998877763 Q ss_pred CCCCCCeeeeEecCccceeeeeecc--CCCCccccc--ccceecceeecCc-ccccccccceec---------CCcce-- Q lcl|NC_018087. 164 NRPKDGIIELRRLDPRNVQFVRELD--TKMENGVKV--VKGYREYFLYDTE-LESYQCGHQHFA---------AGTKI-- 227 (520) Q Consensus 164 ~~~k~GI~elr~lDPr~i~~vr~i~--~~~~~~~~~--~~~~~ey~~y~~~-~~~~~~~~~~~~---------~~~~~-- 227 (520) +|-..+..+||+.+.++.+-. .+..-.++. ..+...+-+|.+. ...|........ ....+ T Consensus 148 ----~~~~~~~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (478) T protein:vir:10 148 ----EGEFKTFRVPAEQAVPIWTNKERDELQAFIRVYELDGAERVEYWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQ 223 (478) T ss_pred ----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEeeeCceEEEEEeCCcEEEEEecCCeeeccccccccccccceec Confidence 356789999999998875421 111111111 1111122222221 111111000000 00000 Q ss_pred --------ecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHH Q lcl|NC_018087. 228 --------KIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLK-LLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQ 298 (520) Q Consensus 228 --------~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~-m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeq 298 (520) +|| |+++ +|+....|-|+..+....-+. ++-+....-+-++.|-+-+.-.+..+... . T Consensus 224 ~~~~~~~g~vP---vv~~------~n~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~~~~~~--~-- 290 (478) T protein:vir:10 224 GNKLMSWGRVP---FIPF------KNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKD--F-- 290 (478) T ss_pred ccccccCCcce---EEEe------ccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCcccccc--h-- Confidence 111 1111 122334566666555555444 33444444466666654443222211110 0 Q ss_pred HHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHH-HHHHHHHHHHhcCCChh Q lcl|NC_018087. 299 HMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDD-ILYFRKALYMALRVPLS 377 (520) Q Consensus 299 yl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~kkLy~aL~VP~S 377 (520) .. -|..++ =+|++.-+ |.+++.|-...+...+.. +.-+.+.+|...++|- T Consensus 291 -~~-~~~~~~-----------------------~~~~~~~~---~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~- 341 (478) T protein:vir:10 291 -MH-NLKYYK-----------------------AISVAGES---GSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVD- 341 (478) T ss_pred -hh-hhhhCc-----------------------eeEecCCC---CCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCcC- Confidence 00 011111 11233222 234666655555554433 6677888999999994 Q ss_pred hccCCCccccccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHH Q lcl|NC_018087. 378 RIPDEQTQNVFDMSTAIS--RDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEM 455 (520) Q Consensus 378 Rl~~~~~~~~~G~~~eIt--RDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~El 455 (520) +.+++ +. |..+..+ .-..-...-+.+.+..|...+..+|+.=+-+.|. ..+|. .|.+.|...---.+. T Consensus 342 -~~~~~-~~--~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~--~~d~~----~i~i~f~~~~p~~~~ 411 (478) T protein:vir:10 342 -FQQDK-FG--NSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRL--DVRVQ----DIEITFNFNVMVNEL 411 (478) T ss_pred -cCccc-cc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Ccccc----cceEEeCCCCCCCHH Confidence 22221 11 2222222 2222233335555566666666655543333343 33343 467777544433333 Q ss_pred HHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCH---HHHHHHHHHHHHhhhc-C-CccCCccccC Q lcl|NC_018087. 456 KTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSD---EDIAAERKLIDEELSD-K-IFNPPEPEEI 520 (520) Q Consensus 456 Ke~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tD---eeI~~~~kqi~~E~~~-~-~~~~p~~e~~ 520 (520) .. +++++.+.+ .+|.+++++. |...+ +|++.++++-++..++ + ..+..++++- T Consensus 412 e~-------~~~~~~~~g----~iS~et~i~~-~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~ 469 (478) T protein:vir:10 412 EN-------SQIAMNSTG----LLSKETILGN-HSWVQDPVAEMERIEQENIELNQQLPDIEEGLNDEQQ 469 (478) T ss_pred HH-------HHHHHHHhC----CCChHHHHHh-CCCCCCHHHHHHHHHHHHHHHHHhccccCCCCccccc Confidence 33 334444433 3799999976 45432 3333333332222221 1 1111111111 No 131 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=95.76 E-value=0.0015 Score=36.07 Aligned_cols=415 Identities=13% Similarity=0.089 Sum_probs=181.2 Q ss_pred ccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccc-------------------------hh Q lcl|NC_018087. 34 ITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYE-------------------------VD 88 (520) Q Consensus 34 ~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pE-------------------------vd 88 (520) .++|=. |.+.......+... +.... ..-..+|+.+..+++ +. T Consensus 1 ~~~~i~--~~~~~~~~~~~~~~--------L~~~~---~~~~~r~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~ 67 (485) T protein:vir:24 1 MTAPLP--GQEEIADPAIARDE--------MVSAF---EDQNQNLRSNTSYYEAERRPEAIGVTVPVQMQSLLAHVGYPR 67 (485) T ss_pred CCCCCC--CCCcccchHHHHHH--------HHHHH---HHHHHHHHHHHHHHhccCchhhcCcccchhhhhhhhccchHH Confidence 222211 11111100000000 00000 000112222222222 12 Q ss_pred HHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCC--- Q lcl|NC_018087. 89 NAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNR--- 165 (520) Q Consensus 89 ~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~--- 165 (520) -+|+..+.-. + .+.+++. ++.+. .+.++.|.+-=+|+....++++.-.+.||-|.+.-.|.+. T Consensus 68 ~ivd~~~~~l-~----~~g~~~~-~~~~~--------~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~ 133 (485) T protein:vir:24 68 LYVDSIAERQ-A----VEGFRLG-DADEA--------DEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDL 133 (485) T ss_pred HHHHHHhhhh-c----cCceecC-CCchh--------HHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCccccc Confidence 2222222111 0 0112211 11111 2234455555577888899999999999998876655332 Q ss_pred -CCCCeeeeEecCccceeeeeeccCC-CCcccccc-----cceecceeecCcccccc-c-ccceecCC---cce-ecCcc Q lcl|NC_018087. 166 -PKDGIIELRRLDPRNVQFVRELDTK-MENGVKVV-----KGYREYFLYDTELESYQ-C-GHQHFAAG---TKI-KIPYS 232 (520) Q Consensus 166 -~k~GI~elr~lDPr~i~~vr~i~~~-~~~~~~~~-----~~~~ey~~y~~~~~~~~-~-~~~~~~~~---~~~-~I~~~ 232 (520) ..+|-..++.++|+.+-.+.+-... ..-..... +.+..+-+|.+....+. . ++...... ..+ .+| T Consensus 134 ~~~~~~~~i~~~~p~~~~~i~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vP-- 211 (485) T protein:vir:24 134 GWDPNVPLIRVEPPTRMYAEIDPRIGRPAKAIRVAYDAEGNEIQAATLYTPNETFGWFRAEGEWVEWFSDPHGLGAVP-- 211 (485) T ss_pred ccCCCcceEEEeccceeEEEeeCCcCceeEEEEEEEeecCCeEEEEEEEcCCcEEEEEecCCceEeecccccCCCccc-- Confidence 2345567889999988777552211 11111111 11122233443321111 0 11000000 000 111 Q ss_pred cEEEeecccccCCCCcchhhhHHHHHHH-HH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcce Q lcl|NC_018087. 233 AMVYAHSGLVDCCGKNIIGYLHRAVKPA-NQ-LKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNR 310 (520) Q Consensus 233 aI~y~hSGL~d~~~~~~~syL~~aik~~-Nq-L~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knk 310 (520) .|.|.+.. ++.+..+.|-|.+.++++ .. -+++-+..++-...-.|.|-+.=.+....+...- +.. T Consensus 212 vv~f~n~~--~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~-----------~~~ 278 (485) T protein:vir:24 212 VVPLPNRT--RLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPE-----------TGQ 278 (485) T ss_pred EEEeccCc--ccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCccccccccc-----------ccc Confidence 13344322 222223345455444433 22 2556677777777777777554222111110000 000 Q ss_pred eEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccc Q lcl|NC_018087. 311 ISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDM 390 (520) Q Consensus 311 lvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~ 390 (520) .+.++..| ..|+.--+ +.++..+++.+-=.-++-++=.-..+...-++|..-|...+..+ .. T Consensus 279 ~~~~~~~~-------------~i~~~~~~---~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~--~S 340 (485) T protein:vir:24 279 TLFDAYLA-------------RILAFEDA---EGKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNP--AS 340 (485) T ss_pred chhhhccc-------------ceeccCCC---CceEEeecccchHHHHHHHHHHHHHHhcccCCCHHHhccccCcc--hH Confidence 11112222 23443222 23455666532111222233333334444577776554332111 12 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-ChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 391 STAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVI-TEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLS 469 (520) Q Consensus 391 ~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~-t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~ 469 (520) +..|.--+...-.-+.+.|..|..-+...++.=+.+.+-. ...+ ...|.+.|.....=+. .+.++.+. T Consensus 341 g~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~d----~~~i~v~f~~~~~~s~-------~~~ad~~~ 409 (485) T protein:vir:24 341 AEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLMKGGDVPPD----MLRMETVWRDPSTPTY-------AAKADAAT 409 (485) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCccc----cceeeEEecCCCCCCH-------HHHHHHHH Confidence 2344445666777888999999998888888755554422 2222 2467888865443333 34455555 Q ss_pred HhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCC---------cc-CCccc---cC Q lcl|NC_018087. 470 LMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKI---------FN-PPEPE---EI 520 (520) Q Consensus 470 ~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~---------~~-~p~~e---~~ 520 (520) .+..-.-..+|.+++++. |.++++++++++++.++|..++. -+ +|..+ |- T Consensus 410 kl~~~g~~~~s~et~~~~-l~~~~d~~~e~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 472 (485) T protein:vir:24 410 KLYGNGQGVIPRERARKD-MGYSIAEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPNPTPA 472 (485) T ss_pred HHHhcccccCCHHHHHhh-CCCCHhHHHHHHHHHHHHhhhhhhHHHhhcccCCCCCCCCCCCCC Confidence 554332246799999954 99999999987776666543321 01 11111 11 No 132 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=95.74 E-value=0.0015 Score=36.02 Aligned_cols=416 Identities=13% Similarity=0.052 Sum_probs=182.0 Q ss_pred HhhhccCCCcccCCCC--CCCceeecccccccccccccccccccccchhHH--HHHHHHHHHhhccchhHHH-------- Q lcl|NC_018087. 24 DKIINDKAESITAPKF--DDGATEVDSQDIAYNGVFQKLYGSQDPTATSTR--ELINTYRSLLNNYEVDNAV-------- 91 (520) Q Consensus 24 ~~~~~~~~~s~~~p~~--~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~--~LI~~YR~ma~~pEvd~Ai-------- 91 (520) -+.|+.+..+..+++- -+...++. ...-.++-+.. ....+|+.+..+++-+..| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~ 66 (470) T protein:vir:99 1 MKDINYGRDKVTGNSSFIFPKGEKLT--------------SNELLGFIAYNETVLKPRYRENMKLYLGKHKILTAPEKET 66 (470) T ss_pred CccccCCcccccCCceEEeCCCCCcC--------------HHHHHHHHHHHHHhhHHHHHHHHHHhccccccccCccccc Confidence 1122222211111110 00000000 00000010110 1122355554444433221 Q ss_pred -----------HhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEe Q lcl|NC_018087. 92 -----------QEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKI 160 (520) Q Consensus 92 -----------~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkv 160 (520) ..||+..+-+= -..||++...+. .+-.+.+..+..--+|+....++++...+-|+.|.+.- T Consensus 67 ~~~~ki~~n~~~~Ivd~~~~~l-~g~p~~~~~~~d-------~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~ 138 (470) T protein:vir:99 67 GADNRIVVNSAKYVVDVYNGYF-CGIEPKLALLND-------SSKIDEIARWNRQENFFDTINEISKQCDIFGRSIASIY 138 (470) T ss_pred CCcceeecchHHHHHHHHhhhh-ccCCeeEeeCCc-------hhHHHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEE Confidence 12222211111 124555544332 11223445556666889999999999999999888776 Q ss_pred eecCCCCCCeeeeEecCccceeeeeeccCCC--Ccccccc----cc-ee-cceeecCcccccccccce--------ecCC Q lcl|NC_018087. 161 INPNRPKDGIIELRRLDPRNVQFVRELDTKM--ENGVKVV----KG-YR-EYFLYDTELESYQCGHQH--------FAAG 224 (520) Q Consensus 161 id~~~~k~GI~elr~lDPr~i~~vr~i~~~~--~~~~~~~----~~-~~-ey~~y~~~~~~~~~~~~~--------~~~~ 224 (520) +|. +|-..+..+||+.+.++.+-.... .-.++.+ .+ .. -+.+|.+.......+... ..++ T Consensus 139 ~d~----dg~~~i~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (470) T protein:vir:99 139 QGE----DARPHLMYSSPNHAFIIYDDTVQRQPLAFVHYQIDNSNNWTDAYGVIQYADKFYKFKGYDIEEDTNAAGYAIN 214 (470) T ss_pred eCC----CCeEEEEEEccceeEEEEcCCCCcceEEEEEEEEEecCCeeEEEEEEEecCeEEEEEeccccccccccccccc Confidence 652 467789999999998875432211 1111111 00 01 122333321111000000 0000 Q ss_pred cce-ecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHH Q lcl|NC_018087. 225 TKI-KIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLK-LLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQH 302 (520) Q Consensus 225 ~~~-~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~-m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~ 302 (520) ++ +|| |+++ .|+....|=++..+....-+. ++=+....-+..+.|.+-+.- ..++..+.-+-+.. T Consensus 215 -~~g~vP---vv~~------~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g---~~~~~~~~g~~~~~ 281 (470) T protein:vir:99 215 -PYGLVP---AVEF------FENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIG---FKLPEDDEGNPKFD 281 (470) T ss_pred -CCCccc---eEee------cCCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeec---CCcccccccchhhh Confidence 11 222 2221 122223454454444433333 455555566677777665532 22221111011111 Q ss_pred HHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHH-HHHHHHHHHHhcCCChhhccC Q lcl|NC_018087. 303 IMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDD-ILYFRKALYMALRVPLSRIPD 381 (520) Q Consensus 303 im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~kkLy~aL~VP~SRl~~ 381 (520) + -.+++. -+|=.+++.|..+.+|....+...... +.-+.+.+|...++|- +.. T Consensus 282 ~---~~~~~~---------------------~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~--~~~ 335 (470) T protein:vir:99 282 F---KNNRVL---------------------YVSQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPN--IQD 335 (470) T ss_pred h---hhccee---------------------eecCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCcc--ccc Confidence 0 011111 122233344556888877667666654 7888899999999994 222 Q ss_pred CC-ccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHH Q lcl|NC_018087. 382 EQ-TQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEI 460 (520) Q Consensus 382 ~~-~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei 460 (520) ++ +++. .+..|...+.....-+.+.+..|...+...++.=+.+-+.....+++ ...|.+.|...-.-.+.. T Consensus 336 ~~~~~n~--Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~--~~~i~v~f~~~~p~~~~e---- 407 (470) T protein:vir:99 336 KNFAGNS--SGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNNKQDQEL--WSELDFKFTRNLPEDMAS---- 407 (470) T ss_pred cccccCc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccc--cccceEEeCCCCCcCHHH---- Confidence 21 1111 22333333344455566777777777777666533333333322222 235888886554444433 Q ss_pred HHHHHHHHHHhhcccchhhhHHHHHHHHhCCCH--HHHHHHHHHHHHh----hhcC-----CccCCccccC Q lcl|NC_018087. 461 TERRVNVLSLMEPYIGKYISNHTAMKDFLQMSD--EDIAAERKLIDEE----LSDK-----IFNPPEPEEI 520 (520) Q Consensus 461 ~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tD--eeI~~~~kqi~~E----~~~~-----~~~~p~~e~~ 520 (520) .+++++.+. | .+|.++++.. |...| +|++.+.++-++. .+.. --.+|++||= T Consensus 408 ---~a~~~~kl~---g-iis~et~l~~-l~~vd~~~E~eri~~E~~~~~~~~~~~~~~~d~~~~d~~~ee~ 470 (470) T protein:vir:99 408 ---AIDNAKNAE---G-IVSKKTQLGM-IPDIEPDAEMKQIAKEKADAIKQTQQLSMPIDILKRDNNAEEE 470 (470) T ss_pred ---HHHHHHHHh---c-cCCHHHHHHh-CCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCCcCCCCCCccCC Confidence 334444443 4 3799999987 55554 4454444332221 1111 0011222111 No 133 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=95.71 E-value=0.0016 Score=35.93 Aligned_cols=395 Identities=13% Similarity=0.127 Sum_probs=167.0 Q ss_pred hhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHH-----------HHHHHHHhhccchhHH---- Q lcl|NC_018087. 26 IINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTREL-----------INTYRSLLNNYEVDNA---- 90 (520) Q Consensus 26 ~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~L-----------I~~YR~ma~~pEvd~A---- 90 (520) -+|.=..-.+.|.+.- ..-.+......+.++ +.+|+.+..+++-+.. T Consensus 1 ~~~~~~~~~~~~~~~~------------------~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~ 62 (474) T protein:vir:97 1 MFNIIRMPWDKPYGEE------------------VVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQ 62 (474) T ss_pred CcccccccCCCchhhH------------------HHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcc Confidence 1111111111111100 011112221122222 2244444444333221 Q ss_pred -----------------------HHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHH Q lcl|NC_018087. 91 -----------------------VQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFK 147 (520) Q Consensus 91 -----------------------i~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fR 147 (520) ...||+-.+-+= -..||++..++.+..+ .+. .++. -+|+....++.+ T Consensus 63 ~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l-~g~p~~~~~~d~~~~~----~l~----~~~~-n~~~~~~~e~~~ 132 (474) T protein:vir:97 63 MKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYV-ASKPVTYSCEDENVLK----VIH----DVLD-TRWDNKLIDILT 132 (474) T ss_pred cchhccccccccccCcceeecchHHHHHHHHHhhh-hcCCceeccCcHHHHH----HHH----HHHh-ccHHHHHHHHHH Confidence 122222211111 2366677666544333 222 2222 367888889999 Q ss_pred hhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeecc--CCCCcccccc--cceecceeecCcc-cccccccceec Q lcl|NC_018087. 148 RWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELD--TKMENGVKVV--KGYREYFLYDTEL-ESYQCGHQHFA 222 (520) Q Consensus 148 rWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~--~~~~~~~~~~--~~~~ey~~y~~~~-~~~~~~~~~~~ 222 (520) .+.+-|+-|.+.-+| + +|-..+..+||+.+.++.+-. .+..-.++.+ .+...+-+|++.. ..|...+.... T Consensus 133 ~~~~~G~~~~~~~~d-~---~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~ 208 (474) T protein:vir:97 133 ATSNKGIDWLQVYIN-E---NGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLI 208 (474) T ss_pred HHhhcCceEEEEEec-C---CCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccc Confidence 999999998886655 3 367889999999999875421 1111222211 1222233444331 11211111111 Q ss_pred CC---------------cceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEc Q lcl|NC_018087. 223 AG---------------TKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLK-LLEDAMMIYRITRAPDRRVFYI 286 (520) Q Consensus 223 ~~---------------~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~-m~EDalVIyRi~RApeRRvFyI 286 (520) .. .-=+|| |+++ +++....|=++..+.....+. ++-+....-+.++.|-+-+.-. T Consensus 209 ~~~~~~~~~~~~~~~~~~~g~vP---vv~~------~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~ 279 (474) T protein:vir:97 209 PDYYYGANHVQSHFSNGNWGRVP---FIAF------KNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGY 279 (474) T ss_pred cccccCcCcccccccccCCCccc---eEEe------cCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecC Confidence 00 000122 2222 122234565666555555554 3344444445556665433221 Q ss_pred cCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHH-HHHHHH Q lcl|NC_018087. 287 DTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMD-DILYFR 365 (520) Q Consensus 287 DvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~-DV~YF~ 365 (520) +.-. ...++..+..+++-.=+++. +++.|-...+.+... -+.-+. T Consensus 280 ~~~~--------------------------------~~~~~~~~~~~~~i~~~~~~--~~~~l~~~~~~~~~~~~~~~l~ 325 (474) T protein:vir:97 280 EGED--------------------------------LEEFMRGLKYYKAINVDGDG--GVETIQVEVPVSSTKEYIDLMR 325 (474) T ss_pred Cccc--------------------------------chhhhhhhhccceeeccCCC--ceeEEeecCCHHHHHHHHHHHH Confidence 1100 01111111222221122332 344444444444433 446667 Q ss_pred HHHHHhcCCChhhccCCCccccccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceE Q lcl|NC_018087. 366 KALYMALRVPLSRIPDEQTQNVFDMSTA--ISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIK 443 (520) Q Consensus 366 kkLy~aL~VP~SRl~~~~~~~~~G~~~e--ItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~ 443 (520) +.+|+...+|- +.+++.. |..+. |..-......-+.+.+..|...+..+++.=+-+-|+ ..+|. .|. T Consensus 326 ~~I~~~s~~p~--~~~~~~~---~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~--~~d~~----~i~ 394 (474) T protein:vir:97 326 VYIMEFGQGVD--FQTDKFG---SAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNL--KTDVK----DIE 394 (474) T ss_pred HHHHHHhCccc--cCccccc---cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Ccccc----eee Confidence 77888899984 2222211 22222 222223344445666666666666666543333343 23443 467 Q ss_pred EEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcC-----CccCCcc- Q lcl|NC_018087. 444 IVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDK-----IFNPPEP- 517 (520) Q Consensus 444 ~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~-----~~~~p~~- 517 (520) +.|....--.+...+ +++.++ + .+|.+++++. |...++ .+++.++|++|..+. .+.+..+ T Consensus 395 v~f~~~~p~~~~e~a-------~~~~~~-g----~iS~et~l~~-l~~v~D-~~~E~eri~~E~~~~~~~~~~~~~~~~~ 460 (474) T protein:vir:97 395 ISFNFNRMMNDAEQS-------QIIAQS-Q----YLSRETLVKS-SPLVDD-YKAELERIEQEQMEYNKQLPNLDDGGAD 460 (474) T ss_pred EEeccCcccCHHHHH-------HHHHHc-C----CCCHHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHhhccccCCCCCC Confidence 778544433333222 334443 2 4799999977 444331 223333444333221 1111100 Q ss_pred -----ccC Q lcl|NC_018087. 518 -----EEI 520 (520) Q Consensus 518 -----e~~ 520 (520) |+= T Consensus 461 ~~~~~~~~ 468 (474) T protein:vir:97 461 GAQQQEGS 468 (474) T ss_pred CcccCCCC Confidence 000 No 134 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=95.71 E-value=0.0016 Score=35.93 Aligned_cols=395 Identities=13% Similarity=0.127 Sum_probs=167.0 Q ss_pred hhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHH-----------HHHHHHHhhccchhHH---- Q lcl|NC_018087. 26 IINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTREL-----------INTYRSLLNNYEVDNA---- 90 (520) Q Consensus 26 ~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~L-----------I~~YR~ma~~pEvd~A---- 90 (520) -+|.=..-.+.|.+.- ..-.+......+.++ +.+|+.+..+++-+.. T Consensus 1 ~~~~~~~~~~~~~~~~------------------~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~ 62 (474) T protein:vir:94 1 MFNIIRMPWDKPYGEE------------------VVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQ 62 (474) T ss_pred CcccccccCCCchhhH------------------HHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcc Confidence 1111111111111100 011112221122222 2244444444333221 Q ss_pred -----------------------HHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHH Q lcl|NC_018087. 91 -----------------------VQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFK 147 (520) Q Consensus 91 -----------------------i~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fR 147 (520) ...||+-.+-+= -..||++..++.+..+ .+. .++. -+|+....++.+ T Consensus 63 ~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l-~g~p~~~~~~d~~~~~----~l~----~~~~-n~~~~~~~e~~~ 132 (474) T protein:vir:94 63 MKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYV-ASKPVTYSCEDENVLK----VIH----DVLD-TRWDNKLIDILT 132 (474) T ss_pred cchhccccccccccCcceeecchHHHHHHHHHhhh-hcCCceeccCcHHHHH----HHH----HHHh-ccHHHHHHHHHH Confidence 122222211111 2366677666544333 222 2222 367888889999 Q ss_pred hhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeecc--CCCCcccccc--cceecceeecCcc-cccccccceec Q lcl|NC_018087. 148 RWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELD--TKMENGVKVV--KGYREYFLYDTEL-ESYQCGHQHFA 222 (520) Q Consensus 148 rWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~--~~~~~~~~~~--~~~~ey~~y~~~~-~~~~~~~~~~~ 222 (520) .+.+-|+-|.+.-+| + +|-..+..+||+.+.++.+-. .+..-.++.+ .+...+-+|++.. ..|...+.... T Consensus 133 ~~~~~G~~~~~~~~d-~---~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~ 208 (474) T protein:vir:94 133 ATSNKGIDWLQVYIN-E---NGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLI 208 (474) T ss_pred HHhhcCceEEEEEec-C---CCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccc Confidence 999999998886655 3 367889999999999875421 1111222211 1222233444331 11211111111 Q ss_pred CC---------------cceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEc Q lcl|NC_018087. 223 AG---------------TKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLK-LLEDAMMIYRITRAPDRRVFYI 286 (520) Q Consensus 223 ~~---------------~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~-m~EDalVIyRi~RApeRRvFyI 286 (520) .. .-=+|| |+++ +++....|=++..+.....+. ++-+....-+.++.|-+-+.-. T Consensus 209 ~~~~~~~~~~~~~~~~~~~g~vP---vv~~------~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~ 279 (474) T protein:vir:94 209 PDYYYGANHVQSHFSNGNWGRVP---FIAF------KNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGY 279 (474) T ss_pred cccccCcCcccccccccCCCccc---eEEe------cCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecC Confidence 00 000122 2222 122234565666555555554 3344444445556665433221 Q ss_pred cCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHH-HHHHHH Q lcl|NC_018087. 287 DTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMD-DILYFR 365 (520) Q Consensus 287 DvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~-DV~YF~ 365 (520) +.-. ...++..+..+++-.=+++. +++.|-...+.+... -+.-+. T Consensus 280 ~~~~--------------------------------~~~~~~~~~~~~~i~~~~~~--~~~~l~~~~~~~~~~~~~~~l~ 325 (474) T protein:vir:94 280 EGED--------------------------------LEEFMRGLKYYKAINVDGDG--GVETIQVEVPVSSTKEYIDLMR 325 (474) T ss_pred Cccc--------------------------------chhhhhhhhccceeeccCCC--ceeEEeecCCHHHHHHHHHHHH Confidence 1100 01111111222221122332 344444444444433 446667 Q ss_pred HHHHHhcCCChhhccCCCccccccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceE Q lcl|NC_018087. 366 KALYMALRVPLSRIPDEQTQNVFDMSTA--ISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIK 443 (520) Q Consensus 366 kkLy~aL~VP~SRl~~~~~~~~~G~~~e--ItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~ 443 (520) +.+|+...+|- +.+++.. |..+. |..-......-+.+.+..|...+..+++.=+-+-|+ ..+|. .|. T Consensus 326 ~~I~~~s~~p~--~~~~~~~---~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~--~~d~~----~i~ 394 (474) T protein:vir:94 326 VYIMEFGQGVD--FQTDKFG---SAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNL--KTDVK----DIE 394 (474) T ss_pred HHHHHHhCccc--cCccccc---cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Ccccc----eee Confidence 77888899984 2222211 22222 222223344445666666666666666543333343 23443 467 Q ss_pred EEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcC-----CccCCcc- Q lcl|NC_018087. 444 IVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDK-----IFNPPEP- 517 (520) Q Consensus 444 ~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~-----~~~~p~~- 517 (520) +.|....--.+...+ +++.++ + .+|.+++++. |...++ .+++.++|++|..+. .+.+..+ T Consensus 395 v~f~~~~p~~~~e~a-------~~~~~~-g----~iS~et~l~~-l~~v~D-~~~E~eri~~E~~~~~~~~~~~~~~~~~ 460 (474) T protein:vir:94 395 ISFNFNRMMNDAEQS-------QIIAQS-Q----YLSRETLVKS-SPLVDD-YKAELERIEQEQMEYNKQLPNLDDGGAD 460 (474) T ss_pred EEeccCcccCHHHHH-------HHHHHc-C----CCCHHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHhhccccCCCCCC Confidence 778544433333222 334443 2 4799999977 444331 223333444333221 1111100 Q ss_pred -----ccC Q lcl|NC_018087. 518 -----EEI 520 (520) Q Consensus 518 -----e~~ 520 (520) |+= T Consensus 461 ~~~~~~~~ 468 (474) T protein:vir:94 461 GAQQQEGS 468 (474) T ss_pred CcccCCCC Confidence 000 No 135 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=95.70 E-value=0.0016 Score=35.92 Aligned_cols=397 Identities=12% Similarity=0.080 Sum_probs=171.9 Q ss_pred eeccccccc---cccccccccc-cccc-----chhHHHHHHHHHHHhhccchhHH------------------------- Q lcl|NC_018087. 45 EVDSQDIAY---NGVFQKLYGS-QDPT-----ATSTRELINTYRSLLNNYEVDNA------------------------- 90 (520) Q Consensus 45 ~i~~~~~a~---~g~~~~~~~~-~~~~-----~~~~~~LI~~YR~ma~~pEvd~A------------------------- 90 (520) +++.+++.. +..+...+.. +..+ +..-...+.+|+.+..+++-... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~ 80 (472) T protein:vir:93 1 MYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMIT 80 (472) T ss_pred CCCCCCcchhhhhceeeecCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhcccccccccccccccc Confidence 333332211 1222221100 0000 01112233556555555433321 Q ss_pred --HHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCCCC Q lcl|NC_018087. 91 --VQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKD 168 (520) Q Consensus 91 --i~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~ 168 (520) ...||+-.+-+= -..|+++..++.+..+ .+ +.+++ -+|+....++++.+++-|+-|.+.-+|. + T Consensus 81 n~~~~ivd~~~~~l-~g~~~~~~~~d~~~~~----~l----~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~----d 146 (472) T protein:vir:93 81 NFHANLVDQKVSYI-VGKPIAFKHTDDEVVK----RI----DEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDE----E 146 (472) T ss_pred chHHHHHHHHhhhh-cccCeeeccCChHHHH----HH----HHHHh-ccHHHHHHHHHHHHhhcCeEEEEEEECC----C Confidence 111222211111 2366777766654433 22 23333 2678888899999999999888766552 3 Q ss_pred CeeeeEecCccceeeeeecc--CCCCccccccc----ce--------ecceeecCcccccccccc------eecCCccee Q lcl|NC_018087. 169 GIIELRRLDPRNVQFVRELD--TKMENGVKVVK----GY--------REYFLYDTELESYQCGHQ------HFAAGTKIK 228 (520) Q Consensus 169 GI~elr~lDPr~i~~vr~i~--~~~~~~~~~~~----~~--------~ey~~y~~~~~~~~~~~~------~~~~~~~~~ 228 (520) |-..+..+||+.+.++.+-. .+..-.++.+. .. ..+|.+............ ...++.-=+ T Consensus 147 ~~~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (472) T protein:vir:93 147 GEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGK 226 (472) T ss_pred CceEEEEEcccceEEEEcCCCCCceEEEEEEEEeecceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCC Confidence 56789999999998875421 11112222111 00 112222211110000000 000010001 Q ss_pred cCcccEEEeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhh Q lcl|NC_018087. 229 IPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLK-LLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSH 307 (520) Q Consensus 229 I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~-m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~ 307 (520) || |+++. |+....|=++..+....-+. ++-+....-+....|-+-+.-.+.-.. .+..+. +..+ T Consensus 227 vP---vv~~~------nn~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~-----~~~~~~-~~~~ 291 (472) T protein:vir:93 227 IP---FIPFK------NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQEL-----PEFKRL-LRYY 291 (472) T ss_pred cc---eEEec------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcccc-----hhhHHH-Hhhc Confidence 21 22221 12223454554444333332 455555556666666444432221111 111111 1111 Q ss_pred cceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccc Q lcl|NC_018087. 308 RNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQN 386 (520) Q Consensus 308 knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~ 386 (520) + -..+ ++ |..+.+|-...+.+. ..-+.-+++.+|+..++|-- .+++ ++ T Consensus 292 ~-----------------------~~~~---~~--~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~--~~~~-~~ 340 (472) T protein:vir:93 292 G-----------------------AIKV---SD--NGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDF--SSDK-FG 340 (472) T ss_pred c-----------------------cccc---CC--CCcceeEeecCCHHHHHHHHHHHHHHHHHHhCCCCC--Cccc-cc Confidence 1 0111 11 123555543334333 33456677788889999852 2221 11 Q ss_pred ccccc--chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHH Q lcl|NC_018087. 387 VFDMS--TAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERR 464 (520) Q Consensus 387 ~~G~~--~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R 464 (520) |.+ ..|.--+.....-+.+.++.|...+.++++.=+-+-|+ ..+|. .|.+.|....--.+. +. T Consensus 341 --~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~--~~~~~----~i~v~f~~~~p~~~~-------~~ 405 (472) T protein:vir:93 341 --SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI--KGEHK----DVDISFNYNKVANTE-------LQ 405 (472) T ss_pred --cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Ccccc----eeeEEeCCCCCCCHH-------HH Confidence 222 22333344455667788888888888877754333343 34554 466777533332232 22 Q ss_pred HHHHHHhhcccchhhhHHHHHHHHhCCCH--HHHHHHHHHHHHhhhcCC-cc----CCcc-ccC Q lcl|NC_018087. 465 VNVLSLMEPYIGKYISNHTAMKDFLQMSD--EDIAAERKLIDEELSDKI-FN----PPEP-EEI 520 (520) Q Consensus 465 ~~~~~~~~p~vgky~S~~~i~k~IL~~tD--eeI~~~~kqi~~E~~~~~-~~----~p~~-e~~ 520 (520) ++++..+. | .+|.+++++.+-..+| +|++.++++-++.+++.. ++ ++.+ +|- T Consensus 406 ~~~~~k~~---g-iis~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~ 465 (472) T protein:vir:93 406 VQTAQQSM---G-IVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQQER 465 (472) T ss_pred HHHHHHHh---c-cCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhccCcCcccCCCCCCCCC Confidence 44455553 4 3799999987433443 455554443333222210 11 1111 111 No 136 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=95.62 E-value=0.0017 Score=35.73 Aligned_cols=397 Identities=12% Similarity=0.073 Sum_probs=181.0 Q ss_pred CCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchhHH-------------------------HH Q lcl|NC_018087. 38 KFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNA-------------------------VQ 92 (520) Q Consensus 38 ~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~A-------------------------i~ 92 (520) =++|-...+.. + ++.-.....+|+.+..|++=... |+ T Consensus 1 ~~~~~~~~i~~---------------l---~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd 62 (441) T protein:vir:80 1 MNSDELALIEG---------------M---YDRIQRLSSWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVD 62 (441) T ss_pred CCccHHHHHHH---------------H---HHHHHHHHHHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHH Confidence 11111000100 0 00000111223333333322222 22 Q ss_pred hhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCCCCCeee Q lcl|NC_018087. 93 EIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIE 172 (520) Q Consensus 93 eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~e 172 (520) ..++-. + +..+...+ . +..+.+.+--+|+....++++.-.+-|+-|.+.-.| ++|-.. T Consensus 63 ~~~~~l-~------~~g~~~~d---~--------~~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d----~~g~~~ 120 (441) T protein:vir:80 63 ALEERL-D------WLGWTNGD---G--------YGLDGVYAANRLATASCDVHLDALIFGLSFVAIIPH----GDGTVS 120 (441) T ss_pred HHHhhh-c------cccccCCC---h--------HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeC----CCCceE Confidence 111110 0 00010000 1 233445555678888899999999999988876544 347778 Q ss_pred eEecCccceeeeeeccCCC-Ccccccc----cceecceeecCcccccc--cccceecC----Ccce-ecCcccEEEeecc Q lcl|NC_018087. 173 LRRLDPRNVQFVRELDTKM-ENGVKVV----KGYREYFLYDTELESYQ--CGHQHFAA----GTKI-KIPYSAMVYAHSG 240 (520) Q Consensus 173 lr~lDPr~i~~vr~i~~~~-~~~~~~~----~~~~ey~~y~~~~~~~~--~~~~~~~~----~~~~-~I~~~aI~y~hSG 240 (520) ++.+||+.+..+.+-.... .-....+ .......+|.+..-.+. .+...... ...+ ++| .|.|.+-- T Consensus 121 i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~~~~~~~~vy~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP--vv~~~n~~ 198 (441) T protein:vir:80 121 VRPQSPKNCTGKFSADGSRLDAGLVVQQTCDPEVVEAELLLPDVIVQVERRGSREWVEVDRIPNVLGAVP--LVPIVNRR 198 (441) T ss_pred EEEEccceEEEEEeCCCCceeEEEEEEEEecCceEEEEEEecCeEEEEEEcCCcceeeccccccCCCcee--EEEeeccc Confidence 9999999998876422111 1010110 11122234433221111 11100000 0001 122 13333321 Q ss_pred cccCCCCcchhhhHHHHHHHH-H-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCC Q lcl|NC_018087. 241 LVDCCGKNIIGYLHRAVKPAN-Q-LKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTG 318 (520) Q Consensus 241 L~d~~~~~~~syL~~aik~~N-q-L~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TG 318 (520) - .......|-|...++++- . -+++-+..++-+.+..|.|-+.=.+.+..+. +. .+. ..| T Consensus 199 ~--~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~G~~~~~~~~--------~~---~~~------~~~ 259 (441) T protein:vir:80 199 R--TSRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVTGVSADEFSQ--------PG---WVL------SMA 259 (441) T ss_pred c--CCccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeeecCCcccccc--------ch---hhh------ccc Confidence 1 111122343433333321 1 2456677778888888877554222111110 00 000 001 Q ss_pred ccccccccchhhhhhc-ccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHH Q lcl|NC_018087. 319 KVKNQANMMALTEDYW-LQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRD 397 (520) Q Consensus 319 ev~d~~~~msmlEDyw-LpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRD 397 (520) -+| +|--++|.+.++..+|+++-=.-++-++=.-..++...++|.+-|...+. +. ..+..|.-- T Consensus 260 -------------~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~-~~-~Sg~Al~~~ 324 (441) T protein:vir:80 260 -------------SVWAVDKDDDGDTPNVGSFPVNSPTPYSDQMRLLAQLTAGEAAVPERYFGFITS-NP-PSGEALAAE 324 (441) T ss_pred -------------ccccCCCCCCCCcceeEecCccchHHHHHHHHHHHHHHhcccCCCHHHhccCCC-cc-hHHHHHHHH Confidence 122 34444455567777886432223333444556677778888766643321 10 111223333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccch Q lcl|NC_018087. 398 ELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGK 477 (520) Q Consensus 398 ElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgk 477 (520) ....-.-+.+.++.|..-+...++.=+-+.|. ..++......+.+.|.....=+. .+.++.+.++..-+-. T Consensus 325 ~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~--~~~~~~~~~~i~~~f~~~~~~~~-------~e~ad~~~kl~~~g~~ 395 (441) T protein:vir:80 325 ESRLVKRAERRQTSFGQGWLSVGFLAAKALDS--RVDEADFFGDVGLRWRDASTPTR-------AATADAVTKLVGAGIL 395 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CCcccccceeeeEEeCCCCCcCH-------HHHHHHHHHHHhcCcc Confidence 44455667788888888888877754444443 33444445678888875443333 4455555555544323 Q ss_pred hhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhc----CCccCCccccC Q lcl|NC_018087. 478 YISNHTAMKDFLQMSDEDIAAERKLIDEELSD----KIFNPPEPEEI 520 (520) Q Consensus 478 y~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~----~~~~~p~~e~~ 520 (520) ..|.++++ ..|..+++|++++.++-+++... .-..+-..+|+ T Consensus 396 ~~s~~~~~-~~l~~~~~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~ 441 (441) T protein:vir:80 396 PADSRTVL-EMLGLDDVQVEAVMRHRAESSDPLAVLAGAISRQTNEV 441 (441) T ss_pred cccHHHHH-HhCCCCHHHHHHHHHHHHHHHHHHHHHhhhhhcccccC Confidence 45788887 45899999999876644333221 11122233344 No 137 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=95.61 E-value=0.0018 Score=35.69 Aligned_cols=426 Identities=11% Similarity=0.058 Sum_probs=176.8 Q ss_pred chhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccc---cchhHHHHHH-------- Q lcl|NC_018087. 8 DLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDP---TATSTRELIN-------- 76 (520) Q Consensus 8 ~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~---~~~~~~~LI~-------- 76 (520) +|+.=.|-.-.|-..+ ++. .+-.-. ++.|. +....+. ...-...+|+ T Consensus 1 ~~~~~~~~~~~~~~~~---~~~---~~~~~~---------------n~~~~-~~~~e~~~~~~~~~i~~~i~~~~~~~~~ 58 (511) T protein:vir:93 1 MLKVNEFETDTDLRGN---INY---LFNDEA---------------NVVYT-YDGTESDLLQNVNEVSKYIEHHMDYQRP 58 (511) T ss_pred Cccccchhhhhhhhhh---hhh---hhhhhh---------------CCccc-ccchhhhhhccHHHHHHHHHHHHHhhHH Confidence 4444344333333221 111 010000 11111 1100111 1111222332 Q ss_pred HHHHHhhccchhHHH----------------------HhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHH Q lcl|NC_018087. 77 TYRSLLNNYEVDNAV----------------------QEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLN 134 (520) Q Consensus 77 ~YR~ma~~pEvd~Ai----------------------~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ 134 (520) +|+.+..+.+-...| .-||+..+-+= -..||++..++.+ ..+.++.+++ T Consensus 59 r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl-~g~p~~~~~~d~~--------~~~~l~~~~~ 129 (511) T protein:vir:93 59 RLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYF-LGNPIQYQDDDKD--------VLEVIEAFND 129 (511) T ss_pred HHHHHHHHhcccCccccccCcCcccccCcceeecchHHHHHHHHhhhh-cccCeeeccCChH--------HHHHHHHHHh Confidence 344444443332221 12222211111 2366676665543 3345566666 Q ss_pred HhcchhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccC--CCCccccccc----------cee Q lcl|NC_018087. 135 MLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDT--KMENGVKVVK----------GYR 202 (520) Q Consensus 135 ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~--~~~~~~~~~~----------~~~ 202 (520) --+|+....++.+...+-|+-|.+.-.|. +|-..+..+||+.+.+|.+-.. +..-+++.+. .+. T Consensus 130 ~n~~~~~~~~~~~~~~~~G~ay~~vy~de----~~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~~~~~~~~~~ 205 (511) T protein:vir:93 130 LNDVESHNRSLGLDLSIYGKAYELMIRNQ----DDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVF 205 (511) T ss_pred hcCHhHHHHHHHHHHHhcCeeEEEEEeCC----CCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccceEE Confidence 67889999999999999999999877662 3667899999999998865322 1122222111 011 Q ss_pred cceeecCcc-cccccccceec----------CCcceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHHH-HHHHH Q lcl|NC_018087. 203 EYFLYDTEL-ESYQCGHQHFA----------AGTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKL-LEDAM 270 (520) Q Consensus 203 ey~~y~~~~-~~~~~~~~~~~----------~~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m-~EDal 270 (520) -+-+|++.. ..|..++.... +..-=+|| .|.|. ++....|-++..+.....+.. +=+.. T Consensus 206 ~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vP--vv~~~-------nn~~g~gd~e~v~~liDa~d~~~S~~~ 276 (511) T protein:vir:93 206 TVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMP--ITEFS-------NNERRKGDYEKVITLIDLYDNAESDTA 276 (511) T ss_pred EEEEEeCCcEEEEEecCCCccccccccccccccCCCccc--eEEec-------CCCCCCCchhhHHHHHHHHHHHHHHHH Confidence 122555442 22211111100 00001222 12222 222335666665555554432 22333 Q ss_pred HHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccc-cccccchhhh-hhc--ccccCCCCCcce Q lcl|NC_018087. 271 MIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVK-NQANMMALTE-DYW--LQRRDGKAVTEV 346 (520) Q Consensus 271 VIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~-d~~~~msmlE-Dyw--LpRReGgrgTEI 346 (520) ..-+-++.|-+-+.=..... +++++ +....+-.+. ..| ...-..+.|..+ T Consensus 277 ~~~~~~~~~~lv~~G~~~~~--------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 330 (511) T protein:vir:93 277 NYMSDLNDAMLLIKGNLNLD--------------------------PVEVRKQKEANVLFLEPTVYADSEGRETEGSVDG 330 (511) T ss_pred HHHHHhhCcceeeecCcccC--------------------------chhhcccccccceecccccccccccccCCCCcce Confidence 33344445544333111000 00000 0111111111 111 111122233455 Q ss_pred eecCCCCCcChH-HHHHHHHHHHHHhcCCChhhccCCCccccccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 347 ETLPGMTGMNEM-DDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAIS--RDELSFDKFISELQHKFEEIFLSPLKSN 423 (520) Q Consensus 347 sTLpGg~nLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eIt--RDElkF~KFI~rLr~rFs~if~d~Lk~Q 423 (520) ..|-...+...+ .-+.-..+.+|+-.++|-- ..++ +. |..+..+ .-...-..-+.+.++.|..-+...++.= T Consensus 331 ~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~--~~~~-~~--~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li 405 (511) T protein:vir:93 331 GYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNM--KDDN-FS--GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLL 405 (511) T ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCCccc--cccc-cc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 555443333322 2334456777888888852 2221 11 2222222 2222223344555555555555544432 Q ss_pred HHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCH---HHHHHHH Q lcl|NC_018087. 424 LLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSD---EDIAAER 500 (520) Q Consensus 424 LiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tD---eeI~~~~ 500 (520) +-+-++....++..-...+.+.|...---.+ .+.++++..+. | .+|.+++++. |..++ +||+.++ T Consensus 406 ~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~-------~e~~~~~~kl~---g-~iS~et~~~~-l~~v~d~~~E~~ri~ 473 (511) T protein:vir:93 406 ETILKNTWSIDANKDFNTVRYVYNRNLPKSL-------IEELKAYIDSG---G-KISQTTLMSL-FSFFQDPELEVKKIE 473 (511) T ss_pred HHHHHhccCcccccccccceEEeCCCCCCCH-------HHHHHHHHHHh---c-cCchHHHHHh-CCCCCCHHHHHHHHH Confidence 2221222222232223456777865333333 23344555553 3 3799999976 55543 5666655 Q ss_pred HHHHHhhhc---CCccCCccccC Q lcl|NC_018087. 501 KLIDEELSD---KIFNPPEPEEI 520 (520) Q Consensus 501 kqi~~E~~~---~~~~~p~~e~~ 520 (520) ++-+++.+. ....+|...+= T Consensus 474 ~E~~~~~~~~~~~~~~~~~~~~~ 496 (511) T protein:vir:93 474 EDEKESIKKAQKGIYKDPRDIND 496 (511) T ss_pred HHHHHHHHHHhhhcccCCCCCCC Confidence 554444332 11222222111 No 138 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=95.51 E-value=0.0019 Score=35.46 Aligned_cols=399 Identities=12% Similarity=0.090 Sum_probs=165.8 Q ss_pred eeccc----cccccccccccccccccc-------chhHHHHHHHHHHHhhccchhHH----------------------- Q lcl|NC_018087. 45 EVDSQ----DIAYNGVFQKLYGSQDPT-------ATSTRELINTYRSLLNNYEVDNA----------------------- 90 (520) Q Consensus 45 ~i~~~----~~a~~g~~~~~~~~~~~~-------~~~~~~LI~~YR~ma~~pEvd~A----------------------- 90 (520) |+... ..-+...+...+...+.. +..-..-+++|+.+..+++-... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki 80 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQYETQEEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWRM 80 (468) T ss_pred CccccCCcCceeehheeecccccccCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccccccccccccccccc Confidence 33221 001111111111000000 00111122334444333332211 Q ss_pred ----HHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCC Q lcl|NC_018087. 91 ----VQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRP 166 (520) Q Consensus 91 ----i~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~ 166 (520) ...||+..+-+= -..||++..++.+..+ .++.++. =+|+....++.+.+.+-|+-|.+.-+|. T Consensus 81 ~~n~~~~Iv~~~~~~l-~g~p~~~~~~d~~~~~--------~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~--- 147 (468) T protein:vir:96 81 YTNYHQNLVDQKVAYA-VANPVTYGTEDEKSLK--------TIQEVLN-HKWDDKLVDILTAASNKGVEWIQPYVDE--- 147 (468) T ss_pred ccchHHHHHHHHHhhh-ccCCceeccCChHHHH--------HHHHHHh-cCHHHHHHHHHHHHhhcCeEEEEEEEcC--- Confidence 111222111111 1256666665543333 2333332 2677888889999999999999877653 Q ss_pred CCCeeeeEecCccceeeeeecc--CCCCcccccc--cceecceeecCc-ccccccccce---------ecCCcceecCcc Q lcl|NC_018087. 167 KDGIIELRRLDPRNVQFVRELD--TKMENGVKVV--KGYREYFLYDTE-LESYQCGHQH---------FAAGTKIKIPYS 232 (520) Q Consensus 167 k~GI~elr~lDPr~i~~vr~i~--~~~~~~~~~~--~~~~ey~~y~~~-~~~~~~~~~~---------~~~~~~~~I~~~ 232 (520) +|-..+..+||+.+.++..-. .+..-.++.+ .+...+-+|.+. ...|...... ............ T Consensus 148 -~~~~~i~~~~p~~~~~v~~~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (468) T protein:vir:96 148 -QGEFKTFRVPAEQAIPIWTNKERDELKAFIRLYELDGGERVEYWTANDVTFYELKDGQLIPDYYQGEEHVQAHYYVGNK 226 (468) T ss_pred -CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCceeecccccccccccceeeccc Confidence 366789999999998874311 1111111110 011111222221 1111100000 000000000000 Q ss_pred -------cEEEeecccccCCCCcchhhhHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHH Q lcl|NC_018087. 233 -------AMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKL-LEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIM 304 (520) Q Consensus 233 -------aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m-~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im 304 (520) -|+++ +|+....|-++..+.....|.+ +-+..-.-+-.+.|-+-+.-.+... .++.+.. | T Consensus 227 ~~~~~~iPvv~~------~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~-----~~~~~~~-~ 294 (468) T protein:vir:96 227 SMSWNRVPFIPF------KNNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEGED-----LEEFMYN-L 294 (468) T ss_pred cccCCcccEEEe------cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccc-----cchhhhh-h Confidence 02221 1222345666665555555542 3333444456666654333211111 1111110 1 Q ss_pred HhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChH-HHHHHHHHHHHHhcCCChhhccCCC Q lcl|NC_018087. 305 NSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEM-DDILYFRKALYMALRVPLSRIPDEQ 383 (520) Q Consensus 305 ~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei-~DV~YF~kkLy~aL~VP~SRl~~~~ 383 (520) ..++ -++++= +++. .+..|....+.... .-++-+.+.+|+..++|- +.+++ T Consensus 295 ~~~~-----------------------~i~~~~-d~~~--~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~ 346 (468) T protein:vir:96 295 KYYK-----------------------AINVDG-DGSG--GVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVD--FQQDK 346 (468) T ss_pred hcCc-----------------------eEEecC-CCCC--cceEEeecCChHHHHHHHHHHHHHHHHHhCccc--ccccc Confidence 1111 112322 2222 24444443344333 336778888999999984 33222 Q ss_pred ccccccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHH Q lcl|NC_018087. 384 TQNVFDMSTAIS--RDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEIT 461 (520) Q Consensus 384 ~~~~~G~~~eIt--RDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~ 461 (520) .. |..+..+ ........-+.+.+..|...+..+++.=+-+.|+. -+| ..|.+.|...---.+...++ T Consensus 347 ~~---~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~--~d~----~~i~i~f~~~~p~d~~e~a~-- 415 (468) T protein:vir:96 347 FG---NSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKLS--IKV----QDVEITFNFNVMVNELEQSQ-- 415 (468) T ss_pred cc---cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--ccc----ceeeEEecCCCCcCHHHHHH-- Confidence 11 2232222 11222334466777777777777666544444542 233 34677776555444433333 Q ss_pred HHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcC-----CccCCccccC Q lcl|NC_018087. 462 ERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDK-----IFNPPEPEEI 520 (520) Q Consensus 462 ~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~-----~~~~p~~e~~ 520 (520) ++.++ | .+|.+++++. |...++ -+++.++|++|..+. -+...++.|= T Consensus 416 -----~~~~~----g-~iS~et~i~~-l~~v~D-~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 467 (468) T protein:vir:96 416 -----IGVNS----Q-YLSKETVVTN-HPWVDD-PVAEMERIDQEELALPSIEEGLNGKENNEP 467 (468) T ss_pred -----HHHhc----C-CCchHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHHhhccCCCCCCCC Confidence 23332 2 5799999977 555431 234444444444322 1222222222 No 139 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=95.47 E-value=0.002 Score=35.38 Aligned_cols=406 Identities=14% Similarity=0.131 Sum_probs=190.7 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhccC-CCcccCCCC-CCCceeecccccccccccccccccccccchhHHHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIINDK-AESITAPKF-DDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTY 78 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~~~-~~s~~~p~~-~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~Y 78 (520) |--.+.+.|+-+ +.+ ..++-+|-. .|++- . .++.|+-.. .+. .+ .. T Consensus 1 ~~~~~~~~~~~~---------------~~~~~~~~g~~~s~~~~~~-~----~~~~~~~~~--~g~--~v--------~~ 48 (437) T protein:vir:10 1 MKQGKQRALGRI---------------KSSFLKWLGVPISLTDGSF-W----SAWGGMGSS--SGE--TV--------TA 48 (437) T ss_pred CCcchhhhhhhh---------------HHhhhhhcCCcccCCchhH-H----HhhcccccC--CCc--ee--------ch Confidence 111122221111 111 111122211 11110 0 011111110 010 01 12 Q ss_pred HHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHH-HHHHHHHHH-HhcchhhhHHHHH----hhccc Q lcl|NC_018087. 79 RSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLI-SDEFNSVLN-MLNFQRKGSDHFK----RWYVD 152 (520) Q Consensus 79 R~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I-~eeF~~i~~-ll~f~k~g~~~fR----rWYvD 152 (520) ...+.+|-|..||+-|.+.+.-.+-. .....-+.. +..+ ......+|+ --|-...++++.+ .+.+. T Consensus 49 ~~al~~~~v~~ci~~Ia~~ia~lp~~--~~~~~~~g~------~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~ 120 (437) T protein:vir:10 49 DSALQLSAVWSCVRLIAETIATLPLN--LYQTKPDGT------RVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLW 120 (437) T ss_pred HhhhccHHHHHHHHHHHHHHhhCcee--EEEEcCCCc------eeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhc Confidence 34468899999999999987543110 111100100 0000 111222232 2334445555444 45677 Q ss_pred cceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcc Q lcl|NC_018087. 153 SRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYS 232 (520) Q Consensus 153 gri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~ 232 (520) |--|..++-| ...+++|.+|+|..+...+. .++.- .|.++.+. +....++.+ T Consensus 121 Gnay~~i~r~----~g~~~~L~~l~p~~v~i~~~-----~~g~~------~y~~~~~~-------------g~~~~~~~~ 172 (437) T protein:vir:10 121 GNGYARKLRS----AGVLIGLELMLPQRTTVKRL-----TSGAL------QYTYRNVD-------------GTVSTLAED 172 (437) T ss_pred CCeEEEEEec----CCcEEEEEEEcCcceEEEEC-----CCCeE------EEEEEecC-------------ceEEEEccc Confidence 9888876644 24689999999999887542 12211 12222111 223578999 Q ss_pred cEEEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeE Q lcl|NC_018087. 233 AMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRIS 312 (520) Q Consensus 233 aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklv 312 (520) .|+|.. + ...++...+|-+..|.+++.....+++...=+----+--+-|..++ +.|.+.++++..+.+-.+|+. . T Consensus 173 dIih~r-~-~~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g--~ 247 (437) T protein:vir:10 173 DVFHVR-G-FSLDGLMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTD-QILQKEKRAEIRTDLAEQFGG--A 247 (437) T ss_pred cEEEec-C-cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhcC--c Confidence 998874 2 3456667788999999999988888877665555556667777776 678888877766665555543 1 Q ss_pred eecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCcccccccc Q lcl|NC_018087. 313 YDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMS 391 (520) Q Consensus 313 Yd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~ 391 (520) - ..|. .+ .++ .|++++.|.-...-.+ ++=-++-.+.+.++++||...|....+.+. .. T Consensus 248 ~--nag~------~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~--~~ 306 (437) T protein:vir:10 248 M--QAGK------TM-VLE----------AGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTS--WG 306 (437) T ss_pred c--ccCc------ce-ecc----------CCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccc--cc Confidence 0 1122 11 221 2566666642222223 222345678899999999999864332211 11 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018087. 392 TAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLM 471 (520) Q Consensus 392 ~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~ 471 (520) +.+.-.-+.|..+ .|+--+.. +.+. +=+.++++.++.. ..|+|++ + ++.... +..|.+.++.+ T Consensus 307 sn~e~~~~~f~~~--tl~P~~~~-ie~~-----l~~kll~~~e~~~--~~~~fd~--~----~ll~~d-~~~r~~~~~~~ 369 (437) T protein:vir:10 307 TGIEQQTLGFLTF--TLRPWLTR-IEQA-----ARRSLLRPGERDQ--FYAEFSV--E----GLLRAD-SAGRAAFYSTM 369 (437) T ss_pred chHHHHHHHHHHH--HHHHHHHH-HHHH-----HHhhccCccccCc--eEEEEec--h----hhhccC-HHHHHHHHHHH Confidence 2232223334443 23322221 2222 2223456666642 2344433 2 232222 46667776655 Q ss_pred hcccchhhhHHHHHHHHhCCCHHH----HHHHHH---HHHHhhhcCC---------ccCCccccC Q lcl|NC_018087. 472 EPYIGKYISNHTAMKDFLQMSDED----IAAERK---LIDEELSDKI---------FNPPEPEEI 520 (520) Q Consensus 472 ~p~vgky~S~~~i~k~IL~~tDee----I~~~~k---qi~~E~~~~~---------~~~p~~e~~ 520 (520) -.- -++|.+-++. +|++.+-+ +-.... -+++...+.. -.+.++|+. T Consensus 370 ~~~--G~~T~NE~R~-~~gl~pi~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (437) T protein:vir:10 370 TQN--GLMTRDECRA-KENLPPMGGNAAVLTVQSALLPIDKLGEHTTATAAQDALKAWLYQEEKT 431 (437) T ss_pred HhC--CCcCHHHHHH-HhCCCCCCCCcceEeecCcccchhhccCcCCCcchhccccccCCCCCCC Confidence 222 3668888884 47775432 000000 0111111100 001222222 No 140 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=95.45 E-value=0.002 Score=35.34 Aligned_cols=426 Identities=12% Similarity=0.063 Sum_probs=175.5 Q ss_pred chhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccc---cchhHHHHHH-------- Q lcl|NC_018087. 8 DLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDP---TATSTRELIN-------- 76 (520) Q Consensus 8 ~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~---~~~~~~~LI~-------- 76 (520) +|+.=.|-.-.|-..+ ++. .+-.-. ++.|. +...... ...-...+|+ T Consensus 1 ~~~~~~~~~~~~~~~~---~~~---~~~~~~---------------n~~~~-~~~~e~~~~~~~~~i~~~i~~~~~~~~~ 58 (511) T protein:vir:96 1 MLKVNEFETDTDLRGN---INY---LFNDEA---------------NVVYT-YDGTESDLLQNVNEVSKYIEHHMDYQRP 58 (511) T ss_pred Cccccchhhhhhhhhh---hhh---hhhhhh---------------CCccc-cchhhhhhhccHHHHHHHHHHHHHhhHH Confidence 3444344333333221 111 010000 11111 1100111 1111222222 Q ss_pred HHHHHhhccchhHHH----------------------HhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHH Q lcl|NC_018087. 77 TYRSLLNNYEVDNAV----------------------QEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLN 134 (520) Q Consensus 77 ~YR~ma~~pEvd~Ai----------------------~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ 134 (520) +|+.+..+.+=+..+ .-||+..+-+= -..||++..++.+ ..+.+..+++ T Consensus 59 r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl-~g~p~~~~~~~~~--------~~~~l~~~~~ 129 (511) T protein:vir:96 59 RLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYF-LGNPIQYQDDDKD--------VLEAIEAFND 129 (511) T ss_pred HHHHHHHHhcccCccccccCcCcccccCcceeecchHHHHHHHHHhhh-ccCCceeecCchH--------HHHHHHHHHh Confidence 354454433322111 12222211111 1366677665542 3345667777 Q ss_pred HhcchhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCC--CCccccccc----------cee Q lcl|NC_018087. 135 MLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTK--MENGVKVVK----------GYR 202 (520) Q Consensus 135 ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~--~~~~~~~~~----------~~~ 202 (520) --+|+....++.+...+-|+-|.+.-+|. +|-..+..+||+.+.+|.+-... ..-+++.+. .+. T Consensus 130 ~n~~~~~~~~~~~~~~i~G~a~~~vy~de----d~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~ 205 (511) T protein:vir:96 130 LNDVESHNRSLGLDLSIYGKAYELMIRNQ----DDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVF 205 (511) T ss_pred hcCHHHHHHHHHHHHHhcCeeEEEEEeCC----CCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccceEE Confidence 77899999999999999999888876652 36788999999999998653321 112222110 011 Q ss_pred cceeecCcc-cccccccceec----------CCcceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHHHHH-HHH Q lcl|NC_018087. 203 EYFLYDTEL-ESYQCGHQHFA----------AGTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLE-DAM 270 (520) Q Consensus 203 ey~~y~~~~-~~~~~~~~~~~----------~~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~E-Dal 270 (520) .+-+|++.. ..+..++.... +..--+|| |++. +|+....|-++..+.....+..+- +.. T Consensus 206 ~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vP---vv~~------~nn~~g~gd~e~v~~liDa~d~~~S~~~ 276 (511) T protein:vir:96 206 TVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMP---ITEF------SNNERRKGDYEKVITLIDLYDNAESDTA 276 (511) T ss_pred EEEEEeCCcEEEEEecCCCcccccccccccccccCCcee---eEEe------cCCCCCCCchhhhHHHHHHHHHHHHHHH Confidence 122555442 11211111000 00000222 2221 122234566665555554443322 222 Q ss_pred HHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccc-cccccchhhhhhc---ccccCCCCCcce Q lcl|NC_018087. 271 MIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVK-NQANMMALTEDYW---LQRRDGKAVTEV 346 (520) Q Consensus 271 VIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~-d~~~~msmlEDyw---LpRReGgrgTEI 346 (520) ..-+-++.|-+-+.- .++ ..++++. +....+-.+.... ......+.|..+ T Consensus 277 ~~~~~~~~~~lv~~g----~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 330 (511) T protein:vir:96 277 NYMSDLNDAMLLIKG----NLN----------------------LDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDG 330 (511) T ss_pred HHHHHhhCceeeeec----Ccc----------------------CCchhhcccccccceecccccccccccccCCCCcce Confidence 222334444332221 100 0001111 1111111111110 001112224456 Q ss_pred eecCCCCCcChH-HHHHHHHHHHHHhcCCChhhccCCCccccccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 347 ETLPGMTGMNEM-DDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAIS--RDELSFDKFISELQHKFEEIFLSPLKSN 423 (520) Q Consensus 347 sTLpGg~nLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eIt--RDElkF~KFI~rLr~rFs~if~d~Lk~Q 423 (520) ..|-...+...+ .-+.-+.+.+|.-.++|-- .+++ +. |..+..+ .-...-..-+.+.+..|..-+...++.= T Consensus 331 ~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~--~~~~-~~--~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li 405 (511) T protein:vir:96 331 GYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNM--KDDN-FS--GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLL 405 (511) T ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCCccc--cccc-cc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 666555454433 3345667888888888862 2221 11 2222222 2222233334445555555555544432 Q ss_pred HHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCH---HHHHHHH Q lcl|NC_018087. 424 LLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSD---EDIAAER 500 (520) Q Consensus 424 LiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tD---eeI~~~~ 500 (520) +-+-++....+++.-...+++.|...---.+ .+.++++..+. | .+|.+++++. |..++ +|++.+. T Consensus 406 ~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~-------~e~~~~~~kl~---G-~iS~et~l~~-l~~v~D~~~E~~ri~ 473 (511) T protein:vir:96 406 ETILKNTWSIDANKDFNTVRYVYNRNLPKSL-------IEELKAYIDSG---G-KISQTTLMSL-FSFFQDPELEVKKIE 473 (511) T ss_pred HHHHHhhcCcccccccccceEEeCCCCCCCH-------HHHHHHHHHHh---c-cCChHHHHHh-CCCCCCHHHHHHHHH Confidence 2222222222222223457788864333333 23344555553 3 3799999976 55543 4555554 Q ss_pred HHHHHhhhc---CCccCCccccC Q lcl|NC_018087. 501 KLIDEELSD---KIFNPPEPEEI 520 (520) Q Consensus 501 kqi~~E~~~---~~~~~p~~e~~ 520 (520) ++-+++.+. .....|++.+= T Consensus 474 ~E~~~~~~~~~~~~~~~~~~~~~ 496 (511) T protein:vir:96 474 EDEKESIKKAQKGIYKDPRDIND 496 (511) T ss_pred HHHHHHHHHHhhccccCCCCCCC Confidence 443332222 12222222111 No 141 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=95.44 E-value=0.0021 Score=35.30 Aligned_cols=406 Identities=12% Similarity=0.131 Sum_probs=178.0 Q ss_pred cc-cccchhhhcchhhhhhhHHHhhhc-------cCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHH Q lcl|NC_018087. 3 ML-ADSDLKMFAFWHKVDDTEYDKIIN-------DKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTREL 74 (520) Q Consensus 3 ~~-~~~~l~~f~~~~~~~~~~~~~~~~-------~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~L 74 (520) |- .|+..-|+.|-.+++... .+. ..+.+..-| .++..... ...+++++.. . ..+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~---~~~~~~~f~~~e~r~~~~~--~~~~~~~~----~~~~~~~~~~--~-~~~~~---- 64 (441) T protein:vir:98 1 MHWYNTDCYFVDFKSRKQSRK---ELVVVGIFYKNEKRDLQYN--EDDLQMMV----QTLPGFQGTK--L-RQYKD---- 64 (441) T ss_pred CceecCccceeccccccchhh---hhhccccccccccccccCC--CcchHHHH----HHhhcccccC--c-cccch---- Confidence 21 112222222211111110 000 000111111 11111100 0001111110 0 00111 Q ss_pred HHHHHHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcch----hhhHH----HH Q lcl|NC_018087. 75 INTYRSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQ----RKGSD----HF 146 (520) Q Consensus 75 I~~YR~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~----k~g~~----~f 146 (520) ..-+++|-|..||+-|.+.+.-.+ +.+. ++.... .-+.++.+|+-. -++.+ ++ T Consensus 65 ----~~al~~~~V~acv~~Ia~~iA~lp-----l~~~-~~~~~~---------~~~~~~~lL~~~PN~~~t~~~f~~~l~ 125 (441) T protein:vir:98 65 ----IEAIRHSDIFTAVMMIASDLARMP-----IRVT-VNGQIN---------YSDRIVNLLNTRPNPMYNGYIFKLVVF 125 (441) T ss_pred ----hhhhccHHHHHHHHHHHHhhccCc-----eEEe-cCCccc---------ccchHHHHHhcccccCCCHHHHHHHHH Confidence 123678899999999999877542 3332 111111 112244444322 23334 44 Q ss_pred HhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcc Q lcl|NC_018087. 147 KRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTK 226 (520) Q Consensus 147 RrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~ 226 (520) ....+.|.-|..++-|. + .-+++|.+|+|..+.+.++ .+|.- -|+++..... +.+.. T Consensus 126 ~~lll~Gnay~~i~r~~-~--G~~~~L~~i~~~~v~v~~~-----~~g~~------~~~~~~~~~~---------~~~~~ 182 (441) T protein:vir:98 126 VSALLTSHGYIEITRDK-T--GEPMNLTFRKTSEIELKLD-----ARGRL------YYFHQRIDSN---------GNNIE 182 (441) T ss_pred HHHhhcCCeEEEEEEcC-C--CcEEEEEEEcCceeEEEEC-----CCCcE------EEEEEEeccC---------cceee Confidence 44677899999988662 2 2389999999999988543 22211 1222211100 11122 Q ss_pred eecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHH-HHH Q lcl|NC_018087. 227 IKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQH-IMN 305 (520) Q Consensus 227 ~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~-im~ 305 (520) ..++++.|+|+. ..+.++...+|-|+.|.+++.....+++...=+=.--+--+-|..++ |.+...+|.+=+++ ... T Consensus 183 ~~~~~~dviHir--~~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~-~~~~~~e~~~~~~~~~~~ 259 (441) T protein:vir:98 183 RNVKFEDMLDIK--FYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFHK 259 (441) T ss_pred EEEccccEEEec--cCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC-CCCCCHHHHHHHHHHHHH Confidence 568888888775 24556666788899998888877777776553222223445566666 44433444333333 222 Q ss_pred hhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCc Q lcl|NC_018087. 306 SHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQT 384 (520) Q Consensus 306 ~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~ 384 (520) .|. | ..+..+.+ .++ .|.+++.|.=...-.| ++--++..+.+.++++||...|..+.+ T Consensus 260 ~~~---------G-~~nag~~~-vl~----------~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~ 318 (441) T protein:vir:98 260 SFS---------G-TKQAGKVV-VLD----------ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA 318 (441) T ss_pred Hhc---------C-ccccCcce-ecC----------CCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCC Confidence 232 2 22222233 222 1456666642222222 344466677899999999999853322 Q ss_pred cccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHH Q lcl|NC_018087. 385 QNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERR 464 (520) Q Consensus 385 ~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R 464 (520) ++.+.-..+-|. ..|+--+.. +.+.|-..| .++. . . ..+.|.-+ ++...+ +..| T Consensus 319 ------~~s~~q~~~~y~---~tl~P~~~~-ie~~ln~~L-----~~~~--~--~--~~~~fd~~----~llr~d-~~~~ 372 (441) T protein:vir:98 319 ------NMSITDANLDYL---STLKPYITC-VCAELNFKF-----NDEY--V--N--REFKFDTT----EIRVVD-EKTQ 372 (441) T ss_pred ------CccHHHHHHHHH---HHHHHHHHH-HHHHHHhhc-----cccc--c--C--ceEEEech----hhhccC-HHHH Confidence 112222233343 445432222 233332222 2221 1 1 23344322 222221 3456 Q ss_pred HHHHHHhhcccchhhhHHHHHHHHhCCCHHH-----H----------HHHH-HHHHH-hhhcCCccCCcccc Q lcl|NC_018087. 465 VNVLSLMEPYIGKYISNHTAMKDFLQMSDED-----I----------AAER-KLIDE-ELSDKIFNPPEPEE 519 (520) Q Consensus 465 ~~~~~~~~p~vgky~S~~~i~k~IL~~tDee-----I----------~~~~-kqi~~-E~~~~~~~~p~~e~ 519 (520) .+.++.+-.- -+++.+-+++ .+++.+-+ + +... .|..+ +..+.--+-.++.| T Consensus 373 ~~~~~~~~~~--G~~T~NE~R~-~~gl~pi~gGd~~~~~~~~n~~~~~~~~~~q~~~~~~~~~~~kgGe~ne 441 (441) T protein:vir:98 373 AEIDKINIDS--GKMNIDEIRQ-RDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred HHHHHHHHhC--CCcCHHHHHH-HhCCCCCCCCCcceEeecccccccccccccccccccccccccCCCCCCC Confidence 6666655322 3678888874 46665421 1 0000 00000 00001122333333 No 142 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=95.35 E-value=0.0022 Score=35.12 Aligned_cols=390 Identities=11% Similarity=0.079 Sum_probs=181.3 Q ss_pred cccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHh Q lcl|NC_018087. 3 MLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLL 82 (520) Q Consensus 3 ~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma 82 (520) |...+.++-++-+....-. +.+..+..-| .+.....+.++ ..+..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~----------------~~~~~~~~~~v------------~~~~~~ 46 (409) T protein:vir:93 1 MAKENIVTRIKKKLIDNWI------DQSTSKLYDF----------------SPWKNRSFWGV------------INNTLE 46 (409) T ss_pred CCccchhhhhhhhhhhhhh------cccccccccc----------------ccccCcccccc------------chhhhh Confidence 7777766655543322211 1111111100 11111111111 112346 Q ss_pred hccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHH-hcchhhhHHH----HHhhccccceeE Q lcl|NC_018087. 83 NNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNM-LNFQRKGSDH----FKRWYVDSRVFF 157 (520) Q Consensus 83 ~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~l-l~f~k~g~~~----fRrWYvDgri~~ 157 (520) ++|-|..||+-|.+.+.-.+ +.+- +.++ .. .....++|+. =|=..+++++ +..+.++|--|. T Consensus 47 ~~~~V~~ci~~Ia~~ia~lp-----~~~~-~~~~---~~----~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~ 113 (409) T protein:vir:93 47 TNETIFSAITKLSNSMASLP-----LKMY-EDYK---VV----NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYV 113 (409) T ss_pred ccHHHHHHHHHHHHhhhhCc-----eeEe-eccc---cc----cchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEE Confidence 88999999999999887432 1111 1111 11 1112223322 2223345554 445678899988 Q ss_pred EEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEe Q lcl|NC_018087. 158 HKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYA 237 (520) Q Consensus 158 hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~ 237 (520) .++-|. ...+++|.+|+|..+.+.++- ++.. -+|.|... .+.++.++.+.|+|. T Consensus 114 ~i~r~~---~G~~~~L~~l~~~~v~~~~~~-----~~~~------~~y~~~~~------------~g~~~~~~~~eVih~ 167 (409) T protein:vir:93 114 LIERDI---YHQPSKLFLLNPDVVEMLIEN-----QSRE------LYYSIHAA------------TGNKLIVHNMDMLHF 167 (409) T ss_pred EEEECC---CCcEEEEEEEcCceeEEEEeC-----CCcE------EEEEEEcC------------CceEEEEccccEEEe Confidence 887652 223899999999999885431 1111 12222211 123467899999888 Q ss_pred ecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCC Q lcl|NC_018087. 238 HSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDART 317 (520) Q Consensus 238 hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~T 317 (520) . +.-..++...+|-|..|.++......++..- + .-..+|..-+ ..--+.+.+.+++...+.+.+.|.| . T Consensus 168 r-~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~-~-~~~~~~~~~i-~~~~~~l~~e~~~~~~~~~~~~~~~-------~ 236 (409) T protein:vir:93 168 K-HIVASNMVQGISPIDVLKNTTDFDNAVRTFN-L-TEMQKPDSFM-LKYGSNVGKEKRQQVLEDFKQYYEE-------N 236 (409) T ss_pred C-CCCCCCccccccHHHHHHHHHHHHHHHHHHH-H-HhcCCCCceE-EecCCCCCHHHHHHHHHHHHHHhhc-------C Confidence 4 2223344455677777777666555555542 3 3333343333 3334566666665555544443432 2 Q ss_pred CccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHH---HHHHHHHHHHHhcCCChhhccCCCccccccccchh Q lcl|NC_018087. 318 GKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMD---DILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAI 394 (520) Q Consensus 318 Gev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~---DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eI 394 (520) |. .+ .+ +| |.+++.|. .+.-+++ -..|-...+.++++||.+-|...++.+ ++. + T Consensus 237 g~------~~-vl--------~~--g~~~~~l~--~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~-~sn---~ 293 (409) T protein:vir:93 237 GG------IL-FQ--------EP--GVEIEPLP--KKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTN-FAK---N 293 (409) T ss_pred CC------ee-ec--------CC--CceEEEcC--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC-ccc---H Confidence 32 11 11 12 56777774 2333333 334667889999999998886433221 111 2 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_018087. 395 SRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPY 474 (520) Q Consensus 395 tRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~ 474 (520) .-.-..|..++ |+-.+.. +.+-|- ..++++.++.. ...|.|.-+ ++...+ +..|.+.+..+-.- T Consensus 294 e~~~~~f~~~~--l~P~~~~-ie~~l~-----~~Ll~~~~~~~---~~~~~fd~~----~ll~~d-~~~~~~~~~~~~~~ 357 (409) T protein:vir:93 294 EELNRFYLQHT--LLPIVKQ-YEEEFN-----RKLLTKTDREK---NRYFKFNVK----SYLRAD-SATQAEVYFKAVRS 357 (409) T ss_pred HHHHHHHHHHH--HHHHHHH-HHHHHH-----hhcCCcccccC---cceEEeech----hhhccC-HHHHHHHHHHHHhC Confidence 11122355442 3332222 222222 23456665542 234555432 333333 35566666655332 Q ss_pred cchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCC--------ccC---CccccC Q lcl|NC_018087. 475 IGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKI--------FNP---PEPEEI 520 (520) Q Consensus 475 vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~--------~~~---p~~e~~ 520 (520) -+++..-+++ +|++.+-+ --++-+-.-.-.++ ... -+..|= T Consensus 358 --G~~T~NE~R~-~~g~~p~~--ggD~~~~~~n~~~~~~~~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:93 358 --GYYTINDIRE-WEDLPPVE--GGDKPLISGDLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred --CCcCHHHHHH-HhCCCCCC--CcCeeeecccccccccchhhcccccCCCCCcCCC Confidence 3667777775 46765432 00000000000000 000 000000 No 143 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=95.33 E-value=0.0023 Score=35.07 Aligned_cols=436 Identities=12% Similarity=0.097 Sum_probs=172.7 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCC---ceeecccccccc--------ccccc-ccccccccc Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDG---ATEVDSQDIAYN--------GVFQK-LYGSQDPTA 68 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg---~~~i~~~~~a~~--------g~~~~-~~~~~~~~~ 68 (520) ..-....+..=|.--.-- . - -.+..++-|+---+ +.++..+...++ =.+++ ++.++.-.. T Consensus 35 ~~~~~~~~~~~~~~~~~~--~----~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 106 (695) T protein:vir:36 35 TAAAAQPVPADFARRGAL--N----A--LDAAPVVEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVT 106 (695) T ss_pred hhccccccchhhhhcccc--c----c--cccccccCCCcccccceeceecccccCccccchhhhhhcccccccccchhhh Confidence 111111222111110000 0 0 01111122221111 011221111110 00011 111110000 Q ss_pred hhHHHHHHHHHHHhhccchhHHHHhhhceeeEe------cCCCcE----EEEeeccchhhh-HHHHHHHHHHHHHHHHhc Q lcl|NC_018087. 69 TSTRELINTYRSLLNNYEVDNAVQEIVSDAIVY------EEGFDV----VSIDLDQTAFTE-NIRNLISDEFNSVLNMLN 137 (520) Q Consensus 69 ~~~~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~------d~~~~~----V~l~Ld~~~~s~-~ik~~I~eeF~~i~~ll~ 137 (520) .+-+==+-.--.|||+||.+.+++-|..||+-. ....+. +++.-+..+-++ .-.++|..|++.+ ++ T Consensus 107 ~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~erL-~V-- 183 (695) T protein:vir:36 107 SSGFPGFPTLVLLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERL-RI-- 183 (695) T ss_pred ccCcchHHHHHHHhhccchhhHHHHHHHHhhcccceecccchhhhhhccccccccccccCchHHHHHHHHHHHHH-HH-- Confidence 000111223456899999999999999999543 222211 222222222122 2334666676632 22 Q ss_pred chhhhHHHHHhhccccc--eeEEEeeec------------CCCCCCeeeeEecCccceeeeeeccCCCCcccccccceec Q lcl|NC_018087. 138 FQRKGSDHFKRWYVDSR--VFFHKIINP------------NRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYRE 203 (520) Q Consensus 138 f~k~g~~~fRrWYvDgr--i~~hkvid~------------~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~e 203 (520) ..+..+.++-=-+-|. +|+..-=|. +-+|.+++.|+.|||..+.+- ..+.++-+.+ T Consensus 184 -~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~---------~~n~~dP~sp 253 (695) T protein:vir:36 184 -RDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPN---------NYNSINPVAD 253 (695) T ss_pred -HHHHHHHHHhhccccceEEEEEeccCccccccccccccccccCcceeeeEeecccccccc---------hhhhccchhh Confidence 3333333322222232 233332221 123456777999998766661 1111111111 Q ss_pred ceeecCcccccccccceecCCcceecCcccEEEeecccccCC------CCcchhhhHHHHHHHHH-HHHHHHHHHHHHHh Q lcl|NC_018087. 204 YFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDCC------GKNIIGYLHRAVKPANQ-LKLLEDAMMIYRIT 276 (520) Q Consensus 204 y~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~~------~~~~~syL~~aik~~Nq-L~m~EDalVIyRi~ 276 (520) ..|.|+ .|... +.+||.+=++... |---|+ ++..+|.+..+..-..+ +++...+.=+- . T Consensus 254 -dfgkP~--~y~V~--------G~kIH~SRL~~f~-g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li--~ 319 (695) T protein:vir:36 254 -DFYKPS--TWWMI--------GTEVHATRLHTIV-SRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIV--K 319 (695) T ss_pred -ccCCCc--eEEEe--------ceEEeeeeEEEec-CCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHH--H Confidence 111111 11111 1144444433322 211111 33345655555543332 33333322221 1 Q ss_pred cCccceEEEccCC-CCchHHHHHHH--HHHHHhhcceeEeecCCCccccccccch-hhhhhcccccCCCCCcceeecCCC Q lcl|NC_018087. 277 RAPDRRVFYIDTG-NMPARKAAQHM--QHIMNSHRNRISYDARTGKVKNQANMMA-LTEDYWLQRRDGKAVTEVETLPGM 352 (520) Q Consensus 277 RApeRRvFyIDvG-nlpk~KAeqyl--~~im~~~knklvYd~~TGev~d~~~~ms-mlEDywLpRReGgrgTEIsTLpGg 352 (520) ++.-+ ++-.|.. -|.....++.. -+++++||.-. |-+ .+. =.|||- .++ T Consensus 320 ~~~v~-~lk~dla~aL~~g~~~~l~~R~eli~~~Rsn~------G~~-----llDk~~Eefe----------q~s----- 372 (695) T protein:vir:36 320 QFSVS-GILMDLAQALMPGANVDLSMRAELINRYRDNR------NIL-----FLDKATEEFF----------QFN----- 372 (695) T ss_pred hhhHH-HHHHHHHHhhcChhHHHHHHHHHHHHHhcCcc------ceE-----EEecCCcceE----------EEe----- Confidence 11111 0011210 00111122222 25566776311 110 000 013443 222 Q ss_pred CCcChHHHHH-HHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHh Q lcl|NC_018087. 353 TGMNEMDDIL-YFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKS-----NLLL 426 (520) Q Consensus 353 ~nLgei~DV~-YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~-----QLiL 426 (520) .+|+-++||. =|..-+=-+.+||+.||=..+.-++ ..++| -|.-.|...|..+|. ..+..+|++ |+-+ T Consensus 373 tslSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGl-NATGE--~D~rnYYD~I~s~Qe---~~L~p~L~rl~~ii~rS~ 446 (695) T protein:vir:36 373 TPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGL-NASSE--GEIRVWYDYVRAYQR---NALQQLMNDVIVMIQLSL 446 (695) T ss_pred cccCCHHHHHHHHHHHHHhhhcCchhhhhccCcccc-cccch--hhHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHh Confidence 4788899975 4888888899999999865553333 12221 244558888888775 334444433 4444 Q ss_pred cCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHh Q lcl|NC_018087. 427 KRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEE 506 (520) Q Consensus 427 kgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E 506 (520) -|.+. ..|.|+|+.=..-+|..-+||...+.+.....-.- --++.+-|+..+ .+|.+=-= -..++ + T Consensus 447 ~G~id--------pdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~--gvI~~~evr~rL--~~d~~s~Y-~~~~D-~ 512 (695) T protein:vir:36 447 FGAVD--------PSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQE--QVIRPDQVAARL--NTEPDGPY-AGKLD-A 512 (695) T ss_pred cCCCC--------CcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHh--cCCCHHHHHHHH--hcCCCccc-ccccc-c Confidence 45443 45889999888888888899988888764443111 012333333221 00000000 00000 1 Q ss_pred hhcCCccCCccccC Q lcl|NC_018087. 507 LSDKIFNPPEPEEI 520 (520) Q Consensus 507 ~~~~~~~~p~~e~~ 520 (520) ..+ ...|.++|| T Consensus 513 ~d~--p~~~~~~~~ 524 (695) T protein:vir:36 513 NDD--PGVPADDDI 524 (695) T ss_pred ccC--CCcCccchh Confidence 111 112333444 No 144 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=95.27 E-value=0.0024 Score=34.94 Aligned_cols=419 Identities=11% Similarity=0.062 Sum_probs=174.7 Q ss_pred chhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccc---cchhHHHHHH-------- Q lcl|NC_018087. 8 DLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDP---TATSTRELIN-------- 76 (520) Q Consensus 8 ~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~---~~~~~~~LI~-------- 76 (520) +|+.=.|-.-.|-..+-+ . .+-+-. ++.|.- ...-+. ...-...+|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~---~---~~~~~~---------------n~~~~~-~~~e~~~~~~~~~i~~~i~~~~~~~~~ 58 (511) T protein:vir:99 1 MLKVNEFETDTDLRGNIN---Y---LFNDEA---------------NVVYTY-DGTESDLLQNVNEVSKYIEHHMDYQRP 58 (511) T ss_pred Cccccchhhhhhhhhhhh---h---hhhhhh---------------CCcccc-chhhhhhhccHHHHHHHHHHHHHhhHH Confidence 444444444333332111 0 010000 111111 101111 1111222222 Q ss_pred HHHHHhhccchhHHH----------------------HhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHH Q lcl|NC_018087. 77 TYRSLLNNYEVDNAV----------------------QEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLN 134 (520) Q Consensus 77 ~YR~ma~~pEvd~Ai----------------------~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ 134 (520) +|+.+..+.+=...+ .-||+-.+-+= -..||++..++.+ ..+.+..+++ T Consensus 59 r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl-~g~p~~~~~~d~~--------~~~~l~~~~~ 129 (511) T protein:vir:99 59 RLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYF-LGNPIQYQDDDKD--------VLEAIEAFND 129 (511) T ss_pred HHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHHhhh-cccCceeecCchH--------HHHHHHHHHh Confidence 344454443322211 11222111111 1256666655432 3345566666 Q ss_pred HhcchhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCC--CCccccccc----------cee Q lcl|NC_018087. 135 MLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTK--MENGVKVVK----------GYR 202 (520) Q Consensus 135 ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~--~~~~~~~~~----------~~~ 202 (520) --+|+....++.+...+-|+-|.+.-.|. +|-..+..+||+.+-+|.+-... ..-.++.+. .+. T Consensus 130 ~n~~~~~~~~~~~~~~i~G~a~~~vy~de----d~~~~i~~~~p~~~~~vyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~ 205 (511) T protein:vir:99 130 LNDVESHNRSLGLDLSIYGKAYELMIRNQ----DDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVF 205 (511) T ss_pred hcCHhHHHHHHHHHHHhcCeeEEEEEeCC----CCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecccCccceEE Confidence 67889999999999999999999877663 36788999999999998653321 112222110 011 Q ss_pred cceeecCcc-cccccccceec----------CCcceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHHHH-HHHH Q lcl|NC_018087. 203 EYFLYDTEL-ESYQCGHQHFA----------AGTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLL-EDAM 270 (520) Q Consensus 203 ey~~y~~~~-~~~~~~~~~~~----------~~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~-EDal 270 (520) .+-+|++.. ..|..++.... ++.-=.|| .|.|+. +....|=++..+.....+..+ =+.. T Consensus 206 ~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vP--vv~~~n-------n~~g~sd~e~v~~liDa~d~~~S~~~ 276 (511) T protein:vir:99 206 TVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMP--ITEFSN-------NERRKGDYEKVITLIDLYDNAESDTA 276 (511) T ss_pred EEEEEeCCcEEEEEecCCccccccccccccccCCCCccc--eEEecC-------CCCCCCchhhhHHHHHHHHHHHHHHH Confidence 112455432 22211111100 00000222 122222 222345455444444433322 1222 Q ss_pred HHHHHhcCccceEEE---ccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhccccc--------C Q lcl|NC_018087. 271 MIYRITRAPDRRVFY---IDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRR--------D 339 (520) Q Consensus 271 VIyRi~RApeRRvFy---IDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRR--------e 339 (520) ..-+-++.|-+-+.- .|.+.+++.+ ... ..|++.. . T Consensus 277 ~~~~~~~~~~lv~~G~~~~~~~~~~~~~----------------------------~~~-----~~~~~~~~~~~~~~~~ 323 (511) T protein:vir:99 277 NYMSDLNDAMLLIKGNLNLDPVEVRKQK----------------------------EAN-----VLFLEPTVYADSEGRE 323 (511) T ss_pred HHHHHhhchhhhhccCcccCchhhcccc----------------------------ccc-----ceeccccccccccccc Confidence 222334444433321 1111111100 000 1222221 1 Q ss_pred CCCCcceeecCCCCCcChHHH-HHHHHHHHHHhcCCChhhccCCCccccccccch--hhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 340 GKAVTEVETLPGMTGMNEMDD-ILYFRKALYMALRVPLSRIPDEQTQNVFDMSTA--ISRDELSFDKFISELQHKFEEIF 416 (520) Q Consensus 340 GgrgTEIsTLpGg~nLgei~D-V~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~e--ItRDElkF~KFI~rLr~rFs~if 416 (520) .+.|..+..|-...+...... +.-+.+.+|+...+|---.+ ++. |.++. |..-...-..-+.+.++.|..-+ T Consensus 324 ~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~---~~~--gn~Sg~Alk~~~~~l~~ka~~k~~~~~~~l 398 (511) T protein:vir:99 324 TEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDD---NFS--GTQSGEAMKYKLFGLEQRTKTKEGLFTKGL 398 (511) T ss_pred CCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccc---ccc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 122345555554444443333 56667788888888862221 111 22222 22222233444556666666666 Q ss_pred HHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCH--- Q lcl|NC_018087. 417 LSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSD--- 493 (520) Q Consensus 417 ~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tD--- 493 (520) ...++.=+-+-++...-++..-...+.+.|....--.+ .+.++++..+. | .+|.+++++. |...+ T Consensus 399 ~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~-------~e~~~~~~kl~---G-iiS~et~l~~-l~~v~D~~ 466 (511) T protein:vir:99 399 RRRAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPKSL-------IEELKAYIDSG---G-KISQTTLMSL-FSFFQDPE 466 (511) T ss_pred HHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCH-------HHHHHHHHHHh---c-cCCHHHHHHh-CCCCCCHH Confidence 66555422222222222222222346777764433333 22334455553 3 3799999987 55543 Q ss_pred HHHHHHHHHHHHhhhcC---CccCCccccC Q lcl|NC_018087. 494 EDIAAERKLIDEELSDK---IFNPPEPEEI 520 (520) Q Consensus 494 eeI~~~~kqi~~E~~~~---~~~~p~~e~~ 520 (520) +|++.+.++-+++.+.. ..++++..+- T Consensus 467 ~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 496 (511) T protein:vir:99 467 LEVKKIEEDEKESIKKAQKNMYQDPRNIND 496 (511) T ss_pred HHHHHHHHHHHHHHHHHhhcccccCCCCCC Confidence 45555444433322211 1212111111 No 145 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=95.13 E-value=0.0027 Score=34.66 Aligned_cols=395 Identities=14% Similarity=0.134 Sum_probs=167.3 Q ss_pred CCCcccCCCCCCCceeecccccccccccccccccccccchhHH-----------HHHHHHHHHhhccchhH--------- Q lcl|NC_018087. 30 KAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTR-----------ELINTYRSLLNNYEVDN--------- 89 (520) Q Consensus 30 ~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~-----------~LI~~YR~ma~~pEvd~--------- 89 (520) -..-+.-|....-+.++ .-.++....... .-+.+|+.+..+++-.. T Consensus 1 ~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~~~ 66 (474) T protein:vir:95 1 MFNIIRMPWDKPYGEEV--------------VEQLKPQFETQEEMIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKV 66 (474) T ss_pred CcceeecCCCCchhhHH--------------HHhhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccCchhcccccc Confidence 11122222222111110 000011111111 11223444433332221 Q ss_pred ------------------HHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhcc Q lcl|NC_018087. 90 ------------------AVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYV 151 (520) Q Consensus 90 ------------------Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYv 151 (520) -...||+-.+-+ --..||++..++.+..+ .+.+ ++. =+|+....++.+.+.+ T Consensus 67 ~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~-l~g~p~~~~~~d~~~~~----~l~~----~~~-n~~~~~~~e~~~~~~~ 136 (474) T protein:vir:95 67 DVYGNIDYDKPDWRITTNFHQNLVDQKVSY-VASKPVTYSCEDESVLK----IIHD----VLD-TRWDNKLIDILTATSN 136 (474) T ss_pred ccccccccccccceeccchHHHHHHHHHhh-hccCCceeccCchHHHH----HHHH----HHh-ccHHHHHHHHHHHHhh Confidence 111222222211 12367777776644333 3332 222 3678888999999999 Q ss_pred ccceeEEEeeecCCCCCCeeeeEecCccceeeeeecc--CCCCcccccc--cceecceeecCc-ccccccccceecC--- Q lcl|NC_018087. 152 DSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELD--TKMENGVKVV--KGYREYFLYDTE-LESYQCGHQHFAA--- 223 (520) Q Consensus 152 Dgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~--~~~~~~~~~~--~~~~ey~~y~~~-~~~~~~~~~~~~~--- 223 (520) -|+-|.+.-+| + +|-..+..+||+++.+|.+-. .+..-.++.+ .+...+-+|++. ...|........+ T Consensus 137 ~G~~~~~v~~d-~---~~~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~ 212 (474) T protein:vir:95 137 KGIDWLQVYIN-E---NGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYY 212 (474) T ss_pred cCcEEEEEEec-C---CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEEcCeeEEEEEeCCeEEEEEEcCCccccccc Confidence 99988886655 2 366789999999998874321 1111112111 111122233222 1111111100000 Q ss_pred ------------CcceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCC Q lcl|NC_018087. 224 ------------GTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLK-LLEDAMMIYRITRAPDRRVFYIDTGN 290 (520) Q Consensus 224 ------------~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~-m~EDalVIyRi~RApeRRvFyIDvGn 290 (520) ..-=+|| |+++ .++....|=++..+.....+. ++-+....-+-.+.|-+-+.-.+... T Consensus 213 ~~~~~~~~~~~~~~~g~iP---vv~~------~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~ 283 (474) T protein:vir:95 213 YGANHIQSHFSNGNWGRVP---FIAF------KNNPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQD 283 (474) T ss_pred cCcccccccccccCCCccc---eEee------cCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccc Confidence 0000222 2221 112233455666555555554 45555555667777765544332221 Q ss_pred CchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHH-HHHHHHHHHH Q lcl|NC_018087. 291 MPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMD-DILYFRKALY 369 (520) Q Consensus 291 lpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~-DV~YF~kkLy 369 (520) ... ...-+..++ . +++ +++. +++.|-...+++... -+.-+.+.+| T Consensus 284 ~~~------~~~~~~~~~-----------------~------i~~---~~~~--~~~~l~~~~~~~~~~~~~~~l~~~i~ 329 (474) T protein:vir:95 284 LEE------FMRGLKYYK-----------------A------INV---DGDG--GVETIQVEVPVSSTKEYIDLMRAYIM 329 (474) T ss_pred chh------hhhhhhccc-----------------e------eec---cCCC--ceeEEeecCCHHHHHHHHHHHHHHHH Confidence 110 000010000 0 111 1222 344444444554433 3466778889 Q ss_pred HhcCCChhhccCCCccccccccc--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEee Q lcl|NC_018087. 370 MALRVPLSRIPDEQTQNVFDMST--AISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFH 447 (520) Q Consensus 370 ~aL~VP~SRl~~~~~~~~~G~~~--eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~ 447 (520) ...++|- +.+++.. |..+ .|..-...-..-+.+.+..|...+.++++.=+-+-|+ ..+| ..|.+.|. T Consensus 330 ~~s~~p~--~~~~~~~---~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~--~~d~----~~i~v~f~ 398 (474) T protein:vir:95 330 EFGQGVD--FQTDKFG---SAPSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNL--KMDV----KDIEISFN 398 (474) T ss_pred HHhCCcc--ccccccc---ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Cccc----ceeeEEec Confidence 9999993 3222211 2222 2333333333446666677777666666543333343 2333 45677775 Q ss_pred ccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhh----cC-CccCC-----cc Q lcl|NC_018087. 448 KNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELS----DK-IFNPP-----EP 517 (520) Q Consensus 448 ~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~----~~-~~~~p-----~~ 517 (520) ...--.+.. .++++.++ | .+|.+++++. |..+++ -+++-++|++|.. +. .+.++ ++ T Consensus 399 ~~~p~d~~e-------~a~~~~~~----g-~iS~et~i~~-l~~v~d-~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~ 464 (474) T protein:vir:95 399 FNRMMNDAE-------QSQIIAQS----Q-YLSRETLVKS-SPLVDD-YKAELERIEQEQMEYNKQLPNLDDGGADGAQQ 464 (474) T ss_pred cCCCcCHHH-------HHHHHHhc----C-CCchHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHhcccccccccCCCCcC Confidence 444433322 22334443 3 4799999976 555432 1233334444432 11 11111 11 Q ss_pred ccC Q lcl|NC_018087. 518 EEI 520 (520) Q Consensus 518 e~~ 520 (520) ++. T Consensus 465 ~~~ 467 (474) T protein:vir:95 465 QER 467 (474) T ss_pred CCC Confidence 111 No 146 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=94.81 E-value=0.0034 Score=34.09 Aligned_cols=386 Identities=11% Similarity=0.029 Sum_probs=177.4 Q ss_pred ceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHH Q lcl|NC_018087. 43 ATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIR 122 (520) Q Consensus 43 ~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik 122 (520) -++++++....++.....+.++.. ...+++|-|..||.-|.+.+.-. |+.|.-.+.+.. T Consensus 1 ~~~~~~~~g~~~~~~~~~~~~~~~------------~~~~~~~~V~acV~~Ia~~iA~l-----pl~l~~~~~~~~---- 59 (723) T protein:vir:94 1 MTTFPSGAGGWNAWSADSVFGNGA------------KGWSNSAVAYRCISMLANNAASV-----DLVVRGPDGELD---- 59 (723) T ss_pred CcccccCCCccccccccccccccH------------HHHhhhHHHHHHHHHHHHhhccc-----eeEEEcCCCccc---- Confidence 233332211111111111111111 12467899999999998876533 222221111111 Q ss_pred HHHHHHHHHHHHHh----cchhhhHHHHHh----hccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcc Q lcl|NC_018087. 123 NLISDEFNSVLNML----NFQRKGSDHFKR----WYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENG 194 (520) Q Consensus 123 ~~I~eeF~~i~~ll----~f~k~g~~~fRr----WYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~ 194 (520) +-..++.+| |-..++.++.+. +...|.-|..++.+..+...-+.+|..|+|+.+..+..- ++ T Consensus 60 -----~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~-----~~ 129 (723) T protein:vir:94 60 -----ELHPLSQLWNVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATR-----AA 129 (723) T ss_pred -----hhhHHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecC-----CC Confidence 112344444 334455554444 567899999988775555555789999999877664221 11 Q ss_pred cccccceecceeecCcccccccccceecCCcceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 195 VKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYR 274 (520) Q Consensus 195 ~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyR 274 (520) ........-.|.|... .+..+.++.+.|+|.. +.-..++-..+|-|..|.+++.....+++.. .+ T Consensus 130 ~~~~~~~~~~y~~~~~------------~G~~~~~~~~dIiHir-~~~~~dg~~G~Spi~~a~~~i~~~~aa~~~~--~~ 194 (723) T protein:vir:94 130 DAVPQAQIIGYVIERT------------DGVRVPVLADEMLWLR-FSDPYDPLAVMAPWKAARAAVDADFYAATWQ--RQ 194 (723) T ss_pred ccceeeeeeEEEEEec------------CceeEEecccceEEec-CCCCCCCcccccHHHHHHHHHHHHHHHHHHH--HH Confidence 1111111111221110 1223578889998874 2212355567899999999998888887754 33 Q ss_pred HhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhh---hhcccccCCCCCcceeecCC Q lcl|NC_018087. 275 ITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTE---DYWLQRRDGKAVTEVETLPG 351 (520) Q Consensus 275 i~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlE---DywLpRReGgrgTEIsTLpG 351 (520) .++-=-+--..|-++++.+..+++..+.+...|.- .+|.-+.+ +++ .-+...- .|++++.|. T Consensus 195 ~f~NG~~p~giL~~~~l~~e~~~~~~~~~~~~~~G----------~~Nagk~~-vL~g~~~~~~vl~---~G~~~~~l~- 259 (723) T protein:vir:94 195 SFKNGARPGGVVNLGDMDEQTFTKTVAAFRSQVEG----------VQNAGRHL-LIAGQGSDGGAAG---KGATFTSLS- 259 (723) T ss_pred HHhcCCCcceEEEcCCCCHHHHHHHHHHHHHHhhc----------hhhcCcce-eeccccccccccc---CCceEEEcc- Confidence 34332222223334567666665544444443331 11111222 111 1111222 245565553 Q ss_pred CCCcChHHHH---HHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_018087. 352 MTGMNEMDDI---LYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKR 428 (520) Q Consensus 352 g~nLgei~DV---~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkg 428 (520) .+.-++.-+ +|-.+..-++.+||..-|..++.++ .+.-..+.|..+ .|+-.+ ..+.+.|-..|+ T Consensus 260 -~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~~st~s------N~e~~~~~f~~~--tL~P~~-~~ie~~ln~~Ll--- 326 (723) T protein:vir:94 260 -MSPAEMDYINSRMHSAEEVMLAFGIRKDALLGGSTYE------NQAEAKAAVWTE--TLIPQM-EVMASITDLQLL--- 326 (723) T ss_pred -CCHHHHHHHHHHHHhHHHHHHHhCCChhHcCCCCCcc------cHHHHHHHHHHH--HHHHHH-HHHHHHHhHhhc--- Confidence 333344333 3445668999999987775332211 122223345432 233322 223334444443 Q ss_pred CCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHH--H-- Q lcl|NC_018087. 429 VITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLI--D-- 504 (520) Q Consensus 429 i~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi--~-- 504 (520) +..+| .+.++|..+-.... =+..|.+.++.+-. .-+++.+-++. +|++.+-+= -+.++ . T Consensus 327 --~~~g~-----~~~~~f~~~~lLr~-----D~~~r~~~~~~~v~--~G~~T~NE~R~-~lglpPi~g--Gd~~~~~~p~ 389 (723) T protein:vir:94 327 --PDIGW-----TVEWDFNSVPALQE-----DLEAQAGRNQGYLV--NDVLMVDEVRA-TIGLDPLPG--GIGQMTLTPY 389 (723) T ss_pred --ccccC-----ceEEeecchhhhhc-----CHHHHHHHHHHHHh--CCCcCHHHHHH-HhCCCCCCC--Ccccceeccc Confidence 22223 36677763322211 12345555544322 23667777774 477754310 00000 0 Q ss_pred --HhhhcCCccCCccccC Q lcl|NC_018087. 505 --EELSDKIFNPPEPEEI 520 (520) Q Consensus 505 --~E~~~~~~~~p~~e~~ 520 (520) +-.+.+ -+.|..+|= T Consensus 390 ~~~~a~~~-~~~p~~~e~ 406 (723) T protein:vir:94 390 RAQFAPAP-APAPAVEEG 406 (723) T ss_pred cccccCCC-CCCccchhh Confidence 000000 112222222 No 147 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=94.79 E-value=0.0035 Score=34.05 Aligned_cols=441 Identities=11% Similarity=0.094 Sum_probs=178.5 Q ss_pred Cccccc--cchhhhcchhh--hhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHH Q lcl|NC_018087. 1 MSMLAD--SDLKMFAFWHK--VDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELIN 76 (520) Q Consensus 1 ~~~~~~--~~l~~f~~~~~--~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~ 76 (520) |..+-| +...||.=--. ..-.++++..-++++-+..-...|...-++..++-+.|.. .....++...+ . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~------~~~~~~~~~~~-~ 73 (535) T protein:vir:10 1 MAILKDLRNAFSLSNKKSTSYIELGDYDKDIVNKAIRPGRASARDTVDGIDIADGNVAGQY------SVASISDVLST-K 73 (535) T ss_pred ChhhHHHHHHHHhhhhhhhhhHHHhhhhHHHHHhhhhhhhhhhhccccccccccCCccccc------ccCccccccCH-H Confidence 554444 22222211000 0011122211122221111111222222222111111211 11111121111 3 Q ss_pred HHHHH-hhccchhHHHHhhhceeeEec------CCCcEEEEeeccch--hhhHHHHHHHHHHHHHHHHhc----chhhhH Q lcl|NC_018087. 77 TYRSL-LNNYEVDNAVQEIVSDAIVYE------EGFDVVSIDLDQTA--FTENIRNLISDEFNSVLNMLN----FQRKGS 143 (520) Q Consensus 77 ~YR~m-a~~pEvd~Ai~eIvneaiv~d------~~~~~V~l~Ld~~~--~s~~ik~~I~eeF~~i~~ll~----f~k~g~ 143 (520) .++.+ ..+|-|..||+.+.+.+.++- ...-.+.|.|.+.. -++.-+. +...+.++|. -...+. T Consensus 74 ~l~~~~~~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~----~~~~l~~lL~~~PN~~~~~~ 149 (535) T protein:vir:10 74 KLLKAYADNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIK----RAHEIEDFIYNTGSEYYEWR 149 (535) T ss_pred HHHHHhccChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhh----hhhHHHHHHHhCCCCCCChh Confidence 34444 468989999999888766531 11122233333322 2222222 2233334442 222344 Q ss_pred HHHHhh--------ccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccc Q lcl|NC_018087. 144 DHFKRW--------YVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQ 215 (520) Q Consensus 144 ~~fRrW--------YvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~ 215 (520) +.|+.| ++-|.-.|-.|+- +....+++|.+|||.+|++..+..... ....|+.|... T Consensus 150 ~~~~~~~~~lv~d~l~~~g~ay~~i~r--~~~G~~~~L~~l~p~~V~v~~d~~~~~--------~~~~~~~~~~~----- 214 (535) T protein:vir:10 150 DTFPRLLTKIINDMYVQDQINIERIFK--NDSNELDHFNAVDASKVVISYSPRSKD--------QPRKFEQFVSE----- 214 (535) T ss_pred HHHHHHHHHHHHHHHhhCCceEEEEEE--CCCCcEEEEEEeCCceeEEEEcCcccc--------CceEEEEEecC----- Confidence 333222 2223343433332 233349999999999999864433211 11123333221 Q ss_pred cccceecCCcceecCcccEEEeecc-cccC-CCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCC--- Q lcl|NC_018087. 216 CGHQHFAAGTKIKIPYSAMVYAHSG-LVDC-CGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGN--- 290 (520) Q Consensus 216 ~~~~~~~~~~~~~I~~~aI~y~hSG-L~d~-~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGn--- 290 (520) +..++++.+.|+|..-. +.+. ++...+|-|+.|.+.+.....+++...=+----+--+-|..++.+. T Consensus 215 --------~~~~~~~~~eiih~~~~~~~~~~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~ 286 (535) T protein:vir:10 215 --------TKSVKFSERNLTFINYWNLSDTDRRGYGYSPVEASIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQ 286 (535) T ss_pred --------ceeEEECcccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcc Confidence 22357888888887521 1222 2345678899999999999888876654433334445677776432 Q ss_pred CchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHH Q lcl|NC_018087. 291 MPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALY 369 (520) Q Consensus 291 lpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy 369 (520) +.+..+ +=++ ..++... - | ..+..+. .+++. + |.++.-|--...-.| ++=..+..+..- T Consensus 287 ls~e~~-e~lk---~~~~~~~--~---G-~~nag~~-~vl~~------~---g~~~~~l~~~~~D~qfle~~~~~~~eIa 346 (535) T protein:vir:10 287 ANQMML-AGIR---RQWTSQG--S---G-LGGAWKI-PILAA------K---DAKFVNMTQNSRDMEFDKFLNFMIYDTA 346 (535) T ss_pred cCHHHH-HHHH---HHHHHHh--c---C-ccccccc-ccccC------C---CceEEecCCChhHHHHHHHHHHHHHHHH Confidence 322222 2222 2222211 0 1 1111111 23322 1 334444332222122 222446788899 Q ss_pred HhcCCChhhccCC--Cccccc-cccchhhHH--HHHHHHH----HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhh Q lcl|NC_018087. 370 MALRVPLSRIPDE--QTQNVF-DMSTAISRD--ELSFDKF----ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELN 440 (520) Q Consensus 370 ~aL~VP~SRl~~~--~~~~~~-G~~~eItRD--ElkF~KF----I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~ 440 (520) ++.+||...|... ++.+.. +.+....++ |-.+..| +.-+..++...+.. .| ++..+ . T Consensus 347 ~afgVPp~~lG~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie~~ln~----~L-----l~~~~-----~ 412 (535) T protein:vir:10 347 AIFQMQPEEINFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLSFIEQVIND----KI-----MRYVD-----T 412 (535) T ss_pred HHhCCCHHHhccccCcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhh----hc-----ccccC-----C Confidence 9999999888532 222111 111111111 2223333 33344444433332 22 22222 2 Q ss_pred ceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHH---HHH----HHHHH----HHhhhc Q lcl|NC_018087. 441 NIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDED---IAA----ERKLI----DEELSD 509 (520) Q Consensus 441 ~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDee---I~~----~~kqi----~~E~~~ 509 (520) .+.|+|.. +.... ...|.++.+.+.. -.++..-+++. +.|.+-+ +-- .+..+ ..+... T Consensus 413 ~~~f~f~~------l~~~d-~~~r~~~~~~~~~---g~lT~NE~R~~-~gl~piegGD~~~~~~~~~~~~~~~~~~~~~~ 481 (535) T protein:vir:10 413 DYRFSFTL------GDAQD-KLQEEQVWKLKLA---NGYFINEYRKD-HGLKTVDGLDVPGFIGSAENFINATGFGQPNV 481 (535) T ss_pred eEEEEecc------ccccC-HHHHHHHHHHHHc---CCCCHHHHHHH-hCCCCCCCccccccccchhhcccccccccccC Confidence 35566642 22222 2334554443321 23688888854 7775431 100 00000 000000 Q ss_pred CCccCCccccC Q lcl|NC_018087. 510 KIFNPPEPEEI 520 (520) Q Consensus 510 ~~~~~p~~e~~ 520 (520) +--.+|.++++ T Consensus 482 p~~~~~~~~~~ 492 (535) T protein:vir:10 482 PDSSDDSGSTL 492 (535) T ss_pred CCCCCCccccC Confidence 00001111111 No 148 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=94.42 E-value=0.0045 Score=33.47 Aligned_cols=401 Identities=13% Similarity=0.068 Sum_probs=187.9 Q ss_pred cccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeeccc-----ccccccccccccccccccchhHHHHHHH Q lcl|NC_018087. 3 MLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQ-----DIAYNGVFQKLYGSQDPTATSTRELINT 77 (520) Q Consensus 3 ~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~-----~~a~~g~~~~~~~~~~~~~~~~~~LI~~ 77 (520) |.+ ..+.--|-+++... |..... .+++.. ..+..|..... +.-++.....++. T Consensus 1 ~~~-~i~~~~g~~~~~~~----------------~~~~~~-~~ia~~~~~~~~~~~~~~~p~~----~~il~~~~~~~~~ 58 (491) T protein:vir:79 1 MSK-GLWVSPTEFVKFGE----------------PDKSLS-SQIATRARSIDFFALGMYLPNP----DPVLKALGKDIRV 58 (491) T ss_pred CCC-eeeCCCCCcccccc----------------cchhHH-HHHhhhccccccccccccCcch----hHHHhhccCCHHH Confidence 433 11111111221111 100000 011110 00111222222 1112222224689 Q ss_pred HHHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeE Q lcl|NC_018087. 78 YRSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFF 157 (520) Q Consensus 78 YR~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~ 157 (520) |++|..++.|.++++.+..-+.-.+=. |.-... ++.+.+.|++-| +-++|+.-..++..- ..-|--++ T Consensus 59 y~~m~~D~~i~s~l~~Rk~av~~~~w~-----i~~~~~--~~~~a~~i~e~l----~~~~~~~~i~~~lda-~~~G~s~~ 126 (491) T protein:vir:79 59 YRELRADAHVGGCVRRRKAAVKALEWG-----LDRGKA--KSRVAKSIADVF----ADLDLSRIATEMLDA-VLYGYQPM 126 (491) T ss_pred HHHHhhChHHHHHHHHHHHHHhCCCcE-----EecCCC--CHHHHHHHHHHH----hcCCHHHHHHHHHHh-hhhcceeE Confidence 999999999999999998766543221 211111 222334444433 334666655554432 23577888 Q ss_pred EEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcc-cEEE Q lcl|NC_018087. 158 HKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYS-AMVY 236 (520) Q Consensus 158 hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~-aI~y 236 (520) ++++..++..-.+.+|...+|+.+.+-+ .++.. +.+. .....++.+|+. -|+| T Consensus 127 Ei~w~~~~g~~~~~~l~~r~~~~f~~d~------~~~l~----------l~~~----------~~~~~g~~lp~~k~i~~ 180 (491) T protein:vir:79 127 EITWGKVGNYIVPIDVVGKPADWFVYDP------ENQLR----------FRSK----------EHWVQGEELPARKFLVP 180 (491) T ss_pred EEEEeecCCeeeEEeeeeecccceeecc------CCceE----------Eeec----------CCCCCceeecCCCeEEE Confidence 8888876655556677777776665521 11111 0000 011223455554 4666 Q ss_pred eecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHh-cCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeec Q lcl|NC_018087. 237 AHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRIT-RAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDA 315 (520) Q Consensus 237 ~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~-RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~ 315 (520) +|. -+..+....|-|+.|..+|-=.+......+.+=-. =.|- |+...|-|.-.+.|. ..++.+. ...+ | T Consensus 181 ~~~--~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~-~igky~~~a~~~ek~-~l~~al~-~~~~----~- 250 (491) T protein:vir:79 181 RQE--ATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPM-LVGKHPRSASDAETN-LLLDRLE-DMVQ----D- 250 (491) T ss_pred Eec--CCCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCe-EEEecCCCCCHHHHH-HHHHHHH-HHhc----C- Confidence 662 22233445788999988887777776666655443 3444 455557776555443 2222222 2221 1 Q ss_pred CCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh----HHHHHHHHHHHHHhcCCChhh-ccCCCccccccc Q lcl|NC_018087. 316 RTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE----MDDILYFRKALYMALRVPLSR-IPDEQTQNVFDM 390 (520) Q Consensus 316 ~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge----i~DV~YF~kkLy~aL~VP~SR-l~~~~~~~~~G~ 390 (520) .| -. +| .|++|+.+.-+..-|. ..=++|..++.-+++- +- |.++++++ + T Consensus 251 -a~------~v--------iP-----~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iL---GqtlTt~~~gs---~ 304 (491) T protein:vir:79 251 -AV------AV--------IP-----DDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIALL---GQNQTTEATST---R 304 (491) T ss_pred -eE------EE--------ec-----CCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHHh---hhhhccCcccc---h Confidence 01 11 22 3688888853322232 2338888888877762 11 33443322 2 Q ss_pred c-chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHH-HHHHHHHHHH Q lcl|NC_018087. 391 S-TAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTI-EITERRVNVL 468 (520) Q Consensus 391 ~-~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~-Ei~~~R~~~~ 468 (520) + +++-. | -+...++..++..+..|..+++-=+.+.+-.. ..+.+.|... .+. +...+++..+ T Consensus 305 a~~~vh~-~-v~~~i~~~D~~~i~~tln~li~~l~~~N~~~~--------~~p~f~~~e~------ee~~~~~a~~~~~L 368 (491) T protein:vir:79 305 ASAQAGL-E-VTDDIRDGDKAIVVEAMNMLIRWICDLNFDGA--------ARPVFDMWEQ------EQVDEIQAGRDEKL 368 (491) T ss_pred hhHHHHH-H-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC--------CcceEeecCc------CchhHHHHHHHHHH Confidence 2 33322 2 25667777777777777775554444444311 2344554422 221 2334444444 Q ss_pred HHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCC-----ccCCccccC Q lcl|NC_018087. 469 SLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKI-----FNPPEPEEI 520 (520) Q Consensus 469 ~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~-----~~~p~~e~~ 520 (520) ..+ |-=++.+|++++ +++...+.++...........+- ...|+.+.+ T Consensus 369 ~~~----G~~i~~~~~~e~-~Gip~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 420 (491) T protein:vir:79 369 TRA----GARFTPAYFKRA-YNLQDGDLDERPLPVSAVDAVGAASFAEFEAPDQDAL 420 (491) T ss_pred HhC----CCccCHHHHHHH-hCCCCCCCCccccCcCcccccccccccccCCCCCcch Confidence 443 434899999966 78765443332111111110000 000001111 No 149 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=94.40 E-value=0.0045 Score=33.44 Aligned_cols=425 Identities=11% Similarity=0.064 Sum_probs=173.3 Q ss_pred chhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccc----cchhHHHHHH------- Q lcl|NC_018087. 8 DLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDP----TATSTRELIN------- 76 (520) Q Consensus 8 ~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~----~~~~~~~LI~------- 76 (520) +|+.=.|-.-.+-.. .++. .+-.- .++.|. |.+.+. ...-..++|+ T Consensus 1 ~~~~~~~~~~~~~~~---~~~~---~~~~~---------------~n~~~~--~~~~e~~~~~~~~~i~~~i~~~~~~~~ 57 (511) T protein:vir:96 1 MLKVNEFETDTDLRG---NINY---LFNDE---------------ANVVYT--YDGTESDLLQNVNEVSKYIEHHMDYQR 57 (511) T ss_pred Cccccchhhhhhhhh---hhhh---hhhhh---------------hCCccc--ccchhhhhhcCHHHHHHHHHHHHHhhh Confidence 334333333222221 1111 00000 011111 111111 1111222332 Q ss_pred -HHHHHhhccchhHHH----------------------HhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHH Q lcl|NC_018087. 77 -TYRSLLNNYEVDNAV----------------------QEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVL 133 (520) Q Consensus 77 -~YR~ma~~pEvd~Ai----------------------~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~ 133 (520) +|+.+..+.+=...+ .-||+..+-+ --..||++..++.+ ..+.+..++ T Consensus 58 ~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~y-l~g~p~~~~~~d~~--------~~~~l~~~~ 128 (511) T protein:vir:96 58 PRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGY-FLGNPIQYQDDDKD--------VLEAIEAFN 128 (511) T ss_pred HHHHHHHHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhh-hcccCceeecCchH--------HHHHHHHHH Confidence 344444333322211 1222222111 12366777665542 334566777 Q ss_pred HHhcchhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCC--CCccccccc----------ce Q lcl|NC_018087. 134 NMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTK--MENGVKVVK----------GY 201 (520) Q Consensus 134 ~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~--~~~~~~~~~----------~~ 201 (520) .--+|+..-.++.+.+.+-|+-|.+.-+|. +|-..+..+||+.+.+|.+-... ..-+++.+. .+ T Consensus 129 ~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~----dg~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~ 204 (511) T protein:vir:96 129 DLNDVESHNRSLGLDLSIYGKAYELMIRNQ----DDETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDEV 204 (511) T ss_pred hhcChhHHHHHHHHHHHhcCeeEEEEEeCC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceE Confidence 777889999999999999999988876652 36788999999999998653221 122222111 01 Q ss_pred ecceeecCcc-cccccccceecC----------CcceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHHHH-HHH Q lcl|NC_018087. 202 REYFLYDTEL-ESYQCGHQHFAA----------GTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLL-EDA 269 (520) Q Consensus 202 ~ey~~y~~~~-~~~~~~~~~~~~----------~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~-EDa 269 (520) ..+-+|++.. ..|..++..... ..--.+| .|.|+. +....|=++..+.....+..+ -+. T Consensus 205 ~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vP--vv~~~n-------~~~g~gd~e~v~~liDa~~~~~S~~ 275 (511) T protein:vir:96 205 FTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMP--ITEFSN-------NERRKGDYEKVITLIDLYDNAESDT 275 (511) T ss_pred EEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccc--eEEecC-------CCCCCCchhhhHHHHHHHHHHHHHH Confidence 1223555442 222111111000 0000122 122221 112345455444444333321 111 Q ss_pred HHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccc-cccccchhhhh---hcccccCCCCCcc Q lcl|NC_018087. 270 MMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVK-NQANMMALTED---YWLQRRDGKAVTE 345 (520) Q Consensus 270 lVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~-d~~~~msmlED---ywLpRReGgrgTE 345 (520) ...-+.++.|-+-+.-....+ .++++ +....+-.+.. +.......+.|.. T Consensus 276 ~~~~~~~~~~~lv~~G~~~~~--------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (511) T protein:vir:96 276 ANYMSDLNDAMLLIKGNLNLD--------------------------PVEVRKQKEANVLFLEPTVYVDAEGRETEGSVD 329 (511) T ss_pred HHHHHHhhcchhheecCccCC--------------------------chhhcccccccceeccccceeccccccCCCCcc Confidence 122233344433332211010 00010 00001100000 0011112233445 Q ss_pred eeecCCCCCcChH-HHHHHHHHHHHHhcCCChhhccCCCccccccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 346 VETLPGMTGMNEM-DDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTA--ISRDELSFDKFISELQHKFEEIFLSPLKS 422 (520) Q Consensus 346 IsTLpGg~nLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~e--ItRDElkF~KFI~rLr~rFs~if~d~Lk~ 422 (520) +..|-...+...+ .-+.-+.+.+|.-.++|-- ..++ +. |..+. |..-...-..-+.+.++.|..-+...++. T Consensus 330 ~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~--~~~~-~~--~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l 404 (511) T protein:vir:96 330 GGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNM--KDDN-FS--GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKL 404 (511) T ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc--cccc-cc--cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5555554444433 3345567788888888852 2111 10 22222 22222223344555555566555555544 Q ss_pred HHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCH---HHHHHH Q lcl|NC_018087. 423 NLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSD---EDIAAE 499 (520) Q Consensus 423 QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tD---eeI~~~ 499 (520) =+-+-++...-++..-...+.+.|...---.+ .+.++++..+. | .+|.+++++. |..++ +||+.+ T Consensus 405 i~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~-------~e~~d~~~kl~---G-~iS~et~l~~-l~~v~d~~~El~ri 472 (511) T protein:vir:96 405 LETILKNTRSIDANKDFNTVRYVYNRNLPKSL-------IEELKAYIDSG---G-KISQTTLMSL-FSFFQDPELEVKKI 472 (511) T ss_pred HHHHHHhcCCCccccccccceEEeCCCCCcCH-------HHHHHHHHHHh---c-cCChHHHHHh-CCCCCCHHHHHHHH Confidence 22222222221122222356788865433333 22344555553 3 3799999976 66654 455554 Q ss_pred HHHHHHhhhcC---CccCCccccC Q lcl|NC_018087. 500 RKLIDEELSDK---IFNPPEPEEI 520 (520) Q Consensus 500 ~kqi~~E~~~~---~~~~p~~e~~ 520 (520) +++-+++.+.. ...+|...+= T Consensus 473 ~~E~~~~~~~~~~~~~~~~~~~~~ 496 (511) T protein:vir:96 473 EEDEKESIKKAQKGIYKDPRDIND 496 (511) T ss_pred HHHHHHHHHHHhhccccCCCCCCC Confidence 44433332211 1222222111 No 150 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=94.40 E-value=0.0045 Score=33.44 Aligned_cols=425 Identities=11% Similarity=0.064 Sum_probs=173.3 Q ss_pred chhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccc----cchhHHHHHH------- Q lcl|NC_018087. 8 DLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDP----TATSTRELIN------- 76 (520) Q Consensus 8 ~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~----~~~~~~~LI~------- 76 (520) +|+.=.|-.-.+-.. .++. .+-.- .++.|. |.+.+. ...-..++|+ T Consensus 1 ~~~~~~~~~~~~~~~---~~~~---~~~~~---------------~n~~~~--~~~~e~~~~~~~~~i~~~i~~~~~~~~ 57 (511) T protein:vir:78 1 MLKVNEFETDTDLRG---NINY---LFNDE---------------ANVVYT--YDGTESDLLQNVNEVSKYIEHHMDYQR 57 (511) T ss_pred Cccccchhhhhhhhh---hhhh---hhhhh---------------hCCccc--ccchhhhhhcCHHHHHHHHHHHHHhhh Confidence 334333333222221 1111 00000 011111 111111 1111222332 Q ss_pred -HHHHHhhccchhHHH----------------------HhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHH Q lcl|NC_018087. 77 -TYRSLLNNYEVDNAV----------------------QEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVL 133 (520) Q Consensus 77 -~YR~ma~~pEvd~Ai----------------------~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~ 133 (520) +|+.+..+.+=...+ .-||+..+-+ --..||++..++.+ ..+.+..++ T Consensus 58 ~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~y-l~g~p~~~~~~d~~--------~~~~l~~~~ 128 (511) T protein:vir:78 58 PRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGY-FLGNPIQYQDDDKD--------VLEAIEAFN 128 (511) T ss_pred HHHHHHHHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhh-hcccCceeecCchH--------HHHHHHHHH Confidence 344444333322211 1222222111 12366777665542 334566777 Q ss_pred HHhcchhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCC--CCccccccc----------ce Q lcl|NC_018087. 134 NMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTK--MENGVKVVK----------GY 201 (520) Q Consensus 134 ~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~--~~~~~~~~~----------~~ 201 (520) .--+|+..-.++.+.+.+-|+-|.+.-+|. +|-..+..+||+.+.+|.+-... ..-+++.+. .+ T Consensus 129 ~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~----dg~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~ 204 (511) T protein:vir:78 129 DLNDVESHNRSLGLDLSIYGKAYELMIRNQ----DDETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDEV 204 (511) T ss_pred hhcChhHHHHHHHHHHHhcCeeEEEEEeCC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceE Confidence 777889999999999999999988876652 36788999999999998653221 122222111 01 Q ss_pred ecceeecCcc-cccccccceecC----------CcceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHHHH-HHH Q lcl|NC_018087. 202 REYFLYDTEL-ESYQCGHQHFAA----------GTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLL-EDA 269 (520) Q Consensus 202 ~ey~~y~~~~-~~~~~~~~~~~~----------~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~-EDa 269 (520) ..+-+|++.. ..|..++..... ..--.+| .|.|+. +....|=++..+.....+..+ -+. T Consensus 205 ~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vP--vv~~~n-------~~~g~gd~e~v~~liDa~~~~~S~~ 275 (511) T protein:vir:78 205 FTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMP--ITEFSN-------NERRKGDYEKVITLIDLYDNAESDT 275 (511) T ss_pred EEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccc--eEEecC-------CCCCCCchhhhHHHHHHHHHHHHHH Confidence 1223555442 222111111000 0000122 122221 112345455444444333321 111 Q ss_pred HHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccc-cccccchhhhh---hcccccCCCCCcc Q lcl|NC_018087. 270 MMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVK-NQANMMALTED---YWLQRRDGKAVTE 345 (520) Q Consensus 270 lVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~-d~~~~msmlED---ywLpRReGgrgTE 345 (520) ...-+.++.|-+-+.-....+ .++++ +....+-.+.. +.......+.|.. T Consensus 276 ~~~~~~~~~~~lv~~G~~~~~--------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (511) T protein:vir:78 276 ANYMSDLNDAMLLIKGNLNLD--------------------------PVEVRKQKEANVLFLEPTVYVDAEGRETEGSVD 329 (511) T ss_pred HHHHHHhhcchhheecCccCC--------------------------chhhcccccccceeccccceeccccccCCCCcc Confidence 122233344433332211010 00010 00001100000 0011112233445 Q ss_pred eeecCCCCCcChH-HHHHHHHHHHHHhcCCChhhccCCCccccccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 346 VETLPGMTGMNEM-DDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTA--ISRDELSFDKFISELQHKFEEIFLSPLKS 422 (520) Q Consensus 346 IsTLpGg~nLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~e--ItRDElkF~KFI~rLr~rFs~if~d~Lk~ 422 (520) +..|-...+...+ .-+.-+.+.+|.-.++|-- ..++ +. |..+. |..-...-..-+.+.++.|..-+...++. T Consensus 330 ~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~--~~~~-~~--~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l 404 (511) T protein:vir:78 330 GGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNM--KDDN-FS--GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKL 404 (511) T ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc--cccc-cc--cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5555554444433 3345567788888888852 2111 10 22222 22222223344555555566555555544 Q ss_pred HHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCH---HHHHHH Q lcl|NC_018087. 423 NLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSD---EDIAAE 499 (520) Q Consensus 423 QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tD---eeI~~~ 499 (520) =+-+-++...-++..-...+.+.|...---.+ .+.++++..+. | .+|.+++++. |..++ +||+.+ T Consensus 405 i~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~-------~e~~d~~~kl~---G-~iS~et~l~~-l~~v~d~~~El~ri 472 (511) T protein:vir:78 405 LETILKNTRSIDANKDFNTVRYVYNRNLPKSL-------IEELKAYIDSG---G-KISQTTLMSL-FSFFQDPELEVKKI 472 (511) T ss_pred HHHHHHhcCCCccccccccceEEeCCCCCcCH-------HHHHHHHHHHh---c-cCChHHHHHh-CCCCCCHHHHHHHH Confidence 22222222221122222356788865433333 22344555553 3 3799999976 66654 455554 Q ss_pred HHHHHHhhhcC---CccCCccccC Q lcl|NC_018087. 500 RKLIDEELSDK---IFNPPEPEEI 520 (520) Q Consensus 500 ~kqi~~E~~~~---~~~~p~~e~~ 520 (520) +++-+++.+.. ...+|...+= T Consensus 473 ~~E~~~~~~~~~~~~~~~~~~~~~ 496 (511) T protein:vir:78 473 EEDEKESIKKAQKGIYKDPRDIND 496 (511) T ss_pred HHHHHHHHHHHhhccccCCCCCCC Confidence 44433332211 1222222111 No 151 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=94.10 E-value=0.0054 Score=33.01 Aligned_cols=443 Identities=10% Similarity=0.048 Sum_probs=173.9 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeeccc--c--cccccccccccccccccchhHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQ--D--IAYNGVFQKLYGSQDPTATSTRELIN 76 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~--~--~a~~g~~~~~~~~~~~~~~~~~~LI~ 76 (520) .|-+-...+++ .| .+....+|. .+..+..++++.+--...|... - .-+. -...+|.|--..+.+...... T Consensus 8 ~~~~~~~~~~~-~~-~~~~~~~~~---~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~-~l~~yY~g~~~~i~~~~~~~~ 81 (501) T protein:vir:27 8 DSTGQDLVLNL-RF-HRESRIRYR---ADNLEELMVNNWELLKNFINHHKLRQAPRIQ-ELLDYARGENHDVLQFGRRKD 81 (501) T ss_pred eccchhhhhhc-cc-ChhHHHhhc---cccccccccccHHHHHHHHHHHHHHHHHHHH-HHHHHhcCCCccccccCccCc Confidence 11111111110 00 000000000 0000000000000000000000 0 0000 000000000000000000000 Q ss_pred HHHH--HhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccc Q lcl|NC_018087. 77 TYRS--LLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSR 154 (520) Q Consensus 77 ~YR~--ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgr 154 (520) .++. -..++=..-+|+..+.-.+ .+||++..++.+-.+.+. +.+..++..-+|+....++.+...+-|+ T Consensus 82 ~~~~~~ki~~n~~k~Ivd~~~~yl~-----g~p~~~~~~d~~~~~~~~----~~l~~~~~~n~~~~~~~~~~~~~~~~G~ 152 (501) T protein:vir:27 82 REMADKRAVHNYGRMISKFKTGYLA-----GNPIRVEYDDNDNNSQND----DTIKRIGRINDIDSHNRTLIRDLSQTGR 152 (501) T ss_pred cccccceeccchHHHHHHHHhhhhc-----ccCeeEecCCccchHHHH----HHHHHHHHhcChhHHHHHHHHHHhhCCe Confidence 0000 0112333333333333221 367788777665445443 3445567778999999999999999999 Q ss_pred eeEEEeeecCCCCCCeeeeEecCccceeeeeeccCC--CCcccccc------cceecceeecCccc-ccccccceecCC- Q lcl|NC_018087. 155 VFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTK--MENGVKVV------KGYREYFLYDTELE-SYQCGHQHFAAG- 224 (520) Q Consensus 155 i~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~--~~~~~~~~------~~~~ey~~y~~~~~-~~~~~~~~~~~~- 224 (520) -|.+.-.|. +|=..+..+||+.+.+|.+-... ..-+++.+ +++..+-+|++... .+..++..-..+ T Consensus 153 a~~~vy~de----d~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~~~~vyt~~~v~~~~~~~~~~~~~~ 228 (501) T protein:vir:27 153 AYEVIYRNE----YDETRIKRLNPLETFVIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNEHIYTLDASDDFNEISV 228 (501) T ss_pred EEEEEEeCC----CCceEEEEEccceeEEEecCCCCCceEEEEEEEEeeecCCcEEEEEEEeCCeEEEEEeCCceeeccc Confidence 988877663 25567899999999988543321 11222211 11222334544321 111111000000 Q ss_pred --cce-ecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHH Q lcl|NC_018087. 225 --TKI-KIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKL-LEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHM 300 (520) Q Consensus 225 --~~~-~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m-~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl 300 (520) ..+ +|| |++. +|+....|=++.++.....+.. +=+....-+.++.|-+-+.-.+..+.+.. T Consensus 229 ~~~~~g~vP---vv~~------~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~------ 293 (501) T protein:vir:27 229 TTHAFGTVP---ITEF------LNNVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQ------ 293 (501) T ss_pred cccCCCccc---EEEe------cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccc------ Confidence 001 222 3332 1233345555555554444432 23333344555666555443222221100 Q ss_pred HHHHHhhcceeEeecCCCccccccccchhhhhhcccccC----CCCCcceeecCCCCCcChH-HHHHHHHHHHHHhcCCC Q lcl|NC_018087. 301 QHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRD----GKAVTEVETLPGMTGMNEM-DDILYFRKALYMALRVP 375 (520) Q Consensus 301 ~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRRe----GgrgTEIsTLpGg~nLgei-~DV~YF~kkLy~aL~VP 375 (520) ...++..+ ..+++--+ ++.+..+..|-...+.... .-+.-+.+.+|.-.++| T Consensus 294 ----------------~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p 350 (501) T protein:vir:27 294 ----------------ASDMKRTR-------LMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIP 350 (501) T ss_pred ----------------hhhhhhcC-------ceeecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCc Confidence 00011000 11221111 1112234444332222222 22455677788888988 Q ss_pred hhhccCCCccccccccchhhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCh-hhHHhhhhceEEEeeccchH Q lcl|NC_018087. 376 LSRIPDEQTQNVFDMSTAISR--DELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITE-DEWEAELNNIKIVFHKNSYF 452 (520) Q Consensus 376 ~SRl~~~~~~~~~G~~~eItR--DElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~-eew~~~~~~I~~~f~~Dn~f 452 (520) ---. ++ +. |.++..+. -...-..-+.+.++.|..-+..+++.=+-+-++... .+++ ...|.+.|...-.- T Consensus 351 ~~~~--~~-~~--~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d--~~~i~v~f~~~~p~ 423 (501) T protein:vir:27 351 DMSD--TN-FS--GNTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFD--ESLLKITFTPNLPK 423 (501) T ss_pred ccCc--cc-cc--cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--cccceEEeCCCCCc Confidence 4222 11 11 32222221 122234556666677777666666543333233221 1111 23578888654443 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCH---HHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 453 SEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSD---EDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 453 ~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tD---eeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) .+.. -++++..+.+ .+|.+++++. |...+ +|++.++++-+++..+... +...++. T Consensus 424 n~~e-------~ad~~~kl~g----~iS~et~l~~-l~~v~D~~~E~eri~~E~~e~~~~~~~-~~~~~~~ 481 (501) T protein:vir:27 424 SLNE-------QVSILTGLGG----QVSQETALSL-SGLVESPNEELDKINKEVSEIDFKGYS-NDFNEHV 481 (501) T ss_pred CHHH-------HHHHHHHHhc----cCcHHHHHHh-CCCCCCHHHHHHHHHHHHHhhhHhhhc-Ccccccc Confidence 4432 3344555543 3799999987 44443 5555555544433333321 1111112 No 152 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=93.98 E-value=0.0058 Score=32.86 Aligned_cols=415 Identities=13% Similarity=0.120 Sum_probs=171.7 Q ss_pred hhhhcchhhhhhhHHHhhhccCCCcccCCCCC-CCceeeccccccccccccc-ccccccccchh---------HHHHHHH Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAESITAPKFD-DGATEVDSQDIAYNGVFQK-LYGSQDPTATS---------TRELINT 77 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~-dg~~~i~~~~~a~~g~~~~-~~~~~~~~~~~---------~~~LI~~ 77 (520) +.|...- |-+++.-- .|-...+.++++.. -+.. +..+.+..... -...+.+ T Consensus 1 ~~~~~~~-----------------~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r 62 (492) T protein:vir:97 1 MQFIQLI-----------------SQVAQALIKGGNILYPSQPTQTE-IFDAIVRTNNKPETLEEMIVRYIKQHLEKLPE 62 (492) T ss_pred ChHHHHH-----------------HHHHHHHhcCCceeeccchhhhh-HhhhcccCCCchhhHHHHHHHHHHHHHHHHHH Confidence 1111111 11222111 23333333444321 2221 12222221111 1122344 Q ss_pred HHHHhhccchhHHH---------------------------HhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHH Q lcl|NC_018087. 78 YRSLLNNYEVDNAV---------------------------QEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFN 130 (520) Q Consensus 78 YR~ma~~pEvd~Ai---------------------------~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~ 130 (520) |+.+..+++-...| ..||+-.+-+ --..||++..++.+..+ ..+ T Consensus 63 ~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~y-l~g~p~~~~~~d~~~~~--------~l~ 133 (492) T protein:vir:97 63 ISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY-IVGKPIAFKHTDDEVVK--------RID 133 (492) T ss_pred HHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHHHhhh-hcccCceeccCchHHHH--------HHH Confidence 55555554443211 1111111111 12266666666554333 233 Q ss_pred HHHHHhcchhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeec--cCCCCccccccc--c------ Q lcl|NC_018087. 131 SVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVREL--DTKMENGVKVVK--G------ 200 (520) Q Consensus 131 ~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i--~~~~~~~~~~~~--~------ 200 (520) .+++ -+|+....++.+.+++-|+-|.+.-+|. +|-..++.+||+.+.++.+- ..+..-.++.+. . T Consensus 134 ~~~~-n~~~~~~~~~~~~~~~~G~a~~~v~~d~----dg~~~~~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~ 208 (492) T protein:vir:97 134 EVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDE----EGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEY 208 (492) T ss_pred HHHh-ccHHHHHHHHHHHHhhcCeEEEEEEecC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEeeccceeEEE Confidence 3333 3677888889999999999888877652 36678999999999887542 111212222111 0 Q ss_pred ----eecceeecCccccccccc------ceecCCcceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHH-HHHHH Q lcl|NC_018087. 201 ----YREYFLYDTELESYQCGH------QHFAAGTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLK-LLEDA 269 (520) Q Consensus 201 ----~~ey~~y~~~~~~~~~~~------~~~~~~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~-m~EDa 269 (520) ...+|.+........... -...++.-=.|| |+++. ++....|=++..+....-+. ++=+. T Consensus 209 y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP---vv~~~------nn~~g~sd~e~v~~liDa~d~~~S~~ 279 (492) T protein:vir:97 209 WDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP---FIPFK------NNDLEISDIFMYKTLIDAYNRRLSDL 279 (492) T ss_pred EecCeEEEEEEecCeeeecccccccccccccccCCCCCcc---eEEec------CCCCCCCchHhHHHHHHHHHHHHHHH Confidence 011222221110000000 000011000222 23221 12223444444333333332 23344 Q ss_pred HHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeec Q lcl|NC_018087. 270 MMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETL 349 (520) Q Consensus 270 lVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTL 349 (520) ...-+-++-|-+-+.-.+.-+.+ +... .+..++...-++ |..+.+| T Consensus 280 ~~~~~~~~~~~l~~~g~~~~~~~-----~~~~---------------------------~~~~~~~~~~~~--~~~~~~l 325 (492) T protein:vir:97 280 SNTFKDSNELTYVLKNYDDQELP-----EFKR---------------------------LLRYYGAIKVSD--NGGVDTI 325 (492) T ss_pred HHHHHHhccceeeeecCCcccch-----hHHH---------------------------HHhhccceecCC--CCcceeE Confidence 44445555554444322211111 1111 111111111111 2235555 Q ss_pred CCCCCcChH-HHHHHHHHHHHHhcCCChhhccCCCccccccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018087. 350 PGMTGMNEM-DDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTA--ISRDELSFDKFISELQHKFEEIFLSPLKSNLLL 426 (520) Q Consensus 350 pGg~nLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~e--ItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiL 426 (520) -...+...+ .-+.-.++.+|+-.++|- +.++. ++ |..+. |.--+.....-+.+.++.|..-+..+++.=+-+ T Consensus 326 ~~~~~~~~~~~~~~~L~~~I~~~s~~p~--~~~~~-~~--~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~ 400 (492) T protein:vir:97 326 QVEVPVENSKKYLDELYQKIMLFGQAVD--FSSDK-FG--SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEH 400 (492) T ss_pred eccCCHHHHHHHHHHHHHHHHHHhCCCC--CCccc-cc--cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 444444322 334556677888888884 22222 11 22222 222233344556777777777777766643333 Q ss_pred cCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCH--HHHHHHHHHHH Q lcl|NC_018087. 427 KRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSD--EDIAAERKLID 504 (520) Q Consensus 427 kgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tD--eeI~~~~kqi~ 504 (520) -|+ ..+|. .|.+.|....--.+. +.+++++.+.+ .+|.+++++.+-..+| +|++.+.++-+ T Consensus 401 ~~~--~~~~~----~i~v~f~~~~p~~~~-------e~a~~~~kl~G----~iS~et~l~~l~~v~d~~~Eleri~~E~~ 463 (492) T protein:vir:97 401 FDI--KGEHK----DVDISFNYNKVANTE-------LQVQTAQQSMG----IVSHETVLENHPFVEDLQAELERIEQEQT 463 (492) T ss_pred hcC--Ccccc----eeeEEecCCCCCCHH-------HHHHHHHHHhc----cCchHHHHHhCCCCCCHHHHHHHHHHHHH Confidence 332 34454 456777544333332 23455555543 3799999987433333 45554444333 Q ss_pred HhhhcCC-ccCC-----ccccC Q lcl|NC_018087. 505 EELSDKI-FNPP-----EPEEI 520 (520) Q Consensus 505 ~E~~~~~-~~~p-----~~e~~ 520 (520) +..++.. .... .++|- T Consensus 464 ~~~~~~~~~~~~~~~~~~~~~~ 485 (492) T protein:vir:97 464 EYNKQLPNLDDGGADSAQQQER 485 (492) T ss_pred HHHHhhhccccCCCCCCccccc Confidence 2222210 0000 00011 No 153 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=93.86 E-value=0.0062 Score=32.70 Aligned_cols=386 Identities=14% Similarity=0.069 Sum_probs=179.4 Q ss_pred ccccccccchhHHHHHHHHHHHhhccchhHHH----------------------HhhhceeeEecCCCcEEEEeeccchh Q lcl|NC_018087. 60 LYGSQDPTATSTRELINTYRSLLNNYEVDNAV----------------------QEIVSDAIVYEEGFDVVSIDLDQTAF 117 (520) Q Consensus 60 ~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai----------------------~eIvneaiv~d~~~~~V~l~Ld~~~~ 117 (520) +-.+.-. +-..+|+.+..+++-+..+ ..||+..+-+= -..||++..++..- T Consensus 1 ~~~~~~~------~~~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l-~g~~~~~~~~~~~~ 73 (440) T protein:vir:95 1 MLAAFLG------SQKQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYV-IGNPVSIGVMEGGS 73 (440) T ss_pred ChhhHHH------HHHHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhhe-eccCceEeeCCCcc Confidence 1111111 1233444444444433221 11222111100 22455554443321 Q ss_pred hhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCC--CCccc Q lcl|NC_018087. 118 TENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTK--MENGV 195 (520) Q Consensus 118 s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~--~~~~~ 195 (520) + ...+.++.++.--+|+....++.+.+.+-|+-|.+.-+|. +|-..+..++|+.+.++.+-... ..-.+ T Consensus 74 ~-----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~----~~~~~i~~~~p~~~~~~~d~~~~~~~~~~i 144 (440) T protein:vir:95 74 A-----DQLSTIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDK----DKVDRVVLISPLEMFVIRDLTVEQNIIAAV 144 (440) T ss_pred H-----HHHHHHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecC----CCceEEEEEcccceEEEEcCCCCCceEEEE Confidence 1 2233556667667889999999999999999998877662 35567899999999998643221 11122 Q ss_pred cc--ccceecceeecCcc-cccccccceecCC-------cce-ecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHH Q lcl|NC_018087. 196 KV--VKGYREYFLYDTEL-ESYQCGHQHFAAG-------TKI-KIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLK 264 (520) Q Consensus 196 ~~--~~~~~ey~~y~~~~-~~~~~~~~~~~~~-------~~~-~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~ 264 (520) +. ......+.+|++.. ..|....+....- .++ +|| |+++ +|+....|=++..+.....+. T Consensus 145 ~~~~~~~~~~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP---vv~~------~n~~~g~sd~e~v~~lida~~ 215 (440) T protein:vir:95 145 HLPIYADKVNMTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVP---VVEW------WNNRFRMGDYESEISLIDAYD 215 (440) T ss_pred EEEEecCceEEEEEeCCeEEEEEEecCCccceeecceeeccCceee---EEEe------eCCCCCCCchhhhHHHHHHHH Confidence 11 11111223454332 1121111000000 000 111 1221 122223454444444444333 Q ss_pred -HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCC-Cccccccccchhhhhhccccc---- Q lcl|NC_018087. 265 -LLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDART-GKVKNQANMMALTEDYWLQRR---- 338 (520) Q Consensus 265 -m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~T-Gev~d~~~~msmlEDywLpRR---- 338 (520) ++-+.+..-+-.+.|-+-+.-.+.+.-. +..+ +.+++.+. .|++.. T Consensus 216 ~~~s~~~~~~~~~~~~~~v~~g~~~~~~~---------------------~~e~~~~~~~~~~-------~~~~~~~~~~ 267 (440) T protein:vir:95 216 AGQSDTANYMSDLNDAMLLVKGDLDGIKL---------------------SPEDAAKMKDANM-------LFLKTGISTT 267 (440) T ss_pred HHHHHHHHHHHHhhcceeeeecccccCCC---------------------Cccchhhhhhccc-------eecccccccc Confidence 3344455556666666555432111100 0111 11111111 233222 Q ss_pred CCCCCcceeecCCCCCcChH-HHHHHHHHHHHHhcCCChhhccCCCccccccccchhh--HHHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 339 DGKAVTEVETLPGMTGMNEM-DDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAIS--RDELSFDKFISELQHKFEEI 415 (520) Q Consensus 339 eGgrgTEIsTLpGg~nLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eIt--RDElkF~KFI~rLr~rFs~i 415 (520) .++.+..++.|-...++... .-++-+.+.+|...++|- +..++ + -|..+... --+.....-+.+.|..|..- T Consensus 268 ~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p~--~~~~~-~--~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~ 342 (440) T protein:vir:95 268 GQQTTADASYIYKQYDVNGTEAYKNRLANDIHRFSRIPN--LDDDR-F--NSTSSGIALLYKMIGLEQVRKDKETYFTKA 342 (440) T ss_pred cCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcc--ccccc-c--cccchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22334567777666666544 347788889999999984 22111 1 12222222 22233444577777778887 Q ss_pred HHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHH Q lcl|NC_018087. 416 FLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDED 495 (520) Q Consensus 416 f~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDee 495 (520) +.++|+.=+-+-+++...+|+. ..+.+.|..--.-.+.. .+++++.+. | .+|.++++.. |..+|.+ T Consensus 343 l~~~~~li~~~~~~~~~~~~~~--~~v~i~f~~~~p~~~~~-------~ad~~~kl~---g-~iS~et~~~~-l~~~d~~ 408 (440) T protein:vir:95 343 LRRRYELISNIHKAINGPVIEA--NKLTFTFHPNIPQDVWT-------EIKAYIEAG---G-EISQETLMEN-ASFTDYK 408 (440) T ss_pred HHHHHHHHHHHHhhcCCccccc--ccceEEeCCCCCCCHHH-------HHHHHHHHh---c-cCcHHHHHHh-CCCCCcH Confidence 7777776444444555555553 35777787555555533 344444443 3 4799999987 5655432 Q ss_pred HHHHHHHHHHhhhcCC------c----cCCcccc Q lcl|NC_018087. 496 IAAERKLIDEELSDKI------F----NPPEPEE 519 (520) Q Consensus 496 I~~~~kqi~~E~~~~~------~----~~p~~e~ 519 (520) ++.++|++|..... . ...+++| T Consensus 409 --~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~e 440 (440) T protein:vir:95 409 --TEHSRILKQGGSSDLEIGQIVGDADVGQADTE 440 (440) T ss_pred --HHHHHHHHHHHHhhhhHHhhccCCCCCCcCCC Confidence 22233333332211 0 0111122 No 154 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=93.64 E-value=0.0069 Score=32.44 Aligned_cols=419 Identities=11% Similarity=0.048 Sum_probs=164.1 Q ss_pred cccc---------------------cchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeeccccccccccccccc Q lcl|NC_018087. 3 MLAD---------------------SDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLY 61 (520) Q Consensus 3 ~~~~---------------------~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~ 61 (520) |+.| .+++|..--..+....++. + ..+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~l~~~~i~~li~~~~~~~~~r~~~-l------------------------------~~YY 49 (506) T protein:vir:94 1 MDYDLTEHKQANLIYQESLENLTPNKIMKFITHHFNYQRPRLEM-L------------------------------DDYY 49 (506) T ss_pred CCcchhhhhcceeecccchhcCCHHHHHHHHHHHHHHHHHHHHH-H------------------------------HHHh Confidence 5444 2222211111110000100 0 0000 Q ss_pred ccccccchh-HHHHHHHHHH--HhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcc Q lcl|NC_018087. 62 GSQDPTATS-TRELINTYRS--LLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNF 138 (520) Q Consensus 62 ~~~~~~~~~-~~~LI~~YR~--ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f 138 (520) .|--..+.. ...+.+.++. -..+|=+.-+|+..+.-. -..||++..++... .+..+.+.+--+| T Consensus 50 ~g~~~~i~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l-----~G~p~~~~~~d~~~--------~~~l~~~~~~N~~ 116 (506) T protein:vir:94 50 QGYNLKILDKQSRRHEDGKADHRATHSFAKYIADFQTSYS-----VGNPINVKLPDDGS--------NSGFDTFNKANDV 116 (506) T ss_pred cCCCccccccccccccccCCcceeecchHHHHHHHhhhhh-----cccCceeecCcchH--------HHHHHHHHhccCH Confidence 000000000 0000000000 011222233333333221 13677776665422 3345556666788 Q ss_pred hhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCC--Cccccccc----------cee-cce Q lcl|NC_018087. 139 QRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKM--ENGVKVVK----------GYR-EYF 205 (520) Q Consensus 139 ~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~--~~~~~~~~----------~~~-ey~ 205 (520) +....++.+.+.+-|+-|.+.-+|. +|-..+..+||+.+.+|.+-.... .-+++.+. .+. -+. T Consensus 117 ~~~~~~~~~~~~~~G~a~~~v~~de----d~~~~i~~~~p~~~~~v~dd~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~ 192 (506) T protein:vir:94 117 DAENYDLFLDMSRYGRAYEYVYRGE----DNEEHLAKLDPLDTFVIYSTDVDPKPIMAVRYHQIELVDDNQVSTINYVPE 192 (506) T ss_pred hHHHHHHHHHHHhcCeEEEEEEecC----CCeeEEEEEcccceEEEecCCCCCceEEEEEEEeeeeccCCceeEEEEEEE Confidence 9999999999999999998877662 367889999999999986532211 11111100 011 111 Q ss_pred eecCcccccccccc-eec----CCcce-ecCcccEEEeecccccCCCCcchhhhHHHHHHHH--------HHHHHHHHHH Q lcl|NC_018087. 206 LYDTELESYQCGHQ-HFA----AGTKI-KIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPAN--------QLKLLEDAMM 271 (520) Q Consensus 206 ~y~~~~~~~~~~~~-~~~----~~~~~-~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~N--------qL~m~EDalV 271 (520) +|.+.......+.. ... ...++ +||- |.|+ |+....|-++..+.... ....+++..- T Consensus 193 ~yt~~~~~~~~~~~~~~~~~~~~~~~~g~vPv--v~~~-------n~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~ 263 (506) T protein:vir:94 193 TWTADTYTLYNPTPIMGKMQVDTTKPITTFPV--VEFK-------NSNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNE 263 (506) T ss_pred EEeCceEEEeccccCccceeccccccCCccce--EEec-------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhh Confidence 23332211111100 000 00011 2221 2222 22223344444333332 2334455544 Q ss_pred HHHHhcCcc-ceEEEccCCCC-----------chHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccC Q lcl|NC_018087. 272 IYRITRAPD-RRVFYIDTGNM-----------PARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRD 339 (520) Q Consensus 272 IyRi~RApe-RRvFyIDvGnl-----------pk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRRe 339 (520) .+++...-. ....-.+.... ....+.+.+.+ |. +++ -++++-.. T Consensus 264 ~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--~~~---------------------~~~~~~~~ 319 (506) T protein:vir:94 264 AMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKE-MK--DAN---------------------MLLLKSGM 319 (506) T ss_pred HHHHHhcCccccccchhccccccccccccccccccchhHHHhh-hh--hcC---------------------eeeecccc Confidence 444443321 11111111110 00011111110 00 011 12222222 Q ss_pred CCCCc----ceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccch--hhHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 340 GKAVT----EVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTA--ISRDELSFDKFISELQHKF 412 (520) Q Consensus 340 GgrgT----EIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~e--ItRDElkF~KFI~rLr~rF 412 (520) +..|+ .+..|--..++.. -.-+.-..+.+|...++|- +.+++.. |.++. |.--+..-..-+.+.|..| T Consensus 320 ~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~~~---~n~Sg~Aik~~~~~l~~k~~~k~~~~ 394 (506) T protein:vir:94 320 TVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPD--LTDENFA---SNSSGVAMQYKVLGTVELASTKRRMF 394 (506) T ss_pred cccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccc--ccccccc---ccchHHHHHHHHHHHHHHHHHHHHHH Confidence 21121 2222222222221 2234556778888899994 2222211 22222 2222222335667777777 Q ss_pred HHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCC Q lcl|NC_018087. 413 EEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMS 492 (520) Q Consensus 413 s~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~t 492 (520) ..-+...++.=+-+-++. ...++.-...+++.|...---.+...++ ++..+. | .+|.++++.. |..+ T Consensus 395 ~~~l~~~~~li~~~~~~~-~~~~~~d~~~i~i~f~~~~p~d~~e~a~-------~~~kl~---g-~iS~et~~~~-lp~v 461 (506) T protein:vir:94 395 ERGLYARYQIISDIENSI-HGDWTFDPQELTFTFRDNLPADNISQIK-------ALVQAG---A-TLPQKYLYQQ-LPGV 461 (506) T ss_pred HHHHHHHHHHHHHHHHhc-CCccccccccceEEeCCCCCcCHHHHHH-------HHHHHh---c-cCChHHHHHh-CCCC Confidence 777777665433222221 1222222345778886655555544443 344443 3 4899999976 6665 Q ss_pred H---HHHHHHHHHHHHhhhcC--CccCCc-----------cccC Q lcl|NC_018087. 493 D---EDIAAERKLIDEELSDK--IFNPPE-----------PEEI 520 (520) Q Consensus 493 D---eeI~~~~kqi~~E~~~~--~~~~p~-----------~e~~ 520 (520) + +|++.+.++-+++.+.. -..+++ +||| T Consensus 462 ~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 505 (506) T protein:vir:94 462 TNPQDIVDMMKEQSANGDYSFDQNGVISNDGQTNTTATQTDEEV 505 (506) T ss_pred CCHHHHHHHHHHHHHHHhhcchhhcCCCcccCccccccccccCC Confidence 5 44444443332221110 001111 1222 No 155 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=93.47 E-value=0.0075 Score=32.25 Aligned_cols=442 Identities=10% Similarity=0.054 Sum_probs=168.7 Q ss_pred cccc-----cchhh----hcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecc----c-cc---cccccccccccccc Q lcl|NC_018087. 3 MLAD-----SDLKM----FAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDS----Q-DI---AYNGVFQKLYGSQD 65 (520) Q Consensus 3 ~~~~-----~~l~~----f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~----~-~~---a~~g~~~~~~~~~~ 65 (520) |+-- +-+.- +.|. +....++.. +..+..+.+ ....+.. . .. .+ --+..+|.|-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~---~~~~~~~~~----~~~~i~~~i~~h~~~~~~rl-~~l~~yY~g~~ 71 (502) T protein:vir:48 1 MMEQTLFTDSTGQDLVLNLRFH-RESRIRYRA---DNLEELMVN----NWELLKNFINHHKLRQAPRI-QELLDYARGEN 71 (502) T ss_pred CceeEEEEecchhHHHhhcccC-hhHHhhhcc---cchhhhccc----cHHHHHHHHHHHHHHHHHHH-HHHHHHhcCCC Confidence 1110 00000 0110 000111100 000000000 0000000 0 00 00 00000000000 Q ss_pred ccchhHHHHHHHHH--HHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhH Q lcl|NC_018087. 66 PTATSTRELINTYR--SLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGS 143 (520) Q Consensus 66 ~~~~~~~~LI~~YR--~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~ 143 (520) ..+.+........+ .-..++=..-+|+..+.-. -..||++..++.+-.+ .+.+.++.++.--+|+.... T Consensus 72 ~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl-----~g~p~~~~~~d~~~~~----~~~~~l~~~~~~N~~~~~~~ 142 (502) T protein:vir:48 72 HDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYL-----AGNPIRVEYDDNEDNS----QNDDAIKRIGRINDIDTHNR 142 (502) T ss_pred ccccccccccccccccceeecchHHHHHHHHhhhh-----cccCeeEecCCccchh----HHHHHHHHHHhhcCHhHHHH Confidence 00000000000000 0011122222233222211 1366777766554444 44555666777789999999 Q ss_pred HHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeecc--CCCCcccccc------cceecceeecCcccc-c Q lcl|NC_018087. 144 DHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELD--TKMENGVKVV------KGYREYFLYDTELES-Y 214 (520) Q Consensus 144 ~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~--~~~~~~~~~~------~~~~ey~~y~~~~~~-~ 214 (520) ++.+...+-|+-|.+.-+|. +|-..++.+||+.+..|.+-. .+..-+++.+ .+...+-+|++..-. + T Consensus 143 ~~~~~~~~~G~a~~~v~~de----dg~~~i~~~~p~~~~~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~iyt~~~i~~~ 218 (502) T protein:vir:48 143 NLIRDLSQTGRAYEVIYRSE----YDETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNQHIYTL 218 (502) T ss_pred HHHHHHhhcCeEEEEEEeCC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEEeecCCcEEEEEEEeCCeEEEE Confidence 99999999999998877663 356779999999999875422 1111222211 011112244433211 1 Q ss_pred ccccceecCC---cce-ecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCC Q lcl|NC_018087. 215 QCGHQHFAAG---TKI-KIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLK-LLEDAMMIYRITRAPDRRVFYIDTG 289 (520) Q Consensus 215 ~~~~~~~~~~---~~~-~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~-m~EDalVIyRi~RApeRRvFyIDvG 289 (520) ..++..-..+ ..+ +|| |++. +|+....|=++.++....-+. ++=+....-+-++.|-+-+.-.... T Consensus 219 ~~~~~~~~~~~~~~~~g~vP---vv~~------~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~ 289 (502) T protein:vir:48 219 DASDSFNEISVTPHAFGTVP---ITEF------LNNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLAL 289 (502) T ss_pred EeCCceeeccceecCCCccc---eEEe------cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccc Confidence 1110000000 000 122 2221 122334455555444433332 2233333444555555444322111 Q ss_pred CCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHH-HHHHHHHH Q lcl|NC_018087. 290 NMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDD-ILYFRKAL 368 (520) Q Consensus 290 nlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~kkL 368 (520) ..+.... -+.++ ++.+ +...... -+.+.+..+.+|-...+...+.- ++-+.+.+ T Consensus 290 ~~~~~~~------~~~~~--~~~~-------------~~~~~~~----~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I 344 (502) T protein:vir:48 290 PQGMQAS------DMKRT--RLMQ-------------LKPPKSA----DGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDI 344 (502) T ss_pred ccccchh------hhhhc--ceee-------------ccccccc----cccccCcceeEeeecCCHHHHHHHHHHHHHHH Confidence 1110000 00000 0000 0000000 01122335555544444444433 56667788 Q ss_pred HHhcCCChhhccCCCccccccccc--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCh-hhHHhhhhceEEE Q lcl|NC_018087. 369 YMALRVPLSRIPDEQTQNVFDMST--AISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITE-DEWEAELNNIKIV 445 (520) Q Consensus 369 y~aL~VP~SRl~~~~~~~~~G~~~--eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~-eew~~~~~~I~~~ 445 (520) |+..++|---++.- + |..+ .|..-......-+.+.++.|..-+...++.=+-+-++... .+++ ...|.+. T Consensus 345 ~~~s~~p~~~~~~~-~----~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d--~~~i~i~ 417 (502) T protein:vir:48 345 HVFTNTPDMSDNHF-S----GNASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFD--ESRLKIT 417 (502) T ss_pred HHHhCCCCcCcccc-c----cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--cccceEE Confidence 88888884222111 1 2222 2222222334455666666666666666543322232211 1122 2457888 Q ss_pred eeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhc----CCcc-------- Q lcl|NC_018087. 446 FHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSD----KIFN-------- 513 (520) Q Consensus 446 f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~----~~~~-------- 513 (520) |.....-.+.. .++++.++.+ .+|.+++++. |..+++. +++-++|++|..+ +.-+ T Consensus 418 f~~~~p~d~~e-------~a~~~~kl~g----~iS~et~l~~-l~~v~D~-~~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 484 (502) T protein:vir:48 418 FTPNLPKSLYE-------QVSILNDLGG----QVSQETALSL-SGLVENP-TEELDKINEESSKIDFKGYPSYFYDNVGK 484 (502) T ss_pred eCCCCCcCHHH-------HHHHHHHHhc----cCcHHHHHHh-CCCCCCH-HHHHHHHHHHHHhhhhhcccccccccccc Confidence 86544433432 3345555543 3799999988 6664421 2333334333322 1100 Q ss_pred --CCccccC Q lcl|NC_018087. 514 --PPEPEEI 520 (520) Q Consensus 514 --~p~~e~~ 520 (520) +..+|+- T Consensus 485 ~~d~~~e~~ 493 (502) T protein:vir:48 485 YTDEVKETH 493 (502) T ss_pred cCCCccCCC Confidence 0001111 No 156 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=93.28 E-value=0.0081 Score=32.04 Aligned_cols=430 Identities=11% Similarity=0.054 Sum_probs=187.0 Q ss_pred cCCCcccCCCCCCCceeeccccccccc------ccccccccccccchhHH-HHHHHHHHH-hhccchhHHHHhhhceeeE Q lcl|NC_018087. 29 DKAESITAPKFDDGATEVDSQDIAYNG------VFQKLYGSQDPTATSTR-ELINTYRSL-LNNYEVDNAVQEIVSDAIV 100 (520) Q Consensus 29 ~~~~s~~~p~~~dg~~~i~~~~~a~~g------~~~~~~~~~~~~~~~~~-~LI~~YR~m-a~~pEvd~Ai~eIvneaiv 100 (520) -..+.+-....++....+..=-..+.. -...+|-|- ..+.... ..=+.+|.+ +-..=+.-+|+..+.-..+ T Consensus 1 ~~~~~~~~~e~~~~~~~~~~l~~~~~~~~~r~~~l~~YY~G~-~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~~l~~ 79 (486) T protein:vir:42 1 MTAPLPGMEEIEDPAVVREEMISAFEDASKDLASNTSYYDAE-RRPEAIGVTVPREMQQLLAHVGYPRLYVDSVAERQAV 79 (486) T ss_pred CCCCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhccc-CcchhcccccchhHhhhhhccchHHHHHHHHHhhhcc Confidence 111111122222222111100000000 001111111 1110000 000111111 0111223333333322211 Q ss_pred ecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecC----CCCCCeeeeEec Q lcl|NC_018087. 101 YEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPN----RPKDGIIELRRL 176 (520) Q Consensus 101 ~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~----~~k~GI~elr~l 176 (520) + .++ ..+. + ...++++.|+.--+|+....++++.-.+.||-|.+.-.+.. ...+|...++.+ T Consensus 80 -~----g~~--~~~~---~----~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~ 145 (486) T protein:vir:42 80 -E----GFR--LGDA---D----EADEELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVE 145 (486) T ss_pred -c----cee--cCCC---c----hhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEe Confidence 0 111 1111 1 12234566666667888999999999999999888765532 245678889999 Q ss_pred CccceeeeeeccC-CCCcccccc-----cceecceeecCcccccc-c-ccceecCC---cce-ecCcccEEEeecccccC Q lcl|NC_018087. 177 DPRNVQFVRELDT-KMENGVKVV-----KGYREYFLYDTELESYQ-C-GHQHFAAG---TKI-KIPYSAMVYAHSGLVDC 244 (520) Q Consensus 177 DPr~i~~vr~i~~-~~~~~~~~~-----~~~~ey~~y~~~~~~~~-~-~~~~~~~~---~~~-~I~~~aI~y~hSGL~d~ 244 (520) ||+.+-.+.+-.. ...-++... +.+...-+|.+....+. . ++.-.... ..+ ++| .|.|++- .++ T Consensus 146 ~p~~~~~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vP--vv~~~n~--~~~ 221 (486) T protein:vir:42 146 PPTRMHAEIDPRINRVSKAIRVAYDKEGNEIQAATLYTPMETIGWFRADGEWAEWFNVPHGLGVVP--VVPLPNR--TRL 221 (486) T ss_pred cccceEEEEeCCCCCeEEEEEEEEecCCCeEEEEEEEcCCcEEEEEecCCcEEeecceecCCCCce--EEEeccc--ccc Confidence 9998888765211 111111111 01111223433321111 0 00000000 000 111 1334431 122 Q ss_pred CCCcchhhhH----HHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCcc Q lcl|NC_018087. 245 CGKNIIGYLH----RAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKV 320 (520) Q Consensus 245 ~~~~~~syL~----~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev 320 (520) .+....|-++ ..+..+|. .+-+..++-...-.|.|-+.-.+....+.... +.+.+.++..| T Consensus 222 ~~~~G~s~i~~~v~~liDa~~~--~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~-----------~~~~~~~~~~~-- 286 (486) T protein:vir:42 222 SDLYGTSEITPELRSMTDAAAR--ILMLMQATAELMGVPQRLIFGIKPEEIGVDSE-----------TGQTLFDAYLA-- 286 (486) T ss_pred CCCCCcccchhhHHHHHHHHHH--HHHHHHHHHHhhcchHHHhhcCCccccccccc-----------cccchhhhhhc-- Confidence 2222233333 23333332 34455555566666666554332222110000 01111122222 Q ss_pred ccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHH Q lcl|NC_018087. 321 KNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELS 400 (520) Q Consensus 321 ~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElk 400 (520) ..|..-.+ ..++..+|+.+-=.-++-++=.-.++...-++|..-|...+... ..+..|.--+.. T Consensus 287 -----------~~~~~~~~---~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~--~Sg~Al~~~~~~ 350 (486) T protein:vir:42 287 -----------RILAFEDA---EGKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNP--ASAEAIRAAESR 350 (486) T ss_pred -----------hhcccCCC---CceEEeecccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCch--hHHHHHHHHHHH Confidence 22322111 23466677643222334444444555666788876664332111 123345556667 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC-hhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhh Q lcl|NC_018087. 401 FDKFISELQHKFEEIFLSPLKSNLLLKRVIT-EDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYI 479 (520) Q Consensus 401 F~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t-~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~ 479 (520) ...-+.+.|..|..-+...++.-+.+.+... +.+| ..|++.|.....=+. .+.++.+..+..-+...+ T Consensus 351 l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~d~----~~i~v~w~~~~~~s~-------~~~ad~~~kl~~~~~g~~ 419 (486) T protein:vir:42 351 LIKKVERKNLMFGGAWEEAMRIAYRIMKGGDVPPDM----LRMETVWRDPSTPTY-------AAKADAATKLYGNGQGVI 419 (486) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc----eeeeEEecCCCCCCH-------HHHHHHHHHHHhcccCCC Confidence 7788899999999999999987777766532 2222 358888865544443 334444444443333467 Q ss_pred hHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCC---------c------cCCccccC Q lcl|NC_018087. 480 SNHTAMKDFLQMSDEDIAAERKLIDEELSDKI---------F------NPPEPEEI 520 (520) Q Consensus 480 S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~---------~------~~p~~e~~ 520 (520) |.++++ ..|.+++++++++++.-+++...+. - +.+.++.. T Consensus 420 s~et~~-~~lg~~~d~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 474 (486) T protein:vir:42 420 PRERAR-IDMGYSVKEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPSPTAPPK 474 (486) T ss_pred CHHHHH-hcCCCChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCCCCCCC Confidence 999998 5699999999887765444432211 1 11111111 No 157 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=93.23 E-value=0.0083 Score=31.99 Aligned_cols=411 Identities=9% Similarity=0.064 Sum_probs=175.4 Q ss_pred CcccCCCCC--CCceeecccccccccccccccccccccc-----hhHHHHHHHHHHHhhccchhHHHHhhhceeeEecCC Q lcl|NC_018087. 32 ESITAPKFD--DGATEVDSQDIAYNGVFQKLYGSQDPTA-----TSTRELINTYRSLLNNYEVDNAVQEIVSDAIVYEEG 104 (520) Q Consensus 32 ~s~~~p~~~--dg~~~i~~~~~a~~g~~~~~~~~~~~~~-----~~~~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~d~~ 104 (520) -++..|..- .-....... ..-.--+..+|-|- ..+ ....++-+..+ ...++=+.-+|+..+.-++ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~-~~r~~~l~~Yy~g~-~~i~~~~~~~~~~~~~~~~-k~~~n~~~~ivd~~~~~l~----- 72 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDG-MSRVRLLARYSNGD-APLPELTRNTSAAWRSFQR-EARTNWGLMVRDSVADRII----- 72 (456) T ss_pred CCCCCHHHHHHHHHHHHHHH-HHHHHHHHHHHhcC-CCchhcCcccChhhhhhhh-hhhcchHHHHHHHHHhhhc----- Confidence 011111000 000000000 00000001111110 000 00011111111 1222333444444443222 Q ss_pred CcEEEEeec-cchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceee Q lcl|NC_018087. 105 FDVVSIDLD-QTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQF 183 (520) Q Consensus 105 ~~~V~l~Ld-~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~ 183 (520) .+||++... +.+.. +.+..+..--+|+....++++.-.+.||-|.+.-.| .+|-..++.+||+.+-. T Consensus 73 ~~~~~~~~~~d~~~~--------~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d----~~g~~~i~~~~p~~~~~ 140 (456) T protein:vir:10 73 PNGITVGGSADSDLA--------LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR----DDGTATITADSPETMVV 140 (456) T ss_pred cCCeecCCCCCcchH--------HHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeC----CCCceEEEEEccceeEE Confidence 245555432 22222 233344555678888889999999999987654433 34677788999998877 Q ss_pred eeeccCCC--Cccccccc---------------ceec----ceeecCcccccccccceecCCcceec-------CcccEE Q lcl|NC_018087. 184 VRELDTKM--ENGVKVVK---------------GYRE----YFLYDTELESYQCGHQHFAAGTKIKI-------PYSAMV 235 (520) Q Consensus 184 vr~i~~~~--~~~~~~~~---------------~~~e----y~~y~~~~~~~~~~~~~~~~~~~~~I-------~~~aI~ 235 (520) +.+-..+. ...++... ++.. .++|...... .....++..+.+ ..--|+ T Consensus 141 i~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~pvv 216 (456) T protein:vir:10 141 SVDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRR----LVTRISDSWVPVGDAVVTGSPPPVV 216 (456) T ss_pred EEcCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccce----eeeecCCceeeccccCCCCCceeEE Confidence 65422111 11111110 0001 1111111000 000001110100 011133 Q ss_pred EeecccccCCCCcchhhhHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEee Q lcl|NC_018087. 236 YAHSGLVDCCGKNIIGYLHRAVKPANQLKL-LEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYD 314 (520) Q Consensus 236 y~hSGL~d~~~~~~~syL~~aik~~NqL~m-~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd 314 (520) +.+ +...+|=++..+.....+.. +=|.++.-...--|.|-+.-.+.+. |.. | T Consensus 217 ~~~-------N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~-~~~-------------------d 269 (456) T protein:vir:10 217 VYQ-------NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGL-PNV-------------------D 269 (456) T ss_pred Eec-------CCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccc-ccc-------------------c Confidence 332 22334555554444433332 2233344444444444433222111 100 0 Q ss_pred cCCCccccccccchhhhh-hcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccc Q lcl|NC_018087. 315 ARTGKVKNQANMMALTED-YWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMST 392 (520) Q Consensus 315 ~~TGev~d~~~~msmlED-ywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~ 392 (520) .+|..-+..+....-.+ .|. +..|+.|..++.. ++.. ++-++-.-..+....++|.+-|...++ | ..+. T Consensus 270 -~~g~~~~~~~~~~~~~~~~~~----~~~~~~~~q~~~~-~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~-N--~Sg~ 340 (456) T protein:vir:10 270 -ENGNAIDYASIFEAAPGALWE----LPPGVDIWESQAN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-N--QSAE 340 (456) T ss_pred -ccccccchhhhhhhhcccccc----CCCCcceEEeccc-ChhHHHHHHHHHHHHHHhccCCChHHhccccc-C--hHHH Confidence 01110000000000001 132 1134567777753 3433 444677777788888999877754332 1 1233 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018087. 393 AISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLME 472 (520) Q Consensus 393 eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~ 472 (520) .|.--+..+-.-+.+.|+.|..-+.+.++.-+.+.|... + ..+++.|..-..=+. .+.++++..+. T Consensus 341 Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~~---~----~~~~v~w~~~~~~~~-------~~~ada~~kl~ 406 (456) T protein:vir:10 341 GAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV---E----DTVDVSFESPDRVTL-------GEKYSAASLAK 406 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc---c----cceeEEecCCCCcCH-------HHHHHHHHHHH Confidence 455556667778888999999989999988888888431 1 357888865433332 23345555543 Q ss_pred cccchhhhHHHHHHHHhCCCHHHHHHHHH-HHHHhhh---cC--CccCCcccc Q lcl|NC_018087. 473 PYIGKYISNHTAMKDFLQMSDEDIAAERK-LIDEELS---DK--IFNPPEPEE 519 (520) Q Consensus 473 p~vgky~S~~~i~k~IL~~tDeeI~~~~k-qi~~E~~---~~--~~~~p~~e~ 519 (520) .- ...|...+ ..+|++++++|++.+. .+++|.. .. -.|+|++.- T Consensus 407 ~~--gi~~~~~~-~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:10 407 AA--GESWASIR-RNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQEDGSR 456 (456) T ss_pred Hc--CCChHHHH-HhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhcCCCCCCC Confidence 21 23465555 5789999998875433 3333322 11 123444444 No 158 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=93.23 E-value=0.0083 Score=31.99 Aligned_cols=411 Identities=9% Similarity=0.064 Sum_probs=175.4 Q ss_pred CcccCCCCC--CCceeecccccccccccccccccccccc-----hhHHHHHHHHHHHhhccchhHHHHhhhceeeEecCC Q lcl|NC_018087. 32 ESITAPKFD--DGATEVDSQDIAYNGVFQKLYGSQDPTA-----TSTRELINTYRSLLNNYEVDNAVQEIVSDAIVYEEG 104 (520) Q Consensus 32 ~s~~~p~~~--dg~~~i~~~~~a~~g~~~~~~~~~~~~~-----~~~~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~d~~ 104 (520) -++..|..- .-....... ..-.--+..+|-|- ..+ ....++-+..+ ...++=+.-+|+..+.-++ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~-~~r~~~l~~Yy~g~-~~i~~~~~~~~~~~~~~~~-k~~~n~~~~ivd~~~~~l~----- 72 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDG-MSRVRLLARYSNGD-APLPELTRNTSAAWRSFQR-EARTNWGLMVRDSVADRII----- 72 (456) T ss_pred CCCCCHHHHHHHHHHHHHHH-HHHHHHHHHHHhcC-CCchhcCcccChhhhhhhh-hhhcchHHHHHHHHHhhhc----- Confidence 011111000 000000000 00000001111110 000 00011111111 1222333444444443222 Q ss_pred CcEEEEeec-cchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceee Q lcl|NC_018087. 105 FDVVSIDLD-QTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQF 183 (520) Q Consensus 105 ~~~V~l~Ld-~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~ 183 (520) .+||++... +.+.. +.+..+..--+|+....++++.-.+.||-|.+.-.| .+|-..++.+||+.+-. T Consensus 73 ~~~~~~~~~~d~~~~--------~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d----~~g~~~i~~~~p~~~~~ 140 (456) T protein:vir:10 73 PNGITVGGSADSDLA--------LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR----DDGTATITADSPETMVV 140 (456) T ss_pred cCCeecCCCCCcchH--------HHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeC----CCCceEEEEEccceeEE Confidence 245555432 22222 233344555678888889999999999987654433 34677788999998877 Q ss_pred eeeccCCC--Cccccccc---------------ceec----ceeecCcccccccccceecCCcceec-------CcccEE Q lcl|NC_018087. 184 VRELDTKM--ENGVKVVK---------------GYRE----YFLYDTELESYQCGHQHFAAGTKIKI-------PYSAMV 235 (520) Q Consensus 184 vr~i~~~~--~~~~~~~~---------------~~~e----y~~y~~~~~~~~~~~~~~~~~~~~~I-------~~~aI~ 235 (520) +.+-..+. ...++... ++.. .++|...... .....++..+.+ ..--|+ T Consensus 141 i~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~pvv 216 (456) T protein:vir:10 141 SVDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRR----LVTRISDSWVPVGDAVVTGSPPPVV 216 (456) T ss_pred EEcCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccce----eeeecCCceeeccccCCCCCceeEE Confidence 65422111 11111110 0001 1111111000 000001110100 011133 Q ss_pred EeecccccCCCCcchhhhHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEee Q lcl|NC_018087. 236 YAHSGLVDCCGKNIIGYLHRAVKPANQLKL-LEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYD 314 (520) Q Consensus 236 y~hSGL~d~~~~~~~syL~~aik~~NqL~m-~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd 314 (520) +.+ +...+|=++..+.....+.. +=|.++.-...--|.|-+.-.+.+. |.. | T Consensus 217 ~~~-------N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~-~~~-------------------d 269 (456) T protein:vir:10 217 VYQ-------NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGL-PNV-------------------D 269 (456) T ss_pred Eec-------CCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccc-ccc-------------------c Confidence 332 22334555554444433332 2233344444444444433222111 100 0 Q ss_pred cCCCccccccccchhhhh-hcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccc Q lcl|NC_018087. 315 ARTGKVKNQANMMALTED-YWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMST 392 (520) Q Consensus 315 ~~TGev~d~~~~msmlED-ywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~ 392 (520) .+|..-+..+....-.+ .|. +..|+.|..++.. ++.. ++-++-.-..+....++|.+-|...++ | ..+. T Consensus 270 -~~g~~~~~~~~~~~~~~~~~~----~~~~~~~~q~~~~-~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~-N--~Sg~ 340 (456) T protein:vir:10 270 -ENGNAIDYASIFEAAPGALWE----LPPGVDIWESQAN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-N--QSAE 340 (456) T ss_pred -ccccccchhhhhhhhcccccc----CCCCcceEEeccc-ChhHHHHHHHHHHHHHHhccCCChHHhccccc-C--hHHH Confidence 01110000000000001 132 1134567777753 3433 444677777788888999877754332 1 1233 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018087. 393 AISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLME 472 (520) Q Consensus 393 eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~ 472 (520) .|.--+..+-.-+.+.|+.|..-+.+.++.-+.+.|... + ..+++.|..-..=+. .+.++++..+. T Consensus 341 Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~~---~----~~~~v~w~~~~~~~~-------~~~ada~~kl~ 406 (456) T protein:vir:10 341 GAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV---E----DTVDVSFESPDRVTL-------GEKYSAASLAK 406 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc---c----cceeEEecCCCCcCH-------HHHHHHHHHHH Confidence 455556667778888999999989999988888888431 1 357888865433332 23345555543 Q ss_pred cccchhhhHHHHHHHHhCCCHHHHHHHHH-HHHHhhh---cC--CccCCcccc Q lcl|NC_018087. 473 PYIGKYISNHTAMKDFLQMSDEDIAAERK-LIDEELS---DK--IFNPPEPEE 519 (520) Q Consensus 473 p~vgky~S~~~i~k~IL~~tDeeI~~~~k-qi~~E~~---~~--~~~~p~~e~ 519 (520) .- ...|...+ ..+|++++++|++.+. .+++|.. .. -.|+|++.- T Consensus 407 ~~--gi~~~~~~-~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:10 407 AA--GESWASIR-RNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQEDGSR 456 (456) T ss_pred Hc--CCChHHHH-HhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhcCCCCCCC Confidence 21 23465555 5789999998875433 3333322 11 123444444 No 159 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=93.06 E-value=0.009 Score=31.81 Aligned_cols=325 Identities=12% Similarity=0.133 Sum_probs=152.9 Q ss_pred CCCcEEEEeeccchhhhHHHHHHHHHHHHHHHH-hcchhhhHHHHHh----hccccceeEEEeeecCCCCCCeeeeEecC Q lcl|NC_018087. 103 EGFDVVSIDLDQTAFTENIRNLISDEFNSVLNM-LNFQRKGSDHFKR----WYVDSRVFFHKIINPNRPKDGIIELRRLD 177 (520) Q Consensus 103 ~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~l-l~f~k~g~~~fRr----WYvDgri~~hkvid~~~~k~GI~elr~lD 177 (520) --.-|+.|. ++.+ .+......+|+. =|-..++.++.+. |.++|--|..++-| .+. -+++|..|+ T Consensus 1 ia~lp~~~~-~~~~-------~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~-~~G--~~~~L~~l~ 69 (348) T protein:vir:93 1 MASLPLKMY-EDYK-------VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERD-IYH--QPSKLFLLN 69 (348) T ss_pred CcccceEeE-ecCc-------CcccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEC-CCC--cEEEEEEEc Confidence 111222221 1111 111222233332 2334466665554 67789988887765 222 388999999 Q ss_pred ccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccccCCCCcchhhhHHHH Q lcl|NC_018087. 178 PRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAV 257 (520) Q Consensus 178 Pr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~ai 257 (520) |..++.+.+- ++.. ..|.++.+ .+..+.++.+.|+|.- +....++-..+|.+..|. T Consensus 70 ~~~v~~~~~~-----~~~~-----~~y~~~~~-------------~g~~~~~~~~eiih~r-~~~~~~~~~G~s~~~~~~ 125 (348) T protein:vir:93 70 PDVVEMLIEN-----QSRE-----LYYSIHAA-------------TGNKLIVHNMDMLHFK-HIVASNMVQGISPIDVLK 125 (348) T ss_pred CCceEEEEeC-----CCcE-----EEEEEEcC-------------CCeEEEEccccEEEec-CCCCCCceeeccHHHHHH Confidence 9999875331 1111 11222211 1234678999998872 222334445678888888 Q ss_pred HHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccc Q lcl|NC_018087. 258 KPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQR 337 (520) Q Consensus 258 k~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpR 337 (520) ..+.....++... .....+.| . +...-.+++-+.++++..+.....|+| .|. .+ . |+ T Consensus 126 ~~i~~~~~~~~~~-~~~~~~~~-~-~i~~~~~~l~~e~~~~~~~~~~~~~~n-------~~~------~~-v-----l~- 182 (348) T protein:vir:93 126 NTTDFDNAVRTFN-LTEMQKPD-S-FMLKYGSNVSTEKRQQVLEDFKQYYEE-------NGG------IL-F-----QE- 182 (348) T ss_pred HHHHHHHHHHHHH-HHhcCCCc-e-eEEecCCCCCHHHHHHHHHHHHHHhhc-------CCC------ee-e-----cC- Confidence 8877666666653 33333333 2 333344566666665555554443322 232 11 1 11 Q ss_pred cCCCCCcceeecCCCCCcChHHH---HHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHH-HHHHHHHHH Q lcl|NC_018087. 338 RDGKAVTEVETLPGMTGMNEMDD---ILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKF-ISELQHKFE 413 (520) Q Consensus 338 ReGgrgTEIsTLpGg~nLgei~D---V~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KF-I~rLr~rFs 413 (520) .|.+++.|. .+.-+++= -++..+.+.++++||..-|...++.+ ++...+.. ..|.++ +.-+-.+++ T Consensus 183 ----~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~-~~~~e~~~---~~~~~~~l~P~~~~ie 252 (348) T protein:vir:93 183 ----PGVEIEPLP--KKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTN-FAKNEELN---RFYLQHTLLPIVKQYE 252 (348) T ss_pred ----CCceEEEcC--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC-cccHHHHH---HHHHHHHHHHHHHHHH Confidence 256777774 34444433 34678889999999999887443322 12221111 123332 233333322 Q ss_pred HHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCH Q lcl|NC_018087. 414 EIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSD 493 (520) Q Consensus 414 ~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tD 493 (520) .- +-..++++.+|.. ..+|+|+.. ++...+ ++.|.+++..+-. .-+++.+-++. .|++.+ T Consensus 253 ~~---------l~~~l~~~~~~~~-g~~i~fd~~------~l~~~d-~~~~a~~~~~~~~--~G~~T~NE~R~-~~g~~p 312 (348) T protein:vir:93 253 EE---------FNRKLLTKTDREK-NRYFKFNVK------SYLRAD-SATQAEVYFKAVR--SGYYTINDIRE-WEDLPP 312 (348) T ss_pred HH---------HHHhhCCcccccC-cceEEeech------hhhccC-HHHHHHHHHHHHh--CCCCCHHHHHH-HhCCCC Confidence 22 2234566666652 223444332 222222 3556666655522 23567777764 356543 Q ss_pred HHHHHHHHHHHHhhhcCCcc--------------CCccccC Q lcl|NC_018087. 494 EDIAAERKLIDEELSDKIFN--------------PPEPEEI 520 (520) Q Consensus 494 eeI~~~~kqi~~E~~~~~~~--------------~p~~e~~ 520 (520) -+ .-++-+ .+...++ +.+.+|= T Consensus 313 ~~--ggD~~~---~~~n~~~~~~~~~~~~~~~gg~~n~~~~ 348 (348) T protein:vir:93 313 VE--GGDKPL---ISGDLYPIDTPLELRKSLKGGDKNVNES 348 (348) T ss_pred CC--CcCeEe---ecccccccccchhhcccccCCCCCcCCC Confidence 21 000000 0000000 1111111 No 160 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=93.05 E-value=0.009 Score=31.81 Aligned_cols=421 Identities=11% Similarity=0.051 Sum_probs=170.1 Q ss_pred chhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeeccccccccccccccc--ccccccchhHHHHH--------HH Q lcl|NC_018087. 8 DLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLY--GSQDPTATSTRELI--------NT 77 (520) Q Consensus 8 ~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~--~~~~~~~~~~~~LI--------~~ 77 (520) +|+.=.|-.-.|-..+-+. .+-.-. ++.|.-+. .+......-...+| .+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~------~~~~~~---------------n~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r 59 (511) T protein:vir:10 1 MLKVNEFETDTDLRGNINY------LFNDEA---------------NVVYTYDGTESDLLQNVNEVSKCIEHHMDYQRPR 59 (511) T ss_pred Cccccchhhhhhhhhhhhh------hhhhhh---------------cCCccCchhhhhcccCHHHHHHHHHHHHHhhHHH Confidence 4444444333333321111 000000 11111000 00000000111111 23 Q ss_pred HHHHhhccchhHHH----------------------HhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHH Q lcl|NC_018087. 78 YRSLLNNYEVDNAV----------------------QEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNM 135 (520) Q Consensus 78 YR~ma~~pEvd~Ai----------------------~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~l 135 (520) |+.+..+.+=...| .-||+..+-+= -..||++..++.+ ..+.++.+.+- T Consensus 60 ~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl-~g~p~~~~~~d~~--------~~~~l~~~~~~ 130 (511) T protein:vir:10 60 LKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYF-LGNPIQYQDDDKD--------VLEAIEAFNDL 130 (511) T ss_pred HHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhh-cccCceeecCchH--------HHHHHHHHHhh Confidence 33333332211111 11222211111 1266666655442 33455666666 Q ss_pred hcchhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCC--CCccccccc----------ceec Q lcl|NC_018087. 136 LNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTK--MENGVKVVK----------GYRE 203 (520) Q Consensus 136 l~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~--~~~~~~~~~----------~~~e 203 (520) -+|+....++.+.+.+-|+-|.+.-+| + +|-..+..+||+.+.+|.+-..+ ..-+++.+. .+.. T Consensus 131 n~~~~~~~~~~~~~~i~G~ay~~vy~d-e---dg~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~ 206 (511) T protein:vir:10 131 NDVESHNRSLGLDLSIYGKAYEIMIRN-Q---DDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFT 206 (511) T ss_pred cCHHHHHHHHHHHHHhcCeeEEEEEeC-C---CCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecccCccceEEE Confidence 788899999999999999988877665 2 46788999999999998653322 122222111 0111 Q ss_pred ceeecCcc-cccccccceec----------CCcceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHHH-HHHHHH Q lcl|NC_018087. 204 YFLYDTEL-ESYQCGHQHFA----------AGTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKL-LEDAMM 271 (520) Q Consensus 204 y~~y~~~~-~~~~~~~~~~~----------~~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m-~EDalV 271 (520) +-+|++.. ..+..++.... +..--+|| |++. +++....|-++..+....-+.. +=+... T Consensus 207 ~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vP---vv~f------~nn~~g~gd~e~v~~liDa~d~~~S~~~~ 277 (511) T protein:vir:10 207 VDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMP---ITEF------SNNERRKGDYEKVITLIDLYDNAESDTAN 277 (511) T ss_pred EEEEeCCcEEEEEecCCCcccccccccccccccCccee---EEEe------cCCCCCCCchhhhHHHHHHHHHHHHHHHH Confidence 22555432 22211111000 00001222 2221 1222345666665555443332 122222 Q ss_pred HHHHhcCccceEEE---ccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhh-hcccc--cCCCCCcc Q lcl|NC_018087. 272 IYRITRAPDRRVFY---IDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTED-YWLQR--RDGKAVTE 345 (520) Q Consensus 272 IyRi~RApeRRvFy---IDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlED-ywLpR--ReGgrgTE 345 (520) .-+-++.|-+-+.- .|.+.+++.+ ...+-.+.. .+.+. ...+.|.. T Consensus 278 ~~~~~~~~~lv~~g~~~~~~~~~~~~~----------------------------~~~~~~~~~~~~~~~~~~~~~~~~d 329 (511) T protein:vir:10 278 YMSDLNDAMLLIKGNLNLDPVEVRKQK----------------------------EANVLFLEPTVYADSEGRETEGSVD 329 (511) T ss_pred HHHHhhCceeeeeccccCCchhhccch----------------------------hccceecccccccccccccCCCCcc Confidence 22333444333221 0111111100 001100000 00000 01112334 Q ss_pred eeecCCCCCcChH-HHHHHHHHHHHHhcCCChhhccCCCccccccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 346 VETLPGMTGMNEM-DDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAIS--RDELSFDKFISELQHKFEEIFLSPLKS 422 (520) Q Consensus 346 IsTLpGg~nLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eIt--RDElkF~KFI~rLr~rFs~if~d~Lk~ 422 (520) +..|-...+...+ .-+.-+.+.+|.-.++|-- ..++ +. |..+..+ .-......-+.+.+..|..-+...++. T Consensus 330 ~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~--~~~~-~~--~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l 404 (511) T protein:vir:10 330 GGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNM--KDDN-FS--GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKL 404 (511) T ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc--cccc-cc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4444443333333 3445667788888888862 2221 11 2222222 222223344555566666666555544 Q ss_pred HHHhcC----CCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCH---HH Q lcl|NC_018087. 423 NLLLKR----VITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSD---ED 495 (520) Q Consensus 423 QLiLkg----i~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tD---ee 495 (520) =+-+-+ +-...+| ..+++.|...--=.+ .+.++++..+. | .+|.+++++. |...+ +| T Consensus 405 i~~~~~~~~~~~~~~d~----~~i~i~f~~~~p~d~-------~~~~~~~~kl~---G-~iS~et~~~~-l~~v~d~~~E 468 (511) T protein:vir:10 405 LETILKNTRSIDANKDF----NTVRYVYNRNLPKSL-------IEELKAYIDSG---G-KISQTTLMSL-FSFFQDPELE 468 (511) T ss_pred HHHHHHhhCCccccccc----ceeeEEeCCCCCcCH-------HHHHHHHHHHh---c-cCcHHHHHHh-CCCCCCHHHH Confidence 222212 1223333 357788865333333 33345555553 3 3799999977 55543 45 Q ss_pred HHHHHHHHHHhhhcC---CccCCccccC Q lcl|NC_018087. 496 IAAERKLIDEELSDK---IFNPPEPEEI 520 (520) Q Consensus 496 I~~~~kqi~~E~~~~---~~~~p~~e~~ 520 (520) |+.++++-+++.+.. ...+|++.+= T Consensus 469 ~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 496 (511) T protein:vir:10 469 VKKIEEDEKESIKKAQKGIYKDPRDIND 496 (511) T ss_pred HHHHHHHHHHHHHHHhhhcccCCCCCCC Confidence 554444433332221 1222221111 No 161 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=92.89 E-value=0.0096 Score=31.65 Aligned_cols=417 Identities=12% Similarity=0.074 Sum_probs=182.6 Q ss_pred Cccccccchh-----------hhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccch Q lcl|NC_018087. 1 MSMLADSDLK-----------MFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTAT 69 (520) Q Consensus 1 ~~~~~~~~l~-----------~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~ 69 (520) |-=+|+..|. |+.-|.++-. .++.. .-.|.|-..-.+.. .. T Consensus 1 ~~~~p~~~l~~~~~~~~~~~~l~~~~~~~~~-r~~~~-----------------------~~YY~g~~~i~~~~----~~ 52 (479) T protein:vir:99 1 MIDLPDEDLSSEGLAKYLETKVFPKMNTECE-RLDDF-----------------------EAWTKNGQEVPDLA----TR 52 (479) T ss_pred CccCCcccCChhHHHHHHHHHHHHHHHHHhH-HHHHH-----------------------HHHHhcCCcccccc----cc Confidence 4444443221 2222211110 00000 00111111000100 01 Q ss_pred hHHHHHHHHHHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhh Q lcl|NC_018087. 70 STRELINTYRSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRW 149 (520) Q Consensus 70 ~~~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrW 149 (520) +.....+..+.++.+.=+.-+|+..++-+. +..+...+.+.. +++..|++.=+|+....++++.- T Consensus 53 ~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~-------~~gf~~~d~~~~--------~~~~~i~~~N~~d~~~~~~~~~a 117 (479) T protein:vir:99 53 HKNKEREVLQQLSRKPWMGLMVNSFAQQLI-------VDGYRKTGTNEN--------AKGWDTWRLNQMDKQQFWLNRAV 117 (479) T ss_pred cCChhHHHHHHHhhcCcHHHHHHHHHhhcc-------cccccCCCchhh--------HHHHHHHHhcChhHHHHHHHHHH Confidence 112333444444334445556666555331 111222222222 23444555557788888899999 Q ss_pred ccccceeEEEeeecC--CCCCCeeeeEecCccceeeeeeccCCCCccc--cc--ccceecceeecCcc-cccccccceec Q lcl|NC_018087. 150 YVDSRVFFHKIINPN--RPKDGIIELRRLDPRNVQFVRELDTKMENGV--KV--VKGYREYFLYDTEL-ESYQCGHQHFA 222 (520) Q Consensus 150 YvDgri~~hkvid~~--~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~--~~--~~~~~ey~~y~~~~-~~~~~~~~~~~ 222 (520) .+-|+-|. .|+-.. ...+|...++.+||+.+-.+-+ ....... .. ......+.+|.... ..+..+..... T Consensus 118 ~~~G~af~-~v~~~~~~~d~~g~~~i~~~~p~~~~~iyd--d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 194 (479) T protein:vir:99 118 LTFGYAFI-KVTSGISPLDGTTVARIKCIDPRDAFAIWE--DPYWDEWPKYLLERQPNGQYWWWTEEDYSIFEFKQGKFI 194 (479) T ss_pred hhcCceEE-EEecCCCCcCCCCceEEEEechhheEEEec--CCcccceeeEEEeecCceeEEEEecceEEEEEecCCcee Confidence 99999655 443211 2356888899999999887632 1111111 11 11111222222111 11111100000 Q ss_pred -----CCcceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHH-HHHHHHHHHHHHhcCccceEEEccCCCCchHHH Q lcl|NC_018087. 223 -----AGTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQL-KLLEDAMMIYRITRAPDRRVFYIDTGNMPARKA 296 (520) Q Consensus 223 -----~~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL-~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KA 296 (520) ++.-=++| .|.|+|..-.++. ..|=++..+.....+ +.+-+..++-.....|.|-++-. .++.... T Consensus 195 ~~~~~~h~~g~vP--vv~f~n~~~~~~~---g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~---~~~~~~~ 266 (479) T protein:vir:99 195 YRETVSHDYGHIP--FVRYVNVMDLRGV---CYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATGL---MLPEGAN 266 (479) T ss_pred eccccccCCCCcc--eEEeecCCCcCcC---CcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCC---Ccccccc Confidence 00000122 3556664333322 234333333322222 34556666667777777765421 1211000 Q ss_pred HHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCCh Q lcl|NC_018087. 297 AQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPL 376 (520) Q Consensus 297 eqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~ 376 (520) . ....+.-+.++ .|.. .+.+.++-++|+..--.-++-++=.-..+...-++|. T Consensus 267 ~--~~~~~~~~~~~----------------------i~~~---~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~t~~p~ 319 (479) T protein:vir:99 267 A--DQEKMRFAQES----------------------MLIS---QNEKASFGAIPAAPLDGLLNAYKESLLEFLALAQLPP 319 (479) T ss_pred c--chhcccccccc----------------------ceee---cCCCceEEEecccchHHHHHHHHHHHHHHhccCCCCH Confidence 0 00000000011 1211 1224567777764322222323333334444456666 Q ss_pred hhccCCCccccccccchhhHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccch Q lcl|NC_018087. 377 SRIPDEQTQNVFDMSTAISRDEL-----SFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSY 451 (520) Q Consensus 377 SRl~~~~~~~~~G~~~eItRDEl-----kF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~ 451 (520) .=+ |.++..|-+-+ ..-.-+.+.|+.|..-+...|+.=+.+.|..-..++ -.|.+.|..-.. T Consensus 320 ~~~---------g~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~~~~~~~~~----~~i~~~w~~~~~ 386 (479) T protein:vir:99 320 HIA---------GQIVNVAADALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKIEGRTEEATD----LDFTITWQDVTI 386 (479) T ss_pred HHc---------ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccc----eeeeEEecCCCC Confidence 333 22222333333 355678889999999999999987777876433222 236777743222 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhh-----cCCccCCccccC Q lcl|NC_018087. 452 FSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELS-----DKIFNPPEPEEI 520 (520) Q Consensus 452 f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~-----~~~~~~p~~e~~ 520 (520) =+. .+..+.+.++..- | .+|.+++++.+...|+.+++.+.+..+++.. +.+.+.+++.+- T Consensus 387 ~s~-------~~~ad~~~kl~~a-g-~is~et~l~~l~gv~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (479) T protein:vir:99 387 QSL-------AQFADAWAKMVES-L-KIPAEGVWDMIPNLDQSTVNGWKEIYDREGDFGKYMRKLQNGPDPAEQ 451 (479) T ss_pred CCH-------HHHHHHHHHHHhc-C-CCCHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccc Confidence 122 2344555554322 3 3899999999889999998877655444421 122222221111 No 162 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=92.83 E-value=0.0098 Score=31.60 Aligned_cols=407 Identities=10% Similarity=0.086 Sum_probs=167.3 Q ss_pred hhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchhHH Q lcl|NC_018087. 11 MFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNA 90 (520) Q Consensus 11 ~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~A 90 (520) |+.-++ .|++- + .....+.....-...-.... .+..+.+|+.+..+.+=... T Consensus 1 ~~~~~~------------------~~~~~------~---~~~~~~~~~~~~~~~i~~~~-~~~~~~~~~~~~~yy~g~~~ 52 (479) T protein:vir:79 1 MLNIYI------------------SETDL------I---KVQLKKESTINLVKVIEHYI-LKHRPEKYKQGEEYYYGNTD 52 (479) T ss_pred CCCcee------------------cccce------E---eeccccCChhHHHHHHHHHH-hhhhHHHHHHHHHHhccCCc Confidence 000000 00000 0 00000000000001001110 11233445555544432221 Q ss_pred -----------------------------HHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhh Q lcl|NC_018087. 91 -----------------------------VQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRK 141 (520) Q Consensus 91 -----------------------------i~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~ 141 (520) ...||+..+-+= -..||++..++.+..+ .+ +.+.+ =+|+.. T Consensus 53 i~~~~~~~~~~~~~~~~~~~~~~ki~~~~~~~Ivd~~~~~l-~g~p~~~~~~~~~~~~----~~----~~~~~-n~~~~~ 122 (479) T protein:vir:79 53 VNNKRRYYLLDGAKVDDFTKVNNKAINNYHKLLVDQKVGYS-VGNPIVFNADDDNLTK----LL----NDLLG-EEFDDT 122 (479) T ss_pred ccccccccccccccccccccCcceeecchHHHHHHHHHhhh-hcCCceeccCCHHHHH----HH----HHHHh-cCHHHH Confidence 122333222111 1256677666543322 22 22222 388899 Q ss_pred hHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeecc--CCCCccccccc-------ceecceeecCcc- Q lcl|NC_018087. 142 GSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELD--TKMENGVKVVK-------GYREYFLYDTEL- 211 (520) Q Consensus 142 g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~--~~~~~~~~~~~-------~~~ey~~y~~~~- 211 (520) ..++.+...+-|+.|.+.-+|. +|-..++.+||+.+.++.+-. .+..-+++.+. .+..+-+|.+.. T Consensus 123 ~~~~~~~~~~~G~~~~~v~~d~----~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~y~~~~~~~~~~~~~e~y~~~~i 198 (479) T protein:vir:79 123 ITELYLNASNKGVEWLHPYINR----KGEFKYVIIPAEEAIPIWDSKRQRELVAFIRFYYIEDIDGNKIKRVEYYTENDI 198 (479) T ss_pred HHHHHHHHHhcCeEEEEEEeCC----CCceEEEEEccceeEEEEeCCCCCceEEEEEEEEEeecCCceEEEEEEEeCCcE Confidence 9999999999999999877652 366789999999998875321 11122222110 111122332221 Q ss_pred cccccccceec--------------CCcceecCcc-----c--EEEeecccccCCCCcchhhhHHHHHHHHHHHH-HHHH Q lcl|NC_018087. 212 ESYQCGHQHFA--------------AGTKIKIPYS-----A--MVYAHSGLVDCCGKNIIGYLHRAVKPANQLKL-LEDA 269 (520) Q Consensus 212 ~~~~~~~~~~~--------------~~~~~~I~~~-----a--I~y~hSGL~d~~~~~~~syL~~aik~~NqL~m-~EDa 269 (520) ..|......+. +.....+... . |+++ +|+....|-++..+....-+.. +-+. T Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~------~nn~~g~sd~~~v~~liDa~d~~~S~~ 272 (479) T protein:vir:79 199 TYFIERGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPF------KNNEKCVSDLTFYKSLIDIYDNNISTL 272 (479) T ss_pred EEEEecCCcccccccccccccccccccccccccccccCCCcccEEEe------cCCCCCCcchhhhHHHHHHHHHHHHHH Confidence 11111111000 0000000000 0 1111 1233345666665555554442 3344 Q ss_pred HHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeec Q lcl|NC_018087. 270 MMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETL 349 (520) Q Consensus 270 lVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTL 349 (520) ...-+.++-|-.-+--.+ |. ...+-.. . ++. .+.+. + +|+.+ ++.| T Consensus 273 ~~~~~~~~~~~~v~~g~~-~~----~~~~~~~--------~---------~~~-~~~i~------~---~~~~~--~~~l 318 (479) T protein:vir:79 273 ADNLDEIQEVIYVLKEYP-GT----SLQEFID--------N---------IRY-YKSIK------V---DGGGG--VDKL 318 (479) T ss_pred HHHHHHhhCceeeeecCC-cc----ccccchh--------h---------hhh-cccee------c---CCCCc--ceEE Confidence 445555555543321111 11 0000000 0 000 00111 1 12222 3343 Q ss_pred CCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018087. 350 PGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRD--ELSFDKFISELQHKFEEIFLSPLKSNLLL 426 (520) Q Consensus 350 pGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRD--ElkF~KFI~rLr~rFs~if~d~Lk~QLiL 426 (520) ....+... -.-++-+++.+|+...+|-. ..++ +|.+|..+.. ...-..-+.+.+..|...+.++++.=+-+ T Consensus 319 ~~~~~~~~~~~~~~~l~~~i~~~s~~p~~--~~~~----~gn~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~ 392 (479) T protein:vir:79 319 EINIPVEAKKELLDRLEKNIIIFGQGVNP--ESQN----TGDKSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEY 392 (479) T ss_pred eccCCHHHHHHHHHHHHHHHHHHhCcccc--cccc----ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33333322 23355667778888898852 2222 2444443322 22233446667777777777766653333 Q ss_pred cCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHh Q lcl|NC_018087. 427 KRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEE 506 (520) Q Consensus 427 kgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E 506 (520) -++....+++ ...+.+.|...---.+...+ ++++.+. | .+|.+++++. |...++ .+++-++|++| T Consensus 393 ~~~~~~~~~~--~~~i~i~f~~~~p~~~~~~a-------~~~~kl~---g-~iS~et~l~~-l~~v~d-~~~E~~ri~~E 457 (479) T protein:vir:79 393 LKISGNKSYD--YKTVQITFNHSMIINEAEKI-------DMAAKST---G-IVSDETIVSN-HPWVED-VNDELERLKKQ 457 (479) T ss_pred HhccCCCccc--cccceEEeCCCCCcCHHHHH-------HHHHHHh---c-cCcHHHHHHh-CCCCCC-HHHHHHHHHHH Confidence 3333333333 23577888655444443333 3444443 4 3799999977 555432 22333344333 Q ss_pred hhc--------CCccCCccccC Q lcl|NC_018087. 507 LSD--------KIFNPPEPEEI 520 (520) Q Consensus 507 ~~~--------~~~~~p~~e~~ 520 (520) ..+ +-..++..+|- T Consensus 458 ~~~~~~~~~~~~~~~~~~~~e~ 479 (479) T protein:vir:79 458 EDTQKEYDDLIPNNQDGVIDET 479 (479) T ss_pred HHHHHHHHhccCcccCCCcCcC Confidence 321 11122223333 No 163 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=92.73 E-value=0.01 Score=31.50 Aligned_cols=405 Identities=15% Similarity=0.132 Sum_probs=173.6 Q ss_pred cc-cchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhh Q lcl|NC_018087. 5 AD-SDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLN 83 (520) Q Consensus 5 ~~-~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~ 83 (520) .| .--++|-|.. ++.-.++. .+.+.... ..-..+|+.+.. T Consensus 1 ~~~~~~~~~~~~~--~~~~~~~~------------------------------i~~~i~~~-------~~~~~r~~~~~~ 41 (453) T protein:vir:73 1 MNLKPIKLMTYSR--DEEITDKV------------------------------VNDFMKKH-------QEEVERYEYLGN 41 (453) T ss_pred Cccccceeeeccc--cccCCHHH------------------------------HHHHHHHH-------HHHHHHHHHHHH Confidence 00 0011111110 00000000 00000000 011223333333 Q ss_pred ccchhHHH--------------------HhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhH Q lcl|NC_018087. 84 NYEVDNAV--------------------QEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGS 143 (520) Q Consensus 84 ~pEvd~Ai--------------------~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~ 143 (520) +++=...| .-||+..+-+ -..+|+++..++. ...+..+.++.--+|+.... T Consensus 42 yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~-l~g~~~~~~~~d~--------~~~~~l~~~~~~n~~~~~~~ 112 (453) T protein:vir:73 42 MYKGIMEISSQKAKDSWKPDNRLTNNFAKYIVDTFVGY-FNGIPIKKTHDDK--------SVLEAMQLFDNLNDMEDEES 112 (453) T ss_pred HhccccchhcCCCCCccCccceeecchHHHHHHHhhhh-hcccCceeecCCh--------HHHHHHHHHHHhcChhHHHH Confidence 33322211 1122111111 1124555544432 23334555555568899999 Q ss_pred HHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCC--cccc---cccceecceeecCcc-cccccc Q lcl|NC_018087. 144 DHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKME--NGVK---VVKGYREYFLYDTEL-ESYQCG 217 (520) Q Consensus 144 ~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~--~~~~---~~~~~~ey~~y~~~~-~~~~~~ 217 (520) ++.+..++-|+-|.+.-.|. +|-..+..++|+.+.++.+-..... -.++ ...+.....+|.+.. ..+... T Consensus 113 ~~~~~~~~~G~~~~~v~~d~----~~~~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~ 188 (453) T protein:vir:73 113 ELAKIACVYGRAYELMYQNE----STESEVIYCSPLNVFMVYDDSIKQKPLFAVYYGFDEEGNLSGTVYTLLETISITGK 188 (453) T ss_pred HHHHHHHhcCeEEEEEEeCC----CCceEEEEEcccceEEEEeCCCCceeEEEEEEEEecCceEEEEEEeCCeEEEEEec Confidence 99999999999999887763 3667889999999988865322111 1111 111222234444432 111111 Q ss_pred cceecCCc----ce-ecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCC Q lcl|NC_018087. 218 HQHFAAGT----KI-KIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLK-LLEDAMMIYRITRAPDRRVFYIDTGNM 291 (520) Q Consensus 218 ~~~~~~~~----~~-~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~-m~EDalVIyRi~RApeRRvFyIDvGnl 291 (520) ...+.... .+ +||- |.|++ +....|-++..+.....+. ++-+....-+.++.|.+-+.-. ++ T Consensus 189 ~~~~~~~~~~~~~~g~vPv--v~~~n-------~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~---~~ 256 (453) T protein:vir:73 189 AGEVKFGESTYNVYSDLPI--VEYNF-------NEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGA---EV 256 (453) T ss_pred CCceEEccceeccCCceeE--EEecC-------CCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecC---CC Confidence 11100000 00 2221 22222 2223444454444443332 2334444456667777655422 11 Q ss_pred chHHHHHHHHHHHHhhc-ceeEeecCCCccccccccchhhhhhcccccCC--CCCcceeecCCCCCcChHH-HHHHHHHH Q lcl|NC_018087. 292 PARKAAQHMQHIMNSHR-NRISYDARTGKVKNQANMMALTEDYWLQRRDG--KAVTEVETLPGMTGMNEMD-DILYFRKA 367 (520) Q Consensus 292 pk~KAeqyl~~im~~~k-nklvYd~~TGev~d~~~~msmlEDywLpRReG--grgTEIsTLpGg~nLgei~-DV~YF~kk 367 (520) +. + -++..+ ++.. -+.+.. +...| +.+.++.+|-...+.+.+. -+.-+++. T Consensus 257 ~~----~----~~~~~~~~~~~----------------~~~~~~-~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~ 311 (453) T protein:vir:73 257 DE----E----DAKNIKDNRLI----------------NFFDKN-SNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERS 311 (453) T ss_pred Cc----h----hhhcccccccc----------------cccccc-cccccccccCceeEEeeecCCHHHHHHHHHHHHHH Confidence 11 1 111111 1110 000000 11111 1223466665444444333 35667888 Q ss_pred HHHhcCCChhhccCCCccccccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CCChhhHHhhhhceEE Q lcl|NC_018087. 368 LYMALRVPLSRIPDEQTQNVFDMSTAIS--RDELSFDKFISELQHKFEEIFLSPLKSNLLLKR-VITEDEWEAELNNIKI 444 (520) Q Consensus 368 Ly~aL~VP~SRl~~~~~~~~~G~~~eIt--RDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkg-i~t~eew~~~~~~I~~ 444 (520) +|...++|- +..++ ||.++... --+.....-+.+.++.|..-+...++.=+-+.+ .-...+| ..|.+ T Consensus 312 I~~~s~~p~--~~~~~----~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~----~~i~v 381 (453) T protein:vir:73 312 IFQFTMAAN--ISDEN----FGNSSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNASNKDAW----KDIEY 381 (453) T ss_pred HHHHhCCcc--cCccc----ccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccc----ccceE Confidence 899999984 33322 24333322 223334455667777777777776664333322 2222333 35778 Q ss_pred EeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCH---HHHHHHHHHHHHhhhcCCc-cCCccccC Q lcl|NC_018087. 445 VFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSD---EDIAAERKLIDEELSDKIF-NPPEPEEI 520 (520) Q Consensus 445 ~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tD---eeI~~~~kqi~~E~~~~~~-~~p~~e~~ 520 (520) .|....--.+. +.+++++.+. | .+|.+++++. |..++ +||+.++++-+++.+...- ....+++. T Consensus 382 ~f~~~~p~~~~-------~~a~~~~k~~---g-iis~et~~~~-~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 449 (453) T protein:vir:73 382 TFTRNEPKDIK-------EQAETANILK---G-ITSEETALSV-ISVIPDVQAEMEKIKKKKLLQLSLTRTSNLVRMKQM 449 (453) T ss_pred EeCCCCCCCHH-------HHHHHHHHHh---c-cCcHHHHHHh-CCCCCCHHHHHHHHHHHHHHHHHHHHhccCCcchhh Confidence 88654443443 3344455554 3 4799999976 55543 5665555533332222111 11122222 No 164 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=92.70 E-value=0.01 Score=31.47 Aligned_cols=392 Identities=11% Similarity=0.077 Sum_probs=176.1 Q ss_pred cccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHh Q lcl|NC_018087. 3 MLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLL 82 (520) Q Consensus 3 ~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma 82 (520) |.++ .+.+... +.+.|...+--... -|+.. +.-...+ | -...+ T Consensus 1 m~~~-------~~~~~~~------------~~~~~~~~~~~~~~-------~g~~~---s~~~~~v-~-------~~~al 43 (419) T protein:vir:80 1 MFFS-------RQLLSNL------------GQTQPGSGGWVSAL-------LGSAR---SEAGQVV-T-------PASAL 43 (419) T ss_pred CCcc-------ccccccc------------CcCCCCcchhhHHh-------hcccc---cccCccc-C-------hHHhh Confidence 3222 1111110 11112111100000 01100 0000011 1 12345 Q ss_pred hccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcc----hhhhHHHHH----hhccccc Q lcl|NC_018087. 83 NNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNF----QRKGSDHFK----RWYVDSR 154 (520) Q Consensus 83 ~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f----~k~g~~~fR----rWYvDgr 154 (520) ++|-|..||+.|.+.+.-.+ +.|.=... +. ++.+ .-..+..+|+. ..++.++.+ .+.+.|- T Consensus 44 ~~~~v~~cv~~ia~~ia~lp-----~~~~~~~~---~~-~~~~--~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gn 112 (419) T protein:vir:80 44 SLTVLQNCVTLLAESIAQLP-----VELYERSG---DD-RKPA--TDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGN 112 (419) T ss_pred ccHHHHHHHHHHHHhhccCc-----eEEEEecC---CC-cccc--cccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCC Confidence 78999999999999876442 22211110 00 0111 01123334332 344555444 4677799 Q ss_pred eeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccE Q lcl|NC_018087. 155 VFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAM 234 (520) Q Consensus 155 i~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI 234 (520) -|.-++-+. ++ -+.+|.+|+|..+...+.- ++.. .|.+.+ ...+|.+-| T Consensus 113 a~~~i~r~~-~G--~~~~L~~i~~~~v~i~~~~-----~~~~------~y~~~~-----------------~~~~~~~~i 161 (419) T protein:vir:80 113 SYSFIDRDQ-DG--VIQGLYPLDNEAVTVMKGP-----DLKP------MYRVAG-----------------ADPLPQRLV 161 (419) T ss_pred eEEEEEECC-CC--cEEEEEEecCceEEEEECC-----CceE------EEEEcC-----------------ccccchhhe Confidence 888877553 22 3899999999999875321 1111 111111 113567777 Q ss_pred EEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEee Q lcl|NC_018087. 235 VYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYD 314 (520) Q Consensus 235 ~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd 314 (520) +|.-. ...++...+|-++.|..++.....+++...=+---.+--+-++.++... +..+.++-++.+...++...- T Consensus 162 ~h~~~--~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~-~~~~~~~~~~~~~~~~~~~~~-- 236 (419) T protein:vir:80 162 HHVRW--MSINGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDA-PALKDQASVDRITDGWNAKFG-- 236 (419) T ss_pred EEecC--CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCC-CcccCHHHHHHHHHHHHHHhc-- Confidence 66642 4566777789999999999988888877665555556666677776422 222222223333333333221 Q ss_pred cCCCccccccccchhhhhhcccccCCCCCcceeecCCCCC-cChHHHHHHHHHHHHHhcCCChhhccCCCccccccccch Q lcl|NC_018087. 315 ARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTG-MNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTA 393 (520) Q Consensus 315 ~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~n-Lgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~e 393 (520) | .++..+.+ .++ .|.+++-|.-... +.-++-.++-.+.+.++++||..-|...++.. + +. T Consensus 237 ---g-~~n~g~~~-vl~----------~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t-~---~n 297 (419) T protein:vir:80 237 ---G-SGNAKKVA-LLQ----------EGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERAT-F---SN 297 (419) T ss_pred ---C-ccccCCce-ecC----------CCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCC-c---cc Confidence 1 11111121 221 2456666542111 12233445778999999999998886432211 1 11 Q ss_pred hhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018087. 394 ISRDELSFDKF-ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLME 472 (520) Q Consensus 394 ItRDElkF~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~ 472 (520) +.-.-+.|..+ |.-+..++ .+.|- +.++++.++.. ..+.|..+ ++... -+..|++.++.+- T Consensus 298 ~e~~~~~f~~~~l~P~~~~i----e~~l~-----~kll~~~~~~~----~~i~fd~~----~l~~~-d~~~~~~~~~~~~ 359 (419) T protein:vir:80 298 IEHQSLQFVIYTLLPWVKRH----EQAKT-----RDLLLPSERKQ----YFIEYNLA----GLLRG-DQSSRYAAYAVGR 359 (419) T ss_pred HHHHHHHHHHHHHHHHHHHH----HHHHh-----hhccCccccCC----eEEEEech----hhhcc-CHHHHHHHHHHHH Confidence 22222345544 22222222 22222 23445555432 23444322 22222 2355666666542 Q ss_pred cccchhhhHHHHHHHHhCCCHHHHHHHHHHH------HHhhhcC-CccCCccccC Q lcl|NC_018087. 473 PYIGKYISNHTAMKDFLQMSDEDIAAERKLI------DEELSDK-IFNPPEPEEI 520 (520) Q Consensus 473 p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi------~~E~~~~-~~~~p~~e~~ 520 (520) . .-+++.+-+++ ++++.+-+ --++.. .....++ --.+|++++- T Consensus 360 ~--~G~~T~NE~R~-~~g~~p~~--gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~ 409 (419) T protein:vir:80 360 Q--WGWLSINDIRR-LENMPPVK--GGDIYLSPMNMVDASKPQPIPMGKTEPTKA 409 (419) T ss_pred h--CCCcCHHHHHH-HhCCCCCC--CcceeeeccccccccccccccCCCCCchhh Confidence 2 23567777774 46665421 111111 0000000 0011111111 No 165 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=92.69 E-value=0.01 Score=31.47 Aligned_cols=387 Identities=12% Similarity=0.126 Sum_probs=176.0 Q ss_pred hcchhhhhhhHHHhhhccCC--CcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchhH Q lcl|NC_018087. 12 FAFWHKVDDTEYDKIINDKA--ESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDN 89 (520) Q Consensus 12 f~~~~~~~~~~~~~~~~~~~--~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~ 89 (520) +|||.+... ....+. .+.+.| .+ .++.|+ ... .-+..+.+|-|.. T Consensus 1 MG~~~~~~~-----~~~~~~~~~~~~~~-------~~----~~~~g~--------~~~---------~~~~al~~~~V~~ 47 (411) T protein:vir:81 1 MGWWSRLTR-----FFRPRNETVDMTNP-------LL----LQWLGV--------DPD---------TPRNQLSEATYFA 47 (411) T ss_pred CchHHHHHh-----hccCcccccccchH-------HH----HHHhcC--------ccc---------ChhhhhccHHHHH Confidence 444433211 000000 000000 00 011111 110 0134467899999 Q ss_pred HHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcc----hhhhHHHH----HhhccccceeEEEee Q lcl|NC_018087. 90 AVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNF----QRKGSDHF----KRWYVDSRVFFHKII 161 (520) Q Consensus 90 Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f----~k~g~~~f----RrWYvDgri~~hkvi 161 (520) ||+-|.+.+.-. |+.+--++. +..+ ++ .-..+..+|+- ..++.++. ..+.++|--|..++. T Consensus 48 ~v~~Ia~~iA~l-----p~~~~~~~~---~~~~-~~--~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r 116 (411) T protein:vir:81 48 CLKILSESLGKL-----PLKMYQKTE---RGIV-KS--DREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQY 116 (411) T ss_pred HHHHHHHhHhhC-----ceeEEEecC---Ccee-ee--cccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEe Confidence 999999977532 111110000 0000 00 01122333322 23444444 445678988888876 Q ss_pred ecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeeccc Q lcl|NC_018087. 162 NPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGL 241 (520) Q Consensus 162 d~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL 241 (520) | ...+.+|.+|+|..+.++++-.. . ......-||.|... ..+..+.++.+.|+|..-+ T Consensus 117 ~----~g~~~~l~~l~~~~v~~~~~~~~-----~-~~~~~~~~~~~~~~-----------~~g~~~~~~~~eiih~k~~- 174 (411) T protein:vir:81 117 S----GPQLQALWILPSQYVTIVVDDRG-----L-LGEKNAIWYRYNDP-----------YDGKMYVFRNDEILHFKTS- 174 (411) T ss_pred c----CCceEEEEEECCceEEEEEcCcc-----c-ccccceEEEEEEec-----------CCceEEEEccccEEEEcCC- Confidence 5 24689999999999998644221 1 11111112222111 1123467899999998533 Q ss_pred ccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccc Q lcl|NC_018087. 242 VDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVK 321 (520) Q Consensus 242 ~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~ 321 (520) .+.++...+|-+..|...+.....+++...-+----+--+-+...+ +.|.+..+++..+.+...|.- .+ T Consensus 175 ~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g----------~~ 243 (411) T protein:vir:81 175 VTFDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYT-GDLNQEARDRLVKGFEQFANG----------SK 243 (411) T ss_pred CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC-CCCCHHHHHHHHHHHHHHhcC----------cc Confidence 3456667788999998888888888776655444434445566665 466666555444443333321 11 Q ss_pred cccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHH Q lcl|NC_018087. 322 NQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELS 400 (520) Q Consensus 322 d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElk 400 (520) +..+.+ .++ .|.+++.|.-...-.| ++-.++..+.+.++++||..-|....+.+. +. +.-.-+. T Consensus 244 n~g~~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~-~n---~e~~~~~ 308 (411) T protein:vir:81 244 NAGKII-PVP----------LGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSSY-AS---AEAQNLA 308 (411) T ss_pred ccCCce-ecC----------CCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCc-hh---HHHHHHH Confidence 111122 121 2456666632111112 344567899999999999988853322111 11 1111223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhh Q lcl|NC_018087. 401 FDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYIS 480 (520) Q Consensus 401 F~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S 480 (520) |.++ .|+--+.. +.+.|-. .+.++.++. ....+.|.-+. +...+ ...|.+.++.+-.- -+++ T Consensus 309 f~~~--~l~P~~~~-ie~~l~~-----~ll~~~~~~---~~~~~~fd~~~----ll~~d-~~~~~~~~~~~~~~--g~~t 370 (411) T protein:vir:81 309 FYVD--TLLYVLKQ-YEEEITY-----KILSNDLIS---QGHYFKFNVNV----ILRAD-IKTQMDSLSTAVQN--GIMT 370 (411) T ss_pred HHHH--HHHHHHHH-HHHHHHh-----hcCChhhcC---CCcEEEeechh----hhccC-HHHHHHHHHHHHhC--CCcC Confidence 4443 13322211 2222222 344554443 22344454332 22211 23345555444221 2556 Q ss_pred HHHHHHHHhCCCHHH-------------HHHHHHHHHHhhhcCCccCCcc Q lcl|NC_018087. 481 NHTAMKDFLQMSDED-------------IAAERKLIDEELSDKIFNPPEP 517 (520) Q Consensus 481 ~~~i~k~IL~~tDee-------------I~~~~kqi~~E~~~~~~~~p~~ 517 (520) .+-++ +++++.+.+ ++...++ ..| -.|. T Consensus 371 ~NE~R-~~~gl~p~~ggD~~~~~~n~~pl~~~~~~---~~k-----gGd~ 411 (411) T protein:vir:81 371 PNEAR-DYLDMPADDYGNNLMANGNYIPLSMLGAN---YGK-----GGDS 411 (411) T ss_pred HHHHH-HHhCCCCCCCCCeeeeccCccchhhhhhh---hcc-----CCCC Confidence 66665 346654321 0000000 000 1111 No 166 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=92.64 E-value=0.011 Score=31.41 Aligned_cols=409 Identities=10% Similarity=0.064 Sum_probs=196.6 Q ss_pred eeccccccccccc------------------------cccccccccc---chhHHHHHHHHHHHhhccchhHHHHhhhc- Q lcl|NC_018087. 45 EVDSQDIAYNGVF------------------------QKLYGSQDPT---ATSTRELINTYRSLLNNYEVDNAVQEIVS- 96 (520) Q Consensus 45 ~i~~~~~a~~g~~------------------------~~~~~~~~~~---~~~~~~LI~~YR~ma~~pEvd~Ai~eIvn- 96 (520) +|+.+++...|.- ..+|.|-..- ..+..+-.+.+|.+.+. +.-||+-.++ T Consensus 1 ~~~~~~~~~~gl~~~~~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~nw--~~~~Vd~~a~r 78 (474) T protein:vir:81 1 MIQQQTVRIPSLSNDENALINGLLAQIENLRWKNLLRTSYYENKRTIQYVGTLIPPQYFNLGLVLGW--TGKAVDALARR 78 (474) T ss_pred CcCCCcCcCCCCChhHHHHHHHHHHHHHHHhhHHHHHHHHhccCCChhhccccccHHHHHHHhhcCh--HHHHHHHHHhh Confidence 3332222111110 0111111000 00000001122222222 2234444332 Q ss_pred ---eeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCCCCCeeee Q lcl|NC_018087. 97 ---DAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIEL 173 (520) Q Consensus 97 ---eaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~el 173 (520) +-+++.++ ..-+. .-..|..-=+|+....+.++.=++-||-|.-.- ...+.++...+ T Consensus 79 l~~~Gf~~~d~----------~~~~~--------~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~--~~~d~~~~~~i 138 (474) T protein:vir:81 79 CNLEGFVWPDG----------DLDSL--------GGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINT--VGEDDEPEALI 138 (474) T ss_pred hcccceECCCC----------Cccch--------HHHHHHHhcChhHHHHHHHHHHHhhCceeEEEe--cCCCCCceeEE Confidence 11111110 00011 123445556677788888889999999984433 33445567778 Q ss_pred EecCccceeeeeeccCCC-Ccccccc----c-ceecceeecCcccc-ccc-ccce--ec--CCcceecCcccEEEeeccc Q lcl|NC_018087. 174 RRLDPRNVQFVRELDTKM-ENGVKVV----K-GYREYFLYDTELES-YQC-GHQH--FA--AGTKIKIPYSAMVYAHSGL 241 (520) Q Consensus 174 r~lDPr~i~~vr~i~~~~-~~~~~~~----~-~~~ey~~y~~~~~~-~~~-~~~~--~~--~~~~~~I~~~aI~y~hSGL 241 (520) +.++|+.+-.+.+=.+.. .-+..+. + ......+|.+..-. +.. ++.. .. .-..+-+| .|.|+|.-- T Consensus 139 ~~~sp~~~~~~~D~~~~~~~~al~~~~~~~~g~~~~~~ly~~~~~~~~~~~~~~~~w~~~~~~~~~gvP--vV~~~n~~~ 216 (474) T protein:vir:81 139 HVKDASEATGEWNRRRRGLNNLLSIIDKDKEGKVLSLALYLDNETVTAQRDKATLKWQVDRDEHVYGVP--AQVLPYKPA 216 (474) T ss_pred EEeccceEEEEEeCCCCcceeeeEEEEEcCCCcEEEEEEEeCCcEEEEEEcCccceeeeccCCCCCCcc--eEEeccccc Confidence 999999887765421111 1111111 1 11223344333211 110 0000 00 00111133 678887633 Q ss_pred c-cCCCCcchhhhHHHHHHHHH-HHHHHHHHHHHHHhcCccceEEEccCCCC------chHHHHHHHHHHHHhhcceeEe Q lcl|NC_018087. 242 V-DCCGKNIIGYLHRAVKPANQ-LKLLEDAMMIYRITRAPDRRVFYIDTGNM------PARKAAQHMQHIMNSHRNRISY 313 (520) Q Consensus 242 ~-d~~~~~~~syL~~aik~~Nq-L~m~EDalVIyRi~RApeRRvFyIDvGnl------pk~KAeqyl~~im~~~knklvY 313 (520) + ++.|..-+ -+..+...+- -+.+.+.++.=...=.|+|-|+=.+-... |..+-.+++-.|+ T Consensus 217 ~~~~~G~s~i--~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~~~~~~d~d~~~~~~~~~~~~~i~--------- 285 (474) T protein:vir:81 217 PKRPFGQSRI--TKPMMGLQDAGVRELARREGHMDVFSYPEFWLLGADESALKNADGTIKSVWEARLGRIK--------- 285 (474) T ss_pred ccCcCCcccc--chhHHHHHHHHHHHHHHHHHHHHHhcchhheeecCChhhcccccccccchhhhhHHHHh--------- Confidence 2 22332211 1122222121 25678888888899899988864332111 1111112222111 Q ss_pred ecCCCccccccccchhhhhhcccccC-CC----CCcceeecCCCCCcChH-HHHHHHHHHHHHhcCCChhhccCCCcccc Q lcl|NC_018087. 314 DARTGKVKNQANMMALTEDYWLQRRD-GK----AVTEVETLPGMTGMNEM-DDILYFRKALYMALRVPLSRIPDEQTQNV 387 (520) Q Consensus 314 d~~TGev~d~~~~msmlEDywLpRRe-Gg----rgTEIsTLpGg~nLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~ 387 (520) .+|.-+ |. .+++|..+|+.+ |.-. +-++=.-..+-...++|.+-|.-.+..|. T Consensus 286 --------------------~~~~d~d~~~~~~~~~~~~q~~~a~-l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np 344 (474) T protein:vir:81 286 --------------------GLPDDADADIPQLARADVKQFPAAS-PDAHWSDINGLAKLFAREASLPDTAVAISGLSNP 344 (474) T ss_pred --------------------cCCCcccccccccccccccccCCCC-hhHHHHHHHHHHHHHHhhhCCCHHHhcccccccc Confidence 133322 21 235677777743 4432 22443444555567999877652111111 Q ss_pred ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHH Q lcl|NC_018087. 388 FDMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNV 467 (520) Q Consensus 388 ~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~ 467 (520) ..+..|.-.|....+-+.+.|+.|..=+..+++.-+.+.|-....+|..--..+.+.|..-..=+ +.++.+. T Consensus 345 -~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~~s-------~a~~aDa 416 (474) T protein:vir:81 345 -TSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVAIDEIPDEWKSIDAKWRDPRYLS-------KSAQADA 416 (474) T ss_pred -cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccchhhccceeEecCCCccC-------HHHHHHH Confidence 23446777777888889999999999999999999999887766665554456777775333222 2455666 Q ss_pred HHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCcc--------CCccc Q lcl|NC_018087. 468 LSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFN--------PPEPE 518 (520) Q Consensus 468 ~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~--------~p~~e 518 (520) +..+..- |.-+....+..++|++|+.||+.....++++.-...+. .+.++ T Consensus 417 ~~Kl~~a-~~~~~~~~~~~~~lg~t~~~i~~~~~~~~~~~~~~~~~~l~~~~~~~~~aq 474 (474) T protein:vir:81 417 GMKQLAA-VPWLAETEVGLELIGLTPQQARRAMADKRRVQGRGTLQALIDRSNNGATAQ 474 (474) T ss_pred HHHHHhc-ccCCCcHHHHHhhcCCCHHHHHHHHHHHHHHhHHHHHHHHHhcCCCCCCCC Confidence 6555432 33344445555789999999988666555554333222 12222 No 167 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=92.39 E-value=0.012 Score=31.19 Aligned_cols=242 Identities=13% Similarity=0.125 Sum_probs=121.9 Q ss_pred hhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchh Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVD 88 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd 88 (520) ++||. +. ++ .+..+|. .+....+.. .+++.+.. ...+. -..-+++|-|. T Consensus 1 MglF~---~~-~~----------r~~~~~~-~~~~~~~~~-----~~~~~~~~---~~~v~--------~~~al~~~~v~ 49 (251) T protein:vir:46 1 MGIFY---KN-EK----------RDLQYNE-DDLQMMVQT-----LPSFQGTK---LRQYK--------DIEAIRHSDIF 49 (251) T ss_pred CCccc---cc-cc----------cccCCCc-cchhhhhhh-----hccccCcC---cceec--------hhhhhccHHHH Confidence 34442 21 11 1122221 111111110 01111100 00111 12235788899 Q ss_pred HHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHh----cchhhhHHHHH----hhccccceeEEEe Q lcl|NC_018087. 89 NAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNML----NFQRKGSDHFK----RWYVDSRVFFHKI 160 (520) Q Consensus 89 ~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll----~f~k~g~~~fR----rWYvDgri~~hkv 160 (520) .||+-|.+.+.-.+ +.+.=+..... . ..++.+| +-.-++.++.+ .+.+.|.-|.-++ T Consensus 50 ~~i~~ia~~iA~lp-----~~~~~~~~~~~-------~---~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~ 114 (251) T protein:vir:46 50 TAVMMIASDLARMP-----IRVTVNGQINY-------S---DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEIT 114 (251) T ss_pred HHHHHHHHhHhhCc-----eEEeeCccccc-------c---chHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEE Confidence 99999988775432 22211111000 0 1223333 34445555544 4567799999888 Q ss_pred eecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecc Q lcl|NC_018087. 161 INPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSG 240 (520) Q Consensus 161 id~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSG 240 (520) -|.+ .-+++|.+|+|..+..++.- ++.- .|+++... +. ..+....++.+.|+|.. + T Consensus 115 r~~~---G~~~~L~~i~~~~v~v~~~~-----~g~~------~~~~~~~~---~~------~~g~~~~~~~~diiH~r-~ 170 (251) T protein:vir:46 115 RDKT---GEPMNLTFRKTSEIELKSDA-----RGRL------YYFHQRID---SN------GNNIERNVKFEDMLDIK-F 170 (251) T ss_pred ECCC---CcEEEEEEECCceEEEEECC-----CCcE------EEEEEEec---cC------CcceeEEECCccEEEec-C Confidence 6632 24999999999999886432 2211 12221111 00 01223688999998886 3 Q ss_pred cccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHH-HHHHHHHHHhhcceeEeecCCCc Q lcl|NC_018087. 241 LVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKA-AQHMQHIMNSHRNRISYDARTGK 319 (520) Q Consensus 241 L~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KA-eqyl~~im~~~knklvYd~~TGe 319 (520) ...+|...+|-|+.|..++......++...-+----+--+-+..++ |.|...+| ++..+.....|.-- ...|. T Consensus 171 -~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~~e~~~~~~~~~~~~~~g~----~n~g~ 244 (251) T protein:vir:46 171 -YSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFPKVLVEL----NKLGK 244 (251) T ss_pred -cCCCCeeecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeC-CCCCCHHHHHHHHHHHHHHhcCc----ccccc Confidence 4677777789999999999888888876654433334456777777 56644444 44444444445420 12344 Q ss_pred cccccccchhhh Q lcl|NC_018087. 320 VKNQANMMALTE 331 (520) Q Consensus 320 v~d~~~~msmlE 331 (520) |-. .|-| T Consensus 245 ~~~-----gm~~ 251 (251) T protein:vir:46 245 LSY-----SMNQ 251 (251) T ss_pred ccc-----ccCC Confidence 331 1222 No 168 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=91.84 E-value=0.014 Score=30.75 Aligned_cols=418 Identities=11% Similarity=0.115 Sum_probs=168.6 Q ss_pred Ccc----ccccchhhhcchhhhhhhHHHhhhccCCCcccC----CCCCC-CceeecccccccccccccccccccccchhH Q lcl|NC_018087. 1 MSM----LADSDLKMFAFWHKVDDTEYDKIINDKAESITA----PKFDD-GATEVDSQDIAYNGVFQKLYGSQDPTATST 71 (520) Q Consensus 1 ~~~----~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~----p~~~d-g~~~i~~~~~a~~g~~~~~~~~~~~~~~~~ 71 (520) +.- -|.-+|.|-.-+- . .. +.-+|.. .-.-| +.+..+.-.+..+.+|.+ T Consensus 55 ~~~~~~~~~~~~~~~~~~~~-~-------~~--~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~G------------ 112 (698) T protein:vir:10 55 LDAAPVAEPSPSLRLARQFE-V-------DV--SNYTPRERRAASYALDFNGTSMDALSFVTSSGFPG------------ 112 (698) T ss_pred cccccccCCCccccccccce-e-------cc--ccCCccccchhhhhhcccccccccchhhhccCcch------------ Confidence 111 1112333322111 0 00 0111100 00001 001111001111112222 Q ss_pred HHHHHHHHHHhhccchhHHHHhhhceeeEe------cCCCcE----EEEeeccchhhh-HHHHHHHHHHHHHHHHhcchh Q lcl|NC_018087. 72 RELINTYRSLLNNYEVDNAVQEIVSDAIVY------EEGFDV----VSIDLDQTAFTE-NIRNLISDEFNSVLNMLNFQR 140 (520) Q Consensus 72 ~~LI~~YR~ma~~pEvd~Ai~eIvneaiv~------d~~~~~----V~l~Ld~~~~s~-~ik~~I~eeF~~i~~ll~f~k 140 (520) +-.--.|||+||.+.+++.|..||+-. ....+. +++.-+...-++ .-.++|..|++.+ +... T Consensus 113 ---y~~la~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl----~V~~ 185 (698) T protein:vir:10 113 ---FPTLVLLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERL----RIRD 185 (698) T ss_pred ---HHHHHHHhhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHH----HHHH Confidence 223356899999999999999999543 222211 222222222222 2334666666543 2233 Q ss_pred hhHHHHHhhccccc--eeEEEeeec------------CCCCCCeeeeEecCccceeeeeeccCCCCcccccccceeccee Q lcl|NC_018087. 141 KGSDHFKRWYVDSR--VFFHKIINP------------NRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFL 206 (520) Q Consensus 141 ~g~~~fRrWYvDgr--i~~hkvid~------------~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~ 206 (520) +.-+.++-=-+-|. +|+.+-=|. .-+|.+++.|+.|||..+.+- ....++-+.+ .. T Consensus 186 ~l~eai~~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGL~ViDp~~vtP~---------~~n~~dP~sp-df 255 (698) T protein:vir:10 186 AVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPN---------NYNSINPVAD-DF 255 (698) T ss_pred HHHHHHHhcccccceEEEEEeecCccccccccccccccccCccceeeeeecccccccc---------hhhhccchhh-cc Confidence 33343322222222 233332221 123446777888888666651 1111111111 11 Q ss_pred ecCcccccccccceecCCcceecCcccEEEeecccccCC------CCcchhhhHHHHHHHHH-HHHHHHHHH-HHHHhcC Q lcl|NC_018087. 207 YDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDCC------GKNIIGYLHRAVKPANQ-LKLLEDAMM-IYRITRA 278 (520) Q Consensus 207 y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~~------~~~~~syL~~aik~~Nq-L~m~EDalV-IyRi~RA 278 (520) |.|+ .| ...++ +||.+=++... |--.|+ ++..+|.+.++..-..+ +++...+.- +...... T Consensus 256 gkP~--~y-------~V~G~-~IH~SRL~~~v-g~pvpd~LKp~y~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~~~ 324 (698) T protein:vir:10 256 YKPS--TW-------WMIGS-EVHATRLHTIV-SRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVS 324 (698) T ss_pred CCCc--eE-------EEecc-eecceeEEEec-CCCchhhhcchhccCCccHHHHHHHHHHHHHHHhhhHHHHHHHhhHH Confidence 1111 11 11111 44554443322 111111 33345655555544333 333332221 1111111 Q ss_pred ccceEEEccCCC-CchHHHHHHHH--HHHHhhcceeEeecCCCccccccccch-hhhhhcccccCCCCCcceeecCCCCC Q lcl|NC_018087. 279 PDRRVFYIDTGN-MPARKAAQHMQ--HIMNSHRNRISYDARTGKVKNQANMMA-LTEDYWLQRRDGKAVTEVETLPGMTG 354 (520) Q Consensus 279 peRRvFyIDvGn-lpk~KAeqyl~--~im~~~knklvYd~~TGev~d~~~~ms-mlEDywLpRReGgrgTEIsTLpGg~n 354 (520) . +..|... |.....++... +++++||.-. |-+ .+. -.|||- .++ .+ T Consensus 325 ~----l~~dla~aL~~g~~~~l~~R~eli~~~Rsn~------G~~-----llDk~~Eefe----------q~s-----t~ 374 (698) T protein:vir:10 325 G----ILMDLAQALTPGANVDLSMRAELINRYRDNR------NIL-----FLDKATEEFF----------QFN-----TP 374 (698) T ss_pred H----HHHHHHHhcCChhhHHHHHHHHHHHHhcCcc------ceE-----EEecCCcceE----------EEe-----cC Confidence 0 0111110 00011112221 4455665310 110 000 013333 222 47 Q ss_pred cChHHHHH-HHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHhcC Q lcl|NC_018087. 355 MNEMDDIL-YFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKS-----NLLLKR 428 (520) Q Consensus 355 Lgei~DV~-YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~-----QLiLkg 428 (520) |+-++||. =|..-+=-+.+||+.||=..+.-++ ..++| -|.-.|...|..+|. ..+..+|++ |+-+-| T Consensus 375 lSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGl-NATGE--~D~rnYYD~I~s~Qe---~~L~p~L~rl~~ii~rS~~G 448 (698) T protein:vir:10 375 LSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGL-NASSE--GEIRVWYDYVRAYQR---NALQQLMNDVIVMIQLSLFG 448 (698) T ss_pred cCCHHHHHHHHHHHHHhhhcCchhhhhccCCccc-Cccch--hhHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHhcC Confidence 88888874 5888888899999999865554333 12221 244558888887775 333344433 333444 Q ss_pred CCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhh Q lcl|NC_018087. 429 VITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELS 508 (520) Q Consensus 429 i~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~ 508 (520) -+. ..|.|+|+.=..-+|..-+||...+.+.-+..-.- --++.+-|+... .+|.+=-= --+..+.. T Consensus 449 ~id--------p~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~--gvI~~~evr~rL--~~d~~s~Y--~~~~d~~d 514 (698) T protein:vir:10 449 AVD--------PSIKWQWNALRELDDLEVAEARYKQAQSDVLYVQE--QVIRPDQVAARL--NTEPDGPY--AGKLDAND 514 (698) T ss_pred CCC--------CcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHh--cCCCHHHHHHHH--hccCCCcc--ccccCCcc Confidence 443 45899999888888988999988888864443111 012333333221 11100000 00000111 Q ss_pred cCCccCCccccC Q lcl|NC_018087. 509 DKIFNPPEPEEI 520 (520) Q Consensus 509 ~~~~~~p~~e~~ 520 (520) +|- .|++.+| T Consensus 515 ~p~--~~~~~~~ 524 (698) T protein:vir:10 515 DPG--APADDDI 524 (698) T ss_pred cCC--CCCCCcc Confidence 111 1222233 No 169 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=91.63 E-value=0.015 Score=30.59 Aligned_cols=435 Identities=9% Similarity=-0.021 Sum_probs=175.0 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRS 80 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ 80 (520) |+|+- -.++.++=-.+..-.+..+.. +...+...-...+++..--. ...+|+-.... .-.+..- .=+. T Consensus 1 M~~~~-~l~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~--~~~~g~~~~~~-~~~g~~v-------~~~~ 68 (466) T protein:vir:81 1 MRLID-RLLSTRGAAPRMSIDDYAQML-NEFAFNGIGYGFGGGVPRIQ--QTLAGPSTELA-PDTFVGL-------ATQA 68 (466) T ss_pred CchhH-HHhhccCcccccchhhhhhhh-hhhhccccccccccccHHHH--Hhhcccccccc-Ccccccc-------chhh Confidence 55542 222222110000000000000 00000000011111100000 00011111100 0011000 1123 Q ss_pred HhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHH-HHHHHHHhcchhhhHHHHHh----hccccce Q lcl|NC_018087. 81 LLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDE-FNSVLNMLNFQRKGSDHFKR----WYVDSRV 155 (520) Q Consensus 81 ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~ee-F~~i~~ll~f~k~g~~~fRr----WYvDgri 155 (520) .+++|-|..||+-|.+.+... |+.|.=+.. .-+..+.+. ...+++-=|-..+++++.+. +.+.|-- T Consensus 69 a~~~~~v~~~i~~Ia~~ia~l-----p~~~~~~~~----~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna 139 (466) T protein:vir:81 69 YQANGPVFACMLVRQLVFSSV-----RFRWQRLRD----GKPSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNS 139 (466) T ss_pred hhccHHHHHHHHHHHHhhccC-----ceEEEEecC----CceeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCe Confidence 466899999999999886543 222211110 001111110 11122222334456665444 4567888 Q ss_pred eEEEeeecCC-----CCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecC Q lcl|NC_018087. 156 FFHKIINPNR-----PKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIP 230 (520) Q Consensus 156 ~~hkvid~~~-----~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~ 230 (520) |..++-+..- ...-+.+|.+|+|..+...+.. ++... -+|.|... +.. ..+....++ T Consensus 140 y~~i~r~~~g~l~~~~~g~~~~l~~l~~~~v~~~~~~-----~~~~~-----~~y~~~~~-------~~~-~~~~~~~~~ 201 (466) T protein:vir:81 140 YWTIVDGEFVRMRPDWVDVVVEERMVRGGRGELGGGQ-----LGWRK-----VGYLYTEG-------GRQ-SGNESVGFL 201 (466) T ss_pred EEEEEecCccccccccCcceeEEEEecCcceEEEEcC-----CCceE-----EEEEEEec-------Ccc-cccceeeec Confidence 8888755221 1123789999999988885432 22110 01222111 111 112345789 Q ss_pred cccEEEeeccccc-CCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcc Q lcl|NC_018087. 231 YSAMVYAHSGLVD-CCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRN 309 (520) Q Consensus 231 ~~aI~y~hSGL~d-~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~kn 309 (520) .+.|+|.. ++.+ .++...+|.+..|++++.....+++...=+=---+--.-|+..| +.|.+..+++..+.+...|+- T Consensus 202 ~~dviHir-~~~~~~d~~~G~s~i~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g 279 (466) T protein:vir:81 202 AEDVVHFA-PIPDPLASYRGMSWLTPILREIRADQAMSKHQAKFFDNGATVNLVIKHN-PMADPAAVKKWADEVNSKHAG 279 (466) T ss_pred cccEEEEc-CCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecC-CCCCHHHHHHHHHHHHHHhcC Confidence 99998874 3433 34556789999999998888888766543333334445566666 457766666555555555542 Q ss_pred eeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccc Q lcl|NC_018087. 310 RISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVF 388 (520) Q Consensus 310 klvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~ 388 (520) . ...|. .+ .++ + |.+++.|.-...-.| ++-.++..+.+.++.+||...|.-..+. T Consensus 280 --~--~n~g~------~~-vl~-------~---g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lG~~~~~--- 335 (466) T protein:vir:81 280 --V--DNAWK------NL-NLY-------P---GADADVVGSNLQEIDFKNVRGGGETRIAAAAGVPPVIVGLSEGL--- 335 (466) T ss_pred --c--ccccc------ce-EcC-------C---CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccccCC--- Confidence 0 11122 11 221 2 456666643222222 3445688999999999999988532111 Q ss_pred cccc--hhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHH--HHHHHHHHHH Q lcl|NC_018087. 389 DMST--AISRDELSFDKFI-SELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFS--EMKTIEITER 463 (520) Q Consensus 389 G~~~--eItRDElkF~KFI-~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~--ElKe~Ei~~~ 463 (520) ++++ .+.-.-+.|.+++ .-+-.++..-+. ..| .+..+ ...+.++|..+..-. ....+|+... T Consensus 336 ~~st~sn~eq~~~~f~~~tl~P~~~~ie~~l~----~~L-----~~~~~----~~~~~~~f~~~~llr~d~~~r~~~~~~ 402 (466) T protein:vir:81 336 AAATYSNYGQARRRLADGTAHPLWQNLSGCIG----HVM-----PDMGP----DVRLWYDADDVPFLREDEKDAADIQKV 402 (466) T ss_pred CccccccHHHHHHHHHHHHHHHHHHHHHHHHH----hhc-----CCccc----CcceEEEecchhhhccCHHHHHHHHHH Confidence 1221 1222223354443 444444444332 222 23222 122455665443221 1112222222 Q ss_pred HHHHHHHhhcccchhhhHHHHHHH-------HhCCCHHHHHHHHHHHHHhhhcCCccCCccc----cC Q lcl|NC_018087. 464 RVNVLSLMEPYIGKYISNHTAMKD-------FLQMSDEDIAAERKLIDEELSDKIFNPPEPE----EI 520 (520) Q Consensus 464 R~~~~~~~~p~vgky~S~~~i~k~-------IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e----~~ 520 (520) |.+.+..+-.- | ++.+-++.- .+..++....+ .+.....++ ...+++. +- T Consensus 403 ~~~~~~~~~~~-g--~t~nE~r~~~~~gd~~~~~~~~~~~~~---~~~~~~~~~-~~~~~~~~~Gg~~ 463 (466) T protein:vir:81 403 RAETINTLITA-G--YEPESVVAAVNSGDLRLLKHTGLTSVQ---LLPPGVSAS-ASSDTPTSGGADD 463 (466) T ss_pred HHHHHHHHHHc-C--CChhhccccccCCccccccCCCcchhh---hcccccccc-cCCCCcccCCCCc Confidence 33322222111 0 122222210 00011100000 000000000 0000000 00 No 170 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=91.55 E-value=0.015 Score=30.53 Aligned_cols=401 Identities=12% Similarity=0.125 Sum_probs=182.7 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRS 80 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ 80 (520) |-...|. |+|.+-..-- ...+..++..+.. -| .+. ..+-..+.. ++. +. T Consensus 8 ~~~~~~~-----g~~~~~~~~~----~~~~~~~~~~~~~-~~--~~~-----~~~~~~~~~------v~~--------~~ 56 (424) T protein:vir:18 8 IDLRTNN-----GWWARLQSWF----VGGRLVTPNQGSQ-TG--PVS-----AHGHLGDSS------IND--------ER 56 (424) T ss_pred EeecCCC-----chHHHHHhhh----ccccccccccccc-cc--ccc-----ccccccccc------ccH--------HH Confidence 5555541 3343322211 0001111111111 01 010 001000101 101 34 Q ss_pred HhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHH--HHHHHHHH-HhcchhhhHHHHHh----hcccc Q lcl|NC_018087. 81 LLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLIS--DEFNSVLN-MLNFQRKGSDHFKR----WYVDS 153 (520) Q Consensus 81 ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~--eeF~~i~~-ll~f~k~g~~~fRr----WYvDg 153 (520) .+++|-|..||.-|.+.+.-++ +.|-=.. .+..+.++. .....+|+ --|-..++.++.+. +...| T Consensus 57 al~~~~v~~cv~~Ia~~iA~lp-----~~~~~~~---~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~G 128 (424) T protein:vir:18 57 ILQISTVWRCVSLISTLTACLP-----LDVFETD---QNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYG 128 (424) T ss_pred hhccHHHHHHHHHHHHhhccCc-----eEEEEee---cCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcC Confidence 4678889999999988875432 2111000 000011110 11112222 12334556665444 55668 Q ss_pred ceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCccc Q lcl|NC_018087. 154 RVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSA 233 (520) Q Consensus 154 ri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~a 233 (520) .-|.-++-|. + .-+++|.+|+|..+...+. +... +|.|.. .+..++++.+. T Consensus 129 nay~~i~r~~-~--G~~~~L~pl~~~~V~v~~~-------~~~~------~y~~~~-------------~g~~~~~~~~e 179 (424) T protein:vir:18 129 NAYALVDRNS-A--GDVISLLPLQSANMDVKLV-------GKKV------VYRYQR-------------DSEYADFSQKE 179 (424) T ss_pred CeEEEEEECC-C--CcEEEEEEecCcceEEEEc-------CCeE------EEEEEe-------------CCeEEEecccc Confidence 8888776442 2 2389999999998875321 1111 222211 12346889999 Q ss_pred EEEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEe Q lcl|NC_018087. 234 MVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISY 313 (520) Q Consensus 234 I~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvY 313 (520) |+|.. + ...++...+|-++.|++++.....+++...=+----+--+-+...+-+.+.+..+++ +++.++++.. T Consensus 180 Iih~r-~-~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~-~~~~~~~~~~---- 252 (424) T protein:vir:18 180 IFHLK-G-FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQ-VEENFKEIAG---- 252 (424) T ss_pred EEEec-C-cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCCHHHHHH-HHHHHHHHhC---- Confidence 98885 2 456777778999999999988777777654433333333556667666666555443 3333333221 Q ss_pred ecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccc Q lcl|NC_018087. 314 DARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMST 392 (520) Q Consensus 314 d~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~ 392 (520) |+ +..+.+ .++ .|++++.|.=...-.| ++-.+|..+.+.++.+||.+.|....+.+..| + T Consensus 253 ----g~--nag~~~-vl~----------~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~--s 313 (424) T protein:vir:18 253 ----GP--VKKRLW-ILE----------AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWG--S 313 (424) T ss_pred ----Cc--ccCCce-ecc----------CCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccc--c Confidence 21 111111 221 1566666642222222 34446778889999999999986432211111 2 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018087. 393 AISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLME 472 (520) Q Consensus 393 eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~ 472 (520) .+.-..+.|.++ .|+--... +.+.|- +.++++.++. ...+.|.-+. +... -+..|.+.+..+- T Consensus 314 n~eq~~~~f~~~--tl~P~~~~-ie~~l~-----~~L~~~~~~~----~~~~~fd~~~----llr~-d~~~r~~~~~~~~ 376 (424) T protein:vir:18 314 GIEQQNLGFLQY--TLQPYISR-WENSIQ-----RWLIPAKDVG----RIHAEHNLDG----LLRG-DSASRAAFMKAMG 376 (424) T ss_pred cHHHHHHHHHHH--HHHHHHHH-HHHHHH-----hhcCCccccC----CeEEEEechh----hhcc-CHHHHHHHHHHHH Confidence 233334456544 23222221 222222 2345555543 2344454332 2111 2355666666652 Q ss_pred cccchhhhHHHHHHHHhCCCHHHHHHHHH--------HHHHhhhcCCccCCccc Q lcl|NC_018087. 473 PYIGKYISNHTAMKDFLQMSDEDIAAERK--------LIDEELSDKIFNPPEPE 518 (520) Q Consensus 473 p~vgky~S~~~i~k~IL~~tDeeI~~~~k--------qi~~E~~~~~~~~p~~e 518 (520) .- -+++.+-++. .+.|.+-+ .-++ -+..-.+++ -|..+.. T Consensus 377 ~~--G~~T~NE~R~-~~gl~pi~--gGD~~~~~~n~~~l~~~~~~~-~p~~~ga 424 (424) T protein:vir:18 377 EA--GLRTINEMRR-TDNLPPLP--GGDVAMRQSQYVPITDLGTNK-EPRNNGA 424 (424) T ss_pred hC--CCcCHHHHHH-HhCCCCCC--CcCeeeeccCccchHhhhccC-CCccCCC Confidence 22 3567777774 46665421 1111 001100111 0111111 No 171 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=90.69 E-value=0.02 Score=29.95 Aligned_cols=429 Identities=10% Similarity=0.016 Sum_probs=171.3 Q ss_pred chhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHH----HH--------H Q lcl|NC_018087. 8 DLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTR----EL--------I 75 (520) Q Consensus 8 ~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~----~L--------I 75 (520) +||.=-|-.+.+=.. ...+.=|..+.. .| .|.+++....... .+ . T Consensus 1 ~~~~~~~~~~~~~~~--------~~~~~~~~~~~~-------------~~--~~~~~e~~~~~~~~~i~~~i~~~~~~~~ 57 (512) T protein:vir:97 1 MLKANEFETDTDLRE--------NRNYLFNDEANV-------------VY--TYDGTESDLLQNINEVSKYIEHHMDYQR 57 (512) T ss_pred CccceeccCceeeee--------Cceeeecccccc-------------cc--ccCchhhhhhhhHHHHHHHHHHHHHhhH Confidence 222222222111110 001111111111 11 1222222222111 11 2 Q ss_pred HHHHHHhhccchhHHH----------------------HhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHH Q lcl|NC_018087. 76 NTYRSLLNNYEVDNAV----------------------QEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVL 133 (520) Q Consensus 76 ~~YR~ma~~pEvd~Ai----------------------~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~ 133 (520) .+|+.+..+++=...| .-||+..+-+= -..||++..++.+ ..+.++.++ T Consensus 58 ~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl-~g~p~~~~~~d~~--------~~~~l~~~~ 128 (512) T protein:vir:97 58 PRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYF-LGNPIQCQDDDKD--------VLEAIEAFN 128 (512) T ss_pred HHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhh-cccCceeccCChH--------HHHHHHHHH Confidence 2355554444333221 11111111111 1255555554432 334455666 Q ss_pred HHhcchhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCC--CCccccccc---------c-e Q lcl|NC_018087. 134 NMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTK--MENGVKVVK---------G-Y 201 (520) Q Consensus 134 ~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~--~~~~~~~~~---------~-~ 201 (520) +--+|+....++.+...+-|+-|.+.-+|. +|-..+..+||+.+..+.+-... ..-+++.+. . + T Consensus 129 ~~n~~~~~~~~~~~~~~i~G~ay~~vy~de----d~~~~i~~~~p~~~~~iyd~~~~~~~~~~vr~~~~~~~~~~~~~~~ 204 (512) T protein:vir:97 129 DLNDVESHNRSLGLDLSIYGKAYELMIRNQ----DDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEV 204 (512) T ss_pred hhcCHHHHHHHHHHHHHhcCeEEEEEEeCC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceE Confidence 667889999999999999999999877663 35678999999999988553221 112222110 0 1 Q ss_pred ecceeecCccc-ccccccceecCCcceec-----Ccc--cEEEeecccccCCCCcchhhhHHHHHHHHHHHH-HHHHHHH Q lcl|NC_018087. 202 REYFLYDTELE-SYQCGHQHFAAGTKIKI-----PYS--AMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKL-LEDAMMI 272 (520) Q Consensus 202 ~ey~~y~~~~~-~~~~~~~~~~~~~~~~I-----~~~--aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m-~EDalVI 272 (520) ...-+|++... .|..++......+.... +-. -|++. +++....|-++.++.....+.. +=+.... T Consensus 205 ~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~------~nn~~~~gd~e~v~~liDa~d~~~S~~~~~ 278 (512) T protein:vir:97 205 FTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEF------SNNERRKGDYEKVITLIDLYDNAESDTANY 278 (512) T ss_pred EEEEEEeCCcEEEEEecCCCcccccccccccccccCcccceEee------cCCCCCCCchhhhHHHHHHHHHHHHHHHHH Confidence 11125544321 11111100000000000 000 01211 1222345666665555544443 2222223 Q ss_pred HHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhh----hcccccCCCCCcceee Q lcl|NC_018087. 273 YRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTED----YWLQRRDGKAVTEVET 348 (520) Q Consensus 273 yRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlED----ywLpRReGgrgTEIsT 348 (520) -+-++.|-+-+.-....+ ..+. .+.+ ...+..+++ ...+.-+++.|..+.. T Consensus 279 ~~~~~~~~lv~~G~~~~~-----~~~~-----~~~~---------------~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 333 (512) T protein:vir:97 279 MSDLNDAMLLIKGNLNLD-----PVEV-----RKQK---------------EANVLFLEPTVYENRDTGIETEGSVDGGY 333 (512) T ss_pred HHHhcCceeeeecCccCC-----chhh-----hhhh---------------hcccccccccchhhcccccCCCCCcceEE Confidence 344444444332111111 0000 0000 000101111 1111112222333444 Q ss_pred cCCCCCcCh-HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 349 LPGMTGMNE-MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAIS--RDELSFDKFISELQHKFEEIFLSPLKSNLL 425 (520) Q Consensus 349 LpGg~nLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eIt--RDElkF~KFI~rLr~rFs~if~d~Lk~QLi 425 (520) |-...+... -.-+.-+.+.+|+-.++|---.+. +. |.++..+ .-...-..-+.+.++.|..-+...++.=+- T Consensus 334 l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~---~~--gn~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~ 408 (512) T protein:vir:97 334 IYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDN---FS--GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLET 408 (512) T ss_pred EeecCCHHHHHHHHHHHHHHHHHHhCCcccCccc---cc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 443333332 233556677788888888632221 11 2222222 222223333455555555555554443222 Q ss_pred hcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCH---HHHHHHHHH Q lcl|NC_018087. 426 LKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSD---EDIAAERKL 502 (520) Q Consensus 426 Lkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tD---eeI~~~~kq 502 (520) +-++...-++..-...+++.|...---.+ .+.++++..+. | .+|.+++++. |...+ +||+.+.++ T Consensus 409 ~~~~~~~~~~~~d~~~i~~~f~~~~p~~~-------~e~~~~~~kl~---g-iiS~et~~~~-l~~v~d~~~E~eri~~E 476 (512) T protein:vir:97 409 ILKNTRSIDANKDFNTVRYVYNRNLPKSL-------IEELKAYIDSG---G-KISQTTLMSL-FSFFQDPELEVKKIEED 476 (512) T ss_pred HHHhcCCcccccccccceEEeCCCCCcCH-------HHHHHHHHHHh---c-cCchHHHHHh-CCCCCCHHHHHHHHHHH Confidence 21222222222222357788865333333 23344555553 3 3799999977 56543 555555555 Q ss_pred HHHhhhcC---CccCCccccC Q lcl|NC_018087. 503 IDEELSDK---IFNPPEPEEI 520 (520) Q Consensus 503 i~~E~~~~---~~~~p~~e~~ 520 (520) -+++.+.. ...+|...+- T Consensus 477 ~~~~~~~~~~~~~~~~~~~~~ 497 (512) T protein:vir:97 477 EKESIKKAQKGIYKDPRDIND 497 (512) T ss_pred HHHHHHHHhhcccCCCCCCCC Confidence 44433322 2222222222 No 172 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=90.36 E-value=0.021 Score=29.75 Aligned_cols=407 Identities=10% Similarity=0.043 Sum_probs=173.9 Q ss_pred hcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHH-HhhccchhHH Q lcl|NC_018087. 12 FAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRS-LLNNYEVDNA 90 (520) Q Consensus 12 f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~-ma~~pEvd~A 90 (520) +||+.+-.. ..+..+ +|+.. . + .|.....+.+....+ .... +.++|-|..| T Consensus 1 Mg~~~~~~~-------~~~~~~-~~~~~-~----~-------~~~~~~~~~~~~~~~--------~~~~~~~~~~~v~~~ 52 (423) T protein:vir:81 1 MGFLQKLGL-------APSVVA-TPEPI-E----L-------VGPIFESLKLSTKNM--------TVEQIWEDQPHLRTV 52 (423) T ss_pred CchhHhhcc-------cccccc-Ccccc-c----c-------ccccccccccccchh--------hHHHHHHhhhHHHHH Confidence 333332211 111111 11111 0 0 011111010110011 1122 2578999999 Q ss_pred HHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhc---chhhhHHHH----HhhccccceeEEEeeec Q lcl|NC_018087. 91 VQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLN---FQRKGSDHF----KRWYVDSRVFFHKIINP 163 (520) Q Consensus 91 i~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~---f~k~g~~~f----RrWYvDgri~~hkvid~ 163 (520) |+-|.+.+--.+ +.|-=.. .+..++.+.+ ..+..+|. =..++.++. ..+.+.|--|..+.-|. T Consensus 53 i~~ia~~ia~lp-----~~~~~~~---~dg~~~~~~~--~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~ 122 (423) T protein:vir:81 53 TTFIARNVASLQ-----LQAFERV---EDGGRERVRE--GHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDL 122 (423) T ss_pred HHHHHHhHhhCc-----eEEEEEe---cCCceeeecc--chHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC Confidence 999999776432 1110000 0011112211 11223332 112344443 44668898887766553 Q ss_pred CCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeeccccc Q lcl|NC_018087. 164 NRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVD 243 (520) Q Consensus 164 ~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d 243 (520) .....+.+|+++.|..++.-+ . .++. +-..|.++.... ..+..+.+|.+.|+|.. ... T Consensus 123 -~~~~~~~~l~p~~~~~v~~~~-~----~~~~----~~~~Y~~~~~~~----------~~g~~~~~~~~evih~r--~~~ 180 (423) T protein:vir:81 123 -GVDTPTLDIRPIPVSWVQRRA-Y----KDGW----GSLDYIIIESGD----------NDGRSVKVPGERVIHRH--GYN 180 (423) T ss_pred -CcCcceEEEeecccceeeeee-c----cCCC----cceEEEEEEecC----------CCceEEEEcccceEEec--CCC Confidence 223356677777665554421 1 1111 111121211110 01223688999998876 344 Q ss_pred CCCC-cchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeE-eecCCCccc Q lcl|NC_018087. 244 CCGK-NIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRIS-YDARTGKVK 321 (520) Q Consensus 244 ~~~~-~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklv-Yd~~TGev~ 321 (520) +++. ..+|-+..|..++.....+++...=+=---+.-+-|+..|-...|.+-.++-.+.+..+++.... --..+|.+ T Consensus 181 ~~~~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~~~~~~~~~~~~~~~n~g~~- 259 (423) T protein:vir:81 181 PKTMKRGKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAESRTRFMANLRASFSPKSSDVGGT- 259 (423) T ss_pred CCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHHHHHHHHHHHHHHhccccccCCcc- Confidence 5554 46788999998888777777764443222345666777776544332222223333444443221 11223432 Q ss_pred cccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHH---HHHHHHHHhcCCChhhccCCCccccccccchhhHHH Q lcl|NC_018087. 322 NQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDIL---YFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDE 398 (520) Q Consensus 322 d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~---YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDE 398 (520) + .++ .|.+++.|. .+.-+++-++ +-.....++.+||..-|..-++ +. .+.+.-.- T Consensus 260 -----~-vl~----------~g~~~~~l~--~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~-~t---~sn~e~~~ 317 (423) T protein:vir:81 260 -----L-LLE----------DGMKAENFH--TTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDN-AN---YSNVREFR 317 (423) T ss_pred -----e-ecC----------CCceEEecc--CChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCC-CC---cccHHHHH Confidence 2 222 245666663 3344444433 5567799999999887753211 11 01122223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchh Q lcl|NC_018087. 399 LSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKY 478 (520) Q Consensus 399 lkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky 478 (520) ..|..+ .|+-.+. .+.+.|-. .+.++.+|+.-...|+|+.. .+...++ +.|.+.++.+=.-. -+ T Consensus 318 ~~f~~~--~L~P~~~-~ie~~l~~-----~L~~~~~~~~~~~~~~fd~~------~llr~d~-~~r~~~~~~~l~~~-G~ 381 (423) T protein:vir:81 318 KALYGD--NLGSWIR-IIQDVMNL-----FLLPRVGIDNEKFYFEFNLE------EKLRASF-EEAAEIKRAAVGNV-AW 381 (423) T ss_pred HHHHHH--HHHHHHH-HHHHHHhh-----hhcCccccccCccEEEecch------hhhccCH-HHHHHHHHHHHhCC-CC Confidence 335554 2332222 12232222 23454444433334444333 2322222 45555555431111 25 Q ss_pred hhHHHHHHHHhCCCHHHHHHHHHHHHHhhhc-CCccCCccccC Q lcl|NC_018087. 479 ISNHTAMKDFLQMSDEDIAAERKLIDEELSD-KIFNPPEPEEI 520 (520) Q Consensus 479 ~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~-~~~~~p~~e~~ 520 (520) ++.+-+++ ++.|.+-+ .-++.+-...-. +--+++..|+. T Consensus 382 ~T~NE~R~-~~gl~p~~--gGD~~~~p~n~~~~~~~~~~~~~~ 421 (423) T protein:vir:81 382 MTINEVRA-MDNLPSID--GGDDLARPLNTEFGDSEDAPGEEV 421 (423) T ss_pred cCHHHHHH-HhCCCCCC--CcceeecccccccCccCCCCCCCC Confidence 67777774 46665421 112211111000 00111222222 No 173 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=89.42 E-value=0.026 Score=29.23 Aligned_cols=393 Identities=11% Similarity=0.012 Sum_probs=182.8 Q ss_pred hhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHH--HHHHHHHHhhccchhHHHHhhh Q lcl|NC_018087. 18 VDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRE--LINTYRSLLNNYEVDNAVQEIV 95 (520) Q Consensus 18 ~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~--LI~~YR~ma~~pEvd~Ai~eIv 95 (520) ++...+.+++.. ..+|.+.. .++..+..... ..+-..++ -++.|++|..++.|.++++.+. T Consensus 1 v~~~~l~~e~at---------~~~~~d~~-------~~~~~~l~~~~-~~il~~a~~g~~~~y~~l~~D~~i~s~l~~rk 63 (488) T protein:vir:99 1 MEKPALGREIAT---------SGDGRDIT-------RPFISGLQVPN-DSILQRRGGNDLRVYEEILSDAQVKTVWGQRQ 63 (488) T ss_pred CCccchhHHHHH---------HHhhhhhh-------ccccCCCCCCC-hHHHHhhccCCHHHHHHHhhChHHHHHHHHHH Confidence 111111111100 00010000 11111111111 11111111 1689999999999999999998 Q ss_pred ceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCCCCCeeeeEe Q lcl|NC_018087. 96 SDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIELRR 175 (520) Q Consensus 96 neaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~elr~ 175 (520) .-+.-.+=...|- .++.-.+ ++.++....++-++|+.-..+++.- ..-|--++++++..++..-.+..|.. T Consensus 64 ~av~~~~w~i~p~----~~~~~~~----~~ae~v~~~l~~~~~~~~l~~~lda-~~~G~s~~Ei~w~~~~g~~~~~~l~~ 134 (488) T protein:vir:99 64 LAVVSREWKVEAG----GDRPIDQ----AAAEHLEQQLQRVGWDRVTSKMLFG-VFYGYAVSELIYGRDDRYITLEAIKV 134 (488) T ss_pred HHHhcCCceEEcC----CCChHHH----HHHHHHHHHHhCCCHHHHHHHHHhh-hhhcceeEEEEEeecCCeeeEeeeee Confidence 7765443221111 1122233 3344444445456777666666643 34688889999987665555667777 Q ss_pred cCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCc--ccEEEeecccccCCCCcchhhh Q lcl|NC_018087. 176 LDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPY--SAMVYAHSGLVDCCGKNIIGYL 253 (520) Q Consensus 176 lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~--~aI~y~hSGL~d~~~~~~~syL 253 (520) .+|+.+.+-+ .++... ... ..+..+..+|. .=+++.| --...+....|-| T Consensus 135 r~~~~f~~d~------~~~l~~--------~~~------------~~~~~g~~lp~~~~~i~~~~--~~~~g~p~g~gLl 186 (488) T protein:vir:99 135 RNRRRFRYDQ------DGGLRL--------LTP------------NNMFEGEPCPAPYFWHFSTG--ADNDDEPYGLGLA 186 (488) T ss_pred ecccceeecC------CCceEE--------ecc------------CCCCCccccccCceEEEEee--cCCCCCcccchHH Confidence 7776555421 111110 000 01112344443 3344444 2233344557899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh-cCccceEEEcc-CCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhh Q lcl|NC_018087. 254 HRAVKPANQLKLLEDAMMIYRIT-RAPDRRVFYID-TGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTE 331 (520) Q Consensus 254 ~~aik~~NqL~m~EDalVIyRi~-RApeRRvFyID-vGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlE 331 (520) .+|..+|--.+......+.+--. =.|-| +...| .|.=+..| .+.++.+. ...+.- | -. T Consensus 187 ~~~~w~~~fK~~~~~~w~~f~E~yG~P~~-igky~~~~a~~~ek-~~l~~av~-~~~~~~------~------~v----- 246 (488) T protein:vir:99 187 HWLYWPVFFKRNGIKFWLIFLDKFGMPTA-VGRYDDKTATPEDK-AKLLAALH-AIQTDS------A------II----- 246 (488) T ss_pred HHHHHHHHHHHhhHHHHHHHHHHcCCcee-eeecCCCCCCHHHH-HHHHHHHH-HHhcCc------E------EE----- Confidence 99999988888777777766443 35655 44334 33323332 22333322 222211 1 11 Q ss_pred hhcccccCCCCCcceeecCCCCCcCh--HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHH Q lcl|NC_018087. 332 DYWLQRRDGKAVTEVETLPGMTGMNE--MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQ 409 (520) Q Consensus 332 DywLpRReGgrgTEIsTLpGg~nLge--i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr 409 (520) +| .|++|+.+..+..-+. ..=++|..++.-+++-=-. |.++++.+ +++..-+-+|+ +...+..-. T Consensus 247 ---iP-----~~~~ie~~ea~~~~~~~~~~li~~~d~~Isk~iLGqt--lts~~~~G--s~a~~~vh~~v-~~d~~~aDa 313 (488) T protein:vir:99 247 ---MP-----AGMQAELLEAGRSGTADYKTLHDTMDATIAKVGLGQV--ASTQGTPG--RLGNDDLQADV-RLDLVKADA 313 (488) T ss_pred ---ec-----CCceeEEeecCCCChHHHHHHHHHHHHHHHHHHhhhh--hccccccc--chhhHHHHHHH-HHHHHHHHH Confidence 12 3678888874333222 2347788888777642211 33333221 23333344444 444455555 Q ss_pred HHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHh Q lcl|NC_018087. 410 HKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFL 489 (520) Q Consensus 410 ~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL 489 (520) +.++..+..-|-..|+.=|. ... -...+.+++... |-++.+.+.+..+-+..|-=++.+|++++ + T Consensus 314 ~~i~~tln~~li~~l~~~N~-~~~----~~p~~~~~~~e~---------edl~~~a~~~~~l~~~~G~~i~~~~i~e~-~ 378 (488) T protein:vir:99 314 DLICESFNLGPARWLTEWNF-PGA----QPPRVYRVIEEP---------EDITAKAERDEKVFRMSGFRPTRGYVQET-Y 378 (488) T ss_pred HHHHHHHHHHHHHHHHHhCc-CCc----CCceeEecCCCc---------ccHHHHHHHHHHHHhhcCCCCCHHHHHHH-c Confidence 55555555323333333232 111 123444544422 33344455555554444555899999966 7 Q ss_pred CCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 490 QMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 490 ~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) ++...+-.+.. ..+ -+.+.+.+. T Consensus 379 Gip~~~~~~~~-------~~~-~~~~~~~~~ 401 (488) T protein:vir:99 379 GVEVESTQAEA-------TAP-TPSTEFAEG 401 (488) T ss_pred CCCCccccccc-------ccC-CCcccCCCC Confidence 88765432211 001 111111111 No 174 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=87.87 E-value=0.036 Score=28.50 Aligned_cols=396 Identities=11% Similarity=0.047 Sum_probs=197.4 Q ss_pred hhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchh Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVD 88 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd 88 (520) |.++ .|.++- -.-.|.|-..-.+.++.- ..++-..+|... +=+. T Consensus 1 l~~~-----------------------~~r~~~-------~~~yY~g~~~~~~~~~~~----p~~~~~~~~~v~--nw~~ 44 (410) T protein:vir:95 1 MNLY-----------------------QSRVNL-------RYKHYAMQHYEAPTGITI----PAHIRAKYQAVL--GWAA 44 (410) T ss_pred CCcc-----------------------hhhHHH-------HHHHhcCCCCccccchhc----cHHHHhHHHhhc--chhH Confidence 0000 000000 000112221111212111 113333455333 3344 Q ss_pred HHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCCCC Q lcl|NC_018087. 89 NAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKD 168 (520) Q Consensus 89 ~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~ 168 (520) -||+..++=..+ +-. + .++. +...+..--+|+....+.++.=++.||-|.- |... .+ T Consensus 45 ~~Vds~a~rl~~-~Gf-~-----~~d~------------~l~~i~~~N~ld~~~~~~~~~al~~G~sf~~-v~~~---~d 101 (410) T protein:vir:95 45 KGVDSLADRLIF-RAF-A-----NDDF------------NVTEIFDRNNPDIFFDSAILSALIGSCSFVY-ISKG---ED 101 (410) T ss_pred HHHHHhHhhhcc-ccc-c-----CCCc------------hHHHHHhhcChHHHHHHHHHHHHHhCceeEE-EecC---CC Confidence 466665542211 110 0 1111 2345566677888889999999999996544 4332 24 Q ss_pred CeeeeEecCccceeeeeeccCC-CCccccccc-----ceecceeecCccccccc--ccceecCCcceecCcccEEEeecc Q lcl|NC_018087. 169 GIIELRRLDPRNVQFVRELDTK-MENGVKVVK-----GYREYFLYDTELESYQC--GHQHFAAGTKIKIPYSAMVYAHSG 240 (520) Q Consensus 169 GI~elr~lDPr~i~~vr~i~~~-~~~~~~~~~-----~~~ey~~y~~~~~~~~~--~~~~~~~~~~~~I~~~aI~y~hSG 240 (520) |-..++.++|+.+.-+.+=.+. ..-+..+.. ......+|.+....+.. ++....++ +.- -...|.|+|.. T Consensus 102 ~~~~i~~~sP~~~~~i~Dp~~~~~~~al~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~g-~vPvV~f~n~~ 179 (410) T protein:vir:95 102 DEVRLQVIESSNATGVIDPITGLLVEGYAVLARDDYNRPTLEAYFEPNATHFIPKDGEPYSVTN-ETG-IPLLVPVIHRP 179 (410) T ss_pred CceEEEEEcccceEEEEeCCCCceEEEEEEEEecCCCeEEEEEEEeCCcEEEEeeCCccccccC-CCC-CcceEEecccc Confidence 5567899999998877642111 111111111 11123344433211111 10000011 110 02346666543 Q ss_pred cccCCCCcchhhh-HHHHHHHHH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCC Q lcl|NC_018087. 241 LVDCCGKNIIGYL-HRAVKPANQ-LKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTG 318 (520) Q Consensus 241 L~d~~~~~~~syL-~~aik~~Nq-L~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TG 318 (520) -. ......|-+ +..+...+- -+.|.++++.=...=.|.|-++=+|...-|..+-..+ T Consensus 180 ~l--~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~------------------- 238 (410) T protein:vir:95 180 DA--VRPFGRSRITRAGMYYQKYAKRTLERADITAEFYSWPQKYILGLDPDAEPMEKWKAT------------------- 238 (410) T ss_pred cC--CccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeeccCCCCCcCchhhhh------------------- Confidence 22 222223322 222222222 2678888899899999999887655422222111111 Q ss_pred ccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHH Q lcl|NC_018087. 319 KVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDE 398 (520) Q Consensus 319 ev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDE 398 (520) +--=..+|.-++|-+.+|..+|+++=-+=++=++=.-..+....++|.+-|...+. |. ..+..|.-.| T Consensus 239 ----------~~~i~~~~~~~~~~~~~v~q~~~~~l~~~~~~l~~l~~~~a~~s~lP~~~lg~~~~-Np-sSa~Al~a~~ 306 (410) T protein:vir:95 239 ----------VSSLLTISSSDKGVKPSVGQFTTASMSPFTEQLRTAAAGFAGEMGLTLDDLGFVSD-NP-SSVEAIKASH 306 (410) T ss_pred ----------hhhheeccCCCCCCcceEEecCCCChHHHHHHHHHHHHHHhhhcCCCHHHhccccC-ch-hHHHHHHHHH Confidence 11114457767777788988988543333344555556666777999877764432 11 2334567778 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC--hhhHHhhhhceEEEeec--cchHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_018087. 399 LSFDKFISELQHKFEEIFLSPLKSNLLLKRVIT--EDEWEAELNNIKIVFHK--NSYFSEMKTIEITERRVNVLSLMEPY 474 (520) Q Consensus 399 lkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t--~eew~~~~~~I~~~f~~--Dn~f~ElKe~Ei~~~R~~~~~~~~p~ 474 (520) ....+-+.+-|+.|..-+..+++.-+.+.+-.. +.+|. .+.+.|.. |- + +--+.++.+.+..+..- T Consensus 307 ~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~----~~~v~W~p~~d~---~---~~s~a~~aDa~~Kl~~a 376 (410) T protein:vir:95 307 ENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFV----RTAVKWEPLFEA---D---ANTMTMIGDGVVKLNQA 376 (410) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccccc----eeeEEeeecCCc---c---hhhHHHHHHHHHHHHHh Confidence 889999999999999999999999888866543 33443 34555541 21 1 11234555554444332 Q ss_pred cchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCC Q lcl|NC_018087. 475 IGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKI 511 (520) Q Consensus 475 vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~ 511 (520) +-.+.+.++++ +.|++||+||.... +++...+|- T Consensus 377 ~~g~~~~~~~~-~~lg~~~~~~~~~~--~~e~~~~g~ 410 (410) T protein:vir:95 377 LPGYINAETIR-DLTGIAGDMSAKPV--VSEGGSNGE 410 (410) T ss_pred ccCCccHHHHH-HhcCCChHHHHHHH--HHHHHhCCC Confidence 22345777777 55999999876533 333333332 No 175 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=87.53 E-value=0.037 Score=28.43 Aligned_cols=189 Identities=14% Similarity=0.111 Sum_probs=93.0 Q ss_pred EEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCC----CCCcceeecCCCCCcChH Q lcl|NC_018087. 283 VFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDG----KAVTEVETLPGMTGMNEM 358 (520) Q Consensus 283 vFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReG----grgTEIsTLpGg~nLgei 358 (520) ||.++. +.+-+ +...+++ ++.|.+..++==. .+| +.+=+..++. .+|+-+ T Consensus 1 V~k~~~------------------l~~~~--~~~~~~~---~~r~~~~~~~~~~-~~~~~ld~~~e~~e~~~--~~lsGl 54 (201) T protein:vir:10 1 MWKAKG------------------LADLC--DDSDGAA---RLRLAQVDNNSGV-GQAIGIDADSEEYNVLN--SDIGGI 54 (201) T ss_pred CccchH------------------HHHHh--cCChHHH---HHHHHHHHHhhhh-hhhheeecCCcceeeee--cCcCCh Confidence 333221 00000 0000111 1112111111000 000 0001222222 356667 Q ss_pred HH-HHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhcCCCChhhHH Q lcl|NC_018087. 359 DD-ILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFISELQHKF-EEIFLSPLKSNLLLKRVITEDEWE 436 (520) Q Consensus 359 ~D-V~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI~rLr~rF-s~if~d~Lk~QLiLkgi~t~eew~ 436 (520) +| +..|...+=-+.++|+.||-.++..++ +.+++ -|.-.|..+|..+|.+. ..+...++ .-+... T Consensus 55 ~d~l~~~~~~iaa~s~iP~t~LfG~sp~Gl-natge--~d~~nyyd~i~~~Qe~~l~p~le~l~-----~~~~~~----- 121 (201) T protein:vir:10 55 DTFLSQKFDRIVALSGIHEIILKGKNVGGV-SASQN--TALETFYGYVDRKRKAELLPLLEFLL-----PFIVTE----- 121 (201) T ss_pred HHHHHHHHHHHHhHhcCchhhhcCCCCccc-cccch--hHHHHHHHHHHHHHHHHHHHHHHHHH-----HhhcCC----- Confidence 77 457888899999999999977665554 22221 13346999999999533 33333322 223322 Q ss_pred hhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCC------HHHHHHHHHHHHHhhhcC Q lcl|NC_018087. 437 AELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMS------DEDIAAERKLIDEELSDK 510 (520) Q Consensus 437 ~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~t------DeeI~~~~kqi~~E~~~~ 510 (520) +.++|.|..=..=+|...+||.....++++.+-.- -.+|.+-+++.+-... +..|......-+.| .| T Consensus 122 ---~~~~~~f~pL~~~s~kekAei~~~~a~a~~~~~~~--g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~~e~~--dp 194 (201) T protein:vir:10 122 ---QEWSVEFNPLSQVSDKDKSEILEKNVNSVAALIAA--GIIDADEARDTLRAISTEVKIGEGSIQTEVVINESE--DP 194 (201) T ss_pred ---CCceEeeCCCCCCCHHHHHHHHHHHHHHHHHHHHc--CCCCHHHHHHHHHhcCCcCCCCCCCCCccccccccC--CC Confidence 35778999989999999999999999999887432 2467777776543321 11111111111111 11 Q ss_pred CccCCccc Q lcl|NC_018087. 511 IFNPPEPE 518 (520) Q Consensus 511 ~~~~p~~e 518 (520) .-.|+++ T Consensus 195 -~~~~~~~ 201 (201) T protein:vir:10 195 -LDVSANN 201 (201) T ss_pred -CCCCCCC Confidence 1123333 No 176 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=84.65 E-value=0.059 Score=27.33 Aligned_cols=402 Identities=14% Similarity=0.140 Sum_probs=164.2 Q ss_pred CCCcccCCCCCCCceeeccc---cc-ccccccccccccccccchhHHHHHHHHHHHhhccchhHHH-------------- Q lcl|NC_018087. 30 KAESITAPKFDDGATEVDSQ---DI-AYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNAV-------------- 91 (520) Q Consensus 30 ~~~s~~~p~~~dg~~~i~~~---~~-a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai-------------- 91 (520) -..-+.-|....-+-++-.. .. ...-..+.+ ++.-.+.+.+|..+..+.+-+..| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-------i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~ 73 (474) T protein:vir:96 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRL-------INNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNID 73 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHH-------HHHHHHHHHHHHHHHHHhcccCccccccchhhhccccc Confidence 01111111111111111000 00 000000001 111122334455555444433221 Q ss_pred -------------HhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEE Q lcl|NC_018087. 92 -------------QEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFH 158 (520) Q Consensus 92 -------------~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~h 158 (520) .-||+-.+-+ --..||++..++.+..+ .+ +.+++ =+|+..-.++.+.+.+-|+-|.+ T Consensus 74 ~~~~~~ki~~n~~k~Iv~~~~~y-l~g~p~~~~~~~~~~~~----~l----~~~~~-n~~~~~~~~l~~~~~~~G~~~~~ 143 (474) T protein:vir:96 74 YTKPDWRITTNFHQNLVDQKVSY-VAGKPVTYAHDDDKVLD----VI----HQVLD-TRWDNKLIDILTAASNKGIDWLQ 143 (474) T ss_pred ccccccccccchHHHHHHhhhhh-hcccCceeccCChHHHH----HH----HHHHh-ccHHHHHHHHHHHHhhCCeEEEE Confidence 2222222222 12367777666543332 33 33332 36788888999999999999988 Q ss_pred EeeecCCCCCCeeeeEecCccceeeeeecc--CCCCcccccc--cceecceeecCcc-cccccccceec----------- Q lcl|NC_018087. 159 KIINPNRPKDGIIELRRLDPRNVQFVRELD--TKMENGVKVV--KGYREYFLYDTEL-ESYQCGHQHFA----------- 222 (520) Q Consensus 159 kvid~~~~k~GI~elr~lDPr~i~~vr~i~--~~~~~~~~~~--~~~~ey~~y~~~~-~~~~~~~~~~~----------- 222 (520) .-+|. +|-..++.+||+.+.++.+-. .+..-.++.+ .+...+-+|.+.. ..|......+. T Consensus 144 ~~~d~----~~~~~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~ 219 (474) T protein:vir:96 144 VYINE----DGELKLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQ 219 (474) T ss_pred eeeCC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeecccccccccc Confidence 77653 367889999999999875421 1111222211 1112233444321 11111111000 Q ss_pred ----CCcceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHH Q lcl|NC_018087. 223 ----AGTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLK-LLEDAMMIYRITRAPDRRVFYIDTGNMPARKAA 297 (520) Q Consensus 223 ----~~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~-m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAe 297 (520) ++.--+|| |+++ .++....|=|+..+.....+. ++-+....-+-++.|-+-+.-.+.-++. T Consensus 220 ~~~~~~~~~~vP---vv~~------~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~----- 285 (474) T protein:vir:96 220 THFSTGSWERVP---FIAF------KNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLS----- 285 (474) T ss_pred CcccccCCCccc---eEEe------cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCccccc----- Confidence 00000111 1211 122234455555555554443 2333333445555554433322111111 Q ss_pred HHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCCh Q lcl|NC_018087. 298 QHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPL 376 (520) Q Consensus 298 qyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~ 376 (520) +. +.-+..+++-.-+++. .+..|-...+.+. -.-+.-+.+.+|...++|- T Consensus 286 ~~---------------------------~~~~~~~~~i~~~~~~--~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~ 336 (474) T protein:vir:96 286 EF---------------------------MEGLKYYKAINVSSDG--GVETIQVEVPVASTKEYLDMMRAYIVEFGQGVD 336 (474) T ss_pred ch---------------------------hhhhhccceeeccCCC--ceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcC Confidence 00 1111111111111111 2444433333322 2335556677899999984 Q ss_pred hhccCCCccccccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHH Q lcl|NC_018087. 377 SRIPDEQTQNVFDMSTAIS--RDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSE 454 (520) Q Consensus 377 SRl~~~~~~~~~G~~~eIt--RDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~E 454 (520) - .+++ ++ |..+..+ .-......-+.+.+..|...+...|+.=+-+-|+ ..+| ..|.+.|....--.+ T Consensus 337 ~--~~~~-~~--~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~--~~d~----~~i~i~f~~~~p~~~ 405 (474) T protein:vir:96 337 F--QTDK-FG--SATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKI--KLDA----KEIEITFNFNVMVND 405 (474) T ss_pred c--cccc-cc--cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Cccc----ceeeEEecCCCccCH Confidence 2 2222 11 2233322 1122233445666666666666666643333343 2233 457788865544444 Q ss_pred HHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcC--CccC--------------Cccc Q lcl|NC_018087. 455 MKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDK--IFNP--------------PEPE 518 (520) Q Consensus 455 lKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~--~~~~--------------p~~e 518 (520) ...+ +++.++ | .+|.+++++. |..+++ -+++-++|++|..+. ..+. |+++ T Consensus 406 ~e~a-------~~~~~~----g-iiS~et~~~~-lp~v~D-~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (474) T protein:vir:96 406 LEQS-------QIGAQS----Q-YLSKETLVRH-HPWVDD-PKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSENN 471 (474) T ss_pred HHHH-------HHHHHc----C-CCChHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCcc Confidence 3333 333332 2 5799999966 565432 222233444332211 1111 1111 Q ss_pred cC Q lcl|NC_018087. 519 EI 520 (520) Q Consensus 519 ~~ 520 (520) |. T Consensus 472 e~ 473 (474) T protein:vir:96 472 QS 473 (474) T ss_pred cc Confidence 11 No 177 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=84.65 E-value=0.059 Score=27.33 Aligned_cols=402 Identities=14% Similarity=0.140 Sum_probs=164.2 Q ss_pred CCCcccCCCCCCCceeeccc---cc-ccccccccccccccccchhHHHHHHHHHHHhhccchhHHH-------------- Q lcl|NC_018087. 30 KAESITAPKFDDGATEVDSQ---DI-AYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNAV-------------- 91 (520) Q Consensus 30 ~~~s~~~p~~~dg~~~i~~~---~~-a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai-------------- 91 (520) -..-+.-|....-+-++-.. .. ...-..+.+ ++.-.+.+.+|..+..+.+-+..| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-------i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~ 73 (474) T protein:vir:95 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRL-------INNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNID 73 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHH-------HHHHHHHHHHHHHHHHHhcccCccccccchhhhccccc Confidence 01111111111111111000 00 000000001 111122334455555444433221 Q ss_pred -------------HhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEE Q lcl|NC_018087. 92 -------------QEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFH 158 (520) Q Consensus 92 -------------~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~h 158 (520) .-||+-.+-+ --..||++..++.+..+ .+ +.+++ =+|+..-.++.+.+.+-|+-|.+ T Consensus 74 ~~~~~~ki~~n~~k~Iv~~~~~y-l~g~p~~~~~~~~~~~~----~l----~~~~~-n~~~~~~~~l~~~~~~~G~~~~~ 143 (474) T protein:vir:95 74 YTKPDWRITTNFHQNLVDQKVSY-VAGKPVTYAHDDDKVLD----VI----HQVLD-TRWDNKLIDILTAASNKGIDWLQ 143 (474) T ss_pred ccccccccccchHHHHHHhhhhh-hcccCceeccCChHHHH----HH----HHHHh-ccHHHHHHHHHHHHhhCCeEEEE Confidence 2222222222 12367777666543332 33 33332 36788888999999999999988 Q ss_pred EeeecCCCCCCeeeeEecCccceeeeeecc--CCCCcccccc--cceecceeecCcc-cccccccceec----------- Q lcl|NC_018087. 159 KIINPNRPKDGIIELRRLDPRNVQFVRELD--TKMENGVKVV--KGYREYFLYDTEL-ESYQCGHQHFA----------- 222 (520) Q Consensus 159 kvid~~~~k~GI~elr~lDPr~i~~vr~i~--~~~~~~~~~~--~~~~ey~~y~~~~-~~~~~~~~~~~----------- 222 (520) .-+|. +|-..++.+||+.+.++.+-. .+..-.++.+ .+...+-+|.+.. ..|......+. T Consensus 144 ~~~d~----~~~~~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~ 219 (474) T protein:vir:95 144 VYINE----DGELKLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQ 219 (474) T ss_pred eeeCC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeecccccccccc Confidence 77653 367889999999999875421 1111222211 1112233444321 11111111000 Q ss_pred ----CCcceecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHH Q lcl|NC_018087. 223 ----AGTKIKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLK-LLEDAMMIYRITRAPDRRVFYIDTGNMPARKAA 297 (520) Q Consensus 223 ----~~~~~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~-m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAe 297 (520) ++.--+|| |+++ .++....|=|+..+.....+. ++-+....-+-++.|-+-+.-.+.-++. T Consensus 220 ~~~~~~~~~~vP---vv~~------~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~----- 285 (474) T protein:vir:95 220 THFSTGSWERVP---FIAF------KNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLS----- 285 (474) T ss_pred CcccccCCCccc---eEEe------cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCccccc----- Confidence 00000111 1211 122234455555555554443 2333333445555554433322111111 Q ss_pred HHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHHHHHHHHHHHHhcCCCh Q lcl|NC_018087. 298 QHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDDILYFRKALYMALRVPL 376 (520) Q Consensus 298 qyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~DV~YF~kkLy~aL~VP~ 376 (520) +. +.-+..+++-.-+++. .+..|-...+.+. -.-+.-+.+.+|...++|- T Consensus 286 ~~---------------------------~~~~~~~~~i~~~~~~--~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~ 336 (474) T protein:vir:95 286 EF---------------------------MEGLKYYKAINVSSDG--GVETIQVEVPVASTKEYLDMMRAYIVEFGQGVD 336 (474) T ss_pred ch---------------------------hhhhhccceeeccCCC--ceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcC Confidence 00 1111111111111111 2444433333322 2335556677899999984 Q ss_pred hhccCCCccccccccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHH Q lcl|NC_018087. 377 SRIPDEQTQNVFDMSTAIS--RDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSE 454 (520) Q Consensus 377 SRl~~~~~~~~~G~~~eIt--RDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~E 454 (520) - .+++ ++ |..+..+ .-......-+.+.+..|...+...|+.=+-+-|+ ..+| ..|.+.|....--.+ T Consensus 337 ~--~~~~-~~--~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~--~~d~----~~i~i~f~~~~p~~~ 405 (474) T protein:vir:95 337 F--QTDK-FG--SATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKI--KLDA----KEIEITFNFNVMVND 405 (474) T ss_pred c--cccc-cc--cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Cccc----ceeeEEecCCCccCH Confidence 2 2222 11 2233322 1122233445666666666666666643333343 2233 457788865544444 Q ss_pred HHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcC--CccC--------------Cccc Q lcl|NC_018087. 455 MKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDK--IFNP--------------PEPE 518 (520) Q Consensus 455 lKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~--~~~~--------------p~~e 518 (520) ...+ +++.++ | .+|.+++++. |..+++ -+++-++|++|..+. ..+. |+++ T Consensus 406 ~e~a-------~~~~~~----g-iiS~et~~~~-lp~v~D-~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (474) T protein:vir:95 406 LEQS-------QIGAQS----Q-YLSKETLVRH-HPWVDD-PKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSENN 471 (474) T ss_pred HHHH-------HHHHHc----C-CCChHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCcc Confidence 3333 333332 2 5799999966 565432 222233444332211 1111 1111 Q ss_pred cC Q lcl|NC_018087. 519 EI 520 (520) Q Consensus 519 ~~ 520 (520) |. T Consensus 472 e~ 473 (474) T protein:vir:95 472 QS 473 (474) T ss_pred cc Confidence 11 No 178 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=83.91 E-value=0.065 Score=27.10 Aligned_cols=316 Identities=14% Similarity=0.055 Sum_probs=135.6 Q ss_pred hccCCCcccCCCC-CCC--ceee-ccc-c---------------cccccccccccccccccchhHHHHHHHHHHHhhccc Q lcl|NC_018087. 27 INDKAESITAPKF-DDG--ATEV-DSQ-D---------------IAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYE 86 (520) Q Consensus 27 ~~~~~~s~~~p~~-~dg--~~~i-~~~-~---------------~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pE 86 (520) ++++.....|.+. ... ..++ .-+ + .+.+|.++.+.+ +...|-+-+|..+.|-- T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~pp~-------~~~~la~l~~~~~~h~~ 73 (346) T protein:vir:10 1 MKKQLRKNLTQNDRLQPQAQTEIFSFGDPIPVLDRADILNYLECSAMYEKWYNPPM-------SFDGLAKSLRSSTHHES 73 (346) T ss_pred CCcccCCCCCcccccccccCeEEEecCCcceecCchhHHHHHHHhhcCCceEecCC-------CHHHHHHHHHhhhhcch Confidence 1111111111100 000 0000 000 0 000111111111 12223344444433311 Q ss_pred hhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcc---hhhh---HHHHHhhccccceeEEEe Q lcl|NC_018087. 87 VDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNF---QRKG---SDHFKRWYVDSRVFFHKI 160 (520) Q Consensus 87 vd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f---~k~g---~~~fRrWYvDgri~~hkv 160 (520) +-.| |. ..+..++.- .-+. .+++..|.+-|--|++++ T Consensus 74 ~i~~---------------------------------k~----n~l~~l~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~i~ 116 (346) T protein:vir:10 74 AIIT---------------------------------KA----NILLSTCEVDSRYLSRRDLSSFVKDYLVFGNAYFEVV 116 (346) T ss_pred hhhh---------------------------------hh----hhHHHHHhCCCCCCCHHHHHHHHHHHHhcCCeEEEEE Confidence 1111 00 112222210 0011 123345667799999998 Q ss_pred eecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecc Q lcl|NC_018087. 161 INPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSG 240 (520) Q Consensus 161 id~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSG 240 (520) -+ ....+++|.+|+|..++..+ ..++.. |.++.. .+..+.++++.|+|.. T Consensus 117 r~---~~G~~~~L~pl~~~~v~~~~-----~~~~~~-------~~~~~~-------------~g~~~~~~~~dIih~r-- 166 (346) T protein:vir:10 117 RN---RLGQVQRIESPLAKYVRKGL-----EAGQFY-------YVPQRF-------------DHQEHEFAKGSIYHLL-- 166 (346) T ss_pred Ec---CCCcEEEEEEecCCceEEEE-----cCCeEE-------EEEEcc-------------CCeEEEEecccEEEec-- Confidence 55 33359999999998887521 112111 112211 1234678999998774 Q ss_pred cccC-CCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCc Q lcl|NC_018087. 241 LVDC-CGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGK 319 (520) Q Consensus 241 L~d~-~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGe 319 (520) ..++ ++-..+|-+..|+..+..-...++...=|=--=|--.-|+|+.-++|.+..+++ +++-+.+.+. . T Consensus 167 ~~~~~~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG~~~~~il~~~d~~l~~e~~~~-i~~~~~~~~g---------~ 236 (346) T protein:vir:10 167 EPDINQDIYGLPQYLSALQSAWLNESATLFRRKYFLNGAHAGFVFYMSDASQKQEDVEN-IRQQLKQSKG---------V 236 (346) T ss_pred CCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHH-HHHHHHHhcC---------c Confidence 3455 454567888888877766555555432221112345667777556776554443 4444444332 0 Q ss_pred cccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHH-HHHHHHHHhcCCChhhccC-CCccccccccchhhHH Q lcl|NC_018087. 320 VKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDIL-YFRKALYMALRVPLSRIPD-EQTQNVFDMSTAISRD 397 (520) Q Consensus 320 v~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~-YF~kkLy~aL~VP~SRl~~-~~~~~~~G~~~eItRD 397 (520) .|.++.+ + ++--....|.+++.|.--..-.|+-.++ +-.+...++.+||...+.- +++...|| .+.-. T Consensus 237 -~n~~~~~-v-----l~~~~~~~gi~~~pis~~~~d~qf~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s---~~e~~ 306 (346) T protein:vir:10 237 -GNFKNLF-V-----HAPNGKKDGIQIIPIADVSAKDEFFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFG---NVADA 306 (346) T ss_pred -cccCcee-E-----ecCCCCccceeEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcc---cHHHH Confidence 1111111 1 1111112355666654322223333333 4466799999999998852 22211122 23333 Q ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHH Q lcl|NC_018087. 398 ELSFDKF-ISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSE 454 (520) Q Consensus 398 ElkF~KF-I~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~E 454 (520) ..-|.+. |.-|+.+|.+++..+.. .+ |+|+-..=--++| T Consensus 307 ~~~f~~~~l~P~~~~iee~n~~L~~------e~------------i~F~~~~ll~~~~ 346 (346) T protein:vir:10 307 AEVFFITEIEPLQERLKEFNQWLGQ------EV------------IKFKPSKLLQRTQ 346 (346) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccc------ce------------eeechhhhcccCC Confidence 4445555 78888888876554211 11 1111100000111 No 179 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=83.46 E-value=0.068 Score=26.98 Aligned_cols=394 Identities=12% Similarity=0.086 Sum_probs=168.2 Q ss_pred eeccc-ccccccccccccccccccch-----------hHHHHHHHHHHHhhccchhHHH--------------------- Q lcl|NC_018087. 45 EVDSQ-DIAYNGVFQKLYGSQDPTAT-----------STRELINTYRSLLNNYEVDNAV--------------------- 91 (520) Q Consensus 45 ~i~~~-~~a~~g~~~~~~~~~~~~~~-----------~~~~LI~~YR~ma~~pEvd~Ai--------------------- 91 (520) |++.+ +... -..-.+.-.++.... .-..-+.+|+.+..+.+-+..| T Consensus 1 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~k 79 (474) T protein:vir:96 1 MIVIFWPNEK-PYHERVVEQIKPKYETQEEMIIRLINDHKPKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKPDWR 79 (474) T ss_pred CeeeccCCCc-hhhhhHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhccCCcchhccchhcccccccccccchh Confidence 44432 0000 000000001111111 1122345566665554433221 Q ss_pred ------HhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCC Q lcl|NC_018087. 92 ------QEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNR 165 (520) Q Consensus 92 ------~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~ 165 (520) ..||+-.+-+= -..||++..++.+..+ ++. .+++ =+|+..-.++.+...+-|+-|.+.-+|. T Consensus 80 i~~n~~~~Ivd~~~~~l-~g~p~~~~~~d~~~~~----~l~----~~~~-n~~~~~~~~~~~~~~~~G~~~~~~y~d~-- 147 (474) T protein:vir:96 80 MFTNYHQNLVDQKVAYA-VANPVTFSSDDDKSLK----TIQ----EVLN-HKWDDKLVDILTAASNKGIEWLQPYIDE-- 147 (474) T ss_pred cccchHHHHHHhhhhhh-cccCceeecCchHHHH----HHH----HHHh-cCHHHHHHHHHHHHHhcCeeEEEEEecC-- Confidence 12222222221 1277777776654333 332 3332 2667777888899999999988776652 Q ss_pred CCCCeeeeEecCccceeeeeec--cCCCCcccccc--cceecceeecCc-cccccccc---------ceecCCcc----- Q lcl|NC_018087. 166 PKDGIIELRRLDPRNVQFVREL--DTKMENGVKVV--KGYREYFLYDTE-LESYQCGH---------QHFAAGTK----- 226 (520) Q Consensus 166 ~k~GI~elr~lDPr~i~~vr~i--~~~~~~~~~~~--~~~~ey~~y~~~-~~~~~~~~---------~~~~~~~~----- 226 (520) +|-..+..+||+.+.++.+- ..+..-.++.+ .+...+.+|.+. ...|.... ........ T Consensus 148 --~~~~~i~~~~p~~~~~v~d~~~~~~~~~~vr~~~~~~~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (474) T protein:vir:96 148 --NGEFKTFRVPAEQAIPIWTNKERDTLKAFIRYYRLDGAERVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYYVGN 225 (474) T ss_pred --CCceEEEEEcccceEEEEcCCCCCceEEEEEEEeecCceEEEEEeCCeEEEEEecCCceeeccccccccccccccccc Confidence 35677999999999887542 11222222211 112223333322 11111100 00000000 Q ss_pred -----eecCcccEEEeecccccCCCCcchhhhHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHH Q lcl|NC_018087. 227 -----IKIPYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLK-LLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHM 300 (520) Q Consensus 227 -----~~I~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~-m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl 300 (520) =+|| |+++ +|+....|=++..+...+.+. ++-+..-.-+.++.|-+-+.-.+.... . +.+ T Consensus 226 ~~~~~g~iP---vv~~------~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~--~---~~~ 291 (474) T protein:vir:96 226 KRVSWGRVP---FIPF------KNNPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYEGQDL--D---EFM 291 (474) T ss_pred cccCCCcee---EEEe------ccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccc--c---chh Confidence 0111 1111 122334565666555555554 233334444566666543332211110 0 000 Q ss_pred HHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChH-HHHHHHHHHHHHhcCCChhhc Q lcl|NC_018087. 301 QHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEM-DDILYFRKALYMALRVPLSRI 379 (520) Q Consensus 301 ~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei-~DV~YF~kkLy~aL~VP~SRl 379 (520) . -|.. .++. ++| +.|.+++.|-...++... .-+.-..+.+|+..++|-. T Consensus 292 ~-~~~~--~~~i---------------------~~~----~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~-- 341 (474) T protein:vir:96 292 R-NLKY--YKAI---------------------NVD----GDGSGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDF-- 341 (474) T ss_pred h-hhhc--CceE---------------------Eec----CCCCceeEEeecCChHHHHHHHHHHHHHHHHHhCCccc-- Confidence 0 0111 1111 111 123346665544444333 3345667789999999952 Q ss_pred cCCCccccccccchhhHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHH Q lcl|NC_018087. 380 PDEQTQNVFDMSTAISRD--ELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKT 457 (520) Q Consensus 380 ~~~~~~~~~G~~~eItRD--ElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe 457 (520) .+++.. |..+..+.. ..---.-+.+.+..|..-+..+|+.=|-+.|+ ..+| ..|.+.|...--..+..- T Consensus 342 ~~~~~~---~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~--~~~~----~~i~i~f~~~~p~~~~e~ 412 (474) T protein:vir:96 342 QQDKFG---NSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKL--NIKV----QDVEITFNFNVMVNELEQ 412 (474) T ss_pred cccccc---cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Cccc----ceeeEEeccCCCcCHHHH Confidence 222211 222333311 11122345566666666666666553344443 2233 346677765544444322 Q ss_pred HHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCC------ccCCccccC Q lcl|NC_018087. 458 IEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKI------FNPPEPEEI 520 (520) Q Consensus 458 ~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~------~~~p~~e~~ 520 (520) + +++.++ | .+|.+++++. |...++ -+++.++|++|..+.. ..+.+..+- T Consensus 413 ~-------~~~~~a----g-~iS~et~~~~-~~~v~d-~~~E~~ri~~E~~e~~~~~~~~~~~~~~~~~ 467 (474) T protein:vir:96 413 S-------QIGVQS----Q-YLSKETVVTN-HPWVDD-PVAELERIEQDNIDFNKQLPPLEGDANGRAQ 467 (474) T ss_pred H-------HHHHhc----C-CCchHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHhcccccccccccccC Confidence 2 233332 2 4799999976 554432 2345555555543211 111111111 No 180 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=82.97 E-value=0.072 Score=26.84 Aligned_cols=385 Identities=11% Similarity=0.120 Sum_probs=172.2 Q ss_pred ecccccccccccccccccccccchhHHHHHHHHHHHhhccchhHHH---------------------------Hhhhcee Q lcl|NC_018087. 46 VDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNAV---------------------------QEIVSDA 98 (520) Q Consensus 46 i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai---------------------------~eIvnea 98 (520) +.. -.+.--+..-..-+.+|+.+..+++-+..| ..||+.. T Consensus 1 l~~-------------~~i~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~ 67 (451) T protein:vir:10 1 MEL-------------EKIRAIISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEK 67 (451) T ss_pred CCH-------------HHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhh Confidence 000 000000111112334455555555443211 1122222 Q ss_pred eEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCC----CCCCeeeeE Q lcl|NC_018087. 99 IVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNR----PKDGIIELR 174 (520) Q Consensus 99 iv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~----~k~GI~elr 174 (520) +-+= -..||++..++. +...+++. + .++ =+|+....++.+.+.+-|+-|.+.-+|.+. +.+|-..+. T Consensus 68 ~~yl-~G~p~~~~~~~~---~~~~~~~~-~---~~~-n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~ 138 (451) T protein:vir:10 68 ASYM-FTYPVLFDIDNN---KELNEKVT-D---VLG-NEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYG 138 (451) T ss_pred hhhe-ecccceeecCCc---HHHHHHHH-H---Hhc-cCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEE Confidence 2221 125566554443 22222332 2 222 367888889999999999999998887432 334677789 Q ss_pred ecCccceeeeeecc--CCCCcccccc------------cceecceeecCcc-cccccccceecCCcc---------e-ec Q lcl|NC_018087. 175 RLDPRNVQFVRELD--TKMENGVKVV------------KGYREYFLYDTEL-ESYQCGHQHFAAGTK---------I-KI 229 (520) Q Consensus 175 ~lDPr~i~~vr~i~--~~~~~~~~~~------------~~~~ey~~y~~~~-~~~~~~~~~~~~~~~---------~-~I 229 (520) .++|+.+-++.+-. .+..-.++.+ +.+.-+-+|++.. ..|...... ..+.. + +| T Consensus 139 ~i~p~~~~~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~-~~~~~~~~~~~~~~~g~v 217 (451) T protein:vir:10 139 VVNTEEIIPIYRNGIERELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVS-CCGSQIEHITVQHRFNSV 217 (451) T ss_pred EEcccceEEEEcCCCCCceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccC-ccccccccccccCCCCee Confidence 99999988875421 1111111111 0011122343331 111110000 00000 1 22 Q ss_pred CcccEEEeecccccCCCCcchhhhHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhc Q lcl|NC_018087. 230 PYSAMVYAHSGLVDCCGKNIIGYLHRAVKPANQLKL-LEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHR 308 (520) Q Consensus 230 ~~~aI~y~hSGL~d~~~~~~~syL~~aik~~NqL~m-~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~k 308 (520) | |+++ .++....|-++..+.....+.+ +=+..-.-+-+.-|-+-+.-.+... .+ +.+.. |..++ T Consensus 218 P---vv~~------~nn~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~-~~----~~~~~-~~~~~ 282 (451) T protein:vir:10 218 P---FVEF------SNNIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGED-TS----EFLKE-LKRYK 282 (451) T ss_pred e---EEEe------ccCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCccc-ch----hhHHH-HhhCC Confidence 2 2222 1233345666666555555442 3333334455666655443322211 11 11111 11222 Q ss_pred ceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHH-HHHHHHHHHHhcCCChhhccCCCcccc Q lcl|NC_018087. 309 NRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDD-ILYFRKALYMALRVPLSRIPDEQTQNV 387 (520) Q Consensus 309 nklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~D-V~YF~kkLy~aL~VP~SRl~~~~~~~~ 387 (520) --++.. .+.+.|-.+..|....+...... +.-+.+.+|+..++|- +.+++ T Consensus 283 ~i~~~~-----------------------~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~---- 333 (451) T protein:vir:10 283 TIKTET-----------------------DSEGDSGGLKTMQIEIPTEARKIILEILKKQIYESGQGLQ--QDTEN---- 333 (451) T ss_pred eEEecC-----------------------cCCccCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCccc--ccccc---- Confidence 111111 11222334666655556665555 7888899999999994 32222 Q ss_pred ccccchhhHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHH Q lcl|NC_018087. 388 FDMSTAISRD--ELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRV 465 (520) Q Consensus 388 ~G~~~eItRD--ElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~ 465 (520) ||.+|..+.. ......-+.+.+..|...+..+|+.=+-+-| ..+|. .|.+.|...---.+ .+.+ T Consensus 334 ~gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~---~~d~~----~i~i~f~~~~p~n~-------~e~~ 399 (451) T protein:vir:10 334 FGNASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLG---VTDYK----KIQQTYTRNMMSND-------LEDA 399 (451) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC---CCCcc----ceeEEecCCCCCCH-------HHHH Confidence 2444443322 2223334555666666666555544332223 33454 46677765444333 2234 Q ss_pred HHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhh------cCCccCCcc Q lcl|NC_018087. 466 NVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELS------DKIFNPPEP 517 (520) Q Consensus 466 ~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~------~~~~~~p~~ 517 (520) +++..+. | .+|.+++++. |...|. .+++.++|++|.. ..-+++=++ T Consensus 400 ~~~~kl~---g-~iS~et~~~~-~p~v~d-~~~e~~~~~ee~~~~~~~~~~~~~~~~~ 451 (451) T protein:vir:10 400 DIATKSV---G-IIPTKIILRH-HPWVDD-VEEAEKLYLEEKKIQASKVSDDYNNFTE 451 (451) T ss_pred HHHHHHh---c-cCchHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHHHHhhcCCCCC Confidence 4555553 4 3799999977 555441 2333333333322 222333222 No 181 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=82.82 E-value=0.074 Score=26.79 Aligned_cols=409 Identities=12% Similarity=0.072 Sum_probs=207.8 Q ss_pred cccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHh Q lcl|NC_018087. 3 MLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLL 82 (520) Q Consensus 3 ~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma 82 (520) |=-..+-.|...|......- .... -.|.|-..-.+.+.. + ..++-..+|... T Consensus 1 m~~~~i~~L~~~~~~~~~r~-~~~~-----------------------~yy~g~~~~~~~~~~--~--p~~~~~~~~~v~ 52 (422) T protein:vir:97 1 MNYMGMGYLRRKLALFKTGV-DKRY-----------------------RYYAMDDRDDTRSIV--M--PNNVREMYRSVL 52 (422) T ss_pred CChHHHHHHHHHHHHHHHHH-HHHH-----------------------HHHhcCCChhhcCcc--c--cHHHHHHHHhhc Confidence 43334555555555543321 1100 011111111111111 1 122323334323 Q ss_pred hccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeee Q lcl|NC_018087. 83 NNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIIN 162 (520) Q Consensus 83 ~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid 162 (520) + =+.-||+.+++=..+ + +. + .++. +...+...-+|+....+.++.=++.||-|.-.--+ T Consensus 53 n--w~~~~Vd~~a~rl~~-~-Gf---~--~~d~------------~l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~ 111 (422) T protein:vir:97 53 E--WTAKGVDSLADRIIF-R-EF---T--NDDF------------NAWEIFKANNPDIFFDTAIQSALIASCCFVYIMPG 111 (422) T ss_pred c--hhHHHHHHHHhcccc-c-ee---e--CCch------------hHHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeC Confidence 3 335567766552222 1 11 1 1111 23456666778888889999999999977665433 Q ss_pred cCCCCCCeeeeEecCccceeeeeeccCCCC-cccccc----cceecceee-cCccccc-ccccceecCCcceecCcccEE Q lcl|NC_018087. 163 PNRPKDGIIELRRLDPRNVQFVRELDTKME-NGVKVV----KGYREYFLY-DTELESY-QCGHQHFAAGTKIKIPYSAMV 235 (520) Q Consensus 163 ~~~~k~GI~elr~lDPr~i~~vr~i~~~~~-~~~~~~----~~~~ey~~y-~~~~~~~-~~~~~~~~~~~~~~I~~~aI~ 235 (520) +.+|...++.++|+.+--+.+=.+... -+..+. .+.....+| .+....+ ..++.....-.+.-. ...|. T Consensus 112 ---~~~~~p~i~~~sp~~~~~i~D~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~-vPvv~ 187 (422) T protein:vir:97 112 ---AEDGLPKMQVIEASKATGILDPTTFLLTEGYAILESDSNGNPTLEAYFTDKDIWYYPKKGKPYNIKNPTGH-PLLVP 187 (422) T ss_pred ---CCCCeeEEEEechhhEEEEEeCCCCcceeeEEEEEecCCCcEEEEEEEcCceEEEEcCCCccccccCCCCC-cceEE Confidence 234677899999999988765221111 111111 111111222 1111111 111111110111110 12355 Q ss_pred Eeeccc-ccCCCCcchhhh-HHHHHHHHH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeE Q lcl|NC_018087. 236 YAHSGL-VDCCGKNIIGYL-HRAVKPANQ-LKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRIS 312 (520) Q Consensus 236 y~hSGL-~d~~~~~~~syL-~~aik~~Nq-L~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklv 312 (520) |++..- .++.|. |-+ +..+...+- -+.+.++++.=...=.|.|-++=+|--.-|..+-..+ T Consensus 188 ~~n~~~~~~~~G~---s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~------------- 251 (422) T protein:vir:97 188 IIHRPDAVRPFGR---SRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYVLGMDPDAKPMEKWRAT------------- 251 (422) T ss_pred ecccCCCccccCc---cccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcccCcccccCchhhhh------------- Confidence 555422 223332 322 222222222 2567888899899999999887554322122111111 Q ss_pred eecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcC-hHHHHHHHHHHHHHhcCCChhhccCCCcccccccc Q lcl|NC_018087. 313 YDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMN-EMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMS 391 (520) Q Consensus 313 Yd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLg-ei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~ 391 (520) |-.=..+|.-+.|-+.+|..++++. |+ =++-++-.-..+....++|.+-|...+.... .+ T Consensus 252 ----------------~~~i~~~~~de~~~~~~v~q~~~~~-l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~Nps--Sa 312 (422) T protein:vir:97 252 ----------------VSTLLEISKDEDGDKPTVGQFTTAS-MAPFMEHLKMYASLFAGGSGLTLDDLGFPSDNPS--SV 312 (422) T ss_pred ----------------hhhhhccCCCCCCCcceeeecCCCC-hhHHHHHHHHHHHHHhcccCCCHHHhccccCchh--HH Confidence 1123446777777778899998854 44 3333444444555556999877765442111 23 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC--hhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 392 TAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVIT--EDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLS 469 (520) Q Consensus 392 ~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t--~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~ 469 (520) ..|.-.|....+-+.+-|+.|..-+..+++.-+.+.+-.. +++|. .+.+.|. .++-. ++..+.+..+.+. T Consensus 313 ~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~----~~~~~w~-p~~~~---~~~s~a~~aDa~~ 384 (422) T protein:vir:97 313 ESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEFPYLRNQFM----DTVIKWE-PLFEA---DANMLTLVGDGAI 384 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccchhhc----cceEEEc-cCCCC---ChHHHHHHHHHHH Confidence 4566778889999999999999999999999887766443 33443 3566665 22211 2333455555555 Q ss_pred HhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcC Q lcl|NC_018087. 470 LMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDK 510 (520) Q Consensus 470 ~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~ 510 (520) .+-.-+-.+.+.+++++. |++|+.+++ ...++++.-+| T Consensus 385 Kl~~a~~~~~~~~~~~~~-lg~~~~~~~--~~~~~~~~~d~ 422 (422) T protein:vir:97 385 KLNQAIPGFMDADVIRDL-TGVKGADKP--IPAITEVTTDG 422 (422) T ss_pred HHHhhccccccHHHHHHH-cCCCchhHH--HHHHHhhhccC Confidence 543333345677888755 899876554 34566666666 No 182 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=80.49 E-value=0.094 Score=26.20 Aligned_cols=396 Identities=10% Similarity=0.033 Sum_probs=207.8 Q ss_pred cccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHh Q lcl|NC_018087. 3 MLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLL 82 (520) Q Consensus 3 ~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma 82 (520) |=-+-+-.|..-|...... ++... -.|.|-..-.+.++. -..++-..||... T Consensus 1 ~~~~~i~~L~~~~~~~~~r-~~~~~-----------------------~yY~g~~~~~~~~~~----~p~~~~~~~~~v~ 52 (409) T protein:vir:94 1 MTEKGIGYLRFKLSVHKRR-AEMRY-----------------------DQYAMKYVDRFKGIT----IPQALSQQYRSIL 52 (409) T ss_pred CCHHHHHHHHHHHHHHhHH-HHHHH-----------------------HHhcccCchhhcChh----hhHHHHHHHhhhc Confidence 4444445555555443322 11100 001111111122221 1224555666544 Q ss_pred hccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeee Q lcl|NC_018087. 83 NNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIIN 162 (520) Q Consensus 83 ~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid 162 (520) + =+.-||+.+++=..+ + +.+ .++ ++...+...-+|+....+.++.=++.||-|. .|.- T Consensus 53 n--w~~~iVds~a~rl~~-~-Gf~-----~~d------------~~l~~i~~~N~ld~~~~~~~~~aliyG~sf~-~v~~ 110 (409) T protein:vir:94 53 G--WCAKGVDSLADRLVF-R-EFE-----NDD------------FTVNEIFEENNPDIFFDSAVLSSLIASCSFT-YISK 110 (409) T ss_pred c--hhHHHHHHhHhhccc-C-ccc-----CCc------------hHHHHHHHhcChhHHHHHHHHHHHHhcceeE-EEec Confidence 3 344577766553322 1 111 111 1345566667788888999999999999554 4443 Q ss_pred cCCCCCCeeeeEecCccceeeeeeccCCC-Ccccccc-----cceecceeecCccccc--ccccceecCCcceecCcccE Q lcl|NC_018087. 163 PNRPKDGIIELRRLDPRNVQFVRELDTKM-ENGVKVV-----KGYREYFLYDTELESY--QCGHQHFAAGTKIKIPYSAM 234 (520) Q Consensus 163 ~~~~k~GI~elr~lDPr~i~~vr~i~~~~-~~~~~~~-----~~~~ey~~y~~~~~~~--~~~~~~~~~~~~~~I~~~aI 234 (520) . .+|-..++.++|+.+--+.+-.+.. .-+.++. .+.....+|.+....+ ..++.....-++.-. ...| T Consensus 111 ~---~dg~~~i~~~sp~~~~~i~D~~~~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~g~-vPvV 186 (409) T protein:vir:94 111 G---ENDAVRLQVIEAVNATGIIDPITGLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGH-PLLV 186 (409) T ss_pred C---CCCceEEEEeccceEEEEEecCCCceeeeEEEEEecCCCceEEEEEEecCcEEEEEecCceeEeeeCCCCC-cceE Confidence 2 2466688999998887765422111 1111111 0111122333322111 111111111111111 1245 Q ss_pred EEeecccc-cCCCCcchhhhHHHH-HHHHH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhccee Q lcl|NC_018087. 235 VYAHSGLV-DCCGKNIIGYLHRAV-KPANQ-LKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRI 311 (520) Q Consensus 235 ~y~hSGL~-d~~~~~~~syL~~ai-k~~Nq-L~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knkl 311 (520) .|++-.-. ++.| .|-+-..+ ...+- -+.+.++++.=...=.|.|-++=+|...-|..+-..++. T Consensus 187 ~f~n~~~~~~~~G---~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~---------- 253 (409) T protein:vir:94 187 PIIHRPDAVRPFG---RSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEPMETWKATVS---------- 253 (409) T ss_pred EeccccccccccC---ccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCcccchhhhhHH---------- Confidence 66653322 2222 33332212 11121 256788899999999999999866543222221111111 Q ss_pred EeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCcccccccc Q lcl|NC_018087. 312 SYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMS 391 (520) Q Consensus 312 vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~ 391 (520) .=..+|.-+.|-+.+|..+|+++=-+=++=++=.-..+....++|.+-|...+... ..+ T Consensus 254 -------------------~i~~~~~d~dg~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~t~lP~~~lg~~~~Np--sSa 312 (409) T protein:vir:94 254 -------------------SMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSDNP--SSV 312 (409) T ss_pred -------------------HhhcCCCCCCCCCceEEecCCCChhHHHHHHHHHHHHHhhhcCCCHHHhccccCch--hHH Confidence 11335666667778899998855333345555666677777899987776543211 233 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC--hhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 392 TAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVIT--EDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLS 469 (520) Q Consensus 392 ~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t--~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~ 469 (520) ..|.-.|....+-+.|-|+.|..-+..++|.-+.+.+-.. +++|. .+.+.|. ++.-++ +--+.++.+.+. T Consensus 313 ~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~~~~~~~~----~~~v~W~-p~~~~~---~~~~a~~aDa~~ 384 (409) T protein:vir:94 313 EAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAPYLREQFR----KTKPKWE-PLFEAD---ASMLSLIGDGAI 384 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccccc----cceEEec-cCCCcc---hHHHHHHHHHHH Confidence 4566667778889999999999999999998777665443 34443 4667776 433333 333466666666 Q ss_pred HhhcccchhhhHHHHHHHHhCCCHHH Q lcl|NC_018087. 470 LMEPYIGKYISNHTAMKDFLQMSDED 495 (520) Q Consensus 470 ~~~p~vgky~S~~~i~k~IL~~tDee 495 (520) .+..-+=.+.+.+.++.. |++|+.| T Consensus 385 Kl~~ag~~~~~~~~~~~~-lG~~~~d 409 (409) T protein:vir:94 385 KLNQAIPEFINKDTIRDL-TGIEGGE 409 (409) T ss_pred HHHHhcccccchhHHHHH-cCCCCCC Confidence 665543345677777755 9999999 No 183 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=78.99 E-value=0.11 Score=25.86 Aligned_cols=407 Identities=11% Similarity=0.034 Sum_probs=184.4 Q ss_pred cccccchhhhcchhhhhhhHH-HhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHH Q lcl|NC_018087. 3 MLADSDLKMFAFWHKVDDTEY-DKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSL 81 (520) Q Consensus 3 ~~~~~~l~~f~~~~~~~~~~~-~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~m 81 (520) |.+ .+++.+.+.-+... ++.+. .-++.. ..+.. +. ++|. .....++-++.....++.|++| T Consensus 1 m~~----~i~~~~g~p~~~~~~~~~~~----~~ia~~-~~~~~-~~------~~~~--~~~~~~~iLr~~~~~~~~y~~m 62 (491) T protein:vir:10 1 MSK----GLWVSPTEFVTFGEPDKSLS----SQIATR-ARSID-FF------ALGM--YLPNPDPVLKALGKDIRVYREL 62 (491) T ss_pred CCC----ceeCCCCCccCcccCChHHH----HHHHhh-hcccc-cc------cccC--CccchHHHHHhcCCCHHHHHHH Confidence 433 22222222111000 00000 000100 01111 11 1111 1122222222221236799999 Q ss_pred hhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEee Q lcl|NC_018087. 82 LNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKII 161 (520) Q Consensus 82 a~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvi 161 (520) ..++.|.++++....-+.-.+= .|.--.. ++...+.|.+-|+ -++|+.-..++..- ..-|--.+++++ T Consensus 63 ~~D~~i~s~l~~Rk~av~~~~w-----~i~~~~~--~~~~~e~v~e~l~----~~~~~~~l~~~lda-~~~G~s~~Ei~w 130 (491) T protein:vir:10 63 RADAHVGGCVRRRKAAVKALEW-----GLDRGKA--KSRVAKSIADVFA----DLDLSRIVTEMLDA-VLYGYQPMEITW 130 (491) T ss_pred hhChHHHHHHHHHHHHHhCCCc-----EEecCCC--CHHHHHHHHHHHh----cCCHHHHHHHHHHh-hhhcceeEEEEE Confidence 9999999999999776543221 1111010 2223334444332 34565555555432 336778888888 Q ss_pred ecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcc-cEEEeecc Q lcl|NC_018087. 162 NPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYS-AMVYAHSG 240 (520) Q Consensus 162 d~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~-aI~y~hSG 240 (520) ..+...-.+.++...+|+.+.+-+ .++... .+. .+...++.+|+. -|+|+|.. T Consensus 131 ~~~~g~~~~~~l~~r~~~~f~~d~------~~~l~~----------~~~----------~~~~~g~~l~~~k~i~~~~~~ 184 (491) T protein:vir:10 131 GKVGNYIVPIDVVGKPADWFVYDP------ENQLRF----------RSK----------DHWMQGEELPARKFLVPRQEA 184 (491) T ss_pred eecCCeeEEEEeeeecccceeecc------CCceEE----------ecC----------CCCCCcceecCCCEEEEEecC Confidence 866544455677777776555411 112110 000 011223455555 47777633 Q ss_pred cccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHh-cCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCc Q lcl|NC_018087. 241 LVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRIT-RAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGK 319 (520) Q Consensus 241 L~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~-RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGe 319 (520) +..+....|-|..|..+|--.+......+.+=-. =.|-|- ...|.|.-.+.|. +.++.+ ....+.- | T Consensus 185 --~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~i-gky~~~a~~~ek~-~l~~al-~~~~~~a------~- 252 (491) T protein:vir:10 185 --TYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLV-GKHPRSASDGEKN-LLLDCL-EDMVQDA------V- 252 (491) T ss_pred --CCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEE-EecCCCCCHHHHH-HHHHHH-HHHhcCc------E- Confidence 2233445788999998888887666665555433 345544 4447776444443 233322 2222210 1 Q ss_pred cccccccchhhhhhcccccCCCCCcceeecCCCCCcChHH----HHHHHHHHHHHhcCCChhhccCCCcccccccc-chh Q lcl|NC_018087. 320 VKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMD----DILYFRKALYMALRVPLSRIPDEQTQNVFDMS-TAI 394 (520) Q Consensus 320 v~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~----DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~-~eI 394 (520) -. +| .|++|+.+.-+.+-|..+ =++|..++.-+++-= .=|.+++++ +++ +++ T Consensus 253 -----~v--------iP-----~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLG--qtlTt~~~g---s~a~~~v 309 (491) T protein:vir:10 253 -----AV--------VP-----DDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIALLG--QNQTTEATS---TRASAQA 309 (491) T ss_pred -----EE--------ec-----CCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHHhh--hhcccCccc---chhHHHH Confidence 11 22 368899886443344333 388898887776531 113344332 222 333 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_018087. 395 SRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPY 474 (520) Q Consensus 395 tRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~ 474 (520) -.+ -+...++..++..+..+..+++-=+.+.+... ...++.|... . |..+.+.+.+..+.+. T Consensus 310 h~~--v~~di~~~D~~~i~~tln~li~~l~~~N~~~~--------~~p~f~~~~~------~--e~~~~~a~~~~~L~~~ 371 (491) T protein:vir:10 310 GLE--VTDDIRDGDKAVVSEAMNMLIRWICDLNFDGA--------DRPVFDMWEQ------E--QVDEIQAGRDQKLTQA 371 (491) T ss_pred HHH--HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC--------CcceEEecCc------C--chhHHHHHHHHHHHhC Confidence 222 25566666666666666664444333443211 2344555422 1 2223344444444433 Q ss_pred cchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcC-----CccCCccccC Q lcl|NC_018087. 475 IGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDK-----IFNPPEPEEI 520 (520) Q Consensus 475 vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~-----~~~~p~~e~~ 520 (520) |==++.+|++++ +++...+.++...........+ -...|+..++ T Consensus 372 -G~~i~~~~i~e~-~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 420 (491) T protein:vir:10 372 -GARFTPAYFKRA-YNLQDGDLDERPLPVSAVDTVGAASFAEFEAPDQDAL 420 (491) T ss_pred -CCcCCHHHHHHH-hCCCCCCcCccccccCCCCCcccccccccCCCCCCch Confidence 434899999876 7776544433211110000000 0000000011 No 184 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=78.67 E-value=0.11 Score=25.79 Aligned_cols=381 Identities=12% Similarity=0.114 Sum_probs=166.7 Q ss_pred hhhhcchhhhhhhHHHhhhccCCCcccCCCC--CCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccc Q lcl|NC_018087. 9 LKMFAFWHKVDDTEYDKIINDKAESITAPKF--DDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYE 86 (520) Q Consensus 9 l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~--~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pE 86 (520) ++||..-.. + ..+. .++. +.|.... ..|+ |++.. -+++|. T Consensus 1 m~~~~~~~~--~----------~~~~-~~~~~~~~~~~~~------~~g~----~~~~~---------------Al~~~~ 42 (417) T protein:vir:38 1 MKLFRGLAT--E----------VDPH-WADHLLDSGVIPS------FRGG----YLGIS---------------ALRNSD 42 (417) T ss_pred Ccccccccc--C----------CCcc-chhhhcccccccc------cCCc----eechh---------------hcccHH Confidence 444421000 0 0000 0000 1111111 0122 22211 147889 Q ss_pred hhHHHHhhhceeeEecCCCcEEEEeeccc-hhhhHHHHHHHHHHHHHHHH-hcchhhhHHH----HHhhccccceeEEEe Q lcl|NC_018087. 87 VDNAVQEIVSDAIVYEEGFDVVSIDLDQT-AFTENIRNLISDEFNSVLNM-LNFQRKGSDH----FKRWYVDSRVFFHKI 160 (520) Q Consensus 87 vd~Ai~eIvneaiv~d~~~~~V~l~Ld~~-~~s~~ik~~I~eeF~~i~~l-l~f~k~g~~~----fRrWYvDgri~~hkv 160 (520) |-.||+.|.+.+.-. |+.+.-... ...+ . .....+|+. =|=..++.++ +....+.|--|..++ T Consensus 43 V~~cv~~ia~~iA~l-----p~~~~~~~~~~~~~---~---~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~ 111 (417) T protein:vir:38 43 VLTAVSIVSGDVSRF-----PLVITDSSTDEVID---L---ANIEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIV 111 (417) T ss_pred HHHHHHHHHHhhccC-----eeEEEEcCCcceec---c---chHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEE Confidence 999999999877643 333321111 1111 0 111122221 2333445554 334567788888877 Q ss_pred eecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecc Q lcl|NC_018087. 161 INPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSG 240 (520) Q Consensus 161 id~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSG 240 (520) -|... .-+..|.+|+|..+...+.- ++ . -+|.|... ..+....++.+.|+|.. T Consensus 112 r~~~g--~~~~~l~~l~p~~v~v~~~~-----~~-~------~~y~~~~~-----------~~~~~~~~~~~dviH~r-- 164 (417) T protein:vir:38 112 RDPIT--NEPAMFEFYAPSQTQVDTSD-----PD-N------IIYRFTPY-----------NSSMQKVCGFEDVIHWK-- 164 (417) T ss_pred EcCCC--CEEEEEEEeCCceEEEEEcC-----CC-e------EEEEEEEc-----------CCcEEEEecCcceEEec-- Confidence 54322 23788999999999875321 11 1 11222111 11122457778887774 Q ss_pred cccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCcc Q lcl|NC_018087. 241 LVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKV 320 (520) Q Consensus 241 L~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev 320 (520) ..+.++...+|-|..|.+++.....++....=+=-.-+.-+-|...+ |.|.+.++++.-+.+-..|.- ...|. T Consensus 165 ~~~~d~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~-~~l~~e~~~~~~~~~~~~~~g-----~n~g~- 237 (417) T protein:vir:38 165 FFSYDTIMGRSPLLSLGDEIGLQESGVSTLQKFFKSGLKGSIIKAKE-SRLSAEARQKIREDFERAQAG-----ADAGS- 237 (417) T ss_pred CCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC-CCCCHHHHHHHHHHHHHHhcc-----cccCC- Confidence 24667767789999999998887777776543222224445566555 556666555443333222221 11232 Q ss_pred ccccccchhhhhhcccccCCCCCcceeecCCCCCcCh---HHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHH Q lcl|NC_018087. 321 KNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE---MDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRD 397 (520) Q Consensus 321 ~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge---i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRD 397 (520) .+ . |+ + |.+.+.|. -+..+ ++--+|-...+.++++||.+.|...+.. +.+.-. T Consensus 238 -----~~-v-----l~--~---g~~~~~l~--~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~------s~~e~~ 293 (417) T protein:vir:38 238 -----PI-I-----VD--A---TMDYQPLE--VDTNVLNLINSNNYSTAQIAKALRVPAYRLAQNSPN------QSVKQL 293 (417) T ss_pred -----ce-e-----cc--C---CceEEEcc--CCHHHHHHHHHHHhhHHHHHHHhCCCHHHhCCCCcc------hhHHHH Confidence 22 1 21 1 44555542 22222 2233344677889999999999632211 112212 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccch Q lcl|NC_018087. 398 ELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGK 477 (520) Q Consensus 398 ElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgk 477 (520) -+.|.+. .|+-.+..| .+-|- ..++++.++.. +.+.|..++ ..++. +.++...++ .- T Consensus 294 ~~~~~~~--tl~P~~~~i-e~~l~-----~~Ll~~~~~~~----~~~~fd~~~-l~~~~-------~~~~~~~~~---~G 350 (417) T protein:vir:38 294 ADDYIRN--DLPFYFEPI-TSEFE-----LKLLDDAQRHQ----YCIGFDTKS-VNGLP-------IADVNTAVN---GG 350 (417) T ss_pred HHHHHHH--HHHHHHHHH-HHHHH-----hhhcChhhccc----ceEEechhh-hhHHH-------HHHHHHHHh---CC Confidence 2223332 344333332 22222 23355555532 334443222 11111 111112222 23 Q ss_pred hhhHHHHHHHHhCCCHHHHHHHHH--------------HHH-HhhhcCCccCCccccC Q lcl|NC_018087. 478 YISNHTAMKDFLQMSDEDIAAERK--------------LID-EELSDKIFNPPEPEEI 520 (520) Q Consensus 478 y~S~~~i~k~IL~~tDeeI~~~~k--------------qi~-~E~~~~~~~~p~~e~~ 520 (520) +++.+-+++ ++++.+-+=-..++ +.+ .+..+..-.++++|.. T Consensus 351 ~~T~NE~R~-~~gl~pi~~g~~d~~~~~~n~~~~d~~~~~~~~~~~~~kgg~~~~~~~ 407 (417) T protein:vir:38 351 LWTGNEGRA-ELGKKPLKDPNMDRIQSTLNTVFLDQKEAYQAEHAAELKGGDTNAKGN 407 (417) T ss_pred CcCHHHHHH-HhCCCCCCCCCCCeeeecccccccccccccccccccccCCCCCCCCCC Confidence 567777764 46775421100000 000 0111111123333333 No 185 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=76.61 E-value=0.13 Score=25.38 Aligned_cols=356 Identities=11% Similarity=0.103 Sum_probs=161.0 Q ss_pred hcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchhHHH Q lcl|NC_018087. 12 FAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNAV 91 (520) Q Consensus 12 f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai 91 (520) +|||.+..... .+. .| .++..... +.+ +. + .| +++-|..|| T Consensus 1 Mg~f~~~~~~~---------~~~-~~--~~~~~~~~-------------~~~-~~-~--------~~----~~~~v~~~i 41 (378) T protein:vir:16 1 MNLFGKVVSFS---------RGK-LN--NDTQRVTA-------------WQN-EA-V--------EY----TSAFVTNIH 41 (378) T ss_pred Cccchhhhhhh---------ccc-cc--CCcceeee-------------ccc-ch-h--------hH----HHHHHHHHH Confidence 77777644221 000 01 11111110 100 00 0 12 334488999 Q ss_pred HhhhceeeEecCCCcEEEEeeccc-hhhhHHHHHHHHHHHHHHHH-hcchhhhHHHHH----hhccccceeEEEeeecCC Q lcl|NC_018087. 92 QEIVSDAIVYEEGFDVVSIDLDQT-AFTENIRNLISDEFNSVLNM-LNFQRKGSDHFK----RWYVDSRVFFHKIINPNR 165 (520) Q Consensus 92 ~eIvneaiv~d~~~~~V~l~Ld~~-~~s~~ik~~I~eeF~~i~~l-l~f~k~g~~~fR----rWYvDgri~~hkvid~~~ 165 (520) +-|.+.+..++-. -+..... ...+.....+.....++|+. =|=.-+++++.+ .+...|.-|..++.|... T Consensus 42 ~~Ia~~iA~l~~~----~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~ 117 (378) T protein:vir:16 42 NKIANEITKVEFN----HVKYKKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNT 117 (378) T ss_pred HHHHhhhhhCcee----EEEEcccccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCC Confidence 9999987754421 0111111 11122222233333444432 222335555444 577789999888876321 Q ss_pred CCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccccCC Q lcl|NC_018087. 166 PKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDCC 245 (520) Q Consensus 166 ~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~~ 245 (520) .++.++-|. +..+.++.+.|+|..+- .+ T Consensus 118 -----g~~~~l~~~--------------------------------------------~~~~~~~~~diih~r~~---~~ 145 (378) T protein:vir:16 118 -----GELLDLLFA--------------------------------------------DDKKEYKPEELVRLTSP---FY 145 (378) T ss_pred -----ceEEEEEec--------------------------------------------CCeeEecccceEEecCc---cC Confidence 223332220 11245677888888642 33 Q ss_pred CCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccc Q lcl|NC_018087. 246 GKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQAN 325 (520) Q Consensus 246 ~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~ 325 (520) +..+.|.|+.|.+.++. ++ .-+--|-+..++ +.|.+..+.+....+...|++..- |. +..+ T Consensus 146 ~~~~~s~l~~~~~~i~~------~~-----~~~~~~g~l~~~-~~l~~~~~~~~~~~~~~~~~~~~~-----~~--~~g~ 206 (378) T protein:vir:16 146 INEDTSILDNALASIQT------KL-----EQGKLRGLLKIN-AFLDIDNTQEYREKALTTIKNMQE-----GS--SYNG 206 (378) T ss_pred ccchhHHHHHHHHHHHH------HH-----hcCccceeeEeC-CcCCHHHHHHHHHHHHHHHHHhhc-----cc--cccc Confidence 44567888888766532 11 122223344443 455555555555555555554222 21 1112 Q ss_pred cchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHHH Q lcl|NC_018087. 326 MMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKFI 405 (520) Q Consensus 326 ~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KFI 405 (520) .+. + ..|.+++.|.-.....++...+|-++.+.++++||.+.|. |..+|-. -+-|..+ T Consensus 207 ~~v-l----------~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~--------g~~~e~~--~~~f~~~- 264 (378) T protein:vir:16 207 LTP-V----------DNKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILL--------GTASQEQ--QIYFYNS- 264 (378) T ss_pred ceE-c----------CCCceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHhc--------CCchHHH--HHHHHHH- Confidence 221 1 1255666666555666788899999999999999998873 2222211 1123332 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhh---hceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHH Q lcl|NC_018087. 406 SELQHKFEEIFLSPLKSNLLLKRVITEDEWEAEL---NNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNH 482 (520) Q Consensus 406 ~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~---~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~ 482 (520) .|+-.+.. +.+- +=..+++++++.... ....++|.-+ .+.... +..|++.+..+-.- -+++.+ T Consensus 265 -tl~P~~~~-ie~~-----l~~kLl~~~e~~~~~~~~~~~~~~f~~~----~l~~~d-~~~~~~~~~~~~~~--G~~T~N 330 (378) T protein:vir:16 265 -TIIPLLIQ-LEKE-----LTYKLISTNRRRVVKGNLYYERIIVDNQ----LFKFAT-LKELIDLYHENING--PIFTQN 330 (378) T ss_pred -HHHHHHHH-HHHH-----HHhhcCChhhhhhhhhcccccceeeccc----hhhhcC-HHHHHHHHHHHHhC--CCcCHH Confidence 23332222 2222 223456777766432 1223444322 222222 23566665555332 266777 Q ss_pred HHHHHHhCCCHHH-------------HHHHHHHHHHhhhcCCccCCcccc Q lcl|NC_018087. 483 TAMKDFLQMSDED-------------IAAERKLIDEELSDKIFNPPEPEE 519 (520) Q Consensus 483 ~i~k~IL~~tDee-------------I~~~~kqi~~E~~~~~~~~p~~e~ 519 (520) -++. ++++.+-+ +.... +.+...+++.-.+-+..| T Consensus 331 E~R~-~~g~~p~~ggD~~~~~~n~~~~~~~~-~~~~~~~~~~~~~e~~ne 378 (378) T protein:vir:16 331 QLLV-KMGEQPIEGGDVYIANLNAVAVKNLS-DLQGSRKDVTSTDETNNQ 378 (378) T ss_pred HHHH-HhCCCCCCCCCeEeeccccccccchh-hhcCccCCCCCCCCCCCC Confidence 7764 46654321 01111 111111111111111122 No 186 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=74.33 E-value=0.16 Score=24.95 Aligned_cols=383 Identities=11% Similarity=0.108 Sum_probs=161.9 Q ss_pred hcchhhhhhhH----H-------Hh-hhccCCCcccCCCCCCCceeecc-cccccccccccccccccccchhHHHHHHHH Q lcl|NC_018087. 12 FAFWHKVDDTE----Y-------DK-IINDKAESITAPKFDDGATEVDS-QDIAYNGVFQKLYGSQDPTATSTRELINTY 78 (520) Q Consensus 12 f~~~~~~~~~~----~-------~~-~~~~~~~s~~~p~~~dg~~~i~~-~~~a~~g~~~~~~~~~~~~~~~~~~LI~~Y 78 (520) +|||..--.-- + +- .-.+...++..|.....+..... ..... |++...+..-.+...+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~~~~~~~~~~t~------- 72 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAW-SGYPESWATPSWGSAQD------- 72 (409) T ss_pred CchhhhhcccccCCCcccccccccccCCCCceeeccCCCcchhhhhccccccccc-ccccccccccCccccch------- Confidence 67776543310 0 00 00001111222221111111000 01111 22222221111211111 Q ss_pred HHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHH-hcchhhhHHH----HHhhcccc Q lcl|NC_018087. 79 RSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNM-LNFQRKGSDH----FKRWYVDS 153 (520) Q Consensus 79 R~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~l-l~f~k~g~~~----fRrWYvDg 153 (520) +.++++|-|..||+-|.+.+.-. |+.+- ++.... ++...+++. =|-..++.++ +..+.+ | T Consensus 73 ~~~~~~~~v~acV~~Ia~~iA~l-----pl~~~-~~~~~~--------~~~~~ll~~~PN~~~t~~~f~~~l~~~lll-G 137 (409) T protein:vir:83 73 KLRTLIDVAWACIDLNASVLSSM-----PIYRM-RNGRII--------DSVAWMSNPDPEVYTSWQEFAKQLFWDFQL-G 137 (409) T ss_pred hhHhhhHHHHHHHHHHHHhhccC-----ceEEe-eCCccc--------cchhhhcccCCCCCCCHHHHHHHHHHHHhh-C Confidence 45677899999999999876543 22221 111111 122222211 0111334443 344555 7 Q ss_pred ceeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCccc Q lcl|NC_018087. 154 RVFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSA 233 (520) Q Consensus 154 ri~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~a 233 (520) --|+.++....+. -+++|..|+|..+.+.+. .++. ..|.+ +.. . .++. T Consensus 138 nay~~~i~r~~~G--~~~~L~pl~p~~v~v~~~-----~~g~------~~y~~-~~~---------~---------~~~e 185 (409) T protein:vir:83 138 EAFVLPMAHGSDG--YPIRFRVVPPWLVNVELK-----KGAR------REYRI-GGL---------N---------VTDE 185 (409) T ss_pred CcEEEEEEECCCC--cEEEEEEECCcceEEEEc-----CCce------EEEEE-ccc---------c---------Cccc Confidence 7787777543332 289999999999887432 1221 11211 110 0 1244 Q ss_pred EEEeecccccC-CCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhc--CccceEEEccCCCCchHHHHHHHHHHHHhhcce Q lcl|NC_018087. 234 MVYAHSGLVDC-CGKNIIGYLHRAVKPANQLKLLEDAMMIYRITR--APDRRVFYIDTGNMPARKAAQHMQHIMNSHRNR 310 (520) Q Consensus 234 I~y~hSGL~d~-~~~~~~syL~~aik~~NqL~m~EDalVIyRi~R--ApeRRvFyIDvGnlpk~KAeqyl~~im~~~knk 310 (520) |+|+- .+.+ ++...+|-|+.|...+..-...++... +.+. |--.-|...| +.|.+.++++..+.....|. T Consensus 186 iiHir--~~~~~~~~~G~spi~~~~~~i~~~~a~~~~~~--~~f~nga~p~gil~~~-~~ls~e~~~~~~~~~~~~~~-- 258 (409) T protein:vir:83 186 ILHIR--YQGNTADAHGHGPLESAAPRQVVIGLLQKYVQ--NLAETGGVPLYWLGVE-RRLSETEAVDLMDRWIESRS-- 258 (409) T ss_pred eEEeC--CCCCCCCcccccHHHHHHHHHHHHHHHHHHHH--HHHhcCCCcceEeecC-CCCCHHHHHHHHHHHHHhhC-- Confidence 65542 2333 334567888888888887777777543 3333 2233444444 45666666665555544332 Q ss_pred eEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccc Q lcl|NC_018087. 311 ISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDM 390 (520) Q Consensus 311 lvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~ 390 (520) ...|. .+ +++ +|+.-++..++.. ..+.-++--+|-.+..-++.+||..-|.-.+..+-.+. T Consensus 259 ----~nag~------~~-il~-------~g~~~~~~~~~s~-~d~q~le~r~~~~~eIa~~fgVPp~llg~~~~~~~~ty 319 (409) T protein:vir:83 259 ----KYAGH------PA-LVT-------GGATLNQAKSMSA-QDLSLMELTQFNEARIAILLGVPPFLVGLPGATGSLTY 319 (409) T ss_pred ----CccCc------cc-eec-------CCcccccccCCCH-HHHHHHHHHHhhHHHHHHHhCCCHHHccCCCCcccccc Confidence 12232 11 222 1222122222211 01111222234456688899999988753221110011 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 391 STAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSL 470 (520) Q Consensus 391 ~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~ 470 (520) + .+.-.-+.|.++ .|+--+.. +.+.|-..|+ ++. +.|+ |.-+ ++...++ ..|.+.++. T Consensus 320 s-n~eq~~~~f~~~--tL~P~~~~-ie~~l~~~Ll-----~~~------~~~~--f~~~----~llr~d~-~~r~~~~~~ 377 (409) T protein:vir:83 320 S-NIEQLFSFHDRS--SLRPKATA-VMAALDRWAL-----PSP------QHLE--LNRD----DYTRPSL-VERATAYKI 377 (409) T ss_pred c-cHHHHHHHHHHH--HHHHHHHH-HHHHHHHhhC-----CCC------cEEE--eehh----hhhccCH-HHHHHHHHH Confidence 1 132223334432 33332222 2333333443 322 1344 4322 3444443 467777766 Q ss_pred hhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 471 MEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 471 ~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) +-.- -++|.+-+++. +.|. |.+..++. T Consensus 378 ~~~~--G~lT~NE~R~~-~glp--------------------p~~ggd~l 404 (409) T protein:vir:83 378 MIEA--GVMEPNEARAM-ERLH--------------------SEAAAVRL 404 (409) T ss_pred HHhC--CCcCHHHHHHH-hCCC--------------------CCCCCccc Confidence 5432 24555555432 3332 22222333 No 187 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=70.37 E-value=0.21 Score=24.30 Aligned_cols=398 Identities=10% Similarity=0.010 Sum_probs=210.0 Q ss_pred cccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHh Q lcl|NC_018087. 3 MLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLL 82 (520) Q Consensus 3 ~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma 82 (520) |=-+-+-.|..-|....... .... -.|.|-..-.+.++. -..++-..||... T Consensus 1 ~~~~~i~~L~~~~~~~~~r~-~~~~-----------------------~yY~g~~~~~~~~~~----~p~~~~~~~~~v~ 52 (409) T protein:vir:16 1 MTEKGIGYLRFKLSVHKRRA-EMRY-----------------------EQYAMKHVDRFKGIT----IPQALSQQYRSIL 52 (409) T ss_pred CCHHHHHHHHHHHHHHhHHH-HHHH-----------------------HHHhccCchhhcchh----hhHHHHHHHhhhc Confidence 44445555655555433221 1100 011111111122221 1224445566544 Q ss_pred hccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeee Q lcl|NC_018087. 83 NNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIIN 162 (520) Q Consensus 83 ~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid 162 (520) +=+.-||+.+++=..+ + +.+ .++ ++...|...-+|+....+.++.=++.||-|. .|.. T Consensus 53 --nw~~~iVds~a~rl~~-~-Gf~-----~~d------------~~l~~i~~~N~ld~~~~~~~~~al~yG~sf~-~v~~ 110 (409) T protein:vir:16 53 --GWCAKGVDSLADRLVF-R-EFE-----NDD------------FTVNEIFEENNPDIFFDSTVLSALIASCSFT-YISK 110 (409) T ss_pred --ChhHHHHHHhHhhccc-c-ccc-----Ccc------------hHHHHHHHhcChhHHHHHHHHHHHHhCceeE-EEec Confidence 3444577766553332 1 111 111 1345566667788899999999999999655 4443 Q ss_pred cCCCCCCeeeeEecCccceeeeeeccCCC-Ccccccc----cc-eecceeecCcccccc--cccceecCCcceecCcccE Q lcl|NC_018087. 163 PNRPKDGIIELRRLDPRNVQFVRELDTKM-ENGVKVV----KG-YREYFLYDTELESYQ--CGHQHFAAGTKIKIPYSAM 234 (520) Q Consensus 163 ~~~~k~GI~elr~lDPr~i~~vr~i~~~~-~~~~~~~----~~-~~ey~~y~~~~~~~~--~~~~~~~~~~~~~I~~~aI 234 (520) . .+|-..++.++|+.+--+.+=.+.. .-+..+. .+ ...+.+|.+....++ .++.....-.+.-. ...| T Consensus 111 ~---~dg~~~i~~~sP~~~~~i~D~~~~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~-vPvV 186 (409) T protein:vir:16 111 G---ENDAVRLQVIEATNATGIIDPITGLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGN-PLLV 186 (409) T ss_pred C---CCCceEEEEEcccceEEEeecccccceeeeEEEEecCCCceEEEEEEecCcEEEEEecCccccceecCCCC-cceE Confidence 2 2466789999998887765421111 1111111 11 112334433321111 11111111111110 1256 Q ss_pred EEeecccccCCCCcchhhhHH-HHHHHHH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeE Q lcl|NC_018087. 235 VYAHSGLVDCCGKNIIGYLHR-AVKPANQ-LKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRIS 312 (520) Q Consensus 235 ~y~hSGL~d~~~~~~~syL~~-aik~~Nq-L~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklv 312 (520) .|++.--. ......|-+-. .+...+- -+.+.++++.=...=.|.|-++=+|...-|..+=..+ T Consensus 187 ~f~n~~~~--~~~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~------------- 251 (409) T protein:vir:16 187 PIIHRPDA--VRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEPMETWKAT------------- 251 (409) T ss_pred Eecccccc--cccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCCccchhhhh------------- Confidence 66653221 12222333322 2222122 2567888899889999999888765432232111000 Q ss_pred eecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccc Q lcl|NC_018087. 313 YDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMST 392 (520) Q Consensus 313 Yd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~ 392 (520) +-.=..+|.-+.|-+.+|..+++++=-+=++=++=.-..+....++|.+-|...+. |- ..+. T Consensus 252 ----------------~~~i~~~~~d~~g~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~-Np-sSa~ 313 (409) T protein:vir:16 252 ----------------VSSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSD-NP-SSVE 313 (409) T ss_pred ----------------hhHhhccCCCCCCCCceEEecCCCChhHHHHHHHHHHHHHhhhcCCCHHHcccccC-ch-hHHH Confidence 11112356666777788999988643334566666677788888999977765432 11 2334 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018087. 393 AISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLME 472 (520) Q Consensus 393 eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~ 472 (520) .|.-.|....+-+.+-|+.|..-+..+++.-+.+.+-... +......+.+.|. ++.. .++.-+.+..+.+..+. T Consensus 314 Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~--~~~~~~~~~v~W~-~~~~---~~~~s~a~~aDa~~Kl~ 387 (409) T protein:vir:16 314 AIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVPY--LREQFSKTKPKWE-PLFE---ADASMLSLIGDGAIKLN 387 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--cchhhccceEEec-CCCC---cchhhHHHHHHHHHHHH Confidence 5777788899999999999999999999998887665321 1222234667775 2211 12333466667776665 Q ss_pred cccchhh-hHHHHHHHHhCCCHHH Q lcl|NC_018087. 473 PYIGKYI-SNHTAMKDFLQMSDED 495 (520) Q Consensus 473 p~vgky~-S~~~i~k~IL~~tDee 495 (520) .- |+.+ ..+++. +-|++|+.| T Consensus 388 ~a-~~~~~~~~v~~-~~~g~~~~d 409 (409) T protein:vir:16 388 QA-IPEFINKDTIR-DLTGIKGAE 409 (409) T ss_pred hh-cccccchhHHH-HhccCCCCC Confidence 53 4444 456665 558999999 No 188 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=54.58 E-value=0.5 Score=22.23 Aligned_cols=421 Identities=11% Similarity=0.062 Sum_probs=179.9 Q ss_pred hccCCCcccCCCCCCCceeeccccccccccccccccccc--ccchhHHHHHHHHHHHh-hccchhHHHHhhhceeeEecC Q lcl|NC_018087. 27 INDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQD--PTATSTRELINTYRSLL-NNYEVDNAVQEIVSDAIVYEE 103 (520) Q Consensus 27 ~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~--~~~~~~~~LI~~YR~ma-~~pEvd~Ai~eIvneaiv~d~ 103 (520) +-.. .+||.-..-.-++.++..+..|. .+-..+ ..++ ....++.|++|. .++.|.++++.+..-+.-.+= T Consensus 1 ~~~~---~~~~~p~~~~g~~~~~~~~~~~~---~~~~~e~~~~lr-~~~~~~ly~~m~e~D~~i~s~l~~rk~av~~~~w 73 (469) T protein:vir:10 1 MTER---VKTAAPVSEAGYVFGSGVVDGWT---VWDPFEQTPELQ-WPQSVAVYSRMDNEDSRVTSLLEAISLPIRSTPW 73 (469) T ss_pred CCCc---ccCCCCccchhhhhhcccccchh---hccccccccccc-cccchHHHHHHHhhChHHHHHHHHHHHHHhcCCc Confidence 1111 12221111112222211111111 111111 1121 135688999995 699999999999866543321 Q ss_pred CCcEEEEeeccchhhhHHHHHHHHHHHHHHHH-------------hcchhhhHHHHHhhccccceeEEEeeecCC-CCCC Q lcl|NC_018087. 104 GFDVVSIDLDQTAFTENIRNLISDEFNSVLNM-------------LNFQRKGSDHFKRWYVDSRVFFHKIINPNR-PKDG 169 (520) Q Consensus 104 ~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~l-------------l~f~k~g~~~fRrWYvDgri~~hkvid~~~-~k~G 169 (520) . |+-.. -++.+.+.+.+.....+.. .+|.....+++-..+.-|--++++|+.... ..+| T Consensus 74 ~-----v~p~~--~~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~dG 146 (469) T protein:vir:10 74 R-----IRANG--ASDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPDG 146 (469) T ss_pred e-----EecCC--CCHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCCC Confidence 1 11111 1334444444433222210 123333334444456678888899987543 1234 Q ss_pred ee---eeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCccc-EEEeecccccCC Q lcl|NC_018087. 170 II---ELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSA-MVYAHSGLVDCC 245 (520) Q Consensus 170 I~---elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~a-I~y~hSGL~d~~ 245 (520) -. .|...+|+.++. +.-...++....... .+ .....+..+....+++.||+.- |+|.|.. ... T Consensus 147 ~~~~~~l~~rp~~~i~~---~~~~~~~~l~~~~~~------~~--~~~~~~~~~~~~~~~~~lp~~k~i~~~~~~--~~g 213 (469) T protein:vir:10 147 RFWLRKLAPRPQWTISK---FNVAPDGGLESIEQI------AP--PARTRGSLYVANIAPPEIPVNRLVVYTRNK--RPG 213 (469) T ss_pred ceeeeeeeecCccccee---eeeccCCceeeeeec------Cc--ccccccccccCCCCccccccCcEEEEEecC--CCC Confidence 34 444444444432 222222222211111 00 0001111122223456677665 7788742 223 Q ss_pred CCcchhhhHHHHHHHHHHHHHHHHHHHHHHh-cCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCcccccc Q lcl|NC_018087. 246 GKNIIGYLHRAVKPANQLKLLEDAMMIYRIT-RAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQA 324 (520) Q Consensus 246 ~~~~~syL~~aik~~NqL~m~EDalVIyRi~-RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~ 324 (520) +....|-|.+|..+|-=.+......+.+=-. =.|-| |...+.|.-...| +-|.+++..+++- ...|-| T Consensus 214 ~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~-vgky~~~a~~~ek--~~l~~a~~~~~~g----~~a~~i---- 282 (469) T protein:vir:10 214 QWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIP-VGTASSATDEDEV--RKMAALARSVRGG----INAGVG---- 282 (469) T ss_pred CcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcce-EEecCCCCCHHHH--HHHHHHHHHHhcC----CceEEE---- Confidence 3344788888888877666655555544332 24554 5566666544443 3344444444331 011111 Q ss_pred ccchhhhhhcccccCCCCCcceeecCCCCCcC-hHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHH Q lcl|NC_018087. 325 NMMALTEDYWLQRRDGKAVTEVETLPGMTGMN-EMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDK 403 (520) Q Consensus 325 ~~msmlEDywLpRReGgrgTEIsTLpGg~nLg-ei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~K 403 (520) +| .|++|..+..+.+.. -..=++|..++.-+++--.. |..+++++ +++..=+-.|+ |.. T Consensus 283 ----------ip-----~~~~ie~~ea~g~~~~~~~li~~~d~~Isk~iLG~t--lTs~~~gG--S~a~~~vh~ev-~~d 342 (469) T protein:vir:10 283 ----------LA-----QGQILELLGVSGNLPDIRRAIEGHDRSIALSGLAHF--LNLDGKGG--SYALASVLEDP-FTQ 342 (469) T ss_pred ----------cc-----CCceEEEeecCCCchHHHHHHHHHHHHHHHHHhccc--ccccCccc--hhhHHHHHHHH-HHH Confidence 11 367888776543322 12226777777766654333 33332111 12211111221 444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccch-hhhHH Q lcl|NC_018087. 404 FISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGK-YISNH 482 (520) Q Consensus 404 FI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgk-y~S~~ 482 (520) .++...+.++..+..-|-..|+.=+. ......-++.|..- | .+.+...+++..+..+ ++... -++.+ T Consensus 343 ~~~sDa~~i~~tln~~li~~l~~lN~------g~~~~~P~~~~~~~----e-~~~~~~a~~i~~l~~~-G~~~~~~~~~~ 410 (469) T protein:vir:10 343 AVHAYATSICRIANQHIIEDLVDINF------GVDTPAPVLTFDPI----G-SRQDLTAAAVKLLYDA-GVFDDDPAVKR 410 (469) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcC------CCCCCccEEEecCC----C-CcHHHHHHHHHHHHhc-CCccCccccHH Confidence 55555556666665444444443331 11122344555321 1 2233445555555443 11111 25788 Q ss_pred HHHHHHhCCCHHHHHHHHHHHHHhhhcCCcc---CCccc------------------cC Q lcl|NC_018087. 483 TAMKDFLQMSDEDIAAERKLIDEELSDKIFN---PPEPE------------------EI 520 (520) Q Consensus 483 ~i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~---~p~~e------------------~~ 520 (520) |+++. +++...+-.+...+ ...++..-. .|+.. ++ T Consensus 411 ~~~e~-~gip~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 466 (469) T protein:vir:10 411 AIRQR-FNLPSELNDTPSAE--PEEPAAVPNQSAAPARTRSSGNADARARAPKADQGVL 466 (469) T ss_pred HHHHH-hCCCCCCCCccccc--chhcccCCCCCccccccCCCCCcccccccCCChHHhh Confidence 99866 77753333222111 111111000 01100 00 No 189 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=50.33 E-value=0.61 Score=21.75 Aligned_cols=338 Identities=12% Similarity=0.080 Sum_probs=145.0 Q ss_pred hhhhhhHHHhhhccCCCcccCCCCC-CCceeecccc-cccccccccccccccccchhHHHHHHHHHHHhhccchhHHHHh Q lcl|NC_018087. 16 HKVDDTEYDKIINDKAESITAPKFD-DGATEVDSQD-IAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNAVQE 93 (520) Q Consensus 16 ~~~~~~~~~~~~~~~~~s~~~p~~~-dg~~~i~~~~-~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai~e 93 (520) .+ ++.....-.+...+..+=... |+-+.....+ +.+-+.+.+-..+.-.-.-+..-|-+-+|.-..| .+||.- T Consensus 1 ~~--~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~epp~~~~~La~l~~~n~~h---~~~i~~ 75 (348) T protein:vir:26 1 MT--EQLIHSHTTDGTESKSVYSFDPNPEPVDTNSWMTRYCELFYNDFDDYWEPPISLKGLAEIANANGYH---GSLLKA 75 (348) T ss_pred CC--ccccchhhccccCCceEEEecCCCeeecCcchHHHHHHHHhcCCCccccCCCCHHHHHHHHhhhhhh---hhhHhh Confidence 11 000000000111111111111 2222222111 1111222111111111112334455666654444 344443 Q ss_pred hhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeecCCCCCCeeee Q lcl|NC_018087. 94 IVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPNRPKDGIIEL 173 (520) Q Consensus 94 Ivneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~~~k~GI~el 173 (520) -.|-....= .| +-..| +.+|. +++..|.+-|.-|+.++-+ ....+++| T Consensus 76 k~N~l~~~~---~P------n~~~t-------~~~f~-------------~~~~d~ll~Gnay~~~~rn---~~G~~~~L 123 (348) T protein:vir:26 76 RANYVAGRF---MN------GGGLP-------MYKMN-------------SACWDYFGLGMSAFVKIRS---YLKNVIAL 123 (348) T ss_pred hhhHHhhcc---cC------CCCCC-------HHHHH-------------HHHHHHHhcCCeEEEEEEc---CCCcEEEE Confidence 333211100 00 00011 11222 2233455569999999865 23359999 Q ss_pred EecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccccCC-CCcchhh Q lcl|NC_018087. 174 RRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDCC-GKNIIGY 252 (520) Q Consensus 174 r~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~~-~~~~~sy 252 (520) ..|+|..++..+ ++. ||.+... +..+.++++.|+|.. ..|++ +-..+|- T Consensus 124 ~~l~~~~v~~~~-------d~~--------~~~~~~~-------------g~~~~f~~~dIiHir--~~~~~~~~~Gls~ 173 (348) T protein:vir:26 124 EPLPMVHMRKRK-------NGD--------FVQLLRN-------------NEQKVFKAKDVIFIP--QYDPQQQIYGLPD 173 (348) T ss_pred EEecCceeEeee-------cCc--------EEEEEec-------------CeEEEEcCccEEEEc--CCCCCCCcccccH Confidence 999997766521 121 3322211 234577889998775 35654 4445777 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHH--hc--CccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccch Q lcl|NC_018087. 253 LHRAVKPANQLKLLEDAMMIYRI--TR--APDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMA 328 (520) Q Consensus 253 L~~aik~~NqL~m~EDalVIyRi--~R--ApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~ms 328 (520) +..|++.+.. -.+.-.||. .+ |--.-|+|+.-+++-+..+ +-+++-+...+ |. .|.++.+ T Consensus 174 ~~~a~~si~l----~~~a~~~~~~~f~NGa~pg~Il~~~~~~ls~e~~-~~lk~~~~~~~---------G~-~n~~~~~- 237 (348) T protein:vir:26 174 YLGSIQSSLL----NRDATLFRRRYYLNGAHMGFIFYATDPNLSEADE-KALKEKIASSK---------GI-GNFRSMF- 237 (348) T ss_pred HHHHHHHHHH----HHHHHHHHHHHHhccCCCceEEEecCCCCCHHHH-HHHHHHHHHhc---------Cc-cccccee- Confidence 7778776643 333334432 22 3345677775556654443 33444444322 11 1111111 Q ss_pred hhhhhcccccCCCCCcceeecCCCCCcChHHHHHHH-HHHHHHhcCCChhhccC-CCccccccccchhhHHHHHHH-HHH Q lcl|NC_018087. 329 LTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYF-RKALYMALRVPLSRIPD-EQTQNVFDMSTAISRDELSFD-KFI 405 (520) Q Consensus 329 mlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF-~kkLy~aL~VP~SRl~~-~~~~~~~G~~~eItRDElkF~-KFI 405 (520) + +.--+...|.+++-|.--..-.|.-.++=| ..-+.++.+||...+.- +++...||...+..+. |. .-+ T Consensus 238 v-----l~~~g~~~Gi~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp~llGi~~~~~~~~sn~e~~~~~---f~~~~l 309 (348) T protein:vir:26 238 V-----NIPNGKEKGIQLIPVGDIATKDEFERIKNITAQDIFVGHRFPAGMGGMLPQQGANVPDPLKVSQV---YDFYEV 309 (348) T ss_pred E-----EcCCCCccceeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCHHHccccCCCCCccccHHHHHHH---HHHHHH Confidence 1 111112246667666554444555556555 44599999999988752 2222233433333332 33 345 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 406 SELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSL 470 (520) Q Consensus 406 ~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~ 470 (520) .-++.+|...+.+- |.+ +++ ..++|+|..+ .+|.+..+- T Consensus 310 ~P~~~~ie~~ln~~----l~~-----~~~-----~~~~fdl~~~------------~e~~~~~a~ 348 (348) T protein:vir:26 310 IPVCKRFMDAVNND----PEI-----PDN-----LKLKFNLNPG------------VESANGSAV 348 (348) T ss_pred HHHHHHHHHHHhhh----hCC-----CCc-----cEEEEecCcc------------cccchhhcC Confidence 66677666644442 211 111 1233443221 222222222 No 190 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=45.25 E-value=0.78 Score=21.19 Aligned_cols=354 Identities=12% Similarity=0.092 Sum_probs=155.6 Q ss_pred hcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchhHHH Q lcl|NC_018087. 12 FAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNAV 91 (520) Q Consensus 12 f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai 91 (520) +|||.+...-. .+ ..| ++.+ ... ++.+.. + .| +.+=|..|| T Consensus 1 Mg~f~~~~~f~---------~~-~~~-~~~~-~~~-----~~~~~~----------~--------~~----~~~~v~~~i 41 (378) T protein:vir:93 1 MNLFGKVVSFS---------RG-KLN-NDTQ-RVT-----AWQNEA----------V--------EY----TSAFVTNIH 41 (378) T ss_pred Cccchhhhhhh---------cc-ccC-CCcc-eee-----ecccch----------h--------HH----HHHHHHHHH Confidence 67776644211 00 011 1111 000 111100 0 01 334488899 Q ss_pred HhhhceeeEecCCCcEEEEeec-cc-hhhhHHHHHHHHHHHHHHHH-hcchhhhHHHHH----hhccccceeEEEeeecC Q lcl|NC_018087. 92 QEIVSDAIVYEEGFDVVSIDLD-QT-AFTENIRNLISDEFNSVLNM-LNFQRKGSDHFK----RWYVDSRVFFHKIINPN 164 (520) Q Consensus 92 ~eIvneaiv~d~~~~~V~l~Ld-~~-~~s~~ik~~I~eeF~~i~~l-l~f~k~g~~~fR----rWYvDgri~~hkvid~~ 164 (520) +-|.+.+.-.+ +.+--. +. ...+.....+.....++|+. =+=.-+++++.+ .+..+|.-|.+++.|.. T Consensus 42 ~~Ia~~iA~lp-----~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~ 116 (378) T protein:vir:93 42 NKIANEITKVE-----FNHVKYKKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDN 116 (378) T ss_pred HHHHhhhhhCc-----eeeEEEcccccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecC Confidence 99998876432 222111 11 11222222222233333332 223344555444 57789999988887632 Q ss_pred CCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccccC Q lcl|NC_018087. 165 RPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDC 244 (520) Q Consensus 165 ~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~ 244 (520) . -++.++-| .+.+++++.+.|+|..+- . T Consensus 117 ~-----g~~~~l~~--------------------------------------------~~~~~~~~~~diih~r~~---~ 144 (378) T protein:vir:93 117 T-----GELLDLLF--------------------------------------------ADDKKEYKTEELVRLTSP---F 144 (378) T ss_pred C-----ceEEEEEe--------------------------------------------cCCeeEeccceeEEecCc---c Confidence 1 12222211 012346678888887632 2 Q ss_pred CCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCcccccc Q lcl|NC_018087. 245 CGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQA 324 (520) Q Consensus 245 ~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~ 324 (520) ++..+.|-|+.|...+.. ....+--+-+..++ |.|.+..+.+....+...|++..- |. +.. T Consensus 145 ~~~~~~s~l~~~~~~i~~-----------~~~~~~~~g~l~~~-~~l~~~~~~~~~~~~~~~~~~~~~-----~~--~~~ 205 (378) T protein:vir:93 145 YINEDTSILDNALASIQT-----------KLEQGKLRGLLKIN-AFLDIDNTQEYREKALTTIKNMQE-----GS--SYN 205 (378) T ss_pred ccchhhHHHHHHHHHHHH-----------HHhcCcccceeeeC-CcCCHHHHHHHHHHHHHHHHHhhc-----cc--ccc Confidence 344467778777665432 11223234444443 344444444444444444544221 21 111 Q ss_pred ccchhhhhhcccccCCCCCcceeecCCCCCcChHHHHHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHHHH Q lcl|NC_018087. 325 NMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMDDILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFDKF 404 (520) Q Consensus 325 ~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~KF 404 (520) ..+. + ..|.+++.|.-.....+++..+|-.+.+.++++||.+.|. |..+| -.+..| T Consensus 206 ~~~~-l----------~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~--------g~~~e-----~~~~~f 261 (378) T protein:vir:93 206 GLTP-V----------DNKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILL--------GTATQ-----EQQIYF 261 (378) T ss_pred cceE-c----------CCCceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHhc--------CCcHH-----HHHHHH Confidence 1221 1 1245666665555566788889999999999999998873 22222 122223 Q ss_pred H-HHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhh---hceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhh Q lcl|NC_018087. 405 I-SELQHKFEEIFLSPLKSNLLLKRVITEDEWEAEL---NNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYIS 480 (520) Q Consensus 405 I-~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~---~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S 480 (520) + ..|+-.+.. +.+. +-..++++.++.... ..+.+.|.-+ ++... -+..|.+.+..+-.- -+++ T Consensus 262 ~~~tl~P~~~~-ie~~-----l~~kLl~~~er~~~~~~~~~~~~~fd~~----~l~~~-d~~~~~~~~~~~~~~--G~~t 328 (378) T protein:vir:93 262 YNSTIIPLLIQ-LEKE-----LTYKLISTNRRRVVKGNLYYERIIVDNQ----LFKFA-TLKELIDLYHENING--PIFT 328 (378) T ss_pred HHHHHHHHHHH-HHHH-----HHhhcCChhHhhhhhhcccccceeeccc----hhhhc-CHHHHHHHHHHHHhC--CCcC Confidence 2 223332222 2222 334567777766432 2233444322 22211 124566666655332 2567 Q ss_pred HHHHHHHHhCCCHHH-------------HHHHHHHHHHhhhcCCccCCccc Q lcl|NC_018087. 481 NHTAMKDFLQMSDED-------------IAAERKLIDEELSDKIFNPPEPE 518 (520) Q Consensus 481 ~~~i~k~IL~~tDee-------------I~~~~kqi~~E~~~~~~~~p~~e 518 (520) .+-++. .++|.+-+ +....++-..+..++--.+.+.| T Consensus 329 ~NE~R~-~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:93 329 QNQLLV-KMGEQPIEGGDVYIANLNAVAVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred HHHHHH-HhCCCCCCCCCeeeeccccccccchhhhcCccCCCCCCCCCCCC Confidence 777764 46665321 00011110001111000111111 No 191 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=44.93 E-value=0.79 Score=21.15 Aligned_cols=325 Identities=14% Similarity=0.125 Sum_probs=136.9 Q ss_pred CccccccchhhhcchhhhhhhHHH--hhhccCCCc--ccCCCC-CCCceeecccccccccccccccccccccchhHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYD--KIINDKAES--ITAPKF-DDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELI 75 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~--~~~~~~~~s--~~~p~~-~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI 75 (520) || + ++..+... ..-..+.++ +-.|-- .+|++...--..+.+|.++.+.++.. -|- T Consensus 1 m~----------~---~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~pp~~~~-------~la 60 (340) T protein:vir:98 1 MS----------K---RKPRKAVAMTASAPQKMEAFTFGEPVPVLDKRDILDYVECISNGKWYEPPVSFS-------GLA 60 (340) T ss_pred CC----------C---CCCCccccccccCccceeEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHH-------HHH Confidence 11 0 00000000 000000011 111110 11111111001122333333333322 355 Q ss_pred HHHHHHhhccchhHHHHhhhceeeE-ecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccc Q lcl|NC_018087. 76 NTYRSLLNNYEVDNAVQEIVSDAIV-YEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSR 154 (520) Q Consensus 76 ~~YR~ma~~pEvd~Ai~eIvneaiv-~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgr 154 (520) +-+|.-..| .+||.--+|-..- +.-+ -..| +.+|.. ++-.|.+=|- T Consensus 61 ~l~~a~~~h---~s~i~~k~n~l~~~~~Pn----------~~lt-------~~~f~~-------------~~~d~ll~Gn 107 (340) T protein:vir:98 61 KSLRSAVHH---SSPIYVKRNVLASTYIPH----------PLLS-------RQDFSR-------------FALDYLVFGN 107 (340) T ss_pred HHHHhcccc---chhhhhhhhHHhhccCCC----------CCCC-------HHHHHH-------------HHHHHHhcCC Confidence 556655554 3344333332111 1111 0111 122321 2223445599 Q ss_pred eeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccE Q lcl|NC_018087. 155 VFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAM 234 (520) Q Consensus 155 i~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI 234 (520) -|+.++-+ ....+++|.+++|..++.. .++.. ||.+... +..+.++++.| T Consensus 108 ay~~~~rn---~~G~~~~L~pl~~~~vr~~-------~~~~~-------~~~~~~~-------------~~~~~~~~~eV 157 (340) T protein:vir:98 108 AFLEQRHS---VTGQLIKLLTSPAKYTRRG-------VDDSV-------FWFVENF-------------TQPHEFAPDTV 157 (340) T ss_pred eEEEEEEC---CCCcEEEEEEeCCceEEEc-------ccCcE-------EEEEecC-------------CeEEEEccccE Confidence 99998854 3335999999999766652 12211 3333211 23457888899 Q ss_pred EEeecccccCC-CCcchhhhHHHHHHHHHHHHHHHHHHHHHH--h--cCccceEEEccCCCCchHHHHHHHHHHHHhhcc Q lcl|NC_018087. 235 VYAHSGLVDCC-GKNIIGYLHRAVKPANQLKLLEDAMMIYRI--T--RAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRN 309 (520) Q Consensus 235 ~y~hSGL~d~~-~~~~~syL~~aik~~NqL~m~EDalVIyRi--~--RApeRRvFyIDvGnlpk~KAeqyl~~im~~~kn 309 (520) +|.. ..|+. +-..+|-+..|++.+.. -.+.-.||. . -|--.-|+|+.-+.+. .++.+=+++-+..++ T Consensus 158 iHir--~~~~~~~~~Gls~~~~a~~si~l----~~aa~~~~~~~f~NGa~pg~il~~~~~~ls-~e~~~~lk~~~~~~~- 229 (340) T protein:vir:98 158 FHLL--EPDINQEIYGLPEYLSALNSAWL----NESATLFRRKYYQNGAHAGYIMYVTDPAQS-ATDVESLRDAMRNSK- 229 (340) T ss_pred EEEc--CCCCCCCcccccHHHHHHHHHHH----HHHHHHHHHHHHhccCCCceEEEecCCCCC-HHHHHHHHHHHHHhc- Confidence 8775 35653 44557777767665432 223333322 2 2456677887655555 444444555554432 Q ss_pred eeEeecCCCccccccccchhhhhhcccccCCCC--CcceeecCCCCCcChHHHHH-HHHHHHHHhcCCChhhccC-CCcc Q lcl|NC_018087. 310 RISYDARTGKVKNQANMMALTEDYWLQRRDGKA--VTEVETLPGMTGMNEMDDIL-YFRKALYMALRVPLSRIPD-EQTQ 385 (520) Q Consensus 310 klvYd~~TGev~d~~~~msmlEDywLpRReGgr--gTEIsTLpGg~nLgei~DV~-YF~kkLy~aL~VP~SRl~~-~~~~ 385 (520) | ..|.++.+ ++. .||+ |.+++-|.--..-.|.-.++ +-++...++.+||...+.- +++. T Consensus 230 --------G-~~n~~~~~-vl~-------~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t 292 (340) T protein:vir:98 230 --------G-LGNFKNLF-FYS-------PNGKPDGIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPFQLMGGKPENI 292 (340) T ss_pred --------C-ccccCcee-Eec-------CCCCccceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCC Confidence 1 11111111 111 1222 45555443322222222332 3456689999999998853 2222 Q ss_pred ccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhc Q lcl|NC_018087. 386 NVFDMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNN 441 (520) Q Consensus 386 ~~~G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~ 441 (520) ..||...+..+. =...-+.-|+.+|.++... | .-.++.-++++-.... T Consensus 293 ~~~sn~e~~~~~--f~~~~l~Pl~~~iee~n~~-L-----~~e~~rF~~~~l~~~d 340 (340) T protein:vir:98 293 GSLGDVEKVAKV--FVRNELSPLQDRFREVNDW-L-----GMEVIRFKEYTLDNPE 340 (340) T ss_pred CccccHHHHHHH--HHHHHHHHHHHHHHHHHhc-c-----cccccccCccccccCC Confidence 223433333332 1223456677777764432 1 1223333333322222 No 192 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=35.80 E-value=1.2 Score=20.13 Aligned_cols=390 Identities=10% Similarity=0.044 Sum_probs=171.8 Q ss_pred cccccchhhhcchhhhhhhHHHhhhccCCCcccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHh Q lcl|NC_018087. 3 MLADSDLKMFAFWHKVDDTEYDKIINDKAESITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLL 82 (520) Q Consensus 3 ~~~~~~l~~f~~~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma 82 (520) |+|...+ . +....+.++.+|-....- +++... .+..-|. ...+ T Consensus 1 ~~~~r~~----------~---------~~~~~~~~~~~~~~~~~~------g~~~s~-----~~~~vt~-------~~al 43 (419) T protein:vir:14 1 MFFSRQL----------L---------SNLGQTQMSAGGWVSALL------GSSRSD-----SGQVVTP-------ASAL 43 (419) T ss_pred Ccccccc----------c---------ccccccccCcchhhHHhh------cCCCcc-----CCcccch-------HHhh Confidence 3332111 1 011111222222111000 011100 0000011 1346 Q ss_pred hccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhc----chhhhHHHHHh----hccccc Q lcl|NC_018087. 83 NNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLN----FQRKGSDHFKR----WYVDSR 154 (520) Q Consensus 83 ~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~----f~k~g~~~fRr----WYvDgr 154 (520) ++|-|..||+.|.+.+.-.+- .|-=... +. +..+ ....+.++|+ -..++.++.+. +.+.|- T Consensus 44 ~~~~v~~~v~~ia~~iA~lp~-----~~~~~~~---~~-~~~~--~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gn 112 (419) T protein:vir:14 44 ALTVLQNCVTLLAESIAQLPI-----ELYERSG---ED-RKPA--TDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGN 112 (419) T ss_pred ccHHHHHHHHHHHHhhccCce-----EEEEecC---Cc-cccc--cccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCC Confidence 788899999999998763321 1110000 00 0011 1122333332 23455565444 667788 Q ss_pred eeEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccE Q lcl|NC_018087. 155 VFFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAM 234 (520) Q Consensus 155 i~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI 234 (520) -|..++-|. + .-+++|.+|+|..+...+.- ++.. .|.++... .++.+-| T Consensus 113 a~~~i~r~~-~--G~~~~l~pl~~~~v~v~~~~-----~~~~------~y~~~~~~-----------------~~~~~~i 161 (419) T protein:vir:14 113 SYSFIDRDS-D--GVIQGLYPLDNEAVTVMRGS-----DLKP------VYRVRGSD-----------------PMPQRLV 161 (419) T ss_pred eEEEEEECC-C--CcEEEEEEecCceEEEEECC-----CceE------EEEEccCc-----------------ccchhhe Confidence 888877553 2 23899999999999875332 2211 12111111 2456666 Q ss_pred EEeecccccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCc--hHHHHHHHHHHHHh-hccee Q lcl|NC_018087. 235 VYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAMMIYRITRAPDRRVFYIDTGNMP--ARKAAQHMQHIMNS-HRNRI 311 (520) Q Consensus 235 ~y~hSGL~d~~~~~~~syL~~aik~~NqL~m~EDalVIyRi~RApeRRvFyIDvGnlp--k~KAeqyl~~im~~-~knkl 311 (520) +|.. ....++...+|-+..|..++.....+++...=+----+--+-++..+...-+ ..++.+-+++.+++ |+ T Consensus 162 ~h~~--~~~~dg~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 236 (419) T protein:vir:14 162 HHVR--WMSINGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDRITDGWNAKFG--- 236 (419) T ss_pred eEec--CcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhc--- Confidence 6653 2455666778899999998888888877665554445666778877643211 22332223333222 22 Q ss_pred EeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcChH---HHHHHHHHHHHHhcCCChhhccCCCccccc Q lcl|NC_018087. 312 SYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNEM---DDILYFRKALYMALRVPLSRIPDEQTQNVF 388 (520) Q Consensus 312 vYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLgei---~DV~YF~kkLy~aL~VP~SRl~~~~~~~~~ 388 (520) | ..+..+.+ .++ .|.+++.|. .+..++ +--++..+.+.++++||..-|....+.. + T Consensus 237 ------g-~~nag~~~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t-~ 295 (419) T protein:vir:14 237 ------G-SGNAKKVA-LLQ----------EGMTFRPLS--MTNVDAALIDALRLSALDIARIYKIPAHMVNELERAT-F 295 (419) T ss_pred ------C-ccccCCce-ecC----------CCceEEEcc--CChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCC-c Confidence 1 11111222 222 255666664 233333 3334778999999999998886432211 1 Q ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHH Q lcl|NC_018087. 389 DMSTAISRDELSFDKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVL 468 (520) Q Consensus 389 G~~~eItRDElkF~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~ 468 (520) . .+.-.-+.|..++ |+--... +.+ -+-+.++++.++.. ..+.|..+. +... -+..|.+.+ T Consensus 296 -s--~~E~~~~~f~~~~--L~P~~~~-ie~-----~l~~kll~~~~~~~----~~i~fd~~~----l~r~-d~~~~~~~~ 355 (419) T protein:vir:14 296 -S--NIEHQSLQFVIYT--LLPWVKR-HEQ-----AKTRDLLLPSERKQ----YFIEYNLAG----LLRG-DQSSRYAAY 355 (419) T ss_pred -c--cHHHHHHHHHHHH--HHHHHHH-HHH-----HHhhhccCccccCC----eEEEEechh----hhcc-CHHHHHHHH Confidence 1 1111112233322 2221111 111 12234556666542 234444322 2111 234566666 Q ss_pred HHhhcccchhhhHHHHHHHHhCCCHHHH----------HHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 469 SLMEPYIGKYISNHTAMKDFLQMSDEDI----------AAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 469 ~~~~p~vgky~S~~~i~k~IL~~tDeeI----------~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) +.+-.- -+++.+-++. .+++.+-+= ....+.=+.+..++--.+...+|+ T Consensus 356 ~~~~~~--G~~T~NE~R~-~~gl~p~~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~e~ 414 (419) T protein:vir:14 356 AVGRQW--GWLSINDIRR-LENMPPVKGGDIYLSPMNMVDASKPQQLPVGKSEPTKAAIDEI 414 (419) T ss_pred HHHHhC--CCcCHHHHHH-HhCCCCCCCcCeeeeccccccccccccccCCCCCCccccccch Confidence 655222 3567777774 466543210 000000000011110011222223 No 193 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=30.47 E-value=1.6 Score=19.50 Aligned_cols=410 Identities=10% Similarity=0.025 Sum_probs=148.7 Q ss_pred ccCCCCCCCceeecccccccccccccccccccccchhHHHHHHHHHHHhhccchhHHH---------------------- Q lcl|NC_018087. 34 ITAPKFDDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNAV---------------------- 91 (520) Q Consensus 34 ~~~p~~~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~Ai---------------------- 91 (520) .+||--.---. ...+.+..-+..-. ....+++|+.+..+++=...| T Consensus 1 ~~~~~~~~~~~-------~~~~~~~~~i~~~~-----~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~n 68 (537) T protein:vir:78 1 MTSPLLNKPID-------QLGGLLNTEITTYM-----ASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASN 68 (537) T ss_pred CCcccccccHH-------HHHHHHHHHHHHHH-----HHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccc Confidence 22221110000 00111111110000 012345566665555544322 Q ss_pred --------HhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccceeEEEeeec Q lcl|NC_018087. 92 --------QEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINP 163 (520) Q Consensus 92 --------~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~ 163 (520) ..||+..+-|= -..||++..++..- + .+.+.+..+++ -+|++.-.++.+.+.+-|+-|.|.=+|. T Consensus 69 nki~~nf~k~Ivd~~~~yl-~G~Pv~~~~~d~~~-~----e~~~~l~~~~~-~~~~~~~~el~~~~s~~G~ay~~~y~de 141 (537) T protein:vir:78 69 VKISHGFFTELVDQLAQYL-LSNGVEVKVKDEDN-T----QLDEILQEYFD-EDFQATIDTLVTNASKKGFEGIFARTTS 141 (537) T ss_pred cccccchHHHHHHHHhhhh-cccCceeecCcchh-H----HHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeeEEEeeecC Confidence 11222222221 13677776554322 1 23334444443 3677888899999999999998877663 Q ss_pred CCCCCCeeeeEecCccceeeeeeccCCCCcccccc------------cceecceeecCcc-cccccccceec----CCcc Q lcl|NC_018087. 164 NRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVV------------KGYREYFLYDTEL-ESYQCGHQHFA----AGTK 226 (520) Q Consensus 164 ~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~------------~~~~ey~~y~~~~-~~~~~~~~~~~----~~~~ 226 (520) +|-..+..+||+.+-+|.+-..+...-.+++ +.+.-+-+|++.. ..|-....... .... T Consensus 142 ----~~~~~~~~i~p~~~~pv~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~ 217 (537) T protein:vir:78 142 ----EGKLKFQTVDGLTLIPVFDDYGVLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEA 217 (537) T ss_pred ----CCceEEEEEccceeEEEEcCCCCceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCccccccccccc Confidence 3677899999999888754222111111110 0111122333321 11111110000 0000 Q ss_pred eecC-cccEEEe------------------e-cccccC----CCCcchhhhHHHHHHHHHHHHHH-HHHHHHHHhcCccc Q lcl|NC_018087. 227 IKIP-YSAMVYA------------------H-SGLVDC----CGKNIIGYLHRAVKPANQLKLLE-DAMMIYRITRAPDR 281 (520) Q Consensus 227 ~~I~-~~aI~y~------------------h-SGL~d~----~~~~~~syL~~aik~~NqL~m~E-DalVIyRi~RApeR 281 (520) +... ...+..+ | .|.+.. ++....|=|+..+.....+.++= +..-...-+..|-- T Consensus 218 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~il 297 (537) T protein:vir:78 218 YNPNPAPHVLAIEESTDADFEDTDGYQVLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIY 297 (537) T ss_pred ccccccceeeeccccccccccccccccccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCcee Confidence 0000 0000000 0 111111 12223455665554444433211 11111111222211 Q ss_pred eEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccccchhhhhhcccccCCCCCcceeecCCCCCcCh-HHH Q lcl|NC_018087. 282 RVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQANMMALTEDYWLQRRDGKAVTEVETLPGMTGMNE-MDD 360 (520) Q Consensus 282 RvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~~msmlEDywLpRReGgrgTEIsTLpGg~nLge-i~D 360 (520) -+.-. +|+ +...++.-+..+.+..-.| .|..++.|--..+..- -.- T Consensus 298 vi~g~------------------------------~~~--~~~~~~~~l~~~~~i~v~~-d~~~v~~l~~~~~~~~~e~~ 344 (537) T protein:vir:78 298 VVKGF------------------------------SGD--STDKLRQNIKAKKMIGVNG-DNAGMEIQTVSIPYEARKAK 344 (537) T ss_pred eeecC------------------------------CCc--cchhHHHHHhhcCceeecC-CCCceeEEEecCCHHHHHHH Confidence 11111 111 1111122222222211111 1222344333222211 112 Q ss_pred HHHHHHHHHHhcCCChhhccCCCccccccccchhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHh Q lcl|NC_018087. 361 ILYFRKALYMALRVPLSRIPDEQTQNVFDMSTAISRDELSFD---KFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEA 437 (520) Q Consensus 361 V~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~~eItRDElkF~---KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~ 437 (520) +.-..+.+|+-..+|-. .... +|.+|..+..= +|. .-+.+.++.|...|...|+.=+-+-++....+|+ T Consensus 345 ld~L~~~I~~~s~~~~~--~~~~----~gn~SGvAlk~-~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~~~~~d- 416 (537) T protein:vir:78 345 MDIDVENIYRSGMGFNS--TAVG----DGNVTNVVIKS-RYTLLAMKARKMETSLRKVLRWCADMVVSDIALRGLGEYD- 416 (537) T ss_pred HHHHHHHHHHhcCCCCC--cccc----ccCCcHHHHHH-HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccc- Confidence 44455566666666642 2221 25444443321 111 1233333333333333332211111222212233 Q ss_pred hhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHHHHHHHhCCCHHHHHHHHHHHHHhhhc------CC Q lcl|NC_018087. 438 ELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHTAMKDFLQMSDEDIAAERKLIDEELSD------KI 511 (520) Q Consensus 438 ~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~i~k~IL~~tDeeI~~~~kqi~~E~~~------~~ 511 (520) ...|.+.|.+.---.+...++++. .+. .+-.+|.+++++. |...|+. +..+++++|..+ .. T Consensus 417 -~~~i~i~f~~~~P~n~~e~a~~~~-------~l~--~~giiS~eT~l~~-~p~vdd~--e~ek~~~ee~~~~~~~~~~~ 483 (537) T protein:vir:78 417 -SNDICFEIEPHVLANELDIATTRK-------TEA--ETEALKIGNIMTV-APRIGDD--ETLKLIAEELDLDYNELKDA 483 (537) T ss_pred -cceeeEEeccCCCCCHHHHHHHHH-------HHH--hcCcchHHHHHHh-CCCCCCH--HHHHHHHHHHHhhhhhhhhh Confidence 235778888666555543343332 221 1235699999976 5554432 111222222110 00 Q ss_pred ccC---------CccccC Q lcl|NC_018087. 512 FNP---------PEPEEI 520 (520) Q Consensus 512 ~~~---------p~~e~~ 520 (520) ..+ |+.++. T Consensus 484 ~~~~~~~~~~~~~~~~~~ 501 (537) T protein:vir:78 484 LAEQDAQSLDVSPDVQAM 501 (537) T ss_pred hhhhcccccCcCcchhhh Confidence 011 111111 No 194 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=29.97 E-value=1.6 Score=19.44 Aligned_cols=422 Identities=14% Similarity=0.043 Sum_probs=159.5 Q ss_pred hhhhhhhHHHhhhccCCCcccCCCCCCCceeecc----cccccccccccccccccccchhHHHHHHHHHHHhhccchhHH Q lcl|NC_018087. 15 WHKVDDTEYDKIINDKAESITAPKFDDGATEVDS----QDIAYNGVFQKLYGSQDPTATSTRELINTYRSLLNNYEVDNA 90 (520) Q Consensus 15 ~~~~~~~~~~~~~~~~~~s~~~p~~~dg~~~i~~----~~~a~~g~~~~~~~~~~~~~~~~~~LI~~YR~ma~~pEvd~A 90 (520) =+|+..+- ..+ +..+......++.+.... .+..++|.. ...+++..+. ...++.|+.|..++.|.++ T Consensus 1 m~k~~~k~--~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~---~~~~~~iLr~-~~~~~ly~~m~~D~hi~s~ 71 (448) T protein:vir:79 1 MAKRGRKP--KEL---VPGPGSIDPSDVPKLEGASVPVMSTSYDVVV---DREFDELLQG-KDGLLVYHKMLSDGTVKNA 71 (448) T ss_pred CCCCCCCC--ccc---cCcccccccccchhhhhhhhhhccccccccc---ccchhHhhcc-ccchHHHHHHhhChHHHHH Confidence 11111110 000 011111111111111111 011112222 2233333333 3467899999999999999 Q ss_pred HHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHH---HHHHHhcchhhhHHHHHhhccccceeEEEeeecC-CC Q lcl|NC_018087. 91 VQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFN---SVLNMLNFQRKGSDHFKRWYVDSRVFFHKIINPN-RP 166 (520) Q Consensus 91 i~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~---~i~~ll~f~k~g~~~fRrWYvDgri~~hkvid~~-~~ 166 (520) ++....-+.-.+=...|- .+..-.+.+-+.+++-.. ...+...|..-..++.. =..-|--+++++.... +. T Consensus 72 l~~Rk~av~~~~w~v~p~----~~~~~~~~~ae~v~~~l~~~~~~~~~~~f~~~~~~~ld-a~~~G~s~~Eivw~~~~~g 146 (448) T protein:vir:79 72 LNYIFGRIRSAKWYVEPA----STDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIYEN-AYIYGMAAGEIVLTLGADG 146 (448) T ss_pred HHHHHHHHhcCCceEecC----CCCHHHHHHHHHHHHHhhhhhhhhccCCHHHHHHHHHH-hhhhcceeEEEEeeecCCC Confidence 999987554332221110 111111222222222111 00111223222222211 1233556677776532 22 Q ss_pred CCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEEEeecccccCCC Q lcl|NC_018087. 167 KDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMVYAHSGLVDCCG 246 (520) Q Consensus 167 k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~y~hSGL~d~~~ 246 (520) .-++..|...+|+.+...+ -...++..... .... ...+ ....+++.||..-+++.| . -...+ T Consensus 147 ~~~~~~l~~r~~~~~~~f~---~~~d~~l~~~~-------~~~~---~~~~---~~~~~~~~lP~~~~i~~~-~-~~~g~ 208 (448) T protein:vir:79 147 KLILDKIVPIHPFNIDEVL---YDEEGGPKALK-------LSGE---VKGG---SQFVSGLEIPIWKTVVFL-H-NDDGS 208 (448) T ss_pred ceecccccccCCcccccee---eecCCceEEee-------cCCc---cccc---ccCCCccccccceEEEEe-c-CccCC Confidence 2234444444554443311 11122222110 0000 0000 011234566766655433 1 12223 Q ss_pred CcchhhhHHHHHHHHHHHHHHHHHHHHH-HhcCccceEEEccCCCCchHHHHHHHHHHHHhhcceeEeecCCCccccccc Q lcl|NC_018087. 247 KNIIGYLHRAVKPANQLKLLEDAMMIYR-ITRAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRISYDARTGKVKNQAN 325 (520) Q Consensus 247 ~~~~syL~~aik~~NqL~m~EDalVIyR-i~RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklvYd~~TGev~d~~~ 325 (520) ....|-|..|..+|-=.+......+.+= +.=.|-| |...+.|.-...+..+-|..++...+- +...|- T Consensus 209 p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~-vgky~~ga~~~~~~~~~l~~av~~i~~----g~~a~~------ 277 (448) T protein:vir:79 209 FTGQSALRAAVPHWLAKRALILLINHGLERFMIGVP-TLTIPKSVRQGTKQWEAAKEIVKNFVQ----KPRHGI------ 277 (448) T ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCceE-EEecCCCCCcCHHHHHHHHHHHHHHhc----CCceEE------ Confidence 3457888888888776666665555443 3344665 666666654333333334444433321 011111 Q ss_pred cchhhhhhcccccCCCCCcceeecCCCCCcChHH-HHHHHHHHHHHhcCCChhhccCCCcccccccc-chhhHHHHHHHH Q lcl|NC_018087. 326 MMALTEDYWLQRRDGKAVTEVETLPGMTGMNEMD-DILYFRKALYMALRVPLSRIPDEQTQNVFDMS-TAISRDELSFDK 403 (520) Q Consensus 326 ~msmlEDywLpRReGgrgTEIsTLpGg~nLgei~-DV~YF~kkLy~aL~VP~SRl~~~~~~~~~G~~-~eItRDElkF~K 403 (520) . +| .|++|..+..+.+.+... =++|..+..-+++-=-. |..+++. |.+ ....--.-.+.+ T Consensus 278 i--------iP-----~~~~ie~~ea~~~~~~~~~~i~~~d~~Isk~iLGqt--lTs~~~~---g~~~~~~~~~~~v~~~ 339 (448) T protein:vir:79 278 I--------LP-----DDWKFDTVDLKSAMPDAIPYLTYHDAGIARALGIDF--NTVQLNM---GVQAINIGEFVSLTQQ 339 (448) T ss_pred E--------ec-----CCceEEEEecCCCcccHHHHHHHHHHHHHHHHhhhh--hcccccc---chhhhhhhhHHHHHHH Confidence 1 11 457888877655544322 24566666655543221 3333322 222 121111112334 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccchHHHHHHHHHHHHHHHHHHHhhcccchhhhHHH Q lcl|NC_018087. 404 FISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSYFSEMKTIEITERRVNVLSLMEPYIGKYISNHT 483 (520) Q Consensus 404 FI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Ei~~~R~~~~~~~~p~vgky~S~~~ 483 (520) -+..-.+.++..|..-|-..|+.=+.=.. .-...+.|+.. |-.++.-+.+++..+..+. ..+.++ T Consensus 340 ~~~aDa~~i~~tln~~li~~l~~lNfg~~----~~~P~~~f~~~------e~~Dl~~~a~~~~~l~~~~-----~~~~~~ 404 (448) T protein:vir:79 340 TIISLQREFASAVNLYLIPKLVLPNWPSA----TRFPRLTFEME------ERNDFSAAANLMGMLINAV-----KDSEDI 404 (448) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCc----CCCcEEEecCC------ChHHHHHHHHHhhhhhccc-----hhhHHH Confidence 44444444555555433333332221011 11133444433 3345444556665554332 112333 Q ss_pred HHHHHhCCCHHHHHHHHHHHHHhhhcCCccCCccccC Q lcl|NC_018087. 484 AMKDFLQMSDEDIAAERKLIDEELSDKIFNPPEPEEI 520 (520) Q Consensus 484 i~k~IL~~tDeeI~~~~kqi~~E~~~~~~~~p~~e~~ 520 (520) ++ +.+..-+..=.++. + .+-.+++..|.- T Consensus 405 ~~-~~~~~p~~~~~~~~-----~--a~~~~~~~~~~~ 433 (448) T protein:vir:79 405 PT-ELKALIDALPSKMR-----R--ALGVVDEVREAV 433 (448) T ss_pred HH-HhhcCCCCCCCccc-----c--ccCCCCcccccc Confidence 33 22333211000000 0 000001110100 No 195 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=25.59 E-value=2 Score=18.89 Aligned_cols=332 Identities=13% Similarity=0.082 Sum_probs=135.5 Q ss_pred CccccccchhhhcchhhhhhhHHHhhhc--cCCCccc--CCCC-CCCceeecccccccccccccccccccccchhHHHHH Q lcl|NC_018087. 1 MSMLADSDLKMFAFWHKVDDTEYDKIIN--DKAESIT--APKF-DDGATEVDSQDIAYNGVFQKLYGSQDPTATSTRELI 75 (520) Q Consensus 1 ~~~~~~~~l~~f~~~~~~~~~~~~~~~~--~~~~s~~--~p~~-~dg~~~i~~~~~a~~g~~~~~~~~~~~~~~~~~~LI 75 (520) ||= --.+......+.+.. .+.++++ -|-- .++.+..+--..+.+|-++.+.++. .-|. T Consensus 1 ~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~-------~~la 63 (344) T protein:vir:56 1 MSK----------KKGKTPQPAAKTMTASAPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPVSF-------TGLA 63 (344) T ss_pred CCC----------CCCCCCchhhHHhhcCCCceEEEEcCCceeecCcchhhhHHHhhhcCccccCCCCH-------HHHH Confidence 110 000000000000000 0001111 1100 0111110000112223333333222 2355 Q ss_pred HHHHHHhhccchhHHHHhhhceeeEecCCCcEEEEeeccchhhhHHHHHHHHHHHHHHHHhcchhhhHHHHHhhccccce Q lcl|NC_018087. 76 NTYRSLLNNYEVDNAVQEIVSDAIVYEEGFDVVSIDLDQTAFTENIRNLISDEFNSVLNMLNFQRKGSDHFKRWYVDSRV 155 (520) Q Consensus 76 ~~YR~ma~~pEvd~Ai~eIvneaiv~d~~~~~V~l~Ld~~~~s~~ik~~I~eeF~~i~~ll~f~k~g~~~fRrWYvDgri 155 (520) +-+|.-+.|.- ||...+|-...+= .| +--.| +.+|.. ++-.+.+-|.- T Consensus 64 ~~~~a~~~h~s---~i~~k~n~l~~~~---~P------np~~t-------~~~f~~-------------~~~d~ll~Gna 111 (344) T protein:vir:56 64 KSLRAAVHHSS---PIYVKRNILASTF---IP------HPWLS-------QQDFSR-------------FVLDFLVFGNA 111 (344) T ss_pred HHHhhhhhhCc---cceehhhhHHhhc---CC------CCCCC-------HHHHHH-------------HHHHHHhcCCe Confidence 55665555522 2222222111000 00 00111 112321 22234556999 Q ss_pred eEEEeeecCCCCCCeeeeEecCccceeeeeeccCCCCcccccccceecceeecCcccccccccceecCCcceecCcccEE Q lcl|NC_018087. 156 FFHKIINPNRPKDGIIELRRLDPRNVQFVRELDTKMENGVKVVKGYREYFLYDTELESYQCGHQHFAAGTKIKIPYSAMV 235 (520) Q Consensus 156 ~~hkvid~~~~k~GI~elr~lDPr~i~~vr~i~~~~~~~~~~~~~~~ey~~y~~~~~~~~~~~~~~~~~~~~~I~~~aI~ 235 (520) |+.++-+ ....+++|.+|+|..++..+ ++. .||.+.. .+..+..+++.|+ T Consensus 112 y~~~~rn---~~G~~~~L~pl~~~~v~~~~-------~~~-------~~~~~~~-------------~g~~~~~~~~dIi 161 (344) T protein:vir:56 112 FLEKRYS---TTGKVIRLETSPAKYTRRGV-------EED-------VYWWVPS-------------FNEPTAFAPGSVF 161 (344) T ss_pred EEEEEEC---CCCcEEEEEEeCCceeEEee-------cCC-------EEEEEec-------------CCeEEEEcCccEE Confidence 9999865 33359999999998776521 111 1222221 1334677889998 Q ss_pred EeecccccCC-CCcchhhhHHHHHHHHHHHHHHHHHHHHHHh--cCccceEEEccCCCCchHHHHHHHHHHHHhhcceeE Q lcl|NC_018087. 236 YAHSGLVDCC-GKNIIGYLHRAVKPANQLKLLEDAMMIYRIT--RAPDRRVFYIDTGNMPARKAAQHMQHIMNSHRNRIS 312 (520) Q Consensus 236 y~hSGL~d~~-~~~~~syL~~aik~~NqL~m~EDalVIyRi~--RApeRRvFyIDvGnlpk~KAeqyl~~im~~~knklv 312 (520) |.. ..|++ +-..+|-+..|+..+..=...+..- .|.. =|--.-|+|+.-++|-+..+ +-|++-+..-+ T Consensus 162 Hir--~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~--~~~f~NGa~pg~Il~~~d~~ls~e~~-~~lk~~~~~~~---- 232 (344) T protein:vir:56 162 HLL--EPDINQELYGLPEYLSALNSAWLNESATLFR--RKYYENGAHAGYIMYVTDAVQDRNDI-EMLRENMVKSK---- 232 (344) T ss_pred EEC--CCCCCCCcccccHHHHHHHHHHHHHHHHHHH--HHHHhccCCCceEEEecCCCCCHHHH-HHHHHHHHHhc---- Confidence 775 35654 4456777777776655322222211 1222 25566777775556654433 33333333211 Q ss_pred eecCCCccccccccchhhhhhcccccCC-CCCcceeecCCCCCcChHHHHH-HHHHHHHHhcCCChhhccC-CCcccccc Q lcl|NC_018087. 313 YDARTGKVKNQANMMALTEDYWLQRRDG-KAVTEVETLPGMTGMNEMDDIL-YFRKALYMALRVPLSRIPD-EQTQNVFD 389 (520) Q Consensus 313 Yd~~TGev~d~~~~msmlEDywLpRReG-grgTEIsTLpGg~nLgei~DV~-YF~kkLy~aL~VP~SRl~~-~~~~~~~G 389 (520) |. +..+.| +|.--.| ..|.++.-|.--..-.|+-.++ +-.+-+-++.+||...+.- +++...|| T Consensus 233 -----g~--~~~r~l------~l~~p~g~~~G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp~llGi~~~~t~~~~ 299 (344) T protein:vir:56 233 -----GR--NNFKNL------FLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLG 299 (344) T ss_pred -----CC--CCccce------EEecCCCCccceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccc Confidence 11 122222 1111111 1244555443322222322332 3345588999999998853 22222223 Q ss_pred ccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhhHHhhhhceEEEeeccch Q lcl|NC_018087. 390 MSTAISRDELSF-DKFISELQHKFEEIFLSPLKSNLLLKRVITEDEWEAELNNIKIVFHKNSY 451 (520) Q Consensus 390 ~~~eItRDElkF-~KFI~rLr~rFs~if~d~Lk~QLiLkgi~t~eew~~~~~~I~~~f~~Dn~ 451 (520) ...+..+ -| ..-+.-|+.+|.++...+ .-.++.-.+|+ ...|+- T Consensus 300 n~eq~~~---~f~~~tL~Pl~~~ie~~n~~l------~~~~~~F~~y~---------l~~~~~ 344 (344) T protein:vir:56 300 DIEKVAK---VFVRNELIPLQDRIREINGWI------GQEVIRFKNYS---------LDTDNG 344 (344) T ss_pred cHHHHHH---HHHHHHHHHHHHHHHHHHhhh------ccccccCCCcc---------ccccCC Confidence 3333332 33 334566788887765553 22333333333 222222 Done!